BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= psy1911
(342 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|110456454|gb|ABG74712.1| cathepsin B preproprotein-like protein [Diaphorina citri]
Length = 125
Score = 213 bits (543), Expect = 7e-53, Method: Compositional matrix adjust.
Identities = 98/98 (100%), Positives = 98/98 (100%)
Query: 66 KKAHMVPRCNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVE 125
KKAHMVPRCNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVE
Sbjct: 19 KKAHMVPRCNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVE 78
Query: 126 NDIPYWLVANSWNDHWGDHGTFKILRGENEADIEMGFN 163
NDIPYWLVANSWNDHWGDHGTFKILRGENEADIEMGFN
Sbjct: 79 NDIPYWLVANSWNDHWGDHGTFKILRGENEADIEMGFN 116
Score = 54.7 bits (130), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 24/24 (100%), Positives = 24/24 (100%)
Query: 317 NCYNPSYESTYRFDLKKGKKAHMV 340
NCYNPSYESTYRFDLKKGKKAHMV
Sbjct: 1 NCYNPSYESTYRFDLKKGKKAHMV 24
>gi|22535408|emb|CAC87118.1| cathepsin B-like protease [Nilaparvata lugens]
Length = 347
Score = 169 bits (429), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 81/155 (52%), Positives = 108/155 (69%), Gaps = 2/155 (1%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+P+ FDAR+KW +C SLR I DQ NCGSCWAVSVA A +DRLCIASN + G IS++ ++
Sbjct: 92 VPKYFDARKKWKKCKSLREIRDQGNCGSCWAVSVAAAFADRLCIASNAKWNGHISSRELM 151
Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
+C C +GC GG+P AW F +G+VTGGDY+S +GCQPY +APCEHH++G NC+
Sbjct: 152 SCCSYCGFGCEGGFPDAAWVFIKRHGLVTGGDYHSHDGCQPYPIAPCEHHMEGSKPNCSA 211
Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
TP C+ C + S Y+ D +KGK A++V
Sbjct: 212 SPTEPTPACETTCTHGS-SLAYQKDRQKGKSAYLV 245
Score = 100 bits (249), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 52/97 (53%), Positives = 64/97 (65%), Gaps = 4/97 (4%)
Query: 66 KKAHMVPRCNAMRQ--IYEHGPLVAIFSVYADFLQYKSGVY-QHNFGDSIGLHAVRVLGW 122
K A++VP Q I+++GP+VA F VY DF YKSGVY +H G HAV+V+GW
Sbjct: 240 KSAYLVPVGEKQTQLEIFKNGPIVAAFKVYEDFFMYKSGVYKRHPESPFRGRHAVKVIGW 299
Query: 123 GVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
G +N +PYWLV NSW+ WGD G FKI RG NE D E
Sbjct: 300 GEQNGLPYWLVQNSWDYDWGDKGLFKIARG-NECDFE 335
>gi|27882093|gb|AAH44517.1| Zgc:55862 [Danio rerio]
Length = 330
Score = 169 bits (428), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 89/192 (46%), Positives = 115/192 (59%), Gaps = 6/192 (3%)
Query: 152 GENEADIEMGFNNRVEANSSEDDDLETMGCQNAKGL--PRNFDAREKWPECPSLRHIADQ 209
G N D++ + R+ + L M Q +GL P+NFDARE+WP CP+L+ I DQ
Sbjct: 43 GHNFRDVDYSYVKRLCGTFLKGPKLPVM-VQYTEGLKLPKNFDAREQWPNCPTLKEIRDQ 101
Query: 210 SNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVACTPNC-WGCNGGWPQLAWRFWG 268
+CGSCWA A AISDR+CI SN + +IS+Q ++ C +C GCNGG+P AW FW
Sbjct: 102 GSCGSCWAFGAAEAISDRVCIQSNAKVSVEISSQDLLTCCDSCGMGCNGGYPSAAWDFWT 161
Query: 269 HNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTLLGKLKTPECKQNCYNPSYESTYR 328
+G+VTGG YNS GC+PYT+ PCEHHV G CT G TP C C P Y Y+
Sbjct: 162 TDGLVTGGLYNSHIGCRPYTIEPCEHHVNGSRPPCTGEGG-DTPNCDMKC-EPGYSPLYK 219
Query: 329 FDLKKGKKAHMV 340
D GK ++ V
Sbjct: 220 EDKHFGKTSYSV 231
Score = 120 bits (301), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 54/99 (54%), Positives = 73/99 (73%), Gaps = 2/99 (2%)
Query: 63 HYFKKAHMVP--RCNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
H+ K ++ VP + M +++++GP+ A F+VY DFL YKSGVYQH G ++G HA+++L
Sbjct: 223 HFGKTSYSVPSNQNGIMAELFKNGPVEAAFTVYEDFLLYKSGVYQHMSGSALGGHAIKIL 282
Query: 121 GWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
GWG EN +PYWL ANSWN WGD+G FKILRGE+ IE
Sbjct: 283 GWGEENGVPYWLAANSWNTDWGDNGYFKILRGEDHCGIE 321
>gi|74179506|dbj|BAE44111.1| cathepsin B preproprotein [Cyprinus carpio]
Length = 330
Score = 169 bits (428), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 90/192 (46%), Positives = 114/192 (59%), Gaps = 6/192 (3%)
Query: 152 GENEADIEMGFNNRVEANSSEDDDLETMGCQNAKGL--PRNFDAREKWPECPSLRHIADQ 209
G N D++ + R+ + L M Q A L P NFDARE+WP CP+L+ I DQ
Sbjct: 43 GHNFHDVDYSYVKRLCGTLLKGPRLPVM-VQYADDLKLPTNFDAREQWPNCPTLKEIRDQ 101
Query: 210 SNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVACTPNC-WGCNGGWPQLAWRFWG 268
+CGSCWA A AISDR+CI SN + +ISAQ ++ C C GCNGG+P AW FW
Sbjct: 102 GSCGSCWAFGAAEAISDRVCIHSNAKVSVEISAQDLLTCCDGCGMGCNGGYPSAAWDFWS 161
Query: 269 HNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTLLGKLKTPECKQNCYNPSYESTYR 328
+G+VTGG YNS GC+PYT+ PCEHHV G CT G TP C +C P Y +Y+
Sbjct: 162 SDGLVTGGLYNSHIGCRPYTIEPCEHHVNGSRPPCTGEGG-DTPNCDMSC-EPGYSPSYK 219
Query: 329 FDLKKGKKAHMV 340
D GK ++ V
Sbjct: 220 QDKHFGKTSYSV 231
Score = 121 bits (303), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 56/108 (51%), Positives = 77/108 (71%), Gaps = 2/108 (1%)
Query: 54 YLPTSIPLSHYFKKAHMVP--RCNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDS 111
Y P+ H+ K ++ VP + + M+++Y++GP+ F+VY DFL YKSGVYQH G +
Sbjct: 214 YSPSYKQDKHFGKTSYSVPSNQKDIMKELYKNGPVEGAFTVYEDFLSYKSGVYQHVSGPA 273
Query: 112 IGLHAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
+G HA+++LGWG EN +PYWL ANSWN WGD+G FKILRGE+ IE
Sbjct: 274 LGGHAIKILGWGEENGVPYWLAANSWNTDWGDNGYFKILRGEDHCGIE 321
>gi|37788265|gb|AAO64472.1| cathepsin B precursor [Fundulus heteroclitus]
Length = 330
Score = 167 bits (424), Expect = 6e-39, Method: Compositional matrix adjust.
Identities = 89/192 (46%), Positives = 116/192 (60%), Gaps = 6/192 (3%)
Query: 152 GENEADIEMGFNNRVEANSSEDDDLETMGCQNAKG--LPRNFDAREKWPECPSLRHIADQ 209
G N D++ G+ + + L M Q+A G LP+ FDARE+WPECP+L+ I DQ
Sbjct: 43 GHNFHDVDYGYVKNLCGTLLKGPKLPIM-VQSAGGMKLPKQFDAREQWPECPTLKEIRDQ 101
Query: 210 SNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVACTPNC-WGCNGGWPQLAWRFWG 268
+CGSCWA A AISDR+CI + G + +IS+Q ++ C +C GCNGG+P AW FW
Sbjct: 102 GSCGSCWAFGAAEAISDRICIHTKGKVSVEISSQDLLTCCDSCGMGCNGGYPANAWEFWT 161
Query: 269 HNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTLLGKLKTPECKQNCYNPSYESTYR 328
G+VTGG YNS GC+PYT+ PCEHHV G CT G TPEC C Y +Y+
Sbjct: 162 EQGLVTGGLYNSHIGCRPYTIEPCEHHVNGSRPPCTGEGG-DTPECVTQC-EAGYTPSYQ 219
Query: 329 FDLKKGKKAHMV 340
D GK ++ V
Sbjct: 220 KDKHYGKTSYGV 231
Score = 115 bits (287), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 56/108 (51%), Positives = 70/108 (64%), Gaps = 2/108 (1%)
Query: 54 YLPTSIPLSHYFKKAHMVPRCNAMRQ--IYEHGPLVAIFSVYADFLQYKSGVYQHNFGDS 111
Y P+ HY K ++ VP Q IY++GP+ F VY DF YKSGVYQH G +
Sbjct: 214 YTPSYQKDKHYGKTSYGVPSEEEQIQSEIYKNGPVEGAFIVYEDFPSYKSGVYQHVTGSA 273
Query: 112 IGLHAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
+G HA++++GWG EN +PYWL ANSWN WGD+G FKILRG N IE
Sbjct: 274 LGGHAIKMIGWGEENGVPYWLCANSWNTDWGDNGFFKILRGSNHCGIE 321
>gi|50540542|ref|NP_998501.1| cathepsin B, a precursor [Danio rerio]
gi|34784038|gb|AAH56688.1| Cathepsin B, a [Danio rerio]
gi|37681773|gb|AAQ97764.1| cathepsin B [Danio rerio]
gi|41351445|gb|AAH65589.1| Cathepsin B, a [Danio rerio]
Length = 330
Score = 165 bits (417), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 86/192 (44%), Positives = 115/192 (59%), Gaps = 6/192 (3%)
Query: 152 GENEADIEMGFNNRVEANSSEDDDLETMGCQNAKGL--PRNFDAREKWPECPSLRHIADQ 209
G N D++ + ++ + L M Q +GL P+NFDARE+WP CP+L+ I DQ
Sbjct: 43 GHNFRDVDYSYVKKLCGTFLKGPKLPVM-VQYTEGLKLPKNFDAREQWPNCPTLKEIRDQ 101
Query: 210 SNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVACTPNC-WGCNGGWPQLAWRFWG 268
+CGSCWA A AISDR+CI S+ + +IS+Q ++ C +C GCNGG+P AW FW
Sbjct: 102 GSCGSCWAFGAAEAISDRVCIHSDAKVSVEISSQDLLTCCDSCGMGCNGGYPSAAWDFWA 161
Query: 269 HNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTLLGKLKTPECKQNCYNPSYESTYR 328
G+VTGG YNS GC+PYT+ PCEHHV G C+ G TP C C P Y +Y+
Sbjct: 162 TEGLVTGGLYNSHIGCRPYTIEPCEHHVNGSRPPCSGEGG-DTPNCDMKC-EPGYSPSYK 219
Query: 329 FDLKKGKKAHMV 340
D GK ++ V
Sbjct: 220 QDKHFGKTSYSV 231
Score = 119 bits (297), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 55/108 (50%), Positives = 75/108 (69%), Gaps = 2/108 (1%)
Query: 54 YLPTSIPLSHYFKKAHMVP--RCNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDS 111
Y P+ H+ K ++ VP + + M +++++GP+ F+VY DFL YKSGVYQH G
Sbjct: 214 YSPSYKQDKHFGKTSYSVPSNQNSIMAELFKNGPVEGAFTVYEDFLLYKSGVYQHMSGSP 273
Query: 112 IGLHAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
+G HA+++LGWG EN +PYWL ANSWN WGD+G FKILRGE+ IE
Sbjct: 274 VGGHAIKILGWGEENGVPYWLAANSWNTDWGDNGYFKILRGEDHCGIE 321
>gi|51038793|gb|AAT94175.1| cathepsin B [Paralichthys olivaceus]
gi|121053785|gb|ABM47001.1| cathepsin B [Paralichthys olivaceus]
Length = 330
Score = 164 bits (415), Expect = 5e-38, Method: Compositional matrix adjust.
Identities = 87/192 (45%), Positives = 115/192 (59%), Gaps = 6/192 (3%)
Query: 152 GENEADIEMGFNNRVEANSSEDDDLETMGCQNAKGL--PRNFDAREKWPECPSLRHIADQ 209
G N +++ + R+ + L M Q A GL P FDARE+WPECP+L+ I DQ
Sbjct: 43 GHNFHNVDYSYVRRLCGTMLKGPKLPIM-VQYAGGLKLPAEFDAREQWPECPTLKEIRDQ 101
Query: 210 SNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVACTPNC-WGCNGGWPQLAWRFWG 268
+CGSCWA A AISDR+CI S G + +IS++ ++ C +C GCNGG+P AW FW
Sbjct: 102 GSCGSCWAFGAAEAISDRVCIHSGGKISVEISSEDLLTCCDSCGMGCNGGYPSSAWDFWT 161
Query: 269 HNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTLLGKLKTPECKQNCYNPSYESTYR 328
G+V+GG YNS GC+PYT++PCEHHV G CT G TPEC C Y +Y+
Sbjct: 162 KEGLVSGGLYNSHIGCRPYTISPCEHHVNGSRPPCTGEGG-DTPECISRC-EAGYSPSYK 219
Query: 329 FDLKKGKKAHMV 340
D GK ++ V
Sbjct: 220 QDKHYGKSSYSV 231
Score = 110 bits (276), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 55/108 (50%), Positives = 69/108 (63%), Gaps = 2/108 (1%)
Query: 54 YLPTSIPLSHYFKKAHMVPRC--NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDS 111
Y P+ HY K ++ V +I ++GP+ F+VY DF+ YKSGVYQH G
Sbjct: 214 YSPSYKQDKHYGKSSYSVEGSVEQIQAEISKNGPVEGAFTVYEDFVMYKSGVYQHVSGSV 273
Query: 112 IGLHAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
+G HA++VLGWG E+ IPYWL ANSWN WGD+G FKILRG N IE
Sbjct: 274 LGGHAIKVLGWGEEDGIPYWLCANSWNTDWGDNGFFKILRGSNHCGIE 321
>gi|254746338|emb|CAX16634.1| putative C1A cysteine protease precursor [Manduca sexta]
Length = 337
Score = 163 bits (413), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 76/155 (49%), Positives = 101/155 (65%), Gaps = 4/155 (2%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP NFD R+KWP+CP+L I DQ +CGSCWA A++DR C SNG S++ ++
Sbjct: 83 LPENFDPRDKWPDCPTLNEIRDQGSCGSCWAFGAVEAMTDRYCTYSNGTKHFHFSSEDLL 142
Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
+C P C GCNGG P LAW +W H G+V+GG+YNS +GC+PY + PCEHHV G C+
Sbjct: 143 SCCPICGLGCNGGIPSLAWEYWKHFGIVSGGNYNSTQGCRPYEIPPCEHHVPGNRMPCS- 201
Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
G KTP+C++NC N Y Y+ D + GK + V
Sbjct: 202 -GDTKTPKCQKNCEN-GYNVMYKKDKRYGKHVYSV 234
Score = 117 bits (292), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 52/81 (64%), Positives = 65/81 (80%)
Query: 79 QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWN 138
++Y++GP+ F+VYAD L YKSGVY+H GD++G HA+++LGWGVEND YWLVANSWN
Sbjct: 244 ELYKNGPVEGAFTVYADLLAYKSGVYKHIQGDALGGHAIKILGWGVENDNKYWLVANSWN 303
Query: 139 DHWGDHGTFKILRGENEADIE 159
WGD+G FKILRGEN IE
Sbjct: 304 TDWGDNGFFKILRGENHCGIE 324
>gi|197725747|gb|ACH73069.1| cathepsin B precursor [Epinephelus coioides]
Length = 333
Score = 163 bits (412), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 78/155 (50%), Positives = 102/155 (65%), Gaps = 3/155 (1%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP+NFD+RE+WP CP+L+ I DQ +CGSCWA A AISDRLCI SNG + +IS++ ++
Sbjct: 79 LPKNFDSREQWPNCPTLKEIRDQGSCGSCWAFGAAEAISDRLCIHSNGKVSVEISSEDLL 138
Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
C +C GCNGG+P AW FW G+V+GG Y+S GC+PYT+ PCEHHV G CT
Sbjct: 139 TCCDSCGMGCNGGYPSAAWDFWTDVGLVSGGLYDSHVGCRPYTIPPCEHHVNGTRPPCTG 198
Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
G TP+C C Y +Y+ D GK ++ V
Sbjct: 199 EGG-DTPQCILQC-ESGYTPSYKADKHYGKSSYSV 231
Score = 58.5 bits (140), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 36/92 (39%), Positives = 50/92 (54%), Gaps = 5/92 (5%)
Query: 54 YLPTSIPLSHYFKKAHMVPRCNAMRQ--IYEHGPLVAIFSVYADFLQYKSGVYQHNFGDS 111
Y P+ HY K ++ VP Q IY++GP+ F+VY DFL YK+GVYQH G +
Sbjct: 214 YTPSYKADKHYGKSSYSVPSDEEQIQSEIYKNGPVEGAFTVYEDFLLYKTGVYQHMTGSA 273
Query: 112 IGLHAVRVLGWGVENDIPYWLVANSWNDHWGD 143
+G HA++ W E + +S D WGD
Sbjct: 274 VGGHAIK--SWLGEEVCSLLALCHSDTD-WGD 302
>gi|312374701|gb|EFR22198.1| hypothetical protein AND_15621 [Anopheles darlingi]
Length = 335
Score = 162 bits (411), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 76/155 (49%), Positives = 102/155 (65%), Gaps = 3/155 (1%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP NFD+RE+WP CP++R I DQ +CGSCWA A+SDR+CIAS G + SA+ +V
Sbjct: 83 LPENFDSREQWPNCPTIREIRDQGSCGSCWAFGAVEAMSDRVCIASGGKIHFRFSAEDLV 142
Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
+C C +GCNGG+P AW +W H G+V+GG + S GCQPY +APCEHHV G +C
Sbjct: 143 SCCHTCGFGCNGGFPGAAWSYWVHKGLVSGGPFGSNLGCQPYAIAPCEHHVNGTRPSCEG 202
Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
G KTP+C + C + SY Y D + G K++ +
Sbjct: 203 EGG-KTPKCVKKCQD-SYTVPYAKDKRYGSKSYSI 235
Score = 112 bits (280), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 52/98 (53%), Positives = 67/98 (68%), Gaps = 2/98 (2%)
Query: 64 YFKKAHMVPRC--NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLG 121
Y K++ +PR ++I +GP+ F+VY D L YK GVYQH G +G HA+R+LG
Sbjct: 228 YGSKSYSIPRHEDQIRKEIMTNGPVEGAFTVYEDLLHYKEGVYQHVTGKMLGGHAIRILG 287
Query: 122 WGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
WGVEN+ YWL+ANSWN WGD+G FKILRGE+ IE
Sbjct: 288 WGVENNTKYWLIANSWNSDWGDNGFFKILRGEDHLGIE 325
>gi|256077361|ref|XP_002574974.1| SmCB2 peptidase (C01 family) [Schistosoma mansoni]
gi|18181863|emb|CAC85211.2| cathepsin B endopeptidase [Schistosoma mansoni]
gi|353231645|emb|CCD79000.1| SmCB2 peptidase (C01 family) [Schistosoma mansoni]
Length = 347
Score = 162 bits (411), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 77/155 (49%), Positives = 101/155 (65%), Gaps = 4/155 (2%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP++FDAR +WP CPS+ I DQS+CGSCWA A+SDR+CI S G +SA+++V
Sbjct: 94 LPKSFDARVEWPHCPSISEIRDQSSCGSCWAFGAVEAMSDRICIKSKGKHKPFLSAENLV 153
Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
+C +C GCNGG+P AW +W + G+VTG YN+ GCQPY PCEHHV GPL +C
Sbjct: 154 SCCSSCGMGCNGGFPHSAWLYWKNQGIVTGDLYNTTNGCQPYEFPPCEHHVIGPLPSCD- 212
Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
G ++TP CK NC P Y Y D G+K + +
Sbjct: 213 -GDVETPSCKTNC-QPGYNIPYEKDKWYGEKVYRI 245
Score = 114 bits (286), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 52/87 (59%), Positives = 63/87 (72%)
Query: 77 MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANS 136
M ++ +GP+ F VYADF YKSGVYQH G +G HAVR+LGWG EN++PYWL+ANS
Sbjct: 253 MLELMRNGPVEVDFEVYADFPNYKSGVYQHVSGALLGGHAVRLLGWGEENNVPYWLIANS 312
Query: 137 WNDHWGDHGTFKILRGENEADIEMGFN 163
WN WGD G FKI+RG+NE IE N
Sbjct: 313 WNSDWGDKGYFKIVRGKNECGIESDVN 339
>gi|112983908|ref|NP_001036850.1| cathepsin B precursor [Bombyx mori]
gi|13548667|dbj|BAB40804.1| cathepsin B [Bombyx mori]
Length = 337
Score = 162 bits (410), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 78/175 (44%), Positives = 108/175 (61%), Gaps = 9/175 (5%)
Query: 172 EDDDLETMGCQNAK-----GLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISD 226
ED+ T+ + K GLP NFD R+KWP+CP+L + DQ +CGSCWA A++D
Sbjct: 63 EDEHFATLPIKTHKIDLIAGLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTD 122
Query: 227 RLCIASNGYFTGQISAQHIVACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQ 285
R+C SNG SA+ +++C P C GC+GG P+LAW +W H G+V+GG YNS +GC+
Sbjct: 123 RVCTYSNGTKHFHFSAEDLLSCCPICGLGCSGGMPRLAWEYWKHFGLVSGGSYNSSQGCR 182
Query: 286 PYTLAPCEHHVQGPLQNCTLLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
PY + PCEHHV G C+ G KTP+C + C Y+ Y+ D + GK + V
Sbjct: 183 PYEIPPCEHHVPGNRMPCS--GDTKTPKCTKKC-ESGYDVNYKQDKQYGKHVYTV 234
Score = 114 bits (285), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 49/81 (60%), Positives = 65/81 (80%)
Query: 79 QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWN 138
+++++GP+ F+VY+D L YKSGVY+H GD++G HAV++LGWGVEND YWL+ANSWN
Sbjct: 244 ELFKNGPVEGAFTVYSDLLSYKSGVYKHTQGDALGGHAVKILGWGVENDNKYWLIANSWN 303
Query: 139 DHWGDHGTFKILRGENEADIE 159
WGD+G FKILRGE+ IE
Sbjct: 304 SDWGDNGFFKILRGEDHCGIE 324
>gi|121073168|gb|ABM47070.1| cathepsin B1 [Clonorchis sinensis]
gi|358341105|dbj|GAA29748.2| cathepsin B [Clonorchis sinensis]
Length = 339
Score = 162 bits (410), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 80/155 (51%), Positives = 99/155 (63%), Gaps = 4/155 (2%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP +FDAREKWP C S+ I DQSNCGSCWA A AISDR+CIAS G +IS + +V
Sbjct: 88 LPESFDAREKWPYCSSIAEIRDQSNCGSCWAFGAAGAISDRICIASGGKHQPRISPEDLV 147
Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
C +C GC GG+P AW +W NG+VTG YN+ + C+PY+ PCEHHV GP + CT
Sbjct: 148 DCCADCGMGCQGGYPAQAWEYWVRNGLVTGDLYNTTDTCRPYSFPPCEHHVVGPRKPCT- 206
Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
G TP+C + C P Y TY D G KA+ +
Sbjct: 207 -GDPTTPQCVKKC-QPEYPKTYENDKWYGLKAYSI 239
Score = 106 bits (264), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 50/87 (57%), Positives = 58/87 (66%)
Query: 77 MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANS 136
MR + +GPL F VYADF Y SGVY+H G +G HAVR++GWGVE+ YWL+ANS
Sbjct: 247 MRDLMTYGPLEVDFEVYADFPSYSSGVYRHVAGGLLGGHAVRLVGWGVEDGADYWLIANS 306
Query: 137 WNDHWGDHGTFKILRGENEADIEMGFN 163
WN WGD G FKI RG NE IE N
Sbjct: 307 WNTDWGDGGYFKIRRGVNECGIESDAN 333
>gi|327322926|gb|AEA48884.1| cathepsin B [Oplegnathus fasciatus]
Length = 330
Score = 162 bits (410), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 78/156 (50%), Positives = 102/156 (65%), Gaps = 3/156 (1%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP FDARE+WP CP+L+ I DQ +CGSCWA A AISDR+CI SN + +IS++ ++
Sbjct: 79 LPEEFDAREQWPNCPTLKEIRDQGSCGSCWAFGAAEAISDRVCIHSNAKVSVEISSEDLL 138
Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
C +C GCNGG+P AW FW G+V+GG Y+S GC+PYT+APCEHHV G +CT
Sbjct: 139 TCCMSCGMGCNGGYPSAAWDFWTKEGLVSGGLYDSHIGCRPYTIAPCEHHVNGSRPSCTG 198
Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMVL 341
G TP+C C Y +Y+ D GK ++ VL
Sbjct: 199 EGG-DTPQCITKC-EAGYTPSYKEDKHFGKTSYTVL 232
Score = 110 bits (274), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 52/108 (48%), Positives = 70/108 (64%), Gaps = 2/108 (1%)
Query: 54 YLPTSIPLSHYFKKAHMVPRCNAMRQ--IYEHGPLVAIFSVYADFLQYKSGVYQHNFGDS 111
Y P+ H+ K ++ V Q I+++GP+ F VY DF+ YKSGVYQH G +
Sbjct: 214 YTPSYKEDKHFGKTSYTVLSDEEQIQSEIFKNGPVEGAFIVYEDFVLYKSGVYQHVSGSA 273
Query: 112 IGLHAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
+G HA+++LGWGVE+ +PYWL ANSWN WGD+G FK LRG + IE
Sbjct: 274 VGGHAIKILGWGVEDGVPYWLCANSWNTDWGDNGFFKFLRGSDHCGIE 321
>gi|226821413|gb|ACO82382.1| cathepsin B [Lutjanus argentimaculatus]
Length = 330
Score = 161 bits (408), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 78/156 (50%), Positives = 101/156 (64%), Gaps = 3/156 (1%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP+ FD+RE+WP CP+L+ I DQ +CGSCWA + AISDRLCI SN + +ISA+ ++
Sbjct: 79 LPKAFDSREQWPNCPTLKEIRDQGSCGSCWAFGASEAISDRLCIHSNAKVSVEISAEDLL 138
Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
C +C GCNGG+P AW FW G+V+GG Y+S GC+PYT+ PCEHHV G CT
Sbjct: 139 TCCDSCGMGCNGGYPSAAWDFWTKEGLVSGGLYDSHVGCRPYTIPPCEHHVNGSRPPCTG 198
Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMVL 341
G TP+C C Y +YR D GK ++ VL
Sbjct: 199 EGG-DTPQCLSQC-EAGYTPSYREDKHYGKTSYSVL 232
Score = 114 bits (285), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 56/108 (51%), Positives = 71/108 (65%), Gaps = 2/108 (1%)
Query: 54 YLPTSIPLSHYFKKAHMVPRCNAMRQ--IYEHGPLVAIFSVYADFLQYKSGVYQHNFGDS 111
Y P+ HY K ++ V A Q IY++GP+ F+VY DF+ YKSGVYQH G +
Sbjct: 214 YTPSYREDKHYGKTSYSVLSDEAEIQYEIYKNGPVEGAFTVYEDFVLYKSGVYQHVSGSA 273
Query: 112 IGLHAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
+G HA++VLGWG EN +PYWL ANSWN WGD+G FK LRG + IE
Sbjct: 274 VGGHAIKVLGWGEENGVPYWLCANSWNTDWGDNGFFKFLRGSDHCGIE 321
>gi|183988832|gb|ACC66065.1| cathepsin B [Antheraea assama]
Length = 287
Score = 161 bits (407), Expect = 5e-37, Method: Compositional matrix adjust.
Identities = 76/155 (49%), Positives = 101/155 (65%), Gaps = 4/155 (2%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP NFD R+KWP+CP+L I DQ +CGSCWA A++DR+CI SN SA+ +V
Sbjct: 44 LPENFDPRDKWPDCPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLV 103
Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
+C P C GCNGG P LAW +W H G+V+GG+YNS +GC+PY + PCEHHV G C
Sbjct: 104 SCCPICGLGCNGGMPTLAWEYWKHVGLVSGGNYNSSQGCRPYEIPPCEHHVPGNRMPCN- 162
Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
G KTP+C++ C + SY ++ D + GK + V
Sbjct: 163 -GDTKTPKCEKTCES-SYTVPFKKDKRYGKHVYSV 195
Score = 111 bits (277), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 47/85 (55%), Positives = 64/85 (75%)
Query: 75 NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVA 134
N +++++GP+ F+VY+D L YKSGVYQH G+++G HA+++LGWGVEN YWL+A
Sbjct: 201 NIKAELFKNGPVEGAFTVYSDLLSYKSGVYQHTHGNALGGHAIKILGWGVENGSKYWLIA 260
Query: 135 NSWNDHWGDHGTFKILRGENEADIE 159
NSWN WGD+G KILRGE+ IE
Sbjct: 261 NSWNSDWGDNGFLKILRGEDHCGIE 285
>gi|344195776|gb|AEM98130.1| cathepsin B [Cynoglossus semilaevis]
Length = 332
Score = 160 bits (406), Expect = 6e-37, Method: Compositional matrix adjust.
Identities = 77/155 (49%), Positives = 102/155 (65%), Gaps = 3/155 (1%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP +FDAR +WP+CP+L+ + DQ +CGSCWA A AISDRLCI SNG +ISA+ ++
Sbjct: 79 LPVDFDARVQWPQCPTLKEVRDQGSCGSCWAFGAAEAISDRLCIHSNGLMNVEISAEDLL 138
Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
+C +C GCNGG+P AW FW +G+V+GG Y+S GC+PY++APCEHHV G CT
Sbjct: 139 SCCDSCGMGCNGGYPSAAWEFWTTDGLVSGGLYDSHIGCRPYSIAPCEHHVNGSRPPCTG 198
Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
G TP+C + C Y Y D GK ++ V
Sbjct: 199 EGG-DTPQCTKKC-EAGYTPGYTQDKHYGKLSYSV 231
Score = 114 bits (284), Expect = 8e-23, Method: Compositional matrix adjust.
Identities = 58/114 (50%), Positives = 72/114 (63%), Gaps = 2/114 (1%)
Query: 48 KKKKRLYLPTSIPLSHYFKKAHMVPRCNAMRQ--IYEHGPLVAIFSVYADFLQYKSGVYQ 105
KK + Y P HY K ++ V Q IY++GP+ F+VY DFL YK+GVYQ
Sbjct: 208 KKCEAGYTPGYTQDKHYGKLSYSVDDSEKEIQLEIYKNGPVEGAFTVYEDFLLYKTGVYQ 267
Query: 106 HNFGDSIGLHAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
H G ++G HA++VLGWG EN PYWL ANSWN WGD+G FKILRG + IE
Sbjct: 268 HVTGSAVGGHAIKVLGWGEENGTPYWLCANSWNTDWGDNGFFKILRGSDHCGIE 321
>gi|116177489|gb|ABJ80691.1| cathepsin B [Hippoglossus hippoglossus]
Length = 330
Score = 160 bits (406), Expect = 7e-37, Method: Compositional matrix adjust.
Identities = 85/192 (44%), Positives = 113/192 (58%), Gaps = 6/192 (3%)
Query: 152 GENEADIEMGFNNRVEANSSEDDDLETMGCQNAKGL--PRNFDAREKWPECPSLRHIADQ 209
G N D++ + R+ + L M Q A GL P FD+RE+WPECP+L+ I DQ
Sbjct: 43 GHNFRDVDYSYVRRLCGTMLKGPKLPIM-VQYAGGLKLPAQFDSREQWPECPTLKEIRDQ 101
Query: 210 SNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVACTPNC-WGCNGGWPQLAWRFWG 268
+CGSCWA A AISDR+CI S + +IS++ ++ C C GCNGG+P AW FW
Sbjct: 102 GSCGSCWAFGAAEAISDRVCIHSGSKVSVEISSEDLLTCCDACGMGCNGGYPSAAWDFWT 161
Query: 269 HNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTLLGKLKTPECKQNCYNPSYESTYR 328
G+V+GG YNS GC+PYT+ PCEHHV G +C+ G TP+C +C Y TY
Sbjct: 162 KEGLVSGGLYNSHIGCRPYTIPPCEHHVNGSRPHCSGEGG-DTPKCVHSC-EAGYSPTYT 219
Query: 329 FDLKKGKKAHMV 340
D GK ++ V
Sbjct: 220 KDKHYGKSSYSV 231
Score = 111 bits (277), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 53/108 (49%), Positives = 69/108 (63%), Gaps = 2/108 (1%)
Query: 54 YLPTSIPLSHYFKKAHMVPRC--NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDS 111
Y PT HY K ++ V +I ++GP+ F VY DF+ YKSGVYQH G +
Sbjct: 214 YSPTYTKDKHYGKSSYSVEASVEQIQAEISQNGPVEGAFIVYEDFVMYKSGVYQHTTGSA 273
Query: 112 IGLHAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
+G HA++VLGWG E+ +PYWL ANSWN WG++G FKILRG + IE
Sbjct: 274 LGGHAIKVLGWGEEDGVPYWLCANSWNTDWGENGFFKILRGSDHCGIE 321
>gi|195729973|gb|ACG50797.1| cathepsin B2 [Trichobilharzia szidati]
Length = 344
Score = 160 bits (405), Expect = 8e-37, Method: Compositional matrix adjust.
Identities = 83/191 (43%), Positives = 110/191 (57%), Gaps = 4/191 (2%)
Query: 151 RGENEADIEMGFNNRVEANSSEDDDLETMGCQNAKGLPRNFDAREKWPECPSLRHIADQS 210
R ++ +DI + N L T + LP+ FDAR+ WP CPS+ I DQS
Sbjct: 56 RFKSVSDIRRMLGALPDPNGGYLPTLCTGYTPSLDELPKEFDARKHWPHCPSISEIRDQS 115
Query: 211 NCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVACTPNC-WGCNGGWPQLAWRFWGH 269
+CGSCWA A+SDR+CI S G +SA+++VAC +C GCNGG+P AW +W
Sbjct: 116 SCGSCWAFGAVEAMSDRICIESKGLHKPFLSAENLVACCSSCGMGCNGGFPHSAWSYWKR 175
Query: 270 NGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTLLGKLKTPECKQNCYNPSYESTYRF 329
+G+VTG YN+ +GCQPY PCEHHV GP +C G ++TP+CK C P Y Y
Sbjct: 176 SGIVTGDLYNTTDGCQPYEFPPCEHHVVGPRPSCG--GDVETPKCKTTC-QPGYNIPYNK 232
Query: 330 DLKKGKKAHMV 340
D GK + V
Sbjct: 233 DKWYGKTVYRV 243
Score = 119 bits (298), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 53/90 (58%), Positives = 65/90 (72%)
Query: 77 MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANS 136
M+++ +HGP+ F VYADF YKSGVYQH G +G HAVR+LGWG EN +PYWL+ANS
Sbjct: 251 MKEVMDHGPVEVDFEVYADFPNYKSGVYQHVSGGLLGGHAVRLLGWGEENGVPYWLIANS 310
Query: 137 WNDHWGDHGTFKILRGENEADIEMGFNNRV 166
WN WGD+G FKI+RG NE IE N +
Sbjct: 311 WNSDWGDNGYFKIIRGRNECGIESDVNAGI 340
>gi|348534156|ref|XP_003454569.1| PREDICTED: cathepsin B-like [Oreochromis niloticus]
Length = 330
Score = 160 bits (405), Expect = 8e-37, Method: Compositional matrix adjust.
Identities = 78/155 (50%), Positives = 100/155 (64%), Gaps = 3/155 (1%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP+ FDAR++WP CP+L+ I DQ +CGSCWA A AISDR+CI SNG +IS++ ++
Sbjct: 79 LPKEFDARQQWPNCPTLKEIRDQGSCGSCWAFGAAEAISDRVCIHSNGKVNVEISSEDLL 138
Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
C +C GCNGG+P AW FW G+V+GG Y S GC+PYT+APCEHHV G CT
Sbjct: 139 TCCDSCGMGCNGGYPSAAWDFWASEGLVSGGLYESHIGCRPYTIAPCEHHVNGSRPPCTG 198
Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
G TPEC + C Y +Y D GK ++ V
Sbjct: 199 EGG-DTPECVRQC-ESGYTPSYIQDKHYGKTSYSV 231
Score = 120 bits (300), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 58/108 (53%), Positives = 72/108 (66%), Gaps = 2/108 (1%)
Query: 54 YLPTSIPLSHYFKKAHMVPRCNAMRQ--IYEHGPLVAIFSVYADFLQYKSGVYQHNFGDS 111
Y P+ I HY K ++ VP Q IY++GP+ F+VY DFL YK+GVYQH G +
Sbjct: 214 YTPSYIQDKHYGKTSYSVPSDEQQIQTEIYKNGPVEGAFTVYEDFLLYKTGVYQHVSGSA 273
Query: 112 IGLHAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
+G HA++VLGWG EN PYWL ANSWN WGD+G FKILRG + IE
Sbjct: 274 VGGHAIKVLGWGEENGTPYWLCANSWNTDWGDNGYFKILRGSDHCGIE 321
>gi|432946172|ref|XP_004083803.1| PREDICTED: cathepsin B-like [Oryzias latipes]
Length = 330
Score = 160 bits (405), Expect = 8e-37, Method: Compositional matrix adjust.
Identities = 76/155 (49%), Positives = 103/155 (66%), Gaps = 4/155 (2%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP +FD RE+WP CP+L+ I DQ NCGSCWA A AISDR+CI S G + +ISA+ ++
Sbjct: 79 LPDSFDPREQWPNCPTLKQIRDQGNCGSCWAFGAAEAISDRICIQSGGKISLEISAEDLL 138
Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
C C GC GG+P AW FW + G+VTGG ++S+ GC+PYTLAPCEHHV G C
Sbjct: 139 TCCDECGMGCFGGFPSAAWEFWTNKGLVTGGLFDSKVGCRPYTLAPCEHHVNGSRPPCQ- 197
Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
G+++TP+C C N Y +Y D G++++ +
Sbjct: 198 -GEVETPKCVTQCNN-GYSLSYPKDKHFGQRSYSI 230
Score = 122 bits (305), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 56/99 (56%), Positives = 73/99 (73%), Gaps = 2/99 (2%)
Query: 63 HYFKKAHMVP--RCNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
H+ ++++ +P + M ++Y++GP+ A FSVYADFL YK+GVYQH GD +G HAV++L
Sbjct: 222 HFGQRSYSIPSQQEQIMTELYKNGPVEAAFSVYADFLLYKNGVYQHVTGDMLGGHAVKIL 281
Query: 121 GWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
GWG EN PYWLVANSWN WGD G FKI RG +E IE
Sbjct: 282 GWGEENGTPYWLVANSWNSDWGDKGFFKIKRGNDECGIE 320
>gi|333408990|gb|AEF32260.1| cathepsin B [Cristaria plicata]
Length = 347
Score = 160 bits (405), Expect = 8e-37, Method: Compositional matrix adjust.
Identities = 76/157 (48%), Positives = 104/157 (66%), Gaps = 3/157 (1%)
Query: 185 KGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQH 244
+ LP NFDAR +WP CP+++ + DQ +CGSCWA A+SDR+CIASNG +ISA+
Sbjct: 93 RDLPTNFDARTQWPNCPTVKEVRDQGDCGSCWAFGAVEAMSDRICIASNGKVNAEISAED 152
Query: 245 IVACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNC 303
++AC +C GC GG+P AWR++ G+VTGG YNS +GCQPY + C+HHV G LQ C
Sbjct: 153 LLACCSSCGEGCQGGFPAEAWRYYEREGLVTGGLYNSSQGCQPYMIPACDHHVVGHLQPC 212
Query: 304 TLLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
+ KTP+C + C +Y TY+ D GK ++ V
Sbjct: 213 P-KEEAKTPKCSKKC-EANYNVTYKDDKHYGKNSYSV 247
Score = 116 bits (291), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 60/123 (48%), Positives = 77/123 (62%), Gaps = 1/123 (0%)
Query: 38 KKKKKKKKKKKKKKRLYLPTSIPLSHYFKKAHMVPRCN-AMRQIYEHGPLVAIFSVYADF 96
K++ K K KK + Y T HY K ++ V M +I +GP+ A F+VY DF
Sbjct: 214 KEEAKTPKCSKKCEANYNVTYKDDKHYGKNSYSVDSVEKIMTEIMTNGPVEAAFTVYEDF 273
Query: 97 LQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEA 156
L YKSGVYQH G +G HAV++LGWG +N PYW+VANSWN WG+ G F ILRG++E
Sbjct: 274 LSYKSGVYQHRTGQELGGHAVKILGWGEDNGTPYWIVANSWNPDWGNQGFFNILRGKDEC 333
Query: 157 DIE 159
IE
Sbjct: 334 GIE 336
>gi|239938584|gb|ACS36091.1| cysteine proteinase [Haemonchus contortus]
Length = 346
Score = 160 bits (405), Expect = 9e-37, Method: Compositional matrix adjust.
Identities = 85/189 (44%), Positives = 113/189 (59%), Gaps = 19/189 (10%)
Query: 157 DIEMGFNNRVEANSSEDDDLETMGCQNAKGLPRNFDAREKWPECPSLRHIADQSNCGSCW 216
D+ NR A +EDD+ +P +FDAR WP C S+RHI DQ+NCGSCW
Sbjct: 72 DLRFVNQNRKPAVENEDDE--------GDDIPESFDARTHWPNCTSIRHIRDQANCGSCW 123
Query: 217 AVSVANAISDRLCIASNGYFTGQISAQHIVACTPNC-WGCNGGWPQLAWRFWGHNGVVTG 275
AVS A+A+SDR+CI SNG IS+ V+C +C +GC+GGWP LA+ F+ + G VTG
Sbjct: 124 AVSTASALSDRICIESNGETQMHISSIDFVSCCESCGYGCDGGWPILAFDFYTYEGAVTG 183
Query: 276 GDYNSQEGCQPYTLAPCEHHVQGPLQNCTLLGK----LKTPECKQNCYNPSYESTYRFDL 331
GDY S++GC+PY PC HH N T G+ KTP+C++ C SY+ Y D
Sbjct: 184 GDYGSKDGCRPYPFHPCGHH-----GNDTYYGECPKGAKTPKCRRRC-QRSYKKAYYMDK 237
Query: 332 KKGKKAHMV 340
G+ A+ V
Sbjct: 238 SYGEDAYEV 246
Score = 115 bits (287), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 54/125 (43%), Positives = 82/125 (65%), Gaps = 7/125 (5%)
Query: 37 KKKKKKKKKKKKKKKRLYLPTSIPLSHYFKKAHMVPRC--NAMRQIYEHGPLVAIFSVYA 94
K K +++ ++ KK Y+ S Y + A+ VP R+I ++GP+V F+VY
Sbjct: 217 KTPKCRRRCQRSYKKAYYMDKS-----YGEDAYEVPHSVKAIQREIMKNGPVVGAFTVYE 271
Query: 95 DFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGEN 154
DF YK G+Y+H G + G HA++++GWGVEND+PYWL+ANSW++ WG+ G F+++RG N
Sbjct: 272 DFSYYKKGIYKHTAGQARGGHAIKIIGWGVENDVPYWLIANSWHNDWGEEGYFRMIRGIN 331
Query: 155 EADIE 159
E IE
Sbjct: 332 ECGIE 336
>gi|239938582|gb|ACS36090.1| cysteine proteinase [Haemonchus contortus]
Length = 346
Score = 160 bits (405), Expect = 9e-37, Method: Compositional matrix adjust.
Identities = 85/189 (44%), Positives = 113/189 (59%), Gaps = 19/189 (10%)
Query: 157 DIEMGFNNRVEANSSEDDDLETMGCQNAKGLPRNFDAREKWPECPSLRHIADQSNCGSCW 216
D+ NR A +EDD+ +P +FDAR WP C S+RHI DQ+NCGSCW
Sbjct: 72 DLRFVNQNRKPAVENEDDE--------GDDIPESFDARTHWPNCTSIRHIRDQANCGSCW 123
Query: 217 AVSVANAISDRLCIASNGYFTGQISAQHIVACTPNC-WGCNGGWPQLAWRFWGHNGVVTG 275
AVS A+A+SDR+CI SNG IS+ V+C +C +GC+GGWP LA+ F+ + G VTG
Sbjct: 124 AVSTASALSDRICIESNGETQMHISSIDFVSCCESCSYGCDGGWPILAFDFYTYEGAVTG 183
Query: 276 GDYNSQEGCQPYTLAPCEHHVQGPLQNCTLLGK----LKTPECKQNCYNPSYESTYRFDL 331
GDY S++GC+PY PC HH N T G+ KTP+C++ C SY+ Y D
Sbjct: 184 GDYGSKDGCRPYPFHPCGHH-----GNDTYYGECPKGAKTPKCRRRC-QRSYKKAYYMDK 237
Query: 332 KKGKKAHMV 340
G+ A+ V
Sbjct: 238 SYGEDAYEV 246
Score = 114 bits (286), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 54/125 (43%), Positives = 82/125 (65%), Gaps = 7/125 (5%)
Query: 37 KKKKKKKKKKKKKKKRLYLPTSIPLSHYFKKAHMVPRC--NAMRQIYEHGPLVAIFSVYA 94
K K +++ ++ KK Y+ S Y + A+ VP R+I ++GP+V F+VY
Sbjct: 217 KTPKCRRRCQRSYKKAYYMDKS-----YGEDAYEVPHSVKAIQREIMKNGPVVGAFTVYE 271
Query: 95 DFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGEN 154
DF YK G+Y+H G + G HA++++GWGVEND+PYWL+ANSW++ WG+ G F+++RG N
Sbjct: 272 DFSYYKKGIYKHTAGQARGGHAIKIIGWGVENDVPYWLIANSWHNDWGEEGYFRMIRGIN 331
Query: 155 EADIE 159
E IE
Sbjct: 332 ECGIE 336
>gi|225711544|gb|ACO11618.1| Cathepsin B precursor [Caligus rogercresseyi]
Length = 332
Score = 160 bits (404), Expect = 9e-37, Method: Compositional matrix adjust.
Identities = 75/158 (47%), Positives = 103/158 (65%), Gaps = 5/158 (3%)
Query: 184 AKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQ 243
+ LP +FDARE WP CPS+R I DQ +CGSCWA A A+SDR+CI +N ISA+
Sbjct: 80 TEALPSDFDAREHWPNCPSIRLIRDQGSCGSCWAFGAAEAMSDRICIHTNKNV--NISAE 137
Query: 244 HIVACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQN 302
++++C +C +GCNGG+P AW++W G+V+GG Y S GCQPY + PCEHHV G Q
Sbjct: 138 NLLSCCYSCGFGCNGGFPGAAWKYWTSKGLVSGGLYGSHSGCQPYDIEPCEHHVNGTRQP 197
Query: 303 CTLLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
C G +TP+C + C N +Y Y DL G+ ++ +
Sbjct: 198 CAEGG--RTPKCHRTCENENYSVPYDKDLSFGRSSYSI 233
Score = 111 bits (278), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 50/81 (61%), Positives = 61/81 (75%)
Query: 79 QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWN 138
+I ++GP+ A FSVY+DF+ KSGVY+H G +G HA+R+LGWGVE PYWLVANSWN
Sbjct: 243 EIMDNGPVEAAFSVYSDFMNDKSGVYRHVKGSLLGGHAIRILGWGVEKGTPYWLVANSWN 302
Query: 139 DHWGDHGTFKILRGENEADIE 159
WGD GTFKILRG + IE
Sbjct: 303 TDWGDKGTFKILRGSDHCGIE 323
>gi|154089579|gb|ABS57370.1| cathepsin B2 [Trichobilharzia regenti]
Length = 344
Score = 160 bits (404), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 83/191 (43%), Positives = 109/191 (57%), Gaps = 4/191 (2%)
Query: 151 RGENEADIEMGFNNRVEANSSEDDDLETMGCQNAKGLPRNFDAREKWPECPSLRHIADQS 210
R ++ +DI + N L T + LP+ FDAR+ WP CPS+ I DQS
Sbjct: 56 RFKSVSDIRRMLGALPDPNGGHLPTLCTGYTPSLDELPKEFDARKYWPHCPSISEIRDQS 115
Query: 211 NCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVACTPNC-WGCNGGWPQLAWRFWGH 269
+CGSCWA A+SDR+CI S G +SA+++VAC +C GCNGG+P AW +W
Sbjct: 116 SCGSCWAFGAVEAMSDRICIESKGLHKPFLSAENLVACCSSCGMGCNGGFPHSAWSYWKR 175
Query: 270 NGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTLLGKLKTPECKQNCYNPSYESTYRF 329
+G+VTG YN +GCQPY PCEHHV GP +C G ++TP+CK C P Y Y
Sbjct: 176 SGIVTGDLYNPTDGCQPYEFPPCEHHVVGPRPSCE--GDVETPKCKTTC-QPGYNIPYNK 232
Query: 330 DLKKGKKAHMV 340
D GK + V
Sbjct: 233 DKWYGKTVYRV 243
Score = 119 bits (299), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 54/90 (60%), Positives = 65/90 (72%)
Query: 77 MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANS 136
M+++ EHGP+ F VYADF YKSGVYQH G +G HAVR+LGWG EN +PYWL+ANS
Sbjct: 251 MKEVKEHGPVEVDFEVYADFPNYKSGVYQHVSGGLLGGHAVRLLGWGEENGVPYWLIANS 310
Query: 137 WNDHWGDHGTFKILRGENEADIEMGFNNRV 166
WN WGD+G FKI+RG NE IE N +
Sbjct: 311 WNSDWGDNGYFKIIRGRNECGIESDVNAGI 340
>gi|7537454|gb|AAF35867.2| cathepsin B-like cysteine proteinase [Helicoverpa armigera]
Length = 338
Score = 158 bits (400), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 74/151 (49%), Positives = 95/151 (62%), Gaps = 8/151 (5%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP NFD R+KWP CP+L + DQ +CGSCWA A++DR C SNG SA+ ++
Sbjct: 84 LPENFDPRDKWPNCPTLNEVRDQGSCGSCWAFGAVEAMTDRYCTYSNGTQHFHFSAEDLL 143
Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
+C P C GCNGG P LAW +W H G+V+GG YNS +GC+PY + PCEHHV G C
Sbjct: 144 SCCPICGLGCNGGMPTLAWEYWKHFGLVSGGSYNSSQGCRPYEIPPCEHHVPGNRMPCN- 202
Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKK 336
G KTP+C++ C ES Y D +K K+
Sbjct: 203 -GDSKTPKCEKTC-----ESNYNVDYRKDKR 227
Score = 112 bits (280), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 47/81 (58%), Positives = 64/81 (79%)
Query: 79 QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWN 138
+++++GP+ F+VY+D L YK+GVY+H GD++G HAV++LGWGVEN YWL+ANSWN
Sbjct: 245 ELFKNGPVEGAFTVYSDLLNYKTGVYKHTIGDALGGHAVKILGWGVENGNKYWLIANSWN 304
Query: 139 DHWGDHGTFKILRGENEADIE 159
WGD+G FKILRGE+ IE
Sbjct: 305 SDWGDNGFFKILRGEDHCGIE 325
>gi|119887749|gb|ABM05925.1| cathepsin B-like cysteine proteinase [Helicoverpa assulta]
Length = 338
Score = 158 bits (400), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 74/151 (49%), Positives = 95/151 (62%), Gaps = 8/151 (5%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP NFD R+KWP CP+L + DQ +CGSCWA A++DR C SNG SA+ ++
Sbjct: 84 LPENFDPRDKWPNCPTLNEVRDQGSCGSCWAFGAVEAMTDRYCTYSNGTQHFHFSAEDLL 143
Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
+C P C GCNGG P LAW +W H G+V+GG YNS +GC+PY + PCEHHV G C
Sbjct: 144 SCCPICGLGCNGGMPTLAWEYWKHFGLVSGGSYNSSQGCRPYEIPPCEHHVPGNRMPCN- 202
Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKK 336
G KTP+C++ C ES Y D +K K+
Sbjct: 203 -GDSKTPKCEKTC-----ESNYNVDYRKDKR 227
Score = 112 bits (281), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 47/81 (58%), Positives = 64/81 (79%)
Query: 79 QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWN 138
+++++GP+ F+VY+D L YK+GVY+H GD++G HAV++LGWGVEN YWL+ANSWN
Sbjct: 245 ELFKNGPVEGAFTVYSDLLNYKTGVYKHTIGDALGGHAVKILGWGVENGNKYWLIANSWN 304
Query: 139 DHWGDHGTFKILRGENEADIE 159
WGD+G FKILRGE+ IE
Sbjct: 305 SDWGDNGFFKILRGEDHCGIE 325
>gi|196009263|ref|XP_002114497.1| expressed hypothetical protein [Trichoplax adhaerens]
gi|190583516|gb|EDV23587.1| expressed hypothetical protein [Trichoplax adhaerens]
Length = 333
Score = 158 bits (400), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 74/160 (46%), Positives = 105/160 (65%), Gaps = 4/160 (2%)
Query: 182 QNAKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQIS 241
++ LP++FD+R+KW CPS+R I DQ +CGSCW+ +I+DR+CI SNG IS
Sbjct: 77 EDTSDLPKSFDSRDKWRMCPSIREIRDQGSCGSCWSFGAVESITDRICIHSNGKVKVHIS 136
Query: 242 AQHIVACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPL 300
A+ ++ C +C GCNGG+ AW +W +NG+VTGG Y+S +GCQPY + CEHHV+GP
Sbjct: 137 AEDLMTCCTSCGMGCNGGFLPQAWHYWVNNGIVTGGQYHSHKGCQPYEIPKCEHHVKGPF 196
Query: 301 QNCTLLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
+ C +L TP+C Q C P Y T+ D GKK++ +
Sbjct: 197 KACG--KELPTPKCSQKC-QPGYNKTFNQDKHFGKKSYSI 233
Score = 115 bits (287), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 53/99 (53%), Positives = 70/99 (70%), Gaps = 2/99 (2%)
Query: 63 HYFKKAHMVPRC--NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
H+ KK++ + ++I +GP+ A F+VYADF YKSGVYQH G +G HAV++L
Sbjct: 225 HFGKKSYSITNNIQQIQKEIMMNGPVEAAFTVYADFPSYKSGVYQHTTGGPLGGHAVKIL 284
Query: 121 GWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
GWG EN+ PYWL+ANSWN WGD G FKI+RG++E IE
Sbjct: 285 GWGTENNTPYWLIANSWNPTWGDKGYFKIIRGKDECGIE 323
>gi|240992702|ref|XP_002404475.1| cathepsin B endopeptidase, putative [Ixodes scapularis]
gi|215491572|gb|EEC01213.1| cathepsin B endopeptidase, putative [Ixodes scapularis]
Length = 337
Score = 158 bits (400), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 78/155 (50%), Positives = 99/155 (63%), Gaps = 4/155 (2%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP +FDAREKWP C S+ I DQS CGSCWA A A+SDR+CI S G ISA+ ++
Sbjct: 85 LPESFDAREKWPHCNSIHLIRDQSTCGSCWAFGAAEAMSDRVCIHSKGKIQVNISAEDLL 144
Query: 247 ACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
C +C GCNGG P AW +W +G+VTGG Y + +GC+PY+LAPCEHH +G L NCT
Sbjct: 145 DCCDSCGAGCNGGTPAAAWEYWKESGLVTGGLYGTNDGCKPYSLAPCEHHTKGSLPNCT- 203
Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
G + TP+C C Y Y+ D GKK + +
Sbjct: 204 -GTVPTPKCVHLC-RKGYGKDYQDDKHFGKKVYSI 236
Score = 121 bits (304), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 59/109 (54%), Positives = 73/109 (66%), Gaps = 2/109 (1%)
Query: 63 HYFKKAHMVPRCNAMRQ--IYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
H+ KK + + Q I+++GP+ A F V ADFL YKSGVYQH+ D IG HA+R+L
Sbjct: 228 HFGKKVYSISSDEKQIQTEIFKNGPVEADFIVLADFLSYKSGVYQHHSDDVIGGHAIRIL 287
Query: 121 GWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIEMGFNNRVEAN 169
GWG EN PYWL ANSWN+ WGDHG FKILRG++E IE N + N
Sbjct: 288 GWGTENGTPYWLAANSWNEDWGDHGYFKILRGKDECGIEEDINAGIPKN 336
>gi|185135431|ref|NP_001117776.1| procathepsin B precursor [Oncorhynchus mykiss]
gi|14582897|gb|AAK69705.1|AF358667_1 procathepsin B [Oncorhynchus mykiss]
Length = 330
Score = 158 bits (400), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 76/155 (49%), Positives = 102/155 (65%), Gaps = 4/155 (2%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP +FDAR +WP CP+++ I DQ +CGSCWA A AISDR CI SNG + +ISA+ ++
Sbjct: 79 LPDSFDARLQWPNCPTIKEIRDQGSCGSCWAFGAAEAISDRYCIHSNGKVSVEISAEDLL 138
Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
+C C GC GG+P AW +W +G+VTGG Y S GC+PY++APCEHHV G CT
Sbjct: 139 SCCDACGMGCMGGFPSAAWDYWAESGLVTGGLYGSNIGCRPYSIAPCEHHVNGTRPPCT- 197
Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
G+ TP+C C N Y +Y+ D + GK+ + V
Sbjct: 198 -GEGDTPKCVSEC-NAGYTPSYKKDKRFGKQTYSV 230
Score = 119 bits (299), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 57/108 (52%), Positives = 75/108 (69%), Gaps = 2/108 (1%)
Query: 54 YLPTSIPLSHYFKKAHMVP--RCNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDS 111
Y P+ + K+ + VP M ++Y++GP+ A FSVY DFL YK+GVYQH G
Sbjct: 213 YTPSYKKDKRFGKQTYSVPPKEQQIMTELYKNGPVEAAFSVYEDFLLYKTGVYQHVTGQM 272
Query: 112 IGLHAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
+G HA+++LGWG EN+ PYWLVANSWN WGD+G FKILRG++E IE
Sbjct: 273 LGGHAIKILGWGKENNTPYWLVANSWNTDWGDNGFFKILRGKDECGIE 320
>gi|31872149|gb|AAP59456.1| cathepsin B precursor [Araneus ventricosus]
Length = 334
Score = 158 bits (400), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 76/155 (49%), Positives = 100/155 (64%), Gaps = 4/155 (2%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP +FD+RE+WP CP++ I DQ +CGSCWA A A+SDR CI SNG +ISA+ ++
Sbjct: 83 LPESFDSREQWPNCPTISEIRDQGSCGSCWAFGAAEAMSDRHCIHSNGKVNVEISAEDLL 142
Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
C +C GCNGG+P AW +W G+VTGG YNS GCQPYT+A CEHH +G L C
Sbjct: 143 TCCDSCGMGCNGGFPGSAWEYWVDKGLVTGGLYNSHVGCQPYTIASCEHHTKGKLPPCGD 202
Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
+ + TP+C C Y +YR D GKK++ +
Sbjct: 203 I--VDTPQCVHMC-EKGYNVSYRADKYFGKKSYSI 234
Score = 114 bits (285), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 51/81 (62%), Positives = 62/81 (76%)
Query: 79 QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWN 138
+I +GP+ A F+VYADF+ YKSGVY+H G+ +G HAVR+LGWG E+ PYWLVANSWN
Sbjct: 244 EISTNGPVEAAFTVYADFVTYKSGVYRHVTGEEMGGHAVRILGWGTESGTPYWLVANSWN 303
Query: 139 DHWGDHGTFKILRGENEADIE 159
WGD G FKILRG +E IE
Sbjct: 304 TDWGDKGYFKILRGSDECGIE 324
>gi|170028910|ref|XP_001842337.1| cathepsin L [Culex quinquefasciatus]
gi|167879387|gb|EDS42770.1| cathepsin L [Culex quinquefasciatus]
Length = 334
Score = 158 bits (399), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 75/155 (48%), Positives = 100/155 (64%), Gaps = 4/155 (2%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP NFDARE+WP CP++R I DQ +CGSCWA A+SDR+CI S G ++SA+ +V
Sbjct: 83 LPENFDAREQWPNCPTIREIRDQGSCGSCWAFGAVEAMSDRICIHSKGKVHFRVSAEDLV 142
Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
+C C +GCNGG+P AW +W G+V+GG Y S +GCQPY ++PCEHHV G C
Sbjct: 143 SCCHTCGFGCNGGFPGAAWSYWVRKGLVSGGPYGSDQGCQPYAISPCEHHVNGTRGPCN- 201
Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
G+ KTP+C + C SY Y D GK ++ +
Sbjct: 202 -GEGKTPKCVKKC-QASYNVPYAKDKFFGKSSYSI 234
Score = 110 bits (274), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 46/82 (56%), Positives = 60/82 (73%)
Query: 78 RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
++++ +GP+ F+VY D L YK GVYQH G +G HA+R+LGWGVEND +WL+ANSW
Sbjct: 243 KELFTNGPVEGAFTVYEDLLNYKEGVYQHTAGKMLGGHAIRILGWGVENDTKFWLIANSW 302
Query: 138 NDHWGDHGTFKILRGENEADIE 159
N WGD+G FKILRG + IE
Sbjct: 303 NSDWGDNGYFKILRGSDHLGIE 324
>gi|226472808|emb|CAX71090.1| cathepsin B [Schistosoma japonicum]
Length = 325
Score = 158 bits (399), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 79/179 (44%), Positives = 108/179 (60%), Gaps = 6/179 (3%)
Query: 151 RGENEADIEMGFNNRVEANSSEDDDLETMGCQNAKGLPRNFDAREKWPECPSLRHIADQS 210
R + +DI + N + + L T LP++FDAR++W CPS+ I DQS
Sbjct: 59 RFKTVSDIRRMLGALPDPNGEQLETLCTGYELTLNELPKSFDARKEWTHCPSISEIRDQS 118
Query: 211 NCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVACTPNC-WGCNGGWPQLAWRFWGH 269
+CGSCWA A+SDR+CI S G + +SA+++V+C +C GCNGG+P AW +W +
Sbjct: 119 SCGSCWAFGAVEAMSDRICIESKGKYKPFLSAENLVSCCSSCGMGCNGGFPHSAWLYWKN 178
Query: 270 NGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTLLGKLKTPECKQNC---YNPSYES 325
G+VTG YN+ GCQPY PCEHH GPL C G ++TP CK+ C YN SYE+
Sbjct: 179 QGIVTGDLYNTTNGCQPYEFPPCEHHTLGPLPVCD--GDVETPPCKRTCQAGYNVSYEN 235
Score = 82.0 bits (201), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 35/58 (60%), Positives = 45/58 (77%)
Query: 77 MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVA 134
M+++ +HGP+ F VYADF YKSGVYQH G +G HAVR+LGWG EN++PYWL+A
Sbjct: 254 MKELMQHGPVEVDFEVYADFPNYKSGVYQHVSGALLGGHAVRLLGWGEENNVPYWLIA 311
>gi|56759588|gb|AAW28820.1| Parcxpwnx02 [Periplaneta americana]
Length = 343
Score = 157 bits (398), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 89/212 (41%), Positives = 113/212 (53%), Gaps = 19/212 (8%)
Query: 135 NSWNDHWGDHGTFKILRGENEADIEMGF-----NNRVEANSSEDDDLETMGCQNAKGLPR 189
NS N W H F E MG N R+ S ED D+E +P
Sbjct: 45 NSLNTTWKAHRNFGNDIPLREIKKLMGVRRSLENFRLPEKSMEDIDIE---------IPE 95
Query: 190 NFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVACT 249
FD RE+WPECP+L+ I DQ +CGSCWA A+SDR+CI S G SA+ ++ C
Sbjct: 96 EFDPREQWPECPTLKEIRDQGSCGSCWAFGAVEAMSDRVCIHSKGKTHFHFSAEDLLTCC 155
Query: 250 PNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTLLGK 308
+C +GCNGG P AW +W G+V+GG YNS +GCQPY + PCEHHV G + C G+
Sbjct: 156 SSCGFGCNGGEPGAAWDYWVSTGIVSGGSYNSHQGCQPYAIEPCEHHVNGTRKPC---GE 212
Query: 309 LKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
TP C + C Y+ Y D GK A+ V
Sbjct: 213 GDTPRCVKRC-EEGYDVPYGKDRHFGKSAYAV 243
Score = 114 bits (284), Expect = 8e-23, Method: Compositional matrix adjust.
Identities = 53/103 (51%), Positives = 71/103 (68%), Gaps = 2/103 (1%)
Query: 63 HYFKKAHMVPRC--NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
H+ K A+ VP +++ +GP A +VY DFL Y++GVYQH G ++G HAVR+L
Sbjct: 235 HFGKSAYAVPGSVKAIQKELLLNGPAEAALTVYDDFLHYRTGVYQHVSGGALGGHAVRLL 294
Query: 121 GWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIEMGFN 163
GWGVE+ PYWL+ANSWN WGD+G F+ILRG++E IE N
Sbjct: 295 GWGVEDGTPYWLLANSWNYDWGDNGYFRILRGQDECGIESDIN 337
>gi|118424551|gb|ABK90823.1| cathepsin B-like cysteine proteinase [Spodoptera exigua]
Length = 341
Score = 157 bits (398), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 75/155 (48%), Positives = 95/155 (61%), Gaps = 4/155 (2%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP NFD R+KWP CP+L + DQ +CGSCWA A++DR C SNG SA+ ++
Sbjct: 87 LPENFDPRDKWPNCPTLNEVRDQGSCGSCWAFGAVEAMTDRYCTYSNGTKHFHFSAEDLL 146
Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
+C P C GCNGG P LAW +W H G+V+GG YNS +GC+PY + PCEHHV G C
Sbjct: 147 SCCPVCGLGCNGGMPTLAWEYWKHFGLVSGGSYNSSQGCRPYEIPPCEHHVPGNRMPCN- 205
Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
G KTP+C + C SY Y D + GK + V
Sbjct: 206 -GDSKTPKCHKTC-ESSYNVDYHKDKRYGKHVYSV 238
Score = 110 bits (276), Expect = 8e-22, Method: Compositional matrix adjust.
Identities = 46/81 (56%), Positives = 64/81 (79%)
Query: 79 QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWN 138
++Y++GP+ F+VY+D L YK+GVY+H G+++G HA+++LGWGVEN YWL+ANSWN
Sbjct: 248 ELYKNGPVEGAFTVYSDLLNYKNGVYKHTVGNALGGHAIKILGWGVENGNKYWLIANSWN 307
Query: 139 DHWGDHGTFKILRGENEADIE 159
WGD+G FKILRGE+ IE
Sbjct: 308 SDWGDNGFFKILRGEDHCGIE 328
>gi|38147393|gb|AAR12009.1| cathepsin B-like proteinase [Triatoma infestans]
Length = 332
Score = 157 bits (398), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 74/155 (47%), Positives = 105/155 (67%), Gaps = 4/155 (2%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP+ FDAR+ WP C S+ I DQ +CGSCWA A+SDR+CI SNG +SA+++V
Sbjct: 81 LPKEFDARKHWPNCTSIAEIRDQGSCGSCWAFGAVEAMSDRICIHSNGKLQVHLSAENLV 140
Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
+C +C +GC+GG+P AW +W + G+V+GG+Y S++GCQPY++APCEHHV GP C+
Sbjct: 141 SCCDSCGFGCDGGYPASAWDYWQNVGIVSGGNYGSKQGCQPYSIAPCEHHVPGPRPACS- 199
Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
G+ TP+C+ C S S Y DL G+ A+ +
Sbjct: 200 -GEGSTPDCRNQCDKRSGIS-YDKDLYYGESAYSL 232
Score = 114 bits (284), Expect = 8e-23, Method: Compositional matrix adjust.
Identities = 49/82 (59%), Positives = 64/82 (78%)
Query: 79 QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWN 138
+I ++GP+ A F+VY D + YK GVYQH G +G HA+++LGWGVEND PYWLVANSWN
Sbjct: 242 EILKNGPVEAAFTVYEDLVNYKEGVYQHVAGSVLGGHAIKILGWGVENDTPYWLVANSWN 301
Query: 139 DHWGDHGTFKILRGENEADIEM 160
WG++G FKILRG++E IE+
Sbjct: 302 TDWGNNGFFKILRGKDECGIEI 323
>gi|183988834|gb|ACC66066.1| cathepsin B [Samia ricini]
Length = 283
Score = 157 bits (398), Expect = 6e-36, Method: Compositional matrix adjust.
Identities = 76/155 (49%), Positives = 99/155 (63%), Gaps = 4/155 (2%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP FD R+KWPEC +L I DQ +CGSCWA A++DR+CI SN SA+ +V
Sbjct: 43 LPEIFDPRDKWPECLTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLV 102
Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
+C P C GCNGG P LAW +W H G+V+GG+YNS +GC+PY + PCEHHV G C
Sbjct: 103 SCCPICGLGCNGGMPTLAWEYWKHVGLVSGGNYNSSQGCRPYEIPPCEHHVPGNRMPCN- 161
Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
G KTP+C++NC SY ++ D + GK + V
Sbjct: 162 -GDTKTPKCQKNC-ESSYNVPFKKDKRYGKHVYSV 194
Score = 107 bits (267), Expect = 9e-21, Method: Compositional matrix adjust.
Identities = 44/80 (55%), Positives = 65/80 (81%)
Query: 79 QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWN 138
+++++GP+ A F+VY+D L YK+GVY+H G+++G HA++++GWGVEN+ YWL+ANSWN
Sbjct: 204 ELFKNGPVEAAFTVYSDLLSYKNGVYKHTEGNALGGHAIKIIGWGVENNNKYWLIANSWN 263
Query: 139 DHWGDHGTFKILRGENEADI 158
WGD+G FKILRGE+ I
Sbjct: 264 SDWGDNGFFKILRGEDHCGI 283
>gi|225717770|gb|ACO14731.1| Cathepsin B precursor [Caligus clemensi]
Length = 331
Score = 157 bits (397), Expect = 6e-36, Method: Compositional matrix adjust.
Identities = 74/158 (46%), Positives = 104/158 (65%), Gaps = 5/158 (3%)
Query: 184 AKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQ 243
+ +P FDARE WP CPS+R I DQ +CGSCWA A A+SDR+CI ++ ISA+
Sbjct: 79 TESIPDTFDAREHWPNCPSIRLIRDQGSCGSCWAFGAAEAMSDRVCIHTHKNV--NISAE 136
Query: 244 HIVACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQN 302
++++C C +GCNGG+P AWRFW + G+V+GG Y S +GCQPY + PCEHHV G +
Sbjct: 137 NLLSCCYTCGFGCNGGFPGAAWRFWENKGLVSGGLYGSHKGCQPYLIEPCEHHVNGTRKP 196
Query: 303 CTLLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
C G +TP+C + C N +Y +Y DL G+ ++ +
Sbjct: 197 CAEGG--RTPKCHKTCDNKNYPISYEKDLSFGRSSYSI 232
Score = 112 bits (281), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 50/81 (61%), Positives = 61/81 (75%)
Query: 79 QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWN 138
I +GP+ A FSVY+DF+ YKSGVY+H G +G HA+R+LGWG+E PYWLVANSWN
Sbjct: 242 DIMTNGPVEAAFSVYSDFMSYKSGVYRHVKGSLLGGHAIRILGWGMEKGTPYWLVANSWN 301
Query: 139 DHWGDHGTFKILRGENEADIE 159
WGD+GTFKILRG + IE
Sbjct: 302 TDWGDNGTFKILRGSDHCGIE 322
>gi|357613937|gb|EHJ68797.1| cathepsin B-like cysteine proteinase [Danaus plexippus]
Length = 334
Score = 157 bits (397), Expect = 7e-36, Method: Compositional matrix adjust.
Identities = 74/155 (47%), Positives = 97/155 (62%), Gaps = 4/155 (2%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP NFD R+KWP CP+L I DQ +CGSCWA A++DR C SNG SA+ ++
Sbjct: 82 LPENFDPRDKWPNCPTLNEIRDQGSCGSCWAFGAVEAMTDRYCTYSNGTKHFHFSAEDLL 141
Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
+C P C GCNGG P AW +W H G+V+GG+YNS +GC PY + PCEHHV G C
Sbjct: 142 SCCPVCGLGCNGGIPSFAWEYWKHFGIVSGGNYNSSQGCLPYEIPPCEHHVPGNRIPCN- 200
Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
G+ TP+C ++C Y ++Y+ D K GK + V
Sbjct: 201 -GETSTPKCHRSC-RKEYTNSYKSDKKYGKHVYSV 233
Score = 110 bits (275), Expect = 9e-22, Method: Compositional matrix adjust.
Identities = 47/81 (58%), Positives = 64/81 (79%)
Query: 79 QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWN 138
+I+++GP+ F+VYAD L YKSGVY+H G+++G HA++++GWGVEN YWL+ANSWN
Sbjct: 243 EIFKNGPVEGAFTVYADLLTYKSGVYKHTEGEALGGHAIKIMGWGVENGNKYWLIANSWN 302
Query: 139 DHWGDHGTFKILRGENEADIE 159
WGD+G FKILRGE+ IE
Sbjct: 303 SDWGDNGFFKILRGEDHCGIE 323
>gi|347972086|ref|XP_313835.5| AGAP004533-PA [Anopheles gambiae str. PEST]
gi|333469165|gb|EAA09183.5| AGAP004533-PA [Anopheles gambiae str. PEST]
Length = 337
Score = 157 bits (397), Expect = 7e-36, Method: Compositional matrix adjust.
Identities = 73/147 (49%), Positives = 96/147 (65%), Gaps = 5/147 (3%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP NFD+RE+WP CP++R I DQ +CGSCWA A+SDR+C+AS G + SA+ +V
Sbjct: 85 LPENFDSREQWPNCPTIREIRDQGSCGSCWAFGAVEAMSDRVCVASGGKIHFRFSAEDLV 144
Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
+C C +GCNGG+P AW +W G+V+GG + S GCQPY +APCEHHV G +C
Sbjct: 145 SCCHTCGFGCNGGFPGAAWSYWVRKGLVSGGPFGSNLGCQPYAIAPCEHHVNGTRPSCEG 204
Query: 306 LGKLKTPECKQNC---YNPSYESTYRF 329
G KTP+C + C YN Y+ RF
Sbjct: 205 EGG-KTPKCVKKCQESYNVPYQKDKRF 230
Score = 109 bits (272), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 48/82 (58%), Positives = 59/82 (71%)
Query: 78 RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
++I +GP+ F+VY D L YK GVYQH G +G HA+R+LGWGVEN YWL+ANSW
Sbjct: 246 KEIMTNGPVEGAFTVYEDLLHYKEGVYQHVTGKMLGGHAIRILGWGVENGTKYWLIANSW 305
Query: 138 NDHWGDHGTFKILRGENEADIE 159
N WGD+G FKILRGE+ IE
Sbjct: 306 NSDWGDNGFFKILRGEDHLGIE 327
>gi|240992699|ref|XP_002404474.1| cathepsin B endopeptidase, putative [Ixodes scapularis]
gi|215491571|gb|EEC01212.1| cathepsin B endopeptidase, putative [Ixodes scapularis]
Length = 337
Score = 157 bits (397), Expect = 7e-36, Method: Compositional matrix adjust.
Identities = 76/155 (49%), Positives = 99/155 (63%), Gaps = 4/155 (2%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP +FDAREKW C S+ I DQS CGSCWA A A+SDR+CI S G ISA+ ++
Sbjct: 85 LPESFDAREKWSHCASIHLIRDQSTCGSCWAFGAAEAMSDRVCIHSKGKIQVDISAEDLL 144
Query: 247 ACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
C +C GCNGG+P AW +W +G+VTGG Y + +GC+PY+LAPCEHH +G L NCT
Sbjct: 145 DCCDSCGAGCNGGYPAAAWEYWKESGLVTGGLYGTSDGCKPYSLAPCEHHTKGSLPNCT- 203
Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
G + TP+C C Y Y+ D G+K + +
Sbjct: 204 -GTVPTPKCVHLC-RKGYGKDYQDDKHFGRKVYSI 236
Score = 128 bits (321), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 57/91 (62%), Positives = 70/91 (76%)
Query: 79 QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWN 138
+I+++GP+ A F+VYADFL YKSGVYQH GD +G HA+R+LGWG EN PYWLVANSWN
Sbjct: 246 EIFKNGPVEADFTVYADFLSYKSGVYQHQSGDVLGGHAIRILGWGTENGTPYWLVANSWN 305
Query: 139 DHWGDHGTFKILRGENEADIEMGFNNRVEAN 169
+ WGDHG FKILRG++E IE N + N
Sbjct: 306 EDWGDHGYFKILRGKDECGIEDDINAGIPKN 336
>gi|30995341|gb|AAO59414.2| cathepsin B endopeptidase [Schistosoma japonicum]
gi|226472794|emb|CAX71083.1| cathepsin B [Schistosoma japonicum]
gi|226472796|emb|CAX71084.1| cathepsin B [Schistosoma japonicum]
gi|226472798|emb|CAX71085.1| cathepsin B [Schistosoma japonicum]
gi|226472802|emb|CAX71087.1| cathepsin B [Schistosoma japonicum]
gi|226472806|emb|CAX71089.1| cathepsin B [Schistosoma japonicum]
Length = 348
Score = 157 bits (397), Expect = 7e-36, Method: Compositional matrix adjust.
Identities = 79/179 (44%), Positives = 108/179 (60%), Gaps = 6/179 (3%)
Query: 151 RGENEADIEMGFNNRVEANSSEDDDLETMGCQNAKGLPRNFDAREKWPECPSLRHIADQS 210
R + +DI + N + + L T LP++FDAR++W CPS+ I DQS
Sbjct: 59 RFKTVSDIRRMLGALPDPNGEQLETLCTGYELTLNELPKSFDARKEWTHCPSISEIRDQS 118
Query: 211 NCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVACTPNC-WGCNGGWPQLAWRFWGH 269
+CGSCWA A+SDR+CI S G + +SA+++V+C +C GCNGG+P AW +W +
Sbjct: 119 SCGSCWAFGAVEAMSDRICIESKGKYKPFLSAENLVSCCSSCGMGCNGGFPHSAWLYWKN 178
Query: 270 NGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTLLGKLKTPECKQNC---YNPSYES 325
G+VTG YN+ GCQPY PCEHH GPL C G ++TP CK+ C YN SYE+
Sbjct: 179 QGIVTGDLYNTTNGCQPYEFPPCEHHTLGPLPVCD--GDVETPPCKRTCQAGYNVSYEN 235
Score = 120 bits (300), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 53/87 (60%), Positives = 66/87 (75%)
Query: 77 MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANS 136
M+++ +HGP+ F VYADF YKSGVYQH G +G HAVR+LGWG EN++PYWL+ANS
Sbjct: 254 MKELMQHGPVEVDFEVYADFPNYKSGVYQHVSGALLGGHAVRLLGWGEENNVPYWLIANS 313
Query: 137 WNDHWGDHGTFKILRGENEADIEMGFN 163
WN WGD+G FKI+RG+NE IE N
Sbjct: 314 WNTDWGDNGYFKIIRGKNECGIESDVN 340
>gi|389608541|dbj|BAM17880.1| cathepsin B [Papilio xuthus]
Length = 334
Score = 157 bits (397), Expect = 8e-36, Method: Compositional matrix adjust.
Identities = 74/155 (47%), Positives = 97/155 (62%), Gaps = 4/155 (2%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP NFD R+KWP CP+L + DQ +CGSCWA A++DR+C SNG SA+ ++
Sbjct: 82 LPENFDPRDKWPNCPTLNEVRDQGSCGSCWAFGAVEAMTDRICTYSNGTKHFHFSAEDLL 141
Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
+C P C GCNGG P LAW +W H G+V+GG YNS +GC+PY + PCEHHV G C+
Sbjct: 142 SCCPICGLGCNGGMPTLAWEYWKHFGLVSGGSYNSSQGCRPYEIPPCEHHVPGNRLPCS- 200
Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
G KTP+C + C Y+ Y+ D GK + V
Sbjct: 201 -GDTKTPKCVKEC-ESGYKVPYKQDKHYGKHVYSV 233
Score = 112 bits (281), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 48/81 (59%), Positives = 64/81 (79%)
Query: 79 QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWN 138
++Y++GP+ F+VYAD L YKSGVY+H GD++G HA++++GWGVEN YWL+ANSWN
Sbjct: 243 ELYKNGPVEGAFTVYADLLSYKSGVYKHVTGDALGGHAIKIMGWGVENGNKYWLIANSWN 302
Query: 139 DHWGDHGTFKILRGENEADIE 159
WGD+G FKILRGE+ IE
Sbjct: 303 SDWGDNGFFKILRGEDHCGIE 323
>gi|226472800|emb|CAX71086.1| cathepsin B [Schistosoma japonicum]
gi|226472804|emb|CAX71088.1| cathepsin B [Schistosoma japonicum]
Length = 348
Score = 157 bits (396), Expect = 8e-36, Method: Compositional matrix adjust.
Identities = 79/179 (44%), Positives = 108/179 (60%), Gaps = 6/179 (3%)
Query: 151 RGENEADIEMGFNNRVEANSSEDDDLETMGCQNAKGLPRNFDAREKWPECPSLRHIADQS 210
R + +DI + N + + L T LP++FDAR++W CPS+ I DQS
Sbjct: 59 RFKTVSDIRRMLGALPDPNGEQLETLCTGYELTLNELPKSFDARKEWTHCPSISEIRDQS 118
Query: 211 NCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVACTPNC-WGCNGGWPQLAWRFWGH 269
+CGSCWA A+SDR+CI S G + +SA+++V+C +C GCNGG+P AW +W +
Sbjct: 119 SCGSCWAFGAVEAMSDRICIESKGKYKPFLSAENLVSCCSSCGMGCNGGFPHSAWLYWKN 178
Query: 270 NGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTLLGKLKTPECKQNC---YNPSYES 325
G+VTG YN+ GCQPY PCEHH GPL C G ++TP CK+ C YN SYE+
Sbjct: 179 QGIVTGDLYNTTNGCQPYEFPPCEHHTLGPLPVCD--GDVETPPCKRTCQAGYNVSYEN 235
Score = 119 bits (299), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 53/90 (58%), Positives = 67/90 (74%)
Query: 77 MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANS 136
M+++ +HGP+ F VYADF YKSGVYQH G +G HAVR+LGWG EN++PYWL+ANS
Sbjct: 254 MKELMQHGPVEVDFEVYADFPNYKSGVYQHVSGALLGGHAVRLLGWGEENNVPYWLIANS 313
Query: 137 WNDHWGDHGTFKILRGENEADIEMGFNNRV 166
WN WGD+G FKI+RG+NE IE N +
Sbjct: 314 WNTDWGDNGYFKIIRGKNECGIESDVNAGI 343
>gi|306992171|gb|ADN19566.1| cathepsin B-like proteinase [Spodoptera frugiperda]
Length = 341
Score = 157 bits (396), Expect = 9e-36, Method: Compositional matrix adjust.
Identities = 75/155 (48%), Positives = 95/155 (61%), Gaps = 4/155 (2%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP NFD R+KWP CP+L + DQ +CGSCWA A++DR C SNG SA+ ++
Sbjct: 87 LPENFDPRDKWPNCPTLNEVRDQGSCGSCWAFGAVEAMTDRYCTYSNGTKHFHFSAEDLL 146
Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
+C P C GCNGG P LAW +W H G+V+GG YNS +GC+PY + PCEHHV G C
Sbjct: 147 SCCPVCGLGCNGGMPTLAWEYWKHFGLVSGGSYNSGQGCRPYEIPPCEHHVPGNRVPCN- 205
Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
G KTP+C + C SY Y D + GK + V
Sbjct: 206 -GDSKTPKCHKTC-EASYSVDYHKDKRYGKHVYSV 238
Score = 105 bits (261), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 44/81 (54%), Positives = 63/81 (77%)
Query: 79 QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWN 138
+++++GP+ F+VY+D L YK+GVY+H G+++G HA+++LGWGVEN Y L+ANSWN
Sbjct: 248 ELFKNGPVEGAFTVYSDLLNYKNGVYKHTVGNALGGHAIKILGWGVENGNKYRLIANSWN 307
Query: 139 DHWGDHGTFKILRGENEADIE 159
WGD+G FKILRGE+ IE
Sbjct: 308 SDWGDNGFFKILRGEDHCGIE 328
>gi|195130519|ref|XP_002009699.1| GI15503 [Drosophila mojavensis]
gi|193908149|gb|EDW07016.1| GI15503 [Drosophila mojavensis]
Length = 342
Score = 157 bits (396), Expect = 9e-36, Method: Compositional matrix adjust.
Identities = 74/155 (47%), Positives = 93/155 (60%), Gaps = 9/155 (5%)
Query: 182 QNAKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQIS 241
+ LP +FDAR WP CP++ I DQ +CGSCWA A+SDR+CI SNG S
Sbjct: 85 DDGDDLPESFDARTAWPNCPTIGEIRDQGSCGSCWAFGAVEAMSDRVCIHSNGTVNFHFS 144
Query: 242 AQHIVACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPL 300
A+ +V+C C +GCNGG+P AW +W H G+V+GG YNS EGC+PY + PCEHHV G
Sbjct: 145 AEDLVSCCHTCGFGCNGGFPGAAWSYWTHKGIVSGGSYNSNEGCRPYEIEPCEHHVNGTR 204
Query: 301 QNCTLLGKLKTPECKQNCYNPSYESTYRFDLKKGK 335
C +TP CK C ES+Y D K K
Sbjct: 205 PPCK---NGRTPSCKHQC-----ESSYSVDYAKDK 231
Score = 104 bits (259), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 50/111 (45%), Positives = 71/111 (63%), Gaps = 6/111 (5%)
Query: 63 HYFKKAHMV---PRCNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRV 119
H+ K++ + PR R+I +GP+ F+VY D + YKSGVY+H G +G HA+R+
Sbjct: 232 HFGSKSYSIRRNPR-EIQREIMTNGPVEGAFTVYEDLILYKSGVYKHVHGKELGGHAIRI 290
Query: 120 LGWGVEND--IPYWLVANSWNDHWGDHGTFKILRGENEADIEMGFNNRVEA 168
LGWGV D +PYWL+ NSWN WGD+G F+I+RGE+ IE + + A
Sbjct: 291 LGWGVWGDSKVPYWLIGNSWNTDWGDNGFFRIVRGEDHCGIESAISAGLPA 341
>gi|225708580|gb|ACO10136.1| Cathepsin B precursor [Osmerus mordax]
Length = 329
Score = 156 bits (395), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 80/193 (41%), Positives = 112/193 (58%), Gaps = 10/193 (5%)
Query: 152 GENEADIEMGFNNRVEANSSEDDDLETMGCQNAKGLPRNFDAREKWPECPSLRHIADQSN 211
G+N ++++ + + L + LP FDAR++WP CP+++ I DQ +
Sbjct: 43 GQNFYNVDLSYVQGLCGTLQNKPTLPELEHPAGVKLPDTFDARQQWPNCPTIQDIRDQGS 102
Query: 212 CGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVACTPNC-WGCNGGWPQLAWRFWGHN 270
CGSCWA A AISDRLCI SN T +ISA+ +++C C GC GG+P AW +W +
Sbjct: 103 CGSCWAFGAAEAISDRLCIHSNAKITVEISAEDLLSCCEECGMGCFGGYPSAAWEYWAKS 162
Query: 271 GVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTLLGKLKTPECKQNC---YNPSYESTY 327
G+VTGG Y S +GC+PY++ PCEHHV G C G+ TP+C+ C Y P+YE
Sbjct: 163 GLVTGGLYGSNKGCRPYSIPPCEHHVNGTRPPCQ--GEGDTPKCQTKCIDGYTPAYEKDK 220
Query: 328 RFDLKKGKKAHMV 340
F GKK + V
Sbjct: 221 YF----GKKTYSV 229
Score = 120 bits (301), Expect = 9e-25, Method: Compositional matrix adjust.
Identities = 58/108 (53%), Positives = 74/108 (68%), Gaps = 2/108 (1%)
Query: 54 YLPTSIPLSHYFKKAHMVP--RCNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDS 111
Y P ++ KK + VP + M ++Y++GP+ A FSVY DFL YKSGVYQH GD
Sbjct: 212 YTPAYEKDKYFGKKTYSVPSKQEQIMTELYKNGPVEAAFSVYEDFLLYKSGVYQHLTGDM 271
Query: 112 IGLHAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
+G HA+++LGWG EN+ PYWL ANSWN WG+ G FKILRG +E IE
Sbjct: 272 LGGHAIKILGWGKENNTPYWLAANSWNTDWGNQGFFKILRGGDECGIE 319
>gi|389611087|dbj|BAM19154.1| cathepsin B [Papilio polytes]
Length = 334
Score = 156 bits (395), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 74/155 (47%), Positives = 98/155 (63%), Gaps = 4/155 (2%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP NFD R+KWP CP+L + DQ +CGSCWA A++DR+C SNG SA+ ++
Sbjct: 82 LPENFDPRDKWPNCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLL 141
Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
+C P C GCNGG P LAW +W H G+V+GG YNS +GC+PY + PCEHHV G C+
Sbjct: 142 SCCPICGLGCNGGMPTLAWEYWKHFGLVSGGSYNSTQGCRPYEIPPCEHHVPGNRLPCS- 200
Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
G KTP+C + C + +Y Y+ D GK + V
Sbjct: 201 -GDTKTPKCIKKCED-NYNVAYKQDKHYGKHIYSV 233
Score = 112 bits (281), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 48/81 (59%), Positives = 64/81 (79%)
Query: 79 QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWN 138
++Y++GP+ F+VYAD L YKSGVY+H GD++G HA++++GWGVEN YWL+ANSWN
Sbjct: 243 ELYKNGPVEGAFTVYADLLSYKSGVYKHVAGDALGGHAIKIMGWGVENGNKYWLIANSWN 302
Query: 139 DHWGDHGTFKILRGENEADIE 159
WGD+G FKILRGE+ IE
Sbjct: 303 SDWGDNGFFKILRGEDHCGIE 323
>gi|432852559|ref|XP_004067308.1| PREDICTED: cathepsin B-like [Oryzias latipes]
Length = 330
Score = 156 bits (395), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 76/155 (49%), Positives = 98/155 (63%), Gaps = 3/155 (1%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP FDAR +WP CP+L+ I DQ +CGSCWA A AISDR+CI SN + +IS++ ++
Sbjct: 79 LPTEFDARAQWPNCPTLKEIRDQGSCGSCWAFGAAEAISDRVCIHSNARVSVEISSEDLL 138
Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
C +C GCNGG+P AW FW G+VTGG Y+S GC+PYT+ PCEHHV G CT
Sbjct: 139 TCCESCGMGCNGGYPTAAWDFWTKEGLVTGGLYDSHVGCRPYTIPPCEHHVNGTRPPCTG 198
Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
G TP+C C Y +Y+ D GK ++ V
Sbjct: 199 EGG-DTPQCINQC-ESGYTPSYKKDKHYGKTSYSV 231
Score = 112 bits (281), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 55/108 (50%), Positives = 69/108 (63%), Gaps = 2/108 (1%)
Query: 54 YLPTSIPLSHYFKKAHMVP--RCNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDS 111
Y P+ HY K ++ V +IY++GP+ F VY DF YKSGVYQH G
Sbjct: 214 YTPSYKKDKHYGKTSYSVEANENQIQTEIYKNGPVEGAFMVYEDFPMYKSGVYQHVSGSL 273
Query: 112 IGLHAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
IG HA+++LGWGVE+ +PYWL ANSWN WGD+G FKILRG + IE
Sbjct: 274 IGGHAIKILGWGVEDGVPYWLCANSWNTDWGDNGYFKILRGSDHCGIE 321
>gi|346470617|gb|AEO35153.1| hypothetical protein [Amblyomma maculatum]
Length = 335
Score = 156 bits (395), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 77/157 (49%), Positives = 97/157 (61%), Gaps = 4/157 (2%)
Query: 185 KGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQH 244
K LP +FDAREKW C S+ I DQS CGSCWA A+SDR+CI S G ISA+
Sbjct: 82 KDLPESFDAREKWSHCNSIHVIRDQSTCGSCWAFGATEAMSDRVCIHSKGKVQVNISAED 141
Query: 245 IVACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNC 303
++ C +C GCNGG+P AW F+ +G+VTGG Y + +GCQPY PCEHH GPL NC
Sbjct: 142 LLTCCDSCGAGCNGGYPAAAWEFYKTDGIVTGGLYGTDDGCQPYYFPPCEHHTVGPLPNC 201
Query: 304 TLLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
T G TP+C ++C YE +Y D KK + +
Sbjct: 202 T--GIKPTPQCVRDC-RKGYEKSYSEDKHYAKKVYTL 235
Score = 121 bits (303), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 56/106 (52%), Positives = 74/106 (69%), Gaps = 2/106 (1%)
Query: 63 HYFKKAHMVP--RCNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
HY KK + + +I+++GP+ A F+VYADF+ YKSGVYQ + D++G HA+R+L
Sbjct: 227 HYAKKVYTLSADETQIKTEIFKNGPVEADFTVYADFVSYKSGVYQRHSDDALGGHAIRIL 286
Query: 121 GWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIEMGFNNRV 166
GWG EN +PYWLVANSWN+ WGD G FKILRG +E IE N +
Sbjct: 287 GWGTENGVPYWLVANSWNEDWGDKGYFKILRGNDECGIEDDINAGI 332
>gi|91078958|ref|XP_974220.1| PREDICTED: similar to cathepsin b [Tribolium castaneum]
gi|270004841|gb|EFA01289.1| cathepsin B precursor [Tribolium castaneum]
Length = 334
Score = 155 bits (393), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 74/159 (46%), Positives = 102/159 (64%), Gaps = 3/159 (1%)
Query: 183 NAKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISA 242
+A +P +FDAR++WP CP++R I DQ +CGSCWA A+SDR+CI S G ++SA
Sbjct: 76 DAMEIPDDFDARKQWPNCPTIREIRDQGSCGSCWAFGAVEAMSDRVCIHSKGAVNVRLSA 135
Query: 243 QHIVACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQ 301
+V+C +C GCNGG+P AW +W + G+V+GG + S +GC+PY +APCEHHV G
Sbjct: 136 DDLVSCCYSCGMGCNGGFPGAAWHYWVNKGIVSGGSFGSNQGCRPYEIAPCEHHVNGTRP 195
Query: 302 NCTLLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
CT KTP CKQ C Y Y+ D GK+A+ +
Sbjct: 196 PCTGDDN-KTPSCKQQC-EKGYNVPYKKDKNFGKEAYSI 232
Score = 114 bits (284), Expect = 8e-23, Method: Compositional matrix adjust.
Identities = 50/94 (53%), Positives = 64/94 (68%)
Query: 78 RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
++I +GP+ F VY D L YK GVYQH G+++G HA+R+LGWG E PYWL+ANSW
Sbjct: 241 KEIMTNGPVEGAFEVYEDLLSYKKGVYQHVKGEALGGHAIRILGWGTEKGTPYWLIANSW 300
Query: 138 NDHWGDHGTFKILRGENEADIEMGFNNRVEANSS 171
N WGD+GTFKILRGE+ IE + +SS
Sbjct: 301 NSDWGDNGTFKILRGEDHCGIESSIVAGIPKDSS 334
>gi|296221607|ref|XP_002756833.1| PREDICTED: cathepsin B, partial [Callithrix jacchus]
Length = 330
Score = 155 bits (392), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 72/156 (46%), Positives = 103/156 (66%), Gaps = 5/156 (3%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP +FDARE+WP+CP+++ I DQ +CGSCWA AISDR+CI +N + + ++SA+ ++
Sbjct: 71 LPESFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHTNAHVSVEVSAEDLL 130
Query: 247 ACTPNCW--GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
C + GCNGG+P AW FW G+V+GG Y+S GC+PY++ PCEHHV G CT
Sbjct: 131 TCCGSMCGDGCNGGYPAEAWNFWTRKGLVSGGLYDSHVGCRPYSIPPCEHHVNGSRPPCT 190
Query: 305 LLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
G+ TP+C ++C P Y TY+ D G ++ V
Sbjct: 191 --GEGDTPKCSKSC-EPGYSPTYKQDKHYGYDSYSV 223
Score = 122 bits (306), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 60/108 (55%), Positives = 74/108 (68%), Gaps = 2/108 (1%)
Query: 54 YLPTSIPLSHYFKKAHMVPRC--NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDS 111
Y PT HY ++ V + M +IY++GP+ FSVYADFL YKSGVYQH G+
Sbjct: 206 YSPTYKQDKHYGYDSYSVSNNERDIMAEIYKNGPVEGAFSVYADFLLYKSGVYQHVTGEM 265
Query: 112 IGLHAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
+G HA+R+LGWGVEN PYWLV NSWN WGD+G FKILRG++ IE
Sbjct: 266 MGGHAIRILGWGVENGTPYWLVGNSWNTDWGDNGFFKILRGQDHCGIE 313
>gi|395507317|ref|XP_003757972.1| PREDICTED: cathepsin B [Sarcophilus harrisii]
Length = 342
Score = 155 bits (392), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 77/170 (45%), Positives = 107/170 (62%), Gaps = 13/170 (7%)
Query: 173 DDDLETMGCQNAKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIAS 232
DDD++ LP NFDARE+WP+CP+++ I DQ +CGSCWA AISDR+C+ +
Sbjct: 77 DDDMK---------LPENFDAREQWPKCPTIKEIRDQGSCGSCWAFGAVEAISDRICVHT 127
Query: 233 NGYFTGQISAQHIVACTP-NCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLA 290
NGY T ++SA+ +++C C GCNGG+P AW++W G+V+GG Y+S GC+PY++
Sbjct: 128 NGYITIEVSAEDLLSCCGLQCGEGCNGGFPAGAWKYWIKKGLVSGGLYDSHVGCRPYSIP 187
Query: 291 PCEHHVQGPLQNCTLLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
PCEHHV G CT G TP+C + C Y Y+ D G A+ V
Sbjct: 188 PCEHHVNGSRPACTGEGG-DTPKCNKKC-EAGYSPDYKDDKHYGTTAYNV 235
Score = 129 bits (323), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 65/117 (55%), Positives = 78/117 (66%), Gaps = 2/117 (1%)
Query: 45 KKKKKKKRLYLPTSIPLSHYFKKAHMVPRCN--AMRQIYEHGPLVAIFSVYADFLQYKSG 102
K KK + Y P HY A+ VP M +IY++GP+ F VYADFLQYKSG
Sbjct: 209 KCNKKCEAGYSPDYKDDKHYGTTAYNVPSSEKEIMAEIYKNGPVEGAFIVYADFLQYKSG 268
Query: 103 VYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
VYQH GD +G HA+RVLGWGVE+ +PYWL ANSWN WGD+G FKILRG++ IE
Sbjct: 269 VYQHVTGDMLGGHAIRVLGWGVEDGVPYWLAANSWNTDWGDNGFFKILRGKDHCGIE 325
>gi|194766882|ref|XP_001965553.1| GF22391 [Drosophila ananassae]
gi|190619544|gb|EDV35068.1| GF22391 [Drosophila ananassae]
Length = 342
Score = 155 bits (391), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 77/170 (45%), Positives = 100/170 (58%), Gaps = 6/170 (3%)
Query: 174 DDLETMG--CQNAKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIA 231
D E +G Q +P+ FDAREKWP CP++ I DQ +CGSCWA A+SDR+CI
Sbjct: 73 DKQEVLGYLSQKVDDIPKEFDAREKWPNCPTINEIRDQGSCGSCWAFGAVEAMSDRVCIH 132
Query: 232 SNGYFTGQISAQHIVACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLA 290
SNG + SA +V+C C +GCNGG+P AW +W G+V+GG Y S+ GC+PY +A
Sbjct: 133 SNGNVNFRFSADDLVSCCHTCGFGCNGGFPGAAWSYWTRKGIVSGGRYGSKTGCRPYEIA 192
Query: 291 PCEHHVQGPLQNCTLLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
PCEHHV G C KTP+C+ C Y Y D G K++ V
Sbjct: 193 PCEHHVNGTRAPCNH--DSKTPKCQHQC-EAGYNVEYSKDKHFGSKSYSV 239
Score = 109 bits (273), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 51/101 (50%), Positives = 69/101 (68%), Gaps = 4/101 (3%)
Query: 63 HYFKKAHMVPRC--NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
H+ K++ V R + +I +GP+ F+VY D + YKSGVYQH G +G HA+R+L
Sbjct: 231 HFGSKSYSVRRNVRDIQEEIMTNGPVEGAFTVYEDLILYKSGVYQHEHGKELGGHAIRIL 290
Query: 121 GWGV--ENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
GWGV + ++PYWL+ANSWND WGD G F+ILRGE+ IE
Sbjct: 291 GWGVWGKEEVPYWLIANSWNDDWGDKGFFRILRGEDHCGIE 331
>gi|187097096|ref|NP_001119608.1| cathepsin B-348 precursor [Acyrthosiphon pisum]
gi|161343833|tpg|DAA06097.1| TPA_inf: cathepsin B [Acyrthosiphon pisum]
Length = 342
Score = 154 bits (390), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 75/155 (48%), Positives = 96/155 (61%), Gaps = 4/155 (2%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP FDARE+WP CP++R + DQ +CGSCWA A+SDR+CI SNG SA+++V
Sbjct: 91 LPETFDARERWPNCPTIREVRDQGSCGSCWAFGAVEAMSDRVCIHSNGTKNFHFSAENLV 150
Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
+C C +GCNGG+P AW +W G+V+GG Y S GC PY +APCEHHV G C
Sbjct: 151 SCCWTCGFGCNGGFPGAAWNYWKTKGIVSGGPYGSNMGCIPYEIAPCEHHVNGTRGPCKE 210
Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
G KTP C + C Y+ Y DL GK A+ +
Sbjct: 211 GG--KTPTCVKKC-EEGYKVPYAQDLHHGKSAYSI 242
Score = 110 bits (276), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 59/134 (44%), Positives = 82/134 (61%), Gaps = 2/134 (1%)
Query: 37 KKKKKKKKKKKKKKKRLYLPTSIPLSHYFKKAHMVPRCNAMRQ-IYEHGPLVAIFSVYAD 95
K+ K KK ++ +P + L H + + +RQ IY +GP+ F+VY D
Sbjct: 209 KEGGKTPTCVKKCEEGYKVPYAQDLHHGKSAYSIRNDVDQIRQEIYTNGPVEGAFTVYED 268
Query: 96 FLQYKSGVYQHNFGDSIGLHAVRVLGWGVEN-DIPYWLVANSWNDHWGDHGTFKILRGEN 154
F+ Y++GVY+H G ++G HA+R+LGWGV+N +IPYWLVANSWN WG G FKILRG +
Sbjct: 269 FIAYRAGVYKHVAGKALGGHAIRILGWGVQNGEIPYWLVANSWNTDWGSDGFFKILRGSD 328
Query: 155 EADIEMGFNNRVEA 168
E IE N + A
Sbjct: 329 ECGIEGQINAGLPA 342
>gi|30583753|gb|AAP36125.1| Homo sapiens cathepsin B [synthetic construct]
gi|61370555|gb|AAX43516.1| cathepsin B [synthetic construct]
Length = 340
Score = 154 bits (390), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 79/193 (40%), Positives = 115/193 (59%), Gaps = 8/193 (4%)
Query: 152 GENEADIEMGFNNRVEAN--SSEDDDLETMGCQNAKGLPRNFDAREKWPECPSLRHIADQ 209
G N +++MG+ R+ M ++ K LP +FDARE+WP+CP+++ I DQ
Sbjct: 44 GHNFYNVDMGYLKRLCGTFLGGPKPPQRVMFTEDLK-LPASFDAREQWPQCPTIKEIRDQ 102
Query: 210 SNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVACTPNCW--GCNGGWPQLAWRFW 267
+CGSCWA AISDR+CI +N + + ++SA+ ++ C + GCNGG+P AW FW
Sbjct: 103 GSCGSCWAFGAVEAISDRICIHTNAHVSVEVSAEDLLTCCGSMCGDGCNGGYPAEAWNFW 162
Query: 268 GHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTLLGKLKTPECKQNCYNPSYESTY 327
G+V+GG Y S GC+PY++ PCEHHV G CT G+ TP+C + C P Y TY
Sbjct: 163 TRKGLVSGGLYESHVGCRPYSIPPCEHHVNGSRPPCT--GEGDTPKCSKIC-EPGYSPTY 219
Query: 328 RFDLKKGKKAHMV 340
+ D G ++ V
Sbjct: 220 KQDKHYGYNSYSV 232
Score = 123 bits (309), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 60/108 (55%), Positives = 75/108 (69%), Gaps = 2/108 (1%)
Query: 54 YLPTSIPLSHYFKKAHMVPRC--NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDS 111
Y PT HY ++ V + M +IY++GP+ FSVY+DFL YKSGVYQH G+
Sbjct: 215 YSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEM 274
Query: 112 IGLHAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
+G HA+R+LGWGVEN PYWLVANSWN WGD+G FKILRG++ IE
Sbjct: 275 MGGHAIRILGWGVENGTPYWLVANSWNTDWGDNGFFKILRGQDHCGIE 322
>gi|403307501|ref|XP_003944231.1| PREDICTED: cathepsin B [Saimiri boliviensis boliviensis]
Length = 351
Score = 154 bits (390), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 72/156 (46%), Positives = 103/156 (66%), Gaps = 5/156 (3%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP +FDARE+WP+CP+++ I DQ +CGSCWA AISDR+CI +N + + ++SA+ ++
Sbjct: 92 LPESFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHTNAHVSVEVSAEDLL 151
Query: 247 ACTPNCW--GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
C + GCNGG+P AW FW G+V+GG Y+S GC+PY++ PCEHHV G CT
Sbjct: 152 TCCGSMCGDGCNGGYPAEAWNFWTRKGLVSGGLYDSHVGCRPYSIPPCEHHVNGSRPPCT 211
Query: 305 LLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
G+ TP+C ++C P Y TY+ D G ++ V
Sbjct: 212 --GEGDTPKCSKSC-EPGYTPTYKQDKHYGYNSYSV 244
Score = 122 bits (306), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 59/108 (54%), Positives = 74/108 (68%), Gaps = 2/108 (1%)
Query: 54 YLPTSIPLSHYFKKAHMVPRC--NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDS 111
Y PT HY ++ V + M +IY++GP+ FSVY+DFL YKSGVYQH G+
Sbjct: 227 YTPTYKQDKHYGYNSYSVSNSERDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEM 286
Query: 112 IGLHAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
+G HA+R+LGWGVEN PYWLV NSWN WGD+G FKILRG++ IE
Sbjct: 287 MGGHAIRILGWGVENGTPYWLVGNSWNTDWGDNGFFKILRGQDHCGIE 334
>gi|260786791|ref|XP_002588440.1| hypothetical protein BRAFLDRAFT_199166 [Branchiostoma floridae]
gi|229273602|gb|EEN44451.1| hypothetical protein BRAFLDRAFT_199166 [Branchiostoma floridae]
Length = 332
Score = 154 bits (390), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 72/160 (45%), Positives = 104/160 (65%), Gaps = 6/160 (3%)
Query: 183 NAKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISA 242
NA+ +P FD+R +W CP+++ + DQ +CGSCWA + A A+SDR C+ASNG +S+
Sbjct: 76 NAQDIPDTFDSRTQWANCPTIKEVRDQGSCGSCWAEAAAEAMSDRTCVASNGKVQVHLSS 135
Query: 243 QHIVACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQ 301
++++AC C GC+GG+P+ AW +W +G+VTGG Y S +GCQPY +APCEHH+ G
Sbjct: 136 ENLMACCETCGMGCHGGFPEAAWEYWKQDGLVTGGPYGSMQGCQPYEIAPCEHHINGSRP 195
Query: 302 NCTLLGKLK-TPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
C GK++ TP CK+ C Y T+ D K A+ V
Sbjct: 196 AC---GKIEPTPRCKKTC-ESGYNVTFNKDKHYAKSAYSV 231
Score = 114 bits (286), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 54/99 (54%), Positives = 67/99 (67%), Gaps = 2/99 (2%)
Query: 63 HYFKKAHMVPR--CNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
HY K A+ V +I +GP+ A F+VYADF YKSGVYQH G +G HAV+++
Sbjct: 223 HYAKSAYSVSSKVQQIQMEIMTNGPVEAAFTVYADFPHYKSGVYQHESGAELGGHAVKMI 282
Query: 121 GWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
GWG+E PYWL+ANSWN WGD G FKILRG++E IE
Sbjct: 283 GWGMEGSTPYWLIANSWNSDWGDMGFFKILRGQDECGIE 321
>gi|16307393|gb|AAH10240.1| Cathepsin B [Homo sapiens]
Length = 339
Score = 154 bits (390), Expect = 5e-35, Method: Compositional matrix adjust.
Identities = 79/193 (40%), Positives = 115/193 (59%), Gaps = 8/193 (4%)
Query: 152 GENEADIEMGFNNRVEAN--SSEDDDLETMGCQNAKGLPRNFDAREKWPECPSLRHIADQ 209
G N +++MG+ R+ M ++ K LP +FDARE+WP+CP+++ I DQ
Sbjct: 44 GHNFYNVDMGYLKRLCGTFLGGPKPPQRVMFTEDLK-LPASFDAREQWPQCPTIKEIRDQ 102
Query: 210 SNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVACTPNCW--GCNGGWPQLAWRFW 267
+CGSCWA AISDR+CI +N + + ++SA+ ++ C + GCNGG+P AW FW
Sbjct: 103 GSCGSCWAFGAVEAISDRICIHTNAHVSVEVSAEDLLTCCGSMCGDGCNGGYPAEAWNFW 162
Query: 268 GHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTLLGKLKTPECKQNCYNPSYESTY 327
G+V+GG Y S GC+PY++ PCEHHV G CT G+ TP+C + C P Y TY
Sbjct: 163 TRKGLVSGGLYESHVGCRPYSIPPCEHHVNGSRPPCT--GEGDTPKCSKIC-EPGYSPTY 219
Query: 328 RFDLKKGKKAHMV 340
+ D G ++ V
Sbjct: 220 KQDKHYGYNSYSV 232
Score = 123 bits (309), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 60/108 (55%), Positives = 75/108 (69%), Gaps = 2/108 (1%)
Query: 54 YLPTSIPLSHYFKKAHMVPRC--NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDS 111
Y PT HY ++ V + M +IY++GP+ FSVY+DFL YKSGVYQH G+
Sbjct: 215 YSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEM 274
Query: 112 IGLHAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
+G HA+R+LGWGVEN PYWLVANSWN WGD+G FKILRG++ IE
Sbjct: 275 MGGHAIRILGWGVENGTPYWLVANSWNTDWGDNGFFKILRGQDHCGIE 322
>gi|348587350|ref|XP_003479431.1| PREDICTED: cathepsin B-like [Cavia porcellus]
Length = 340
Score = 154 bits (389), Expect = 5e-35, Method: Compositional matrix adjust.
Identities = 73/156 (46%), Positives = 102/156 (65%), Gaps = 4/156 (2%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP +FDARE+WP CP+++ I DQ +CGSCWA A+SDRLCI +NG+ ++SA+ ++
Sbjct: 80 LPESFDAREQWPNCPTIKEIRDQGSCGSCWAFGAVEAMSDRLCIHTNGHVNVEVSAEDLL 139
Query: 247 ACT-PNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
+C P C GCNGG+P AW++W G+V+GG Y S GC+PY++ PCEHHV G CT
Sbjct: 140 SCCGPLCGEGCNGGYPTEAWKYWTRKGLVSGGLYGSHVGCRPYSIPPCEHHVNGTRPKCT 199
Query: 305 LLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
G TP+C + C P Y +Y+ D G ++ V
Sbjct: 200 GEGG-DTPKCSKTC-EPGYSPSYKEDKYYGYSSYSV 233
Score = 119 bits (299), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 57/108 (52%), Positives = 75/108 (69%), Gaps = 2/108 (1%)
Query: 54 YLPTSIPLSHYFKKAHMVPRCN--AMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDS 111
Y P+ +Y ++ VP M +IY++GP+ A FSV++DFL YKSGVY+H G+
Sbjct: 216 YSPSYKEDKYYGYSSYSVPSTEKEIMAEIYKNGPVEAAFSVFSDFLTYKSGVYKHVAGEV 275
Query: 112 IGLHAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
+G HA+R+LGWG EN +PYWLV NSWN WGD+G FKILRGE+ IE
Sbjct: 276 LGGHAIRILGWGKENGVPYWLVGNSWNVDWGDNGFFKILRGEDHCGIE 323
>gi|226468762|emb|CAX76409.1| cathepsin B [Schistosoma japonicum]
gi|257206178|emb|CAX82740.1| cathepsin B [Schistosoma japonicum]
Length = 348
Score = 154 bits (389), Expect = 5e-35, Method: Compositional matrix adjust.
Identities = 78/179 (43%), Positives = 108/179 (60%), Gaps = 6/179 (3%)
Query: 151 RGENEADIEMGFNNRVEANSSEDDDLETMGCQNAKGLPRNFDAREKWPECPSLRHIADQS 210
R + +DI + N + + L T LP++FDAR++W CPS+ I DQS
Sbjct: 59 RFKTVSDIRRMLGALPDPNGEQLETLCTGYELTLNELPKSFDARKEWTHCPSISEIRDQS 118
Query: 211 NCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVACTPNC-WGCNGGWPQLAWRFWGH 269
+CGSCWA A+SDR+CI S G + +SA+++V+C +C GCNGG+P AW +W +
Sbjct: 119 SCGSCWAFGAVEAMSDRICIESKGKYKPFLSAENLVSCCSSCGMGCNGGFPHSAWLYWKN 178
Query: 270 NGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTLLGKLKTPECKQNC---YNPSYES 325
G+VTG YN+ GCQPY PCEH+ GPL C G ++TP CK+ C YN SYE+
Sbjct: 179 QGIVTGDLYNTTNGCQPYEFPPCEHNTLGPLPVCD--GDVETPPCKRTCQAGYNVSYEN 235
Score = 120 bits (300), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 53/87 (60%), Positives = 66/87 (75%)
Query: 77 MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANS 136
M+++ +HGP+ F VYADF YKSGVYQH G +G HAVR+LGWG EN++PYWL+ANS
Sbjct: 254 MKELMQHGPVEVDFEVYADFPNYKSGVYQHVSGALLGGHAVRLLGWGEENNVPYWLIANS 313
Query: 137 WNDHWGDHGTFKILRGENEADIEMGFN 163
WN WGD+G FKI+RG+NE IE N
Sbjct: 314 WNTDWGDNGYFKIIRGKNECGIESDVN 340
>gi|161343863|tpg|DAA06112.1| TPA_inf: cathepsin B [Myzus persicae]
Length = 340
Score = 154 bits (389), Expect = 5e-35, Method: Compositional matrix adjust.
Identities = 75/155 (48%), Positives = 98/155 (63%), Gaps = 4/155 (2%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP NFDARE WP CP++R + DQ +CGSCWA A+SDR+CI S G SA+++V
Sbjct: 89 LPENFDAREHWPNCPTIREVRDQGSCGSCWAFGAVEAMSDRVCIHSKGAKNFHFSAENLV 148
Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
+C C +GCNGG+P AW +W G+V+GG Y S+ GC PY +APCEHHV G C
Sbjct: 149 SCCRTCGFGCNGGFPGAAWHYWKTKGIVSGGPYGSKMGCIPYEIAPCEHHVNGTRGPCKE 208
Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
G KTP C + C + Y+ Y DL +GK A+ +
Sbjct: 209 GG--KTPACVKKCED-GYKVPYAQDLHRGKSAYSL 240
Score = 110 bits (275), Expect = 9e-22, Method: Compositional matrix adjust.
Identities = 50/92 (54%), Positives = 67/92 (72%), Gaps = 1/92 (1%)
Query: 78 RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVEN-DIPYWLVANS 136
++IY +GP+ F+VY DF+ Y++GVY+H G ++G HA+R+LGWGV+N +IPYWLVANS
Sbjct: 249 QEIYTNGPVEGAFTVYEDFIAYRAGVYKHVAGKALGGHAIRILGWGVQNGEIPYWLVANS 308
Query: 137 WNDHWGDHGTFKILRGENEADIEMGFNNRVEA 168
WN WG G FKILRG +E IE N + A
Sbjct: 309 WNSDWGSDGFFKILRGSDECGIEGQINAGLPA 340
>gi|49036806|gb|AAT48984.1| cathepsin B-like proteinase [Triatoma sordida]
Length = 331
Score = 154 bits (389), Expect = 6e-35, Method: Compositional matrix adjust.
Identities = 72/165 (43%), Positives = 106/165 (64%), Gaps = 13/165 (7%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+P FDAR++WP CPS+ I DQ +CGSCWA A+SDR+CI SNG +SA+++V
Sbjct: 81 IPDEFDARKQWPNCPSITDIRDQGSCGSCWAFGAVEAMSDRICIHSNGKLQVHLSAENLV 140
Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
+C +C +GC+GG+P AW +W + G+V+GG+Y S++GCQPY++APCEHHV G C+
Sbjct: 141 SCCDSCGYGCDGGFPASAWDYWQNEGIVSGGNYGSKQGCQPYSIAPCEHHVPGSRPACS- 199
Query: 306 LGKLKTPECKQNC-------YNPSY---ESTYRFDLKKGKKAHMV 340
G TP+C+ C Y+ + E+ Y D K +A ++
Sbjct: 200 -GGGDTPDCRNQCDEGSGISYDQDHYYGETVYTLDEAKQIQAEIL 243
Score = 115 bits (288), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 50/81 (61%), Positives = 64/81 (79%)
Query: 79 QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWN 138
+I ++GP+ A F+VY D L YK GVYQH G+++G HA+++LGWGVEND PYWLVANSWN
Sbjct: 241 EILKNGPVEAAFTVYEDLLNYKEGVYQHVAGEALGGHAIKILGWGVENDTPYWLVANSWN 300
Query: 139 DHWGDHGTFKILRGENEADIE 159
WG++G FKILRG +E IE
Sbjct: 301 TDWGNNGFFKILRGSDECGIE 321
>gi|73586701|gb|AAI02998.1| CTSB protein [Bos taurus]
Length = 335
Score = 154 bits (389), Expect = 6e-35, Method: Compositional matrix adjust.
Identities = 75/156 (48%), Positives = 100/156 (64%), Gaps = 5/156 (3%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP +FDARE+WP CP+++ I DQ +CGSCWA AISDR+CI SNG ++SA+ ++
Sbjct: 80 LPESFDAREQWPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHSNGRVNVEVSAEDML 139
Query: 247 ACTPN-CW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
C C GCNGG+P AW FW G+V+GG YNS GC+PY++ PCEHHV G CT
Sbjct: 140 TCCDGECGDGCNGGFPSGAWNFWTKKGLVSGGLYNSHVGCRPYSIPPCEHHVNGSRPPCT 199
Query: 305 LLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
G+ TP+C + C P Y +Y+ D G ++ V
Sbjct: 200 --GEGDTPKCSKTC-EPGYSPSYKEDKHFGCSSYSV 232
Score = 120 bits (301), Expect = 9e-25, Method: Compositional matrix adjust.
Identities = 53/83 (63%), Positives = 65/83 (78%)
Query: 77 MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANS 136
M +IY++GP+ FSVY+DFL YKSGVYQH G+ +G HA+R+LGWGVEN PYWLV NS
Sbjct: 240 MAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVSGEIMGGHAIRILGWGVENGTPYWLVGNS 299
Query: 137 WNDHWGDHGTFKILRGENEADIE 159
WN WGD+G FKILRG++ IE
Sbjct: 300 WNTDWGDNGFFKILRGQDHCGIE 322
>gi|56462338|gb|AAV91452.1| cysteine peptidase 2 cathepsin-B-like [Lonomia obliqua]
Length = 338
Score = 154 bits (388), Expect = 7e-35, Method: Compositional matrix adjust.
Identities = 73/155 (47%), Positives = 95/155 (61%), Gaps = 4/155 (2%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP NFD R+KWP CP+L I DQ +CGSCWA A++DR+C S+G SA+ ++
Sbjct: 84 LPENFDPRDKWPNCPTLNEIRDQGSCGSCWAFGAVEAMTDRVCTYSDGTKHFHFSAEDLL 143
Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
+C P C GCNGG P LAW +W H G+V+GG YNS +GC PY + PCEHHV G C
Sbjct: 144 SCCPICGLGCNGGMPTLAWEYWKHAGIVSGGSYNSTQGCIPYEVPPCEHHVPGNRLPCN- 202
Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
G KTP+C++ C Y ++ D GK + V
Sbjct: 203 -GDTKTPKCQKTC-EAGYNVPFKKDKHYGKHVYSV 235
Score = 113 bits (282), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 53/99 (53%), Positives = 69/99 (69%), Gaps = 2/99 (2%)
Query: 63 HYFKKAHMVP--RCNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
HY K + V N +++++GP+ F+VY+D L YKSGVYQH G ++G HAV++L
Sbjct: 227 HYGKHVYSVSGNEDNIKAELFKNGPVEGAFTVYSDLLSYKSGVYQHTDGSALGGHAVKIL 286
Query: 121 GWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
GWGVEN YWL+ANSWN WGD+G FKILRGE+ IE
Sbjct: 287 GWGVENGSKYWLIANSWNSDWGDNGFFKILRGEDHCGIE 325
>gi|157833437|pdb|1PBH|A Chain A, Crystal Structure Of Human Recombinant Procathepsin B At
3.2 Angstrom Resolution
gi|157835646|pdb|2PBH|A Chain A, Crystal Structure Of Human Procathepsin B At 3.3 Angstrom
Resolution
gi|157836863|pdb|3PBH|A Chain A, Refined Crystal Structure Of Human Procathepsin B At 2.5
Angstrom Resolution
Length = 317
Score = 154 bits (388), Expect = 7e-35, Method: Compositional matrix adjust.
Identities = 72/156 (46%), Positives = 101/156 (64%), Gaps = 5/156 (3%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP +FDARE+WP+CP+++ I DQ +CGSCWA AISDR+CI +N + + ++SA+ ++
Sbjct: 64 LPASFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHTNAHVSVEVSAEDLL 123
Query: 247 ACTPNCW--GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
C + GCNGG+P AW FW G+V+GG Y S GC+PY++ PCEHHV G CT
Sbjct: 124 TCCGSMCGDGCNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCEHHVNGSRPPCT 183
Query: 305 LLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
G+ TP+C + C P Y TY+ D G ++ V
Sbjct: 184 --GEGDTPKCSKIC-EPGYSPTYKQDKHYGYNSYSV 216
Score = 124 bits (310), Expect = 8e-26, Method: Compositional matrix adjust.
Identities = 60/108 (55%), Positives = 75/108 (69%), Gaps = 2/108 (1%)
Query: 54 YLPTSIPLSHYFKKAHMVPRC--NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDS 111
Y PT HY ++ V + M +IY++GP+ FSVY+DFL YKSGVYQH G+
Sbjct: 199 YSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEM 258
Query: 112 IGLHAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
+G HA+R+LGWGVEN PYWLVANSWN WGD+G FKILRG++ IE
Sbjct: 259 MGGHAIRILGWGVENGTPYWLVANSWNTDWGDNGFFKILRGQDHCGIE 306
>gi|410956528|ref|XP_003984894.1| PREDICTED: cathepsin B [Felis catus]
Length = 339
Score = 154 bits (388), Expect = 7e-35, Method: Compositional matrix adjust.
Identities = 74/156 (47%), Positives = 101/156 (64%), Gaps = 5/156 (3%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP NFDARE WP CP+++ I DQ +CGSCWA AISDR+CI +NG+ ++SA+ ++
Sbjct: 80 LPENFDAREHWPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICILTNGHVNVEVSAEDML 139
Query: 247 ACTPN-CW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
C + C GCNGG+P AW FW G+V+GG Y+S GC+PY++ PCEHHV G CT
Sbjct: 140 TCCGDQCGDGCNGGFPAEAWNFWTKQGLVSGGLYDSHVGCRPYSIPPCEHHVNGSRPPCT 199
Query: 305 LLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
G+ TP+C + C P Y +Y+ D G ++ V
Sbjct: 200 --GEGDTPKCSKIC-EPGYTPSYKEDKHYGCNSYSV 232
Score = 128 bits (322), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 57/83 (68%), Positives = 67/83 (80%)
Query: 77 MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANS 136
M +IY++GP+ A FSV++DFLQYKSGVYQH G+ +G HAVR+LGWGVEND PYWLV NS
Sbjct: 240 MAEIYKNGPVEAAFSVFSDFLQYKSGVYQHVTGEMMGGHAVRILGWGVENDTPYWLVGNS 299
Query: 137 WNDHWGDHGTFKILRGENEADIE 159
WN WGDHG FKILRG + IE
Sbjct: 300 WNTDWGDHGFFKILRGRDHCGIE 322
>gi|197098184|ref|NP_001126573.1| cathepsin B precursor [Pongo abelii]
gi|75061687|sp|Q5R6D1.1|CATB_PONAB RecName: Full=Cathepsin B; Contains: RecName: Full=Cathepsin B
light chain; Contains: RecName: Full=Cathepsin B heavy
chain; Flags: Precursor
gi|55731764|emb|CAH92586.1| hypothetical protein [Pongo abelii]
gi|55731953|emb|CAH92685.1| hypothetical protein [Pongo abelii]
Length = 339
Score = 154 bits (388), Expect = 8e-35, Method: Compositional matrix adjust.
Identities = 72/156 (46%), Positives = 101/156 (64%), Gaps = 5/156 (3%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP +FDARE+WP+CP+++ I DQ +CGSCWA AISDR+CI +N + + ++SA+ ++
Sbjct: 80 LPESFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHTNAHVSVEVSAEDLL 139
Query: 247 ACTPNCW--GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
C + GCNGG+P AW FW G+V+GG Y S GC+PY++ PCEHHV G CT
Sbjct: 140 TCCGSMCGDGCNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCEHHVNGSRPPCT 199
Query: 305 LLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
G+ TP+C + C P Y TY+ D G ++ V
Sbjct: 200 --GEGDTPKCSKIC-EPGYSPTYKQDKHYGYNSYSV 232
Score = 123 bits (309), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 60/108 (55%), Positives = 75/108 (69%), Gaps = 2/108 (1%)
Query: 54 YLPTSIPLSHYFKKAHMVPRC--NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDS 111
Y PT HY ++ V + M +IY++GP+ FSVY+DFL YKSGVYQH G+
Sbjct: 215 YSPTYKQDKHYGYNSYSVSNSERDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEM 274
Query: 112 IGLHAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
+G HA+R+LGWGVEN PYWLVANSWN WGD+G FKILRG++ IE
Sbjct: 275 MGGHAIRILGWGVENGTPYWLVANSWNTDWGDNGFFKILRGQDHCGIE 322
>gi|341888136|gb|EGT44071.1| hypothetical protein CAEBREN_13576 [Caenorhabditis brenneri]
Length = 337
Score = 154 bits (388), Expect = 8e-35, Method: Compositional matrix adjust.
Identities = 78/162 (48%), Positives = 102/162 (62%), Gaps = 5/162 (3%)
Query: 184 AKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQ 243
A+ +P +FDARE+WP C S+ +I DQS+CGSCWAV+ A ISDR CIASNG ISA+
Sbjct: 72 AENIPDHFDAREQWPNCVSIDNIRDQSDCGSCWAVAAAETISDRTCIASNGEVNVLISAE 131
Query: 244 HIVACTP---NCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGP 299
+++C NC GC GG+P AWR+W HNG+VTGG Y SQ GC+PY++APC V G
Sbjct: 132 DLLSCCTGGYNCGDGCEGGYPIQAWRYWVHNGLVTGGSYESQYGCKPYSIAPCGQTVNGV 191
Query: 300 LQNCTLLGKLKTPECKQNCYNPS-YESTYRFDLKKGKKAHMV 340
++ TPEC + C + S Y Y D G A+ +
Sbjct: 192 TWPKCAADEVATPECVKQCTSKSDYAVPYDQDKHYGSSAYAI 233
Score = 107 bits (268), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 52/99 (52%), Positives = 66/99 (66%), Gaps = 2/99 (2%)
Query: 63 HYFKKAHMVPRCNAMRQ--IYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
HY A+ + + A Q I +GP+ F VY+DF QYKSG+Y+H G +G HAV++L
Sbjct: 225 HYGSSAYAIRQNVAQIQTEIMRNGPVEVGFLVYSDFYQYKSGIYKHVAGRELGGHAVKIL 284
Query: 121 GWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
GWGVEN PYWL ANSWN +WG+ G F+I RG NE IE
Sbjct: 285 GWGVENGTPYWLAANSWNVNWGEKGYFRIRRGTNECGIE 323
>gi|226472810|emb|CAX71091.1| cathepsin B [Schistosoma japonicum]
Length = 348
Score = 154 bits (388), Expect = 8e-35, Method: Compositional matrix adjust.
Identities = 78/179 (43%), Positives = 107/179 (59%), Gaps = 6/179 (3%)
Query: 151 RGENEADIEMGFNNRVEANSSEDDDLETMGCQNAKGLPRNFDAREKWPECPSLRHIADQS 210
R + +DI + N + + L T LP++FDAR++W CPS+ I DQS
Sbjct: 59 RFKTVSDIRRMLGALPDPNGEQLETLCTGYELTVNELPKSFDARKEWTHCPSISEIRDQS 118
Query: 211 NCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVACTPNC-WGCNGGWPQLAWRFWGH 269
+CGS WA A+SDR+CI S G + +SA+++V+C +C GCNGG+P AW +W +
Sbjct: 119 SCGSYWAFGAVEAMSDRICIESKGKYKPFLSAENLVSCCSSCGMGCNGGFPHSAWLYWKN 178
Query: 270 NGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTLLGKLKTPECKQNC---YNPSYES 325
G+VTG YN+ GCQPY PCEHH GPL C G ++TP CK+ C YN SYE+
Sbjct: 179 QGIVTGDLYNTTNGCQPYEFPPCEHHTLGPLPVCD--GDVETPPCKRTCQAGYNVSYEN 235
Score = 120 bits (300), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 53/87 (60%), Positives = 66/87 (75%)
Query: 77 MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANS 136
M+++ +HGP+ F VYADF YKSGVYQH G +G HAVR+LGWG EN++PYWL+ANS
Sbjct: 254 MKELMQHGPVEVDFEVYADFPNYKSGVYQHVSGALLGGHAVRLLGWGEENNVPYWLIANS 313
Query: 137 WNDHWGDHGTFKILRGENEADIEMGFN 163
WN WGD+G FKI+RG+NE IE N
Sbjct: 314 WNTDWGDNGYFKIIRGKNECGIESDVN 340
>gi|332862712|ref|XP_003317964.1| PREDICTED: cathepsin B isoform 1 [Pan troglodytes]
gi|332862714|ref|XP_003317965.1| PREDICTED: cathepsin B isoform 2 [Pan troglodytes]
gi|332862716|ref|XP_003317966.1| PREDICTED: cathepsin B isoform 3 [Pan troglodytes]
gi|332862718|ref|XP_519607.3| PREDICTED: cathepsin B isoform 5 [Pan troglodytes]
gi|410057614|ref|XP_003954244.1| PREDICTED: cathepsin B [Pan troglodytes]
gi|410262606|gb|JAA19269.1| cathepsin B [Pan troglodytes]
gi|410262608|gb|JAA19270.1| cathepsin B [Pan troglodytes]
gi|410359820|gb|JAA44654.1| cathepsin B [Pan troglodytes]
gi|410359822|gb|JAA44655.1| cathepsin B [Pan troglodytes]
gi|410359824|gb|JAA44656.1| cathepsin B [Pan troglodytes]
gi|410359826|gb|JAA44657.1| cathepsin B [Pan troglodytes]
gi|410359828|gb|JAA44658.1| cathepsin B [Pan troglodytes]
Length = 339
Score = 154 bits (388), Expect = 8e-35, Method: Compositional matrix adjust.
Identities = 72/156 (46%), Positives = 101/156 (64%), Gaps = 5/156 (3%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP +FDARE+WP+CP+++ I DQ +CGSCWA AISDR+CI +N + + ++SA+ ++
Sbjct: 80 LPESFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHTNAHVSVEVSAEDLL 139
Query: 247 ACTPNCW--GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
C + GCNGG+P AW FW G+V+GG Y S GC+PY++ PCEHHV G CT
Sbjct: 140 TCCGSMCGDGCNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCEHHVNGSRPPCT 199
Query: 305 LLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
G+ TP+C + C P Y TY+ D G ++ V
Sbjct: 200 --GEGDTPKCSKIC-EPGYSPTYKQDKHYGYNSYSV 232
Score = 123 bits (309), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 60/108 (55%), Positives = 75/108 (69%), Gaps = 2/108 (1%)
Query: 54 YLPTSIPLSHYFKKAHMVPRC--NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDS 111
Y PT HY ++ V + M +IY++GP+ FSVY+DFL YKSGVYQH G+
Sbjct: 215 YSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEM 274
Query: 112 IGLHAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
+G HA+R+LGWGVEN PYWLVANSWN WGD+G FKILRG++ IE
Sbjct: 275 MGGHAIRILGWGVENGTPYWLVANSWNTDWGDNGFFKILRGQDHCGIE 322
>gi|397467300|ref|XP_003805362.1| PREDICTED: cathepsin B [Pan paniscus]
Length = 339
Score = 154 bits (388), Expect = 8e-35, Method: Compositional matrix adjust.
Identities = 72/156 (46%), Positives = 101/156 (64%), Gaps = 5/156 (3%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP +FDARE+WP+CP+++ I DQ +CGSCWA AISDR+CI +N + + ++SA+ ++
Sbjct: 80 LPESFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHTNAHVSVEVSAEDLL 139
Query: 247 ACTPNCW--GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
C + GCNGG+P AW FW G+V+GG Y S GC+PY++ PCEHHV G CT
Sbjct: 140 TCCGSMCGDGCNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCEHHVNGSRPPCT 199
Query: 305 LLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
G+ TP+C + C P Y TY+ D G ++ V
Sbjct: 200 --GEGDTPKCSKIC-EPGYSPTYKQDKHYGYNSYSV 232
Score = 123 bits (309), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 60/108 (55%), Positives = 75/108 (69%), Gaps = 2/108 (1%)
Query: 54 YLPTSIPLSHYFKKAHMVPRC--NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDS 111
Y PT HY ++ V + M +IY++GP+ FSVY+DFL YKSGVYQH G+
Sbjct: 215 YSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEM 274
Query: 112 IGLHAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
+G HA+R+LGWGVEN PYWLVANSWN WGD+G FKILRG++ IE
Sbjct: 275 MGGHAIRILGWGVENGTPYWLVANSWNTDWGDNGFFKILRGQDHCGIE 322
>gi|157058763|gb|ABV03139.1| cathepsin B-348 [Acyrthosiphon pisum]
Length = 248
Score = 153 bits (387), Expect = 9e-35, Method: Compositional matrix adjust.
Identities = 75/155 (48%), Positives = 96/155 (61%), Gaps = 4/155 (2%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP FDARE+WP CP++R + DQ +CGSCWA A+SDR+CI SNG SA+++V
Sbjct: 26 LPETFDARERWPNCPTIREVRDQGSCGSCWAFGAVEAMSDRVCIHSNGTKNFHFSAENLV 85
Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
+C C +GCNGG+P AW +W G+V+GG Y S GC PY +APCEHHV G C
Sbjct: 86 SCCWTCGFGCNGGFPGAAWNYWKTKGIVSGGPYGSNMGCIPYEIAPCEHHVNGTRGPCKE 145
Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
G KTP C + C Y+ Y DL GK A+ +
Sbjct: 146 GG--KTPTCVKKC-EEGYKVPYAQDLHHGKSAYSI 177
Score = 87.4 bits (215), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 45/104 (43%), Positives = 66/104 (63%), Gaps = 2/104 (1%)
Query: 37 KKKKKKKKKKKKKKKRLYLPTSIPLSHYFKKAHMVPRCNAMRQ-IYEHGPLVAIFSVYAD 95
K+ K KK ++ +P + L H + + +RQ IY +GP+ F+VY D
Sbjct: 144 KEGGKTPTCVKKCEEGYKVPYAQDLHHGKSAYSIRNDVDQIRQEIYTNGPVEGAFTVYED 203
Query: 96 FLQYKSGVYQHNFGDSIGLHAVRVLGWGVEN-DIPYWLVANSWN 138
F+ Y++GVY+H G ++G HA+R+LGWGV+N +IPYWLVANSWN
Sbjct: 204 FIAYRAGVYKHVAGKALGGHAIRILGWGVQNGEIPYWLVANSWN 247
>gi|4503139|ref|NP_001899.1| cathepsin B preproprotein [Homo sapiens]
gi|22538431|ref|NP_680090.1| cathepsin B preproprotein [Homo sapiens]
gi|22538433|ref|NP_680091.1| cathepsin B preproprotein [Homo sapiens]
gi|22538435|ref|NP_680092.1| cathepsin B preproprotein [Homo sapiens]
gi|22538437|ref|NP_680093.1| cathepsin B preproprotein [Homo sapiens]
gi|68067549|sp|P07858.3|CATB_HUMAN RecName: Full=Cathepsin B; AltName: Full=APP secretase; Short=APPS;
AltName: Full=Cathepsin B1; Contains: RecName:
Full=Cathepsin B light chain; Contains: RecName:
Full=Cathepsin B heavy chain; Flags: Precursor
gi|291888|gb|AAC37547.1| cathepsin B [Homo sapiens]
gi|63102437|gb|AAH95408.1| Cathepsin B [Homo sapiens]
gi|119586034|gb|EAW65630.1| cathepsin B, isoform CRA_a [Homo sapiens]
gi|119586036|gb|EAW65632.1| cathepsin B, isoform CRA_a [Homo sapiens]
gi|119586037|gb|EAW65633.1| cathepsin B, isoform CRA_a [Homo sapiens]
gi|119586038|gb|EAW65634.1| cathepsin B, isoform CRA_a [Homo sapiens]
gi|119586039|gb|EAW65635.1| cathepsin B, isoform CRA_a [Homo sapiens]
gi|119586040|gb|EAW65636.1| cathepsin B, isoform CRA_a [Homo sapiens]
gi|168277954|dbj|BAG10955.1| cathepsin B precursor [synthetic construct]
gi|193786804|dbj|BAG52127.1| unnamed protein product [Homo sapiens]
Length = 339
Score = 153 bits (387), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 72/156 (46%), Positives = 101/156 (64%), Gaps = 5/156 (3%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP +FDARE+WP+CP+++ I DQ +CGSCWA AISDR+CI +N + + ++SA+ ++
Sbjct: 80 LPASFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHTNAHVSVEVSAEDLL 139
Query: 247 ACTPNCW--GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
C + GCNGG+P AW FW G+V+GG Y S GC+PY++ PCEHHV G CT
Sbjct: 140 TCCGSMCGDGCNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCEHHVNGSRPPCT 199
Query: 305 LLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
G+ TP+C + C P Y TY+ D G ++ V
Sbjct: 200 --GEGDTPKCSKIC-EPGYSPTYKQDKHYGYNSYSV 232
Score = 123 bits (309), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 60/108 (55%), Positives = 75/108 (69%), Gaps = 2/108 (1%)
Query: 54 YLPTSIPLSHYFKKAHMVPRC--NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDS 111
Y PT HY ++ V + M +IY++GP+ FSVY+DFL YKSGVYQH G+
Sbjct: 215 YSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEM 274
Query: 112 IGLHAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
+G HA+R+LGWGVEN PYWLVANSWN WGD+G FKILRG++ IE
Sbjct: 275 MGGHAIRILGWGVENGTPYWLVANSWNTDWGDNGFFKILRGQDHCGIE 322
>gi|426358853|ref|XP_004046705.1| PREDICTED: cathepsin B isoform 1 [Gorilla gorilla gorilla]
gi|426358855|ref|XP_004046706.1| PREDICTED: cathepsin B isoform 2 [Gorilla gorilla gorilla]
gi|426358857|ref|XP_004046707.1| PREDICTED: cathepsin B isoform 3 [Gorilla gorilla gorilla]
Length = 339
Score = 153 bits (387), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 72/156 (46%), Positives = 101/156 (64%), Gaps = 5/156 (3%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP +FDARE+WP+CP+++ I DQ +CGSCWA AISDR+CI +N + + ++SA+ ++
Sbjct: 80 LPESFDAREQWPQCPTVKEIRDQGSCGSCWAFGAVEAISDRICIHTNAHVSVEVSAEDLL 139
Query: 247 ACTPNCW--GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
C + GCNGG+P AW FW G+V+GG Y S GC+PY++ PCEHHV G CT
Sbjct: 140 TCCGSMCGDGCNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCEHHVNGSRPPCT 199
Query: 305 LLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
G+ TP+C + C P Y TY+ D G ++ V
Sbjct: 200 --GEGDTPKCSKIC-EPGYSPTYKQDKHYGYNSYSV 232
Score = 123 bits (309), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 60/108 (55%), Positives = 75/108 (69%), Gaps = 2/108 (1%)
Query: 54 YLPTSIPLSHYFKKAHMVPRC--NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDS 111
Y PT HY ++ V + M +IY++GP+ FSVY+DFL YKSGVYQH G+
Sbjct: 215 YSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEM 274
Query: 112 IGLHAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
+G HA+R+LGWGVEN PYWLVANSWN WGD+G FKILRG++ IE
Sbjct: 275 MGGHAIRILGWGVENGTPYWLVANSWNTDWGDNGFFKILRGQDHCGIE 322
>gi|158261501|dbj|BAF82928.1| unnamed protein product [Homo sapiens]
Length = 339
Score = 153 bits (387), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 72/156 (46%), Positives = 101/156 (64%), Gaps = 5/156 (3%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP +FDARE+WP+CP+++ I DQ +CGSCWA AISDR+CI +N + + ++SA+ ++
Sbjct: 80 LPASFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHTNAHVSVEVSAEDLL 139
Query: 247 ACTPNCW--GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
C + GCNGG+P AW FW G+V+GG Y S GC+PY++ PCEHHV G CT
Sbjct: 140 TCCGSMCGDGCNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCEHHVNGSRPPCT 199
Query: 305 LLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
G+ TP+C + C P Y TY+ D G ++ V
Sbjct: 200 --GEGDTPKCSKIC-EPGYSPTYKQDKHYGYNSYSV 232
Score = 122 bits (306), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 60/108 (55%), Positives = 74/108 (68%), Gaps = 2/108 (1%)
Query: 54 YLPTSIPLSHYFKKAHMVPRC--NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDS 111
Y PT HY ++ V + M +IY++GP FSVY+DFL YKSGVYQH G+
Sbjct: 215 YSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPAEGAFSVYSDFLLYKSGVYQHVTGEM 274
Query: 112 IGLHAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
+G HA+R+LGWGVEN PYWLVANSWN WGD+G FKILRG++ IE
Sbjct: 275 MGGHAIRILGWGVENGTPYWLVANSWNTDWGDNGFFKILRGQDHCGIE 322
>gi|181192|gb|AAA52129.1| preprocathepsin B [Homo sapiens]
gi|193787271|dbj|BAG52477.1| unnamed protein product [Homo sapiens]
Length = 339
Score = 153 bits (387), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 72/156 (46%), Positives = 101/156 (64%), Gaps = 5/156 (3%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP +FDARE+WP+CP+++ I DQ +CGSCWA AISDR+CI +N + + ++SA+ ++
Sbjct: 80 LPASFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHTNAHVSVEVSAEDLL 139
Query: 247 ACTPNCW--GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
C + GCNGG+P AW FW G+V+GG Y S GC+PY++ PCEHHV G CT
Sbjct: 140 TCCGSMCGDGCNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCEHHVNGSRPPCT 199
Query: 305 LLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
G+ TP+C + C P Y TY+ D G ++ V
Sbjct: 200 --GEGDTPKCSKIC-EPGYSPTYKQDKHYGYNSYSV 232
Score = 123 bits (309), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 60/108 (55%), Positives = 75/108 (69%), Gaps = 2/108 (1%)
Query: 54 YLPTSIPLSHYFKKAHMVPRC--NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDS 111
Y PT HY ++ V + M +IY++GP+ FSVY+DFL YKSGVYQH G+
Sbjct: 215 YSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEM 274
Query: 112 IGLHAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
+G HA+R+LGWGVEN PYWLVANSWN WGD+G FKILRG++ IE
Sbjct: 275 MGGHAIRILGWGVENGTPYWLVANSWNTDWGDNGFFKILRGQDHCGIE 322
>gi|333361087|pdb|3AI8|B Chain B, Cathepsin B In Complex With The Nitroxoline
gi|333361088|pdb|3AI8|A Chain A, Cathepsin B In Complex With The Nitroxoline
Length = 256
Score = 153 bits (386), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 72/156 (46%), Positives = 101/156 (64%), Gaps = 5/156 (3%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP +FDARE+WP+CP+++ I DQ +CGSCWA AISDR+CI +N + + ++SA+ ++
Sbjct: 3 LPASFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHTNAHVSVEVSAEDLL 62
Query: 247 ACTPNCW--GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
C + GCNGG+P AW FW G+V+GG Y S GC+PY++ PCEHHV G CT
Sbjct: 63 TCCGSMCGDGCNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCEHHVNGSRPPCT 122
Query: 305 LLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
G+ TP+C + C P Y TY+ D G ++ V
Sbjct: 123 --GEGDTPKCSKIC-EPGYSPTYKQDKHYGYNSYSV 155
Score = 124 bits (310), Expect = 8e-26, Method: Compositional matrix adjust.
Identities = 60/108 (55%), Positives = 75/108 (69%), Gaps = 2/108 (1%)
Query: 54 YLPTSIPLSHYFKKAHMVPRC--NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDS 111
Y PT HY ++ V + M +IY++GP+ FSVY+DFL YKSGVYQH G+
Sbjct: 138 YSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEM 197
Query: 112 IGLHAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
+G HA+R+LGWGVEN PYWLVANSWN WGD+G FKILRG++ IE
Sbjct: 198 MGGHAIRILGWGVENGTPYWLVANSWNTDWGDNGFFKILRGQDHCGIE 245
>gi|195058549|ref|XP_001995463.1| GH17748 [Drosophila grimshawi]
gi|193896249|gb|EDV95115.1| GH17748 [Drosophila grimshawi]
Length = 340
Score = 153 bits (386), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 72/157 (45%), Positives = 91/157 (57%), Gaps = 5/157 (3%)
Query: 185 KGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQH 244
K LP FD+ + WP CP++R I DQ +CGSCWA A+SDR+CI SN SA
Sbjct: 86 KDLPEEFDSSKNWPNCPTIREIRDQGSCGSCWAFGAVEAMSDRVCIHSNATVNFHFSADD 145
Query: 245 IVACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNC 303
+V C C +GCNGG+P AW +W G+V+GG YNS EGC+PY + PCEHHV GP C
Sbjct: 146 LVTCCHTCGFGCNGGFPGAAWSYWTTRGIVSGGSYNSTEGCRPYEVEPCEHHVDGPRPPC 205
Query: 304 TLLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
TP CK C P+Y Y D G ++ +
Sbjct: 206 H---SGSTPHCKHQC-QPNYSVDYEKDKHFGASSYSI 238
Score = 106 bits (265), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 48/90 (53%), Positives = 65/90 (72%), Gaps = 3/90 (3%)
Query: 72 PRCNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGV--ENDIP 129
PR N R+I +GP+ F+VY D + YK+GVYQH G +G HA+R++GWGV E+ +P
Sbjct: 242 PR-NIQREIMTNGPVEGAFTVYEDLILYKTGVYQHVHGKQLGGHAIRIIGWGVWGESKVP 300
Query: 130 YWLVANSWNDHWGDHGTFKILRGENEADIE 159
YWL+ANSWN WGD+G F+ILRG++ IE
Sbjct: 301 YWLIANSWNTDWGDNGFFRILRGKDHCGIE 330
>gi|34979797|gb|AAQ83887.1| cathepsin B [Branchiostoma belcheri tsingtauense]
Length = 332
Score = 153 bits (386), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 72/160 (45%), Positives = 103/160 (64%), Gaps = 6/160 (3%)
Query: 183 NAKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISA 242
NA+ +P FD+R +W CP+++ + DQ +CGSCWA++ A+SDR+C+AS G ISA
Sbjct: 76 NAQDIPDTFDSRTQWANCPTIKEVRDQGSCGSCWALAAVEAMSDRICVASKGSTMAHISA 135
Query: 243 QHIVACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQ 301
+ + +C +C GCNGG+P+ AW +W +G+VTGG Y S +GCQPY + PCEHH+ G
Sbjct: 136 EDLNSCCKSCGNGCNGGFPEAAWEYWKRDGLVTGGPYGSHQGCQPYEIKPCEHHINGSRP 195
Query: 302 NCTLLGKLK-TPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
C GKL+ TP CK++C Y T+ D K A+ V
Sbjct: 196 AC---GKLEPTPRCKKSC-ESGYNVTFAKDKHYAKTAYSV 231
Score = 112 bits (280), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 53/99 (53%), Positives = 66/99 (66%), Gaps = 2/99 (2%)
Query: 63 HYFKKAHMVPR--CNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
HY K A+ V +I +GP+ A F+VYADF YKSGVYQH G +G HAV+++
Sbjct: 223 HYAKTAYSVSSKVQQIQMEIMTNGPVEAAFTVYADFPHYKSGVYQHESGAELGGHAVKMI 282
Query: 121 GWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
GWG E PYWL+ANSWN WG+ G FKILRG++E IE
Sbjct: 283 GWGTEGSTPYWLIANSWNTDWGNMGFFKILRGQDECGIE 321
>gi|24158605|pdb|1GMY|A Chain A, Cathepsin B Complexed With Dipeptidyl Nitrile Inhibitor
gi|24158606|pdb|1GMY|B Chain B, Cathepsin B Complexed With Dipeptidyl Nitrile Inhibitor
gi|24158607|pdb|1GMY|C Chain C, Cathepsin B Complexed With Dipeptidyl Nitrile Inhibitor
Length = 261
Score = 153 bits (386), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 72/156 (46%), Positives = 101/156 (64%), Gaps = 5/156 (3%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP +FDARE+WP+CP+++ I DQ +CGSCWA AISDR+CI +N + + ++SA+ ++
Sbjct: 2 LPASFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHTNAHVSVEVSAEDLL 61
Query: 247 ACTPNCW--GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
C + GCNGG+P AW FW G+V+GG Y S GC+PY++ PCEHHV G CT
Sbjct: 62 TCCGSMCGDGCNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCEHHVNGSRPPCT 121
Query: 305 LLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
G+ TP+C + C P Y TY+ D G ++ V
Sbjct: 122 --GEGDTPKCSKIC-EPGYSPTYKQDKHYGYNSYSV 154
Score = 124 bits (310), Expect = 9e-26, Method: Compositional matrix adjust.
Identities = 60/108 (55%), Positives = 75/108 (69%), Gaps = 2/108 (1%)
Query: 54 YLPTSIPLSHYFKKAHMVPRC--NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDS 111
Y PT HY ++ V + M +IY++GP+ FSVY+DFL YKSGVYQH G+
Sbjct: 137 YSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEM 196
Query: 112 IGLHAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
+G HA+R+LGWGVEN PYWLVANSWN WGD+G FKILRG++ IE
Sbjct: 197 MGGHAIRILGWGVENGTPYWLVANSWNTDWGDNGFFKILRGQDHCGIE 244
>gi|308390275|gb|ADO32581.1| cathepsin B [Marsupenaeus japonicus]
Length = 332
Score = 153 bits (386), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 71/156 (45%), Positives = 100/156 (64%), Gaps = 4/156 (2%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+P+ FD+R WP CP++ I DQ +CGSCWA +SDR CI S G S++++V
Sbjct: 80 MPKEFDSRAAWPMCPTIGEIRDQGSCGSCWAFGAVEVMSDRQCIHSKGKSNFHYSSENLV 139
Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
+C C +GCNGG+P A+++W H+G+V+GG +NS +GCQPY +APCEHHV GP C+
Sbjct: 140 SCCHLCGFGCNGGFPGAAFKYWVHSGIVSGGSFNSTQGCQPYEIAPCEHHVPGPRPKCSE 199
Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMVL 341
G TP+C + C N Y Y DL G KA+ ++
Sbjct: 200 GG--GTPKCVKRCEN-GYTVDYESDLHHGGKAYSIM 232
Score = 110 bits (276), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 49/81 (60%), Positives = 59/81 (72%)
Query: 79 QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWN 138
+I ++GP+ F+VY DFL YKSGVYQH G +G HA+R+LGWG EN PYWL ANSWN
Sbjct: 241 EIMKNGPVEGAFTVYVDFLHYKSGVYQHRHGLPLGGHAIRILGWGEENGTPYWLCANSWN 300
Query: 139 DHWGDHGTFKILRGENEADIE 159
WGD+G FKILRG + IE
Sbjct: 301 TDWGDNGLFKILRGSDHCGIE 321
>gi|345790427|ref|XP_543203.3| PREDICTED: cathepsin B [Canis lupus familiaris]
Length = 339
Score = 153 bits (386), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 73/156 (46%), Positives = 102/156 (65%), Gaps = 5/156 (3%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP +FDARE+WP CP+++ I DQ +CGSCWA AISDR+CI +NG+ ++SA+ ++
Sbjct: 80 LPESFDAREQWPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICIRTNGHVNVEVSAEDML 139
Query: 247 ACTPN-CW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
C + C GCNGG+P AW FW G+V+GG Y+S GC+PY++ PCEHHV G CT
Sbjct: 140 TCCGDQCGDGCNGGFPAEAWNFWTKQGLVSGGLYDSHVGCRPYSIPPCEHHVNGSRPPCT 199
Query: 305 LLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
G+ TP+C + C P Y +Y+ D G ++ V
Sbjct: 200 --GEGDTPKCSKIC-EPGYSPSYKEDKHYGCSSYSV 232
Score = 119 bits (299), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 53/83 (63%), Positives = 65/83 (78%)
Query: 77 MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANS 136
M +IY++GP+ A F+VY+DFL YKSGVYQH G+ +G HAVR+LGWGVE+ PYWLV NS
Sbjct: 240 MAEIYKNGPVEAAFTVYSDFLLYKSGVYQHVTGEMMGGHAVRILGWGVEDGTPYWLVGNS 299
Query: 137 WNDHWGDHGTFKILRGENEADIE 159
WN WGD+G FKILRG + IE
Sbjct: 300 WNTDWGDNGFFKILRGRDHCGIE 322
>gi|302564570|ref|NP_001181828.1| cathepsin B precursor [Macaca mulatta]
Length = 339
Score = 153 bits (386), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 72/156 (46%), Positives = 101/156 (64%), Gaps = 5/156 (3%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP +FDARE+WP+CP+++ I DQ +CGSCWA AISDR+CI +N + + ++SA+ ++
Sbjct: 80 LPESFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHTNAHVSVEVSAEDLL 139
Query: 247 ACTPNCW--GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
C GCNGG+P AW FW G+V+GG Y+S GC+PY++ PCEHHV G CT
Sbjct: 140 TCCGIMCGDGCNGGYPAGAWNFWTRKGLVSGGLYDSHVGCRPYSIPPCEHHVNGSRPPCT 199
Query: 305 LLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
G+ TP+C + C P Y TY+ D G ++ V
Sbjct: 200 --GEGDTPKCSKIC-EPGYSPTYKQDKHYGYNSYSV 232
Score = 123 bits (309), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 60/108 (55%), Positives = 75/108 (69%), Gaps = 2/108 (1%)
Query: 54 YLPTSIPLSHYFKKAHMVPRC--NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDS 111
Y PT HY ++ V + M +IY++GP+ FSVY+DFL YKSGVYQH G+
Sbjct: 215 YSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEM 274
Query: 112 IGLHAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
+G HA+R+LGWGVEN PYWLVANSWN WGD+G FKILRG++ IE
Sbjct: 275 MGGHAIRILGWGVENGTPYWLVANSWNTDWGDNGFFKILRGQDHCGIE 322
>gi|402877481|ref|XP_003902454.1| PREDICTED: cathepsin B [Papio anubis]
Length = 339
Score = 153 bits (386), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 72/156 (46%), Positives = 101/156 (64%), Gaps = 5/156 (3%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP +FDARE+WP+CP+++ I DQ +CGSCWA AISDR+CI +N + + ++SA+ ++
Sbjct: 80 LPESFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHTNAHVSVEVSAEDLL 139
Query: 247 ACTPNCW--GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
C GCNGG+P AW FW G+V+GG Y+S GC+PY++ PCEHHV G CT
Sbjct: 140 TCCGIMCGDGCNGGYPAGAWNFWTRKGLVSGGLYDSHVGCRPYSIPPCEHHVNGSRPPCT 199
Query: 305 LLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
G+ TP+C + C P Y TY+ D G ++ V
Sbjct: 200 --GEGDTPKCSKIC-EPGYSPTYKQDKHYGYNSYSV 232
Score = 123 bits (309), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 60/108 (55%), Positives = 75/108 (69%), Gaps = 2/108 (1%)
Query: 54 YLPTSIPLSHYFKKAHMVPRC--NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDS 111
Y PT HY ++ V + M +IY++GP+ FSVY+DFL YKSGVYQH G+
Sbjct: 215 YSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEM 274
Query: 112 IGLHAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
+G HA+R+LGWGVEN PYWLVANSWN WGD+G FKILRG++ IE
Sbjct: 275 MGGHAIRILGWGVENGTPYWLVANSWNTDWGDNGFFKILRGQDHCGIE 322
>gi|298370749|gb|ADI80349.1| cathepsin B [Litopenaeus vannamei]
Length = 331
Score = 153 bits (386), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 72/156 (46%), Positives = 99/156 (63%), Gaps = 4/156 (2%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP+ FD+R WP CP++ I DQ +CGSCWA +SDR CI S G SA+++V
Sbjct: 79 LPKEFDSRAAWPMCPTIGEIRDQGSCGSCWAFGAVEVMSDRQCIHSKGKSNFHYSAENLV 138
Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
+C C +GCNGG+P A+++W H+G+V+GG +NS +GCQPY +APCEHHV GP C+
Sbjct: 139 SCCHLCGFGCNGGFPGAAFKYWVHSGIVSGGSFNSTQGCQPYEIAPCEHHVPGPRPKCSE 198
Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMVL 341
G TP+C + C Y Y DL G KA+ ++
Sbjct: 199 GG--GTPKCAKTC-EKGYIVDYESDLHHGGKAYSIM 231
Score = 111 bits (277), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 50/81 (61%), Positives = 59/81 (72%)
Query: 79 QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWN 138
+I ++GP+ F+VY DFL YKSGVYQH G +G HA+RVLGWG EN PYWL ANSWN
Sbjct: 240 EIMKNGPVEGAFTVYVDFLHYKSGVYQHRHGLPLGGHAIRVLGWGEENGTPYWLCANSWN 299
Query: 139 DHWGDHGTFKILRGENEADIE 159
WGD+G FKILRG + IE
Sbjct: 300 TDWGDNGLFKILRGSDHCGIE 320
>gi|262368170|pdb|3K9M|A Chain A, Cathepsin B In Complex With Stefin A
gi|262368172|pdb|3K9M|B Chain B, Cathepsin B In Complex With Stefin A
Length = 254
Score = 153 bits (386), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 72/156 (46%), Positives = 101/156 (64%), Gaps = 5/156 (3%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP +FDARE+WP+CP+++ I DQ +CGSCWA AISDR+CI +N + + ++SA+ ++
Sbjct: 1 LPASFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHTNAHVSVEVSAEDLL 60
Query: 247 ACTPNCW--GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
C + GCNGG+P AW FW G+V+GG Y S GC+PY++ PCEHHV G CT
Sbjct: 61 TCCGSMCGDGCNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCEHHVNGSRPPCT 120
Query: 305 LLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
G+ TP+C + C P Y TY+ D G ++ V
Sbjct: 121 --GEGDTPKCSKIC-EPGYSPTYKQDKHYGYNSYSV 153
Score = 124 bits (310), Expect = 9e-26, Method: Compositional matrix adjust.
Identities = 60/108 (55%), Positives = 75/108 (69%), Gaps = 2/108 (1%)
Query: 54 YLPTSIPLSHYFKKAHMVPRC--NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDS 111
Y PT HY ++ V + M +IY++GP+ FSVY+DFL YKSGVYQH G+
Sbjct: 136 YSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEM 195
Query: 112 IGLHAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
+G HA+R+LGWGVEN PYWLVANSWN WGD+G FKILRG++ IE
Sbjct: 196 MGGHAIRILGWGVENGTPYWLVANSWNTDWGDNGFFKILRGQDHCGIE 243
>gi|161343829|tpg|DAA06095.1| TPA_inf: cathepsin B [Aphis gossypii]
Length = 280
Score = 153 bits (386), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 72/154 (46%), Positives = 105/154 (68%), Gaps = 2/154 (1%)
Query: 184 AKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQ 243
GLP NFDAR++WP CPS+ HI +Q NC S +A+SVA+A++DR+CI SN +SAQ
Sbjct: 60 TNGLPINFDARKRWPNCPSIGHIYNQGNCRSSYAISVASAVTDRICIHSNETKNPIMSAQ 119
Query: 244 HIVACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEH-HVQGPLQ 301
I++C C +GC+GG +W F+ +G V+GGDYNS +GCQPY + PC+ + + P
Sbjct: 120 QIISCCYLCGYGCDGGSQFESWDFYRRHGFVSGGDYNSNQGCQPYMIPPCKLINEKSPRH 179
Query: 302 NCTLLGKLKTPECKQNCYNPSYESTYRFDLKKGK 335
+CT + +TP C+ C NP+Y S+++ D+ KGK
Sbjct: 180 SCTTYNREETPACEIKCNNPNYYSSFKTDIYKGK 213
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 27/76 (35%), Positives = 44/76 (57%), Gaps = 3/76 (3%)
Query: 57 TSIPLSHYFKKAHMVPRCNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHN---FGDSIG 113
+S Y K + V AM++I+++GP+ F +Y D + YKSGVYQ++ +GD
Sbjct: 203 SSFKTDIYKGKYYQVYPFMAMKEIFDNGPITTQFYMYRDLIDYKSGVYQYDEGFYGDFFT 262
Query: 114 LHAVRVLGWGVENDIP 129
+ +++GWG EN P
Sbjct: 263 VQGXKIIGWGEENGDP 278
>gi|60816353|gb|AAX36379.1| cathepsin B [synthetic construct]
gi|61358313|gb|AAX41546.1| cathepsin B [synthetic construct]
Length = 339
Score = 153 bits (386), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 73/156 (46%), Positives = 102/156 (65%), Gaps = 5/156 (3%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP +FDARE+WP+CP+++ I DQ +CGSCWA AISDR+CI +N + + ++SA+ ++
Sbjct: 80 LPASFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHTNAHVSVEVSAEDLL 139
Query: 247 ACTPN-CW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
C + C GCNGG+P AW FW G+V+GG Y S GC+PY++ PCEHHV G CT
Sbjct: 140 TCCGSRCGDGCNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCEHHVNGSRPPCT 199
Query: 305 LLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
G+ TP+C + C P Y TY+ D G ++ V
Sbjct: 200 --GEGDTPKCSKIC-EPGYSPTYKQDKHYGYNSYSV 232
Score = 123 bits (309), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 60/108 (55%), Positives = 75/108 (69%), Gaps = 2/108 (1%)
Query: 54 YLPTSIPLSHYFKKAHMVPRC--NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDS 111
Y PT HY ++ V + M +IY++GP+ FSVY+DFL YKSGVYQH G+
Sbjct: 215 YSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEM 274
Query: 112 IGLHAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
+G HA+R+LGWGVEN PYWLVANSWN WGD+G FKILRG++ IE
Sbjct: 275 MGGHAIRILGWGVENGTPYWLVANSWNTDWGDNGFFKILRGQDHCGIE 322
>gi|146217390|gb|ABQ10737.1| cathepsin B [Penaeus monodon]
Length = 331
Score = 153 bits (386), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 71/156 (45%), Positives = 99/156 (63%), Gaps = 4/156 (2%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+P+ FD+R WP CP++ I DQ +CGSCWA +SDR CI S G SA+++V
Sbjct: 79 MPKEFDSRAAWPMCPTIGEIRDQGSCGSCWAFGAVEVMSDRQCIHSKGKSNFHYSAENLV 138
Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
+C C +GCNGG+P A+++W H+G+V+GG +NS +GCQPY +APCEHHV GP C+
Sbjct: 139 SCCHLCGFGCNGGFPGAAFKYWVHSGIVSGGSFNSTQGCQPYEIAPCEHHVSGPRPKCSE 198
Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMVL 341
G TP+C + C Y Y DL G KA+ ++
Sbjct: 199 GG--GTPKCAKTC-EKGYIVDYESDLHHGGKAYSIM 231
Score = 111 bits (277), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 50/81 (61%), Positives = 58/81 (71%)
Query: 79 QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWN 138
+I +GP+ F+VY DFL YKSGVYQH G +G HA+RVLGWG EN PYWL ANSWN
Sbjct: 240 EIMNNGPVEGAFTVYVDFLHYKSGVYQHRHGLPLGGHAIRVLGWGEENGTPYWLCANSWN 299
Query: 139 DHWGDHGTFKILRGENEADIE 159
WGD+G FKILRG + IE
Sbjct: 300 TDWGDNGLFKILRGSDHCGIE 320
>gi|157058769|gb|ABV03142.1| cathepsin B-348 [Myzus persicae]
Length = 246
Score = 152 bits (385), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 75/155 (48%), Positives = 98/155 (63%), Gaps = 4/155 (2%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP NFDARE WP CP++R + DQ +CGSCWA A+SDR+CI S G SA+++V
Sbjct: 24 LPENFDAREHWPNCPTIREVRDQGSCGSCWAFGAVEAMSDRVCIHSKGAKNFHFSAENLV 83
Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
+C C +GCNGG+P AW +W G+V+GG Y S+ GC PY +APCEHHV G C
Sbjct: 84 SCCWTCGFGCNGGFPGAAWHYWKTKGIVSGGPYGSKMGCIPYEIAPCEHHVNGTRGPCKE 143
Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
G KTP C + C + Y+ Y DL +GK A+ +
Sbjct: 144 GG--KTPACVKKCED-GYKVPYAQDLHRGKSAYSL 175
Score = 86.7 bits (213), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 36/62 (58%), Positives = 51/62 (82%), Gaps = 1/62 (1%)
Query: 78 RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVEN-DIPYWLVANS 136
++IY +GP+ F+VY DF+ Y++GVY+H G ++G HA+R+LGWGV+N +IPYWLVANS
Sbjct: 184 QEIYTNGPVEGAFTVYEDFIAYRAGVYKHVAGKALGGHAIRILGWGVQNGEIPYWLVANS 243
Query: 137 WN 138
WN
Sbjct: 244 WN 245
>gi|225713216|gb|ACO12454.1| Cathepsin B precursor [Lepeophtheirus salmonis]
gi|290561811|gb|ADD38303.1| Cathepsin B [Lepeophtheirus salmonis]
Length = 333
Score = 152 bits (385), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 72/155 (46%), Positives = 98/155 (63%), Gaps = 5/155 (3%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+P NFD+R+KWP CP++ I DQ +CGSCWA A+SDRLCI SN +SA++++
Sbjct: 84 IPENFDSRQKWPHCPTISLIRDQGSCGSCWAFGAVEAMSDRLCIHSNKIV--NVSAENLL 141
Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
+C +C +GCNGG+P AW FW G+V+GG Y S +GCQPY +APCEHH G C+
Sbjct: 142 SCCYSCGFGCNGGFPGAAWSFWKKKGLVSGGLYGSHKGCQPYAIAPCEHHANGTRPPCS- 200
Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
G +TP+C C N Y Y D G+ ++ V
Sbjct: 201 -GGGRTPKCHTFCENEDYSLPYEKDKSFGRSSYSV 234
Score = 116 bits (291), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 52/81 (64%), Positives = 63/81 (77%)
Query: 79 QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWN 138
+I +GP+ A FSVY+DFL YKSGVY+H G +G HA+R+LGWGVEN PYWLVANSWN
Sbjct: 244 EIMNNGPVEAAFSVYSDFLNYKSGVYRHVKGSLLGGHAIRILGWGVENGTPYWLVANSWN 303
Query: 139 DHWGDHGTFKILRGENEADIE 159
WGD+GTFKIL+G + IE
Sbjct: 304 TDWGDNGTFKILKGSDHCGIE 324
>gi|75076082|sp|Q4R5M2.1|CATB_MACFA RecName: Full=Cathepsin B; Contains: RecName: Full=Cathepsin B
light chain; Contains: RecName: Full=Cathepsin B heavy
chain; Flags: Precursor
gi|67970521|dbj|BAE01603.1| unnamed protein product [Macaca fascicularis]
gi|355779504|gb|EHH63980.1| Cathepsin B [Macaca fascicularis]
gi|383411999|gb|AFH29213.1| cathepsin B preproprotein [Macaca mulatta]
gi|384942194|gb|AFI34702.1| cathepsin B preproprotein [Macaca mulatta]
Length = 339
Score = 152 bits (385), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 72/156 (46%), Positives = 101/156 (64%), Gaps = 5/156 (3%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP +FDARE+WP+CP+++ I DQ +CGSCWA AISDR+CI +N + + ++SA+ ++
Sbjct: 80 LPESFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHTNAHVSVEVSAEDLL 139
Query: 247 ACTPNCW--GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
C GCNGG+P AW FW G+V+GG Y+S GC+PY++ PCEHHV G CT
Sbjct: 140 TCCGIMCGDGCNGGYPAGAWNFWTRKGLVSGGLYDSHVGCRPYSIPPCEHHVNGSRPPCT 199
Query: 305 LLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
G+ TP+C + C P Y TY+ D G ++ V
Sbjct: 200 --GEGDTPKCSKIC-EPGYSPTYKQDKHYGYNSYSV 232
Score = 123 bits (308), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 60/108 (55%), Positives = 75/108 (69%), Gaps = 2/108 (1%)
Query: 54 YLPTSIPLSHYFKKAHMVPRC--NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDS 111
Y PT HY ++ V + M +IY++GP+ FSVY+DFL YKSGVYQH G+
Sbjct: 215 YSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEM 274
Query: 112 IGLHAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
+G HA+R+LGWGVEN PYWLVANSWN WGD+G FKILRG++ IE
Sbjct: 275 MGGHAIRILGWGVENGTPYWLVANSWNTDWGDNGFFKILRGQDHCGIE 322
>gi|354471594|ref|XP_003498026.1| PREDICTED: cathepsin B-like [Cricetulus griseus]
gi|344254255|gb|EGW10359.1| Cathepsin B [Cricetulus griseus]
Length = 339
Score = 152 bits (384), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 73/142 (51%), Positives = 95/142 (66%), Gaps = 7/142 (4%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP NFDARE+W CP+++ I DQ +CGSCWA A+SDRLCI +NG+ ++SA+ ++
Sbjct: 80 LPENFDAREQWSNCPTIKQIRDQGSCGSCWAFGAVGAMSDRLCIHTNGHVNVEVSAEDLL 139
Query: 247 ACT-PNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
C C GCNGG+P AW FW G+V+GG YNS GC PYT+ PCEHHV G CT
Sbjct: 140 TCCGSQCGDGCNGGYPSGAWNFWIKKGLVSGGLYNSHVGCLPYTIPPCEHHVNGSRPQCT 199
Query: 305 LLGKLKTPECKQNC---YNPSY 323
G+ TP+C ++C Y+PSY
Sbjct: 200 --GEGDTPKCTKSCEAGYSPSY 219
Score = 122 bits (307), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 53/83 (63%), Positives = 67/83 (80%)
Query: 77 MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANS 136
M +IY++GP+ F+V++DFL YKSGVY+H GD +G HA+R+LGWGVEN +PYWLVANS
Sbjct: 240 MAEIYKNGPVEGAFTVFSDFLTYKSGVYKHEAGDIMGGHAIRILGWGVENSVPYWLVANS 299
Query: 137 WNDHWGDHGTFKILRGENEADIE 159
WN WGD+G FKILRGE+ IE
Sbjct: 300 WNVDWGDNGLFKILRGEDHCGIE 322
>gi|380791571|gb|AFE67661.1| cathepsin B preproprotein, partial [Macaca mulatta]
Length = 311
Score = 152 bits (384), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 72/156 (46%), Positives = 101/156 (64%), Gaps = 5/156 (3%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP +FDARE+WP+CP+++ I DQ +CGSCWA AISDR+CI +N + + ++SA+ ++
Sbjct: 80 LPESFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHTNAHVSVEVSAEDLL 139
Query: 247 ACTPNCW--GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
C GCNGG+P AW FW G+V+GG Y+S GC+PY++ PCEHHV G CT
Sbjct: 140 TCCGIMCGDGCNGGYPAGAWNFWTRKGLVSGGLYDSHVGCRPYSIPPCEHHVNGSRPPCT 199
Query: 305 LLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
G+ TP+C + C P Y TY+ D G ++ V
Sbjct: 200 --GEGDTPKCSKIC-EPGYSPTYKQDKHYGYNSYSV 232
Score = 110 bits (276), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 54/97 (55%), Positives = 67/97 (69%), Gaps = 2/97 (2%)
Query: 54 YLPTSIPLSHYFKKAHMVPRC--NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDS 111
Y PT HY ++ V + M +IY++GP+ FSVY+DFL YKSGVYQH G+
Sbjct: 215 YSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEM 274
Query: 112 IGLHAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFK 148
+G HA+R+LGWGVEN PYWLVANSWN WGD+G FK
Sbjct: 275 MGGHAIRILGWGVENGTPYWLVANSWNTDWGDNGFFK 311
>gi|118429531|gb|ABK91813.1| cathepsin B-like cysteine proteinase precursor [Clonorchis
sinensis]
gi|358331549|dbj|GAA37857.2| cathepsin B-like cysteine proteinase [Clonorchis sinensis]
Length = 343
Score = 152 bits (384), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 72/143 (50%), Positives = 90/143 (62%), Gaps = 2/143 (1%)
Query: 183 NAKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISA 242
+A LP+NFDAR KWP CPS+ I DQS CGSCWA A+SDRLCI SNG F +SA
Sbjct: 82 DAMRLPKNFDARTKWPHCPSISEIRDQSGCGSCWAFGAVEAMSDRLCIHSNGAFNKSLSA 141
Query: 243 QHIVACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQ 301
+++C NC +GC+GG+P +AW +WG +G+VTGG GC+ Y CEHHVQG
Sbjct: 142 VDLLSCCENCGYGCSGGYPAVAWDYWGAHGIVTGGSKEDPSGCRSYPFPKCEHHVQGHYP 201
Query: 302 NCTLLGKLKTPECKQNCYNPSYE 324
C TPEC Q+C P +
Sbjct: 202 PCP-HQYYPTPECVQHCDTPGID 223
Score = 111 bits (277), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 48/83 (57%), Positives = 62/83 (74%)
Query: 77 MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANS 136
M++I GP+ A+F+VY DFLQYK GVY H++G + HA+R+LGWG E D+PYWL+ANS
Sbjct: 245 MKEIMLRGPVEAVFTVYEDFLQYKFGVYFHSWGAPLSEHAIRILGWGEEGDVPYWLIANS 304
Query: 137 WNDHWGDHGTFKILRGENEADIE 159
WN+ WG+ G K LRG NE IE
Sbjct: 305 WNEDWGEKGYMKFLRGLNECGIE 327
>gi|126681075|gb|ABO26563.1| cathepsin B-like cysteine protease form 1 [Ixodes ricinus]
Length = 337
Score = 152 bits (384), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 74/155 (47%), Positives = 98/155 (63%), Gaps = 4/155 (2%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP +FDAREKW C S+ I DQS CGSCWA A A+SDR+CI S G ISA+ ++
Sbjct: 85 LPESFDAREKWSHCASINLIRDQSTCGSCWAFGAAEAMSDRVCIHSEGGIQVNISAEDLL 144
Query: 247 ACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
C +C GC+GG+P AW +W +G+V+ G Y + +GC+PY+LAPCEHH +G L NCT
Sbjct: 145 DCCDSCGAGCDGGYPAAAWEYWKESGLVSDGLYGTPDGCKPYSLAPCEHHTKGSLPNCT- 203
Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
G + TP+C C Y Y+ D GKK + +
Sbjct: 204 -GTVPTPKCVHLC-RKGYGKDYQHDKHFGKKVYSI 236
Score = 127 bits (320), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 60/106 (56%), Positives = 76/106 (71%), Gaps = 2/106 (1%)
Query: 63 HYFKKAHMVPRCNAMRQ--IYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
H+ KK + + Q I+++GP+ A F+VYADFL YKSGVYQH+ GD +G HA+R+L
Sbjct: 228 HFGKKVYSISSNEKQIQTEIFKNGPVEADFTVYADFLSYKSGVYQHHSGDVLGGHAIRIL 287
Query: 121 GWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIEMGFNNRV 166
GWG EN PYWLVANSWN+ WGDHG FKILRG++E IE N +
Sbjct: 288 GWGTENGTPYWLVANSWNEDWGDHGYFKILRGKDECGIEDDINAGI 333
>gi|147906534|ref|NP_001090927.1| cathepsin B precursor [Sus scrofa]
gi|187470655|sp|A1E295.1|CATB_PIG RecName: Full=Cathepsin B; Contains: RecName: Full=Cathepsin B
light chain; Contains: RecName: Full=Cathepsin B heavy
chain; Flags: Precursor
gi|118490058|gb|ABK96810.1| cathepsin B [Sus scrofa]
Length = 335
Score = 152 bits (383), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 73/156 (46%), Positives = 102/156 (65%), Gaps = 5/156 (3%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP++FDARE+WP CP+++ I DQ +CGSCWA AISDR+CI SNG ++SA+ ++
Sbjct: 80 LPKSFDAREQWPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICIRSNGRVNVEVSAEDML 139
Query: 247 ACTPN-CW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
C + C GCNGG+P AW FW G+V+GG Y+S GC+PY++ PCEHHV G CT
Sbjct: 140 TCCGDECGDGCNGGFPSGAWNFWTKKGLVSGGLYDSHVGCRPYSIPPCEHHVNGSRPPCT 199
Query: 305 LLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
G+ TP+C + C P Y +Y+ D G ++ +
Sbjct: 200 --GEGDTPKCSKIC-EPGYTPSYKEDKHFGCSSYSI 232
Score = 124 bits (310), Expect = 8e-26, Method: Compositional matrix adjust.
Identities = 54/83 (65%), Positives = 66/83 (79%)
Query: 77 MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANS 136
M +IY++GP+ F+VY+DFLQYKSGVYQH GD +G HA+R+LGWGVEN PYWLV NS
Sbjct: 240 MAEIYKNGPVEGAFTVYSDFLQYKSGVYQHVTGDLMGGHAIRILGWGVENGTPYWLVGNS 299
Query: 137 WNDHWGDHGTFKILRGENEADIE 159
WN WGD+G FKILRG++ IE
Sbjct: 300 WNTDWGDNGFFKILRGQDHCGIE 322
>gi|90074902|dbj|BAE87131.1| unnamed protein product [Macaca fascicularis]
Length = 296
Score = 152 bits (383), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 72/156 (46%), Positives = 101/156 (64%), Gaps = 5/156 (3%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP +FDARE+WP+CP+++ I DQ +CGSCWA AISDR+CI +N + + ++SA+ ++
Sbjct: 80 LPESFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHTNAHVSVEVSAEDLL 139
Query: 247 ACTPNCW--GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
C GCNGG+P AW FW G+V+GG Y+S GC+PY++ PCEHHV G CT
Sbjct: 140 TCCGIMCGDGCNGGYPAGAWNFWTRKGLVSGGLYDSHVGCRPYSIPPCEHHVNGSRPPCT 199
Query: 305 LLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
G+ TP+C + C P Y TY+ D G ++ V
Sbjct: 200 --GEGDTPKCSKIC-EPGYSPTYKQDKHYGYNSYSV 232
Score = 54.3 bits (129), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 23/34 (67%), Positives = 26/34 (76%)
Query: 126 NDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
N PYWLVANSWN WGD+G FKILRG++ IE
Sbjct: 246 NGTPYWLVANSWNTDWGDNGFFKILRGQDHCGIE 279
>gi|171948776|gb|ACB59245.1| cathepsin B [Sus scrofa]
Length = 335
Score = 152 bits (383), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 73/156 (46%), Positives = 101/156 (64%), Gaps = 5/156 (3%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP+ FDARE+WP CP+++ I DQ +CGSCWA AISDR+CI SNG ++SA+ ++
Sbjct: 80 LPKGFDAREQWPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICIRSNGRVNVEVSAEDML 139
Query: 247 ACTPN-CW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
C + C GCNGG+P AW FW G+V+GG Y+S GC+PY++ PCEHHV G CT
Sbjct: 140 TCCGDECGDGCNGGFPSGAWNFWTKKGLVSGGLYDSHVGCRPYSIPPCEHHVNGSRPPCT 199
Query: 305 LLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
G+ TP+C + C P Y +Y+ D G ++ +
Sbjct: 200 --GEGDTPKCSKIC-EPGYTPSYKEDKHFGCSSYSI 232
Score = 124 bits (310), Expect = 8e-26, Method: Compositional matrix adjust.
Identities = 54/83 (65%), Positives = 66/83 (79%)
Query: 77 MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANS 136
M +IY++GP+ F+VY+DFLQYKSGVYQH GD +G HA+R+LGWGVEN PYWLV NS
Sbjct: 240 MAEIYKNGPVEGAFTVYSDFLQYKSGVYQHVTGDLMGGHAIRILGWGVENGTPYWLVGNS 299
Query: 137 WNDHWGDHGTFKILRGENEADIE 159
WN WGD+G FKILRG++ IE
Sbjct: 300 WNTDWGDNGFFKILRGQDHCGIE 322
>gi|149698064|ref|XP_001498242.1| PREDICTED: cathepsin B [Equus caballus]
Length = 340
Score = 151 bits (382), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 73/150 (48%), Positives = 99/150 (66%), Gaps = 4/150 (2%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP NFDARE+WP CP+++ I DQ +CGSCWA AISDR+CI +NG+ + ++SA+ ++
Sbjct: 80 LPENFDAREQWPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICIRTNGHVSVEVSAEDML 139
Query: 247 ACTPN-CW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
C + C GCNGG+P AW FW G+V+GG Y+S GC+PY++ PCEHHV G CT
Sbjct: 140 TCCGDQCGDGCNGGFPAEAWNFWTKQGLVSGGLYDSHVGCRPYSIPPCEHHVNGSRPPCT 199
Query: 305 LLGKLKTPECKQNCYNPSYESTYRFDLKKG 334
G TP+C + C P Y +Y+ D G
Sbjct: 200 GEGG-DTPKCSKIC-EPGYSPSYKEDKHYG 227
Score = 124 bits (311), Expect = 7e-26, Method: Compositional matrix adjust.
Identities = 55/83 (66%), Positives = 67/83 (80%)
Query: 77 MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANS 136
M +I+++GP+ A F+VY+DFLQYKSGVYQH GD +G HAVR+LGWGVEN PYWLV NS
Sbjct: 241 MAEIFKNGPVEAAFTVYSDFLQYKSGVYQHVAGDMMGGHAVRILGWGVENGTPYWLVGNS 300
Query: 137 WNDHWGDHGTFKILRGENEADIE 159
WN WGD+G FKILRG++ IE
Sbjct: 301 WNTDWGDNGFFKILRGQDHCGIE 323
>gi|25988674|gb|AAN76202.1| lysosomal cysteine proteinase cathepsin B/green fluorescent protein
EGFP fusion protein [synthetic construct]
Length = 578
Score = 151 bits (382), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 74/166 (44%), Positives = 100/166 (60%), Gaps = 5/166 (3%)
Query: 177 ETMGCQNAKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYF 236
E +G LP +FDARE+W CP++ I DQ +CGSCWA A+SDR+CI +NG
Sbjct: 70 ERVGFSEDINLPESFDAREQWSNCPTIAQIRDQGSCGSCWAFGAVEAMSDRICIHTNGRV 129
Query: 237 TGQISAQHIVACTP-NCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEH 294
++SA+ ++ C C GCNGG+P AW FW G+V+GG YNS GC PYT+ PCEH
Sbjct: 130 NVEVSAEDLLTCCGIQCGDGCNGGYPSGAWNFWTRKGLVSGGVYNSHIGCLPYTIPPCEH 189
Query: 295 HVQGPLQNCTLLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
HV G CT G+ TP+C + C Y ++Y+ D G ++ V
Sbjct: 190 HVNGSRPPCT--GEGDTPKCNKMC-EAGYSTSYKEDKHYGYTSYSV 232
Score = 124 bits (311), Expect = 7e-26, Method: Compositional matrix adjust.
Identities = 53/83 (63%), Positives = 67/83 (80%)
Query: 77 MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANS 136
M +IY++GP+ F+V++DFL YKSGVY+H GD +G HA+R+LGWG+EN +PYWLVANS
Sbjct: 240 MAEIYKNGPVEGAFTVFSDFLTYKSGVYKHEAGDVMGGHAIRILGWGIENGVPYWLVANS 299
Query: 137 WNDHWGDHGTFKILRGENEADIE 159
WN WGD+G FKILRGEN IE
Sbjct: 300 WNVDWGDNGFFKILRGENHCGIE 322
>gi|321452279|gb|EFX63703.1| hypothetical protein DAPPUDRAFT_306608 [Daphnia pulex]
Length = 340
Score = 151 bits (381), Expect = 5e-34, Method: Compositional matrix adjust.
Identities = 70/157 (44%), Positives = 98/157 (62%), Gaps = 4/157 (2%)
Query: 185 KGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQH 244
+ +P FDARE+WP+CP+++ I DQ +CGSCWA A+SDR+CI S G +SA++
Sbjct: 87 QAIPEAFDAREQWPDCPTIQEIRDQGSCGSCWAFGAVEAMSDRICIHSKGEVNAHLSAEN 146
Query: 245 IVACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNC 303
+V+C C +GCNGG+P AW W G+VTGG++NS +GCQPY + CEHH G C
Sbjct: 147 LVSCCYTCGFGCNGGFPGAAWSHWVKKGIVTGGNFNSSQGCQPYIIPACEHHTTGDRPPC 206
Query: 304 TLLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
+ G TP+C + C + Y Y DL G ++ V
Sbjct: 207 SEGG--GTPKCLKTCED-GYTVDYTQDLHYGASSYSV 240
Score = 105 bits (263), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 45/81 (55%), Positives = 59/81 (72%)
Query: 79 QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWN 138
+I +GP+ +VY DF YKSGVYQH G ++G HA+R+LGWGVE +PYWL+ANSWN
Sbjct: 250 EIMNNGPVEGALTVYEDFPTYKSGVYQHVHGKALGGHAIRILGWGVEEGVPYWLIANSWN 309
Query: 139 DHWGDHGTFKILRGENEADIE 159
WGD+G K+LRG++ IE
Sbjct: 310 TDWGDNGYIKLLRGKDHCGIE 330
>gi|157058767|gb|ABV03141.1| cathepsin B-348 [Sitobion avenae]
Length = 252
Score = 151 bits (381), Expect = 5e-34, Method: Compositional matrix adjust.
Identities = 74/155 (47%), Positives = 97/155 (62%), Gaps = 4/155 (2%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP FDARE WP CP++R + DQ +CGSCWA A+SDR+CI S G SA+++V
Sbjct: 28 LPETFDAREHWPNCPTIREVRDQGSCGSCWAFGAVEAMSDRVCIHSKGTKNFHFSAENLV 87
Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
+C C +GCNGG+P AW +W G+V+GG Y S GC PY +APCEHHV G C
Sbjct: 88 SCCWTCGFGCNGGFPGAAWHYWKTKGIVSGGPYGSNMGCIPYEIAPCEHHVNGTRGPCKE 147
Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
G KTP+C + C + Y+ Y DL +GK A+ +
Sbjct: 148 GG--KTPKCVKKCED-GYKVPYEQDLHRGKSAYSL 179
Score = 89.4 bits (220), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 37/65 (56%), Positives = 52/65 (80%), Gaps = 1/65 (1%)
Query: 78 RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVEN-DIPYWLVANS 136
++IY +GP+ F+VY DF+ Y++GVY+H G ++G HA+R+LGWGV+N +IPYWLVANS
Sbjct: 188 QEIYTNGPVEGAFTVYEDFIAYRAGVYKHVAGKALGGHAIRILGWGVQNGEIPYWLVANS 247
Query: 137 WNDHW 141
WN W
Sbjct: 248 WNTDW 252
>gi|301776581|ref|XP_002923704.1| PREDICTED: cathepsin B-like [Ailuropoda melanoleuca]
gi|281347694|gb|EFB23278.1| hypothetical protein PANDA_012896 [Ailuropoda melanoleuca]
Length = 339
Score = 151 bits (381), Expect = 5e-34, Method: Compositional matrix adjust.
Identities = 73/150 (48%), Positives = 98/150 (65%), Gaps = 5/150 (3%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP NFDARE+WP CP+++ I DQ +CGSCWA AISDR+CI +NG+ ++SA+ ++
Sbjct: 80 LPENFDAREQWPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICIRTNGHVNVEVSAEDML 139
Query: 247 ACTPN-CW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
C + C GCNGG+P AW FW G+V+GG Y S GC+PY++ PCEHHV G CT
Sbjct: 140 TCCGDQCGDGCNGGFPAEAWNFWTKQGLVSGGLYESHVGCRPYSIPPCEHHVNGSRPPCT 199
Query: 305 LLGKLKTPECKQNCYNPSYESTYRFDLKKG 334
G+ TP+C + C P Y +Y+ D G
Sbjct: 200 --GEGDTPKCSKFC-EPGYTPSYKEDKHYG 226
Score = 120 bits (301), Expect = 8e-25, Method: Compositional matrix adjust.
Identities = 54/83 (65%), Positives = 65/83 (78%)
Query: 77 MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANS 136
M +IY++GP+ A F+VY+DFL YKSGVYQH G+ +G HAVR+LGWGVEN PYWLV NS
Sbjct: 240 MAEIYKNGPVEAAFTVYSDFLLYKSGVYQHVTGEMMGGHAVRILGWGVENGTPYWLVGNS 299
Query: 137 WNDHWGDHGTFKILRGENEADIE 159
WN WGD+G FKILRG + IE
Sbjct: 300 WNTDWGDNGFFKILRGRDHCGIE 322
>gi|91078964|ref|XP_974298.1| PREDICTED: similar to putative cathepsin B-like like proteinase
[Tribolium castaneum]
gi|270004838|gb|EFA01286.1| cathepsin B precursor [Tribolium castaneum]
Length = 335
Score = 151 bits (381), Expect = 5e-34, Method: Compositional matrix adjust.
Identities = 71/155 (45%), Positives = 97/155 (62%), Gaps = 5/155 (3%)
Query: 183 NAKGLPRNFDAREKWPECPSL-RHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQIS 241
+ +P +FDARE WPEC S+ I DQ++CGSCWA A A+SDR+CI SN IS
Sbjct: 80 DVNAIPESFDAREAWPECASIIGDIRDQASCGSCWAFGAAEAMSDRICIHSNATVKVSIS 139
Query: 242 AQHIVACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPL 300
+ + C C GCNGGWP AW +W G+VTGG Y +++GC+ YT+ PCEHH +G L
Sbjct: 140 TEDLNTCCYECGDGCNGGWPAEAWAYWAETGIVTGGKYETKDGCKAYTVPPCEHHTEGDL 199
Query: 301 QNCTLLGKLKTPECKQNCYNPSYESTYRFDLKKGK 335
C + + TP+CK+ C + + Y+ DL+KG
Sbjct: 200 PACGDI--VPTPQCKKEC-DAGVDIEYKSDLRKGS 231
Score = 112 bits (281), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 50/81 (61%), Positives = 60/81 (74%)
Query: 79 QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWN 138
+I +GP+ A F VY DFL YKSGVYQ G+ G HA+++LGWGVE+ PYWL ANSWN
Sbjct: 245 EIMTNGPVEADFDVYEDFLNYKSGVYQQTTGNYAGGHAIKILGWGVEDGTPYWLAANSWN 304
Query: 139 DHWGDHGTFKILRGENEADIE 159
+ WGD G FKILRG+NE IE
Sbjct: 305 EDWGDKGYFKILRGQNECGIE 325
>gi|157167366|ref|XP_001653890.1| cathepsin b [Aedes aegypti]
gi|54289254|gb|AAV31917.1| lysosomal cathepsin B [Aedes aegypti]
gi|108874249|gb|EAT38474.1| AAEL009637-PA [Aedes aegypti]
Length = 340
Score = 150 bits (380), Expect = 6e-34, Method: Compositional matrix adjust.
Identities = 70/155 (45%), Positives = 97/155 (62%), Gaps = 3/155 (1%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
P NFD+R +WP CP++ I DQ +CGSCWA A+SDR+CI S G ++S++ +V
Sbjct: 88 FPENFDSRTQWPNCPTIGEIRDQGSCGSCWAFGAVEAMSDRICIHSEGKVHFRVSSEDLV 147
Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
+C C +GCNGG+P AW +W G+V+GG + S +GCQPY +APCEHHV G +C
Sbjct: 148 SCCHTCGFGCNGGFPGAAWSYWVRKGLVSGGPFGSDQGCQPYAIAPCEHHVNGSRPSCEG 207
Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
G KTP+C + C SY Y D GK ++ +
Sbjct: 208 EGG-KTPKCVKKC-QASYNVPYAKDKMYGKSSYSI 240
Score = 105 bits (263), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 46/82 (56%), Positives = 58/82 (70%)
Query: 78 RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
++I +GP+ F+VY D L YK GVY H G +G HA+R+LGWGVE+ YWL+ANSW
Sbjct: 249 KEIMTNGPVEGAFTVYEDLLNYKEGVYHHVHGKMLGGHAIRILGWGVEDGTKYWLIANSW 308
Query: 138 NDHWGDHGTFKILRGENEADIE 159
N WGD+G FKILRGE+ IE
Sbjct: 309 NSDWGDNGFFKILRGEDHLGIE 330
>gi|14141821|gb|AAK07477.2|AF329480_1 probable cathepsin B-like cysteine proteinase precursor [Glossina
morsitans morsitans]
gi|289743431|gb|ADD20463.1| putative cathepsin B-like cysteine proteinase precursor [Glossina
morsitans morsitans]
Length = 340
Score = 150 bits (380), Expect = 7e-34, Method: Compositional matrix adjust.
Identities = 70/155 (45%), Positives = 97/155 (62%), Gaps = 3/155 (1%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+P++FD+R++WP CP++ I DQ +CGSCWA A+SDR+CI SNG SA +V
Sbjct: 88 IPKDFDSRKQWPHCPTIWEIRDQGSCGSCWAFGAVEAMSDRVCIHSNGTVNFHFSADDLV 147
Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
+C C +GCNGG+P AW +W G+V+GG Y S +GC+PY +APCEHHV G C
Sbjct: 148 SCCHTCGFGCNGGFPGAAWSYWVRKGIVSGGPYGSSQGCRPYEIAPCEHHVNGTRPPCEK 207
Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
KTP C+ C SY+ Y+ D G +A+ +
Sbjct: 208 EYG-KTPRCQHKC-QASYKVDYKTDKHFGSRAYSI 240
Score = 110 bits (275), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 49/99 (49%), Positives = 68/99 (68%), Gaps = 2/99 (2%)
Query: 63 HYFKKAHMVPRC--NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
H+ +A+ + + + +I HGP+ F+VY D + YK GVY+H G +G HA+R++
Sbjct: 232 HFGSRAYSISKNVHDIQEEIMTHGPVEGAFTVYEDLILYKDGVYEHVHGKELGGHAIRII 291
Query: 121 GWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
GWGVE DIPYWLVANSWN WG++G FKILRG++ IE
Sbjct: 292 GWGVEKDIPYWLVANSWNTDWGNNGFFKILRGKDHCGIE 330
>gi|344281458|ref|XP_003412496.1| PREDICTED: cathepsin B-like [Loxodonta africana]
Length = 340
Score = 150 bits (380), Expect = 7e-34, Method: Compositional matrix adjust.
Identities = 79/192 (41%), Positives = 110/192 (57%), Gaps = 5/192 (2%)
Query: 152 GENEADIEMGFNNRVEANSSEDDDL-ETMGCQNAKGLPRNFDAREKWPECPSLRHIADQS 210
G N D++M + R+ L + + LP NFDARE WP CP+++ I DQ
Sbjct: 44 GHNFYDVDMSYVKRLCGTLLNGPKLPQRVHLAEEMDLPENFDARENWPNCPTIKEIRDQG 103
Query: 211 NCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVACT-PNCW-GCNGGWPQLAWRFWG 268
+CGSCWA AISDR+CI +NG ++SA+ ++ C C GCNGG+P AW FW
Sbjct: 104 SCGSCWAFGAVEAISDRVCIHTNGNVNVEVSAEDLLTCCHMECGDGCNGGFPAGAWNFWT 163
Query: 269 HNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTLLGKLKTPECKQNCYNPSYESTYR 328
G+V+GG Y+S GC+PY++ PCEHHV G C G +TP+C + C P Y +Y+
Sbjct: 164 KKGLVSGGLYDSHVGCRPYSIPPCEHHVNGSRPPCKGEGG-ETPKCSKTC-EPGYSPSYK 221
Query: 329 FDLKKGKKAHMV 340
D G ++ V
Sbjct: 222 EDKHYGYSSYGV 233
Score = 124 bits (311), Expect = 7e-26, Method: Compositional matrix adjust.
Identities = 62/125 (49%), Positives = 79/125 (63%), Gaps = 2/125 (1%)
Query: 37 KKKKKKKKKKKKKKKRLYLPTSIPLSHYFKKAHMVPRCN--AMRQIYEHGPLVAIFSVYA 94
K + + K K + Y P+ HY ++ VP M +IY++GP+ FSVY
Sbjct: 199 KGEGGETPKCSKTCEPGYSPSYKEDKHYGYSSYGVPSSEQEIMAEIYKNGPVEGAFSVYT 258
Query: 95 DFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGEN 154
DFL YKSGVYQH G+ +G HA+R+LGWGVEN PYWL ANSWN WGD+G FKILRG++
Sbjct: 259 DFLVYKSGVYQHVTGEEVGGHAIRILGWGVENGTPYWLAANSWNTDWGDNGFFKILRGQD 318
Query: 155 EADIE 159
IE
Sbjct: 319 HCGIE 323
>gi|330434688|gb|AEC22812.1| cathepsin B [Macrobrachium nipponense]
Length = 331
Score = 150 bits (379), Expect = 8e-34, Method: Compositional matrix adjust.
Identities = 71/155 (45%), Positives = 98/155 (63%), Gaps = 4/155 (2%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+P+ FD+R W CP++ I DQ +CGSCWA ++DR CI SNG SA+++V
Sbjct: 79 IPKEFDSRTAWSMCPTISEIRDQGSCGSCWAFGAVEVMTDRDCIHSNGTKNFHYSAENLV 138
Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
+C C +GCNGG+P A+++W H+G+V+GG +NS +GCQPY +APCEHHV GP C
Sbjct: 139 SCCHLCGFGCNGGFPGAAFQYWVHSGIVSGGAFNSTQGCQPYEIAPCEHHVSGPRPKCAE 198
Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
G TP+C +NC +Y Y DL G K + V
Sbjct: 199 GG--STPKCHKNC-ESNYVVDYESDLHHGSKHYSV 230
Score = 109 bits (272), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 49/81 (60%), Positives = 57/81 (70%)
Query: 79 QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWN 138
I +GP+ F+VY DFL YKSGVYQH G +G HA+RVLGWG E+ PYWL ANSWN
Sbjct: 240 DIMTNGPVEGAFTVYVDFLHYKSGVYQHTHGLPLGGHAIRVLGWGEEDGTPYWLCANSWN 299
Query: 139 DHWGDHGTFKILRGENEADIE 159
WGD+G FKILRG + IE
Sbjct: 300 TDWGDNGYFKILRGSDHCGIE 320
>gi|195438776|ref|XP_002067308.1| GK16352 [Drosophila willistoni]
gi|194163393|gb|EDW78294.1| GK16352 [Drosophila willistoni]
Length = 340
Score = 150 bits (379), Expect = 8e-34, Method: Compositional matrix adjust.
Identities = 70/150 (46%), Positives = 93/150 (62%), Gaps = 9/150 (6%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+P FDAREKW CP++ I DQ +CGSCWA A+SDR+CI S G +SA +V
Sbjct: 88 IPTEFDAREKWSNCPTIGEIRDQGSCGSCWAFGAVEAMSDRVCIHSQGKVNFHLSADDLV 147
Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
+C C +GCNGG+P AW +W G+V+GG++ SQ+GC+PY + PCEHHV G C+
Sbjct: 148 SCCHTCGFGCNGGFPGAAWSYWTRKGIVSGGNFGSQQGCRPYEIEPCEHHVNGTRPPCS- 206
Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGK 335
TP C+ C ES+Y+ D KK K
Sbjct: 207 --SGSTPRCQHVC-----ESSYKVDYKKDK 229
Score = 102 bits (254), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 45/87 (51%), Positives = 62/87 (71%), Gaps = 2/87 (2%)
Query: 75 NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVEND--IPYWL 132
+ ++I +GP+ F+VY D + YKSGVY+H G +G HA+R+LGWGV D IPYWL
Sbjct: 244 DIQKEIMNNGPVEGAFTVYEDLILYKSGVYEHVHGKELGGHAIRILGWGVWGDEKIPYWL 303
Query: 133 VANSWNDHWGDHGTFKILRGENEADIE 159
+ANSWN WGD+G F+I+RG++ IE
Sbjct: 304 IANSWNTDWGDNGFFRIVRGKDHCGIE 330
>gi|223646922|gb|ACN10219.1| Cathepsin B precursor [Salmo salar]
gi|223647940|gb|ACN10728.1| Cathepsin B precursor [Salmo salar]
gi|223672785|gb|ACN12574.1| Cathepsin B precursor [Salmo salar]
Length = 330
Score = 150 bits (379), Expect = 8e-34, Method: Compositional matrix adjust.
Identities = 71/147 (48%), Positives = 94/147 (63%), Gaps = 5/147 (3%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP FD R++WP CP+L+ I DQ +CGSCWA A AISDR+CI SN + +IS++ ++
Sbjct: 79 LPDTFDPRQQWPNCPTLKEIRDQGSCGSCWAFGAAEAISDRVCIHSNAKVSVEISSEDLL 138
Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
+C +C GCNGG+P AW FW G+VTGG Y+S GC+PY++ PCEHHV G CT
Sbjct: 139 SCCDSCGMGCNGGYPSAAWDFWTTEGLVTGGLYDSHVGCRPYSIPPCEHHVNGTRPPCTG 198
Query: 306 LGKLKTPECKQNC---YNPSYESTYRF 329
+ TP+C C Y P Y+ F
Sbjct: 199 E-EGDTPQCSNQCETGYTPGYKQDKHF 224
Score = 109 bits (272), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 50/99 (50%), Positives = 68/99 (68%), Gaps = 2/99 (2%)
Query: 63 HYFKKAHMVP--RCNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
H+ K ++ +P M ++ ++GP+ F+VY DFL YKSGVYQH G ++G HA++VL
Sbjct: 223 HFGKNSYSLPSEEQQIMAELLKNGPVEGAFTVYEDFLLYKSGVYQHVSGSAVGGHAIKVL 282
Query: 121 GWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
GWG E PYWL ANSWN WG++G FKILRG++ IE
Sbjct: 283 GWGEEGGTPYWLAANSWNTDWGENGFFKILRGKDHCGIE 321
>gi|351695295|gb|EHA98213.1| Cathepsin B [Heterocephalus glaber]
Length = 340
Score = 150 bits (379), Expect = 9e-34, Method: Compositional matrix adjust.
Identities = 70/156 (44%), Positives = 102/156 (65%), Gaps = 4/156 (2%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP +FDAR++WP CP+++ I DQ +CGSCWA A+SDR+CI +NG+ ++SA+ ++
Sbjct: 80 LPESFDARQQWPNCPTIKEIRDQGSCGSCWAFGAVGAMSDRVCIHTNGHVNVEVSAEDLL 139
Query: 247 ACTP-NCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
+C C GCNGG+P AW++W G+V+GG Y+S GC+PY++ PCEHHV G CT
Sbjct: 140 SCCGLECGDGCNGGYPSAAWKYWTKKGLVSGGLYDSHVGCRPYSIPPCEHHVNGTRPQCT 199
Query: 305 LLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
G TP+C + C P Y +Y+ D G ++ V
Sbjct: 200 GEGG-DTPKCSKTC-EPGYSPSYKEDKHFGYDSYSV 233
Score = 113 bits (282), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 48/83 (57%), Positives = 64/83 (77%)
Query: 77 MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANS 136
M +IY++GP+ F+V++DFL YK+GVY+H G+ +G HA+R+LGWG EN +PYWLV NS
Sbjct: 241 MAEIYKNGPVEGAFTVFSDFLMYKTGVYKHLAGEMLGGHAIRILGWGKENGVPYWLVGNS 300
Query: 137 WNDHWGDHGTFKILRGENEADIE 159
WN WGD G FKI+RGE+ IE
Sbjct: 301 WNVDWGDSGFFKIVRGEDHCGIE 323
>gi|496317|dbj|BAA04103.1| Sarcophaga pro-cathepsin B [Sarcophaga peregrina]
Length = 344
Score = 150 bits (379), Expect = 9e-34, Method: Compositional matrix adjust.
Identities = 73/156 (46%), Positives = 95/156 (60%), Gaps = 5/156 (3%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+P FDAR+ WP CP++ I DQ +CGSCWA A+SDRLCI SN SA +V
Sbjct: 92 VPEEFDARKAWPNCPTIGEIRDQGSCGSCWAFGAVEAMSDRLCIHSNATIHFHFSADDLV 151
Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
+C C +GCNGG+P AW +W G+V+GG Y S +GC+PY +APCEHHV G C
Sbjct: 152 SCCHTCGFGCNGGFPGAAWAYWTRKGIVSGGPYGSSQGCRPYEIAPCEHHVNGTRPPCD- 210
Query: 306 LGKL-KTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
G+ KTP C+ C SY+ Y+ D G K++ V
Sbjct: 211 -GEHGKTPSCRHEC-QKSYDVDYKTDKHFGSKSYSV 244
Score = 110 bits (275), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 50/99 (50%), Positives = 69/99 (69%), Gaps = 2/99 (2%)
Query: 63 HYFKKAHMVPRC--NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
H+ K++ V R + ++I ++GP+ F+VY D + YK GVYQH G +G HA+R+L
Sbjct: 236 HFGSKSYSVKRNVKDIQKEIMQNGPVEGAFTVYEDLILYKDGVYQHVHGRELGGHAIRIL 295
Query: 121 GWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
GWGVEN PYWL+ANSWN WG++G FK+LRGE+ IE
Sbjct: 296 GWGVENKTPYWLIANSWNTDWGNNGFFKMLRGEDHCGIE 334
>gi|86279341|gb|ABC88766.1| putative cathepsin B-like like proteinase [Tenebrio molitor]
Length = 301
Score = 150 bits (378), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 73/156 (46%), Positives = 100/156 (64%), Gaps = 7/156 (4%)
Query: 183 NAKGLPRNFDAREKWPECPSLR-HIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQIS 241
N +P +FDARE WPEC S+ I DQ++CGSCWA A+SDR+CI S+ +IS
Sbjct: 80 NLDAIPESFDAREAWPECTSIIGEIRDQASCGSCWAFGAVEAMSDRICIHSDASVKVRIS 139
Query: 242 AQHIVACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPL 300
A+ + C +C GCNGGWP LAW +W G+VTGG Y EGC+ Y++ PC+HHV G L
Sbjct: 140 AEDLNDCCYDCGDGCNGGWPDLAWSYWSSTGIVTGGLYGVDEGCKAYSIKPCDHHVDGNL 199
Query: 301 QNCTLLGKL-KTPECKQNCYNPSYESTYRFDLKKGK 335
C G + +TP CK++C + S + Y+ DL++G
Sbjct: 200 GPC---GDIQRTPACKKSCDSTS-DLEYKSDLRRGS 231
>gi|355681635|gb|AER96808.1| cathepsin B [Mustela putorius furo]
Length = 338
Score = 150 bits (378), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 71/150 (47%), Positives = 100/150 (66%), Gaps = 5/150 (3%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP +FD+RE+WP CP+++ I DQ +CGSCWA AISDR+CI +NG+ + ++SA+ ++
Sbjct: 80 LPESFDSREQWPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICIRTNGHVSVEVSAEDML 139
Query: 247 ACTPN-CW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
C + C GCNGG+P AW FW G+V+GG Y+S GC+PY++ PCEHHV G CT
Sbjct: 140 TCCGDQCGDGCNGGFPAEAWNFWTXXGLVSGGLYDSHVGCRPYSIPPCEHHVNGSRPPCT 199
Query: 305 LLGKLKTPECKQNCYNPSYESTYRFDLKKG 334
G+ TP+C + C P Y +Y+ D G
Sbjct: 200 --GEGDTPKCSKIC-EPGYTPSYKEDKHYG 226
Score = 122 bits (307), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 55/83 (66%), Positives = 66/83 (79%)
Query: 77 MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANS 136
M +IY++GP+ A FSVY+DFL YKSGVYQH G+ +G HAVR+LGWGVEN PYWLV NS
Sbjct: 240 MAEIYKNGPVEAAFSVYSDFLMYKSGVYQHVTGEMMGGHAVRILGWGVENGTPYWLVGNS 299
Query: 137 WNDHWGDHGTFKILRGENEADIE 159
WN WGD+G FKILRG++ IE
Sbjct: 300 WNTDWGDNGFFKILRGQDHCGIE 322
>gi|157058765|gb|ABV03140.1| cathepsin B-348 [Aulacorthum solani]
Length = 237
Score = 149 bits (377), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 74/155 (47%), Positives = 96/155 (61%), Gaps = 4/155 (2%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP FDARE WP CP++R + DQ +CGSCWA A+SDR+CI S G SA+++V
Sbjct: 28 LPETFDAREHWPNCPTIREVRDQGSCGSCWAFGAVEAMSDRVCIHSKGTKNFHFSAENLV 87
Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
+C C +GCNGG+P AW +W G+V+GG Y S GC PY +APCEHHV G C
Sbjct: 88 SCCWTCGFGCNGGFPGAAWNYWKTKGIVSGGPYGSNMGCIPYEVAPCEHHVNGTRGPCKE 147
Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
G KTP+C + C + Y+ Y DL GK A+ +
Sbjct: 148 GG--KTPKCVKKCED-GYKVPYAQDLHHGKSAYSL 179
Score = 68.2 bits (165), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 35/91 (38%), Positives = 54/91 (59%), Gaps = 1/91 (1%)
Query: 37 KKKKKKKKKKKKKKKRLYLPTSIPLSHYFKKAHMVPRCNAMRQ-IYEHGPLVAIFSVYAD 95
K+ K K KK + +P + L H + + +RQ IY +GP+ F+VY D
Sbjct: 146 KEGGKTPKCVKKCEDGYKVPYAQDLHHGKSAYSLSNDVDQIRQEIYTNGPVEGAFTVYED 205
Query: 96 FLQYKSGVYQHNFGDSIGLHAVRVLGWGVEN 126
F+ Y++GVY+H G ++G HA+R+LGWGV+N
Sbjct: 206 FIAYRAGVYKHVAGKALGGHAIRILGWGVQN 236
>gi|338815385|gb|AEJ08755.1| cathepsin B [Crassostrea ariakensis]
Length = 341
Score = 149 bits (377), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 74/158 (46%), Positives = 97/158 (61%), Gaps = 5/158 (3%)
Query: 185 KGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQH 244
K LP +FD+R +WP CP+L+ + DQ CGSCWA A+SDR+CI S G ISA+
Sbjct: 87 KDLPASFDSRTQWPNCPTLKEVRDQGACGSCWAFGAVEAMSDRICIKSQGKENVHISAED 146
Query: 245 IVACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNC 303
+ +C C GC GG+P AW ++ +G+VTGG YNS +GCQPYT+ C+HHV G LQ C
Sbjct: 147 LTSCCRTCGNGCEGGFPSAAWSYYKRDGLVTGGQYNSHQGCQPYTIKACDHHVVGKLQPC 206
Query: 304 TL-LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
+ +G TP+CK C Y TY D G A+ V
Sbjct: 207 SKDIG--PTPKCKHTC-EAGYNVTYEKDKHYGMSAYSV 241
Score = 114 bits (284), Expect = 9e-23, Method: Compositional matrix adjust.
Identities = 54/98 (55%), Positives = 66/98 (67%), Gaps = 1/98 (1%)
Query: 63 HYFKKAHMVPRCN-AMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLG 121
HY A+ V M +I +GP+ F+VYADF QYKSGVY+H G +G HA+++LG
Sbjct: 233 HYGMSAYSVHGVEKIMTEIMTNGPVEGAFTVYADFPQYKSGVYKHTTGQPLGGHAIKILG 292
Query: 122 WGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
WG EN YWLVANSWN WGD G FKILRG++E IE
Sbjct: 293 WGTENGDDYWLVANSWNPDWGDQGFFKILRGQDECGIE 330
>gi|426220597|ref|XP_004004501.1| PREDICTED: cathepsin B [Ovis aries]
Length = 335
Score = 149 bits (376), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 73/156 (46%), Positives = 100/156 (64%), Gaps = 5/156 (3%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP +FDARE+WP CP+++ I DQ +CGSCWA AISDR+CI S G ++SA+ ++
Sbjct: 80 LPDSFDAREQWPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHSKGRVNVEVSAEDML 139
Query: 247 ACTPN-CW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
C + C GCNGG+P AW FW G+V+GG Y+S GC+PY++ PCEHHV G CT
Sbjct: 140 TCCGSECGDGCNGGFPSGAWNFWTKKGLVSGGLYDSHVGCRPYSIPPCEHHVNGSRPPCT 199
Query: 305 LLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
G+ TP+C + C P Y +Y+ D G ++ V
Sbjct: 200 --GEGDTPKCSKIC-EPGYSPSYKDDKHFGCSSYSV 232
Score = 122 bits (307), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 54/83 (65%), Positives = 65/83 (78%)
Query: 77 MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANS 136
M +IY++GP+ FSVY+DFL YKSGVYQH G+ +G HA+R+LGWGVEND PYWLV NS
Sbjct: 240 MAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVSGEMMGGHAIRILGWGVENDTPYWLVGNS 299
Query: 137 WNDHWGDHGTFKILRGENEADIE 159
WN WGD G FKILRG++ IE
Sbjct: 300 WNTDWGDKGFFKILRGQDHCGIE 322
>gi|379067374|gb|AFC90100.1| cathepsin B [Capra hircus]
Length = 335
Score = 149 bits (376), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 73/156 (46%), Positives = 100/156 (64%), Gaps = 5/156 (3%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP +FDARE+WP CP+++ I DQ +CGSCWA AISDR+CI S G ++SA+ ++
Sbjct: 80 LPDSFDAREQWPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHSKGRVNVEVSAEDML 139
Query: 247 ACTPN-CW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
C + C GCNGG+P AW FW G+V+GG Y+S GC+PY++ PCEHHV G CT
Sbjct: 140 TCCGSECGDGCNGGFPSGAWNFWTKKGLVSGGLYDSHVGCRPYSIPPCEHHVNGSRPPCT 199
Query: 305 LLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
G+ TP+C + C P Y +Y+ D G ++ V
Sbjct: 200 --GEGDTPKCSKIC-EPGYSPSYKDDKHFGCSSYSV 232
Score = 122 bits (307), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 54/83 (65%), Positives = 65/83 (78%)
Query: 77 MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANS 136
M +IY++GP+ FSVY+DFL YKSGVYQH G+ +G HA+R+LGWGVEND PYWLV NS
Sbjct: 240 MAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVSGEMMGGHAIRILGWGVENDTPYWLVGNS 299
Query: 137 WNDHWGDHGTFKILRGENEADIE 159
WN WGD G FKILRG++ IE
Sbjct: 300 WNTDWGDKGFFKILRGQDHCGIE 322
>gi|195393194|ref|XP_002055239.1| GJ19262 [Drosophila virilis]
gi|194149749|gb|EDW65440.1| GJ19262 [Drosophila virilis]
Length = 338
Score = 149 bits (376), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 72/155 (46%), Positives = 89/155 (57%), Gaps = 5/155 (3%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP FDAR WP+CP++ I DQ +CGSCWA A+SDR+CI SN SA +V
Sbjct: 86 LPEEFDARTAWPDCPTIGEIRDQGSCGSCWAFGAVEAMSDRVCIHSNATVNFHFSADDLV 145
Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
+C C +GCNGG+P AW +W H G+V+GG Y S+EGC+PY + PCEHHV G C
Sbjct: 146 SCCHTCGFGCNGGFPGAAWSYWTHKGIVSGGSYGSKEGCRPYEVEPCEHHVNGTRPPCH- 204
Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
TP C C Y Y D G KA+ V
Sbjct: 205 --SGSTPRCMHKC-ESGYSVDYAKDKHFGAKAYSV 236
Score = 109 bits (272), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 51/101 (50%), Positives = 69/101 (68%), Gaps = 4/101 (3%)
Query: 63 HYFKKAHMVPR--CNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
H+ KA+ V R + R+I +GP+ F+VY D + YK+GVYQH G +G HA+R+L
Sbjct: 228 HFGAKAYSVNRNPLDIQREIMTNGPVEGAFTVYEDLILYKTGVYQHVHGRQLGGHAIRIL 287
Query: 121 GWGV--ENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
GWGV +N +PYWL+ NSWN WGD+G F+ILRGE+ IE
Sbjct: 288 GWGVWGDNKVPYWLIGNSWNTDWGDNGFFRILRGEDHCGIE 328
>gi|126303983|ref|XP_001381634.1| PREDICTED: cathepsin B-like [Monodelphis domestica]
Length = 337
Score = 149 bits (376), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 72/156 (46%), Positives = 100/156 (64%), Gaps = 4/156 (2%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP NFDARE+WP CP+++ I DQ +CGSCWA AISDR+C+ SNG ++SA+ ++
Sbjct: 81 LPENFDAREQWPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICVHSNGNANVEVSAEDLL 140
Query: 247 ACTPN-CW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
+C + C GCNGG+P AW FW G+V+GG Y+S GC+PY++ PCEHHV G CT
Sbjct: 141 SCCGSECGDGCNGGFPAGAWNFWTKKGLVSGGLYDSHVGCRPYSIPPCEHHVNGSRPACT 200
Query: 305 LLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
+ TP C++ C Y + Y+ D G ++ V
Sbjct: 201 GE-EGDTPTCRKKC-EEGYSTQYKDDKNYGSTSYSV 234
Score = 118 bits (295), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 56/99 (56%), Positives = 69/99 (69%), Gaps = 2/99 (2%)
Query: 63 HYFKKAHMVPRCN--AMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
+Y ++ VP M +IY++GP+ FSVY DFL YKSGVYQH G+ +G HA+R+L
Sbjct: 226 NYGSTSYSVPSSEQEIMAEIYKNGPVEGAFSVYEDFLHYKSGVYQHVAGEMLGGHAIRIL 285
Query: 121 GWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
GWGVEN I YWL ANSWN WGD+G FK LRG+N IE
Sbjct: 286 GWGVENGIRYWLAANSWNIDWGDNGFFKFLRGKNHCGIE 324
>gi|289743429|gb|ADD20462.1| putative cathepsin B-like cysteine proteinase precursor [Glossina
morsitans morsitans]
Length = 340
Score = 149 bits (375), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 70/155 (45%), Positives = 95/155 (61%), Gaps = 3/155 (1%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+P+ FD+R +WP CP++ I DQ +CGSCWA A+SDR+CI SNG SA +V
Sbjct: 88 IPKEFDSRNQWPHCPTIWEIRDQGSCGSCWAFGAVEAMSDRVCIHSNGTVNFHFSADDLV 147
Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
+C C +GCNGG+P AW +W G+V+GG Y S +GC+PY +APCEHHV G C
Sbjct: 148 SCCHTCGFGCNGGFPGAAWGYWVRKGIVSGGPYGSSQGCRPYEIAPCEHHVNGTRPPCEK 207
Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
KTP C+ C SY+ Y+ D G +A+ +
Sbjct: 208 EYG-KTPRCQHKC-QASYKVDYKTDKHFGSRAYSI 240
Score = 104 bits (260), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 44/81 (54%), Positives = 59/81 (72%)
Query: 79 QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWN 138
+I +GP+ F+VY D + YK GVY+H G +G HA+R++GWGVE D PYWL+ANSWN
Sbjct: 250 EIMTNGPVEGAFTVYEDLILYKDGVYEHVHGKELGGHAIRIIGWGVEKDTPYWLIANSWN 309
Query: 139 DHWGDHGTFKILRGENEADIE 159
WG++G FKILRG++ IE
Sbjct: 310 TDWGNNGFFKILRGKDHCGIE 330
>gi|417399216|gb|JAA46636.1| Putative cathepsin b [Desmodus rotundus]
Length = 340
Score = 149 bits (375), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 73/156 (46%), Positives = 99/156 (63%), Gaps = 4/156 (2%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP +FDARE+WP+CP+++ I DQ +CGSCWA AISDR+CI SNG ++SA+ ++
Sbjct: 80 LPESFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVEAISDRICIRSNGLQNVEVSAEDLL 139
Query: 247 ACTP-NCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
C C GCNGG+P AW FW G+V+GG Y+S GC+PY++ PCEHHV G C+
Sbjct: 140 TCCGFQCGEGCNGGFPSGAWNFWKKQGLVSGGLYDSHVGCRPYSIPPCEHHVNGSRPPCS 199
Query: 305 LLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
G TP+C + C P Y +Y+ D G + V
Sbjct: 200 GEGG-DTPKCSKIC-EPGYSPSYKEDKHFGCDTYSV 233
Score = 122 bits (307), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 60/108 (55%), Positives = 73/108 (67%), Gaps = 2/108 (1%)
Query: 54 YLPTSIPLSHYFKKAHMVPRCN--AMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDS 111
Y P+ H+ + VP M +IY++GP+ A FSVY+DFL YKSGVYQH G+
Sbjct: 216 YSPSYKEDKHFGCDTYSVPSDEKEIMVEIYKNGPVEAAFSVYSDFLLYKSGVYQHVTGEM 275
Query: 112 IGLHAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
+G HAVR+LGWGVEN PYWLV NSWN WGD+G FKILRG + IE
Sbjct: 276 VGGHAVRILGWGVENGTPYWLVGNSWNTDWGDNGFFKILRGRDHCGIE 323
>gi|187103108|ref|NP_001119614.1| cathepsin B-1418 precursor [Acyrthosiphon pisum]
gi|163300438|tpg|DAA06126.1| TPA_inf: cathepsin B transcript 1418 [Acyrthosiphon pisum]
gi|239788654|dbj|BAH70998.1| ACYPI000010 [Acyrthosiphon pisum]
Length = 346
Score = 149 bits (375), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 75/153 (49%), Positives = 98/153 (64%), Gaps = 8/153 (5%)
Query: 182 QNAKGLPRNFDAREKWPECPSLR-HIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQI 240
++ + LP NFDARE+WPEC SL I DQSNCGSCWAVS A+ SDRLCIA+ G +
Sbjct: 85 ESNEALPENFDARERWPECSSLLGSIKDQSNCGSCWAVSAASVFSDRLCIATGGAVARNL 144
Query: 241 SAQHIVACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGP 299
SA+ + C C GC+GG P+ AW F+ +G+VTGGDY S++GCQPY++ PC G
Sbjct: 145 SAEQLNTCCYRCGNGCDGGSPESAWYFFMRHGIVTGGDYGSEDGCQPYSIYPC-----GK 199
Query: 300 LQNCTLLGKLKTPECK-QNCYNPSYESTYRFDL 331
+N + TP+C + C N +Y YR DL
Sbjct: 200 GRNTCIEDDPDTPDCSIKTCTNSNYSKNYRADL 232
Score = 105 bits (261), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 47/99 (47%), Positives = 66/99 (66%), Gaps = 2/99 (2%)
Query: 63 HYFKKAHMVPRC--NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
HY + + R + M+ +Y++GP+ A F VY DF+ YKSGVY + G G HA+++L
Sbjct: 233 HYVDTVYSLSRSEEDIMKDLYKNGPVQAAFYVYTDFMYYKSGVYSYTRGQIEGGHAIKIL 292
Query: 121 GWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
GWGV++ YWL ANSW+ WG++G F+ILRG NE IE
Sbjct: 293 GWGVDDGTKYWLCANSWSRSWGENGLFRILRGNNECHIE 331
>gi|355697726|gb|EHH28274.1| Cathepsin B [Macaca mulatta]
Length = 339
Score = 148 bits (374), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 71/156 (45%), Positives = 100/156 (64%), Gaps = 5/156 (3%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP +FDARE+WP+CP+++ I DQ +CGSCWA AISDR+CI +N + + ++SA+ ++
Sbjct: 80 LPESFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHTNAHVSVEVSAEDLL 139
Query: 247 ACTPNCW--GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
C GCNGG+P AW F G+V+GG Y+S GC+PY++ PCEHHV G CT
Sbjct: 140 TCCGIMCGDGCNGGYPAGAWNFLTRKGLVSGGLYDSHVGCRPYSIPPCEHHVNGSRPPCT 199
Query: 305 LLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
G+ TP+C + C P Y TY+ D G ++ V
Sbjct: 200 --GEGDTPKCSKIC-EPGYSPTYKQDKHYGYNSYSV 232
Score = 123 bits (308), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 60/108 (55%), Positives = 75/108 (69%), Gaps = 2/108 (1%)
Query: 54 YLPTSIPLSHYFKKAHMVPRC--NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDS 111
Y PT HY ++ V + M +IY++GP+ FSVY+DFL YKSGVYQH G+
Sbjct: 215 YSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEM 274
Query: 112 IGLHAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
+G HA+R+LGWGVEN PYWLVANSWN WGD+G FKILRG++ IE
Sbjct: 275 MGGHAIRILGWGVENGTPYWLVANSWNTDWGDNGFFKILRGQDHCGIE 322
>gi|431918315|gb|ELK17542.1| Cathepsin B [Pteropus alecto]
Length = 359
Score = 148 bits (374), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 72/148 (48%), Positives = 95/148 (64%), Gaps = 6/148 (4%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP +FDARE+WP CP+++ I DQ +CGSCWA AISDR+CI +NG ++SA+ ++
Sbjct: 103 LPESFDAREQWPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICILTNGNVNVEVSAEDLL 162
Query: 247 ACTP-NCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
C C GCNGG+P AW FW G+V+GG Y+S GC+PY++ PCEHHV G CT
Sbjct: 163 TCCGFQCGEGCNGGFPSGAWNFWTKKGLVSGGLYDSHVGCRPYSIPPCEHHVNGSRPPCT 222
Query: 305 LLGKLKTPECKQNC---YNPSYESTYRF 329
G TP+C + C Y PSY+ F
Sbjct: 223 GEGG-STPKCSRICEAGYTPSYKEDKHF 249
Score = 123 bits (309), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 59/108 (54%), Positives = 74/108 (68%), Gaps = 2/108 (1%)
Query: 54 YLPTSIPLSHYFKKAHMVPRCNA--MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDS 111
Y P+ H+ ++ VP M +IY++GP+ A FSVY+DFL YKSGVYQH G+
Sbjct: 239 YTPSYKEDKHFGCSSYSVPSSETEIMAEIYKNGPVEAAFSVYSDFLLYKSGVYQHVTGEM 298
Query: 112 IGLHAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
+G HAVR+LGWGVE+ PYWLV NSWN WGD G FKILRG++ IE
Sbjct: 299 MGGHAVRILGWGVEDGTPYWLVGNSWNTDWGDSGFFKILRGQDHCGIE 346
>gi|410916585|ref|XP_003971767.1| PREDICTED: cathepsin B-like [Takifugu rubripes]
Length = 328
Score = 148 bits (374), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 71/155 (45%), Positives = 103/155 (66%), Gaps = 4/155 (2%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP +FDAR++WP+C +++ I DQ +CGSCWA A AISDRLCI S + +ISA+ ++
Sbjct: 77 LPDSFDARKQWPDCRTIQQIRDQGSCGSCWAFGAAEAISDRLCIHSGSKISLEISAEDLL 136
Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
+C C GC+GG+P AW FW G+VTGG S+ GC+PY++APCEHHV G C
Sbjct: 137 SCCDECGMGCSGGYPSSAWEFWTKKGLVTGGLCGSEVGCRPYSIAPCEHHVNGTRPPCQ- 195
Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
G +TP+C++ C + Y ++Y D GK+++ +
Sbjct: 196 -GTQETPKCEKKCID-GYLTSYLKDKHFGKRSYSL 228
Score = 119 bits (297), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 57/124 (45%), Positives = 83/124 (66%), Gaps = 2/124 (1%)
Query: 38 KKKKKKKKKKKKKKRLYLPTSIPLSHYFKKAHMVP--RCNAMRQIYEHGPLVAIFSVYAD 95
+ ++ K +KK YL + + H+ K+++ +P + M ++Y++GP+ A F+VYAD
Sbjct: 195 QGTQETPKCEKKCIDGYLTSYLKDKHFGKRSYSLPSQQEQIMTELYKNGPVEAAFTVYAD 254
Query: 96 FLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENE 155
FL YK+GVYQH G+ +G HA+++LGWG E+ PYWL ANSWN WGD G FKI RG +E
Sbjct: 255 FLLYKTGVYQHVTGEVLGGHAIKILGWGEESGTPYWLAANSWNGDWGDKGFFKIKRGNDE 314
Query: 156 ADIE 159
IE
Sbjct: 315 CGIE 318
>gi|340380665|ref|XP_003388842.1| PREDICTED: cathepsin B-like [Amphimedon queenslandica]
Length = 333
Score = 148 bits (374), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 72/155 (46%), Positives = 99/155 (63%), Gaps = 6/155 (3%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+P FDAR+KW +CPS+ I DQ +CGSCWA+ A+SDR C++ ISA++++
Sbjct: 82 IPDTFDARQKWSDCPSISDIRDQGSCGSCWALGAVEAMSDRYCVSFQENV--HISAENLM 139
Query: 247 ACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
C C GC GG+ Q AW +W +G+VTGG Y S EGCQPY + C HH GP +NCT
Sbjct: 140 TCCKFCGNGCAGGFLQQAWEYWVKDGLVTGGQYGSDEGCQPYLIPKCNHHEPGPYENCT- 198
Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
G+ KTP+C++ C Y ++Y DL G+KA+ V
Sbjct: 199 -GEGKTPQCERTC-RSGYTTSYEADLHYGEKAYAV 231
Score = 115 bits (289), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 53/99 (53%), Positives = 73/99 (73%), Gaps = 2/99 (2%)
Query: 63 HYFKKAHMVPR-CNAMR-QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
HY +KA+ V R A++ +I +GP+ F+VY+DF YKSGVYQH G ++G HA+R+L
Sbjct: 223 HYGEKAYAVHREVEAIQTEIMTNGPVEGAFTVYSDFPTYKSGVYQHVVGHALGGHAIRIL 282
Query: 121 GWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
GWG EN +PYWL+ANSWN WGD G FK++RG+++ IE
Sbjct: 283 GWGTENGVPYWLIANSWNPSWGDKGYFKMIRGKDDCGIE 321
>gi|45822211|emb|CAE47502.1| cathepsin B-like proteinase [Diabrotica virgifera virgifera]
Length = 331
Score = 148 bits (374), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 69/147 (46%), Positives = 97/147 (65%), Gaps = 5/147 (3%)
Query: 187 LPRNFDAREKWPECPS-LRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHI 245
+P +FDARE W EC + + DQS+CGSCWAV+ A+A+SDR CIAS G +SA+++
Sbjct: 79 VPNSFDARENWKECSDVISTVVDQSDCGSCWAVAAASAMSDRRCIASQGKLKVPVSAENL 138
Query: 246 VACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
++C +C +GC GG+P +AW +W G+ TGG Y S++GCQPY+L PCEHH +G C+
Sbjct: 139 LSCCDSCGYGCEGGYPTMAWSYWIDTGITTGGLYGSKQGCQPYSLQPCEHHTEGNKVQCS 198
Query: 305 LLGKLKTPECKQNCYNPS--YESTYRF 329
L TP CK C + + Y+S F
Sbjct: 199 TL-DYDTPSCKHKCDDSALNYKSELTF 224
Score = 116 bits (291), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 51/85 (60%), Positives = 64/85 (75%)
Query: 75 NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVA 134
N ++I +GP+ A F VY+DF+ YKSGVYQH G+ +G HAVR+LGWG E+ +PYWLVA
Sbjct: 237 NIQKEILTNGPVEAAFDVYSDFVNYKSGVYQHVAGEYLGGHAVRILGWGEESGVPYWLVA 296
Query: 135 NSWNDHWGDHGTFKILRGENEADIE 159
NSWN+ WGD G FKI RG NE+ E
Sbjct: 297 NSWNEDWGDKGLFKIRRGNNESGFE 321
>gi|326916753|ref|XP_003204669.1| PREDICTED: cathepsin B-like [Meleagris gallopavo]
Length = 340
Score = 148 bits (373), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 70/156 (44%), Positives = 100/156 (64%), Gaps = 4/156 (2%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP FD+R++WP CP++ I DQ +CGSCWA AISDR+C+ +N + ++SA+ ++
Sbjct: 80 LPDTFDSRKQWPNCPTISEIRDQGSCGSCWAFGAVEAISDRICVHTNAKVSVEVSAEDLL 139
Query: 247 ACTP-NC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
+C C GCNGG+P AWR+W G+V+GG Y+S GC+PYT+ PCEHHV G CT
Sbjct: 140 SCCGFECGMGCNGGYPSGAWRYWTERGLVSGGLYDSHVGCRPYTIPPCEHHVNGSRPPCT 199
Query: 305 LLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
G +TP C ++C P Y +Y+ D G ++ V
Sbjct: 200 GEGG-ETPRCSRHC-EPGYSPSYKEDKHYGITSYGV 233
Score = 124 bits (311), Expect = 7e-26, Method: Compositional matrix adjust.
Identities = 60/108 (55%), Positives = 73/108 (67%), Gaps = 2/108 (1%)
Query: 54 YLPTSIPLSHYFKKAHMVPRCN--AMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDS 111
Y P+ HY ++ VPR M +IY++GP+ F VY DFL YKSGVYQH G+
Sbjct: 216 YSPSYKEDKHYGITSYGVPRSEKEIMAEIYKNGPVEGAFIVYEDFLMYKSGVYQHVSGEQ 275
Query: 112 IGLHAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
+G HA+R+LGWGVEN PYWL ANSWN WGD+G FKILRGE+ IE
Sbjct: 276 VGGHAIRILGWGVENGTPYWLAANSWNTDWGDNGFFKILRGEDHCGIE 323
>gi|329668994|gb|AEB96385.1| cathepsin B-like cysteine protease 2 [Angiostrongylus cantonensis]
Length = 316
Score = 148 bits (373), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 84/203 (41%), Positives = 116/203 (57%), Gaps = 19/203 (9%)
Query: 148 KILRGENEADIE--------MGFNNRVEANSSEDDDLETMGCQNAKGLPRNFDAREKWPE 199
++ + E ADIE F NR N +DD E G + +P +FDAR WP
Sbjct: 25 QLFKAEPRADIEHLRRKVMKSKFINR--NNKPREDDTEIDGSK----IPDSFDARVTWPH 78
Query: 200 CPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVA-CTPNCWGCNGG 258
CPS+ +I DQS CGSCWA S A +SDR+CIAS+G+ ++SA I++ CT +GC+GG
Sbjct: 79 CPSISYIRDQSQCGSCWAFSSAEVMSDRVCIASHGHKKVELSADDILSCCTDGGYGCDGG 138
Query: 259 WPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPL-QNCTLLGKLKTPECKQN 317
WP AW+++ GVVTGG Y +++ C+PY + PC H NCT ++ TP+CK
Sbjct: 139 WPVSAWQYFVETGVVTGGLYGTKDACRPYEIPPCGIHKNETFYSNCTQ--EIDTPDCKTT 196
Query: 318 CYNPSYESTYRFDLKKGKKAHMV 340
C Y +Y D GK A+ V
Sbjct: 197 C-QAGYPISYDDDKTYGKTAYSV 218
Score = 110 bits (275), Expect = 9e-22, Method: Compositional matrix adjust.
Identities = 47/84 (55%), Positives = 62/84 (73%)
Query: 78 RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
++I +GP+VA F+VY DF YK+G+Y+H G G HAVR+LGWG + +PYWLVANSW
Sbjct: 227 KEIMTYGPVVAAFTVYDDFFHYKTGIYKHVSGAEAGGHAVRILGWGQQGGVPYWLVANSW 286
Query: 138 NDHWGDHGTFKILRGENEADIEMG 161
N WG++G F+ILRG +E IE G
Sbjct: 287 NTDWGENGYFRILRGSDECGIEDG 310
>gi|170787211|gb|ACB38229.1| cathepsin B [Meretrix meretrix]
Length = 337
Score = 148 bits (373), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 73/157 (46%), Positives = 92/157 (58%), Gaps = 4/157 (2%)
Query: 185 KGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQH 244
K LP FDAR +WP+CPSL+ + DQ CGSCWA A +DRLCI S G +SA+
Sbjct: 84 KDLPDTFDARTQWPDCPSLKEVRDQGACGSCWAFGCVEAATDRLCIQSKGIVNAHLSAED 143
Query: 245 IVACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNC 303
+ +C C GCNGG+ + AW + +G+VTGG YNS +GC PY + C+HHV G LQ C
Sbjct: 144 LTSCCRTCGNGCNGGFLEGAWNYLKRDGIVTGGPYNSHQGCLPYEIKACDHHVVGKLQPC 203
Query: 304 TLLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
G TP CK+ C Y +TY D K H V
Sbjct: 204 K--GDGPTPRCKKEC-ESGYNNTYSKDEHHAKTVHAV 237
Score = 107 bits (267), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 52/98 (53%), Positives = 65/98 (66%), Gaps = 1/98 (1%)
Query: 63 HYFKKAHMVPRC-NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLG 121
H+ K H V M +I +GP+ A F+VY+DF YKSGVY+H G +G HA++ LG
Sbjct: 229 HHAKTVHAVEGVEQIMTEIMTNGPVEAAFTVYSDFPTYKSGVYEHKSGGPLGGHAIKTLG 288
Query: 122 WGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
WG E+ YWLVANSWN WGD+G FKILRG +E IE
Sbjct: 289 WGNEDGKDYWLVANSWNPDWGDNGFFKILRGRDECGIE 326
>gi|160333103|ref|NP_001103948.1| capthepsin B, b precursor [Danio rerio]
gi|133777414|gb|AAI15255.1| Ctsbb protein [Danio rerio]
Length = 326
Score = 148 bits (373), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 72/155 (46%), Positives = 99/155 (63%), Gaps = 4/155 (2%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP +FD R++WP C +L I DQ +CGSCWA +ISDR+CI S G + +ISA+ ++
Sbjct: 75 LPDSFDLRDQWPNCKTLSQIRDQGSCGSCWAFGAVESISDRICIHSKGKQSPEISAEDLL 134
Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
+C C +GC+GG+P AW +W +G+VTGG YNS GC+PY++APCEHHV G C+
Sbjct: 135 SCCDQCGFGCSGGFPAEAWDYWRRSGLVTGGLYNSDVGCRPYSIAPCEHHVNGTRPPCS- 193
Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
G+ TP+C C P Y Y+ D G K + V
Sbjct: 194 -GEQDTPKCTGVCI-PKYSVPYKQDKHFGSKVYNV 226
Score = 117 bits (294), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 55/99 (55%), Positives = 70/99 (70%), Gaps = 2/99 (2%)
Query: 63 HYFKKAHMVP--RCNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
H+ K + VP + M ++Y +GP+ A F+VY DF YKSGVYQH G ++G HAV++L
Sbjct: 218 HFGSKVYNVPSDQQQIMTELYTNGPVEAAFTVYEDFPLYKSGVYQHLTGSALGGHAVKIL 277
Query: 121 GWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
GWG EN P+WLVANSWN WGD+G FKILRG +E IE
Sbjct: 278 GWGEENGTPFWLVANSWNSDWGDNGYFKILRGHDECGIE 316
>gi|45822203|emb|CAE47498.1| cathepsin B-like proteinase [Diabrotica virgifera virgifera]
Length = 328
Score = 148 bits (373), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 68/155 (43%), Positives = 97/155 (62%), Gaps = 3/155 (1%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+P +FDAR++WP CP++R I DQ +CGSCWA A+SDR+CI SNG S+ +V
Sbjct: 77 IPADFDARQQWPHCPTIREIRDQGSCGSCWAFGAVEAMSDRVCIHSNGESNFHFSSDDLV 136
Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
+C C GCNGG+P AW +W G+V+GG Y +++GC+PY + PCEHH G C
Sbjct: 137 SCCWTCGMGCNGGYPGAAWHYWVRKGLVSGGQYGTKQGCRPYEIPPCEHHTNGSRPACD- 195
Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
+ TP+C ++C +Y+ Y DL G KA+ +
Sbjct: 196 ASEGNTPKCAKSC-ESNYKINYSNDLHFGSKAYSI 229
Score = 118 bits (295), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 51/84 (60%), Positives = 63/84 (75%)
Query: 79 QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWN 138
+I ++GP+ FSVYADF+ YK+GVYQH G +G HA+R+ GWGVEN+ PYWL+ANSWN
Sbjct: 239 EILQNGPVEGAFSVYADFVNYKTGVYQHIKGQFLGGHAIRIFGWGVENNTPYWLIANSWN 298
Query: 139 DHWGDHGTFKILRGENEADIEMGF 162
WGD GTFKILRG + IE G
Sbjct: 299 TDWGDSGTFKILRGSDHCGIESGI 322
>gi|443692853|gb|ELT94358.1| hypothetical protein CAPTEDRAFT_221292 [Capitella teleta]
Length = 374
Score = 147 bits (372), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 68/158 (43%), Positives = 101/158 (63%), Gaps = 4/158 (2%)
Query: 184 AKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQ 243
A LP +FDAR++W CP+++ + DQ +CGSCWA A+SDR+CIAS G IS++
Sbjct: 119 AVNLPDDFDARKEWTGCPTIKEVRDQGSCGSCWAFGAVEAMSDRICIASKGNVHAHISSE 178
Query: 244 HIVACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQN 302
+++C +C GCNGG+P AW ++ G+V+GG Y + +GC+PY++APCEHHV G
Sbjct: 179 DLLSCCSSCGMGCNGGFPPAAWEYFRDTGLVSGGQYGTHQGCRPYSIAPCEHHVNGTRLP 238
Query: 303 CTLLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
C+ G+ TP+C++ C Y+ Y D G A+ V
Sbjct: 239 CS--GEGPTPKCERTC-EKGYKVKYEDDKNFGYTAYSV 273
Score = 120 bits (302), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 54/83 (65%), Positives = 63/83 (75%)
Query: 77 MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANS 136
M +I +GP+ F+VYADF YKSGVYQH G +G HA+RVLGWGVE+ PYWLVANS
Sbjct: 281 MTEIMTNGPVEGAFTVYADFPTYKSGVYQHVSGGELGGHAIRVLGWGVEDGTPYWLVANS 340
Query: 137 WNDHWGDHGTFKILRGENEADIE 159
WN WGD+G FKILRG+NE IE
Sbjct: 341 WNSDWGDNGFFKILRGQNECGIE 363
>gi|326427908|gb|EGD73478.1| cathepsin B [Salpingoeca sp. ATCC 50818]
Length = 341
Score = 147 bits (372), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 74/157 (47%), Positives = 100/157 (63%), Gaps = 6/157 (3%)
Query: 187 LPRNFDAREKWPE-CPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHI 245
LP +FD+R +W CPS++ + DQ+NCGSCWA A++DR CIAS G T ISA+ +
Sbjct: 89 LPTSFDSRTQWGSMCPSVKEVRDQANCGSCWAFGAVEAMTDRTCIASKGAQTPHISAEDL 148
Query: 246 VAC-TPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNC 303
+ C T C GCNGG+P AW +W + G+VTGG Y+S +GCQPY+LA CEHH GP + C
Sbjct: 149 LTCCTFTCGDGCNGGYPAAAWEYWKNQGIVTGGQYDSNQGCQPYSLAKCEHHTTGPYKPC 208
Query: 304 TLLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
+ + TP CK++C Y TY D G ++ V
Sbjct: 209 GDI--VPTPACKRSCRQ-GYNVTYPNDKHFGASSYGV 242
Score = 104 bits (259), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 44/81 (54%), Positives = 60/81 (74%)
Query: 79 QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWN 138
+I +GP+ A F+VY+DFL YKSGVYQH G +G HA++++GWGV++ YW+VANSWN
Sbjct: 251 EIMTNGPVEAAFTVYSDFLSYKSGVYQHTSGQPLGGHAIKIIGWGVQDGTDYWIVANSWN 310
Query: 139 DHWGDHGTFKILRGENEADIE 159
D WG+ G F I +G +E IE
Sbjct: 311 DSWGNDGFFWIKKGTDECGIE 331
>gi|213514196|ref|NP_001133994.1| Cathepsin B precursor [Salmo salar]
gi|209156086|gb|ACI34275.1| Cathepsin B precursor [Salmo salar]
Length = 330
Score = 147 bits (372), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 71/155 (45%), Positives = 99/155 (63%), Gaps = 3/155 (1%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP+NFD R +WP CP+L+ + DQ +CGSCWA A AISDR+CI SN + +IS++ ++
Sbjct: 79 LPKNFDPRLQWPNCPTLKEVRDQGSCGSCWAFGAAEAISDRVCIHSNAKVSVEISSEDLL 138
Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
+C +C GCNGG+P A FW G+V+GG Y+S GC+PY++ PCEHHV G C
Sbjct: 139 SCCESCGMGCNGGYPSAACDFWTKEGLVSGGLYDSHIGCRPYSIPPCEHHVNGTRPPCKG 198
Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
+ TP+C C P Y Y+ D GK+++ V
Sbjct: 199 E-EGDTPQCTNQC-EPGYTPGYKQDKHFGKRSYSV 231
Score = 115 bits (289), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 52/99 (52%), Positives = 72/99 (72%), Gaps = 2/99 (2%)
Query: 63 HYFKKAHMVPRCN--AMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
H+ K+++ VP M+++Y++GP+ F+VY DFL YKSGVY+H G ++G HA++VL
Sbjct: 223 HFGKRSYSVPSDEKEIMKELYKNGPVEGAFTVYEDFLLYKSGVYRHVSGSAVGGHAIKVL 282
Query: 121 GWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
GWG E IPYWL ANSWN WG++G FKI+RGE+ IE
Sbjct: 283 GWGEEGGIPYWLAANSWNTDWGENGFFKIVRGEDHCGIE 321
>gi|156365510|ref|XP_001626688.1| predicted protein [Nematostella vectensis]
gi|156213574|gb|EDO34588.1| predicted protein [Nematostella vectensis]
Length = 259
Score = 147 bits (372), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 70/155 (45%), Positives = 97/155 (62%), Gaps = 4/155 (2%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+P +FD+RE+WP CP+++ + DQ CGSCWA A+SDR CI S G ISA+ ++
Sbjct: 4 VPDHFDSREQWPHCPTIKEVRDQGACGSCWAFGAVEAMSDRYCIKSEGKVMPHISAEDLL 63
Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
+C C GCNGG+P+ AW W G+VTGG Y+S +GCQPY +A C+HHV G L+ C
Sbjct: 64 SCCETCGMGCNGGYPESAWDHWKSKGLVTGGQYDSHKGCQPYKIAACDHHVVGKLKPCK- 122
Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
G TP+C++ C Y +Y D G+ A+ V
Sbjct: 123 -GDSPTPKCERKC-EAGYNVSYSDDKHFGQSAYSV 155
Score = 114 bits (284), Expect = 8e-23, Method: Compositional matrix adjust.
Identities = 54/102 (52%), Positives = 68/102 (66%), Gaps = 2/102 (1%)
Query: 63 HYFKKAHMVPRCNA--MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
H+ + A+ V A ++I +GP+ F+VYADF YKSGVYQH G ++G HA+++L
Sbjct: 147 HFGQSAYSVRSDPAEIQKEIMTNGPVEGAFTVYADFPTYKSGVYQHTSGSALGGHAIKIL 206
Query: 121 GWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIEMGF 162
GWG EN PYWLVANSWN WGD G FKI RG +E IE G
Sbjct: 207 GWGEENGTPYWLVANSWNSDWGDEGFFKIKRGNDECGIESGI 248
>gi|82830420|ref|NP_072119.2| cathepsin B preproprotein [Rattus norvegicus]
gi|47939014|gb|AAH72490.1| Cathepsin B [Rattus norvegicus]
gi|149030258|gb|EDL85314.1| rCG52258, isoform CRA_a [Rattus norvegicus]
Length = 339
Score = 147 bits (372), Expect = 6e-33, Method: Compositional matrix adjust.
Identities = 74/166 (44%), Positives = 100/166 (60%), Gaps = 5/166 (3%)
Query: 177 ETMGCQNAKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYF 236
E +G LP +FDARE+W CP++ I DQ +CGSCWA A+SDR+CI +NG
Sbjct: 70 ERVGFSEDINLPESFDAREQWSNCPTIAQIRDQGSCGSCWAFGAVEAMSDRICIHTNGRV 129
Query: 237 TGQISAQHIVACTP-NCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEH 294
++SA+ ++ C C GCNGG+P AW FW G+V+GG YNS GC PYT+ PCEH
Sbjct: 130 NVEVSAEDLLTCCGIQCGDGCNGGYPSGAWNFWTRKGLVSGGVYNSHIGCLPYTIPPCEH 189
Query: 295 HVQGPLQNCTLLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
HV G CT G+ TP+C + C Y ++Y+ D G ++ V
Sbjct: 190 HVNGSRPPCT--GEGDTPKCNKMC-EAGYSTSYKEDKHYGYTSYSV 232
Score = 123 bits (309), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 53/83 (63%), Positives = 67/83 (80%)
Query: 77 MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANS 136
M +IY++GP+ F+V++DFL YKSGVY+H GD +G HA+R+LGWG+EN +PYWLVANS
Sbjct: 240 MAEIYKNGPVEGAFTVFSDFLTYKSGVYKHEAGDVMGGHAIRILGWGIENGVPYWLVANS 299
Query: 137 WNDHWGDHGTFKILRGENEADIE 159
WN WGD+G FKILRGEN IE
Sbjct: 300 WNVDWGDNGFFKILRGENHCGIE 322
>gi|405971658|gb|EKC36483.1| Cathepsin B [Crassostrea gigas]
Length = 341
Score = 147 bits (372), Expect = 6e-33, Method: Compositional matrix adjust.
Identities = 73/158 (46%), Positives = 95/158 (60%), Gaps = 5/158 (3%)
Query: 185 KGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQH 244
K LP FD+R +WP CP+L+ + DQ CGSCWA A+SDR+CI S G ISA+
Sbjct: 87 KDLPATFDSRTQWPNCPTLKEVRDQGACGSCWAFGAVEAMSDRICIKSQGKENTHISAED 146
Query: 245 IVACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNC 303
+ +C C GC GG+P AW ++ +G+VTGG YNS +GC PYT+ C+HHV G LQ C
Sbjct: 147 LTSCCRTCGNGCEGGFPSAAWSYYKKDGLVTGGQYNSHQGCLPYTIKACDHHVVGKLQPC 206
Query: 304 T-LLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
+ +G TP+CK C Y TY D G A+ V
Sbjct: 207 SKSIG--PTPKCKHTC-EAGYNVTYEKDKHYGSSAYSV 241
Score = 114 bits (285), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 54/98 (55%), Positives = 66/98 (67%), Gaps = 1/98 (1%)
Query: 63 HYFKKAHMVPRCN-AMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLG 121
HY A+ V M +I +GP+ F+VYADF QYKSGVY+H G +G HA+++LG
Sbjct: 233 HYGSSAYSVHGVEKIMTEIMTNGPVEGAFTVYADFPQYKSGVYKHTTGQPLGGHAIKILG 292
Query: 122 WGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
WG EN YWLVANSWN WGD G FKILRG++E IE
Sbjct: 293 WGTENGDDYWLVANSWNPDWGDQGFFKILRGQDECGIE 330
>gi|201023319|ref|NP_001128401.1| cathepsin B-10270 precursor [Acyrthosiphon pisum]
gi|239788119|dbj|BAH70754.1| ACYPI000021 [Acyrthosiphon pisum]
Length = 341
Score = 147 bits (371), Expect = 7e-33, Method: Compositional matrix adjust.
Identities = 71/153 (46%), Positives = 95/153 (62%), Gaps = 3/153 (1%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+P FDAR KW EC S+ HI +Q NC + WA+SV +AI+DR+CI S T S Q ++
Sbjct: 87 MPETFDARNKWFECVSIAHIWNQGNCAADWAISVTSAINDRICIKSKKNITAFYSPQKML 146
Query: 247 ACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
+C +C GCNGG+ AW++W G+VTGGDY S EGCQP+ + PC H V +
Sbjct: 147 SCCDDCGDGCNGGYSGAAWQYWMKRGLVTGGDYGSNEGCQPWLIPPCNHTVMDERSPSYM 206
Query: 306 LGKLK--TPECKQNCYNPSYESTYRFDLKKGKK 336
GK K TP+C NCYNP+Y + D+ KG +
Sbjct: 207 CGKYKSETPQCTLNCYNPNYSKPFLKDISKGIR 239
Score = 94.7 bits (234), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 47/91 (51%), Positives = 56/91 (61%), Gaps = 2/91 (2%)
Query: 74 CNAM--RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYW 131
C+ M ++ +HGP AI VY DFL YKSG+YQH G +G V+V+GWGV + YW
Sbjct: 244 CSGMIRNELKKHGPATAIMRVYEDFLTYKSGIYQHVTGKLLGQITVKVIGWGVYRGVQYW 303
Query: 132 LVANSWNDHWGDHGTFKILRGENEADIEMGF 162
L ANSW WGD G FKI RG NE E F
Sbjct: 304 LAANSWGTSWGDKGFFKIRRGYNECLFEDYF 334
>gi|203648|gb|AAA40993.1| cathepsin (EC 3.4.22.1), partial [Rattus norvegicus]
Length = 271
Score = 147 bits (371), Expect = 7e-33, Method: Compositional matrix adjust.
Identities = 74/166 (44%), Positives = 100/166 (60%), Gaps = 5/166 (3%)
Query: 177 ETMGCQNAKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYF 236
E +G LP +FDARE+W CP++ I DQ +CGSCWA A+SDR+CI +NG
Sbjct: 2 ERVGFSEDINLPESFDAREQWSNCPTIAQIRDQGSCGSCWAFGAVEAMSDRICIHTNGRV 61
Query: 237 TGQISAQHIVACTP-NCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEH 294
++SA+ ++ C C GCNGG+P AW FW G+V+GG YNS GC PYT+ PCEH
Sbjct: 62 NVEVSAEDLLTCCGIQCGDGCNGGYPSGAWNFWTRKGLVSGGVYNSHIGCLPYTIPPCEH 121
Query: 295 HVQGPLQNCTLLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
HV G CT G+ TP+C + C Y ++Y+ D G ++ V
Sbjct: 122 HVNGSRPPCT--GEGDTPKCNKMC-EAGYSTSYKEDKHYGYTSYSV 164
Score = 124 bits (310), Expect = 9e-26, Method: Compositional matrix adjust.
Identities = 53/83 (63%), Positives = 67/83 (80%)
Query: 77 MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANS 136
M +IY++GP+ F+V++DFL YKSGVY+H GD +G HA+R+LGWG+EN +PYWLVANS
Sbjct: 172 MAEIYKNGPVEGAFTVFSDFLTYKSGVYKHEAGDVMGGHAIRILGWGIENGVPYWLVANS 231
Query: 137 WNDHWGDHGTFKILRGENEADIE 159
WN WGD+G FKILRGEN IE
Sbjct: 232 WNVDWGDNGFFKILRGENHCGIE 254
>gi|1705630|sp|P00787.2|CATB_RAT RecName: Full=Cathepsin B; AltName: Full=Cathepsin B1; AltName:
Full=RSG-2; Contains: RecName: Full=Cathepsin B light
chain; Contains: RecName: Full=Cathepsin B heavy chain;
Flags: Precursor
gi|1524328|emb|CAA57792.1| cathepsin b [Rattus norvegicus]
Length = 339
Score = 147 bits (371), Expect = 7e-33, Method: Compositional matrix adjust.
Identities = 74/166 (44%), Positives = 100/166 (60%), Gaps = 5/166 (3%)
Query: 177 ETMGCQNAKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYF 236
E +G LP +FDARE+W CP++ I DQ +CGSCWA A+SDR+CI +NG
Sbjct: 70 ERVGFSEDINLPESFDAREQWSNCPTIAQIRDQGSCGSCWAFGAVEAMSDRICIHTNGRV 129
Query: 237 TGQISAQHIVACTP-NCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEH 294
++SA+ ++ C C GCNGG+P AW FW G+V+GG YNS GC PYT+ PCEH
Sbjct: 130 NVEVSAEDLLTCCGIQCGDGCNGGYPSGAWNFWTRKGLVSGGVYNSHIGCLPYTIPPCEH 189
Query: 295 HVQGPLQNCTLLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
HV G CT G+ TP+C + C Y ++Y+ D G ++ V
Sbjct: 190 HVNGSRPPCT--GEGDTPKCNKMC-EAGYSTSYKEDKHYGYTSYSV 232
Score = 123 bits (309), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 53/83 (63%), Positives = 67/83 (80%)
Query: 77 MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANS 136
M +IY++GP+ F+V++DFL YKSGVY+H GD +G HA+R+LGWG+EN +PYWLVANS
Sbjct: 240 MAEIYKNGPVEGAFTVFSDFLTYKSGVYKHEAGDVMGGHAIRILGWGIENGVPYWLVANS 299
Query: 137 WNDHWGDHGTFKILRGENEADIE 159
WN WGD+G FKILRGEN IE
Sbjct: 300 WNVDWGDNGFFKILRGENHCGIE 322
>gi|189096178|pdb|3CBJ|A Chain A, Chagasin-cathepsin B Complex
gi|189096180|pdb|3CBK|A Chain A, Chagasin-Cathepsin B
Length = 266
Score = 147 bits (371), Expect = 7e-33, Method: Compositional matrix adjust.
Identities = 70/156 (44%), Positives = 99/156 (63%), Gaps = 5/156 (3%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP +FDARE+WP+CP+++ I DQ +CGS WA AISDR+CI +N + + ++SA+ ++
Sbjct: 7 LPASFDAREQWPQCPTIKEIRDQGSCGSAWAFGAVEAISDRICIHTNAHVSVEVSAEDLL 66
Query: 247 ACTPNCW--GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
C + GCNGG+P AW FW G+V+GG Y S GC+PY++ PCE HV G CT
Sbjct: 67 TCCGSMCGDGCNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCEAHVNGARPPCT 126
Query: 305 LLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
G+ TP+C + C P Y TY+ D G ++ V
Sbjct: 127 --GEGDTPKCSKIC-EPGYSPTYKQDKHYGYNSYSV 159
Score = 124 bits (310), Expect = 7e-26, Method: Compositional matrix adjust.
Identities = 60/108 (55%), Positives = 75/108 (69%), Gaps = 2/108 (1%)
Query: 54 YLPTSIPLSHYFKKAHMVPRC--NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDS 111
Y PT HY ++ V + M +IY++GP+ FSVY+DFL YKSGVYQH G+
Sbjct: 142 YSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEM 201
Query: 112 IGLHAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
+G HA+R+LGWGVEN PYWLVANSWN WGD+G FKILRG++ IE
Sbjct: 202 MGGHAIRILGWGVENGTPYWLVANSWNTDWGDNGFFKILRGQDHCGIE 249
>gi|6681079|ref|NP_031824.1| cathepsin B preproprotein [Mus musculus]
gi|115712|sp|P10605.2|CATB_MOUSE RecName: Full=Cathepsin B; AltName: Full=Cathepsin B1; Contains:
RecName: Full=Cathepsin B light chain; Contains:
RecName: Full=Cathepsin B heavy chain; Flags: Precursor
gi|239907|gb|AAB20536.1| preprocathepsin B [Mus sp.]
gi|309152|gb|AAA37375.1| cathepsin B [Mus musculus]
gi|13879360|gb|AAH06656.1| Cathepsin B [Mus musculus]
gi|26350521|dbj|BAC38900.1| unnamed protein product [Mus musculus]
gi|74180941|dbj|BAE27751.1| unnamed protein product [Mus musculus]
gi|74191261|dbj|BAE39458.1| unnamed protein product [Mus musculus]
gi|74198944|dbj|BAE30691.1| unnamed protein product [Mus musculus]
gi|74208073|dbj|BAE29144.1| unnamed protein product [Mus musculus]
gi|148704123|gb|EDL36070.1| cathepsin B, isoform CRA_a [Mus musculus]
Length = 339
Score = 147 bits (371), Expect = 8e-33, Method: Compositional matrix adjust.
Identities = 73/148 (49%), Positives = 92/148 (62%), Gaps = 7/148 (4%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP FDARE+W CP++ I DQ +CGSCWA AISDR CI +NG ++SA+ ++
Sbjct: 80 LPETFDAREQWSNCPTIGQIRDQGSCGSCWAFGAVEAISDRTCIHTNGRVNVEVSAEDLL 139
Query: 247 ACTP-NCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
C C GCNGG+P AW FW G+V+GG YNS GC PYT+ PCEHHV G CT
Sbjct: 140 TCCGIQCGDGCNGGYPSGAWSFWTKKGLVSGGVYNSHVGCLPYTIPPCEHHVNGSRPPCT 199
Query: 305 LLGKLKTPECKQNC---YNPSYESTYRF 329
G+ TP C ++C Y+PSY+ F
Sbjct: 200 --GEGDTPRCNKSCEAGYSPSYKEDKHF 225
Score = 122 bits (305), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 53/83 (63%), Positives = 66/83 (79%)
Query: 77 MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANS 136
M +IY++GP+ F+V++DFL YKSGVY+H GD +G HA+R+LGWGVEN +PYWL ANS
Sbjct: 240 MAEIYKNGPVEGAFTVFSDFLTYKSGVYKHEAGDMMGGHAIRILGWGVENGVPYWLAANS 299
Query: 137 WNDHWGDHGTFKILRGENEADIE 159
WN WGD+G FKILRGEN IE
Sbjct: 300 WNLDWGDNGFFKILRGENHCGIE 322
>gi|74221319|dbj|BAE42140.1| unnamed protein product [Mus musculus]
Length = 339
Score = 147 bits (371), Expect = 8e-33, Method: Compositional matrix adjust.
Identities = 73/148 (49%), Positives = 92/148 (62%), Gaps = 7/148 (4%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP FDARE+W CP++ I DQ +CGSCWA AISDR CI +NG ++SA+ ++
Sbjct: 80 LPETFDAREQWSNCPTIGQIRDQGSCGSCWAFGAVEAISDRTCIHTNGRVNVEVSAEDLL 139
Query: 247 ACTP-NCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
C C GCNGG+P AW FW G+V+GG YNS GC PYT+ PCEHHV G CT
Sbjct: 140 TCCGIQCGDGCNGGYPSGAWSFWTKKGLVSGGVYNSHVGCLPYTIPPCEHHVNGSRPPCT 199
Query: 305 LLGKLKTPECKQNC---YNPSYESTYRF 329
G+ TP C ++C Y+PSY+ F
Sbjct: 200 --GEGDTPRCNKSCEAGYSPSYKEDKHF 225
Score = 117 bits (292), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 51/83 (61%), Positives = 64/83 (77%)
Query: 77 MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANS 136
M +IY++ P+ F+V++DFL YKSGVY+H GD +G HA+R+LGWGV N +PYWL ANS
Sbjct: 240 MAEIYKNDPVEGAFTVFSDFLTYKSGVYKHEAGDMMGGHAIRILGWGVGNGVPYWLAANS 299
Query: 137 WNDHWGDHGTFKILRGENEADIE 159
WN WGD+G FKILRGEN IE
Sbjct: 300 WNLDWGDNGFFKILRGENHCGIE 322
>gi|449267314|gb|EMC78276.1| Cathepsin B [Columba livia]
Length = 340
Score = 147 bits (370), Expect = 9e-33, Method: Compositional matrix adjust.
Identities = 69/156 (44%), Positives = 100/156 (64%), Gaps = 4/156 (2%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP +FD+R +WP CP++ I DQ +CGSCWA AISDR+C+ +N + ++SA+ ++
Sbjct: 80 LPDSFDSRTQWPNCPTISEIRDQGSCGSCWAFGAVEAISDRICVHTNAKVSVEVSAEDLL 139
Query: 247 ACTP-NC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
+C C GCNGG+P AWR+W G+V+GG Y+S GC+PY++ PCEHHV G CT
Sbjct: 140 SCCGFECGMGCNGGYPSGAWRYWTEKGLVSGGLYDSHVGCRPYSIPPCEHHVNGSRPPCT 199
Query: 305 LLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
G +TP C ++C P Y +Y+ D G ++ V
Sbjct: 200 GEGG-ETPRCSRHC-EPGYSPSYKEDKHYGITSYGV 233
Score = 122 bits (307), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 59/108 (54%), Positives = 73/108 (67%), Gaps = 2/108 (1%)
Query: 54 YLPTSIPLSHYFKKAHMVPRCN--AMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDS 111
Y P+ HY ++ VPR M +IY++GP+ F VY DFL YKSGVYQH G+
Sbjct: 216 YSPSYKEDKHYGITSYGVPRSEKEIMAEIYKNGPVEGAFIVYEDFLMYKSGVYQHVTGEQ 275
Query: 112 IGLHAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
+G HA+R+LGWGV+N PYWL ANSWN WGD+G FKILRGE+ IE
Sbjct: 276 VGGHAIRLLGWGVDNGTPYWLAANSWNTDWGDNGFFKILRGEDHCGIE 323
>gi|340380685|ref|XP_003388852.1| PREDICTED: cathepsin B-like [Amphimedon queenslandica]
Length = 341
Score = 147 bits (370), Expect = 9e-33, Method: Compositional matrix adjust.
Identities = 73/156 (46%), Positives = 100/156 (64%), Gaps = 14/156 (8%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQ--ISAQH 244
+P FDAR+KWP+CP++ + DQ CGSCWA A+SDR CI+ F Q ISA++
Sbjct: 86 IPDTFDARQKWPDCPTIGTVRDQGACGSCWAFGAVEAMSDRYCIS----FKEQVNISAEN 141
Query: 245 IVACTPNCW-GCNGGWPQLAWRFWG----HNGVVTGGDYNSQEGCQPYTLAPCEHHVQGP 299
+++C C GC+GG+P AWR W + G+VTGG Y+S GCQPYT+ C+HH GP
Sbjct: 142 LLSCCETCGSGCDGGYPAAAWRHWADKLLYEGIVTGGQYDSNAGCQPYTIPKCDHHEPGP 201
Query: 300 LQNCTLLGKLKTPECKQNCYNPSYESTYRFDLKKGK 335
+NC+ G TP CK++C + SY+ +YR D GK
Sbjct: 202 YENCS--GSQSTPSCKRSCIS-SYDKSYRSDKHYGK 234
Score = 114 bits (285), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 49/81 (60%), Positives = 60/81 (74%)
Query: 79 QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWN 138
+I +GP+ FSVYADF Y SGVYQH G +G HA+++LGWG EN +PYWLVANSWN
Sbjct: 249 EIMTNGPVEGAFSVYADFPTYTSGVYQHTTGSFLGGHAIKILGWGTENGVPYWLVANSWN 308
Query: 139 DHWGDHGTFKILRGENEADIE 159
WGD G FKI+RG++E IE
Sbjct: 309 PSWGDSGFFKIIRGKDECGIE 329
>gi|1311050|pdb|1CPJ|A Chain A, Crystal Structures Of Recombinant Rat Cathepsin B And A
Cathepsin B-Inhibitor Complex: Implications For
Structure- Based Inhibitor Design
gi|1311051|pdb|1CPJ|B Chain B, Crystal Structures Of Recombinant Rat Cathepsin B And A
Cathepsin B-Inhibitor Complex: Implications For
Structure- Based Inhibitor Design
gi|1421561|pdb|1THE|A Chain A, Crystal Structures Of Recombinant Rat Cathepsin B And A
Cathepsin B- Inhibitor Complex: Implications For
Structure-Based Inhibitor Design
gi|1421562|pdb|1THE|B Chain B, Crystal Structures Of Recombinant Rat Cathepsin B And A
Cathepsin B- Inhibitor Complex: Implications For
Structure-Based Inhibitor Design
Length = 260
Score = 147 bits (370), Expect = 9e-33, Method: Compositional matrix adjust.
Identities = 72/156 (46%), Positives = 97/156 (62%), Gaps = 5/156 (3%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP +FDARE+W CP++ I DQ +CGSCWA A+SDR+CI +NG ++SA+ ++
Sbjct: 7 LPESFDAREQWSNCPTIAQIRDQGSCGSCWAFGAVEAMSDRICIHTNGRVNVEVSAEDLL 66
Query: 247 ACTP-NCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
C C GCNGG+P AW FW G+V+GG YNS GC PYT+ PCEHHV G CT
Sbjct: 67 TCCGIQCGDGCNGGYPSGAWNFWTRKGLVSGGVYNSHIGCLPYTIPPCEHHVNGARPPCT 126
Query: 305 LLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
G+ TP+C + C Y ++Y+ D G ++ V
Sbjct: 127 --GEGDTPKCNKMC-EAGYSTSYKEDKHYGYTSYSV 159
Score = 124 bits (311), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 53/83 (63%), Positives = 67/83 (80%)
Query: 77 MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANS 136
M +IY++GP+ F+V++DFL YKSGVY+H GD +G HA+R+LGWG+EN +PYWLVANS
Sbjct: 167 MAEIYKNGPVEGAFTVFSDFLTYKSGVYKHEAGDVMGGHAIRILGWGIENGVPYWLVANS 226
Query: 137 WNDHWGDHGTFKILRGENEADIE 159
WN WGD+G FKILRGEN IE
Sbjct: 227 WNADWGDNGFFKILRGENHCGIE 249
>gi|125981197|ref|XP_001354605.1| GA10694 [Drosophila pseudoobscura pseudoobscura]
gi|54642915|gb|EAL31659.1| GA10694 [Drosophila pseudoobscura pseudoobscura]
Length = 338
Score = 147 bits (370), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 70/155 (45%), Positives = 92/155 (59%), Gaps = 5/155 (3%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+P FDAR+ WP CP++ I DQ +CGSCWA A+SDR+CI S G +SA +V
Sbjct: 86 IPEEFDARKAWPNCPTIGEIRDQGSCGSCWAFGAVEAMSDRVCIHSEGKVNFHLSADDLV 145
Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
+C C +GCNGG+P AW +W G+V+GG Y S +GC+PY +APCEHHV G C+
Sbjct: 146 SCCHICGFGCNGGFPGAAWSYWTRKGIVSGGPYGSTQGCRPYEIAPCEHHVNGTRPPCS- 204
Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
TP C+ C SY Y D G K++ V
Sbjct: 205 --HGSTPSCQHKC-QASYSVEYAKDKNFGSKSYSV 236
Score = 104 bits (260), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 45/84 (53%), Positives = 61/84 (72%), Gaps = 2/84 (2%)
Query: 78 RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGV--ENDIPYWLVAN 135
++I +GP+ F+VY D + YKSGVYQH G +G HA+R+LGWGV E+ +PYWL+ N
Sbjct: 245 QEIMTNGPVEGAFTVYEDLILYKSGVYQHEHGKELGGHAIRILGWGVWGESKVPYWLIGN 304
Query: 136 SWNDHWGDHGTFKILRGENEADIE 159
SWN WGD+G F+ILRG++ IE
Sbjct: 305 SWNTDWGDNGFFRILRGQDHCGIE 328
>gi|195165479|ref|XP_002023566.1| GL19846 [Drosophila persimilis]
gi|194105700|gb|EDW27743.1| GL19846 [Drosophila persimilis]
Length = 329
Score = 146 bits (369), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 70/155 (45%), Positives = 92/155 (59%), Gaps = 5/155 (3%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+P FDAR+ WP CP++ I DQ +CGSCWA A+SDR+CI S G +SA +V
Sbjct: 86 IPEEFDARKAWPNCPTIGEIRDQGSCGSCWAFGAVEAMSDRVCIHSEGKVNFHLSADDLV 145
Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
+C C +GCNGG+P AW +W G+V+GG Y S +GC+PY +APCEHHV G C+
Sbjct: 146 SCCHICGFGCNGGFPGAAWSYWTRKGIVSGGPYGSTQGCRPYEIAPCEHHVNGTRPPCS- 204
Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
TP C+ C SY Y D G K++ V
Sbjct: 205 --HGSTPSCQHKC-QASYSVEYAKDKNFGSKSYSV 236
Score = 87.4 bits (215), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 37/68 (54%), Positives = 49/68 (72%), Gaps = 2/68 (2%)
Query: 78 RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGV--ENDIPYWLVAN 135
++I +GP+ F+VY D + YKSGVYQH G +G HA+R+LGWGV E+ +PYWL+ N
Sbjct: 245 QEIMTNGPVEGAFTVYEDLILYKSGVYQHEHGKELGGHAIRILGWGVWGESKVPYWLIGN 304
Query: 136 SWNDHWGD 143
SWN WGD
Sbjct: 305 SWNTDWGD 312
>gi|1127275|pdb|1CTE|A Chain A, Crystal Structures Of Recombinant Rat Cathepsin B And A
Cathepsin B-Inhibitor Complex: Implications For
Structure- Based Inhibitor Design
gi|1127276|pdb|1CTE|B Chain B, Crystal Structures Of Recombinant Rat Cathepsin B And A
Cathepsin B-Inhibitor Complex: Implications For
Structure- Based Inhibitor Design
Length = 254
Score = 146 bits (369), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 72/156 (46%), Positives = 97/156 (62%), Gaps = 5/156 (3%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP +FDARE+W CP++ I DQ +CGSCWA A+SDR+CI +NG ++SA+ ++
Sbjct: 1 LPESFDAREQWSNCPTIAQIRDQGSCGSCWAFGAVEAMSDRICIHTNGRVNVEVSAEDLL 60
Query: 247 ACTP-NCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
C C GCNGG+P AW FW G+V+GG YNS GC PYT+ PCEHHV G CT
Sbjct: 61 TCCGIQCGDGCNGGYPSGAWNFWTRKGLVSGGVYNSHIGCLPYTIPPCEHHVNGARPPCT 120
Query: 305 LLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
G+ TP+C + C Y ++Y+ D G ++ V
Sbjct: 121 --GEGDTPKCNKMC-EAGYSTSYKEDKHYGYTSYSV 153
Score = 124 bits (312), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 53/83 (63%), Positives = 67/83 (80%)
Query: 77 MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANS 136
M +IY++GP+ F+V++DFL YKSGVY+H GD +G HA+R+LGWG+EN +PYWLVANS
Sbjct: 161 MAEIYKNGPVEGAFTVFSDFLTYKSGVYKHEAGDVMGGHAIRILGWGIENGVPYWLVANS 220
Query: 137 WNDHWGDHGTFKILRGENEADIE 159
WN WGD+G FKILRGEN IE
Sbjct: 221 WNADWGDNGFFKILRGENHCGIE 243
>gi|390994431|gb|AFM37365.1| cathepsin B2 [Dictyocaulus viviparus]
Length = 346
Score = 146 bits (369), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 69/182 (37%), Positives = 101/182 (55%), Gaps = 8/182 (4%)
Query: 160 MGFNNRVEANSSEDDDLETMGCQNAKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVS 219
+ ++ N DD++ +P +FD+R +WP CPS++ I DQS+CGSCWA
Sbjct: 72 VAIPSKYRVNEVTHDDIDD------SAIPSSFDSRTQWPNCPSIKSIRDQSSCGSCWAFG 125
Query: 220 VANAISDRLCIASNGYFTGQISAQHIVACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDY 278
A A++DR+CIAS G +SA +++C C +GC+GG+P AW +W G+V+GG Y
Sbjct: 126 AAEAMTDRICIASKGAIQFTVSADDLLSCCDECGFGCDGGFPYAAWNYWVEKGIVSGGSY 185
Query: 279 NSQEGCQPYTLAPCEHHVQGPLQNCTLLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAH 338
S+ GC+PY PCEHH G + T C+ C Y + Y D + G KA+
Sbjct: 186 TSKSGCKPYPFPPCEHHTNGTHYHPCPKDLYPTNTCEHKC-QSGYATAYTNDKRYGAKAY 244
Query: 339 MV 340
V
Sbjct: 245 TV 246
Score = 103 bits (257), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 48/100 (48%), Positives = 66/100 (66%), Gaps = 2/100 (2%)
Query: 64 YFKKAHMVP-RCNAM-RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLG 121
Y KA+ V R A+ ++I HGP+ + VY DF Y G+Y+H G +G HAV+++G
Sbjct: 239 YGAKAYTVAARVKAIQKEIMLHGPVEVAYDVYEDFEHYLKGIYKHTAGSYLGGHAVKMIG 298
Query: 122 WGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIEMG 161
WG EN IPYW+ +NSWN WG++G F+ILRG +E IE G
Sbjct: 299 WGTENGIPYWICSNSWNSDWGENGFFRILRGTDECGIESG 338
>gi|44965401|gb|AAS49537.1| cathepsin B [Latimeria chalumnae]
Length = 225
Score = 146 bits (369), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 70/142 (49%), Positives = 93/142 (65%), Gaps = 6/142 (4%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP NFD+R +WP+CP+++ I DQ +CGSCWA AISDR+CI S G +ISA+ ++
Sbjct: 13 LPENFDSRTQWPKCPTIQEIRDQGSCGSCWAFGAVEAISDRVCIHSKGKVNVEISAEDLL 72
Query: 247 ACTP-NC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
+C C +GCNGG+P AW FW G+V+GG + S GC+PYT+ PCEHHV G +CT
Sbjct: 73 SCCGMECGFGCNGGYPSGAWNFWTETGLVSGGLFKSHIGCRPYTIPPCEHHVNGSRPSCT 132
Query: 305 LLGKLKTPECKQNC---YNPSY 323
+ TP+C C Y PSY
Sbjct: 133 GE-EGDTPKCVMQCEAGYTPSY 153
Score = 75.5 bits (184), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 36/75 (48%), Positives = 50/75 (66%), Gaps = 2/75 (2%)
Query: 54 YLPTSIPLSHYFKKAHMVPRCNAMRQI--YEHGPLVAIFSVYADFLQYKSGVYQHNFGDS 111
Y P+ H+ ++ V A QI Y++GP+ F+VY DFLQYKSGVY+H GD+
Sbjct: 149 YTPSYFKDKHFGSTSYAVSSNEADIQIEIYKNGPVEGAFTVYEDFLQYKSGVYKHVTGDA 208
Query: 112 IGLHAVRVLGWGVEN 126
+G HA+R+LGWGVE+
Sbjct: 209 VGGHAIRILGWGVES 223
>gi|282400164|ref|NP_001164205.1| cathepsin B precursor [Tribolium castaneum]
gi|270004839|gb|EFA01287.1| cathepsin B precursor [Tribolium castaneum]
Length = 335
Score = 146 bits (368), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 77/156 (49%), Positives = 96/156 (61%), Gaps = 7/156 (4%)
Query: 183 NAKGLPRNFDAREKWPEC-PSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQIS 241
NA+ +P +FDARE WP+C P + +I DQS CGSCWA A+SDR+CI SN IS
Sbjct: 80 NAQEIPDSFDAREAWPDCAPIIGNIRDQSTCGSCWAFGAVEAMSDRICIHSNATVKVNIS 139
Query: 242 AQHIVACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPL 300
A+ + C C GCNGG P +AW W NG+VTGG+Y GC+ Y+ APCEHHV G L
Sbjct: 140 AEDPLDCCTICGMGCNGGMPAMAWLHWTVNGIVTGGNYEDTNGCKAYSFAPCEHHVDGDL 199
Query: 301 QNCTLLGKLK-TPECKQNCYNPSYESTYRFDLKKGK 335
C G K TP+CK+ C + S TY+ DL G
Sbjct: 200 PPC---GPTKPTPDCKKECDSGS-SLTYQNDLTHGS 231
Score = 119 bits (299), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 55/81 (67%), Positives = 63/81 (77%)
Query: 79 QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWN 138
+I +GP+ A FSVY DFL YKSGVYQH G+ G HA+++LGWGVEND PYWLVANSWN
Sbjct: 245 EIMTNGPVEASFSVYEDFLSYKSGVYQHLEGEYAGGHAIKILGWGVENDTPYWLVANSWN 304
Query: 139 DHWGDHGTFKILRGENEADIE 159
+ WGD G FKILRG NE IE
Sbjct: 305 EDWGDKGYFKILRGSNECGIE 325
>gi|195566634|ref|XP_002106884.1| GD15875 [Drosophila simulans]
gi|194204277|gb|EDX17853.1| GD15875 [Drosophila simulans]
Length = 340
Score = 146 bits (368), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 70/155 (45%), Positives = 93/155 (60%), Gaps = 4/155 (2%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP FD+R++WP CP++ I DQ +CGSCWA A+SDR+CI S G SA +V
Sbjct: 87 LPEEFDSRKQWPNCPTIGEIRDQGSCGSCWAFGAVEAMSDRVCIHSGGKVNFHFSADDLV 146
Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
+C C +GCNGG+P AW +W G+V+GG Y S +GC+PY ++PCEHHV G C
Sbjct: 147 SCCHTCGFGCNGGFPGAAWSYWTRKGIVSGGPYGSNQGCRPYEISPCEHHVNGTRPPCAH 206
Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
G TP+C C + SY Y D G K++ V
Sbjct: 207 GG--GTPKCSHVCQS-SYTVDYAKDKHFGSKSYSV 238
Score = 106 bits (265), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 50/101 (49%), Positives = 65/101 (64%), Gaps = 4/101 (3%)
Query: 63 HYFKKAHMVPRC--NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
H+ K++ V R +I +GP+ F+VY D + YK GVYQH G +G HA+R+L
Sbjct: 230 HFGSKSYSVKRNVREIQEEIMTNGPVEGAFTVYEDLILYKDGVYQHEHGKELGGHAIRIL 289
Query: 121 GWGVEND--IPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
GWGV D IPYWL+ NSWN WGDHG F+ILRG++ IE
Sbjct: 290 GWGVWGDEKIPYWLIGNSWNTDWGDHGFFRILRGQDHCGIE 330
>gi|154761391|gb|ABS85545.1| cathepsin B preproprotein [Biomphalaria glabrata]
Length = 333
Score = 146 bits (368), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 73/155 (47%), Positives = 96/155 (61%), Gaps = 6/155 (3%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP NFD R KWP+C SL I DQ+NCGSCWA A A++DR+CIA G ISA+ I
Sbjct: 87 LPDNFDPRTKWPDCASLNEIRDQANCGSCWAFGSAEAMTDRICIAGKGNI--HISAEDIN 144
Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
C +C GCNGG+P AW ++ GVV+GG Y + EGC PY+L C+HH G Q C
Sbjct: 145 DCCKSCGMGCNGGYPAAAWEWYVDTGVVSGGQYGTNEGCMPYSLPHCDHHTTGKYQPCPA 204
Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
+ + TP+C++ C Y +Y D +GKK++ V
Sbjct: 205 V--VPTPKCEKKCLT-GYPKSYSNDKTRGKKSYGV 236
Score = 102 bits (255), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 44/83 (53%), Positives = 62/83 (74%)
Query: 77 MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANS 136
M+++ ++GP+ A F VY+DFL YK+GVY+H G G HAV+++G+G E+ YWLVANS
Sbjct: 243 MQELVDNGPVTAAFDVYSDFLSYKTGVYRHTTGSYEGGHAVKIIGYGTESGQDYWLVANS 302
Query: 137 WNDHWGDHGTFKILRGENEADIE 159
WN+ WGD G FKI +G++E IE
Sbjct: 303 WNEDWGDKGFFKIAKGKDECGIE 325
>gi|308511959|ref|XP_003118162.1| CRE-CPR-6 protein [Caenorhabditis remanei]
gi|308238808|gb|EFO82760.1| CRE-CPR-6 protein [Caenorhabditis remanei]
Length = 387
Score = 146 bits (368), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 68/155 (43%), Positives = 96/155 (61%), Gaps = 1/155 (0%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+P NFD+RE WP+C S+R+I DQS+CGSCWA A+SDR+CIAS+G +SA ++
Sbjct: 105 IPENFDSRENWPKCQSIRNIRDQSSCGSCWAFGAVEAMSDRICIASHGELQVSLSADDLL 164
Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
+C +C +GCNGG P AWR+W +G+VTG +Y + GC+PY PCEHH + +
Sbjct: 165 SCCRSCGFGCNGGDPLAAWRYWVKDGIVTGSNYTANSGCKPYPFPPCEHHSKKTHFDPCP 224
Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
TP+C++ C + TY D G A+ V
Sbjct: 225 HDLYPTPKCEKKCIADYTDKTYSEDKFYGASAYGV 259
Score = 103 bits (258), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 46/84 (54%), Positives = 56/84 (66%)
Query: 78 RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
+++ HGPL F VY DFL Y GVY H G G HAV+++GWG+EN IPYW ANSW
Sbjct: 268 KELMTHGPLEIAFEVYEDFLNYDGGVYVHTGGKLGGGHAVKLVGWGIENGIPYWTCANSW 327
Query: 138 NDHWGDHGTFKILRGENEADIEMG 161
N WG+ G F+ILRG +E IE G
Sbjct: 328 NTDWGEDGFFRILRGVDECGIESG 351
>gi|323147412|gb|ADX32985.1| cathepsin B [Pinctada fucata]
Length = 366
Score = 146 bits (368), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 72/157 (45%), Positives = 96/157 (61%), Gaps = 3/157 (1%)
Query: 185 KGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQH 244
K LP FDAR +W CP+++ I DQ +CGSCWA ++SDR+CI SNG ISA+
Sbjct: 112 KDLPDTFDARTQWSNCPTIKEIRDQGSCGSCWAFGAVESMSDRICIKSNGQQNAHISAED 171
Query: 245 IVACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNC 303
+ +C +C GCNGG+ AW ++ +G+VTGG YNS +GCQPYT+ C+HHV G LQ C
Sbjct: 172 LTSCCRSCGNGCNGGFLSGAWEYYKRDGLVTGGQYNSHQGCQPYTVKACDHHVVGKLQPC 231
Query: 304 TLLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
+ + TP CK C Y +Y D G A+ V
Sbjct: 232 SKK-EEHTPVCKHEC-ESGYNVSYTKDKHYGATAYSV 266
Score = 110 bits (275), Expect = 8e-22, Method: Compositional matrix adjust.
Identities = 52/98 (53%), Positives = 65/98 (66%), Gaps = 1/98 (1%)
Query: 63 HYFKKAHMVPRCN-AMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLG 121
HY A+ V M +I +GP+ F+VYADF QYKSGVY+H G +G HA++++G
Sbjct: 258 HYGATAYSVRGVQQIMTEIMTNGPVEGAFTVYADFPQYKSGVYKHTTGSPLGGHAIKIMG 317
Query: 122 WGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
WG E YWLVANSWN WG+ GTFKILRG +E IE
Sbjct: 318 WGTEGGDDYWLVANSWNPDWGNQGTFKILRGRDECGIE 355
>gi|148222779|ref|NP_001080410.1| uncharacterized protein LOC380102 precursor [Xenopus laevis]
gi|28302291|gb|AAH46667.1| Cg10992 protein [Xenopus laevis]
Length = 333
Score = 146 bits (368), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 75/184 (40%), Positives = 108/184 (58%), Gaps = 7/184 (3%)
Query: 152 GENEADIEMGFNNRVEANSSEDDDLE-TMGCQNAKGLPRNFDAREKWPECPSLRHIADQS 210
G N A+ ++ + R+ L+ G + LP +FD+R WP CP++R I DQ
Sbjct: 44 GHNFANADVHYVKRLCGTHLNGPQLQKRFGFADDLDLPDSFDSRAAWPNCPTIREIRDQG 103
Query: 211 NCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVACTP-NC-WGCNGGWPQLAWRFWG 268
+CGSCWA AISDR+C+ +NG ++SA+ +++C C GCNGG+P AWRFW
Sbjct: 104 SCGSCWAFGAVEAISDRVCVHTNGKVNVEVSAEDLLSCCGFKCGMGCNGGYPSGAWRFWT 163
Query: 269 HNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTLLGKLKTPECKQNC---YNPSYES 325
G+V+GG Y+S GC+PY++ PCEHHV G +C + TP+C + C Y P+Y S
Sbjct: 164 ETGLVSGGLYDSHVGCRPYSIPPCEHHVNGSRPSCKGE-EGDTPKCMKTCEEGYTPAYGS 222
Query: 326 TYRF 329
F
Sbjct: 223 DKHF 226
Score = 118 bits (295), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 59/125 (47%), Positives = 76/125 (60%), Gaps = 2/125 (1%)
Query: 37 KKKKKKKKKKKKKKKRLYLPTSIPLSHYFKKAHMVPRCN--AMRQIYEHGPLVAIFSVYA 94
K ++ K K + Y P H+ ++ VP M IY++GP+ F VYA
Sbjct: 199 KGEEGDTPKCMKTCEEGYTPAYGSDKHFGATSYGVPSSEKEIMADIYKNGPVEGAFVVYA 258
Query: 95 DFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGEN 154
DF YKSGVYQH G+ +G HA+++LGWGVEN PYWL ANSWN WGD+G FKILRG++
Sbjct: 259 DFPLYKSGVYQHETGEELGGHAIKILGWGVENGTPYWLCANSWNTDWGDNGFFKILRGKD 318
Query: 155 EADIE 159
IE
Sbjct: 319 HCGIE 323
>gi|390994433|gb|AFM37366.1| cathepsin B3 [Dictyocaulus viviparus]
Length = 342
Score = 145 bits (367), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 66/159 (41%), Positives = 102/159 (64%), Gaps = 6/159 (3%)
Query: 182 QNAKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQIS 241
++ G+P +FDAR +WP CPS+ I DQ++CGSCWA +V +ISDR+CIA++ T + S
Sbjct: 88 EDTAGIPESFDARTQWPHCPSISLIRDQADCGSCWAFAVGESISDRVCIATDANKTAEFS 147
Query: 242 AQHIVACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPL 300
+ I+ C C +GC+GG+P AW ++ GVVTGG Y ++ C+PY ++PC +H
Sbjct: 148 VEDILTCCDECGFGCDGGFPDAAWEYFVSTGVVTGGLYGTKNACRPYEISPCGNHPNETF 207
Query: 301 -QNCTLLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAH 338
+NCT + TP CK +C Y +Y+ D +G+K++
Sbjct: 208 YRNCT---GVSTPSCKTSC-QKGYPVSYKDDKTRGRKSY 242
Score = 110 bits (275), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 46/82 (56%), Positives = 62/82 (75%)
Query: 78 RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
+ I +HGPLVA FSVY DF+ YK G+Y++ G G HAVR+LGWGVEN++ YW++ANSW
Sbjct: 253 KDILKHGPLVATFSVYEDFMYYKKGIYRYTHGGYEGGHAVRILGWGVENNVKYWIIANSW 312
Query: 138 NDHWGDHGTFKILRGENEADIE 159
N WG+ G F+++RG N+ IE
Sbjct: 313 NTDWGEDGFFRMVRGINDCGIE 334
>gi|195437434|ref|XP_002066645.1| GK24603 [Drosophila willistoni]
gi|194162730|gb|EDW77631.1| GK24603 [Drosophila willistoni]
Length = 341
Score = 145 bits (366), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 70/192 (36%), Positives = 109/192 (56%), Gaps = 11/192 (5%)
Query: 150 LRGENEADIEMGFNNRVEANSSEDDDLETMGCQNAKGLPRNFDAREKWPECPSLRHIADQ 209
L G +E + ++ E DD++ + LP +FDAR +W CP++ I +Q
Sbjct: 58 LMGVHEESYKYPLPDKQEVLGESDDEI------SLADLPVDFDARLRWTSCPTISEIREQ 111
Query: 210 SNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVACTPNC-WGCNGGWPQLAWRFWG 268
+CGSCWA++ + +SDRLCI SNG ++S +++C C + C GG+P AW +W
Sbjct: 112 GSCGSCWAIATTSVMSDRLCIGSNGVMNFRLSGLDMLSCCAICGFACQGGYPGAAWAYWA 171
Query: 269 HNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTLLGKLKTPECKQNCYNPSYESTYR 328
G+V+GGDY SQ+GCQPYT+ PC+H G CT+ G ++ C+ C PSY+ ++
Sbjct: 172 RKGLVSGGDYGSQQGCQPYTIEPCDHSGNGSRPVCTVGGGVR---CQHLC-EPSYKVDFQ 227
Query: 329 FDLKKGKKAHMV 340
D K + +
Sbjct: 228 RDKNFASKVYSI 239
Score = 107 bits (266), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 49/84 (58%), Positives = 60/84 (71%), Gaps = 2/84 (2%)
Query: 78 RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGV--ENDIPYWLVAN 135
++I +GP+ AI +VY DFL YK+GVY H G+ +G HAVR+LGWGV +PYWLVAN
Sbjct: 248 KEIMTNGPVQAILTVYEDFLSYKTGVYYHLEGEKVGPHAVRILGWGVWGTKKVPYWLVAN 307
Query: 136 SWNDHWGDHGTFKILRGENEADIE 159
SW WGD+G F I RGEN DIE
Sbjct: 308 SWGSDWGDNGFFHIFRGENHCDIE 331
>gi|18921171|ref|NP_572920.1| cathepsin B1, isoform A [Drosophila melanogaster]
gi|7292926|gb|AAF48317.1| cathepsin B1, isoform A [Drosophila melanogaster]
gi|16767940|gb|AAL28188.1| GH06546p [Drosophila melanogaster]
gi|220944992|gb|ACL85039.1| CG10992-PA [synthetic construct]
gi|220954816|gb|ACL89951.1| CG10992-PA [synthetic construct]
Length = 340
Score = 145 bits (366), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 69/155 (44%), Positives = 93/155 (60%), Gaps = 4/155 (2%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP FD+R++WP CP++ I DQ +CGSCWA A+SDR+CI S G SA +V
Sbjct: 87 LPEEFDSRKQWPNCPTIGEIRDQGSCGSCWAFGAVEAMSDRVCIHSGGKVNFHFSADDLV 146
Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
+C C +GCNGG+P AW +W G+V+GG Y S +GC+PY ++PCEHHV G C
Sbjct: 147 SCCHTCGFGCNGGFPGAAWSYWTRKGIVSGGPYGSNQGCRPYEISPCEHHVNGTRPPCAH 206
Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
G +TP+C C + Y Y D G K++ V
Sbjct: 207 GG--RTPKCSHVCQS-GYTVDYAKDKHFGSKSYSV 238
Score = 106 bits (264), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 50/101 (49%), Positives = 65/101 (64%), Gaps = 4/101 (3%)
Query: 63 HYFKKAHMVPRC--NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
H+ K++ V R +I +GP+ F+VY D + YK GVYQH G +G HA+R+L
Sbjct: 230 HFGSKSYSVRRNVREIQEEIMTNGPVEGAFTVYEDLILYKDGVYQHEHGKELGGHAIRIL 289
Query: 121 GWGV--ENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
GWGV E IPYWL+ NSWN WGDHG F+ILRG++ IE
Sbjct: 290 GWGVWGEEKIPYWLIGNSWNTDWGDHGFFRILRGQDHCGIE 330
>gi|195478432|ref|XP_002100515.1| GE16138 [Drosophila yakuba]
gi|194188039|gb|EDX01623.1| GE16138 [Drosophila yakuba]
Length = 340
Score = 145 bits (366), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 69/155 (44%), Positives = 92/155 (59%), Gaps = 4/155 (2%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+P FD+R++WP CP++ I DQ +CGSCWA A+SDR+CI S G SA +V
Sbjct: 87 IPEEFDSRKQWPNCPTIGEIRDQGSCGSCWAFGAVEAMSDRVCIHSGGKVNFHFSADDLV 146
Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
+C C +GCNGG+P AW +W G+V+GG Y S +GC+PY ++PCEHHV G C
Sbjct: 147 SCCHTCGFGCNGGFPGAAWSYWTRKGIVSGGPYGSNQGCRPYEISPCEHHVNGTRPPCAH 206
Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
G TP+C C SY Y D G K++ V
Sbjct: 207 GG--ATPKCSHVC-QSSYTVDYAKDKHFGSKSYSV 238
Score = 104 bits (259), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 49/101 (48%), Positives = 65/101 (64%), Gaps = 4/101 (3%)
Query: 63 HYFKKAHMVPRC--NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
H+ K++ V R + +I +GP+ F+VY D + YK GVYQH G +G HA+R+L
Sbjct: 230 HFGSKSYSVRRNVRDIQEEIMTNGPVEGAFTVYEDLILYKDGVYQHEHGKELGGHAIRIL 289
Query: 121 GWGVEND--IPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
GWGV D IPYWL+ NSWN WGD G F+ILRG++ IE
Sbjct: 290 GWGVWGDEKIPYWLIGNSWNTDWGDQGFFRILRGQDHCGIE 330
>gi|46195455|ref|NP_990702.1| cathepsin B precursor [Gallus gallus]
gi|1168790|sp|P43233.1|CATB_CHICK RecName: Full=Cathepsin B; AltName: Full=Cathepsin B1; Contains:
RecName: Full=Cathepsin B light chain; Contains:
RecName: Full=Cathepsin B heavy chain; Flags: Precursor
gi|603203|gb|AAA87075.1| cathepsin B [Gallus gallus]
Length = 340
Score = 145 bits (366), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 69/156 (44%), Positives = 98/156 (62%), Gaps = 4/156 (2%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP FD R++WP CP++ I DQ +CGSCWA AISDR+C+ +N + ++SA+ ++
Sbjct: 80 LPDTFDTRKQWPNCPTISEIRDQGSCGSCWAFGAVEAISDRICVHTNAKVSVEVSAEDLL 139
Query: 247 ACTP-NC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
+C C GCNGG+P AWR+W G+V+GG Y+S GC+ YT+ PCEHHV G CT
Sbjct: 140 SCCGFECGMGCNGGYPSGAWRYWTERGLVSGGLYDSHVGCRAYTIPPCEHHVNGSRPPCT 199
Query: 305 LLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
G +TP C ++C P Y +Y+ D G ++ V
Sbjct: 200 GEGG-ETPRCSRHC-EPGYSPSYKEDKHYGITSYGV 233
Score = 120 bits (302), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 59/108 (54%), Positives = 71/108 (65%), Gaps = 2/108 (1%)
Query: 54 YLPTSIPLSHYFKKAHMVPRCN--AMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDS 111
Y P+ HY ++ VPR M +IY++GP+ F VY DFL YKSGVYQH G+
Sbjct: 216 YSPSYKEDKHYGITSYGVPRSEKEIMAEIYKNGPVEGAFIVYEDFLMYKSGVYQHVSGEQ 275
Query: 112 IGLHAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
+G HA+R+LGWGVEN PYWL ANSWN WG G FKILRGE+ IE
Sbjct: 276 VGGHAIRILGWGVENGTPYWLAANSWNTDWGITGFFKILRGEDHCGIE 323
>gi|309202|gb|AAA37494.1| mouse preprocathepsin B [Mus musculus]
Length = 339
Score = 145 bits (366), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 72/148 (48%), Positives = 92/148 (62%), Gaps = 7/148 (4%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP FDARE+W CP++ I DQ +CGSCWA AISDR CI +NG ++SA+ ++
Sbjct: 80 LPETFDAREQWSNCPTIGQIRDQGSCGSCWAFGAVEAISDRTCIHTNGRVNVEVSAEDLL 139
Query: 247 ACTP-NCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
C C GCNGG+P AW FW G+V+GG Y+S GC PYT+ PCEHHV G CT
Sbjct: 140 TCCGIQCGDGCNGGYPSGAWNFWTKKGLVSGGVYDSHIGCLPYTIPPCEHHVNGSRPPCT 199
Query: 305 LLGKLKTPECKQNC---YNPSYESTYRF 329
G+ TP C ++C Y+PSY+ F
Sbjct: 200 --GEGDTPRCNKSCEAGYSPSYKEDKHF 225
Score = 118 bits (296), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 52/83 (62%), Positives = 65/83 (78%)
Query: 77 MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANS 136
M +IY++GP+ F+V++DFL YKSGVY+H GD +G HA+R+L WGVEN +PYWL ANS
Sbjct: 240 MAEIYKNGPVEGAFTVFSDFLTYKSGVYKHEAGDMMGGHAIRILVWGVENGVPYWLAANS 299
Query: 137 WNDHWGDHGTFKILRGENEADIE 159
WN WGD+G FKILRGEN IE
Sbjct: 300 WNLDWGDNGFFKILRGENHCGIE 322
>gi|442616292|ref|NP_001259536.1| cathepsin B1, isoform B [Drosophila melanogaster]
gi|440216755|gb|AGB95378.1| cathepsin B1, isoform B [Drosophila melanogaster]
Length = 330
Score = 145 bits (366), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 69/155 (44%), Positives = 93/155 (60%), Gaps = 4/155 (2%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP FD+R++WP CP++ I DQ +CGSCWA A+SDR+CI S G SA +V
Sbjct: 77 LPEEFDSRKQWPNCPTIGEIRDQGSCGSCWAFGAVEAMSDRVCIHSGGKVNFHFSADDLV 136
Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
+C C +GCNGG+P AW +W G+V+GG Y S +GC+PY ++PCEHHV G C
Sbjct: 137 SCCHTCGFGCNGGFPGAAWSYWTRKGIVSGGPYGSNQGCRPYEISPCEHHVNGTRPPCAH 196
Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
G +TP+C C + Y Y D G K++ V
Sbjct: 197 GG--RTPKCSHVCQS-GYTVDYAKDKHFGSKSYSV 228
Score = 106 bits (264), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 50/101 (49%), Positives = 65/101 (64%), Gaps = 4/101 (3%)
Query: 63 HYFKKAHMVPRC--NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
H+ K++ V R +I +GP+ F+VY D + YK GVYQH G +G HA+R+L
Sbjct: 220 HFGSKSYSVRRNVREIQEEIMTNGPVEGAFTVYEDLILYKDGVYQHEHGKELGGHAIRIL 279
Query: 121 GWGV--ENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
GWGV E IPYWL+ NSWN WGDHG F+ILRG++ IE
Sbjct: 280 GWGVWGEEKIPYWLIGNSWNTDWGDHGFFRILRGQDHCGIE 320
>gi|341904470|gb|EGT60303.1| hypothetical protein CAEBREN_20420 [Caenorhabditis brenneri]
Length = 351
Score = 145 bits (366), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 75/156 (48%), Positives = 93/156 (59%), Gaps = 3/156 (1%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+P +FD+R +WP CPS+ I DQS+CGSCWAVS A ISDR+CIAS G ISA I
Sbjct: 97 IPDSFDSRAQWPNCPSISKIRDQSSCGSCWAVSAAETISDRICIASKGQTQVSISADDIN 156
Query: 247 ACTPNCW--GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
AC GCNGG+P AWR + NG VTGG Y + GC+PY PCEHHV G
Sbjct: 157 ACCGMACGNGCNGGYPIEAWRHYVKNGYVTGGSYQEKTGCKPYPYPPCEHHVNGTHYKPC 216
Query: 305 LLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
T +C+++C Y TY+ DL G+ A+ V
Sbjct: 217 PSDMYPTDKCERSC-QAGYSLTYKQDLHFGQSAYAV 251
Score = 110 bits (274), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 50/102 (49%), Positives = 68/102 (66%), Gaps = 2/102 (1%)
Query: 63 HYFKKAHMVPR--CNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
H+ + A+ V + ++I +GP+ F+VYADF Y GVY H G S+G HAV++L
Sbjct: 243 HFGQSAYAVSKKATEIQKEIMTNGPVEVAFTVYADFEVYSGGVYVHTAGASLGGHAVKML 302
Query: 121 GWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIEMGF 162
GWGV+N PYWL ANSWN+ WG++G F+I+RG NE IE G
Sbjct: 303 GWGVDNGTPYWLCANSWNEDWGENGYFRIIRGVNECGIEHGV 344
>gi|50657025|emb|CAH04630.1| cathepsin B [Suberites domuncula]
Length = 331
Score = 145 bits (365), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 82/199 (41%), Positives = 109/199 (54%), Gaps = 11/199 (5%)
Query: 145 GTFKILRGENEADI--EMGFNNRVEANSSEDDDLETMGCQNAKGLPRNFDAREKWPECPS 202
G K G +E DI +MG V D L K +P FDAR +WP+CP+
Sbjct: 38 GVNKRFEGLSEVDIRRQMG----VLQGGPLDIKLPEKDITPLKDVPDMFDARMQWPDCPT 93
Query: 203 LRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVACTPNC-WGCNGGWPQ 261
++ I DQ CGSCWA ++SDR CI N + ISA+ ++AC C GCNGG+
Sbjct: 94 IKEIRDQGACGSCWAFGAVESMSDRFCIHFNQ--SAHISAEDLMACCETCGMGCNGGYLG 151
Query: 262 LAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTLLGKLKTPECKQNCYNP 321
AWR++ H G+VTGG YNS+EGCQPY +A C+HHV G Q C + TP C + C
Sbjct: 152 AAWRYFEHTGLVTGGQYNSKEGCQPYLIASCDHHVVGKKQPCA-SKEEHTPRCSKTC-EA 209
Query: 322 SYESTYRFDLKKGKKAHMV 340
Y+ ++ D G A+ V
Sbjct: 210 GYDVSFEKDKHFGASAYSV 228
Score = 111 bits (278), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 48/81 (59%), Positives = 60/81 (74%)
Query: 79 QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWN 138
+I +GP+ F+VYADF YKSGVYQH G +G HA+R+LGWG EN PYWLVANSWN
Sbjct: 238 EIMTNGPVEGAFTVYADFPTYKSGVYQHTSGAMLGGHAIRILGWGTENGTPYWLVANSWN 297
Query: 139 DHWGDHGTFKILRGENEADIE 159
+ WG G FKI+RG+++ IE
Sbjct: 298 EDWGAMGYFKIIRGKDDCGIE 318
>gi|1942645|pdb|1MIR|A Chain A, Rat Procathepsin B
gi|1942646|pdb|1MIR|B Chain B, Rat Procathepsin B
Length = 322
Score = 145 bits (365), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 73/166 (43%), Positives = 99/166 (59%), Gaps = 5/166 (3%)
Query: 177 ETMGCQNAKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYF 236
E +G LP +FDARE+W CP++ I DQ +CGS WA A+SDR+CI +NG
Sbjct: 53 ERVGFSEDINLPESFDAREQWSNCPTIAQIRDQGSCGSSWAFGAVEAMSDRICIHTNGRV 112
Query: 237 TGQISAQHIVACTP-NCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEH 294
++SA+ ++ C C GCNGG+P AW FW G+V+GG YNS GC PYT+ PCEH
Sbjct: 113 NVEVSAEDLLTCCGIQCGDGCNGGYPSGAWNFWTRKGLVSGGVYNSHIGCLPYTIPPCEH 172
Query: 295 HVQGPLQNCTLLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
HV G CT G+ TP+C + C Y ++Y+ D G ++ V
Sbjct: 173 HVNGARPPCT--GEGDTPKCNKMC-EAGYSTSYKEDKHYGYTSYSV 215
Score = 124 bits (311), Expect = 7e-26, Method: Compositional matrix adjust.
Identities = 53/83 (63%), Positives = 67/83 (80%)
Query: 77 MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANS 136
M +IY++GP+ F+V++DFL YKSGVY+H GD +G HA+R+LGWG+EN +PYWLVANS
Sbjct: 223 MAEIYKNGPVEGAFTVFSDFLTYKSGVYKHEAGDVMGGHAIRILGWGIENGVPYWLVANS 282
Query: 137 WNDHWGDHGTFKILRGENEADIE 159
WN WGD+G FKILRGEN IE
Sbjct: 283 WNADWGDNGFFKILRGENHCGIE 305
>gi|346472613|gb|AEO36151.1| hypothetical protein [Amblyomma maculatum]
Length = 373
Score = 145 bits (365), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 72/167 (43%), Positives = 95/167 (56%), Gaps = 6/167 (3%)
Query: 178 TMGCQNAKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIAS---NG 234
T+ + LP NFDARE WP+CP++R I DQ +CGSCWA AISDR CI S
Sbjct: 109 TLDVSALRVLPENFDAREHWPDCPTIREIRDQGSCGSCWAFGAVEAISDRTCIHSPEGKP 168
Query: 235 YFTGQISAQHIVACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCE 293
++A +++C C GCNGG+P AW +W H G+VTGG+Y+S EGC PY + C+
Sbjct: 169 RVIAHLAADDVLSCCTECGAGCNGGFPGSAWSYWVHKGIVTGGNYDSDEGCMPYPIKACD 228
Query: 294 HHVQGPLQNCTLLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
HHV G L C TP C + C Y+ + D G+ A+ V
Sbjct: 229 HHVNGTLGPCDKTIP-PTPRCVRMC-RKGYDVDFMDDKHYGRHAYSV 273
Score = 114 bits (286), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 56/99 (56%), Positives = 68/99 (68%), Gaps = 2/99 (2%)
Query: 63 HYFKKAHMVPRCNAMRQ--IYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
HY + A+ VP Q I +GP+ A F+VY DFL YKSGVYQ + ++G HA+R+L
Sbjct: 265 HYGRHAYSVPAKAKQIQAEIMMNGPVEADFTVYEDFLHYKSGVYQRHTDSALGGHAIRLL 324
Query: 121 GWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
GWGVEN +PYWL ANSWN WGD G FKILRG +E IE
Sbjct: 325 GWGVENGVPYWLAANSWNTEWGDKGFFKILRGSDECGIE 363
>gi|327281751|ref|XP_003225610.1| PREDICTED: cathepsin B-like [Anolis carolinensis]
Length = 330
Score = 144 bits (364), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 72/184 (39%), Positives = 108/184 (58%), Gaps = 7/184 (3%)
Query: 146 TFKILRGENEADIEMGFNNRVEANSSEDDDL-ETMGCQNAKGLPRNFDAREKWPECPSLR 204
T + G N +++M + ++ L E + LP +FD+R++WP CP++
Sbjct: 28 TLVVRAGHNFHNVDMSYLKKLCGTYLHGPKLPERFAFADDVELPDSFDSRKQWPSCPTIN 87
Query: 205 HIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVACTP-NC-WGCNGGWPQL 262
I DQ +CGSCWA AISDR+C+ +NG +ISA+ +++C C GCNGG+P
Sbjct: 88 EIRDQGSCGSCWAFGAVEAISDRVCVHTNGKVNVEISAEDLLSCCGFECGMGCNGGYPSG 147
Query: 263 AWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTLLGKLKTPECKQNC---Y 319
AW++W G+V+GG Y+S GC+PY++ PCEHH G C+ G +TPEC + C Y
Sbjct: 148 AWKYWTEKGLVSGGLYDSHVGCRPYSIPPCEHHTNGTRPPCSGEGG-ETPECVKKCEDGY 206
Query: 320 NPSY 323
P+Y
Sbjct: 207 TPAY 210
Score = 120 bits (302), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 57/114 (50%), Positives = 75/114 (65%), Gaps = 2/114 (1%)
Query: 48 KKKKRLYLPTSIPLSHYFKKAHMVPRCN--AMRQIYEHGPLVAIFSVYADFLQYKSGVYQ 105
KK + Y P HY ++ +PR M +IY++GP+ F VY+DFL YKSGVYQ
Sbjct: 200 KKCEDGYTPAYKQDKHYGVTSYGIPRSEKEIMAEIYKNGPVEGAFVVYSDFLMYKSGVYQ 259
Query: 106 HNFGDSIGLHAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
H G+ +G HA+R+LGWGV+N PYWL ANSWN WG+ G F+ILRG++ IE
Sbjct: 260 HVSGEEVGGHAIRILGWGVDNGTPYWLAANSWNTDWGEDGFFRILRGQDHCGIE 313
>gi|335347291|gb|AEH42093.1| cysteine proteinase 6 [Haemonchus contortus]
Length = 346
Score = 144 bits (364), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 74/159 (46%), Positives = 99/159 (62%), Gaps = 11/159 (6%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+P +FDAR KWP C S++HI DQ+NCGSCWAVS A+ +SDR+CIAS IS+ V
Sbjct: 94 IPESFDARTKWPNCTSIKHIRDQANCGSCWAVSTASVLSDRICIASKQKKQVHISSIDFV 153
Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
+C +C +GC GGWP A+ ++ + GVVTGGDY S+ GC+PY PC HH N T
Sbjct: 154 SCCDSCGFGCEGGWPIDAFEYYSYQGVVTGGDYGSKTGCRPYPFHPCGHH-----GNETY 208
Query: 306 LGKL----KTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
G+ TPEC + C Y+++YR D G+ + V
Sbjct: 209 YGECPKEESTPECVKQC-QKGYKNSYRRDKTWGEDYYEV 246
Score = 99.0 bits (245), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 38/82 (46%), Positives = 59/82 (71%)
Query: 78 RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
R+I GP+V+ F+VY DF Y G+Y+H G + G HA++++GWG E ++PYW++ANSW
Sbjct: 255 REIMRSGPVVSSFTVYDDFSYYVKGIYKHTAGKARGSHAIKIIGWGTEKNVPYWIIANSW 314
Query: 138 NDHWGDHGTFKILRGENEADIE 159
++ WG+ G F+++RG N IE
Sbjct: 315 HNDWGEKGFFRMVRGTNHCGIE 336
>gi|197129222|gb|ACH45720.1| putative cathepsin B variant 2 [Taeniopygia guttata]
Length = 236
Score = 144 bits (364), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 68/142 (47%), Positives = 94/142 (66%), Gaps = 6/142 (4%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP NFD+R +WP CP++ I DQ +CGSCWA AISDR+C+ +N + ++SA+ ++
Sbjct: 80 LPDNFDSRTQWPNCPTISEIRDQGSCGSCWAFGAVEAISDRICVHTNAKVSVEVSAEDLL 139
Query: 247 ACTP-NC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
+C C GCNGG+P AWR+W G+V+GG Y+S GC+PY++ PCEHHV G CT
Sbjct: 140 SCCGFECGMGCNGGYPSGAWRYWTERGLVSGGLYDSHVGCRPYSIPPCEHHVNGTRPPCT 199
Query: 305 LLGKLKTPECKQNC---YNPSY 323
G TP C ++C Y+PSY
Sbjct: 200 GEGG-STPRCSRHCEPGYSPSY 220
>gi|195352458|ref|XP_002042729.1| GM17589 [Drosophila sechellia]
gi|194126760|gb|EDW48803.1| GM17589 [Drosophila sechellia]
Length = 340
Score = 144 bits (364), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 69/155 (44%), Positives = 92/155 (59%), Gaps = 4/155 (2%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP FD+R++WP CP++ I DQ +CGSCWA A+SDR+CI S G SA +V
Sbjct: 87 LPEEFDSRKQWPNCPTIGEIRDQGSCGSCWAFGAVEAMSDRVCIHSGGKVNFHFSADDLV 146
Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
+C C +GCNGG+P AW +W G+V+GG Y S +GC+PY ++PCEHHV G C
Sbjct: 147 SCCHTCGFGCNGGFPGAAWSYWTRKGIVSGGPYGSNQGCRPYEISPCEHHVNGTRPPCA- 205
Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
TP+C C + SY Y D G K++ V
Sbjct: 206 -NGSGTPKCSHVCQS-SYTVDYAKDKHFGSKSYSV 238
Score = 104 bits (260), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 49/101 (48%), Positives = 65/101 (64%), Gaps = 4/101 (3%)
Query: 63 HYFKKAHMVPRC--NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
H+ K++ V R +I +GP+ F+VY D + YK GVYQH G +G HA+R+L
Sbjct: 230 HFGSKSYSVKRNVREIQEEIMTNGPVEGAFTVYEDLILYKDGVYQHEHGKELGGHAIRIL 289
Query: 121 GWGVEND--IPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
GWGV + IPYWL+ NSWN WGDHG F+ILRG++ IE
Sbjct: 290 GWGVWGNEKIPYWLIGNSWNTDWGDHGFFRILRGQDHCGIE 330
>gi|187105118|ref|NP_001119619.1| cathepsin B-5880 precursor [Acyrthosiphon pisum]
gi|163300442|tpg|DAA06127.1| TPA_inf: cathepsin B transcript 5880 [Acyrthosiphon pisum]
gi|239790051|dbj|BAH71611.1| ACYPI000015 [Acyrthosiphon pisum]
gi|239790053|dbj|BAH71612.1| ACYPI000015 [Acyrthosiphon pisum]
Length = 302
Score = 144 bits (363), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 72/149 (48%), Positives = 99/149 (66%), Gaps = 2/149 (1%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP++FDAR KW CPS+ + DQ NC S +A+SVA+A+SDR+CI SNG ++SAQ I+
Sbjct: 53 LPKSFDARAKWYMCPSIGMVYDQGNCKSSYAISVASAVSDRICIHSNGTVKPKLSAQQIL 112
Query: 247 ACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
+C C GC+GG +W F+ +G+V+GG+Y S EGCQPYT+ PC+ H + ++N
Sbjct: 113 SCCYLCGDGCSGGQHFESWDFYRRHGLVSGGEYGSNEGCQPYTIEPCQ-HTETAVENACS 171
Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKG 334
L TPECK CYNP Y + Y D +G
Sbjct: 172 NKTLFTPECKVQCYNPDYGTRYVKDNHQG 200
Score = 113 bits (282), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 50/91 (54%), Positives = 65/91 (71%)
Query: 69 HMVPRCNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDI 128
+ VP AM++IYE+GP+ A F +Y DF+ Y+SGVY +N G + AV++LGWG EN
Sbjct: 203 YRVPAYTAMKEIYENGPITASFYMYQDFVNYQSGVYAYNSGKYVTTQAVKILGWGEENGT 262
Query: 129 PYWLVANSWNDHWGDHGTFKILRGENEADIE 159
PYWL ANS+N +WGD+G KILRG NE IE
Sbjct: 263 PYWLAANSFNTYWGDNGFVKILRGANECYIE 293
>gi|47217183|emb|CAG11019.1| unnamed protein product [Tetraodon nigroviridis]
Length = 351
Score = 144 bits (363), Expect = 6e-32, Method: Compositional matrix adjust.
Identities = 74/176 (42%), Positives = 102/176 (57%), Gaps = 24/176 (13%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP+ FD+RE+WP CP+L+ I DQ +CGSCWA + A+SDR+CI SN + ++SAQ ++
Sbjct: 79 LPKEFDSREQWPNCPTLKEIRDQGSCGSCWAFGASEAMSDRVCIHSNAKVSVELSAQDLL 138
Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQ---------------------EGC 284
C +C GCNGG+P AW FW +G+V+GG Y+S GC
Sbjct: 139 TCCNSCGMGCNGGYPSSAWNFWVSDGLVSGGLYDSHIGRIQVSLCVLLLAVDRDFVSPGC 198
Query: 285 QPYTLAPCEHHVQGPLQNCTLLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
+PYT+ PCEHHV G +C+ G TPEC C Y +Y+ D GK ++ V
Sbjct: 199 RPYTIPPCEHHVNGSRPSCSGEGG-DTPECIFRC-EAGYSPSYKQDKHFGKTSYSV 252
Score = 111 bits (278), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 48/82 (58%), Positives = 63/82 (76%)
Query: 78 RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
++IY++GP+ F+VY DF+ YKSGVYQH G ++G HA+++LGWG EN +PYWL ANSW
Sbjct: 261 QEIYKNGPVEGAFTVYEDFVLYKSGVYQHVSGSALGGHAIKMLGWGEENGVPYWLCANSW 320
Query: 138 NDHWGDHGTFKILRGENEADIE 159
N WGD+G FKILRG + IE
Sbjct: 321 NTDWGDNGFFKILRGADHCGIE 342
>gi|148229459|ref|NP_001079570.1| cathepsin B precursor [Xenopus laevis]
gi|28277314|gb|AAH44689.1| MGC53360 protein [Xenopus laevis]
Length = 333
Score = 144 bits (363), Expect = 7e-32, Method: Compositional matrix adjust.
Identities = 72/184 (39%), Positives = 109/184 (59%), Gaps = 7/184 (3%)
Query: 152 GENEADIEMGFNNRVEANSSEDDDLE-TMGCQNAKGLPRNFDAREKWPECPSLRHIADQS 210
G N A+ ++ + R+ + L+ G + LP +FD+R WP CP++R I DQ
Sbjct: 44 GHNFANADLHYVKRLCGTLLKGPQLQKRFGFADGLELPDSFDSRAAWPNCPTIREIRDQG 103
Query: 211 NCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVACTPN--CWGCNGGWPQLAWRFWG 268
+CGSCWA AISDR+C+ +NG ++SA+ +++C + GCNGG+P AW+FW
Sbjct: 104 SCGSCWAFGAVEAISDRVCVHTNGKVNVEVSAEDLLSCCGDECGMGCNGGYPSGAWQFWT 163
Query: 269 HNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTLLGKLKTPECKQNC---YNPSYES 325
G+V+GG Y+S GC+PY++ PCEHHV G C + TP+C + C Y+P+Y +
Sbjct: 164 ETGLVSGGLYDSHVGCRPYSIPPCEHHVNGSRPACKGE-EGDTPKCVKQCEEGYSPAYGT 222
Query: 326 TYRF 329
F
Sbjct: 223 DKHF 226
Score = 118 bits (296), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 59/125 (47%), Positives = 78/125 (62%), Gaps = 2/125 (1%)
Query: 37 KKKKKKKKKKKKKKKRLYLPTSIPLSHYFKKAHMVPRCN--AMRQIYEHGPLVAIFSVYA 94
K ++ K K+ + Y P H+ ++ VP M +IY++GP+ F VYA
Sbjct: 199 KGEEGDTPKCVKQCEEGYSPAYGTDKHFGTTSYGVPTSEKEIMAEIYKNGPVEGAFLVYA 258
Query: 95 DFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGEN 154
DF YKSGVYQH G+ +G HA+++LGWGVEN PYWL ANSWN WGD+G FKILRG++
Sbjct: 259 DFPLYKSGVYQHETGEELGGHAIKILGWGVENGTPYWLCANSWNTDWGDNGFFKILRGKD 318
Query: 155 EADIE 159
IE
Sbjct: 319 HCGIE 323
>gi|60600065|gb|AAX26576.1| unknown [Schistosoma japonicum]
Length = 190
Score = 144 bits (362), Expect = 7e-32, Method: Compositional matrix adjust.
Identities = 71/165 (43%), Positives = 99/165 (60%), Gaps = 3/165 (1%)
Query: 151 RGENEADIEMGFNNRVEANSSEDDDLETMGCQNAKGLPRNFDAREKWPECPSLRHIADQS 210
R + +DI + N + + L T LP++FDAR++W CPS+ I DQS
Sbjct: 28 RFKTVSDIRRMLGALPDPNGEQLETLCTGYELTLNELPKSFDARKEWTHCPSISEIRDQS 87
Query: 211 NCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVACTPNC-WGCNGGWPQLAWRFWGH 269
+CGSCWA A+SDR+CI S G + +SA+++V+C +C GCNGG+P AW +W +
Sbjct: 88 SCGSCWAFGAVEAMSDRICIESKGKYKPFLSAENLVSCCSSCGMGCNGGFPHSAWLYWKN 147
Query: 270 NGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTLLGKLKTPEC 314
G+VTG YN+ GCQPY PCEH+ GPL C G ++TP C
Sbjct: 148 QGIVTGDLYNTTNGCQPYEFPPCEHNTLGPLPVCD--GDVETPPC 190
>gi|268579855|ref|XP_002644910.1| C. briggsae CBR-CPR-6 protein [Caenorhabditis briggsae]
Length = 376
Score = 144 bits (362), Expect = 7e-32, Method: Compositional matrix adjust.
Identities = 67/155 (43%), Positives = 96/155 (61%), Gaps = 1/155 (0%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+P +FD+RE WP+C S+R+I DQS+CGSCWA A+SDR+CIAS+G +SA ++
Sbjct: 106 IPESFDSRENWPKCQSIRNIRDQSSCGSCWAFGAVEAMSDRICIASHGELQVSLSADDLL 165
Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
+C +C +GCNGG P AWR+W +G+VTG +Y + GC+PY PCEHH + +
Sbjct: 166 SCCRSCGFGCNGGDPLAAWRYWVKDGIVTGSNYTANSGCKPYPFPPCEHHSKKTHFDPCP 225
Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
TP+C++ C + TY D G A+ V
Sbjct: 226 HDLYPTPKCEKKCIADYTDKTYSEDKFYGHSAYGV 260
Score = 102 bits (253), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 45/85 (52%), Positives = 56/85 (65%)
Query: 78 RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
+++ HGPL F VY DFL Y GVY H G G HAV+++GWG+E+ IPYW ANSW
Sbjct: 269 KELMTHGPLEIAFEVYEDFLNYDGGVYVHTGGKLGGGHAVKLIGWGIEDGIPYWTCANSW 328
Query: 138 NDHWGDHGTFKILRGENEADIEMGF 162
N WG+ G F+ILRG +E IE G
Sbjct: 329 NTDWGEDGFFRILRGVDECGIESGV 353
>gi|194895314|ref|XP_001978227.1| GG19486 [Drosophila erecta]
gi|190649876|gb|EDV47154.1| GG19486 [Drosophila erecta]
Length = 340
Score = 144 bits (362), Expect = 7e-32, Method: Compositional matrix adjust.
Identities = 69/150 (46%), Positives = 88/150 (58%), Gaps = 8/150 (5%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+P FD+R++WP CP++ I DQ CGSCWA A+SDR+CI S G SA +V
Sbjct: 87 IPEEFDSRKQWPNCPTIGEIRDQGECGSCWAFGAVEAMSDRVCIHSGGKVNFHFSADDLV 146
Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
+C C +GCNGG+P AW +W G+V+GG Y S +GC+PY +APCEHHV G C
Sbjct: 147 SCCHTCGFGCNGGFPGAAWSYWTRKGIVSGGPYGSNQGCRPYEIAPCEHHVNGTRPPCGH 206
Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGK 335
G TP+C C ES Y D K K
Sbjct: 207 GG--GTPKCSHVC-----ESGYTVDYAKDK 229
Score = 104 bits (260), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 49/101 (48%), Positives = 66/101 (65%), Gaps = 4/101 (3%)
Query: 63 HYFKKAHMVPRC--NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
H+ K++ V R + +I +GP+ F+VY D + YK GVYQH G +G HA+R+L
Sbjct: 230 HFGSKSYSVKRNVRDIQEEIMTNGPVEGAFTVYEDLILYKDGVYQHQHGKELGGHAIRIL 289
Query: 121 GWGV--ENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
GWGV E IPYWL+ NSWN WGD+G F+ILRG++ IE
Sbjct: 290 GWGVWGEEKIPYWLIGNSWNTDWGDNGFFRILRGQDHCGIE 330
>gi|1777779|gb|AAB40605.1| cathepsin B-like cysteine proteinase [Ascaris suum]
gi|324515014|gb|ADY46062.1| Cathepsin B cysteine proteinase 6 [Ascaris suum]
Length = 398
Score = 144 bits (362), Expect = 8e-32, Method: Compositional matrix adjust.
Identities = 67/155 (43%), Positives = 95/155 (61%), Gaps = 1/155 (0%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+P FDAREKW +C SL++I DQS+CGSCWA A+SDR+CIASNG +SA ++
Sbjct: 121 IPEAFDAREKWDQCASLKNIRDQSSCGSCWAFGAVEAMSDRICIASNGKIQVSLSADDLL 180
Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
+C +C +GC+GG P AW++W G+VTG ++ ++GC+PY PCEHH
Sbjct: 181 SCCKSCGFGCDGGDPMAAWKYWVKEGIVTGSNFTMKQGCKPYPFPPCEHHSNKTHYQPCK 240
Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
TP+C++ C + E TY D G+ A+ V
Sbjct: 241 HDLYPTPKCEKKCLDIYTEKTYAEDKFFGETAYGV 275
Score = 105 bits (263), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 50/113 (44%), Positives = 66/113 (58%), Gaps = 14/113 (12%)
Query: 78 RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
++I HGP+ F VY DFL Y G+Y H G G HAV++LGWGVE +PYWLVANSW
Sbjct: 284 KEILTHGPVEVAFEVYEDFLMYDGGIYVHTGGKIGGGHAVKMLGWGVEQGVPYWLVANSW 343
Query: 138 NDHWGDHGTFKILRGENEADIEMG--------------FNNRVEANSSEDDDL 176
N WG+ G F+I+RG +E IE ++ R ++ EDDD+
Sbjct: 344 NTDWGEDGFFRIIRGIDECGIESSVVGGLPKLNRTYKKYHRRYRLDNDEDDDI 396
>gi|44965462|gb|AAS49538.1| cathepsin B [Protopterus dolloi]
Length = 225
Score = 144 bits (362), Expect = 8e-32, Method: Compositional matrix adjust.
Identities = 70/156 (44%), Positives = 98/156 (62%), Gaps = 4/156 (2%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP NFD+R +WP CP++R I DQ +CGSCWA ++SDR+C+ S G ++SA+ ++
Sbjct: 13 LPDNFDSRTQWPNCPTIREIRDQGSCGSCWAFGAVESMSDRVCVHSGGKQNVEVSAEDLL 72
Query: 247 ACTP-NC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
+C C GCNGG+P AW++W G+V+GG Y S GC+PYT+ PCEHHV G +C+
Sbjct: 73 SCCGFECGMGCNGGYPSGAWQYWTEKGLVSGGLYGSGIGCRPYTIPPCEHHVNGSRPSCS 132
Query: 305 LLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
G TP+C Q C + Y Y D G+ A+ V
Sbjct: 133 GEGG-DTPKCVQKC-DSGYTPAYEKDKIYGQSAYSV 166
Score = 76.6 bits (187), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 33/66 (50%), Positives = 49/66 (74%), Gaps = 2/66 (3%)
Query: 64 YFKKAHMVPRC--NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLG 121
Y + A+ VP + M +IY+ GP+ F+VY DFL YKSGVYQH+ G+++G HA+++LG
Sbjct: 159 YGQSAYSVPSSPESIMEEIYKDGPVEGAFTVYEDFLLYKSGVYQHHTGEAVGGHAIKILG 218
Query: 122 WGVEND 127
WG+EN+
Sbjct: 219 WGIENN 224
>gi|56758644|gb|AAW27462.1| unknown [Schistosoma japonicum]
Length = 294
Score = 144 bits (362), Expect = 9e-32, Method: Compositional matrix adjust.
Identities = 75/195 (38%), Positives = 108/195 (55%), Gaps = 13/195 (6%)
Query: 148 KILRGENEADIEMGFNNRVEANSSEDDDLETMGCQNAKGLPRNFDAREKWPECPSLRHIA 207
+IL G + D EM R + D ++E +P FD+R+KWP C S+ I
Sbjct: 61 RILMGARKEDAEMKRKRRPTVDH-HDLNVE---------IPSQFDSRKKWPHCKSISQIR 110
Query: 208 DQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVACTPNCW-GCNGGWPQLAWRF 266
DQS CGSCWA A++DR+CI S G + ++SA +++C +C GC GG+P +AW +
Sbjct: 111 DQSRCGSCWAFGAVEAMTDRICIQSGGQQSAELSALDLISCCEDCGDGCQGGFPGVAWDY 170
Query: 267 WGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTLLGKLKTPECKQNCYNPSYEST 326
W G+VTGG + GCQPY CEHH +G C KTP+CKQ C Y++
Sbjct: 171 WVKRGIVTGGSKENHTGCQPYPFPKCEHHTKGKYPACG-TKIYKTPQCKQKC-QKGYKTP 228
Query: 327 YRFDLKKGKKAHMVL 341
Y D G++++ V+
Sbjct: 229 YEQDKHYGEESYNVI 243
Score = 53.1 bits (126), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 21/43 (48%), Positives = 31/43 (72%)
Query: 78 RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
++I +GP+ A F VY DFL YKSG+Y+H G +G HA+R++
Sbjct: 251 KEIMMNGPVEAAFDVYEDFLNYKSGIYRHVTGSIVGGHAIRII 293
>gi|308504233|ref|XP_003114300.1| hypothetical protein CRE_27039 [Caenorhabditis remanei]
gi|308261685|gb|EFP05638.1| hypothetical protein CRE_27039 [Caenorhabditis remanei]
Length = 351
Score = 144 bits (362), Expect = 9e-32, Method: Compositional matrix adjust.
Identities = 75/156 (48%), Positives = 92/156 (58%), Gaps = 3/156 (1%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+P +FD+R +WP CPS+ I DQS+CGSCWAVS A ISDR+CIASNG ISA I
Sbjct: 97 IPDSFDSRAQWPNCPSISKIRDQSSCGSCWAVSAAETISDRICIASNGKTQLSISADDIN 156
Query: 247 ACTPNCW--GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
AC GCNGG+P AWR + G VTGG Y + GC+PY PCEHHV G
Sbjct: 157 ACCGMVCGNGCNGGYPIEAWRHYVKKGYVTGGSYQEKTGCKPYPYPPCEHHVNGTHYKPC 216
Query: 305 LLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
T +C+++C Y TY DL G+ A+ V
Sbjct: 217 PSNMYPTDKCERSC-QAGYALTYTQDLHFGQSAYAV 251
Score = 112 bits (279), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 51/102 (50%), Positives = 67/102 (65%), Gaps = 2/102 (1%)
Query: 63 HYFKKAHMVPR--CNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
H+ + A+ V + ++I HGP+ FSVY DF Y GVY H G S+G HAV++L
Sbjct: 243 HFGQSAYAVSKKVTEIQKEIMTHGPVEVAFSVYEDFEHYSGGVYVHTAGASLGGHAVKML 302
Query: 121 GWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIEMGF 162
GWGV+N PYWL ANSWN+ WG++G F+I+RG NE IE G
Sbjct: 303 GWGVDNGTPYWLCANSWNEDWGENGYFRIIRGVNECGIESGV 344
>gi|221221056|gb|ACM09189.1| Cathepsin B precursor [Salmo salar]
gi|221222300|gb|ACM09811.1| Cathepsin B precursor [Salmo salar]
Length = 207
Score = 143 bits (361), Expect = 9e-32, Method: Compositional matrix adjust.
Identities = 71/156 (45%), Positives = 97/156 (62%), Gaps = 4/156 (2%)
Query: 152 GENEADIEMGFNNRVEANSSEDDDLETMGCQNAKG--LPRNFDAREKWPECPSLRHIADQ 209
G N +++ + R+ + L TM Q A LP FD R++WP CP+L+ I DQ
Sbjct: 43 GPNFHNVDYSYVKRLCGTLLKGPKLPTM-VQYAGDVELPDTFDPRQQWPNCPTLKEIRDQ 101
Query: 210 SNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVACTPNC-WGCNGGWPQLAWRFWG 268
+CGSCWA A AISDR+CI SN + +IS++ +++C +C GCNGG+P AW FW
Sbjct: 102 GSCGSCWAFGAAEAISDRVCIHSNAKVSVEISSEDLLSCCDSCGMGCNGGYPSAAWDFWT 161
Query: 269 HNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
G+VTGG Y+S GC+PY++ PCEHHV G CT
Sbjct: 162 TEGLVTGGLYDSHVGCRPYSIPPCEHHVNGTRPPCT 197
>gi|56758130|gb|AAW27205.1| unknown [Schistosoma japonicum]
Length = 279
Score = 143 bits (361), Expect = 9e-32, Method: Compositional matrix adjust.
Identities = 75/195 (38%), Positives = 107/195 (54%), Gaps = 13/195 (6%)
Query: 148 KILRGENEADIEMGFNNRVEANSSEDDDLETMGCQNAKGLPRNFDAREKWPECPSLRHIA 207
+IL G + D EM R + D ++E +P FD+R+KWP C S+ I
Sbjct: 61 RILMGARKEDAEMKRKRRPTVDH-HDLNVE---------IPSQFDSRKKWPHCKSISQIR 110
Query: 208 DQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVACTPNCW-GCNGGWPQLAWRF 266
DQS CGSCWA A++DR+CI S G + ++SA +++C +C GC GG+P +AW +
Sbjct: 111 DQSRCGSCWAFGAVEAMTDRICIQSGGQQSAELSALDLISCCEDCGDGCQGGFPGVAWDY 170
Query: 267 WGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTLLGKLKTPECKQNCYNPSYEST 326
W G+VTGG + GCQPY CEHH +G C KTP+CKQ C Y++
Sbjct: 171 WVKRGIVTGGSKENHTGCQPYPFPKCEHHTKGKYPACG-TKIYKTPQCKQTC-QKGYKTP 228
Query: 327 YRFDLKKGKKAHMVL 341
Y D G +++ V+
Sbjct: 229 YEQDKHYGDESYNVI 243
>gi|107921798|gb|ABF85680.1| cathepsin B3 [Fasciola hepatica]
Length = 278
Score = 143 bits (361), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 73/155 (47%), Positives = 94/155 (60%), Gaps = 2/155 (1%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP +FDAR+KWP CPS+ I DQS+C SCWAVS A+AI+DR+CI SNG ++SA IV
Sbjct: 63 LPESFDARQKWPNCPSISEIRDQSSCSSCWAVSSASAITDRICIHSNGQKKPRLSAIDIV 122
Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
+C C +GCNGG P ++W +W GVVTGG + GC PY C H V P
Sbjct: 123 SCCAYCGYGCNGGIPAMSWDYWTREGVVTGGTLENPTGCLPYPFPKCSHGVVTPGLPPCP 182
Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
TP+C++ C+ Y TY D KGK ++ V
Sbjct: 183 RDIYPTPKCEKKCHA-GYNKTYEQDKVKGKSSYNV 216
Score = 70.1 bits (170), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 28/57 (49%), Positives = 41/57 (71%)
Query: 75 NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYW 131
+ M +I ++GP+ IF ++ DFL YKSG+Y + G +G HA+RV+GWGVEN + YW
Sbjct: 222 DIMMEIMKNGPVDGIFYMFEDFLVYKSGIYHYTTGRLVGGHAIRVIGWGVENGVNYW 278
>gi|313233819|emb|CBY09988.1| unnamed protein product [Oikopleura dioica]
Length = 356
Score = 143 bits (361), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 70/156 (44%), Positives = 95/156 (60%), Gaps = 5/156 (3%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP NFD+RE WP+CPS+ + DQ +CGSCWA + AISDR CI SN FT +S++ ++
Sbjct: 95 LPANFDSREAWPDCPSISEVRDQGSCGSCWAFGASEAISDRTCIHSNAAFTFDLSSEDLL 154
Query: 247 ACTPNCW--GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
+C GCNGG+PQ AW +W NG+V+GG Y+ GCQPY + PCEHH +G CT
Sbjct: 155 SCCGYVCGNGCNGGFPQAAWEYWVQNGLVSGGLYHGT-GCQPYAIEPCEHHTEGDRPPCT 213
Query: 305 LLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
+ TP+C C + Y + D G A+ +
Sbjct: 214 GE-EGTTPKCSHKCVD-GYTGNFAQDKHYGSVAYRI 247
Score = 105 bits (263), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 54/111 (48%), Positives = 68/111 (61%), Gaps = 2/111 (1%)
Query: 63 HYFKKAHMVPRCN--AMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
HY A+ +P M +IY++GP+ F VY DF YKSGVY H+ G ++G HA+RVL
Sbjct: 239 HYGSVAYRIPANEKAIMNEIYKNGPVEGAFIVYEDFPTYKSGVYSHHTGSALGGHAIRVL 298
Query: 121 GWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIEMGFNNRVEANSS 171
GWG EN YWL NSWN WG++G FKI RG NE IE + A+ S
Sbjct: 299 GWGEENGEKYWLCGNSWNTDWGNNGFFKIKRGVNECGIESEMVGGIPASES 349
>gi|1169189|sp|P43157.1|CYSP_SCHJA RecName: Full=Cathepsin B-like cysteine proteinase; AltName:
Full=Antigen Sj31; Flags: Precursor
gi|11167|emb|CAA50305.1| cathepsin B [Schistosoma japonicum]
Length = 342
Score = 143 bits (361), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 76/194 (39%), Positives = 107/194 (55%), Gaps = 13/194 (6%)
Query: 148 KILRGENEADIEMGFNNRVEANSSEDDDLETMGCQNAKGLPRNFDAREKWPECPSLRHIA 207
+IL G + D EM N R + D ++E +P FD+R+KWP C S+ I
Sbjct: 61 RILMGARKEDAEMKRNRRPTVDH-HDLNVE---------IPSQFDSRKKWPHCKSISQIR 110
Query: 208 DQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVACTPNCW-GCNGGWPQLAWRF 266
DQS CGSCWA A++DR+CI S G + ++SA +++C +C GC GG+P +AW +
Sbjct: 111 DQSRCGSCWAFGAVEAMTDRICIQSGGGQSAELSALDLISCCKDCGDGCQGGFPGVAWDY 170
Query: 267 WGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTLLGKLKTPECKQNCYNPSYEST 326
W G+VTGG + GCQPY CEHH +G C KTP+CKQ C Y++
Sbjct: 171 WVKRGIVTGGSKENHTGCQPYPFPKCEHHTKGKYPACG-TKIYKTPQCKQTC-QKGYKTP 228
Query: 327 YRFDLKKGKKAHMV 340
Y D G +++ V
Sbjct: 229 YEQDKHYGDESYNV 242
Score = 110 bits (274), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 45/82 (54%), Positives = 60/82 (73%)
Query: 78 RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
R I +GP+ A F VY DFL YKSG+Y+H G +G HA+R++GWGVE PYWL+ANSW
Sbjct: 251 RDIMMYGPVEAAFDVYEDFLNYKSGIYRHVTGSIVGGHAIRIIGWGVEKRTPYWLIANSW 310
Query: 138 NDHWGDHGTFKILRGENEADIE 159
N+ WG+ G F+++RG +E IE
Sbjct: 311 NEDWGEKGLFRMVRGRDECSIE 332
>gi|358331547|dbj|GAA35870.2| cathepsin B-like cysteine proteinase [Clonorchis sinensis]
Length = 508
Score = 143 bits (361), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 67/136 (49%), Positives = 84/136 (61%), Gaps = 2/136 (1%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP+NFDAR+ WP C S+ I DQS+CGSCWA A+SDRLCI SNG F +SA ++
Sbjct: 86 LPKNFDARKTWPHCSSISEIRDQSSCGSCWAFGAVEAMSDRLCIHSNGAFNKSLSAVDLL 145
Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
+C +C +GC GG+P +AW +W +G+VTGG GC+ Y CEHHVQG C
Sbjct: 146 SCCKDCGFGCRGGYPAVAWDYWKTHGIVTGGSKEDPSGCRSYPFPKCEHHVQGHYPPCP- 204
Query: 306 LGKLKTPECKQNCYNP 321
TPEC Q C P
Sbjct: 205 RELYPTPECVQQCDTP 220
Score = 108 bits (269), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 64/182 (35%), Positives = 87/182 (47%), Gaps = 26/182 (14%)
Query: 75 NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVA 134
+ M++I GP+ AIF++Y DFL+Y SGVY H G + HAVR+LGWG ++PYWL+A
Sbjct: 243 SIMKEIMLRGPVEAIFTMYEDFLRYSSGVYFHALGAPMSGHAVRILGWGELGNVPYWLIA 302
Query: 135 NSWNDHWGDHGTFKILRGENEADIEMGFNNRVEANSSEDDDLETMG----CQNAKGLPRN 190
NSWN+ WG+ G K LRG NE I EDD +G C K +P
Sbjct: 303 NSWNEDWGEEGYMKFLRGYNECGI-------------EDDVTAVLGNAWSCPAIKVVPSK 349
Query: 191 FDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVACTP 250
F SL + CG SDR + + +S +V+ P
Sbjct: 350 FI---------SLMKELYANACGCFRVYYTLLLYSDRYGSSYKIWLPTDVSGDLVVSFYP 400
Query: 251 NC 252
+C
Sbjct: 401 DC 402
>gi|74213457|dbj|BAE35542.1| unnamed protein product [Mus musculus]
Length = 339
Score = 143 bits (361), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 72/148 (48%), Positives = 91/148 (61%), Gaps = 7/148 (4%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP FDARE+W CP++ I DQ +CGSCWA AISDR CI +NG ++SA+ ++
Sbjct: 80 LPETFDAREQWSNCPTIGQIRDQGSCGSCWAFGAVEAISDRTCIHTNGRVNVEVSAEDLL 139
Query: 247 ACTP-NCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
C C GCNGG+P AW FW G+V+GG YNS GC PYT+ PCEHHV G CT
Sbjct: 140 TCCGIQCGDGCNGGYPSGAWSFWTKKGLVSGGVYNSHVGCLPYTIPPCEHHVNGSRPPCT 199
Query: 305 LLGKLKTPECKQNC---YNPSYESTYRF 329
G+ T C ++C Y+PSY+ F
Sbjct: 200 --GEGDTHRCNKSCEAGYSPSYKEDKHF 225
Score = 122 bits (305), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 53/83 (63%), Positives = 66/83 (79%)
Query: 77 MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANS 136
M +IY++GP+ F+V++DFL YKSGVY+H GD +G HA+R+LGWGVEN +PYWL ANS
Sbjct: 240 MAEIYKNGPVEGAFTVFSDFLTYKSGVYKHEAGDMMGGHAIRILGWGVENGVPYWLAANS 299
Query: 137 WNDHWGDHGTFKILRGENEADIE 159
WN WGD+G FKILRGEN IE
Sbjct: 300 WNLDWGDNGFFKILRGENHCGIE 322
>gi|255040223|gb|ACT99884.1| truncated cathepsin B [Opisthorchis viverrini]
Length = 313
Score = 143 bits (361), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 67/136 (49%), Positives = 85/136 (62%), Gaps = 2/136 (1%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP+NFDAR KWP C S+ I DQS+CGSCWA A+SDRLCI SNG F +SA ++
Sbjct: 86 LPKNFDARSKWPHCSSVSEIRDQSSCGSCWAFGAVEAMSDRLCIHSNGSFNKSLSAVDLL 145
Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
+C +C +GC GG+P +AW +W +G+VTGG GC+ Y C+HHVQG C
Sbjct: 146 SCCKDCGFGCRGGYPAVAWDYWRTHGIVTGGSKEDPSGCRSYPFPKCDHHVQGHYPPCPR 205
Query: 306 LGKLKTPECKQNCYNP 321
TPEC Q+C P
Sbjct: 206 Q-IYPTPECVQDCDTP 220
Score = 96.3 bits (238), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 40/71 (56%), Positives = 54/71 (76%)
Query: 75 NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVA 134
+ M++I GP+ A+F+VY DFLQYKS VY H +G + HA+R+LGWG E D+PYWL+A
Sbjct: 243 SIMKEIMLRGPVEAVFTVYEDFLQYKSRVYFHAWGAPMSGHAIRILGWGEEGDVPYWLIA 302
Query: 135 NSWNDHWGDHG 145
NSWN+ WG+ G
Sbjct: 303 NSWNEDWGEKG 313
>gi|324507953|gb|ADY43363.1| Cathepsin B cysteine proteinase 6 [Ascaris suum]
Length = 352
Score = 143 bits (361), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 67/155 (43%), Positives = 95/155 (61%), Gaps = 1/155 (0%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+P FDAREKW +C SL++I DQS+CGSCWA A+SDR+CIASNG +SA ++
Sbjct: 80 IPEAFDAREKWDQCASLKNIRDQSSCGSCWAFGAVEAMSDRICIASNGKIQVSLSADDLL 139
Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
+C +C +GC+GG P AW++W G+VTG ++ ++GC+PY PCEHH
Sbjct: 140 SCCKSCGFGCDGGDPMAAWKYWVKEGIVTGSNFTMKQGCKPYPFPPCEHHSNKTHYQPCK 199
Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
TP+C++ C + E TY D G+ A+ V
Sbjct: 200 HDLYPTPKCEKKCLDIYTEKTYAEDKFFGETAYGV 234
Score = 104 bits (259), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 45/82 (54%), Positives = 56/82 (68%)
Query: 78 RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
++I HGP+ F VY DFL Y G+Y H G G HAV++LGWGVE +PYWLVANSW
Sbjct: 243 KEILTHGPVEVAFEVYEDFLMYDGGIYVHTGGKIGGGHAVKMLGWGVEQGVPYWLVANSW 302
Query: 138 NDHWGDHGTFKILRGENEADIE 159
N WG+ G F+I+RG +E IE
Sbjct: 303 NTDWGEDGFFRIIRGIDECGIE 324
>gi|341888137|gb|EGT44072.1| hypothetical protein CAEBREN_10156 [Caenorhabditis brenneri]
Length = 344
Score = 143 bits (361), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 72/171 (42%), Positives = 102/171 (59%), Gaps = 5/171 (2%)
Query: 175 DLETMGCQNAKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNG 234
D + + + + +P +FDARE+WP C S+ +I DQS+CGSCWA + A AISDR CIASNG
Sbjct: 70 DEDIVATEVSDAIPDHFDAREQWPSCVSIDNIRDQSDCGSCWAFAAAEAISDRTCIASNG 129
Query: 235 YFTGQISAQHIVACTPNCW----GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLA 290
+S++ +++C + GC GG+P AW++WG +G+VTGG Y SQ GC+PY++A
Sbjct: 130 AVNTLLSSEDLLSCCTGIFSCGNGCEGGYPIQAWKWWGKHGLVTGGSYESQFGCKPYSIA 189
Query: 291 PCEHHVQGPLQNCTLLGKLKTPECKQNCY-NPSYESTYRFDLKKGKKAHMV 340
PC V G TP+C C N +Y + Y D G A+ V
Sbjct: 190 PCGQTVNGVTWPKCPEDTEPTPKCVDACTSNHTYPTAYLQDKHFGATAYAV 240
Score = 109 bits (272), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 47/81 (58%), Positives = 61/81 (75%)
Query: 79 QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWN 138
+I ++GP+ F+VY DF QY +GVY H G S+G HAV++LGWGV+N PYWLVANSWN
Sbjct: 250 EILKNGPIEVAFTVYEDFYQYTTGVYVHTAGASLGGHAVKILGWGVDNGTPYWLVANSWN 309
Query: 139 DHWGDHGTFKILRGENEADIE 159
+WG+ G F+I+RG NE IE
Sbjct: 310 INWGEKGYFRIIRGLNECGIE 330
>gi|17565162|ref|NP_503382.1| Protein W07B8.4 [Caenorhabditis elegans]
gi|351059398|emb|CCD74288.1| Protein W07B8.4 [Caenorhabditis elegans]
Length = 335
Score = 143 bits (361), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 73/164 (44%), Positives = 100/164 (60%), Gaps = 5/164 (3%)
Query: 182 QNAKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQIS 241
+ A +P ++D R+ WP+C S+ +I DQS+CGSCWAV+ A AISDR CIASNG +S
Sbjct: 68 ETADSIPDSYDVRDHWPQCISVNNIRDQSHCGSCWAVAAAEAISDRTCIASNGDVNTLLS 127
Query: 242 AQHIVACTP---NCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQ 297
A+ I+ C NC GC GG+P AWR+W NG+VTGG + SQ GC+PY++APC +
Sbjct: 128 AEDILTCCTGKFNCGDGCEGGYPIQAWRYWVKNGLVTGGSFESQYGCKPYSIAPCGETID 187
Query: 298 GPLQNCTLLGKLKTPECKQNCY-NPSYESTYRFDLKKGKKAHMV 340
G + TP+C+ +C N SY Y D G A+ +
Sbjct: 188 GVTWPECPMKISDTPKCEHHCTGNNSYPIPYDQDKHFGASAYAI 231
Score = 102 bits (255), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 48/99 (48%), Positives = 62/99 (62%), Gaps = 2/99 (2%)
Query: 63 HYFKKAHMVPRC--NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
H+ A+ + R +I HGP+ F VY DF YK+G+Y H G +G HAV++L
Sbjct: 223 HFGASAYAIGRSAKQIQTEILAHGPVEVGFIVYEDFYLYKTGIYTHVAGGELGGHAVKML 282
Query: 121 GWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
GWGV+N PYWL ANSWN WG+ G F+ILRG +E IE
Sbjct: 283 GWGVDNGTPYWLAANSWNTVWGEKGYFRILRGVDECGIE 321
>gi|221219800|gb|ACM08561.1| Cathepsin B precursor [Salmo salar]
gi|221222296|gb|ACM09809.1| Cathepsin B precursor [Salmo salar]
Length = 205
Score = 143 bits (360), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 71/156 (45%), Positives = 97/156 (62%), Gaps = 4/156 (2%)
Query: 152 GENEADIEMGFNNRVEANSSEDDDLETMGCQNAKG--LPRNFDAREKWPECPSLRHIADQ 209
G N +++ + R+ + L TM Q A LP FD R++WP CP+L+ I DQ
Sbjct: 43 GPNFHNVDYSYVKRLCGTLLKGPKLPTM-VQYAGDVELPDTFDPRQQWPNCPTLKEIRDQ 101
Query: 210 SNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVACTPNC-WGCNGGWPQLAWRFWG 268
+CGSCWA A AISDR+CI SN + +IS++ +++C +C GCNGG+P AW FW
Sbjct: 102 GSCGSCWAFGAAEAISDRVCIHSNAKVSVEISSEDLLSCCDSCGMGCNGGYPSAAWDFWT 161
Query: 269 HNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
G+VTGG Y+S GC+PY++ PCEHHV G CT
Sbjct: 162 TEGLVTGGLYDSHVGCRPYSIPPCEHHVNGTRPPCT 197
>gi|268557308|ref|XP_002636643.1| Hypothetical protein CBG23351 [Caenorhabditis briggsae]
Length = 351
Score = 143 bits (360), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 75/156 (48%), Positives = 91/156 (58%), Gaps = 3/156 (1%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+P +FD+R +WP CPS+ I DQS+CGSCWAVS A ISDR+CIASNG ISA I
Sbjct: 97 VPDSFDSRTQWPNCPSISKIRDQSSCGSCWAVSAAETISDRICIASNGKTQISISADDIN 156
Query: 247 ACTPNCW--GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
AC GCNGG+P AWR + G VTGG Y + GC+PY PCEHHV G
Sbjct: 157 ACCGMVCGNGCNGGYPIEAWRHYVKKGYVTGGSYQEKSGCKPYPYPPCEHHVNGTHYKPC 216
Query: 305 LLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
T +C+ +C Y TY DL G+ A+ V
Sbjct: 217 PSNMYPTDKCEHSC-QAGYPLTYTQDLHFGQSAYAV 251
Score = 111 bits (278), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 50/102 (49%), Positives = 67/102 (65%), Gaps = 2/102 (1%)
Query: 63 HYFKKAHMVPR--CNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
H+ + A+ V + ++I HGP+ F+VY DF Y GVY H G S+G HAV++L
Sbjct: 243 HFGQSAYAVSKKPAEIQKEIMTHGPVEVAFTVYEDFEHYSGGVYVHTAGASLGGHAVKML 302
Query: 121 GWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIEMGF 162
GWGV+N PYWL ANSWN+ WG++G F+I+RG NE IE G
Sbjct: 303 GWGVDNGTPYWLCANSWNEDWGENGYFRIIRGVNECGIESGV 344
>gi|341900876|gb|EGT56811.1| hypothetical protein CAEBREN_29569 [Caenorhabditis brenneri]
Length = 344
Score = 143 bits (360), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 72/171 (42%), Positives = 101/171 (59%), Gaps = 5/171 (2%)
Query: 175 DLETMGCQNAKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNG 234
D + + + + +P FDARE+WP C S+ +I DQS+CGSCWA + A AISDR CIASNG
Sbjct: 70 DEDIVATEVSDAIPDRFDAREQWPSCVSIDNIRDQSDCGSCWAFAAAEAISDRTCIASNG 129
Query: 235 YFTGQISAQHIVACTPNCW----GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLA 290
+S++ +++C + GC GG+P AW++WG +G+VTGG Y SQ GC+PY++A
Sbjct: 130 AVNTLLSSEDLLSCCTGIFSCGNGCEGGYPIQAWKWWGKHGLVTGGSYESQFGCKPYSIA 189
Query: 291 PCEHHVQGPLQNCTLLGKLKTPECKQNCY-NPSYESTYRFDLKKGKKAHMV 340
PC V G TP+C C N +Y + Y D G A+ V
Sbjct: 190 PCGQTVNGVTWPKCPEDTEPTPKCVDACTSNHTYPTAYLQDKHFGATAYAV 240
Score = 109 bits (272), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 47/81 (58%), Positives = 61/81 (75%)
Query: 79 QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWN 138
+I ++GP+ F+VY DF QY +GVY H G S+G HAV++LGWGV+N PYWLVANSWN
Sbjct: 250 EILKNGPIEVAFTVYEDFYQYTTGVYVHTAGASLGGHAVKILGWGVDNGTPYWLVANSWN 309
Query: 139 DHWGDHGTFKILRGENEADIE 159
+WG+ G F+I+RG NE IE
Sbjct: 310 INWGEKGYFRIIRGLNECGIE 330
>gi|268555790|ref|XP_002635884.1| Hypothetical protein CBG01104 [Caenorhabditis briggsae]
Length = 337
Score = 143 bits (360), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 70/156 (44%), Positives = 97/156 (62%), Gaps = 2/156 (1%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+P ++D R+ W +C S+ +I DQS+CGSCWAV+ A ISDRLCIASNG +SA+ ++
Sbjct: 78 IPESYDVRDHWSKCISVDNIRDQSDCGSCWAVAAAETISDRLCIASNGSINTFVSAEDLL 137
Query: 247 ACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
+C +C GC+GG+P AWR+W G+V+GG Y SQ GC+PY++APC V G
Sbjct: 138 SCCTSCGDGCDGGYPLQAWRYWVKQGLVSGGSYESQYGCKPYSIAPCGQTVNGVTWPKCP 197
Query: 306 LGKLKTPECKQNCYN-PSYESTYRFDLKKGKKAHMV 340
+ TPEC +C + SY Y D G A+ V
Sbjct: 198 AQEEATPECASHCTSKSSYSVAYEKDKHYGLSAYPV 233
Score = 113 bits (282), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 56/99 (56%), Positives = 68/99 (68%), Gaps = 2/99 (2%)
Query: 63 HYFKKAHMVPRCNAMRQ--IYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
HY A+ V R A Q I +HGP+ A F VY+DF +YKSG+Y H G +G HAV++L
Sbjct: 225 HYGLSAYPVGRKEAQIQTEILQHGPVEAGFLVYSDFYRYKSGIYTHVSGQELGGHAVKIL 284
Query: 121 GWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
GWGVEN YWLVANSWN +WG+ G F+ILRG NE IE
Sbjct: 285 GWGVENGTKYWLVANSWNINWGEKGYFRILRGRNECGIE 323
>gi|38373697|gb|AAR19103.1| cathepsin B [Uronema marinum]
Length = 350
Score = 143 bits (360), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 72/163 (44%), Positives = 100/163 (61%), Gaps = 9/163 (5%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP +FD RE +P+C SL+ + DQSNCGSCWA AISDR+CIAS +IS+++++
Sbjct: 86 LPESFDLREAYPKCESLQQVRDQSNCGSCWAFGTVEAISDRICIASGQKDQTRISSENLL 145
Query: 247 AC---TPNC-WGCNGGWPQLAWRFWGHNGVVTGGDY-----NSQEGCQPYTLAPCEHHVQ 297
+C T C GCNGG+ AW ++ G+V+G Y NS+ CQPY+ PC HHVQ
Sbjct: 146 SCCRGTFACGMGCNGGYTAGAWNYYVKTGLVSGNLYTDDNQNSKTECQPYSFPPCSHHVQ 205
Query: 298 GPLQNCTLLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
G Q CT L + TP+C C + +++Y DL KG ++ V
Sbjct: 206 GEYQACTDLPQFNTPKCYTECNSQYTQNSYEQDLHKGVSSYSV 248
Score = 115 bits (289), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 50/84 (59%), Positives = 62/84 (73%)
Query: 79 QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWN 138
+IY++G A F+VY+DFL Y SGVYQ+ G +G HA+++LGWGVEN PYWL ANSWN
Sbjct: 258 EIYQYGSTTASFNVYSDFLTYSSGVYQNTSGSYMGGHAIKMLGWGVENGTPYWLCANSWN 317
Query: 139 DHWGDHGTFKILRGENEADIEMGF 162
WG++G FKILRG NE IE G
Sbjct: 318 SSWGENGFFKILRGSNECGIESGM 341
>gi|170586854|ref|XP_001898194.1| cathepsin B-like cysteine proteinase [Brugia malayi]
gi|158594589|gb|EDP33173.1| cathepsin B-like cysteine proteinase, putative [Brugia malayi]
Length = 384
Score = 142 bits (359), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 68/155 (43%), Positives = 96/155 (61%), Gaps = 2/155 (1%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+P +FDAR+ WPEC SLR+I DQS+CGSCWAV+ A+SDR+CI S G +SA ++
Sbjct: 121 IPESFDARKNWPECASLRNIRDQSSCGSCWAVAAVEAMSDRICITSKGKKQVILSADDLL 180
Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
+C C +GC GG P AW++W +G+VTG DY + GC+PY PCEHH
Sbjct: 181 SCCKTCGFGCFGGEPMAAWKYWVLSGIVTGSDYTNHSGCRPYPFPPCEHHSNKTHYEPCK 240
Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
TP+C + C + +Y +Y+ D G++A+ V
Sbjct: 241 HDLYPTPKCYKQC-DKNYTKSYKADKYYGEQAYNV 274
Score = 98.2 bits (243), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 44/88 (50%), Positives = 58/88 (65%), Gaps = 3/88 (3%)
Query: 78 RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
++I GP+ A F VY DFL Y SG+Y+H G G HAV++LGWG++ + YWL ANSW
Sbjct: 283 KEIMTLGPVEASFEVYTDFLHYTSGIYKHVAGSVGGGHAVKILGWGIDQGVSYWLAANSW 342
Query: 138 NDHWGD---HGTFKILRGENEADIEMGF 162
N+ WG+ G F+ILRG +E IE G
Sbjct: 343 NNDWGEDVFSGYFRILRGADECGIESGI 370
>gi|87246247|gb|ABD35300.1| cathepsin B-like cysteine protease [Triatoma infestans]
Length = 333
Score = 142 bits (358), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 66/147 (44%), Positives = 95/147 (64%), Gaps = 6/147 (4%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP FDAR++WP C ++ I DQ +CGSCWA A+SDRLCI SNG +SA++++
Sbjct: 82 LPDEFDARKQWPNCSTIGEIRDQGSCGSCWAFGAVEAMSDRLCIHSNGKLQVHLSAENLL 141
Query: 247 ACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
+C +C GC GG P+ AW +W G+V+GG+Y S++GCQPY++APCEH + G C
Sbjct: 142 SCCDSCGDGCLGGSPESAWEYWHKFGIVSGGNYGSKQGCQPYSIAPCEHSIHGSSPACG- 200
Query: 306 LGKLKTPECKQNC---YNPSYESTYRF 329
G TP+CK+ C Y+ Y+ + +
Sbjct: 201 -GVTDTPKCKKQCEKGYSIPYDKAFYY 226
Score = 106 bits (265), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 51/109 (46%), Positives = 72/109 (66%), Gaps = 7/109 (6%)
Query: 58 SIPLS---HYFKKAHMVPRCNAMR---QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDS 111
SIP +Y + + +P +A + +I ++GP+VA F VY D YK GVYQH G+
Sbjct: 217 SIPYDKAFYYGQPGYAIPN-DAQKIQAEILKNGPIVASFLVYEDLFSYKEGVYQHVAGEF 275
Query: 112 IGLHAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIEM 160
+G H +++ GWG+EN PYWLVANSWN WG++G FKI RG++E IE+
Sbjct: 276 LGGHVIKIFGWGIENGTPYWLVANSWNTDWGNNGFFKIPRGKDECGIEI 324
>gi|132566367|gb|ABO34080.1| cathepsin B5 [Clonorchis sinensis]
Length = 343
Score = 142 bits (358), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 67/136 (49%), Positives = 84/136 (61%), Gaps = 2/136 (1%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP+NFDAR+ WP C S+ I DQS+CGSCWA A+SDRLCI SNG F +SA ++
Sbjct: 86 LPKNFDARKTWPHCSSISEIRDQSSCGSCWAFGAVEAMSDRLCIHSNGAFNKSLSAVDLL 145
Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
+C +C +GC GG+P +AW +W +G+VTGG GC+ Y CEHHVQG C
Sbjct: 146 SCCKDCGFGCRGGYPAVAWDYWKTHGIVTGGSKEDPSGCRSYPFPKCEHHVQGHYPPCP- 204
Query: 306 LGKLKTPECKQNCYNP 321
TPEC Q C P
Sbjct: 205 RELYPTPECVQQCDTP 220
Score = 106 bits (264), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 46/85 (54%), Positives = 60/85 (70%)
Query: 75 NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVA 134
+ M++I GP+ AIF++Y DFL+Y SGVY H G + HAVR+LGWG ++PYWL+A
Sbjct: 243 SIMKEIMLRGPVEAIFTMYEDFLRYSSGVYFHALGAPMSGHAVRILGWGELGNVPYWLIA 302
Query: 135 NSWNDHWGDHGTFKILRGENEADIE 159
NSWN+ WG+ G K LRG NE IE
Sbjct: 303 NSWNEDWGEEGYMKFLRGYNECGIE 327
>gi|45361295|ref|NP_989225.1| cathepsin B precursor [Xenopus (Silurana) tropicalis]
gi|38969948|gb|AAH63365.1| hypothetical protein MGC75969 [Xenopus (Silurana) tropicalis]
Length = 333
Score = 142 bits (358), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 73/184 (39%), Positives = 106/184 (57%), Gaps = 7/184 (3%)
Query: 152 GENEADIEMGFNNRVEANSSEDDDLE-TMGCQNAKGLPRNFDAREKWPECPSLRHIADQS 210
G N A+ ++ + R+ L+ G + LP +FD+R WP CP++R + DQ
Sbjct: 44 GHNFANADLHYVKRLCGTHLNGPQLQKRFGFADGMELPDSFDSRAAWPNCPTIREVRDQG 103
Query: 211 NCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVACTP-NC-WGCNGGWPQLAWRFWG 268
+CGSCWA AISDR+C+ +NG ++SA+ +++C C GCNGG+P AW+FW
Sbjct: 104 SCGSCWAFGAVEAISDRVCVHTNGKVNVEVSAEDLLSCCGFECGMGCNGGYPSGAWKFWT 163
Query: 269 HNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTLLGKLKTPECKQNC---YNPSYES 325
G+V+GG Y+S GC+PY++ PCEHHV G C + TP+C + C Y P Y S
Sbjct: 164 ETGLVSGGLYDSHLGCRPYSIPPCEHHVNGSRPACKGE-EGDTPKCVKQCEDGYAPVYGS 222
Query: 326 TYRF 329
F
Sbjct: 223 DKHF 226
Score = 119 bits (299), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 59/125 (47%), Positives = 78/125 (62%), Gaps = 2/125 (1%)
Query: 37 KKKKKKKKKKKKKKKRLYLPTSIPLSHYFKKAHMVPRCN--AMRQIYEHGPLVAIFSVYA 94
K ++ K K+ + Y P H+ ++ VP M +IY++GP+ F VYA
Sbjct: 199 KGEEGDTPKCVKQCEDGYAPVYGSDKHFGATSYGVPSSEKEIMAEIYKNGPVEGAFLVYA 258
Query: 95 DFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGEN 154
DF YKSGVYQH G+ +G HA+++LGWGVEN PYWL ANSWN WGD+G FKILRG++
Sbjct: 259 DFPMYKSGVYQHETGEELGGHAIKILGWGVENGTPYWLCANSWNTDWGDNGFFKILRGKD 318
Query: 155 EADIE 159
IE
Sbjct: 319 HCGIE 323
>gi|1181143|emb|CAA93278.1| cysteine proteinase [Haemonchus contortus]
Length = 341
Score = 142 bits (358), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 71/141 (50%), Positives = 93/141 (65%), Gaps = 5/141 (3%)
Query: 182 QNAKG--LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQ 239
+N KG +P +FDAR KWP+C SL+HI DQ+NCGSCWAVS A+A+SDR+CIASNG
Sbjct: 83 KNDKGEDIPESFDARTKWPKCSSLKHIRDQANCGSCWAVSTASALSDRICIASNGRKQVH 142
Query: 240 ISAQHIVACTPN-C-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQ 297
+SA I++C N C +GCNGGWP A+ ++ G VTGGDY + GC+PY PC HH +
Sbjct: 143 VSATDILSCCGNQCGYGCNGGWPIQAFNYFSKQGAVTGGDYKATSGCRPYPFHPCGHHGK 202
Query: 298 GPLQNCTLLGKLKTPECKQNC 318
+ TP+C + C
Sbjct: 203 DTYYG-ECPNEATTPKCVRKC 222
Score = 105 bits (261), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 46/96 (47%), Positives = 66/96 (68%), Gaps = 2/96 (2%)
Query: 66 KKAHMVPRCN--AMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWG 123
K A+ VP R+I ++GP+V F+VY DF YK G+Y+H G + G HA++++GWG
Sbjct: 238 KDAYEVPNSEKAIQREIMKNGPVVGAFTVYEDFSYYKKGIYKHTAGKARGGHAIKIIGWG 297
Query: 124 VENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
E +PYWL+ANSW++ WG++G F+ILRG N IE
Sbjct: 298 KEGGVPYWLIANSWHNDWGENGYFRILRGSNHCGIE 333
>gi|325302582|dbj|BAJ83491.1| cathepsin B-like peptidase [Echinococcus multilocularis]
Length = 338
Score = 142 bits (358), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 67/146 (45%), Positives = 92/146 (63%), Gaps = 5/146 (3%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP FDAR+ WP+CP++ I DQ CGSCWA A+SDR+CI S G +ISA ++
Sbjct: 84 LPSEFDARKAWPDCPTIGEIRDQGTCGSCWAFGATEAMSDRICIHSEGKEVVRISADDLL 143
Query: 247 ACTPNC--WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
+C +GCNGG P+ AWR+W +G+V+GG Y S GC+PY + PCEHH G +C
Sbjct: 144 SCCGLFCGFGCNGGLPENAWRYWAIDGIVSGGLYGSHVGCRPYEIPPCEHHTSGNRPDCK 203
Query: 305 LLGKLKTPECKQNCYNPSYESTYRFD 330
G KTP+C++ C S++ Y+ D
Sbjct: 204 --GNSKTPKCQRQCVE-SFDGKYQAD 226
Score = 114 bits (286), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 54/92 (58%), Positives = 63/92 (68%)
Query: 75 NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVA 134
+ M +I +GP+ A F VYADFL YKSGVYQH G +G HAV++LGWG EN +PYWL A
Sbjct: 242 DIMNEILVYGPVEADFIVYADFLTYKSGVYQHVKGGFLGGHAVKILGWGEENGVPYWLCA 301
Query: 135 NSWNDHWGDHGTFKILRGENEADIEMGFNNRV 166
NSWN WGD G FKILRG N IE N +
Sbjct: 302 NSWNTDWGDGGFFKILRGYNHCKIEADINAGI 333
>gi|392920988|ref|NP_506011.2| Protein F57F5.1 [Caenorhabditis elegans]
gi|206994319|emb|CAB00098.2| Protein F57F5.1 [Caenorhabditis elegans]
Length = 351
Score = 142 bits (357), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 74/157 (47%), Positives = 91/157 (57%), Gaps = 3/157 (1%)
Query: 186 GLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHI 245
+P +FD+R WP CPS+ I DQS+CGSCWAVS A ISDR+CIASN ISA I
Sbjct: 96 AVPDSFDSRTAWPNCPSISKIRDQSSCGSCWAVSAAETISDRICIASNAKTILSISADDI 155
Query: 246 VACTPNCW--GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNC 303
AC GCNGG+P AWR + G VTGG Y + GC+PY PCEHHV G
Sbjct: 156 NACCGMVCGNGCNGGYPIEAWRHYVKKGYVTGGSYQDKTGCKPYPYPPCEHHVNGTHYKP 215
Query: 304 TLLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
T +C+++C Y TY+ DL G+ A+ V
Sbjct: 216 CPSNMYPTDKCERSC-QAGYALTYQQDLHFGQSAYAV 251
Score = 106 bits (265), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 48/95 (50%), Positives = 65/95 (68%), Gaps = 2/95 (2%)
Query: 63 HYFKKAHMVPRCNA--MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
H+ + A+ V + A ++I HGP+ F+VY DF Y GVY H G S+G HAV++L
Sbjct: 243 HFGQSAYAVSKKAAEIQKEIMTHGPVEVAFTVYEDFEHYSGGVYVHTAGASLGGHAVKML 302
Query: 121 GWGVENDIPYWLVANSWNDHWGDHGTFKILRGENE 155
GWGV+N PYWL ANSWN+ WG++G F+I+RG NE
Sbjct: 303 GWGVDNGTPYWLCANSWNEDWGENGYFRIIRGVNE 337
>gi|157058745|gb|ABV03130.1| cathepsin B-2744 [Sitobion avenae]
Length = 260
Score = 142 bits (357), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 68/156 (43%), Positives = 96/156 (61%), Gaps = 2/156 (1%)
Query: 187 LPRNFDAREKWPECPS-LRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHI 245
+PR FDAR+ + C + + DQ NC S WAV+VA+ SDRLCIASNG FT +SAQ++
Sbjct: 26 IPREFDARQYFGSCADVIGDVKDQGNCASSWAVAVASTFSDRLCIASNGQFTDNLSAQNL 85
Query: 246 VAC-TPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
++C GC+GG AW G+VTGG+++S EGCQPY + PC H+ G L+NC+
Sbjct: 86 LSCGDEEKMGCDGGSAFKAWELTMSKGIVTGGNFDSNEGCQPYKIRPCNHYGNGNLKNCS 145
Query: 305 LLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
L + + C++ C N +Y+ Y DL K +M
Sbjct: 146 SLRRTQMTVCREKCVNKNYKVKYEDDLHKTSIVYMT 181
Score = 75.5 bits (184), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 31/69 (44%), Positives = 46/69 (66%), Gaps = 1/69 (1%)
Query: 78 RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVEND-IPYWLVANS 136
++I +GP+ A VY +F+ YK G+Y+ G+ IG H V+++GWGV+ D YWL NS
Sbjct: 191 QEIMTYGPVTAFMYVYENFMGYKEGIYKSTAGELIGYHHVKLIGWGVDGDGTEYWLAMNS 250
Query: 137 WNDHWGDHG 145
WN +WG +G
Sbjct: 251 WNSNWGTNG 259
>gi|384597848|gb|AFI23675.1| cathepsin B, partial [Brugia malayi]
Length = 319
Score = 141 bits (356), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 68/155 (43%), Positives = 96/155 (61%), Gaps = 2/155 (1%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+P +FDAR+ WPEC SLR+I DQS+CGSCWAV+ A+SDR+CI S G +SA ++
Sbjct: 77 IPESFDARKNWPECASLRNIRDQSSCGSCWAVAAVEAMSDRICITSKGKKQVILSADDLL 136
Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
+C C +GC GG P AW++W +G+VTG DY + GC+PY PCEHH
Sbjct: 137 SCCKTCGFGCFGGEPMAAWKYWVLSGIVTGSDYTNHSGCRPYPFPPCEHHSNKTHYEPCK 196
Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
TP+C + C + +Y +Y+ D G++A+ V
Sbjct: 197 HDLYPTPKCYKQC-DKNYTKSYKADKYYGEQAYNV 230
Score = 97.8 bits (242), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 41/78 (52%), Positives = 55/78 (70%)
Query: 78 RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
++I GP+ A F VY DFL Y SG+Y+H G G HAV++LGWG++ + YWL ANSW
Sbjct: 239 KEIMTLGPVEASFEVYTDFLHYTSGIYKHVAGSVGGGHAVKILGWGIDQGVSYWLAANSW 298
Query: 138 NDHWGDHGTFKILRGENE 155
N+ WG+ G F+ILRG +E
Sbjct: 299 NNDWGEDGYFRILRGADE 316
>gi|227293|prf||1701299A cathepsin B
Length = 339
Score = 141 bits (356), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 71/148 (47%), Positives = 91/148 (61%), Gaps = 7/148 (4%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP FDARE+W CP++ I DQ +CGSCWA AISDR CI +NG ++SA+ ++
Sbjct: 80 LPETFDAREQWSNCPTIGQIRDQGSCGSCWAFGAVEAISDRTCIHTNGRVNVEVSAEDLL 139
Query: 247 ACTP-NCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
C C GCNGG+P AW FW G+V+GG Y+S GC PYT+ PCEHHV G CT
Sbjct: 140 TCCGIQCGDGCNGGYPSGAWNFWTKKGLVSGGYYDSHIGCLPYTIPPCEHHVNGSRPPCT 199
Query: 305 LLGKLKTPECKQNC---YNPSYESTYRF 329
G+ T C ++C Y+PSY+ F
Sbjct: 200 --GEGDTRRCNKSCEAGYSPSYKEDKHF 225
Score = 116 bits (291), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 51/83 (61%), Positives = 64/83 (77%)
Query: 77 MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANS 136
M +IY++GP+ F+V++DFL YKSGVY+H GD +G HA+R+L WGVEN +PYW ANS
Sbjct: 240 MAEIYKNGPVEGAFTVFSDFLTYKSGVYKHEAGDMMGGHAIRILVWGVENGVPYWAAANS 299
Query: 137 WNDHWGDHGTFKILRGENEADIE 159
WN WGD+G FKILRGEN IE
Sbjct: 300 WNLDWGDNGFFKILRGENHCGIE 322
>gi|56753443|gb|AAW24925.1| unknown [Schistosoma japonicum]
Length = 342
Score = 141 bits (356), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 75/195 (38%), Positives = 106/195 (54%), Gaps = 13/195 (6%)
Query: 148 KILRGENEADIEMGFNNRVEANSSEDDDLETMGCQNAKGLPRNFDAREKWPECPSLRHIA 207
+IL G + D EM R + D ++E +P FD+R+KWP C S+ I
Sbjct: 61 RILMGARKEDAEMKRKRRPTVDH-HDLNVE---------IPSQFDSRKKWPHCKSISQIR 110
Query: 208 DQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVACTPNCW-GCNGGWPQLAWRF 266
DQS CGSCWA A++DR+CI S G + ++SA +++C +C GC GG+P +AW +
Sbjct: 111 DQSRCGSCWAFGAVEAMTDRICIQSGGQQSAELSALDLISCCEDCGDGCQGGFPGVAWDY 170
Query: 267 WGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTLLGKLKTPECKQNCYNPSYEST 326
W G+VTGG + GCQPY CEHH +G C KTP+CKQ C Y++
Sbjct: 171 WVKRGIVTGGSKENHTGCQPYPFPKCEHHTKGKYPACG-TKIYKTPQCKQKC-QKGYKTP 228
Query: 327 YRFDLKKGKKAHMVL 341
Y D G + + V+
Sbjct: 229 YEQDKNYGDQRYNVI 243
Score = 110 bits (276), Expect = 8e-22, Method: Compositional matrix adjust.
Identities = 45/82 (54%), Positives = 62/82 (75%)
Query: 78 RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
R+I +GP+ A F VY DFL YKSG+Y+H G +G HA+R++GWGVE PYWL+ANSW
Sbjct: 251 REIMMYGPVEAAFDVYEDFLNYKSGIYRHVAGSIVGGHAIRIIGWGVEKGKPYWLIANSW 310
Query: 138 NDHWGDHGTFKILRGENEADIE 159
N+ WG++G F+++RG +E IE
Sbjct: 311 NEDWGENGLFRMVRGRDECSIE 332
>gi|300176937|emb|CBK25506.2| unnamed protein product [Blastocystis hominis]
Length = 320
Score = 141 bits (355), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 67/134 (50%), Positives = 87/134 (64%), Gaps = 5/134 (3%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP +FD EKWPECPSL+ I DQS CGSCWA A A +DRLCIAS G ++S Q ++
Sbjct: 69 LPESFDPVEKWPECPSLKEIRDQSVCGSCWAFGAAEAATDRLCIASKGKIQDRLSDQDLL 128
Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
C +C +GCNGGWP +AW ++ GV TGG+Y S++ C Y C+HHV+G C
Sbjct: 129 TCCESCGFGCNGGWPSMAWSWFHSTGVTTGGEYGSKDWCNAYEFPKCDHHVEGKYPPC-- 186
Query: 306 LGKLK-TPECKQNC 318
G+ + TPEC + C
Sbjct: 187 -GETQPTPECVEKC 199
Score = 107 bits (267), Expect = 9e-21, Method: Compositional matrix adjust.
Identities = 48/99 (48%), Positives = 72/99 (72%), Gaps = 2/99 (2%)
Query: 63 HYFKKAHMVPR-CNAMR-QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
H+F +A+ VP A++ ++ +GP+ FSVY DF+ YKSG+YQH G +G HAV+++
Sbjct: 212 HFFGEAYHVPSNVEAIKTELMTNGPIEVDFSVYEDFMTYKSGIYQHVAGKYLGGHAVKLV 271
Query: 121 GWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
GWGVE+ + YW +ANSWN+ WG++G F+I+ G+NE IE
Sbjct: 272 GWGVEDGVEYWKIANSWNEDWGENGYFRIIAGKNECGIE 310
>gi|341887135|gb|EGT43070.1| hypothetical protein CAEBREN_13756 [Caenorhabditis brenneri]
Length = 398
Score = 141 bits (355), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 65/155 (41%), Positives = 95/155 (61%), Gaps = 1/155 (0%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+P +FD+RE WP+C S++ I DQS+CGSCWA A+SDR+CIAS+G +SA ++
Sbjct: 120 IPESFDSRENWPKCESIKAIRDQSSCGSCWAFGAVEAMSDRICIASHGELQVSLSADDLL 179
Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
+C +C +GCNGG P AWR+W +G+VTG ++ + GC+PY PCEHH + +
Sbjct: 180 SCCRSCGFGCNGGDPLAAWRYWVKDGIVTGSNFTANSGCKPYPFPPCEHHSKKTHFDPCP 239
Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
TP+C++ C + TY D G A+ V
Sbjct: 240 HDLYPTPKCEKRCNAEYTDKTYSEDKFYGSSAYGV 274
Score = 104 bits (260), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 46/84 (54%), Positives = 57/84 (67%)
Query: 78 RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
+++ HGPL F VY DFL Y GVY H G G HAV+++GWG+E+ IPYW VANSW
Sbjct: 283 KELMTHGPLEIAFEVYEDFLNYDGGVYVHTGGKLGGGHAVKLIGWGIEDGIPYWTVANSW 342
Query: 138 NDHWGDHGTFKILRGENEADIEMG 161
N WG+ G F+ILRG +E IE G
Sbjct: 343 NTDWGEDGFFRILRGVDECGIESG 366
>gi|294894292|ref|XP_002774787.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
gi|239880404|gb|EER06603.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
Length = 414
Score = 141 bits (355), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 73/157 (46%), Positives = 93/157 (59%), Gaps = 10/157 (6%)
Query: 182 QNAKGLPRNFDAREKWPECP-SLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQI 240
+ + LP +FDAR +P C +RHI DQS+CGSCWA V A +DRLCI SNG FT +
Sbjct: 137 EELQDLPTDFDARTAFPNCSKVIRHIRDQSDCGSCWAFGVTEAFNDRLCIKSNGTFTELL 196
Query: 241 SAQHIVACTPNCWGCNGGWPQLAWRFWGHN-GVVTGGDYNSQ------EGCQPYTLAPCE 293
SA + AC P+ +GC+GG P LAW W HN G+ TGGDY ++ +GC PY PC
Sbjct: 197 SAGEMNACAPS-FGCDGGIPSLAWS-WVHNKGIATGGDYLAEDDMTKDDGCWPYDFPPCA 254
Query: 294 HHVQGPLQNCTLLGKLKTPECKQNCYNPSYESTYRFD 330
HHV +TP C + C+NP Y +T R D
Sbjct: 255 HHVNDSKYPKCPKDSYETPNCAEQCHNPKYTTTLRDD 291
Score = 94.0 bits (232), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 51/125 (40%), Positives = 67/125 (53%), Gaps = 7/125 (5%)
Query: 43 KKKKKKKKKRLYLPTSIPLSHYFKKAHMVPRCNA-MRQIYEHGPLV------AIFSVYAD 95
K + R +L S+P + A R + + IY P V A F VY D
Sbjct: 283 KYTTTLRDDRHFLVESVPYEYSVNDAKNAIRTDGPVGPIYFCDPSVNFDQVSASFIVYED 342
Query: 96 FLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENE 155
FL Y+SGVY+H G +G HAV+++GWG E YWLV NSWN+ WGD+G FKI G E
Sbjct: 343 FLAYRSGVYKHTSGKELGGHAVKIIGWGEETGQAYWLVVNSWNEDWGDNGLFKIALGNCE 402
Query: 156 ADIEM 160
D ++
Sbjct: 403 IDDDL 407
>gi|25146613|ref|NP_741818.1| Protein CPR-6, isoform a [Caenorhabditis elegans]
gi|1169087|sp|P43510.1|CPR6_CAEEL RecName: Full=Cathepsin B-like cysteine proteinase 6; AltName:
Full=Cysteine protease-related 6; Flags: Precursor
gi|671715|gb|AAA98787.1| cathepsin B-like cysteine proteinase [Caenorhabditis elegans]
gi|695294|gb|AAA98789.1| cathepsin B-like cysteine proteinase [Caenorhabditis elegans]
gi|351058213|emb|CCD65628.1| Protein CPR-6, isoform a [Caenorhabditis elegans]
Length = 379
Score = 141 bits (355), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 65/155 (41%), Positives = 96/155 (61%), Gaps = 1/155 (0%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+P +FD+R+ WP+C S++ I DQS+CGSCWA A+SDR+CIAS+G +SA ++
Sbjct: 105 IPESFDSRDNWPKCDSIKVIRDQSSCGSCWAFGAVEAMSDRICIASHGELQVTLSADDLL 164
Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
+C +C +GCNGG P AWR+W +G+VTG +Y + GC+PY PCEHH + +
Sbjct: 165 SCCKSCGFGCNGGDPLAAWRYWVKDGIVTGSNYTANNGCKPYPFPPCEHHSKKTHFDPCP 224
Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
TP+C++ C + + TY D G A+ V
Sbjct: 225 HDLYPTPKCEKKCVSDYTDKTYSEDKFFGASAYGV 259
Score = 102 bits (255), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 45/84 (53%), Positives = 57/84 (67%)
Query: 78 RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
+++ HGPL F VY DFL Y GVY H G G HAV+++GWG+++ IPYW VANSW
Sbjct: 268 KELMTHGPLEIAFEVYEDFLNYDGGVYVHTGGKLGGGHAVKLIGWGIDDGIPYWTVANSW 327
Query: 138 NDHWGDHGTFKILRGENEADIEMG 161
N WG+ G F+ILRG +E IE G
Sbjct: 328 NTDWGEDGFFRILRGVDECGIESG 351
>gi|71984043|ref|NP_001024426.1| Protein CPR-6, isoform b [Caenorhabditis elegans]
gi|351058214|emb|CCD65629.1| Protein CPR-6, isoform b [Caenorhabditis elegans]
Length = 378
Score = 141 bits (355), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 65/155 (41%), Positives = 96/155 (61%), Gaps = 1/155 (0%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+P +FD+R+ WP+C S++ I DQS+CGSCWA A+SDR+CIAS+G +SA ++
Sbjct: 104 IPESFDSRDNWPKCDSIKVIRDQSSCGSCWAFGAVEAMSDRICIASHGELQVTLSADDLL 163
Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
+C +C +GCNGG P AWR+W +G+VTG +Y + GC+PY PCEHH + +
Sbjct: 164 SCCKSCGFGCNGGDPLAAWRYWVKDGIVTGSNYTANNGCKPYPFPPCEHHSKKTHFDPCP 223
Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
TP+C++ C + + TY D G A+ V
Sbjct: 224 HDLYPTPKCEKKCVSDYTDKTYSEDKFFGASAYGV 258
Score = 102 bits (255), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 45/84 (53%), Positives = 57/84 (67%)
Query: 78 RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
+++ HGPL F VY DFL Y GVY H G G HAV+++GWG+++ IPYW VANSW
Sbjct: 267 KELMTHGPLEIAFEVYEDFLNYDGGVYVHTGGKLGGGHAVKLIGWGIDDGIPYWTVANSW 326
Query: 138 NDHWGDHGTFKILRGENEADIEMG 161
N WG+ G F+ILRG +E IE G
Sbjct: 327 NTDWGEDGFFRILRGVDECGIESG 350
>gi|76576341|gb|ABA53864.1| cathepsin B-like cysteine protease 2 [Parelaphostrongylus tenuis]
Length = 344
Score = 141 bits (355), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 69/156 (44%), Positives = 97/156 (62%), Gaps = 5/156 (3%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+P +FDAR +WP CPS+ +I DQS CGSCWA A A+SDR+CIAS+G T ++SA I+
Sbjct: 94 IPDSFDARVQWPHCPSISYIRDQSQCGSCWAFGSAEAMSDRVCIASHGNKTVELSADDIL 153
Query: 247 ACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQ-NCT 304
+C +C GC+GG+P AW ++ GVVTGG Y +++ C+PY + PC HH NCT
Sbjct: 154 SCCYDCGDGCDGGYPISAWEYFVETGVVTGGLYGTKDSCRPYEIPPCGHHRNETFYGNCT 213
Query: 305 LLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
+ TP+C C Y +Y D GK ++ +
Sbjct: 214 QIA--DTPDCVTTC-QAGYPISYDDDKTFGKDSYTI 246
Score = 101 bits (251), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 45/82 (54%), Positives = 55/82 (67%)
Query: 78 RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
++I +GP+ A F VY DF Y G+Y+H G G HAVR+LGWG E YWLVANSW
Sbjct: 255 KEIMTYGPVTAAFIVYEDFFHYHRGIYKHVSGGEEGGHAVRILGWGEEKGTAYWLVANSW 314
Query: 138 NDHWGDHGTFKILRGENEADIE 159
N WG++G F+ILRG NE IE
Sbjct: 315 NTDWGENGYFRILRGSNECGIE 336
>gi|193209594|ref|NP_001123113.1| Protein CPR-6, isoform c [Caenorhabditis elegans]
gi|351058222|emb|CCD65637.1| Protein CPR-6, isoform c [Caenorhabditis elegans]
Length = 369
Score = 140 bits (354), Expect = 7e-31, Method: Compositional matrix adjust.
Identities = 65/155 (41%), Positives = 96/155 (61%), Gaps = 1/155 (0%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+P +FD+R+ WP+C S++ I DQS+CGSCWA A+SDR+CIAS+G +SA ++
Sbjct: 95 IPESFDSRDNWPKCDSIKVIRDQSSCGSCWAFGAVEAMSDRICIASHGELQVTLSADDLL 154
Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
+C +C +GCNGG P AWR+W +G+VTG +Y + GC+PY PCEHH + +
Sbjct: 155 SCCKSCGFGCNGGDPLAAWRYWVKDGIVTGSNYTANNGCKPYPFPPCEHHSKKTHFDPCP 214
Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
TP+C++ C + + TY D G A+ V
Sbjct: 215 HDLYPTPKCEKKCVSDYTDKTYSEDKFFGASAYGV 249
Score = 102 bits (255), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 45/84 (53%), Positives = 57/84 (67%)
Query: 78 RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
+++ HGPL F VY DFL Y GVY H G G HAV+++GWG+++ IPYW VANSW
Sbjct: 258 KELMTHGPLEIAFEVYEDFLNYDGGVYVHTGGKLGGGHAVKLIGWGIDDGIPYWTVANSW 317
Query: 138 NDHWGDHGTFKILRGENEADIEMG 161
N WG+ G F+ILRG +E IE G
Sbjct: 318 NTDWGEDGFFRILRGVDECGIESG 341
>gi|194246069|gb|ACF35526.1| putative cathepsin B-like cysteine protease form 1 [Dermacentor
variabilis]
Length = 277
Score = 140 bits (354), Expect = 7e-31, Method: Compositional matrix adjust.
Identities = 70/145 (48%), Positives = 86/145 (59%), Gaps = 4/145 (2%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP +FDARE W C S+ I DQS CGSC A A+SDR+CI + G ISAQ ++
Sbjct: 25 LPESFDAREAWSHCDSIHLIRDQSTCGSCRAFGATEAMSDRICIHTKGRVQVNISAQDLL 84
Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
C C GC GG+P AW ++ G+VTGG Y + +GCQPY PCEHH +GPL NCT
Sbjct: 85 TCCHQCGMGCFGGYPSAAWDYYKDEGIVTGGLYGTDDGCQPYYFPPCEHHTKGPLPNCT- 143
Query: 306 LGKLKTPECKQNCYNPSYESTYRFD 330
TP+C Q C YE +Y D
Sbjct: 144 -DTKPTPKCLQVC-RKGYEKSYSED 166
Score = 81.3 bits (199), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 46/89 (51%), Positives = 56/89 (62%), Gaps = 4/89 (4%)
Query: 79 QIYEHGPLVAIFSVYADFLQYKSGVYQ-HNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
+IY++GP+ A FSVY DFL YKSGVYQ H++ H + LGW ++ WLVANSW
Sbjct: 186 EIYKNGPVEADFSVYTDFLAYKSGVYQRHSYELWEARH--QNLGWALKRR-SVWLVANSW 242
Query: 138 NDHWGDHGTFKILRGENEADIEMGFNNRV 166
N WGD G FKI RG NE IE N +
Sbjct: 243 NQDWGDKGYFKIRRGNNECGIENDINAGI 271
>gi|29374027|gb|AAO73004.1| cathepsin B [Fasciola gigantica]
Length = 337
Score = 140 bits (353), Expect = 9e-31, Method: Compositional matrix adjust.
Identities = 72/155 (46%), Positives = 93/155 (60%), Gaps = 2/155 (1%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP +FDAR+KW CPS+ I DQS+C SCWAVS A+AI+DR+CI SNG ++SA IV
Sbjct: 86 LPESFDARQKWANCPSISEIRDQSSCSSCWAVSSASAITDRICIHSNGQKKPRLSAIDIV 145
Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
+C C +GCNGG P ++W +W GVVTGG + GC PY C H V P
Sbjct: 146 SCCAYCGYGCNGGIPAMSWDYWTREGVVTGGTLENPTGCLPYPFPKCSHGVVTPGLPPCP 205
Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
TP+C++ C+ Y TY D KGK ++ V
Sbjct: 206 RDIYPTPKCEKKCH-AGYNKTYEQDKVKGKSSYNV 239
Score = 106 bits (265), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 45/87 (51%), Positives = 62/87 (71%)
Query: 77 MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANS 136
M +I ++GP+ IF ++ DFL YKSG+Y + G +G HA+RV+GWGVEN + YWL+ANS
Sbjct: 247 MMEIMKNGPVDGIFYMFEDFLVYKSGIYHYTTGRLVGGHAIRVIGWGVENGVKYWLIANS 306
Query: 137 WNDHWGDHGTFKILRGENEADIEMGFN 163
WN+ WG+ G F++ RG NE IE N
Sbjct: 307 WNEGWGEKGYFRMRRGNNECGIEARIN 333
>gi|156255405|gb|ABU62925.1| cathepsin B [Fasciola hepatica]
Length = 337
Score = 140 bits (352), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 72/155 (46%), Positives = 93/155 (60%), Gaps = 2/155 (1%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP +FDAR+KW CPS+ I DQS+C SCWAVS A+AI+DR+CI SNG ++SA IV
Sbjct: 86 LPESFDARQKWANCPSISEIRDQSSCSSCWAVSSASAITDRICIHSNGQKKPRLSAIDIV 145
Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
+C C +GCNGG P ++W +W GVVTGG + GC PY C H V P
Sbjct: 146 SCCAYCGYGCNGGIPAMSWDYWTREGVVTGGTLENPTGCLPYPFPKCSHGVVTPGLPPCP 205
Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
TP+C++ C+ Y TY D KGK ++ V
Sbjct: 206 RDIYPTPKCEKKCH-AGYNKTYEQDKVKGKSSYNV 239
Score = 107 bits (266), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 45/89 (50%), Positives = 63/89 (70%)
Query: 75 NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVA 134
+ M +I ++GP+ IF ++ DFL YKSG+Y + G +G HA+RV+GWGVEN + YWL+A
Sbjct: 245 DIMMEIMKNGPVDGIFYMFEDFLVYKSGIYHYTTGRLVGGHAIRVIGWGVENGVKYWLIA 304
Query: 135 NSWNDHWGDHGTFKILRGENEADIEMGFN 163
NSWN+ WG+ G F++ RG NE IE N
Sbjct: 305 NSWNEGWGEKGYFRMRRGNNECGIEARIN 333
>gi|167538317|ref|XP_001750823.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163770644|gb|EDQ84327.1| predicted protein [Monosiga brevicollis MX1]
Length = 341
Score = 140 bits (352), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 72/157 (45%), Positives = 97/157 (61%), Gaps = 6/157 (3%)
Query: 187 LPRNFDAREKW-PECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHI 245
LP FD+RE+W CPS + I DQ+ CGSCWA +++DR+CIAS G ISAQ +
Sbjct: 88 LPTAFDSREQWGSTCPSTKEIRDQAACGSCWAFGAVESMTDRICIASKGSLRPHISAQDL 147
Query: 246 VACTP-NCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNC 303
+ C C GC+GG+P AW ++ G+VTGG+YNS +GCQPY+L C+HHV G C
Sbjct: 148 MTCCLFTCGSGCSGGYPSAAWSWFKTTGIVTGGNYNSSQGCQPYSLPNCDHHVSGQYPAC 207
Query: 304 TLLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
+ G+ TP CK++C Y +TY D G A+ V
Sbjct: 208 S--GEGPTPACKKSC-EAGYNNTYSNDKHFGATAYSV 241
Score = 103 bits (258), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 44/81 (54%), Positives = 59/81 (72%)
Query: 79 QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWN 138
+I +GP+ F+VY D L YKSGVYQH G +G HA++++GWGVE+ + YW VANSWN
Sbjct: 251 EIMTNGPVEGAFTVYEDLLTYKSGVYQHTTGQVLGGHAIKIIGWGVESGVDYWWVANSWN 310
Query: 139 DHWGDHGTFKILRGENEADIE 159
+ WGD+G FKI +G +E IE
Sbjct: 311 NDWGDNGFFKIKKGVDECGIE 331
>gi|241154720|ref|XP_002407359.1| cathepsin B endopeptidase, putative [Ixodes scapularis]
gi|215494103|gb|EEC03744.1| cathepsin B endopeptidase, putative [Ixodes scapularis]
Length = 337
Score = 140 bits (352), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 68/155 (43%), Positives = 93/155 (60%), Gaps = 4/155 (2%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP +FDAR KW C S+ I DQS CGSCWA A+SDR+CI S G ISA+ ++
Sbjct: 85 LPESFDARAKWSHCDSIHLIRDQSTCGSCWAFGATEAMSDRICIHSKGKMQVNISAEDLL 144
Query: 247 ACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
C C GC GG+P AW W G+V+GG Y + +GC+PY+LAPCE+H + + NC
Sbjct: 145 DCCDTCGHGCKGGFPAAAWEHWKERGIVSGGLYGTPDGCKPYSLAPCEYHTKCRIPNCIP 204
Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
+ + TPEC +C Y+ Y+ D G+K + +
Sbjct: 205 I--VHTPECVHHC-RKGYDKDYQEDKHFGQKVYSI 236
Score = 112 bits (279), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 54/99 (54%), Positives = 67/99 (67%), Gaps = 2/99 (2%)
Query: 63 HYFKKAHMVPRCNAMRQ--IYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
H+ +K + + R Q I+ +GP+ A F VY DFL YKSGVYQ + D G+HA+R+L
Sbjct: 228 HFGQKVYSISRDEKQIQTEIFTNGPVEADFHVYGDFLCYKSGVYQRHSNDGRGMHAIRIL 287
Query: 121 GWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
GWG EN PYWL ANSWN++WGD G FKILR NE IE
Sbjct: 288 GWGTENGTPYWLAANSWNENWGDKGYFKILRRTNECGIE 326
>gi|308488594|ref|XP_003106491.1| hypothetical protein CRE_15919 [Caenorhabditis remanei]
gi|308253841|gb|EFO97793.1| hypothetical protein CRE_15919 [Caenorhabditis remanei]
Length = 342
Score = 140 bits (352), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 94/258 (36%), Positives = 128/258 (49%), Gaps = 42/258 (16%)
Query: 92 VYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGV-ENDIPYWLVANSWNDHWGDHGTFKIL 150
V+ DFLQ+ +G+Y H G+ G +V LGWG+ E IP N W G K+
Sbjct: 3 VFDDFLQHTTGIYVHLAGNKQGHLSVGTLGWGMFEELIPK-------NSFW-TAGIPKVS 54
Query: 151 RGENEADIE-----MGFNNRVEANSSEDDDLETMGCQNAKGLPRNFDAREKWPECPSLRH 205
R + + +GFN+ S E+ DL FDARE+WPEC S+
Sbjct: 55 RSFMLSTLVKDPEIIGFNDLGPTFSPENSDLSPF-----------FDARERWPECSSIPL 103
Query: 206 IADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVACTPNCW----GCNGGWPQ 261
I D S C S WA + A ++SDRLCI S G +SAQ +++C GC GG P
Sbjct: 104 INDISECKSSWAFAAAESMSDRLCINSGGMIDTILSAQELLSCCTGVLSCGEGCAGGNPL 163
Query: 262 LAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQG----PLQNCTLLGKLKTPECKQN 317
AW++W +G+ TGG Y SQ GC+PY++APC + P N T L TP C++
Sbjct: 164 KAWQYWQKHGIPTGGSYESQFGCKPYSIAPCGKTIGNVTYPPCTNTT----LPTPTCEKK 219
Query: 318 CYNPSYESTYRFDLKKGK 335
C + Y DL K +
Sbjct: 220 C-----KPGYPVDLDKDR 232
Score = 95.9 bits (237), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 42/99 (42%), Positives = 61/99 (61%), Gaps = 2/99 (2%)
Query: 63 HYFKKAHMVP--RCNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
HY +P + + +GP+ A +Y DFLQY +G+Y H G+ G +VR+L
Sbjct: 233 HYGVSVDQLPNRQIEIQSDVMLNGPVEATMEIYDDFLQYTTGIYVHLAGNKQGHLSVRIL 292
Query: 121 GWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
GWG+ +PYWL+ANSW WG++GTF++LRG NE +E
Sbjct: 293 GWGMFEGVPYWLLANSWGKEWGENGTFRVLRGVNECGLE 331
>gi|56759504|gb|AAW27892.1| unknown [Schistosoma japonicum]
Length = 279
Score = 140 bits (352), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 66/155 (42%), Positives = 93/155 (60%), Gaps = 3/155 (1%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+P FD+R+KWP C S+ I DQS CGSCWA A++DR+CI S G + ++SA ++
Sbjct: 27 IPSQFDSRKKWPHCKSISQIRDQSRCGSCWAFGAVEAMTDRICIQSGGQQSAELSALDLI 86
Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
+C +C GC GG+P +AW +W G+VTGG + GCQPY CEHH +G C
Sbjct: 87 SCCEDCGQGCQGGFPGVAWDYWVKRGIVTGGSKENHTGCQPYPFPKCEHHTKGKYPACGT 146
Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
KTP+CKQ C Y++ Y D G++++ V
Sbjct: 147 K-IYKTPQCKQTC-QKGYKTPYEQDKHYGEESYNV 179
Score = 112 bits (279), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 46/82 (56%), Positives = 60/82 (73%)
Query: 78 RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
R I +GP+ A F VY DFL YKSG+Y+H G +G HA+R++GWGVE PYWL+ANSW
Sbjct: 188 RDIMMYGPVEAAFDVYEDFLNYKSGIYRHVTGSIVGGHAIRIIGWGVEKRTPYWLIANSW 247
Query: 138 NDHWGDHGTFKILRGENEADIE 159
N+ WG+ G F+I+RG +E IE
Sbjct: 248 NEDWGEKGLFRIVRGRDECSIE 269
>gi|239938574|gb|ACS36086.1| cysteine proteinase [Haemonchus contortus]
Length = 253
Score = 140 bits (352), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 68/134 (50%), Positives = 89/134 (66%), Gaps = 3/134 (2%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+P +FDAR KWP+C SL+HI DQ+NCGSCWAVS A+A+SDR+CIASNG +SA I+
Sbjct: 2 IPESFDARTKWPKCSSLKHIHDQANCGSCWAVSTASALSDRICIASNGRKQVHVSATDIL 61
Query: 247 ACTPN-C-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
+C N C +GCNGGWP A+ ++ G VTGGDY + GC+PY PC HH +
Sbjct: 62 SCCGNQCGYGCNGGWPIQAFNYFSKQGAVTGGDYKATSGCRPYPFHPCGHHGKDTYYG-E 120
Query: 305 LLGKLKTPECKQNC 318
+ TP+C + C
Sbjct: 121 CPNEATTPKCVRKC 134
Score = 107 bits (267), Expect = 9e-21, Method: Compositional matrix adjust.
Identities = 47/96 (48%), Positives = 67/96 (69%), Gaps = 2/96 (2%)
Query: 66 KKAHMVPRCN--AMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWG 123
K A+ VP R+I ++GP+V F+VY DF YK G+Y+H G + G HA++++GWG
Sbjct: 150 KDAYEVPNSEKAIQREIMKNGPVVGAFTVYEDFSYYKKGIYKHTAGKARGGHAIKIIGWG 209
Query: 124 VENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
EN +PYWL+ANSW++ WG++G F+ILRG N IE
Sbjct: 210 KENGVPYWLIANSWHNDWGENGYFRILRGSNHCGIE 245
>gi|239938576|gb|ACS36087.1| cysteine proteinase [Haemonchus contortus]
Length = 253
Score = 140 bits (352), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 68/134 (50%), Positives = 89/134 (66%), Gaps = 3/134 (2%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+P +FDAR KWP+C SL+HI DQ+NCGSCWAVS A+A+SDR+CIASNG +SA I+
Sbjct: 2 IPESFDARTKWPKCSSLKHIRDQANCGSCWAVSTASALSDRICIASNGRKQVHVSATDIL 61
Query: 247 ACTPN-C-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
+C N C +GCNGGWP A+ ++ G VTGGDY + GC+PY PC HH +
Sbjct: 62 SCCGNQCGYGCNGGWPIQAFNYFSKQGAVTGGDYKATSGCRPYPFHPCGHHGKDTYYG-E 120
Query: 305 LLGKLKTPECKQNC 318
+ TP+C + C
Sbjct: 121 CPNEATTPKCVRKC 134
Score = 105 bits (261), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 46/96 (47%), Positives = 66/96 (68%), Gaps = 2/96 (2%)
Query: 66 KKAHMVPRCN--AMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWG 123
K A+ VP R+I ++GP+V F+VY DF YK G+Y+H G + G HA++++GWG
Sbjct: 150 KDAYEVPNSEKAIQREIMKNGPVVGAFTVYEDFSYYKKGIYKHTAGKARGGHAIKIIGWG 209
Query: 124 VENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
E +PYWL+ANSW++ WG++G F+ILRG N IE
Sbjct: 210 KEGGVPYWLIANSWHNDWGENGYFRILRGSNHCGIE 245
>gi|49036808|gb|AAT48985.1| cathepsin B-like proteinase [Triatoma vitticeps]
Length = 332
Score = 139 bits (351), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 67/155 (43%), Positives = 96/155 (61%), Gaps = 4/155 (2%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+P FDAR+ WP C S+ I DQ +CGSCWA A+SDR+CI SNG +SA++++
Sbjct: 81 VPDEFDARKHWPNCSSITEIRDQGSCGSCWAFGAVEAMSDRICIHSNGKLQVHLSAENLL 140
Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
+C +C +GC GG + AW +W G+V+GG+Y S++GCQPY++APCEH + G C
Sbjct: 141 SCCDSCGYGCLGGSAENAWEYWHKFGIVSGGNYGSKQGCQPYSIAPCEHSIPGSRPACE- 199
Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
G TP+CK+ C Y Y DL G+ + +
Sbjct: 200 -GVRDTPKCKKQC-EKGYGIPYGDDLCYGQPGYTI 232
Score = 111 bits (278), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 48/81 (59%), Positives = 61/81 (75%)
Query: 79 QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWN 138
+I ++GP+VA VY D YK+GVYQH G+ +G H +++LGWGVEND PYWLVANSWN
Sbjct: 242 EILKNGPIVASILVYEDLFSYKAGVYQHVAGEVLGGHVIKILGWGVENDTPYWLVANSWN 301
Query: 139 DHWGDHGTFKILRGENEADIE 159
WG++G FKILRG +E IE
Sbjct: 302 TDWGNNGFFKILRGSDECGIE 322
>gi|44968648|gb|AAS49594.1| cathepsin B [Scyliorhinus canicula]
Length = 206
Score = 139 bits (351), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 64/144 (44%), Positives = 95/144 (65%), Gaps = 7/144 (4%)
Query: 193 AREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVACTP-N 251
+RE+WP+CP+++ I DQ +CGSCWA A+SDR+CI S G ++SA+ +++C
Sbjct: 1 SREQWPDCPTIKEIRDQGSCGSCWAFGAVEAMSDRICIHSRGKVNVEVSAEDLLSCCKLE 60
Query: 252 CW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTLLGKLK 310
C GCNGG+P AW FW ++G+V+GG Y S GC+PY+++PCEHHV G C+ G+++
Sbjct: 61 CGNGCNGGYPSGAWEFWTNDGLVSGGLYYSHIGCRPYSISPCEHHVNGSRPKCS--GEIE 118
Query: 311 TPECKQNC---YNPSYESTYRFDL 331
TP C + C Y+P Y + L
Sbjct: 119 TPRCSRRCEAGYSPKYSEDKHYGL 142
Score = 68.9 bits (167), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 30/50 (60%), Positives = 38/50 (76%)
Query: 77 MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVEN 126
M +IY++GP+ A V+ DFL YKSGVYQH G SIG HA+++LGWG EN
Sbjct: 155 MTEIYKNGPVEAALEVFKDFLLYKSGVYQHKTGGSIGGHAIKILGWGEEN 204
>gi|300176938|emb|CBK25507.2| unnamed protein product [Blastocystis hominis]
Length = 320
Score = 139 bits (351), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 69/147 (46%), Positives = 89/147 (60%), Gaps = 6/147 (4%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP +FD EKWPECPSL+ I DQS CGSCWA A A +DRLCIAS G ++S Q ++
Sbjct: 69 LPESFDPVEKWPECPSLKEIRDQSVCGSCWAFGAAEAATDRLCIASKGKIQDRLSEQDLL 128
Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
C +C +GC+GGW +AWR++ GV TGG+Y S++ C Y+ CEHH +G C
Sbjct: 129 TCCDSCGFGCDGGWLDMAWRWFQSTGVTTGGEYGSKDWCNAYSFPKCEHHAEGKYPPCGE 188
Query: 306 LGKLKTPECKQNC---YNPSYESTYRF 329
+TPEC + C Y YE F
Sbjct: 189 --SQETPECVKQCQEGYPVEYEKDKHF 213
Score = 104 bits (260), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 49/101 (48%), Positives = 72/101 (71%), Gaps = 2/101 (1%)
Query: 63 HYFKKAHMVPR-CNAMR-QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
H+F +A+ V +A++ ++ +GPL F VY DFL YKSG+YQH G +G HAV+++
Sbjct: 212 HFFGEAYYVQGGIDAIKTELMTNGPLEVSFFVYEDFLTYKSGIYQHVAGKYLGGHAVKLV 271
Query: 121 GWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIEMG 161
GWGVE+ I YW +ANSWN+ WG++G F+I+ G+ E IE+G
Sbjct: 272 GWGVEDGIEYWKIANSWNEDWGENGYFRIVAGKGECGIEVG 312
>gi|449667614|ref|XP_002166962.2| PREDICTED: cathepsin B-like [Hydra magnipapillata]
Length = 330
Score = 139 bits (351), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 70/156 (44%), Positives = 96/156 (61%), Gaps = 5/156 (3%)
Query: 187 LPRNFDAREKWPE-CPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHI 245
LP FDAR++W CPSL + DQ CGSCWA A A++DR+CIA+ G +IS + +
Sbjct: 78 LPIEFDARKEWGSICPSLLEVRDQGECGSCWAFGAAEAMTDRICIATKGKNQVRISTEDL 137
Query: 246 VACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
+ C +C +GCNGG+PQ AW F+ G+VTGG YNS +GCQPY + C+HHV C
Sbjct: 138 LTCCDSCGFGCNGGYPQSAWEFFKTKGIVTGGPYNSHKGCQPYAIPACDHHVPHSKNPCN 197
Query: 305 LLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
G L TP+C++ C Y TY+ D G ++ +
Sbjct: 198 --GSLPTPKCEKVC-EKGYNITYKNDKHYGVTSYSI 230
Score = 121 bits (304), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 53/83 (63%), Positives = 66/83 (79%)
Query: 77 MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANS 136
MR+I +GP+ A F+V+ADF YKSGVYQH G+ +G HA+++LGWGVEN+ PYWLVANS
Sbjct: 238 MREIMTNGPVEAAFTVFADFPNYKSGVYQHVSGEELGGHAIKILGWGVENNTPYWLVANS 297
Query: 137 WNDHWGDHGTFKILRGENEADIE 159
WN WGD+G FKILRG +E IE
Sbjct: 298 WNPSWGDNGFFKILRGSDECGIE 320
>gi|28971815|dbj|BAC65419.1| cathepsin B [Pandalus borealis]
Length = 328
Score = 139 bits (351), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 67/158 (42%), Positives = 94/158 (59%), Gaps = 4/158 (2%)
Query: 184 AKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQ 243
K +P FDARE+WP CP + I DQ NCGSCWAVS A+ ++DR CI + G + S++
Sbjct: 73 TKEIPVEFDAREQWPHCPCIDEIRDQGNCGSCWAVSAASVMTDRTCIDTEGLVDFRFSSE 132
Query: 244 HIVACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQN 302
++ AC C C GG A+ W G V+GG +NS EGCQPY++ CEHH++GP
Sbjct: 133 NVAACCTECGNACYGGDEDTAFTHWVTKGFVSGGRHNSNEGCQPYSVEECEHHIEGPRPP 192
Query: 303 CTLLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
C G + C + C+ Y TY DL+ G +A+++
Sbjct: 193 CE--GDMPELVCSETCHE-EYGKTYEEDLEYGLEAYVL 227
Score = 115 bits (288), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 54/98 (55%), Positives = 67/98 (68%), Gaps = 2/98 (2%)
Query: 64 YFKKAHMVPR--CNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLG 121
Y +A+++P+ +I +GP+ A F+VY DFL YKSGVYQH G G HAVRV+G
Sbjct: 220 YGLEAYVLPQDVTQIQEEIMTNGPVTAAFAVYDDFLSYKSGVYQHETGLLDGYHAVRVIG 279
Query: 122 WGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
WG E PYWLVANSWN WGD+G FKILRG +E + E
Sbjct: 280 WGEEEGTPYWLVANSWNTDWGDNGLFKILRGSDECEFE 317
>gi|157058741|gb|ABV03128.1| cathepsin B-2744 [Aulacorthum solani]
Length = 255
Score = 139 bits (351), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 70/170 (41%), Positives = 100/170 (58%), Gaps = 9/170 (5%)
Query: 173 DDDLETMGCQNAKGLPRNFDAREKWPECPS-LRHIADQSNCGSCWAVSVANAISDRLCIA 231
D++ ET+ +PR FDAR+ + C + + DQ NC S WAV+VA+ +DRLCIA
Sbjct: 19 DNNYETV-------IPRTFDARQYFVSCSDVIGDVKDQGNCASSWAVAVASTFTDRLCIA 71
Query: 232 SNGYFTGQISAQHIVAC-TPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLA 290
SNG FT +SAQ++++C GC+GG AW G+VTGG+Y+S EGCQPY
Sbjct: 72 SNGQFTDNLSAQNLMSCGNEEKMGCDGGSAFKAWELTMSKGIVTGGNYDSNEGCQPYKNR 131
Query: 291 PCEHHVQGPLQNCTLLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
PC+H+ L NC+ L + + C++ C N +Y+ Y DL K +M
Sbjct: 132 PCDHYGDSSLTNCSSLRRTQMTVCREKCVNKNYKVKYEDDLHKTSIVYMT 181
Score = 72.0 bits (175), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 29/65 (44%), Positives = 44/65 (67%), Gaps = 1/65 (1%)
Query: 78 RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVEND-IPYWLVANS 136
++I +GP+ A+ VY +F+ YK G+Y+ G+ IG H V+++GWGV+ D YWL NS
Sbjct: 191 QEIMTYGPVTALMYVYENFMGYKKGIYKSTAGELIGYHHVKLIGWGVDEDGTEYWLAMNS 250
Query: 137 WNDHW 141
WN +W
Sbjct: 251 WNSNW 255
>gi|339236191|ref|XP_003379650.1| cathepsin B [Trichinella spiralis]
gi|316977649|gb|EFV60721.1| cathepsin B [Trichinella spiralis]
Length = 356
Score = 139 bits (351), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 68/150 (45%), Positives = 92/150 (61%), Gaps = 7/150 (4%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+P FD+R++WP CP++ I DQSNCGSCWA AISDR+CIA++G IS+ ++
Sbjct: 102 IPVEFDSRKQWPYCPTIGEIRDQSNCGSCWAFGAVEAISDRICIATDGRQKPHISSTDLL 161
Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
+C C +GC GG P AW FW G+VTGG+Y + +GC+PY APC HH G C+
Sbjct: 162 SCCKICGFGCQGGDPHQAWSFWVKYGLVTGGNYTTHDGCRPYPFAPCNHHSNGTYGPCSH 221
Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGK 335
+ TP CK+ C +STY+ K K
Sbjct: 222 DLE-PTPVCKKAC-----QSTYKIQYNKDK 245
Score = 111 bits (277), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 49/82 (59%), Positives = 60/82 (73%)
Query: 78 RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
+++ +GP+ F VY DFL YK+GVYQH+ G +G HAVR+LGWG EN +PYWL+ANSW
Sbjct: 263 KELMMNGPMEVAFEVYEDFLLYKTGVYQHHTGSVLGGHAVRLLGWGEENGVPYWLLANSW 322
Query: 138 NDHWGDHGTFKILRGENEADIE 159
N WGD G FKI RG NE IE
Sbjct: 323 NTEWGDKGFFKIYRGRNECGIE 344
>gi|201023321|ref|NP_001128402.1| cathepsin B-1874 precursor [Acyrthosiphon pisum]
Length = 315
Score = 139 bits (350), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 74/154 (48%), Positives = 106/154 (68%), Gaps = 2/154 (1%)
Query: 184 AKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQ 243
GLP NFD+R+KWP CPS+ HI +Q NC S +AV+ A+A SDR+CI SNG +SAQ
Sbjct: 58 TSGLPINFDSRKKWPNCPSIGHIYNQGNCRSSYAVAAASAASDRICIQSNGTKNPIMSAQ 117
Query: 244 HIVACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCE-HHVQGPLQ 301
I++C C GC+GG +W ++ +G V+GGDYNS +GCQPYT+ PC+ + + P
Sbjct: 118 QIISCCYLCGHGCDGGSLFESWDYYRRHGFVSGGDYNSNQGCQPYTIPPCKLMNEKPPGH 177
Query: 302 NCTLLGKLKTPECKQNCYNPSYESTYRFDLKKGK 335
+CT + +TP C++ CYNP+Y +++R D+ KGK
Sbjct: 178 SCTTYHREETPICEKKCYNPNYYTSFRTDIYKGK 211
Score = 90.9 bits (224), Expect = 9e-16, Method: Compositional matrix adjust.
Identities = 48/112 (42%), Positives = 69/112 (61%), Gaps = 8/112 (7%)
Query: 50 KKRLYLP---TSIPLSHYFKKAHMVPRCNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQH 106
+K+ Y P TS Y K + + AM+ I+++GP+ F +Y D + YKSGVYQ+
Sbjct: 191 EKKCYNPNYYTSFRTDIYKGKYYKLSPYMAMKDIFDNGPITTQFYMYRDLVDYKSGVYQY 250
Query: 107 N----FGDSIGLHAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGEN 154
+ F D +H+V++ GWG EN +PYWLVANS+ WG +GTFKI RG +
Sbjct: 251 DEQSDF-DFFTVHSVKIFGWGEENGVPYWLVANSFGTDWGYNGTFKISRGND 301
>gi|198429088|ref|XP_002120307.1| PREDICTED: similar to cathepsin B [Ciona intestinalis]
Length = 364
Score = 139 bits (350), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 65/134 (48%), Positives = 86/134 (64%), Gaps = 4/134 (2%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+P FD+R++WP CPS+ +I DQ +CGSCWA A+SDR CI SNG +ISA+ ++
Sbjct: 112 IPNQFDSRKQWPHCPSISYIRDQGSCGSCWAFGAVEAMSDRYCIRSNGKIQVEISAEDLL 171
Query: 247 ACTP-NCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
+C C GCNGG+P AW++W +G+VTGG Y S+ GC PY + PCEHHV G C+
Sbjct: 172 SCCGFECGDGCNGGFPGSAWKYWNSDGLVTGGLYGSKTGCLPYQIKPCEHHVPGDRPKCS 231
Query: 305 LLGKLKTPECKQNC 318
G TP C C
Sbjct: 232 EGG--GTPSCVSKC 243
Score = 112 bits (280), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 50/81 (61%), Positives = 59/81 (72%)
Query: 79 QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWN 138
+I HGP+ F+VYADF YKSGVY+H G +G HA+R+LGWG EN + YWLVANSWN
Sbjct: 274 EIMTHGPVEGAFTVYADFPTYKSGVYKHVTGGVLGGHAIRILGWGSENGVAYWLVANSWN 333
Query: 139 DHWGDHGTFKILRGENEADIE 159
WGD G FKILRG +E IE
Sbjct: 334 TDWGDKGYFKILRGSDECGIE 354
>gi|149030259|gb|EDL85315.1| rCG52258, isoform CRA_b [Rattus norvegicus]
Length = 210
Score = 139 bits (349), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 68/142 (47%), Positives = 89/142 (62%), Gaps = 4/142 (2%)
Query: 177 ETMGCQNAKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYF 236
E +G LP +FDARE+W CP++ I DQ +CGSCWA A+SDR+CI +NG
Sbjct: 70 ERVGFSEDINLPESFDAREQWSNCPTIAQIRDQGSCGSCWAFGAVEAMSDRICIHTNGRV 129
Query: 237 TGQISAQHIVACTP-NCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEH 294
++SA+ ++ C C GCNGG+P AW FW G+V+GG YNS GC PYT+ PCEH
Sbjct: 130 NVEVSAEDLLTCCGIQCGDGCNGGYPSGAWNFWTRKGLVSGGVYNSHIGCLPYTIPPCEH 189
Query: 295 HVQGPLQNCTLLGKLKTPECKQ 316
HV G CT G+ TP+C +
Sbjct: 190 HVNGSRPPCT--GEGDTPKCNK 209
>gi|3087803|emb|CAA93279.1| cysteine protease [Haemonchus contortus]
Length = 325
Score = 139 bits (349), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 71/163 (43%), Positives = 98/163 (60%), Gaps = 5/163 (3%)
Query: 179 MGCQNAKG--LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYF 236
+G +N +G +P +FDAR WP C SL HI DQ+NCGSCWAVS A A+SDR+CI++NG
Sbjct: 84 VGDENDEGDDIPESFDARTHWPNCSSLTHIRDQANCGSCWAVSTAAALSDRICISTNGTK 143
Query: 237 TGQISAQHIVACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHH 295
ISA I+ C C +GC GGWP AW + G VTGG ++ C+ + PC HH
Sbjct: 144 QVNISATDILTCCYKCGYGCQGGWPIEAWEYVAREGAVTGGRLLAKSCCRSHPFPPCGHH 203
Query: 296 VQGPLQNCTLLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAH 338
G+ +TP+C+ +C P Y+++Y D +GK A+
Sbjct: 204 GNETYYG-ECGGRARTPKCRTSC-TPGYKNSYSDDKIRGKDAY 244
Score = 88.6 bits (218), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 36/64 (56%), Positives = 50/64 (78%)
Query: 78 RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
R+I ++GP+VA F+VYADF YK G+Y+H G + G HAV+V+GWG E D+PYW+V NSW
Sbjct: 255 REIMKNGPVVAAFTVYADFSYYKKGIYKHTAGRARGSHAVKVIGWGEEGDVPYWIVKNSW 314
Query: 138 NDHW 141
++ W
Sbjct: 315 HNDW 318
>gi|121073189|gb|ABM47071.1| cathepsin B2 [Clonorchis sinensis]
gi|358341868|dbj|GAA36574.2| cathepsin B-like cysteine proteinase [Clonorchis sinensis]
Length = 343
Score = 138 bits (348), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 64/140 (45%), Positives = 88/140 (62%), Gaps = 2/140 (1%)
Query: 183 NAKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISA 242
++K +P++FDAR WP CPS+ I DQS+CGSCWA A+SDRLCI S+G F +SA
Sbjct: 82 DSKLIPKSFDARATWPHCPSISEIRDQSSCGSCWAFGAVEAMSDRLCIHSSGAFNKSLSA 141
Query: 243 QHIVACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQ 301
+++C +C GC+GG+P +AW FW +G+VTGG GC+PY C+HH QG
Sbjct: 142 VDLLSCCKDCGDGCDGGFPPMAWDFWKTHGIVTGGSKEEPTGCRPYPFPKCQHHSQGHYP 201
Query: 302 NCTLLGKLKTPECKQNCYNP 321
C TP+C ++C P
Sbjct: 202 PCPRR-IYPTPKCVKHCDTP 220
Score = 108 bits (271), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 46/83 (55%), Positives = 62/83 (74%)
Query: 77 MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANS 136
M++I +GP+ A F V+ DF +YKSG+Y H +G S+G HA+R+LGWG EN +PYWL+ANS
Sbjct: 245 MKEILLNGPVEATFEVHEDFPEYKSGIYFHAWGGSVGGHAIRILGWGEENGVPYWLIANS 304
Query: 137 WNDHWGDHGTFKILRGENEADIE 159
WN+ WG+ G + LRG NE IE
Sbjct: 305 WNEDWGEKGYLRFLRGHNECGIE 327
>gi|312271213|gb|ADQ57304.1| cathepsin B-like cysteine protease 2 [Angiostrongylus cantonensis]
Length = 347
Score = 138 bits (348), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 59/111 (53%), Positives = 78/111 (70%), Gaps = 1/111 (0%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+P++FD+R WPECPSL I DQS+CGSCWAV A++DR+CIAS G ISA ++
Sbjct: 95 IPKSFDSRTNWPECPSLYSIRDQSSCGSCWAVGAVEAMTDRICIASKGNQKVTISADDLL 154
Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHV 296
+C C +GC+GG P AW +W NG+VTG +Y S+ GC+PY PCEHH+
Sbjct: 155 SCCDECGFGCDGGDPYAAWSYWVSNGIVTGSNYTSKSGCKPYPYPPCEHHI 205
Score = 103 bits (257), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 46/99 (46%), Positives = 63/99 (63%), Gaps = 2/99 (2%)
Query: 63 HYFKKAHMVPR--CNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
HY + V + + ++I +GP+ F VY DF Y SG+Y+H GD +G HAV++L
Sbjct: 240 HYGASVYAVAQDVASIQKEIMTNGPVEVAFDVYEDFEHYSSGIYKHTTGDYLGGHAVKML 299
Query: 121 GWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
GWG EN YW+ ANSWN WG++G F+ILRG +E IE
Sbjct: 300 GWGTENGTDYWICANSWNSDWGENGFFRILRGVDECQIE 338
>gi|161343853|tpg|DAA06107.1| TPA_inf: cathepsin B [Acyrthosiphon pisum]
Length = 217
Score = 138 bits (348), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 74/154 (48%), Positives = 106/154 (68%), Gaps = 2/154 (1%)
Query: 184 AKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQ 243
GLP NFD+R+KWP CPS+ HI +Q NC S +AV+ A+A SDR+CI SNG +SAQ
Sbjct: 58 TSGLPINFDSRKKWPNCPSIGHIYNQGNCRSSYAVAAASAASDRICIQSNGTKNPIMSAQ 117
Query: 244 HIVACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCE-HHVQGPLQ 301
I++C C GC+GG +W ++ +G V+GGDYNS +GCQPYT+ PC+ + + P
Sbjct: 118 QIISCCYLCGHGCDGGSLFESWDYYRRHGFVSGGDYNSNQGCQPYTIPPCKLMNEKPPGH 177
Query: 302 NCTLLGKLKTPECKQNCYNPSYESTYRFDLKKGK 335
+CT + +TP C++ CYNP+Y +++R D+ KGK
Sbjct: 178 SCTTYHREETPICEKKCYNPNYYTSFRTDIYKGK 211
>gi|209863079|ref|NP_001119613.2| cathepsin B precursor [Acyrthosiphon pisum]
Length = 323
Score = 138 bits (348), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 66/156 (42%), Positives = 96/156 (61%), Gaps = 2/156 (1%)
Query: 187 LPRNFDAREKWPECPS-LRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHI 245
+PR FDAR+ + C + + + DQ NC S WAV+VA+ +DRLCIASNG FT +SAQ++
Sbjct: 64 IPREFDARQYFTSCANVIGDVKDQGNCASSWAVAVASTFTDRLCIASNGQFTDNLSAQNL 123
Query: 246 VACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
++C GC+GG AW + G+VTGG+++S EGCQPY PC+H+ L NC+
Sbjct: 124 MSCGDGEKMGCDGGSAFKAWELTMNKGIVTGGNFDSNEGCQPYKNRPCDHYGDSRLTNCS 183
Query: 305 LLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
L + + C++ C N +Y+ Y DL K +M
Sbjct: 184 SLRRTQMTVCRKKCVNKNYKVKYEDDLHKTSIVYMT 219
Score = 92.8 bits (229), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 40/84 (47%), Positives = 56/84 (66%), Gaps = 1/84 (1%)
Query: 78 RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVEND-IPYWLVANS 136
++I +GP+ A VY +F+ YK G+Y+ G+ IG H V+++GWGV+ D YWL NS
Sbjct: 229 QEIMTYGPVTAFMYVYENFMGYKEGIYKSTTGELIGYHHVKLIGWGVDGDGTEYWLAMNS 288
Query: 137 WNDHWGDHGTFKILRGENEADIEM 160
WN +WG+ G FKILRG N IE+
Sbjct: 289 WNSNWGNDGLFKILRGYNFCSIEL 312
>gi|161343839|tpg|DAA06100.1| TPA_inf: cathepsin B [Acyrthosiphon pisum]
Length = 323
Score = 138 bits (348), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 66/156 (42%), Positives = 96/156 (61%), Gaps = 2/156 (1%)
Query: 187 LPRNFDAREKWPECPS-LRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHI 245
+PR FDAR+ + C + + + DQ NC S WAV+VA+ +DRLCIASNG FT +SAQ++
Sbjct: 64 IPREFDARQYFTSCANVIGDVKDQGNCASSWAVAVASTFTDRLCIASNGQFTDNLSAQNL 123
Query: 246 VACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
++C GC+GG AW + G+VTGG+++S EGCQPY PC+H+ L NC+
Sbjct: 124 MSCGDGEKMGCDGGSAFKAWELTMNKGIVTGGNFDSNEGCQPYKNRPCDHYGDSRLTNCS 183
Query: 305 LLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
L + + C++ C N +Y+ Y DL K +M
Sbjct: 184 SLRRTQMTVCRKKCVNKNYKVKYEDDLHKTSIVYMT 219
Score = 94.7 bits (234), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 41/84 (48%), Positives = 56/84 (66%), Gaps = 1/84 (1%)
Query: 78 RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVEND-IPYWLVANS 136
++I HGP+ A VY +F+ YK G+Y+ G+ IG H V+++GWGV+ D YWL NS
Sbjct: 229 QEIMTHGPVTAFMYVYENFMGYKEGIYKSTTGELIGYHHVKLIGWGVDGDGTEYWLAMNS 288
Query: 137 WNDHWGDHGTFKILRGENEADIEM 160
WN +WG+ G FKILRG N IE+
Sbjct: 289 WNSNWGNDGLFKILRGYNFCSIEL 312
>gi|3087801|emb|CAA93277.1| cysteine proteinase [Haemonchus contortus]
Length = 344
Score = 138 bits (347), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 71/164 (43%), Positives = 99/164 (60%), Gaps = 11/164 (6%)
Query: 182 QNAKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQIS 241
+ +P +FDAR WP C SL HI DQ++CGSCWAVS A+A+SDR+CIAS G +S
Sbjct: 86 DDGDDIPESFDARTHWPNCSSLTHIRDQADCGSCWAVSTASALSDRICIASKGAKQVYVS 145
Query: 242 AQHIVACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPL 300
A I++C +C GC+GG+ A++F+ G VTGGDY +++ C+PY PC HH
Sbjct: 146 ATDILSCCHSCGDGCDGGYVIDAFKFFAEQGAVTGGDYGAKDCCRPYPFHPCGHH----- 200
Query: 301 QNCTLLGKL----KTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
N T G+ TPEC + C YE+ Y D +G+ A+ +
Sbjct: 201 GNETYYGECPEDGSTPECVRKC-QEGYETEYHEDRVRGEDAYRL 243
Score = 94.7 bits (234), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 38/82 (46%), Positives = 58/82 (70%)
Query: 78 RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
++I +GP+VA F V+ DF Y+ G+Y H G G HAV+++GWG E+ +PYW++ANSW
Sbjct: 253 KEIMRNGPVVAAFIVFDDFSFYRKGIYAHVAGSPRGGHAVKIIGWGTEHGVPYWIIANSW 312
Query: 138 NDHWGDHGTFKILRGENEADIE 159
+ WG+ G F+++RG N+ IE
Sbjct: 313 HSDWGEDGYFRMVRGINDCGIE 334
>gi|170028912|ref|XP_001842338.1| oryzain gamma chain [Culex quinquefasciatus]
gi|167879388|gb|EDS42771.1| oryzain gamma chain [Culex quinquefasciatus]
Length = 333
Score = 138 bits (347), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 66/155 (42%), Positives = 90/155 (58%), Gaps = 5/155 (3%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP NFD+RE+WP CP++R I DQ +CGSCWA A+SDR+CI S G ++SA+ ++
Sbjct: 83 LPENFDSREQWPNCPTIREIRDQGSCGSCWAFGAVEAMSDRVCIHSKGKVLFRVSAEDLL 142
Query: 247 ACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
C NC GC+GG P W+ W G+V+GG + S +GC+PYT+ PC H G C
Sbjct: 143 TCCTNCGHGCDGGAPGAGWKHWIEKGLVSGGPFGSDQGCRPYTIEPCVHVENGAQSPCK- 201
Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
TP+C + C P Y Y D GK + +
Sbjct: 202 --DSITPKCIKKCL-PGYNVPYAKDKSFGKSTYSI 233
Score = 112 bits (280), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 49/82 (59%), Positives = 60/82 (73%)
Query: 78 RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
++I+ +GP+ A F+V+ DF YK G+YQH G+ G HAVR+LGWGVEN YWL ANSW
Sbjct: 242 KEIFTNGPVEATFTVFDDFASYKHGIYQHTSGNLAGEHAVRILGWGVENGTKYWLAANSW 301
Query: 138 NDHWGDHGTFKILRGENEADIE 159
N WGD+G FKILRG N DIE
Sbjct: 302 NSDWGDNGYFKILRGSNHVDIE 323
>gi|268555788|ref|XP_002635883.1| C. briggsae CBR-CPR-5 protein [Caenorhabditis briggsae]
Length = 345
Score = 137 bits (346), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 71/171 (41%), Positives = 100/171 (58%), Gaps = 5/171 (2%)
Query: 175 DLETMGCQNAKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNG 234
D + + + +P +FDAR++WP C S+ +I DQS+CGSCWA + A AISDR CIASNG
Sbjct: 71 DEDIVATEVFDAIPDHFDARDQWPSCVSINNIRDQSDCGSCWAFAAAEAISDRTCIASNG 130
Query: 235 YFTGQISAQHIVACTPNCW----GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLA 290
+S+Q +++C GC GG+P AW++W +G+VTGG Y SQ GC+PY++A
Sbjct: 131 AVNTLLSSQDLLSCCTGLLSCGNGCEGGYPIQAWKWWVKHGLVTGGSYESQFGCKPYSIA 190
Query: 291 PCEHHVQGPLQNCTLLGKLKTPECKQNCY-NPSYESTYRFDLKKGKKAHMV 340
PC V G TP+C + C N +Y + Y D G A+ V
Sbjct: 191 PCGQTVNGVTWPKCPDDTEPTPKCVEACTSNNTYPTPYLQDKHFGATAYAV 241
Score = 108 bits (270), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 47/81 (58%), Positives = 61/81 (75%)
Query: 79 QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWN 138
+I ++GP+ F+VY DF QY +GVY H G S+G HAV++LGWGV+N PYWLVANSWN
Sbjct: 251 EILKNGPVEVAFTVYEDFYQYTTGVYVHTSGASLGGHAVKILGWGVDNGTPYWLVANSWN 310
Query: 139 DHWGDHGTFKILRGENEADIE 159
+WG+ G F+I+RG NE IE
Sbjct: 311 VNWGEKGYFRIIRGLNECGIE 331
>gi|1345924|sp|P25802.3|CYSP1_OSTOS RecName: Full=Cathepsin B-like cysteine proteinase 1; Flags:
Precursor
Length = 341
Score = 137 bits (346), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 72/143 (50%), Positives = 88/143 (61%), Gaps = 5/143 (3%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+P ++D R +W C SL HI DQ+NCGSCWAVS A A+SDR+CIAS G ISAQ +V
Sbjct: 91 IPESYDPRIQWANCSSLFHIPDQANCGSCWAVSSAAAMSDRICIASKGAKQVLISAQDVV 150
Query: 247 ACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
+C C GC GGWP A+RF GVVTGGDYN++ C+PY + PC HH
Sbjct: 151 SCCTWCGDGCEGGWPISAFRFHADEGVVTGGDYNTKGSCRPYEIHPCGHHGNETYYG-EC 209
Query: 306 LGKLKTPECKQNC---YNPSYES 325
+G TP CK+ C Y SY S
Sbjct: 210 VGMADTPRCKRRCLLGYPKSYPS 232
Score = 113 bits (283), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 55/127 (43%), Positives = 80/127 (62%), Gaps = 6/127 (4%)
Query: 48 KKKKRLYLPTSIPLSHYFKKAHMVPRC--NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQ 105
K++ L P S P Y+KKA+ + + I ++GP+VA ++VY DF Y+SG+Y+
Sbjct: 219 KRRCLLGYPKSYPSDRYYKKAYQLKNSVKAIQKDIMKNGPVVATYTVYEDFAHYRSGIYK 278
Query: 106 HNFGDSIGLHAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIEMGFNNR 165
H G GLHAV+V+GWG E PYW+VANSW+D WG++G F++ RG N+ GF R
Sbjct: 279 HKAGRKTGLHAVKVIGWGEEKGTPYWIVANSWHDDWGENGFFRMHRGSNDC----GFEER 334
Query: 166 VEANSSE 172
+ A S +
Sbjct: 335 MAAGSVQ 341
>gi|226469952|emb|CAX70257.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 137 bits (346), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 70/156 (44%), Positives = 93/156 (59%), Gaps = 3/156 (1%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+P +FD+R+KWP C S+ I DQS CGS WAVS AISDR+CI S G + ++SA ++
Sbjct: 90 IPSHFDSRKKWPRCKSISQIRDQSRCGSSWAVSAVGAISDRICIQSGGKQSVELSAIDLI 149
Query: 247 ACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
+C NC GC+GG+P AW +W +G+VTGG + GCQPY CEHH G +C
Sbjct: 150 SCCENCGSGCDGGFPGPAWDYWVSHGIVTGGSKENHTGCQPYPFPKCEHHSIGKYPSCG- 208
Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMVL 341
KTP+CK+ C Y + Y D G A V+
Sbjct: 209 DKMYKTPQCKRKC-QKGYTTPYEHDKHYGGIAINVI 243
Score = 100 bits (249), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 40/82 (48%), Positives = 58/82 (70%)
Query: 78 RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
++I +GP+ A ++ DFL YKSG+Y++ G +G H VR++GWG+EN YWL AN+W
Sbjct: 251 KEIMMYGPVEAYLLIFEDFLNYKSGIYKYTTGSFVGEHYVRIIGWGIENGTAYWLAANTW 310
Query: 138 NDHWGDHGTFKILRGENEADIE 159
N+ WG+ G F+I+RG NE IE
Sbjct: 311 NEDWGEKGYFRIVRGRNECSIE 332
>gi|161343861|tpg|DAA06111.1| TPA_inf: cathepsin B [Myzus persicae]
Length = 323
Score = 137 bits (346), Expect = 6e-30, Method: Compositional matrix adjust.
Identities = 67/156 (42%), Positives = 95/156 (60%), Gaps = 2/156 (1%)
Query: 187 LPRNFDAREKWPECPS-LRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHI 245
+P+ FDAR+ + C + + + DQ NC S WAV+VA+ +DRLCIASNG FT +SAQ++
Sbjct: 64 IPKEFDARQYFISCANVIGDVKDQGNCASSWAVAVASTFTDRLCIASNGKFTDNLSAQNL 123
Query: 246 VACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
++C + GC+GG AW F G+VTGG Y+S EGCQPY PC+H+ L NC+
Sbjct: 124 MSCGDDEKLGCDGGSAYKAWEFTMGKGIVTGGPYDSNEGCQPYKNRPCDHYGDSSLTNCS 183
Query: 305 LLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
L + + C+ C N +Y+ Y DL K +M
Sbjct: 184 SLRRTQMMFCRDKCVNKNYKVKYEDDLYKTSVVYMT 219
Score = 92.8 bits (229), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 42/84 (50%), Positives = 56/84 (66%), Gaps = 1/84 (1%)
Query: 78 RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGV-ENDIPYWLVANS 136
++I +GP+ A VY +F+ YK GVY+ G+ IG H V+++GWGV E I YWL NS
Sbjct: 229 QEIMTYGPVTAFMYVYENFMGYKEGVYKSTAGELIGYHHVKLIGWGVDEAGIEYWLAMNS 288
Query: 137 WNDHWGDHGTFKILRGENEADIEM 160
WN +WG+ G FKILRG N IE+
Sbjct: 289 WNSNWGNDGLFKILRGYNFCSIEL 312
>gi|157058739|gb|ABV03127.1| cathepsin B-2744 [Acyrthosiphon pisum]
Length = 260
Score = 137 bits (346), Expect = 6e-30, Method: Compositional matrix adjust.
Identities = 66/156 (42%), Positives = 96/156 (61%), Gaps = 2/156 (1%)
Query: 187 LPRNFDAREKWPECPS-LRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHI 245
+PR FDAR+ + C + + + DQ NC S WAV+VA+ +DRLCIASNG FT +SAQ++
Sbjct: 26 IPREFDARQYFTSCANVIGDVKDQGNCASSWAVAVASTFTDRLCIASNGQFTDNLSAQNL 85
Query: 246 VACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
++C GC+GG AW + G+VTGG+++S EGCQPY PC+H+ L NC+
Sbjct: 86 MSCGDGEKMGCDGGSAFKAWELTMNKGIVTGGNFDSNEGCQPYKNRPCDHYGDSRLTNCS 145
Query: 305 LLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
L + + C++ C N +Y+ Y DL K +M
Sbjct: 146 SLRRTQMTVCRKKCVNKNYKVKYEDDLHKTSIVYMT 181
Score = 76.3 bits (186), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 31/69 (44%), Positives = 46/69 (66%), Gaps = 1/69 (1%)
Query: 78 RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVEND-IPYWLVANS 136
++I +GP+ A VY +F+ YK G+Y+ G+ IG H V+++GWGV+ D YWL NS
Sbjct: 191 QEIMTYGPVTAFMYVYENFMGYKEGIYKSTTGELIGYHHVKLIGWGVDGDGTEYWLAMNS 250
Query: 137 WNDHWGDHG 145
WN +WG+ G
Sbjct: 251 WNSNWGNDG 259
>gi|161343873|tpg|DAA06117.1| TPA_inf: cathepsin B [Myzus persicae]
Length = 254
Score = 137 bits (345), Expect = 7e-30, Method: Compositional matrix adjust.
Identities = 71/154 (46%), Positives = 105/154 (68%), Gaps = 2/154 (1%)
Query: 184 AKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQ 243
GLP NFD+R+KWP CPS+ HI +Q NC S +AV+ A+A SDR+CI SN +SAQ
Sbjct: 60 TNGLPTNFDSRKKWPNCPSIGHIYNQGNCRSSYAVAAASAASDRICIHSNSTKNPIMSAQ 119
Query: 244 HIVACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEH-HVQGPLQ 301
I++C C +GC+GG +W F+ +G V+GG+YNS +GCQPYT+ PC+ + + P
Sbjct: 120 QIISCCYLCGYGCDGGSLFESWDFYRRHGFVSGGEYNSNQGCQPYTIPPCKLINEKPPGH 179
Query: 302 NCTLLGKLKTPECKQNCYNPSYESTYRFDLKKGK 335
+CT + +TP C++ C NP+Y +++R D+ +GK
Sbjct: 180 SCTTFNREETPTCEKKCNNPNYYTSFRADIYRGK 213
Score = 42.4 bits (98), Expect = 0.31, Method: Compositional matrix adjust.
Identities = 20/51 (39%), Positives = 31/51 (60%)
Query: 57 TSIPLSHYFKKAHMVPRCNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHN 107
TS Y K + V AM++I+++GP+ F +Y D + YKSGVYQ++
Sbjct: 203 TSFRADIYRGKYYKVSPYMAMKEIFDNGPITTQFYMYRDLVDYKSGVYQYD 253
>gi|148704124|gb|EDL36071.1| cathepsin B, isoform CRA_b [Mus musculus]
Length = 237
Score = 137 bits (345), Expect = 7e-30, Method: Compositional matrix adjust.
Identities = 67/132 (50%), Positives = 83/132 (62%), Gaps = 4/132 (3%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP FDARE+W CP++ I DQ +CGSCWA AISDR CI +NG ++SA+ ++
Sbjct: 74 LPETFDAREQWSNCPTIGQIRDQGSCGSCWAFGAVEAISDRTCIHTNGRVNVEVSAEDLL 133
Query: 247 ACTP-NCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
C C GCNGG+P AW FW G+V+GG YNS GC PYT+ PCEHHV G CT
Sbjct: 134 TCCGIQCGDGCNGGYPSGAWSFWTKKGLVSGGVYNSHVGCLPYTIPPCEHHVNGSRPPCT 193
Query: 305 LLGKLKTPECKQ 316
G+ TP C +
Sbjct: 194 --GEGDTPRCNK 203
>gi|332376204|gb|AEE63242.1| unknown [Dendroctonus ponderosae]
Length = 338
Score = 137 bits (345), Expect = 8e-30, Method: Compositional matrix adjust.
Identities = 66/145 (45%), Positives = 91/145 (62%), Gaps = 6/145 (4%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+P +FDARE WP C SL+ I DQS+CGSCWA A+SDR+CI S+ +SA+ +
Sbjct: 84 VPESFDARENWPRCDSLKQIRDQSSCGSCWAFGAVEAMSDRICIHSDQSNQVYVSAEDLN 143
Query: 247 ACTPNCW----GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQ-GPLQ 301
+C + GC+GG+ W +W +G+VTGG YNS +GC+ Y+L PCEHHV+ G
Sbjct: 144 SCCFGLFACGLGCDGGYVAEPWDYWRTDGIVTGGAYNSSQGCKDYSLEPCEHHVEVGSRP 203
Query: 302 NCTLLGKLKTPECKQNCYNPSYEST 326
C+ L TPEC ++CY S + T
Sbjct: 204 QCSSL-NFDTPECVRSCYESSLDYT 227
Score = 102 bits (254), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 48/82 (58%), Positives = 59/82 (71%), Gaps = 1/82 (1%)
Query: 79 QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGD-SIGLHAVRVLGWGVENDIPYWLVANSW 137
+I ++GP+ A F+VY DFL YKSGVYQ D S+G HA++VLGWGVE YWL+ANSW
Sbjct: 248 EILKNGPIEAAFTVYNDFLSYKSGVYQATAQDESVGGHAIKVLGWGVEEGTKYWLIANSW 307
Query: 138 NDHWGDHGTFKILRGENEADIE 159
N WGD+G FK LRG + IE
Sbjct: 308 NTDWGDNGYFKFLRGVDHCGIE 329
>gi|56756436|gb|AAW26391.1| unknown [Schistosoma japonicum]
Length = 342
Score = 137 bits (345), Expect = 8e-30, Method: Compositional matrix adjust.
Identities = 74/195 (37%), Positives = 104/195 (53%), Gaps = 13/195 (6%)
Query: 148 KILRGENEADIEMGFNNRVEANSSEDDDLETMGCQNAKGLPRNFDAREKWPECPSLRHIA 207
+IL G + D EM R + D ++E +P FD+R+KWP C S+ I
Sbjct: 61 RILMGARKEDAEMKRKRRPTVDH-HDLNVE---------IPSQFDSRKKWPHCKSISQIR 110
Query: 208 DQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVACTPNCW-GCNGGWPQLAWRF 266
DQS CGSCWA A++DR+CI S G + ++SA +++C +C GC GG+P AW +
Sbjct: 111 DQSRCGSCWAFGAVEAMTDRICIQSGGQQSAELSALDLISCCEDCGDGCKGGFPGQAWDY 170
Query: 267 WGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTLLGKLKTPECKQNCYNPSYEST 326
W G+VTGG + GCQPY CEH +G C KTP+CKQ C Y++
Sbjct: 171 WVKRGIVTGGSEENHTGCQPYPFPKCEHLTKGKYPACG-TKIYKTPQCKQTC-QKGYKTP 228
Query: 327 YRFDLKKGKKAHMVL 341
Y D G + + V+
Sbjct: 229 YEQDKHYGDQRYNVI 243
Score = 110 bits (274), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 45/82 (54%), Positives = 61/82 (74%)
Query: 78 RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
R+I +GP+ A F VY DFL YKSG+Y+H G +G HA+R++GWGVE PYWL+ANSW
Sbjct: 251 REIMMYGPVEAAFDVYEDFLNYKSGIYRHVTGSIVGGHAIRIIGWGVEKGKPYWLIANSW 310
Query: 138 NDHWGDHGTFKILRGENEADIE 159
N+ WG+ G F+++RG +E IE
Sbjct: 311 NEDWGEKGLFRMVRGRDECSIE 332
>gi|226469950|emb|CAX70256.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 137 bits (344), Expect = 8e-30, Method: Compositional matrix adjust.
Identities = 78/196 (39%), Positives = 109/196 (55%), Gaps = 15/196 (7%)
Query: 148 KILRGENEADIEMGFNNRVEANSSEDDDLETMGCQNAKGLPRNFDAREKWPECPSLRHIA 207
+IL G + D EM + R + D ++E +P FD+R+KWP C S+ I
Sbjct: 61 RILLGGGKEDAEMKWKRRPTVDH-HDLNVE---------IPSQFDSRKKWPHCKSISQIR 110
Query: 208 DQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVACTPNCW-GCNGGWPQLAWRF 266
DQS CGS WAVS A+SDR+CI S G + ++SA +++C NC GC+GG+P AW +
Sbjct: 111 DQSQCGSSWAVSAVGAMSDRICIQSGGKQSVELSAIDLISCCENCGSGCDGGFPGPAWDY 170
Query: 267 WGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTLLGKL-KTPECKQNCYNPSYES 325
W +G+VTGG + GCQPY CEHH G +C K+ KTP+CK+ C Y +
Sbjct: 171 WVSHGIVTGGSKENHTGCQPYPFPKCEHHSIGKYPSCG--DKIYKTPQCKRKC-QKGYTT 227
Query: 326 TYRFDLKKGKKAHMVL 341
Y D G + V+
Sbjct: 228 PYEHDKHYGGISINVI 243
Score = 100 bits (249), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 40/82 (48%), Positives = 58/82 (70%)
Query: 78 RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
++I +GP+ A ++ DFL YKSG+Y++ G +G H VR++GWG+EN YWL AN+W
Sbjct: 251 KEIMMYGPVEAYLLIFEDFLNYKSGIYRYTTGSFVGEHYVRIIGWGIENGTAYWLAANTW 310
Query: 138 NDHWGDHGTFKILRGENEADIE 159
N+ WG+ G F+I+RG NE IE
Sbjct: 311 NEDWGEKGYFRIVRGRNECSIE 332
>gi|157058747|gb|ABV03131.1| cathepsin B-2744 [Myzus persicae]
Length = 261
Score = 137 bits (344), Expect = 9e-30, Method: Compositional matrix adjust.
Identities = 67/156 (42%), Positives = 95/156 (60%), Gaps = 2/156 (1%)
Query: 187 LPRNFDAREKWPECPS-LRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHI 245
+P+ FDAR+ + C + + + DQ NC S WAV+VA+ +DRLCIASNG FT +SAQ++
Sbjct: 28 IPKEFDARQYFISCANVIGDVKDQGNCASSWAVAVASTFTDRLCIASNGKFTDNLSAQNL 87
Query: 246 VACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
++C + GC+GG AW F G+VTGG Y+S EGCQPY PC+H+ L NC+
Sbjct: 88 MSCGDDEKLGCDGGSAYKAWEFTMGKGIVTGGPYDSNEGCQPYKNRPCDHYGDSSLTNCS 147
Query: 305 LLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
L + + C+ C N +Y+ Y DL K +M
Sbjct: 148 SLRRTQMMFCRDKCVNKNYKVKYEDDLYKTSVVYMT 183
Score = 75.9 bits (185), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 33/69 (47%), Positives = 46/69 (66%), Gaps = 1/69 (1%)
Query: 78 RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGV-ENDIPYWLVANS 136
++I +GP+ A VY +F+ YK GVY+ G+ IG H V+++GWGV E I YWL NS
Sbjct: 193 QEIMTYGPVTAFMYVYENFMGYKEGVYKSTAGELIGYHHVKLIGWGVDEAGIEYWLAMNS 252
Query: 137 WNDHWGDHG 145
WN +WG +G
Sbjct: 253 WNSNWGTNG 261
>gi|226469948|emb|CAX70255.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 137 bits (344), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 78/196 (39%), Positives = 109/196 (55%), Gaps = 15/196 (7%)
Query: 148 KILRGENEADIEMGFNNRVEANSSEDDDLETMGCQNAKGLPRNFDAREKWPECPSLRHIA 207
+IL G + D EM + R + D ++E +P FD+R+KWP C S+ I
Sbjct: 61 RILLGGGKEDAEMKWKRRPTVDH-HDLNVE---------IPSQFDSRKKWPHCKSISQIR 110
Query: 208 DQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVACTPNCW-GCNGGWPQLAWRF 266
DQS CGS WAVS A+SDR+CI S G + ++SA +++C NC GC+GG+P AW +
Sbjct: 111 DQSQCGSSWAVSAVGAMSDRICIQSGGKQSVELSAIDLISCCENCGSGCDGGFPGPAWDY 170
Query: 267 WGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTLLGKL-KTPECKQNCYNPSYES 325
W +G+VTGG + GCQPY CEHH G +C K+ KTP+CK+ C Y +
Sbjct: 171 WVSHGIVTGGSKENHTGCQPYPFPKCEHHSIGKYPSCG--DKIYKTPQCKRKC-QKGYTT 227
Query: 326 TYRFDLKKGKKAHMVL 341
Y D G + V+
Sbjct: 228 PYEHDKHYGGISINVI 243
Score = 100 bits (248), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 40/81 (49%), Positives = 57/81 (70%)
Query: 79 QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWN 138
+I +GP+ A ++ DFL YKSG+Y++ G +G H VR++GWG+EN YWL AN+WN
Sbjct: 252 EIMMYGPVEAYLLIFEDFLNYKSGIYRYTTGSFVGEHYVRIIGWGIENGTAYWLAANTWN 311
Query: 139 DHWGDHGTFKILRGENEADIE 159
+ WG+ G F+I+RG NE IE
Sbjct: 312 EDWGEKGYFRIVRGRNECSIE 332
>gi|161671340|gb|ABX75522.1| cathepsin b [Lycosa singoriensis]
Length = 247
Score = 137 bits (344), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 66/150 (44%), Positives = 93/150 (62%), Gaps = 4/150 (2%)
Query: 192 DAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVACTPN 251
D+RE+WP+CPS+ I DQ +CGSCWA A+SDR CI SNG ++S + +++C +
Sbjct: 1 DSREQWPDCPSISEIRDQGSCGSCWAFGAVEAMSDRHCIHSNGKVKIEVSPEDLLSCCSS 60
Query: 252 C-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTLLGKLK 310
C GC+GG+P AW FW G+ TGG +NS GCQPY + CEHH G C+ + +
Sbjct: 61 CGMGCDGGFPPSAWEFWVDKGIATGGLWNSHIGCQPYEIPACEHHTTGDRPPCSDI--VD 118
Query: 311 TPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
TP+C C Y ++YR D GKK++ +
Sbjct: 119 TPKCVHLC-EKGYNTSYRDDKHFGKKSYSI 147
Score = 124 bits (310), Expect = 9e-26, Method: Compositional matrix adjust.
Identities = 57/99 (57%), Positives = 73/99 (73%), Gaps = 2/99 (2%)
Query: 63 HYFKKAHMVPRCNAMRQ--IYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
H+ KK++ + Q I+++GP+ FSVY+DF+ YKSGVYQH+ G+S+G HA+RVL
Sbjct: 139 HFGKKSYSIESLEQQIQTEIFKNGPVEGAFSVYSDFINYKSGVYQHHSGESLGGHAIRVL 198
Query: 121 GWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
GWG END+PYWL ANSWN WGD G FKILRG +E IE
Sbjct: 199 GWGYENDVPYWLCANSWNTDWGDKGYFKILRGSDECGIE 237
>gi|157058743|gb|ABV03129.1| cathepsin B-2744 [Pterocomma populeum]
Length = 244
Score = 137 bits (344), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 66/169 (39%), Positives = 99/169 (58%), Gaps = 3/169 (1%)
Query: 175 DLETMGCQNAKGLPRNFDAREKWPECPS-LRHIADQSNCGSCWAVSVANAISDRLCIASN 233
D +T+ +P+ FDAR + C + + + DQ NC S WAV+VA+ +DRLCIA+
Sbjct: 13 DRKTVDANYRTDVPKEFDARRHFVSCANVIGDVKDQGNCASSWAVAVASTFTDRLCIATG 72
Query: 234 GYFTGQISAQHIVAC--TPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAP 291
G FT +SAQ++++C + GC+GG AW F NG+VTGG++NS EGCQPY P
Sbjct: 73 GKFTDNLSAQNLMSCGDSEKFVGCHGGSAFKAWEFTMGNGIVTGGNFNSNEGCQPYKNRP 132
Query: 292 CEHHVQGPLQNCTLLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
C+H+ + NC+ + + C++ C N +Y+ Y DL K +M
Sbjct: 133 CDHYGDSSMTNCSSFRRTQMSICREKCVNKNYKVKYEDDLHKTSVVYMT 181
Score = 58.5 bits (140), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 21/50 (42%), Positives = 36/50 (72%)
Query: 78 RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVEND 127
++I +GP+ A+ VY +F+ YK G+Y+ GD +G H V+++GWGV++D
Sbjct: 191 QEIMTYGPVTALMYVYENFMGYKEGIYKSTVGDLVGYHHVKLIGWGVDDD 240
>gi|27806671|ref|NP_776456.1| cathepsin B precursor [Bos taurus]
gi|115312124|sp|P07688.5|CATB_BOVIN RecName: Full=Cathepsin B; AltName: Full=BCSB; Contains: RecName:
Full=Cathepsin B light chain; Contains: RecName:
Full=Cathepsin B heavy chain; Flags: Precursor
gi|289402|gb|AAA03064.1| cathepsin B [Bos taurus]
gi|809479|gb|AAA80198.1| cathepsin B [Bos taurus]
gi|296484950|tpg|DAA27065.1| TPA: cathepsin B precursor [Bos taurus]
Length = 335
Score = 137 bits (344), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 73/156 (46%), Positives = 98/156 (62%), Gaps = 5/156 (3%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP +FDARE+WP CP+++ I DQ +CGSCWA AISDR+CI SNG ++SA+ ++
Sbjct: 80 LPESFDAREQWPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHSNGRVNVEVSAEDML 139
Query: 247 A--CTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
GCNGG+P AW FW G+V+GG YNS GC+PY++ PCEHHV G CT
Sbjct: 140 TCCGGECGDGCNGGFPSGAWNFWTKKGLVSGGLYNSHVGCRPYSIPPCEHHVNGSRPPCT 199
Query: 305 LLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
G+ TP+C + C P Y +Y+ D G ++ V
Sbjct: 200 --GEGDTPKCSKTC-EPGYSPSYKEDKHFGCSSYSV 232
Score = 120 bits (300), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 53/83 (63%), Positives = 65/83 (78%)
Query: 77 MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANS 136
M +IY++GP+ FSVY+DFL YKSGVYQH G+ +G HA+R+LGWGVEN PYWLV NS
Sbjct: 240 MAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVSGEIMGGHAIRILGWGVENGTPYWLVGNS 299
Query: 137 WNDHWGDHGTFKILRGENEADIE 159
WN WGD+G FKILRG++ IE
Sbjct: 300 WNTDWGDNGFFKILRGQDHCGIE 322
>gi|146165818|ref|XP_001015807.2| Papain family cysteine protease containing protein [Tetrahymena
thermophila]
gi|146145394|gb|EAR95562.2| Papain family cysteine protease containing protein [Tetrahymena
thermophila SB210]
Length = 338
Score = 136 bits (343), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 67/161 (41%), Positives = 91/161 (56%), Gaps = 5/161 (3%)
Query: 182 QNAKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQIS 241
Q +P +FD+RE+WP C S++ I DQS CGSCWA + SDR+CIASN IS
Sbjct: 82 QTNDPIPESFDSREQWPNCNSIKTIRDQSTCGSCWAFAATETYSDRICIASNQELQTSIS 141
Query: 242 AQHIVACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPL 300
++ ++ C C GC GG+P AW++ GV TGG Y C+PY PC+HHV G
Sbjct: 142 SEDLLECCATCGNGCQGGYPSAAWKYMKATGVSTGGLYGDDSSCKPYVFPPCDHHVVGQY 201
Query: 301 QNCTLLGKLK-TPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
C G +K TP+C + C + E TY+ DL K + +
Sbjct: 202 PPC---GPIKPTPKCVKQCNSQYTEKTYQQDLHHPSKVYQL 239
Score = 101 bits (251), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 51/101 (50%), Positives = 68/101 (67%), Gaps = 5/101 (4%)
Query: 63 HYFKKAHMVPRCNA---MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSI-GLHAVR 118
H+ K + +P NA R+I HGP+ A F V +DFL YKSGVY + G H+V+
Sbjct: 231 HHPSKVYQLPN-NAEAIQREIMAHGPVQASFRVASDFLTYKSGVYIRDPKLKYEGGHSVK 289
Query: 119 VLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
++GWGVE PYWL+ANSWN+ WG++G FK+LRG+NE IE
Sbjct: 290 IIGWGVEQGTPYWLIANSWNEDWGENGLFKMLRGKNECGIE 330
>gi|350540002|ref|NP_001232104.1| putative cathepsin B variant 2 precursor [Taeniopygia guttata]
gi|197129221|gb|ACH45719.1| putative cathepsin B variant 2 [Taeniopygia guttata]
Length = 261
Score = 136 bits (343), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 59/120 (49%), Positives = 82/120 (68%), Gaps = 2/120 (1%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP NFD+R +WP CP++ I DQ +CGSCWA AISDR+C+ +N + ++SA+ ++
Sbjct: 80 LPDNFDSRTQWPNCPTISEIRDQGSCGSCWAFGAVEAISDRICVHTNAKVSVEVSAEDLL 139
Query: 247 ACTP-NC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
+C C GCNGG+P AWR+W G+V+GG Y+S GC+PY++ PCEHHV G CT
Sbjct: 140 SCCGFECGMGCNGGYPSGAWRYWTERGLVSGGLYDSHVGCRPYSIPPCEHHVNGTRPPCT 199
>gi|121309133|dbj|BAF43801.1| Longipain [Haemaphysalis longicornis]
Length = 341
Score = 136 bits (343), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 66/155 (42%), Positives = 92/155 (59%), Gaps = 3/155 (1%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+P +FD+R +W +CP++R I DQ +CGSCWA A+SDR CI S ++A ++
Sbjct: 89 IPDHFDSRHRWHDCPTIREIRDQGSCGSCWAFGAVEAMSDRHCIHSGAKNIVHLAADDVL 148
Query: 247 ACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
+C +C GCNGG+P AW +W H G+VTGG+Y+S EGC PY + C+HHV G L C
Sbjct: 149 SCCMSCGSGCNGGFPGAAWSYWVHKGIVTGGNYDSDEGCMPYPIKACDHHVNGTLGPCD- 207
Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
TP C + C Y + D GKK++ V
Sbjct: 208 KSIPPTPRCVRMC-RKGYNVDFADDKHYGKKSYSV 241
Score = 112 bits (279), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 55/99 (55%), Positives = 68/99 (68%), Gaps = 2/99 (2%)
Query: 63 HYFKKAHMVPR--CNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
HY KK++ VP +I +GP+ A F+VYADF YKSGVYQ + ++G HA+R+L
Sbjct: 233 HYGKKSYSVPSNVTQIQVEIMTNGPVEADFTVYADFPLYKSGVYQRHTDQALGGHAIRLL 292
Query: 121 GWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
GWGVE +PYWL ANSWN WGD G FKILRG +E IE
Sbjct: 293 GWGVEKGVPYWLAANSWNTEWGDKGFFKILRGSDECGIE 331
>gi|393909827|gb|EJD75608.1| cysteine endopeptidase [Loa loa]
Length = 383
Score = 136 bits (342), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 65/155 (41%), Positives = 94/155 (60%), Gaps = 2/155 (1%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+P +FDAR+ WPEC SLR++ DQS+CGSCWAV+ A+SDR+CI S G +SA ++
Sbjct: 123 IPESFDARKHWPECASLRNVRDQSSCGSCWAVAAVEAMSDRICIMSKGKKQVTLSADDLL 182
Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
+C C +GC GG P AW++W G+VTG +Y + GC+PY PCEHH
Sbjct: 183 SCCKTCGFGCFGGEPMAAWKYWVLRGIVTGSEYTNHSGCRPYPFPPCEHHNNKTHYEPCK 242
Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
TP+C + C + +Y +Y+ D G++ + V
Sbjct: 243 HDLYPTPKCVKKC-DKNYGKSYKADKYYGEQVYNV 276
Score = 105 bits (261), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 46/85 (54%), Positives = 57/85 (67%)
Query: 78 RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
++I GP+ A F VY DFL Y G+Y+H G G HAV+VLGWG++ +PYWL ANSW
Sbjct: 285 KEIMTLGPVEASFEVYTDFLYYTGGIYKHVAGSMGGGHAVKVLGWGIDQGVPYWLAANSW 344
Query: 138 NDHWGDHGTFKILRGENEADIEMGF 162
N WG+ G F+ILRG NE IE G
Sbjct: 345 NTDWGEDGYFRILRGVNECGIESGI 369
>gi|552159|gb|AAA29434.1| cathepsin B-like cysteine protease, partial [Ostertagia ostertagi]
Length = 240
Score = 136 bits (342), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 72/143 (50%), Positives = 88/143 (61%), Gaps = 5/143 (3%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+P ++D R +W C SL HI DQ+NCGSCWAVS A A+SDR+CIAS G ISAQ +V
Sbjct: 95 IPESYDPRIQWANCSSLFHIPDQANCGSCWAVSSAAAMSDRICIASKGAKQVLISAQDVV 154
Query: 247 ACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
+C C GC GGWP A+RF GVVTGGDYN++ C+PY + PC HH
Sbjct: 155 SCCTWCGDGCEGGWPISAFRFHADEGVVTGGDYNTKGSCRPYEIHPCGHHGNETYYG-EC 213
Query: 306 LGKLKTPECKQNC---YNPSYES 325
+G TP CK+ C Y SY S
Sbjct: 214 VGMADTPRCKRRCLLGYPKSYPS 236
>gi|17565164|ref|NP_503383.1| Protein CPR-5 [Caenorhabditis elegans]
gi|1169086|sp|P43509.1|CPR5_CAEEL RecName: Full=Cathepsin B-like cysteine proteinase 5; AltName:
Full=Cysteine protease-related 5; Flags: Precursor
gi|671713|gb|AAA98786.1| cathepsin B-like cysteine proteinase [Caenorhabditis elegans]
gi|675502|gb|AAA98784.1| cathepsin B-like cysteine proteinase [Caenorhabditis elegans]
gi|351059399|emb|CCD74289.1| Protein CPR-5 [Caenorhabditis elegans]
Length = 344
Score = 136 bits (342), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 68/171 (39%), Positives = 102/171 (59%), Gaps = 5/171 (2%)
Query: 175 DLETMGCQNAKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNG 234
D + + + + +P +FDAR++WP C S+ +I DQS+CGSCWA + A AISDR CIASNG
Sbjct: 70 DEDIVATEVSDAIPDHFDARDQWPNCMSINNIRDQSDCGSCWAFAAAEAISDRTCIASNG 129
Query: 235 YFTGQISAQHIVACTPNCW----GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLA 290
+S++ +++C + GC GG+P AW++W +G+VTGG Y +Q GC+PY++A
Sbjct: 130 AVNTLLSSEDLLSCCTGMFSCGNGCEGGYPIQAWKWWVKHGLVTGGSYETQFGCKPYSIA 189
Query: 291 PCEHHVQGPLQNCTLLGKLKTPECKQNCYNP-SYESTYRFDLKKGKKAHMV 340
PC V G TP+C +C + +Y + Y D G A+ V
Sbjct: 190 PCGETVNGVKWPACPEDTEPTPKCVDSCTSKNNYATPYLQDKHFGSTAYAV 240
Score = 108 bits (270), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 47/81 (58%), Positives = 59/81 (72%)
Query: 79 QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWN 138
+I +GP+ F+VY DF QY +GVY H G S+G HAV++LGWGV+N PYWLVANSWN
Sbjct: 250 EILTNGPIEVAFTVYEDFYQYTTGVYVHTAGASLGGHAVKILGWGVDNGTPYWLVANSWN 309
Query: 139 DHWGDHGTFKILRGENEADIE 159
WG+ G F+I+RG NE IE
Sbjct: 310 VAWGEKGYFRIIRGLNECGIE 330
>gi|29840882|gb|AAP05883.1| similar to GenBank Accession Number X70968 cathepsin B in
Schistosoma japonicum [Schistosoma japonicum]
Length = 312
Score = 136 bits (342), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 65/155 (41%), Positives = 89/155 (57%), Gaps = 2/155 (1%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP+ FD+R+ W C S+R I DQS+CGSCWA ++SDR+CI S G + ++SA +++
Sbjct: 92 LPKYFDSRKYWKNCSSIRTIRDQSSCGSCWAFGAVESMSDRICIHSKGRISIELSAVNLL 151
Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
+C C +GCNGG P +AW +W G+VTGG + GCQPY C HH +
Sbjct: 152 SCCSRCGFGCNGGIPGMAWDYWKDEGIVTGGSNETHTGCQPYPFPECIHHSTSINHSSCE 211
Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
+ TPEC Q C P Y Y D GK ++ V
Sbjct: 212 VKYYSTPECYQTC-QPDYAIQYENDKYYGKSSYYV 245
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 22/49 (44%), Positives = 33/49 (67%)
Query: 75 NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWG 123
+ M++I +GP+ A F VY DFL YK+GVY++ G +G HA+R+ G
Sbjct: 251 SIMKEILLNGPVEATFYVYDDFLNYKTGVYKYVTGSLLGGHAIRITWLG 299
>gi|552158|gb|AAA29433.1| cathepsin B-like cysteine protease, partial [Ostertagia ostertagi]
Length = 236
Score = 136 bits (342), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 72/143 (50%), Positives = 88/143 (61%), Gaps = 5/143 (3%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+P ++D R +W C SL HI DQ+NCGSCWAVS A A+SDR+CIAS G ISAQ +V
Sbjct: 91 IPESYDPRIQWANCSSLFHIPDQANCGSCWAVSSAAAMSDRICIASKGAKQVLISAQDVV 150
Query: 247 ACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
+C C GC GGWP A+RF GVVTGGDYN++ C+PY + PC HH
Sbjct: 151 SCCTWCGDGCEGGWPISAFRFHADEGVVTGGDYNTKGSCRPYEIHPCGHHGNETYYG-EC 209
Query: 306 LGKLKTPECKQNC---YNPSYES 325
+G TP CK+ C Y SY S
Sbjct: 210 VGMADTPRCKRRCLLGYPKSYPS 232
>gi|226471004|emb|CAX70583.1| Cysteine PRotease related protein [Schistosoma japonicum]
Length = 304
Score = 136 bits (342), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 74/195 (37%), Positives = 104/195 (53%), Gaps = 13/195 (6%)
Query: 148 KILRGENEADIEMGFNNRVEANSSEDDDLETMGCQNAKGLPRNFDAREKWPECPSLRHIA 207
+IL G + D EM R + D ++E +P FD+R+KWP C S+ I
Sbjct: 23 RILMGARKEDAEMKRKRRPTVDH-HDLNVE---------IPSQFDSRKKWPHCKSISQIR 72
Query: 208 DQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVACTPNCW-GCNGGWPQLAWRF 266
DQS CGSCWA A++DR+CI S G + ++SA +++C +C GC GG+P AW +
Sbjct: 73 DQSRCGSCWAFGAVEAMTDRICIQSGGGQSAELSALDLISCCKDCGDGCKGGFPGQAWDY 132
Query: 267 WGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTLLGKLKTPECKQNCYNPSYEST 326
W G+VTGG + GCQPY CEH +G C KTP+CKQ C Y++
Sbjct: 133 WVKRGIVTGGSEENHTGCQPYPFPKCEHLTKGKYPACG-TKIYKTPQCKQTC-QKGYKTP 190
Query: 327 YRFDLKKGKKAHMVL 341
Y D G + + V+
Sbjct: 191 YEQDKHYGDQRYNVI 205
Score = 112 bits (280), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 46/82 (56%), Positives = 61/82 (74%)
Query: 78 RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
R+I +GP+ A F VY DFL YKSG+Y+H G +G HA+R++GWGVE PYWL+ANSW
Sbjct: 213 REIMMYGPVEAAFDVYEDFLNYKSGIYRHVTGSIVGGHAIRIIGWGVEKRTPYWLIANSW 272
Query: 138 NDHWGDHGTFKILRGENEADIE 159
N+ WG+ G F+I+RG +E IE
Sbjct: 273 NEDWGEKGLFRIVRGRDECSIE 294
>gi|28373366|pdb|1ITO|A Chain A, Crystal Structure Analysis Of Bovine Spleen Cathepsin B-
E64c Complex
gi|88192750|pdb|2DC6|A Chain A, X-ray Crystal Structure Analysis Of Bovine Spleen
Cathepsin B-ca073 Complex
gi|88192751|pdb|2DC7|A Chain A, X-ray Crystal Structure Analysis Of Bovine Spleen
Cathepsin B-ca042 Complex
gi|88192752|pdb|2DC8|A Chain A, X-ray Crystal Structure Analysis Of Bovine Spleen
Cathepsin B-ca059 Complex
gi|88192753|pdb|2DC9|A Chain A, X-Ray Crystal Structure Analysis Of Bovine Spleen
Cathepsin B-Ca074me Complex
gi|88192754|pdb|2DCA|A Chain A, X-ray Crystal Structure Analysis Of Bovine Spleen
Cathepsin B-ca075 Complex
gi|88192755|pdb|2DCB|A Chain A, X-Ray Crystal Structure Analysis Of Bovine Spleen
Cathepsin B-Ca076 Complex
gi|88192756|pdb|2DCC|A Chain A, X-Ray Crystal Structure Analysis Of Bovine Spleen
Cathepsin B-Ca077 Complex
gi|88192757|pdb|2DCD|A Chain A, X-Ray Crystal Structure Analysis Of Bovine Spleen
Cathepsin B-Ca078 Complex
Length = 256
Score = 136 bits (342), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 73/156 (46%), Positives = 98/156 (62%), Gaps = 5/156 (3%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP +FDARE+WP CP+++ I DQ +CGSCWA AISDR+CI SNG ++SA+ ++
Sbjct: 1 LPESFDAREQWPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHSNGRVNVEVSAEDML 60
Query: 247 A--CTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
GCNGG+P AW FW G+V+GG YNS GC+PY++ PCEHHV G CT
Sbjct: 61 TCCGGECGDGCNGGFPSGAWNFWTKKGLVSGGLYNSHVGCRPYSIPPCEHHVNGSRPPCT 120
Query: 305 LLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
G+ TP+C + C P Y +Y+ D G ++ V
Sbjct: 121 --GEGDTPKCSKTC-EPGYSPSYKEDKHFGCSSYSV 153
Score = 120 bits (300), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 53/83 (63%), Positives = 65/83 (78%)
Query: 77 MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANS 136
M +IY++GP+ FSVY+DFL YKSGVYQH G+ +G HA+R+LGWGVEN PYWLV NS
Sbjct: 161 MAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVSGEIMGGHAIRILGWGVENGTPYWLVGNS 220
Query: 137 WNDHWGDHGTFKILRGENEADIE 159
WN WGD+G FKILRG++ IE
Sbjct: 221 WNTDWGDNGFFKILRGQDHCGIE 243
>gi|444525951|gb|ELV14228.1| Cathepsin B [Tupaia chinensis]
Length = 339
Score = 135 bits (341), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 70/156 (44%), Positives = 100/156 (64%), Gaps = 5/156 (3%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP +FDARE+W CP+++ I DQ +CGSCWA +ISDR+CI +NG+ ++SA+ ++
Sbjct: 80 LPESFDAREQWSSCPTIKEIRDQGSCGSCWAFGAVESISDRICIHTNGHVNVEVSAEDML 139
Query: 247 A--CTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
GCNGG+P AW FW G+V+GG Y+S GC+PY++ PCEHHV G CT
Sbjct: 140 TCCGGQCGEGCNGGYPSAAWNFWTKKGLVSGGLYDSHVGCRPYSIPPCEHHVNGSRPPCT 199
Query: 305 LLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
G+ TP+C ++C P Y S+Y+ D G ++ V
Sbjct: 200 --GEGDTPKCSKSC-EPGYSSSYKEDKHYGYSSYSV 232
Score = 122 bits (306), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 57/99 (57%), Positives = 71/99 (71%), Gaps = 2/99 (2%)
Query: 63 HYFKKAHMVPRCN--AMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
HY ++ VP M +IY++GP+ FSVY+DFL YKSGVYQH G+ +G HA+R+L
Sbjct: 224 HYGYSSYSVPGIEKEIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRIL 283
Query: 121 GWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
GWG EN PYWLVANSWN WGD+G FKILRG++ IE
Sbjct: 284 GWGTENGTPYWLVANSWNTDWGDNGFFKILRGQDHCGIE 322
>gi|427785213|gb|JAA58058.1| Putative cathepsin l culex quinquefasciatus cathepsin l
[Rhipicephalus pulchellus]
Length = 346
Score = 135 bits (341), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 74/170 (43%), Positives = 96/170 (56%), Gaps = 12/170 (7%)
Query: 175 DLETMGCQNAKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCI--AS 232
DL ++G LP NFD+RE WPEC ++ I DQ +CGSCWA A+SDR CI S
Sbjct: 85 DLSSLG-----PLPENFDSRENWPECTTIGEIRDQGSCGSCWAFGAVEAMSDRTCIHSPS 139
Query: 233 NGYFTGQISAQHIVACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAP 291
G +SA +++C C GCNGG+P AW FW G+VTGG+Y+S +GC PY +
Sbjct: 140 GGPKRVHLSADDLLSCCRTCGNGCNGGFPGSAWSFWVKTGIVTGGNYDSDDGCMPYPIKA 199
Query: 292 CEHHVQGPLQNCTLLGKL-KTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
C+HHV G L C K+ TP C C Y+ Y D GK ++ V
Sbjct: 200 CDHHVNGTLGPCD--KKIPPTPRCVHMC-RKGYDVDYHDDKHYGKSSYSV 246
Score = 116 bits (290), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 55/99 (55%), Positives = 70/99 (70%), Gaps = 2/99 (2%)
Query: 63 HYFKKAHMVPRCNAMRQ--IYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
HY K ++ VP Q I +GP+ A F+VY+DF+ YKSGVYQ + +++G HA+R+L
Sbjct: 238 HYGKSSYSVPSEEKQIQAEIMTNGPVEADFTVYSDFVHYKSGVYQRHTDEALGGHAIRLL 297
Query: 121 GWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
GWGVEN +PYWL ANSWN WGD G FKILRG +E IE
Sbjct: 298 GWGVENGVPYWLAANSWNTEWGDKGFFKILRGSDECGIE 336
>gi|146386348|gb|ABQ23962.1| cathepsin B [Oryctolagus cuniculus]
Length = 228
Score = 135 bits (340), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 69/146 (47%), Positives = 94/146 (64%), Gaps = 5/146 (3%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP +FDARE+WP CP+++ I DQ +CGSCWA AISDR+CI +NG+ ++SA+ ++
Sbjct: 59 LPESFDAREQWPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHTNGHVNVEVSAEDML 118
Query: 247 A--CTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
GCNGG+P AW FW G+V+GG Y+S GC+PY++ PCEHHV G CT
Sbjct: 119 TCCGGQCGDGCNGGYPSGAWNFWTKKGLVSGGLYDSHVGCKPYSIPPCEHHVNGSRPACT 178
Query: 305 LLGKLKTPECKQNCYNPSYESTYRFD 330
G+ TP C + C P Y +Y+ D
Sbjct: 179 --GEGDTPRCSKTC-EPGYSPSYKED 201
>gi|56753605|gb|AAW25005.1| SJCHGC02852 protein [Schistosoma japonicum]
Length = 346
Score = 135 bits (340), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 65/155 (41%), Positives = 89/155 (57%), Gaps = 2/155 (1%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP+ FD+R+ W C S+R I DQS+CGSCWA ++SDR+CI S G + ++SA +++
Sbjct: 92 LPKYFDSRKYWKNCSSIRTIRDQSSCGSCWAFGAVESMSDRICIHSKGRISIELSAVNLL 151
Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
+C C +GCNGG P +AW +W G+VTGG + GCQPY C HH +
Sbjct: 152 SCCSRCGFGCNGGIPGMAWDYWKDEGIVTGGSNETHTGCQPYPFPECIHHSTSINHSSCE 211
Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
+ TPEC Q C P Y Y D GK ++ V
Sbjct: 212 VKYYSTPECYQTC-QPDYAIQYENDKYYGKSSYYV 245
Score = 104 bits (259), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 51/101 (50%), Positives = 67/101 (66%), Gaps = 4/101 (3%)
Query: 63 HYFKKAHMVP--RCNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
+Y K ++ V + M++I +GP+ A F V+ DFL YK+GVY++ G +G HA+R++
Sbjct: 237 YYGKSSYYVTSDEVSIMKEILLNGPVEATFYVFDDFLNYKTGVYKYVTGSLLGGHAIRII 296
Query: 121 GWGVE--NDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
GWGV N PYWL ANSWN WGD G FKILRG NE IE
Sbjct: 297 GWGVSTLNHTPYWLCANSWNKQWGDKGYFKILRGSNECGIE 337
>gi|326515156|dbj|BAK03491.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 471
Score = 135 bits (340), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 71/159 (44%), Positives = 97/159 (61%), Gaps = 11/159 (6%)
Query: 187 LPRNFDARE--KWPEC-PSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQ 243
LP +FD R+ KWP C SL H+ DQ +CGSCWA A A++DR+CIASNG +SA+
Sbjct: 214 LPTSFDPRDGSKWPACKDSLNHVRDQGSCGSCWAFGAAEAMTDRICIASNGQNNFYLSAE 273
Query: 244 HIVACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQN 302
+ +C +C GC GG+P AW ++ G+VTGGD+NS +GC PY L C+HHV G Q
Sbjct: 274 DLTSCCDSCGMGCEGGYPSAAWDYFQSTGLVTGGDWNSNQGCYPYQLQACDHHVTGKYQP 333
Query: 303 CTLLGKLK-TPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
C G ++ TP C +C N +T+ D G ++ V
Sbjct: 334 C---GDIQPTPACANSCQN---NATWSSDKHFGASSYSV 366
Score = 119 bits (297), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 50/86 (58%), Positives = 65/86 (75%)
Query: 77 MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANS 136
M +IY +GP+ A + VYADF+ YKSGVYQH GD +G HAV+++GWGV+ PYW+VANS
Sbjct: 374 MTEIYTNGPVEASYDVYADFVSYKSGVYQHVTGDYLGGHAVKIIGWGVDGSTPYWIVANS 433
Query: 137 WNDHWGDHGTFKILRGENEADIEMGF 162
WN+ WG++G F ILRG +E IE G
Sbjct: 434 WNNDWGNNGFFNILRGSDECGIEDGI 459
>gi|118365170|ref|XP_001015806.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila]
gi|89297573|gb|EAR95561.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila SB210]
Length = 340
Score = 135 bits (340), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 67/161 (41%), Positives = 92/161 (57%), Gaps = 6/161 (3%)
Query: 183 NAKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISA 242
NA +P FDARE+WP C S++ I DQS CGSCWA + SDR+CIASN IS+
Sbjct: 84 NADPIPEFFDAREQWPNCQSIKLIRDQSTCGSCWAFAATETFSDRICIASNQTLQTSISS 143
Query: 243 QHIVACTPN-C-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPL 300
+ ++ C + C GC GG+P AW + GV TGG Y C+PY PC+HHV G
Sbjct: 144 EDLLECCADYCGMGCKGGYPSAAWGYMKRQGVSTGGLYGDDTSCKPYIFPPCDHHVTGQY 203
Query: 301 QNCTLLGKLK-TPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
Q C G ++ TP+C + C + ++TY DL + + +
Sbjct: 204 QPC---GPIQPTPQCVKECNSEYTQNTYEKDLHFASQTYSI 241
Score = 99.0 bits (245), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 46/83 (55%), Positives = 58/83 (69%), Gaps = 1/83 (1%)
Query: 78 RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSI-GLHAVRVLGWGVENDIPYWLVANS 136
R+I HGP+ A F V ADFL YKSGVY N G H+V+++GWG E + PYWL+ANS
Sbjct: 250 REIMAHGPVQASFKVAADFLTYKSGVYIRNPKLKYEGGHSVKIIGWGKEGNTPYWLIANS 309
Query: 137 WNDHWGDHGTFKILRGENEADIE 159
WN+ WG+ G F++LRG NE IE
Sbjct: 310 WNEDWGEKGLFRMLRGRNECGIE 332
>gi|341891084|gb|EGT47019.1| CBN-CPR-4 protein [Caenorhabditis brenneri]
Length = 335
Score = 135 bits (340), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 68/155 (43%), Positives = 90/155 (58%), Gaps = 1/155 (0%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+P FDAR +WP C S+ +I DQS+CGSCWA + A A SDR CIASNG +SA+ ++
Sbjct: 81 IPATFDARTQWPSCVSINNIRDQSDCGSCWAFAAAEAASDRFCIASNGAVNTLLSAEDVL 140
Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
+C NC +GC GG+P AW++ +G TGG Y +Q GC+PY+LAPC V
Sbjct: 141 SCCSNCGYGCEGGYPINAWKYLVKSGFCTGGSYEAQFGCKPYSLAPCGETVGNTTWPACP 200
Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
TP C C N +Y Y+ D G A+ V
Sbjct: 201 TDGYDTPACVNKCTNSNYNVAYKDDKHFGSTAYAV 235
Score = 117 bits (292), Expect = 9e-24, Method: Compositional matrix adjust.
Identities = 54/99 (54%), Positives = 69/99 (69%), Gaps = 2/99 (2%)
Query: 63 HYFKKAHMVPRCNAMRQ--IYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
H+ A+ V + A Q I HGP+ A F+VY DF QYKSGVY H G+ +G HA+R+L
Sbjct: 227 HFGSTAYAVGKKVAQIQAEIIAHGPVEAAFTVYEDFYQYKSGVYVHTTGEELGGHAIRIL 286
Query: 121 GWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
GWG +N PYWLVANSWN +WG++G F+I+RG NE IE
Sbjct: 287 GWGTDNGTPYWLVANSWNVNWGENGYFRIIRGTNECGIE 325
>gi|29374023|gb|AAO73002.1| cathepsin B [Fasciola gigantica]
Length = 335
Score = 135 bits (339), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 67/155 (43%), Positives = 92/155 (59%), Gaps = 2/155 (1%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP +FDAREKWP C S+ I DQS+C SCWAV A+A++DR+CI SNG ++SA +V
Sbjct: 86 LPESFDAREKWPNCSSISEIPDQSSCSSCWAVGTASAMTDRICIHSNGEKKPRLSAVDLV 145
Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
+C P C +GC GG+P +AW +W +G+V+GG + GC PY C H + P
Sbjct: 146 SCCPYCGYGCEGGYPSMAWDYWWRHGIVSGGTLENPTGCLPYPFPKCSHLEETPGLAPCP 205
Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
TP+C++ C Y T D KGK ++ V
Sbjct: 206 RELYATPKCEKQC-QAGYSKTSEEDKIKGKSSYNV 239
Score = 97.1 bits (240), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 42/89 (47%), Positives = 59/89 (66%), Gaps = 2/89 (2%)
Query: 75 NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVA 134
+ M +I +GP+ I+ ++ DF YKSG+YQ+ G +G H + +GWGVEN + YWL A
Sbjct: 245 DIMMEIITNGPVSTIYYIFEDFTVYKSGIYQYTSGSLMGGHGI--IGWGVENGVKYWLAA 302
Query: 135 NSWNDHWGDHGTFKILRGENEADIEMGFN 163
NSWN+ WG++G F+I RG NE IE N
Sbjct: 303 NSWNEGWGENGYFRIRRGTNECGIESRIN 331
>gi|440913587|gb|ELR63025.1| Cathepsin B [Bos grunniens mutus]
Length = 335
Score = 135 bits (339), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 72/156 (46%), Positives = 98/156 (62%), Gaps = 5/156 (3%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP +FDAR++WP CP+++ I DQ +CGSCWA AISDR+CI SNG ++SA+ ++
Sbjct: 80 LPESFDARKQWPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHSNGRVNVEVSAEDML 139
Query: 247 A--CTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
GCNGG+P AW FW G+V+GG YNS GC+PY++ PCEHHV G CT
Sbjct: 140 TCCGGECGDGCNGGFPSGAWNFWTKKGLVSGGLYNSHVGCRPYSIPPCEHHVNGSRPPCT 199
Query: 305 LLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
G+ TP+C + C P Y +Y+ D G ++ V
Sbjct: 200 --GEGDTPKCSKTC-EPGYSPSYKEDKHFGCSSYSV 232
Score = 120 bits (301), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 53/83 (63%), Positives = 65/83 (78%)
Query: 77 MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANS 136
M +IY++GP+ FSVY+DFL YKSGVYQH G+ +G HA+R+LGWGVEN PYWLV NS
Sbjct: 240 MAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVSGEIMGGHAIRILGWGVENGTPYWLVGNS 299
Query: 137 WNDHWGDHGTFKILRGENEADIE 159
WN WGD+G FKILRG++ IE
Sbjct: 300 WNTDWGDNGFFKILRGQDHCGIE 322
>gi|9955277|pdb|1QDQ|A Chain A, X-Ray Crystal Structure Of Bovine Cathepsin B-Ca074
Complex
Length = 253
Score = 135 bits (339), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 70/156 (44%), Positives = 94/156 (60%), Gaps = 5/156 (3%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP +FDARE+WP CP+++ I DQ +CGSCWA AISDR+CI SNG ++SA+ ++
Sbjct: 1 LPESFDAREQWPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHSNGRVNVEVSAEDML 60
Query: 247 ACTPNCWGCNGGW--PQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
C G P AW FW G+V+GG YNS GC+PY++ PCEHHV G CT
Sbjct: 61 TCCGGECGDGCNGGEPSGAWNFWTKKGLVSGGLYNSHVGCRPYSIPPCEHHVNGSRPPCT 120
Query: 305 LLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
G+ TP+C + C P Y +Y+ D G ++ V
Sbjct: 121 --GEGDTPKCSKTC-EPGYSPSYKEDKHFGCSSYSV 153
Score = 122 bits (305), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 54/83 (65%), Positives = 66/83 (79%)
Query: 77 MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANS 136
M +IY++GP+ FSVY+DFL YKSGVYQH G+ +G HA+R+LGWGVEN PYWLVANS
Sbjct: 161 MAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVSGEIMGGHAIRILGWGVENGTPYWLVANS 220
Query: 137 WNDHWGDHGTFKILRGENEADIE 159
WN WGD+G FKILRG++ IE
Sbjct: 221 WNTDWGDNGFFKILRGQDHCGIE 243
>gi|291385792|ref|XP_002709482.1| PREDICTED: cathepsin B [Oryctolagus cuniculus]
Length = 339
Score = 134 bits (338), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 69/146 (47%), Positives = 94/146 (64%), Gaps = 5/146 (3%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP +FDARE+WP CP+++ I DQ +CGSCWA AISDR+CI +NG+ ++SA+ ++
Sbjct: 80 LPESFDAREQWPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHTNGHVNVEVSAEDML 139
Query: 247 A--CTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
GCNGG+P AW FW G+V+GG Y+S GC+PY++ PCEHHV G CT
Sbjct: 140 TCCGGQCGDGCNGGYPSGAWNFWTKKGLVSGGLYDSHVGCKPYSIPPCEHHVNGSRPACT 199
Query: 305 LLGKLKTPECKQNCYNPSYESTYRFD 330
G+ TP C + C P Y +Y+ D
Sbjct: 200 --GEGDTPRCSKTC-EPGYSPSYKED 222
Score = 119 bits (298), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 52/81 (64%), Positives = 64/81 (79%)
Query: 79 QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWN 138
+IY++GP+ F+VY+DFL YKSGVYQH GD +G HA+R+LGWG EN +PYWLVANSWN
Sbjct: 242 EIYKNGPVEGAFTVYSDFLMYKSGVYQHTTGDIMGGHAIRILGWGEENGVPYWLVANSWN 301
Query: 139 DHWGDHGTFKILRGENEADIE 159
WGD G FKILRG++ IE
Sbjct: 302 TDWGDKGFFKILRGQDHCGIE 322
>gi|345308|pir||S31909 cathepsin B-like cysteine proteinase (EC 3.4.22.-) - fluke
(Schistosoma japonicum)
Length = 316
Score = 134 bits (338), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 67/156 (42%), Positives = 94/156 (60%), Gaps = 3/156 (1%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+P +FD+R+KWP C S+ I DQS C S WAVS A+SDR+CI S G + ++SA ++
Sbjct: 64 IPSHFDSRKKWPRCKSISQIRDQSRCASSWAVSAVGAMSDRICIQSGGKQSVELSAIDLI 123
Query: 247 ACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
+C NC GC+GG+P AW +W +G+VTGG + GCQPY CEHH +G +C
Sbjct: 124 SCCENCGSGCDGGFPGPAWDYWVSHGIVTGGSKENHTGCQPYPFPKCEHHSKGKYPSCGD 183
Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMVL 341
KTP+CK+ C Y++ Y D G + V+
Sbjct: 184 K-MYKTPQCKRKC-QKGYKTPYEHDKHYGGISINVI 217
Score = 100 bits (249), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 39/82 (47%), Positives = 58/82 (70%)
Query: 78 RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
++I +GP+ A ++ DFL YKSG+Y++ G +G H VR++GWG+EN YWL AN+W
Sbjct: 225 KEIMMYGPVEAYLLIFEDFLNYKSGIYRYTTGSFVGEHYVRIIGWGIENGTAYWLAANTW 284
Query: 138 NDHWGDHGTFKILRGENEADIE 159
N+ WG+ G F+I+RG NE +E
Sbjct: 285 NEDWGEKGYFRIVRGRNECSVE 306
>gi|257215762|emb|CAX83033.1| Cysteine PRotease related protein [Schistosoma japonicum]
Length = 233
Score = 134 bits (338), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 70/176 (39%), Positives = 97/176 (55%), Gaps = 12/176 (6%)
Query: 148 KILRGENEADIEMGFNNRVEANSSEDDDLETMGCQNAKGLPRNFDAREKWPECPSLRHIA 207
+IL G + D EM R + D ++E +P FD+R+KWP C S+ I
Sbjct: 61 RILMGARKEDAEMKRKRRPTVDH-HDLNVE---------IPSQFDSRKKWPHCKSISQIR 110
Query: 208 DQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVACTPNCW-GCNGGWPQLAWRF 266
DQS CGSCWA A++DR+CI S G + ++SA +++C +C GC GG+P AW +
Sbjct: 111 DQSRCGSCWAFGAVEAMTDRICIQSGGGQSAELSALDLISCCKDCGDGCKGGFPGQAWDY 170
Query: 267 WGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTLLGKLKTPECKQNCYNPS 322
W G+VTGG + GCQPY CEH +G C KTP+CKQ C++ S
Sbjct: 171 WVKRGIVTGGSEENHTGCQPYPFPKCEHLTKGKYPACG-TKIYKTPQCKQTCHSIS 225
>gi|312271211|gb|ADQ57303.1| cathepsin B-like cysteine proteinase 1 [Angiostrongylus
cantonensis]
Length = 394
Score = 134 bits (337), Expect = 6e-29, Method: Compositional matrix adjust.
Identities = 65/156 (41%), Positives = 92/156 (58%), Gaps = 3/156 (1%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+P FDAR+ W C S+++I DQS+CGSCWA A+SDR+CIASN +SA ++
Sbjct: 121 IPETFDARQHWSNCQSIKNIRDQSSCGSCWAFGAVEAMSDRICIASNEKIQVTLSADDLL 180
Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
+C C +GC GG P AW++W +G+VTG ++ + +GC+PY PCEHH +
Sbjct: 181 SCCRTCGFGCEGGDPMFAWQYWVDHGIVTGSNFTANQGCKPYPFPPCEHHSNKTRFDPCR 240
Query: 306 LGKLKTPECKQNCYNPSY-ESTYRFDLKKGKKAHMV 340
TP+C + C PSY E Y D G+ A+ V
Sbjct: 241 HDLYPTPKCSKKCV-PSYKEKNYDDDRFYGRTAYGV 275
Score = 101 bits (252), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 43/84 (51%), Positives = 56/84 (66%)
Query: 78 RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
++I HGP+ F VY DFL Y G+Y H G G HAV+++GWG++ PYWL+ANSW
Sbjct: 284 KEILTHGPVEVAFEVYEDFLHYAGGIYVHTGGKLGGGHAVKLIGWGIDQGTPYWLIANSW 343
Query: 138 NDHWGDHGTFKILRGENEADIEMG 161
N WG+ G F+ILRG +E IE G
Sbjct: 344 NTDWGEEGFFRILRGVDECGIESG 367
>gi|308488328|ref|XP_003106358.1| hypothetical protein CRE_16047 [Caenorhabditis remanei]
gi|308253708|gb|EFO97660.1| hypothetical protein CRE_16047 [Caenorhabditis remanei]
Length = 343
Score = 134 bits (336), Expect = 8e-29, Method: Compositional matrix adjust.
Identities = 70/159 (44%), Positives = 94/159 (59%), Gaps = 5/159 (3%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+P ++D R+ + +C S+ +I DQS+CGSCWAV+ A AISDR CIASNG +SA+ I+
Sbjct: 81 IPDHYDVRDDFSQCISVNNIRDQSHCGSCWAVAAAEAISDRTCIASNGVVNTLLSAEDIL 140
Query: 247 ACTPNCW----GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQN 302
C + GC GG+P AW++W NG+VTGG Y SQ GC+PY++APC V G
Sbjct: 141 TCCIGEYYCGDGCEGGYPIQAWKYWVKNGLVTGGSYESQFGCKPYSIAPCGQTVNGVTWP 200
Query: 303 CTLLGKLKTPECKQNCY-NPSYESTYRFDLKKGKKAHMV 340
TP+C +C N SY Y D G A+ V
Sbjct: 201 KCPNSDADTPKCVDHCTSNSSYPIPYEKDKHYGATAYAV 239
Score = 112 bits (279), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 54/99 (54%), Positives = 68/99 (68%), Gaps = 2/99 (2%)
Query: 63 HYFKKAHMVPR--CNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
HY A+ V R +I ++GP+ F+VYADF QYKSGVY H G +G HAV++L
Sbjct: 231 HYGATAYAVSRKVDQIQSEILKNGPVEVGFTVYADFYQYKSGVYVHVAGPELGGHAVKLL 290
Query: 121 GWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
GWGV+N PYWL ANSWN +WG++G F+ILRG NE IE
Sbjct: 291 GWGVDNGTPYWLAANSWNTNWGENGYFRILRGVNECGIE 329
>gi|161343851|tpg|DAA06106.1| TPA_inf: cathepsin B [Acyrthosiphon pisum]
Length = 333
Score = 134 bits (336), Expect = 8e-29, Method: Compositional matrix adjust.
Identities = 67/159 (42%), Positives = 97/159 (61%), Gaps = 8/159 (5%)
Query: 184 AKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQ 243
A +P FDAR++W C ++ I DQ NCGSCWA S + A +DRLCIASNG F +SA+
Sbjct: 81 AGNIPNEFDARKRWKNCTTIGTIRDQGNCGSCWAFSTSGAFADRLCIASNGSFNQLLSAE 140
Query: 244 HIVACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQN 302
H+ +C C GC GG+P AWR++ +G+VTGG++NS EGCQPY PC + Q+
Sbjct: 141 HVTSCCYRCGLGCQGGYPIRAWRYYSKHGLVTGGNFNSFEGCQPYMFPPCTGNNSCSGQS 200
Query: 303 CTLLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMVL 341
K +C++ C+ + +YR D + +++ VL
Sbjct: 201 ------EKNHKCQKKCFGNT-SISYRGDRRYVERSPYVL 232
Score = 94.4 bits (233), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 49/123 (39%), Positives = 75/123 (60%), Gaps = 5/123 (4%)
Query: 42 KKKKKKKKKKRLYLPTSIPLS---HYFKKA-HMVPRCNAMRQIYEHGPLVAIFSVYADFL 97
+ +K K +K+ + TSI Y +++ +++ N I +GP+ + F VY DF+
Sbjct: 199 QSEKNHKCQKKCFGNTSISYRGDRRYVERSPYVLAYDNMQNDIMTYGPIESSFDVYDDFI 258
Query: 98 QYKSGVYQHNFGDS-IGLHAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEA 156
YKSGVY + + +G H+V+ +GWGVE ++ YWL+ NSWN+ WGD G FKI RG NE
Sbjct: 259 SYKSGVYFKSPNATYLGGHSVKCIGWGVERNVSYWLMMNSWNNTWGDGGNFKIRRGTNEC 318
Query: 157 DIE 159
+E
Sbjct: 319 QVE 321
>gi|195729971|gb|ACG50796.1| cathepsin B1 [Trichobilharzia szidati]
Length = 342
Score = 134 bits (336), Expect = 8e-29, Method: Compositional matrix adjust.
Identities = 73/180 (40%), Positives = 105/180 (58%), Gaps = 13/180 (7%)
Query: 171 SEDDDLE-----TMGCQNAK-GLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAI 224
SED++L T+ QN +P +FD+R+KW +C S+ +I DQS CG CWA + A+
Sbjct: 68 SEDEELRKKRRPTVDHQNVSLEIPSSFDSRKKWRQCKSISNIRDQSRCGPCWAFAAVEAM 127
Query: 225 SDRLCIASNGYFTGQISAQHIVACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEG 283
SDR+CI S G + ++SA +++C C GC GG+P AW +W G+VTG + G
Sbjct: 128 SDRICIQSKGKKSVELSAVDLLSCCTECGLGCQGGFPGAAWDYWVEEGIVTGSSKENHTG 187
Query: 284 CQPYTLAPCEHHVQGPLQNCTLLGK--LKTPECKQNCYNPSYESTYRFDLKKGKKAHMVL 341
CQPY CEHH +G C G+ KTP+C+Q C Y++ Y+ D GK ++ VL
Sbjct: 188 CQPYPFPKCEHHTKGKYPAC---GEKIYKTPKCQQKC-QKGYKTPYKKDKYYGKLSYNVL 243
Score = 114 bits (285), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 49/92 (53%), Positives = 65/92 (70%)
Query: 78 RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
++I HGP+ A F+VY+DFL YKSG+Y+H G IG HAVR++GWGVE PYWL+ANSW
Sbjct: 251 KEIMMHGPVEAAFTVYSDFLNYKSGIYKHMKGTVIGGHAVRIIGWGVEKKTPYWLIANSW 310
Query: 138 NDHWGDHGTFKILRGENEADIEMGFNNRVEAN 169
N+ WG+ G F+ILRG++ IE + N
Sbjct: 311 NEDWGEKGYFRILRGKDVCGIESAVTAGLPHN 342
>gi|395842321|ref|XP_003793966.1| PREDICTED: cathepsin B [Otolemur garnettii]
Length = 339
Score = 134 bits (336), Expect = 8e-29, Method: Compositional matrix adjust.
Identities = 69/156 (44%), Positives = 99/156 (63%), Gaps = 5/156 (3%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP++FDARE+W CP+++ I DQ +CGSCWA +ISDR+CI +NG+ + ++SA+ ++
Sbjct: 80 LPKSFDAREQWSHCPTIKEIRDQGSCGSCWAFGAVESISDRICIHTNGHVSVEVSAEDLL 139
Query: 247 A--CTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
GCNGG+P AW FW G+V+GG Y S GC+PY++ PCEHHV G CT
Sbjct: 140 TCCGGQCGDGCNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCEHHVNGSRPACT 199
Query: 305 LLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
G+ TP+C + C P Y TY+ D G ++ +
Sbjct: 200 --GEGDTPKCSKTC-EPGYSPTYKEDKHFGYTSYSL 232
Score = 122 bits (305), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 58/108 (53%), Positives = 74/108 (68%), Gaps = 2/108 (1%)
Query: 54 YLPTSIPLSHYFKKAHMVP--RCNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDS 111
Y PT H+ ++ +P M +IY++GP+ FSVY+DFL YKSGVYQH GD
Sbjct: 215 YSPTYKEDKHFGYTSYSLPTNEWEIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHLTGDM 274
Query: 112 IGLHAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
+G HA+R+LGWG EN +PYWLVANSWN WGD G F+ILRG++ IE
Sbjct: 275 MGGHAIRILGWGEENGVPYWLVANSWNTDWGDGGFFRILRGQDHCGIE 322
>gi|157058735|gb|ABV03125.1| cathepsin B-16 [Aulacorthum solani]
Length = 246
Score = 134 bits (336), Expect = 8e-29, Method: Compositional matrix adjust.
Identities = 62/147 (42%), Positives = 93/147 (63%), Gaps = 5/147 (3%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+PR FDAR++W C ++ + DQ NCGSCWA ++A +DRLC+A++G F +S + I
Sbjct: 68 IPRTFDARKRWRHCKTIGEVRDQGNCGSCWAFGTSSAFADRLCVATDGDFNELLSPEEIA 127
Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
C C +GC+GG+P AW+++ +G+VTGG+Y S EGC+PY + PC+HH QG +C+
Sbjct: 128 FCCHTCGFGCHGGYPIKAWKYFSTHGLVTGGNYKSGEGCEPYRVPPCQHHHQGN-NSCSD 186
Query: 306 LGKLKTPECKQNCY---NPSYESTYRF 329
K C + CY + Y +RF
Sbjct: 187 KPMEKNHRCTRMCYGDQDLDYNDDHRF 213
>gi|308500570|ref|XP_003112470.1| CRE-CPR-4 protein [Caenorhabditis remanei]
gi|308267038|gb|EFP10991.1| CRE-CPR-4 protein [Caenorhabditis remanei]
Length = 335
Score = 134 bits (336), Expect = 9e-29, Method: Compositional matrix adjust.
Identities = 70/156 (44%), Positives = 94/156 (60%), Gaps = 3/156 (1%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+P FDAR +WP C S+ +I DQS+CGSCWA + A A SDR CIASNG +SA+ ++
Sbjct: 81 IPATFDARTQWPNCVSINNIRDQSDCGSCWAFAAAEAASDRFCIASNGAVNTLLSAEDVL 140
Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPL-QNCT 304
+C NC +GC+GG+P AW++ +G TGG Y +Q GC+PY+LAPC V +C
Sbjct: 141 SCCSNCGYGCDGGYPINAWKYLVKSGFCTGGSYEAQFGCKPYSLAPCGETVGNVTWPDCP 200
Query: 305 LLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
G TP C C N Y + Y+ D G A+ V
Sbjct: 201 DDG-YNTPACVNKCTNTKYNTAYKDDKHFGSTAYAV 235
Score = 116 bits (290), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 54/99 (54%), Positives = 68/99 (68%), Gaps = 2/99 (2%)
Query: 63 HYFKKAHMVPRCNAMRQ--IYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
H+ A+ V + A Q I HGP+ A F+VY DF QYKSGVY H G +G HA+R+L
Sbjct: 227 HFGSTAYAVGKKVAQIQAEIIAHGPVEAAFTVYEDFYQYKSGVYVHTTGQELGGHAIRIL 286
Query: 121 GWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
GWG +N PYWLVANSWN +WG++G F+I+RG NE IE
Sbjct: 287 GWGTDNGTPYWLVANSWNVNWGENGYFRIIRGTNECGIE 325
>gi|325302580|dbj|BAJ83490.1| cathepsin B-like peptidase [Echinococcus multilocularis]
Length = 351
Score = 133 bits (335), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 71/164 (43%), Positives = 94/164 (57%), Gaps = 8/164 (4%)
Query: 182 QNAKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCI----ASNGYFT 237
+N K LP +FD R+KWP C +L I DQ +CGSCWA A A+SDRLCI S
Sbjct: 91 ENYKSLPASFDPRKKWPNCKTLFEIRDQGSCGSCWAFGAAEAMSDRLCIQQQTVSGRAVM 150
Query: 238 GQISAQHIVACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHV 296
++SA +++C +C GCNGG+P AW FW H G+V+GG Y ++ C+ Y + PCEHHV
Sbjct: 151 VRLSADDLLSCCRDCGMGCNGGFPSQAWNFWKHEGLVSGGLYGTKGVCRAYEIPPCEHHV 210
Query: 297 QGPLQNCTLLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
G C G TP+CK C Y+ Y+ D K + V
Sbjct: 211 NGTRPPCE--GDAPTPKCKNVCQE-EYKVPYKKDKHYAVKVYSV 251
Score = 106 bits (265), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 47/81 (58%), Positives = 59/81 (72%)
Query: 79 QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWN 138
++ HGP+ A F VYADF YKSGVYQH G +G HA++++GWG E+ +PYWL ANSWN
Sbjct: 261 ELITHGPVEADFEVYADFPTYKSGVYQHVSGALLGGHAIKLMGWGEEDGVPYWLCANSWN 320
Query: 139 DHWGDHGTFKILRGENEADIE 159
WG+ G FKILRG+N IE
Sbjct: 321 TDWGEGGFFKILRGKNHCGIE 341
>gi|55793947|gb|AAV65884.1| cathepsin B1 isotype 4 precursor [Trichobilharzia regenti]
Length = 342
Score = 133 bits (335), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 65/156 (41%), Positives = 93/156 (59%), Gaps = 3/156 (1%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+P +FD+R+KW +C S+ +I DQS CGSCWA + A+SDR+CI S G + ++SA ++
Sbjct: 90 IPSSFDSRKKWHQCKSISNIRDQSRCGSCWAFAAVEAMSDRICIESKGKKSVELSAVDLL 149
Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
+C C GC GG+P AW +W +G+VTG + GCQPY CEHH G C
Sbjct: 150 SCCTECGLGCQGGFPGAAWDYWVEDGIVTGSSKENHTGCQPYPFPKCEHHTTGKYPECG- 208
Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMVL 341
KTP+C Q C Y++ Y+ D G+ ++ VL
Sbjct: 209 EKIYKTPKCHQKC-QKGYKTPYKKDKYYGRMSYNVL 243
Score = 110 bits (275), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 45/87 (51%), Positives = 64/87 (73%)
Query: 78 RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
++I HGP+ F+V++DFL YKSG+Y++ G IG HAVR++GWGVE PYWL+ANSW
Sbjct: 251 KEIMMHGPVEVAFTVHSDFLNYKSGIYKYMTGAEIGEHAVRIIGWGVEKKTPYWLIANSW 310
Query: 138 NDHWGDHGTFKILRGENEADIEMGFNN 164
N+ WG+ G F++LRG++E IE +
Sbjct: 311 NEDWGEKGYFRMLRGKDECGIESAVTS 337
>gi|91089435|ref|XP_966663.1| PREDICTED: similar to AGAP004533-PA [Tribolium castaneum]
gi|270012706|gb|EFA09154.1| cathepsin B precursor [Tribolium castaneum]
Length = 320
Score = 133 bits (335), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 67/178 (37%), Positives = 96/178 (53%), Gaps = 20/178 (11%)
Query: 169 NSSEDD-----DLETMGCQNAKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANA 223
N + DD D ET + +P+NFDAR WP+C S+R I +Q +CGSCWA
Sbjct: 56 NGARDDPAFFTDTETKNVTIPEQIPQNFDARIVWPQCESIRKIRNQGSCGSCWAFGAVET 115
Query: 224 ISDRLCIASNGYFTGQISAQHIVACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQE 282
+SDRLCIASN + SAQ ++AC C GC GG+ AW++W +G+V+GGD+N+ +
Sbjct: 116 MSDRLCIASNATKKFEFSAQDLLACCKECGHGCGGGYSSRAWQYWVTDGIVSGGDFNTSQ 175
Query: 283 GCQPYTLAPCEHHVQGPLQNCTLLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
GC PY++ TP C C NP Y+ Y D + G +++ +
Sbjct: 176 GCHPYSV--------------QAFRDSTTPNCSSFCTNPKYQKNYSEDKRYGARSYRI 219
Score = 94.7 bits (234), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 45/82 (54%), Positives = 54/82 (65%), Gaps = 1/82 (1%)
Query: 79 QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWN 138
+I GP+ A + VY DF Y++GVYQH G+ G H+V++LGWG EN YWLVANSW
Sbjct: 229 EIMTSGPVQASYVVYDDFYSYQNGVYQHVLGNVSGRHSVKILGWGRENGTDYWLVANSWG 288
Query: 139 DHWGD-HGTFKILRGENEADIE 159
WG G FK LRGEN DIE
Sbjct: 289 RDWGRLGGFFKFLRGENHCDIE 310
>gi|17559068|ref|NP_504682.1| Protein CPR-4 [Caenorhabditis elegans]
gi|1169085|sp|P43508.1|CPR4_CAEEL RecName: Full=Cathepsin B-like cysteine proteinase 4; AltName:
Full=Cysteine protease-related 4; Flags: Precursor
gi|675500|gb|AAA98785.1| cathepsin B-like cysteine proteinase [Caenorhabditis elegans]
gi|695293|gb|AAA98783.1| cathepsin B-like cysteine proteinase [Caenorhabditis elegans]
gi|351063163|emb|CCD71204.1| Protein CPR-4 [Caenorhabditis elegans]
Length = 335
Score = 133 bits (335), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 73/178 (41%), Positives = 97/178 (54%), Gaps = 2/178 (1%)
Query: 165 RVEANSSEDDDLETMGCQ-NAKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANA 223
R E + D+E + N +P FDAR +WP C S+ +I DQS+CGSCWA + A A
Sbjct: 58 RTEFVAPHTPDVEVVKHDINEDTIPATFDARTQWPNCMSINNIRDQSDCGSCWAFAAAEA 117
Query: 224 ISDRLCIASNGYFTGQISAQHIVACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQE 282
SDR CIASNG +SA+ +++C NC +GC GG+P AW++ +G TGG Y +Q
Sbjct: 118 ASDRFCIASNGAVNTLLSAEDVLSCCSNCGYGCEGGYPINAWKYLVKSGFCTGGSYEAQF 177
Query: 283 GCQPYTLAPCEHHVQGPLQNCTLLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
GC+PY+LAPC V TP C C N +Y Y D G A+ V
Sbjct: 178 GCKPYSLAPCGETVGNVTWPSCPDDGYDTPACVNKCTNKNYNVAYTADKHFGSTAYAV 235
Score = 114 bits (286), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 51/99 (51%), Positives = 67/99 (67%), Gaps = 2/99 (2%)
Query: 63 HYFKKAHMVPR--CNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
H+ A+ V + +I HGP+ A F+VY DF QYK+GVY H G +G HA+R+L
Sbjct: 227 HFGSTAYAVGKKVSQIQAEIIAHGPVEAAFTVYEDFYQYKTGVYVHTTGQELGGHAIRIL 286
Query: 121 GWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
GWG +N PYWLVANSWN +WG++G F+I+RG NE IE
Sbjct: 287 GWGTDNGTPYWLVANSWNVNWGENGYFRIIRGTNECGIE 325
>gi|55793945|gb|AAV65883.1| cathepsin B1 isotype 3 precursor [Trichobilharzia regenti]
Length = 342
Score = 133 bits (335), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 65/156 (41%), Positives = 93/156 (59%), Gaps = 3/156 (1%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+P +FD+R+KW +C S+ +I DQS CGSCWA + A+SDR+CI S G + ++SA ++
Sbjct: 90 IPSSFDSRKKWHQCKSISNIRDQSRCGSCWAFTAVEAMSDRICIESKGKKSVELSAVDLL 149
Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
+C C GC GG+P AW +W +G+VTG + GCQPY CEHH G C
Sbjct: 150 SCCTECGLGCQGGFPGAAWDYWVEDGIVTGSSKENHTGCQPYPFPKCEHHTTGKYPECG- 208
Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMVL 341
KTP+C Q C Y++ Y+ D G+ ++ VL
Sbjct: 209 EKIYKTPKCHQKC-QKGYKTPYKKDKYYGRMSYNVL 243
Score = 112 bits (281), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 47/82 (57%), Positives = 64/82 (78%)
Query: 78 RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
++I HGP+ A F+V++DFL YKSG+Y++ G IG HAVR++GWGVE PYWL+ANSW
Sbjct: 251 KEIMMHGPVEAAFTVHSDFLNYKSGIYKYMTGAEIGGHAVRIIGWGVEKKTPYWLIANSW 310
Query: 138 NDHWGDHGTFKILRGENEADIE 159
N+ WG+ G F+ILRG++E IE
Sbjct: 311 NEDWGEKGYFRILRGKDECGIE 332
>gi|209863073|ref|NP_001119610.2| cathepsin B-1852 [Acyrthosiphon pisum]
Length = 333
Score = 133 bits (334), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 66/156 (42%), Positives = 96/156 (61%), Gaps = 8/156 (5%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+P FDAR++W C ++ I DQ NCGSCWA S + A +DRLCIASNG F +SA+H+
Sbjct: 84 IPNEFDARKRWKNCTTIGTIRDQGNCGSCWAFSTSGAFADRLCIASNGSFNQLLSAEHVT 143
Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
+C C GC GG+P AWR++ +G+VTGG++NS EGCQPY PC + Q+
Sbjct: 144 SCCYRCGLGCQGGYPIRAWRYYSKHGLVTGGNFNSFEGCQPYMFPPCTGNNSCSGQS--- 200
Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMVL 341
K +C++ C+ + +YR D + +++ VL
Sbjct: 201 ---EKNHKCQKKCFGNT-SISYRGDRRYVERSPYVL 232
Score = 93.2 bits (230), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 49/123 (39%), Positives = 74/123 (60%), Gaps = 5/123 (4%)
Query: 42 KKKKKKKKKKRLYLPTSIPLS---HYFKKA-HMVPRCNAMRQIYEHGPLVAIFSVYADFL 97
+ +K K +K+ + TSI Y +++ +++ N I +GP+ + F VY DF+
Sbjct: 199 QSEKNHKCQKKCFGNTSISYRGDRRYVERSPYVLAYDNMQNDIMTYGPIESSFDVYDDFI 258
Query: 98 QYKSGVYQHNFGDS-IGLHAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEA 156
YKSGVY + + +G H+V+ +GWGVE ++ YWL+ NSWN WGD G FKI RG NE
Sbjct: 259 SYKSGVYFKSPNATYLGGHSVKCIGWGVERNVSYWLMMNSWNSTWGDGGYFKIRRGTNEC 318
Query: 157 DIE 159
+E
Sbjct: 319 QVE 321
>gi|55793941|gb|AAV65881.1| cathepsin B1 isotype 1 precursor [Trichobilharzia regenti]
Length = 342
Score = 133 bits (334), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 65/156 (41%), Positives = 93/156 (59%), Gaps = 3/156 (1%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+P +FD+R+KW +C S+ +I DQS CGSCWA + A+SDR+CI S G + ++SA ++
Sbjct: 90 IPSSFDSRKKWHQCKSISNIRDQSRCGSCWAFAAVEAMSDRICIESKGKKSVELSAVDLL 149
Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
+C C GC GG+P AW +W +G+VTG + GCQPY CEHH G C
Sbjct: 150 SCCTECGLGCQGGFPGAAWDYWVEDGIVTGSSKENHTGCQPYPFPKCEHHTTGKYPECG- 208
Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMVL 341
KTP+C Q C Y++ Y+ D G+ ++ VL
Sbjct: 209 EKIYKTPKCHQKC-QKGYKTPYKKDKYYGRMSYNVL 243
Score = 112 bits (280), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 47/82 (57%), Positives = 64/82 (78%)
Query: 78 RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
++I HGP+ A F+V++DFL YKSG+Y++ G IG HAVR++GWGVE PYWL+ANSW
Sbjct: 251 KEIMMHGPVEAAFTVHSDFLNYKSGIYKYMTGAEIGGHAVRIIGWGVEKKTPYWLIANSW 310
Query: 138 NDHWGDHGTFKILRGENEADIE 159
N+ WG+ G F+ILRG++E IE
Sbjct: 311 NEDWGEKGYFRILRGKDECGIE 332
>gi|349956183|dbj|GAA30948.1| cathepsin B-like cysteine proteinase [Clonorchis sinensis]
Length = 337
Score = 133 bits (334), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 65/155 (41%), Positives = 89/155 (57%), Gaps = 4/155 (2%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+P+ FDAR++WP CP++ I DQS+CGSCWA A+SDRLCI +NG FT +ISA ++
Sbjct: 80 IPKAFDARKQWPHCPTIGEIRDQSSCGSCWAFGAVEAMSDRLCIHTNGTFTKRISAVDLI 139
Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
+C C +GC GG+P +AW FW G+VTGG + GC+ Y C HH C+
Sbjct: 140 SCCGYCGFGCQGGFPPIAWDFWQTEGIVTGGSKENPTGCRSYPFPRCSHHGSKKYPPCSH 199
Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
TP C Q C P ++ Y D + + V
Sbjct: 200 R-IYDTPNCVQKCDTP--DTDYATDKTRANITYNV 231
Score = 112 bits (281), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 50/83 (60%), Positives = 63/83 (75%)
Query: 77 MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANS 136
M++I +GP+ A F VY DFL YKSGVY H+ G +G HA+R+LGWG EN + YWL+ANS
Sbjct: 239 MKEIMINGPVEAAFQVYEDFLGYKSGVYFHSDGTLLGGHAIRILGWGEENGVAYWLIANS 298
Query: 137 WNDHWGDHGTFKILRGENEADIE 159
WND WG+ G FK+LRG+NE IE
Sbjct: 299 WNDGWGEDGCFKMLRGKNECGIE 321
>gi|268558600|ref|XP_002637291.1| C. briggsae CBR-CPR-4 protein [Caenorhabditis briggsae]
Length = 335
Score = 132 bits (333), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 71/156 (45%), Positives = 93/156 (59%), Gaps = 3/156 (1%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+P FDAR +WP C S+ +I DQS+CGSCWA + A A SDR CIASNG +SA+ ++
Sbjct: 81 IPDTFDARTQWPSCVSINNIRDQSDCGSCWAFAAAEAASDRFCIASNGAVNTLLSAEDVL 140
Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPL-QNCT 304
+C NC +GC GG+P AW++ +G TGG Y SQ GC+PY+LAPC V +C
Sbjct: 141 SCCSNCGYGCEGGYPINAWKYLVKSGFCTGGSYVSQFGCKPYSLAPCGETVGNTTWPDCP 200
Query: 305 LLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
G TP C C N +Y Y+ D G A+ V
Sbjct: 201 QDG-YNTPSCVNKCTNNNYNIAYKDDKHFGSTAYAV 235
Score = 116 bits (290), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 54/99 (54%), Positives = 68/99 (68%), Gaps = 2/99 (2%)
Query: 63 HYFKKAHMVPRCNAMRQ--IYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
H+ A+ V + A Q I HGP+ A F+VY DF QYKSGVY H G +G HA+R+L
Sbjct: 227 HFGSTAYAVGKKVAQIQAEILAHGPVEAAFTVYEDFYQYKSGVYVHTTGQELGGHAIRIL 286
Query: 121 GWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
GWG +N PYWLVANSWN +WG++G F+I+RG NE IE
Sbjct: 287 GWGTDNGTPYWLVANSWNVNWGENGYFRIIRGTNECGIE 325
>gi|118122|sp|P25793.1|CYSP2_HAECO RecName: Full=Cathepsin B-like cysteine proteinase 2; Flags:
Precursor
gi|159165|gb|AAA29171.1| cathepsin B-like cysteine protease [Haemonchus contortus]
Length = 342
Score = 132 bits (333), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 71/160 (44%), Positives = 98/160 (61%), Gaps = 13/160 (8%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+P ++D R+ W C + +I DQ+NCGSCWAVS A AISDR+CIAS ISA I+
Sbjct: 87 IPPSYDPRDVWKNCTTF-YIRDQANCGSCWAVSTAAAISDRICIASKAEKQVNISATDIM 145
Query: 247 ACT-PNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
C P C GC GGWP AW+++ ++GVV+GG+Y +++ C+PY + PC HH N T
Sbjct: 146 TCCRPQCGDGCEGGWPIEAWKYFIYDGVVSGGEYLTKDVCRPYPIHPCGHH-----GNDT 200
Query: 305 LLGKLK----TPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
G+ + TP CK+ C P YR D + GK A++V
Sbjct: 201 YYGECRGTAPTPPCKRKC-RPGVRKMYRIDKRYGKDAYIV 239
Score = 102 bits (253), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 46/98 (46%), Positives = 69/98 (70%), Gaps = 2/98 (2%)
Query: 64 YFKKAHMVPRC--NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLG 121
Y K A++V + +I ++GP+VA F+VY DF YKSG+Y+H G+ G HAV+++G
Sbjct: 232 YGKDAYIVKQSVKAIQSEILKNGPVVASFAVYEDFRHYKSGIYKHTAGELRGYHAVKMIG 291
Query: 122 WGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
WG EN+ +WL+ANSW++ WG+ G F+I+RG N+ IE
Sbjct: 292 WGNENNTDFWLIANSWHNDWGEKGYFRIVRGSNDCGIE 329
>gi|390994429|gb|AFM37364.1| cathepsin B1 [Dictyocaulus viviparus]
Length = 350
Score = 132 bits (333), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 72/161 (44%), Positives = 94/161 (58%), Gaps = 13/161 (8%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+P NFDAREKW +C S+R I DQS+CGSCWAVS A +SDR CI S+G +SA I+
Sbjct: 95 IPENFDAREKWSQCDSIRTIRDQSHCGSCWAVSAAETMSDRTCIHSDGKINVGLSATDIL 154
Query: 247 AC--TPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
+C T GC GG+P AWR++ +GV TGG Y ++ C+PY PC HH +N
Sbjct: 155 SCCGTTCGRGCRGGYPIEAWRYFMLHGVCTGGHYAEKDVCKPYAFHPCGHH-----RNEI 209
Query: 305 LLGK-----LKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
G+ TP+C Q+C Y S Y D GK A+ +
Sbjct: 210 YYGECPKEIFPTPQCTQSC-QAGYASDYEDDKIYGKSAYAL 249
Score = 93.6 bits (231), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 46/99 (46%), Positives = 64/99 (64%), Gaps = 3/99 (3%)
Query: 64 YFKKAHMVPRCN--AMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLG 121
Y K A+ +P R+I +GP+ A F VY DF +Y+SG+Y H G G HAV+++G
Sbjct: 242 YGKSAYALPNNEKAIQREIMTNGPVQAAFMVYEDFSRYRSGIYVHTAGRREGGHAVKLIG 301
Query: 122 WGVEND-IPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
WGV++D YWL ANSWN WG++G F+I+RG + IE
Sbjct: 302 WGVDDDGNKYWLAANSWNSDWGENGYFRIVRGVDHCGIE 340
>gi|193716207|ref|XP_001950562.1| PREDICTED: cathepsin B-like [Acyrthosiphon pisum]
Length = 340
Score = 132 bits (333), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 62/147 (42%), Positives = 92/147 (62%), Gaps = 5/147 (3%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+P++FDAR+KW C ++ + DQ NCGSCWA++ ++A +DRLC+A+N F +SA+ I
Sbjct: 88 IPKHFDARKKWKRCHTIGKVRDQGNCGSCWAMATSSAFADRLCVATNADFNELLSAEEIT 147
Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
C +C +GCNGG+P AW + + G+VTGGDY S EGC+PY + PC + +G C
Sbjct: 148 FCCSSCGYGCNGGYPIKAWESFNNRGLVTGGDYQSGEGCEPYRVPPCPYDAEGH-NTCAG 206
Query: 306 LGKLKTPECKQNCY---NPSYESTYRF 329
+ K C + CY + Y +RF
Sbjct: 207 KPREKNHRCTRTCYGNQDLDYNDDHRF 233
Score = 89.4 bits (220), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 41/97 (42%), Positives = 63/97 (64%), Gaps = 1/97 (1%)
Query: 64 YFKKAHMVPRCNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDS-IGLHAVRVLGW 122
+ + ++ + + + + +GP+ A F +Y DF YKSGVY + S +G HAV+++GW
Sbjct: 233 FTRDSYYLTYSSIQKDVMRYGPIEASFDMYDDFPSYKSGVYVRSENASYLGGHAVKLIGW 292
Query: 123 GVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
G E+ + YWL+ NSWN+ WGD+G FKI RG NE I+
Sbjct: 293 GEEHGVLYWLMVNSWNEGWGDNGLFKIRRGTNECGID 329
>gi|118118|sp|P19092.1|CYSP1_HAECO RecName: Full=Cathepsin B-like cysteine proteinase 1; Flags:
Precursor
gi|159173|gb|AAA29175.1| cysteine protease (AC-1) [Haemonchus contortus]
Length = 342
Score = 132 bits (333), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 71/160 (44%), Positives = 98/160 (61%), Gaps = 13/160 (8%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+P ++D R+ W C + +I DQ+NCGSCWAVS A AISDR+CIAS ISA I+
Sbjct: 87 IPPSYDPRDVWKNCTTF-YIRDQANCGSCWAVSTAAAISDRICIASKAEKQVNISATDIM 145
Query: 247 ACT-PNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
C P C GC GGWP AW+++ ++GVV+GG+Y +++ C+PY + PC HH N T
Sbjct: 146 TCCRPQCGDGCEGGWPIEAWKYFIYDGVVSGGEYLTKDVCRPYPIHPCGHH-----GNDT 200
Query: 305 LLGKLK----TPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
G+ + TP CK+ C P YR D + GK A++V
Sbjct: 201 YYGECRGTAPTPPCKRKC-RPGVRKMYRIDKRYGKDAYIV 239
Score = 102 bits (253), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 46/98 (46%), Positives = 68/98 (69%), Gaps = 2/98 (2%)
Query: 64 YFKKAHMVPRC--NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLG 121
Y K A++V + +I +GP+VA F+VY DF YKSG+Y+H G+ G HAV+++G
Sbjct: 232 YGKDAYIVKQSVKAIQSEILRNGPVVASFAVYEDFRHYKSGIYKHTAGELRGYHAVKMIG 291
Query: 122 WGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
WG EN+ +WL+ANSW++ WG+ G F+I+RG N+ IE
Sbjct: 292 WGNENNTDFWLIANSWHNDWGEKGYFRIIRGTNDCGIE 329
>gi|56758716|gb|AAW27498.1| unknown [Schistosoma japonicum]
Length = 342
Score = 132 bits (332), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 66/143 (46%), Positives = 90/143 (62%), Gaps = 7/143 (4%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+P +FD+R+KWP C S+ I DQS C S WAVS A+SDR+CI S G + ++SA ++
Sbjct: 90 IPSHFDSRKKWPRCKSISQIRDQSRCASSWAVSAVAAMSDRICIQSGGKQSVELSAIDLI 149
Query: 247 ACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
+C NC GC+GG +W +W +G+VTGG + GC+PY C+H V+G + C
Sbjct: 150 SCCENCGSGCDGGVTGYSWDYWVKHGIVTGGSKENHTGCRPYPFPKCDHFVKGKYRACG- 208
Query: 306 LGKL-KTPECKQNC---YNPSYE 324
KL KTP+CKQ C YN SYE
Sbjct: 209 -DKLYKTPQCKQTCQKGYNTSYE 230
Score = 100 bits (248), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 42/82 (51%), Positives = 58/82 (70%)
Query: 78 RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
++I +GP+ A +Y DFL YKSG+Y++ G I HAVR++GWGVEN YWL AN+W
Sbjct: 251 KEIMMYGPVEAYLEIYEDFLNYKSGIYRYTTGKYISGHAVRLIGWGVENGTAYWLAANTW 310
Query: 138 NDHWGDHGTFKILRGENEADIE 159
N+ WG+ G F+I+RG +E IE
Sbjct: 311 NEDWGEKGYFRIVRGRDECLIE 332
>gi|126116630|gb|ABN79675.1| cathepsin B3 [Clonorchis sinensis]
Length = 337
Score = 132 bits (332), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 65/155 (41%), Positives = 88/155 (56%), Gaps = 4/155 (2%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+P+ FDAR++WP CP++ I DQS+CGSCWA A+SDRLCI +NG FT +ISA ++
Sbjct: 80 IPKAFDARKQWPHCPTIGEIRDQSSCGSCWAFGAVEAMSDRLCIHTNGTFTKRISAVDLI 139
Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
+C C +GC GG+P AW FW G+VTGG + GC+ Y C HH C+
Sbjct: 140 SCCGYCGFGCQGGFPPTAWDFWQTEGIVTGGSKENPTGCRSYPFPRCSHHGSKKYPPCSH 199
Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
TP C Q C P ++ Y D + + V
Sbjct: 200 R-IYDTPNCVQKCDTP--DTDYATDKTRANITYNV 231
Score = 112 bits (280), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 50/83 (60%), Positives = 63/83 (75%)
Query: 77 MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANS 136
M++I +GP+ A F VY DFL YKSGVY H+ G +G HA+R+LGWG EN + YWL+ANS
Sbjct: 239 MKEIMINGPVEAAFQVYEDFLGYKSGVYFHSDGTLLGGHAIRILGWGEENGVAYWLIANS 298
Query: 137 WNDHWGDHGTFKILRGENEADIE 159
WND WG+ G FK+LRG+NE IE
Sbjct: 299 WNDGWGEDGYFKMLRGKNECGIE 321
>gi|56754307|gb|AAW25341.1| unknown [Schistosoma japonicum]
Length = 309
Score = 132 bits (331), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 66/143 (46%), Positives = 90/143 (62%), Gaps = 7/143 (4%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+P +FD+R+KWP C S+ I DQS C S WAVS A+SDR+CI S G + ++SA ++
Sbjct: 57 IPSHFDSRKKWPRCKSISQIRDQSRCASSWAVSAVGAMSDRICIQSGGKQSVELSAIDLI 116
Query: 247 ACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
+C NC GC+GG +W +W +G+VTGG + GC+PY C+H V+G + C
Sbjct: 117 SCCKNCGSGCDGGVTGYSWDYWVSHGIVTGGSKENHTGCRPYPFPKCDHFVKGKYRACG- 175
Query: 306 LGKL-KTPECKQNC---YNPSYE 324
KL KTP+CKQ C YN SYE
Sbjct: 176 -DKLYKTPQCKQTCQKGYNTSYE 197
Score = 100 bits (248), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 47/99 (47%), Positives = 63/99 (63%), Gaps = 2/99 (2%)
Query: 63 HYFKKAHMVPRCNAMRQ--IYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
HY ++ V ++ Q I HG + A +Y DFL YKSG+Y++ G I HAVR++
Sbjct: 201 HYGGFSYNVLSVESVIQKDIMMHGTVEAYLEIYEDFLNYKSGIYRYTTGQFISGHAVRLI 260
Query: 121 GWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
GWGVEN YWL AN+WN+ WG+ G F+I+RG NE IE
Sbjct: 261 GWGVENGTAYWLAANTWNEDWGEKGYFRIVRGRNECLIE 299
>gi|56757271|gb|AAW26807.1| unknown [Schistosoma japonicum]
Length = 342
Score = 132 bits (331), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 66/143 (46%), Positives = 90/143 (62%), Gaps = 7/143 (4%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+P +FD+R+KWP C S+ I DQS C S WAVS A+SDR+CI S G + ++SA ++
Sbjct: 90 IPSHFDSRKKWPRCKSISQIRDQSRCASSWAVSAVGAMSDRICIQSGGKQSVELSAIDLI 149
Query: 247 ACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
+C NC GC+GG +W +W +G+VTGG + GC+PY C+H V+G + C
Sbjct: 150 SCCKNCGSGCDGGVTGYSWDYWVKHGIVTGGSKENHTGCRPYPFPKCDHFVKGKYRACG- 208
Query: 306 LGKL-KTPECKQNC---YNPSYE 324
KL KTP+CKQ C YN SYE
Sbjct: 209 -DKLYKTPQCKQTCQKGYNTSYE 230
Score = 99.8 bits (247), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 42/82 (51%), Positives = 58/82 (70%)
Query: 78 RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
++I +GP+ A +Y DFL YKSG+Y++ G I HAVR++GWGVEN YWL AN+W
Sbjct: 251 KEIMMYGPVEAYLHIYEDFLNYKSGIYRYTTGQFISGHAVRLIGWGVENGTSYWLAANTW 310
Query: 138 NDHWGDHGTFKILRGENEADIE 159
N+ WG+ G F+I+RG +E IE
Sbjct: 311 NEDWGEKGYFRIVRGRDECLIE 332
>gi|294954734|ref|XP_002788292.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
gi|239903555|gb|EER20088.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
Length = 317
Score = 132 bits (331), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 65/151 (43%), Positives = 88/151 (58%), Gaps = 8/151 (5%)
Query: 187 LPRNFDAREKWPECPS-LRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHI 245
LP +FDAR +P C + HI DQS CGSCWA V A +DRLC+ SNG FT +SA +
Sbjct: 60 LPTDFDARTAFPNCSKVIGHIRDQSACGSCWAFGVTEAFNDRLCVKSNGTFTELLSAGEM 119
Query: 246 VACTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQ------EGCQPYTLAPCEHHVQGP 299
AC P+ +GC+GG+P AW + G+ TGGDY ++ +GC PY PC HH+
Sbjct: 120 NACAPS-YGCDGGYPDSAWSWVHDEGIATGGDYVARGNLTKGDGCWPYDFPPCAHHINDT 178
Query: 300 LQNCTLLGKLKTPECKQNCYNPSYESTYRFD 330
G +TP C + C+NP Y ++ + D
Sbjct: 179 KYPKCPKGSYETPNCVEQCHNPKYSTSLKND 209
Score = 101 bits (252), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 52/118 (44%), Positives = 65/118 (55%), Gaps = 8/118 (6%)
Query: 43 KKKKKKKKKRLYLPTSIPLSHYFKKAHMVPRCNAMRQIYEHGPLVAIFSVYADFLQYKSG 102
K K R Y+ S P + NA I GP+ A + VY DFL YKSG
Sbjct: 201 KYSTSLKNDRHYMLESSPYQYSVN--------NAKNAIRTDGPVSASYLVYEDFLAYKSG 252
Query: 103 VYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIEM 160
VY+H G +G HAV+++GWG EN YWLV NSWN+ WGDHG FKI G + D ++
Sbjct: 253 VYKHTSGSYLGGHAVKIIGWGEENGEAYWLVVNSWNEDWGDHGLFKIALGNCQIDDDL 310
>gi|27526823|emb|CAD32937.1| pro-cathepsin B2 [Fasciola hepatica]
Length = 337
Score = 132 bits (331), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 70/155 (45%), Positives = 86/155 (55%), Gaps = 2/155 (1%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP +FDAREKWP C S+R I DQS+CGSCWAV+ A+SDR+CI SNG ++SA +V
Sbjct: 76 LPESFDAREKWPLCRSIRQIPDQSSCGSCWAVAGVGAMSDRVCIHSNGMMQPELSAIDLV 135
Query: 247 ACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
+C C GC GG P AW +W NG+VTGG + GC PY C H N
Sbjct: 136 SCCSYCGNGCQGGSPPAAWDYWWRNGIVTGGTLENPTGCLPYPFPQCRHPGSRSQLNPCP 195
Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
TP C C Y+ TY D GK ++ V
Sbjct: 196 RYTYPTPSCYPYC-QAGYDKTYEKDKVYGKTSYNV 229
Score = 103 bits (257), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 46/83 (55%), Positives = 59/83 (71%)
Query: 77 MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANS 136
M +I ++GP+ A F VY DF YKSG+Y H G G HA+R++GWGVEN + YWL ANS
Sbjct: 237 MEEIMKNGPVEAGFIVYTDFAVYKSGIYHHVSGRYAGKHAIRIIGWGVENGVKYWLTANS 296
Query: 137 WNDHWGDHGTFKILRGENEADIE 159
WN WG++G F+ILRG +E IE
Sbjct: 297 WNVGWGENGYFRILRGTDECRIE 319
>gi|56752925|gb|AAW24674.1| unknown [Schistosoma japonicum]
Length = 342
Score = 132 bits (331), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 66/143 (46%), Positives = 90/143 (62%), Gaps = 7/143 (4%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+P +FD+R+KWP C S+ I DQS C S WAVS A+SDR+CI S G + ++SA ++
Sbjct: 90 IPSHFDSRKKWPRCKSISQIRDQSRCASSWAVSAVAAMSDRICIQSGGKQSVELSAIDLI 149
Query: 247 ACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
+C NC GC+GG +W +W +G+VTGG + GC+PY C+H V+G + C
Sbjct: 150 SCCKNCGSGCDGGVTGYSWDYWVKHGIVTGGSKENHTGCRPYPFPKCDHFVKGKYRACG- 208
Query: 306 LGKL-KTPECKQNC---YNPSYE 324
KL KTP+CKQ C YN SYE
Sbjct: 209 -DKLYKTPQCKQTCQKGYNTSYE 230
Score = 100 bits (248), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 42/82 (51%), Positives = 58/82 (70%)
Query: 78 RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
++I +GP+ A +Y DFL YKSG+Y++ G I HAVR++GWGVEN YWL AN+W
Sbjct: 251 KEIMMYGPVEAYLQIYEDFLNYKSGIYRYTTGKYISGHAVRLIGWGVENGTSYWLAANTW 310
Query: 138 NDHWGDHGTFKILRGENEADIE 159
N+ WG+ G F+I+RG +E IE
Sbjct: 311 NEDWGEKGYFRIVRGRDECLIE 332
>gi|55793943|gb|AAV65882.1| cathepsin B1 isotype 2 precursor [Trichobilharzia regenti]
Length = 342
Score = 132 bits (331), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 65/156 (41%), Positives = 92/156 (58%), Gaps = 3/156 (1%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+P +FD+R+KW +C S+ +I DQS CGSCWA + A+SDR+CI S G + ++SA ++
Sbjct: 90 IPSSFDSRKKWRQCKSISNIRDQSRCGSCWAFAAVEAMSDRICIESKGKKSVELSAVDLL 149
Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
+C C GC GG+P AW +W +G+VTG + GCQPY CEHH G C
Sbjct: 150 SCCTECGLGCQGGFPGAAWDYWVEDGIVTGSSKENHTGCQPYPFPKCEHHTTGKYPECG- 208
Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMVL 341
KTP+C Q C Y++ Y D G+ ++ VL
Sbjct: 209 EKIYKTPKCHQKC-QKGYKTPYGKDKYYGRMSYNVL 243
Score = 112 bits (280), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 47/82 (57%), Positives = 64/82 (78%)
Query: 78 RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
++I HGP+ A F+V++DFL YKSG+Y++ G IG HAVR++GWGVE PYWL+ANSW
Sbjct: 251 KEIMMHGPVEAAFTVHSDFLNYKSGIYKYMTGAEIGGHAVRIIGWGVEKKTPYWLIANSW 310
Query: 138 NDHWGDHGTFKILRGENEADIE 159
N+ WG+ G F+ILRG++E IE
Sbjct: 311 NEDWGEKGYFRILRGKDECGIE 332
>gi|157058775|gb|ABV03145.1| cathepsin B-16D [Myzus persicae]
Length = 236
Score = 131 bits (330), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 70/159 (44%), Positives = 92/159 (57%), Gaps = 12/159 (7%)
Query: 164 NRVEANSSEDDDLETMGCQNAKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANA 223
N + SED D N +PR FDAR KW C ++ + DQ NCGSCWAV+ ++A
Sbjct: 64 NNMNLYKSEDADY------NNTYIPRFFDARRKWRHCSTIGRVRDQGNCGSCWAVATSSA 117
Query: 224 ISDRLCIASNGYFTGQISAQHIVACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQE 282
+DRLC+A+N F +SA+ I C C +GCNGG+P AW+ + G+VTGGDY S E
Sbjct: 118 FADRLCVATNADFNELLSAEEITFCCHTCGFGCNGGYPIKAWKRFSKKGLVTGGDYKSGE 177
Query: 283 GCQPYTLAPCEHHVQGPLQNCTLLGKLKTP--ECKQNCY 319
GC+PY + PC + QG N T GK C + CY
Sbjct: 178 GCEPYRVPPCPNDDQG---NNTCAGKPMESNHRCTRMCY 213
>gi|124502519|gb|ABN13633.1| cysteine proteinase [Haemonchus contortus]
Length = 342
Score = 131 bits (330), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 76/182 (41%), Positives = 103/182 (56%), Gaps = 20/182 (10%)
Query: 165 RVEANSSEDDDLETMGCQNAKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAI 224
R+ ED D E +P ++D R+ W C + +I DQ+NCGSCWAVS A AI
Sbjct: 72 RLNLMVKEDPDPEV-------DIPPSYDPRDVWKNCTTF-YIRDQANCGSCWAVSTAAAI 123
Query: 225 SDRLCIASNGYFTGQISAQHIVACT-PNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQE 282
SDR+CIAS ISA I+ C P C GC GGWP AW+++ ++GVV+GG+Y ++
Sbjct: 124 SDRICIASKAEKQVNISATDIMTCCRPQCGDGCEGGWPIEAWKYFIYDGVVSGGEYLTKG 183
Query: 283 GCQPYTLAPCEHHVQGPLQNCTLLGKLK----TPECKQNCYNPSYESTYRFDLKKGKKAH 338
C+PY + PC HH N T G+ + TP CK+ C P YR D + GK A+
Sbjct: 184 VCRPYPIHPCGHH-----GNDTYYGECRGTAPTPPCKKEC-RPGVRKVYRIDKRYGKDAY 237
Query: 339 MV 340
+V
Sbjct: 238 IV 239
Score = 102 bits (253), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 46/98 (46%), Positives = 68/98 (69%), Gaps = 2/98 (2%)
Query: 64 YFKKAHMVPRC--NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLG 121
Y K A++V + +I +GP+VA F+VY DF YKSG+Y+H G+ G HAV+++G
Sbjct: 232 YGKDAYIVKQSVKAIQSEILRNGPVVASFAVYEDFRHYKSGIYKHTAGELRGYHAVKMIG 291
Query: 122 WGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
WG EN+ +WL+ANSW++ WG+ G F+I+RG N+ IE
Sbjct: 292 WGNENNTDFWLIANSWHNDWGEKGYFRIIRGTNDCGIE 329
>gi|56756410|gb|AAW26378.1| unknown [Schistosoma japonicum]
Length = 342
Score = 131 bits (330), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 66/143 (46%), Positives = 90/143 (62%), Gaps = 7/143 (4%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+P +FD+R+KWP C S+ I DQS CGS WAVS A+SDR+CI S G + ++SA ++
Sbjct: 90 IPSHFDSRKKWPRCKSISQIRDQSQCGSSWAVSAVGAMSDRICIQSGGKQSVELSAVDLI 149
Query: 247 ACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
+C C GC+GG+ +W +W G+VTGG + GC+PY C+H V+G + C
Sbjct: 150 SCCKYCGSGCDGGFLGPSWDYWVLRGIVTGGSKENHTGCRPYPFPKCDHFVKGKYRACG- 208
Query: 306 LGKL-KTPECKQNC---YNPSYE 324
KL KTP+CKQ C YN SYE
Sbjct: 209 -DKLYKTPQCKQTCQKGYNTSYE 230
Score = 104 bits (259), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 48/99 (48%), Positives = 64/99 (64%), Gaps = 2/99 (2%)
Query: 63 HYFKKAHMVPRCNAMRQ--IYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
HY ++ V ++ Q I HGP+ A +Y DFL YKSG+Y++ G I HAVR++
Sbjct: 234 HYGGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGQFISGHAVRLI 293
Query: 121 GWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
GWGVEN YWL AN+WN+ WG+ G F+I+RG NE IE
Sbjct: 294 GWGVENGTAYWLAANTWNEDWGEKGYFRIVRGRNECSIE 332
>gi|189308076|gb|ACD86922.1| cysteine protease [Caenorhabditis brenneri]
Length = 228
Score = 131 bits (329), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 65/145 (44%), Positives = 86/145 (59%), Gaps = 1/145 (0%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+P FDAR +WP C S+ +I DQS+CGSCWA + A A SDR CIASNG +SA+ ++
Sbjct: 81 IPATFDARTQWPSCVSINNIRDQSDCGSCWAFAAAEAASDRFCIASNGAVNTLLSAEDVL 140
Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
+C NC +GC GG+P AW++ +G TGG Y +Q GC+PY+LAPC V
Sbjct: 141 SCCSNCGYGCEGGYPINAWKYLVKSGFCTGGSYEAQFGCKPYSLAPCGETVGNTTWPACP 200
Query: 306 LGKLKTPECKQNCYNPSYESTYRFD 330
TP C C N +Y Y+ D
Sbjct: 201 TDGYDTPACVNKCTNSNYNVAYKDD 225
>gi|407080581|gb|AFS89610.1| procathepsin B precursor [Phenacoccus solenopsis]
Length = 309
Score = 131 bits (329), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 77/179 (43%), Positives = 110/179 (61%), Gaps = 5/179 (2%)
Query: 160 MGFN-NRVEANSSEDDDLETMGCQNAKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAV 218
MG N + ++ N + D + + + ++ LP FD+R +WP CP++R I DQ +CG+CWA
Sbjct: 23 MGINYSELKPNVTPDLEPPFVVSKISENLPDEFDSRVRWPNCPTIREIRDQGSCGACWAF 82
Query: 219 SVANAISDRLCIASNGYFTGQISAQHIVACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGD 277
+ A A+SDR+CI S+ SA ++++C +C GC G LAW W +G+V+GG
Sbjct: 83 AAAEAMSDRVCIHSSQTKHFHFSALNLLSCCDSCEKGCLGCDHHLAWDHWVKHGIVSGGS 142
Query: 278 YNSQEGCQPYTLAPCEHHVQGPLQNCTLLGKLKTPECKQNCYNPSYESTYRFDLKKGKK 336
Y S+EGCQPY L PCEHH GP +NCT G TP C + C P Y+ +Y DL GK+
Sbjct: 143 YGSKEGCQPYHLPPCEHHRAGPRRNCTKYG--PTPSCARVC-QPDYKISYEDDLHFGKQ 198
Score = 88.6 bits (218), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 40/83 (48%), Positives = 56/83 (67%), Gaps = 2/83 (2%)
Query: 79 QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVE--NDIPYWLVANS 136
+I+ +GP+ A + Y DF Y+SG+Y H G + HAV+++GWG + + PYWLVANS
Sbjct: 213 EIFHNGPVEATMAAYEDFYTYESGIYHHIEGTFVCDHAVKIIGWGTDKKTNTPYWLVANS 272
Query: 137 WNDHWGDHGTFKILRGENEADIE 159
+N WG++G FKI RG NE IE
Sbjct: 273 FNTDWGEYGFFKIKRGVNECGIE 295
>gi|157058773|gb|ABV03144.1| cathepsin B-16D [Sitobion avenae]
Length = 215
Score = 131 bits (329), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 64/136 (47%), Positives = 88/136 (64%), Gaps = 6/136 (4%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+PR+FDAR KW C ++ + DQ NCGSCWAV+ ++A +DRLC+A++G F +SA+ I
Sbjct: 70 IPRHFDARRKWRHCQTIGEVRDQGNCGSCWAVATSSAFADRLCVATDGDFNQLLSAEEIT 129
Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
C C +GCNGG+P AW + +G+VTGGDY S+EGC+PY + PC + G N T
Sbjct: 130 FCCHTCGFGCNGGYPIKAWERFKKHGLVTGGDYKSEEGCEPYRVPPCPYDESG---NNTC 186
Query: 306 LGKL--KTPECKQNCY 319
GK K C + CY
Sbjct: 187 AGKPMEKNHRCTRMCY 202
>gi|313229093|emb|CBY18245.1| unnamed protein product [Oikopleura dioica]
Length = 355
Score = 131 bits (329), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 67/155 (43%), Positives = 87/155 (56%), Gaps = 3/155 (1%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+P FD+R W +C + I DQ CGSCWA A AISDR+CIAS G +A+ ++
Sbjct: 94 IPATFDSRSNWSDCSVIGKIRDQGGCGSCWAFGAAEAISDRICIASKGATDVMYAAEDVL 153
Query: 247 ACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
+C C GCNGG+P A ++ G+VTGG Y +++ CQPYTL CEHHV G CT
Sbjct: 154 SCCLTCGNGCNGGYPLAAMEYFVTRGLVTGGLYGTKDTCQPYTLEACEHHVPGDRPPCTE 213
Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
G TP+C C Y+ D G KA+ V
Sbjct: 214 GG--GTPKCSHQCIPDYTTKAYKDDKVHGHKAYSV 246
Score = 108 bits (269), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 50/95 (52%), Positives = 64/95 (67%), Gaps = 2/95 (2%)
Query: 67 KAHMVPR--CNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGV 124
KA+ VP ++I +GP+ A F+VY+DF YKSGVY+H G +G HA++++GWG
Sbjct: 242 KAYSVPNDVGKIQQEIMHYGPVEAAFTVYSDFPSYKSGVYRHTSGSELGGHAIKIIGWGT 301
Query: 125 ENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
E YWL+ NSWN WGD GTFKILRG NE IE
Sbjct: 302 EGGDDYWLINNSWNSDWGDKGTFKILRGSNECGIE 336
>gi|56756114|gb|AAW26235.1| unknown [Schistosoma japonicum]
Length = 342
Score = 131 bits (329), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 65/143 (45%), Positives = 91/143 (63%), Gaps = 7/143 (4%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+P +FD+R+KWP C S+ I DQS C S WAVS A+SDR+CI S G + ++SA ++
Sbjct: 90 IPSHFDSRKKWPRCKSISQIRDQSRCASSWAVSSVGAMSDRICIQSGGKQSVELSAIDLI 149
Query: 247 ACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
+C NC GC+GG+ +W +W +G+VTGG + GC+PY C+H V+G + C
Sbjct: 150 SCCKNCGSGCDGGYFLPSWDYWVSHGIVTGGSKENHTGCRPYPFPKCDHFVKGKYRACG- 208
Query: 306 LGKL-KTPECKQNC---YNPSYE 324
KL +TP+CKQ C YN SYE
Sbjct: 209 -DKLYETPQCKQTCQKGYNTSYE 230
Score = 102 bits (255), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 44/82 (53%), Positives = 57/82 (69%)
Query: 78 RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
+ I HGP+ A +Y DFL YKSG+Y++ G I HAVR++GWGVEN YWL AN+W
Sbjct: 251 KDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGKYISGHAVRLIGWGVENGTAYWLAANTW 310
Query: 138 NDHWGDHGTFKILRGENEADIE 159
N+ WG+ G F+I+RG NE IE
Sbjct: 311 NEDWGEKGYFRIVRGRNECLIE 332
>gi|56756380|gb|AAW26363.1| unknown [Schistosoma japonicum]
Length = 342
Score = 130 bits (328), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 66/143 (46%), Positives = 90/143 (62%), Gaps = 7/143 (4%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+P +FD+R+KWP C S+ I DQS CGS WAVS A+SDR+CI S G + ++SA ++
Sbjct: 90 IPSHFDSRKKWPRCKSISQIRDQSQCGSSWAVSAIGAMSDRICIQSGGKQSVKLSAVDLI 149
Query: 247 ACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
+C NC GC+GG+ +W +W G+VTGG + GC+PY C+H V+G + C
Sbjct: 150 SCCENCGSGCDGGFLGPSWDYWVLRGIVTGGSKENHTGCRPYPFPKCDHFVKGKYRACG- 208
Query: 306 LGKL-KTPECKQNC---YNPSYE 324
KL KTP+C Q C YN SYE
Sbjct: 209 -DKLYKTPQCNQTCQKGYNTSYE 230
Score = 102 bits (255), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 48/99 (48%), Positives = 64/99 (64%), Gaps = 2/99 (2%)
Query: 63 HYFKKAHMVPRCNAMRQ--IYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
HY ++ V ++ Q I HGP+ A +Y DFL YKSG+Y++ G I HAVR++
Sbjct: 234 HYGGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGQFISGHAVRLI 293
Query: 121 GWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
GWGVEN YWL AN+WN+ WG+ G F+I+RG NE IE
Sbjct: 294 GWGVENGTAYWLAANTWNEDWGEKGYFRIVRGRNECLIE 332
>gi|157058733|gb|ABV03124.1| cathepsin B-16a [Acyrthosiphon pisum]
Length = 274
Score = 130 bits (328), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 61/147 (41%), Positives = 91/147 (61%), Gaps = 5/147 (3%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+PR FDAR +W C ++ + DQ +CGSCWA++ ++A +DRLC+A+NG F +SA+ I
Sbjct: 84 IPRTFDARRRWRHCKTIGEVRDQGHCGSCWAMATSSAFADRLCVATNGDFNELLSAEEIT 143
Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
C C +GCNGG+P AW+++ +G+VTGG+Y S EGC+PY + PC +G +C
Sbjct: 144 FCCHTCGFGCNGGYPIKAWKYFSSHGIVTGGNYKSGEGCEPYRVPPCPQDEEGK-SSCAG 202
Query: 306 LGKLKTPECKQNCY---NPSYESTYRF 329
K C + CY + Y +RF
Sbjct: 203 KPIEKNHRCTRMCYGNQDLDYNEDHRF 229
>gi|56752809|gb|AAW24616.1| unknown [Schistosoma japonicum]
Length = 342
Score = 130 bits (328), Expect = 7e-28, Method: Compositional matrix adjust.
Identities = 66/143 (46%), Positives = 90/143 (62%), Gaps = 7/143 (4%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+P +FD+R+KWP C S+ I DQS CGS WAVS A+SDR+CI S G + ++SA ++
Sbjct: 90 IPSHFDSRKKWPRCKSISQIRDQSQCGSSWAVSAVGAMSDRICIQSGGKQSVELSAVDLI 149
Query: 247 ACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
+C C GC+GG+ +W +W G+VTGG + GC+PY C+H V+G + C
Sbjct: 150 SCCKYCGSGCDGGFLGPSWDYWVLRGIVTGGSKENHTGCRPYPFPKCDHFVKGKYRACG- 208
Query: 306 LGKL-KTPECKQNC---YNPSYE 324
KL KTP+CKQ C YN SYE
Sbjct: 209 -DKLYKTPQCKQTCQKGYNTSYE 230
Score = 104 bits (259), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 48/99 (48%), Positives = 64/99 (64%), Gaps = 2/99 (2%)
Query: 63 HYFKKAHMVPRCNAMRQ--IYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
HY ++ V ++ Q I HGP+ A +Y DFL YKSG+Y++ G I HAVR++
Sbjct: 234 HYGGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGQFISGHAVRLI 293
Query: 121 GWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
GWGVEN YWL AN+WN+ WG+ G F+I+RG NE IE
Sbjct: 294 GWGVENGTAYWLAANTWNEDWGEKGYFRIVRGRNECSIE 332
>gi|171474007|gb|AAX31052.2| SJCHGC09761 protein [Schistosoma japonicum]
Length = 342
Score = 130 bits (328), Expect = 7e-28, Method: Compositional matrix adjust.
Identities = 66/143 (46%), Positives = 90/143 (62%), Gaps = 7/143 (4%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+P +FD+R+KWP C S+ I DQS CGS WAVS A+SDR+CI S G + ++SA ++
Sbjct: 90 IPSHFDSRKKWPRCKSISQIRDQSQCGSSWAVSAVGAMSDRICIQSGGKQSVELSAVDLI 149
Query: 247 ACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
+C C GC+GG+ +W +W G+VTGG + GC+PY C+H V+G + C
Sbjct: 150 SCCKYCGSGCDGGFLGPSWDYWVLRGIVTGGSKENHTGCRPYPFPKCDHFVKGKYRACG- 208
Query: 306 LGKL-KTPECKQNC---YNPSYE 324
KL KTP+CKQ C YN SYE
Sbjct: 209 -DKLYKTPQCKQTCQKGYNTSYE 230
Score = 102 bits (255), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 48/99 (48%), Positives = 64/99 (64%), Gaps = 2/99 (2%)
Query: 63 HYFKKAHMVPRCNAMRQ--IYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
HY ++ V ++ Q I HGP+ A +Y DFL YKSG+Y++ G I HAVR++
Sbjct: 234 HYGGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGKYISGHAVRLI 293
Query: 121 GWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
GWGVEN YWL AN+WN+ WG+ G F+I+RG NE IE
Sbjct: 294 GWGVENGTAYWLAANTWNEDWGEKGYFRIVRGRNECLIE 332
>gi|255040225|gb|ACT99885.1| cathepsin B2 [Opisthorchis viverrini]
Length = 337
Score = 130 bits (328), Expect = 7e-28, Method: Compositional matrix adjust.
Identities = 65/149 (43%), Positives = 86/149 (57%), Gaps = 5/149 (3%)
Query: 177 ETMGCQNAKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYF 236
E++G +N +P+ FDARE+WP CP++ I DQS+CGSCWA A+SDRLCI SNG F
Sbjct: 73 ESLGDEN---IPKTFDAREQWPHCPTIGQIRDQSSCGSCWAFGAVEAMSDRLCIHSNGTF 129
Query: 237 TGQISAQHIVACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHH 295
T +S+ +V+C C +GC GG+P AW FW G+VTGG GC+ Y C HH
Sbjct: 130 TKSLSSIDLVSCCGYCGFGCQGGYPPAAWDFWQAYGIVTGGSKEDPMGCRSYPFPKCSHH 189
Query: 296 VQGPLQNCTLLGKLKTPECKQNCYNPSYE 324
C TP+C C P+ +
Sbjct: 190 GSKKYPPCPHR-IYDTPKCVPKCDTPNID 217
Score = 112 bits (279), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 49/83 (59%), Positives = 62/83 (74%)
Query: 77 MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANS 136
M++I +GP+ A F VY DF YK GVY H+ G+ IG HA+R+LGWG EN PYWL+ANS
Sbjct: 239 MKEIMINGPVEAAFEVYEDFFGYKQGVYFHSTGEFIGGHAIRILGWGEENGTPYWLIANS 298
Query: 137 WNDHWGDHGTFKILRGENEADIE 159
WN+ WG+ G FK+LRG+NE IE
Sbjct: 299 WNEGWGEDGYFKMLRGKNECGIE 321
>gi|56759488|gb|AAW27884.1| unknown [Schistosoma japonicum]
Length = 342
Score = 130 bits (328), Expect = 8e-28, Method: Compositional matrix adjust.
Identities = 66/143 (46%), Positives = 90/143 (62%), Gaps = 7/143 (4%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+P +FD+R+KWP C S+ I DQS CGS WAVS A+SDR+CI S G + ++SA ++
Sbjct: 90 IPSHFDSRKKWPRCKSISQIRDQSQCGSSWAVSAVGAMSDRICIQSGGKQSVELSAVDLI 149
Query: 247 ACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
+C C GC+GG+ +W +W G+VTGG + GC+PY C+H V+G + C
Sbjct: 150 SCCKYCGSGCDGGFLGPSWDYWVLRGIVTGGSKENHTGCRPYPFPKCDHFVKGKYRACG- 208
Query: 306 LGKL-KTPECKQNC---YNPSYE 324
KL KTP+CKQ C YN SYE
Sbjct: 209 -DKLYKTPQCKQTCQKGYNTSYE 230
Score = 102 bits (255), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 44/82 (53%), Positives = 57/82 (69%)
Query: 78 RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
+ I HGP+ A +Y DFL YKSG+Y++ G I HAVR++GWGVEN YWL AN+W
Sbjct: 251 KDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGKYISGHAVRLIGWGVENGTAYWLAANTW 310
Query: 138 NDHWGDHGTFKILRGENEADIE 159
N+ WG+ G F+I+RG NE IE
Sbjct: 311 NEDWGEKGYFRIVRGRNECLIE 332
>gi|56756475|gb|AAW26410.1| unknown [Schistosoma japonicum]
Length = 342
Score = 130 bits (328), Expect = 8e-28, Method: Compositional matrix adjust.
Identities = 66/143 (46%), Positives = 90/143 (62%), Gaps = 7/143 (4%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+P +FD+R+KWP C S+ I DQS CGS WAVS A+SDR+CI S G + ++SA ++
Sbjct: 90 IPSHFDSRKKWPRCKSISQIRDQSQCGSSWAVSAVGAMSDRICIQSGGKQSVELSAVDLI 149
Query: 247 ACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
+C C GC+GG+ +W +W G+VTGG + GC+PY C+H V+G + C
Sbjct: 150 SCCKYCGSGCDGGFLGPSWDYWVLRGIVTGGSKENHTGCRPYPFPKCDHFVKGKYRACG- 208
Query: 306 LGKL-KTPECKQNC---YNPSYE 324
KL KTP+CKQ C YN SYE
Sbjct: 209 -DKLYKTPQCKQTCQKGYNTSYE 230
Score = 102 bits (255), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 44/82 (53%), Positives = 57/82 (69%)
Query: 78 RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
+ I HGP+ A +Y DFL YKSG+Y++ G I HAVR++GWGVEN YWL AN+W
Sbjct: 251 KDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGKYISGHAVRLIGWGVENGTAYWLAANTW 310
Query: 138 NDHWGDHGTFKILRGENEADIE 159
N+ WG+ G F+I+RG NE IE
Sbjct: 311 NEDWGEKGYFRIVRGRNECLIE 332
>gi|294894290|ref|XP_002774786.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
gi|239880403|gb|EER06602.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
Length = 830
Score = 130 bits (328), Expect = 8e-28, Method: Compositional matrix adjust.
Identities = 74/210 (35%), Positives = 102/210 (48%), Gaps = 46/210 (21%)
Query: 149 ILRGENEADIEMGFNNRVEANSSEDDDLETMGCQNAKGLPRNFDAREKWPECPS-LRHIA 207
++RG N+ I+ G+ + + LP +FDAR +P C + HI
Sbjct: 516 LMRGSNDKAIKKGY-----------------AIEELQDLPTDFDARTAFPNCSKVIGHIR 558
Query: 208 DQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVACTPNCWGCNGGWPQLAWRFW 267
DQS CGSCWA V A +DRLCI SNG FT +SA + AC P+ GCNGG+P AW +
Sbjct: 559 DQSACGSCWAFGVTEAFNDRLCIKSNGTFTELLSAGEMNACAPS-HGCNGGFPNSAWSWV 617
Query: 268 GHNGVVTGGDYNSQ------EGCQPYTLAPCEHHVQ------------------GPLQNC 303
G+ TGGDY ++ +GC PY PC HH+ +
Sbjct: 618 HDKGIATGGDYVAKDDMTKDDGCWPYDFPPCAHHINDTKYPECPKVSCSGESPPATAETA 677
Query: 304 TLLG---KLKTPECKQNCYNPSYESTYRFD 330
T++ +TP C + C+NP Y +T R D
Sbjct: 678 TVIAYQNSYETPNCAEQCHNPKYTTTLRDD 707
Score = 93.6 bits (231), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 40/67 (59%), Positives = 50/67 (74%)
Query: 86 LVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWNDHWGDHG 145
+ A FSVY DFL YKSGVY+H G+ +G HAV+++GWG E+ YW+V NSWN+ WGDHG
Sbjct: 749 VSASFSVYEDFLAYKSGVYKHTSGEYLGGHAVKIIGWGEESGQAYWIVVNSWNEDWGDHG 808
Query: 146 TFKILRG 152
FKI G
Sbjct: 809 LFKIALG 815
>gi|161343827|tpg|DAA06094.1| TPA_inf: cathepsin B [Aphis gossypii]
Length = 207
Score = 130 bits (327), Expect = 8e-28, Method: Compositional matrix adjust.
Identities = 60/129 (46%), Positives = 86/129 (66%), Gaps = 5/129 (3%)
Query: 171 SEDDDLETMGCQNAKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCI 230
SED++ + + +PR FDAR+KW C ++ I DQ NCGSCWA++ ++A +DRLC+
Sbjct: 76 SEDENYDNL----LGRIPRKFDARKKWRNCKTIGAIRDQGNCGSCWALATSSAFADRLCV 131
Query: 231 ASNGYFTGQISAQHIVACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTL 289
ASNG F +SA+ + C C +GCNGG+P AW + +G+VTGGDY S+EGC+PY +
Sbjct: 132 ASNGNFNQLLSAEELTFCCHKCGFGCNGGYPIKAWERFMKHGLVTGGDYKSREGCEPYRV 191
Query: 290 APCEHHVQG 298
PC + G
Sbjct: 192 PPCPYDELG 200
>gi|294951797|ref|XP_002787132.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
gi|239901778|gb|EER18928.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
Length = 278
Score = 130 bits (327), Expect = 9e-28, Method: Compositional matrix adjust.
Identities = 68/151 (45%), Positives = 86/151 (56%), Gaps = 8/151 (5%)
Query: 187 LPRNFDAREKWPECPS-LRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHI 245
LP +FDAR +P C + HI DQS CGSCWA V A +DRLCI S+G FT +SA +
Sbjct: 21 LPTDFDARTAFPNCSKVIGHIRDQSACGSCWAFGVTEAFNDRLCIKSHGTFTELLSAGEM 80
Query: 246 VACTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQ------EGCQPYTLAPCEHHVQGP 299
AC P+ GCNGG+P AW + G+ TGGDY ++ +GC PY PC HHV
Sbjct: 81 NACAPS-HGCNGGFPNSAWSWVHDKGIATGGDYVAEDDMTKDDGCWPYDFPPCAHHVNDS 139
Query: 300 LQNCTLLGKLKTPECKQNCYNPSYESTYRFD 330
+TP C + C+NP Y +T R D
Sbjct: 140 KYPKCPKDSYETPNCAEQCHNPKYTTTLRDD 170
Score = 98.2 bits (243), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 44/78 (56%), Positives = 55/78 (70%)
Query: 75 NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVA 134
+A I GP+ A F+VY DFL YKSGVY+H G+ +G HAV+++GWG E+ YWLV
Sbjct: 186 DAKNAIRTDGPVSASFTVYEDFLAYKSGVYKHTSGEYLGGHAVKIIGWGEESGQAYWLVV 245
Query: 135 NSWNDHWGDHGTFKILRG 152
NSWN+ WGDHG FKI G
Sbjct: 246 NSWNEDWGDHGLFKIALG 263
>gi|241998314|ref|XP_002433800.1| longipain, putative [Ixodes scapularis]
gi|215495559|gb|EEC05200.1| longipain, putative [Ixodes scapularis]
Length = 339
Score = 130 bits (327), Expect = 9e-28, Method: Compositional matrix adjust.
Identities = 63/158 (39%), Positives = 91/158 (57%), Gaps = 10/158 (6%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+P FD+R++W +CP++R I DQ CGSCWA ++SDR CI S ++A ++
Sbjct: 88 IPAQFDSRQQWQDCPTIREIRDQGACGSCWAFGAVESMSDRHCIHSGAKNIVHLAADDVL 147
Query: 247 ACTPNCWGC----NGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQN 302
+C CWGC NGG+P AW +W G+VTGG+Y++ EGC PY + C+HHV G L
Sbjct: 148 SC---CWGCGSGCNGGFPGAAWSYWVEKGIVTGGNYDTDEGCMPYPVPSCDHHVNGTLGP 204
Query: 303 CTLLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
C TP+C + C Y ++ D GK ++ V
Sbjct: 205 CGQ--DPPTPKCVRLC-RKGYNIDFKDDKHYGKSSYSV 239
Score = 114 bits (284), Expect = 8e-23, Method: Compositional matrix adjust.
Identities = 54/99 (54%), Positives = 69/99 (69%), Gaps = 2/99 (2%)
Query: 63 HYFKKAHMVP--RCNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
HY K ++ V +I ++GP+ F+VYADF YKSGVY+ + D++G HA+R+L
Sbjct: 231 HYGKSSYSVSSNETQIQMEIMKNGPVEGAFTVYADFPLYKSGVYKSHSTDALGGHAIRIL 290
Query: 121 GWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
GWGVEN +P+WLVANSWN WGD G FKILRG NE IE
Sbjct: 291 GWGVENGVPFWLVANSWNTEWGDKGYFKILRGSNECGIE 329
>gi|392922404|ref|NP_507186.3| Protein CPR-2 [Caenorhabditis elegans]
gi|206994217|emb|CAB04322.3| Protein CPR-2 [Caenorhabditis elegans]
Length = 326
Score = 130 bits (327), Expect = 9e-28, Method: Compositional matrix adjust.
Identities = 72/155 (46%), Positives = 92/155 (59%), Gaps = 13/155 (8%)
Query: 188 PRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVA 247
P NFDAR +WP+C S++ I +QSNCGSCWA S A ISDR CIASNG IS ++
Sbjct: 84 PLNFDARTRWPQCKSMKLIREQSNCGSCWAFSTAEVISDRTCIASNGTQQPIISPTDLLT 143
Query: 248 CTP-NCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
C +C GC+GG+P A+++W GVVTGGDY GC+PY + PC NC
Sbjct: 144 CCGMSCGEGCDGGFPYRAFQWWARRGVVTGGDYLGT-GCKPYPIRPCNS------DNCV- 195
Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
L+TP C+ +C P Y +TY D G A+ V
Sbjct: 196 --NLQTPPCRLSC-QPGYRTTYTNDKNYGNSAYPV 227
Score = 110 bits (275), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 53/99 (53%), Positives = 66/99 (66%), Gaps = 2/99 (2%)
Query: 63 HYFKKAHMVPRCNAMRQ--IYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
+Y A+ VPR A Q IY +GP+VA F VY DF +YKSG+Y+H G S G HAV+++
Sbjct: 219 NYGNSAYPVPRTVAAIQADIYYNGPVVAAFIVYEDFEKYKSGIYRHIAGRSKGGHAVKLI 278
Query: 121 GWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
GWG E PYWL NSW WG+ GTF+ILRG +E IE
Sbjct: 279 GWGTERGTPYWLAVNSWGSQWGESGTFRILRGVDECGIE 317
>gi|226474160|emb|CAX71567.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 130 bits (327), Expect = 9e-28, Method: Compositional matrix adjust.
Identities = 66/143 (46%), Positives = 90/143 (62%), Gaps = 7/143 (4%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+P +FD+R+KWP C S+ I DQS CGS WAVS A+SDR+CI S G + ++SA ++
Sbjct: 90 IPSHFDSRKKWPRCKSISQIRDQSQCGSSWAVSAVGAMSDRICIQSGGKQSVELSAVDLI 149
Query: 247 ACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
+C C GC+GG+ +W +W G+VTGG + GC+PY C+H V+G + C
Sbjct: 150 SCCKYCGSGCDGGFLGPSWDYWVLRGIVTGGSKENHTGCRPYPFPKCDHFVKGKYRACG- 208
Query: 306 LGKL-KTPECKQNC---YNPSYE 324
KL KTP+CKQ C YN SYE
Sbjct: 209 -DKLYKTPQCKQICQKGYNTSYE 230
Score = 104 bits (259), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 48/99 (48%), Positives = 64/99 (64%), Gaps = 2/99 (2%)
Query: 63 HYFKKAHMVPRCNAMRQ--IYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
HY ++ V ++ Q I HGP+ A +Y DFL YKSG+Y++ G I HAVR++
Sbjct: 234 HYGGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGQFISGHAVRLI 293
Query: 121 GWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
GWGVEN YWL AN+WN+ WG+ G F+I+RG NE IE
Sbjct: 294 GWGVENGTAYWLAANTWNEDWGEKGYFRIVRGRNECSIE 332
>gi|51947600|gb|AAU14266.1| cathepsin B-N [Myzus persicae]
Length = 338
Score = 130 bits (327), Expect = 9e-28, Method: Compositional matrix adjust.
Identities = 64/136 (47%), Positives = 85/136 (62%), Gaps = 6/136 (4%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+PR FDAR KW C ++ + DQ NCGSCWAV+ ++A +DRLC+A+N F +SA+ I
Sbjct: 86 IPRFFDARRKWRHCSTIGRVRDQGNCGSCWAVATSSAFADRLCVATNADFNELLSAEEIT 145
Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
C C +GCNGG+P AW+ + G+VTGGDY S EGC+PY + PC + QG N T
Sbjct: 146 FCCHTCGFGCNGGYPIKAWKRFSKKGLVTGGDYKSGEGCEPYRVPPCPNDDQG---NNTC 202
Query: 306 LGKLKTP--ECKQNCY 319
GK C + CY
Sbjct: 203 AGKPMESNHRCTRMCY 218
Score = 95.1 bits (235), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 44/97 (45%), Positives = 62/97 (63%), Gaps = 1/97 (1%)
Query: 64 YFKKAHMVPRCNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDS-IGLHAVRVLGW 122
Y + + + + + + +GP+ A F VY DF YKSGVY + S +G HAV+++GW
Sbjct: 231 YTRDYYYLTYGSIQKDVMTYGPIEASFDVYDDFPSYKSGVYVKSENASYLGGHAVKLIGW 290
Query: 123 GVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
G E +PYWL+ NSWN+ WGDHG FKI RG NE ++
Sbjct: 291 GEEYGVPYWLMVNSWNEDWGDHGFFKIQRGTNECGVD 327
>gi|187104114|ref|NP_001119617.1| cathepsin B-16A precursor [Acyrthosiphon pisum]
gi|161343835|tpg|DAA06098.1| TPA_inf: cathepsin B [Acyrthosiphon pisum]
Length = 340
Score = 130 bits (327), Expect = 9e-28, Method: Compositional matrix adjust.
Identities = 61/147 (41%), Positives = 91/147 (61%), Gaps = 5/147 (3%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+PR FDAR +W C ++ + DQ +CGSCWA++ ++A +DRLC+A+NG F +SA+ I
Sbjct: 88 IPRTFDARRRWRHCKTIGEVRDQGHCGSCWAMATSSAFADRLCVATNGDFNELLSAEEIT 147
Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
C C +GCNGG+P AW+++ +G+VTGG+Y S EGC+PY + PC +G +C
Sbjct: 148 FCCHTCGFGCNGGYPIKAWKYFSSHGIVTGGNYKSGEGCEPYRVPPCPQDEEGK-SSCAG 206
Query: 306 LGKLKTPECKQNCY---NPSYESTYRF 329
K C + CY + Y +RF
Sbjct: 207 KPIEKNHRCTRMCYGNQDLDYNDDHRF 233
Score = 94.4 bits (233), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 43/83 (51%), Positives = 56/83 (67%), Gaps = 1/83 (1%)
Query: 78 RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDS-IGLHAVRVLGWGVENDIPYWLVANS 136
+ + +GP+ A F VY DF YKSGVYQ + +G HAV+++GWGVE PYWL+ NS
Sbjct: 247 KDVMNYGPIEASFDVYDDFPSYKSGVYQRTPNATKLGGHAVKLIGWGVEEGTPYWLMVNS 306
Query: 137 WNDHWGDHGTFKILRGENEADIE 159
WN WGD+G FKI RG +E I+
Sbjct: 307 WNAQWGDNGLFKIRRGTDECGID 329
>gi|159179|gb|AAA29178.1| cysteine proteinase, partial [Haemonchus contortus]
Length = 341
Score = 130 bits (326), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 70/160 (43%), Positives = 93/160 (58%), Gaps = 13/160 (8%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+P +D R+ W C S +I DQ+NCGSCWAVS A AISDR+CIA+ ISA +V
Sbjct: 86 IPEEYDPRKIWSNCTSF-YIRDQANCGSCWAVSTAAAISDRICIATKARKQVNISATDLV 144
Query: 247 AC-TPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
C TP C +GC+GGW AW ++ + G+V+GG+Y S+ C+PY + PC HH N T
Sbjct: 145 TCCTPTCGFGCDGGWSIKAWEYFTYAGLVSGGEYRSKRCCRPYPIHPCGHH-----GNDT 199
Query: 305 LLG----KLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
G + TP CK+ C P Y YR D + G A +
Sbjct: 200 YYGECPEEASTPSCKKKC-QPGYRKLYRMDKRYGTDAFQL 238
Score = 102 bits (255), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 42/82 (51%), Positives = 62/82 (75%)
Query: 78 RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
+++ ++GP+ A F+VY DF YKSG+Y+H G+ G HAV+++GWG EN YWL+ANSW
Sbjct: 247 KELLKNGPVTASFAVYEDFSLYKSGIYRHTAGELRGYHAVKMIGWGTENRTDYWLIANSW 306
Query: 138 NDHWGDHGTFKILRGENEADIE 159
+D WG++G F+I+RG N+ IE
Sbjct: 307 HDDWGENGYFRIIRGINDCGIE 328
>gi|201023369|ref|NP_001128426.1| cathepsin B-3483 [Acyrthosiphon pisum]
gi|328712086|ref|XP_003244726.1| PREDICTED: cathepsin B-like [Acyrthosiphon pisum]
Length = 355
Score = 130 bits (326), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 70/163 (42%), Positives = 92/163 (56%), Gaps = 10/163 (6%)
Query: 184 AKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQ 243
A P +FDAR W C S+ HI +Q NC + WA+SV +A++DR+CIAS G T S Q
Sbjct: 93 ANETPESFDARYHWFNCTSISHIWNQGNCAADWAISVTSAMNDRICIASQGNITALYSPQ 152
Query: 244 HIVACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCE--------H 294
+V+C +C GC+GG+ AWR+ G+VTGGDY S EGCQP+ + PC
Sbjct: 153 KLVSCCEDCGNGCSGGYTAAAWRYILKKGIVTGGDYGSNEGCQPWLVQPCNASTTAADPS 212
Query: 295 HVQGPLQNCTLLGKLKTPECKQNCYNPSYESTYRFDLKKGKKA 337
V GP C TP+C +CYN +E Y D+ K KK
Sbjct: 213 SVLGPHGVCG-GDPATTPKCDLSCYNARHEGKYLDDIIKAKKV 254
Score = 100 bits (248), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 48/94 (51%), Positives = 59/94 (62%)
Query: 66 KKAHMVPRCNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVE 125
KK C+A + + +HGP V VY DFL YKSGVY H GD +GL +VR++GWG+E
Sbjct: 252 KKVFTFDGCSARKNLRKHGPYVVTMRVYEDFLAYKSGVYHHVTGDYLGLLSVRMIGWGLE 311
Query: 126 NDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
+WL+ANSW WGD G FKI R NE IE
Sbjct: 312 GGQAFWLLANSWGTSWGDKGFFKIRRFVNECWIE 345
>gi|17559066|ref|NP_506790.1| Protein CPR-3 [Caenorhabditis elegans]
gi|1169083|sp|P43507.1|CPR3_CAEEL RecName: Full=Cathepsin B-like cysteine proteinase 3; AltName:
Full=Cysteine protease-related 3; Flags: Precursor
gi|675494|gb|AAA98788.1| cathepsin B-like cysteine proteinase [Caenorhabditis elegans]
gi|675496|gb|AAA98782.1| cathepsin B-like cysteine proteinase [Caenorhabditis elegans]
gi|14530554|emb|CAB61032.2| Protein CPR-3 [Caenorhabditis elegans]
Length = 370
Score = 130 bits (326), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 73/185 (39%), Positives = 99/185 (53%), Gaps = 13/185 (7%)
Query: 158 IEMGFNNRVEANSSEDDDLETMGCQNAKGLPRNFDAREKWPECPSLRHIADQSNCGSCWA 217
+++ F +E +S +L G + LP FDAREKWP+C +++ I +Q+ CGSCWA
Sbjct: 63 MDVKFAEPLEKDSDVASELFVRGEIVPEPLPDTFDAREKWPDCNTIKLIRNQATCGSCWA 122
Query: 218 VSVANAISDRLCIASNGYFTGQISAQHIVAC--TPNCWGCNGGWPQLAWRFWGHNGVVTG 275
A ISDR+CI SNG IS + I++C T +GC GG+ A RFW +G VTG
Sbjct: 123 FGAAEVISDRVCIQSNGTQQPVISVEDILSCCGTTCGYGCKGGYSIEALRFWASSGAVTG 182
Query: 276 GDYNSQEGCQPYTLAPCEHHVQGPLQNCTLLGKLKTPECKQNCYNPSYESTYRFDLKKGK 335
GDY GC PY+ APC +NC + TP CK C + Y+ D G
Sbjct: 183 GDYGGH-GCMPYSFAPCT-------KNCP---ESTTPSCKTTCQSSYKTEEYKKDKHYGA 231
Query: 336 KAHMV 340
A+ V
Sbjct: 232 SAYKV 236
Score = 102 bits (254), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 48/101 (47%), Positives = 64/101 (63%), Gaps = 4/101 (3%)
Query: 63 HYFKKAHMVPRCNAMRQI----YEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVR 118
HY A+ V ++ +I Y +GP+ A + VY DF YKSGVY + G +G HAV+
Sbjct: 228 HYGASAYKVTTTKSVTEIQTEIYHYGPVEASYKVYEDFYHYKSGVYHYTSGKLVGGHAVK 287
Query: 119 VLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
++GWGVEN + YWL+ANSW +G+ G FKI RG NE IE
Sbjct: 288 IIGWGVENGVDYWLIANSWGTSFGEKGFFKIRRGTNECQIE 328
>gi|56755451|gb|AAW25905.1| unknown [Schistosoma japonicum]
Length = 342
Score = 130 bits (326), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 66/143 (46%), Positives = 90/143 (62%), Gaps = 7/143 (4%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+P +FD+R+KWP C S+ I DQS CGS WAVS A+SDR+CI S G + ++SA ++
Sbjct: 90 IPSHFDSRKKWPRCKSISQIRDQSQCGSSWAVSAVGAMSDRICIQSGGKQSVELSAVDLI 149
Query: 247 ACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
+C C GC+GG+ +W +W G+VTGG + GC+PY C+H V+G + C
Sbjct: 150 SCCKYCGSGCDGGFLGPSWDYWVLRGIVTGGSKENHTGCRPYPFPKCDHFVKGKYRACG- 208
Query: 306 LGKL-KTPECKQNC---YNPSYE 324
KL KTP+CKQ C YN SYE
Sbjct: 209 -DKLYKTPQCKQICQKGYNTSYE 230
Score = 104 bits (259), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 48/99 (48%), Positives = 64/99 (64%), Gaps = 2/99 (2%)
Query: 63 HYFKKAHMVPRCNAMRQ--IYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
HY ++ V ++ Q I HGP+ A +Y DFL YKSG+Y++ G I HAVR++
Sbjct: 234 HYGGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGQFISGHAVRLI 293
Query: 121 GWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
GWGVEN YWL AN+WN+ WG+ G F+I+RG NE IE
Sbjct: 294 GWGVENGTAYWLAANTWNEDWGEKGYFRIVRGRNECSIE 332
>gi|226474164|emb|CAX71568.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
gi|226474166|emb|CAX71569.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 130 bits (326), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 66/143 (46%), Positives = 90/143 (62%), Gaps = 7/143 (4%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+P +FD+R+KWP C S+ I DQS CGS WAVS A+SDR+CI S G + ++SA ++
Sbjct: 90 IPSHFDSRKKWPRCKSISQIRDQSQCGSSWAVSAVGAMSDRICIQSGGKQSVELSAVDLI 149
Query: 247 ACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
+C C GC+GG+ +W +W G+VTGG + GC+PY C+H V+G + C
Sbjct: 150 SCCKYCGSGCDGGFLGPSWDYWVLRGIVTGGSKENHTGCRPYPFPKCDHFVKGKYRACG- 208
Query: 306 LGKL-KTPECKQNC---YNPSYE 324
KL KTP+CKQ C YN SYE
Sbjct: 209 -DKLYKTPQCKQICQKGYNTSYE 230
Score = 103 bits (258), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 48/99 (48%), Positives = 64/99 (64%), Gaps = 2/99 (2%)
Query: 63 HYFKKAHMVPRCNAMRQ--IYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
HY ++ V ++ Q I HGP+ A +Y DFL YKSG+Y++ G I HAVR++
Sbjct: 234 HYGGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGQFISGHAVRLI 293
Query: 121 GWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
GWGVEN YWL AN+WN+ WG+ G F+I+RG NE IE
Sbjct: 294 GWGVENGTAYWLAANTWNEDWGEKGYFRIVRGRNECSIE 332
>gi|107921773|gb|ABF85678.1| cathepsin B1 [Fasciola hepatica]
Length = 278
Score = 130 bits (326), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 70/155 (45%), Positives = 86/155 (55%), Gaps = 2/155 (1%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP +FDAREKWP C S+R I DQS+CGSCWAV+ A+SDR+CI SNG ++SA +V
Sbjct: 63 LPESFDAREKWPLCRSIRQIPDQSSCGSCWAVAGVGAMSDRVCIHSNGMMQPELSAIDLV 122
Query: 247 ACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
+C C GC GG P AW +W NG+VTGG + GC PY C H N
Sbjct: 123 SCCSYCGNGCQGGSPPAAWDYWWRNGIVTGGTLENPTGCLPYPFPQCRHPGSRSQLNPCP 182
Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
TP C C Y+ TY D GK ++ V
Sbjct: 183 GYIYPTPSCYPYC-QAGYDKTYEEDKVYGKTSYNV 216
Score = 67.0 bits (162), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 29/55 (52%), Positives = 39/55 (70%)
Query: 77 MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYW 131
M++I ++GP+ A F VY DF YKSG+Y H G G HA+R++GWGVEN + YW
Sbjct: 224 MQEIMKNGPVEAGFIVYTDFAVYKSGIYHHVSGRYAGKHAIRIIGWGVENGVNYW 278
>gi|339242313|ref|XP_003377082.1| Gut-specific cysteine proteinase [Trichinella spiralis]
gi|316974149|gb|EFV57673.1| Gut-specific cysteine proteinase [Trichinella spiralis]
Length = 517
Score = 129 bits (325), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 94/277 (33%), Positives = 129/277 (46%), Gaps = 44/277 (15%)
Query: 62 SHYFKKAHMVPRCNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLG 121
SHY+ + M++IY+ GP+VA F VY DFL Y SG Q G+ +
Sbjct: 140 SHYYVNQDEF---DIMQEIYQRGPVVAGFKVYHDFLYYISG--QFICGNKRCEEEENLTS 194
Query: 122 WGV------ENDIPYWLVANSWNDHWGD-------------HGTFKILRGENEADIEMGF 162
W V E + LV + G G G ++ +I +
Sbjct: 195 WEVNFAYVEEQEKKNALVKLNLKRRKGTKLKQKLCMQKEKIMGLNPYFSGMSKEEILIRM 254
Query: 163 NNRVEANSSE-DDDLETMGCQNAKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVA 221
++ +S+E D L K LP++FD+REKWPEC +R I DQSNCGSCWAVS A
Sbjct: 255 GTKLMNSSTEFDSKLSNNNEALIKKLPKHFDSREKWPECEWIRFIRDQSNCGSCWAVSAA 314
Query: 222 NAISDRLCIASNGYFTGQISAQHIVACTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQ 281
+ ++DR CIAS G T IS + I+AC G + +W G+ TGG Y +
Sbjct: 315 SVMTDRHCIASKGQETPYISDEQILAC---------GMIPSPFNYWKKMGIATGGPYGDK 365
Query: 282 EGCQPYTLAPCEHHVQGPLQNCTLLGKLKTPECKQNC 318
CQPY++APC C+ TP CK +C
Sbjct: 366 SCCQPYSIAPC--------SKCSYTA--STPSCKYDC 392
Score = 108 bits (270), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 47/83 (56%), Positives = 59/83 (71%)
Query: 77 MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANS 136
M +IY HGP+VA F VY DF Y SG+YQ ++G HA+R++GWG EN IPYWL+ANS
Sbjct: 421 MNEIYTHGPVVAGFIVYEDFTYYISGIYQQTTYVAMGGHAIRIIGWGEENGIPYWLIANS 480
Query: 137 WNDHWGDHGTFKILRGENEADIE 159
WN +G+ G F+I RG NE IE
Sbjct: 481 WNTTFGEKGFFRIRRGTNECRIE 503
Score = 45.1 bits (105), Expect = 0.047, Method: Compositional matrix adjust.
Identities = 24/94 (25%), Positives = 44/94 (46%), Gaps = 15/94 (15%)
Query: 242 AQHIVACTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQ 301
A +++ GC G + A+ +W +G+VTGG Y + C PY+++PC
Sbjct: 57 ALFVISRIAALVGCRSGKIEAAFIYWQRSGLVTGGPYGEKACCLPYSISPCT-------- 108
Query: 302 NCTLLGKLKTPECKQNCYNPSYESTYRFDLKKGK 335
+ P+C++ C +++Y LK+ K
Sbjct: 109 --MCRPYMLAPKCQRTC-----QASYNLSLKRDK 135
>gi|226474180|emb|CAX71576.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 129 bits (325), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 65/143 (45%), Positives = 90/143 (62%), Gaps = 7/143 (4%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+P +FD+R+KWP C S+ I DQS CGS WAVS A+SDR+CI S G + ++SA ++
Sbjct: 90 IPSHFDSRKKWPRCKSISQIRDQSQCGSSWAVSAVGAMSDRICIQSGGKQSVELSAVDLI 149
Query: 247 ACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
+C C GC+GG+ +W +W G+VTGG + GC+PY C+H V+G + C
Sbjct: 150 SCCKYCGSGCDGGFLGPSWDYWVLRGIVTGGSKENHTGCRPYPFPKCDHFVKGKYRACG- 208
Query: 306 LGKL-KTPECKQNC---YNPSYE 324
KL +TP+CKQ C YN SYE
Sbjct: 209 -DKLYETPQCKQTCQKGYNTSYE 230
Score = 104 bits (259), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 48/99 (48%), Positives = 64/99 (64%), Gaps = 2/99 (2%)
Query: 63 HYFKKAHMVPRCNAMRQ--IYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
HY ++ V ++ Q I HGP+ A +Y DFL YKSG+Y++ G I HAVR++
Sbjct: 234 HYGGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGQFISGHAVRLI 293
Query: 121 GWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
GWGVEN YWL AN+WN+ WG+ G F+I+RG NE IE
Sbjct: 294 GWGVENGTAYWLAANTWNEDWGEKGYFRIVRGRNECSIE 332
>gi|55793949|gb|AAV65885.1| cathepsin B1 isotype 5 precursor [Trichobilharzia regenti]
Length = 342
Score = 129 bits (325), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 62/156 (39%), Positives = 94/156 (60%), Gaps = 3/156 (1%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+P +FD+R++WP+C S+ +I DQS CG+ WA + A+SDR+CI S G + ++SA ++
Sbjct: 90 IPTSFDSRKEWPQCKSISNIRDQSRCGAGWAFAAVQAMSDRICIESKGKKSVELSAVDLL 149
Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
+C C GC G+P +AW +W G+VTGG + GCQPY CEHH +G C
Sbjct: 150 SCCIECGLGCQMGFPGIAWDYWVQEGIVTGGSKENHTGCQPYPFPKCEHHTKGRYPECGE 209
Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMVL 341
+ +K P+C Q C Y++ Y D GK ++ +L
Sbjct: 210 IIYMK-PKCHQKC-QKGYKTPYEKDKYYGKVSYNLL 243
Score = 112 bits (279), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 46/87 (52%), Positives = 64/87 (73%)
Query: 78 RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
++I HGP+ A F V++DFL YKSG+Y+H G IG H VR++GWGVE + PYWL+ANSW
Sbjct: 251 KEIMMHGPVEASFRVHSDFLNYKSGIYKHMTGIDIGSHVVRIIGWGVEKETPYWLIANSW 310
Query: 138 NDHWGDHGTFKILRGENEADIEMGFNN 164
N+ WG+ G F++LRG++E IE +
Sbjct: 311 NEDWGEKGYFRMLRGKDECGIESAVTS 337
>gi|221107055|ref|XP_002166984.1| PREDICTED: cathepsin B-like [Hydra magnipapillata]
Length = 330
Score = 129 bits (325), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 70/144 (48%), Positives = 85/144 (59%), Gaps = 10/144 (6%)
Query: 187 LPRNFDAREKW-PECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHI 245
LP ++D REKW CPS I DQ +CGSCWA A +DR+CI SNG ISA+ +
Sbjct: 77 LPDSYDTREKWGSTCPSTTEIRDQGSCGSCWAFGAVEAFTDRICIQSNGAKNPHISAEDL 136
Query: 246 VACTPNCW---GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQN 302
+ C W GCNGG AW F+ + G VTGG YNS EGCQPY + CEHH G +
Sbjct: 137 LTCC-GFWCGFGCNGGRLGPAWNFFKYAGAVTGGQYNSSEGCQPYEIPSCEHHTSGSKKP 195
Query: 303 CTLLGKLKTPECKQNC---YNPSY 323
C G TP+CK++C YN SY
Sbjct: 196 CE--GSEPTPKCKRSCREGYNVSY 217
Score = 119 bits (298), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 51/81 (62%), Positives = 66/81 (81%)
Query: 79 QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWN 138
+IY +GP+ A F+VY+DF YKSGVY++ G+++G HA+++LGWGVEN++PYWLVANSWN
Sbjct: 240 EIYLNGPVEAAFTVYSDFPNYKSGVYKYTTGNALGGHAIKILGWGVENNVPYWLVANSWN 299
Query: 139 DHWGDHGTFKILRGENEADIE 159
WGD G FKILRG NE IE
Sbjct: 300 PDWGDKGFFKILRGSNECGIE 320
>gi|226474184|emb|CAX71578.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 129 bits (324), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 65/143 (45%), Positives = 89/143 (62%), Gaps = 7/143 (4%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+P +FD+R+KWP C S+ I DQS CGS WAVS A+SDR+CI S G + ++SA ++
Sbjct: 90 IPSHFDSRKKWPRCKSISQIRDQSQCGSSWAVSAVGAMSDRICIQSGGKQSVELSAVDLI 149
Query: 247 ACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
+C C GC+GG+ +W +W G+VTGG + GC+PY C+H V+G + C
Sbjct: 150 SCCKYCGSGCDGGFLGPSWDYWVLRGIVTGGSKENHTGCRPYPFPKCDHFVKGKYRACG- 208
Query: 306 LGKL-KTPECKQNC---YNPSYE 324
KL KTP+C Q C YN SYE
Sbjct: 209 -DKLYKTPQCNQTCQKGYNTSYE 230
Score = 102 bits (255), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 48/99 (48%), Positives = 64/99 (64%), Gaps = 2/99 (2%)
Query: 63 HYFKKAHMVPRCNAMRQ--IYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
HY ++ V ++ Q I HGP+ A +Y DFL YKSG+Y++ G I HAVR++
Sbjct: 234 HYGGFSYNVLSVESVIQKDIMVHGPVEAYLEIYEDFLNYKSGIYRYTTGKYISGHAVRLI 293
Query: 121 GWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
GWGVEN YWL AN+WN+ WG+ G F+I+RG NE IE
Sbjct: 294 GWGVENGTAYWLAANTWNEDWGEKGYFRIVRGRNECLIE 332
>gi|226473762|emb|CAX71566.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
gi|226474170|emb|CAX71571.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 129 bits (324), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 65/143 (45%), Positives = 89/143 (62%), Gaps = 7/143 (4%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+P +FD+R+KWP C S+ I DQS CGS WAVS A+SDR+CI S G + ++SA ++
Sbjct: 90 IPSHFDSRKKWPRCKSISQIRDQSQCGSSWAVSAVGAMSDRICIQSGGKQSVELSAVDLI 149
Query: 247 ACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
+C C GC+GG+ +W +W G+VTGG + GC+PY C+H V+G + C
Sbjct: 150 SCCKYCGSGCDGGFLGPSWDYWVLRGIVTGGSKENHTGCRPYPFPKCDHFVKGKYRACG- 208
Query: 306 LGKL-KTPECKQNC---YNPSYE 324
KL KTP+C Q C YN SYE
Sbjct: 209 -DKLYKTPQCNQTCQKGYNTSYE 230
Score = 102 bits (255), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 48/99 (48%), Positives = 64/99 (64%), Gaps = 2/99 (2%)
Query: 63 HYFKKAHMVPRCNAMRQ--IYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
HY ++ V ++ Q I HGP+ A +Y DFL YKSG+Y++ G I HAVR++
Sbjct: 234 HYGGFSYNVLSVESVIQKDIMMHGPVEAYIEIYEDFLNYKSGIYRYTTGKYISGHAVRLI 293
Query: 121 GWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
GWGVEN YWL AN+WN+ WG+ G F+I+RG NE IE
Sbjct: 294 GWGVENGTAYWLAANTWNEDWGEKGYFRIVRGRNECLIE 332
>gi|226474176|emb|CAX71574.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 129 bits (324), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 65/143 (45%), Positives = 89/143 (62%), Gaps = 7/143 (4%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+P +FD+R+KWP C S+ I DQS CGS WAVS A+SDR+CI S G + ++SA ++
Sbjct: 90 IPSHFDSRKKWPRCKSISQIRDQSQCGSSWAVSAVGAMSDRICIQSGGKQSVELSAVDLI 149
Query: 247 ACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
+C C GC+GG+ +W +W G+VTGG + GC+PY C+H V+G + C
Sbjct: 150 SCCKYCGSGCDGGFLGPSWDYWVLRGIVTGGSKENHTGCRPYPFPKCDHFVKGKYRACG- 208
Query: 306 LGKL-KTPECKQNC---YNPSYE 324
KL KTP+C Q C YN SYE
Sbjct: 209 -DKLYKTPQCNQTCQKGYNTSYE 230
Score = 102 bits (255), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 47/99 (47%), Positives = 64/99 (64%), Gaps = 2/99 (2%)
Query: 63 HYFKKAHMVPRCNAMRQ--IYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
HY ++ V ++ Q I HGP+ A +Y DFL YKSG+Y++ G I HAVR++
Sbjct: 234 HYGGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGKYISGHAVRLI 293
Query: 121 GWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
GWGVEN YWL AN+WN+ WG+ G F+I+RG NE I+
Sbjct: 294 GWGVENGTAYWLAANTWNEDWGEKGYFRIVRGRNECSID 332
>gi|226474172|emb|CAX71572.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 129 bits (324), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 65/143 (45%), Positives = 89/143 (62%), Gaps = 7/143 (4%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+P +FD+R+KWP C S+ I DQS CGS WAVS A+SDR+CI S G + ++SA ++
Sbjct: 90 IPSHFDSRKKWPRCKSISQIRDQSQCGSSWAVSAVGAMSDRICIQSGGKQSVELSAVDLI 149
Query: 247 ACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
+C C GC+GG+ +W +W G+VTGG + GC+PY C+H V+G + C
Sbjct: 150 SCCKYCGSGCDGGFLGPSWDYWVLRGIVTGGSKENHTGCRPYPFPKCDHFVKGKYRACG- 208
Query: 306 LGKL-KTPECKQNC---YNPSYE 324
KL KTP+C Q C YN SYE
Sbjct: 209 -DKLYKTPQCNQTCQKGYNTSYE 230
Score = 103 bits (256), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 48/99 (48%), Positives = 64/99 (64%), Gaps = 2/99 (2%)
Query: 63 HYFKKAHMVPRCNAMRQ--IYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
HY ++ V ++ Q I HGP+ A +Y DFL YKSG+Y++ G I HAVR++
Sbjct: 234 HYGGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGQFISGHAVRLI 293
Query: 121 GWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
GWGVEN YWL AN+WN+ WG+ G F+I+RG NE IE
Sbjct: 294 GWGVENGTAYWLAANTWNEDWGEKGYFRIVRGRNECLIE 332
>gi|56756907|gb|AAW26625.1| unknown [Schistosoma japonicum]
Length = 342
Score = 129 bits (324), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 65/143 (45%), Positives = 89/143 (62%), Gaps = 7/143 (4%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+P +FD+R+KWP C S+ I DQS CGS WAVS A+SDR+CI S G + ++SA ++
Sbjct: 90 IPSHFDSRKKWPRCKSISQIRDQSQCGSSWAVSAVGAMSDRICIQSGGKQSVELSAVDLI 149
Query: 247 ACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
+C C GC+GG+ +W +W G+VTGG + GC+PY C+H V+G + C
Sbjct: 150 SCCKYCGSGCDGGFLGPSWDYWVLRGIVTGGSKENHTGCRPYPFPKCDHFVKGKYRACG- 208
Query: 306 LGKL-KTPECKQNC---YNPSYE 324
KL KTP+C Q C YN SYE
Sbjct: 209 -DKLYKTPQCNQTCQKGYNTSYE 230
Score = 102 bits (255), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 48/99 (48%), Positives = 64/99 (64%), Gaps = 2/99 (2%)
Query: 63 HYFKKAHMVPRCNAMRQ--IYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
HY ++ V ++ Q I HGP+ A +Y DFL YKSG+Y++ G I HAVR++
Sbjct: 234 HYGGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGKYISGHAVRLI 293
Query: 121 GWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
GWGVEN YWL AN+WN+ WG+ G F+I+RG NE IE
Sbjct: 294 GWGVENGTAYWLAANTWNEDWGEKGYFRIVRGRNECLIE 332
>gi|56758864|gb|AAW27572.1| unknown [Schistosoma japonicum]
Length = 342
Score = 129 bits (324), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 65/143 (45%), Positives = 89/143 (62%), Gaps = 7/143 (4%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+P +FD+R+KWP C S+ I DQS CGS WAVS A+SDR+CI S G + ++SA ++
Sbjct: 90 IPSHFDSRKKWPRCKSISQIRDQSQCGSSWAVSAVGAMSDRICIQSGGKQSVELSAVDLI 149
Query: 247 ACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
+C C GC+GG+ +W +W G+VTGG + GC+PY C+H V+G + C
Sbjct: 150 SCCKYCGSGCDGGFLGPSWDYWVLRGIVTGGSKENHTGCRPYPFPKCDHFVKGKYRACG- 208
Query: 306 LGKL-KTPECKQNC---YNPSYE 324
KL KTP+C Q C YN SYE
Sbjct: 209 -DKLYKTPQCNQTCQKGYNTSYE 230
Score = 102 bits (254), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 44/82 (53%), Positives = 57/82 (69%)
Query: 78 RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
+ I HGP+ A +Y DFL YKSG+Y++ G I HAVR++GWGVEN YWL AN+W
Sbjct: 251 KDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGKYISGHAVRLIGWGVENGTAYWLAANTW 310
Query: 138 NDHWGDHGTFKILRGENEADIE 159
N+ WG+ G F+I+RG NE IE
Sbjct: 311 NEDWGEKGYFRIVRGRNECLIE 332
>gi|226474168|emb|CAX71570.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 129 bits (324), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 65/143 (45%), Positives = 89/143 (62%), Gaps = 7/143 (4%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+P +FD+R+KWP C S+ I DQS CGS WAVS A+SDR+CI S G + ++SA ++
Sbjct: 90 IPSHFDSRKKWPRCKSISQIRDQSQCGSSWAVSAVGAMSDRICIQSGGKQSVELSAVDLI 149
Query: 247 ACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
+C C GC+GG+ +W +W G+VTGG + GC+PY C+H V+G + C
Sbjct: 150 SCCKYCGSGCDGGFLGPSWDYWVLRGIVTGGSKENHTGCRPYPFPKCDHFVKGKYRACG- 208
Query: 306 LGKL-KTPECKQNC---YNPSYE 324
KL KTP+C Q C YN SYE
Sbjct: 209 -DKLYKTPQCNQTCQKGYNTSYE 230
Score = 102 bits (253), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 48/99 (48%), Positives = 63/99 (63%), Gaps = 2/99 (2%)
Query: 63 HYFKKAHMVPRCNAMRQ--IYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
HY ++ V ++ Q I HGP A +Y DFL YKSG+Y++ G I HAVR++
Sbjct: 234 HYGGFSYNVLSVESVIQKDIMMHGPAEAYLEIYEDFLNYKSGIYRYTTGQFISGHAVRLI 293
Query: 121 GWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
GWGVEN YWL AN+WN+ WG+ G F+I+RG NE IE
Sbjct: 294 GWGVENGTAYWLAANTWNEDWGEKGYFRIVRGRNECLIE 332
>gi|161343867|tpg|DAA06114.1| TPA_inf: cathepsin B [Myzus persicae]
Length = 340
Score = 129 bits (324), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 58/134 (43%), Positives = 87/134 (64%), Gaps = 2/134 (1%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+P+ FDAR+KW +C ++ + DQ NCGSCWA++ ++A +DRLC+A++ F +S + +
Sbjct: 88 IPKKFDARKKWRKCKTIGAVRDQGNCGSCWALATSSAFADRLCVATDADFNEFLSPEELT 147
Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
C C +GCNGG+P AW + +G+VTGGDY S EGC+PY + PC HH +G +C+
Sbjct: 148 FCCHTCGYGCNGGYPIKAWERFKSHGLVTGGDYKSGEGCEPYRVPPCRHHAEGN-NSCSD 206
Query: 306 LGKLKTPECKQNCY 319
K C + CY
Sbjct: 207 KPMEKNHRCTRMCY 220
Score = 94.4 bits (233), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 42/97 (43%), Positives = 62/97 (63%), Gaps = 1/97 (1%)
Query: 64 YFKKAHMVPRCNAMRQIYEHGPLVAIFSVYADFLQYKSGVY-QHNFGDSIGLHAVRVLGW 122
Y + ++ + + + + +GP+ A F VY DF YKSGVY + + +G HAV+++GW
Sbjct: 233 YTRDSYYLTYGSIQKDVMNYGPIEASFDVYDDFPSYKSGVYIRSDNASYLGGHAVKLIGW 292
Query: 123 GVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
G E+ +PYWL+ NSWN WGD G FKI RG NE ++
Sbjct: 293 GEESGVPYWLMVNSWNTDWGDKGLFKIQRGTNECGVD 329
>gi|226474174|emb|CAX71573.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 129 bits (324), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 65/143 (45%), Positives = 89/143 (62%), Gaps = 7/143 (4%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+P +FD+R+KWP C S+ I DQS CGS WAVS A+SDR+CI S G + ++SA ++
Sbjct: 90 IPSHFDSRKKWPRCKSISQIRDQSQCGSSWAVSAVGAMSDRICIQSGGKQSVELSAIDLI 149
Query: 247 ACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
+C C GC+GG+ +W +W G+VTGG + GC+PY C+H V+G + C
Sbjct: 150 SCCKYCGSGCDGGFLGPSWDYWVLRGIVTGGSKENHTGCRPYPFPKCDHFVKGKYRACG- 208
Query: 306 LGKL-KTPECKQNC---YNPSYE 324
KL KTP+C Q C YN SYE
Sbjct: 209 -DKLYKTPQCNQTCQKGYNTSYE 230
Score = 103 bits (256), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 48/99 (48%), Positives = 64/99 (64%), Gaps = 2/99 (2%)
Query: 63 HYFKKAHMVPRCNAMRQ--IYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
HY ++ V ++ Q I HGP+ A +Y DFL YKSG+Y++ G I HAVR++
Sbjct: 234 HYGGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGQFISGHAVRLI 293
Query: 121 GWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
GWGVEN YWL AN+WN+ WG+ G F+I+RG NE IE
Sbjct: 294 GWGVENGTAYWLAANTWNEDWGEKGYFRIVRGRNECLIE 332
>gi|22531389|emb|CAD44625.1| cathepsin B1 isotype 2 [Schistosoma mansoni]
Length = 340
Score = 129 bits (324), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 68/156 (43%), Positives = 89/156 (57%), Gaps = 5/156 (3%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+P NFD+R+KWP C S+ I DQS CGSCWA A+SDR CI S G ++SA ++
Sbjct: 89 IPSNFDSRKKWPGCKSIATIRDQSRCGSCWAFGAVEAMSDRSCIQSGGKQNVELSAVDLL 148
Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
+C +C GC GG AW FW G+VTG + GC+PY CEHH +G C
Sbjct: 149 SCCESCGLGCEGGILGPAWDFWVKEGIVTGSSKENHTGCEPYPFPKCEHHTKGKYPPCG- 207
Query: 306 LGKL-KTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
K+ KTP CKQ C Y++ Y D +GK ++ V
Sbjct: 208 -SKIYKTPRCKQTC-QKKYKTPYTQDKHRGKSSYNV 241
Score = 115 bits (288), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 46/82 (56%), Positives = 67/82 (81%)
Query: 78 RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
++I ++GP+ A F+VY DFL YKSG+Y+H G+++G HA+R++GWGVEN PYWL+ANSW
Sbjct: 250 KEIMKYGPVEASFTVYEDFLNYKSGIYKHITGEALGGHAIRIIGWGVENKTPYWLIANSW 309
Query: 138 NDHWGDHGTFKILRGENEADIE 159
N+ WG++G F+I+RG +E IE
Sbjct: 310 NEDWGENGYFRIVRGRDECFIE 331
>gi|226473756|emb|CAX71563.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 129 bits (323), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 65/143 (45%), Positives = 89/143 (62%), Gaps = 7/143 (4%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+P +FD+R+KWP C S+ I DQS CGS WAVS A+SDR+CI S G + ++SA ++
Sbjct: 90 IPSHFDSRKKWPRCKSISQIRDQSQCGSSWAVSAVGAMSDRICIQSGGKQSVELSAVDLI 149
Query: 247 ACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
+C C GC+GG+ +W +W G+VTGG + GC+PY C+H V+G + C
Sbjct: 150 SCCKYCGSGCDGGFLGPSWDYWVLRGIVTGGSKENHTGCRPYPFPKCDHFVKGKYRACG- 208
Query: 306 LGKL-KTPECKQNC---YNPSYE 324
KL KTP+C Q C YN SYE
Sbjct: 209 -DKLYKTPQCNQTCQKGYNTSYE 230
Score = 102 bits (255), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 48/99 (48%), Positives = 64/99 (64%), Gaps = 2/99 (2%)
Query: 63 HYFKKAHMVPRCNAMRQ--IYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
HY ++ V ++ Q I HGP+ A +Y DFL YKSG+Y++ G I HAVR++
Sbjct: 234 HYGGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGKYISGHAVRLI 293
Query: 121 GWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
GWGVEN YWL AN+WN+ WG+ G F+I+RG NE IE
Sbjct: 294 GWGVENGTAYWLAANTWNEDWGEKGYFRIVRGRNECLIE 332
>gi|56754499|gb|AAW25437.1| unknown [Schistosoma japonicum]
Length = 342
Score = 129 bits (323), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 65/143 (45%), Positives = 89/143 (62%), Gaps = 7/143 (4%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+P +FD+R+KWP C S+ I DQS CGS WAVS A+SDR+CI S G + ++SA ++
Sbjct: 90 IPSHFDSRKKWPRCKSISQIRDQSQCGSSWAVSAVGAMSDRICIQSGGKQSVELSAVDLI 149
Query: 247 ACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
+C C GC+GG+ +W +W G+VTGG + GC+PY C+H V+G + C
Sbjct: 150 SCCKYCGSGCDGGFLGPSWDYWVLRGIVTGGSKENHTGCRPYPFPKCDHFVKGKYRACG- 208
Query: 306 LGKL-KTPECKQNC---YNPSYE 324
KL KTP+C Q C YN SYE
Sbjct: 209 -DKLYKTPQCNQTCQKGYNTSYE 230
Score = 102 bits (255), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 48/99 (48%), Positives = 64/99 (64%), Gaps = 2/99 (2%)
Query: 63 HYFKKAHMVPRCNAMRQ--IYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
HY ++ V ++ Q I HGP+ A +Y DFL YKSG+Y++ G I HAVR++
Sbjct: 234 HYGGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGKYISGHAVRLI 293
Query: 121 GWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
GWGVEN YWL AN+WN+ WG+ G F+I+RG NE IE
Sbjct: 294 GWGVENGTAYWLAANTWNEDWGEKGYFRIVRGRNECLIE 332
>gi|161343845|tpg|DAA06103.1| TPA_inf: cathepsin B [Acyrthosiphon pisum]
Length = 261
Score = 129 bits (323), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 59/134 (44%), Positives = 84/134 (62%), Gaps = 2/134 (1%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+PR+FDAR KW C ++ + DQ NCGSCWA++ ++A +DRLC+A+N F +SA+ I
Sbjct: 88 IPRHFDARRKWRRCHTIGAVRDQGNCGSCWAMATSSAFADRLCVATNADFNELLSAEEIT 147
Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
C +C +GCNGG+P AW + G+VTGGDY S EGC+PY + PC + +G C
Sbjct: 148 FCCHSCGFGCNGGYPIKAWERFKKRGLVTGGDYQSGEGCEPYRVPPCPYDAEGH-NTCAG 206
Query: 306 LGKLKTPECKQNCY 319
+ C + CY
Sbjct: 207 KPRESNHRCTRMCY 220
>gi|226473760|emb|CAX71565.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 129 bits (323), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 65/143 (45%), Positives = 89/143 (62%), Gaps = 7/143 (4%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+P +FD+R+KWP C S+ I DQS CGS WAVS A+SDR+CI S G + ++SA ++
Sbjct: 90 IPSHFDSRKKWPRCKSISQIRDQSQCGSSWAVSAVGAMSDRICIQSGGKQSVELSAVDLI 149
Query: 247 ACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
+C C GC+GG+ +W +W G+VTGG + GC+PY C+H V+G + C
Sbjct: 150 SCCKYCGSGCDGGFLGPSWDYWVLRGIVTGGSKENHTGCRPYPFPKCDHFVKGKYRACG- 208
Query: 306 LGKL-KTPECKQNC---YNPSYE 324
KL KTP+C Q C YN SYE
Sbjct: 209 -DKLYKTPQCNQTCQKGYNTSYE 230
Score = 98.2 bits (243), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 43/82 (52%), Positives = 56/82 (68%)
Query: 78 RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
+ I HGP+ A +Y DFL YKSG+Y++ G I HAVR++G GVEN YWL AN+W
Sbjct: 251 KDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGKYISGHAVRLIGCGVENGTAYWLAANTW 310
Query: 138 NDHWGDHGTFKILRGENEADIE 159
N+ WG+ G F+I+RG NE IE
Sbjct: 311 NEDWGEKGYFRIVRGRNECLIE 332
>gi|256052329|ref|XP_002569725.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
gi|353228436|emb|CCD74607.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
Length = 345
Score = 129 bits (323), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 68/156 (43%), Positives = 89/156 (57%), Gaps = 5/156 (3%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+P NFD+R+KWP C S+ I DQS CGSCWA A+SDR CI S G ++SA ++
Sbjct: 94 IPSNFDSRKKWPGCKSIATIRDQSRCGSCWAFGAVEAMSDRSCIQSGGKQNVELSAVDLL 153
Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
+C +C GC GG AW FW G+VTG + GC+PY CEHH +G C
Sbjct: 154 SCCESCGLGCEGGILGPAWDFWVKEGIVTGSSKENHTGCEPYPFPKCEHHTKGKYPPCG- 212
Query: 306 LGKL-KTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
K+ KTP CKQ C Y++ Y D +GK ++ V
Sbjct: 213 -SKIYKTPRCKQTC-QKKYKTPYTQDKHRGKSSYNV 246
Score = 115 bits (287), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 46/82 (56%), Positives = 67/82 (81%)
Query: 78 RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
++I ++GP+ A F+VY DFL YKSG+Y+H G+++G HA+R++GWGVEN PYWL+ANSW
Sbjct: 255 KEIMKYGPVEASFTVYEDFLNYKSGIYKHITGEALGGHAIRIIGWGVENKTPYWLIANSW 314
Query: 138 NDHWGDHGTFKILRGENEADIE 159
N+ WG++G F+I+RG +E IE
Sbjct: 315 NEDWGENGYFRIVRGRDECFIE 336
>gi|226474178|emb|CAX71575.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 129 bits (323), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 65/143 (45%), Positives = 89/143 (62%), Gaps = 7/143 (4%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+P +FD+R+KWP C S+ I DQS CGS WAVS A+SDR+CI S G + ++SA ++
Sbjct: 90 IPSHFDSRKKWPRCKSISQIRDQSQCGSSWAVSAVGAMSDRICIQSGGKQSVELSAVDLI 149
Query: 247 ACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
+C C GC+GG+ +W +W G+VTGG + GC+PY C+H V+G + C
Sbjct: 150 SCCKYCGSGCDGGFLGPSWDYWVLRGIVTGGSKENHTGCRPYPFPKCDHFVKGKYRACG- 208
Query: 306 LGKL-KTPECKQNC---YNPSYE 324
KL KTP+C Q C YN SYE
Sbjct: 209 -DKLYKTPQCNQTCQKGYNTSYE 230
Score = 104 bits (259), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 48/99 (48%), Positives = 64/99 (64%), Gaps = 2/99 (2%)
Query: 63 HYFKKAHMVPRCNAMRQ--IYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
HY ++ V ++ Q I HGP+ A +Y DFL YKSG+Y++ G I HAVR++
Sbjct: 234 HYGGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGQFISGHAVRLI 293
Query: 121 GWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
GWGVEN YWL AN+WN+ WG+ G F+I+RG NE IE
Sbjct: 294 GWGVENGTAYWLAANTWNEDWGEKGYFRIVRGRNECSIE 332
>gi|56757646|gb|AAW26973.1| unknown [Schistosoma japonicum]
Length = 342
Score = 129 bits (323), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 65/143 (45%), Positives = 89/143 (62%), Gaps = 7/143 (4%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+P +FD+R+KWP C S+ I DQS CGS WAVS A+SDR+CI S G + ++SA ++
Sbjct: 90 IPSHFDSRKKWPRCKSISQIRDQSQCGSSWAVSAVGAMSDRICIQSGGKQSVELSAVDLI 149
Query: 247 ACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
+C C GC+GG+ +W +W G+VTGG + GC+PY C+H V+G + C
Sbjct: 150 SCCKYCGSGCDGGFLGPSWDYWVLRGIVTGGSKENHTGCRPYPFPKCDHFVKGKYRACG- 208
Query: 306 LGKL-KTPECKQNC---YNPSYE 324
KL KTP+C Q C YN SYE
Sbjct: 209 -DKLYKTPQCNQTCQKGYNTSYE 230
Score = 103 bits (258), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 44/82 (53%), Positives = 57/82 (69%)
Query: 78 RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
+ I HGP+ A +Y DFL YKSG+Y++ G I HAVR++GWGVEN YWL AN+W
Sbjct: 251 KDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGKYISGHAVRLIGWGVENGTAYWLAANTW 310
Query: 138 NDHWGDHGTFKILRGENEADIE 159
N+ WG+ G F+I+RG NE IE
Sbjct: 311 NEDWGEKGYFRIVRGRNECSIE 332
>gi|145481831|ref|XP_001426938.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124394016|emb|CAK59540.1| unnamed protein product [Paramecium tetraurelia]
Length = 332
Score = 129 bits (323), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 69/154 (44%), Positives = 91/154 (59%), Gaps = 11/154 (7%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP +F A+EKWP CPS+ I DQ NCGSCWAVS A+ +SDRLCIAS QISA+ ++
Sbjct: 71 LPPSFSAQEKWPGCPSIELIPDQGNCGSCWAVSAASTMSDRLCIASGQTDKRQISAEDLL 130
Query: 247 ACT-PNC-----WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEH-HVQGP 299
+C NC GC+GG+P AW++ +G+VTGG YN C+PY+ PC H + G
Sbjct: 131 SCCGINCELDGNGGCDGGYPYGAWKYLRVDGIVTGGTYNDFSLCKPYSFPPCSHGNDSGK 190
Query: 300 LQNCT---LLGKLKTPECKQNCYNPSYESTYRFD 330
C + TP C + C+ P + TY D
Sbjct: 191 YSKCENDFFMLTEVTPSCTKKCH-PQFSRTYDVD 223
Score = 108 bits (271), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 48/81 (59%), Positives = 59/81 (72%)
Query: 79 QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWN 138
+IY +GP+ A+F+V+ DFL YKSGVYQ G G HAV+++GWG EN +PYW NSWN
Sbjct: 244 EIYLNGPVQAVFTVFDDFLNYKSGVYQQTTGQRRGKHAVKIIGWGTENGVPYWEAINSWN 303
Query: 139 DHWGDHGTFKILRGENEADIE 159
D WG +G FKILRG N DIE
Sbjct: 304 DGWGINGKFKILRGFNHLDIE 324
>gi|226473758|emb|CAX71564.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 129 bits (323), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 65/143 (45%), Positives = 89/143 (62%), Gaps = 7/143 (4%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+P +FD+R+KWP C S+ I DQS CGS WAVS A+SDR+CI S G + ++SA ++
Sbjct: 90 IPSHFDSRKKWPRCKSISQIRDQSQCGSSWAVSAVGAMSDRICIQSGGKQSVELSAVDLI 149
Query: 247 ACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
+C C GC+GG+ +W +W G+VTGG + GC+PY C+H V+G + C
Sbjct: 150 SCCKYCGSGCDGGFLGPSWDYWVLRGIVTGGSKENHTGCRPYPFPKCDHFVKGKYRACG- 208
Query: 306 LGKL-KTPECKQNC---YNPSYE 324
KL KTP+C Q C YN SYE
Sbjct: 209 -DKLYKTPQCNQTCQKGYNTSYE 230
Score = 102 bits (255), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 48/99 (48%), Positives = 64/99 (64%), Gaps = 2/99 (2%)
Query: 63 HYFKKAHMVPRCNAMRQ--IYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
HY ++ V ++ Q I HGP+ A +Y DFL YKSG+Y++ G I HAVR++
Sbjct: 234 HYGGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGQFISGHAVRLI 293
Query: 121 GWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
GWGVEN YWL AN+WN+ WG+ G F+I+RG NE IE
Sbjct: 294 GWGVENGTAYWLAANTWNEDWGEKGYFRIVRGRNECLIE 332
>gi|393902164|gb|EFO13452.2| hypothetical protein LOAG_15077, partial [Loa loa]
Length = 186
Score = 129 bits (323), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 69/155 (44%), Positives = 97/155 (62%), Gaps = 6/155 (3%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP++FDAR +WP C S+ +A+Q CGSCWA+S A+ +SDRLCIA+N QISA+ ++
Sbjct: 11 LPKHFDARLRWPLCWSVHVVANQGGCGSCWAISAASVMSDRLCIATNYSNQKQISAEDLI 70
Query: 247 ACTPNCWGCNGG-WPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
+C C GC G W A+ +W ++GVVTGGDY S EGC+PYT AP + P
Sbjct: 71 SCCTECGGCQGSHWALSAFIYWRNHGVVTGGDYGSFEGCKPYTTAP---NCGSPCSFEYY 127
Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
K+ +P C++ C P Y +Y DL +KA+ +
Sbjct: 128 RRKI-SPACQKTC-QPLYGLSYEEDLISSQKAYWI 160
>gi|56752787|gb|AAW24605.1| unknown [Schistosoma japonicum]
Length = 309
Score = 128 bits (322), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 65/143 (45%), Positives = 89/143 (62%), Gaps = 7/143 (4%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+P +FD+R+KWP C S+ I DQS CGS WAVS A+SDR+CI S G + ++SA ++
Sbjct: 57 IPSHFDSRKKWPRCKSISQIRDQSQCGSSWAVSAVGAMSDRICIQSGGKQSVELSAVDLI 116
Query: 247 ACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
+C C GC+GG+ +W +W G+VTGG + GC+PY C+H V+G + C
Sbjct: 117 SCCKYCGSGCDGGFLGPSWDYWVLRGIVTGGSKENHTGCRPYPFPKCDHFVKGKYRACG- 175
Query: 306 LGKL-KTPECKQNC---YNPSYE 324
KL KTP+C Q C YN SYE
Sbjct: 176 -DKLYKTPQCNQTCQKGYNTSYE 197
Score = 102 bits (255), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 48/99 (48%), Positives = 64/99 (64%), Gaps = 2/99 (2%)
Query: 63 HYFKKAHMVPRCNAMRQ--IYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
HY ++ V ++ Q I HGP+ A +Y DFL YKSG+Y++ G I HAVR++
Sbjct: 201 HYGGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGQFISGHAVRLI 260
Query: 121 GWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
GWGVEN YWL AN+WN+ WG+ G F+I+RG NE IE
Sbjct: 261 GWGVENGTAYWLAANTWNEDWGEKGYFRIVRGRNECLIE 299
>gi|984958|gb|AAC46877.1| cathepsin B-like proteinase [Ancylostoma caninum]
Length = 343
Score = 128 bits (321), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 76/208 (36%), Positives = 107/208 (51%), Gaps = 9/208 (4%)
Query: 132 LVANSWNDHWGDHGTFKILRGENEADIEMGFNNRVEANS--SEDDDLETMGCQNAKGLPR 189
L ++ D+ +H +F R E + E R+ + E E + P
Sbjct: 34 LSGQAFVDYINEHQSF--YRAEYSPEAEAFVKARIMDSKYLVEPKKEEVLEDVYGNDPPA 91
Query: 190 NFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVACT 249
+FDAR WPEC S+ I DQS+CGSCWAVS A A+SD +C+ SN IS I++C
Sbjct: 92 SFDARTHWPECRSIGTIRDQSSCGSCWAVSSAEAMSDEICVQSNSTIRVMISDSDILSCC 151
Query: 250 P-NC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTLLG 307
+C +GC GGWP A+++ +GVVTGG Y ++ C+PY PC HH P G
Sbjct: 152 GISCGYGCQGGWPIEAYKWMQRDGVVTGGKYRQKKVCKPYAFYPCGHHQNDPYYGPCPGG 211
Query: 308 KLKTPECKQNC---YNPSYESTYRFDLK 332
TP+C++ C YN SY+ F +
Sbjct: 212 LWPTPKCRKTCQRKYNKSYQEDKHFATR 239
Score = 110 bits (275), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 50/99 (50%), Positives = 67/99 (67%), Gaps = 2/99 (2%)
Query: 63 HYFKKAHMVPRC--NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
H+ +A+ +P N ++IY++GP+VA F VY DF YK G+Y H +G G HAV+V+
Sbjct: 235 HFATRAYYLPNNERNIRQEIYKNGPVVAAFRVYQDFSYYKKGIYVHKWGGQTGAHAVKVV 294
Query: 121 GWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
GWG EN YWL+ANSWN WG+ G F+I+RG NE IE
Sbjct: 295 GWGRENATDYWLIANSWNTDWGESGYFRIVRGTNECGIE 333
>gi|157058731|gb|ABV03123.1| cathepsin B-16D1 [Acyrthosiphon pisum]
Length = 243
Score = 128 bits (321), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 59/134 (44%), Positives = 84/134 (62%), Gaps = 2/134 (1%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+PR+FDAR KW C ++ + DQ NCGSCWA++ ++A +DRLC+A+N F +SA+ I
Sbjct: 86 IPRHFDARRKWRSCHTIGAVRDQGNCGSCWAMATSSAFADRLCVATNADFNELLSAEEIT 145
Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
C +C +GCNGG+P AW + G+VTGGDY S EGC+PY + PC + +G C
Sbjct: 146 FCCYSCGFGCNGGYPIKAWERFKKRGLVTGGDYQSGEGCEPYRVPPCPYDAEGH-NTCAG 204
Query: 306 LGKLKTPECKQNCY 319
+ C + CY
Sbjct: 205 KPRESNHRCTRMCY 218
>gi|312266|emb|CAA51531.1| cathepsin B-like enzyme [Gallus gallus]
Length = 156
Score = 128 bits (321), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 63/147 (42%), Positives = 89/147 (60%), Gaps = 4/147 (2%)
Query: 196 KWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVACTP-NC-W 253
+WP CP++ I DQ +CGSCWA ISDR+C+ +N + ++SA+ +++C C
Sbjct: 2 QWPNCPTISEIRDQGSCGSCWAFGSVEVISDRICVHTNAKVSVEVSAEDLLSCCGFECGM 61
Query: 254 GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTLLGKLKTPE 313
GCNGG+P AWR+W G+V+GG Y+S GC YT+ PCEHHV G CT G +TP
Sbjct: 62 GCNGGYPSGAWRYWTERGLVSGGLYDSHVGCAGYTIPPCEHHVNGSRPPCTGEGG-ETPR 120
Query: 314 CKQNCYNPSYESTYRFDLKKGKKAHMV 340
C ++C P Y +Y+ D G + V
Sbjct: 121 CSRHC-EPGYSPSYKEDKHYGSHIYGV 146
>gi|159177|gb|AAA29177.1| cysteine proteinase [Haemonchus contortus]
Length = 342
Score = 128 bits (321), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 71/160 (44%), Positives = 99/160 (61%), Gaps = 14/160 (8%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+P +D REK+ +C + +I DQ+NCGSCWAVS A AISDR+CIA+NG IS+ I+
Sbjct: 86 IPEEYDPREKF-KCSTF-YIRDQANCGSCWAVSTAAAISDRICIATNGEKQVNISSTDIL 143
Query: 247 A-CTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
C P C +GC GGW AW ++ + GVV+GG+Y ++ C+PY + PC HH N T
Sbjct: 144 TCCNPQCGFGCGGGWSIRAWEYFVYEGVVSGGEYLTKGVCRPYPIHPCGHH-----GNDT 198
Query: 305 LLG----KLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
G + TP CK+ C P Y+ +R D ++GK A+ V
Sbjct: 199 YYGECPREAATPPCKKKC-QPGYKKIFRMDKRQGKVAYGV 237
Score = 99.4 bits (246), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 47/101 (46%), Positives = 68/101 (67%), Gaps = 6/101 (5%)
Query: 78 RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDI--PYWLVAN 135
R+I HGP+VA F+VY DF YK+GVY+H G G HAV+++GWGV++ YWL+AN
Sbjct: 246 REILRHGPVVASFAVYEDFSLYKTGVYKHTAGALRGYHAVKMMGWGVDSKTKAKYWLIAN 305
Query: 136 SWNDHWGDHGTFKILRGENEADIEMGFNNRVEANSSEDDDL 176
SW++ WG++G F+ +RG N+ +IE + V A + D L
Sbjct: 306 SWHNDWGENGYFRFIRGINDCEIE----DTVAAGIVDVDSL 342
>gi|328718094|ref|XP_003246386.1| PREDICTED: cathepsin B [Acyrthosiphon pisum]
Length = 340
Score = 128 bits (321), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 59/134 (44%), Positives = 84/134 (62%), Gaps = 2/134 (1%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+PR+FDAR KW C ++ + DQ NCGSCWA++ ++A +DRLC+A+N F +SA+ I
Sbjct: 88 IPRHFDARRKWRRCHTIGAVRDQGNCGSCWAMATSSAFADRLCVATNADFNELLSAEEIT 147
Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
C +C +GCNGG+P AW + G+VTGGDY S EGC+PY + PC + +G C
Sbjct: 148 FCCHSCGFGCNGGYPIKAWERFKKRGLVTGGDYQSGEGCEPYRVPPCPYDAEGH-NTCAG 206
Query: 306 LGKLKTPECKQNCY 319
+ C + CY
Sbjct: 207 KPRESNHRCTRMCY 220
Score = 92.4 bits (228), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 43/97 (44%), Positives = 61/97 (62%), Gaps = 1/97 (1%)
Query: 64 YFKKAHMVPRCNAMRQIYEHGPLVAIFSVYADFLQYKSGVY-QHNFGDSIGLHAVRVLGW 122
Y + ++ + + + + +GP+ A F VY DF YKSGVY + +G HAV+++GW
Sbjct: 233 YTRDSYYLTYGSIQKDVMTYGPIEASFDVYDDFPSYKSGVYVKSENATYLGGHAVKLIGW 292
Query: 123 GVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
G E +PYWL+ NSWN WGD+G FKI RG NE I+
Sbjct: 293 GEEYGVPYWLMVNSWNADWGDNGLFKIRRGTNECGID 329
>gi|201023315|ref|NP_001128400.1| cathepsin B-16D2 precursor [Acyrthosiphon pisum]
Length = 340
Score = 128 bits (321), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 59/134 (44%), Positives = 84/134 (62%), Gaps = 2/134 (1%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+PR+FDAR KW C ++ + DQ NCGSCWA++ ++A +DRLC+A+N F +SA+ I
Sbjct: 88 IPRHFDARRKWRRCHTIGAVRDQGNCGSCWAMATSSAFADRLCVATNADFNELLSAEEIT 147
Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
C +C +GCNGG+P AW + G+VTGGDY S EGC+PY + PC + +G C
Sbjct: 148 FCCHSCGFGCNGGYPIKAWERFKKRGLVTGGDYQSGEGCEPYRVPPCPYDAEGH-NTCAG 206
Query: 306 LGKLKTPECKQNCY 319
+ C + CY
Sbjct: 207 KPRESNHRCTRMCY 220
Score = 92.0 bits (227), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 43/97 (44%), Positives = 61/97 (62%), Gaps = 1/97 (1%)
Query: 64 YFKKAHMVPRCNAMRQIYEHGPLVAIFSVYADFLQYKSGVY-QHNFGDSIGLHAVRVLGW 122
Y + ++ + + + + +GP+ A F VY DF YKSGVY + +G HAV+++GW
Sbjct: 233 YTRDSYYLTYGSIQKDVMTYGPIEASFDVYDDFPSYKSGVYVKSENATYLGGHAVKLIGW 292
Query: 123 GVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
G E +PYWL+ NSWN WGD+G FKI RG NE I+
Sbjct: 293 GEEYGVPYWLMVNSWNADWGDNGLFKIRRGTNECGID 329
>gi|118358710|ref|XP_001012596.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila]
gi|89294363|gb|EAR92351.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila SB210]
Length = 346
Score = 128 bits (321), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 73/166 (43%), Positives = 99/166 (59%), Gaps = 14/166 (8%)
Query: 182 QNAKGLPRNFDAREKWPE-CPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQ- 239
+ A LP FD+R +W + C SL + DQSNCGSCWA A ++SDR CI GQ
Sbjct: 88 RQANNLPSEFDSRVQWGDKCSSLWEVRDQSNCGSCWAFGAAESLSDRHCI-----HLGQD 142
Query: 240 --ISAQHIVACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHV 296
+S Q++V C C +GC+GGWP+ A ++ +NG+VTG Y + CQ Y+LAPC HHV
Sbjct: 143 IRLSTQNLVTCCDECGFGCDGGWPEAAMDYYVNNGLVTGDLYGNNSWCQAYSLAPCAHHV 202
Query: 297 QGPLQ-NCTLLGKLKTPECKQNC-YNPSYESTYRFDLKKGKKAHMV 340
+ CT G+L TP C ++C N +Y Y DL KG KA+ +
Sbjct: 203 TSDVYPPCT--GELPTPPCVKSCDSNSTYTIPYPKDLHKGSKAYSI 246
Score = 119 bits (297), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 51/83 (61%), Positives = 63/83 (75%)
Query: 77 MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANS 136
M +I +GP+ F+VY DFL YKSGVYQH G +G HAV+++GWGVEN PYW++ NS
Sbjct: 254 MTEIQTNGPIEVAFTVYEDFLTYKSGVYQHVTGSELGGHAVKMVGWGVENGTPYWIIVNS 313
Query: 137 WNDHWGDHGTFKILRGENEADIE 159
WN+ WGD GTFKILRG+NE IE
Sbjct: 314 WNESWGDKGTFKILRGQNECGIE 336
>gi|157058771|gb|ABV03143.1| cathepsin B-16D [Aulacorthum solani]
Length = 201
Score = 128 bits (321), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 56/113 (49%), Positives = 81/113 (71%), Gaps = 1/113 (0%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+PR+FDAR KW C ++ + DQ NCGSCWA++ ++A +DRLC+A+NG F +SA+ I
Sbjct: 72 IPRHFDARRKWRHCQTIGKVRDQGNCGSCWAMATSSAFADRLCVATNGDFNELLSAEEIT 131
Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQG 298
C C +GC+GG+P AW+ + +G+VTGG+YNS EGC+PY + PC + QG
Sbjct: 132 FCCHTCGFGCHGGYPIKAWKRFNKHGLVTGGNYNSGEGCEPYRVPPCPYDDQG 184
>gi|4325188|gb|AAD17297.1| cysteine proteinase [Ancylostoma ceylanicum]
Length = 341
Score = 127 bits (320), Expect = 7e-27, Method: Compositional matrix adjust.
Identities = 78/213 (36%), Positives = 110/213 (51%), Gaps = 10/213 (4%)
Query: 124 VENDIPYWLVANSWNDHWGDHGTFKILRGENEADIEMGFNNRVEANS--SEDDDLETMGC 181
VE D+ L ++ D+ +H +F R E + E R+ + +E E +
Sbjct: 26 VEKDVEK-LTGQAFVDYINEHQSF--YRAEYSPEAEAFVKARIMDSKFLAEQKKEEVLAD 82
Query: 182 QNAKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQIS 241
P +FDAR +WPEC S+ I DQS CGSCWAVS A A+SD +C+ SN IS
Sbjct: 83 VYGDDPPDSFDARTQWPECRSIGTIRDQSACGSCWAVSSAEAMSDEICVQSNSTIKVMIS 142
Query: 242 AQHIVACTP-NC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGP 299
I++C +C +GC GGWP A+R+ +GVVTGG Y ++ C+PY+ PC H P
Sbjct: 143 DTDILSCCGLDCGYGCQGGWPIEAYRWMQRDGVVTGGKYRQRDVCKPYSFYPCGQHKDVP 202
Query: 300 LQNCTLLGKLKTPECK---QNCYNPSYESTYRF 329
G TP+C+ Q YN +Y+ F
Sbjct: 203 YYGPCPGGLWPTPKCRKSSQRKYNKTYQEDKHF 235
Score = 91.3 bits (225), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 47/117 (40%), Positives = 71/117 (60%), Gaps = 3/117 (2%)
Query: 45 KKKKKKKRLYLPTSIPLSHYFKKAHMVPRCN-AMRQ-IYEHGPLVAIFSVYADFLQYKSG 102
K +K +R Y T H+ +++ +P ++RQ IY++GP+VA F VY D+ G
Sbjct: 216 KCRKSSQRKYNKTYQEDKHFATRSYSLPNNERSIRQEIYKNGPVVAAFKVYEDYSS-TGG 274
Query: 103 VYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
+Y H +G G HA +V+GWG EN YWL+ANSWN WG+ G ++I+R + +IE
Sbjct: 275 IYVHKWGIQTGAHADKVIGWGRENGTDYWLIANSWNTDWGEDGYYRIVRETDNCEIE 331
>gi|239793652|dbj|BAH72931.1| ACYPI000018 [Acyrthosiphon pisum]
Length = 239
Score = 127 bits (319), Expect = 7e-27, Method: Compositional matrix adjust.
Identities = 55/116 (47%), Positives = 79/116 (68%), Gaps = 1/116 (0%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+PR+FDAR KW C ++ + DQ NCGSCWA++ ++A +DRLC+A+N F +SA+ I
Sbjct: 88 IPRHFDARRKWRRCHTIGAVRDQGNCGSCWAMATSSAFADRLCVATNTDFNELLSAEEIT 147
Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQ 301
C +C +GCNGG+P AW + G+VTGGDY S EGC+PY + PC + +G +
Sbjct: 148 FCCHSCGFGCNGGYPIKAWERFKKRGLVTGGDYQSGEGCEPYRVPPCPYDAEGHIH 203
>gi|226474182|emb|CAX71577.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 127 bits (319), Expect = 7e-27, Method: Compositional matrix adjust.
Identities = 64/143 (44%), Positives = 89/143 (62%), Gaps = 7/143 (4%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+P +FD+R+KWP C S+ I DQS CGS WAVS A+SDR+CI S G + ++SA ++
Sbjct: 90 IPSHFDSRKKWPRCKSISQIRDQSQCGSSWAVSAVGAMSDRICIQSGGKQSVELSAVDLI 149
Query: 247 ACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
+C C GC+GG+ +W +W G+VTGG + C+PY C+H V+G + C
Sbjct: 150 SCCKYCGSGCDGGFLGPSWDYWVLRGIVTGGSKENHTSCRPYPFPKCDHFVKGKYRACG- 208
Query: 306 LGKL-KTPECKQNC---YNPSYE 324
KL +TP+CKQ C YN SYE
Sbjct: 209 -DKLYETPQCKQTCQKGYNTSYE 230
Score = 104 bits (259), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 48/99 (48%), Positives = 64/99 (64%), Gaps = 2/99 (2%)
Query: 63 HYFKKAHMVPRCNAMRQ--IYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
HY ++ V ++ Q I HGP+ A +Y DFL YKSG+Y++ G I HAVR++
Sbjct: 234 HYGGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGQFISGHAVRLI 293
Query: 121 GWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
GWGVEN YWL AN+WN+ WG+ G F+I+RG NE IE
Sbjct: 294 GWGVENGTAYWLAANTWNEDWGEKGYFRIVRGRNECSIE 332
>gi|157058753|gb|ABV03134.1| cathepsin B-84 [Acyrthosiphon pisum]
Length = 230
Score = 127 bits (319), Expect = 8e-27, Method: Compositional matrix adjust.
Identities = 57/134 (42%), Positives = 85/134 (63%), Gaps = 2/134 (1%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+P FD+R +W C ++ H+ +Q NCGSCWA A +DRLC+A+NG F ISA+ +
Sbjct: 44 VPEFFDSRLEWDYCETIGHVRNQGNCGSCWAHGTTGAFADRLCVATNGEFNELISAEELT 103
Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
C C +GCNGG+P AW+++ +GVVTGGDY++ +GCQPY + PC +G +C+
Sbjct: 104 FCCHTCGFGCNGGYPLKAWQYFKRHGVVTGGDYDTTDGCQPYRVPPCVKDDEG-HNSCSG 162
Query: 306 LGKLKTPECKQNCY 319
+ +C + CY
Sbjct: 163 QPTERNHKCSKKCY 176
Score = 38.5 bits (88), Expect = 5.0, Method: Compositional matrix adjust.
Identities = 22/65 (33%), Positives = 34/65 (52%), Gaps = 3/65 (4%)
Query: 44 KKKKKKKKRLYLPTSIPL--SHY-FKKAHMVPRCNAMRQIYEHGPLVAIFSVYADFLQYK 100
++ K K+ Y +I +HY K A+ + + +GP+ A F VY DF+ Y+
Sbjct: 166 ERNHKCSKKCYGDDTIDYKKNHYKTKDAYYLKNTTMQKDTMVYGPIEASFDVYDDFMNYE 225
Query: 101 SGVYQ 105
SGVYQ
Sbjct: 226 SGVYQ 230
>gi|161343837|tpg|DAA06099.1| TPA_inf: cathepsin B [Acyrthosiphon pisum]
Length = 255
Score = 127 bits (319), Expect = 8e-27, Method: Compositional matrix adjust.
Identities = 58/134 (43%), Positives = 84/134 (62%), Gaps = 2/134 (1%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+P++FDAR KW C ++ + DQ NCGSCWA++ ++A +DRLC+A+N F +SA+ I
Sbjct: 88 IPKHFDARRKWRSCHTIGAVRDQGNCGSCWAMATSSAFADRLCVATNADFNELLSAEEIT 147
Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
C +C +GCNGG+P AW + G+VTGGDY S EGC+PY + PC + +G C
Sbjct: 148 FCCHSCGFGCNGGYPIKAWERFKKRGLVTGGDYQSGEGCEPYRVPPCPYDAEGH-NTCAG 206
Query: 306 LGKLKTPECKQNCY 319
+ C + CY
Sbjct: 207 KPRESNHRCTRMCY 220
>gi|291291827|gb|ADD91786.1| cysteine proteinase [Haemonchus contortus]
Length = 253
Score = 127 bits (318), Expect = 9e-27, Method: Compositional matrix adjust.
Identities = 64/134 (47%), Positives = 86/134 (64%), Gaps = 3/134 (2%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+P + +R KWP+C SL+ I DQ+NCGSCWAVS A+A+SDR+CIASNG +SA I+
Sbjct: 2 IPESPYSRTKWPKCSSLKPIRDQANCGSCWAVSTASALSDRICIASNGRKQVHVSATDIL 61
Query: 247 ACTPN-C-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
+C N C +GCNGGWP A+ ++ G VTGGDY + GC+PY PC HH +
Sbjct: 62 SCCGNQCGYGCNGGWPIQAFNYFSKQGAVTGGDYKATSGCRPYPFHPCGHHGKDTYYG-E 120
Query: 305 LLGKLKTPECKQNC 318
+ TP+C + C
Sbjct: 121 CPNEATTPKCVRKC 134
Score = 101 bits (252), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 41/82 (50%), Positives = 60/82 (73%)
Query: 78 RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
R+I ++GP+V F+VY DF YK G+Y+H G + G HA++++GWG E +PYWL+ANSW
Sbjct: 164 REIMKNGPVVGAFTVYEDFSYYKKGIYKHTAGKARGGHAIKIIGWGKEGGVPYWLIANSW 223
Query: 138 NDHWGDHGTFKILRGENEADIE 159
++ WG++G F+IL G N IE
Sbjct: 224 HNDWGENGYFRILCGSNHCGIE 245
>gi|46812327|gb|AAT02230.1| cathepsin B-like proteinase [Triatoma dimidiata]
Length = 332
Score = 127 bits (318), Expect = 9e-27, Method: Compositional matrix adjust.
Identities = 66/151 (43%), Positives = 91/151 (60%), Gaps = 14/151 (9%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIA-----SNGYFTGQIS 241
+P FDAR++WP CPS+ I DQ +CGSCWA+ + RLC+ SNG +S
Sbjct: 81 IPDEFDARKQWPNCPSITDIRDQGSCGSCWALELL-----RLCLIVFVSHSNGKLQVHLS 135
Query: 242 AQHIVACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPL 300
A+++V C +C GC GG P AW +W G+V+GG+Y S+EGCQPY++APCEHH+ G
Sbjct: 136 AENLVTCCGSCGAGCFGGDPGSAWEYWRDVGIVSGGNYGSKEGCQPYSIAPCEHHIPGSR 195
Query: 301 QNCTLLGKLKTPECKQNCYNPSYESTYRFDL 331
C G+ T +C++ C Y Y DL
Sbjct: 196 PPCR--GEGHTADCRKQC-EKGYSIPYDKDL 223
Score = 108 bits (270), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 47/82 (57%), Positives = 61/82 (74%)
Query: 79 QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWN 138
+I ++GP+ A F VY D L YK GVY+H G +G HA+++LGWGVEN PYWL+ANSWN
Sbjct: 242 EILKNGPVEAAFFVYEDLLTYKEGVYKHVAGAPVGGHAIKILGWGVENGTPYWLIANSWN 301
Query: 139 DHWGDHGTFKILRGENEADIEM 160
WG++G FKILRG +E IE+
Sbjct: 302 TDWGNNGFFKILRGSDECGIEI 323
>gi|308466896|ref|XP_003095699.1| CRE-CPR-3 protein [Caenorhabditis remanei]
gi|308244581|gb|EFO88533.1| CRE-CPR-3 protein [Caenorhabditis remanei]
Length = 373
Score = 127 bits (318), Expect = 9e-27, Method: Compositional matrix adjust.
Identities = 68/154 (44%), Positives = 85/154 (55%), Gaps = 12/154 (7%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+P FDARE WP+C S++ I +Q+ CGSCWA A ISDR+CI SNG IS + I+
Sbjct: 93 IPDTFDARENWPDCKSIKLIRNQATCGSCWAFGAAEVISDRICIQSNGTQQPIISVEDIL 152
Query: 247 AC--TPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
+C T GC GG+ A RFW NG VTGGDYN GC PY+ APC+ + P T
Sbjct: 153 SCCGTTCGKGCQGGYSIEAMRFWKSNGAVTGGDYNGN-GCMPYSFAPCQ---KSPCVEST 208
Query: 305 LLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAH 338
TP CK C + + Y D G A+
Sbjct: 209 ------TPTCKTTCQSSYTTANYTTDKHYGTSAY 236
Score = 105 bits (261), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 56/130 (43%), Positives = 72/130 (55%), Gaps = 10/130 (7%)
Query: 63 HYFKKAHMVPRCNAM-----RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAV 117
HY A+ + N + +IY +GP+ A + VY DF QYKSGVY + G +G HAV
Sbjct: 230 HYGTSAYRLATTNNVVSTIQYEIYHNGPVEASYKVYEDFYQYKSGVYHYVSGKLVGGHAV 289
Query: 118 RVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIEMGFNNRV-----EANSSE 172
+++GWG END+ YWLVANSW +G+ G FKI RG NE IE V A +
Sbjct: 290 KIIGWGTENDVDYWLVANSWGIKFGEGGFFKIRRGTNECQIESNVVAGVAKLGTHAEKGD 349
Query: 173 DDDLETMGCQ 182
DDD C
Sbjct: 350 DDDGSATSCS 359
>gi|119638996|gb|ABL85239.1| cysteine proteinase 5 [Necator americanus]
Length = 342
Score = 127 bits (318), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 72/158 (45%), Positives = 86/158 (54%), Gaps = 12/158 (7%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP FDAREKWP C S+R I DQSNCGSCWAVS A+ +SDRLCI SNG S I+
Sbjct: 89 LPETFDAREKWPNCTSIRTIRDQSNCGSCWAVSAASVMSDRLCIQSNGTIQSWASDTDIL 148
Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
+C NC GC+GG P A+ F NGV TGG + C+PY PC H QN
Sbjct: 149 SCCWNCGMGCDGGRPFAAFFFAIDNGVCTGGPFREPNVCKPYAFYPCGRH-----QNQKY 203
Query: 306 LGKL-----KTPECKQNCYNPSYESTYRFDLKKGKKAH 338
G TP+C++ C Y Y+ D G A+
Sbjct: 204 FGPCPKELWPTPKCRKMC-QLKYNVAYKDDKIYGNDAY 240
Score = 101 bits (251), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 46/98 (46%), Positives = 64/98 (65%), Gaps = 2/98 (2%)
Query: 64 YFKKAHMVP--RCNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLG 121
Y A+ +P M++I+ +GP+V FSV+ADF YK GVY N G HAV+++G
Sbjct: 235 YGNDAYSLPNNETRIMQEIFTNGPVVGSFSVFADFAIYKKGVYVSNGIQQNGAHAVKIIG 294
Query: 122 WGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
WGV++ + YWL+ANSWN+ WGD G + LRG+N IE
Sbjct: 295 WGVQDGLKYWLIANSWNNDWGDEGYVRFLRGDNHCGIE 332
>gi|358341867|dbj|GAA49438.1| cathepsin B-like cysteine proteinase [Clonorchis sinensis]
Length = 952
Score = 127 bits (318), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 62/133 (46%), Positives = 76/133 (57%), Gaps = 2/133 (1%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP +FDAR WP CPS+ I DQS+CGSCWA A+SDRLCI S G F +SA +V
Sbjct: 639 LPESFDARANWPHCPSISEIRDQSSCGSCWAFGAVEAMSDRLCIHSKGAFNKSLSAVDLV 698
Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
+C C GC GG+ +AW FW +G+VTGG GC+ Y CEH +G C
Sbjct: 699 SCCTECGCGCRGGYSPIAWDFWKTHGIVTGGSKEKPTGCRSYPFPSCEHRGKGQYPPCP- 757
Query: 306 LGKLKTPECKQNC 318
TPEC + C
Sbjct: 758 HQLYPTPECIKRC 770
Score = 119 bits (299), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 63/159 (39%), Positives = 86/159 (54%), Gaps = 4/159 (2%)
Query: 183 NAKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISA 242
+ K LP++FDAR KWP CPS+ I DQS+C S WA ++SDRLCI SNG F +SA
Sbjct: 47 DEKELPKSFDARTKWPHCPSISEIRDQSSCESFWAFGAVESMSDRLCIHSNGAFNKSLSA 106
Query: 243 QHIVACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQ 301
+++C +C GC G+ +AW FW +G+VTGG GC+ + C H +G
Sbjct: 107 TDLLSCCEDCGLGCGAGFHPMAWDFWKTHGIVTGGSKEEPSGCRSFPFPKCGHRRKGRYP 166
Query: 302 NCTLLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
C TPEC + C P E Y D + ++ V
Sbjct: 167 PCP-RHIYPTPECIKQCDEP--EVNYEKDKTRANISYNV 202
Score = 106 bits (264), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 44/85 (51%), Positives = 61/85 (71%)
Query: 75 NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVA 134
+ M++I +GP+ A F +YADFL+Y GVY H +G I HA+R+LGWG ++ +PYWL+A
Sbjct: 208 SIMKEIMLNGPVEASFGIYADFLEYNGGVYFHCWGGPISRHAIRILGWGEDDGVPYWLIA 267
Query: 135 NSWNDHWGDHGTFKILRGENEADIE 159
NSWN+ WG+ G + LRG NE IE
Sbjct: 268 NSWNEDWGEKGYVRFLRGHNECGIE 292
Score = 102 bits (253), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 44/83 (53%), Positives = 57/83 (68%)
Query: 76 AMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVAN 135
M++I GP+ AI VY D L YKSGVY H +G +G H +R+LGWG E+ +PYWLVAN
Sbjct: 854 VMKEIMLRGPVGAILHVYEDLLDYKSGVYFHVWGGHLGEHGIRILGWGEEDGVPYWLVAN 913
Query: 136 SWNDHWGDHGTFKILRGENEADI 158
SWN+ WG+ G ++LR NE I
Sbjct: 914 SWNEDWGEKGYMRVLRWRNECGI 936
>gi|300122171|emb|CBK22745.2| unnamed protein product [Blastocystis hominis]
Length = 319
Score = 127 bits (318), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 61/149 (40%), Positives = 86/149 (57%), Gaps = 6/149 (4%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP+ FD+ EKWPECPS+ + DQS+C SCWA V +DR+CI S G ++SA+ ++
Sbjct: 69 LPKEFDSSEKWPECPSILEVRDQSSCASCWAFGVVEVATDRICIESKGKNQVRLSAEDVL 128
Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
C +C + C GG+ +AW + GVVTGG YNS E C+ Y PC H ++G C+
Sbjct: 129 ECCKDCGFQCQGGYSAMAWEYLRRTGVVTGGQYNSTEWCKSYPFPPCSHGIEGQYPQCST 188
Query: 306 LGKLKTPECKQNC---YNPSYE-STYRFD 330
+ P+C+ C Y YE Y+F
Sbjct: 189 KPPV-VPKCETTCQEGYPIEYEKDRYKFS 216
Score = 94.7 bits (234), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 41/81 (50%), Positives = 53/81 (65%)
Query: 79 QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWN 138
+I E+GP+ A F VY DF+ YKSG+Y H G + LH V+++GWG EN YW NSWN
Sbjct: 231 EIMENGPVDASFQVYEDFMTYKSGIYHHVEGKFMNLHTVKIIGWGEENGEAYWKAVNSWN 290
Query: 139 DHWGDHGTFKILRGENEADIE 159
WG++G F+I G NE IE
Sbjct: 291 SEWGENGLFRIRLGTNECTIE 311
>gi|239788404|dbj|BAH70886.1| ACYPI000014 [Acyrthosiphon pisum]
Length = 335
Score = 126 bits (317), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 57/134 (42%), Positives = 85/134 (63%), Gaps = 2/134 (1%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+P FD+R +W C ++ H+ +Q NCGSCWA A +DRLC+A+NG F ISA+ +
Sbjct: 84 VPEFFDSRLEWDYCETIGHVRNQGNCGSCWAHGTTGAFADRLCVATNGEFNELISAEELT 143
Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
C C +GCNGG+P AW+++ +GVVTGGDY++ +GCQPY + PC +G +C+
Sbjct: 144 FCCHRCVFGCNGGYPLKAWQYFKRHGVVTGGDYDTTDGCQPYRVPPCVKDDEGH-NSCSG 202
Query: 306 LGKLKTPECKQNCY 319
+ +C + CY
Sbjct: 203 QPTERNHKCSKKCY 216
Score = 99.4 bits (246), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 54/134 (40%), Positives = 74/134 (55%), Gaps = 4/134 (2%)
Query: 30 KKKEEEKKKKKKKKKKKKKKKKRLYLPTSIPL--SHY-FKKAHMVPRCNAMRQIYEHGPL 86
K E + ++ K K+ Y +I +HY K A+ + + +GP+
Sbjct: 192 KDDEGHNSCSGQPTERNHKCSKKCYGDDTIDYKKNHYKTKDAYYLKNTTMQKDTMVYGPI 251
Query: 87 VAIFSVYADFLQYKSGVYQHNFGDS-IGLHAVRVLGWGVENDIPYWLVANSWNDHWGDHG 145
A F VY DF+ Y+SGVYQ S +G HAV+++GWGVE PYWL+ NSW + WGD G
Sbjct: 252 EASFDVYDDFMNYESGVYQRTGNASYLGGHAVKMIGWGVEEGTPYWLMVNSWGEQWGDKG 311
Query: 146 TFKILRGENEADIE 159
FKILRG +E IE
Sbjct: 312 MFKILRGTDECGIE 325
>gi|187105116|ref|NP_001119618.1| cathepsin B-84 precursor [Acyrthosiphon pisum]
gi|161343843|tpg|DAA06102.1| TPA_inf: cathepsin B [Acyrthosiphon pisum]
Length = 335
Score = 126 bits (317), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 57/134 (42%), Positives = 85/134 (63%), Gaps = 2/134 (1%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+P FD+R +W C ++ H+ +Q NCGSCWA A +DRLC+A+NG F ISA+ +
Sbjct: 84 VPEFFDSRLEWDYCETIGHVRNQGNCGSCWAHGTTGAFADRLCVATNGEFNELISAEELT 143
Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
C C +GCNGG+P AW+++ +GVVTGGDY++ +GCQPY + PC +G +C+
Sbjct: 144 FCCHRCGFGCNGGYPLKAWQYFKRHGVVTGGDYDTTDGCQPYRVPPCVKDDEGH-NSCSG 202
Query: 306 LGKLKTPECKQNCY 319
+ +C + CY
Sbjct: 203 QPTERNHKCSKKCY 216
Score = 99.8 bits (247), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 54/134 (40%), Positives = 74/134 (55%), Gaps = 4/134 (2%)
Query: 30 KKKEEEKKKKKKKKKKKKKKKKRLYLPTSIPL--SHY-FKKAHMVPRCNAMRQIYEHGPL 86
K E + ++ K K+ Y +I +HY K A+ + + +GP+
Sbjct: 192 KDDEGHNSCSGQPTERNHKCSKKCYGDDTIDYKKNHYKTKDAYYLKNTTMQKDTMVYGPI 251
Query: 87 VAIFSVYADFLQYKSGVYQHNFGDS-IGLHAVRVLGWGVENDIPYWLVANSWNDHWGDHG 145
A F VY DF+ Y+SGVYQ S +G HAV+++GWGVE PYWL+ NSW + WGD G
Sbjct: 252 EASFDVYDDFMNYESGVYQRTGNASYLGGHAVKMIGWGVEEGTPYWLMVNSWGEQWGDKG 311
Query: 146 TFKILRGENEADIE 159
FKILRG +E IE
Sbjct: 312 MFKILRGTDECGIE 325
>gi|339241013|ref|XP_003376432.1| Gut-specific cysteine proteinase [Trichinella spiralis]
gi|316974853|gb|EFV58323.1| Gut-specific cysteine proteinase [Trichinella spiralis]
Length = 551
Score = 126 bits (316), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 61/150 (40%), Positives = 94/150 (62%), Gaps = 18/150 (12%)
Query: 188 PRNFDAREKWPECP-SLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
P FD+R+ WP+C + I DQ+NCGSCWAVS A+ +SDR CIA++G FT +S ++
Sbjct: 290 PVEFDSRKHWPQCEKVISFIKDQANCGSCWAVSSASVMSDRTCIATDGQFTTLLSDAELL 349
Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
+C +C +GCNGG+PQ +++W ++G+ TGG Y S + C+PY + PC NC+
Sbjct: 350 SCCTSCGYGCNGGYPQRTFKYWVYSGMPTGGPYGSNDTCKPYPIPPC--------SNCS- 400
Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGK 335
+ +TP+C ++C STY L + +
Sbjct: 401 --ETRTPKCSKSCI-----STYPLSLNEDR 423
Score = 103 bits (257), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 47/85 (55%), Positives = 60/85 (70%)
Query: 75 NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVA 134
+ M+ I +GP+VA SVY DFL YK GVY G +G HAVR++GWG +++IPYWLVA
Sbjct: 438 SMMKDISLYGPIVAGMSVYEDFLHYKEGVYTQESGIFLGGHAVRIIGWGEQDNIPYWLVA 497
Query: 135 NSWNDHWGDHGTFKILRGENEADIE 159
NSWN +G+ G FKI RG +E IE
Sbjct: 498 NSWNTTFGEDGLFKIRRGFDECGIE 522
>gi|256090368|ref|XP_002581167.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
gi|22531387|emb|CAD44624.1| cathepsin B1 isotype 1 [Schistosoma mansoni]
gi|353228442|emb|CCD74613.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
Length = 340
Score = 126 bits (316), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 66/156 (42%), Positives = 89/156 (57%), Gaps = 5/156 (3%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+P +FD+R+KWP C S+ I DQS CGSCWA A+SDR CI S G ++SA ++
Sbjct: 89 IPSSFDSRKKWPRCKSIATIRDQSRCGSCWAFGAVEAMSDRSCIQSGGKQNVELSAVDLL 148
Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
+C +C GC GG AW +W G+VTG + GC+PY CEHH +G C
Sbjct: 149 SCCESCGLGCEGGILGPAWDYWVKEGIVTGSSKENHTGCEPYPFPKCEHHTKGKYPPCG- 207
Query: 306 LGKL-KTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
K+ KTP CKQ C Y++ Y D +GK ++ V
Sbjct: 208 -SKIYKTPRCKQTC-QKKYKTPYTQDKHRGKSSYNV 241
Score = 116 bits (290), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 46/82 (56%), Positives = 67/82 (81%)
Query: 78 RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
++I ++GP+ A F+VY DFL YKSG+Y+H G+++G HA+R++GWGVEN PYWL+ANSW
Sbjct: 250 KEIMKYGPVEAGFTVYEDFLNYKSGIYKHITGETLGGHAIRIIGWGVENKTPYWLIANSW 309
Query: 138 NDHWGDHGTFKILRGENEADIE 159
N+ WG++G F+I+RG +E IE
Sbjct: 310 NEDWGENGYFRIVRGRDECSIE 331
>gi|300835056|gb|ADK37857.1| putative cathepsin precursor [Sitobion avenae]
Length = 340
Score = 126 bits (316), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 63/149 (42%), Positives = 92/149 (61%), Gaps = 9/149 (6%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+PR FDAR+KW C ++ + DQ +CGSCWA ++A +DRLC+A++G F +SA+ I
Sbjct: 88 IPRTFDARKKWRHCRTIGEVRDQGHCGSCWAFGTSSAFADRLCVATDGDFNELLSAEEIT 147
Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
C C +GC+GG+P AW+++ +G+VTGG+Y S EGC+PY + PC +G N T
Sbjct: 148 FCCHTCGFGCHGGYPIKAWKYFSKHGLVTGGNYKSGEGCEPYRVPPCPRDDKG---NNTC 204
Query: 306 LGKL--KTPECKQNCY---NPSYESTYRF 329
GK K C + CY + Y +RF
Sbjct: 205 AGKPIEKNHRCTRMCYGDQDLDYNDDHRF 233
Score = 93.6 bits (231), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 44/83 (53%), Positives = 55/83 (66%), Gaps = 1/83 (1%)
Query: 78 RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDS-IGLHAVRVLGWGVENDIPYWLVANS 136
+ + +GP+ A F VY DF YKSGVY+ S +G HAV+++GWGVE PYWL+ NS
Sbjct: 247 KDVMTYGPIEASFDVYDDFPSYKSGVYEKTENASYLGGHAVKLIGWGVEEGTPYWLMVNS 306
Query: 137 WNDHWGDHGTFKILRGENEADIE 159
WN WGD G FKI RG NE I+
Sbjct: 307 WNAQWGDKGLFKIRRGTNECGID 329
>gi|401758196|gb|AFQ01133.1| cathepsin B [Chilo suppressalis]
Length = 350
Score = 126 bits (316), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 68/171 (39%), Positives = 95/171 (55%), Gaps = 19/171 (11%)
Query: 188 PRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVA 247
P FDARE W CP+L+ I DQ CGSCWAV+ +A++DR+CI S G S + +++
Sbjct: 85 PNQFDAREHWKNCPTLKDIRDQGGCGSCWAVAAVSAMTDRMCILSKGKEHFYFSIKDVLS 144
Query: 248 CTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNC--- 303
C C GC GG AW ++ G+V+GG Y S++GCQPYT+ PC H V G ++ C
Sbjct: 145 CCGYCGNGCEGGVLTRAWIYYKKIGIVSGGGYKSKQGCQPYTIPPCNHLVWGEIEQCKNI 204
Query: 304 TLLGKLK--------------TPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
+ K K TPEC++ C N +Y+ Y D +GK + V
Sbjct: 205 PMTPKCKNIPVIPEQCKYIPITPECEKKC-NKNYKVCYSKDKHRGKSVYRV 254
Score = 102 bits (254), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 43/89 (48%), Positives = 60/89 (67%)
Query: 63 HYFKKAHMVPRCNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGW 122
H K + V + ++IYE+GP+ + F+VY DFL YK G+Y + G +GLH+V+++GW
Sbjct: 246 HRGKSVYRVKKSEIFKEIYEYGPVTSYFTVYEDFLNYKEGIYNYTSGQKLGLHSVKIIGW 305
Query: 123 GVENDIPYWLVANSWNDHWGDHGTFKILR 151
G E I YWL ANS+N WGD G FKI+R
Sbjct: 306 GEERGIKYWLAANSFNTDWGDKGFFKIIR 334
>gi|7507648|pir||T24819 hypothetical protein T10H4.12 - Caenorhabditis elegans
Length = 324
Score = 126 bits (316), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 71/180 (39%), Positives = 97/180 (53%), Gaps = 13/180 (7%)
Query: 158 IEMGFNNRVEANSSEDDDLETMGCQNAKGLPRNFDAREKWPECPSLRHIADQSNCGSCWA 217
+++ F +E +S +L G + LP FDAREKWP+C +++ I +Q+ CGSCWA
Sbjct: 1 MDVKFAEPLEKDSDVASELFVRGEIVPEPLPDTFDAREKWPDCNTIKLIRNQATCGSCWA 60
Query: 218 VSVANAISDRLCIASNGYFTGQISAQHIVAC--TPNCWGCNGGWPQLAWRFWGHNGVVTG 275
A ISDR+CI SNG IS + I++C T +GC GG+ A RFW +G VTG
Sbjct: 61 FGAAEVISDRVCIQSNGTQQPVISVEDILSCCGTTCGYGCKGGYSIEALRFWASSGAVTG 120
Query: 276 GDYNSQEGCQPYTLAPCEHHVQGPLQNCTLLGKLKTPECKQNCYNPSYESTYRFDLKKGK 335
GDY GC PY+ APC +NC + TP CK C + Y+ D G+
Sbjct: 121 GDYGGH-GCMPYSFAPC-------TKNCP---ESTTPSCKTTCQSSYKTEEYKKDKHYGE 169
Score = 101 bits (251), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 44/81 (54%), Positives = 57/81 (70%)
Query: 79 QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWN 138
+IY +GP+ A + VY DF YKSGVY + G +G HAV+++GWGVEN + YWL+ANSW
Sbjct: 202 EIYHYGPVEASYKVYEDFYHYKSGVYHYTSGKLVGGHAVKIIGWGVENGVDYWLIANSWG 261
Query: 139 DHWGDHGTFKILRGENEADIE 159
+G+ G FKI RG NE IE
Sbjct: 262 TSFGEKGFFKIRRGTNECQIE 282
>gi|159175|gb|AAA29176.1| cysteine proteinase [Haemonchus contortus]
Length = 348
Score = 125 bits (315), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 73/177 (41%), Positives = 92/177 (51%), Gaps = 11/177 (6%)
Query: 169 NSSEDDDLETMGCQNAKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRL 228
N +DD +T LP N+D R W C S I DQ+NCGSCWAVS A AISDR+
Sbjct: 76 NPVVNDDNDT-----GADLPENYDPRIVWKNCSSFHTIRDQANCGSCWAVSTAAAISDRI 130
Query: 229 CIASNGYFTGQISAQHIVACT-PNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQP 286
CIA+ G S I+ C C GC GGWP AW+F+ ++GVV+GG Y + C P
Sbjct: 131 CIATKGKKQVYASDTDILTCCGARCGLGCRGGWPIEAWKFFEYDGVVSGGPYLGKGCCSP 190
Query: 287 YTLAPCEHHVQGPLQ-NCTLLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMVLM 342
Y L PC H NC +G TP CK+ C P + YR D + G+ +
Sbjct: 191 YPLHPCGRHGNDTFYGNC--VGMAPTPPCKRKC-QPGFRGMYRVDKRYGEPGRTYTL 244
Score = 96.7 bits (239), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 44/96 (45%), Positives = 65/96 (67%), Gaps = 3/96 (3%)
Query: 67 KAHMVPRCNA--MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGD-SIGLHAVRVLGWG 123
+ + +PR R I E G +VA+F+VY DF Y+SG+Y+H G + G HAV+++GWG
Sbjct: 240 RTYTLPRSEVKIRRDIKERGSVVAVFAVYEDFSHYQSGIYKHTAGRFTGGYHAVKMIGWG 299
Query: 124 VENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
+N YWL+ANSW+D WG++G F+++RG N IE
Sbjct: 300 KDNGTDYWLIANSWHDDWGENGFFRMIRGINNCGIE 335
>gi|204022102|dbj|BAG71148.1| cathepsin B-N2 [Tuberaphis takenouchii]
Length = 334
Score = 125 bits (315), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 64/136 (47%), Positives = 85/136 (62%), Gaps = 6/136 (4%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+P NFDAR+KW +C ++ + DQ NCG+CWA ++A +DRLCIA+NG F +SA+ +
Sbjct: 85 IPSNFDARKKWRKCSTVGKVRDQGNCGTCWAFGTSSAFADRLCIATNGEFNELLSAEELA 144
Query: 247 ACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
C C GC+GG+P AW + +G+VTGGDYNS EGCQPY + PC G N T
Sbjct: 145 FCCHKCGSGCHGGYPIKAWERFRKHGLVTGGDYNSGEGCQPYRVPPCPFDEYG---NNTC 201
Query: 306 LGKL--KTPECKQNCY 319
GK K C + CY
Sbjct: 202 RGKPAEKNHRCTRMCY 217
Score = 94.4 bits (233), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 45/97 (46%), Positives = 59/97 (60%), Gaps = 1/97 (1%)
Query: 64 YFKKAHMVPRCNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDS-IGLHAVRVLGW 122
Y + A+ + + +GP+ A + VY DF YKSGVY S +G HAV+++GW
Sbjct: 230 YTRDAYYLNYQIIQNDLMTYGPIEASYDVYDDFPNYKSGVYMKTENASYLGGHAVKLIGW 289
Query: 123 GVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
G E +PYWL+ NSWND WGD G FKI RG NE I+
Sbjct: 290 GEEYGVPYWLLVNSWNDQWGDQGLFKIRRGTNECGID 326
>gi|161343825|tpg|DAA06093.1| TPA_inf: cathepsin B [Aphis gossypii]
Length = 199
Score = 125 bits (315), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 56/125 (44%), Positives = 85/125 (68%), Gaps = 5/125 (4%)
Query: 171 SEDDDLETMGCQNAKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCI 230
SED++ + + + +P+ FDAR+KW C ++ + DQ NCGSCWA+S ++A +DRLC+
Sbjct: 76 SEDENYDNLFGR----IPKKFDARKKWRHCTTIGKVRDQGNCGSCWALSTSSAFADRLCV 131
Query: 231 ASNGYFTGQISAQHIVACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTL 289
A+NG F +SA+ + C C +GCNGG+P AW + +G+VTGG+Y S EGC+PY +
Sbjct: 132 ATNGDFNQLLSAEELTFCCHKCGYGCNGGYPIKAWERFKKHGLVTGGEYKSGEGCEPYRV 191
Query: 290 APCEH 294
PC +
Sbjct: 192 PPCPY 196
>gi|161343859|tpg|DAA06110.1| TPA_inf: cathepsin B [Myzus persicae]
Length = 260
Score = 125 bits (314), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 58/133 (43%), Positives = 83/133 (62%), Gaps = 2/133 (1%)
Query: 188 PRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVA 247
PR FDAR+KW C ++ + DQ +CGSCWA ++A +DRLC+A++G F +SA+ I
Sbjct: 89 PRTFDARKKWRHCKTIGEVRDQGHCGSCWAFGTSSAFADRLCVATDGDFNELLSAEEITF 148
Query: 248 CTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTLL 306
C C +GCNGG P AW+++ +G+VTGG+Y S EGC+PY + PC +G C
Sbjct: 149 CCHTCGFGCNGGDPIKAWKYFSTHGLVTGGNYKSGEGCEPYRVPPCPRDDKGK-NTCAGK 207
Query: 307 GKLKTPECKQNCY 319
+ K C + CY
Sbjct: 208 PREKNHRCTRMCY 220
>gi|343197337|pdb|3QSD|A Chain A, Structure Of Cathepsin B1 From Schistosoma Mansoni In
Complex With Ca074 Inhibitor
gi|343197588|pdb|3S3Q|A Chain A, Structure Of Cathepsin B1 From Schistosoma Mansoni In
Complex With K11017 Inhibitor
gi|343197589|pdb|3S3R|A Chain A, Structure Of Cathepsin B1 From Schistosoma Mansoni In
Complex With K11777 Inhibitor
gi|343197590|pdb|3S3R|B Chain B, Structure Of Cathepsin B1 From Schistosoma Mansoni In
Complex With K11777 Inhibitor
gi|343197591|pdb|3S3R|C Chain C, Structure Of Cathepsin B1 From Schistosoma Mansoni In
Complex With K11777 Inhibitor
Length = 254
Score = 125 bits (314), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 65/155 (41%), Positives = 87/155 (56%), Gaps = 3/155 (1%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+P +FD+R+KWP C S+ I DQS CGSCWA A+SDR CI S G ++SA ++
Sbjct: 3 IPSSFDSRKKWPRCKSIATIRDQSRCGSCWAFGAVEAMSDRSCIQSGGKQNVELSAVDLL 62
Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
+C +C GC GG AW +W G+VTG + GC+PY CEHH +G C
Sbjct: 63 SCCESCGLGCEGGILGPAWDYWVKEGIVTGSSKENHAGCEPYPFPKCEHHTKGKYPPCGS 122
Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
KTP CKQ C Y++ Y D +GK ++ V
Sbjct: 123 K-IYKTPRCKQTC-QKKYKTPYTQDKHRGKSSYNV 155
Score = 115 bits (289), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 46/82 (56%), Positives = 67/82 (81%)
Query: 78 RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
++I ++GP+ A F+VY DFL YKSG+Y+H G+++G HA+R++GWGVEN PYWL+ANSW
Sbjct: 164 KEIMKYGPVEAGFTVYEDFLNYKSGIYKHITGETLGGHAIRIIGWGVENKAPYWLIANSW 223
Query: 138 NDHWGDHGTFKILRGENEADIE 159
N+ WG++G F+I+RG +E IE
Sbjct: 224 NEDWGENGYFRIVRGRDECSIE 245
>gi|219565128|dbj|BAH04068.1| cathepsin B [Equus caballus]
Length = 162
Score = 125 bits (314), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 55/107 (51%), Positives = 77/107 (71%), Gaps = 2/107 (1%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP NFDARE+WP CP+++ I DQ +CGSCWA AISDR+CI +NG+ + ++SA+ ++
Sbjct: 56 LPENFDAREQWPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICIRTNGHVSVEVSAEDML 115
Query: 247 ACTPN-CW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAP 291
C + C GCNGG+P AW FW G+V+GG Y+S GC+PY++ P
Sbjct: 116 TCCGDQCGDGCNGGFPAEAWNFWTKQGLVSGGLYDSHVGCRPYSIPP 162
>gi|350535627|ref|NP_001233013.1| uncharacterized protein LOC100164982 precursor [Acyrthosiphon
pisum]
gi|239789514|dbj|BAH71377.1| ACYPI005957 [Acyrthosiphon pisum]
Length = 339
Score = 125 bits (314), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 59/147 (40%), Positives = 91/147 (61%), Gaps = 5/147 (3%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+PR FDAR++W C ++ + DQ +CGSCWA ++A +DRLC+A++G F +SA+ +
Sbjct: 87 IPRTFDARKRWRHCKTIGEVRDQGHCGSCWAFGTSSAFADRLCVATDGDFNELLSAEELT 146
Query: 247 ACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
C C GCNGG+P AW+++ +G+VTGG+Y S +GC+PY + PC + G +C
Sbjct: 147 FCCHACGHGCNGGYPIKAWKYFSTHGLVTGGNYKSGKGCEPYRVPPCPRNEDGK-SSCAG 205
Query: 306 LGKLKTPECKQNCY---NPSYESTYRF 329
K K C + CY + Y+ +RF
Sbjct: 206 KPKEKNHRCTRMCYGNQDLDYDDDHRF 232
Score = 94.4 bits (233), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 43/83 (51%), Positives = 56/83 (67%), Gaps = 1/83 (1%)
Query: 78 RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDS-IGLHAVRVLGWGVENDIPYWLVANS 136
+ + +GP+ A F VY DF YKSGVYQ + +G HAV+++GWGVE PYWL+ NS
Sbjct: 246 KDVLNYGPIEASFDVYDDFPSYKSGVYQRTPNATKLGGHAVKLIGWGVEEGTPYWLMVNS 305
Query: 137 WNDHWGDHGTFKILRGENEADIE 159
WN WGD+G FKI RG +E I+
Sbjct: 306 WNAQWGDNGLFKIRRGTDECRID 328
>gi|119638965|gb|ABL85237.1| cysteine proteinase 3 [Necator americanus]
Length = 360
Score = 125 bits (314), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 64/162 (39%), Positives = 94/162 (58%), Gaps = 10/162 (6%)
Query: 158 IEMGFNNRVEANSSEDDDLETMGCQNAKGLPRNFDAREKWPECPSLRHIADQSNCGSCWA 217
+E F + E +++D++ ++ +P +FDAR+KWP+C S+ I DQS+CGSCWA
Sbjct: 66 MESRFLDNEEGEMLKEEDMDF-----SEEIPVSFDARDKWPKCTSIGFIRDQSHCGSCWA 120
Query: 218 VSVANAISDRLCIASNGYFTGQISAQHIVACTPNC-WGCNGGWPQLAWRFWGHNGVVTGG 276
VS A +SDRLC+ SNG +S I+AC PNC GC GG AW ++ + GV TGG
Sbjct: 121 VSSAETMSDRLCVQSNGTIKVLLSDTDILACCPNCGAGCGGGHTIRAWEYFKNTGVCTGG 180
Query: 277 DYNSQEGCQPYTLAPCEHHVQGPLQNCTLLGKLKTPECKQNC 318
Y +++ C+PY PC+ G TP+C++ C
Sbjct: 181 LYGTKDSCKPYAFYPCKDESYGKCPK----DSFPTPKCRKIC 218
Score = 90.9 bits (224), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 43/104 (41%), Positives = 64/104 (61%), Gaps = 7/104 (6%)
Query: 63 HYFKKAHMVPRCNA--MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
+Y A+ +P+ +I +GP+ A F +Y DF Y+ GVY + G +G HA++++
Sbjct: 231 YYANSAYRIPQNETWIKLEIMRNGPVTASFRIYPDFGFYEKGVYVTSGGRELGGHAIKII 290
Query: 121 GWGVE----NDIPYWLVANSWNDHWGD-HGTFKILRGENEADIE 159
GWG E D+PYWL+ANSW WG+ +G F+ILRG+N IE
Sbjct: 291 GWGTEKVNGTDLPYWLIANSWGTDWGENNGYFRILRGQNHCQIE 334
>gi|256052331|ref|XP_002569726.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
gi|353228435|emb|CCD74606.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
Length = 319
Score = 125 bits (313), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 65/155 (41%), Positives = 86/155 (55%), Gaps = 3/155 (1%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+P NFD+R+KWP C S+ I DQS CGS WA A+SDR CI S G ++SA ++
Sbjct: 67 IPSNFDSRKKWPGCKSIATIRDQSRCGSSWAFGAVEAMSDRSCIQSGGKQNVELSAVDLL 126
Query: 247 ACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
+C +C G GG+P LAW +W G+VTG + CQPY CEHH +G C
Sbjct: 127 SCCEHCGDGFEGGFPALAWDYWVKEGIVTGSSKENHTSCQPYPFPKCEHHTKGKYPAC-F 185
Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
KTP C+ C SY++ Y D +GK + V
Sbjct: 186 EEIYKTPNCENTC-QKSYKTPYAQDKHRGKSRYNV 219
Score = 110 bits (275), Expect = 9e-22, Method: Compositional matrix adjust.
Identities = 45/82 (54%), Positives = 63/82 (76%)
Query: 78 RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
++I ++GP+ A F VY DFL YKSG+Y+H G + HA+R++GWGVEN+ PYWL+ NSW
Sbjct: 228 KEIMKYGPVEANFIVYEDFLNYKSGIYKHITGKLVSWHAIRIIGWGVENNTPYWLIPNSW 287
Query: 138 NDHWGDHGTFKILRGENEADIE 159
N+ WG++G F+ILRG +E IE
Sbjct: 288 NEDWGENGNFRILRGRHECSIE 309
>gi|52630925|gb|AAU84926.1| putative cathepsin B-N [Toxoptera citricida]
Length = 340
Score = 125 bits (313), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 65/152 (42%), Positives = 92/152 (60%), Gaps = 10/152 (6%)
Query: 171 SEDDDLETMGCQNAKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCI 230
SED++ + + + +PR FDAR+KW C ++ I DQ NCGSCWA++ ++A +DRLC+
Sbjct: 76 SEDENYDNLFGR----IPRKFDARKKWRNCKTIGAIRDQGNCGSCWALATSSAFADRLCV 131
Query: 231 ASNGYFTGQISAQHIVACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTL 289
SN F +SA+ + C C +GCNGG+P AW + +G+VTGGDY S EGC+PY +
Sbjct: 132 VSNEDFNQLLSAEELTFCCHKCGFGCNGGYPIKAWEHFKKHGLVTGGDYKSGEGCEPYRV 191
Query: 290 APCEHHVQGPLQNCTLLGKLKTP--ECKQNCY 319
PC + G N T GK C + CY
Sbjct: 192 PPCPYDESG---NNTCAGKPMEANHRCTRMCY 220
Score = 91.7 bits (226), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 43/97 (44%), Positives = 61/97 (62%), Gaps = 1/97 (1%)
Query: 64 YFKKAHMVPRCNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDS-IGLHAVRVLGW 122
Y + ++ + + + + +GP+ A F VY DF YKSGVY + S +G HA +++GW
Sbjct: 233 YTRDSYYLTYGSIQKDVLTYGPVEASFDVYDDFPSYKSGVYIRSENASYLGGHAAKLIGW 292
Query: 123 GVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
G E +PYWL+ NSWN WGD+G FKI RG NE I+
Sbjct: 293 GEEYGVPYWLMVNSWNADWGDNGLFKIQRGTNECGID 329
>gi|157058737|gb|ABV03126.1| cathepsin B-16 [Myzus persicae]
Length = 238
Score = 125 bits (313), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 58/133 (43%), Positives = 83/133 (62%), Gaps = 2/133 (1%)
Query: 188 PRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVA 247
PR FDAR+KW C ++ + DQ +CGSCWA ++A +DRLC+A++G F +SA+ I
Sbjct: 67 PRTFDARKKWRHCKTIGEVRDQGHCGSCWAFGTSSAFADRLCVATDGDFNELLSAEEITF 126
Query: 248 CTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTLL 306
C C +GCNGG P AW+++ +G+VTGG+Y S EGC+PY + PC +G C
Sbjct: 127 CCHTCGFGCNGGDPIKAWKYFSTHGLVTGGNYKSGEGCEPYRVPPCPRDDKGK-NTCAGK 185
Query: 307 GKLKTPECKQNCY 319
+ K C + CY
Sbjct: 186 PREKNHRCTRMCY 198
>gi|157058757|gb|ABV03136.1| cathepsin B-84 [Pterocomma populeum]
Length = 218
Score = 124 bits (312), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 65/156 (41%), Positives = 93/156 (59%), Gaps = 3/156 (1%)
Query: 186 GLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHI 245
G+P+ FDAR +W C ++ + DQ NCGSCWA + A +DRLCIA+ G F ISA+ +
Sbjct: 43 GIPKAFDARLEWKYCKTIGQVRDQGNCGSCWAHGTSGAFADRLCIATKGDFNELISAEEL 102
Query: 246 VACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
C C GCNGG P AW+++ +GVVTGG+YN+ GCQPY + PC + +G +C+
Sbjct: 103 TFCCHLCGIGCNGGNPLRAWQYFKRHGVVTGGNYNTTNGCQPYRVPPCTNGDKGHY-SCS 161
Query: 305 LLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
K + +C + CY Y+ D K K A+ +
Sbjct: 162 GQQKERNHKCLKTCYGDK-TVDYKRDHYKTKDAYYL 196
>gi|211853248|emb|CAP17587.1| cathepsin-like protein 4 [Crateromorpha meyeri]
Length = 325
Score = 124 bits (312), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 68/160 (42%), Positives = 91/160 (56%), Gaps = 11/160 (6%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+P FD+R +WP+C ++ I DQSNCGSCWA ++SDR CI + ISA +++
Sbjct: 80 IPDMFDSRTQWPDCKTIGLIEDQSNCGSCWAFGATESMSDRYCIHMKMHLL--ISAANLM 137
Query: 247 ACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYN----SQEGCQPYTLAPCEHHVQGPLQ 301
C NC GC GG+ AW +W G+VTGG YN + CQPY L CEHH+ G
Sbjct: 138 ECCRNCGNGCEGGFLGAAWNYWKQEGLVTGGLYNPSATESDTCQPYPLPSCEHHINGSKP 197
Query: 302 NCTLLGKL-KTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
C K+ KTPEC C+ Y ++Y DL G+ A+ V
Sbjct: 198 ACP--SKIAKTPECVHTCH-AGYPTSYEQDLHYGESAYSV 234
Score = 109 bits (273), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 52/99 (52%), Positives = 69/99 (69%), Gaps = 2/99 (2%)
Query: 63 HYFKKAHMVPR--CNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
HY + A+ V R +I +GP+ A F+VYADF YKSGVY+ + +G HAV+++
Sbjct: 226 HYGESAYSVRRRVAEIQTEIMTNGPVEAAFTVYADFPAYKSGVYKRHSLRQLGGHAVKMI 285
Query: 121 GWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
GWG E+ IPYWL+ANSWN WGDHG FKI+RG++E IE
Sbjct: 286 GWGEEDGIPYWLIANSWNSDWGDHGYFKIVRGQDECGIE 324
>gi|340501578|gb|EGR28345.1| hypothetical protein IMG5_177790 [Ichthyophthirius multifiliis]
Length = 356
Score = 124 bits (312), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 64/160 (40%), Positives = 100/160 (62%), Gaps = 6/160 (3%)
Query: 184 AKGLPRNFDAREKW-PECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISA 242
+ LP+NFD+R++W +CPSL + DQS CGSCWA + A ++SDR+CI + ++S
Sbjct: 94 VENLPKNFDSRKQWGSKCPSLNEVRDQSTCGSCWAFAAAESLSDRICIHTGEDV--RLST 151
Query: 243 QHIVACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQ 301
+++V+C +C GCNGG+P+ A +++ G+VTG + CQ Y+ PC HHV +
Sbjct: 152 ENLVSCCSSCGDGCNGGYPEAAMQYFVKTGLVTGDLFGDNNFCQAYSFPPCAHHV-ASTK 210
Query: 302 NCTLLGKLKTPECKQNCYNPS-YESTYRFDLKKGKKAHMV 340
G++ TPECK+ C + S + Y DL KG+K++ V
Sbjct: 211 YPPCKGEVPTPECKKKCDDDSKVKRPYNEDLYKGQKSYSV 250
Score = 124 bits (311), Expect = 7e-26, Method: Compositional matrix adjust.
Identities = 52/83 (62%), Positives = 64/83 (77%)
Query: 77 MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANS 136
M +I +GP+ F+VY DF+ YKSGVYQH G+ +G HAV+++GWGVEND PYWL+ NS
Sbjct: 258 MTEIMNNGPVEVAFTVYEDFVTYKSGVYQHVTGEQLGGHAVKMIGWGVENDTPYWLIVNS 317
Query: 137 WNDHWGDHGTFKILRGENEADIE 159
WN+ WGD GTFKILRG NE IE
Sbjct: 318 WNETWGDQGTFKILRGSNECGIE 340
>gi|161343879|tpg|DAA06120.1| TPA_inf: cathepsin B [Toxoptera citricida]
Length = 340
Score = 124 bits (312), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 60/136 (44%), Positives = 86/136 (63%), Gaps = 6/136 (4%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+P+ FDAR+KW C ++ + DQ NCGSCWA++ ++A +DRLC+A+N F +SA+ I
Sbjct: 88 IPKKFDARKKWRHCTTIGAVRDQGNCGSCWAIATSSAFADRLCVATNADFNQLLSAEEIT 147
Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
C C +GCNGG+P AW + +G+VTGG+Y S EGC+PY + PC + G N T
Sbjct: 148 FCCHKCGYGCNGGYPIKAWERFKKHGLVTGGEYKSGEGCEPYRVPPCPYDESG---NNTC 204
Query: 306 LGKL--KTPECKQNCY 319
GK + C + CY
Sbjct: 205 SGKPMEQNHRCTRMCY 220
Score = 92.8 bits (229), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 43/83 (51%), Positives = 55/83 (66%), Gaps = 1/83 (1%)
Query: 78 RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDS-IGLHAVRVLGWGVENDIPYWLVANS 136
+ + +GP+ A F VY DFL YKSGVY + S +G HAV+++GWG E PYWL+ NS
Sbjct: 247 KDVMTYGPIEASFDVYDDFLSYKSGVYVRSENASYLGGHAVKLIGWGEEYGTPYWLMMNS 306
Query: 137 WNDHWGDHGTFKILRGENEADIE 159
WN WGD G FKI RG NE ++
Sbjct: 307 WNADWGDEGLFKIRRGTNECGVD 329
>gi|118358706|ref|XP_001012594.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila]
gi|89294361|gb|EAR92349.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila SB210]
Length = 346
Score = 124 bits (311), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 73/172 (42%), Positives = 99/172 (57%), Gaps = 15/172 (8%)
Query: 176 LETMGCQNAKGLPRNFDAREKWPE-CPSLRHIADQSNCGSCWAVSVANAISDRLCIASNG 234
LET+ Q A GLP FDAR +W + C SL + DQS CGSCWA A ++SDR CI
Sbjct: 83 LETVSAQ-ANGLPEEFDARVQWGDKCSSLWEVRDQSTCGSCWAFGAAESLSDRHCI---- 137
Query: 235 YFTGQ---ISAQHIVACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLA 290
GQ +S Q+++ C C GC+GGWP+ A ++ + G+VTG Y + CQ YT A
Sbjct: 138 -HLGQDIRLSTQNLLTCCAACGDGCDGGWPEAAMDYYVNTGLVTGDLYGNNSWCQAYTFA 196
Query: 291 PCEHHVQGPLQ-NCTLLGKLKTPECKQNC-YNPSYESTYRFDLKKGKKAHMV 340
PC HHV + CT G+L TP C +C N ++ Y D+ +G KA+ +
Sbjct: 197 PCAHHVTSDIYPPCT--GELPTPPCINSCDSNSTHTIPYSKDIHRGSKAYGI 246
Score = 120 bits (300), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 51/83 (61%), Positives = 64/83 (77%)
Query: 77 MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANS 136
M +IY++GP+ +VY DFL YK+GVYQH GD +G HAV+++GWGVEN PYW + NS
Sbjct: 254 MAEIYKNGPIEVALTVYEDFLTYKTGVYQHVTGDELGGHAVKMVGWGVENGTPYWTIVNS 313
Query: 137 WNDHWGDHGTFKILRGENEADIE 159
WN+ WGD GTFKILRG+NE IE
Sbjct: 314 WNESWGDKGTFKILRGKNECGIE 336
>gi|327408413|emb|CCA30060.1| unnamed protein product [Neospora caninum Liverpool]
Length = 463
Score = 124 bits (311), Expect = 7e-26, Method: Compositional matrix adjust.
Identities = 65/154 (42%), Positives = 90/154 (58%), Gaps = 10/154 (6%)
Query: 187 LPRNFDAREKWPECPSLR-HIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHI 245
+P NFDAR +P C + H+ DQ +CGSCWA + A +DRLCI S G +S QH
Sbjct: 168 VPANFDARTAFPVCKDVVGHVRDQGDCGSCWAFASTEAFNDRLCIRSQGKGVMPLSTQHT 227
Query: 246 VAC--TPNC--WGCNGGWPQLAWRFWGHNGVVTGGDYNS---QEGCQPYTLAPCEHHVQG 298
+C +C +GCNGG P +AWR++ GVVTGGD+++ C PY + C HH +
Sbjct: 228 TSCCNAIHCASFGCNGGQPGMAWRWFERKGVVTGGDFDTLGKGTTCWPYEIPFCAHHAKA 287
Query: 299 PLQNC-TLLGKLKTPECKQNCYNPSY-ESTYRFD 330
P NC T + KTP+C+++C +Y E FD
Sbjct: 288 PFPNCDTDVRPRKTPKCRKDCEEAAYSEHVLPFD 321
Score = 96.3 bits (238), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 49/127 (38%), Positives = 68/127 (53%), Gaps = 4/127 (3%)
Query: 38 KKKKKKKKKKKKKKRLYLPTSIPLSHYFKKAH----MVPRCNAMRQIYEHGPLVAIFSVY 93
+ +K K +K ++ Y +P KA + R R + HG + F VY
Sbjct: 297 RPRKTPKCRKDCEEAAYSEHVLPFDKDVHKASSSYSLRSRDAVKRDMMAHGTVTGAFMVY 356
Query: 94 ADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGE 153
DFL YKSGVY+H +G +G HA++++GWG E+ YW NSWN +WGD G FKI G+
Sbjct: 357 EDFLNYKSGVYKHVYGGPLGGHAIKIIGWGTEDGEEYWHAVNSWNTYWGDSGHFKIEMGQ 416
Query: 154 NEADIEM 160
D EM
Sbjct: 417 CGVDNEM 423
>gi|204022100|dbj|BAG71147.1| cathepsin B-N1 [Tuberaphis takenouchii]
Length = 334
Score = 124 bits (310), Expect = 8e-26, Method: Compositional matrix adjust.
Identities = 62/136 (45%), Positives = 87/136 (63%), Gaps = 6/136 (4%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+P NFDAR+KW +C ++ + DQ +CGSCWA ++A +DRLCIA++G F +SA+ +
Sbjct: 85 IPSNFDARKKWRKCSTIGEVRDQGHCGSCWAFGTSSAFADRLCIATDGEFNELLSAEELA 144
Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
C C +GC+GG+P AW ++ +G+VTGGDY+S EGCQPY + PC G N T
Sbjct: 145 FCCHKCGFGCHGGYPIKAWEWFKKHGLVTGGDYDSGEGCQPYRVPPCPLDEYG---NNTC 201
Query: 306 LGKL--KTPECKQNCY 319
GK K C + CY
Sbjct: 202 RGKPAEKNHRCTRMCY 217
Score = 102 bits (254), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 47/98 (47%), Positives = 62/98 (63%), Gaps = 1/98 (1%)
Query: 63 HYFKKAHMVPRCNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDS-IGLHAVRVLG 121
H+ + A+ + + + +GP+ A F VY DF YKSGVY S +G HAV+++G
Sbjct: 229 HWTRDAYYLTYTTIQKDVMAYGPIEASFDVYDDFPNYKSGVYMKTENASYLGGHAVKLIG 288
Query: 122 WGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
WG E +PYWL+ NSWND WGD G FKILRG NE I+
Sbjct: 289 WGEEYGVPYWLLVNSWNDQWGDQGLFKILRGTNECGID 326
>gi|204022108|dbj|BAG71151.1| cathepsin B-N [Cerataphis jamuritsu]
Length = 333
Score = 124 bits (310), Expect = 8e-26, Method: Compositional matrix adjust.
Identities = 64/139 (46%), Positives = 87/139 (62%), Gaps = 6/139 (4%)
Query: 184 AKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQ 243
A+ +P NFDAR+KW +C S+ + DQ +CGSCWA ++A +DRLCIA+ G F +SA+
Sbjct: 81 AQRIPSNFDARKKWKKCLSIGEVRDQGHCGSCWAFGTSSAFADRLCIATEGEFNELLSAE 140
Query: 244 HIVACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQN 302
+ C C +GCNGG+P AW + +G+VTGG+Y+S EGCQPY + PC G N
Sbjct: 141 ELTFCCHKCGFGCNGGYPIRAWERFRKHGLVTGGNYDSYEGCQPYRVPPCPLDEYG---N 197
Query: 303 CTLLGKL--KTPECKQNCY 319
T GK K C + CY
Sbjct: 198 NTCHGKPMEKNHRCTRMCY 216
Score = 99.8 bits (247), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 47/98 (47%), Positives = 60/98 (61%), Gaps = 1/98 (1%)
Query: 63 HYFKKAHMVPRCNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDS-IGLHAVRVLG 121
HY + A+ + + +GP+ A F VY DF YKSGVY S +G HAV+++G
Sbjct: 228 HYTRDAYYLTYGTIQNDVLTYGPIEASFEVYDDFPSYKSGVYVKTENASYLGGHAVKLIG 287
Query: 122 WGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
WG E +PYWL+ NSWND WGD G FKI RG NE I+
Sbjct: 288 WGEEYGVPYWLLVNSWNDQWGDQGLFKIRRGTNECGID 325
>gi|999909|pdb|1HUC|B Chain B, The Refined 2.15 Angstroms X-Ray Crystal Structure Of
Human Liver Cathepsin B: The Structural Basis For Its
Specificity
gi|999911|pdb|1HUC|D Chain D, The Refined 2.15 Angstroms X-Ray Crystal Structure Of
Human Liver Cathepsin B: The Structural Basis For Its
Specificity
gi|1421164|pdb|1CSB|B Chain B, Crystal Structure Of Cathepsin B Inhibited With Ca030 At
2.1 Angstroms Resolution: A Basis For The Design Of
Specific Epoxysuccinyl Inhibitors
gi|1421167|pdb|1CSB|E Chain E, Crystal Structure Of Cathepsin B Inhibited With Ca030 At
2.1 Angstroms Resolution: A Basis For The Design Of
Specific Epoxysuccinyl Inhibitors
gi|122920711|pdb|2IPP|B Chain B, Crystal Structure Of The Tetragonal Form Of Human Liver
Cathepsin B
Length = 205
Score = 124 bits (310), Expect = 9e-26, Method: Compositional matrix adjust.
Identities = 60/108 (55%), Positives = 75/108 (69%), Gaps = 2/108 (1%)
Query: 54 YLPTSIPLSHYFKKAHMVPRC--NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDS 111
Y PT HY ++ V + M +IY++GP+ FSVY+DFL YKSGVYQH G+
Sbjct: 87 YSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEM 146
Query: 112 IGLHAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
+G HA+R+LGWGVEN PYWLVANSWN WGD+G FKILRG++ IE
Sbjct: 147 MGGHAIRILGWGVENGTPYWLVANSWNTDWGDNGFFKILRGQDHCGIE 194
Score = 86.3 bits (212), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 44/104 (42%), Positives = 62/104 (59%), Gaps = 5/104 (4%)
Query: 239 QISAQHIVACTPNCWG--CNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHV 296
++SA+ ++ C + G CNGG+P AW FW G+V+GG Y S GC+PY++ PCEHHV
Sbjct: 4 EVSAEDLLTCCGSMCGDGCNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCEHHV 63
Query: 297 QGPLQNCTLLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
G CT G+ TP+C + C P Y TY+ D G ++ V
Sbjct: 64 NGSRPPCT--GEGDTPKCSKIC-EPGYSPTYKQDKHYGYNSYSV 104
>gi|741376|prf||2007265A cathepsin B
Length = 153
Score = 124 bits (310), Expect = 9e-26, Method: Compositional matrix adjust.
Identities = 60/108 (55%), Positives = 75/108 (69%), Gaps = 2/108 (1%)
Query: 54 YLPTSIPLSHYFKKAHMVPRC--NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDS 111
Y PT HY ++ V + M +IY++GP+ FSVY+DFL YKSGVYQH G+
Sbjct: 29 YSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEM 88
Query: 112 IGLHAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
+G HA+R+LGWGVEN PYWLVANSWN WGD+G FKILRG++ IE
Sbjct: 89 MGGHAIRILGWGVENGTPYWLVANSWNTDWGDNGFFKILRGQDHCGIE 136
>gi|328697984|ref|XP_003240502.1| PREDICTED: cathepsin B [Acyrthosiphon pisum]
Length = 339
Score = 123 bits (309), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 59/147 (40%), Positives = 87/147 (59%), Gaps = 5/147 (3%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+PR FDAR +W C ++ + DQ CGSCWA ++A +DRLC+A++G F +SA+ +
Sbjct: 87 IPRTFDARRRWRHCKTIGEVRDQGYCGSCWAFGTSSAFADRLCVATDGDFNELLSAEELT 146
Query: 247 ACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
C C GCNGG+P AW+++ +G+VTGG+Y S EGC+PY + PC + G +C
Sbjct: 147 FCCHTCGNGCNGGYPIKAWKYFSSHGLVTGGNYKSGEGCEPYRVPPCPRNEDG-TSSCAG 205
Query: 306 LGKLKTPECKQNCY---NPSYESTYRF 329
K C + CY + Y +RF
Sbjct: 206 QPIEKNHRCTRMCYGNQDLDYNDDHRF 232
Score = 95.1 bits (235), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 43/83 (51%), Positives = 57/83 (68%), Gaps = 1/83 (1%)
Query: 78 RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDS-IGLHAVRVLGWGVENDIPYWLVANS 136
+ + +GP+ A F VY DF YKSGVYQ + +G HAV+++GWGVE IPYWL+ NS
Sbjct: 246 KDVMNYGPIEASFDVYDDFYSYKSGVYQRTPNATKLGGHAVKLIGWGVEEGIPYWLMVNS 305
Query: 137 WNDHWGDHGTFKILRGENEADIE 159
W+ WGD+G FKI RG +E I+
Sbjct: 306 WSAQWGDNGLFKIRRGTDECGID 328
>gi|194384502|dbj|BAG59411.1| unnamed protein product [Homo sapiens]
Length = 273
Score = 123 bits (309), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 60/108 (55%), Positives = 75/108 (69%), Gaps = 2/108 (1%)
Query: 54 YLPTSIPLSHYFKKAHMVPRC--NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDS 111
Y PT HY ++ V + M +IY++GP+ FSVY+DFL YKSGVYQH G+
Sbjct: 149 YSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEM 208
Query: 112 IGLHAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
+G HA+R+LGWGVEN PYWLVANSWN WGD+G FKILRG++ IE
Sbjct: 209 MGGHAIRILGWGVENGTPYWLVANSWNTDWGDNGFFKILRGQDHCGIE 256
Score = 73.9 bits (180), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 29/48 (60%), Positives = 37/48 (77%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNG 234
LP +FDARE+WP+CP+++ I DQ +CGSCWA AISDR+CI NG
Sbjct: 80 LPASFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHVNG 127
>gi|181178|gb|AAA52125.1| lysosomal proteinase cathepsin B, partial [Homo sapiens]
Length = 209
Score = 123 bits (309), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 60/108 (55%), Positives = 75/108 (69%), Gaps = 2/108 (1%)
Query: 54 YLPTSIPLSHYFKKAHMVPRC--NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDS 111
Y PT HY ++ V + M +IY++GP+ FSVY+DFL YKSGVYQH G+
Sbjct: 85 YSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEM 144
Query: 112 IGLHAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
+G HA+R+LGWGVEN PYWLVANSWN WGD+G FKILRG++ IE
Sbjct: 145 MGGHAIRILGWGVENGTPYWLVANSWNTDWGDNGFFKILRGQDHCGIE 192
Score = 85.9 bits (211), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 44/104 (42%), Positives = 62/104 (59%), Gaps = 5/104 (4%)
Query: 239 QISAQHIVACTPNCWG--CNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHV 296
++SA+ ++ C + G CNGG+P AW FW G+V+GG Y S GC+PY++ PCEHHV
Sbjct: 2 EVSAEDLLTCCGSMCGDGCNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCEHHV 61
Query: 297 QGPLQNCTLLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
G CT G+ TP+C + C P Y TY+ D G ++ V
Sbjct: 62 NGSRPPCT--GEGDTPKCSKIC-EPGYSPTYKQDKHYGYNSYSV 102
>gi|157058755|gb|ABV03135.1| cathepsin B-84 [Aulacorthum solani]
Length = 218
Score = 123 bits (309), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 62/155 (40%), Positives = 91/155 (58%), Gaps = 3/155 (1%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+P FD+R +W C ++ H+ +Q NCGSCWA A +DRLC+A+NG ISA+ +
Sbjct: 44 VPEFFDSRLEWKYCKTIGHVRNQGNCGSCWAHGTTGAFADRLCVATNGEVNQLISAEEVT 103
Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
C C +GCNGG P AW+++ +GVVTGGDYN+ +GCQPY + PC +G +C+
Sbjct: 104 FCCHRCGFGCNGGNPLRAWQYFKRHGVVTGGDYNTTDGCQPYRVPPCVKDDKG-HNSCSG 162
Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
+ +C + CY Y+ D K K A+ +
Sbjct: 163 QPTERNHKCSKKCYGDD-TVDYKSDHYKTKDAYYL 196
>gi|76576339|gb|ABA53863.1| cathepsin B-like cysteine protease 1 [Parelaphostrongylus tenuis]
Length = 346
Score = 123 bits (309), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 54/113 (47%), Positives = 74/113 (65%), Gaps = 1/113 (0%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+P++FDAR WP+C SLR + DQS CGS WAV+ AI DR+CIAS G +SA I+
Sbjct: 94 IPKSFDARTNWPKCASLRTVRDQSACGSGWAVAAVGAIMDRICIASEGKQQVILSADDIL 153
Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQG 298
+C C +GC GG AW +W +G+VTG +Y ++ GC+PY PCEH++
Sbjct: 154 SCCTECGYGCEGGDTYKAWNYWTTDGIVTGSNYTTKSGCKPYPYPPCEHYIDA 206
Score = 109 bits (272), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 45/82 (54%), Positives = 60/82 (73%)
Query: 78 RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
++I HGP+ F VY DF Y SG+Y+H G+ +G+HAV++LGWG EN + YW+ ANSW
Sbjct: 256 QEIMNHGPVEVTFDVYEDFEHYSSGIYKHMAGEYVGVHAVKMLGWGTENGVDYWICANSW 315
Query: 138 NDHWGDHGTFKILRGENEADIE 159
N WG++G F+ILRGENE IE
Sbjct: 316 NSDWGENGFFRILRGENECGIE 337
>gi|204022077|dbj|BAG71136.1| cathepsin B-S1 [Tuberaphis sumatrana]
gi|204022079|dbj|BAG71137.1| cathepsin B-S2 [Tuberaphis sumatrana]
Length = 334
Score = 123 bits (309), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 57/133 (42%), Positives = 82/133 (61%), Gaps = 3/133 (2%)
Query: 188 PRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVA 247
P+ FD+RE W C + HI DQ NCGSCW+ S A +DRLC+++ G F +S + +
Sbjct: 86 PQQFDSRENWKSCKQIGHIRDQGNCGSCWSFSTTGAFADRLCVSTGGKFNELLSPEELAF 145
Query: 248 CTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTLL 306
C +C GC GG+P AWR++ GV TGGDY+++EGC+PY +APC ++ QG C
Sbjct: 146 CCKDCGNGCEGGYPIKAWRYFRTQGVTTGGDYDTKEGCKPYKVAPC-YNKQGK-NTCGGK 203
Query: 307 GKLKTPECKQNCY 319
+ +C + CY
Sbjct: 204 PMERNHQCPKTCY 216
Score = 94.0 bits (232), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 45/107 (42%), Positives = 64/107 (59%), Gaps = 2/107 (1%)
Query: 66 KKAHMVPRCNAMRQ-IYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSI-GLHAVRVLGWG 123
K +++ + Q I +GP+ A F VY DF YKSG+Y+ H+V+++GWG
Sbjct: 228 KSEYVINSIKTIEQDIKTYGPVEASFDVYDDFSVYKSGIYRKTPNAKYQNGHSVKIIGWG 287
Query: 124 VENDIPYWLVANSWNDHWGDHGTFKILRGENEADIEMGFNNRVEANS 170
EN PYWL NSW+ WGDHGTFKI++G+NE IE + ++S
Sbjct: 288 QENGTPYWLAVNSWSKFWGDHGTFKIIKGKNECGIERAVTAGIPSSS 334
>gi|187107122|ref|NP_001119621.1| cathepsin B-3098 precursor [Acyrthosiphon pisum]
gi|161343841|tpg|DAA06101.1| TPA_inf: cathepsin B [Acyrthosiphon pisum]
Length = 337
Score = 123 bits (309), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 63/157 (40%), Positives = 94/157 (59%), Gaps = 6/157 (3%)
Query: 164 NRVEANSSEDDDLETMGCQNAKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANA 223
++V N ++DD N + +P FDAR+KW C ++ + DQ NCGS WA+S ++A
Sbjct: 67 DKVNYNMYKNDDH----ADNYQEIPMKFDARKKWIRCKTIGEVRDQGNCGSDWALSTSSA 122
Query: 224 ISDRLCIASNGYFTGQISAQHIVACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQE 282
+DRLC+A+NG F +SA+ I C C GCNGG+P AW+ + ++G+VTGG+Y S E
Sbjct: 123 FADRLCVATNGDFNQLLSAEEITFCCHKCGNGCNGGYPIRAWKRFKNHGLVTGGNYKSGE 182
Query: 283 GCQPYTLAPCEHHVQGPLQNCTLLGKLKTPECKQNCY 319
GC+PY + PC + G C+ +C + CY
Sbjct: 183 GCEPYRVPPCPYDKDGK-NTCSGQPMESNHKCSKKCY 218
Score = 83.6 bits (205), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 39/97 (40%), Positives = 57/97 (58%), Gaps = 1/97 (1%)
Query: 64 YFKKAHMVPRCNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDS-IGLHAVRVLGW 122
Y + + + + + +GP+ F VY DF YKSG+Y + S +G H+V+++GW
Sbjct: 231 YTRDDYYLTYRGIQKDVINYGPIETSFDVYDDFPNYKSGIYVKSENASYLGGHSVKLIGW 290
Query: 123 GVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
G E + YWL+ NSWN WGD G FKI RG NE ++
Sbjct: 291 GEEYGVLYWLMVNSWNADWGDKGLFKIRRGTNECRVD 327
>gi|347546077|gb|AEP03186.1| cathepsin B [Diuraphis noxia]
Length = 239
Score = 123 bits (309), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 52/106 (49%), Positives = 76/106 (71%), Gaps = 1/106 (0%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+P+ FDAR+KW +C ++ + DQ CGSCWAVS ++A +DRLCIA++G F +SA I
Sbjct: 47 IPKTFDARKKWVQCDTIGRVRDQGQCGSCWAVSTSSAFADRLCIATDGDFNELLSADEIT 106
Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAP 291
C C +GC+GG+P AW+ + +G+VTGGD++S EGC+PY + P
Sbjct: 107 FCCYTCGFGCDGGYPIKAWKQFSRHGLVTGGDFDSGEGCEPYRVPP 152
>gi|204022104|dbj|BAG71149.1| cathepsin B-N [Astegopteryx styracophila]
Length = 332
Score = 123 bits (309), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 62/136 (45%), Positives = 86/136 (63%), Gaps = 6/136 (4%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+P+ FDAR+KW +C ++ + DQ CGSCWA ++A +DRLCIA+NG F +SA+ +
Sbjct: 83 IPKFFDARKKWRKCFTIGEVRDQGKCGSCWAFGTSSAFADRLCIATNGEFNELLSAEELT 142
Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
C C +GC+GG+P AW + +G+VTGGDY+S EGCQPY ++PC G N T
Sbjct: 143 FCCHKCGFGCHGGYPIKAWERFQKHGLVTGGDYDSGEGCQPYRVSPCPLDEYG---NNTC 199
Query: 306 LGK--LKTPECKQNCY 319
GK K C + CY
Sbjct: 200 RGKPAEKNHRCTRMCY 215
Score = 97.1 bits (240), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 45/98 (45%), Positives = 61/98 (62%), Gaps = 1/98 (1%)
Query: 63 HYFKKAHMVPRCNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDS-IGLHAVRVLG 121
H+ + A+ + R + +GP+ A + VY DF YKSGVY + +G HAV+++G
Sbjct: 227 HFTRDAYYLTFGIIQRDVMAYGPIEASYDVYDDFPSYKSGVYVRTENATYLGGHAVKLIG 286
Query: 122 WGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
WG E +PYWL+ NSWND WGD G FKI RG NE I+
Sbjct: 287 WGEEYGVPYWLMVNSWNDQWGDKGLFKIRRGTNECGID 324
>gi|194387364|dbj|BAG60046.1| unnamed protein product [Homo sapiens]
Length = 245
Score = 123 bits (309), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 60/108 (55%), Positives = 75/108 (69%), Gaps = 2/108 (1%)
Query: 54 YLPTSIPLSHYFKKAHMVPRC--NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDS 111
Y PT HY ++ V + M +IY++GP+ FSVY+DFL YKSGVYQH G+
Sbjct: 121 YSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEM 180
Query: 112 IGLHAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
+G HA+R+LGWGVEN PYWLVANSWN WGD+G FKILRG++ IE
Sbjct: 181 MGGHAIRILGWGVENGTPYWLVANSWNTDWGDNGFFKILRGQDHCGIE 228
Score = 108 bits (269), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 55/131 (41%), Positives = 77/131 (58%), Gaps = 5/131 (3%)
Query: 212 CGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVACTPNCWG--CNGGWPQLAWRFWGH 269
C WA AISDR+CI +N + + ++SA+ ++ C + G CNGG+P AW FW
Sbjct: 11 CRMSWAFGAVEAISDRICIHTNAHVSVEVSAEDLLTCCGSMCGDGCNGGYPAEAWNFWTR 70
Query: 270 NGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTLLGKLKTPECKQNCYNPSYESTYRF 329
G+V+GG Y S GC+PY++ PCEHHV G CT G+ TP+C + C P Y TY+
Sbjct: 71 KGLVSGGLYESHVGCRPYSIPPCEHHVNGSRPPCT--GEGDTPKCSKIC-EPGYSPTYKQ 127
Query: 330 DLKKGKKAHMV 340
D G ++ V
Sbjct: 128 DKHYGYNSYSV 138
>gi|339831342|gb|AEK20867.1| cathepsin B [Eimeria tenella]
Length = 512
Score = 123 bits (308), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 66/162 (40%), Positives = 96/162 (59%), Gaps = 14/162 (8%)
Query: 191 FDAREKWPECPS-LRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVACT 249
FDARE +P+C + H+ DQ +CGSCWA + A++DR CI S G +S QH +C
Sbjct: 241 FDAREAFPQCAEVIGHVRDQGDCGSCWAFASTEALNDRFCIKSGGRHREALSPQHTTSCC 300
Query: 250 P--NC--WGCNGGWPQLAWRFWGHNGVVTGGDYN---SQEGCQPYTLAPCEHHVQGPLQN 302
+C +GC+GG P++AWR++ ++GVVTGGDYN + + C PY + C HH +GP
Sbjct: 301 DLLHCLSFGCSGGQPRMAWRWFSNDGVVTGGDYNELHTGKSCWPYEIPFCRHHSEGPYPK 360
Query: 303 CTLLGKL-KTPECKQNCYNPSYES---TYRFDLKKGKKAHMV 340
C G L K P+C+++C Y S ++ DL A+ V
Sbjct: 361 CE--GPLPKAPKCRKDCEEAEYTSKVKPFKDDLHFATSAYSV 400
Score = 94.7 bits (234), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 48/99 (48%), Positives = 60/99 (60%), Gaps = 1/99 (1%)
Query: 63 HYFKKAHMVP-RCNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLG 121
H+ A+ V R R++ E+G L F VY DFL YK GVY H G +G HAV+V+G
Sbjct: 392 HFATSAYSVEGRDQIKRELMENGTLTGAFLVYEDFLLYKEGVYHHVTGMPMGGHAVKVIG 451
Query: 122 WGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIEM 160
+G E+ YWL NSWN++WGD GTFKI GE D E
Sbjct: 452 FGNEDGRDYWLAVNSWNEYWGDKGTFKIEMGEAGIDKEF 490
>gi|254575663|gb|ACT68328.1| cysteine proteinase [Haemonchus contortus]
Length = 348
Score = 123 bits (308), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 69/165 (41%), Positives = 91/165 (55%), Gaps = 6/165 (3%)
Query: 166 VEANSSEDDDLETMGCQNAKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAIS 225
+E + ++++ L + +P +FD+REKW +CPSLR I DQSNCGSCWAVS A +S
Sbjct: 75 IERSYNQENVLPVANITSNDDIPESFDSREKWKDCPSLRVIPDQSNCGSCWAVSAAQCMS 134
Query: 226 DRLCIASNGYFTGQISAQHIVACTPNC--WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEG 283
DRLCI S G +SA I+AC +GC+GG+ AW++ GVVTGG Y +
Sbjct: 135 DRLCIHSQGRKKVLLSATDILACCGKFCGYGCDGGYNARAWKWATIAGVVTGGAYKEKGN 194
Query: 284 CQPYTLAPCEHHVQGPLQNCTLLGKLKTPECKQNC---YNPSYES 325
C+PY C H NC TP CK C Y YE+
Sbjct: 195 CKPYVFPQCGAHKGKAFNNCP-SHPYATPACKPYCQYGYGKRYEN 238
Score = 90.1 bits (222), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 39/84 (46%), Positives = 56/84 (66%), Gaps = 1/84 (1%)
Query: 79 QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWN 138
+I + GP+ A F++Y DF Y GVY H G G H+++++GWGV+ + YWL+ANSW+
Sbjct: 259 EIMKKGPVHATFNIYEDFEHYNGGVYIHTAGAMEGGHSIKIIGWGVDKGVKYWLIANSWS 318
Query: 139 DHWG-DHGTFKILRGENEADIEMG 161
WG D G F+++RG N DIE G
Sbjct: 319 TDWGEDGGYFRVVRGINNCDIEGG 342
>gi|332244666|ref|XP_003271495.1| PREDICTED: cathepsin B [Nomascus leucogenys]
Length = 351
Score = 123 bits (308), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 66/163 (40%), Positives = 94/163 (57%), Gaps = 12/163 (7%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSC--W-----AVSVANAISDRLCIASNGYFTGQ 239
LP +F ARE+WP+CP++ Q G W A AISDR+CI +N + + +
Sbjct: 85 LPESFYAREQWPQCPTIXXXRAQPGRGGLTRWGSFLQAFGAVEAISDRICIHTNAHISVE 144
Query: 240 ISAQHIVACTPNCWG--CNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQ 297
+SA+ ++ C + G CNGG+P AW FW G+V+GG Y+S GC+PY++ PCEHHV
Sbjct: 145 VSAEDLLTCCGSMCGDGCNGGYPAEAWNFWTRKGLVSGGLYDSHVGCRPYSIPPCEHHVN 204
Query: 298 GPLQNCTLLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
G CT G+ TP+C + C P Y TY+ D G ++ V
Sbjct: 205 GSRPPCT--GEGDTPKCSKIC-EPGYSPTYKQDKHYGYNSYSV 244
Score = 123 bits (308), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 60/108 (55%), Positives = 75/108 (69%), Gaps = 2/108 (1%)
Query: 54 YLPTSIPLSHYFKKAHMVPRC--NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDS 111
Y PT HY ++ V + M +IY++GP+ FSVY+DFL YKSGVYQH G+
Sbjct: 227 YSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHITGEM 286
Query: 112 IGLHAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
+G HA+R+LGWGVEN PYWLVANSWN WGD+G FKILRG++ IE
Sbjct: 287 MGGHAIRILGWGVENGTPYWLVANSWNTDWGDNGFFKILRGQDHCGIE 334
>gi|359427491|gb|AEV46267.1| eimeripain [Eimeria tenella]
Length = 512
Score = 123 bits (308), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 66/162 (40%), Positives = 96/162 (59%), Gaps = 14/162 (8%)
Query: 191 FDAREKWPECPS-LRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVACT 249
FDARE +P+C + H+ DQ +CGSCWA + A++DR CI S G +S QH +C
Sbjct: 241 FDAREAFPQCAEVIGHVRDQGDCGSCWAFASTEALNDRFCIKSGGRHREALSPQHTTSCC 300
Query: 250 P--NC--WGCNGGWPQLAWRFWGHNGVVTGGDYN---SQEGCQPYTLAPCEHHVQGPLQN 302
+C +GC+GG P++AWR++ ++GVVTGGDYN + + C PY + C HH +GP
Sbjct: 301 DLLHCLSFGCSGGQPRMAWRWFSNDGVVTGGDYNELHTGKSCWPYEIPFCRHHSEGPYPK 360
Query: 303 CTLLGKL-KTPECKQNCYNPSYES---TYRFDLKKGKKAHMV 340
C G L K P+C+++C Y S ++ DL A+ V
Sbjct: 361 CE--GPLPKAPKCRKDCEEAEYTSKVKPFKDDLHFATSAYSV 400
Score = 94.7 bits (234), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 48/99 (48%), Positives = 60/99 (60%), Gaps = 1/99 (1%)
Query: 63 HYFKKAHMVP-RCNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLG 121
H+ A+ V R R++ E+G L F VY DFL YK GVY H G +G HAV+V+G
Sbjct: 392 HFATSAYSVEGRDQIKRELMENGTLTGAFLVYEDFLLYKEGVYHHVTGMPMGGHAVKVIG 451
Query: 122 WGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIEM 160
+G E+ YWL NSWN++WGD GTFKI GE D E
Sbjct: 452 FGNEDGRDYWLAVNSWNEYWGDKGTFKIEMGEAGIDKEF 490
>gi|118429529|gb|ABK91812.1| cathepsin B precursor [Clonorchis sinensis]
Length = 342
Score = 123 bits (308), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 61/133 (45%), Positives = 75/133 (56%), Gaps = 2/133 (1%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP +FDAR WP CPS+ I DQS+CGSCWA A+SDRLCI S G F +SA +V
Sbjct: 86 LPESFDARANWPHCPSISEIRDQSSCGSCWAFGAVEAMSDRLCIHSKGAFNKSLSAVDLV 145
Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
+C C GC GG+ +AW W +G+VTGG GC+ Y CEH +G C
Sbjct: 146 SCCTECGCGCRGGYSPIAWDLWKTHGIVTGGSKEKPTGCRSYPFPSCEHRGKGQYPPCPH 205
Query: 306 LGKLKTPECKQNC 318
TPEC + C
Sbjct: 206 Q-LYPTPECIKRC 217
Score = 101 bits (251), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 44/83 (53%), Positives = 57/83 (68%)
Query: 76 AMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVAN 135
M++I GP+ AI VY D L YKSGVY H +G +G H +R+LGWG E+ +PYWLVAN
Sbjct: 244 VMKEIMLRGPVGAILHVYEDLLDYKSGVYFHVWGGHLGEHGIRILGWGEEDGVPYWLVAN 303
Query: 136 SWNDHWGDHGTFKILRGENEADI 158
SWN+ WG+ G ++LR NE I
Sbjct: 304 SWNEDWGEKGYMRVLRWRNECGI 326
>gi|193783549|dbj|BAG53460.1| unnamed protein product [Homo sapiens]
Length = 276
Score = 123 bits (308), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 60/108 (55%), Positives = 75/108 (69%), Gaps = 2/108 (1%)
Query: 54 YLPTSIPLSHYFKKAHMVPRC--NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDS 111
Y PT HY ++ V + M +IY++GP+ FSVY+DFL YKSGVYQH G+
Sbjct: 152 YSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEM 211
Query: 112 IGLHAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
+G HA+R+LGWGVEN PYWLVANSWN WGD+G FKILRG++ IE
Sbjct: 212 MGGHAIRILGWGVENGTPYWLVANSWNTDWGDNGFFKILRGQDHCGIE 259
Score = 81.6 bits (200), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 43/101 (42%), Positives = 57/101 (56%), Gaps = 6/101 (5%)
Query: 240 ISAQHIVACTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGP 299
+S I C + CNGG+P AW FW G+V+GG Y S GC+PY++ PCEHHV G
Sbjct: 75 LSEVFITGCL---FSCNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCEHHVNGS 131
Query: 300 LQNCTLLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
CT G+ TP+C + C P Y TY+ D G ++ V
Sbjct: 132 RPPCT--GEGDTPKCSKIC-EPGYSPTYKQDKHYGYNSYSV 169
>gi|118364222|ref|XP_001015333.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila]
gi|89297100|gb|EAR95088.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila SB210]
Length = 341
Score = 123 bits (308), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 74/177 (41%), Positives = 102/177 (57%), Gaps = 18/177 (10%)
Query: 172 EDDDLETMGCQNA--KGLPRNFDAREKWPE-CPSLRHIADQSNCGSCWAVSVANAISDRL 228
E D+ E + NA LP FDAR++W + C SL + DQSNCGSCWA +++DR
Sbjct: 75 EGDNGENLPVSNAVKADLPTAFDARQQWGDKCTSLWEVRDQSNCGSCWAFGAVESLTDRH 134
Query: 229 CIASNGYFTGQ---ISAQHIVACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGC 284
CI GQ +SAQ+++ C C GCNGG+P A ++ G+VTG YN+ C
Sbjct: 135 CI-----HLGQDIRLSAQNMLTCCATCGQGCNGGYPASAMSYYVKTGLVTGDLYNTTGWC 189
Query: 285 QPYTLAPCEHHVQGPLQ-NCTLLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
Q Y+ APC HHV PL CT G+L TP+C + C + S ++ + + KG KA+ V
Sbjct: 190 QAYSFAPCAHHVDTPLYPACT--GELPTPKCAKTCDSGSGQT---YTVHKGSKAYSV 241
Score = 120 bits (302), Expect = 8e-25, Method: Compositional matrix adjust.
Identities = 51/83 (61%), Positives = 66/83 (79%)
Query: 77 MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANS 136
M +I +GP+ A F+VY DFL YKSGVY+H G ++G HA++++GWGVEN+ PYW+V NS
Sbjct: 249 MTEIQTNGPVEAAFTVYEDFLNYKSGVYKHVTGKALGGHAIKIVGWGVENNTPYWIVVNS 308
Query: 137 WNDHWGDHGTFKILRGENEADIE 159
WN WGD+GTFKILRG+NE IE
Sbjct: 309 WNQTWGDNGTFKILRGKNECGIE 331
>gi|2944340|gb|AAC05262.1| cathepsin B-like cysteine protease GCP7 [Haemonchus contortus]
Length = 348
Score = 123 bits (308), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 69/165 (41%), Positives = 91/165 (55%), Gaps = 6/165 (3%)
Query: 166 VEANSSEDDDLETMGCQNAKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAIS 225
+E + ++++ L + +P +FD+REKW +CPSLR I DQSNCGSCWAVS A +S
Sbjct: 75 IERSYNQENVLPIANITSNDDIPESFDSREKWKDCPSLRVIPDQSNCGSCWAVSAAQCMS 134
Query: 226 DRLCIASNGYFTGQISAQHIVACTPNC--WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEG 283
DRLCI S G +SA I+AC +GC+GG+ AW++ GVVTGG Y +
Sbjct: 135 DRLCIHSQGRKKVLLSATDILACCGKFCGYGCDGGYNARAWKWATIAGVVTGGAYKEKGN 194
Query: 284 CQPYTLAPCEHHVQGPLQNCTLLGKLKTPECKQNC---YNPSYES 325
C+PY C H NC TP CK C Y YE+
Sbjct: 195 CKPYVFPQCGAHKGKAFNNCP-SHPYATPACKPYCQYGYGKRYEN 238
Score = 91.3 bits (225), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 39/84 (46%), Positives = 57/84 (67%), Gaps = 1/84 (1%)
Query: 79 QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWN 138
+I + GP+ A F++Y DF Y+ GVY H G G H+++++GWGV+ + YWL+ANSW+
Sbjct: 259 EIMQKGPVHATFNIYEDFEHYEGGVYIHTAGAMEGGHSIKIIGWGVDKGVKYWLIANSWS 318
Query: 139 DHWG-DHGTFKILRGENEADIEMG 161
WG D G F+++RG N DIE G
Sbjct: 319 TDWGEDGGYFRVVRGINNCDIEGG 342
>gi|119638992|gb|ABL85238.1| cysteine proteinase 4 [Necator americanus]
Length = 339
Score = 123 bits (308), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 68/157 (43%), Positives = 92/157 (58%), Gaps = 7/157 (4%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP FDAREKWP C S+ I D S CGSCWAVS A+ +SDRLCI +NG +S+ I+
Sbjct: 88 LPERFDAREKWPHCASIGLIRDHSACGSCWAVSAASVMSDRLCIQTNGTNQKILSSADIL 147
Query: 247 ACT-PNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
AC +C GC GG+P A+ + + GV +GG+Y + C+PY PC+ + GP C
Sbjct: 148 ACCGEDCGSGCEGGYPIQAYFYLENTGVCSGGEYREKNVCKPYPFYPCDGNY-GP---CP 203
Query: 305 LLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMVL 341
G TP+C++ C Y Y D GK +H++L
Sbjct: 204 KEGAFDTPKCRKIC-QFRYPVPYEEDKVFGKNSHILL 239
Score = 102 bits (253), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 46/97 (47%), Positives = 68/97 (70%), Gaps = 3/97 (3%)
Query: 66 KKAHMVPRCNAMR---QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGW 122
K +H++ + N R +I+ +GP+ A F V+ DF+ YK G+Y+ +G IG+HA++++GW
Sbjct: 233 KNSHILLQDNEARIRQEIFINGPVGANFYVFEDFIHYKEGIYKQTYGKWIGVHAIKLIGW 292
Query: 123 GVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
G EN YWLVANS+N WG++GTF+ILRG N IE
Sbjct: 293 GTENGTDYWLVANSYNYDWGENGTFRILRGTNHCLIE 329
>gi|242001640|ref|XP_002435463.1| cathepsin B endopeptidase, putative [Ixodes scapularis]
gi|215498799|gb|EEC08293.1| cathepsin B endopeptidase, putative [Ixodes scapularis]
Length = 223
Score = 123 bits (308), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 54/88 (61%), Positives = 69/88 (78%)
Query: 79 QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWN 138
+I+++GP+ A F+ YADFL YKSGVYQH+ D IG HA+R+LGWG E++ PYWL+ANSWN
Sbjct: 132 EIFQNGPVEAEFTAYADFLSYKSGVYQHHSRDIIGRHAIRILGWGSEDNNPYWLLANSWN 191
Query: 139 DHWGDHGTFKILRGENEADIEMGFNNRV 166
+ WGDHG FK+LRG NE DIE N +
Sbjct: 192 EDWGDHGYFKMLRGVNECDIESFVNAGI 219
Score = 104 bits (259), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 54/125 (43%), Positives = 74/125 (59%), Gaps = 4/125 (3%)
Query: 217 AVSVANAISDRLCIASNGYFTGQISAQHIVACTPNCW-GCNGGWPQLAWRFWGHNGVVTG 275
A A+SDR+CI SNG ISA+ ++ C C GC+GG AW++W G+V+G
Sbjct: 1 AFGAVEAMSDRVCIHSNGRVQVDISAEDLMDCCDKCGSGCSGGVSAAAWQYWKDAGLVSG 60
Query: 276 GDYNSQEGCQPYTLAPCEHHVQGPLQNCTLLGKLKTPECKQNCYNPSYESTYRFDLKKGK 335
G YN+ +GC+PY+LAPCEH QG L C +G L TP+CK+ C YE +Y D K
Sbjct: 61 GLYNTTDGCKPYSLAPCEHSSQGSLPEC--VGTLPTPKCKRQC-REGYERSYDDDKYFAK 117
Query: 336 KAHMV 340
+ +
Sbjct: 118 NVYSI 122
>gi|157167368|ref|XP_001653891.1| cathepsin b [Aedes aegypti]
gi|108874250|gb|EAT38475.1| AAEL009642-PA [Aedes aegypti]
Length = 332
Score = 123 bits (308), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 63/154 (40%), Positives = 90/154 (58%), Gaps = 10/154 (6%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+P FDAREKWP C S+ I +Q CG+CWAV+ + +SDRLCI S G F +++A+ ++
Sbjct: 85 IPEFFDAREKWPYCKSISTIKNQGLCGACWAVAAVSVMSDRLCIHSEGKFDVELAAEDLM 144
Query: 247 ACTPNCW-GCNGGWPQ-LAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
C +C GCNGG+ ++++W G+V+G YNS +GC+PY PC + P C
Sbjct: 145 GCCKDCGNGCNGGFLDGTSFQYWVDVGLVSGAAYNSTDGCKPYPFKPCLY----PFVGCH 200
Query: 305 LLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAH 338
KTP C +C Y+ TYR D G A+
Sbjct: 201 ---PEKTPSCTHHC-TEGYDGTYRRDKYYGSAAY 230
Score = 102 bits (254), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 49/99 (49%), Positives = 64/99 (64%), Gaps = 2/99 (2%)
Query: 63 HYFKKAHMVPRCNAMRQ--IYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
+Y A+ +P M Q I +GP+ + FSVY D YK+GVYQH G +G HAVR++
Sbjct: 224 YYGSAAYKLPNDERMIQLEIMTNGPVESGFSVYQDLYLYKTGVYQHVVGREVGKHAVRLI 283
Query: 121 GWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
GWG E +PYWL+ANS+ + WG+HG FK LRG N IE
Sbjct: 284 GWGKERGVPYWLIANSYGEDWGEHGYFKFLRGSNHLGIE 322
>gi|294877495|ref|XP_002768009.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
gi|239870149|gb|EER00727.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
Length = 180
Score = 122 bits (307), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 66/154 (42%), Positives = 85/154 (55%), Gaps = 8/154 (5%)
Query: 182 QNAKGLPRNFDAREKWPECP-SLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQI 240
+ + LP +FDAR +P C + HI DQS CGSCWA V A +DRLCI S+G FT +
Sbjct: 27 EELQDLPTDFDARTAFPNCSKVIGHIRDQSACGSCWAFGVTEAFNDRLCIKSDGAFTELL 86
Query: 241 SAQHIVACTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQ------EGCQPYTLAPCEH 294
SA + ACT +GC GG P AW + G+ TGGDY ++ +GC PY PC H
Sbjct: 87 SAGEMNACTLF-FGCGGGDPYSAWSWVHDKGIATGGDYVAKDDMTKDDGCWPYDFPPCAH 145
Query: 295 HVQGPLQNCTLLGKLKTPECKQNCYNPSYESTYR 328
H+ G TP C + C+NP Y +T R
Sbjct: 146 HINDTKYPKCPEGLYPTPNCVEQCHNPKYTTTLR 179
>gi|343961899|dbj|BAK62537.1| cathepsin B precursor [Pan troglodytes]
Length = 195
Score = 122 bits (307), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 60/108 (55%), Positives = 74/108 (68%), Gaps = 2/108 (1%)
Query: 54 YLPTSIPLSHYFKKAHMVPRC--NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDS 111
Y PT HY ++ V M +IY++GP+ FSVY+DFL YKSGVYQH G+
Sbjct: 71 YSPTYKQDKHYGYNSYSVSNSEKGIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEM 130
Query: 112 IGLHAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
+G HA+R+LGWGVEN PYWLVANSWN WGD+G FKILRG++ IE
Sbjct: 131 MGGHAIRILGWGVENGTPYWLVANSWNTDWGDNGFFKILRGQDHCGIE 178
Score = 83.6 bits (205), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 41/87 (47%), Positives = 53/87 (60%), Gaps = 3/87 (3%)
Query: 254 GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTLLGKLKTPE 313
GCNGG+P AW FW G+V+GG Y S GC+PY++ PCEHHV G CT G+ TP+
Sbjct: 5 GCNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCEHHVNGSRPPCT--GEGDTPK 62
Query: 314 CKQNCYNPSYESTYRFDLKKGKKAHMV 340
C + C P Y TY+ D G ++ V
Sbjct: 63 CSKIC-EPGYSPTYKQDKHYGYNSYSV 88
>gi|327239610|gb|AEA39649.1| cathepsin B [Epinephelus coioides]
Length = 171
Score = 122 bits (307), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 63/129 (48%), Positives = 81/129 (62%), Gaps = 3/129 (2%)
Query: 213 GSCWAVSVANAISDRLCIASNGYFTGQISAQHIVACTPNC-WGCNGGWPQLAWRFWGHNG 271
GSCWA A AISDRLCI SNG + +IS++ ++AC +C GCNGG+P AW FW G
Sbjct: 1 GSCWAFGAAEAISDRLCIHSNGKVSVEISSEDLLACCDSCGMGCNGGYPSAAWDFWTDVG 60
Query: 272 VVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTLLGKLKTPECKQNCYNPSYESTYRFDL 331
+V+GG Y+S GC+PYT+ PCEHHV G CT G TP+C C Y +Y+ D
Sbjct: 61 LVSGGLYDSHVGCRPYTIPPCEHHVNGTRPPCTGEGG-DTPQCILQC-ESGYTPSYKADK 118
Query: 332 KKGKKAHMV 340
GK ++ V
Sbjct: 119 HYGKSSYSV 127
Score = 52.4 bits (124), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 27/62 (43%), Positives = 37/62 (59%), Gaps = 2/62 (3%)
Query: 54 YLPTSIPLSHYFKKAHMVPRCNAMRQ--IYEHGPLVAIFSVYADFLQYKSGVYQHNFGDS 111
Y P+ HY K ++ VP Q IY++GP+ F+VY DFL YK+GVYQH G +
Sbjct: 110 YTPSYKADKHYGKSSYSVPSDEEQIQSEIYKNGPVEGAFTVYEDFLLYKTGVYQHMTGSA 169
Query: 112 IG 113
+G
Sbjct: 170 VG 171
>gi|312105965|ref|XP_003150617.1| hypothetical protein LOAG_15077 [Loa loa]
Length = 150
Score = 122 bits (307), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 67/146 (45%), Positives = 92/146 (63%), Gaps = 6/146 (4%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP++FDAR +WP C S+ +A+Q CGSCWA+S A+ +SDRLCIA+N QISA+ ++
Sbjct: 8 LPKHFDARLRWPLCWSVHVVANQGGCGSCWAISAASVMSDRLCIATNYSNQKQISAEDLI 67
Query: 247 ACTPNCWGCNGG-WPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
+C C GC G W A+ +W ++GVVTGGDY S EGC+PYT AP + P
Sbjct: 68 SCCTECGGCQGSHWALSAFIYWRNHGVVTGGDYGSFEGCKPYTTAP---NCGSPCSFEYY 124
Query: 306 LGKLKTPECKQNCYNPSYESTYRFDL 331
K+ +P C++ C P Y +Y DL
Sbjct: 125 RRKI-SPACQKTC-QPLYGLSYEEDL 148
>gi|149030260|gb|EDL85316.1| rCG52258, isoform CRA_c [Rattus norvegicus]
Length = 130
Score = 122 bits (307), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 53/83 (63%), Positives = 67/83 (80%)
Query: 77 MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANS 136
M +IY++GP+ F+V++DFL YKSGVY+H GD +G HA+R+LGWG+EN +PYWLVANS
Sbjct: 31 MAEIYKNGPVEGAFTVFSDFLTYKSGVYKHEAGDVMGGHAIRILGWGIENGVPYWLVANS 90
Query: 137 WNDHWGDHGTFKILRGENEADIE 159
WN WGD+G FKILRGEN IE
Sbjct: 91 WNVDWGDNGFFKILRGENHCGIE 113
>gi|91089437|ref|XP_966750.1| PREDICTED: similar to putative cathepsin B-like proteinase
[Tribolium castaneum]
gi|270012705|gb|EFA09153.1| cathepsin B precursor [Tribolium castaneum]
Length = 324
Score = 122 bits (307), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 61/147 (41%), Positives = 81/147 (55%), Gaps = 15/147 (10%)
Query: 185 KGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQH 244
+ +P FD R W +CPSL++I +Q NCGSCWA ++DRLCIAS G + SA
Sbjct: 80 EAIPETFDGRTHWSQCPSLKNIRNQGNCGSCWAFGSVEVMTDRLCIASKGKTKFEFSADD 139
Query: 245 IVACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNC 303
++AC C GC+GG P A+ +W G+V+GGDYNS EGCQPY +
Sbjct: 140 LLACCTACGKGCDGGAPYRAFEYWVAKGIVSGGDYNSNEGCQPY-------------EGS 186
Query: 304 TLLGKLKTPECKQNCYNPSYESTYRFD 330
L + TP+C C N Y + Y D
Sbjct: 187 AFLNSV-TPKCSTKCLNSKYTTPYAKD 212
Score = 99.0 bits (245), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 45/82 (54%), Positives = 57/82 (69%), Gaps = 1/82 (1%)
Query: 79 QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWN 138
+I +GP+V VY DF YKSGVYQH G+S+G HAV+++GWG E +PYWL+ANSW
Sbjct: 233 EIMNNGPVVTHMDVYEDFYSYKSGVYQHVSGNSMGGHAVKIIGWGTEKGVPYWLIANSWG 292
Query: 139 DHWGD-HGTFKILRGENEADIE 159
W D G +KILRG+N IE
Sbjct: 293 AKWADLDGFYKILRGKNHCKIE 314
>gi|347972088|ref|XP_313836.5| AGAP004534-PA [Anopheles gambiae str. PEST]
gi|333469166|gb|EAA09182.5| AGAP004534-PA [Anopheles gambiae str. PEST]
Length = 334
Score = 122 bits (307), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 63/156 (40%), Positives = 94/156 (60%), Gaps = 10/156 (6%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+P +FDAR WP C SLR I +Q CGSCWAV+ A+ +SDR+CI SNG ++A+ ++
Sbjct: 87 IPESFDARNHWPNCESLRAIRNQGTCGSCWAVAAASVMSDRVCIHSNGTINVALAAEDLM 146
Query: 247 ACTPNCW-GCNGGWPQ-LAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
C +C GCNGG+ ++++W G+V+GG YNS +GC+PY PCE+ P +C
Sbjct: 147 GCCVDCGNGCNGGFLDGTSFQYWVDAGLVSGGAYNSTDGCKPYPFKPCEY----PFNDCH 202
Query: 305 LLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
+ +P+C +C + + Y D GK A+ V
Sbjct: 203 V---EISPKCTHHCRD-GVDRHYSKDKLFGKVAYSV 234
Score = 109 bits (272), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 54/96 (56%), Positives = 68/96 (70%), Gaps = 2/96 (2%)
Query: 66 KKAHMVPRCN-AMR-QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWG 123
K A+ VPR A+R +I +GP+ A F VY D L YKSGVY+H +G+ IG HAVR++GWG
Sbjct: 229 KVAYSVPRDERAIRYEIMTNGPVEAGFDVYEDVLLYKSGVYRHVYGEQIGKHAVRIIGWG 288
Query: 124 VENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
+ IPYWL+ANS+ D WGDHG FK +RG N IE
Sbjct: 289 RDGGIPYWLIANSYGDDWGDHGYFKFVRGSNHLGIE 324
>gi|341888694|gb|EGT44629.1| hypothetical protein CAEBREN_31940 [Caenorhabditis brenneri]
Length = 374
Score = 122 bits (307), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 67/156 (42%), Positives = 88/156 (56%), Gaps = 12/156 (7%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP FD+RE+WPEC S++ I +Q+ CGSCWA A ISDR+CI SN T IS + I+
Sbjct: 97 LPDTFDSREQWPECKSIKLIRNQATCGSCWAFGAAEIISDRICIQSNATQTPIISVEDIL 156
Query: 247 ACT-PNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
+C +C GC GG+ A RFW +G VTGGDYN GC PY+ APC+ +C
Sbjct: 157 SCCGVSCGKGCQGGYSIEALRFWKSSGAVTGGDYNGA-GCMPYSFAPCKK------DSC- 208
Query: 305 LLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
+ TP CK C + + Y D G A+ +
Sbjct: 209 --AQGTTPSCKTTCQSSYKTAEYTKDKHFGTTAYKI 242
Score = 105 bits (261), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 56/130 (43%), Positives = 72/130 (55%), Gaps = 13/130 (10%)
Query: 63 HYFKKAHMVPRCNAMRQ--IYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
H+ A+ + A Q IY +GP+ A F VY DF +YKSGVYQ+ G +G HAV+++
Sbjct: 234 HFGTTAYKITNSVAAIQTEIYHNGPVEASFKVYEDFYKYKSGVYQYTSGKLVGGHAVKII 293
Query: 121 GWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIEMGFNNRVEANS--------SE 172
GWG EN + YWL+ANSW +GD G FK+ RG NE IE N V + E
Sbjct: 294 GWGTENGVDYWLIANSWGTTFGDSGFFKMRRGTNEVGIE---GNVVAGTAKLGTHDEKRE 350
Query: 173 DDDLETMGCQ 182
DDD C
Sbjct: 351 DDDGAATSCS 360
>gi|170028916|ref|XP_001842340.1| cathepsin B [Culex quinquefasciatus]
gi|167879390|gb|EDS42773.1| cathepsin B [Culex quinquefasciatus]
Length = 339
Score = 122 bits (307), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 69/198 (34%), Positives = 108/198 (54%), Gaps = 19/198 (9%)
Query: 145 GTFKILRGENEADIEMGFNNRVEANSSEDDDLETMGCQNAKGLPRNFDAREKWPECPSLR 204
G F+ ++G E+ ++ ++ SS D+ + +P FDAREKWP C S+
Sbjct: 59 GEFRSIKGIYESPLDFTLPSKRLHASSLDEVV----------IPDRFDAREKWPFCQSIH 108
Query: 205 HIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVACTPNCW-GCNGGWPQ-L 262
+ +Q CGSCWAV+ + +SDRLCI S+G +++ + ++ C +C GCNGG+
Sbjct: 109 SVRNQGTCGSCWAVATVSVMSDRLCIHSDGEVNLELATEDLMGCCKDCGNGCNGGFLDGT 168
Query: 263 AWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTLLGKLKTPECKQNCYNPS 322
A+++W G+V+G YNS EGC+PY PC + P C + K P+C +C N
Sbjct: 169 AFQYWVDAGLVSGAPYNSSEGCKPYPFEPCSY----PFVGCH--HEKKNPKCLHHCIN-G 221
Query: 323 YESTYRFDLKKGKKAHMV 340
Y+ YR D G A+ +
Sbjct: 222 YDRKYRKDKFFGATAYKI 239
Score = 102 bits (255), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 47/94 (50%), Positives = 61/94 (64%), Gaps = 2/94 (2%)
Query: 68 AHMVPRCNAMRQ--IYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVE 125
A+ +P M Q I +GP+ F V+ DF Y SGVY+H G +G+HA+R++GWG E
Sbjct: 236 AYKIPNDARMIQLEIMTNGPVATGFEVFEDFYFYHSGVYKHVVGKKVGMHAIRIVGWGTE 295
Query: 126 NDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
N PYWL+ANS+ D WGD G FK+LRG N IE
Sbjct: 296 NGTPYWLIANSYGDTWGDKGFFKMLRGSNHLGIE 329
>gi|52630945|gb|AAU84936.1| putative cathepsin B-S [Toxoptera citricida]
Length = 335
Score = 122 bits (306), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 59/138 (42%), Positives = 84/138 (60%), Gaps = 2/138 (1%)
Query: 183 NAKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISA 242
N +P FDAR +W C ++ + +Q NCGSCWA A +DRLCIA+NG F ISA
Sbjct: 80 NDSEIPNFFDARIQWSHCKTIGEVRNQGNCGSCWAHGTTGAFADRLCIATNGDFNELISA 139
Query: 243 QHIVACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQ 301
+ + C C +GCNGG P AW+++ +GVVTGG+YN+ +GCQPY + PC +G
Sbjct: 140 EELTFCCHRCGFGCNGGNPLKAWQYFKRHGVVTGGNYNTTDGCQPYKVPPCVKDEEGH-N 198
Query: 302 NCTLLGKLKTPECKQNCY 319
+C+ +C ++CY
Sbjct: 199 SCSGQPTEPNHKCSRSCY 216
Score = 92.8 bits (229), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 44/95 (46%), Positives = 59/95 (62%), Gaps = 1/95 (1%)
Query: 66 KKAHMVPRCNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHN-FGDSIGLHAVRVLGWGV 124
K A+ + + +GP+ A F VY DF+ Y+SGVYQ +G HAV+++GWG
Sbjct: 231 KNAYYLNIDTMQKDTIAYGPIEASFDVYDDFVNYESGVYQKTEDAKYLGGHAVKMIGWGE 290
Query: 125 ENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
E+ PYWL+ NSW + WG +G FKILRG NE IE
Sbjct: 291 EDGTPYWLMVNSWGEQWGANGMFKILRGTNECGIE 325
>gi|320166129|gb|EFW43028.1| cathepsin B [Capsaspora owczarzaki ATCC 30864]
Length = 332
Score = 122 bits (306), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 61/172 (35%), Positives = 93/172 (54%), Gaps = 1/172 (0%)
Query: 170 SSEDDDLETMGCQNAKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLC 229
+ ED L A+ +P FD+R WP CP+++ + DQS CGSCWA ++SDR+C
Sbjct: 62 TPEDQRLPLKVAPIAEAIPDTFDSRTNWPACPTIKEVRDQSACGSCWAFGAVESMSDRIC 121
Query: 230 IASNGYFTGQISAQHIVACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYT 288
IASN ++SA +++C +C GC+GG +W ++ + G+VTG YN+ C+PY
Sbjct: 122 IASNATKIVRLSASDLLSCCTSCGDGCDGGQLGPSWDYYKNKGIVTGYLYNTTGYCKPYD 181
Query: 289 LAPCEHHVQGPLQNCTLLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
C HH P TP+C ++C +TY DL G+ ++ V
Sbjct: 182 FPACAHHEASPDYPDCPSTDYSTPKCTKSCVAGYTANTYTADLHYGQSSYSV 233
Score = 109 bits (273), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 53/106 (50%), Positives = 68/106 (64%), Gaps = 8/106 (7%)
Query: 63 HYFKKAHMVPRCNAMRQ--IYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
HY + ++ V R +A Q I HGP+ A F+VY+DF Y+SGVY+H G +G HA+ ++
Sbjct: 225 HYGQSSYSVGRTDAAIQTEILNHGPVEAAFTVYSDFPTYRSGVYKHTSGSVLGGHAISIV 284
Query: 121 GWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIEMGFNNRV 166
GWG E+ PYWLV NSWN WGD G FKILRG + G NN V
Sbjct: 285 GWGTESGSPYWLVKNSWNPSWGDGGFFKILRG------DCGINNDV 324
>gi|86279343|gb|ABC88767.1| putative cathepsin B-like proteinase [Tenebrio molitor]
Length = 321
Score = 122 bits (306), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 63/158 (39%), Positives = 90/158 (56%), Gaps = 15/158 (9%)
Query: 183 NAKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISA 242
NA+ +P +FDAR KWP C SL I DQ CGSCWA + ++SDR+CI S+G S
Sbjct: 79 NARDVPESFDARTKWPNCDSLNRIRDQGACGSCWAFASIESMSDRICIHSSGSAQFMFSP 138
Query: 243 QHIVACTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQN 302
+ +++C +C C GG+ A F+ + G+V+GGD NS EGC+PYT + H QG
Sbjct: 139 EDLLSCCTSCGDCGGGYMMSALDFYINEGIVSGGDVNSNEGCRPYT---ADAHDQG---- 191
Query: 303 CTLLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
+TP C ++C N Y ++Y D G ++V
Sbjct: 192 -------QTPACTKSCRN-GYSTSYSADKHYGSNDYVV 221
Score = 113 bits (282), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 45/81 (55%), Positives = 61/81 (75%)
Query: 79 QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWN 138
++ +GP++ F V+ DF Y SGVY+H G+S+G H V+++GWGVEN +PYWL+ANSW
Sbjct: 231 EVMTNGPIIVNFEVFQDFYNYVSGVYRHVSGESVGFHVVKIVGWGVENGVPYWLIANSWG 290
Query: 139 DHWGDHGTFKILRGENEADIE 159
WGDHG FK+LRG+NE IE
Sbjct: 291 SSWGDHGFFKMLRGQNECGIE 311
>gi|237836005|ref|XP_002367300.1| cysteine proteinase, putative [Toxoplasma gondii ME49]
gi|211964964|gb|EEB00160.1| cysteine proteinase, putative [Toxoplasma gondii ME49]
gi|221506020|gb|EEE31655.1| cysteine proteinase, putative [Toxoplasma gondii VEG]
Length = 572
Score = 122 bits (306), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 63/154 (40%), Positives = 92/154 (59%), Gaps = 10/154 (6%)
Query: 187 LPRNFDAREKWPECPSLR-HIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHI 245
+P +FDAR +P C + H+ DQ +CGSCWA + A +DRLCI S G +SAQH
Sbjct: 277 VPAHFDARTAFPACKDVVGHVRDQGDCGSCWAFASTEAFNDRLCIRSQGKGLMPLSAQHT 336
Query: 246 VAC--TPNC--WGCNGGWPQLAWRFWGHNGVVTGGDYNS---QEGCQPYTLAPCEHHVQG 298
+C +C +GCNGG P +AWR++ GVVTGGD+++ C PY + C HH +
Sbjct: 337 TSCCNAIHCASFGCNGGQPGMAWRWFERKGVVTGGDFDALGKGTTCWPYEVPFCAHHAKA 396
Query: 299 PLQNC-TLLGKLKTPECKQNCYNPSY-ESTYRFD 330
P +C L KTP+C+++C +Y ++ + FD
Sbjct: 397 PFPDCDATLVPRKTPKCRKDCEEQAYADNVHPFD 430
Score = 99.4 bits (246), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 51/125 (40%), Positives = 68/125 (54%), Gaps = 4/125 (3%)
Query: 40 KKKKKKKKKKKKRLYLPTSIPLSHYFKKA----HMVPRCNAMRQIYEHGPLVAIFSVYAD 95
+K K +K +++ Y P KA + R + R + HGP+ F VY D
Sbjct: 408 RKTPKCRKDCEEQAYADNVHPFDQDTHKATSAYSLRSRDDVKRDMMTHGPVSGAFMVYED 467
Query: 96 FLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENE 155
FL YKSGVY+H G +G HA++++GWG EN YW NSWN +WGD G FKI G+
Sbjct: 468 FLSYKSGVYKHVSGLPVGGHAIKIIGWGTENGEEYWHAVNSWNTYWGDGGQFKIAMGQCG 527
Query: 156 ADIEM 160
D EM
Sbjct: 528 IDGEM 532
>gi|118153|sp|P25792.1|CYSP_SCHMA RecName: Full=Cathepsin B-like cysteine proteinase; AltName:
Full=Antigen Sm31; Flags: Precursor
gi|160950|gb|AAA29865.1| cathepsin B [Schistosoma mansoni]
Length = 340
Score = 122 bits (305), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 64/156 (41%), Positives = 86/156 (55%), Gaps = 5/156 (3%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+P NFD+R+KWP C S+ I DQS CGSCW+ A+SDR CI S G ++SA ++
Sbjct: 89 IPSNFDSRKKWPGCKSIATIRDQSRCGSCWSFGAVEAMSDRSCIQSGGKQNVELSAVDLL 148
Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
C +C GC GG AW +W G+VT + GC+PY CEHH +G C
Sbjct: 149 TCCESCGLGCEGGILGPAWDYWVKEGIVTASSKENHTGCEPYPFPKCEHHTKGKYPPCG- 207
Query: 306 LGKL-KTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
K+ TP CKQ C Y++ Y D +GK ++ V
Sbjct: 208 -SKIYNTPRCKQTCQR-KYKTPYTQDKHRGKSSYNV 241
Score = 116 bits (290), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 46/82 (56%), Positives = 67/82 (81%)
Query: 78 RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
++I ++GP+ A F+VY DFL YKSG+Y+H G+++G HA+R++GWGVEN PYWL+ANSW
Sbjct: 250 KEIMKYGPVEASFTVYEDFLNYKSGIYKHITGEALGGHAIRIIGWGVENKTPYWLIANSW 309
Query: 138 NDHWGDHGTFKILRGENEADIE 159
N+ WG++G F+I+RG +E IE
Sbjct: 310 NEDWGENGYFRIVRGRDECSIE 331
>gi|204022088|dbj|BAG71141.1| cathepsin B-N2 [Tuberaphis styraci]
Length = 334
Score = 122 bits (305), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 61/136 (44%), Positives = 86/136 (63%), Gaps = 6/136 (4%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+P +FDAR+KW +C ++ + DQ NCGSCWA ++A +DRLCIA++G F +S + +
Sbjct: 85 IPSSFDARKKWRKCSTIGEVRDQGNCGSCWAFGTSSAFADRLCIATDGEFNELLSPEELA 144
Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
C C +GC+GG+P AW + +G+VTGG+Y+S EGCQPY ++PC G N T
Sbjct: 145 FCCHKCGFGCSGGYPIRAWERFKKHGLVTGGNYDSGEGCQPYKVSPCPLDEYG---NNTC 201
Query: 306 LGKL--KTPECKQNCY 319
GK K C Q CY
Sbjct: 202 SGKPAEKNHRCTQMCY 217
Score = 97.8 bits (242), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 45/98 (45%), Positives = 59/98 (60%), Gaps = 1/98 (1%)
Query: 63 HYFKKAHMVPRCNAMRQIYEHGPLVAIFSVYADFLQYKSGVY-QHNFGDSIGLHAVRVLG 121
HY + A+ + + +GP+ A F VY DF YKSGVY + +G HAV+++G
Sbjct: 229 HYTRDAYYLTYGTIQNDVLAYGPIEASFEVYDDFPSYKSGVYTKMENATYLGGHAVKLIG 288
Query: 122 WGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
WG E +PYWL+ NSWND WGD G FKI RG NE +
Sbjct: 289 WGEEYGVPYWLLVNSWNDQWGDQGLFKIRRGTNECGTD 326
>gi|157058749|gb|ABV03132.1| cathepsin B-3098 [Acyrthosiphon pisum]
Length = 256
Score = 122 bits (305), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 63/157 (40%), Positives = 94/157 (59%), Gaps = 6/157 (3%)
Query: 164 NRVEANSSEDDDLETMGCQNAKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANA 223
++V N ++DD N + +P FDAR+KW C ++ + DQ NCGS WA+S ++A
Sbjct: 9 DKVNYNMYKNDDHA----DNYQEIPMKFDARKKWIRCKTIGEVRDQGNCGSDWALSTSSA 64
Query: 224 ISDRLCIASNGYFTGQISAQHIVACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQE 282
+DRLC+A+NG F +SA+ I C C GCNGG+P AW+ + ++G+VTGG+Y S E
Sbjct: 65 FADRLCVATNGDFNQLLSAEEITFCCHKCGNGCNGGYPIRAWKRFKNHGLVTGGNYKSGE 124
Query: 283 GCQPYTLAPCEHHVQGPLQNCTLLGKLKTPECKQNCY 319
GC+PY + PC + G C+ +C + CY
Sbjct: 125 GCEPYRVPPCPYDKDGK-NTCSGQPMEPNHKCSKKCY 160
Score = 70.9 bits (172), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 33/83 (39%), Positives = 49/83 (59%), Gaps = 1/83 (1%)
Query: 64 YFKKAHMVPRCNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDS-IGLHAVRVLGW 122
Y + + + + + +GP+ A F VY DF YKSG+Y + S +G H+V+++GW
Sbjct: 173 YTRDDYYLTYRGIQKDVINYGPIEASFDVYDDFPNYKSGIYVKSENASYLGGHSVKLIGW 232
Query: 123 GVENDIPYWLVANSWNDHWGDHG 145
G E + YWL+ NSWN WGD G
Sbjct: 233 GEEYGVLYWLMVNSWNADWGDKG 255
>gi|55793951|gb|AAV65886.1| cathepsin B1 isotype 6 precursor [Trichobilharzia regenti]
Length = 342
Score = 122 bits (305), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 54/99 (54%), Positives = 73/99 (73%), Gaps = 2/99 (2%)
Query: 63 HYFKKAHMVP--RCNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
HY K A+ VP + ++I HGP+ + F+VY+DFL YKSG+Y+H G IG+H VR++
Sbjct: 234 HYGKVAYNVPNNEDSIKKEIMMHGPVGSFFTVYSDFLNYKSGIYKHMKGTEIGVHTVRIV 293
Query: 121 GWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
GWGVE PYWL+ANSWN+ WG+ G F+ILRG++E DIE
Sbjct: 294 GWGVEKGTPYWLIANSWNEGWGEKGYFRILRGKDECDIE 332
Score = 120 bits (300), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 62/155 (40%), Positives = 88/155 (56%), Gaps = 3/155 (1%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+P FD+R+KW +C S+ I DQS CGS WA + +SDR+CI S G + ++SA ++
Sbjct: 90 IPSTFDSRKKWSQCKSISSIHDQSRCGSGWAFAAVEVMSDRICIQSKGEKSVELSAVDLL 149
Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
+C C GC GG+P AW +W GVVTG + GCQPY CEH+ G C
Sbjct: 150 SCCRECGLGCLGGFPGSAWDYWVEEGVVTGSSGENHTGCQPYPFPKCEHNTTGKYPACG- 208
Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
+TP+C++ C Y++ Y+ D GK A+ V
Sbjct: 209 QKIYETPKCQKKC-QKGYKTPYKKDKHYGKVAYNV 242
>gi|161343869|tpg|DAA06115.1| TPA_inf: cathepsin B [Myzus persicae]
Length = 337
Score = 122 bits (305), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 53/106 (50%), Positives = 75/106 (70%), Gaps = 1/106 (0%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+P +FDAR KW C ++ + DQ NCGSCWAV+ ++A +DRLC+A+ G F +SA+ I
Sbjct: 88 IPEHFDARNKWVYCDTIGRVRDQGNCGSCWAVATSSAFADRLCVATTGDFNELLSAEEIT 147
Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAP 291
C C +GC+GG+P AW+ + +G+VTGGDYNS EGC+PY + P
Sbjct: 148 FCCHTCGFGCHGGYPIKAWKRFSTHGLVTGGDYNSGEGCEPYRVPP 193
Score = 90.9 bits (224), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 42/97 (43%), Positives = 61/97 (62%), Gaps = 1/97 (1%)
Query: 64 YFKKAHMVPRCNAMRQIYEHGPLVAIFSVYADFLQYKSGVY-QHNFGDSIGLHAVRVLGW 122
Y + + + + + + +GP+ A F VY DF YKSGVY + + +G HAV+++GW
Sbjct: 230 YTRDYYYLTYGSIQKDVLTYGPIEASFDVYDDFPSYKSGVYVKSDNASYLGGHAVKLIGW 289
Query: 123 GVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
G E+ PYWL+ NSWN WGD+G FKI RG NE ++
Sbjct: 290 GEEDGTPYWLMVNSWNTQWGDNGFFKIRRGTNECGVD 326
>gi|221484923|gb|EEE23213.1| cysteine proteinase, putative [Toxoplasma gondii GT1]
Length = 569
Score = 122 bits (305), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 63/154 (40%), Positives = 92/154 (59%), Gaps = 10/154 (6%)
Query: 187 LPRNFDAREKWPECPSLR-HIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHI 245
+P +FDAR +P C + H+ DQ +CGSCWA + A +DRLCI S G +SAQH
Sbjct: 274 VPAHFDARTAFPACKDVVGHVRDQGDCGSCWAFASTEAFNDRLCIRSQGKRLMPLSAQHT 333
Query: 246 VAC--TPNC--WGCNGGWPQLAWRFWGHNGVVTGGDYNS---QEGCQPYTLAPCEHHVQG 298
+C +C +GCNGG P +AWR++ GVVTGGD+++ C PY + C HH +
Sbjct: 334 TSCCNAIHCASFGCNGGQPGMAWRWFERKGVVTGGDFDALGKGTTCWPYEVPFCAHHAKA 393
Query: 299 PLQNC-TLLGKLKTPECKQNCYNPSY-ESTYRFD 330
P +C L KTP+C+++C +Y ++ + FD
Sbjct: 394 PFPDCDATLVPRKTPKCRKDCEEQAYADNVHPFD 427
Score = 99.4 bits (246), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 51/125 (40%), Positives = 68/125 (54%), Gaps = 4/125 (3%)
Query: 40 KKKKKKKKKKKKRLYLPTSIPLSHYFKKA----HMVPRCNAMRQIYEHGPLVAIFSVYAD 95
+K K +K +++ Y P KA + R + R + HGP+ F VY D
Sbjct: 405 RKTPKCRKDCEEQAYADNVHPFDQDTHKATSAYSLRSRDDVKRDMMTHGPVSGAFMVYED 464
Query: 96 FLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENE 155
FL YKSGVY+H G +G HA++++GWG EN YW NSWN +WGD G FKI G+
Sbjct: 465 FLSYKSGVYKHVSGLPVGGHAIKIIGWGTENGEEYWHAVNSWNTYWGDGGQFKIAMGQCG 524
Query: 156 ADIEM 160
D EM
Sbjct: 525 IDGEM 529
>gi|204022106|dbj|BAG71150.1| cathepsin B-N [Astegopteryx spinocephala]
Length = 332
Score = 122 bits (305), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 60/136 (44%), Positives = 86/136 (63%), Gaps = 6/136 (4%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+P+ FDAR+KW +C ++ + DQ CGSCWA ++A +DRLCIA++G F +SA+ +
Sbjct: 83 IPKFFDARKKWRKCSTIGEVRDQGKCGSCWAFGTSSAFADRLCIATDGDFNELLSAEELT 142
Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
C C +GC+GG+P AW + +G+VTGG+Y+S EGCQPY ++PC G N T
Sbjct: 143 FCCHTCGYGCHGGYPIKAWERFKKHGLVTGGNYDSSEGCQPYRVSPCPLDEYG---NNTC 199
Query: 306 LGK--LKTPECKQNCY 319
GK K C + CY
Sbjct: 200 RGKPAEKNHRCTRMCY 215
Score = 95.1 bits (235), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 43/97 (44%), Positives = 60/97 (61%), Gaps = 1/97 (1%)
Query: 64 YFKKAHMVPRCNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDS-IGLHAVRVLGW 122
+ + A+ + + + +GP+ A + VY DF YKSGVY + +G HAV+++GW
Sbjct: 228 FTRDAYYLTYGTIQKDVMTYGPIEASYEVYDDFPSYKSGVYVRTENATYLGGHAVKLIGW 287
Query: 123 GVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
G E +PYWL+ NSWND WGD G FKI RG NE I+
Sbjct: 288 GEEYGVPYWLMVNSWNDQWGDRGLFKIRRGTNECGID 324
>gi|1644295|emb|CAB03627.1| cysteine proteinase [Haemonchus contortus]
Length = 345
Score = 122 bits (305), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 62/127 (48%), Positives = 79/127 (62%), Gaps = 13/127 (10%)
Query: 166 VEANSSEDDDLETMGCQNAKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAIS 225
VE EDDD+ P +FDAR W C SLRHI DQ+NCGSCWAVS A+A+S
Sbjct: 84 VENADDEDDDI-----------PESFDARTHWANCTSLRHIRDQANCGSCWAVSTASALS 132
Query: 226 DRLCIASNGYFTGQISAQHIVACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGC 284
DR+CIAS G IS+ IV+C C +GC+GGWP A+ ++ G VT G+ S++GC
Sbjct: 133 DRICIASKGETQLHISSIDIVSCCKLCGYGCDGGWPIEAFDYFSRQGAVT-GETTSKDGC 191
Query: 285 QPYTLAP 291
+PY P
Sbjct: 192 RPYPFHP 198
Score = 108 bits (269), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 44/76 (57%), Positives = 59/76 (77%)
Query: 84 GPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWNDHWGD 143
GP+VA+F+VY DF YK G+Y H G + G HA++++GWGVEN +PYWL+ANSW+D WG+
Sbjct: 260 GPVVAVFTVYEDFSYYKKGIYVHIAGKARGAHAIKIIGWGVENGLPYWLIANSWHDDWGE 319
Query: 144 HGTFKILRGENEADIE 159
G F+I+RG NE IE
Sbjct: 320 QGLFRIVRGINECGIE 335
>gi|21700775|gb|AAL60053.1| cysteine proteinase [Toxoplasma gondii]
Length = 569
Score = 122 bits (305), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 63/154 (40%), Positives = 92/154 (59%), Gaps = 10/154 (6%)
Query: 187 LPRNFDAREKWPECPSLR-HIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHI 245
+P +FDAR +P C + H+ DQ +CGSCWA + A +DRLCI S G +SAQH
Sbjct: 274 VPAHFDARTAFPACKDVVGHVRDQGDCGSCWAFASTEAFNDRLCIRSQGKRLMPLSAQHT 333
Query: 246 VAC--TPNC--WGCNGGWPQLAWRFWGHNGVVTGGDYNS---QEGCQPYTLAPCEHHVQG 298
+C +C +GCNGG P +AWR++ GVVTGGD+++ C PY + C HH +
Sbjct: 334 TSCCNAIHCASFGCNGGQPGMAWRWFERKGVVTGGDFDALGKGTTCWPYEVPFCAHHAKA 393
Query: 299 PLQNC-TLLGKLKTPECKQNCYNPSY-ESTYRFD 330
P +C L KTP+C+++C +Y ++ + FD
Sbjct: 394 PFPDCDATLVPRKTPKCRKDCEEQAYADNVHPFD 427
Score = 99.4 bits (246), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 51/125 (40%), Positives = 68/125 (54%), Gaps = 4/125 (3%)
Query: 40 KKKKKKKKKKKKRLYLPTSIPLSHYFKKA----HMVPRCNAMRQIYEHGPLVAIFSVYAD 95
+K K +K +++ Y P KA + R + R + HGP+ F VY D
Sbjct: 405 RKTPKCRKDCEEQAYADNVHPFDQDTHKATSAYSLRSRDDVKRDMMTHGPVSGAFMVYED 464
Query: 96 FLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENE 155
FL YKSGVY+H G +G HA++++GWG EN YW NSWN +WGD G FKI G+
Sbjct: 465 FLSYKSGVYKHVSGLPVGGHAIKIIGWGTENGEEYWHAVNSWNTYWGDGGQFKIAMGQCG 524
Query: 156 ADIEM 160
D EM
Sbjct: 525 IDGEM 529
>gi|189308104|gb|ACD86936.1| cysteine protease [Caenorhabditis brenneri]
Length = 210
Score = 121 bits (304), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 57/111 (51%), Positives = 76/111 (68%), Gaps = 1/111 (0%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+P FDAR +WP C S+ +I DQS+CGSCWA + A A SDR CIASNG +SA+ ++
Sbjct: 81 IPATFDARTQWPSCVSINNIRDQSDCGSCWAFAAAEAASDRFCIASNGAVNTLLSAEDVL 140
Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHV 296
+C NC +GC GG+P AW++ +G TGG Y +Q GC+PY+LAPC V
Sbjct: 141 SCCSNCGYGCEGGYPINAWKYLVKSGFCTGGSYEAQFGCKPYSLAPCGETV 191
>gi|54289256|gb|AAV31918.1| putative vitellogenic cathepsin B [Aedes aegypti]
Length = 332
Score = 121 bits (304), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 62/154 (40%), Positives = 90/154 (58%), Gaps = 10/154 (6%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+P FDAREKWP C S+ I +Q CG+CWAV+ + +SDRLCI S G F +++A+ ++
Sbjct: 85 IPEFFDAREKWPYCKSISTIKNQGLCGACWAVATVSVMSDRLCIHSEGKFDVELAAEDLM 144
Query: 247 ACTPNCW-GCNGGWPQ-LAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
C +C GCNGG+ ++++W G+V+G YN+ +GC+PY PC + P C
Sbjct: 145 GCCKDCGNGCNGGFLDGTSFQYWVDVGLVSGAAYNNTDGCKPYPFKPCLY----PFVGCH 200
Query: 305 LLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAH 338
KTP C +C Y+ TYR D G A+
Sbjct: 201 ---PEKTPSCTHHC-TEGYDGTYRRDKYYGSAAY 230
Score = 102 bits (254), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 49/99 (49%), Positives = 64/99 (64%), Gaps = 2/99 (2%)
Query: 63 HYFKKAHMVPRCNAMRQ--IYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
+Y A+ +P M Q I +GP+ + FSVY D YK+GVYQH G +G HAVR++
Sbjct: 224 YYGSAAYKLPNDERMIQLEIMTNGPVESGFSVYQDLYLYKTGVYQHVVGREVGKHAVRLI 283
Query: 121 GWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
GWG E +PYWL+ANS+ + WG+HG FK LRG N IE
Sbjct: 284 GWGKERGVPYWLIANSYGEDWGEHGYFKFLRGSNHLGIE 322
>gi|268561878|ref|XP_002638441.1| Hypothetical protein CBG18657 [Caenorhabditis briggsae]
Length = 372
Score = 121 bits (303), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 68/180 (37%), Positives = 97/180 (53%), Gaps = 20/180 (11%)
Query: 156 ADIEMGF---NNRVEANSSEDDDLETMGCQNAKGLPRNFDAREKWPECPSLRHIADQSNC 212
+++EM F + + S +D+ L G +P +FDAR+ WP C S++ I +Q+ C
Sbjct: 46 SELEMKFKVMDLKFSEISPKDEPLTVQGVY----VPISFDARDHWPNCKSIKLIRNQAYC 101
Query: 213 GSCWAVSVANAISDRLCIASNGYFTGQISAQHIVACTPNCW--GCNGGWPQLAWRFWGHN 270
G+CWA A ISDR+CI S G IS + I++C + GC GG+P +FW ++
Sbjct: 102 GACWAFGAAEIISDRICIQSGGAHQPIISVEDILSCCGSSCGEGCKGGYPLEGLKFWMNS 161
Query: 271 GVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTLLGKLKTPECKQNCYNPSYESTYRFD 330
GVVTGGDYN GCQPYT PC + TP C++ C E+TY+ D
Sbjct: 162 GVVTGGDYNGT-GCQPYTFPPCS----------SCEASKSTPSCQKKCQTGYLEATYKND 210
Score = 96.7 bits (239), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 43/81 (53%), Positives = 55/81 (67%)
Query: 79 QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWN 138
+IY +GP+ + V+ DF QYKSGVY + G G HAV+++GWG EN + YWLVANSW
Sbjct: 263 EIYNNGPVEVSYRVFEDFYQYKSGVYHYVSGKLTGAHAVKIIGWGTENKVDYWLVANSWG 322
Query: 139 DHWGDHGTFKILRGENEADIE 159
+G+ G FKI RG NE IE
Sbjct: 323 TDFGEKGFFKIRRGTNECGIE 343
>gi|312374702|gb|EFR22199.1| hypothetical protein AND_15622 [Anopheles darlingi]
Length = 339
Score = 121 bits (303), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 56/134 (41%), Positives = 87/134 (64%), Gaps = 9/134 (6%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+P +FD+R++WP C SLR I +Q CGSCWAV+ A+ +SDR+CI +NG I+A+ ++
Sbjct: 92 IPESFDSRDRWPNCDSLREIRNQGTCGSCWAVAAASVMSDRVCIHTNGTRNVAIAAEDLM 151
Query: 247 ACTPNCW-GCNGGWPQ-LAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
C +C GC GG+ ++++W G+V+GG YNS EGC+PY PC + P +C
Sbjct: 152 GCCADCGNGCEGGFLDGTSFQYWVDAGLVSGGAYNSTEGCKPYPFKPCLY----PFTDCH 207
Query: 305 LLGKLKTPECKQNC 318
+ ++P+CK +C
Sbjct: 208 ---REESPKCKHHC 218
Score = 103 bits (256), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 48/94 (51%), Positives = 64/94 (68%), Gaps = 2/94 (2%)
Query: 68 AHMVPRCNAM--RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVE 125
A+ VPR + +I +GP+ F VY D YKSGVY+H +G+ +G HAVR++GWG E
Sbjct: 236 AYSVPRDERVIRYEIMTNGPVEGGFDVYEDVFLYKSGVYRHVYGEHVGKHAVRIIGWGRE 295
Query: 126 NDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
IPYWL++NS+ + WGDHG FKI+RG N IE
Sbjct: 296 GGIPYWLISNSYGEDWGDHGYFKIVRGINHLGIE 329
>gi|91078960|ref|XP_974244.1| PREDICTED: similar to putative cathepsin B-like proteinase
[Tribolium castaneum]
gi|270004840|gb|EFA01288.1| cathepsin B precursor [Tribolium castaneum]
Length = 319
Score = 121 bits (303), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 63/158 (39%), Positives = 91/158 (57%), Gaps = 15/158 (9%)
Query: 183 NAKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISA 242
N + +P+ FDAR+KWP+C SL I DQ +CGSCWA + +SDR+CI S+G SA
Sbjct: 77 NPQDIPKTFDARKKWPKCDSLNRIRDQGSCGSCWAFAAVETMSDRICIHSSGAKKFFFSA 136
Query: 243 QHIVACTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQN 302
+ +++C C C+GG+ A+ F+ GVV+GGD NS EGC+PYT + H +G
Sbjct: 137 EDLLSCCTACGSCSGGYMMAAFDFYIKQGVVSGGDLNSNEGCRPYT---ADAHDKG---- 189
Query: 303 CTLLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
TP C ++C Y ++Y D G K ++V
Sbjct: 190 -------VTPSCTKSC-RKGYPTSYSSDKHYGSKDYIV 219
Score = 102 bits (254), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 48/99 (48%), Positives = 62/99 (62%), Gaps = 2/99 (2%)
Query: 63 HYFKKAHMVPR--CNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
HY K ++V N +I +GP++ F VY DF Y SGVY H G+ G H V+++
Sbjct: 211 HYGSKDYIVDAGVSNIQYEIMTNGPIIVSFKVYQDFYNYGSGVYHHVSGNYTGNHIVKIV 270
Query: 121 GWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
GWG E + YWL+ANSW WG+HG FKILRG+NE IE
Sbjct: 271 GWGTEKEQDYWLIANSWGSSWGEHGFFKILRGKNECGIE 309
>gi|204022090|dbj|BAG71142.1| cathepsin B-N3 [Tuberaphis styraci]
Length = 334
Score = 121 bits (303), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 58/134 (43%), Positives = 83/134 (61%), Gaps = 2/134 (1%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+P +FDAR+KW +C ++ + DQ NCGSCWA ++A +DRLCIA++G F +S + +
Sbjct: 85 IPSSFDARKKWRKCSTIGEVRDQGNCGSCWAFGTSSAFADRLCIATDGEFNELLSPEELA 144
Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
C C +GC+GG+P AW + +G+VTGG+Y+S EGCQPY + PC G C+
Sbjct: 145 FCCHKCGFGCSGGYPIRAWERFKKHGLVTGGNYDSGEGCQPYKVPPCPLDEYGN-NTCSG 203
Query: 306 LGKLKTPECKQNCY 319
K C Q CY
Sbjct: 204 KPAEKNHRCTQMCY 217
Score = 97.4 bits (241), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 45/94 (47%), Positives = 58/94 (61%), Gaps = 1/94 (1%)
Query: 63 HYFKKAHMVPRCNAMRQIYEHGPLVAIFSVYADFLQYKSGVY-QHNFGDSIGLHAVRVLG 121
HY + A+ + + +GP+ A F VY DF YKSGVY + +G HAV+++G
Sbjct: 229 HYTRDAYYLTYGTIQNDVLAYGPIEASFEVYDDFPSYKSGVYTKMENATYLGGHAVKLIG 288
Query: 122 WGVENDIPYWLVANSWNDHWGDHGTFKILRGENE 155
WG E +PYWL+ NSWND WGD G FKI RG NE
Sbjct: 289 WGEEYGVPYWLLVNSWNDQWGDQGLFKIRRGTNE 322
>gi|268555420|ref|XP_002635699.1| Hypothetical protein CBG22436 [Caenorhabditis briggsae]
Length = 317
Score = 121 bits (303), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 68/158 (43%), Positives = 87/158 (55%), Gaps = 13/158 (8%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+P FDAR +WP C S++ I +Q+ CGSCWA A +SDR+CIAS G IS ++
Sbjct: 75 IPTYFDARTRWPNCRSIKMIRNQATCGSCWAFGAAEVMSDRICIASMGTKQPIISPTDLL 134
Query: 247 ACTPNC--WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
+C N +GC G P A+R+W GVVTGGDY GC+PY APC CT
Sbjct: 135 SCCGNFCGYGCKGASPLQAFRWWNKKGVVTGGDYRG-SGCKPYPFAPCTA------LPCT 187
Query: 305 LLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMVLM 342
K +TP C NC P+Y Y D G A++V M
Sbjct: 188 ---KSETPRCSLNC-QPAYSKAYSKDKYFGTPAYIVGM 221
Score = 101 bits (251), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 43/84 (51%), Positives = 61/84 (72%)
Query: 76 AMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVAN 135
A++ +GP+ A F VY DF Y+SGVY+H G +G HAV+++GWG++N PYWL+AN
Sbjct: 225 AIQTEITNGPVEAAFIVYDDFNHYRSGVYRHVAGKLVGGHAVKIIGWGIQNGAPYWLMAN 284
Query: 136 SWNDHWGDHGTFKILRGENEADIE 159
SW +WG++G FK+LRG +E IE
Sbjct: 285 SWGPYWGENGFFKMLRGVDECGIE 308
>gi|161343865|tpg|DAA06113.1| TPA_inf: cathepsin B [Myzus persicae]
Length = 335
Score = 121 bits (303), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 56/134 (41%), Positives = 83/134 (61%), Gaps = 2/134 (1%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+P FD+R +W C ++ + +Q NCGSCWA A +DRLCIA++G F ISA+ +
Sbjct: 84 VPEFFDSRLEWKNCKTIGEVRNQGNCGSCWAHGTTGAFADRLCIATDGEFNELISAEELT 143
Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
C C +GCNGG P AW+++ +GVVTGG+YN+ +GCQPY + PC +G +C+
Sbjct: 144 FCCHTCGFGCNGGNPLKAWKYFKRHGVVTGGNYNTTDGCQPYRVPPCVRDDEGH-NSCSG 202
Query: 306 LGKLKTPECKQNCY 319
+ +C + CY
Sbjct: 203 QPTERNHKCSKKCY 216
Score = 97.8 bits (242), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 47/100 (47%), Positives = 62/100 (62%), Gaps = 2/100 (2%)
Query: 62 SHY-FKKAHMVPRCNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDS-IGLHAVRV 119
+HY K A+ + + +GP+ A F VY DF Y+SGVYQ S +G HAV++
Sbjct: 226 NHYKTKDAYYLSNTTMQKDTMVYGPIEASFDVYDDFTSYESGVYQKTENASYLGGHAVKM 285
Query: 120 LGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
+GWGVE PYWL+ NSW + WGD G FKILRG +E +E
Sbjct: 286 IGWGVEEGTPYWLMVNSWGEQWGDKGMFKILRGTDECGVE 325
>gi|161343871|tpg|DAA06116.1| TPA_inf: cathepsin B [Myzus persicae]
Length = 276
Score = 120 bits (302), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 63/151 (41%), Positives = 91/151 (60%), Gaps = 9/151 (5%)
Query: 170 SSEDDDLETMGCQNAKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLC 229
+ +DDD N + +P FDAR+KW C ++ + DQ +CGS WA+S ++A SDRLC
Sbjct: 15 TGDDDD-------NYQEIPIKFDARKKWLRCKTIGEVRDQGHCGSDWAMSTSSAFSDRLC 67
Query: 230 IASNGYFTGQISAQHIVACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYT 288
+A+NG F +SA+ I C C GC+GG+P AW+ + +G+VTGG+Y S EGC+PY
Sbjct: 68 VATNGDFNQLLSAEEITFCCHTCGDGCSGGYPIRAWKRYKKHGLVTGGNYKSGEGCEPYR 127
Query: 289 LAPCEHHVQGPLQNCTLLGKLKTPECKQNCY 319
+ PC + QG C+ K C + CY
Sbjct: 128 VPPCPNDDQGN-NTCSGQPMEKNHRCTRMCY 157
Score = 85.5 bits (210), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 42/106 (39%), Positives = 60/106 (56%), Gaps = 1/106 (0%)
Query: 64 YFKKAHMVPRCNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDS-IGLHAVRVLGW 122
Y + + + + + +GP+ A F VY DF YKSG+Y + S +G H+V+++GW
Sbjct: 170 YTRDHYYLTYRGIQKDVINYGPIEASFDVYDDFPSYKSGIYVKSENASYLGGHSVKLIGW 229
Query: 123 GVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIEMGFNNRVEA 168
G E + YWL+ NSWN WGD G FKI RG NE ++ V A
Sbjct: 230 GEEYGVLYWLMVNSWNADWGDKGLFKIRRGTNECGVDNSTTGGVPA 275
>gi|404250524|gb|AFR54113.1| cysteine proteinase, partial [Haemonchus contortus]
Length = 332
Score = 120 bits (302), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 65/143 (45%), Positives = 81/143 (56%), Gaps = 6/143 (4%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+P +FD+RE W C S+ +I DQSNCGSCWAVS A +SDR+C+ S G IS I+
Sbjct: 95 IPESFDSREVWKSCSSITYIRDQSNCGSCWAVSAAETMSDRICVQSKGRVQKMISDVDIL 154
Query: 247 ACT-PNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
AC C GCNGG AW + GVVTGG Y + C+PY L PC +H G +C
Sbjct: 155 ACCGSECGRGCNGGMDHKAWEYVKEFGVVTGGRYQEKGVCKPYPLHPCGNH-GGKFWSCP 213
Query: 305 LLGKLKTPECKQNC---YNPSYE 324
+TP CK+ C Y YE
Sbjct: 214 RDHSFRTPACKKYCQYGYGKRYE 236
Score = 89.0 bits (219), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 39/75 (52%), Positives = 51/75 (68%)
Query: 78 RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
R++ ++GP+ A F Y DF Y G+Y H G G HAV+V+GWGVEN YW VANSW
Sbjct: 257 REMMKNGPVQAAFITYEDFSFYTKGIYVHTRGRQRGAHAVKVVGWGVENGTKYWNVANSW 316
Query: 138 NDHWGDHGTFKILRG 152
+ WG++G F+ILRG
Sbjct: 317 STDWGENGYFRILRG 331
>gi|157058759|gb|ABV03137.1| cathepsin B-84 [Rhopalosiphum padi]
Length = 219
Score = 120 bits (302), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 62/155 (40%), Positives = 91/155 (58%), Gaps = 3/155 (1%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+P FDAR +W C ++ + +Q NCGSCWA A +DRLC+A+NG F ISA+ +
Sbjct: 46 VPDFFDARIEWKYCKTIGEVRNQGNCGSCWAHGTTGAFADRLCVATNGDFNELISAEELT 105
Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
C C +GCNGG P AW ++ +GVVTGG+YN+ +GCQPY + PC +G +C+
Sbjct: 106 FCCHTCGFGCNGGNPIRAWLYFKRHGVVTGGNYNTTDGCQPYKVPPCIRDEEGH-NSCSG 164
Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
+ C ++CY + S Y+ K K A+ +
Sbjct: 165 QRTERNHRCSKSCYGNT-TSDYKNGHYKTKDAYYL 198
>gi|29374025|gb|AAO73003.1| cathepsin B [Fasciola gigantica]
Length = 339
Score = 120 bits (302), Expect = 8e-25, Method: Compositional matrix adjust.
Identities = 60/155 (38%), Positives = 88/155 (56%), Gaps = 2/155 (1%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP +FDAR +WP+C ++ I DQ++CGSCWA + A+A+SDR+CI SNG +++A +
Sbjct: 86 LPESFDARSQWPQCWTISEIRDQASCGSCWATAAASAMSDRVCIHSNGQMRPRLAAADPL 145
Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
+C C GC GG+P AW +W G+VTGG + ++ GCQP+ C+H +
Sbjct: 146 SCCTYCGQGCRGGYPPKAWDYWMREGIVTGGTWENRTGCQPWMFTKCDHVGDSRKYSRCP 205
Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
TP C + C Y TY D G ++ V
Sbjct: 206 HYTYPTPPCARAC-QTGYNKTYEQDKFYGNSSYNV 239
Score = 105 bits (262), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 43/83 (51%), Positives = 63/83 (75%)
Query: 77 MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANS 136
M++I ++GP+ F+++ DF Y+SG+Y H G IG HAVR++GWGVEN + YWL+ANS
Sbjct: 247 MQEIMKNGPVEVTFAIFQDFGVYRSGIYHHVAGKFIGRHAVRMIGWGVENGVNYWLMANS 306
Query: 137 WNDHWGDHGTFKILRGENEADIE 159
WN+ WG++G F+++RG NE IE
Sbjct: 307 WNEEWGENGYFRMVRGRNECGIE 329
>gi|428174191|gb|EKX43088.1| hypothetical protein GUITHDRAFT_73372 [Guillardia theta CCMP2712]
Length = 255
Score = 120 bits (302), Expect = 8e-25, Method: Compositional matrix adjust.
Identities = 57/112 (50%), Positives = 72/112 (64%), Gaps = 9/112 (8%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+P NFDAR WP+CPS+ HI DQS CGSCWA A+SDRLCIASNG ++SA+ ++
Sbjct: 15 IPDNFDARTNWPQCPSIAHIRDQSTCGSCWAFGAVEAMSDRLCIASNGTVKDELSAEDML 74
Query: 247 A-CTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHV 296
+ C C GCNGG+P AWRF+ +G+ T Y PY PCEHH+
Sbjct: 75 SCCLVQCGMGCNGGFPTGAWRFFKMHGLTTESKY-------PYVFPPCEHHI 119
Score = 108 bits (270), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 49/97 (50%), Positives = 66/97 (68%)
Query: 63 HYFKKAHMVPRCNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGW 122
++ K + V +I +GP+ A F+VY DFL Y+SGVY+H G +G HA++++GW
Sbjct: 146 YHGKSVYSVSPAKIQAEIMTNGPVEAAFTVYQDFLAYQSGVYRHVSGPELGGHAIKIMGW 205
Query: 123 GVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
GVE YWLVANSWN+ WGD GTFKI RG++E IE
Sbjct: 206 GVEAGNKYWLVANSWNEDWGDKGTFKIARGDDECGIE 242
>gi|442754445|gb|JAA69382.1| Putative cathepsin b precursor [Ixodes ricinus]
Length = 340
Score = 120 bits (301), Expect = 8e-25, Method: Compositional matrix adjust.
Identities = 58/99 (58%), Positives = 71/99 (71%), Gaps = 2/99 (2%)
Query: 63 HYFKKAHMVPRCNAMRQ--IYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
HY K ++ VP Q I ++GP+ F+VYADF YKSGVY+ + D++G HA+R+L
Sbjct: 232 HYGKSSYSVPSNETQIQMEIMKNGPVEGAFTVYADFPLYKSGVYKSHSTDALGGHAIRIL 291
Query: 121 GWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
GWGVEND+PYWLVANSWN WGD G FKILRG NE IE
Sbjct: 292 GWGVENDVPYWLVANSWNTEWGDKGYFKILRGSNECGIE 330
Score = 94.0 bits (232), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 54/161 (33%), Positives = 81/161 (50%), Gaps = 15/161 (9%)
Query: 187 LPRNFDAREKWPECPSLRHIAD---QSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQ 243
+P FD+R++W + P H D + ++SDR CI S ++A
Sbjct: 88 IPAQFDSRQQWQDWP--HHPGDPGTKERADPVGHFGAVESMSDRHCIHSGAKNIVHLAAD 145
Query: 244 HIVACTPNCWGC----NGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGP 299
+++C CWGC NGG+P AW +W G+VTGG+Y++ EGC PY + C+HHV G
Sbjct: 146 DVLSC---CWGCGSGCNGGFPAAAWSYWVDKGIVTGGNYDTDEGCMPYPVPSCDHHVNGT 202
Query: 300 LQNCTLLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
L C TP+C + C Y ++ D GK ++ V
Sbjct: 203 LGPCGQ--DPPTPKCVRLC-RKGYNVDFKDDKHYGKSSYSV 240
>gi|349604734|gb|AEQ00202.1| Cathepsin B-like protein, partial [Equus caballus]
Length = 134
Score = 120 bits (301), Expect = 8e-25, Method: Compositional matrix adjust.
Identities = 61/110 (55%), Positives = 76/110 (69%), Gaps = 5/110 (4%)
Query: 54 YLPTSIPLSHYFKKAHMVPRCNAMRQIYE----HGPLVAIFSVYADFLQYKSGVYQHNFG 109
Y P+ HY ++ V R A R+ ++ +GP+ A F+VY+DFLQYKSGVYQH G
Sbjct: 9 YSPSYKEDKHYGCSSYSVSR-GARRRSWQRSSKNGPVEAAFTVYSDFLQYKSGVYQHVAG 67
Query: 110 DSIGLHAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
D +G HAVR+LGWGVEN PYWLV NSWN WGD+G FKILRG++ IE
Sbjct: 68 DMMGGHAVRILGWGVENGTPYWLVGNSWNTDWGDNGFFKILRGQDHCGIE 117
>gi|268561802|ref|XP_002638421.1| C. briggsae CBR-CPR-3 protein [Caenorhabditis briggsae]
Length = 375
Score = 120 bits (301), Expect = 9e-25, Method: Compositional matrix adjust.
Identities = 64/154 (41%), Positives = 88/154 (57%), Gaps = 12/154 (7%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP FDAR++WP+C SL+ I +Q++CGSCWA A ISDR+CI SNG ISA+ I+
Sbjct: 95 LPDTFDARDQWPDCKSLKFIRNQASCGSCWAFGAAEVISDRVCIQSNGTQQPIISAEDIL 154
Query: 247 ACTPNCW--GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
+C + GC GG+ A ++W ++GVVTGGDYN GC PY+ PC+ + P
Sbjct: 155 SCCGSTCGKGCQGGYTIEAMKYWMNSGVVTGGDYNGA-GCMPYSFPPCK---KSPCV--- 207
Query: 305 LLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAH 338
+ TP CK C + Y+ D A+
Sbjct: 208 ---EFSTPSCKTTCQEKYTTADYKNDKHFATSAY 238
Score = 103 bits (256), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 50/101 (49%), Positives = 63/101 (62%), Gaps = 4/101 (3%)
Query: 79 QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWN 138
+IY +GP+ A + V+ DF QYKSGVY H G+ +G HAV+++GWG EN + YWLVANSW
Sbjct: 253 EIYHNGPVEASYRVFEDFYQYKSGVYHHVSGNLVGGHAVKIIGWGTENGVDYWLVANSWG 312
Query: 139 DHWGDHGTFKILRGENEADIE----MGFNNRVEANSSEDDD 175
+G+ G FKI RG NE IE G N DDD
Sbjct: 313 TSFGEKGFFKIRRGTNECQIESNIVAGLAKLGTHNEKTDDD 353
>gi|209863077|ref|NP_001119612.2| cathepsin B-912 precursor [Acyrthosiphon pisum]
Length = 342
Score = 120 bits (301), Expect = 9e-25, Method: Compositional matrix adjust.
Identities = 59/147 (40%), Positives = 86/147 (58%), Gaps = 5/147 (3%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+P+ FDAR++W C ++ + DQ NCGSCWA++ ++A +DRLCIA+N F +SA+ +
Sbjct: 90 IPKKFDARKEWRRCITIGQVRDQGNCGSCWALATSSAFADRLCIATNYEFNELLSAEELT 149
Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
C C + C+GG+P AW ++ +G+VTGGDY S EGC PY + PC G C
Sbjct: 150 FCCHLCGFACHGGYPIKAWSYFRRHGIVTGGDYQSGEGCAPYRVPPCFSEEDGN-NTCRG 208
Query: 306 LGKLKTPECKQNCYNP---SYESTYRF 329
K C + CY Y+ +RF
Sbjct: 209 QPMEKHHRCTRMCYGDQEIDYDDDHRF 235
Score = 89.0 bits (219), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 39/97 (40%), Positives = 62/97 (63%), Gaps = 1/97 (1%)
Query: 64 YFKKAHMVPRCNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDS-IGLHAVRVLGW 122
+ + + + + + + +GP+ A VY DF YKSGVY+ + + +G HAV+++GW
Sbjct: 235 FTRDYYYLTYASIQKDVMTYGPIEASMEVYDDFPSYKSGVYEKSENATYLGGHAVKLIGW 294
Query: 123 GVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
G E+ +PYWL+ NSW++ WGD G FKI RG NE ++
Sbjct: 295 GEEDGVPYWLMVNSWSEMWGDKGLFKIRRGTNECSVD 331
>gi|161343831|tpg|DAA06096.1| TPA_inf: cathepsin B [Aphis gossypii]
Length = 194
Score = 120 bits (301), Expect = 9e-25, Method: Compositional matrix adjust.
Identities = 63/128 (49%), Positives = 83/128 (64%), Gaps = 3/128 (2%)
Query: 171 SEDDDLETMGC-QNAKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLC 229
SE D L T ++ LP ++D + W EC S+ I DQSNCGSCWA+S A+A S RLC
Sbjct: 48 SEKDTLLTYDSPAGSEPLPESYDVTQTWSECKSVVSIRDQSNCGSCWALSTASAFSGRLC 107
Query: 230 IASNGYFTGQISAQHIVACTPN-CW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPY 287
IASN F +S ++I +C C GCNGG P+ AW++ NG+ TGG+YNS EGCQPY
Sbjct: 108 IASNMDFNIVLSGEYINSCCNGKCGDGCNGGHPEKAWKYIKKNGLCTGGEYNSNEGCQPY 167
Query: 288 TLAPCEHH 295
++ PC +
Sbjct: 168 SIFPCPRN 175
>gi|19526442|gb|AAL89717.1|AF483623_1 cathepsin B [Apriona germari]
Length = 324
Score = 120 bits (301), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 63/157 (40%), Positives = 91/157 (57%), Gaps = 18/157 (11%)
Query: 185 KGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQH 244
G+P +FDARE+WP C S+R I D+ CGSCWA + +SDRLC+AS G SA+
Sbjct: 82 SGIPDSFDAREQWPFCESIRTIRDEGACGSCWAFAAVEVMSDRLCLASEGRKKFIFSAEE 141
Query: 245 IVACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNC 303
+V+C C GC GG+ +++W NG+ +GGDY S+ GC+PYT A V G
Sbjct: 142 VVSCCTACGGGCRGGFLNEPYKYWVTNGIPSGGDYGSKLGCKPYTAA-----VSG----- 191
Query: 304 TLLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
+TP+C++ C + YE ++ DL+ A+ V
Sbjct: 192 ------ETPQCQKACVS-GYEKSWEKDLRHATSAYQV 221
Score = 104 bits (260), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 43/82 (52%), Positives = 58/82 (70%)
Query: 78 RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
R+I ++GP+ A VY DF Y +G+YQH G +G HAV+++GWG END+PYW+ ANSW
Sbjct: 230 REILDNGPVTAYMEVYEDFYSYGTGIYQHTSGSFVGGHAVKIIGWGSENDVPYWIAANSW 289
Query: 138 NDHWGDHGTFKILRGENEADIE 159
+G+ G F+ILRG N A IE
Sbjct: 290 GTGFGEDGFFRILRGSNCAGIE 311
>gi|21930117|gb|AAM82155.1| cysteine proteinase [Ancylostoma ceylanicum]
Length = 348
Score = 120 bits (300), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 55/134 (41%), Positives = 81/134 (60%), Gaps = 2/134 (1%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+P FDAR++WP C S++HI DQS+CGSCWAV+ A+A+SDR+C +NG +S ++
Sbjct: 94 IPDTFDARDRWPNCTSMKHIRDQSSCGSCWAVAAASAMSDRVCALTNGRINRILSDTEVL 153
Query: 247 ACT-PNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
+C +C +GC GG+P A+ + G+ TGG Y ++ CQPY PC +H P
Sbjct: 154 SCCFGSCGFGCKGGYPARAFGYAWRYGLSTGGPYGEKDACQPYAFYPCGNHAHEPYYGPC 213
Query: 305 LLGKLKTPECKQNC 318
TP C++ C
Sbjct: 214 PDELWPTPTCRRTC 227
Score = 107 bits (266), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 46/81 (56%), Positives = 59/81 (72%)
Query: 79 QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWN 138
+I GP+VA + VY DF YK GVY H G+ GLHAV+++GWG ND+PYWLVANSWN
Sbjct: 258 EIMTRGPVVATYKVYRDFDYYKKGVYIHREGEVTGLHAVKIIGWGKGNDVPYWLVANSWN 317
Query: 139 DHWGDHGTFKILRGENEADIE 159
WGD+G F+I+RG + +IE
Sbjct: 318 TDWGDNGYFRIVRGTDNCEIE 338
>gi|335347289|gb|AEH42092.1| cysteine proteinase 1 [Haemonchus contortus]
Length = 332
Score = 120 bits (300), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 65/143 (45%), Positives = 81/143 (56%), Gaps = 6/143 (4%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+P +FD+RE W C S+ +I DQSNCGSCWAVS A +SDR+C+ S G IS I+
Sbjct: 95 IPESFDSREVWKNCSSITYIRDQSNCGSCWAVSAAETMSDRICVQSKGRVQKMISDVDIL 154
Query: 247 ACTP-NCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
AC C GCNGG AW + GVVTGG Y + C+PY L PC +H G +C
Sbjct: 155 ACCGRECGRGCNGGMDHKAWEYVKEFGVVTGGRYQEKGVCKPYPLHPCGNH-GGKFWSCP 213
Query: 305 LLGKLKTPECKQNC---YNPSYE 324
+TP CK+ C Y YE
Sbjct: 214 RDHSFRTPACKKYCQYGYGKRYE 236
Score = 88.6 bits (218), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 39/75 (52%), Positives = 50/75 (66%)
Query: 78 RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
R++ ++GP+ A F Y DF Y G+Y H G G HAV+V+GWGVEN YW VANSW
Sbjct: 257 REMMKNGPVQAAFITYEDFSFYTKGIYVHTRGRQRGAHAVKVVGWGVENGTKYWNVANSW 316
Query: 138 NDHWGDHGTFKILRG 152
+ WG+ G F+ILRG
Sbjct: 317 STDWGEDGYFRILRG 331
>gi|239938580|gb|ACS36089.1| cysteine proteinase [Haemonchus contortus]
Length = 332
Score = 120 bits (300), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 65/143 (45%), Positives = 81/143 (56%), Gaps = 6/143 (4%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+P +FD+RE W C S+ +I DQSNCGSCWAVS A +SDR+C+ S G IS I+
Sbjct: 95 IPESFDSREVWKNCSSITYIRDQSNCGSCWAVSAAETMSDRICVQSKGRVQKMISDVDIL 154
Query: 247 ACTP-NCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
AC C GCNGG AW + GVVTGG Y + C+PY L PC +H G +C
Sbjct: 155 ACCGRECGRGCNGGMDHKAWEYVKEFGVVTGGRYQEKGVCKPYPLHPCGNH-GGKFWSCP 213
Query: 305 LLGKLKTPECKQNC---YNPSYE 324
+TP CK+ C Y YE
Sbjct: 214 RDHSFRTPACKKYCQYGYGKRYE 236
Score = 89.0 bits (219), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 39/75 (52%), Positives = 51/75 (68%)
Query: 78 RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
R++ ++GP+ A F Y DF Y G+Y H G G HAV+V+GWGVEN YW VANSW
Sbjct: 257 REMMKNGPVQAAFITYEDFSFYTKGIYVHTRGRQRGAHAVKVVGWGVENGTKYWNVANSW 316
Query: 138 NDHWGDHGTFKILRG 152
+ WG++G F+ILRG
Sbjct: 317 STDWGENGYFRILRG 331
>gi|427787723|gb|JAA59313.1| Putative cathepsin b-like cysteine protease form 2 [Rhipicephalus
pulchellus]
Length = 338
Score = 120 bits (300), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 63/159 (39%), Positives = 89/159 (55%), Gaps = 11/159 (6%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP +FDAR+ W +C S+ I DQS+CG+CWA AISDR+CI + G ISAQ ++
Sbjct: 83 LPESFDARQHWRKCNSIHVIRDQSSCGACWAFGAVEAISDRICIHTKGSVQVNISAQDLL 142
Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQG----PLQ 301
C C GC GG P AW F+ G+VTGG Y +++GCQPY++ + G P+
Sbjct: 143 TCCDYCRTGCKGGVPSYAWMFYKEKGIVTGGLYGTEDGCQPYSIHTTRYTTTGLLPPPIN 202
Query: 302 NCTLLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
+ + + P CK+ C SY Y D G+K + +
Sbjct: 203 DLSPM-----PPCKREC-RKSYGKKYSEDKHYGEKVYTL 235
Score = 112 bits (280), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 54/103 (52%), Positives = 68/103 (66%), Gaps = 2/103 (1%)
Query: 63 HYFKKAHMVPRCNAM--RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
HY +K + + A +I+++GP+ A F+VYADF YKSGVYQ + G HA+R+L
Sbjct: 227 HYGEKVYTLSGDEAQIKTEIFKNGPVEADFAVYADFYSYKSGVYQAHSRVRCGSHAIRIL 286
Query: 121 GWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIEMGFN 163
GWG EN +PYWL ANSW +HWGD G FKI RG NE IE N
Sbjct: 287 GWGTENGVPYWLAANSWTEHWGDKGYFKIRRGNNECGIEEDIN 329
>gi|402585445|gb|EJW79385.1| hypothetical protein WUBG_09708, partial [Wuchereria bancrofti]
Length = 190
Score = 120 bits (300), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 65/147 (44%), Positives = 88/147 (59%), Gaps = 10/147 (6%)
Query: 188 PRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVA 247
P FDAR +WP C S+ +A+Q CGSCWA+S A+ +SDRLCIA+N QISA+ +++
Sbjct: 49 PEQFDARLQWPLCWSVHQVANQGGCGSCWAISAASVMSDRLCIATNYSNQKQISAEDLIS 108
Query: 248 CTPNCWGCNGG-WPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL- 305
C C GC G W A+ +W ++G+VTGGDY S EGC+PY AP + P C+
Sbjct: 109 CCAECGGCQGSNWALSAFIYWRNHGIVTGGDYGSFEGCKPYATAP---NCGSP---CSFE 162
Query: 306 -LGKLKTPECKQNCYNPSYESTYRFDL 331
K P C++ C P Y +Y DL
Sbjct: 163 YYRKKAAPICQKTC-QPLYGLSYEEDL 188
>gi|48425700|pdb|1SP4|B Chain B, Crystal Structure Of Ns-134 In Complex With Bovine
Cathepsin B: A Two Headed Epoxysuccinyl Inhibitor
Extends Along The Whole Active Site Cleft
Length = 205
Score = 120 bits (300), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 53/83 (63%), Positives = 65/83 (78%)
Query: 77 MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANS 136
M +IY++GP+ FSVY+DFL YKSGVYQH G+ +G HA+R+LGWGVEN PYWLV NS
Sbjct: 113 MAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVSGEIMGGHAIRILGWGVENGTPYWLVGNS 172
Query: 137 WNDHWGDHGTFKILRGENEADIE 159
WN WGD+G FKILRG++ IE
Sbjct: 173 WNTDWGDNGFFKILRGQDHCGIE 195
Score = 73.6 bits (179), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 36/82 (43%), Positives = 49/82 (59%), Gaps = 3/82 (3%)
Query: 259 WPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTLLGKLKTPECKQNC 318
+P AW FW G+V+GG YNS GC+PY++ PCEHHV G CT G+ TP+C + C
Sbjct: 27 FPSGAWNFWTKKGLVSGGLYNSHVGCRPYSIPPCEHHVNGSRPPCT--GEGDTPKCNKTC 84
Query: 319 YNPSYESTYRFDLKKGKKAHMV 340
P Y +Y+ D G ++ V
Sbjct: 85 -EPGYSPSYKEDKHFGCSSYSV 105
>gi|321461662|gb|EFX72692.1| hypothetical protein DAPPUDRAFT_308155 [Daphnia pulex]
Length = 379
Score = 119 bits (299), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 65/196 (33%), Positives = 101/196 (51%), Gaps = 3/196 (1%)
Query: 147 FKILRGENEADIEMGFNNRVEANSSEDDDLETMGCQNAKGLPRNFDAREKWPECPSLRHI 206
F+ G D F + +E L + + +P FDAR +WP CP++ I
Sbjct: 73 FRSFMGARAYDPWRYFMSVKRRQVNERRSLSSPSGFYSSSIPAEFDARLRWPNCPTIGEI 132
Query: 207 ADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVACTPNCW-GCNGGWPQLAWR 265
+Q +C SCWAV+ + +SDR+CI S ++SA ++++C C GC GG+P AW
Sbjct: 133 FEQGSCASCWAVAPTDVMSDRICIHSGSRHIVRLSAGNLLSCCKLCGKGCKGGFPGGAWM 192
Query: 266 FWGHNGVVTGGDYNSQEGCQPYTLAPC-EHHVQGPLQNCTLLGKLKTPECKQNCYNPSYE 324
W +G+VTGG Y+S GCQ Y PC + +G ++N EC++ C SY
Sbjct: 193 HWSKHGIVTGGSYSSDYGCQKYQFFPCYQPRTKGSIKNKCPKTDNTLLECRETC-RTSYN 251
Query: 325 STYRFDLKKGKKAHMV 340
+Y+ DL G+ + +
Sbjct: 252 KSYKQDLYYGESVYRI 267
Score = 99.4 bits (246), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 43/81 (53%), Positives = 54/81 (66%)
Query: 79 QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWN 138
+I E+GP+ A +Y DFL YK GVY+H G + HAV++ GWG E PYWL AN W+
Sbjct: 277 EIMENGPVQANLRIYEDFLHYKFGVYRHVHGQGLEYHAVKIFGWGTEGGTPYWLAANPWS 336
Query: 139 DHWGDHGTFKILRGENEADIE 159
WG+ G FKILRG N A+IE
Sbjct: 337 KRWGNGGFFKILRGSNHAEIE 357
>gi|254575665|gb|ACT68329.1| cysteine proteinase [Haemonchus contortus]
Length = 348
Score = 119 bits (299), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 61/140 (43%), Positives = 82/140 (58%), Gaps = 2/140 (1%)
Query: 166 VEANSSEDDDLETMGCQNAKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAIS 225
+E + ++++ L + +P +FD+REKW +CPSLR I DQSNCGSCWAVS A +S
Sbjct: 75 IERSYNQENVLPIANITSNDDIPESFDSREKWKDCPSLRVIPDQSNCGSCWAVSAAQCMS 134
Query: 226 DRLCIASNGYFTGQISAQHIVACTPNC--WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEG 283
DRLCI S G +SA I+AC +GC+GG+ AW++ GVVTGG Y +
Sbjct: 135 DRLCIHSQGRKKVLLSATDILACCGKFCGYGCDGGYNARAWKWATIAGVVTGGAYKEKGN 194
Query: 284 CQPYTLAPCEHHVQGPLQNC 303
C+PY C H NC
Sbjct: 195 CKPYVFPQCGAHKGKAFNNC 214
Score = 90.9 bits (224), Expect = 9e-16, Method: Compositional matrix adjust.
Identities = 39/84 (46%), Positives = 56/84 (66%), Gaps = 1/84 (1%)
Query: 79 QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWN 138
+I + GP+ A F++Y DF Y GVY H G G H+++++GWGV+ + YWL+ANSW+
Sbjct: 259 EIMQKGPVHATFNIYEDFEHYNGGVYIHTAGAMEGGHSIKIIGWGVDKGVKYWLIANSWS 318
Query: 139 DHWG-DHGTFKILRGENEADIEMG 161
WG D G F+++RG N DIE G
Sbjct: 319 TDWGEDGGYFRVVRGINNCDIEGG 342
>gi|347972080|ref|XP_313831.5| AGAP004531-PA [Anopheles gambiae str. PEST]
gi|333469162|gb|EAA09191.5| AGAP004531-PA [Anopheles gambiae str. PEST]
Length = 375
Score = 119 bits (298), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 52/99 (52%), Positives = 69/99 (69%), Gaps = 2/99 (2%)
Query: 63 HYFKKAHMVPRCN--AMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
HY + A+ VP+ M +++ GP A F++Y DF+QYKSGVY+H FG +G H+V+V+
Sbjct: 266 HYGRVAYTVPKDEERIMYEVFNFGPAQATFTMYTDFVQYKSGVYRHTFGVRVGTHSVKVM 325
Query: 121 GWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
GWGVEND+ YWL ANSW WGD G FKI+RGE+ E
Sbjct: 326 GWGVENDVKYWLCANSWGAQWGDGGFFKIVRGEDHLSFE 364
Score = 104 bits (260), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 56/156 (35%), Positives = 79/156 (50%), Gaps = 14/156 (8%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP +FDAR+KW CPS+ + +Q C S +AV+ + ++DR C+ S G A ++
Sbjct: 131 LPMSFDARQKWSYCPSMNMVRNQGCCDSSYAVAAVSTMTDRWCVHSEGKAQFNFGAYDVL 190
Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
+C C +GC+GG P W +W NG+ +GG + S EGCQ Y P C
Sbjct: 191 SCCHRCGFGCDGGVPSAVWHYWVENGITSGGAFGSHEGCQSY-----------PFDVCKK 239
Query: 306 LGKLK-TPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
G TP C + C P Y TY D G+ A+ V
Sbjct: 240 SGDSNDTPRCLRFC-QPGYNVTYPEDKHYGRVAYTV 274
>gi|48762476|dbj|BAD23809.1| cathepsin B-S [Tuberaphis styraci]
gi|204022069|dbj|BAG71132.1| cathepsin B-S1 [Tuberaphis styraci]
Length = 349
Score = 119 bits (298), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 55/133 (41%), Positives = 79/133 (59%), Gaps = 3/133 (2%)
Query: 188 PRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVA 247
P+ FD+RE W C + HI DQ NCGSCW+ S A +DRLC+++ G F +S + +
Sbjct: 86 PKQFDSRENWKSCKQIGHIRDQGNCGSCWSFSTTGAFADRLCVSTGGKFNQLLSPEELAF 145
Query: 248 CTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTLL 306
C +C GC GG+P AW+++ GV TGGDY+++EGC PY + PC + QG C
Sbjct: 146 CCMDCGKGCGGGYPIKAWKYFRTQGVTTGGDYDTKEGCMPYKVPPC-YDEQGK-NTCGGK 203
Query: 307 GKLKTPECKQNCY 319
+ +C + CY
Sbjct: 204 PMERNHQCPKTCY 216
Score = 95.9 bits (237), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 47/122 (38%), Positives = 69/122 (56%), Gaps = 2/122 (1%)
Query: 51 KRLYLPTSIPLSHYFKKAHMVPRCNAMRQ-IYEHGPLVAIFSVYADFLQYKSGVYQHNF- 108
K Y T++ + K +++ + Q + +GP+ A F VY DF YKSG+Y+
Sbjct: 213 KTCYGKTTVQDRYKTKNEYVINSIETIEQDLMTYGPVEASFDVYDDFSVYKSGIYRKTPK 272
Query: 109 GDSIGLHAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIEMGFNNRVEA 168
G H+++++GWG EN PYWL NSW+ WGDHGTFKI++G NE IE + +
Sbjct: 273 AKYEGGHSIKIIGWGEENGTPYWLAVNSWSKFWGDHGTFKIIKGRNECGIERAVTAGIPS 332
Query: 169 NS 170
S
Sbjct: 333 TS 334
>gi|204022094|dbj|BAG71144.1| cathepsin B-N1 [Tuberaphis taiwana]
Length = 334
Score = 119 bits (298), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 60/136 (44%), Positives = 85/136 (62%), Gaps = 6/136 (4%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+P +FDAR+KW +C ++ + DQ CGSCWA ++A +DRLCIA++G F +SA+ +
Sbjct: 85 IPSSFDARKKWRKCSTIGEVRDQGKCGSCWAFGTSSAFADRLCIATDGEFNELLSAEELA 144
Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
C C +GC+GG+P AW + +G+VTGG+Y+S EGCQPY + PC G N T
Sbjct: 145 FCCHKCGFGCSGGYPIRAWERFKKHGLVTGGNYDSGEGCQPYRVPPCPLDEYG---NNTC 201
Query: 306 LGKL--KTPECKQNCY 319
GK K C + CY
Sbjct: 202 RGKPAEKNHRCTRMCY 217
Score = 99.8 bits (247), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 47/98 (47%), Positives = 60/98 (61%), Gaps = 1/98 (1%)
Query: 63 HYFKKAHMVPRCNAMRQIYEHGPLVAIFSVYADFLQYKSGVY-QHNFGDSIGLHAVRVLG 121
HY + A+ + I +GP+ A F VY DF YKSGVY + +G HAV+++G
Sbjct: 229 HYTRDAYYLTYGTIQNDILAYGPIEASFEVYDDFPSYKSGVYTKMENATYLGGHAVKLIG 288
Query: 122 WGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
WG E +PYWL+ NSWND WGD G FKI RG NE I+
Sbjct: 289 WGEEYGVPYWLLVNSWNDQWGDQGLFKIRRGTNECGID 326
>gi|5764077|emb|CAB53367.1| necpain [Necator americanus]
Length = 339
Score = 119 bits (297), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 64/154 (41%), Positives = 83/154 (53%), Gaps = 4/154 (2%)
Query: 188 PRNFDAREKWPECPSLR-HIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
P FDAR+ WP C + H+ DQS CGSCWAVS A+ +SDRLC+ SNG +S I+
Sbjct: 85 PEKFDARDAWPYCREIIGHVRDQSRCGSCWAVSAASVMSDRLCVQSNGKIKLHVSDTDIL 144
Query: 247 ACTPNCWG--CNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
AC G C+GGWP AW + GV TGGDY ++ C+PY PC +H
Sbjct: 145 ACCGEFCGDGCSGGWPFQAWEWVRKYGVCTGGDYRAKGVCKPYAFHPCGNHENQVYYGVC 204
Query: 305 LLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAH 338
G TP C++ C Y Y+ D KK++
Sbjct: 205 PKGSWPTPRCEKFC-QRGYIKPYKKDKFYAKKSY 237
Score = 95.5 bits (236), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 44/98 (44%), Positives = 64/98 (65%), Gaps = 2/98 (2%)
Query: 64 YFKKAHMVP--RCNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLG 121
Y KK++ +P I ++GP+ A F VY DF YK G+Y+H G G HAV+++G
Sbjct: 232 YAKKSYWLPNDEKEIRLDIMKNGPVQAAFDVYEDFKLYKRGIYKHKEGIQTGGHAVKIIG 291
Query: 122 WGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
WG +N YWL+ANSW+ WG+ G F+++RGEN+ +IE
Sbjct: 292 WGKDNGTDYWLIANSWSKDWGESGFFRMVRGENDCEIE 329
>gi|162813|gb|AAA30434.1| cathepsin B, partial [Bos taurus]
Length = 122
Score = 119 bits (297), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 53/83 (63%), Positives = 65/83 (78%)
Query: 77 MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANS 136
M +IY++GP+ FSVY+DFL YKSGVYQH G+ +G HA+R+LGWGVEN PYWLV NS
Sbjct: 27 MAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVSGEIMGGHAIRILGWGVENGTPYWLVGNS 86
Query: 137 WNDHWGDHGTFKILRGENEADIE 159
WN WGD+G FKILRG++ IE
Sbjct: 87 WNTDWGDNGFFKILRGQDHCGIE 109
>gi|107921791|gb|ABF85679.1| cathepsin B2 [Fasciola hepatica]
Length = 278
Score = 118 bits (296), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 59/155 (38%), Positives = 87/155 (56%), Gaps = 2/155 (1%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP +FDAR +WP+C ++ I DQ++CGSCWA + A+A+SDR+CI SNG +++A +
Sbjct: 63 LPESFDARSQWPQCWTISEIRDQASCGSCWATAAASAMSDRVCIHSNGQMRPRLAAADPL 122
Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
+C C GC GG+P AW +W G+VTGG + ++ GCQP+ C+H +
Sbjct: 123 SCCTYCGQGCRGGYPPKAWDYWMREGIVTGGTWENRTGCQPWMFTKCDHVGDSRKYSRCP 182
Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
P C + C Y TY D G ++ V
Sbjct: 183 HYTYPKPPCARAC-QTGYNKTYEQDKFYGNSSYNV 216
Score = 68.2 bits (165), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 27/55 (49%), Positives = 40/55 (72%)
Query: 77 MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYW 131
M++I ++GP+ F+++ DF Y+SG+Y H G IG HAVR++GWGVEN + YW
Sbjct: 224 MQEIMKNGPVEVTFAIFQDFGVYRSGIYHHVAGKFIGRHAVRMIGWGVENGVNYW 278
>gi|3087797|emb|CAA93275.1| cysteine proteinase [Haemonchus contortus]
Length = 330
Score = 118 bits (296), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 64/143 (44%), Positives = 80/143 (55%), Gaps = 7/143 (4%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+P +FD+RE W C S+ +I DQSN GSCWAVS A +SDR+C+ S G IS I+
Sbjct: 95 IPESFDSREVWKNCSSITYIRDQSNSGSCWAVSAAETMSDRICVQSKGRVQKMISDVDIL 154
Query: 247 ACT-PNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
AC C GCNGG AW + GVVTGG Y + C+PY L PCE + G +C
Sbjct: 155 ACCGRECGRGCNGGMDHKAWEYVKEFGVVTGGRYQEKGVCKPYHLHPCE--ITGKFWSCP 212
Query: 305 LLGKLKTPECKQNC---YNPSYE 324
+TP CK+ C Y YE
Sbjct: 213 RDHSFRTPACKKYCQYGYGKRYE 235
Score = 78.2 bits (191), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 32/64 (50%), Positives = 45/64 (70%)
Query: 78 RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
R++ ++GP+ A F+ Y DF Y+ G+Y H++G G HAV+V+GWGVEN YW VANSW
Sbjct: 256 REMMKNGPVQAAFTTYEDFSFYRKGIYVHSYGRQRGAHAVKVVGWGVENGTKYWNVANSW 315
Query: 138 NDHW 141
+ W
Sbjct: 316 STDW 319
>gi|300952942|gb|ADK46902.1| cathepsin B [Radopholus similis]
Length = 356
Score = 118 bits (295), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 66/160 (41%), Positives = 94/160 (58%), Gaps = 14/160 (8%)
Query: 183 NAKGLPRNFDAREKWPECP-SLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQIS 241
+A LP++FD+R+++ +C + I DQSNCGSCWAVS A+ I DR+CIASNG IS
Sbjct: 104 DATKLPQHFDSRKQFTKCAKVIGTIQDQSNCGSCWAVSSASVIQDRICIASNGEQKVHIS 163
Query: 242 AQHIVAC-TPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPL 300
AQ I++C T GCNGG+P A+ + +GVVTG ++ +GC+PY P
Sbjct: 164 AQDILSCATDRSQGCNGGYPDEAFEHYAQSGVVTGSGNSANQGCKPYPFLP--------- 214
Query: 301 QNCTLLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
+ T+ + TPEC + C N Y+ Y+ D G + V
Sbjct: 215 -HTTV--EYSTPECSKKCENYQYKKAYKQDKHFGMSVYNV 251
Score = 95.1 bits (235), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 45/83 (54%), Positives = 58/83 (69%), Gaps = 2/83 (2%)
Query: 79 QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVE--NDIPYWLVANS 136
+I +GP+ A VY DF+ YKSGVYQ F +G HAVR++GWGV+ +PYWLVANS
Sbjct: 262 EIMNNGPVEANMIVYYDFMFYKSGVYQTVFPWPLGGHAVRIVGWGVDGPTKVPYWLVANS 321
Query: 137 WNDHWGDHGTFKILRGENEADIE 159
WN WG+ G F+I RG +E+ IE
Sbjct: 322 WNTDWGEDGYFRIRRGTDESYIE 344
>gi|48762491|dbj|BAD23815.1| cathepsin B-S1 [Tuberaphis coreana]
Length = 334
Score = 118 bits (295), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 54/133 (40%), Positives = 79/133 (59%), Gaps = 3/133 (2%)
Query: 188 PRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVA 247
P+ FD+R W C + HI DQ NCGSCW+ S A +DRLC+++ G F +S + +
Sbjct: 86 PQQFDSRTNWKSCKQIGHIRDQGNCGSCWSFSTTGAFADRLCVSTGGKFNQLLSPEELAF 145
Query: 248 CTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTLL 306
C +C GC GG+P AW+++ GV TGGDY+++EGC PY + PC ++ QG C
Sbjct: 146 CCKDCGQGCGGGYPIKAWKYFRTQGVTTGGDYDTKEGCMPYKVPPC-YNKQGK-NTCGGQ 203
Query: 307 GKLKTPECKQNCY 319
+ +C + CY
Sbjct: 204 PMERNHQCPKTCY 216
Score = 90.5 bits (223), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 45/122 (36%), Positives = 68/122 (55%), Gaps = 2/122 (1%)
Query: 51 KRLYLPTSIPLSHYFKKAHMVPRCNAMRQ-IYEHGPLVAIFSVYADFLQYKSGVYQHNFG 109
K Y T++ + K + + + Q + +GP+ A F VY DF YKSG+Y+
Sbjct: 213 KTCYGKTTVQNRYKTKSEYSINSIKTIEQDLKTYGPVEASFDVYDDFSVYKSGIYRKTPK 272
Query: 110 DSI-GLHAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIEMGFNNRVEA 168
G H+++++GWG EN YWL NSW+ WG+HGTFKI++G NE IE + +
Sbjct: 273 AKYEGRHSIKIIGWGQENGTTYWLAVNSWSKFWGEHGTFKIIKGRNECGIERAVTAGIPS 332
Query: 169 NS 170
+S
Sbjct: 333 SS 334
>gi|157058751|gb|ABV03133.1| cathepsin B-3098 [Aulacorthum solani]
Length = 215
Score = 117 bits (294), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 58/139 (41%), Positives = 83/139 (59%), Gaps = 2/139 (1%)
Query: 182 QNAKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQIS 241
N + +PR FDAR+KW C ++ + DQ NC S WA+S ++A +DRLC+A+NG F +S
Sbjct: 1 DNYQEIPRKFDARKKWLRCKTIGEVRDQGNCASGWALSTSSAFADRLCVATNGDFNQLLS 60
Query: 242 AQHIVACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPL 300
A+ I C C GC GG+P AW+ + +G+VTGG+Y S EGC+PY + PC + G
Sbjct: 61 AEEITFCCHTCGNGCYGGYPIRAWKSFKKHGLVTGGNYKSGEGCEPYRVPPCPYDEYGN- 119
Query: 301 QNCTLLGKLKTPECKQNCY 319
C+ C + CY
Sbjct: 120 NTCSGQPMESNHRCTRMCY 138
Score = 45.1 bits (105), Expect = 0.048, Method: Compositional matrix adjust.
Identities = 21/49 (42%), Positives = 32/49 (65%), Gaps = 1/49 (2%)
Query: 78 RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDS-IGLHAVRVLGWGVE 125
+ + +GP+ A F VY DF YKSG+Y + S +G H+V+++GWG E
Sbjct: 165 KDVINYGPIEASFDVYDDFPSYKSGIYVKSENASYLGGHSVKLIGWGEE 213
>gi|268560898|ref|XP_002638183.1| Hypothetical protein CBG22612 [Caenorhabditis briggsae]
Length = 721
Score = 117 bits (294), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 63/153 (41%), Positives = 81/153 (52%), Gaps = 15/153 (9%)
Query: 188 PRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVA 247
P +FDAR+ WP C S++ I DQ+ CGSCWA A ISDR+CI SNG IS + I+
Sbjct: 81 PTSFDARDYWPNCKSIKMIRDQAYCGSCWAFGAAEVISDRICIQSNGTDQPIISPEDILT 140
Query: 248 CTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCE--HHVQGPLQNCTL 305
C N GC GG+ A +FW GVVTGGD+ +GC PY+ C H Q
Sbjct: 141 CCTNSHGCQGGFVLEAMKFWKSKGVVTGGDFQG-DGCIPYSYGSCSDCHTAQ-------- 191
Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAH 338
TP+CK C ++ Y+ D G A+
Sbjct: 192 ----TTPKCKNECQVKYTKNEYKEDKYYGSSAY 220
Score = 98.2 bits (243), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 46/101 (45%), Positives = 67/101 (66%), Gaps = 4/101 (3%)
Query: 63 HYFKKAHMVPRCNAMR----QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVR 118
+Y A+ + NA+R +I +GP+ A + VY DF YKSGVY++ G +G HAV+
Sbjct: 214 YYGSSAYRLSTSNAVRTIQSEILRNGPVEATYQVYEDFYYYKSGVYEYISGRHMGGHAVK 273
Query: 119 VLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
++GWGVE ++ YWL+ANSW +G++G FK+ RG NE IE
Sbjct: 274 IIGWGVEENVNYWLIANSWGTGFGENGFFKMRRGNNECGIE 314
>gi|239938578|gb|ACS36088.1| cysteine proteinase [Haemonchus contortus]
Length = 332
Score = 117 bits (294), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 64/143 (44%), Positives = 80/143 (55%), Gaps = 6/143 (4%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+P +FD+R W C S+ +I DQSNCGSCWAVS A +SDR+C+ S G IS I+
Sbjct: 95 IPESFDSRVVWKNCSSITYIRDQSNCGSCWAVSAAETMSDRICVQSKGRVQKMISDVDIL 154
Query: 247 ACTP-NCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
AC C GCNGG AW + GVVTGG Y + C+PY L PC +H G +C
Sbjct: 155 ACCGRECGRGCNGGMDHKAWEYVKEFGVVTGGRYQEKGVCKPYPLHPCGNH-GGKFWSCP 213
Query: 305 LLGKLKTPECKQNC---YNPSYE 324
+TP CK+ C Y YE
Sbjct: 214 RDHSFRTPACKKYCQYGYGKRYE 236
Score = 85.9 bits (211), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 38/75 (50%), Positives = 50/75 (66%)
Query: 78 RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
R++ ++GP+ A Y DF Y+ G+Y H G G HAV+V+GWGVEN YW VANSW
Sbjct: 257 REMMKNGPVQAASITYEDFSFYRRGIYVHTRGRQRGAHAVKVVGWGVENGTKYWNVANSW 316
Query: 138 NDHWGDHGTFKILRG 152
+ WG+ G F+ILRG
Sbjct: 317 STDWGEDGYFRILRG 331
>gi|268572243|ref|XP_002648913.1| Hypothetical protein CBG17826 [Caenorhabditis briggsae]
Length = 323
Score = 117 bits (294), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 65/156 (41%), Positives = 86/156 (55%), Gaps = 15/156 (9%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+P +FD+R +W C S+ I DQ+ CGSCWA S A ISDR+CIA+ G IS ++
Sbjct: 81 IPPSFDSRTRWSNCTSIEMIRDQAQCGSCWAFSTAEVISDRICIATKGTQQPTISPTDML 140
Query: 247 ACTPNCW--GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
AC N GC GG+P A+R+W GVVTGGD+ GC+PY APC + P +
Sbjct: 141 ACCGNSCGDGCKGGYPIQAFRWWNSRGVVTGGDFRGS-GCRPYPFAPC---ISCPEE--- 193
Query: 305 LLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
KTP C +C Y + Y D + G A+ V
Sbjct: 194 -----KTPTCSLSC-QFGYSTAYAKDKRFGVSAYAV 223
Score = 101 bits (252), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 46/95 (48%), Positives = 64/95 (67%), Gaps = 2/95 (2%)
Query: 67 KAHMVPRCNAMRQ--IYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGV 124
A+ V R A Q I +GP+V F++Y D +YKSGVY+H G +G HA++++GWG
Sbjct: 219 SAYAVARNVAAIQTEIMTNGPVVGAFTMYEDMYKYKSGVYRHTAGRLLGGHAIKIIGWGT 278
Query: 125 ENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
+N IPYWL+ANSW +WG++G K+ RG NE IE
Sbjct: 279 QNGIPYWLIANSWGANWGENGFLKMRRGVNECGIE 313
>gi|161343855|tpg|DAA06108.1| TPA_inf: cathepsin B [Acyrthosiphon pisum]
Length = 342
Score = 117 bits (294), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 58/147 (39%), Positives = 85/147 (57%), Gaps = 5/147 (3%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+P+ FDAR++W C ++ + DQ NCGSCWA++ ++A +DRLCIA+N F +SA+ +
Sbjct: 90 IPKKFDARKEWRRCITIGQVRDQGNCGSCWALATSSAFADRLCIATNYEFNELLSAEELT 149
Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
C C + C+GG+P AW ++ +G+VTGG Y S EGC PY + PC G C
Sbjct: 150 FCCHLCGFACHGGYPIKAWSYFRRHGIVTGGGYQSGEGCAPYRVPPCFSEEDGN-NTCRG 208
Query: 306 LGKLKTPECKQNCYNP---SYESTYRF 329
K C + CY Y+ +RF
Sbjct: 209 QPMEKHHRCTRMCYGDQEIDYDDDHRF 235
Score = 89.0 bits (219), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 39/97 (40%), Positives = 62/97 (63%), Gaps = 1/97 (1%)
Query: 64 YFKKAHMVPRCNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDS-IGLHAVRVLGW 122
+ + + + + + + +GP+ A VY DF YKSGVY+ + + +G HAV+++GW
Sbjct: 235 FTRDYYYLTYASIQKDVMTYGPIEASMEVYDDFPSYKSGVYEKSENATYLGGHAVKLIGW 294
Query: 123 GVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
G E+ +PYWL+ NSW++ WGD G FKI RG NE ++
Sbjct: 295 GEEDGVPYWLMVNSWSEMWGDKGLFKIRRGTNECSVD 331
>gi|17510377|ref|NP_490763.1| Protein Y65B4A.2 [Caenorhabditis elegans]
gi|373220066|emb|CCD71920.1| Protein Y65B4A.2 [Caenorhabditis elegans]
Length = 421
Score = 117 bits (294), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 60/157 (38%), Positives = 85/157 (54%), Gaps = 10/157 (6%)
Query: 174 DDLETMGCQNAKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASN 233
D+LE N+ +P+NFDAR+KWP CPS+ ++ +Q CGSC+AV+ A SDR CI SN
Sbjct: 128 DELENF---NSSDVPKNFDARQKWPNCPSISNVPNQGGCGSCFAVAAAGVASDRACIHSN 184
Query: 234 GYFTGQISAQHIVACTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCE 293
G F +S + I+ C C C GG P A +W + G+VTGG ++GC+PY+ +
Sbjct: 185 GTFKSLLSEEDIIGCCSVCGNCYGGDPLKALTYWVNQGLVTGG----RDGCRPYSF---D 237
Query: 294 HHVQGPLQNCTLLGKLKTPECKQNCYNPSYESTYRFD 330
P T + C + C N Y+ Y D
Sbjct: 238 LSCGVPCSPATFFEAEEKRTCMKRCQNIYYQQKYEED 274
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 37/88 (42%), Positives = 50/88 (56%), Gaps = 10/88 (11%)
Query: 78 RQIYEHGPLVAIFSVYADFLQYKSGVYQ----HNFGDSIGL-HAVRVLGWGVEND-IPYW 131
++I +GP F V +FL Y SGV++ F D I H VR++GWG +D YW
Sbjct: 327 KEILLYGPTTMAFPVPEEFLHYSSGVFRPYPTDGFDDRIVYWHVVRLIGWGESDDGTHYW 386
Query: 132 LVANSWNDHWGDHGTFKILRGENEADIE 159
L NS+ +HWGD+G FKI N D+E
Sbjct: 387 LAVNSFGNHWGDNGLFKI----NTDDME 410
>gi|227018340|gb|ACP18836.1| cysteine proteinase 3 [Chrysomela tremula]
Length = 190
Score = 117 bits (294), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 55/107 (51%), Positives = 78/107 (72%), Gaps = 1/107 (0%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+P NFDARE WPEC S+R I DQS+CGSCWAV+ A A+SDR+CI S G +S + ++
Sbjct: 83 IPENFDARENWPECESIRMIRDQSDCGSCWAVAAAAAVSDRICIYSYGANQTIVSDEDLL 142
Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPC 292
+C +C +GC+GG+ AW +W ++G+V+GG YNS GC+ Y++ PC
Sbjct: 143 SCCDDCGFGCDGGYSWEAWNYWKNDGIVSGGPYNSTRGCKAYSMQPC 189
>gi|358341865|dbj|GAA49436.1| cathepsin B-like cysteine proteinase [Clonorchis sinensis]
Length = 515
Score = 117 bits (294), Expect = 7e-24, Method: Compositional matrix adjust.
Identities = 61/158 (38%), Positives = 86/158 (54%), Gaps = 8/158 (5%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+P FDAR+ W +CPS+R I QS+CGSCWA A+SDRLCI S + +SA ++
Sbjct: 81 IPMQFDARKYWLKCPSIREIRGQSSCGSCWAFGAVEAMSDRLCIHSGAKYQKGLSAVDLL 140
Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQG--PLQNC 303
+C C +GC+GG+P AW +W +G+VTGG + GC+ Y C H +G PL
Sbjct: 141 SCCWKCGYGCDGGFPAQAWNYWSTDGIVTGGSKENPSGCRSYPFPSCSHDERGRHPLCPS 200
Query: 304 TLLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMVL 341
+ TP C + C Y +L K ++ VL
Sbjct: 201 EI---YHTPRCTKKCDTDKLH--YSAELTKANSSYNVL 233
Score = 40.0 bits (92), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 15/28 (53%), Positives = 21/28 (75%)
Query: 77 MRQIYEHGPLVAIFSVYADFLQYKSGVY 104
M +I +GP+ A+F VY DFLQY+ G+Y
Sbjct: 240 MMEIMNNGPVEAVFDVYEDFLQYEKGIY 267
>gi|56752811|gb|AAW24617.1| unknown [Schistosoma japonicum]
Length = 342
Score = 117 bits (293), Expect = 7e-24, Method: Compositional matrix adjust.
Identities = 73/195 (37%), Positives = 102/195 (52%), Gaps = 13/195 (6%)
Query: 148 KILRGENEADIEMGFNNRVEANSSEDDDLETMGCQNAKGLPRNFDAREKWPECPSLRHIA 207
+IL G + D EM R + D ++E +P FD+R+KWP C S+ I
Sbjct: 61 RILMGARKEDAEMKRKRRPTVDH-HDLNVE---------IPSQFDSRKKWPHCKSISQIR 110
Query: 208 DQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVA-CTPNCWGCNGGWPQLAWRF 266
DQS CGSCWA A++DR+CI S G + ++SA +++ C GC GG+P AW +
Sbjct: 111 DQSRCGSCWAFGAVEAMTDRICIQSGGQQSAELSALDLISCCKDCGGGCKGGFPGQAWDY 170
Query: 267 WGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTLLGKLKTPECKQNCYNPSYEST 326
W G+VTGG + GCQPY CEH +G C KTP+CKQ C Y++
Sbjct: 171 WVKRGIVTGGSEENHTGCQPYPFPKCEHLTKGKYPACG-TKIYKTPQCKQTC-QKGYKTP 228
Query: 327 YRFDLKKGKKAHMVL 341
Y D G + + V+
Sbjct: 229 YEQDKHYGDQRYNVI 243
Score = 110 bits (274), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 45/82 (54%), Positives = 61/82 (74%)
Query: 78 RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
R+I +GP+ A F VY DFL YKSG+Y+H G +G HA+R++GWGVE PYWL+ANSW
Sbjct: 251 REIMMYGPVEAAFDVYEDFLNYKSGIYRHVTGSIVGGHAIRIIGWGVEKGKPYWLIANSW 310
Query: 138 NDHWGDHGTFKILRGENEADIE 159
N+ WG+ G F+++RG +E IE
Sbjct: 311 NEDWGEKGLFRMVRGRDECSIE 332
>gi|226473754|emb|CAX71562.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 329
Score = 117 bits (293), Expect = 7e-24, Method: Compositional matrix adjust.
Identities = 64/144 (44%), Positives = 82/144 (56%), Gaps = 22/144 (15%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNG--YFTGQISAQH 244
+P +FD+R+KWP C S+ I DQS CGS WAVS AISDR+CI S G + G
Sbjct: 90 IPSHFDSRKKWPRCKSISQIRDQSQCGSSWAVSAVGAISDRICIQSGGKQSYCGS----- 144
Query: 245 IVACTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
GC+GG+ +W +W G+VTGG + GC+PY C+H V+G + C
Sbjct: 145 ---------GCDGGFLGPSWDYWVLRGIVTGGSKENHTGCRPYPFPKCDHFVKGKYRACG 195
Query: 305 LLGKL-KTPECKQNC---YNPSYE 324
KL KTP+CKQ C YN SYE
Sbjct: 196 --DKLYKTPQCKQTCQKGYNTSYE 217
Score = 103 bits (258), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 48/99 (48%), Positives = 64/99 (64%), Gaps = 2/99 (2%)
Query: 63 HYFKKAHMVPRCNAMRQ--IYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
HY ++ V ++ Q I HGP+ A +Y DFL YKSG+Y++ G I HAVR++
Sbjct: 221 HYGGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGQFISGHAVRLI 280
Query: 121 GWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
GWGVEN YWL AN+WN+ WG+ G F+I+RG NE IE
Sbjct: 281 GWGVENGTAYWLAANTWNEDWGEKGYFRIVRGRNECSIE 319
>gi|48762485|dbj|BAD23812.1| cathepsin B-N1 [Tuberaphis styraci]
Length = 340
Score = 117 bits (293), Expect = 7e-24, Method: Compositional matrix adjust.
Identities = 59/136 (43%), Positives = 84/136 (61%), Gaps = 6/136 (4%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+P +FDAR+KW +C ++ + DQ CGSCWA ++A +DRLCIA++G F +S + +
Sbjct: 88 IPSSFDARKKWRKCSTIGEVRDQGKCGSCWAFGTSSAFADRLCIATDGEFNELLSPEELA 147
Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
C C +GC+GG+P AW + +G+VTGG+Y+S EGCQPY + PC G N T
Sbjct: 148 FCCHKCGFGCSGGYPIRAWERFKKHGLVTGGNYDSGEGCQPYRVPPCPLDEYG---NNTC 204
Query: 306 LGKL--KTPECKQNCY 319
GK K C + CY
Sbjct: 205 RGKPAEKNHRCTRMCY 220
Score = 100 bits (248), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 47/98 (47%), Positives = 60/98 (61%), Gaps = 1/98 (1%)
Query: 63 HYFKKAHMVPRCNAMRQIYEHGPLVAIFSVYADFLQYKSGVY-QHNFGDSIGLHAVRVLG 121
HY + A+ + I +GP+ A F VY DF YKSGVY + +G HAV+++G
Sbjct: 232 HYTRDAYYLTYGTIQNDILAYGPIEASFEVYDDFPSYKSGVYTKMENATYLGGHAVKLIG 291
Query: 122 WGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
WG E +PYWL+ NSWND WGD G FKI RG NE I+
Sbjct: 292 WGEEYGVPYWLLVNSWNDQWGDQGLFKIRRGTNECGID 329
>gi|204022085|dbj|BAG71140.1| cathepsin B-S [Astegopteryx spinocephala]
Length = 335
Score = 117 bits (293), Expect = 8e-24, Method: Compositional matrix adjust.
Identities = 68/171 (39%), Positives = 92/171 (53%), Gaps = 9/171 (5%)
Query: 161 GFNNRV-EANSSEDDDLETMGCQNAKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVS 219
G+ N + E +DD L T K +FDARE W C + H+ DQ NCGSCWA
Sbjct: 63 GYKNYLNEVEIKKDDPLYTKNNDTIK----HFDAREDWKICKQIGHVRDQGNCGSCWAFG 118
Query: 220 VANAISDRLCIASNGYFTGQISAQHIVACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDY 278
A +DRLC+A+ G F Q+SA+ + C C GC GG P AW+++ +G+ TGGDY
Sbjct: 119 TTGAFADRLCVATGGGFNEQLSAEKLTFCCWTCGLGCQGGNPIKAWKYFKRHGITTGGDY 178
Query: 279 NSQEGCQPYTLAPCEHHVQGPLQNCTLLGKLKTPECKQNCY-NPSYESTYR 328
S EGC PY + PC + QG C +C + CY N + E+ Y+
Sbjct: 179 GSNEGCAPYKVPPC-YDDQGEFL-CQGKPTEHNHKCPRACYGNSTVENRYK 227
Score = 94.0 bits (232), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 44/111 (39%), Positives = 70/111 (63%), Gaps = 2/111 (1%)
Query: 51 KRLYLPTSIPLSHYFKKAHMVPRCNAMRQ-IYEHGPLVAIFSVYADFLQYKSGVYQHN-F 108
+ Y +++ + K +++ + Q I ++GP+ A F VY DF+ YKSG+YQ
Sbjct: 214 RACYGNSTVENRYKVKSIYVLDSSKTIEQDIRKYGPVEASFDVYDDFITYKSGIYQKTPN 273
Query: 109 GDSIGLHAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
+G H+V+++GWG E+ IPYWL+ NSW+ WG+ GTF+I++G NE IE
Sbjct: 274 AFYVGGHSVKLIGWGEEDGIPYWLLVNSWSKFWGEQGTFRIIKGRNECGIE 324
>gi|48762493|dbj|BAD23816.1| cathepsin B-N1 [Tuberaphis coreana]
Length = 340
Score = 117 bits (293), Expect = 8e-24, Method: Compositional matrix adjust.
Identities = 59/136 (43%), Positives = 84/136 (61%), Gaps = 6/136 (4%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+P +FDAR+KW +C ++ + DQ CGSCWA ++A +DRLCIA++G F +S + +
Sbjct: 88 IPSSFDARKKWRKCSTIGEVRDQGKCGSCWAFGTSSAFADRLCIATDGEFNELLSPEELA 147
Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
C C +GC+GG+P AW + +G+VTGG+Y+S EGCQPY + PC G N T
Sbjct: 148 FCCHKCGFGCSGGYPIRAWERFKKHGLVTGGNYDSGEGCQPYRVPPCPLDEYG---NNTC 204
Query: 306 LGKL--KTPECKQNCY 319
GK K C + CY
Sbjct: 205 RGKPAEKNHRCTRMCY 220
Score = 99.8 bits (247), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 47/98 (47%), Positives = 60/98 (61%), Gaps = 1/98 (1%)
Query: 63 HYFKKAHMVPRCNAMRQIYEHGPLVAIFSVYADFLQYKSGVY-QHNFGDSIGLHAVRVLG 121
HY + A+ + I +GP+ A F VY DF YKSGVY + +G HAV+++G
Sbjct: 232 HYTRDAYYLTYGTIQNDILAYGPIEASFEVYDDFPSYKSGVYTKMENATYLGGHAVKLIG 291
Query: 122 WGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
WG E +PYWL+ NSWND WGD G FKI RG NE I+
Sbjct: 292 WGEEYGVPYWLLVNSWNDQWGDQGLFKIRRGTNECGID 329
>gi|189502866|gb|ACE06814.1| unknown [Schistosoma japonicum]
Length = 121
Score = 117 bits (293), Expect = 8e-24, Method: Compositional matrix adjust.
Identities = 53/87 (60%), Positives = 66/87 (75%)
Query: 77 MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANS 136
M+++ +HGP+ F VYADF YKSGVYQH G +G HAVR+LGWG EN++PYWL+ANS
Sbjct: 27 MKELMQHGPVEVDFEVYADFPNYKSGVYQHVSGALLGGHAVRLLGWGEENNVPYWLIANS 86
Query: 137 WNDHWGDHGTFKILRGENEADIEMGFN 163
WN WGD+G FKI+RG+NE IE N
Sbjct: 87 WNTDWGDNGYFKIIRGKNECGIESDVN 113
>gi|268619140|gb|ACZ13346.1| cathepsin B-like cysteine proteinase [Bursaphelenchus xylophilus]
Length = 405
Score = 117 bits (293), Expect = 8e-24, Method: Compositional matrix adjust.
Identities = 63/154 (40%), Positives = 84/154 (54%), Gaps = 4/154 (2%)
Query: 187 LPRNFDAREKWPECPSL-RHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHI 245
+P +FDA EKWPEC + +I DQSNCGSCWAVS A +SDR+C+A+NG IS
Sbjct: 72 IPESFDAAEKWPECAEVFNNIRDQSNCGSCWAVSSAGVMSDRICVATNGKVKVSISGIAT 131
Query: 246 VACTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGP-LQNCT 304
+C GCNGG ++A+ + NG TG + + +GCQPY C HHV C
Sbjct: 132 ASCVGGD-GCNGGLEEVAFEKFIENGFPTGSEVDKHQGCQPYPFKHCAHHVNSTEYPPCD 190
Query: 305 LLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAH 338
+ + K C C Y+ Y DL GK+ +
Sbjct: 191 SVPEYKADTCSHEC-QKDYDRKYEEDLYYGKEQY 223
Score = 84.3 bits (207), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 41/86 (47%), Positives = 53/86 (61%), Gaps = 2/86 (2%)
Query: 78 RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSI-GLHAVRVLGWGVENDIPYWLVANS 136
R+I +GP+ F+VY FL Y G+Y+ G+ I G HAVRV+GWGVEN YW +ANS
Sbjct: 233 REIMTNGPVAVSFTVYESFLYYSGGIYRSTPGERIKGYHAVRVVGWGVENGTKYWKIANS 292
Query: 137 WNDHWGDHGTFK-ILRGENEADIEMG 161
WN+ WG G +E+DIE G
Sbjct: 293 WNEQWGRERLLPHTPAGVDESDIEDG 318
>gi|157058761|gb|ABV03138.1| cathepsin B-84 [Myzus persicae]
Length = 220
Score = 117 bits (293), Expect = 8e-24, Method: Compositional matrix adjust.
Identities = 55/134 (41%), Positives = 82/134 (61%), Gaps = 2/134 (1%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+P FD+R +W C ++ + +Q NCGSCWA A +DRLCIA++G F ISA+ +
Sbjct: 47 VPEFFDSRLEWKNCKTIGEVRNQGNCGSCWAHGTTGAFADRLCIATDGEFNELISAEELT 106
Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
C C +GCNGG P AW+++ +GVVTGG+YN+ +GCQP + PC +G +C+
Sbjct: 107 FCCHTCGFGCNGGNPLKAWKYFKRHGVVTGGNYNTTDGCQPSRVPPCVRDDEG-HNSCSG 165
Query: 306 LGKLKTPECKQNCY 319
+ +C + CY
Sbjct: 166 QPTERNHKCSKKCY 179
>gi|204022092|dbj|BAG71143.1| cathepsin B-N2 [Tuberaphis coreana]
Length = 334
Score = 117 bits (293), Expect = 9e-24, Method: Compositional matrix adjust.
Identities = 59/136 (43%), Positives = 84/136 (61%), Gaps = 6/136 (4%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+P +FDAR+KW +C ++ + DQ CGSCWA ++A +DRLCIA++G F +S + +
Sbjct: 85 IPSSFDARKKWRKCSTIGEVRDQGKCGSCWAFGTSSAFADRLCIATDGEFNELLSPEELA 144
Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
C C +GC+GG+P AW + +G+VTGG+Y+S EGCQPY + PC G N T
Sbjct: 145 FCCHKCGFGCSGGYPIRAWERFKKHGLVTGGNYDSGEGCQPYRVPPCPLDEYG---NNTC 201
Query: 306 LGKL--KTPECKQNCY 319
GK K C + CY
Sbjct: 202 RGKPAEKNHRCTRMCY 217
Score = 100 bits (248), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 47/98 (47%), Positives = 60/98 (61%), Gaps = 1/98 (1%)
Query: 63 HYFKKAHMVPRCNAMRQIYEHGPLVAIFSVYADFLQYKSGVY-QHNFGDSIGLHAVRVLG 121
HY + A+ + I +GP+ A F VY DF YKSGVY + +G HAV+++G
Sbjct: 229 HYTRDAYYLTYGTIQNDILAYGPIEASFEVYDDFPSYKSGVYTKMENATYLGGHAVKLIG 288
Query: 122 WGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
WG E +PYWL+ NSWND WGD G FKI RG NE I+
Sbjct: 289 WGEEYGVPYWLLVNSWNDQWGDQGLFKIRRGTNECGID 326
>gi|341900875|gb|EGT56810.1| hypothetical protein CAEBREN_32632 [Caenorhabditis brenneri]
Length = 287
Score = 117 bits (293), Expect = 9e-24, Method: Compositional matrix adjust.
Identities = 65/166 (39%), Positives = 93/166 (56%), Gaps = 8/166 (4%)
Query: 174 DDLETMGCQNAKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASN 233
DD ++ +N+ L + FDARE+WPEC S+ I D S C S WA + A ++SDRLCI S
Sbjct: 16 DDGPSVPTENSD-LSQFFDARERWPECMSIPQINDISECKSSWAFAAAESMSDRLCINSG 74
Query: 234 GYFTGQISAQHIVACTPNCW----GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTL 289
G +SAQ +++C GC GG AW++WG +G+ TGG Y SQ GC+PY++
Sbjct: 75 GTINTILSAQELLSCCTGVLSCGEGCGGGNAFKAWQYWGKHGLPTGGSYESQFGCKPYSI 134
Query: 290 APCEHHVQGPLQNCTLLGKLKTPECKQNCYNPSYESTYRFDLKKGK 335
APC V L TP C++ C + ++ Y D+ K +
Sbjct: 135 APCGKTVGNVTYPACTNTTLPTPSCEKKC---TSKNGYPVDIDKDR 177
Score = 98.6 bits (244), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 43/99 (43%), Positives = 60/99 (60%), Gaps = 2/99 (2%)
Query: 63 HYFKKAHMVP--RCNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
HY +P + + +GP+ F VY DFLQY +G+Y H G+ G +VR+L
Sbjct: 178 HYGASVDQLPNRQIEIQSDVMLNGPIETTFEVYDDFLQYTTGIYVHLTGNKQGHLSVRIL 237
Query: 121 GWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
GWG+ +PYWL+ANSW WG++GTF+ LRG NE +E
Sbjct: 238 GWGMYEGVPYWLLANSWGKEWGENGTFRALRGTNECGLE 276
>gi|256090364|ref|XP_002581165.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
gi|353228444|emb|CCD74615.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
Length = 303
Score = 117 bits (292), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 46/82 (56%), Positives = 67/82 (81%)
Query: 78 RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
++I ++GP+ A F+VY DFL YKSG+Y+H G+++G HA+R++GWGVEN PYWL+ANSW
Sbjct: 213 KEIMKYGPVEASFTVYEDFLNYKSGIYKHITGETLGGHAIRIIGWGVENKTPYWLIANSW 272
Query: 138 NDHWGDHGTFKILRGENEADIE 159
N+ WG++G F+I+RG +E IE
Sbjct: 273 NEDWGENGYFRIVRGRDECSIE 294
Score = 82.0 bits (201), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 50/145 (34%), Positives = 68/145 (46%), Gaps = 28/145 (19%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+P +FD+R+KWP C S+ I DQS CGSC A A+S+R CI S G ++SA +
Sbjct: 89 IPSSFDSRKKWPRCKSIATIRDQSRCGSCCAFGAVEAMSERSCIQSGGKQNVELSAVDL- 147
Query: 247 ACTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTLL 306
G+VTG + GC+PY CEH +G C
Sbjct: 148 -----------------------EGIVTGSSKENNTGCEPYPFPKCEHFTKGQYPPCG-- 182
Query: 307 GKL-KTPECKQNCYNPSYESTYRFD 330
K+ KTP CK C Y+++Y D
Sbjct: 183 SKIYKTPRCKTTC-QKRYKTSYAQD 206
>gi|10803435|emb|CAC13130.1| putative cathepsin B.4 [Ostertagia ostertagi]
Length = 194
Score = 117 bits (292), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 60/128 (46%), Positives = 81/128 (63%), Gaps = 3/128 (2%)
Query: 214 SCWAVSVANAISDRLCIASNGYFTGQISAQHIVACTPNC-WGCNGGWPQLAWRFWGHNGV 272
SCWAVS A A+SDR+CIAS G ISAQ +V+C C +GC+GGWP AW+F+ GV
Sbjct: 1 SCWAVSSAAAMSDRICIASKGVKQVLISAQDMVSCCSYCGYGCDGGWPIKAWQFFAREGV 60
Query: 273 VTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTLLGKLKTPECKQNCYNPSYESTYRFDLK 332
VTGG+Y Q C+PY + PC HH + P +TP CK+ C Y++TY+ D +
Sbjct: 61 VTGGNYGRQGCCRPYEITPCGHHGREPYYG-ECYDDAQTPRCKRKC-QSGYKTTYKKDKR 118
Query: 333 KGKKAHMV 340
G+KA+ +
Sbjct: 119 YGRKAYQL 126
Score = 49.7 bits (117), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 28/74 (37%), Positives = 40/74 (54%), Gaps = 2/74 (2%)
Query: 47 KKKKKRLYLPTSIPLSHYFKKAHMVPRC--NAMRQIYEHGPLVAIFSVYADFLQYKSGVY 104
K+K + Y T Y +KA+ +P R+I HGP+VA ++VY DF Y G+Y
Sbjct: 102 KRKCQSGYKTTYKKDKRYGRKAYQLPNSVKAIQREIMMHGPVVAGYTVYEDFSYYTKGIY 161
Query: 105 QHNFGDSIGLHAVR 118
+H G G HAV+
Sbjct: 162 KHTAGRETGGHAVK 175
>gi|226466816|emb|CAX69543.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 337
Score = 117 bits (292), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 62/155 (40%), Positives = 82/155 (52%), Gaps = 3/155 (1%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP FD+RE+W CPS++ I DQS C S WA++ AISDR+CI +NG ++SA +V
Sbjct: 84 LPDYFDSREQWKNCPSIKRIYDQSQCYSSWAMASVAAISDRICIQTNGTVKVELSAIELV 143
Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
+C C GCN G+ + AW +W NG+VTG + GC PY C+H C
Sbjct: 144 SCCSKCAVGCNFGYSESAWYYWVENGLVTGESNGNNSGCLPYPFPKCDHGSSDSYPMCGY 203
Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
+ P C C P Y Y D GK A+ V
Sbjct: 204 V-VYTPPVCNGTC-RPGYPIPYNDDKHFGKSAYQV 236
Score = 109 bits (273), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 52/103 (50%), Positives = 70/103 (67%), Gaps = 2/103 (1%)
Query: 63 HYFKKAHMVPR--CNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
H+ K A+ V + + R+I +GP+ A +Y DF+ YKSGVY+H G I + +VR++
Sbjct: 228 HFGKSAYQVKQNESDIRREIMLYGPVEASIFIYDDFVDYKSGVYKHLTGRLITIQSVRII 287
Query: 121 GWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIEMGFN 163
GWG+EN IPYWL ANSWN+ WG +G FKILRG NE +IE N
Sbjct: 288 GWGIENGIPYWLCANSWNEEWGLNGFFKILRGSNECEIEAFVN 330
>gi|204022083|dbj|BAG71139.1| cathepsin B-S [Astegopteryx styracophila]
Length = 335
Score = 116 bits (291), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 68/173 (39%), Positives = 92/173 (53%), Gaps = 9/173 (5%)
Query: 161 GFNNRV-EANSSEDDDLETMGCQNAKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVS 219
G+ N + E +DD L T K +FDARE W C + H+ DQ NCGSCWA
Sbjct: 63 GYKNYLNEVEIKKDDPLYTKNNNKIK----HFDARENWKICKQIGHVRDQGNCGSCWAFG 118
Query: 220 VANAISDRLCIASNGYFTGQISAQHIVACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDY 278
A +DRLC+A+ G F Q+SA+ + C C GC GG P AW+++ G+ TGGDY
Sbjct: 119 TTGAFADRLCVATGGGFNEQLSAEKLTFCCWTCGLGCQGGNPIKAWKYFKRRGITTGGDY 178
Query: 279 NSQEGCQPYTLAPCEHHVQGPLQNCTLLGKLKTPECKQNCY-NPSYESTYRFD 330
S EGC PY + PC + QG C +C + CY N + E+ Y+ +
Sbjct: 179 GSNEGCAPYKVPPC-YDDQGEFL-CQGKPTEHNHKCPRACYGNSTVENRYKVE 229
Score = 93.6 bits (231), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 41/83 (49%), Positives = 58/83 (69%), Gaps = 1/83 (1%)
Query: 78 RQIYEHGPLVAIFSVYADFLQYKSGVYQHN-FGDSIGLHAVRVLGWGVENDIPYWLVANS 136
+ I +GP+ A F VY DF+ YKSG+YQ +G H+V+++GWG E+ IPYWL+ NS
Sbjct: 242 QDIRTYGPVEASFDVYDDFITYKSGIYQKTPNALYVGGHSVKLIGWGEEDGIPYWLLVNS 301
Query: 137 WNDHWGDHGTFKILRGENEADIE 159
W+ WG+ GTF+I++G NE IE
Sbjct: 302 WSKFWGEQGTFRIIKGRNECGIE 324
>gi|341888224|gb|EGT44159.1| hypothetical protein CAEBREN_15022 [Caenorhabditis brenneri]
Length = 332
Score = 116 bits (291), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 61/153 (39%), Positives = 86/153 (56%), Gaps = 7/153 (4%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
L + FDARE+WPEC S+ I D S C S WA + A ++SDRLCI S G +SAQ ++
Sbjct: 72 LSQFFDARERWPECTSIPQINDISECKSSWAFAAAESMSDRLCINSGGMINTILSAQELL 131
Query: 247 ACTPNCW----GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQN 302
+C GC GG AW++WG +G+ TGG Y +Q GC+PY++APC V
Sbjct: 132 SCCTGVLSCGEGCGGGNAFKAWQYWGKHGLPTGGSYETQFGCKPYSIAPCGKTVGNVTYP 191
Query: 303 CTLLGKLKTPECKQNCYNPSYESTYRFDLKKGK 335
L TP C++ C + ++ Y D+ K +
Sbjct: 192 ACTNTTLPTPSCEKKC---TSKNGYPVDIDKDR 221
Score = 96.3 bits (238), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 40/80 (50%), Positives = 55/80 (68%)
Query: 80 IYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWND 139
+ +GP+ F VY DFLQY +G+Y H G+ G +VR+LGWG+ +PYWL+ANSW
Sbjct: 242 VMLNGPIETTFEVYDDFLQYTTGIYVHLTGNKQGHLSVRILGWGMYEGVPYWLLANSWGK 301
Query: 140 HWGDHGTFKILRGENEADIE 159
WG++GTF+ LRG NE +E
Sbjct: 302 EWGENGTFRALRGTNECGLE 321
>gi|56758470|gb|AAW27375.1| unknown [Schistosoma japonicum]
Length = 217
Score = 116 bits (291), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 52/118 (44%), Positives = 75/118 (63%), Gaps = 1/118 (0%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+P +FD+R+KWP C S+ I DQS CGS WAVS A+SDR+CI S G + ++SA ++
Sbjct: 90 IPSHFDSRKKWPRCKSISQIRDQSQCGSSWAVSAVGAMSDRICIQSGGKQSVELSAVDLI 149
Query: 247 ACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNC 303
+C C GC+GG+ +W +W G+VTGG + GC+PY C+H V+G + C
Sbjct: 150 SCCKYCGSGCDGGFLGPSWDYWVLRGIVTGGSKENHTGCRPYPFPKCDHFVKGKYRAC 207
>gi|356984175|gb|AET43950.1| cathepsin B, partial [Reishia clavigera]
Length = 209
Score = 116 bits (291), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 60/123 (48%), Positives = 79/123 (64%), Gaps = 1/123 (0%)
Query: 38 KKKKKKKKKKKKKKRLYLPTSIPLSHYFKKAHMVPRCN-AMRQIYEHGPLVAIFSVYADF 96
K K + +KK + Y T HY ++++ V N M ++ GP+ A F+VY+DF
Sbjct: 77 KGDGKTPRCEKKCEAGYNVTFKDDKHYGQRSYSVSSVNDIMEELVTRGPVEAAFTVYSDF 136
Query: 97 LQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEA 156
LQY SGVY+H G ++G HAV++LG+GVEN YWLVANSWN WGD G FKILRG +E
Sbjct: 137 LQYHSGVYRHTTGSALGGHAVKILGYGVENGDKYWLVANSWNPDWGDQGFFKILRGVDEC 196
Query: 157 DIE 159
IE
Sbjct: 197 GIE 199
Score = 97.1 bits (240), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 48/109 (44%), Positives = 68/109 (62%), Gaps = 4/109 (3%)
Query: 233 NGYFTGQISAQHIVACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAP 291
N +SA ++AC +C GCNGG+P AW + H+GVVTGG YNS++GCQPY +A
Sbjct: 5 NATVHAHVSANELLACCESCGDGCNGGYPSAAWEVFDHDGVVTGGQYNSKQGCQPYLIAA 64
Query: 292 CEHHVQGPLQNCTLLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
C+HHV G L+ C G KTP C++ C Y T++ D G++++ V
Sbjct: 65 CDHHVVGKLKPCK--GDGKTPRCEKKC-EAGYNVTFKDDKHYGQRSYSV 110
>gi|339242631|ref|XP_003377241.1| cathepsin B [Trichinella spiralis]
gi|316973973|gb|EFV57514.1| cathepsin B [Trichinella spiralis]
Length = 199
Score = 116 bits (291), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 61/157 (38%), Positives = 90/157 (57%), Gaps = 13/157 (8%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP+ +D R+ +P C + I DQSNCGSCWAVS A+ +SDR CIA+NG +S + ++
Sbjct: 54 LPKEYDVRKAYPHCKYINFIKDQSNCGSCWAVSSASVMSDRHCIATNGTEQPFLSEEELI 113
Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
+C C GC+GG+ A+ +W G+ +GG Y + GC+PY++APC NC
Sbjct: 114 SCCKTCGLGCDGGYVSHAFEYWVEKGLPSGGAYGWKTGCKPYSIAPC--------NNCD- 164
Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMVLM 342
+ +TP+CK C P Y T + D G K + +
Sbjct: 165 --EAETPKCKNTCI-PEYPLTPKDDKYFGNKIMLRIF 198
>gi|255087666|ref|XP_002505756.1| cathepsin B-like cysteine proteinase [Micromonas sp. RCC299]
gi|226521026|gb|ACO67014.1| cathepsin B-like cysteine proteinase [Micromonas sp. RCC299]
Length = 273
Score = 116 bits (291), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 60/140 (42%), Positives = 74/140 (52%), Gaps = 11/140 (7%)
Query: 184 AKGLPRNFDAREKWPECPSLRHIA-DQSNCGSCWAVSVANAISDRLCIASNGYFTGQISA 242
A GLP +FDAR KWP C L +A DQ NCGSCWA++ A +SDR CI S G ++S
Sbjct: 15 ALGLPESFDARTKWPTCAHLIGVARDQGNCGSCWAMAPAEVMSDRACIQSGGEIDAELSP 74
Query: 243 QHIVACTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQN 302
++AC +GC GG A+ F NGVVTGG ++ Q C PY APC H +
Sbjct: 75 FQLLACAQGSFGCEGGESADAYEFAKSNGVVTGGGFDDQNTCAPYPFAPCHHPCE----- 129
Query: 303 CTLLGKLKTPECKQNCYNPS 322
TP C C S
Sbjct: 130 -----VFPTPACPATCVGGS 144
Score = 53.5 bits (127), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 33/87 (37%), Positives = 49/87 (56%), Gaps = 13/87 (14%)
Query: 79 QIYEHGPLVAIFS-VYADFLQYKSGVYQHN-----FGDSIGLHAVRVLGWG------VEN 126
+IY +GP+ + +Y +F YKSGV++ + G + G H V+V+GWG E
Sbjct: 174 EIYHNGPVSSYAGDIYEEFYAYKSGVFRESPSVAQRGANHGGHVVKVIGWGKADPAKGEG 233
Query: 127 DIPYWLVANSWNDHWGDHGTFKILRGE 153
+ YW+V NSW + WGD G +I GE
Sbjct: 234 EGYYWIVVNSWLN-WGDDGVGRIAVGE 259
>gi|204022096|dbj|BAG71145.1| cathepsin B-N1 [Tuberaphis sumatrana]
Length = 334
Score = 116 bits (291), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 60/136 (44%), Positives = 84/136 (61%), Gaps = 6/136 (4%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+P NFDAR+KW +C ++ + DQ +CGSCWA ++A +DRLCIA++G F +S + +
Sbjct: 85 IPSNFDARKKWRKCSTIGEVRDQGHCGSCWAFGTSSAFADRLCIATDGEFNELLSPEELA 144
Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
C C +GC+GG P AW + +G+VTGG+Y+S EGCQPY + PC G N T
Sbjct: 145 FCCHKCGFGCSGGNPIKAWERFQKHGLVTGGNYDSGEGCQPYKVPPCPLDEYG---NNTC 201
Query: 306 LGKL--KTPECKQNCY 319
GK K C + CY
Sbjct: 202 SGKPAEKNHRCTRMCY 217
Score = 99.0 bits (245), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 46/98 (46%), Positives = 60/98 (61%), Gaps = 1/98 (1%)
Query: 63 HYFKKAHMVPRCNAMRQIYEHGPLVAIFSVYADFLQYKSGVY-QHNFGDSIGLHAVRVLG 121
HY + A+ + + +GP+ A F VY DF YKSGVY + +G HAV+++G
Sbjct: 229 HYTRDAYYLTYGTIQYDVLAYGPIEASFEVYDDFPSYKSGVYTKMENATYLGGHAVKLIG 288
Query: 122 WGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
WG E +PYWL+ NSWND WGD G FKI RG NE I+
Sbjct: 289 WGEEYGVPYWLLVNSWNDQWGDQGLFKIRRGTNECGID 326
>gi|204022071|dbj|BAG71133.1| cathepsin B-S2 [Tuberaphis coreana]
Length = 334
Score = 116 bits (290), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 49/108 (45%), Positives = 67/108 (62%), Gaps = 1/108 (0%)
Query: 188 PRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVA 247
P+ FD+R W C + HI DQ NCGSCW+ S A +DRLC+++ G F +S + +
Sbjct: 86 PQQFDSRTNWKSCKQIGHIRDQGNCGSCWSFSTTGAFADRLCVSTGGKFNQLLSPEELTF 145
Query: 248 CTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEH 294
C +C GC GG P AW ++ GV TGGDYN++EGC PY + PC +
Sbjct: 146 CCKDCGQGCGGGNPMKAWEYFRTQGVTTGGDYNTKEGCMPYKVPPCRN 193
Score = 90.9 bits (224), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 45/122 (36%), Positives = 68/122 (55%), Gaps = 2/122 (1%)
Query: 51 KRLYLPTSIPLSHYFKKAHMVPRCNAMRQ-IYEHGPLVAIFSVYADFLQYKSGVYQHNFG 109
K Y T++ + K + + + Q I +GP+ A F Y D YKSG+Y+ +
Sbjct: 213 KTCYGKTTVQNRYKTKSEYYINSIKTIEQDIKTYGPVEASFDCYDDLSVYKSGIYRKSPN 272
Query: 110 DSI-GLHAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIEMGFNNRVEA 168
G H+++++GWG E+ PYWL NSW+ WGDHGTFKI++G NE IE + +
Sbjct: 273 AKYKGGHSIKIIGWGQEDGTPYWLAVNSWSKFWGDHGTFKIIKGRNECGIERAVTAGIPS 332
Query: 169 NS 170
+S
Sbjct: 333 SS 334
>gi|119638954|gb|ABL85236.1| cysteine proteinase 2 [Necator americanus]
Length = 347
Score = 116 bits (290), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 69/176 (39%), Positives = 90/176 (51%), Gaps = 10/176 (5%)
Query: 167 EANSSEDDDLETMGCQNAKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISD 226
E + ED DL A LP +FDAREKWPECPS+ I DQS G CWAVS A ++D
Sbjct: 81 EMDQQEDIDL-------AVSLPESFDAREKWPECPSIGLIRDQSAGGGCWAVSSAEVMTD 133
Query: 227 RLCIASNGYFTGQISAQHIVACTPN-CW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGC 284
R+CI SNG +S I++C C GC G P+ A+ + GV +GG Y ++ C
Sbjct: 134 RICIQSNGTKQVYVSETDILSCCGQRCGSGCTSGVPRQAFNYAIRKGVCSGGPYGTKGVC 193
Query: 285 QPYTLAPCEHHVQGPLQNCTLLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
+PY PC +H P G TP C++ C Y Y D G K ++
Sbjct: 194 KPYPFYPCGYHAHLPYYGPCPDGMWPTPTCEKAC-QSDYTVPYNDDRIFGSKTIVL 248
Score = 110 bits (274), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 44/83 (53%), Positives = 62/83 (74%)
Query: 78 RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
R+I+ +GPLVA ++VY DF YK+G+Y G + G HAV+++GWG EN + YWL+ANSW
Sbjct: 256 REIFNNGPLVATYTVYEDFAYYKNGIYMTGLGRATGAHAVKIIGWGEENGVKYWLIANSW 315
Query: 138 NDHWGDHGTFKILRGENEADIEM 160
N WG++G F++LRG N DIE+
Sbjct: 316 NTDWGENGFFRMLRGTNLCDIEL 338
>gi|269146930|gb|ACZ28411.1| cathepsin b [Simulium nigrimanum]
Length = 168
Score = 116 bits (290), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 52/98 (53%), Positives = 68/98 (69%), Gaps = 2/98 (2%)
Query: 64 YFKKAHMVPR--CNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLG 121
Y K++ VP ++I +GP+ F+VY D +QYK GVYQH G +G HA+R+LG
Sbjct: 62 YGAKSYSVPHHVAQIQKEIMTNGPVEGAFTVYEDLVQYKDGVYQHVTGKMLGGHAIRILG 121
Query: 122 WGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
WGVEND+PYWL+ANSWN WG++G FKILRG + IE
Sbjct: 122 WGVENDVPYWLIANSWNTDWGNNGFFKILRGSDHCGIE 159
Score = 53.1 bits (126), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 28/67 (41%), Positives = 38/67 (56%), Gaps = 2/67 (2%)
Query: 274 TGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTLLGKLKTPECKQNCYNPSYESTYRFDLKK 333
+GG + S +GC PY +APCEHHV G C + KTP+C ++C SY Y D
Sbjct: 5 SGGPFGSNQGCHPYKIAPCEHHVNGTRPACNGE-EGKTPKCIKHC-QASYTVAYEQDKSY 62
Query: 334 GKKAHMV 340
G K++ V
Sbjct: 63 GAKSYSV 69
>gi|170030062|ref|XP_001842909.1| cathepsin B-like thiol protease [Culex quinquefasciatus]
gi|167865915|gb|EDS29298.1| cathepsin B-like thiol protease [Culex quinquefasciatus]
Length = 288
Score = 116 bits (290), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 52/86 (60%), Positives = 66/86 (76%), Gaps = 1/86 (1%)
Query: 75 NAMR-QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLV 133
NAM+ +IY++GP+V F VY DF QY+SGVY+H G G HAVRV+GWGVEN + YWL
Sbjct: 193 NAMKAEIYQNGPIVTSFDVYGDFFQYRSGVYRHVTGAYKGSHAVRVIGWGVENGVKYWLC 252
Query: 134 ANSWNDHWGDHGTFKILRGENEADIE 159
ANSWN+ WG++G FKI+RGEN +E
Sbjct: 253 ANSWNERWGENGFFKIVRGENHVGVE 278
Score = 113 bits (282), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 62/173 (35%), Positives = 94/173 (54%), Gaps = 14/173 (8%)
Query: 169 NSSEDDDLETMGCQ-NAKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDR 227
N SE ++L + Q + + LP +FDAR+KWP CPSL I Q +CGSC+AVS A I+DR
Sbjct: 28 NESELNNLPRLQNQRSVRALPASFDARQKWPYCPSLNQIRSQGSCGSCYAVSTAAVITDR 87
Query: 228 LCIASNGYFTGQISAQHIVACTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPY 287
CI S G + ++C +C+ C+GG+ + +W G+ +GG Y+S +GC+PY
Sbjct: 88 YCIHSGGERQFYFGSTGYLSCCTDCYKCDGGYVHKTFDYWVKYGLTSGGPYHSGQGCKPY 147
Query: 288 TLAPCEHHVQGPLQNCTLLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
G Q+ ++ K C + C Y TY DLK G ++++
Sbjct: 148 PFG-------GATQDVNIVLK-----CDRQC-QAGYPLTYSQDLKHGASSYIL 187
>gi|166030312|gb|ABY78823.1| cathepsin B-like protease [Trypanosoma congolense]
Length = 335
Score = 115 bits (289), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 60/134 (44%), Positives = 82/134 (61%), Gaps = 11/134 (8%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP +FD+ EKWP CP++R IADQS CGSCWAVS A+AISDR C G +ISA H++
Sbjct: 90 LPESFDSAEKWPNCPTIREIADQSACGSCWAVSTASAISDRHCTV-GGVQQLRISAAHLL 148
Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHH-VQGPLQNCT 304
+C +C +GC+GG+P AWR++ +G+ + CQPY C+HH +G C+
Sbjct: 149 SCCKDCGYGCDGGYPDAAWRYYVSHGLAS-------SYCQPYPFPHCDHHGGKGKKPPCS 201
Query: 305 LLGKLKTPECKQNC 318
TP+C C
Sbjct: 202 KY-DFHTPKCNTTC 214
Score = 104 bits (260), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 45/82 (54%), Positives = 59/82 (71%)
Query: 78 RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
R++Y +GP V F VY+DF YK+GVY+H GD +G HAVR++GWG N PYW +ANSW
Sbjct: 240 RELYFNGPFVVAFQVYSDFFAYKTGVYRHVSGDVLGGHAVRIVGWGKLNGTPYWKIANSW 299
Query: 138 NDHWGDHGTFKILRGENEADIE 159
+ WG +G F ILRG++E IE
Sbjct: 300 DTDWGMNGHFLILRGKDECGIE 321
>gi|268563232|ref|XP_002638788.1| Hypothetical protein CBG05143 [Caenorhabditis briggsae]
Length = 426
Score = 115 bits (289), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 62/184 (33%), Positives = 92/184 (50%), Gaps = 7/184 (3%)
Query: 147 FKILRGENEADIEMGFNNRVEANSSEDDDLETMGCQNAKGLPRNFDAREKWPECPSLRHI 206
FK R + + M + + + LE + + LP++FDAR+KWP CPS+ ++
Sbjct: 103 FKYTRNQTAVEEYMEHIRKFFESDAMKRHLEELENYKSSSLPKHFDARQKWPNCPSISNV 162
Query: 207 ADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVACTPNCWGCNGGWPQLAWRF 266
+Q CGSC+AV+ A SDR CI SNG F +S + I+ C C C GG P A +
Sbjct: 163 PNQGGCGSCFAVAAAGVASDRACIHSNGTFKSLLSEEDIIGCCSVCGNCYGGDPLKALTY 222
Query: 267 WGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTLLGKLKTPECKQNCYNPSYEST 326
W + G+VTGG ++GC+PY+ + P T + C + C N Y+
Sbjct: 223 WVNQGLVTGG----RDGCRPYSF---DLSCGVPCSPATFFEAEEKRTCMRRCQNIYYQQK 275
Query: 327 YRFD 330
Y D
Sbjct: 276 YEED 279
Score = 66.6 bits (161), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 45/121 (37%), Positives = 66/121 (54%), Gaps = 17/121 (14%)
Query: 50 KKRLYLPTSIPLSHY----FKKAHMVPRCNAMR-QIYEHGPLVAIFSVYADFLQYKSGVY 104
K+R+ +PT I H+ +K ++ N ++ +I +GP F V +FL Y SGV+
Sbjct: 301 KERVKVPTII--GHFNDKNTEKLNVTEYRNVIKKEILLYGPTTMAFPVPEEFLHYSSGVF 358
Query: 105 Q----HNFGDSIGL-HAVRVLGWGVENDIP-YWLVANSWNDHWGDHGTFKILRGENEADI 158
+ F D I H VR++GWG +D YWL NS+ +HWGD+G FKI N D+
Sbjct: 359 RPFPLDGFDDRIVYWHVVRLIGWGESDDGQHYWLAVNSFGNHWGDNGIFKI----NTDDM 414
Query: 159 E 159
E
Sbjct: 415 E 415
>gi|145498570|ref|XP_001435272.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124402403|emb|CAK67875.1| unnamed protein product [Paramecium tetraurelia]
Length = 325
Score = 115 bits (289), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 58/137 (42%), Positives = 80/137 (58%), Gaps = 11/137 (8%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+P +FDAR++WP C S++ + DQS CGSCWA A A+SDRLCIA+ G T + +
Sbjct: 75 IPESFDARQQWPNCESIKEVRDQSTCGSCWAFGAAEAMSDRLCIAT-GKQTRISTEDLLT 133
Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQ----GPLQ 301
C C GCNGG+P AW ++ + G+VTG + C+PYT PC+HHV GP
Sbjct: 134 CCGITCGMGCNGGFPSGAWNYFKNKGLVTGDLFGDNSWCRPYTFPPCDHHVDDGKYGPCG 193
Query: 302 NCTLLGKLKTPECKQNC 318
+ TP C ++C
Sbjct: 194 D-----SQPTPACVKSC 205
Score = 112 bits (279), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 49/84 (58%), Positives = 63/84 (75%)
Query: 79 QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWN 138
+I GP+ A F+VY DFL YKSGVYQ+ G ++G HAV+++GWGVE ++PYWLV NSWN
Sbjct: 236 EIMTFGPVEASFTVYEDFLTYKSGVYQNVAGANLGGHAVKIIGWGVEKNVPYWLVVNSWN 295
Query: 139 DHWGDHGTFKILRGENEADIEMGF 162
+ WG++G FKILRG N IE G
Sbjct: 296 EGWGENGLFKILRGSNHVGIEGGI 319
>gi|341891034|gb|EGT46969.1| hypothetical protein CAEBREN_30419 [Caenorhabditis brenneri]
Length = 422
Score = 115 bits (288), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 62/184 (33%), Positives = 91/184 (49%), Gaps = 7/184 (3%)
Query: 147 FKILRGENEADIEMGFNNRVEANSSEDDDLETMGCQNAKGLPRNFDAREKWPECPSLRHI 206
FK R + + M + + + LE + + LP+ FDAR+KWP CPS+ ++
Sbjct: 99 FKYTRNQTAVEEYMEHIRKFFESDAMKRHLEELDNYKSSDLPKAFDARQKWPNCPSISNV 158
Query: 207 ADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVACTPNCWGCNGGWPQLAWRF 266
+Q CGSC+AV+ A SDR CI SNG F +S + I+ C C C GG P A +
Sbjct: 159 PNQGGCGSCFAVAAAGVASDRACIHSNGTFKALLSEEDIIGCCSVCGNCYGGDPLKALTY 218
Query: 267 WGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTLLGKLKTPECKQNCYNPSYEST 326
W + G+VTGG ++GC+PY+ + P T + C + C N Y+
Sbjct: 219 WVNQGLVTGG----RDGCRPYSF---DLSCGVPCSPATFFEAEEKRTCMRRCQNIYYQQR 271
Query: 327 YRFD 330
Y D
Sbjct: 272 YEED 275
Score = 66.2 bits (160), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 45/121 (37%), Positives = 65/121 (53%), Gaps = 17/121 (14%)
Query: 50 KKRLYLPTSIPLSHY----FKKAHMVPRCNAMR-QIYEHGPLVAIFSVYADFLQYKSGVY 104
K+R+ +PT I H+ +K ++ N ++ +I +GP F V +FL Y SGV+
Sbjct: 297 KERVKVPTII--GHFNDKNTEKLNVTEYRNVIKKEILLYGPTTMAFPVPEEFLHYSSGVF 354
Query: 105 Q----HNFGDSIGL-HAVRVLGWG-VENDIPYWLVANSWNDHWGDHGTFKILRGENEADI 158
+ F D I H VR++GWG E+ YWL NS+ HWGD+G FKI N D+
Sbjct: 355 RPFPLDGFDDRIVYWHVVRLIGWGQSEDGTHYWLAVNSFGSHWGDNGLFKI----NTDDM 410
Query: 159 E 159
E
Sbjct: 411 E 411
>gi|161343881|tpg|DAA06121.1| TPA_inf: cathepsin B [Toxoptera citricida]
Length = 182
Score = 115 bits (288), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 53/111 (47%), Positives = 76/111 (68%), Gaps = 2/111 (1%)
Query: 187 LPRNFDAREKWPECPS-LRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHI 245
+P+ FDAR+ + C + + + DQ NC S WAV+VA+ +DRLCIA+NG FT +SAQ++
Sbjct: 66 IPKEFDARQYFFNCANVIGDVKDQGNCASSWAVAVASTFTDRLCIATNGTFTQNLSAQNL 125
Query: 246 VACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHH 295
++C + GCNGG AW F G+VTGG+++S EGCQPY PC+H+
Sbjct: 126 MSCGDDEKSGCNGGSAFKAWEFITGKGIVTGGNFDSNEGCQPYKNRPCDHY 176
>gi|308485822|ref|XP_003105109.1| hypothetical protein CRE_20700 [Caenorhabditis remanei]
gi|308257054|gb|EFP01007.1| hypothetical protein CRE_20700 [Caenorhabditis remanei]
Length = 410
Score = 115 bits (288), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 62/184 (33%), Positives = 92/184 (50%), Gaps = 7/184 (3%)
Query: 147 FKILRGENEADIEMGFNNRVEANSSEDDDLETMGCQNAKGLPRNFDAREKWPECPSLRHI 206
FK R + + M + + + LE + + LP++FDAR+KWP CPS+ ++
Sbjct: 87 FKYTRNQTAVEEYMEHIRKFFESDAMKRHLEELENYKSSDLPKHFDARQKWPNCPSISNV 146
Query: 207 ADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVACTPNCWGCNGGWPQLAWRF 266
+Q CGSC+AV+ A SDR CI SNG F +S + I+ C C C GG P A +
Sbjct: 147 PNQGGCGSCFAVAAAGVASDRACIHSNGTFKALLSEEDIIGCCSVCGNCYGGDPLKALTY 206
Query: 267 WGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTLLGKLKTPECKQNCYNPSYEST 326
W + G+VTGG ++GC+PY+ + P T + C + C N Y+
Sbjct: 207 WVNQGLVTGG----RDGCRPYSF---DLSCGVPCSPATFFEAEEKRTCMRRCQNIYYQQK 259
Query: 327 YRFD 330
Y D
Sbjct: 260 YEED 263
Score = 64.7 bits (156), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 45/121 (37%), Positives = 65/121 (53%), Gaps = 17/121 (14%)
Query: 50 KKRLYLPTSIPLSHY----FKKAHMVPRCNAMR-QIYEHGPLVAIFSVYADFLQYKSGVY 104
K+R+ +PT I H+ +K ++ N ++ +I +GP F V +FL Y SGV+
Sbjct: 285 KERVKVPTII--GHFNDKNTEKLNVTEYRNVIKKEILLYGPTTMAFPVPEEFLHYSSGVF 342
Query: 105 Q----HNFGDSIGL-HAVRVLGWGVENDIP-YWLVANSWNDHWGDHGTFKILRGENEADI 158
+ F D I H VR++GWG D YWL NS+ +HWGD+G FKI N D+
Sbjct: 343 RPFPLDGFDDRIVYWHVVRLIGWGESGDGQHYWLAINSFGNHWGDNGLFKI----NTDDM 398
Query: 159 E 159
E
Sbjct: 399 E 399
>gi|189239879|ref|XP_968767.2| PREDICTED: similar to putative cathepsin B-like proteinase
[Tribolium castaneum]
gi|270012755|gb|EFA09203.1| cathepsin B precursor [Tribolium castaneum]
Length = 353
Score = 115 bits (288), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 57/157 (36%), Positives = 85/157 (54%), Gaps = 16/157 (10%)
Query: 187 LPRNFDAREKWPECPS-LRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHI 245
+P FDARE WP+C + +I +Q CGSCWA + A +SDRLC+A+NG + S + +
Sbjct: 73 IPATFDAREYWPQCKDVIGNIRNQGKCGSCWAFAAAEVMSDRLCVATNGSVKFEFSPEDL 132
Query: 246 VACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
+ C C C GG+ AW+++ G+V+GGDYN+ GCQPY+ + V
Sbjct: 133 INCCETCGKKCKGGYSYYAWKYYTSTGLVSGGDYNTSRGCQPYSKSNFNDGV-------- 184
Query: 305 LLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMVL 341
+PEC + C N Y ++Y D G + +L
Sbjct: 185 ------SPECSKTCQNTKYPTSYLNDRHFGDGTYYIL 215
Score = 89.0 bits (219), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 43/77 (55%), Positives = 50/77 (64%), Gaps = 1/77 (1%)
Query: 84 GPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWNDHWGD 143
GP++A F VY DF Y+ GVY H G +G HAV+++GWG EN YWLVANSW WG
Sbjct: 230 GPVMAGFDVYEDFKLYREGVYVHTSGALLGSHAVKIIGWGTENGWAYWLVANSWGKDWGA 289
Query: 144 -HGTFKILRGENEADIE 159
G FKI RG NE IE
Sbjct: 290 LGGVFKIRRGTNECKIE 306
>gi|170030060|ref|XP_001842908.1| cathepsin B [Culex quinquefasciatus]
gi|167865914|gb|EDS29297.1| cathepsin B [Culex quinquefasciatus]
Length = 320
Score = 115 bits (288), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 52/83 (62%), Positives = 62/83 (74%)
Query: 77 MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANS 136
M +IY++GP+V F V+ADF QYKSGVY+H G + G HAVRV+GWGVEN + YWLVANS
Sbjct: 228 MTEIYQNGPVVVQFEVFADFYQYKSGVYRHVTGATEGWHAVRVIGWGVENGVKYWLVANS 287
Query: 137 WNDHWGDHGTFKILRGENEADIE 159
W WGD G FK +RGEN IE
Sbjct: 288 WGVRWGDKGFFKFVRGENHLGIE 310
Score = 115 bits (287), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 61/160 (38%), Positives = 84/160 (52%), Gaps = 14/160 (8%)
Query: 182 QNAKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQIS 241
++ + LP +FD+R+KWP CPSL I DQ CGSC+ VS A AI+DR CI S G
Sbjct: 76 RSVRSLPESFDSRQKWPNCPSLNQIRDQGCCGSCYVVSTAAAITDRYCIHSGGQKQFTFG 135
Query: 242 AQHIVACTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQ 301
A +AC +C+ C+GG+ W++W +G+ + G Y S +GC Y + V PL
Sbjct: 136 ATDYLACCTDCFKCDGGYVGKTWQYWVDSGLTSEGPYKSGQGCNSYPFG--SYCVNDPL- 192
Query: 302 NCTLLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMVL 341
P C + C Y TY DLK G A+ V+
Sbjct: 193 ----------PTCSRTC-QAGYPLTYSQDLKYGGSAYRVM 221
>gi|56752997|gb|AAW24710.1| unknown [Schistosoma japonicum]
Length = 342
Score = 115 bits (288), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 72/195 (36%), Positives = 101/195 (51%), Gaps = 13/195 (6%)
Query: 148 KILRGENEADIEMGFNNRVEANSSEDDDLETMGCQNAKGLPRNFDAREKWPECPSLRHIA 207
+IL G + D EM N R + D ++E +P FD+R+KWP C S+ I
Sbjct: 61 RILMGARKEDAEMKRNRRPTVDH-HDLNVE---------IPSQFDSRKKWPHCKSISQIR 110
Query: 208 DQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVACTPNCWGCNGGWPQL-AWRF 266
DQS CGSCWA A++DR+CI S G + ++SA +++C +C G G AW +
Sbjct: 111 DQSRCGSCWAFGAVEAMTDRICIQSGGQQSAELSALDLISCCEDCGGGCKGGFPGQAWDY 170
Query: 267 WGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTLLGKLKTPECKQNCYNPSYEST 326
W G+VTGG + GCQPY CEH +G C KTP+CKQ C Y++
Sbjct: 171 WVKRGIVTGGSEENHTGCQPYPFPKCEHLTKGKYPACG-TKIYKTPQCKQTC-QKGYKTP 228
Query: 327 YRFDLKKGKKAHMVL 341
Y D G + + V+
Sbjct: 229 YEQDKHYGDQRYNVI 243
Score = 110 bits (274), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 45/82 (54%), Positives = 61/82 (74%)
Query: 78 RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
R+I +GP+ A F VY DFL YKSG+Y+H G +G HA+R++GWGVE PYWL+ANSW
Sbjct: 251 REIMMYGPVEAAFDVYEDFLNYKSGIYRHVTGSIVGGHAIRIIGWGVEKGKPYWLIANSW 310
Query: 138 NDHWGDHGTFKILRGENEADIE 159
N+ WG+ G F+++RG +E IE
Sbjct: 311 NEDWGEKGLFRMVRGRDECSIE 332
>gi|71424150|ref|XP_812694.1| cysteine peptidase C (CPC) [Trypanosoma cruzi strain CL Brener]
gi|70877506|gb|EAN90843.1| cysteine peptidase C (CPC), putative [Trypanosoma cruzi]
Length = 333
Score = 115 bits (288), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 60/134 (44%), Positives = 80/134 (59%), Gaps = 12/134 (8%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
L FDA E WPECP++ I DQS+CGSCWAV+ A+AISDR C G +ISA ++
Sbjct: 92 LQDRFDAGEAWPECPTVTEIRDQSSCGSCWAVAAASAISDRYCTL-GGVRDLRISAGDLM 150
Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGP-LQNCT 304
+C C +GCNGG+P++AW ++ +G+V+ E CQPY C HHV L C+
Sbjct: 151 SCCDVCGFGCNGGYPEVAWEYYAVHGIVS-------EYCQPYPFPSCAHHVNSSDLSPCS 203
Query: 305 LLGKLKTPECKQNC 318
G+ TP C C
Sbjct: 204 --GEYDTPTCNSTC 215
Score = 93.2 bits (230), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 43/82 (52%), Positives = 54/82 (65%)
Query: 78 RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
R++ +GP FSVYADF+ Y GVY+H G +G HAVR++GWG N PYW +ANSW
Sbjct: 241 RELILNGPFEVSFSVYADFVAYTGGVYKHVAGIFLGGHAVRIVGWGELNGEPYWKIANSW 300
Query: 138 NDHWGDHGTFKILRGENEADIE 159
N WG +G F I RG +E IE
Sbjct: 301 NREWGMNGYFLIARGVDECGIE 322
>gi|239790489|dbj|BAH71802.1| ACYPI000009 [Acyrthosiphon pisum]
Length = 178
Score = 115 bits (288), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 53/111 (47%), Positives = 75/111 (67%), Gaps = 2/111 (1%)
Query: 187 LPRNFDAREKWPECPS-LRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHI 245
+PR FDAR+ + C + + + DQ NC S WAV+VA+ +DRLCIASNG FT +SAQ++
Sbjct: 64 IPREFDARQYFTSCANVIGDVKDQGNCASSWAVAVASTFTDRLCIASNGQFTDNLSAQNL 123
Query: 246 VACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHH 295
++C GC+GG AW + G+VTGG+++S EGCQPY PC+H+
Sbjct: 124 MSCGDGEKMGCDGGSAFKAWELTMNKGIVTGGNFDSNEGCQPYKNRPCDHY 174
>gi|5031250|gb|AAD38132.1|AF127592_1 vitellogenic cathepsin-B like protease [Aedes aegypti]
Length = 386
Score = 115 bits (287), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 60/134 (44%), Positives = 79/134 (58%), Gaps = 13/134 (9%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP FDAREKWPECPSLR I DQ CGSCWAVS A+A++DR C+ S G + ++
Sbjct: 125 LPDTFDAREKWPECPSLREIRDQGCCGSCWAVSAASAMTDRWCVRSKGKEQFIFGSLDLL 184
Query: 247 ACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
+C +C GC GG AW+FW G+ +GG NS++GC PY P+ C +
Sbjct: 185 SCCHSCGQGCRGGTLGPAWQFWVEKGLSSGGPLNSRQGCHPY-----------PIGECRI 233
Query: 306 LGKLK-TPECKQNC 318
G+ + TP+C C
Sbjct: 234 PGEDEDTPKCSNKC 247
Score = 101 bits (252), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 45/83 (54%), Positives = 59/83 (71%)
Query: 77 MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANS 136
M +I+ +GP+ A F Y D YKSG+Y+H +G G HAV++LGWGVEN + YWLVANS
Sbjct: 277 MEEIFINGPVQAAFHTYLDLHAYKSGIYRHVWGPLSGGHAVKLLGWGVENGVKYWLVANS 336
Query: 137 WNDHWGDHGTFKILRGENEADIE 159
W WG++G FK++RGEN IE
Sbjct: 337 WGREWGENGFFKMVRGENHCGIE 359
>gi|226471002|emb|CAX70582.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 115 bits (287), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 71/195 (36%), Positives = 102/195 (52%), Gaps = 13/195 (6%)
Query: 148 KILRGENEADIEMGFNNRVEANSSEDDDLETMGCQNAKGLPRNFDAREKWPECPSLRHIA 207
+IL G + D EM R + D ++E +P FD+R+KWP C S+ I
Sbjct: 61 RILMGARKEDAEMKRKRRPTVDH-HDLNVE---------IPSQFDSRKKWPHCKSISQIR 110
Query: 208 DQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVACTPNCWGCNGGWPQL-AWRF 266
DQS CGSCWA A++DR+CI S G + ++SA +++C +C G G AW +
Sbjct: 111 DQSRCGSCWAFGAVEAMTDRICIQSGGQQSAELSALDLISCCEDCGGGCKGGFPGQAWDY 170
Query: 267 WGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTLLGKLKTPECKQNCYNPSYEST 326
W G+VTGG + GCQPY CEH +G C KTP+CKQ C Y++
Sbjct: 171 WVKRGIVTGGSEENHTGCQPYPFPKCEHLTKGKYPACG-TKIYKTPQCKQTC-QKGYKTP 228
Query: 327 YRFDLKKGKKAHMVL 341
Y+ D G +++ V+
Sbjct: 229 YKQDKHYGDESYNVI 243
Score = 108 bits (270), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 44/82 (53%), Positives = 61/82 (74%)
Query: 78 RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
++I +GP+ A F VY DFL YKSG+Y+H G +G HA+R++GWGVE PYWL+ANSW
Sbjct: 251 KEIMMYGPVEAAFDVYEDFLNYKSGIYRHVTGSIVGGHAIRIIGWGVEKGKPYWLIANSW 310
Query: 138 NDHWGDHGTFKILRGENEADIE 159
N+ WG+ G F+++RG +E IE
Sbjct: 311 NEDWGEKGLFRMVRGRDECSIE 332
>gi|226466652|emb|CAX69461.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 340
Score = 114 bits (286), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 65/193 (33%), Positives = 101/193 (52%), Gaps = 8/193 (4%)
Query: 151 RGENEADIEMGFNNRVEANSSEDDDLETMGCQNAKG-LPRNFDAREKWPECPSLRHIADQ 209
R + DIE F +E + + ++T+ + +PR+FDAR W C ++R I D+
Sbjct: 52 RFRSSKDIEKMFRKYIEIENIQTKHIKTISHNSINMEIPRSFDARYHWINCSTIRQIHDE 111
Query: 210 SNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVAC--TPNCWGCNGGWPQLAWRFW 267
S C + WA++ ++ISDR+CI SNG + Q+SA+ ++C +P GC G +W
Sbjct: 112 SLCRADWAIATVDSISDRICIRSNGRISVQLSARDAISCGFSP---GCFHGSEVEVLVYW 168
Query: 268 GHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTLLGKLKTPECKQNCYNPSYESTY 327
G+VTGG Y Q GCQPY L C +H + +C + P+C C + Y TY
Sbjct: 169 ITYGIVTGGSYEDQSGCQPYPLPKCSYHPESRFLDCN-NNTFEFPQCTNECQD-GYNKTY 226
Query: 328 RFDLKKGKKAHMV 340
D G++ + V
Sbjct: 227 DDDKFYGERIYNV 239
Score = 83.6 bits (205), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 41/86 (47%), Positives = 54/86 (62%), Gaps = 1/86 (1%)
Query: 75 NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHN-FGDSIGLHAVRVLGWGVENDIPYWLV 133
+ ++I +GP++A SV DFL YKSGVY ++G +R++GWG E IPYWL
Sbjct: 245 DIQKEILMNGPVIASISVNTDFLVYKSGVYLPTPRSRNLGWITLRIIGWGYEGKIPYWLC 304
Query: 134 ANSWNDHWGDHGTFKILRGENEADIE 159
ANSWN+ WGD+G KI RG IE
Sbjct: 305 ANSWNEEWGDNGYVKIQRGVQAGYIE 330
>gi|157111449|ref|XP_001651570.1| cathepsin b [Aedes aegypti]
gi|108868331|gb|EAT32556.1| AAEL015312-PA [Aedes aegypti]
Length = 386
Score = 114 bits (286), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 60/134 (44%), Positives = 79/134 (58%), Gaps = 13/134 (9%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP FDAREKWPECPSLR I DQ CGSCWAVS A+A++DR C+ S G + ++
Sbjct: 125 LPDTFDAREKWPECPSLREIRDQGCCGSCWAVSAASAMTDRWCVRSKGKEQFIFGSLDLL 184
Query: 247 ACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
+C +C GC GG AW+FW G+ +GG NS++GC PY P+ C +
Sbjct: 185 SCCHSCGQGCRGGTLGPAWQFWVEKGLSSGGPLNSRQGCHPY-----------PIGECRI 233
Query: 306 LGKLK-TPECKQNC 318
G+ + TP+C C
Sbjct: 234 PGEDEDTPKCSNKC 247
Score = 103 bits (258), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 49/99 (49%), Positives = 66/99 (66%), Gaps = 2/99 (2%)
Query: 63 HYFKKAHMVPRCN--AMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
HY + A+ +P M +I+ +GP+ A F Y D YKSG+Y+H +G G HAV++L
Sbjct: 261 HYGRVAYSLPNDERKIMEEIFINGPVQAAFHTYLDLHAYKSGIYRHVWGPLSGGHAVKLL 320
Query: 121 GWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
GWGVEN + YWLVANSW WG++G FK++RGEN IE
Sbjct: 321 GWGVENGVKYWLVANSWGREWGENGFFKMVRGENHCGIE 359
>gi|157131748|ref|XP_001662318.1| cathepsin b [Aedes aegypti]
gi|108871395|gb|EAT35620.1| AAEL012216-PA [Aedes aegypti]
Length = 386
Score = 114 bits (286), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 60/134 (44%), Positives = 79/134 (58%), Gaps = 13/134 (9%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP FDAREKWPECPSLR I DQ CGSCWAVS A+A++DR C+ S G + ++
Sbjct: 125 LPDTFDAREKWPECPSLREIRDQGCCGSCWAVSAASAMTDRWCVRSKGKEQFIFGSLDLL 184
Query: 247 ACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
+C +C GC GG AW+FW G+ +GG NS++GC PY P+ C +
Sbjct: 185 SCCHSCGQGCRGGTLGPAWQFWVEKGLSSGGPLNSRQGCHPY-----------PIGECRI 233
Query: 306 LGKLK-TPECKQNC 318
G+ + TP+C C
Sbjct: 234 PGEDEDTPKCSNKC 247
Score = 103 bits (258), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 49/99 (49%), Positives = 66/99 (66%), Gaps = 2/99 (2%)
Query: 63 HYFKKAHMVPRCN--AMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
HY + A+ +P M +I+ +GP+ A F Y D YKSG+Y+H +G G HAV++L
Sbjct: 261 HYGRVAYSLPNDERKIMEEIFINGPVQAAFHTYLDLHAYKSGIYRHVWGPLSGGHAVKLL 320
Query: 121 GWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
GWGVEN + YWLVANSW WG++G FK++RGEN IE
Sbjct: 321 GWGVENGVKYWLVANSWGREWGENGFFKMVRGENHCGIE 359
>gi|204022098|dbj|BAG71146.1| cathepsin B-N2 [Tuberaphis sumatrana]
Length = 334
Score = 114 bits (286), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 56/134 (41%), Positives = 81/134 (60%), Gaps = 2/134 (1%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+P FDAR+KW +C ++ + DQ +CGSCWA ++A +DRLCIA++G F +S + +
Sbjct: 85 IPSYFDARKKWRKCLTIGEVRDQGHCGSCWAFGTSSAFADRLCIATDGEFNELLSPEELA 144
Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
C C +GC+GG+P AW + +G+VTGG+Y S EGCQPY + PC G C+
Sbjct: 145 FCCHKCGFGCSGGYPIKAWERFKKHGLVTGGNYESGEGCQPYRVPPCPLDEYGN-NTCSG 203
Query: 306 LGKLKTPECKQNCY 319
K C + CY
Sbjct: 204 KPTEKNHRCTRMCY 217
Score = 99.0 bits (245), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 46/98 (46%), Positives = 60/98 (61%), Gaps = 1/98 (1%)
Query: 63 HYFKKAHMVPRCNAMRQIYEHGPLVAIFSVYADFLQYKSGVY-QHNFGDSIGLHAVRVLG 121
HY + A+ + + +GP+ A F VY DF YKSGVY + +G HAV+++G
Sbjct: 229 HYTRDAYYLTYGTIQNDVLAYGPIEASFEVYDDFPSYKSGVYTKMENATYLGGHAVKLIG 288
Query: 122 WGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
WG E +PYWL+ NSWND WGD G FKI RG NE I+
Sbjct: 289 WGEEYGVPYWLLVNSWNDQWGDQGLFKIRRGTNECGID 326
>gi|268570495|ref|XP_002648548.1| Hypothetical protein CBG24861 [Caenorhabditis briggsae]
Length = 323
Score = 114 bits (286), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 64/156 (41%), Positives = 85/156 (54%), Gaps = 15/156 (9%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+P +FD+R +W C S+ I DQ+ CGSCWA S A ISDR+CIA+ G IS ++
Sbjct: 81 IPPSFDSRTRWSNCTSIEMIRDQAQCGSCWAFSTAEVISDRICIATKGTQQPTISPTDML 140
Query: 247 ACTPNCW--GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
AC N GC G +P A+R+W GVVTGGD+ GC+PY APC + P +
Sbjct: 141 ACCGNSCGDGCKGRYPIQAFRWWNSRGVVTGGDFRGS-GCRPYPFAPC---ISCPEE--- 193
Query: 305 LLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
KTP C +C Y + Y D + G A+ V
Sbjct: 194 -----KTPTCSLSC-QFGYSTAYAKDKRFGVSAYAV 223
Score = 101 bits (252), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 46/95 (48%), Positives = 64/95 (67%), Gaps = 2/95 (2%)
Query: 67 KAHMVPRCNAMRQ--IYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGV 124
A+ V R A Q I +GP+V F++Y D +YKSGVY+H G +G HA++++GWG
Sbjct: 219 SAYAVARNVAAIQTEIMTNGPVVGAFTMYEDMYKYKSGVYRHTAGRLLGGHAIKIIGWGT 278
Query: 125 ENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
+N IPYWL+ANSW +WG++G K+ RG NE IE
Sbjct: 279 QNGIPYWLIANSWGANWGENGFLKMRRGVNECGIE 313
>gi|157167281|ref|XP_001658485.1| cathepsin b [Aedes aegypti]
gi|108876476|gb|EAT40701.1| AAEL007585-PA [Aedes aegypti]
Length = 386
Score = 114 bits (286), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 60/134 (44%), Positives = 79/134 (58%), Gaps = 13/134 (9%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP FDAREKWPECPSLR I DQ CGSCWAVS A+A++DR C+ S G + ++
Sbjct: 125 LPDTFDAREKWPECPSLREIRDQGCCGSCWAVSAASAMTDRWCVRSKGKEQFIFGSLDLL 184
Query: 247 ACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
+C +C GC GG AW+FW G+ +GG NS++GC PY P+ C +
Sbjct: 185 SCCHSCGQGCRGGTLGPAWQFWVEKGLSSGGPLNSRQGCHPY-----------PIGECRI 233
Query: 306 LGKLK-TPECKQNC 318
G+ + TP+C C
Sbjct: 234 PGEDEDTPKCSNKC 247
Score = 105 bits (261), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 50/99 (50%), Positives = 66/99 (66%), Gaps = 2/99 (2%)
Query: 63 HYFKKAHMVPRCN--AMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
HY + A+ +P M +I+ +GP+ A F Y D YKSG+Y+H +G G HAV++L
Sbjct: 261 HYGRVAYSLPNDERKIMEEIFINGPVQAAFHTYLDLHAYKSGIYRHVWGPLSGGHAVKLL 320
Query: 121 GWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
GWGVEN + YWLVANSW WG++G FKI+RGEN IE
Sbjct: 321 GWGVENGVKYWLVANSWGREWGENGFFKIVRGENHCGIE 359
>gi|71656032|ref|XP_816569.1| cysteine peptidase C (CPC) [Trypanosoma cruzi strain CL Brener]
gi|70881707|gb|EAN94718.1| cysteine peptidase C (CPC), putative [Trypanosoma cruzi]
Length = 333
Score = 114 bits (285), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 59/134 (44%), Positives = 80/134 (59%), Gaps = 12/134 (8%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
L FDA E WP+CP++ I DQS+CGSCWAV+ A+AISDR C G +ISA ++
Sbjct: 92 LQDRFDAGEAWPKCPTITEIRDQSSCGSCWAVAAASAISDRYCTL-GGVRDLRISAGDLM 150
Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGP-LQNCT 304
+C C +GCNGG+P++AW ++ +G+V+ E CQPY C HHV L C+
Sbjct: 151 SCCDVCGYGCNGGYPEVAWEYYAVHGIVS-------EYCQPYPFPSCAHHVNSSDLSPCS 203
Query: 305 LLGKLKTPECKQNC 318
G+ TP C C
Sbjct: 204 --GEYDTPTCNSTC 215
Score = 95.5 bits (236), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 44/82 (53%), Positives = 54/82 (65%)
Query: 78 RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
R++ +GP FSVYADFL Y GVY+H G +G HAVR++GWG N PYW +ANSW
Sbjct: 241 RELLLNGPFEVSFSVYADFLAYTGGVYKHVAGTFLGGHAVRIVGWGELNGEPYWKIANSW 300
Query: 138 NDHWGDHGTFKILRGENEADIE 159
N WG +G F I RG +E IE
Sbjct: 301 NREWGMNGYFLIARGVDECGIE 322
>gi|28932700|gb|AAO60044.1| midgut cysteine proteinase 1 [Rhipicephalus appendiculatus]
Length = 332
Score = 114 bits (285), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 52/99 (52%), Positives = 74/99 (74%), Gaps = 2/99 (2%)
Query: 63 HYFKKAH-MVPRCNAMR-QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
H+ K + ++ +C+A++ IY++GP+ + F VYADF YKSGVYQ + +G+HA+++L
Sbjct: 222 HFAKNVYRLLKKCDAIKTDIYKNGPVESAFFVYADFPSYKSGVYQQHMIKFMGVHAIKIL 281
Query: 121 GWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
GWG E+ +PYWLVANSWN WGD G FKILRG++E IE
Sbjct: 282 GWGTEDGVPYWLVANSWNVGWGDKGYFKILRGKDECGIE 320
Score = 92.0 bits (227), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 58/155 (37%), Positives = 80/155 (51%), Gaps = 12/155 (7%)
Query: 188 PRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVA 247
P +F RE W C S+R I DQS CGSCWA + A +ISDR+CI +NG ISA+ ++A
Sbjct: 88 PESFTPREYWSHCSSIRVIRDQSACGSCWAFAAAESISDRICIHTNGKVQVNISAEDLLA 147
Query: 248 CTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTLL 306
C C GC+G + +V +++GCQPY+L PC + NCT
Sbjct: 148 CCHTCGHGCDGRCHCSSVAILQGRRLVP-EPVRTEDGCQPYSLPPC-------VPNCT-- 197
Query: 307 GKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMVL 341
TP+C+ C YE +Y D K + +L
Sbjct: 198 HPEPTPKCQHVC-RKGYEKSYEEDKHFAKNVYRLL 231
>gi|157092993|gb|ABV22151.1| cysteine proteinase [Perkinsus chesapeaki]
Length = 396
Score = 114 bits (285), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 61/148 (41%), Positives = 86/148 (58%), Gaps = 8/148 (5%)
Query: 185 KGLPRNFDAREKWPECPS-LRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQ 243
K LP +F+A E++ EC S + HI DQS CGSCWA + A +DRLCI S G FT +S
Sbjct: 137 KDLPVSFNATEEFKECSSVIGHIRDQSACGSCWAFAPTEAFNDRLCIKSAGNFTSLLSPG 196
Query: 244 HIVACTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQ------EGCQPYTLAPCEHHVQ 297
++ AC+ GC+GG AW++ GVVTGGDY+++ +GC PY + PC H+
Sbjct: 197 NVAACSKTS-GCHGGSSLDAWQWLHTTGVVTGGDYSAEKDMTESDGCWPYDIPPCAHYTN 255
Query: 298 GPLQNCTLLGKLKTPECKQNCYNPSYES 325
L K P C+++C N Y++
Sbjct: 256 STLYPKCPKTKYDFPTCQESCPNKKYDT 283
Score = 84.0 bits (206), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 38/76 (50%), Positives = 54/76 (71%), Gaps = 4/76 (5%)
Query: 78 RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
++I +GP+ A + VY DFL YKSGVY+ +++G HAV+++GWG + YWLV NSW
Sbjct: 308 KEIMTNGPVSASYLVYDDFLTYKSGVYKRTSHNALGGHAVKIIGWGED----YWLVVNSW 363
Query: 138 NDHWGDHGTFKILRGE 153
N +WGD+G FKI G+
Sbjct: 364 NKNWGDNGMFKIGCGQ 379
>gi|1008858|gb|AAA79004.1| cathepsin B-like thiol protease [Aedes aegypti]
Length = 342
Score = 114 bits (285), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 60/155 (38%), Positives = 87/155 (56%), Gaps = 11/155 (7%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP +FDAR+KW +CPSL I +Q CGSCWA+S A+A++DR CI S G A ++
Sbjct: 87 LPESFDARQKWSQCPSLNVIRNQGCCGSCWAISAASAMTDRWCIKSKGKEQFSFGATDML 146
Query: 247 ACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
AC C GC GG+ AW+FW GV +GG YNS++GC PY + C+ +
Sbjct: 147 ACCHACGDGCKGGYLGPAWQFWVEQGVSSGGPYNSRQGCHPYPIDVCDASGE-------- 198
Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
+ TP+C + C + + D + G+ A+ +
Sbjct: 199 --EADTPKCSKRCQSGYNVTDVWQDRRYGRVAYSI 231
Score = 108 bits (269), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 52/98 (53%), Positives = 66/98 (67%), Gaps = 2/98 (2%)
Query: 64 YFKKAHMVPRCN--AMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLG 121
Y + A+ +P M +IY +GP+ A F Y D YKSGVY+H +G G HAV+++G
Sbjct: 224 YGRVAYSIPNDEQKIMEEIYINGPVQAAFMTYQDLHAYKSGVYRHVWGHMAGGHAVKLMG 283
Query: 122 WGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
WGVEN + YWLVANSW D WGD+G FKI+RGEN IE
Sbjct: 284 WGVENGLKYWLVANSWGDDWGDNGFFKIVRGENHCGIE 321
>gi|194246067|gb|ACF35525.1| putative cathepsin B-like cysteine protease form 1 [Dermacentor
variabilis]
Length = 192
Score = 114 bits (285), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 54/106 (50%), Positives = 71/106 (66%), Gaps = 2/106 (1%)
Query: 63 HYFKKAHMVP--RCNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
H+ KK + + +IY++GP+ A FSVYADF YKSGVYQ + + +G HA+R+L
Sbjct: 81 HFGKKVYSISSDETQIKTEIYKNGPVEADFSVYADFPSYKSGVYQRHSEEMLGGHAIRIL 140
Query: 121 GWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIEMGFNNRV 166
GWG E+ +PYWLVANSWN+ WGD G FKI RG +E IE N +
Sbjct: 141 GWGTEDGVPYWLVANSWNEDWGDKGYFKIRRGNDECGIEDDINAGI 186
Score = 89.4 bits (220), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 42/87 (48%), Positives = 54/87 (62%), Gaps = 3/87 (3%)
Query: 254 GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTLLGKLKTPE 313
GCNGG+P AW+F+ +VTGG Y +++GCQPY PCEHH GPL NCT G TPE
Sbjct: 6 GCNGGYPSAAWQFYKDEDIVTGGLYGTEDGCQPYYFPPCEHHTVGPLPNCT--GIKPTPE 63
Query: 314 CKQNCYNPSYESTYRFDLKKGKKAHMV 340
C + C Y+ +Y D GKK + +
Sbjct: 64 CAKTC-REGYQKSYTRDKHFGKKVYSI 89
>gi|157167283|ref|XP_001658486.1| cathepsin b [Aedes aegypti]
gi|108876477|gb|EAT40702.1| AAEL007599-PA [Aedes aegypti]
Length = 342
Score = 114 bits (284), Expect = 8e-23, Method: Compositional matrix adjust.
Identities = 60/155 (38%), Positives = 87/155 (56%), Gaps = 11/155 (7%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP +FDAR+KW +CPSL I +Q CGSCWA+S A+A++DR CI S G A ++
Sbjct: 87 LPESFDARQKWSQCPSLNVIRNQGCCGSCWAISAASAMTDRWCIKSKGKEQFSFGATDML 146
Query: 247 ACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
AC C GC GG+ AW+FW GV +GG YNS++GC PY + C+ +
Sbjct: 147 ACCHACGDGCKGGYLGPAWQFWVEQGVSSGGPYNSRQGCHPYPIDVCDASGE-------- 198
Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
+ TP+C + C + + D + G+ A+ +
Sbjct: 199 --EADTPKCSKRCQSGYNVTDVWQDRRYGRVAYSI 231
Score = 108 bits (269), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 52/98 (53%), Positives = 66/98 (67%), Gaps = 2/98 (2%)
Query: 64 YFKKAHMVPRCN--AMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLG 121
Y + A+ +P M +IY +GP+ A F Y D YKSGVY+H +G G HAV+++G
Sbjct: 224 YGRVAYSIPNDEQKIMEEIYINGPVQAAFMTYQDLHAYKSGVYRHVWGHMAGGHAVKLMG 283
Query: 122 WGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
WGVEN + YWLVANSW D WGD+G FKI+RGEN IE
Sbjct: 284 WGVENGLKYWLVANSWGDDWGDNGFFKIVRGENHCGIE 321
>gi|32566081|ref|NP_506002.2| Protein CPR-1 [Caenorhabditis elegans]
gi|32172429|sp|P25807.2|CPR1_CAEEL RecName: Full=Gut-specific cysteine proteinase; Flags: Precursor
gi|1395200|gb|AAB88058.1| gut-specific cysteine protease-1 [Caenorhabditis elegans]
gi|24817276|emb|CAB01410.2| Protein CPR-1 [Caenorhabditis elegans]
Length = 329
Score = 114 bits (284), Expect = 8e-23, Method: Compositional matrix adjust.
Identities = 65/156 (41%), Positives = 85/156 (54%), Gaps = 13/156 (8%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+P FD+R +W EC S++ I DQ+ CGSCWA A ISDR CI + G IS ++
Sbjct: 85 VPATFDSRTQWSECKSIKLIRDQATCGSCWAFGAAEMISDRTCIETKGAQQPIISPDDLL 144
Query: 247 ACT-PNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
+C +C GC GG+P A R+W GVVTGGDY+ GC+PY +APC NC
Sbjct: 145 SCCGSSCGNGCEGGYPIQALRWWDSKGVVTGGDYHGA-GCKPYPIAPCTS------GNCP 197
Query: 305 LLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
+ KTP C +C Y + Y D G A+ V
Sbjct: 198 ---ESKTPSCSMSC-QSGYSTAYAKDKHFGVSAYAV 229
Score = 112 bits (279), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 51/99 (51%), Positives = 69/99 (69%), Gaps = 2/99 (2%)
Query: 63 HYFKKAHMVPRCNAMRQ--IYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
H+ A+ VP+ A Q IY +GP+ A FSVY DF +YKSGVY+H G +G HA++++
Sbjct: 221 HFGVSAYAVPKNAASIQAEIYANGPVEAAFSVYEDFYKYKSGVYKHTAGKYLGGHAIKII 280
Query: 121 GWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
GWG E+ PYWLVANSW +WG+ G FKI RG+++ IE
Sbjct: 281 GWGTESGSPYWLVANSWGVNWGESGFFKIYRGDDQCGIE 319
>gi|204022081|dbj|BAG71138.1| cathepsin B-S1 [Tuberaphis takenouchii]
Length = 332
Score = 114 bits (284), Expect = 9e-23, Method: Compositional matrix adjust.
Identities = 49/103 (47%), Positives = 65/103 (63%), Gaps = 1/103 (0%)
Query: 191 FDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVACTP 250
FD+RE W C + I DQ NCGSCWA A +DRLC+++ G F +S + + C
Sbjct: 89 FDSRENWKSCKQIGRIRDQGNCGSCWAFGTTGAFADRLCVSTGGKFNELLSPEDVAFCCQ 148
Query: 251 NCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPC 292
NC GC GG+P AW+++ GV TGGDY+S+EGC PY + PC
Sbjct: 149 NCGKGCEGGYPIKAWQYFRTQGVPTGGDYDSKEGCAPYKIPPC 191
Score = 92.4 bits (228), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 42/111 (37%), Positives = 68/111 (61%), Gaps = 2/111 (1%)
Query: 51 KRLYLPTSIPLSHYFKKAHMVPRCNAMRQ-IYEHGPLVAIFSVYADFLQYKSGVYQHNF- 108
K Y T++ + K +++ N M Q + ++GP+ A F+++ D YKSG+YQ
Sbjct: 213 KTCYGSTTVQKRYKVKNEYVLNSPNTMEQDLIKYGPIEASFNLFDDLSAYKSGIYQKTPK 272
Query: 109 GDSIGLHAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
+ H+++++GWG EN +PYWL NSW+ WG+ GTF+I++G NE IE
Sbjct: 273 AKFLSGHSIKIIGWGKENGVPYWLAVNSWSKFWGEQGTFRIIKGRNECGIE 323
>gi|410912140|ref|XP_003969548.1| PREDICTED: cathepsin B-like [Takifugu rubripes]
Length = 246
Score = 113 bits (283), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 54/108 (50%), Positives = 72/108 (66%), Gaps = 2/108 (1%)
Query: 54 YLPTSIPLSHYFKKAHMVP--RCNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDS 111
Y P+ HY K ++ V + +IY++GP+ F+VY DF+ YK+GVYQH G +
Sbjct: 130 YTPSYKQDKHYGKTSYSVSSDEDDIKHEIYKNGPVEGAFTVYEDFVLYKTGVYQHVTGSA 189
Query: 112 IGLHAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
+G HA+++LGWG EN IPYWL ANSWN WG++G FKILRG N IE
Sbjct: 190 LGGHAIKILGWGEENGIPYWLCANSWNTDWGNNGFFKILRGSNHCGIE 237
Score = 107 bits (267), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 53/125 (42%), Positives = 79/125 (63%), Gaps = 3/125 (2%)
Query: 217 AVSVANAISDRLCIASNGYFTGQISAQHIVACTPNC-WGCNGGWPQLAWRFWGHNGVVTG 275
A + A+SDR+CI SN + ++SA+ +++C +C GCNGG+P AW FW +G+V+G
Sbjct: 25 AFGASEAMSDRICIHSNAKISVELSAEDLLSCCESCGMGCNGGYPSAAWDFWTKDGLVSG 84
Query: 276 GDYNSQEGCQPYTLAPCEHHVQGPLQNCTLLGKLKTPECKQNCYNPSYESTYRFDLKKGK 335
G Y+S GC+PYT+ PCEHHV G +C+ G +TP+C C Y +Y+ D GK
Sbjct: 85 GLYDSHIGCRPYTIPPCEHHVNGSRPSCSGEGG-ETPQCVYRC-EAGYTPSYKQDKHYGK 142
Query: 336 KAHMV 340
++ V
Sbjct: 143 TSYSV 147
>gi|308504375|ref|XP_003114371.1| CRE-CPR-1 protein [Caenorhabditis remanei]
gi|308261756|gb|EFP05709.1| CRE-CPR-1 protein [Caenorhabditis remanei]
Length = 366
Score = 113 bits (283), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 65/156 (41%), Positives = 85/156 (54%), Gaps = 13/156 (8%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+P +FD+R W EC S++ I DQ+ CGSCWA A ISDR CI + G IS ++
Sbjct: 122 IPASFDSRTHWSECKSIKLIRDQATCGSCWAFGAAEVISDRTCIETKGAQQPIISPDDLL 181
Query: 247 ACT-PNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
+C +C GC GG+P A R+W GVVTGGDY+ GC+PY +APC NC
Sbjct: 182 SCCGSSCGNGCEGGYPIQALRWWDSKGVVTGGDYHGA-GCKPYPIAPCTS------GNCP 234
Query: 305 LLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
+ KTP C +C Y + Y D G A+ V
Sbjct: 235 ---ESKTPSCSLSC-QSGYTTAYAKDKHFGTSAYAV 266
Score = 108 bits (270), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 46/99 (46%), Positives = 68/99 (68%), Gaps = 2/99 (2%)
Query: 63 HYFKKAHMVPR--CNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
H+ A+ V R + +I +GP+ A F+VY DF +YKSGVY+H G ++G HA++++
Sbjct: 258 HFGTSAYAVARKVASIQTEIMTNGPVEAAFTVYEDFYKYKSGVYKHTAGKALGGHAIKII 317
Query: 121 GWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
GWG E+ PYWLVANSW + WG+ G F+I RG+++ IE
Sbjct: 318 GWGTESGSPYWLVANSWGNSWGESGFFRIFRGDDQCGIE 356
>gi|390357905|ref|XP_003729132.1| PREDICTED: cathepsin B-like [Strongylocentrotus purpuratus]
Length = 354
Score = 113 bits (282), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 51/85 (60%), Positives = 61/85 (71%)
Query: 79 QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWN 138
+I +GP+ A F+VY DF YKSGVYQH G +G HA+++LGWGVE YWLVANSWN
Sbjct: 265 EIMTNGPVEADFTVYEDFPTYKSGVYQHTTGGVLGGHAIKILGWGVEEGTKYWLVANSWN 324
Query: 139 DHWGDHGTFKILRGENEADIEMGFN 163
+ WGD+G FKILRG NE IE N
Sbjct: 325 NEWGDNGFFKILRGSNECGIESDIN 349
Score = 74.7 bits (182), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 39/83 (46%), Positives = 49/83 (59%), Gaps = 4/83 (4%)
Query: 249 TPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTLLG 307
TP C CNGG+P AW ++ G+VTGG +NS +GCQPY + C+HHV G C G
Sbjct: 166 TPECKHKCNGGFPGSAWEYYKDTGIVTGGQWNSSQGCQPYQIKSCDHHVNGTKGPCQ--G 223
Query: 308 KLKTPECKQNCYNPSYESTYRFD 330
+ TPECK C SY + Y D
Sbjct: 224 EGPTPECKHKC-EASYSTPYEQD 245
Score = 61.6 bits (148), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 28/59 (47%), Positives = 36/59 (61%), Gaps = 2/59 (3%)
Query: 260 PQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTLLGKLKTPECKQNC 318
P AW ++ G+VTGG +NS +GCQPY + C+HHV G C G+ TPECK C
Sbjct: 117 PGSAWEYYKDTGIVTGGQWNSSQGCQPYQIKSCDHHVNGTKGPCQ--GEGPTPECKHKC 173
>gi|290989996|ref|XP_002677623.1| cathepsin B [Naegleria gruberi]
gi|284091231|gb|EFC44879.1| cathepsin B [Naegleria gruberi]
Length = 321
Score = 113 bits (282), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 56/113 (49%), Positives = 74/113 (65%), Gaps = 10/113 (8%)
Query: 57 TSIPLSH--YFKKA--HMVP------RCNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQH 106
T+IP+S Y+ K+ H+ P + ++IY +GP+ FSVY DF+ YKSGVY H
Sbjct: 194 TNIPISSQLYYAKSFSHISPWMFWERVADIQQEIYTNGPVQGGFSVYQDFMNYKSGVYSH 253
Query: 107 NFGDSIGLHAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
G +G HA++++GWGVE + YWLVANSW+ WG GTFKILRG NE IE
Sbjct: 254 KTGSFLGGHAIKIIGWGVEGGVDYWLVANSWSTDWGIDGTFKILRGHNECGIE 306
Score = 83.2 bits (204), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 48/133 (36%), Positives = 66/133 (49%), Gaps = 26/133 (19%)
Query: 186 GLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHI 245
GLP NFD+R++W +C + I +Q CGSCWA S + ++SDR CIASNG +S Q +
Sbjct: 85 GLPTNFDSRQQWGKC--IHPIRNQEQCGSCWAFSASESLSDRFCIASNGKVDVILSPQDM 142
Query: 246 VACTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
V+C N GC+GG AW + + G+V + C PY +
Sbjct: 143 VSCDYNDMGCDGGNLDNAWWWMKNKGIVP-------DSCMPY-----------------V 178
Query: 306 LGKLKTPECKQNC 318
G P C NC
Sbjct: 179 SGGGNVPACPSNC 191
>gi|407425570|gb|EKF39488.1| cysteine peptidase C (CPC), putative [Trypanosoma cruzi
marinkellei]
Length = 333
Score = 113 bits (282), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 60/134 (44%), Positives = 80/134 (59%), Gaps = 12/134 (8%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
L FDA E WP CP++ I DQS+CGSCWAV+ A+A+SDR C G +ISA ++
Sbjct: 92 LEDKFDAAEAWPNCPTITEIRDQSSCGSCWAVAAASAMSDRYCTL-GGVRDLRISAGDLM 150
Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGP-LQNCT 304
+C C +GCNGG+P++AW F+ +G+V+ E CQPY C HHV L C+
Sbjct: 151 SCCDVCGYGCNGGFPEVAWVFYVVHGLVS-------EYCQPYPFPSCAHHVNSSDLAPCS 203
Query: 305 LLGKLKTPECKQNC 318
G KTP+C C
Sbjct: 204 --GDYKTPKCNSTC 215
Score = 96.3 bits (238), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 44/82 (53%), Positives = 54/82 (65%)
Query: 78 RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
R++ +GP F VYADF+ Y GVY+H GD +G HAVR++GWG N PYW +ANSW
Sbjct: 241 RELLLNGPFEVAFEVYADFMAYTGGVYKHVAGDLLGGHAVRLVGWGELNGEPYWKIANSW 300
Query: 138 NDHWGDHGTFKILRGENEADIE 159
N WG +G F I RG NE IE
Sbjct: 301 NHEWGMNGYFLIARGVNECGIE 322
>gi|341886633|gb|EGT42568.1| hypothetical protein CAEBREN_17563 [Caenorhabditis brenneri]
Length = 358
Score = 113 bits (282), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 64/184 (34%), Positives = 99/184 (53%), Gaps = 14/184 (7%)
Query: 165 RVEANSSEDDDLETMGCQNAKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAI 224
R S+E+D+ + + +P +FDAR+KWP C + + DQS+CGS + A
Sbjct: 77 RSHEQSTENDNSQVF-----EEIPNSFDARQKWPSCSQIGAVRDQSDCGSAAHLVAAEIA 131
Query: 225 SDRLCIASNGYFTGQISAQHIVACTP-------NCWGCNGGWPQLAWRFWGHNGVVTGGD 277
SDR CI SNG F +SAQ ++C + WGC+G WP+ ++W +G+ TGG+
Sbjct: 132 SDRTCIFSNGTFNWPLSAQDPLSCCVGLMSICGDGWGCDGSWPKDILKWWQTHGLCTGGN 191
Query: 278 YNSQEGCQPYTLAPCEHHVQGPLQNCTLLGKLKTPECKQNCY-NPSYESTYRFDLKKGKK 336
Y+ Q GC+PYT+ PC+ + G TP C++ C N ++ +Y+ D GK
Sbjct: 192 YDDQFGCKPYTIYPCDKKYPNGTTSVPCPG-YHTPVCEERCTSNITWPISYKQDKHFGKA 250
Query: 337 AHMV 340
+ V
Sbjct: 251 HYNV 254
Score = 90.9 bits (224), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 43/107 (40%), Positives = 62/107 (57%), Gaps = 3/107 (2%)
Query: 56 PTSIPLSHYFKKAHMVP---RCNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSI 112
P S +F KAH + +I +GP++A F +Y DF YKSG+Y H GD
Sbjct: 238 PISYKQDKHFGKAHYNVGKKMTDIQTEIMRNGPVIASFIIYDDFWDYKSGIYVHTAGDQE 297
Query: 113 GLHAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
G +++GWGV+N +PYWL + W +G++G +ILRG NE +IE
Sbjct: 298 GGMDTKIIGWGVDNGVPYWLCVHQWGTDFGENGFVRILRGVNEVNIE 344
>gi|226471008|emb|CAX70585.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 113 bits (282), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 71/195 (36%), Positives = 100/195 (51%), Gaps = 13/195 (6%)
Query: 148 KILRGENEADIEMGFNNRVEANSSEDDDLETMGCQNAKGLPRNFDAREKWPECPSLRHIA 207
+IL G + D EM R + D ++E +P FD+R+KWP C S+ I
Sbjct: 61 RILMGARKEDAEMKRKRRPTVDH-HDLNVE---------IPSQFDSRKKWPHCKSISQIR 110
Query: 208 DQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVACTPNCWGCNGGWPQL-AWRF 266
DQS CGSCWA A++DR+CI S G + ++SA +++C +C G G AW +
Sbjct: 111 DQSRCGSCWAFGAVEAMTDRICIQSGGQQSAELSALDLISCCEDCGGGCKGGFPGQAWDY 170
Query: 267 WGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTLLGKLKTPECKQNCYNPSYEST 326
W G+VTGG + GCQPY CEH +G C KTP+CKQ C Y++
Sbjct: 171 WVKRGIVTGGSEENHTGCQPYPFPKCEHLTKGKYPACG-TKIYKTPQCKQTC-QKGYKTP 228
Query: 327 YRFDLKKGKKAHMVL 341
Y D G + + V+
Sbjct: 229 YEQDKHYGDQRYNVI 243
Score = 110 bits (276), Expect = 8e-22, Method: Compositional matrix adjust.
Identities = 45/82 (54%), Positives = 62/82 (75%)
Query: 78 RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
R+I +GP+ A F VY DFL YKSG+Y+H G +G HA+R++GWGVE PYWL+ANSW
Sbjct: 251 REIMMYGPVEAAFDVYEDFLNYKSGIYRHVAGSIVGGHAIRIIGWGVEKGKPYWLIANSW 310
Query: 138 NDHWGDHGTFKILRGENEADIE 159
N+ WG++G F+++RG +E IE
Sbjct: 311 NEDWGENGLFRMVRGRDECSIE 332
>gi|226471006|emb|CAX70584.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 113 bits (282), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 71/195 (36%), Positives = 100/195 (51%), Gaps = 13/195 (6%)
Query: 148 KILRGENEADIEMGFNNRVEANSSEDDDLETMGCQNAKGLPRNFDAREKWPECPSLRHIA 207
+IL G + D EM R + D ++E +P FD+R+KWP C S+ I
Sbjct: 61 RILMGARKEDAEMKRKRRPTVDH-HDLNVE---------IPSQFDSRKKWPHCKSISQIR 110
Query: 208 DQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVACTPNCWGCNGGWPQL-AWRF 266
DQS CGSCWA A++DR+CI S G + ++SA +++C +C G G AW +
Sbjct: 111 DQSRCGSCWAFGAVEAMTDRICIQSGGQQSAELSALDLISCCEDCGGGCKGGFPGQAWDY 170
Query: 267 WGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTLLGKLKTPECKQNCYNPSYEST 326
W G+VTGG + GCQPY CEH +G C KTP+CKQ C Y++
Sbjct: 171 WVKRGIVTGGSEENHTGCQPYPFPKCEHLTKGKYPACG-TKIYKTPQCKQTC-QKGYKTP 228
Query: 327 YRFDLKKGKKAHMVL 341
Y D G + + V+
Sbjct: 229 YEQDKHYGDQRYNVI 243
Score = 110 bits (276), Expect = 8e-22, Method: Compositional matrix adjust.
Identities = 45/82 (54%), Positives = 62/82 (75%)
Query: 78 RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
R+I +GP+ A F VY DFL YKSG+Y+H G +G HA+R++GWGVE PYWL+ANSW
Sbjct: 251 REIMMYGPVEAAFDVYEDFLNYKSGIYRHVAGSIVGGHAIRIIGWGVEKGKPYWLIANSW 310
Query: 138 NDHWGDHGTFKILRGENEADIE 159
N+ WG++G F+++RG +E IE
Sbjct: 311 NEDWGENGLFRMVRGRDECSIE 332
>gi|17560488|ref|NP_506310.1| Protein F32H5.1 [Caenorhabditis elegans]
gi|3876629|emb|CAB04249.1| Protein F32H5.1 [Caenorhabditis elegans]
Length = 356
Score = 113 bits (282), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 64/179 (35%), Positives = 96/179 (53%), Gaps = 10/179 (5%)
Query: 171 SEDDDLETMGCQNA-KGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLC 229
S D+ E G N +P +FD+R+KWP C + + DQS+CGS + SDR C
Sbjct: 75 SNDEVSEKTGNDNVLVDIPSSFDSRQKWPSCSQIGAVRDQSDCGSAAHLVAVEIASDRTC 134
Query: 230 IASNGYFTGQISAQHIVACTP-------NCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQE 282
IASNG F +SAQ ++C + WGC+G WP+ ++W +G+ TGG+YN Q
Sbjct: 135 IASNGTFNWPLSAQDPLSCCVGLMSICGDGWGCDGSWPKDILKWWQTHGLCTGGNYNDQF 194
Query: 283 GCQPYTLAPCEHHVQGPLQNCTLLGKLKTPECKQNCY-NPSYESTYRFDLKKGKKAHMV 340
GC+PY++ PC+ + G TP C+++C N ++ Y+ D GK + V
Sbjct: 195 GCKPYSIYPCDKKYANGTTSVPCPG-YHTPTCEEHCTSNITWPIAYKQDKHFGKAHYNV 252
Score = 87.4 bits (215), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 40/107 (37%), Positives = 61/107 (57%), Gaps = 3/107 (2%)
Query: 56 PTSIPLSHYFKKAHMVP---RCNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSI 112
P + +F KAH + +I +GP++A F +Y DF YK+G+Y H GD
Sbjct: 236 PIAYKQDKHFGKAHYNVGKKMTDIQIEIMTNGPVIASFIIYDDFWDYKTGIYVHTAGDQE 295
Query: 113 GLHAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
G +++GWGV+N +PYWL + W +G++G + LRG NE +IE
Sbjct: 296 GGMDTKIIGWGVDNGVPYWLCVHQWGTDFGENGFVRFLRGVNEVNIE 342
>gi|3088522|gb|AAD03404.1| cathepsin B-like protease precursor [Trypanosoma cruzi]
gi|407859283|gb|EKG06969.1| cysteine peptidase C (CPC) [Trypanosoma cruzi]
Length = 333
Score = 113 bits (282), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 58/134 (43%), Positives = 80/134 (59%), Gaps = 12/134 (8%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
L FDA E WP+CP++ I DQS+CGSCWAV+ A+A+SDR C G +ISA ++
Sbjct: 92 LQDRFDAGEAWPKCPTITEIRDQSSCGSCWAVAAASAMSDRYCTL-GGVRDLRISAGDLM 150
Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGP-LQNCT 304
+C C +GCNGG+P++AW ++ +G+V+ E CQPY C HHV L C+
Sbjct: 151 SCCDVCGYGCNGGYPEVAWEYYAVHGIVS-------EYCQPYPFPSCAHHVNSSDLSPCS 203
Query: 305 LLGKLKTPECKQNC 318
G+ TP C C
Sbjct: 204 --GEYDTPTCNSTC 215
Score = 93.6 bits (231), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 43/82 (52%), Positives = 54/82 (65%)
Query: 78 RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
R++ +GP FSVYADF+ Y GVY+H G +G HAVR++GWG N PYW +ANSW
Sbjct: 241 RELLLNGPFEVSFSVYADFVAYTGGVYKHVTGVFLGGHAVRIVGWGELNGEPYWKIANSW 300
Query: 138 NDHWGDHGTFKILRGENEADIE 159
N WG +G F I RG +E IE
Sbjct: 301 NHEWGMNGYFLIARGVDECGIE 322
>gi|268566077|ref|XP_002647467.1| Hypothetical protein CBG06539 [Caenorhabditis briggsae]
Length = 332
Score = 112 bits (281), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 67/161 (41%), Positives = 86/161 (53%), Gaps = 21/161 (13%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+P FDAR KWP+C S++ I +Q+NCGSCWA A ISDR+CIA+ G IS +V
Sbjct: 87 IPETFDARTKWPKCKSIKLIRNQANCGSCWAFGAAEVISDRICIATKGARQPVISPMDMV 146
Query: 247 ACTPN-C-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTL---APCEHHVQGPLQ 301
C C +GC+GG+ A R+W +GVVTGGDY +GC+PY A C V
Sbjct: 147 DCCGEYCGYGCDGGYSIQALRWWVFDGVVTGGDYQG-DGCKPYQFCNSAGCPDAV----- 200
Query: 302 NCTLLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMVLM 342
TPEC +C Y + Y D G A+ V M
Sbjct: 201 ---------TPECALSC-QSKYNTEYAKDKNFGTSAYYVGM 231
Score = 100 bits (248), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 50/103 (48%), Positives = 67/103 (65%), Gaps = 5/103 (4%)
Query: 75 NAMR-QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLV 133
NA++ I +GP+ A F VY DF +YKSGVY++ G +G HA++++GWG EN YWL+
Sbjct: 234 NAIQTDIMTNGPVEASFKVYEDFYKYKSGVYKYIAGKMLGGHAIKIIGWGTENGTAYWLI 293
Query: 134 ANSWNDHWGDHGTFKILRGENEADIEMGFNNRVEANSSEDDDL 176
ANSW WG++G FKI RG NE IE N V A ++ D L
Sbjct: 294 ANSWGTKWGENGFFKIRRGVNECGIE----NNVVAGKADVDTL 332
>gi|341878049|gb|EGT33984.1| CBN-CPR-1 protein [Caenorhabditis brenneri]
Length = 330
Score = 112 bits (281), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 64/156 (41%), Positives = 86/156 (55%), Gaps = 13/156 (8%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+P +FD+R W EC S++ I +Q+ CGSCWA A ISDR CI + G IS ++
Sbjct: 86 IPASFDSRTHWSECKSIKLIRNQATCGSCWAFGAAEVISDRTCIETKGAQQPIISPDDLL 145
Query: 247 ACT-PNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
+C +C GC GG+P A R+W GVVTGGDY+ GC+PY +APC +C
Sbjct: 146 SCCGSSCGNGCEGGYPIQALRWWDSKGVVTGGDYHGA-GCKPYPIAPCTS------GSCP 198
Query: 305 LLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
+ KTP C +C P Y + Y D G A+ V
Sbjct: 199 ---ESKTPACSLSC-QPGYTTAYAKDKHFGTSAYAV 230
Score = 107 bits (267), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 46/99 (46%), Positives = 67/99 (67%), Gaps = 2/99 (2%)
Query: 63 HYFKKAHMVPR--CNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
H+ A+ V + + +I +GP+ A F+VY DF +YKSGVY+H G ++G HA++++
Sbjct: 222 HFGTSAYAVAKKVASIQTEIMTNGPVEAAFTVYEDFYKYKSGVYKHTAGKALGGHAIKII 281
Query: 121 GWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
GWG E+ PYWLVANSW WG+ G FKI RG+++ IE
Sbjct: 282 GWGTESGSPYWLVANSWGTSWGESGFFKIFRGDDQCGIE 320
>gi|294939825|ref|XP_002782575.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
gi|239894358|gb|EER14370.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
Length = 398
Score = 112 bits (281), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 56/127 (44%), Positives = 71/127 (55%), Gaps = 9/127 (7%)
Query: 182 QNAKGLPRNFDAREKWPECP-SLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQI 240
+ K LP +FDAR +P+C + H+ DQS CG CWA V A +DRLCI SNG FT +
Sbjct: 135 EELKDLPTDFDARTAFPKCSKVIGHVRDQSACGDCWAFGVTEAFNDRLCIKSNGTFTKLL 194
Query: 241 SAQHIVACTPNC--WGCNGGWPQLAWRFWGHNGVVTGGDY------NSQEGCQPYTLAPC 292
SA + AC P+ GC GG+P AW + G+ TGGDY +GC PY PC
Sbjct: 195 SAGEMNACAPSLKDPGCRGGFPYSAWSWVHDEGIATGGDYVPRDNMTEDDGCWPYDFPPC 254
Query: 293 EHHVQGP 299
H + P
Sbjct: 255 AHFFKDP 261
Score = 98.6 bits (244), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 49/109 (44%), Positives = 65/109 (59%), Gaps = 8/109 (7%)
Query: 52 RLYLPTSIPLSHYFKKAHMVPRCNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDS 111
R ++ S+P ++F +A I GP+ A F VY DFL YKSGVY+H G
Sbjct: 291 RYFMVESVP--YHFSAD------DAKNAIRTDGPVSATFYVYEDFLAYKSGVYKHTSGSL 342
Query: 112 IGLHAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIEM 160
+G HAV+++GWG + YWLV NSWN+ WGDHG FKI G+ D E+
Sbjct: 343 LGAHAVKIIGWGEDGGEAYWLVVNSWNEGWGDHGLFKIALGDCGIDNEL 391
>gi|166030314|gb|ABY78824.1| cathepsin B-like protease [Trypanosoma congolense]
Length = 335
Score = 112 bits (280), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 59/134 (44%), Positives = 80/134 (59%), Gaps = 11/134 (8%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP +FD+ EKWP CP++R IADQS CGSCWAVS A+AISDR C G +ISA H++
Sbjct: 90 LPESFDSAEKWPNCPTIREIADQSACGSCWAVSTASAISDRYCTV-GGVQQLRISAAHLL 148
Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHH-VQGPLQNCT 304
+C +C +GC+GG+P AW ++ +G+ + CQPY C HH +G C+
Sbjct: 149 SCCKDCGYGCDGGYPGTAWEYYVSHGL-------ASSYCQPYPFPHCGHHGGKGKKPPCS 201
Query: 305 LLGKLKTPECKQNC 318
TP+C C
Sbjct: 202 KY-DFHTPKCNTTC 214
Score = 106 bits (264), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 46/82 (56%), Positives = 60/82 (73%)
Query: 78 RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
R++Y +GP V F VY+DFL YK+GVY+H GD +G HAVR++GWG N PYW +ANSW
Sbjct: 240 RELYFNGPFVVAFQVYSDFLAYKTGVYRHVSGDVLGGHAVRIVGWGKLNGTPYWKIANSW 299
Query: 138 NDHWGDHGTFKILRGENEADIE 159
+ WG +G F ILRG++E IE
Sbjct: 300 DTDWGMNGHFLILRGKDECGIE 321
>gi|15723274|gb|AAL06325.1| cathepsin B-like protease [Trypanosoma cruzi]
gi|15723278|gb|AAL06327.1| cathepsin B-like protease [Trypanosoma cruzi]
Length = 208
Score = 112 bits (280), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 59/130 (45%), Positives = 79/130 (60%), Gaps = 12/130 (9%)
Query: 191 FDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVACTP 250
FDA E WPECP++ I DQS+CGSCWAV+ A+AISDR C G +ISA +++C
Sbjct: 1 FDAGEAWPECPTVTEIRDQSSCGSCWAVAAASAISDRYCTL-GGVRDLRISAGDLMSCCD 59
Query: 251 NC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGP-LQNCTLLGK 308
C +GCNGG+P++AW ++ +G+V+ E CQPY C HHV L C+ G+
Sbjct: 60 VCGFGCNGGYPEVAWEYYAVHGIVS-------EYCQPYPFPSCAHHVNSSDLSPCS--GE 110
Query: 309 LKTPECKQNC 318
TP C C
Sbjct: 111 YDTPTCNSTC 120
Score = 74.3 bits (181), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 33/61 (54%), Positives = 42/61 (68%)
Query: 78 RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
R++ +GP FSVYADF+ Y GVY+H G +G HAVR++GWG N PYW +ANSW
Sbjct: 146 RELILNGPFEVSFSVYADFVAYTGGVYKHVAGIFLGGHAVRIVGWGELNGEPYWKIANSW 205
Query: 138 N 138
N
Sbjct: 206 N 206
>gi|268557292|ref|XP_002636635.1| C. briggsae CBR-CPR-1 protein [Caenorhabditis briggsae]
Length = 330
Score = 112 bits (280), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 50/99 (50%), Positives = 69/99 (69%), Gaps = 2/99 (2%)
Query: 63 HYFKKAHMVPRCNAMRQ--IYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
H+ A+ V R A Q I +GP+ A F+VY DF +YKSGVY+H G ++G HA++++
Sbjct: 222 HFGASAYAVARSVAAIQTEIMTNGPVEAAFTVYEDFYKYKSGVYKHTAGKALGGHAIKII 281
Query: 121 GWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
GWG E+ PYWLVANSW +WG+ G FKILRG+++ IE
Sbjct: 282 GWGTESGSPYWLVANSWGTNWGESGFFKILRGDDQCGIE 320
Score = 112 bits (280), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 64/156 (41%), Positives = 86/156 (55%), Gaps = 13/156 (8%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+P +FD+R +W EC S++ I +Q+ CGSCWA A ISDR CI + G IS ++
Sbjct: 86 IPASFDSRTQWSECKSIKLIRNQATCGSCWAFGAAEIISDRTCIETKGAQQPIISPDDLL 145
Query: 247 ACT-PNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
+C +C GC GG+P A R+W GVVTGGDY+ GC+PY +APC NC
Sbjct: 146 SCCGSSCGNGCEGGYPIQALRWWDSKGVVTGGDYHGA-GCKPYPIAPCTS------GNCP 198
Query: 305 LLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
+ KTP C +C Y + Y D G A+ V
Sbjct: 199 ---ESKTPACSLSC-QSGYSTAYAKDKHFGASAYAV 230
>gi|166030316|gb|ABY78825.1| cathepsin B-like protease [Trypanosoma congolense]
Length = 336
Score = 112 bits (280), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 60/134 (44%), Positives = 80/134 (59%), Gaps = 11/134 (8%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP +FD+ EKWP CP++R IADQS CGSCWAVS A+AISDR C G +ISA H++
Sbjct: 90 LPESFDSAEKWPNCPTIREIADQSACGSCWAVSTASAISDRHCTV-GGVQQLRISAAHLL 148
Query: 247 ACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHH-VQGPLQNCT 304
+C +C GC+GG+P AWR++ +G+ + CQPY C HH +G C+
Sbjct: 149 SCCKDCGDGCDGGYPDAAWRYYVSHGL-------ASSYCQPYPFPHCGHHGGKGKKPPCS 201
Query: 305 LLGKLKTPECKQNC 318
TP+C C
Sbjct: 202 KY-DFHTPKCNTTC 214
Score = 104 bits (260), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 45/83 (54%), Positives = 58/83 (69%)
Query: 78 RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
R++Y +GP V F V++DFL YK+GVY+H GD +G HAVR++GWG N PYW +ANSW
Sbjct: 241 RELYFNGPFVVAFQVFSDFLAYKTGVYRHVSGDFLGGHAVRIVGWGKLNGTPYWKIANSW 300
Query: 138 NDHWGDHGTFKILRGENEADIEM 160
+ WG +G F LRG NE IE
Sbjct: 301 DTDWGMNGHFLFLRGNNECGIEF 323
>gi|144952804|gb|ABP04056.1| cathepsin B-4 [Clonorchis sinensis]
Length = 347
Score = 112 bits (280), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 65/144 (45%), Positives = 80/144 (55%), Gaps = 6/144 (4%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTG-QISAQHI 245
LP FDARE WPEC ++ I DQS CGSCWA + A+SDR+CI SN Q+SA +
Sbjct: 86 LPSEFDAREHWPECRTIPQIRDQSGCGSCWAFAAVTAMSDRVCIHSNQTLVNVQLSATDL 145
Query: 246 VACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHH-VQGPLQNC 303
+AC C +GC GGW +AW +W NG+VTGG+Y C PY PC HH +G
Sbjct: 146 LACCTTCGFGCVGGWGGMAWDYWRDNGIVTGGEYKDSHTCLPYPFPPCRHHGAKGSEYPP 205
Query: 304 TLLGKLKTPECKQNC---YNPSYE 324
TP+C C Y YE
Sbjct: 206 CPEKMYSTPQCVSECQKGYATKYE 229
Score = 100 bits (250), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 45/93 (48%), Positives = 60/93 (64%), Gaps = 1/93 (1%)
Query: 78 RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVEND-IPYWLVANS 136
++I+ GP+ A +VY DF Y GVY+H G+ +G HA+R+LGWGVE D PYWL ANS
Sbjct: 250 KEIWMRGPVEATMNVYTDFANYAGGVYKHTTGELLGGHAIRLLGWGVEEDGTPYWLAANS 309
Query: 137 WNDHWGDHGTFKILRGENEADIEMGFNNRVEAN 169
WN WG+ G F+ILRG + IE + + N
Sbjct: 310 WNPSWGEKGFFRILRGSDHCGIESDVSAGLPVN 342
>gi|268561866|ref|XP_002638438.1| Hypothetical protein CBG18654 [Caenorhabditis briggsae]
Length = 396
Score = 112 bits (280), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 68/187 (36%), Positives = 97/187 (51%), Gaps = 18/187 (9%)
Query: 158 IEMGFNNRVEANSSE-DDDLETMGCQNAKGLPRNFDAREKWPECPSLRHIADQSNCGSCW 216
+++ F E+ SE DDLE + LP FD+R +WP C S++ I DQ+ CGSCW
Sbjct: 58 MDVRFAEVPESEKSEKSDDLEF---ETLIQLPTAFDSRVQWPNCNSIKLIRDQTYCGSCW 114
Query: 217 AVSVANAISDRLCIASNGYFTGQISAQHIVACTPNCW--GCNGGWPQLAWRFWGHNGVVT 274
A + A ISDR+CI SNG IS + I++C + GC GG+ A ++W ++GVVT
Sbjct: 115 AFAAAEIISDRICIQSNGTQQPIISPEDILSCCGSSCNNGCQGGYTIEAMKYWMNSGVVT 174
Query: 275 GGDYNSQEGCQPYTLAPCEHHVQGPLQNCTLLGKLKTPECKQNCYNPSYESTYRFDLKKG 334
GGDY GC PY+ PC T P CK C SY++ + L
Sbjct: 175 GGDYQGA-GCIPYSFRPCS----------TCKEPKDAPSCKTTC-QASYKAKSAYRLPTT 222
Query: 335 KKAHMVL 341
++ ++
Sbjct: 223 TSSNAIV 229
Score = 99.0 bits (245), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 49/112 (43%), Positives = 66/112 (58%), Gaps = 4/112 (3%)
Query: 48 KKKKRLYLPTSIPLSHYFKKAHMVPRCNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHN 107
K K LPT+ + A + + +IY +GP+ + VY DF YKSGVY H
Sbjct: 212 KAKSAYRLPTTTSSNAIVANAVQMIQ----TEIYNNGPVEVAYQVYDDFYHYKSGVYYHV 267
Query: 108 FGDSIGLHAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
+GD HAV+++GWG E + YWLVANSW+ +G++G FKI RG NE IE
Sbjct: 268 YGDKPSGHAVKIIGWGTEKKVDYWLVANSWSTTFGENGFFKIRRGTNECGIE 319
>gi|270012757|gb|EFA09205.1| cathepsin B precursor [Tribolium castaneum]
Length = 348
Score = 112 bits (280), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 60/149 (40%), Positives = 88/149 (59%), Gaps = 7/149 (4%)
Query: 152 GENEADIE--MGFNNRVEANSSEDDDLETMGCQNAKGLPRNFDAREKWPECPS-LRHIAD 208
G N DI+ +GF + + D ++T + AK +P +FDAREKWPEC + I D
Sbjct: 42 GTNSLDIKSRLGF---LGLHPDPDYKIQTKHHKIAKSIPESFDAREKWPECKDVIGKIRD 98
Query: 209 QSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVACTPNC-WGCNGGWPQLAWRFW 267
Q CGSCWA + ++DRLCI + G S ++++ C +C C GG+ AW ++
Sbjct: 99 QGTCGSCWAFASTEVMTDRLCIGTKGETKFVFSPENLLTCCEDCRLECVGGYTAKAWDYY 158
Query: 268 GHNGVVTGGDYNSQEGCQPYTLAPCEHHV 296
+ G+V+GGDYNS EGCQPY+ A ++ V
Sbjct: 159 INEGIVSGGDYNSSEGCQPYSKASFQYAV 187
Score = 76.6 bits (187), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 38/82 (46%), Positives = 49/82 (59%), Gaps = 10/82 (12%)
Query: 79 QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWN 138
+I +GP++A F+V+ D + YKSG+ N V +L WG E +PYWL+ANSW
Sbjct: 227 EILTNGPVMATFNVFEDIIYYKSGIQLSN---------VSILRWGTEEGVPYWLIANSWG 277
Query: 139 DHWGDHGTF-KILRGENEADIE 159
WGD G F KI RG NE IE
Sbjct: 278 TWWGDLGGFIKIKRGTNECAIE 299
>gi|167541036|gb|ABZ82028.1| cathepsin B endopeptidase [Clonorchis sinensis]
Length = 228
Score = 112 bits (280), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 50/83 (60%), Positives = 63/83 (75%)
Query: 77 MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANS 136
M++I +GP+ A F VY DFL YKSGVY H+ G +G HA+R+LGWG EN + YWL+ANS
Sbjct: 130 MKEIMINGPVEAAFQVYEDFLGYKSGVYFHSDGTLLGGHAIRILGWGEENGVAYWLIANS 189
Query: 137 WNDHWGDHGTFKILRGENEADIE 159
WND WG+ G FK+LRG+NE IE
Sbjct: 190 WNDGWGEDGYFKMLRGKNECGIE 212
Score = 84.7 bits (208), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 47/125 (37%), Positives = 63/125 (50%), Gaps = 4/125 (3%)
Query: 217 AVSVANAISDRLCIASNGYFTGQISAQHIVACTPNC-WGCNGGWPQLAWRFWGHNGVVTG 275
A A+SDRLCI +NG FT +ISA +++C C +GC GG+P AW FW G+VTG
Sbjct: 1 AFGAVEAMSDRLCIHTNGTFTKRISAVDLISCCGYCGFGCQGGFPPTAWDFWQTEGIVTG 60
Query: 276 GDYNSQEGCQPYTLAPCEHHVQGPLQNCTLLGKLKTPECKQNCYNPSYESTYRFDLKKGK 335
G + GC+ Y C HH C+ TP C Q C P ++ Y D +
Sbjct: 61 GSKENPTGCRSYPFPRCSHHGSKKYPPCSHR-IYDTPNCVQKCDTP--DTDYATDKTRAN 117
Query: 336 KAHMV 340
+ V
Sbjct: 118 ITYNV 122
>gi|256090674|ref|XP_002581308.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
Length = 250
Score = 112 bits (279), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 49/86 (56%), Positives = 65/86 (75%)
Query: 78 RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
++I +GP+ A V++DFL YKSGVY+H G + +H+VR++GWG+ENDIPYWL ANSW
Sbjct: 157 KEIMMNGPVEAGIFVHSDFLNYKSGVYRHITGQLVTIHSVRIIGWGIENDIPYWLCANSW 216
Query: 138 NDHWGDHGTFKILRGENEADIEMGFN 163
N+ WG +G FKILRG NE +IE N
Sbjct: 217 NEDWGLNGYFKILRGSNECEIESFVN 242
Score = 77.0 bits (188), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 41/123 (33%), Positives = 60/123 (48%), Gaps = 6/123 (4%)
Query: 216 WAVSVANAISDRLCIASNGYFTGQISAQHIVACTPNCWGCNGGWPQLAWRFWGHNGVVTG 275
WAV+ A +ISDR CI +NG Q+SA +++C+ N GC G+ + +W +W NG+VTG
Sbjct: 30 WAVASAASISDRTCIQTNGTMKVQLSAIELISCSKNKLGCQIGFSEFSWDYWLKNGLVTG 89
Query: 276 GDYNSQEGCQPYTLAPCEHHVQGPLQNCTLLGKLKTPECKQNCYNPSYESTYRFDLKKGK 335
GC PY C+H C + P C + C Y Y+ D G+
Sbjct: 90 ----DPTGCLPYPFPKCDHRSSNSYPKCGYI-TYTAPPCTKTC-RSGYPIPYKADKHYGR 143
Query: 336 KAH 338
+
Sbjct: 144 VIY 146
>gi|358341561|dbj|GAA37330.2| cathepsin B-like cysteine proteinase [Clonorchis sinensis]
Length = 347
Score = 112 bits (279), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 65/144 (45%), Positives = 80/144 (55%), Gaps = 6/144 (4%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTG-QISAQHI 245
LP FDARE WPEC ++ I DQS CGSCWA + A+SDR+CI SN Q+SA +
Sbjct: 86 LPSEFDAREHWPECRTIPQIRDQSGCGSCWAFAAVTAMSDRVCIHSNQTLVNVQLSATDL 145
Query: 246 VACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHH-VQGPLQNC 303
+AC C +GC GGW +AW +W NG+VTGG+Y C PY PC HH +G
Sbjct: 146 LACCTTCGFGCVGGWGGMAWDYWRDNGIVTGGEYKDSHTCLPYPFPPCRHHGAKGSEYPP 205
Query: 304 TLLGKLKTPECKQNC---YNPSYE 324
TP+C C Y YE
Sbjct: 206 CPEKMYSTPQCVSECQKGYATKYE 229
Score = 100 bits (250), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 45/93 (48%), Positives = 60/93 (64%), Gaps = 1/93 (1%)
Query: 78 RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVEND-IPYWLVANS 136
++I+ GP+ A +VY DF Y GVY+H G+ +G HA+R+LGWGVE D PYWL ANS
Sbjct: 250 KEIWMRGPVEATMNVYTDFANYAGGVYKHTTGELLGGHAIRLLGWGVEEDGTPYWLAANS 309
Query: 137 WNDHWGDHGTFKILRGENEADIEMGFNNRVEAN 169
WN WG+ G F+ILRG + IE + + N
Sbjct: 310 WNPSWGEKGFFRILRGSDHCGIESDVSAGLPVN 342
>gi|239792046|dbj|BAH72408.1| ACYPI000003 [Acyrthosiphon pisum]
Length = 182
Score = 112 bits (279), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 59/134 (44%), Positives = 82/134 (61%), Gaps = 2/134 (1%)
Query: 37 KKKKKKKKKKKKKKKRLYLPTSIPLSHYFKKAHMVPRCNAMRQ-IYEHGPLVAIFSVYAD 95
K+ K KK ++ +P + L H + + +RQ IY +GP+ F+VY D
Sbjct: 49 KEGGKTPTCVKKCEEGYKVPYAQDLHHGKSAYSIRNDVDQIRQEIYTNGPVEGAFTVYED 108
Query: 96 FLQYKSGVYQHNFGDSIGLHAVRVLGWGVEN-DIPYWLVANSWNDHWGDHGTFKILRGEN 154
F+ Y++GVY+H G ++G HA+R+LGWGV+N +IPYWLVANSWN WG G FKILRG +
Sbjct: 109 FIAYRAGVYKHVAGKALGGHAIRILGWGVQNGEIPYWLVANSWNTDWGSDGFFKILRGSD 168
Query: 155 EADIEMGFNNRVEA 168
E IE N + A
Sbjct: 169 ECGIEGQINAGLPA 182
Score = 59.7 bits (143), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 32/70 (45%), Positives = 39/70 (55%), Gaps = 3/70 (4%)
Query: 271 GVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTLLGKLKTPECKQNCYNPSYESTYRFD 330
G+V+GG Y S GC PY +APCEHHV G C G KTP C + C Y+ Y D
Sbjct: 16 GIVSGGPYGSNMGCIPYEIAPCEHHVNGTRGPCKEGG--KTPTCVKKC-EEGYKVPYAQD 72
Query: 331 LKKGKKAHMV 340
L GK A+ +
Sbjct: 73 LHHGKSAYSI 82
>gi|15723276|gb|AAL06326.1| cathepsin B-like protease [Trypanosoma cruzi]
Length = 208
Score = 111 bits (278), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 58/130 (44%), Positives = 79/130 (60%), Gaps = 12/130 (9%)
Query: 191 FDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVACTP 250
FDA E WP+CP++ I DQS+CGSCWAV+ A+AISDR C G +ISA +++C
Sbjct: 1 FDAGEAWPKCPTITEIRDQSSCGSCWAVAAASAISDRYCTL-GGVRDLRISAGDLMSCCD 59
Query: 251 NC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGP-LQNCTLLGK 308
C +GCNGG+P++AW ++ +G+V+ E CQPY C HHV L C+ G+
Sbjct: 60 VCGYGCNGGYPEVAWEYYAVHGIVS-------EYCQPYPFPSCAHHVNSSDLSPCS--GE 110
Query: 309 LKTPECKQNC 318
TP C C
Sbjct: 111 YDTPTCNSTC 120
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 34/61 (55%), Positives = 42/61 (68%)
Query: 78 RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
R++ +GP FSVYADFL Y GVY+H G +G HAVR++GWG N PYW +ANSW
Sbjct: 146 RELLLNGPFEVSFSVYADFLAYTGGVYKHVAGTFLGGHAVRIVGWGELNGEPYWKIANSW 205
Query: 138 N 138
N
Sbjct: 206 N 206
>gi|15723280|gb|AAL06328.1| cathepsin B-like protease [Trypanosoma cruzi]
Length = 208
Score = 111 bits (278), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 58/130 (44%), Positives = 79/130 (60%), Gaps = 12/130 (9%)
Query: 191 FDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVACTP 250
FDA E WP+CP++ I DQS+CGSCWAV+ A+AISDR C G +ISA +++C
Sbjct: 1 FDAGEAWPKCPTITEIRDQSSCGSCWAVAAASAISDRYCTL-GGVRDLRISAGDLMSCCD 59
Query: 251 NC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGP-LQNCTLLGK 308
C +GCNGG+P++AW ++ +G+V+ E CQPY C HHV L C+ G+
Sbjct: 60 VCGYGCNGGYPEVAWEYYAVHGIVS-------EYCQPYPFPSCAHHVNSSDLSPCS--GE 110
Query: 309 LKTPECKQNC 318
TP C C
Sbjct: 111 YDTPTCNSTC 120
Score = 75.9 bits (185), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 34/61 (55%), Positives = 42/61 (68%)
Query: 78 RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
R++ +GP FSVYADFL Y GVY+H G +G HAVR++GWG N PYW +ANSW
Sbjct: 146 RELLLNGPFEVSFSVYADFLAYTGGVYKHVAGIFLGGHAVRIVGWGELNGEPYWKIANSW 205
Query: 138 N 138
N
Sbjct: 206 N 206
>gi|353228456|emb|CCD74627.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
Length = 333
Score = 111 bits (278), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 57/154 (37%), Positives = 81/154 (52%), Gaps = 6/154 (3%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP FD+RE+W +CPS+ I DQS C S WAV+ A +ISDR CI +NG Q+SA ++
Sbjct: 84 LPDYFDSREQWKDCPSINIIHDQSKCDSGWAVASAASISDRTCIQTNGTMKVQLSAIELI 143
Query: 247 ACTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTLL 306
+C+ N GC G+ + +W +W NG+VTG GC PY C+H C +
Sbjct: 144 SCSKNKLGCQIGFSEFSWDYWLKNGLVTG----DPTGCLPYPFPKCDHRSSNSYPKCGYI 199
Query: 307 GKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
P C + C Y Y+ D G+ + +
Sbjct: 200 -TYTAPPCTKTC-RSGYPIPYKADKHYGRVIYSL 231
Score = 111 bits (277), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 49/86 (56%), Positives = 65/86 (75%)
Query: 78 RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
++I +GP+ A V++DFL YKSGVY+H G + +H+VR++GWG+ENDIPYWL ANSW
Sbjct: 240 KEIMMNGPVEAGIFVHSDFLNYKSGVYRHITGQLVTIHSVRIIGWGIENDIPYWLCANSW 299
Query: 138 NDHWGDHGTFKILRGENEADIEMGFN 163
N+ WG +G FKILRG NE +IE N
Sbjct: 300 NEDWGLNGYFKILRGSNECEIESFVN 325
>gi|161343877|tpg|DAA06119.1| TPA_inf: cathepsin B [Myzus persicae]
Length = 145
Score = 111 bits (278), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 49/91 (53%), Positives = 64/91 (70%)
Query: 69 HMVPRCNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDI 128
+ +P AM++IYE+GP+ A F +Y DF+ Y+SGVY N G + AV++LGWG EN
Sbjct: 46 YRIPGYTAMKEIYENGPITASFYMYQDFVNYQSGVYAFNSGKYVTTQAVKILGWGEENGT 105
Query: 129 PYWLVANSWNDHWGDHGTFKILRGENEADIE 159
PYWL ANS+N +WGD+G KILRG NE IE
Sbjct: 106 PYWLAANSFNTYWGDNGFVKILRGANECYIE 136
>gi|3087799|emb|CAA93276.1| cysteine proteinase [Haemonchus contortus]
Length = 350
Score = 111 bits (277), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 61/156 (39%), Positives = 80/156 (51%), Gaps = 13/156 (8%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+P +FD+R W C S+ ++ DQS CGSCWAVS A+ +SDR+C+ + G +S I+
Sbjct: 94 IPESFDSRIVWKNCSSITYVRDQSRCGSCWAVSAASTMSDRICVQTKGKLQTILSDTDIL 153
Query: 247 ACTPNCW--GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
+C GC GG+ LAW + GVVTGG Y + C+PY PC H G +C
Sbjct: 154 SCCGRMCGDGCEGGYDHLAWEWVQRFGVVTGGPYQQKGVCRPYAFHPCGLH-HGRRYDCP 212
Query: 305 LLGKLKTPECKQNC---YNPSYE-------STYRFD 330
TP CK C Y YE STY D
Sbjct: 213 WDHSFSTPACKPYCQFGYGKRYEKDKFFVKSTYILD 248
Score = 78.2 bits (191), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 34/65 (52%), Positives = 44/65 (67%)
Query: 78 RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
R++ ++GP+ A F Y DF YK G+Y H G G HAV+++GWGVEN YW VANSW
Sbjct: 256 REMMKNGPVQAAFITYEDFSPYKGGIYVHVKGRERGAHAVKLIGWGVENGTKYWTVANSW 315
Query: 138 NDHWG 142
+D WG
Sbjct: 316 HDDWG 320
>gi|332374788|gb|AEE62535.1| unknown [Dendroctonus ponderosae]
Length = 328
Score = 111 bits (277), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 59/138 (42%), Positives = 79/138 (57%), Gaps = 10/138 (7%)
Query: 187 LPRNFDAREKWPECPSLR-HIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHI 245
+P +FD+R WPEC + I DQS CGSCWA + A+SDR+CI SN +S+Q +
Sbjct: 81 IPESFDSRTAWPECTQIIGMIRDQSRCGSCWAFAAVEAMSDRICIHSNATKKLLVSSQDL 140
Query: 246 VACTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNS-QEGCQPYTLAPCEHHVQGPLQNCT 304
+ C GCNGGWP +AW W NG+VTGG Y + ++GC+ Y L C+ H C
Sbjct: 141 LTCG-TAGGCNGGWPAVAWSDW-TNGIVTGGLYGALEQGCKSYFLEGCDDHP----NKCR 194
Query: 305 LLGKLKTPECKQNCYNPS 322
+ TP C + C PS
Sbjct: 195 --NYVSTPACVEQCDEPS 210
Score = 102 bits (254), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 43/81 (53%), Positives = 59/81 (72%)
Query: 79 QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWN 138
+I +GP+ A VY DF QY+SG+YQ + G HAV++LGWGVE+ + YWLVANSWN
Sbjct: 235 EIMTNGPVEATMDVYVDFAQYQSGIYQLTTDEYEGGHAVKILGWGVEDGVKYWLVANSWN 294
Query: 139 DHWGDHGTFKILRGENEADIE 159
+ WG++G F+I+RG +E IE
Sbjct: 295 ERWGENGLFRIIRGRDEVGIE 315
>gi|40557606|gb|AAR88096.1| cathepsin B-like cysteine protease [Callosobruchus maculatus]
Length = 330
Score = 111 bits (277), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 58/140 (41%), Positives = 79/140 (56%), Gaps = 13/140 (9%)
Query: 183 NAKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISA 242
+ K LP FDAR++W +C S++ I DQS CGSCWAVS A+ +SDR+CI S+ +ISA
Sbjct: 77 DGKDLPEEFDARKQWSKCESIKEIRDQSGCGSCWAVSSASVMSDRICIQSDQKNQLRISA 136
Query: 243 QHIVACTPNCW----GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQG 298
++ C +C GC+GG P + W +G V+GG+YNS GC Y L C
Sbjct: 137 ADMIECCESCTFSVDGCHGGIPSFTFTEWKDSGFVSGGEYNSTNGCMSYPLPRCN----- 191
Query: 299 PLQNCTLLGKLKTPECKQNC 318
+C L P CK+ C
Sbjct: 192 --PSCKTL--YDAPTCKKEC 207
Score = 106 bits (265), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 54/103 (52%), Positives = 73/103 (70%), Gaps = 7/103 (6%)
Query: 63 HYFKKAHMV---PRCNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDS--IGLHAV 117
HY K+A+ + +I ++GP+VA F+VYADF+ Y SGVY+ + G+S +G HAV
Sbjct: 220 HYAKQAYRIMSKVERQIQLEIIKNGPVVASFTVYADFIHYLSGVYKFD-GESKLLGGHAV 278
Query: 118 RVLGWGVENDI-PYWLVANSWNDHWGDHGTFKILRGENEADIE 159
R++GWG+EN PYWLV+NSWN+ WGD G FKI RG+NE IE
Sbjct: 279 RIIGWGIENGTYPYWLVSNSWNERWGDQGLFKIWRGKNECGIE 321
>gi|3929733|emb|CAA77178.1| cathepsin B [Homo sapiens]
Length = 195
Score = 111 bits (277), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 58/131 (44%), Positives = 79/131 (60%), Gaps = 7/131 (5%)
Query: 212 CGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVACTPNCWG--CNGGWPQLAWRFWGH 269
CGSCWA AISDR+CI +N + ++SA+ ++ C + G CNGG+P AW FW
Sbjct: 1 CGSCWAFGAVEAISDRICIHTN--VSVEVSAEDLLTCCGSMCGDGCNGGYPAEAWNFWTR 58
Query: 270 NGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTLLGKLKTPECKQNCYNPSYESTYRF 329
G+V+GG Y S GC+PY++ PCEHHV G CT G+ TP+C + C P Y TY+
Sbjct: 59 KGLVSGGLYESHVGCRPYSIPPCEHHVNGSRPPCT--GEGDTPKCSKIC-EPGYSPTYKQ 115
Query: 330 DLKKGKKAHMV 340
D G ++ V
Sbjct: 116 DKHYGYDSYSV 126
Score = 98.6 bits (244), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 48/87 (55%), Positives = 60/87 (68%), Gaps = 2/87 (2%)
Query: 54 YLPTSIPLSHYFKKAHMVPRC--NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDS 111
Y PT HY ++ V + M +IY++GP+ FSVY+DFL YKSGVYQH G+
Sbjct: 109 YSPTYKQDKHYGYDSYSVSNSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEM 168
Query: 112 IGLHAVRVLGWGVENDIPYWLVANSWN 138
+G HA+R+LGWGVEN PYWLVANSWN
Sbjct: 169 MGGHAIRILGWGVENGTPYWLVANSWN 195
>gi|308488550|ref|XP_003106469.1| hypothetical protein CRE_16049 [Caenorhabditis remanei]
gi|308253819|gb|EFO97771.1| hypothetical protein CRE_16049 [Caenorhabditis remanei]
Length = 205
Score = 111 bits (277), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 48/81 (59%), Positives = 60/81 (74%)
Query: 79 QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWN 138
+I HGP+ F+VY DF QY +GVY H G S+G HAV++LGWGV+N PYWLVANSWN
Sbjct: 110 EILAHGPIEVAFTVYEDFYQYTTGVYVHTAGKSLGGHAVKILGWGVDNGTPYWLVANSWN 169
Query: 139 DHWGDHGTFKILRGENEADIE 159
+WG+ G F+I+RG NE IE
Sbjct: 170 VNWGEKGYFRIIRGLNECGIE 190
Score = 64.7 bits (156), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 36/92 (39%), Positives = 50/92 (54%), Gaps = 1/92 (1%)
Query: 250 PNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTLLGKL 309
P+ C GG+P AW++W +G+VTGG Y SQ GC+PY++APC V G
Sbjct: 9 PSFSSCEGGYPIQAWKWWVKHGLVTGGSYESQFGCKPYSIAPCGQTVNGVTWPKCPEDTE 68
Query: 310 KTPECKQNCY-NPSYESTYRFDLKKGKKAHMV 340
TP+C + C N +Y + Y D G A+ V
Sbjct: 69 PTPKCVEACTSNNTYPTGYLQDKHFGATAYAV 100
>gi|3912916|gb|AAC78691.1| thiol protease [Trichuris suis]
Length = 348
Score = 110 bits (276), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 51/115 (44%), Positives = 77/115 (66%), Gaps = 3/115 (2%)
Query: 48 KKKKRLYLPTSIPLSHYFKKAHMVPRCNA---MRQIYEHGPLVAIFSVYADFLQYKSGVY 104
K++ L P S P Y+ K+ + + + R+I ++GP+VA F+VY DF YKSG+Y
Sbjct: 222 KRRCLLGYPKSYPSDRYYGKSAYIVKQSVKAIQREIMKNGPVVASFAVYEDFRHYKSGIY 281
Query: 105 QHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
+H G+ G HAV+++GWG EN+ +WL+ANSW+ WG+ G F+I+RG+NE IE
Sbjct: 282 KHTAGELRGYHAVKIIGWGKENNTDFWLIANSWHQDWGEKGYFRIVRGKNECGIE 336
Score = 102 bits (253), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 67/168 (39%), Positives = 87/168 (51%), Gaps = 13/168 (7%)
Query: 184 AKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQ 243
A +P +FD R W C SL I DQ+ CGSCWAVS A +SDR+C+ SN IS
Sbjct: 81 ALSIPPSFDVRSLWHVC-SLNLIRDQAKCGSCWAVSAAETMSDRICVQSNCSIKACISDT 139
Query: 244 HIVACTP-NC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPY-TLAPCEHHVQ--- 297
I++C C +GCNGG+P AWR + G TGG + GC+PY P H++
Sbjct: 140 DILSCCGLYCGYGCNGGFPIEAWRHFTVAGNCTGGKTIDKYGCKPYKPTGPIGRHLKRND 199
Query: 298 -GPLQNCTL----LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
P N T +G TP CK+ C Y +Y D GK A++V
Sbjct: 200 YAPCPNDTYYGECVGMADTPRCKRRCL-LGYPKSYPSDRYYGKSAYIV 246
>gi|193603738|ref|XP_001943652.1| PREDICTED: cathepsin B-like cysteine proteinase 5-like
[Acyrthosiphon pisum]
Length = 337
Score = 110 bits (276), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 69/174 (39%), Positives = 96/174 (55%), Gaps = 9/174 (5%)
Query: 171 SEDDDLETMGCQ-NAKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLC 229
SE D L T + + LP ++D + W EC S+ I DQSNCGSCWA+S A+A SDRLC
Sbjct: 69 SEKDILLTYDVSIDLESLPESYDITQTWSECKSVVSIRDQSNCGSCWALSTASAFSDRLC 128
Query: 230 IASNGYFTGQISAQHIVACTPNCWGCNGGW--PQLAWRFWGHNGVVTGGDYNSQEGCQPY 287
I SN +S ++I +C G P+ AW++ NG+ TGG+Y S EGCQPY
Sbjct: 129 ITSNMGVNKVLSGEYINSCCNGKCGNGCNGGHPEKAWKYIKKNGLCTGGEYGSNEGCQPY 188
Query: 288 TLAPCEHHVQGPLQNCTLLGKLKTPEC-KQNCYNPSYESTYRFDLKKGKKAHMV 340
++ PC + +C+ + TP+C K C N +YE+ DL K + V
Sbjct: 189 SIVPCPRNA----NSCSKENE-DTPQCYKDQCTNNNYETPLVSDLYYAYKVYSV 237
Score = 95.9 bits (237), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 42/83 (50%), Positives = 57/83 (68%)
Query: 77 MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANS 136
M +++++GP+VA VY DFL YK G+YQ+ G G HAV+++GWG ++ I YWL AN+
Sbjct: 245 MSEVFKNGPVVAAMKVYDDFLCYKGGIYQYTTGGLKGDHAVKIMGWGEDDGIDYWLCANT 304
Query: 137 WNDHWGDHGTFKILRGENEADIE 159
W + WG G FKI RG NE IE
Sbjct: 305 WGNSWGMGGMFKIRRGRNECGIE 327
>gi|161343847|tpg|DAA06104.1| TPA_inf: cathepsin B [Acyrthosiphon pisum]
Length = 187
Score = 110 bits (276), Expect = 8e-22, Method: Compositional matrix adjust.
Identities = 52/101 (51%), Positives = 68/101 (67%), Gaps = 1/101 (0%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+P FDAR KW EC S+ HI +Q NC + WA+SV +AI+DR+CI S T S Q ++
Sbjct: 87 MPETFDARNKWFECVSIAHIWNQGNCAADWAISVTSAINDRICIKSKKNITAFYSPQKML 146
Query: 247 ACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQP 286
+C +C GCNGG+ AW++W G+VTGGDY S EGCQP
Sbjct: 147 SCCDDCGDGCNGGYSGAAWQYWMKRGLVTGGDYGSNEGCQP 187
>gi|343470805|emb|CCD16605.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 337
Score = 110 bits (275), Expect = 9e-22, Method: Compositional matrix adjust.
Identities = 61/139 (43%), Positives = 77/139 (55%), Gaps = 13/139 (9%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP FDA E WP CP++R IADQS C + WAVS A+AISDR C G +ISA H++
Sbjct: 91 LPETFDAAEHWPHCPTIREIADQSECRASWAVSTASAISDRYCTVGKGK-QLRISAAHLL 149
Query: 247 ACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
+C +C GC GG+P AWR++ G+ + CQPY CEH QG N T
Sbjct: 150 SCCKDCGDGCKGGFPGFAWRYYVEYGI-------TSSSCQPYPFPRCEH--QGAQGNKTP 200
Query: 306 LGK--LKTPECKQNCYNPS 322
K TP+C C + S
Sbjct: 201 CSKYNFDTPKCNATCTDKS 219
Score = 100 bits (249), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 48/93 (51%), Positives = 61/93 (65%), Gaps = 1/93 (1%)
Query: 78 RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
R++Y +GP VA+F VY D YKSGVY++ GD +G AV+V+GWG N PYW VANSW
Sbjct: 242 RELYFNGPFVAVFYVYTDLFAYKSGVYRNVDGDFLGGTAVKVVGWGKLNGTPYWKVANSW 301
Query: 138 NDHWGDHGTFKILRGENEADIE-MGFNNRVEAN 169
+ WG G ILRG NE +IE +GF E +
Sbjct: 302 DTDWGMDGYLLILRGNNECNIEHLGFAGTPETS 334
>gi|323448735|gb|EGB04630.1| hypothetical protein AURANDRAFT_32318 [Aureococcus anophagefferens]
Length = 253
Score = 110 bits (275), Expect = 9e-22, Method: Compositional matrix adjust.
Identities = 52/120 (43%), Positives = 72/120 (60%), Gaps = 2/120 (1%)
Query: 200 CPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVACTP-NCWGCNGG 258
CPSL+ I DQ+NCGSCWA A++DR+CIASNG T +SAQ + +C GCNGG
Sbjct: 1 CPSLKEIRDQANCGSCWAFGSTEAMTDRMCIASNGTVTTHLSAQDVTSCDKLGDMGCNGG 60
Query: 259 WPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTLLGKLKTPECKQNC 318
P + +W +G+V GG+Y + GC Y L PC HHV + +++ P+C + C
Sbjct: 61 IPSSVYSYWALSGIVDGGNYGDKSGCWSYQLEPCAHHVNSS-KYPACPDEVRAPKCARKC 119
Score = 94.0 bits (232), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 42/82 (51%), Positives = 57/82 (69%), Gaps = 1/82 (1%)
Query: 79 QIYEHGPLVAIFSVYADFLQYKSGVYQHNF-GDSIGLHAVRVLGWGVENDIPYWLVANSW 137
IY++GP+ +F V DFL YKSGVY+ +G HA++++G+G E+ YWLVANSW
Sbjct: 156 DIYQNGPITGMFFVKQDFLAYKSGVYEPKLLSPPLGGHAIKIMGFGTEDGKDYWLVANSW 215
Query: 138 NDHWGDHGTFKILRGENEADIE 159
N+ WGD G FKI+RG+N IE
Sbjct: 216 NEDWGDDGYFKIIRGKNACQIE 237
>gi|56758658|gb|AAW27469.1| unknown [Schistosoma japonicum]
Length = 181
Score = 110 bits (275), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 45/82 (54%), Positives = 61/82 (74%)
Query: 78 RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
++I +GP+ A F VY DFL YKSG+Y+H G +G HA+R++GWGVE PYWL+ANSW
Sbjct: 90 KEIMMNGPVEAAFDVYEDFLNYKSGIYRHVTGSIVGGHAIRIIGWGVEKRTPYWLIANSW 149
Query: 138 NDHWGDHGTFKILRGENEADIE 159
N+ WG+ G F+I+RG +E IE
Sbjct: 150 NEDWGEKGLFRIVRGRDECSIE 171
Score = 48.1 bits (113), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 27/71 (38%), Positives = 36/71 (50%), Gaps = 2/71 (2%)
Query: 271 GVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTLLGKLKTPECKQNCYNPSYESTYRFD 330
G+VTGG + GCQPY CEH +G C KTP+CKQ C Y++ Y D
Sbjct: 14 GIVTGGSKENHTGCQPYPFPKCEHLTKGKYPACG-TKIYKTPQCKQKC-QKGYKTPYEQD 71
Query: 331 LKKGKKAHMVL 341
G + + V+
Sbjct: 72 KNYGDQRYNVI 82
>gi|325303156|tpg|DAA34330.1| TPA_inf: cysteine proteinase cathepsin L [Amblyomma variegatum]
Length = 207
Score = 110 bits (275), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 49/105 (46%), Positives = 67/105 (63%), Gaps = 4/105 (3%)
Query: 185 KGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNG---YFTGQIS 241
LP NFDARE+WP+CP++ I DQ +CGSCWA A+SDR CI S ++
Sbjct: 103 TALPENFDAREQWPDCPTIGEIRDQGSCGSCWAFGAVEAMSDRTCIHSPARKPRVNVHLA 162
Query: 242 AQHIVACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQ 285
A +++C +C GCNGG+P AW +W H+G+V GG Y++ EGC
Sbjct: 163 ADDVLSCCKDCGAGCNGGFPGAAWSYWVHHGIVDGGHYDTDEGCM 207
>gi|170060938|ref|XP_001866023.1| cathepsin B [Culex quinquefasciatus]
gi|167879260|gb|EDS42643.1| cathepsin B [Culex quinquefasciatus]
Length = 353
Score = 110 bits (275), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 59/156 (37%), Positives = 83/156 (53%), Gaps = 11/156 (7%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP FDAR+KWP+CPSLR I +Q CGSCWA+S A A +DR CI S + T + ++
Sbjct: 98 LPEQFDARDKWPQCPSLREIRNQGCCGSCWAISAAEAFTDRWCIHSPEHTTFSFGSFDLI 157
Query: 247 ACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
+C +C GC GG AW +W GV +GG YNS++GC Y C P ++
Sbjct: 158 SCCHSCGDGCQGGVLGPAWDYWVQKGVSSGGPYNSKQGCHSYPFDTC----HSPDED--- 210
Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMVL 341
P+C + C + D + G+ A+ V+
Sbjct: 211 ---DDAPKCSRKCQSSYSVQDVSKDRRFGRVAYSVV 243
Score = 107 bits (266), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 48/83 (57%), Positives = 59/83 (71%)
Query: 77 MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANS 136
M +I+ +GP+ A F VY DF YKSGVY+H G G HA+++LGWGVEN YWL +NS
Sbjct: 250 MEEIFVNGPVQAAFQVYLDFKTYKSGVYRHVTGPLEGGHAIKILGWGVENGTKYWLCSNS 309
Query: 137 WNDHWGDHGTFKILRGENEADIE 159
W + WGDHG FKI+RGEN IE
Sbjct: 310 WGEDWGDHGFFKIVRGENHLGIE 332
>gi|194246059|gb|ACF35521.1| putative cathepsin B-like cysteine protease form 1 [Dermacentor
variabilis]
Length = 217
Score = 110 bits (275), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 55/117 (47%), Positives = 72/117 (61%), Gaps = 4/117 (3%)
Query: 225 SDRLCIASNGYFTGQISAQHIVACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEG 283
SDR+CI + G ISA+ ++ C +C GCNGG+P AW+F+ G+VTGG Y +++G
Sbjct: 1 SDRICIHTKGKVQVNISAEDLLTCCDSCGSGCNGGYPSAAWQFYKDEGIVTGGLYGTEDG 60
Query: 284 CQPYTLAPCEHHVQGPLQNCTLLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
CQPY PCEHH GPL NCT G TPEC + C YE +Y D GKK + +
Sbjct: 61 CQPYYFPPCEHHTVGPLPNCT--GIKPTPECAKTC-REGYEKSYTRDKHFGKKVYSI 114
Score = 109 bits (273), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 52/106 (49%), Positives = 70/106 (66%), Gaps = 2/106 (1%)
Query: 63 HYFKKAHMVP--RCNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
H+ KK + + +I ++GP+ A F+VYADF YKSGVYQ + + +G HA+R+L
Sbjct: 106 HFGKKVYSISSDETQIKTEICKNGPVEADFNVYADFPSYKSGVYQRHSKEMLGGHAIRIL 165
Query: 121 GWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIEMGFNNRV 166
GWG E+ +PYWLVANSWN+ WGD G FKI RG +E IE N +
Sbjct: 166 GWGTEDGVPYWLVANSWNEDWGDKGYFKIRRGNDECGIENDINAGI 211
>gi|291236586|ref|XP_002738220.1| PREDICTED: cathepsin B preproprotein-like [Saccoglossus
kowalevskii]
Length = 93
Score = 110 bits (275), Expect = 1e-21, Method: Composition-based stats.
Identities = 49/83 (59%), Positives = 63/83 (75%)
Query: 77 MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANS 136
M +I ++GP+ F+VYADF YKSGVYQH G+++G HA+++LGWG E+ YWLVANS
Sbjct: 1 MAEIQKYGPVEGAFTVYADFPSYKSGVYQHETGEALGGHAIKILGWGNEDGHDYWLVANS 60
Query: 137 WNDHWGDHGTFKILRGENEADIE 159
WN+ WGD G FKILRG +E IE
Sbjct: 61 WNEDWGDQGFFKILRGVDECGIE 83
>gi|268578113|ref|XP_002644039.1| Hypothetical protein CBG17499 [Caenorhabditis briggsae]
Length = 355
Score = 110 bits (274), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 61/179 (34%), Positives = 97/179 (54%), Gaps = 10/179 (5%)
Query: 170 SSEDDDLETMGCQNAKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLC 229
S ED + E + +P +FD+R++WPEC + + DQS+CGS + SDR C
Sbjct: 75 SHEDQETEN-SAEVLINIPASFDSRQQWPECTQIGAVRDQSDCGSAAHLVAVEMASDRTC 133
Query: 230 IASNGYFTGQISAQHIVACTP-------NCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQE 282
I+SNG F +SAQ ++C + WGC+G WP+ ++W +G+ TGG+Y+ Q
Sbjct: 134 ISSNGTFNWPLSAQDPLSCCVGLMSICGDGWGCDGSWPKDILKWWQTHGLCTGGNYDDQF 193
Query: 283 GCQPYTLAPCEHHVQGPLQNCTLLGKLKTPECKQNCY-NPSYESTYRFDLKKGKKAHMV 340
GC+PY++ PC+ + + G TP C+ +C N ++ Y+ D GK + V
Sbjct: 194 GCKPYSIYPCDKNYPNGTTSVPCPG-YHTPPCEDHCTSNITWPIAYKQDKHFGKAHYNV 251
Score = 90.1 bits (222), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 42/107 (39%), Positives = 62/107 (57%), Gaps = 3/107 (2%)
Query: 56 PTSIPLSHYFKKAHMVP---RCNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSI 112
P + +F KAH + +I +GP++A F +Y DF YKSG+Y H GD
Sbjct: 235 PIAYKQDKHFGKAHYNVGKKMTDIQTEIMTNGPVIASFIIYEDFWDYKSGIYVHTAGDQE 294
Query: 113 GLHAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
G +++GWGV+N +PYWL + W +G++G +ILRG NE +IE
Sbjct: 295 GGMDTKIIGWGVDNGVPYWLCVHQWGTDFGENGFVRILRGVNEVNIE 341
>gi|343474137|emb|CCD14154.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 337
Score = 110 bits (274), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 60/135 (44%), Positives = 75/135 (55%), Gaps = 13/135 (9%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP FDA E WP CP++R IADQS C + WAVS A+AISDR C G +ISA H++
Sbjct: 91 LPETFDAAEHWPHCPTIREIADQSECRASWAVSTASAISDRYCTVGKGK-QLRISAAHLL 149
Query: 247 ACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
+C +C GC GG+P AWR++ G+ + CQPY CEH QG N T
Sbjct: 150 SCCKDCGDGCKGGFPGFAWRYYVEYGI-------TSSSCQPYPFPRCEH--QGAQGNKTP 200
Query: 306 LGK--LKTPECKQNC 318
K TP+C C
Sbjct: 201 CSKYNFDTPKCNATC 215
Score = 103 bits (256), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 49/93 (52%), Positives = 62/93 (66%), Gaps = 1/93 (1%)
Query: 78 RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
R++Y +GP VA+F VY D YKSGVY+H GD +G AV+V+GWG N PYW +ANSW
Sbjct: 242 RELYFNGPFVAVFYVYTDLFAYKSGVYRHVDGDFLGGTAVKVVGWGKLNGTPYWKLANSW 301
Query: 138 NDHWGDHGTFKILRGENEADIE-MGFNNRVEAN 169
+ WG G ILRG NE +IE +GF EA+
Sbjct: 302 DTDWGMGGYLLILRGNNECNIEHLGFAGTPEAS 334
>gi|56755295|gb|AAW25827.1| SJCHGC06356 protein [Schistosoma japonicum]
Length = 279
Score = 110 bits (274), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 58/156 (37%), Positives = 86/156 (55%), Gaps = 7/156 (4%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+PR+FDAR W C ++R I D+S C + WA++ ++ISDR+CI SNG + Q+SA+ +
Sbjct: 28 IPRSFDARYHWINCSTIRQIHDESLCRADWAIATVDSISDRICIRSNGRISVQLSARDAI 87
Query: 247 AC--TPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
+C +P GC G +W G+VTGG Y Q GCQPY L C +H + +C
Sbjct: 88 SCGFSP---GCFHGSEVEVLVYWITYGIVTGGSYEDQSGCQPYPLPKCSYHPESRFLDCN 144
Query: 305 LLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
+ P+C C + Y TY D G++ + V
Sbjct: 145 -NNTFEFPQCTNECQD-GYNKTYDDDKFYGERIYNV 178
Score = 80.5 bits (197), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 40/86 (46%), Positives = 53/86 (61%), Gaps = 1/86 (1%)
Query: 75 NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHN-FGDSIGLHAVRVLGWGVENDIPYWLV 133
+ ++I +GP++A SV DFL YKSGVY ++G +R++GWG E IPYWL
Sbjct: 184 DIQKEILMNGPVIASISVNTDFLVYKSGVYLPTPRSRNLGWITLRIIGWGYEGKIPYWLC 243
Query: 134 ANSWNDHWGDHGTFKILRGENEADIE 159
ANSWN+ WG +G KI RG IE
Sbjct: 244 ANSWNEEWGANGYVKIQRGVQAGYIE 269
>gi|166030330|gb|ABY78832.1| cathepsin B-like protease [Trypanosoma congolense]
gi|343476577|emb|CCD12360.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 337
Score = 110 bits (274), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 60/135 (44%), Positives = 75/135 (55%), Gaps = 13/135 (9%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP FDA E WP CP++R IADQS C + WAVS A+AISDR C G +ISA H++
Sbjct: 91 LPETFDAAEHWPHCPTIREIADQSECRASWAVSTASAISDRYCTVGKGK-QLRISAAHLL 149
Query: 247 ACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
+C +C GC GG+P AWR++ G+ + CQPY CEH QG N T
Sbjct: 150 SCCKDCGDGCKGGFPGFAWRYYVEYGI-------TSSSCQPYPFPRCEH--QGAQGNKTP 200
Query: 306 LGK--LKTPECKQNC 318
K TP+C C
Sbjct: 201 CSKYNFDTPKCNATC 215
Score = 103 bits (257), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 49/93 (52%), Positives = 62/93 (66%), Gaps = 1/93 (1%)
Query: 78 RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
R++Y +GP VA+F VY D YKSGVY+H GD +G AV+V+GWG N PYW +ANSW
Sbjct: 242 RELYFNGPFVAVFYVYTDLFAYKSGVYRHVDGDFLGGTAVKVVGWGKLNGTPYWKLANSW 301
Query: 138 NDHWGDHGTFKILRGENEADIE-MGFNNRVEAN 169
+ WG G ILRG NE +IE +GF EA+
Sbjct: 302 DTDWGMGGYLLILRGNNECNIEHLGFAGTPEAS 334
>gi|312382740|gb|EFR28091.1| hypothetical protein AND_04395 [Anopheles darlingi]
Length = 381
Score = 110 bits (274), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 49/108 (45%), Positives = 73/108 (67%), Gaps = 2/108 (1%)
Query: 54 YLPTSIPLSHYFKKAHMVPRCN--AMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDS 111
Y T + H+ + A+ VPR + +++ GP+ A F+VY DF+QYKSGVY+H +G
Sbjct: 263 YNTTYLEDKHFGRVAYSVPRDEDRILYELFYFGPVQASFTVYTDFIQYKSGVYRHTYGVR 322
Query: 112 IGLHAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
+G H+V+++GWGVEN +WL ANSW WG++G FKI+RGE+ +E
Sbjct: 323 VGDHSVKIVGWGVENGTKFWLCANSWGAEWGENGFFKIIRGEDHLSVE 370
Score = 93.2 bits (230), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 56/156 (35%), Positives = 78/156 (50%), Gaps = 12/156 (7%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
P +FDAR+KW CPS+ I +Q C S +AV+ I+DR CI S G A ++
Sbjct: 135 FPESFDARQKWSFCPSVGTIRNQGCCASSYAVAAVATITDRWCIHSEGKSQFSFGAYDVL 194
Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCE-HHVQGPLQNCT 304
+C C +GC+GG P W +W NG+ +GG Y S EGCQ Y C+ + P +
Sbjct: 195 SCCHRCGFGCDGGVPSAVWHYWVENGITSGGAYESHEGCQSYPFGVCKPQEIFAPHVDLI 254
Query: 305 LLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
C + C P Y +TY D G+ A+ V
Sbjct: 255 ---------CLRQC-QPGYNTTYLEDKHFGRVAYSV 280
>gi|343474132|emb|CCD14149.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 337
Score = 110 bits (274), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 60/135 (44%), Positives = 75/135 (55%), Gaps = 13/135 (9%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP FDA E WP CP++R IADQS C + WAVS A+AISDR C G +ISA H++
Sbjct: 91 LPETFDAAEHWPHCPTIREIADQSECRASWAVSTASAISDRYCTVGKGK-QLRISAAHLL 149
Query: 247 ACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
+C +C GC GG+P AWR++ G+ + CQPY CEH QG N T
Sbjct: 150 SCCKDCGDGCKGGFPGFAWRYYVEYGI-------TSSSCQPYPFPRCEH--QGAQGNKTP 200
Query: 306 LGK--LKTPECKQNC 318
K TP+C C
Sbjct: 201 CSKYNFDTPKCNATC 215
Score = 103 bits (256), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 49/93 (52%), Positives = 62/93 (66%), Gaps = 1/93 (1%)
Query: 78 RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
R++Y +GP VA+F VY D YKSGVY+H GD +G AV+V+GWG N PYW +ANSW
Sbjct: 242 RELYFNGPFVAVFYVYTDLFAYKSGVYRHVDGDFLGGTAVKVVGWGKLNGTPYWKLANSW 301
Query: 138 NDHWGDHGTFKILRGENEADIE-MGFNNRVEAN 169
+ WG G ILRG NE +IE +GF EA+
Sbjct: 302 DTDWGMGGYLLILRGNNECNIEHLGFAGTPEAS 334
>gi|15723272|gb|AAL06324.1| cathepsin B-like protease [Trypanosoma cruzi]
Length = 208
Score = 110 bits (274), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 57/130 (43%), Positives = 79/130 (60%), Gaps = 12/130 (9%)
Query: 191 FDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVACTP 250
FDA E WP+CP++ I DQS+CGSCWAV+ A+A+SDR C G +ISA +++C
Sbjct: 1 FDAGEAWPKCPTITEIRDQSSCGSCWAVAAASAMSDRYCTL-GGVRDLRISAGDLMSCCD 59
Query: 251 NC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGP-LQNCTLLGK 308
C +GCNGG+P++AW ++ +G+V+ E CQPY C HHV L C+ G+
Sbjct: 60 VCGYGCNGGYPEVAWEYYAVHGIVS-------EYCQPYPFPSCAHHVNSSDLSPCS--GE 110
Query: 309 LKTPECKQNC 318
TP C C
Sbjct: 111 YDTPTCNSTC 120
Score = 74.7 bits (182), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 33/61 (54%), Positives = 42/61 (68%)
Query: 78 RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
R++ +GP FSVYADF+ Y GVY+H G +G HAVR++GWG N PYW +ANSW
Sbjct: 146 RELLLNGPFEVSFSVYADFVAYTGGVYKHVTGVFLGGHAVRIVGWGELNGEPYWKIANSW 205
Query: 138 N 138
N
Sbjct: 206 N 206
>gi|308507719|ref|XP_003116043.1| hypothetical protein CRE_08645 [Caenorhabditis remanei]
gi|308250987|gb|EFO94939.1| hypothetical protein CRE_08645 [Caenorhabditis remanei]
Length = 356
Score = 110 bits (274), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 65/166 (39%), Positives = 84/166 (50%), Gaps = 13/166 (7%)
Query: 172 EDDDLETMGCQNAKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIA 231
EDD + +P FDAR WP+C S++ + DQSNCGSCWA A ISDR+CI
Sbjct: 55 EDDSYVLRNQRILPSIPTTFDARTNWPKCNSIKMVRDQSNCGSCWAFGAAEVISDRICIH 114
Query: 232 SNGYFTGQISAQHIVACTPNCWGCNGGWPQL--AWRFWGHNGVVTGGDYNSQEGCQPYTL 289
SNG ISA+ I+ C G Q A +FW G VTGGDY +GC+PY+
Sbjct: 115 SNGKEQPVISAEDILTCCGKSCGNGCQGGQGLEAMKFWTTYGAVTGGDYKG-DGCKPYSF 173
Query: 290 APCEHHVQGPLQNCTLLGKLKTPECKQNCYNPSYESTYRFDLKKGK 335
APC + V+ TP C+ C + + Y+ D GK
Sbjct: 174 APCSNCVESK----------TTPSCQSKCQSTYTVTNYKGDKHYGK 209
Score = 99.0 bits (245), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 43/81 (53%), Positives = 54/81 (66%)
Query: 79 QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWN 138
+IY++GP+ ++VY DF YKSGVY H G G HAV+++GWG E + YWLV NSW
Sbjct: 242 EIYQNGPVEVAYTVYDDFYHYKSGVYHHVTGKDTGGHAVKIIGWGTEKGVDYWLVTNSWG 301
Query: 139 DHWGDHGTFKILRGENEADIE 159
+GD G FKI RG NE IE
Sbjct: 302 TSFGDKGFFKIRRGTNECGIE 322
>gi|256086863|ref|XP_002579605.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
gi|353228447|emb|CCD74618.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
Length = 271
Score = 109 bits (273), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 50/86 (58%), Positives = 63/86 (73%), Gaps = 1/86 (1%)
Query: 75 NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGV-ENDIPYWLV 133
+ M++I +GP+ F VY DFL YKSGVY+H G +G HA+R++GWG+ +N IPYWL
Sbjct: 174 SIMKEILLNGPVEGGFYVYEDFLNYKSGVYKHITGSYLGGHAIRIIGWGIQQNHIPYWLC 233
Query: 134 ANSWNDHWGDHGTFKILRGENEADIE 159
ANSWN+ WGD G FKILRG NE IE
Sbjct: 234 ANSWNNQWGDQGYFKILRGTNECGIE 259
Score = 81.3 bits (199), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 43/125 (34%), Positives = 64/125 (51%), Gaps = 2/125 (1%)
Query: 217 AVSVANAISDRLCIASNGYFTGQISAQHIVACTPNC-WGCNGGWPQLAWRFWGHNGVVTG 275
A ++SDR+CI S + ++SA ++++C C +GC GG P +AW +W + G+VTG
Sbjct: 45 AFGAVESMSDRICIHSKNKISVELSAINLLSCCTRCGFGCRGGIPGMAWDYWKYEGIVTG 104
Query: 276 GDYNSQEGCQPYTLAPCEHHVQGPLQNCTLLGKLKTPECKQNCYNPSYESTYRFDLKKGK 335
G + GCQPY C HH TPEC + C + Y Y+ D GK
Sbjct: 105 GSNETHTGCQPYPFPECNHHSSSKSYPPCESYYFPTPECHETCQD-DYGKPYKKDKFYGK 163
Query: 336 KAHMV 340
++ V
Sbjct: 164 SSYNV 168
>gi|342181301|emb|CCC90780.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 335
Score = 109 bits (273), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 58/134 (43%), Positives = 80/134 (59%), Gaps = 11/134 (8%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP +FD+ EKWP CP++R IADQS CGSCWAVS A+AISDR C G +ISA H++
Sbjct: 90 LPESFDSAEKWPNCPTIREIADQSACGSCWAVSTASAISDRHCTV-GGVQQLRISAAHLM 148
Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHH-VQGPLQNCT 304
+C +C +GC+GG+P +W ++ +G+ + CQPY C HH +G C+
Sbjct: 149 SCCEDCGYGCDGGYPGTSWEYYVSHGL-------ASSYCQPYPFPHCGHHGGKGKKPPCS 201
Query: 305 LLGKLKTPECKQNC 318
TP+C C
Sbjct: 202 KY-HFHTPKCNTTC 214
Score = 103 bits (257), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 45/82 (54%), Positives = 58/82 (70%)
Query: 78 RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
R++Y +GP V +F VY+DFL YK+GVY+H GD +G HAVR++GWG N PYW +ANSW
Sbjct: 240 RELYFNGPFVVVFWVYSDFLAYKTGVYRHVSGDFLGGHAVRIVGWGKLNGTPYWKIANSW 299
Query: 138 NDHWGDHGTFKILRGENEADIE 159
+ WG +G LRG NE IE
Sbjct: 300 DTDWGMNGHLLFLRGNNECGIE 321
>gi|56758040|gb|AAW27160.1| unknown [Schistosoma japonicum]
Length = 216
Score = 109 bits (273), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 45/82 (54%), Positives = 61/82 (74%)
Query: 78 RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
++I +GP+ A F VY DFL YKSG+Y+H G +G HA+R++GWGVE PYWL+ANSW
Sbjct: 125 KEIMMNGPVEAAFDVYEDFLNYKSGIYRHVTGSIVGGHAIRIIGWGVEKRTPYWLIANSW 184
Query: 138 NDHWGDHGTFKILRGENEADIE 159
N+ WG+ G F+I+RG +E IE
Sbjct: 185 NEDWGEKGLFRIVRGRDECSIE 206
Score = 91.3 bits (225), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 46/119 (38%), Positives = 68/119 (57%), Gaps = 3/119 (2%)
Query: 224 ISDRLCIASNGYFTGQISAQHIVACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQE 282
++DR+CI S G + ++SA +++C +C GC GG+P AW +W G+VTGG +
Sbjct: 1 MTDRICIQSGGQQSAELSALDLISCCEDCGDGCQGGFPGQAWDYWVTQGIVTGGSKENHT 60
Query: 283 GCQPYTLAPCEHHVQGPLQNCTLLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMVL 341
GCQPY CEHH +G C KTP+CKQ C Y++ Y D G +++ V+
Sbjct: 61 GCQPYPFPKCEHHTKGKYPACGTK-IYKTPQCKQTC-QKGYKTPYEQDKHYGDESYNVI 117
>gi|308512693|gb|ADO33000.1| cathepsin B [Biston betularia]
Length = 217
Score = 109 bits (273), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 47/81 (58%), Positives = 65/81 (80%)
Query: 79 QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWN 138
+++++GP+ A F+VYAD L YKSGVY+H GD++G HA++++GWGVEN YWL+ANSWN
Sbjct: 124 ELFKNGPVEAAFTVYADLLAYKSGVYKHVEGDALGGHAIKIIGWGVENGNKYWLIANSWN 183
Query: 139 DHWGDHGTFKILRGENEADIE 159
WG++G FKILRGE+ IE
Sbjct: 184 TDWGNNGFFKILRGEDHCGIE 204
Score = 107 bits (268), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 54/117 (46%), Positives = 71/117 (60%), Gaps = 4/117 (3%)
Query: 225 SDRLCIASNGYFTGQISAQHIVACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEG 283
+DR+C SNG SA+ +++C P C GCNGG P LAW +W H G+V+GG+YNS +G
Sbjct: 1 TDRVCTYSNGTKHFHFSAEDLLSCCPICGLGCNGGMPTLAWEYWKHMGLVSGGNYNSSQG 60
Query: 284 CQPYTLAPCEHHVQGPLQNCTLLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
C PY + PCEHHV G C G KTP+C + C N Y Y+ D + GK + V
Sbjct: 61 CSPYVIPPCEHHVPGNRLPCN--GDTKTPKCSKTCEN-GYNVLYKKDKRYGKHVYAV 114
>gi|343476048|emb|CCD12737.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 336
Score = 109 bits (273), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 59/134 (44%), Positives = 79/134 (58%), Gaps = 11/134 (8%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP +FD+ EKWP CP++R IADQS CGSCWAVS A+AISDR C G +ISA H++
Sbjct: 90 LPESFDSAEKWPNCPTIREIADQSACGSCWAVSTASAISDRHCTV-GGVQQLRISAAHLL 148
Query: 247 ACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHH-VQGPLQNCT 304
+C +C GC+GG+P AW ++ +G+ + CQPY C HH +G C+
Sbjct: 149 SCCKDCGDGCDGGYPDSAWEYYVSHGL-------ASSYCQPYPFPHCGHHGGKGKKPPCS 201
Query: 305 LLGKLKTPECKQNC 318
TP+C C
Sbjct: 202 KY-DFHTPKCNTTC 214
Score = 107 bits (267), Expect = 9e-21, Method: Compositional matrix adjust.
Identities = 47/82 (57%), Positives = 59/82 (71%)
Query: 78 RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
R++Y +GP V F VY+DFL YK+GVY+H GD +G HAVR++GWG N PYW +ANSW
Sbjct: 241 RELYFNGPFVVAFQVYSDFLAYKTGVYRHVSGDFLGGHAVRIVGWGKLNGTPYWKIANSW 300
Query: 138 NDHWGDHGTFKILRGENEADIE 159
+ WG +G F ILRG NE IE
Sbjct: 301 DTDWGMNGHFLILRGNNECGIE 322
>gi|341904369|gb|EGT60202.1| hypothetical protein CAEBREN_08101 [Caenorhabditis brenneri]
Length = 330
Score = 109 bits (273), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 63/156 (40%), Positives = 85/156 (54%), Gaps = 13/156 (8%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+P +FD+R W EC S++ I +Q+ CGSCWA A ISDR CI + G IS ++
Sbjct: 86 IPASFDSRTHWSECKSIKLIRNQATCGSCWAFGAAEVISDRTCIETKGAQQPIISPDDLL 145
Query: 247 ACT-PNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
+C +C GC GG+P A R+W GVVTGGDY+ GC+PY +APC +C
Sbjct: 146 SCCGSSCGNGCEGGYPIQALRWWDSKGVVTGGDYHGA-GCKPYPIAPCTS------GSCP 198
Query: 305 LLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
+ KTP C +C Y + Y D G A+ V
Sbjct: 199 ---ESKTPACSLSC-QSGYTTAYAKDKHFGTSAYAV 230
Score = 107 bits (267), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 46/99 (46%), Positives = 67/99 (67%), Gaps = 2/99 (2%)
Query: 63 HYFKKAHMVPR--CNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
H+ A+ V + + +I +GP+ A F+VY DF +YKSGVY+H G ++G HA++++
Sbjct: 222 HFGTSAYAVAKKVASIQTEIMTNGPVEAAFTVYEDFYKYKSGVYKHTAGKALGGHAIKII 281
Query: 121 GWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
GWG E+ PYWLVANSW WG+ G FKI RG+++ IE
Sbjct: 282 GWGTESGSPYWLVANSWGTSWGESGFFKIFRGDDQCGIE 320
>gi|149436731|ref|XP_001513125.1| PREDICTED: cathepsin B-like [Ornithorhynchus anatinus]
Length = 211
Score = 109 bits (272), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 48/99 (48%), Positives = 68/99 (68%), Gaps = 2/99 (2%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP NFDAR++WP CP+++ I DQ +CGSCWA AISDR+C+ +NG + ++SA+ ++
Sbjct: 81 LPENFDARQQWPNCPTIKEIRDQGSCGSCWAFGAVEAISDRVCVHTNGQVSVEVSAEDLL 140
Query: 247 ACTP-NC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEG 283
C C GCNGG+P AW +W G+V+GG Y+S G
Sbjct: 141 TCCGLECGMGCNGGYPTGAWTYWTKKGLVSGGLYDSHVG 179
>gi|3929817|emb|CAA77181.1| cathepsin B [Mus musculus]
Length = 194
Score = 109 bits (272), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 57/123 (46%), Positives = 74/123 (60%), Gaps = 7/123 (5%)
Query: 212 CGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVACTP-NCW-GCNGGWPQLAWRFWGH 269
CGSCWA AISDR CI +NG ++SA+ ++ C C GCNGG+P AW FW
Sbjct: 1 CGSCWAFGAVEAISDRTCIHTNGRVNVEVSAEDLLTCCGIQCGDGCNGGYPSGAWNFWTK 60
Query: 270 NGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTLLGKLKTPECKQNC---YNPSYEST 326
G+V+GG Y+S GC PYT+ PCEHHV G + G+ TP C ++C Y+PSY+
Sbjct: 61 KGLVSGGVYDSHIGCLPYTIPPCEHHVNGSRP--PMHGEGDTPRCNKSCEAGYSPSYKED 118
Query: 327 YRF 329
F
Sbjct: 119 KHF 121
Score = 88.6 bits (218), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 36/59 (61%), Positives = 48/59 (81%)
Query: 77 MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVAN 135
M +IY++GP+ F+V++DFL YKSGVY+H GD +G HA+R+LGWGVEN +PYWL AN
Sbjct: 136 MAEIYKNGPVEGAFTVFSDFLTYKSGVYKHEAGDMMGGHAIRILGWGVENGVPYWLAAN 194
>gi|339242629|ref|XP_003377240.1| Gut-specific cysteine proteinase [Trichinella spiralis]
gi|316973974|gb|EFV57515.1| Gut-specific cysteine proteinase [Trichinella spiralis]
Length = 325
Score = 108 bits (271), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 51/135 (37%), Positives = 78/135 (57%), Gaps = 13/135 (9%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP DAR++WP+C + + DQ+NCGSCWAVS A+ ++DR+CI S +S + +V
Sbjct: 84 LPFEMDARKRWPQCKYIGFVRDQANCGSCWAVSSASVMTDRICIESIAAKQPLLSEEELV 143
Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
+C C +GC+GG+P A+ +W G+ TGG Y S +GC+PY++
Sbjct: 144 SCCKICGYGCDGGYPDKAFIYWATRGIPTGGPYGSTKGCKPYSIGSNSED---------- 193
Query: 306 LGKLKTPECKQNCYN 320
+ +TP C + C N
Sbjct: 194 --EAETPLCTRQCIN 206
Score = 107 bits (268), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 44/83 (53%), Positives = 64/83 (77%)
Query: 77 MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANS 136
M+++Y++GP+V F+VY DF+ Y GVY+H FG +G HAV+++GWG+EN YWL++NS
Sbjct: 233 MQELYKNGPVVVAFNVYEDFMYYIKGVYEHRFGKFLGGHAVKLIGWGIENSKKYWLISNS 292
Query: 137 WNDHWGDHGTFKILRGENEADIE 159
WN WG++G FKI+RG+N IE
Sbjct: 293 WNTTWGENGFFKIIRGKNCCAIE 315
>gi|340053922|emb|CCC48215.1| cysteine peptidase C (CPC) [Trypanosoma vivax Y486]
Length = 334
Score = 108 bits (271), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 55/135 (40%), Positives = 79/135 (58%), Gaps = 9/135 (6%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP +FDA WP+CP+++ IADQS+CGSCWAV+ A A+SDR C+ + G ISA ++
Sbjct: 91 LPESFDAATAWPDCPTIKRIADQSSCGSCWAVAAATAMSDRFCV-TGGVRDLGISAGDLL 149
Query: 247 ACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
+C +C GC+GG+P AW ++ +G+V+ DY CQPY PC+H
Sbjct: 150 SCCTSCGDGCDGGYPDEAWLYFTESGLVS--DY-----CQPYPFPPCKHSGGRSKNPSCH 202
Query: 306 LGKLKTPECKQNCYN 320
TP+C C +
Sbjct: 203 DMHFHTPKCNATCTD 217
Score = 102 bits (253), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 47/103 (45%), Positives = 64/103 (62%), Gaps = 2/103 (1%)
Query: 59 IPLSHYF--KKAHMVPRCNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHA 116
IP+ YF + + + R++Y GP F+VY DFL Y+SGVY+H G +G HA
Sbjct: 220 IPVVRYFASESYSLQGEEDYKRELYLRGPFEVAFTVYEDFLAYESGVYKHVSGGPVGGHA 279
Query: 117 VRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
VRV+GWG N +PYW +ANSWN WG++G RG++E IE
Sbjct: 280 VRVVGWGERNGVPYWKIANSWNTDWGENGYLYFYRGKDECGIE 322
>gi|294883442|ref|XP_002770942.1| Cathepsin B precursor, putative [Perkinsus marinus ATCC 50983]
gi|239874068|gb|EER02758.1| Cathepsin B precursor, putative [Perkinsus marinus ATCC 50983]
Length = 393
Score = 108 bits (270), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 65/160 (40%), Positives = 85/160 (53%), Gaps = 18/160 (11%)
Query: 187 LPRNFDAREKWPECPS-LRHIADQSNCGSCWAVSVANAISDRLCIASNGYF-TGQISAQH 244
LP FDARE + C + + H+ DQS CGSCWA + + A SDRLCI S+G F +SA H
Sbjct: 127 LPDRFDAREHFKNCATVIGHVRDQSTCGSCWAFATSEAFSDRLCIRSSGEFDLVPLSAGH 186
Query: 245 IVACTPNC-----WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGP 299
AC +GC+GG P AWR++ +GVV+ D GC PY C HHV+
Sbjct: 187 TAACCSEAEGCFSFGCDGGQPDSAWRWFSEHGVVSELD----SGCWPYNFPECSHHVETK 242
Query: 300 -LQNCTLLGKLKTPECKQNCYN----PSYESTYRFDLKKG 334
++ C G +P C C N PS+ES F +G
Sbjct: 243 GMEPCK--GNSPSPVCSTTCRNHHFKPSFESDRHFTEDEG 280
Score = 100 bits (249), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 44/83 (53%), Positives = 59/83 (71%)
Query: 78 RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
++I ++GP+ A F+VY DFL YKSGVY+H G +G HAV+++GWG + + YWLV NSW
Sbjct: 291 KEIIDNGPVAAAFTVYEDFLYYKSGVYKHVNGSELGGHAVKIIGWGTDQNEQYWLVMNSW 350
Query: 138 NDHWGDHGTFKILRGENEADIEM 160
N +WGD G FKI GE D E+
Sbjct: 351 NVNWGDQGIFKIAIGECGIDSEV 373
>gi|170060936|ref|XP_001866022.1| cathepsin B [Culex quinquefasciatus]
gi|167879259|gb|EDS42642.1| cathepsin B [Culex quinquefasciatus]
Length = 341
Score = 108 bits (270), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 58/155 (37%), Positives = 81/155 (52%), Gaps = 11/155 (7%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP FDAR++WPEC SL+ I +Q CGSCWA+S A +DR CI S A ++
Sbjct: 89 LPERFDARDRWPECTSLKQIRNQGCCGSCWAISAAETFTDRWCIHSEDKDQFSFGAYDLL 148
Query: 247 ACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
+C +C GC GG AW+FW GV +GG YNS++GC PY + C +
Sbjct: 149 SCCHSCGDGCQGGNLGPAWQFWVQRGVSSGGPYNSRQGCHPYPVDVCHSADE-------- 200
Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
TP+C + C + + D + G+ A+ V
Sbjct: 201 --DADTPKCTRKCQSMYNVTNVSDDRRFGRVAYSV 233
Score = 101 bits (252), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 45/81 (55%), Positives = 58/81 (71%)
Query: 79 QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWN 138
+I+ +GP+ A F VY DF YK+GVY+H FG G HAV+++GWGVEN YWL +NSW
Sbjct: 243 EIFRNGPVQASFDVYLDFKAYKTGVYRHVFGPMEGGHAVKMIGWGVENGTKYWLCSNSWG 302
Query: 139 DHWGDHGTFKILRGENEADIE 159
+ WG+ G FKI+RGEN IE
Sbjct: 303 EDWGERGFFKIVRGENHCGIE 323
>gi|308504721|ref|XP_003114544.1| hypothetical protein CRE_27547 [Caenorhabditis remanei]
gi|308261929|gb|EFP05882.1| hypothetical protein CRE_27547 [Caenorhabditis remanei]
Length = 358
Score = 108 bits (269), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 58/162 (35%), Positives = 88/162 (54%), Gaps = 9/162 (5%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+P FD+R+KWPEC + + DQS+CGS + SDR CI SNG F +SAQ +
Sbjct: 94 IPTYFDSRQKWPECTQIGAVRDQSDCGSAAHLVAVELASDRTCIFSNGTFNWPLSAQDPL 153
Query: 247 ACTP-------NCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGP 299
+C + WGC+G WP+ ++W +G+ TGG+Y Q GC+PY++ PC+
Sbjct: 154 SCCVGLMSICGDGWGCDGSWPKDILKWWQTHGLCTGGNYEDQFGCKPYSIYPCDKKYPNG 213
Query: 300 LQNCTLLGKLKTPECKQNCY-NPSYESTYRFDLKKGKKAHMV 340
+ G TP C+++C N ++ Y+ D GK + V
Sbjct: 214 TTSVPCPG-YHTPTCEEHCTSNITWPIAYKQDKHFGKAHYNV 254
Score = 86.3 bits (212), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 40/107 (37%), Positives = 61/107 (57%), Gaps = 3/107 (2%)
Query: 56 PTSIPLSHYFKKAHMVP---RCNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSI 112
P + +F KAH + +I +GP++A F +Y DF YKSG+Y H GD
Sbjct: 238 PIAYKQDKHFGKAHYNVGKKMTDIQTEIMTNGPVIASFVIYDDFWDYKSGIYVHTAGDQE 297
Query: 113 GLHAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
G +++GWGV++ +PYWL + W +G++G + LRG NE +IE
Sbjct: 298 GGMDTKIIGWGVDSGVPYWLCVHQWGTDFGENGFVRFLRGVNEVNIE 344
>gi|56756587|gb|AAW26466.1| unknown [Schistosoma japonicum]
Length = 216
Score = 108 bits (269), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 44/82 (53%), Positives = 61/82 (74%)
Query: 78 RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
++I +GP+ A F VY DFL YKSG+Y+H G +G HA+R++GWGV+ PYWL+ANSW
Sbjct: 125 KEIMMNGPVEAAFDVYEDFLNYKSGIYRHVTGSIVGGHAIRIIGWGVKKRTPYWLIANSW 184
Query: 138 NDHWGDHGTFKILRGENEADIE 159
N+ WG+ G F+I+RG +E IE
Sbjct: 185 NEDWGEKGLFRIVRGRDECSIE 206
Score = 92.8 bits (229), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 46/119 (38%), Positives = 70/119 (58%), Gaps = 3/119 (2%)
Query: 224 ISDRLCIASNGYFTGQISAQHIVACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQE 282
++DR+CI S G + ++SA +++C +C GC GG+P +AW +W G+VTGG +
Sbjct: 1 MTDRICIQSGGGQSAELSALDLISCCEDCGQGCQGGFPGVAWDYWVTQGIVTGGSKENHT 60
Query: 283 GCQPYTLAPCEHHVQGPLQNCTLLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMVL 341
GCQPY CEHH +G C KTP+CKQ C Y++ Y+ D G +++ V+
Sbjct: 61 GCQPYPFPKCEHHTKGKYPACGTK-IYKTPQCKQKC-QKGYKTPYKQDKHYGDESYNVI 117
>gi|166030310|gb|ABY78822.1| cathepsin B-like protease [Trypanosoma congolense]
Length = 335
Score = 107 bits (268), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 58/134 (43%), Positives = 79/134 (58%), Gaps = 11/134 (8%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP +FD+ EKWP CP++R IADQS CGSCWAVS A+AISDR C G +ISA H++
Sbjct: 90 LPESFDSAEKWPNCPTIREIADQSACGSCWAVSTASAISDRHCTV-GGVQQLRISAAHLM 148
Query: 247 ACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHH-VQGPLQNCT 304
+C +C GC+GG+P +W ++ +G+ + CQPY C HH +G C+
Sbjct: 149 SCCEDCGDGCDGGYPGTSWEYYVSHGL-------ASSYCQPYPFPHCGHHGGKGKKPPCS 201
Query: 305 LLGKLKTPECKQNC 318
TP+C C
Sbjct: 202 KY-HFHTPKCNTTC 214
Score = 103 bits (257), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 45/82 (54%), Positives = 58/82 (70%)
Query: 78 RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
R++Y +GP V +F VY+DFL YK+GVY+H GD +G HAVR++GWG N PYW +ANSW
Sbjct: 240 RELYFNGPFVVVFWVYSDFLAYKTGVYRHVSGDFLGGHAVRIVGWGKLNGTPYWKIANSW 299
Query: 138 NDHWGDHGTFKILRGENEADIE 159
+ WG +G LRG NE IE
Sbjct: 300 DTDWGMNGHLLFLRGNNECGIE 321
>gi|402583630|gb|EJW77574.1| hypothetical protein WUBG_11516 [Wuchereria bancrofti]
Length = 168
Score = 107 bits (268), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 58/147 (39%), Positives = 78/147 (53%), Gaps = 7/147 (4%)
Query: 184 AKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQ 243
A LP FDAR KWP CPS+ ++ +Q CGSC+AV+VA SDR+CIA+NG +S+
Sbjct: 13 ASELPDEFDARRKWPLCPSIHNVPNQGGCGSCYAVAVAGVASDRICIATNGTVQVILSSD 72
Query: 244 HIVACTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNC 303
I++C +C C GG A +W + G+VTGG ++GCQPY P + P
Sbjct: 73 DIISCCISCGACTGGDSLKAMIYWVNEGIVTGG----RDGCQPY---PYDIKCGIPCPLL 125
Query: 304 TLLGKLKTPECKQNCYNPSYESTYRFD 330
K C C N Y + Y D
Sbjct: 126 EFAKNAKMQRCHHKCQNIYYRNDYFND 152
>gi|296863454|pdb|3HHI|A Chain A, Crystal Structure Of Cathepsin B From T. Brucei In Complex
With Ca074
gi|296863455|pdb|3HHI|B Chain B, Crystal Structure Of Cathepsin B From T. Brucei In Complex
With Ca074
Length = 325
Score = 107 bits (268), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 59/139 (42%), Positives = 78/139 (56%), Gaps = 12/139 (8%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP +FD+ E WP CP++ IADQS CGSCWAV+ A+A+SDR C G ISA ++
Sbjct: 72 LPSSFDSAEAWPNCPTIPQIADQSACGSCWAVAAASAMSDRFCTM-GGVQDVHISAGDLL 130
Query: 247 ACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQG--PLQNC 303
AC +C GCNGG P AW ++ G+V+ DY CQPY C HH + C
Sbjct: 131 ACCSDCGDGCNGGDPDRAWAYFSSTGLVS--DY-----CQPYPFPHCSHHSKSKNGYPPC 183
Query: 304 TLLGKLKTPECKQNCYNPS 322
+ TP+C C +P+
Sbjct: 184 SQF-NFDTPKCDYTCDDPT 201
Score = 100 bits (249), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 44/85 (51%), Positives = 54/85 (63%)
Query: 77 MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANS 136
MR+++ GP F VY DF+ Y SGVY H G +G HAVR++GWG N +PYW +ANS
Sbjct: 222 MRELFFRGPFEVAFDVYEDFIAYNSGVYHHVSGQYLGGHAVRLVGWGTSNGVPYWKIANS 281
Query: 137 WNDHWGDHGTFKILRGENEADIEMG 161
WN WG G F I RG +E IE G
Sbjct: 282 WNTEWGMDGYFLIRRGSSECGIEDG 306
>gi|268572255|ref|XP_002648916.1| Hypothetical protein CBG17829 [Caenorhabditis briggsae]
Length = 220
Score = 107 bits (268), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 48/98 (48%), Positives = 67/98 (68%), Gaps = 4/98 (4%)
Query: 79 QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWN 138
+I +GP+V +F++Y D +YKSGVY+H G +G HA++++GWG +N IPYWL+ANSW
Sbjct: 127 EIMTNGPVVGVFTMYEDMYKYKSGVYRHTAGRLLGGHAIKIIGWGTQNGIPYWLIANSWG 186
Query: 139 DHWGDHGTFKILRGENEADIEMGFNNRVEANSSEDDDL 176
WG++G FKI RG NE IE N V A ++ D L
Sbjct: 187 TKWGENGFFKIRRGVNECGIE----NNVVAGKADVDTL 220
Score = 37.7 bits (86), Expect = 7.4, Method: Compositional matrix adjust.
Identities = 22/49 (44%), Positives = 27/49 (55%), Gaps = 2/49 (4%)
Query: 211 NCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV-ACTPNC-WGCNG 257
N SCWA A ISDR+CIA+ G IS +V C C +GC+G
Sbjct: 63 NVKSCWAFGAAEVISDRICIATKGARQPIISPMDMVDCCGKYCGYGCDG 111
>gi|294914603|ref|XP_002778294.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
gi|239886508|gb|EER10089.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
Length = 365
Score = 107 bits (268), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 46/80 (57%), Positives = 58/80 (72%)
Query: 78 RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
++I +GP A FSVY DFL YKSGVY+H G +G HAV ++GWG E + YWLV NSW
Sbjct: 275 KEIMTNGPTSAAFSVYEDFLSYKSGVYKHTSGGFLGGHAVEIIGWGTEKGVDYWLVMNSW 334
Query: 138 NDHWGDHGTFKILRGENEAD 157
N+ WGDHGTFKI++G+ D
Sbjct: 335 NEEWGDHGTFKIVQGDCGID 354
Score = 95.9 bits (237), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 55/174 (31%), Positives = 83/174 (47%), Gaps = 12/174 (6%)
Query: 169 NSSEDDDLETMGCQNAKGLPRNFDAREKWPECPS-LRHIADQSNCGSCWAVSVANAISDR 227
N +E+ + + + +P +FDAR+ + EC + H+ DQS CGSCWA A + R
Sbjct: 82 NGTEELEEKVYPAEELVDIPDSFDARDAFKECKDVIGHVRDQSACGSCWAFGTVEAFNAR 141
Query: 228 LCIASNGYFTGQISAQHIVACTPN-----CWGCNGGWPQLAWRFWGHNGVVTGGDY---- 278
+CI S G +SA ++AC +GC+GG P +W F NG+V+GG +
Sbjct: 142 VCIKSGGKLNQLLSAADMLACCNIGHFCLSFGCSGGNPITSWTFLHTNGIVSGGGFVPEK 201
Query: 279 --NSQEGCQPYTLAPCEHHVQGPLQNCTLLGKLKTPECKQNCYNPSYESTYRFD 330
+ +GC PY C HH + TP C +C N Y + + D
Sbjct: 202 NMKAADGCWPYNFPKCAHHQKESDYKPCAKEIYDTPSCSSSCPNAKYGTAFDKD 255
>gi|355332948|pdb|3MOR|A Chain A, Crystal Structure Of Cathepsin B From Trypanosoma Brucei
gi|355332949|pdb|3MOR|B Chain B, Crystal Structure Of Cathepsin B From Trypanosoma Brucei
Length = 317
Score = 107 bits (268), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 59/139 (42%), Positives = 78/139 (56%), Gaps = 12/139 (8%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP +FD+ E WP CP++ IADQS CGSCWAV+ A+A+SDR C G ISA ++
Sbjct: 71 LPSSFDSAEAWPNCPTIPQIADQSACGSCWAVAAASAMSDRFCTM-GGVQDVHISAGDLL 129
Query: 247 ACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQG--PLQNC 303
AC +C GCNGG P AW ++ G+V+ DY CQPY C HH + C
Sbjct: 130 ACCSDCGDGCNGGDPDRAWAYFSSTGLVS--DY-----CQPYPFPHCSHHSKSKNGYPPC 182
Query: 304 TLLGKLKTPECKQNCYNPS 322
+ TP+C C +P+
Sbjct: 183 SQF-NFDTPKCNYTCDDPT 200
Score = 100 bits (250), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 44/85 (51%), Positives = 54/85 (63%)
Query: 77 MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANS 136
MR+++ GP F VY DF+ Y SGVY H G +G HAVR++GWG N +PYW +ANS
Sbjct: 221 MRELFFRGPFEVAFDVYEDFIAYNSGVYHHVSGQYLGGHAVRLVGWGTSNGVPYWKIANS 280
Query: 137 WNDHWGDHGTFKILRGENEADIEMG 161
WN WG G F I RG +E IE G
Sbjct: 281 WNTEWGMDGYFLIRRGSSECGIEDG 305
>gi|166030308|gb|ABY78821.1| cathepsin B-like protease [Trypanosoma congolense]
Length = 336
Score = 107 bits (268), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 59/134 (44%), Positives = 77/134 (57%), Gaps = 11/134 (8%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP +FD+ EKWP CP++R IADQS CGSCWAVS A+AISDR C G +ISA H++
Sbjct: 90 LPESFDSAEKWPNCPTIREIADQSACGSCWAVSTASAISDRYCTV-GGVQQLRISAAHLM 148
Query: 247 ACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHH-VQGPLQNCT 304
+C +C GC GG P AW ++ +G+ + CQPY C HH +G C+
Sbjct: 149 SCCEDCGDGCKGGAPDSAWEYYVSHGL-------ASSYCQPYPFPHCGHHGGKGKKPPCS 201
Query: 305 LLGKLKTPECKQNC 318
TP+C C
Sbjct: 202 KY-HFHTPKCNTTC 214
Score = 107 bits (267), Expect = 9e-21, Method: Compositional matrix adjust.
Identities = 47/82 (57%), Positives = 59/82 (71%)
Query: 78 RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
R++Y +GP V F VY+DFL YK+GVY+H GD +G HAVR++GWG N PYW +ANSW
Sbjct: 241 RELYFNGPFVVDFGVYSDFLAYKTGVYRHVSGDVLGGHAVRIVGWGKLNGTPYWKIANSW 300
Query: 138 NDHWGDHGTFKILRGENEADIE 159
+ WG +G F ILRG NE IE
Sbjct: 301 DTDWGMNGHFLILRGNNECGIE 322
>gi|261328564|emb|CBH11542.1| CPC cysteine peptidase, Clan CA, family C1,Cathepsin B-like,
putative [Trypanosoma brucei gambiense DAL972]
Length = 340
Score = 107 bits (267), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 59/139 (42%), Positives = 78/139 (56%), Gaps = 12/139 (8%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP +FD+ E WP CP++ IADQS CGSCWAV+ A+A+SDR C G ISA ++
Sbjct: 94 LPSSFDSAEAWPNCPTIPQIADQSACGSCWAVAAASAMSDRFCTM-GGVQDVHISAGDLL 152
Query: 247 ACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQG--PLQNC 303
AC +C GCNGG P AW ++ G+V+ DY CQPY C HH + C
Sbjct: 153 ACCSDCGDGCNGGDPDRAWAYFSSTGLVS--DY-----CQPYPFPHCSHHSKSKNGYPPC 205
Query: 304 TLLGKLKTPECKQNCYNPS 322
+ TP+C C +P+
Sbjct: 206 SQF-NFDTPKCNYTCDDPT 223
Score = 100 bits (249), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 44/85 (51%), Positives = 54/85 (63%)
Query: 77 MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANS 136
MR+++ GP F VY DF+ Y SGVY H G +G HAVR++GWG N +PYW +ANS
Sbjct: 244 MRELFFRGPFEVAFDVYEDFIAYNSGVYHHVSGQYLGGHAVRLVGWGTSNGVPYWKIANS 303
Query: 137 WNDHWGDHGTFKILRGENEADIEMG 161
WN WG G F I RG +E IE G
Sbjct: 304 WNTEWGMDGYFLIRRGSSECGIEDG 328
>gi|4204370|gb|AAD11445.1| cathepsin B protease, partial [Fasciola hepatica]
Length = 247
Score = 107 bits (267), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 55/148 (37%), Positives = 82/148 (55%), Gaps = 2/148 (1%)
Query: 194 REKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVACTPNC- 252
R +WP+C ++ I DQ++CGSCWA + A+A+SDR+CI SNG +++A ++C C
Sbjct: 1 RSQWPQCWTISEIRDQASCGSCWATAAASAMSDRVCIHSNGQMRPRLAAADPLSCCTYCG 60
Query: 253 WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTLLGKLKTP 312
GC GG+P AW +W G+VTGG + ++ GCQP+ C+H + TP
Sbjct: 61 QGCRGGYPPKAWDYWMREGIVTGGTWENRTGCQPWMFTKCDHVGDSRKYSRCPHYTYPTP 120
Query: 313 ECKQNCYNPSYESTYRFDLKKGKKAHMV 340
C + C Y TY D G ++ V
Sbjct: 121 PCARAC-QTGYNKTYEQDKFYGNSSYNV 147
Score = 105 bits (262), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 43/83 (51%), Positives = 63/83 (75%)
Query: 77 MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANS 136
M++I ++GP+ F+++ DF Y+SG+Y H G IG HAVR++GWGVEN + YWL+ANS
Sbjct: 155 MQEIMKNGPVEVTFAIFQDFGVYRSGIYHHVAGKFIGRHAVRMIGWGVENGVNYWLMANS 214
Query: 137 WNDHWGDHGTFKILRGENEADIE 159
WN+ WG++G F+++RG NE IE
Sbjct: 215 WNEEWGENGYFRMVRGRNECGIE 237
>gi|72389769|ref|XP_845179.1| cysteine peptidase C (CPC) [Trypanosoma brucei brucei strain 927/4
GUTat10.1]
gi|427931064|pdb|4HWY|A Chain A, Trypanosoma Brucei Procathepsin B Solved From 40 Fs
Free-electron Laser Pulse Data By Serial Femtosecond
X-ray Crystallography
gi|40557577|gb|AAR88085.1| cathepsin B-like cysteine protease [Trypanosoma brucei]
gi|62360039|gb|AAX80461.1| cysteine peptidase C (CPC) [Trypanosoma brucei]
gi|70801714|gb|AAZ11620.1| cysteine peptidase C (CPC) [Trypanosoma brucei brucei strain 927/4
GUTat10.1]
Length = 340
Score = 107 bits (267), Expect = 9e-21, Method: Compositional matrix adjust.
Identities = 59/139 (42%), Positives = 78/139 (56%), Gaps = 12/139 (8%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP +FD+ E WP CP++ IADQS CGSCWAV+ A+A+SDR C G ISA ++
Sbjct: 94 LPSSFDSAEAWPNCPTIPQIADQSACGSCWAVAAASAMSDRFCTM-GGVQDVHISAGDLL 152
Query: 247 ACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQG--PLQNC 303
AC +C GCNGG P AW ++ G+V+ DY CQPY C HH + C
Sbjct: 153 ACCSDCGDGCNGGDPDRAWAYFSSTGLVS--DY-----CQPYPFPHCSHHSKSKNGYPPC 205
Query: 304 TLLGKLKTPECKQNCYNPS 322
+ TP+C C +P+
Sbjct: 206 SQF-NFDTPKCNYTCDDPT 223
Score = 100 bits (250), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 44/85 (51%), Positives = 54/85 (63%)
Query: 77 MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANS 136
MR+++ GP F VY DF+ Y SGVY H G +G HAVR++GWG N +PYW +ANS
Sbjct: 244 MRELFFRGPFEVAFDVYEDFIAYNSGVYHHVSGQYLGGHAVRLVGWGTSNGVPYWKIANS 303
Query: 137 WNDHWGDHGTFKILRGENEADIEMG 161
WN WG G F I RG +E IE G
Sbjct: 304 WNTEWGMDGYFLIRRGSSECGIEDG 328
>gi|294873367|ref|XP_002766594.1| Cathepsin B precursor, putative [Perkinsus marinus ATCC 50983]
gi|239867622|gb|EEQ99311.1| Cathepsin B precursor, putative [Perkinsus marinus ATCC 50983]
Length = 244
Score = 107 bits (266), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 46/80 (57%), Positives = 58/80 (72%)
Query: 78 RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
++I +GP A FSVY DFL YKSGVY+H G +G HAV ++GWG E + YWLV NSW
Sbjct: 154 KEIMTNGPTSAAFSVYEDFLSYKSGVYKHTSGGFLGGHAVEIIGWGTEKGVDYWLVMNSW 213
Query: 138 NDHWGDHGTFKILRGENEAD 157
N+ WGDHGTFKI++G+ D
Sbjct: 214 NEEWGDHGTFKIVQGDCGID 233
Score = 80.9 bits (198), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 46/134 (34%), Positives = 64/134 (47%), Gaps = 11/134 (8%)
Query: 208 DQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVACTPN-----CWGCNGGWPQL 262
DQS CGSCWA A + R+CI S G +SA +++AC +GC+GG P
Sbjct: 1 DQSACGSCWAFGTVEAFNARVCIKSGGKLNQLLSAANMLACCNIGHFCLSFGCSGGNPIT 60
Query: 263 AWRFWGHNGVVTGGDY------NSQEGCQPYTLAPCEHHVQGPLQNCTLLGKLKTPECKQ 316
+W F NG+V+GG + + +GC PY+ C HH G TP C
Sbjct: 61 SWTFLHTNGIVSGGGFVPEKNMKAADGCWPYSFPKCAHHQDGSDYKPCAKEIYDTPSCSS 120
Query: 317 NCYNPSYESTYRFD 330
+C N Y + + D
Sbjct: 121 SCPNAKYGTAFDKD 134
>gi|156708114|gb|ABU93315.1| cathepsin B6 cysteine protease [Monocercomonoides sp. PA]
Length = 281
Score = 107 bits (266), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 48/102 (47%), Positives = 68/102 (66%)
Query: 61 LSHYFKKAHMVPRCNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
+ + ++KA N ++ ++GP+ F+VY+DF+ YKSGVYQH G G HAV ++
Sbjct: 173 IRYKYEKAETYTVQNIQEELMKNGPVYFRFTVYSDFMNYKSGVYQHKSGYQEGGHAVLLI 232
Query: 121 GWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIEMGF 162
GWGVE+ +PYWL+ NSW WG+ G FKI+RG+NE E GF
Sbjct: 233 GWGVEDGVPYWLLQNSWGPAWGEKGHFKIIRGKNECGCEQGF 274
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 48/144 (33%), Positives = 68/144 (47%), Gaps = 28/144 (19%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+P NFDARE+WP + + DQ++CGSCWA + + AI +R I G G +S Q +V
Sbjct: 63 VPENFDAREQWPG--KIYPVRDQASCGSCWAHAASEAIGNRFSIKGCG--KGMLSVQDLV 118
Query: 247 ACTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTLL 306
+C GCNGG L+ ++ NGV T E C PY +
Sbjct: 119 SCDKGDSGCNGGSGPLSSKWLVSNGVTT-------EECLPY-----------------VS 154
Query: 307 GKLKTPECKQNCYNPSYESTYRFD 330
G + P C C N S Y+++
Sbjct: 155 GNGRVPACAAKCSNGSQIIRYKYE 178
>gi|412985820|emb|CCO17020.1| cathepsin B-like cysteine proteinase [Bathycoccus prasinos]
Length = 541
Score = 107 bits (266), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 59/137 (43%), Positives = 75/137 (54%), Gaps = 11/137 (8%)
Query: 187 LPRNFDAREKWPECPSLRHIA-DQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHI 245
LP +FDAREKWPEC A DQ CGSCWA++ +SDRLCIAS G +++A I
Sbjct: 276 LPESFDAREKWPECSEFIGEAWDQGECGSCWAIAPTKVMSDRLCIASGGKVQERLAASEI 335
Query: 246 VACTP-----NCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEH--HVQG 298
++C + C GG P A+ F GV +GG Y ++GC Y PC H HVQ
Sbjct: 336 LSCGQLVSEFSFGSCEGGMPDDAYEFAKEFGVASGGKYGDEKGCAAYPFPPCHHPCHVQ- 394
Query: 299 PLQNCTLLGKLKTPECK 315
P C L K T +C+
Sbjct: 395 PTPACPL--KSDTAQCQ 409
Score = 51.2 bits (121), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 31/83 (37%), Positives = 45/83 (54%), Gaps = 8/83 (9%)
Query: 78 RQIYEHGPLVAIF-SVYADFLQYKSGVYQHNF-----GDSIGLHAVRVLGWGVENDIPY- 130
R+IY GP+ + ++Y +F YK G Y+ + G S G H + V+GW E+D Y
Sbjct: 440 REIYNSGPVSSYAGTIYDEFYAYKDGAYRTSADSETRGRSHGGHVIEVIGWHKESDGTYS 499
Query: 131 WLVANSWNDHWGDHGTFKILRGE 153
W + NSW + WG G +I GE
Sbjct: 500 WKIINSWLN-WGKKGHGRIAVGE 521
>gi|401415968|ref|XP_003872479.1| CPC cysteine peptidase, Clan CA, family C1,Cathepsin B-like
[Leishmania mexicana MHOM/GT/2001/U1103]
gi|322488703|emb|CBZ23950.1| CPC cysteine peptidase, Clan CA, family C1,Cathepsin B-like
[Leishmania mexicana MHOM/GT/2001/U1103]
Length = 340
Score = 106 bits (265), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 63/166 (37%), Positives = 84/166 (50%), Gaps = 22/166 (13%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP +FDA EKWP C ++ I DQSNCGSCWA++ A+SDR C S G +IS +++
Sbjct: 98 LPESFDASEKWPMCVTIGEIRDQSNCGSCWAIAAVEAMSDRYCTMS-GIPDRRISTTNLL 156
Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQG----PLQ 301
+C C +GC GG P +AW +W GV T E CQPY PC HH P
Sbjct: 157 SCCFICGFGCYGGIPAMAWLWWVWVGVTT-------ELCQPYPFGPCSHHGNSSKYPPCP 209
Query: 302 NCTLLGKLKTPECKQNCYNPS-----YESTYRFDLKKGKKAHMVLM 342
N TP+C C N Y+ + +K ++ + LM
Sbjct: 210 NTI----YNTPKCNTTCDNVEMELVKYKGVSSYSIKGERELMVELM 251
Score = 104 bits (259), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 46/83 (55%), Positives = 59/83 (71%)
Query: 77 MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANS 136
M ++ +GPL VYADF+ YKSGVY+H GD +G HAV+++GWGV++ IPYW +ANS
Sbjct: 247 MVELMNNGPLEVAMQVYADFVAYKSGVYKHVSGDHLGGHAVKLVGWGVKDGIPYWKIANS 306
Query: 137 WNDHWGDHGTFKILRGENEADIE 159
WN WGD G F I RG +E IE
Sbjct: 307 WNTDWGDKGYFLIQRGNDECGIE 329
>gi|154340956|ref|XP_001566431.1| cysteine peptidase C (CPC) [Leishmania braziliensis
MHOM/BR/75/M2904]
gi|134063754|emb|CAM39941.1| cysteine peptidase C (CPC) [Leishmania braziliensis
MHOM/BR/75/M2904]
Length = 340
Score = 106 bits (265), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 45/83 (54%), Positives = 58/83 (69%)
Query: 77 MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANS 136
M ++ +GP F VYADF+ YKSGVY H G+ +G HAV+++GWGV+N PYW +ANS
Sbjct: 247 MIELMTYGPFEVAFDVYADFVSYKSGVYSHTTGERLGGHAVKLVGWGVQNGTPYWKIANS 306
Query: 137 WNDHWGDHGTFKILRGENEADIE 159
WN WGD+G F I RG +E IE
Sbjct: 307 WNSDWGDNGYFLIRRGTDECGIE 329
Score = 105 bits (263), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 53/136 (38%), Positives = 73/136 (53%), Gaps = 9/136 (6%)
Query: 184 AKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQ 243
A+ LP +FD+ +KWP+C ++ I DQSNCGSCWA++ A+SDR C + G ++S
Sbjct: 95 AQELPTSFDSSDKWPKCRTISEIRDQSNCGSCWAIAAVEAMSDRYCTVA-GITDLRVSTG 153
Query: 244 HIVACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQN 302
H+++C C GC GG P +AW +W G+ + E CQPY PC HH G
Sbjct: 154 HLLSCCFVCGMGCQGGIPTMAWLWWVWVGL-------TSEVCQPYPFPPCGHHTDGGKYP 206
Query: 303 CTLLGKLKTPECKQNC 318
TP C C
Sbjct: 207 ACPSTIYDTPTCNSTC 222
>gi|166030332|gb|ABY78833.1| cathepsin B-like protease [Trypanosoma congolense]
Length = 336
Score = 106 bits (265), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 60/138 (43%), Positives = 79/138 (57%), Gaps = 11/138 (7%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP +FDA E WP CP++R IADQS C + WAV+ A+AISDR C G +ISA ++
Sbjct: 91 LPESFDAAEHWPHCPTIREIADQSACRASWAVATASAISDRYCTVGKGK-QLRISAADLM 149
Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHH-VQGPLQNCT 304
AC +C GC GG+P AW ++ +G+ + SQ CQPY CEH QG C+
Sbjct: 150 ACCKDCGGGCEGGYPDAAWEYYVSHGITS-----SQ--CQPYPFPRCEHRGAQGKKPPCS 202
Query: 305 LLGKLKTPECKQNCYNPS 322
K TP+C C + S
Sbjct: 203 KY-KFVTPQCNATCTDKS 219
Score = 106 bits (264), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 49/86 (56%), Positives = 63/86 (73%), Gaps = 1/86 (1%)
Query: 78 RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
R++Y +GP V F V++DFL YKSGVYQH G+ +G AVR++GWG N PYW VANSW
Sbjct: 241 RELYFNGPFVVRFQVHSDFLAYKSGVYQHVAGNFLGGKAVRIVGWGKLNGTPYWKVANSW 300
Query: 138 NDHWGDHGTFKILRGENEADIE-MGF 162
+ WG +G F ILRG+NE +IE +GF
Sbjct: 301 DTDWGMNGYFLILRGDNECNIEHLGF 326
>gi|728602|emb|CAA88490.1| cathepsin B-like enzyme [Leishmania mexicana]
gi|1586011|prf||2202319A cathepsin B-like Cys protease
Length = 340
Score = 106 bits (265), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 60/143 (41%), Positives = 75/143 (52%), Gaps = 17/143 (11%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP +FDA EKWP C ++ I DQSNCGSCWA++ A+SDR C S G +IS +++
Sbjct: 98 LPESFDASEKWPMCVTIGEIRDQSNCGSCWAIAAVEAMSDRYCTMS-GIPDRRISTTNLL 156
Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQG----PLQ 301
+C C +GC GG P +AW +W GV T E CQPY PC HH P
Sbjct: 157 SCCFICGFGCYGGIPAMAWLWWVWVGVTT-------ELCQPYPFGPCSHHGNSSKYPPCP 209
Query: 302 NCTLLGKLKTPECKQNCYNPSYE 324
N TP+C C N E
Sbjct: 210 NTI----YNTPKCNTTCDNVEME 228
Score = 102 bits (254), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 45/81 (55%), Positives = 58/81 (71%)
Query: 79 QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWN 138
++ +GPL VYADF+ YKSGVY+H GD +G HAV+++GWGV++ IPYW +ANSWN
Sbjct: 249 ELMNNGPLEVAMQVYADFVAYKSGVYKHVSGDHLGGHAVKLVGWGVKDGIPYWKIANSWN 308
Query: 139 DHWGDHGTFKILRGENEADIE 159
WGD G F I RG +E IE
Sbjct: 309 TDWGDKGYFLIQRGNDECGIE 329
>gi|156708112|gb|ABU93314.1| cathepsin B5 cysteine protease [Monocercomonoides sp. PA]
Length = 281
Score = 106 bits (265), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 48/81 (59%), Positives = 56/81 (69%)
Query: 79 QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWN 138
++Y GP A FSVY DF YKSGVY H G +G HAV V+GWGVE+ PYWL+ NSW
Sbjct: 191 ELYSRGPFEAAFSVYEDFKSYKSGVYHHITGKMLGGHAVMVVGWGVEDGTPYWLIQNSWG 250
Query: 139 DHWGDHGTFKILRGENEADIE 159
WG+ G FKILRG+NE IE
Sbjct: 251 TTWGEQGFFKILRGKNECGIE 271
Score = 75.1 bits (183), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 48/136 (35%), Positives = 64/136 (47%), Gaps = 28/136 (20%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP NFDARE+WP + + +Q CGSCWA +VA +RL I G G +S Q +V
Sbjct: 63 LPDNFDAREQWPG--KILPVRNQEQCGSCWAFAVAETTGNRLNILGCG--RGDMSPQDLV 118
Query: 247 ACTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTLL 306
+C GCNGG P +W + H+G+ T E C PY +
Sbjct: 119 SCDKVDHGCNGGSPLFSWEWVKHSGITT-------EECIPY-----------------VS 154
Query: 307 GKLKTPECKQNCYNPS 322
G + P C + C N S
Sbjct: 155 GGGRVPSCPKKCTNGS 170
>gi|160688716|gb|ABX45136.1| cathepsin B-like cysteine protease 2 [Callosobruchus maculatus]
Length = 260
Score = 106 bits (264), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 56/104 (53%), Positives = 74/104 (71%), Gaps = 9/104 (8%)
Query: 63 HYFKKAHMVPRCNAMRQI----YEHGPLVAIFSVYADFLQYKSGVYQHNFGDS--IGLHA 116
HY K+A+ + RQI ++GP+VA F+VYADF+ Y SGVY+ + G+S +G HA
Sbjct: 150 HYAKQAYRI-MSKVERQIQLEIIKNGPVVASFTVYADFIHYLSGVYKFD-GESKLLGGHA 207
Query: 117 VRVLGWGVENDI-PYWLVANSWNDHWGDHGTFKILRGENEADIE 159
VR++GWG+EN PYWLV+NSWN+ WGD G FKI RG+NE IE
Sbjct: 208 VRIIGWGIENGTYPYWLVSNSWNERWGDQGLFKIWRGKNECGIE 251
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 19/35 (54%), Positives = 25/35 (71%)
Query: 183 NAKGLPRNFDAREKWPECPSLRHIADQSNCGSCWA 217
+ K LP FDAR++W +C S++ I DQS CGSCW
Sbjct: 77 DGKDLPEEFDARKQWSKCESIKEIRDQSGCGSCWG 111
>gi|166030328|gb|ABY78831.1| cathepsin B-like protease [Trypanosoma congolense]
Length = 336
Score = 106 bits (264), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 59/138 (42%), Positives = 75/138 (54%), Gaps = 11/138 (7%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP FDA E WP CP++R IADQS C + WAVS A+AISDR C G +ISA ++
Sbjct: 90 LPETFDAAEHWPHCPTIREIADQSACRASWAVSTASAISDRYCTVGGGK-QLRISAADLL 148
Query: 247 ACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHH-VQGPLQNCT 304
+C C GC GG+P AW ++ G+ + GCQPY CEH QG C+
Sbjct: 149 SCCKQCGDGCKGGFPGFAWLYYVEYGI-------ASSGCQPYPFPHCEHRGAQGNKTPCS 201
Query: 305 LLGKLKTPECKQNCYNPS 322
K TP+C C + S
Sbjct: 202 KY-KFDTPKCNATCTDKS 218
Score = 102 bits (253), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 53/110 (48%), Positives = 68/110 (61%), Gaps = 4/110 (3%)
Query: 58 SIPLSHYFKKAHMV---PRCNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGL 114
SIPL Y A + + R++Y +GP VA+F VY D YKSGVY++ GD +G
Sbjct: 218 SIPLVKYRGNATYLLLHGEEDYKRELYFNGPFVAVFFVYTDLFAYKSGVYRNVDGDFLGG 277
Query: 115 HAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE-MGFN 163
AVR++GWG N PYW VANSW+ WG +G ILRG NE +IE +GF
Sbjct: 278 QAVRIVGWGKLNGTPYWKVANSWDTDWGMNGYMLILRGNNECNIEHLGFT 327
>gi|294935195|ref|XP_002781337.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
gi|239891887|gb|EER13132.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
Length = 317
Score = 106 bits (264), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 45/83 (54%), Positives = 59/83 (71%)
Query: 75 NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVA 134
N ++I +GP A FSVY DF+ YKSGVY+H G +G+H+V ++GWG E + YWLV
Sbjct: 224 NIKKEIMTNGPTSATFSVYEDFVSYKSGVYKHTNGTLMGIHSVEIIGWGTEKGVDYWLVM 283
Query: 135 NSWNDHWGDHGTFKILRGENEAD 157
NSWN+ WGDHGTFKI +G+ D
Sbjct: 284 NSWNEGWGDHGTFKIAQGDCGID 306
Score = 93.6 bits (231), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 52/143 (36%), Positives = 72/143 (50%), Gaps = 6/143 (4%)
Query: 187 LPRNFDAREKWPECPS-LRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHI 245
+P +FDAR+ + EC + H+ DQS C SCWA++ A + RLCI S G F +SA +
Sbjct: 59 IPNSFDARDAFKECKDVIGHVWDQSACASCWAIAPVEAFNARLCIKSGGKFNQLLSAGEM 118
Query: 246 VAC--TPNCW---GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPL 300
+AC + + W GC GG AW F +G+ T G ++ +GC PY C HH +
Sbjct: 119 IACCNSTHSWQPRGCKGGMILNAWSFLKTHGIATEGSMSAADGCWPYNFPKCAHHQKKSK 178
Query: 301 QNCTLLGKLKTPECKQNCYNPSY 323
TP C C N Y
Sbjct: 179 YEPCSKKLYDTPSCLDRCPNEKY 201
>gi|343477197|emb|CCD11909.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 336
Score = 106 bits (264), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 59/134 (44%), Positives = 77/134 (57%), Gaps = 11/134 (8%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP +FDA E WP CP++R IADQS C + WAV+ A+AISDR C G +ISA ++
Sbjct: 91 LPESFDAAEHWPHCPTIREIADQSACRASWAVATASAISDRYCTVGKGKQL-RISAADLM 149
Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHH-VQGPLQNCT 304
AC +C GC GG+P AW ++ +G+ + SQ CQPY CEH QG C+
Sbjct: 150 ACCKDCGGGCEGGYPDAAWEYYVSHGIAS-----SQ--CQPYPFPRCEHRGAQGKKTPCS 202
Query: 305 LLGKLKTPECKQNC 318
K TP+C C
Sbjct: 203 KY-KFVTPQCNATC 215
Score = 104 bits (259), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 48/86 (55%), Positives = 63/86 (73%), Gaps = 1/86 (1%)
Query: 78 RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
R++Y +GP V F V++DFL YK+GVYQH G+ +G AVR++GWG N PYW VANSW
Sbjct: 241 RELYFNGPFVVRFQVHSDFLAYKNGVYQHVAGNFLGGKAVRIVGWGKLNGTPYWKVANSW 300
Query: 138 NDHWGDHGTFKILRGENEADIE-MGF 162
+ WG +G F ILRG+NE +IE +GF
Sbjct: 301 DTDWGMNGYFLILRGDNECNIEHLGF 326
>gi|156708110|gb|ABU93313.1| cathepsin B4 cysteine protease [Monocercomonoides sp. PA]
Length = 281
Score = 106 bits (264), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 48/83 (57%), Positives = 58/83 (69%)
Query: 77 MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANS 136
M+ + E+GPL F VY+DF+ Y+SGVYQH G G HAV + GWGVEN +PYWLV NS
Sbjct: 189 MQALMEYGPLSCGFMVYSDFMNYRSGVYQHKSGYFEGGHAVLLCGWGVENGLPYWLVQNS 248
Query: 137 WNDHWGDHGTFKILRGENEADIE 159
W WG+ G FKILRG N +IE
Sbjct: 249 WGPAWGEKGFFKILRGSNHCEIE 271
Score = 59.7 bits (143), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 42/112 (37%), Positives = 57/112 (50%), Gaps = 6/112 (5%)
Query: 188 PRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVA 247
P FDARE+WP + + DQ++CGSCWA SVA A+ D IA G G +S Q +V+
Sbjct: 64 PTEFDAREQWPG--KILPVRDQASCGSCWAHSVAEAMGDAQNIA--GCPRGAMSVQDLVS 119
Query: 248 CTPNCWGCNGGWPQLAWRFWGHNGVVTGG--DYNSQEGCQPYTLAPCEHHVQ 297
C CNGG + A + G+ T Y S G P + C++ Q
Sbjct: 120 CDKTDSACNGGDMKKAQEYLVKTGITTEACVKYVSGSGRVPACPSKCDNGSQ 171
>gi|294897889|ref|XP_002776090.1| cysteine protease Cys2, putative [Perkinsus marinus ATCC 50983]
gi|239882699|gb|EER07906.1| cysteine protease Cys2, putative [Perkinsus marinus ATCC 50983]
Length = 134
Score = 106 bits (264), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 45/76 (59%), Positives = 57/76 (75%)
Query: 78 RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
++I +GP A FSVY DFL YKSGVY+H G +G HAV ++GWG E + YWLV NSW
Sbjct: 44 KEIMTNGPTSAAFSVYEDFLSYKSGVYKHTSGGFLGGHAVEIIGWGTEKGVDYWLVMNSW 103
Query: 138 NDHWGDHGTFKILRGE 153
N+ WGDHGTFKI++G+
Sbjct: 104 NEEWGDHGTFKIVQGD 119
>gi|294936554|ref|XP_002781799.1| cysteine protease Cys2, putative [Perkinsus marinus ATCC 50983]
gi|239892784|gb|EER13594.1| cysteine protease Cys2, putative [Perkinsus marinus ATCC 50983]
Length = 88
Score = 106 bits (264), Expect = 2e-20, Method: Composition-based stats.
Identities = 44/71 (61%), Positives = 54/71 (76%)
Query: 83 HGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWNDHWG 142
+GP A FSVY DFL YKSGVY+H G +G HAV ++GWG E + YWLV NSWN+ WG
Sbjct: 3 NGPTSAAFSVYEDFLSYKSGVYKHTSGGFLGGHAVEIIGWGTEKGVDYWLVMNSWNEEWG 62
Query: 143 DHGTFKILRGE 153
DHGTFKI++G+
Sbjct: 63 DHGTFKIVQGD 73
>gi|116784401|gb|ABK23329.1| unknown [Picea sitchensis]
Length = 350
Score = 105 bits (263), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 47/85 (55%), Positives = 60/85 (70%)
Query: 75 NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVA 134
+ M ++Y +GP+ FSVY DF YKSGVY++ GD +G HAV+++GWG E+ YWLVA
Sbjct: 239 DIMAEVYTNGPVEVSFSVYEDFAHYKSGVYKYTKGDYMGGHAVKLVGWGTEDGTDYWLVA 298
Query: 135 NSWNDHWGDHGTFKILRGENEADIE 159
NSWN WG+ G FKI RG NE IE
Sbjct: 299 NSWNTAWGEDGYFKIARGSNECGIE 323
Score = 89.4 bits (220), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 54/135 (40%), Positives = 70/135 (51%), Gaps = 20/135 (14%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP+ FDARE WP+C S++ I DQ +CGSCWA A+SDR CI T +S +V
Sbjct: 96 LPKQFDAREAWPQCTSVQTILDQGHCGSCWAFGAVEALSDRFCIHHKVNVT--LSENDLV 153
Query: 247 ACTPNCW--GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPY-TLAPCEHHVQGPLQNC 303
AC GC+GG+P AW+++ GVVT C PY A C+H PL
Sbjct: 154 ACCGFMCGDGCDGGYPISAWQYFISTGVVTA-------ECDPYFDDAGCQHPGCEPL--- 203
Query: 304 TLLGKLKTPECKQNC 318
TP+C + C
Sbjct: 204 -----YPTPQCVKQC 213
>gi|17565158|ref|NP_503384.1| Protein W07B8.1 [Caenorhabditis elegans]
gi|351059396|emb|CCD74286.1| Protein W07B8.1 [Caenorhabditis elegans]
Length = 335
Score = 105 bits (263), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 58/158 (36%), Positives = 84/158 (53%), Gaps = 7/158 (4%)
Query: 182 QNAKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQIS 241
Q L +FDARE+WPEC S+ I D S C + WA + A ++SDRLCI S G+ +S
Sbjct: 71 QANSDLSPSFDARERWPECMSIPQINDISECKTSWAFAAAESMSDRLCINSGGFKNTILS 130
Query: 242 AQHIVACTPNCW----GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQ 297
A+ +++C + GC GG P AW++ +G+ TGG Y SQ GC+PY++ PC V
Sbjct: 131 AEELLSCCTGMFSCGEGCEGGNPFKAWQYIQKHGIPTGGSYESQFGCKPYSIPPCGKTVG 190
Query: 298 GPLQNCTLLGKLKTPECKQNCYNPSYESTYRFDLKKGK 335
TP C++ C + Y D+ K +
Sbjct: 191 NVTYPACTNTTSPTPSCEKKC---TSRIGYPIDIDKDR 225
Score = 102 bits (253), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 50/117 (42%), Positives = 68/117 (58%), Gaps = 3/117 (2%)
Query: 46 KKKKKKRLYLPTSIPLS-HYFKKAHMVPRCNAMRQ--IYEHGPLVAIFSVYADFLQYKSG 102
+KK R+ P I HY +P Q + +GP+ A F VY DFLQY +G
Sbjct: 208 EKKCTSRIGYPIDIDKDRHYGVSVDQLPNSQIEIQSDVMLNGPIQATFEVYDDFLQYTTG 267
Query: 103 VYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
+Y H G+ G +VR++GWGV +PYWL ANSW WG++GTF++LRG NE +E
Sbjct: 268 IYVHLTGNKQGHLSVRIIGWGVWQGVPYWLCANSWGRQWGENGTFRVLRGTNECGLE 324
>gi|291000228|ref|XP_002682681.1| predicted protein [Naegleria gruberi]
gi|284096309|gb|EFC49937.1| predicted protein [Naegleria gruberi]
Length = 225
Score = 105 bits (263), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 47/83 (56%), Positives = 61/83 (73%)
Query: 77 MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANS 136
M +I +GP+ + FSVY DF+ YKSGVY H G +G HA++++GWGVEN++ YWLVANS
Sbjct: 140 MNEIATNGPVQSGFSVYQDFMSYKSGVYTHQTGSFLGGHAIKIVGWGVENNVKYWLVANS 199
Query: 137 WNDHWGDHGTFKILRGENEADIE 159
W WG +G FKI RG+NE IE
Sbjct: 200 WGPDWGLNGLFKIKRGDNECGIE 222
Score = 76.3 bits (186), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 43/107 (40%), Positives = 58/107 (54%), Gaps = 14/107 (13%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWA-----VSVANAISDRLCIASNGYFTGQIS 241
LP +FD+REKWP C + I +Q CGSCWA + + +SDR CIAS G +S
Sbjct: 2 LPESFDSREKWPTC--IHPIRNQEQCGSCWACKNLFIQSSEVLSDRFCIASGGKVNVVLS 59
Query: 242 AQHIVACTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYT 288
Q +V+C GC+GG AW + H G+VT + C PY+
Sbjct: 60 PQDLVSCNWYNAGCDGGILWAAWIYLKHTGIVT-------DQCLPYS 99
>gi|224285427|gb|ACN40436.1| unknown [Picea sitchensis]
Length = 350
Score = 105 bits (263), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 47/85 (55%), Positives = 60/85 (70%)
Query: 75 NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVA 134
+ M ++Y +GP+ FSVY DF YKSGVY++ GD +G HAV+++GWG E+ YWLVA
Sbjct: 239 DIMAEVYTNGPVEVSFSVYEDFAHYKSGVYKYTKGDYMGGHAVKLVGWGTEDGTDYWLVA 298
Query: 135 NSWNDHWGDHGTFKILRGENEADIE 159
NSWN WG+ G FKI RG NE IE
Sbjct: 299 NSWNTAWGEDGYFKIARGSNECGIE 323
Score = 89.4 bits (220), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 54/135 (40%), Positives = 70/135 (51%), Gaps = 20/135 (14%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP+ FDARE WP+C S++ I DQ +CGSCWA A+SDR CI T +S +V
Sbjct: 96 LPKQFDAREAWPQCTSVQTILDQGHCGSCWAFGAVEALSDRFCIHHKVNVT--LSENDLV 153
Query: 247 ACTPNCW--GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPY-TLAPCEHHVQGPLQNC 303
AC GC+GG+P AW+++ GVVT C PY A C+H PL
Sbjct: 154 ACCGFMCGDGCDGGYPISAWQYFISTGVVTA-------ECDPYFDDAGCQHPGCEPL--- 203
Query: 304 TLLGKLKTPECKQNC 318
TP+C + C
Sbjct: 204 -----YPTPQCVKQC 213
>gi|161343875|tpg|DAA06118.1| TPA_inf: cathepsin B [Myzus persicae]
Length = 210
Score = 105 bits (263), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 54/122 (44%), Positives = 74/122 (60%), Gaps = 7/122 (5%)
Query: 212 CGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVACTPNCW-GCNGGWPQLAWRFWGHN 270
CGSCWA S A+ SDRLCIA+ G +SA+ + C C GC+GG P+ AW F+ +
Sbjct: 1 CGSCWAASAASVFSDRLCIATGGAVARNLSAEQLNTCCYRCGNGCDGGSPEAAWYFFMRH 60
Query: 271 GVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTLLGKLKTPECK-QNCYNPSYESTYRF 329
G+VTGGDY S +GCQPY++ P +G +N + + TP+C + C N +Y YR
Sbjct: 61 GIVTGGDYESGDGCQPYSIYP-----RGKGRNTCIDDDIDTPDCSIRTCTNSNYTKGYRA 115
Query: 330 DL 331
DL
Sbjct: 116 DL 117
Score = 99.0 bits (245), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 44/92 (47%), Positives = 62/92 (67%), Gaps = 2/92 (2%)
Query: 63 HYFKKAHMVPRC--NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
HY + + R + M IY++GP+ A F VY DF+ YKSGVY + G G HA+++L
Sbjct: 118 HYVDTVYSLSRSEEDIMTDIYKNGPVQAAFYVYTDFMYYKSGVYSYTRGQIEGGHAIKIL 177
Query: 121 GWGVENDIPYWLVANSWNDHWGDHGTFKILRG 152
GWGV+++ YWL ANSW+ WG++G F+ILRG
Sbjct: 178 GWGVDDNTKYWLCANSWSRSWGENGLFRILRG 209
>gi|256052327|ref|XP_002569724.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
Length = 96
Score = 105 bits (263), Expect = 3e-20, Method: Composition-based stats.
Identities = 44/82 (53%), Positives = 61/82 (74%)
Query: 78 RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
++I ++GP+ A F VY DFL YKSG+Y+H G HA+R++GWG EN+ PYWL+ NSW
Sbjct: 5 KEIMKYGPVEANFIVYEDFLNYKSGIYKHITGKLFSWHAIRIIGWGEENNTPYWLIPNSW 64
Query: 138 NDHWGDHGTFKILRGENEADIE 159
N+ WG++G F+ILRG +E IE
Sbjct: 65 NEDWGENGNFRILRGRHECSIE 86
>gi|984960|gb|AAC46878.1| cathepsin B proteinase, partial [Ancylostoma caninum]
Length = 340
Score = 105 bits (262), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 45/81 (55%), Positives = 58/81 (71%)
Query: 79 QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWN 138
+IY++GP+VA F VY DF Y+ G+Y H +G G HAV+V+GWG EN YWL+ANSWN
Sbjct: 250 EIYKNGPVVAAFKVYQDFSYYRGGIYVHKWGGQTGAHAVKVVGWGRENGTDYWLIANSWN 309
Query: 139 DHWGDHGTFKILRGENEADIE 159
WG++G F+I RG NE IE
Sbjct: 310 TDWGENGYFRIARGSNECGIE 330
Score = 103 bits (257), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 60/141 (42%), Positives = 80/141 (56%), Gaps = 6/141 (4%)
Query: 188 PRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVA 247
P +FDAR WPEC S+ I DQS CGSCWAVS A A+SD++C+ SN IS I++
Sbjct: 88 PDSFDARAHWPECRSIGTIRDQSACGSCWAVSSAEAMSDQICVQSNRTTRVMISDTDILS 147
Query: 248 CT-PNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
C +C +GC P A+R+ + VVTGG Y ++ C+PY PC +H
Sbjct: 148 CCGISCGYGCE-VLPIEAYRWMQRSVVVTGGKYRQKDVCKPYAFYPCGNHTNERYYGPCP 206
Query: 306 LGKLKTPECKQNC---YNPSY 323
G TP+C++ C YN SY
Sbjct: 207 RGLWPTPKCRKACQRKYNKSY 227
>gi|343472937|emb|CCD15042.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 336
Score = 105 bits (262), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 61/138 (44%), Positives = 76/138 (55%), Gaps = 11/138 (7%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP FDA E WP CP++R IADQS C + WAVS A+AISDR C G +ISA ++
Sbjct: 90 LPETFDAAEHWPHCPTIREIADQSECRASWAVSTASAISDRYCTVGGGK-QLRISAADLM 148
Query: 247 ACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHH-VQGPLQNCT 304
AC C GC GG+P AW ++ G+ + SQ CQPY CEH QG C+
Sbjct: 149 ACCKQCGDGCKGGFPGFAWLYYVEYGITS-----SQ--CQPYPFPHCEHRGAQGNKTPCS 201
Query: 305 LLGKLKTPECKQNCYNPS 322
K TP+C C + S
Sbjct: 202 KY-KFDTPKCNATCTDKS 218
Score = 102 bits (254), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 53/110 (48%), Positives = 68/110 (61%), Gaps = 4/110 (3%)
Query: 58 SIPLSHYFKKAHMV---PRCNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGL 114
SIPL Y A + + R++Y +GP VA+F VY D YKSGVY++ GD +G
Sbjct: 218 SIPLVKYRGNATYLLLHGEEDYKRELYFNGPFVAVFFVYTDLFAYKSGVYRNVDGDFLGG 277
Query: 115 HAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE-MGFN 163
AVR++GWG N PYW VANSW+ WG +G ILRG NE +IE +GF
Sbjct: 278 QAVRIVGWGKLNGTPYWKVANSWDTDWGMNGYMLILRGNNECNIEHLGFT 327
>gi|299471123|emb|CBN78981.1| cathepsin B-like proteinase [Ectocarpus siliculosus]
Length = 557
Score = 105 bits (262), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 70/198 (35%), Positives = 93/198 (46%), Gaps = 36/198 (18%)
Query: 168 ANSSEDDDLETMGCQNAKGLPRNFDAREKWPECPSL-RHIADQSNCGSCWAVSVANAISD 226
A SS D+D+ P NFDARE +PEC S+ + DQS+CGSCWA + A +D
Sbjct: 272 AQSSSDEDI-----------PANFDAREAFPECASIIGRVRDQSDCGSCWAFASTEAFND 320
Query: 227 RLCIASNGYFTGQ-------------ISAQHIVACTPN-----CWGCNGGWPQLAWRFWG 268
R CIA G +SA+ AC GCNGG P AW+++
Sbjct: 321 RRCIAGIGKEDAAGAEGEATADQLLVLSAEDTTACCHGFHCGLSMGCNGGQPGSAWKWFT 380
Query: 269 HNGVVTGGDY---NSQEGCQPYTLAPCEHHVQGPLQNCTLL--GKLKTPECKQNCYNPSY 323
GVVTGGDY + C+PY PC HHV G+ TPEC C ++
Sbjct: 381 KTGVVTGGDYADIGTGTTCKPYEFMPCAHHVDPGASGYPACPDGEYPTPECLSECSETNF 440
Query: 324 E-STYRFDLKKGKKAHMV 340
+Y D K ++A+ +
Sbjct: 441 SGGSYGEDKKMAREAYSL 458
Score = 96.3 bits (238), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 43/87 (49%), Positives = 58/87 (66%), Gaps = 2/87 (2%)
Query: 75 NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVE--NDIPYWL 132
N R + ++G + A FSV++DFL Y GVY H G +G HAV+++GWG + + YWL
Sbjct: 463 NIQRDMMKYGSVTAAFSVFSDFLTYSGGVYTHESGSFMGGHAVKMIGWGTDEVSGEDYWL 522
Query: 133 VANSWNDHWGDHGTFKILRGENEADIE 159
+ANSWN WG+ G F+ILRG NE IE
Sbjct: 523 IANSWNPSWGEGGLFRILRGVNECGIE 549
>gi|343474530|emb|CCD13852.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 335
Score = 105 bits (262), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 55/138 (39%), Positives = 78/138 (56%), Gaps = 11/138 (7%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP FDA EKWP CP++ I+DQS+CGSCWAV+ A +++DR C +G +ISA ++
Sbjct: 90 LPETFDAAEKWPNCPTITEISDQSSCGSCWAVAAATSMTDRYCTI-HGVRGLRISAADLL 148
Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPL-QNCT 304
AC +C +GC GG P +AW ++ G+ +G CQPY C H+ C+
Sbjct: 149 ACCGDCGYGCLGGDPDMAWAYFSSEGIASG-------RCQPYPFPRCSHYTNSTTYPQCS 201
Query: 305 LLGKLKTPECKQNCYNPS 322
L L TP C C + +
Sbjct: 202 AL-HLWTPTCNPACTDST 218
Score = 105 bits (261), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 44/82 (53%), Positives = 58/82 (70%)
Query: 78 RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
R++Y GP A+F V++D YK GVY+H G IG HAVR++GWG ++ +PYW +ANSW
Sbjct: 240 RELYFRGPFQAVFDVWSDLFAYKHGVYKHVGGAFIGAHAVRIVGWGNQSGVPYWKIANSW 299
Query: 138 NDHWGDHGTFKILRGENEADIE 159
N WGD G F +LRG+NE IE
Sbjct: 300 NAEWGDRGYFFMLRGDNECGIE 321
>gi|166030318|gb|ABY78826.1| cathepsin B-like protease [Trypanosoma congolense]
Length = 335
Score = 105 bits (262), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 55/134 (41%), Positives = 76/134 (56%), Gaps = 11/134 (8%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP FDA EKWP CP++ I+DQS+CGSCWAV+ A +++DR C +G +ISA ++
Sbjct: 90 LPETFDAAEKWPNCPTITEISDQSSCGSCWAVAAATSMTDRYCTI-HGVRGLRISAADLL 148
Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPL-QNCT 304
AC +C +GC GG P +AW ++ G+ +G CQPY C H+ C+
Sbjct: 149 ACCGDCGYGCLGGDPDMAWAYFSSEGIASG-------RCQPYPFPRCSHYTNSTTYPQCS 201
Query: 305 LLGKLKTPECKQNC 318
L L TP C C
Sbjct: 202 AL-HLWTPTCNPAC 214
Score = 105 bits (261), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 44/82 (53%), Positives = 58/82 (70%)
Query: 78 RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
R++Y GP A+F V++D YK GVY+H G IG HAVR++GWG ++ +PYW +ANSW
Sbjct: 240 RELYFRGPFQAVFDVWSDLFAYKHGVYKHVGGAFIGAHAVRIVGWGNQSGVPYWKIANSW 299
Query: 138 NDHWGDHGTFKILRGENEADIE 159
N WGD G F +LRG+NE IE
Sbjct: 300 NAEWGDRGYFFMLRGDNECGIE 321
>gi|283468816|emb|CAO98753.1| putative cathepsin B [Fasciola hepatica]
Length = 112
Score = 105 bits (262), Expect = 3e-20, Method: Composition-based stats.
Identities = 45/87 (51%), Positives = 62/87 (71%)
Query: 77 MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANS 136
M +I ++GP+ IF ++ DFL YKSG+Y + G +G HA+RV+GWGVEN + YWL+ANS
Sbjct: 22 MMEIMKNGPVDGIFYMFEDFLVYKSGIYHYTTGRLVGGHAIRVIGWGVENGVKYWLIANS 81
Query: 137 WNDHWGDHGTFKILRGENEADIEMGFN 163
WN+ WG+ G F++ RG NE IE N
Sbjct: 82 WNEGWGEKGYFRMRRGNNECGIEARIN 108
>gi|343475054|emb|CCD13447.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 336
Score = 105 bits (262), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 49/86 (56%), Positives = 62/86 (72%), Gaps = 1/86 (1%)
Query: 78 RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
R++Y +GP V F V++DFL YKSGVYQH G+ +G AVR++GWG N PYW VANSW
Sbjct: 241 RELYFNGPFVVRFQVHSDFLAYKSGVYQHVAGNFLGGKAVRIVGWGKMNGTPYWKVANSW 300
Query: 138 NDHWGDHGTFKILRGENEADIE-MGF 162
+ WG +G F ILRG NE +IE +GF
Sbjct: 301 DTDWGMNGYFLILRGNNECNIEHLGF 326
Score = 90.1 bits (222), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 58/137 (42%), Positives = 73/137 (53%), Gaps = 9/137 (6%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP +FDA EKWP CP++R I DQS C + WAV+ A+AISDR C NG +A +
Sbjct: 91 LPESFDAAEKWPHCPTIREIPDQSACRASWAVATASAISDRYCTVGNGKQLRISAADLMA 150
Query: 247 ACTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHH-VQGPLQNCTL 305
CT GC GG+P AW ++ NG+ + SQ CQPY CEH QG C+
Sbjct: 151 CCTGCGGGCEGGYPDAAWEYYVSNGITS-----SQ--CQPYPFPRCEHRGAQGKKPPCSK 203
Query: 306 LGKLKTPECKQNCYNPS 322
TP C C + S
Sbjct: 204 Y-NFDTPTCNATCTDKS 219
>gi|116779190|gb|ABK21175.1| unknown [Picea sitchensis]
gi|148907952|gb|ABR17096.1| unknown [Picea sitchensis]
gi|224284884|gb|ACN40172.1| unknown [Picea sitchensis]
Length = 350
Score = 105 bits (261), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 47/85 (55%), Positives = 58/85 (68%)
Query: 75 NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVA 134
+ M ++Y GP+ F VY DF YKSGVY++ GD +G HAV+++GWG EN YWLVA
Sbjct: 239 DIMAEVYTKGPVEVDFLVYEDFAHYKSGVYKYITGDFLGGHAVKLIGWGTENGTDYWLVA 298
Query: 135 NSWNDHWGDHGTFKILRGENEADIE 159
NSWN WG+ G FKI RG NE IE
Sbjct: 299 NSWNTAWGEDGYFKIARGSNECSIE 323
Score = 89.7 bits (221), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 55/135 (40%), Positives = 71/135 (52%), Gaps = 20/135 (14%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP+ FDAR+ WP+C S+R I DQ +CGSCWA A+SDR CI T +S +V
Sbjct: 96 LPKQFDARKAWPQCTSVRTILDQGHCGSCWAFGAVEALSDRFCIHYKVNVT--LSENDLV 153
Query: 247 ACTP-NCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPY-TLAPCEHHVQGPLQNC 303
AC C GC+GG+P AW+++ GVVT C PY A C+H PL
Sbjct: 154 ACCGFRCGDGCDGGYPLSAWQYFISTGVVTA-------ECDPYFDEAGCQHPGCEPL--- 203
Query: 304 TLLGKLKTPECKQNC 318
TP+C + C
Sbjct: 204 -----YPTPQCVKQC 213
>gi|281208776|gb|EFA82951.1| peptidase C1A family protein [Polysphondylium pallidum PN500]
Length = 1308
Score = 105 bits (261), Expect = 4e-20, Method: Composition-based stats.
Identities = 55/136 (40%), Positives = 72/136 (52%), Gaps = 15/136 (11%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP NFDA ++WP+CP++ I +Q+ CGSCWA +ISDR CI N + Q+S Q ++
Sbjct: 70 LPTNFDAAQQWPQCPTIGAIQNQAECGSCWAFGAIESISDRFCIHKNE--SVQLSFQDLI 127
Query: 247 ACTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTLL 306
C GC GG P A+++ NGVVT CQPYT+ C Q P N
Sbjct: 128 TCDNQDNGCEGGDPYTAYKYVQKNGVVT-------SNCQPYTIPTCP-PAQQPCMNF--- 176
Query: 307 GKLKTPECKQNCYNPS 322
+ TP C C N S
Sbjct: 177 --VNTPPCSAKCANSS 190
Score = 92.0 bits (227), Expect = 3e-16, Method: Composition-based stats.
Identities = 43/95 (45%), Positives = 60/95 (63%), Gaps = 2/95 (2%)
Query: 63 HYFKKAHMV-PRCNAMR-QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
H+ K + V P A++ +I +GP+ A F VY DFL YKSGVY H G +G H ++++
Sbjct: 198 HHLKTVYAVKPNVAAIQNEIVTNGPVEACFEVYEDFLGYKSGVYTHKSGKDLGGHCIKIV 257
Query: 121 GWGVENDIPYWLVANSWNDHWGDHGTFKILRGENE 155
G+GV N PYW+ NSW WG++G F I G+NE
Sbjct: 258 GFGVSNGTPYWICNNSWTTSWGNNGIFWIEAGKNE 292
>gi|1848229|gb|AAB48119.1| cathepsin B-like protease [Leishmania major]
Length = 340
Score = 105 bits (261), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 46/85 (54%), Positives = 59/85 (69%)
Query: 77 MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANS 136
M ++ +GPL VY+DF+ YKSGVY+H GD +G HAV+++GWG ++ +PYW VANS
Sbjct: 247 MIELMTNGPLELTMQVYSDFVGYKSGVYKHVLGDFLGGHAVKLVGWGTQDGVPYWKVANS 306
Query: 137 WNDHWGDHGTFKILRGENEADIEMG 161
WN WGD G F I RG NE IE G
Sbjct: 307 WNTDWGDKGYFLIQRGNNECKIESG 331
Score = 99.0 bits (245), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 58/162 (35%), Positives = 79/162 (48%), Gaps = 14/162 (8%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP FDA E WP C ++ I DQSNCGSCWA++ AISDR C G ++S +++
Sbjct: 98 LPEFFDAAEHWPMCLTISEIRDQSNCGSCWAIAAVEAISDRYC-TFGGVPDRRMSTSNLL 156
Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
+C C GC+GG P +AW +W G+ T E CQPY PC HH
Sbjct: 157 SCCFICGLGCHGGIPTVAWLWWVWVGIAT-------EDCQPYPFDPCSHHGNSEKYPPCP 209
Query: 306 LGKLKTPECKQNCYN-----PSYESTYRFDLKKGKKAHMVLM 342
TP+C C Y+ + + +K K+ + LM
Sbjct: 210 STIYDTPKCNTTCERNEMDLVKYKGSTSYSVKGEKELMIELM 251
>gi|312091331|ref|XP_003146940.1| cathepsin B [Loa loa]
Length = 249
Score = 104 bits (260), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 46/85 (54%), Positives = 57/85 (67%)
Query: 78 RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
++I GP+ A F VY DFL Y G+Y+H G G HAV+VLGWG++ +PYWL ANSW
Sbjct: 151 KEIMTLGPVEASFEVYTDFLYYTGGIYKHVAGSMGGGHAVKVLGWGIDQGVPYWLAANSW 210
Query: 138 NDHWGDHGTFKILRGENEADIEMGF 162
N WG+ G F+ILRG NE IE G
Sbjct: 211 NTDWGEDGYFRILRGVNECGIESGI 235
Score = 96.3 bits (238), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 50/133 (37%), Positives = 74/133 (55%), Gaps = 2/133 (1%)
Query: 209 QSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVACTPNC-WGCNGGWPQLAWRFW 267
+S+ GSCWAV+ A+SDR+CI S G +SA +++C C +GC GG P AW++W
Sbjct: 11 KSSSGSCWAVAAVEAMSDRICIMSKGKKQVTLSADDLLSCCKTCGFGCFGGEPMAAWKYW 70
Query: 268 GHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTLLGKLKTPECKQNCYNPSYESTY 327
G+VTG +Y + GC+PY PCEHH TP+C + C + +Y +Y
Sbjct: 71 VLRGIVTGSEYTNHSGCRPYPFPPCEHHNNKTHYEPCKHDLYPTPKCVKKC-DKNYGKSY 129
Query: 328 RFDLKKGKKAHMV 340
+ D G+ + V
Sbjct: 130 KADKYYGQSVYNV 142
>gi|159950|gb|AAA29435.1| cathepsin B-like cysteine protease, partial [Ostertagia ostertagi]
Length = 105
Score = 104 bits (260), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 46/95 (48%), Positives = 66/95 (69%), Gaps = 4/95 (4%)
Query: 78 RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
+ I ++GP+VA ++VY DF Y+SG+Y+H G GLHAV+V+GWG E PYW+VANSW
Sbjct: 15 KDIMKNGPVVATYTVYEDFAHYRSGIYKHKAGRKTGLHAVKVIGWGEEKGTPYWIVANSW 74
Query: 138 NDHWGDHGTFKILRGENEADIEMGFNNRVEANSSE 172
+D WG++G F++ RG N+ GF R+ A S +
Sbjct: 75 HDDWGENGFFRMHRGSNDC----GFEERMAAGSVQ 105
>gi|329669000|gb|AEB96388.1| cathepsin B-like cysteine protease 2 [Angiostrongylus cantonensis]
Length = 232
Score = 104 bits (260), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 47/101 (46%), Positives = 65/101 (64%), Gaps = 2/101 (1%)
Query: 63 HYFKKAHMVPR--CNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
HY + V + + ++I +GP+ F VY DF Y SG+Y+H GD +G HAV++L
Sbjct: 125 HYGASVYAVAQDVASIQKEIMTNGPVEVAFDVYEDFEHYSSGIYKHTTGDYLGGHAVKML 184
Query: 121 GWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIEMG 161
GWG EN YW+ ANSWN WG++G F+ILRG +E +IE G
Sbjct: 185 GWGTENGTDYWICANSWNSDWGENGFFRILRGVDECEIESG 225
Score = 99.4 bits (246), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 45/89 (50%), Positives = 60/89 (67%), Gaps = 1/89 (1%)
Query: 209 QSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVACTPNC-WGCNGGWPQLAWRFW 267
QS+CGSCWAV A++DR+CIAS G ISA +++C C +GC+G P AW +W
Sbjct: 2 QSSCGSCWAVGAVEAMTDRICIASKGNQKVTISADDLLSCCDECGFGCDGRDPYAAWSYW 61
Query: 268 GHNGVVTGGDYNSQEGCQPYTLAPCEHHV 296
NG+VTG +Y S+ GC+PY PCEHH+
Sbjct: 62 VSNGIVTGSNYTSKSGCKPYPYPPCEHHI 90
>gi|729283|sp|Q06544.1|CYSP3_OSTOS RecName: Full=Cathepsin B-like cysteine proteinase 3
gi|159952|gb|AAA29436.1| cathepsin B-like cysteine protease, partial [Ostertagia ostertagi]
Length = 174
Score = 104 bits (260), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 53/120 (44%), Positives = 73/120 (60%), Gaps = 2/120 (1%)
Query: 42 KKKKKKKKKKRLYLPTSIPLSHYFKKAHMVPRC--NAMRQIYEHGPLVAIFSVYADFLQY 99
K K +K +R YL H+ K A+ +P R I ++GP+VA F VY DF Y
Sbjct: 47 KTPKCQKTCQRGYLKAYKEDKHFGKSAYRLPNNVKAIQRDIMKNGPVVAGFIVYEDFAHY 106
Query: 100 KSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
KSG+Y+H G G HAV+++GWG E PYWL+ANSW+D WG+ G ++++RG N IE
Sbjct: 107 KSGIYKHTAGRMTGGHAVKIIGWGKEKGTPYWLIANSWHDDWGEKGFYRMIRGINNCRIE 166
Score = 51.2 bits (121), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 28/78 (35%), Positives = 40/78 (51%), Gaps = 2/78 (2%)
Query: 263 AWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTLLGKLKTPECKQNCYNPS 322
AW+++ GVVTGG+Y Q C+PY PC H + P KTP+C++ C
Sbjct: 1 AWQYFALEGVVTGGNYRKQGCCRPYEFPPCGRHGKEPYYG-ECYDTAKTPKCQKTC-QRG 58
Query: 323 YESTYRFDLKKGKKAHMV 340
Y Y+ D GK A+ +
Sbjct: 59 YLKAYKEDKHFGKSAYRL 76
>gi|86451908|gb|ABC97349.1| cathepsin B [Streblomastix strix]
Length = 312
Score = 104 bits (259), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 60/153 (39%), Positives = 79/153 (51%), Gaps = 20/153 (13%)
Query: 175 DLETMGCQNAKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNG 234
D+ T+ N LP FD+R WP C + I DQ +CGSCWA+S + DR CI S G
Sbjct: 67 DVSTVPVAN---LPDEFDSRTNWPNCQLIGKIYDQGHCGSCWAMSSFEVLQDRFCIKSEG 123
Query: 235 YFTGQISAQHIVACTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEH 294
T ++S QH+ +CTP C GCNGGW A+ F NG++ E C PY + C+H
Sbjct: 124 KQTPELSPQHLTSCTPGCSGCNGGWMSTAFGFMQSNGILG-------EDCIPYQMGKCKH 176
Query: 295 HVQGPLQNCTLLGKLKTPEC-KQNCYNPSYEST 326
C+ TP+C K CY +ST
Sbjct: 177 ------PGCS---TWPTPKCNKTKCYPNDTKST 200
Score = 92.4 bits (228), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 40/85 (47%), Positives = 56/85 (65%)
Query: 75 NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVA 134
+ ++IYE+GP+ A F+VY D Y+SGVYQH G GLHA++V+GWG+ + + YW +
Sbjct: 217 DIQKEIYENGPVTASFAVYEDLSVYQSGVYQHVTGGFEGLHAIKVVGWGILDGVKYWTIV 276
Query: 135 NSWNDHWGDHGTFKILRGENEADIE 159
NSW + WG G I RG +E IE
Sbjct: 277 NSWAEDWGFDGLLLIRRGVDECGIE 301
>gi|156708108|gb|ABU93312.1| cathepsin B2 cysteine protease [Monocercomonoides sp. PA]
Length = 281
Score = 103 bits (258), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 46/83 (55%), Positives = 56/83 (67%)
Query: 77 MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANS 136
M ++YE+GP+ F+VY DF+ YKSGVY H G G HAV +GWGVE++ PYWL NS
Sbjct: 189 MEELYENGPISVAFTVYYDFMNYKSGVYVHKTGGIAGGHAVLCVGWGVEDNTPYWLCQNS 248
Query: 137 WNDHWGDHGTFKILRGENEADIE 159
W WG+ G FKILRG N IE
Sbjct: 249 WGPAWGEKGHFKILRGSNHCGIE 271
Score = 77.4 bits (189), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 48/137 (35%), Positives = 64/137 (46%), Gaps = 28/137 (20%)
Query: 186 GLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHI 245
LP NFD+RE+WP + + DQ++CGSCWA SVA + DRL I + G +S Q +
Sbjct: 62 ALPENFDSREQWPG--KILPVRDQASCGSCWAFSVAETMGDRLSIKGCDF--GDMSPQDL 117
Query: 246 VACTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
V+C GCNGG+ AW + +G+ T E C PY
Sbjct: 118 VSCDTTDMGCNGGYMDHAWAWTKSHGITT-------EKCMPYQ----------------- 153
Query: 306 LGKLKTPECKQNCYNPS 322
G + P C C N S
Sbjct: 154 SGSGRVPACPAKCVNGS 170
>gi|156708104|gb|ABU93310.1| cathepsin B1 cysteine protease [Monocercomonoides sp. PA]
Length = 281
Score = 103 bits (258), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 46/83 (55%), Positives = 56/83 (67%)
Query: 77 MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANS 136
M ++YE+GP+ F+VY DF+ YKSGVY H G G HAV +GWGVE++ PYWL NS
Sbjct: 189 MEELYENGPISVAFTVYYDFMNYKSGVYVHKTGGIAGGHAVLCVGWGVEDNTPYWLCQNS 248
Query: 137 WNDHWGDHGTFKILRGENEADIE 159
W WG+ G FKILRG N IE
Sbjct: 249 WGPAWGEKGHFKILRGSNHCGIE 271
Score = 78.2 bits (191), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 49/137 (35%), Positives = 64/137 (46%), Gaps = 28/137 (20%)
Query: 186 GLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHI 245
LP NFD+RE+WP + + DQ++CGSCWA SVA + DRL I Y G ++ Q +
Sbjct: 62 ALPENFDSREQWPG--KILPVRDQASCGSCWAFSVAETMGDRLSIKGCDY--GDMAPQDL 117
Query: 246 VACTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
V+C GCNGG+ AW + +GV T E C PY
Sbjct: 118 VSCDTTDMGCNGGYMDHAWAWTKSHGVTT-------EKCMPYQ----------------- 153
Query: 306 LGKLKTPECKQNCYNPS 322
G + P C C N S
Sbjct: 154 SGSGRVPACPAKCVNGS 170
>gi|256052325|ref|XP_002569723.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
gi|353228438|emb|CCD74609.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
Length = 198
Score = 103 bits (258), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 59/151 (39%), Positives = 79/151 (52%), Gaps = 9/151 (5%)
Query: 171 SEDDDLETMGCQNAKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCI 230
S DD MG A+ + ++KWP C S+ I DQS CGS WA A+SDR CI
Sbjct: 50 SLDDARIQMG---ARREESDLRRKKKWPGCKSIATIRDQSRCGSSWAFGAVEAMSDRSCI 106
Query: 231 ASNGYFTGQISAQHIVACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTL 289
S G ++SA +++C +C G GG+P LAW +W G+VTG + CQPY
Sbjct: 107 QSGGKQNVELSAVDLLSCCEHCGDGFEGGFPALAWDYWVKEGIVTGSSKENHTVCQPYPF 166
Query: 290 APCEHHVQGPLQNCTLLGK--LKTPECKQNC 318
CEHH +G C G+ +TP C+ C
Sbjct: 167 PKCEHHTKGKYPAC---GEEIYRTPNCENTC 194
>gi|260782761|ref|XP_002586451.1| hypothetical protein BRAFLDRAFT_247264 [Branchiostoma floridae]
gi|229271561|gb|EEN42462.1| hypothetical protein BRAFLDRAFT_247264 [Branchiostoma floridae]
Length = 272
Score = 103 bits (257), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 47/81 (58%), Positives = 58/81 (71%)
Query: 79 QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWN 138
+I +GP+ A F+VY+D + YKSGVY H G +G HAV+VLGWGVE++ YWLVANSW
Sbjct: 179 EIMTNGPVEAAFTVYSDIVHYKSGVYHHTSGGKLGGHAVKVLGWGVEDEEEYWLVANSWG 238
Query: 139 DHWGDHGTFKILRGENEADIE 159
WGD G FKI RG +E IE
Sbjct: 239 PDWGDQGFFKIKRGSDECGIE 259
Score = 74.3 bits (181), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 39/103 (37%), Positives = 56/103 (54%), Gaps = 8/103 (7%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+P++FDAR +W C I DQ +CGSCWA + +SDRLCI + G +S++ ++
Sbjct: 43 IPKSFDARMEWSTCVRSHKIHDQGHCGSCWAFASTEVLSDRLCIQTRGSTNIILSSEDLL 102
Query: 247 ACTPNCWGC-NGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYT 288
+C GC +GG AWR+ GVV C+PYT
Sbjct: 103 SCDKAGRGCSDGGRLSEAWRYMQKKGVVA-------NRCKPYT 138
>gi|409905640|gb|AFV46426.1| cysteine protease C [Leishmania donovani]
Length = 345
Score = 103 bits (257), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 45/85 (52%), Positives = 58/85 (68%)
Query: 77 MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANS 136
M ++ +GPL VY+DF+ YKSGVY+H GD +G HAV+++GWG + +PYW +ANS
Sbjct: 252 MIELMTNGPLEVTMQVYSDFVGYKSGVYKHVSGDLLGGHAVKLVGWGTQGGVPYWKIANS 311
Query: 137 WNDHWGDHGTFKILRGENEADIEMG 161
WN WGD G F I RG NE IE G
Sbjct: 312 WNTDWGDKGYFLIQRGSNECGIESG 336
Score = 102 bits (253), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 56/137 (40%), Positives = 70/137 (51%), Gaps = 17/137 (12%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP FDA E WP C ++ I DQSNCGSCWA++ AISDR C G +IS +++
Sbjct: 103 LPEFFDAAEHWPMCVTISEIRDQSNCGSCWAIAAVEAISDRYCTL-GGVPDRRISTSNLL 161
Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQG----PLQ 301
+C C +GC GG P +AW +W G+ T E CQPY PC HH P
Sbjct: 162 SCCFICGFGCYGGIPTMAWLWWVWVGITT-------EVCQPYPFGPCSHHGNSDKYPPCP 214
Query: 302 NCTLLGKLKTPECKQNC 318
N TP+C C
Sbjct: 215 NTI----YDTPKCNTTC 227
>gi|300121514|emb|CBK22033.2| unnamed protein product [Blastocystis hominis]
Length = 476
Score = 103 bits (257), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 67/208 (32%), Positives = 95/208 (45%), Gaps = 37/208 (17%)
Query: 75 NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVA 134
N M++IY HGP+ V D L+YK G+Y+ G + H + V+GWG EN IPYW+V
Sbjct: 95 NIMKEIYAHGPVTCSIDVPDDLLEYKGGIYEDKTGIAGDGHDISVVGWGEENGIPYWIVR 154
Query: 135 NSWNDHWGDHGTFKILRGENEADIEMG------------FNNRVEANSSEDDDLETMGCQ 182
NSW +WG+ G F+I+RG+N IE G N V + GC
Sbjct: 155 NSWGTYWGEEGFFRIVRGKNNLGIEEGCTYGIPRIPEEKITNPVSLGVKHRINYFPQGCV 214
Query: 183 NA----------KGLPRNFDAREKWPECPSLRHIADQSN-------------CGSCWAVS 219
LP + E P +R+I D N CGSCWA +
Sbjct: 215 LESRKEMEEVIKSPLPHTYIKTEDLPTSYDIRNI-DGYNYATWDKNQHIPHYCGSCWAQA 273
Query: 220 VANAISDRLCIASNG-YFTGQISAQHIV 246
+A+SDR+ + G + T +S Q ++
Sbjct: 274 PTSALSDRINLMRKGKWPTINLSEQEVI 301
Score = 77.8 bits (190), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 37/83 (44%), Positives = 49/83 (59%), Gaps = 2/83 (2%)
Query: 79 QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGV--ENDIPYWLVANS 136
+IY GP+ + V FL Y GV+ G +G HAV V GWGV E PYW+V NS
Sbjct: 382 EIYARGPISCVMDVTQTFLDYTGGVFTSREGKWLGKHAVEVTGWGVDEETRTPYWIVRNS 441
Query: 137 WNDHWGDHGTFKILRGENEADIE 159
W +WG++G F+I G+N +IE
Sbjct: 442 WGTYWGENGWFRIAMGQNLLNIE 464
>gi|294898091|ref|XP_002776152.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
gi|239882839|gb|EER07968.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
Length = 382
Score = 103 bits (257), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 65/183 (35%), Positives = 88/183 (48%), Gaps = 30/183 (16%)
Query: 149 ILRGENEADIEMGFNNRVEANSSEDDDLETMGCQNAKGLPRNFDAREKWPECP-SLRHIA 207
++RG N+ ++ G+ + + LP +FDAR +P C + HI
Sbjct: 121 LMRGSNDKAVKKGY-----------------AIEELQDLPTDFDARTAFPNCSKVIGHIR 163
Query: 208 DQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVACTPNCWGCNGGWPQLAWRFW 267
DQS CGSCWA V A +DRLCI SNG FT +SA + ACT +GC GG P AW +
Sbjct: 164 DQSACGSCWAFGVTEAFNDRLCIKSNGAFTELLSAGEMNACTL-FFGCGGGDPYSAWSWV 222
Query: 268 GHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTLLGKLKTPECKQNCYNPSYESTY 327
G+ TG EG +P ++ E Q+ TP C + C NP Y +T
Sbjct: 223 HDKGIATG------EGSRPKRVSESEAIPVIAYQDI-----YPTPNCVEQCRNPKYTTTL 271
Query: 328 RFD 330
R D
Sbjct: 272 RDD 274
Score = 92.0 bits (227), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 42/86 (48%), Positives = 55/86 (63%)
Query: 75 NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVA 134
+A I GP+ A F+VY DFL YKSGVY+H G +G HAV+++GWG ++ YWL
Sbjct: 290 DAKNAIRTDGPVSASFTVYEDFLAYKSGVYKHTSGSYLGGHAVKIIGWGEKSGQAYWLAV 349
Query: 135 NSWNDHWGDHGTFKILRGENEADIEM 160
NSWN+ WGD G FKI G D ++
Sbjct: 350 NSWNEDWGDKGLFKIALGNCGIDDDL 375
>gi|17384033|emb|CAD12394.1| cysteine proteinase [Leishmania infantum]
Length = 340
Score = 103 bits (257), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 45/85 (52%), Positives = 58/85 (68%)
Query: 77 MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANS 136
M ++ +GPL VY+DF+ YKSGVY+H GD +G HAV+++GWG + +PYW +ANS
Sbjct: 247 MIELMTNGPLEVTMQVYSDFVGYKSGVYKHVSGDLLGGHAVKLVGWGTQGGVPYWKIANS 306
Query: 137 WNDHWGDHGTFKILRGENEADIEMG 161
WN WGD G F I RG NE IE G
Sbjct: 307 WNTDWGDKGYFLIQRGSNECGIESG 331
Score = 102 bits (253), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 56/137 (40%), Positives = 70/137 (51%), Gaps = 17/137 (12%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP FDA E WP C ++ I DQSNCGSCWA++ AISDR C G +IS +++
Sbjct: 98 LPEFFDAAEHWPMCVTISEIRDQSNCGSCWAIAAVEAISDRYCTL-GGVPDRRISTSNLL 156
Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQG----PLQ 301
+C C +GC GG P +AW +W G+ T E CQPY PC HH P
Sbjct: 157 SCCFICGFGCYGGIPTMAWLWWVWVGITT-------EVCQPYPFGPCSHHGNSDKYPPCP 209
Query: 302 NCTLLGKLKTPECKQNC 318
N TP+C C
Sbjct: 210 NTI----YDTPKCNTTC 222
>gi|156708106|gb|ABU93311.1| cathepsin B2 cysteine protease [Monocercomonoides sp. PA]
Length = 282
Score = 103 bits (257), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 46/82 (56%), Positives = 56/82 (68%)
Query: 78 RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
+++YE+GPL F+VY DF+ YKSGVY H G G HAV +GWGVE++ PYWL NSW
Sbjct: 190 QELYENGPLSVAFTVYYDFMNYKSGVYVHKTGGVAGGHAVLCIGWGVEDNTPYWLCQNSW 249
Query: 138 NDHWGDHGTFKILRGENEADIE 159
WG+ G FKILRG N IE
Sbjct: 250 GPAWGEKGHFKILRGSNHCGIE 271
Score = 81.6 bits (200), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 45/106 (42%), Positives = 60/106 (56%), Gaps = 11/106 (10%)
Query: 182 QNAKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQIS 241
++ LP NFDARE+WPE + + DQ++CGSCWA SVA + DRL I G G +S
Sbjct: 58 ESDNALPENFDAREQWPE--QILPVRDQASCGSCWAFSVAETMGDRLSIIGCG--RGHMS 113
Query: 242 AQHIVACTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPY 287
Q +V+C GCNGG+ AW + +GV + E C PY
Sbjct: 114 PQDLVSCDTTDMGCNGGYMDKAWAWTKSHGV-------TNEECMPY 152
>gi|146092987|ref|XP_001466605.1| CPC cysteine peptidase, Clan CA, family C1,Cathepsin B-like
[Leishmania infantum JPCM5]
gi|398018677|ref|XP_003862503.1| cysteine peptidase C (CPC) [Leishmania donovani]
gi|12005276|gb|AAG44365.1| cathepsin B-like cysteine protease [Leishmania donovani]
gi|134070968|emb|CAM69644.1| CPC cysteine peptidase, Clan CA, family C1,Cathepsin B-like
[Leishmania infantum JPCM5]
gi|322500733|emb|CBZ35810.1| cysteine peptidase C (CPC) [Leishmania donovani]
Length = 340
Score = 103 bits (257), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 45/85 (52%), Positives = 58/85 (68%)
Query: 77 MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANS 136
M ++ +GPL VY+DF+ YKSGVY+H GD +G HAV+++GWG + +PYW +ANS
Sbjct: 247 MIELMTNGPLEVTMQVYSDFVGYKSGVYKHVSGDLLGGHAVKLVGWGTQGGVPYWKIANS 306
Query: 137 WNDHWGDHGTFKILRGENEADIEMG 161
WN WGD G F I RG NE IE G
Sbjct: 307 WNTDWGDKGYFLIQRGSNECGIESG 331
Score = 101 bits (252), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 56/137 (40%), Positives = 70/137 (51%), Gaps = 17/137 (12%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP FDA E WP C ++ I DQSNCGSCWA++ AISDR C G +IS +++
Sbjct: 98 LPEFFDAAEHWPMCVTISEIRDQSNCGSCWAIAAVEAISDRYCTL-GGVPDRRISTSNLL 156
Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQG----PLQ 301
+C C +GC GG P +AW +W G+ T E CQPY PC HH P
Sbjct: 157 SCCFICGFGCYGGIPTMAWLWWVWVGITT-------EVCQPYPFGPCSHHGNSDKYPPCP 209
Query: 302 NCTLLGKLKTPECKQNC 318
N TP+C C
Sbjct: 210 NTI----YDTPKCNTTC 222
>gi|339239305|ref|XP_003381207.1| cathepsin B [Trichinella spiralis]
gi|316975778|gb|EFV59177.1| cathepsin B [Trichinella spiralis]
Length = 343
Score = 103 bits (256), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 58/131 (44%), Positives = 75/131 (57%), Gaps = 11/131 (8%)
Query: 45 KKKKKKKRLYLPTSIPLSHYFKKAHMVPRCNAMRQIYEHGPLVAIFSVYADFLQYKSGVY 104
K+K K R Y S+Y + P R+I +HGP+VA ++ FL YKSGVY
Sbjct: 221 KRKLDKDRYYGE-----SYYLITSSNQPVKTIQREIMDHGPVVAAMEIFESFLYYKSGVY 275
Query: 105 ---QHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIEMG 161
+ N S+GLHAV+++GWG + IPYWLV NSWN +G+ G FKI RG NE IE
Sbjct: 276 SANKRNDDPSLGLHAVKLIGWGEQKRIPYWLVVNSWNTTFGEQGLFKIRRGTNECGIE-- 333
Query: 162 FNNRVEANSSE 172
N V A +E
Sbjct: 334 -NLHVTAGLAE 343
Score = 99.8 bits (247), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 57/167 (34%), Positives = 81/167 (48%), Gaps = 45/167 (26%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCW------------------------------ 216
L +FDAREKWPEC + I DQS C CW
Sbjct: 60 LEEHFDAREKWPECKYIGFIKDQSTCSCCWVSGDFLYHYDQWKIILLFDFSSSSSHWLFI 119
Query: 217 ----AVSVANAISDRLCIASNGYFTGQISAQHIVACTPNC-WGCNGGWPQLAWRFWGHNG 271
A+S A+ ++DR CIA G +S + + +C +C +GCNGG+P LA+++W G
Sbjct: 120 STFKAMSSASVMTDRTCIAYKGEQQPFLSDEELTSCCTSCGYGCNGGFPLLAFKYWNEIG 179
Query: 272 VVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTLLGKLKTPECKQNC 318
V TGG Y S+ GC+P+++AP P + T +TP C+ C
Sbjct: 180 VPTGGPYGSKSGCKPFSIAP-------PTSSST---AAQTPLCQLKC 216
>gi|389593817|ref|XP_003722157.1| cysteine peptidase C (CPC) [Leishmania major strain Friedlin]
gi|321438655|emb|CBZ12414.1| cysteine peptidase C (CPC) [Leishmania major strain Friedlin]
Length = 340
Score = 103 bits (256), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 45/85 (52%), Positives = 59/85 (69%)
Query: 77 MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANS 136
M ++ +GPL VY+DF+ YKSGVY+H G+ +G HAV+++GWG ++ +PYW VANS
Sbjct: 247 MIELMTNGPLELTMQVYSDFVGYKSGVYKHVLGEFLGGHAVKLVGWGTQDGVPYWKVANS 306
Query: 137 WNDHWGDHGTFKILRGENEADIEMG 161
WN WGD G F I RG NE IE G
Sbjct: 307 WNTDWGDKGYFLIQRGNNECKIESG 331
Score = 99.8 bits (247), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 58/162 (35%), Positives = 79/162 (48%), Gaps = 14/162 (8%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP FDA E WP C ++ I DQSNCGSCWA++ AISDR C G ++S +++
Sbjct: 98 LPEFFDAAEHWPMCLTISEIRDQSNCGSCWAIAAVEAISDRYC-TFGGVPDRRMSTSNLL 156
Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
+C C GC+GG P +AW +W G+ T E CQPY PC HH
Sbjct: 157 SCCFICGLGCHGGIPTVAWLWWVWVGIAT-------EDCQPYPFDPCSHHGNSEKYPPCP 209
Query: 306 LGKLKTPECKQNCYNPS-----YESTYRFDLKKGKKAHMVLM 342
TP+C C Y+ + + +K K+ + LM
Sbjct: 210 STIYDTPKCNTTCERSEMDLVKYKGSTSYSVKGEKELMIELM 251
>gi|268566081|ref|XP_002647468.1| Hypothetical protein CBG06540 [Caenorhabditis briggsae]
Length = 188
Score = 102 bits (255), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 52/106 (49%), Positives = 68/106 (64%), Gaps = 3/106 (2%)
Query: 184 AKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQ 243
AK +P FDAR+KW C S++ I +Q+NCGSCWA A ISDR+CI + G IS
Sbjct: 73 AKKIPDTFDARQKWKNCTSIKMIRNQANCGSCWAFGAAEVISDRICIVTKGARQPIISPT 132
Query: 244 HIVACTPN-C-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPY 287
++ C C +GC+GG+ A R+W NGVVTGGDY +GC+PY
Sbjct: 133 DMLDCCGEYCGYGCDGGYSIQALRWWVSNGVVTGGDYQG-DGCKPY 177
>gi|222424744|dbj|BAH20325.1| AT1G02305 [Arabidopsis thaliana]
Length = 293
Score = 102 bits (254), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 52/110 (47%), Positives = 69/110 (62%), Gaps = 8/110 (7%)
Query: 75 NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVEND-IPYWLV 133
+ M ++Y++GP+ F+VY DF YKSGVY+H G +IG HAV+++GWG +D YWL+
Sbjct: 180 DIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKHITGTNIGGHAVKLIGWGTSDDGEDYWLL 239
Query: 134 ANSWNDHWGDHGTFKILRGENEADIEMGF-------NNRVEANSSEDDDL 176
AN WN WGD G FKI RG NE IE G N V+ ++ DD L
Sbjct: 240 ANQWNRSWGDDGYFKIRRGTNECGIEHGVVAGLPSDRNVVKGITTSDDLL 289
Score = 88.2 bits (217), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 52/135 (38%), Positives = 67/135 (49%), Gaps = 20/135 (14%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP+ FDAR W +C S+ I DQ +CGSCWA ++SDR CI N +S ++
Sbjct: 37 LPKEFDARTAWSQCTSIGRILDQGHCGSCWAFGAVESLSDRFCIKYN--MNVSLSVNDLL 94
Query: 247 ACTPNCW--GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPY-TLAPCEHHVQGPLQNC 303
AC GCNGG+P AWR++ H+GVVT E C PY C H P
Sbjct: 95 ACCGFLCGQGCNGGYPIAAWRYFKHHGVVT-------EECDPYFDNTGCSHPGCEP---- 143
Query: 304 TLLGKLKTPECKQNC 318
TP+C + C
Sbjct: 144 ----AYPTPKCARKC 154
>gi|270012756|gb|EFA09204.1| cathepsin B precursor [Tribolium castaneum]
Length = 369
Score = 102 bits (254), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 57/156 (36%), Positives = 77/156 (49%), Gaps = 17/156 (10%)
Query: 187 LPRNFDAREKWPECPSL-RHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHI 245
+P FDARE WPEC + +I +Q C S WA + A +SDRLCIA+NG Q+S + +
Sbjct: 72 IPETFDAREYWPECADIIGNIRNQGKCSSSWAFAAAEVMSDRLCIATNGKVKIQLSPEDL 131
Query: 246 VACTPNCWG-CNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
+ C C C GG+ AW ++ G+V+GGDYN+ GCQPY+
Sbjct: 132 IDCCHYCGNQCKGGYTYYAWNYFMLTGLVSGGDYNTSTGCQPYS---------------E 176
Query: 305 LLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
L TP C C N Y Y D G + +
Sbjct: 177 LNYYRITPPCNTTCQNDKYPIPYVSDKHFGDSIYYI 212
Score = 74.3 bits (181), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 43/110 (39%), Positives = 54/110 (49%), Gaps = 13/110 (11%)
Query: 63 HYFKKAHMVPRCNAMRQ---IYEHGPLVAIFSVYADFLQYKSG---------VYQHNFGD 110
H+ + +P+ Q + GP+VA F VY DF Y+ G VY + G
Sbjct: 204 HFGDSIYYIPQNETAIQNEILSGGGPVVAAFDVYGDFKIYRDGEQHDTILEGVYIYTSGA 263
Query: 111 SIGLHAVRVLGWGVENDIPYWLVANSWNDHWGD-HGTFKILRGENEADIE 159
G AV+++GWG EN YWL ANSW WG G FKI RG NE E
Sbjct: 264 LFGRTAVKIIGWGTENGWAYWLAANSWGKDWGALGGFFKIRRGTNECGFE 313
>gi|18378947|ref|NP_563648.1| putative cathepsin B-like cysteine protease [Arabidopsis thaliana]
gi|16226808|gb|AAL16267.1|AF428337_1 At1g02300/T6A9_10 [Arabidopsis thaliana]
gi|14532526|gb|AAK63991.1| At1g02300/T6A9_10 [Arabidopsis thaliana]
gi|25090140|gb|AAN72238.1| At1g02300/T6A9_10 [Arabidopsis thaliana]
gi|332189292|gb|AEE27413.1| putative cathepsin B-like cysteine protease [Arabidopsis thaliana]
Length = 362
Score = 102 bits (254), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 52/110 (47%), Positives = 69/110 (62%), Gaps = 8/110 (7%)
Query: 75 NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVEND-IPYWLV 133
+ M ++Y++GP+ F+VY DF YKSGVY+H G +IG HAV+++GWG +D YWL+
Sbjct: 249 DIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKHITGTNIGGHAVKLIGWGTSDDGEDYWLL 308
Query: 134 ANSWNDHWGDHGTFKILRGENEADIEMGF-------NNRVEANSSEDDDL 176
AN WN WGD G FKI RG NE IE G N V+ ++ DD L
Sbjct: 309 ANQWNRSWGDDGYFKIRRGTNECGIEHGVVAGLPSDRNVVKGITTSDDLL 358
Score = 88.6 bits (218), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 52/135 (38%), Positives = 67/135 (49%), Gaps = 20/135 (14%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP+ FDAR W +C S+ I DQ +CGSCWA ++SDR CI N +S ++
Sbjct: 106 LPKEFDARTAWSQCTSIGRILDQGHCGSCWAFGAVESLSDRFCIKYN--MNVSLSVNDLL 163
Query: 247 ACTPNCW--GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPY-TLAPCEHHVQGPLQNC 303
AC GCNGG+P AWR++ H+GVVT E C PY C H P
Sbjct: 164 ACCGFLCGQGCNGGYPIAAWRYFKHHGVVT-------EECDPYFDNTGCSHPGCEP---- 212
Query: 304 TLLGKLKTPECKQNC 318
TP+C + C
Sbjct: 213 ----AYPTPKCARKC 223
>gi|56754337|gb|AAW25356.1| SJCHGC00056 protein [Schistosoma japonicum]
Length = 342
Score = 102 bits (254), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 42/82 (51%), Positives = 58/82 (70%)
Query: 78 RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
R I +GP+ A F VY DFL KSG+ +H G +G H +R++GWGVE PYWL+ANSW
Sbjct: 251 RDIMMYGPVEAAFDVYEDFLNSKSGISRHVTGSIVGGHPIRIIGWGVEKGNPYWLIANSW 310
Query: 138 NDHWGDHGTFKILRGENEADIE 159
N+ WG++G F+++RG +E IE
Sbjct: 311 NEDWGENGLFRMVRGRDECSIE 332
Score = 87.8 bits (216), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 51/144 (35%), Positives = 66/144 (45%), Gaps = 24/144 (16%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+P FD+R+KWP C S+ I DQS CGSCWA A++DR+CI S G + ++SA ++
Sbjct: 90 IPSQFDSRKKWPHCKSISQIRDQSRCGSCWAFGAVEAMTDRICIQSGGGQSAELSALDLI 149
Query: 247 A------------CTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEH 294
+ W G WRF N GCQPY CEH
Sbjct: 150 SCCEDCGGGCKGGFPGQAWDM-GKTRDSHWRFRKKN----------HTGCQPYPFPKCEH 198
Query: 295 HVQGPLQNCTLLGKLKTPECKQNC 318
+G C KTP+CKQ C
Sbjct: 199 LTKGKYPACG-TKIYKTPQCKQTC 221
>gi|91088083|ref|XP_968689.1| PREDICTED: similar to AGAP004533-PA [Tribolium castaneum]
Length = 360
Score = 102 bits (253), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 57/156 (36%), Positives = 77/156 (49%), Gaps = 17/156 (10%)
Query: 187 LPRNFDAREKWPECPSL-RHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHI 245
+P FDARE WPEC + +I +Q C S WA + A +SDRLCIA+NG Q+S + +
Sbjct: 72 IPETFDAREYWPECADIIGNIRNQGKCSSSWAFAAAEVMSDRLCIATNGKVKIQLSPEDL 131
Query: 246 VACTPNCWG-CNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
+ C C C GG+ AW ++ G+V+GGDYN+ GCQPY+
Sbjct: 132 IDCCHYCGNQCKGGYTYYAWNYFMLTGLVSGGDYNTSTGCQPYS---------------E 176
Query: 305 LLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
L TP C C N Y Y D G + +
Sbjct: 177 LNYYRITPPCNTTCQNDKYPIPYVSDKHFGDSIYYI 212
Score = 82.4 bits (202), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 43/101 (42%), Positives = 54/101 (53%), Gaps = 4/101 (3%)
Query: 63 HYFKKAHMVPRCNAMRQ---IYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRV 119
H+ + +P+ Q + GP+VA F VY DF Y+ GVY + G G AV++
Sbjct: 204 HFGDSIYYIPQNETAIQNEILSGGGPVVAAFDVYGDFKIYRDGVYIYTSGALFGRTAVKI 263
Query: 120 LGWGVENDIPYWLVANSWNDHWGD-HGTFKILRGENEADIE 159
+GWG EN YWL ANSW WG G FKI RG NE E
Sbjct: 264 IGWGTENGWAYWLAANSWGKDWGALGGFFKIRRGTNECGFE 304
>gi|86451924|gb|ABC97357.1| cathepsin B [Streblomastix strix]
Length = 283
Score = 102 bits (253), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 45/81 (55%), Positives = 59/81 (72%)
Query: 79 QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWN 138
+IYE+GP+ F VY+DF+ YKSGVY H G G HAV ++GWGVE+++PYWLV NSW
Sbjct: 191 EIYEYGPVSMGFIVYSDFMSYKSGVYVHQAGYIEGGHAVLIVGWGVEDEVPYWLVQNSWG 250
Query: 139 DHWGDHGTFKILRGENEADIE 159
WG++G FKILRG + + E
Sbjct: 251 TDWGENGFFKILRGSDHCECE 271
Score = 75.1 bits (183), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 41/106 (38%), Positives = 60/106 (56%), Gaps = 11/106 (10%)
Query: 182 QNAKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQIS 241
+++ +P FDAREKWP+ ++ + DQ CGSCWA S+A I DRL + G G I+
Sbjct: 58 RDSNKVPDTFDAREKWPD--AILPVRDQGECGSCWAFSIAETIGDRLGVL--GCSRGDIA 113
Query: 242 AQHIVACTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPY 287
+ +V+C GC+GG+ +AW + NG+ T E C PY
Sbjct: 114 PEDLVSCDIFDDGCDGGFIDMAWDWCQENGLTT-------EECIPY 152
>gi|10803441|emb|CAC13133.1| putative cathepsin B.7 [Ostertagia ostertagi]
Length = 198
Score = 102 bits (253), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 57/127 (44%), Positives = 77/127 (60%), Gaps = 5/127 (3%)
Query: 214 SCWAVSVANAISDRLCIASNGYFTGQISAQHIVACTPNCW-GCNGGWPQLAWRFWGHNGV 272
SCWAVS A A+SDR+CIAS G ISAQ IV+C C GC GGWP AW++ GV
Sbjct: 1 SCWAVSTAAAMSDRICIASKGATQVLISAQDIVSCCTWCGAGCEGGWPIEAWKYGVTEGV 60
Query: 273 VTGGDYNSQEGCQPYTLAPCEHHVQGPLQ-NCTLLGKLKTPECKQNCYNPSYESTYRFDL 331
VTGG++ +E C+ Y + PC +H P +C + +TP CK+ C P Y+++Y D
Sbjct: 61 VTGGNFGRKECCRSYEIHPCGYHGNEPFYGHCHSMA--RTPPCKKRC-RPGYKNSYMMDK 117
Query: 332 KKGKKAH 338
+ G A+
Sbjct: 118 RYGTSAY 124
Score = 77.4 bits (189), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 36/64 (56%), Positives = 44/64 (68%), Gaps = 4/64 (6%)
Query: 78 RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVE----NDIPYWLV 133
R I E+GP+VA F VY DF YKSG+Y+H G G HAV+V+GWG E IPYW++
Sbjct: 135 RDIMENGPVVAGFDVYEDFKYYKSGIYRHTAGKXTGGHAVKVIGWGEEXTENGTIPYWII 194
Query: 134 ANSW 137
ANSW
Sbjct: 195 ANSW 198
>gi|166030324|gb|ABY78829.1| cathepsin B-like protease [Trypanosoma congolense]
Length = 336
Score = 101 bits (252), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 48/93 (51%), Positives = 61/93 (65%), Gaps = 1/93 (1%)
Query: 78 RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
R++Y +GP VA+F VY D YKSGVY+H GD +G AV+V+GWG N PYW VAN+W
Sbjct: 241 RELYFNGPFVAVFYVYTDLFAYKSGVYRHVDGDFLGGTAVKVVGWGKLNGTPYWKVANTW 300
Query: 138 NDHWGDHGTFKILRGENEADIE-MGFNNRVEAN 169
+ WG G ILRG NE +IE +GF E +
Sbjct: 301 DTDWGMDGYLLILRGNNECNIEHLGFAGTPETS 333
Score = 87.4 bits (215), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 57/138 (41%), Positives = 75/138 (54%), Gaps = 11/138 (7%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP +FD+ EKWP CP++R IADQS C + WAVS A+ ISDR C G +ISA H++
Sbjct: 90 LPESFDSAEKWPNCPTIREIADQSACRASWAVSTASVISDRYCTV-GGVQQLRISAAHLL 148
Query: 247 A-CTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHH-VQGPLQNCT 304
+ C GC GG+P AWR++ G+ + CQPY CEH QG C+
Sbjct: 149 SCCKQCGGGCKGGFPGFAWRYYVEYGI-------ASSYCQPYPFPHCEHRGAQGNKTPCS 201
Query: 305 LLGKLKTPECKQNCYNPS 322
TP+C C + S
Sbjct: 202 KY-NFDTPKCNATCTDKS 218
>gi|52546914|gb|AAU81590.1| cysteine proteinase, partial [Petunia x hybrida]
Length = 122
Score = 101 bits (252), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 47/84 (55%), Positives = 59/84 (70%), Gaps = 1/84 (1%)
Query: 77 MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVEND-IPYWLVAN 135
M ++Y++GP+ F+VY DF YKSGVY+H GD +G HAV+++GWG D YWL+AN
Sbjct: 11 MTEVYKNGPVEVAFTVYEDFAHYKSGVYKHVTGDELGGHAVKLIGWGTSEDGEDYWLLAN 70
Query: 136 SWNDHWGDHGTFKILRGENEADIE 159
WN WGD G FKI RG NE DIE
Sbjct: 71 QWNRGWGDDGYFKIRRGTNECDIE 94
>gi|300121294|emb|CBK21674.2| unnamed protein product [Blastocystis hominis]
Length = 561
Score = 101 bits (252), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 67/229 (29%), Positives = 106/229 (46%), Gaps = 34/229 (14%)
Query: 77 MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANS 136
M++IY GP+ + + YK G+++ G + HA+ V+GWG E+ YW+V NS
Sbjct: 185 MKEIYARGPITCALDATDELVAYKGGIFEDKTGTTSLNHAISVVGWGEEDGKKYWIVRNS 244
Query: 137 WNDHWGDHGTFKILRGENEADIEMGFN---NRVEANSSEDDDLETM---------GCQNA 184
W +WG++G F+I+RG N IE RV +D + ++ C
Sbjct: 245 WGTYWGENGWFRIVRGTNNLGIESECTWAVPRVPEKMRLNDKMRSLHNRARYFPHSCAIR 304
Query: 185 K--------GLPRNFDAREKWPECPSLRHIADQS------------NCGSCWAVSVANAI 224
K LP + E P+ +R+I ++ CGSCWA +AI
Sbjct: 305 KQEPAVVTEPLPHFYLKSEDIPKSYDIRNIDGRNYATWDKNQHIPQYCGSCWAQGSTSAI 364
Query: 225 SDRLCIASNG-YFTGQISAQHIVACTPNCWGCNGGWPQLAWRFWGHNGV 272
+DR+ I G + T ++S Q ++ C N CNGGW +R+ G+
Sbjct: 365 ADRINIMRKGKWPTVELSVQEVINCG-NTGSCNGGWDSGVYRYAHEEGI 412
Score = 65.1 bits (157), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 30/82 (36%), Positives = 48/82 (58%), Gaps = 1/82 (1%)
Query: 79 QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGV-ENDIPYWLVANSW 137
+I+ GP+ SV +FL Y GV+ + +G H + V GWGV E+ YW+ NSW
Sbjct: 468 EIFARGPISCYVSVSQEFLDYTGGVFVEHDHSMLGGHIIEVAGWGVTEDGQEYWIGRNSW 527
Query: 138 NDHWGDHGTFKILRGENEADIE 159
++WG++G F+I ++ +IE
Sbjct: 528 GEYWGENGWFRIQTDKDNLEIE 549
Score = 55.1 bits (131), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 37/106 (34%), Positives = 52/106 (49%), Gaps = 14/106 (13%)
Query: 187 LPRNFDARE----KWPECPSLRHIADQSNCGSCWAVSVANAISDRL-CIASNGYFTGQIS 241
LP+++D R + +HI CGSCWA S A+A++DRL + N + T ++S
Sbjct: 43 LPKSYDPRNIDGVSYVSVSRNQHIPQY--CGSCWAFSAASAVADRLRLMTKNAWPTAELS 100
Query: 242 AQHIVACTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPY 287
Q IV C GC+GG A++ GV T EGC Y
Sbjct: 101 PQMIVNCATTAMGCHGGSMTSAYKLMKERGVPT-------EGCMRY 139
>gi|12004577|gb|AAG44098.1| cathepsin B cysteine protease [Leishmania chagasi]
Length = 340
Score = 101 bits (252), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 56/137 (40%), Positives = 70/137 (51%), Gaps = 17/137 (12%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP FDA E WP C ++ I DQSNCGSCWA++ AISDR C G +IS +++
Sbjct: 98 LPEFFDAAEHWPMCVTISEIRDQSNCGSCWAIAAVEAISDRYCTL-GGVPDRRISTSNLL 156
Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQG----PLQ 301
+C C +GC GG P +AW +W G+ T E CQPY PC HH P
Sbjct: 157 SCCFICGFGCYGGIPTMAWLWWVWVGITT-------EVCQPYPFGPCSHHGNSDKYPPCP 209
Query: 302 NCTLLGKLKTPECKQNC 318
N TP+C C
Sbjct: 210 NTI----YDTPKCNTTC 222
Score = 100 bits (249), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 44/85 (51%), Positives = 57/85 (67%)
Query: 77 MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANS 136
M ++ +GPL VY+DF+ YKSG Y+H GD +G HAV+++GWG + +PYW +ANS
Sbjct: 247 MIELMTNGPLEVTMQVYSDFVGYKSGGYKHVSGDLLGGHAVKLVGWGTQGGVPYWKIANS 306
Query: 137 WNDHWGDHGTFKILRGENEADIEMG 161
WN WGD G F I RG NE IE G
Sbjct: 307 WNTDWGDKGYFLIQRGSNECGIESG 331
>gi|303289014|ref|XP_003063795.1| cathepsin B-like cysteine proteinase [Micromonas pusilla CCMP1545]
gi|226454863|gb|EEH52168.1| cathepsin B-like cysteine proteinase [Micromonas pusilla CCMP1545]
Length = 390
Score = 101 bits (252), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 58/145 (40%), Positives = 76/145 (52%), Gaps = 18/145 (12%)
Query: 184 AKGLPRNFDAREKWPECPSLRHIA-DQSNCGSCWAVSVANAISDRLCIASNGYFTGQ--- 239
A GLP FDARE+WP C + A DQ CGSCWAV+ A ++DR CIA+NG G
Sbjct: 113 ADGLPELFDARERWPRCARVVGTALDQGKCGSCWAVATAAVLTDRACIATNGALGGGGGG 172
Query: 240 ---ISAQHIVACTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHV 296
+SA +++C GC GG + A+ + +GVVTGG Y + C PY C+H
Sbjct: 173 GEFLSASQLLSCGAAD-GCEGGDERDAFEYAKTHGVVTGGAYGDESTCAPYLFDACQHPC 231
Query: 297 QGPLQNCTLLGKLKTPECKQNCYNP 321
+ K TPEC +C P
Sbjct: 232 E----------KSPTPECPLSCVRP 246
>gi|145356617|ref|XP_001422524.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144582767|gb|ABP00841.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 245
Score = 101 bits (251), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 49/120 (40%), Positives = 71/120 (59%), Gaps = 12/120 (10%)
Query: 187 LPRNFDAREKWPECPSLRHIA-DQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHI 245
LP++FD REKWP+C +L A DQ CGSCWAV+ A ++DRLCIA+NG +SA +
Sbjct: 2 LPKDFDVREKWPKCAALVSEALDQGECGSCWAVAPAKVMADRLCIATNGAVASHLSAMQL 61
Query: 246 VAC-----------TPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEH 294
++C + C+GG+P A+ +G+V+GG + + C PY APC+H
Sbjct: 62 LSCGKLENGTFDAGSTYSGSCDGGFPNEAYEKARTSGIVSGGLFGDDKTCMPYAFAPCQH 121
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 35/86 (40%), Positives = 49/86 (56%), Gaps = 9/86 (10%)
Query: 74 CNAMRQIYEHGPLVA-IFSVYADFLQYKSGVYQHN-----FGDSIGLHAVRVLGWG-VEN 126
C A+ Y HGP+ + + V+ +F +YKSGVY + G++ G H + V+GWG E+
Sbjct: 162 CMALELFY-HGPVSSYVGDVFDEFYKYKSGVYSLSKDVAARGENHGGHVMEVIGWGTTES 220
Query: 127 DIPYWLVANSWNDHWGDHGTFKILRG 152
YW V NSW + WGD G KI G
Sbjct: 221 GTRYWKVYNSWLN-WGDQGYGKIAVG 245
>gi|294876288|ref|XP_002767632.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
gi|239869318|gb|EER00350.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
Length = 97
Score = 101 bits (251), Expect = 5e-19, Method: Composition-based stats.
Identities = 41/79 (51%), Positives = 58/79 (73%)
Query: 75 NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVA 134
N ++I +GP A S+Y DFL Y+SGVY+H G +G+H+V ++GWG+E + YWLV
Sbjct: 4 NIKKEIMTNGPTSATLSMYNDFLSYESGVYKHTSGTFMGVHSVEIIGWGIEKGVDYWLVM 63
Query: 135 NSWNDHWGDHGTFKILRGE 153
NSWN+ WGD+GTFKI +G+
Sbjct: 64 NSWNEDWGDNGTFKIAQGD 82
>gi|449489527|ref|XP_004158338.1| PREDICTED: cathepsin B-like [Cucumis sativus]
Length = 349
Score = 101 bits (251), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 51/100 (51%), Positives = 66/100 (66%), Gaps = 3/100 (3%)
Query: 63 HYFKKAHMVPR--CNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
HY A+ V R + M ++Y++GP+ F+VY DF YKSGVY+H GD +G HAV+++
Sbjct: 231 HYGVSAYRVKRDPNDIMAEVYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLI 290
Query: 121 GWGVEND-IPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
GWG +D YWL+AN WN WGD G FKI RG NE IE
Sbjct: 291 GWGTTDDGEDYWLLANQWNRGWGDDGYFKIRRGTNECGIE 330
Score = 89.4 bits (220), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 52/137 (37%), Positives = 71/137 (51%), Gaps = 20/137 (14%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP++FDARE WP+C S+ I DQ +CGSCWA ++SDR CI + T +S ++
Sbjct: 102 LPKSFDAREAWPQCISIGTILDQGHCGSCWAFGAVESLSDRFCIHFDMNIT--LSVNDLL 159
Query: 247 ACTPNCW--GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPY-TLAPCEHHVQGPLQNC 303
AC GC+GG+P AWR++ +GVVT E C PY C H P
Sbjct: 160 ACCGFMCGDGCDGGYPISAWRYFVRHGVVT-------EQCDPYFDTTGCSHPGCEP---- 208
Query: 304 TLLGKLKTPECKQNCYN 320
TP C ++C +
Sbjct: 209 ----AYPTPRCVRHCVD 221
>gi|294877489|ref|XP_002768007.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
gi|239870145|gb|EER00725.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
Length = 344
Score = 101 bits (251), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 44/83 (53%), Positives = 56/83 (67%)
Query: 75 NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVA 134
N ++I +GP A FS Y DF YKSGVY+H G +G H+V ++GWG E + YWLV
Sbjct: 251 NIKKEIMTNGPTSASFSTYEDFSSYKSGVYKHTSGGYLGDHSVEIIGWGTEKGVDYWLVM 310
Query: 135 NSWNDHWGDHGTFKILRGENEAD 157
NSWN+ WGDHGTFKI +G+ D
Sbjct: 311 NSWNEGWGDHGTFKIAQGDCGID 333
Score = 95.1 bits (235), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 49/120 (40%), Positives = 70/120 (58%), Gaps = 12/120 (10%)
Query: 187 LPRNFDAREKWPECPS-LRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHI 245
+P +FDAR+ + EC + H+ DQS CGSCWA++ A + RLCI S G F +SA +
Sbjct: 59 IPSSFDARDAFKECKDVIGHVWDQSACGSCWAIAPVEAFNARLCIKSGGKFNQLLSAGEM 118
Query: 246 VAC-----TPNCWGCNGGWPQLAWRFWGHNGVVTGGDY------NSQEGCQPYTLAPCEH 294
+AC + N GC GG + AW F +G+VTGGD+ ++ +GC PY+ C H
Sbjct: 119 LACCNSVHSCNSHGCQGGIARAAWSFLKMHGIVTGGDFVPKGSMSAADGCWPYSFPKCAH 178
>gi|449446774|ref|XP_004141146.1| PREDICTED: cathepsin B-like [Cucumis sativus]
Length = 348
Score = 101 bits (251), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 51/100 (51%), Positives = 66/100 (66%), Gaps = 3/100 (3%)
Query: 63 HYFKKAHMVPR--CNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
HY A+ V R + M ++Y++GP+ F+VY DF YKSGVY+H GD +G HAV+++
Sbjct: 230 HYGVSAYRVKRDPNDIMAEVYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLI 289
Query: 121 GWGVEND-IPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
GWG +D YWL+AN WN WGD G FKI RG NE IE
Sbjct: 290 GWGTTDDGEDYWLLANQWNRGWGDDGYFKIRRGTNECGIE 329
Score = 89.4 bits (220), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 52/137 (37%), Positives = 71/137 (51%), Gaps = 20/137 (14%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP++FDARE WP+C S+ I DQ +CGSCWA ++SDR CI + T +S ++
Sbjct: 101 LPKSFDAREAWPQCISIGTILDQGHCGSCWAFGAVESLSDRFCIHFDMNIT--LSVNDLL 158
Query: 247 ACTPNCW--GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPY-TLAPCEHHVQGPLQNC 303
AC GC+GG+P AWR++ +GVVT E C PY C H P
Sbjct: 159 ACCGFMCGDGCDGGYPISAWRYFVRHGVVT-------EQCDPYFDTTGCSHPGCEP---- 207
Query: 304 TLLGKLKTPECKQNCYN 320
TP C ++C +
Sbjct: 208 ----AYPTPRCVRHCVD 220
>gi|156708120|gb|ABU93318.1| cathepsin B9 cysteine protease, partial [Monocercomonoides sp. PA]
Length = 382
Score = 100 bits (250), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 52/132 (39%), Positives = 68/132 (51%), Gaps = 16/132 (12%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+P +FDAR WP CP++ HI DQ +CGSCWA+ + DR CI SNG +S Q I
Sbjct: 70 IPESFDARTNWPNCPTIGHIYDQGHCGSCWAMCSFEVLQDRFCIHSNGSEKPWLSGQDIT 129
Query: 247 ACTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTLL 306
+C GCNGGW + A+ + GV T E C PY + C H C+
Sbjct: 130 SCDSRSHGCNGGWTETAFEYAKKAGVPT-------EECVPYLMGKCHH------PGCS-- 174
Query: 307 GKLKTPECKQNC 318
+TP CK+ C
Sbjct: 175 -SWQTPTCKKEC 185
Score = 54.3 bits (129), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 24/64 (37%), Positives = 40/64 (62%), Gaps = 2/64 (3%)
Query: 63 HYFKKAHMVPR-CNAMR-QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
+Y K++ + R A++ ++ +GP+ A+F+ Y D Y GVY H G GLHA++++
Sbjct: 198 YYASKSYSIQRNVEAIQLELMRNGPVTAVFTTYDDLAVYWRGVYNHVMGSEQGLHAIKIV 257
Query: 121 GWGV 124
GWGV
Sbjct: 258 GWGV 261
Score = 39.7 bits (91), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 17/35 (48%), Positives = 21/35 (60%)
Query: 125 ENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
E IPYW++ NSW + +G G I RG NE IE
Sbjct: 321 EEGIPYWIIVNSWGEDFGMDGILLIKRGVNECGIE 355
>gi|297843026|ref|XP_002889394.1| hypothetical protein ARALYDRAFT_887367 [Arabidopsis lyrata subsp.
lyrata]
gi|297335236|gb|EFH65653.1| hypothetical protein ARALYDRAFT_887367 [Arabidopsis lyrata subsp.
lyrata]
Length = 359
Score = 100 bits (250), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 49/105 (46%), Positives = 67/105 (63%), Gaps = 1/105 (0%)
Query: 75 NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVEND-IPYWLV 133
+ M ++Y++GP+ F+VY DF YKSGVY+H G IG HAV+++GWG +D YWL+
Sbjct: 246 DIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKHITGTKIGGHAVKLIGWGTSDDGEDYWLL 305
Query: 134 ANSWNDHWGDHGTFKILRGENEADIEMGFNNRVEANSSEDDDLET 178
AN WN WGD G FKI RG NE IE G + ++ + D+ T
Sbjct: 306 ANQWNRSWGDDGYFKIRRGTNECGIEHGVVAGLPSDRNVFKDVTT 350
Score = 70.1 bits (170), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 44/103 (42%), Positives = 57/103 (55%), Gaps = 11/103 (10%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP+ FDAR W +C S+ I DQ +CGSCWA ++SDR CI N +SA +V
Sbjct: 103 LPKEFDARTAWSQCTSIPRILDQGHCGSCWAFGAVESLSDRFCIKYN--LNVSLSANDVV 160
Query: 247 A--CTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPY 287
A GCNGG+P AW ++ ++GVVT E C PY
Sbjct: 161 ACCGLLCGLGCNGGFPMGAWLYFKYHGVVT-------EECDPY 196
>gi|297843028|ref|XP_002889395.1| hypothetical protein ARALYDRAFT_887368 [Arabidopsis lyrata subsp.
lyrata]
gi|297335237|gb|EFH65654.1| hypothetical protein ARALYDRAFT_887368 [Arabidopsis lyrata subsp.
lyrata]
Length = 360
Score = 100 bits (250), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 47/88 (53%), Positives = 61/88 (69%), Gaps = 1/88 (1%)
Query: 75 NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVEND-IPYWLV 133
+ M ++Y++GP+ F+VY DF YKSGVY+H G +IG HAV+++GWG +D YWL+
Sbjct: 247 DIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKHITGTNIGGHAVKLIGWGTSDDGEDYWLL 306
Query: 134 ANSWNDHWGDHGTFKILRGENEADIEMG 161
AN WN WGD G FKI RG NE IE G
Sbjct: 307 ANQWNRSWGDDGYFKIRRGTNECGIEHG 334
Score = 88.2 bits (217), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 52/135 (38%), Positives = 67/135 (49%), Gaps = 20/135 (14%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP+ FDAR W +C S+ I DQ +CGSCWA ++SDR CI N +S ++
Sbjct: 104 LPKEFDARTAWSQCTSVGRILDQGHCGSCWAFGAVESLSDRFCIKYN--MNISLSVNDLL 161
Query: 247 ACTPNC--WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPY-TLAPCEHHVQGPLQNC 303
AC GCNGG+P AWR++ H+GVVT E C PY C H P
Sbjct: 162 ACCGFLCGQGCNGGYPIAAWRYFKHHGVVT-------EECDPYFDNTGCSHPGCEP---- 210
Query: 304 TLLGKLKTPECKQNC 318
TP+C + C
Sbjct: 211 ----AYPTPKCARKC 221
>gi|166030320|gb|ABY78827.1| cathepsin B-like protease [Trypanosoma congolense]
Length = 336
Score = 100 bits (250), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 47/93 (50%), Positives = 61/93 (65%), Gaps = 1/93 (1%)
Query: 78 RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
R++Y +GP VA+F VY D YKSGVY++ GD +G AVR++GWG N PYW VAN+W
Sbjct: 241 RELYFNGPFVAVFFVYTDLFAYKSGVYRNVDGDILGGQAVRIVGWGKLNGTPYWKVANTW 300
Query: 138 NDHWGDHGTFKILRGENEADIE-MGFNNRVEAN 169
+ WG G ILRG NE +IE +GF E +
Sbjct: 301 DTDWGMDGYLLILRGNNECNIEHLGFAGTPETS 333
Score = 87.4 bits (215), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 57/138 (41%), Positives = 75/138 (54%), Gaps = 11/138 (7%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP +FD+ EKWP CP++R IADQS C + WAVS A+ ISDR C G +ISA H++
Sbjct: 90 LPESFDSAEKWPNCPTIREIADQSACRASWAVSTASVISDRYCTV-GGVQQLRISAAHLL 148
Query: 247 A-CTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHH-VQGPLQNCT 304
+ C GC GG+P AWR++ G+ + CQPY CEH QG C+
Sbjct: 149 SCCKQCGGGCKGGFPGFAWRYYVEYGI-------ASSYCQPYPFPHCEHRGAQGNKTPCS 201
Query: 305 LLGKLKTPECKQNCYNPS 322
TP+C C + S
Sbjct: 202 KY-NFDTPKCNATCTDKS 218
>gi|609175|emb|CAA57522.1| cathepsin B-like cysteine proteinase [Nicotiana rustica]
Length = 356
Score = 100 bits (250), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 49/100 (49%), Positives = 66/100 (66%), Gaps = 3/100 (3%)
Query: 63 HYFKKAHMVPRC--NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
H+ A+M+ + M ++Y++GP+ F+VY DF YKSGVY+H GD +G HAV+++
Sbjct: 229 HFGVNAYMISSDPHSIMTEVYKNGPVEVSFTVYEDFAHYKSGVYKHVTGDIMGGHAVKLI 288
Query: 121 GWGVEND-IPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
GWG D YWL+AN WN WGD G FKI RG NE +IE
Sbjct: 289 GWGTSEDGEDYWLLANQWNRGWGDDGYFKIRRGTNECEIE 328
Score = 77.4 bits (189), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 41/103 (39%), Positives = 54/103 (52%), Gaps = 6/103 (5%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP+ FDAR W C ++ I DQ +CGSCWA ++SDR CI +SA +
Sbjct: 100 LPQEFDARVAWSNCSTIGRILDQGHCGSCWAFGAVESLSDRFCIHYG--LNISLSANDLY 157
Query: 247 ACTPNCW--GCNGGWPQLAWRFWGHNGVVTG--GDYNSQEGCQ 285
AC GC+GG+P AW+++ GVVT Y EGC
Sbjct: 158 ACCGFLCGDGCDGGYPLQAWKYFVRKGVVTDECDPYFDNEGCS 200
>gi|300121755|emb|CBK22330.2| unnamed protein product [Blastocystis hominis]
Length = 562
Score = 100 bits (250), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 69/234 (29%), Positives = 109/234 (46%), Gaps = 38/234 (16%)
Query: 75 NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVA 134
N M++IY GP+ + + ++YK G+Y+ G H++ V+GWG E+ YW+
Sbjct: 183 NMMKEIYARGPITCTIADPEELMEYKGGIYRDTTGAKSLDHSISVVGWGEEDGQKYWIAR 242
Query: 135 NSWNDHWGDHGTFKILRGEN----EADI---------EMGFNNRVEAN---------SSE 172
NSW WG+ G F+I+RGEN EAD EM N+++ + S
Sbjct: 243 NSWGTFWGEKGWFRIVRGENNLGIEADCQWAVPRVPEEMILNDQMRSQRNRARYFPRSCA 302
Query: 173 DDDLETMGCQNAKGLPRNFDAREKWPECPSLRHIADQSN-------------CGSCWAVS 219
D + M P + E P+ +R+I D N CGSCWA +
Sbjct: 303 RPDTKEMKEHVVSPRPHTYIKSEDIPKNYDIRNI-DGVNYATWDKNQHIPQYCGSCWAQA 361
Query: 220 VANAISDRLCIASNG-YFTGQISAQHIVACTPNCWGCNGGWPQLAWRFWGHNGV 272
+A+SDR+ + G + T ++S Q I+ C+ C GGW +++ H G+
Sbjct: 362 PTSALSDRINLMRKGKWPTVELSVQEIINCSGKG-SCEGGWQSGVYQYAYHQGI 414
Score = 70.1 bits (170), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 30/76 (39%), Positives = 47/76 (61%), Gaps = 1/76 (1%)
Query: 79 QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWG-VENDIPYWLVANSW 137
+I+ GP+ V +FL Y+ G+++ N + +G H+V V GWG E+ YW+ NSW
Sbjct: 470 EIFARGPVSCDIWVTQEFLDYQGGIFKENGSEYLGRHSVEVAGWGETEDGTKYWIGRNSW 529
Query: 138 NDHWGDHGTFKILRGE 153
+WG+HG F+I+ GE
Sbjct: 530 GTYWGEHGWFRIIIGE 545
Score = 48.9 bits (115), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 33/106 (31%), Positives = 54/106 (50%), Gaps = 14/106 (13%)
Query: 187 LPRNFDARE----KWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNG-YFTGQIS 241
LP+++D R+ + +HI CGSCW+ + +++SDRL + + G + +S
Sbjct: 43 LPKSYDPRDIDGRNYVTVTKNQHIPQY--CGSCWSFASVSSVSDRLKLMTKGKWPVHDLS 100
Query: 242 AQHIVACTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPY 287
Q I+ C N GC GG P A+++ +GV +EGC Y
Sbjct: 101 PQVILNCDHNSNGCQGGHPLTAFKYMHDHGV-------PEEGCMRY 139
>gi|268566089|ref|XP_002647469.1| Hypothetical protein CBG06541 [Caenorhabditis briggsae]
Length = 280
Score = 100 bits (250), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 46/95 (48%), Positives = 63/95 (66%), Gaps = 2/95 (2%)
Query: 67 KAHMVPRCNAMRQ--IYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGV 124
A+ V R A Q I +GP+V F++Y D +YKSGVY+H G +G HA++++GWG
Sbjct: 176 SAYAVARNVAAIQTEIMTNGPVVGAFTMYEDMYKYKSGVYRHTAGRLLGGHAIKIIGWGT 235
Query: 125 ENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
+N IPYWL+ANSW WG++G K+ RG NE IE
Sbjct: 236 QNGIPYWLIANSWGADWGENGFLKMRRGVNECGIE 270
Score = 88.6 bits (218), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 55/133 (41%), Positives = 72/133 (54%), Gaps = 13/133 (9%)
Query: 210 SNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVACTP-NCW-GCNGGWPQLAWRFW 267
+ CGSCWA S A ISDR+CIA+ G IS ++AC +C GC GG+P A+R+W
Sbjct: 59 AQCGSCWAFSTAEVISDRICIATKGTQQPTISPTDMLACCGRSCGDGCEGGYPIQAFRWW 118
Query: 268 GHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTLLGKLKTPECKQNCYNPSYESTY 327
GVVTGGD+ GC+PY APC + C + KTP C +C Y + Y
Sbjct: 119 NSRGVVTGGDFRG-SGCRPYPFAPCNSY------KCP---EEKTPTCSLSC-QFGYSTAY 167
Query: 328 RFDLKKGKKAHMV 340
D + G A+ V
Sbjct: 168 AKDKRFGVSAYAV 180
Score = 74.3 bits (181), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 31/66 (46%), Positives = 43/66 (65%)
Query: 83 HGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWNDHWG 142
+GP+ A F+VY DF YK GVYQ+ G +G+HA++++GWG E+ YWL+ANSW G
Sbjct: 3 NGPVEASFTVYEDFYIYKKGVYQYTAGQVVGVHAIKIMGWGTEHGTDYWLIANSWGAQCG 62
Query: 143 DHGTFK 148
F
Sbjct: 63 SCWAFS 68
>gi|324514184|gb|ADY45787.1| Cathepsin B cysteine proteinase 6 [Ascaris suum]
Length = 476
Score = 100 bits (249), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 56/163 (34%), Positives = 79/163 (48%), Gaps = 17/163 (10%)
Query: 184 AKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQ 243
A LP FDAR KW C SL ++ +Q CG+C+AV+ SDR CIASNG S +
Sbjct: 190 ADSLPSEFDARRKWSYCSSLHNVPNQGGCGACYAVAAVGVASDRACIASNGTLQSMFSEE 249
Query: 244 HIVACTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTL-----APCEHHVQG 298
++ C C C GG P A +W G+VTGG ++GC+PY++ PC V
Sbjct: 250 DVLGCCAVCGNCYGGDPLKALVYWVDEGLVTGG----RDGCRPYSVDLSCGVPCSPAVY- 304
Query: 299 PLQNCTLLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMVL 341
PL +C + C + ++ Y D G A+ +
Sbjct: 305 PLAE-------YRRKCYRQCQDIYFQYNYESDKHYGSMAYSMF 340
Score = 69.3 bits (168), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 43/114 (37%), Positives = 57/114 (50%), Gaps = 15/114 (13%)
Query: 48 KKKKRLYLPTSIPLSHYFKKAHMVP------RCNAMRQIYEHGPLVAIFSVYADFLQYKS 101
K +R+ LPT I Y + P R M+++Y GP+ F V +FL Y S
Sbjct: 349 KGSERVKLPTVI---GYLNETSDEPLTDKEIRQIIMKELYLWGPMTMAFPVTEEFLHYSS 405
Query: 102 GVYQ----HNFGDSIGL-HAVRVLGWG-VENDIPYWLVANSWNDHWGDHGTFKI 149
GV+ NF D I H R++GWG + D YWL NS+ HWGD G F+I
Sbjct: 406 GVFSPFPAANFSDRIVYWHVARLIGWGKYDGDNHYWLAVNSFGRHWGDDGVFRI 459
>gi|323448265|gb|EGB04166.1| hypothetical protein AURANDRAFT_32974 [Aureococcus anophagefferens]
Length = 298
Score = 100 bits (249), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 45/83 (54%), Positives = 54/83 (65%)
Query: 77 MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANS 136
M I E GP+ F+VY DF Y G+Y H G+ G HAV+ +GWGVEN YW VANS
Sbjct: 197 MAMIAEGGPVETAFTVYEDFENYAGGIYHHVTGEEAGGHAVKFVGWGVENGTKYWKVANS 256
Query: 137 WNDHWGDHGTFKILRGENEADIE 159
WN +WG+ G F+ILRG NE IE
Sbjct: 257 WNPYWGEAGYFRILRGSNEGGIE 279
Score = 79.0 bits (193), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 51/117 (43%), Positives = 59/117 (50%), Gaps = 13/117 (11%)
Query: 188 PRNFDAREKWPECPSL-RHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
P FD+ +WPEC L I DQSNCG CWA + A A SDR CIA+ G +SAQ V
Sbjct: 25 PEAFDSAARWPECAKLIGDIRDQSNCGCCWAFAGAEAASDRQCIATGGAVAVPLSAQD-V 83
Query: 247 ACTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYT-------LAP-CEHH 295
N GC+GG W + G VTGG YN G P+ AP C HH
Sbjct: 84 CFNANVDGCDGGQIITPWTYVAKAGAVTGGQYN---GTGPFGAGLCADWFAPHCHHH 137
>gi|10803450|emb|CAB97364.2| putative cathepsin B.1 [Ostertagia ostertagi]
Length = 199
Score = 100 bits (249), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 57/131 (43%), Positives = 76/131 (58%), Gaps = 5/131 (3%)
Query: 214 SCWAVSVANAISDRLCIASNGYFTGQISAQHIVACTPNC-WGCNGGWPQLAWRFWGHNGV 272
SCWAVS A+A+SDR+CIA+ G IS Q IV+C C +GC GGW AW ++ GV
Sbjct: 1 SCWAVSSASAMSDRVCIATQGAKQVLISDQDIVSCCTWCGYGCQGGWSIRAWYYFAEQGV 60
Query: 273 VTGGDYNSQEGCQPYTLAPCEHHVQGPLQN-CTLLGKLKTPECKQNCYNPSYESTYRFDL 331
VTGG+YN++ C+PY + PC +H P C L TP CK+ C Y +Y D
Sbjct: 61 VTGGNYNTKGSCRPYEIHPCGYHKDEPYYGECDDLA--DTPRCKRRC-QLGYPKSYPSDK 117
Query: 332 KKGKKAHMVLM 342
G+ A+ + M
Sbjct: 118 HYGRTAYQLPM 128
Score = 72.8 bits (177), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 39/91 (42%), Positives = 54/91 (59%), Gaps = 7/91 (7%)
Query: 48 KKKKRLYLPTSIPLS-HYFKKAHMVPRC--NAMRQIYEHGPLVAIFSVYADFLQYKSGVY 104
K++ +L P S P HY + A+ +P + R+I +GP+VA F+VY DF YK G+Y
Sbjct: 102 KRRCQLGYPKSYPSDKHYGRTAYQLPMSVESIQREIMRNGPVVAGFTVYEDFAHYKGGIY 161
Query: 105 QHNFGDSIGLHAVRVLGWGVEN----DIPYW 131
+H G G HAV+V+GWG E IPYW
Sbjct: 162 KHTSGKKTGGHAVKVIGWGSEQKGSEKIPYW 192
>gi|294885809|ref|XP_002771442.1| cathepsin L precursor, putative [Perkinsus marinus ATCC 50983]
gi|239875086|gb|EER03258.1| cathepsin L precursor, putative [Perkinsus marinus ATCC 50983]
Length = 527
Score = 100 bits (249), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 46/86 (53%), Positives = 56/86 (65%)
Query: 75 NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVA 134
NA I GP+ A + VY DFL YKSGVY+H G +G HAV+++GWG EN YWLV
Sbjct: 435 NAKNAIRTDGPISASYLVYEDFLAYKSGVYKHTSGSYLGGHAVKIIGWGEENGEAYWLVV 494
Query: 135 NSWNDHWGDHGTFKILRGENEADIEM 160
NSWN+ WGD G FKI G E D ++
Sbjct: 495 NSWNEDWGDQGLFKIALGNCEIDDDL 520
Score = 43.9 bits (102), Expect = 0.097, Method: Compositional matrix adjust.
Identities = 19/58 (32%), Positives = 28/58 (48%)
Query: 273 VTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTLLGKLKTPECKQNCYNPSYESTYRFD 330
V G+ +GC PY PC HH+ G +TP C + C+NP Y ++ + D
Sbjct: 362 VARGNLTKGDGCWPYDFPPCAHHINDTKYPKCPKGSYETPNCVEQCHNPKYTTSLKND 419
>gi|300176830|emb|CBK25399.2| unnamed protein product [Blastocystis hominis]
Length = 563
Score = 100 bits (249), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 70/220 (31%), Positives = 105/220 (47%), Gaps = 38/220 (17%)
Query: 75 NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVA 134
N M++IY GP+ +V D ++YK G+Y+ G HA+ V+GWG E+ YW+
Sbjct: 183 NMMKEIYARGPITCSIAVPDDLMEYKGGIYRDTTGAKTLDHAISVVGWGEEDGQKYWIAR 242
Query: 135 NSWNDHWGDHGTFKILRGEN----EADI---------EMGFNNRVEAN---------SSE 172
NSW WG+ G F+I+RGEN EAD EM N+++ + S
Sbjct: 243 NSWGTFWGEKGWFRIVRGENNLGIEADCQWAVPRVPEEMILNDQMRSQRNRARYFPRSCL 302
Query: 173 DDDLETMGCQNAKGLPRNFDAREKWPECPSLRHIADQSN-------------CGSCWAVS 219
D M P + E P+ +R+I D N CGSCWA +
Sbjct: 303 LKDANRMKEHVVSPRPHTYIKSEDIPKNYDIRNI-DGVNYATWDKNQHIPQYCGSCWAQA 361
Query: 220 VANAISDRLCIASNG-YFTGQISAQHIVACTPNCWGCNGG 258
+A+SDR+ + G + T ++SAQ ++ C+ N C+GG
Sbjct: 362 PTSALSDRINLMRKGKWPTVELSAQEVINCS-NAGTCDGG 400
Score = 69.3 bits (168), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 30/81 (37%), Positives = 48/81 (59%), Gaps = 1/81 (1%)
Query: 79 QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWG-VENDIPYWLVANSW 137
+I+ GP+ V +FL Y+ G++ + G +G HAV V GWG E+ YW+ NSW
Sbjct: 470 EIFARGPVSCSMIVTEEFLAYQGGIFVDDRGHIVGYHAVEVAGWGETEDGTKYWIARNSW 529
Query: 138 NDHWGDHGTFKILRGENEADI 158
+WG+HG F+++ G ++ I
Sbjct: 530 GPYWGEHGWFRMIVGVSKGLI 550
Score = 48.9 bits (115), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 33/106 (31%), Positives = 54/106 (50%), Gaps = 14/106 (13%)
Query: 187 LPRNFDARE----KWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNG-YFTGQIS 241
LP+++D R+ + +HI CGSCW+ + +++SDRL + + G + +S
Sbjct: 43 LPKSYDPRDIDGRNYVTVTKNQHIPQY--CGSCWSFASVSSVSDRLKLMTKGKWPVHDLS 100
Query: 242 AQHIVACTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPY 287
Q I+ C N GC GG P A+++ +GV +EGC Y
Sbjct: 101 PQVILNCDHNSNGCQGGHPLTAFKYMHDHGV-------PEEGCMRY 139
>gi|156708118|gb|ABU93317.1| cathepsin B8 cysteine protease, partial [Monocercomonoides sp. PA]
Length = 275
Score = 100 bits (249), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 43/88 (48%), Positives = 57/88 (64%)
Query: 75 NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVA 134
M ++ +GP+ A F V+ DFL YKSG+YQH G S G H V ++GWG EN +PYWL+
Sbjct: 181 TVMDEVANNGPVYACFEVFEDFLNYKSGIYQHKTGKSKGWHHVMLMGWGTENGVPYWLLQ 240
Query: 135 NSWNDHWGDHGTFKILRGENEADIEMGF 162
NSW WG+ G F+I RG N+ I+ F
Sbjct: 241 NSWGSGWGEKGFFRIRRGTNDCHIDEIF 268
Score = 59.3 bits (142), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 40/135 (29%), Positives = 59/135 (43%), Gaps = 28/135 (20%)
Query: 188 PRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVA 247
P +FD R+KWP + +Q++CGSCWA + + + R+ I G + G +S Q +V+
Sbjct: 58 PASFDCRQKWPG--KAEPVRNQASCGSCWAHAASETMGFRMGI--RGCYKGVMSPQDLVS 113
Query: 248 CTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTLLG 307
C N GC GG+ W + G+ T E C PY + G
Sbjct: 114 CESNNMGCEGGYADRVWNWIQKKGITT-------EQCLPY-----------------VSG 149
Query: 308 KLKTPECKQNCYNPS 322
+ P C C N S
Sbjct: 150 SGRVPTCPSKCKNGS 164
>gi|268555786|ref|XP_002635882.1| Hypothetical protein CBG01102 [Caenorhabditis briggsae]
Length = 374
Score = 100 bits (249), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 59/185 (31%), Positives = 86/185 (46%), Gaps = 45/185 (24%)
Query: 191 FDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVACTP 250
FDARE+WPEC S+ I D S+C S WA S A ++SDRLCI S G +SAQ +++C
Sbjct: 85 FDARERWPECSSIPIINDISDCKSSWAFSAAESMSDRLCINSGGMINTVLSAQELLSCCT 144
Query: 251 NCWGCN----------------------------------------GGWPQLAWRFWGHN 270
+ C GG AW++W +
Sbjct: 145 GVFSCGEGDSEHWQFRNSKFRKPRCQKFNKEILEARRNLETREKCAGGNVFKAWQYWQKH 204
Query: 271 GVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTLLGKLKTPECKQNCYNPSYESTYRFD 330
G+ TGG Y SQ GC+PY+++PC+ + L ++TP C++ C +S Y +
Sbjct: 205 GLPTGGSYESQFGCKPYSISPCDTVIGNITFPGCLNSTVQTPSCEKKC-----KSGYPVE 259
Query: 331 LKKGK 335
L K +
Sbjct: 260 LDKDR 264
Score = 98.6 bits (244), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 43/99 (43%), Positives = 61/99 (61%), Gaps = 2/99 (2%)
Query: 63 HYFKKAHMVP--RCNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
HY +P + + +GP+ A VY DFLQY +G+Y H G+ G +VR+L
Sbjct: 265 HYGVSVDQLPNRQIEIQSDVMLNGPISATMEVYDDFLQYTTGIYVHLTGNKQGHLSVRIL 324
Query: 121 GWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
GWG+ +PYWL+ANSW WG++GTF++LRG NE +E
Sbjct: 325 GWGMYEGVPYWLLANSWGKQWGENGTFRVLRGVNECGLE 363
>gi|168026641|ref|XP_001765840.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162683017|gb|EDQ69431.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 339
Score = 100 bits (248), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 42/86 (48%), Positives = 60/86 (69%), Gaps = 1/86 (1%)
Query: 75 NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVEND-IPYWLV 133
+ M ++Y +GP+ F V+ DF YK+GVY+H +G IG HAV+++GWG +D + YW +
Sbjct: 237 DLMAELYTNGPIEVSFEVFEDFAHYKTGVYKHVYGRYIGGHAVKLIGWGTTDDGVDYWTI 296
Query: 134 ANSWNDHWGDHGTFKILRGENEADIE 159
NSWN +WG+HG F+I RG NE IE
Sbjct: 297 VNSWNTNWGEHGLFRIARGGNECGIE 322
Score = 80.5 bits (197), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 43/102 (42%), Positives = 59/102 (57%), Gaps = 6/102 (5%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP+ FDAR+ W C ++ I DQ +CGSCWA A +++DR CI N + +S ++
Sbjct: 95 LPKEFDARKHWGHCSTIGAILDQGHCGSCWAFGAAESLTDRFCIHMNESVS--LSENDLL 152
Query: 247 ACTP-NCW-GCNGGWPQLAWRFWGHNGVVTG--GDYNSQEGC 284
AC C GC+GG+P AWR++ GVVT Y Q GC
Sbjct: 153 ACCGFECGDGCDGGYPIRAWRYFKRTGVVTSKCDPYFDQIGC 194
>gi|14582576|gb|AAK69541.1|AF283476_1 cathepsin B-like cysteine proteinase [Ipomoea batatas]
Length = 352
Score = 100 bits (248), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 50/110 (45%), Positives = 66/110 (60%), Gaps = 8/110 (7%)
Query: 75 NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVEND-IPYWLV 133
+ M ++Y +GP F+VY DF YKSGVY+H G +G HAV+++GWG D YWL+
Sbjct: 239 SIMTEVYTNGPAEVSFTVYEDFAHYKSGVYKHVTGSEMGGHAVKLIGWGTSEDGEDYWLL 298
Query: 134 ANSWNDHWGDHGTFKILRGENEA---DIEMGF----NNRVEANSSEDDDL 176
AN WN WGD G FKI+RG NE D+ G N +E+ +DD L
Sbjct: 299 ANQWNRSWGDDGYFKIIRGTNECGIEDVTAGMPSTKNLDIESGVRDDDSL 348
Score = 84.3 bits (207), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 42/103 (40%), Positives = 57/103 (55%), Gaps = 6/103 (5%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP+ FDAR WP+C S+ I DQ +CGSCWA +++DR CI T +S ++
Sbjct: 96 LPKTFDARTAWPQCLSIADILDQGHCGSCWAFGAVESLTDRFCIHYGTNVT--LSVNDLL 153
Query: 247 ACTPNCW--GCNGGWPQLAWRFWGHNGVVTG--GDYNSQEGCQ 285
AC GC+GG+P AW+++ GVVT Y Q GC
Sbjct: 154 ACCGFLCGEGCDGGYPIAAWQYFKRTGVVTSECDPYFDQTGCS 196
>gi|217073630|gb|ACJ85175.1| unknown [Medicago truncatula]
Length = 359
Score = 99.8 bits (247), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 49/101 (48%), Positives = 69/101 (68%), Gaps = 5/101 (4%)
Query: 63 HYFKKAHMV---PRCNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRV 119
HY KA+ V P+ + M ++Y++GP+ F+V+ DF YKSGVY+H G ++G HAV++
Sbjct: 232 HYSVKAYRVKSDPQ-DIMTEVYKNGPVEVAFTVFEDFAHYKSGVYKHITGSALGGHAVKL 290
Query: 120 LGWGVEND-IPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
+GWG ++ YWL+AN WN +WGD G FKI RG NE IE
Sbjct: 291 IGWGTSDEGEDYWLLANQWNTNWGDDGYFKIKRGTNECGIE 331
Score = 80.5 bits (197), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 48/135 (35%), Positives = 65/135 (48%), Gaps = 20/135 (14%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP+ FDAR W +C ++ I DQ +CGSCWA ++ DR C S+ +S ++
Sbjct: 103 LPKEFDARAAWSQCSTIGKILDQGHCGSCWAFGAVESLQDRFC--SHFDMNISLSVNDLL 160
Query: 247 ACTPNC--WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPY-TLAPCEHHVQGPLQNC 303
AC GC+GG P AWR+ H+GVVT E C PY C H P
Sbjct: 161 ACCGFLCGAGCDGGTPIYAWRYLAHHGVVT-------EECDPYFDQIGCSHPGCEP---- 209
Query: 304 TLLGKLKTPECKQNC 318
+TP+C + C
Sbjct: 210 ----AYQTPKCVRKC 220
>gi|294914336|ref|XP_002778250.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
gi|239886453|gb|EER10045.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
Length = 388
Score = 99.8 bits (247), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 44/83 (53%), Positives = 59/83 (71%)
Query: 78 RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
R+I ++GP+ A F+VY DF YKSGVY+H G +G HAV+++GWG++ + YWLV NSW
Sbjct: 286 REIIDNGPVAAAFTVYEDFPYYKSGVYKHVNGSELGGHAVKIIGWGIDQNEQYWLVMNSW 345
Query: 138 NDHWGDHGTFKILRGENEADIEM 160
N +WGD G FKI GE D E+
Sbjct: 346 NVNWGDQGIFKIAIGECGIDSEV 368
>gi|357116869|ref|XP_003560199.1| PREDICTED: cathepsin B-like [Brachypodium distachyon]
Length = 350
Score = 99.8 bits (247), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 45/86 (52%), Positives = 61/86 (70%), Gaps = 1/86 (1%)
Query: 75 NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDI-PYWLV 133
+ M ++Y++GP+ F+VY DF YKSGVY+H G+ +G HAV+++GWG D YWL+
Sbjct: 241 DIMAEVYKNGPVEVAFTVYEDFAHYKSGVYEHITGEMMGGHAVKLIGWGTSADGKDYWLL 300
Query: 134 ANSWNDHWGDHGTFKILRGENEADIE 159
AN WN WGD G FKI+RG+NE IE
Sbjct: 301 ANQWNRGWGDDGYFKIIRGKNECGIE 326
Score = 79.0 bits (193), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 49/135 (36%), Positives = 64/135 (47%), Gaps = 20/135 (14%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP+ FDAR KW C ++ I DQ +CGSCWA + DR CI N +S +V
Sbjct: 98 LPKEFDARSKWSGCSTIGTILDQGHCGSCWAFGAVECLQDRFCIHLN--MNISLSVNDLV 155
Query: 247 ACTPNCW--GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPY-TLAPCEHHVQGPLQNC 303
AC GC+GG+P AW++ NGVVT + C PY C+H P
Sbjct: 156 ACCGFMCGDGCDGGYPISAWQYLVENGVVT-------DECDPYFDQVGCKHPGCEP---- 204
Query: 304 TLLGKLKTPECKQNC 318
TP C++ C
Sbjct: 205 ----AYPTPACEKKC 215
>gi|217072748|gb|ACJ84734.1| unknown [Medicago truncatula]
gi|388505480|gb|AFK40806.1| unknown [Medicago truncatula]
Length = 359
Score = 99.8 bits (247), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 49/101 (48%), Positives = 69/101 (68%), Gaps = 5/101 (4%)
Query: 63 HYFKKAHMV---PRCNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRV 119
HY KA+ V P+ + M ++Y++GP+ F+V+ DF YKSGVY+H G ++G HAV++
Sbjct: 232 HYSVKAYRVKSDPQ-DIMAEVYKNGPVEVAFTVFEDFAHYKSGVYKHITGSALGGHAVKL 290
Query: 120 LGWGVEND-IPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
+GWG ++ YWL+AN WN +WGD G FKI RG NE IE
Sbjct: 291 IGWGTSDEGEDYWLLANQWNTNWGDDGYFKIKRGTNECGIE 331
Score = 82.0 bits (201), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 48/135 (35%), Positives = 66/135 (48%), Gaps = 20/135 (14%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP+ FDAR W +C ++ I DQ +CGSCWA ++ DR CI + + +S ++
Sbjct: 103 LPKEFDARTAWSQCSTIGKILDQGHCGSCWAFGAVESLQDRFCIHFDMNIS--LSVNDLL 160
Query: 247 ACTPNC--WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPY-TLAPCEHHVQGPLQNC 303
AC GC+GG P AWR+ H+GVVT E C PY C H P
Sbjct: 161 ACCGFLCGAGCDGGTPIYAWRYLAHHGVVT-------EECDPYFDQIGCSHPGCEP---- 209
Query: 304 TLLGKLKTPECKQNC 318
+TP+C + C
Sbjct: 210 ----AYQTPKCVRKC 220
>gi|357511629|ref|XP_003626103.1| Cathepsin B [Medicago truncatula]
gi|87240982|gb|ABD32840.1| Peptidase C1A, papain; Somatotropin hormone; Peptidase C1,
propeptide [Medicago truncatula]
gi|355501118|gb|AES82321.1| Cathepsin B [Medicago truncatula]
Length = 357
Score = 99.8 bits (247), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 49/101 (48%), Positives = 69/101 (68%), Gaps = 5/101 (4%)
Query: 63 HYFKKAHMV---PRCNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRV 119
HY KA+ V P+ + M ++Y++GP+ F+V+ DF YKSGVY+H G ++G HAV++
Sbjct: 230 HYSVKAYRVKSDPQ-DIMAEVYKNGPVEVAFTVFEDFAHYKSGVYKHITGSALGGHAVKL 288
Query: 120 LGWGVEND-IPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
+GWG ++ YWL+AN WN +WGD G FKI RG NE IE
Sbjct: 289 IGWGTSDEGEDYWLLANQWNTNWGDDGYFKIKRGTNECGIE 329
Score = 82.0 bits (201), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 48/135 (35%), Positives = 66/135 (48%), Gaps = 20/135 (14%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP+ FDAR W +C ++ I DQ +CGSCWA ++ DR CI + + +S ++
Sbjct: 101 LPKEFDARTAWSQCSTIGKILDQGHCGSCWAFGAVESLQDRFCIHFDMNIS--LSVNDLL 158
Query: 247 ACTPNC--WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPY-TLAPCEHHVQGPLQNC 303
AC GC+GG P AWR+ H+GVVT E C PY C H P
Sbjct: 159 ACCGFLCGAGCDGGTPIYAWRYLAHHGVVT-------EECDPYFDQIGCSHPGCEP---- 207
Query: 304 TLLGKLKTPECKQNC 318
+TP+C + C
Sbjct: 208 ----AYQTPKCVRKC 218
>gi|161343857|tpg|DAA06109.1| TPA_inf: cathepsin B [Acyrthosiphon pisum]
Length = 163
Score = 99.8 bits (247), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 48/94 (51%), Positives = 58/94 (61%)
Query: 66 KKAHMVPRCNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVE 125
KK C+A + + +HGP V VY DFL YKSGVY H GD +GL +VR++GWG+E
Sbjct: 60 KKVFTFDGCSARKNLRKHGPYVVTMRVYEDFLAYKSGVYHHVTGDYLGLLSVRMIGWGLE 119
Query: 126 NDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
+WL ANSW WGD G FKI R NE IE
Sbjct: 120 GGQAFWLFANSWGTSWGDKGFFKIRRFVNERWIE 153
>gi|15150360|gb|AAK85411.1| cathepsin B-like protease [Trypanosoma rangeli]
Length = 207
Score = 99.4 bits (246), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 56/132 (42%), Positives = 72/132 (54%), Gaps = 12/132 (9%)
Query: 191 FDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVACTP 250
FDA E WP CP++ I DQS CGSCWAV+ +A+SDR C G +ISA +++C
Sbjct: 1 FDAGEAWPNCPTITEIRDQSGCGSCWAVAARSAMSDRYC-TRGGVRDLRISAGDLLSCCN 59
Query: 251 NC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGP-LQNCTLLGK 308
C GCNGG P AW ++ G+V+ E CQPY PC HHV C++ +
Sbjct: 60 ACGLGCNGGDPDWAWLYYVETGIVS-------EFCQPYPFPPCAHHVNSTHYTPCSV--E 110
Query: 309 LKTPECKQNCYN 320
TP C C N
Sbjct: 111 YDTPFCNITCTN 122
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 31/61 (50%), Positives = 44/61 (72%)
Query: 78 RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
R+++ +GP F+VY DF+ Y GVY+H G+++G HAVR++GWG N PYW +ANSW
Sbjct: 145 RELFLYGPFEVAFTVYEDFVAYSDGVYKHFSGNALGGHAVRLVGWGNLNGTPYWKIANSW 204
Query: 138 N 138
N
Sbjct: 205 N 205
>gi|94958151|gb|ABF47216.1| cathepsin B [Nicotiana benthamiana]
Length = 356
Score = 99.4 bits (246), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 48/100 (48%), Positives = 66/100 (66%), Gaps = 3/100 (3%)
Query: 63 HYFKKAHMVPRC--NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
H+ A+M+ + M ++Y++GP+ F+VY DF YKSGVY+H GD +G HAV+++
Sbjct: 229 HFGVNAYMISSDPHSIMTELYKNGPVEVSFTVYEDFAHYKSGVYKHVTGDVMGGHAVKLI 288
Query: 121 GWGVEND-IPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
GWG D YWL+AN WN WGD G FKI RG +E +IE
Sbjct: 289 GWGTSEDGEDYWLLANQWNRGWGDDGYFKIRRGTDECEIE 328
Score = 82.0 bits (201), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 42/103 (40%), Positives = 56/103 (54%), Gaps = 6/103 (5%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP+ FDAR WP C ++ I DQ +CGSCWA ++SDR CI +SA ++
Sbjct: 100 LPQEFDARVAWPNCSTIGRILDQGHCGSCWAFGAVESLSDRFCIHYG--LNISLSANDLL 157
Query: 247 ACTPNCW--GCNGGWPQLAWRFWGHNGVVTG--GDYNSQEGCQ 285
AC GC+GG+P AW+++ GVVT Y EGC
Sbjct: 158 ACCGFLCGDGCDGGYPLQAWKYFVRKGVVTDECDPYFDNEGCS 200
>gi|357511627|ref|XP_003626102.1| Cathepsin L-like proteinase [Medicago truncatula]
gi|355501117|gb|AES82320.1| Cathepsin L-like proteinase [Medicago truncatula]
Length = 351
Score = 99.0 bits (245), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 50/101 (49%), Positives = 69/101 (68%), Gaps = 5/101 (4%)
Query: 63 HYFKKAHMV---PRCNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRV 119
HY KA+ V P+ + M ++Y++GP+ F+VY DF YKSGVY+H G ++G HAV++
Sbjct: 224 HYSVKAYTVNSDPQ-DIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKHITGFALGGHAVKL 282
Query: 120 LGWGVEND-IPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
+GWG ++ YWL+AN WN +WGD G FKI RG NE IE
Sbjct: 283 VGWGTSHEGEDYWLLANQWNTNWGDDGYFKIKRGTNECGIE 323
Score = 82.4 bits (202), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 50/137 (36%), Positives = 66/137 (48%), Gaps = 20/137 (14%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP++FDAR W +C ++ I DQ +CGSCWA ++SDR CI + +S I+
Sbjct: 95 LPKDFDARTAWSQCSTIGRILDQGHCGSCWAFGAVESLSDRFCIHFD--MNVSLSVNDIL 152
Query: 247 ACTPNC--WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPY-TLAPCEHHVQGPLQNC 303
AC GC GG P AW + H+GVVT E C PY C H P
Sbjct: 153 ACCGLLCGAGCAGGTPFSAWIYLAHHGVVT-------EECDPYFDQIGCSHPGCEP---- 201
Query: 304 TLLGKLKTPECKQNCYN 320
+TP+C + C N
Sbjct: 202 ----TYRTPKCVKKCVN 214
>gi|224285256|gb|ACN40354.1| unknown [Picea sitchensis]
Length = 350
Score = 99.0 bits (245), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 45/86 (52%), Positives = 60/86 (69%), Gaps = 1/86 (1%)
Query: 75 NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVEND-IPYWLV 133
N M +++ +GP+ FSVY DF Y++GVY+H G +G HAV+++GWG +D I YWL+
Sbjct: 237 NIMAEVFNNGPVEVSFSVYEDFAHYETGVYKHVQGRYLGGHAVKLIGWGTTDDGIDYWLI 296
Query: 134 ANSWNDHWGDHGTFKILRGENEADIE 159
ANSWN WG+ G FKI RG NE IE
Sbjct: 297 ANSWNTAWGEGGYFKIARGVNECGIE 322
Score = 88.2 bits (217), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 55/135 (40%), Positives = 66/135 (48%), Gaps = 20/135 (14%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP FDAR+ WP C S R I DQ +CGSCWA + A+SDR CI +S +V
Sbjct: 95 LPSKFDARKAWPHCTSTRSILDQGHCGSCWAFAAVEALSDRFCIHFQ--VNATLSENDLV 152
Query: 247 ACTP-NCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAP-CEHHVQGPLQNC 303
AC C GCNGG+P AWR++ GVVT + C PY C H P
Sbjct: 153 ACCGFRCGSGCNGGFPLSAWRYFSRRGVVT-------DECDPYFDNDGCNHPGCEP---- 201
Query: 304 TLLGKLKTPECKQNC 318
TP C +NC
Sbjct: 202 ----SYPTPRCVKNC 212
>gi|87240981|gb|ABD32839.1| Peptidase C1A, papain; Somatotropin hormone; Peptidase C1,
propeptide [Medicago truncatula]
Length = 356
Score = 99.0 bits (245), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 50/101 (49%), Positives = 69/101 (68%), Gaps = 5/101 (4%)
Query: 63 HYFKKAHMV---PRCNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRV 119
HY KA+ V P+ + M ++Y++GP+ F+VY DF YKSGVY+H G ++G HAV++
Sbjct: 229 HYSVKAYTVNSDPQ-DIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKHITGFALGGHAVKL 287
Query: 120 LGWGVEND-IPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
+GWG ++ YWL+AN WN +WGD G FKI RG NE IE
Sbjct: 288 VGWGTSHEGEDYWLLANQWNTNWGDDGYFKIKRGTNECGIE 328
Score = 82.8 bits (203), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 50/137 (36%), Positives = 66/137 (48%), Gaps = 20/137 (14%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP++FDAR W +C ++ I DQ +CGSCWA ++SDR CI + +S I+
Sbjct: 100 LPKDFDARTAWSQCSTIGRILDQGHCGSCWAFGAVESLSDRFCIHFD--MNVSLSVNDIL 157
Query: 247 ACTPNC--WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPY-TLAPCEHHVQGPLQNC 303
AC GC GG P AW + H+GVVT E C PY C H P
Sbjct: 158 ACCGLLCGAGCAGGTPFSAWIYLAHHGVVT-------EECDPYFDQIGCSHPGCEP---- 206
Query: 304 TLLGKLKTPECKQNCYN 320
+TP+C + C N
Sbjct: 207 ----TYRTPKCVKKCVN 219
>gi|60598652|gb|AAX25875.1| unknown [Schistosoma japonicum]
Length = 195
Score = 99.0 bits (245), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 57/148 (38%), Positives = 79/148 (53%), Gaps = 11/148 (7%)
Query: 148 KILRGENEADIEMGFNNRVEANSSEDDDLETMGCQNAKGLPRNFDAREKWPECPSLRHIA 207
+IL G + D EM R + D ++E +P FD+R+KWP C S+ I
Sbjct: 28 RILMGARKEDAEMKRKRRPTVDH-HDLNVE---------IPSQFDSRKKWPHCKSISQIR 77
Query: 208 DQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVACTPNCWGCNGGWPQL-AWRF 266
DQS CGSCWA A++DR+CI S G + ++SA +++C +C G G AW +
Sbjct: 78 DQSRCGSCWAFGAVEAMTDRICIQSGGQQSAELSALDLISCCEDCGGGCKGGFPGQAWDY 137
Query: 267 WGHNGVVTGGDYNSQEGCQPYTLAPCEH 294
W G+VTGG + GCQPY CEH
Sbjct: 138 WVKRGIVTGGSKENHTGCQPYPFPKCEH 165
>gi|197304333|dbj|BAG69285.1| cathepsin B-like cysteine protease [Raphanus sativus]
Length = 343
Score = 99.0 bits (245), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 50/101 (49%), Positives = 66/101 (65%), Gaps = 5/101 (4%)
Query: 63 HYFKKAHMV---PRCNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRV 119
HY ++V P+ + M +IY++GP+ F+VY DF YKSGVY+H G +IG HAV++
Sbjct: 233 HYSINTYVVESNPQ-DIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGSNIGGHAVKL 291
Query: 120 LGWGVEND-IPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
+GWG +D YWL+AN WN WGD G F I RG NE IE
Sbjct: 292 IGWGTTDDGEDYWLLANQWNRSWGDDGYFMIRRGTNECGIE 332
Score = 88.6 bits (218), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 52/135 (38%), Positives = 71/135 (52%), Gaps = 20/135 (14%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP++FDAR WP+C S+ I DQ +CGSCWA ++SDR CI T +S ++
Sbjct: 104 LPKSFDARTHWPQCTSIGKILDQGHCGSCWAFGAVESLSDRFCIQFGMNIT--LSVNDLL 161
Query: 247 ACTP-NCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPY-TLAPCEHHVQGPLQNC 303
AC C GC+GG+P AW+++ ++GVVT E C PY C H P N
Sbjct: 162 ACCGFRCGDGCDGGYPISAWQYFSYSGVVT-------EECDPYFDQTGCSHPGCEPAYN- 213
Query: 304 TLLGKLKTPECKQNC 318
TP+C + C
Sbjct: 214 -------TPQCLRKC 221
>gi|294890224|ref|XP_002773108.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
gi|239878009|gb|EER04924.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
Length = 109
Score = 99.0 bits (245), Expect = 3e-18, Method: Composition-based stats.
Identities = 50/109 (45%), Positives = 65/109 (59%), Gaps = 8/109 (7%)
Query: 52 RLYLPTSIPLSHYFKKAHMVPRCNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDS 111
R +L S+P + A NA+R GP+ A F VY DFL Y+SGVY+H G
Sbjct: 2 RHFLVESVPYEYSVNDAK-----NAIRT---DGPVSASFIVYEDFLAYRSGVYKHTSGKE 53
Query: 112 IGLHAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIEM 160
+G HAV+++GWG E YWLV NSWN+ WGD+G FKI G E D ++
Sbjct: 54 LGGHAVKIIGWGEETGQAYWLVVNSWNEDWGDNGLFKIALGNCEIDDDL 102
>gi|330805199|ref|XP_003290573.1| hypothetical protein DICPUDRAFT_155103 [Dictyostelium purpureum]
gi|325079281|gb|EGC32888.1| hypothetical protein DICPUDRAFT_155103 [Dictyostelium purpureum]
Length = 313
Score = 99.0 bits (245), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 48/98 (48%), Positives = 61/98 (62%), Gaps = 1/98 (1%)
Query: 63 HYFKKAHMVPRCNA-MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLG 121
H K + + A M++I +GP+ A FSVY DFL YKSGVYQH G +G H V++ G
Sbjct: 207 HKMAKIYSINSVEAIMQEISTNGPVEACFSVYEDFLGYKSGVYQHTTGKFLGGHCVKIFG 266
Query: 122 WGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
+G N + YW VANSW WGD+G F I RG +E IE
Sbjct: 267 YGTLNGVNYWSVANSWTTSWGDNGIFLIKRGSDECGIE 304
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 44/131 (33%), Positives = 61/131 (46%), Gaps = 15/131 (11%)
Query: 188 PRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVA 247
P +FD+R W C ++ +I +Q+ CGSCWA + DR+CI Q+S +V
Sbjct: 79 PASFDSRTAWSNCTTIGYIENQARCGSCWAFGAVESAQDRICIHKG--LDVQLSFLDLVT 136
Query: 248 CTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTLLG 307
C + GC GG AW F GVVT + C+PYT+ C P L
Sbjct: 137 CDQSDDGCEGGDDVSAWNFLKKQGVVT-------QECKPYTIPTC------PPAQQPCLN 183
Query: 308 KLKTPECKQNC 318
+ TP C + C
Sbjct: 184 FVNTPNCVKQC 194
>gi|166030326|gb|ABY78830.1| cathepsin B-like protease [Trypanosoma congolense]
Length = 336
Score = 99.0 bits (245), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 52/110 (47%), Positives = 67/110 (60%), Gaps = 4/110 (3%)
Query: 58 SIPLSHYFKKAHMV---PRCNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGL 114
SIPL Y A + + R++Y +GP VA+F VY D YKSGVY++ GD +G
Sbjct: 218 SIPLVKYRGNATYLLLHGEEDYKRELYFNGPFVAVFFVYTDLFAYKSGVYRNVDGDFLGG 277
Query: 115 HAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE-MGFN 163
AVR++GWG N PYW VANSW+ WG +G IL G NE +IE +GF
Sbjct: 278 QAVRIVGWGKLNGTPYWKVANSWDTDWGMNGYMLILGGNNECNIEHLGFT 327
Score = 92.0 bits (227), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 59/138 (42%), Positives = 77/138 (55%), Gaps = 11/138 (7%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP +FD+ EKWP CP++R IADQS C + WAVS A+AISDR C G +ISA H++
Sbjct: 90 LPESFDSAEKWPNCPTIREIADQSACRASWAVSTASAISDRYCTVGGGK-QLRISAAHLL 148
Query: 247 A-CTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHH-VQGPLQNCT 304
+ C GC GG+P AW ++ G+ + GCQPY CEH QG C+
Sbjct: 149 SCCKQCGGGCKGGFPGFAWLYYVEYGI-------ASSGCQPYPFPHCEHRGAQGNKTPCS 201
Query: 305 LLGKLKTPECKQNCYNPS 322
K TP+C C + S
Sbjct: 202 KY-KFDTPKCNATCTDKS 218
>gi|402594312|gb|EJW88238.1| cathepsin B5 [Wuchereria bancrofti]
Length = 407
Score = 98.6 bits (244), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 50/128 (39%), Positives = 73/128 (57%), Gaps = 2/128 (1%)
Query: 214 SCWAVSVANAISDRLCIASNGYFTGQISAQHIVACTPNC-WGCNGGWPQLAWRFWGHNGV 272
SCWAV+ A+SDR+CI S G +SA +++C C +GC GG P AW++W +G+
Sbjct: 163 SCWAVAAVEAMSDRICITSKGKKQVILSADDLLSCCKTCGFGCFGGEPMAAWKYWVLSGI 222
Query: 273 VTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTLLGKLKTPECKQNCYNPSYESTYRFDLK 332
VTG DY + GC+PY PCEHH TP+C + C + +Y+ Y+ D
Sbjct: 223 VTGSDYTNHSGCRPYPFPPCEHHNNKTHYEPCKHDLYPTPKCDRQC-DKNYKKPYKADKY 281
Query: 333 KGKKAHMV 340
G++A+ V
Sbjct: 282 YGEQAYNV 289
Score = 93.2 bits (230), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 43/88 (48%), Positives = 56/88 (63%), Gaps = 3/88 (3%)
Query: 78 RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
++I GP+ A F VY DFL Y G+Y+H G G HAV++LGWG++ + YWL ANSW
Sbjct: 298 KEIMTLGPVEASFEVYTDFLHYIGGIYKHVAGSVGGGHAVKILGWGIDQGVSYWLAANSW 357
Query: 138 NDHWGD---HGTFKILRGENEADIEMGF 162
N WG+ G F+ILRG +E IE G
Sbjct: 358 NTDWGEDVFSGYFRILRGVDECGIESGI 385
>gi|343476073|emb|CCD12715.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 336
Score = 98.6 bits (244), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 65/157 (41%), Positives = 86/157 (54%), Gaps = 15/157 (9%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP +FD+ EKWP CP++R IADQS C + WAVS A+AISDR C G +ISA H++
Sbjct: 90 LPESFDSAEKWPNCPTIREIADQSACRASWAVSTASAISDRYCTVGGGK-QLRISAAHLL 148
Query: 247 A-CTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHH-VQGPLQNCT 304
+ C GC GG+P AWR++ G+ + CQPY CEHH QG C+
Sbjct: 149 SCCKQCGGGCKGGFPGFAWRYYVEYGI-------ASSYCQPYPFPQCEHHGAQGNKTPCS 201
Query: 305 LLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMVL 341
K TP+C C + T +GK A+M+L
Sbjct: 202 NY-KFVTPQCNTTC----TDKTIPLIKYRGKDAYMLL 233
Score = 97.1 bits (240), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 53/109 (48%), Positives = 69/109 (63%), Gaps = 4/109 (3%)
Query: 58 SIPLSHYF-KKAHMV-PRCNAM-RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGL 114
+IPL Y K A+M+ P R++Y +GP VAI VY D YKSGVY++ G +G+
Sbjct: 218 TIPLIKYRGKDAYMLLPGEEEFKRELYFNGPFVAILFVYTDLFAYKSGVYRNVDGSYMGV 277
Query: 115 HAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE-MGF 162
AV+V+GWG N PYW VAN+W+ WG G ILRG NE +IE +GF
Sbjct: 278 TAVKVVGWGKLNGTPYWKVANTWDTDWGMDGYLLILRGNNECNIEHLGF 326
>gi|168000937|ref|XP_001753172.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162695871|gb|EDQ82213.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 347
Score = 98.6 bits (244), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 41/86 (47%), Positives = 60/86 (69%), Gaps = 1/86 (1%)
Query: 75 NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVEND-IPYWLV 133
+ M ++Y +GP+ F VY DF YK+GVY+H FG +G HAV+++GWG +D + YW +
Sbjct: 245 DLMAELYTNGPVEVAFEVYEDFAHYKTGVYKHLFGGFMGGHAVKLIGWGTTDDGVDYWTI 304
Query: 134 ANSWNDHWGDHGTFKILRGENEADIE 159
NSWN +WG+ G F+I+RG +E IE
Sbjct: 305 VNSWNTNWGEDGLFRIVRGNDECGIE 330
Score = 84.7 bits (208), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 47/141 (33%), Positives = 73/141 (51%), Gaps = 22/141 (15%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP+ FDAR++W CP++ I Q +CGSCWA +++DR CI N + +S ++
Sbjct: 103 LPKEFDARKQWSHCPTIGDILGQGHCGSCWAFGAVESLTDRFCIHLNESVS--LSENDLL 160
Query: 247 ACTP-NC-WGCNGGWPQLAWRFWGHNGVVTG--GDYNSQEGCQPYTLAPCEHHVQGPLQN 302
AC C +GC GG+P AW+++ H+GVVT Y Q+GC P
Sbjct: 161 ACCGFECGYGCEGGYPIRAWKYFKHSGVVTNKCDPYFDQKGCAHPGCYP----------- 209
Query: 303 CTLLGKLKTPECKQNCYNPSY 323
+TP+C++ C + +
Sbjct: 210 -----TYETPKCEKQCVDDEF 225
>gi|156708116|gb|ABU93316.1| cathepsin B7 cysteine protease, partial [Monocercomonoides sp. PA]
Length = 273
Score = 98.2 bits (243), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 42/87 (48%), Positives = 56/87 (64%)
Query: 76 AMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVAN 135
M ++ +GP+ A F V+ DF Y+SGVYQH G S G H V ++GWG EN +PYWL+ N
Sbjct: 180 VMDEVANNGPVYACFEVFEDFYNYRSGVYQHKTGRSQGWHHVMLMGWGTENGVPYWLLQN 239
Query: 136 SWNDHWGDHGTFKILRGENEADIEMGF 162
SW WG+ G F+I RG N+ I+ F
Sbjct: 240 SWGSGWGEKGFFRIRRGTNDCHIDEIF 266
Score = 56.2 bits (134), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 40/135 (29%), Positives = 57/135 (42%), Gaps = 28/135 (20%)
Query: 188 PRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVA 247
P +FD R+KWP + +Q +CGSCWA + + + R+ I G +S Q +V+
Sbjct: 56 PASFDCRQKWPG--KAEPVRNQGSCGSCWAHAASETMGFRMGIRRCS--KGVMSPQDLVS 111
Query: 248 CTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTLLG 307
C N GCNGG+ W + G+ T E C PY + G
Sbjct: 112 CESNNMGCNGGYADRVWNWIQKKGITT-------EQCIPY-----------------VSG 147
Query: 308 KLKTPECKQNCYNPS 322
+ P C C N S
Sbjct: 148 SGRVPTCPSKCKNGS 162
>gi|6562772|emb|CAB62590.1| putative cathepsin B-like protease [Pisum sativum]
Length = 174
Score = 98.2 bits (243), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 46/86 (53%), Positives = 60/86 (69%), Gaps = 1/86 (1%)
Query: 75 NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVEND-IPYWLV 133
N M ++Y++GP+ FSVY DF YKSGVY+H G ++G HAV++ GWG ++ YWL+
Sbjct: 76 NIMEEVYKNGPVEVAFSVYEDFAHYKSGVYKHITGSALGGHAVKLNGWGTSDEGEDYWLL 135
Query: 134 ANSWNDHWGDHGTFKILRGENEADIE 159
AN WN +WGD G FKI RG NE IE
Sbjct: 136 ANQWNTNWGDDGYFKIKRGTNECGIE 161
>gi|294929081|ref|XP_002779258.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
gi|239888294|gb|EER11053.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
Length = 288
Score = 98.2 bits (243), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 61/195 (31%), Positives = 92/195 (47%), Gaps = 27/195 (13%)
Query: 157 DIEMGFNNRVEANSSEDDDLETMGCQ--NAKGLPRNFDAREKWPECPS-LRHIADQSNCG 213
D+ G N + +S+ DD+ +G K LP NFDAR+K+ C + H+ DQS C
Sbjct: 5 DVPTGCPNGPKPSSTSDDETRLLGPTKPELKDLPSNFDARQKFASCAGVIGHVRDQSACH 64
Query: 214 SCWAVSVANAISDRLCIASNGYFTGQISAQHIVAC------TPNCWGCNGGWPQLAWRFW 267
+CW VS ++DR+CI S G F +S + +C P GC GG F
Sbjct: 65 NCWTVSSTGMLNDRVCIKSGGTFRDILSVGYFTSCCNPANGCPKAKGCQGGNLLEGLNFL 124
Query: 268 GHNGVVTG------GDYNSQEGCQPYTLAPCEHHVQGPLQNCTLLGKLKTPECKQNCYNP 321
++G+VTG G +S +GC PY C+H +P C+ C N
Sbjct: 125 KNHGIVTGDEFKPAGQLSSADGCWPYPFPKCKH------------AGYSSPACQTKCTNK 172
Query: 322 SYESTYRFDLKKGKK 336
+Y+++ + DL + K
Sbjct: 173 AYKTSLQQDLHRAKS 187
Score = 92.4 bits (228), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 37/90 (41%), Positives = 57/90 (63%), Gaps = 1/90 (1%)
Query: 65 FKKAHMVPRCNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGV 124
F + +P+ N ++I+ +GP++ + S+Y D YK+GVY H G G+H ++++GWGV
Sbjct: 188 FGRLPAIPQ-NIKQEIFTNGPVIGMLSIYEDIRVYKAGVYVHQTGSFQGIHTLKIIGWGV 246
Query: 125 ENDIPYWLVANSWNDHWGDHGTFKILRGEN 154
E+ YWL NSWN+ WGDHG K+ G
Sbjct: 247 ESGQDYWLAVNSWNEEWGDHGMIKLAVGRT 276
>gi|10803443|emb|CAC13134.1| putative cathepsin B.8 [Ostertagia ostertagi]
Length = 197
Score = 98.2 bits (243), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 51/128 (39%), Positives = 73/128 (57%), Gaps = 1/128 (0%)
Query: 214 SCWAVSVANAISDRLCIASNGYFTGQISAQHIVACTPNC-WGCNGGWPQLAWRFWGHNGV 272
SCWA AISDR+CIAS G +SA +++C +C +GCNGG P AW+FW G+
Sbjct: 1 SCWAFGAVEAISDRICIASKGKTQVTLSAADLLSCCRSCGFGCNGGDPLSAWKFWVKEGI 60
Query: 273 VTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTLLGKLKTPECKQNCYNPSYESTYRFDLK 332
VTG ++++ GC+PY CEHH + TP+C+++C E TY+ D
Sbjct: 61 VTGSNHSTNAGCKPYPFPACEHHSNKTHYDPCKHDLFPTPKCEKSCQATFGERTYKEDKY 120
Query: 333 KGKKAHMV 340
G+ A+ V
Sbjct: 121 FGRSAYGV 128
Score = 64.7 bits (156), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 25/54 (46%), Positives = 36/54 (66%)
Query: 78 RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYW 131
++I +GP+ F VY DFL Y G+Y H G G HAV+++GWG++N +PYW
Sbjct: 137 KEIITYGPVEVAFEVYEDFLNYAGGIYVHQGGALGGGHAVKMIGWGIDNGVPYW 190
>gi|204022075|dbj|BAG71135.1| cathepsin B-S2 [Tuberaphis taiwana]
Length = 334
Score = 98.2 bits (243), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 53/133 (39%), Positives = 76/133 (57%), Gaps = 3/133 (2%)
Query: 188 PRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV- 246
P+ FD+R W C + HI DQ NCGSCW+ S A +DRLC+++ G F +S + +
Sbjct: 86 PQQFDSRTNWKSCKQIGHIRDQGNCGSCWSFSTTGAFADRLCVSTGGKFNQLLSPEELAF 145
Query: 247 ACTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTLL 306
C GC GG+P AW+++ GV TGGDY ++EGC PY + PC ++ QG C
Sbjct: 146 CCKDCGKGCGGGYPIKAWKYFRTQGVTTGGDYGTKEGCMPYKVPPC-YNKQGK-NTCGGQ 203
Query: 307 GKLKTPECKQNCY 319
+ +C + CY
Sbjct: 204 PMERNHQCPKTCY 216
Score = 91.7 bits (226), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 45/122 (36%), Positives = 70/122 (57%), Gaps = 2/122 (1%)
Query: 51 KRLYLPTSIPLSHYFKKAHMVPRCNAMRQ-IYEHGPLVAIFSVYADFLQYKSGVYQHNF- 108
K Y T++ + K +++ + Q + +GP+ A F VY DF YKSG+Y+
Sbjct: 213 KTCYGKTTVQNRYKTKSEYVMNSIKTIEQDLKTYGPVEASFDVYDDFSVYKSGIYRKTPK 272
Query: 109 GDSIGLHAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIEMGFNNRVEA 168
G H+++++GWG +N PYWL NSW+ WG+HGTFKI++G NE IE + +
Sbjct: 273 AKYQGGHSIKIIGWGQQNGTPYWLAVNSWSKFWGEHGTFKIIKGRNECGIERAVTAGIPS 332
Query: 169 NS 170
+S
Sbjct: 333 SS 334
>gi|294931810|ref|XP_002780018.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
gi|239889821|gb|EER11813.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
Length = 131
Score = 97.8 bits (242), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 43/78 (55%), Positives = 54/78 (69%)
Query: 75 NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVA 134
N ++I +GP A FS Y DF YKSGVY+H G +G H+V ++GWG E + YWLV
Sbjct: 30 NIKKEIMTNGPTSASFSTYEDFSSYKSGVYKHTSGGYLGDHSVEIIGWGTEKGVDYWLVM 89
Query: 135 NSWNDHWGDHGTFKILRG 152
NSWN+ WGDHGTFKI +G
Sbjct: 90 NSWNEGWGDHGTFKIAQG 107
>gi|168020784|ref|XP_001762922.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162685734|gb|EDQ72127.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 345
Score = 97.8 bits (242), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 41/84 (48%), Positives = 61/84 (72%), Gaps = 1/84 (1%)
Query: 77 MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVEND-IPYWLVAN 135
M +++ +GP+ F V+ DF YK+GVY+H +G IG HAV+++GWG +D + YW + N
Sbjct: 245 MAELFTNGPIEVAFDVFEDFAHYKTGVYKHLYGGYIGGHAVKLVGWGTTDDGVDYWSMVN 304
Query: 136 SWNDHWGDHGTFKILRGENEADIE 159
SWN +WG+ GTF+ILRG++E IE
Sbjct: 305 SWNTNWGEDGTFRILRGKDECGIE 328
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 41/102 (40%), Positives = 57/102 (55%), Gaps = 6/102 (5%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP FDAR+ W C ++ I DQ +CGSCWA +++DR CI N + +S ++
Sbjct: 101 LPTEFDARKHWSHCSTIGDILDQGHCGSCWAFGAVESLTDRFCIHLNESVS--LSENDLL 158
Query: 247 ACTP-NCW-GCNGGWPQLAWRFWGHNGVVTG--GDYNSQEGC 284
AC C GC GG+P AW+++ GVVT Y Q+GC
Sbjct: 159 ACCGFECGDGCEGGYPIRAWQYFKRTGVVTSKCDPYFDQKGC 200
>gi|326430261|gb|EGD75831.1| hypothetical protein PTSG_07950 [Salpingoeca sp. ATCC 50818]
Length = 381
Score = 97.8 bits (242), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 45/84 (53%), Positives = 58/84 (69%), Gaps = 2/84 (2%)
Query: 78 RQIYEHGPLVAIFSVYADFLQYKSGVYQ--HNFGDSIGLHAVRVLGWGVENDIPYWLVAN 135
R I +HGP++A + V+ DF +Y SGVY + DSIG HAV ++GWGVE++ PYWLV N
Sbjct: 252 RDIMQHGPVLASYEVFEDFGEYDSGVYTCPDDGSDSIGWHAVIIVGWGVEDNTPYWLVQN 311
Query: 136 SWNDHWGDHGTFKILRGENEADIE 159
SW +G G FKI RG NE +IE
Sbjct: 312 SWGTGFGIDGYFKIARGTNECNIE 335
Score = 52.0 bits (123), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 38/140 (27%), Positives = 64/140 (45%), Gaps = 28/140 (20%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTG-QISAQHI 245
+P ++++ E + +C I Q +CGSCWA + ++ R+CI S G +++ Q +
Sbjct: 94 IPDSYNSHEAYSKCKP--DILQQGSCGSCWAFATTGVLAQRMCIKSEQIGQGYELAPQAL 151
Query: 246 VACT----------------PNCW---GCNGGWPQLAWRFWGHNGVV--TGGDYNSQEGC 284
V+CT C+ GC+GG+P A+RF G+ Y S++G
Sbjct: 152 VSCTDQICYTKAGDRCSSPSSTCYCSLGCDGGYPDGAFRFMQDEGITPELCVKYVSKDGT 211
Query: 285 QPYTLAPCEHHVQGPLQNCT 304
P + VQ + CT
Sbjct: 212 DPLECS----DVQTMVSECT 227
>gi|356505709|ref|XP_003521632.1| PREDICTED: cathepsin B-like [Glycine max]
Length = 357
Score = 97.8 bits (242), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 50/101 (49%), Positives = 65/101 (64%), Gaps = 5/101 (4%)
Query: 63 HYFKKAHMV---PRCNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRV 119
HY A+ V P + M ++Y++GP+ F+VY DF YKSGVY+H G +G HAV++
Sbjct: 230 HYSVSAYRVNSDPH-DIMAEVYKNGPVEVAFTVYEDFAYYKSGVYKHITGYELGGHAVKL 288
Query: 120 LGWGVEND-IPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
+GWG +D YWL+AN WN WGD G FKI RG NE IE
Sbjct: 289 IGWGTTDDGEDYWLLANQWNREWGDDGYFKIRRGTNECGIE 329
Score = 88.2 bits (217), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 50/135 (37%), Positives = 69/135 (51%), Gaps = 20/135 (14%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP+NFDAR W +C ++ I DQ +CGSCWA ++SDR CI + + +S ++
Sbjct: 101 LPKNFDARTAWSQCSTIGRILDQGHCGSCWAFGAVESLSDRFCIHFDVNIS--LSVNDLL 158
Query: 247 ACTPNCW--GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPY-TLAPCEHHVQGPLQNC 303
AC GC+GG+P AWR+ H+GVVT E C PY C H P
Sbjct: 159 ACCGFLCGSGCDGGYPLYAWRYLAHHGVVT-------EECDPYFDQIGCSHPGCEP---- 207
Query: 304 TLLGKLKTPECKQNC 318
+TP+C + C
Sbjct: 208 ----AYRTPKCVKKC 218
>gi|204022073|dbj|BAG71134.1| cathepsin B-S1 [Tuberaphis taiwana]
Length = 334
Score = 97.8 bits (242), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 53/133 (39%), Positives = 76/133 (57%), Gaps = 3/133 (2%)
Query: 188 PRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV- 246
P+ FD+R W C + HI DQ NCGSCW+ S A +DRLC+++ G F +S + +
Sbjct: 86 PQQFDSRTNWKSCKQIGHIRDQGNCGSCWSFSTTGAFADRLCVSTGGKFNQLLSPEELAF 145
Query: 247 ACTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTLL 306
C GC GG+P AW+++ GV TGGDY ++EGC PY + PC ++ QG C
Sbjct: 146 CCKDCGKGCGGGYPIKAWKYFRTQGVTTGGDYGTKEGCMPYKVPPC-YNKQGK-NTCGGQ 203
Query: 307 GKLKTPECKQNCY 319
+ +C + CY
Sbjct: 204 PMERNHQCPKTCY 216
Score = 92.8 bits (229), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 41/94 (43%), Positives = 58/94 (61%), Gaps = 1/94 (1%)
Query: 78 RQIYEHGPLVAIFSVYADFLQYKSGVYQHNF-GDSIGLHAVRVLGWGVENDIPYWLVANS 136
R I +GP+ A F VY D YKSG+Y+ G H+++++GWG +N PYWL NS
Sbjct: 241 RDIMTYGPVEASFDVYDDLSAYKSGIYRKTPKAKYQGGHSIKIIGWGQQNGTPYWLAVNS 300
Query: 137 WNDHWGDHGTFKILRGENEADIEMGFNNRVEANS 170
W+ WG+HGTFKI++G NE IE + ++S
Sbjct: 301 WSKFWGEHGTFKIIKGRNECGIERAVTAGIPSSS 334
>gi|356572872|ref|XP_003554589.1| PREDICTED: cathepsin B-like [Glycine max]
Length = 356
Score = 97.8 bits (242), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 49/100 (49%), Positives = 64/100 (64%), Gaps = 3/100 (3%)
Query: 63 HYFKKAHMVPRC--NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
HY A+ V + M ++Y++GP+ F+VY DF YKSGVY+H G +G HAV+++
Sbjct: 229 HYSVNAYRVSSDPHDIMTEVYKNGPVEVAFTVYEDFAHYKSGVYKHITGYELGGHAVKLI 288
Query: 121 GWG-VENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
GWG E+ YWL+AN WN WGD G FKI RG NE IE
Sbjct: 289 GWGTTEDGEDYWLLANQWNREWGDDGYFKIRRGTNECGIE 328
Score = 85.9 bits (211), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 49/135 (36%), Positives = 69/135 (51%), Gaps = 20/135 (14%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP+NFDAR W +C ++ I DQ +CGSCWA ++SDR CI + + +S ++
Sbjct: 100 LPKNFDARTAWSQCSTIGRILDQGHCGSCWAFGAVESLSDRFCIHFDVNIS--LSVNDLL 157
Query: 247 ACTPNCW--GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPY-TLAPCEHHVQGPLQNC 303
AC GC+GG+P AW++ H+GVVT E C PY C H P
Sbjct: 158 ACCGFLCGSGCDGGYPLYAWQYLAHHGVVT-------EECDPYFDQIGCSHPGCEP---- 206
Query: 304 TLLGKLKTPECKQNC 318
+TP+C + C
Sbjct: 207 ----AYRTPKCVKKC 217
>gi|38048307|gb|AAR10056.1| similar to Drosophila melanogaster CG10992, partial [Drosophila
yakuba]
Length = 174
Score = 97.8 bits (242), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 41/88 (46%), Positives = 56/88 (63%), Gaps = 1/88 (1%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+P FD+R++WP CP++ I DQ +CGSCWA A+SDR+CI S G SA +V
Sbjct: 87 IPEEFDSRKQWPNCPTIGEIRDQGSCGSCWAFGAVEAMSDRVCIHSGGKVNFHFSADDLV 146
Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVV 273
+C C +GCNGG+P AW +W G+V
Sbjct: 147 SCCHTCGFGCNGGFPGAAWSYWTRKGIV 174
>gi|297814171|ref|XP_002874969.1| hypothetical protein ARALYDRAFT_490415 [Arabidopsis lyrata subsp.
lyrata]
gi|297320806|gb|EFH51228.1| hypothetical protein ARALYDRAFT_490415 [Arabidopsis lyrata subsp.
lyrata]
Length = 359
Score = 97.4 bits (241), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 45/86 (52%), Positives = 59/86 (68%), Gaps = 1/86 (1%)
Query: 75 NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVEND-IPYWLV 133
+ M ++Y++GP+ F+VY DF YKSGVY+H G +IG HAV+++GWG N+ YWL+
Sbjct: 246 DIMAEVYKNGPVEVSFTVYEDFAHYKSGVYKHITGSNIGGHAVKLIGWGTSNEGEDYWLM 305
Query: 134 ANSWNDHWGDHGTFKILRGENEADIE 159
AN WN WGD G F I RG NE IE
Sbjct: 306 ANQWNRGWGDDGYFMIRRGTNECGIE 331
Score = 84.0 bits (206), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 43/103 (41%), Positives = 59/103 (57%), Gaps = 11/103 (10%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP+ FDAR WP+C S+ I DQ +CGSCWA ++SDR CI +S ++
Sbjct: 103 LPKAFDARTAWPQCTSIGKILDQGHCGSCWAFGAVESLSDRFCIQFG--MNISLSVNDLL 160
Query: 247 ACTP-NCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPY 287
AC C GC+GG+P AW+++ ++GVVT E C PY
Sbjct: 161 ACCGFRCGDGCDGGYPIAAWQYFSYSGVVT-------EECDPY 196
>gi|294899385|ref|XP_002776615.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
gi|239883670|gb|EER08431.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
Length = 233
Score = 97.4 bits (241), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 49/101 (48%), Positives = 62/101 (61%), Gaps = 8/101 (7%)
Query: 187 LPRNFDAREKWPECP-SLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHI 245
LP +FDAR +P C + HI DQS CGSCWA V A +DRLCI SNG FT +SA +
Sbjct: 117 LPTDFDARTAFPNCSKVIGHIRDQSACGSCWAFGVTEAFNDRLCIKSNGTFTELLSAGEM 176
Query: 246 VACTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQP 286
AC P+ +GC+GG+P AW + G+ TG EG +P
Sbjct: 177 NACAPS-YGCDGGYPDSAWSWVHDEGIATG------EGSRP 210
>gi|10803452|emb|CAB97365.2| putative cathepsin B.2 [Ostertagia ostertagi]
Length = 194
Score = 97.4 bits (241), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 55/128 (42%), Positives = 73/128 (57%), Gaps = 3/128 (2%)
Query: 214 SCWAVSVANAISDRLCIASNGYFTGQISAQHIVACTPNC-WGCNGGWPQLAWRFWGHNGV 272
SCWAVS A A+SDR+CIAS G +S Q ++AC C +GC GGWP AW+++ GV
Sbjct: 1 SCWAVSSAAAMSDRVCIASXGAKQVLLSDQDMLACCSWCGYGCEGGWPMKAWQYFXLEGV 60
Query: 273 VTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTLLGKLKTPECKQNCYNPSYESTYRFDLK 332
VTGG+Y Q C+PY PC H + P KTP+C++ C Y Y+ D
Sbjct: 61 VTGGNYRKQGCCRPYEFPPCGRHGKEPYYG-ECYDSAKTPKCQKTC-QRGYLKPYKEDKH 118
Query: 333 KGKKAHMV 340
GK A+ +
Sbjct: 119 FGKSAYRL 126
Score = 83.6 bits (205), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 44/98 (44%), Positives = 58/98 (59%), Gaps = 2/98 (2%)
Query: 42 KKKKKKKKKKRLYLPTSIPLSHYFKKAHMVPRC--NAMRQIYEHGPLVAIFSVYADFLQY 99
K K +K +R YL H+ K A+ +P R I ++GP+VA F VY DF Y
Sbjct: 97 KTPKCQKTCQRGYLKPYKEDKHFGKSAYRLPNNVKAIQRDIMKNGPVVAGFIVYEDFAHY 156
Query: 100 KSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
KSG+Y+H G G HAV+++GWG E PYWL+ANSW
Sbjct: 157 KSGIYKHTAGRMTGGHAVKIIGWGKEXGTPYWLIANSW 194
>gi|328871084|gb|EGG19455.1| peptidase C1A family protein [Dictyostelium fasciculatum]
Length = 352
Score = 97.4 bits (241), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 47/99 (47%), Positives = 62/99 (62%), Gaps = 2/99 (2%)
Query: 63 HYFKKAH-MVPRCNAMRQ-IYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
H+ + M P NA++Q I +GP+ A F VY DFL YKSGVYQH G +G H V+++
Sbjct: 198 HFIDGVYSMNPTVNAIQQEIMTNGPVEACFEVYEDFLGYKSGVYQHTTGKDLGGHCVKMI 257
Query: 121 GWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
GWG +N+ YW+ NSW +WG+ G F I G NE IE
Sbjct: 258 GWGTQNNELYWICNNSWTTYWGNQGVFWIKAGVNECGIE 296
Score = 77.8 bits (190), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 49/147 (33%), Positives = 70/147 (47%), Gaps = 17/147 (11%)
Query: 185 KGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQH 244
+ +P NF++ ++W C + I +Q+ CGSCWA ++SDR CI +S Q
Sbjct: 68 QAVPANFNSAQQWSNCSYISAIQNQARCGSCWAFGAVESVSDRFCIHKGEDVL--LSFQD 125
Query: 245 IVACTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
+V C + GC GG A +F G+V+ C PYT+ C P Q
Sbjct: 126 LVTCDQSDNGCQGGDAYTAMKFIQKKGIVS-------NDCLPYTIPTC-----APAQQ-P 172
Query: 305 LLGKLKTPECKQNCYNPSYESTYRFDL 331
L + TP+C + C N SY TY DL
Sbjct: 173 CLNFVDTPQCVEKCSNASY--TYAQDL 197
>gi|255548165|ref|XP_002515139.1| cathepsin B, putative [Ricinus communis]
gi|223545619|gb|EEF47123.1| cathepsin B, putative [Ricinus communis]
Length = 376
Score = 97.4 bits (241), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 45/86 (52%), Positives = 60/86 (69%), Gaps = 1/86 (1%)
Query: 75 NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGV-ENDIPYWLV 133
+ M ++Y++GP+ F+VY DF YKSGVY+H G+ +G HAV+++GWG +N YWL+
Sbjct: 261 DVMAEVYKNGPVEVSFTVYEDFAHYKSGVYKHITGEVMGGHAVKLIGWGTSDNGEDYWLL 320
Query: 134 ANSWNDHWGDHGTFKILRGENEADIE 159
AN WN WGD G FKI RG NE IE
Sbjct: 321 ANQWNRGWGDDGYFKIRRGTNECGIE 346
Score = 73.6 bits (179), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 49/154 (31%), Positives = 66/154 (42%), Gaps = 37/154 (24%)
Query: 187 LPRNFDAREKWPECPSLRHIADQ-----------------SNCGSCWAVSVANAISDRLC 229
LP+ FDAR WP C ++ I Q +CGSCWA ++SDR C
Sbjct: 101 LPKEFDARTAWPHCSTIGKILGQLLSFYNIFSIFFFLFLEGHCGSCWAFGAVESLSDRFC 160
Query: 230 IASNGYFTGQISAQHIVACTPNCWG--CNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPY 287
I +S ++AC G C+GG+P AWR++ H+GVVT E C PY
Sbjct: 161 IHFG--MNISLSVNDLLACCGFLCGDGCDGGYPMYAWRYFVHHGVVT-------EECDPY 211
Query: 288 -TLAPCEHHVQGPLQNCTLLGKLKTPECKQNCYN 320
C H P TP+C + C +
Sbjct: 212 FDNIGCSHPGCEP--------GFPTPKCVRKCID 237
>gi|224064400|ref|XP_002301457.1| predicted protein [Populus trichocarpa]
gi|222843183|gb|EEE80730.1| predicted protein [Populus trichocarpa]
Length = 357
Score = 97.4 bits (241), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 46/86 (53%), Positives = 58/86 (67%), Gaps = 1/86 (1%)
Query: 75 NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVEND-IPYWLV 133
+ M +IY++GP+ F+VY DF YKSGVY+H G +G HAV+++GWG D YWL+
Sbjct: 244 SIMAEIYKNGPVEVAFTVYEDFAHYKSGVYKHITGGMMGGHAVKLIGWGTSEDGEAYWLL 303
Query: 134 ANSWNDHWGDHGTFKILRGENEADIE 159
AN WN WGD G FKI RG NE IE
Sbjct: 304 ANQWNRGWGDDGYFKIRRGTNECGIE 329
Score = 88.2 bits (217), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 52/137 (37%), Positives = 67/137 (48%), Gaps = 20/137 (14%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP FDAR WP+C ++ I DQ +CGSCWA ++SDR CI +S ++
Sbjct: 101 LPEEFDARTAWPQCSTIGKILDQGHCGSCWAFGAVESLSDRFCIHYG--MNISLSVNDLL 158
Query: 247 ACTPNCW--GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPY-TLAPCEHHVQGPLQNC 303
AC GCNGG+P AWR++ H+GVVT E C PY C H P
Sbjct: 159 ACCGFLCGSGCNGGYPISAWRYFVHHGVVT-------EECDPYFDDIGCSHPGCEP---- 207
Query: 304 TLLGKLKTPECKQNCYN 320
TP+C + C N
Sbjct: 208 ----GYPTPKCARKCVN 220
>gi|302823081|ref|XP_002993195.1| hypothetical protein SELMODRAFT_270024 [Selaginella moellendorffii]
gi|300138965|gb|EFJ05715.1| hypothetical protein SELMODRAFT_270024 [Selaginella moellendorffii]
Length = 342
Score = 97.4 bits (241), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 54/136 (39%), Positives = 77/136 (56%), Gaps = 19/136 (13%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP++FDARE WP+C S+++I DQ +CGSCWA A++DR CI +N + +S +V
Sbjct: 99 LPKHFDAREAWPQCSSIKNILDQGHCGSCWAFGAVEALTDRFCILNNENVS--LSENDLV 156
Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAP-CEHHVQGPLQNCT 304
AC +C +GC+GG+P AW ++ GVVT SQ C PY C+H P
Sbjct: 157 ACCSSCGFGCDGGYPYAAWEYFAQTGVVT-----SQ--CDPYFDGKGCKHPGCEP----- 204
Query: 305 LLGKLKTPECKQNCYN 320
+ TP C + C +
Sbjct: 205 ---EYDTPVCVKQCVD 217
Score = 97.4 bits (241), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 44/82 (53%), Positives = 59/82 (71%), Gaps = 1/82 (1%)
Query: 79 QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVEND-IPYWLVANSW 137
+IY++GP+ ++VY DF YKSGVY+H FG+ +G HAV+ +GWG +D YW+VANSW
Sbjct: 244 EIYKNGPVEVSYTVYEDFAHYKSGVYKHVFGEVLGGHAVKFIGWGTTDDGKDYWIVANSW 303
Query: 138 NDHWGDHGTFKILRGENEADIE 159
N WG+ G F+I RG NE IE
Sbjct: 304 NRSWGEDGFFQISRGSNECGIE 325
>gi|294955270|ref|XP_002788457.1| cysteine protease, putative [Perkinsus marinus ATCC 50983]
gi|239903926|gb|EER20253.1| cysteine protease, putative [Perkinsus marinus ATCC 50983]
Length = 392
Score = 97.4 bits (241), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 45/102 (44%), Positives = 63/102 (61%), Gaps = 3/102 (2%)
Query: 61 LSHYFKKAHMVPRCNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
+ +F + N ++I +GP A FS+Y DFL Y+SGVY+H G +G H V ++
Sbjct: 237 FTAHFSPYQLKGTDNIKKEIMTNGPTSAAFSMYDDFLSYESGVYKHTSGTLMGEHGVEII 296
Query: 121 GWGVENDIPYWLVANSWNDHWGDHGTFKILRGE---NEADIE 159
GWG + + YWLV NSWN+ WG HGTFKI +G+ N+ IE
Sbjct: 297 GWGTKQGVDYWLVMNSWNEGWGVHGTFKIAQGDCGINDMAIE 338
Score = 48.5 bits (114), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 25/72 (34%), Positives = 30/72 (41%)
Query: 252 CWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTLLGKLKT 311
C GC G P AW F G+ T G ++ +GC PY C HH Q T
Sbjct: 156 CDGCTKGRPDAAWSFLNVYGIATEGSMSAADGCWPYNFPKCGHHQQDSKYQPCPEKNYDT 215
Query: 312 PECKQNCYNPSY 323
P C C N +Y
Sbjct: 216 PPCLDRCPNKNY 227
>gi|166030322|gb|ABY78828.1| cathepsin B-like protease [Trypanosoma congolense]
gi|343471419|emb|CCD16168.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 336
Score = 97.4 bits (241), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 53/109 (48%), Positives = 69/109 (63%), Gaps = 4/109 (3%)
Query: 58 SIPLSHYF-KKAHMV-PRCNAM-RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGL 114
+IPL Y K A+M+ P R++Y +GP VAI VY D YKSGVY++ G +G+
Sbjct: 218 TIPLIKYRGKDAYMLLPGEEEFKRELYFNGPFVAILFVYTDLFAYKSGVYRNVDGSYMGV 277
Query: 115 HAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE-MGF 162
AV+V+GWG N PYW VAN+W+ WG G ILRG NE +IE +GF
Sbjct: 278 TAVKVVGWGKLNGTPYWKVANTWDTDWGMDGYLLILRGNNECNIEHLGF 326
Score = 95.5 bits (236), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 64/157 (40%), Positives = 85/157 (54%), Gaps = 15/157 (9%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP +FD+ EKWP CP++R IADQS C + WAVS A+AISDR C G +ISA H++
Sbjct: 90 LPESFDSAEKWPNCPTIREIADQSACRASWAVSTASAISDRYCTVGGGK-QLRISAAHLL 148
Query: 247 A-CTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHH-VQGPLQNCT 304
+ C GC GG+P AWR++ G+ + CQPY CEH QG C+
Sbjct: 149 SCCKQCGGGCKGGFPGFAWRYYVEYGI-------ASSYCQPYPFPQCEHQGAQGNKTPCS 201
Query: 305 LLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMVL 341
K TP+C C + T +GK A+M+L
Sbjct: 202 NY-KFVTPQCNTTC----TDKTIPLIKYRGKDAYMLL 233
>gi|2317912|gb|AAC24376.1| cathepsin B-like cysteine proteinase [Arabidopsis thaliana]
Length = 357
Score = 97.4 bits (241), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 45/86 (52%), Positives = 59/86 (68%), Gaps = 1/86 (1%)
Query: 75 NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVEND-IPYWLV 133
+ M ++Y++GP+ F+VY DF YKSGVY++ G IG HAV+++GWG +D YWL+
Sbjct: 244 DIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKYITGTKIGGHAVKLIGWGTSDDGEDYWLL 303
Query: 134 ANSWNDHWGDHGTFKILRGENEADIE 159
AN WN WGD G FKI RG NE IE
Sbjct: 304 ANQWNRSWGDDGYFKIRRGTNECGIE 329
Score = 80.1 bits (196), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 51/135 (37%), Positives = 68/135 (50%), Gaps = 22/135 (16%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP+ FDAR W C S+R I +CGSCWA ++SDR CI N +SA ++
Sbjct: 103 LPKEFDARTAWSHCTSIRRIL--GHCGSCWAFGAVESLSDRFCIKYN--LNVSLSANDVI 158
Query: 247 ACTPNC--WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPY-TLAPCEHHVQGPLQNC 303
AC +GCNGG+P AW ++ ++GVVT QE C PY C H P
Sbjct: 159 ACCGLLCGFGCNGGFPMGAWLYFKYHGVVT------QE-CDPYFDNTGCSHPGCEP---- 207
Query: 304 TLLGKLKTPECKQNC 318
TP+C++ C
Sbjct: 208 ----TYPTPKCERKC 218
>gi|414886870|tpg|DAA62884.1| TPA: cathepsin B-like cysteine proteinase 3 [Zea mays]
Length = 347
Score = 97.1 bits (240), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 45/88 (51%), Positives = 61/88 (69%), Gaps = 1/88 (1%)
Query: 75 NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVEN-DIPYWLV 133
+ M ++Y++GP+ F+VY DF YKSGVY+H G +G HAV+++GWG + YWL+
Sbjct: 236 DIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKHITGGIMGGHAVKLIGWGTSDAGEDYWLL 295
Query: 134 ANSWNDHWGDHGTFKILRGENEADIEMG 161
AN WN WGD G FKI+RG+NE IE G
Sbjct: 296 ANQWNRGWGDDGYFKIIRGKNECGIEEG 323
Score = 78.6 bits (192), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 47/135 (34%), Positives = 67/135 (49%), Gaps = 20/135 (14%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP+ FDAR W C ++ +I +Q +CGSCWA + DR CI N + +S ++
Sbjct: 93 LPKEFDARSAWSRCSTIGNILEQGHCGSCWAFGAVECLQDRFCIHLN--MSILLSVNDLL 150
Query: 247 ACTPNCW--GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPY-TLAPCEHHVQGPLQNC 303
AC GC+GG+P AWR++ NGVVT + C PY C+H P
Sbjct: 151 ACCGFMCGDGCDGGYPIEAWRYFVQNGVVT-------DECDPYFDPVGCKHPGCEP---- 199
Query: 304 TLLGKLKTPECKQNC 318
TP+C++ C
Sbjct: 200 ----AYPTPKCEKKC 210
>gi|226497010|ref|NP_001150152.1| LOC100283781 precursor [Zea mays]
gi|195637168|gb|ACG38052.1| cathepsin B-like cysteine proteinase 3 precursor [Zea mays]
Length = 347
Score = 97.1 bits (240), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 45/88 (51%), Positives = 61/88 (69%), Gaps = 1/88 (1%)
Query: 75 NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVEN-DIPYWLV 133
+ M ++Y++GP+ F+VY DF YKSGVY+H G +G HAV+++GWG + YWL+
Sbjct: 236 DIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKHITGGIMGGHAVKLIGWGTSDAGEDYWLL 295
Query: 134 ANSWNDHWGDHGTFKILRGENEADIEMG 161
AN WN WGD G FKI+RG+NE IE G
Sbjct: 296 ANQWNRGWGDDGYFKIIRGKNECGIEEG 323
Score = 80.5 bits (197), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 48/135 (35%), Positives = 67/135 (49%), Gaps = 20/135 (14%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP+ FDAR W C ++ +I DQ +CGSCWA + DR CI N + +S ++
Sbjct: 93 LPKEFDARSAWSRCSTIGNILDQGHCGSCWAFGAVECLQDRFCIHLN--MSILLSVNDLL 150
Query: 247 ACTPNCW--GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPY-TLAPCEHHVQGPLQNC 303
AC GC+GG+P AWR++ NGVVT + C PY C+H P
Sbjct: 151 ACCGFMCGDGCDGGYPIEAWRYFVQNGVVT-------DECDPYFDPVGCKHPGCEP---- 199
Query: 304 TLLGKLKTPECKQNC 318
TP+C++ C
Sbjct: 200 ----AYPTPKCEKKC 210
>gi|224128101|ref|XP_002320244.1| predicted protein [Populus trichocarpa]
gi|222861017|gb|EEE98559.1| predicted protein [Populus trichocarpa]
Length = 339
Score = 97.1 bits (240), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 45/84 (53%), Positives = 57/84 (67%), Gaps = 1/84 (1%)
Query: 77 MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVEND-IPYWLVAN 135
M ++ +GP+ F+VY DF YKSGVY+H GD++G HAV+++GWG D YWL+AN
Sbjct: 228 MAEVSSNGPVEVAFTVYEDFAHYKSGVYKHITGDAMGGHAVKLIGWGTSEDGEDYWLLAN 287
Query: 136 SWNDHWGDHGTFKILRGENEADIE 159
WN WGD G FKI RG NE IE
Sbjct: 288 QWNRGWGDDGYFKIKRGTNECGIE 311
Score = 80.1 bits (196), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 50/139 (35%), Positives = 67/139 (48%), Gaps = 24/139 (17%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP FDAR WP C ++ I DQ +CGSCWA ++SDR CI + +S ++
Sbjct: 83 LPIEFDARTAWPHCSTIGRILDQGHCGSCWAFGAVESLSDRFCIHYGMNLS--LSVNDLL 140
Query: 247 ACTPNCW----GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPY-TLAPCEHHVQGPLQ 301
AC W GC+GG P AWR++ +GVVT E C PY C H P
Sbjct: 141 ACCG--WMCGAGCDGGSPIDAWRYFVQSGVVT-------EECDPYFDDIGCSHPGCEP-- 189
Query: 302 NCTLLGKLKTPECKQNCYN 320
TP+C++ C +
Sbjct: 190 ------GFPTPKCERKCAD 202
>gi|262217337|gb|ACY38050.1| cathepsin B [Dactylis glomerata]
Length = 348
Score = 97.1 bits (240), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 44/86 (51%), Positives = 60/86 (69%), Gaps = 1/86 (1%)
Query: 75 NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVEN-DIPYWLV 133
+ M ++Y++GP+ F+VY DF YKSGVY+H G +G HAV+++GWG + YWL+
Sbjct: 237 DIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKHVTGGVMGGHAVKLIGWGTSDAGEDYWLL 296
Query: 134 ANSWNDHWGDHGTFKILRGENEADIE 159
AN WN WGD G FKI+RG+NE IE
Sbjct: 297 ANQWNRGWGDDGYFKIIRGKNECGIE 322
Score = 82.4 bits (202), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 50/135 (37%), Positives = 68/135 (50%), Gaps = 20/135 (14%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP+ FDAR KW C ++ I DQ +CGSCWA + DR CI N +SA +V
Sbjct: 94 LPKQFDARSKWSGCSTIGTILDQGHCGSCWAFGAVECLQDRFCIHQN--INISLSANDLV 151
Query: 247 ACTPNCW--GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPY-TLAPCEHHVQGPLQNC 303
AC GC+GG+P AW+++ +GVVT E C PY C+H P +
Sbjct: 152 ACCGFMCGDGCDGGYPIKAWQYFVQSGVVT-------EECDPYFDQVGCKHPGCEPAYD- 203
Query: 304 TLLGKLKTPECKQNC 318
TP+C++ C
Sbjct: 204 -------TPKCEKKC 211
>gi|414886872|tpg|DAA62886.1| TPA: hypothetical protein ZEAMMB73_253741 [Zea mays]
gi|414886873|tpg|DAA62887.1| TPA: hypothetical protein ZEAMMB73_253741 [Zea mays]
Length = 208
Score = 97.1 bits (240), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 45/88 (51%), Positives = 61/88 (69%), Gaps = 1/88 (1%)
Query: 75 NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVEN-DIPYWLV 133
+ M ++Y++GP+ F+VY DF YKSGVY+H G +G HAV+++GWG + YWL+
Sbjct: 97 DIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKHITGGIMGGHAVKLIGWGTSDAGEDYWLL 156
Query: 134 ANSWNDHWGDHGTFKILRGENEADIEMG 161
AN WN WGD G FKI+RG+NE IE G
Sbjct: 157 ANQWNRGWGDDGYFKIIRGKNECGIEEG 184
>gi|357116879|ref|XP_003560204.1| PREDICTED: cathepsin B-like [Brachypodium distachyon]
Length = 351
Score = 96.7 bits (239), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 53/127 (41%), Positives = 72/127 (56%), Gaps = 17/127 (13%)
Query: 67 KAHMVPRCNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVEN 126
+ H P + M ++Y +GP+ F+VY DF YKSGVY+H G +G HAV+++GWG +
Sbjct: 233 RVHSNPH-DIMAEVYTNGPVEVAFTVYEDFAHYKSGVYKHITGGVMGGHAVKLIGWGTSD 291
Query: 127 -DIPYWLVANSWNDHWGDHGTFKILRGENEADIEMGFNNRVEANSSEDDDLETMGCQNAK 185
YWL+AN WN WGD G FKI+RG+NE IE ED G + K
Sbjct: 292 AGEDYWLLANQWNRGWGDDGYFKIIRGKNECGIE------------ED---VVAGMPSTK 336
Query: 186 GLPRNFD 192
+ RN+D
Sbjct: 337 NMARNYD 343
Score = 79.3 bits (194), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 48/135 (35%), Positives = 65/135 (48%), Gaps = 20/135 (14%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP FDAR +W C ++ I DQ +CGSCWA + DR CI N +S ++
Sbjct: 97 LPTEFDARSQWSGCSTIGTILDQGHCGSCWAFGAVECLQDRFCIHLN--MNISLSVNDLL 154
Query: 247 ACTPNCW--GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPY-TLAPCEHHVQGPLQNC 303
AC GCNGG+P AWR++ GVVT + C PY C+H P
Sbjct: 155 ACCGFLCGSGCNGGYPISAWRYFRRKGVVT-------DECDPYFDQVGCKHPGCEP---- 203
Query: 304 TLLGKLKTPECKQNC 318
+TP+C++ C
Sbjct: 204 ----AYRTPKCEKKC 214
>gi|255076333|ref|XP_002501841.1| cysteine endopeptidase [Micromonas sp. RCC299]
gi|226517105|gb|ACO63099.1| cysteine endopeptidase [Micromonas sp. RCC299]
Length = 359
Score = 96.7 bits (239), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 51/113 (45%), Positives = 67/113 (59%), Gaps = 2/113 (1%)
Query: 187 LPRNFDAREKWPECPSL-RHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHI 245
LP NFDAR+KWP+C ++ + DQ CGSCWAV+ A ++DRLCIAS G ++S Q+
Sbjct: 105 LPLNFDARQKWPQCRAIIGTVRDQGKCGSCWAVATAEVMNDRLCIASGGAEQRELSPQYP 164
Query: 246 VACTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYN-SQEGCQPYTLAPCEHHVQ 297
++C GC GG +A G+V GG N S+ C PY PCEH Q
Sbjct: 165 LSCYDGGSGCQGGDVAVAMHEATTKGMVFGGMLNRSKTACLPYEFEPCEHPCQ 217
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 34/92 (36%), Positives = 54/92 (58%), Gaps = 10/92 (10%)
Query: 78 RQIYEHGPLVAIF-SVYADFLQYKSGVYQHNFGD----SIGLHAVRVLGWGVENDI--PY 130
++I +GP+ F +V++DF Y +GVY D +G+HA +++GWG + PY
Sbjct: 266 QEIMTYGPVAVTFGTVHSDFYGYHAGVYTVREEDKNEEGLGMHATKLIGWGFDEATGHPY 325
Query: 131 WLVANSWNDHWGDHGTFKILRGENEADIEMGF 162
WL+ NSW D+WG HG ++ G E ++E G
Sbjct: 326 WLMMNSW-DNWGIHGLGRV--GVGEMNMEQGI 354
>gi|215687149|dbj|BAG90919.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 403
Score = 96.7 bits (239), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 51/120 (42%), Positives = 70/120 (58%), Gaps = 16/120 (13%)
Query: 75 NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVEN-DIPYWLV 133
+ M ++Y++GP+ F+VY DF YKSGVY+H G +G HAV+++GWG + YWL+
Sbjct: 290 DIMAEVYQNGPVEVAFTVYEDFAHYKSGVYKHITGGMMGGHAVKLIGWGTTDAGEDYWLL 349
Query: 134 ANSWNDHWGDHGTFKILRGENEADIEMGFNNRVEANSSEDDDLETMGCQNAKGLPRNFDA 193
AN WN WGD G FKI+RG NE IE ED G + K + RN+D+
Sbjct: 350 ANQWNRGWGDDGYFKIIRGTNECGIE------------ED---VVAGMPSTKNMVRNYDS 394
Score = 82.4 bits (202), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 43/103 (41%), Positives = 57/103 (55%), Gaps = 6/103 (5%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP+ FDAR W +C ++ I DQ +CGSCWA + DR CI N +S +V
Sbjct: 147 LPKEFDARSAWSQCNTIGTILDQGHCGSCWAFGAVECLQDRFCIHFN--MNISLSVNDLV 204
Query: 247 ACTPNCW--GCNGGWPQLAWRFWGHNGVVTG--GDYNSQEGCQ 285
AC GC+GG+P +AWR++ NGVVT Y Q GC+
Sbjct: 205 ACCGFMCGDGCDGGYPIMAWRYFVRNGVVTDECDPYFDQVGCK 247
>gi|59895951|gb|AAX11351.1| cathepsin B-like cysteine protease [Oryza sativa Japonica Group]
gi|125551767|gb|EAY97476.1| hypothetical protein OsI_19406 [Oryza sativa Indica Group]
gi|215694023|dbj|BAG89222.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215712372|dbj|BAG94499.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215765382|dbj|BAG87079.1| unnamed protein product [Oryza sativa Japonica Group]
gi|222631058|gb|EEE63190.1| hypothetical protein OsJ_17999 [Oryza sativa Japonica Group]
Length = 358
Score = 96.7 bits (239), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 51/120 (42%), Positives = 70/120 (58%), Gaps = 16/120 (13%)
Query: 75 NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVEN-DIPYWLV 133
+ M ++Y++GP+ F+VY DF YKSGVY+H G +G HAV+++GWG + YWL+
Sbjct: 245 DIMAEVYQNGPVEVAFTVYEDFAHYKSGVYKHITGGMMGGHAVKLIGWGTTDAGEDYWLL 304
Query: 134 ANSWNDHWGDHGTFKILRGENEADIEMGFNNRVEANSSEDDDLETMGCQNAKGLPRNFDA 193
AN WN WGD G FKI+RG NE IE ED G + K + RN+D+
Sbjct: 305 ANQWNRGWGDDGYFKIIRGTNECGIE------------ED---VVAGMPSTKNMVRNYDS 349
Score = 82.4 bits (202), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 43/103 (41%), Positives = 57/103 (55%), Gaps = 6/103 (5%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP+ FDAR W +C ++ I DQ +CGSCWA + DR CI N +S +V
Sbjct: 102 LPKEFDARSAWSQCNTIGTILDQGHCGSCWAFGAVECLQDRFCIHFN--MNISLSVNDLV 159
Query: 247 ACTPNCW--GCNGGWPQLAWRFWGHNGVVTG--GDYNSQEGCQ 285
AC GC+GG+P +AWR++ NGVVT Y Q GC+
Sbjct: 160 ACCGFMCGDGCDGGYPIMAWRYFVRNGVVTDECDPYFDQVGCK 202
>gi|62320420|dbj|BAD94873.1| cathepsin B-like cysteine proteinase like protein [Arabidopsis
thaliana]
Length = 183
Score = 96.7 bits (239), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 45/86 (52%), Positives = 59/86 (68%), Gaps = 1/86 (1%)
Query: 75 NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVEND-IPYWLV 133
+ M ++Y++GP+ F+VY DF YKSGVY++ G IG HAV+++GWG +D YWL+
Sbjct: 70 DIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKYITGTKIGGHAVKLIGWGTSDDGEDYWLL 129
Query: 134 ANSWNDHWGDHGTFKILRGENEADIE 159
AN WN WGD G FKI RG NE IE
Sbjct: 130 ANQWNRSWGDDGYFKIRRGTNECGIE 155
>gi|312283137|dbj|BAJ34434.1| unnamed protein product [Thellungiella halophila]
Length = 362
Score = 96.7 bits (239), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 44/86 (51%), Positives = 59/86 (68%), Gaps = 1/86 (1%)
Query: 75 NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVEND-IPYWLV 133
+ M ++Y++GP+ F+VY DF YKSGVY+H G +IG HAV+++GWG ++ YWL+
Sbjct: 249 DIMAEVYKNGPVEVSFTVYEDFAHYKSGVYKHITGSNIGGHAVKLIGWGTTDEGEDYWLL 308
Query: 134 ANSWNDHWGDHGTFKILRGENEADIE 159
AN WN WGD G F I RG NE IE
Sbjct: 309 ANQWNRSWGDDGYFMIRRGTNECGIE 334
Score = 86.3 bits (212), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 50/137 (36%), Positives = 71/137 (51%), Gaps = 20/137 (14%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP+ FDAR WP+C S+ +I DQ +CGSCWA ++SDR CI + +S ++
Sbjct: 106 LPKEFDARTAWPQCTSIGNILDQGHCGSCWAFGAVESLSDRFCIEFGMNIS--LSVNDLL 163
Query: 247 ACTP-NCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPY-TLAPCEHHVQGPLQNC 303
AC C GC+GG+P AW+++ ++GVVT E C PY C H P
Sbjct: 164 ACCGFRCGDGCDGGYPIAAWQYFSYSGVVT-------EECDPYFDDTGCSHPGCEP---- 212
Query: 304 TLLGKLKTPECKQNCYN 320
TP+C + C +
Sbjct: 213 ----AYPTPKCMRKCVS 225
>gi|40643250|emb|CAC83720.1| cathepsin B [Hordeum vulgare subsp. vulgare]
gi|326494236|dbj|BAJ90387.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326499864|dbj|BAJ90767.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 344
Score = 96.7 bits (239), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 44/86 (51%), Positives = 60/86 (69%), Gaps = 1/86 (1%)
Query: 75 NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVEN-DIPYWLV 133
+ M ++Y++GP+ F+VY DF YKSGVY+H G +G HAV+++GWG + YWL+
Sbjct: 239 DIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKHITGGVMGGHAVKLIGWGTSDAGEDYWLL 298
Query: 134 ANSWNDHWGDHGTFKILRGENEADIE 159
AN WN WGD G FKI+RG+NE IE
Sbjct: 299 ANQWNRGWGDDGYFKIIRGKNECGIE 324
Score = 82.4 bits (202), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 51/135 (37%), Positives = 67/135 (49%), Gaps = 20/135 (14%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP+ FDAR KW C ++ I DQ +CGSCWA + DR CI N + +SA +V
Sbjct: 96 LPKEFDARSKWSGCSTIGKILDQGHCGSCWAFGAVECLQDRFCIHHNMNIS--LSANDLV 153
Query: 247 ACTPNCW--GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPY-TLAPCEHHVQGPLQNC 303
AC GC+GG+P AW+++ NGVVT E C PY C+H P
Sbjct: 154 ACCGFMCGDGCDGGYPISAWQYFVQNGVVT-------EECDPYFDQVGCKHPGCEP---- 202
Query: 304 TLLGKLKTPECKQNC 318
TP C++ C
Sbjct: 203 ----AYPTPVCEKKC 213
>gi|18378945|ref|NP_563647.1| putative cathepsin B-like cysteine protease [Arabidopsis thaliana]
gi|332189291|gb|AEE27412.1| putative cathepsin B-like cysteine protease [Arabidopsis thaliana]
Length = 379
Score = 96.7 bits (239), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 45/86 (52%), Positives = 59/86 (68%), Gaps = 1/86 (1%)
Query: 75 NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVEND-IPYWLV 133
+ M ++Y++GP+ F+VY DF YKSGVY++ G IG HAV+++GWG +D YWL+
Sbjct: 266 DIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKYITGTKIGGHAVKLIGWGTSDDGEDYWLL 325
Query: 134 ANSWNDHWGDHGTFKILRGENEADIE 159
AN WN WGD G FKI RG NE IE
Sbjct: 326 ANQWNRSWGDDGYFKIRRGTNECGIE 351
Score = 71.6 bits (174), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 51/155 (32%), Positives = 68/155 (43%), Gaps = 40/155 (25%)
Query: 187 LPRNFDAREKWPECPSLRHIAD--------------------QSNCGSCWAVSVANAISD 226
LP+ FDAR W C S+R I +CGSCWA ++SD
Sbjct: 103 LPKEFDARTAWSHCTSIRRILVGYILNNVLLWSTITLWFWFLLGHCGSCWAFGAVESLSD 162
Query: 227 RLCIASNGYFTGQISAQHIVACTPNC--WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGC 284
R CI N +SA ++AC +GCNGG+P AW ++ ++GVVT QE C
Sbjct: 163 RFCIKYN--LNVSLSANDVIACCGLLCGFGCNGGFPMGAWLYFKYHGVVT------QE-C 213
Query: 285 QPY-TLAPCEHHVQGPLQNCTLLGKLKTPECKQNC 318
PY C H P TP+C++ C
Sbjct: 214 DPYFDNTGCSHPGCEP--------TYPTPKCERKC 240
>gi|302764096|ref|XP_002965469.1| hypothetical protein SELMODRAFT_143272 [Selaginella moellendorffii]
gi|300166283|gb|EFJ32889.1| hypothetical protein SELMODRAFT_143272 [Selaginella moellendorffii]
Length = 331
Score = 96.7 bits (239), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 44/82 (53%), Positives = 58/82 (70%), Gaps = 1/82 (1%)
Query: 79 QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVEND-IPYWLVANSW 137
+IY++GP+ ++VY DF YKSGVY+H FG +G HAV+ +GWG +D YW+VANSW
Sbjct: 233 EIYKNGPVEVSYTVYEDFAHYKSGVYKHVFGQVLGGHAVKFIGWGTTDDGKDYWIVANSW 292
Query: 138 NDHWGDHGTFKILRGENEADIE 159
N WG+ G F+I RG NE IE
Sbjct: 293 NRSWGEDGFFQISRGSNECGIE 314
Score = 95.9 bits (237), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 54/136 (39%), Positives = 75/136 (55%), Gaps = 19/136 (13%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP++FDARE WP+C S++ I DQ +CGSCWA A++DR CI +N + +S +V
Sbjct: 88 LPKHFDAREAWPQCASIKTILDQGHCGSCWAFGAVEALTDRFCILNNENVS--LSENDLV 145
Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAP-CEHHVQGPLQNCT 304
AC +C +GC GG+P AW ++ GVVT SQ C PY C+H P
Sbjct: 146 ACCSSCGFGCEGGYPYAAWEYFAQTGVVT-----SQ--CDPYFDGKGCKHPGCEP----- 193
Query: 305 LLGKLKTPECKQNCYN 320
+ TP C + C +
Sbjct: 194 ---EYDTPVCVKQCVD 206
>gi|194352768|emb|CAQ00112.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
gi|326488519|dbj|BAJ93928.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326508126|dbj|BAJ99330.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 355
Score = 96.7 bits (239), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 46/94 (48%), Positives = 63/94 (67%), Gaps = 2/94 (2%)
Query: 67 KAHMVPRCNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVEN 126
+ H P + M ++Y++GP+ F+VY DF YKSGVY+H G +G HAV+++GWG +
Sbjct: 237 RVHSNPH-DIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKHITGGVMGGHAVKLIGWGTSD 295
Query: 127 -DIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
YWL+AN WN WGD G FKI+RG+NE IE
Sbjct: 296 AGEDYWLLANQWNRGWGDDGYFKIIRGKNECGIE 329
Score = 82.8 bits (203), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 48/135 (35%), Positives = 69/135 (51%), Gaps = 20/135 (14%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP+ FDAR +W C ++ +I DQ +CG+CWA + ++ DR CI N + +S ++
Sbjct: 101 LPKEFDARTQWSSCSTIGNILDQGHCGACWAFAAVESLQDRFCIHLN--MSVSLSVNDLL 158
Query: 247 ACTPNCW--GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPY-TLAPCEHHVQGPLQNC 303
AC GCNGG+P AWR++ +GVVT E C PY C+H P
Sbjct: 159 ACCGFLCGSGCNGGYPISAWRYFRRSGVVT-------EECDPYFDQTGCQHPGCEP---- 207
Query: 304 TLLGKLKTPECKQNC 318
TP+C + C
Sbjct: 208 ----AYPTPKCHRKC 218
>gi|6165885|gb|AAF04727.1|AF101239_1 cathepsin B-like cysteine proteinase [Ipomoea batatas]
Length = 352
Score = 96.7 bits (239), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 49/110 (44%), Positives = 64/110 (58%), Gaps = 8/110 (7%)
Query: 75 NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVEND-IPYWLV 133
+ M ++Y +GP F+VY DF YKSGVY+H G +G HAV+++GWG D YWL+
Sbjct: 239 SIMTEVYTNGPAEVSFTVYEDFAHYKSGVYKHVTGSEMGGHAVKLIGWGTSEDGEDYWLL 298
Query: 134 ANSWNDHWGDHGTFKILRGENEADIE-------MGFNNRVEANSSEDDDL 176
AN WN WG G FKI+RG NE IE N +E+ +DD L
Sbjct: 299 ANQWNRSWGGDGYFKIIRGTNECGIEDVTAGTPSTKNLDIESGVRDDDSL 348
Score = 84.3 bits (207), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 42/103 (40%), Positives = 57/103 (55%), Gaps = 6/103 (5%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP+ FDAR WP+C S+ I DQ +CGSCWA +++DR CI T +S ++
Sbjct: 96 LPKTFDARTAWPQCLSIADILDQGHCGSCWAFGAVESLTDRFCIHYGTNVT--LSVNDLL 153
Query: 247 ACTPNCW--GCNGGWPQLAWRFWGHNGVVTG--GDYNSQEGCQ 285
AC GC+GG+P AW+++ GVVT Y Q GC
Sbjct: 154 ACCGFLCGEGCDGGYPIAAWQYFKRTGVVTSECDPYFDQTGCS 196
>gi|149392557|gb|ABR26081.1| cathepsin b-like cysteine proteinase 3 [Oryza sativa Indica Group]
Length = 142
Score = 96.3 bits (238), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 51/120 (42%), Positives = 70/120 (58%), Gaps = 16/120 (13%)
Query: 75 NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVEN-DIPYWLV 133
+ M ++Y++GP+ F+VY DF YKSGVY+H G +G HAV+++GWG + YWL+
Sbjct: 29 DIMAEVYQNGPVEVAFTVYEDFAHYKSGVYKHITGGMMGGHAVKLIGWGTTDAGEDYWLL 88
Query: 134 ANSWNDHWGDHGTFKILRGENEADIEMGFNNRVEANSSEDDDLETMGCQNAKGLPRNFDA 193
AN WN WGD G FKI+RG NE IE ED G + K + RN+D+
Sbjct: 89 ANQWNRGWGDDGYFKIIRGTNECGIE------------ED---VVAGMPSTKNMVRNYDS 133
>gi|388500062|gb|AFK38097.1| unknown [Lotus japonicus]
Length = 357
Score = 96.3 bits (238), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 43/86 (50%), Positives = 58/86 (67%), Gaps = 1/86 (1%)
Query: 75 NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVEND-IPYWLV 133
+ M ++Y++GP+ F+VY DF YKSGVY+H G +G HAV+++GWG ++ YWL+
Sbjct: 244 DIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKHITGSQLGGHAVKLIGWGTTDEGEDYWLI 303
Query: 134 ANSWNDHWGDHGTFKILRGENEADIE 159
AN WN WGD G F I RG NE IE
Sbjct: 304 ANQWNRSWGDDGYFMIRRGTNECGIE 329
Score = 85.9 bits (211), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 49/135 (36%), Positives = 68/135 (50%), Gaps = 20/135 (14%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP++FDAR W +C ++ I DQ +CGSCWA ++SDR CI + +S ++
Sbjct: 101 LPKSFDARTAWSQCSTIGRILDQGHCGSCWAFGAVESLSDRFCIHLD--VNVSLSVNDLL 158
Query: 247 ACTPNCW--GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPY-TLAPCEHHVQGPLQNC 303
AC GC+GG+P AWR+ H+GVVT E C PY C H P
Sbjct: 159 ACCGFLCGSGCDGGYPLYAWRYLAHHGVVT-------EECDPYFDQIGCSHPGCEP---- 207
Query: 304 TLLGKLKTPECKQNC 318
+TP+C + C
Sbjct: 208 ----AYQTPKCVRKC 218
>gi|21693|emb|CAA46810.1| cathepsin B [Triticum aestivum]
Length = 305
Score = 96.3 bits (238), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 44/86 (51%), Positives = 59/86 (68%), Gaps = 1/86 (1%)
Query: 75 NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVEN-DIPYWLV 133
+ M ++Y +GP+ F+VY DF YKSGVY+H G +G HAV+++GWG + YWL+
Sbjct: 200 DIMAEVYNNGPVEVAFTVYEDFAHYKSGVYKHITGGVMGGHAVKLIGWGTSDAGEDYWLL 259
Query: 134 ANSWNDHWGDHGTFKILRGENEADIE 159
AN WN WGD G FKI+RG+NE IE
Sbjct: 260 ANQWNRGWGDDGYFKIIRGKNECGIE 285
Score = 81.6 bits (200), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 45/103 (43%), Positives = 58/103 (56%), Gaps = 6/103 (5%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP+ FDAR KW C ++ I DQ +CGSCWA + DR CI N T +SA +V
Sbjct: 57 LPKVFDARSKWSGCSTIGKILDQGHCGSCWAFGAVECLQDRFCIHHNMNIT--LSANDLV 114
Query: 247 ACTPNCW--GCNGGWPQLAWRFWGHNGVVTG--GDYNSQEGCQ 285
AC GC+GG+P AW+++ NGVVT Y Q GC+
Sbjct: 115 ACCGFMCGDGCDGGYPISAWQYFVQNGVVTDECDPYFDQVGCK 157
>gi|320167003|gb|EFW43902.1| cathepsin B [Capsaspora owczarzaki ATCC 30864]
Length = 306
Score = 96.3 bits (238), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 49/99 (49%), Positives = 58/99 (58%), Gaps = 2/99 (2%)
Query: 66 KKAHMVPRCNAMRQ--IYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWG 123
K A+ V A Q I +GP+ A FSVY DF Y SGVY H G G HAV+++GWG
Sbjct: 201 KTAYQVANNMAAIQSEILANGPVEAAFSVYDDFFSYTSGVYSHQSGALDGGHAVKIVGWG 260
Query: 124 VENDIPYWLVANSWNDHWGDHGTFKILRGENEADIEMGF 162
V+ PYW+VANSW WG G F I RG +E IE G
Sbjct: 261 VDGTTPYWIVANSWGTSWGQAGFFWIKRGNDECGIEDGI 299
Score = 92.4 bits (228), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 53/133 (39%), Positives = 68/133 (51%), Gaps = 16/133 (12%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+P +FDAR +WP S+ I DQ CGSCWA A+SDRL IASN +S Q +V
Sbjct: 81 IPTSFDARTQWPA--SIHPIRDQQQCGSCWAFGATEALSDRLAIASNNSINVVLSPQDLV 138
Query: 247 ACTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTLL 306
+C +GC+GG+P AW + GVVT + C PYT G C +
Sbjct: 139 SCDSTDYGCDGGYPINAWHYMQSLGVVT-------DTCYPYTSG------NGDSGTCQIT 185
Query: 307 GKLKTPECKQNCY 319
GK KTP C +
Sbjct: 186 GK-KTPACATATF 197
>gi|389608479|dbj|BAM17849.1| tubulointerstitial nephritis antigen [Papilio xuthus]
Length = 429
Score = 96.3 bits (238), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 46/102 (45%), Positives = 63/102 (61%), Gaps = 7/102 (6%)
Query: 75 NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQ---HNFGDSIGLHAVRVLGWGVENDIPYW 131
+ M I E GP+ A+ +VY DF Y+ GVY+ H + G H+VR++GWG + YW
Sbjct: 326 DIMYDIMESGPVQAVMTVYQDFFHYRDGVYRRSYHGNNELKGFHSVRIIGWGEDRGDRYW 385
Query: 132 LVANSWNDHWGDHGTFKILRGENEADIE----MGFNNRVEAN 169
+VANSW WG++G F+I RG NEADIE G ++ EAN
Sbjct: 386 VVANSWGRQWGENGYFRIARGSNEADIESFVVTGLSDVTEAN 427
Score = 65.9 bits (159), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 48/129 (37%), Positives = 63/129 (48%), Gaps = 14/129 (10%)
Query: 188 PRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVA 247
P FDAR +WP + I DQ CGS WAVS+A SDR I SNG +S Q +++
Sbjct: 191 PTQFDARTRWPG--FISPIVDQGWCGSDWAVSLAGVASDRFAIQSNGAENMVLSPQTLLS 248
Query: 248 CTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPY--TLAPCEHHVQGPL--QN 302
C GC+GG +AW F +G+V E C PY ++ C +G L
Sbjct: 249 CNVRAQQGCHGGHIDVAWNFARGHGLV-------DEKCFPYKASVTRCPFRPRGNLIQDG 301
Query: 303 CTLLGKLKT 311
C L K +T
Sbjct: 302 CMPLVKRRT 310
>gi|225437812|ref|XP_002281936.1| PREDICTED: cathepsin B-like isoform 1 [Vitis vinifera]
gi|359480250|ref|XP_003632421.1| PREDICTED: cathepsin B-like [Vitis vinifera]
Length = 358
Score = 95.9 bits (237), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 43/86 (50%), Positives = 60/86 (69%), Gaps = 1/86 (1%)
Query: 77 MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVEND-IPYWLVAN 135
M ++Y++GP+ F+VY DF Y+SGVY++ GD +G HAV+++GWG +D YW++AN
Sbjct: 246 MAEVYKNGPVEVAFTVYEDFAHYESGVYRYTTGDVMGGHAVKLIGWGTTDDGEDYWILAN 305
Query: 136 SWNDHWGDHGTFKILRGENEADIEMG 161
WN +WGD G F I RG NE IE G
Sbjct: 306 QWNRNWGDDGYFMIRRGVNECGIEEG 331
Score = 88.2 bits (217), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 51/135 (37%), Positives = 69/135 (51%), Gaps = 20/135 (14%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP++FDAR WP+C ++ I DQ +CGSCWA ++SDR CI +S ++
Sbjct: 101 LPKHFDARTAWPQCSTIGKILDQGHCGSCWAFGAVESLSDRFCIHFG--MNISLSVNDLL 158
Query: 247 ACTPNCW--GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAP-CEHHVQGPLQNC 303
AC GC+GG+P AWR++ H+GVVT E C PY A C H P
Sbjct: 159 ACCGFLCGSGCDGGYPLYAWRYFIHHGVVT-------EECDPYFDATGCSHPGCEP---- 207
Query: 304 TLLGKLKTPECKQNC 318
TP+C + C
Sbjct: 208 ----GYPTPKCVRKC 218
>gi|156375635|ref|XP_001630185.1| predicted protein [Nematostella vectensis]
gi|156217201|gb|EDO38122.1| predicted protein [Nematostella vectensis]
Length = 311
Score = 95.9 bits (237), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 44/103 (42%), Positives = 64/103 (62%), Gaps = 3/103 (2%)
Query: 63 HYFKKAHMVPRCNA---MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRV 119
++ K A+ +P N I +GP+ A F+++ DF Y+SG+Y H G +G HA+++
Sbjct: 202 YHAKSAYKLPAKNVEAIQTDIMNNGPVEADFTIFQDFYAYRSGIYVHATGKQLGGHAIKI 261
Query: 120 LGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIEMGF 162
LGWG E+++ YWL ANSW +WG G FKI RG +E IE G
Sbjct: 262 LGWGTEDNVDYWLCANSWGANWGIQGYFKIRRGTDECGIEDGL 304
Score = 81.3 bits (199), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 39/91 (42%), Positives = 54/91 (59%), Gaps = 2/91 (2%)
Query: 184 AKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQ 243
A+ +P NFDAR++WP S+ I +Q CGSCWA + +SDR IAS +SAQ
Sbjct: 80 AENIPENFDARKQWPG--SIHPIRNQGQCGSCWAFGASEVLSDRFAIASKNQIYVTLSAQ 137
Query: 244 HIVACTPNCWGCNGGWPQLAWRFWGHNGVVT 274
+V C + GC+GGWP AW + G++T
Sbjct: 138 QLVDCDLDNSGCSGGWPINAWNYMVKTGLLT 168
>gi|18411686|ref|NP_567215.1| cathepsin B [Arabidopsis thaliana]
gi|13877861|gb|AAK44008.1|AF370193_1 putative cathepsin B cysteine protease [Arabidopsis thaliana]
gi|17473834|gb|AAL38343.1| unknown protein [Arabidopsis thaliana]
gi|21281113|gb|AAM45063.1| putative cathepsin B cysteine protease [Arabidopsis thaliana]
gi|21554165|gb|AAM63244.1| cathepsin B-like cysteine protease, putative [Arabidopsis thaliana]
gi|24417490|gb|AAN60355.1| unknown [Arabidopsis thaliana]
gi|24899725|gb|AAN65077.1| unknown protein [Arabidopsis thaliana]
gi|51968702|dbj|BAD43043.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
gi|51969104|dbj|BAD43244.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
gi|51969220|dbj|BAD43302.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
gi|51970472|dbj|BAD43928.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
gi|51970630|dbj|BAD44007.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
gi|51970704|dbj|BAD44044.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
gi|51970802|dbj|BAD44093.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
gi|51970974|dbj|BAD44179.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
gi|51971008|dbj|BAD44196.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
gi|51971116|dbj|BAD44250.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
gi|62320144|dbj|BAD94342.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
gi|110740287|dbj|BAF02040.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
gi|332656652|gb|AEE82052.1| cathepsin B [Arabidopsis thaliana]
Length = 359
Score = 95.9 bits (237), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 44/86 (51%), Positives = 59/86 (68%), Gaps = 1/86 (1%)
Query: 75 NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVEND-IPYWLV 133
+ M ++Y++GP+ F+VY DF YKSGVY+H G +IG HAV+++GWG ++ YWL+
Sbjct: 246 DIMAEVYKNGPVEVSFTVYEDFAHYKSGVYKHITGSNIGGHAVKLIGWGTSSEGEDYWLM 305
Query: 134 ANSWNDHWGDHGTFKILRGENEADIE 159
AN WN WGD G F I RG NE IE
Sbjct: 306 ANQWNRGWGDDGYFMIRRGTNECGIE 331
Score = 85.9 bits (211), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 50/135 (37%), Positives = 69/135 (51%), Gaps = 20/135 (14%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP+ FDAR WP+C S+ +I DQ +CGSCWA ++SDR CI +S ++
Sbjct: 103 LPKAFDARTAWPQCTSIGNILDQGHCGSCWAFGAVESLSDRFCIQFG--MNISLSVNDLL 160
Query: 247 ACTP-NCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPY-TLAPCEHHVQGPLQNC 303
AC C GC+GG+P AW+++ ++GVVT E C PY C H P
Sbjct: 161 ACCGFRCGDGCDGGYPIAAWQYFSYSGVVT-------EECDPYFDNTGCSHPGCEP---- 209
Query: 304 TLLGKLKTPECKQNC 318
TP+C + C
Sbjct: 210 ----AYPTPKCSRKC 220
>gi|30678927|ref|NP_849281.1| cathepsin B [Arabidopsis thaliana]
gi|3859606|gb|AAC72872.1| contains similarity to cysteine proteases (Pfam: PF00112,
E=1.3e-79, N=1) [Arabidopsis thaliana]
gi|7268205|emb|CAB77732.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
gi|332656653|gb|AEE82053.1| cathepsin B [Arabidopsis thaliana]
Length = 359
Score = 95.9 bits (237), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 44/86 (51%), Positives = 59/86 (68%), Gaps = 1/86 (1%)
Query: 75 NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVEND-IPYWLV 133
+ M ++Y++GP+ F+VY DF YKSGVY+H G +IG HAV+++GWG ++ YWL+
Sbjct: 246 DIMAEVYKNGPVEVSFTVYEDFAHYKSGVYKHITGSNIGGHAVKLIGWGTSSEGEDYWLM 305
Query: 134 ANSWNDHWGDHGTFKILRGENEADIE 159
AN WN WGD G F I RG NE IE
Sbjct: 306 ANQWNRGWGDDGYFMIRRGTNECGIE 331
Score = 80.5 bits (197), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 48/135 (35%), Positives = 67/135 (49%), Gaps = 20/135 (14%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP+ FDAR WP+C S+ +I +CGSCWA ++SDR CI +S ++
Sbjct: 103 LPKAFDARTAWPQCTSIGNILGLGHCGSCWAFGAVESLSDRFCIQFG--MNISLSVNDLL 160
Query: 247 ACTP-NCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPY-TLAPCEHHVQGPLQNC 303
AC C GC+GG+P AW+++ ++GVVT E C PY C H P
Sbjct: 161 ACCGFRCGDGCDGGYPIAAWQYFSYSGVVT-------EECDPYFDNTGCSHPGCEP---- 209
Query: 304 TLLGKLKTPECKQNC 318
TP+C + C
Sbjct: 210 ----AYPTPKCSRKC 220
>gi|290992302|ref|XP_002678773.1| predicted protein [Naegleria gruberi]
gi|284092387|gb|EFC46029.1| predicted protein [Naegleria gruberi]
Length = 236
Score = 95.9 bits (237), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 43/85 (50%), Positives = 59/85 (69%), Gaps = 2/85 (2%)
Query: 77 MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDI--PYWLVA 134
M + ++GP+ A FSVY DF+ YKSGVY H G +G HA++++GWGV++ PYW++A
Sbjct: 142 MEDMQQNGPVQAAFSVYRDFMSYKSGVYHHVSGSLLGGHAIKMVGWGVDSATNKPYWIIA 201
Query: 135 NSWNDHWGDHGTFKILRGENEADIE 159
NSW WG +G F ILRG +E IE
Sbjct: 202 NSWGPSWGLNGFFWILRGSDECGIE 226
Score = 79.3 bits (194), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 40/98 (40%), Positives = 55/98 (56%), Gaps = 9/98 (9%)
Query: 191 FDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVACTP 250
FD+R KWP C + I +Q CGSCWA S + +SDR CIAS G +S Q++V+C
Sbjct: 17 FDSRTKWPHC--VHPIRNQEQCGSCWAFSASEVLSDRFCIASGGKVDVVLSPQYMVSCDS 74
Query: 251 NCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYT 288
+GC+GG+ AW F G+ + + C PYT
Sbjct: 75 TDYGCDGGYLNNAWAFLAGTGIPS-------DKCAPYT 105
>gi|195026034|ref|XP_001986167.1| GH20676 [Drosophila grimshawi]
gi|193902167|gb|EDW01034.1| GH20676 [Drosophila grimshawi]
Length = 432
Score = 95.9 bits (237), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 45/89 (50%), Positives = 58/89 (65%), Gaps = 4/89 (4%)
Query: 75 NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQH---NFGDSIGLHAVRVLGWGVE-NDIPY 130
+ M +IY GP+ A +VY DF Y SGVYQH N G + G H+V+++GWG E N + Y
Sbjct: 325 DIMAEIYHSGPVQATMTVYRDFFSYSSGVYQHTAANRGAATGFHSVKLVGWGEEHNGVKY 384
Query: 131 WLVANSWNDHWGDHGTFKILRGENEADIE 159
W+ ANSW WG+ G F+ILRG NE IE
Sbjct: 385 WIAANSWGPWWGERGYFRILRGSNECGIE 413
Score = 70.5 bits (171), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 38/92 (41%), Positives = 52/92 (56%), Gaps = 2/92 (2%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LPR+F+A EKW + + DQ CG+ W +S + SDR I S G Q+SAQ+I+
Sbjct: 187 LPRSFNAVEKWST--FISEVPDQGWCGASWVLSTTSVASDRFAIQSQGKEVVQLSAQNIL 244
Query: 247 ACTPNCWGCNGGWPQLAWRFWGHNGVVTGGDY 278
+CT GC+GG AWR+ NGV+ Y
Sbjct: 245 SCTRRQQGCDGGHLDAAWRYMHKNGVLDANCY 276
>gi|297744106|emb|CBI37076.3| unnamed protein product [Vitis vinifera]
Length = 392
Score = 95.5 bits (236), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 43/86 (50%), Positives = 60/86 (69%), Gaps = 1/86 (1%)
Query: 77 MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVEND-IPYWLVAN 135
M ++Y++GP+ F+VY DF Y+SGVY++ GD +G HAV+++GWG +D YW++AN
Sbjct: 280 MAEVYKNGPVEVAFTVYEDFAHYESGVYRYTTGDVMGGHAVKLIGWGTTDDGEDYWILAN 339
Query: 136 SWNDHWGDHGTFKILRGENEADIEMG 161
WN +WGD G F I RG NE IE G
Sbjct: 340 QWNRNWGDDGYFMIRRGVNECGIEEG 365
Score = 72.0 bits (175), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 51/171 (29%), Positives = 70/171 (40%), Gaps = 56/171 (32%)
Query: 187 LPRNFDAREKWPECPSL------------------------------------RHIADQS 210
LP++FDAR WP+C ++ +I DQ
Sbjct: 99 LPKHFDARTAWPQCSTIGKILGRLLDSFSSYFDDFFCFGCTDALYFSYHLLVPFYIKDQG 158
Query: 211 NCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVACTPNCW--GCNGGWPQLAWRFWG 268
+CGSCWA ++SDR CI +S ++AC GC+GG+P AWR++
Sbjct: 159 HCGSCWAFGAVESLSDRFCIHFG--MNISLSVNDLLACCGFLCGSGCDGGYPLYAWRYFI 216
Query: 269 HNGVVTGGDYNSQEGCQPYTLAP-CEHHVQGPLQNCTLLGKLKTPECKQNC 318
H+GVVT E C PY A C H P TP+C + C
Sbjct: 217 HHGVVT-------EECDPYFDATGCSHPGCEP--------GYPTPKCVRKC 252
>gi|297723949|ref|NP_001174338.1| Os05g0310500 [Oryza sativa Japonica Group]
gi|255676228|dbj|BAH93066.1| Os05g0310500, partial [Oryza sativa Japonica Group]
Length = 234
Score = 95.5 bits (236), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 44/86 (51%), Positives = 59/86 (68%), Gaps = 1/86 (1%)
Query: 75 NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVEN-DIPYWLV 133
+ M ++Y++GP+ F+VY DF YKSGVY+H G +G HAV+++GWG + YWL+
Sbjct: 121 DIMAEVYQNGPVEVAFTVYEDFAHYKSGVYKHITGGMMGGHAVKLIGWGTTDAGEDYWLL 180
Query: 134 ANSWNDHWGDHGTFKILRGENEADIE 159
AN WN WGD G FKI+RG NE IE
Sbjct: 181 ANQWNRGWGDDGYFKIIRGTNECGIE 206
Score = 55.8 bits (133), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 38/112 (33%), Positives = 51/112 (45%), Gaps = 20/112 (17%)
Query: 210 SNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVACTPNCWG--CNGGWPQLAWRFW 267
+CGSCWA + DR CI N +S +VAC G C+GG+P +AWR++
Sbjct: 1 GHCGSCWAFGAVECLQDRFCIHFN--MNISLSVNDLVACCGFMCGDGCDGGYPIMAWRYF 58
Query: 268 GHNGVVTGGDYNSQEGCQPY-TLAPCEHHVQGPLQNCTLLGKLKTPECKQNC 318
NGVVT + C PY C+H P TP C++ C
Sbjct: 59 VRNGVVT-------DECDPYFDQVGCKHPGCEP--------AYPTPVCEKKC 95
>gi|224064398|ref|XP_002301456.1| predicted protein [Populus trichocarpa]
gi|222843182|gb|EEE80729.1| predicted protein [Populus trichocarpa]
Length = 325
Score = 95.5 bits (236), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 45/86 (52%), Positives = 58/86 (67%), Gaps = 1/86 (1%)
Query: 75 NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVEND-IPYWLV 133
+ M ++ +GP+ F+VY DF YKSGVY+H GD +G HAV+++GWG +D YWL+
Sbjct: 212 SIMAEVSMNGPVEVAFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWLL 271
Query: 134 ANSWNDHWGDHGTFKILRGENEADIE 159
AN WN WGD G FKI RG NE IE
Sbjct: 272 ANQWNRGWGDDGYFKIRRGTNECGIE 297
Score = 55.1 bits (131), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 38/113 (33%), Positives = 53/113 (46%), Gaps = 24/113 (21%)
Query: 211 NCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVACTPNCW----GCNGGWPQLAWRF 266
+CGSCWA ++SDR CI + +S ++AC W GC+GG+P AWR+
Sbjct: 93 HCGSCWAFGAVESLSDRFCIHYGMNLS--LSVNDLLACCG--WMCGDGCDGGYPIDAWRY 148
Query: 267 WGHNGVVTGGDYNSQEGCQPY-TLAPCEHHVQGPLQNCTLLGKLKTPECKQNC 318
+ +GVVT E C PY C H P TP+C++ C
Sbjct: 149 FVQSGVVT-------EECDPYFDDIGCSHPGCEP--------GFPTPKCERKC 186
>gi|308163070|gb|EFO65432.1| Cathepsin B precursor [Giardia lamblia P15]
Length = 97
Score = 95.5 bits (236), Expect = 3e-17, Method: Composition-based stats.
Identities = 41/86 (47%), Positives = 57/86 (66%), Gaps = 3/86 (3%)
Query: 77 MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVEND---IPYWLV 133
M+ + GP+ A+ SVY DFL Y+ GVY+H +G I HAV ++G+G +D +PYW+V
Sbjct: 1 MQALANDGPVQAVMSVYRDFLYYRGGVYRHVYGVQISSHAVEIIGYGTTDDEDRVPYWIV 60
Query: 134 ANSWNDHWGDHGTFKILRGENEADIE 159
NS +WG+ G F I+RG NE DIE
Sbjct: 61 KNSLGPNWGEDGYFNIVRGSNECDIE 86
>gi|339248603|ref|XP_003373289.1| cathepsin B [Trichinella spiralis]
gi|316970616|gb|EFV54519.1| cathepsin B [Trichinella spiralis]
Length = 576
Score = 95.5 bits (236), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 48/96 (50%), Positives = 61/96 (63%), Gaps = 14/96 (14%)
Query: 77 MRQIYEHGPLVAIFSVYADFLQYKSGVYQH---------NFGDSIGLHAVRVLGWGVEND 127
M +I +GP+ A F V+ DF YKSGVYQH + S G H+VR+LGWGV++
Sbjct: 448 MTEIMANGPVQATFLVHEDFFMYKSGVYQHLPYANDKGPAYARS-GYHSVRILGWGVDHS 506
Query: 128 ----IPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
I YWL ANSW + WG++G F+ILRGEN DIE
Sbjct: 507 TGVPIKYWLCANSWGEEWGENGLFRILRGENHCDIE 542
Score = 66.2 bits (160), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 37/110 (33%), Positives = 57/110 (51%), Gaps = 5/110 (4%)
Query: 158 IEMGFNNRVEANSSEDD--DLETMGCQNAKGLPRNFDAREKWPECPSLRHIADQSNCGSC 215
++ GF+ R+ E ++ + + + LP +FDARE+WP + + DQ +C S
Sbjct: 280 LDEGFSYRLGTLLPEKSVKNMNEILIEMSNFLPESFDARERWPS--FIHPVRDQGDCASS 337
Query: 216 WAVSVANAISDRLCIASNGYFTGQISAQHIVACT-PNCWGCNGGWPQLAW 264
WA S +DRL I S G F +S Q +++C GCNGG+ AW
Sbjct: 338 WAFSTTAVSADRLAIQSGGKFYNPLSVQQLLSCNQARQRGCNGGYLDRAW 387
>gi|328702238|ref|XP_001943280.2| PREDICTED: cathepsin B-like cysteine proteinase 5-like
[Acyrthosiphon pisum]
Length = 328
Score = 95.5 bits (236), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 46/162 (28%), Positives = 86/162 (53%), Gaps = 19/162 (11%)
Query: 165 RVEANSSEDDDLETM--GCQNAKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVAN 222
RVE + + +T+ G + + FDAR++WP+C ++ ++ N WA + A
Sbjct: 60 RVETTTKSKELNKTLDSGVVKDNRIHKEFDARKRWPQCKTIGEFRNEGNFALSWAYAAAG 119
Query: 223 AISDRLCIASNGYFTGQISAQHIVACTPNCWGCNGGWPQL-----AWRFWGHNGVVTGGD 277
++DR+CIA+NG + IS + +++C+ G +GG+ + W + +G+V+GG
Sbjct: 120 VLADRMCIATNGSYNQLISTEELISCS----GVSGGYHGIVSEREVWEYLKSHGLVSGGK 175
Query: 278 YNSQEGCQPYTLAPCEHHVQGPLQNCTLLGKLKTPECKQNCY 319
YN+ +GCQP + P E +++ ++K C +CY
Sbjct: 176 YNTSDGCQPSKIPPIEEYME--------YSEIKNYTCNDHCY 209
Score = 43.1 bits (100), Expect = 0.20, Method: Compositional matrix adjust.
Identities = 17/35 (48%), Positives = 24/35 (68%)
Query: 117 VRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILR 151
V+++GWGVEN YWL+ +SW G +G FK+ R
Sbjct: 274 VKLIGWGVENGEDYWLLVDSWGYERGQNGVFKVER 308
>gi|157167285|ref|XP_001658487.1| cathepsin b [Aedes aegypti]
gi|108876478|gb|EAT40703.1| AAEL007590-PA [Aedes aegypti]
Length = 313
Score = 95.5 bits (236), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 43/83 (51%), Positives = 55/83 (66%)
Query: 77 MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANS 136
M I+ +GP+ A+F Y D + Y GVY+H G G HAV+++GWGVE+ YWLVANS
Sbjct: 218 MEDIFVNGPVQAVFQWYEDIVNYSGGVYRHQSGRLKGGHAVKLIGWGVEDGTKYWLVANS 277
Query: 137 WNDHWGDHGTFKILRGENEADIE 159
W WGD G FK++RGEN IE
Sbjct: 278 WGRVWGDDGFFKMVRGENHCGIE 300
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 56/158 (35%), Positives = 77/158 (48%), Gaps = 13/158 (8%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP++FDAR++WP+C SL I Q CGSC VS A+A++DR CI S G A ++
Sbjct: 62 LPKSFDARQQWPQCSSLNEIRTQGCCGSCAYVSGASAMTDRWCIHSKGKKQFTFGAFDLL 121
Query: 247 ACTPNCWGCNGGWPQLA--WRFWGHNGVVTGGDYNSQEGCQPYTLAP-CEHHVQGPLQNC 303
+C C G G W +W GV +GG Y S +GC PY + P C +G +
Sbjct: 122 SCCYECGGGCTGGGIPGPIWSYWVKQGVSSGGPYGSNQGCHPYPMPPSCPKPSEGDYPD- 180
Query: 304 TLLGKLKTPECKQNCYNPSYESTYRF-DLKKGKKAHMV 340
P C C N Y T D + G+ A+ +
Sbjct: 181 -------EPNCSTRC-NAGYNVTEDLRDRRFGRVAYSI 210
>gi|38639325|gb|AAR25800.1| cathepsin B-like cysteine proteinase [Solanum tuberosum]
Length = 354
Score = 95.1 bits (235), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 49/101 (48%), Positives = 66/101 (65%), Gaps = 5/101 (4%)
Query: 63 HYFKKAHMV---PRCNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRV 119
HY A+ V P+ + M ++Y++GP+ F+VY DF YKSGVY+H G ++G HAV++
Sbjct: 229 HYGVNAYRVSHDPQ-SIMAEVYKNGPVEVSFTVYEDFAHYKSGVYKHVTGGNMGGHAVKL 287
Query: 120 LGWGV-ENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
+GWG E YWL+ NSWN WG+ G FKI RG NE IE
Sbjct: 288 IGWGTSEQGEDYWLIVNSWNRGWGEDGYFKIRRGTNECGIE 328
Score = 91.3 bits (225), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 56/163 (34%), Positives = 79/163 (48%), Gaps = 23/163 (14%)
Query: 162 FNNRVEANSSEDDDLETMGCQN---AKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAV 218
F + + + DLE + K LP+ FDAR+ WP+C ++ I DQ +CGSCWA
Sbjct: 72 FKRLLGVKPAREGDLEGIPVLTHPRLKELPKEFDARKAWPQCSTIGKILDQGHCGSCWAF 131
Query: 219 SVANAISDRLCIASNGYFTGQISAQHIVACTPNCW--GCNGGWPQLAWRFWGHNGVVTGG 276
++SDR CI N + +S ++AC GC+GG+P AWR++ +GVVT
Sbjct: 132 GAVESLSDRFCIHYN--LSISLSVNDLLACCSFLCGSGCDGGYPIAAWRYFKRSGVVT-- 187
Query: 277 DYNSQEGCQPY-TLAPCEHHVQGPLQNCTLLGKLKTPECKQNC 318
E C PY C H PL TP+C + C
Sbjct: 188 -----EECDPYFDTTGCSHPGCEPLY--------PTPKCHRKC 217
>gi|328726600|ref|XP_003248962.1| PREDICTED: cathepsin B-like cysteine proteinase-like [Acyrthosiphon
pisum]
Length = 169
Score = 94.7 bits (234), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 43/83 (51%), Positives = 57/83 (68%), Gaps = 1/83 (1%)
Query: 78 RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDS-IGLHAVRVLGWGVENDIPYWLVANS 136
+ + +GP+ A F VY DF YKSGVYQ + +G HAV+++GWGVE IPYWL+ NS
Sbjct: 76 KDVMNYGPIEASFDVYDDFYSYKSGVYQRTPNATKLGGHAVKLIGWGVEEGIPYWLMVNS 135
Query: 137 WNDHWGDHGTFKILRGENEADIE 159
W+ WGD+G FKI RG +E I+
Sbjct: 136 WSAQWGDNGLFKIRRGTDECGID 158
>gi|156708122|gb|ABU93319.1| cathepsin B10 cysteine protease [Monocercomonoides sp. PA]
Length = 283
Score = 94.7 bits (234), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 39/81 (48%), Positives = 55/81 (67%)
Query: 79 QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWN 138
++Y +GP+ + VY DF Y G+Y+H G+ +G HAV ++GWG+E+ + YWLV NSW
Sbjct: 192 ELYNNGPIQVTYVVYEDFFYYSKGIYKHLSGNKVGGHAVVLMGWGIEDGVKYWLVQNSWG 251
Query: 139 DHWGDHGTFKILRGENEADIE 159
WG+ G F+ILRG NE IE
Sbjct: 252 YEWGEQGYFRILRGSNECGIE 272
Score = 75.9 bits (185), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 50/154 (32%), Positives = 78/154 (50%), Gaps = 20/154 (12%)
Query: 176 LETMGCQNAKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGY 235
+E +++ +P +FDAR++WP ++ + DQ CGSCWA S+A ++ DR I G
Sbjct: 53 VEKFTIEDSFYVPESFDARDEWPN--AILPVRDQEKCGSCWAFSIAESLGDRFGILGCG- 109
Query: 236 FTGQISAQHIVACTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPY-----TLA 290
G +S Q +++C N GCNGG+ + +W + G+ T E C PY +
Sbjct: 110 -KGHLSPQDLISCDSNDLGCNGGYQENSWTWVLTTGITT-------ESCWPYRSGSGRIP 161
Query: 291 PCEHH-VQGP-LQNCTL--LGKLKTPECKQNCYN 320
C H V G LQ T+ +L + E + YN
Sbjct: 162 SCPHRCVNGSVLQRNTINNYRRLDSSELQDELYN 195
>gi|300121248|emb|CBK21629.2| unnamed protein product [Blastocystis hominis]
Length = 559
Score = 94.7 bits (234), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 75/262 (28%), Positives = 109/262 (41%), Gaps = 41/262 (15%)
Query: 75 NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVA 134
N M++IY GP+ +V DF+ YK G+Y+ G +HA+ V+GWG EN YW+
Sbjct: 184 NMMKEIYARGPITCGIAVPQDFVDYKGGIYKDESGAVEKVHAISVVGWGEENGEKYWIGR 243
Query: 135 NSWNDHWGDHGTFKILRGENEADIEMGFNNRV----EANSSEDDDLETM------GCQN- 183
NSW ++WG+ G F+I RG N IE V EA S + + GC +
Sbjct: 244 NSWGNYWGEEGWFRIARGINNLAIESECQWAVPKVPEARKSREFRRRELLLHVREGCVDK 303
Query: 184 ---------AKGLPRNFDAREKWPECPSLRHIADQSN-------------CGSCWAVSVA 221
LP + P +R++ D N CGSCWA
Sbjct: 304 SRAVNKEHVVSPLPHTYLKANDLPASYDIRNV-DGVNYATWNRNQHIPVWCGSCWAQGST 362
Query: 222 NAISDRLCIASNGYFTGQISAQHIVACTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQ 281
A+SDR+ I G + A +V + C+GGW + + H +
Sbjct: 363 AALSDRINIMRKGAWPAVNLAVQVVLNCGDAGSCHGGWDDGVYAY-AHEVDI------PD 415
Query: 282 EGCQPYTLAPCEHHVQGPLQNC 303
+ CQPY E + +NC
Sbjct: 416 QTCQPYEAVDHECSPENICRNC 437
Score = 70.5 bits (171), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 29/71 (40%), Positives = 42/71 (59%)
Query: 79 QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWN 138
+I+ GP+ +V FL Y GVY+ + + H V + GWGVEN PYW+ NSW
Sbjct: 468 EIFARGPVSCSMTVRESFLDYHGGVYESDSSPMVAGHIVEIAGWGVENGRPYWIGRNSWG 527
Query: 139 DHWGDHGTFKI 149
++WG+ G F+I
Sbjct: 528 EYWGEEGWFRI 538
Score = 51.6 bits (122), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 36/108 (33%), Positives = 49/108 (45%), Gaps = 18/108 (16%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSN------CGSCWAVSVANAISDRLCIASNGYFTGQ- 239
LP+N+D R L ++ N CGSCWA S +A+SDRL + + G +
Sbjct: 44 LPKNYDPRN----INGLNMVSVNKNQHIPVWCGSCWAFSATSAVSDRLKLMTKGAWPEHD 99
Query: 240 ISAQHIVACTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPY 287
+S Q ++ C N GC GG P +R GV EGC Y
Sbjct: 100 LSVQVVINCADNAEGCGGGHPTDVYRLMNEMGV-------PAEGCMRY 140
>gi|294916338|ref|XP_002778359.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
gi|239886683|gb|EER10154.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
Length = 105
Score = 94.7 bits (234), Expect = 6e-17, Method: Composition-based stats.
Identities = 41/78 (52%), Positives = 53/78 (67%)
Query: 75 NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVA 134
+A I GP+ A F+VY DFL Y+SGVY+H G +G HAV+++GWG ++ YWL
Sbjct: 13 DAKNAIRTDGPVSASFTVYEDFLAYRSGVYKHTSGSYLGGHAVKIIGWGEKSGQAYWLAV 72
Query: 135 NSWNDHWGDHGTFKILRG 152
NSWN+ WGDHG FKI G
Sbjct: 73 NSWNEDWGDHGLFKIALG 90
>gi|448278133|gb|AGE43966.1| putative cathepsin B [Naegleria fowleri]
Length = 349
Score = 94.4 bits (233), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 43/81 (53%), Positives = 60/81 (74%), Gaps = 2/81 (2%)
Query: 80 IYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGV-ENDIPYWLVANSWN 138
I +GP+++ F VY DF Y+SG Y+H G +G HA++V+GWGV ++++PYW+VANSW+
Sbjct: 260 ILANGPVISGFKVYRDFYNYRSG-YKHVAGGLVGGHAIKVVGWGVTQSNVPYWIVANSWS 318
Query: 139 DHWGDHGTFKILRGENEADIE 159
D WG +G F ILRG NE IE
Sbjct: 319 DEWGMNGYFWILRGTNECSIE 339
Score = 66.2 bits (160), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 36/104 (34%), Positives = 54/104 (51%), Gaps = 9/104 (8%)
Query: 186 GLPRNFDAR--EKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQ 243
LP +++A + C L I +Q CGSCWA S++ ++DR CI + G +S Q
Sbjct: 120 ALPDSYNAANDSNYYMCQQLHRIRNQEQCGSCWAFSISEMVADRFCIGTRGKINTIMSPQ 179
Query: 244 HIVACTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPY 287
+V+C GCNGG A++F G+V+ +GC PY
Sbjct: 180 WMVSCDTADNGCNGGEFPTAFQFVETTGLVS-------DGCVPY 216
>gi|255647484|gb|ACU24206.1| unknown [Glycine max]
Length = 327
Score = 94.4 bits (233), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 43/82 (52%), Positives = 57/82 (69%), Gaps = 1/82 (1%)
Query: 75 NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVEND-IPYWLV 133
+ M ++Y++GP+ F+VY DF YKSGVY+H G +G HAV+++GWG +D YWL+
Sbjct: 244 DIMAEVYKNGPVEVAFTVYEDFAYYKSGVYKHITGYELGGHAVKLIGWGTTDDGEDYWLL 303
Query: 134 ANSWNDHWGDHGTFKILRGENE 155
AN WN WGD G FKI RG NE
Sbjct: 304 ANQWNREWGDDGYFKIRRGTNE 325
Score = 87.4 bits (215), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 50/135 (37%), Positives = 68/135 (50%), Gaps = 20/135 (14%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP+NFDAR W +C ++ I DQ +CGSCWA ++SDR CI + +S ++
Sbjct: 101 LPKNFDARTAWSQCSTIGRILDQGHCGSCWAFGAVESLSDRFCIHFD--VNISLSVNDLL 158
Query: 247 ACTPNCW--GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPY-TLAPCEHHVQGPLQNC 303
AC GC+GG+P AWR+ H+GVVT E C PY C H P
Sbjct: 159 ACCGFLCGSGCDGGYPLYAWRYLAHHGVVT-------EECDPYFDQIGCSHPGCEP---- 207
Query: 304 TLLGKLKTPECKQNC 318
+TP+C + C
Sbjct: 208 ----AYRTPKCVKKC 218
>gi|4099305|gb|AAD00577.1| cysteine proteinase [Clonorchis sinensis]
Length = 180
Score = 94.4 bits (233), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 47/103 (45%), Positives = 59/103 (57%), Gaps = 2/103 (1%)
Query: 220 VANAISDRLCIASNGYFTGQISAQHIVACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDY 278
A+SDRLCI SNG F +SA +++C NC +GC GG+P +AW +W +G+VTGG
Sbjct: 2 AVEAMSDRLCIHSNGAFNKSLSAVDLLSCCENCGFGCRGGYPAVAWDYWKTHGIVTGGSK 61
Query: 279 NSQEGCQPYTLAPCEHHVQGPLQNCTLLGKLKTPECKQNCYNP 321
GC+ Y CEHHVQG C TPEC Q C P
Sbjct: 62 EDPSGCRSYPFPKCEHHVQGHYPPCP-RELYPTPECVQQCDTP 103
Score = 63.5 bits (153), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 27/55 (49%), Positives = 38/55 (69%)
Query: 75 NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIP 129
+ M++I GP+ AIF++Y DFL+Y SGVY H G + HAVR+LGWG ++P
Sbjct: 126 SIMKEIMLRGPVEAIFTMYEDFLRYSSGVYFHALGAPMSGHAVRILGWGELGNVP 180
>gi|294950069|ref|XP_002786445.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
gi|239900737|gb|EER18241.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
Length = 149
Score = 94.4 bits (233), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 40/79 (50%), Positives = 53/79 (67%), Gaps = 1/79 (1%)
Query: 72 PRCNAMRQ-IYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPY 130
P ++Q I+EHGP+ F +Y DF YKSGVY H GD +G H ++++GWGVE+ Y
Sbjct: 45 PAVQQIKQEIFEHGPVFCAFDMYKDFGLYKSGVYVHTTGDLVGSHTLKIIGWGVESGQEY 104
Query: 131 WLVANSWNDHWGDHGTFKI 149
WL NSWN+ WGDHG K+
Sbjct: 105 WLAMNSWNEEWGDHGLIKM 123
>gi|388499754|gb|AFK37943.1| unknown [Lotus japonicus]
Length = 209
Score = 94.4 bits (233), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 43/86 (50%), Positives = 58/86 (67%), Gaps = 1/86 (1%)
Query: 75 NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVEND-IPYWLV 133
+ M ++Y++GP+ F+VY DF YKSGVY+H G +G HAV+++GWG ++ YWL+
Sbjct: 96 DIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKHITGSQLGGHAVKLIGWGTTDEGEDYWLI 155
Query: 134 ANSWNDHWGDHGTFKILRGENEADIE 159
AN WN WGD G F I RG NE IE
Sbjct: 156 ANQWNRSWGDDGYFMIRRGTNECGIE 181
>gi|290990464|ref|XP_002677856.1| predicted protein [Naegleria gruberi]
gi|284091466|gb|EFC45112.1| predicted protein [Naegleria gruberi]
Length = 231
Score = 94.0 bits (232), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 41/75 (54%), Positives = 54/75 (72%)
Query: 78 RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
++I HGP+ A F VY+DF YKSGVY+H G G+HAV+++GWG EN + YWL+ANSW
Sbjct: 145 KEILTHGPVNADFMVYSDFTVYKSGVYRHQTGSFEGIHAVKIIGWGTENGVDYWLIANSW 204
Query: 138 NDHWGDHGTFKILRG 152
+G G FKI+RG
Sbjct: 205 GTTFGLQGFFKIVRG 219
Score = 82.8 bits (203), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 42/109 (38%), Positives = 61/109 (55%), Gaps = 9/109 (8%)
Query: 191 FDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVACTP 250
FD+R+KWP C + I DQ NCGSC++ + + +SDR CI SNG +S Q +V C+
Sbjct: 6 FDSRQKWPNC--VHPIRDQGNCGSCYSFASSEVMSDRFCIFSNGSVNVVLSPQDLVTCSW 63
Query: 251 NCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGP 299
+GCNGG P L + + +G+V+ + C PY HV+ P
Sbjct: 64 YSFGCNGGIPGLVFDYIHKDGLVS-------DACFPYLSYDGNTHVKCP 105
>gi|326492684|dbj|BAJ90198.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 355
Score = 94.0 bits (232), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 45/94 (47%), Positives = 62/94 (65%), Gaps = 2/94 (2%)
Query: 67 KAHMVPRCNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVEN 126
+ H P + M ++Y++GP+ F+VY DF YKSGVY+H G +G HAV+++GWG +
Sbjct: 237 RVHSNPH-DIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKHITGGVMGGHAVKLIGWGTSD 295
Query: 127 -DIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
YWL+AN WN WG G FKI+RG+NE IE
Sbjct: 296 AGEDYWLLANQWNRGWGGDGYFKIIRGKNECGIE 329
Score = 82.4 bits (202), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 48/135 (35%), Positives = 69/135 (51%), Gaps = 20/135 (14%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP+ FDAR +W C ++ +I DQ +CG+CWA + ++ DR CI N + +S ++
Sbjct: 101 LPKEFDARTQWSSCSTIGNILDQGHCGACWAFAAVESLQDRFCIHLN--MSVSLSVNDLL 158
Query: 247 ACTPNCW--GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPY-TLAPCEHHVQGPLQNC 303
AC GCNGG+P AWR++ +GVVT E C PY C+H P
Sbjct: 159 ACCGFLCGSGCNGGYPISAWRYFRRSGVVT-------EECDPYFDQTGCQHPGCEP---- 207
Query: 304 TLLGKLKTPECKQNC 318
TP+C + C
Sbjct: 208 ----AYPTPKCHRKC 218
>gi|159114116|ref|XP_001707283.1| Cathepsin B precursor [Giardia lamblia ATCC 50803]
gi|157435387|gb|EDO79609.1| Cathepsin B precursor [Giardia lamblia ATCC 50803]
Length = 332
Score = 94.0 bits (232), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 42/86 (48%), Positives = 57/86 (66%), Gaps = 3/86 (3%)
Query: 77 MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVEND---IPYWLV 133
M+ + GP+ A+ SVY DFL Y+ GVY+H +G I HAV ++G+G +D IPYW+V
Sbjct: 235 MQTLANDGPVQAVMSVYRDFLYYRGGVYKHVYGIQISSHAVEIIGYGTTDDEERIPYWIV 294
Query: 134 ANSWNDHWGDHGTFKILRGENEADIE 159
NS +WG+ G F I+RG NE DIE
Sbjct: 295 KNSLGPNWGEEGYFNIVRGSNECDIE 320
Score = 58.9 bits (141), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 31/88 (35%), Positives = 42/88 (47%), Gaps = 2/88 (2%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+P FD RE++P+C + + DQ CG+CWA S A DR C+ S Q+ V
Sbjct: 104 IPDAFDLREEYPQC--ITPVYDQGYCGACWAFSATGAFGDRRCMQWLDPVGVPYSQQYTV 161
Query: 247 ACTPNCWGCNGGWPQLAWRFWGHNGVVT 274
+C GC GG W F +G T
Sbjct: 162 SCDDLDLGCAGGTSFNVWTFLTEHGTTT 189
>gi|56755425|gb|AAW25892.1| unknown [Schistosoma japonicum]
Length = 226
Score = 94.0 bits (232), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 50/116 (43%), Positives = 69/116 (59%), Gaps = 5/116 (4%)
Query: 217 AVSVANAISDRLCIASNGYFTGQISAQHIVACTPNCW-GCNGGWPQLAWRFWGHNGVVTG 275
AVS A+SDR+CI S G + ++SA +++C NC GC+GG+P AW +W +G+VTG
Sbjct: 42 AVSAVGAMSDRICIQSGGKQSVELSAIDLISCCENCGSGCDGGFPGPAWDYWVSHGIVTG 101
Query: 276 GDYNSQEGCQPYTLAPCEHHVQGPLQNCTLLGKL-KTPECKQNCYNPSYESTYRFD 330
G + GCQPY CEHH G +C K+ KTP+CK+ C Y + Y D
Sbjct: 102 GSKENHTGCQPYPFPKCEHHSIGKYPSCG--DKIYKTPQCKRKC-QKGYTTPYEHD 154
Score = 58.9 bits (141), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 22/50 (44%), Positives = 36/50 (72%)
Query: 78 RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVEND 127
++I +GP+ A ++ DFL YKSG+Y++ G +G H VR++GWG+EN+
Sbjct: 173 KEIMMYGPVEAYLLIFEDFLNYKSGIYRYTTGSFVGEHYVRIIGWGIENE 222
>gi|10803437|emb|CAC13131.1| putative cathepsin B.5 [Ostertagia ostertagi]
Length = 196
Score = 93.6 bits (231), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 55/134 (41%), Positives = 72/134 (53%), Gaps = 13/134 (9%)
Query: 214 SCWAVSVANAISDRLCIASNGYFTGQISAQHIVACTPN-CW-GCNGGWPQLAWRFWGHNG 271
SCWA A A+SDR+CIAS G ISA +++C C GC GG+P AW++W G
Sbjct: 1 SCWAFGAAEAMSDRICIASQGKTQVTISADDVLSCCGKKCGNGCEGGYPIEAWKYWVKTG 60
Query: 272 VVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTLLG-----KLKTPECKQNCYNPSYEST 326
+ TGG Y SQ GC+PY + PC HH +N T G + TP C C +Y++
Sbjct: 61 ICTGGSYESQSGCKPYPIPPCGHH-----KNQTYFGPCPTDEYDTPVCTNKCIA-AYKTP 114
Query: 327 YRFDLKKGKKAHMV 340
Y D G A+ V
Sbjct: 115 YSDDKHYGTSAYNV 128
Score = 64.7 bits (156), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 31/69 (44%), Positives = 41/69 (59%), Gaps = 2/69 (2%)
Query: 63 HYFKKAHMVPRCNA--MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
HY A+ V + A ++I +GP+ A ++VY DF QY GVY H G +G HAVR+L
Sbjct: 120 HYGTSAYNVAKTVAGIQKEIMTNGPVEAAYTVYEDFYQYTGGVYTHTGGAEVGGHAVRIL 179
Query: 121 GWGVENDIP 129
GWGV P
Sbjct: 180 GWGVRQQDP 188
>gi|23344736|gb|AAN28681.1| cathepsin B [Theromyzon tessulatum]
Length = 65
Score = 93.6 bits (231), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 39/65 (60%), Positives = 51/65 (78%)
Query: 78 RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
+++ +HGP+ A +VY+DFLQYKSGVY H GD +G HAV+++GWGVEN +PYWLV NSW
Sbjct: 1 KELMKHGPVEAALTVYSDFLQYKSGVYHHVAGDELGGHAVKLIGWGVENKVPYWLVVNSW 60
Query: 138 NDHWG 142
WG
Sbjct: 61 GTTWG 65
>gi|294891865|ref|XP_002773777.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
gi|239878981|gb|EER05593.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
Length = 156
Score = 93.6 bits (231), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 36/77 (46%), Positives = 55/77 (71%)
Query: 78 RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
++I+++G ++ + S+Y DF YKSGVY H G +G+H+++++GWGVE+ YWL NSW
Sbjct: 68 QEIFDNGTVLGVISMYEDFRLYKSGVYVHTTGGLVGVHSLKIIGWGVESGQDYWLAVNSW 127
Query: 138 NDHWGDHGTFKILRGEN 154
N+ WGDHG K+ GE
Sbjct: 128 NEEWGDHGMIKLAVGET 144
>gi|321478457|gb|EFX89414.1| hypothetical protein DAPPUDRAFT_303204 [Daphnia pulex]
Length = 442
Score = 93.6 bits (231), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 51/158 (32%), Positives = 84/158 (53%), Gaps = 15/158 (9%)
Query: 9 TKKKKKKKKKKEEKKKKKKKKKKKEEEKKKKKKKKKKKKKKKKRLYLPTSIPLSHYFKKA 68
+ + + +K K ++ K + ++K + K +K P + ++ +
Sbjct: 279 SGRTGQVEKCKVPRRGNLATMKCQLVNAAERKSDRSDKPPRKGLFRSPPAYRIAPFED-- 336
Query: 69 HMVPRCNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDS---IGLHAVRVLGWGVE 125
+ M +I +HGP+ A V+ DF Y+ GVY+++ +S G H+VR++GWGV+
Sbjct: 337 ------DIMNEILQHGPVQATMRVHPDFFLYRGGVYRYSGTNSQQRSGYHSVRIVGWGVD 390
Query: 126 ----NDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
N YWLVANSW WG+ G F+I+RGENE+DIE
Sbjct: 391 SSKRNPTKYWLVANSWGRLWGEDGYFRIVRGENESDIE 428
Score = 62.4 bits (150), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 39/103 (37%), Positives = 53/103 (51%), Gaps = 10/103 (9%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP +FD R +W + +L+ + DQ CG+ WA S A +DRL I S G+ +S Q+++
Sbjct: 185 LPMSFDGRIEWRD--TLQDVRDQGWCGASWAFSTAAVAADRLAIQSRGHEVYPLSMQNLL 242
Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYT 288
AC GCNGG AW + GVV E C PY
Sbjct: 243 ACNNRGQQGCNGGHLDRAWNYMRRFGVVN-------EECYPYI 278
>gi|395734831|ref|XP_003776483.1| PREDICTED: LOW QUALITY PROTEIN: cathepsin B-like [Pongo abelii]
Length = 350
Score = 93.2 bits (230), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 58/160 (36%), Positives = 83/160 (51%), Gaps = 15/160 (9%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASN-----GYFTGQIS 241
LP +FD E+WP+ P R I DQ + G CWA+ AISD +CI N G ++S
Sbjct: 93 LPESFDPXEQWPDXPX-REIRDQGSYGFCWALGALEAISDWICIHPNVGGAQGGNHVEVS 151
Query: 242 AQHIVACTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPY-TLAPCEHHVQGPL 300
A+ + C GCNGG P W FW G+V+GG Y+S GC+ + +L PC+HH+ G
Sbjct: 152 AEDKLTCLCGD-GCNGGXPNEGWNFWTGKGLVSGGLYDSHVGCRLFPSLLPCKHHIHGX- 209
Query: 301 QNCTLLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
+ +P+C C P TY+ D G ++ +
Sbjct: 210 ---PYVXTGDSPKCSMTC-EPG--QTYKXDKHYGCSSYSI 243
Score = 85.1 bits (209), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 43/85 (50%), Positives = 53/85 (62%)
Query: 75 NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVA 134
+ M IY++ + FSVY DFL YK YQ G+ G HA+ +LG VEN YWLVA
Sbjct: 249 DIMTNIYKNDXVEEAFSVYLDFLMYKFKEYQGVTGEMXGGHAICILGCKVENSTSYWLVA 308
Query: 135 NSWNDHWGDHGTFKILRGENEADIE 159
N WN WGD+G FKILRG++ IE
Sbjct: 309 NXWNRDWGDNGFFKILRGQDHYGIE 333
>gi|195121981|ref|XP_002005491.1| GI19039 [Drosophila mojavensis]
gi|193910559|gb|EDW09426.1| GI19039 [Drosophila mojavensis]
Length = 432
Score = 93.2 bits (230), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 43/89 (48%), Positives = 56/89 (62%), Gaps = 4/89 (4%)
Query: 75 NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQH---NFGDSIGLHAVRVLGWGVEND-IPY 130
+ M +IY GP+ A VY DF Y GVY+ N G G H+V+++GWG E+D + Y
Sbjct: 324 DIMAEIYHSGPVQATMRVYRDFFSYSGGVYRQTAANRGAPTGFHSVKIVGWGEEHDGVKY 383
Query: 131 WLVANSWNDHWGDHGTFKILRGENEADIE 159
W+ ANSW WG+HG F+ILRG NE IE
Sbjct: 384 WIAANSWGPWWGEHGYFRILRGSNECGIE 412
Score = 72.8 bits (177), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 42/105 (40%), Positives = 54/105 (51%), Gaps = 9/105 (8%)
Query: 184 AKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQ 243
+ GLPR F+A E+W + + DQ CGS W +S + SDR I S G Q+S Q
Sbjct: 184 SSGLPRKFNAVERWSS--YISEVPDQGWCGSSWVLSTTSVASDRFAIQSQGKEVVQLSPQ 241
Query: 244 HIVACTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYT 288
+I++CT GC GG AWR+ GVV E C PYT
Sbjct: 242 NILSCTRRQQGCEGGHLDAAWRYLHKKGVV-------DETCYPYT 279
>gi|294876463|ref|XP_002767679.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
gi|239869446|gb|EER00397.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
Length = 348
Score = 93.2 bits (230), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 53/149 (35%), Positives = 74/149 (49%), Gaps = 12/149 (8%)
Query: 187 LPRNFDAREKWPECPS-LRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHI 245
+P +FDAR+ + EC + H+ DQS C SCWA++ A S RLCI S G F +SA +
Sbjct: 83 IPSSFDARDAFKECKDVIGHVWDQSACASCWAIAPVQAFSARLCIKSGGKFNQLLSAGEL 142
Query: 246 VAC-----TPNCWGCNGGWPQLAWRFWGHNGVVTGGDY------NSQEGCQPYTLAPCEH 294
+AC + GC GG + AW F +G+ TGGD+ + +GC PY C H
Sbjct: 143 LACCNLAHSCEARGCKGGVARDAWVFLNKHGIATGGDFVPKSSMEAVDGCWPYNFPRCAH 202
Query: 295 HVQGPLQNCTLLGKLKTPECKQNCYNPSY 323
+ + +TP C C N Y
Sbjct: 203 YQKKSKYGPCPKKSYETPSCLDRCPNEKY 231
Score = 89.0 bits (219), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 36/76 (47%), Positives = 48/76 (63%)
Query: 78 RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
++I +HGP A F Y DF YKSGVY++ G + H V ++GWG E + YWL N W
Sbjct: 258 KEIMKHGPTSASFFTYEDFFSYKSGVYKYTSGAYVEFHTVELIGWGTEKGVDYWLAKNDW 317
Query: 138 NDHWGDHGTFKILRGE 153
N+ W D GTFKI +G+
Sbjct: 318 NEEWADLGTFKIAQGD 333
>gi|294879717|ref|XP_002768767.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
gi|239871616|gb|EER01485.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
Length = 157
Score = 93.2 bits (230), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 42/86 (48%), Positives = 56/86 (65%)
Query: 75 NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVA 134
+A I GP+ A F+VY DFL Y+SGVY+H G +G HAV+++GWG ++ YWL
Sbjct: 65 DAKNAIRTDGPVSASFTVYEDFLAYRSGVYKHTSGSYLGGHAVKIIGWGEKSGQAYWLAV 124
Query: 135 NSWNDHWGDHGTFKILRGENEADIEM 160
NSWN+ WGDHG FKI G D ++
Sbjct: 125 NSWNEDWGDHGLFKIALGNCGIDDDL 150
Score = 43.5 bits (101), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 19/49 (38%), Positives = 24/49 (48%)
Query: 282 EGCQPYTLAPCEHHVQGPLQNCTLLGKLKTPECKQNCYNPSYESTYRFD 330
+GC PY PC HH+ G TP C + C+NP Y +T R D
Sbjct: 1 DGCWPYDFPPCAHHINDTKYPKCPKGLYPTPNCVEQCHNPKYTTTLRDD 49
>gi|118398308|ref|XP_001031483.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila]
gi|89285812|gb|EAR83820.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila SB210]
Length = 591
Score = 92.8 bits (229), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 79/285 (27%), Positives = 121/285 (42%), Gaps = 57/285 (20%)
Query: 77 MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWG-VENDIPYWLVAN 135
M++IY+ GP+ +V L Y G++ GD H + V+G+G ++N YW+V N
Sbjct: 195 MQEIYQRGPITCGIAVPDALLNYTGGIFYDRTGDLEIEHDISVVGYGTLKNGTKYWMVRN 254
Query: 136 SWNDHWGDHGTFKILRGENEADIEMGFNNRVEANSSEDD--DLETM------------GC 181
SW +WG++G F+I+RG N +IE V ++ +D +L T+ GC
Sbjct: 255 SWGTYWGENGFFRIIRGVNNLNIESACAWAVPRDTWSNDVRNLTTVNEKPVSNFQKSSGC 314
Query: 182 QN--------------------AKGLPRNFDAREKWPECPSLRHIADQSN------CGSC 215
+ A LP++F W +++ N CGSC
Sbjct: 315 KRESIFNLPEKIKSSRPHEYLKAADLPKSF----TWQNAYGKNYLSITRNQHIPVYCGSC 370
Query: 216 WAVSVANAISDRLCIASNGYFTG-QISAQHIVACTPNCWGCNGGWPQLAWRFWGHNGVVT 274
WA ++I+DR+ IA NG F +S Q I+ C C+GG + F NG+
Sbjct: 371 WAHGATSSIADRINIARNGTFPQVALSPQVIINCKAGG-SCSGGNAMGVYEFGHTNGI-- 427
Query: 275 GGDYNSQEGCQPYTLAPCEHHVQGPLQNCTLLGKLKTPECKQNCY 319
+E CQ Y E +Q C E QNC+
Sbjct: 428 -----PEESCQQYVAKNPEKFTCSDIQQCM---NCAPSEKGQNCW 464
Score = 55.8 bits (133), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 26/83 (31%), Positives = 42/83 (50%), Gaps = 2/83 (2%)
Query: 79 QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVE--NDIPYWLVANS 136
+I+ GP+ Y G+++ N + H V V+GWGV+ + YW+ NS
Sbjct: 489 EIFARGPIGCGIEATLKLENYSGGIFEQNLLFTSLNHEVAVVGWGVDEATGVEYWIARNS 548
Query: 137 WNDHWGDHGTFKILRGENEADIE 159
W +WG++G F+I +N IE
Sbjct: 549 WGSYWGENGYFRIRMHKNNNGIE 571
Score = 42.0 bits (97), Expect = 0.43, Method: Compositional matrix adjust.
Identities = 26/66 (39%), Positives = 39/66 (59%), Gaps = 4/66 (6%)
Query: 212 CGSCWAVSVANAISDRLCIASNGYFTG-QISAQHIVACTPNC--WGCNGGWPQLAWRFWG 268
CGSCWA + +A+SDR+ IA N F +S Q +++C + GCNGG + A+ W
Sbjct: 74 CGSCWAFAATSALSDRIKIARNATFPDINLSPQFLLSCQQDQEDLGCNGGDARNAFA-WI 132
Query: 269 HNGVVT 274
H+ +T
Sbjct: 133 HSNNIT 138
>gi|290975216|ref|XP_002670339.1| cathepsin B-like cysteine proteinase [Naegleria gruberi]
gi|284083897|gb|EFC37595.1| cathepsin B-like cysteine proteinase [Naegleria gruberi]
Length = 350
Score = 92.8 bits (229), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 48/135 (35%), Positives = 71/135 (52%), Gaps = 9/135 (6%)
Query: 154 NEADIEMGFNNRVEANSSEDDDLETMGCQNAKGLPRNFDAREKWPECPSLRHIADQSNCG 213
NE D++ ++ + ++ D + + P FDARE+WP+C +R I +Q NCG
Sbjct: 92 NENDLKGEVMDKDNSTNTPLSDSRYLTILRLRDFPTQFDAREQWPQC--IRSIKNQKNCG 149
Query: 214 SCWAVSVANAISDRLCIASNGYFTGQISAQHIVACTPNCWGCNGGWPQLAWRFWGHNGVV 273
SCWA S ++ ++DR CI S G +S Q +V+C+ GCNGG+ WRF G V
Sbjct: 150 SCWAFSASSVLADRFCIKSGGKVNVDLSPQFMVSCSGQNNGCNGGFFDATWRFLVSVGTV 209
Query: 274 TGGDYNSQEGCQPYT 288
+ E C PY
Sbjct: 210 S-------EACVPYV 217
Score = 84.0 bits (206), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 39/86 (45%), Positives = 52/86 (60%), Gaps = 2/86 (2%)
Query: 75 NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVEND--IPYWL 132
+ M + +GP+ VY DF YKSGVY H G +G HAV+++GWG ++ +PYW+
Sbjct: 254 DIMADLKANGPIQVAMGVYRDFYSYKSGVYHHVSGRYVGGHAVKIVGWGYDSASKLPYWI 313
Query: 133 VANSWNDHWGDHGTFKILRGENEADI 158
ANSW + WG G F ILRG E I
Sbjct: 314 CANSWGEDWGIKGYFWILRGRGECGI 339
>gi|161343821|tpg|DAA06091.1| TPA_inf: cathepsin B [Aphis gossypii]
Length = 196
Score = 92.8 bits (229), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 41/97 (42%), Positives = 63/97 (64%), Gaps = 1/97 (1%)
Query: 64 YFKKAHMVPRCNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDS-IGLHAVRVLGW 122
Y + + + + + + +GP+ A F VY+DF YKSG+Y+ + +G HAV+++GW
Sbjct: 89 YTRDYYYLTYGSIQKDVMTYGPIEASFDVYSDFPSYKSGIYERTENATYLGGHAVKLIGW 148
Query: 123 GVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
G + IPYWL+ NSWN+ WGD+G FKI RG NE ++
Sbjct: 149 GEQYGIPYWLMVNSWNEDWGDNGLFKIRRGTNECGVD 185
Score = 59.7 bits (143), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 34/78 (43%), Positives = 47/78 (60%), Gaps = 6/78 (7%)
Query: 245 IVACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNC 303
+ C C +GC+GG+P AW+ + ++G+VTGGDY S EGC+PY + PC + QG N
Sbjct: 2 LTFCCHTCGFGCHGGYPIRAWKRFKNHGLVTGGDYKSGEGCEPYRVPPCPYDEQG---NN 58
Query: 304 TLLGKL--KTPECKQNCY 319
T GK K C + CY
Sbjct: 59 TCAGKPMEKNHRCTRICY 76
>gi|291000017|ref|XP_002682576.1| cathepsin C [Naegleria gruberi]
gi|284096203|gb|EFC49832.1| cathepsin C [Naegleria gruberi]
Length = 430
Score = 92.4 bits (228), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 49/127 (38%), Positives = 70/127 (55%), Gaps = 19/127 (14%)
Query: 49 KKKRLYLPTSIPLSHYFKKAHMVPRCNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNF 108
+++RL+ + Y+ +H + M +IY++GPL F VY D YK GVY+H
Sbjct: 297 RQQRLHSSNYYFVGGYYGNSH---ELSMMHEIYQNGPLAIGFEVYPDLRNYKHGVYKHVT 353
Query: 109 GDSI---GL-------------HAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRG 152
+ + GL HAV ++GWGVEN PYW + NSW+ WGD+G FKILRG
Sbjct: 354 AEELKAQGLSEDEMIPHFEVVNHAVLMVGWGVENGTPYWKIKNSWSTTWGDNGYFKILRG 413
Query: 153 ENEADIE 159
+E +E
Sbjct: 414 SDECGVE 420
Score = 49.7 bits (117), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 27/82 (32%), Positives = 39/82 (47%), Gaps = 7/82 (8%)
Query: 206 IADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVACTPNCWGCNGGWPQLAWR 265
+ +Q CGSC+A S ++ R+ I SN S Q IV C+ GC+GG+P L +
Sbjct: 207 VRNQEQCGSCYAFSSSDMFGSRVRIPSNLTQVPVYSPQDIVDCSAYSQGCDGGFPFLVGK 266
Query: 266 FWGHNGVVTGGDYNSQEGCQPY 287
+ G+ E C PY
Sbjct: 267 YAMDYGLTV-------ESCDPY 281
>gi|294890618|ref|XP_002773230.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
gi|239878281|gb|EER05046.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
Length = 238
Score = 92.4 bits (228), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 52/153 (33%), Positives = 75/153 (49%), Gaps = 9/153 (5%)
Query: 187 LPRNFDAREKWPECPS-LRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHI 245
+P +FDAR+ + EC + H+ DQS CGSCWA A + R+CI S G +SA +
Sbjct: 59 IPDSFDARDAFKECKDVIGHVRDQSACGSCWAFGTVEAFNARVCIKSGGKLNQLLSAADM 118
Query: 246 VACTPN-----CWGCNGGWPQLAWRFWGHNGVVTG---GDYNSQEGCQPYTLAPCEHHVQ 297
+AC +GC+GG P +W F NG+V+G + + +GC PY C HH +
Sbjct: 119 LACCNIEHFCLSFGCSGGNPITSWTFLHTNGIVSGKLSKNMKAADGCWPYNFPKCAHHQK 178
Query: 298 GPLQNCTLLGKLKTPECKQNCYNPSYESTYRFD 330
TP C +C N Y + + D
Sbjct: 179 ESDYKPCAKELYDTPSCSSSCPNAKYGTAFDKD 211
>gi|294889976|ref|XP_002773021.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
gi|239877724|gb|EER04837.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
Length = 342
Score = 92.4 bits (228), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 37/82 (45%), Positives = 56/82 (68%)
Query: 78 RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
++I+++GP+ AI +++ DF YKSGVY++ G +G H ++++GWGVE YWL NSW
Sbjct: 212 QEIFDNGPVAAIMTIHEDFRLYKSGVYEYKTGAMVGAHTLKLIGWGVEAGQEYWLAVNSW 271
Query: 138 NDHWGDHGTFKILRGENEADIE 159
N+ WGD G K+ G+N D E
Sbjct: 272 NEEWGDQGKIKLAVGKNALDEE 293
Score = 73.6 bits (179), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 41/121 (33%), Positives = 60/121 (49%), Gaps = 13/121 (10%)
Query: 187 LPRNFDAREKWPECPS-LRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHI 245
LP NF+A+ K+ C + HI DQ+ C +CWA + +DR+CI S G T +S ++
Sbjct: 39 LPSNFNAQIKFASCADVIGHIRDQAECHNCWASASVGMFNDRVCIQSGGRITDILSLAYL 98
Query: 246 VAC------TPNCWGCNGGWPQLAWRFWGHNGVVTGGDY------NSQEGCQPYTLAPCE 293
+C P GC G F ++G+VTGG+Y + +GC PY C
Sbjct: 99 TSCCNHANGCPKSDGCRRGSVAEGLIFMKNHGIVTGGEYKPPKKLGNDDGCWPYPFPKCN 158
Query: 294 H 294
H
Sbjct: 159 H 159
>gi|294893885|ref|XP_002774682.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
gi|239880102|gb|EER06498.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
Length = 121
Score = 92.4 bits (228), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 44/89 (49%), Positives = 57/89 (64%), Gaps = 2/89 (2%)
Query: 187 LPRNFDAREKWPECP-SLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHI 245
LP +FDAR +P C + HI DQS CGSCWA V A +DRLC+ SNG FT +SA +
Sbjct: 34 LPTDFDARTAFPNCSKVIGHIRDQSACGSCWAFGVTEAFNDRLCVKSNGTFTELLSAGEM 93
Query: 246 VACTPNCWGCNGGWPQLAWRFWGHNGVVT 274
AC P+ +GC+GG+P AW + G+ T
Sbjct: 94 NACAPS-YGCDGGYPDSAWSWVHDEGIAT 121
>gi|123478051|ref|XP_001322190.1| Clan CA, family C1, cathepsin B-like cysteine peptidase
[Trichomonas vaginalis G3]
gi|121905031|gb|EAY09967.1| Clan CA, family C1, cathepsin B-like cysteine peptidase
[Trichomonas vaginalis G3]
Length = 288
Score = 92.4 bits (228), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 39/80 (48%), Positives = 52/80 (65%)
Query: 80 IYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWND 139
I GP+ VY+D + YKSG+Y H G+ +G HAV ++GWG +N I YW+++NSWN
Sbjct: 200 IMTEGPVTTSLKVYSDLMYYKSGIYTHTKGEFLGHHAVEIIGWGTKNGIDYWIISNSWNT 259
Query: 140 HWGDHGTFKILRGENEADIE 159
WG +G F I RG NE IE
Sbjct: 260 TWGMNGLFLIKRGVNECHIE 279
Score = 57.4 bits (137), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 33/101 (32%), Positives = 49/101 (48%), Gaps = 11/101 (10%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+P +++ E++P+C + DQ CGSCW+ +V+ + S R C N S H+V
Sbjct: 68 IPMSYNFTERFPQCDF--GVLDQGKCGSCWSFAVSKSFSHRYCRKYNKPVL--FSQSHLV 123
Query: 247 ACTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPY 287
AC GC GG AWR+ G+ + CQPY
Sbjct: 124 ACDRRNSGCGGGIEVNAWRYIDLRGL-------PLDSCQPY 157
>gi|253743418|gb|EES99819.1| Cathepsin B precursor [Giardia intestinalis ATCC 50581]
Length = 296
Score = 92.0 bits (227), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 45/91 (49%), Positives = 63/91 (69%), Gaps = 2/91 (2%)
Query: 75 NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGV-ENDIPYWLV 133
+A+ + HGP+VA F+V DF+ YKSGVYQH +G +G HAV V+G+GV ++ + YW V
Sbjct: 201 SAIDVLLSHGPVVATFNVAQDFMYYKSGVYQHRWGVWLGGHAVEVVGYGVTDSGLDYWTV 260
Query: 134 ANSWNDHWGDHGTFKILRGENEADIEM-GFN 163
NSW WG+ G F+I+RG +E IE GF+
Sbjct: 261 RNSWGPDWGEDGYFRIVRGSDECGIEQEGFH 291
Score = 59.7 bits (143), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 30/88 (34%), Positives = 44/88 (50%), Gaps = 2/88 (2%)
Query: 186 GLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHI 245
G P ++D R+++P C + + DQ +CGSCWA S +D C + S Q++
Sbjct: 75 GAPESYDFRDEYPHC--ITEVVDQGSCGSCWAFSSIQTFADHRCRSGLDATGVSYSVQYV 132
Query: 246 VACTPNCWGCNGGWPQLAWRFWGHNGVV 273
+ C GCNGG P A+ F G V
Sbjct: 133 LDCDRKDHGCNGGEPTKAFDFLHSTGTV 160
>gi|328726763|ref|XP_003249034.1| PREDICTED: cathepsin B-like cysteine proteinase-like, partial
[Acyrthosiphon pisum]
Length = 129
Score = 92.0 bits (227), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 43/83 (51%), Positives = 56/83 (67%), Gaps = 1/83 (1%)
Query: 78 RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDS-IGLHAVRVLGWGVENDIPYWLVANS 136
+ + +GP+ A F VY DF YKSGVYQ + +G HAV+++GWGVE PYWL+ NS
Sbjct: 36 KDVLNYGPIEASFDVYDDFPSYKSGVYQRTPNATKLGGHAVKLIGWGVEEGTPYWLMVNS 95
Query: 137 WNDHWGDHGTFKILRGENEADIE 159
WN WGD+G FKI RG +E I+
Sbjct: 96 WNAQWGDNGLFKIRRGTDECRID 118
>gi|403332696|gb|EJY65386.1| Cathepsin B [Oxytricha trifallax]
Length = 297
Score = 91.7 bits (226), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 40/76 (52%), Positives = 49/76 (64%)
Query: 78 RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
+I HGP+ F+VY DF Y+SGVY D G HA+++LG+GVEN PYWL ANSW
Sbjct: 208 SEIVAHGPVEGAFTVYTDFFNYQSGVYTPTTSDVAGGHAIKILGFGVENGTPYWLCANSW 267
Query: 138 NDHWGDHGTFKILRGE 153
WG G FKI +GE
Sbjct: 268 GPSWGMQGFFKIKQGE 283
Score = 85.1 bits (209), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 49/141 (34%), Positives = 64/141 (45%), Gaps = 26/141 (18%)
Query: 188 PRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVA 247
P NFDAR++W + I DQ CG+CWA A+SDR IASNG S + +V+
Sbjct: 77 PDNFDARQQWGS--KIHAIRDQQQCGACWAFGATEALSDRFTIASNGSVDVVFSPEDLVS 134
Query: 248 CTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTLLG 307
C N +GCNGG+ +AW F +GVV + C PY+ G
Sbjct: 135 CDTNDYGCNGGYMDMAWEFLDQHGVVA-------DSCFPYS-----------------AG 170
Query: 308 KLKTPECKQNCYNPSYESTYR 328
P C C + S E Y
Sbjct: 171 SGFAPACASKCADGSAEKKYS 191
>gi|294945206|ref|XP_002784584.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
gi|239897729|gb|EER16380.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
Length = 298
Score = 91.7 bits (226), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 64/190 (33%), Positives = 90/190 (47%), Gaps = 19/190 (10%)
Query: 171 SEDDDLETMGCQNAK----GLPRNFDAREKWPECPS-LRHIADQSNCGSCWAVSVANAIS 225
SE D E ++ K LP FDAR+K+ C + H+ DQ CG+CWAV ++
Sbjct: 13 SESSDEEIRLVESTKPVVENLPPEFDARQKFNYCRDVIGHVRDQGRCGNCWAVCPTEVLN 72
Query: 226 DRLCIASNGYFTGQISAQHIVAC---TPNCW---GCNGGWPQLAWRFWGHNGVVTGGDYN 279
DRLCI S+G +SA ++ +C C GCNGG A F +GVVTG D+
Sbjct: 73 DRLCIKSSGKIQEILSAGYVTSCCNPAHGCLHAKGCNGGRLVEAMSFLRDHGVVTGNDFK 132
Query: 280 SQ------EGCQPYTLAPCEH-HVQGP-LQNCTLLGKLKTPECKQNCYNPSYESTYRFDL 331
Q +GC PY C H +G C + + P C+ C N +Y+ + D+
Sbjct: 133 PQDQLREADGCWPYPFQKCNHVPTEGTGYPKCKDVVQQPVPPCRTTCTNKAYKKSLEKDV 192
Query: 332 KKGKKAHMVL 341
+ K VL
Sbjct: 193 HRAKSWRKVL 202
Score = 81.6 bits (200), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 35/90 (38%), Positives = 57/90 (63%), Gaps = 2/90 (2%)
Query: 78 RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
++I+++GP+ + F +Y DF YKSGVY + LH ++++GWG ++ YWL N+W
Sbjct: 210 QEIFDNGPVFSAFEMYKDFRYYKSGVYVPTTKEVDCLHVIKIIGWGADSVREYWLAMNAW 269
Query: 138 NDHWGDHGTFKILRGENEADIEMGFNNRVE 167
N+ WGDHG K+ G+N +E G +R +
Sbjct: 270 NEEWGDHGLIKMAFGKNR--LENGTFHRAD 297
>gi|146163742|ref|XP_001012227.2| Papain family cysteine protease containing protein [Tetrahymena
thermophila]
gi|146145940|gb|EAR91982.2| Papain family cysteine protease containing protein [Tetrahymena
thermophila SB210]
Length = 581
Score = 91.7 bits (226), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 76/273 (27%), Positives = 116/273 (42%), Gaps = 58/273 (21%)
Query: 75 NAMRQIYEHGPL-VAIFSVYADFLQYK--SGVYQHNFGDSIGLHAVRVLGWGVENDIPYW 131
N M++I+ GP+ I S D+L+Y G+Y + HA+ V+GWGVEN YW
Sbjct: 186 NMMQEIFNRGPIGCGIAS--NDYLRYNYTGGIYVNTTEVDYHNHAISVVGWGVENGTKYW 243
Query: 132 LVANSWNDHWGDHGTFKILRGENEADIEM-------------GFNNRVEANSSEDDDLET 178
+V NSW +WG+ G F+++RG N +IE N +N++ +
Sbjct: 244 IVRNSWGSYWGEKGYFRLVRGINSLNIESDCAWAVPKDTWTNDVRNTTASNTNSQSNFRQ 303
Query: 179 M-GCQ--------------------NAKGLPRNFDAREKWPECPSLRHIADQSN------ 211
+ C N LP+++D W + +++ N
Sbjct: 304 LHDCVRQENNQKDQVILSPLPHQYLNGAVLPKSWD----WRNISGVNYLSVTRNQHIPQY 359
Query: 212 CGSCWAVSVANAISDRLCIASNGYFTG-QISAQHIVACTPNCWGCNGGWPQLAWRFWGHN 270
CGSCWA ++I+DR+ IA N F ++S Q I+ C CNGG P + F
Sbjct: 360 CGSCWAHGTTSSIADRINIARNRTFPDIELSVQAIINCKAGG-SCNGGQPISVYSFAHKK 418
Query: 271 GVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNC 303
GV +E CQ Y + +Q C
Sbjct: 419 GV-------PEESCQNYVAKNPQKFSCSDIQRC 444
Score = 52.0 bits (123), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 31/90 (34%), Positives = 44/90 (48%), Gaps = 7/90 (7%)
Query: 79 QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVE--NDIPYWLVANS 136
+I+ GP+ +V F Y GVY I H + V+GWGV+ + YW+ NS
Sbjct: 488 EIFARGPISCGIAVTNKFEAYTGGVYSEKSLTRIN-HEIAVVGWGVDETTNTEYWIGRNS 546
Query: 137 WNDHWGDHGTFKI-LRGEN---EADIEMGF 162
W +WG+ G F+I + EN E D G
Sbjct: 547 WGTYWGEDGFFRIKMHSENLKIETDCSWGV 576
Score = 43.5 bits (101), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 31/108 (28%), Positives = 49/108 (45%), Gaps = 18/108 (16%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSN------CGSCWAVSVANAISDRLCIASNGYFTG-Q 239
LP NF W + + ++ N CGSCWA + + +SDR+ IA F
Sbjct: 42 LPENF----FWGDVDGVNYLTVTKNQHIPQYCGSCWAFTATSTLSDRIKIARKAAFPDIL 97
Query: 240 ISAQHIVACTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPY 287
IS Q +++C GC+GG ++++ N + + E C PY
Sbjct: 98 ISPQVLISCDDFSNGCHGGNILTSYQWIAQNNI-------TDETCSPY 138
>gi|508264|gb|AAA96833.1| cysteine protease, partial [Caenorhabditis elegans]
Length = 198
Score = 91.7 bits (226), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 54/110 (49%), Positives = 59/110 (53%), Gaps = 14/110 (12%)
Query: 214 SCWAVSVANAISDRLCIASNGYFTGQISAQHIVACTPNCWG--CNGGWPQLAWRFWGHNG 271
SCWAVS A ISDR+CIASN ISA I AC G CNGG+P AWR + G
Sbjct: 1 SCWAVSAAETISDRICIASNAKTILSISADDINACCGMVCGNGCNGGYPIEAWRHYVKKG 60
Query: 272 VVTGGDYNSQEGCQPYTLAPCEHHVQGPL------------QNCTLLGKL 309
VTGG Y + GC+PY PCEHHV G QN LGKL
Sbjct: 61 YVTGGSYQDKTGCKPYPYPPCEHHVNGTHYKPCPSNMYPTGQNANALGKL 110
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 32/60 (53%), Positives = 40/60 (66%)
Query: 78 RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
+ I HG L +V+ DF Y GVY H G S+G HAV++LGWGV+N PYWL+ANSW
Sbjct: 139 KGIKTHGQLRGGITVFEDFEHYSGGVYVHTAGASLGGHAVKMLGWGVDNGTPYWLIANSW 198
>gi|195384166|ref|XP_002050789.1| GJ20006 [Drosophila virilis]
gi|194145586|gb|EDW61982.1| GJ20006 [Drosophila virilis]
Length = 432
Score = 91.7 bits (226), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 41/89 (46%), Positives = 56/89 (62%), Gaps = 4/89 (4%)
Query: 75 NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQH---NFGDSIGLHAVRVLGWGVEND-IPY 130
+ M +IY GP+ A +Y DF Y G+Y+ N G G H+V+++GWG E+D + Y
Sbjct: 325 DIMAEIYHSGPVQATMRIYRDFFSYSGGIYRQTAANRGAPTGFHSVKLVGWGEEHDGVKY 384
Query: 131 WLVANSWNDHWGDHGTFKILRGENEADIE 159
W+ ANSW WG+HG F+ILRG NE IE
Sbjct: 385 WIAANSWGPWWGEHGYFRILRGSNECGIE 413
Score = 71.6 bits (174), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 42/102 (41%), Positives = 53/102 (51%), Gaps = 9/102 (8%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LPR F+A EKW + + DQ CGS W +S + SDR I S G Q+SAQ+I+
Sbjct: 187 LPRKFNAVEKWSS--YISEVPDQGWCGSSWVLSTTSVASDRFAIQSQGKEVVQLSAQNIL 244
Query: 247 ACTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYT 288
+CT GC GG AWR+ GV+ E C PYT
Sbjct: 245 SCTRRQQGCEGGHLDAAWRYLHKKGVL-------DEKCYPYT 279
>gi|340508280|gb|EGR34021.1| papain family cysteine protease, putative [Ichthyophthirius
multifiliis]
Length = 620
Score = 91.7 bits (226), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 68/265 (25%), Positives = 108/265 (40%), Gaps = 63/265 (23%)
Query: 75 NAMRQIYEHGPLVA-IFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLV 133
N M++I+ GP+ I+S Y G+Y H V ++GWGVEN + YW+V
Sbjct: 183 NIMQEIFNRGPVACNIYSTEYLRYNYTGGIYNDTTAYPETNHVVSIVGWGVENGVKYWIV 242
Query: 134 ANSWNDHWGDHGTFKILRGENEADIEMGFNNRVEANSSEDDDLETMGCQN---------- 183
NSW +WG+ G ++ LRG N +IE V ++ +D+ + + N
Sbjct: 243 RNSWGSYWGEKGFYRQLRGVNMINIEQFCYWAVPKDTWTNDERDKIQTSNEQEKQESNQE 302
Query: 184 ---------------------------------AKGLPRNFDAREKWPECPSLRHIADQS 210
+ +P++FD W + +++
Sbjct: 303 KINNFFKFSNYTCRRESPKNQPQLIKGKQPYQIIQKVPKSFD----WRNVNGVNYLSHTR 358
Query: 211 N------CGSCWAVSVANAISDRLCIASNGYF-TGQISAQHIVACTPNCWGCNGGWPQLA 263
N CGSCWA +++SDR+ IA N + +S Q I+ C C GG PQ
Sbjct: 359 NQHIPQYCGSCWAHGTTSSLSDRINIARNKTWPDTSLSVQAIINCNAGG-SCEGGNPQTV 417
Query: 264 WRFWGHNGVVTGGDYNSQEGCQPYT 288
+ F + G+ +E CQ Y
Sbjct: 418 YEFANNKGI-------PEESCQNYV 435
Score = 55.5 bits (132), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 27/83 (32%), Positives = 41/83 (49%), Gaps = 2/83 (2%)
Query: 79 QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVE--NDIPYWLVANS 136
+IY GP+ V + F Y G+Y + H + V+GWG++ YW+ NS
Sbjct: 494 EIYMRGPISCGIHVSSKFEAYNGGIYSERSILPVINHEIAVVGWGIDEKTKTEYWIGRNS 553
Query: 137 WNDHWGDHGTFKILRGENEADIE 159
W +WG+ G F+I +N IE
Sbjct: 554 WGTYWGESGFFRIQMHKNNLGIE 576
Score = 41.2 bits (95), Expect = 0.71, Method: Compositional matrix adjust.
Identities = 24/77 (31%), Positives = 41/77 (53%), Gaps = 8/77 (10%)
Query: 212 CGSCWAVSVANAISDRLCIASNGYFTG-QISAQHIVACTPNCWGCNGGWPQLAWRFWGHN 270
CGSCWA + ++++SDR+ I N + I+ Q +V+C GC+GG ++++ N
Sbjct: 66 CGSCWAQAASSSLSDRIKIVRNAQWPDILIAPQVLVSCNKYSNGCHGGSAADSFQWIKEN 125
Query: 271 GVVTGGDYNSQEGCQPY 287
+ + E C PY
Sbjct: 126 NI-------TDESCSPY 135
>gi|28974200|gb|AAO61484.1| cathepsin B [Sterkiella histriomuscorum]
Length = 294
Score = 91.7 bits (226), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 40/76 (52%), Positives = 49/76 (64%)
Query: 78 RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
+I HGP+ F+VY DF Y+SGVY D G HA+++LG+GVEN PYWL ANSW
Sbjct: 205 SEIVSHGPVEGAFTVYTDFFNYQSGVYTPTTTDVAGGHAIKILGYGVENGTPYWLCANSW 264
Query: 138 NDHWGDHGTFKILRGE 153
WG G FKI +GE
Sbjct: 265 GPAWGMSGFFKIKQGE 280
Score = 72.0 bits (175), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 37/102 (36%), Positives = 52/102 (50%), Gaps = 12/102 (11%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+P NFDAR++W + I DQ CGSCWA A SDR I +S + +V
Sbjct: 76 VPENFDARQQWGS--KIHAIRDQQQCGSCWAFGATEAFSDRFAINGKDVI---LSPEDLV 130
Query: 247 ACTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYT 288
+C N +GCNGG+ +AW + +G T + C PY+
Sbjct: 131 SCDTNDYGCNGGYMDVAWEYLADHGAAT-------DSCFPYS 165
>gi|291228863|ref|XP_002734398.1| PREDICTED: hypothetical protein [Saccoglossus kowalevskii]
Length = 451
Score = 91.3 bits (225), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 48/89 (53%), Positives = 57/89 (64%), Gaps = 8/89 (8%)
Query: 79 QIYEHGPLVAIFSVYADFLQYKSGVYQHN---FGDSIGLHA-----VRVLGWGVENDIPY 130
+I E+GP+ A F V DF Y SGVY+H D+ HA V++LGWGVEN I Y
Sbjct: 321 EIMENGPVQASFEVKEDFFMYGSGVYRHTPIASNDAEQYHASEWHSVKLLGWGVENGIKY 380
Query: 131 WLVANSWNDHWGDHGTFKILRGENEADIE 159
WL ANSW WG+ G FKILRGENE +IE
Sbjct: 381 WLGANSWGTKWGEDGYFKILRGENECNIE 409
Score = 68.9 bits (167), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 43/105 (40%), Positives = 55/105 (52%), Gaps = 10/105 (9%)
Query: 185 KGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQH 244
K +P++FDAR+KW + I DQ NC S WA S SDRL I S+G +S QH
Sbjct: 177 KKIPKSFDARDKWGS--MITGILDQGNCASSWAFSTVGVASDRLAIQSSGETGMTLSPQH 234
Query: 245 IVAC-TPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYT 288
+++C T GC+GG AW F GVV+ C PYT
Sbjct: 235 LLSCNTRGQRGCSGGHIDRAWWFMRKRGVVS-------NDCYPYT 272
>gi|159108157|ref|XP_001704351.1| Cathepsin B precursor [Giardia lamblia ATCC 50803]
gi|157432412|gb|EDO76677.1| Cathepsin B precursor [Giardia lamblia ATCC 50803]
Length = 360
Score = 91.3 bits (225), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 42/86 (48%), Positives = 60/86 (69%), Gaps = 1/86 (1%)
Query: 75 NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGV-ENDIPYWLV 133
+A+ + HGP+VA F+V DF+ YKSGVYQH +G +G HAV ++G+GV ++ + YW V
Sbjct: 265 SAIDVLLAHGPVVATFNVAQDFMYYKSGVYQHRWGLWLGGHAVEIIGYGVTDSGLDYWTV 324
Query: 134 ANSWNDHWGDHGTFKILRGENEADIE 159
NSW WG+ G F+I+RG +E IE
Sbjct: 325 RNSWGPDWGEDGYFRIVRGGDECGIE 350
Score = 62.0 bits (149), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 31/88 (35%), Positives = 45/88 (51%), Gaps = 2/88 (2%)
Query: 186 GLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHI 245
G P ++D R+++P C + + DQ NCGSCWA S +D C + S Q++
Sbjct: 139 GAPESYDFRDEYPHC--ITEVVDQGNCGSCWAFSSVQTFADHRCRSGLDATGVSYSVQYV 196
Query: 246 VACTPNCWGCNGGWPQLAWRFWGHNGVV 273
+ C GCNGG P A+ F + G V
Sbjct: 197 LDCDRKDHGCNGGEPVNAFNFLHNTGTV 224
>gi|308161503|gb|EFO63946.1| Cathepsin B precursor [Giardia lamblia P15]
Length = 363
Score = 90.9 bits (224), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 42/86 (48%), Positives = 60/86 (69%), Gaps = 1/86 (1%)
Query: 75 NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGV-ENDIPYWLV 133
+A+ + HGP+VA F+V DF+ YKSGVYQH +G +G HAV ++G+GV ++ + YW V
Sbjct: 268 SAIDVLLAHGPVVATFNVAQDFMYYKSGVYQHRWGVWLGGHAVEIVGYGVTDSGLDYWTV 327
Query: 134 ANSWNDHWGDHGTFKILRGENEADIE 159
NSW WG+ G F+I+RG +E IE
Sbjct: 328 RNSWGPDWGEDGYFRIVRGGDECGIE 353
Score = 61.2 bits (147), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 31/88 (35%), Positives = 45/88 (51%), Gaps = 2/88 (2%)
Query: 186 GLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHI 245
G P ++D RE++P C + + DQ +CGSCWA S +D C + S Q++
Sbjct: 142 GAPESYDFREEYPHC--ITEVVDQGSCGSCWAFSSIQTFADHRCRSGLDATGVSYSVQYV 199
Query: 246 VACTPNCWGCNGGWPQLAWRFWGHNGVV 273
+ C GCNGG P A+ F + G V
Sbjct: 200 LDCDRKDHGCNGGEPVNAFNFLHNTGTV 227
>gi|253744515|gb|EET00718.1| Cathepsin B precursor [Giardia intestinalis ATCC 50581]
Length = 306
Score = 90.9 bits (224), Expect = 9e-16, Method: Compositional matrix adjust.
Identities = 41/86 (47%), Positives = 55/86 (63%), Gaps = 3/86 (3%)
Query: 77 MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVEND---IPYWLV 133
M+ + GP+ A +VY DFL Y+SGVY+H +G I HAV ++G+G +D PYW+V
Sbjct: 209 MQALANDGPVQASMAVYRDFLYYRSGVYRHVYGSQISSHAVEIIGYGAADDEDSTPYWIV 268
Query: 134 ANSWNDHWGDHGTFKILRGENEADIE 159
NS WG+ G F I+RG NE DIE
Sbjct: 269 KNSLGSGWGEEGYFNIVRGSNECDIE 294
Score = 63.5 bits (153), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 36/104 (34%), Positives = 50/104 (48%), Gaps = 9/104 (8%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+P +FD RE++P+C + + DQ +CGSCWA S +A DR C+ S Q+ +
Sbjct: 78 IPASFDFREEYPQC--ITPVYDQGHCGSCWAFSATSAFGDRRCMQGLDSAGVPYSQQYTI 135
Query: 247 ACTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLA 290
+C GC GG W F +G T C PYT A
Sbjct: 136 SCDYLDLGCAGGLSFSVWTFLTEHGTTT-------LECVPYTDA 172
>gi|38639319|gb|AAR25797.1| cathepsin B-like cysteine proteinase [Solanum tuberosum]
Length = 218
Score = 90.5 bits (223), Expect = 9e-16, Method: Compositional matrix adjust.
Identities = 56/163 (34%), Positives = 79/163 (48%), Gaps = 23/163 (14%)
Query: 162 FNNRVEANSSEDDDLETMGCQN---AKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAV 218
F + + + DLE + K LP+ FDAR+ WP+C ++ I DQ +CGSCWA
Sbjct: 70 FKRLLGVKPAREGDLEGIPVLTHPRLKELPKEFDARKAWPQCSTIGKILDQGHCGSCWAF 129
Query: 219 SVANAISDRLCIASNGYFTGQISAQHIVACTPNCW--GCNGGWPQLAWRFWGHNGVVTGG 276
++SDR CI N + +S ++AC GC+GG+P AWR++ +GVVT
Sbjct: 130 GAVESLSDRFCIHYN--LSISLSVNDLLACCSFLCGSGCDGGYPIAAWRYFKRSGVVT-- 185
Query: 277 DYNSQEGCQPY-TLAPCEHHVQGPLQNCTLLGKLKTPECKQNC 318
E C PY C H PL TP+C + C
Sbjct: 186 -----EECDPYFDTTGCSHPGCEPLY--------PTPKCHRKC 215
>gi|449498128|ref|XP_002193225.2| PREDICTED: tubulointerstitial nephritis antigen [Taeniopygia
guttata]
Length = 469
Score = 90.5 bits (223), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 48/121 (39%), Positives = 70/121 (57%), Gaps = 18/121 (14%)
Query: 57 TSIPLSHYFKKAHMVPRC-----------NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQ 105
T+ P + F+K++ + RC + M++I + GP+ AI VY DF YK G+YQ
Sbjct: 339 TNGPCPNAFEKSNRLYRCASHYRVSSKETDIMKEIKDRGPVQAIMKVYEDFFLYKEGIYQ 398
Query: 106 HN--FGDSIGLHAVRVLGWGVEND-----IPYWLVANSWNDHWGDHGTFKILRGENEADI 158
H+ G H+V++LGWG D +W+ ANSW WG++G F+ILRG+NE DI
Sbjct: 399 HSQKAGSKWKTHSVKLLGWGALPDKNGQKQKFWIAANSWGKSWGENGYFRILRGQNECDI 458
Query: 159 E 159
E
Sbjct: 459 E 459
Score = 68.6 bits (166), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 40/95 (42%), Positives = 54/95 (56%), Gaps = 3/95 (3%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
P F A +WPE + DQ NCG+ WA S A+ +DR+ I S G T +SAQ+++
Sbjct: 222 FPAIFSAIYEWPE--WIHDPLDQRNCGASWAFSTASVAADRIAIHSKGQITDNLSAQNLI 279
Query: 247 AC-TPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNS 280
+C T N GCNGG AWR+ +GVV+ Y S
Sbjct: 280 SCDTRNQHGCNGGSIDGAWRYLKTHGVVSYACYPS 314
>gi|270012758|gb|EFA09206.1| cathepsin B precursor [Tribolium castaneum]
Length = 326
Score = 90.5 bits (223), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 41/82 (50%), Positives = 57/82 (69%), Gaps = 1/82 (1%)
Query: 79 QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWN 138
+I +GP++A ++V+ DF +KSGVY + G +G H+V+V+GWG E IPYWL+ANSW
Sbjct: 207 EILTNGPVMAYYNVFEDFACHKSGVYYYKSGKFVGRHSVKVIGWGTEEGIPYWLIANSWG 266
Query: 139 DHWGD-HGTFKILRGENEADIE 159
WG+ G FK+ RG NE IE
Sbjct: 267 SEWGELGGFFKMRRGTNECWIE 288
Score = 89.7 bits (221), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 46/104 (44%), Positives = 67/104 (64%), Gaps = 2/104 (1%)
Query: 187 LPRNFDAREKWPECPS-LRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHI 245
+P +FDAREKWPEC + I +Q NCGSCWA + ++DRLCI+S G S +++
Sbjct: 76 IPESFDAREKWPECKDVIGKIRNQGNCGSCWAFASTEVMTDRLCISSKGKIKFVFSPENL 135
Query: 246 VACTPNCWGCNGG-WPQLAWRFWGHNGVVTGGDYNSQEGCQPYT 288
+ C +C G + + AW ++ + G+ +GGDYNS EGCQPY+
Sbjct: 136 LTCCKDCGCGCKGGYIKNAWDYYINEGIASGGDYNSSEGCQPYS 179
>gi|290973351|ref|XP_002669412.1| predicted protein [Naegleria gruberi]
gi|284082959|gb|EFC36668.1| predicted protein [Naegleria gruberi]
Length = 488
Score = 90.5 bits (223), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 44/94 (46%), Positives = 53/94 (56%), Gaps = 9/94 (9%)
Query: 75 NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGL---------HAVRVLGWGVE 125
N M ++Y GPL F VY DF YK GVY H+ + HAV ++GWG E
Sbjct: 384 NMMYELYHGGPLAIAFEVYDDFFNYKGGVYTHSTALKTKIAEPGWEETNHAVLLVGWGEE 443
Query: 126 NDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
N +PYWLV NSW WG +G FKI RG +E D E
Sbjct: 444 NGVPYWLVKNSWGTSWGINGFFKIKRGTDECDCE 477
Score = 48.9 bits (115), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 30/110 (27%), Positives = 49/110 (44%), Gaps = 15/110 (13%)
Query: 182 QNAKGLPRNFDAREKWPECPSLRHIADQSN----CGSCWAVSVANAISDRLCIASNGYFT 237
++ LP+ F W + + N CGSC+A S ++ R+ + +NG T
Sbjct: 251 EDVNALPKEF----SWTNVNGMNLVVPVRNQGVFCGSCYAFSSSDMFGSRVRVITNGTKT 306
Query: 238 GQISAQHIVACTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPY 287
S Q IV C+ GC+GG+ L ++ G+ ++E C PY
Sbjct: 307 PVYSPQDIVECSAYSQGCDGGFMYLVSKYAEDYGL-------AEESCDPY 349
>gi|308811264|ref|XP_003082940.1| cysteine proteinase (ISS) [Ostreococcus tauri]
gi|116054818|emb|CAL56895.1| cysteine proteinase (ISS) [Ostreococcus tauri]
Length = 362
Score = 90.5 bits (223), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 50/123 (40%), Positives = 65/123 (52%), Gaps = 15/123 (12%)
Query: 187 LPRNFDAREKWPECPSL-RHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHI 245
LP FD REKWP+C +L DQ CGSCWAV+ A A++DRLCIA+NG +SA +
Sbjct: 88 LPDTFDVREKWPKCAALVSEAVDQGACGSCWAVAPAKAMTDRLCIATNGAVNTHVSAIQL 147
Query: 246 VACTPNCWGC--------------NGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAP 291
++C + GG+P A+ GVV+GG Q+ C PY AP
Sbjct: 148 LSCNSHSNSAYTYDENLAGGSGGCMGGYPTEAYETAHRVGVVSGGLNGDQDTCMPYPFAP 207
Query: 292 CEH 294
C H
Sbjct: 208 CHH 210
Score = 51.2 bits (121), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 34/97 (35%), Positives = 52/97 (53%), Gaps = 14/97 (14%)
Query: 79 QIYEHGPLVA-IFSVYADFLQYKSGVYQHN-----FGDSIGLHAVRVLGWGVEND-IPYW 131
+I+E GP+ + VY +F QY+ GVY+ + G + G H + V+GWG + + YW
Sbjct: 256 EIFERGPVTTFVGDVYDEFYQYERGVYKLSKDPAARGKNHGGHVMEVIGWGKSAEGVRYW 315
Query: 132 LVANSWNDHWGDHGTFKILRGENEADIEMGFNNRVEA 168
V NSW + WG+ G +I G E+ + VEA
Sbjct: 316 KVYNSWLN-WGERGYGEIAVG------ELSIGDNVEA 345
>gi|32129435|sp|P92133.2|CATB3_GIALA RecName: Full=Cathepsin B-like CP3; AltName: Full=Cathepsin B-like
protease B3; Flags: Precursor
gi|1763663|gb|AAB58260.1| cysteine protease [Giardia intestinalis]
gi|11691660|emb|CAC18648.1| cathepsin B-like cysteine protease 3 [Giardia intestinalis]
Length = 299
Score = 90.5 bits (223), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 39/84 (46%), Positives = 57/84 (67%), Gaps = 1/84 (1%)
Query: 77 MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVEND-IPYWLVAN 135
M+ + GPL F+VY+DF+ Y+SGVYQH +G G HAV ++G+G ++D + YW++ N
Sbjct: 206 MKALATGGPLQTAFTVYSDFMYYESGVYQHTYGRVEGGHAVDMVGYGTDDDGVDYWIIKN 265
Query: 136 SWNDHWGDHGTFKILRGENEADIE 159
SW WG+ G F+I+R NE IE
Sbjct: 266 SWGPDWGEDGYFRIIRMTNECGIE 289
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 38/108 (35%), Positives = 52/108 (48%), Gaps = 9/108 (8%)
Query: 180 GCQNAKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQ 239
G +A P +FD RE++P C + + DQ CGSCWA S ++ DR C A +
Sbjct: 67 GTVSATQAPDSFDFREEYPHC--IPEVVDQGGCGSCWAFSSVASVGDRRCFAGLDKKAVK 124
Query: 240 ISAQHIVACTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPY 287
S Q++V+C C+GGW WRF G T + C PY
Sbjct: 125 YSPQYVVSCDRGDMACDGGWLPSVWRFLTKTGTTT-------DECVPY 165
>gi|239793607|dbj|BAH72912.1| ACYPI000019 [Acyrthosiphon pisum]
Length = 188
Score = 90.1 bits (222), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 41/83 (49%), Positives = 58/83 (69%), Gaps = 5/83 (6%)
Query: 76 AMRQIYEHGPLVAIFSVYADFLQYKSGVYQHN----FGDSIGLHAVRVLGWGVENDIPYW 131
AM+ I+++GP+ F +Y D + YKSGVYQ++ F D +H+V++ GWG EN +PYW
Sbjct: 93 AMKDIFDNGPITTQFYMYRDLVDYKSGVYQYDEQSDF-DFFTVHSVKIFGWGEENGVPYW 151
Query: 132 LVANSWNDHWGDHGTFKILRGEN 154
LVANS+ WG +GTFKI RG +
Sbjct: 152 LVANSFGTDWGYNGTFKISRGND 174
Score = 55.1 bits (131), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 24/55 (43%), Positives = 39/55 (70%), Gaps = 1/55 (1%)
Query: 282 EGCQPYTLAPCE-HHVQGPLQNCTLLGKLKTPECKQNCYNPSYESTYRFDLKKGK 335
+GCQPYT+ PC+ + + P +CT + +TP C++ CYNP+Y +++R D+ KGK
Sbjct: 30 QGCQPYTIPPCKLMNEKPPGHSCTTYHREETPICEKKCYNPNYYTSFRTDIYKGK 84
>gi|290977636|ref|XP_002671543.1| predicted protein [Naegleria gruberi]
gi|284085113|gb|EFC38799.1| predicted protein [Naegleria gruberi]
Length = 268
Score = 90.1 bits (222), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 45/109 (41%), Positives = 63/109 (57%), Gaps = 15/109 (13%)
Query: 185 KGLPRN------FDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTG 238
K +P+N FDAREKW +C + I +Q CGSCWA S + A SDRLCIA+NG
Sbjct: 81 KKMPKNLKAASHFDAREKWEDC--IHEIRNQEECGSCWAFSASEAFSDRLCIATNGSVNI 138
Query: 239 QISAQHIVACTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPY 287
+S Q++V+C +GC+GG+ AW F + G+ + + C PY
Sbjct: 139 VLSPQYMVSCDATDYGCDGGYLNNAWNFLANTGIPS-------DECVPY 180
Score = 48.9 bits (115), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 22/47 (46%), Positives = 30/47 (63%)
Query: 75 NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLG 121
+ + I E+G + + FSVY DF YKSGVY H G G HA++V+G
Sbjct: 221 DIQKDIQENGSIQSGFSVYKDFFSYKSGVYHHVTGSLAGGHAIKVIG 267
>gi|427783627|gb|JAA57265.1| hypothetical protein [Rhipicephalus pulchellus]
Length = 483
Score = 90.1 bits (222), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 47/111 (42%), Positives = 68/111 (61%), Gaps = 14/111 (12%)
Query: 63 HYFKKAHMVP--RCNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHN-FGDSI------- 112
H+ + VP + M++IY +GP+ A+ V DF Y+SGVY+H +S+
Sbjct: 327 HFSTPPYRVPANEEDIMQEIYANGPVQALILVKEDFFLYRSGVYRHTRIAESLRPQYSRS 386
Query: 113 GLHAVRVLGWGVEND----IPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
G H+VR+LGWGV+ I YWL ANSW WG++G F+I+RGE+E+ IE
Sbjct: 387 GWHSVRILGWGVDRSQYRPIKYWLCANSWGHGWGENGYFRIVRGEDESQIE 437
Score = 57.0 bits (136), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 39/104 (37%), Positives = 50/104 (48%), Gaps = 13/104 (12%)
Query: 187 LPRNFDAREKWPECPSLRH-IADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHI 245
LP FDAR +W L H + DQ +C + WA S A SDRL I S G ++S Q +
Sbjct: 200 LPEEFDARIRWS---GLVHGVRDQGDCANSWAFSTAAVASDRLSIQSRGVDKVELSPQDL 256
Query: 246 VACTPNC--WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPY 287
++C C GG P WRF + G V+ E C PY
Sbjct: 257 MSCLNGGRRVVCQGGHPDRGWRFLLNYGGVS-------EECYPY 293
>gi|326490902|dbj|BAJ90118.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326508404|dbj|BAJ99469.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326514912|dbj|BAJ99817.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 345
Score = 90.1 bits (222), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 42/90 (46%), Positives = 57/90 (63%), Gaps = 1/90 (1%)
Query: 75 NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVEN-DIPYWLV 133
+ M ++Y++GP+ F +Y DF YKSGVY+ G +G HA +++GWG + YWL+
Sbjct: 240 DIMAEVYKNGPVEVSFIIYEDFAHYKSGVYKQITGRMVGGHAAKLIGWGTSDAGEDYWLL 299
Query: 134 ANSWNDHWGDHGTFKILRGENEADIEMGFN 163
AN WN WGD G FKI+RG NE IE N
Sbjct: 300 ANQWNRGWGDDGYFKIIRGTNECGIEGDVN 329
Score = 78.6 bits (192), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 41/103 (39%), Positives = 56/103 (54%), Gaps = 6/103 (5%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP+ FDAR KW C ++ I DQ +CG+CWA + DR CI + +S +V
Sbjct: 97 LPKEFDARSKWSGCSTIGKILDQGHCGACWAFGAVECLQDRFCIHHS--VNVSLSVNDLV 154
Query: 247 ACTPNCW--GCNGGWPQLAWRFWGHNGVVTG--GDYNSQEGCQ 285
AC GC+GG+P AW+++ NGVVT + Q GCQ
Sbjct: 155 ACCGFLCGDGCDGGYPIFAWQYFVENGVVTDECDPFFDQVGCQ 197
>gi|322788703|gb|EFZ14296.1| hypothetical protein SINV_07506 [Solenopsis invicta]
Length = 443
Score = 90.1 bits (222), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 43/93 (46%), Positives = 59/93 (63%), Gaps = 8/93 (8%)
Query: 75 NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSI---GLHAVRVLGWGVEND---- 127
+ M++I GP+ A VY DF YKSG+Y+H+ + G H+VR++GWG E
Sbjct: 340 DIMQEILTSGPVQATMRVYQDFFIYKSGIYRHSRSAELHDSGYHSVRIIGWGEERSYRGP 399
Query: 128 -IPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
+ YWLVANSW +WGD+G FKI +G NE +IE
Sbjct: 400 PLKYWLVANSWGYNWGDNGLFKIQKGTNECEIE 432
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 36/92 (39%), Positives = 49/92 (53%), Gaps = 3/92 (3%)
Query: 183 NAKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISA 242
+ LPR FD+R +W + I DQ CG+ WAVS A+ SDR I S G ++SA
Sbjct: 199 DPDALPREFDSRTRWSR--DISGIHDQGWCGASWAVSTADVASDRYSIMSKGAEAPELSA 256
Query: 243 QHIVACTPNC-WGCNGGWPQLAWRFWGHNGVV 273
Q +++C GC GG+ AW F G+V
Sbjct: 257 QQLLSCNNRGQQGCRGGYLDRAWLFMRKFGLV 288
>gi|289724789|gb|ADD18342.1| putative cysteine proteinase TIN-ag [Glossina morsitans morsitans]
Length = 387
Score = 89.7 bits (221), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 40/89 (44%), Positives = 56/89 (62%), Gaps = 4/89 (4%)
Query: 75 NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNF---GDSIGLHAVRVLGWGVEND-IPY 130
+ M +I+ GP+ A +VY DF Y G+Y+H G +G H+V+++GWG E+D Y
Sbjct: 279 DIMAEIFMSGPVQATLTVYRDFFSYSGGIYRHTAASRGSPVGFHSVKLIGWGEEHDGNKY 338
Query: 131 WLVANSWNDHWGDHGTFKILRGENEADIE 159
W+ NSW WG+HG F+ILRG NE IE
Sbjct: 339 WIATNSWGTWWGEHGNFRILRGSNECGIE 367
Score = 72.4 bits (176), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 41/102 (40%), Positives = 54/102 (52%), Gaps = 9/102 (8%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LPR+F++ +KW + + DQ CGS W +S A+ SDR I S G Q+S Q+I+
Sbjct: 142 LPRSFNSIDKWAS--YISDVLDQGWCGSSWVISTASVASDRFAIQSRGKEVIQLSPQNIL 199
Query: 247 ACTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYT 288
+CT GCNGG AWR+ GVV E C PY
Sbjct: 200 SCTRRQQGCNGGHLDAAWRYLHKQGVV-------DESCYPYV 234
>gi|294937366|ref|XP_002782055.1| cathepsin B precursor, putative [Perkinsus marinus ATCC 50983]
gi|239893340|gb|EER13850.1| cathepsin B precursor, putative [Perkinsus marinus ATCC 50983]
Length = 159
Score = 89.7 bits (221), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 37/90 (41%), Positives = 57/90 (63%), Gaps = 1/90 (1%)
Query: 65 FKKAHMVPRCNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGV 124
F + +P+ N ++I+ +GP++ + S+Y D YK+GVY H G G+H ++++GWGV
Sbjct: 59 FGRLPAIPQ-NIKQEIFTNGPVIGMLSIYEDIRVYKAGVYVHQTGSFQGIHTLKIIGWGV 117
Query: 125 ENDIPYWLVANSWNDHWGDHGTFKILRGEN 154
E+ YWL NSWN+ WGDHG K+ G
Sbjct: 118 ESGQDYWLAVNSWNEEWGDHGMIKLAVGRT 147
>gi|13469701|gb|AAK27318.1| cysteine proteinase [Clonorchis sinensis]
Length = 179
Score = 89.7 bits (221), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 43/103 (41%), Positives = 61/103 (59%), Gaps = 2/103 (1%)
Query: 220 VANAISDRLCIASNGYFTGQISAQHIVACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDY 278
A+SDRLCI S+G F +SA +++C +C +GC+GG+P +AW FW +G+VTGG
Sbjct: 2 AVEAMSDRLCIHSSGAFNKSLSAVDLLSCCKDCGYGCDGGFPPMAWDFWKTHGIVTGGSK 61
Query: 279 NSQEGCQPYTLAPCEHHVQGPLQNCTLLGKLKTPECKQNCYNP 321
GC+PY C+HH QG C TP+C ++C P
Sbjct: 62 EEPAGCRPYPFPKCQHHSQGHYPPCPRR-IYPTPKCVKHCDTP 103
Score = 65.1 bits (157), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 27/52 (51%), Positives = 39/52 (75%)
Query: 77 MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDI 128
M++I +GP+ A F V+ DF +YKSG+Y H +G S+G HA+R+LGWG EN +
Sbjct: 128 MKEILLNGPVEATFEVHEDFPEYKSGIYFHAWGGSVGGHAIRILGWGEENGV 179
>gi|159109223|ref|XP_001704877.1| Cathepsin B precursor [Giardia lamblia ATCC 50803]
gi|157432952|gb|EDO77203.1| Cathepsin B precursor [Giardia lamblia ATCC 50803]
Length = 300
Score = 89.7 bits (221), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 38/84 (45%), Positives = 57/84 (67%), Gaps = 1/84 (1%)
Query: 77 MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVEND-IPYWLVAN 135
M+ + GPL F VY+DF+ Y+SGVYQH +G G HAV ++G+G ++D + YW++ N
Sbjct: 207 MKALSTSGPLQVAFLVYSDFMYYESGVYQHTYGYMEGGHAVEMVGYGTDDDGVDYWIIRN 266
Query: 136 SWNDHWGDHGTFKILRGENEADIE 159
SW WG+ G F+++RG N+ IE
Sbjct: 267 SWGPDWGEDGYFRMIRGINDCSIE 290
Score = 71.6 bits (174), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 36/101 (35%), Positives = 49/101 (48%), Gaps = 9/101 (8%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+P +FD RE++P C + + DQ CGSCWA S DR C+A + S Q++V
Sbjct: 75 VPESFDFREEYPHC--IPEVVDQGGCGSCWAFSSVATFGDRRCVAGLDKKPVKYSPQYVV 132
Query: 247 ACTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPY 287
+C CNGGW W+F G T + C PY
Sbjct: 133 SCDHGDMACNGGWLPNVWKFLTKTGTTT-------DECVPY 166
>gi|449283627|gb|EMC90232.1| Tubulointerstitial nephritis antigen [Columba livia]
Length = 469
Score = 89.4 bits (220), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 47/121 (38%), Positives = 69/121 (57%), Gaps = 18/121 (14%)
Query: 57 TSIPLSHYFKKAHMVPRC-----------NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQ 105
T+ P + +K++ + RC N M++I + GP+ AI VY DF YK G+Y+
Sbjct: 339 TNGPCPNALEKSNRLYRCASHYRVSSKETNIMKEIMDKGPVQAIMKVYEDFFLYKEGIYR 398
Query: 106 HN--FGDSIGLHAVRVLGWGVEND-----IPYWLVANSWNDHWGDHGTFKILRGENEADI 158
H+ G H+V++LGWG D +W+ ANSW WG++G F+ILRG+NE DI
Sbjct: 399 HSQKAGSKWKTHSVKLLGWGALADKNGQKQKFWIAANSWGKSWGENGYFRILRGQNECDI 458
Query: 159 E 159
E
Sbjct: 459 E 459
Score = 66.6 bits (161), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 39/95 (41%), Positives = 52/95 (54%), Gaps = 3/95 (3%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
P F A WPE + DQ NCG+ WA S A+ +DR+ I S G T +S Q+++
Sbjct: 222 FPVFFAATYAWPE--WIHDPLDQRNCGASWAFSTASVAADRIAIHSEGQITDNLSVQNLI 279
Query: 247 AC-TPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNS 280
+C T N GCNGG AWR+ +GVV+ Y S
Sbjct: 280 SCDTRNQHGCNGGNIDSAWRYLKTHGVVSYACYPS 314
>gi|308157829|gb|EFO60849.1| Cathepsin B precursor [Giardia lamblia P15]
Length = 300
Score = 89.4 bits (220), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 38/84 (45%), Positives = 57/84 (67%), Gaps = 1/84 (1%)
Query: 77 MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVEND-IPYWLVAN 135
M+ + GPL F VY+DF+ Y+SGVYQH +G G HAV ++G+G ++D + YW++ N
Sbjct: 207 MKALSTTGPLQVAFLVYSDFMYYESGVYQHTYGYMEGGHAVEMVGYGTDDDGVDYWIIRN 266
Query: 136 SWNDHWGDHGTFKILRGENEADIE 159
SW WG+ G F+++RG N+ IE
Sbjct: 267 SWGPDWGEDGYFRMIRGINDCSIE 290
Score = 73.6 bits (179), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 38/101 (37%), Positives = 50/101 (49%), Gaps = 9/101 (8%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+P +FD RE++P C + + DQ CGSCWA S DR CIA + S Q++V
Sbjct: 75 VPESFDFREEYPHC--IPEVVDQGGCGSCWAFSSVATFGDRRCIAGLDKKPVKYSPQYVV 132
Query: 247 ACTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPY 287
+C CNGGW AW+F G T + C PY
Sbjct: 133 SCDHGNMACNGGWLPNAWKFLTKTGTTT-------DECVPY 166
>gi|239788200|dbj|BAH70790.1| ACYPI000013 [Acyrthosiphon pisum]
Length = 165
Score = 89.4 bits (220), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 36/75 (48%), Positives = 53/75 (70%), Gaps = 1/75 (1%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+PR FDAR +W C ++ + DQ +CGSCWA++ ++A +DRLC+A+NG F +SA+ I
Sbjct: 88 IPRTFDARRRWRHCKTIGEVRDQGHCGSCWAMATSSAFADRLCVATNGDFNELLSAEEIT 147
Query: 247 ACTPNC-WGCNGGWP 260
C C +GCNGG+P
Sbjct: 148 FCCHTCGFGCNGGYP 162
>gi|294956046|ref|XP_002788796.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
gi|239904363|gb|EER20592.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
Length = 130
Score = 89.4 bits (220), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 37/88 (42%), Positives = 57/88 (64%), Gaps = 1/88 (1%)
Query: 65 FKKAHMVPRCNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGV 124
F + +P+ N ++I+ +GP++ + S+Y D YK+GVY H G G+H ++++GWGV
Sbjct: 30 FGRLPAIPQ-NIKQEIFTNGPVIGMLSIYEDIRVYKAGVYVHQTGSFQGIHTLKIIGWGV 88
Query: 125 ENDIPYWLVANSWNDHWGDHGTFKILRG 152
E+ YWL NSWN+ WGDHG K+ G
Sbjct: 89 ESGQDYWLAVNSWNEEWGDHGMIKLAVG 116
>gi|307201161|gb|EFN81067.1| Uncharacterized peptidase C1-like protein F26E4.3 [Harpegnathos
saltator]
Length = 443
Score = 89.4 bits (220), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 42/93 (45%), Positives = 59/93 (63%), Gaps = 8/93 (8%)
Query: 75 NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSI---GLHAVRVLGWGVEND---- 127
+ M++I GP+ A VY DF YK+GVY+H+ + G H++R++GWG E
Sbjct: 340 DIMQEILTSGPVQATMRVYQDFFVYKNGVYRHSRSAELHDSGYHSMRIIGWGEEPSYRGP 399
Query: 128 -IPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
+ YWLVANSW HWG++G F+I RG NE +IE
Sbjct: 400 PLKYWLVANSWGRHWGENGLFRIQRGTNECEIE 432
Score = 70.5 bits (171), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 39/92 (42%), Positives = 51/92 (55%), Gaps = 3/92 (3%)
Query: 183 NAKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISA 242
+ LPR FDAR +WP + I DQ CG+ WAVS A+ SDR I S G ++SA
Sbjct: 199 DPDALPREFDARTRWPR--DISGIHDQGWCGASWAVSTADVASDRFAIMSKGAEDVELSA 256
Query: 243 QHIVACTPNC-WGCNGGWPQLAWRFWGHNGVV 273
QH+++C GC GG+ AW F G+V
Sbjct: 257 QHLLSCNNRGQQGCRGGYLDRAWLFMRKFGLV 288
>gi|159108625|ref|XP_001704582.1| Cathepsin B precursor [Giardia lamblia ATCC 50803]
gi|157432649|gb|EDO76908.1| Cathepsin B precursor [Giardia lamblia ATCC 50803]
Length = 298
Score = 89.0 bits (219), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 39/84 (46%), Positives = 55/84 (65%), Gaps = 1/84 (1%)
Query: 77 MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGV-ENDIPYWLVAN 135
M+ + GPL F+VY+DF+ Y+ GVYQH +G G HAV ++G+G E D+ YW++ N
Sbjct: 205 MKALATGGPLQTAFTVYSDFMYYEGGVYQHTYGRVEGGHAVEMVGYGTDEYDVDYWIIRN 264
Query: 136 SWNDHWGDHGTFKILRGENEADIE 159
SW WG+ G F+I+R NE IE
Sbjct: 265 SWGPDWGEDGYFRIIRMTNECGIE 288
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 38/108 (35%), Positives = 52/108 (48%), Gaps = 9/108 (8%)
Query: 180 GCQNAKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQ 239
G +A P +FD RE++P C + + DQ CGSCWA S ++ DR C A +
Sbjct: 67 GTVSATQAPDSFDFREEYPHC--IPEVVDQGGCGSCWAFSSVASVGDRRCFAGLDKKAVK 124
Query: 240 ISAQHIVACTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPY 287
S Q++V+C C+GGW WRF G T + C PY
Sbjct: 125 YSPQYVVSCDRGDMACDGGWLPSVWRFLTKTGTTT-------DECVPY 165
>gi|167508668|gb|ABZ81540.1| cathepsin B-like cysteine protease [Caenorhabditis brenneri]
Length = 193
Score = 89.0 bits (219), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 43/117 (36%), Positives = 65/117 (55%), Gaps = 3/117 (2%)
Query: 46 KKKKKKRLYLPTSIPLSHYFKKAHMVP---RCNAMRQIYEHGPLVAIFSVYADFLQYKSG 102
+++ + P S +F KAH + +I +GP++A F +Y DF YKSG
Sbjct: 77 EERCTSNITWPISYKQVKHFGKAHYNVGKKMTDIQTEIMRNGPVIASFIIYDDFWDYKSG 136
Query: 103 VYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
+Y H GD G +++GWGV+N +PYWL + W +G++G +ILRG NE IE
Sbjct: 137 IYVHTAGDQEGGMDTKIIGWGVDNGVPYWLCVHQWGTDFGENGFMRILRGVNEVHIE 193
Score = 57.0 bits (136), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 31/87 (35%), Positives = 50/87 (57%), Gaps = 3/87 (3%)
Query: 253 WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTLLGKLKTP 312
WGC+G WP+ ++W +G+ TGG+Y+ Q GC+PYT+ PC+ + G TP
Sbjct: 16 WGCDGSWPKDILKWWQTHGLCTGGNYDDQFGCKPYTIYPCDKTYPNGTTSVPCPG-YHTP 74
Query: 313 ECKQNCY-NPSYESTYRFDLKKGKKAH 338
C++ C N ++ +Y+ +K KAH
Sbjct: 75 VCEERCTSNITWPISYK-QVKHFGKAH 100
>gi|308160258|gb|EFO62754.1| Cathepsin B precursor [Giardia lamblia P15]
Length = 298
Score = 89.0 bits (219), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 39/84 (46%), Positives = 56/84 (66%), Gaps = 1/84 (1%)
Query: 77 MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGV-ENDIPYWLVAN 135
M+ + GPL F+VY+DF+ Y+ GVYQH +G + G HAV ++G+G E D+ YW++ N
Sbjct: 205 MKALATGGPLQTAFTVYSDFMYYQGGVYQHVYGRAEGGHAVEMVGYGTDEYDVDYWIIRN 264
Query: 136 SWNDHWGDHGTFKILRGENEADIE 159
SW WG+ G F+I+R NE IE
Sbjct: 265 SWGPDWGEDGYFRIIRMTNECGIE 288
Score = 74.7 bits (182), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 38/108 (35%), Positives = 54/108 (50%), Gaps = 9/108 (8%)
Query: 180 GCQNAKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQ 239
G +A +P +FD RE++P C + + DQ CGSCWA S ++ DR C+A +
Sbjct: 67 GTVSATQVPDSFDFREEYPHC--IPEVVDQGGCGSCWAFSSVASVGDRRCVAGLDKKAVR 124
Query: 240 ISAQHIVACTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPY 287
S Q++V+C C+GGW WRF G T + C PY
Sbjct: 125 YSPQYVVSCDRGDMACDGGWLPSVWRFLVKTGTTT-------DECVPY 165
>gi|294891881|ref|XP_002773785.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
gi|239878989|gb|EER05601.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
Length = 455
Score = 89.0 bits (219), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 33/72 (45%), Positives = 51/72 (70%)
Query: 78 RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
++I+++GP+ A+ ++Y DF YKSGVY H G + H ++++GWGVE+ YWL N+W
Sbjct: 320 QEIFDNGPVAAMMTLYEDFRFYKSGVYVHKTGQMLAAHTLKLIGWGVESGQEYWLAVNAW 379
Query: 138 NDHWGDHGTFKI 149
N+ WGDHG K+
Sbjct: 380 NEEWGDHGMIKL 391
Score = 84.7 bits (208), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 62/220 (28%), Positives = 96/220 (43%), Gaps = 21/220 (9%)
Query: 135 NSWNDHWGDHGTFKILRGENEADIEMGFNNRVEANSSEDDDLET--MGCQNA--KGLPRN 190
NS W +G + D+ G +N +S+ D+ E +G N LP +
Sbjct: 89 NSMQQSWTASKDQPPFKGMSIKDLPAGCSNDTMFSSTLDEGGENRLLGPTNPVLTTLPSS 148
Query: 191 FDAREKWPECPS-LRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVAC- 248
FDAR+K+ C + H+ +Q C +CWA + +DR+CI S G T +S ++ +C
Sbjct: 149 FDARQKFASCADVIGHVREQGECNNCWASAAVGMFNDRVCIKSGGRITDILSLGYLTSCC 208
Query: 249 -----TPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQE------GCQPYTLAPCEH--H 295
P GC G F ++G+VTGG+Y E GC PY C H
Sbjct: 209 NRANGCPKSNGCMFGSVPEGLNFMKNHGLVTGGEYKPPEELGNDDGCWPYPFPKCNHVPG 268
Query: 296 VQGPLQNCTLLGKLKTPECKQNCYNPSYESTYRFDLKKGK 335
++ C + L P C C N +Y ++ + D + K
Sbjct: 269 LESKYPRCAQVRDL--PACATTCPNKAYGTSMQKDTHRAK 306
>gi|328872536|gb|EGG20903.1| hypothetical protein DFA_00770 [Dictyostelium fasciculatum]
Length = 313
Score = 89.0 bits (219), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 44/90 (48%), Positives = 53/90 (58%), Gaps = 5/90 (5%)
Query: 76 AMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIG---LHAVRVLGWGVENDIPYWL 132
A IY +GP++A+F +Y D YKSGVY + DS HA RV+GWGVE+ + YWL
Sbjct: 164 AKADIYLNGPIIAVFDLYTDIYNYKSGVYIKS--DSATYKETHAGRVIGWGVEDGVQYWL 221
Query: 133 VANSWNDHWGDHGTFKILRGENEADIEMGF 162
ANSW WG G FKI G NE E F
Sbjct: 222 AANSWGTGWGQQGLFKIRSGTNEVGFEANF 251
Score = 68.6 bits (166), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 40/107 (37%), Positives = 57/107 (53%), Gaps = 11/107 (10%)
Query: 183 NAKGLPRNFDAREKWPECPSLRHIADQSN-CGSCWAVSVANAISDRLCIASNGYFTGQIS 241
+A LP +FD+R+KW +C S + DQ C SCWA++ ++DRLC+AS G +S
Sbjct: 29 DASNLPASFDSRQKWSDCFS--PVRDQGQKCSSCWAMTATGVLADRLCVASGGKVKKVLS 86
Query: 242 AQHIVACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPY 287
Q ++ C N GC GG ++ NGVVT E C+ Y
Sbjct: 87 PQELIDCDRNGNLGCGGGRLDTPLAYFRDNGVVT-------EKCESY 126
>gi|330846430|ref|XP_003295033.1| hypothetical protein DICPUDRAFT_51857 [Dictyostelium purpureum]
gi|325074364|gb|EGC28440.1| hypothetical protein DICPUDRAFT_51857 [Dictyostelium purpureum]
Length = 257
Score = 88.6 bits (218), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 48/103 (46%), Positives = 62/103 (60%), Gaps = 10/103 (9%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+P++FDAR +WP C + I +Q CGSCWA S + +SDRLCIASNG +S Q +V
Sbjct: 31 IPQSFDARTQWPNC--IHPILNQEQCGSCWAFSASEVLSDRLCIASNGKTGVVLSPQALV 88
Query: 247 AC-TPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYT 288
+C GCNGG PQLAW + +G+ T GC PYT
Sbjct: 89 SCDIFGNQGCNGGIPQLAWEYMELHGIPT-------YGCFPYT 124
Score = 67.4 bits (163), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 32/75 (42%), Positives = 46/75 (61%), Gaps = 3/75 (4%)
Query: 78 RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDS-IGLHAVRVLGWGVE--NDIPYWLVA 134
+ I + GP+ VY+DF+ Y SGVY G S +G HA++++GWG + ++ YW+VA
Sbjct: 165 QDIMKFGPIQGTMEVYSDFMSYTSGVYTMTPGSSLLGGHAIKIVGWGFDQASNQNYWIVA 224
Query: 135 NSWNDHWGDHGTFKI 149
NSW WG G F I
Sbjct: 225 NSWGPSWGIDGFFWI 239
>gi|294895531|ref|XP_002775206.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
gi|239881224|gb|EER07022.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
Length = 130
Score = 88.6 bits (218), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 37/88 (42%), Positives = 57/88 (64%), Gaps = 1/88 (1%)
Query: 65 FKKAHMVPRCNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGV 124
F + +P+ N ++I+ +GP++ + S+Y D YK+GVY H G G+H ++++GWGV
Sbjct: 30 FGRLPAIPQ-NIKQEIFTNGPVIGMLSLYEDIRVYKAGVYVHQTGSFQGIHTLKIIGWGV 88
Query: 125 ENDIPYWLVANSWNDHWGDHGTFKILRG 152
E+ YWL NSWN+ WGDHG K+ G
Sbjct: 89 ESGQDYWLAVNSWNEEWGDHGMIKLAVG 116
>gi|21699|emb|CAA46811.1| cathepsin B [Triticum aestivum]
Length = 353
Score = 88.6 bits (218), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 45/96 (46%), Positives = 61/96 (63%), Gaps = 4/96 (4%)
Query: 67 KAHMVPRCNAMRQIYEHGPLVAIFSV--YADFLQYKSGVYQHNFGDSIGLHAVRVLGWGV 124
+ H P + M ++Y++GP+ F+ DF YKSGVY+H G +G HAV+++GWG
Sbjct: 233 RVHSNPH-DIMAEVYKNGPVEVAFTYCQILDFAHYKSGVYKHITGGVMGGHAVKLIGWGT 291
Query: 125 EN-DIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
+ YWL+AN WN WGD G FKI+RGENE IE
Sbjct: 292 SDAGEDYWLLANQWNRGWGDDGYFKIIRGENECGIE 327
Score = 84.3 bits (207), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 49/135 (36%), Positives = 70/135 (51%), Gaps = 20/135 (14%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP+ FDAR +W C ++ +I DQ +CG+CWA + A+ DR CI N + +S ++
Sbjct: 97 LPKEFDARTQWSSCSTIGNILDQGHCGACWAFAAVEALQDRFCIHLN--MSVSLSVNDLL 154
Query: 247 ACTPNCW--GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPY-TLAPCEHHVQGPLQNC 303
AC GCNGG+P AWR++ +GVVT E C PY C+H P
Sbjct: 155 ACCGFLCGSGCNGGYPISAWRYFRRSGVVT-------EECDPYFDQTGCQHPGCEP---- 203
Query: 304 TLLGKLKTPECKQNC 318
TP+C++ C
Sbjct: 204 ----AYPTPKCQRKC 214
>gi|294888035|ref|XP_002772321.1| Cathepsin B precursor, putative [Perkinsus marinus ATCC 50983]
gi|239876433|gb|EER04137.1| Cathepsin B precursor, putative [Perkinsus marinus ATCC 50983]
Length = 200
Score = 88.6 bits (218), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 46/96 (47%), Positives = 57/96 (59%), Gaps = 7/96 (7%)
Query: 208 DQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVACTPNCWGCNGGWPQLAWRFW 267
DQS CGSCWA V A +DRLCI S+G FT +SA + ACT +GC GG P AW +
Sbjct: 1 DQSACGSCWAFGVTEAFNDRLCIKSDGAFTELLSAGEMNACTLF-FGCGGGDPYSAWSWV 59
Query: 268 GHNGVVTGGDYNSQ------EGCQPYTLAPCEHHVQ 297
G+ TGGDY ++ +GC PY PC HH+
Sbjct: 60 HDKGIATGGDYVAKDDMTKDDGCWPYDFPPCAHHIN 95
Score = 88.2 bits (217), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 38/74 (51%), Positives = 51/74 (68%)
Query: 75 NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVA 134
+A I GP+ A F+VY DFL Y+SGVY+H G +G HAV+++GWG ++ YWL
Sbjct: 127 DAKNAIRTDGPVSASFTVYEDFLAYRSGVYKHTSGSYLGGHAVKIIGWGEKSGQAYWLAV 186
Query: 135 NSWNDHWGDHGTFK 148
NSWN+ WGDHG F+
Sbjct: 187 NSWNEDWGDHGLFR 200
>gi|170045773|ref|XP_001850470.1| tubulointerstitial nephritis antigen [Culex quinquefasciatus]
gi|167868692|gb|EDS32075.1| tubulointerstitial nephritis antigen [Culex quinquefasciatus]
Length = 463
Score = 88.2 bits (217), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 46/117 (39%), Positives = 68/117 (58%), Gaps = 12/117 (10%)
Query: 55 LPTSIPLSHYFKKAHMVPRCN---AMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDS 111
LPT + ++ +K N M +I +HGP+ AI V+ DF YKSG+Y+H+ S
Sbjct: 299 LPTKVDRTNMYKMGPAFSLNNETDIMIEIKKHGPVQAILRVHRDFFSYKSGIYRHSAASS 358
Query: 112 IG-----LHAVRVLGWGVEND----IPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
G H+VR++GWG E + YW+ NSW WG++G F+I+RG+NE +IE
Sbjct: 359 AGDERAGYHSVRLIGWGEERNGYETTKYWVAVNSWGRWWGENGRFRIVRGQNECEIE 415
Score = 64.7 bits (156), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 38/104 (36%), Positives = 50/104 (48%), Gaps = 9/104 (8%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP +FDA WP + + DQ CGS WA+S A+ SDR I S G Q++ Q I+
Sbjct: 186 LPTHFDATTYWPG--FIGEVKDQGWCGSSWALSTASVASDRFAILSKGREIVQLAPQQII 243
Query: 247 ACTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLA 290
+C GC+GG AW + G V + C PY A
Sbjct: 244 SCVRRSQGCSGGHLDTAWNYVRKVGTV-------NDECYPYISA 280
>gi|66506619|ref|XP_393283.2| PREDICTED: uncharacterized peptidase C1-like protein F26E4.3-like
[Apis mellifera]
Length = 439
Score = 88.2 bits (217), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 43/94 (45%), Positives = 56/94 (59%), Gaps = 9/94 (9%)
Query: 75 NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSI---GLHAVRVLGWG--VEND-- 127
+ MR+I GP+ A VY DF Y+SG+Y H + G H+VR++GWG + D
Sbjct: 334 DIMREILTSGPVQATMKVYQDFFSYESGIYMHTPIAELYESGYHSVRIIGWGEDISTDSG 393
Query: 128 --IPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
I YWLV NSW WG++G F+I RG NE DIE
Sbjct: 394 LPIKYWLVVNSWGQEWGENGLFRIRRGINECDIE 427
Score = 65.5 bits (158), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 35/92 (38%), Positives = 51/92 (55%), Gaps = 3/92 (3%)
Query: 183 NAKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISA 242
+ + LPR FDAR +W + + DQ CG+ WA+S A SDR + S G + +SA
Sbjct: 193 DPESLPREFDARTRWRR--QISGVDDQGWCGASWAISTAQVASDRFAVMSKGTDSVLLSA 250
Query: 243 QHIVACTPNCW-GCNGGWPQLAWRFWGHNGVV 273
QH+++C GC+GG+ AW F G+V
Sbjct: 251 QHLLSCNKKGQRGCDGGYLDRAWLFMRKFGLV 282
>gi|114153242|gb|ABI52787.1| cathepsin B-like protein [Argas monolakensis]
Length = 91
Score = 88.2 bits (217), Expect = 5e-15, Method: Composition-based stats.
Identities = 41/78 (52%), Positives = 51/78 (65%), Gaps = 1/78 (1%)
Query: 83 HGPLVAIFSVYADFLQYKSGVYQHNFGDSI-GLHAVRVLGWGVENDIPYWLVANSWNDHW 141
H V + V+ DF + V + D + G HA+R++GWGVE D+PYWLVANSWN W
Sbjct: 4 HSAGVRLSPVFTDFGHLQGQVCTSDTVDVLMGGHAIRIIGWGVEEDVPYWLVANSWNREW 63
Query: 142 GDHGTFKILRGENEADIE 159
GD+G FKILRG NE IE
Sbjct: 64 GDNGYFKILRGSNECGIE 81
>gi|294916952|ref|XP_002778399.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
gi|239886773|gb|EER10194.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
Length = 228
Score = 88.2 bits (217), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 34/77 (44%), Positives = 53/77 (68%)
Query: 78 RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
++I+++GP+ A+ ++Y DF YKSGVY H G + H ++++GWGVE+ YWL N+W
Sbjct: 139 QEIFDNGPVAAMMTLYEDFRYYKSGVYVHKTGQLLAAHTLKLIGWGVESGQEYWLAMNAW 198
Query: 138 NDHWGDHGTFKILRGEN 154
N+ WGDHG K+ G+
Sbjct: 199 NEEWGDHGMIKLAVGKT 215
Score = 47.4 bits (111), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 36/126 (28%), Positives = 54/126 (42%), Gaps = 16/126 (12%)
Query: 224 ISDRLCIASNGYFTGQISAQHIVAC------TPNCWGCNGGWPQLAWRFWGHNGVVTGGD 277
+DR+CI S G T +S ++ +C P GC G F ++G+VTGG+
Sbjct: 2 FNDRVCIKSGGKTTDILSLGYLTSCCNRANGCPKSNGCMFGSVPEGLNFMKNHGLVTGGE 61
Query: 278 YNSQE------GCQPYTLAPCEH--HVQGPLQNCTLLGKLKTPECKQNCYNPSYESTYRF 329
Y E GC PY C H ++ C + L P C C N +Y ++ +
Sbjct: 62 YKPPEKLGNDDGCWPYPFPKCNHVPGLESKYPRCAQVRDL--PACATTCPNKAYGTSMQK 119
Query: 330 DLKKGK 335
D + K
Sbjct: 120 DTHRAK 125
>gi|403377404|gb|EJY88697.1| hypothetical protein OXYTRI_00086 [Oxytricha trifallax]
Length = 351
Score = 88.2 bits (217), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 40/101 (39%), Positives = 61/101 (60%), Gaps = 9/101 (8%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+P FD R KWP+C LR I DQ+NCG+CWA + + ++DR+CI +NG ++S Q +V
Sbjct: 120 IPLEFDFRTKWPQC--LRKIRDQANCGACWAFTGSGMLADRICILTNGTINEELSPQDMV 177
Query: 247 ACTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPY 287
C+ + +GC GG+ A + + GV ++E C PY
Sbjct: 178 DCSHDNFGCEGGYLMNALDYLMNEGV-------TKESCTPY 211
Score = 70.5 bits (171), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 37/101 (36%), Positives = 56/101 (55%), Gaps = 8/101 (7%)
Query: 78 RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGW-GVENDIPYWLVANS 136
R + ++GPL+ +VY DF+ Y +G Y+ G+ +G HAV+++GW + WL+ N
Sbjct: 250 RDLMQNGPLMVGLTVYEDFINYATGDYKFVAGEIVGGHAVKLMGWRTTQKGQTSWLIQNQ 309
Query: 137 WNDHWGDHGTFKILRGENEADIEMGFNNRVEANSSEDDDLE 177
WND WG+ G IL ENE I+ + + D DLE
Sbjct: 310 WNDDWGEQGFGYIL--ENEVGID-----SIGVGCTPDIDLE 343
>gi|290992564|ref|XP_002678904.1| predicted protein [Naegleria gruberi]
gi|284092518|gb|EFC46160.1| predicted protein [Naegleria gruberi]
Length = 289
Score = 88.2 bits (217), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 40/75 (53%), Positives = 51/75 (68%), Gaps = 3/75 (4%)
Query: 78 RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVEN---DIPYWLVA 134
+ I +GP+ A FSVY DF YKSGVY+H G G HA++++GWGV + D PYW+VA
Sbjct: 215 KDIQANGPVQAAFSVYQDFFSYKSGVYRHVSGSLAGGHAIKIVGWGVTSDGKDTPYWIVA 274
Query: 135 NSWNDHWGDHGTFKI 149
NSWN +WG G F I
Sbjct: 275 NSWNTNWGQEGFFWI 289
Score = 80.1 bits (196), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 40/99 (40%), Positives = 55/99 (55%), Gaps = 9/99 (9%)
Query: 190 NFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVACT 249
+FDAR KW +C + I DQ CGSCWA S + +SDR CIASNG +S ++++ C
Sbjct: 86 SFDARTKWGKC--VHPIRDQQQCGSCWAFSASEVLSDRFCIASNGSVDVVLSPEYMLQCD 143
Query: 250 PNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYT 288
+GC+GG+ AW F G+ + C PYT
Sbjct: 144 STDYGCDGGYLNNAWAFLAGTGI-------PSDKCDPYT 175
>gi|403345965|gb|EJY72367.1| Cathepsin B [Oxytricha trifallax]
Length = 309
Score = 87.8 bits (216), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 38/78 (48%), Positives = 52/78 (66%)
Query: 80 IYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWND 139
I + GP+ F+VYADF YKSG+Y H G + G HAV++LGWG + YW+VANSW +
Sbjct: 212 IQQSGPVETGFTVYADFFNYKSGIYHHVSGGAEGGHAVKILGWGKQGSENYWIVANSWGE 271
Query: 140 HWGDHGTFKILRGENEAD 157
WG+ G F I +G++ D
Sbjct: 272 SWGEKGFFNIRQGDSGID 289
Score = 85.5 bits (210), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 44/117 (37%), Positives = 66/117 (56%), Gaps = 10/117 (8%)
Query: 171 SEDDDLETMGCQNAKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCI 230
S D+ NA +P +FD+R +W C + I DQ+ CGSCWA + + ++SDR CI
Sbjct: 63 SHSSDIPAFTQINA-AVPDSFDSRTQWQGC--VHPIRDQAQCGSCWAFAASESLSDRFCI 119
Query: 231 ASNGYFTGQISAQHIVACTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPY 287
AS G +S Q +V+C N +GC+GG+ LAW++ GV + + C+PY
Sbjct: 120 ASQGKVNVVLSPQDMVSCDTNNYGCDGGYLNLAWQYLEKKGVAS-------DSCEPY 169
>gi|125810908|ref|XP_001361665.1| GA15908 [Drosophila pseudoobscura pseudoobscura]
gi|54636841|gb|EAL26244.1| GA15908 [Drosophila pseudoobscura pseudoobscura]
Length = 433
Score = 87.8 bits (216), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 43/89 (48%), Positives = 54/89 (60%), Gaps = 4/89 (4%)
Query: 75 NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQH---NFGDSIGLHAVRVLGWGVE-NDIPY 130
+ M +IY GP+ A VY DF Y SGVY+ N G G H+V+++GWG E N Y
Sbjct: 326 DIMAEIYHSGPVQATMRVYRDFFSYSSGVYRQTAANRGAPTGFHSVKLVGWGEEHNGDKY 385
Query: 131 WLVANSWNDHWGDHGTFKILRGENEADIE 159
W+ ANSW WG+ G F+ILRG NE IE
Sbjct: 386 WIAANSWGPWWGERGYFRILRGSNECGIE 414
Score = 72.4 bits (176), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 43/103 (41%), Positives = 53/103 (51%), Gaps = 9/103 (8%)
Query: 186 GLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHI 245
GLP F+A EKW + + DQ CGS W +S + SDR I S G Q+SAQ+I
Sbjct: 188 GLPAAFNAVEKWSS--YISEVPDQGWCGSSWVLSTTSVASDRFAIQSKGKEAVQLSAQNI 245
Query: 246 VACTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYT 288
++CT GC GG AWR+ GVV E C PYT
Sbjct: 246 LSCTRRQQGCEGGHLDAAWRYLHKKGVV-------DESCYPYT 281
>gi|195154396|ref|XP_002018108.1| GL16940 [Drosophila persimilis]
gi|194113904|gb|EDW35947.1| GL16940 [Drosophila persimilis]
Length = 433
Score = 87.8 bits (216), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 43/89 (48%), Positives = 54/89 (60%), Gaps = 4/89 (4%)
Query: 75 NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQH---NFGDSIGLHAVRVLGWGVE-NDIPY 130
+ M +IY GP+ A VY DF Y SGVY+ N G G H+V+++GWG E N Y
Sbjct: 326 DIMAEIYHSGPVQATMRVYRDFFSYSSGVYRQTAANRGAPTGFHSVKLVGWGEEHNGDKY 385
Query: 131 WLVANSWNDHWGDHGTFKILRGENEADIE 159
W+ ANSW WG+ G F+ILRG NE IE
Sbjct: 386 WIAANSWGPWWGERGYFRILRGSNECGIE 414
Score = 72.4 bits (176), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 43/103 (41%), Positives = 53/103 (51%), Gaps = 9/103 (8%)
Query: 186 GLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHI 245
GLP F+A EKW + + DQ CGS W +S + SDR I S G Q+SAQ+I
Sbjct: 188 GLPAAFNAVEKWSS--YISEVPDQGWCGSSWVLSTTSVASDRFAIQSKGKEAVQLSAQNI 245
Query: 246 VACTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYT 288
++CT GC GG AWR+ GVV E C PYT
Sbjct: 246 LSCTRRQQGCEGGHLDAAWRYLHKKGVV-------DESCYPYT 281
>gi|403362666|gb|EJY81064.1| Cathepsin B [Oxytricha trifallax]
Length = 309
Score = 87.8 bits (216), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 38/78 (48%), Positives = 52/78 (66%)
Query: 80 IYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWND 139
I + GP+ F+VYADF YKSG+Y H G + G HAV++LGWG + YW+VANSW +
Sbjct: 212 IQQSGPVETGFTVYADFFNYKSGIYHHVSGGAEGGHAVKILGWGKQGSENYWIVANSWGE 271
Query: 140 HWGDHGTFKILRGENEAD 157
WG+ G F I +G++ D
Sbjct: 272 SWGEKGFFNIRQGDSGID 289
Score = 85.5 bits (210), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 44/117 (37%), Positives = 66/117 (56%), Gaps = 10/117 (8%)
Query: 171 SEDDDLETMGCQNAKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCI 230
S D+ NA +P +FD+R +W C + I DQ+ CGSCWA + + ++SDR CI
Sbjct: 63 SHSSDIPAFTQINA-AVPDSFDSRTQWQGC--VHPIRDQAQCGSCWAFAASESLSDRFCI 119
Query: 231 ASNGYFTGQISAQHIVACTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPY 287
AS G +S Q +V+C N +GC+GG+ LAW++ GV + + C+PY
Sbjct: 120 ASQGKVNVVLSPQDMVSCDTNNYGCDGGYLNLAWQYLEKKGVAS-------DSCEPY 169
>gi|290987261|ref|XP_002676341.1| predicted protein [Naegleria gruberi]
gi|284089943|gb|EFC43597.1| predicted protein [Naegleria gruberi]
Length = 218
Score = 87.8 bits (216), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 38/85 (44%), Positives = 54/85 (63%), Gaps = 4/85 (4%)
Query: 80 IYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGV----ENDIPYWLVAN 135
I G L A +Y DF+QY+ GVY+H G+ + H+VR++GWG+ + IPYW+ N
Sbjct: 124 IMNGGSLAASLDIYRDFVQYRGGVYRHLVGNYMFTHSVRIVGWGITSPQQGSIPYWICGN 183
Query: 136 SWNDHWGDHGTFKILRGENEADIEM 160
+W + WG G F ILRG NE +IE+
Sbjct: 184 NWTEEWGMQGWFWILRGSNECNIEL 208
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 33/99 (33%), Positives = 50/99 (50%), Gaps = 14/99 (14%)
Query: 200 CPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVACTPNCWGCNGGW 259
C L I D+ CG CWA VA +SDR C++S +S Q++++C N GC+ G+
Sbjct: 1 CKQLSLIRDEQQCG-CWAFVVAEVVSDRFCVSSKTKVNEVLSPQYLISCDSNNGGCSYGY 59
Query: 260 PQLAWRFWGHNGVVTGGDYNSQEGCQPYT------LAPC 292
A++F + G+VT E C P+ + PC
Sbjct: 60 FDTAFQFVENQGIVT-------ENCFPFVSGEGNYIPPC 91
>gi|12658201|gb|AAK01061.1| cysteine proteinase [Metagonimus yokogawai]
Length = 179
Score = 87.8 bits (216), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 47/122 (38%), Positives = 64/122 (52%), Gaps = 4/122 (3%)
Query: 220 VANAISDRLCIASNGYFTGQISAQHIVACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDY 278
A++DRLCI SN IS+ +++C +C +GC+GG+P AW FW NG+VTGG
Sbjct: 2 AVEAMTDRLCIHSNATIKKHISSTDLLSCCESCGFGCHGGFPPRAWDFWMENGLVTGGSK 61
Query: 279 NSQEGCQPYTLAPCEHHVQGPLQNCTLLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAH 338
+ GC+ Y C HH +GP C TP C + C P E Y D K K ++
Sbjct: 62 ENPSGCRSYPFPKCNHHGKGPDAPCPEK-IFPTPACNKTCDTP--EVNYILDKTKAKSSY 118
Query: 339 MV 340
V
Sbjct: 119 NV 120
Score = 71.6 bits (174), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 31/52 (59%), Positives = 40/52 (76%)
Query: 77 MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDI 128
M++I ++GP+ A F VY DFL Y+SGVY H+FG IG HA+R+LGWG EN I
Sbjct: 128 MKEIMQNGPVEAAFEVYEDFLHYESGVYFHSFGRMIGGHAIRMLGWGEENGI 179
>gi|357623033|gb|EHJ74345.1| tubulointerstitial nephritis antigen [Danaus plexippus]
Length = 426
Score = 87.8 bits (216), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 40/88 (45%), Positives = 59/88 (67%), Gaps = 3/88 (3%)
Query: 75 NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHN-FGDSI--GLHAVRVLGWGVENDIPYW 131
+ M I E GP+ A+ +V+ DF Y G+Y+ + +GD+ GLH+VR++GWG + YW
Sbjct: 322 DIMYDIMESGPVHAVMTVHQDFFHYHDGIYRRSPYGDNTLQGLHSVRIVGWGEDRGDKYW 381
Query: 132 LVANSWNDHWGDHGTFKILRGENEADIE 159
+VANSW WG++G F+I RG NE+ IE
Sbjct: 382 VVANSWGCDWGENGYFRIARGSNESGIE 409
Score = 66.2 bits (160), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 41/104 (39%), Positives = 54/104 (51%), Gaps = 10/104 (9%)
Query: 188 PRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVA 247
PR+FDAR +WP + + DQ CGS WAV++A SDR I SNG +S Q +++
Sbjct: 187 PRDFDARRRWPN--FISPVLDQGWCGSDWAVTIATVASDRFAIQSNGAERMVLSPQVLLS 244
Query: 248 C-TPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLA 290
C GC GG +AW F +G+V E C PY A
Sbjct: 245 CNIRRQQGCRGGHIDVAWNFARGHGLV-------DEECFPYKAA 281
>gi|268564843|ref|XP_002639246.1| Hypothetical protein CBG03805 [Caenorhabditis briggsae]
Length = 526
Score = 87.8 bits (216), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 45/93 (48%), Positives = 57/93 (61%), Gaps = 12/93 (12%)
Query: 79 QIYEHGPLVAIFSVYADFLQYKSGVYQHN------FGDSI--GLHAVRVLGWGVEND--- 127
++ +GP+ A F V+ DF Y GVYQH+ S+ G H+VRVLGWGV++
Sbjct: 404 ELMTNGPVQATFVVHEDFFMYAGGVYQHSDLAAQKGASSVAEGYHSVRVLGWGVDHSTGR 463
Query: 128 -IPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
I YWL ANSW WG+ G FKILRGEN +IE
Sbjct: 464 PIKYWLCANSWGTQWGEDGYFKILRGENHCEIE 496
Score = 71.2 bits (173), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 52/149 (34%), Positives = 71/149 (47%), Gaps = 16/149 (10%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP +FDAR+KW + IADQ +CGS WAVS SDRL I S G +S+Q ++
Sbjct: 258 LPEHFDARDKWGHL--IHPIADQGDCGSSWAVSTTGISSDRLSIISEGRINASLSSQQLL 315
Query: 247 ACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEH--HVQGPLQNC 303
+C + GC GG+ AW + GVV GD+ C PY H P ++
Sbjct: 316 SCNQHRQKGCEGGYLDRAWWYIRKLGVV--GDH-----CYPYVSGQSREPGHCLIPKRDY 368
Query: 304 TLLGKLKTPECKQNC----YNPSYESTYR 328
T L+ P Q+ P Y+ + R
Sbjct: 369 TNRQGLRCPSGSQDSTAFKMTPPYKVSSR 397
>gi|300176576|emb|CBK24241.2| unnamed protein product [Blastocystis hominis]
Length = 563
Score = 87.8 bits (216), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 43/105 (40%), Positives = 58/105 (55%), Gaps = 1/105 (0%)
Query: 56 PTSIPLSHYFKKAHMVPRCNAMR-QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGL 114
P S +Y V +AM QIY +GP+ SV D Y++G++ N S+
Sbjct: 138 PISSYHKYYISSFDAVDGISAMMDQIYYNGPITCKISVTNDLQNYRNGIFSRNTSSSLYD 197
Query: 115 HAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
H V ++GWG EN+ PYW+V NSW WG+ G F+ILRG N IE
Sbjct: 198 HYVNIIGWGSENETPYWIVRNSWGSSWGEDGYFRILRGVNLLGIE 242
Score = 65.9 bits (159), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 31/82 (37%), Positives = 45/82 (54%), Gaps = 1/82 (1%)
Query: 79 QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWG-VENDIPYWLVANSW 137
+IYE GP+ V F +Y GV+ +G H V V+GWG E + YW+ N+W
Sbjct: 471 EIYERGPITCFMVVTEQFQRYTGGVFVEEDHHYLGGHIVEVVGWGRTEEGVEYWIGRNNW 530
Query: 138 NDHWGDHGTFKILRGENEADIE 159
++WG+ G F+I+ G N IE
Sbjct: 531 GENWGEKGWFRIMMGGNNLLIE 552
Score = 40.4 bits (93), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 32/108 (29%), Positives = 47/108 (43%), Gaps = 12/108 (11%)
Query: 183 NAKGLPRNFDAR--EKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQI 240
N LP +D R + +R C +CWA + +A+SDRL + S G + +
Sbjct: 325 NLSSLPTQYDIRSLDGVDYSTPIRTQRAPQFCNACWAQAAVSALSDRLQLQSRGAWPMVV 384
Query: 241 -SAQHIVACTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPY 287
S Q +V C C+GG P +RF + + S E CQ Y
Sbjct: 385 LSTQMVVNCATG--SCDGGDPGEVYRFAYMSSI-------SDESCQVY 423
>gi|294891885|ref|XP_002773787.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
gi|239878991|gb|EER05603.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
Length = 234
Score = 87.8 bits (216), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 33/72 (45%), Positives = 51/72 (70%)
Query: 78 RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
++I+++GP+ A+ ++Y DF YKSGVY H G + H ++++GWGVE+ YWL N+W
Sbjct: 107 QEIFDNGPVAAMMTLYEDFRFYKSGVYVHKTGQMLAAHTLKLIGWGVESGQEYWLAVNAW 166
Query: 138 NDHWGDHGTFKI 149
N+ WGDHG K+
Sbjct: 167 NEEWGDHGMIKL 178
>gi|253747738|gb|EET02294.1| Cathepsin B precursor [Giardia intestinalis ATCC 50581]
Length = 305
Score = 87.4 bits (215), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 39/83 (46%), Positives = 52/83 (62%)
Query: 77 MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANS 136
M + GP+ F V+ DFL Y G+Y +G SIG HAV ++G+G N+ YW+V NS
Sbjct: 214 MTSLLNEGPVQTGFYVHEDFLYYVGGIYHKTYGSSIGGHAVLIVGYGSMNNHDYWIVRNS 273
Query: 137 WNDHWGDHGTFKILRGENEADIE 159
W WG++G F+ILRG NE IE
Sbjct: 274 WGSDWGENGYFRILRGTNECGIE 296
Score = 58.2 bits (139), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 34/101 (33%), Positives = 44/101 (43%), Gaps = 9/101 (8%)
Query: 188 PRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVA 247
P D R+ PEC DQS+C C+A + A+S R CIA +SAQH+V+
Sbjct: 82 PDRLDYRQTHPEC--FFEPEDQSDCSCCYAFATLGALSTRRCIAKLDASVVPLSAQHMVS 139
Query: 248 CTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYT 288
C GC GG +W F G + C PY
Sbjct: 140 CDHGEAGCQGGGFNTSWAFLETEGAI-------MRDCLPYV 173
>gi|340382603|ref|XP_003389808.1| PREDICTED: hypothetical protein LOC100632176 [Amphimedon
queenslandica]
Length = 570
Score = 87.4 bits (215), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 65/228 (28%), Positives = 96/228 (42%), Gaps = 34/228 (14%)
Query: 79 QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWN 138
+IY GP+ QY G++ + H V V+GWGVEN + YW+V NSW
Sbjct: 194 EIYARGPIGCGIDATEKLEQYTGGIFSERKLLPMINHEVSVVGWGVENGVEYWIVRNSWG 253
Query: 139 DHWGDHGTFKILRGENEADIEM----GFNNRVEANSSEDDDL-----------------E 177
+WG++G F+I+ ++ IE G E N +
Sbjct: 254 TYWGENGFFRIMMHKDNLAIETECDWGVPLLKEPNKQHQVHKQQQQQQQEYKCSCVKKSD 313
Query: 178 TMGCQNAKGLPRNFDAREKWPECPSLRHIA--DQSN----------CGSCWAVSVANAIS 225
++ P + E P +R+I D S CGSCWA+ +A+S
Sbjct: 314 SVKTHVHTPEPHTYIKLEDIPAAYDIRNINGNDYSTVNRNQHIPQYCGSCWAMGTTSALS 373
Query: 226 DRLCIASNG-YFTGQISAQHIVACTPNCWGCNGGWPQLAWRFWGHNGV 272
DR+ + G Y +S Q +V C N GC+GG P A+ + NGV
Sbjct: 374 DRIKLMRKGAYPVINLSPQVLVDCANNSHGCDGGDPTAAYSYIYENGV 421
Score = 64.7 bits (156), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 27/84 (32%), Positives = 46/84 (54%), Gaps = 1/84 (1%)
Query: 77 MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVEND-IPYWLVAN 135
+ +I+ GP+ +V + F QY GV+ G H + + GWGV + + YW+ N
Sbjct: 475 LSEIFARGPIACTIAVTSAFEQYTGGVFNDTTGAKSLDHEISIAGWGVTSGGVKYWIGRN 534
Query: 136 SWNDHWGDHGTFKILRGENEADIE 159
SW +WG+ G F+++RG + +E
Sbjct: 535 SWGTYWGEAGWFRLIRGVDNLGVE 558
Score = 40.4 bits (93), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 39/123 (31%), Positives = 59/123 (47%), Gaps = 19/123 (15%)
Query: 183 NAKGLPRNFDAREK----WPECPSLRHIADQSNCGSCWAVSVANAISDRLCIA-SNGYFT 237
N LP F +K + P +HI CGSCWA+ +A+SDR+ I +N Y
Sbjct: 47 NLADLPSQFSWEDKDGQNYLTPPRNQHIPQY--CGSCWAMGTTSALSDRISIMRNNTYPM 104
Query: 238 GQISAQHIVACTPNCWG---CNGGWPQLAWRFWGHNGVV--TGGDYNSQEG-CQPYTLAP 291
Q++ Q I+ NC G C GG P + + +G+ T +Y ++ G C P +
Sbjct: 105 VQLATQVII----NCRGGGSCQGGNPGGVYEYIHRHGLPDETCQNYEARNGECTPIEI-- 158
Query: 292 CEH 294
CE+
Sbjct: 159 CEN 161
>gi|308159555|gb|EFO62082.1| Cathepsin B precursor [Giardia lamblia P15]
Length = 305
Score = 87.4 bits (215), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 44/106 (41%), Positives = 61/106 (57%), Gaps = 3/106 (2%)
Query: 57 TSIPLSHYFKKAHMVPRCN---AMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIG 113
T + + ++K A P N M + GP+ F V+ DFL Y G+Y +G S+G
Sbjct: 191 TLVEDAFHYKAASASPLNNYNEIMVSLLADGPVQTGFYVHEDFLYYVGGIYHKVYGSSLG 250
Query: 114 LHAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
HAV ++G+G ND YW+V NSW WG++G F+ILRG NE IE
Sbjct: 251 GHAVLIVGYGSMNDHDYWIVRNSWGPDWGENGYFRILRGTNECGIE 296
Score = 58.2 bits (139), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 40/121 (33%), Positives = 51/121 (42%), Gaps = 15/121 (12%)
Query: 168 ANSSEDDDLETMGCQNAKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDR 227
N +EDD G P D R+ PEC DQ C C+A + A+S R
Sbjct: 68 VNITEDD------LYPPDGSPDRLDYRQTHPEC--FFEPEDQKECSCCYAFATIGALSTR 119
Query: 228 LCIASNGYFTGQISAQHIVACTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPY 287
CIA +S QH+V+C GC GG + +W F GVV + C PY
Sbjct: 120 RCIAKLDSQAVSLSVQHMVSCDNGEAGCLGGEFESSWAFLETEGVV-------KSDCLPY 172
Query: 288 T 288
T
Sbjct: 173 T 173
>gi|32129434|sp|P92132.2|CATB2_GIALA RecName: Full=Cathepsin B-like CP2; AltName: Full=Cathepsin B-like
protease B2; Flags: Precursor
gi|11691658|emb|CAC18647.1| cathepsin B-like protease 2 [Giardia intestinalis]
Length = 300
Score = 87.4 bits (215), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 37/84 (44%), Positives = 57/84 (67%), Gaps = 1/84 (1%)
Query: 77 MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVEND-IPYWLVAN 135
M+ + GPL F V++DF+ Y+SGVYQH +G G HAV ++G+G ++D + YW++ N
Sbjct: 207 MKALSTSGPLQVAFLVHSDFMYYESGVYQHTYGYMEGGHAVEMVGYGTDDDGVDYWIIKN 266
Query: 136 SWNDHWGDHGTFKILRGENEADIE 159
SW WG+ G F+++RG N+ IE
Sbjct: 267 SWGPDWGEDGYFRMIRGINDCSIE 290
Score = 72.0 bits (175), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 36/101 (35%), Positives = 49/101 (48%), Gaps = 9/101 (8%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+P +FD RE++P C + + DQ CGSCWA S DR C+A + S Q++V
Sbjct: 75 VPESFDFREEYPHC--IPEVVDQGGCGSCWAFSSVATFGDRRCVAGLDKKPVKYSPQYVV 132
Query: 247 ACTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPY 287
+C CNGGW W+F G T + C PY
Sbjct: 133 SCDHGDMACNGGWLPNVWKFLTKTGTTT-------DECVPY 166
>gi|56757237|gb|AAW26790.1| unknown [Schistosoma japonicum]
Length = 170
Score = 87.4 bits (215), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 39/81 (48%), Positives = 55/81 (67%), Gaps = 1/81 (1%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+P +FD+R+KWP C S+ I DQS C S WAVS A+SDR+CI S G + ++SA ++
Sbjct: 90 IPSHFDSRKKWPRCKSISQIRDQSRCASSWAVSAVGAMSDRICIQSGGKQSVELSAIDLI 149
Query: 247 ACTPNCW-GCNGGWPQLAWRF 266
+C NC GC+GG+P AW +
Sbjct: 150 SCCENCGSGCDGGFPGPAWDY 170
>gi|242014495|ref|XP_002427925.1| tubulointerstitial nephritis antigen, putative [Pediculus humanus
corporis]
gi|212512409|gb|EEB15187.1| tubulointerstitial nephritis antigen, putative [Pediculus humanus
corporis]
Length = 473
Score = 87.4 bits (215), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 43/94 (45%), Positives = 57/94 (60%), Gaps = 9/94 (9%)
Query: 75 NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGD----SIGLHAVRVLGWGVENDI-- 128
+ M +I + GP+ A VY DF YKSGVY + + + G H+V++LGWG E +I
Sbjct: 329 DIMEEIMQSGPVQATMKVYQDFFSYKSGVYTKSNTERESSNFGYHSVKILGWGEETNIYG 388
Query: 129 ---PYWLVANSWNDHWGDHGTFKILRGENEADIE 159
YWL ANSW WG++G FKI RG NE +IE
Sbjct: 389 QPIKYWLAANSWGQQWGENGFFKIRRGTNECEIE 422
Score = 69.7 bits (169), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 44/105 (41%), Positives = 54/105 (51%), Gaps = 9/105 (8%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP +FDAR KWP + ADQ CG+ WAVS A+ SDR I S G +S QH++
Sbjct: 190 LPNSFDARNKWPG--WISGPADQGWCGASWAVSTASVASDRYAIMSKGLTKVDLSPQHLL 247
Query: 247 ACTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAP 291
+C GC GG AW F G+V DY C P+T P
Sbjct: 248 SCNKGQRGCQGGHLSRAWTFIRKFGLVD--DY-----CYPWTGTP 285
>gi|308494436|ref|XP_003109407.1| hypothetical protein CRE_08204 [Caenorhabditis remanei]
gi|308246820|gb|EFO90772.1| hypothetical protein CRE_08204 [Caenorhabditis remanei]
Length = 470
Score = 87.4 bits (215), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 45/93 (48%), Positives = 57/93 (61%), Gaps = 12/93 (12%)
Query: 79 QIYEHGPLVAIFSVYADFLQYKSGVYQHN------FGDSI--GLHAVRVLGWGVEND--- 127
++ +GP+ A F V+ DF Y GVYQH+ S+ G H+VRVLGWGV++
Sbjct: 348 ELMTNGPVQATFVVHEDFFMYAGGVYQHSDLAAQKGASSVAEGYHSVRVLGWGVDHSTGR 407
Query: 128 -IPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
I YWL ANSW WG+ G FKILRGEN +IE
Sbjct: 408 PIKYWLCANSWGTQWGEDGYFKILRGENHCEIE 440
Score = 68.6 bits (166), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 42/103 (40%), Positives = 56/103 (54%), Gaps = 10/103 (9%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP +FDAR+KW + +ADQ +CGS WAVS SDRL I S G +S+Q ++
Sbjct: 202 LPEHFDARDKWGHL--IHPVADQGDCGSSWAVSTTGISSDRLSIISEGRINASLSSQQLL 259
Query: 247 ACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYT 288
+C + GC GG+ AW + GVV GD+ C PY
Sbjct: 260 SCNQHRQKGCEGGYLDRAWWYIRKLGVV--GDH-----CYPYV 295
>gi|308488534|ref|XP_003106461.1| CRE-CPR-5 protein [Caenorhabditis remanei]
gi|308253811|gb|EFO97763.1| CRE-CPR-5 protein [Caenorhabditis remanei]
Length = 153
Score = 87.0 bits (214), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 38/84 (45%), Positives = 53/84 (63%)
Query: 175 DLETMGCQNAKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNG 234
D + + + A +P +FDAR+KW C S+ +I DQS+CGSCWA + A AISDR CIASNG
Sbjct: 70 DEDIVATEVADAIPDSFDARDKWSSCVSINNIRDQSDCGSCWAFAAAEAISDRTCIASNG 129
Query: 235 YFTGQISAQHIVACTPNCWGCNGG 258
+S+Q +++C C G
Sbjct: 130 AVNTLLSSQDLLSCCVGVLSCGNG 153
>gi|294871893|ref|XP_002766082.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
gi|239866672|gb|EEQ98799.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
Length = 118
Score = 87.0 bits (214), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 36/88 (40%), Positives = 56/88 (63%), Gaps = 1/88 (1%)
Query: 65 FKKAHMVPRCNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGV 124
F + +P+ N ++I+ +GP++ ++Y D YK+GVY H G G+H ++++GWGV
Sbjct: 18 FGRLPAIPQ-NIKQEIFTNGPVIGALTIYEDIRVYKAGVYVHQTGSFQGIHTLKIIGWGV 76
Query: 125 ENDIPYWLVANSWNDHWGDHGTFKILRG 152
E+ YWL NSWN+ WGDHG K+ G
Sbjct: 77 ESGQDYWLAVNSWNEEWGDHGMIKLAVG 104
>gi|332030944|gb|EGI70570.1| Uncharacterized peptidase C1-like protein F26E4.3 [Acromyrmex
echinatior]
Length = 501
Score = 87.0 bits (214), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 41/93 (44%), Positives = 58/93 (62%), Gaps = 8/93 (8%)
Query: 75 NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSI---GLHAVRVLGWGVEND---- 127
+ M++I GP+ A VY DF YK+G+Y+H+ + G H+VR++GWG E
Sbjct: 398 DIMQEILTSGPVQATMRVYQDFFVYKNGIYRHSQSAELHDSGYHSVRIIGWGEERSYRGP 457
Query: 128 -IPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
+ YWLV NSW +WG++G FKI RG NE +IE
Sbjct: 458 PLKYWLVVNSWGYNWGENGLFKIQRGTNECEIE 490
Score = 67.0 bits (162), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 37/107 (34%), Positives = 56/107 (52%), Gaps = 10/107 (9%)
Query: 183 NAKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISA 242
+ LPR FD+R +W + ++ DQ CG+ WA+S A+ +DR I S G ++SA
Sbjct: 257 DPDALPREFDSRTRWSR--DISNVHDQGWCGASWAISTADVATDRFSIMSKGAEDAELSA 314
Query: 243 QHIVACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYT 288
QH+++C GC GG+ AW F G+V + C P+T
Sbjct: 315 QHLLSCNNRGQQGCRGGYLDRAWLFMRKFGLV-------DKDCYPWT 354
>gi|350408961|ref|XP_003488566.1| PREDICTED: tubulointerstitial nephritis antigen-like [Bombus
impatiens]
Length = 445
Score = 87.0 bits (214), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 41/96 (42%), Positives = 54/96 (56%), Gaps = 11/96 (11%)
Query: 75 NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGD---SIGLHAVRVLGWGVEND---- 127
+ M +I GP+ A VY DF Y+SG+Y+H + G H+VR++GWG +
Sbjct: 339 DIMYEILTSGPVQATMKVYQDFFSYESGIYKHTATTEHYAFGYHSVRIIGWGEDTSAHRY 398
Query: 128 ----IPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
I YWLV NSW WG+ G F+I RG NE DIE
Sbjct: 399 RNLPIKYWLVVNSWGQQWGESGLFRIQRGTNECDIE 434
Score = 64.3 bits (155), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 37/106 (34%), Positives = 54/106 (50%), Gaps = 10/106 (9%)
Query: 183 NAKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISA 242
+ + LPR FDAR +WP + I DQ CG+ WA+S SDR + S G + +SA
Sbjct: 198 DPESLPREFDARIRWPR--EISDIDDQGWCGASWAISTTRVASDRFALMSKGADSVLLSA 255
Query: 243 QHIVACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPY 287
QH+++C C+GG+ AW + G+V E C P+
Sbjct: 256 QHLLSCNNRGQQACSGGYLDRAWLYMRKFGLV-------DEDCYPW 294
>gi|189238903|ref|XP_967834.2| PREDICTED: similar to tubulointerstitial nephritis antigen
[Tribolium castaneum]
Length = 453
Score = 87.0 bits (214), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 47/115 (40%), Positives = 64/115 (55%), Gaps = 10/115 (8%)
Query: 55 LPTSIPLSHYFKKA---HMVPRCNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHN---F 108
LPT++ +K A + + M +I GP+ A VY DF YK G+Y+H+
Sbjct: 319 LPTNVDRRSKYKVAPAYRVGNETDIMYEILHSGPVQATMKVYHDFFTYKRGIYRHSPIST 378
Query: 109 GDSIGLHAVRVLGWGVENDIP----YWLVANSWNDHWGDHGTFKILRGENEADIE 159
D G H+VR++GWG E YW VANSW WG++G F+ILRG NE +IE
Sbjct: 379 NDRTGYHSVRIVGWGEEYSPEGLKKYWKVANSWGPEWGENGYFRILRGSNECEIE 433
Score = 64.3 bits (155), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 40/107 (37%), Positives = 53/107 (49%), Gaps = 10/107 (9%)
Query: 183 NAKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISA 242
+ LPR FD+ KWP + I DQ CGS WA++ A SDR I S G +SA
Sbjct: 201 DPNSLPREFDSEFKWPG--WMSEIQDQGWCGSSWAITTAAVASDRFAILSKGREKVTLSA 258
Query: 243 QHIVACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYT 288
QH+++C CNGG+ AW + G+V E C PY+
Sbjct: 259 QHLLSCDRRGQQSCNGGYLDRAWSYIRKIGLV-------DEQCFPYS 298
>gi|294891623|ref|XP_002773656.1| hypothetical protein Pmar_PMAR011495 [Perkinsus marinus ATCC 50983]
gi|239878860|gb|EER05472.1| hypothetical protein Pmar_PMAR011495 [Perkinsus marinus ATCC 50983]
Length = 815
Score = 87.0 bits (214), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 42/107 (39%), Positives = 64/107 (59%), Gaps = 8/107 (7%)
Query: 59 IPLSHYFKKAHMVP-RCNAMRQIYEHGPLVAIFSVYADFLQY----KSGVYQHNFGD-SI 112
+PL H+ + P MR + E G ++ F +A+F ++ + G+Y G I
Sbjct: 525 LPLYHFHPI--LAPNEAVMMRTVQETGSVIVSFRAHANFQEFFMFNRFGLYTTTAGSPEI 582
Query: 113 GLHAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
G HAVR++G+GVE ++P+WL+ NSW D WG+HG F++LRG N IE
Sbjct: 583 GNHAVRIIGFGVEGNVPFWLLMNSWGDDWGEHGCFRMLRGRNLCGIE 629
Score = 47.0 bits (110), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 31/44 (70%), Gaps = 3/44 (6%)
Query: 188 PR-NFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCI 230
PR +FDAR +WP+CP +A Q CGSC+A+ V+ +DR+C+
Sbjct: 386 PREHFDARIEWPQCPF--PVAMQGMCGSCFAIVVSTVGTDRVCV 427
>gi|182509202|ref|NP_001116812.1| tubulointerstitial nephritis antigen precursor [Bombyx mori]
gi|81303350|gb|ABB71105.1| TIN-ag-RP [Bombyx mori]
Length = 404
Score = 87.0 bits (214), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 39/88 (44%), Positives = 55/88 (62%), Gaps = 3/88 (3%)
Query: 75 NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHN-FGDSI--GLHAVRVLGWGVENDIPYW 131
+ M I GP + I +VY DF Y+ G+Y+H GD + GLH+VR++GWG + + YW
Sbjct: 306 DIMYDIMTSGPALGIMTVYQDFFHYREGIYRHTRHGDQLMRGLHSVRIVGWGEDAEDKYW 365
Query: 132 LVANSWNDHWGDHGTFKILRGENEADIE 159
+VANSW WG+ G F+I RG + IE
Sbjct: 366 IVANSWGTSWGEKGYFRIARGHSGTGIE 393
Score = 58.9 bits (141), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 39/101 (38%), Positives = 55/101 (54%), Gaps = 10/101 (9%)
Query: 188 PRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVA 247
P FDAR +W + IADQ CGS WAVS+A+ + DR I S G ++S+Q +++
Sbjct: 186 PDEFDARREWY--GYISPIADQDWCGSDWAVSIASIVGDRFSIQSFGTENVRMSSQTLLS 243
Query: 248 C-TPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPY 287
C GCNGG +A+ F +G+V+ E C PY
Sbjct: 244 CHLKGQRGCNGGNLDIAFDFVKTHGLVS-------EQCFPY 277
>gi|340712697|ref|XP_003394892.1| PREDICTED: tubulointerstitial nephritis antigen-like [Bombus
terrestris]
Length = 445
Score = 87.0 bits (214), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 41/96 (42%), Positives = 54/96 (56%), Gaps = 11/96 (11%)
Query: 75 NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGD---SIGLHAVRVLGWGVEND---- 127
+ M +I GP+ A VY DF Y+SG+Y+H + G H+VR++GWG +
Sbjct: 339 DIMYEILTSGPVQATMKVYQDFFSYESGIYKHTATTEHYAFGYHSVRIIGWGEDTSAHRH 398
Query: 128 ----IPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
I YWLV NSW WG+ G F+I RG NE DIE
Sbjct: 399 HNLPIKYWLVVNSWGQQWGESGLFRIQRGTNECDIE 434
Score = 64.3 bits (155), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 37/106 (34%), Positives = 54/106 (50%), Gaps = 10/106 (9%)
Query: 183 NAKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISA 242
+ + LPR FDAR +WP + I DQ CG+ WA+S SDR + S G + +SA
Sbjct: 198 DPESLPREFDARIRWPR--EISDIDDQGWCGASWAISATRVASDRFALMSKGADSVLLSA 255
Query: 243 QHIVACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPY 287
QH+++C C+GG+ AW + G+V E C P+
Sbjct: 256 QHLLSCNNRGQQACSGGYLDRAWLYMRKFGLV-------DEDCYPW 294
>gi|323447573|gb|EGB03489.1| hypothetical protein AURANDRAFT_72715 [Aureococcus anophagefferens]
Length = 812
Score = 86.7 bits (213), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 40/88 (45%), Positives = 54/88 (61%), Gaps = 2/88 (2%)
Query: 75 NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSI--GLHAVRVLGWGVENDIPYWL 132
N ++I HGP+ F+VY F+ YKSGVY + + + G HAV+++GWG E YWL
Sbjct: 466 NMQKEIMTHGPIQVAFNVYKSFMSYKSGVYAKKWYELMPEGGHAVKIVGWGTEGGKDYWL 525
Query: 133 VANSWNDHWGDHGTFKILRGENEADIEM 160
VANSWN WGD G FKI G +++
Sbjct: 526 VANSWNTSWGDEGYFKIAVGAESISLDV 553
Score = 64.3 bits (155), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 38/107 (35%), Positives = 52/107 (48%), Gaps = 10/107 (9%)
Query: 182 QNAKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQIS 241
N +P F+A +W ++ I DQ CGSCWA S A +SDR I N +S
Sbjct: 335 DNITDVPSEFNAVTQWKGL--VQPIRDQQQCGSCWAFSAAEVLSDRNAIQHNKA-EPVLS 391
Query: 242 AQHIVACTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYT 288
+ +V+C GCNGG AW + + G+VT + C PYT
Sbjct: 392 PEDLVSCDRVDQGCNGGNLGTAWTYLKNTGIVT-------DACFPYT 431
>gi|403371460|gb|EJY85611.1| Cathepsin B [Oxytricha trifallax]
Length = 309
Score = 86.7 bits (213), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 39/78 (50%), Positives = 51/78 (65%)
Query: 80 IYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWND 139
I E GP+ F+VY DF Y SGVY H GD+ G HAV++LGWG + YW+VANSW +
Sbjct: 212 IQESGPVETGFTVYQDFYNYNSGVYHHVTGDAEGGHAVKILGWGKQGLENYWIVANSWGE 271
Query: 140 HWGDHGTFKILRGENEAD 157
WG+ G F I +G++ D
Sbjct: 272 DWGEKGYFNIRQGDSGID 289
Score = 84.7 bits (208), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 41/101 (40%), Positives = 60/101 (59%), Gaps = 9/101 (8%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP +FD+R +W +C + I DQ+ CGSCWA + A ++SDR CIAS G +S Q +V
Sbjct: 78 LPDSFDSRTQWKDC--VHPIRDQAQCGSCWAFAAAESLSDRFCIASQGKVNLVLSPQDMV 135
Query: 247 ACTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPY 287
+C + +GC GG+ AW++ GV S + C+PY
Sbjct: 136 SCDTSNFGCFGGYLDQAWQYLEQQGV-------SSDSCEPY 169
>gi|157116531|ref|XP_001658537.1| tubulointerstitial nephritis antigen [Aedes aegypti]
gi|108883447|gb|EAT47672.1| AAEL001232-PA [Aedes aegypti]
Length = 462
Score = 86.7 bits (213), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 41/94 (43%), Positives = 58/94 (61%), Gaps = 9/94 (9%)
Query: 75 NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDS-----IGLHAVRVLGWGVEND-- 127
+ M +I +HGP+ AI V+ DF YKSG+Y+H+ + G H+VR++GWG E
Sbjct: 321 DIMLEIKKHGPVQAIMRVHRDFFSYKSGIYRHSAASTSADQRAGYHSVRLIGWGEERHGY 380
Query: 128 --IPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
YW+ NSW WG++G F+ILRG NE +IE
Sbjct: 381 EVTKYWIAVNSWGTWWGENGRFRILRGSNECEIE 414
Score = 69.7 bits (169), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 42/104 (40%), Positives = 51/104 (49%), Gaps = 9/104 (8%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP +FDA WP + + DQ CGS WAVS A+ SDR I S G T Q++ Q IV
Sbjct: 185 LPTHFDATNYWPG--FIGKVRDQGWCGSSWAVSTASVASDRFAILSKGRETVQLAPQQIV 242
Query: 247 ACTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLA 290
+C GC+GG AW + G V E C PY A
Sbjct: 243 SCVRRSQGCSGGHLDTAWSYLRKVGTV-------NEECYPYISA 279
>gi|328869211|gb|EGG17589.1| hypothetical protein DFA_08585 [Dictyostelium fasciculatum]
Length = 323
Score = 86.7 bits (213), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 48/103 (46%), Positives = 59/103 (57%), Gaps = 10/103 (9%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+P FDARE+WP C + + +Q CGSCWA S + A+SDRLCIAS G +S Q +V
Sbjct: 95 IPSTFDAREQWPGC--VHAVLNQEQCGSCWAFSSSEALSDRLCIASKGQVNVTLSPQALV 152
Query: 247 ACTP-NCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYT 288
AC GCNGG PQLAW + G+ T C PYT
Sbjct: 153 ACDDIGNQGCNGGVPQLAWEYMEWKGLPT-------FECYPYT 188
Score = 73.6 bits (179), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 38/107 (35%), Positives = 61/107 (57%), Gaps = 8/107 (7%)
Query: 61 LSHYFKKAHMVPRCNAM----RQIYEHGPLVAIFSVYADFLQYKSGVYQHN-FGDSIGLH 115
+++Y K + CN++ +I +GP+V VY DF+ Y SGVY ++ + +G H
Sbjct: 207 MTYYRAKPFSMTTCNSVACIQNEIITYGPVVGTMMVYQDFMSYSSGVYVYDGTAELLGGH 266
Query: 116 AVRVLGWGVE--NDIPYWLVANSWNDHWGD-HGTFKILRGENEADIE 159
A+ ++GWG + + + YW+V NSW+ WG G F I RG N I+
Sbjct: 267 AIEIVGWGTDATSKLDYWIVKNSWSAAWGGLDGYFWIQRGTNMCGID 313
>gi|307175943|gb|EFN65753.1| Uncharacterized peptidase C1-like protein F26E4.3 [Camponotus
floridanus]
Length = 443
Score = 86.7 bits (213), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 41/93 (44%), Positives = 59/93 (63%), Gaps = 8/93 (8%)
Query: 75 NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSI---GLHAVRVLGWGVEND---- 127
+ M++I GP+ A VY DF Y+SGVY+H+ + G H+VR++GWG E
Sbjct: 340 DIMQEILTSGPVQATMRVYQDFFVYQSGVYRHSRSAELHDSGYHSVRIIGWGEEPSYRGP 399
Query: 128 -IPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
+ YWLVANSW +WG++G F+I +G NE +IE
Sbjct: 400 PLKYWLVANSWGHNWGENGLFRIQKGTNECEIE 432
Score = 71.6 bits (174), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 42/107 (39%), Positives = 57/107 (53%), Gaps = 10/107 (9%)
Query: 183 NAKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISA 242
+ LPR F++R +WP + I DQ CG+ WAVS A+ SDR I S G T ++SA
Sbjct: 199 DPDALPREFNSRTRWPR--DISDIHDQGWCGASWAVSTADVASDRFAIMSKGAETVELSA 256
Query: 243 QHIVACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYT 288
QH+++C GC GG+ AW F G+V E C P+T
Sbjct: 257 QHLLSCNNRGQQGCKGGYLDRAWLFMRKFGLV-------DEECYPWT 296
>gi|270011021|gb|EFA07469.1| cathepsin B precursor [Tribolium castaneum]
Length = 327
Score = 86.7 bits (213), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 47/115 (40%), Positives = 64/115 (55%), Gaps = 10/115 (8%)
Query: 55 LPTSIPLSHYFKKA---HMVPRCNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHN---F 108
LPT++ +K A + + M +I GP+ A VY DF YK G+Y+H+
Sbjct: 193 LPTNVDRRSKYKVAPAYRVGNETDIMYEILHSGPVQATMKVYHDFFTYKRGIYRHSPIST 252
Query: 109 GDSIGLHAVRVLGWGVENDIP----YWLVANSWNDHWGDHGTFKILRGENEADIE 159
D G H+VR++GWG E YW VANSW WG++G F+ILRG NE +IE
Sbjct: 253 NDRTGYHSVRIVGWGEEYSPEGLKKYWKVANSWGPEWGENGYFRILRGSNECEIE 307
Score = 66.2 bits (160), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 41/103 (39%), Positives = 52/103 (50%), Gaps = 10/103 (9%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LPR FD+ KWP S I DQ CGS WA++ A SDR I S G +SAQH++
Sbjct: 79 LPREFDSEFKWPGWMS--EIQDQGWCGSSWAITTAAVASDRFAILSKGREKVTLSAQHLL 136
Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYT 288
+C CNGG+ AW + G+V E C PY+
Sbjct: 137 SCDRRGQQSCNGGYLDRAWSYIRKIGLV-------DEQCFPYS 172
>gi|253748582|gb|EET02635.1| Cathepsin B precursor [Giardia intestinalis ATCC 50581]
Length = 298
Score = 86.7 bits (213), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 39/84 (46%), Positives = 54/84 (64%), Gaps = 1/84 (1%)
Query: 77 MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGV-ENDIPYWLVAN 135
M+ + GPL F+VY+DF+ Y+ GVYQH G G HAV ++G+G E D+ YW++ N
Sbjct: 205 MKALVTGGPLQTAFTVYSDFMYYEGGVYQHMSGRVEGGHAVEMVGYGTDEYDVDYWIIRN 264
Query: 136 SWNDHWGDHGTFKILRGENEADIE 159
SW WG+ G F+I+R NE IE
Sbjct: 265 SWGPDWGEDGYFRIIRMTNECGIE 288
Score = 73.9 bits (180), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 34/88 (38%), Positives = 47/88 (53%), Gaps = 2/88 (2%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+P +FD RE++P C + + DQ +CGSCWA S ++ DR C A S Q++V
Sbjct: 74 VPDSFDFREEYPHC--IPEVVDQGSCGSCWAFSSVASLGDRRCFAGLDKKAVTYSPQYVV 131
Query: 247 ACTPNCWGCNGGWPQLAWRFWGHNGVVT 274
+C C+GGW Q WRF G T
Sbjct: 132 SCDHGDMACDGGWLQSVWRFLTKTGTTT 159
>gi|193629592|ref|XP_001944624.1| PREDICTED: cathepsin B-like isoform 4 [Acyrthosiphon pisum]
Length = 331
Score = 86.3 bits (212), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 42/134 (31%), Positives = 72/134 (53%), Gaps = 14/134 (10%)
Query: 189 RNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVAC 248
+ FDAR++WP+C ++ + ++ N WA + +DR+CIA+NG + +S + +++C
Sbjct: 89 KEFDARKRWPQCKTIGEVYNEGNALLSWAYATTGVFADRMCIATNGSYNKHLSTEELISC 148
Query: 249 TPNCWGCNGGWPQ--LAWRFWGHNGVVTGGD-YNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
+ N GW + LAW ++ +G+V+GG YN+ +GCQP + P C L
Sbjct: 149 SGIKASAN-GWVRDGLAWEYFKTHGLVSGGSIYNTNDGCQPSKIPPV----------CNL 197
Query: 306 LGKLKTPECKQNCY 319
K+ C CY
Sbjct: 198 PTKINKRTCVDYCY 211
Score = 72.8 bits (177), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 34/92 (36%), Positives = 55/92 (59%), Gaps = 2/92 (2%)
Query: 69 HMVPRCNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHN-FGDSIGLHAVRVLGWGVEND 127
H+ P+ + +++ +GP+ A ++Y D +KSGVY + L V+++GWGVEN
Sbjct: 230 HVKPK-DIQKEVQTYGPVTAALNLYDDIFLHKSGVYTLTKNAKYVRLQYVKLIGWGVENG 288
Query: 128 IPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
+ YWL+ NSW + WG +G KI RG+ +E
Sbjct: 289 VDYWLLVNSWGNEWGQNGLLKIKRGKYGCAVE 320
>gi|193202653|ref|NP_492593.2| Protein F26E4.3 [Caenorhabditis elegans]
gi|205371857|sp|P90850.3|YCF2E_CAEEL RecName: Full=Uncharacterized peptidase C1-like protein F26E4.3;
Flags: Precursor
gi|166157004|emb|CAB03007.2| Protein F26E4.3 [Caenorhabditis elegans]
Length = 452
Score = 86.3 bits (212), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 44/93 (47%), Positives = 57/93 (61%), Gaps = 12/93 (12%)
Query: 79 QIYEHGPLVAIFSVYADFLQYKSGVYQHN------FGDSI--GLHAVRVLGWGVEND--- 127
++ +GP+ A F V+ DF Y GVYQH+ S+ G H+VRVLGWGV++
Sbjct: 330 ELMTNGPVQATFVVHEDFFMYAGGVYQHSDLAAQKGASSVAEGYHSVRVLGWGVDHSTGK 389
Query: 128 -IPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
I YWL ANSW WG+ G FK+LRGEN +IE
Sbjct: 390 PIKYWLCANSWGTQWGEDGYFKVLRGENHCEIE 422
Score = 68.9 bits (167), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 42/103 (40%), Positives = 57/103 (55%), Gaps = 10/103 (9%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP +FDAR+KW P + +ADQ +CGS W+VS SDRL I S G +S+Q ++
Sbjct: 184 LPEHFDARDKWG--PLIHPVADQGDCGSSWSVSTTAISSDRLAIISEGRINSTLSSQQLL 241
Query: 247 ACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYT 288
+C + GC GG+ AW + GVV GD+ C PY
Sbjct: 242 SCNQHRQKGCEGGYLDRAWWYIRKLGVV--GDH-----CYPYV 277
>gi|341891358|gb|EGT47293.1| hypothetical protein CAEBREN_29072 [Caenorhabditis brenneri]
Length = 349
Score = 86.3 bits (212), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 44/93 (47%), Positives = 57/93 (61%), Gaps = 12/93 (12%)
Query: 79 QIYEHGPLVAIFSVYADFLQYKSGVYQHN------FGDSI--GLHAVRVLGWGVEND--- 127
++ +GP+ A F V+ DF Y GVYQH+ S+ G H+VRVLGWGV++
Sbjct: 227 ELMTNGPVQATFVVHEDFFMYAGGVYQHSDLAAQKGASSVAEGYHSVRVLGWGVDHSTGR 286
Query: 128 -IPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
I YWL ANSW WG+ G FKILRG+N +IE
Sbjct: 287 PIKYWLCANSWGTQWGEDGYFKILRGDNHCEIE 319
>gi|290988628|ref|XP_002677000.1| predicted protein [Naegleria gruberi]
gi|284090605|gb|EFC44256.1| predicted protein [Naegleria gruberi]
Length = 158
Score = 86.3 bits (212), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 39/86 (45%), Positives = 53/86 (61%), Gaps = 2/86 (2%)
Query: 75 NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVEND--IPYWL 132
+ M + +GPL A VY DF YKSGVY H G +G HA++++GWGV++ +PYW+
Sbjct: 62 DMMADLKANGPLQATMIVYKDFFSYKSGVYHHVSGRMVGAHAIKIVGWGVDSASKLPYWI 121
Query: 133 VANSWNDHWGDHGTFKILRGENEADI 158
ANSW + WG G F I RG E +
Sbjct: 122 CANSWGEDWGLDGYFWIARGRGECGL 147
>gi|341898422|gb|EGT54357.1| hypothetical protein CAEBREN_10381 [Caenorhabditis brenneri]
Length = 466
Score = 86.3 bits (212), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 44/93 (47%), Positives = 57/93 (61%), Gaps = 12/93 (12%)
Query: 79 QIYEHGPLVAIFSVYADFLQYKSGVYQHN------FGDSI--GLHAVRVLGWGVEND--- 127
++ +GP+ A F V+ DF Y GVYQH+ S+ G H+VRVLGWGV++
Sbjct: 344 ELMTNGPVQATFVVHEDFFMYAGGVYQHSDLAAQKGASSVAEGYHSVRVLGWGVDHSTGR 403
Query: 128 -IPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
I YWL ANSW WG+ G FKILRG+N +IE
Sbjct: 404 PIKYWLCANSWGTQWGEDGYFKILRGDNHCEIE 436
Score = 66.6 bits (161), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 40/103 (38%), Positives = 55/103 (53%), Gaps = 10/103 (9%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP +FD+R+KW + + DQ +CGS WAVS SDRL I S G +S+Q ++
Sbjct: 198 LPEHFDSRDKWGHL--INPVVDQGDCGSSWAVSTTGISSDRLAIISEGRINASLSSQQLL 255
Query: 247 ACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYT 288
+C + GC GG+ AW + GVV GD+ C PY
Sbjct: 256 SCNQHRQKGCEGGYLDRAWWYIRKLGVV--GDH-----CYPYV 291
>gi|12958837|gb|AAK09441.1|AF339098_1 cathepsin b-like precursor protein [Ancylostoma ceylanicum]
Length = 180
Score = 86.3 bits (212), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 40/81 (49%), Positives = 55/81 (67%), Gaps = 2/81 (2%)
Query: 188 PRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVA 247
P +FDAR +WPEC ++ I DQS+CGSCWAV+ A+A+SD +C+ SN IS I++
Sbjct: 90 PESFDARTQWPECRAIGTIRDQSSCGSCWAVASASAMSDEMCVQSNSSIKLMISDTDILS 149
Query: 248 CT-PNC-WGCNGGWPQLAWRF 266
C C +GC GGWP A+R+
Sbjct: 150 CCGLECGYGCQGGWPIEAYRW 170
>gi|383861394|ref|XP_003706171.1| PREDICTED: tubulointerstitial nephritis antigen-like [Megachile
rotundata]
Length = 442
Score = 86.3 bits (212), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 40/95 (42%), Positives = 57/95 (60%), Gaps = 10/95 (10%)
Query: 75 NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSI---GLHAVRVLGWGVE------ 125
+ M++I GP+ A VY DF Y+SGVY+H+ + H+VR++GWG E
Sbjct: 337 DIMQEILTSGPVQATMRVYQDFFSYESGVYKHSVTAELYESDYHSVRIIGWGEEPPTYSR 396
Query: 126 -NDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
+ YWLVANSW WG++G F+I +G NE +IE
Sbjct: 397 NTPLKYWLVANSWGQQWGENGLFRIQKGTNECEIE 431
Score = 69.7 bits (169), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 40/106 (37%), Positives = 55/106 (51%), Gaps = 10/106 (9%)
Query: 183 NAKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISA 242
+ + LPR FD+R +WP + I DQ CG+ WA+S A SDR I S G ++SA
Sbjct: 196 DPESLPREFDSRTRWPR--DISKITDQGWCGASWAISSAQVASDRFAIMSKGTDAVELSA 253
Query: 243 QHIVACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPY 287
QH+++C GC+GG AW F G+V E C P+
Sbjct: 254 QHLLSCNNRGQQGCSGGHLDRAWMFMRRFGLV-------DENCYPW 292
>gi|348690656|gb|EGZ30470.1| papain-like cysteine protease C1 [Phytophthora sojae]
Length = 647
Score = 85.9 bits (211), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 64/250 (25%), Positives = 105/250 (42%), Gaps = 57/250 (22%)
Query: 77 MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANS 136
M +IY GP+ +V FL+Y G++ + HA+ ++GWG E+ +P+W++ NS
Sbjct: 212 MAEIYARGPIACSVAVTDGFLKYSGGIFDDKTNATETDHAISIVGWGEEDGVPFWVLRNS 271
Query: 137 WNDHWGDHGTFKILRGENEADIE---------------------------MGFNNRVEAN 169
W WG+ G +++RG N +E V N
Sbjct: 272 WGSFWGEDGWMRLVRGVNNVGVEGECAFGVPKDDGWPTPTKIEEEEPVQEEEEKKDVVEN 331
Query: 170 SSEDDDLETM--GCQ--------------------NAKGLPRNFDARE----KWPECPSL 203
+ ED +E+ GC+ + K LP+ +D R+ +
Sbjct: 332 TEEDTSVESKLGGCRQKLHFAGGERVISPLPHETIDVKDLPKAWDWRDVNGRNFVTWDKN 391
Query: 204 RHIADQSNCGSCWAVSVANAISDRLCIASNGYFTG-QISAQHIVACTPNCWGCNGGWPQL 262
+HI CGSCWA +A+SDR+ I N + +S Q ++ C CNGG P L
Sbjct: 392 QHIPQY--CGSCWAQGTTSALSDRISILRNASWPEIALSPQVLINCHAGG-TCNGGNPGL 448
Query: 263 AWRFWGHNGV 272
+ + +G+
Sbjct: 449 VYEYAHRHGI 458
Score = 56.2 bits (134), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 39/137 (28%), Positives = 62/137 (45%), Gaps = 8/137 (5%)
Query: 63 HYFKKAHMVPRCNAMR-QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLG 121
+Y + V + M+ +IY+ GP+ F Y G+Y + + H + V G
Sbjct: 503 YYVSEYGSVSGADRMKAEIYKRGPIGCGVHATEKFEAYTGGIYSEHVMFPLINHEISVAG 562
Query: 122 WGV--ENDIPYWLVANSWNDHWGDHGTFKILRGENEADIEMGFNNRVE-ANSSEDDD--- 175
WG E D YW+ NSW +WG++G F+I N IE + V + S+ DD
Sbjct: 563 WGYDEETDTEYWIGRNSWGTYWGENGWFRIQMHHNNLGIEQDCDWGVPLPDGSKPDDFVI 622
Query: 176 -LETMGCQNAKGLPRNF 191
++ G + + RNF
Sbjct: 623 TVDYEGNEEGQATARNF 639
Score = 42.7 bits (99), Expect = 0.24, Method: Compositional matrix adjust.
Identities = 35/117 (29%), Positives = 52/117 (44%), Gaps = 27/117 (23%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSN------CGSCWAVSVANAISDRLCIASNGYFTGQ- 239
LP+NFD W ++ N CGSCW+ + +A++DR+ IA + +
Sbjct: 57 LPKNFD----WRNVNGTNYVTISRNQHIPHYCGSCWSFAATSALADRIMIAKERSPSNKP 112
Query: 240 ---------ISAQHIVACTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPY 287
+S Q I+ C GC+GG A+R+ NGV +EGCQ Y
Sbjct: 113 SVEVHREVVLSPQVILNCDKKDNGCHGGDQLEAYRYIKKNGV-------PEEGCQRY 162
>gi|66801417|ref|XP_629634.1| hypothetical protein DDB_G0292462 [Dictyostelium discoideum AX4]
gi|60463014|gb|EAL61210.1| hypothetical protein DDB_G0292462 [Dictyostelium discoideum AX4]
Length = 323
Score = 85.9 bits (211), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 41/89 (46%), Positives = 56/89 (62%), Gaps = 1/89 (1%)
Query: 75 NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVEND-IPYWLV 133
+A +I +GP++A F +Y+DF +K VY + + HAVRV+GWG +D + YW+
Sbjct: 183 DAQYEIMTNGPVIATFMLYSDFKPHKWDVYIKSSNTQVESHAVRVVGWGTTSDGVDYWIA 242
Query: 134 ANSWNDHWGDHGTFKILRGENEADIEMGF 162
ANSW WGD G FKI RG +EA E GF
Sbjct: 243 ANSWGTGWGDKGYFKIRRGSDEAAFEEGF 271
Score = 58.2 bits (139), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 30/97 (30%), Positives = 51/97 (52%), Gaps = 11/97 (11%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+P +FD R W +C + + +Q +CGSCWA + ++DR+CI S+ +S Q+++
Sbjct: 46 IPASFDVRTNWGDC--MSPVREQQSCGSCWAQVTSGILADRMCIESDKNIKMLLSPQYLM 103
Query: 247 ACTPNCW---------GCNGGWPQLAWRFWGHNGVVT 274
C +C GC GG+ LA + G+V+
Sbjct: 104 DCDGSCVSDGVSGCNNGCKGGFVGLALTRLINEGIVS 140
>gi|324512900|gb|ADY45327.1| Peptidase C1-like protein [Ascaris suum]
Length = 450
Score = 85.9 bits (211), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 42/97 (43%), Positives = 55/97 (56%), Gaps = 12/97 (12%)
Query: 75 NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQH--------NFGDSIGLHAVRVLGWGVEN 126
+ M +I +GP+ A F VY DF Y GVYQH G H+VR++GWG +
Sbjct: 326 DIMTEIITNGPVQATFLVYEDFFMYSGGVYQHLDLHEHKEEERKVQGYHSVRIIGWGEDY 385
Query: 127 D----IPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
+ YWL ANSW + WG+ G F+ILRGEN +IE
Sbjct: 386 STGPQVKYWLAANSWGNEWGEDGLFRILRGENHCEIE 422
Score = 62.8 bits (151), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 38/104 (36%), Positives = 55/104 (52%), Gaps = 10/104 (9%)
Query: 185 KGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQH 244
+ LP +FDAREKWP + + DQ +C S W+ S +DRL I ++G +SAQ
Sbjct: 182 RELPSSFDAREKWP--LYIHPVRDQGDCASSWSHSTTATSADRLSIITDGRVNIPLSAQQ 239
Query: 245 IVACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPY 287
+++C + GC GG+ AW + GVV+ E C PY
Sbjct: 240 LLSCNQHRQRGCEGGYLDRAWWYIRKLGVVS-------ELCYPY 276
>gi|193688336|ref|XP_001945899.1| PREDICTED: cathepsin B-like cysteine proteinase 5-like
[Acyrthosiphon pisum]
Length = 308
Score = 85.9 bits (211), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 38/105 (36%), Positives = 64/105 (60%), Gaps = 3/105 (2%)
Query: 189 RNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVAC 248
+ FDAR++WP+C ++ + ++ N WA +VA ++DR CIA+NG + +S + +++C
Sbjct: 67 KEFDARKRWPKCKTIGEVHNEGNFALGWAYAVAGVLADRTCIATNGGYNKLLSTEELISC 126
Query: 249 TPNCWGCNGGWP--QLAWRFWGHNGVVTGGDYNSQEGCQPYTLAP 291
+ NG P + W + +GVV+GG YNS +GCQP+ P
Sbjct: 127 S-GIKENNGSVPSERSIWEYLKSHGVVSGGKYNSNDGCQPFKFPP 170
Score = 75.9 bits (185), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 36/86 (41%), Positives = 50/86 (58%), Gaps = 1/86 (1%)
Query: 75 NAMRQIYEHGPLVAIFSVYADFLQYKSGVY-QHNFGDSIGLHAVRVLGWGVENDIPYWLV 133
+ +++ +GP+V F V DF YKSGVY + + I +++GWGVEN + YWLV
Sbjct: 212 DIQKEVQTYGPVVVRFMVCDDFFLYKSGVYAKSDKAKGIRTQYAKLIGWGVENGVDYWLV 271
Query: 134 ANSWNDHWGDHGTFKILRGENEADIE 159
NSW WG G FKI G N+ +E
Sbjct: 272 INSWGHEWGQKGLFKIKSGTNQCGVE 297
>gi|281200411|gb|EFA74631.1| hypothetical protein PPL_11599 [Polysphondylium pallidum PN500]
Length = 311
Score = 85.5 bits (210), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 51/151 (33%), Positives = 75/151 (49%), Gaps = 32/151 (21%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+P +FD+R WP C + + +Q CGSCWA + + ++SDRLCIAS G +S Q +V
Sbjct: 81 VPNSFDSRTNWPGC--VHAVLNQGQCGSCWAFAASESLSDRLCIASQGAINVTLSPQALV 138
Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
+C GCNGG PQ+AW + +G+ T + C PYT
Sbjct: 139 SCDIEFNQGCNGGIPQMAWEYLELHGIPT-------DSCFPYT----------------- 174
Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKK 336
G P+C++ C + S ++ L KGK
Sbjct: 175 SGNGTAPDCQKECSDGS-----KYQLYKGKT 200
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 37/103 (35%), Positives = 56/103 (54%), Gaps = 7/103 (6%)
Query: 64 YFKKAHMVPRCNAMRQI----YEHGPLVAIFSVYADFLQYKSGVYQHNFGDSI-GLHAVR 118
Y K + C+++ I + +GP+ VY DF+ Y SGVY G + G HA++
Sbjct: 196 YKGKTFTLKTCSSVAAIQANVFAYGPIEGTMDVYQDFMSYTSGVYVMTPGSKLLGGHAIK 255
Query: 119 VLGWGVEND--IPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
++GWG ++ + YW+V NSW WG +G F I RG N I+
Sbjct: 256 IVGWGTDSTSGLDYWIVQNSWGSDWGMNGFFWIQRGTNMCGID 298
>gi|363732245|ref|XP_419905.3| PREDICTED: tubulointerstitial nephritis antigen [Gallus gallus]
Length = 467
Score = 85.5 bits (210), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 41/95 (43%), Positives = 59/95 (62%), Gaps = 13/95 (13%)
Query: 75 NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNF--GDSIGLHAVRVLGWGVENDIP--- 129
+ M +I GP+ AI VY DF YK G+Y+H++ G H+V++LGWG +P
Sbjct: 368 DIMEEIMAKGPVQAIMKVYEDFFLYKEGIYRHSYKAGSKWKTHSVKLLGWG---SLPGKN 424
Query: 130 -----YWLVANSWNDHWGDHGTFKILRGENEADIE 159
+W+ ANSW +WG++G F+ILRG+NE DIE
Sbjct: 425 GQKQKFWIAANSWGKYWGENGYFRILRGQNECDIE 459
Score = 67.8 bits (164), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 42/117 (35%), Positives = 60/117 (51%), Gaps = 13/117 (11%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
P F A WP+ + DQ NCG+ WA S A+ +DR+ I S+G T +S Q+++
Sbjct: 222 FPEFFAATYAWPD--WIHDPLDQRNCGASWAFSTASVAADRITIHSDGQITDNLSVQNLI 279
Query: 247 AC-TPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQN 302
+C T N GCNGG AWR+ +GVV+ C P +HH+ P +N
Sbjct: 280 SCDTGNQRGCNGGSIDGAWRYLTTHGVVS-------YACYPSFW---KHHLDSPSEN 326
>gi|326916361|ref|XP_003204476.1| PREDICTED: tubulointerstitial nephritis antigen-like [Meleagris
gallopavo]
Length = 467
Score = 85.5 bits (210), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 41/95 (43%), Positives = 59/95 (62%), Gaps = 13/95 (13%)
Query: 75 NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNF--GDSIGLHAVRVLGWGVENDIP--- 129
+ M +I GP+ AI VY DF YK G+Y+H++ G H+V++LGWG +P
Sbjct: 368 DIMEEIMAKGPVQAIMKVYEDFFLYKEGIYRHSYKAGSKWKTHSVKLLGWG---SLPGKN 424
Query: 130 -----YWLVANSWNDHWGDHGTFKILRGENEADIE 159
+W+ ANSW +WG++G F+ILRG+NE DIE
Sbjct: 425 GQKQKFWIAANSWGKYWGENGYFRILRGQNECDIE 459
Score = 64.7 bits (156), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 37/95 (38%), Positives = 53/95 (55%), Gaps = 3/95 (3%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
P F A WP+ + DQ NCG+ WA S A+ +DR+ I S+G T +S Q+++
Sbjct: 222 FPEFFAATYAWPD--WIHDPLDQRNCGASWAFSTASVAADRIAIHSDGQITDNLSVQNLI 279
Query: 247 AC-TPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNS 280
+C T N GC GG + AWR+ +GVV+ Y S
Sbjct: 280 SCDTKNQHGCGGGNIEGAWRYLKTHGVVSYACYPS 314
>gi|348513320|ref|XP_003444190.1| PREDICTED: tubulointerstitial nephritis antigen-like [Oreochromis
niloticus]
Length = 499
Score = 85.5 bits (210), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 39/96 (40%), Positives = 60/96 (62%), Gaps = 13/96 (13%)
Query: 77 MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSI--------GLHAVRVLGWGVENDI 128
M++I ++GP+ AI V+ DF YK+G+Y+H G H+VR+ GWG + ++
Sbjct: 377 MKEIMDNGPVQAIMEVHEDFFVYKTGIYKHTDVSFTKPPQYRKHGTHSVRITGWGEDRNV 436
Query: 129 P-----YWLVANSWNDHWGDHGTFKILRGENEADIE 159
YW+ ANSW +WG++G F+I+RGENE +IE
Sbjct: 437 DGTSRKYWIAANSWGKNWGENGYFRIVRGENECEIE 472
Score = 71.2 bits (173), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 56/163 (34%), Positives = 77/163 (47%), Gaps = 22/163 (13%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP F++ EKWP + DQ NC + WA S A SDR+ I S G+ T ++S Q+++
Sbjct: 227 LPPYFNSAEKWPG--KIHEPLDQGNCAASWAFSTAAVASDRISIQSMGHMTPRLSPQNLI 284
Query: 247 AC-TPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
+C T N GC GG AW + GVVT E C PY H + C +
Sbjct: 285 SCDTRNQGGCAGGRIDGAWWYLRRRGVVT-------EDCYPYQPP---HQTPAEVGRCMM 334
Query: 306 ----LGKLK---TPEC--KQNCYNPSYESTYRFDLKKGKKAHM 339
+G+ K T C QN +N Y+ST + L +K M
Sbjct: 335 QSRSVGRGKRQATQRCPNTQNYHNDIYQSTPPYRLSSNEKEIM 377
>gi|403365170|gb|EJY82363.1| Cathepsin B [Oxytricha trifallax]
Length = 309
Score = 85.1 bits (209), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 36/78 (46%), Positives = 52/78 (66%)
Query: 80 IYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWND 139
I + GP+ F++Y DFL Y SG+Y H G ++G HAV++LGWG + YW+VANSW +
Sbjct: 212 IQQSGPVETGFTIYEDFLNYNSGIYHHVTGGNMGGHAVKILGWGKQGLENYWIVANSWGE 271
Query: 140 HWGDHGTFKILRGENEAD 157
WG+ G F I +G++ D
Sbjct: 272 DWGEKGYFNIRQGDSGID 289
Score = 77.0 bits (188), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 37/101 (36%), Positives = 57/101 (56%), Gaps = 9/101 (8%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP +FD+R +W +C + I DQ+ CGSCWA + ++SDR CIAS G +S Q ++
Sbjct: 78 LPDSFDSRTQWKDC--VHPIRDQAKCGSCWAFAAVESLSDRFCIASQGKVNLVLSPQDML 135
Query: 247 ACTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPY 287
+C + + C GG+ AW++ GV + C+PY
Sbjct: 136 SCDASNFCCFGGYLDTAWQYLEQQGV-------GSDSCEPY 169
>gi|328722316|ref|XP_003247542.1| PREDICTED: cathepsin B-like isoform 2 [Acyrthosiphon pisum]
gi|328722318|ref|XP_003247543.1| PREDICTED: cathepsin B-like isoform 3 [Acyrthosiphon pisum]
Length = 276
Score = 85.1 bits (209), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 42/134 (31%), Positives = 72/134 (53%), Gaps = 14/134 (10%)
Query: 189 RNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVAC 248
+ FDAR++WP+C ++ + ++ N WA + +DR+CIA+NG + +S + +++C
Sbjct: 34 KEFDARKRWPQCKTIGEVYNEGNALLSWAYATTGVFADRMCIATNGSYNKHLSTEELISC 93
Query: 249 TPNCWGCNGGWPQ--LAWRFWGHNGVVTGGD-YNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
+ N GW + LAW ++ +G+V+GG YN+ +GCQP + P C L
Sbjct: 94 SGIKASAN-GWVRDGLAWEYFKTHGLVSGGSIYNTNDGCQPSKIPPV----------CNL 142
Query: 306 LGKLKTPECKQNCY 319
K+ C CY
Sbjct: 143 PTKINKRTCVDYCY 156
Score = 72.8 bits (177), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 35/102 (34%), Positives = 60/102 (58%), Gaps = 5/102 (4%)
Query: 59 IPLSHYFKKAHMVPRCNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHN-FGDSIGLHAV 117
+ + +Y+ H+ P+ + +++ +GP+ A ++Y D +KSGVY + L V
Sbjct: 168 VKVRYYY---HVKPK-DIQKEVQTYGPVTAALNLYDDIFLHKSGVYTLTKNAKYVRLQYV 223
Query: 118 RVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
+++GWGVEN + YWL+ NSW + WG +G KI RG+ +E
Sbjct: 224 KLIGWGVENGVDYWLLVNSWGNEWGQNGLLKIKRGKYGCAVE 265
>gi|432884030|ref|XP_004074413.1| PREDICTED: tubulointerstitial nephritis antigen-like [Oryzias
latipes]
Length = 474
Score = 85.1 bits (209), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 41/96 (42%), Positives = 58/96 (60%), Gaps = 13/96 (13%)
Query: 77 MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSI--------GLHAVRVLGWGVENDI 128
M++I E+GP+ AI V+ DF YK+G+Y+H S G H+VR+ GWG + D
Sbjct: 352 MKEIMENGPVQAIMEVHEDFFVYKNGIYKHTDVSSTKPPQYRKHGTHSVRITGWGEDKDY 411
Query: 129 P-----YWLVANSWNDHWGDHGTFKILRGENEADIE 159
YW+ ANSW +WG++G F+I RG NE +IE
Sbjct: 412 DGTPRKYWIAANSWGKNWGENGFFRIARGANECEIE 447
Score = 73.9 bits (180), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 59/162 (36%), Positives = 78/162 (48%), Gaps = 20/162 (12%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LPR F++ EKWP + DQ NC + WA S A SDR+ I S G+ T Q+S Q+++
Sbjct: 202 LPRYFNSSEKWPN--KIHEPLDQGNCAASWAFSTAAVASDRISIQSMGHMTPQLSPQNLI 259
Query: 247 AC-TPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPY---TLAPCEHHVQGPLQN 302
+C T N GC GG AW + GVVT E C PY AP E V +
Sbjct: 260 SCDTRNQGGCAGGRIDGAWWYLRRRGVVT-------ENCYPYQPPQQAPAE--VGRCMMQ 310
Query: 303 CTLLGKLK---TPECKQ--NCYNPSYESTYRFDLKKGKKAHM 339
+G+ K T C N +N Y+ST + L +K M
Sbjct: 311 SRAVGRGKRQATQRCPNTYNYHNDIYQSTPPYKLSSNEKEIM 352
>gi|301119245|ref|XP_002907350.1| cysteine protease family C01A, putative [Phytophthora infestans
T30-4]
gi|262105862|gb|EEY63914.1| cysteine protease family C01A, putative [Phytophthora infestans
T30-4]
Length = 710
Score = 85.1 bits (209), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 62/242 (25%), Positives = 104/242 (42%), Gaps = 49/242 (20%)
Query: 77 MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANS 136
M +IY GP+ +V FL+Y G++ + HA+ ++GWG EN +P+W++ NS
Sbjct: 211 MAEIYARGPIACSVAVTDGFLKYSGGIFDDKTNATDVDHAISIVGWGEENGVPFWVLRNS 270
Query: 137 WNDHWGDHGTFKILRGENEADIEMGFNNRVEANSS-------------------EDDDLE 177
W WG+ G +++RG N +E V + E+ +E
Sbjct: 271 WGSFWGESGWMRLVRGVNNVGVEGECAFGVPRDDGWPTPTKIEEKEEDKVKEPQEETSVE 330
Query: 178 TM--GCQ--------------------NAKGLPRNFDARE----KWPECPSLRHIADQSN 211
+ GC+ + LP+++D R+ + +HI
Sbjct: 331 STLGGCRQKLHFAGGERVISPLPHETMDVTDLPKSWDWRDVNGKNYVTWDKNQHIPKY-- 388
Query: 212 CGSCWAVSVANAISDRLCIASNGYFTG-QISAQHIVACTPNCWGCNGGWPQLAWRFWGHN 270
CGSCWA +A+SDR+ I N + +S Q ++ C CNGG P L + + +
Sbjct: 389 CGSCWAQGTTSALSDRISILRNASWPEIALSPQVLINCHAGG-TCNGGNPGLVYEYAHRH 447
Query: 271 GV 272
G+
Sbjct: 448 GI 449
Score = 55.1 bits (131), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 28/83 (33%), Positives = 41/83 (49%), Gaps = 2/83 (2%)
Query: 79 QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGV--ENDIPYWLVANS 136
+IY+ GP+ + F Y G+Y + + H + V GWG E D YW+ NS
Sbjct: 511 EIYKRGPIGCGVHATSKFESYTGGIYSEHVMFPLINHEISVAGWGYDEETDTEYWIGRNS 570
Query: 137 WNDHWGDHGTFKILRGENEADIE 159
W +WG++G F+I N IE
Sbjct: 571 WGTYWGENGWFRIQMHHNNLGIE 593
Score = 42.0 bits (97), Expect = 0.39, Method: Compositional matrix adjust.
Identities = 34/118 (28%), Positives = 53/118 (44%), Gaps = 27/118 (22%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSN------CGSCWAVSVANAISDRLCI---------- 230
LP+NFD W R+++ N CGSCW+ + +A++DR+ I
Sbjct: 56 LPKNFD----WRNVNGTRYVSISRNQHIPHYCGSCWSFAATSALADRILIFKERNPGNKP 111
Query: 231 ASNGYFTGQISAQHIVACTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYT 288
+ + +S Q I+ C GC+GG A+R+ +GV +EGCQ Y
Sbjct: 112 SVEVHRGVVLSPQVILNCDKKDNGCHGGDQLEAYRYIKEHGV-------PEEGCQRYA 162
>gi|308804940|ref|XP_003079782.1| Cysteine proteinase Cathepsin F (ISS) [Ostreococcus tauri]
gi|116058239|emb|CAL53428.1| Cysteine proteinase Cathepsin F (ISS) [Ostreococcus tauri]
Length = 498
Score = 85.1 bits (209), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 44/114 (38%), Positives = 62/114 (54%), Gaps = 2/114 (1%)
Query: 187 LPRNFDAREKWPECPSL-RHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHI 245
LPR+FDAR+++P+C L + DQ CGSCWAV+ ++DRLCI+S G ++S Q
Sbjct: 257 LPRHFDARDEYPKCARLIGTVRDQGKCGSCWAVAATEIMNDRLCISSGGKEVAELSPQFA 316
Query: 246 VACTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGP 299
++C + GC GG GV GG + + C PY PC+H P
Sbjct: 317 LSCYNSGAGCEGGDVVDTLTLALAKGVPHGGMLD-KGACLPYQFEPCDHPCMIP 369
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 29/80 (36%), Positives = 47/80 (58%), Gaps = 5/80 (6%)
Query: 78 RQIYEHGPLVAIFS-VYADFLQYKSGVYQ--HNFGDSIGLHAVRVLGWGVENDIP-YWLV 133
++I G + F V+ DF +K GVY+ + G +G HA +++GWGV + YW++
Sbjct: 408 KEIKNRGSVAVTFGPVHEDFYGHKEGVYKVTESSGRELGNHATKLIGWGVTQEGDHYWIM 467
Query: 134 ANSWNDHWGDHGTFKILRGE 153
NSW +WG++G K+ GE
Sbjct: 468 VNSWR-NWGENGVGKVRMGE 486
>gi|159115721|ref|XP_001708083.1| Cathepsin B precursor [Giardia lamblia ATCC 50803]
gi|157436192|gb|EDO80409.1| Cathepsin B precursor [Giardia lamblia ATCC 50803]
Length = 305
Score = 85.1 bits (209), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 38/83 (45%), Positives = 52/83 (62%)
Query: 77 MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANS 136
M + GP+ F V+ DFL Y G+Y +G S+G HAV ++G+G N+ YW+V NS
Sbjct: 214 MVSLLADGPVQTGFYVHEDFLYYVGGIYHKVYGTSLGGHAVLIVGYGSMNNHDYWIVRNS 273
Query: 137 WNDHWGDHGTFKILRGENEADIE 159
W WG++G F+ILRG NE IE
Sbjct: 274 WGSDWGENGYFRILRGTNECGIE 296
Score = 57.4 bits (137), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 39/121 (32%), Positives = 50/121 (41%), Gaps = 15/121 (12%)
Query: 168 ANSSEDDDLETMGCQNAKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDR 227
N +EDD G P D R+ PEC DQ C C+A + A+S R
Sbjct: 68 VNITEDD------LYPPAGSPDRLDYRQTHPEC--FFEPEDQKECSCCYAFATLGALSTR 119
Query: 228 LCIASNGYFTGQISAQHIVACTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPY 287
CIA +S QH+V+C GC GG + +W F G V + C PY
Sbjct: 120 RCIAKLDPQAVSLSVQHMVSCDSGEAGCQGGEFESSWAFLETEGAV-------KSDCLPY 172
Query: 288 T 288
T
Sbjct: 173 T 173
>gi|390367767|ref|XP_787947.3| PREDICTED: cathepsin B-like [Strongylocentrotus purpuratus]
Length = 146
Score = 84.7 bits (208), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 35/68 (51%), Positives = 44/68 (64%)
Query: 185 KGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQH 244
K LP NFDARE WP CP+++ + DQ +CGSCWA AISDR+CI S G ISA+
Sbjct: 76 KDLPENFDARENWPNCPTIKEVRDQGSCGSCWAFGAVEAISDRICIKSKGQTQVHISAED 135
Query: 245 IVACTPNC 252
++ C C
Sbjct: 136 LMTCCKTC 143
>gi|412992960|emb|CCO16493.1| cysteine proteinase, putative [Bathycoccus prasinos]
Length = 396
Score = 84.7 bits (208), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 52/144 (36%), Positives = 71/144 (49%), Gaps = 7/144 (4%)
Query: 186 GLPRNFDAREKWPECPSL-RHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQH 244
GLPR FDAR++W EC L + DQ CGSCWAV+ ++DR+CIA T ++S Q+
Sbjct: 145 GLPRQFDARKEWAECKGLIGTVRDQGKCGSCWAVAATEVMNDRVCIAHGK--TEELSPQY 202
Query: 245 IVACTPNCWGCNGGWPQLAWRFWGHNGVVTGGDY-NSQEGCQPYTLAPCEHHVQGP---L 300
++C GC GG + GV TGG + +S C PY C+H Q P
Sbjct: 203 ALSCYSAGAGCEGGNVIDTLQEAIEKGVPTGGMFGDSSSACLPYEFEACDHPCQVPGTIA 262
Query: 301 QNCTLLGKLKTPECKQNCYNPSYE 324
+ C TP + P+ E
Sbjct: 263 EECPTTCADGTPISETEMMRPTSE 286
Score = 62.0 bits (149), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 34/92 (36%), Positives = 52/92 (56%), Gaps = 11/92 (11%)
Query: 78 RQIYEHGPLVAIFS-VYADFLQYKSGVY-QHNFGDSIGLHAVRVLGWGVENDI------- 128
++++++G + F V DF +K GVY Q G +GLHA +++GWG E D
Sbjct: 300 QELHKYGSMAVTFGPVCDDFYGHKHGVYEQPEGGKPLGLHATKIIGWGFEGDDEETGKGG 359
Query: 129 -PYWLVANSWNDHWGDHGTFKILRGENEADIE 159
PYW++ NSW +WG+HG +I GE + E
Sbjct: 360 KPYWIMINSWQ-NWGEHGVGRIGIGEMSIESE 390
>gi|410910940|ref|XP_003968948.1| PREDICTED: tubulointerstitial nephritis antigen-like [Takifugu
rubripes]
Length = 477
Score = 84.7 bits (208), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 40/96 (41%), Positives = 58/96 (60%), Gaps = 13/96 (13%)
Query: 77 MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSI--------GLHAVRVLGWGVENDI 128
M++I ++GP+ AI V+ DF YKSG+Y+H G H+V++ GWG E ++
Sbjct: 355 MKEIQDNGPVQAIMEVHEDFFVYKSGIYKHTDVSFTKPPQYRKHGTHSVKITGWGEERNV 414
Query: 129 P-----YWLVANSWNDHWGDHGTFKILRGENEADIE 159
YW+ ANSW +WG+ G F+I RGENE +IE
Sbjct: 415 DGAKRKYWIAANSWGKNWGEEGYFRIARGENECEIE 450
Score = 71.2 bits (173), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 43/102 (42%), Positives = 54/102 (52%), Gaps = 10/102 (9%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP F++ EKWP + DQ NC + WA S A SDR+ I S G+ T Q+S Q+++
Sbjct: 205 LPLYFNSAEKWPG--KIHEPLDQGNCAASWAFSTAAVASDRISIQSMGHMTPQLSPQNLI 262
Query: 247 AC-TPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPY 287
+C T N GC GG AW F GVVT E C PY
Sbjct: 263 SCDTRNQGGCTGGRIDGAWWFLRRRGVVT-------EDCYPY 297
>gi|66810163|ref|XP_638805.1| peptidase C1A family protein [Dictyostelium discoideum AX4]
gi|74897075|sp|Q54QD9.1|CTSB_DICDI RecName: Full=Cathepsin B; AltName: Full=Cathepsin B1; Flags:
Precursor
gi|60467425|gb|EAL65448.1| peptidase C1A family protein [Dictyostelium discoideum AX4]
Length = 311
Score = 84.7 bits (208), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 36/77 (46%), Positives = 51/77 (66%)
Query: 77 MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANS 136
M++I +GP+ A F+V+ DFL YKSGVY H G +G H V+++G+G N + Y+ N
Sbjct: 223 MQEIVTNGPVEACFTVFEDFLAYKSGVYVHTTGKDLGGHCVKLVGFGTLNGVDYYAANNQ 282
Query: 137 WNDHWGDHGTFKILRGE 153
W WGD+GTF I RG+
Sbjct: 283 WTTSWGDNGTFLIKRGD 299
Score = 77.0 bits (188), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 45/136 (33%), Positives = 64/136 (47%), Gaps = 15/136 (11%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+P +F+A+ WP C ++ I +Q+ CGSCWA + +DRLCI +N Q+S +V
Sbjct: 79 IPTSFNAQTNWPNCTTISQIQNQARCGSCWAFGATESATDRLCIHNNENV--QLSFMDMV 136
Query: 247 ACTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTLL 306
C GC GG AW + G V+ E C PYT+ C P L
Sbjct: 137 TCDETDNGCEGGDAFSAWNWLRKQGAVS-------EECLPYTIPTC------PPAQQPCL 183
Query: 307 GKLKTPECKQNCYNPS 322
+ TP C + C + S
Sbjct: 184 NFVNTPSCTKECQSNS 199
>gi|395833440|ref|XP_003789742.1| PREDICTED: tubulointerstitial nephritis antigen [Otolemur
garnettii]
Length = 464
Score = 84.7 bits (208), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 42/96 (43%), Positives = 60/96 (62%), Gaps = 13/96 (13%)
Query: 77 MRQIYEHGPLVAIFSVYADFLQYKSGVYQH---NFGDS-----IGLHAVRVLGWGVENDI 128
M++I ++GP+ AI V+ DF YKSG+Y+H G+S + HAV++LGWG
Sbjct: 353 MKEIMQNGPVQAIMQVHEDFFHYKSGIYRHVASTHGESENYRKLRTHAVKLLGWGTLRGA 412
Query: 129 P-----YWLVANSWNDHWGDHGTFKILRGENEADIE 159
+W+ ANSW WG++G F+ILRG NE+DIE
Sbjct: 413 QGRKEKFWIAANSWGKSWGENGYFRILRGVNESDIE 448
Score = 57.0 bits (136), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 35/94 (37%), Positives = 48/94 (51%), Gaps = 5/94 (5%)
Query: 187 LPRNFDAREKWPECPSLRH-IADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHI 245
LP F A KWP H DQ NC + WA S A+ +DR+ I S G +T +S Q++
Sbjct: 205 LPEFFVASYKWP---GWTHGPLDQKNCAASWAFSTASVAADRIAIQSKGRYTANLSPQNL 261
Query: 246 VACTP-NCWGCNGGWPQLAWRFWGHNGVVTGGDY 278
++C N GCN G AW + G+V+ Y
Sbjct: 262 ISCCAKNRHGCNSGSIDRAWWYLRKRGLVSHACY 295
>gi|294926967|ref|XP_002779086.1| Gut-specific cysteine proteinase precursor, putative [Perkinsus
marinus ATCC 50983]
gi|239888027|gb|EER10881.1| Gut-specific cysteine proteinase precursor, putative [Perkinsus
marinus ATCC 50983]
Length = 283
Score = 84.3 bits (207), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 52/175 (29%), Positives = 79/175 (45%), Gaps = 15/175 (8%)
Query: 135 NSWNDHWGDHGTFKILRGENEADIEMGFNNRVEANSSEDDDLETMGCQNAK--GLPRNFD 192
NS + W +G + D+ G N + +S+ DD+ +G + LP +FD
Sbjct: 89 NSMQNSWTASKDQPPFKGMSIKDVPTGCPNGPKPSSTSDDETRLLGSTKPELTNLPSDFD 148
Query: 193 AREKWPECPS-LRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVAC--- 248
AR+K+ C + H+ DQ C +CWA +DR+CI S G F +S + +C
Sbjct: 149 ARQKFASCAEVIGHVRDQGACHNCWATGSTGMFNDRVCIKSGGSFQNILSLGYFTSCCNP 208
Query: 249 ---TPNCWGCNGGWPQLAWRFWGHNGVVTGGDYN------SQEGCQPYTLAPCEH 294
P GC GG F ++G+VTG ++ S +GC PY C H
Sbjct: 209 ANGCPKAKGCEGGNLLEGLNFLKNHGIVTGNEFKPASQLVSADGCWPYPFPKCNH 263
>gi|194753202|ref|XP_001958906.1| GF12327 [Drosophila ananassae]
gi|190620204|gb|EDV35728.1| GF12327 [Drosophila ananassae]
Length = 431
Score = 84.3 bits (207), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 41/89 (46%), Positives = 53/89 (59%), Gaps = 4/89 (4%)
Query: 75 NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQH---NFGDSIGLHAVRVLGWGVE-NDIPY 130
+ M +IY GP+ A VY DF Y G+Y+ N G G H+V+++GWG E N Y
Sbjct: 323 DIMAEIYHSGPVQATMRVYRDFFSYSGGIYRQTAANRGAPQGFHSVKLVGWGEEHNGDKY 382
Query: 131 WLVANSWNDHWGDHGTFKILRGENEADIE 159
W+ ANSW WG+ G F+ILRG NE IE
Sbjct: 383 WIAANSWGPWWGERGYFRILRGSNECGIE 411
Score = 75.1 bits (183), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 42/105 (40%), Positives = 56/105 (53%), Gaps = 9/105 (8%)
Query: 184 AKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQ 243
GLP +F+A E+WP + + DQ CGS W +S + SDR I S G ++SAQ
Sbjct: 183 TDGLPSSFNAVERWPS--YISEVPDQGWCGSSWVLSTTSVASDRFAIQSKGKEAVRLSAQ 240
Query: 244 HIVACTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYT 288
+I++CT GC+GG AWRF GVV + C PYT
Sbjct: 241 NILSCTRRQQGCDGGHLDAAWRFLHKKGVV-------DDSCYPYT 278
>gi|290979437|ref|XP_002672440.1| predicted protein [Naegleria gruberi]
gi|284086017|gb|EFC39696.1| predicted protein [Naegleria gruberi]
Length = 354
Score = 84.3 bits (207), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 40/94 (42%), Positives = 56/94 (59%), Gaps = 6/94 (6%)
Query: 75 NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVA 134
N I +G + + F++Y DF+ Y+SGVY+H ++G HAV ++GWGVE+ YWL
Sbjct: 262 NIKAAIMSYGSVQSGFTIYRDFMSYRSGVYKHVSTTTLGGHAVALIGWGVESGTNYWLAV 321
Query: 135 NSWNDHWGDHGTFKILRGENEADIEMGFNNRVEA 168
NSW +WG G FKI +G E G N+V A
Sbjct: 322 NSWGSNWGMSGYFKIAQG------ECGIENQVYA 349
Score = 75.1 bits (183), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 42/118 (35%), Positives = 57/118 (48%), Gaps = 11/118 (9%)
Query: 171 SEDDDLETMGCQNAKGLPRNFDAREKWPEC-PSLRHIADQSNCGSCWAVSVANAISDRLC 229
S D D + + LP NFDAR +W C P++R DQ CG+CWA S ++ RLC
Sbjct: 116 STDPDTPRLDIEPRVDLPMNFDARTQWRGCIPAVR---DQQTCGACWAFSATYVLAHRLC 172
Query: 230 IASNGYFTGQISAQHIVACTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPY 287
IA+NG +S ++ V C C GG+ + AW F G + C PY
Sbjct: 173 IATNGKTNVVLSPEYQVQCDTMNKACQGGYLKYAWSFLERTGTTV-------DSCIPY 223
>gi|327281715|ref|XP_003225592.1| PREDICTED: tubulointerstitial nephritis antigen-like [Anolis
carolinensis]
Length = 520
Score = 84.3 bits (207), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 40/97 (41%), Positives = 59/97 (60%), Gaps = 15/97 (15%)
Query: 77 MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDS--------IGLHAVRVLGWGVENDI 128
M+++ E+GP+ AI V+ DF Y++G+Y+H + G H+V++ GWG E +
Sbjct: 405 MKELMENGPVQAILEVHEDFFMYRTGIYRHTAVAAGKPEQYRRHGTHSVKITGWG-EEQM 463
Query: 129 P------YWLVANSWNDHWGDHGTFKILRGENEADIE 159
P YW+ ANSW WG+HG F+I RGENE +IE
Sbjct: 464 PDGSNQKYWIAANSWGKDWGEHGYFRITRGENECEIE 500
Score = 63.5 bits (153), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 52/162 (32%), Positives = 73/162 (45%), Gaps = 19/162 (11%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP F+A +KW + DQ NC WA S A SDR+ I S G+ T +S Q+++
Sbjct: 254 LPSYFNAADKWSG--MIHEPLDQGNCAGSWAFSTAAVASDRISIHSMGHMTPALSPQNLL 311
Query: 247 AC-TPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGP--LQNC 303
+C T + GCNGG AW F GVVT + C P++ H P + +
Sbjct: 312 SCNTRHQQGCNGGRIDGAWWFLRRRGVVT-------DECYPFSNQETNHSPNAPACMMHS 364
Query: 304 TLLGKLKTPECKQNCYNPS------YESTYRFDLKKGKKAHM 339
G+ K + C NP Y+ST + L +K M
Sbjct: 365 RSTGRGKR-QAIARCPNPRSHANEIYQSTPAYRLSSNEKEIM 405
>gi|126310154|ref|XP_001364630.1| PREDICTED: tubulointerstitial nephritis antigen [Monodelphis
domestica]
Length = 468
Score = 84.3 bits (207), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 42/96 (43%), Positives = 60/96 (62%), Gaps = 13/96 (13%)
Query: 77 MRQIYEHGPLVAIFSVYADFLQYKSGVYQH--NFGD------SIGLHAVRVLGWGVENDI 128
M++I ++GP+ AI V+ DF YKSG+Y+H N D ++ HAV++ GWGV
Sbjct: 357 MKEIMQNGPVQAIMQVHEDFFHYKSGIYRHINNLKDESEKYRNLRTHAVKLTGWGVLRGA 416
Query: 129 P-----YWLVANSWNDHWGDHGTFKILRGENEADIE 159
+W+ ANSW WG++G F+ILRG NE+DIE
Sbjct: 417 QGKKEKFWIAANSWGKSWGENGYFRILRGVNESDIE 452
Score = 55.8 bits (133), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 34/102 (33%), Positives = 50/102 (49%), Gaps = 3/102 (2%)
Query: 178 TMGCQNAKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFT 237
T+ + LP F + KWP DQ NC + WA S A+ +DR+ I S G +T
Sbjct: 200 TVTLPSQTDLPEFFISSYKWP--GWTHDPLDQKNCAASWAFSTASVAADRIAIQSKGRYT 257
Query: 238 GQISAQHIVA-CTPNCWGCNGGWPQLAWRFWGHNGVVTGGDY 278
+S Q++++ C N GC GG AW + G+V+ Y
Sbjct: 258 DNLSPQNLISCCVKNRHGCKGGSIDRAWWYLRKRGLVSHACY 299
>gi|66270083|gb|AAY43371.1| cathepsin-like cysteine protease [Phytophthora infestans]
Length = 635
Score = 84.0 bits (206), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 61/236 (25%), Positives = 101/236 (42%), Gaps = 49/236 (20%)
Query: 77 MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANS 136
M +IY GP+ +V FL+Y G++ + HA+ ++GWG EN +P+W++ NS
Sbjct: 211 MAEIYARGPIACSVAVTDGFLKYSGGIFDDKTNATDVDHAISIVGWGEENGVPFWVLRNS 270
Query: 137 WNDHWGDHGTFKILRGENEADIEMGFNNRVEANSS-------------------EDDDLE 177
W WG+ G +++RG N +E V + E+ +E
Sbjct: 271 WGSFWGESGWMRLVRGVNNVGVEGECAFGVPRDDGWPTPTKIEEKEEDKVKEPQEETSVE 330
Query: 178 TM--GCQ--------------------NAKGLPRNFDARE----KWPECPSLRHIADQSN 211
+ GC+ + LP+++D R+ + +HI
Sbjct: 331 STLGGCRQKLHFAGGERVISPLPHETMDVTDLPKSWDWRDVNGKNYVTWDKNQHIPKY-- 388
Query: 212 CGSCWAVSVANAISDRLCIASNGYFTG-QISAQHIVACTPNCWGCNGGWPQLAWRF 266
CGSCWA +A+SDR+ I N + +S Q ++ C CNGG P L + +
Sbjct: 389 CGSCWAQGTTSALSDRISILRNASWPEIALSPQVLINCHAGG-TCNGGNPGLVYEY 443
Score = 56.2 bits (134), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 34/118 (28%), Positives = 54/118 (45%), Gaps = 5/118 (4%)
Query: 79 QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGV--ENDIPYWLVANS 136
+IY+ GP+ + F Y G+Y + + H + V GWG E D YW+ NS
Sbjct: 511 EIYKRGPIGCGVHATSKFESYTGGIYSEHVMFPLINHEISVAGWGYDEETDTEYWIGRNS 570
Query: 137 WNDHWGDHGTFKILRGENEADIEMGFNNRV---EANSSEDDDLETMGCQNAKGLPRNF 191
W +WG++G F+I N IE + V + + D ++ G + + RNF
Sbjct: 571 WGTYWGENGWFRIQMHHNNLGIEQDCDWGVPLPDGSKPNDFVVDYQGNEAGEATDRNF 628
Score = 41.2 bits (95), Expect = 0.65, Method: Compositional matrix adjust.
Identities = 34/118 (28%), Positives = 53/118 (44%), Gaps = 27/118 (22%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSN------CGSCWAVSVANAISDRLCI---------- 230
LP+NFD W R+++ N CGSCW+ + +A++DR+ I
Sbjct: 56 LPKNFD----WRNVNGTRYVSISRNQHIPHYCGSCWSFAATSALADRILIFKERNPGNKP 111
Query: 231 ASNGYFTGQISAQHIVACTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYT 288
+ + +S Q I+ C GC+GG A+R+ +GV +EGCQ Y
Sbjct: 112 SVEVHRGVVLSPQVILNCDKKDNGCHGGDQLEAYRYIKEHGV-------PEEGCQRYA 162
>gi|312082955|ref|XP_003143660.1| hypothetical protein LOAG_08080 [Loa loa]
gi|307761175|gb|EFO20409.1| hypothetical protein LOAG_08080 [Loa loa]
Length = 339
Score = 84.0 bits (206), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 44/92 (47%), Positives = 62/92 (67%), Gaps = 9/92 (9%)
Query: 75 NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQH--NFGDSI-GLHAVRVLGWGVE--NDIP 129
+ M +I +GP+ A F V+ DF + +GVY+H G+ I G H+VR+LGWG + IP
Sbjct: 220 DIMSEILTNGPVQATFRVHGDF--FIAGVYKHLPTVGEEIEGYHSVRLLGWGEDYSTGIP 277
Query: 130 --YWLVANSWNDHWGDHGTFKILRGENEADIE 159
YW+ ANSW +WG++GTF+ILRGEN +IE
Sbjct: 278 VKYWIAANSWGTNWGENGTFRILRGENHCEIE 309
Score = 64.7 bits (156), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 39/102 (38%), Positives = 54/102 (52%), Gaps = 10/102 (9%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP +FDAR+KWP+ + I DQ +C S WA S A +DRL + + G +SAQ +
Sbjct: 80 LPTSFDARQKWPDF--IHPIQDQGDCASSWAQSTAATSADRLALITEGRQNVALSAQQFL 137
Query: 247 ACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPY 287
+C + GC GG+ AW + GVV+ E C PY
Sbjct: 138 SCNQHRQKGCEGGYLDRAWWYIRKFGVVS-------EECYPY 172
>gi|351709947|gb|EHB12866.1| Tubulointerstitial nephritis antigen-like protein [Heterocephalus
glaber]
Length = 467
Score = 84.0 bits (206), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 41/96 (42%), Positives = 54/96 (56%), Gaps = 13/96 (13%)
Query: 77 MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGD--------SIGLHAVRVLGWGVE--- 125
M+++ E+GP+ A+ VY DF YKSG+Y H G H+V++ GWG E
Sbjct: 354 MKELMENGPVQALMEVYEDFFLYKSGIYSHTLVSMGRPEQYRRHGTHSVKITGWGEEMLP 413
Query: 126 --NDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
+ YW ANSW WG+ G F+ILRG NE DIE
Sbjct: 414 DGRTLKYWTAANSWGPSWGERGYFRILRGSNECDIE 449
Score = 61.2 bits (147), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 41/112 (36%), Positives = 56/112 (50%), Gaps = 5/112 (4%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP+ F+A +KWP + DQ NC WA S A SDR+ I S G+ T +S Q+++
Sbjct: 203 LPKAFEASKKWPN--MIHDPLDQGNCAGSWAFSTAAVASDRVSIHSMGHMTPVLSPQNLL 260
Query: 247 AC-TPNCWGCNGGWPQLAWRFWGHNGVVTGGDY--NSQEGCQPYTLAPCEHH 295
+C T + GC GG AW F GVV+ Y + E + PC H
Sbjct: 261 SCDTHHQQGCQGGRLDGAWWFLRRRGVVSDHCYPFSGHEQAEAGPATPCMMH 312
>gi|193606095|ref|XP_001951499.1| PREDICTED: cathepsin B-like cysteine proteinase 5-like
[Acyrthosiphon pisum]
Length = 330
Score = 84.0 bits (206), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 41/107 (38%), Positives = 62/107 (57%), Gaps = 4/107 (3%)
Query: 54 YLPTSIPLSHYFKKAHMVPRCNAMRQIYEHGPLVAIFSVYADFLQYKSGVY-QHNFGDSI 112
Y + +SHY+ ++ + +++ +GP+ F VY DF YKSGVY + +
Sbjct: 216 YYHDHVKVSHYY---NIKSNEDIQKEVQTYGPVSVKFRVYDDFFLYKSGVYVKTEKSLYV 272
Query: 113 GLHAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
H +++GWGVEN + YWL+ NSW + WG +G FKI RG NE +E
Sbjct: 273 RRHFAKLIGWGVENGVDYWLLVNSWGNEWGQNGLFKIKRGTNEVHVE 319
Score = 77.4 bits (189), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 41/137 (29%), Positives = 68/137 (49%), Gaps = 17/137 (12%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+ FDAR+ WP+C ++ + D N WA + A ++DR+CIA+NG + +S + ++
Sbjct: 86 IHEEFDARKGWPQCKTIGEVHDDGNTRWGWAYATAGVLADRMCIATNGSYNQLLSTEELI 145
Query: 247 AC----TPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQN 302
C T G W + +G+V+GG YN+ +GCQP + P ++ L N
Sbjct: 146 FCGGIKTKQSGAVRG---DDVWEYLKSHGLVSGGKYNTNDGCQPSKIPPIG-NIPTHLYN 201
Query: 303 CTLLGKLKTPECKQNCY 319
T C++ CY
Sbjct: 202 HT---------CEERCY 209
>gi|115621283|ref|XP_782184.2| PREDICTED: tubulointerstitial nephritis antigen-like
[Strongylocentrotus purpuratus]
Length = 450
Score = 84.0 bits (206), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 40/99 (40%), Positives = 60/99 (60%), Gaps = 14/99 (14%)
Query: 75 NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQH---------NFGDSIGLHAVRVLGWGVE 125
+ M +IY++GP+ A F+V DF Y GVY++ + D G H+V+++GWG++
Sbjct: 337 DIMTEIYQNGPVQATFNVKNDFFVYNRGVYRNVKQEFTASQSDSDQAGWHSVKIVGWGID 396
Query: 126 -----NDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
N I YWL NSW +WG+ G F+I+RG NE +IE
Sbjct: 397 RSDWYNPIKYWLCTNSWGRNWGEQGMFRIVRGVNECEIE 435
Score = 69.7 bits (169), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 45/128 (35%), Positives = 60/128 (46%), Gaps = 10/128 (7%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP FDARE WP + + DQ CGS WA+S A+ SDRL I S G ++S QH++
Sbjct: 197 LPETFDARENWPGL--IDEVIDQGKCGSSWAISTASVASDRLAIQSMGEINPRLSEQHLL 254
Query: 247 ACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
+C GC+GG+ AW G V+ C PY E + L+
Sbjct: 255 SCNIRGQRGCSGGYLDRAWYHLRRAGAVS-------RACYPYHSGLDEDTIMQKLRCRVA 307
Query: 306 LGKLKTPE 313
G + PE
Sbjct: 308 YGSSQCPE 315
>gi|428180143|gb|EKX49011.1| cathepsin B-like cysteine protease [Guillardia theta CCMP2712]
Length = 330
Score = 84.0 bits (206), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 45/103 (43%), Positives = 61/103 (59%), Gaps = 9/103 (8%)
Query: 75 NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSI-GLHAVRVLGWGVE-------N 126
+ M++IY +GP+ A F VY F+ YKSGVY H D + G HA++++GWGVE
Sbjct: 221 DIMKEIYLNGPVEAGFRVYTSFMSYKSGVYHHRILDIMEGGHAIKIVGWGVEPPKRFWQK 280
Query: 127 DIPYWLVANSWNDHWGDHGTFKILRGENE-ADIEMGFNNRVEA 168
YW+ ANSW WG +G FKI RG+N E G ++V A
Sbjct: 281 PTKYWICANSWTADWGMNGFFKIRRGKNRFGQSECGIEDQVFA 323
Score = 80.5 bits (197), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 42/110 (38%), Positives = 58/110 (52%), Gaps = 4/110 (3%)
Query: 185 KGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQH 244
K LP +F+ E WP + I DQ+ CGSCWA + + +SDR IASNG +S +
Sbjct: 92 KDLPESFNCYENWPN--YMHPIRDQARCGSCWAFAASEVLSDRFAIASNGTVNKILSPED 149
Query: 245 IVACTPNCWGCNGGWPQLAWRFWGHNGVVTGG--DYNSQEGCQPYTLAPC 292
+V+C GC GG+ AW + NG+VT Y +Q+G P C
Sbjct: 150 LVSCDKGDMGCQGGYLDKAWDYLKTNGIVTESCFPYAAQKGVAPSCRISC 199
>gi|496968|gb|AAA96831.1| cysteine protease homologue, partial [Ancylostoma caninum]
Length = 197
Score = 83.6 bits (205), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 43/95 (45%), Positives = 60/95 (63%), Gaps = 2/95 (2%)
Query: 45 KKKKKKKRLYLPTSIPLSHYFKKAHMVPRCN-AMRQ-IYEHGPLVAIFSVYADFLQYKSG 102
K +K +R Y + H+ +A+ +P ++RQ IY++GP+VA F VY DF YK G
Sbjct: 103 KCRKTCQRKYYKSYQEDKHFATRAYYLPNNERSIRQEIYKNGPVVAAFRVYQDFSYYKKG 162
Query: 103 VYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
+Y H +G G HAV+V+GWG EN YWL+ANSW
Sbjct: 163 IYVHKWGGQTGAHAVKVVGWGRENATDYWLIANSW 197
Score = 49.7 bits (117), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 38/130 (29%), Positives = 61/130 (46%), Gaps = 4/130 (3%)
Query: 214 SCWAVSVANAISDRLCIASNGYFTGQISAQHIVACT-PNC-WGCNGGWPQLAWRFWGHNG 271
SCWAVS A A+SD +C+ SN IS I++C +C +GC GGW A+++
Sbjct: 1 SCWAVSSAEAMSDEICVQSNSTIRVMISDSDILSCCGISCGYGCQGGWSIEAYKWMQRER 60
Query: 272 VVTGGDYNSQEGCQPYTLAP-CEHHVQGPLQNCTLLGKLKTPECKQNCYNPSYESTYRFD 330
+ + C+P + +H P G TP+C++ C Y+S Y+ D
Sbjct: 61 CCYRWENTDRRVCKPVRPSIRVGNHPNDPYYGPCPGGLWPTPKCRKTCQRKYYKS-YQED 119
Query: 331 LKKGKKAHMV 340
+A+ +
Sbjct: 120 KHFATRAYYL 129
>gi|452268|emb|CAA80451.1| cathepsin B-like protease [Fasciola hepatica]
Length = 104
Score = 83.6 bits (205), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 39/88 (44%), Positives = 50/88 (56%), Gaps = 1/88 (1%)
Query: 209 QSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVACTPNCW-GCNGGWPQLAWRFW 267
Q CG+CWA A+SDR+CI S G +SA+ +++C C GC GG P LAW +W
Sbjct: 1 QGQCGTCWAFGAVGAMSDRVCIHSKGQMKPHLSARDLLSCCEFCGRGCRGGSPALAWDYW 60
Query: 268 GHNGVVTGGDYNSQEGCQPYTLAPCEHH 295
+G+VTGG GC PY C HH
Sbjct: 61 KSSGIVTGGSLEEPTGCAPYPFPKCAHH 88
>gi|294952611|ref|XP_002787376.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
gi|239902348|gb|EER19172.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
Length = 203
Score = 83.6 bits (205), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 33/80 (41%), Positives = 52/80 (65%)
Query: 78 RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
++I+++GP+ + F +Y DF YKSGVY + + H V+++GWG ++ YWL NSW
Sbjct: 114 QEIFDNGPVFSAFKMYEDFRYYKSGVYVPTTKEVLSFHLVKIIGWGADSVQEYWLAMNSW 173
Query: 138 NDHWGDHGTFKILRGENEAD 157
N+ WGDHG K+ G+N +
Sbjct: 174 NEEWGDHGLIKMAFGKNRLE 193
Score = 46.2 bits (108), Expect = 0.020, Method: Compositional matrix adjust.
Identities = 31/99 (31%), Positives = 42/99 (42%), Gaps = 14/99 (14%)
Query: 254 GCNGGWPQLAWRFWGHNGVVTGGDYNSQ------EGCQPYTLAPCEHHVQGPLQN----- 302
GCNGG A F GVVTG D+ Q +GC PY C H P +N
Sbjct: 11 GCNGGTFVEAMSFLEDYGVVTGNDFKPQGQLSEADGCWPYPFQKCNH---VPTENSEYPK 67
Query: 303 CTLLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMVL 341
C + P C+ C N +Y+ + + D+ + K V
Sbjct: 68 CKDVAHQPLPPCRTTCTNKAYKKSLKKDVHRAKSWRKVF 106
>gi|327282776|ref|XP_003226118.1| PREDICTED: tubulointerstitial nephritis antigen-like [Anolis
carolinensis]
Length = 476
Score = 83.6 bits (205), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 39/98 (39%), Positives = 60/98 (61%), Gaps = 13/98 (13%)
Query: 75 NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFG--------DSIGLHAVRVLGWGVEN 126
+ M++I E+GP+ A+ VY DF YKSG+Y+H + H+++++GWG
Sbjct: 369 DIMKEIKENGPVQAVMQVYDDFFLYKSGIYKHIWSLEGKTQNRHQKKPHSIKIVGWGTLR 428
Query: 127 DIP-----YWLVANSWNDHWGDHGTFKILRGENEADIE 159
D +W+ ANSW + WG++G F+ILRG+NE DIE
Sbjct: 429 DAEGQRQKFWIAANSWGNSWGENGYFRILRGQNECDIE 466
Score = 60.8 bits (146), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 35/93 (37%), Positives = 49/93 (52%), Gaps = 3/93 (3%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
P F A +WP + DQ NC + WA S A+ +DR+ I S G FT +S QH++
Sbjct: 224 FPEFFVAWHEWPG--WIHDPLDQRNCAASWAFSTASVAADRIAIHSKGRFTDNLSPQHLI 281
Query: 247 AC-TPNCWGCNGGWPQLAWRFWGHNGVVTGGDY 278
+C T N +GC GG AW + G+V+ Y
Sbjct: 282 SCDTRNQYGCKGGSITGAWSYLKKYGLVSHACY 314
>gi|294893015|ref|XP_002774310.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
gi|239879603|gb|EER06126.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
Length = 81
Score = 83.6 bits (205), Expect = 1e-13, Method: Composition-based stats.
Identities = 33/68 (48%), Positives = 47/68 (69%)
Query: 86 LVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWNDHWGDHG 145
++ + S+Y DF YKSGVY H G +G+H+++++GWGVE+ YWL NSWN+ GDHG
Sbjct: 1 VLGVISMYEDFRLYKSGVYVHTTGGLVGVHSLKIIGWGVESGQDYWLAVNSWNEESGDHG 60
Query: 146 TFKILRGE 153
K+ GE
Sbjct: 61 MIKLAVGE 68
>gi|149941230|emb|CAO02547.1| putative cathepsin B-like cysteine protease [Vigna unguiculata]
Length = 201
Score = 83.6 bits (205), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 50/135 (37%), Positives = 67/135 (49%), Gaps = 20/135 (14%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP NFDAR W +C ++ I DQ +CGSCWA ++SDR CI + +S ++
Sbjct: 18 LPVNFDARTAWSQCSTIGRILDQGHCGSCWAFGAVESLSDRFCIHFD--VNISLSVNDLL 75
Query: 247 ACTPNCW--GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPY-TLAPCEHHVQGPLQNC 303
AC GCNGG+P AWR+ ++GVVT E C PY C H P
Sbjct: 76 ACCGFLCGSGCNGGYPLSAWRYLSNHGVVT-------EECDPYFDQTGCSHPGCEP---- 124
Query: 304 TLLGKLKTPECKQNC 318
+TP+C + C
Sbjct: 125 ----AYRTPKCVKKC 135
Score = 42.7 bits (99), Expect = 0.28, Method: Compositional matrix adjust.
Identities = 17/35 (48%), Positives = 25/35 (71%)
Query: 75 NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFG 109
+ M ++Y++GP+ F+VY DF YKSGVY+H G
Sbjct: 161 DIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKHVTG 195
>gi|158285208|ref|XP_001687862.1| AGAP007684-PA [Anopheles gambiae str. PEST]
gi|158285210|ref|XP_308187.4| AGAP007684-PB [Anopheles gambiae str. PEST]
gi|157019881|gb|EDO64511.1| AGAP007684-PA [Anopheles gambiae str. PEST]
gi|157019882|gb|EAA04576.4| AGAP007684-PB [Anopheles gambiae str. PEST]
Length = 463
Score = 83.6 bits (205), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 39/94 (41%), Positives = 57/94 (60%), Gaps = 9/94 (9%)
Query: 75 NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFG-----DSIGLHAVRVLGWGVE---- 125
+ M +I + G + AI VY DF Y+SG+Y+H+ + H+VR++GWG E
Sbjct: 323 DIMAEIKDRGTVQAIMRVYRDFFSYRSGIYRHSAAATPAEERSAYHSVRLIGWGEERVGY 382
Query: 126 NDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
+ + YW+ NSW WG++G F+ILRG NE DIE
Sbjct: 383 DVVKYWIAINSWGQWWGENGRFRILRGSNECDIE 416
Score = 64.7 bits (156), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 40/104 (38%), Positives = 48/104 (46%), Gaps = 9/104 (8%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP FDA E W + DQ CGS WA S A SDR I S G Q++ Q ++
Sbjct: 187 LPTRFDASEHWTGL--VAEARDQGWCGSSWAFSTATMASDRFAILSKGREMVQLAPQQML 244
Query: 247 ACTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLA 290
AC GC+GG AW++ GVV E C PY A
Sbjct: 245 ACVRRQQGCSGGHLDTAWQYLRRTGVV-------NEECYPYIAA 281
>gi|77744608|gb|ABB02268.1| cathepsin B [Ovis aries]
Length = 76
Score = 83.2 bits (204), Expect = 1e-13, Method: Composition-based stats.
Identities = 38/75 (50%), Positives = 51/75 (68%), Gaps = 2/75 (2%)
Query: 224 ISDRLCIASNGYFTGQISAQHIVACT-PNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQ 281
ISDR+CI S G ++SA+ ++ C C GCNGG+P AW FW G+V+GG Y+S
Sbjct: 1 ISDRICIHSKGRVNVEVSAEDMLTCCGSECGDGCNGGFPSGAWNFWTKKGLVSGGLYDSH 60
Query: 282 EGCQPYTLAPCEHHV 296
GC+PY++ PCEHHV
Sbjct: 61 VGCRPYSIPPCEHHV 75
>gi|21695|emb|CAA46812.1| cathepsin B [Triticum aestivum]
Length = 310
Score = 83.2 bits (204), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 49/135 (36%), Positives = 70/135 (51%), Gaps = 20/135 (14%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP+ FDAR +W C ++ +I DQ +CG+CWA + A+ DR CI N + +S ++
Sbjct: 97 LPKEFDARTQWSSCSTIGNILDQGHCGACWAFAAVEALQDRFCIHLN--MSVSLSVNDLL 154
Query: 247 ACTPNCW--GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPY-TLAPCEHHVQGPLQNC 303
AC GCNGG+P AWR++ +GVVT E C PY C+H P
Sbjct: 155 ACCGFLCGSGCNGGYPISAWRYFRRSGVVT-------EECDPYFDQTGCQHPGCEP---- 203
Query: 304 TLLGKLKTPECKQNC 318
TP+C++ C
Sbjct: 204 ----AYPTPKCQRKC 214
Score = 64.7 bits (156), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 33/79 (41%), Positives = 48/79 (60%), Gaps = 4/79 (5%)
Query: 67 KAHMVPRCNAMRQIYEHGPLVAIFSV--YADFLQYKSGVYQHNFGDSIGLHAVRVLGWGV 124
+ H P + M ++Y++GP+ F+ DF YKSGVY+H G +G HAV+++GWG
Sbjct: 233 RVHSNPH-DIMAEVYKNGPVEVAFTYCQILDFAHYKSGVYKHITGGVMGGHAVKLIGWGT 291
Query: 125 EN-DIPYWLVANSWNDHWG 142
+ YWL+AN WN WG
Sbjct: 292 SDAGEDYWLLANQWNRGWG 310
>gi|403343435|gb|EJY71046.1| Papain family cysteine protease containing protein [Oxytricha
trifallax]
Length = 619
Score = 83.2 bits (204), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 69/257 (26%), Positives = 104/257 (40%), Gaps = 64/257 (24%)
Query: 77 MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANS 136
M++IY+ GP+ +V Y G+YQ GD +H V V+G+GVEN +W+V NS
Sbjct: 196 MQEIYQRGPIACGIAVPDSLETYTGGIYQDTTGDQNIVHDVSVVGFGVENGTKFWVVRNS 255
Query: 137 WNDHWGDHGTFKILRGENEADIEMG---------FNNRV-------EANSSEDD------ 174
W H+G++G +++RG N IE + NRV E N ++D
Sbjct: 256 WGSHYGENGFVRVIRGVNNIAIETDCAWATPVDTWTNRVPHKTTDAEKNDPKNDKYRKNG 315
Query: 175 -----------DLETMGCQ----------------------NAKGLPRNFDARE----KW 197
+ GC+ +A LP N D R +
Sbjct: 316 PYPSGMENEFLSTKNHGCRRVAKAAFKAGQVKTEVMPWEEIDAAALPANLDWRNVNNTNF 375
Query: 198 PECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQI--SAQHIVACTPNCWGC 255
+HI CGSCWA ++++DR I + I +AQ I+ C C
Sbjct: 376 LSWSKNQHIPQY--CGSCWAQGTTSSLADRFNILLGDHNPTPIDLAAQTIINCQAGG-SC 432
Query: 256 NGGWPQLAWRFWGHNGV 272
NGG P + + G+
Sbjct: 433 NGGDPSGVYEYAFETGI 449
Score = 54.3 bits (129), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 31/101 (30%), Positives = 53/101 (52%), Gaps = 5/101 (4%)
Query: 63 HYFKKAHMVPRCNAMR-QIYEHGPLVAIFSVYADFLQYKSGVY-QHNFGDSIGLHAVRVL 120
+Y + + N M+ +I+++GP+ SV F Y +G+Y + +F I H + V+
Sbjct: 499 YYVSNYYGLSGANKMKAEIFKNGPISCGISVTDGFEAYSTGIYSESSFFPQIN-HEIAVV 557
Query: 121 GWGVE--NDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
GWG++ YW+ NSW +WG+ G F+I + IE
Sbjct: 558 GWGLDEATKTEYWIGRNSWGTYWGEQGFFRIKMHSDNLAIE 598
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 38/120 (31%), Positives = 59/120 (49%), Gaps = 13/120 (10%)
Query: 212 CGSCWAVSVANAISDRLCIASNGYFTG-QISAQHIVACTPNCWGCNGGWPQLAWRFWGHN 270
CGSCWA + ++ISDR+ IA + I+ Q +++C+ N GC+GG A++F H
Sbjct: 77 CGSCWAQAATSSISDRIKIARKAAWPDINIAPQVVISCSMNDDGCHGGEAISAYQFM-HQ 135
Query: 271 GVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTLLGKLKTPECKQNCYNPSYESTYRFD 330
VT E C Y ++ V+ C + K + Q+C+ P +TYR D
Sbjct: 136 SEVT------DETCSIYQARGHDNGVE-----CAPINVCKNCQPFQDCFVPDEYNTYRVD 184
>gi|115605092|gb|ABJ15785.1| cathepsin B [Bos taurus]
Length = 118
Score = 83.2 bits (204), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 40/86 (46%), Positives = 53/86 (61%), Gaps = 3/86 (3%)
Query: 255 CNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTLLGKLKTPEC 314
CNGG+P AW FW G+V+GG YNS GC+PY++ PCEHHV G CT G+ TP+C
Sbjct: 1 CNGGFPSGAWNFWTKKGLVSGGLYNSHVGCRPYSIPPCEHHVNGSRPPCT--GEGDTPKC 58
Query: 315 KQNCYNPSYESTYRFDLKKGKKAHMV 340
+ C P Y +Y+ D G ++ V
Sbjct: 59 SKTC-EPGYSPSYKEDKHFGCSSYSV 83
Score = 43.1 bits (100), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 18/28 (64%), Positives = 23/28 (82%)
Query: 77 MRQIYEHGPLVAIFSVYADFLQYKSGVY 104
M +IY++GP+ FSVY+DFL YKSGVY
Sbjct: 91 MAEIYKNGPVEGAFSVYSDFLLYKSGVY 118
>gi|149941232|emb|CAO02548.1| putative cathepsin B-like cysteine protease,putative [Vigna
unguiculata]
Length = 195
Score = 83.2 bits (204), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 50/135 (37%), Positives = 67/135 (49%), Gaps = 20/135 (14%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP NFDAR W +C ++ I DQ +CGSCWA ++SDR CI + +S ++
Sbjct: 18 LPVNFDARTAWSQCSTIGRILDQGHCGSCWAFGAVESLSDRFCIHFD--VNISLSVNDLL 75
Query: 247 ACTPNCW--GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPY-TLAPCEHHVQGPLQNC 303
AC GCNGG+P AWR+ ++GVVT E C PY C H P
Sbjct: 76 ACCGFLCGSGCNGGYPLSAWRYLSNHGVVT-------EECDPYFDQTGCSHPGCEP---- 124
Query: 304 TLLGKLKTPECKQNC 318
+TP+C + C
Sbjct: 125 ----AYRTPKCVKKC 135
Score = 42.7 bits (99), Expect = 0.27, Method: Compositional matrix adjust.
Identities = 17/35 (48%), Positives = 25/35 (71%)
Query: 75 NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFG 109
+ M ++Y++GP+ F+VY DF YKSGVY+H G
Sbjct: 161 DIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKHVTG 195
>gi|403354695|gb|EJY76909.1| Cathepsin B [Oxytricha trifallax]
Length = 311
Score = 83.2 bits (204), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 35/80 (43%), Positives = 50/80 (62%)
Query: 80 IYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWND 139
++ GP+VA+F V+ DF+ Y G+Y GD +G HAV++LG+GVEN Y++ N W
Sbjct: 224 LFNKGPMVAVFDVFEDFINYGGGIYNKVSGDKLGKHAVKLLGYGVENSTNYYIGVNQWGK 283
Query: 140 HWGDHGTFKILRGENEADIE 159
WG+ G F+I GE D E
Sbjct: 284 DWGEDGYFRIKAGEVLIDNE 303
Score = 70.9 bits (172), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 37/107 (34%), Positives = 57/107 (53%), Gaps = 10/107 (9%)
Query: 182 QNAKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQIS 241
Q AK +P ++D R +P C + I DQ+ CGSCWA + N + R C+A+ G ++S
Sbjct: 82 QVAKQMPSSYDVRTVYPMCEN--RIKDQAQCGSCWAFATTNVLEYRYCMATKGKKYPELS 139
Query: 242 AQHIVAC-TPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPY 287
Q++++C WGC+GG+ + + GV T E C PY
Sbjct: 140 PQNLISCFNSASWGCDGGYIDQTFLYLEMMGVNT-------EQCMPY 179
>gi|14789619|gb|AAH10745.1| Tubulointerstitial nephritis antigen [Mus musculus]
Length = 475
Score = 83.2 bits (204), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 43/124 (34%), Positives = 68/124 (54%), Gaps = 24/124 (19%)
Query: 60 PLSHYFKKAHMVPRCNA-----------MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNF 108
P + F+K++ + +C+ MR+I ++GP+ AI V+ DF YK+G+Y+H
Sbjct: 336 PCPNSFEKSNRIYQCSPPYRVSSNETEIMREIIQNGPVQAIMQVHEDFFYYKTGIYRHVV 395
Query: 109 GDS--------IGLHAVRVLGWGVENDI-----PYWLVANSWNDHWGDHGTFKILRGENE 155
+ + HAV++ GWG +W+ ANSW WG++G F+ILRG NE
Sbjct: 396 STNEEPEKYKKLRTHAVKLTGWGTLRGARGKKEKFWIAANSWGKSWGENGYFRILRGVNE 455
Query: 156 ADIE 159
+DIE
Sbjct: 456 SDIE 459
Score = 58.5 bits (140), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 36/94 (38%), Positives = 48/94 (51%), Gaps = 5/94 (5%)
Query: 187 LPRNFDAREKWPECPSLRH-IADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHI 245
LP F A KWP H DQ NC + WA S A+ +DR+ I S G +T +S Q++
Sbjct: 216 LPEIFIASYKWP---GWTHGPLDQKNCAASWAFSTASVAADRIAIQSKGRYTANLSPQNL 272
Query: 246 VACTP-NCWGCNGGWPQLAWRFWGHNGVVTGGDY 278
++C N GCN G AW F G+V+ Y
Sbjct: 273 ISCCAKNRHGCNSGSIDRAWWFLRKRGLVSHACY 306
>gi|53850626|ref|NP_001005549.1| tubulointerstitial nephritis antigen precursor [Rattus norvegicus]
gi|51858645|gb|AAH81887.1| Tubulointerstitial nephritis antigen [Rattus norvegicus]
gi|149019129|gb|EDL77770.1| tubulointerstitial nephritis antigen [Rattus norvegicus]
Length = 475
Score = 83.2 bits (204), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 43/124 (34%), Positives = 68/124 (54%), Gaps = 24/124 (19%)
Query: 60 PLSHYFKKAHMVPRCN-----------AMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNF 108
P + F+K++ + +C+ MR+I ++GP+ AI V+ DF YK+G+Y+H
Sbjct: 336 PCPNSFEKSNRIYQCSPPYRISSNETEIMREIIQNGPVQAIMQVHEDFFYYKTGIYRHVV 395
Query: 109 GDS--------IGLHAVRVLGWGVENDIP-----YWLVANSWNDHWGDHGTFKILRGENE 155
+ + HAV++ GWG +W+ ANSW WG++G F+ILRG NE
Sbjct: 396 STNEEPEKYRKLRTHAVKLTGWGTLRGAQGKKEKFWIAANSWGKSWGENGYFRILRGVNE 455
Query: 156 ADIE 159
+DIE
Sbjct: 456 SDIE 459
Score = 58.2 bits (139), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 36/94 (38%), Positives = 48/94 (51%), Gaps = 5/94 (5%)
Query: 187 LPRNFDAREKWPECPSLRH-IADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHI 245
LP F A KWP H DQ NC + WA S A+ +DR+ I S G +T +S Q++
Sbjct: 216 LPEVFIASYKWP---GWTHGPLDQKNCAASWAFSTASVAADRIAIQSKGRYTANLSPQNL 272
Query: 246 VACTP-NCWGCNGGWPQLAWRFWGHNGVVTGGDY 278
++C N GCN G AW F G+V+ Y
Sbjct: 273 ISCCAKNRHGCNSGSIDRAWWFLRKRGLVSHACY 306
>gi|227499499|ref|NP_036163.3| tubulointerstitial nephritis antigen precursor [Mus musculus]
gi|4929827|gb|AAD34171.1| tubulo-interstitial nephritis antigen [Mus musculus]
gi|148694397|gb|EDL26344.1| tubulointerstitial nephritis antigen, isoform CRA_a [Mus musculus]
Length = 475
Score = 83.2 bits (204), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 43/124 (34%), Positives = 68/124 (54%), Gaps = 24/124 (19%)
Query: 60 PLSHYFKKAHMVPRCNA-----------MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNF 108
P + F+K++ + +C+ MR+I ++GP+ AI V+ DF YK+G+Y+H
Sbjct: 336 PCPNSFEKSNRIYQCSPPYRVSSNETEIMREIIQNGPVQAIMQVHEDFFYYKTGIYRHVV 395
Query: 109 GDS--------IGLHAVRVLGWGVENDI-----PYWLVANSWNDHWGDHGTFKILRGENE 155
+ + HAV++ GWG +W+ ANSW WG++G F+ILRG NE
Sbjct: 396 STNEEPEKYKKLRTHAVKLTGWGTLRGARGKKEKFWIAANSWGKSWGENGYFRILRGVNE 455
Query: 156 ADIE 159
+DIE
Sbjct: 456 SDIE 459
Score = 58.5 bits (140), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 36/94 (38%), Positives = 48/94 (51%), Gaps = 5/94 (5%)
Query: 187 LPRNFDAREKWPECPSLRH-IADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHI 245
LP F A KWP H DQ NC + WA S A+ +DR+ I S G +T +S Q++
Sbjct: 216 LPEIFIASYKWP---GWTHGPLDQKNCAASWAFSTASVAADRIAIQSKGRYTANLSPQNL 272
Query: 246 VACTP-NCWGCNGGWPQLAWRFWGHNGVVTGGDY 278
++C N GCN G AW F G+V+ Y
Sbjct: 273 ISCCAKNRHGCNSGSIDRAWWFLRKRGLVSHACY 306
>gi|146163744|ref|XP_001471259.1| cathepsin z [Tetrahymena thermophila]
gi|146145941|gb|EDK31861.1| cathepsin z [Tetrahymena thermophila SB210]
Length = 585
Score = 82.8 bits (203), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 68/260 (26%), Positives = 111/260 (42%), Gaps = 59/260 (22%)
Query: 77 MRQIYEHGPLVAIFSVYADFLQYK--SGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVA 134
M++I+ GP +A + ++L+Y G+Y H + V+GWG EN+ YW++
Sbjct: 189 MQEIFNRGP-IACYIYATEYLRYNYTGGIYNDTSSYPGTNHVIEVVGWGEENNEKYWIIR 247
Query: 135 NSWNDHWGDHGTFKILRGENEADIEMG----------FNNRV-------EANSSEDDDLE 177
NSW +WG+ G ++ LRG N +IE + N V E +++ ++
Sbjct: 248 NSWGSYWGEKGFYRQLRGVNMLNIESSNCNWAVPLDTWTNDVRNTTKVTEVSNNHTNNFR 307
Query: 178 TMGC--------------------QNAKGLPRNFDAREKWPECPSLRHIADQSN------ 211
C NA LP N+D W + +++ N
Sbjct: 308 HTTCIRESNKNSTQLITGPLPHEYINAASLPANWD----WRNINGVNYLSFTRNQHIPQY 363
Query: 212 CGSCWAVSVANAISDRLCIASNGYFTG-QISAQHIVACTPNCWGCNGGWPQLAWRFWGHN 270
CGSCWA ++++DR+ IA N + +S Q ++ C CNGG P ++F
Sbjct: 364 CGSCWAHGTTSSLADRINIARNRTWPDIALSVQVVLNCQAGG-SCNGGQPMGVYQFANKQ 422
Query: 271 GVVTGGDYNSQEGCQPYTLA 290
G+ +E CQ Y A
Sbjct: 423 GI-------PEESCQNYLAA 435
Score = 54.3 bits (129), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 26/83 (31%), Positives = 42/83 (50%), Gaps = 2/83 (2%)
Query: 79 QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVE--NDIPYWLVANS 136
+IY GP+ V F Y G+Y+ + + H + V+GWG + + YW+ NS
Sbjct: 492 EIYARGPISCGIYVTNKFEAYTGGIYKESTAFPMINHEIAVVGWGTDPQTGVEYWIGRNS 551
Query: 137 WNDHWGDHGTFKILRGENEADIE 159
W +WG++G F+I + IE
Sbjct: 552 WGTYWGENGFFRIQMHKQNLAIE 574
>gi|431838263|gb|ELK00195.1| Tubulointerstitial nephritis antigen [Pteropus alecto]
Length = 425
Score = 82.8 bits (203), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 40/96 (41%), Positives = 58/96 (60%), Gaps = 13/96 (13%)
Query: 77 MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDS--------IGLHAVRVLGWGVENDI 128
M++I +GP+ AI V+ DF YKSG+Y+H + + HAV++ GWG
Sbjct: 314 MKEIIHNGPVQAIMQVHEDFFHYKSGIYRHVTSTNEKSEKYQKLQTHAVKLTGWGTLRGA 373
Query: 129 P-----YWLVANSWNDHWGDHGTFKILRGENEADIE 159
+W+VANSW + WG++G F+ILRG NE+DIE
Sbjct: 374 QGRKEKFWIVANSWGNSWGENGYFRILRGVNESDIE 409
Score = 54.7 bits (130), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 33/93 (35%), Positives = 47/93 (50%), Gaps = 3/93 (3%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP F A KWP DQ NC + WA S A+ +DR+ I S G +T +S Q+++
Sbjct: 166 LPEFFVAYYKWPGW--THGPLDQKNCAASWAFSTASVAADRIAIQSKGRYTANLSPQNLI 223
Query: 247 ACTP-NCWGCNGGWPQLAWRFWGHNGVVTGGDY 278
+C N GC+ G AW + G+V+ Y
Sbjct: 224 SCCAKNRHGCSSGSIDRAWWYLRKRGLVSHACY 256
>gi|348553066|ref|XP_003462348.1| PREDICTED: tubulointerstitial nephritis antigen-like [Cavia
porcellus]
Length = 475
Score = 82.8 bits (203), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 43/127 (33%), Positives = 69/127 (54%), Gaps = 24/127 (18%)
Query: 57 TSIPLSHYFKKAHMVPRCN-----------AMRQIYEHGPLVAIFSVYADFLQYKSGVYQ 105
+ P ++ +K++ + +C+ M++I ++GP+ AI V+ DF YK+G+Y+
Sbjct: 333 ATTPCPNHIEKSNRIYQCSPPYRVSSNETQIMKEIMQNGPVQAIMKVHEDFFSYKTGIYR 392
Query: 106 HNFGDS--------IGLHAVRVLGWGVENDI-----PYWLVANSWNDHWGDHGTFKILRG 152
H S + HAV++ GWG +W+ ANSW WG++G FKILRG
Sbjct: 393 HVTSTSEDSEKYQKLRTHAVKLTGWGTLKGARGKKEKFWIAANSWGKSWGENGYFKILRG 452
Query: 153 ENEADIE 159
NE+DIE
Sbjct: 453 VNESDIE 459
Score = 58.2 bits (139), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 40/129 (31%), Positives = 60/129 (46%), Gaps = 21/129 (16%)
Query: 187 LPRNFDAREKWPECPSLRH-IADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHI 245
LP F A KWP H DQ NC + WA S A+ +DR+ I S+G +T +S Q++
Sbjct: 217 LPEFFIASYKWP---GWTHGPLDQKNCAASWAFSTASVAADRIAIQSSGRYTANLSPQNL 273
Query: 246 VACTPN-CWGCNGGWPQLAWRFWGHNGVVTGG------DYNSQEGC----------QPYT 288
++C GC GG AW + G+V+ D N+ GC + +
Sbjct: 274 ISCCARKRHGCGGGSVDRAWWYLRKRGLVSHACYPLFKDQNATNGCAMASRSDGRGKRHA 333
Query: 289 LAPCEHHVQ 297
PC +H++
Sbjct: 334 TTPCPNHIE 342
>gi|328712819|ref|XP_001942906.2| PREDICTED: tubulointerstitial nephritis antigen-like isoform 1
[Acyrthosiphon pisum]
gi|328712821|ref|XP_003244911.1| PREDICTED: tubulointerstitial nephritis antigen-like isoform 2
[Acyrthosiphon pisum]
Length = 463
Score = 82.8 bits (203), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 50/160 (31%), Positives = 79/160 (49%), Gaps = 19/160 (11%)
Query: 28 KKKKKEEEKKKKKKKKKKKKKKKKRLYLPTSIPLSHYFKKAHMVPRCNAMRQIYEHGPLV 87
KKKK+ + + + + K RL+ + + M +I GP+
Sbjct: 298 KKKKETMAQCPSRVRSNNDRTTKTRLHRVGPV--------YRVATEEGIMHEILTSGPVQ 349
Query: 88 AIFSVYADFLQYKSGVYQHN---FGDSIGLHAVRVLGWGVEND----IPYWLVANSWNDH 140
A+ V DF YKSGVY+ + G G H+VR++GWG E + YW+ +NSW
Sbjct: 350 AVMKVSRDFFMYKSGVYKCSNLASGSRTGYHSVRIVGWGEEYQGGKIVKYWIASNSWGSW 409
Query: 141 WGDHGTFKILRGENEADIEMGFNNRVEANSSEDDDLETMG 180
WG++G F+IL+G +E +IE + V A ++ DD + G
Sbjct: 410 WGENGYFRILKGVDECEIE----DFVIAAWADIDDFDVTG 445
Score = 51.2 bits (121), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 36/120 (30%), Positives = 54/120 (45%), Gaps = 19/120 (15%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
L R++DARE W + DQ CG+ WA++ +DR I S + +S QH++
Sbjct: 195 LRRSYDAREVWGN--YISSPIDQGWCGASWAITTVQVTTDRFGIMSKRAISDVLSPQHLL 252
Query: 247 AC-TPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
+C N GC GG AW + G++T E C P+ QG + C +
Sbjct: 253 SCNNLNQQGCQGGHLTRAWNWIRKFGLIT-------EECYPW---------QGRMSTCAV 296
>gi|56756124|gb|AAW26240.1| unknown [Schistosoma japonicum]
Length = 159
Score = 82.8 bits (203), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 43/112 (38%), Positives = 62/112 (55%), Gaps = 14/112 (12%)
Query: 148 KILRGENEADIEMGFNNRVEANSSEDDDLETMGCQNAKGLPRNFDAREKWPECPSLRHIA 207
+IL G + D EM N R + D ++E +P FD+R+KWP C S+ I
Sbjct: 61 RILMGARKEDAEMKRNRRPTVDH-HDLNVE---------IPSQFDSRKKWPHCKSISQIR 110
Query: 208 DQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVACTPNCWGCNGGW 259
DQS CGSCWA A++DR+CI S G + ++SA +++C +C GGW
Sbjct: 111 DQSRCGSCWAFGAVEAMTDRICIQSGGQQSAELSALDLISCCEDC----GGW 158
>gi|312083604|ref|XP_003143931.1| hypothetical protein LOAG_08355 [Loa loa]
Length = 188
Score = 82.8 bits (203), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 34/66 (51%), Positives = 47/66 (71%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+P +FDAR+ WPEC SLR++ DQS+CGSCWAV+ A+SDR+CI S G +SA ++
Sbjct: 120 IPESFDARKHWPECASLRNVRDQSSCGSCWAVAAVEAMSDRICIMSKGKKQVTLSADDLL 179
Query: 247 ACTPNC 252
+C C
Sbjct: 180 SCCKTC 185
>gi|290980376|ref|XP_002672908.1| predicted protein [Naegleria gruberi]
gi|284086488|gb|EFC40164.1| predicted protein [Naegleria gruberi]
Length = 261
Score = 82.8 bits (203), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 43/104 (41%), Positives = 57/104 (54%), Gaps = 4/104 (3%)
Query: 60 PLSHYFKKA--HMVPRCNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQH--NFGDSIGLH 115
PL Y KA ++ + + I + G ++ +Y DFL Y SGVYQH N I
Sbjct: 149 PLVLYKTKAVQNLTGEHDMQQAILQGGSIMTELDMYQDFLYYSSGVYQHSANLRQPIAKF 208
Query: 116 AVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
VR++GWGVEN + YW+V N W WG G I RG NE++IE
Sbjct: 209 VVRIIGWGVENGVKYWIVPNIWGKTWGMQGYIWIRRGNNESNIE 252
Score = 48.5 bits (114), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 28/88 (31%), Positives = 42/88 (47%), Gaps = 13/88 (14%)
Query: 216 WAVSVANAISDRLCIASNGYFTGQISAQHIVACTPNCWGCNGGWPQLAWRFWGHNGVVTG 275
W + +SDR+C+ S+ F ++S Q+I+ C +GCNGG+ + F G+ T
Sbjct: 66 WGHVPSATVSDRMCVQSSAKFQERLSTQYILECDTRDFGCNGGYLNTEFEFELKRGIPT- 124
Query: 276 GDYNSQEGCQPYTLAPCEHHVQGPLQNC 303
E C PY+ V G L NC
Sbjct: 125 ------EKCVPYS------AVNGTLANC 140
>gi|12330246|gb|AAG52660.1| cysteine proteinase [Metagonimus yokogawai]
Length = 179
Score = 82.8 bits (203), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 45/122 (36%), Positives = 62/122 (50%), Gaps = 4/122 (3%)
Query: 220 VANAISDRLCIASNGYFTGQISAQHIVACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDY 278
A++DRLCI SN ISA +++C +C +GC+GG+P AW FW NG+VTGG
Sbjct: 2 AVEAMTDRLCIHSNATIKKHISATDLLSCCESCGFGCHGGFPPRAWDFWMENGLVTGGSK 61
Query: 279 NSQEGCQPYTLAPCEHHVQGPLQNCTLLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAH 338
+ GC+ Y C HH +G C TP C +C P + Y D K ++
Sbjct: 62 ENPSGCRSYPFPRCSHHGKGKYPPCPKT-IFDTPNCVDHCDKPDID--YAADKTHAKSSY 118
Query: 339 MV 340
V
Sbjct: 119 NV 120
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 26/52 (50%), Positives = 38/52 (73%)
Query: 77 MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDI 128
M++I +GP+ A F VY DF++YKSG+Y H+ G +G HA+R+LGWG E +
Sbjct: 128 MKEIMRNGPVEAAFMVYEDFIEYKSGIYFHSHGKLLGGHAIRMLGWGEEKGV 179
>gi|66805843|ref|XP_636643.1| hypothetical protein DDB_G0288563 [Dictyostelium discoideum AX4]
gi|60465035|gb|EAL63141.1| hypothetical protein DDB_G0288563 [Dictyostelium discoideum AX4]
Length = 314
Score = 82.8 bits (203), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 46/104 (44%), Positives = 61/104 (58%), Gaps = 11/104 (10%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFT-GQISAQHI 245
+P +FD+R +WP+C + I +Q CGSCWA S + +SDRLCIASN G +S Q +
Sbjct: 88 IPTSFDSRVQWPDC--IHPILNQEQCGSCWAFSSSEVLSDRLCIASNNKTNPGALSPQTL 145
Query: 246 VAC-TPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYT 288
VAC GC+GG PQLAW + G+ T + C PYT
Sbjct: 146 VACDVYGNDGCSGGIPQLAWEYMELKGLPT-------DSCVPYT 182
Score = 74.7 bits (182), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 38/95 (40%), Positives = 55/95 (57%), Gaps = 7/95 (7%)
Query: 62 SHYFKKAHMVPRCNAMRQIYE----HGPLVAIFSVYADFLQYKSGVYQHNFGDS-IGLHA 116
S Y K + C++++ I E +GP+V VY DF+ Y SGVY G S +G HA
Sbjct: 202 SLYRAKPFTLKTCSSVQCIQENILAYGPIVGTMEVYEDFMSYSSGVYVMTPGSSLLGGHA 261
Query: 117 VRVLGWGVE--NDIPYWLVANSWNDHWGDHGTFKI 149
++++GWG + + + YW+VANSW WG G F I
Sbjct: 262 IKIVGWGFDQTSQLNYWIVANSWGADWGQQGFFFI 296
>gi|111054118|gb|ABH04250.1| cathepsin B precursor [Sus scrofa]
Length = 61
Score = 82.4 bits (202), Expect = 2e-13, Method: Composition-based stats.
Identities = 35/55 (63%), Positives = 42/55 (76%)
Query: 105 QHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
+H GD +G HA+R+LGWGVEN PYWLV NSWN WGD+G FKILRG++ IE
Sbjct: 4 KHVTGDLMGGHAIRILGWGVENGTPYWLVGNSWNTDWGDNGFFKILRGQDHCGIE 58
>gi|312383398|gb|EFR28501.1| hypothetical protein AND_03481 [Anopheles darlingi]
Length = 573
Score = 82.4 bits (202), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 38/94 (40%), Positives = 57/94 (60%), Gaps = 9/94 (9%)
Query: 75 NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFG-----DSIGLHAVRVLGWGVE---- 125
+ M +I E G + AI VY DF Y++G+Y+H+ + H+VR++GWG E
Sbjct: 432 DIMTEIKERGTVQAILRVYRDFFSYQNGIYRHSAAATPAEERSAYHSVRLIGWGEERVGY 491
Query: 126 NDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
+ + YW+ NSW WG++G F+ILRG NE +IE
Sbjct: 492 DMVKYWIAVNSWGTWWGENGRFRILRGTNECEIE 525
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 37/104 (35%), Positives = 49/104 (47%), Gaps = 9/104 (8%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP +FDA + WP + DQ CGS WA+S SDR I S G Q++ Q ++
Sbjct: 296 LPSHFDAADHWPRL--VGEARDQGWCGSSWALSTTTMASDRFAILSKGREQVQLAPQQLL 353
Query: 247 ACTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLA 290
AC C+GG AW++ GVV + C PY A
Sbjct: 354 ACVRRQQACSGGHLDTAWQYLRRVGVV-------NDECYPYIAA 390
>gi|350596935|ref|XP_001927698.4| PREDICTED: tubulointerstitial nephritis antigen, partial [Sus
scrofa]
Length = 368
Score = 82.4 bits (202), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 43/124 (34%), Positives = 68/124 (54%), Gaps = 24/124 (19%)
Query: 60 PLSHYFKKAHMVPRCN-----------AMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNF 108
P + F+K++ + +C+ MR+I ++GP+ AI V+ DF YK+G+Y+H
Sbjct: 229 PCPNNFEKSNRIYQCSPPYRVSSNETEIMREIMQNGPVQAIMQVHEDFFHYKTGIYRHVT 288
Query: 109 GDS--------IGLHAVRVLGWGVENDIP-----YWLVANSWNDHWGDHGTFKILRGENE 155
+ + HAV++ GWG +W+ ANSW WG++G F+ILRG NE
Sbjct: 289 STNEESDKYRKLRTHAVKLTGWGTLKGAQGRKEKFWIAANSWGKSWGENGYFRILRGVNE 348
Query: 156 ADIE 159
+DIE
Sbjct: 349 SDIE 352
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 34/93 (36%), Positives = 47/93 (50%), Gaps = 3/93 (3%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP F A KWP DQ NC + WA S A+ +DR+ I S G +T +S Q+++
Sbjct: 109 LPEFFVASYKWPGW--THGPLDQKNCAASWAFSTASVAADRIAIQSEGRYTANLSPQNLI 166
Query: 247 ACTP-NCWGCNGGWPQLAWRFWGHNGVVTGGDY 278
+C N GCN G AW + G+V+ Y
Sbjct: 167 SCCAKNRHGCNSGSIDRAWWYLRKRGLVSHACY 199
>gi|290982673|ref|XP_002674054.1| predicted protein [Naegleria gruberi]
gi|284087642|gb|EFC41310.1| predicted protein [Naegleria gruberi]
Length = 673
Score = 82.4 bits (202), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 42/101 (41%), Positives = 58/101 (57%), Gaps = 4/101 (3%)
Query: 72 PRCNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDS-IGLHAVRVLGWGVE-NDIP 129
P N +I +GP+ A F VY+DF YKSG+YQ G + +G HAV+VLGW + N P
Sbjct: 218 PITNYQTEIMTNGPVEADFDVYSDFYSYKSGIYQKTAGSTYVGGHAVKVLGWASDSNGTP 277
Query: 130 YWLVANSWNDHWGDHGTFKILRGENEADIEMGFNNRVEANS 170
YW+ N W WG G F I RG + + + F+N + A +
Sbjct: 278 YWIAQNQWGTSWGMGGYFYIYRGNSTLNCK--FDNYMIAGT 316
Score = 81.6 bits (200), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 43/124 (34%), Positives = 64/124 (51%), Gaps = 9/124 (7%)
Query: 165 RVEANSSEDDDLETMGCQNAKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAI 224
R E +SSE+ T ++ +P FD+R KWP+C + I +Q CGSCWA +
Sbjct: 66 RGEESSSEEARYNTRDVKSTVAIPDTFDSRTKWPQC--IHGIRNQGQCGSCWAFATTGVF 123
Query: 225 SDRLCIASNGYFTGQISAQHIVACTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGC 284
SDRLCI +N IS + ++ C + C GG+ +W+F+ + G+ E C
Sbjct: 124 SDRLCITTNNVSNVVISPEFLIECDKTSFACQGGYGYYSWKFFMNTGI-------PLESC 176
Query: 285 QPYT 288
PYT
Sbjct: 177 VPYT 180
>gi|6562770|emb|CAB62589.1| putative cathepsin B-like protease [Pisum sativum]
Length = 206
Score = 82.4 bits (202), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 42/103 (40%), Positives = 58/103 (56%), Gaps = 11/103 (10%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP+ FDAR WP+C ++ I DQ +CGSCWA ++SDR CI +S ++
Sbjct: 102 LPKEFDARTAWPQCSTIGRILDQGHCGSCWAFGAVESLSDRFCIHFG--VDVPLSVNDLL 159
Query: 247 ACTPNCW--GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPY 287
AC GC+GG+P AW+++ H+GVVT E C PY
Sbjct: 160 ACCGFLCGSGCDGGYPISAWKYFAHHGVVT-------EECDPY 195
>gi|361069783|gb|AEW09203.1| Pinus taeda anonymous locus CL4685Contig1_03 genomic sequence
gi|383153583|gb|AFG58928.1| Pinus taeda anonymous locus CL4685Contig1_03 genomic sequence
gi|383153585|gb|AFG58929.1| Pinus taeda anonymous locus CL4685Contig1_03 genomic sequence
gi|383153587|gb|AFG58930.1| Pinus taeda anonymous locus CL4685Contig1_03 genomic sequence
gi|383153589|gb|AFG58931.1| Pinus taeda anonymous locus CL4685Contig1_03 genomic sequence
gi|383153591|gb|AFG58932.1| Pinus taeda anonymous locus CL4685Contig1_03 genomic sequence
gi|383153593|gb|AFG58933.1| Pinus taeda anonymous locus CL4685Contig1_03 genomic sequence
gi|383153595|gb|AFG58934.1| Pinus taeda anonymous locus CL4685Contig1_03 genomic sequence
gi|383153597|gb|AFG58935.1| Pinus taeda anonymous locus CL4685Contig1_03 genomic sequence
gi|383153599|gb|AFG58936.1| Pinus taeda anonymous locus CL4685Contig1_03 genomic sequence
gi|383153601|gb|AFG58937.1| Pinus taeda anonymous locus CL4685Contig1_03 genomic sequence
gi|383153603|gb|AFG58938.1| Pinus taeda anonymous locus CL4685Contig1_03 genomic sequence
gi|383153605|gb|AFG58939.1| Pinus taeda anonymous locus CL4685Contig1_03 genomic sequence
gi|383153607|gb|AFG58940.1| Pinus taeda anonymous locus CL4685Contig1_03 genomic sequence
gi|383153609|gb|AFG58941.1| Pinus taeda anonymous locus CL4685Contig1_03 genomic sequence
Length = 68
Score = 82.4 bits (202), Expect = 3e-13, Method: Composition-based stats.
Identities = 38/64 (59%), Positives = 45/64 (70%)
Query: 96 FLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENE 155
F YKSGVY++ GD +G HAV+++GWG E YWLVANSWN WG+ G FKI RG NE
Sbjct: 1 FAHYKSGVYKYIKGDLMGGHAVKLVGWGTEGGTDYWLVANSWNTAWGEDGYFKIARGSNE 60
Query: 156 ADIE 159
IE
Sbjct: 61 CGIE 64
>gi|47212965|emb|CAF93376.1| unnamed protein product [Tetraodon nigroviridis]
Length = 271
Score = 82.4 bits (202), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 39/96 (40%), Positives = 57/96 (59%), Gaps = 13/96 (13%)
Query: 77 MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSI--------GLHAVRVLGWGVENDI 128
M++I ++GP+ AI V+ DF Y SG+Y+H G H+V++ GWG E +
Sbjct: 158 MKEIQDNGPVQAIMEVHEDFFMYNSGIYKHTDVSFTKPPHYRKHGTHSVKITGWGEERNF 217
Query: 129 P-----YWLVANSWNDHWGDHGTFKILRGENEADIE 159
YW+ ANSW +WG++G F+I RGENE +IE
Sbjct: 218 DGTTRKYWIAANSWGKNWGENGYFRIARGENECEIE 253
Score = 70.9 bits (172), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 56/163 (34%), Positives = 74/163 (45%), Gaps = 22/163 (13%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP F++ EKWP + DQ NC + WA S A SDR+ I S G+ T Q+S Q+++
Sbjct: 8 LPLYFNSAEKWPG--KIHEPLDQGNCAASWAFSTAAVASDRISIQSMGHMTPQLSPQNLI 65
Query: 247 AC-TPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
+C T N GC GG AW + GVVT E C PY L C +
Sbjct: 66 SCDTRNQGGCAGGRLDGAWWYLRRRGVVT-------EDCYPYRPP---QQTPAELSRCMM 115
Query: 306 ----LGKLK---TPEC--KQNCYNPSYESTYRFDLKKGKKAHM 339
+G+ K T C N N Y+ST + L +K M
Sbjct: 116 QSRSVGRGKRQATQRCPNTNNYQNDIYQSTPPYRLSTSEKEIM 158
>gi|32129433|sp|P92131.3|CATB1_GIALA RecName: Full=Cathepsin B-like CP1; AltName: Full=Cathepsin B-like
protease B1; Flags: Precursor
Length = 303
Score = 82.4 bits (202), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 37/85 (43%), Positives = 55/85 (64%), Gaps = 2/85 (2%)
Query: 77 MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGD-SIGLHAVRVLGWGVEND-IPYWLVA 134
M + GPL + VYAD Y+SGVY+H +G ++G HA+ ++G+G +D YW++
Sbjct: 210 MGMLVAGGPLQTMIVVYADLSYYESGVYKHTYGTINLGFHALEIVGYGTTDDGTDYWIIK 269
Query: 135 NSWNDHWGDHGTFKILRGENEADIE 159
NSW WG++G F+I+RG NE IE
Sbjct: 270 NSWGPDWGENGYFRIVRGVNECRIE 294
Score = 65.9 bits (159), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 31/89 (34%), Positives = 45/89 (50%), Gaps = 2/89 (2%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+P FD R+++P+C ++ DQ +CGSCWA S DR C S QH++
Sbjct: 79 IPPQFDFRDEYPQC--VKPALDQGSCGSCWAFSAIGVFGDRRCAMGIDKEAVSYSQQHLI 136
Query: 247 ACTPNCWGCNGGWPQLAWRFWGHNGVVTG 275
+C+ +GC+GG Q W F G T
Sbjct: 137 SCSLENFGCDGGDFQPTWSFLTFTGATTA 165
>gi|159112288|ref|XP_001706373.1| Cathepsin B precursor [Giardia lamblia ATCC 50803]
gi|157434469|gb|EDO78699.1| Cathepsin B precursor [Giardia lamblia ATCC 50803]
Length = 303
Score = 82.4 bits (202), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 37/85 (43%), Positives = 55/85 (64%), Gaps = 2/85 (2%)
Query: 77 MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGD-SIGLHAVRVLGWGVEND-IPYWLVA 134
M + GPL + VYAD Y+SGVY+H +G ++G HA+ ++G+G +D YW++
Sbjct: 210 MGMLVAGGPLQTMIVVYADLSYYESGVYKHTYGTINLGFHALEIVGYGTTDDGTDYWIIK 269
Query: 135 NSWNDHWGDHGTFKILRGENEADIE 159
NSW WG++G F+I+RG NE IE
Sbjct: 270 NSWGPDWGENGYFRIVRGVNECRIE 294
Score = 64.3 bits (155), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 30/89 (33%), Positives = 44/89 (49%), Gaps = 2/89 (2%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+P FD R+++P+C ++ DQ +CG CWA S DR C S QH++
Sbjct: 79 IPPQFDFRDEYPQC--VKPALDQGSCGGCWAFSAIGVFGDRRCAMGIDKEAVSYSQQHLI 136
Query: 247 ACTPNCWGCNGGWPQLAWRFWGHNGVVTG 275
+C+ +GC+GG Q W F G T
Sbjct: 137 SCSLENFGCDGGDFQPTWSFLTFTGATTA 165
>gi|11691656|emb|CAC18646.1| cathepsin B-like protease 1 [Giardia intestinalis]
Length = 303
Score = 82.4 bits (202), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 37/85 (43%), Positives = 55/85 (64%), Gaps = 2/85 (2%)
Query: 77 MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGD-SIGLHAVRVLGWGVEND-IPYWLVA 134
M + GPL + VYAD Y+SGVY+H +G ++G HA+ ++G+G +D YW++
Sbjct: 210 MGMLVAGGPLQTMIVVYADLSYYESGVYKHTYGTINLGFHALEIVGYGTTDDGTDYWIIK 269
Query: 135 NSWNDHWGDHGTFKILRGENEADIE 159
NSW WG++G F+I+RG NE IE
Sbjct: 270 NSWGPDWGENGYFRIVRGVNECRIE 294
Score = 64.3 bits (155), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 30/89 (33%), Positives = 44/89 (49%), Gaps = 2/89 (2%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+P FD R+++P+C ++ DQ +CG CWA S DR C S QH++
Sbjct: 79 IPPQFDFRDEYPQC--VKPALDQGSCGECWAFSAIGVFGDRRCAMGIDKEAVSYSQQHLI 136
Query: 247 ACTPNCWGCNGGWPQLAWRFWGHNGVVTG 275
+C+ +GC+GG Q W F G T
Sbjct: 137 SCSLENFGCDGGDFQPTWSFLTFTGATTA 165
>gi|48762489|dbj|BAD23814.1| cathepsin B-N [Tuberaphis takenouchii]
Length = 163
Score = 82.4 bits (202), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 45/103 (43%), Positives = 63/103 (61%), Gaps = 6/103 (5%)
Query: 220 VANAISDRLCIASNGYFTGQISAQHIVACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDY 278
++A +DRLCIA++G F +SA+ + C C +GC+GG+P AW ++ +G+VTGGDY
Sbjct: 2 TSSAFADRLCIATDGEFNELLSAEELAFCCHKCGFGCHGGYPIKAWEWFKKHGLVTGGDY 61
Query: 279 NSQEGCQPYTLAPCEHHVQGPLQNCTLLGKL--KTPECKQNCY 319
+S EGCQPY + PC G N T GK K C + CY
Sbjct: 62 DSGEGCQPYRVPPCPLDEYG---NNTCRGKPAEKNHRCTRMCY 101
Score = 37.7 bits (86), Expect = 7.6, Method: Compositional matrix adjust.
Identities = 16/43 (37%), Positives = 24/43 (55%)
Query: 63 HYFKKAHMVPRCNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQ 105
H+ + A+ + + + +GP+ A F VY DF YKSGVY
Sbjct: 113 HWTRDAYYLTYTTIQKDVMAYGPIEASFDVYDDFPNYKSGVYM 155
>gi|195426329|ref|XP_002061289.1| GK20838 [Drosophila willistoni]
gi|194157374|gb|EDW72275.1| GK20838 [Drosophila willistoni]
Length = 432
Score = 82.0 bits (201), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 39/85 (45%), Positives = 51/85 (60%), Gaps = 4/85 (4%)
Query: 79 QIYEHGPLVAIFSVYADFLQYKSGVYQH---NFGDSIGLHAVRVLGWGVE-NDIPYWLVA 134
+I+ GP+ A VY DF Y G+Y+ N G G H+V+++GWG E N YW+ A
Sbjct: 329 EIFHSGPVQATMRVYRDFFSYSGGIYRQTAANRGAPTGFHSVKLVGWGEEHNGDKYWIAA 388
Query: 135 NSWNDHWGDHGTFKILRGENEADIE 159
NSW WG+ G F+ILRG NE IE
Sbjct: 389 NSWGPWWGERGYFRILRGSNECGIE 413
Score = 69.7 bits (169), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 40/103 (38%), Positives = 53/103 (51%), Gaps = 9/103 (8%)
Query: 186 GLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHI 245
GLP +F+A +KW + + DQ CGS W +S + SDR I S G Q+S Q+I
Sbjct: 188 GLPASFNAVDKWSR--YISEVPDQGWCGSSWVLSTTSVASDRFAIQSQGKEVVQLSPQNI 245
Query: 246 VACTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYT 288
++CT GC GG AWR+ GV+ E C PYT
Sbjct: 246 LSCTRRQQGCEGGHLDAAWRYLHKKGVL-------DESCYPYT 281
>gi|193610664|ref|XP_001948185.1| PREDICTED: cathepsin B-like [Acyrthosiphon pisum]
Length = 324
Score = 82.0 bits (201), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 33/105 (31%), Positives = 60/105 (57%), Gaps = 2/105 (1%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+ + FDAR+ W +C ++ + + N WA + A +DR+C+A+NG + +S + ++
Sbjct: 85 ISKEFDARKHWSQCKTIGEVYNDGNSDLSWAYATTGAFADRMCVATNGSYNQLLSTEQLI 144
Query: 247 ACTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAP 291
+C+ N AW+F+ G+V+GG YN+ +GCQP + P
Sbjct: 145 SCSG--IKSNAMADDQAWKFFKKQGLVSGGKYNTNDGCQPSKIPP 187
Score = 80.1 bits (196), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 35/82 (42%), Positives = 50/82 (60%), Gaps = 1/82 (1%)
Query: 75 NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNF-GDSIGLHAVRVLGWGVENDIPYWLV 133
N R++ +GP+ A FS+Y D Y SGVY + + +++GWGVEN + YWL+
Sbjct: 229 NIQREVQTYGPVSAYFSLYDDLFLYTSGVYARTEKSKFVRYQSAKLIGWGVENGVDYWLL 288
Query: 134 ANSWNDHWGDHGTFKILRGENE 155
NSW + WG +G FKI RG +E
Sbjct: 289 VNSWGNEWGQNGLFKIKRGTDE 310
>gi|63115212|gb|AAY33830.1| cathepsin B, partial [Siniperca chuatsi]
Length = 69
Score = 82.0 bits (201), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 35/60 (58%), Positives = 45/60 (75%)
Query: 100 KSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
K GVYQH +G ++G HA+++LGWG E+ +PYWL ANSWN WGD+G FK LRG + IE
Sbjct: 1 KFGVYQHVYGSAVGGHAIKILGWGEEDGVPYWLCANSWNTDWGDNGFFKFLRGSDHCRIE 60
>gi|195585648|ref|XP_002082593.1| GD25141 [Drosophila simulans]
gi|194194602|gb|EDX08178.1| GD25141 [Drosophila simulans]
Length = 484
Score = 82.0 bits (201), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 39/89 (43%), Positives = 52/89 (58%), Gaps = 4/89 (4%)
Query: 75 NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDS---IGLHAVRVLGWGVE-NDIPY 130
+ M +I+ GP+ A V DF Y GVY+ + G H+V+++GWG E N Y
Sbjct: 324 DIMAEIFHSGPVQATMRVNRDFFAYSGGVYRETAANRKAPTGFHSVKLVGWGEEHNGEKY 383
Query: 131 WLVANSWNDHWGDHGTFKILRGENEADIE 159
W+ ANSW WG+HG F+ILRG NE IE
Sbjct: 384 WIAANSWGSWWGEHGYFRILRGSNECGIE 412
Score = 71.2 bits (173), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 41/103 (39%), Positives = 54/103 (52%), Gaps = 9/103 (8%)
Query: 186 GLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHI 245
GLP +F+A +KW + + DQ CG+ W +S + SDR I S G Q+SAQ+I
Sbjct: 186 GLPSSFNALDKWSS--YISEVPDQGWCGASWVLSTTSVASDRFAIQSKGKEAVQLSAQNI 243
Query: 246 VACTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYT 288
++CT GC GG AWR+ GVV E C PYT
Sbjct: 244 LSCTRRQQGCEGGHLDAAWRYLHKKGVV-------DENCYPYT 279
>gi|126330441|ref|XP_001381244.1| PREDICTED: tubulointerstitial nephritis antigen-like 1 [Monodelphis
domestica]
Length = 466
Score = 82.0 bits (201), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 41/98 (41%), Positives = 58/98 (59%), Gaps = 13/98 (13%)
Query: 75 NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQH---NFGDSI-----GLHAVRVLGWGVEN 126
+ M+++ E+GP+ A+ V+ DF YKSG+Y+H + G G H+V++ GWG E
Sbjct: 350 DIMKELMENGPVQALMEVHEDFFLYKSGIYKHTPASLGKPARYRQHGTHSVKITGWGEER 409
Query: 127 D-----IPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
+ YW ANSW WG+ G F+ILRG NE DIE
Sbjct: 410 QPDGQRLKYWTAANSWGPTWGEKGHFRILRGANECDIE 447
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 50/157 (31%), Positives = 70/157 (44%), Gaps = 7/157 (4%)
Query: 142 GDHGTFKILRGENEADIEMGFNNRVEANSSEDDDLETMGCQNAKGLPRNFDAREKWPECP 201
G+H F + E +G + + ++ M Q LP F+A +KWP
Sbjct: 158 GNHSAFWGMTLEEGIQYRLGTVRPASSVMNMNEIQMVMAPQET--LPLAFNASDKWPGL- 214
Query: 202 SLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVAC-TPNCWGCNGGWP 260
+ DQ NC WA S A SDR+ I S G+ T +S Q++++C T N GC GG
Sbjct: 215 -IHEPLDQGNCAGSWAFSTAAVASDRISIHSMGHMTPALSPQNLLSCDTHNQKGCRGGRL 273
Query: 261 QLAWRFWGHNGVVTGGDYNSQEGCQPYT--LAPCEHH 295
AW F G+V+ Y G + T APC H
Sbjct: 274 DGAWWFLRRRGLVSNHCYPFSAGNRDATAPAAPCMMH 310
>gi|253742315|gb|EES99155.1| Cathepsin B precursor [Giardia intestinalis ATCC 50581]
Length = 303
Score = 81.6 bits (200), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 38/85 (44%), Positives = 53/85 (62%), Gaps = 2/85 (2%)
Query: 77 MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGD-SIGLHAVRVLGWGVEND-IPYWLVA 134
M+ + GP+ + VYAD L Y GVY+H +G S GLHA+ ++G+G +D YW +
Sbjct: 210 MQMLVSGGPVQTMIVVYADLLYYAGGVYRHTYGPISNGLHALEMVGYGTTDDGTDYWTIK 269
Query: 135 NSWNDHWGDHGTFKILRGENEADIE 159
NSW WG+ G F+I+RG NE IE
Sbjct: 270 NSWGSDWGEDGYFRIVRGVNECRIE 294
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 30/88 (34%), Positives = 41/88 (46%), Gaps = 2/88 (2%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP FD R+++P C S + DQ +CG CWA S R C S QH++
Sbjct: 79 LPAQFDFRDEYPHCVS--PVFDQGSCGGCWAFSAIGMFGSRRCAVGIDKAAVLYSQQHLI 136
Query: 247 ACTPNCWGCNGGWPQLAWRFWGHNGVVT 274
+C+ +GC+GG W F G T
Sbjct: 137 SCSTENFGCSGGDFFPTWSFLTQTGATT 164
>gi|1763659|gb|AAB58258.1| cysteine protease [Giardia intestinalis]
Length = 269
Score = 81.6 bits (200), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 37/85 (43%), Positives = 55/85 (64%), Gaps = 2/85 (2%)
Query: 77 MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGD-SIGLHAVRVLGWGVEND-IPYWLVA 134
M + GPL + VYAD Y+SGVY+H +G ++G HA+ ++G+G +D YW++
Sbjct: 176 MGMLVAGGPLQTMIVVYADLSYYESGVYKHTYGTINLGFHALEIVGYGTTDDGTDYWIIK 235
Query: 135 NSWNDHWGDHGTFKILRGENEADIE 159
NSW WG++G F+I+RG NE IE
Sbjct: 236 NSWGPDWGENGYFRIVRGVNECRIE 260
Score = 64.3 bits (155), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 30/88 (34%), Positives = 44/88 (50%), Gaps = 2/88 (2%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+P FD R+++P+C ++ DQ +CG CWA S DR C S QH++
Sbjct: 45 IPPQFDFRDEYPQC--VKPALDQGSCGECWAFSAIGVFGDRRCAMGIDKEAVSYSQQHLI 102
Query: 247 ACTPNCWGCNGGWPQLAWRFWGHNGVVT 274
+C+ +GC+GG Q W F G T
Sbjct: 103 SCSLENFGCDGGDFQPTWSFLTFTGATT 130
>gi|159117627|ref|XP_001709033.1| Cathepsin B precursor [Giardia lamblia ATCC 50803]
gi|157437148|gb|EDO81359.1| Cathepsin B precursor [Giardia lamblia ATCC 50803]
Length = 308
Score = 81.6 bits (200), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 33/83 (39%), Positives = 53/83 (63%), Gaps = 1/83 (1%)
Query: 78 RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVEND-IPYWLVANS 136
R + GP+ A+F+VY DF Y G+Y + +G+ +G +V ++G+G ++ YW+V N
Sbjct: 208 RAVALRGPMQAMFTVYEDFTYYLEGIYSYTYGNRVGFLSVEIVGYGTSDEGQDYWIVKNY 267
Query: 137 WNDHWGDHGTFKILRGENEADIE 159
W WG+ G F+I+RG+NE IE
Sbjct: 268 WGPGWGEDGYFRIVRGQNECQIE 290
Score = 54.7 bits (130), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 30/93 (32%), Positives = 48/93 (51%), Gaps = 5/93 (5%)
Query: 182 QNAKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQIS 241
+N +P +FD RE++P+C + + D C S WA S +A S R C+ + S
Sbjct: 70 ENEDPVPDHFDFREEYPQC--ITEVIDIGLCSSSWAYSAVDAFSHRRCLTGLDQEATRYS 127
Query: 242 AQHIVAC--TPNCWGCNGGWPQLAWRFWGHNGV 272
AQ+I++C T C+G + +AW F G+
Sbjct: 128 AQYILSCSSTNGCFGFSTR-ESIAWDFIATTGI 159
>gi|354483193|ref|XP_003503779.1| PREDICTED: tubulointerstitial nephritis antigen-like [Cricetulus
griseus]
Length = 475
Score = 81.6 bits (200), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 43/124 (34%), Positives = 67/124 (54%), Gaps = 24/124 (19%)
Query: 60 PLSHYFKKAHMVPRCN-----------AMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNF 108
P + F+K++ + +C+ MR+I +GP+ AI V+ DF YK+G+Y+H
Sbjct: 336 PCPNSFEKSNRIYQCSPPYRVSSNETEIMREIIRNGPVQAIMQVHEDFFYYKTGIYRHVI 395
Query: 109 GDS--------IGLHAVRVLGWGVENDI-----PYWLVANSWNDHWGDHGTFKILRGENE 155
+ + HAV++ GWG +W+ ANSW WG++G F+ILRG NE
Sbjct: 396 STNEESEKYRKLRSHAVKLTGWGTLRGAGGKKEKFWIAANSWGKSWGENGYFRILRGVNE 455
Query: 156 ADIE 159
+DIE
Sbjct: 456 SDIE 459
Score = 54.7 bits (130), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 34/94 (36%), Positives = 47/94 (50%), Gaps = 5/94 (5%)
Query: 187 LPRNFDAREKWPECPSLRH-IADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHI 245
LP F + KWP H DQ NC + WA S A+ +DR+ I S G +T +S Q++
Sbjct: 216 LPEVFISSYKWP---GWTHGPLDQKNCAASWAFSTASVAADRIAIQSRGRYTANLSPQNL 272
Query: 246 VACTP-NCWGCNGGWPQLAWRFWGHNGVVTGGDY 278
++C GCN G AW F G+V+ Y
Sbjct: 273 ISCCAKKRHGCNSGSIDRAWWFLRKRGLVSHACY 306
>gi|6562768|emb|CAB62588.1| putative cathepsin B-like protease [Pisum sativum]
Length = 166
Score = 81.6 bits (200), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 42/103 (40%), Positives = 58/103 (56%), Gaps = 11/103 (10%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP+ FDAR WP+C ++ I DQ +CGSCWA ++SDR CI +S ++
Sbjct: 62 LPKEFDARTAWPQCSTIGRILDQGHCGSCWAFGAVESLSDRFCIHFG--VDVPLSVNDLL 119
Query: 247 ACTPNCW--GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPY 287
AC GC+GG+P AW+++ H+GVVT E C PY
Sbjct: 120 ACCGFLCGSGCDGGYPISAWKYFAHHGVVT-------EECDPY 155
>gi|440907441|gb|ELR57591.1| Tubulointerstitial nephritis antigen [Bos grunniens mutus]
Length = 476
Score = 81.6 bits (200), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 39/96 (40%), Positives = 56/96 (58%), Gaps = 13/96 (13%)
Query: 77 MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDS--------IGLHAVRVLGWGVENDI 128
MR+I ++GP+ AI V+ DF YK+G+Y+H + HAV++ GWG
Sbjct: 365 MREIMQNGPVQAIMQVHEDFFNYKTGIYRHITSTNEDSEKYRKFRTHAVKLTGWGTLRGA 424
Query: 129 P-----YWLVANSWNDHWGDHGTFKILRGENEADIE 159
+W+ ANSW WG++G F+ILRG NE+DIE
Sbjct: 425 QGQKEKFWIAANSWGKSWGENGYFRILRGVNESDIE 460
Score = 51.6 bits (122), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 33/94 (35%), Positives = 46/94 (48%), Gaps = 5/94 (5%)
Query: 187 LPRNFDAREKWPECPSLRH-IADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHI 245
LP F A KWP H DQ NC + WA S A+ +DR+ I S G +T +S Q++
Sbjct: 217 LPEFFIASYKWP---GWTHGPLDQKNCAASWAFSTASVAADRIAIQSQGRYTANLSPQNL 273
Query: 246 VACTP-NCWGCNGGWPQLAWRFWGHNGVVTGGDY 278
++C GCN AW + G+V+ Y
Sbjct: 274 ISCCAKKRRGCNSESVDRAWWYLRKRGLVSHACY 307
>gi|16768502|gb|AAL28470.1| GM06507p [Drosophila melanogaster]
Length = 430
Score = 81.6 bits (200), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 39/89 (43%), Positives = 52/89 (58%), Gaps = 4/89 (4%)
Query: 75 NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDS---IGLHAVRVLGWGVE-NDIPY 130
+ M +I+ GP+ A V DF Y GVY+ + G H+V+++GWG E N Y
Sbjct: 323 DIMAEIFHSGPVQATMRVNRDFFAYSGGVYRETAANRKAPTGFHSVKLVGWGEEHNGEKY 382
Query: 131 WLVANSWNDHWGDHGTFKILRGENEADIE 159
W+ ANSW WG+HG F+ILRG NE IE
Sbjct: 383 WIAANSWGSWWGEHGYFRILRGSNECGIE 411
Score = 71.2 bits (173), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 41/103 (39%), Positives = 54/103 (52%), Gaps = 9/103 (8%)
Query: 186 GLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHI 245
GLP +F+A +KW + + DQ CG+ W +S + SDR I S G Q+SAQ+I
Sbjct: 186 GLPNSFNALDKWSS--YISEVPDQGWCGASWVLSTTSVASDRFAIQSKGKENVQLSAQNI 243
Query: 246 VACTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYT 288
++CT GC GG AWR+ GVV E C PYT
Sbjct: 244 LSCTRRQQGCEGGHLDAAWRYLHKKGVV-------DENCYPYT 279
>gi|253748399|gb|EET02549.1| Cathepsin B precursor [Giardia intestinalis ATCC 50581]
Length = 303
Score = 81.6 bits (200), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 38/85 (44%), Positives = 53/85 (62%), Gaps = 2/85 (2%)
Query: 77 MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGD-SIGLHAVRVLGWGVEND-IPYWLVA 134
M+ + GP+ + VYAD L Y GVY+H +G S GLHA+ ++G+G +D YW +
Sbjct: 210 MQMLVSGGPVQTMIVVYADLLYYAGGVYRHTYGPISNGLHALEMVGYGTTDDGTDYWTIK 269
Query: 135 NSWNDHWGDHGTFKILRGENEADIE 159
NSW WG+ G F+I+RG NE IE
Sbjct: 270 NSWGSDWGEDGYFRIVRGVNECRIE 294
Score = 60.1 bits (144), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 30/88 (34%), Positives = 41/88 (46%), Gaps = 2/88 (2%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP FD R+++P C S + DQ +CG CWA S R C S QH++
Sbjct: 79 LPAQFDFRDEYPHCVS--PVFDQGSCGGCWAFSAIGMFGSRRCAVGIDKAAVLYSQQHLI 136
Query: 247 ACTPNCWGCNGGWPQLAWRFWGHNGVVT 274
+C+ +GC+GG W F G T
Sbjct: 137 SCSTENFGCSGGDFFPTWSFLTQTGATT 164
>gi|195346663|ref|XP_002039877.1| GM15657 [Drosophila sechellia]
gi|194135226|gb|EDW56742.1| GM15657 [Drosophila sechellia]
Length = 431
Score = 81.6 bits (200), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 39/89 (43%), Positives = 52/89 (58%), Gaps = 4/89 (4%)
Query: 75 NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDS---IGLHAVRVLGWGVE-NDIPY 130
+ M +I+ GP+ A V DF Y GVY+ + G H+V+++GWG E N Y
Sbjct: 324 DIMAEIFHSGPVQATMRVNRDFFAYSGGVYRETAANRKALTGFHSVKLVGWGEEHNGEKY 383
Query: 131 WLVANSWNDHWGDHGTFKILRGENEADIE 159
W+ ANSW WG+HG F+ILRG NE IE
Sbjct: 384 WIAANSWGSWWGEHGYFRILRGSNECGIE 412
Score = 71.2 bits (173), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 41/103 (39%), Positives = 54/103 (52%), Gaps = 9/103 (8%)
Query: 186 GLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHI 245
GLP +F+A +KW + + DQ CG+ W +S + SDR I S G Q+SAQ+I
Sbjct: 186 GLPSSFNALDKWSS--YISEVPDQGWCGASWVLSTTSVASDRFAIQSKGKEAVQLSAQNI 243
Query: 246 VACTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYT 288
++CT GC GG AWR+ GVV E C PYT
Sbjct: 244 LSCTRRQQGCEGGHLDAAWRYLHKKGVV-------DENCYPYT 279
>gi|24657813|ref|NP_726176.1| secreted Wg-interacting molecule, isoform A [Drosophila
melanogaster]
gi|24657819|ref|NP_611652.2| secreted Wg-interacting molecule, isoform B [Drosophila
melanogaster]
gi|21064305|gb|AAM29382.1| RE01730p [Drosophila melanogaster]
gi|21626543|gb|AAF46818.2| secreted Wg-interacting molecule, isoform A [Drosophila
melanogaster]
gi|21626544|gb|AAM68213.1| secreted Wg-interacting molecule, isoform B [Drosophila
melanogaster]
gi|220949028|gb|ACL87057.1| CG3074-PA [synthetic construct]
gi|220958134|gb|ACL91610.1| CG3074-PA [synthetic construct]
Length = 431
Score = 81.6 bits (200), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 39/89 (43%), Positives = 52/89 (58%), Gaps = 4/89 (4%)
Query: 75 NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDS---IGLHAVRVLGWGVE-NDIPY 130
+ M +I+ GP+ A V DF Y GVY+ + G H+V+++GWG E N Y
Sbjct: 324 DIMAEIFHSGPVQATMRVNRDFFAYSGGVYRETAANRKAPTGFHSVKLVGWGEEHNGEKY 383
Query: 131 WLVANSWNDHWGDHGTFKILRGENEADIE 159
W+ ANSW WG+HG F+ILRG NE IE
Sbjct: 384 WIAANSWGSWWGEHGYFRILRGSNECGIE 412
Score = 71.2 bits (173), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 41/103 (39%), Positives = 54/103 (52%), Gaps = 9/103 (8%)
Query: 186 GLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHI 245
GLP +F+A +KW + + DQ CG+ W +S + SDR I S G Q+SAQ+I
Sbjct: 186 GLPSSFNALDKWSS--YISEVPDQGWCGASWVLSTTSVASDRFAIQSKGKENVQLSAQNI 243
Query: 246 VACTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYT 288
++CT GC GG AWR+ GVV E C PYT
Sbjct: 244 LSCTRRQQGCEGGHLDAAWRYLHKKGVV-------DENCYPYT 279
>gi|193688334|ref|XP_001945855.1| PREDICTED: cathepsin B-like cysteine proteinase 5-like
[Acyrthosiphon pisum]
Length = 313
Score = 81.6 bits (200), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 37/116 (31%), Positives = 66/116 (56%), Gaps = 2/116 (1%)
Query: 189 RNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVAC 248
+ FDAR++WP+C ++ + ++ N WA + A ++DR CIA+NG + +S + +++C
Sbjct: 74 KEFDARKRWPKCKTIGEVHNEGNFAFGWAYAAAGVLADRTCIATNGGYNKLLSTEELISC 133
Query: 249 TPNCWGCNGGWPQLA-WRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNC 303
+ NG + + W + +GVV+GG YNS +GCQP+ P + + C
Sbjct: 134 S-GIKETNGNVNERSIWEYLKSHGVVSGGKYNSNDGCQPFKFPPIANILTHLQHTC 188
Score = 77.8 bits (190), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 42/110 (38%), Positives = 59/110 (53%), Gaps = 4/110 (3%)
Query: 54 YLPTSIPLSH---YFKKAHMVPRCNAMRQIYEHGPLVAIFSVYADFLQYKSGVY-QHNFG 109
Y TSI +H + + + +++ +GP+ F V DFL YKSGVY + +
Sbjct: 193 YGNTSINYNHDHVRVRNYYTIRTGYIQKEVQTYGPVAVQFKVCDDFLLYKSGVYVKSDNA 252
Query: 110 DSIGLHAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
I +++GWGVEN + YWLV NSW WG G FKI RG N+ +E
Sbjct: 253 KVIRTQYAKLIGWGVENGVDYWLVINSWGHEWGQKGLFKIKRGTNQCGVE 302
>gi|239790303|dbj|BAH71722.1| ACYPI001175 [Acyrthosiphon pisum]
Length = 330
Score = 81.6 bits (200), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 40/107 (37%), Positives = 61/107 (57%), Gaps = 4/107 (3%)
Query: 54 YLPTSIPLSHYFKKAHMVPRCNAMRQIYEHGPLVAIFSVYADFLQYKSGVY-QHNFGDSI 112
Y + +SHY+ ++ + +++ +GP+ F VY DF YKSGVY + +
Sbjct: 216 YYHDHVKVSHYY---NIKSNEDIQKEVQTYGPVSVKFRVYDDFFLYKSGVYVKTEKSLYV 272
Query: 113 GLHAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
H +++GWGVEN + YWL+ N W + WG +G FKI RG NE +E
Sbjct: 273 RRHFAKLIGWGVENGVDYWLLVNFWGNEWGQNGLFKIKRGTNEVHVE 319
Score = 77.4 bits (189), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 41/137 (29%), Positives = 68/137 (49%), Gaps = 17/137 (12%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+ FDAR+ WP+C ++ + D N WA + A ++DR+CIA+NG + +S + ++
Sbjct: 86 IHEEFDARKGWPQCKTIGEVHDDGNTRWGWAYATAGVLADRMCIATNGSYNQLLSTEELI 145
Query: 247 AC----TPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQN 302
C T G W + +G+V+GG YN+ +GCQP + P ++ L N
Sbjct: 146 FCGGIKTKQSGAVRG---DDVWEYLKSHGLVSGGKYNTNDGCQPSKIPPIG-NIPTHLYN 201
Query: 303 CTLLGKLKTPECKQNCY 319
T C++ CY
Sbjct: 202 HT---------CEERCY 209
>gi|78042562|ref|NP_001030279.1| tubulointerstitial nephritis antigen [Bos taurus]
gi|108861910|sp|Q3SZI1.1|TINAG_BOVIN RecName: Full=Tubulointerstitial nephritis antigen; Short=TIN-Ag
gi|74354008|gb|AAI02844.1| Tubulointerstitial nephritis antigen [Bos taurus]
gi|296474572|tpg|DAA16687.1| TPA: tubulointerstitial nephritis antigen [Bos taurus]
Length = 476
Score = 81.6 bits (200), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 39/96 (40%), Positives = 56/96 (58%), Gaps = 13/96 (13%)
Query: 77 MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDS--------IGLHAVRVLGWGVENDI 128
MR+I ++GP+ AI V+ DF YK+G+Y+H + HAV++ GWG
Sbjct: 365 MREIMQNGPVQAIMQVHEDFFNYKTGIYRHITSTNEDSEKYRKFRTHAVKLTGWGTLRGA 424
Query: 129 P-----YWLVANSWNDHWGDHGTFKILRGENEADIE 159
+W+ ANSW WG++G F+ILRG NE+DIE
Sbjct: 425 QGQKEKFWIAANSWGKSWGENGYFRILRGVNESDIE 460
Score = 54.7 bits (130), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 34/94 (36%), Positives = 47/94 (50%), Gaps = 5/94 (5%)
Query: 187 LPRNFDAREKWPECPSLRH-IADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHI 245
LP F A KWP H DQ NC + WA S A+ +DR+ I S G +T +S Q++
Sbjct: 217 LPEFFIASYKWP---GWTHGPLDQKNCAASWAFSTASVAADRIAIQSQGRYTANLSPQNL 273
Query: 246 VACTP-NCWGCNGGWPQLAWRFWGHNGVVTGGDY 278
++C GCN G AW + G+V+ Y
Sbjct: 274 ISCCAKKRHGCNSGSVDRAWWYLRKRGLVSHACY 307
>gi|403357104|gb|EJY78168.1| Cathepsin B [Oxytricha trifallax]
Length = 349
Score = 81.6 bits (200), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 39/101 (38%), Positives = 58/101 (57%), Gaps = 9/101 (8%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+P +FD+R+KWP C + I DQ CGSCWA + + +SDR CI S G +S Q +V
Sbjct: 125 IPESFDSRDKWPNC--IHGIRDQQLCGSCWAFASSAFLSDRFCIHSEGQINEDLSPQDLV 182
Query: 247 ACTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPY 287
+C+ +GC+GG + F + G+V+ E C+PY
Sbjct: 183 SCSYENFGCSGGQLTESVDFLIYEGIVS-------EKCKPY 216
Score = 64.7 bits (156), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 29/80 (36%), Positives = 44/80 (55%), Gaps = 1/80 (1%)
Query: 79 QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWG-VENDIPYWLVANSW 137
++ +GP++ SVY D + YK GVY++ G+ +G HA++++GWG E +W N W
Sbjct: 256 ELMTNGPMMVGLSVYEDLMNYKEGVYEYTTGNQVGGHAIKIIGWGHTEKGELFWKCQNQW 315
Query: 138 NDHWGDHGTFKILRGENEAD 157
WG G I GE D
Sbjct: 316 GKDWGMGGYINIKAGELGMD 335
>gi|294952605|ref|XP_002787373.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
gi|239902345|gb|EER19169.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
Length = 185
Score = 81.6 bits (200), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 33/80 (41%), Positives = 53/80 (66%)
Query: 78 RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
++I+++GP+++ F +Y DF YKSGVY +S H+++++GWG + YWL NSW
Sbjct: 96 QEIFDNGPVLSSFKMYEDFRYYKSGVYVPTTKESSTSHSIKIIGWGGASGREYWLAVNSW 155
Query: 138 NDHWGDHGTFKILRGENEAD 157
N+ WGDHG K+ G+N +
Sbjct: 156 NEEWGDHGLIKMAFGKNRLE 175
>gi|134023803|gb|AAI35570.1| LOC100124858 protein [Xenopus (Silurana) tropicalis]
Length = 484
Score = 81.6 bits (200), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 41/98 (41%), Positives = 56/98 (57%), Gaps = 13/98 (13%)
Query: 75 NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHN--------FGDSIGLHAVRVLGWGVEN 126
+ M+++YE+GP+ AI V+ DF YKSG+Y+ G H+V++ GWG E
Sbjct: 368 DIMKELYENGPVQAIMEVHEDFFMYKSGIYRRTPVTEREPEHHRRHGTHSVKITGWGEER 427
Query: 127 DIP-----YWLVANSWNDHWGDHGTFKILRGENEADIE 159
YWL ANSW WG+ G F+I RGENE +IE
Sbjct: 428 GRDGQTHKYWLAANSWGRDWGEDGYFRIARGENECEIE 465
Score = 70.5 bits (171), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 55/163 (33%), Positives = 74/163 (45%), Gaps = 15/163 (9%)
Query: 183 NAKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISA 242
N LP +F+A EKWP + DQ NC WA S A SDR+ I S G+ T +S
Sbjct: 217 NNDILPSHFNAAEKWPGL--VHEPLDQGNCAGSWAFSTAAVASDRISIQSMGHMTQSLSP 274
Query: 243 QHIVAC-TPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQ 301
Q++++C T N GC GG AW + GVV+ E C P+T H +
Sbjct: 275 QNLLSCDTRNQHGCRGGRVDGAWWYLRRRGVVS-------EPCYPFTSLNTNGHSAPCMM 327
Query: 302 NCTLLGKLK---TPECKQNCY--NPSYESTYRFDLKKGKKAHM 339
+G+ K T C Y N Y+ST + L +K M
Sbjct: 328 QSRSMGRGKRQATNNCPNQYYSSNEIYQSTPAYRLASSEKDIM 370
>gi|308162940|gb|EFO65307.1| Cathepsin B precursor [Giardia lamblia P15]
Length = 303
Score = 81.6 bits (200), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 37/85 (43%), Positives = 56/85 (65%), Gaps = 2/85 (2%)
Query: 77 MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGD-SIGLHAVRVLGWGVEND-IPYWLVA 134
M + GP+ + VY+D Y+SGVY+H +G S+GLHA+ ++G+G +D YW++
Sbjct: 210 MHMLATGGPVQTMIVVYSDLSYYESGVYKHTYGTISLGLHALEMVGYGTTDDGTDYWIIR 269
Query: 135 NSWNDHWGDHGTFKILRGENEADIE 159
NSW WG++G F+I+RG NE IE
Sbjct: 270 NSWGADWGENGYFRIVRGVNECRIE 294
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 32/95 (33%), Positives = 48/95 (50%), Gaps = 6/95 (6%)
Query: 182 QNAKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQIS 241
+ A +P FD R+++P+C + + DQ +CG CWA S DR C+A S
Sbjct: 74 EPADPIPSQFDFRDEYPQC--VTPVMDQGSCGGCWAFSAIGVFGDRRCVAGIDKEGVPYS 131
Query: 242 AQHIVACTPNCWGCNGG--WPQLAWRFWGHNGVVT 274
Q++++C+ GC+GG WP W F G T
Sbjct: 132 QQYLISCSTENHGCDGGDFWP--TWSFLTLTGATT 164
>gi|395526635|ref|XP_003765465.1| PREDICTED: tubulointerstitial nephritis antigen-like [Sarcophilus
harrisii]
Length = 467
Score = 81.6 bits (200), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 39/98 (39%), Positives = 57/98 (58%), Gaps = 13/98 (13%)
Query: 75 NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGD--------SIGLHAVRVLGWGVE- 125
+ M+++ E+GP+ A+ V+ DF YKSG+Y+H G H+V++ GWG E
Sbjct: 351 DIMKELMENGPVQALLEVHEDFFLYKSGIYKHTPASLGKPERYRQHGTHSVKITGWGEEI 410
Query: 126 ----NDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
+ YW ANSW WG++G F+I+RG NE DIE
Sbjct: 411 QPDGQKVKYWTAANSWGPTWGENGYFRIVRGANECDIE 448
Score = 61.6 bits (148), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 42/112 (37%), Positives = 53/112 (47%), Gaps = 5/112 (4%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
LP F A KWP + DQ NC WA S A SDR+ I S G+ + +S Q+++
Sbjct: 202 LPSAFSASNKWPGL--IHEPLDQGNCAGSWAFSTAAVASDRISIHSMGHMSPALSPQNLL 259
Query: 247 AC-TPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQ--PYTLAPCEHH 295
+C T N GC GG AW F G+V+ Y EG APC H
Sbjct: 260 SCNTHNQHGCRGGRLDGAWWFLRRRGLVSNNCYPFSEGDHNGAAPAAPCMMH 311
>gi|145347486|ref|XP_001418195.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144578424|gb|ABO96488.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 330
Score = 81.3 bits (199), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 41/109 (37%), Positives = 61/109 (55%), Gaps = 2/109 (1%)
Query: 187 LPRNFDAREKWPECPSLR-HIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHI 245
LP +FDAR +P+C L + DQ CGSCWAV+ ++DRLC+A++G ++S Q+
Sbjct: 112 LPTSFDARVAYPKCSRLLGAVRDQGRCGSCWAVAATEVMNDRLCVATDGENADELSPQYA 171
Query: 246 VACTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEH 294
++C + GC+GG R G+ GG +S C PY C+H
Sbjct: 172 LSCFDSGSGCDGGDVLDTLRIAFTKGIPYGGMLDSN-ACLPYEFEACDH 219
Score = 46.2 bits (108), Expect = 0.023, Method: Compositional matrix adjust.
Identities = 28/72 (38%), Positives = 41/72 (56%), Gaps = 6/72 (8%)
Query: 94 ADFLQYKSGVYQ--HNFGDSIGLHAVRVLGWGV-ENDIPYWLVANSWNDHWGDHGTFKIL 150
D SGVY ++ G+ +G HA +++GWGV E YW + NSW + WG++G K+
Sbjct: 256 GDVTHTGSGVYTVPNDAGEPLGQHATKLIGWGVSEEGEHYWWMVNSWRN-WGENGVSKVR 314
Query: 151 RGENEADIEMGF 162
G E +IE G
Sbjct: 315 MG--EMNIESGI 324
>gi|344264196|ref|XP_003404179.1| PREDICTED: tubulointerstitial nephritis antigen [Loxodonta
africana]
Length = 476
Score = 81.3 bits (199), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 39/96 (40%), Positives = 57/96 (59%), Gaps = 13/96 (13%)
Query: 77 MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDS--------IGLHAVRVLGWGVENDI 128
M++I ++GP+ AI V+ DF YK+G+Y+H S + HAV++ GWG+
Sbjct: 365 MKEIMQNGPVQAIMQVHEDFFHYKTGIYRHVIRTSEESEKYQKLRTHAVKLTGWGMMKGA 424
Query: 129 P-----YWLVANSWNDHWGDHGTFKILRGENEADIE 159
+W+ ANSW WG+ G F+ILRG NE+DIE
Sbjct: 425 KGRKEKFWVAANSWGKSWGEDGYFRILRGVNESDIE 460
Score = 61.6 bits (148), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 37/94 (39%), Positives = 50/94 (53%), Gaps = 5/94 (5%)
Query: 187 LPRNFDAREKWPECPSLRH-IADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHI 245
LP F A KWP H DQ NC + WA S A+ +DR+ I SNG +T +S Q++
Sbjct: 217 LPEFFVASYKWP---GWTHGPLDQKNCAASWAFSTASVAADRIAIQSNGRYTANLSPQNL 273
Query: 246 VA-CTPNCWGCNGGWPQLAWRFWGHNGVVTGGDY 278
++ CT N GCN G AW + G+V+ Y
Sbjct: 274 ISCCTKNRHGCNSGSVDRAWWYLRKRGLVSHACY 307
>gi|253744204|gb|EET00443.1| Cathepsin B precursor [Giardia intestinalis ATCC 50581]
Length = 309
Score = 81.3 bits (199), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 35/83 (42%), Positives = 53/83 (63%), Gaps = 1/83 (1%)
Query: 78 RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVEND-IPYWLVANS 136
R I GP+ A+F+VY DF Y G+Y H +G + G +V ++G+G ++ YW+V N
Sbjct: 209 RAIALRGPMQAMFTVYEDFAYYLEGIYSHVYGGTAGYLSVEIVGYGTSDEGQDYWIVKNY 268
Query: 137 WNDHWGDHGTFKILRGENEADIE 159
W +WG+ G F+I+RG+NE IE
Sbjct: 269 WGSNWGEDGYFRIVRGQNECQIE 291
Score = 48.9 bits (115), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 32/106 (30%), Positives = 48/106 (45%), Gaps = 8/106 (7%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+P +FD RE++P+C + + D C S WA S A R C+ + SAQ+I+
Sbjct: 75 IPDHFDFREEYPQC--ITEVIDMGTCSSSWAHSPVEAFGHRRCMNGVDQEATRYSAQYIL 132
Query: 247 AC-TPNCWGCNGGWPQLAWRFWGHNGV-----VTGGDYNSQEGCQP 286
+C T N G ++W F G+ V DY+ E P
Sbjct: 133 SCATTNGCLAFPGQGVVSWDFIATTGIPLESCVKYTDYDKTESSYP 178
>gi|130502070|ref|NP_001076255.1| tubulointerstitial nephritis antigen [Oryctolagus cuniculus]
gi|818411|gb|AAC48477.1| tubulointerstitial nephritis antigen [Oryctolagus cuniculus]
Length = 474
Score = 81.3 bits (199), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 38/96 (39%), Positives = 57/96 (59%), Gaps = 13/96 (13%)
Query: 77 MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDS--------IGLHAVRVLGWGVENDI 128
M++I ++GP+ AI V+ DF YK+G+Y+H + + HAV++ GWG
Sbjct: 363 MKEIMQNGPVQAIMQVHEDFFHYKTGIYRHVISTNEESEKYRKLQTHAVKLTGWGTLKGA 422
Query: 129 -----PYWLVANSWNDHWGDHGTFKILRGENEADIE 159
+W+ ANSW WG++G F+ILRG NE+DIE
Sbjct: 423 RGQKEKFWIAANSWGKSWGENGYFRILRGVNESDIE 458
Score = 51.2 bits (121), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 26/69 (37%), Positives = 39/69 (56%), Gaps = 1/69 (1%)
Query: 211 NCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVACTP-NCWGCNGGWPQLAWRFWGH 269
NC + WA S A+ +DR+ I SNG +T +S Q++++C N GCN G AW +
Sbjct: 237 NCAASWAFSTASVAADRIAIQSNGRYTANLSPQNLISCCAKNRHGCNSGSIDRAWWYLRK 296
Query: 270 NGVVTGGDY 278
G+V+ Y
Sbjct: 297 RGLVSHACY 305
>gi|426250116|ref|XP_004018784.1| PREDICTED: tubulointerstitial nephritis antigen [Ovis aries]
Length = 476
Score = 81.3 bits (199), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 39/96 (40%), Positives = 56/96 (58%), Gaps = 13/96 (13%)
Query: 77 MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDS--------IGLHAVRVLGWGVENDI 128
MR+I ++GP+ AI V+ DF YK+G+Y+H + HAV++ GWG
Sbjct: 365 MREIMQNGPVQAIMQVHEDFFNYKTGIYRHITSTNEDSEKYRKFRTHAVKLTGWGTLRGA 424
Query: 129 -----PYWLVANSWNDHWGDHGTFKILRGENEADIE 159
+W+ ANSW WG++G F+ILRG NE+DIE
Sbjct: 425 HGQKEKFWIAANSWGKSWGENGYFRILRGVNESDIE 460
Score = 54.7 bits (130), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 38/116 (32%), Positives = 53/116 (45%), Gaps = 14/116 (12%)
Query: 164 NRVEANSSEDDDLETMGCQNAKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANA 223
N V A+ +E DL P F A KWP DQ NC + WA S A+
Sbjct: 205 NEVTASLAETTDL-----------PEFFIASYKWP--GWTHGPLDQKNCAASWAFSTASV 251
Query: 224 ISDRLCIASNGYFTGQISAQHIVACTP-NCWGCNGGWPQLAWRFWGHNGVVTGGDY 278
+DR+ I S G +T +S Q++++C GCN G AW + G+V+ Y
Sbjct: 252 AADRIAIQSQGRYTANLSPQNLISCCAKKRHGCNSGSVDRAWWYLRKRGLVSHACY 307
>gi|145514872|ref|XP_001443341.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124410719|emb|CAK75944.1| unnamed protein product [Paramecium tetraurelia]
Length = 358
Score = 81.3 bits (199), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 40/100 (40%), Positives = 59/100 (59%), Gaps = 2/100 (2%)
Query: 75 NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGL--HAVRVLGWGVENDIPYWL 132
N R+I +GP+VA+ V+ DFL YK GVY+ G S HAV+V+GWG ++ + YW+
Sbjct: 255 NIKREILNNGPIVAVIQVFKDFLVYKGGVYEVVEGSSKFQYGHAVKVIGWGKQDGVNYWV 314
Query: 133 VANSWNDHWGDHGTFKILRGENEADIEMGFNNRVEANSSE 172
+ NSW D WG G + G+N+ +E + A S+E
Sbjct: 315 IENSWGDSWGLKGLAYVAVGQNQLQLEAYSVAPIVAASTE 354
Score = 58.2 bits (139), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 32/102 (31%), Positives = 49/102 (48%), Gaps = 9/102 (8%)
Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
+P +++ RE PEC + I Q NC S ++++ +A SDRLC + NG F Q+S Q +
Sbjct: 131 IPESYNFREAQPECA--QPIYFQGNCSSSYSIAAVSATSDRLCKSKNGEFQDQLSPQSPI 188
Query: 247 ACTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYT 288
+C + C GG G V+ C PY+
Sbjct: 189 SCDDKNYKCGGGSVTRVLEVGKKQGFVS-------TSCLPYS 223
>gi|196009233|ref|XP_002114482.1| hypothetical protein TRIADDRAFT_28083 [Trichoplax adhaerens]
gi|190583501|gb|EDV23572.1| hypothetical protein TRIADDRAFT_28083 [Trichoplax adhaerens]
Length = 466
Score = 80.9 bits (198), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 44/109 (40%), Positives = 63/109 (57%), Gaps = 11/109 (10%)
Query: 62 SHYFKKAHMVPRCN---AMRQIYEHGPLVAIFSVYADFLQYKSGVYQHN-FGDS-----I 112
++Y+ CN MR++ ++GP+ F VY DF YK G+YQH GDS I
Sbjct: 348 TNYYYVGGFYGACNEYLMMRELVKNGPISISFEVYGDFKHYKGGIYQHTGLGDSYNPWQI 407
Query: 113 GLHAVRVLGWGVE--NDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
HAV ++G+G + + YW+V NSW WG++G F+ILRG +E IE
Sbjct: 408 TNHAVLLVGYGTDQKSGKDYWIVKNSWGTKWGENGFFRILRGVDECSIE 456
Score = 46.6 bits (109), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 31/105 (29%), Positives = 49/105 (46%), Gaps = 15/105 (14%)
Query: 187 LPRNFDAREKWPECPSLRHIA---DQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQ 243
P+ FD W ++ +++ +Q CGSC+A S RL + S +S Q
Sbjct: 235 FPKQFD----WRNVSNVNYVSPVRNQGACGSCYAFSSMAMYEARLRVLSKNSVKRVMSPQ 290
Query: 244 HIVACTPNCWGCNGGWPQLAWRFWGHN-GVVTGGDYNSQEGCQPY 287
+V+C+ GC GG+P L +G + G+V +E C PY
Sbjct: 291 DVVSCSEYAQGCAGGFPYLIAGKYGEDFGLV-------EESCFPY 328
>gi|47125398|gb|AAH70278.1| Tubulointerstitial nephritis antigen [Homo sapiens]
gi|190690249|gb|ACE86899.1| tubulointerstitial nephritis antigen protein [synthetic construct]
gi|190691623|gb|ACE87586.1| tubulointerstitial nephritis antigen protein [synthetic construct]
gi|312150986|gb|ADQ32005.1| tubulointerstitial nephritis antigen [synthetic construct]
Length = 476
Score = 80.9 bits (198), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 38/96 (39%), Positives = 57/96 (59%), Gaps = 13/96 (13%)
Query: 77 MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDS--------IGLHAVRVLGWGVENDI 128
M++I ++GP+ AI V+ DF YK+G+Y+H + + HAV++ GWG
Sbjct: 365 MKEIMQNGPVQAIMQVHEDFFHYKTGIYRHVTSTNKESEKYRKLQTHAVKLTGWGTLRGA 424
Query: 129 P-----YWLVANSWNDHWGDHGTFKILRGENEADIE 159
+W+ ANSW WG++G F+ILRG NE+DIE
Sbjct: 425 QGQKEKFWIAANSWGKSWGENGYFRILRGVNESDIE 460
Score = 57.0 bits (136), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 35/94 (37%), Positives = 48/94 (51%), Gaps = 5/94 (5%)
Query: 187 LPRNFDAREKWPECPSLRH-IADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHI 245
LP F A KWP H DQ NC + WA S A+ +DR+ I S G +T +S Q++
Sbjct: 217 LPEFFVASYKWP---GWTHGPLDQKNCAASWAFSTASVAADRIAIQSKGRYTANLSPQNL 273
Query: 246 VACTP-NCWGCNGGWPQLAWRFWGHNGVVTGGDY 278
++C N GCN G AW + G+V+ Y
Sbjct: 274 ISCCAKNRHGCNSGSIDRAWWYLRKRGLVSHACY 307
>gi|6009533|dbj|BAA84949.1| tubulointerstitial nephritis antigen [Homo sapiens]
Length = 476
Score = 80.9 bits (198), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 38/96 (39%), Positives = 57/96 (59%), Gaps = 13/96 (13%)
Query: 77 MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDS--------IGLHAVRVLGWGVENDI 128
M++I ++GP+ AI V+ DF YK+G+Y+H + + HAV++ GWG
Sbjct: 365 MKEIMQNGPVQAIMQVHEDFFHYKTGIYRHVTSTNKESEKYRKLQTHAVKLTGWGTLRGA 424
Query: 129 P-----YWLVANSWNDHWGDHGTFKILRGENEADIE 159
+W+ ANSW WG++G F+ILRG NE+DIE
Sbjct: 425 QGQKEKFWIAANSWGKSWGENGYFRILRGVNESDIE 460
Score = 57.0 bits (136), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 35/94 (37%), Positives = 48/94 (51%), Gaps = 5/94 (5%)
Query: 187 LPRNFDAREKWPECPSLRH-IADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHI 245
LP F A KWP H DQ NC + WA S A+ +DR+ I S G +T +S Q++
Sbjct: 217 LPEFFVASYKWP---GWTHGPLDQKNCAASWAFSTASVAADRIAIQSKGRYTANLSPQNL 273
Query: 246 VACTP-NCWGCNGGWPQLAWRFWGHNGVVTGGDY 278
++C N GCN G AW + G+V+ Y
Sbjct: 274 ISCCAKNRHGCNSGSIDRAWWYLRKRGLVSHACY 307
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.317 0.133 0.431
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 5,978,831,710
Number of Sequences: 23463169
Number of extensions: 290740234
Number of successful extensions: 11118598
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 33926
Number of HSP's successfully gapped in prelim test: 4560
Number of HSP's that attempted gapping in prelim test: 8787244
Number of HSP's gapped (non-prelim): 1417178
length of query: 342
length of database: 8,064,228,071
effective HSP length: 143
effective length of query: 199
effective length of database: 9,003,962,200
effective search space: 1791788477800
effective search space used: 1791788477800
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 77 (34.3 bits)