BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= psy1664
(524 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|312374701|gb|EFR22198.1| hypothetical protein AND_15621 [Anopheles darlingi]
Length = 335
Score = 312 bits (800), Expect = 3e-82, Method: Compositional matrix adjust.
Identities = 153/290 (52%), Positives = 198/290 (68%), Gaps = 11/290 (3%)
Query: 48 KLPFYGAEKNALSKLTLSELEMRMGVHPDSKLPQNRLPLLVQ-LSDPLEELPEGFDARIN 106
K + A +N ++LS + MGVH D+ + R P V LS +++LPE FD+R
Sbjct: 35 KASTWRAGRNFHPDVSLSYIRGLMGVHQDAY--KFREPEFVHDLSADVDDLPENFDSREQ 92
Query: 107 WPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGC 166
WP CPTI+EIRDQGSCGS WA GAVEAMSDRVCIAS GK H R S++DLVSCC CG GC
Sbjct: 93 WPNCPTIREIRDQGSCGSCWAFGAVEAMSDRVCIASGGKIHFRFSAEDLVSCCHTCGFGC 152
Query: 167 QGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEI-PCERYMNGSHSSCQDNEPNTPECIR 225
GGF G AW YWV G+VSGG + S GC+PY I PCE ++NG+ SC+ TP+C++
Sbjct: 153 NGGFPGAAWSYWVHKGLVSGGPFGSNLGCQPYAIAPCEHHVNGTRPSCEGEGGKTPKCVK 212
Query: 226 KCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKH 285
KCQ Y V Y D +G +YS+P +E+ I +EI +GPVEG+ T+Y D++ YK G+Y+H
Sbjct: 213 KCQDSYTVPYAKDKRYGSKSYSIPRHEDQIRKEIMTNGPVEGAFTVYEDLLHYKEGVYQH 272
Query: 286 VAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
V G LG HAIRI+GWG E + KYWL+ANS+N++WG+NG F+I
Sbjct: 273 VTGKMLGGHAIRILGWGVE-------NNTKYWLIANSWNSDWGDNGFFKI 315
Score = 183 bits (464), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 82/164 (50%), Positives = 117/164 (71%), Gaps = 8/164 (4%)
Query: 335 IGCRPYEI-PCERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPAN 393
+GC+PY I PCE ++NG+R SC+ TP+C++KCQ Y V Y D +G +YS+P +
Sbjct: 179 LGCQPYAIAPCEHHVNGTRPSCEGEGGKTPKCVKKCQDSYTVPYAKDKRYGSKSYSIPRH 238
Query: 394 EETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTS 453
E+ I +EI +GPVEG+ T+Y D++ YK G+Y+HV G LG HAIRI+GWG E
Sbjct: 239 EDQIRKEIMTNGPVEGAFTVYEDLLHYKEGVYQHVTGKMLGGHAIRILGWGVE------- 291
Query: 454 SVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
+ KYWL+ANS+N++WG+NG F+I+RG++ GIE+ I AGLPK+
Sbjct: 292 NNTKYWLIANSWNSDWGDNGFFKILRGEDHLGIESSIAAGLPKL 335
>gi|14141821|gb|AAK07477.2|AF329480_1 probable cathepsin B-like cysteine proteinase precursor [Glossina
morsitans morsitans]
gi|289743431|gb|ADD20463.1| putative cathepsin B-like cysteine proteinase precursor [Glossina
morsitans morsitans]
Length = 340
Score = 312 bits (799), Expect = 3e-82, Method: Compositional matrix adjust.
Identities = 160/286 (55%), Positives = 187/286 (65%), Gaps = 18/286 (6%)
Query: 59 LSKLT-LSELEMRMGVHPDSK---LPQNRLPL----LVQLSDPLEELPEGFDARINWPYC 110
KLT +S MGVHPD+ LP R+ L LV L + + +P+ FD+R WP+C
Sbjct: 44 FHKLTPMSHYRQLMGVHPDAHNYALPDKRMVLREEELVGLGNNM--IPKDFDSRKQWPHC 101
Query: 111 PTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGF 170
PTI EIRDQGSCGS WA GAVEAMSDRVCI S G + S+DDLVSCC CG GC GGF
Sbjct: 102 PTIWEIRDQGSCGSCWAFGAVEAMSDRVCIHSNGTVNFHFSADDLVSCCHTCGFGCNGGF 161
Query: 171 HGKAWKYWVTTGIVSGGTYASKQGCRPYEI-PCERYMNGSHSSCQDNEPNTPECIRKCQP 229
G AW YWV GIVSGG Y S QGCRPYEI PCE ++NG+ C+ TP C KCQ
Sbjct: 162 PGAAWSYWVRKGIVSGGPYGSSQGCRPYEIAPCEHHVNGTRPPCEKEYGKTPRCQHKCQA 221
Query: 230 GYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGG 289
Y V Y+ D +FG AYS+ N I EI HGPVEG+ T+Y D+ILYK G+Y+HV G
Sbjct: 222 SYKVDYKTDKHFGSRAYSISKNVHDIQEEIMTHGPVEGAFTVYEDLILYKDGVYEHVHGK 281
Query: 290 PLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
LG HAIRIIGWG E + YWLVANS+NT+WG NG F+I
Sbjct: 282 ELGGHAIRIIGWGVE-------KDIPYWLVANSWNTDWGNNGFFKI 320
Score = 191 bits (484), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 91/163 (55%), Positives = 112/163 (68%), Gaps = 8/163 (4%)
Query: 336 GCRPYEI-PCERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANE 394
GCRPYEI PCE ++NG+R C+ TP C KCQ Y V Y+ D +FG AYS+ N
Sbjct: 185 GCRPYEIAPCEHHVNGTRPPCEKEYGKTPRCQHKCQASYKVDYKTDKHFGSRAYSISKNV 244
Query: 395 ETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSS 454
I EI HGPVEG+ T+Y D+ILYK G+Y+HV G LG HAIRIIGWG E
Sbjct: 245 HDIQEEIMTHGPVEGAFTVYEDLILYKDGVYEHVHGKELGGHAIRIIGWGVE-------K 297
Query: 455 VVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
+ YWLVANS+NT+WG NG F+I+RG++ CGIE+ I+AGLPKI
Sbjct: 298 DIPYWLVANSWNTDWGNNGFFKILRGKDHCGIESSISAGLPKI 340
>gi|347972086|ref|XP_313835.5| AGAP004533-PA [Anopheles gambiae str. PEST]
gi|333469165|gb|EAA09183.5| AGAP004533-PA [Anopheles gambiae str. PEST]
Length = 337
Score = 311 bits (798), Expect = 4e-82, Method: Compositional matrix adjust.
Identities = 156/290 (53%), Positives = 200/290 (68%), Gaps = 12/290 (4%)
Query: 48 KLPFYGAEKNALSKLTLSELEMRMGVHPDSKLPQNRLP-LLVQLSDPLEELPEGFDARIN 106
K + A +N +L+ + MGVHPD+ + R P +L LSD +ELPE FD+R
Sbjct: 38 KATTWRAGQNFHPDTSLTYIRGLMGVHPDAD--KFREPEILHDLSDG-DELPENFDSREQ 94
Query: 107 WPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGC 166
WP CPTI+EIRDQGSCGS WA GAVEAMSDRVC+AS GK H R S++DLVSCC CG GC
Sbjct: 95 WPNCPTIREIRDQGSCGSCWAFGAVEAMSDRVCVASGGKIHFRFSAEDLVSCCHTCGFGC 154
Query: 167 QGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEI-PCERYMNGSHSSCQDNEPNTPECIR 225
GGF G AW YWV G+VSGG + S GC+PY I PCE ++NG+ SC+ TP+C++
Sbjct: 155 NGGFPGAAWSYWVRKGLVSGGPFGSNLGCQPYAIAPCEHHVNGTRPSCEGEGGKTPKCVK 214
Query: 226 KCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKH 285
KCQ Y+V Y+ D FG +YS+ +E I +EI +GPVEG+ T+Y D++ YK G+Y+H
Sbjct: 215 KCQESYNVPYQKDKRFGASSYSIARHEAQIQKEIMTNGPVEGAFTVYEDLLHYKEGVYQH 274
Query: 286 VAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
V G LG HAIRI+GWG E GT KYWL+ANS+N++WG+NG F+I
Sbjct: 275 VTGKMLGGHAIRILGWGVE---NGT----KYWLIANSWNSDWGDNGFFKI 317
Score = 183 bits (465), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 84/164 (51%), Positives = 119/164 (72%), Gaps = 8/164 (4%)
Query: 335 IGCRPYEI-PCERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPAN 393
+GC+PY I PCE ++NG+R SC+ TP+C++KCQ Y+V Y+ D FG +YS+ +
Sbjct: 181 LGCQPYAIAPCEHHVNGTRPSCEGEGGKTPKCVKKCQESYNVPYQKDKRFGASSYSIARH 240
Query: 394 EETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTS 453
E I +EI +GPVEG+ T+Y D++ YK G+Y+HV G LG HAIRI+GWG E GT
Sbjct: 241 EAQIQKEIMTNGPVEGAFTVYEDLLHYKEGVYQHVTGKMLGGHAIRILGWGVE---NGT- 296
Query: 454 SVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
KYWL+ANS+N++WG+NG F+I+RG++ GIE+ I+AGLPK+
Sbjct: 297 ---KYWLIANSWNSDWGDNGFFKILRGEDHLGIESSISAGLPKL 337
>gi|91078958|ref|XP_974220.1| PREDICTED: similar to cathepsin b [Tribolium castaneum]
gi|270004841|gb|EFA01289.1| cathepsin B precursor [Tribolium castaneum]
Length = 334
Score = 311 bits (796), Expect = 7e-82, Method: Compositional matrix adjust.
Identities = 154/291 (52%), Positives = 200/291 (68%), Gaps = 15/291 (5%)
Query: 48 KLPFYGAEKNALSKLTLSELEMRMGVHPDSK--LPQNRLPLLVQLSDPLEELPEGFDARI 105
K + A N + +S + MGV P+SK +P + LL + E+P+ FDAR
Sbjct: 34 KQSTWKAGPNFAENVPMSYIRRLMGVPPNSKYHMPSVKRHLLDAM-----EIPDDFDARK 88
Query: 106 NWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNG 165
WP CPTI+EIRDQGSCGS WA GAVEAMSDRVCI S+G +VRLS+DDLVSCC CG G
Sbjct: 89 QWPNCPTIREIRDQGSCGSCWAFGAVEAMSDRVCIHSKGAVNVRLSADDLVSCCYSCGMG 148
Query: 166 CQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEI-PCERYMNGSHSSCQDNEPNTPECI 224
C GGF G AW YWV GIVSGG++ S QGCRPYEI PCE ++NG+ C ++ TP C
Sbjct: 149 CNGGFPGAAWHYWVNKGIVSGGSFGSNQGCRPYEIAPCEHHVNGTRPPCTGDDNKTPSCK 208
Query: 225 RKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYK 284
++C+ GY+V Y+ D NFG+ AYS+ + + I +EI +GPVEG+ +Y D++ YK G+Y+
Sbjct: 209 QQCEKGYNVPYKKDKNFGKEAYSISSEVQQIQKEIMTNGPVEGAFEVYEDLLSYKKGVYQ 268
Query: 285 HVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
HV G LG HAIRI+GWG E +GT YWL+ANS+N++WG+NG F+I
Sbjct: 269 HVKGEALGGHAIRILGWGTE---KGTP----YWLIANSWNSDWGDNGTFKI 312
Score = 182 bits (463), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 83/162 (51%), Positives = 118/162 (72%), Gaps = 8/162 (4%)
Query: 336 GCRPYEI-PCERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANE 394
GCRPYEI PCE ++NG+R C ++ TP C ++C+ GY+V Y+ D NFG+ AYS+ +
Sbjct: 177 GCRPYEIAPCEHHVNGTRPPCTGDDNKTPSCKQQCEKGYNVPYKKDKNFGKEAYSISSEV 236
Query: 395 ETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSS 454
+ I +EI +GPVEG+ +Y D++ YK G+Y+HV G LG HAIRI+GWG E +GT
Sbjct: 237 QQIQKEIMTNGPVEGAFEVYEDLLSYKKGVYQHVKGEALGGHAIRILGWGTE---KGTP- 292
Query: 455 VVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPK 496
YWL+ANS+N++WG+NG F+I+RG++ CGIE+ I AG+PK
Sbjct: 293 ---YWLIANSWNSDWGDNGTFKILRGEDHCGIESSIVAGIPK 331
>gi|195058549|ref|XP_001995463.1| GH17748 [Drosophila grimshawi]
gi|193896249|gb|EDV95115.1| GH17748 [Drosophila grimshawi]
Length = 340
Score = 310 bits (795), Expect = 1e-81, Method: Compositional matrix adjust.
Identities = 157/284 (55%), Positives = 191/284 (67%), Gaps = 13/284 (4%)
Query: 56 KNALSKLTLSELEMRMGVHPDSK---LPQNRLPLLVQLSDPLEELPEGFDARINWPYCPT 112
+N + ++ + MGVHPDS LP+ R L D ++LPE FD+ NWP CPT
Sbjct: 46 RNFDAAVSEHHIRALMGVHPDSHKFTLPEKRELLGADGED--KDLPEEFDSSKNWPNCPT 103
Query: 113 IQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHG 172
I+EIRDQGSCGS WA GAVEAMSDRVCI S + S+DDLV+CC CG GC GGF G
Sbjct: 104 IREIRDQGSCGSCWAFGAVEAMSDRVCIHSNATVNFHFSADDLVTCCHTCGFGCNGGFPG 163
Query: 173 KAWKYWVTTGIVSGGTYASKQGCRPYEI-PCERYMNGSHSSCQDNEPNTPECIRKCQPGY 231
AW YW T GIVSGG+Y S +GCRPYE+ PCE +++G C +TP C +CQP Y
Sbjct: 164 AAWSYWTTRGIVSGGSYNSTEGCRPYEVEPCEHHVDGPRPPCHSG--STPHCKHQCQPNY 221
Query: 232 DVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPL 291
V YE D +FG +YS+ N I REI +GPVEG+ T+Y D+ILYKTG+Y+HV G L
Sbjct: 222 SVDYEKDKHFGASSYSINRNPRNIQREIMTNGPVEGAFTVYEDLILYKTGVYQHVHGKQL 281
Query: 292 GEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
G HAIRIIGWG GE S V YWL+ANS+NT+WG+NG FRI
Sbjct: 282 GGHAIRIIGWGV--WGE---SKVPYWLIANSWNTDWGDNGFFRI 320
Score = 191 bits (484), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 92/163 (56%), Positives = 118/163 (72%), Gaps = 8/163 (4%)
Query: 336 GCRPYEI-PCERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANE 394
GCRPYE+ PCE +++G R C + +TP C +CQP Y V YE D +FG +YS+ N
Sbjct: 185 GCRPYEVEPCEHHVDGPRPPCHSG--STPHCKHQCQPNYSVDYEKDKHFGASSYSINRNP 242
Query: 395 ETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSS 454
I REI +GPVEG+ T+Y D+ILYKTG+Y+HV G LG HAIRIIGWG GE S
Sbjct: 243 RNIQREIMTNGPVEGAFTVYEDLILYKTGVYQHVHGKQLGGHAIRIIGWGV--WGE---S 297
Query: 455 VVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
V YWL+ANS+NT+WG+NG FRI+RG++ CGIE+ I+AGLPK+
Sbjct: 298 KVPYWLIANSWNTDWGDNGFFRILRGKDHCGIESQISAGLPKL 340
>gi|170028910|ref|XP_001842337.1| cathepsin L [Culex quinquefasciatus]
gi|167879387|gb|EDS42770.1| cathepsin L [Culex quinquefasciatus]
Length = 334
Score = 310 bits (793), Expect = 2e-81, Method: Compositional matrix adjust.
Identities = 149/289 (51%), Positives = 200/289 (69%), Gaps = 11/289 (3%)
Query: 48 KLPFYGAEKNALSKLTLSELEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINW 107
K + A +N +S + MGVH D+ + P+++ D ++LPE FDAR W
Sbjct: 36 KATTWRAGRNFHPDTPMSYIRGLMGVHKDAD--KFMPPVMLHDLDEGDDLPENFDAREQW 93
Query: 108 PYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQ 167
P CPTI+EIRDQGSCGS WA GAVEAMSDR+CI S+GK H R+S++DLVSCC CG GC
Sbjct: 94 PNCPTIREIRDQGSCGSCWAFGAVEAMSDRICIHSKGKVHFRVSAEDLVSCCHTCGFGCN 153
Query: 168 GGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEI-PCERYMNGSHSSCQDNEPNTPECIRK 226
GGF G AW YWV G+VSGG Y S QGC+PY I PCE ++NG+ C + E TP+C++K
Sbjct: 154 GGFPGAAWSYWVRKGLVSGGPYGSDQGCQPYAISPCEHHVNGTRGPC-NGEGKTPKCVKK 212
Query: 227 CQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHV 286
CQ Y+V Y D FG+ +YS+ ++E+ I +E+F +GPVEG+ T+Y D++ YK G+Y+H
Sbjct: 213 CQASYNVPYAKDKFFGKSSYSIASHEQQIQKELFTNGPVEGAFTVYEDLLNYKEGVYQHT 272
Query: 287 AGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
AG LG HAIRI+GWG E + K+WL+ANS+N++WG+NG F+I
Sbjct: 273 AGKMLGGHAIRILGWGVE-------NDTKFWLIANSWNSDWGDNGYFKI 314
Score = 177 bits (450), Expect = 8e-42, Method: Compositional matrix adjust.
Identities = 81/163 (49%), Positives = 117/163 (71%), Gaps = 9/163 (5%)
Query: 336 GCRPYEI-PCERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANE 394
GC+PY I PCE ++NG+R C E TP+C++KCQ Y+V Y D FG+ +YS+ ++E
Sbjct: 180 GCQPYAISPCEHHVNGTRGPCNG-EGKTPKCVKKCQASYNVPYAKDKFFGKSSYSIASHE 238
Query: 395 ETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSS 454
+ I +E+F +GPVEG+ T+Y D++ YK G+Y+H AG LG HAIRI+GWG E +
Sbjct: 239 QQIQKELFTNGPVEGAFTVYEDLLNYKEGVYQHTAGKMLGGHAIRILGWGVE-------N 291
Query: 455 VVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
K+WL+ANS+N++WG+NG F+I+RG + GIE+ I AGLPK+
Sbjct: 292 DTKFWLIANSWNSDWGDNGYFKILRGSDHLGIESSIAAGLPKV 334
>gi|45822203|emb|CAE47498.1| cathepsin B-like proteinase [Diabrotica virgifera virgifera]
Length = 328
Score = 309 bits (791), Expect = 3e-81, Method: Compositional matrix adjust.
Identities = 148/285 (51%), Positives = 194/285 (68%), Gaps = 11/285 (3%)
Query: 52 YGAEKNALSKLTLSELEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCP 111
+ A +N ++ + MGV PD K N +P ++ E+P FDAR WP+CP
Sbjct: 35 WTAGRNFAQDKSMDYIIKLMGVLPDHK---NYMPPVLTHKLEALEIPADFDARQQWPHCP 91
Query: 112 TIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFH 171
TI+EIRDQGSCGS WA GAVEAMSDRVCI S G+ + SSDDLVSCC CG GC GG+
Sbjct: 92 TIREIRDQGSCGSCWAFGAVEAMSDRVCIHSNGESNFHFSSDDLVSCCWTCGMGCNGGYP 151
Query: 172 GKAWKYWVTTGIVSGGTYASKQGCRPYEI-PCERYMNGSHSSCQDNEPNTPECIRKCQPG 230
G AW YWV G+VSGG Y +KQGCRPYEI PCE + NGS +C +E NTP+C + C+
Sbjct: 152 GAAWHYWVRKGLVSGGQYGTKQGCRPYEIPPCEHHTNGSRPACDASEGNTPKCAKSCESN 211
Query: 231 YDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGP 290
Y ++Y +DL+FG AYS+ ++ + I EI ++GPVEG+ ++YAD + YKTG+Y+H+ G
Sbjct: 212 YKINYSNDLHFGSKAYSISSDVKQIQAEILQNGPVEGAFSVYADFVNYKTGVYQHIKGQF 271
Query: 291 LGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
LG HAIRI GWG E + YWL+ANS+NT+WG++G F+I
Sbjct: 272 LGGHAIRIFGWGVE-------NNTPYWLIANSWNTDWGDSGTFKI 309
Score = 187 bits (476), Expect = 9e-45, Method: Compositional matrix adjust.
Identities = 85/164 (51%), Positives = 119/164 (72%), Gaps = 8/164 (4%)
Query: 334 RIGCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPA 392
+ GCRPYEIP CE + NGSR +C A+E NTP+C + C+ Y ++Y +DL+FG AYS+ +
Sbjct: 172 KQGCRPYEIPPCEHHTNGSRPACDASEGNTPKCAKSCESNYKINYSNDLHFGSKAYSISS 231
Query: 393 NEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGT 452
+ + I EI ++GPVEG+ ++YAD + YKTG+Y+H+ G LG HAIRI GWG E
Sbjct: 232 DVKQIQAEILQNGPVEGAFSVYADFVNYKTGVYQHIKGQFLGGHAIRIFGWGVE------ 285
Query: 453 SSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPK 496
+ YWL+ANS+NT+WG++G F+I+RG + CGIE+ I AGLPK
Sbjct: 286 -NNTPYWLIANSWNTDWGDSGTFKILRGSDHCGIESGIVAGLPK 328
>gi|125981197|ref|XP_001354605.1| GA10694 [Drosophila pseudoobscura pseudoobscura]
gi|54642915|gb|EAL31659.1| GA10694 [Drosophila pseudoobscura pseudoobscura]
Length = 338
Score = 307 bits (786), Expect = 1e-80, Method: Compositional matrix adjust.
Identities = 160/281 (56%), Positives = 190/281 (67%), Gaps = 13/281 (4%)
Query: 61 KLTLSELEMR--MGVHPDSK---LPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQE 115
K ++SE +R MGVHPD+ LP+ R+ L +D ++PE FDAR WP CPTI E
Sbjct: 45 KESVSEEYIRGLMGVHPDAHKFALPEKRIVLGDLYADDGVDIPEEFDARKAWPNCPTIGE 104
Query: 116 IRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAW 175
IRDQGSCGS WA GAVEAMSDRVCI S GK + LS+DDLVSCC CG GC GGF G AW
Sbjct: 105 IRDQGSCGSCWAFGAVEAMSDRVCIHSEGKVNFHLSADDLVSCCHICGFGCNGGFPGAAW 164
Query: 176 KYWVTTGIVSGGTYASKQGCRPYEI-PCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVS 234
YW GIVSGG Y S QGCRPYEI PCE ++NG+ C +TP C KCQ Y V
Sbjct: 165 SYWTRKGIVSGGPYGSTQGCRPYEIAPCEHHVNGTRPPCSHG--STPSCQHKCQASYSVE 222
Query: 235 YEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEH 294
Y D NFG +YS+ N I +EI +GPVEG+ T+Y D+ILYK+G+Y+H G LG H
Sbjct: 223 YAKDKNFGSKSYSVRRNVAEIQQEIMTNGPVEGAFTVYEDLILYKSGVYQHEHGKELGGH 282
Query: 295 AIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
AIRI+GWG GE S V YWL+ NS+NT+WG+NG FRI
Sbjct: 283 AIRILGWGV--WGE---SKVPYWLIGNSWNTDWGDNGFFRI 318
Score = 182 bits (462), Expect = 4e-43, Method: Compositional matrix adjust.
Identities = 90/163 (55%), Positives = 115/163 (70%), Gaps = 8/163 (4%)
Query: 336 GCRPYEI-PCERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANE 394
GCRPYEI PCE ++NG+R C + +TP C KCQ Y V Y D NFG +YS+ N
Sbjct: 183 GCRPYEIAPCEHHVNGTRPPC--SHGSTPSCQHKCQASYSVEYAKDKNFGSKSYSVRRNV 240
Query: 395 ETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSS 454
I +EI +GPVEG+ T+Y D+ILYK+G+Y+H G LG HAIRI+GWG GE S
Sbjct: 241 AEIQQEIMTNGPVEGAFTVYEDLILYKSGVYQHEHGKELGGHAIRILGWGV--WGE---S 295
Query: 455 VVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
V YWL+ NS+NT+WG+NG FRI+RGQ+ CGIE+ I+AGLPK+
Sbjct: 296 KVPYWLIGNSWNTDWGDNGFFRILRGQDHCGIESSISAGLPKL 338
>gi|157167366|ref|XP_001653890.1| cathepsin b [Aedes aegypti]
gi|54289254|gb|AAV31917.1| lysosomal cathepsin B [Aedes aegypti]
gi|108874249|gb|EAT38474.1| AAEL009637-PA [Aedes aegypti]
Length = 340
Score = 306 bits (785), Expect = 1e-80, Method: Compositional matrix adjust.
Identities = 149/289 (51%), Positives = 196/289 (67%), Gaps = 10/289 (3%)
Query: 48 KLPFYGAEKNALSKLTLSELEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINW 107
K + A N + ++S + MGVH D+ + P+ + + ++ PE FD+R W
Sbjct: 41 KATTWKAGPNFSPETSMSFIRGLMGVHKDAD--KFMPPVYLHEMEADDDFPENFDSRTQW 98
Query: 108 PYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQ 167
P CPTI EIRDQGSCGS WA GAVEAMSDR+CI S GK H R+SS+DLVSCC CG GC
Sbjct: 99 PNCPTIGEIRDQGSCGSCWAFGAVEAMSDRICIHSEGKVHFRVSSEDLVSCCHTCGFGCN 158
Query: 168 GGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEI-PCERYMNGSHSSCQDNEPNTPECIRK 226
GGF G AW YWV G+VSGG + S QGC+PY I PCE ++NGS SC+ TP+C++K
Sbjct: 159 GGFPGAAWSYWVRKGLVSGGPFGSDQGCQPYAIAPCEHHVNGSRPSCEGEGGKTPKCVKK 218
Query: 227 CQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHV 286
CQ Y+V Y D +G+ +YS+ +E+ I +EI +GPVEG+ T+Y D++ YK G+Y HV
Sbjct: 219 CQASYNVPYAKDKMYGKSSYSIANHEKQIQKEIMTNGPVEGAFTVYEDLLNYKEGVYHHV 278
Query: 287 AGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
G LG HAIRI+GWG E +GT KYWL+ANS+N++WG+NG F+I
Sbjct: 279 HGKMLGGHAIRILGWGVE---DGT----KYWLIANSWNSDWGDNGFFKI 320
Score = 181 bits (460), Expect = 6e-43, Method: Compositional matrix adjust.
Identities = 84/163 (51%), Positives = 118/163 (72%), Gaps = 8/163 (4%)
Query: 336 GCRPYEI-PCERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANE 394
GC+PY I PCE ++NGSR SC+ TP+C++KCQ Y+V Y D +G+ +YS+ +E
Sbjct: 185 GCQPYAIAPCEHHVNGSRPSCEGEGGKTPKCVKKCQASYNVPYAKDKMYGKSSYSIANHE 244
Query: 395 ETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSS 454
+ I +EI +GPVEG+ T+Y D++ YK G+Y HV G LG HAIRI+GWG E +GT
Sbjct: 245 KQIQKEIMTNGPVEGAFTVYEDLLNYKEGVYHHVHGKMLGGHAIRILGWGVE---DGT-- 299
Query: 455 VVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
KYWL+ANS+N++WG+NG F+I+RG++ GIE+ I AGLPK+
Sbjct: 300 --KYWLIANSWNSDWGDNGFFKILRGEDHLGIESSIAAGLPKV 340
>gi|161343863|tpg|DAA06112.1| TPA_inf: cathepsin B [Myzus persicae]
Length = 340
Score = 306 bits (783), Expect = 2e-80, Method: Compositional matrix adjust.
Identities = 150/286 (52%), Positives = 197/286 (68%), Gaps = 9/286 (3%)
Query: 51 FYGAEKNALSKLTLSELEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYC 110
++ A +N +S L+ MGVH +S +L LV +D +LPE FDAR +WP C
Sbjct: 44 YWSAGRNFHKNTPMSYLKGLMGVH-ESNAHYPKLEQLVSYTDTPTDLPENFDAREHWPNC 102
Query: 111 PTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGF 170
PTI+E+RDQGSCGS WA GAVEAMSDRVCI S+G ++ S+++LVSCC+ CG GC GGF
Sbjct: 103 PTIREVRDQGSCGSCWAFGAVEAMSDRVCIHSKGAKNFHFSAENLVSCCRTCGFGCNGGF 162
Query: 171 HGKAWKYWVTTGIVSGGTYASKQGCRPYEI-PCERYMNGSHSSCQDNEPNTPECIRKCQP 229
G AW YW T GIVSGG Y SK GC PYEI PCE ++NG+ C++ TP C++KC+
Sbjct: 163 PGAAWHYWKTKGIVSGGPYGSKMGCIPYEIAPCEHHVNGTRGPCKEGG-KTPACVKKCED 221
Query: 230 GYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGG 289
GY V Y DL+ G+ AYSL + + I +EI+ +GPVEG+ T+Y D I Y+ G+YKHVAG
Sbjct: 222 GYKVPYAQDLHRGKSAYSLGNDVDQIRQEIYTNGPVEGAFTVYEDFIAYRAGVYKHVAGK 281
Query: 290 PLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
LG HAIRI+GWG + + + YWLVANS+N++WG +G F+I
Sbjct: 282 ALGGHAIRILGWGVQ------NGEIPYWLVANSWNSDWGSDGFFKI 321
Score = 177 bits (450), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 84/163 (51%), Positives = 115/163 (70%), Gaps = 8/163 (4%)
Query: 334 RIGCRPYEI-PCERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPA 392
++GC PYEI PCE ++NG+R C+ TP C++KC+ GY V Y DL+ G+ AYSL
Sbjct: 184 KMGCIPYEIAPCEHHVNGTRGPCKEGG-KTPACVKKCEDGYKVPYAQDLHRGKSAYSLGN 242
Query: 393 NEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGT 452
+ + I +EI+ +GPVEG+ T+Y D I Y+ G+YKHVAG LG HAIRI+GWG +
Sbjct: 243 DVDQIRQEIYTNGPVEGAFTVYEDFIAYRAGVYKHVAGKALGGHAIRILGWGVQ------ 296
Query: 453 SSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLP 495
+ + YWLVANS+N++WG +G F+I+RG +ECGIE I AGLP
Sbjct: 297 NGEIPYWLVANSWNSDWGSDGFFKILRGSDECGIEGQINAGLP 339
>gi|289743429|gb|ADD20462.1| putative cathepsin B-like cysteine proteinase precursor [Glossina
morsitans morsitans]
Length = 340
Score = 305 bits (782), Expect = 3e-80, Method: Compositional matrix adjust.
Identities = 158/286 (55%), Positives = 186/286 (65%), Gaps = 18/286 (6%)
Query: 59 LSKLT-LSELEMRMGVHPDSK---LPQNRLPL----LVQLSDPLEELPEGFDARINWPYC 110
KLT +S MGVHPD+ LP R+ L LV L + + +P+ FD+R WP+C
Sbjct: 44 FHKLTPMSHYRQLMGVHPDAHYYALPDKRMVLREEELVGLGNDM--IPKEFDSRNQWPHC 101
Query: 111 PTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGF 170
PTI EIRDQGSCGS WA GAVEAMSDRVCI S G + S+DDLVSCC CG GC GGF
Sbjct: 102 PTIWEIRDQGSCGSCWAFGAVEAMSDRVCIHSNGTVNFHFSADDLVSCCHTCGFGCNGGF 161
Query: 171 HGKAWKYWVTTGIVSGGTYASKQGCRPYEI-PCERYMNGSHSSCQDNEPNTPECIRKCQP 229
G AW YWV GIVSGG Y S QGCRPYEI PCE ++NG+ C+ TP C KCQ
Sbjct: 162 PGAAWGYWVRKGIVSGGPYGSSQGCRPYEIAPCEHHVNGTRPPCEKEYGKTPRCQHKCQA 221
Query: 230 GYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGG 289
Y V Y+ D +FG AYS+ N I EI +GPVEG+ T+Y D+ILYK G+Y+HV G
Sbjct: 222 SYKVDYKTDKHFGSRAYSISKNVRDIQGEIMTNGPVEGAFTVYEDLILYKDGVYEHVHGK 281
Query: 290 PLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
LG HAIRIIGWG E YWL+ANS+NT+WG NG F+I
Sbjct: 282 ELGGHAIRIIGWGVE-------KDTPYWLIANSWNTDWGNNGFFKI 320
Score = 185 bits (469), Expect = 6e-44, Method: Compositional matrix adjust.
Identities = 89/163 (54%), Positives = 111/163 (68%), Gaps = 8/163 (4%)
Query: 336 GCRPYEI-PCERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANE 394
GCRPYEI PCE ++NG+R C+ TP C KCQ Y V Y+ D +FG AYS+ N
Sbjct: 185 GCRPYEIAPCEHHVNGTRPPCEKEYGKTPRCQHKCQASYKVDYKTDKHFGSRAYSISKNV 244
Query: 395 ETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSS 454
I EI +GPVEG+ T+Y D+ILYK G+Y+HV G LG HAIRIIGWG E
Sbjct: 245 RDIQGEIMTNGPVEGAFTVYEDLILYKDGVYEHVHGKELGGHAIRIIGWGVE-------K 297
Query: 455 VVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
YWL+ANS+NT+WG NG F+I+RG++ CGIE+ I+AGLPKI
Sbjct: 298 DTPYWLIANSWNTDWGNNGFFKILRGKDHCGIESSISAGLPKI 340
>gi|496317|dbj|BAA04103.1| Sarcophaga pro-cathepsin B [Sarcophaga peregrina]
Length = 344
Score = 303 bits (776), Expect = 1e-79, Method: Compositional matrix adjust.
Identities = 151/296 (51%), Positives = 187/296 (63%), Gaps = 11/296 (3%)
Query: 44 ILLPKLPFYGAEKNALSKLTLSELEMRMGVHPDSK---LPQNRLPLLVQLSDPLEELPEG 100
I+ K + +N + S MGVHPD+ L + L L ++ ++PE
Sbjct: 36 IVRSKAKTWTPGRNYDKSVPRSHFRRLMGVHPDAHKFTLHEKSLVLGEEVGLADSDVPEE 95
Query: 101 FDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCK 160
FDAR WP CPTI EIRDQGSCGS WA GAVEAMSDR+CI S H S+DDLVSCC
Sbjct: 96 FDARKAWPNCPTIGEIRDQGSCGSCWAFGAVEAMSDRLCIHSNATIHFHFSADDLVSCCH 155
Query: 161 DCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEI-PCERYMNGSHSSCQDNEPN 219
CG GC GGF G AW YW GIVSGG Y S QGCRPYEI PCE ++NG+ C
Sbjct: 156 TCGFGCNGGFPGAAWAYWTRKGIVSGGPYGSSQGCRPYEIAPCEHHVNGTRPPCDGEHGK 215
Query: 220 TPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYK 279
TP C +CQ YDV Y+ D +FG +YS+ N + I +EI ++GPVEG+ T+Y D+ILYK
Sbjct: 216 TPSCRHECQKSYDVDYKTDKHFGSKSYSVKRNVKDIQKEIMQNGPVEGAFTVYEDLILYK 275
Query: 280 TGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
G+Y+HV G LG HAIRI+GWG E + YWL+ANS+NT+WG NG F++
Sbjct: 276 DGVYQHVHGRELGGHAIRILGWGVE-------NKTPYWLIANSWNTDWGNNGFFKM 324
Score = 185 bits (470), Expect = 5e-44, Method: Compositional matrix adjust.
Identities = 85/163 (52%), Positives = 114/163 (69%), Gaps = 8/163 (4%)
Query: 336 GCRPYEI-PCERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANE 394
GCRPYEI PCE ++NG+R C TP C +CQ YDV Y+ D +FG +YS+ N
Sbjct: 189 GCRPYEIAPCEHHVNGTRPPCDGEHGKTPSCRHECQKSYDVDYKTDKHFGSKSYSVKRNV 248
Query: 395 ETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSS 454
+ I +EI ++GPVEG+ T+Y D+ILYK G+Y+HV G LG HAIRI+GWG E +
Sbjct: 249 KDIQKEIMQNGPVEGAFTVYEDLILYKDGVYQHVHGRELGGHAIRILGWGVE-------N 301
Query: 455 VVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
YWL+ANS+NT+WG NG F+++RG++ CGIE+ I AGLPK+
Sbjct: 302 KTPYWLIANSWNTDWGNNGFFKMLRGEDHCGIESAIAAGLPKV 344
>gi|195393194|ref|XP_002055239.1| GJ19262 [Drosophila virilis]
gi|194149749|gb|EDW65440.1| GJ19262 [Drosophila virilis]
Length = 338
Score = 303 bits (776), Expect = 2e-79, Method: Compositional matrix adjust.
Identities = 154/279 (55%), Positives = 186/279 (66%), Gaps = 13/279 (4%)
Query: 63 TLSELEMR--MGVHPDSK---LPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIR 117
++SE +R MGVHPD+ LP+ L + +LPE FDAR WP CPTI EIR
Sbjct: 47 SVSEHHIRGLMGVHPDAHKFTLPEKSQVLGNLMEADGGDLPEEFDARTAWPDCPTIGEIR 106
Query: 118 DQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKY 177
DQGSCGS WA GAVEAMSDRVCI S + S+DDLVSCC CG GC GGF G AW Y
Sbjct: 107 DQGSCGSCWAFGAVEAMSDRVCIHSNATVNFHFSADDLVSCCHTCGFGCNGGFPGAAWSY 166
Query: 178 WVTTGIVSGGTYASKQGCRPYEI-PCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYE 236
W GIVSGG+Y SK+GCRPYE+ PCE ++NG+ C +TP C+ KC+ GY V Y
Sbjct: 167 WTHKGIVSGGSYGSKEGCRPYEVEPCEHHVNGTRPPCHSG--STPRCMHKCESGYSVDYA 224
Query: 237 DDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAI 296
D +FG AYS+ N I REI +GPVEG+ T+Y D+ILYKTG+Y+HV G LG HAI
Sbjct: 225 KDKHFGAKAYSVNRNPLDIQREIMTNGPVEGAFTVYEDLILYKTGVYQHVHGRQLGGHAI 284
Query: 297 RIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
RI+GWG + V YWL+ NS+NT+WG+NG FRI
Sbjct: 285 RILGWGVW-----GDNKVPYWLIGNSWNTDWGDNGFFRI 318
Score = 185 bits (469), Expect = 6e-44, Method: Compositional matrix adjust.
Identities = 88/163 (53%), Positives = 116/163 (71%), Gaps = 8/163 (4%)
Query: 336 GCRPYEI-PCERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANE 394
GCRPYE+ PCE ++NG+R C + +TP C+ KC+ GY V Y D +FG AYS+ N
Sbjct: 183 GCRPYEVEPCEHHVNGTRPPCHSG--STPRCMHKCESGYSVDYAKDKHFGAKAYSVNRNP 240
Query: 395 ETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSS 454
I REI +GPVEG+ T+Y D+ILYKTG+Y+HV G LG HAIRI+GWG +
Sbjct: 241 LDIQREIMTNGPVEGAFTVYEDLILYKTGVYQHVHGRQLGGHAIRILGWGVW-----GDN 295
Query: 455 VVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
V YWL+ NS+NT+WG+NG FRI+RG++ CGIE+ I+AGLPK+
Sbjct: 296 KVPYWLIGNSWNTDWGDNGFFRILRGEDHCGIESAISAGLPKL 338
>gi|195130519|ref|XP_002009699.1| GI15503 [Drosophila mojavensis]
gi|193908149|gb|EDW07016.1| GI15503 [Drosophila mojavensis]
Length = 342
Score = 303 bits (775), Expect = 2e-79, Method: Compositional matrix adjust.
Identities = 153/279 (54%), Positives = 187/279 (67%), Gaps = 13/279 (4%)
Query: 63 TLSELEMR--MGVHPDSK---LPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIR 117
++SE +R MGVHPD+ LP+ L + D ++LPE FDAR WP CPTI EIR
Sbjct: 51 SVSEGHIRGLMGVHPDAHKFTLPEKSQVLGNLVGDDGDDLPESFDARTAWPNCPTIGEIR 110
Query: 118 DQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKY 177
DQGSCGS WA GAVEAMSDRVCI S G + S++DLVSCC CG GC GGF G AW Y
Sbjct: 111 DQGSCGSCWAFGAVEAMSDRVCIHSNGTVNFHFSAEDLVSCCHTCGFGCNGGFPGAAWSY 170
Query: 178 WVTTGIVSGGTYASKQGCRPYEI-PCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYE 236
W GIVSGG+Y S +GCRPYEI PCE ++NG+ C++ TP C +C+ Y V Y
Sbjct: 171 WTHKGIVSGGSYNSNEGCRPYEIEPCEHHVNGTRPPCKNGR--TPSCKHQCESSYSVDYA 228
Query: 237 DDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAI 296
D +FG +YS+ N I REI +GPVEG+ T+Y D+ILYK+G+YKHV G LG HAI
Sbjct: 229 KDKHFGSKSYSIRRNPREIQREIMTNGPVEGAFTVYEDLILYKSGVYKHVHGKELGGHAI 288
Query: 297 RIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
RI+GWG S V YWL+ NS+NT+WG+NG FRI
Sbjct: 289 RILGWGVW-----GDSKVPYWLIGNSWNTDWGDNGFFRI 322
Score = 180 bits (456), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 87/163 (53%), Positives = 112/163 (68%), Gaps = 8/163 (4%)
Query: 336 GCRPYEI-PCERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANE 394
GCRPYEI PCE ++NG+R C+ TP C +C+ Y V Y D +FG +YS+ N
Sbjct: 187 GCRPYEIEPCEHHVNGTRPPCKNGR--TPSCKHQCESSYSVDYAKDKHFGSKSYSIRRNP 244
Query: 395 ETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSS 454
I REI +GPVEG+ T+Y D+ILYK+G+YKHV G LG HAIRI+GWG S
Sbjct: 245 REIQREIMTNGPVEGAFTVYEDLILYKSGVYKHVHGKELGGHAIRILGWGVW-----GDS 299
Query: 455 VVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
V YWL+ NS+NT+WG+NG FRIVRG++ CGIE+ I+AGLP +
Sbjct: 300 KVPYWLIGNSWNTDWGDNGFFRIVRGEDHCGIESAISAGLPAL 342
>gi|50540542|ref|NP_998501.1| cathepsin B, a precursor [Danio rerio]
gi|34784038|gb|AAH56688.1| Cathepsin B, a [Danio rerio]
gi|37681773|gb|AAQ97764.1| cathepsin B [Danio rerio]
gi|41351445|gb|AAH65589.1| Cathepsin B, a [Danio rerio]
Length = 330
Score = 301 bits (770), Expect = 7e-79, Method: Compositional matrix adjust.
Identities = 137/254 (53%), Positives = 186/254 (73%), Gaps = 9/254 (3%)
Query: 83 RLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIAS 142
+LP++VQ ++ L+ LP+ FDAR WP CPT++EIRDQGSCGS WA GA EA+SDRVCI S
Sbjct: 66 KLPVMVQYTEGLK-LPKNFDAREQWPNCPTLKEIRDQGSCGSCWAFGAAEAISDRVCIHS 124
Query: 143 RGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEI-P 201
K V +SS DL++CC CG GC GG+ AW +W T G+V+GG Y S GCRPY I P
Sbjct: 125 DAKVSVEISSQDLLTCCDSCGMGCNGGYPSAAWDFWATEGLVTGGLYNSHIGCRPYTIEP 184
Query: 202 CERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFR 261
CE ++NGS C +TP C KC+PGY SY+ D +FG+ +YS+P+N+ +IM E+F+
Sbjct: 185 CEHHVNGSRPPCSGEGGDTPNCDMKCEPGYSPSYKQDKHFGKTSYSVPSNQNSIMAELFK 244
Query: 262 HGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVAN 321
+GPVEG+ T+Y D +LYK+G+Y+H++G P+G HAI+I+GWG+E + V YWL AN
Sbjct: 245 NGPVEGAFTVYEDFLLYKSGVYQHMSGSPVGGHAIKILGWGEE-------NGVPYWLAAN 297
Query: 322 SFNTNWGENGLFRI 335
S+NT+WG+NG F+I
Sbjct: 298 SWNTDWGDNGYFKI 311
Score = 198 bits (503), Expect = 7e-48, Method: Compositional matrix adjust.
Identities = 86/163 (52%), Positives = 124/163 (76%), Gaps = 8/163 (4%)
Query: 334 RIGCRPYEI-PCERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPA 392
IGCRPY I PCE ++NGSR C +TP C KC+PGY SY+ D +FG+ +YS+P+
Sbjct: 174 HIGCRPYTIEPCEHHVNGSRPPCSGEGGDTPNCDMKCEPGYSPSYKQDKHFGKTSYSVPS 233
Query: 393 NEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGT 452
N+ +IM E+F++GPVEG+ T+Y D +LYK+G+Y+H++G P+G HAI+I+GWG+E
Sbjct: 234 NQNSIMAELFKNGPVEGAFTVYEDFLLYKSGVYQHMSGSPVGGHAIKILGWGEE------ 287
Query: 453 SSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLP 495
+ V YWL ANS+NT+WG+NG F+I+RG++ CGIE++I AG+P
Sbjct: 288 -NGVPYWLAANSWNTDWGDNGYFKILRGEDHCGIESEIVAGIP 329
>gi|194766882|ref|XP_001965553.1| GF22391 [Drosophila ananassae]
gi|190619544|gb|EDV35068.1| GF22391 [Drosophila ananassae]
Length = 342
Score = 300 bits (769), Expect = 1e-78, Method: Compositional matrix adjust.
Identities = 152/296 (51%), Positives = 196/296 (66%), Gaps = 11/296 (3%)
Query: 44 ILLPKLPFYGAEKNALSKLTLSELEMRMGVHPDS---KLPQNRLPLLVQLSDPLEELPEG 100
++ K + A +N ++ + MGVHPD+ LP ++ +L LS ++++P+
Sbjct: 33 LVKTKTRTWQAGRNFDEGVSEEYIRGLMGVHPDAYKFALP-DKQEVLGYLSQKVDDIPKE 91
Query: 101 FDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCK 160
FDAR WP CPTI EIRDQGSCGS WA GAVEAMSDRVCI S G + R S+DDLVSCC
Sbjct: 92 FDAREKWPNCPTINEIRDQGSCGSCWAFGAVEAMSDRVCIHSNGNVNFRFSADDLVSCCH 151
Query: 161 DCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEI-PCERYMNGSHSSCQDNEPN 219
CG GC GGF G AW YW GIVSGG Y SK GCRPYEI PCE ++NG+ + C +++
Sbjct: 152 TCGFGCNGGFPGAAWSYWTRKGIVSGGRYGSKTGCRPYEIAPCEHHVNGTRAPC-NHDSK 210
Query: 220 TPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYK 279
TP+C +C+ GY+V Y D +FG +YS+ N I EI +GPVEG+ T+Y D+ILYK
Sbjct: 211 TPKCQHQCEAGYNVEYSKDKHFGSKSYSVRRNVRDIQEEIMTNGPVEGAFTVYEDLILYK 270
Query: 280 TGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
+G+Y+H G LG HAIRI+GWG E V YWL+ANS+N +WG+ G FRI
Sbjct: 271 SGVYQHEHGKELGGHAIRILGWGVWGKEE-----VPYWLIANSWNDDWGDKGFFRI 321
Score = 177 bits (450), Expect = 8e-42, Method: Compositional matrix adjust.
Identities = 84/165 (50%), Positives = 116/165 (70%), Gaps = 7/165 (4%)
Query: 334 RIGCRPYEI-PCERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPA 392
+ GCRPYEI PCE ++NG+R+ C ++ TP+C +C+ GY+V Y D +FG +YS+
Sbjct: 183 KTGCRPYEIAPCEHHVNGTRAPCN-HDSKTPKCQHQCEAGYNVEYSKDKHFGSKSYSVRR 241
Query: 393 NEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGT 452
N I EI +GPVEG+ T+Y D+ILYK+G+Y+H G LG HAIRI+GWG E
Sbjct: 242 NVRDIQEEIMTNGPVEGAFTVYEDLILYKSGVYQHEHGKELGGHAIRILGWGVWGKEE-- 299
Query: 453 SSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
V YWL+ANS+N +WG+ G FRI+RG++ CGIE+ I+AGLPK+
Sbjct: 300 ---VPYWLIANSWNDDWGDKGFFRILRGEDHCGIESSISAGLPKL 341
>gi|195352458|ref|XP_002042729.1| GM17589 [Drosophila sechellia]
gi|194126760|gb|EDW48803.1| GM17589 [Drosophila sechellia]
Length = 340
Score = 300 bits (768), Expect = 1e-78, Method: Compositional matrix adjust.
Identities = 153/296 (51%), Positives = 187/296 (63%), Gaps = 10/296 (3%)
Query: 44 ILLPKLPFYGAEKNALSKLTLSELEMRMGVHPDSK---LPQNRLPLLVQLSDPLEELPEG 100
++ K + +N + +T + MGVHPD+ LP R L + L+ELPE
Sbjct: 31 VVRSKAKTWKVGRNFDASVTEGHIRRLMGVHPDAHKFALPDKREVLGDLYMNSLDELPEE 90
Query: 101 FDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCK 160
FD+R WP CPTI EIRDQGSCGS WA GAVEAMSDRVCI S GK + S+DDLVSCC
Sbjct: 91 FDSRKQWPNCPTIGEIRDQGSCGSCWAFGAVEAMSDRVCIHSGGKVNFHFSADDLVSCCH 150
Query: 161 DCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEI-PCERYMNGSHSSCQDNEPN 219
CG GC GGF G AW YW GIVSGG Y S QGCRPYEI PCE ++NG+ C N
Sbjct: 151 TCGFGCNGGFPGAAWSYWTRKGIVSGGPYGSNQGCRPYEISPCEHHVNGTRPPCA-NGSG 209
Query: 220 TPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYK 279
TP+C CQ Y V Y D +FG +YS+ N I EI +GPVEG+ T+Y D+ILYK
Sbjct: 210 TPKCSHVCQSSYTVDYAKDKHFGSKSYSVKRNVREIQEEIMTNGPVEGAFTVYEDLILYK 269
Query: 280 TGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
G+Y+H G LG HAIRI+GWG + + YWL+ NS+NT+WG++G FRI
Sbjct: 270 DGVYQHEHGKELGGHAIRILGWGVW-----GNEKIPYWLIGNSWNTDWGDHGFFRI 320
Score = 176 bits (445), Expect = 4e-41, Method: Compositional matrix adjust.
Identities = 85/163 (52%), Positives = 111/163 (68%), Gaps = 7/163 (4%)
Query: 336 GCRPYEI-PCERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANE 394
GCRPYEI PCE ++NG+R C AN TP+C CQ Y V Y D +FG +YS+ N
Sbjct: 184 GCRPYEISPCEHHVNGTRPPC-ANGSGTPKCSHVCQSSYTVDYAKDKHFGSKSYSVKRNV 242
Query: 395 ETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSS 454
I EI +GPVEG+ T+Y D+ILYK G+Y+H G LG HAIRI+GWG +
Sbjct: 243 REIQEEIMTNGPVEGAFTVYEDLILYKDGVYQHEHGKELGGHAIRILGWGVW-----GNE 297
Query: 455 VVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
+ YWL+ NS+NT+WG++G FRI+RGQ+ CGIE+ I+AGLPK+
Sbjct: 298 KIPYWLIGNSWNTDWGDHGFFRILRGQDHCGIESSISAGLPKL 340
>gi|443692853|gb|ELT94358.1| hypothetical protein CAPTEDRAFT_221292 [Capitella teleta]
Length = 374
Score = 299 bits (766), Expect = 2e-78, Method: Compositional matrix adjust.
Identities = 144/255 (56%), Positives = 184/255 (72%), Gaps = 10/255 (3%)
Query: 82 NRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIA 141
+RLP++ + + LP+ FDAR W CPTI+E+RDQGSCGS WA GAVEAMSDR+CIA
Sbjct: 108 SRLPIMRHKLEAVN-LPDDFDARKEWTGCPTIKEVRDQGSCGSCWAFGAVEAMSDRICIA 166
Query: 142 SRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEI- 200
S+G H +SS+DL+SCC CG GC GGF AW+Y+ TG+VSGG Y + QGCRPY I
Sbjct: 167 SKGNVHAHISSEDLLSCCSSCGMGCNGGFPPAAWEYFRDTGLVSGGQYGTHQGCRPYSIA 226
Query: 201 PCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIF 260
PCE ++NG+ C P TP+C R C+ GY V YEDD NFG AYS+ +E+ IM EI
Sbjct: 227 PCEHHVNGTRLPCSGEGP-TPKCERTCEKGYKVKYEDDKNFGYTAYSVDNDEKQIMTEIM 285
Query: 261 RHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVA 320
+GPVEG+ T+YAD YK+G+Y+HV+GG LG HAIR++GWG E +GT YWLVA
Sbjct: 286 TNGPVEGAFTVYADFPTYKSGVYQHVSGGELGGHAIRVLGWGVE---DGTP----YWLVA 338
Query: 321 NSFNTNWGENGLFRI 335
NS+N++WG+NG F+I
Sbjct: 339 NSWNSDWGDNGFFKI 353
Score = 197 bits (501), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 95/162 (58%), Positives = 120/162 (74%), Gaps = 9/162 (5%)
Query: 336 GCRPYEI-PCERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANE 394
GCRPY I PCE ++NG+R C P TP+C R C+ GY V YEDD NFG AYS+ +E
Sbjct: 219 GCRPYSIAPCEHHVNGTRLPCSGEGP-TPKCERTCEKGYKVKYEDDKNFGYTAYSVDNDE 277
Query: 395 ETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSS 454
+ IM EI +GPVEG+ T+YAD YK+G+Y+HV+GG LG HAIR++GWG E +GT
Sbjct: 278 KQIMTEIMTNGPVEGAFTVYADFPTYKSGVYQHVSGGELGGHAIRVLGWGVE---DGTP- 333
Query: 455 VVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPK 496
YWLVANS+N++WG+NG F+I+RGQNECGIE +I AGLPK
Sbjct: 334 ---YWLVANSWNSDWGDNGFFKILRGQNECGIEGEIVAGLPK 372
>gi|18921171|ref|NP_572920.1| cathepsin B1, isoform A [Drosophila melanogaster]
gi|7292926|gb|AAF48317.1| cathepsin B1, isoform A [Drosophila melanogaster]
gi|16767940|gb|AAL28188.1| GH06546p [Drosophila melanogaster]
gi|220944992|gb|ACL85039.1| CG10992-PA [synthetic construct]
gi|220954816|gb|ACL89951.1| CG10992-PA [synthetic construct]
Length = 340
Score = 299 bits (765), Expect = 3e-78, Method: Compositional matrix adjust.
Identities = 154/296 (52%), Positives = 188/296 (63%), Gaps = 10/296 (3%)
Query: 44 ILLPKLPFYGAEKNALSKLTLSELEMRMGVHPDSK---LPQNRLPLLVQLSDPLEELPEG 100
++ K + +N + +T + MGVHPD+ LP R L + ++ELPE
Sbjct: 31 VVRSKAKTWTVGRNFDASVTEGHIRRLMGVHPDAHKFALPDKREVLGDLYVNSVDELPEE 90
Query: 101 FDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCK 160
FD+R WP CPTI EIRDQGSCGS WA GAVEAMSDRVCI S GK + S+DDLVSCC
Sbjct: 91 FDSRKQWPNCPTIGEIRDQGSCGSCWAFGAVEAMSDRVCIHSGGKVNFHFSADDLVSCCH 150
Query: 161 DCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEI-PCERYMNGSHSSCQDNEPN 219
CG GC GGF G AW YW GIVSGG Y S QGCRPYEI PCE ++NG+ C
Sbjct: 151 TCGFGCNGGFPGAAWSYWTRKGIVSGGPYGSNQGCRPYEISPCEHHVNGTRPPCAHG-GR 209
Query: 220 TPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYK 279
TP+C CQ GY V Y D +FG +YS+ N I EI +GPVEG+ T+Y D+ILYK
Sbjct: 210 TPKCSHVCQSGYTVDYAKDKHFGSKSYSVRRNVREIQEEIMTNGPVEGAFTVYEDLILYK 269
Query: 280 TGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
G+Y+H G LG HAIRI+GWG GE + YWL+ NS+NT+WG++G FRI
Sbjct: 270 DGVYQHEHGKELGGHAIRILGWGV--WGE---EKIPYWLIGNSWNTDWGDHGFFRI 320
Score = 176 bits (445), Expect = 4e-41, Method: Compositional matrix adjust.
Identities = 87/163 (53%), Positives = 113/163 (69%), Gaps = 7/163 (4%)
Query: 336 GCRPYEI-PCERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANE 394
GCRPYEI PCE ++NG+R C A+ TP+C CQ GY V Y D +FG +YS+ N
Sbjct: 184 GCRPYEISPCEHHVNGTRPPC-AHGGRTPKCSHVCQSGYTVDYAKDKHFGSKSYSVRRNV 242
Query: 395 ETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSS 454
I EI +GPVEG+ T+Y D+ILYK G+Y+H G LG HAIRI+GWG GE
Sbjct: 243 REIQEEIMTNGPVEGAFTVYEDLILYKDGVYQHEHGKELGGHAIRILGWGV--WGE---E 297
Query: 455 VVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
+ YWL+ NS+NT+WG++G FRI+RGQ+ CGIE+ I+AGLPK+
Sbjct: 298 KIPYWLIGNSWNTDWGDHGFFRILRGQDHCGIESSISAGLPKL 340
>gi|187097096|ref|NP_001119608.1| cathepsin B-348 precursor [Acyrthosiphon pisum]
gi|161343833|tpg|DAA06097.1| TPA_inf: cathepsin B [Acyrthosiphon pisum]
Length = 342
Score = 299 bits (765), Expect = 3e-78, Method: Compositional matrix adjust.
Identities = 147/287 (51%), Positives = 195/287 (67%), Gaps = 11/287 (3%)
Query: 51 FYGAEKNALSKLTLSELEMRMGVH-PDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPY 109
++ A +N +S ++ MGVH +++ P +L L+ +D +LPE FDAR WP
Sbjct: 46 YWSAGRNFHKDTPISYIKGLMGVHEKNAEYP--KLEQLLTYNDASTDLPETFDARERWPN 103
Query: 110 CPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGG 169
CPTI+E+RDQGSCGS WA GAVEAMSDRVCI S G ++ S+++LVSCC CG GC GG
Sbjct: 104 CPTIREVRDQGSCGSCWAFGAVEAMSDRVCIHSNGTKNFHFSAENLVSCCWTCGFGCNGG 163
Query: 170 FHGKAWKYWVTTGIVSGGTYASKQGCRPYEI-PCERYMNGSHSSCQDNEPNTPECIRKCQ 228
F G AW YW T GIVSGG Y S GC PYEI PCE ++NG+ C++ TP C++KC+
Sbjct: 164 FPGAAWNYWKTKGIVSGGPYGSNMGCIPYEIAPCEHHVNGTRGPCKEGG-KTPTCVKKCE 222
Query: 229 PGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAG 288
GY V Y DL+ G+ AYS+ + + I +EI+ +GPVEG+ T+Y D I Y+ G+YKHVAG
Sbjct: 223 EGYKVPYAQDLHHGKSAYSIRNDVDQIRQEIYTNGPVEGAFTVYEDFIAYRAGVYKHVAG 282
Query: 289 GPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
LG HAIRI+GWG + + + YWLVANS+NT+WG +G F+I
Sbjct: 283 KALGGHAIRILGWGVQ------NGEIPYWLVANSWNTDWGSDGFFKI 323
Score = 177 bits (450), Expect = 9e-42, Method: Compositional matrix adjust.
Identities = 84/162 (51%), Positives = 114/162 (70%), Gaps = 8/162 (4%)
Query: 335 IGCRPYEI-PCERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPAN 393
+GC PYEI PCE ++NG+R C+ TP C++KC+ GY V Y DL+ G+ AYS+ +
Sbjct: 187 MGCIPYEIAPCEHHVNGTRGPCKEGG-KTPTCVKKCEEGYKVPYAQDLHHGKSAYSIRND 245
Query: 394 EETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTS 453
+ I +EI+ +GPVEG+ T+Y D I Y+ G+YKHVAG LG HAIRI+GWG + +
Sbjct: 246 VDQIRQEIYTNGPVEGAFTVYEDFIAYRAGVYKHVAGKALGGHAIRILGWGVQ------N 299
Query: 454 SVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLP 495
+ YWLVANS+NT+WG +G F+I+RG +ECGIE I AGLP
Sbjct: 300 GEIPYWLVANSWNTDWGSDGFFKILRGSDECGIEGQINAGLP 341
>gi|56759588|gb|AAW28820.1| Parcxpwnx02 [Periplaneta americana]
Length = 343
Score = 298 bits (764), Expect = 4e-78, Method: Compositional matrix adjust.
Identities = 150/299 (50%), Positives = 202/299 (67%), Gaps = 15/299 (5%)
Query: 38 DRVDHSILLPKLPFYGAEKNALSKLTLSELEMRMGVHPDSKLPQNRLPLLVQLSDPLEEL 97
D +DH L + A +N + + L E++ MGV L RLP + D E+
Sbjct: 39 DFIDHINSLNTT--WKAHRNFGNDIPLREIKKLMGVR--RSLENFRLPE-KSMEDIDIEI 93
Query: 98 PEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVS 157
PE FD R WP CPT++EIRDQGSCGS WA GAVEAMSDRVCI S+GK H S++DL++
Sbjct: 94 PEEFDPREQWPECPTLKEIRDQGSCGSCWAFGAVEAMSDRVCIHSKGKTHFHFSAEDLLT 153
Query: 158 CCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEI-PCERYMNGSHSSCQDN 216
CC CG GC GG G AW YWV+TGIVSGG+Y S QGC+PY I PCE ++NG+ C
Sbjct: 154 CCSSCGFGCNGGEPGAAWDYWVSTGIVSGGSYNSHQGCQPYAIEPCEHHVNGTRKPC--G 211
Query: 217 EPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMI 276
E +TP C+++C+ GYDV Y D +FG+ AY++P + + I +E+ +GP E ++T+Y D +
Sbjct: 212 EGDTPRCVKRCEEGYDVPYGKDRHFGKSAYAVPGSVKAIQKELLLNGPAEAALTVYDDFL 271
Query: 277 LYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
Y+TG+Y+HV+GG LG HA+R++GWG E +GT YWL+ANS+N +WG+NG FRI
Sbjct: 272 HYRTGVYQHVSGGALGGHAVRLLGWGVE---DGT----PYWLLANSWNYDWGDNGYFRI 323
Score = 183 bits (465), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 83/163 (50%), Positives = 121/163 (74%), Gaps = 10/163 (6%)
Query: 336 GCRPYEI-PCERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANE 394
GC+PY I PCE ++NG+R C E +TP C+++C+ GYDV Y D +FG+ AY++P +
Sbjct: 190 GCQPYAIEPCEHHVNGTRKPC--GEGDTPRCVKRCEEGYDVPYGKDRHFGKSAYAVPGSV 247
Query: 395 ETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSS 454
+ I +E+ +GP E ++T+Y D + Y+TG+Y+HV+GG LG HA+R++GWG E +GT
Sbjct: 248 KAIQKELLLNGPAEAALTVYDDFLHYRTGVYQHVSGGALGGHAVRLLGWGVE---DGT-- 302
Query: 455 VVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
YWL+ANS+N +WG+NG FRI+RGQ+ECGIE+DI GLPK+
Sbjct: 303 --PYWLLANSWNYDWGDNGYFRILRGQDECGIESDINGGLPKV 343
>gi|442616292|ref|NP_001259536.1| cathepsin B1, isoform B [Drosophila melanogaster]
gi|440216755|gb|AGB95378.1| cathepsin B1, isoform B [Drosophila melanogaster]
Length = 330
Score = 298 bits (764), Expect = 4e-78, Method: Compositional matrix adjust.
Identities = 154/289 (53%), Positives = 185/289 (64%), Gaps = 10/289 (3%)
Query: 51 FYGAEKNALSKLTLSELEMRMGVHPDSK---LPQNRLPLLVQLSDPLEELPEGFDARINW 107
F +N + +T + MGVHPD+ LP R L + ++ELPE FD+R W
Sbjct: 28 FIEVGRNFDASVTEGHIRRLMGVHPDAHKFALPDKREVLGDLYVNSVDELPEEFDSRKQW 87
Query: 108 PYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQ 167
P CPTI EIRDQGSCGS WA GAVEAMSDRVCI S GK + S+DDLVSCC CG GC
Sbjct: 88 PNCPTIGEIRDQGSCGSCWAFGAVEAMSDRVCIHSGGKVNFHFSADDLVSCCHTCGFGCN 147
Query: 168 GGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEI-PCERYMNGSHSSCQDNEPNTPECIRK 226
GGF G AW YW GIVSGG Y S QGCRPYEI PCE ++NG+ C TP+C
Sbjct: 148 GGFPGAAWSYWTRKGIVSGGPYGSNQGCRPYEISPCEHHVNGTRPPCAHG-GRTPKCSHV 206
Query: 227 CQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHV 286
CQ GY V Y D +FG +YS+ N I EI +GPVEG+ T+Y D+ILYK G+Y+H
Sbjct: 207 CQSGYTVDYAKDKHFGSKSYSVRRNVREIQEEIMTNGPVEGAFTVYEDLILYKDGVYQHE 266
Query: 287 AGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
G LG HAIRI+GWG GE + YWL+ NS+NT+WG++G FRI
Sbjct: 267 HGKELGGHAIRILGWGV--WGE---EKIPYWLIGNSWNTDWGDHGFFRI 310
Score = 175 bits (444), Expect = 4e-41, Method: Compositional matrix adjust.
Identities = 87/163 (53%), Positives = 113/163 (69%), Gaps = 7/163 (4%)
Query: 336 GCRPYEI-PCERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANE 394
GCRPYEI PCE ++NG+R C A+ TP+C CQ GY V Y D +FG +YS+ N
Sbjct: 174 GCRPYEISPCEHHVNGTRPPC-AHGGRTPKCSHVCQSGYTVDYAKDKHFGSKSYSVRRNV 232
Query: 395 ETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSS 454
I EI +GPVEG+ T+Y D+ILYK G+Y+H G LG HAIRI+GWG GE
Sbjct: 233 REIQEEIMTNGPVEGAFTVYEDLILYKDGVYQHEHGKELGGHAIRILGWGV--WGE---E 287
Query: 455 VVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
+ YWL+ NS+NT+WG++G FRI+RGQ+ CGIE+ I+AGLPK+
Sbjct: 288 KIPYWLIGNSWNTDWGDHGFFRILRGQDHCGIESSISAGLPKL 330
>gi|195165479|ref|XP_002023566.1| GL19846 [Drosophila persimilis]
gi|194105700|gb|EDW27743.1| GL19846 [Drosophila persimilis]
Length = 329
Score = 298 bits (762), Expect = 7e-78, Method: Compositional matrix adjust.
Identities = 156/276 (56%), Positives = 186/276 (67%), Gaps = 13/276 (4%)
Query: 61 KLTLSELEMR--MGVHPDSK---LPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQE 115
K ++SE +R MGVHPD+ LP+ R+ L +D ++PE FDAR WP CPTI E
Sbjct: 45 KESVSEEYIRGLMGVHPDAHKFALPEKRIVLGDLYADDGIDIPEEFDARKAWPNCPTIGE 104
Query: 116 IRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAW 175
IRDQGSCGS WA GAVEAMSDRVCI S GK + LS+DDLVSCC CG GC GGF G AW
Sbjct: 105 IRDQGSCGSCWAFGAVEAMSDRVCIHSEGKVNFHLSADDLVSCCHICGFGCNGGFPGAAW 164
Query: 176 KYWVTTGIVSGGTYASKQGCRPYEI-PCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVS 234
YW GIVSGG Y S QGCRPYEI PCE ++NG+ C +TP C KCQ Y V
Sbjct: 165 SYWTRKGIVSGGPYGSTQGCRPYEIAPCEHHVNGTRPPCSHG--STPSCQHKCQASYSVE 222
Query: 235 YEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEH 294
Y D NFG +YS+ N I +EI +GPVEG+ T+Y D+ILYK+G+Y+H G LG H
Sbjct: 223 YAKDKNFGSKSYSVRRNVAEIQQEIMTNGPVEGAFTVYEDLILYKSGVYQHEHGKELGGH 282
Query: 295 AIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGEN 330
AIRI+GWG GE S V YWL+ NS+NT+WG+N
Sbjct: 283 AIRILGWGV--WGE---SKVPYWLIGNSWNTDWGDN 313
Score = 153 bits (387), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 81/163 (49%), Positives = 105/163 (64%), Gaps = 17/163 (10%)
Query: 336 GCRPYEI-PCERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANE 394
GCRPYEI PCE ++NG+R C + +TP C KCQ Y V Y D NFG +YS+ N
Sbjct: 183 GCRPYEIAPCEHHVNGTRPPC--SHGSTPSCQHKCQASYSVEYAKDKNFGSKSYSVRRNV 240
Query: 395 ETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSS 454
I +EI +GPVEG+ T+Y D+ILYK+G+Y+H G LG HAIRI+GWG GE S
Sbjct: 241 AEIQQEIMTNGPVEGAFTVYEDLILYKSGVYQHEHGKELGGHAIRILGWGV--WGE---S 295
Query: 455 VVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
V YWL+ NS+NT+WG+N + CGIE+ I+AGL +
Sbjct: 296 KVPYWLIGNSWNTDWGDN---------DHCGIESSISAGLSHL 329
>gi|195438776|ref|XP_002067308.1| GK16352 [Drosophila willistoni]
gi|194163393|gb|EDW78294.1| GK16352 [Drosophila willistoni]
Length = 340
Score = 297 bits (761), Expect = 9e-78, Method: Compositional matrix adjust.
Identities = 151/279 (54%), Positives = 185/279 (66%), Gaps = 13/279 (4%)
Query: 63 TLSELEMR--MGVHPDSK---LPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIR 117
++SE +R MGVHPD+ LP L + D ++P FDAR W CPTI EIR
Sbjct: 49 SVSEKYIRGLMGVHPDADKFALPDKMEVLGKLVEDSDSDIPTEFDAREKWSNCPTIGEIR 108
Query: 118 DQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKY 177
DQGSCGS WA GAVEAMSDRVCI S+GK + LS+DDLVSCC CG GC GGF G AW Y
Sbjct: 109 DQGSCGSCWAFGAVEAMSDRVCIHSQGKVNFHLSADDLVSCCHTCGFGCNGGFPGAAWSY 168
Query: 178 WVTTGIVSGGTYASKQGCRPYEI-PCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYE 236
W GIVSGG + S+QGCRPYEI PCE ++NG+ C +TP C C+ Y V Y+
Sbjct: 169 WTRKGIVSGGNFGSQQGCRPYEIEPCEHHVNGTRPPCSSG--STPRCQHVCESSYKVDYK 226
Query: 237 DDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAI 296
D NFG +YS+ N I +EI +GPVEG+ T+Y D+ILYK+G+Y+HV G LG HAI
Sbjct: 227 KDKNFGSKSYSIKNNVLDIQKEIMNNGPVEGAFTVYEDLILYKSGVYEHVHGKELGGHAI 286
Query: 297 RIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
RI+GWG + YWL+ANS+NT+WG+NG FRI
Sbjct: 287 RILGWGV-----WGDEKIPYWLIANSWNTDWGDNGFFRI 320
Score = 181 bits (458), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 86/163 (52%), Positives = 114/163 (69%), Gaps = 8/163 (4%)
Query: 336 GCRPYEI-PCERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANE 394
GCRPYEI PCE ++NG+R C + +TP C C+ Y V Y+ D NFG +YS+ N
Sbjct: 185 GCRPYEIEPCEHHVNGTRPPCSSG--STPRCQHVCESSYKVDYKKDKNFGSKSYSIKNNV 242
Query: 395 ETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSS 454
I +EI +GPVEG+ T+Y D+ILYK+G+Y+HV G LG HAIRI+GWG
Sbjct: 243 LDIQKEIMNNGPVEGAFTVYEDLILYKSGVYEHVHGKELGGHAIRILGWGV-----WGDE 297
Query: 455 VVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
+ YWL+ANS+NT+WG+NG FRIVRG++ CGIE+ I+AGLPK+
Sbjct: 298 KIPYWLIANSWNTDWGDNGFFRIVRGKDHCGIESSISAGLPKL 340
>gi|170028912|ref|XP_001842338.1| oryzain gamma chain [Culex quinquefasciatus]
gi|167879388|gb|EDS42771.1| oryzain gamma chain [Culex quinquefasciatus]
Length = 333
Score = 297 bits (760), Expect = 1e-77, Method: Compositional matrix adjust.
Identities = 147/289 (50%), Positives = 194/289 (67%), Gaps = 12/289 (4%)
Query: 48 KLPFYGAEKNALSKLTLSELEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINW 107
K + A +N LS MGVH D+ + P+++ D ++LPE FD+R W
Sbjct: 36 KATTWHAGRNFHPDTPLSYFRGLMGVHKDAD--KFMPPVMLHDLDEGDDLPENFDSREQW 93
Query: 108 PYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQ 167
P CPTI+EIRDQGSCGS WA GAVEAMSDRVCI S+GK R+S++DL++CC +CG+GC
Sbjct: 94 PNCPTIREIRDQGSCGSCWAFGAVEAMSDRVCIHSKGKVLFRVSAEDLLTCCTNCGHGCD 153
Query: 168 GGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEI-PCERYMNGSHSSCQDNEPNTPECIRK 226
GG G WK+W+ G+VSGG + S QGCRPY I PC NG+ S C+D+ TP+CI+K
Sbjct: 154 GGAPGAGWKHWIEKGLVSGGPFGSDQGCRPYTIEPCVHVENGAQSPCKDSI--TPKCIKK 211
Query: 227 CQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHV 286
C PGY+V Y D +FG+ YS+ +E I +EIF +GPVE + T++ D YK GIY+H
Sbjct: 212 CLPGYNVPYAKDKSFGKSTYSIANDERQIRKEIFTNGPVEATFTVFDDFASYKHGIYQHT 271
Query: 287 AGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
+G GEHA+RI+GWG E GT KYWL ANS+N++WG+NG F+I
Sbjct: 272 SGNLAGEHAVRILGWGVE---NGT----KYWLAANSWNSDWGDNGYFKI 313
Score = 170 bits (430), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 83/163 (50%), Positives = 111/163 (68%), Gaps = 10/163 (6%)
Query: 336 GCRPYEI-PCERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANE 394
GCRPY I PC NG++S C+ + TP+CI+KC PGY+V Y D +FG+ YS+ +E
Sbjct: 180 GCRPYTIEPCVHVENGAQSPCK--DSITPKCIKKCLPGYNVPYAKDKSFGKSTYSIANDE 237
Query: 395 ETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSS 454
I +EIF +GPVE + T++ D YK GIY+H +G GEHA+RI+GWG E GT
Sbjct: 238 RQIRKEIFTNGPVEATFTVFDDFASYKHGIYQHTSGNLAGEHAVRILGWGVE---NGT-- 292
Query: 455 VVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
KYWL ANS+N++WG+NG F+I+RG N IE+ I AGLPK+
Sbjct: 293 --KYWLAANSWNSDWGDNGYFKILRGSNHVDIESAIVAGLPKV 333
>gi|195566634|ref|XP_002106884.1| GD15875 [Drosophila simulans]
gi|194204277|gb|EDX17853.1| GD15875 [Drosophila simulans]
Length = 340
Score = 296 bits (758), Expect = 2e-77, Method: Compositional matrix adjust.
Identities = 151/296 (51%), Positives = 185/296 (62%), Gaps = 10/296 (3%)
Query: 44 ILLPKLPFYGAEKNALSKLTLSELEMRMGVHPDSK---LPQNRLPLLVQLSDPLEELPEG 100
++ K + +N + +T + MGVHPD+ LP R L + ++ELPE
Sbjct: 31 VVRSKAKTWKVGRNFDASVTEGHIRRLMGVHPDAHKFALPDKREVLGDLYMNSVDELPEE 90
Query: 101 FDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCK 160
FD+R WP CPTI EIRDQGSCGS WA GAVEAMSDRVCI S GK + S+DDLVSCC
Sbjct: 91 FDSRKQWPNCPTIGEIRDQGSCGSCWAFGAVEAMSDRVCIHSGGKVNFHFSADDLVSCCH 150
Query: 161 DCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEI-PCERYMNGSHSSCQDNEPN 219
CG GC GGF G AW YW GIVSGG Y S QGCRPYEI PCE ++NG+ C
Sbjct: 151 TCGFGCNGGFPGAAWSYWTRKGIVSGGPYGSNQGCRPYEISPCEHHVNGTRPPCAHG-GG 209
Query: 220 TPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYK 279
TP+C CQ Y V Y D +FG +YS+ N I EI +GPVEG+ T+Y D+ILYK
Sbjct: 210 TPKCSHVCQSSYTVDYAKDKHFGSKSYSVKRNVREIQEEIMTNGPVEGAFTVYEDLILYK 269
Query: 280 TGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
G+Y+H G LG HAIRI+GWG + YWL+ NS+NT+WG++G FRI
Sbjct: 270 DGVYQHEHGKELGGHAIRILGWGVW-----GDEKIPYWLIGNSWNTDWGDHGFFRI 320
Score = 173 bits (438), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 84/163 (51%), Positives = 110/163 (67%), Gaps = 7/163 (4%)
Query: 336 GCRPYEI-PCERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANE 394
GCRPYEI PCE ++NG+R C A+ TP+C CQ Y V Y D +FG +YS+ N
Sbjct: 184 GCRPYEISPCEHHVNGTRPPC-AHGGGTPKCSHVCQSSYTVDYAKDKHFGSKSYSVKRNV 242
Query: 395 ETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSS 454
I EI +GPVEG+ T+Y D+ILYK G+Y+H G LG HAIRI+GWG
Sbjct: 243 REIQEEIMTNGPVEGAFTVYEDLILYKDGVYQHEHGKELGGHAIRILGWGVW-----GDE 297
Query: 455 VVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
+ YWL+ NS+NT+WG++G FRI+RGQ+ CGIE+ I+AGLPK+
Sbjct: 298 KIPYWLIGNSWNTDWGDHGFFRILRGQDHCGIESSISAGLPKL 340
>gi|348534156|ref|XP_003454569.1| PREDICTED: cathepsin B-like [Oreochromis niloticus]
Length = 330
Score = 295 bits (755), Expect = 3e-77, Method: Compositional matrix adjust.
Identities = 136/254 (53%), Positives = 187/254 (73%), Gaps = 9/254 (3%)
Query: 83 RLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIAS 142
+LP++VQ + ++ LP+ FDAR WP CPT++EIRDQGSCGS WA GA EA+SDRVCI S
Sbjct: 66 KLPVMVQYAGDVK-LPKEFDARQQWPNCPTLKEIRDQGSCGSCWAFGAAEAISDRVCIHS 124
Query: 143 RGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEI-P 201
GK +V +SS+DL++CC CG GC GG+ AW +W + G+VSGG Y S GCRPY I P
Sbjct: 125 NGKVNVEISSEDLLTCCDSCGMGCNGGYPSAAWDFWASEGLVSGGLYESHIGCRPYTIAP 184
Query: 202 CERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFR 261
CE ++NGS C +TPEC+R+C+ GY SY D ++G+ +YS+P++E+ I EI++
Sbjct: 185 CEHHVNGSRPPCTGEGGDTPECVRQCESGYTPSYIQDKHYGKTSYSVPSDEQQIQTEIYK 244
Query: 262 HGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVAN 321
+GPVEG+ T+Y D +LYKTG+Y+HV+G +G HAI+++GWG+E GT YWL AN
Sbjct: 245 NGPVEGAFTVYEDFLLYKTGVYQHVSGSAVGGHAIKVLGWGEE---NGT----PYWLCAN 297
Query: 322 SFNTNWGENGLFRI 335
S+NT+WG+NG F+I
Sbjct: 298 SWNTDWGDNGYFKI 311
Score = 193 bits (490), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 89/178 (50%), Positives = 126/178 (70%), Gaps = 15/178 (8%)
Query: 327 WGENGLF-------RIGCRPYEI-PCERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYE 378
W GL IGCRPY I PCE ++NGSR C +TPEC+R+C+ GY SY
Sbjct: 160 WASEGLVSGGLYESHIGCRPYTIAPCEHHVNGSRPPCTGEGGDTPECVRQCESGYTPSYI 219
Query: 379 DDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAI 438
D ++G+ +YS+P++E+ I EI+++GPVEG+ T+Y D +LYKTG+Y+HV+G +G HAI
Sbjct: 220 QDKHYGKTSYSVPSDEQQIQTEIYKNGPVEGAFTVYEDFLLYKTGVYQHVSGSAVGGHAI 279
Query: 439 RIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPK 496
+++GWG+E GT YWL ANS+NT+WG+NG F+I+RG + CGIE++I AG+PK
Sbjct: 280 KVLGWGEE---NGT----PYWLCANSWNTDWGDNGYFKILRGSDHCGIESEIVAGIPK 330
>gi|321452279|gb|EFX63703.1| hypothetical protein DAPPUDRAFT_306608 [Daphnia pulex]
Length = 340
Score = 295 bits (755), Expect = 4e-77, Method: Compositional matrix adjust.
Identities = 156/325 (48%), Positives = 203/325 (62%), Gaps = 14/325 (4%)
Query: 12 FLKDLDLSQSSRNHSNGVFCDLSKAFDRVDHSILLPKLPFYGAEKNALSKLTLSELEMRM 71
F L SQ+ + SN F LS F ++H + + A +N + L M
Sbjct: 9 FAVVLVTSQAKKLKSNKYFNPLSDEF--INHINSMKST--WKAGRNFGKNFPMGALTQMM 64
Query: 72 GVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAV 131
GVHPDS L L + Q+ + +PE FDAR WP CPTIQEIRDQGSCGS WA GAV
Sbjct: 65 GVHPDSNLYMPPLKNVSQMYSN-QAIPEAFDAREQWPDCPTIQEIRDQGSCGSCWAFGAV 123
Query: 132 EAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYAS 191
EAMSDR+CI S+G+ + LS+++LVSCC CG GC GGF G AW +WV GIV+GG + S
Sbjct: 124 EAMSDRICIHSKGEVNAHLSAENLVSCCYTCGFGCNGGFPGAAWSHWVKKGIVTGGNFNS 183
Query: 192 KQGCRPYEIP-CERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPA 250
QGC+PY IP CE + G C + TP+C++ C+ GY V Y DL++G +YS+
Sbjct: 184 SQGCQPYIIPACEHHTTGDRPPCSEGG-GTPKCLKTCEDGYTVDYTQDLHYGASSYSVHK 242
Query: 251 NEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGT 310
E I EI +GPVEG++T+Y D YK+G+Y+HV G LG HAIRI+GWG E EG
Sbjct: 243 RMEDIQLEIMNNGPVEGALTVYEDFPTYKSGVYQHVHGKALGGHAIRILGWGVE---EG- 298
Query: 311 SSVVKYWLVANSFNTNWGENGLFRI 335
V YWL+ANS+NT+WG+NG ++
Sbjct: 299 ---VPYWLIANSWNTDWGDNGYIKL 320
Score = 168 bits (425), Expect = 8e-39, Method: Compositional matrix adjust.
Identities = 82/163 (50%), Positives = 111/163 (68%), Gaps = 9/163 (5%)
Query: 336 GCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANE 394
GC+PY IP CE + G R C TP+C++ C+ GY V Y DL++G +YS+
Sbjct: 186 GCQPYIIPACEHHTTGDRPPCSEGG-GTPKCLKTCEDGYTVDYTQDLHYGASSYSVHKRM 244
Query: 395 ETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSS 454
E I EI +GPVEG++T+Y D YK+G+Y+HV G LG HAIRI+GWG E EG
Sbjct: 245 EDIQLEIMNNGPVEGALTVYEDFPTYKSGVYQHVHGKALGGHAIRILGWGVE---EG--- 298
Query: 455 VVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
V YWL+ANS+NT+WG+NG +++RG++ CGIE+ ITAGLPK+
Sbjct: 299 -VPYWLIANSWNTDWGDNGYIKLLRGKDHCGIESQITAGLPKL 340
>gi|27882093|gb|AAH44517.1| Zgc:55862 [Danio rerio]
Length = 330
Score = 295 bits (755), Expect = 4e-77, Method: Compositional matrix adjust.
Identities = 135/254 (53%), Positives = 183/254 (72%), Gaps = 9/254 (3%)
Query: 83 RLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIAS 142
+LP++VQ ++ L+ LP+ FDAR WP CPT++EIRDQGSCGS WA GA EA+SDRVCI S
Sbjct: 66 KLPVMVQYTEGLK-LPKNFDAREQWPNCPTLKEIRDQGSCGSCWAFGAAEAISDRVCIQS 124
Query: 143 RGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEI-P 201
K V +SS DL++CC CG GC GG+ AW +W T G+V+GG Y S GCRPY I P
Sbjct: 125 NAKVSVEISSQDLLTCCDSCGMGCNGGYPSAAWDFWTTDGLVTGGLYNSHIGCRPYTIEP 184
Query: 202 CERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFR 261
CE ++NGS C +TP C KC+PGY Y++D +FG+ +YS+P+N+ IM E+F+
Sbjct: 185 CEHHVNGSRPPCTGEGGDTPNCDMKCEPGYSPLYKEDKHFGKTSYSVPSNQNGIMAELFK 244
Query: 262 HGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVAN 321
+GPVE + T+Y D +LYK+G+Y+H++G LG HAI+I+GWG+E + V YWL AN
Sbjct: 245 NGPVEAAFTVYEDFLLYKSGVYQHMSGSALGGHAIKILGWGEE-------NGVPYWLAAN 297
Query: 322 SFNTNWGENGLFRI 335
S+NT+WG+NG F+I
Sbjct: 298 SWNTDWGDNGYFKI 311
Score = 191 bits (485), Expect = 9e-46, Method: Compositional matrix adjust.
Identities = 84/163 (51%), Positives = 121/163 (74%), Gaps = 8/163 (4%)
Query: 334 RIGCRPYEI-PCERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPA 392
IGCRPY I PCE ++NGSR C +TP C KC+PGY Y++D +FG+ +YS+P+
Sbjct: 174 HIGCRPYTIEPCEHHVNGSRPPCTGEGGDTPNCDMKCEPGYSPLYKEDKHFGKTSYSVPS 233
Query: 393 NEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGT 452
N+ IM E+F++GPVE + T+Y D +LYK+G+Y+H++G LG HAI+I+GWG+E
Sbjct: 234 NQNGIMAELFKNGPVEAAFTVYEDFLLYKSGVYQHMSGSALGGHAIKILGWGEE------ 287
Query: 453 SSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLP 495
+ V YWL ANS+NT+WG+NG F+I+RG++ CGIE++I AG+P
Sbjct: 288 -NGVPYWLAANSWNTDWGDNGYFKILRGEDHCGIESEIVAGIP 329
>gi|395507317|ref|XP_003757972.1| PREDICTED: cathepsin B [Sarcophilus harrisii]
Length = 342
Score = 294 bits (753), Expect = 8e-77, Method: Compositional matrix adjust.
Identities = 137/256 (53%), Positives = 182/256 (71%), Gaps = 9/256 (3%)
Query: 82 NRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIA 141
+LP V L+D +LPE FDAR WP CPTI+EIRDQGSCGS WA GAVEA+SDR+C+
Sbjct: 67 KQLPQRVMLADDDMKLPENFDAREQWPKCPTIKEIRDQGSCGSCWAFGAVEAISDRICVH 126
Query: 142 SRGKRHVRLSSDDLVSCCK-DCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEI 200
+ G + +S++DL+SCC CG GC GGF AWKYW+ G+VSGG Y S GCRPY I
Sbjct: 127 TNGYITIEVSAEDLLSCCGLQCGEGCNGGFPAGAWKYWIKKGLVSGGLYDSHVGCRPYSI 186
Query: 201 P-CERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREI 259
P CE ++NGS +C +TP+C +KC+ GY Y+DD ++G AY++P++E+ IM EI
Sbjct: 187 PPCEHHVNGSRPACTGEGGDTPKCNKKCEAGYSPDYKDDKHYGTTAYNVPSSEKEIMAEI 246
Query: 260 FRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLV 319
+++GPVEG+ +YAD + YK+G+Y+HV G LG HAIR++GWG E V YWL
Sbjct: 247 YKNGPVEGAFIVYADFLQYKSGVYQHVTGDMLGGHAIRVLGWGVE-------DGVPYWLA 299
Query: 320 ANSFNTNWGENGLFRI 335
ANS+NT+WG+NG F+I
Sbjct: 300 ANSWNTDWGDNGFFKI 315
Score = 189 bits (480), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 87/185 (47%), Positives = 129/185 (69%), Gaps = 15/185 (8%)
Query: 315 KYWLVANSFNTNWGENGLF--RIGCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQP 371
KYW+ + GL+ +GCRPY IP CE ++NGSR +C +TP+C +KC+
Sbjct: 162 KYWIKKGLVS-----GGLYDSHVGCRPYSIPPCEHHVNGSRPACTGEGGDTPKCNKKCEA 216
Query: 372 GYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGG 431
GY Y+DD ++G AY++P++E+ IM EI+++GPVEG+ +YAD + YK+G+Y+HV G
Sbjct: 217 GYSPDYKDDKHYGTTAYNVPSSEKEIMAEIYKNGPVEGAFIVYADFLQYKSGVYQHVTGD 276
Query: 432 PLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADIT 491
LG HAIR++GWG E V YWL ANS+NT+WG+NG F+I+RG++ CGIE+++
Sbjct: 277 MLGGHAIRVLGWGVE-------DGVPYWLAANSWNTDWGDNGFFKILRGKDHCGIESEMV 329
Query: 492 AGLPK 496
AG+P+
Sbjct: 330 AGIPR 334
>gi|126303983|ref|XP_001381634.1| PREDICTED: cathepsin B-like [Monodelphis domestica]
Length = 337
Score = 294 bits (752), Expect = 8e-77, Method: Compositional matrix adjust.
Identities = 136/254 (53%), Positives = 184/254 (72%), Gaps = 10/254 (3%)
Query: 84 LPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASR 143
LP + L+D ++ LPE FDAR WP CPTI+EIRDQGSCGS WA GAVEA+SDR+C+ S
Sbjct: 69 LPQRMILADNMK-LPENFDAREQWPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICVHSN 127
Query: 144 GKRHVRLSSDDLVSCC-KDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIP- 201
G +V +S++DL+SCC +CG+GC GGF AW +W G+VSGG Y S GCRPY IP
Sbjct: 128 GNANVEVSAEDLLSCCGSECGDGCNGGFPAGAWNFWTKKGLVSGGLYDSHVGCRPYSIPP 187
Query: 202 CERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFR 261
CE ++NGS +C E +TP C +KC+ GY Y+DD N+G +YS+P++E+ IM EI++
Sbjct: 188 CEHHVNGSRPACTGEEGDTPTCRKKCEEGYSTQYKDDKNYGSTSYSVPSSEQEIMAEIYK 247
Query: 262 HGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVAN 321
+GPVEG+ ++Y D + YK+G+Y+HVAG LG HAIRI+GWG E + ++YWL AN
Sbjct: 248 NGPVEGAFSVYEDFLHYKSGVYQHVAGEMLGGHAIRILGWGVE-------NGIRYWLAAN 300
Query: 322 SFNTNWGENGLFRI 335
S+N +WG+NG F+
Sbjct: 301 SWNIDWGDNGFFKF 314
Score = 194 bits (493), Expect = 9e-47, Method: Compositional matrix adjust.
Identities = 84/164 (51%), Positives = 122/164 (74%), Gaps = 8/164 (4%)
Query: 334 RIGCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPA 392
+GCRPY IP CE ++NGSR +C E +TP C +KC+ GY Y+DD N+G +YS+P+
Sbjct: 177 HVGCRPYSIPPCEHHVNGSRPACTGEEGDTPTCRKKCEEGYSTQYKDDKNYGSTSYSVPS 236
Query: 393 NEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGT 452
+E+ IM EI+++GPVEG+ ++Y D + YK+G+Y+HVAG LG HAIRI+GWG E
Sbjct: 237 SEQEIMAEIYKNGPVEGAFSVYEDFLHYKSGVYQHVAGEMLGGHAIRILGWGVE------ 290
Query: 453 SSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPK 496
+ ++YWL ANS+N +WG+NG F+ +RG+N CGIE++I AG+P+
Sbjct: 291 -NGIRYWLAANSWNIDWGDNGFFKFLRGKNHCGIESEIIAGIPR 333
>gi|74179506|dbj|BAE44111.1| cathepsin B preproprotein [Cyprinus carpio]
Length = 330
Score = 293 bits (751), Expect = 1e-76, Method: Compositional matrix adjust.
Identities = 135/254 (53%), Positives = 183/254 (72%), Gaps = 9/254 (3%)
Query: 83 RLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIAS 142
RLP++VQ +D L+ LP FDAR WP CPT++EIRDQGSCGS WA GA EA+SDRVCI S
Sbjct: 66 RLPVMVQYADDLK-LPTNFDAREQWPNCPTLKEIRDQGSCGSCWAFGAAEAISDRVCIHS 124
Query: 143 RGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEI-P 201
K V +S+ DL++CC CG GC GG+ AW +W + G+V+GG Y S GCRPY I P
Sbjct: 125 NAKVSVEISAQDLLTCCDGCGMGCNGGYPSAAWDFWSSDGLVTGGLYNSHIGCRPYTIEP 184
Query: 202 CERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFR 261
CE ++NGS C +TP C C+PGY SY+ D +FG+ +YS+P+N++ IM+E+++
Sbjct: 185 CEHHVNGSRPPCTGEGGDTPNCDMSCEPGYSPSYKQDKHFGKTSYSVPSNQKDIMKELYK 244
Query: 262 HGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVAN 321
+GPVEG+ T+Y D + YK+G+Y+HV+G LG HAI+I+GWG+E + V YWL AN
Sbjct: 245 NGPVEGAFTVYEDFLSYKSGVYQHVSGPALGGHAIKILGWGEE-------NGVPYWLAAN 297
Query: 322 SFNTNWGENGLFRI 335
S+NT+WG+NG F+I
Sbjct: 298 SWNTDWGDNGYFKI 311
Score = 192 bits (488), Expect = 4e-46, Method: Compositional matrix adjust.
Identities = 87/178 (48%), Positives = 127/178 (71%), Gaps = 15/178 (8%)
Query: 327 WGENGLF-------RIGCRPYEI-PCERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYE 378
W +GL IGCRPY I PCE ++NGSR C +TP C C+PGY SY+
Sbjct: 160 WSSDGLVTGGLYNSHIGCRPYTIEPCEHHVNGSRPPCTGEGGDTPNCDMSCEPGYSPSYK 219
Query: 379 DDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAI 438
D +FG+ +YS+P+N++ IM+E++++GPVEG+ T+Y D + YK+G+Y+HV+G LG HAI
Sbjct: 220 QDKHFGKTSYSVPSNQKDIMKELYKNGPVEGAFTVYEDFLSYKSGVYQHVSGPALGGHAI 279
Query: 439 RIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPK 496
+I+GWG+E + V YWL ANS+NT+WG+NG F+I+RG++ CGIE++I AG+P+
Sbjct: 280 KILGWGEE-------NGVPYWLAANSWNTDWGDNGYFKILRGEDHCGIESEIVAGIPQ 330
>gi|185135431|ref|NP_001117776.1| procathepsin B precursor [Oncorhynchus mykiss]
gi|14582897|gb|AAK69705.1|AF358667_1 procathepsin B [Oncorhynchus mykiss]
Length = 330
Score = 293 bits (750), Expect = 1e-76, Method: Compositional matrix adjust.
Identities = 139/254 (54%), Positives = 179/254 (70%), Gaps = 10/254 (3%)
Query: 83 RLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIAS 142
RLP LVQ SD LP+ FDAR+ WP CPTI+EIRDQGSCGS WA GA EA+SDR CI S
Sbjct: 66 RLPELVQ-SDEDMSLPDSFDARLQWPNCPTIKEIRDQGSCGSCWAFGAAEAISDRYCIHS 124
Query: 143 RGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEI-P 201
GK V +S++DL+SCC CG GC GGF AW YW +G+V+GG Y S GCRPY I P
Sbjct: 125 NGKVSVEISAEDLLSCCDACGMGCMGGFPSAAWDYWAESGLVTGGLYGSNIGCRPYSIAP 184
Query: 202 CERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFR 261
CE ++NG+ C E +TP+C+ +C GY SY+ D FG+ YS+P E+ IM E+++
Sbjct: 185 CEHHVNGTRPPCT-GEGDTPKCVSECNAGYTPSYKKDKRFGKQTYSVPPKEQQIMTELYK 243
Query: 262 HGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVAN 321
+GPVE + ++Y D +LYKTG+Y+HV G LG HAI+I+GWG+E + YWLVAN
Sbjct: 244 NGPVEAAFSVYEDFLLYKTGVYQHVTGQMLGGHAIKILGWGKE-------NNTPYWLVAN 296
Query: 322 SFNTNWGENGLFRI 335
S+NT+WG+NG F+I
Sbjct: 297 SWNTDWGDNGFFKI 310
Score = 190 bits (483), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 88/179 (49%), Positives = 126/179 (70%), Gaps = 16/179 (8%)
Query: 327 WGENGLF-------RIGCRPYEI-PCERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYE 378
W E+GL IGCRPY I PCE ++NG+R C E +TP+C+ +C GY SY+
Sbjct: 160 WAESGLVTGGLYGSNIGCRPYSIAPCEHHVNGTRPPCTG-EGDTPKCVSECNAGYTPSYK 218
Query: 379 DDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAI 438
D FG+ YS+P E+ IM E++++GPVE + ++Y D +LYKTG+Y+HV G LG HAI
Sbjct: 219 KDKRFGKQTYSVPPKEQQIMTELYKNGPVEAAFSVYEDFLLYKTGVYQHVTGQMLGGHAI 278
Query: 439 RIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
+I+GWG+E + YWLVANS+NT+WG+NG F+I+RG++ECGIE++I AG+P++
Sbjct: 279 KILGWGKE-------NNTPYWLVANSWNTDWGDNGFFKILRGKDECGIESEIVAGIPRL 330
>gi|194895314|ref|XP_001978227.1| GG19486 [Drosophila erecta]
gi|190649876|gb|EDV47154.1| GG19486 [Drosophila erecta]
Length = 340
Score = 293 bits (750), Expect = 2e-76, Method: Compositional matrix adjust.
Identities = 151/296 (51%), Positives = 187/296 (63%), Gaps = 10/296 (3%)
Query: 44 ILLPKLPFYGAEKNALSKLTLSELEMRMGVHPDSK---LPQNRLPLLVQLSDPLEELPEG 100
++ K + +N S +T + MGVHPD+ L R L + ++++PE
Sbjct: 31 LVRSKAKTWTVGRNFDSSVTEGYIRRLMGVHPDAHKFALADKREVLGDLYMNTVDQIPEE 90
Query: 101 FDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCK 160
FD+R WP CPTI EIRDQG CGS WA GAVEAMSDRVCI S GK + S+DDLVSCC
Sbjct: 91 FDSRKQWPNCPTIGEIRDQGECGSCWAFGAVEAMSDRVCIHSGGKVNFHFSADDLVSCCH 150
Query: 161 DCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEI-PCERYMNGSHSSCQDNEPN 219
CG GC GGF G AW YW GIVSGG Y S QGCRPYEI PCE ++NG+ C +
Sbjct: 151 TCGFGCNGGFPGAAWSYWTRKGIVSGGPYGSNQGCRPYEIAPCEHHVNGTRPPC-GHGGG 209
Query: 220 TPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYK 279
TP+C C+ GY V Y D +FG +YS+ N I EI +GPVEG+ T+Y D+ILYK
Sbjct: 210 TPKCSHVCESGYTVDYAKDKHFGSKSYSVKRNVRDIQEEIMTNGPVEGAFTVYEDLILYK 269
Query: 280 TGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
G+Y+H G LG HAIRI+GWG GE + YWL+ NS+NT+WG+NG FRI
Sbjct: 270 DGVYQHQHGKELGGHAIRILGWGV--WGE---EKIPYWLIGNSWNTDWGDNGFFRI 320
Score = 175 bits (444), Expect = 5e-41, Method: Compositional matrix adjust.
Identities = 86/163 (52%), Positives = 112/163 (68%), Gaps = 7/163 (4%)
Query: 336 GCRPYEI-PCERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANE 394
GCRPYEI PCE ++NG+R C + TP+C C+ GY V Y D +FG +YS+ N
Sbjct: 184 GCRPYEIAPCEHHVNGTRPPC-GHGGGTPKCSHVCESGYTVDYAKDKHFGSKSYSVKRNV 242
Query: 395 ETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSS 454
I EI +GPVEG+ T+Y D+ILYK G+Y+H G LG HAIRI+GWG GE
Sbjct: 243 RDIQEEIMTNGPVEGAFTVYEDLILYKDGVYQHQHGKELGGHAIRILGWGV--WGE---E 297
Query: 455 VVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
+ YWL+ NS+NT+WG+NG FRI+RGQ+ CGIE+ I+AGLPK+
Sbjct: 298 KIPYWLIGNSWNTDWGDNGFFRILRGQDHCGIESSISAGLPKL 340
>gi|327322926|gb|AEA48884.1| cathepsin B [Oplegnathus fasciatus]
Length = 330
Score = 292 bits (747), Expect = 3e-76, Method: Compositional matrix adjust.
Identities = 136/254 (53%), Positives = 181/254 (71%), Gaps = 9/254 (3%)
Query: 83 RLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIAS 142
+LP++VQ + L+ LPE FDAR WP CPT++EIRDQGSCGS WA GA EA+SDRVCI S
Sbjct: 66 KLPVMVQYTGDLK-LPEEFDAREQWPNCPTLKEIRDQGSCGSCWAFGAAEAISDRVCIHS 124
Query: 143 RGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEI-P 201
K V +SS+DL++CC CG GC GG+ AW +W G+VSGG Y S GCRPY I P
Sbjct: 125 NAKVSVEISSEDLLTCCMSCGMGCNGGYPSAAWDFWTKEGLVSGGLYDSHIGCRPYTIAP 184
Query: 202 CERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFR 261
CE ++NGS SC +TP+CI KC+ GY SY++D +FG+ +Y++ ++EE I EIF+
Sbjct: 185 CEHHVNGSRPSCTGEGGDTPQCITKCEAGYTPSYKEDKHFGKTSYTVLSDEEQIQSEIFK 244
Query: 262 HGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVAN 321
+GPVEG+ +Y D +LYK+G+Y+HV+G +G HAI+I+GWG E V YWL AN
Sbjct: 245 NGPVEGAFIVYEDFVLYKSGVYQHVSGSAVGGHAIKILGWGVE-------DGVPYWLCAN 297
Query: 322 SFNTNWGENGLFRI 335
S+NT+WG+NG F+
Sbjct: 298 SWNTDWGDNGFFKF 311
Score = 190 bits (482), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 84/164 (51%), Positives = 120/164 (73%), Gaps = 8/164 (4%)
Query: 334 RIGCRPYEI-PCERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPA 392
IGCRPY I PCE ++NGSR SC +TP+CI KC+ GY SY++D +FG+ +Y++ +
Sbjct: 174 HIGCRPYTIAPCEHHVNGSRPSCTGEGGDTPQCITKCEAGYTPSYKEDKHFGKTSYTVLS 233
Query: 393 NEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGT 452
+EE I EIF++GPVEG+ +Y D +LYK+G+Y+HV+G +G HAI+I+GWG E
Sbjct: 234 DEEQIQSEIFKNGPVEGAFIVYEDFVLYKSGVYQHVSGSAVGGHAIKILGWGVE------ 287
Query: 453 SSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPK 496
V YWL ANS+NT+WG+NG F+ +RG + CGIE+++ AG+PK
Sbjct: 288 -DGVPYWLCANSWNTDWGDNGFFKFLRGSDHCGIESEVVAGIPK 330
>gi|213514196|ref|NP_001133994.1| Cathepsin B precursor [Salmo salar]
gi|209156086|gb|ACI34275.1| Cathepsin B precursor [Salmo salar]
Length = 330
Score = 292 bits (747), Expect = 3e-76, Method: Compositional matrix adjust.
Identities = 130/254 (51%), Positives = 185/254 (72%), Gaps = 9/254 (3%)
Query: 83 RLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIAS 142
+L +VQ ++ +E LP+ FD R+ WP CPT++E+RDQGSCGS WA GA EA+SDRVCI S
Sbjct: 66 KLSTMVQYTEDME-LPKNFDPRLQWPNCPTLKEVRDQGSCGSCWAFGAAEAISDRVCIHS 124
Query: 143 RGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEI-P 201
K V +SS+DL+SCC+ CG GC GG+ A +W G+VSGG Y S GCRPY I P
Sbjct: 125 NAKVSVEISSEDLLSCCESCGMGCNGGYPSAACDFWTKEGLVSGGLYDSHIGCRPYSIPP 184
Query: 202 CERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFR 261
CE ++NG+ C+ E +TP+C +C+PGY Y+ D +FG+ +YS+P++E+ IM+E+++
Sbjct: 185 CEHHVNGTRPPCKGEEGDTPQCTNQCEPGYTPGYKQDKHFGKRSYSVPSDEKEIMKELYK 244
Query: 262 HGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVAN 321
+GPVEG+ T+Y D +LYK+G+Y+HV+G +G HAI+++GWG+E + YWL AN
Sbjct: 245 NGPVEGAFTVYEDFLLYKSGVYRHVSGSAVGGHAIKVLGWGEE-------GGIPYWLAAN 297
Query: 322 SFNTNWGENGLFRI 335
S+NT+WGENG F+I
Sbjct: 298 SWNTDWGENGFFKI 311
Score = 196 bits (499), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 82/163 (50%), Positives = 125/163 (76%), Gaps = 8/163 (4%)
Query: 334 RIGCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPA 392
IGCRPY IP CE ++NG+R C+ E +TP+C +C+PGY Y+ D +FG+ +YS+P+
Sbjct: 174 HIGCRPYSIPPCEHHVNGTRPPCKGEEGDTPQCTNQCEPGYTPGYKQDKHFGKRSYSVPS 233
Query: 393 NEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGT 452
+E+ IM+E++++GPVEG+ T+Y D +LYK+G+Y+HV+G +G HAI+++GWG+E
Sbjct: 234 DEKEIMKELYKNGPVEGAFTVYEDFLLYKSGVYRHVSGSAVGGHAIKVLGWGEE------ 287
Query: 453 SSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLP 495
+ YWL ANS+NT+WGENG F+IVRG++ CGIE+++ AG+P
Sbjct: 288 -GGIPYWLAANSWNTDWGENGFFKIVRGEDHCGIESEMVAGIP 329
>gi|308390275|gb|ADO32581.1| cathepsin B [Marsupenaeus japonicus]
Length = 332
Score = 291 bits (746), Expect = 4e-76, Method: Compositional matrix adjust.
Identities = 149/284 (52%), Positives = 192/284 (67%), Gaps = 13/284 (4%)
Query: 54 AEKNALSKLTLSELEMRMGVHPDSKLPQNRLPLLVQLSDPLE-ELPEGFDARINWPYCPT 112
A +N L++ MGVHPDSK +P P ++P+ FD+R WP CPT
Sbjct: 39 AGRNFNRHLSIRYFRRLMGVHPDSKY---HMPGYEAHKIPENFDMPKEFDSRAAWPMCPT 95
Query: 113 IQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHG 172
I EIRDQGSCGS WA GAVE MSDR CI S+GK + SS++LVSCC CG GC GGF G
Sbjct: 96 IGEIRDQGSCGSCWAFGAVEVMSDRQCIHSKGKSNFHYSSENLVSCCHLCGFGCNGGFPG 155
Query: 173 KAWKYWVTTGIVSGGTYASKQGCRPYEI-PCERYMNGSHSSCQDNEPNTPECIRKCQPGY 231
A+KYWV +GIVSGG++ S QGC+PYEI PCE ++ G C + TP+C+++C+ GY
Sbjct: 156 AAFKYWVHSGIVSGGSFNSTQGCQPYEIAPCEHHVPGPRPKCSEGG-GTPKCVKRCENGY 214
Query: 232 DVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPL 291
V YE DL+ G AYS+ +E+ I EI ++GPVEG+ T+Y D + YK+G+Y+H G PL
Sbjct: 215 TVDYESDLHHGGKAYSIMKDEDQIKYEIMKNGPVEGAFTVYVDFLHYKSGVYQHRHGLPL 274
Query: 292 GEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
G HAIRI+GWG+E GT YWL ANS+NT+WG+NGLF+I
Sbjct: 275 GGHAIRILGWGEE---NGT----PYWLCANSWNTDWGDNGLFKI 311
Score = 179 bits (455), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 92/192 (47%), Positives = 127/192 (66%), Gaps = 22/192 (11%)
Query: 312 SVVKYWL-----VANSFNTNWGENGLFRIGCRPYEI-PCERYMNGSRSSCQANEPNTPEC 365
+ KYW+ SFN+ GC+PYEI PCE ++ G R C TP+C
Sbjct: 156 AAFKYWVHSGIVSGGSFNST--------QGCQPYEIAPCEHHVPGPRPKCSEGG-GTPKC 206
Query: 366 IRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIY 425
+++C+ GY V YE DL+ G AYS+ +E+ I EI ++GPVEG+ T+Y D + YK+G+Y
Sbjct: 207 VKRCENGYTVDYESDLHHGGKAYSIMKDEDQIKYEIMKNGPVEGAFTVYVDFLHYKSGVY 266
Query: 426 KHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECG 485
+H G PLG HAIRI+GWG+E GT YWL ANS+NT+WG+NGLF+I+RG + CG
Sbjct: 267 QHRHGLPLGGHAIRILGWGEE---NGT----PYWLCANSWNTDWGDNGLFKILRGSDHCG 319
Query: 486 IEADITAGLPKI 497
IE++I+AGLPK+
Sbjct: 320 IESEISAGLPKL 331
>gi|195478432|ref|XP_002100515.1| GE16138 [Drosophila yakuba]
gi|194188039|gb|EDX01623.1| GE16138 [Drosophila yakuba]
Length = 340
Score = 291 bits (746), Expect = 4e-76, Method: Compositional matrix adjust.
Identities = 149/296 (50%), Positives = 183/296 (61%), Gaps = 10/296 (3%)
Query: 44 ILLPKLPFYGAEKNALSKLTLSELEMRMGVHPDSK---LPQNRLPLLVQLSDPLEELPEG 100
++ K + +N + +T + MGVHPD+ L R L + ++E+PE
Sbjct: 31 LVRSKAKTWTVGRNFDASVTEGHIRRLMGVHPDAHKFALADKREVLGDLYMNSVDEIPEE 90
Query: 101 FDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCK 160
FD+R WP CPTI EIRDQGSCGS WA GAVEAMSDRVCI S GK + S+DDLVSCC
Sbjct: 91 FDSRKQWPNCPTIGEIRDQGSCGSCWAFGAVEAMSDRVCIHSGGKVNFHFSADDLVSCCH 150
Query: 161 DCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEI-PCERYMNGSHSSCQDNEPN 219
CG GC GGF G AW YW GIVSGG Y S QGCRPYEI PCE ++NG+ C
Sbjct: 151 TCGFGCNGGFPGAAWSYWTRKGIVSGGPYGSNQGCRPYEISPCEHHVNGTRPPCAHGGA- 209
Query: 220 TPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYK 279
TP+C CQ Y V Y D +FG +YS+ N I EI +GPVEG+ T+Y D+ILYK
Sbjct: 210 TPKCSHVCQSSYTVDYAKDKHFGSKSYSVRRNVRDIQEEIMTNGPVEGAFTVYEDLILYK 269
Query: 280 TGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
G+Y+H G LG HAIRI+GWG + YWL+ NS+NT+WG+ G FRI
Sbjct: 270 DGVYQHEHGKELGGHAIRILGWGVW-----GDEKIPYWLIGNSWNTDWGDQGFFRI 320
Score = 172 bits (436), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 84/163 (51%), Positives = 109/163 (66%), Gaps = 7/163 (4%)
Query: 336 GCRPYEI-PCERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANE 394
GCRPYEI PCE ++NG+R C A+ TP+C CQ Y V Y D +FG +YS+ N
Sbjct: 184 GCRPYEISPCEHHVNGTRPPC-AHGGATPKCSHVCQSSYTVDYAKDKHFGSKSYSVRRNV 242
Query: 395 ETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSS 454
I EI +GPVEG+ T+Y D+ILYK G+Y+H G LG HAIRI+GWG
Sbjct: 243 RDIQEEIMTNGPVEGAFTVYEDLILYKDGVYQHEHGKELGGHAIRILGWGVW-----GDE 297
Query: 455 VVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
+ YWL+ NS+NT+WG+ G FRI+RGQ+ CGIE+ I+AGLPK+
Sbjct: 298 KIPYWLIGNSWNTDWGDQGFFRILRGQDHCGIESSISAGLPKL 340
>gi|223646922|gb|ACN10219.1| Cathepsin B precursor [Salmo salar]
gi|223647940|gb|ACN10728.1| Cathepsin B precursor [Salmo salar]
gi|223672785|gb|ACN12574.1| Cathepsin B precursor [Salmo salar]
Length = 330
Score = 291 bits (745), Expect = 5e-76, Method: Compositional matrix adjust.
Identities = 133/254 (52%), Positives = 179/254 (70%), Gaps = 9/254 (3%)
Query: 83 RLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIAS 142
+LP +VQ + +E LP+ FD R WP CPT++EIRDQGSCGS WA GA EA+SDRVCI S
Sbjct: 66 KLPTMVQYAGDVE-LPDTFDPRQQWPNCPTLKEIRDQGSCGSCWAFGAAEAISDRVCIHS 124
Query: 143 RGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEI-P 201
K V +SS+DL+SCC CG GC GG+ AW +W T G+V+GG Y S GCRPY I P
Sbjct: 125 NAKVSVEISSEDLLSCCDSCGMGCNGGYPSAAWDFWTTEGLVTGGLYDSHVGCRPYSIPP 184
Query: 202 CERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFR 261
CE ++NG+ C E +TP+C +C+ GY Y+ D +FG+ +YSLP+ E+ IM E+ +
Sbjct: 185 CEHHVNGTRPPCTGEEGDTPQCSNQCETGYTPGYKQDKHFGKNSYSLPSEEQQIMAELLK 244
Query: 262 HGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVAN 321
+GPVEG+ T+Y D +LYK+G+Y+HV+G +G HAI+++GWG+E YWL AN
Sbjct: 245 NGPVEGAFTVYEDFLLYKSGVYQHVSGSAVGGHAIKVLGWGEE-------GGTPYWLAAN 297
Query: 322 SFNTNWGENGLFRI 335
S+NT+WGENG F+I
Sbjct: 298 SWNTDWGENGFFKI 311
Score = 187 bits (476), Expect = 9e-45, Method: Compositional matrix adjust.
Identities = 80/163 (49%), Positives = 119/163 (73%), Gaps = 8/163 (4%)
Query: 334 RIGCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPA 392
+GCRPY IP CE ++NG+R C E +TP+C +C+ GY Y+ D +FG+ +YSLP+
Sbjct: 174 HVGCRPYSIPPCEHHVNGTRPPCTGEEGDTPQCSNQCETGYTPGYKQDKHFGKNSYSLPS 233
Query: 393 NEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGT 452
E+ IM E+ ++GPVEG+ T+Y D +LYK+G+Y+HV+G +G HAI+++GWG+E
Sbjct: 234 EEQQIMAELLKNGPVEGAFTVYEDFLLYKSGVYQHVSGSAVGGHAIKVLGWGEE------ 287
Query: 453 SSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLP 495
YWL ANS+NT+WGENG F+I+RG++ CGIE+++ AG+P
Sbjct: 288 -GGTPYWLAANSWNTDWGENGFFKILRGKDHCGIESEMVAGVP 329
>gi|351695295|gb|EHA98213.1| Cathepsin B [Heterocephalus glaber]
Length = 340
Score = 291 bits (745), Expect = 5e-76, Method: Compositional matrix adjust.
Identities = 137/255 (53%), Positives = 187/255 (73%), Gaps = 10/255 (3%)
Query: 83 RLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIAS 142
RLP ++ ++ + +LPE FDAR WP CPTI+EIRDQGSCGS WA GAV AMSDRVCI +
Sbjct: 67 RLPQRIKFAE-IVDLPESFDARQQWPNCPTIKEIRDQGSCGSCWAFGAVGAMSDRVCIHT 125
Query: 143 RGKRHVRLSSDDLVSCCK-DCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIP 201
G +V +S++DL+SCC +CG+GC GG+ AWKYW G+VSGG Y S GCRPY IP
Sbjct: 126 NGHVNVEVSAEDLLSCCGLECGDGCNGGYPSAAWKYWTKKGLVSGGLYDSHVGCRPYSIP 185
Query: 202 -CERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIF 260
CE ++NG+ C +TP+C + C+PGY SY++D +FG +YS+ +NE+ IM EI+
Sbjct: 186 PCEHHVNGTRPQCTGEGGDTPKCSKTCEPGYSPSYKEDKHFGYDSYSVSSNEKEIMAEIY 245
Query: 261 RHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVA 320
++GPVEG+ T+++D ++YKTG+YKH+AG LG HAIRI+GWG+E + V YWLV
Sbjct: 246 KNGPVEGAFTVFSDFLMYKTGVYKHLAGEMLGGHAIRILGWGKE-------NGVPYWLVG 298
Query: 321 NSFNTNWGENGLFRI 335
NS+N +WG++G F+I
Sbjct: 299 NSWNVDWGDSGFFKI 313
Score = 191 bits (486), Expect = 6e-46, Method: Compositional matrix adjust.
Identities = 90/189 (47%), Positives = 133/189 (70%), Gaps = 15/189 (7%)
Query: 311 SSVVKYWLVANSFNTNWGENGLF--RIGCRPYEIP-CERYMNGSRSSCQANEPNTPECIR 367
S+ KYW + GL+ +GCRPY IP CE ++NG+R C +TP+C +
Sbjct: 156 SAAWKYWTKKGLVS-----GGLYDSHVGCRPYSIPPCEHHVNGTRPQCTGEGGDTPKCSK 210
Query: 368 KCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKH 427
C+PGY SY++D +FG +YS+ +NE+ IM EI+++GPVEG+ T+++D ++YKTG+YKH
Sbjct: 211 TCEPGYSPSYKEDKHFGYDSYSVSSNEKEIMAEIYKNGPVEGAFTVFSDFLMYKTGVYKH 270
Query: 428 VAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIE 487
+AG LG HAIRI+GWG+E + V YWLV NS+N +WG++G F+IVRG++ CGIE
Sbjct: 271 LAGEMLGGHAIRILGWGKE-------NGVPYWLVGNSWNVDWGDSGFFKIVRGEDHCGIE 323
Query: 488 ADITAGLPK 496
++I AG+P+
Sbjct: 324 SEIVAGIPR 332
>gi|146217390|gb|ABQ10737.1| cathepsin B [Penaeus monodon]
Length = 331
Score = 291 bits (744), Expect = 7e-76, Method: Compositional matrix adjust.
Identities = 148/284 (52%), Positives = 190/284 (66%), Gaps = 13/284 (4%)
Query: 54 AEKNALSKLTLSELEMRMGVHPDSKLPQNRLPLLVQLSDPLE-ELPEGFDARINWPYCPT 112
A +N L++ MGVHPDSK +P P E+P+ FD+R WP CPT
Sbjct: 38 AGRNFNKHLSIKYFRRLMGVHPDSKF---HMPKYEAHQIPENFEMPKEFDSRAAWPMCPT 94
Query: 113 IQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHG 172
I EIRDQGSCGS WA GAVE MSDR CI S+GK + S+++LVSCC CG GC GGF G
Sbjct: 95 IGEIRDQGSCGSCWAFGAVEVMSDRQCIHSKGKSNFHYSAENLVSCCHLCGFGCNGGFPG 154
Query: 173 KAWKYWVTTGIVSGGTYASKQGCRPYEI-PCERYMNGSHSSCQDNEPNTPECIRKCQPGY 231
A+KYWV +GIVSGG++ S QGC+PYEI PCE +++G C + TP+C + C+ GY
Sbjct: 155 AAFKYWVHSGIVSGGSFNSTQGCQPYEIAPCEHHVSGPRPKCSEG-GGTPKCAKTCEKGY 213
Query: 232 DVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPL 291
V YE DL+ G AYS+ +E+ I EI +GPVEG+ T+Y D + YK+G+Y+H G PL
Sbjct: 214 IVDYESDLHHGGKAYSIMKDEDQIKYEIMNNGPVEGAFTVYVDFLHYKSGVYQHRHGLPL 273
Query: 292 GEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
G HAIR++GWG+E GT YWL ANS+NT+WG+NGLF+I
Sbjct: 274 GGHAIRVLGWGEE---NGT----PYWLCANSWNTDWGDNGLFKI 310
Score = 177 bits (448), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 91/192 (47%), Positives = 127/192 (66%), Gaps = 22/192 (11%)
Query: 312 SVVKYWLVAN-----SFNTNWGENGLFRIGCRPYEI-PCERYMNGSRSSCQANEPNTPEC 365
+ KYW+ + SFN+ GC+PYEI PCE +++G R C + TP+C
Sbjct: 155 AAFKYWVHSGIVSGGSFNST--------QGCQPYEIAPCEHHVSGPRPKC-SEGGGTPKC 205
Query: 366 IRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIY 425
+ C+ GY V YE DL+ G AYS+ +E+ I EI +GPVEG+ T+Y D + YK+G+Y
Sbjct: 206 AKTCEKGYIVDYESDLHHGGKAYSIMKDEDQIKYEIMNNGPVEGAFTVYVDFLHYKSGVY 265
Query: 426 KHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECG 485
+H G PLG HAIR++GWG+E GT YWL ANS+NT+WG+NGLF+I+RG + CG
Sbjct: 266 QHRHGLPLGGHAIRVLGWGEE---NGT----PYWLCANSWNTDWGDNGLFKILRGSDHCG 318
Query: 486 IEADITAGLPKI 497
IE++I+AGLPK+
Sbjct: 319 IESEISAGLPKV 330
>gi|348587350|ref|XP_003479431.1| PREDICTED: cathepsin B-like [Cavia porcellus]
Length = 340
Score = 290 bits (743), Expect = 1e-75, Method: Compositional matrix adjust.
Identities = 138/255 (54%), Positives = 184/255 (72%), Gaps = 10/255 (3%)
Query: 83 RLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIAS 142
RLP VQ ++ L+ LPE FDAR WP CPTI+EIRDQGSCGS WA GAVEAMSDR+CI +
Sbjct: 67 RLPQRVQFAEDLD-LPESFDAREQWPNCPTIKEIRDQGSCGSCWAFGAVEAMSDRLCIHT 125
Query: 143 RGKRHVRLSSDDLVSCCKD-CGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIP 201
G +V +S++DL+SCC CG GC GG+ +AWKYW G+VSGG Y S GCRPY IP
Sbjct: 126 NGHVNVEVSAEDLLSCCGPLCGEGCNGGYPTEAWKYWTRKGLVSGGLYGSHVGCRPYSIP 185
Query: 202 -CERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIF 260
CE ++NG+ C +TP+C + C+PGY SY++D +G +YS+P+ E+ IM EI+
Sbjct: 186 PCEHHVNGTRPKCTGEGGDTPKCSKTCEPGYSPSYKEDKYYGYSSYSVPSTEKEIMAEIY 245
Query: 261 RHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVA 320
++GPVE + ++++D + YK+G+YKHVAG LG HAIRI+GWG+E + V YWLV
Sbjct: 246 KNGPVEAAFSVFSDFLTYKSGVYKHVAGEVLGGHAIRILGWGKE-------NGVPYWLVG 298
Query: 321 NSFNTNWGENGLFRI 335
NS+N +WG+NG F+I
Sbjct: 299 NSWNVDWGDNGFFKI 313
Score = 185 bits (469), Expect = 5e-44, Method: Compositional matrix adjust.
Identities = 80/164 (48%), Positives = 121/164 (73%), Gaps = 8/164 (4%)
Query: 334 RIGCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPA 392
+GCRPY IP CE ++NG+R C +TP+C + C+PGY SY++D +G +YS+P+
Sbjct: 176 HVGCRPYSIPPCEHHVNGTRPKCTGEGGDTPKCSKTCEPGYSPSYKEDKYYGYSSYSVPS 235
Query: 393 NEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGT 452
E+ IM EI+++GPVE + ++++D + YK+G+YKHVAG LG HAIRI+GWG+E
Sbjct: 236 TEKEIMAEIYKNGPVEAAFSVFSDFLTYKSGVYKHVAGEVLGGHAIRILGWGKE------ 289
Query: 453 SSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPK 496
+ V YWLV NS+N +WG+NG F+I+RG++ CGIE+++ AG+P+
Sbjct: 290 -NGVPYWLVGNSWNVDWGDNGFFKILRGEDHCGIESEVVAGIPR 332
>gi|298370749|gb|ADI80349.1| cathepsin B [Litopenaeus vannamei]
Length = 331
Score = 290 bits (742), Expect = 1e-75, Method: Compositional matrix adjust.
Identities = 148/283 (52%), Positives = 192/283 (67%), Gaps = 11/283 (3%)
Query: 54 AEKNALSKLTLSELEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTI 113
A +N L++ MGVHPDSK + + Q+ + E LP+ FD+R WP CPTI
Sbjct: 38 AGRNFNKHLSIRYFRRLMGVHPDSKYHMPKYEVH-QIPENFE-LPKEFDSRAAWPMCPTI 95
Query: 114 QEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGK 173
EIRDQGSCGS WA GAVE MSDR CI S+GK + S+++LVSCC CG GC GGF G
Sbjct: 96 GEIRDQGSCGSCWAFGAVEVMSDRQCIHSKGKSNFHYSAENLVSCCHLCGFGCNGGFPGA 155
Query: 174 AWKYWVTTGIVSGGTYASKQGCRPYEI-PCERYMNGSHSSCQDNEPNTPECIRKCQPGYD 232
A+KYWV +GIVSGG++ S QGC+PYEI PCE ++ G C + TP+C + C+ GY
Sbjct: 156 AFKYWVHSGIVSGGSFNSTQGCQPYEIAPCEHHVPGPRPKCSEGG-GTPKCAKTCEKGYI 214
Query: 233 VSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLG 292
V YE DL+ G AYS+ +E+ I EI ++GPVEG+ T+Y D + YK+G+Y+H G PLG
Sbjct: 215 VDYESDLHHGGKAYSIMKDEDQIKYEIMKNGPVEGAFTVYVDFLHYKSGVYQHRHGLPLG 274
Query: 293 EHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
HAIR++GWG+E GT YWL ANS+NT+WG+NGLF+I
Sbjct: 275 GHAIRVLGWGEE---NGTP----YWLCANSWNTDWGDNGLFKI 310
Score = 176 bits (446), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 91/192 (47%), Positives = 126/192 (65%), Gaps = 22/192 (11%)
Query: 312 SVVKYWLVAN-----SFNTNWGENGLFRIGCRPYEI-PCERYMNGSRSSCQANEPNTPEC 365
+ KYW+ + SFN+ GC+PYEI PCE ++ G R C TP+C
Sbjct: 155 AAFKYWVHSGIVSGGSFNST--------QGCQPYEIAPCEHHVPGPRPKCSEGG-GTPKC 205
Query: 366 IRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIY 425
+ C+ GY V YE DL+ G AYS+ +E+ I EI ++GPVEG+ T+Y D + YK+G+Y
Sbjct: 206 AKTCEKGYIVDYESDLHHGGKAYSIMKDEDQIKYEIMKNGPVEGAFTVYVDFLHYKSGVY 265
Query: 426 KHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECG 485
+H G PLG HAIR++GWG+E GT YWL ANS+NT+WG+NGLF+I+RG + CG
Sbjct: 266 QHRHGLPLGGHAIRVLGWGEE---NGTP----YWLCANSWNTDWGDNGLFKILRGSDHCG 318
Query: 486 IEADITAGLPKI 497
IE++I+AGLPK+
Sbjct: 319 IESEISAGLPKL 330
>gi|344281458|ref|XP_003412496.1| PREDICTED: cathepsin B-like [Loxodonta africana]
Length = 340
Score = 289 bits (740), Expect = 2e-75, Method: Compositional matrix adjust.
Identities = 136/255 (53%), Positives = 186/255 (72%), Gaps = 10/255 (3%)
Query: 83 RLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIAS 142
+LP V L++ ++ LPE FDAR NWP CPTI+EIRDQGSCGS WA GAVEA+SDRVCI +
Sbjct: 67 KLPQRVHLAEEMD-LPENFDARENWPNCPTIKEIRDQGSCGSCWAFGAVEAISDRVCIHT 125
Query: 143 RGKRHVRLSSDDLVSCCK-DCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIP 201
G +V +S++DL++CC +CG+GC GGF AW +W G+VSGG Y S GCRPY IP
Sbjct: 126 NGNVNVEVSAEDLLTCCHMECGDGCNGGFPAGAWNFWTKKGLVSGGLYDSHVGCRPYSIP 185
Query: 202 -CERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIF 260
CE ++NGS C+ TP+C + C+PGY SY++D ++G +Y +P++E+ IM EI+
Sbjct: 186 PCEHHVNGSRPPCKGEGGETPKCSKTCEPGYSPSYKEDKHYGYSSYGVPSSEQEIMAEIY 245
Query: 261 RHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVA 320
++GPVEG+ ++Y D ++YK+G+Y+HV G +G HAIRI+GWG E GT YWL A
Sbjct: 246 KNGPVEGAFSVYTDFLVYKSGVYQHVTGEEVGGHAIRILGWGVE---NGT----PYWLAA 298
Query: 321 NSFNTNWGENGLFRI 335
NS+NT+WG+NG F+I
Sbjct: 299 NSWNTDWGDNGFFKI 313
Score = 187 bits (475), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 83/164 (50%), Positives = 122/164 (74%), Gaps = 8/164 (4%)
Query: 334 RIGCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPA 392
+GCRPY IP CE ++NGSR C+ TP+C + C+PGY SY++D ++G +Y +P+
Sbjct: 176 HVGCRPYSIPPCEHHVNGSRPPCKGEGGETPKCSKTCEPGYSPSYKEDKHYGYSSYGVPS 235
Query: 393 NEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGT 452
+E+ IM EI+++GPVEG+ ++Y D ++YK+G+Y+HV G +G HAIRI+GWG E GT
Sbjct: 236 SEQEIMAEIYKNGPVEGAFSVYTDFLVYKSGVYQHVTGEEVGGHAIRILGWGVE---NGT 292
Query: 453 SSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPK 496
YWL ANS+NT+WG+NG F+I+RGQ+ CGIE++I AG+P+
Sbjct: 293 ----PYWLAANSWNTDWGDNGFFKILRGQDHCGIESEIVAGIPR 332
>gi|254746338|emb|CAX16634.1| putative C1A cysteine protease precursor [Manduca sexta]
Length = 337
Score = 288 bits (737), Expect = 5e-75, Method: Compositional matrix adjust.
Identities = 145/304 (47%), Positives = 195/304 (64%), Gaps = 15/304 (4%)
Query: 33 LSKAFDRVDHSILLPKLPFYGAEKNALSKLTLSELEMRMGVHPDSKLPQNRLPLLVQLSD 92
LS AF R+ +S K + A +N + + + MG D + ++P + +D
Sbjct: 25 LSDAFIRLINS----KQNTWRAGRNFPTTTPFAHINKLMGALQDDNVA--KMPKVEHDAD 78
Query: 93 PLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSS 152
+ LPE FD R WP CPT+ EIRDQGSCGS WA GAVEAM+DR C S G +H SS
Sbjct: 79 LIASLPENFDPRDKWPDCPTLNEIRDQGSCGSCWAFGAVEAMTDRYCTYSNGTKHFHFSS 138
Query: 153 DDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEI-PCERYMNGSHS 211
+DL+SCC CG GC GG AW+YW GIVSGG Y S QGCRPYEI PCE ++ G+
Sbjct: 139 EDLLSCCPICGLGCNGGIPSLAWEYWKHFGIVSGGNYNSTQGCRPYEIPPCEHHVPGNRM 198
Query: 212 SCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTI 271
C + TP+C + C+ GY+V Y+ D +G+ YS+ A E+ I E++++GPVEG+ T+
Sbjct: 199 PCS-GDTKTPKCQKNCENGYNVMYKKDKRYGKHVYSVSAGEDHIRAELYKNGPVEGAFTV 257
Query: 272 YADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENG 331
YAD++ YK+G+YKH+ G LG HAI+I+GWG E + KYWLVANS+NT+WG+NG
Sbjct: 258 YADLLAYKSGVYKHIQGDALGGHAIKILGWGVE-------NDNKYWLVANSWNTDWGDNG 310
Query: 332 LFRI 335
F+I
Sbjct: 311 FFKI 314
Score = 176 bits (447), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 82/161 (50%), Positives = 116/161 (72%), Gaps = 9/161 (5%)
Query: 336 GCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANE 394
GCRPYEIP CE ++ G+R C + + TP+C + C+ GY+V Y+ D +G+ YS+ A E
Sbjct: 180 GCRPYEIPPCEHHVPGNRMPC-SGDTKTPKCQKNCENGYNVMYKKDKRYGKHVYSVSAGE 238
Query: 395 ETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSS 454
+ I E++++GPVEG+ T+YAD++ YK+G+YKH+ G LG HAI+I+GWG E +
Sbjct: 239 DHIRAELYKNGPVEGAFTVYADLLAYKSGVYKHIQGDALGGHAIKILGWGVE-------N 291
Query: 455 VVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLP 495
KYWLVANS+NT+WG+NG F+I+RG+N CGIE I AG P
Sbjct: 292 DNKYWLVANSWNTDWGDNGFFKILRGENHCGIEGSIIAGEP 332
>gi|112983908|ref|NP_001036850.1| cathepsin B precursor [Bombyx mori]
gi|13548667|dbj|BAB40804.1| cathepsin B [Bombyx mori]
Length = 337
Score = 287 bits (735), Expect = 9e-75, Method: Compositional matrix adjust.
Identities = 136/285 (47%), Positives = 188/285 (65%), Gaps = 11/285 (3%)
Query: 52 YGAEKNALSKLTLSELEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCP 111
+ A +N + + L+ MGV D LP+ D + LPE FD R WP CP
Sbjct: 40 WKAGRNFPRDTSFAHLKKIMGVIEDEHFAT--LPIKTHKIDLIAGLPENFDPRDKWPDCP 97
Query: 112 TIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFH 171
T+ E+RDQGSCGS WA GAVEAM+DRVC S G +H S++DL+SCC CG GC GG
Sbjct: 98 TLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCCPICGLGCSGGMP 157
Query: 172 GKAWKYWVTTGIVSGGTYASKQGCRPYEI-PCERYMNGSHSSCQDNEPNTPECIRKCQPG 230
AW+YW G+VSGG+Y S QGCRPYEI PCE ++ G+ C + TP+C +KC+ G
Sbjct: 158 RLAWEYWKHFGLVSGGSYNSSQGCRPYEIPPCEHHVPGNRMPCS-GDTKTPKCTKKCESG 216
Query: 231 YDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGP 290
YDV+Y+ D +G+ Y++ +E+ I E+F++GPVEG+ T+Y+D++ YK+G+YKH G
Sbjct: 217 YDVNYKQDKQYGKHVYTVSGDEDHIRAELFKNGPVEGAFTVYSDLLSYKSGVYKHTQGDA 276
Query: 291 LGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
LG HA++I+GWG E + KYWL+ANS+N++WG+NG F+I
Sbjct: 277 LGGHAVKILGWGVE-------NDNKYWLIANSWNSDWGDNGFFKI 314
Score = 175 bits (444), Expect = 5e-41, Method: Compositional matrix adjust.
Identities = 77/161 (47%), Positives = 117/161 (72%), Gaps = 9/161 (5%)
Query: 336 GCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANE 394
GCRPYEIP CE ++ G+R C + + TP+C +KC+ GYDV+Y+ D +G+ Y++ +E
Sbjct: 180 GCRPYEIPPCEHHVPGNRMPC-SGDTKTPKCTKKCESGYDVNYKQDKQYGKHVYTVSGDE 238
Query: 395 ETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSS 454
+ I E+F++GPVEG+ T+Y+D++ YK+G+YKH G LG HA++I+GWG E +
Sbjct: 239 DHIRAELFKNGPVEGAFTVYSDLLSYKSGVYKHTQGDALGGHAVKILGWGVE-------N 291
Query: 455 VVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLP 495
KYWL+ANS+N++WG+NG F+I+RG++ CGIE+ I G P
Sbjct: 292 DNKYWLIANSWNSDWGDNGFFKILRGEDHCGIESSIVTGEP 332
>gi|157058767|gb|ABV03141.1| cathepsin B-348 [Sitobion avenae]
Length = 252
Score = 286 bits (733), Expect = 1e-74, Method: Compositional matrix adjust.
Identities = 142/259 (54%), Positives = 184/259 (71%), Gaps = 11/259 (4%)
Query: 71 MGVH-PDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALG 129
MGVH +++ P +L L+ +D +LPE FDAR +WP CPTI+E+RDQGSCGS WA G
Sbjct: 3 MGVHEKNAEYP--KLEQLLTYTDAPIDLPETFDAREHWPNCPTIREVRDQGSCGSCWAFG 60
Query: 130 AVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTY 189
AVEAMSDRVCI S+G ++ S+++LVSCC CG GC GGF G AW YW T GIVSGG Y
Sbjct: 61 AVEAMSDRVCIHSKGTKNFHFSAENLVSCCWTCGFGCNGGFPGAAWHYWKTKGIVSGGPY 120
Query: 190 ASKQGCRPYEI-PCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSL 248
S GC PYEI PCE ++NG+ C++ TP+C++KC+ GY V YE DL+ G+ AYSL
Sbjct: 121 GSNMGCIPYEIAPCEHHVNGTRGPCKEGG-KTPKCVKKCEDGYKVPYEQDLHRGKSAYSL 179
Query: 249 PANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGE 308
+ + I +EI+ +GPVEG+ T+Y D I Y+ G+YKHVAG LG HAIRI+GWG +
Sbjct: 180 SNDVDQIRQEIYTNGPVEGAFTVYEDFIAYRAGVYKHVAGKALGGHAIRILGWGVQ---- 235
Query: 309 GTSSVVKYWLVANSFNTNW 327
+ + YWLVANS+NT+W
Sbjct: 236 --NGEIPYWLVANSWNTDW 252
Score = 145 bits (366), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 70/136 (51%), Positives = 96/136 (70%), Gaps = 8/136 (5%)
Query: 335 IGCRPYEI-PCERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPAN 393
+GC PYEI PCE ++NG+R C+ TP+C++KC+ GY V YE DL+ G+ AYSL +
Sbjct: 124 MGCIPYEIAPCEHHVNGTRGPCKEGG-KTPKCVKKCEDGYKVPYEQDLHRGKSAYSLSND 182
Query: 394 EETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTS 453
+ I +EI+ +GPVEG+ T+Y D I Y+ G+YKHVAG LG HAIRI+GWG + +
Sbjct: 183 VDQIRQEIYTNGPVEGAFTVYEDFIAYRAGVYKHVAGKALGGHAIRILGWGVQ------N 236
Query: 454 SVVKYWLVANSFNTNW 469
+ YWLVANS+NT+W
Sbjct: 237 GEIPYWLVANSWNTDW 252
>gi|37788265|gb|AAO64472.1| cathepsin B precursor [Fundulus heteroclitus]
Length = 330
Score = 286 bits (732), Expect = 2e-74, Method: Compositional matrix adjust.
Identities = 130/254 (51%), Positives = 180/254 (70%), Gaps = 9/254 (3%)
Query: 83 RLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIAS 142
+LP++VQ + ++ LP+ FDAR WP CPT++EIRDQGSCGS WA GA EA+SDR+CI +
Sbjct: 66 KLPIMVQSAGGMK-LPKQFDAREQWPECPTLKEIRDQGSCGSCWAFGAAEAISDRICIHT 124
Query: 143 RGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEI-P 201
+GK V +SS DL++CC CG GC GG+ AW++W G+V+GG Y S GCRPY I P
Sbjct: 125 KGKVSVEISSQDLLTCCDSCGMGCNGGYPANAWEFWTEQGLVTGGLYNSHIGCRPYTIEP 184
Query: 202 CERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFR 261
CE ++NGS C +TPEC+ +C+ GY SY+ D ++G+ +Y +P+ EE I EI++
Sbjct: 185 CEHHVNGSRPPCTGEGGDTPECVTQCEAGYTPSYQKDKHYGKTSYGVPSEEEQIQSEIYK 244
Query: 262 HGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVAN 321
+GPVEG+ +Y D YK+G+Y+HV G LG HAI++IGWG+E + V YWL AN
Sbjct: 245 NGPVEGAFIVYEDFPSYKSGVYQHVTGSALGGHAIKMIGWGEE-------NGVPYWLCAN 297
Query: 322 SFNTNWGENGLFRI 335
S+NT+WG+NG F+I
Sbjct: 298 SWNTDWGDNGFFKI 311
Score = 189 bits (479), Expect = 4e-45, Method: Compositional matrix adjust.
Identities = 87/178 (48%), Positives = 121/178 (67%), Gaps = 15/178 (8%)
Query: 327 WGENGLF-------RIGCRPYEI-PCERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYE 378
W E GL IGCRPY I PCE ++NGSR C +TPEC+ +C+ GY SY+
Sbjct: 160 WTEQGLVTGGLYNSHIGCRPYTIEPCEHHVNGSRPPCTGEGGDTPECVTQCEAGYTPSYQ 219
Query: 379 DDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAI 438
D ++G+ +Y +P+ EE I EI+++GPVEG+ +Y D YK+G+Y+HV G LG HAI
Sbjct: 220 KDKHYGKTSYGVPSEEEQIQSEIYKNGPVEGAFIVYEDFPSYKSGVYQHVTGSALGGHAI 279
Query: 439 RIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPK 496
++IGWG+E + V YWL ANS+NT+WG+NG F+I+RG N CGIE+++ AG+PK
Sbjct: 280 KMIGWGEE-------NGVPYWLCANSWNTDWGDNGFFKILRGSNHCGIESEVVAGIPK 330
>gi|344195776|gb|AEM98130.1| cathepsin B [Cynoglossus semilaevis]
Length = 332
Score = 286 bits (731), Expect = 2e-74, Method: Compositional matrix adjust.
Identities = 130/254 (51%), Positives = 182/254 (71%), Gaps = 9/254 (3%)
Query: 83 RLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIAS 142
+LP+ +Q + ++ LP FDAR+ WP CPT++E+RDQGSCGS WA GA EA+SDR+CI S
Sbjct: 66 KLPVKLQFTADVQ-LPVDFDARVQWPQCPTLKEVRDQGSCGSCWAFGAAEAISDRLCIHS 124
Query: 143 RGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEI-P 201
G +V +S++DL+SCC CG GC GG+ AW++W T G+VSGG Y S GCRPY I P
Sbjct: 125 NGLMNVEISAEDLLSCCDSCGMGCNGGYPSAAWEFWTTDGLVSGGLYDSHIGCRPYSIAP 184
Query: 202 CERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFR 261
CE ++NGS C +TP+C +KC+ GY Y D ++G+++YS+ +E+ I EI++
Sbjct: 185 CEHHVNGSRPPCTGEGGDTPQCTKKCEAGYTPGYTQDKHYGKLSYSVDDSEKEIQLEIYK 244
Query: 262 HGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVAN 321
+GPVEG+ T+Y D +LYKTG+Y+HV G +G HAI+++GWG+E GT YWL AN
Sbjct: 245 NGPVEGAFTVYEDFLLYKTGVYQHVTGSAVGGHAIKVLGWGEE---NGT----PYWLCAN 297
Query: 322 SFNTNWGENGLFRI 335
S+NT+WG+NG F+I
Sbjct: 298 SWNTDWGDNGFFKI 311
Score = 186 bits (471), Expect = 4e-44, Method: Compositional matrix adjust.
Identities = 83/164 (50%), Positives = 119/164 (72%), Gaps = 8/164 (4%)
Query: 334 RIGCRPYEI-PCERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPA 392
IGCRPY I PCE ++NGSR C +TP+C +KC+ GY Y D ++G+++YS+
Sbjct: 174 HIGCRPYSIAPCEHHVNGSRPPCTGEGGDTPQCTKKCEAGYTPGYTQDKHYGKLSYSVDD 233
Query: 393 NEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGT 452
+E+ I EI+++GPVEG+ T+Y D +LYKTG+Y+HV G +G HAI+++GWG+E GT
Sbjct: 234 SEKEIQLEIYKNGPVEGAFTVYEDFLLYKTGVYQHVTGSAVGGHAIKVLGWGEE---NGT 290
Query: 453 SSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPK 496
YWL ANS+NT+WG+NG F+I+RG + CGIE++I AG+PK
Sbjct: 291 ----PYWLCANSWNTDWGDNGFFKILRGSDHCGIESEIVAGIPK 330
>gi|56462338|gb|AAV91452.1| cysteine peptidase 2 cathepsin-B-like [Lonomia obliqua]
Length = 338
Score = 285 bits (730), Expect = 3e-74, Method: Compositional matrix adjust.
Identities = 135/294 (45%), Positives = 194/294 (65%), Gaps = 11/294 (3%)
Query: 43 SILLPKLPFYGAEKNALSKLTLSELEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFD 102
+IL K + A +N + + ++M MG D + +LP + ++ + LPE FD
Sbjct: 32 NILNSKPKTWTAGRNFPANTPFAHIKMLMGALKDDNIL--KLPKMTHDAELIASLPENFD 89
Query: 103 ARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDC 162
R WP CPT+ EIRDQGSCGS WA GAVEAM+DRVC S G +H S++DL+SCC C
Sbjct: 90 PRDKWPNCPTLNEIRDQGSCGSCWAFGAVEAMTDRVCTYSDGTKHFHFSAEDLLSCCPIC 149
Query: 163 GNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEI-PCERYMNGSHSSCQDNEPNTP 221
G GC GG AW+YW GIVSGG+Y S QGC PYE+ PCE ++ G+ C + + TP
Sbjct: 150 GLGCNGGMPTLAWEYWKHAGIVSGGSYNSTQGCIPYEVPPCEHHVPGNRLPC-NGDTKTP 208
Query: 222 ECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTG 281
+C + C+ GY+V ++ D ++G+ YS+ NE+ I E+F++GPVEG+ T+Y+D++ YK+G
Sbjct: 209 KCQKTCEAGYNVPFKKDKHYGKHVYSVSGNEDNIKAELFKNGPVEGAFTVYSDLLSYKSG 268
Query: 282 IYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
+Y+H G LG HA++I+GWG E + KYWL+ANS+N++WG+NG F+I
Sbjct: 269 VYQHTDGSALGGHAVKILGWGVE-------NGSKYWLIANSWNSDWGDNGFFKI 315
Score = 167 bits (423), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 73/163 (44%), Positives = 115/163 (70%), Gaps = 9/163 (5%)
Query: 336 GCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANE 394
GC PYE+P CE ++ G+R C + TP+C + C+ GY+V ++ D ++G+ YS+ NE
Sbjct: 181 GCIPYEVPPCEHHVPGNRLPCNGDT-KTPKCQKTCEAGYNVPFKKDKHYGKHVYSVSGNE 239
Query: 395 ETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSS 454
+ I E+F++GPVEG+ T+Y+D++ YK+G+Y+H G LG HA++I+GWG E +
Sbjct: 240 DNIKAELFKNGPVEGAFTVYSDLLSYKSGVYQHTDGSALGGHAVKILGWGVE-------N 292
Query: 455 VVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
KYWL+ANS+N++WG+NG F+I+RG++ CGIE+ I G P +
Sbjct: 293 GSKYWLIANSWNSDWGDNGFFKILRGEDHCGIESSIVTGEPLL 335
>gi|51038793|gb|AAT94175.1| cathepsin B [Paralichthys olivaceus]
gi|121053785|gb|ABM47001.1| cathepsin B [Paralichthys olivaceus]
Length = 330
Score = 285 bits (729), Expect = 4e-74, Method: Compositional matrix adjust.
Identities = 133/254 (52%), Positives = 179/254 (70%), Gaps = 9/254 (3%)
Query: 83 RLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIAS 142
+LP++VQ + L+ LP FDAR WP CPT++EIRDQGSCGS WA GA EA+SDRVCI S
Sbjct: 66 KLPIMVQYAGGLK-LPAEFDAREQWPECPTLKEIRDQGSCGSCWAFGAAEAISDRVCIHS 124
Query: 143 RGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEI-P 201
GK V +SS+DL++CC CG GC GG+ AW +W G+VSGG Y S GCRPY I P
Sbjct: 125 GGKISVEISSEDLLTCCDSCGMGCNGGYPSSAWDFWTKEGLVSGGLYNSHIGCRPYTISP 184
Query: 202 CERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFR 261
CE ++NGS C +TPECI +C+ GY SY+ D ++G+ +YS+ + E I EI +
Sbjct: 185 CEHHVNGSRPPCTGEGGDTPECISRCEAGYSPSYKQDKHYGKSSYSVEGSVEQIQAEISK 244
Query: 262 HGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVAN 321
+GPVEG+ T+Y D ++YK+G+Y+HV+G LG HAI+++GWG+E + YWL AN
Sbjct: 245 NGPVEGAFTVYEDFVMYKSGVYQHVSGSVLGGHAIKVLGWGEE-------DGIPYWLCAN 297
Query: 322 SFNTNWGENGLFRI 335
S+NT+WG+NG F+I
Sbjct: 298 SWNTDWGDNGFFKI 311
Score = 186 bits (472), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 83/164 (50%), Positives = 118/164 (71%), Gaps = 8/164 (4%)
Query: 334 RIGCRPYEI-PCERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPA 392
IGCRPY I PCE ++NGSR C +TPECI +C+ GY SY+ D ++G+ +YS+
Sbjct: 174 HIGCRPYTISPCEHHVNGSRPPCTGEGGDTPECISRCEAGYSPSYKQDKHYGKSSYSVEG 233
Query: 393 NEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGT 452
+ E I EI ++GPVEG+ T+Y D ++YK+G+Y+HV+G LG HAI+++GWG+E
Sbjct: 234 SVEQIQAEISKNGPVEGAFTVYEDFVMYKSGVYQHVSGSVLGGHAIKVLGWGEE------ 287
Query: 453 SSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPK 496
+ YWL ANS+NT+WG+NG F+I+RG N CGIE++I AG+PK
Sbjct: 288 -DGIPYWLCANSWNTDWGDNGFFKILRGSNHCGIESEIVAGIPK 330
>gi|389611087|dbj|BAM19154.1| cathepsin B [Papilio polytes]
Length = 334
Score = 285 bits (728), Expect = 6e-74, Method: Compositional matrix adjust.
Identities = 140/297 (47%), Positives = 194/297 (65%), Gaps = 17/297 (5%)
Query: 43 SILLPKLPFYGAEKNALSKLTLSELEMRMGVHPDSKLPQNRLPLLVQLS---DPLEELPE 99
+++ K + A +N + ++ MG L +R LV L D + LPE
Sbjct: 30 NLINSKQDTWKAGRNFPVDTPVKHIQKLMGT-----LKDDRFTTLVTLQHEVDLIASLPE 84
Query: 100 GFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCC 159
FD R WP CPT+ E+RDQGSCGS WA GAVEAM+DRVC S G +H S++DL+SCC
Sbjct: 85 NFDPRDKWPNCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCC 144
Query: 160 KDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEI-PCERYMNGSHSSCQDNEP 218
CG GC GG AW+YW G+VSGG+Y S QGCRPYEI PCE ++ G+ C +
Sbjct: 145 PICGLGCNGGMPTLAWEYWKHFGLVSGGSYNSTQGCRPYEIPPCEHHVPGNRLPCS-GDT 203
Query: 219 NTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILY 278
TP+CI+KC+ Y+V+Y+ D ++G+ YS+ E+ I E++++GPVEG+ T+YAD++ Y
Sbjct: 204 KTPKCIKKCEDNYNVAYKQDKHYGKHIYSVRGGEDHIKAELYKNGPVEGAFTVYADLLSY 263
Query: 279 KTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
K+G+YKHVAG LG HAI+I+GWG E + KYWL+ANS+N++WG+NG F+I
Sbjct: 264 KSGVYKHVAGDALGGHAIKIMGWGVE-------NGNKYWLIANSWNSDWGDNGFFKI 313
Score = 177 bits (449), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 81/163 (49%), Positives = 121/163 (74%), Gaps = 9/163 (5%)
Query: 336 GCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANE 394
GCRPYEIP CE ++ G+R C + + TP+CI+KC+ Y+V+Y+ D ++G+ YS+ E
Sbjct: 179 GCRPYEIPPCEHHVPGNRLPC-SGDTKTPKCIKKCEDNYNVAYKQDKHYGKHIYSVRGGE 237
Query: 395 ETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSS 454
+ I E++++GPVEG+ T+YAD++ YK+G+YKHVAG LG HAI+I+GWG E +
Sbjct: 238 DHIKAELYKNGPVEGAFTVYADLLSYKSGVYKHVAGDALGGHAIKIMGWGVE-------N 290
Query: 455 VVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
KYWL+ANS+N++WG+NG F+I+RG++ CGIE+ I AG P +
Sbjct: 291 GNKYWLIANSWNSDWGDNGFFKILRGEDHCGIESSIVAGEPLL 333
>gi|148229459|ref|NP_001079570.1| cathepsin B precursor [Xenopus laevis]
gi|28277314|gb|AAH44689.1| MGC53360 protein [Xenopus laevis]
Length = 333
Score = 284 bits (727), Expect = 7e-74, Method: Compositional matrix adjust.
Identities = 133/242 (54%), Positives = 178/242 (73%), Gaps = 9/242 (3%)
Query: 96 ELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDL 155
ELP+ FD+R WP CPTI+EIRDQGSCGS WA GAVEA+SDRVC+ + GK +V +S++DL
Sbjct: 79 ELPDSFDSRAAWPNCPTIREIRDQGSCGSCWAFGAVEAISDRVCVHTNGKVNVEVSAEDL 138
Query: 156 VSCCKD-CGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIP-CERYMNGSHSSC 213
+SCC D CG GC GG+ AW++W TG+VSGG Y S GCRPY IP CE ++NGS +C
Sbjct: 139 LSCCGDECGMGCNGGYPSGAWQFWTETGLVSGGLYDSHVGCRPYSIPPCEHHVNGSRPAC 198
Query: 214 QDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYA 273
+ E +TP+C+++C+ GY +Y D +FG +Y +P +E+ IM EI+++GPVEG+ +YA
Sbjct: 199 KGEEGDTPKCVKQCEEGYSPAYGTDKHFGTTSYGVPTSEKEIMAEIYKNGPVEGAFLVYA 258
Query: 274 DMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLF 333
D LYK+G+Y+H G LG HAI+I+GWG E GT YWL ANS+NT+WG+NG F
Sbjct: 259 DFPLYKSGVYQHETGEELGGHAIKILGWGVE---NGT----PYWLCANSWNTDWGDNGFF 311
Query: 334 RI 335
+I
Sbjct: 312 KI 313
Score = 188 bits (477), Expect = 7e-45, Method: Compositional matrix adjust.
Identities = 88/178 (49%), Positives = 125/178 (70%), Gaps = 15/178 (8%)
Query: 327 WGENGLF-------RIGCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYE 378
W E GL +GCRPY IP CE ++NGSR +C+ E +TP+C+++C+ GY +Y
Sbjct: 162 WTETGLVSGGLYDSHVGCRPYSIPPCEHHVNGSRPACKGEEGDTPKCVKQCEEGYSPAYG 221
Query: 379 DDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAI 438
D +FG +Y +P +E+ IM EI+++GPVEG+ +YAD LYK+G+Y+H G LG HAI
Sbjct: 222 TDKHFGTTSYGVPTSEKEIMAEIYKNGPVEGAFLVYADFPLYKSGVYQHETGEELGGHAI 281
Query: 439 RIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPK 496
+I+GWG E GT YWL ANS+NT+WG+NG F+I+RG++ CGIE++I AG+PK
Sbjct: 282 KILGWGVE---NGT----PYWLCANSWNTDWGDNGFFKILRGKDHCGIESEIVAGVPK 332
>gi|327281751|ref|XP_003225610.1| PREDICTED: cathepsin B-like [Anolis carolinensis]
Length = 330
Score = 284 bits (727), Expect = 7e-74, Method: Compositional matrix adjust.
Identities = 134/255 (52%), Positives = 181/255 (70%), Gaps = 10/255 (3%)
Query: 83 RLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIAS 142
+LP +D +E LP+ FD+R WP CPTI EIRDQGSCGS WA GAVEA+SDRVC+ +
Sbjct: 57 KLPERFAFADDVE-LPDSFDSRKQWPSCPTINEIRDQGSCGSCWAFGAVEAISDRVCVHT 115
Query: 143 RGKRHVRLSSDDLVSCCK-DCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIP 201
GK +V +S++DL+SCC +CG GC GG+ AWKYW G+VSGG Y S GCRPY IP
Sbjct: 116 NGKVNVEISAEDLLSCCGFECGMGCNGGYPSGAWKYWTEKGLVSGGLYDSHVGCRPYSIP 175
Query: 202 -CERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIF 260
CE + NG+ C TPEC++KC+ GY +Y+ D ++G +Y +P +E+ IM EI+
Sbjct: 176 PCEHHTNGTRPPCSGEGGETPECVKKCEDGYTPAYKQDKHYGVTSYGIPRSEKEIMAEIY 235
Query: 261 RHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVA 320
++GPVEG+ +Y+D ++YK+G+Y+HV+G +G HAIRI+GWG + GT YWL A
Sbjct: 236 KNGPVEGAFVVYSDFLMYKSGVYQHVSGEEVGGHAIRILGWG---VDNGTP----YWLAA 288
Query: 321 NSFNTNWGENGLFRI 335
NS+NT+WGE+G FRI
Sbjct: 289 NSWNTDWGEDGFFRI 303
Score = 188 bits (478), Expect = 6e-45, Method: Compositional matrix adjust.
Identities = 87/178 (48%), Positives = 124/178 (69%), Gaps = 15/178 (8%)
Query: 327 WGENGLF-------RIGCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYE 378
W E GL +GCRPY IP CE + NG+R C TPEC++KC+ GY +Y+
Sbjct: 152 WTEKGLVSGGLYDSHVGCRPYSIPPCEHHTNGTRPPCSGEGGETPECVKKCEDGYTPAYK 211
Query: 379 DDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAI 438
D ++G +Y +P +E+ IM EI+++GPVEG+ +Y+D ++YK+G+Y+HV+G +G HAI
Sbjct: 212 QDKHYGVTSYGIPRSEKEIMAEIYKNGPVEGAFVVYSDFLMYKSGVYQHVSGEEVGGHAI 271
Query: 439 RIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPK 496
RI+GWG + GT YWL ANS+NT+WGE+G FRI+RGQ+ CGIE++I AG+PK
Sbjct: 272 RILGWG---VDNGTP----YWLAANSWNTDWGEDGFFRILRGQDHCGIESEIVAGIPK 322
>gi|417399216|gb|JAA46636.1| Putative cathepsin b [Desmodus rotundus]
Length = 340
Score = 284 bits (726), Expect = 1e-73, Method: Compositional matrix adjust.
Identities = 148/312 (47%), Positives = 205/312 (65%), Gaps = 21/312 (6%)
Query: 33 LSKAFDRVDHSILLPKLPFYGAEKNALSK-------LTLSELEMRMGVHPDSKLPQNRLP 85
L+ A R++ L +L Y ++N K + LS ++ G +KL +LP
Sbjct: 14 LTTARSRLEFQPLSDELVNYVNKQNTTWKAGHNFYNVDLSYVKKLCG----TKLGGPKLP 69
Query: 86 LLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGK 145
+ L+ + LPE FDAR WP CPTI+EIRDQGSCGS WA GAVEA+SDR+CI S G
Sbjct: 70 QRLSLAGDIA-LPESFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVEAISDRICIRSNGL 128
Query: 146 RHVRLSSDDLVSCCK-DCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIP-CE 203
++V +S++DL++CC CG GC GGF AW +W G+VSGG Y S GCRPY IP CE
Sbjct: 129 QNVEVSAEDLLTCCGFQCGEGCNGGFPSGAWNFWKKQGLVSGGLYDSHVGCRPYSIPPCE 188
Query: 204 RYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHG 263
++NGS C +TP+C + C+PGY SY++D +FG YS+P++E+ IM EI+++G
Sbjct: 189 HHVNGSRPPCSGEGGDTPKCSKICEPGYSPSYKEDKHFGCDTYSVPSDEKEIMVEIYKNG 248
Query: 264 PVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSF 323
PVE + ++Y+D +LYK+G+Y+HV G +G HA+RI+GWG E GT YWLV NS+
Sbjct: 249 PVEAAFSVYSDFLLYKSGVYQHVTGEMVGGHAVRILGWGVE---NGT----PYWLVGNSW 301
Query: 324 NTNWGENGLFRI 335
NT+WG+NG F+I
Sbjct: 302 NTDWGDNGFFKI 313
Score = 185 bits (469), Expect = 5e-44, Method: Compositional matrix adjust.
Identities = 84/166 (50%), Positives = 122/166 (73%), Gaps = 8/166 (4%)
Query: 334 RIGCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPA 392
+GCRPY IP CE ++NGSR C +TP+C + C+PGY SY++D +FG YS+P+
Sbjct: 176 HVGCRPYSIPPCEHHVNGSRPPCSGEGGDTPKCSKICEPGYSPSYKEDKHFGCDTYSVPS 235
Query: 393 NEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGT 452
+E+ IM EI+++GPVE + ++Y+D +LYK+G+Y+HV G +G HA+RI+GWG E GT
Sbjct: 236 DEKEIMVEIYKNGPVEAAFSVYSDFLLYKSGVYQHVTGEMVGGHAVRILGWGVE---NGT 292
Query: 453 SSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKIG 498
YWLV NS+NT+WG+NG F+I+RG++ CGIE++I AG+P G
Sbjct: 293 ----PYWLVGNSWNTDWGDNGFFKILRGRDHCGIESEIVAGIPCTG 334
>gi|345790427|ref|XP_543203.3| PREDICTED: cathepsin B [Canis lupus familiaris]
Length = 339
Score = 283 bits (724), Expect = 1e-73, Method: Compositional matrix adjust.
Identities = 137/255 (53%), Positives = 185/255 (72%), Gaps = 11/255 (4%)
Query: 83 RLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIAS 142
+LP VQ + L LPE FDAR WP CPTI+EIRDQGSCGS WA GAVEA+SDR+CI +
Sbjct: 67 KLPQRVQFAKNLI-LPESFDAREQWPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICIRT 125
Query: 143 RGKRHVRLSSDDLVSCCKD-CGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIP 201
G +V +S++D+++CC D CG+GC GGF +AW +W G+VSGG Y S GCRPY IP
Sbjct: 126 NGHVNVEVSAEDMLTCCGDQCGDGCNGGFPAEAWNFWTKQGLVSGGLYDSHVGCRPYSIP 185
Query: 202 -CERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIF 260
CE ++NGS C E +TP+C + C+PGY SY++D ++G +YS+ NE+ IM EI+
Sbjct: 186 PCEHHVNGSRPPCT-GEGDTPKCSKICEPGYSPSYKEDKHYGCSSYSVSDNEKEIMAEIY 244
Query: 261 RHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVA 320
++GPVE + T+Y+D +LYK+G+Y+HV G +G HA+RI+GWG E +GT YWLV
Sbjct: 245 KNGPVEAAFTVYSDFLLYKSGVYQHVTGEMMGGHAVRILGWGVE---DGT----PYWLVG 297
Query: 321 NSFNTNWGENGLFRI 335
NS+NT+WG+NG F+I
Sbjct: 298 NSWNTDWGDNGFFKI 312
Score = 181 bits (460), Expect = 6e-43, Method: Compositional matrix adjust.
Identities = 84/163 (51%), Positives = 122/163 (74%), Gaps = 9/163 (5%)
Query: 334 RIGCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPA 392
+GCRPY IP CE ++NGSR C E +TP+C + C+PGY SY++D ++G +YS+
Sbjct: 176 HVGCRPYSIPPCEHHVNGSRPPCTG-EGDTPKCSKICEPGYSPSYKEDKHYGCSSYSVSD 234
Query: 393 NEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGT 452
NE+ IM EI+++GPVE + T+Y+D +LYK+G+Y+HV G +G HA+RI+GWG E +GT
Sbjct: 235 NEKEIMAEIYKNGPVEAAFTVYSDFLLYKSGVYQHVTGEMMGGHAVRILGWGVE---DGT 291
Query: 453 SSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLP 495
YWLV NS+NT+WG+NG F+I+RG++ CGIE++I AG+P
Sbjct: 292 ----PYWLVGNSWNTDWGDNGFFKILRGRDHCGIESEIVAGIP 330
>gi|76576341|gb|ABA53864.1| cathepsin B-like cysteine protease 2 [Parelaphostrongylus tenuis]
Length = 344
Score = 283 bits (724), Expect = 2e-73, Method: Compositional matrix adjust.
Identities = 130/269 (48%), Positives = 183/269 (68%), Gaps = 8/269 (2%)
Query: 68 EMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWA 127
E+RM + + +++ P + ++ + ++P+ FDAR+ WP+CP+I IRDQ CGS WA
Sbjct: 65 ELRMKIMKSKFISRSKKPRVDEIGEEGFKIPDSFDARVQWPHCPSISYIRDQSQCGSCWA 124
Query: 128 LGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGG 187
G+ EAMSDRVCIAS G + V LS+DD++SCC DCG+GC GG+ AW+Y+V TG+V+GG
Sbjct: 125 FGSAEAMSDRVCIASHGNKTVELSADDILSCCYDCGDGCDGGYPISAWEYFVETGVVTGG 184
Query: 188 TYASKQGCRPYEI-PCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAY 246
Y +K CRPYEI PC + N + +TP+C+ CQ GY +SY+DD FG+ +Y
Sbjct: 185 LYGTKDSCRPYEIPPCGHHRNETFYGNCTQIADTPDCVTTCQAGYPISYDDDKTFGKDSY 244
Query: 247 SLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPL 306
++ ++ I +EI +GPV + +Y D Y GIYKHV+GG G HA+RI+GWG+E
Sbjct: 245 TIESSVTAIQKEIMTYGPVTAAFIVYEDFFHYHRGIYKHVSGGEEGGHAVRILGWGEE-- 302
Query: 307 GEGTSSVVKYWLVANSFNTNWGENGLFRI 335
+GT+ YWLVANS+NT+WGENG FRI
Sbjct: 303 -KGTA----YWLVANSWNTDWGENGYFRI 326
Score = 163 bits (413), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 78/158 (49%), Positives = 106/158 (67%), Gaps = 8/158 (5%)
Query: 337 CRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEE 395
CRPYEIP C + N + +TP+C+ CQ GY +SY+DD FG+ +Y++ ++
Sbjct: 192 CRPYEIPPCGHHRNETFYGNCTQIADTPDCVTTCQAGYPISYDDDKTFGKDSYTIESSVT 251
Query: 396 TIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSV 455
I +EI +GPV + +Y D Y GIYKHV+GG G HA+RI+GWG+E +GT+
Sbjct: 252 AIQKEIMTYGPVTAAFIVYEDFFHYHRGIYKHVSGGEEGGHAVRILGWGEE---KGTA-- 306
Query: 456 VKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAG 493
YWLVANS+NT+WGENG FRI+RG NECGIE ++ AG
Sbjct: 307 --YWLVANSWNTDWGENGYFRILRGSNECGIEENVVAG 342
>gi|379067374|gb|AFC90100.1| cathepsin B [Capra hircus]
Length = 335
Score = 283 bits (723), Expect = 2e-73, Method: Compositional matrix adjust.
Identities = 131/241 (54%), Positives = 179/241 (74%), Gaps = 10/241 (4%)
Query: 97 LPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLV 156
LP+ FDAR WP CPTI+EIRDQGSCGS WA GAVEA+SDR+CI S+G+ +V +S++D++
Sbjct: 80 LPDSFDAREQWPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHSKGRVNVEVSAEDML 139
Query: 157 SCC-KDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIP-CERYMNGSHSSCQ 214
+CC +CG+GC GGF AW +W G+VSGG Y S GCRPY IP CE ++NGS C
Sbjct: 140 TCCGSECGDGCNGGFPSGAWNFWTKKGLVSGGLYDSHVGCRPYSIPPCEHHVNGSRPPCT 199
Query: 215 DNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYAD 274
E +TP+C + C+PGY SY+DD +FG +YS+ +NE+ IM EI+++GPVEG+ ++Y+D
Sbjct: 200 -GEGDTPKCSKICEPGYSPSYKDDKHFGCSSYSVSSNEKEIMAEIYKNGPVEGAFSVYSD 258
Query: 275 MILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFR 334
+LYK+G+Y+HV+G +G HAIRI+GWG E + YWLV NS+NT+WG+ G F+
Sbjct: 259 FLLYKSGVYQHVSGEMMGGHAIRILGWGVE-------NDTPYWLVGNSWNTDWGDKGFFK 311
Query: 335 I 335
I
Sbjct: 312 I 312
Score = 187 bits (475), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 85/163 (52%), Positives = 122/163 (74%), Gaps = 9/163 (5%)
Query: 334 RIGCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPA 392
+GCRPY IP CE ++NGSR C E +TP+C + C+PGY SY+DD +FG +YS+ +
Sbjct: 176 HVGCRPYSIPPCEHHVNGSRPPCTG-EGDTPKCSKICEPGYSPSYKDDKHFGCSSYSVSS 234
Query: 393 NEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGT 452
NE+ IM EI+++GPVEG+ ++Y+D +LYK+G+Y+HV+G +G HAIRI+GWG E
Sbjct: 235 NEKEIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVSGEMMGGHAIRILGWGVE------ 288
Query: 453 SSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLP 495
+ YWLV NS+NT+WG+ G F+I+RGQ+ CGIE++I AG+P
Sbjct: 289 -NDTPYWLVGNSWNTDWGDKGFFKILRGQDHCGIESEIVAGMP 330
>gi|73586701|gb|AAI02998.1| CTSB protein [Bos taurus]
Length = 335
Score = 283 bits (723), Expect = 2e-73, Method: Compositional matrix adjust.
Identities = 134/241 (55%), Positives = 179/241 (74%), Gaps = 10/241 (4%)
Query: 97 LPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLV 156
LPE FDAR WP CPTI+EIRDQGSCGS WA GAVEA+SDR+CI S G+ +V +S++D++
Sbjct: 80 LPESFDAREQWPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHSNGRVNVEVSAEDML 139
Query: 157 SCCK-DCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIP-CERYMNGSHSSCQ 214
+CC +CG+GC GGF AW +W G+VSGG Y S GCRPY IP CE ++NGS C
Sbjct: 140 TCCDGECGDGCNGGFPSGAWNFWTKKGLVSGGLYNSHVGCRPYSIPPCEHHVNGSRPPCT 199
Query: 215 DNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYAD 274
E +TP+C + C+PGY SY++D +FG +YS+ NE+ IM EI+++GPVEG+ ++Y+D
Sbjct: 200 -GEGDTPKCSKTCEPGYSPSYKEDKHFGCSSYSVANNEKEIMAEIYKNGPVEGAFSVYSD 258
Query: 275 MILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFR 334
+LYK+G+Y+HV+G +G HAIRI+GWG E GT YWLV NS+NT+WG+NG F+
Sbjct: 259 FLLYKSGVYQHVSGEIMGGHAIRILGWGVE---NGT----PYWLVGNSWNTDWGDNGFFK 311
Query: 335 I 335
I
Sbjct: 312 I 312
Score = 187 bits (474), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 87/163 (53%), Positives = 123/163 (75%), Gaps = 9/163 (5%)
Query: 334 RIGCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPA 392
+GCRPY IP CE ++NGSR C E +TP+C + C+PGY SY++D +FG +YS+
Sbjct: 176 HVGCRPYSIPPCEHHVNGSRPPCTG-EGDTPKCSKTCEPGYSPSYKEDKHFGCSSYSVAN 234
Query: 393 NEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGT 452
NE+ IM EI+++GPVEG+ ++Y+D +LYK+G+Y+HV+G +G HAIRI+GWG E GT
Sbjct: 235 NEKEIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVSGEIMGGHAIRILGWGVE---NGT 291
Query: 453 SSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLP 495
YWLV NS+NT+WG+NG F+I+RGQ+ CGIE++I AG+P
Sbjct: 292 ----PYWLVGNSWNTDWGDNGFFKILRGQDHCGIESEIVAGMP 330
>gi|426220597|ref|XP_004004501.1| PREDICTED: cathepsin B [Ovis aries]
Length = 335
Score = 283 bits (723), Expect = 2e-73, Method: Compositional matrix adjust.
Identities = 131/241 (54%), Positives = 179/241 (74%), Gaps = 10/241 (4%)
Query: 97 LPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLV 156
LP+ FDAR WP CPTI+EIRDQGSCGS WA GAVEA+SDR+CI S+G+ +V +S++D++
Sbjct: 80 LPDSFDAREQWPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHSKGRVNVEVSAEDML 139
Query: 157 SCC-KDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIP-CERYMNGSHSSCQ 214
+CC +CG+GC GGF AW +W G+VSGG Y S GCRPY IP CE ++NGS C
Sbjct: 140 TCCGSECGDGCNGGFPSGAWNFWTKKGLVSGGLYDSHVGCRPYSIPPCEHHVNGSRPPCT 199
Query: 215 DNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYAD 274
E +TP+C + C+PGY SY+DD +FG +YS+ +NE+ IM EI+++GPVEG+ ++Y+D
Sbjct: 200 -GEGDTPKCSKICEPGYSPSYKDDKHFGCSSYSVSSNEKEIMAEIYKNGPVEGAFSVYSD 258
Query: 275 MILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFR 334
+LYK+G+Y+HV+G +G HAIRI+GWG E + YWLV NS+NT+WG+ G F+
Sbjct: 259 FLLYKSGVYQHVSGEMMGGHAIRILGWGVE-------NDTPYWLVGNSWNTDWGDKGFFK 311
Query: 335 I 335
I
Sbjct: 312 I 312
Score = 187 bits (475), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 85/163 (52%), Positives = 122/163 (74%), Gaps = 9/163 (5%)
Query: 334 RIGCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPA 392
+GCRPY IP CE ++NGSR C E +TP+C + C+PGY SY+DD +FG +YS+ +
Sbjct: 176 HVGCRPYSIPPCEHHVNGSRPPCTG-EGDTPKCSKICEPGYSPSYKDDKHFGCSSYSVSS 234
Query: 393 NEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGT 452
NE+ IM EI+++GPVEG+ ++Y+D +LYK+G+Y+HV+G +G HAIRI+GWG E
Sbjct: 235 NEKEIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVSGEMMGGHAIRILGWGVE------ 288
Query: 453 SSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLP 495
+ YWLV NS+NT+WG+ G F+I+RGQ+ CGIE++I AG+P
Sbjct: 289 -NDTPYWLVGNSWNTDWGDKGFFKILRGQDHCGIESEIVAGMP 330
>gi|171948776|gb|ACB59245.1| cathepsin B [Sus scrofa]
Length = 335
Score = 282 bits (722), Expect = 2e-73, Method: Compositional matrix adjust.
Identities = 135/241 (56%), Positives = 178/241 (73%), Gaps = 10/241 (4%)
Query: 97 LPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLV 156
LP+GFDAR WP CPTI+EIRDQGSCGS WA GAVEA+SDR+CI S G+ +V +S++D++
Sbjct: 80 LPKGFDAREQWPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICIRSNGRVNVEVSAEDML 139
Query: 157 SCCKD-CGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIP-CERYMNGSHSSCQ 214
+CC D CG+GC GGF AW +W G+VSGG Y S GCRPY IP CE ++NGS C
Sbjct: 140 TCCGDECGDGCNGGFPSGAWNFWTKKGLVSGGLYDSHVGCRPYSIPPCEHHVNGSRPPCT 199
Query: 215 DNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYAD 274
E +TP+C + C+PGY SY++D +FG +YS+ NE+ IM EI+++GPVEG+ T+Y+D
Sbjct: 200 -GEGDTPKCSKICEPGYTPSYKEDKHFGCSSYSISRNEKEIMAEIYKNGPVEGAFTVYSD 258
Query: 275 MILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFR 334
+ YK+G+Y+HV G +G HAIRI+GWG E GT YWLV NS+NT+WG+NG F+
Sbjct: 259 FLQYKSGVYQHVTGDLMGGHAIRILGWGVE---NGT----PYWLVGNSWNTDWGDNGFFK 311
Query: 335 I 335
I
Sbjct: 312 I 312
Score = 184 bits (468), Expect = 8e-44, Method: Compositional matrix adjust.
Identities = 87/163 (53%), Positives = 121/163 (74%), Gaps = 9/163 (5%)
Query: 334 RIGCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPA 392
+GCRPY IP CE ++NGSR C E +TP+C + C+PGY SY++D +FG +YS+
Sbjct: 176 HVGCRPYSIPPCEHHVNGSRPPCTG-EGDTPKCSKICEPGYTPSYKEDKHFGCSSYSISR 234
Query: 393 NEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGT 452
NE+ IM EI+++GPVEG+ T+Y+D + YK+G+Y+HV G +G HAIRI+GWG E GT
Sbjct: 235 NEKEIMAEIYKNGPVEGAFTVYSDFLQYKSGVYQHVTGDLMGGHAIRILGWGVE---NGT 291
Query: 453 SSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLP 495
YWLV NS+NT+WG+NG F+I+RGQ+ CGIE++I AG+P
Sbjct: 292 ----PYWLVGNSWNTDWGDNGFFKILRGQDHCGIESEIVAGIP 330
>gi|47217183|emb|CAG11019.1| unnamed protein product [Tetraodon nigroviridis]
Length = 351
Score = 282 bits (721), Expect = 3e-73, Method: Compositional matrix adjust.
Identities = 135/275 (49%), Positives = 185/275 (67%), Gaps = 30/275 (10%)
Query: 83 RLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIAS 142
+LPL+++ + ++ LP+ FD+R WP CPT++EIRDQGSCGS WA GA EAMSDRVCI S
Sbjct: 66 KLPLMIRYAGDIK-LPKEFDSREQWPNCPTLKEIRDQGSCGSCWAFGASEAMSDRVCIHS 124
Query: 143 RGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYAS----------- 191
K V LS+ DL++CC CG GC GG+ AW +WV+ G+VSGG Y S
Sbjct: 125 NAKVSVELSAQDLLTCCNSCGMGCNGGYPSSAWNFWVSDGLVSGGLYDSHIGRIQVSLCV 184
Query: 192 ----------KQGCRPYEI-PCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLN 240
GCRPY I PCE ++NGS SC +TPECI +C+ GY SY+ D +
Sbjct: 185 LLLAVDRDFVSPGCRPYTIPPCEHHVNGSRPSCSGEGGDTPECIFRCEAGYSPSYKQDKH 244
Query: 241 FGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIG 300
FG+ +YS+ + E+ I +EI+++GPVEG+ T+Y D +LYK+G+Y+HV+G LG HAI+++G
Sbjct: 245 FGKTSYSVSSEEDEIKQEIYKNGPVEGAFTVYEDFVLYKSGVYQHVSGSALGGHAIKMLG 304
Query: 301 WGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
WG+E + V YWL ANS+NT+WG+NG F+I
Sbjct: 305 WGEE-------NGVPYWLCANSWNTDWGDNGFFKI 332
Score = 190 bits (482), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 85/162 (52%), Positives = 121/162 (74%), Gaps = 8/162 (4%)
Query: 336 GCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANE 394
GCRPY IP CE ++NGSR SC +TPECI +C+ GY SY+ D +FG+ +YS+ + E
Sbjct: 197 GCRPYTIPPCEHHVNGSRPSCSGEGGDTPECIFRCEAGYSPSYKQDKHFGKTSYSVSSEE 256
Query: 395 ETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSS 454
+ I +EI+++GPVEG+ T+Y D +LYK+G+Y+HV+G LG HAI+++GWG+E +
Sbjct: 257 DEIKQEIYKNGPVEGAFTVYEDFVLYKSGVYQHVSGSALGGHAIKMLGWGEE-------N 309
Query: 455 VVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPK 496
V YWL ANS+NT+WG+NG F+I+RG + CGIE++I AG PK
Sbjct: 310 GVPYWLCANSWNTDWGDNGFFKILRGADHCGIESEIVAGNPK 351
>gi|427785213|gb|JAA58058.1| Putative cathepsin l culex quinquefasciatus cathepsin l
[Rhipicephalus pulchellus]
Length = 346
Score = 282 bits (721), Expect = 4e-73, Method: Compositional matrix adjust.
Identities = 140/271 (51%), Positives = 178/271 (65%), Gaps = 18/271 (6%)
Query: 71 MGVHPDS---KLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWA 127
+GV P++ +LP+ RL L L LPE FD+R NWP C TI EIRDQGSCGS WA
Sbjct: 68 LGVSPENHRYRLPERRLDL-----SSLGPLPENFDSRENWPECTTIGEIRDQGSCGSCWA 122
Query: 128 LGAVEAMSDRVCI--ASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVS 185
GAVEAMSDR CI S G + V LS+DDL+SCC+ CGNGC GGF G AW +WV TGIV+
Sbjct: 123 FGAVEAMSDRTCIHSPSGGPKRVHLSADDLLSCCRTCGNGCNGGFPGSAWSFWVKTGIVT 182
Query: 186 GGTYASKQGCRPYEI-PCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRI 244
GG Y S GC PY I C+ ++NG+ C P TP C+ C+ GYDV Y DD ++G+
Sbjct: 183 GGNYDSDDGCMPYPIKACDHHVNGTLGPCDKKIPPTPRCVHMCRKGYDVDYHDDKHYGKS 242
Query: 245 AYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQE 304
+YS+P+ E+ I EI +GPVE T+Y+D + YK+G+Y+ LG HAIR++GWG E
Sbjct: 243 SYSVPSEEKQIQAEIMTNGPVEADFTVYSDFVHYKSGVYQRHTDEALGGHAIRLLGWGVE 302
Query: 305 PLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
+ V YWL ANS+NT WG+ G F+I
Sbjct: 303 -------NGVPYWLAANSWNTEWGDKGFFKI 326
Score = 168 bits (426), Expect = 6e-39, Method: Compositional matrix adjust.
Identities = 77/162 (47%), Positives = 106/162 (65%), Gaps = 8/162 (4%)
Query: 336 GCRPYEI-PCERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANE 394
GC PY I C+ ++NG+ C P TP C+ C+ GYDV Y DD ++G+ +YS+P+ E
Sbjct: 191 GCMPYPIKACDHHVNGTLGPCDKKIPPTPRCVHMCRKGYDVDYHDDKHYGKSSYSVPSEE 250
Query: 395 ETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSS 454
+ I EI +GPVE T+Y+D + YK+G+Y+ LG HAIR++GWG E +
Sbjct: 251 KQIQAEIMTNGPVEADFTVYSDFVHYKSGVYQRHTDEALGGHAIRLLGWGVE-------N 303
Query: 455 VVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPK 496
V YWL ANS+NT WG+ G F+I+RG +ECGIE D+ AGLPK
Sbjct: 304 GVPYWLAANSWNTEWGDKGFFKILRGSDECGIEDDVVAGLPK 345
>gi|226821413|gb|ACO82382.1| cathepsin B [Lutjanus argentimaculatus]
Length = 330
Score = 282 bits (721), Expect = 4e-73, Method: Compositional matrix adjust.
Identities = 126/254 (49%), Positives = 181/254 (71%), Gaps = 9/254 (3%)
Query: 83 RLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIAS 142
+LP++VQ + ++ LP+ FD+R WP CPT++EIRDQGSCGS WA GA EA+SDR+CI S
Sbjct: 66 KLPIMVQYAGDMK-LPKAFDSREQWPNCPTLKEIRDQGSCGSCWAFGASEAISDRLCIHS 124
Query: 143 RGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEI-P 201
K V +S++DL++CC CG GC GG+ AW +W G+VSGG Y S GCRPY I P
Sbjct: 125 NAKVSVEISAEDLLTCCDSCGMGCNGGYPSAAWDFWTKEGLVSGGLYDSHVGCRPYTIPP 184
Query: 202 CERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFR 261
CE ++NGS C +TP+C+ +C+ GY SY +D ++G+ +YS+ ++E I EI++
Sbjct: 185 CEHHVNGSRPPCTGEGGDTPQCLSQCEAGYTPSYREDKHYGKTSYSVLSDEAEIQYEIYK 244
Query: 262 HGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVAN 321
+GPVEG+ T+Y D +LYK+G+Y+HV+G +G HAI+++GWG+E + V YWL AN
Sbjct: 245 NGPVEGAFTVYEDFVLYKSGVYQHVSGSAVGGHAIKVLGWGEE-------NGVPYWLCAN 297
Query: 322 SFNTNWGENGLFRI 335
S+NT+WG+NG F+
Sbjct: 298 SWNTDWGDNGFFKF 311
Score = 184 bits (468), Expect = 8e-44, Method: Compositional matrix adjust.
Identities = 81/169 (47%), Positives = 123/169 (72%), Gaps = 10/169 (5%)
Query: 331 GLF--RIGCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIA 387
GL+ +GCRPY IP CE ++NGSR C +TP+C+ +C+ GY SY +D ++G+ +
Sbjct: 169 GLYDSHVGCRPYTIPPCEHHVNGSRPPCTGEGGDTPQCLSQCEAGYTPSYREDKHYGKTS 228
Query: 388 YSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEP 447
YS+ ++E I EI+++GPVEG+ T+Y D +LYK+G+Y+HV+G +G HAI+++GWG+E
Sbjct: 229 YSVLSDEAEIQYEIYKNGPVEGAFTVYEDFVLYKSGVYQHVSGSAVGGHAIKVLGWGEE- 287
Query: 448 LGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPK 496
+ V YWL ANS+NT+WG+NG F+ +RG + CGIE++I AG+PK
Sbjct: 288 ------NGVPYWLCANSWNTDWGDNGFFKFLRGSDHCGIESEIVAGIPK 330
>gi|157058769|gb|ABV03142.1| cathepsin B-348 [Myzus persicae]
Length = 246
Score = 281 bits (720), Expect = 4e-73, Method: Compositional matrix adjust.
Identities = 137/244 (56%), Positives = 173/244 (70%), Gaps = 8/244 (3%)
Query: 83 RLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIAS 142
+L LV +D +LPE FDAR +WP CPTI+E+RDQGSCGS WA GAVEAMSDRVCI S
Sbjct: 10 KLEQLVSYTDTPTDLPENFDAREHWPNCPTIREVRDQGSCGSCWAFGAVEAMSDRVCIHS 69
Query: 143 RGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEI-P 201
+G ++ S+++LVSCC CG GC GGF G AW YW T GIVSGG Y SK GC PYEI P
Sbjct: 70 KGAKNFHFSAENLVSCCWTCGFGCNGGFPGAAWHYWKTKGIVSGGPYGSKMGCIPYEIAP 129
Query: 202 CERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFR 261
CE ++NG+ C++ TP C++KC+ GY V Y DL+ G+ AYSL + + I +EI+
Sbjct: 130 CEHHVNGTRGPCKEGG-KTPACVKKCEDGYKVPYAQDLHRGKSAYSLGNDVDQIRQEIYT 188
Query: 262 HGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVAN 321
+GPVEG+ T+Y D I Y+ G+YKHVAG LG HAIRI+GWG + + + YWLVAN
Sbjct: 189 NGPVEGAFTVYEDFIAYRAGVYKHVAGKALGGHAIRILGWGVQ------NGEIPYWLVAN 242
Query: 322 SFNT 325
S+NT
Sbjct: 243 SWNT 246
Score = 139 bits (350), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 68/135 (50%), Positives = 93/135 (68%), Gaps = 8/135 (5%)
Query: 334 RIGCRPYEI-PCERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPA 392
++GC PYEI PCE ++NG+R C+ TP C++KC+ GY V Y DL+ G+ AYSL
Sbjct: 119 KMGCIPYEIAPCEHHVNGTRGPCKEGG-KTPACVKKCEDGYKVPYAQDLHRGKSAYSLGN 177
Query: 393 NEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGT 452
+ + I +EI+ +GPVEG+ T+Y D I Y+ G+YKHVAG LG HAIRI+GWG +
Sbjct: 178 DVDQIRQEIYTNGPVEGAFTVYEDFIAYRAGVYKHVAGKALGGHAIRILGWGVQ------ 231
Query: 453 SSVVKYWLVANSFNT 467
+ + YWLVANS+NT
Sbjct: 232 NGEIPYWLVANSWNT 246
>gi|389608541|dbj|BAM17880.1| cathepsin B [Papilio xuthus]
Length = 334
Score = 281 bits (720), Expect = 5e-73, Method: Compositional matrix adjust.
Identities = 136/288 (47%), Positives = 188/288 (65%), Gaps = 17/288 (5%)
Query: 52 YGAEKNALSKLTLSELEMRMGVHPDSKLPQNRLPLLVQLSDPLE---ELPEGFDARINWP 108
+ A +N S ++ MG L +R LV + +E LPE FD R WP
Sbjct: 39 WKAGRNFPSDTPFKHIKKLMGT-----LRDDRFTTLVTMQHEVELIASLPENFDPRDKWP 93
Query: 109 YCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQG 168
CPT+ E+RDQGSCGS WA GAVEAM+DR+C S G +H S++DL+SCC CG GC G
Sbjct: 94 NCPTLNEVRDQGSCGSCWAFGAVEAMTDRICTYSNGTKHFHFSAEDLLSCCPICGLGCNG 153
Query: 169 GFHGKAWKYWVTTGIVSGGTYASKQGCRPYEI-PCERYMNGSHSSCQDNEPNTPECIRKC 227
G AW+YW G+VSGG+Y S QGCRPYEI PCE ++ G+ C + TP+C+++C
Sbjct: 154 GMPTLAWEYWKHFGLVSGGSYNSSQGCRPYEIPPCEHHVPGNRLPCS-GDTKTPKCVKEC 212
Query: 228 QPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVA 287
+ GY V Y+ D ++G+ YS+ E+ I E++++GPVEG+ T+YAD++ YK+G+YKHV
Sbjct: 213 ESGYKVPYKQDKHYGKHVYSVRGGEDHIKAELYKNGPVEGAFTVYADLLSYKSGVYKHVT 272
Query: 288 GGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
G LG HAI+I+GWG E + KYWL+ANS+N++WG+NG F+I
Sbjct: 273 GDALGGHAIKIMGWGVE-------NGNKYWLIANSWNSDWGDNGFFKI 313
Score = 174 bits (442), Expect = 7e-41, Method: Compositional matrix adjust.
Identities = 79/161 (49%), Positives = 118/161 (73%), Gaps = 9/161 (5%)
Query: 336 GCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANE 394
GCRPYEIP CE ++ G+R C + + TP+C+++C+ GY V Y+ D ++G+ YS+ E
Sbjct: 179 GCRPYEIPPCEHHVPGNRLPC-SGDTKTPKCVKECESGYKVPYKQDKHYGKHVYSVRGGE 237
Query: 395 ETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSS 454
+ I E++++GPVEG+ T+YAD++ YK+G+YKHV G LG HAI+I+GWG E +
Sbjct: 238 DHIKAELYKNGPVEGAFTVYADLLSYKSGVYKHVTGDALGGHAIKIMGWGVE-------N 290
Query: 455 VVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLP 495
KYWL+ANS+N++WG+NG F+I+RG++ CGIE+ I AG P
Sbjct: 291 GNKYWLIANSWNSDWGDNGFFKILRGEDHCGIESSIVAGEP 331
>gi|45361295|ref|NP_989225.1| cathepsin B precursor [Xenopus (Silurana) tropicalis]
gi|38969948|gb|AAH63365.1| hypothetical protein MGC75969 [Xenopus (Silurana) tropicalis]
Length = 333
Score = 281 bits (720), Expect = 5e-73, Method: Compositional matrix adjust.
Identities = 136/279 (48%), Positives = 188/279 (67%), Gaps = 14/279 (5%)
Query: 59 LSKLTLSELEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRD 118
+ L ++ G H + Q R + ELP+ FD+R WP CPTI+E+RD
Sbjct: 47 FANADLHYVKRLCGTHLNGPQLQKRFGFADGM-----ELPDSFDSRAAWPNCPTIREVRD 101
Query: 119 QGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCK-DCGNGCQGGFHGKAWKY 177
QGSCGS WA GAVEA+SDRVC+ + GK +V +S++DL+SCC +CG GC GG+ AWK+
Sbjct: 102 QGSCGSCWAFGAVEAISDRVCVHTNGKVNVEVSAEDLLSCCGFECGMGCNGGYPSGAWKF 161
Query: 178 WVTTGIVSGGTYASKQGCRPYEIP-CERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYE 236
W TG+VSGG Y S GCRPY IP CE ++NGS +C+ E +TP+C+++C+ GY Y
Sbjct: 162 WTETGLVSGGLYDSHLGCRPYSIPPCEHHVNGSRPACKGEEGDTPKCVKQCEDGYAPVYG 221
Query: 237 DDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAI 296
D +FG +Y +P++E+ IM EI+++GPVEG+ +YAD +YK+G+Y+H G LG HAI
Sbjct: 222 SDKHFGATSYGVPSSEKEIMAEIYKNGPVEGAFLVYADFPMYKSGVYQHETGEELGGHAI 281
Query: 297 RIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
+I+GWG E GT YWL ANS+NT+WG+NG F+I
Sbjct: 282 KILGWGVE---NGT----PYWLCANSWNTDWGDNGFFKI 313
Score = 185 bits (470), Expect = 4e-44, Method: Compositional matrix adjust.
Identities = 87/178 (48%), Positives = 125/178 (70%), Gaps = 15/178 (8%)
Query: 327 WGENGLF-------RIGCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYE 378
W E GL +GCRPY IP CE ++NGSR +C+ E +TP+C+++C+ GY Y
Sbjct: 162 WTETGLVSGGLYDSHLGCRPYSIPPCEHHVNGSRPACKGEEGDTPKCVKQCEDGYAPVYG 221
Query: 379 DDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAI 438
D +FG +Y +P++E+ IM EI+++GPVEG+ +YAD +YK+G+Y+H G LG HAI
Sbjct: 222 SDKHFGATSYGVPSSEKEIMAEIYKNGPVEGAFLVYADFPMYKSGVYQHETGEELGGHAI 281
Query: 439 RIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPK 496
+I+GWG E GT YWL ANS+NT+WG+NG F+I+RG++ CGIE++I AG+PK
Sbjct: 282 KILGWGVE---NGT----PYWLCANSWNTDWGDNGFFKILRGKDHCGIESEIVAGIPK 332
>gi|225708580|gb|ACO10136.1| Cathepsin B precursor [Osmerus mordax]
Length = 329
Score = 281 bits (720), Expect = 5e-73, Method: Compositional matrix adjust.
Identities = 133/257 (51%), Positives = 181/257 (70%), Gaps = 11/257 (4%)
Query: 81 QNRLPLLVQLSDPL-EELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVC 139
QN+ P L +L P +LP+ FDAR WP CPTIQ+IRDQGSCGS WA GA EA+SDR+C
Sbjct: 62 QNK-PTLPELEHPAGVKLPDTFDARQQWPNCPTIQDIRDQGSCGSCWAFGAAEAISDRLC 120
Query: 140 IASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYE 199
I S K V +S++DL+SCC++CG GC GG+ AW+YW +G+V+GG Y S +GCRPY
Sbjct: 121 IHSNAKITVEISAEDLLSCCEECGMGCFGGYPSAAWEYWAKSGLVTGGLYGSNKGCRPYS 180
Query: 200 I-PCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMRE 258
I PCE ++NG+ CQ E +TP+C KC GY +YE D FG+ YS+P+ +E IM E
Sbjct: 181 IPPCEHHVNGTRPPCQ-GEGDTPKCQTKCIDGYTPAYEKDKYFGKKTYSVPSKQEQIMTE 239
Query: 259 IFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWL 318
++++GPVE + ++Y D +LYK+G+Y+H+ G LG HAI+I+GWG+E + YWL
Sbjct: 240 LYKNGPVEAAFSVYEDFLLYKSGVYQHLTGDMLGGHAIKILGWGKE-------NNTPYWL 292
Query: 319 VANSFNTNWGENGLFRI 335
ANS+NT+WG G F+I
Sbjct: 293 AANSWNTDWGNQGFFKI 309
Score = 178 bits (452), Expect = 6e-42, Method: Compositional matrix adjust.
Identities = 83/179 (46%), Positives = 122/179 (68%), Gaps = 16/179 (8%)
Query: 327 WGENGLF-------RIGCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYE 378
W ++GL GCRPY IP CE ++NG+R CQ E +TP+C KC GY +YE
Sbjct: 159 WAKSGLVTGGLYGSNKGCRPYSIPPCEHHVNGTRPPCQG-EGDTPKCQTKCIDGYTPAYE 217
Query: 379 DDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAI 438
D FG+ YS+P+ +E IM E++++GPVE + ++Y D +LYK+G+Y+H+ G LG HAI
Sbjct: 218 KDKYFGKKTYSVPSKQEQIMTELYKNGPVEAAFSVYEDFLLYKSGVYQHLTGDMLGGHAI 277
Query: 439 RIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
+I+GWG+E + YWL ANS+NT+WG G F+I+RG +ECGIE+++ AG+P++
Sbjct: 278 KILGWGKE-------NNTPYWLAANSWNTDWGNQGFFKILRGGDECGIESEVVAGIPQL 329
>gi|240992699|ref|XP_002404474.1| cathepsin B endopeptidase, putative [Ixodes scapularis]
gi|215491571|gb|EEC01212.1| cathepsin B endopeptidase, putative [Ixodes scapularis]
Length = 337
Score = 281 bits (719), Expect = 5e-73, Method: Compositional matrix adjust.
Identities = 138/285 (48%), Positives = 188/285 (65%), Gaps = 12/285 (4%)
Query: 52 YGAEKNALSKLTLSELEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCP 111
+ A +N +++S + MGVHP SK + RL V P ++LPE FDAR W +C
Sbjct: 43 WKAGRNFDKSISMSYIRGLMGVHPKSK--EYRLAEFVHDEIP-DDLPESFDAREKWSHCA 99
Query: 112 TIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFH 171
+I IRDQ +CGS WA GA EAMSDRVCI S+GK V +S++DL+ CC CG GC GG+
Sbjct: 100 SIHLIRDQSTCGSCWAFGAAEAMSDRVCIHSKGKIQVDISAEDLLDCCDSCGAGCNGGYP 159
Query: 172 GKAWKYWVTTGIVSGGTYASKQGCRPYEI-PCERYMNGSHSSCQDNEPNTPECIRKCQPG 230
AW+YW +G+V+GG Y + GC+PY + PCE + GS +C P TP+C+ C+ G
Sbjct: 160 AAAWEYWKESGLVTGGLYGTSDGCKPYSLAPCEHHTKGSLPNCTGTVP-TPKCVHLCRKG 218
Query: 231 YDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGP 290
Y Y+DD +FGR YS+ ++E+ I EIF++GPVE T+YAD + YK+G+Y+H +G
Sbjct: 219 YGKDYQDDKHFGRKVYSISSDEKQIQTEIFKNGPVEADFTVYADFLSYKSGVYQHQSGDV 278
Query: 291 LGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
LG HAIRI+GWG E GT YWLVANS+N +WG++G F+I
Sbjct: 279 LGGHAIRILGWGTE---NGTP----YWLVANSWNEDWGDHGYFKI 316
Score = 176 bits (447), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 87/178 (48%), Positives = 119/178 (66%), Gaps = 16/178 (8%)
Query: 327 WGENGLF-------RIGCRPYEI-PCERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYE 378
W E+GL GC+PY + PCE + GS +C P TP+C+ C+ GY Y+
Sbjct: 166 WKESGLVTGGLYGTSDGCKPYSLAPCEHHTKGSLPNCTGTVP-TPKCVHLCRKGYGKDYQ 224
Query: 379 DDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAI 438
DD +FGR YS+ ++E+ I EIF++GPVE T+YAD + YK+G+Y+H +G LG HAI
Sbjct: 225 DDKHFGRKVYSISSDEKQIQTEIFKNGPVEADFTVYADFLSYKSGVYQHQSGDVLGGHAI 284
Query: 439 RIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPK 496
RI+GWG E GT YWLVANS+N +WG++G F+I+RG++ECGIE DI AG+PK
Sbjct: 285 RILGWGTE---NGTP----YWLVANSWNEDWGDHGYFKILRGKDECGIEDDINAGIPK 335
>gi|338815385|gb|AEJ08755.1| cathepsin B [Crassostrea ariakensis]
Length = 341
Score = 281 bits (719), Expect = 7e-73, Method: Compositional matrix adjust.
Identities = 134/277 (48%), Positives = 187/277 (67%), Gaps = 20/277 (7%)
Query: 64 LSELEMRMGVH----PDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQ 119
L+ ++M G + P+ +LP+ ++ +PL++LP FD+R WP CPT++E+RDQ
Sbjct: 59 LAHVKMMCGTYLNTPPELRLPEKKM-------EPLKDLPASFDSRTQWPNCPTLKEVRDQ 111
Query: 120 GSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWV 179
G+CGS WA GAVEAMSDR+CI S+GK +V +S++DL SCC+ CGNGC+GGF AW Y+
Sbjct: 112 GACGSCWAFGAVEAMSDRICIKSQGKENVHISAEDLTSCCRTCGNGCEGGFPSAAWSYYK 171
Query: 180 TTGIVSGGTYASKQGCRPYEI-PCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDD 238
G+V+GG Y S QGC+PY I C+ ++ G C + TP+C C+ GY+V+YE D
Sbjct: 172 RDGLVTGGQYNSHQGCQPYTIKACDHHVVGKLQPCSKDIGPTPKCKHTCEAGYNVTYEKD 231
Query: 239 LNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRI 298
++G AYS+ E+ IM EI +GPVEG+ T+YAD YK+G+YKH G PLG HAI+I
Sbjct: 232 KHYGMSAYSVHGVEK-IMTEIMTNGPVEGAFTVYADFPQYKSGVYKHTTGQPLGGHAIKI 290
Query: 299 IGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
+GWG E + YWLVANS+N +WG+ G F+I
Sbjct: 291 LGWGTENGDD-------YWLVANSWNPDWGDQGFFKI 320
Score = 164 bits (416), Expect = 8e-38, Method: Compositional matrix adjust.
Identities = 80/163 (49%), Positives = 110/163 (67%), Gaps = 9/163 (5%)
Query: 336 GCRPYEI-PCERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANE 394
GC+PY I C+ ++ G C + TP+C C+ GY+V+YE D ++G AYS+ E
Sbjct: 186 GCQPYTIKACDHHVVGKLQPCSKDIGPTPKCKHTCEAGYNVTYEKDKHYGMSAYSVHGVE 245
Query: 395 ETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSS 454
+ IM EI +GPVEG+ T+YAD YK+G+YKH G PLG HAI+I+GWG E +
Sbjct: 246 K-IMTEIMTNGPVEGAFTVYADFPQYKSGVYKHTTGQPLGGHAIKILGWGTENGDD---- 300
Query: 455 VVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
YWLVANS+N +WG+ G F+I+RGQ+ECGIE+ I+AG PK+
Sbjct: 301 ---YWLVANSWNPDWGDQGFFKILRGQDECGIESQISAGEPKL 340
>gi|118424551|gb|ABK90823.1| cathepsin B-like cysteine proteinase [Spodoptera exigua]
Length = 341
Score = 281 bits (718), Expect = 7e-73, Method: Compositional matrix adjust.
Identities = 133/285 (46%), Positives = 188/285 (65%), Gaps = 11/285 (3%)
Query: 52 YGAEKNALSKLTLSELEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCP 111
+ A +N L+ ++ GV D+ L ++LP + +D + +LPE FD R WP CP
Sbjct: 44 WKAGRNFPVNTPLTHIKKLTGVLVDTHL--SKLPKVEHDADLIADLPENFDPRDKWPNCP 101
Query: 112 TIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFH 171
T+ E+RDQGSCGS WA GAVEAM+DR C S G +H S++DL+SCC CG GC GG
Sbjct: 102 TLNEVRDQGSCGSCWAFGAVEAMTDRYCTYSNGTKHFHFSAEDLLSCCPVCGLGCNGGMP 161
Query: 172 GKAWKYWVTTGIVSGGTYASKQGCRPYEI-PCERYMNGSHSSCQDNEPNTPECIRKCQPG 230
AW+YW G+VSGG+Y S QGCRPYEI PCE ++ G+ C + + TP+C + C+
Sbjct: 162 TLAWEYWKHFGLVSGGSYNSSQGCRPYEIPPCEHHVPGNRMPC-NGDSKTPKCHKTCESS 220
Query: 231 YDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGP 290
Y+V Y D +G+ YS+ + E+ I E++++GPVEG+ T+Y+D++ YK G+YKH G
Sbjct: 221 YNVDYHKDKRYGKHVYSVSSKEDHIKAELYKNGPVEGAFTVYSDLLNYKNGVYKHTVGNA 280
Query: 291 LGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
LG HAI+I+GWG E + KYWL+ANS+N++WG+NG F+I
Sbjct: 281 LGGHAIKILGWGVE-------NGNKYWLIANSWNSDWGDNGFFKI 318
Score = 169 bits (429), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 76/163 (46%), Positives = 113/163 (69%), Gaps = 9/163 (5%)
Query: 336 GCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANE 394
GCRPYEIP CE ++ G+R C + TP+C + C+ Y+V Y D +G+ YS+ + E
Sbjct: 184 GCRPYEIPPCEHHVPGNRMPCNGDS-KTPKCHKTCESSYNVDYHKDKRYGKHVYSVSSKE 242
Query: 395 ETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSS 454
+ I E++++GPVEG+ T+Y+D++ YK G+YKH G LG HAI+I+GWG E +
Sbjct: 243 DHIKAELYKNGPVEGAFTVYSDLLNYKNGVYKHTVGNALGGHAIKILGWGVE-------N 295
Query: 455 VVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
KYWL+ANS+N++WG+NG F+I+RG++ CGIE+ I AG P +
Sbjct: 296 GNKYWLIANSWNSDWGDNGFFKILRGEDHCGIESSIVAGEPLL 338
>gi|148222779|ref|NP_001080410.1| uncharacterized protein LOC380102 precursor [Xenopus laevis]
gi|28302291|gb|AAH46667.1| Cg10992 protein [Xenopus laevis]
Length = 333
Score = 280 bits (717), Expect = 9e-73, Method: Compositional matrix adjust.
Identities = 136/266 (51%), Positives = 183/266 (68%), Gaps = 14/266 (5%)
Query: 72 GVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAV 131
G H + Q R L +LP+ FD+R WP CPTI+EIRDQGSCGS WA GAV
Sbjct: 60 GTHLNGPQLQKRFGFADDL-----DLPDSFDSRAAWPNCPTIREIRDQGSCGSCWAFGAV 114
Query: 132 EAMSDRVCIASRGKRHVRLSSDDLVSCCK-DCGNGCQGGFHGKAWKYWVTTGIVSGGTYA 190
EA+SDRVC+ + GK +V +S++DL+SCC CG GC GG+ AW++W TG+VSGG Y
Sbjct: 115 EAISDRVCVHTNGKVNVEVSAEDLLSCCGFKCGMGCNGGYPSGAWRFWTETGLVSGGLYD 174
Query: 191 SKQGCRPYEIP-CERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLP 249
S GCRPY IP CE ++NGS SC+ E +TP+C++ C+ GY +Y D +FG +Y +P
Sbjct: 175 SHVGCRPYSIPPCEHHVNGSRPSCKGEEGDTPKCMKTCEEGYTPAYGSDKHFGATSYGVP 234
Query: 250 ANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEG 309
++E+ IM +I+++GPVEG+ +YAD LYK+G+Y+H G LG HAI+I+GWG E G
Sbjct: 235 SSEKEIMADIYKNGPVEGAFVVYADFPLYKSGVYQHETGEELGGHAIKILGWGVE---NG 291
Query: 310 TSSVVKYWLVANSFNTNWGENGLFRI 335
T YWL ANS+NT+WG+NG F+I
Sbjct: 292 T----PYWLCANSWNTDWGDNGFFKI 313
Score = 186 bits (473), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 87/178 (48%), Positives = 125/178 (70%), Gaps = 15/178 (8%)
Query: 327 WGENGLF-------RIGCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYE 378
W E GL +GCRPY IP CE ++NGSR SC+ E +TP+C++ C+ GY +Y
Sbjct: 162 WTETGLVSGGLYDSHVGCRPYSIPPCEHHVNGSRPSCKGEEGDTPKCMKTCEEGYTPAYG 221
Query: 379 DDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAI 438
D +FG +Y +P++E+ IM +I+++GPVEG+ +YAD LYK+G+Y+H G LG HAI
Sbjct: 222 SDKHFGATSYGVPSSEKEIMADIYKNGPVEGAFVVYADFPLYKSGVYQHETGEELGGHAI 281
Query: 439 RIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPK 496
+I+GWG E GT YWL ANS+NT+WG+NG F+I+RG++ CGIE+++ AG+PK
Sbjct: 282 KILGWGVE---NGT----PYWLCANSWNTDWGDNGFFKILRGKDHCGIESEVVAGIPK 332
>gi|147906534|ref|NP_001090927.1| cathepsin B precursor [Sus scrofa]
gi|187470655|sp|A1E295.1|CATB_PIG RecName: Full=Cathepsin B; Contains: RecName: Full=Cathepsin B
light chain; Contains: RecName: Full=Cathepsin B heavy
chain; Flags: Precursor
gi|118490058|gb|ABK96810.1| cathepsin B [Sus scrofa]
Length = 335
Score = 280 bits (717), Expect = 1e-72, Method: Compositional matrix adjust.
Identities = 134/241 (55%), Positives = 177/241 (73%), Gaps = 10/241 (4%)
Query: 97 LPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLV 156
LP+ FDAR WP CPTI+EIRDQGSCGS WA GAVEA+SDR+CI S G+ +V +S++D++
Sbjct: 80 LPKSFDAREQWPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICIRSNGRVNVEVSAEDML 139
Query: 157 SCCKD-CGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIP-CERYMNGSHSSCQ 214
+CC D CG+GC GGF AW +W G+VSGG Y S GCRPY IP CE ++NGS C
Sbjct: 140 TCCGDECGDGCNGGFPSGAWNFWTKKGLVSGGLYDSHVGCRPYSIPPCEHHVNGSRPPCT 199
Query: 215 DNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYAD 274
E +TP+C + C+PGY SY++D +FG +YS+ NE+ IM EI+++GPVEG+ T+Y+D
Sbjct: 200 -GEGDTPKCSKICEPGYTPSYKEDKHFGCSSYSISRNEKEIMAEIYKNGPVEGAFTVYSD 258
Query: 275 MILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFR 334
+ YK+G+Y+HV G +G HAIRI+GWG E GT YWLV NS+NT+WG+NG F+
Sbjct: 259 FLQYKSGVYQHVTGDLMGGHAIRILGWGVE---NGT----PYWLVGNSWNTDWGDNGFFK 311
Query: 335 I 335
I
Sbjct: 312 I 312
Score = 184 bits (468), Expect = 8e-44, Method: Compositional matrix adjust.
Identities = 87/163 (53%), Positives = 121/163 (74%), Gaps = 9/163 (5%)
Query: 334 RIGCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPA 392
+GCRPY IP CE ++NGSR C E +TP+C + C+PGY SY++D +FG +YS+
Sbjct: 176 HVGCRPYSIPPCEHHVNGSRPPCTG-EGDTPKCSKICEPGYTPSYKEDKHFGCSSYSISR 234
Query: 393 NEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGT 452
NE+ IM EI+++GPVEG+ T+Y+D + YK+G+Y+HV G +G HAIRI+GWG E GT
Sbjct: 235 NEKEIMAEIYKNGPVEGAFTVYSDFLQYKSGVYQHVTGDLMGGHAIRILGWGVE---NGT 291
Query: 453 SSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLP 495
YWLV NS+NT+WG+NG F+I+RGQ+ CGIE++I AG+P
Sbjct: 292 ----PYWLVGNSWNTDWGDNGFFKILRGQDHCGIESEIVAGIP 330
>gi|431918315|gb|ELK17542.1| Cathepsin B [Pteropus alecto]
Length = 359
Score = 280 bits (716), Expect = 1e-72, Method: Compositional matrix adjust.
Identities = 135/255 (52%), Positives = 182/255 (71%), Gaps = 10/255 (3%)
Query: 83 RLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIAS 142
+LP V L++ L LPE FDAR WP CPTI+EIRDQGSCGS WA GAVEA+SDR+CI +
Sbjct: 90 KLPQRVWLAEDLV-LPESFDAREQWPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICILT 148
Query: 143 RGKRHVRLSSDDLVSCCK-DCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIP 201
G +V +S++DL++CC CG GC GGF AW +W G+VSGG Y S GCRPY IP
Sbjct: 149 NGNVNVEVSAEDLLTCCGFQCGEGCNGGFPSGAWNFWTKKGLVSGGLYDSHVGCRPYSIP 208
Query: 202 -CERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIF 260
CE ++NGS C +TP+C R C+ GY SY++D +FG +YS+P++E IM EI+
Sbjct: 209 PCEHHVNGSRPPCTGEGGSTPKCSRICEAGYTPSYKEDKHFGCSSYSVPSSETEIMAEIY 268
Query: 261 RHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVA 320
++GPVE + ++Y+D +LYK+G+Y+HV G +G HA+RI+GWG E +GT YWLV
Sbjct: 269 KNGPVEAAFSVYSDFLLYKSGVYQHVTGEMMGGHAVRILGWGVE---DGT----PYWLVG 321
Query: 321 NSFNTNWGENGLFRI 335
NS+NT+WG++G F+I
Sbjct: 322 NSWNTDWGDSGFFKI 336
Score = 184 bits (466), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 84/163 (51%), Positives = 121/163 (74%), Gaps = 8/163 (4%)
Query: 334 RIGCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPA 392
+GCRPY IP CE ++NGSR C +TP+C R C+ GY SY++D +FG +YS+P+
Sbjct: 199 HVGCRPYSIPPCEHHVNGSRPPCTGEGGSTPKCSRICEAGYTPSYKEDKHFGCSSYSVPS 258
Query: 393 NEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGT 452
+E IM EI+++GPVE + ++Y+D +LYK+G+Y+HV G +G HA+RI+GWG E +GT
Sbjct: 259 SETEIMAEIYKNGPVEAAFSVYSDFLLYKSGVYQHVTGEMMGGHAVRILGWGVE---DGT 315
Query: 453 SSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLP 495
YWLV NS+NT+WG++G F+I+RGQ+ CGIE++I AGLP
Sbjct: 316 ----PYWLVGNSWNTDWGDSGFFKILRGQDHCGIESEIVAGLP 354
>gi|226472800|emb|CAX71086.1| cathepsin B [Schistosoma japonicum]
gi|226472804|emb|CAX71088.1| cathepsin B [Schistosoma japonicum]
Length = 348
Score = 280 bits (715), Expect = 2e-72, Method: Compositional matrix adjust.
Identities = 137/274 (50%), Positives = 180/274 (65%), Gaps = 10/274 (3%)
Query: 63 TLSELEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSC 122
T+S++ +G PD Q L L ELP+ FDAR W +CP+I EIRDQ SC
Sbjct: 62 TVSDIRRMLGALPDPNGEQLET-LCTGYELTLNELPKSFDARKEWTHCPSISEIRDQSSC 120
Query: 123 GSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTG 182
GS WA GAVEAMSDR+CI S+GK LS+++LVSCC CG GC GGF AW YW G
Sbjct: 121 GSCWAFGAVEAMSDRICIESKGKYKPFLSAENLVSCCSSCGMGCNGGFPHSAWLYWKNQG 180
Query: 183 IVSGGTYASKQGCRPYEI-PCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNF 241
IV+G Y + GC+PYE PCE + G C D + TP C R CQ GY+VSYE+D +
Sbjct: 181 IVTGDLYNTTNGCQPYEFPPCEHHTLGPLPVC-DGDVETPPCKRTCQAGYNVSYENDKWY 239
Query: 242 GRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGW 301
G++ Y + +N+E IM+E+ +HGPVE +YAD YK+G+Y+HV+G LG HA+R++GW
Sbjct: 240 GKVVYRVKSNQEAIMKELMQHGPVEVDFEVYADFPNYKSGVYQHVSGALLGGHAVRLLGW 299
Query: 302 GQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
G+E + V YWL+ANS+NT+WG+NG F+I
Sbjct: 300 GEE-------NNVPYWLIANSWNTDWGDNGYFKI 326
Score = 182 bits (461), Expect = 5e-43, Method: Compositional matrix adjust.
Identities = 83/163 (50%), Positives = 117/163 (71%), Gaps = 9/163 (5%)
Query: 336 GCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANE 394
GC+PYE P CE + G C + TP C R CQ GY+VSYE+D +G++ Y + +N+
Sbjct: 192 GCQPYEFPPCEHHTLGPLPVCDG-DVETPPCKRTCQAGYNVSYENDKWYGKVVYRVKSNQ 250
Query: 395 ETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSS 454
E IM+E+ +HGPVE +YAD YK+G+Y+HV+G LG HA+R++GWG+E +
Sbjct: 251 EAIMKELMQHGPVEVDFEVYADFPNYKSGVYQHVSGALLGGHAVRLLGWGEE-------N 303
Query: 455 VVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
V YWL+ANS+NT+WG+NG F+I+RG+NECGIE+D+ AG+PKI
Sbjct: 304 NVPYWLIANSWNTDWGDNGYFKIIRGKNECGIESDVNAGIPKI 346
>gi|30995341|gb|AAO59414.2| cathepsin B endopeptidase [Schistosoma japonicum]
gi|226472794|emb|CAX71083.1| cathepsin B [Schistosoma japonicum]
gi|226472796|emb|CAX71084.1| cathepsin B [Schistosoma japonicum]
gi|226472798|emb|CAX71085.1| cathepsin B [Schistosoma japonicum]
gi|226472802|emb|CAX71087.1| cathepsin B [Schistosoma japonicum]
gi|226472806|emb|CAX71089.1| cathepsin B [Schistosoma japonicum]
Length = 348
Score = 280 bits (715), Expect = 2e-72, Method: Compositional matrix adjust.
Identities = 137/274 (50%), Positives = 180/274 (65%), Gaps = 10/274 (3%)
Query: 63 TLSELEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSC 122
T+S++ +G PD Q L L ELP+ FDAR W +CP+I EIRDQ SC
Sbjct: 62 TVSDIRRMLGALPDPNGEQLET-LCTGYELTLNELPKSFDARKEWTHCPSISEIRDQSSC 120
Query: 123 GSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTG 182
GS WA GAVEAMSDR+CI S+GK LS+++LVSCC CG GC GGF AW YW G
Sbjct: 121 GSCWAFGAVEAMSDRICIESKGKYKPFLSAENLVSCCSSCGMGCNGGFPHSAWLYWKNQG 180
Query: 183 IVSGGTYASKQGCRPYEI-PCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNF 241
IV+G Y + GC+PYE PCE + G C D + TP C R CQ GY+VSYE+D +
Sbjct: 181 IVTGDLYNTTNGCQPYEFPPCEHHTLGPLPVC-DGDVETPPCKRTCQAGYNVSYENDKWY 239
Query: 242 GRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGW 301
G++ Y + +N+E IM+E+ +HGPVE +YAD YK+G+Y+HV+G LG HA+R++GW
Sbjct: 240 GKVVYRVKSNQEAIMKELMQHGPVEVDFEVYADFPNYKSGVYQHVSGALLGGHAVRLLGW 299
Query: 302 GQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
G+E + V YWL+ANS+NT+WG+NG F+I
Sbjct: 300 GEE-------NNVPYWLIANSWNTDWGDNGYFKI 326
Score = 182 bits (461), Expect = 5e-43, Method: Compositional matrix adjust.
Identities = 83/163 (50%), Positives = 117/163 (71%), Gaps = 9/163 (5%)
Query: 336 GCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANE 394
GC+PYE P CE + G C + TP C R CQ GY+VSYE+D +G++ Y + +N+
Sbjct: 192 GCQPYEFPPCEHHTLGPLPVCDG-DVETPPCKRTCQAGYNVSYENDKWYGKVVYRVKSNQ 250
Query: 395 ETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSS 454
E IM+E+ +HGPVE +YAD YK+G+Y+HV+G LG HA+R++GWG+E +
Sbjct: 251 EAIMKELMQHGPVEVDFEVYADFPNYKSGVYQHVSGALLGGHAVRLLGWGEE-------N 303
Query: 455 VVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
V YWL+ANS+NT+WG+NG F+I+RG+NECGIE+D+ AG+PKI
Sbjct: 304 NVPYWLIANSWNTDWGDNGYFKIIRGKNECGIESDVNAGIPKI 346
>gi|432852559|ref|XP_004067308.1| PREDICTED: cathepsin B-like [Oryzias latipes]
Length = 330
Score = 280 bits (715), Expect = 2e-72, Method: Compositional matrix adjust.
Identities = 129/254 (50%), Positives = 178/254 (70%), Gaps = 9/254 (3%)
Query: 83 RLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIAS 142
+LP++VQ + ++ LP FDAR WP CPT++EIRDQGSCGS WA GA EA+SDRVCI S
Sbjct: 66 KLPIMVQYAGDVK-LPTEFDARAQWPNCPTLKEIRDQGSCGSCWAFGAAEAISDRVCIHS 124
Query: 143 RGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEI-P 201
+ V +SS+DL++CC+ CG GC GG+ AW +W G+V+GG Y S GCRPY I P
Sbjct: 125 NARVSVEISSEDLLTCCESCGMGCNGGYPTAAWDFWTKEGLVTGGLYDSHVGCRPYTIPP 184
Query: 202 CERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFR 261
CE ++NG+ C +TP+CI +C+ GY SY+ D ++G+ +YS+ ANE I EI++
Sbjct: 185 CEHHVNGTRPPCTGEGGDTPQCINQCESGYTPSYKKDKHYGKTSYSVEANENQIQTEIYK 244
Query: 262 HGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVAN 321
+GPVEG+ +Y D +YK+G+Y+HV+G +G HAI+I+GWG E V YWL AN
Sbjct: 245 NGPVEGAFMVYEDFPMYKSGVYQHVSGSLIGGHAIKILGWGVE-------DGVPYWLCAN 297
Query: 322 SFNTNWGENGLFRI 335
S+NT+WG+NG F+I
Sbjct: 298 SWNTDWGDNGYFKI 311
Score = 181 bits (458), Expect = 9e-43, Method: Compositional matrix adjust.
Identities = 82/169 (48%), Positives = 120/169 (71%), Gaps = 10/169 (5%)
Query: 331 GLF--RIGCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIA 387
GL+ +GCRPY IP CE ++NG+R C +TP+CI +C+ GY SY+ D ++G+ +
Sbjct: 169 GLYDSHVGCRPYTIPPCEHHVNGTRPPCTGEGGDTPQCINQCESGYTPSYKKDKHYGKTS 228
Query: 388 YSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEP 447
YS+ ANE I EI+++GPVEG+ +Y D +YK+G+Y+HV+G +G HAI+I+GWG E
Sbjct: 229 YSVEANENQIQTEIYKNGPVEGAFMVYEDFPMYKSGVYQHVSGSLIGGHAIKILGWGVE- 287
Query: 448 LGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPK 496
V YWL ANS+NT+WG+NG F+I+RG + CGIE+++ AG+PK
Sbjct: 288 ------DGVPYWLCANSWNTDWGDNGYFKILRGSDHCGIESEVVAGIPK 330
>gi|157058763|gb|ABV03139.1| cathepsin B-348 [Acyrthosiphon pisum]
Length = 248
Score = 279 bits (714), Expect = 2e-72, Method: Compositional matrix adjust.
Identities = 139/257 (54%), Positives = 178/257 (69%), Gaps = 11/257 (4%)
Query: 71 MGVH-PDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALG 129
MGVH +++ P +L L+ +D +LPE FDAR WP CPTI+E+RDQGSCGS WA G
Sbjct: 1 MGVHEKNAEYP--KLEQLLTYNDASTDLPETFDARERWPNCPTIREVRDQGSCGSCWAFG 58
Query: 130 AVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTY 189
AVEAMSDRVCI S G ++ S+++LVSCC CG GC GGF G AW YW T GIVSGG Y
Sbjct: 59 AVEAMSDRVCIHSNGTKNFHFSAENLVSCCWTCGFGCNGGFPGAAWNYWKTKGIVSGGPY 118
Query: 190 ASKQGCRPYEI-PCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSL 248
S GC PYEI PCE ++NG+ C++ TP C++KC+ GY V Y DL+ G+ AYS+
Sbjct: 119 GSNMGCIPYEIAPCEHHVNGTRGPCKEGG-KTPTCVKKCEEGYKVPYAQDLHHGKSAYSI 177
Query: 249 PANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGE 308
+ + I +EI+ +GPVEG+ T+Y D I Y+ G+YKHVAG LG HAIRI+GWG +
Sbjct: 178 RNDVDQIRQEIYTNGPVEGAFTVYEDFIAYRAGVYKHVAGKALGGHAIRILGWGVQ---- 233
Query: 309 GTSSVVKYWLVANSFNT 325
+ + YWLVANS+NT
Sbjct: 234 --NGEIPYWLVANSWNT 248
Score = 137 bits (346), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 67/134 (50%), Positives = 92/134 (68%), Gaps = 8/134 (5%)
Query: 335 IGCRPYEI-PCERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPAN 393
+GC PYEI PCE ++NG+R C+ TP C++KC+ GY V Y DL+ G+ AYS+ +
Sbjct: 122 MGCIPYEIAPCEHHVNGTRGPCKEGG-KTPTCVKKCEEGYKVPYAQDLHHGKSAYSIRND 180
Query: 394 EETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTS 453
+ I +EI+ +GPVEG+ T+Y D I Y+ G+YKHVAG LG HAIRI+GWG + +
Sbjct: 181 VDQIRQEIYTNGPVEGAFTVYEDFIAYRAGVYKHVAGKALGGHAIRILGWGVQ------N 234
Query: 454 SVVKYWLVANSFNT 467
+ YWLVANS+NT
Sbjct: 235 GEIPYWLVANSWNT 248
>gi|56753443|gb|AAW24925.1| unknown [Schistosoma japonicum]
Length = 342
Score = 279 bits (714), Expect = 2e-72, Method: Compositional matrix adjust.
Identities = 139/294 (47%), Positives = 189/294 (64%), Gaps = 11/294 (3%)
Query: 45 LLPKLPFYGAEKNALSKL-TLSELEMRMGVHP-DSKLPQNRLPLLVQLSDPLEELPEGFD 102
+ K P G + + + +L + + MG D+++ + R P V D E+P FD
Sbjct: 37 FINKHPDAGWKADKSDRFHSLDDARILMGARKEDAEMKRKRRPT-VDHHDLNVEIPSQFD 95
Query: 103 ARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDC 162
+R WP+C +I +IRDQ CGS WA GAVEAM+DR+CI S G++ LS+ DL+SCC+DC
Sbjct: 96 SRKKWPHCKSISQIRDQSRCGSCWAFGAVEAMTDRICIQSGGQQSAELSALDLISCCEDC 155
Query: 163 GNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIP-CERYMNGSHSSCQDNEPNTP 221
G+GCQGGF G AW YWV GIV+GG+ + GC+PY P CE + G + +C TP
Sbjct: 156 GDGCQGGFPGVAWDYWVKRGIVTGGSKENHTGCQPYPFPKCEHHTKGKYPACGTKIYKTP 215
Query: 222 ECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTG 281
+C +KCQ GY YE D N+G Y++ +NE+ I REI +GPVE + +Y D + YK+G
Sbjct: 216 QCKQKCQKGYKTPYEQDKNYGDQRYNVISNEKAIQREIMMYGPVEAAFDVYEDFLNYKSG 275
Query: 282 IYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
IY+HVAG +G HAIRIIGWG E +G YWL+ANS+N +WGENGLFR+
Sbjct: 276 IYRHVAGSIVGGHAIRIIGWGVE---KGKP----YWLIANSWNEDWGENGLFRM 322
Score = 168 bits (425), Expect = 8e-39, Method: Compositional matrix adjust.
Identities = 85/180 (47%), Positives = 112/180 (62%), Gaps = 11/180 (6%)
Query: 316 YWLVANSFNTNWGENGLFRIGCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYD 374
YW+ EN GC+PY P CE + G +C TP+C +KCQ GY
Sbjct: 170 YWVKRGIVTGGSKEN---HTGCQPYPFPKCEHHTKGKYPACGTKIYKTPQCKQKCQKGYK 226
Query: 375 VSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLG 434
YE D N+G Y++ +NE+ I REI +GPVE + +Y D + YK+GIY+HVAG +G
Sbjct: 227 TPYEQDKNYGDQRYNVISNEKAIQREIMMYGPVEAAFDVYEDFLNYKSGIYRHVAGSIVG 286
Query: 435 EHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGL 494
HAIRIIGWG E +G YWL+ANS+N +WGENGLFR+VRG++EC IE+ + AGL
Sbjct: 287 GHAIRIIGWGVE---KGKP----YWLIANSWNEDWGENGLFRMVRGRDECSIESHVVAGL 339
>gi|432946172|ref|XP_004083803.1| PREDICTED: cathepsin B-like [Oryzias latipes]
Length = 330
Score = 279 bits (713), Expect = 3e-72, Method: Compositional matrix adjust.
Identities = 128/254 (50%), Positives = 183/254 (72%), Gaps = 11/254 (4%)
Query: 85 PLLVQLSDPLE--ELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIAS 142
P L +L+ +E +LP+ FD R WP CPT+++IRDQG+CGS WA GA EA+SDR+CI S
Sbjct: 65 PKLPELAHDVEGIKLPDSFDPREQWPNCPTLKQIRDQGNCGSCWAFGAAEAISDRICIQS 124
Query: 143 RGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEI-P 201
GK + +S++DL++CC +CG GC GGF AW++W G+V+GG + SK GCRPY + P
Sbjct: 125 GGKISLEISAEDLLTCCDECGMGCFGGFPSAAWEFWTNKGLVTGGLFDSKVGCRPYTLAP 184
Query: 202 CERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFR 261
CE ++NGS CQ E TP+C+ +C GY +SY D +FG+ +YS+P+ +E IM E+++
Sbjct: 185 CEHHVNGSRPPCQ-GEVETPKCVTQCNNGYSLSYPKDKHFGQRSYSIPSQQEQIMTELYK 243
Query: 262 HGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVAN 321
+GPVE + ++YAD +LYK G+Y+HV G LG HA++I+GWG+E GT YWLVAN
Sbjct: 244 NGPVEAAFSVYADFLLYKNGVYQHVTGDMLGGHAVKILGWGEE---NGTP----YWLVAN 296
Query: 322 SFNTNWGENGLFRI 335
S+N++WG+ G F+I
Sbjct: 297 SWNSDWGDKGFFKI 310
Score = 184 bits (467), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 90/182 (49%), Positives = 128/182 (70%), Gaps = 13/182 (7%)
Query: 319 VANSFNTNWG--ENGLF--RIGCRPYEI-PCERYMNGSRSSCQANEPNTPECIRKCQPGY 373
A F TN G GLF ++GCRPY + PCE ++NGSR CQ E TP+C+ +C GY
Sbjct: 155 AAWEFWTNKGLVTGGLFDSKVGCRPYTLAPCEHHVNGSRPPCQG-EVETPKCVTQCNNGY 213
Query: 374 DVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPL 433
+SY D +FG+ +YS+P+ +E IM E++++GPVE + ++YAD +LYK G+Y+HV G L
Sbjct: 214 SLSYPKDKHFGQRSYSIPSQQEQIMTELYKNGPVEAAFSVYADFLLYKNGVYQHVTGDML 273
Query: 434 GEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAG 493
G HA++I+GWG+E GT YWLVANS+N++WG+ G F+I RG +ECGIE+++ AG
Sbjct: 274 GGHAVKILGWGEE---NGTP----YWLVANSWNSDWGDKGFFKIKRGNDECGIESEMVAG 326
Query: 494 LP 495
P
Sbjct: 327 AP 328
>gi|226472810|emb|CAX71091.1| cathepsin B [Schistosoma japonicum]
Length = 348
Score = 278 bits (712), Expect = 4e-72, Method: Compositional matrix adjust.
Identities = 136/274 (49%), Positives = 180/274 (65%), Gaps = 10/274 (3%)
Query: 63 TLSELEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSC 122
T+S++ +G PD Q L + ELP+ FDAR W +CP+I EIRDQ SC
Sbjct: 62 TVSDIRRMLGALPDPNGEQLET-LCTGYELTVNELPKSFDARKEWTHCPSISEIRDQSSC 120
Query: 123 GSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTG 182
GS WA GAVEAMSDR+CI S+GK LS+++LVSCC CG GC GGF AW YW G
Sbjct: 121 GSYWAFGAVEAMSDRICIESKGKYKPFLSAENLVSCCSSCGMGCNGGFPHSAWLYWKNQG 180
Query: 183 IVSGGTYASKQGCRPYEI-PCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNF 241
IV+G Y + GC+PYE PCE + G C D + TP C R CQ GY+VSYE+D +
Sbjct: 181 IVTGDLYNTTNGCQPYEFPPCEHHTLGPLPVC-DGDVETPPCKRTCQAGYNVSYENDKWY 239
Query: 242 GRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGW 301
G++ Y + +N+E IM+E+ +HGPVE +YAD YK+G+Y+HV+G LG HA+R++GW
Sbjct: 240 GKVVYRVKSNQEAIMKELMQHGPVEVDFEVYADFPNYKSGVYQHVSGALLGGHAVRLLGW 299
Query: 302 GQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
G+E + V YWL+ANS+NT+WG+NG F+I
Sbjct: 300 GEE-------NNVPYWLIANSWNTDWGDNGYFKI 326
Score = 182 bits (461), Expect = 5e-43, Method: Compositional matrix adjust.
Identities = 83/163 (50%), Positives = 117/163 (71%), Gaps = 9/163 (5%)
Query: 336 GCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANE 394
GC+PYE P CE + G C + TP C R CQ GY+VSYE+D +G++ Y + +N+
Sbjct: 192 GCQPYEFPPCEHHTLGPLPVCDG-DVETPPCKRTCQAGYNVSYENDKWYGKVVYRVKSNQ 250
Query: 395 ETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSS 454
E IM+E+ +HGPVE +YAD YK+G+Y+HV+G LG HA+R++GWG+E +
Sbjct: 251 EAIMKELMQHGPVEVDFEVYADFPNYKSGVYQHVSGALLGGHAVRLLGWGEE-------N 303
Query: 455 VVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
V YWL+ANS+NT+WG+NG F+I+RG+NECGIE+D+ AG+PKI
Sbjct: 304 NVPYWLIANSWNTDWGDNGYFKIIRGKNECGIESDVNAGIPKI 346
>gi|226468762|emb|CAX76409.1| cathepsin B [Schistosoma japonicum]
gi|257206178|emb|CAX82740.1| cathepsin B [Schistosoma japonicum]
Length = 348
Score = 278 bits (711), Expect = 4e-72, Method: Compositional matrix adjust.
Identities = 137/274 (50%), Positives = 179/274 (65%), Gaps = 10/274 (3%)
Query: 63 TLSELEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSC 122
T+S++ +G PD Q L L ELP+ FDAR W +CP+I EIRDQ SC
Sbjct: 62 TVSDIRRMLGALPDPNGEQLET-LCTGYELTLNELPKSFDARKEWTHCPSISEIRDQSSC 120
Query: 123 GSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTG 182
GS WA GAVEAMSDR+CI S+GK LS+++LVSCC CG GC GGF AW YW G
Sbjct: 121 GSCWAFGAVEAMSDRICIESKGKYKPFLSAENLVSCCSSCGMGCNGGFPHSAWLYWKNQG 180
Query: 183 IVSGGTYASKQGCRPYEI-PCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNF 241
IV+G Y + GC+PYE PCE G C D + TP C R CQ GY+VSYE+D +
Sbjct: 181 IVTGDLYNTTNGCQPYEFPPCEHNTLGPLPVC-DGDVETPPCKRTCQAGYNVSYENDKWY 239
Query: 242 GRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGW 301
G++ Y + +N+E IM+E+ +HGPVE +YAD YK+G+Y+HV+G LG HA+R++GW
Sbjct: 240 GKVVYRVKSNQEAIMKELMQHGPVEVDFEVYADFPNYKSGVYQHVSGALLGGHAVRLLGW 299
Query: 302 GQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
G+E + V YWL+ANS+NT+WG+NG F+I
Sbjct: 300 GEE-------NNVPYWLIANSWNTDWGDNGYFKI 326
Score = 180 bits (457), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 83/163 (50%), Positives = 116/163 (71%), Gaps = 9/163 (5%)
Query: 336 GCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANE 394
GC+PYE P CE G C + TP C R CQ GY+VSYE+D +G++ Y + +N+
Sbjct: 192 GCQPYEFPPCEHNTLGPLPVCDG-DVETPPCKRTCQAGYNVSYENDKWYGKVVYRVKSNQ 250
Query: 395 ETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSS 454
E IM+E+ +HGPVE +YAD YK+G+Y+HV+G LG HA+R++GWG+E +
Sbjct: 251 EAIMKELMQHGPVEVDFEVYADFPNYKSGVYQHVSGALLGGHAVRLLGWGEE-------N 303
Query: 455 VVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
V YWL+ANS+NT+WG+NG F+I+RG+NECGIE+D+ AG+PKI
Sbjct: 304 NVPYWLIANSWNTDWGDNGYFKIIRGKNECGIESDVNAGIPKI 346
>gi|196009263|ref|XP_002114497.1| expressed hypothetical protein [Trichoplax adhaerens]
gi|190583516|gb|EDV23587.1| expressed hypothetical protein [Trichoplax adhaerens]
Length = 333
Score = 278 bits (711), Expect = 5e-72, Method: Compositional matrix adjust.
Identities = 125/255 (49%), Positives = 175/255 (68%), Gaps = 9/255 (3%)
Query: 82 NRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIA 141
N L + + + +LP+ FD+R W CP+I+EIRDQGSCGS W+ GAVE+++DR+CI
Sbjct: 67 NHFQLPIHVHEDTSDLPKSFDSRDKWRMCPSIREIRDQGSCGSCWSFGAVESITDRICIH 126
Query: 142 SRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIP 201
S GK V +S++DL++CC CG GC GGF +AW YWV GIV+GG Y S +GC+PYEIP
Sbjct: 127 SNGKVKVHISAEDLMTCCTSCGMGCNGGFLPQAWHYWVNNGIVTGGQYHSHKGCQPYEIP 186
Query: 202 -CERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIF 260
CE ++ G +C P TP+C +KCQPGY+ ++ D +FG+ +YS+ N + I +EI
Sbjct: 187 KCEHHVKGPFKACGKELP-TPKCSQKCQPGYNKTFNQDKHFGKKSYSITNNIQQIQKEIM 245
Query: 261 RHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVA 320
+GPVE + T+YAD YK+G+Y+H GGPLG HA++I+GWG E + YWL+A
Sbjct: 246 MNGPVEAAFTVYADFPSYKSGVYQHTTGGPLGGHAVKILGWGTE-------NNTPYWLIA 298
Query: 321 NSFNTNWGENGLFRI 335
NS+N WG+ G F+I
Sbjct: 299 NSWNPTWGDKGYFKI 313
Score = 175 bits (444), Expect = 5e-41, Method: Compositional matrix adjust.
Identities = 79/162 (48%), Positives = 113/162 (69%), Gaps = 9/162 (5%)
Query: 336 GCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANE 394
GC+PYEIP CE ++ G +C P TP+C +KCQPGY+ ++ D +FG+ +YS+ N
Sbjct: 179 GCQPYEIPKCEHHVKGPFKACGKELP-TPKCSQKCQPGYNKTFNQDKHFGKKSYSITNNI 237
Query: 395 ETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSS 454
+ I +EI +GPVE + T+YAD YK+G+Y+H GGPLG HA++I+GWG E +
Sbjct: 238 QQIQKEIMMNGPVEAAFTVYADFPSYKSGVYQHTTGGPLGGHAVKILGWGTE-------N 290
Query: 455 VVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPK 496
YWL+ANS+N WG+ G F+I+RG++ECGIE+ I AG+PK
Sbjct: 291 NTPYWLIANSWNPTWGDKGYFKIIRGKDECGIESSIVAGMPK 332
>gi|405971658|gb|EKC36483.1| Cathepsin B [Crassostrea gigas]
Length = 341
Score = 278 bits (711), Expect = 5e-72, Method: Compositional matrix adjust.
Identities = 133/277 (48%), Positives = 185/277 (66%), Gaps = 20/277 (7%)
Query: 64 LSELEMRMGVH----PDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQ 119
L+ ++M G + P+ +LP+ ++ +PL++LP FD+R WP CPT++E+RDQ
Sbjct: 59 LAHVKMMCGTYLNTPPELRLPEKKM-------EPLKDLPATFDSRTQWPNCPTLKEVRDQ 111
Query: 120 GSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWV 179
G+CGS WA GAVEAMSDR+CI S+GK + +S++DL SCC+ CGNGC+GGF AW Y+
Sbjct: 112 GACGSCWAFGAVEAMSDRICIKSQGKENTHISAEDLTSCCRTCGNGCEGGFPSAAWSYYK 171
Query: 180 TTGIVSGGTYASKQGCRPYEI-PCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDD 238
G+V+GG Y S QGC PY I C+ ++ G C + TP+C C+ GY+V+YE D
Sbjct: 172 KDGLVTGGQYNSHQGCLPYTIKACDHHVVGKLQPCSKSIGPTPKCKHTCEAGYNVTYEKD 231
Query: 239 LNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRI 298
++G AYS+ E+ IM EI +GPVEG+ T+YAD YK+G+YKH G PLG HAI+I
Sbjct: 232 KHYGSSAYSVHGVEK-IMTEIMTNGPVEGAFTVYADFPQYKSGVYKHTTGQPLGGHAIKI 290
Query: 299 IGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
+GWG E + YWLVANS+N +WG+ G F+I
Sbjct: 291 LGWGTENGDD-------YWLVANSWNPDWGDQGFFKI 320
Score = 163 bits (413), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 80/163 (49%), Positives = 109/163 (66%), Gaps = 9/163 (5%)
Query: 336 GCRPYEI-PCERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANE 394
GC PY I C+ ++ G C + TP+C C+ GY+V+YE D ++G AYS+ E
Sbjct: 186 GCLPYTIKACDHHVVGKLQPCSKSIGPTPKCKHTCEAGYNVTYEKDKHYGSSAYSVHGVE 245
Query: 395 ETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSS 454
+ IM EI +GPVEG+ T+YAD YK+G+YKH G PLG HAI+I+GWG E +
Sbjct: 246 K-IMTEIMTNGPVEGAFTVYADFPQYKSGVYKHTTGQPLGGHAIKILGWGTENGDD---- 300
Query: 455 VVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
YWLVANS+N +WG+ G F+I+RGQ+ECGIE+ I+AG PK+
Sbjct: 301 ---YWLVANSWNPDWGDQGFFKILRGQDECGIESQISAGEPKL 340
>gi|296221607|ref|XP_002756833.1| PREDICTED: cathepsin B, partial [Callithrix jacchus]
Length = 330
Score = 278 bits (711), Expect = 5e-72, Method: Compositional matrix adjust.
Identities = 145/312 (46%), Positives = 196/312 (62%), Gaps = 22/312 (7%)
Query: 33 LSKAFDRVDHSILLPKLPFYGAEKNA-------LSKLTLSELEMRMGVHPDSKLPQNRLP 85
L+ A+ R L +L Y ++N + LS L+ G P R+
Sbjct: 5 LADAWQRPSFHPLSDELVNYVNKQNTTWQAGHNFYNVDLSYLKRLCGTFLGGPKPPQRVK 64
Query: 86 LLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGK 145
L+ LPE FDAR WP CPTI+EIRDQGSCGS WA GAVEA+SDR+CI +
Sbjct: 65 FAEDLN-----LPESFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHTNAH 119
Query: 146 RHVRLSSDDLVSCCKD-CGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIP-CE 203
V +S++DL++CC CG+GC GG+ +AW +W G+VSGG Y S GCRPY IP CE
Sbjct: 120 VSVEVSAEDLLTCCGSMCGDGCNGGYPAEAWNFWTRKGLVSGGLYDSHVGCRPYSIPPCE 179
Query: 204 RYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHG 263
++NGS C E +TP+C + C+PGY +Y+ D ++G +YS+ NE IM EI+++G
Sbjct: 180 HHVNGSRPPCT-GEGDTPKCSKSCEPGYSPTYKQDKHYGYDSYSVSNNERDIMAEIYKNG 238
Query: 264 PVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSF 323
PVEG+ ++YAD +LYK+G+Y+HV G +G HAIRI+GWG E GT YWLV NS+
Sbjct: 239 PVEGAFSVYADFLLYKSGVYQHVTGEMMGGHAIRILGWGVE---NGT----PYWLVGNSW 291
Query: 324 NTNWGENGLFRI 335
NT+WG+NG F+I
Sbjct: 292 NTDWGDNGFFKI 303
Score = 184 bits (468), Expect = 7e-44, Method: Compositional matrix adjust.
Identities = 85/164 (51%), Positives = 121/164 (73%), Gaps = 9/164 (5%)
Query: 334 RIGCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPA 392
+GCRPY IP CE ++NGSR C E +TP+C + C+PGY +Y+ D ++G +YS+
Sbjct: 167 HVGCRPYSIPPCEHHVNGSRPPCTG-EGDTPKCSKSCEPGYSPTYKQDKHYGYDSYSVSN 225
Query: 393 NEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGT 452
NE IM EI+++GPVEG+ ++YAD +LYK+G+Y+HV G +G HAIRI+GWG E GT
Sbjct: 226 NERDIMAEIYKNGPVEGAFSVYADFLLYKSGVYQHVTGEMMGGHAIRILGWGVE---NGT 282
Query: 453 SSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPK 496
YWLV NS+NT+WG+NG F+I+RGQ+ CGIE+++ AG+P+
Sbjct: 283 ----PYWLVGNSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPR 322
>gi|31872149|gb|AAP59456.1| cathepsin B precursor [Araneus ventricosus]
Length = 334
Score = 278 bits (711), Expect = 5e-72, Method: Compositional matrix adjust.
Identities = 139/285 (48%), Positives = 184/285 (64%), Gaps = 13/285 (4%)
Query: 52 YGAEKNALSKLTLSELEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCP 111
+ A +N +T+ + +GVH D+ + RLP + +LPE FD+R WP CP
Sbjct: 42 WKAGRNFHEGVTMKYIRGLLGVHKDNH--KYRLPSIRHAVPG--DLPESFDSREQWPNCP 97
Query: 112 TIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFH 171
TI EIRDQGSCGS WA GA EAMSDR CI S GK +V +S++DL++CC CG GC GGF
Sbjct: 98 TISEIRDQGSCGSCWAFGAAEAMSDRHCIHSNGKVNVEISAEDLLTCCDSCGMGCNGGFP 157
Query: 172 GKAWKYWVTTGIVSGGTYASKQGCRPYEIP-CERYMNGSHSSCQDNEPNTPECIRKCQPG 230
G AW+YWV G+V+GG Y S GC+PY I CE + G C D +TP+C+ C+ G
Sbjct: 158 GSAWEYWVDKGLVTGGLYNSHVGCQPYTIASCEHHTKGKLPPCGDI-VDTPQCVHMCEKG 216
Query: 231 YDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGP 290
Y+VSY D FG+ +YS+ E+ I EI +GPVE + T+YAD + YK+G+Y+HV G
Sbjct: 217 YNVSYRADKYFGKKSYSIDEQEDQIKTEISTNGPVEAAFTVYADFVTYKSGVYRHVTGEE 276
Query: 291 LGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
+G HA+RI+GWG E S YWLVANS+NT+WG+ G F+I
Sbjct: 277 MGGHAVRILGWGTE-------SGTPYWLVANSWNTDWGDKGYFKI 314
Score = 169 bits (427), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 81/170 (47%), Positives = 113/170 (66%), Gaps = 11/170 (6%)
Query: 331 GLF--RIGCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIA 387
GL+ +GC+PY I CE + G C + +TP+C+ C+ GY+VSY D FG+ +
Sbjct: 173 GLYNSHVGCQPYTIASCEHHTKGKLPPC-GDIVDTPQCVHMCEKGYNVSYRADKYFGKKS 231
Query: 388 YSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEP 447
YS+ E+ I EI +GPVE + T+YAD + YK+G+Y+HV G +G HA+RI+GWG E
Sbjct: 232 YSIDEQEDQIKTEISTNGPVEAAFTVYADFVTYKSGVYRHVTGEEMGGHAVRILGWGTE- 290
Query: 448 LGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
S YWLVANS+NT+WG+ G F+I+RG +ECGIE+ I AGLPK+
Sbjct: 291 ------SGTPYWLVANSWNTDWGDKGYFKILRGSDECGIESSIVAGLPKV 334
>gi|357613937|gb|EHJ68797.1| cathepsin B-like cysteine proteinase [Danaus plexippus]
Length = 334
Score = 278 bits (710), Expect = 7e-72, Method: Compositional matrix adjust.
Identities = 138/274 (50%), Positives = 179/274 (65%), Gaps = 11/274 (4%)
Query: 63 TLSELEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSC 122
TL ++ MG D L ++L + D + LPE FD R WP CPT+ EIRDQGSC
Sbjct: 50 TLKSIKKLMGALEDKYL--HKLYTVEHDDDTINNLPENFDPRDKWPNCPTLNEIRDQGSC 107
Query: 123 GSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTG 182
GS WA GAVEAM+DR C S G +H S++DL+SCC CG GC GG AW+YW G
Sbjct: 108 GSCWAFGAVEAMTDRYCTYSNGTKHFHFSAEDLLSCCPVCGLGCNGGIPSFAWEYWKHFG 167
Query: 183 IVSGGTYASKQGCRPYEI-PCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNF 241
IVSGG Y S QGC PYEI PCE ++ G+ C + E +TP+C R C+ Y SY+ D +
Sbjct: 168 IVSGGNYNSSQGCLPYEIPPCEHHVPGNRIPC-NGETSTPKCHRSCRKEYTNSYKSDKKY 226
Query: 242 GRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGW 301
G+ YS+ EE I EIF++GPVEG+ T+YAD++ YK+G+YKH G LG HAI+I+GW
Sbjct: 227 GKHVYSVGGGEEHIKAEIFKNGPVEGAFTVYADLLTYKSGVYKHTEGEALGGHAIKIMGW 286
Query: 302 GQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
G E + KYWL+ANS+N++WG+NG F+I
Sbjct: 287 GVE-------NGNKYWLIANSWNSDWGDNGFFKI 313
Score = 168 bits (426), Expect = 6e-39, Method: Compositional matrix adjust.
Identities = 81/161 (50%), Positives = 112/161 (69%), Gaps = 9/161 (5%)
Query: 336 GCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANE 394
GC PYEIP CE ++ G+R C E +TP+C R C+ Y SY+ D +G+ YS+ E
Sbjct: 179 GCLPYEIPPCEHHVPGNRIPCNG-ETSTPKCHRSCRKEYTNSYKSDKKYGKHVYSVGGGE 237
Query: 395 ETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSS 454
E I EIF++GPVEG+ T+YAD++ YK+G+YKH G LG HAI+I+GWG E +
Sbjct: 238 EHIKAEIFKNGPVEGAFTVYADLLTYKSGVYKHTEGEALGGHAIKIMGWGVE-------N 290
Query: 455 VVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLP 495
KYWL+ANS+N++WG+NG F+I+RG++ CGIE+ I AG P
Sbjct: 291 GNKYWLIANSWNSDWGDNGFFKILRGEDHCGIESSIVAGEP 331
>gi|116177489|gb|ABJ80691.1| cathepsin B [Hippoglossus hippoglossus]
Length = 330
Score = 277 bits (709), Expect = 8e-72, Method: Compositional matrix adjust.
Identities = 129/254 (50%), Positives = 174/254 (68%), Gaps = 9/254 (3%)
Query: 83 RLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIAS 142
+LP++VQ + L+ LP FD+R WP CPT++EIRDQGSCGS WA GA EA+SDRVCI S
Sbjct: 66 KLPIMVQYAGGLK-LPAQFDSREQWPECPTLKEIRDQGSCGSCWAFGAAEAISDRVCIHS 124
Query: 143 RGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEI-P 201
K V +SS+DL++CC CG GC GG+ AW +W G+VSGG Y S GCRPY I P
Sbjct: 125 GSKVSVEISSEDLLTCCDACGMGCNGGYPSAAWDFWTKEGLVSGGLYNSHIGCRPYTIPP 184
Query: 202 CERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFR 261
CE ++NGS C +TP+C+ C+ GY +Y D ++G+ +YS+ A+ E I EI +
Sbjct: 185 CEHHVNGSRPHCSGEGGDTPKCVHSCEAGYSPTYTKDKHYGKSSYSVEASVEQIQAEISQ 244
Query: 262 HGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVAN 321
+GPVEG+ +Y D ++YK+G+Y+H G LG HAI+++GWG+E V YWL AN
Sbjct: 245 NGPVEGAFIVYEDFVMYKSGVYQHTTGSALGGHAIKVLGWGEE-------DGVPYWLCAN 297
Query: 322 SFNTNWGENGLFRI 335
S+NT+WGENG F+I
Sbjct: 298 SWNTDWGENGFFKI 311
Score = 181 bits (458), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 82/169 (48%), Positives = 117/169 (69%), Gaps = 10/169 (5%)
Query: 331 GLF--RIGCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIA 387
GL+ IGCRPY IP CE ++NGSR C +TP+C+ C+ GY +Y D ++G+ +
Sbjct: 169 GLYNSHIGCRPYTIPPCEHHVNGSRPHCSGEGGDTPKCVHSCEAGYSPTYTKDKHYGKSS 228
Query: 388 YSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEP 447
YS+ A+ E I EI ++GPVEG+ +Y D ++YK+G+Y+H G LG HAI+++GWG+E
Sbjct: 229 YSVEASVEQIQAEISQNGPVEGAFIVYEDFVMYKSGVYQHTTGSALGGHAIKVLGWGEE- 287
Query: 448 LGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPK 496
V YWL ANS+NT+WGENG F+I+RG + CGIE++I AG+PK
Sbjct: 288 ------DGVPYWLCANSWNTDWGENGFFKILRGSDHCGIESEIVAGIPK 330
>gi|449267314|gb|EMC78276.1| Cathepsin B [Columba livia]
Length = 340
Score = 277 bits (708), Expect = 1e-71, Method: Compositional matrix adjust.
Identities = 131/255 (51%), Positives = 177/255 (69%), Gaps = 10/255 (3%)
Query: 83 RLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIAS 142
+LP V + +E LP+ FD+R WP CPTI EIRDQGSCGS WA GAVEA+SDR+C+ +
Sbjct: 67 KLPERVDFAGDME-LPDSFDSRTQWPNCPTISEIRDQGSCGSCWAFGAVEAISDRICVHT 125
Query: 143 RGKRHVRLSSDDLVSCCK-DCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIP 201
K V +S++DL+SCC +CG GC GG+ AW+YW G+VSGG Y S GCRPY IP
Sbjct: 126 NAKVSVEVSAEDLLSCCGFECGMGCNGGYPSGAWRYWTEKGLVSGGLYDSHVGCRPYSIP 185
Query: 202 -CERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIF 260
CE ++NGS C TP C R C+PGY SY++D ++G +Y +P +E+ IM EI+
Sbjct: 186 PCEHHVNGSRPPCTGEGGETPRCSRHCEPGYSPSYKEDKHYGITSYGVPRSEKEIMAEIY 245
Query: 261 RHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVA 320
++GPVEG+ +Y D ++YK+G+Y+HV G +G HAIR++GWG + GT YWL A
Sbjct: 246 KNGPVEGAFIVYEDFLMYKSGVYQHVTGEQVGGHAIRLLGWGVD---NGT----PYWLAA 298
Query: 321 NSFNTNWGENGLFRI 335
NS+NT+WG+NG F+I
Sbjct: 299 NSWNTDWGDNGFFKI 313
Score = 182 bits (462), Expect = 4e-43, Method: Compositional matrix adjust.
Identities = 85/177 (48%), Positives = 121/177 (68%), Gaps = 15/177 (8%)
Query: 327 WGENGLF-------RIGCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYE 378
W E GL +GCRPY IP CE ++NGSR C TP C R C+PGY SY+
Sbjct: 162 WTEKGLVSGGLYDSHVGCRPYSIPPCEHHVNGSRPPCTGEGGETPRCSRHCEPGYSPSYK 221
Query: 379 DDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAI 438
+D ++G +Y +P +E+ IM EI+++GPVEG+ +Y D ++YK+G+Y+HV G +G HAI
Sbjct: 222 EDKHYGITSYGVPRSEKEIMAEIYKNGPVEGAFIVYEDFLMYKSGVYQHVTGEQVGGHAI 281
Query: 439 RIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLP 495
R++GWG + GT YWL ANS+NT+WG+NG F+I+RG++ CGIE++I AG+P
Sbjct: 282 RLLGWGVD---NGT----PYWLAANSWNTDWGDNGFFKILRGEDHCGIESEIVAGIP 331
>gi|195729973|gb|ACG50797.1| cathepsin B2 [Trichobilharzia szidati]
Length = 344
Score = 276 bits (707), Expect = 1e-71, Method: Compositional matrix adjust.
Identities = 134/275 (48%), Positives = 183/275 (66%), Gaps = 12/275 (4%)
Query: 63 TLSELEMRMGVHPDSKLPQNRLPLLVQLSDP-LEELPEGFDARINWPYCPTIQEIRDQGS 121
++S++ +G PD LP L P L+ELP+ FDAR +WP+CP+I EIRDQ S
Sbjct: 59 SVSDIRRMLGALPDPN--GGYLPTLCTGYTPSLDELPKEFDARKHWPHCPSISEIRDQSS 116
Query: 122 CGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTT 181
CGS WA GAVEAMSDR+CI S+G LS+++LV+CC CG GC GGF AW YW +
Sbjct: 117 CGSCWAFGAVEAMSDRICIESKGLHKPFLSAENLVACCSSCGMGCNGGFPHSAWSYWKRS 176
Query: 182 GIVSGGTYASKQGCRPYEI-PCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLN 240
GIV+G Y + GC+PYE PCE ++ G SC + TP+C CQPGY++ Y D
Sbjct: 177 GIVTGDLYNTTDGCQPYEFPPCEHHVVGPRPSC-GGDVETPKCKTTCQPGYNIPYNKDKW 235
Query: 241 FGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIG 300
+G+ Y + +N+E IM+E+ HGPVE +YAD YK+G+Y+HV+GG LG HA+R++G
Sbjct: 236 YGKTVYRVHSNQEAIMKEVMDHGPVEVDFEVYADFPNYKSGVYQHVSGGLLGGHAVRLLG 295
Query: 301 WGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
WG+E + V YWL+ANS+N++WG+NG F+I
Sbjct: 296 WGEE-------NGVPYWLIANSWNSDWGDNGYFKI 323
Score = 180 bits (457), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 81/163 (49%), Positives = 117/163 (71%), Gaps = 9/163 (5%)
Query: 336 GCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANE 394
GC+PYE P CE ++ G R SC + TP+C CQPGY++ Y D +G+ Y + +N+
Sbjct: 189 GCQPYEFPPCEHHVVGPRPSC-GGDVETPKCKTTCQPGYNIPYNKDKWYGKTVYRVHSNQ 247
Query: 395 ETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSS 454
E IM+E+ HGPVE +YAD YK+G+Y+HV+GG LG HA+R++GWG+E +
Sbjct: 248 EAIMKEVMDHGPVEVDFEVYADFPNYKSGVYQHVSGGLLGGHAVRLLGWGEE-------N 300
Query: 455 VVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
V YWL+ANS+N++WG+NG F+I+RG+NECGIE+D+ AG+PK+
Sbjct: 301 GVPYWLIANSWNSDWGDNGYFKIIRGRNECGIESDVNAGIPKL 343
>gi|34979797|gb|AAQ83887.1| cathepsin B [Branchiostoma belcheri tsingtauense]
Length = 332
Score = 276 bits (707), Expect = 1e-71, Method: Compositional matrix adjust.
Identities = 134/275 (48%), Positives = 185/275 (67%), Gaps = 14/275 (5%)
Query: 63 TLSELEMRMGVHPDSKLPQN-RLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGS 121
T+S ++ GV D P N +LPL + + +++P+ FD+R W CPTI+E+RDQGS
Sbjct: 49 TVSYVKGLCGVIRD---PNNHKLPLKLHELN-AQDIPDTFDSRTQWANCPTIKEVRDQGS 104
Query: 122 CGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTT 181
CGS WAL AVEAMSDR+C+AS+G +S++DL SCCK CGNGC GGF AW+YW
Sbjct: 105 CGSCWALAAVEAMSDRICVASKGSTMAHISAEDLNSCCKSCGNGCNGGFPEAAWEYWKRD 164
Query: 182 GIVSGGTYASKQGCRPYEI-PCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLN 240
G+V+GG Y S QGC+PYEI PCE ++NGS +C EP TP C + C+ GY+V++ D +
Sbjct: 165 GLVTGGPYGSHQGCQPYEIKPCEHHINGSRPACGKLEP-TPRCKKSCESGYNVTFAKDKH 223
Query: 241 FGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIG 300
+ + AYS+ + + I EI +GPVE + T+YAD YK+G+Y+H +G LG HA+++IG
Sbjct: 224 YAKTAYSVSSKVQQIQMEIMTNGPVEAAFTVYADFPHYKSGVYQHESGAELGGHAVKMIG 283
Query: 301 WGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
W GT YWL+ANS+NT+WG G F+I
Sbjct: 284 W-------GTEGSTPYWLIANSWNTDWGNMGFFKI 311
Score = 173 bits (439), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 80/163 (49%), Positives = 112/163 (68%), Gaps = 9/163 (5%)
Query: 336 GCRPYEI-PCERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANE 394
GC+PYEI PCE ++NGSR +C EP TP C + C+ GY+V++ D ++ + AYS+ +
Sbjct: 177 GCQPYEIKPCEHHINGSRPACGKLEP-TPRCKKSCESGYNVTFAKDKHYAKTAYSVSSKV 235
Query: 395 ETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSS 454
+ I EI +GPVE + T+YAD YK+G+Y+H +G LG HA+++IGW GT
Sbjct: 236 QQIQMEIMTNGPVEAAFTVYADFPHYKSGVYQHESGAELGGHAVKMIGW-------GTEG 288
Query: 455 VVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
YWL+ANS+NT+WG G F+I+RGQ+ECGIE DI AG PK+
Sbjct: 289 STPYWLIANSWNTDWGNMGFFKILRGQDECGIERDIVAGEPKL 331
>gi|326916753|ref|XP_003204669.1| PREDICTED: cathepsin B-like [Meleagris gallopavo]
Length = 340
Score = 276 bits (707), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 132/255 (51%), Positives = 178/255 (69%), Gaps = 10/255 (3%)
Query: 83 RLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIAS 142
+LP V + ++ LP+ FD+R WP CPTI EIRDQGSCGS WA GAVEA+SDR+C+ +
Sbjct: 67 KLPERVDFAADID-LPDTFDSRKQWPNCPTISEIRDQGSCGSCWAFGAVEAISDRICVHT 125
Query: 143 RGKRHVRLSSDDLVSCCK-DCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIP 201
K V +S++DL+SCC +CG GC GG+ AW+YW G+VSGG Y S GCRPY IP
Sbjct: 126 NAKVSVEVSAEDLLSCCGFECGMGCNGGYPSGAWRYWTERGLVSGGLYDSHVGCRPYTIP 185
Query: 202 -CERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIF 260
CE ++NGS C TP C R C+PGY SY++D ++G +Y +P +E+ IM EI+
Sbjct: 186 PCEHHVNGSRPPCTGEGGETPRCSRHCEPGYSPSYKEDKHYGITSYGVPRSEKEIMAEIY 245
Query: 261 RHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVA 320
++GPVEG+ +Y D ++YK+G+Y+HV+G +G HAIRI+GWG E GT YWL A
Sbjct: 246 KNGPVEGAFIVYEDFLMYKSGVYQHVSGEQVGGHAIRILGWGVE---NGT----PYWLAA 298
Query: 321 NSFNTNWGENGLFRI 335
NS+NT+WG+NG F+I
Sbjct: 299 NSWNTDWGDNGFFKI 313
Score = 185 bits (469), Expect = 5e-44, Method: Compositional matrix adjust.
Identities = 87/178 (48%), Positives = 123/178 (69%), Gaps = 15/178 (8%)
Query: 327 WGENGLF-------RIGCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYE 378
W E GL +GCRPY IP CE ++NGSR C TP C R C+PGY SY+
Sbjct: 162 WTERGLVSGGLYDSHVGCRPYTIPPCEHHVNGSRPPCTGEGGETPRCSRHCEPGYSPSYK 221
Query: 379 DDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAI 438
+D ++G +Y +P +E+ IM EI+++GPVEG+ +Y D ++YK+G+Y+HV+G +G HAI
Sbjct: 222 EDKHYGITSYGVPRSEKEIMAEIYKNGPVEGAFIVYEDFLMYKSGVYQHVSGEQVGGHAI 281
Query: 439 RIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPK 496
RI+GWG E GT YWL ANS+NT+WG+NG F+I+RG++ CGIE++I AG+P+
Sbjct: 282 RILGWGVE---NGT----PYWLAANSWNTDWGDNGFFKILRGEDHCGIESEIVAGVPR 332
>gi|154089579|gb|ABS57370.1| cathepsin B2 [Trichobilharzia regenti]
Length = 344
Score = 276 bits (706), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 134/275 (48%), Positives = 182/275 (66%), Gaps = 12/275 (4%)
Query: 63 TLSELEMRMGVHPDSKLPQNRLPLLVQLSDP-LEELPEGFDARINWPYCPTIQEIRDQGS 121
++S++ +G PD LP L P L+ELP+ FDAR WP+CP+I EIRDQ S
Sbjct: 59 SVSDIRRMLGALPDPN--GGHLPTLCTGYTPSLDELPKEFDARKYWPHCPSISEIRDQSS 116
Query: 122 CGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTT 181
CGS WA GAVEAMSDR+CI S+G LS+++LV+CC CG GC GGF AW YW +
Sbjct: 117 CGSCWAFGAVEAMSDRICIESKGLHKPFLSAENLVACCSSCGMGCNGGFPHSAWSYWKRS 176
Query: 182 GIVSGGTYASKQGCRPYEI-PCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLN 240
GIV+G Y GC+PYE PCE ++ G SC+ + TP+C CQPGY++ Y D
Sbjct: 177 GIVTGDLYNPTDGCQPYEFPPCEHHVVGPRPSCE-GDVETPKCKTTCQPGYNIPYNKDKW 235
Query: 241 FGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIG 300
+G+ Y + +N+E IM+E+ HGPVE +YAD YK+G+Y+HV+GG LG HA+R++G
Sbjct: 236 YGKTVYRVHSNQEAIMKEVKEHGPVEVDFEVYADFPNYKSGVYQHVSGGLLGGHAVRLLG 295
Query: 301 WGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
WG+E + V YWL+ANS+N++WG+NG F+I
Sbjct: 296 WGEE-------NGVPYWLIANSWNSDWGDNGYFKI 323
Score = 181 bits (458), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 81/163 (49%), Positives = 118/163 (72%), Gaps = 9/163 (5%)
Query: 336 GCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANE 394
GC+PYE P CE ++ G R SC+ + TP+C CQPGY++ Y D +G+ Y + +N+
Sbjct: 189 GCQPYEFPPCEHHVVGPRPSCEGDV-ETPKCKTTCQPGYNIPYNKDKWYGKTVYRVHSNQ 247
Query: 395 ETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSS 454
E IM+E+ HGPVE +YAD YK+G+Y+HV+GG LG HA+R++GWG+E +
Sbjct: 248 EAIMKEVKEHGPVEVDFEVYADFPNYKSGVYQHVSGGLLGGHAVRLLGWGEE-------N 300
Query: 455 VVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
V YWL+ANS+N++WG+NG F+I+RG+NECGIE+D+ AG+PK+
Sbjct: 301 GVPYWLIANSWNSDWGDNGYFKIIRGRNECGIESDVNAGIPKL 343
>gi|119887749|gb|ABM05925.1| cathepsin B-like cysteine proteinase [Helicoverpa assulta]
Length = 338
Score = 276 bits (705), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 131/285 (45%), Positives = 185/285 (64%), Gaps = 11/285 (3%)
Query: 52 YGAEKNALSKLTLSELEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCP 111
+ A +N + ++ GV PD L ++L + + + LPE FD R WP CP
Sbjct: 41 WKAGRNFPEHTPFAHIKKLAGVLPDYHL--SKLSKVEHEDELIASLPENFDPRDKWPNCP 98
Query: 112 TIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFH 171
T+ E+RDQGSCGS WA GAVEAM+DR C S G +H S++DL+SCC CG GC GG
Sbjct: 99 TLNEVRDQGSCGSCWAFGAVEAMTDRYCTYSNGTQHFHFSAEDLLSCCPICGLGCNGGMP 158
Query: 172 GKAWKYWVTTGIVSGGTYASKQGCRPYEI-PCERYMNGSHSSCQDNEPNTPECIRKCQPG 230
AW+YW G+VSGG+Y S QGCRPYEI PCE ++ G+ C + + TP+C + C+
Sbjct: 159 TLAWEYWKHFGLVSGGSYNSSQGCRPYEIPPCEHHVPGNRMPC-NGDSKTPKCEKTCESN 217
Query: 231 YDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGP 290
Y+V Y D +G+ +S+ + E+ I E+F++GPVEG+ T+Y+D++ YKTG+YKH G
Sbjct: 218 YNVDYRKDKRYGKHVFSVSSKEDHIRAELFKNGPVEGAFTVYSDLLNYKTGVYKHTIGDA 277
Query: 291 LGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
LG HA++I+GWG E + KYWL+ANS+N++WG+NG F+I
Sbjct: 278 LGGHAVKILGWGVE-------NGNKYWLIANSWNSDWGDNGFFKI 315
Score = 169 bits (428), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 76/161 (47%), Positives = 113/161 (70%), Gaps = 9/161 (5%)
Query: 336 GCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANE 394
GCRPYEIP CE ++ G+R C + TP+C + C+ Y+V Y D +G+ +S+ + E
Sbjct: 181 GCRPYEIPPCEHHVPGNRMPCNGDS-KTPKCEKTCESNYNVDYRKDKRYGKHVFSVSSKE 239
Query: 395 ETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSS 454
+ I E+F++GPVEG+ T+Y+D++ YKTG+YKH G LG HA++I+GWG E +
Sbjct: 240 DHIRAELFKNGPVEGAFTVYSDLLNYKTGVYKHTIGDALGGHAVKILGWGVE-------N 292
Query: 455 VVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLP 495
KYWL+ANS+N++WG+NG F+I+RG++ CGIE+ I AG P
Sbjct: 293 GNKYWLIANSWNSDWGDNGFFKILRGEDHCGIESSIVAGEP 333
>gi|256077361|ref|XP_002574974.1| SmCB2 peptidase (C01 family) [Schistosoma mansoni]
gi|18181863|emb|CAC85211.2| cathepsin B endopeptidase [Schistosoma mansoni]
gi|353231645|emb|CCD79000.1| SmCB2 peptidase (C01 family) [Schistosoma mansoni]
Length = 347
Score = 276 bits (705), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 135/274 (49%), Positives = 178/274 (64%), Gaps = 12/274 (4%)
Query: 63 TLSELEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSC 122
T+S++ +G PD Q +SD ELP+ FDAR+ WP+CP+I EIRDQ SC
Sbjct: 63 TVSDIRRMLGALPDPNGEQLETLCTGYISD---ELPKSFDARVEWPHCPSISEIRDQSSC 119
Query: 123 GSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTG 182
GS WA GAVEAMSDR+CI S+GK LS+++LVSCC CG GC GGF AW YW G
Sbjct: 120 GSCWAFGAVEAMSDRICIKSKGKHKPFLSAENLVSCCSSCGMGCNGGFPHSAWLYWKNQG 179
Query: 183 IVSGGTYASKQGCRPYEI-PCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNF 241
IV+G Y + GC+PYE PCE ++ G SC D + TP C CQPGY++ YE D +
Sbjct: 180 IVTGDLYNTTNGCQPYEFPPCEHHVIGPLPSC-DGDVETPSCKTNCQPGYNIPYEKDKWY 238
Query: 242 GRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGW 301
G Y + +N E IM E+ R+GPVE +YAD YK+G+Y+HV+G LG HA+R++GW
Sbjct: 239 GEKVYRIHSNPEAIMLELMRNGPVEVDFEVYADFPNYKSGVYQHVSGALLGGHAVRLLGW 298
Query: 302 GQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
G+E + V YWL+ANS+N++WG+ G F+I
Sbjct: 299 GEE-------NNVPYWLIANSWNSDWGDKGYFKI 325
Score = 172 bits (436), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 81/163 (49%), Positives = 112/163 (68%), Gaps = 9/163 (5%)
Query: 336 GCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANE 394
GC+PYE P CE ++ G SC + TP C CQPGY++ YE D +G Y + +N
Sbjct: 191 GCQPYEFPPCEHHVIGPLPSCDG-DVETPSCKTNCQPGYNIPYEKDKWYGEKVYRIHSNP 249
Query: 395 ETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSS 454
E IM E+ R+GPVE +YAD YK+G+Y+HV+G LG HA+R++GWG+E +
Sbjct: 250 EAIMLELMRNGPVEVDFEVYADFPNYKSGVYQHVSGALLGGHAVRLLGWGEE-------N 302
Query: 455 VVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
V YWL+ANS+N++WG+ G F+IVRG+NECGIE+D+ AG+PKI
Sbjct: 303 NVPYWLIANSWNSDWGDKGYFKIVRGKNECGIESDVNAGIPKI 345
>gi|346472613|gb|AEO36151.1| hypothetical protein [Amblyomma maculatum]
Length = 373
Score = 276 bits (705), Expect = 3e-71, Method: Compositional matrix adjust.
Identities = 138/269 (51%), Positives = 175/269 (65%), Gaps = 13/269 (4%)
Query: 71 MGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGA 130
+GV P++ + RLP L LPE FDAR +WP CPTI+EIRDQGSCGS WA GA
Sbjct: 94 LGVRPENS--RYRLPERTLDVSALRVLPENFDAREHWPDCPTIREIRDQGSCGSCWAFGA 151
Query: 131 VEAMSDRVCIAS-RGKRHV--RLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGG 187
VEA+SDR CI S GK V L++DD++SCC +CG GC GGF G AW YWV GIV+GG
Sbjct: 152 VEAISDRTCIHSPEGKPRVIAHLAADDVLSCCTECGAGCNGGFPGSAWSYWVHKGIVTGG 211
Query: 188 TYASKQGCRPYEI-PCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAY 246
Y S +GC PY I C+ ++NG+ C P TP C+R C+ GYDV + DD ++GR AY
Sbjct: 212 NYDSDEGCMPYPIKACDHHVNGTLGPCDKTIPPTPRCVRMCRKGYDVDFMDDKHYGRHAY 271
Query: 247 SLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPL 306
S+PA + I EI +GPVE T+Y D + YK+G+Y+ LG HAIR++GWG E
Sbjct: 272 SVPAKAKQIQAEIMMNGPVEADFTVYEDFLHYKSGVYQRHTDSALGGHAIRLLGWGVE-- 329
Query: 307 GEGTSSVVKYWLVANSFNTNWGENGLFRI 335
+ V YWL ANS+NT WG+ G F+I
Sbjct: 330 -----NGVPYWLAANSWNTEWGDKGFFKI 353
Score = 171 bits (433), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 80/162 (49%), Positives = 106/162 (65%), Gaps = 8/162 (4%)
Query: 336 GCRPYEI-PCERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANE 394
GC PY I C+ ++NG+ C P TP C+R C+ GYDV + DD ++GR AYS+PA
Sbjct: 218 GCMPYPIKACDHHVNGTLGPCDKTIPPTPRCVRMCRKGYDVDFMDDKHYGRHAYSVPAKA 277
Query: 395 ETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSS 454
+ I EI +GPVE T+Y D + YK+G+Y+ LG HAIR++GWG E +
Sbjct: 278 KQIQAEIMMNGPVEADFTVYEDFLHYKSGVYQRHTDSALGGHAIRLLGWGVE-------N 330
Query: 455 VVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPK 496
V YWL ANS+NT WG+ G F+I+RG +ECGIE+DI AGLPK
Sbjct: 331 GVPYWLAANSWNTEWGDKGFFKILRGSDECGIESDIVAGLPK 372
>gi|38147393|gb|AAR12009.1| cathepsin B-like proteinase [Triatoma infestans]
Length = 332
Score = 275 bits (704), Expect = 3e-71, Method: Compositional matrix adjust.
Identities = 140/274 (51%), Positives = 178/274 (64%), Gaps = 20/274 (7%)
Query: 67 LEMRMGVHPDSK----LPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSC 122
L+ GVH D+ LP+ ++ L V L P+ FDAR +WP C +I EIRDQGSC
Sbjct: 54 LKSLAGVHKDANNAFTLPKRQVSLDVTL-------PKEFDARKHWPNCTSIAEIRDQGSC 106
Query: 123 GSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTG 182
GS WA GAVEAMSDR+CI S GK V LS+++LVSCC CG GC GG+ AW YW G
Sbjct: 107 GSCWAFGAVEAMSDRICIHSNGKLQVHLSAENLVSCCDSCGFGCDGGYPASAWDYWQNVG 166
Query: 183 IVSGGTYASKQGCRPYEI-PCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNF 241
IVSGG Y SKQGC+PY I PCE ++ G +C E +TP+C +C +SY+ DL +
Sbjct: 167 IVSGGNYGSKQGCQPYSIAPCEHHVPGPRPACS-GEGSTPDCRNQCDKRSGISYDKDLYY 225
Query: 242 GRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGW 301
G AYSL + I EI ++GPVE + T+Y D++ YK G+Y+HVAG LG HAI+I+GW
Sbjct: 226 GESAYSLEDEAKQIQAEILKNGPVEAAFTVYEDLVNYKEGVYQHVAGSVLGGHAIKILGW 285
Query: 302 GQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
G E + YWLVANS+NT+WG NG F+I
Sbjct: 286 GVE-------NDTPYWLVANSWNTDWGNNGFFKI 312
Score = 169 bits (428), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 80/165 (48%), Positives = 114/165 (69%), Gaps = 9/165 (5%)
Query: 334 RIGCRPYEI-PCERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPA 392
+ GC+PY I PCE ++ G R +C + E +TP+C +C +SY+ DL +G AYSL
Sbjct: 176 KQGCQPYSIAPCEHHVPGPRPAC-SGEGSTPDCRNQCDKRSGISYDKDLYYGESAYSLED 234
Query: 393 NEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGT 452
+ I EI ++GPVE + T+Y D++ YK G+Y+HVAG LG HAI+I+GWG E
Sbjct: 235 EAKQIQAEILKNGPVEAAFTVYEDLVNYKEGVYQHVAGSVLGGHAIKILGWGVE------ 288
Query: 453 SSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
+ YWLVANS+NT+WG NG F+I+RG++ECGIE D++AGLP++
Sbjct: 289 -NDTPYWLVANSWNTDWGNNGFFKILRGKDECGIEIDVSAGLPRL 332
>gi|333408990|gb|AEF32260.1| cathepsin B [Cristaria plicata]
Length = 347
Score = 275 bits (704), Expect = 3e-71, Method: Compositional matrix adjust.
Identities = 128/255 (50%), Positives = 178/255 (69%), Gaps = 9/255 (3%)
Query: 82 NRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIA 141
N PL V+ +PL +LP FDAR WP CPT++E+RDQG CGS WA GAVEAMSDR+CIA
Sbjct: 80 NPNPLPVKNIEPLRDLPTNFDARTQWPNCPTVKEVRDQGDCGSCWAFGAVEAMSDRICIA 139
Query: 142 SRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIP 201
S GK + +S++DL++CC CG GCQGGF +AW+Y+ G+V+GG Y S QGC+PY IP
Sbjct: 140 SNGKVNAEISAEDLLACCSSCGEGCQGGFPAEAWRYYEREGLVTGGLYNSSQGCQPYMIP 199
Query: 202 -CERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIF 260
C+ ++ G C E TP+C +KC+ Y+V+Y+DD ++G+ +YS+ + E+ IM EI
Sbjct: 200 ACDHHVVGHLQPCPKEEAKTPKCSKKCEANYNVTYKDDKHYGKNSYSVDSVEK-IMTEIM 258
Query: 261 RHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVA 320
+GPVE + T+Y D + YK+G+Y+H G LG HA++I+GWG++ GT YW+VA
Sbjct: 259 TNGPVEAAFTVYEDFLSYKSGVYQHRTGQELGGHAVKILGWGED---NGTP----YWIVA 311
Query: 321 NSFNTNWGENGLFRI 335
NS+N +WG G F I
Sbjct: 312 NSWNPDWGNQGFFNI 326
Score = 161 bits (408), Expect = 7e-37, Method: Compositional matrix adjust.
Identities = 76/163 (46%), Positives = 113/163 (69%), Gaps = 9/163 (5%)
Query: 336 GCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANE 394
GC+PY IP C+ ++ G C E TP+C +KC+ Y+V+Y+DD ++G+ +YS+ + E
Sbjct: 192 GCQPYMIPACDHHVVGHLQPCPKEEAKTPKCSKKCEANYNVTYKDDKHYGKNSYSVDSVE 251
Query: 395 ETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSS 454
+ IM EI +GPVE + T+Y D + YK+G+Y+H G LG HA++I+GWG++ GT
Sbjct: 252 K-IMTEIMTNGPVEAAFTVYEDFLSYKSGVYQHRTGQELGGHAVKILGWGED---NGTP- 306
Query: 455 VVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
YW+VANS+N +WG G F I+RG++ECGIE+ I AGLPK+
Sbjct: 307 ---YWIVANSWNPDWGNQGFFNILRGKDECGIESQIVAGLPKL 346
>gi|241998314|ref|XP_002433800.1| longipain, putative [Ixodes scapularis]
gi|215495559|gb|EEC05200.1| longipain, putative [Ixodes scapularis]
Length = 339
Score = 275 bits (704), Expect = 3e-71, Method: Compositional matrix adjust.
Identities = 137/274 (50%), Positives = 188/274 (68%), Gaps = 14/274 (5%)
Query: 64 LSELEMRMGVHPDSKLPQNRLPLLVQLSDPLE-ELPEGFDARINWPYCPTIQEIRDQGSC 122
L + ++GV D+ + RLP LV D LE ++P FD+R W CPTI+EIRDQG+C
Sbjct: 58 LETVRRKLGVSRDNH--KYRLPELVH--DTLEMDIPAQFDSRQQWQDCPTIREIRDQGAC 113
Query: 123 GSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTG 182
GS WA GAVE+MSDR CI S K V L++DD++SCC CG+GC GGF G AW YWV G
Sbjct: 114 GSCWAFGAVESMSDRHCIHSGAKNIVHLAADDVLSCCWGCGSGCNGGFPGAAWSYWVEKG 173
Query: 183 IVSGGTYASKQGCRPYEIP-CERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNF 241
IV+GG Y + +GC PY +P C+ ++NG+ C +P TP+C+R C+ GY++ ++DD ++
Sbjct: 174 IVTGGNYDTDEGCMPYPVPSCDHHVNGTLGPC-GQDPPTPKCVRLCRKGYNIDFKDDKHY 232
Query: 242 GRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGW 301
G+ +YS+ +NE I EI ++GPVEG+ T+YAD LYK+G+YK + LG HAIRI+GW
Sbjct: 233 GKSSYSVSSNETQIQMEIMKNGPVEGAFTVYADFPLYKSGVYKSHSTDALGGHAIRILGW 292
Query: 302 GQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
G E + V +WLVANS+NT WG+ G F+I
Sbjct: 293 GVE-------NGVPFWLVANSWNTEWGDKGYFKI 319
Score = 174 bits (441), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 81/162 (50%), Positives = 115/162 (70%), Gaps = 9/162 (5%)
Query: 336 GCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANE 394
GC PY +P C+ ++NG+ C +P TP+C+R C+ GY++ ++DD ++G+ +YS+ +NE
Sbjct: 185 GCMPYPVPSCDHHVNGTLGPC-GQDPPTPKCVRLCRKGYNIDFKDDKHYGKSSYSVSSNE 243
Query: 395 ETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSS 454
I EI ++GPVEG+ T+YAD LYK+G+YK + LG HAIRI+GWG E +
Sbjct: 244 TQIQMEIMKNGPVEGAFTVYADFPLYKSGVYKSHSTDALGGHAIRILGWGVE-------N 296
Query: 455 VVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPK 496
V +WLVANS+NT WG+ G F+I+RG NECGIE DI AG+PK
Sbjct: 297 GVPFWLVANSWNTEWGDKGYFKILRGSNECGIEEDIVAGIPK 338
>gi|183988832|gb|ACC66065.1| cathepsin B [Antheraea assama]
Length = 287
Score = 275 bits (704), Expect = 3e-71, Method: Compositional matrix adjust.
Identities = 132/283 (46%), Positives = 183/283 (64%), Gaps = 11/283 (3%)
Query: 54 AEKNALSKLTLSELEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTI 113
A +N + ++ MG D + +LP + +D + LPE FD R WP CPT+
Sbjct: 3 AGRNFPIHTPFAHIKKLMGSLKDDNIL--KLPKVTHDADLIASLPENFDPRDKWPDCPTL 60
Query: 114 QEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGK 173
EIRDQGSCGS WA GAVEAM+DRVCI S +H S++DLVSCC CG GC GG
Sbjct: 61 NEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCPICGLGCNGGMPTL 120
Query: 174 AWKYWVTTGIVSGGTYASKQGCRPYEI-PCERYMNGSHSSCQDNEPNTPECIRKCQPGYD 232
AW+YW G+VSGG Y S QGCRPYEI PCE ++ G+ C + + TP+C + C+ Y
Sbjct: 121 AWEYWKHVGLVSGGNYNSSQGCRPYEIPPCEHHVPGNRMPC-NGDTKTPKCEKTCESSYT 179
Query: 233 VSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLG 292
V ++ D +G+ YS+ +E+ I E+F++GPVEG+ T+Y+D++ YK+G+Y+H G LG
Sbjct: 180 VPFKKDKRYGKHVYSVSGHEDNIKAELFKNGPVEGAFTVYSDLLSYKSGVYQHTHGNALG 239
Query: 293 EHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
HAI+I+GWG E + KYWL+ANS+N++WG+NG +I
Sbjct: 240 GHAIKILGWGVE-------NGSKYWLIANSWNSDWGDNGFLKI 275
Score = 157 bits (398), Expect = 9e-36, Method: Compositional matrix adjust.
Identities = 70/154 (45%), Positives = 108/154 (70%), Gaps = 9/154 (5%)
Query: 336 GCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANE 394
GCRPYEIP CE ++ G+R C + TP+C + C+ Y V ++ D +G+ YS+ +E
Sbjct: 141 GCRPYEIPPCEHHVPGNRMPCNGDT-KTPKCEKTCESSYTVPFKKDKRYGKHVYSVSGHE 199
Query: 395 ETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSS 454
+ I E+F++GPVEG+ T+Y+D++ YK+G+Y+H G LG HAI+I+GWG E +
Sbjct: 200 DNIKAELFKNGPVEGAFTVYSDLLSYKSGVYQHTHGNALGGHAIKILGWGVE-------N 252
Query: 455 VVKYWLVANSFNTNWGENGLFRIVRGQNECGIEA 488
KYWL+ANS+N++WG+NG +I+RG++ CGIE+
Sbjct: 253 GSKYWLIANSWNSDWGDNGFLKILRGEDHCGIES 286
>gi|7537454|gb|AAF35867.2| cathepsin B-like cysteine proteinase [Helicoverpa armigera]
Length = 338
Score = 275 bits (704), Expect = 4e-71, Method: Compositional matrix adjust.
Identities = 131/285 (45%), Positives = 185/285 (64%), Gaps = 11/285 (3%)
Query: 52 YGAEKNALSKLTLSELEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCP 111
+ A +N + ++ GV PD L ++L + + + LPE FD R WP CP
Sbjct: 41 WKAGRNFPEHTPFAHIKRLAGVLPDYHL--SKLSKVEHEDELIASLPENFDPRDKWPNCP 98
Query: 112 TIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFH 171
T+ E+RDQGSCGS WA GAVEAM+DR C S G +H S++DL+SCC CG GC GG
Sbjct: 99 TLNEVRDQGSCGSCWAFGAVEAMTDRYCTYSNGTQHFHFSAEDLLSCCPICGLGCNGGMP 158
Query: 172 GKAWKYWVTTGIVSGGTYASKQGCRPYEI-PCERYMNGSHSSCQDNEPNTPECIRKCQPG 230
AW+YW G+VSGG+Y S QGCRPYEI PCE ++ G+ C + + TP+C + C+
Sbjct: 159 TLAWEYWKHFGLVSGGSYNSSQGCRPYEIPPCEHHVPGNRMPC-NGDSKTPKCEKTCESN 217
Query: 231 YDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGP 290
Y+V Y D +G+ +S+ + E+ I E+F++GPVEG+ T+Y+D++ YKTG+YKH G
Sbjct: 218 YNVDYRKDKRYGKHVFSVSSKEDHIRAELFKNGPVEGAFTVYSDLLNYKTGVYKHTIGDA 277
Query: 291 LGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
LG HA++I+GWG E + KYWL+ANS+N++WG+NG F+I
Sbjct: 278 LGGHAVKILGWGVE-------NGNKYWLIANSWNSDWGDNGFFKI 315
Score = 169 bits (427), Expect = 5e-39, Method: Compositional matrix adjust.
Identities = 76/161 (47%), Positives = 113/161 (70%), Gaps = 9/161 (5%)
Query: 336 GCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANE 394
GCRPYEIP CE ++ G+R C + TP+C + C+ Y+V Y D +G+ +S+ + E
Sbjct: 181 GCRPYEIPPCEHHVPGNRMPCNGDS-KTPKCEKTCESNYNVDYRKDKRYGKHVFSVSSKE 239
Query: 395 ETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSS 454
+ I E+F++GPVEG+ T+Y+D++ YKTG+YKH G LG HA++I+GWG E +
Sbjct: 240 DHIRAELFKNGPVEGAFTVYSDLLNYKTGVYKHTIGDALGGHAVKILGWGVE-------N 292
Query: 455 VVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLP 495
KYWL+ANS+N++WG+NG F+I+RG++ CGIE+ I AG P
Sbjct: 293 GNKYWLIANSWNSDWGDNGFFKILRGEDHCGIESSIVAGEP 333
>gi|354471594|ref|XP_003498026.1| PREDICTED: cathepsin B-like [Cricetulus griseus]
gi|344254255|gb|EGW10359.1| Cathepsin B [Cricetulus griseus]
Length = 339
Score = 275 bits (704), Expect = 4e-71, Method: Compositional matrix adjust.
Identities = 132/255 (51%), Positives = 181/255 (70%), Gaps = 11/255 (4%)
Query: 83 RLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIAS 142
+LP V ++ +E LPE FDAR W CPTI++IRDQGSCGS WA GAV AMSDR+CI +
Sbjct: 67 KLPERVAFAEDME-LPENFDAREQWSNCPTIKQIRDQGSCGSCWAFGAVGAMSDRLCIHT 125
Query: 143 RGKRHVRLSSDDLVSCC-KDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIP 201
G +V +S++DL++CC CG+GC GG+ AW +W+ G+VSGG Y S GC PY IP
Sbjct: 126 NGHVNVEVSAEDLLTCCGSQCGDGCNGGYPSGAWNFWIKKGLVSGGLYNSHVGCLPYTIP 185
Query: 202 -CERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIF 260
CE ++NGS C E +TP+C + C+ GY SY++D ++G +YS+ NE+ IM EI+
Sbjct: 186 PCEHHVNGSRPQCT-GEGDTPKCTKSCEAGYSPSYKEDKHYGYTSYSVSNNEKEIMAEIY 244
Query: 261 RHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVA 320
++GPVEG+ T+++D + YK+G+YKH AG +G HAIRI+GWG E + V YWLVA
Sbjct: 245 KNGPVEGAFTVFSDFLTYKSGVYKHEAGDIMGGHAIRILGWGVE-------NSVPYWLVA 297
Query: 321 NSFNTNWGENGLFRI 335
NS+N +WG+NGLF+I
Sbjct: 298 NSWNVDWGDNGLFKI 312
Score = 183 bits (465), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 84/167 (50%), Positives = 122/167 (73%), Gaps = 9/167 (5%)
Query: 334 RIGCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPA 392
+GC PY IP CE ++NGSR C E +TP+C + C+ GY SY++D ++G +YS+
Sbjct: 176 HVGCLPYTIPPCEHHVNGSRPQCTG-EGDTPKCTKSCEAGYSPSYKEDKHYGYTSYSVSN 234
Query: 393 NEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGT 452
NE+ IM EI+++GPVEG+ T+++D + YK+G+YKH AG +G HAIRI+GWG E
Sbjct: 235 NEKEIMAEIYKNGPVEGAFTVFSDFLTYKSGVYKHEAGDIMGGHAIRILGWGVE------ 288
Query: 453 SSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKIGL 499
+ V YWLVANS+N +WG+NGLF+I+RG++ CGIE++I AG+P+ L
Sbjct: 289 -NSVPYWLVANSWNVDWGDNGLFKILRGEDHCGIESEIVAGIPRTDL 334
>gi|410916585|ref|XP_003971767.1| PREDICTED: cathepsin B-like [Takifugu rubripes]
Length = 328
Score = 275 bits (702), Expect = 5e-71, Method: Compositional matrix adjust.
Identities = 129/254 (50%), Positives = 178/254 (70%), Gaps = 10/254 (3%)
Query: 83 RLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIAS 142
+LP ++ ++ + LP+ FDAR WP C TIQ+IRDQGSCGS WA GA EA+SDR+CI S
Sbjct: 64 KLPQVLHNTEGIR-LPDSFDARKQWPDCRTIQQIRDQGSCGSCWAFGAAEAISDRLCIHS 122
Query: 143 RGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEI-P 201
K + +S++DL+SCC +CG GC GG+ AW++W G+V+GG S+ GCRPY I P
Sbjct: 123 GSKISLEISAEDLLSCCDECGMGCSGGYPSSAWEFWTKKGLVTGGLCGSEVGCRPYSIAP 182
Query: 202 CERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFR 261
CE ++NG+ CQ + TP+C +KC GY SY D +FG+ +YSLP+ +E IM E+++
Sbjct: 183 CEHHVNGTRPPCQGTQ-ETPKCEKKCIDGYLTSYLKDKHFGKRSYSLPSQQEQIMTELYK 241
Query: 262 HGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVAN 321
+GPVE + T+YAD +LYKTG+Y+HV G LG HAI+I+GWG+E S YWL AN
Sbjct: 242 NGPVEAAFTVYADFLLYKTGVYQHVTGEVLGGHAIKILGWGEE-------SGTPYWLAAN 294
Query: 322 SFNTNWGENGLFRI 335
S+N +WG+ G F+I
Sbjct: 295 SWNGDWGDKGFFKI 308
Score = 183 bits (464), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 87/177 (49%), Positives = 120/177 (67%), Gaps = 16/177 (9%)
Query: 327 WGENGLF-------RIGCRPYEI-PCERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYE 378
W + GL +GCRPY I PCE ++NG+R CQ + TP+C +KC GY SY
Sbjct: 158 WTKKGLVTGGLCGSEVGCRPYSIAPCEHHVNGTRPPCQGTQ-ETPKCEKKCIDGYLTSYL 216
Query: 379 DDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAI 438
D +FG+ +YSLP+ +E IM E++++GPVE + T+YAD +LYKTG+Y+HV G LG HAI
Sbjct: 217 KDKHFGKRSYSLPSQQEQIMTELYKNGPVEAAFTVYADFLLYKTGVYQHVTGEVLGGHAI 276
Query: 439 RIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLP 495
+I+GWG+E S YWL ANS+N +WG+ G F+I RG +ECGIE+++ AG P
Sbjct: 277 KILGWGEE-------SGTPYWLAANSWNGDWGDKGFFKIKRGNDECGIESEMVAGTP 326
>gi|397467300|ref|XP_003805362.1| PREDICTED: cathepsin B [Pan paniscus]
Length = 339
Score = 275 bits (702), Expect = 5e-71, Method: Compositional matrix adjust.
Identities = 133/253 (52%), Positives = 181/253 (71%), Gaps = 11/253 (4%)
Query: 85 PLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRG 144
P V ++ L+ LPE FDAR WP CPTI+EIRDQGSCGS WA GAVEA+SDR+CI +
Sbjct: 69 PQRVMFTEDLK-LPESFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHTNA 127
Query: 145 KRHVRLSSDDLVSCCKD-CGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIP-C 202
V +S++DL++CC CG+GC GG+ +AW +W G+VSGG Y S GCRPY IP C
Sbjct: 128 HVSVEVSAEDLLTCCGSMCGDGCNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPC 187
Query: 203 ERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRH 262
E ++NGS C E +TP+C + C+PGY +Y+ D ++G +YS+ +E+ IM EI+++
Sbjct: 188 EHHVNGSRPPCT-GEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKN 246
Query: 263 GPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANS 322
GPVEG+ ++Y+D +LYK+G+Y+HV G +G HAIRI+GWG E GT YWLVANS
Sbjct: 247 GPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGVE---NGT----PYWLVANS 299
Query: 323 FNTNWGENGLFRI 335
+NT+WG+NG F+I
Sbjct: 300 WNTDWGDNGFFKI 312
Score = 182 bits (463), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 84/164 (51%), Positives = 123/164 (75%), Gaps = 9/164 (5%)
Query: 334 RIGCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPA 392
+GCRPY IP CE ++NGSR C E +TP+C + C+PGY +Y+ D ++G +YS+
Sbjct: 176 HVGCRPYSIPPCEHHVNGSRPPCTG-EGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVSN 234
Query: 393 NEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGT 452
+E+ IM EI+++GPVEG+ ++Y+D +LYK+G+Y+HV G +G HAIRI+GWG E GT
Sbjct: 235 SEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGVE---NGT 291
Query: 453 SSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPK 496
YWLVANS+NT+WG+NG F+I+RGQ+ CGIE+++ AG+P+
Sbjct: 292 ----PYWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPR 331
>gi|332862712|ref|XP_003317964.1| PREDICTED: cathepsin B isoform 1 [Pan troglodytes]
gi|332862714|ref|XP_003317965.1| PREDICTED: cathepsin B isoform 2 [Pan troglodytes]
gi|332862716|ref|XP_003317966.1| PREDICTED: cathepsin B isoform 3 [Pan troglodytes]
gi|332862718|ref|XP_519607.3| PREDICTED: cathepsin B isoform 5 [Pan troglodytes]
gi|410057614|ref|XP_003954244.1| PREDICTED: cathepsin B [Pan troglodytes]
gi|410262606|gb|JAA19269.1| cathepsin B [Pan troglodytes]
gi|410262608|gb|JAA19270.1| cathepsin B [Pan troglodytes]
gi|410359820|gb|JAA44654.1| cathepsin B [Pan troglodytes]
gi|410359822|gb|JAA44655.1| cathepsin B [Pan troglodytes]
gi|410359824|gb|JAA44656.1| cathepsin B [Pan troglodytes]
gi|410359826|gb|JAA44657.1| cathepsin B [Pan troglodytes]
gi|410359828|gb|JAA44658.1| cathepsin B [Pan troglodytes]
Length = 339
Score = 275 bits (702), Expect = 5e-71, Method: Compositional matrix adjust.
Identities = 133/253 (52%), Positives = 181/253 (71%), Gaps = 11/253 (4%)
Query: 85 PLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRG 144
P V ++ L+ LPE FDAR WP CPTI+EIRDQGSCGS WA GAVEA+SDR+CI +
Sbjct: 69 PQRVMFTEDLK-LPESFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHTNA 127
Query: 145 KRHVRLSSDDLVSCCKD-CGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIP-C 202
V +S++DL++CC CG+GC GG+ +AW +W G+VSGG Y S GCRPY IP C
Sbjct: 128 HVSVEVSAEDLLTCCGSMCGDGCNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPC 187
Query: 203 ERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRH 262
E ++NGS C E +TP+C + C+PGY +Y+ D ++G +YS+ +E+ IM EI+++
Sbjct: 188 EHHVNGSRPPCT-GEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKN 246
Query: 263 GPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANS 322
GPVEG+ ++Y+D +LYK+G+Y+HV G +G HAIRI+GWG E GT YWLVANS
Sbjct: 247 GPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGVE---NGT----PYWLVANS 299
Query: 323 FNTNWGENGLFRI 335
+NT+WG+NG F+I
Sbjct: 300 WNTDWGDNGFFKI 312
Score = 182 bits (463), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 84/164 (51%), Positives = 123/164 (75%), Gaps = 9/164 (5%)
Query: 334 RIGCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPA 392
+GCRPY IP CE ++NGSR C E +TP+C + C+PGY +Y+ D ++G +YS+
Sbjct: 176 HVGCRPYSIPPCEHHVNGSRPPCTG-EGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVSN 234
Query: 393 NEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGT 452
+E+ IM EI+++GPVEG+ ++Y+D +LYK+G+Y+HV G +G HAIRI+GWG E GT
Sbjct: 235 SEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGVE---NGT 291
Query: 453 SSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPK 496
YWLVANS+NT+WG+NG F+I+RGQ+ CGIE+++ AG+P+
Sbjct: 292 ----PYWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPR 331
>gi|25988674|gb|AAN76202.1| lysosomal cysteine proteinase cathepsin B/green fluorescent protein
EGFP fusion protein [synthetic construct]
Length = 578
Score = 274 bits (701), Expect = 6e-71, Method: Compositional matrix adjust.
Identities = 131/255 (51%), Positives = 179/255 (70%), Gaps = 11/255 (4%)
Query: 83 RLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIAS 142
+LP V S+ + LPE FDAR W CPTI +IRDQGSCGS WA GAVEAMSDR+CI +
Sbjct: 67 KLPERVGFSEDIN-LPESFDAREQWSNCPTIAQIRDQGSCGSCWAFGAVEAMSDRICIHT 125
Query: 143 RGKRHVRLSSDDLVSCCK-DCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIP 201
G+ +V +S++DL++CC CG+GC GG+ AW +W G+VSGG Y S GC PY IP
Sbjct: 126 NGRVNVEVSAEDLLTCCGIQCGDGCNGGYPSGAWNFWTRKGLVSGGVYNSHIGCLPYTIP 185
Query: 202 -CERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIF 260
CE ++NGS C E +TP+C + C+ GY SY++D ++G +YS+ +E+ IM EI+
Sbjct: 186 PCEHHVNGSRPPCT-GEGDTPKCNKMCEAGYSTSYKEDKHYGYTSYSVSDSEKEIMAEIY 244
Query: 261 RHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVA 320
++GPVEG+ T+++D + YK+G+YKH AG +G HAIRI+GWG E + V YWLVA
Sbjct: 245 KNGPVEGAFTVFSDFLTYKSGVYKHEAGDVMGGHAIRILGWGIE-------NGVPYWLVA 297
Query: 321 NSFNTNWGENGLFRI 335
NS+N +WG+NG F+I
Sbjct: 298 NSWNVDWGDNGFFKI 312
Score = 181 bits (460), Expect = 7e-43, Method: Compositional matrix adjust.
Identities = 83/164 (50%), Positives = 120/164 (73%), Gaps = 9/164 (5%)
Query: 334 RIGCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPA 392
IGC PY IP CE ++NGSR C E +TP+C + C+ GY SY++D ++G +YS+
Sbjct: 176 HIGCLPYTIPPCEHHVNGSRPPCTG-EGDTPKCNKMCEAGYSTSYKEDKHYGYTSYSVSD 234
Query: 393 NEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGT 452
+E+ IM EI+++GPVEG+ T+++D + YK+G+YKH AG +G HAIRI+GWG E
Sbjct: 235 SEKEIMAEIYKNGPVEGAFTVFSDFLTYKSGVYKHEAGDVMGGHAIRILGWGIE------ 288
Query: 453 SSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPK 496
+ V YWLVANS+N +WG+NG F+I+RG+N CGIE++I AG+P+
Sbjct: 289 -NGVPYWLVANSWNVDWGDNGFFKILRGENHCGIESEIVAGIPR 331
>gi|1169189|sp|P43157.1|CYSP_SCHJA RecName: Full=Cathepsin B-like cysteine proteinase; AltName:
Full=Antigen Sj31; Flags: Precursor
gi|11167|emb|CAA50305.1| cathepsin B [Schistosoma japonicum]
Length = 342
Score = 274 bits (701), Expect = 6e-71, Method: Compositional matrix adjust.
Identities = 132/275 (48%), Positives = 177/275 (64%), Gaps = 10/275 (3%)
Query: 63 TLSELEMRMGVHP-DSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGS 121
+L + + MG D+++ +NR P V D E+P FD+R WP+C +I +IRDQ
Sbjct: 56 SLDDARILMGARKEDAEMKRNRRPT-VDHHDLNVEIPSQFDSRKKWPHCKSISQIRDQSR 114
Query: 122 CGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTT 181
CGS WA GAVEAM+DR+CI S G + LS+ DL+SCCKDCG+GCQGGF G AW YWV
Sbjct: 115 CGSCWAFGAVEAMTDRICIQSGGGQSAELSALDLISCCKDCGDGCQGGFPGVAWDYWVKR 174
Query: 182 GIVSGGTYASKQGCRPYEIP-CERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLN 240
GIV+GG+ + GC+PY P CE + G + +C TP+C + CQ GY YE D +
Sbjct: 175 GIVTGGSKENHTGCQPYPFPKCEHHTKGKYPACGTKIYKTPQCKQTCQKGYKTPYEQDKH 234
Query: 241 FGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIG 300
+G +Y++ NE+ I R+I +GPVE + +Y D + YK+GIY+HV G +G HAIRIIG
Sbjct: 235 YGDESYNVQNNEKVIQRDIMMYGPVEAAFDVYEDFLNYKSGIYRHVTGSIVGGHAIRIIG 294
Query: 301 WGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
WG E YWL+ANS+N +WGE GLFR+
Sbjct: 295 WGVEKR-------TPYWLIANSWNEDWGEKGLFRM 322
Score = 164 bits (415), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 81/182 (44%), Positives = 109/182 (59%), Gaps = 11/182 (6%)
Query: 316 YWLVANSFNTNWGENGLFRIGCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYD 374
YW+ EN GC+PY P CE + G +C TP+C + CQ GY
Sbjct: 170 YWVKRGIVTGGSKEN---HTGCQPYPFPKCEHHTKGKYPACGTKIYKTPQCKQTCQKGYK 226
Query: 375 VSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLG 434
YE D ++G +Y++ NE+ I R+I +GPVE + +Y D + YK+GIY+HV G +G
Sbjct: 227 TPYEQDKHYGDESYNVQNNEKVIQRDIMMYGPVEAAFDVYEDFLNYKSGIYRHVTGSIVG 286
Query: 435 EHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGL 494
HAIRIIGWG E YWL+ANS+N +WGE GLFR+VRG++EC IE+D+ AGL
Sbjct: 287 GHAIRIIGWGVEKR-------TPYWLIANSWNEDWGEKGLFRMVRGRDECSIESDVVAGL 339
Query: 495 PK 496
K
Sbjct: 340 IK 341
>gi|346470617|gb|AEO35153.1| hypothetical protein [Amblyomma maculatum]
Length = 335
Score = 274 bits (701), Expect = 7e-71, Method: Compositional matrix adjust.
Identities = 136/285 (47%), Positives = 185/285 (64%), Gaps = 12/285 (4%)
Query: 52 YGAEKNALSKLTLSELEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCP 111
+ A +N +S L+ MGVHPDSK RLPL P ++LPE FDAR W +C
Sbjct: 42 WKAGRNFDKNTPVSYLKGLMGVHPDSK--NYRLPLFYHEDIP-KDLPESFDAREKWSHCN 98
Query: 112 TIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFH 171
+I IRDQ +CGS WA GA EAMSDRVCI S+GK V +S++DL++CC CG GC GG+
Sbjct: 99 SIHVIRDQSTCGSCWAFGATEAMSDRVCIHSKGKVQVNISAEDLLTCCDSCGAGCNGGYP 158
Query: 172 GKAWKYWVTTGIVSGGTYASKQGCRPYEI-PCERYMNGSHSSCQDNEPNTPECIRKCQPG 230
AW+++ T GIV+GG Y + GC+PY PCE + G +C +P TP+C+R C+ G
Sbjct: 159 AAAWEFYKTDGIVTGGLYGTDDGCQPYYFPPCEHHTVGPLPNCTGIKP-TPQCVRDCRKG 217
Query: 231 YDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGP 290
Y+ SY +D ++ + Y+L A+E I EIF++GPVE T+YAD + YK+G+Y+ +
Sbjct: 218 YEKSYSEDKHYAKKVYTLSADETQIKTEIFKNGPVEADFTVYADFVSYKSGVYQRHSDDA 277
Query: 291 LGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
LG HAIRI+GW GT + V YWLVANS+N +WG+ G F+I
Sbjct: 278 LGGHAIRILGW-------GTENGVPYWLVANSWNEDWGDKGYFKI 315
Score = 165 bits (417), Expect = 6e-38, Method: Compositional matrix adjust.
Identities = 78/162 (48%), Positives = 109/162 (67%), Gaps = 9/162 (5%)
Query: 336 GCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANE 394
GC+PY P CE + G +C +P TP+C+R C+ GY+ SY +D ++ + Y+L A+E
Sbjct: 181 GCQPYYFPPCEHHTVGPLPNCTGIKP-TPQCVRDCRKGYEKSYSEDKHYAKKVYTLSADE 239
Query: 395 ETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSS 454
I EIF++GPVE T+YAD + YK+G+Y+ + LG HAIRI+GW GT +
Sbjct: 240 TQIKTEIFKNGPVEADFTVYADFVSYKSGVYQRHSDDALGGHAIRILGW-------GTEN 292
Query: 455 VVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPK 496
V YWLVANS+N +WG+ G F+I+RG +ECGIE DI AG+PK
Sbjct: 293 GVPYWLVANSWNEDWGDKGYFKILRGNDECGIEDDINAGIPK 334
>gi|197098184|ref|NP_001126573.1| cathepsin B precursor [Pongo abelii]
gi|75061687|sp|Q5R6D1.1|CATB_PONAB RecName: Full=Cathepsin B; Contains: RecName: Full=Cathepsin B
light chain; Contains: RecName: Full=Cathepsin B heavy
chain; Flags: Precursor
gi|55731764|emb|CAH92586.1| hypothetical protein [Pongo abelii]
gi|55731953|emb|CAH92685.1| hypothetical protein [Pongo abelii]
Length = 339
Score = 274 bits (701), Expect = 7e-71, Method: Compositional matrix adjust.
Identities = 133/255 (52%), Positives = 181/255 (70%), Gaps = 11/255 (4%)
Query: 83 RLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIAS 142
+ P V ++ L+ LPE FDAR WP CPTI+EIRDQGSCGS WA GAVEA+SDR+CI +
Sbjct: 67 KPPQRVMFTEDLK-LPESFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHT 125
Query: 143 RGKRHVRLSSDDLVSCCKD-CGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIP 201
V +S++DL++CC CG+GC GG+ +AW +W G+VSGG Y S GCRPY IP
Sbjct: 126 NAHVSVEVSAEDLLTCCGSMCGDGCNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIP 185
Query: 202 -CERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIF 260
CE ++NGS C E +TP+C + C+PGY +Y+ D ++G +YS+ +E IM EI+
Sbjct: 186 PCEHHVNGSRPPCT-GEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVSNSERDIMAEIY 244
Query: 261 RHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVA 320
++GPVEG+ ++Y+D +LYK+G+Y+HV G +G HAIRI+GWG E GT YWLVA
Sbjct: 245 KNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGVE---NGT----PYWLVA 297
Query: 321 NSFNTNWGENGLFRI 335
NS+NT+WG+NG F+I
Sbjct: 298 NSWNTDWGDNGFFKI 312
Score = 182 bits (462), Expect = 4e-43, Method: Compositional matrix adjust.
Identities = 84/164 (51%), Positives = 122/164 (74%), Gaps = 9/164 (5%)
Query: 334 RIGCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPA 392
+GCRPY IP CE ++NGSR C E +TP+C + C+PGY +Y+ D ++G +YS+
Sbjct: 176 HVGCRPYSIPPCEHHVNGSRPPCTG-EGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVSN 234
Query: 393 NEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGT 452
+E IM EI+++GPVEG+ ++Y+D +LYK+G+Y+HV G +G HAIRI+GWG E GT
Sbjct: 235 SERDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGVE---NGT 291
Query: 453 SSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPK 496
YWLVANS+NT+WG+NG F+I+RGQ+ CGIE+++ AG+P+
Sbjct: 292 ----PYWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPR 331
>gi|426358853|ref|XP_004046705.1| PREDICTED: cathepsin B isoform 1 [Gorilla gorilla gorilla]
gi|426358855|ref|XP_004046706.1| PREDICTED: cathepsin B isoform 2 [Gorilla gorilla gorilla]
gi|426358857|ref|XP_004046707.1| PREDICTED: cathepsin B isoform 3 [Gorilla gorilla gorilla]
Length = 339
Score = 274 bits (701), Expect = 7e-71, Method: Compositional matrix adjust.
Identities = 132/253 (52%), Positives = 181/253 (71%), Gaps = 11/253 (4%)
Query: 85 PLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRG 144
P V ++ L+ LPE FDAR WP CPT++EIRDQGSCGS WA GAVEA+SDR+CI +
Sbjct: 69 PQRVMFTEDLK-LPESFDAREQWPQCPTVKEIRDQGSCGSCWAFGAVEAISDRICIHTNA 127
Query: 145 KRHVRLSSDDLVSCCKD-CGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIP-C 202
V +S++DL++CC CG+GC GG+ +AW +W G+VSGG Y S GCRPY IP C
Sbjct: 128 HVSVEVSAEDLLTCCGSMCGDGCNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPC 187
Query: 203 ERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRH 262
E ++NGS C E +TP+C + C+PGY +Y+ D ++G +YS+ +E+ IM EI+++
Sbjct: 188 EHHVNGSRPPCT-GEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKN 246
Query: 263 GPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANS 322
GPVEG+ ++Y+D +LYK+G+Y+HV G +G HAIRI+GWG E GT YWLVANS
Sbjct: 247 GPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGVE---NGT----PYWLVANS 299
Query: 323 FNTNWGENGLFRI 335
+NT+WG+NG F+I
Sbjct: 300 WNTDWGDNGFFKI 312
Score = 182 bits (463), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 84/164 (51%), Positives = 123/164 (75%), Gaps = 9/164 (5%)
Query: 334 RIGCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPA 392
+GCRPY IP CE ++NGSR C E +TP+C + C+PGY +Y+ D ++G +YS+
Sbjct: 176 HVGCRPYSIPPCEHHVNGSRPPCTG-EGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVSN 234
Query: 393 NEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGT 452
+E+ IM EI+++GPVEG+ ++Y+D +LYK+G+Y+HV G +G HAIRI+GWG E GT
Sbjct: 235 SEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGVE---NGT 291
Query: 453 SSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPK 496
YWLVANS+NT+WG+NG F+I+RGQ+ CGIE+++ AG+P+
Sbjct: 292 ----PYWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPR 331
>gi|330434688|gb|AEC22812.1| cathepsin B [Macrobrachium nipponense]
Length = 331
Score = 274 bits (700), Expect = 9e-71, Method: Compositional matrix adjust.
Identities = 138/285 (48%), Positives = 186/285 (65%), Gaps = 11/285 (3%)
Query: 52 YGAEKNALSKLTLSELEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCP 111
+ A +N L + L+ MGVH DSK + P+ ++P+ FD+R W CP
Sbjct: 36 WKAGRNFNKNLPMRYLKSLMGVHADSKFHMS--PVHKHKIPEGFKIPKEFDSRTAWSMCP 93
Query: 112 TIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFH 171
TI EIRDQGSCGS WA GAVE M+DR CI S G ++ S+++LVSCC CG GC GGF
Sbjct: 94 TISEIRDQGSCGSCWAFGAVEVMTDRDCIHSNGTKNFHYSAENLVSCCHLCGFGCNGGFP 153
Query: 172 GKAWKYWVTTGIVSGGTYASKQGCRPYEI-PCERYMNGSHSSCQDNEPNTPECIRKCQPG 230
G A++YWV +GIVSGG + S QGC+PYEI PCE +++G C + +TP+C + C+
Sbjct: 154 GAAFQYWVHSGIVSGGAFNSTQGCQPYEIAPCEHHVSGPRPKCAEGG-STPKCHKNCESN 212
Query: 231 YDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGP 290
Y V YE DL+ G YS+ +E I +I +GPVEG+ T+Y D + YK+G+Y+H G P
Sbjct: 213 YVVDYESDLHHGSKHYSVDKDETQIKYDIMTNGPVEGAFTVYVDFLHYKSGVYQHTHGLP 272
Query: 291 LGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
LG HAIR++GWG+E +GT YWL ANS+NT+WG+NG F+I
Sbjct: 273 LGGHAIRVLGWGEE---DGT----PYWLCANSWNTDWGDNGYFKI 310
Score = 172 bits (435), Expect = 5e-40, Method: Compositional matrix adjust.
Identities = 82/163 (50%), Positives = 115/163 (70%), Gaps = 9/163 (5%)
Query: 336 GCRPYEI-PCERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANE 394
GC+PYEI PCE +++G R C A +TP+C + C+ Y V YE DL+ G YS+ +E
Sbjct: 176 GCQPYEIAPCEHHVSGPRPKC-AEGGSTPKCHKNCESNYVVDYESDLHHGSKHYSVDKDE 234
Query: 395 ETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSS 454
I +I +GPVEG+ T+Y D + YK+G+Y+H G PLG HAIR++GWG+E +GT
Sbjct: 235 TQIKYDIMTNGPVEGAFTVYVDFLHYKSGVYQHTHGLPLGGHAIRVLGWGEE---DGT-- 289
Query: 455 VVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
YWL ANS+NT+WG+NG F+I+RG + CGIE++I+AGLPK+
Sbjct: 290 --PYWLCANSWNTDWGDNGYFKILRGSDHCGIESEISAGLPKV 330
>gi|50657025|emb|CAH04630.1| cathepsin B [Suberites domuncula]
Length = 331
Score = 274 bits (700), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 139/279 (49%), Positives = 187/279 (67%), Gaps = 23/279 (8%)
Query: 64 LSELEMR--MGVHP----DSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIR 117
LSE+++R MGV D KLP+ + PL+++P+ FDAR+ WP CPTI+EIR
Sbjct: 46 LSEVDIRRQMGVLQGGPLDIKLPEKDIT-------PLKDVPDMFDARMQWPDCPTIKEIR 98
Query: 118 DQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKY 177
DQG+CGS WA GAVE+MSDR CI H+ S++DL++CC+ CG GC GG+ G AW+Y
Sbjct: 99 DQGACGSCWAFGAVESMSDRFCIHFNQSAHI--SAEDLMACCETCGMGCNGGYLGAAWRY 156
Query: 178 WVTTGIVSGGTYASKQGCRPYEIP-CERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYE 236
+ TG+V+GG Y SK+GC+PY I C+ ++ G C E +TP C + C+ GYDVS+E
Sbjct: 157 FEHTGLVTGGQYNSKEGCQPYLIASCDHHVVGKKQPCASKEEHTPRCSKTCEAGYDVSFE 216
Query: 237 DDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAI 296
D +FG AYS+ ++ E I EI +GPVEG+ T+YAD YK+G+Y+H +G LG HAI
Sbjct: 217 KDKHFGASAYSVRSSVEAIQTEIMTNGPVEGAFTVYADFPTYKSGVYQHTSGAMLGGHAI 276
Query: 297 RIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
RI+GWG E GT YWLVANS+N +WG G F+I
Sbjct: 277 RILGWGTE---NGTP----YWLVANSWNEDWGAMGYFKI 308
Score = 173 bits (439), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 82/162 (50%), Positives = 113/162 (69%), Gaps = 8/162 (4%)
Query: 336 GCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANE 394
GC+PY I C+ ++ G + C + E +TP C + C+ GYDVS+E D +FG AYS+ ++
Sbjct: 173 GCQPYLIASCDHHVVGKKQPCASKEEHTPRCSKTCEAGYDVSFEKDKHFGASAYSVRSSV 232
Query: 395 ETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSS 454
E I EI +GPVEG+ T+YAD YK+G+Y+H +G LG HAIRI+GWG E GT
Sbjct: 233 EAIQTEIMTNGPVEGAFTVYADFPTYKSGVYQHTSGAMLGGHAIRILGWGTE---NGTP- 288
Query: 455 VVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPK 496
YWLVANS+N +WG G F+I+RG+++CGIE+ ITAG+PK
Sbjct: 289 ---YWLVANSWNEDWGAMGYFKIIRGKDDCGIESQITAGMPK 327
>gi|306992171|gb|ADN19566.1| cathepsin B-like proteinase [Spodoptera frugiperda]
Length = 341
Score = 273 bits (699), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 133/285 (46%), Positives = 183/285 (64%), Gaps = 11/285 (3%)
Query: 52 YGAEKNALSKLTLSELEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCP 111
+ A +N L+ ++ GV D+ L ++LP D + LPE FD R WP CP
Sbjct: 44 WKAGRNFPVNTPLTHIKKLTGVLVDTHL--SKLPKAEHDMDLIASLPENFDPRDKWPNCP 101
Query: 112 TIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFH 171
T+ E+RDQGSCGS WA GAVEAM+DR C S G +H S++DL+SCC CG GC GG
Sbjct: 102 TLNEVRDQGSCGSCWAFGAVEAMTDRYCTYSNGTKHFHFSAEDLLSCCPVCGLGCNGGMP 161
Query: 172 GKAWKYWVTTGIVSGGTYASKQGCRPYEI-PCERYMNGSHSSCQDNEPNTPECIRKCQPG 230
AW+YW G+VSGG+Y S QGCRPYEI PCE ++ G+ C + + TP+C + C+
Sbjct: 162 TLAWEYWKHFGLVSGGSYNSGQGCRPYEIPPCEHHVPGNRVPC-NGDSKTPKCHKTCEAS 220
Query: 231 YDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGP 290
Y V Y D +G+ YS+ + E+ I E+F++GPVEG+ T+Y+D++ YK G+YKH G
Sbjct: 221 YSVDYHKDKRYGKHVYSVSSKEDHIKAELFKNGPVEGAFTVYSDLLNYKNGVYKHTVGNA 280
Query: 291 LGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
LG HAI+I+GWG E + KY L+ANS+N++WG+NG F+I
Sbjct: 281 LGGHAIKILGWGVE-------NGNKYRLIANSWNSDWGDNGFFKI 318
Score = 165 bits (417), Expect = 6e-38, Method: Compositional matrix adjust.
Identities = 76/161 (47%), Positives = 110/161 (68%), Gaps = 9/161 (5%)
Query: 336 GCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANE 394
GCRPYEIP CE ++ G+R C + TP+C + C+ Y V Y D +G+ YS+ + E
Sbjct: 184 GCRPYEIPPCEHHVPGNRVPCNGDS-KTPKCHKTCEASYSVDYHKDKRYGKHVYSVSSKE 242
Query: 395 ETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSS 454
+ I E+F++GPVEG+ T+Y+D++ YK G+YKH G LG HAI+I+GWG E +
Sbjct: 243 DHIKAELFKNGPVEGAFTVYSDLLNYKNGVYKHTVGNALGGHAIKILGWGVE-------N 295
Query: 455 VVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLP 495
KY L+ANS+N++WG+NG F+I+RG++ CGIE+ I AG P
Sbjct: 296 GNKYRLIANSWNSDWGDNGFFKILRGEDHCGIESSIVAGEP 336
>gi|126681075|gb|ABO26563.1| cathepsin B-like cysteine protease form 1 [Ixodes ricinus]
Length = 337
Score = 273 bits (698), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 136/285 (47%), Positives = 185/285 (64%), Gaps = 12/285 (4%)
Query: 52 YGAEKNALSKLTLSELEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCP 111
+ A +N +++S + MGV+P SK + RLP V P ++LPE FDAR W +C
Sbjct: 43 WKAGRNFDKSISMSYIRGLMGVNPKSK--EYRLPEFVHEEIP-DDLPESFDAREKWSHCA 99
Query: 112 TIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFH 171
+I IRDQ +CGS WA GA EAMSDRVCI S G V +S++DL+ CC CG GC GG+
Sbjct: 100 SINLIRDQSTCGSCWAFGAAEAMSDRVCIHSEGGIQVNISAEDLLDCCDSCGAGCDGGYP 159
Query: 172 GKAWKYWVTTGIVSGGTYASKQGCRPYEI-PCERYMNGSHSSCQDNEPNTPECIRKCQPG 230
AW+YW +G+VS G Y + GC+PY + PCE + GS +C P TP+C+ C+ G
Sbjct: 160 AAAWEYWKESGLVSDGLYGTPDGCKPYSLAPCEHHTKGSLPNCTGTVP-TPKCVHLCRKG 218
Query: 231 YDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGP 290
Y Y+ D +FG+ YS+ +NE+ I EIF++GPVE T+YAD + YK+G+Y+H +G
Sbjct: 219 YGKDYQHDKHFGKKVYSISSNEKQIQTEIFKNGPVEADFTVYADFLSYKSGVYQHHSGDV 278
Query: 291 LGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
LG HAIRI+GWG E GT YWLVANS+N +WG++G F+I
Sbjct: 279 LGGHAIRILGWGTE---NGTP----YWLVANSWNEDWGDHGYFKI 316
Score = 173 bits (439), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 86/178 (48%), Positives = 118/178 (66%), Gaps = 16/178 (8%)
Query: 327 WGENGLFRIG-------CRPYEI-PCERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYE 378
W E+GL G C+PY + PCE + GS +C P TP+C+ C+ GY Y+
Sbjct: 166 WKESGLVSDGLYGTPDGCKPYSLAPCEHHTKGSLPNCTGTVP-TPKCVHLCRKGYGKDYQ 224
Query: 379 DDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAI 438
D +FG+ YS+ +NE+ I EIF++GPVE T+YAD + YK+G+Y+H +G LG HAI
Sbjct: 225 HDKHFGKKVYSISSNEKQIQTEIFKNGPVEADFTVYADFLSYKSGVYQHHSGDVLGGHAI 284
Query: 439 RIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPK 496
RI+GWG E GT YWLVANS+N +WG++G F+I+RG++ECGIE DI AG+PK
Sbjct: 285 RILGWGTE---NGTP----YWLVANSWNEDWGDHGYFKILRGKDECGIEDDINAGIPK 335
>gi|149698064|ref|XP_001498242.1| PREDICTED: cathepsin B [Equus caballus]
Length = 340
Score = 273 bits (698), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 134/255 (52%), Positives = 183/255 (71%), Gaps = 10/255 (3%)
Query: 83 RLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIAS 142
+LP V ++ + LPE FDAR WP CPTI+EIRDQGSCGS WA GAVEA+SDR+CI +
Sbjct: 67 KLPQRVWFAEDVV-LPENFDAREQWPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICIRT 125
Query: 143 RGKRHVRLSSDDLVSCCKD-CGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIP 201
G V +S++D+++CC D CG+GC GGF +AW +W G+VSGG Y S GCRPY IP
Sbjct: 126 NGHVSVEVSAEDMLTCCGDQCGDGCNGGFPAEAWNFWTKQGLVSGGLYDSHVGCRPYSIP 185
Query: 202 -CERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIF 260
CE ++NGS C +TP+C + C+PGY SY++D ++G +YS+ ++E+ IM EIF
Sbjct: 186 PCEHHVNGSRPPCTGEGGDTPKCSKICEPGYSPSYKEDKHYGCSSYSVSSSEKEIMAEIF 245
Query: 261 RHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVA 320
++GPVE + T+Y+D + YK+G+Y+HVAG +G HA+RI+GWG E GT YWLV
Sbjct: 246 KNGPVEAAFTVYSDFLQYKSGVYQHVAGDMMGGHAVRILGWGVE---NGT----PYWLVG 298
Query: 321 NSFNTNWGENGLFRI 335
NS+NT+WG+NG F+I
Sbjct: 299 NSWNTDWGDNGFFKI 313
Score = 177 bits (448), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 84/163 (51%), Positives = 121/163 (74%), Gaps = 8/163 (4%)
Query: 334 RIGCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPA 392
+GCRPY IP CE ++NGSR C +TP+C + C+PGY SY++D ++G +YS+ +
Sbjct: 176 HVGCRPYSIPPCEHHVNGSRPPCTGEGGDTPKCSKICEPGYSPSYKEDKHYGCSSYSVSS 235
Query: 393 NEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGT 452
+E+ IM EIF++GPVE + T+Y+D + YK+G+Y+HVAG +G HA+RI+GWG E GT
Sbjct: 236 SEKEIMAEIFKNGPVEAAFTVYSDFLQYKSGVYQHVAGDMMGGHAVRILGWGVE---NGT 292
Query: 453 SSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLP 495
YWLV NS+NT+WG+NG F+I+RGQ+ CGIE++I AG+P
Sbjct: 293 ----PYWLVGNSWNTDWGDNGFFKILRGQDHCGIESEIVAGIP 331
>gi|121073168|gb|ABM47070.1| cathepsin B1 [Clonorchis sinensis]
gi|358341105|dbj|GAA29748.2| cathepsin B [Clonorchis sinensis]
Length = 339
Score = 273 bits (698), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 131/275 (47%), Positives = 181/275 (65%), Gaps = 12/275 (4%)
Query: 63 TLSELEMRMGVHPDSK-LPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGS 121
T+S++ +G PD + LL + + +ELPE FDAR WPYC +I EIRDQ +
Sbjct: 55 TISDVRRVLGAVPDPNGFGLEKRCLLSTIRE--QELPESFDAREKWPYCSSIAEIRDQSN 112
Query: 122 CGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTT 181
CGS WA GA A+SDR+CIAS GK R+S +DLV CC DCG GCQGG+ +AW+YWV
Sbjct: 113 CGSCWAFGAAGAISDRICIASGGKHQPRISPEDLVDCCADCGMGCQGGYPAQAWEYWVRN 172
Query: 182 GIVSGGTYASKQGCRPYEI-PCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLN 240
G+V+G Y + CRPY PCE ++ G C +P TP+C++KCQP Y +YE+D
Sbjct: 173 GLVTGDLYNTTDTCRPYSFPPCEHHVVGPRKPCT-GDPTTPQCVKKCQPEYPKTYENDKW 231
Query: 241 FGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIG 300
+G AYS+ +++E IMR++ +GP+E +YAD Y +G+Y+HVAGG LG HA+R++G
Sbjct: 232 YGLKAYSIHSDQEAIMRDLMTYGPLEVDFEVYADFPSYSSGVYRHVAGGLLGGHAVRLVG 291
Query: 301 WGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
WG E + YWL+ANS+NT+WG+ G F+I
Sbjct: 292 WGVEDGAD-------YWLIANSWNTDWGDGGYFKI 319
Score = 170 bits (430), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 79/162 (48%), Positives = 111/162 (68%), Gaps = 9/162 (5%)
Query: 337 CRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEE 395
CRPY P CE ++ G R C +P TP+C++KCQP Y +YE+D +G AYS+ +++E
Sbjct: 186 CRPYSFPPCEHHVVGPRKPC-TGDPTTPQCVKKCQPEYPKTYENDKWYGLKAYSIHSDQE 244
Query: 396 TIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSV 455
IMR++ +GP+E +YAD Y +G+Y+HVAGG LG HA+R++GWG E +
Sbjct: 245 AIMRDLMTYGPLEVDFEVYADFPSYSSGVYRHVAGGLLGGHAVRLVGWGVEDGAD----- 299
Query: 456 VKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
YWL+ANS+NT+WG+ G F+I RG NECGIE+D AG PK+
Sbjct: 300 --YWLIANSWNTDWGDGGYFKIRRGVNECGIESDANAGHPKL 339
>gi|157833437|pdb|1PBH|A Chain A, Crystal Structure Of Human Recombinant Procathepsin B At
3.2 Angstrom Resolution
gi|157835646|pdb|2PBH|A Chain A, Crystal Structure Of Human Procathepsin B At 3.3 Angstrom
Resolution
gi|157836863|pdb|3PBH|A Chain A, Refined Crystal Structure Of Human Procathepsin B At 2.5
Angstrom Resolution
Length = 317
Score = 273 bits (698), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 132/255 (51%), Positives = 181/255 (70%), Gaps = 11/255 (4%)
Query: 83 RLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIAS 142
+ P V ++ L+ LP FDAR WP CPTI+EIRDQGSCGS WA GAVEA+SDR+CI +
Sbjct: 51 KPPQRVMFTEDLK-LPASFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHT 109
Query: 143 RGKRHVRLSSDDLVSCCKD-CGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIP 201
V +S++DL++CC CG+GC GG+ +AW +W G+VSGG Y S GCRPY IP
Sbjct: 110 NAHVSVEVSAEDLLTCCGSMCGDGCNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIP 169
Query: 202 -CERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIF 260
CE ++NGS C E +TP+C + C+PGY +Y+ D ++G +YS+ +E+ IM EI+
Sbjct: 170 PCEHHVNGSRPPCT-GEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIY 228
Query: 261 RHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVA 320
++GPVEG+ ++Y+D +LYK+G+Y+HV G +G HAIRI+GWG E GT YWLVA
Sbjct: 229 KNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGVE---NGT----PYWLVA 281
Query: 321 NSFNTNWGENGLFRI 335
NS+NT+WG+NG F+I
Sbjct: 282 NSWNTDWGDNGFFKI 296
Score = 182 bits (462), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 84/164 (51%), Positives = 123/164 (75%), Gaps = 9/164 (5%)
Query: 334 RIGCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPA 392
+GCRPY IP CE ++NGSR C E +TP+C + C+PGY +Y+ D ++G +YS+
Sbjct: 160 HVGCRPYSIPPCEHHVNGSRPPCTG-EGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVSN 218
Query: 393 NEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGT 452
+E+ IM EI+++GPVEG+ ++Y+D +LYK+G+Y+HV G +G HAIRI+GWG E GT
Sbjct: 219 SEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGVE---NGT 275
Query: 453 SSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPK 496
YWLVANS+NT+WG+NG F+I+RGQ+ CGIE+++ AG+P+
Sbjct: 276 ----PYWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPR 315
>gi|403307501|ref|XP_003944231.1| PREDICTED: cathepsin B [Saimiri boliviensis boliviensis]
Length = 351
Score = 273 bits (697), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 136/277 (49%), Positives = 184/277 (66%), Gaps = 15/277 (5%)
Query: 61 KLTLSELEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQG 120
+ LS L+ G P R+ L+ LPE FDAR WP CPTI+EIRDQG
Sbjct: 61 NVDLSYLKRLCGTFLGGPKPPQRVKFAEDLN-----LPESFDAREQWPQCPTIKEIRDQG 115
Query: 121 SCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKD-CGNGCQGGFHGKAWKYWV 179
SCGS WA GAVEA+SDR+CI + V +S++DL++CC CG+GC GG+ +AW +W
Sbjct: 116 SCGSCWAFGAVEAISDRICIHTNAHVSVEVSAEDLLTCCGSMCGDGCNGGYPAEAWNFWT 175
Query: 180 TTGIVSGGTYASKQGCRPYEIP-CERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDD 238
G+VSGG Y S GCRPY IP CE ++NGS C E +TP+C + C+PGY +Y+ D
Sbjct: 176 RKGLVSGGLYDSHVGCRPYSIPPCEHHVNGSRPPCT-GEGDTPKCSKSCEPGYTPTYKQD 234
Query: 239 LNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRI 298
++G +YS+ +E IM EI+++GPVEG+ ++Y+D +LYK+G+Y+HV G +G HAIRI
Sbjct: 235 KHYGYNSYSVSNSERDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRI 294
Query: 299 IGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
+GWG E GT YWLV NS+NT+WG+NG F+I
Sbjct: 295 LGWGVE---NGT----PYWLVGNSWNTDWGDNGFFKI 324
Score = 181 bits (459), Expect = 9e-43, Method: Compositional matrix adjust.
Identities = 83/164 (50%), Positives = 121/164 (73%), Gaps = 9/164 (5%)
Query: 334 RIGCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPA 392
+GCRPY IP CE ++NGSR C E +TP+C + C+PGY +Y+ D ++G +YS+
Sbjct: 188 HVGCRPYSIPPCEHHVNGSRPPCTG-EGDTPKCSKSCEPGYTPTYKQDKHYGYNSYSVSN 246
Query: 393 NEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGT 452
+E IM EI+++GPVEG+ ++Y+D +LYK+G+Y+HV G +G HAIRI+GWG E GT
Sbjct: 247 SERDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGVE---NGT 303
Query: 453 SSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPK 496
YWLV NS+NT+WG+NG F+I+RGQ+ CGIE+++ AG+P+
Sbjct: 304 ----PYWLVGNSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPR 343
>gi|121309133|dbj|BAF43801.1| Longipain [Haemaphysalis longicornis]
Length = 341
Score = 273 bits (697), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 133/266 (50%), Positives = 177/266 (66%), Gaps = 11/266 (4%)
Query: 71 MGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGA 130
+GVHP++ + RLP +++ ++P+ FD+R W CPTI+EIRDQGSCGS WA GA
Sbjct: 66 LGVHPNNH--KYRLPE-IEIDTSNVQIPDHFDSRHRWHDCPTIREIRDQGSCGSCWAFGA 122
Query: 131 VEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYA 190
VEAMSDR CI S K V L++DD++SCC CG+GC GGF G AW YWV GIV+GG Y
Sbjct: 123 VEAMSDRHCIHSGAKNIVHLAADDVLSCCMSCGSGCNGGFPGAAWSYWVHKGIVTGGNYD 182
Query: 191 SKQGCRPYEI-PCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLP 249
S +GC PY I C+ ++NG+ C + P TP C+R C+ GY+V + DD ++G+ +YS+P
Sbjct: 183 SDEGCMPYPIKACDHHVNGTLGPCDKSIPPTPRCVRMCRKGYNVDFADDKHYGKKSYSVP 242
Query: 250 ANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEG 309
+N I EI +GPVE T+YAD LYK+G+Y+ LG HAIR++GWG E
Sbjct: 243 SNVTQIQVEIMTNGPVEADFTVYADFPLYKSGVYQRHTDQALGGHAIRLLGWGVE----- 297
Query: 310 TSSVVKYWLVANSFNTNWGENGLFRI 335
V YWL ANS+NT WG+ G F+I
Sbjct: 298 --KGVPYWLAANSWNTEWGDKGFFKI 321
Score = 165 bits (417), Expect = 6e-38, Method: Compositional matrix adjust.
Identities = 76/162 (46%), Positives = 106/162 (65%), Gaps = 8/162 (4%)
Query: 336 GCRPYEI-PCERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANE 394
GC PY I C+ ++NG+ C + P TP C+R C+ GY+V + DD ++G+ +YS+P+N
Sbjct: 186 GCMPYPIKACDHHVNGTLGPCDKSIPPTPRCVRMCRKGYNVDFADDKHYGKKSYSVPSNV 245
Query: 395 ETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSS 454
I EI +GPVE T+YAD LYK+G+Y+ LG HAIR++GWG E
Sbjct: 246 TQIQVEIMTNGPVEADFTVYADFPLYKSGVYQRHTDQALGGHAIRLLGWGVE-------K 298
Query: 455 VVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPK 496
V YWL ANS+NT WG+ G F+I+RG +ECGIE D+ AG+P+
Sbjct: 299 GVPYWLAANSWNTEWGDKGFFKILRGSDECGIEDDVVAGIPR 340
>gi|181192|gb|AAA52129.1| preprocathepsin B [Homo sapiens]
gi|193787271|dbj|BAG52477.1| unnamed protein product [Homo sapiens]
Length = 339
Score = 273 bits (697), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 132/255 (51%), Positives = 181/255 (70%), Gaps = 11/255 (4%)
Query: 83 RLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIAS 142
+ P V ++ L+ LP FDAR WP CPTI+EIRDQGSCGS WA GAVEA+SDR+CI +
Sbjct: 67 KPPQRVMFTEDLK-LPASFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHT 125
Query: 143 RGKRHVRLSSDDLVSCCKD-CGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIP 201
V +S++DL++CC CG+GC GG+ +AW +W G+VSGG Y S GCRPY IP
Sbjct: 126 NAHVSVEVSAEDLLTCCGSMCGDGCNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIP 185
Query: 202 -CERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIF 260
CE ++NGS C E +TP+C + C+PGY +Y+ D ++G +YS+ +E+ IM EI+
Sbjct: 186 PCEHHVNGSRPPCT-GEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIY 244
Query: 261 RHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVA 320
++GPVEG+ ++Y+D +LYK+G+Y+HV G +G HAIRI+GWG E GT YWLVA
Sbjct: 245 KNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGVE---NGT----PYWLVA 297
Query: 321 NSFNTNWGENGLFRI 335
NS+NT+WG+NG F+I
Sbjct: 298 NSWNTDWGDNGFFKI 312
Score = 182 bits (463), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 84/164 (51%), Positives = 123/164 (75%), Gaps = 9/164 (5%)
Query: 334 RIGCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPA 392
+GCRPY IP CE ++NGSR C E +TP+C + C+PGY +Y+ D ++G +YS+
Sbjct: 176 HVGCRPYSIPPCEHHVNGSRPPCTG-EGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVSN 234
Query: 393 NEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGT 452
+E+ IM EI+++GPVEG+ ++Y+D +LYK+G+Y+HV G +G HAIRI+GWG E GT
Sbjct: 235 SEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGVE---NGT 291
Query: 453 SSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPK 496
YWLVANS+NT+WG+NG F+I+RGQ+ CGIE+++ AG+P+
Sbjct: 292 ----PYWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPR 331
>gi|60816353|gb|AAX36379.1| cathepsin B [synthetic construct]
gi|61358313|gb|AAX41546.1| cathepsin B [synthetic construct]
Length = 339
Score = 273 bits (697), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 132/253 (52%), Positives = 180/253 (71%), Gaps = 11/253 (4%)
Query: 85 PLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRG 144
P V ++ L+ LP FDAR WP CPTI+EIRDQGSCGS WA GAVEA+SDR+CI +
Sbjct: 69 PQRVMFTEDLK-LPASFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHTNA 127
Query: 145 KRHVRLSSDDLVSCC-KDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIP-C 202
V +S++DL++CC CG+GC GG+ +AW +W G+VSGG Y S GCRPY IP C
Sbjct: 128 HVSVEVSAEDLLTCCGSRCGDGCNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPC 187
Query: 203 ERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRH 262
E ++NGS C E +TP+C + C+PGY +Y+ D ++G +YS+ +E+ IM EI+++
Sbjct: 188 EHHVNGSRPPCT-GEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKN 246
Query: 263 GPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANS 322
GPVEG+ ++Y+D +LYK+G+Y+HV G +G HAIRI+GWG E GT YWLVANS
Sbjct: 247 GPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGVE---NGT----PYWLVANS 299
Query: 323 FNTNWGENGLFRI 335
+NT+WG+NG F+I
Sbjct: 300 WNTDWGDNGFFKI 312
Score = 182 bits (463), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 84/164 (51%), Positives = 123/164 (75%), Gaps = 9/164 (5%)
Query: 334 RIGCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPA 392
+GCRPY IP CE ++NGSR C E +TP+C + C+PGY +Y+ D ++G +YS+
Sbjct: 176 HVGCRPYSIPPCEHHVNGSRPPCTG-EGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVSN 234
Query: 393 NEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGT 452
+E+ IM EI+++GPVEG+ ++Y+D +LYK+G+Y+HV G +G HAIRI+GWG E GT
Sbjct: 235 SEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGVE---NGT 291
Query: 453 SSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPK 496
YWLVANS+NT+WG+NG F+I+RGQ+ CGIE+++ AG+P+
Sbjct: 292 ----PYWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPR 331
>gi|4503139|ref|NP_001899.1| cathepsin B preproprotein [Homo sapiens]
gi|22538431|ref|NP_680090.1| cathepsin B preproprotein [Homo sapiens]
gi|22538433|ref|NP_680091.1| cathepsin B preproprotein [Homo sapiens]
gi|22538435|ref|NP_680092.1| cathepsin B preproprotein [Homo sapiens]
gi|22538437|ref|NP_680093.1| cathepsin B preproprotein [Homo sapiens]
gi|68067549|sp|P07858.3|CATB_HUMAN RecName: Full=Cathepsin B; AltName: Full=APP secretase; Short=APPS;
AltName: Full=Cathepsin B1; Contains: RecName:
Full=Cathepsin B light chain; Contains: RecName:
Full=Cathepsin B heavy chain; Flags: Precursor
gi|291888|gb|AAC37547.1| cathepsin B [Homo sapiens]
gi|63102437|gb|AAH95408.1| Cathepsin B [Homo sapiens]
gi|119586034|gb|EAW65630.1| cathepsin B, isoform CRA_a [Homo sapiens]
gi|119586036|gb|EAW65632.1| cathepsin B, isoform CRA_a [Homo sapiens]
gi|119586037|gb|EAW65633.1| cathepsin B, isoform CRA_a [Homo sapiens]
gi|119586038|gb|EAW65634.1| cathepsin B, isoform CRA_a [Homo sapiens]
gi|119586039|gb|EAW65635.1| cathepsin B, isoform CRA_a [Homo sapiens]
gi|119586040|gb|EAW65636.1| cathepsin B, isoform CRA_a [Homo sapiens]
gi|168277954|dbj|BAG10955.1| cathepsin B precursor [synthetic construct]
gi|193786804|dbj|BAG52127.1| unnamed protein product [Homo sapiens]
Length = 339
Score = 273 bits (697), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 132/253 (52%), Positives = 180/253 (71%), Gaps = 11/253 (4%)
Query: 85 PLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRG 144
P V ++ L+ LP FDAR WP CPTI+EIRDQGSCGS WA GAVEA+SDR+CI +
Sbjct: 69 PQRVMFTEDLK-LPASFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHTNA 127
Query: 145 KRHVRLSSDDLVSCCKD-CGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIP-C 202
V +S++DL++CC CG+GC GG+ +AW +W G+VSGG Y S GCRPY IP C
Sbjct: 128 HVSVEVSAEDLLTCCGSMCGDGCNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPC 187
Query: 203 ERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRH 262
E ++NGS C E +TP+C + C+PGY +Y+ D ++G +YS+ +E+ IM EI+++
Sbjct: 188 EHHVNGSRPPCT-GEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKN 246
Query: 263 GPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANS 322
GPVEG+ ++Y+D +LYK+G+Y+HV G +G HAIRI+GWG E GT YWLVANS
Sbjct: 247 GPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGVE---NGT----PYWLVANS 299
Query: 323 FNTNWGENGLFRI 335
+NT+WG+NG F+I
Sbjct: 300 WNTDWGDNGFFKI 312
Score = 182 bits (463), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 84/164 (51%), Positives = 123/164 (75%), Gaps = 9/164 (5%)
Query: 334 RIGCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPA 392
+GCRPY IP CE ++NGSR C E +TP+C + C+PGY +Y+ D ++G +YS+
Sbjct: 176 HVGCRPYSIPPCEHHVNGSRPPCTG-EGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVSN 234
Query: 393 NEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGT 452
+E+ IM EI+++GPVEG+ ++Y+D +LYK+G+Y+HV G +G HAIRI+GWG E GT
Sbjct: 235 SEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGVE---NGT 291
Query: 453 SSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPK 496
YWLVANS+NT+WG+NG F+I+RGQ+ CGIE+++ AG+P+
Sbjct: 292 ----PYWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPR 331
>gi|16307393|gb|AAH10240.1| Cathepsin B [Homo sapiens]
Length = 339
Score = 273 bits (697), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 132/255 (51%), Positives = 181/255 (70%), Gaps = 11/255 (4%)
Query: 83 RLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIAS 142
+ P V ++ L+ LP FDAR WP CPTI+EIRDQGSCGS WA GAVEA+SDR+CI +
Sbjct: 67 KPPQRVMFTEDLK-LPASFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHT 125
Query: 143 RGKRHVRLSSDDLVSCCKD-CGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIP 201
V +S++DL++CC CG+GC GG+ +AW +W G+VSGG Y S GCRPY IP
Sbjct: 126 NAHVSVEVSAEDLLTCCGSMCGDGCNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIP 185
Query: 202 -CERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIF 260
CE ++NGS C E +TP+C + C+PGY +Y+ D ++G +YS+ +E+ IM EI+
Sbjct: 186 PCEHHVNGSRPPCT-GEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIY 244
Query: 261 RHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVA 320
++GPVEG+ ++Y+D +LYK+G+Y+HV G +G HAIRI+GWG E GT YWLVA
Sbjct: 245 KNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGVE---NGT----PYWLVA 297
Query: 321 NSFNTNWGENGLFRI 335
NS+NT+WG+NG F+I
Sbjct: 298 NSWNTDWGDNGFFKI 312
Score = 182 bits (463), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 84/164 (51%), Positives = 123/164 (75%), Gaps = 9/164 (5%)
Query: 334 RIGCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPA 392
+GCRPY IP CE ++NGSR C E +TP+C + C+PGY +Y+ D ++G +YS+
Sbjct: 176 HVGCRPYSIPPCEHHVNGSRPPCTG-EGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVSN 234
Query: 393 NEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGT 452
+E+ IM EI+++GPVEG+ ++Y+D +LYK+G+Y+HV G +G HAIRI+GWG E GT
Sbjct: 235 SEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGVE---NGT 291
Query: 453 SSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPK 496
YWLVANS+NT+WG+NG F+I+RGQ+ CGIE+++ AG+P+
Sbjct: 292 ----PYWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPR 331
>gi|30583753|gb|AAP36125.1| Homo sapiens cathepsin B [synthetic construct]
gi|61370555|gb|AAX43516.1| cathepsin B [synthetic construct]
Length = 340
Score = 273 bits (697), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 132/253 (52%), Positives = 180/253 (71%), Gaps = 11/253 (4%)
Query: 85 PLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRG 144
P V ++ L+ LP FDAR WP CPTI+EIRDQGSCGS WA GAVEA+SDR+CI +
Sbjct: 69 PQRVMFTEDLK-LPASFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHTNA 127
Query: 145 KRHVRLSSDDLVSCCKD-CGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIP-C 202
V +S++DL++CC CG+GC GG+ +AW +W G+VSGG Y S GCRPY IP C
Sbjct: 128 HVSVEVSAEDLLTCCGSMCGDGCNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPC 187
Query: 203 ERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRH 262
E ++NGS C E +TP+C + C+PGY +Y+ D ++G +YS+ +E+ IM EI+++
Sbjct: 188 EHHVNGSRPPCT-GEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKN 246
Query: 263 GPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANS 322
GPVEG+ ++Y+D +LYK+G+Y+HV G +G HAIRI+GWG E GT YWLVANS
Sbjct: 247 GPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGVE---NGT----PYWLVANS 299
Query: 323 FNTNWGENGLFRI 335
+NT+WG+NG F+I
Sbjct: 300 WNTDWGDNGFFKI 312
Score = 182 bits (463), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 84/164 (51%), Positives = 123/164 (75%), Gaps = 9/164 (5%)
Query: 334 RIGCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPA 392
+GCRPY IP CE ++NGSR C E +TP+C + C+PGY +Y+ D ++G +YS+
Sbjct: 176 HVGCRPYSIPPCEHHVNGSRPPCTG-EGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVSN 234
Query: 393 NEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGT 452
+E+ IM EI+++GPVEG+ ++Y+D +LYK+G+Y+HV G +G HAIRI+GWG E GT
Sbjct: 235 SEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGVE---NGT 291
Query: 453 SSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPK 496
YWLVANS+NT+WG+NG F+I+RGQ+ CGIE+++ AG+P+
Sbjct: 292 ----PYWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPR 331
>gi|402877481|ref|XP_003902454.1| PREDICTED: cathepsin B [Papio anubis]
Length = 339
Score = 272 bits (696), Expect = 3e-70, Method: Compositional matrix adjust.
Identities = 133/255 (52%), Positives = 181/255 (70%), Gaps = 11/255 (4%)
Query: 83 RLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIAS 142
+ P V ++ L+ LPE FDAR WP CPTI+EIRDQGSCGS WA GAVEA+SDR+CI +
Sbjct: 67 KPPQRVMFTEDLK-LPESFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHT 125
Query: 143 RGKRHVRLSSDDLVSCCKD-CGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIP 201
V +S++DL++CC CG+GC GG+ AW +W G+VSGG Y S GCRPY IP
Sbjct: 126 NAHVSVEVSAEDLLTCCGIMCGDGCNGGYPAGAWNFWTRKGLVSGGLYDSHVGCRPYSIP 185
Query: 202 -CERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIF 260
CE ++NGS C E +TP+C + C+PGY +Y+ D ++G +YS+ +E+ IM EI+
Sbjct: 186 PCEHHVNGSRPPCT-GEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIY 244
Query: 261 RHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVA 320
++GPVEG+ ++Y+D +LYK+G+Y+HV G +G HAIRI+GWG E GT YWLVA
Sbjct: 245 KNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGVE---NGT----PYWLVA 297
Query: 321 NSFNTNWGENGLFRI 335
NS+NT+WG+NG F+I
Sbjct: 298 NSWNTDWGDNGFFKI 312
Score = 182 bits (463), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 84/164 (51%), Positives = 123/164 (75%), Gaps = 9/164 (5%)
Query: 334 RIGCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPA 392
+GCRPY IP CE ++NGSR C E +TP+C + C+PGY +Y+ D ++G +YS+
Sbjct: 176 HVGCRPYSIPPCEHHVNGSRPPCTG-EGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVSN 234
Query: 393 NEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGT 452
+E+ IM EI+++GPVEG+ ++Y+D +LYK+G+Y+HV G +G HAIRI+GWG E GT
Sbjct: 235 SEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGVE---NGT 291
Query: 453 SSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPK 496
YWLVANS+NT+WG+NG F+I+RGQ+ CGIE+++ AG+P+
Sbjct: 292 ----PYWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPR 331
>gi|302564570|ref|NP_001181828.1| cathepsin B precursor [Macaca mulatta]
Length = 339
Score = 272 bits (696), Expect = 3e-70, Method: Compositional matrix adjust.
Identities = 133/253 (52%), Positives = 180/253 (71%), Gaps = 11/253 (4%)
Query: 85 PLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRG 144
P V ++ L+ LPE FDAR WP CPTI+EIRDQGSCGS WA GAVEA+SDR+CI +
Sbjct: 69 PQRVMFTEDLK-LPESFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHTNA 127
Query: 145 KRHVRLSSDDLVSCCKD-CGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIP-C 202
V +S++DL++CC CG+GC GG+ AW +W G+VSGG Y S GCRPY IP C
Sbjct: 128 HVSVEVSAEDLLTCCGIMCGDGCNGGYPAGAWNFWTRKGLVSGGLYDSHVGCRPYSIPPC 187
Query: 203 ERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRH 262
E ++NGS C E +TP+C + C+PGY +Y+ D ++G +YS+ +E+ IM EI+++
Sbjct: 188 EHHVNGSRPPCT-GEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKN 246
Query: 263 GPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANS 322
GPVEG+ ++Y+D +LYK+G+Y+HV G +G HAIRI+GWG E GT YWLVANS
Sbjct: 247 GPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGVE---NGT----PYWLVANS 299
Query: 323 FNTNWGENGLFRI 335
+NT+WG+NG F+I
Sbjct: 300 WNTDWGDNGFFKI 312
Score = 182 bits (462), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 84/164 (51%), Positives = 123/164 (75%), Gaps = 9/164 (5%)
Query: 334 RIGCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPA 392
+GCRPY IP CE ++NGSR C E +TP+C + C+PGY +Y+ D ++G +YS+
Sbjct: 176 HVGCRPYSIPPCEHHVNGSRPPCTG-EGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVSN 234
Query: 393 NEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGT 452
+E+ IM EI+++GPVEG+ ++Y+D +LYK+G+Y+HV G +G HAIRI+GWG E GT
Sbjct: 235 SEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGVE---NGT 291
Query: 453 SSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPK 496
YWLVANS+NT+WG+NG F+I+RGQ+ CGIE+++ AG+P+
Sbjct: 292 ----PYWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPR 331
>gi|410956528|ref|XP_003984894.1| PREDICTED: cathepsin B [Felis catus]
Length = 339
Score = 272 bits (696), Expect = 3e-70, Method: Compositional matrix adjust.
Identities = 128/255 (50%), Positives = 183/255 (71%), Gaps = 11/255 (4%)
Query: 83 RLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIAS 142
+LP V ++ + LPE FDAR +WP CPTI+EIRDQGSCGS WA GAVEA+SDR+CI +
Sbjct: 67 KLPQRVSFAEDMV-LPENFDAREHWPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICILT 125
Query: 143 RGKRHVRLSSDDLVSCCKD-CGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIP 201
G +V +S++D+++CC D CG+GC GGF +AW +W G+VSGG Y S GCRPY IP
Sbjct: 126 NGHVNVEVSAEDMLTCCGDQCGDGCNGGFPAEAWNFWTKQGLVSGGLYDSHVGCRPYSIP 185
Query: 202 -CERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIF 260
CE ++NGS C E +TP+C + C+PGY SY++D ++G +YS+ +E+ IM EI+
Sbjct: 186 PCEHHVNGSRPPCT-GEGDTPKCSKICEPGYTPSYKEDKHYGCNSYSVSNSEKEIMAEIY 244
Query: 261 RHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVA 320
++GPVE + ++++D + YK+G+Y+HV G +G HA+RI+GWG E + YWLV
Sbjct: 245 KNGPVEAAFSVFSDFLQYKSGVYQHVTGEMMGGHAVRILGWGVE-------NDTPYWLVG 297
Query: 321 NSFNTNWGENGLFRI 335
NS+NT+WG++G F+I
Sbjct: 298 NSWNTDWGDHGFFKI 312
Score = 171 bits (433), Expect = 9e-40, Method: Compositional matrix adjust.
Identities = 79/177 (44%), Positives = 123/177 (69%), Gaps = 16/177 (9%)
Query: 327 WGENGLF-------RIGCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYE 378
W + GL +GCRPY IP CE ++NGSR C E +TP+C + C+PGY SY+
Sbjct: 162 WTKQGLVSGGLYDSHVGCRPYSIPPCEHHVNGSRPPCTG-EGDTPKCSKICEPGYTPSYK 220
Query: 379 DDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAI 438
+D ++G +YS+ +E+ IM EI+++GPVE + ++++D + YK+G+Y+HV G +G HA+
Sbjct: 221 EDKHYGCNSYSVSNSEKEIMAEIYKNGPVEAAFSVFSDFLQYKSGVYQHVTGEMMGGHAV 280
Query: 439 RIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLP 495
RI+GWG E + YWLV NS+NT+WG++G F+I+RG++ CGIE+++ AG+P
Sbjct: 281 RILGWGVE-------NDTPYWLVGNSWNTDWGDHGFFKILRGRDHCGIESEVVAGIP 330
>gi|323147412|gb|ADX32985.1| cathepsin B [Pinctada fucata]
Length = 366
Score = 272 bits (696), Expect = 3e-70, Method: Compositional matrix adjust.
Identities = 133/273 (48%), Positives = 182/273 (66%), Gaps = 12/273 (4%)
Query: 64 LSELEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCG 123
L +++ G + D P +LP V+ +P ++LP+ FDAR W CPTI+EIRDQGSCG
Sbjct: 84 LDHVKIMCGTYLDVP-PHLQLP--VRDIEPRKDLPDTFDARTQWSNCPTIKEIRDQGSCG 140
Query: 124 SGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGI 183
S WA GAVE+MSDR+CI S G+++ +S++DL SCC+ CGNGC GGF AW+Y+ G+
Sbjct: 141 SCWAFGAVESMSDRICIKSNGQQNAHISAEDLTSCCRSCGNGCNGGFLSGAWEYYKRDGL 200
Query: 184 VSGGTYASKQGCRPYEI-PCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFG 242
V+GG Y S QGC+PY + C+ ++ G C E +TP C +C+ GY+VSY D ++G
Sbjct: 201 VTGGQYNSHQGCQPYTVKACDHHVVGKLQPCSKKEEHTPVCKHECESGYNVSYTKDKHYG 260
Query: 243 RIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWG 302
AYS+ ++ IM EI +GPVEG+ T+YAD YK+G+YKH G PLG HAI+I+GWG
Sbjct: 261 ATAYSVRGVQQ-IMTEIMTNGPVEGAFTVYADFPQYKSGVYKHTTGSPLGGHAIKIMGWG 319
Query: 303 QEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
E + YWLVANS+N +WG G F+I
Sbjct: 320 TEGGDD-------YWLVANSWNPDWGNQGTFKI 345
Score = 162 bits (410), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 78/163 (47%), Positives = 108/163 (66%), Gaps = 9/163 (5%)
Query: 336 GCRPYEI-PCERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANE 394
GC+PY + C+ ++ G C E +TP C +C+ GY+VSY D ++G AYS+ +
Sbjct: 211 GCQPYTVKACDHHVVGKLQPCSKKEEHTPVCKHECESGYNVSYTKDKHYGATAYSVRGVQ 270
Query: 395 ETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSS 454
+ IM EI +GPVEG+ T+YAD YK+G+YKH G PLG HAI+I+GWG E +
Sbjct: 271 Q-IMTEIMTNGPVEGAFTVYADFPQYKSGVYKHTTGSPLGGHAIKIMGWGTEGGDD---- 325
Query: 455 VVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
YWLVANS+N +WG G F+I+RG++ECGIE+ I AG PK+
Sbjct: 326 ---YWLVANSWNPDWGNQGTFKILRGRDECGIESQIAAGEPKL 365
>gi|75076082|sp|Q4R5M2.1|CATB_MACFA RecName: Full=Cathepsin B; Contains: RecName: Full=Cathepsin B
light chain; Contains: RecName: Full=Cathepsin B heavy
chain; Flags: Precursor
gi|67970521|dbj|BAE01603.1| unnamed protein product [Macaca fascicularis]
gi|355779504|gb|EHH63980.1| Cathepsin B [Macaca fascicularis]
gi|383411999|gb|AFH29213.1| cathepsin B preproprotein [Macaca mulatta]
gi|384942194|gb|AFI34702.1| cathepsin B preproprotein [Macaca mulatta]
Length = 339
Score = 272 bits (695), Expect = 3e-70, Method: Compositional matrix adjust.
Identities = 133/255 (52%), Positives = 181/255 (70%), Gaps = 11/255 (4%)
Query: 83 RLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIAS 142
+ P V ++ L+ LPE FDAR WP CPTI+EIRDQGSCGS WA GAVEA+SDR+CI +
Sbjct: 67 KPPQRVMFTEDLK-LPESFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHT 125
Query: 143 RGKRHVRLSSDDLVSCCKD-CGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIP 201
V +S++DL++CC CG+GC GG+ AW +W G+VSGG Y S GCRPY IP
Sbjct: 126 NAHVSVEVSAEDLLTCCGIMCGDGCNGGYPAGAWNFWTRKGLVSGGLYDSHVGCRPYSIP 185
Query: 202 -CERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIF 260
CE ++NGS C E +TP+C + C+PGY +Y+ D ++G +YS+ +E+ IM EI+
Sbjct: 186 PCEHHVNGSRPPCT-GEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIY 244
Query: 261 RHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVA 320
++GPVEG+ ++Y+D +LYK+G+Y+HV G +G HAIRI+GWG E GT YWLVA
Sbjct: 245 KNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGVE---NGT----PYWLVA 297
Query: 321 NSFNTNWGENGLFRI 335
NS+NT+WG+NG F+I
Sbjct: 298 NSWNTDWGDNGFFKI 312
Score = 182 bits (463), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 84/164 (51%), Positives = 123/164 (75%), Gaps = 9/164 (5%)
Query: 334 RIGCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPA 392
+GCRPY IP CE ++NGSR C E +TP+C + C+PGY +Y+ D ++G +YS+
Sbjct: 176 HVGCRPYSIPPCEHHVNGSRPPCTG-EGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVSN 234
Query: 393 NEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGT 452
+E+ IM EI+++GPVEG+ ++Y+D +LYK+G+Y+HV G +G HAIRI+GWG E GT
Sbjct: 235 SEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGVE---NGT 291
Query: 453 SSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPK 496
YWLVANS+NT+WG+NG F+I+RGQ+ CGIE+++ AG+P+
Sbjct: 292 ----PYWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPR 331
>gi|226471004|emb|CAX70583.1| Cysteine PRotease related protein [Schistosoma japonicum]
Length = 304
Score = 272 bits (695), Expect = 4e-70, Method: Compositional matrix adjust.
Identities = 132/275 (48%), Positives = 176/275 (64%), Gaps = 10/275 (3%)
Query: 63 TLSELEMRMGVHP-DSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGS 121
+L + + MG D+++ + R P V D E+P FD+R WP+C +I +IRDQ
Sbjct: 18 SLDDARILMGARKEDAEMKRKRRPT-VDHHDLNVEIPSQFDSRKKWPHCKSISQIRDQSR 76
Query: 122 CGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTT 181
CGS WA GAVEAM+DR+CI S G + LS+ DL+SCCKDCG+GC+GGF G+AW YWV
Sbjct: 77 CGSCWAFGAVEAMTDRICIQSGGGQSAELSALDLISCCKDCGDGCKGGFPGQAWDYWVKR 136
Query: 182 GIVSGGTYASKQGCRPYEIP-CERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLN 240
GIV+GG+ + GC+PY P CE G + +C TP+C + CQ GY YE D +
Sbjct: 137 GIVTGGSEENHTGCQPYPFPKCEHLTKGKYPACGTKIYKTPQCKQTCQKGYKTPYEQDKH 196
Query: 241 FGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIG 300
+G Y++ +NE+ I REI +GPVE + +Y D + YK+GIY+HV G +G HAIRIIG
Sbjct: 197 YGDQRYNVISNEKAIQREIMMYGPVEAAFDVYEDFLNYKSGIYRHVTGSIVGGHAIRIIG 256
Query: 301 WGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
WG E YWL+ANS+N +WGE GLFRI
Sbjct: 257 WGVEKR-------TPYWLIANSWNEDWGEKGLFRI 284
Score = 161 bits (407), Expect = 9e-37, Method: Compositional matrix adjust.
Identities = 81/180 (45%), Positives = 106/180 (58%), Gaps = 11/180 (6%)
Query: 316 YWLVANSFNTNWGENGLFRIGCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYD 374
YW+ EN GC+PY P CE G +C TP+C + CQ GY
Sbjct: 132 YWVKRGIVTGGSEEN---HTGCQPYPFPKCEHLTKGKYPACGTKIYKTPQCKQTCQKGYK 188
Query: 375 VSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLG 434
YE D ++G Y++ +NE+ I REI +GPVE + +Y D + YK+GIY+HV G +G
Sbjct: 189 TPYEQDKHYGDQRYNVISNEKAIQREIMMYGPVEAAFDVYEDFLNYKSGIYRHVTGSIVG 248
Query: 435 EHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGL 494
HAIRIIGWG E YWL+ANS+N +WGE GLFRIVRG++EC IE+ + AGL
Sbjct: 249 GHAIRIIGWGVEKR-------TPYWLIANSWNEDWGEKGLFRIVRGRDECSIESHVVAGL 301
>gi|329668994|gb|AEB96385.1| cathepsin B-like cysteine protease 2 [Angiostrongylus cantonensis]
Length = 316
Score = 271 bits (694), Expect = 4e-70, Method: Compositional matrix adjust.
Identities = 131/269 (48%), Positives = 171/269 (63%), Gaps = 8/269 (2%)
Query: 68 EMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWA 127
+R V + +N P ++P+ FDAR+ WP+CP+I IRDQ CGS WA
Sbjct: 37 HLRRKVMKSKFINRNNKPREDDTEIDGSKIPDSFDARVTWPHCPSISYIRDQSQCGSCWA 96
Query: 128 LGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGG 187
+ E MSDRVCIAS G + V LS+DD++SCC D G GC GG+ AW+Y+V TG+V+GG
Sbjct: 97 FSSAEVMSDRVCIASHGHKKVELSADDILSCCTDGGYGCDGGWPVSAWQYFVETGVVTGG 156
Query: 188 TYASKQGCRPYEI-PCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAY 246
Y +K CRPYEI PC + N + S E +TP+C CQ GY +SY+DD +G+ AY
Sbjct: 157 LYGTKDACRPYEIPPCGIHKNETFYSNCTQEIDTPDCKTTCQAGYPISYDDDKTYGKTAY 216
Query: 247 SLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPL 306
S+ + I +EI +GPV + T+Y D YKTGIYKHV+G G HA+RI+GWGQ+
Sbjct: 217 SVSNSVHAIQKEIMTYGPVVAAFTVYDDFFHYKTGIYKHVSGAEAGGHAVRILGWGQQ-- 274
Query: 307 GEGTSSVVKYWLVANSFNTNWGENGLFRI 335
V YWLVANS+NT+WGENG FRI
Sbjct: 275 -----GGVPYWLVANSWNTDWGENGYFRI 298
Score = 166 bits (421), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 83/166 (50%), Positives = 108/166 (65%), Gaps = 10/166 (6%)
Query: 331 GLF--RIGCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIA 387
GL+ + CRPYEIP C + N + S E +TP+C CQ GY +SY+DD +G+ A
Sbjct: 156 GLYGTKDACRPYEIPPCGIHKNETFYSNCTQEIDTPDCKTTCQAGYPISYDDDKTYGKTA 215
Query: 388 YSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEP 447
YS+ + I +EI +GPV + T+Y D YKTGIYKHV+G G HA+RI+GWGQ+
Sbjct: 216 YSVSNSVHAIQKEIMTYGPVVAAFTVYDDFFHYKTGIYKHVSGAEAGGHAVRILGWGQQ- 274
Query: 448 LGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAG 493
V YWLVANS+NT+WGENG FRI+RG +ECGIE + AG
Sbjct: 275 ------GGVPYWLVANSWNTDWGENGYFRILRGSDECGIEDGVVAG 314
>gi|325302580|dbj|BAJ83490.1| cathepsin B-like peptidase [Echinococcus multilocularis]
Length = 351
Score = 271 bits (694), Expect = 5e-70, Method: Compositional matrix adjust.
Identities = 136/276 (49%), Positives = 177/276 (64%), Gaps = 15/276 (5%)
Query: 65 SELEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGS 124
S++ ++G PD RLP+L LS+ + LP FD R WP C T+ EIRDQGSCGS
Sbjct: 66 SQVRQQLGALPDPM--GRRLPVLYSLSENYKSLPASFDPRKKWPNCKTLFEIRDQGSCGS 123
Query: 125 GWALGAVEAMSDRVCI----ASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVT 180
WA GA EAMSDR+CI S VRLS+DDL+SCC+DCG GC GGF +AW +W
Sbjct: 124 CWAFGAAEAMSDRLCIQQQTVSGRAVMVRLSADDLLSCCRDCGMGCNGGFPSQAWNFWKH 183
Query: 181 TGIVSGGTYASKQGCRPYEIP-CERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDL 239
G+VSGG Y +K CR YEIP CE ++NG+ C+ + P TP+C CQ Y V Y+ D
Sbjct: 184 EGLVSGGLYGTKGVCRAYEIPPCEHHVNGTRPPCEGDAP-TPKCKNVCQEEYKVPYKKDK 242
Query: 240 NFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRII 299
++ YS+ +NE+ I E+ HGPVE +YAD YK+G+Y+HV+G LG HAI+++
Sbjct: 243 HYAVKVYSVHSNEDAIKHELITHGPVEADFEVYADFPTYKSGVYQHVSGALLGGHAIKLM 302
Query: 300 GWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
GWG+E V YWL ANS+NT+WGE G F+I
Sbjct: 303 GWGEE-------DGVPYWLCANSWNTDWGEGGFFKI 331
Score = 169 bits (429), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 83/178 (46%), Positives = 113/178 (63%), Gaps = 16/178 (8%)
Query: 327 WGENGLFRIG-------CRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYE 378
W GL G CR YEIP CE ++NG+R C+ + P TP+C CQ Y V Y+
Sbjct: 181 WKHEGLVSGGLYGTKGVCRAYEIPPCEHHVNGTRPPCEGDAP-TPKCKNVCQEEYKVPYK 239
Query: 379 DDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAI 438
D ++ YS+ +NE+ I E+ HGPVE +YAD YK+G+Y+HV+G LG HAI
Sbjct: 240 KDKHYAVKVYSVHSNEDAIKHELITHGPVEADFEVYADFPTYKSGVYQHVSGALLGGHAI 299
Query: 439 RIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPK 496
+++GWG+E V YWL ANS+NT+WGE G F+I+RG+N CGIE+DI AG+P+
Sbjct: 300 KLMGWGEE-------DGVPYWLCANSWNTDWGEGGFFKILRGKNHCGIESDIVAGIPQ 350
>gi|158261501|dbj|BAF82928.1| unnamed protein product [Homo sapiens]
Length = 339
Score = 271 bits (693), Expect = 5e-70, Method: Compositional matrix adjust.
Identities = 131/253 (51%), Positives = 179/253 (70%), Gaps = 11/253 (4%)
Query: 85 PLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRG 144
P V ++ L+ LP FDAR WP CPTI+EIRDQGSCGS WA GAVEA+SDR+CI +
Sbjct: 69 PQRVMFTEDLK-LPASFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHTNA 127
Query: 145 KRHVRLSSDDLVSCCKD-CGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIP-C 202
V +S++DL++CC CG+GC GG+ +AW +W G+VSGG Y S GCRPY IP C
Sbjct: 128 HVSVEVSAEDLLTCCGSMCGDGCNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPC 187
Query: 203 ERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRH 262
E ++NGS C E +TP+C + C+PGY +Y+ D ++G +YS+ +E+ IM EI+++
Sbjct: 188 EHHVNGSRPPCT-GEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKN 246
Query: 263 GPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANS 322
GP EG+ ++Y+D +LYK+G+Y+HV G +G HAIRI+GWG E GT YWLVANS
Sbjct: 247 GPAEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGVE---NGT----PYWLVANS 299
Query: 323 FNTNWGENGLFRI 335
+NT+WG+NG F+I
Sbjct: 300 WNTDWGDNGFFKI 312
Score = 181 bits (459), Expect = 8e-43, Method: Compositional matrix adjust.
Identities = 83/164 (50%), Positives = 122/164 (74%), Gaps = 9/164 (5%)
Query: 334 RIGCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPA 392
+GCRPY IP CE ++NGSR C E +TP+C + C+PGY +Y+ D ++G +YS+
Sbjct: 176 HVGCRPYSIPPCEHHVNGSRPPCTG-EGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVSN 234
Query: 393 NEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGT 452
+E+ IM EI+++GP EG+ ++Y+D +LYK+G+Y+HV G +G HAIRI+GWG E GT
Sbjct: 235 SEKDIMAEIYKNGPAEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGVE---NGT 291
Query: 453 SSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPK 496
YWLVANS+NT+WG+NG F+I+RGQ+ CGIE+++ AG+P+
Sbjct: 292 ----PYWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPR 331
>gi|189096178|pdb|3CBJ|A Chain A, Chagasin-cathepsin B Complex
gi|189096180|pdb|3CBK|A Chain A, Chagasin-Cathepsin B
Length = 266
Score = 271 bits (693), Expect = 6e-70, Method: Compositional matrix adjust.
Identities = 128/242 (52%), Positives = 175/242 (72%), Gaps = 10/242 (4%)
Query: 96 ELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDL 155
+LP FDAR WP CPTI+EIRDQGSCGS WA GAVEA+SDR+CI + V +S++DL
Sbjct: 6 KLPASFDAREQWPQCPTIKEIRDQGSCGSAWAFGAVEAISDRICIHTNAHVSVEVSAEDL 65
Query: 156 VSCCKD-CGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIP-CERYMNGSHSSC 213
++CC CG+GC GG+ +AW +W G+VSGG Y S GCRPY IP CE ++NG+ C
Sbjct: 66 LTCCGSMCGDGCNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCEAHVNGARPPC 125
Query: 214 QDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYA 273
E +TP+C + C+PGY +Y+ D ++G +YS+ +E+ IM EI+++GPVEG+ ++Y+
Sbjct: 126 T-GEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAFSVYS 184
Query: 274 DMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLF 333
D +LYK+G+Y+HV G +G HAIRI+GWG E GT YWLVANS+NT+WG+NG F
Sbjct: 185 DFLLYKSGVYQHVTGEMMGGHAIRILGWGVE---NGT----PYWLVANSWNTDWGDNGFF 237
Query: 334 RI 335
+I
Sbjct: 238 KI 239
Score = 181 bits (459), Expect = 8e-43, Method: Compositional matrix adjust.
Identities = 85/169 (50%), Positives = 126/169 (74%), Gaps = 11/169 (6%)
Query: 331 GLF--RIGCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIA 387
GL+ +GCRPY IP CE ++NG+R C E +TP+C + C+PGY +Y+ D ++G +
Sbjct: 98 GLYESHVGCRPYSIPPCEAHVNGARPPCTG-EGDTPKCSKICEPGYSPTYKQDKHYGYNS 156
Query: 388 YSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEP 447
YS+ +E+ IM EI+++GPVEG+ ++Y+D +LYK+G+Y+HV G +G HAIRI+GWG E
Sbjct: 157 YSVSNSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGVE- 215
Query: 448 LGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPK 496
GT YWLVANS+NT+WG+NG F+I+RGQ+ CGIE+++ AG+P+
Sbjct: 216 --NGT----PYWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPR 258
>gi|1942645|pdb|1MIR|A Chain A, Rat Procathepsin B
gi|1942646|pdb|1MIR|B Chain B, Rat Procathepsin B
Length = 322
Score = 271 bits (693), Expect = 6e-70, Method: Compositional matrix adjust.
Identities = 130/255 (50%), Positives = 179/255 (70%), Gaps = 11/255 (4%)
Query: 83 RLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIAS 142
+LP V S+ + LPE FDAR W CPTI +IRDQGSCGS WA GAVEAMSDR+CI +
Sbjct: 50 KLPERVGFSEDIN-LPESFDAREQWSNCPTIAQIRDQGSCGSSWAFGAVEAMSDRICIHT 108
Query: 143 RGKRHVRLSSDDLVSCCK-DCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIP 201
G+ +V +S++DL++CC CG+GC GG+ AW +W G+VSGG Y S GC PY IP
Sbjct: 109 NGRVNVEVSAEDLLTCCGIQCGDGCNGGYPSGAWNFWTRKGLVSGGVYNSHIGCLPYTIP 168
Query: 202 -CERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIF 260
CE ++NG+ C E +TP+C + C+ GY SY++D ++G +YS+ +E+ IM EI+
Sbjct: 169 PCEHHVNGARPPCT-GEGDTPKCNKMCEAGYSTSYKEDKHYGYTSYSVSDSEKEIMAEIY 227
Query: 261 RHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVA 320
++GPVEG+ T+++D + YK+G+YKH AG +G HAIRI+GWG E + V YWLVA
Sbjct: 228 KNGPVEGAFTVFSDFLTYKSGVYKHEAGDVMGGHAIRILGWGIE-------NGVPYWLVA 280
Query: 321 NSFNTNWGENGLFRI 335
NS+N +WG+NG F+I
Sbjct: 281 NSWNADWGDNGFFKI 295
Score = 179 bits (453), Expect = 5e-42, Method: Compositional matrix adjust.
Identities = 82/164 (50%), Positives = 120/164 (73%), Gaps = 9/164 (5%)
Query: 334 RIGCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPA 392
IGC PY IP CE ++NG+R C E +TP+C + C+ GY SY++D ++G +YS+
Sbjct: 159 HIGCLPYTIPPCEHHVNGARPPCTG-EGDTPKCNKMCEAGYSTSYKEDKHYGYTSYSVSD 217
Query: 393 NEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGT 452
+E+ IM EI+++GPVEG+ T+++D + YK+G+YKH AG +G HAIRI+GWG E
Sbjct: 218 SEKEIMAEIYKNGPVEGAFTVFSDFLTYKSGVYKHEAGDVMGGHAIRILGWGIE------ 271
Query: 453 SSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPK 496
+ V YWLVANS+N +WG+NG F+I+RG+N CGIE++I AG+P+
Sbjct: 272 -NGVPYWLVANSWNADWGDNGFFKILRGENHCGIESEIVAGIPR 314
>gi|333361087|pdb|3AI8|B Chain B, Cathepsin B In Complex With The Nitroxoline
gi|333361088|pdb|3AI8|A Chain A, Cathepsin B In Complex With The Nitroxoline
Length = 256
Score = 271 bits (693), Expect = 6e-70, Method: Compositional matrix adjust.
Identities = 129/242 (53%), Positives = 175/242 (72%), Gaps = 10/242 (4%)
Query: 96 ELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDL 155
+LP FDAR WP CPTI+EIRDQGSCGS WA GAVEA+SDR+CI + V +S++DL
Sbjct: 2 KLPASFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHTNAHVSVEVSAEDL 61
Query: 156 VSCCKD-CGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIP-CERYMNGSHSSC 213
++CC CG+GC GG+ +AW +W G+VSGG Y S GCRPY IP CE ++NGS C
Sbjct: 62 LTCCGSMCGDGCNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCEHHVNGSRPPC 121
Query: 214 QDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYA 273
E +TP+C + C+PGY +Y+ D ++G +YS+ +E+ IM EI+++GPVEG+ ++Y+
Sbjct: 122 T-GEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAFSVYS 180
Query: 274 DMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLF 333
D +LYK+G+Y+HV G +G HAIRI+GWG E GT YWLVANS+NT+WG+NG F
Sbjct: 181 DFLLYKSGVYQHVTGEMMGGHAIRILGWGVE---NGT----PYWLVANSWNTDWGDNGFF 233
Query: 334 RI 335
+I
Sbjct: 234 KI 235
Score = 182 bits (463), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 86/169 (50%), Positives = 126/169 (74%), Gaps = 11/169 (6%)
Query: 331 GLF--RIGCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIA 387
GL+ +GCRPY IP CE ++NGSR C E +TP+C + C+PGY +Y+ D ++G +
Sbjct: 94 GLYESHVGCRPYSIPPCEHHVNGSRPPCTG-EGDTPKCSKICEPGYSPTYKQDKHYGYNS 152
Query: 388 YSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEP 447
YS+ +E+ IM EI+++GPVEG+ ++Y+D +LYK+G+Y+HV G +G HAIRI+GWG E
Sbjct: 153 YSVSNSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGVE- 211
Query: 448 LGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPK 496
GT YWLVANS+NT+WG+NG F+I+RGQ+ CGIE+++ AG+P+
Sbjct: 212 --NGT----PYWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPR 254
>gi|301776581|ref|XP_002923704.1| PREDICTED: cathepsin B-like [Ailuropoda melanoleuca]
gi|281347694|gb|EFB23278.1| hypothetical protein PANDA_012896 [Ailuropoda melanoleuca]
Length = 339
Score = 271 bits (693), Expect = 7e-70, Method: Compositional matrix adjust.
Identities = 134/255 (52%), Positives = 185/255 (72%), Gaps = 11/255 (4%)
Query: 83 RLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIAS 142
+LP V ++ + LPE FDAR WP CPTI+EIRDQGSCGS WA GAVEA+SDR+CI +
Sbjct: 67 KLPQRVWFAENMV-LPENFDAREQWPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICIRT 125
Query: 143 RGKRHVRLSSDDLVSCCKD-CGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIP 201
G +V +S++D+++CC D CG+GC GGF +AW +W G+VSGG Y S GCRPY IP
Sbjct: 126 NGHVNVEVSAEDMLTCCGDQCGDGCNGGFPAEAWNFWTKQGLVSGGLYESHVGCRPYSIP 185
Query: 202 -CERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIF 260
CE ++NGS C E +TP+C + C+PGY SY++D ++G +YS+ ++E+ IM EI+
Sbjct: 186 PCEHHVNGSRPPCT-GEGDTPKCSKFCEPGYTPSYKEDKHYGCSSYSVSSSEKEIMAEIY 244
Query: 261 RHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVA 320
++GPVE + T+Y+D +LYK+G+Y+HV G +G HA+RI+GWG E GT YWLV
Sbjct: 245 KNGPVEAAFTVYSDFLLYKSGVYQHVTGEMMGGHAVRILGWGVE---NGTP----YWLVG 297
Query: 321 NSFNTNWGENGLFRI 335
NS+NT+WG+NG F+I
Sbjct: 298 NSWNTDWGDNGFFKI 312
Score = 171 bits (434), Expect = 6e-40, Method: Compositional matrix adjust.
Identities = 83/163 (50%), Positives = 122/163 (74%), Gaps = 9/163 (5%)
Query: 334 RIGCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPA 392
+GCRPY IP CE ++NGSR C E +TP+C + C+PGY SY++D ++G +YS+ +
Sbjct: 176 HVGCRPYSIPPCEHHVNGSRPPCTG-EGDTPKCSKFCEPGYTPSYKEDKHYGCSSYSVSS 234
Query: 393 NEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGT 452
+E+ IM EI+++GPVE + T+Y+D +LYK+G+Y+HV G +G HA+RI+GWG E GT
Sbjct: 235 SEKEIMAEIYKNGPVEAAFTVYSDFLLYKSGVYQHVTGEMMGGHAVRILGWGVE---NGT 291
Query: 453 SSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLP 495
YWLV NS+NT+WG+NG F+I+RG++ CGIE++I AG+P
Sbjct: 292 P----YWLVGNSWNTDWGDNGFFKILRGRDHCGIESEIVAGIP 330
>gi|156365510|ref|XP_001626688.1| predicted protein [Nematostella vectensis]
gi|156213574|gb|EDO34588.1| predicted protein [Nematostella vectensis]
Length = 259
Score = 271 bits (692), Expect = 7e-70, Method: Compositional matrix adjust.
Identities = 125/243 (51%), Positives = 177/243 (72%), Gaps = 9/243 (3%)
Query: 94 LEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSD 153
++++P+ FD+R WP+CPTI+E+RDQG+CGS WA GAVEAMSDR CI S GK +S++
Sbjct: 1 MDDVPDHFDSREQWPHCPTIKEVRDQGACGSCWAFGAVEAMSDRYCIKSEGKVMPHISAE 60
Query: 154 DLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEI-PCERYMNGSHSS 212
DL+SCC+ CG GC GG+ AW +W + G+V+GG Y S +GC+PY+I C+ ++ G
Sbjct: 61 DLLSCCETCGMGCNGGYPESAWDHWKSKGLVTGGQYDSHKGCQPYKIAACDHHVVGKLKP 120
Query: 213 CQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIY 272
C+ + P TP+C RKC+ GY+VSY DD +FG+ AYS+ ++ I +EI +GPVEG+ T+Y
Sbjct: 121 CKGDSP-TPKCERKCEAGYNVSYSDDKHFGQSAYSVRSDPAEIQKEIMTNGPVEGAFTVY 179
Query: 273 ADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGL 332
AD YK+G+Y+H +G LG HAI+I+GWG+E GT YWLVANS+N++WG+ G
Sbjct: 180 ADFPTYKSGVYQHTSGSALGGHAIKILGWGEE---NGTP----YWLVANSWNSDWGDEGF 232
Query: 333 FRI 335
F+I
Sbjct: 233 FKI 235
Score = 174 bits (442), Expect = 8e-41, Method: Compositional matrix adjust.
Identities = 83/166 (50%), Positives = 117/166 (70%), Gaps = 9/166 (5%)
Query: 336 GCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANE 394
GC+PY+I C+ ++ G C+ + P TP+C RKC+ GY+VSY DD +FG+ AYS+ ++
Sbjct: 101 GCQPYKIAACDHHVVGKLKPCKGDSP-TPKCERKCEAGYNVSYSDDKHFGQSAYSVRSDP 159
Query: 395 ETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSS 454
I +EI +GPVEG+ T+YAD YK+G+Y+H +G LG HAI+I+GWG+E GT
Sbjct: 160 AEIQKEIMTNGPVEGAFTVYADFPTYKSGVYQHTSGSALGGHAIKILGWGEE---NGTP- 215
Query: 455 VVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKIGLE 500
YWLVANS+N++WG+ G F+I RG +ECGIE+ I GLPK +E
Sbjct: 216 ---YWLVANSWNSDWGDEGFFKIKRGNDECGIESGIVGGLPKFSVE 258
>gi|82830420|ref|NP_072119.2| cathepsin B preproprotein [Rattus norvegicus]
gi|47939014|gb|AAH72490.1| Cathepsin B [Rattus norvegicus]
gi|149030258|gb|EDL85314.1| rCG52258, isoform CRA_a [Rattus norvegicus]
Length = 339
Score = 271 bits (692), Expect = 7e-70, Method: Compositional matrix adjust.
Identities = 131/255 (51%), Positives = 179/255 (70%), Gaps = 11/255 (4%)
Query: 83 RLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIAS 142
+LP V S+ + LPE FDAR W CPTI +IRDQGSCGS WA GAVEAMSDR+CI +
Sbjct: 67 KLPERVGFSEDIN-LPESFDAREQWSNCPTIAQIRDQGSCGSCWAFGAVEAMSDRICIHT 125
Query: 143 RGKRHVRLSSDDLVSCCK-DCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIP 201
G+ +V +S++DL++CC CG+GC GG+ AW +W G+VSGG Y S GC PY IP
Sbjct: 126 NGRVNVEVSAEDLLTCCGIQCGDGCNGGYPSGAWNFWTRKGLVSGGVYNSHIGCLPYTIP 185
Query: 202 -CERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIF 260
CE ++NGS C E +TP+C + C+ GY SY++D ++G +YS+ +E+ IM EI+
Sbjct: 186 PCEHHVNGSRPPCT-GEGDTPKCNKMCEAGYSTSYKEDKHYGYTSYSVSDSEKEIMAEIY 244
Query: 261 RHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVA 320
++GPVEG+ T+++D + YK+G+YKH AG +G HAIRI+GWG E + V YWLVA
Sbjct: 245 KNGPVEGAFTVFSDFLTYKSGVYKHEAGDVMGGHAIRILGWGIE-------NGVPYWLVA 297
Query: 321 NSFNTNWGENGLFRI 335
NS+N +WG+NG F+I
Sbjct: 298 NSWNVDWGDNGFFKI 312
Score = 179 bits (455), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 83/164 (50%), Positives = 120/164 (73%), Gaps = 9/164 (5%)
Query: 334 RIGCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPA 392
IGC PY IP CE ++NGSR C E +TP+C + C+ GY SY++D ++G +YS+
Sbjct: 176 HIGCLPYTIPPCEHHVNGSRPPCTG-EGDTPKCNKMCEAGYSTSYKEDKHYGYTSYSVSD 234
Query: 393 NEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGT 452
+E+ IM EI+++GPVEG+ T+++D + YK+G+YKH AG +G HAIRI+GWG E
Sbjct: 235 SEKEIMAEIYKNGPVEGAFTVFSDFLTYKSGVYKHEAGDVMGGHAIRILGWGIE------ 288
Query: 453 SSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPK 496
+ V YWLVANS+N +WG+NG F+I+RG+N CGIE++I AG+P+
Sbjct: 289 -NGVPYWLVANSWNVDWGDNGFFKILRGENHCGIESEIVAGIPR 331
>gi|24158605|pdb|1GMY|A Chain A, Cathepsin B Complexed With Dipeptidyl Nitrile Inhibitor
gi|24158606|pdb|1GMY|B Chain B, Cathepsin B Complexed With Dipeptidyl Nitrile Inhibitor
gi|24158607|pdb|1GMY|C Chain C, Cathepsin B Complexed With Dipeptidyl Nitrile Inhibitor
Length = 261
Score = 271 bits (692), Expect = 8e-70, Method: Compositional matrix adjust.
Identities = 129/242 (53%), Positives = 175/242 (72%), Gaps = 10/242 (4%)
Query: 96 ELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDL 155
+LP FDAR WP CPTI+EIRDQGSCGS WA GAVEA+SDR+CI + V +S++DL
Sbjct: 1 KLPASFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHTNAHVSVEVSAEDL 60
Query: 156 VSCCKD-CGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIP-CERYMNGSHSSC 213
++CC CG+GC GG+ +AW +W G+VSGG Y S GCRPY IP CE ++NGS C
Sbjct: 61 LTCCGSMCGDGCNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCEHHVNGSRPPC 120
Query: 214 QDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYA 273
E +TP+C + C+PGY +Y+ D ++G +YS+ +E+ IM EI+++GPVEG+ ++Y+
Sbjct: 121 T-GEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAFSVYS 179
Query: 274 DMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLF 333
D +LYK+G+Y+HV G +G HAIRI+GWG E GT YWLVANS+NT+WG+NG F
Sbjct: 180 DFLLYKSGVYQHVTGEMMGGHAIRILGWGVE---NGT----PYWLVANSWNTDWGDNGFF 232
Query: 334 RI 335
+I
Sbjct: 233 KI 234
Score = 182 bits (462), Expect = 4e-43, Method: Compositional matrix adjust.
Identities = 86/169 (50%), Positives = 126/169 (74%), Gaps = 11/169 (6%)
Query: 331 GLF--RIGCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIA 387
GL+ +GCRPY IP CE ++NGSR C E +TP+C + C+PGY +Y+ D ++G +
Sbjct: 93 GLYESHVGCRPYSIPPCEHHVNGSRPPCTG-EGDTPKCSKICEPGYSPTYKQDKHYGYNS 151
Query: 388 YSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEP 447
YS+ +E+ IM EI+++GPVEG+ ++Y+D +LYK+G+Y+HV G +G HAIRI+GWG E
Sbjct: 152 YSVSNSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGVE- 210
Query: 448 LGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPK 496
GT YWLVANS+NT+WG+NG F+I+RGQ+ CGIE+++ AG+P+
Sbjct: 211 --NGT----PYWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPR 253
>gi|240992702|ref|XP_002404475.1| cathepsin B endopeptidase, putative [Ixodes scapularis]
gi|215491572|gb|EEC01213.1| cathepsin B endopeptidase, putative [Ixodes scapularis]
Length = 337
Score = 271 bits (692), Expect = 8e-70, Method: Compositional matrix adjust.
Identities = 133/285 (46%), Positives = 184/285 (64%), Gaps = 12/285 (4%)
Query: 52 YGAEKNALSKLTLSELEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCP 111
+ A +N +++S + MGVHP SK + RL V P ++LPE FDAR WP+C
Sbjct: 43 WKAGRNFDKSISMSYIRGLMGVHPKSK--EYRLAEFVHDEIP-DDLPESFDAREKWPHCN 99
Query: 112 TIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFH 171
+I IRDQ +CGS WA GA EAMSDRVCI S+GK V +S++DL+ CC CG GC GG
Sbjct: 100 SIHLIRDQSTCGSCWAFGAAEAMSDRVCIHSKGKIQVNISAEDLLDCCDSCGAGCNGGTP 159
Query: 172 GKAWKYWVTTGIVSGGTYASKQGCRPYEI-PCERYMNGSHSSCQDNEPNTPECIRKCQPG 230
AW+YW +G+V+GG Y + GC+PY + PCE + GS +C P TP+C+ C+ G
Sbjct: 160 AAAWEYWKESGLVTGGLYGTNDGCKPYSLAPCEHHTKGSLPNCTGTVP-TPKCVHLCRKG 218
Query: 231 YDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGP 290
Y Y+DD +FG+ YS+ ++E+ I EIF++GPVE + AD + YK+G+Y+H +
Sbjct: 219 YGKDYQDDKHFGKKVYSISSDEKQIQTEIFKNGPVEADFIVLADFLSYKSGVYQHHSDDV 278
Query: 291 LGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
+G HAIRI+GWG E GT YWL ANS+N +WG++G F+I
Sbjct: 279 IGGHAIRILGWGTE---NGTP----YWLAANSWNEDWGDHGYFKI 316
Score = 165 bits (418), Expect = 5e-38, Method: Compositional matrix adjust.
Identities = 81/178 (45%), Positives = 115/178 (64%), Gaps = 16/178 (8%)
Query: 327 WGENGLF-------RIGCRPYEI-PCERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYE 378
W E+GL GC+PY + PCE + GS +C P TP+C+ C+ GY Y+
Sbjct: 166 WKESGLVTGGLYGTNDGCKPYSLAPCEHHTKGSLPNCTGTVP-TPKCVHLCRKGYGKDYQ 224
Query: 379 DDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAI 438
DD +FG+ YS+ ++E+ I EIF++GPVE + AD + YK+G+Y+H + +G HAI
Sbjct: 225 DDKHFGKKVYSISSDEKQIQTEIFKNGPVEADFIVLADFLSYKSGVYQHHSDDVIGGHAI 284
Query: 439 RIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPK 496
RI+GWG E GT YWL ANS+N +WG++G F+I+RG++ECGIE DI AG+PK
Sbjct: 285 RILGWGTE---NGTP----YWLAANSWNEDWGDHGYFKILRGKDECGIEEDINAGIPK 335
>gi|160333103|ref|NP_001103948.1| capthepsin B, b precursor [Danio rerio]
gi|133777414|gb|AAI15255.1| Ctsbb protein [Danio rerio]
Length = 326
Score = 271 bits (692), Expect = 9e-70, Method: Compositional matrix adjust.
Identities = 127/254 (50%), Positives = 178/254 (70%), Gaps = 10/254 (3%)
Query: 83 RLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIAS 142
RLP V+ S ++ LP+ FD R WP C T+ +IRDQGSCGS WA GAVE++SDR+CI S
Sbjct: 62 RLPHTVKHSTNVK-LPDSFDLRDQWPNCKTLSQIRDQGSCGSCWAFGAVESISDRICIHS 120
Query: 143 RGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEI-P 201
+GK+ +S++DL+SCC CG GC GGF +AW YW +G+V+GG Y S GCRPY I P
Sbjct: 121 KGKQSPEISAEDLLSCCDQCGFGCSGGFPAEAWDYWRRSGLVTGGLYNSDVGCRPYSIAP 180
Query: 202 CERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFR 261
CE ++NG+ C E +TP+C C P Y V Y+ D +FG Y++P++++ IM E++
Sbjct: 181 CEHHVNGTRPPCS-GEQDTPKCTGVCIPKYSVPYKQDKHFGSKVYNVPSDQQQIMTELYT 239
Query: 262 HGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVAN 321
+GPVE + T+Y D LYK+G+Y+H+ G LG HA++I+GWG+E GT +WLVAN
Sbjct: 240 NGPVEAAFTVYEDFPLYKSGVYQHLTGSALGGHAVKILGWGEE---NGT----PFWLVAN 292
Query: 322 SFNTNWGENGLFRI 335
S+N++WG+NG F+I
Sbjct: 293 SWNSDWGDNGYFKI 306
Score = 178 bits (452), Expect = 5e-42, Method: Compositional matrix adjust.
Identities = 80/164 (48%), Positives = 120/164 (73%), Gaps = 9/164 (5%)
Query: 335 IGCRPYEI-PCERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPAN 393
+GCRPY I PCE ++NG+R C + E +TP+C C P Y V Y+ D +FG Y++P++
Sbjct: 171 VGCRPYSIAPCEHHVNGTRPPC-SGEQDTPKCTGVCIPKYSVPYKQDKHFGSKVYNVPSD 229
Query: 394 EETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTS 453
++ IM E++ +GPVE + T+Y D LYK+G+Y+H+ G LG HA++I+GWG+E GT
Sbjct: 230 QQQIMTELYTNGPVEAAFTVYEDFPLYKSGVYQHLTGSALGGHAVKILGWGEE---NGT- 285
Query: 454 SVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
+WLVANS+N++WG+NG F+I+RG +ECGIE+++ AGLPK+
Sbjct: 286 ---PFWLVANSWNSDWGDNGYFKILRGHDECGIESEMVAGLPKL 326
>gi|49036806|gb|AAT48984.1| cathepsin B-like proteinase [Triatoma sordida]
Length = 331
Score = 271 bits (692), Expect = 9e-70, Method: Compositional matrix adjust.
Identities = 137/274 (50%), Positives = 178/274 (64%), Gaps = 21/274 (7%)
Query: 67 LEMRMGVHPDSK----LPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSC 122
L+ GVH ++ LP+ ++ L V + P+ FDAR WP CP+I +IRDQGSC
Sbjct: 54 LKSLAGVHKNANNAFTLPKRKVSLDVTI-------PDEFDARKQWPNCPSITDIRDQGSC 106
Query: 123 GSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTG 182
GS WA GAVEAMSDR+CI S GK V LS+++LVSCC CG GC GGF AW YW G
Sbjct: 107 GSCWAFGAVEAMSDRICIHSNGKLQVHLSAENLVSCCDSCGYGCDGGFPASAWDYWQNEG 166
Query: 183 IVSGGTYASKQGCRPYEI-PCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNF 241
IVSGG Y SKQGC+PY I PCE ++ GS +C +TP+C +C G +SY+ D +
Sbjct: 167 IVSGGNYGSKQGCQPYSIAPCEHHVPGSRPACSGGG-DTPDCRNQCDEGSGISYDQDHYY 225
Query: 242 GRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGW 301
G Y+L ++ I EI ++GPVE + T+Y D++ YK G+Y+HVAG LG HAI+I+GW
Sbjct: 226 GETVYTLDEAKQ-IQAEILKNGPVEAAFTVYEDLLNYKEGVYQHVAGEALGGHAIKILGW 284
Query: 302 GQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
G E + YWLVANS+NT+WG NG F+I
Sbjct: 285 GVE-------NDTPYWLVANSWNTDWGNNGFFKI 311
Score = 161 bits (407), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 78/165 (47%), Positives = 110/165 (66%), Gaps = 10/165 (6%)
Query: 334 RIGCRPYEI-PCERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPA 392
+ GC+PY I PCE ++ GSR +C +TP+C +C G +SY+ D +G Y+L
Sbjct: 176 KQGCQPYSIAPCEHHVPGSRPACSGGG-DTPDCRNQCDEGSGISYDQDHYYGETVYTLDE 234
Query: 393 NEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGT 452
++ I EI ++GPVE + T+Y D++ YK G+Y+HVAG LG HAI+I+GWG E
Sbjct: 235 AKQ-IQAEILKNGPVEAAFTVYEDLLNYKEGVYQHVAGEALGGHAIKILGWGVE------ 287
Query: 453 SSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
+ YWLVANS+NT+WG NG F+I+RG +ECGIE I AGLP++
Sbjct: 288 -NDTPYWLVANSWNTDWGNNGFFKILRGSDECGIEDQIVAGLPRV 331
>gi|380791571|gb|AFE67661.1| cathepsin B preproprotein, partial [Macaca mulatta]
Length = 311
Score = 270 bits (691), Expect = 9e-70, Method: Compositional matrix adjust.
Identities = 132/254 (51%), Positives = 180/254 (70%), Gaps = 11/254 (4%)
Query: 83 RLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIAS 142
+ P V ++ L+ LPE FDAR WP CPTI+EIRDQGSCGS WA GAVEA+SDR+CI +
Sbjct: 67 KPPQRVMFTEDLK-LPESFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHT 125
Query: 143 RGKRHVRLSSDDLVSCCKD-CGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIP 201
V +S++DL++CC CG+GC GG+ AW +W G+VSGG Y S GCRPY IP
Sbjct: 126 NAHVSVEVSAEDLLTCCGIMCGDGCNGGYPAGAWNFWTRKGLVSGGLYDSHVGCRPYSIP 185
Query: 202 -CERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIF 260
CE ++NGS C E +TP+C + C+PGY +Y+ D ++G +YS+ +E+ IM EI+
Sbjct: 186 PCEHHVNGSRPPCT-GEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIY 244
Query: 261 RHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVA 320
++GPVEG+ ++Y+D +LYK+G+Y+HV G +G HAIRI+GWG E GT YWLVA
Sbjct: 245 KNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGVE---NGT----PYWLVA 297
Query: 321 NSFNTNWGENGLFR 334
NS+NT+WG+NG F+
Sbjct: 298 NSWNTDWGDNGFFK 311
Score = 154 bits (390), Expect = 8e-35, Method: Compositional matrix adjust.
Identities = 73/144 (50%), Positives = 105/144 (72%), Gaps = 9/144 (6%)
Query: 334 RIGCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPA 392
+GCRPY IP CE ++NGSR C E +TP+C + C+PGY +Y+ D ++G +YS+
Sbjct: 176 HVGCRPYSIPPCEHHVNGSRPPCTG-EGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVSN 234
Query: 393 NEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGT 452
+E+ IM EI+++GPVEG+ ++Y+D +LYK+G+Y+HV G +G HAIRI+GWG E GT
Sbjct: 235 SEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGVE---NGT 291
Query: 453 SSVVKYWLVANSFNTNWGENGLFR 476
YWLVANS+NT+WG+NG F+
Sbjct: 292 ----PYWLVANSWNTDWGDNGFFK 311
>gi|262368170|pdb|3K9M|A Chain A, Cathepsin B In Complex With Stefin A
gi|262368172|pdb|3K9M|B Chain B, Cathepsin B In Complex With Stefin A
Length = 254
Score = 270 bits (691), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 129/241 (53%), Positives = 174/241 (72%), Gaps = 10/241 (4%)
Query: 97 LPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLV 156
LP FDAR WP CPTI+EIRDQGSCGS WA GAVEA+SDR+CI + V +S++DL+
Sbjct: 1 LPASFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHTNAHVSVEVSAEDLL 60
Query: 157 SCCKD-CGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIP-CERYMNGSHSSCQ 214
+CC CG+GC GG+ +AW +W G+VSGG Y S GCRPY IP CE ++NGS C
Sbjct: 61 TCCGSMCGDGCNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCEHHVNGSRPPCT 120
Query: 215 DNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYAD 274
E +TP+C + C+PGY +Y+ D ++G +YS+ +E+ IM EI+++GPVEG+ ++Y+D
Sbjct: 121 -GEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAFSVYSD 179
Query: 275 MILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFR 334
+LYK+G+Y+HV G +G HAIRI+GWG E GT YWLVANS+NT+WG+NG F+
Sbjct: 180 FLLYKSGVYQHVTGEMMGGHAIRILGWGVE---NGT----PYWLVANSWNTDWGDNGFFK 232
Query: 335 I 335
I
Sbjct: 233 I 233
Score = 182 bits (462), Expect = 4e-43, Method: Compositional matrix adjust.
Identities = 86/169 (50%), Positives = 126/169 (74%), Gaps = 11/169 (6%)
Query: 331 GLF--RIGCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIA 387
GL+ +GCRPY IP CE ++NGSR C E +TP+C + C+PGY +Y+ D ++G +
Sbjct: 92 GLYESHVGCRPYSIPPCEHHVNGSRPPCTG-EGDTPKCSKICEPGYSPTYKQDKHYGYNS 150
Query: 388 YSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEP 447
YS+ +E+ IM EI+++GPVEG+ ++Y+D +LYK+G+Y+HV G +G HAIRI+GWG E
Sbjct: 151 YSVSNSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGVE- 209
Query: 448 LGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPK 496
GT YWLVANS+NT+WG+NG F+I+RGQ+ CGIE+++ AG+P+
Sbjct: 210 --NGT----PYWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPR 252
>gi|157058765|gb|ABV03140.1| cathepsin B-348 [Aulacorthum solani]
Length = 237
Score = 270 bits (691), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 129/235 (54%), Positives = 167/235 (71%), Gaps = 3/235 (1%)
Query: 71 MGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGA 130
MGVH ++ +L L+ +D +LPE FDAR +WP CPTI+E+RDQGSCGS WA GA
Sbjct: 3 MGVH-ETNAEYPKLEQLLTYTDAPTDLPETFDAREHWPNCPTIREVRDQGSCGSCWAFGA 61
Query: 131 VEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYA 190
VEAMSDRVCI S+G ++ S+++LVSCC CG GC GGF G AW YW T GIVSGG Y
Sbjct: 62 VEAMSDRVCIHSKGTKNFHFSAENLVSCCWTCGFGCNGGFPGAAWNYWKTKGIVSGGPYG 121
Query: 191 SKQGCRPYEI-PCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLP 249
S GC PYE+ PCE ++NG+ C++ TP+C++KC+ GY V Y DL+ G+ AYSL
Sbjct: 122 SNMGCIPYEVAPCEHHVNGTRGPCKEGG-KTPKCVKKCEDGYKVPYAQDLHHGKSAYSLS 180
Query: 250 ANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQE 304
+ + I +EI+ +GPVEG+ T+Y D I Y+ G+YKHVAG LG HAIRI+GWG +
Sbjct: 181 NDVDQIRQEIYTNGPVEGAFTVYEDFIAYRAGVYKHVAGKALGGHAIRILGWGVQ 235
Score = 127 bits (318), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 58/113 (51%), Positives = 81/113 (71%), Gaps = 2/113 (1%)
Query: 335 IGCRPYEI-PCERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPAN 393
+GC PYE+ PCE ++NG+R C+ TP+C++KC+ GY V Y DL+ G+ AYSL +
Sbjct: 124 MGCIPYEVAPCEHHVNGTRGPCKEGG-KTPKCVKKCEDGYKVPYAQDLHHGKSAYSLSND 182
Query: 394 EETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQE 446
+ I +EI+ +GPVEG+ T+Y D I Y+ G+YKHVAG LG HAIRI+GWG +
Sbjct: 183 VDQIRQEIYTNGPVEGAFTVYEDFIAYRAGVYKHVAGKALGGHAIRILGWGVQ 235
>gi|56756436|gb|AAW26391.1| unknown [Schistosoma japonicum]
Length = 342
Score = 270 bits (691), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 131/275 (47%), Positives = 179/275 (65%), Gaps = 10/275 (3%)
Query: 63 TLSELEMRMGVHP-DSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGS 121
+L + + MG D+++ + R P V D E+P FD+R WP+C +I +IRDQ
Sbjct: 56 SLDDARILMGARKEDAEMKRKRRPT-VDHHDLNVEIPSQFDSRKKWPHCKSISQIRDQSR 114
Query: 122 CGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTT 181
CGS WA GAVEAM+DR+CI S G++ LS+ DL+SCC+DCG+GC+GGF G+AW YWV
Sbjct: 115 CGSCWAFGAVEAMTDRICIQSGGQQSAELSALDLISCCEDCGDGCKGGFPGQAWDYWVKR 174
Query: 182 GIVSGGTYASKQGCRPYEIP-CERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLN 240
GIV+GG+ + GC+PY P CE G + +C TP+C + CQ GY YE D +
Sbjct: 175 GIVTGGSEENHTGCQPYPFPKCEHLTKGKYPACGTKIYKTPQCKQTCQKGYKTPYEQDKH 234
Query: 241 FGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIG 300
+G Y++ +NE+ I REI +GPVE + +Y D + YK+GIY+HV G +G HAIRIIG
Sbjct: 235 YGDQRYNVISNEKAIQREIMMYGPVEAAFDVYEDFLNYKSGIYRHVTGSIVGGHAIRIIG 294
Query: 301 WGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
WG E +G YWL+ANS+N +WGE GLFR+
Sbjct: 295 WGVE---KGKP----YWLIANSWNEDWGEKGLFRM 322
Score = 160 bits (405), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 78/162 (48%), Positives = 104/162 (64%), Gaps = 8/162 (4%)
Query: 336 GCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANE 394
GC+PY P CE G +C TP+C + CQ GY YE D ++G Y++ +NE
Sbjct: 187 GCQPYPFPKCEHLTKGKYPACGTKIYKTPQCKQTCQKGYKTPYEQDKHYGDQRYNVISNE 246
Query: 395 ETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSS 454
+ I REI +GPVE + +Y D + YK+GIY+HV G +G HAIRIIGWG E +G
Sbjct: 247 KAIQREIMMYGPVEAAFDVYEDFLNYKSGIYRHVTGSIVGGHAIRIIGWGVE---KGKP- 302
Query: 455 VVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPK 496
YWL+ANS+N +WGE GLFR+VRG++EC IE+ + AGL K
Sbjct: 303 ---YWLIANSWNEDWGEKGLFRMVRGRDECSIESHVVAGLIK 341
>gi|55793947|gb|AAV65884.1| cathepsin B1 isotype 4 precursor [Trichobilharzia regenti]
Length = 342
Score = 270 bits (691), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 129/275 (46%), Positives = 177/275 (64%), Gaps = 10/275 (3%)
Query: 63 TLSELEMRMG-VHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGS 121
+L + + +G +H D +L + R P + + LE +P FD+R W C +I IRDQ
Sbjct: 56 SLEDARILLGAMHEDEELRKKRRPTVDHQNVSLE-IPSSFDSRKKWHQCKSISNIRDQSR 114
Query: 122 CGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTT 181
CGS WA AVEAMSDR+CI S+GK+ V LS+ DL+SCC +CG GCQGGF G AW YWV
Sbjct: 115 CGSCWAFAAVEAMSDRICIESKGKKSVELSAVDLLSCCTECGLGCQGGFPGAAWDYWVED 174
Query: 182 GIVSGGTYASKQGCRPYEIP-CERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLN 240
GIV+G + + GC+PY P CE + G + C + TP+C +KCQ GY Y+ D
Sbjct: 175 GIVTGSSKENHTGCQPYPFPKCEHHTTGKYPECGEKIYKTPKCHQKCQKGYKTPYKKDKY 234
Query: 241 FGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIG 300
+GR++Y++ NE I +EI HGPVE + T+++D + YK+GIYK++ G +GEHA+RIIG
Sbjct: 235 YGRMSYNVLNNENAIKKEIMMHGPVEVAFTVHSDFLNYKSGIYKYMTGAEIGEHAVRIIG 294
Query: 301 WGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
WG E YWL+ANS+N +WGE G FR+
Sbjct: 295 WGVE-------KKTPYWLIANSWNEDWGEKGYFRM 322
Score = 168 bits (426), Expect = 5e-39, Method: Compositional matrix adjust.
Identities = 79/178 (44%), Positives = 113/178 (63%), Gaps = 15/178 (8%)
Query: 327 WGENGLF-------RIGCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYE 378
W E+G+ GC+PY P CE + G C TP+C +KCQ GY Y+
Sbjct: 171 WVEDGIVTGSSKENHTGCQPYPFPKCEHHTTGKYPECGEKIYKTPKCHQKCQKGYKTPYK 230
Query: 379 DDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAI 438
D +GR++Y++ NE I +EI HGPVE + T+++D + YK+GIYK++ G +GEHA+
Sbjct: 231 KDKYYGRMSYNVLNNENAIKKEIMMHGPVEVAFTVHSDFLNYKSGIYKYMTGAEIGEHAV 290
Query: 439 RIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPK 496
RIIGWG E YWL+ANS+N +WGE G FR++RG++ECGIE+ +T+GLP+
Sbjct: 291 RIIGWGVE-------KKTPYWLIANSWNEDWGEKGYFRMLRGKDECGIESAVTSGLPR 341
>gi|1705630|sp|P00787.2|CATB_RAT RecName: Full=Cathepsin B; AltName: Full=Cathepsin B1; AltName:
Full=RSG-2; Contains: RecName: Full=Cathepsin B light
chain; Contains: RecName: Full=Cathepsin B heavy chain;
Flags: Precursor
gi|1524328|emb|CAA57792.1| cathepsin b [Rattus norvegicus]
Length = 339
Score = 270 bits (690), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 131/254 (51%), Positives = 178/254 (70%), Gaps = 11/254 (4%)
Query: 84 LPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASR 143
LP V S+ + LPE FDAR W CPTI +IRDQGSCGS WA GAVEAMSDR+CI +
Sbjct: 68 LPERVGFSEDIN-LPESFDAREQWSNCPTIAQIRDQGSCGSCWAFGAVEAMSDRICIHTN 126
Query: 144 GKRHVRLSSDDLVSCCK-DCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIP- 201
G+ +V +S++DL++CC CG+GC GG+ AW +W G+VSGG Y S GC PY IP
Sbjct: 127 GRVNVEVSAEDLLTCCGIQCGDGCNGGYPSGAWNFWTRKGLVSGGVYNSHIGCLPYTIPP 186
Query: 202 CERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFR 261
CE ++NGS C E +TP+C + C+ GY SY++D ++G +YS+ +E+ IM EI++
Sbjct: 187 CEHHVNGSRPPCT-GEGDTPKCNKMCEAGYSTSYKEDKHYGYTSYSVSDSEKEIMAEIYK 245
Query: 262 HGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVAN 321
+GPVEG+ T+++D + YK+G+YKH AG +G HAIRI+GWG E + V YWLVAN
Sbjct: 246 NGPVEGAFTVFSDFLTYKSGVYKHEAGDVMGGHAIRILGWGIE-------NGVPYWLVAN 298
Query: 322 SFNTNWGENGLFRI 335
S+N +WG+NG F+I
Sbjct: 299 SWNVDWGDNGFFKI 312
Score = 179 bits (455), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 83/164 (50%), Positives = 120/164 (73%), Gaps = 9/164 (5%)
Query: 334 RIGCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPA 392
IGC PY IP CE ++NGSR C E +TP+C + C+ GY SY++D ++G +YS+
Sbjct: 176 HIGCLPYTIPPCEHHVNGSRPPCTG-EGDTPKCNKMCEAGYSTSYKEDKHYGYTSYSVSD 234
Query: 393 NEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGT 452
+E+ IM EI+++GPVEG+ T+++D + YK+G+YKH AG +G HAIRI+GWG E
Sbjct: 235 SEKEIMAEIYKNGPVEGAFTVFSDFLTYKSGVYKHEAGDVMGGHAIRILGWGIE------ 288
Query: 453 SSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPK 496
+ V YWLVANS+N +WG+NG F+I+RG+N CGIE++I AG+P+
Sbjct: 289 -NGVPYWLVANSWNVDWGDNGFFKILRGENHCGIESEIVAGIPR 331
>gi|55793941|gb|AAV65881.1| cathepsin B1 isotype 1 precursor [Trichobilharzia regenti]
Length = 342
Score = 270 bits (689), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 129/275 (46%), Positives = 176/275 (64%), Gaps = 10/275 (3%)
Query: 63 TLSELEMRMG-VHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGS 121
+L + + +G +H D +L + R P + + LE +P FD+R W C +I IRDQ
Sbjct: 56 SLEDARILLGAMHEDEELRKKRRPTVDHQNVSLE-IPSSFDSRKKWHQCKSISNIRDQSR 114
Query: 122 CGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTT 181
CGS WA AVEAMSDR+CI S+GK+ V LS+ DL+SCC +CG GCQGGF G AW YWV
Sbjct: 115 CGSCWAFAAVEAMSDRICIESKGKKSVELSAVDLLSCCTECGLGCQGGFPGAAWDYWVED 174
Query: 182 GIVSGGTYASKQGCRPYEIP-CERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLN 240
GIV+G + + GC+PY P CE + G + C + TP+C +KCQ GY Y+ D
Sbjct: 175 GIVTGSSKENHTGCQPYPFPKCEHHTTGKYPECGEKIYKTPKCHQKCQKGYKTPYKKDKY 234
Query: 241 FGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIG 300
+GR++Y++ NE I +EI HGPVE + T+++D + YK+GIYK++ G +G HA+RIIG
Sbjct: 235 YGRMSYNVLNNENAIKKEIMMHGPVEAAFTVHSDFLNYKSGIYKYMTGAEIGGHAVRIIG 294
Query: 301 WGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
WG E YWL+ANS+N +WGE G FRI
Sbjct: 295 WGVE-------KKTPYWLIANSWNEDWGEKGYFRI 322
Score = 168 bits (425), Expect = 8e-39, Method: Compositional matrix adjust.
Identities = 79/177 (44%), Positives = 111/177 (62%), Gaps = 15/177 (8%)
Query: 327 WGENGLF-------RIGCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYE 378
W E+G+ GC+PY P CE + G C TP+C +KCQ GY Y+
Sbjct: 171 WVEDGIVTGSSKENHTGCQPYPFPKCEHHTTGKYPECGEKIYKTPKCHQKCQKGYKTPYK 230
Query: 379 DDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAI 438
D +GR++Y++ NE I +EI HGPVE + T+++D + YK+GIYK++ G +G HA+
Sbjct: 231 KDKYYGRMSYNVLNNENAIKKEIMMHGPVEAAFTVHSDFLNYKSGIYKYMTGAEIGGHAV 290
Query: 439 RIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLP 495
RIIGWG E YWL+ANS+N +WGE G FRI+RG++ECGIE+++T GLP
Sbjct: 291 RIIGWGVE-------KKTPYWLIANSWNEDWGEKGYFRILRGKDECGIESEVTGGLP 340
>gi|260786791|ref|XP_002588440.1| hypothetical protein BRAFLDRAFT_199166 [Branchiostoma floridae]
gi|229273602|gb|EEN44451.1| hypothetical protein BRAFLDRAFT_199166 [Branchiostoma floridae]
Length = 332
Score = 268 bits (686), Expect = 3e-69, Method: Compositional matrix adjust.
Identities = 131/275 (47%), Positives = 184/275 (66%), Gaps = 14/275 (5%)
Query: 63 TLSELEMRMGVHPDSKLPQN-RLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGS 121
T+S ++ GV D P N +LPL + + +++P+ FD+R W CPTI+E+RDQGS
Sbjct: 49 TVSYVKGLCGVIRD---PNNHKLPLKLHELN-AQDIPDTFDSRTQWANCPTIKEVRDQGS 104
Query: 122 CGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTT 181
CGS WA A EAMSDR C+AS GK V LSS++L++CC+ CG GC GGF AW+YW
Sbjct: 105 CGSCWAEAAAEAMSDRTCVASNGKVQVHLSSENLMACCETCGMGCHGGFPEAAWEYWKQD 164
Query: 182 GIVSGGTYASKQGCRPYEI-PCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLN 240
G+V+GG Y S QGC+PYEI PCE ++NGS +C EP TP C + C+ GY+V++ D +
Sbjct: 165 GLVTGGPYGSMQGCQPYEIAPCEHHINGSRPACGKIEP-TPRCKKTCESGYNVTFNKDKH 223
Query: 241 FGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIG 300
+ + AYS+ + + I EI +GPVE + T+YAD YK+G+Y+H +G LG HA+++IG
Sbjct: 224 YAKSAYSVSSKVQQIQMEIMTNGPVEAAFTVYADFPHYKSGVYQHESGAELGGHAVKMIG 283
Query: 301 WGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
WG E YWL+ANS+N++WG+ G F+I
Sbjct: 284 WGME-------GSTPYWLIANSWNSDWGDMGFFKI 311
Score = 171 bits (433), Expect = 9e-40, Method: Compositional matrix adjust.
Identities = 81/179 (45%), Positives = 118/179 (65%), Gaps = 16/179 (8%)
Query: 327 WGENGLFR-------IGCRPYEI-PCERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYE 378
W ++GL GC+PYEI PCE ++NGSR +C EP TP C + C+ GY+V++
Sbjct: 161 WKQDGLVTGGPYGSMQGCQPYEIAPCEHHINGSRPACGKIEP-TPRCKKTCESGYNVTFN 219
Query: 379 DDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAI 438
D ++ + AYS+ + + I EI +GPVE + T+YAD YK+G+Y+H +G LG HA+
Sbjct: 220 KDKHYAKSAYSVSSKVQQIQMEIMTNGPVEAAFTVYADFPHYKSGVYQHESGAELGGHAV 279
Query: 439 RIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
++IGWG E YWL+ANS+N++WG+ G F+I+RGQ+ECGIE DI AG P++
Sbjct: 280 KMIGWGME-------GSTPYWLIANSWNSDWGDMGFFKILRGQDECGIERDIVAGEPRM 331
>gi|46195455|ref|NP_990702.1| cathepsin B precursor [Gallus gallus]
gi|1168790|sp|P43233.1|CATB_CHICK RecName: Full=Cathepsin B; AltName: Full=Cathepsin B1; Contains:
RecName: Full=Cathepsin B light chain; Contains:
RecName: Full=Cathepsin B heavy chain; Flags: Precursor
gi|603203|gb|AAA87075.1| cathepsin B [Gallus gallus]
Length = 340
Score = 268 bits (686), Expect = 4e-69, Method: Compositional matrix adjust.
Identities = 129/255 (50%), Positives = 174/255 (68%), Gaps = 10/255 (3%)
Query: 83 RLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIAS 142
+ P V ++ ++ LP+ FD R WP CPTI EIRDQGSCGS WA GAVEA+SDR+C+ +
Sbjct: 67 KAPERVDFAEDMD-LPDTFDTRKQWPNCPTISEIRDQGSCGSCWAFGAVEAISDRICVHT 125
Query: 143 RGKRHVRLSSDDLVSCCK-DCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIP 201
K V +S++DL+SCC +CG GC GG+ AW+YW G+VSGG Y S GCR Y IP
Sbjct: 126 NAKVSVEVSAEDLLSCCGFECGMGCNGGYPSGAWRYWTERGLVSGGLYDSHVGCRAYTIP 185
Query: 202 -CERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIF 260
CE ++NGS C TP C R C+PGY SY++D ++G +Y +P +E+ IM EI+
Sbjct: 186 PCEHHVNGSRPPCTGEGGETPRCSRHCEPGYSPSYKEDKHYGITSYGVPRSEKEIMAEIY 245
Query: 261 RHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVA 320
++GPVEG+ +Y D ++YK+G+Y+HV+G +G HAIRI+GWG E GT YWL A
Sbjct: 246 KNGPVEGAFIVYEDFLMYKSGVYQHVSGEQVGGHAIRILGWGVE---NGT----PYWLAA 298
Query: 321 NSFNTNWGENGLFRI 335
NS+NT+WG G F+I
Sbjct: 299 NSWNTDWGITGFFKI 313
Score = 179 bits (453), Expect = 4e-42, Method: Compositional matrix adjust.
Identities = 85/179 (47%), Positives = 121/179 (67%), Gaps = 15/179 (8%)
Query: 327 WGENGLF-------RIGCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYE 378
W E GL +GCR Y IP CE ++NGSR C TP C R C+PGY SY+
Sbjct: 162 WTERGLVSGGLYDSHVGCRAYTIPPCEHHVNGSRPPCTGEGGETPRCSRHCEPGYSPSYK 221
Query: 379 DDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAI 438
+D ++G +Y +P +E+ IM EI+++GPVEG+ +Y D ++YK+G+Y+HV+G +G HAI
Sbjct: 222 EDKHYGITSYGVPRSEKEIMAEIYKNGPVEGAFIVYEDFLMYKSGVYQHVSGEQVGGHAI 281
Query: 439 RIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
RI+GWG E GT YWL ANS+NT+WG G F+I+RG++ CGIE++I AG+P++
Sbjct: 282 RILGWGVE---NGT----PYWLAANSWNTDWGITGFFKILRGEDHCGIESEIVAGVPRM 333
>gi|56759504|gb|AAW27892.1| unknown [Schistosoma japonicum]
Length = 279
Score = 268 bits (686), Expect = 4e-69, Method: Compositional matrix adjust.
Identities = 127/261 (48%), Positives = 170/261 (65%), Gaps = 9/261 (3%)
Query: 76 DSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMS 135
D+++ + R P V + E+P FD+R WP+C +I +IRDQ CGS WA GAVEAM+
Sbjct: 7 DAEMKRKRRPT-VDHHNLNVEIPSQFDSRKKWPHCKSISQIRDQSRCGSCWAFGAVEAMT 65
Query: 136 DRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGC 195
DR+CI S G++ LS+ DL+SCC+DCG GCQGGF G AW YWV GIV+GG+ + GC
Sbjct: 66 DRICIQSGGQQSAELSALDLISCCEDCGQGCQGGFPGVAWDYWVKRGIVTGGSKENHTGC 125
Query: 196 RPYEIP-CERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEET 254
+PY P CE + G + +C TP+C + CQ GY YE D ++G +Y++ NE+
Sbjct: 126 QPYPFPKCEHHTKGKYPACGTKIYKTPQCKQTCQKGYKTPYEQDKHYGEESYNVQNNEKV 185
Query: 255 IMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVV 314
I R+I +GPVE + +Y D + YK+GIY+HV G +G HAIRIIGWG E
Sbjct: 186 IQRDIMMYGPVEAAFDVYEDFLNYKSGIYRHVTGSIVGGHAIRIIGWGVEKR-------T 238
Query: 315 KYWLVANSFNTNWGENGLFRI 335
YWL+ANS+N +WGE GLFRI
Sbjct: 239 PYWLIANSWNEDWGEKGLFRI 259
Score = 163 bits (413), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 81/182 (44%), Positives = 109/182 (59%), Gaps = 11/182 (6%)
Query: 316 YWLVANSFNTNWGENGLFRIGCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYD 374
YW+ EN GC+PY P CE + G +C TP+C + CQ GY
Sbjct: 107 YWVKRGIVTGGSKEN---HTGCQPYPFPKCEHHTKGKYPACGTKIYKTPQCKQTCQKGYK 163
Query: 375 VSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLG 434
YE D ++G +Y++ NE+ I R+I +GPVE + +Y D + YK+GIY+HV G +G
Sbjct: 164 TPYEQDKHYGEESYNVQNNEKVIQRDIMMYGPVEAAFDVYEDFLNYKSGIYRHVTGSIVG 223
Query: 435 EHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGL 494
HAIRIIGWG E YWL+ANS+N +WGE GLFRIVRG++EC IE+++ AGL
Sbjct: 224 GHAIRIIGWGVEKR-------TPYWLIANSWNEDWGEKGLFRIVRGRDECSIESNVVAGL 276
Query: 495 PK 496
K
Sbjct: 277 IK 278
>gi|55793943|gb|AAV65882.1| cathepsin B1 isotype 2 precursor [Trichobilharzia regenti]
Length = 342
Score = 268 bits (686), Expect = 4e-69, Method: Compositional matrix adjust.
Identities = 129/275 (46%), Positives = 175/275 (63%), Gaps = 10/275 (3%)
Query: 63 TLSELEMRMG-VHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGS 121
+L + + +G +H D +L + R P + + LE +P FD+R W C +I IRDQ
Sbjct: 56 SLEDARILLGAMHEDEELRKKRRPTVDHQNVSLE-IPSSFDSRKKWRQCKSISNIRDQSR 114
Query: 122 CGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTT 181
CGS WA AVEAMSDR+CI S+GK+ V LS+ DL+SCC +CG GCQGGF G AW YWV
Sbjct: 115 CGSCWAFAAVEAMSDRICIESKGKKSVELSAVDLLSCCTECGLGCQGGFPGAAWDYWVED 174
Query: 182 GIVSGGTYASKQGCRPYEIP-CERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLN 240
GIV+G + + GC+PY P CE + G + C + TP+C +KCQ GY Y D
Sbjct: 175 GIVTGSSKENHTGCQPYPFPKCEHHTTGKYPECGEKIYKTPKCHQKCQKGYKTPYGKDKY 234
Query: 241 FGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIG 300
+GR++Y++ NE I +EI HGPVE + T+++D + YK+GIYK++ G +G HA+RIIG
Sbjct: 235 YGRMSYNVLNNENAIKKEIMMHGPVEAAFTVHSDFLNYKSGIYKYMTGAEIGGHAVRIIG 294
Query: 301 WGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
WG E YWL+ANS+N +WGE G FRI
Sbjct: 295 WGVE-------KKTPYWLIANSWNEDWGEKGYFRI 322
Score = 166 bits (421), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 79/177 (44%), Positives = 110/177 (62%), Gaps = 15/177 (8%)
Query: 327 WGENGLF-------RIGCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYE 378
W E+G+ GC+PY P CE + G C TP+C +KCQ GY Y
Sbjct: 171 WVEDGIVTGSSKENHTGCQPYPFPKCEHHTTGKYPECGEKIYKTPKCHQKCQKGYKTPYG 230
Query: 379 DDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAI 438
D +GR++Y++ NE I +EI HGPVE + T+++D + YK+GIYK++ G +G HA+
Sbjct: 231 KDKYYGRMSYNVLNNENAIKKEIMMHGPVEAAFTVHSDFLNYKSGIYKYMTGAEIGGHAV 290
Query: 439 RIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLP 495
RIIGWG E YWL+ANS+N +WGE G FRI+RG++ECGIE+++T GLP
Sbjct: 291 RIIGWGVE-------KKTPYWLIANSWNEDWGEKGYFRILRGKDECGIESEVTGGLP 340
>gi|325302582|dbj|BAJ83491.1| cathepsin B-like peptidase [Echinococcus multilocularis]
Length = 338
Score = 268 bits (685), Expect = 5e-69, Method: Compositional matrix adjust.
Identities = 131/241 (54%), Positives = 164/241 (68%), Gaps = 10/241 (4%)
Query: 97 LPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLV 156
LP FDAR WP CPTI EIRDQG+CGS WA GA EAMSDR+CI S GK VR+S+DDL+
Sbjct: 84 LPSEFDARKAWPDCPTIGEIRDQGTCGSCWAFGATEAMSDRICIHSEGKEVVRISADDLL 143
Query: 157 SCCK-DCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEI-PCERYMNGSHSSCQ 214
SCC CG GC GG AW+YW GIVSGG Y S GCRPYEI PCE + +G+ C+
Sbjct: 144 SCCGLFCGFGCNGGLPENAWRYWAIDGIVSGGLYGSHVGCRPYEIPPCEHHTSGNRPDCK 203
Query: 215 DNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYAD 274
N TP+C R+C +D Y+ D +F Y++ A+EE IM EI +GPVE +YAD
Sbjct: 204 GNS-KTPKCQRQCVESFDGKYQADKHFASNVYNVRASEEDIMNEILVYGPVEADFIVYAD 262
Query: 275 MILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFR 334
+ YK+G+Y+HV GG LG HA++I+GWG+E + V YWL ANS+NT+WG+ G F+
Sbjct: 263 FLTYKSGVYQHVKGGFLGGHAVKILGWGEE-------NGVPYWLCANSWNTDWGDGGFFK 315
Query: 335 I 335
I
Sbjct: 316 I 316
Score = 177 bits (448), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 83/165 (50%), Positives = 113/165 (68%), Gaps = 9/165 (5%)
Query: 334 RIGCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPA 392
+GCRPYEIP CE + +G+R C+ N TP+C R+C +D Y+ D +F Y++ A
Sbjct: 180 HVGCRPYEIPPCEHHTSGNRPDCKGNS-KTPKCQRQCVESFDGKYQADKHFASNVYNVRA 238
Query: 393 NEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGT 452
+EE IM EI +GPVE +YAD + YK+G+Y+HV GG LG HA++I+GWG+E
Sbjct: 239 SEEDIMNEILVYGPVEADFIVYADFLTYKSGVYQHVKGGFLGGHAVKILGWGEE------ 292
Query: 453 SSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
+ V YWL ANS+NT+WG+ G F+I+RG N C IEADI AG+PKI
Sbjct: 293 -NGVPYWLCANSWNTDWGDGGFFKILRGYNHCKIEADINAGIPKI 336
>gi|198429088|ref|XP_002120307.1| PREDICTED: similar to cathepsin B [Ciona intestinalis]
Length = 364
Score = 268 bits (685), Expect = 5e-69, Method: Compositional matrix adjust.
Identities = 128/242 (52%), Positives = 166/242 (68%), Gaps = 10/242 (4%)
Query: 96 ELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDL 155
++P FD+R WP+CP+I IRDQGSCGS WA GAVEAMSDR CI S GK V +S++DL
Sbjct: 111 KIPNQFDSRKQWPHCPSISYIRDQGSCGSCWAFGAVEAMSDRYCIRSNGKIQVEISAEDL 170
Query: 156 VSCCK-DCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEI-PCERYMNGSHSSC 213
+SCC +CG+GC GGF G AWKYW + G+V+GG Y SK GC PY+I PCE ++ G C
Sbjct: 171 LSCCGFECGDGCNGGFPGSAWKYWNSDGLVTGGLYGSKTGCLPYQIKPCEHHVPGDRPKC 230
Query: 214 QDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYA 273
+ TP C+ KC+ + Y D ++G +Y++ ++ I EI HGPVEG+ T+YA
Sbjct: 231 SEG-GGTPSCVSKCKGNTTIHYNQDKHYGLSSYAVGSDPTQIQTEIMTHGPVEGAFTVYA 289
Query: 274 DMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLF 333
D YK+G+YKHV GG LG HAIRI+GWG E + V YWLVANS+NT+WG+ G F
Sbjct: 290 DFPTYKSGVYKHVTGGVLGGHAIRILGWGSE-------NGVAYWLVANSWNTDWGDKGYF 342
Query: 334 RI 335
+I
Sbjct: 343 KI 344
Score = 168 bits (426), Expect = 5e-39, Method: Compositional matrix adjust.
Identities = 82/179 (45%), Positives = 114/179 (63%), Gaps = 16/179 (8%)
Query: 327 WGENGLF-------RIGCRPYEI-PCERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYE 378
W +GL + GC PY+I PCE ++ G R C + TP C+ KC+ + Y
Sbjct: 194 WNSDGLVTGGLYGSKTGCLPYQIKPCEHHVPGDRPKC-SEGGGTPSCVSKCKGNTTIHYN 252
Query: 379 DDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAI 438
D ++G +Y++ ++ I EI HGPVEG+ T+YAD YK+G+YKHV GG LG HAI
Sbjct: 253 QDKHYGLSSYAVGSDPTQIQTEIMTHGPVEGAFTVYADFPTYKSGVYKHVTGGVLGGHAI 312
Query: 439 RIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
RI+GWG E + V YWLVANS+NT+WG+ G F+I+RG +ECGIE+ + AG+P+I
Sbjct: 313 RILGWGSE-------NGVAYWLVANSWNTDWGDKGYFKILRGSDECGIESSVVAGIPQI 364
>gi|339236191|ref|XP_003379650.1| cathepsin B [Trichinella spiralis]
gi|316977649|gb|EFV60721.1| cathepsin B [Trichinella spiralis]
Length = 356
Score = 268 bits (685), Expect = 5e-69, Method: Compositional matrix adjust.
Identities = 128/271 (47%), Positives = 175/271 (64%), Gaps = 9/271 (3%)
Query: 67 LEMRMGVHPDSKLPQNRLPLLVQLSDPLE-ELPEGFDARINWPYCPTIQEIRDQGSCGSG 125
++ MGV SKL N +PL V + ++ E+P FD+R WPYCPTI EIRDQ +CGS
Sbjct: 71 IQGMMGVRRSSKLETNSIPLPVISYEHIDMEIPVEFDSRKQWPYCPTIGEIRDQSNCGSC 130
Query: 126 WALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVS 185
WA GAVEA+SDR+CIA+ G++ +SS DL+SCCK CG GCQGG +AW +WV G+V+
Sbjct: 131 WAFGAVEAISDRICIATDGRQKPHISSTDLLSCCKICGFGCQGGDPHQAWSFWVKYGLVT 190
Query: 186 GGTYASKQGCRPYEI-PCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRI 244
GG Y + GCRPY PC + NG++ C + TP C + CQ Y + Y D +G
Sbjct: 191 GGNYTTHDGCRPYPFAPCNHHSNGTYGPCSHDLEPTPVCKKACQSTYKIQYNKDKYYGLK 250
Query: 245 AYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQE 304
AYSL + +E+ +GP+E + +Y D +LYKTG+Y+H G LG HA+R++GWG+E
Sbjct: 251 AYSLHNKASDLQKELMMNGPMEVAFEVYEDFLLYKTGVYQHHTGSVLGGHAVRLLGWGEE 310
Query: 305 PLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
+ V YWL+ANS+NT WG+ G F+I
Sbjct: 311 -------NGVPYWLLANSWNTEWGDKGFFKI 334
Score = 153 bits (387), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 80/190 (42%), Positives = 109/190 (57%), Gaps = 15/190 (7%)
Query: 308 EGTSSVVKYWLVANSFNTNWGENGLFRIGCRPYEI-PCERYMNGSRSSCQANEPNTPECI 366
+ S VKY LV G N GCRPY PC + NG+ C + TP C
Sbjct: 178 QAWSFWVKYGLVT-------GGNYTTHDGCRPYPFAPCNHHSNGTYGPCSHDLEPTPVCK 230
Query: 367 RKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYK 426
+ CQ Y + Y D +G AYSL + +E+ +GP+E + +Y D +LYKTG+Y+
Sbjct: 231 KACQSTYKIQYNKDKYYGLKAYSLHNKASDLQKELMMNGPMEVAFEVYEDFLLYKTGVYQ 290
Query: 427 HVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGI 486
H G LG HA+R++GWG+E + V YWL+ANS+NT WG+ G F+I RG+NECGI
Sbjct: 291 HHTGSVLGGHAVRLLGWGEE-------NGVPYWLLANSWNTEWGDKGFFKIYRGRNECGI 343
Query: 487 EADITAGLPK 496
E++ AGL K
Sbjct: 344 ESEAVAGLYK 353
>gi|313233819|emb|CBY09988.1| unnamed protein product [Oikopleura dioica]
Length = 356
Score = 268 bits (684), Expect = 6e-69, Method: Compositional matrix adjust.
Identities = 137/292 (46%), Positives = 180/292 (61%), Gaps = 23/292 (7%)
Query: 59 LSKLTLSELEMRMGVHPDSK-------------LPQNRLPLLVQLSDPLEELPEGFDARI 105
++K+ ++L + G + +S + +RLP+ L+D ELP FD+R
Sbjct: 44 IAKVNSADLSWKAGANFNSNYAPKHVAGLCGTIMGDDRLPVNHLLNDADLELPANFDSRE 103
Query: 106 NWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCK-DCGN 164
WP CP+I E+RDQGSCGS WA GA EA+SDR CI S LSS+DL+SCC CGN
Sbjct: 104 AWPDCPSISEVRDQGSCGSCWAFGASEAISDRTCIHSNAAFTFDLSSEDLLSCCGYVCGN 163
Query: 165 GCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEI-PCERYMNGSHSSCQDNEPNTPEC 223
GC GGF AW+YWV G+VSGG Y GC+PY I PCE + G C E TP+C
Sbjct: 164 GCNGGFPQAAWEYWVQNGLVSGGLYHG-TGCQPYAIEPCEHHTEGDRPPCTGEEGTTPKC 222
Query: 224 IRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIY 283
KC GY ++ D ++G +AY +PANE+ IM EI+++GPVEG+ +Y D YK+G+Y
Sbjct: 223 SHKCVDGYTGNFAQDKHYGSVAYRIPANEKAIMNEIYKNGPVEGAFIVYEDFPTYKSGVY 282
Query: 284 KHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
H G LG HAIR++GWG+E GE KYWL NS+NT+WG NG F+I
Sbjct: 283 SHHTGSALGGHAIRVLGWGEEN-GE------KYWLCGNSWNTDWGNNGFFKI 327
Score = 174 bits (442), Expect = 9e-41, Method: Compositional matrix adjust.
Identities = 84/176 (47%), Positives = 112/176 (63%), Gaps = 14/176 (7%)
Query: 327 WGENGLFR------IGCRPYEI-PCERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYED 379
W +NGL GC+PY I PCE + G R C E TP+C KC GY ++
Sbjct: 177 WVQNGLVSGGLYHGTGCQPYAIEPCEHHTEGDRPPCTGEEGTTPKCSHKCVDGYTGNFAQ 236
Query: 380 DLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIR 439
D ++G +AY +PANE+ IM EI+++GPVEG+ +Y D YK+G+Y H G LG HAIR
Sbjct: 237 DKHYGSVAYRIPANEKAIMNEIYKNGPVEGAFIVYEDFPTYKSGVYSHHTGSALGGHAIR 296
Query: 440 IIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLP 495
++GWG+E GE KYWL NS+NT+WG NG F+I RG NECGIE+++ G+P
Sbjct: 297 VLGWGEEN-GE------KYWLCGNSWNTDWGNNGFFKIKRGVNECGIESEMVGGIP 345
>gi|195729971|gb|ACG50796.1| cathepsin B1 [Trichobilharzia szidati]
Length = 342
Score = 268 bits (684), Expect = 7e-69, Method: Compositional matrix adjust.
Identities = 125/261 (47%), Positives = 169/261 (64%), Gaps = 9/261 (3%)
Query: 76 DSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMS 135
D +L + R P + + LE +P FD+R W C +I IRDQ CG WA AVEAMS
Sbjct: 70 DEELRKKRRPTVDHQNVSLE-IPSSFDSRKKWRQCKSISNIRDQSRCGPCWAFAAVEAMS 128
Query: 136 DRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGC 195
DR+CI S+GK+ V LS+ DL+SCC +CG GCQGGF G AW YWV GIV+G + + GC
Sbjct: 129 DRICIQSKGKKSVELSAVDLLSCCTECGLGCQGGFPGAAWDYWVEEGIVTGSSKENHTGC 188
Query: 196 RPYEIP-CERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEET 254
+PY P CE + G + +C + TP+C +KCQ GY Y+ D +G+++Y++ + E+
Sbjct: 189 QPYPFPKCEHHTKGKYPACGEKIYKTPKCQQKCQKGYKTPYKKDKYYGKLSYNVLSKEDA 248
Query: 255 IMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVV 314
I +EI HGPVE + T+Y+D + YK+GIYKH+ G +G HA+RIIGWG E
Sbjct: 249 IKKEIMMHGPVEAAFTVYSDFLNYKSGIYKHMKGTVIGGHAVRIIGWGVE-------KKT 301
Query: 315 KYWLVANSFNTNWGENGLFRI 335
YWL+ANS+N +WGE G FRI
Sbjct: 302 PYWLIANSWNEDWGEKGYFRI 322
Score = 169 bits (427), Expect = 5e-39, Method: Compositional matrix adjust.
Identities = 80/181 (44%), Positives = 113/181 (62%), Gaps = 11/181 (6%)
Query: 316 YWLVANSFNTNWGENGLFRIGCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYD 374
YW+ + EN GC+PY P CE + G +C TP+C +KCQ GY
Sbjct: 170 YWVEEGIVTGSSKEN---HTGCQPYPFPKCEHHTKGKYPACGEKIYKTPKCQQKCQKGYK 226
Query: 375 VSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLG 434
Y+ D +G+++Y++ + E+ I +EI HGPVE + T+Y+D + YK+GIYKH+ G +G
Sbjct: 227 TPYKKDKYYGKLSYNVLSKEDAIKKEIMMHGPVEAAFTVYSDFLNYKSGIYKHMKGTVIG 286
Query: 435 EHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGL 494
HA+RIIGWG E YWL+ANS+N +WGE G FRI+RG++ CGIE+ +TAGL
Sbjct: 287 GHAVRIIGWGVE-------KKTPYWLIANSWNEDWGEKGYFRILRGKDVCGIESAVTAGL 339
Query: 495 P 495
P
Sbjct: 340 P 340
>gi|355697726|gb|EHH28274.1| Cathepsin B [Macaca mulatta]
Length = 339
Score = 268 bits (684), Expect = 7e-69, Method: Compositional matrix adjust.
Identities = 132/253 (52%), Positives = 179/253 (70%), Gaps = 11/253 (4%)
Query: 85 PLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRG 144
P V ++ L+ LPE FDAR WP CPTI+EIRDQGSCGS WA GAVEA+SDR+CI +
Sbjct: 69 PQRVMFTEDLK-LPESFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHTNA 127
Query: 145 KRHVRLSSDDLVSCCKD-CGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIP-C 202
V +S++DL++CC CG+GC GG+ AW + G+VSGG Y S GCRPY IP C
Sbjct: 128 HVSVEVSAEDLLTCCGIMCGDGCNGGYPAGAWNFLTRKGLVSGGLYDSHVGCRPYSIPPC 187
Query: 203 ERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRH 262
E ++NGS C E +TP+C + C+PGY +Y+ D ++G +YS+ +E+ IM EI+++
Sbjct: 188 EHHVNGSRPPCT-GEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKN 246
Query: 263 GPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANS 322
GPVEG+ ++Y+D +LYK+G+Y+HV G +G HAIRI+GWG E GT YWLVANS
Sbjct: 247 GPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGVE---NGT----PYWLVANS 299
Query: 323 FNTNWGENGLFRI 335
+NT+WG+NG F+I
Sbjct: 300 WNTDWGDNGFFKI 312
Score = 182 bits (463), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 84/164 (51%), Positives = 123/164 (75%), Gaps = 9/164 (5%)
Query: 334 RIGCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPA 392
+GCRPY IP CE ++NGSR C E +TP+C + C+PGY +Y+ D ++G +YS+
Sbjct: 176 HVGCRPYSIPPCEHHVNGSRPPCTG-EGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVSN 234
Query: 393 NEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGT 452
+E+ IM EI+++GPVEG+ ++Y+D +LYK+G+Y+HV G +G HAIRI+GWG E GT
Sbjct: 235 SEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGVE---NGT 291
Query: 453 SSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPK 496
YWLVANS+NT+WG+NG F+I+RGQ+ CGIE+++ AG+P+
Sbjct: 292 ----PYWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPR 331
>gi|203648|gb|AAA40993.1| cathepsin (EC 3.4.22.1), partial [Rattus norvegicus]
Length = 271
Score = 267 bits (683), Expect = 8e-69, Method: Compositional matrix adjust.
Identities = 130/253 (51%), Positives = 177/253 (69%), Gaps = 11/253 (4%)
Query: 85 PLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRG 144
P V S+ + LPE FDAR W CPTI +IRDQGSCGS WA GAVEAMSDR+CI + G
Sbjct: 1 PERVGFSEDIN-LPESFDAREQWSNCPTIAQIRDQGSCGSCWAFGAVEAMSDRICIHTNG 59
Query: 145 KRHVRLSSDDLVSCCK-DCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIP-C 202
+ +V +S++DL++CC CG+GC GG+ AW +W G+VSGG Y S GC PY IP C
Sbjct: 60 RVNVEVSAEDLLTCCGIQCGDGCNGGYPSGAWNFWTRKGLVSGGVYNSHIGCLPYTIPPC 119
Query: 203 ERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRH 262
E ++NGS C E +TP+C + C+ GY SY++D ++G +YS+ +E+ IM EI+++
Sbjct: 120 EHHVNGSRPPCT-GEGDTPKCNKMCEAGYSTSYKEDKHYGYTSYSVSDSEKEIMAEIYKN 178
Query: 263 GPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANS 322
GPVEG+ T+++D + YK+G+YKH AG +G HAIRI+GWG E + V YWLVANS
Sbjct: 179 GPVEGAFTVFSDFLTYKSGVYKHEAGDVMGGHAIRILGWGIE-------NGVPYWLVANS 231
Query: 323 FNTNWGENGLFRI 335
+N +WG+NG F+I
Sbjct: 232 WNVDWGDNGFFKI 244
Score = 179 bits (453), Expect = 4e-42, Method: Compositional matrix adjust.
Identities = 83/164 (50%), Positives = 120/164 (73%), Gaps = 9/164 (5%)
Query: 334 RIGCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPA 392
IGC PY IP CE ++NGSR C E +TP+C + C+ GY SY++D ++G +YS+
Sbjct: 108 HIGCLPYTIPPCEHHVNGSRPPCTG-EGDTPKCNKMCEAGYSTSYKEDKHYGYTSYSVSD 166
Query: 393 NEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGT 452
+E+ IM EI+++GPVEG+ T+++D + YK+G+YKH AG +G HAIRI+GWG E
Sbjct: 167 SEKEIMAEIYKNGPVEGAFTVFSDFLTYKSGVYKHEAGDVMGGHAIRILGWGIE------ 220
Query: 453 SSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPK 496
+ V YWLVANS+N +WG+NG F+I+RG+N CGIE++I AG+P+
Sbjct: 221 -NGVPYWLVANSWNVDWGDNGFFKILRGENHCGIESEIVAGIPR 263
>gi|1311050|pdb|1CPJ|A Chain A, Crystal Structures Of Recombinant Rat Cathepsin B And A
Cathepsin B-Inhibitor Complex: Implications For
Structure- Based Inhibitor Design
gi|1311051|pdb|1CPJ|B Chain B, Crystal Structures Of Recombinant Rat Cathepsin B And A
Cathepsin B-Inhibitor Complex: Implications For
Structure- Based Inhibitor Design
gi|1421561|pdb|1THE|A Chain A, Crystal Structures Of Recombinant Rat Cathepsin B And A
Cathepsin B- Inhibitor Complex: Implications For
Structure-Based Inhibitor Design
gi|1421562|pdb|1THE|B Chain B, Crystal Structures Of Recombinant Rat Cathepsin B And A
Cathepsin B- Inhibitor Complex: Implications For
Structure-Based Inhibitor Design
Length = 260
Score = 267 bits (682), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 126/241 (52%), Positives = 172/241 (71%), Gaps = 10/241 (4%)
Query: 97 LPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLV 156
LPE FDAR W CPTI +IRDQGSCGS WA GAVEAMSDR+CI + G+ +V +S++DL+
Sbjct: 7 LPESFDAREQWSNCPTIAQIRDQGSCGSCWAFGAVEAMSDRICIHTNGRVNVEVSAEDLL 66
Query: 157 SCCK-DCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIP-CERYMNGSHSSCQ 214
+CC CG+GC GG+ AW +W G+VSGG Y S GC PY IP CE ++NG+ C
Sbjct: 67 TCCGIQCGDGCNGGYPSGAWNFWTRKGLVSGGVYNSHIGCLPYTIPPCEHHVNGARPPCT 126
Query: 215 DNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYAD 274
E +TP+C + C+ GY SY++D ++G +YS+ +E+ IM EI+++GPVEG+ T+++D
Sbjct: 127 -GEGDTPKCNKMCEAGYSTSYKEDKHYGYTSYSVSDSEKEIMAEIYKNGPVEGAFTVFSD 185
Query: 275 MILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFR 334
+ YK+G+YKH AG +G HAIRI+GWG E + V YWLVANS+N +WG+NG F+
Sbjct: 186 FLTYKSGVYKHEAGDVMGGHAIRILGWGIE-------NGVPYWLVANSWNADWGDNGFFK 238
Query: 335 I 335
I
Sbjct: 239 I 239
Score = 179 bits (453), Expect = 4e-42, Method: Compositional matrix adjust.
Identities = 82/164 (50%), Positives = 120/164 (73%), Gaps = 9/164 (5%)
Query: 334 RIGCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPA 392
IGC PY IP CE ++NG+R C E +TP+C + C+ GY SY++D ++G +YS+
Sbjct: 103 HIGCLPYTIPPCEHHVNGARPPCTG-EGDTPKCNKMCEAGYSTSYKEDKHYGYTSYSVSD 161
Query: 393 NEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGT 452
+E+ IM EI+++GPVEG+ T+++D + YK+G+YKH AG +G HAIRI+GWG E
Sbjct: 162 SEKEIMAEIYKNGPVEGAFTVFSDFLTYKSGVYKHEAGDVMGGHAIRILGWGIE------ 215
Query: 453 SSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPK 496
+ V YWLVANS+N +WG+NG F+I+RG+N CGIE++I AG+P+
Sbjct: 216 -NGVPYWLVANSWNADWGDNGFFKILRGENHCGIESEIVAGIPR 258
>gi|1127275|pdb|1CTE|A Chain A, Crystal Structures Of Recombinant Rat Cathepsin B And A
Cathepsin B-Inhibitor Complex: Implications For
Structure- Based Inhibitor Design
gi|1127276|pdb|1CTE|B Chain B, Crystal Structures Of Recombinant Rat Cathepsin B And A
Cathepsin B-Inhibitor Complex: Implications For
Structure- Based Inhibitor Design
Length = 254
Score = 266 bits (680), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 126/241 (52%), Positives = 172/241 (71%), Gaps = 10/241 (4%)
Query: 97 LPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLV 156
LPE FDAR W CPTI +IRDQGSCGS WA GAVEAMSDR+CI + G+ +V +S++DL+
Sbjct: 1 LPESFDAREQWSNCPTIAQIRDQGSCGSCWAFGAVEAMSDRICIHTNGRVNVEVSAEDLL 60
Query: 157 SCCK-DCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIP-CERYMNGSHSSCQ 214
+CC CG+GC GG+ AW +W G+VSGG Y S GC PY IP CE ++NG+ C
Sbjct: 61 TCCGIQCGDGCNGGYPSGAWNFWTRKGLVSGGVYNSHIGCLPYTIPPCEHHVNGARPPCT 120
Query: 215 DNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYAD 274
E +TP+C + C+ GY SY++D ++G +YS+ +E+ IM EI+++GPVEG+ T+++D
Sbjct: 121 -GEGDTPKCNKMCEAGYSTSYKEDKHYGYTSYSVSDSEKEIMAEIYKNGPVEGAFTVFSD 179
Query: 275 MILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFR 334
+ YK+G+YKH AG +G HAIRI+GWG E + V YWLVANS+N +WG+NG F+
Sbjct: 180 FLTYKSGVYKHEAGDVMGGHAIRILGWGIE-------NGVPYWLVANSWNADWGDNGFFK 232
Query: 335 I 335
I
Sbjct: 233 I 233
Score = 178 bits (452), Expect = 5e-42, Method: Compositional matrix adjust.
Identities = 82/164 (50%), Positives = 120/164 (73%), Gaps = 9/164 (5%)
Query: 334 RIGCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPA 392
IGC PY IP CE ++NG+R C E +TP+C + C+ GY SY++D ++G +YS+
Sbjct: 97 HIGCLPYTIPPCEHHVNGARPPCTG-EGDTPKCNKMCEAGYSTSYKEDKHYGYTSYSVSD 155
Query: 393 NEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGT 452
+E+ IM EI+++GPVEG+ T+++D + YK+G+YKH AG +G HAIRI+GWG E
Sbjct: 156 SEKEIMAEIYKNGPVEGAFTVFSDFLTYKSGVYKHEAGDVMGGHAIRILGWGIE------ 209
Query: 453 SSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPK 496
+ V YWLVANS+N +WG+NG F+I+RG+N CGIE++I AG+P+
Sbjct: 210 -NGVPYWLVANSWNADWGDNGFFKILRGENHCGIESEIVAGIPR 252
>gi|355681635|gb|AER96808.1| cathepsin B [Mustela putorius furo]
Length = 338
Score = 266 bits (680), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 128/241 (53%), Positives = 177/241 (73%), Gaps = 10/241 (4%)
Query: 97 LPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLV 156
LPE FD+R WP CPTI+EIRDQGSCGS WA GAVEA+SDR+CI + G V +S++D++
Sbjct: 80 LPESFDSREQWPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICIRTNGHVSVEVSAEDML 139
Query: 157 SCCKD-CGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIP-CERYMNGSHSSCQ 214
+CC D CG+GC GGF +AW +W G+VSGG Y S GCRPY IP CE ++NGS C
Sbjct: 140 TCCGDQCGDGCNGGFPAEAWNFWTXXGLVSGGLYDSHVGCRPYSIPPCEHHVNGSRPPCT 199
Query: 215 DNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYAD 274
E +TP+C + C+PGY SY++D ++G +YS+ ++E+ IM EI+++GPVE + ++Y+D
Sbjct: 200 -GEGDTPKCSKICEPGYTPSYKEDKHYGCSSYSVSSSEKEIMAEIYKNGPVEAAFSVYSD 258
Query: 275 MILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFR 334
++YK+G+Y+HV G +G HA+RI+GWG E GT YWLV NS+NT+WG+NG F+
Sbjct: 259 FLMYKSGVYQHVTGEMMGGHAVRILGWGVE---NGT----PYWLVGNSWNTDWGDNGFFK 311
Query: 335 I 335
I
Sbjct: 312 I 312
Score = 172 bits (435), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 82/163 (50%), Positives = 122/163 (74%), Gaps = 9/163 (5%)
Query: 334 RIGCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPA 392
+GCRPY IP CE ++NGSR C E +TP+C + C+PGY SY++D ++G +YS+ +
Sbjct: 176 HVGCRPYSIPPCEHHVNGSRPPC-TGEGDTPKCSKICEPGYTPSYKEDKHYGCSSYSVSS 234
Query: 393 NEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGT 452
+E+ IM EI+++GPVE + ++Y+D ++YK+G+Y+HV G +G HA+RI+GWG E GT
Sbjct: 235 SEKEIMAEIYKNGPVEAAFSVYSDFLMYKSGVYQHVTGEMMGGHAVRILGWGVE---NGT 291
Query: 453 SSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLP 495
YWLV NS+NT+WG+NG F+I+RGQ+ CGIE++I AG+P
Sbjct: 292 ----PYWLVGNSWNTDWGDNGFFKILRGQDHCGIESEIVAGIP 330
>gi|55793949|gb|AAV65885.1| cathepsin B1 isotype 5 precursor [Trichobilharzia regenti]
Length = 342
Score = 266 bits (680), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 128/275 (46%), Positives = 175/275 (63%), Gaps = 10/275 (3%)
Query: 63 TLSELEMRMG-VHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGS 121
+L + + +G + D +L + R P V D E+P FD+R WP C +I IRDQ
Sbjct: 56 SLKDARILLGAMREDEELRKKRRPT-VDHQDVSLEIPTSFDSRKEWPQCKSISNIRDQSR 114
Query: 122 CGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTT 181
CG+GWA AV+AMSDR+CI S+GK+ V LS+ DL+SCC +CG GCQ GF G AW YWV
Sbjct: 115 CGAGWAFAAVQAMSDRICIESKGKKSVELSAVDLLSCCIECGLGCQMGFPGIAWDYWVQE 174
Query: 182 GIVSGGTYASKQGCRPYEIP-CERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLN 240
GIV+GG+ + GC+PY P CE + G + C + P+C +KCQ GY YE D
Sbjct: 175 GIVTGGSKENHTGCQPYPFPKCEHHTKGRYPECGEIIYMKPKCHQKCQKGYKTPYEKDKY 234
Query: 241 FGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIG 300
+G+++Y+L NE++I +EI HGPVE S +++D + YK+GIYKH+ G +G H +RIIG
Sbjct: 235 YGKVSYNLLKNEDSIKKEIMMHGPVEASFRVHSDFLNYKSGIYKHMTGIDIGSHVVRIIG 294
Query: 301 WGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
WG E YWL+ANS+N +WGE G FR+
Sbjct: 295 WGVE-------KETPYWLIANSWNEDWGEKGYFRM 322
Score = 164 bits (416), Expect = 8e-38, Method: Compositional matrix adjust.
Identities = 79/182 (43%), Positives = 111/182 (60%), Gaps = 11/182 (6%)
Query: 316 YWLVANSFNTNWGENGLFRIGCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYD 374
YW+ EN GC+PY P CE + G C P+C +KCQ GY
Sbjct: 170 YWVQEGIVTGGSKEN---HTGCQPYPFPKCEHHTKGRYPECGEIIYMKPKCHQKCQKGYK 226
Query: 375 VSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLG 434
YE D +G+++Y+L NE++I +EI HGPVE S +++D + YK+GIYKH+ G +G
Sbjct: 227 TPYEKDKYYGKVSYNLLKNEDSIKKEIMMHGPVEASFRVHSDFLNYKSGIYKHMTGIDIG 286
Query: 435 EHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGL 494
H +RIIGWG E YWL+ANS+N +WGE G FR++RG++ECGIE+ +T+GL
Sbjct: 287 SHVVRIIGWGVE-------KETPYWLIANSWNEDWGEKGYFRMLRGKDECGIESAVTSGL 339
Query: 495 PK 496
P+
Sbjct: 340 PR 341
>gi|183988834|gb|ACC66066.1| cathepsin B [Samia ricini]
Length = 283
Score = 266 bits (680), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 125/243 (51%), Positives = 166/243 (68%), Gaps = 9/243 (3%)
Query: 94 LEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSD 153
+ LPE FD R WP C T+ EIRDQGSCGS WA GAVEAM+DRVCI S +H S++
Sbjct: 40 IATLPEIFDPRDKWPECLTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAE 99
Query: 154 DLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEI-PCERYMNGSHSS 212
DLVSCC CG GC GG AW+YW G+VSGG Y S QGCRPYEI PCE ++ G+
Sbjct: 100 DLVSCCPICGLGCNGGMPTLAWEYWKHVGLVSGGNYNSSQGCRPYEIPPCEHHVPGNRMP 159
Query: 213 CQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIY 272
C + + TP+C + C+ Y+V ++ D +G+ YS+ +E+ I E+F++GPVE + T+Y
Sbjct: 160 C-NGDTKTPKCQKNCESSYNVPFKKDKRYGKHVYSVSGHEDHIKAELFKNGPVEAAFTVY 218
Query: 273 ADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGL 332
+D++ YK G+YKH G LG HAI+IIGWG E + KYWL+ANS+N++WG+NG
Sbjct: 219 SDLLSYKNGVYKHTEGNALGGHAIKIIGWGVE-------NNNKYWLIANSWNSDWGDNGF 271
Query: 333 FRI 335
F+I
Sbjct: 272 FKI 274
Score = 157 bits (397), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 71/152 (46%), Positives = 106/152 (69%), Gaps = 9/152 (5%)
Query: 336 GCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANE 394
GCRPYEIP CE ++ G+R C + TP+C + C+ Y+V ++ D +G+ YS+ +E
Sbjct: 140 GCRPYEIPPCEHHVPGNRMPCNG-DTKTPKCQKNCESSYNVPFKKDKRYGKHVYSVSGHE 198
Query: 395 ETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSS 454
+ I E+F++GPVE + T+Y+D++ YK G+YKH G LG HAI+IIGWG E +
Sbjct: 199 DHIKAELFKNGPVEAAFTVYSDLLSYKNGVYKHTEGNALGGHAIKIIGWGVE-------N 251
Query: 455 VVKYWLVANSFNTNWGENGLFRIVRGQNECGI 486
KYWL+ANS+N++WG+NG F+I+RG++ CGI
Sbjct: 252 NNKYWLIANSWNSDWGDNGFFKILRGEDHCGI 283
>gi|55793945|gb|AAV65883.1| cathepsin B1 isotype 3 precursor [Trichobilharzia regenti]
Length = 342
Score = 266 bits (680), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 128/275 (46%), Positives = 175/275 (63%), Gaps = 10/275 (3%)
Query: 63 TLSELEMRMG-VHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGS 121
+L + + +G + D +L + R P + + LE +P FD+R W C +I IRDQ
Sbjct: 56 SLEDARILLGAMREDEELRKKRRPTVDHQNVSLE-IPSSFDSRKKWHQCKSISNIRDQSR 114
Query: 122 CGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTT 181
CGS WA AVEAMSDR+CI S+GK+ V LS+ DL+SCC +CG GCQGGF G AW YWV
Sbjct: 115 CGSCWAFTAVEAMSDRICIESKGKKSVELSAVDLLSCCTECGLGCQGGFPGAAWDYWVED 174
Query: 182 GIVSGGTYASKQGCRPYEIP-CERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLN 240
GIV+G + + GC+PY P CE + G + C + TP+C +KCQ GY Y+ D
Sbjct: 175 GIVTGSSKENHTGCQPYPFPKCEHHTTGKYPECGEKIYKTPKCHQKCQKGYKTPYKKDKY 234
Query: 241 FGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIG 300
+GR++Y++ NE I +EI HGPVE + T+++D + YK+GIYK++ G +G HA+RIIG
Sbjct: 235 YGRMSYNVLNNENAIKKEIMMHGPVEAAFTVHSDFLNYKSGIYKYMTGAEIGGHAVRIIG 294
Query: 301 WGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
WG E YWL+ANS+N +WGE G FRI
Sbjct: 295 WGVE-------KKTPYWLIANSWNEDWGEKGYFRI 322
Score = 168 bits (425), Expect = 8e-39, Method: Compositional matrix adjust.
Identities = 79/177 (44%), Positives = 111/177 (62%), Gaps = 15/177 (8%)
Query: 327 WGENGLF-------RIGCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYE 378
W E+G+ GC+PY P CE + G C TP+C +KCQ GY Y+
Sbjct: 171 WVEDGIVTGSSKENHTGCQPYPFPKCEHHTTGKYPECGEKIYKTPKCHQKCQKGYKTPYK 230
Query: 379 DDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAI 438
D +GR++Y++ NE I +EI HGPVE + T+++D + YK+GIYK++ G +G HA+
Sbjct: 231 KDKYYGRMSYNVLNNENAIKKEIMMHGPVEAAFTVHSDFLNYKSGIYKYMTGAEIGGHAV 290
Query: 439 RIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLP 495
RIIGWG E YWL+ANS+N +WGE G FRI+RG++ECGIE+++T GLP
Sbjct: 291 RIIGWGVE-------KKTPYWLIANSWNEDWGEKGYFRILRGKDECGIESEVTGGLP 340
>gi|49036808|gb|AAT48985.1| cathepsin B-like proteinase [Triatoma vitticeps]
Length = 332
Score = 265 bits (678), Expect = 3e-68, Method: Compositional matrix adjust.
Identities = 133/274 (48%), Positives = 179/274 (65%), Gaps = 20/274 (7%)
Query: 67 LEMRMGVHPDSK----LPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSC 122
L+ GVH D+ LP+ ++ + V + P+ FDAR +WP C +I EIRDQGSC
Sbjct: 54 LKSLAGVHKDANNAFTLPKRQVSVDVTV-------PDEFDARKHWPNCSSITEIRDQGSC 106
Query: 123 GSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTG 182
GS WA GAVEAMSDR+CI S GK V LS+++L+SCC CG GC GG AW+YW G
Sbjct: 107 GSCWAFGAVEAMSDRICIHSNGKLQVHLSAENLLSCCDSCGYGCLGGSAENAWEYWHKFG 166
Query: 183 IVSGGTYASKQGCRPYEI-PCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNF 241
IVSGG Y SKQGC+PY I PCE + GS +C+ +TP+C ++C+ GY + Y DDL +
Sbjct: 167 IVSGGNYGSKQGCQPYSIAPCEHSIPGSRPACEGVR-DTPKCKKQCEKGYGIPYGDDLCY 225
Query: 242 GRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGW 301
G+ Y++ + + I EI ++GP+ S+ +Y D+ YK G+Y+HVAG LG H I+I+GW
Sbjct: 226 GQPGYTIENDAQKIQAEILKNGPIVASILVYEDLFSYKAGVYQHVAGEVLGGHVIKILGW 285
Query: 302 GQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
G E + YWLVANS+NT+WG NG F+I
Sbjct: 286 GVE-------NDTPYWLVANSWNTDWGNNGFFKI 312
Score = 166 bits (419), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 82/193 (42%), Positives = 121/193 (62%), Gaps = 12/193 (6%)
Query: 306 LGEGTSSVVKYWLVANSFNTNWGENGLFRIGCRPYEI-PCERYMNGSRSSCQANEPNTPE 364
LG + +YW + F G N + GC+PY I PCE + GSR +C+ +TP+
Sbjct: 151 LGGSAENAWEYW---HKFGIVSGGNYGSKQGCQPYSIAPCEHSIPGSRPACEGVR-DTPK 206
Query: 365 CIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGI 424
C ++C+ GY + Y DDL +G+ Y++ + + I EI ++GP+ S+ +Y D+ YK G+
Sbjct: 207 CKKQCEKGYGIPYGDDLCYGQPGYTIENDAQKIQAEILKNGPIVASILVYEDLFSYKAGV 266
Query: 425 YKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNEC 484
Y+HVAG LG H I+I+GWG E + YWLVANS+NT+WG NG F+I+RG +EC
Sbjct: 267 YQHVAGEVLGGHVIKILGWGVE-------NDTPYWLVANSWNTDWGNNGFFKILRGSDEC 319
Query: 485 GIEADITAGLPKI 497
GIE I AG+P++
Sbjct: 320 GIEDQIVAGIPRV 332
>gi|55793951|gb|AAV65886.1| cathepsin B1 isotype 6 precursor [Trichobilharzia regenti]
Length = 342
Score = 265 bits (677), Expect = 4e-68, Method: Compositional matrix adjust.
Identities = 127/275 (46%), Positives = 176/275 (64%), Gaps = 10/275 (3%)
Query: 63 TLSELEMRMGV-HPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGS 121
++ + + +GV D KL + R P + + LE +P FD+R W C +I I DQ
Sbjct: 56 SVEDARILLGVMREDEKLRKKRRPTVDHQNVSLE-IPSTFDSRKKWSQCKSISSIHDQSR 114
Query: 122 CGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTT 181
CGSGWA AVE MSDR+CI S+G++ V LS+ DL+SCC++CG GC GGF G AW YWV
Sbjct: 115 CGSGWAFAAVEVMSDRICIQSKGEKSVELSAVDLLSCCRECGLGCLGGFPGSAWDYWVEE 174
Query: 182 GIVSGGTYASKQGCRPYEIP-CERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLN 240
G+V+G + + GC+PY P CE G + +C TP+C +KCQ GY Y+ D +
Sbjct: 175 GVVTGSSGENHTGCQPYPFPKCEHNTTGKYPACGQKIYETPKCQKKCQKGYKTPYKKDKH 234
Query: 241 FGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIG 300
+G++AY++P NE++I +EI HGPV T+Y+D + YK+GIYKH+ G +G H +RI+G
Sbjct: 235 YGKVAYNVPNNEDSIKKEIMMHGPVGSFFTVYSDFLNYKSGIYKHMKGTEIGVHTVRIVG 294
Query: 301 WGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
WG E +GT YWL+ANS+N WGE G FRI
Sbjct: 295 WGVE---KGTP----YWLIANSWNEGWGEKGYFRI 322
Score = 169 bits (427), Expect = 5e-39, Method: Compositional matrix adjust.
Identities = 84/192 (43%), Positives = 117/192 (60%), Gaps = 11/192 (5%)
Query: 306 LGEGTSSVVKYWLVANSFNTNWGENGLFRIGCRPYEIP-CERYMNGSRSSCQANEPNTPE 364
LG S YW+ + GEN GC+PY P CE G +C TP+
Sbjct: 160 LGGFPGSAWDYWVEEGVVTGSSGEN---HTGCQPYPFPKCEHNTTGKYPACGQKIYETPK 216
Query: 365 CIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGI 424
C +KCQ GY Y+ D ++G++AY++P NE++I +EI HGPV T+Y+D + YK+GI
Sbjct: 217 CQKKCQKGYKTPYKKDKHYGKVAYNVPNNEDSIKKEIMMHGPVGSFFTVYSDFLNYKSGI 276
Query: 425 YKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNEC 484
YKH+ G +G H +RI+GWG E +GT YWL+ANS+N WGE G FRI+RG++EC
Sbjct: 277 YKHMKGTEIGVHTVRIVGWGVE---KGTP----YWLIANSWNEGWGEKGYFRILRGKDEC 329
Query: 485 GIEADITAGLPK 496
IE+ + GLP+
Sbjct: 330 DIESLVIGGLPR 341
>gi|6681079|ref|NP_031824.1| cathepsin B preproprotein [Mus musculus]
gi|115712|sp|P10605.2|CATB_MOUSE RecName: Full=Cathepsin B; AltName: Full=Cathepsin B1; Contains:
RecName: Full=Cathepsin B light chain; Contains:
RecName: Full=Cathepsin B heavy chain; Flags: Precursor
gi|239907|gb|AAB20536.1| preprocathepsin B [Mus sp.]
gi|309152|gb|AAA37375.1| cathepsin B [Mus musculus]
gi|13879360|gb|AAH06656.1| Cathepsin B [Mus musculus]
gi|26350521|dbj|BAC38900.1| unnamed protein product [Mus musculus]
gi|74180941|dbj|BAE27751.1| unnamed protein product [Mus musculus]
gi|74191261|dbj|BAE39458.1| unnamed protein product [Mus musculus]
gi|74198944|dbj|BAE30691.1| unnamed protein product [Mus musculus]
gi|74208073|dbj|BAE29144.1| unnamed protein product [Mus musculus]
gi|148704123|gb|EDL36070.1| cathepsin B, isoform CRA_a [Mus musculus]
Length = 339
Score = 263 bits (672), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 128/255 (50%), Positives = 175/255 (68%), Gaps = 11/255 (4%)
Query: 83 RLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIAS 142
+LP V + ++ LPE FDAR W CPTI +IRDQGSCGS WA GAVEA+SDR CI +
Sbjct: 67 KLPGRVAFGEDID-LPETFDAREQWSNCPTIGQIRDQGSCGSCWAFGAVEAISDRTCIHT 125
Query: 143 RGKRHVRLSSDDLVSCCK-DCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIP 201
G+ +V +S++DL++CC CG+GC GG+ AW +W G+VSGG Y S GC PY IP
Sbjct: 126 NGRVNVEVSAEDLLTCCGIQCGDGCNGGYPSGAWSFWTKKGLVSGGVYNSHVGCLPYTIP 185
Query: 202 -CERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIF 260
CE ++NGS C E +TP C + C+ GY SY++D +FG +YS+ + + IM EI+
Sbjct: 186 PCEHHVNGSRPPCT-GEGDTPRCNKSCEAGYSPSYKEDKHFGYTSYSVSNSVKEIMAEIY 244
Query: 261 RHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVA 320
++GPVEG+ T+++D + YK+G+YKH AG +G HAIRI+GWG E + V YWL A
Sbjct: 245 KNGPVEGAFTVFSDFLTYKSGVYKHEAGDMMGGHAIRILGWGVE-------NGVPYWLAA 297
Query: 321 NSFNTNWGENGLFRI 335
NS+N +WG+NG F+I
Sbjct: 298 NSWNLDWGDNGFFKI 312
Score = 175 bits (443), Expect = 5e-41, Method: Compositional matrix adjust.
Identities = 81/164 (49%), Positives = 117/164 (71%), Gaps = 9/164 (5%)
Query: 334 RIGCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPA 392
+GC PY IP CE ++NGSR C E +TP C + C+ GY SY++D +FG +YS+
Sbjct: 176 HVGCLPYTIPPCEHHVNGSRPPCTG-EGDTPRCNKSCEAGYSPSYKEDKHFGYTSYSVSN 234
Query: 393 NEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGT 452
+ + IM EI+++GPVEG+ T+++D + YK+G+YKH AG +G HAIRI+GWG E
Sbjct: 235 SVKEIMAEIYKNGPVEGAFTVFSDFLTYKSGVYKHEAGDMMGGHAIRILGWGVE------ 288
Query: 453 SSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPK 496
+ V YWL ANS+N +WG+NG F+I+RG+N CGIE++I AG+P+
Sbjct: 289 -NGVPYWLAANSWNLDWGDNGFFKILRGENHCGIESEIVAGIPR 331
>gi|226469950|emb|CAX70256.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 263 bits (671), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 129/265 (48%), Positives = 172/265 (64%), Gaps = 9/265 (3%)
Query: 72 GVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAV 131
G D+++ R P V D E+P FD+R WP+C +I +IRDQ CGS WA+ AV
Sbjct: 66 GGKEDAEMKWKRRPT-VDHHDLNVEIPSQFDSRKKWPHCKSISQIRDQSQCGSSWAVSAV 124
Query: 132 EAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYAS 191
AMSDR+CI S GK+ V LS+ DL+SCC++CG+GC GGF G AW YWV+ GIV+GG+ +
Sbjct: 125 GAMSDRICIQSGGKQSVELSAIDLISCCENCGSGCDGGFPGPAWDYWVSHGIVTGGSKEN 184
Query: 192 KQGCRPYEIP-CERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPA 250
GC+PY P CE + G + SC D TP+C RKCQ GY YE D ++G I+ ++
Sbjct: 185 HTGCQPYPFPKCEHHSIGKYPSCGDKIYKTPQCKRKCQKGYTTPYEHDKHYGGISINVIK 244
Query: 251 NEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGT 310
NE I +EI +GPVE + I+ D + YK+GIY++ G +GEH +RIIGWG E GT
Sbjct: 245 NESAIQKEIMMYGPVEAYLLIFEDFLNYKSGIYRYTTGSFVGEHYVRIIGWGIE---NGT 301
Query: 311 SSVVKYWLVANSFNTNWGENGLFRI 335
+ YWL AN++N +WGE G FRI
Sbjct: 302 A----YWLAANTWNEDWGEKGYFRI 322
Score = 153 bits (387), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 77/162 (47%), Positives = 102/162 (62%), Gaps = 8/162 (4%)
Query: 336 GCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANE 394
GC+PY P CE + G SC TP+C RKCQ GY YE D ++G I+ ++ NE
Sbjct: 187 GCQPYPFPKCEHHSIGKYPSCGDKIYKTPQCKRKCQKGYTTPYEHDKHYGGISINVIKNE 246
Query: 395 ETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSS 454
I +EI +GPVE + I+ D + YK+GIY++ G +GEH +RIIGWG E GT+
Sbjct: 247 SAIQKEIMMYGPVEAYLLIFEDFLNYKSGIYRYTTGSFVGEHYVRIIGWGIE---NGTA- 302
Query: 455 VVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPK 496
YWL AN++N +WGE G FRIVRG+NEC IE+ + AG K
Sbjct: 303 ---YWLAANTWNEDWGEKGYFRIVRGRNECSIESVVVAGRLK 341
>gi|345308|pir||S31909 cathepsin B-like cysteine proteinase (EC 3.4.22.-) - fluke
(Schistosoma japonicum)
Length = 316
Score = 263 bits (671), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 130/265 (49%), Positives = 169/265 (63%), Gaps = 9/265 (3%)
Query: 72 GVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAV 131
G D L Q R P V D E+P FD+R WP C +I +IRDQ C S WA+ AV
Sbjct: 40 GRREDPNLRQKRRPT-VDHHDLKVEIPSHFDSRKKWPRCKSISQIRDQSRCASSWAVSAV 98
Query: 132 EAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYAS 191
AMSDR+CI S GK+ V LS+ DL+SCC++CG+GC GGF G AW YWV+ GIV+GG+ +
Sbjct: 99 GAMSDRICIQSGGKQSVELSAIDLISCCENCGSGCDGGFPGPAWDYWVSHGIVTGGSKEN 158
Query: 192 KQGCRPYEIP-CERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPA 250
GC+PY P CE + G + SC D TP+C RKCQ GY YE D ++G I+ ++
Sbjct: 159 HTGCQPYPFPKCEHHSKGKYPSCGDKMYKTPQCKRKCQKGYKTPYEHDKHYGGISINVIK 218
Query: 251 NEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGT 310
NE I +EI +GPVE + I+ D + YK+GIY++ G +GEH +RIIGWG E GT
Sbjct: 219 NESAIQKEIMMYGPVEAYLLIFEDFLNYKSGIYRYTTGSFVGEHYVRIIGWGIE---NGT 275
Query: 311 SSVVKYWLVANSFNTNWGENGLFRI 335
+ YWL AN++N +WGE G FRI
Sbjct: 276 A----YWLAANTWNEDWGEKGYFRI 296
Score = 154 bits (390), Expect = 9e-35, Method: Compositional matrix adjust.
Identities = 80/182 (43%), Positives = 107/182 (58%), Gaps = 11/182 (6%)
Query: 316 YWLVANSFNTNWGENGLFRIGCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYD 374
YW+ EN GC+PY P CE + G SC TP+C RKCQ GY
Sbjct: 144 YWVSHGIVTGGSKEN---HTGCQPYPFPKCEHHSKGKYPSCGDKMYKTPQCKRKCQKGYK 200
Query: 375 VSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLG 434
YE D ++G I+ ++ NE I +EI +GPVE + I+ D + YK+GIY++ G +G
Sbjct: 201 TPYEHDKHYGGISINVIKNESAIQKEIMMYGPVEAYLLIFEDFLNYKSGIYRYTTGSFVG 260
Query: 435 EHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGL 494
EH +RIIGWG E GT+ YWL AN++N +WGE G FRIVRG+NEC +E+ + AG
Sbjct: 261 EHYVRIIGWGIE---NGTA----YWLAANTWNEDWGEKGYFRIVRGRNECSVESVVVAGR 313
Query: 495 PK 496
K
Sbjct: 314 LK 315
>gi|87246247|gb|ABD35300.1| cathepsin B-like cysteine protease [Triatoma infestans]
Length = 333
Score = 262 bits (670), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 133/265 (50%), Positives = 173/265 (65%), Gaps = 12/265 (4%)
Query: 72 GVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAV 131
GVH ++K N L ++ LP+ FDAR WP C TI EIRDQGSCGS WA GAV
Sbjct: 60 GVHKNTK---NGFTLPIRDVSLDITLPDEFDARKQWPNCSTIGEIRDQGSCGSCWAFGAV 116
Query: 132 EAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYAS 191
EAMSDR+CI S GK V LS+++L+SCC CG+GC GG AW+YW GIVSGG Y S
Sbjct: 117 EAMSDRLCIHSNGKLQVHLSAENLLSCCDSCGDGCLGGSPESAWEYWHKFGIVSGGNYGS 176
Query: 192 KQGCRPYEI-PCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPA 250
KQGC+PY I PCE ++GS +C +TP+C ++C+ GY + Y+ +G+ Y++P
Sbjct: 177 KQGCQPYSIAPCEHSIHGSSPAC-GGVTDTPKCKKQCEKGYSIPYDKAFYYGQPGYAIPN 235
Query: 251 NEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGT 310
+ + I EI ++GP+ S +Y D+ YK G+Y+HVAG LG H I+I GWG E GT
Sbjct: 236 DAQKIQAEILKNGPIVASFLVYEDLFSYKEGVYQHVAGEFLGGHVIKIFGWGIE---NGT 292
Query: 311 SSVVKYWLVANSFNTNWGENGLFRI 335
YWLVANS+NT+WG NG F+I
Sbjct: 293 ----PYWLVANSWNTDWGNNGFFKI 313
Score = 163 bits (413), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 83/193 (43%), Positives = 120/193 (62%), Gaps = 12/193 (6%)
Query: 306 LGEGTSSVVKYWLVANSFNTNWGENGLFRIGCRPYEI-PCERYMNGSRSSCQANEPNTPE 364
LG S +YW + F G N + GC+PY I PCE ++GS +C +TP+
Sbjct: 152 LGGSPESAWEYW---HKFGIVSGGNYGSKQGCQPYSIAPCEHSIHGSSPAC-GGVTDTPK 207
Query: 365 CIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGI 424
C ++C+ GY + Y+ +G+ Y++P + + I EI ++GP+ S +Y D+ YK G+
Sbjct: 208 CKKQCEKGYSIPYDKAFYYGQPGYAIPNDAQKIQAEILKNGPIVASFLVYEDLFSYKEGV 267
Query: 425 YKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNEC 484
Y+HVAG LG H I+I GWG E GT YWLVANS+NT+WG NG F+I RG++EC
Sbjct: 268 YQHVAGEFLGGHVIKIFGWGIE---NGT----PYWLVANSWNTDWGNNGFFKIPRGKDEC 320
Query: 485 GIEADITAGLPKI 497
GIE D++AGLP++
Sbjct: 321 GIEIDVSAGLPRL 333
>gi|56756410|gb|AAW26378.1| unknown [Schistosoma japonicum]
Length = 342
Score = 262 bits (670), Expect = 3e-67, Method: Compositional matrix adjust.
Identities = 127/265 (47%), Positives = 169/265 (63%), Gaps = 9/265 (3%)
Query: 72 GVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAV 131
G D L Q R P V D E+P FD+R WP C +I +IRDQ CGS WA+ AV
Sbjct: 66 GRKEDPNLRQKRRPT-VDHHDLKVEIPSHFDSRKKWPRCKSISQIRDQSQCGSSWAVSAV 124
Query: 132 EAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYAS 191
AMSDR+CI S GK+ V LS+ DL+SCCK CG+GC GGF G +W YWV GIV+GG+ +
Sbjct: 125 GAMSDRICIQSGGKQSVELSAVDLISCCKYCGSGCDGGFLGPSWDYWVLRGIVTGGSKEN 184
Query: 192 KQGCRPYEIP-CERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPA 250
GCRPY P C+ ++ G + +C D TP+C + CQ GY+ SYE D ++G +Y++ +
Sbjct: 185 HTGCRPYPFPKCDHFVKGKYRACGDKLYKTPQCKQTCQKGYNTSYEQDKHYGGFSYNVLS 244
Query: 251 NEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGT 310
E I ++I HGPVE + IY D + YK+GIY++ G + HA+R+IGWG E GT
Sbjct: 245 VESVIQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGQFISGHAVRLIGWGVE---NGT 301
Query: 311 SSVVKYWLVANSFNTNWGENGLFRI 335
+ YWL AN++N +WGE G FRI
Sbjct: 302 A----YWLAANTWNEDWGEKGYFRI 322
Score = 159 bits (403), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 79/182 (43%), Positives = 111/182 (60%), Gaps = 11/182 (6%)
Query: 316 YWLVANSFNTNWGENGLFRIGCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYD 374
YW++ EN GCRPY P C+ ++ G +C TP+C + CQ GY+
Sbjct: 170 YWVLRGIVTGGSKEN---HTGCRPYPFPKCDHFVKGKYRACGDKLYKTPQCKQTCQKGYN 226
Query: 375 VSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLG 434
SYE D ++G +Y++ + E I ++I HGPVE + IY D + YK+GIY++ G +
Sbjct: 227 TSYEQDKHYGGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGQFIS 286
Query: 435 EHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGL 494
HA+R+IGWG E GT+ YWL AN++N +WGE G FRIVRG+NEC IE++I AGL
Sbjct: 287 GHAVRLIGWGVE---NGTA----YWLAANTWNEDWGEKGYFRIVRGRNECSIESEIAAGL 339
Query: 495 PK 496
K
Sbjct: 340 IK 341
>gi|226469952|emb|CAX70257.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 262 bits (670), Expect = 3e-67, Method: Compositional matrix adjust.
Identities = 131/265 (49%), Positives = 170/265 (64%), Gaps = 9/265 (3%)
Query: 72 GVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAV 131
G D L + R P V D E+P FD+R WP C +I +IRDQ CGS WA+ AV
Sbjct: 66 GRREDPNLREKRRPT-VDHHDLKVEIPSHFDSRKKWPRCKSISQIRDQSRCGSSWAVSAV 124
Query: 132 EAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYAS 191
A+SDR+CI S GK+ V LS+ DL+SCC++CG+GC GGF G AW YWV+ GIV+GG+ +
Sbjct: 125 GAISDRICIQSGGKQSVELSAIDLISCCENCGSGCDGGFPGPAWDYWVSHGIVTGGSKEN 184
Query: 192 KQGCRPYEIP-CERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPA 250
GC+PY P CE + G + SC D TP+C RKCQ GY YE D ++G IA ++
Sbjct: 185 HTGCQPYPFPKCEHHSIGKYPSCGDKMYKTPQCKRKCQKGYTTPYEHDKHYGGIAINVIK 244
Query: 251 NEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGT 310
NE I +EI +GPVE + I+ D + YK+GIYK+ G +GEH +RIIGWG E GT
Sbjct: 245 NELAIQKEIMMYGPVEAYLLIFEDFLNYKSGIYKYTTGSFVGEHYVRIIGWGIE---NGT 301
Query: 311 SSVVKYWLVANSFNTNWGENGLFRI 335
+ YWL AN++N +WGE G FRI
Sbjct: 302 A----YWLAANTWNEDWGEKGYFRI 322
Score = 155 bits (391), Expect = 6e-35, Method: Compositional matrix adjust.
Identities = 83/182 (45%), Positives = 107/182 (58%), Gaps = 11/182 (6%)
Query: 316 YWLVANSFNTNWGENGLFRIGCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYD 374
YW+ EN GC+PY P CE + G SC TP+C RKCQ GY
Sbjct: 170 YWVSHGIVTGGSKEN---HTGCQPYPFPKCEHHSIGKYPSCGDKMYKTPQCKRKCQKGYT 226
Query: 375 VSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLG 434
YE D ++G IA ++ NE I +EI +GPVE + I+ D + YK+GIYK+ G +G
Sbjct: 227 TPYEHDKHYGGIAINVIKNELAIQKEIMMYGPVEAYLLIFEDFLNYKSGIYKYTTGSFVG 286
Query: 435 EHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGL 494
EH +RIIGWG E GT+ YWL AN++N +WGE G FRIVRG+NEC IE+ + AG
Sbjct: 287 EHYVRIIGWGIE---NGTA----YWLAANTWNEDWGEKGYFRIVRGRNECSIESVVVAGR 339
Query: 495 PK 496
K
Sbjct: 340 LK 341
>gi|225713216|gb|ACO12454.1| Cathepsin B precursor [Lepeophtheirus salmonis]
gi|290561811|gb|ADD38303.1| Cathepsin B [Lepeophtheirus salmonis]
Length = 333
Score = 262 bits (670), Expect = 3e-67, Method: Compositional matrix adjust.
Identities = 137/284 (48%), Positives = 184/284 (64%), Gaps = 15/284 (5%)
Query: 54 AEKNALSKLTLSELEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTI 113
A N +++ S + MGV P+ K + LP + E++PE FD+R WP+CPTI
Sbjct: 44 AGSNFDEEISTSYIRGLMGVLPNHK---DYLPPALPTLLGTEQIPENFDSRQKWPHCPTI 100
Query: 114 QEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGK 173
IRDQGSCGS WA GAVEAMSDR+CI S + V +S+++L+SCC CG GC GGF G
Sbjct: 101 SLIRDQGSCGSCWAFGAVEAMSDRLCIHSN--KIVNVSAENLLSCCYSCGFGCNGGFPGA 158
Query: 174 AWKYWVTTGIVSGGTYASKQGCRPYEI-PCERYMNGSHSSCQDNEPNTPECIRKCQ-PGY 231
AW +W G+VSGG Y S +GC+PY I PCE + NG+ C TP+C C+ Y
Sbjct: 159 AWSFWKKKGLVSGGLYGSHKGCQPYAIAPCEHHANGTRPPCSGGG-RTPKCHTFCENEDY 217
Query: 232 DVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPL 291
+ YE D +FGR +YS+ ++ + I EI +GPVE + ++Y+D + YK+G+Y+HV G L
Sbjct: 218 SLPYEKDKSFGRSSYSVKSDPKQIQLEIMNNGPVEAAFSVYSDFLNYKSGVYRHVKGSLL 277
Query: 292 GEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
G HAIRI+GWG E GT YWLVANS+NT+WG+NG F+I
Sbjct: 278 GGHAIRILGWGVE---NGT----PYWLVANSWNTDWGDNGTFKI 314
Score = 158 bits (399), Expect = 8e-36, Method: Compositional matrix adjust.
Identities = 79/163 (48%), Positives = 109/163 (66%), Gaps = 10/163 (6%)
Query: 336 GCRPYEI-PCERYMNGSRSSCQANEPNTPECIRKCQ-PGYDVSYEDDLNFGRIAYSLPAN 393
GC+PY I PCE + NG+R C TP+C C+ Y + YE D +FGR +YS+ ++
Sbjct: 179 GCQPYAIAPCEHHANGTRPPCSGGG-RTPKCHTFCENEDYSLPYEKDKSFGRSSYSVKSD 237
Query: 394 EETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTS 453
+ I EI +GPVE + ++Y+D + YK+G+Y+HV G LG HAIRI+GWG E GT
Sbjct: 238 PKQIQLEIMNNGPVEAAFSVYSDFLNYKSGVYRHVKGSLLGGHAIRILGWGVE---NGT- 293
Query: 454 SVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPK 496
YWLVANS+NT+WG+NG F+I++G + CGIE I AGLP+
Sbjct: 294 ---PYWLVANSWNTDWGDNGTFKILKGSDHCGIEGSIVAGLPQ 333
>gi|56756475|gb|AAW26410.1| unknown [Schistosoma japonicum]
Length = 342
Score = 262 bits (669), Expect = 3e-67, Method: Compositional matrix adjust.
Identities = 127/265 (47%), Positives = 169/265 (63%), Gaps = 9/265 (3%)
Query: 72 GVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAV 131
G D L Q R P V D E+P FD+R WP C +I +IRDQ CGS WA+ AV
Sbjct: 66 GRREDPNLRQKRRPT-VDHHDLNVEIPSHFDSRKKWPRCKSISQIRDQSQCGSSWAVSAV 124
Query: 132 EAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYAS 191
AMSDR+CI S GK+ V LS+ DL+SCCK CG+GC GGF G +W YWV GIV+GG+ +
Sbjct: 125 GAMSDRICIQSGGKQSVELSAVDLISCCKYCGSGCDGGFLGPSWDYWVLRGIVTGGSKEN 184
Query: 192 KQGCRPYEIP-CERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPA 250
GCRPY P C+ ++ G + +C D TP+C + CQ GY+ SYE D ++G +Y++ +
Sbjct: 185 HTGCRPYPFPKCDHFVKGKYRACGDKLYKTPQCKQTCQKGYNTSYEQDKHYGGFSYNVLS 244
Query: 251 NEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGT 310
E I ++I HGPVE + IY D + YK+GIY++ G + HA+R+IGWG E GT
Sbjct: 245 VESVIQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGKYISGHAVRLIGWGVE---NGT 301
Query: 311 SSVVKYWLVANSFNTNWGENGLFRI 335
+ YWL AN++N +WGE G FRI
Sbjct: 302 A----YWLAANTWNEDWGEKGYFRI 322
Score = 158 bits (399), Expect = 7e-36, Method: Compositional matrix adjust.
Identities = 79/182 (43%), Positives = 111/182 (60%), Gaps = 11/182 (6%)
Query: 316 YWLVANSFNTNWGENGLFRIGCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYD 374
YW++ EN GCRPY P C+ ++ G +C TP+C + CQ GY+
Sbjct: 170 YWVLRGIVTGGSKEN---HTGCRPYPFPKCDHFVKGKYRACGDKLYKTPQCKQTCQKGYN 226
Query: 375 VSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLG 434
SYE D ++G +Y++ + E I ++I HGPVE + IY D + YK+GIY++ G +
Sbjct: 227 TSYEQDKHYGGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGKYIS 286
Query: 435 EHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGL 494
HA+R+IGWG E GT+ YWL AN++N +WGE G FRIVRG+NEC IE++I AGL
Sbjct: 287 GHAVRLIGWGVE---NGTA----YWLAANTWNEDWGEKGYFRIVRGRNECLIESEIAAGL 339
Query: 495 PK 496
K
Sbjct: 340 IK 341
>gi|226469948|emb|CAX70255.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 261 bits (668), Expect = 5e-67, Method: Compositional matrix adjust.
Identities = 129/265 (48%), Positives = 171/265 (64%), Gaps = 9/265 (3%)
Query: 72 GVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAV 131
G D+++ R P V D E+P FD+R WP+C +I +IRDQ CGS WA+ AV
Sbjct: 66 GGKEDAEMKWKRRPT-VDHHDLNVEIPSQFDSRKKWPHCKSISQIRDQSQCGSSWAVSAV 124
Query: 132 EAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYAS 191
AMSDR+CI S GK+ V LS+ DL+SCC++CG+GC GGF G AW YWV+ GIV+GG+ +
Sbjct: 125 GAMSDRICIQSGGKQSVELSAIDLISCCENCGSGCDGGFPGPAWDYWVSHGIVTGGSKEN 184
Query: 192 KQGCRPYEIP-CERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPA 250
GC+PY P CE + G + SC D TP+C RKCQ GY YE D ++G I+ ++
Sbjct: 185 HTGCQPYPFPKCEHHSIGKYPSCGDKIYKTPQCKRKCQKGYTTPYEHDKHYGGISINVIK 244
Query: 251 NEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGT 310
NE I EI +GPVE + I+ D + YK+GIY++ G +GEH +RIIGWG E GT
Sbjct: 245 NESAIQNEIMMYGPVEAYLLIFEDFLNYKSGIYRYTTGSFVGEHYVRIIGWGIE---NGT 301
Query: 311 SSVVKYWLVANSFNTNWGENGLFRI 335
+ YWL AN++N +WGE G FRI
Sbjct: 302 A----YWLAANTWNEDWGEKGYFRI 322
Score = 152 bits (384), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 77/162 (47%), Positives = 101/162 (62%), Gaps = 8/162 (4%)
Query: 336 GCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANE 394
GC+PY P CE + G SC TP+C RKCQ GY YE D ++G I+ ++ NE
Sbjct: 187 GCQPYPFPKCEHHSIGKYPSCGDKIYKTPQCKRKCQKGYTTPYEHDKHYGGISINVIKNE 246
Query: 395 ETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSS 454
I EI +GPVE + I+ D + YK+GIY++ G +GEH +RIIGWG E GT+
Sbjct: 247 SAIQNEIMMYGPVEAYLLIFEDFLNYKSGIYRYTTGSFVGEHYVRIIGWGIE---NGTA- 302
Query: 455 VVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPK 496
YWL AN++N +WGE G FRIVRG+NEC IE+ + AG K
Sbjct: 303 ---YWLAANTWNEDWGEKGYFRIVRGRNECSIESVVVAGRLK 341
>gi|56755451|gb|AAW25905.1| unknown [Schistosoma japonicum]
Length = 342
Score = 261 bits (668), Expect = 5e-67, Method: Compositional matrix adjust.
Identities = 127/265 (47%), Positives = 169/265 (63%), Gaps = 9/265 (3%)
Query: 72 GVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAV 131
G D L Q R P V D E+P FD+R WP C +I +IRDQ CGS WA+ AV
Sbjct: 66 GRKEDPNLRQKRRPT-VDHHDLKVEIPSHFDSRKKWPRCKSISQIRDQSQCGSSWAVSAV 124
Query: 132 EAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYAS 191
AMSDR+CI S GK+ V LS+ DL+SCCK CG+GC GGF G +W YWV GIV+GG+ +
Sbjct: 125 GAMSDRICIQSGGKQSVELSAVDLISCCKYCGSGCDGGFLGPSWDYWVLRGIVTGGSKEN 184
Query: 192 KQGCRPYEIP-CERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPA 250
GCRPY P C+ ++ G + +C D TP+C + CQ GY+ SYE D ++G +Y++ +
Sbjct: 185 HTGCRPYPFPKCDHFVKGKYRACGDKLYKTPQCKQICQKGYNTSYEQDKHYGGFSYNVLS 244
Query: 251 NEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGT 310
E I ++I HGPVE + IY D + YK+GIY++ G + HA+R+IGWG E GT
Sbjct: 245 VESVIQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGQFISGHAVRLIGWGVE---NGT 301
Query: 311 SSVVKYWLVANSFNTNWGENGLFRI 335
+ YWL AN++N +WGE G FRI
Sbjct: 302 A----YWLAANTWNEDWGEKGYFRI 322
Score = 159 bits (401), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 79/182 (43%), Positives = 111/182 (60%), Gaps = 11/182 (6%)
Query: 316 YWLVANSFNTNWGENGLFRIGCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYD 374
YW++ EN GCRPY P C+ ++ G +C TP+C + CQ GY+
Sbjct: 170 YWVLRGIVTGGSKEN---HTGCRPYPFPKCDHFVKGKYRACGDKLYKTPQCKQICQKGYN 226
Query: 375 VSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLG 434
SYE D ++G +Y++ + E I ++I HGPVE + IY D + YK+GIY++ G +
Sbjct: 227 TSYEQDKHYGGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGQFIS 286
Query: 435 EHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGL 494
HA+R+IGWG E GT+ YWL AN++N +WGE G FRIVRG+NEC IE++I AGL
Sbjct: 287 GHAVRLIGWGVE---NGTA----YWLAANTWNEDWGEKGYFRIVRGRNECSIESEIAAGL 339
Query: 495 PK 496
K
Sbjct: 340 IK 341
>gi|56756380|gb|AAW26363.1| unknown [Schistosoma japonicum]
Length = 342
Score = 261 bits (668), Expect = 5e-67, Method: Compositional matrix adjust.
Identities = 125/265 (47%), Positives = 171/265 (64%), Gaps = 9/265 (3%)
Query: 72 GVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAV 131
G D L Q R P V D E+P FD+R WP C +I +IRDQ CGS WA+ A+
Sbjct: 66 GRKEDPNLRQRRRPT-VDHHDLNVEIPSHFDSRKKWPRCKSISQIRDQSQCGSSWAVSAI 124
Query: 132 EAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYAS 191
AMSDR+CI S GK+ V+LS+ DL+SCC++CG+GC GGF G +W YWV GIV+GG+ +
Sbjct: 125 GAMSDRICIQSGGKQSVKLSAVDLISCCENCGSGCDGGFLGPSWDYWVLRGIVTGGSKEN 184
Query: 192 KQGCRPYEIP-CERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPA 250
GCRPY P C+ ++ G + +C D TP+C + CQ GY+ SYE D ++G +Y++ +
Sbjct: 185 HTGCRPYPFPKCDHFVKGKYRACGDKLYKTPQCNQTCQKGYNTSYEQDKHYGGFSYNVLS 244
Query: 251 NEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGT 310
E I ++I HGPVE + IY D + YK+GIY++ G + HA+R+IGWG E GT
Sbjct: 245 VESVIQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGQFISGHAVRLIGWGVE---NGT 301
Query: 311 SSVVKYWLVANSFNTNWGENGLFRI 335
+ YWL AN++N +WGE G FRI
Sbjct: 302 A----YWLAANTWNEDWGEKGYFRI 322
Score = 157 bits (397), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 79/182 (43%), Positives = 111/182 (60%), Gaps = 11/182 (6%)
Query: 316 YWLVANSFNTNWGENGLFRIGCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYD 374
YW++ EN GCRPY P C+ ++ G +C TP+C + CQ GY+
Sbjct: 170 YWVLRGIVTGGSKEN---HTGCRPYPFPKCDHFVKGKYRACGDKLYKTPQCNQTCQKGYN 226
Query: 375 VSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLG 434
SYE D ++G +Y++ + E I ++I HGPVE + IY D + YK+GIY++ G +
Sbjct: 227 TSYEQDKHYGGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGQFIS 286
Query: 435 EHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGL 494
HA+R+IGWG E GT+ YWL AN++N +WGE G FRIVRG+NEC IE++I AGL
Sbjct: 287 GHAVRLIGWGVE---NGTA----YWLAANTWNEDWGEKGYFRIVRGRNECLIESEIAAGL 339
Query: 495 PK 496
K
Sbjct: 340 IK 341
>gi|226473758|emb|CAX71564.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 261 bits (667), Expect = 6e-67, Method: Compositional matrix adjust.
Identities = 127/265 (47%), Positives = 169/265 (63%), Gaps = 9/265 (3%)
Query: 72 GVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAV 131
G D L Q R P V D E+P FD+R WP C +I +IRDQ CGS WA+ AV
Sbjct: 66 GRREDPNLRQKRRPT-VDHHDLNVEIPSHFDSRKKWPRCKSISQIRDQSQCGSSWAVSAV 124
Query: 132 EAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYAS 191
AMSDR+CI S GK+ V LS+ DL+SCCK CG+GC GGF G +W YWV GIV+GG+ +
Sbjct: 125 GAMSDRICIQSGGKQSVELSAVDLISCCKYCGSGCDGGFLGPSWDYWVLRGIVTGGSKEN 184
Query: 192 KQGCRPYEIP-CERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPA 250
GCRPY P C+ ++ G + +C D TP+C + CQ GY+ SYE D ++G +Y++ +
Sbjct: 185 HTGCRPYPFPKCDHFVKGKYRACGDKLYKTPQCNQTCQKGYNTSYEQDKHYGGFSYNVLS 244
Query: 251 NEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGT 310
E I ++I HGPVE + IY D + YK+GIY++ G + HA+R+IGWG E GT
Sbjct: 245 VESVIQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGQFISGHAVRLIGWGVE---NGT 301
Query: 311 SSVVKYWLVANSFNTNWGENGLFRI 335
+ YWL AN++N +WGE G FRI
Sbjct: 302 A----YWLAANTWNEDWGEKGYFRI 322
Score = 157 bits (397), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 79/182 (43%), Positives = 111/182 (60%), Gaps = 11/182 (6%)
Query: 316 YWLVANSFNTNWGENGLFRIGCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYD 374
YW++ EN GCRPY P C+ ++ G +C TP+C + CQ GY+
Sbjct: 170 YWVLRGIVTGGSKEN---HTGCRPYPFPKCDHFVKGKYRACGDKLYKTPQCNQTCQKGYN 226
Query: 375 VSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLG 434
SYE D ++G +Y++ + E I ++I HGPVE + IY D + YK+GIY++ G +
Sbjct: 227 TSYEQDKHYGGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGQFIS 286
Query: 435 EHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGL 494
HA+R+IGWG E GT+ YWL AN++N +WGE G FRIVRG+NEC IE++I AGL
Sbjct: 287 GHAVRLIGWGVE---NGTA----YWLAANTWNEDWGEKGYFRIVRGRNECLIESEIAAGL 339
Query: 495 PK 496
K
Sbjct: 340 IK 341
>gi|44965401|gb|AAS49537.1| cathepsin B [Latimeria chalumnae]
Length = 225
Score = 261 bits (667), Expect = 6e-67, Method: Compositional matrix adjust.
Identities = 122/223 (54%), Positives = 163/223 (73%), Gaps = 3/223 (1%)
Query: 84 LPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASR 143
LP+ + ++ ++ LPE FD+R WP CPTIQEIRDQGSCGS WA GAVEA+SDRVCI S+
Sbjct: 1 LPMKLGMATDVK-LPENFDSRTQWPKCPTIQEIRDQGSCGSCWAFGAVEAISDRVCIHSK 59
Query: 144 GKRHVRLSSDDLVSCC-KDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIP- 201
GK +V +S++DL+SCC +CG GC GG+ AW +W TG+VSGG + S GCRPY IP
Sbjct: 60 GKVNVEISAEDLLSCCGMECGFGCNGGYPSGAWNFWTETGLVSGGLFKSHIGCRPYTIPP 119
Query: 202 CERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFR 261
CE ++NGS SC E +TP+C+ +C+ GY SY D +FG +Y++ +NE I EI++
Sbjct: 120 CEHHVNGSRPSCTGEEGDTPKCVMQCEAGYTPSYFKDKHFGSTSYAVSSNEADIQIEIYK 179
Query: 262 HGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQE 304
+GPVEG+ T+Y D + YK+G+YKHV G +G HAIRI+GWG E
Sbjct: 180 NGPVEGAFTVYEDFLQYKSGVYKHVTGDAVGGHAIRILGWGVE 222
Score = 137 bits (345), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 63/119 (52%), Positives = 86/119 (72%), Gaps = 3/119 (2%)
Query: 331 GLFR--IGCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIA 387
GLF+ IGCRPY IP CE ++NGSR SC E +TP+C+ +C+ GY SY D +FG +
Sbjct: 104 GLFKSHIGCRPYTIPPCEHHVNGSRPSCTGEEGDTPKCVMQCEAGYTPSYFKDKHFGSTS 163
Query: 388 YSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQE 446
Y++ +NE I EI+++GPVEG+ T+Y D + YK+G+YKHV G +G HAIRI+GWG E
Sbjct: 164 YAVSSNEADIQIEIYKNGPVEGAFTVYEDFLQYKSGVYKHVTGDAVGGHAIRILGWGVE 222
>gi|390994431|gb|AFM37365.1| cathepsin B2 [Dictyocaulus viviparus]
Length = 346
Score = 261 bits (667), Expect = 7e-67, Method: Compositional matrix adjust.
Identities = 123/241 (51%), Positives = 158/241 (65%), Gaps = 9/241 (3%)
Query: 97 LPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLV 156
+P FD+R WP CP+I+ IRDQ SCGS WA GA EAM+DR+CIAS+G +S+DDL+
Sbjct: 93 IPSSFDSRTQWPNCPSIKSIRDQSSCGSCWAFGAAEAMTDRICIASKGAIQFTVSADDLL 152
Query: 157 SCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEI-PCERYMNGSH-SSCQ 214
SCC +CG GC GGF AW YWV GIVSGG+Y SK GC+PY PCE + NG+H C
Sbjct: 153 SCCDECGFGCDGGFPYAAWNYWVEKGIVSGGSYTSKSGCKPYPFPPCEHHTNGTHYHPCP 212
Query: 215 DNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYAD 274
+ T C KCQ GY +Y +D +G AY++ A + I +EI HGPVE + +Y D
Sbjct: 213 KDLYPTNTCEHKCQSGYATAYTNDKRYGAKAYTVAARVKAIQKEIMLHGPVEVAYDVYED 272
Query: 275 MILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFR 334
Y GIYKH AG LG HA+++IGWG E + + YW+ +NS+N++WGENG FR
Sbjct: 273 FEHYLKGIYKHTAGSYLGGHAVKMIGWGTE-------NGIPYWICSNSWNSDWGENGFFR 325
Query: 335 I 335
I
Sbjct: 326 I 326
Score = 155 bits (393), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 79/180 (43%), Positives = 108/180 (60%), Gaps = 16/180 (8%)
Query: 327 WGENGLF-------RIGCRPYEIP-CERYMNGSR-SSCQANEPNTPECIRKCQPGYDVSY 377
W E G+ + GC+PY P CE + NG+ C + T C KCQ GY +Y
Sbjct: 174 WVEKGIVSGGSYTSKSGCKPYPFPPCEHHTNGTHYHPCPKDLYPTNTCEHKCQSGYATAY 233
Query: 378 EDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHA 437
+D +G AY++ A + I +EI HGPVE + +Y D Y GIYKH AG LG HA
Sbjct: 234 TNDKRYGAKAYTVAARVKAIQKEIMLHGPVEVAYDVYEDFEHYLKGIYKHTAGSYLGGHA 293
Query: 438 IRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
+++IGWG E + + YW+ +NS+N++WGENG FRI+RG +ECGIE+ + AGLPKI
Sbjct: 294 VKMIGWGTE-------NGIPYWICSNSWNSDWGENGFFRILRGTDECGIESGVVAGLPKI 346
>gi|225711544|gb|ACO11618.1| Cathepsin B precursor [Caligus rogercresseyi]
Length = 332
Score = 261 bits (666), Expect = 7e-67, Method: Compositional matrix adjust.
Identities = 137/287 (47%), Positives = 189/287 (65%), Gaps = 15/287 (5%)
Query: 51 FYGAEKNALSKLTLSELEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYC 110
+ A +N + + + L MGV P+ K + LP + E LP FDAR +WP C
Sbjct: 40 IWKAGRNFHPETSSNYLRSLMGVLPNHK---DHLPPPLPSLLGTEALPSDFDAREHWPNC 96
Query: 111 PTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGF 170
P+I+ IRDQGSCGS WA GA EAMSDR+CI + ++V +S+++L+SCC CG GC GGF
Sbjct: 97 PSIRLIRDQGSCGSCWAFGAAEAMSDRICIHT--NKNVNISAENLLSCCYSCGFGCNGGF 154
Query: 171 HGKAWKYWVTTGIVSGGTYASKQGCRPYEI-PCERYMNGSHSSCQDNEPNTPECIRKCQ- 228
G AWKYW + G+VSGG Y S GC+PY+I PCE ++NG+ C + TP+C R C+
Sbjct: 155 PGAAWKYWTSKGLVSGGLYGSHSGCQPYDIEPCEHHVNGTRQPCAEGG-RTPKCHRTCEN 213
Query: 229 PGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAG 288
Y V Y+ DL+FGR +YS+ ++ + I EI +GPVE + ++Y+D + K+G+Y+HV G
Sbjct: 214 ENYSVPYDKDLSFGRSSYSIRSDPKQIQLEIMDNGPVEAAFSVYSDFMNDKSGVYRHVKG 273
Query: 289 GPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
LG HAIRI+GWG E +GT YWLVANS+NT+WG+ G F+I
Sbjct: 274 SLLGGHAIRILGWGVE---KGT----PYWLVANSWNTDWGDKGTFKI 313
Score = 159 bits (402), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 79/163 (48%), Positives = 112/163 (68%), Gaps = 10/163 (6%)
Query: 336 GCRPYEI-PCERYMNGSRSSCQANEPNTPECIRKCQ-PGYDVSYEDDLNFGRIAYSLPAN 393
GC+PY+I PCE ++NG+R C A TP+C R C+ Y V Y+ DL+FGR +YS+ ++
Sbjct: 178 GCQPYDIEPCEHHVNGTRQPC-AEGGRTPKCHRTCENENYSVPYDKDLSFGRSSYSIRSD 236
Query: 394 EETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTS 453
+ I EI +GPVE + ++Y+D + K+G+Y+HV G LG HAIRI+GWG E +GT
Sbjct: 237 PKQIQLEIMDNGPVEAAFSVYSDFMNDKSGVYRHVKGSLLGGHAIRILGWGVE---KGT- 292
Query: 454 SVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPK 496
YWLVANS+NT+WG+ G F+I+RG + CGIE + GLP+
Sbjct: 293 ---PYWLVANSWNTDWGDKGTFKILRGSDHCGIEGSVVTGLPR 332
>gi|56752925|gb|AAW24674.1| unknown [Schistosoma japonicum]
Length = 342
Score = 261 bits (666), Expect = 8e-67, Method: Compositional matrix adjust.
Identities = 127/265 (47%), Positives = 167/265 (63%), Gaps = 9/265 (3%)
Query: 72 GVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAV 131
G D L Q R P V D E+P FD+R WP C +I +IRDQ C S WA+ AV
Sbjct: 66 GRREDPNLRQKRRPT-VDHHDLNVEIPSHFDSRKKWPRCKSISQIRDQSRCASSWAVSAV 124
Query: 132 EAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYAS 191
AMSDR+CI S GK+ V LS+ DL+SCCK+CG+GC GG G +W YWV GIV+GG+ +
Sbjct: 125 AAMSDRICIQSGGKQSVELSAIDLISCCKNCGSGCDGGVTGYSWDYWVKHGIVTGGSKEN 184
Query: 192 KQGCRPYEIP-CERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPA 250
GCRPY P C+ ++ G + +C D TP+C + CQ GY+ SYE D ++G +YS+
Sbjct: 185 HTGCRPYPFPKCDHFVKGKYRACGDKLYKTPQCKQTCQKGYNTSYEQDKHYGGFSYSVIG 244
Query: 251 NEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGT 310
E I +EI +GPVE + IY D + YK+GIY++ G + HA+R+IGWG E GT
Sbjct: 245 VESAIQKEIMMYGPVEAYLQIYEDFLNYKSGIYRYTTGKYISGHAVRLIGWGVE---NGT 301
Query: 311 SSVVKYWLVANSFNTNWGENGLFRI 335
S YWL AN++N +WGE G FRI
Sbjct: 302 S----YWLAANTWNEDWGEKGYFRI 322
Score = 152 bits (385), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 75/162 (46%), Positives = 102/162 (62%), Gaps = 8/162 (4%)
Query: 336 GCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANE 394
GCRPY P C+ ++ G +C TP+C + CQ GY+ SYE D ++G +YS+ E
Sbjct: 187 GCRPYPFPKCDHFVKGKYRACGDKLYKTPQCKQTCQKGYNTSYEQDKHYGGFSYSVIGVE 246
Query: 395 ETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSS 454
I +EI +GPVE + IY D + YK+GIY++ G + HA+R+IGWG E GTS
Sbjct: 247 SAIQKEIMMYGPVEAYLQIYEDFLNYKSGIYRYTTGKYISGHAVRLIGWGVE---NGTS- 302
Query: 455 VVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPK 496
YWL AN++N +WGE G FRIVRG++EC IE+ I AG K
Sbjct: 303 ---YWLAANTWNEDWGEKGYFRIVRGRDECLIESFIVAGQIK 341
>gi|56752809|gb|AAW24616.1| unknown [Schistosoma japonicum]
Length = 342
Score = 261 bits (666), Expect = 8e-67, Method: Compositional matrix adjust.
Identities = 126/265 (47%), Positives = 169/265 (63%), Gaps = 9/265 (3%)
Query: 72 GVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAV 131
G D L + R P V D E+P FD+R WP C +I +IRDQ CGS WA+ AV
Sbjct: 66 GRREDPNLREKRRPT-VDHHDLKVEIPSHFDSRKKWPRCKSISQIRDQSQCGSSWAVSAV 124
Query: 132 EAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYAS 191
AMSDR+CI S GK+ V LS+ DL+SCCK CG+GC GGF G +W YWV GIV+GG+ +
Sbjct: 125 GAMSDRICIQSGGKQSVELSAVDLISCCKYCGSGCDGGFLGPSWDYWVLRGIVTGGSKEN 184
Query: 192 KQGCRPYEIP-CERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPA 250
GCRPY P C+ ++ G + +C D TP+C + CQ GY+ SYE D ++G +Y++ +
Sbjct: 185 HTGCRPYPFPKCDHFVKGKYRACGDKLYKTPQCKQTCQKGYNTSYEQDKHYGGFSYNVLS 244
Query: 251 NEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGT 310
E I ++I HGPVE + IY D + YK+GIY++ G + HA+R+IGWG E GT
Sbjct: 245 VESVIQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGQFISGHAVRLIGWGVE---NGT 301
Query: 311 SSVVKYWLVANSFNTNWGENGLFRI 335
+ YWL AN++N +WGE G FRI
Sbjct: 302 A----YWLAANTWNEDWGEKGYFRI 322
Score = 159 bits (402), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 79/182 (43%), Positives = 111/182 (60%), Gaps = 11/182 (6%)
Query: 316 YWLVANSFNTNWGENGLFRIGCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYD 374
YW++ EN GCRPY P C+ ++ G +C TP+C + CQ GY+
Sbjct: 170 YWVLRGIVTGGSKEN---HTGCRPYPFPKCDHFVKGKYRACGDKLYKTPQCKQTCQKGYN 226
Query: 375 VSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLG 434
SYE D ++G +Y++ + E I ++I HGPVE + IY D + YK+GIY++ G +
Sbjct: 227 TSYEQDKHYGGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGQFIS 286
Query: 435 EHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGL 494
HA+R+IGWG E GT+ YWL AN++N +WGE G FRIVRG+NEC IE++I AGL
Sbjct: 287 GHAVRLIGWGVE---NGTA----YWLAANTWNEDWGEKGYFRIVRGRNECSIESEIAAGL 339
Query: 495 PK 496
K
Sbjct: 340 IK 341
>gi|161671340|gb|ABX75522.1| cathepsin b [Lycosa singoriensis]
Length = 247
Score = 261 bits (666), Expect = 8e-67, Method: Compositional matrix adjust.
Identities = 124/235 (52%), Positives = 164/235 (69%), Gaps = 9/235 (3%)
Query: 102 DARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKD 161
D+R WP CP+I EIRDQGSCGS WA GAVEAMSDR CI S GK + +S +DL+SCC
Sbjct: 1 DSREQWPDCPSISEIRDQGSCGSCWAFGAVEAMSDRHCIHSNGKVKIEVSPEDLLSCCSS 60
Query: 162 CGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIP-CERYMNGSHSSCQDNEPNT 220
CG GC GGF AW++WV GI +GG + S GC+PYEIP CE + G C D +T
Sbjct: 61 CGMGCDGGFPPSAWEFWVDKGIATGGLWNSHIGCQPYEIPACEHHTTGDRPPCSDI-VDT 119
Query: 221 PECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKT 280
P+C+ C+ GY+ SY DD +FG+ +YS+ + E+ I EIF++GPVEG+ ++Y+D I YK+
Sbjct: 120 PKCVHLCEKGYNTSYRDDKHFGKKSYSIESLEQQIQTEIFKNGPVEGAFSVYSDFINYKS 179
Query: 281 GIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
G+Y+H +G LG HAIR++GWG E + V YWL ANS+NT+WG+ G F+I
Sbjct: 180 GVYQHHSGESLGGHAIRVLGWGYE-------NDVPYWLCANSWNTDWGDKGYFKI 227
Score = 177 bits (448), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 82/164 (50%), Positives = 118/164 (71%), Gaps = 9/164 (5%)
Query: 334 RIGCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPA 392
IGC+PYEIP CE + G R C ++ +TP+C+ C+ GY+ SY DD +FG+ +YS+ +
Sbjct: 91 HIGCQPYEIPACEHHTTGDRPPC-SDIVDTPKCVHLCEKGYNTSYRDDKHFGKKSYSIES 149
Query: 393 NEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGT 452
E+ I EIF++GPVEG+ ++Y+D I YK+G+Y+H +G LG HAIR++GWG E
Sbjct: 150 LEQQIQTEIFKNGPVEGAFSVYSDFINYKSGVYQHHSGESLGGHAIRVLGWGYE------ 203
Query: 453 SSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPK 496
+ V YWL ANS+NT+WG+ G F+I+RG +ECGIE+ I AG+PK
Sbjct: 204 -NDVPYWLCANSWNTDWGDKGYFKILRGSDECGIESSIVAGIPK 246
>gi|449667614|ref|XP_002166962.2| PREDICTED: cathepsin B-like [Hydra magnipapillata]
Length = 330
Score = 261 bits (666), Expect = 9e-67, Method: Compositional matrix adjust.
Identities = 122/260 (46%), Positives = 180/260 (69%), Gaps = 10/260 (3%)
Query: 78 KLPQNRLPLLVQLSDPLEELPEGFDARINW-PYCPTIQEIRDQGSCGSGWALGAVEAMSD 136
K P ++ + LS + +LP FDAR W CP++ E+RDQG CGS WA GA EAM+D
Sbjct: 59 KTPLSKKLPIKDLSKEVHDLPIEFDARKEWGSICPSLLEVRDQGECGSCWAFGAAEAMTD 118
Query: 137 RVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCR 196
R+CIA++GK VR+S++DL++CC CG GC GG+ AW+++ T GIV+GG Y S +GC+
Sbjct: 119 RICIATKGKNQVRISTEDLLTCCDSCGFGCNGGYPQSAWEFFKTKGIVTGGPYNSHKGCQ 178
Query: 197 PYEIP-CERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETI 255
PY IP C+ ++ S + C + P TP+C + C+ GY+++Y++D ++G +YS+ ++ I
Sbjct: 179 PYAIPACDHHVPHSKNPCNGSLP-TPKCEKVCEKGYNITYKNDKHYGVTSYSINNDQNEI 237
Query: 256 MREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVK 315
MREI +GPVE + T++AD YK+G+Y+HV+G LG HAI+I+GWG E +
Sbjct: 238 MREIMTNGPVEAAFTVFADFPNYKSGVYQHVSGEELGGHAIKILGWGVE-------NNTP 290
Query: 316 YWLVANSFNTNWGENGLFRI 335
YWLVANS+N +WG+NG F+I
Sbjct: 291 YWLVANSWNPSWGDNGFFKI 310
Score = 167 bits (422), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 76/163 (46%), Positives = 117/163 (71%), Gaps = 9/163 (5%)
Query: 336 GCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANE 394
GC+PY IP C+ ++ S++ C + P TP+C + C+ GY+++Y++D ++G +YS+ ++
Sbjct: 176 GCQPYAIPACDHHVPHSKNPCNGSLP-TPKCEKVCEKGYNITYKNDKHYGVTSYSINNDQ 234
Query: 395 ETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSS 454
IMREI +GPVE + T++AD YK+G+Y+HV+G LG HAI+I+GWG E +
Sbjct: 235 NEIMREIMTNGPVEAAFTVFADFPNYKSGVYQHVSGEELGGHAIKILGWGVE-------N 287
Query: 455 VVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
YWLVANS+N +WG+NG F+I+RG +ECGIE ++ AGLPK+
Sbjct: 288 NTPYWLVANSWNPSWGDNGFFKILRGSDECGIEDEVVAGLPKV 330
>gi|393909827|gb|EJD75608.1| cysteine endopeptidase [Loa loa]
Length = 383
Score = 260 bits (665), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 124/241 (51%), Positives = 163/241 (67%), Gaps = 9/241 (3%)
Query: 97 LPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLV 156
+PE FDAR +WP C +++ +RDQ SCGS WA+ AVEAMSDR+CI S+GK+ V LS+DDL+
Sbjct: 123 IPESFDARKHWPECASLRNVRDQSSCGSCWAVAAVEAMSDRICIMSKGKKQVTLSADDLL 182
Query: 157 SCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEI-PCERYMNGSH-SSCQ 214
SCCK CG GC GG AWKYWV GIV+G Y + GCRPY PCE + N +H C+
Sbjct: 183 SCCKTCGFGCFGGEPMAAWKYWVLRGIVTGSEYTNHSGCRPYPFPPCEHHNNKTHYEPCK 242
Query: 215 DNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYAD 274
+ TP+C++KC Y SY+ D +G Y++ +N E+I +EI GPVE S +Y D
Sbjct: 243 HDLYPTPKCVKKCDKNYGKSYKADKYYGEQVYNVESNVESIQKEIMTLGPVEASFEVYTD 302
Query: 275 MILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFR 334
+ Y GIYKHVAG G HA++++GWG + +G V YWL ANS+NT+WGE+G FR
Sbjct: 303 FLYYTGGIYKHVAGSMGGGHAVKVLGWG---IDQG----VPYWLAANSWNTDWGEDGYFR 355
Query: 335 I 335
I
Sbjct: 356 I 356
Score = 158 bits (399), Expect = 8e-36, Method: Compositional matrix adjust.
Identities = 85/198 (42%), Positives = 118/198 (59%), Gaps = 17/198 (8%)
Query: 301 WGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIGCRPYEIP-CERYMNGSR-SSCQAN 358
+G EP+ + KYW++ G GCRPY P CE + N + C+ +
Sbjct: 193 FGGEPM-----AAWKYWVLRGIVT---GSEYTNHSGCRPYPFPPCEHHNNKTHYEPCKHD 244
Query: 359 EPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMI 418
TP+C++KC Y SY+ D +G Y++ +N E+I +EI GPVE S +Y D +
Sbjct: 245 LYPTPKCVKKCDKNYGKSYKADKYYGEQVYNVESNVESIQKEIMTLGPVEASFEVYTDFL 304
Query: 419 LYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIV 478
Y GIYKHVAG G HA++++GWG + +G V YWL ANS+NT+WGE+G FRI+
Sbjct: 305 YYTGGIYKHVAGSMGGGHAVKVLGWG---IDQG----VPYWLAANSWNTDWGEDGYFRIL 357
Query: 479 RGQNECGIEADITAGLPK 496
RG NECGIE+ I AG+PK
Sbjct: 358 RGVNECGIESGIIAGIPK 375
>gi|225717770|gb|ACO14731.1| Cathepsin B precursor [Caligus clemensi]
Length = 331
Score = 260 bits (665), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 126/243 (51%), Positives = 173/243 (71%), Gaps = 12/243 (4%)
Query: 95 EELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDD 154
E +P+ FDAR +WP CP+I+ IRDQGSCGS WA GA EAMSDRVCI + ++V +S+++
Sbjct: 80 ESIPDTFDAREHWPNCPSIRLIRDQGSCGSCWAFGAAEAMSDRVCIHTH--KNVNISAEN 137
Query: 155 LVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEI-PCERYMNGSHSSC 213
L+SCC CG GC GGF G AW++W G+VSGG Y S +GC+PY I PCE ++NG+ C
Sbjct: 138 LLSCCYTCGFGCNGGFPGAAWRFWENKGLVSGGLYGSHKGCQPYLIEPCEHHVNGTRKPC 197
Query: 214 QDNEPNTPECIRKC-QPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIY 272
+ TP+C + C Y +SYE DL+FGR +YS+ ++ + I +I +GPVE + ++Y
Sbjct: 198 AEGG-RTPKCHKTCDNKNYPISYEKDLSFGRSSYSIRSDPKQIQMDIMTNGPVEAAFSVY 256
Query: 273 ADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGL 332
+D + YK+G+Y+HV G LG HAIRI+GWG E +GT YWLVANS+NT+WG+NG
Sbjct: 257 SDFMSYKSGVYRHVKGSLLGGHAIRILGWGME---KGT----PYWLVANSWNTDWGDNGT 309
Query: 333 FRI 335
F+I
Sbjct: 310 FKI 312
Score = 164 bits (416), Expect = 8e-38, Method: Compositional matrix adjust.
Identities = 81/163 (49%), Positives = 114/163 (69%), Gaps = 10/163 (6%)
Query: 336 GCRPYEI-PCERYMNGSRSSCQANEPNTPECIRKC-QPGYDVSYEDDLNFGRIAYSLPAN 393
GC+PY I PCE ++NG+R C A TP+C + C Y +SYE DL+FGR +YS+ ++
Sbjct: 177 GCQPYLIEPCEHHVNGTRKPC-AEGGRTPKCHKTCDNKNYPISYEKDLSFGRSSYSIRSD 235
Query: 394 EETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTS 453
+ I +I +GPVE + ++Y+D + YK+G+Y+HV G LG HAIRI+GWG E +GT
Sbjct: 236 PKQIQMDIMTNGPVEAAFSVYSDFMSYKSGVYRHVKGSLLGGHAIRILGWGME---KGT- 291
Query: 454 SVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPK 496
YWLVANS+NT+WG+NG F+I+RG + CGIE + AGLP+
Sbjct: 292 ---PYWLVANSWNTDWGDNGTFKILRGSDHCGIEDSVVAGLPR 331
>gi|226473756|emb|CAX71563.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 260 bits (665), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 126/265 (47%), Positives = 169/265 (63%), Gaps = 9/265 (3%)
Query: 72 GVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAV 131
G D L + R P V D E+P FD+R WP C +I +IRDQ CGS WA+ AV
Sbjct: 66 GRREDPNLREKRRPT-VDHHDLKVEIPSHFDSRKKWPRCKSISQIRDQSQCGSSWAVSAV 124
Query: 132 EAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYAS 191
AMSDR+CI S GK+ V LS+ DL+SCCK CG+GC GGF G +W YWV GIV+GG+ +
Sbjct: 125 GAMSDRICIQSGGKQSVELSAVDLISCCKYCGSGCDGGFLGPSWDYWVLRGIVTGGSKEN 184
Query: 192 KQGCRPYEIP-CERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPA 250
GCRPY P C+ ++ G + +C D TP+C + CQ GY+ SYE D ++G +Y++ +
Sbjct: 185 HTGCRPYPFPKCDHFVKGKYRACGDKLYKTPQCNQTCQKGYNTSYEQDKHYGGFSYNVLS 244
Query: 251 NEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGT 310
E I ++I HGPVE + IY D + YK+GIY++ G + HA+R+IGWG E GT
Sbjct: 245 VESVIQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGKYISGHAVRLIGWGVE---NGT 301
Query: 311 SSVVKYWLVANSFNTNWGENGLFRI 335
+ YWL AN++N +WGE G FRI
Sbjct: 302 A----YWLAANTWNEDWGEKGYFRI 322
Score = 157 bits (398), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 79/182 (43%), Positives = 111/182 (60%), Gaps = 11/182 (6%)
Query: 316 YWLVANSFNTNWGENGLFRIGCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYD 374
YW++ EN GCRPY P C+ ++ G +C TP+C + CQ GY+
Sbjct: 170 YWVLRGIVTGGSKEN---HTGCRPYPFPKCDHFVKGKYRACGDKLYKTPQCNQTCQKGYN 226
Query: 375 VSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLG 434
SYE D ++G +Y++ + E I ++I HGPVE + IY D + YK+GIY++ G +
Sbjct: 227 TSYEQDKHYGGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGKYIS 286
Query: 435 EHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGL 494
HA+R+IGWG E GT+ YWL AN++N +WGE G FRIVRG+NEC IE++I AGL
Sbjct: 287 GHAVRLIGWGVE---NGTA----YWLAANTWNEDWGEKGYFRIVRGRNECLIESEIAAGL 339
Query: 495 PK 496
K
Sbjct: 340 IK 341
>gi|226474176|emb|CAX71574.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 260 bits (665), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 126/265 (47%), Positives = 169/265 (63%), Gaps = 9/265 (3%)
Query: 72 GVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAV 131
G D L + R P V D E+P FD+R WP C +I +IRDQ CGS WA+ AV
Sbjct: 66 GRKEDPNLREKRRPT-VDHHDLNVEIPSHFDSRKKWPRCKSISQIRDQSQCGSSWAVSAV 124
Query: 132 EAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYAS 191
AMSDR+CI S GK+ V LS+ DL+SCCK CG+GC GGF G +W YWV GIV+GG+ +
Sbjct: 125 GAMSDRICIQSGGKQSVELSAVDLISCCKYCGSGCDGGFLGPSWDYWVLRGIVTGGSKEN 184
Query: 192 KQGCRPYEIP-CERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPA 250
GCRPY P C+ ++ G + +C D TP+C + CQ GY+ SYE D ++G +Y++ +
Sbjct: 185 HTGCRPYPFPKCDHFVKGKYRACGDKLYKTPQCNQTCQKGYNTSYEQDKHYGGFSYNVLS 244
Query: 251 NEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGT 310
E I ++I HGPVE + IY D + YK+GIY++ G + HA+R+IGWG E GT
Sbjct: 245 VESVIQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGKYISGHAVRLIGWGVE---NGT 301
Query: 311 SSVVKYWLVANSFNTNWGENGLFRI 335
+ YWL AN++N +WGE G FRI
Sbjct: 302 A----YWLAANTWNEDWGEKGYFRI 322
Score = 158 bits (399), Expect = 8e-36, Method: Compositional matrix adjust.
Identities = 78/182 (42%), Positives = 111/182 (60%), Gaps = 11/182 (6%)
Query: 316 YWLVANSFNTNWGENGLFRIGCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYD 374
YW++ EN GCRPY P C+ ++ G +C TP+C + CQ GY+
Sbjct: 170 YWVLRGIVTGGSKEN---HTGCRPYPFPKCDHFVKGKYRACGDKLYKTPQCNQTCQKGYN 226
Query: 375 VSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLG 434
SYE D ++G +Y++ + E I ++I HGPVE + IY D + YK+GIY++ G +
Sbjct: 227 TSYEQDKHYGGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGKYIS 286
Query: 435 EHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGL 494
HA+R+IGWG E GT+ YWL AN++N +WGE G FRIVRG+NEC I+++I AGL
Sbjct: 287 GHAVRLIGWGVE---NGTA----YWLAANTWNEDWGEKGYFRIVRGRNECSIDSEIAAGL 339
Query: 495 PK 496
K
Sbjct: 340 IK 341
>gi|56756907|gb|AAW26625.1| unknown [Schistosoma japonicum]
Length = 342
Score = 260 bits (665), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 126/265 (47%), Positives = 169/265 (63%), Gaps = 9/265 (3%)
Query: 72 GVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAV 131
G D L + R P V D E+P FD+R WP C +I +IRDQ CGS WA+ AV
Sbjct: 66 GRREDPNLREKRRPT-VDHHDLNVEIPSHFDSRKKWPRCKSISQIRDQSQCGSSWAVSAV 124
Query: 132 EAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYAS 191
AMSDR+CI S GK+ V LS+ DL+SCCK CG+GC GGF G +W YWV GIV+GG+ +
Sbjct: 125 GAMSDRICIQSGGKQSVELSAVDLISCCKYCGSGCDGGFLGPSWDYWVLRGIVTGGSKEN 184
Query: 192 KQGCRPYEIP-CERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPA 250
GCRPY P C+ ++ G + +C D TP+C + CQ GY+ SYE D ++G +Y++ +
Sbjct: 185 HTGCRPYPFPKCDHFVKGKYRACGDKLYKTPQCNQTCQKGYNTSYEQDKHYGGFSYNVLS 244
Query: 251 NEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGT 310
E I ++I HGPVE + IY D + YK+GIY++ G + HA+R+IGWG E GT
Sbjct: 245 VESVIQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGKYISGHAVRLIGWGVE---NGT 301
Query: 311 SSVVKYWLVANSFNTNWGENGLFRI 335
+ YWL AN++N +WGE G FRI
Sbjct: 302 A----YWLAANTWNEDWGEKGYFRI 322
Score = 157 bits (398), Expect = 9e-36, Method: Compositional matrix adjust.
Identities = 79/182 (43%), Positives = 111/182 (60%), Gaps = 11/182 (6%)
Query: 316 YWLVANSFNTNWGENGLFRIGCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYD 374
YW++ EN GCRPY P C+ ++ G +C TP+C + CQ GY+
Sbjct: 170 YWVLRGIVTGGSKEN---HTGCRPYPFPKCDHFVKGKYRACGDKLYKTPQCNQTCQKGYN 226
Query: 375 VSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLG 434
SYE D ++G +Y++ + E I ++I HGPVE + IY D + YK+GIY++ G +
Sbjct: 227 TSYEQDKHYGGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGKYIS 286
Query: 435 EHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGL 494
HA+R+IGWG E GT+ YWL AN++N +WGE G FRIVRG+NEC IE++I AGL
Sbjct: 287 GHAVRLIGWGVE---NGTA----YWLAANTWNEDWGEKGYFRIVRGRNECLIESEIAAGL 339
Query: 495 PK 496
K
Sbjct: 340 IK 341
>gi|74221319|dbj|BAE42140.1| unnamed protein product [Mus musculus]
Length = 339
Score = 260 bits (664), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 128/255 (50%), Positives = 175/255 (68%), Gaps = 11/255 (4%)
Query: 83 RLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIAS 142
+LP V + ++ LPE FDAR W CPTI +IRDQGSCGS WA GAVEA+SDR CI +
Sbjct: 67 KLPGRVAFGEDID-LPETFDAREQWSNCPTIGQIRDQGSCGSCWAFGAVEAISDRTCIHT 125
Query: 143 RGKRHVRLSSDDLVSCCK-DCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIP 201
G+ +V +S++DL++CC CG+GC GG+ AW +W G+VSGG Y S GC PY IP
Sbjct: 126 NGRVNVEVSAEDLLTCCGIQCGDGCNGGYPSGAWSFWTKKGLVSGGVYNSHVGCLPYTIP 185
Query: 202 -CERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIF 260
CE ++NGS C E +TP C + C+ GY SY++D +FG +YS+ + + IM EI+
Sbjct: 186 PCEHHVNGSRPPCT-GEGDTPRCNKSCEAGYSPSYKEDKHFGYTSYSVSNSVKEIMAEIY 244
Query: 261 RHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVA 320
++ PVEG+ T+++D + YK+G+YKH AG +G HAIRI+GWG +G G V YWL A
Sbjct: 245 KNDPVEGAFTVFSDFLTYKSGVYKHEAGDMMGGHAIRILGWG---VGNG----VPYWLAA 297
Query: 321 NSFNTNWGENGLFRI 335
NS+N +WG+NG F+I
Sbjct: 298 NSWNLDWGDNGFFKI 312
Score = 172 bits (436), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 81/164 (49%), Positives = 117/164 (71%), Gaps = 9/164 (5%)
Query: 334 RIGCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPA 392
+GC PY IP CE ++NGSR C E +TP C + C+ GY SY++D +FG +YS+
Sbjct: 176 HVGCLPYTIPPCEHHVNGSRPPCTG-EGDTPRCNKSCEAGYSPSYKEDKHFGYTSYSVSN 234
Query: 393 NEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGT 452
+ + IM EI+++ PVEG+ T+++D + YK+G+YKH AG +G HAIRI+GWG +G G
Sbjct: 235 SVKEIMAEIYKNDPVEGAFTVFSDFLTYKSGVYKHEAGDMMGGHAIRILGWG---VGNG- 290
Query: 453 SSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPK 496
V YWL ANS+N +WG+NG F+I+RG+N CGIE++I AG+P+
Sbjct: 291 ---VPYWLAANSWNLDWGDNGFFKILRGENHCGIESEIVAGIPR 331
>gi|226474160|emb|CAX71567.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 260 bits (664), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 126/265 (47%), Positives = 169/265 (63%), Gaps = 9/265 (3%)
Query: 72 GVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAV 131
G D L + R P V D E+P FD+R WP C +I +IRDQ CGS WA+ AV
Sbjct: 66 GRREDPNLREKRRPT-VDHHDLKVEIPSHFDSRKKWPRCKSISQIRDQSQCGSSWAVSAV 124
Query: 132 EAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYAS 191
AMSDR+CI S GK+ V LS+ DL+SCCK CG+GC GGF G +W YWV GIV+GG+ +
Sbjct: 125 GAMSDRICIQSGGKQSVELSAVDLISCCKYCGSGCDGGFLGPSWDYWVLRGIVTGGSKEN 184
Query: 192 KQGCRPYEIP-CERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPA 250
GCRPY P C+ ++ G + +C D TP+C + CQ GY+ SYE D ++G +Y++ +
Sbjct: 185 HTGCRPYPFPKCDHFVKGKYRACGDKLYKTPQCKQICQKGYNTSYEQDKHYGGFSYNVLS 244
Query: 251 NEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGT 310
E I ++I HGPVE + IY D + YK+GIY++ G + HA+R+IGWG E GT
Sbjct: 245 VESVIQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGQFISGHAVRLIGWGVE---NGT 301
Query: 311 SSVVKYWLVANSFNTNWGENGLFRI 335
+ YWL AN++N +WGE G FRI
Sbjct: 302 A----YWLAANTWNEDWGEKGYFRI 322
Score = 158 bits (400), Expect = 6e-36, Method: Compositional matrix adjust.
Identities = 79/182 (43%), Positives = 111/182 (60%), Gaps = 11/182 (6%)
Query: 316 YWLVANSFNTNWGENGLFRIGCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYD 374
YW++ EN GCRPY P C+ ++ G +C TP+C + CQ GY+
Sbjct: 170 YWVLRGIVTGGSKEN---HTGCRPYPFPKCDHFVKGKYRACGDKLYKTPQCKQICQKGYN 226
Query: 375 VSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLG 434
SYE D ++G +Y++ + E I ++I HGPVE + IY D + YK+GIY++ G +
Sbjct: 227 TSYEQDKHYGGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGQFIS 286
Query: 435 EHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGL 494
HA+R+IGWG E GT+ YWL AN++N +WGE G FRIVRG+NEC IE++I AGL
Sbjct: 287 GHAVRLIGWGVE---NGTA----YWLAANTWNEDWGEKGYFRIVRGRNECSIESEIAAGL 339
Query: 495 PK 496
K
Sbjct: 340 IK 341
>gi|226474180|emb|CAX71576.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 260 bits (664), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 126/265 (47%), Positives = 169/265 (63%), Gaps = 9/265 (3%)
Query: 72 GVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAV 131
G D L + R P V D E+P FD+R WP C +I +IRDQ CGS WA+ AV
Sbjct: 66 GRREDPNLREKRRPT-VDHHDLNVEIPSHFDSRKKWPRCKSISQIRDQSQCGSSWAVSAV 124
Query: 132 EAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYAS 191
AMSDR+CI S GK+ V LS+ DL+SCCK CG+GC GGF G +W YWV GIV+GG+ +
Sbjct: 125 GAMSDRICIQSGGKQSVELSAVDLISCCKYCGSGCDGGFLGPSWDYWVLRGIVTGGSKEN 184
Query: 192 KQGCRPYEIP-CERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPA 250
GCRPY P C+ ++ G + +C D TP+C + CQ GY+ SYE D ++G +Y++ +
Sbjct: 185 HTGCRPYPFPKCDHFVKGKYRACGDKLYETPQCKQTCQKGYNTSYEQDKHYGGFSYNVLS 244
Query: 251 NEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGT 310
E I ++I HGPVE + IY D + YK+GIY++ G + HA+R+IGWG E GT
Sbjct: 245 VESVIQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGQFISGHAVRLIGWGVE---NGT 301
Query: 311 SSVVKYWLVANSFNTNWGENGLFRI 335
+ YWL AN++N +WGE G FRI
Sbjct: 302 A----YWLAANTWNEDWGEKGYFRI 322
Score = 159 bits (401), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 79/182 (43%), Positives = 111/182 (60%), Gaps = 11/182 (6%)
Query: 316 YWLVANSFNTNWGENGLFRIGCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYD 374
YW++ EN GCRPY P C+ ++ G +C TP+C + CQ GY+
Sbjct: 170 YWVLRGIVTGGSKEN---HTGCRPYPFPKCDHFVKGKYRACGDKLYETPQCKQTCQKGYN 226
Query: 375 VSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLG 434
SYE D ++G +Y++ + E I ++I HGPVE + IY D + YK+GIY++ G +
Sbjct: 227 TSYEQDKHYGGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGQFIS 286
Query: 435 EHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGL 494
HA+R+IGWG E GT+ YWL AN++N +WGE G FRIVRG+NEC IE++I AGL
Sbjct: 287 GHAVRLIGWGVE---NGTA----YWLAANTWNEDWGEKGYFRIVRGRNECSIESEIAAGL 339
Query: 495 PK 496
K
Sbjct: 340 IK 341
>gi|56754499|gb|AAW25437.1| unknown [Schistosoma japonicum]
Length = 342
Score = 260 bits (664), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 126/265 (47%), Positives = 169/265 (63%), Gaps = 9/265 (3%)
Query: 72 GVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAV 131
G D L + R P V D E+P FD+R WP C +I +IRDQ CGS WA+ AV
Sbjct: 66 GRREDPNLREKRRPT-VDHHDLNVEIPSHFDSRKKWPRCKSISQIRDQSQCGSSWAVSAV 124
Query: 132 EAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYAS 191
AMSDR+CI S GK+ V LS+ DL+SCCK CG+GC GGF G +W YWV GIV+GG+ +
Sbjct: 125 GAMSDRICIQSGGKQSVELSAVDLISCCKYCGSGCDGGFLGPSWDYWVLRGIVTGGSKEN 184
Query: 192 KQGCRPYEIP-CERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPA 250
GCRPY P C+ ++ G + +C D TP+C + CQ GY+ SYE D ++G +Y++ +
Sbjct: 185 HTGCRPYPFPKCDHFVKGKYRACGDKLYKTPQCNQTCQKGYNTSYEQDKHYGGFSYNVLS 244
Query: 251 NEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGT 310
E I ++I HGPVE + IY D + YK+GIY++ G + HA+R+IGWG E GT
Sbjct: 245 VESVIQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGKYISGHAVRLIGWGVE---NGT 301
Query: 311 SSVVKYWLVANSFNTNWGENGLFRI 335
+ YWL AN++N +WGE G FRI
Sbjct: 302 A----YWLAANTWNEDWGEKGYFRI 322
Score = 157 bits (398), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 79/182 (43%), Positives = 111/182 (60%), Gaps = 11/182 (6%)
Query: 316 YWLVANSFNTNWGENGLFRIGCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYD 374
YW++ EN GCRPY P C+ ++ G +C TP+C + CQ GY+
Sbjct: 170 YWVLRGIVTGGSKEN---HTGCRPYPFPKCDHFVKGKYRACGDKLYKTPQCNQTCQKGYN 226
Query: 375 VSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLG 434
SYE D ++G +Y++ + E I ++I HGPVE + IY D + YK+GIY++ G +
Sbjct: 227 TSYEQDKHYGGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGKYIS 286
Query: 435 EHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGL 494
HA+R+IGWG E GT+ YWL AN++N +WGE G FRIVRG+NEC IE++I AGL
Sbjct: 287 GHAVRLIGWGVE---NGTA----YWLAANTWNEDWGEKGYFRIVRGRNECLIESEIAAGL 339
Query: 495 PK 496
K
Sbjct: 340 IK 341
>gi|171474007|gb|AAX31052.2| SJCHGC09761 protein [Schistosoma japonicum]
Length = 342
Score = 260 bits (664), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 125/265 (47%), Positives = 169/265 (63%), Gaps = 9/265 (3%)
Query: 72 GVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAV 131
G D L + R P + D E+P FD+R WP C +I +IRDQ CGS WA+ AV
Sbjct: 66 GRREDPNLREKRRPT-IDHHDLNVEIPSHFDSRKKWPRCKSISQIRDQSQCGSSWAVSAV 124
Query: 132 EAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYAS 191
AMSDR+CI S GK+ V LS+ DL+SCCK CG+GC GGF G +W YWV GIV+GG+ +
Sbjct: 125 GAMSDRICIQSGGKQSVELSAVDLISCCKYCGSGCDGGFLGPSWDYWVLRGIVTGGSKEN 184
Query: 192 KQGCRPYEIP-CERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPA 250
GCRPY P C+ ++ G + +C D TP+C + CQ GY+ SYE D ++G +Y++ +
Sbjct: 185 HTGCRPYPFPKCDHFVKGKYRACGDKLYKTPQCKQTCQKGYNTSYEQDKHYGGFSYNVLS 244
Query: 251 NEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGT 310
E I ++I HGPVE + IY D + YK+GIY++ G + HA+R+IGWG E GT
Sbjct: 245 VESVIQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGKYISGHAVRLIGWGVE---NGT 301
Query: 311 SSVVKYWLVANSFNTNWGENGLFRI 335
+ YWL AN++N +WGE G FRI
Sbjct: 302 A----YWLAANTWNEDWGEKGYFRI 322
Score = 158 bits (399), Expect = 7e-36, Method: Compositional matrix adjust.
Identities = 79/182 (43%), Positives = 111/182 (60%), Gaps = 11/182 (6%)
Query: 316 YWLVANSFNTNWGENGLFRIGCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYD 374
YW++ EN GCRPY P C+ ++ G +C TP+C + CQ GY+
Sbjct: 170 YWVLRGIVTGGSKEN---HTGCRPYPFPKCDHFVKGKYRACGDKLYKTPQCKQTCQKGYN 226
Query: 375 VSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLG 434
SYE D ++G +Y++ + E I ++I HGPVE + IY D + YK+GIY++ G +
Sbjct: 227 TSYEQDKHYGGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGKYIS 286
Query: 435 EHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGL 494
HA+R+IGWG E GT+ YWL AN++N +WGE G FRIVRG+NEC IE++I AGL
Sbjct: 287 GHAVRLIGWGVE---NGTA----YWLAANTWNEDWGEKGYFRIVRGRNECLIESEIAAGL 339
Query: 495 PK 496
K
Sbjct: 340 IK 341
>gi|226474174|emb|CAX71573.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 260 bits (664), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 126/265 (47%), Positives = 169/265 (63%), Gaps = 9/265 (3%)
Query: 72 GVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAV 131
G D L + R P V D E+P FD+R WP C +I +IRDQ CGS WA+ AV
Sbjct: 66 GRKEDPNLREKRRPT-VDHHDLNVEIPSHFDSRKKWPRCKSISQIRDQSQCGSSWAVSAV 124
Query: 132 EAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYAS 191
AMSDR+CI S GK+ V LS+ DL+SCCK CG+GC GGF G +W YWV GIV+GG+ +
Sbjct: 125 GAMSDRICIQSGGKQSVELSAIDLISCCKYCGSGCDGGFLGPSWDYWVLRGIVTGGSKEN 184
Query: 192 KQGCRPYEIP-CERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPA 250
GCRPY P C+ ++ G + +C D TP+C + CQ GY+ SYE D ++G +Y++ +
Sbjct: 185 HTGCRPYPFPKCDHFVKGKYRACGDKLYKTPQCNQTCQKGYNTSYEQDKHYGGFSYNVLS 244
Query: 251 NEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGT 310
E I ++I HGPVE + IY D + YK+GIY++ G + HA+R+IGWG E GT
Sbjct: 245 VESVIQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGQFISGHAVRLIGWGVE---NGT 301
Query: 311 SSVVKYWLVANSFNTNWGENGLFRI 335
+ YWL AN++N +WGE G FRI
Sbjct: 302 A----YWLAANTWNEDWGEKGYFRI 322
Score = 157 bits (398), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 79/182 (43%), Positives = 111/182 (60%), Gaps = 11/182 (6%)
Query: 316 YWLVANSFNTNWGENGLFRIGCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYD 374
YW++ EN GCRPY P C+ ++ G +C TP+C + CQ GY+
Sbjct: 170 YWVLRGIVTGGSKEN---HTGCRPYPFPKCDHFVKGKYRACGDKLYKTPQCNQTCQKGYN 226
Query: 375 VSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLG 434
SYE D ++G +Y++ + E I ++I HGPVE + IY D + YK+GIY++ G +
Sbjct: 227 TSYEQDKHYGGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGQFIS 286
Query: 435 EHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGL 494
HA+R+IGWG E GT+ YWL AN++N +WGE G FRIVRG+NEC IE++I AGL
Sbjct: 287 GHAVRLIGWGVE---NGTA----YWLAANTWNEDWGEKGYFRIVRGRNECLIESEIAAGL 339
Query: 495 PK 496
K
Sbjct: 340 IK 341
>gi|226474172|emb|CAX71572.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 260 bits (664), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 126/265 (47%), Positives = 169/265 (63%), Gaps = 9/265 (3%)
Query: 72 GVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAV 131
G D L + R P V D E+P FD+R WP C +I +IRDQ CGS WA+ AV
Sbjct: 66 GRKEDPNLREKRRPT-VDHHDLNVEIPSHFDSRKKWPRCKSISQIRDQSQCGSSWAVSAV 124
Query: 132 EAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYAS 191
AMSDR+CI S GK+ V LS+ DL+SCCK CG+GC GGF G +W YWV GIV+GG+ +
Sbjct: 125 GAMSDRICIQSGGKQSVELSAVDLISCCKYCGSGCDGGFLGPSWDYWVLRGIVTGGSKEN 184
Query: 192 KQGCRPYEIP-CERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPA 250
GCRPY P C+ ++ G + +C D TP+C + CQ GY+ SYE D ++G +Y++ +
Sbjct: 185 HTGCRPYPFPKCDHFVKGKYRACGDKLYKTPQCNQTCQKGYNTSYEQDKHYGGFSYNVLS 244
Query: 251 NEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGT 310
E I ++I HGPVE + IY D + YK+GIY++ G + HA+R+IGWG E GT
Sbjct: 245 VESVIQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGQFISGHAVRLIGWGVE---NGT 301
Query: 311 SSVVKYWLVANSFNTNWGENGLFRI 335
+ YWL AN++N +WGE G FRI
Sbjct: 302 A----YWLAANTWNEDWGEKGYFRI 322
Score = 157 bits (398), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 79/182 (43%), Positives = 111/182 (60%), Gaps = 11/182 (6%)
Query: 316 YWLVANSFNTNWGENGLFRIGCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYD 374
YW++ EN GCRPY P C+ ++ G +C TP+C + CQ GY+
Sbjct: 170 YWVLRGIVTGGSKEN---HTGCRPYPFPKCDHFVKGKYRACGDKLYKTPQCNQTCQKGYN 226
Query: 375 VSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLG 434
SYE D ++G +Y++ + E I ++I HGPVE + IY D + YK+GIY++ G +
Sbjct: 227 TSYEQDKHYGGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGQFIS 286
Query: 435 EHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGL 494
HA+R+IGWG E GT+ YWL AN++N +WGE G FRIVRG+NEC IE++I AGL
Sbjct: 287 GHAVRLIGWGVE---NGTA----YWLAANTWNEDWGEKGYFRIVRGRNECLIESEIAAGL 339
Query: 495 PK 496
K
Sbjct: 340 IK 341
>gi|226474184|emb|CAX71578.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 259 bits (663), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 126/265 (47%), Positives = 169/265 (63%), Gaps = 9/265 (3%)
Query: 72 GVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAV 131
G D L + R P V D E+P FD+R WP C +I +IRDQ CGS WA+ AV
Sbjct: 66 GRKEDPNLREKRRPT-VDHHDLNVEIPSHFDSRKKWPRCKSISQIRDQSQCGSSWAVSAV 124
Query: 132 EAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYAS 191
AMSDR+CI S GK+ V LS+ DL+SCCK CG+GC GGF G +W YWV GIV+GG+ +
Sbjct: 125 GAMSDRICIQSGGKQSVELSAVDLISCCKYCGSGCDGGFLGPSWDYWVLRGIVTGGSKEN 184
Query: 192 KQGCRPYEIP-CERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPA 250
GCRPY P C+ ++ G + +C D TP+C + CQ GY+ SYE D ++G +Y++ +
Sbjct: 185 HTGCRPYPFPKCDHFVKGKYRACGDKLYKTPQCNQTCQKGYNTSYEQDKHYGGFSYNVLS 244
Query: 251 NEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGT 310
E I ++I HGPVE + IY D + YK+GIY++ G + HA+R+IGWG E GT
Sbjct: 245 VESVIQKDIMVHGPVEAYLEIYEDFLNYKSGIYRYTTGKYISGHAVRLIGWGVE---NGT 301
Query: 311 SSVVKYWLVANSFNTNWGENGLFRI 335
+ YWL AN++N +WGE G FRI
Sbjct: 302 A----YWLAANTWNEDWGEKGYFRI 322
Score = 157 bits (397), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 79/182 (43%), Positives = 111/182 (60%), Gaps = 11/182 (6%)
Query: 316 YWLVANSFNTNWGENGLFRIGCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYD 374
YW++ EN GCRPY P C+ ++ G +C TP+C + CQ GY+
Sbjct: 170 YWVLRGIVTGGSKEN---HTGCRPYPFPKCDHFVKGKYRACGDKLYKTPQCNQTCQKGYN 226
Query: 375 VSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLG 434
SYE D ++G +Y++ + E I ++I HGPVE + IY D + YK+GIY++ G +
Sbjct: 227 TSYEQDKHYGGFSYNVLSVESVIQKDIMVHGPVEAYLEIYEDFLNYKSGIYRYTTGKYIS 286
Query: 435 EHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGL 494
HA+R+IGWG E GT+ YWL AN++N +WGE G FRIVRG+NEC IE++I AGL
Sbjct: 287 GHAVRLIGWGVE---NGTA----YWLAANTWNEDWGEKGYFRIVRGRNECLIESEIAAGL 339
Query: 495 PK 496
K
Sbjct: 340 IK 341
>gi|226473762|emb|CAX71566.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
gi|226474170|emb|CAX71571.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 259 bits (663), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 126/265 (47%), Positives = 169/265 (63%), Gaps = 9/265 (3%)
Query: 72 GVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAV 131
G D L + R P V D E+P FD+R WP C +I +IRDQ CGS WA+ AV
Sbjct: 66 GRREDPNLREKRRPT-VDHHDLNVEIPSHFDSRKKWPRCKSISQIRDQSQCGSSWAVSAV 124
Query: 132 EAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYAS 191
AMSDR+CI S GK+ V LS+ DL+SCCK CG+GC GGF G +W YWV GIV+GG+ +
Sbjct: 125 GAMSDRICIQSGGKQSVELSAVDLISCCKYCGSGCDGGFLGPSWDYWVLRGIVTGGSKEN 184
Query: 192 KQGCRPYEIP-CERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPA 250
GCRPY P C+ ++ G + +C D TP+C + CQ GY+ SYE D ++G +Y++ +
Sbjct: 185 HTGCRPYPFPKCDHFVKGKYRACGDKLYKTPQCNQTCQKGYNTSYEQDKHYGGFSYNVLS 244
Query: 251 NEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGT 310
E I ++I HGPVE + IY D + YK+GIY++ G + HA+R+IGWG E GT
Sbjct: 245 VESVIQKDIMMHGPVEAYIEIYEDFLNYKSGIYRYTTGKYISGHAVRLIGWGVE---NGT 301
Query: 311 SSVVKYWLVANSFNTNWGENGLFRI 335
+ YWL AN++N +WGE G FRI
Sbjct: 302 A----YWLAANTWNEDWGEKGYFRI 322
Score = 157 bits (397), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 79/182 (43%), Positives = 111/182 (60%), Gaps = 11/182 (6%)
Query: 316 YWLVANSFNTNWGENGLFRIGCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYD 374
YW++ EN GCRPY P C+ ++ G +C TP+C + CQ GY+
Sbjct: 170 YWVLRGIVTGGSKEN---HTGCRPYPFPKCDHFVKGKYRACGDKLYKTPQCNQTCQKGYN 226
Query: 375 VSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLG 434
SYE D ++G +Y++ + E I ++I HGPVE + IY D + YK+GIY++ G +
Sbjct: 227 TSYEQDKHYGGFSYNVLSVESVIQKDIMMHGPVEAYIEIYEDFLNYKSGIYRYTTGKYIS 286
Query: 435 EHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGL 494
HA+R+IGWG E GT+ YWL AN++N +WGE G FRIVRG+NEC IE++I AGL
Sbjct: 287 GHAVRLIGWGVE---NGTA----YWLAANTWNEDWGEKGYFRIVRGRNECLIESEIAAGL 339
Query: 495 PK 496
K
Sbjct: 340 IK 341
>gi|74213457|dbj|BAE35542.1| unnamed protein product [Mus musculus]
Length = 339
Score = 259 bits (663), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 127/255 (49%), Positives = 174/255 (68%), Gaps = 11/255 (4%)
Query: 83 RLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIAS 142
+LP V + ++ LPE FDAR W CPTI +IRDQGSCGS WA GAVEA+SDR CI +
Sbjct: 67 KLPGRVAFGEDID-LPETFDAREQWSNCPTIGQIRDQGSCGSCWAFGAVEAISDRTCIHT 125
Query: 143 RGKRHVRLSSDDLVSCCK-DCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIP 201
G+ +V +S++DL++CC CG+GC GG+ AW +W G+VSGG Y S GC PY IP
Sbjct: 126 NGRVNVEVSAEDLLTCCGIQCGDGCNGGYPSGAWSFWTKKGLVSGGVYNSHVGCLPYTIP 185
Query: 202 -CERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIF 260
CE ++NGS C E +T C + C+ GY SY++D +FG +YS+ + + IM EI+
Sbjct: 186 PCEHHVNGSRPPCT-GEGDTHRCNKSCEAGYSPSYKEDKHFGYTSYSVSNSVKEIMAEIY 244
Query: 261 RHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVA 320
++GPVEG+ T+++D + YK+G+YKH AG +G HAIRI+GWG E + V YWL A
Sbjct: 245 KNGPVEGAFTVFSDFLTYKSGVYKHEAGDMMGGHAIRILGWGVE-------NGVPYWLAA 297
Query: 321 NSFNTNWGENGLFRI 335
NS+N +WG+NG F+I
Sbjct: 298 NSWNLDWGDNGFFKI 312
Score = 172 bits (435), Expect = 5e-40, Method: Compositional matrix adjust.
Identities = 80/164 (48%), Positives = 116/164 (70%), Gaps = 9/164 (5%)
Query: 334 RIGCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPA 392
+GC PY IP CE ++NGSR C E +T C + C+ GY SY++D +FG +YS+
Sbjct: 176 HVGCLPYTIPPCEHHVNGSRPPCTG-EGDTHRCNKSCEAGYSPSYKEDKHFGYTSYSVSN 234
Query: 393 NEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGT 452
+ + IM EI+++GPVEG+ T+++D + YK+G+YKH AG +G HAIRI+GWG E
Sbjct: 235 SVKEIMAEIYKNGPVEGAFTVFSDFLTYKSGVYKHEAGDMMGGHAIRILGWGVE------ 288
Query: 453 SSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPK 496
+ V YWL ANS+N +WG+NG F+I+RG+N CGIE++I AG+P+
Sbjct: 289 -NGVPYWLAANSWNLDWGDNGFFKILRGENHCGIESEIVAGIPR 331
>gi|226474178|emb|CAX71575.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 259 bits (663), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 126/265 (47%), Positives = 169/265 (63%), Gaps = 9/265 (3%)
Query: 72 GVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAV 131
G D L + R P V D E+P FD+R WP C +I +IRDQ CGS WA+ AV
Sbjct: 66 GRREDPNLREKRRPT-VDHHDLNVEIPSHFDSRKKWPRCKSISQIRDQSQCGSSWAVSAV 124
Query: 132 EAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYAS 191
AMSDR+CI S GK+ V LS+ DL+SCCK CG+GC GGF G +W YWV GIV+GG+ +
Sbjct: 125 GAMSDRICIQSGGKQSVELSAVDLISCCKYCGSGCDGGFLGPSWDYWVLRGIVTGGSKEN 184
Query: 192 KQGCRPYEIP-CERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPA 250
GCRPY P C+ ++ G + +C D TP+C + CQ GY+ SYE D ++G +Y++ +
Sbjct: 185 HTGCRPYPFPKCDHFVKGKYRACGDKLYKTPQCNQTCQKGYNTSYEQDKHYGGFSYNVLS 244
Query: 251 NEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGT 310
E I ++I HGPVE + IY D + YK+GIY++ G + HA+R+IGWG E GT
Sbjct: 245 VESVIQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGQFISGHAVRLIGWGVE---NGT 301
Query: 311 SSVVKYWLVANSFNTNWGENGLFRI 335
+ YWL AN++N +WGE G FRI
Sbjct: 302 A----YWLAANTWNEDWGEKGYFRI 322
Score = 155 bits (393), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 78/182 (42%), Positives = 110/182 (60%), Gaps = 11/182 (6%)
Query: 316 YWLVANSFNTNWGENGLFRIGCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYD 374
YW++ EN GCRPY P C+ ++ G +C TP+C + CQ GY+
Sbjct: 170 YWVLRGIVTGGSKEN---HTGCRPYPFPKCDHFVKGKYRACGDKLYKTPQCNQTCQKGYN 226
Query: 375 VSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLG 434
SYE D ++G +Y++ + E I ++I HGPVE + IY D + YK+GIY++ G +
Sbjct: 227 TSYEQDKHYGGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGQFIS 286
Query: 435 EHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGL 494
HA+R+IGWG E GT+ YWL AN++N +WGE G FRIVRG+NEC IE++I AG
Sbjct: 287 GHAVRLIGWGVE---NGTA----YWLAANTWNEDWGEKGYFRIVRGRNECSIESEIAAGR 339
Query: 495 PK 496
K
Sbjct: 340 IK 341
>gi|226474164|emb|CAX71568.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
gi|226474166|emb|CAX71569.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 259 bits (663), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 126/265 (47%), Positives = 169/265 (63%), Gaps = 9/265 (3%)
Query: 72 GVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAV 131
G D L + R P V D E+P FD+R WP C +I +IRDQ CGS WA+ AV
Sbjct: 66 GRREDPNLREKRRPT-VDHHDLNVEIPSHFDSRKKWPRCKSISQIRDQSQCGSSWAVSAV 124
Query: 132 EAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYAS 191
AMSDR+CI S GK+ V LS+ DL+SCCK CG+GC GGF G +W YWV GIV+GG+ +
Sbjct: 125 GAMSDRICIQSGGKQSVELSAVDLISCCKYCGSGCDGGFLGPSWDYWVLRGIVTGGSKEN 184
Query: 192 KQGCRPYEIP-CERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPA 250
GCRPY P C+ ++ G + +C D TP+C + CQ GY+ SYE D ++G +Y++ +
Sbjct: 185 HTGCRPYPFPKCDHFVKGKYRACGDKLYKTPQCKQICQKGYNTSYEQDKHYGGFSYNVLS 244
Query: 251 NEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGT 310
E I ++I HGPVE + IY D + YK+GIY++ G + HA+R+IGWG E GT
Sbjct: 245 VESVIQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGQFISGHAVRLIGWGVE---NGT 301
Query: 311 SSVVKYWLVANSFNTNWGENGLFRI 335
+ YWL AN++N +WGE G FRI
Sbjct: 302 A----YWLAANTWNEDWGEKGYFRI 322
Score = 158 bits (400), Expect = 6e-36, Method: Compositional matrix adjust.
Identities = 79/182 (43%), Positives = 111/182 (60%), Gaps = 11/182 (6%)
Query: 316 YWLVANSFNTNWGENGLFRIGCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYD 374
YW++ EN GCRPY P C+ ++ G +C TP+C + CQ GY+
Sbjct: 170 YWVLRGIVTGGSKEN---HTGCRPYPFPKCDHFVKGKYRACGDKLYKTPQCKQICQKGYN 226
Query: 375 VSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLG 434
SYE D ++G +Y++ + E I ++I HGPVE + IY D + YK+GIY++ G +
Sbjct: 227 TSYEQDKHYGGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGQFIS 286
Query: 435 EHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGL 494
HA+R+IGWG E GT+ YWL AN++N +WGE G FRIVRG+NEC IE++I AGL
Sbjct: 287 GHAVRLIGWGVE---NGTA----YWLAANTWNEDWGEKGYFRIVRGRNECSIESEIAAGL 339
Query: 495 PK 496
K
Sbjct: 340 IK 341
>gi|56756114|gb|AAW26235.1| unknown [Schistosoma japonicum]
Length = 342
Score = 259 bits (663), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 124/265 (46%), Positives = 170/265 (64%), Gaps = 9/265 (3%)
Query: 72 GVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAV 131
G DS L Q R P V D E+P FD+R WP C +I +IRDQ C S WA+ +V
Sbjct: 66 GRKEDSNLRQKRRPT-VDHHDLNVEIPSHFDSRKKWPRCKSISQIRDQSRCASSWAVSSV 124
Query: 132 EAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYAS 191
AMSDR+CI S GK+ V LS+ DL+SCCK+CG+GC GG+ +W YWV+ GIV+GG+ +
Sbjct: 125 GAMSDRICIQSGGKQSVELSAIDLISCCKNCGSGCDGGYFLPSWDYWVSHGIVTGGSKEN 184
Query: 192 KQGCRPYEIP-CERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPA 250
GCRPY P C+ ++ G + +C D TP+C + CQ GY+ SYE D ++G +Y++ +
Sbjct: 185 HTGCRPYPFPKCDHFVKGKYRACGDKLYETPQCKQTCQKGYNTSYEQDKHYGGFSYNVLS 244
Query: 251 NEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGT 310
E I ++I HGPVE + IY D + YK+GIY++ G + HA+R+IGWG E GT
Sbjct: 245 VESVIQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGKYISGHAVRLIGWGVE---NGT 301
Query: 311 SSVVKYWLVANSFNTNWGENGLFRI 335
+ YWL AN++N +WGE G FRI
Sbjct: 302 A----YWLAANTWNEDWGEKGYFRI 322
Score = 157 bits (398), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 75/162 (46%), Positives = 105/162 (64%), Gaps = 8/162 (4%)
Query: 336 GCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANE 394
GCRPY P C+ ++ G +C TP+C + CQ GY+ SYE D ++G +Y++ + E
Sbjct: 187 GCRPYPFPKCDHFVKGKYRACGDKLYETPQCKQTCQKGYNTSYEQDKHYGGFSYNVLSVE 246
Query: 395 ETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSS 454
I ++I HGPVE + IY D + YK+GIY++ G + HA+R+IGWG E GT+
Sbjct: 247 SVIQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGKYISGHAVRLIGWGVE---NGTA- 302
Query: 455 VVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPK 496
YWL AN++N +WGE G FRIVRG+NEC IE++I AGL K
Sbjct: 303 ---YWLAANTWNEDWGEKGYFRIVRGRNECLIESEIAAGLIK 341
>gi|309202|gb|AAA37494.1| mouse preprocathepsin B [Mus musculus]
Length = 339
Score = 259 bits (663), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 127/255 (49%), Positives = 174/255 (68%), Gaps = 11/255 (4%)
Query: 83 RLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIAS 142
+LP V + ++ LPE FDAR W CPTI +IRDQGSCGS WA GAVEA+SDR CI +
Sbjct: 67 KLPGRVAFGEDID-LPETFDAREQWSNCPTIGQIRDQGSCGSCWAFGAVEAISDRTCIHT 125
Query: 143 RGKRHVRLSSDDLVSCCK-DCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIP 201
G+ +V +S++DL++CC CG+GC GG+ AW +W G+VSGG Y S GC PY IP
Sbjct: 126 NGRVNVEVSAEDLLTCCGIQCGDGCNGGYPSGAWNFWTKKGLVSGGVYDSHIGCLPYTIP 185
Query: 202 -CERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIF 260
CE ++NGS C E +TP C + C+ GY SY++D +FG +YS+ + + IM EI+
Sbjct: 186 PCEHHVNGSRPPCT-GEGDTPRCNKSCEAGYSPSYKEDKHFGYTSYSVSNSVKEIMAEIY 244
Query: 261 RHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVA 320
++GPVEG+ T+++D + YK+G+YKH AG +G HAIRI+ WG E + V YWL A
Sbjct: 245 KNGPVEGAFTVFSDFLTYKSGVYKHEAGDMMGGHAIRILVWGVE-------NGVPYWLAA 297
Query: 321 NSFNTNWGENGLFRI 335
NS+N +WG+NG F+I
Sbjct: 298 NSWNLDWGDNGFFKI 312
Score = 172 bits (436), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 81/164 (49%), Positives = 116/164 (70%), Gaps = 9/164 (5%)
Query: 334 RIGCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPA 392
IGC PY IP CE ++NGSR C E +TP C + C+ GY SY++D +FG +YS+
Sbjct: 176 HIGCLPYTIPPCEHHVNGSRPPCTG-EGDTPRCNKSCEAGYSPSYKEDKHFGYTSYSVSN 234
Query: 393 NEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGT 452
+ + IM EI+++GPVEG+ T+++D + YK+G+YKH AG +G HAIRI+ WG E
Sbjct: 235 SVKEIMAEIYKNGPVEGAFTVFSDFLTYKSGVYKHEAGDMMGGHAIRILVWGVE------ 288
Query: 453 SSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPK 496
+ V YWL ANS+N +WG+NG F+I+RG+N CGIE++I AG+P+
Sbjct: 289 -NGVPYWLAANSWNLDWGDNGFFKILRGENHCGIESEIVAGIPR 331
>gi|56758864|gb|AAW27572.1| unknown [Schistosoma japonicum]
Length = 342
Score = 259 bits (663), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 126/265 (47%), Positives = 168/265 (63%), Gaps = 9/265 (3%)
Query: 72 GVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAV 131
G D L + R P V D E+P FD+R WP C +I +IRDQ CGS WA+ AV
Sbjct: 66 GRREDPNLREKRRPT-VDHHDLNVEIPSHFDSRKKWPRCKSISQIRDQSQCGSSWAVSAV 124
Query: 132 EAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYAS 191
AMSDR+CI S GK+ V LS+ DL+SCCK CG+GC GGF G +W YWV GIV+GG+ +
Sbjct: 125 GAMSDRICIQSGGKQSVELSAVDLISCCKYCGSGCDGGFLGPSWDYWVLRGIVTGGSKEN 184
Query: 192 KQGCRPYEIP-CERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPA 250
GCRPY P C+ ++ G + +C D TP+C + CQ GY+ SYE D ++G +Y++
Sbjct: 185 HTGCRPYPFPKCDHFVKGKYRACGDKLYKTPQCNQTCQKGYNTSYEQDKHYGGFSYNVLG 244
Query: 251 NEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGT 310
E I ++I HGPVE + IY D + YK+GIY++ G + HA+R+IGWG E GT
Sbjct: 245 IESVIQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGKYISGHAVRLIGWGVE---NGT 301
Query: 311 SSVVKYWLVANSFNTNWGENGLFRI 335
+ YWL AN++N +WGE G FRI
Sbjct: 302 A----YWLAANTWNEDWGEKGYFRI 322
Score = 157 bits (396), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 79/182 (43%), Positives = 110/182 (60%), Gaps = 11/182 (6%)
Query: 316 YWLVANSFNTNWGENGLFRIGCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYD 374
YW++ EN GCRPY P C+ ++ G +C TP+C + CQ GY+
Sbjct: 170 YWVLRGIVTGGSKEN---HTGCRPYPFPKCDHFVKGKYRACGDKLYKTPQCNQTCQKGYN 226
Query: 375 VSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLG 434
SYE D ++G +Y++ E I ++I HGPVE + IY D + YK+GIY++ G +
Sbjct: 227 TSYEQDKHYGGFSYNVLGIESVIQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGKYIS 286
Query: 435 EHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGL 494
HA+R+IGWG E GT+ YWL AN++N +WGE G FRIVRG+NEC IE++I AGL
Sbjct: 287 GHAVRLIGWGVE---NGTA----YWLAANTWNEDWGEKGYFRIVRGRNECLIESEIAAGL 339
Query: 495 PK 496
K
Sbjct: 340 IK 341
>gi|56752787|gb|AAW24605.1| unknown [Schistosoma japonicum]
Length = 309
Score = 259 bits (662), Expect = 3e-66, Method: Compositional matrix adjust.
Identities = 126/265 (47%), Positives = 169/265 (63%), Gaps = 9/265 (3%)
Query: 72 GVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAV 131
G D L + R P V D E+P FD+R WP C +I +IRDQ CGS WA+ AV
Sbjct: 33 GRREDPNLREKRRPT-VDHHDLNVEIPSHFDSRKKWPRCKSISQIRDQSQCGSSWAVSAV 91
Query: 132 EAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYAS 191
AMSDR+CI S GK+ V LS+ DL+SCCK CG+GC GGF G +W YWV GIV+GG+ +
Sbjct: 92 GAMSDRICIQSGGKQSVELSAVDLISCCKYCGSGCDGGFLGPSWDYWVLRGIVTGGSKEN 151
Query: 192 KQGCRPYEIP-CERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPA 250
GCRPY P C+ ++ G + +C D TP+C + CQ GY+ SYE D ++G +Y++ +
Sbjct: 152 HTGCRPYPFPKCDHFVKGKYRACGDKLYKTPQCNQTCQKGYNTSYEQDKHYGGFSYNVLS 211
Query: 251 NEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGT 310
E I ++I HGPVE + IY D + YK+GIY++ G + HA+R+IGWG E GT
Sbjct: 212 VESVIQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGQFISGHAVRLIGWGVE---NGT 268
Query: 311 SSVVKYWLVANSFNTNWGENGLFRI 335
+ YWL AN++N +WGE G FRI
Sbjct: 269 A----YWLAANTWNEDWGEKGYFRI 289
Score = 157 bits (397), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 79/182 (43%), Positives = 111/182 (60%), Gaps = 11/182 (6%)
Query: 316 YWLVANSFNTNWGENGLFRIGCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYD 374
YW++ EN GCRPY P C+ ++ G +C TP+C + CQ GY+
Sbjct: 137 YWVLRGIVTGGSKEN---HTGCRPYPFPKCDHFVKGKYRACGDKLYKTPQCNQTCQKGYN 193
Query: 375 VSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLG 434
SYE D ++G +Y++ + E I ++I HGPVE + IY D + YK+GIY++ G +
Sbjct: 194 TSYEQDKHYGGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGQFIS 253
Query: 435 EHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGL 494
HA+R+IGWG E GT+ YWL AN++N +WGE G FRIVRG+NEC IE++I AGL
Sbjct: 254 GHAVRLIGWGVE---NGTA----YWLAANTWNEDWGEKGYFRIVRGRNECLIESEIAAGL 306
Query: 495 PK 496
K
Sbjct: 307 IK 308
>gi|56757646|gb|AAW26973.1| unknown [Schistosoma japonicum]
Length = 342
Score = 259 bits (661), Expect = 3e-66, Method: Compositional matrix adjust.
Identities = 125/265 (47%), Positives = 168/265 (63%), Gaps = 9/265 (3%)
Query: 72 GVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAV 131
G D L + R P V D E+P FD+R WP C +I +IRDQ CGS WA+ AV
Sbjct: 66 GRREDPNLREKRRPT-VDHHDLNVEIPSHFDSRKKWPRCKSISQIRDQSQCGSSWAVSAV 124
Query: 132 EAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYAS 191
AMSDR+CI S GK+ V LS+ DL+SCCK CG+GC GGF G +W YWV GIV+GG+ +
Sbjct: 125 GAMSDRICIQSGGKQSVELSAVDLISCCKYCGSGCDGGFLGPSWDYWVLRGIVTGGSKEN 184
Query: 192 KQGCRPYEIP-CERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPA 250
GCRPY P C+ ++ G + +C D TP+C + CQ GY+ SYE D ++G +Y++ +
Sbjct: 185 HTGCRPYPFPKCDHFVKGKYRACGDKLYKTPQCNQTCQKGYNTSYEQDKHYGGFSYNVLS 244
Query: 251 NEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGT 310
E ++I HGPVE + IY D + YK+GIY++ G + HA+R+IGWG E GT
Sbjct: 245 GESVFQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGKYISGHAVRLIGWGVE---NGT 301
Query: 311 SSVVKYWLVANSFNTNWGENGLFRI 335
+ YWL AN++N +WGE G FRI
Sbjct: 302 A----YWLAANTWNEDWGEKGYFRI 322
Score = 157 bits (398), Expect = 9e-36, Method: Compositional matrix adjust.
Identities = 78/182 (42%), Positives = 110/182 (60%), Gaps = 11/182 (6%)
Query: 316 YWLVANSFNTNWGENGLFRIGCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYD 374
YW++ EN GCRPY P C+ ++ G +C TP+C + CQ GY+
Sbjct: 170 YWVLRGIVTGGSKEN---HTGCRPYPFPKCDHFVKGKYRACGDKLYKTPQCNQTCQKGYN 226
Query: 375 VSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLG 434
SYE D ++G +Y++ + E ++I HGPVE + IY D + YK+GIY++ G +
Sbjct: 227 TSYEQDKHYGGFSYNVLSGESVFQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGKYIS 286
Query: 435 EHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGL 494
HA+R+IGWG E GT+ YWL AN++N +WGE G FRIVRG+NEC IE++I AGL
Sbjct: 287 GHAVRLIGWGVE---NGTA----YWLAANTWNEDWGEKGYFRIVRGRNECSIESEIAAGL 339
Query: 495 PK 496
K
Sbjct: 340 IK 341
>gi|395842321|ref|XP_003793966.1| PREDICTED: cathepsin B [Otolemur garnettii]
Length = 339
Score = 259 bits (661), Expect = 3e-66, Method: Compositional matrix adjust.
Identities = 128/255 (50%), Positives = 179/255 (70%), Gaps = 11/255 (4%)
Query: 83 RLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIAS 142
+LP V+ + + LP+ FDAR W +CPTI+EIRDQGSCGS WA GAVE++SDR+CI +
Sbjct: 67 KLPQRVKFAKDMN-LPKSFDAREQWSHCPTIKEIRDQGSCGSCWAFGAVESISDRICIHT 125
Query: 143 RGKRHVRLSSDDLVSCCKDCGNGCQGG-FHGKAWKYWVTTGIVSGGTYASKQGCRPYEIP 201
G V +S++DL++CC G + +AW +W G+VSGG Y S GCRPY IP
Sbjct: 126 NGHVSVEVSAEDLLTCCGGQCGDGCNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIP 185
Query: 202 -CERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIF 260
CE ++NGS +C E +TP+C + C+PGY +Y++D +FG +YSLP NE IM EI+
Sbjct: 186 PCEHHVNGSRPACT-GEGDTPKCSKTCEPGYSPTYKEDKHFGYTSYSLPTNEWEIMAEIY 244
Query: 261 RHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVA 320
++GPVEG+ ++Y+D +LYK+G+Y+H+ G +G HAIRI+GWG+E + V YWLVA
Sbjct: 245 KNGPVEGAFSVYSDFLLYKSGVYQHLTGDMMGGHAIRILGWGEE-------NGVPYWLVA 297
Query: 321 NSFNTNWGENGLFRI 335
NS+NT+WG+ G FRI
Sbjct: 298 NSWNTDWGDGGFFRI 312
Score = 194 bits (493), Expect = 8e-47, Method: Compositional matrix adjust.
Identities = 86/164 (52%), Positives = 125/164 (76%), Gaps = 9/164 (5%)
Query: 334 RIGCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPA 392
+GCRPY IP CE ++NGSR +C E +TP+C + C+PGY +Y++D +FG +YSLP
Sbjct: 176 HVGCRPYSIPPCEHHVNGSRPACTG-EGDTPKCSKTCEPGYSPTYKEDKHFGYTSYSLPT 234
Query: 393 NEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGT 452
NE IM EI+++GPVEG+ ++Y+D +LYK+G+Y+H+ G +G HAIRI+GWG+E
Sbjct: 235 NEWEIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHLTGDMMGGHAIRILGWGEE------ 288
Query: 453 SSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPK 496
+ V YWLVANS+NT+WG+ G FRI+RGQ+ CGIE+++ AG+P+
Sbjct: 289 -NGVPYWLVANSWNTDWGDGGFFRILRGQDHCGIESEVVAGIPR 331
>gi|56759488|gb|AAW27884.1| unknown [Schistosoma japonicum]
Length = 342
Score = 259 bits (661), Expect = 3e-66, Method: Compositional matrix adjust.
Identities = 125/265 (47%), Positives = 168/265 (63%), Gaps = 9/265 (3%)
Query: 72 GVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAV 131
G D L + R P + D E+P FD+R WP C +I +IRDQ CGS WA+ AV
Sbjct: 66 GRREDPNLREKRRPT-IDHHDLNVEIPSHFDSRKKWPRCKSISQIRDQSQCGSSWAVSAV 124
Query: 132 EAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYAS 191
AMSDR+CI S GK+ V LS+ DL+SCCK CG+GC GGF G +W YWV GIV+GG+ +
Sbjct: 125 GAMSDRICIQSGGKQSVELSAVDLISCCKYCGSGCDGGFLGPSWDYWVLRGIVTGGSKEN 184
Query: 192 KQGCRPYEIP-CERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPA 250
GCRPY P C+ ++ G + +C D TP+C + CQ GY+ SYE D ++G +Y++
Sbjct: 185 HTGCRPYPFPKCDHFVKGKYRACGDKLYKTPQCKQTCQKGYNTSYEQDKHYGGFSYNVLG 244
Query: 251 NEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGT 310
E I ++I HGPVE + IY D + YK+GIY++ G + HA+R+IGWG E GT
Sbjct: 245 IESVIQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGKYISGHAVRLIGWGVE---NGT 301
Query: 311 SSVVKYWLVANSFNTNWGENGLFRI 335
+ YWL AN++N +WGE G FRI
Sbjct: 302 A----YWLAANTWNEDWGEKGYFRI 322
Score = 157 bits (396), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 79/182 (43%), Positives = 110/182 (60%), Gaps = 11/182 (6%)
Query: 316 YWLVANSFNTNWGENGLFRIGCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYD 374
YW++ EN GCRPY P C+ ++ G +C TP+C + CQ GY+
Sbjct: 170 YWVLRGIVTGGSKEN---HTGCRPYPFPKCDHFVKGKYRACGDKLYKTPQCKQTCQKGYN 226
Query: 375 VSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLG 434
SYE D ++G +Y++ E I ++I HGPVE + IY D + YK+GIY++ G +
Sbjct: 227 TSYEQDKHYGGFSYNVLGIESVIQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGKYIS 286
Query: 435 EHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGL 494
HA+R+IGWG E GT+ YWL AN++N +WGE G FRIVRG+NEC IE++I AGL
Sbjct: 287 GHAVRLIGWGVE---NGTA----YWLAANTWNEDWGEKGYFRIVRGRNECLIESEIAAGL 339
Query: 495 PK 496
K
Sbjct: 340 IK 341
>gi|384597848|gb|AFI23675.1| cathepsin B, partial [Brugia malayi]
Length = 319
Score = 259 bits (661), Expect = 3e-66, Method: Compositional matrix adjust.
Identities = 125/241 (51%), Positives = 163/241 (67%), Gaps = 9/241 (3%)
Query: 97 LPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLV 156
+PE FDAR NWP C +++ IRDQ SCGS WA+ AVEAMSDR+CI S+GK+ V LS+DDL+
Sbjct: 77 IPESFDARKNWPECASLRNIRDQSSCGSCWAVAAVEAMSDRICITSKGKKQVILSADDLL 136
Query: 157 SCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEI-PCERYMNGSH-SSCQ 214
SCCK CG GC GG AWKYWV +GIV+G Y + GCRPY PCE + N +H C+
Sbjct: 137 SCCKTCGFGCFGGEPMAAWKYWVLSGIVTGSDYTNHSGCRPYPFPPCEHHSNKTHYEPCK 196
Query: 215 DNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYAD 274
+ TP+C ++C Y SY+ D +G AY++ + E+I +EI GPVE S +Y D
Sbjct: 197 HDLYPTPKCYKQCDKNYTKSYKADKYYGEQAYNVENDVESIQKEIMTLGPVEASFEVYTD 256
Query: 275 MILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFR 334
+ Y +GIYKHVAG G HA++I+GWG + +G V YWL ANS+N +WGE+G FR
Sbjct: 257 FLHYTSGIYKHVAGSVGGGHAVKILGWG---IDQG----VSYWLAANSWNNDWGEDGYFR 309
Query: 335 I 335
I
Sbjct: 310 I 310
Score = 141 bits (356), Expect = 7e-31, Method: Compositional matrix adjust.
Identities = 76/188 (40%), Positives = 111/188 (59%), Gaps = 17/188 (9%)
Query: 301 WGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIGCRPYEIP-CERYMNGSR-SSCQAN 358
+G EP+ + KYW+++ G + GCRPY P CE + N + C+ +
Sbjct: 147 FGGEPM-----AAWKYWVLSGIVT---GSDYTNHSGCRPYPFPPCEHHSNKTHYEPCKHD 198
Query: 359 EPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMI 418
TP+C ++C Y SY+ D +G AY++ + E+I +EI GPVE S +Y D +
Sbjct: 199 LYPTPKCYKQCDKNYTKSYKADKYYGEQAYNVENDVESIQKEIMTLGPVEASFEVYTDFL 258
Query: 419 LYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIV 478
Y +GIYKHVAG G HA++I+GWG + +G V YWL ANS+N +WGE+G FRI+
Sbjct: 259 HYTSGIYKHVAGSVGGGHAVKILGWG---IDQG----VSYWLAANSWNNDWGEDGYFRIL 311
Query: 479 RGQNECGI 486
RG +ECG+
Sbjct: 312 RGADECGM 319
>gi|226474168|emb|CAX71570.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 258 bits (660), Expect = 4e-66, Method: Compositional matrix adjust.
Identities = 125/265 (47%), Positives = 168/265 (63%), Gaps = 9/265 (3%)
Query: 72 GVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAV 131
G D L + R P V D E+P FD+R WP C +I +IRDQ CGS WA+ AV
Sbjct: 66 GRKEDPNLREKRRPT-VDHHDLNVEIPSHFDSRKKWPRCKSISQIRDQSQCGSSWAVSAV 124
Query: 132 EAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYAS 191
AMSDR+CI S GK+ V LS+ DL+SCCK CG+GC GGF G +W YWV GIV+GG+ +
Sbjct: 125 GAMSDRICIQSGGKQSVELSAVDLISCCKYCGSGCDGGFLGPSWDYWVLRGIVTGGSKEN 184
Query: 192 KQGCRPYEIP-CERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPA 250
GCRPY P C+ ++ G + +C D TP+C + CQ GY+ SYE D ++G +Y++ +
Sbjct: 185 HTGCRPYPFPKCDHFVKGKYRACGDKLYKTPQCNQTCQKGYNTSYEQDKHYGGFSYNVLS 244
Query: 251 NEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGT 310
E I ++I HGP E + IY D + YK+GIY++ G + HA+R+IGWG E GT
Sbjct: 245 VESVIQKDIMMHGPAEAYLEIYEDFLNYKSGIYRYTTGQFISGHAVRLIGWGVE---NGT 301
Query: 311 SSVVKYWLVANSFNTNWGENGLFRI 335
+ YWL AN++N +WGE G FRI
Sbjct: 302 A----YWLAANTWNEDWGEKGYFRI 322
Score = 156 bits (394), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 78/182 (42%), Positives = 110/182 (60%), Gaps = 11/182 (6%)
Query: 316 YWLVANSFNTNWGENGLFRIGCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYD 374
YW++ EN GCRPY P C+ ++ G +C TP+C + CQ GY+
Sbjct: 170 YWVLRGIVTGGSKEN---HTGCRPYPFPKCDHFVKGKYRACGDKLYKTPQCNQTCQKGYN 226
Query: 375 VSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLG 434
SYE D ++G +Y++ + E I ++I HGP E + IY D + YK+GIY++ G +
Sbjct: 227 TSYEQDKHYGGFSYNVLSVESVIQKDIMMHGPAEAYLEIYEDFLNYKSGIYRYTTGQFIS 286
Query: 435 EHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGL 494
HA+R+IGWG E GT+ YWL AN++N +WGE G FRIVRG+NEC IE++I AGL
Sbjct: 287 GHAVRLIGWGVE---NGTA----YWLAANTWNEDWGEKGYFRIVRGRNECLIESEIAAGL 339
Query: 495 PK 496
K
Sbjct: 340 IK 341
>gi|226474182|emb|CAX71577.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 258 bits (659), Expect = 5e-66, Method: Compositional matrix adjust.
Identities = 125/265 (47%), Positives = 168/265 (63%), Gaps = 9/265 (3%)
Query: 72 GVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAV 131
G D L + R P V D E+P FD+R WP C +I +IRDQ CGS WA+ AV
Sbjct: 66 GRREDPNLREKRRPT-VDHHDLNVEIPSHFDSRKKWPRCKSISQIRDQSQCGSSWAVSAV 124
Query: 132 EAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYAS 191
AMSDR+CI S GK+ V LS+ DL+SCCK CG+GC GGF G +W YWV GIV+GG+ +
Sbjct: 125 GAMSDRICIQSGGKQSVELSAVDLISCCKYCGSGCDGGFLGPSWDYWVLRGIVTGGSKEN 184
Query: 192 KQGCRPYEIP-CERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPA 250
CRPY P C+ ++ G + +C D TP+C + CQ GY+ SYE D ++G +Y++ +
Sbjct: 185 HTSCRPYPFPKCDHFVKGKYRACGDKLYETPQCKQTCQKGYNTSYEQDKHYGGFSYNVLS 244
Query: 251 NEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGT 310
E I ++I HGPVE + IY D + YK+GIY++ G + HA+R+IGWG E GT
Sbjct: 245 VESVIQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGQFISGHAVRLIGWGVE---NGT 301
Query: 311 SSVVKYWLVANSFNTNWGENGLFRI 335
+ YWL AN++N +WGE G FRI
Sbjct: 302 A----YWLAANTWNEDWGEKGYFRI 322
Score = 157 bits (396), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 78/182 (42%), Positives = 110/182 (60%), Gaps = 11/182 (6%)
Query: 316 YWLVANSFNTNWGENGLFRIGCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYD 374
YW++ EN CRPY P C+ ++ G +C TP+C + CQ GY+
Sbjct: 170 YWVLRGIVTGGSKEN---HTSCRPYPFPKCDHFVKGKYRACGDKLYETPQCKQTCQKGYN 226
Query: 375 VSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLG 434
SYE D ++G +Y++ + E I ++I HGPVE + IY D + YK+GIY++ G +
Sbjct: 227 TSYEQDKHYGGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGQFIS 286
Query: 435 EHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGL 494
HA+R+IGWG E GT+ YWL AN++N +WGE G FRIVRG+NEC IE++I AGL
Sbjct: 287 GHAVRLIGWGVE---NGTA----YWLAANTWNEDWGEKGYFRIVRGRNECSIESEIAAGL 339
Query: 495 PK 496
K
Sbjct: 340 IK 341
>gi|44965462|gb|AAS49538.1| cathepsin B [Protopterus dolloi]
Length = 225
Score = 258 bits (659), Expect = 5e-66, Method: Compositional matrix adjust.
Identities = 122/223 (54%), Positives = 162/223 (72%), Gaps = 3/223 (1%)
Query: 84 LPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASR 143
LPL S + LP+ FD+R WP CPTI+EIRDQGSCGS WA GAVE+MSDRVC+ S
Sbjct: 1 LPLKTSFSGNWK-LPDNFDSRTQWPNCPTIREIRDQGSCGSCWAFGAVESMSDRVCVHSG 59
Query: 144 GKRHVRLSSDDLVSCCK-DCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIP- 201
GK++V +S++DL+SCC +CG GC GG+ AW+YW G+VSGG Y S GCRPY IP
Sbjct: 60 GKQNVEVSAEDLLSCCGFECGMGCNGGYPSGAWQYWTEKGLVSGGLYGSGIGCRPYTIPP 119
Query: 202 CERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFR 261
CE ++NGS SC +TP+C++KC GY +YE D +G+ AYS+P++ E+IM EI++
Sbjct: 120 CEHHVNGSRPSCSGEGGDTPKCVQKCDSGYTPAYEKDKIYGQSAYSVPSSPESIMEEIYK 179
Query: 262 HGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQE 304
GPVEG+ T+Y D +LYK+G+Y+H G +G HAI+I+GWG E
Sbjct: 180 DGPVEGAFTVYEDFLLYKSGVYQHHTGEAVGGHAIKILGWGIE 222
Score = 136 bits (343), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 64/128 (50%), Positives = 88/128 (68%), Gaps = 8/128 (6%)
Query: 327 WGENGLFR-------IGCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYE 378
W E GL IGCRPY IP CE ++NGSR SC +TP+C++KC GY +YE
Sbjct: 95 WTEKGLVSGGLYGSGIGCRPYTIPPCEHHVNGSRPSCSGEGGDTPKCVQKCDSGYTPAYE 154
Query: 379 DDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAI 438
D +G+ AYS+P++ E+IM EI++ GPVEG+ T+Y D +LYK+G+Y+H G +G HAI
Sbjct: 155 KDKIYGQSAYSVPSSPESIMEEIYKDGPVEGAFTVYEDFLLYKSGVYQHHTGEAVGGHAI 214
Query: 439 RIIGWGQE 446
+I+GWG E
Sbjct: 215 KILGWGIE 222
>gi|340380665|ref|XP_003388842.1| PREDICTED: cathepsin B-like [Amphimedon queenslandica]
Length = 333
Score = 258 bits (659), Expect = 5e-66, Method: Compositional matrix adjust.
Identities = 126/256 (49%), Positives = 175/256 (68%), Gaps = 13/256 (5%)
Query: 81 QNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCI 140
++ LP ++S + +P+ FDAR W CP+I +IRDQGSCGS WALGAVEAMSDR C+
Sbjct: 68 KHTLPYKEKVS--VGAIPDTFDARQKWSDCPSISDIRDQGSCGSCWALGAVEAMSDRYCV 125
Query: 141 ASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEI 200
+ + +V +S+++L++CCK CGNGC GGF +AW+YWV G+V+GG Y S +GC+PY I
Sbjct: 126 SFQ--ENVHISAENLMTCCKFCGNGCAGGFLQQAWEYWVKDGLVTGGQYGSDEGCQPYLI 183
Query: 201 P-CERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREI 259
P C + G + +C E TP+C R C+ GY SYE DL++G AY++ E I EI
Sbjct: 184 PKCNHHEPGPYENCT-GEGKTPQCERTCRSGYTTSYEADLHYGEKAYAVHREVEAIQTEI 242
Query: 260 FRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLV 319
+GPVEG+ T+Y+D YK+G+Y+HV G LG HAIRI+GWG E + V YWL+
Sbjct: 243 MTNGPVEGAFTVYSDFPTYKSGVYQHVVGHALGGHAIRILGWGTE-------NGVPYWLI 295
Query: 320 ANSFNTNWGENGLFRI 335
ANS+N +WG+ G F++
Sbjct: 296 ANSWNPSWGDKGYFKM 311
Score = 162 bits (409), Expect = 5e-37, Method: Compositional matrix adjust.
Identities = 78/162 (48%), Positives = 108/162 (66%), Gaps = 9/162 (5%)
Query: 336 GCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANE 394
GC+PY IP C + G +C E TP+C R C+ GY SYE DL++G AY++
Sbjct: 177 GCQPYLIPKCNHHEPGPYENC-TGEGKTPQCERTCRSGYTTSYEADLHYGEKAYAVHREV 235
Query: 395 ETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSS 454
E I EI +GPVEG+ T+Y+D YK+G+Y+HV G LG HAIRI+GWG E +
Sbjct: 236 EAIQTEIMTNGPVEGAFTVYSDFPTYKSGVYQHVVGHALGGHAIRILGWGTE-------N 288
Query: 455 VVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPK 496
V YWL+ANS+N +WG+ G F+++RG+++CGIE++I AG PK
Sbjct: 289 GVPYWLIANSWNPSWGDKGYFKMIRGKDDCGIESNIVAGTPK 330
>gi|56758716|gb|AAW27498.1| unknown [Schistosoma japonicum]
Length = 342
Score = 258 bits (659), Expect = 5e-66, Method: Compositional matrix adjust.
Identities = 125/265 (47%), Positives = 167/265 (63%), Gaps = 9/265 (3%)
Query: 72 GVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAV 131
G D L Q R P V D E+P FD+R WP C +I +IRDQ C S WA+ AV
Sbjct: 66 GRKEDPNLRQKRRPT-VDHHDLNVEIPSHFDSRKKWPRCKSISQIRDQSRCASSWAVSAV 124
Query: 132 EAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYAS 191
AMSDR+CI S GK+ V LS+ DL+SCC++CG+GC GG G +W YWV GIV+GG+ +
Sbjct: 125 AAMSDRICIQSGGKQSVELSAIDLISCCENCGSGCDGGVTGYSWDYWVKHGIVTGGSKEN 184
Query: 192 KQGCRPYEIP-CERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPA 250
GCRPY P C+ ++ G + +C D TP+C + CQ GY+ SYE D ++G +YS+
Sbjct: 185 HTGCRPYPFPKCDHFVKGKYRACGDKLYKTPQCKQTCQKGYNTSYEQDKHYGGFSYSVIG 244
Query: 251 NEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGT 310
E I +EI +GPVE + IY D + YK+GIY++ G + HA+R+IGWG E GT
Sbjct: 245 VESAIQKEIMMYGPVEAYLEIYEDFLNYKSGIYRYTTGKYISGHAVRLIGWGVE---NGT 301
Query: 311 SSVVKYWLVANSFNTNWGENGLFRI 335
+ YWL AN++N +WGE G FRI
Sbjct: 302 A----YWLAANTWNEDWGEKGYFRI 322
Score = 151 bits (382), Expect = 7e-34, Method: Compositional matrix adjust.
Identities = 74/162 (45%), Positives = 102/162 (62%), Gaps = 8/162 (4%)
Query: 336 GCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANE 394
GCRPY P C+ ++ G +C TP+C + CQ GY+ SYE D ++G +YS+ E
Sbjct: 187 GCRPYPFPKCDHFVKGKYRACGDKLYKTPQCKQTCQKGYNTSYEQDKHYGGFSYSVIGVE 246
Query: 395 ETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSS 454
I +EI +GPVE + IY D + YK+GIY++ G + HA+R+IGWG E GT+
Sbjct: 247 SAIQKEIMMYGPVEAYLEIYEDFLNYKSGIYRYTTGKYISGHAVRLIGWGVE---NGTA- 302
Query: 455 VVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPK 496
YWL AN++N +WGE G FRIVRG++EC IE+ I AG K
Sbjct: 303 ---YWLAANTWNEDWGEKGYFRIVRGRDECLIESFIVAGQIK 341
>gi|407080581|gb|AFS89610.1| procathepsin B precursor [Phenacoccus solenopsis]
Length = 309
Score = 258 bits (658), Expect = 6e-66, Method: Compositional matrix adjust.
Identities = 136/279 (48%), Positives = 177/279 (63%), Gaps = 15/279 (5%)
Query: 59 LSKLTLSELEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRD 118
L + ++ E++ V PD + P +S E LP+ FD+R+ WP CPTI+EIRD
Sbjct: 20 LGMMGINYSELKPNVTPDLEPP-------FVVSKISENLPDEFDSRVRWPNCPTIREIRD 72
Query: 119 QGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYW 178
QGSCG+ WA A EAMSDRVCI S +H S+ +L+SCC C GC G H AW +W
Sbjct: 73 QGSCGACWAFAAAEAMSDRVCIHSSQTKHFHFSALNLLSCCDSCEKGCLGCDHHLAWDHW 132
Query: 179 VTTGIVSGGTYASKQGCRPYEI-PCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYED 237
V GIVSGG+Y SK+GC+PY + PCE + G +C P TP C R CQP Y +SYED
Sbjct: 133 VKHGIVSGGSYGSKEGCQPYHLPPCEHHRAGPRRNCTKYGP-TPSCARVCQPDYKISYED 191
Query: 238 DLNFGRIAYSL-PANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAI 296
DL+FG+ Y+L P NE+ I EIF +GPVE +M Y D Y++GIY H+ G + +HA+
Sbjct: 192 DLHFGKQWYALAPHNEKIIRTEIFHNGPVEATMAAYEDFYTYESGIYHHIEGTFVCDHAV 251
Query: 297 RIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
+IIGWG + + YWLVANSFNT+WGE G F+I
Sbjct: 252 KIIGWGTD-----KKTNTPYWLVANSFNTDWGEYGFFKI 285
Score = 172 bits (437), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 85/162 (52%), Positives = 108/162 (66%), Gaps = 8/162 (4%)
Query: 336 GCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSL-PAN 393
GC+PY +P CE + G R +C P TP C R CQP Y +SYEDDL+FG+ Y+L P N
Sbjct: 148 GCQPYHLPPCEHHRAGPRRNCTKYGP-TPSCARVCQPDYKISYEDDLHFGKQWYALAPHN 206
Query: 394 EETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTS 453
E+ I EIF +GPVE +M Y D Y++GIY H+ G + +HA++IIGWG +
Sbjct: 207 EKIIRTEIFHNGPVEATMAAYEDFYTYESGIYHHIEGTFVCDHAVKIIGWGTD-----KK 261
Query: 454 SVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLP 495
+ YWLVANSFNT+WGE G F+I RG NECGIE ITAG+P
Sbjct: 262 TNTPYWLVANSFNTDWGEYGFFKIKRGVNECGIENKITAGIP 303
>gi|56757271|gb|AAW26807.1| unknown [Schistosoma japonicum]
Length = 342
Score = 258 bits (658), Expect = 7e-66, Method: Compositional matrix adjust.
Identities = 125/265 (47%), Positives = 167/265 (63%), Gaps = 9/265 (3%)
Query: 72 GVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAV 131
G D L + R P V D E+P FD+R WP C +I +IRDQ C S WA+ AV
Sbjct: 66 GRREDPNLREKRRPT-VDHHDLNVEIPSHFDSRKKWPRCKSISQIRDQSRCASSWAVSAV 124
Query: 132 EAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYAS 191
AMSDR+CI S GK+ V LS+ DL+SCCK+CG+GC GG G +W YWV GIV+GG+ +
Sbjct: 125 GAMSDRICIQSGGKQSVELSAIDLISCCKNCGSGCDGGVTGYSWDYWVKHGIVTGGSKEN 184
Query: 192 KQGCRPYEIP-CERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPA 250
GCRPY P C+ ++ G + +C D TP+C + CQ GY+ SYE D ++G +Y++
Sbjct: 185 HTGCRPYPFPKCDHFVKGKYRACGDKLYKTPQCKQTCQKGYNTSYEQDKHYGEFSYNVIG 244
Query: 251 NEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGT 310
E I +EI +GPVE + IY D + YK+GIY++ G + HA+R+IGWG E GT
Sbjct: 245 VESVIQKEIMMYGPVEAYLHIYEDFLNYKSGIYRYTTGQFISGHAVRLIGWGVE---NGT 301
Query: 311 SSVVKYWLVANSFNTNWGENGLFRI 335
S YWL AN++N +WGE G FRI
Sbjct: 302 S----YWLAANTWNEDWGEKGYFRI 322
Score = 151 bits (382), Expect = 6e-34, Method: Compositional matrix adjust.
Identities = 74/162 (45%), Positives = 102/162 (62%), Gaps = 8/162 (4%)
Query: 336 GCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANE 394
GCRPY P C+ ++ G +C TP+C + CQ GY+ SYE D ++G +Y++ E
Sbjct: 187 GCRPYPFPKCDHFVKGKYRACGDKLYKTPQCKQTCQKGYNTSYEQDKHYGEFSYNVIGVE 246
Query: 395 ETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSS 454
I +EI +GPVE + IY D + YK+GIY++ G + HA+R+IGWG E GTS
Sbjct: 247 SVIQKEIMMYGPVEAYLHIYEDFLNYKSGIYRYTTGQFISGHAVRLIGWGVE---NGTS- 302
Query: 455 VVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPK 496
YWL AN++N +WGE G FRIVRG++EC IE+ I AG K
Sbjct: 303 ---YWLAANTWNEDWGEKGYFRIVRGRDECLIESFIVAGQIK 341
>gi|256090368|ref|XP_002581167.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
gi|22531387|emb|CAD44624.1| cathepsin B1 isotype 1 [Schistosoma mansoni]
gi|353228442|emb|CCD74613.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
Length = 340
Score = 258 bits (658), Expect = 8e-66, Method: Compositional matrix adjust.
Identities = 129/275 (46%), Positives = 169/275 (61%), Gaps = 10/275 (3%)
Query: 63 TLSELEMRMGVHPDS-KLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGS 121
+L + ++MG + L + R P V +D E+P FD+R WP C +I IRDQ
Sbjct: 55 SLDDARIQMGARREEPDLRRTRRPT-VDHNDWNVEIPSSFDSRKKWPRCKSIATIRDQSR 113
Query: 122 CGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTT 181
CGS WA GAVEAMSDR CI S GK++V LS+ DL+SCC+ CG GC+GG G AW YWV
Sbjct: 114 CGSCWAFGAVEAMSDRSCIQSGGKQNVELSAVDLLSCCESCGLGCEGGILGPAWDYWVKE 173
Query: 182 GIVSGGTYASKQGCRPYEIP-CERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLN 240
GIV+G + + GC PY P CE + G + C TP C + CQ Y Y D +
Sbjct: 174 GIVTGSSKENHTGCEPYPFPKCEHHTKGKYPPCGSKIYKTPRCKQTCQKKYKTPYTQDKH 233
Query: 241 FGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIG 300
G+ +Y++ +E+ I +EI ++GPVE T+Y D + YK+GIYKH+ G LG HAIRIIG
Sbjct: 234 RGKSSYNVKNDEKAIQKEIMKYGPVEAGFTVYEDFLNYKSGIYKHITGETLGGHAIRIIG 293
Query: 301 WGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
WG E + YWL+ANS+N +WGENG FRI
Sbjct: 294 WGVE-------NKTPYWLIANSWNEDWGENGYFRI 321
Score = 159 bits (402), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 79/179 (44%), Positives = 107/179 (59%), Gaps = 11/179 (6%)
Query: 316 YWLVANSFNTNWGENGLFRIGCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYD 374
YW+ + EN GC PY P CE + G C + TP C + CQ Y
Sbjct: 169 YWVKEGIVTGSSKEN---HTGCEPYPFPKCEHHTKGKYPPCGSKIYKTPRCKQTCQKKYK 225
Query: 375 VSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLG 434
Y D + G+ +Y++ +E+ I +EI ++GPVE T+Y D + YK+GIYKH+ G LG
Sbjct: 226 TPYTQDKHRGKSSYNVKNDEKAIQKEIMKYGPVEAGFTVYEDFLNYKSGIYKHITGETLG 285
Query: 435 EHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAG 493
HAIRIIGWG E + YWL+ANS+N +WGENG FRIVRG++EC IE+++TAG
Sbjct: 286 GHAIRIIGWGVE-------NKTPYWLIANSWNEDWGENGYFRIVRGRDECSIESEVTAG 337
>gi|241154720|ref|XP_002407359.1| cathepsin B endopeptidase, putative [Ixodes scapularis]
gi|215494103|gb|EEC03744.1| cathepsin B endopeptidase, putative [Ixodes scapularis]
Length = 337
Score = 256 bits (655), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 132/298 (44%), Positives = 183/298 (61%), Gaps = 12/298 (4%)
Query: 52 YGAEKNALSKLTLSELEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCP 111
+ A N +++S + +GVHP S+ + RL V P ++LPE FDAR W +C
Sbjct: 43 WKAGSNFDKCISMSYIRGLLGVHPKSE--EYRLAEFVHEEIP-DDLPESFDARAKWSHCD 99
Query: 112 TIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFH 171
+I IRDQ +CGS WA GA EAMSDR+CI S+GK V +S++DL+ CC CG+GC+GGF
Sbjct: 100 SIHLIRDQSTCGSCWAFGATEAMSDRICIHSKGKMQVNISAEDLLDCCDTCGHGCKGGFP 159
Query: 172 GKAWKYWVTTGIVSGGTYASKQGCRPYEI-PCERYMNGSHSSCQDNEPNTPECIRKCQPG 230
AW++W GIVSGG Y + GC+PY + PCE + +C +TPEC+ C+ G
Sbjct: 160 AAAWEHWKERGIVSGGLYGTPDGCKPYSLAPCEYHTKCRIPNCIPI-VHTPECVHHCRKG 218
Query: 231 YDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGP 290
YD Y++D +FG+ YS+ +E+ I EIF +GPVE +Y D + YK+G+Y+ +
Sbjct: 219 YDKDYQEDKHFGQKVYSISRDEKQIQTEIFTNGPVEADFHVYGDFLCYKSGVYQRHSNDG 278
Query: 291 LGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIGCRPYEIPCERYM 348
G HAIRI+GWG E GT YWL ANS+N NWG+ G F+I R E E ++
Sbjct: 279 RGMHAIRILGWGTE---NGTP----YWLAANSWNENWGDKGYFKILRRTNECGIEEHI 329
Score = 149 bits (375), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 75/163 (46%), Positives = 102/163 (62%), Gaps = 9/163 (5%)
Query: 336 GCRPYEI-PCERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANE 394
GC+PY + PCE + +C +TPEC+ C+ GYD Y++D +FG+ YS+ +E
Sbjct: 182 GCKPYSLAPCEYHTKCRIPNC-IPIVHTPECVHHCRKGYDKDYQEDKHFGQKVYSISRDE 240
Query: 395 ETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSS 454
+ I EIF +GPVE +Y D + YK+G+Y+ + G HAIRI+GWG E GT
Sbjct: 241 KQIQTEIFTNGPVEADFHVYGDFLCYKSGVYQRHSNDGRGMHAIRILGWGTE---NGTP- 296
Query: 455 VVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
YWL ANS+N NWG+ G F+I+R NECGIE I AG+PKI
Sbjct: 297 ---YWLAANSWNENWGDKGYFKILRRTNECGIEEHIYAGIPKI 336
>gi|22531389|emb|CAD44625.1| cathepsin B1 isotype 2 [Schistosoma mansoni]
Length = 340
Score = 256 bits (655), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 137/329 (41%), Positives = 187/329 (56%), Gaps = 17/329 (5%)
Query: 9 VATFLKDLDLSQSSRNHSNGVFCDLSKAFDRVDHSILLPKLPFYGAEKNALSKLTLSELE 68
+A+ + LD S +N D D + + P + + N +L +
Sbjct: 8 IASLITHLDAHISIKNEKFKPLSD-----DIISYINEHPNAGWRAEKSNRFH--SLDDAR 60
Query: 69 MRMGVHPDS-KLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWA 127
++MG + L + R P V ++ E+P FD+R WP C +I IRDQ CGS WA
Sbjct: 61 IQMGARREEPDLRRKRRPT-VDHNEWNVEIPSNFDSRKKWPGCKSIATIRDQSRCGSCWA 119
Query: 128 LGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGG 187
GAVEAMSDR CI S GK++V LS+ DL+SCC+ CG GC+GG G AW +WV GIV+G
Sbjct: 120 FGAVEAMSDRSCIQSGGKQNVELSAVDLLSCCESCGLGCEGGILGPAWDFWVKEGIVTGS 179
Query: 188 TYASKQGCRPYEIP-CERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAY 246
+ + GC PY P CE + G + C TP C + CQ Y Y D + G+ +Y
Sbjct: 180 SKENHTGCEPYPFPKCEHHTKGKYPPCGSKIYKTPRCKQTCQKKYKTPYTQDKHRGKSSY 239
Query: 247 SLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPL 306
++ +E+ I +EI ++GPVE S T+Y D + YK+GIYKH+ G LG HAIRIIGWG E
Sbjct: 240 NVKNDEKAIQKEIMKYGPVEASFTVYEDFLNYKSGIYKHITGEALGGHAIRIIGWGVE-- 297
Query: 307 GEGTSSVVKYWLVANSFNTNWGENGLFRI 335
+ YWL+ANS+N +WGENG FRI
Sbjct: 298 -----NKTPYWLIANSWNEDWGENGYFRI 321
Score = 155 bits (393), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 74/158 (46%), Positives = 100/158 (63%), Gaps = 8/158 (5%)
Query: 336 GCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANE 394
GC PY P CE + G C + TP C + CQ Y Y D + G+ +Y++ +E
Sbjct: 186 GCEPYPFPKCEHHTKGKYPPCGSKIYKTPRCKQTCQKKYKTPYTQDKHRGKSSYNVKNDE 245
Query: 395 ETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSS 454
+ I +EI ++GPVE S T+Y D + YK+GIYKH+ G LG HAIRIIGWG E +
Sbjct: 246 KAIQKEIMKYGPVEASFTVYEDFLNYKSGIYKHITGEALGGHAIRIIGWGVE-------N 298
Query: 455 VVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITA 492
YWL+ANS+N +WGENG FRIVRG++EC IE+++ A
Sbjct: 299 KTPYWLIANSWNEDWGENGYFRIVRGRDECFIESEVIA 336
>gi|118153|sp|P25792.1|CYSP_SCHMA RecName: Full=Cathepsin B-like cysteine proteinase; AltName:
Full=Antigen Sm31; Flags: Precursor
gi|160950|gb|AAA29865.1| cathepsin B [Schistosoma mansoni]
Length = 340
Score = 256 bits (654), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 128/275 (46%), Positives = 170/275 (61%), Gaps = 10/275 (3%)
Query: 63 TLSELEMRMGVHPDS-KLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGS 121
+L + ++MG + L + R P V +D E+P FD+R WP C +I IRDQ
Sbjct: 55 SLDDARIQMGARREEPDLRRKRRPT-VDHNDWNVEIPSNFDSRKKWPGCKSIATIRDQSR 113
Query: 122 CGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTT 181
CGS W+ GAVEAMSDR CI S GK++V LS+ DL++CC+ CG GC+GG G AW YWV
Sbjct: 114 CGSCWSFGAVEAMSDRSCIQSGGKQNVELSAVDLLTCCESCGLGCEGGILGPAWDYWVKE 173
Query: 182 GIVSGGTYASKQGCRPYEIP-CERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLN 240
GIV+ + + GC PY P CE + G + C NTP C + CQ Y Y D +
Sbjct: 174 GIVTASSKENHTGCEPYPFPKCEHHTKGKYPPCGSKIYNTPRCKQTCQRKYKTPYTQDKH 233
Query: 241 FGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIG 300
G+ +Y++ +E+ I +EI ++GPVE S T+Y D + YK+GIYKH+ G LG HAIRIIG
Sbjct: 234 RGKSSYNVKNDEKAIQKEIMKYGPVEASFTVYEDFLNYKSGIYKHITGEALGGHAIRIIG 293
Query: 301 WGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
WG E + YWL+ANS+N +WGENG FRI
Sbjct: 294 WGVE-------NKTPYWLIANSWNEDWGENGYFRI 321
Score = 161 bits (407), Expect = 9e-37, Method: Compositional matrix adjust.
Identities = 80/179 (44%), Positives = 108/179 (60%), Gaps = 11/179 (6%)
Query: 316 YWLVANSFNTNWGENGLFRIGCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYD 374
YW+ + EN GC PY P CE + G C + NTP C + CQ Y
Sbjct: 169 YWVKEGIVTASSKEN---HTGCEPYPFPKCEHHTKGKYPPCGSKIYNTPRCKQTCQRKYK 225
Query: 375 VSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLG 434
Y D + G+ +Y++ +E+ I +EI ++GPVE S T+Y D + YK+GIYKH+ G LG
Sbjct: 226 TPYTQDKHRGKSSYNVKNDEKAIQKEIMKYGPVEASFTVYEDFLNYKSGIYKHITGEALG 285
Query: 435 EHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAG 493
HAIRIIGWG E + YWL+ANS+N +WGENG FRIVRG++EC IE+++ AG
Sbjct: 286 GHAIRIIGWGVE-------NKTPYWLIANSWNEDWGENGYFRIVRGRDECSIESEVIAG 337
>gi|167538317|ref|XP_001750823.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163770644|gb|EDQ84327.1| predicted protein [Monosiga brevicollis MX1]
Length = 341
Score = 256 bits (654), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 125/281 (44%), Positives = 181/281 (64%), Gaps = 19/281 (6%)
Query: 59 LSKLTLSELEMRMGV-HPDSKLPQNRLPLLVQLSDPLEELPEGFDARINW-PYCPTIQEI 116
++ T ++ +MGV +LP+ + +L +LP FD+R W CP+ +EI
Sbjct: 56 FARATDDFIKSQMGVLEGGPQLPEKDIAVLA-------DLPTAFDSREQWGSTCPSTKEI 108
Query: 117 RDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCK-DCGNGCQGGFHGKAW 175
RDQ +CGS WA GAVE+M+DR+CIAS+G +S+ DL++CC CG+GC GG+ AW
Sbjct: 109 RDQAACGSCWAFGAVESMTDRICIASKGSLRPHISAQDLMTCCLFTCGSGCSGGYPSAAW 168
Query: 176 KYWVTTGIVSGGTYASKQGCRPYEIP-CERYMNGSHSSCQDNEPNTPECIRKCQPGYDVS 234
++ TTGIV+GG Y S QGC+PY +P C+ +++G + +C P TP C + C+ GY+ +
Sbjct: 169 SWFKTTGIVTGGNYNSSQGCQPYSLPNCDHHVSGQYPACSGEGP-TPACKKSCEAGYNNT 227
Query: 235 YEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEH 294
Y +D +FG AYS+ + I EI +GPVEG+ T+Y D++ YK+G+Y+H G LG H
Sbjct: 228 YSNDKHFGATAYSVAGEADKIATEIMTNGPVEGAFTVYEDLLTYKSGVYQHTTGQVLGGH 287
Query: 295 AIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
AI+IIGWG E S V YW VANS+N +WG+NG F+I
Sbjct: 288 AIKIIGWGVE-------SGVDYWWVANSWNNDWGDNGFFKI 321
Score = 167 bits (422), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 76/163 (46%), Positives = 109/163 (66%), Gaps = 9/163 (5%)
Query: 336 GCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANE 394
GC+PY +P C+ +++G +C P TP C + C+ GY+ +Y +D +FG AYS+
Sbjct: 187 GCQPYSLPNCDHHVSGQYPACSGEGP-TPACKKSCEAGYNNTYSNDKHFGATAYSVAGEA 245
Query: 395 ETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSS 454
+ I EI +GPVEG+ T+Y D++ YK+G+Y+H G LG HAI+IIGWG E S
Sbjct: 246 DKIATEIMTNGPVEGAFTVYEDLLTYKSGVYQHTTGQVLGGHAIKIIGWGVE-------S 298
Query: 455 VVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
V YW VANS+N +WG+NG F+I +G +ECGIE+ I AG+PK+
Sbjct: 299 GVDYWWVANSWNNDWGDNGFFKIKKGVDECGIESQIVAGMPKL 341
>gi|440913587|gb|ELR63025.1| Cathepsin B [Bos grunniens mutus]
Length = 335
Score = 256 bits (654), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 129/241 (53%), Positives = 172/241 (71%), Gaps = 10/241 (4%)
Query: 97 LPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLV 156
LPE FDAR WP CPTI+EIRDQGSCGS WA GAVEA+SDR+CI S G+ +V +S++D++
Sbjct: 80 LPESFDARKQWPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHSNGRVNVEVSAEDML 139
Query: 157 SCCKDCGNGCQGG-FHGKAWKYWVTTGIVSGGTYASKQGCRPYEIP-CERYMNGSHSSCQ 214
+CC G F AW +W G+VSGG Y S GCRPY IP CE ++NGS C
Sbjct: 140 TCCGGECGDGCNGGFPSGAWNFWTKKGLVSGGLYNSHVGCRPYSIPPCEHHVNGSRPPCT 199
Query: 215 DNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYAD 274
E +TP+C + C+PGY SY++D +FG +YS+ NE+ IM EI+++GPVEG+ ++Y+D
Sbjct: 200 -GEGDTPKCSKTCEPGYSPSYKEDKHFGCSSYSVANNEKEIMAEIYKNGPVEGAFSVYSD 258
Query: 275 MILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFR 334
+LYK+G+Y+HV+G +G HAIRI+GWG E GT YWLV NS+NT+WG+NG F+
Sbjct: 259 FLLYKSGVYQHVSGEIMGGHAIRILGWGVE---NGT----PYWLVGNSWNTDWGDNGFFK 311
Query: 335 I 335
I
Sbjct: 312 I 312
Score = 187 bits (475), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 87/163 (53%), Positives = 123/163 (75%), Gaps = 9/163 (5%)
Query: 334 RIGCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPA 392
+GCRPY IP CE ++NGSR C E +TP+C + C+PGY SY++D +FG +YS+
Sbjct: 176 HVGCRPYSIPPCEHHVNGSRPPCTG-EGDTPKCSKTCEPGYSPSYKEDKHFGCSSYSVAN 234
Query: 393 NEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGT 452
NE+ IM EI+++GPVEG+ ++Y+D +LYK+G+Y+HV+G +G HAIRI+GWG E GT
Sbjct: 235 NEKEIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVSGEIMGGHAIRILGWGVE---NGT 291
Query: 453 SSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLP 495
YWLV NS+NT+WG+NG F+I+RGQ+ CGIE++I AG+P
Sbjct: 292 ----PYWLVGNSWNTDWGDNGFFKILRGQDHCGIESEIVAGMP 330
>gi|256052329|ref|XP_002569725.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
gi|353228436|emb|CCD74607.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
Length = 345
Score = 256 bits (654), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 137/329 (41%), Positives = 187/329 (56%), Gaps = 17/329 (5%)
Query: 9 VATFLKDLDLSQSSRNHSNGVFCDLSKAFDRVDHSILLPKLPFYGAEKNALSKLTLSELE 68
+A+ + LD S +N D D + + P + + N +L +
Sbjct: 13 IASLITHLDAHISIKNEKFKPLSD-----DIISYINEHPNAGWRAEKSNRFH--SLDDAR 65
Query: 69 MRMGVHPDS-KLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWA 127
++MG + L + R P V ++ E+P FD+R WP C +I IRDQ CGS WA
Sbjct: 66 IQMGARREEPDLRRKRRPT-VDHNEWNVEIPSNFDSRKKWPGCKSIATIRDQSRCGSCWA 124
Query: 128 LGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGG 187
GAVEAMSDR CI S GK++V LS+ DL+SCC+ CG GC+GG G AW +WV GIV+G
Sbjct: 125 FGAVEAMSDRSCIQSGGKQNVELSAVDLLSCCESCGLGCEGGILGPAWDFWVKEGIVTGS 184
Query: 188 TYASKQGCRPYEIP-CERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAY 246
+ + GC PY P CE + G + C TP C + CQ Y Y D + G+ +Y
Sbjct: 185 SKENHTGCEPYPFPKCEHHTKGKYPPCGSKIYKTPRCKQTCQKKYKTPYTQDKHRGKSSY 244
Query: 247 SLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPL 306
++ +E+ I +EI ++GPVE S T+Y D + YK+GIYKH+ G LG HAIRIIGWG E
Sbjct: 245 NVKNDEKAIQKEIMKYGPVEASFTVYEDFLNYKSGIYKHITGEALGGHAIRIIGWGVE-- 302
Query: 307 GEGTSSVVKYWLVANSFNTNWGENGLFRI 335
+ YWL+ANS+N +WGENG FRI
Sbjct: 303 -----NKTPYWLIANSWNEDWGENGYFRI 326
Score = 157 bits (398), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 75/159 (47%), Positives = 101/159 (63%), Gaps = 8/159 (5%)
Query: 336 GCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANE 394
GC PY P CE + G C + TP C + CQ Y Y D + G+ +Y++ +E
Sbjct: 191 GCEPYPFPKCEHHTKGKYPPCGSKIYKTPRCKQTCQKKYKTPYTQDKHRGKSSYNVKNDE 250
Query: 395 ETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSS 454
+ I +EI ++GPVE S T+Y D + YK+GIYKH+ G LG HAIRIIGWG E +
Sbjct: 251 KAIQKEIMKYGPVEASFTVYEDFLNYKSGIYKHITGEALGGHAIRIIGWGVE-------N 303
Query: 455 VVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAG 493
YWL+ANS+N +WGENG FRIVRG++EC IE+++ AG
Sbjct: 304 KTPYWLIANSWNEDWGENGYFRIVRGRDECFIESEVIAG 342
>gi|27806671|ref|NP_776456.1| cathepsin B precursor [Bos taurus]
gi|115312124|sp|P07688.5|CATB_BOVIN RecName: Full=Cathepsin B; AltName: Full=BCSB; Contains: RecName:
Full=Cathepsin B light chain; Contains: RecName:
Full=Cathepsin B heavy chain; Flags: Precursor
gi|289402|gb|AAA03064.1| cathepsin B [Bos taurus]
gi|809479|gb|AAA80198.1| cathepsin B [Bos taurus]
gi|296484950|tpg|DAA27065.1| TPA: cathepsin B precursor [Bos taurus]
Length = 335
Score = 256 bits (653), Expect = 3e-65, Method: Compositional matrix adjust.
Identities = 129/241 (53%), Positives = 172/241 (71%), Gaps = 10/241 (4%)
Query: 97 LPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLV 156
LPE FDAR WP CPTI+EIRDQGSCGS WA GAVEA+SDR+CI S G+ +V +S++D++
Sbjct: 80 LPESFDAREQWPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHSNGRVNVEVSAEDML 139
Query: 157 SCCKDCGNGCQGG-FHGKAWKYWVTTGIVSGGTYASKQGCRPYEIP-CERYMNGSHSSCQ 214
+CC G F AW +W G+VSGG Y S GCRPY IP CE ++NGS C
Sbjct: 140 TCCGGECGDGCNGGFPSGAWNFWTKKGLVSGGLYNSHVGCRPYSIPPCEHHVNGSRPPCT 199
Query: 215 DNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYAD 274
E +TP+C + C+PGY SY++D +FG +YS+ NE+ IM EI+++GPVEG+ ++Y+D
Sbjct: 200 -GEGDTPKCSKTCEPGYSPSYKEDKHFGCSSYSVANNEKEIMAEIYKNGPVEGAFSVYSD 258
Query: 275 MILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFR 334
+LYK+G+Y+HV+G +G HAIRI+GWG E GT YWLV NS+NT+WG+NG F+
Sbjct: 259 FLLYKSGVYQHVSGEIMGGHAIRILGWGVE---NGT----PYWLVGNSWNTDWGDNGFFK 311
Query: 335 I 335
I
Sbjct: 312 I 312
Score = 187 bits (475), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 87/163 (53%), Positives = 123/163 (75%), Gaps = 9/163 (5%)
Query: 334 RIGCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPA 392
+GCRPY IP CE ++NGSR C E +TP+C + C+PGY SY++D +FG +YS+
Sbjct: 176 HVGCRPYSIPPCEHHVNGSRPPCTG-EGDTPKCSKTCEPGYSPSYKEDKHFGCSSYSVAN 234
Query: 393 NEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGT 452
NE+ IM EI+++GPVEG+ ++Y+D +LYK+G+Y+HV+G +G HAIRI+GWG E GT
Sbjct: 235 NEKEIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVSGEIMGGHAIRILGWGVE---NGT 291
Query: 453 SSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLP 495
YWLV NS+NT+WG+NG F+I+RGQ+ CGIE++I AG+P
Sbjct: 292 ----PYWLVGNSWNTDWGDNGFFKILRGQDHCGIESEIVAGMP 330
>gi|226472808|emb|CAX71090.1| cathepsin B [Schistosoma japonicum]
Length = 325
Score = 256 bits (653), Expect = 3e-65, Method: Compositional matrix adjust.
Identities = 127/259 (49%), Positives = 166/259 (64%), Gaps = 10/259 (3%)
Query: 63 TLSELEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSC 122
T+S++ +G PD Q L L ELP+ FDAR W +CP+I EIRDQ SC
Sbjct: 62 TVSDIRRMLGALPDPNGEQLET-LCTGYELTLNELPKSFDARKEWTHCPSISEIRDQSSC 120
Query: 123 GSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTG 182
GS WA GAVEAMSDR+CI S+GK LS+++LVSCC CG GC GGF AW YW G
Sbjct: 121 GSCWAFGAVEAMSDRICIESKGKYKPFLSAENLVSCCSSCGMGCNGGFPHSAWLYWKNQG 180
Query: 183 IVSGGTYASKQGCRPYEI-PCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNF 241
IV+G Y + GC+PYE PCE + G C D + TP C R CQ GY+VSYE+D +
Sbjct: 181 IVTGDLYNTTNGCQPYEFPPCEHHTLGPLPVC-DGDVETPPCKRTCQAGYNVSYENDKWY 239
Query: 242 GRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGW 301
G++ Y + +N+E IM+E+ +HGPVE +YAD YK+G+Y+HV+G LG HA+R++GW
Sbjct: 240 GKVVYRVKSNQEAIMKELMQHGPVEVDFEVYADFPNYKSGVYQHVSGALLGGHAVRLLGW 299
Query: 302 GQEPLGEGTSSVVKYWLVA 320
G+E + V YWL+A
Sbjct: 300 GEE-------NNVPYWLIA 311
Score = 124 bits (312), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 59/128 (46%), Positives = 84/128 (65%), Gaps = 9/128 (7%)
Query: 336 GCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANE 394
GC+PYE P CE + G C + TP C R CQ GY+VSYE+D +G++ Y + +N+
Sbjct: 192 GCQPYEFPPCEHHTLGPLPVCDG-DVETPPCKRTCQAGYNVSYENDKWYGKVVYRVKSNQ 250
Query: 395 ETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSS 454
E IM+E+ +HGPVE +YAD YK+G+Y+HV+G LG HA+R++GWG+E +
Sbjct: 251 EAIMKELMQHGPVEVDFEVYADFPNYKSGVYQHVSGALLGGHAVRLLGWGEE-------N 303
Query: 455 VVKYWLVA 462
V YWL+A
Sbjct: 304 NVPYWLIA 311
>gi|226473760|emb|CAX71565.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 255 bits (652), Expect = 3e-65, Method: Compositional matrix adjust.
Identities = 125/265 (47%), Positives = 168/265 (63%), Gaps = 9/265 (3%)
Query: 72 GVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAV 131
G D L + R P V D E+P FD+R WP C +I +IRDQ CGS WA+ AV
Sbjct: 66 GRREDPNLREKRRPT-VDHHDLKVEIPSHFDSRKKWPRCKSISQIRDQSQCGSSWAVSAV 124
Query: 132 EAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYAS 191
AMSDR+CI S GK+ V LS+ DL+SCCK CG+GC GGF G +W YWV GIV+GG+ +
Sbjct: 125 GAMSDRICIQSGGKQSVELSAVDLISCCKYCGSGCDGGFLGPSWDYWVLRGIVTGGSKEN 184
Query: 192 KQGCRPYEIP-CERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPA 250
GCRPY P C+ ++ G + +C D TP+C + CQ GY+ SYE D ++G +Y++ +
Sbjct: 185 HTGCRPYPFPKCDHFVKGKYRACGDKLYKTPQCNQTCQKGYNTSYEQDKHYGGFSYNVLS 244
Query: 251 NEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGT 310
E I ++I HGPVE + IY D + YK+GIY++ G + HA+R+IG G E GT
Sbjct: 245 VESVIQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGKYISGHAVRLIGCGVE---NGT 301
Query: 311 SSVVKYWLVANSFNTNWGENGLFRI 335
+ YWL AN++N +WGE G FRI
Sbjct: 302 A----YWLAANTWNEDWGEKGYFRI 322
Score = 152 bits (385), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 78/182 (42%), Positives = 110/182 (60%), Gaps = 11/182 (6%)
Query: 316 YWLVANSFNTNWGENGLFRIGCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYD 374
YW++ EN GCRPY P C+ ++ G +C TP+C + CQ GY+
Sbjct: 170 YWVLRGIVTGGSKEN---HTGCRPYPFPKCDHFVKGKYRACGDKLYKTPQCNQTCQKGYN 226
Query: 375 VSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLG 434
SYE D ++G +Y++ + E I ++I HGPVE + IY D + YK+GIY++ G +
Sbjct: 227 TSYEQDKHYGGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGKYIS 286
Query: 435 EHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGL 494
HA+R+IG G E GT+ YWL AN++N +WGE G FRIVRG+NEC IE++I AGL
Sbjct: 287 GHAVRLIGCGVE---NGTA----YWLAANTWNEDWGEKGYFRIVRGRNECLIESEIAAGL 339
Query: 495 PK 496
K
Sbjct: 340 IK 341
>gi|56754307|gb|AAW25341.1| unknown [Schistosoma japonicum]
Length = 309
Score = 255 bits (651), Expect = 5e-65, Method: Compositional matrix adjust.
Identities = 123/265 (46%), Positives = 168/265 (63%), Gaps = 9/265 (3%)
Query: 72 GVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAV 131
G D L + R P V D E+P FD+R WP C +I +IRDQ C S WA+ AV
Sbjct: 33 GRREDPNLREKRRPT-VDHHDLNVEIPSHFDSRKKWPRCKSISQIRDQSRCASSWAVSAV 91
Query: 132 EAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYAS 191
AMSDR+CI S GK+ V LS+ DL+SCCK+CG+GC GG G +W YWV+ GIV+GG+ +
Sbjct: 92 GAMSDRICIQSGGKQSVELSAIDLISCCKNCGSGCDGGVTGYSWDYWVSHGIVTGGSKEN 151
Query: 192 KQGCRPYEIP-CERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPA 250
GCRPY P C+ ++ G + +C D TP+C + CQ GY+ SYE D ++G +Y++ +
Sbjct: 152 HTGCRPYPFPKCDHFVKGKYRACGDKLYKTPQCKQTCQKGYNTSYEQDKHYGGFSYNVLS 211
Query: 251 NEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGT 310
E I ++I HG VE + IY D + YK+GIY++ G + HA+R+IGWG E GT
Sbjct: 212 VESVIQKDIMMHGTVEAYLEIYEDFLNYKSGIYRYTTGQFISGHAVRLIGWGVE---NGT 268
Query: 311 SSVVKYWLVANSFNTNWGENGLFRI 335
+ YWL AN++N +WGE G FRI
Sbjct: 269 A----YWLAANTWNEDWGEKGYFRI 289
Score = 154 bits (388), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 74/162 (45%), Positives = 104/162 (64%), Gaps = 8/162 (4%)
Query: 336 GCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANE 394
GCRPY P C+ ++ G +C TP+C + CQ GY+ SYE D ++G +Y++ + E
Sbjct: 154 GCRPYPFPKCDHFVKGKYRACGDKLYKTPQCKQTCQKGYNTSYEQDKHYGGFSYNVLSVE 213
Query: 395 ETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSS 454
I ++I HG VE + IY D + YK+GIY++ G + HA+R+IGWG E GT+
Sbjct: 214 SVIQKDIMMHGTVEAYLEIYEDFLNYKSGIYRYTTGQFISGHAVRLIGWGVE---NGTA- 269
Query: 455 VVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPK 496
YWL AN++N +WGE G FRIVRG+NEC IE++I AGL K
Sbjct: 270 ---YWLAANTWNEDWGEKGYFRIVRGRNECLIESEIAAGLIK 308
>gi|170586854|ref|XP_001898194.1| cathepsin B-like cysteine proteinase [Brugia malayi]
gi|158594589|gb|EDP33173.1| cathepsin B-like cysteine proteinase, putative [Brugia malayi]
Length = 384
Score = 254 bits (650), Expect = 5e-65, Method: Compositional matrix adjust.
Identities = 125/244 (51%), Positives = 163/244 (66%), Gaps = 12/244 (4%)
Query: 97 LPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLV 156
+PE FDAR NWP C +++ IRDQ SCGS WA+ AVEAMSDR+CI S+GK+ V LS+DDL+
Sbjct: 121 IPESFDARKNWPECASLRNIRDQSSCGSCWAVAAVEAMSDRICITSKGKKQVILSADDLL 180
Query: 157 SCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEI-PCERYMNGSH-SSCQ 214
SCCK CG GC GG AWKYWV +GIV+G Y + GCRPY PCE + N +H C+
Sbjct: 181 SCCKTCGFGCFGGEPMAAWKYWVLSGIVTGSDYTNHSGCRPYPFPPCEHHSNKTHYEPCK 240
Query: 215 DNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYAD 274
+ TP+C ++C Y SY+ D +G AY++ + E+I +EI GPVE S +Y D
Sbjct: 241 HDLYPTPKCYKQCDKNYTKSYKADKYYGEQAYNVENDVESIQKEIMTLGPVEASFEVYTD 300
Query: 275 MILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGEN---G 331
+ Y +GIYKHVAG G HA++I+GWG + +G S YWL ANS+N +WGE+ G
Sbjct: 301 FLHYTSGIYKHVAGSVGGGHAVKILGWG---IDQGVS----YWLAANSWNNDWGEDVFSG 353
Query: 332 LFRI 335
FRI
Sbjct: 354 YFRI 357
Score = 149 bits (377), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 82/201 (40%), Positives = 119/201 (59%), Gaps = 20/201 (9%)
Query: 301 WGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIGCRPYEIP-CERYMNGSR-SSCQAN 358
+G EP+ + KYW+++ G + GCRPY P CE + N + C+ +
Sbjct: 191 FGGEPM-----AAWKYWVLSGIVT---GSDYTNHSGCRPYPFPPCEHHSNKTHYEPCKHD 242
Query: 359 EPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMI 418
TP+C ++C Y SY+ D +G AY++ + E+I +EI GPVE S +Y D +
Sbjct: 243 LYPTPKCYKQCDKNYTKSYKADKYYGEQAYNVENDVESIQKEIMTLGPVEASFEVYTDFL 302
Query: 419 LYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGEN---GLF 475
Y +GIYKHVAG G HA++I+GWG + +G S YWL ANS+N +WGE+ G F
Sbjct: 303 HYTSGIYKHVAGSVGGGHAVKILGWG---IDQGVS----YWLAANSWNNDWGEDVFSGYF 355
Query: 476 RIVRGQNECGIEADITAGLPK 496
RI+RG +ECGIE+ I AG+P+
Sbjct: 356 RILRGADECGIESGIVAGIPR 376
>gi|197725747|gb|ACH73069.1| cathepsin B precursor [Epinephelus coioides]
Length = 333
Score = 254 bits (650), Expect = 6e-65, Method: Compositional matrix adjust.
Identities = 123/248 (49%), Positives = 170/248 (68%), Gaps = 12/248 (4%)
Query: 83 RLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIAS 142
+LP+LVQ S ++ LP+ FD+R WP CPT++EIRDQGSCGS WA GA EA+SDR+CI S
Sbjct: 66 KLPVLVQYSGDMK-LPKNFDSREQWPNCPTLKEIRDQGSCGSCWAFGAAEAISDRLCIHS 124
Query: 143 RGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEI-P 201
GK V +SS+DL++CC CG GC GG+ AW +W G+VSGG Y S GCRPY I P
Sbjct: 125 NGKVSVEISSEDLLTCCDSCGMGCNGGYPSAAWDFWTDVGLVSGGLYDSHVGCRPYTIPP 184
Query: 202 CERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFR 261
CE ++NG+ C +TP+CI +C+ GY SY+ D ++G+ +YS+P++EE I EI++
Sbjct: 185 CEHHVNGTRPPCTGEGGDTPQCILQCESGYTPSYKADKHYGKSSYSVPSDEEQIQSEIYK 244
Query: 262 HGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVAN 321
+GPVEG+ T+Y D +LYKTG+Y+H+ G +G HAI+ W LGE S+ L
Sbjct: 245 NGPVEGAFTVYEDFLLYKTGVYQHMTGSAVGGHAIK--SW----LGEEVCSL----LALC 294
Query: 322 SFNTNWGE 329
+T+WG+
Sbjct: 295 HSDTDWGD 302
Score = 140 bits (352), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 72/168 (42%), Positives = 109/168 (64%), Gaps = 14/168 (8%)
Query: 331 GLF--RIGCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIA 387
GL+ +GCRPY IP CE ++NG+R C +TP+CI +C+ GY SY+ D ++G+ +
Sbjct: 169 GLYDSHVGCRPYTIPPCEHHVNGTRPPCTGEGGDTPQCILQCESGYTPSYKADKHYGKSS 228
Query: 388 YSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEP 447
YS+P++EE I EI+++GPVEG+ T+Y D +LYKTG+Y+H+ G +G HAI+ W
Sbjct: 229 YSVPSDEEQIQSEIYKNGPVEGAFTVYEDFLLYKTGVYQHMTGSAVGGHAIK--SW---- 282
Query: 448 LGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLP 495
LGE S+ L +T+WG+ + G + CGIE++I AG+P
Sbjct: 283 LGEEVCSL----LALCHSDTDWGDM-VSLSSAGSDHCGIESEIVAGIP 325
>gi|28373366|pdb|1ITO|A Chain A, Crystal Structure Analysis Of Bovine Spleen Cathepsin B-
E64c Complex
gi|88192750|pdb|2DC6|A Chain A, X-ray Crystal Structure Analysis Of Bovine Spleen
Cathepsin B-ca073 Complex
gi|88192751|pdb|2DC7|A Chain A, X-ray Crystal Structure Analysis Of Bovine Spleen
Cathepsin B-ca042 Complex
gi|88192752|pdb|2DC8|A Chain A, X-ray Crystal Structure Analysis Of Bovine Spleen
Cathepsin B-ca059 Complex
gi|88192753|pdb|2DC9|A Chain A, X-Ray Crystal Structure Analysis Of Bovine Spleen
Cathepsin B-Ca074me Complex
gi|88192754|pdb|2DCA|A Chain A, X-ray Crystal Structure Analysis Of Bovine Spleen
Cathepsin B-ca075 Complex
gi|88192755|pdb|2DCB|A Chain A, X-Ray Crystal Structure Analysis Of Bovine Spleen
Cathepsin B-Ca076 Complex
gi|88192756|pdb|2DCC|A Chain A, X-Ray Crystal Structure Analysis Of Bovine Spleen
Cathepsin B-Ca077 Complex
gi|88192757|pdb|2DCD|A Chain A, X-Ray Crystal Structure Analysis Of Bovine Spleen
Cathepsin B-Ca078 Complex
Length = 256
Score = 254 bits (648), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 129/241 (53%), Positives = 172/241 (71%), Gaps = 10/241 (4%)
Query: 97 LPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLV 156
LPE FDAR WP CPTI+EIRDQGSCGS WA GAVEA+SDR+CI S G+ +V +S++D++
Sbjct: 1 LPESFDAREQWPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHSNGRVNVEVSAEDML 60
Query: 157 SCCKDCGNGCQGG-FHGKAWKYWVTTGIVSGGTYASKQGCRPYEIP-CERYMNGSHSSCQ 214
+CC G F AW +W G+VSGG Y S GCRPY IP CE ++NGS C
Sbjct: 61 TCCGGECGDGCNGGFPSGAWNFWTKKGLVSGGLYNSHVGCRPYSIPPCEHHVNGSRPPCT 120
Query: 215 DNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYAD 274
E +TP+C + C+PGY SY++D +FG +YS+ NE+ IM EI+++GPVEG+ ++Y+D
Sbjct: 121 -GEGDTPKCSKTCEPGYSPSYKEDKHFGCSSYSVANNEKEIMAEIYKNGPVEGAFSVYSD 179
Query: 275 MILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFR 334
+LYK+G+Y+HV+G +G HAIRI+GWG E GT YWLV NS+NT+WG+NG F+
Sbjct: 180 FLLYKSGVYQHVSGEIMGGHAIRILGWGVE---NGT----PYWLVGNSWNTDWGDNGFFK 232
Query: 335 I 335
I
Sbjct: 233 I 233
Score = 186 bits (472), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 87/163 (53%), Positives = 123/163 (75%), Gaps = 9/163 (5%)
Query: 334 RIGCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPA 392
+GCRPY IP CE ++NGSR C E +TP+C + C+PGY SY++D +FG +YS+
Sbjct: 97 HVGCRPYSIPPCEHHVNGSRPPCTG-EGDTPKCSKTCEPGYSPSYKEDKHFGCSSYSVAN 155
Query: 393 NEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGT 452
NE+ IM EI+++GPVEG+ ++Y+D +LYK+G+Y+HV+G +G HAIRI+GWG E GT
Sbjct: 156 NEKEIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVSGEIMGGHAIRILGWGVE---NGT 212
Query: 453 SSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLP 495
YWLV NS+NT+WG+NG F+I+RGQ+ CGIE++I AG+P
Sbjct: 213 ----PYWLVGNSWNTDWGDNGFFKILRGQDHCGIESEIVAGMP 251
>gi|76576339|gb|ABA53863.1| cathepsin B-like cysteine protease 1 [Parelaphostrongylus tenuis]
Length = 346
Score = 254 bits (648), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 117/241 (48%), Positives = 161/241 (66%), Gaps = 9/241 (3%)
Query: 97 LPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLV 156
+P+ FDAR NWP C +++ +RDQ +CGSGWA+ AV A+ DR+CIAS GK+ V LS+DD++
Sbjct: 94 IPKSFDARTNWPKCASLRTVRDQSACGSGWAVAAVGAIMDRICIASEGKQQVILSADDIL 153
Query: 157 SCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEI-PCERYMN-GSHSSCQ 214
SCC +CG GC+GG KAW YW T GIV+G Y +K GC+PY PCE Y++ G + C
Sbjct: 154 SCCTECGYGCEGGDTYKAWNYWTTDGIVTGSNYTTKSGCKPYPYPPCEHYIDAGRYKKCP 213
Query: 215 DNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYAD 274
+ T C KCQ Y +SY++D ++G Y L + I +EI HGPVE + +Y D
Sbjct: 214 KDLYPTNTCEYKCQDNYTISYDEDKHYGAYPYVLVGDASFIQQEIMNHGPVEVTFDVYED 273
Query: 275 MILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFR 334
Y +GIYKH+AG +G HA++++GWG E + V YW+ ANS+N++WGENG FR
Sbjct: 274 FEHYSSGIYKHMAGEYVGVHAVKMLGWGTE-------NGVDYWICANSWNSDWGENGFFR 326
Query: 335 I 335
I
Sbjct: 327 I 327
Score = 156 bits (395), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 76/171 (44%), Positives = 107/171 (62%), Gaps = 9/171 (5%)
Query: 328 GENGLFRIGCRPYEIP-CERYMNGSR-SSCQANEPNTPECIRKCQPGYDVSYEDDLNFGR 385
G N + GC+PY P CE Y++ R C + T C KCQ Y +SY++D ++G
Sbjct: 183 GSNYTTKSGCKPYPYPPCEHYIDAGRYKKCPKDLYPTNTCEYKCQDNYTISYDEDKHYGA 242
Query: 386 IAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQ 445
Y L + I +EI HGPVE + +Y D Y +GIYKH+AG +G HA++++GWG
Sbjct: 243 YPYVLVGDASFIQQEIMNHGPVEVTFDVYEDFEHYSSGIYKHMAGEYVGVHAVKMLGWGT 302
Query: 446 EPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPK 496
E + V YW+ ANS+N++WGENG FRI+RG+NECGIE+++ AG PK
Sbjct: 303 E-------NGVDYWICANSWNSDWGENGFFRILRGENECGIESNVVAGKPK 346
>gi|227293|prf||1701299A cathepsin B
Length = 339
Score = 253 bits (647), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 125/255 (49%), Positives = 172/255 (67%), Gaps = 11/255 (4%)
Query: 83 RLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIAS 142
+LP V + ++ LPE FDAR W CPTI +IRDQGSCGS WA GAVEA+SDR CI +
Sbjct: 67 KLPGRVAFGEDID-LPETFDAREQWSNCPTIGQIRDQGSCGSCWAFGAVEAISDRTCIHT 125
Query: 143 RGKRHVRLSSDDLVSCCK-DCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIP 201
G+ +V +S++DL++CC CG+GC GG+ AW +W G+VSGG Y S GC PY IP
Sbjct: 126 NGRVNVEVSAEDLLTCCGIQCGDGCNGGYPSGAWNFWTKKGLVSGGYYDSHIGCLPYTIP 185
Query: 202 -CERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIF 260
CE ++NGS C E +T C + C+ GY SY++D +FG +YS+ + + IM EI+
Sbjct: 186 PCEHHVNGSRPPCT-GEGDTRRCNKSCEAGYSPSYKEDKHFGYTSYSVSNSVKKIMAEIY 244
Query: 261 RHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVA 320
++GPVEG+ T+++D + YK+G+YKH AG +G HAIRI+ WG E + V YW A
Sbjct: 245 KNGPVEGAFTVFSDFLTYKSGVYKHEAGDMMGGHAIRILVWGVE-------NGVPYWAAA 297
Query: 321 NSFNTNWGENGLFRI 335
NS+N +WG+NG F+I
Sbjct: 298 NSWNLDWGDNGFFKI 312
Score = 167 bits (422), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 79/164 (48%), Positives = 114/164 (69%), Gaps = 9/164 (5%)
Query: 334 RIGCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPA 392
IGC PY IP CE ++NGSR C E +T C + C+ GY SY++D +FG +YS+
Sbjct: 176 HIGCLPYTIPPCEHHVNGSRPPCTG-EGDTRRCNKSCEAGYSPSYKEDKHFGYTSYSVSN 234
Query: 393 NEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGT 452
+ + IM EI+++GPVEG+ T+++D + YK+G+YKH AG +G HAIRI+ WG E
Sbjct: 235 SVKKIMAEIYKNGPVEGAFTVFSDFLTYKSGVYKHEAGDMMGGHAIRILVWGVE------ 288
Query: 453 SSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPK 496
+ V YW ANS+N +WG+NG F+I+RG+N CGIE++I AG+P+
Sbjct: 289 -NGVPYWAAANSWNLDWGDNGFFKILRGENHCGIESEIVAGIPR 331
>gi|9955277|pdb|1QDQ|A Chain A, X-Ray Crystal Structure Of Bovine Cathepsin B-Ca074
Complex
Length = 253
Score = 253 bits (646), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 129/241 (53%), Positives = 172/241 (71%), Gaps = 10/241 (4%)
Query: 97 LPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLV 156
LPE FDAR WP CPTI+EIRDQGSCGS WA GAVEA+SDR+CI S G+ +V +S++D++
Sbjct: 1 LPESFDAREQWPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHSNGRVNVEVSAEDML 60
Query: 157 SCCKDCGNGCQGGFH-GKAWKYWVTTGIVSGGTYASKQGCRPYEIP-CERYMNGSHSSCQ 214
+CC G AW +W G+VSGG Y S GCRPY IP CE ++NGS C
Sbjct: 61 TCCGGECGDGCNGGEPSGAWNFWTKKGLVSGGLYNSHVGCRPYSIPPCEHHVNGSRPPCT 120
Query: 215 DNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYAD 274
E +TP+C + C+PGY SY++D +FG +YS+ NE+ IM EI+++GPVEG+ ++Y+D
Sbjct: 121 -GEGDTPKCSKTCEPGYSPSYKEDKHFGCSSYSVANNEKEIMAEIYKNGPVEGAFSVYSD 179
Query: 275 MILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFR 334
+LYK+G+Y+HV+G +G HAIRI+GWG E GT YWLVANS+NT+WG+NG F+
Sbjct: 180 FLLYKSGVYQHVSGEIMGGHAIRILGWGVE---NGT----PYWLVANSWNTDWGDNGFFK 232
Query: 335 I 335
I
Sbjct: 233 I 233
Score = 187 bits (476), Expect = 8e-45, Method: Compositional matrix adjust.
Identities = 90/168 (53%), Positives = 127/168 (75%), Gaps = 11/168 (6%)
Query: 331 GLF--RIGCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIA 387
GL+ +GCRPY IP CE ++NGSR C E +TP+C + C+PGY SY++D +FG +
Sbjct: 92 GLYNSHVGCRPYSIPPCEHHVNGSRPPCTG-EGDTPKCSKTCEPGYSPSYKEDKHFGCSS 150
Query: 388 YSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEP 447
YS+ NE+ IM EI+++GPVEG+ ++Y+D +LYK+G+Y+HV+G +G HAIRI+GWG E
Sbjct: 151 YSVANNEKEIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVSGEIMGGHAIRILGWGVE- 209
Query: 448 LGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLP 495
GT YWLVANS+NT+WG+NG F+I+RGQ+ CGIE++I AG+P
Sbjct: 210 --NGT----PYWLVANSWNTDWGDNGFFKILRGQDHCGIESEIVAGMP 251
>gi|444525951|gb|ELV14228.1| Cathepsin B [Tupaia chinensis]
Length = 339
Score = 253 bits (646), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 127/255 (49%), Positives = 178/255 (69%), Gaps = 11/255 (4%)
Query: 83 RLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIAS 142
+LP ++ ++ + LPE FDAR W CPTI+EIRDQGSCGS WA GAVE++SDR+CI +
Sbjct: 67 KLPHRIKFAEDMN-LPESFDAREQWSSCPTIKEIRDQGSCGSCWAFGAVESISDRICIHT 125
Query: 143 RGKRHVRLSSDDLVSCCKDCGNGCQGG-FHGKAWKYWVTTGIVSGGTYASKQGCRPYEIP 201
G +V +S++D+++CC G + AW +W G+VSGG Y S GCRPY IP
Sbjct: 126 NGHVNVEVSAEDMLTCCGGQCGEGCNGGYPSAAWNFWTKKGLVSGGLYDSHVGCRPYSIP 185
Query: 202 -CERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIF 260
CE ++NGS C E +TP+C + C+PGY SY++D ++G +YS+P E+ IM EI+
Sbjct: 186 PCEHHVNGSRPPCT-GEGDTPKCSKSCEPGYSSSYKEDKHYGYSSYSVPGIEKEIMAEIY 244
Query: 261 RHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVA 320
++GPVEG+ ++Y+D +LYK+G+Y+HV G +G HAIRI+GWG E GT YWLVA
Sbjct: 245 KNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGTE---NGT----PYWLVA 297
Query: 321 NSFNTNWGENGLFRI 335
NS+NT+WG+NG F+I
Sbjct: 298 NSWNTDWGDNGFFKI 312
Score = 189 bits (481), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 87/164 (53%), Positives = 124/164 (75%), Gaps = 9/164 (5%)
Query: 334 RIGCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPA 392
+GCRPY IP CE ++NGSR C E +TP+C + C+PGY SY++D ++G +YS+P
Sbjct: 176 HVGCRPYSIPPCEHHVNGSRPPCTG-EGDTPKCSKSCEPGYSSSYKEDKHYGYSSYSVPG 234
Query: 393 NEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGT 452
E+ IM EI+++GPVEG+ ++Y+D +LYK+G+Y+HV G +G HAIRI+GWG E GT
Sbjct: 235 IEKEIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGTE---NGT 291
Query: 453 SSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPK 496
YWLVANS+NT+WG+NG F+I+RGQ+ CGIE++I AG+P+
Sbjct: 292 ----PYWLVANSWNTDWGDNGFFKILRGQDHCGIESEIVAGIPR 331
>gi|300176937|emb|CBK25506.2| unnamed protein product [Blastocystis hominis]
Length = 320
Score = 253 bits (645), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 118/241 (48%), Positives = 161/241 (66%), Gaps = 9/241 (3%)
Query: 96 ELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDL 155
+LPE FD WP CP+++EIRDQ CGS WA GA EA +DR+CIAS+GK RLS DL
Sbjct: 68 DLPESFDPVEKWPECPSLKEIRDQSVCGSCWAFGAAEAATDRLCIASKGKIQDRLSDQDL 127
Query: 156 VSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIP-CERYMNGSHSSCQ 214
++CC+ CG GC GG+ AW ++ +TG+ +GG Y SK C YE P C+ ++ G + C
Sbjct: 128 LTCCESCGFGCNGGWPSMAWSWFHSTGVTTGGEYGSKDWCNAYEFPKCDHHVEGKYPPCG 187
Query: 215 DNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYAD 274
+ +P TPEC+ KCQ GY V Y+ D +F AY +P+N E I E+ +GP+E ++Y D
Sbjct: 188 ETQP-TPECVEKCQEGYPVEYKKDKHFFGEAYHVPSNVEAIKTELMTNGPIEVDFSVYED 246
Query: 275 MILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFR 334
+ YK+GIY+HVAG LG HA++++GWG E V+YW +ANS+N +WGENG FR
Sbjct: 247 FMTYKSGIYQHVAGKYLGGHAVKLVGWGVE-------DGVEYWKIANSWNEDWGENGYFR 299
Query: 335 I 335
I
Sbjct: 300 I 300
Score = 161 bits (408), Expect = 8e-37, Method: Compositional matrix adjust.
Identities = 83/190 (43%), Positives = 117/190 (61%), Gaps = 11/190 (5%)
Query: 309 GTSSVVKYWLVANSFNTNWGENGLFRIGCRPYEIP-CERYMNGSRSSCQANEPNTPECIR 367
G S+ W + T GE G + C YE P C+ ++ G C +P TPEC+
Sbjct: 141 GWPSMAWSWFHSTGVTTG-GEYGS-KDWCNAYEFPKCDHHVEGKYPPCGETQP-TPECVE 197
Query: 368 KCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKH 427
KCQ GY V Y+ D +F AY +P+N E I E+ +GP+E ++Y D + YK+GIY+H
Sbjct: 198 KCQEGYPVEYKKDKHFFGEAYHVPSNVEAIKTELMTNGPIEVDFSVYEDFMTYKSGIYQH 257
Query: 428 VAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIE 487
VAG LG HA++++GWG E V+YW +ANS+N +WGENG FRI+ G+NECGIE
Sbjct: 258 VAGKYLGGHAVKLVGWGVE-------DGVEYWKIANSWNEDWGENGYFRIIAGKNECGIE 310
Query: 488 ADITAGLPKI 497
+D AG+P++
Sbjct: 311 SDGVAGIPEL 320
>gi|343197337|pdb|3QSD|A Chain A, Structure Of Cathepsin B1 From Schistosoma Mansoni In
Complex With Ca074 Inhibitor
gi|343197588|pdb|3S3Q|A Chain A, Structure Of Cathepsin B1 From Schistosoma Mansoni In
Complex With K11017 Inhibitor
gi|343197589|pdb|3S3R|A Chain A, Structure Of Cathepsin B1 From Schistosoma Mansoni In
Complex With K11777 Inhibitor
gi|343197590|pdb|3S3R|B Chain B, Structure Of Cathepsin B1 From Schistosoma Mansoni In
Complex With K11777 Inhibitor
gi|343197591|pdb|3S3R|C Chain C, Structure Of Cathepsin B1 From Schistosoma Mansoni In
Complex With K11777 Inhibitor
Length = 254
Score = 252 bits (644), Expect = 3e-64, Method: Compositional matrix adjust.
Identities = 121/241 (50%), Positives = 154/241 (63%), Gaps = 8/241 (3%)
Query: 96 ELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDL 155
E+P FD+R WP C +I IRDQ CGS WA GAVEAMSDR CI S GK++V LS+ DL
Sbjct: 2 EIPSSFDSRKKWPRCKSIATIRDQSRCGSCWAFGAVEAMSDRSCIQSGGKQNVELSAVDL 61
Query: 156 VSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIP-CERYMNGSHSSCQ 214
+SCC+ CG GC+GG G AW YWV GIV+G + + GC PY P CE + G + C
Sbjct: 62 LSCCESCGLGCEGGILGPAWDYWVKEGIVTGSSKENHAGCEPYPFPKCEHHTKGKYPPCG 121
Query: 215 DNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYAD 274
TP C + CQ Y Y D + G+ +Y++ +E+ I +EI ++GPVE T+Y D
Sbjct: 122 SKIYKTPRCKQTCQKKYKTPYTQDKHRGKSSYNVKNDEKAIQKEIMKYGPVEAGFTVYED 181
Query: 275 MILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFR 334
+ YK+GIYKH+ G LG HAIRIIGWG E + YWL+ANS+N +WGENG FR
Sbjct: 182 FLNYKSGIYKHITGETLGGHAIRIIGWGVE-------NKAPYWLIANSWNEDWGENGYFR 234
Query: 335 I 335
I
Sbjct: 235 I 235
Score = 158 bits (400), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 79/179 (44%), Positives = 107/179 (59%), Gaps = 11/179 (6%)
Query: 316 YWLVANSFNTNWGENGLFRIGCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYD 374
YW+ + EN GC PY P CE + G C + TP C + CQ Y
Sbjct: 83 YWVKEGIVTGSSKEN---HAGCEPYPFPKCEHHTKGKYPPCGSKIYKTPRCKQTCQKKYK 139
Query: 375 VSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLG 434
Y D + G+ +Y++ +E+ I +EI ++GPVE T+Y D + YK+GIYKH+ G LG
Sbjct: 140 TPYTQDKHRGKSSYNVKNDEKAIQKEIMKYGPVEAGFTVYEDFLNYKSGIYKHITGETLG 199
Query: 435 EHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAG 493
HAIRIIGWG E + YWL+ANS+N +WGENG FRIVRG++EC IE+++TAG
Sbjct: 200 GHAIRIIGWGVE-------NKAPYWLIANSWNEDWGENGYFRIVRGRDECSIESEVTAG 251
>gi|312271213|gb|ADQ57304.1| cathepsin B-like cysteine protease 2 [Angiostrongylus cantonensis]
Length = 347
Score = 252 bits (643), Expect = 4e-64, Method: Compositional matrix adjust.
Identities = 119/241 (49%), Positives = 159/241 (65%), Gaps = 9/241 (3%)
Query: 97 LPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLV 156
+P+ FD+R NWP CP++ IRDQ SCGS WA+GAVEAM+DR+CIAS+G + V +S+DDL+
Sbjct: 95 IPKSFDSRTNWPECPSLYSIRDQSSCGSCWAVGAVEAMTDRICIASKGNQKVTISADDLL 154
Query: 157 SCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEI-PCERYMNGSH-SSCQ 214
SCC +CG GC GG AW YWV+ GIV+G Y SK GC+PY PCE ++ H C
Sbjct: 155 SCCDECGFGCDGGDPYAAWSYWVSNGIVTGSNYTSKSGCKPYPYPPCEHHIPEHHYKKCP 214
Query: 215 DNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYAD 274
+ T C KCQ GY +SY D ++G Y++ + +I +EI +GPVE + +Y D
Sbjct: 215 KDIYPTNTCEYKCQDGYSISYNSDKHYGASVYAVAQDVASIQKEIMTNGPVEVAFDVYED 274
Query: 275 MILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFR 334
Y +GIYKH G LG HA++++GWG E GT YW+ ANS+N++WGENG FR
Sbjct: 275 FEHYSSGIYKHTTGDYLGGHAVKMLGWGTE---NGTD----YWICANSWNSDWGENGFFR 327
Query: 335 I 335
I
Sbjct: 328 I 328
Score = 143 bits (360), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 77/183 (42%), Positives = 107/183 (58%), Gaps = 12/183 (6%)
Query: 316 YWLVANSFNTNWGENGLFRIGCRPYEIP-CERYM-NGSRSSCQANEPNTPECIRKCQPGY 373
YW V+N T G N + GC+PY P CE ++ C + T C KCQ GY
Sbjct: 175 YW-VSNGIVT--GSNYTSKSGCKPYPYPPCEHHIPEHHYKKCPKDIYPTNTCEYKCQDGY 231
Query: 374 DVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPL 433
+SY D ++G Y++ + +I +EI +GPVE + +Y D Y +GIYKH G L
Sbjct: 232 SISYNSDKHYGASVYAVAQDVASIQKEIMTNGPVEVAFDVYEDFEHYSSGIYKHTTGDYL 291
Query: 434 GEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAG 493
G HA++++GWG E GT YW+ ANS+N++WGENG FRI+RG +EC IE+ + AG
Sbjct: 292 GGHAVKMLGWGTE---NGTD----YWICANSWNSDWGENGFFRILRGVDECQIESSVVAG 344
Query: 494 LPK 496
PK
Sbjct: 345 EPK 347
>gi|268555790|ref|XP_002635884.1| Hypothetical protein CBG01104 [Caenorhabditis briggsae]
Length = 337
Score = 251 bits (641), Expect = 6e-64, Method: Compositional matrix adjust.
Identities = 133/303 (43%), Positives = 178/303 (58%), Gaps = 19/303 (6%)
Query: 37 FDRVDHSILLPKLPFYGAEKNALSKLTLSELEMRMGVHPDSKLPQNRLPLLVQLSDPLEE 96
F ++H + L G K A K + + H S ++ L V LE
Sbjct: 26 FHEINHINSIQSLWTAGPSKFAFQKF---QRRLMRSEHVKSHKSEDILDRKV-----LET 77
Query: 97 LPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLV 156
+PE +D R +W C ++ IRDQ CGS WA+ A E +SDR+CIAS G + +S++DL+
Sbjct: 78 IPESYDVRDHWSKCISVDNIRDQSDCGSCWAVAAAETISDRLCIASNGSINTFVSAEDLL 137
Query: 157 SCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEI-PCERYMNG-SHSSCQ 214
SCC CG+GC GG+ +AW+YWV G+VSGG+Y S+ GC+PY I PC + +NG + C
Sbjct: 138 SCCTSCGDGCDGGYPLQAWRYWVKQGLVSGGSYESQYGCKPYSIAPCGQTVNGVTWPKCP 197
Query: 215 DNEPNTPECIRKC--QPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIY 272
E TPEC C + Y V+YE D ++G AY + E I EI +HGPVE +Y
Sbjct: 198 AQEEATPECASHCTSKSSYSVAYEKDKHYGLSAYPVGRKEAQIQTEILQHGPVEAGFLVY 257
Query: 273 ADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGL 332
+D YK+GIY HV+G LG HA++I+GWG E GT KYWLVANS+N NWGE G
Sbjct: 258 SDFYRYKSGIYTHVSGQELGGHAVKILGWGVE---NGT----KYWLVANSWNINWGEKGY 310
Query: 333 FRI 335
FRI
Sbjct: 311 FRI 313
Score = 163 bits (413), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 83/168 (49%), Positives = 108/168 (64%), Gaps = 11/168 (6%)
Query: 334 RIGCRPYEI-PCERYMNG-SRSSCQANEPNTPECIRKC--QPGYDVSYEDDLNFGRIAYS 389
+ GC+PY I PC + +NG + C A E TPEC C + Y V+YE D ++G AY
Sbjct: 173 QYGCKPYSIAPCGQTVNGVTWPKCPAQEEATPECASHCTSKSSYSVAYEKDKHYGLSAYP 232
Query: 390 LPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLG 449
+ E I EI +HGPVE +Y+D YK+GIY HV+G LG HA++I+GWG E
Sbjct: 233 VGRKEAQIQTEILQHGPVEAGFLVYSDFYRYKSGIYTHVSGQELGGHAVKILGWGVE--- 289
Query: 450 EGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
GT KYWLVANS+N NWGE G FRI+RG+NECGIE+ + AG+P +
Sbjct: 290 NGT----KYWLVANSWNINWGEKGYFRILRGRNECGIESAVVAGIPDL 333
>gi|335347291|gb|AEH42093.1| cysteine proteinase 6 [Haemonchus contortus]
Length = 346
Score = 251 bits (640), Expect = 9e-64, Method: Compositional matrix adjust.
Identities = 120/256 (46%), Positives = 160/256 (62%), Gaps = 8/256 (3%)
Query: 81 QNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCI 140
QNR P++ SD +++PE FDAR WP C +I+ IRDQ +CGS WA+ +SDR+CI
Sbjct: 78 QNRKPVVEDASDKGDDIPESFDARTKWPNCTSIKHIRDQANCGSCWAVSTASVLSDRICI 137
Query: 141 ASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEI 200
AS+ K+ V +SS D VSCC CG GC+GG+ A++Y+ G+V+GG Y SK GCRPY
Sbjct: 138 ASKQKKQVHISSIDFVSCCDSCGFGCEGGWPIDAFEYYSYQGVVTGGDYGSKTGCRPYPF 197
Query: 201 -PCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREI 259
PC + N ++ E +TPEC+++CQ GY SY D +G Y + + + I REI
Sbjct: 198 HPCGHHGNETYYGECPKEESTPECVKQCQKGYKNSYRRDKTWGEDYYEVENSVKAIQREI 257
Query: 260 FRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLV 319
R GPV S T+Y D Y GIYKH AG G HAI+IIGW GT V YW++
Sbjct: 258 MRSGPVVSSFTVYDDFSYYVKGIYKHTAGKARGSHAIKIIGW-------GTEKNVPYWII 310
Query: 320 ANSFNTNWGENGLFRI 335
ANS++ +WGE G FR+
Sbjct: 311 ANSWHNDWGEKGFFRM 326
Score = 151 bits (382), Expect = 8e-34, Method: Compositional matrix adjust.
Identities = 75/161 (46%), Positives = 95/161 (59%), Gaps = 8/161 (4%)
Query: 334 RIGCRPYEI-PCERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPA 392
+ GCRPY PC + N + E +TPEC+++CQ GY SY D +G Y +
Sbjct: 189 KTGCRPYPFHPCGHHGNETYYGECPKEESTPECVKQCQKGYKNSYRRDKTWGEDYYEVEN 248
Query: 393 NEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGT 452
+ + I REI R GPV S T+Y D Y GIYKH AG G HAI+IIGW GT
Sbjct: 249 SVKAIQREIMRSGPVVSSFTVYDDFSYYVKGIYKHTAGKARGSHAIKIIGW-------GT 301
Query: 453 SSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAG 493
V YW++ANS++ +WGE G FR+VRG N CGIE D+ AG
Sbjct: 302 EKNVPYWIIANSWHNDWGEKGFFRMVRGTNHCGIEEDVVAG 342
>gi|170787211|gb|ACB38229.1| cathepsin B [Meretrix meretrix]
Length = 337
Score = 250 bits (639), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 125/281 (44%), Positives = 180/281 (64%), Gaps = 14/281 (4%)
Query: 56 KNALSKLTLSELEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQE 115
KN K + ++ G +P P+ + P V+ + ++LP+ FDAR WP CP+++E
Sbjct: 49 KNVPYKGRMDYVKSLCGANPAP--PEMKFP--VKEIEVPKDLPDTFDARTQWPDCPSLKE 104
Query: 116 IRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAW 175
+RDQG+CGS WA G VEA +DR+CI S+G + LS++DL SCC+ CGNGC GGF AW
Sbjct: 105 VRDQGACGSCWAFGCVEAATDRLCIQSKGIVNAHLSAEDLTSCCRTCGNGCNGGFLEGAW 164
Query: 176 KYWVTTGIVSGGTYASKQGCRPYEI-PCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVS 234
Y GIV+GG Y S QGC PYEI C+ ++ G C+ + P TP C ++C+ GY+ +
Sbjct: 165 NYLKRDGIVTGGPYNSHQGCLPYEIKACDHHVVGKLQPCKGDGP-TPRCKKECESGYNNT 223
Query: 235 YEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEH 294
Y D + + +++ E+ IM EI +GPVE + T+Y+D YK+G+Y+H +GGPLG H
Sbjct: 224 YSKDEHHAKTVHAVEGVEQ-IMTEIMTNGPVEAAFTVYSDFPTYKSGVYEHKSGGPLGGH 282
Query: 295 AIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
AI+ +GWG E + YWLVANS+N +WG+NG F+I
Sbjct: 283 AIKTLGWGNEDGKD-------YWLVANSWNPDWGDNGFFKI 316
Score = 150 bits (379), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 71/160 (44%), Positives = 108/160 (67%), Gaps = 10/160 (6%)
Query: 336 GCRPYEI-PCERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANE 394
GC PYEI C+ ++ G C+ + P TP C ++C+ GY+ +Y D + + +++ E
Sbjct: 183 GCLPYEIKACDHHVVGKLQPCKGDGP-TPRCKKECESGYNNTYSKDEHHAKTVHAVEGVE 241
Query: 395 ETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSS 454
+ IM EI +GPVE + T+Y+D YK+G+Y+H +GGPLG HAI+ +GWG E +
Sbjct: 242 Q-IMTEIMTNGPVEAAFTVYSDFPTYKSGVYEHKSGGPLGGHAIKTLGWGNEDGKD---- 296
Query: 455 VVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGL 494
YWLVANS+N +WG+NG F+I+RG++ECGIE++I AG+
Sbjct: 297 ---YWLVANSWNPDWGDNGFFKILRGRDECGIESNIVAGM 333
>gi|324507953|gb|ADY43363.1| Cathepsin B cysteine proteinase 6 [Ascaris suum]
Length = 352
Score = 249 bits (637), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 123/242 (50%), Positives = 157/242 (64%), Gaps = 10/242 (4%)
Query: 97 LPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLV 156
+PE FDAR W C +++ IRDQ SCGS WA GAVEAMSDR+CIAS GK V LS+DDL+
Sbjct: 80 IPEAFDAREKWDQCASLKNIRDQSSCGSCWAFGAVEAMSDRICIASNGKIQVSLSADDLL 139
Query: 157 SCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEI-PCERYMNGSH-SSCQ 214
SCCK CG GC GG AWKYWV GIV+G + KQGC+PY PCE + N +H C+
Sbjct: 140 SCCKSCGFGCDGGDPMAAWKYWVKEGIVTGSNFTMKQGCKPYPFPPCEHHSNKTHYQPCK 199
Query: 215 DNEPNTPECIRKCQPGY-DVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYA 273
+ TP+C +KC Y + +Y +D FG AY + + +I +EI HGPVE + +Y
Sbjct: 200 HDLYPTPKCEKKCLDIYTEKTYAEDKFFGETAYGVEDDVTSIQKEILTHGPVEVAFEVYE 259
Query: 274 DMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLF 333
D ++Y GIY H G G HA++++GWG E V YWLVANS+NT+WGE+G F
Sbjct: 260 DFLMYDGGIYVHTGGKIGGGHAVKMLGWGVE-------QGVPYWLVANSWNTDWGEDGFF 312
Query: 334 RI 335
RI
Sbjct: 313 RI 314
Score = 148 bits (373), Expect = 8e-33, Method: Compositional matrix adjust.
Identities = 77/186 (41%), Positives = 109/186 (58%), Gaps = 13/186 (6%)
Query: 315 KYWLVANSFNTNWGENGLFRIGCRPYEIP-CERYMNGSR-SSCQANEPNTPECIRKCQPG 372
KYW+ G N + GC+PY P CE + N + C+ + TP+C +KC
Sbjct: 159 KYWVKEGIVT---GSNFTMKQGCKPYPFPPCEHHSNKTHYQPCKHDLYPTPKCEKKCLDI 215
Query: 373 Y-DVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGG 431
Y + +Y +D FG AY + + +I +EI HGPVE + +Y D ++Y GIY H G
Sbjct: 216 YTEKTYAEDKFFGETAYGVEDDVTSIQKEILTHGPVEVAFEVYEDFLMYDGGIYVHTGGK 275
Query: 432 PLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADIT 491
G HA++++GWG E V YWLVANS+NT+WGE+G FRI+RG +ECGIE+ +
Sbjct: 276 IGGGHAVKMLGWGVE-------QGVPYWLVANSWNTDWGEDGFFRIIRGIDECGIESSVV 328
Query: 492 AGLPKI 497
GLPK+
Sbjct: 329 GGLPKL 334
>gi|1777779|gb|AAB40605.1| cathepsin B-like cysteine proteinase [Ascaris suum]
gi|324515014|gb|ADY46062.1| Cathepsin B cysteine proteinase 6 [Ascaris suum]
Length = 398
Score = 249 bits (637), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 123/242 (50%), Positives = 157/242 (64%), Gaps = 10/242 (4%)
Query: 97 LPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLV 156
+PE FDAR W C +++ IRDQ SCGS WA GAVEAMSDR+CIAS GK V LS+DDL+
Sbjct: 121 IPEAFDAREKWDQCASLKNIRDQSSCGSCWAFGAVEAMSDRICIASNGKIQVSLSADDLL 180
Query: 157 SCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEI-PCERYMNGSH-SSCQ 214
SCCK CG GC GG AWKYWV GIV+G + KQGC+PY PCE + N +H C+
Sbjct: 181 SCCKSCGFGCDGGDPMAAWKYWVKEGIVTGSNFTMKQGCKPYPFPPCEHHSNKTHYQPCK 240
Query: 215 DNEPNTPECIRKCQPGY-DVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYA 273
+ TP+C +KC Y + +Y +D FG AY + + +I +EI HGPVE + +Y
Sbjct: 241 HDLYPTPKCEKKCLDIYTEKTYAEDKFFGETAYGVEDDVTSIQKEILTHGPVEVAFEVYE 300
Query: 274 DMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLF 333
D ++Y GIY H G G HA++++GWG E V YWLVANS+NT+WGE+G F
Sbjct: 301 DFLMYDGGIYVHTGGKIGGGHAVKMLGWGVE-------QGVPYWLVANSWNTDWGEDGFF 353
Query: 334 RI 335
RI
Sbjct: 354 RI 355
Score = 147 bits (372), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 77/186 (41%), Positives = 109/186 (58%), Gaps = 13/186 (6%)
Query: 315 KYWLVANSFNTNWGENGLFRIGCRPYEIP-CERYMNGSR-SSCQANEPNTPECIRKCQPG 372
KYW+ G N + GC+PY P CE + N + C+ + TP+C +KC
Sbjct: 200 KYWVKEGIVT---GSNFTMKQGCKPYPFPPCEHHSNKTHYQPCKHDLYPTPKCEKKCLDI 256
Query: 373 Y-DVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGG 431
Y + +Y +D FG AY + + +I +EI HGPVE + +Y D ++Y GIY H G
Sbjct: 257 YTEKTYAEDKFFGETAYGVEDDVTSIQKEILTHGPVEVAFEVYEDFLMYDGGIYVHTGGK 316
Query: 432 PLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADIT 491
G HA++++GWG E V YWLVANS+NT+WGE+G FRI+RG +ECGIE+ +
Sbjct: 317 IGGGHAVKMLGWGVE-------QGVPYWLVANSWNTDWGEDGFFRIIRGIDECGIESSVV 369
Query: 492 AGLPKI 497
GLPK+
Sbjct: 370 GGLPKL 375
>gi|341904470|gb|EGT60303.1| hypothetical protein CAEBREN_20420 [Caenorhabditis brenneri]
Length = 351
Score = 249 bits (637), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 117/242 (48%), Positives = 164/242 (67%), Gaps = 10/242 (4%)
Query: 97 LPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLV 156
+P+ FD+R WP CP+I +IRDQ SCGS WA+ A E +SDR+CIAS+G+ V +S+DD+
Sbjct: 97 IPDSFDSRAQWPNCPSISKIRDQSSCGSCWAVSAAETISDRICIASKGQTQVSISADDIN 156
Query: 157 SCC-KDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIP-CERYMNGSH-SSC 213
+CC CGNGC GG+ +AW+++V G V+GG+Y K GC+PY P CE ++NG+H C
Sbjct: 157 ACCGMACGNGCNGGYPIEAWRHYVKNGYVTGGSYQEKTGCKPYPYPPCEHHVNGTHYKPC 216
Query: 214 QDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYA 273
+ T +C R CQ GY ++Y+ DL+FG+ AY++ I +EI +GPVE + T+YA
Sbjct: 217 PSDMYPTDKCERSCQAGYSLTYKQDLHFGQSAYAVSKKATEIQKEIMTNGPVEVAFTVYA 276
Query: 274 DMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLF 333
D +Y G+Y H AG LG HA++++GWG + GT YWL ANS+N +WGENG F
Sbjct: 277 DFEVYSGGVYVHTAGASLGGHAVKMLGWG---VDNGTP----YWLCANSWNEDWGENGYF 329
Query: 334 RI 335
RI
Sbjct: 330 RI 331
Score = 158 bits (400), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 76/166 (45%), Positives = 108/166 (65%), Gaps = 9/166 (5%)
Query: 334 RIGCRPYEIP-CERYMNGSR-SSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLP 391
+ GC+PY P CE ++NG+ C ++ T +C R CQ GY ++Y+ DL+FG+ AY++
Sbjct: 193 KTGCKPYPYPPCEHHVNGTHYKPCPSDMYPTDKCERSCQAGYSLTYKQDLHFGQSAYAVS 252
Query: 392 ANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEG 451
I +EI +GPVE + T+YAD +Y G+Y H AG LG HA++++GWG + G
Sbjct: 253 KKATEIQKEIMTNGPVEVAFTVYADFEVYSGGVYVHTAGASLGGHAVKMLGWG---VDNG 309
Query: 452 TSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
T YWL ANS+N +WGENG FRI+RG NECGIE + G+PK+
Sbjct: 310 TP----YWLCANSWNEDWGENGYFRIIRGVNECGIEHGVVGGIPKL 351
>gi|390994433|gb|AFM37366.1| cathepsin B3 [Dictyocaulus viviparus]
Length = 342
Score = 249 bits (637), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 119/267 (44%), Positives = 170/267 (63%), Gaps = 14/267 (5%)
Query: 71 MGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGA 130
M H D ++ + P L + + +PE FDAR WP+CP+I IRDQ CGS WA
Sbjct: 70 MKPHYDRRIGK---PQLQENEEDTAGIPESFDARTQWPHCPSISLIRDQADCGSCWAFAV 126
Query: 131 VEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYA 190
E++SDRVCIA+ + S +D+++CC +CG GC GGF AW+Y+V+TG+V+GG Y
Sbjct: 127 GESISDRVCIATDANKTAEFSVEDILTCCDECGFGCDGGFPDAAWEYFVSTGVVTGGLYG 186
Query: 191 SKQGCRPYEI-PCERYMNGS-HSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSL 248
+K CRPYEI PC + N + + +C +TP C CQ GY VSY+DD GR +Y+L
Sbjct: 187 TKNACRPYEISPCGNHPNETFYRNCTG--VSTPSCKTSCQKGYPVSYKDDKTRGRKSYNL 244
Query: 249 PANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGE 308
+ I ++I +HGP+ + ++Y D + YK GIY++ GG G HA+RI+GWG E
Sbjct: 245 ANSVSAIQKDILKHGPLVATFSVYEDFMYYKKGIYRYTHGGYEGGHAVRILGWGVE---- 300
Query: 309 GTSSVVKYWLVANSFNTNWGENGLFRI 335
+ VKYW++ANS+NT+WGE+G FR+
Sbjct: 301 ---NNVKYWIIANSWNTDWGEDGFFRM 324
Score = 152 bits (383), Expect = 5e-34, Method: Compositional matrix adjust.
Identities = 75/161 (46%), Positives = 105/161 (65%), Gaps = 11/161 (6%)
Query: 336 GCRPYEI-PCERYMNGS-RSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPAN 393
CRPYEI PC + N + +C +TP C CQ GY VSY+DD GR +Y+L +
Sbjct: 190 ACRPYEISPCGNHPNETFYRNCTG--VSTPSCKTSCQKGYPVSYKDDKTRGRKSYNLANS 247
Query: 394 EETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTS 453
I ++I +HGP+ + ++Y D + YK GIY++ GG G HA+RI+GWG E
Sbjct: 248 VSAIQKDILKHGPLVATFSVYEDFMYYKKGIYRYTHGGYEGGHAVRILGWGVE------- 300
Query: 454 SVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGL 494
+ VKYW++ANS+NT+WGE+G FR+VRG N+CGIE ++AGL
Sbjct: 301 NNVKYWIIANSWNTDWGEDGFFRMVRGINDCGIEESVSAGL 341
>gi|56753605|gb|AAW25005.1| SJCHGC02852 protein [Schistosoma japonicum]
Length = 346
Score = 248 bits (633), Expect = 5e-63, Method: Compositional matrix adjust.
Identities = 122/275 (44%), Positives = 171/275 (62%), Gaps = 7/275 (2%)
Query: 63 TLSELEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSC 122
++ + MGV +S ++ +D +LP+ FD+R W C +I+ IRDQ SC
Sbjct: 58 SIHHAKSMMGVLLNSVDQHKLHHPIIHHNDINIKLPKYFDSRKYWKNCSSIRTIRDQSSC 117
Query: 123 GSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTG 182
GS WA GAVE+MSDR+CI S+G+ + LS+ +L+SCC CG GC GG G AW YW G
Sbjct: 118 GSCWAFGAVESMSDRICIHSKGRISIELSAVNLLSCCSRCGFGCNGGIPGMAWDYWKDEG 177
Query: 183 IVSGGTYASKQGCRPYEIP-CERYMNG-SHSSCQDNEPNTPECIRKCQPGYDVSYEDDLN 240
IV+GG+ + GC+PY P C + +HSSC+ +TPEC + CQP Y + YE+D
Sbjct: 178 IVTGGSNETHTGCQPYPFPECIHHSTSINHSSCEVKYYSTPECYQTCQPDYAIQYENDKY 237
Query: 241 FGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIG 300
+G+ +Y + ++E +IM+EI +GPVE + ++ D + YKTG+YK+V G LG HAIRIIG
Sbjct: 238 YGKSSYYVTSDEVSIMKEILLNGPVEATFYVFDDFLNYKTGVYKYVTGSLLGGHAIRIIG 297
Query: 301 WGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
WG L YWL ANS+N WG+ G F+I
Sbjct: 298 WGVSTLNH-----TPYWLCANSWNKQWGDKGYFKI 327
Score = 164 bits (415), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 78/163 (47%), Positives = 107/163 (65%), Gaps = 7/163 (4%)
Query: 336 GCRPYEIP-CERYMNG-SRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPAN 393
GC+PY P C + + SSC+ +TPEC + CQP Y + YE+D +G+ +Y + ++
Sbjct: 189 GCQPYPFPECIHHSTSINHSSCEVKYYSTPECYQTCQPDYAIQYENDKYYGKSSYYVTSD 248
Query: 394 EETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTS 453
E +IM+EI +GPVE + ++ D + YKTG+YK+V G LG HAIRIIGWG L
Sbjct: 249 EVSIMKEILLNGPVEATFYVFDDFLNYKTGVYKYVTGSLLGGHAIRIIGWGVSTLNH--- 305
Query: 454 SVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPK 496
YWL ANS+N WG+ G F+I+RG NECGIE+ +TAGLPK
Sbjct: 306 --TPYWLCANSWNKQWGDKGYFKILRGSNECGIESMVTAGLPK 346
>gi|326427908|gb|EGD73478.1| cathepsin B [Salpingoeca sp. ATCC 50818]
Length = 341
Score = 248 bits (633), Expect = 6e-63, Method: Compositional matrix adjust.
Identities = 123/277 (44%), Positives = 181/277 (65%), Gaps = 16/277 (5%)
Query: 62 LTLSELEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINW-PYCPTIQEIRDQG 120
+T+ + +MG ++L + + L V+ + +LP FD+R W CP+++E+RDQ
Sbjct: 58 VTMDYIRKQMG----TRLEGSPVTLDVKHVEVPADLPTSFDSRTQWGSMCPSVKEVRDQA 113
Query: 121 SCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCK-DCGNGCQGGFHGKAWKYWV 179
+CGS WA GAVEAM+DR CIAS+G + +S++DL++CC CG+GC GG+ AW+YW
Sbjct: 114 NCGSCWAFGAVEAMTDRTCIASKGAQTPHISAEDLLTCCTFTCGDGCNGGYPAAAWEYWK 173
Query: 180 TTGIVSGGTYASKQGCRPYEIP-CERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDD 238
GIV+GG Y S QGC+PY + CE + G + C D P TP C R C+ GY+V+Y +D
Sbjct: 174 NQGIVTGGQYDSNQGCQPYSLAKCEHHTTGPYKPCGDIVP-TPACKRSCRQGYNVTYPND 232
Query: 239 LNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRI 298
+FG +Y + ++ I EI +GPVE + T+Y+D + YK+G+Y+H +G PLG HAI+I
Sbjct: 233 KHFGASSYGVRGVDQ-IATEIMTNGPVEAAFTVYSDFLSYKSGVYQHTSGQPLGGHAIKI 291
Query: 299 IGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
IGWG + +GT YW+VANS+N +WG +G F I
Sbjct: 292 IGWGVQ---DGTD----YWIVANSWNDSWGNDGFFWI 321
Score = 150 bits (380), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 74/163 (45%), Positives = 107/163 (65%), Gaps = 10/163 (6%)
Query: 336 GCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANE 394
GC+PY + CE + G C P TP C R C+ GY+V+Y +D +FG +Y + +
Sbjct: 188 GCQPYSLAKCEHHTTGPYKPCGDIVP-TPACKRSCRQGYNVTYPNDKHFGASSYGVRGVD 246
Query: 395 ETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSS 454
+ I EI +GPVE + T+Y+D + YK+G+Y+H +G PLG HAI+IIGWG + +GT
Sbjct: 247 Q-IATEIMTNGPVEAAFTVYSDFLSYKSGVYQHTSGQPLGGHAIKIIGWGVQ---DGTD- 301
Query: 455 VVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
YW+VANS+N +WG +G F I +G +ECGIE+ + AGLPK+
Sbjct: 302 ---YWIVANSWNDSWGNDGFFWIKKGTDECGIESQVVAGLPKV 341
>gi|341888136|gb|EGT44071.1| hypothetical protein CAEBREN_13576 [Caenorhabditis brenneri]
Length = 337
Score = 248 bits (633), Expect = 6e-63, Method: Compositional matrix adjust.
Identities = 126/266 (47%), Positives = 173/266 (65%), Gaps = 17/266 (6%)
Query: 80 PQNRLPLLVQLSDPL---EELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSD 136
P++ P +P+ E +P+ FDAR WP C +I IRDQ CGS WA+ A E +SD
Sbjct: 55 PEHVAPHKFYEVEPISVAENIPDHFDAREQWPNCVSIDNIRDQSDCGSCWAVAAAETISD 114
Query: 137 RVCIASRGKRHVRLSSDDLVSCCK---DCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQ 193
R CIAS G+ +V +S++DL+SCC +CG+GC+GG+ +AW+YWV G+V+GG+Y S+
Sbjct: 115 RTCIASNGEVNVLISAEDLLSCCTGGYNCGDGCEGGYPIQAWRYWVHNGLVTGGSYESQY 174
Query: 194 GCRPYEI-PCERYMNG-SHSSCQDNEPNTPECIRKC--QPGYDVSYEDDLNFGRIAYSLP 249
GC+PY I PC + +NG + C +E TPEC+++C + Y V Y+ D ++G AY++
Sbjct: 175 GCKPYSIAPCGQTVNGVTWPKCAADEVATPECVKQCTSKSDYAVPYDQDKHYGSSAYAIR 234
Query: 250 ANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEG 309
N I EI R+GPVE +Y+D YK+GIYKHVAG LG HA++I+GWG E G
Sbjct: 235 QNVAQIQTEIMRNGPVEVGFLVYSDFYQYKSGIYKHVAGRELGGHAVKILGWGVE---NG 291
Query: 310 TSSVVKYWLVANSFNTNWGENGLFRI 335
T YWL ANS+N NWGE G FRI
Sbjct: 292 T----PYWLAANSWNVNWGEKGYFRI 313
Score = 162 bits (409), Expect = 6e-37, Method: Compositional matrix adjust.
Identities = 86/182 (47%), Positives = 113/182 (62%), Gaps = 18/182 (9%)
Query: 327 WGENGLF-------RIGCRPYEI-PCERYMNG-SRSSCQANEPNTPECIRKC--QPGYDV 375
W NGL + GC+PY I PC + +NG + C A+E TPEC+++C + Y V
Sbjct: 159 WVHNGLVTGGSYESQYGCKPYSIAPCGQTVNGVTWPKCAADEVATPECVKQCTSKSDYAV 218
Query: 376 SYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGE 435
Y+ D ++G AY++ N I EI R+GPVE +Y+D YK+GIYKHVAG LG
Sbjct: 219 PYDQDKHYGSSAYAIRQNVAQIQTEIMRNGPVEVGFLVYSDFYQYKSGIYKHVAGRELGG 278
Query: 436 HAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLP 495
HA++I+GWG E GT YWL ANS+N NWGE G FRI RG NECGIE+ + AG+P
Sbjct: 279 HAVKILGWGVE---NGT----PYWLAANSWNVNWGEKGYFRIRRGTNECGIESSVVAGIP 331
Query: 496 KI 497
+
Sbjct: 332 DL 333
>gi|320166129|gb|EFW43028.1| cathepsin B [Capsaspora owczarzaki ATCC 30864]
Length = 332
Score = 248 bits (632), Expect = 8e-63, Method: Compositional matrix adjust.
Identities = 124/290 (42%), Positives = 177/290 (61%), Gaps = 20/290 (6%)
Query: 57 NALSKLTLSELEMRMGVHPDS-------KLPQN-RLPLLVQLSDPLEELPEGFDARINWP 108
N+L ++E R G + + P++ RLPL +++ E +P+ FD+R NWP
Sbjct: 33 NSLKTTWVAERPTRFGSFDEVARLCGALETPEDQRLPL--KVAPIAEAIPDTFDSRTNWP 90
Query: 109 YCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQG 168
CPTI+E+RDQ +CGS WA GAVE+MSDR+CIAS + VRLS+ DL+SCC CG+GC G
Sbjct: 91 ACPTIKEVRDQSACGSCWAFGAVESMSDRICIASNATKIVRLSASDLLSCCTSCGDGCDG 150
Query: 169 GFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIPCERYMNGS--HSSCQDNEPNTPECIRK 226
G G +W Y+ GIV+G Y + C+PY+ P + S + C + +TP+C +
Sbjct: 151 GQLGPSWDYYKNKGIVTGYLYNTTGYCKPYDFPACAHHEASPDYPDCPSTDYSTPKCTKS 210
Query: 227 CQPGYDV-SYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKH 285
C GY +Y DL++G+ +YS+ + I EI HGPVE + T+Y+D Y++G+YKH
Sbjct: 211 CVAGYTANTYTADLHYGQSSYSVGRTDAAIQTEILNHGPVEAAFTVYSDFPTYRSGVYKH 270
Query: 286 VAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
+G LG HAI I+GWG E S YWLV NS+N +WG+ G F+I
Sbjct: 271 TSGSVLGGHAISIVGWGTE-------SGSPYWLVKNSWNPSWGDGGFFKI 313
Score = 141 bits (355), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 68/165 (41%), Positives = 99/165 (60%), Gaps = 12/165 (7%)
Query: 337 CRPYEIPCERYMNGSRS--SCQANEPNTPECIRKCQPGYDV-SYEDDLNFGRIAYSLPAN 393
C+PY+ P + S C + + +TP+C + C GY +Y DL++G+ +YS+
Sbjct: 177 CKPYDFPACAHHEASPDYPDCPSTDYSTPKCTKSCVAGYTANTYTADLHYGQSSYSVGRT 236
Query: 394 EETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTS 453
+ I EI HGPVE + T+Y+D Y++G+YKH +G LG HAI I+GWG E
Sbjct: 237 DAAIQTEILNHGPVEAAFTVYSDFPTYRSGVYKHTSGSVLGGHAISIVGWGTE------- 289
Query: 454 SVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKIG 498
S YWLV NS+N +WG+ G F+I+RG +CGI D+ GLPK+
Sbjct: 290 SGSPYWLVKNSWNPSWGDGGFFKILRG--DCGINNDVVGGLPKLA 332
>gi|118429529|gb|ABK91812.1| cathepsin B precursor [Clonorchis sinensis]
Length = 342
Score = 247 bits (631), Expect = 9e-63, Method: Compositional matrix adjust.
Identities = 122/242 (50%), Positives = 156/242 (64%), Gaps = 9/242 (3%)
Query: 95 EELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDD 154
+ LPE FDAR NWP+CP+I EIRDQ SCGS WA GAVEAMSDR+CI S+G + LS+ D
Sbjct: 84 QHLPESFDARANWPHCPSISEIRDQSSCGSCWAFGAVEAMSDRLCIHSKGAFNKSLSAVD 143
Query: 155 LVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIP-CERYMNGSHSSC 213
LVSCC +CG GC+GG+ AW W T GIV+GG+ GCR Y P CE G + C
Sbjct: 144 LVSCCTECGCGCRGGYSPIAWDLWKTHGIVTGGSKEKPTGCRSYPFPSCEHRGKGQYPPC 203
Query: 214 QDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYA 273
TPECI++C ++ YE D I+Y++ E+ +M+EI GPV + +Y
Sbjct: 204 PHQLYPTPECIKRCDTK-EIDYEKDKTRANISYNVYPAEQAVMKEIMLRGPVGAILHVYE 262
Query: 274 DMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLF 333
D++ YK+G+Y HV GG LGEH IRI+GWG+E V YWLVANS+N +WGE G
Sbjct: 263 DLLDYKSGVYFHVWGGHLGEHGIRILGWGEE-------DGVPYWLVANSWNEDWGEKGYM 315
Query: 334 RI 335
R+
Sbjct: 316 RV 317
Score = 144 bits (364), Expect = 9e-32, Method: Compositional matrix adjust.
Identities = 73/163 (44%), Positives = 98/163 (60%), Gaps = 9/163 (5%)
Query: 336 GCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANE 394
GCR Y P CE G C TPECI++C ++ YE D I+Y++ E
Sbjct: 183 GCRSYPFPSCEHRGKGQYPPCPHQLYPTPECIKRCDTK-EIDYEKDKTRANISYNVYPAE 241
Query: 395 ETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSS 454
+ +M+EI GPV + +Y D++ YK+G+Y HV GG LGEH IRI+GWG+E
Sbjct: 242 QAVMKEIMLRGPVGAILHVYEDLLDYKSGVYFHVWGGHLGEHGIRILGWGEE-------D 294
Query: 455 VVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
V YWLVANS+N +WGE G R++R +NECGI +TAGLP +
Sbjct: 295 GVPYWLVANSWNEDWGEKGYMRVLRWRNECGIVDQVTAGLPDL 337
>gi|1181143|emb|CAA93278.1| cysteine proteinase [Haemonchus contortus]
Length = 341
Score = 247 bits (631), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 123/261 (47%), Positives = 168/261 (64%), Gaps = 9/261 (3%)
Query: 77 SKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSD 136
S + QNR P+ +D E++PE FDAR WP C +++ IRDQ +CGS WA+ A+SD
Sbjct: 70 SFINQNRKPVFDDKNDKGEDIPESFDARTKWPKCSSLKHIRDQANCGSCWAVSTASALSD 129
Query: 137 RVCIASRGKRHVRLSSDDLVSCC-KDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGC 195
R+CIAS G++ V +S+ D++SCC CG GC GG+ +A+ Y+ G V+GG Y + GC
Sbjct: 130 RICIASNGRKQVHVSATDILSCCGNQCGYGCNGGWPIQAFNYFSKQGAVTGGDYKATSGC 189
Query: 196 RPYEI-PCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEET 254
RPY PC + ++ NE TP+C+RKCQ Y SY+ D + G+ AY +P +E+
Sbjct: 190 RPYPFHPCGHHGKDTYYGECPNEATTPKCVRKCQKSYKKSYKKDRSIGKDAYEVPNSEKA 249
Query: 255 IMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVV 314
I REI ++GPV G+ T+Y D YK GIYKH AG G HAI+IIGWG+E V
Sbjct: 250 IQREIMKNGPVVGAFTVYEDFSYYKKGIYKHTAGKARGGHAIKIIGWGKE-------GGV 302
Query: 315 KYWLVANSFNTNWGENGLFRI 335
YWL+ANS++ +WGENG FRI
Sbjct: 303 PYWLIANSWHNDWGENGYFRI 323
Score = 154 bits (388), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 79/159 (49%), Positives = 102/159 (64%), Gaps = 8/159 (5%)
Query: 336 GCRPYEI-PCERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANE 394
GCRPY PC + + NE TP+C+RKCQ Y SY+ D + G+ AY +P +E
Sbjct: 188 GCRPYPFHPCGHHGKDTYYGECPNEATTPKCVRKCQKSYKKSYKKDRSIGKDAYEVPNSE 247
Query: 395 ETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSS 454
+ I REI ++GPV G+ T+Y D YK GIYKH AG G HAI+IIGWG+E
Sbjct: 248 KAIQREIMKNGPVVGAFTVYEDFSYYKKGIYKHTAGKARGGHAIKIIGWGKE-------G 300
Query: 455 VVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAG 493
V YWL+ANS++ +WGENG FRI+RG N CGIE ++ AG
Sbjct: 301 GVPYWLIANSWHNDWGENGYFRILRGSNHCGIEENVVAG 339
>gi|268557308|ref|XP_002636643.1| Hypothetical protein CBG23351 [Caenorhabditis briggsae]
Length = 351
Score = 247 bits (630), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 117/242 (48%), Positives = 159/242 (65%), Gaps = 10/242 (4%)
Query: 97 LPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLV 156
+P+ FD+R WP CP+I +IRDQ SCGS WA+ A E +SDR+CIAS GK + +S+DD+
Sbjct: 97 VPDSFDSRTQWPNCPSISKIRDQSSCGSCWAVSAAETISDRICIASNGKTQISISADDIN 156
Query: 157 SCC-KDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIP-CERYMNGSH-SSC 213
+CC CGNGC GG+ +AW+++V G V+GG+Y K GC+PY P CE ++NG+H C
Sbjct: 157 ACCGMVCGNGCNGGYPIEAWRHYVKKGYVTGGSYQEKSGCKPYPYPPCEHHVNGTHYKPC 216
Query: 214 QDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYA 273
N T +C CQ GY ++Y DL+FG+ AY++ I +EI HGPVE + T+Y
Sbjct: 217 PSNMYPTDKCEHSCQAGYPLTYTQDLHFGQSAYAVSKKPAEIQKEIMTHGPVEVAFTVYE 276
Query: 274 DMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLF 333
D Y G+Y H AG LG HA++++GWG + GT YWL ANS+N +WGENG F
Sbjct: 277 DFEHYSGGVYVHTAGASLGGHAVKMLGWG---VDNGT----PYWLCANSWNEDWGENGYF 329
Query: 334 RI 335
RI
Sbjct: 330 RI 331
Score = 156 bits (394), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 76/166 (45%), Positives = 104/166 (62%), Gaps = 9/166 (5%)
Query: 334 RIGCRPYEIP-CERYMNGSR-SSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLP 391
+ GC+PY P CE ++NG+ C +N T +C CQ GY ++Y DL+FG+ AY++
Sbjct: 193 KSGCKPYPYPPCEHHVNGTHYKPCPSNMYPTDKCEHSCQAGYPLTYTQDLHFGQSAYAVS 252
Query: 392 ANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEG 451
I +EI HGPVE + T+Y D Y G+Y H AG LG HA++++GWG + G
Sbjct: 253 KKPAEIQKEIMTHGPVEVAFTVYEDFEHYSGGVYVHTAGASLGGHAVKMLGWG---VDNG 309
Query: 452 TSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
T YWL ANS+N +WGENG FRI+RG NECGIE+ + G PK+
Sbjct: 310 T----PYWLCANSWNEDWGENGYFRIIRGVNECGIESGVVGGTPKL 351
>gi|46812327|gb|AAT02230.1| cathepsin B-like proteinase [Triatoma dimidiata]
Length = 332
Score = 247 bits (630), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 126/274 (45%), Positives = 172/274 (62%), Gaps = 20/274 (7%)
Query: 67 LEMRMGVHPDSK----LPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSC 122
L+ GVH ++ LP+ ++ L V + P+ FDAR WP CP+I +IRDQGSC
Sbjct: 54 LKSLAGVHKNANNAFTLPKRKVSLDVTI-------PDEFDARKQWPNCPSITDIRDQGSC 106
Query: 123 GSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTG 182
GS WAL + S GK V LS+++LV+CC CG GC GG G AW+YW G
Sbjct: 107 GSCWALELLRLCLIVFVSHSNGKLQVHLSAENLVTCCGSCGAGCFGGDPGSAWEYWRDVG 166
Query: 183 IVSGGTYASKQGCRPYEI-PCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNF 241
IVSGG Y SK+GC+PY I PCE ++ GS C+ E +T +C ++C+ GY + Y+ DL++
Sbjct: 167 IVSGGNYGSKEGCQPYSIAPCEHHIPGSRPPCR-GEGHTADCRKQCEKGYSIPYDKDLHY 225
Query: 242 GRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGW 301
YS + + I EI ++GPVE + +Y D++ YK G+YKHVAG P+G HAI+I+GW
Sbjct: 226 AEFVYSTERDVKEIQTEILKNGPVEAAFFVYEDLLTYKEGVYKHVAGAPVGGHAIKILGW 285
Query: 302 GQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
G E GT YWL+ANS+NT+WG NG F+I
Sbjct: 286 GVE---NGT----PYWLIANSWNTDWGNNGFFKI 312
Score = 172 bits (436), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 80/163 (49%), Positives = 114/163 (69%), Gaps = 9/163 (5%)
Query: 336 GCRPYEI-PCERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANE 394
GC+PY I PCE ++ GSR C+ E +T +C ++C+ GY + Y+ DL++ YS +
Sbjct: 178 GCQPYSIAPCEHHIPGSRPPCRG-EGHTADCRKQCEKGYSIPYDKDLHYAEFVYSTERDV 236
Query: 395 ETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSS 454
+ I EI ++GPVE + +Y D++ YK G+YKHVAG P+G HAI+I+GWG E GT
Sbjct: 237 KEIQTEILKNGPVEAAFFVYEDLLTYKEGVYKHVAGAPVGGHAIKILGWGVE---NGT-- 291
Query: 455 VVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
YWL+ANS+NT+WG NG F+I+RG +ECGIE D++AGLP+I
Sbjct: 292 --PYWLIANSWNTDWGNNGFFKILRGSDECGIEIDVSAGLPRI 332
>gi|29374025|gb|AAO73003.1| cathepsin B [Fasciola gigantica]
Length = 339
Score = 246 bits (629), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 122/274 (44%), Positives = 170/274 (62%), Gaps = 9/274 (3%)
Query: 64 LSELEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCG 123
+ ++ +G ++ +N L ++ +LPE FDAR WP C TI EIRDQ SCG
Sbjct: 53 VDHFKLHLGALSETPEERNALRPTIKHDISKNDLPESFDARSQWPQCWTISEIRDQASCG 112
Query: 124 SGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGI 183
S WA A AMSDRVCI S G+ RL++ D +SCC CG GC+GG+ KAW YW+ GI
Sbjct: 113 SCWATAAASAMSDRVCIHSNGQMRPRLAAADPLSCCTYCGQGCRGGYPPKAWDYWMREGI 172
Query: 184 VSGGTYASKQGCRPYEIPCERYMNGS--HSSCQDNEPNTPECIRKCQPGYDVSYEDDLNF 241
V+GGT+ ++ GC+P+ ++ S +S C TP C R CQ GY+ +YE D +
Sbjct: 173 VTGGTWENRTGCQPWMFTKCDHVGDSRKYSRCPHYTYPTPPCARACQTGYNKTYEQDKFY 232
Query: 242 GRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGW 301
G +Y++ +E IM+EI ++GPVE + I+ D +Y++GIY HVAG +G HA+R+IGW
Sbjct: 233 GNSSYNVGEHESYIMQEIMKNGPVEVTFAIFQDFGVYRSGIYHHVAGKFIGRHAVRMIGW 292
Query: 302 GQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
G E + V YWL+ANS+N WGENG FR+
Sbjct: 293 GVE-------NGVNYWLMANSWNEEWGENGYFRM 319
Score = 156 bits (395), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 76/166 (45%), Positives = 109/166 (65%), Gaps = 9/166 (5%)
Query: 334 RIGCRPYEIPCERYMNGSR--SSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLP 391
R GC+P+ ++ SR S C TP C R CQ GY+ +YE D +G +Y++
Sbjct: 181 RTGCQPWMFTKCDHVGDSRKYSRCPHYTYPTPPCARACQTGYNKTYEQDKFYGNSSYNVG 240
Query: 392 ANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEG 451
+E IM+EI ++GPVE + I+ D +Y++GIY HVAG +G HA+R+IGWG E
Sbjct: 241 EHESYIMQEIMKNGPVEVTFAIFQDFGVYRSGIYHHVAGKFIGRHAVRMIGWGVE----- 295
Query: 452 TSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
+ V YWL+ANS+N WGENG FR+VRG+NECGIE+++ AG+P++
Sbjct: 296 --NGVNYWLMANSWNEEWGENGYFRMVRGRNECGIESEVVAGMPRL 339
>gi|308504233|ref|XP_003114300.1| hypothetical protein CRE_27039 [Caenorhabditis remanei]
gi|308261685|gb|EFP05638.1| hypothetical protein CRE_27039 [Caenorhabditis remanei]
Length = 351
Score = 246 bits (628), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 117/242 (48%), Positives = 160/242 (66%), Gaps = 10/242 (4%)
Query: 97 LPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLV 156
+P+ FD+R WP CP+I +IRDQ SCGS WA+ A E +SDR+CIAS GK + +S+DD+
Sbjct: 97 IPDSFDSRAQWPNCPSISKIRDQSSCGSCWAVSAAETISDRICIASNGKTQLSISADDIN 156
Query: 157 SCC-KDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIP-CERYMNGSH-SSC 213
+CC CGNGC GG+ +AW+++V G V+GG+Y K GC+PY P CE ++NG+H C
Sbjct: 157 ACCGMVCGNGCNGGYPIEAWRHYVKKGYVTGGSYQEKTGCKPYPYPPCEHHVNGTHYKPC 216
Query: 214 QDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYA 273
N T +C R CQ GY ++Y DL+FG+ AY++ I +EI HGPVE + ++Y
Sbjct: 217 PSNMYPTDKCERSCQAGYALTYTQDLHFGQSAYAVSKKVTEIQKEIMTHGPVEVAFSVYE 276
Query: 274 DMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLF 333
D Y G+Y H AG LG HA++++GWG + GT YWL ANS+N +WGENG F
Sbjct: 277 DFEHYSGGVYVHTAGASLGGHAVKMLGWG---VDNGT----PYWLCANSWNEDWGENGYF 329
Query: 334 RI 335
RI
Sbjct: 330 RI 331
Score = 158 bits (399), Expect = 8e-36, Method: Compositional matrix adjust.
Identities = 76/166 (45%), Positives = 106/166 (63%), Gaps = 9/166 (5%)
Query: 334 RIGCRPYEIP-CERYMNGSR-SSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLP 391
+ GC+PY P CE ++NG+ C +N T +C R CQ GY ++Y DL+FG+ AY++
Sbjct: 193 KTGCKPYPYPPCEHHVNGTHYKPCPSNMYPTDKCERSCQAGYALTYTQDLHFGQSAYAVS 252
Query: 392 ANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEG 451
I +EI HGPVE + ++Y D Y G+Y H AG LG HA++++GWG + G
Sbjct: 253 KKVTEIQKEIMTHGPVEVAFSVYEDFEHYSGGVYVHTAGASLGGHAVKMLGWG---VDNG 309
Query: 452 TSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
T YWL ANS+N +WGENG FRI+RG NECGIE+ + G+PK+
Sbjct: 310 T----PYWLCANSWNEDWGENGYFRIIRGVNECGIESGVVGGIPKL 351
>gi|256052331|ref|XP_002569726.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
gi|353228435|emb|CCD74606.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
Length = 319
Score = 246 bits (628), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 121/261 (46%), Positives = 162/261 (62%), Gaps = 9/261 (3%)
Query: 76 DSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMS 135
+S L + R P+ V +D E+P FD+R WP C +I IRDQ CGS WA GAVEAMS
Sbjct: 47 ESDLRRKRRPI-VDHNDWNVEIPSNFDSRKKWPGCKSIATIRDQSRCGSSWAFGAVEAMS 105
Query: 136 DRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGC 195
DR CI S GK++V LS+ DL+SCC+ CG+G +GGF AW YWV GIV+G + + C
Sbjct: 106 DRSCIQSGGKQNVELSAVDLLSCCEHCGDGFEGGFPALAWDYWVKEGIVTGSSKENHTSC 165
Query: 196 RPYEIP-CERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEET 254
+PY P CE + G + +C + TP C CQ Y Y D + G+ Y++ +E+
Sbjct: 166 QPYPFPKCEHHTKGKYPACFEEIYKTPNCENTCQKSYKTPYAQDKHRGKSRYNVKNDEKA 225
Query: 255 IMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVV 314
I +EI ++GPVE + +Y D + YK+GIYKH+ G + HAIRIIGWG E +
Sbjct: 226 IQKEIMKYGPVEANFIVYEDFLNYKSGIYKHITGKLVSWHAIRIIGWGVE-------NNT 278
Query: 315 KYWLVANSFNTNWGENGLFRI 335
YWL+ NS+N +WGENG FRI
Sbjct: 279 PYWLIPNSWNEDWGENGNFRI 299
Score = 145 bits (367), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 69/158 (43%), Positives = 97/158 (61%), Gaps = 8/158 (5%)
Query: 337 CRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEE 395
C+PY P CE + G +C TP C CQ Y Y D + G+ Y++ +E+
Sbjct: 165 CQPYPFPKCEHHTKGKYPACFEEIYKTPNCENTCQKSYKTPYAQDKHRGKSRYNVKNDEK 224
Query: 396 TIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSV 455
I +EI ++GPVE + +Y D + YK+GIYKH+ G + HAIRIIGWG E +
Sbjct: 225 AIQKEIMKYGPVEANFIVYEDFLNYKSGIYKHITGKLVSWHAIRIIGWGVE-------NN 277
Query: 456 VKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAG 493
YWL+ NS+N +WGENG FRI+RG++EC IE+++TAG
Sbjct: 278 TPYWLIPNSWNEDWGENGNFRILRGRHECSIESEVTAG 315
>gi|27526823|emb|CAD32937.1| pro-cathepsin B2 [Fasciola hepatica]
Length = 337
Score = 246 bits (627), Expect = 3e-62, Method: Compositional matrix adjust.
Identities = 133/259 (51%), Positives = 160/259 (61%), Gaps = 14/259 (5%)
Query: 81 QNRLPLL-VQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVC 139
Q R P + +SD +LPE FDAR WP C +I++I DQ SCGS WA+ V AMSDRVC
Sbjct: 61 QTRRPTVRYNVSD--NDLPESFDAREKWPLCRSIRQIPDQSSCGSCWAVAGVGAMSDRVC 118
Query: 140 IASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYE 199
I S G LS+ DLVSCC CGNGCQGG AW YW GIV+GGT + GC PY
Sbjct: 119 IHSNGMMQPELSAIDLVSCCSYCGNGCQGGSPPAAWDYWWRNGIVTGGTLENPTGCLPYP 178
Query: 200 IPCERYMNGSHSS---CQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIM 256
P R+ GS S C TP C CQ GYD +YE D +G+ +Y++ +E TIM
Sbjct: 179 FPQCRH-PGSRSQLNPCPRYTYPTPSCYPYCQAGYDKTYEKDKVYGKTSYNVDRHEYTIM 237
Query: 257 REIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKY 316
EI ++GPVE +Y D +YK+GIY HV+G G+HAIRIIGWG E + VKY
Sbjct: 238 EEIMKNGPVEAGFIVYTDFAVYKSGIYHHVSGRYAGKHAIRIIGWGVE-------NGVKY 290
Query: 317 WLVANSFNTNWGENGLFRI 335
WL ANS+N WGENG FRI
Sbjct: 291 WLTANSWNVGWGENGYFRI 309
Score = 157 bits (397), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 85/189 (44%), Positives = 112/189 (59%), Gaps = 18/189 (9%)
Query: 327 WGENGLFR-------IGCRPYEIPCERYMNGSRSS---CQANEPNTPECIRKCQPGYDVS 376
W NG+ GC PY P R+ GSRS C TP C CQ GYD +
Sbjct: 157 WWRNGIVTGGTLENPTGCLPYPFPQCRHP-GSRSQLNPCPRYTYPTPSCYPYCQAGYDKT 215
Query: 377 YEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEH 436
YE D +G+ +Y++ +E TIM EI ++GPVE +Y D +YK+GIY HV+G G+H
Sbjct: 216 YEKDKVYGKTSYNVDRHEYTIMEEIMKNGPVEAGFIVYTDFAVYKSGIYHHVSGRYAGKH 275
Query: 437 AIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPK 496
AIRIIGWG E + VKYWL ANS+N WGENG FRI+RG +EC IE+ + AG+P+
Sbjct: 276 AIRIIGWGVE-------NGVKYWLTANSWNVGWGENGYFRILRGTDECRIESIVVAGMPR 328
Query: 497 IGLEIDSNE 505
+ I ++
Sbjct: 329 LQKNITNHH 337
>gi|392920988|ref|NP_506011.2| Protein F57F5.1 [Caenorhabditis elegans]
gi|206994319|emb|CAB00098.2| Protein F57F5.1 [Caenorhabditis elegans]
Length = 351
Score = 246 bits (627), Expect = 3e-62, Method: Compositional matrix adjust.
Identities = 117/242 (48%), Positives = 160/242 (66%), Gaps = 10/242 (4%)
Query: 97 LPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLV 156
+P+ FD+R WP CP+I +IRDQ SCGS WA+ A E +SDR+CIAS K + +S+DD+
Sbjct: 97 VPDSFDSRTAWPNCPSISKIRDQSSCGSCWAVSAAETISDRICIASNAKTILSISADDIN 156
Query: 157 SCC-KDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIP-CERYMNGSH-SSC 213
+CC CGNGC GG+ +AW+++V G V+GG+Y K GC+PY P CE ++NG+H C
Sbjct: 157 ACCGMVCGNGCNGGYPIEAWRHYVKKGYVTGGSYQDKTGCKPYPYPPCEHHVNGTHYKPC 216
Query: 214 QDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYA 273
N T +C R CQ GY ++Y+ DL+FG+ AY++ I +EI HGPVE + T+Y
Sbjct: 217 PSNMYPTDKCERSCQAGYALTYQQDLHFGQSAYAVSKKAAEIQKEIMTHGPVEVAFTVYE 276
Query: 274 DMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLF 333
D Y G+Y H AG LG HA++++GWG + GT YWL ANS+N +WGENG F
Sbjct: 277 DFEHYSGGVYVHTAGASLGGHAVKMLGWG---VDNGT----PYWLCANSWNEDWGENGYF 329
Query: 334 RI 335
RI
Sbjct: 330 RI 331
Score = 148 bits (373), Expect = 8e-33, Method: Compositional matrix adjust.
Identities = 77/166 (46%), Positives = 106/166 (63%), Gaps = 9/166 (5%)
Query: 334 RIGCRPYEIP-CERYMNGSR-SSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLP 391
+ GC+PY P CE ++NG+ C +N T +C R CQ GY ++Y+ DL+FG+ AY++
Sbjct: 193 KTGCKPYPYPPCEHHVNGTHYKPCPSNMYPTDKCERSCQAGYALTYQQDLHFGQSAYAVS 252
Query: 392 ANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEG 451
I +EI HGPVE + T+Y D Y G+Y H AG LG HA++++GWG + G
Sbjct: 253 KKAAEIQKEIMTHGPVEVAFTVYEDFEHYSGGVYVHTAGASLGGHAVKMLGWG---VDNG 309
Query: 452 TSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
T YWL ANS+N +WGENG FRI+RG NECGIE + G+PK+
Sbjct: 310 T----PYWLCANSWNEDWGENGYFRIIRGVNECGIEGGVVGGIPKL 351
>gi|3087801|emb|CAA93277.1| cysteine proteinase [Haemonchus contortus]
Length = 344
Score = 245 bits (625), Expect = 5e-62, Method: Compositional matrix adjust.
Identities = 113/259 (43%), Positives = 166/259 (64%), Gaps = 9/259 (3%)
Query: 79 LPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRV 138
+ NR P++ ++D +++PE FDAR +WP C ++ IRDQ CGS WA+ A+SDR+
Sbjct: 73 IKHNRKPIVEDVNDDGDDIPESFDARTHWPNCSSLTHIRDQADCGSCWAVSTASALSDRI 132
Query: 139 CIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPY 198
CIAS+G + V +S+ D++SCC CG+GC GG+ A+K++ G V+GG Y +K CRPY
Sbjct: 133 CIASKGAKQVYVSATDILSCCHSCGDGCDGGYVIDAFKFFAEQGAVTGGDYGAKDCCRPY 192
Query: 199 EI-PCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLP-ANEETIM 256
PC + N ++ + +TPEC+RKCQ GY+ Y +D G AY LP + + I
Sbjct: 193 PFHPCGHHGNETYYGECPEDGSTPECVRKCQEGYETEYHEDRVRGEDAYRLPIGSVKAIQ 252
Query: 257 REIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKY 316
+EI R+GPV + ++ D Y+ GIY HVAG P G HA++IIGW GT V Y
Sbjct: 253 KEIMRNGPVVAAFIVFDDFSFYRKGIYAHVAGSPRGGHAVKIIGW-------GTEHGVPY 305
Query: 317 WLVANSFNTNWGENGLFRI 335
W++ANS++++WGE+G FR+
Sbjct: 306 WIIANSWHSDWGEDGYFRM 324
Score = 147 bits (371), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 73/162 (45%), Positives = 101/162 (62%), Gaps = 9/162 (5%)
Query: 337 CRPYEI-PCERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLP-ANE 394
CRPY PC + N + + +TPEC+RKCQ GY+ Y +D G AY LP +
Sbjct: 189 CRPYPFHPCGHHGNETYYGECPEDGSTPECVRKCQEGYETEYHEDRVRGEDAYRLPIGSV 248
Query: 395 ETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSS 454
+ I +EI R+GPV + ++ D Y+ GIY HVAG P G HA++IIGWG T
Sbjct: 249 KAIQKEIMRNGPVVAAFIVFDDFSFYRKGIYAHVAGSPRGGHAVKIIGWG-------TEH 301
Query: 455 VVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPK 496
V YW++ANS++++WGE+G FR+VRG N+CGIE ++ AG K
Sbjct: 302 GVPYWIIANSWHSDWGEDGYFRMVRGINDCGIETNVVAGKFK 343
>gi|157167368|ref|XP_001653891.1| cathepsin b [Aedes aegypti]
gi|108874250|gb|EAT38475.1| AAEL009642-PA [Aedes aegypti]
Length = 332
Score = 245 bits (625), Expect = 5e-62, Method: Compositional matrix adjust.
Identities = 120/260 (46%), Positives = 159/260 (61%), Gaps = 20/260 (7%)
Query: 78 KLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDR 137
+LP R + + ++PE FDAR WPYC +I I++QG CG+ WA+ AV MSDR
Sbjct: 71 RLPTKRHDVAYNM-----DIPEFFDAREKWPYCKSISTIKNQGLCGACWAVAAVSVMSDR 125
Query: 138 VCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGF-HGKAWKYWVTTGIVSGGTYASKQGCR 196
+CI S GK V L+++DL+ CCKDCGNGC GGF G +++YWV G+VSG Y S GC+
Sbjct: 126 LCIHSEGKFDVELAAEDLMGCCKDCGNGCNGGFLDGTSFQYWVDVGLVSGAAYNSTDGCK 185
Query: 197 PYEI-PCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETI 255
PY PC G H TP C C GYD +Y D +G AY LP +E I
Sbjct: 186 PYPFKPCLYPFVGCHPE------KTPSCTHHCTEGYDGTYRRDKYYGSAAYKLPNDERMI 239
Query: 256 MREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVK 315
EI +GPVE ++Y D+ LYKTG+Y+HV G +G+HA+R+IGWG+E V
Sbjct: 240 QLEIMTNGPVESGFSVYQDLYLYKTGVYQHVVGREVGKHAVRLIGWGKE-------RGVP 292
Query: 316 YWLVANSFNTNWGENGLFRI 335
YWL+ANS+ +WGE+G F+
Sbjct: 293 YWLIANSYGEDWGEHGYFKF 312
Score = 146 bits (369), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 72/163 (44%), Positives = 97/163 (59%), Gaps = 14/163 (8%)
Query: 336 GCRPYEI-PCERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANE 394
GC+PY PC G + TP C C GYD +Y D +G AY LP +E
Sbjct: 183 GCKPYPFKPCLYPFVG------CHPEKTPSCTHHCTEGYDGTYRRDKYYGSAAYKLPNDE 236
Query: 395 ETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSS 454
I EI +GPVE ++Y D+ LYKTG+Y+HV G +G+HA+R+IGWG+E
Sbjct: 237 RMIQLEIMTNGPVESGFSVYQDLYLYKTGVYQHVVGREVGKHAVRLIGWGKE-------R 289
Query: 455 VVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
V YWL+ANS+ +WGE+G F+ +RG N GIE+ + AGLPK+
Sbjct: 290 GVPYWLIANSYGEDWGEHGYFKFLRGSNHLGIESVVIAGLPKV 332
>gi|312271211|gb|ADQ57303.1| cathepsin B-like cysteine proteinase 1 [Angiostrongylus
cantonensis]
Length = 394
Score = 244 bits (624), Expect = 5e-62, Method: Compositional matrix adjust.
Identities = 127/266 (47%), Positives = 167/266 (62%), Gaps = 14/266 (5%)
Query: 73 VHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVE 132
VH K Q+ L D ++PE FDAR +W C +I+ IRDQ SCGS WA GAVE
Sbjct: 101 VHLSVKAKQH----LSSTKDLDIDIPETFDARQHWSNCQSIKNIRDQSSCGSCWAFGAVE 156
Query: 133 AMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASK 192
AMSDR+CIAS K V LS+DDL+SCC+ CG GC+GG AW+YWV GIV+G + +
Sbjct: 157 AMSDRICIASNEKIQVTLSADDLLSCCRTCGFGCEGGDPMFAWQYWVDHGIVTGSNFTAN 216
Query: 193 QGCRPYEI-PCERYMNGSH-SSCQDNEPNTPECIRKCQPGY-DVSYEDDLNFGRIAYSLP 249
QGC+PY PCE + N + C+ + TP+C +KC P Y + +Y+DD +GR AY +
Sbjct: 217 QGCKPYPFPPCEHHSNKTRFDPCRHDLYPTPKCSKKCVPSYKEKNYDDDRFYGRTAYGVK 276
Query: 250 ANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEG 309
+ I +EI HGPVE + +Y D + Y GIY H G G HA+++IGWG + +G
Sbjct: 277 NDVAAIQKEILTHGPVEVAFEVYEDFLHYAGGIYVHTGGKLGGGHAVKLIGWG---IDQG 333
Query: 310 TSSVVKYWLVANSFNTNWGENGLFRI 335
T YWL+ANS+NT+WGE G FRI
Sbjct: 334 TP----YWLIANSWNTDWGEEGFFRI 355
Score = 149 bits (376), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 76/172 (44%), Positives = 106/172 (61%), Gaps = 10/172 (5%)
Query: 328 GENGLFRIGCRPYEIP-CERYMNGSR-SSCQANEPNTPECIRKCQPGY-DVSYEDDLNFG 384
G N GC+PY P CE + N +R C+ + TP+C +KC P Y + +Y+DD +G
Sbjct: 210 GSNFTANQGCKPYPFPPCEHHSNKTRFDPCRHDLYPTPKCSKKCVPSYKEKNYDDDRFYG 269
Query: 385 RIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWG 444
R AY + + I +EI HGPVE + +Y D + Y GIY H G G HA+++IGWG
Sbjct: 270 RTAYGVKNDVAAIQKEILTHGPVEVAFEVYEDFLHYAGGIYVHTGGKLGGGHAVKLIGWG 329
Query: 445 QEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPK 496
+ +GT YWL+ANS+NT+WGE G FRI+RG +ECGIE+ + G+PK
Sbjct: 330 ---IDQGTP----YWLIANSWNTDWGEEGFFRILRGVDECGIESGVVGGIPK 374
>gi|326515156|dbj|BAK03491.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 471
Score = 244 bits (624), Expect = 6e-62, Method: Compositional matrix adjust.
Identities = 135/337 (40%), Positives = 196/337 (58%), Gaps = 26/337 (7%)
Query: 7 DAVATFLKDLDLSQSSRNH--SNGVFCDL-SKAFDRVDHSILLPKLPFYGAEKNALSKLT 63
+ +A + D+ LS S++ H G+ D +++ D VD L G K + T
Sbjct: 128 NTIACYNLDVTLSSSAKKHPKKTGLGLDAPAQSRDIVDFVNALGTTWTAGHNKR-FTYNT 186
Query: 64 LSELEMRMGVHPDS-KLPQNRLPLLVQLSDPLEELPEGFDAR--INWPYCP-TIQEIRDQ 119
L ++ G KLP R+P + LP FD R WP C ++ +RDQ
Sbjct: 187 LRHVKNLCGAKKGGPKLPVKRIPKKM-------ALPTSFDPRDGSKWPACKDSLNHVRDQ 239
Query: 120 GSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWV 179
GSCGS WA GA EAM+DR+CIAS G+ + LS++DL SCC CG GC+GG+ AW Y+
Sbjct: 240 GSCGSCWAFGAAEAMTDRICIASNGQNNFYLSAEDLTSCCDSCGMGCEGGYPSAAWDYFQ 299
Query: 180 TTGIVSGGTYASKQGCRPYEI-PCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDD 238
+TG+V+GG + S QGC PY++ C+ ++ G + C D +P TP C CQ + ++ D
Sbjct: 300 STGLVTGGDWNSNQGCYPYQLQACDHHVTGKYQPCGDIQP-TPACANSCQ--NNATWSSD 356
Query: 239 LNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRI 298
+FG +YS+ ++++IM EI+ +GPVE S +YAD + YK+G+Y+HV G LG HA++I
Sbjct: 357 KHFGASSYSVGTDQQSIMTEIYTNGPVEASYDVYADFVSYKSGVYQHVTGDYLGGHAVKI 416
Query: 299 IGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
IGWG + YW+VANS+N +WG NG F I
Sbjct: 417 IGWGVD-------GSTPYWIVANSWNNDWGNNGFFNI 446
Score = 151 bits (382), Expect = 8e-34, Method: Compositional matrix adjust.
Identities = 71/163 (43%), Positives = 104/163 (63%), Gaps = 11/163 (6%)
Query: 336 GCRPYEI-PCERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANE 394
GC PY++ C+ ++ G C +P TP C CQ + ++ D +FG +YS+ ++
Sbjct: 314 GCYPYQLQACDHHVTGKYQPCGDIQP-TPACANSCQ--NNATWSSDKHFGASSYSVGTDQ 370
Query: 395 ETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSS 454
++IM EI+ +GPVE S +YAD + YK+G+Y+HV G LG HA++IIGWG +
Sbjct: 371 QSIMTEIYTNGPVEASYDVYADFVSYKSGVYQHVTGDYLGGHAVKIIGWGVD-------G 423
Query: 455 VVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
YW+VANS+N +WG NG F I+RG +ECGIE I AG+PK+
Sbjct: 424 STPYWIVANSWNNDWGNNGFFNILRGSDECGIEDGIVAGIPKV 466
>gi|195437434|ref|XP_002066645.1| GK24603 [Drosophila willistoni]
gi|194162730|gb|EDW77631.1| GK24603 [Drosophila willistoni]
Length = 341
Score = 244 bits (624), Expect = 7e-62, Method: Compositional matrix adjust.
Identities = 130/280 (46%), Positives = 169/280 (60%), Gaps = 14/280 (5%)
Query: 63 TLSELEMR--MGVHPDS-KLPQNRLPLLVQLSD---PLEELPEGFDARINWPYCPTIQEI 116
++SE +R MGVH +S K P ++ SD L +LP FDAR+ W CPTI EI
Sbjct: 49 SISEKYLRGLMGVHEESYKYPLPDKQEVLGESDDEISLADLPVDFDARLRWTSCPTISEI 108
Query: 117 RDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWK 176
R+QGSCGS WA+ MSDR+CI S G + RLS D++SCC CG CQGG+ G AW
Sbjct: 109 REQGSCGSCWAIATTSVMSDRLCIGSNGVMNFRLSGLDMLSCCAICGFACQGGYPGAAWA 168
Query: 177 YWVTTGIVSGGTYASKQGCRPYEI-PCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSY 235
YW G+VSGG Y S+QGC+PY I PC+ NGS C C C+P Y V +
Sbjct: 169 YWARKGLVSGGDYGSQQGCQPYTIEPCDHSGNGSRPVCTVG--GGVRCQHLCEPSYKVDF 226
Query: 236 EDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHA 295
+ D NF YS+ + I +EI +GPV+ +T+Y D + YKTG+Y H+ G +G HA
Sbjct: 227 QRDKNFASKVYSISNDVLEIQKEIMTNGPVQAILTVYEDFLSYKTGVYYHLEGEKVGPHA 286
Query: 296 IRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
+RI+GWG GT V YWLVANS+ ++WG+NG F I
Sbjct: 287 VRILGWGV----WGTKK-VPYWLVANSWGSDWGDNGFFHI 321
Score = 145 bits (367), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 74/163 (45%), Positives = 98/163 (60%), Gaps = 8/163 (4%)
Query: 336 GCRPYEI-PCERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANE 394
GC+PY I PC+ NGSR C C C+P Y V ++ D NF YS+ +
Sbjct: 186 GCQPYTIEPCDHSGNGSRPVCTVG--GGVRCQHLCEPSYKVDFQRDKNFASKVYSISNDV 243
Query: 395 ETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSS 454
I +EI +GPV+ +T+Y D + YKTG+Y H+ G +G HA+RI+GWG GT
Sbjct: 244 LEIQKEIMTNGPVQAILTVYEDFLSYKTGVYYHLEGEKVGPHAVRILGWGV----WGTKK 299
Query: 455 VVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
V YWLVANS+ ++WG+NG F I RG+N C IE I AGLPK+
Sbjct: 300 -VPYWLVANSWGSDWGDNGFFHIFRGENHCDIEGYIMAGLPKL 341
>gi|170060936|ref|XP_001866022.1| cathepsin B [Culex quinquefasciatus]
gi|167879259|gb|EDS42642.1| cathepsin B [Culex quinquefasciatus]
Length = 341
Score = 244 bits (623), Expect = 8e-62, Method: Compositional matrix adjust.
Identities = 123/258 (47%), Positives = 164/258 (63%), Gaps = 16/258 (6%)
Query: 79 LPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRV 138
L ++RLPL + + LPE FDAR WP C ++++IR+QG CGS WA+ A E +DR
Sbjct: 71 LEKHRLPLGILVVKDHIVLPERFDARDRWPECTSLKQIRNQGCCGSCWAISAAETFTDRW 130
Query: 139 CIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPY 198
CI S K + DL+SCC CG+GCQGG G AW++WV G+ SGG Y S+QGC PY
Sbjct: 131 CIHSEDKDQFSFGAYDLLSCCHSCGDGCQGGNLGPAWQFWVQRGVSSGGPYNSRQGCHPY 190
Query: 199 EIPCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVS-YEDDLNFGRIAYSLPANEETIMR 257
+ + HS+ D + +TP+C RKCQ Y+V+ DD FGR+AYS+ +EE I
Sbjct: 191 PV------DVCHSA--DEDADTPKCTRKCQSMYNVTNVSDDRRFGRVAYSVSQDEERIKE 242
Query: 258 EIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYW 317
EIFR+GPV+ S +Y D YKTG+Y+HV G G HA+++IGWG E GT KYW
Sbjct: 243 EIFRNGPVQASFDVYLDFKAYKTGVYRHVFGPMEGGHAVKMIGWGVE---NGT----KYW 295
Query: 318 LVANSFNTNWGENGLFRI 335
L +NS+ +WGE G F+I
Sbjct: 296 LCSNSWGEDWGERGFFKI 313
Score = 153 bits (386), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 79/165 (47%), Positives = 104/165 (63%), Gaps = 20/165 (12%)
Query: 334 RIGCRPYEIPCERYMNGSRSSCQA--NEPNTPECIRKCQPGYDVS-YEDDLNFGRIAYSL 390
R GC PY + C + + +TP+C RKCQ Y+V+ DD FGR+AYS+
Sbjct: 184 RQGCHPYPV----------DVCHSADEDADTPKCTRKCQSMYNVTNVSDDRRFGRVAYSV 233
Query: 391 PANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGE 450
+EE I EIFR+GPV+ S +Y D YKTG+Y+HV G G HA+++IGWG E
Sbjct: 234 SQDEERIKEEIFRNGPVQASFDVYLDFKAYKTGVYRHVFGPMEGGHAVKMIGWGVE---N 290
Query: 451 GTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLP 495
GT KYWL +NS+ +WGE G F+IVRG+N CGIE+D+ AGLP
Sbjct: 291 GT----KYWLCSNSWGEDWGERGFFKIVRGENHCGIESDVHAGLP 331
>gi|211853248|emb|CAP17587.1| cathepsin-like protein 4 [Crateromorpha meyeri]
Length = 325
Score = 243 bits (621), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 121/269 (44%), Positives = 166/269 (61%), Gaps = 21/269 (7%)
Query: 72 GVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAV 131
GV LP + LP+L E++P+ FD+R WP C TI I DQ +CGS WA GA
Sbjct: 62 GVKGSIPLPLSDLPVL-------EDIPDMFDSRTQWPDCKTIGLIEDQSNCGSCWAFGAT 114
Query: 132 EAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTY-- 189
E+MSDR CI K H+ +S+ +L+ CC++CGNGC+GGF G AW YW G+V+GG Y
Sbjct: 115 ESMSDRYCI--HMKMHLLISAANLMECCRNCGNGCEGGFLGAAWNYWKQEGLVTGGLYNP 172
Query: 190 --ASKQGCRPYEIP-CERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAY 246
C+PY +P CE ++NGS +C TPEC+ C GY SYE DL++G AY
Sbjct: 173 SATESDTCQPYPLPSCEHHINGSKPACPSKIAKTPECVHTCHAGYPTSYEQDLHYGESAY 232
Query: 247 SLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPL 306
S+ I EI +GPVE + T+YAD YK+G+YK + LG HA+++IGWG+E
Sbjct: 233 SVRRRVAEIQTEIMTNGPVEAAFTVYADFPAYKSGVYKRHSLRQLGGHAVKMIGWGEE-- 290
Query: 307 GEGTSSVVKYWLVANSFNTNWGENGLFRI 335
+ YWL+ANS+N++WG++G F+I
Sbjct: 291 -----DGIPYWLIANSWNSDWGDHGYFKI 314
Score = 155 bits (393), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 72/153 (47%), Positives = 103/153 (67%), Gaps = 8/153 (5%)
Query: 337 CRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEE 395
C+PY +P CE ++NGS+ +C + TPEC+ C GY SYE DL++G AYS+
Sbjct: 180 CQPYPLPSCEHHINGSKPACPSKIAKTPECVHTCHAGYPTSYEQDLHYGESAYSVRRRVA 239
Query: 396 TIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSV 455
I EI +GPVE + T+YAD YK+G+YK + LG HA+++IGWG+E
Sbjct: 240 EIQTEIMTNGPVEAAFTVYADFPAYKSGVYKRHSLRQLGGHAVKMIGWGEE-------DG 292
Query: 456 VKYWLVANSFNTNWGENGLFRIVRGQNECGIEA 488
+ YWL+ANS+N++WG++G F+IVRGQ+ECGIE+
Sbjct: 293 IPYWLIANSWNSDWGDHGYFKIVRGQDECGIES 325
>gi|25146613|ref|NP_741818.1| Protein CPR-6, isoform a [Caenorhabditis elegans]
gi|1169087|sp|P43510.1|CPR6_CAEEL RecName: Full=Cathepsin B-like cysteine proteinase 6; AltName:
Full=Cysteine protease-related 6; Flags: Precursor
gi|671715|gb|AAA98787.1| cathepsin B-like cysteine proteinase [Caenorhabditis elegans]
gi|695294|gb|AAA98789.1| cathepsin B-like cysteine proteinase [Caenorhabditis elegans]
gi|351058213|emb|CCD65628.1| Protein CPR-6, isoform a [Caenorhabditis elegans]
Length = 379
Score = 243 bits (620), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 121/252 (48%), Positives = 160/252 (63%), Gaps = 10/252 (3%)
Query: 87 LVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKR 146
L + D ++PE FD+R NWP C +I+ IRDQ SCGS WA GAVEAMSDR+CIAS G+
Sbjct: 95 LSKTKDLDLDIPESFDSRDNWPKCDSIKVIRDQSSCGSCWAFGAVEAMSDRICIASHGEL 154
Query: 147 HVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEI-PCERY 205
V LS+DDL+SCCK CG GC GG AW+YWV GIV+G Y + GC+PY PCE +
Sbjct: 155 QVTLSADDLLSCCKSCGFGCNGGDPLAAWRYWVKDGIVTGSNYTANNGCKPYPFPPCEHH 214
Query: 206 MNGSH-SSCQDNEPNTPECIRKCQPGY-DVSYEDDLNFGRIAYSLPANEETIMREIFRHG 263
+H C + TP+C +KC Y D +Y +D FG AY + + E I +E+ HG
Sbjct: 215 SKKTHFDPCPHDLYPTPKCEKKCVSDYTDKTYSEDKFFGASAYGVKDDVEAIQKELMTHG 274
Query: 264 PVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSF 323
P+E + +Y D + Y G+Y H G G HA+++IGWG + +G + YW VANS+
Sbjct: 275 PLEIAFEVYEDFLNYDGGVYVHTGGKLGGGHAVKLIGWG---IDDG----IPYWTVANSW 327
Query: 324 NTNWGENGLFRI 335
NT+WGE+G FRI
Sbjct: 328 NTDWGEDGFFRI 339
Score = 136 bits (343), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 74/186 (39%), Positives = 108/186 (58%), Gaps = 13/186 (6%)
Query: 315 KYWLVANSFNTNWGENGLFRIGCRPYEIP-CERYMNGSR-SSCQANEPNTPECIRKCQPG 372
+YW V + T G N GC+PY P CE + + C + TP+C +KC
Sbjct: 184 RYW-VKDGIVT--GSNYTANNGCKPYPFPPCEHHSKKTHFDPCPHDLYPTPKCEKKCVSD 240
Query: 373 Y-DVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGG 431
Y D +Y +D FG AY + + E I +E+ HGP+E + +Y D + Y G+Y H G
Sbjct: 241 YTDKTYSEDKFFGASAYGVKDDVEAIQKELMTHGPLEIAFEVYEDFLNYDGGVYVHTGGK 300
Query: 432 PLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADIT 491
G HA+++IGWG + +G + YW VANS+NT+WGE+G FRI+RG +ECGIE+ +
Sbjct: 301 LGGGHAVKLIGWG---IDDG----IPYWTVANSWNTDWGEDGFFRILRGVDECGIESGVV 353
Query: 492 AGLPKI 497
G+PK+
Sbjct: 354 GGIPKL 359
>gi|126116630|gb|ABN79675.1| cathepsin B3 [Clonorchis sinensis]
Length = 337
Score = 243 bits (620), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 120/270 (44%), Positives = 164/270 (60%), Gaps = 9/270 (3%)
Query: 67 LEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGW 126
++ G + + +++ P + S E +P+ FDAR WP+CPTI EIRDQ SCGS W
Sbjct: 50 FQLMFGALREPEEQRSKRPTVSHESFSDEHIPKAFDARKQWPHCPTIGEIRDQSSCGSCW 109
Query: 127 ALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSG 186
A GAVEAMSDR+CI + G R+S+ DL+SCC CG GCQGGF AW +W T GIV+G
Sbjct: 110 AFGAVEAMSDRLCIHTNGTFTKRISAVDLISCCGYCGFGCQGGFPPTAWDFWQTEGIVTG 169
Query: 187 GTYASKQGCRPYEIP-CERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIA 245
G+ + GCR Y P C + + + C +TP C++KC D Y D I
Sbjct: 170 GSKENPTGCRSYPFPRCSHHGSKKYPPCSHRIYDTPNCVQKCDTP-DTDYATDKTRANIT 228
Query: 246 YSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEP 305
Y++ A + IM+EI +GPVE + +Y D + YK+G+Y H G LG HAIRI+GWG+E
Sbjct: 229 YNVKAKQNAIMKEIMINGPVEAAFQVYEDFLGYKSGVYFHSDGTLLGGHAIRILGWGEE- 287
Query: 306 LGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
+ V YWL+ANS+N WGE+G F++
Sbjct: 288 ------NGVAYWLIANSWNDGWGEDGYFKM 311
Score = 144 bits (364), Expect = 9e-32, Method: Compositional matrix adjust.
Identities = 71/163 (43%), Positives = 100/163 (61%), Gaps = 9/163 (5%)
Query: 336 GCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANE 394
GCR Y P C + + C +TP C++KC D Y D I Y++ A +
Sbjct: 177 GCRSYPFPRCSHHGSKKYPPCSHRIYDTPNCVQKCDTP-DTDYATDKTRANITYNVKAKQ 235
Query: 395 ETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSS 454
IM+EI +GPVE + +Y D + YK+G+Y H G LG HAIRI+GWG+E +
Sbjct: 236 NAIMKEIMINGPVEAAFQVYEDFLGYKSGVYFHSDGTLLGGHAIRILGWGEE-------N 288
Query: 455 VVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
V YWL+ANS+N WGE+G F+++RG+NECGIE ++TAGLP++
Sbjct: 289 GVAYWLIANSWNDGWGEDGYFKMLRGKNECGIEDEVTAGLPEL 331
>gi|71984043|ref|NP_001024426.1| Protein CPR-6, isoform b [Caenorhabditis elegans]
gi|351058214|emb|CCD65629.1| Protein CPR-6, isoform b [Caenorhabditis elegans]
Length = 378
Score = 243 bits (620), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 121/252 (48%), Positives = 160/252 (63%), Gaps = 10/252 (3%)
Query: 87 LVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKR 146
L + D ++PE FD+R NWP C +I+ IRDQ SCGS WA GAVEAMSDR+CIAS G+
Sbjct: 94 LSKTKDLDLDIPESFDSRDNWPKCDSIKVIRDQSSCGSCWAFGAVEAMSDRICIASHGEL 153
Query: 147 HVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEI-PCERY 205
V LS+DDL+SCCK CG GC GG AW+YWV GIV+G Y + GC+PY PCE +
Sbjct: 154 QVTLSADDLLSCCKSCGFGCNGGDPLAAWRYWVKDGIVTGSNYTANNGCKPYPFPPCEHH 213
Query: 206 MNGSH-SSCQDNEPNTPECIRKCQPGY-DVSYEDDLNFGRIAYSLPANEETIMREIFRHG 263
+H C + TP+C +KC Y D +Y +D FG AY + + E I +E+ HG
Sbjct: 214 SKKTHFDPCPHDLYPTPKCEKKCVSDYTDKTYSEDKFFGASAYGVKDDVEAIQKELMTHG 273
Query: 264 PVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSF 323
P+E + +Y D + Y G+Y H G G HA+++IGWG + +G + YW VANS+
Sbjct: 274 PLEIAFEVYEDFLNYDGGVYVHTGGKLGGGHAVKLIGWG---IDDG----IPYWTVANSW 326
Query: 324 NTNWGENGLFRI 335
NT+WGE+G FRI
Sbjct: 327 NTDWGEDGFFRI 338
Score = 136 bits (343), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 74/186 (39%), Positives = 108/186 (58%), Gaps = 13/186 (6%)
Query: 315 KYWLVANSFNTNWGENGLFRIGCRPYEIP-CERYMNGSR-SSCQANEPNTPECIRKCQPG 372
+YW V + T G N GC+PY P CE + + C + TP+C +KC
Sbjct: 183 RYW-VKDGIVT--GSNYTANNGCKPYPFPPCEHHSKKTHFDPCPHDLYPTPKCEKKCVSD 239
Query: 373 Y-DVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGG 431
Y D +Y +D FG AY + + E I +E+ HGP+E + +Y D + Y G+Y H G
Sbjct: 240 YTDKTYSEDKFFGASAYGVKDDVEAIQKELMTHGPLEIAFEVYEDFLNYDGGVYVHTGGK 299
Query: 432 PLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADIT 491
G HA+++IGWG + +G + YW VANS+NT+WGE+G FRI+RG +ECGIE+ +
Sbjct: 300 LGGGHAVKLIGWG---IDDG----IPYWTVANSWNTDWGEDGFFRILRGVDECGIESGVV 352
Query: 492 AGLPKI 497
G+PK+
Sbjct: 353 GGIPKL 358
>gi|358331547|dbj|GAA35870.2| cathepsin B-like cysteine proteinase [Clonorchis sinensis]
Length = 508
Score = 243 bits (620), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 118/241 (48%), Positives = 155/241 (64%), Gaps = 9/241 (3%)
Query: 96 ELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDL 155
LP+ FDAR WP+C +I EIRDQ SCGS WA GAVEAMSDR+CI S G + LS+ DL
Sbjct: 85 RLPKNFDARKTWPHCSSISEIRDQSSCGSCWAFGAVEAMSDRLCIHSNGAFNKSLSAVDL 144
Query: 156 VSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIP-CERYMNGSHSSCQ 214
+SCCKDCG GC+GG+ AW YW T GIV+GG+ GCR Y P CE ++ G + C
Sbjct: 145 LSCCKDCGFGCRGGYPAVAWDYWKTHGIVTGGSKEDPSGCRSYPFPKCEHHVQGHYPPCP 204
Query: 215 DNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYAD 274
TPEC+++C DV Y +D ++Y++ A+E +IM+EI GPVE T+Y D
Sbjct: 205 RELYPTPECVQQCDTP-DVGYLEDKTRANMSYNIYASEISIMKEIMLRGPVEAIFTMYED 263
Query: 275 MILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFR 334
+ Y +G+Y H G P+ HA+RI+GWG+ LG V YWL+ANS+N +WGE G +
Sbjct: 264 FLRYSSGVYFHALGAPMSGHAVRILGWGE--LGN-----VPYWLIANSWNEDWGEEGYMK 316
Query: 335 I 335
Sbjct: 317 F 317
Score = 141 bits (355), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 71/160 (44%), Positives = 98/160 (61%), Gaps = 9/160 (5%)
Query: 336 GCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANE 394
GCR Y P CE ++ G C TPEC+++C DV Y +D ++Y++ A+E
Sbjct: 183 GCRSYPFPKCEHHVQGHYPPCPRELYPTPECVQQCDTP-DVGYLEDKTRANMSYNIYASE 241
Query: 395 ETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSS 454
+IM+EI GPVE T+Y D + Y +G+Y H G P+ HA+RI+GWG+ LG
Sbjct: 242 ISIMKEIMLRGPVEAIFTMYEDFLRYSSGVYFHALGAPMSGHAVRILGWGE--LGN---- 295
Query: 455 VVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGL 494
V YWL+ANS+N +WGE G + +RG NECGIE D+TA L
Sbjct: 296 -VPYWLIANSWNEDWGEEGYMKFLRGYNECGIEDDVTAVL 334
>gi|221107055|ref|XP_002166984.1| PREDICTED: cathepsin B-like [Hydra magnipapillata]
Length = 330
Score = 243 bits (620), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 122/253 (48%), Positives = 166/253 (65%), Gaps = 11/253 (4%)
Query: 86 LLVQLSDPLEELPEGFDARINW-PYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRG 144
L V+ + LP+ +D R W CP+ EIRDQGSCGS WA GAVEA +DR+CI S G
Sbjct: 66 LEVKQIPVIATLPDSYDTREKWGSTCPSTTEIRDQGSCGSCWAFGAVEAFTDRICIQSNG 125
Query: 145 KRHVRLSSDDLVSCCKD-CGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIP-C 202
++ +S++DL++CC CG GC GG G AW ++ G V+GG Y S +GC+PYEIP C
Sbjct: 126 AKNPHISAEDLLTCCGFWCGFGCNGGRLGPAWNFFKYAGAVTGGQYNSSEGCQPYEIPSC 185
Query: 203 ERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRH 262
E + +GS C+ +EP TP+C R C+ GY+VSY DD + YS+ +EE I EI+ +
Sbjct: 186 EHHTSGSKKPCEGSEP-TPKCKRSCREGYNVSYSDDKHKVSSHYSIANDEEQIKNEIYLN 244
Query: 263 GPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANS 322
GPVE + T+Y+D YK+G+YK+ G LG HAI+I+GWG E + V YWLVANS
Sbjct: 245 GPVEAAFTVYSDFPNYKSGVYKYTTGNALGGHAIKILGWGVE-------NNVPYWLVANS 297
Query: 323 FNTNWGENGLFRI 335
+N +WG+ G F+I
Sbjct: 298 WNPDWGDKGFFKI 310
Score = 169 bits (427), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 81/160 (50%), Positives = 111/160 (69%), Gaps = 9/160 (5%)
Query: 336 GCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANE 394
GC+PYEIP CE + +GS+ C+ +EP TP+C R C+ GY+VSY DD + YS+ +E
Sbjct: 176 GCQPYEIPSCEHHTSGSKKPCEGSEP-TPKCKRSCREGYNVSYSDDKHKVSSHYSIANDE 234
Query: 395 ETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSS 454
E I EI+ +GPVE + T+Y+D YK+G+YK+ G LG HAI+I+GWG E +
Sbjct: 235 EQIKNEIYLNGPVEAAFTVYSDFPNYKSGVYKYTTGNALGGHAIKILGWGVE-------N 287
Query: 455 VVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGL 494
V YWLVANS+N +WG+ G F+I+RG NECGIEA + AG+
Sbjct: 288 NVPYWLVANSWNPDWGDKGFFKILRGSNECGIEASVVAGM 327
>gi|193209594|ref|NP_001123113.1| Protein CPR-6, isoform c [Caenorhabditis elegans]
gi|351058222|emb|CCD65637.1| Protein CPR-6, isoform c [Caenorhabditis elegans]
Length = 369
Score = 243 bits (619), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 121/252 (48%), Positives = 160/252 (63%), Gaps = 10/252 (3%)
Query: 87 LVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKR 146
L + D ++PE FD+R NWP C +I+ IRDQ SCGS WA GAVEAMSDR+CIAS G+
Sbjct: 85 LSKTKDLDLDIPESFDSRDNWPKCDSIKVIRDQSSCGSCWAFGAVEAMSDRICIASHGEL 144
Query: 147 HVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEI-PCERY 205
V LS+DDL+SCCK CG GC GG AW+YWV GIV+G Y + GC+PY PCE +
Sbjct: 145 QVTLSADDLLSCCKSCGFGCNGGDPLAAWRYWVKDGIVTGSNYTANNGCKPYPFPPCEHH 204
Query: 206 MNGSH-SSCQDNEPNTPECIRKCQPGY-DVSYEDDLNFGRIAYSLPANEETIMREIFRHG 263
+H C + TP+C +KC Y D +Y +D FG AY + + E I +E+ HG
Sbjct: 205 SKKTHFDPCPHDLYPTPKCEKKCVSDYTDKTYSEDKFFGASAYGVKDDVEAIQKELMTHG 264
Query: 264 PVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSF 323
P+E + +Y D + Y G+Y H G G HA+++IGWG + +G + YW VANS+
Sbjct: 265 PLEIAFEVYEDFLNYDGGVYVHTGGKLGGGHAVKLIGWG---IDDG----IPYWTVANSW 317
Query: 324 NTNWGENGLFRI 335
NT+WGE+G FRI
Sbjct: 318 NTDWGEDGFFRI 329
Score = 136 bits (343), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 74/186 (39%), Positives = 108/186 (58%), Gaps = 13/186 (6%)
Query: 315 KYWLVANSFNTNWGENGLFRIGCRPYEIP-CERYMNGSR-SSCQANEPNTPECIRKCQPG 372
+YW V + T G N GC+PY P CE + + C + TP+C +KC
Sbjct: 174 RYW-VKDGIVT--GSNYTANNGCKPYPFPPCEHHSKKTHFDPCPHDLYPTPKCEKKCVSD 230
Query: 373 Y-DVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGG 431
Y D +Y +D FG AY + + E I +E+ HGP+E + +Y D + Y G+Y H G
Sbjct: 231 YTDKTYSEDKFFGASAYGVKDDVEAIQKELMTHGPLEIAFEVYEDFLNYDGGVYVHTGGK 290
Query: 432 PLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADIT 491
G HA+++IGWG + +G + YW VANS+NT+WGE+G FRI+RG +ECGIE+ +
Sbjct: 291 LGGGHAVKLIGWG---IDDG----IPYWTVANSWNTDWGEDGFFRILRGVDECGIESGVV 343
Query: 492 AGLPKI 497
G+PK+
Sbjct: 344 GGIPKL 349
>gi|358341867|dbj|GAA49438.1| cathepsin B-like cysteine proteinase [Clonorchis sinensis]
Length = 952
Score = 243 bits (619), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 123/276 (44%), Positives = 165/276 (59%), Gaps = 9/276 (3%)
Query: 61 KLTLSELEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQG 120
+ + L G +S + R P + +ELP+ FDAR WP+CP+I EIRDQ
Sbjct: 15 RFESASLLHTFGALRESAEQRARRPTVKHEVSDEKELPKSFDARTKWPHCPSISEIRDQS 74
Query: 121 SCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVT 180
SC S WA GAVE+MSDR+CI S G + LS+ DL+SCC+DCG GC GFH AW +W T
Sbjct: 75 SCESFWAFGAVESMSDRLCIHSNGAFNKSLSATDLLSCCEDCGLGCGAGFHPMAWDFWKT 134
Query: 181 TGIVSGGTYASKQGCRPYEIP-CERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDL 239
GIV+GG+ GCR + P C G + C + TPECI++C +V+YE D
Sbjct: 135 HGIVTGGSKEEPSGCRSFPFPKCGHRRKGRYPPCPRHIYPTPECIKQCDEP-EVNYEKDK 193
Query: 240 NFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRII 299
I+Y++ ++ +IM+EI +GPVE S IYAD + Y G+Y H GGP+ HAIRI+
Sbjct: 194 TRANISYNVYPSDISIMKEIMLNGPVEASFGIYADFLEYNGGVYFHCWGGPISRHAIRIL 253
Query: 300 GWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
GWG++ V YWL+ANS+N +WGE G R
Sbjct: 254 GWGED-------DGVPYWLIANSWNEDWGEKGYVRF 282
Score = 229 bits (585), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 125/298 (41%), Positives = 161/298 (54%), Gaps = 64/298 (21%)
Query: 95 EELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDD 154
+ LPE FDAR NWP+CP+I EIRDQ SCGS WA GAVEAMSDR+CI S+G + LS+ D
Sbjct: 637 QHLPESFDARANWPHCPSISEIRDQSSCGSCWAFGAVEAMSDRLCIHSKGAFNKSLSAVD 696
Query: 155 LVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIP-CERYMNGSHSSC 213
LVSCC +CG GC+GG+ AW +W T GIV+GG+ GCR Y P CE G + C
Sbjct: 697 LVSCCTECGCGCRGGYSPIAWDFWKTHGIVTGGSKEKPTGCRSYPFPSCEHRGKGQYPPC 756
Query: 214 QDNEPNTPECIRKCQP-----------GYDVSYEDDL---------NFGR---------- 243
TPECI++C G+D + + L NFG
Sbjct: 757 PHQLYPTPECIKRCDTKEIDYEKDKTRGFDSASSEQLADRHCFHTSNFGEASAQRTLHLT 816
Query: 244 --------------------------IAYSLPANEETIMREIFRHGPVEGSMTIYADMIL 277
I+Y++ E+ +M+EI GPV + +Y D++
Sbjct: 817 CLNFMHHSIDLLSSRLEKAVLRSTANISYNVYPAEQAVMKEIMLRGPVGAILHVYEDLLD 876
Query: 278 YKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
YK+G+Y HV GG LGEH IRI+GWG+E V YWLVANS+N +WGE G R+
Sbjct: 877 YKSGVYFHVWGGHLGEHGIRILGWGEE-------DGVPYWLVANSWNEDWGEKGYMRV 927
Score = 138 bits (347), Expect = 8e-30, Method: Compositional matrix adjust.
Identities = 70/158 (44%), Positives = 96/158 (60%), Gaps = 9/158 (5%)
Query: 336 GCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANE 394
GCR + P C G C + TPECI++C +V+YE D I+Y++ ++
Sbjct: 148 GCRSFPFPKCGHRRKGRYPPCPRHIYPTPECIKQCDEP-EVNYEKDKTRANISYNVYPSD 206
Query: 395 ETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSS 454
+IM+EI +GPVE S IYAD + Y G+Y H GGP+ HAIRI+GWG++
Sbjct: 207 ISIMKEIMLNGPVEASFGIYADFLEYNGGVYFHCWGGPISRHAIRILGWGED-------D 259
Query: 455 VVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITA 492
V YWL+ANS+N +WGE G R +RG NECGIE ++TA
Sbjct: 260 GVPYWLIANSWNEDWGEKGYVRFLRGHNECGIEEEVTA 297
Score = 123 bits (308), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 76/219 (34%), Positives = 102/219 (46%), Gaps = 64/219 (29%)
Query: 336 GCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQP-----------GYDVSYEDDL-- 381
GCR Y P CE G C TPECI++C G+D + + L
Sbjct: 736 GCRSYPFPSCEHRGKGQYPPCPHQLYPTPECIKRCDTKEIDYEKDKTRGFDSASSEQLAD 795
Query: 382 -------NFGR------------------------------------IAYSLPANEETIM 398
NFG I+Y++ E+ +M
Sbjct: 796 RHCFHTSNFGEASAQRTLHLTCLNFMHHSIDLLSSRLEKAVLRSTANISYNVYPAEQAVM 855
Query: 399 REIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKY 458
+EI GPV + +Y D++ YK+G+Y HV GG LGEH IRI+GWG+E V Y
Sbjct: 856 KEIMLRGPVGAILHVYEDLLDYKSGVYFHVWGGHLGEHGIRILGWGEE-------DGVPY 908
Query: 459 WLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
WLVANS+N +WGE G R++R +NECGI +TAGLP +
Sbjct: 909 WLVANSWNEDWGEKGYMRVLRWRNECGIVDQVTAGLPDL 947
>gi|291385792|ref|XP_002709482.1| PREDICTED: cathepsin B [Oryctolagus cuniculus]
Length = 339
Score = 243 bits (619), Expect = 3e-61, Method: Compositional matrix adjust.
Identities = 125/255 (49%), Positives = 178/255 (69%), Gaps = 11/255 (4%)
Query: 83 RLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIAS 142
+LP V+ +D ++ LPE FDAR WP CPTI+EIRDQGSCGS WA GAVEA+SDR+CI +
Sbjct: 67 KLPRRVEFADDIK-LPESFDAREQWPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHT 125
Query: 143 RGKRHVRLSSDDLVSCCKDCGNGCQGG-FHGKAWKYWVTTGIVSGGTYASKQGCRPYEIP 201
G +V +S++D+++CC G + AW +W G+VSGG Y S GC+PY IP
Sbjct: 126 NGHVNVEVSAEDMLTCCGGQCGDGCNGGYPSGAWNFWTKKGLVSGGLYDSHVGCKPYSIP 185
Query: 202 -CERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIF 260
CE ++NGS +C E +TP C + C+PGY SY++D ++G +YS+ ++E I EI+
Sbjct: 186 PCEHHVNGSRPACT-GEGDTPRCSKTCEPGYSPSYKEDKHYGYSSYSVSSDENEIKAEIY 244
Query: 261 RHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVA 320
++GPVEG+ T+Y+D ++YK+G+Y+H G +G HAIRI+GWG+E + V YWLVA
Sbjct: 245 KNGPVEGAFTVYSDFLMYKSGVYQHTTGDIMGGHAIRILGWGEE-------NGVPYWLVA 297
Query: 321 NSFNTNWGENGLFRI 335
NS+NT+WG+ G F+I
Sbjct: 298 NSWNTDWGDKGFFKI 312
Score = 172 bits (437), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 81/164 (49%), Positives = 122/164 (74%), Gaps = 9/164 (5%)
Query: 334 RIGCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPA 392
+GC+PY IP CE ++NGSR +C E +TP C + C+PGY SY++D ++G +YS+ +
Sbjct: 176 HVGCKPYSIPPCEHHVNGSRPACTG-EGDTPRCSKTCEPGYSPSYKEDKHYGYSSYSVSS 234
Query: 393 NEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGT 452
+E I EI+++GPVEG+ T+Y+D ++YK+G+Y+H G +G HAIRI+GWG+E
Sbjct: 235 DENEIKAEIYKNGPVEGAFTVYSDFLMYKSGVYQHTTGDIMGGHAIRILGWGEE------ 288
Query: 453 SSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPK 496
+ V YWLVANS+NT+WG+ G F+I+RGQ+ CGIE++I AG+P+
Sbjct: 289 -NGVPYWLVANSWNTDWGDKGFFKILRGQDHCGIESEIVAGIPR 331
>gi|332244666|ref|XP_003271495.1| PREDICTED: cathepsin B [Nomascus leucogenys]
Length = 351
Score = 243 bits (619), Expect = 3e-61, Method: Compositional matrix adjust.
Identities = 129/292 (44%), Positives = 180/292 (61%), Gaps = 22/292 (7%)
Query: 53 GAEKNALSKLTLSELEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPT 112
GA + +S L+ G P R+ L+ LPE F AR WP CPT
Sbjct: 46 GAASYNFYNVDVSYLKRLCGTFLGGPKPPQRVTFTEDLN-----LPESFYAREQWPQCPT 100
Query: 113 IQEIRDQGSCG--SGW-----ALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKD-CGN 164
I R Q G + W A GAVEA+SDR+CI + V +S++DL++CC CG+
Sbjct: 101 IXXXRAQPGRGGLTRWGSFLQAFGAVEAISDRICIHTNAHISVEVSAEDLLTCCGSMCGD 160
Query: 165 GCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIP-CERYMNGSHSSCQDNEPNTPEC 223
GC GG+ +AW +W G+VSGG Y S GCRPY IP CE ++NGS C E +TP+C
Sbjct: 161 GCNGGYPAEAWNFWTRKGLVSGGLYDSHVGCRPYSIPPCEHHVNGSRPPCT-GEGDTPKC 219
Query: 224 IRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIY 283
+ C+PGY +Y+ D ++G +YS+ +E+ IM EI+++GPVEG+ ++Y+D +LYK+G+Y
Sbjct: 220 SKICEPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVY 279
Query: 284 KHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
+H+ G +G HAIRI+GWG E GT YWLVANS+NT+WG+NG F+I
Sbjct: 280 QHITGEMMGGHAIRILGWGVE---NGT----PYWLVANSWNTDWGDNGFFKI 324
Score = 182 bits (461), Expect = 4e-43, Method: Compositional matrix adjust.
Identities = 83/164 (50%), Positives = 123/164 (75%), Gaps = 9/164 (5%)
Query: 334 RIGCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPA 392
+GCRPY IP CE ++NGSR C E +TP+C + C+PGY +Y+ D ++G +YS+
Sbjct: 188 HVGCRPYSIPPCEHHVNGSRPPCTG-EGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVSN 246
Query: 393 NEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGT 452
+E+ IM EI+++GPVEG+ ++Y+D +LYK+G+Y+H+ G +G HAIRI+GWG E GT
Sbjct: 247 SEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHITGEMMGGHAIRILGWGVE---NGT 303
Query: 453 SSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPK 496
YWLVANS+NT+WG+NG F+I+RGQ+ CGIE+++ AG+P+
Sbjct: 304 ----PYWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPR 343
>gi|390994429|gb|AFM37364.1| cathepsin B1 [Dictyocaulus viviparus]
Length = 350
Score = 242 bits (618), Expect = 3e-61, Method: Compositional matrix adjust.
Identities = 128/284 (45%), Positives = 172/284 (60%), Gaps = 12/284 (4%)
Query: 56 KNALSKLTLSELEMRMGVHPDSKLPQ-NRLPLLVQLSDPLEELPEGFDARINWPYCPTIQ 114
+ +S LT S+ + R+ P NR L+ L D E+PE FDAR W C +I+
Sbjct: 55 QTEISSLTSSDHKARLMSEEYLTQPNLNRNELMTGLLDV--EIPENFDAREKWSQCDSIR 112
Query: 115 EIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCC-KDCGNGCQGGFHGK 173
IRDQ CGS WA+ A E MSDR CI S GK +V LS+ D++SCC CG GC+GG+ +
Sbjct: 113 TIRDQSHCGSCWAVSAAETMSDRTCIHSDGKINVGLSATDILSCCGTTCGRGCRGGYPIE 172
Query: 174 AWKYWVTTGIVSGGTYASKQGCRPYEI-PCERYMNG-SHSSCQDNEPNTPECIRKCQPGY 231
AW+Y++ G+ +GG YA K C+PY PC + N + C TP+C + CQ GY
Sbjct: 173 AWRYFMLHGVCTGGHYAEKDVCKPYAFHPCGHHRNEIYYGECPKEIFPTPQCTQSCQAGY 232
Query: 232 DVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPL 291
YEDD +G+ AY+LP NE+ I REI +GPV+ + +Y D Y++GIY H AG
Sbjct: 233 ASDYEDDKIYGKSAYALPNNEKAIQREIMTNGPVQAAFMVYEDFSRYRSGIYVHTAGRRE 292
Query: 292 GEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
G HA+++IGWG + G KYWL ANS+N++WGENG FRI
Sbjct: 293 GGHAVKLIGWGVDDDGN------KYWLAANSWNSDWGENGYFRI 330
Score = 154 bits (388), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 75/163 (46%), Positives = 101/163 (61%), Gaps = 8/163 (4%)
Query: 337 CRPYEI-PCERYMNG-SRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANE 394
C+PY PC + N C TP+C + CQ GY YEDD +G+ AY+LP NE
Sbjct: 194 CKPYAFHPCGHHRNEIYYGECPKEIFPTPQCTQSCQAGYASDYEDDKIYGKSAYALPNNE 253
Query: 395 ETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSS 454
+ I REI +GPV+ + +Y D Y++GIY H AG G HA+++IGWG + G
Sbjct: 254 KAIQREIMTNGPVQAAFMVYEDFSRYRSGIYVHTAGRREGGHAVKLIGWGVDDDGN---- 309
Query: 455 VVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
KYWL ANS+N++WGENG FRIVRG + CGIE+ + AG+P +
Sbjct: 310 --KYWLAANSWNSDWGENGYFRIVRGVDHCGIESAVVAGMPDV 350
>gi|401758196|gb|AFQ01133.1| cathepsin B [Chilo suppressalis]
Length = 350
Score = 242 bits (617), Expect = 3e-61, Method: Compositional matrix adjust.
Identities = 128/301 (42%), Positives = 171/301 (56%), Gaps = 30/301 (9%)
Query: 54 AEKNALSKLTLSELEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTI 113
A +N K L + MG DS++ + LP + P FDAR +W CPT+
Sbjct: 43 AGRNFPKKTPLKYIYNLMGTLSDSRM--DNLPQRNYTFSRKTKYPNQFDAREHWKNCPTL 100
Query: 114 QEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGK 173
++IRDQG CGS WA+ AV AM+DR+CI S+GK H S D++SCC CGNGC+GG +
Sbjct: 101 KDIRDQGGCGSCWAVAAVSAMTDRMCILSKGKEHFYFSIKDVLSCCGYCGNGCEGGVLTR 160
Query: 174 AWKYWVTTGIVSGGTYASKQGCRPYEI-PCERYMNGSHSSCQDNE--------------- 217
AW Y+ GIVSGG Y SKQGC+PY I PC + G C++
Sbjct: 161 AWIYYKKIGIVSGGGYKSKQGCQPYTIPPCNHLVWGEIEQCKNIPMTPKCKNIPVIPEQC 220
Query: 218 ---PNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYAD 274
P TPEC +KC Y V Y D + G+ Y + +E I +EI+ +GPV T+Y D
Sbjct: 221 KYIPITPECEKKCNKNYKVCYSKDKHRGKSVYRVKKSE--IFKEIYEYGPVTSYFTVYED 278
Query: 275 MILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFR 334
+ YK GIY + +G LG H+++IIGWG+E +KYWL ANSFNT+WG+ G F+
Sbjct: 279 FLNYKEGIYNYTSGQKLGLHSVKIIGWGEE-------RGIKYWLAANSFNTDWGDKGFFK 331
Query: 335 I 335
I
Sbjct: 332 I 332
Score = 127 bits (318), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 69/178 (38%), Positives = 95/178 (53%), Gaps = 29/178 (16%)
Query: 336 GCRPYEIP-CERYMNGSRSSCQ------------------ANEPNTPECIRKCQPGYDVS 376
GC+PY IP C + G C+ P TPEC +KC Y V
Sbjct: 181 GCQPYTIPPCNHLVWGEIEQCKNIPMTPKCKNIPVIPEQCKYIPITPECEKKCNKNYKVC 240
Query: 377 YEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEH 436
Y D + G+ Y + +E I +EI+ +GPV T+Y D + YK GIY + +G LG H
Sbjct: 241 YSKDKHRGKSVYRVKKSE--IFKEIYEYGPVTSYFTVYEDFLNYKEGIYNYTSGQKLGLH 298
Query: 437 AIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVR-GQNECGIEADITAG 493
+++IIGWG+E +KYWL ANSFNT+WG+ G F+I+R G CGI ++ AG
Sbjct: 299 SVKIIGWGEE-------RGIKYWLAANSFNTDWGDKGFFKIIREGVGSCGISDNVVAG 349
>gi|312374702|gb|EFR22199.1| hypothetical protein AND_15622 [Anopheles darlingi]
Length = 339
Score = 242 bits (617), Expect = 4e-61, Method: Compositional matrix adjust.
Identities = 113/242 (46%), Positives = 159/242 (65%), Gaps = 15/242 (6%)
Query: 96 ELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDL 155
++PE FD+R WP C +++EIR+QG+CGS WA+ A MSDRVCI + G R+V ++++DL
Sbjct: 91 DIPESFDSRDRWPNCDSLREIRNQGTCGSCWAVAAASVMSDRVCIHTNGTRNVAIAAEDL 150
Query: 156 VSCCKDCGNGCQGGF-HGKAWKYWVTTGIVSGGTYASKQGCRPYEI-PCERYMNGSHSSC 213
+ CC DCGNGC+GGF G +++YWV G+VSGG Y S +GC+PY PC H
Sbjct: 151 MGCCADCGNGCEGGFLDGTSFQYWVDAGLVSGGAYNSTEGCKPYPFKPCLYPFTDCHRE- 209
Query: 214 QDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYA 273
+P+C CQ G D Y D FG +AYS+P +E I EI +GPVEG +Y
Sbjct: 210 -----ESPKCKHHCQHGVDKRYARDKVFGSVAYSVPRDERVIRYEIMTNGPVEGGFDVYE 264
Query: 274 DMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLF 333
D+ LYK+G+Y+HV G +G+HA+RIIGWG+E + YWL++NS+ +WG++G F
Sbjct: 265 DVFLYKSGVYRHVYGEHVGKHAVRIIGWGRE-------GGIPYWLISNSYGEDWGDHGYF 317
Query: 334 RI 335
+I
Sbjct: 318 KI 319
Score = 143 bits (360), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 71/163 (43%), Positives = 99/163 (60%), Gaps = 14/163 (8%)
Query: 336 GCRPYEI-PCERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANE 394
GC+PY PC + C E +P+C CQ G D Y D FG +AYS+P +E
Sbjct: 190 GCKPYPFKPCLYPF----TDCHREE--SPKCKHHCQHGVDKRYARDKVFGSVAYSVPRDE 243
Query: 395 ETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSS 454
I EI +GPVEG +Y D+ LYK+G+Y+HV G +G+HA+RIIGWG+E
Sbjct: 244 RVIRYEIMTNGPVEGGFDVYEDVFLYKSGVYRHVYGEHVGKHAVRIIGWGRE-------G 296
Query: 455 VVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
+ YWL++NS+ +WG++G F+IVRG N GIE+ + GLP +
Sbjct: 297 GIPYWLISNSYGEDWGDHGYFKIVRGINHLGIESKVITGLPLV 339
>gi|54289256|gb|AAV31918.1| putative vitellogenic cathepsin B [Aedes aegypti]
Length = 332
Score = 242 bits (617), Expect = 4e-61, Method: Compositional matrix adjust.
Identities = 118/260 (45%), Positives = 158/260 (60%), Gaps = 20/260 (7%)
Query: 78 KLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDR 137
+LP R + + ++PE FDAR WPYC +I I++QG CG+ WA+ V MSDR
Sbjct: 71 RLPTKRHDVAYNM-----DIPEFFDAREKWPYCKSISTIKNQGLCGACWAVATVSVMSDR 125
Query: 138 VCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGF-HGKAWKYWVTTGIVSGGTYASKQGCR 196
+CI S GK V L+++DL+ CCKDCGNGC GGF G +++YWV G+VSG Y + GC+
Sbjct: 126 LCIHSEGKFDVELAAEDLMGCCKDCGNGCNGGFLDGTSFQYWVDVGLVSGAAYNNTDGCK 185
Query: 197 PYEI-PCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETI 255
PY PC G H TP C C GYD +Y D +G AY LP +E I
Sbjct: 186 PYPFKPCLYPFVGCHPE------KTPSCTHHCTEGYDGTYRRDKYYGSAAYKLPNDERMI 239
Query: 256 MREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVK 315
EI +GPVE ++Y D+ LYKTG+Y+HV G +G+HA+R+IGWG+E V
Sbjct: 240 QLEIMTNGPVESGFSVYQDLYLYKTGVYQHVVGREVGKHAVRLIGWGKE-------RGVP 292
Query: 316 YWLVANSFNTNWGENGLFRI 335
YWL+ANS+ +WGE+G F+
Sbjct: 293 YWLIANSYGEDWGEHGYFKF 312
Score = 146 bits (368), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 72/163 (44%), Positives = 97/163 (59%), Gaps = 14/163 (8%)
Query: 336 GCRPYEI-PCERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANE 394
GC+PY PC G + TP C C GYD +Y D +G AY LP +E
Sbjct: 183 GCKPYPFKPCLYPFVG------CHPEKTPSCTHHCTEGYDGTYRRDKYYGSAAYKLPNDE 236
Query: 395 ETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSS 454
I EI +GPVE ++Y D+ LYKTG+Y+HV G +G+HA+R+IGWG+E
Sbjct: 237 RMIQLEIMTNGPVESGFSVYQDLYLYKTGVYQHVVGREVGKHAVRLIGWGKE-------R 289
Query: 455 VVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
V YWL+ANS+ +WGE+G F+ +RG N GIE+ + AGLPK+
Sbjct: 290 GVPYWLIANSYGEDWGEHGYFKFLRGSNHLGIESVVIAGLPKV 332
>gi|118429531|gb|ABK91813.1| cathepsin B-like cysteine proteinase precursor [Clonorchis
sinensis]
gi|358331549|dbj|GAA37857.2| cathepsin B-like cysteine proteinase [Clonorchis sinensis]
Length = 343
Score = 242 bits (617), Expect = 4e-61, Method: Compositional matrix adjust.
Identities = 120/257 (46%), Positives = 156/257 (60%), Gaps = 11/257 (4%)
Query: 81 QNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCI 140
+ R P + + LP+ FDAR WP+CP+I EIRDQ CGS WA GAVEAMSDR+CI
Sbjct: 70 KARRPTVTHVGFDAMRLPKNFDARTKWPHCPSISEIRDQSGCGSCWAFGAVEAMSDRLCI 129
Query: 141 ASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEI 200
S G + LS+ DL+SCC++CG GC GG+ AW YW GIV+GG+ GCR Y
Sbjct: 130 HSNGAFNKSLSAVDLLSCCENCGYGCSGGYPAVAWDYWGAHGIVTGGSKEDPSGCRSYPF 189
Query: 201 P-CERYMNGSHSSCQDNEPNTPECIRKCQ-PGYDVSYEDDLNFGRIAYSLPANEETIMRE 258
P CE ++ G + C TPEC++ C PG D Y D ++Y++ ++E IM+E
Sbjct: 190 PKCEHHVQGHYPPCPHQYYPTPECVQHCDTPGID--YVKDKTRANMSYNIYSSEILIMKE 247
Query: 259 IFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWL 318
I GPVE T+Y D + YK G+Y H G PL EHAIRI+GWG+E V YWL
Sbjct: 248 IMLRGPVEAVFTVYEDFLQYKFGVYFHSWGAPLSEHAIRILGWGEE-------GDVPYWL 300
Query: 319 VANSFNTNWGENGLFRI 335
+ANS+N +WGE G +
Sbjct: 301 IANSWNEDWGEKGYMKF 317
Score = 149 bits (375), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 79/178 (44%), Positives = 103/178 (57%), Gaps = 18/178 (10%)
Query: 327 WGENGLFR-------IGCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQ-PGYDVSY 377
WG +G+ GCR Y P CE ++ G C TPEC++ C PG D Y
Sbjct: 167 WGAHGIVTGGSKEDPSGCRSYPFPKCEHHVQGHYPPCPHQYYPTPECVQHCDTPGID--Y 224
Query: 378 EDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHA 437
D ++Y++ ++E IM+EI GPVE T+Y D + YK G+Y H G PL EHA
Sbjct: 225 VKDKTRANMSYNIYSSEILIMKEIMLRGPVEAVFTVYEDFLQYKFGVYFHSWGAPLSEHA 284
Query: 438 IRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLP 495
IRI+GWG+E V YWL+ANS+N +WGE G + +RG NECGIE D+TAGLP
Sbjct: 285 IRILGWGEE-------GDVPYWLIANSWNEDWGEKGYMKFLRGLNECGIEDDVTAGLP 335
>gi|349956183|dbj|GAA30948.1| cathepsin B-like cysteine proteinase [Clonorchis sinensis]
Length = 337
Score = 242 bits (617), Expect = 4e-61, Method: Compositional matrix adjust.
Identities = 120/270 (44%), Positives = 164/270 (60%), Gaps = 9/270 (3%)
Query: 67 LEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGW 126
++ G + + +++ P + S E +P+ FDAR WP+CPTI EIRDQ SCGS W
Sbjct: 50 FQLMFGALREPEEQRSKRPTVSHESFSDEHIPKAFDARKQWPHCPTIGEIRDQSSCGSCW 109
Query: 127 ALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSG 186
A GAVEAMSDR+CI + G R+S+ DL+SCC CG GCQGGF AW +W T GIV+G
Sbjct: 110 AFGAVEAMSDRLCIHTNGTFTKRISAVDLISCCGYCGFGCQGGFPPIAWDFWQTEGIVTG 169
Query: 187 GTYASKQGCRPYEIP-CERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIA 245
G+ + GCR Y P C + + + C +TP C++KC D Y D I
Sbjct: 170 GSKENPTGCRSYPFPRCSHHGSKKYPPCSHRIYDTPNCVQKCDTP-DTDYATDKTRANIT 228
Query: 246 YSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEP 305
Y++ A + IM+EI +GPVE + +Y D + YK+G+Y H G LG HAIRI+GWG+E
Sbjct: 229 YNVKAKQNAIMKEIMINGPVEAAFQVYEDFLGYKSGVYFHSDGTLLGGHAIRILGWGEE- 287
Query: 306 LGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
+ V YWL+ANS+N WGE+G F++
Sbjct: 288 ------NGVAYWLIANSWNDGWGEDGCFKM 311
Score = 144 bits (363), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 71/163 (43%), Positives = 100/163 (61%), Gaps = 9/163 (5%)
Query: 336 GCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANE 394
GCR Y P C + + C +TP C++KC D Y D I Y++ A +
Sbjct: 177 GCRSYPFPRCSHHGSKKYPPCSHRIYDTPNCVQKCDTP-DTDYATDKTRANITYNVKAKQ 235
Query: 395 ETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSS 454
IM+EI +GPVE + +Y D + YK+G+Y H G LG HAIRI+GWG+E +
Sbjct: 236 NAIMKEIMINGPVEAAFQVYEDFLGYKSGVYFHSDGTLLGGHAIRILGWGEE-------N 288
Query: 455 VVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
V YWL+ANS+N WGE+G F+++RG+NECGIE ++TAGLP++
Sbjct: 289 GVAYWLIANSWNDGWGEDGCFKMLRGKNECGIEDEVTAGLPEL 331
>gi|347972088|ref|XP_313836.5| AGAP004534-PA [Anopheles gambiae str. PEST]
gi|333469166|gb|EAA09182.5| AGAP004534-PA [Anopheles gambiae str. PEST]
Length = 334
Score = 242 bits (617), Expect = 4e-61, Method: Compositional matrix adjust.
Identities = 116/260 (44%), Positives = 164/260 (63%), Gaps = 20/260 (7%)
Query: 78 KLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDR 137
K+P R + + ++PE FDAR +WP C +++ IR+QG+CGS WA+ A MSDR
Sbjct: 73 KVPIRRYEYVYDV-----DIPESFDARNHWPNCESLRAIRNQGTCGSCWAVAAASVMSDR 127
Query: 138 VCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGF-HGKAWKYWVTTGIVSGGTYASKQGCR 196
VCI S G +V L+++DL+ CC DCGNGC GGF G +++YWV G+VSGG Y S GC+
Sbjct: 128 VCIHSNGTINVALAAEDLMGCCVDCGNGCNGGFLDGTSFQYWVDAGLVSGGAYNSTDGCK 187
Query: 197 PYEI-PCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETI 255
PY PCE N H +P+C C+ G D Y D FG++AYS+P +E I
Sbjct: 188 PYPFKPCEYPFNDCHVEI------SPKCTHHCRDGVDRHYSKDKLFGKVAYSVPRDERAI 241
Query: 256 MREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVK 315
EI +GPVE +Y D++LYK+G+Y+HV G +G+HA+RIIGWG++ +
Sbjct: 242 RYEIMTNGPVEAGFDVYEDVLLYKSGVYRHVYGEQIGKHAVRIIGWGRD-------GGIP 294
Query: 316 YWLVANSFNTNWGENGLFRI 335
YWL+ANS+ +WG++G F+
Sbjct: 295 YWLIANSYGDDWGDHGYFKF 314
Score = 145 bits (366), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 71/163 (43%), Positives = 99/163 (60%), Gaps = 14/163 (8%)
Query: 336 GCRPYEI-PCERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANE 394
GC+PY PCE N C +P+C C+ G D Y D FG++AYS+P +E
Sbjct: 185 GCKPYPFKPCEYPFN----DCHVEI--SPKCTHHCRDGVDRHYSKDKLFGKVAYSVPRDE 238
Query: 395 ETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSS 454
I EI +GPVE +Y D++LYK+G+Y+HV G +G+HA+RIIGWG++
Sbjct: 239 RAIRYEIMTNGPVEAGFDVYEDVLLYKSGVYRHVYGEQIGKHAVRIIGWGRD-------G 291
Query: 455 VVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
+ YWL+ANS+ +WG++G F+ VRG N GIE+ I GLP I
Sbjct: 292 GIPYWLIANSYGDDWGDHGYFKFVRGSNHLGIESKIITGLPLI 334
>gi|300176938|emb|CBK25507.2| unnamed protein product [Blastocystis hominis]
Length = 320
Score = 241 bits (616), Expect = 5e-61, Method: Compositional matrix adjust.
Identities = 114/241 (47%), Positives = 157/241 (65%), Gaps = 9/241 (3%)
Query: 96 ELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDL 155
+LPE FD WP CP+++EIRDQ CGS WA GA EA +DR+CIAS+GK RLS DL
Sbjct: 68 DLPESFDPVEKWPECPSLKEIRDQSVCGSCWAFGAAEAATDRLCIASKGKIQDRLSEQDL 127
Query: 156 VSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIP-CERYMNGSHSSCQ 214
++CC CG GC GG+ AW+++ +TG+ +GG Y SK C Y P CE + G + C
Sbjct: 128 LTCCDSCGFGCDGGWLDMAWRWFQSTGVTTGGEYGSKDWCNAYSFPKCEHHAEGKYPPCG 187
Query: 215 DNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYAD 274
+++ TPEC+++CQ GY V YE D +F AY + + I E+ +GP+E S +Y D
Sbjct: 188 ESQ-ETPECVKQCQEGYPVEYEKDKHFFGEAYYVQGGIDAIKTELMTNGPLEVSFFVYED 246
Query: 275 MILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFR 334
+ YK+GIY+HVAG LG HA++++GWG E ++YW +ANS+N +WGENG FR
Sbjct: 247 FLTYKSGIYQHVAGKYLGGHAVKLVGWGVE-------DGIEYWKIANSWNEDWGENGYFR 299
Query: 335 I 335
I
Sbjct: 300 I 300
Score = 145 bits (365), Expect = 6e-32, Method: Compositional matrix adjust.
Identities = 71/162 (43%), Positives = 99/162 (61%), Gaps = 9/162 (5%)
Query: 337 CRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEE 395
C Y P CE + G C ++ TPEC+++CQ GY V YE D +F AY + +
Sbjct: 167 CNAYSFPKCEHHAEGKYPPCGESQ-ETPECVKQCQEGYPVEYEKDKHFFGEAYYVQGGID 225
Query: 396 TIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSV 455
I E+ +GP+E S +Y D + YK+GIY+HVAG LG HA++++GWG E
Sbjct: 226 AIKTELMTNGPLEVSFFVYEDFLTYKSGIYQHVAGKYLGGHAVKLVGWGVE-------DG 278
Query: 456 VKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
++YW +ANS+N +WGENG FRIV G+ ECGIE G+PK+
Sbjct: 279 IEYWKIANSWNEDWGENGYFRIVAGKGECGIEVGPIGGIPKL 320
>gi|300122171|emb|CBK22745.2| unnamed protein product [Blastocystis hominis]
Length = 319
Score = 241 bits (616), Expect = 5e-61, Method: Compositional matrix adjust.
Identities = 116/256 (45%), Positives = 153/256 (59%), Gaps = 8/256 (3%)
Query: 81 QNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCI 140
+N+ PL ++ ELP+ FD+ WP CP+I E+RDQ SC S WA G VE +DR+CI
Sbjct: 53 KNKKPLPIRSIPIKRELPKEFDSSEKWPECPSILEVRDQSSCASCWAFGVVEVATDRICI 112
Query: 141 ASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEI 200
S+GK VRLS++D++ CCKDCG CQGG+ AW+Y TG+V+GG Y S + C+ Y
Sbjct: 113 ESKGKNQVRLSAEDVLECCKDCGFQCQGGYSAMAWEYLRRTGVVTGGQYNSTEWCKSYPF 172
Query: 201 -PCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREI 259
PC + G + C P P+C CQ GY + YE D Y L N + I EI
Sbjct: 173 PPCSHGIEGQYPQCSTKPPVVPKCETTCQEGYPIEYEKDRYKFSNVYQLENNVDQIKNEI 232
Query: 260 FRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLV 319
+GPV+ S +Y D + YK+GIY HV G + H ++IIGWG+E GE YW
Sbjct: 233 MENGPVDASFQVYEDFMTYKSGIYHHVEGKFMNLHTVKIIGWGEEN-GEA------YWKA 285
Query: 320 ANSFNTNWGENGLFRI 335
NS+N+ WGENGLFRI
Sbjct: 286 VNSWNSEWGENGLFRI 301
Score = 130 bits (327), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 67/159 (42%), Positives = 85/159 (53%), Gaps = 8/159 (5%)
Query: 337 CRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEE 395
C+ Y P C + G C P P+C CQ GY + YE D Y L N +
Sbjct: 167 CKSYPFPPCSHGIEGQYPQCSTKPPVVPKCETTCQEGYPIEYEKDRYKFSNVYQLENNVD 226
Query: 396 TIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSV 455
I EI +GPV+ S +Y D + YK+GIY HV G + H ++IIGWG+E GE
Sbjct: 227 QIKNEIMENGPVDASFQVYEDFMTYKSGIYHHVEGKFMNLHTVKIIGWGEEN-GEA---- 281
Query: 456 VKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGL 494
YW NS+N+ WGENGLFRI G NEC IE+ + GL
Sbjct: 282 --YWKAVNSWNSEWGENGLFRIRLGTNECTIESQVEGGL 318
>gi|442754445|gb|JAA69382.1| Putative cathepsin b precursor [Ixodes ricinus]
Length = 340
Score = 241 bits (616), Expect = 5e-61, Method: Compositional matrix adjust.
Identities = 127/277 (45%), Positives = 179/277 (64%), Gaps = 19/277 (6%)
Query: 64 LSELEMRMGVHPDSKLPQNRLPLLVQLSDPLE-ELPEGFDARI---NWPYCPTIQEIRDQ 119
L + ++GVH D+ + RLP LV D LE ++P FD+R +WP+ P +++
Sbjct: 58 LETVRRKLGVHRDNH--KYRLPELVH--DTLEMDIPAQFDSRQQWQDWPHHPGDPGTKER 113
Query: 120 GSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWV 179
GAVE+MSDR CI S K V L++DD++SCC CG+GC GGF AW YWV
Sbjct: 114 ADPVG--HFGAVESMSDRHCIHSGAKNIVHLAADDVLSCCWGCGSGCNGGFPAAAWSYWV 171
Query: 180 TTGIVSGGTYASKQGCRPYEIP-CERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDD 238
GIV+GG Y + +GC PY +P C+ ++NG+ C + P TP+C+R C+ GY+V ++DD
Sbjct: 172 DKGIVTGGNYDTDEGCMPYPVPSCDHHVNGTLGPCGQDPP-TPKCVRLCRKGYNVDFKDD 230
Query: 239 LNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRI 298
++G+ +YS+P+NE I EI ++GPVEG+ T+YAD LYK+G+YK + LG HAIRI
Sbjct: 231 KHYGKSSYSVPSNETQIQMEIMKNGPVEGAFTVYADFPLYKSGVYKSHSTDALGGHAIRI 290
Query: 299 IGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
+GWG E + V YWLVANS+NT WG+ G F+I
Sbjct: 291 LGWGVE-------NDVPYWLVANSWNTEWGDKGYFKI 320
Score = 179 bits (455), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 84/162 (51%), Positives = 116/162 (71%), Gaps = 9/162 (5%)
Query: 336 GCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANE 394
GC PY +P C+ ++NG+ C +P TP+C+R C+ GY+V ++DD ++G+ +YS+P+NE
Sbjct: 186 GCMPYPVPSCDHHVNGTLGPC-GQDPPTPKCVRLCRKGYNVDFKDDKHYGKSSYSVPSNE 244
Query: 395 ETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSS 454
I EI ++GPVEG+ T+YAD LYK+G+YK + LG HAIRI+GWG E +
Sbjct: 245 TQIQMEIMKNGPVEGAFTVYADFPLYKSGVYKSHSTDALGGHAIRILGWGVE-------N 297
Query: 455 VVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPK 496
V YWLVANS+NT WG+ G F+I+RG NECGIE DI AG+PK
Sbjct: 298 DVPYWLVANSWNTEWGDKGYFKILRGSNECGIEEDIVAGIPK 339
>gi|121073189|gb|ABM47071.1| cathepsin B2 [Clonorchis sinensis]
gi|358341868|dbj|GAA36574.2| cathepsin B-like cysteine proteinase [Clonorchis sinensis]
Length = 343
Score = 241 bits (616), Expect = 6e-61, Method: Compositional matrix adjust.
Identities = 116/240 (48%), Positives = 153/240 (63%), Gaps = 9/240 (3%)
Query: 97 LPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLV 156
+P+ FDAR WP+CP+I EIRDQ SCGS WA GAVEAMSDR+CI S G + LS+ DL+
Sbjct: 86 IPKSFDARATWPHCPSISEIRDQSSCGSCWAFGAVEAMSDRLCIHSSGAFNKSLSAVDLL 145
Query: 157 SCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIP-CERYMNGSHSSCQD 215
SCCKDCG+GC GGF AW +W T GIV+GG+ GCRPY P C+ + G + C
Sbjct: 146 SCCKDCGDGCDGGFPPMAWDFWKTHGIVTGGSKEEPTGCRPYPFPKCQHHSQGHYPPCPR 205
Query: 216 NEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADM 275
TP+C++ C + Y+ D +Y++ +E IM+EI +GPVE + ++ D
Sbjct: 206 RIYPTPKCVKHCDTP-KIDYQKDKTRANTSYNVHQSEVAIMKEILLNGPVEATFEVHEDF 264
Query: 276 ILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
YK+GIY H GG +G HAIRI+GWG+E + V YWL+ANS+N +WGE G R
Sbjct: 265 PEYKSGIYFHAWGGSVGGHAIRILGWGEE-------NGVPYWLIANSWNEDWGEKGYLRF 317
Score = 142 bits (358), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 70/163 (42%), Positives = 97/163 (59%), Gaps = 9/163 (5%)
Query: 336 GCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANE 394
GCRPY P C+ + G C TP+C++ C + Y+ D +Y++ +E
Sbjct: 183 GCRPYPFPKCQHHSQGHYPPCPRRIYPTPKCVKHCDTP-KIDYQKDKTRANTSYNVHQSE 241
Query: 395 ETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSS 454
IM+EI +GPVE + ++ D YK+GIY H GG +G HAIRI+GWG+E +
Sbjct: 242 VAIMKEILLNGPVEATFEVHEDFPEYKSGIYFHAWGGSVGGHAIRILGWGEE-------N 294
Query: 455 VVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
V YWL+ANS+N +WGE G R +RG NECGIE + TAGLP +
Sbjct: 295 GVPYWLIANSWNEDWGEKGYLRFLRGHNECGIEEEATAGLPDL 337
>gi|157167281|ref|XP_001658485.1| cathepsin b [Aedes aegypti]
gi|108876476|gb|EAT40701.1| AAEL007585-PA [Aedes aegypti]
Length = 386
Score = 241 bits (615), Expect = 7e-61, Method: Compositional matrix adjust.
Identities = 120/260 (46%), Positives = 162/260 (62%), Gaps = 16/260 (6%)
Query: 77 SKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSD 136
+ L + +LPL + +LP+ FDAR WP CP+++EIRDQG CGS WA+ A AM+D
Sbjct: 105 ADLERTKLPLGIMADVEDLDLPDTFDAREKWPECPSLREIRDQGCCGSCWAVSAASAMTD 164
Query: 137 RVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCR 196
R C+ S+GK S DL+SCC CG GC+GG G AW++WV G+ SGG S+QGC
Sbjct: 165 RWCVRSKGKEQFIFGSLDLLSCCHSCGQGCRGGTLGPAWQFWVEKGLSSGGPLNSRQGCH 224
Query: 197 PYEIPCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVS-YEDDLNFGRIAYSLPANEETI 255
PY I + +TP+C KC+ GY+V+ D ++GR+AYSLP +E I
Sbjct: 225 PYPI--------GECRIPGEDEDTPKCSNKCRSGYNVTDVWQDRHYGRVAYSLPNDERKI 276
Query: 256 MREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVK 315
M EIF +GPV+ + Y D+ YK+GIY+HV G G HA++++GWG E + VK
Sbjct: 277 MEEIFINGPVQAAFHTYLDLHAYKSGIYRHVWGPLSGGHAVKLLGWGVE-------NGVK 329
Query: 316 YWLVANSFNTNWGENGLFRI 335
YWLVANS+ WGENG F+I
Sbjct: 330 YWLVANSWGREWGENGFFKI 349
Score = 150 bits (378), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 77/165 (46%), Positives = 102/165 (61%), Gaps = 16/165 (9%)
Query: 334 RIGCRPYEIPCERYMNGSRSSCQANEPNTPECIRKCQPGYDVS-YEDDLNFGRIAYSLPA 392
R GC PY I + +TP+C KC+ GY+V+ D ++GR+AYSLP
Sbjct: 220 RQGCHPYPI--------GECRIPGEDEDTPKCSNKCRSGYNVTDVWQDRHYGRVAYSLPN 271
Query: 393 NEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGT 452
+E IM EIF +GPV+ + Y D+ YK+GIY+HV G G HA++++GWG E
Sbjct: 272 DERKIMEEIFINGPVQAAFHTYLDLHAYKSGIYRHVWGPLSGGHAVKLLGWGVE------ 325
Query: 453 SSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
+ VKYWLVANS+ WGENG F+IVRG+N CGIE +I AGLP
Sbjct: 326 -NGVKYWLVANSWGREWGENGFFKIVRGENHCGIEENIHAGLPNF 369
>gi|132566367|gb|ABO34080.1| cathepsin B5 [Clonorchis sinensis]
Length = 343
Score = 241 bits (615), Expect = 7e-61, Method: Compositional matrix adjust.
Identities = 118/240 (49%), Positives = 155/240 (64%), Gaps = 9/240 (3%)
Query: 97 LPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLV 156
LP+ FDAR WP+C +I EIRDQ SCGS WA GAVEAMSDR+CI S G + LS+ DL+
Sbjct: 86 LPKNFDARKTWPHCSSISEIRDQSSCGSCWAFGAVEAMSDRLCIHSNGAFNKSLSAVDLL 145
Query: 157 SCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIP-CERYMNGSHSSCQD 215
SCCKDCG GC+GG+ AW YW T GIV+GG+ GCR Y P CE ++ G + C
Sbjct: 146 SCCKDCGFGCRGGYPAVAWDYWKTHGIVTGGSKEDPSGCRSYPFPKCEHHVQGHYPPCPR 205
Query: 216 NEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADM 275
TPEC+++C DV Y +D ++Y++ A+E +IM+EI GPVE T+Y D
Sbjct: 206 ELYPTPECVQQCDTP-DVGYLEDKTRANMSYNIYASEISIMKEIMLRGPVEAIFTMYEDF 264
Query: 276 ILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
+ Y +G+Y H G P+ HA+RI+GWG+ LG V YWL+ANS+N +WGE G +
Sbjct: 265 LRYSSGVYFHALGAPMSGHAVRILGWGE--LGN-----VPYWLIANSWNEDWGEEGYMKF 317
Score = 146 bits (369), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 73/163 (44%), Positives = 101/163 (61%), Gaps = 9/163 (5%)
Query: 336 GCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANE 394
GCR Y P CE ++ G C TPEC+++C DV Y +D ++Y++ A+E
Sbjct: 183 GCRSYPFPKCEHHVQGHYPPCPRELYPTPECVQQCDTP-DVGYLEDKTRANMSYNIYASE 241
Query: 395 ETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSS 454
+IM+EI GPVE T+Y D + Y +G+Y H G P+ HA+RI+GWG+ LG
Sbjct: 242 ISIMKEIMLRGPVEAIFTMYEDFLRYSSGVYFHALGAPMSGHAVRILGWGE--LGN---- 295
Query: 455 VVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
V YWL+ANS+N +WGE G + +RG NECGIE D+TAGLP +
Sbjct: 296 -VPYWLIANSWNEDWGEEGYMKFLRGYNECGIEDDVTAGLPYL 337
>gi|157167283|ref|XP_001658486.1| cathepsin b [Aedes aegypti]
gi|108876477|gb|EAT40702.1| AAEL007599-PA [Aedes aegypti]
Length = 342
Score = 241 bits (614), Expect = 9e-61, Method: Compositional matrix adjust.
Identities = 117/274 (42%), Positives = 169/274 (61%), Gaps = 17/274 (6%)
Query: 63 TLSELEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSC 122
TL GV D +L + RL + ++ +LPE FDAR W CP++ IR+QG C
Sbjct: 54 TLPPAAYFKGVLYD-RLGETRLAPAILVNPQDIQLPESFDARQKWSQCPSLNVIRNQGCC 112
Query: 123 GSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTG 182
GS WA+ A AM+DR CI S+GK + D+++CC CG+GC+GG+ G AW++WV G
Sbjct: 113 GSCWAISAASAMTDRWCIKSKGKEQFSFGATDMLACCHACGDGCKGGYLGPAWQFWVEQG 172
Query: 183 IVSGGTYASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVS-YEDDLNF 241
+ SGG Y S+QGC PY I E +TP+C ++CQ GY+V+ D +
Sbjct: 173 VSSGGPYNSRQGCHPYPIDV--------CDASGEEADTPKCSKRCQSGYNVTDVWQDRRY 224
Query: 242 GRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGW 301
GR+AYS+P +E+ IM EI+ +GPV+ + Y D+ YK+G+Y+HV G G HA++++GW
Sbjct: 225 GRVAYSIPNDEQKIMEEIYINGPVQAAFMTYQDLHAYKSGVYRHVWGHMAGGHAVKLMGW 284
Query: 302 GQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
G E + +KYWLVANS+ +WG+NG F+I
Sbjct: 285 GVE-------NGLKYWLVANSWGDDWGDNGFFKI 311
Score = 152 bits (385), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 75/167 (44%), Positives = 107/167 (64%), Gaps = 20/167 (11%)
Query: 334 RIGCRPYEIPCERYMNGSRSSCQAN--EPNTPECIRKCQPGYDVS-YEDDLNFGRIAYSL 390
R GC PY I C A+ E +TP+C ++CQ GY+V+ D +GR+AYS+
Sbjct: 182 RQGCHPYPI----------DVCDASGEEADTPKCSKRCQSGYNVTDVWQDRRYGRVAYSI 231
Query: 391 PANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGE 450
P +E+ IM EI+ +GPV+ + Y D+ YK+G+Y+HV G G HA++++GWG E
Sbjct: 232 PNDEQKIMEEIYINGPVQAAFMTYQDLHAYKSGVYRHVWGHMAGGHAVKLMGWGVE---- 287
Query: 451 GTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
+ +KYWLVANS+ +WG+NG F+IVRG+N CGIE D+ AGLP
Sbjct: 288 ---NGLKYWLVANSWGDDWGDNGFFKIVRGENHCGIEKDVHAGLPSF 331
>gi|44968648|gb|AAS49594.1| cathepsin B [Scyliorhinus canicula]
Length = 206
Score = 241 bits (614), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 111/204 (54%), Positives = 150/204 (73%), Gaps = 3/204 (1%)
Query: 103 ARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCK-D 161
+R WP CPTI+EIRDQGSCGS WA GAVEAMSDR+CI SRGK +V +S++DL+SCCK +
Sbjct: 1 SREQWPDCPTIKEIRDQGSCGSCWAFGAVEAMSDRICIHSRGKVNVEVSAEDLLSCCKLE 60
Query: 162 CGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEI-PCERYMNGSHSSCQDNEPNT 220
CGNGC GG+ AW++W G+VSGG Y S GCRPY I PCE ++NGS C E T
Sbjct: 61 CGNGCNGGYPSGAWEFWTNDGLVSGGLYYSHIGCRPYSISPCEHHVNGSRPKCS-GEIET 119
Query: 221 PECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKT 280
P C R+C+ GY Y +D ++G +YS+ ++ IM EI+++GPVE ++ ++ D +LYK+
Sbjct: 120 PRCSRRCEAGYSPKYSEDKHYGLTSYSIGSDVTEIMTEIYKNGPVEAALEVFKDFLLYKS 179
Query: 281 GIYKHVAGGPLGEHAIRIIGWGQE 304
G+Y+H GG +G HAI+I+GWG+E
Sbjct: 180 GVYQHKTGGSIGGHAIKILGWGEE 203
Score = 124 bits (310), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 61/132 (46%), Positives = 89/132 (67%), Gaps = 6/132 (4%)
Query: 320 ANSFNTNWG--ENGLF--RIGCRPYEI-PCERYMNGSRSSCQANEPNTPECIRKCQPGYD 374
A F TN G GL+ IGCRPY I PCE ++NGSR C + E TP C R+C+ GY
Sbjct: 73 AWEFWTNDGLVSGGLYYSHIGCRPYSISPCEHHVNGSRPKC-SGEIETPRCSRRCEAGYS 131
Query: 375 VSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLG 434
Y +D ++G +YS+ ++ IM EI+++GPVE ++ ++ D +LYK+G+Y+H GG +G
Sbjct: 132 PKYSEDKHYGLTSYSIGSDVTEIMTEIYKNGPVEAALEVFKDFLLYKSGVYQHKTGGSIG 191
Query: 435 EHAIRIIGWGQE 446
HAI+I+GWG+E
Sbjct: 192 GHAIKILGWGEE 203
>gi|157131748|ref|XP_001662318.1| cathepsin b [Aedes aegypti]
gi|108871395|gb|EAT35620.1| AAEL012216-PA [Aedes aegypti]
Length = 386
Score = 241 bits (614), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 119/260 (45%), Positives = 162/260 (62%), Gaps = 16/260 (6%)
Query: 77 SKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSD 136
+ L + +LPL + +LP+ FDAR WP CP+++EIRDQG CGS WA+ A AM+D
Sbjct: 105 ADLERTKLPLGIMADVEDLDLPDTFDAREKWPECPSLREIRDQGCCGSCWAVSAASAMTD 164
Query: 137 RVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCR 196
R C+ S+GK S DL+SCC CG GC+GG G AW++WV G+ SGG S+QGC
Sbjct: 165 RWCVRSKGKEQFIFGSLDLLSCCHSCGQGCRGGTLGPAWQFWVEKGLSSGGPLNSRQGCH 224
Query: 197 PYEIPCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVS-YEDDLNFGRIAYSLPANEETI 255
PY I + +TP+C KC+ GY+V+ D ++GR+AYSLP +E I
Sbjct: 225 PYPI--------GECRIPGEDEDTPKCSNKCRSGYNVTDVWQDRHYGRVAYSLPNDERKI 276
Query: 256 MREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVK 315
M EIF +GPV+ + Y D+ YK+GIY+HV G G HA++++GWG E + VK
Sbjct: 277 MEEIFINGPVQAAFHTYLDLHAYKSGIYRHVWGPLSGGHAVKLLGWGVE-------NGVK 329
Query: 316 YWLVANSFNTNWGENGLFRI 335
YWLVANS+ WGENG F++
Sbjct: 330 YWLVANSWGREWGENGFFKM 349
Score = 149 bits (376), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 80/179 (44%), Positives = 106/179 (59%), Gaps = 23/179 (12%)
Query: 327 WGENGLF-------RIGCRPYEIPCERYMNGSRSSCQANEPNTPECIRKCQPGYDVS-YE 378
W E GL R GC PY I + +TP+C KC+ GY+V+
Sbjct: 206 WVEKGLSSGGPLNSRQGCHPYPI--------GECRIPGEDEDTPKCSNKCRSGYNVTDVW 257
Query: 379 DDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAI 438
D ++GR+AYSLP +E IM EIF +GPV+ + Y D+ YK+GIY+HV G G HA+
Sbjct: 258 QDRHYGRVAYSLPNDERKIMEEIFINGPVQAAFHTYLDLHAYKSGIYRHVWGPLSGGHAV 317
Query: 439 RIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
+++GWG E + VKYWLVANS+ WGENG F++VRG+N CGIE +I AGLP
Sbjct: 318 KLLGWGVE-------NGVKYWLVANSWGREWGENGFFKMVRGENHCGIEENIHAGLPNF 369
>gi|1008858|gb|AAA79004.1| cathepsin B-like thiol protease [Aedes aegypti]
Length = 342
Score = 240 bits (613), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 117/274 (42%), Positives = 169/274 (61%), Gaps = 17/274 (6%)
Query: 63 TLSELEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSC 122
TL GV D +L + RL + ++ +LPE FDAR W CP++ IR+QG C
Sbjct: 54 TLPPAAYFKGVLYD-RLGETRLAPAILVNPQDIQLPESFDARQKWSQCPSLNVIRNQGCC 112
Query: 123 GSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTG 182
GS WA+ A AM+DR CI S+GK + D+++CC CG+GC+GG+ G AW++WV G
Sbjct: 113 GSCWAISAASAMTDRWCIKSKGKEQFSFGATDMLACCHACGDGCKGGYLGPAWQFWVEQG 172
Query: 183 IVSGGTYASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVS-YEDDLNF 241
+ SGG Y S+QGC PY I E +TP+C ++CQ GY+V+ D +
Sbjct: 173 VSSGGPYNSRQGCHPYPIDV--------CDASGEEADTPKCSKRCQSGYNVTDVWQDRRY 224
Query: 242 GRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGW 301
GR+AYS+P +E+ IM EI+ +GPV+ + Y D+ YK+G+Y+HV G G HA++++GW
Sbjct: 225 GRVAYSIPNDEQKIMEEIYINGPVQAAFMTYQDLHAYKSGVYRHVWGHMAGGHAVKLMGW 284
Query: 302 GQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
G E + +KYWLVANS+ +WG+NG F+I
Sbjct: 285 GVE-------NGLKYWLVANSWGDDWGDNGFFKI 311
Score = 152 bits (385), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 75/167 (44%), Positives = 107/167 (64%), Gaps = 20/167 (11%)
Query: 334 RIGCRPYEIPCERYMNGSRSSCQAN--EPNTPECIRKCQPGYDVS-YEDDLNFGRIAYSL 390
R GC PY I C A+ E +TP+C ++CQ GY+V+ D +GR+AYS+
Sbjct: 182 RQGCHPYPI----------DVCDASGEEADTPKCSKRCQSGYNVTDVWQDRRYGRVAYSI 231
Query: 391 PANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGE 450
P +E+ IM EI+ +GPV+ + Y D+ YK+G+Y+HV G G HA++++GWG E
Sbjct: 232 PNDEQKIMEEIYINGPVQAAFMTYQDLHAYKSGVYRHVWGHMAGGHAVKLMGWGVE---- 287
Query: 451 GTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
+ +KYWLVANS+ +WG+NG F+IVRG+N CGIE D+ AGLP
Sbjct: 288 ---NGLKYWLVANSWGDDWGDNGFFKIVRGENHCGIEKDVHAGLPSF 331
>gi|91078964|ref|XP_974298.1| PREDICTED: similar to putative cathepsin B-like like proteinase
[Tribolium castaneum]
gi|270004838|gb|EFA01286.1| cathepsin B precursor [Tribolium castaneum]
Length = 335
Score = 240 bits (613), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 127/290 (43%), Positives = 173/290 (59%), Gaps = 13/290 (4%)
Query: 48 KLPFYGAEKNALSKLTLSELEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINW 107
K + A +N L+ ++ +GV P K +L L V D + +PE FDAR W
Sbjct: 37 KKTTWKAGRNFDIHTPLANIKKLLGVLP-KKANARQLELKVHSVD-VNAIPESFDAREAW 94
Query: 108 PYCPTI-QEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGC 166
P C +I +IRDQ SCGS WA GA EAMSDR+CI S V +S++DL +CC +CG+GC
Sbjct: 95 PECASIIGDIRDQASCGSCWAFGAAEAMSDRICIHSNATVKVSISTEDLNTCCYECGDGC 154
Query: 167 QGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEI-PCERYMNGSHSSCQDNEPNTPECIR 225
GG+ +AW YW TGIV+GG Y +K GC+ Y + PCE + G +C D P TP+C +
Sbjct: 155 NGGWPAEAWAYWAETGIVTGGKYETKDGCKAYTVPPCEHHTEGDLPACGDIVP-TPQCKK 213
Query: 226 KCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKH 285
+C G D+ Y+ DL G AY ++E I EI +GPVE +Y D + YK+G+Y+
Sbjct: 214 ECDAGVDIEYKSDLRKGS-AYQTSSDESQIQTEIMTNGPVEADFDVYEDFLNYKSGVYQQ 272
Query: 286 VAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
G G HAI+I+GWG E +GT YWL ANS+N +WG+ G F+I
Sbjct: 273 TTGNYAGGHAIKILGWGVE---DGTP----YWLAANSWNEDWGDKGYFKI 315
Score = 146 bits (369), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 75/179 (41%), Positives = 105/179 (58%), Gaps = 17/179 (9%)
Query: 327 WGENGLF-------RIGCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYE 378
W E G+ + GC+ Y +P CE + G +C P TP+C ++C G D+ Y+
Sbjct: 166 WAETGIVTGGKYETKDGCKAYTVPPCEHHTEGDLPACGDIVP-TPQCKKECDAGVDIEYK 224
Query: 379 DDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAI 438
DL G AY ++E I EI +GPVE +Y D + YK+G+Y+ G G HAI
Sbjct: 225 SDLRKGS-AYQTSSDESQIQTEIMTNGPVEADFDVYEDFLNYKSGVYQQTTGNYAGGHAI 283
Query: 439 RIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
+I+GWG E +GT YWL ANS+N +WG+ G F+I+RGQNECGIE+DI G+P +
Sbjct: 284 KILGWGVE---DGTP----YWLAANSWNEDWGDKGYFKILRGQNECGIESDIIGGIPVV 335
>gi|28971815|dbj|BAC65419.1| cathepsin B [Pandalus borealis]
Length = 328
Score = 240 bits (613), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 124/293 (42%), Positives = 174/293 (59%), Gaps = 13/293 (4%)
Query: 44 ILLPKLPFYGAEKNALSKLTLSELEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDA 103
+L K + A +N ++ L+ V + +P +LPL + P +E+P FDA
Sbjct: 27 LLQSKQMTWKAGRNFAKDISKDFLKSLNCVRKNPDIP--KLPL--KNVTPTKEIPVEFDA 82
Query: 104 RINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCG 163
R WP+CP I EIRDQG+CGS WA+ A M+DR CI + G R SS+++ +CC +CG
Sbjct: 83 REQWPHCPCIDEIRDQGNCGSCWAVSAASVMTDRTCIDTEGLVDFRFSSENVAACCTECG 142
Query: 164 NGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEI-PCERYMNGSHSSCQDNEPNTPE 222
N C GG A+ +WVT G VSGG + S +GC+PY + CE ++ G C+ + P
Sbjct: 143 NACYGGDEDTAFTHWVTKGFVSGGRHNSNEGCQPYSVEECEHHIEGPRPPCEGDMPELV- 201
Query: 223 CIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGI 282
C C Y +YE+DL +G AY LP + I EI +GPV + +Y D + YK+G+
Sbjct: 202 CSETCHEEYGKTYEEDLEYGLEAYVLPQDVTQIQEEIMTNGPVTAAFAVYDDFLSYKSGV 261
Query: 283 YKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
Y+H G G HA+R+IGWG+E EGT YWLVANS+NT+WG+NGLF+I
Sbjct: 262 YQHETGLLDGYHAVRVIGWGEE---EGTP----YWLVANSWNTDWGDNGLFKI 307
Score = 144 bits (363), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 71/159 (44%), Positives = 98/159 (61%), Gaps = 9/159 (5%)
Query: 336 GCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANE 394
GC+PY + CE ++ G R C+ + P C C Y +YE+DL +G AY LP +
Sbjct: 173 GCQPYSVEECEHHIEGPRPPCEGDMPELV-CSETCHEEYGKTYEEDLEYGLEAYVLPQDV 231
Query: 395 ETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSS 454
I EI +GPV + +Y D + YK+G+Y+H G G HA+R+IGWG+E EGT
Sbjct: 232 TQIQEEIMTNGPVTAAFAVYDDFLSYKSGVYQHETGLLDGYHAVRVIGWGEE---EGTP- 287
Query: 455 VVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAG 493
YWLVANS+NT+WG+NGLF+I+RG +EC E D+ A
Sbjct: 288 ---YWLVANSWNTDWGDNGLFKILRGSDECEFEGDMAAA 323
>gi|56752811|gb|AAW24617.1| unknown [Schistosoma japonicum]
Length = 342
Score = 240 bits (613), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 132/275 (48%), Positives = 178/275 (64%), Gaps = 10/275 (3%)
Query: 63 TLSELEMRMGVHP-DSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGS 121
+L + + MG D+++ + R P V D E+P FD+R WP+C +I +IRDQ
Sbjct: 56 SLDDARILMGARKEDAEMKRKRRPT-VDHHDLNVEIPSQFDSRKKWPHCKSISQIRDQSR 114
Query: 122 CGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTT 181
CGS WA GAVEAM+DR+CI S G++ LS+ DL+SCCKDCG GC+GGF G+AW YWV
Sbjct: 115 CGSCWAFGAVEAMTDRICIQSGGQQSAELSALDLISCCKDCGGGCKGGFPGQAWDYWVKR 174
Query: 182 GIVSGGTYASKQGCRPYEIP-CERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLN 240
GIV+GG+ + GC+PY P CE G + +C TP+C + CQ GY YE D +
Sbjct: 175 GIVTGGSEENHTGCQPYPFPKCEHLTKGKYPACGTKIYKTPQCKQTCQKGYKTPYEQDKH 234
Query: 241 FGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIG 300
+G Y++ +NE+ I REI +GPVE + +Y D + YK+GIY+HV G +G HAIRIIG
Sbjct: 235 YGDQRYNVISNEKAIQREIMMYGPVEAAFDVYEDFLNYKSGIYRHVTGSIVGGHAIRIIG 294
Query: 301 WGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
WG E +G YWL+ANS+N +WGE GLFR+
Sbjct: 295 WGVE---KGKP----YWLIANSWNEDWGEKGLFRM 322
Score = 160 bits (406), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 78/162 (48%), Positives = 104/162 (64%), Gaps = 8/162 (4%)
Query: 336 GCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANE 394
GC+PY P CE G +C TP+C + CQ GY YE D ++G Y++ +NE
Sbjct: 187 GCQPYPFPKCEHLTKGKYPACGTKIYKTPQCKQTCQKGYKTPYEQDKHYGDQRYNVISNE 246
Query: 395 ETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSS 454
+ I REI +GPVE + +Y D + YK+GIY+HV G +G HAIRIIGWG E +G
Sbjct: 247 KAIQREIMMYGPVEAAFDVYEDFLNYKSGIYRHVTGSIVGGHAIRIIGWGVE---KGKP- 302
Query: 455 VVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPK 496
YWL+ANS+N +WGE GLFR+VRG++EC IE+ + AGL K
Sbjct: 303 ---YWLIANSWNEDWGEKGLFRMVRGRDECSIESHVVAGLIK 341
>gi|157111449|ref|XP_001651570.1| cathepsin b [Aedes aegypti]
gi|108868331|gb|EAT32556.1| AAEL015312-PA [Aedes aegypti]
Length = 386
Score = 240 bits (613), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 119/260 (45%), Positives = 162/260 (62%), Gaps = 16/260 (6%)
Query: 77 SKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSD 136
+ L + +LPL + +LP+ FDAR WP CP+++EIRDQG CGS WA+ A AM+D
Sbjct: 105 ADLERTKLPLGIMADVEDLDLPDTFDAREKWPECPSLREIRDQGCCGSCWAVSAASAMTD 164
Query: 137 RVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCR 196
R C+ S+GK S DL+SCC CG GC+GG G AW++WV G+ SGG S+QGC
Sbjct: 165 RWCVRSKGKEQFIFGSLDLLSCCHSCGQGCRGGTLGPAWQFWVEKGLSSGGPLNSRQGCH 224
Query: 197 PYEIPCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVS-YEDDLNFGRIAYSLPANEETI 255
PY I + +TP+C KC+ GY+V+ D ++GR+AYSLP +E I
Sbjct: 225 PYPI--------GECRIPGEDEDTPKCSNKCRSGYNVTDVWQDRHYGRVAYSLPNDERKI 276
Query: 256 MREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVK 315
M EIF +GPV+ + Y D+ YK+GIY+HV G G HA++++GWG E + VK
Sbjct: 277 MEEIFINGPVQAAFHTYLDLHAYKSGIYRHVWGPLSGGHAVKLLGWGVE-------NGVK 329
Query: 316 YWLVANSFNTNWGENGLFRI 335
YWLVANS+ WGENG F++
Sbjct: 330 YWLVANSWGREWGENGFFKM 349
Score = 149 bits (376), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 80/179 (44%), Positives = 106/179 (59%), Gaps = 23/179 (12%)
Query: 327 WGENGLF-------RIGCRPYEIPCERYMNGSRSSCQANEPNTPECIRKCQPGYDVS-YE 378
W E GL R GC PY I + +TP+C KC+ GY+V+
Sbjct: 206 WVEKGLSSGGPLNSRQGCHPYPI--------GECRIPGEDEDTPKCSNKCRSGYNVTDVW 257
Query: 379 DDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAI 438
D ++GR+AYSLP +E IM EIF +GPV+ + Y D+ YK+GIY+HV G G HA+
Sbjct: 258 QDRHYGRVAYSLPNDERKIMEEIFINGPVQAAFHTYLDLHAYKSGIYRHVWGPLSGGHAV 317
Query: 439 RIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
+++GWG E + VKYWLVANS+ WGENG F++VRG+N CGIE +I AGLP
Sbjct: 318 KLLGWGVE-------NGVKYWLVANSWGREWGENGFFKMVRGENHCGIEENIHAGLPNF 369
>gi|341887135|gb|EGT43070.1| hypothetical protein CAEBREN_13756 [Caenorhabditis brenneri]
Length = 398
Score = 240 bits (613), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 117/252 (46%), Positives = 158/252 (62%), Gaps = 10/252 (3%)
Query: 87 LVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKR 146
L + D ++PE FD+R NWP C +I+ IRDQ SCGS WA GAVEAMSDR+CIAS G+
Sbjct: 110 LSKTKDLDMDIPESFDSRENWPKCESIKAIRDQSSCGSCWAFGAVEAMSDRICIASHGEL 169
Query: 147 HVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEI-PCERY 205
V LS+DDL+SCC+ CG GC GG AW+YWV GIV+G + + GC+PY PCE +
Sbjct: 170 QVSLSADDLLSCCRSCGFGCNGGDPLAAWRYWVKDGIVTGSNFTANSGCKPYPFPPCEHH 229
Query: 206 MNGSH-SSCQDNEPNTPECIRKCQPGY-DVSYEDDLNFGRIAYSLPANEETIMREIFRHG 263
+H C + TP+C ++C Y D +Y +D +G AY + + E I +E+ HG
Sbjct: 230 SKKTHFDPCPHDLYPTPKCEKRCNAEYTDKTYSEDKFYGSSAYGVKDDVEAIQKELMTHG 289
Query: 264 PVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSF 323
P+E + +Y D + Y G+Y H G G HA+++IGWG E + YW VANS+
Sbjct: 290 PLEIAFEVYEDFLNYDGGVYVHTGGKLGGGHAVKLIGWGIE-------DGIPYWTVANSW 342
Query: 324 NTNWGENGLFRI 335
NT+WGE+G FRI
Sbjct: 343 NTDWGEDGFFRI 354
Score = 135 bits (341), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 72/186 (38%), Positives = 106/186 (56%), Gaps = 13/186 (6%)
Query: 315 KYWLVANSFNTNWGENGLFRIGCRPYEIP-CERYMNGSR-SSCQANEPNTPECIRKCQPG 372
+YW V + T G N GC+PY P CE + + C + TP+C ++C
Sbjct: 199 RYW-VKDGIVT--GSNFTANSGCKPYPFPPCEHHSKKTHFDPCPHDLYPTPKCEKRCNAE 255
Query: 373 Y-DVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGG 431
Y D +Y +D +G AY + + E I +E+ HGP+E + +Y D + Y G+Y H G
Sbjct: 256 YTDKTYSEDKFYGSSAYGVKDDVEAIQKELMTHGPLEIAFEVYEDFLNYDGGVYVHTGGK 315
Query: 432 PLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADIT 491
G HA+++IGWG E + YW VANS+NT+WGE+G FRI+RG +ECGIE+ +
Sbjct: 316 LGGGHAVKLIGWGIE-------DGIPYWTVANSWNTDWGEDGFFRILRGVDECGIESGVV 368
Query: 492 AGLPKI 497
G+PK+
Sbjct: 369 GGIPKL 374
>gi|347972080|ref|XP_313831.5| AGAP004531-PA [Anopheles gambiae str. PEST]
gi|333469162|gb|EAA09191.5| AGAP004531-PA [Anopheles gambiae str. PEST]
Length = 375
Score = 240 bits (612), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 110/269 (40%), Positives = 161/269 (59%), Gaps = 18/269 (6%)
Query: 68 EMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWA 127
+ R GV D + + +LPL L + LP FDAR W YCP++ +R+QG C S +A
Sbjct: 103 QYRTGVLSDESM-KFQLPLGFVLKKDEQPLPMSFDARQKWSYCPSMNMVRNQGCCDSSYA 161
Query: 128 LGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGG 187
+ AV M+DR C+ S GK + D++SCC CG GC GG W YWV GI SGG
Sbjct: 162 VAAVSTMTDRWCVHSEGKAQFNFGAYDVLSCCHRCGFGCDGGVPSAVWHYWVENGITSGG 221
Query: 188 TYASKQGCRPYEIP-CERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAY 246
+ S +GC+ Y C++ + +TP C+R CQPGY+V+Y +D ++GR+AY
Sbjct: 222 AFGSHEGCQSYPFDVCKK---------SGDSNDTPRCLRFCQPGYNVTYPEDKHYGRVAY 272
Query: 247 SLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPL 306
++P +EE IM E+F GP + + T+Y D + YK+G+Y+H G +G H+++++GWG E
Sbjct: 273 TVPKDEERIMYEVFNFGPAQATFTMYTDFVQYKSGVYRHTFGVRVGTHSVKVMGWGVE-- 330
Query: 307 GEGTSSVVKYWLVANSFNTNWGENGLFRI 335
+ VKYWL ANS+ WG+ G F+I
Sbjct: 331 -----NDVKYWLCANSWGAQWGDGGFFKI 354
Score = 154 bits (389), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 70/170 (41%), Positives = 108/170 (63%), Gaps = 10/170 (5%)
Query: 327 WGENGLFRIGCRPYEIPCERY-MNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGR 385
W ENG+ G C+ Y + + S +N+ TP C+R CQPGY+V+Y +D ++GR
Sbjct: 212 WVENGITSGGAFGSHEGCQSYPFDVCKKSGDSND--TPRCLRFCQPGYNVTYPEDKHYGR 269
Query: 386 IAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQ 445
+AY++P +EE IM E+F GP + + T+Y D + YK+G+Y+H G +G H+++++GWG
Sbjct: 270 VAYTVPKDEERIMYEVFNFGPAQATFTMYTDFVQYKSGVYRHTFGVRVGTHSVKVMGWGV 329
Query: 446 EPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLP 495
E + VKYWL ANS+ WG+ G F+IVRG++ E ++ AGLP
Sbjct: 330 E-------NDVKYWLCANSWGAQWGDGGFFKIVRGEDHLSFETNVVAGLP 372
>gi|239938584|gb|ACS36091.1| cysteine proteinase [Haemonchus contortus]
Length = 346
Score = 240 bits (612), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 117/256 (45%), Positives = 159/256 (62%), Gaps = 8/256 (3%)
Query: 81 QNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCI 140
QNR P + D +++PE FDAR +WP C +I+ IRDQ +CGS WA+ A+SDR+CI
Sbjct: 78 QNRKPAVENEDDEGDDIPESFDARTHWPNCTSIRHIRDQANCGSCWAVSTASALSDRICI 137
Query: 141 ASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEI 200
S G+ + +SS D VSCC+ CG GC GG+ A+ ++ G V+GG Y SK GCRPY
Sbjct: 138 ESNGETQMHISSIDFVSCCESCGYGCDGGWPILAFDFYTYEGAVTGGDYGSKDGCRPYPF 197
Query: 201 -PCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREI 259
PC + N ++ TP+C R+CQ Y +Y D ++G AY +P + + I REI
Sbjct: 198 HPCGHHGNDTYYGECPKGAKTPKCRRRCQRSYKKAYYMDKSYGEDAYEVPHSVKAIQREI 257
Query: 260 FRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLV 319
++GPV G+ T+Y D YK GIYKH AG G HAI+IIGWG E + V YWL+
Sbjct: 258 MKNGPVVGAFTVYEDFSYYKKGIYKHTAGQARGGHAIKIIGWGVE-------NDVPYWLI 310
Query: 320 ANSFNTNWGENGLFRI 335
ANS++ +WGE G FR+
Sbjct: 311 ANSWHNDWGEEGYFRM 326
Score = 148 bits (373), Expect = 8e-33, Method: Compositional matrix adjust.
Identities = 74/159 (46%), Positives = 98/159 (61%), Gaps = 8/159 (5%)
Query: 336 GCRPYEI-PCERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANE 394
GCRPY PC + N + TP+C R+CQ Y +Y D ++G AY +P +
Sbjct: 191 GCRPYPFHPCGHHGNDTYYGECPKGAKTPKCRRRCQRSYKKAYYMDKSYGEDAYEVPHSV 250
Query: 395 ETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSS 454
+ I REI ++GPV G+ T+Y D YK GIYKH AG G HAI+IIGWG E +
Sbjct: 251 KAIQREIMKNGPVVGAFTVYEDFSYYKKGIYKHTAGQARGGHAIKIIGWGVE-------N 303
Query: 455 VVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAG 493
V YWL+ANS++ +WGE G FR++RG NECGIE ++ AG
Sbjct: 304 DVPYWLIANSWHNDWGEEGYFRMIRGINECGIEQEVVAG 342
>gi|268555788|ref|XP_002635883.1| C. briggsae CBR-CPR-5 protein [Caenorhabditis briggsae]
Length = 345
Score = 239 bits (611), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 120/255 (47%), Positives = 159/255 (62%), Gaps = 14/255 (5%)
Query: 88 VQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRH 147
+ ++ + +P+ FDAR WP C +I IRDQ CGS WA A EA+SDR CIAS G +
Sbjct: 74 IVATEVFDAIPDHFDARDQWPSCVSINNIRDQSDCGSCWAFAAAEAISDRTCIASNGAVN 133
Query: 148 VRLSSDDLVSCCK---DCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEI-PCE 203
LSS DL+SCC CGNGC+GG+ +AWK+WV G+V+GG+Y S+ GC+PY I PC
Sbjct: 134 TLLSSQDLLSCCTGLLSCGNGCEGGYPIQAWKWWVKHGLVTGGSYESQFGCKPYSIAPCG 193
Query: 204 RYMNG-SHSSCQDNEPNTPECIRKCQPG--YDVSYEDDLNFGRIAYSLPANEETIMREIF 260
+ +NG + C D+ TP+C+ C Y Y D +FG AY++ E I EI
Sbjct: 194 QTVNGVTWPKCPDDTEPTPKCVEACTSNNTYPTPYLQDKHFGATAYAVGKKVEQIQTEIL 253
Query: 261 RHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVA 320
++GPVE + T+Y D Y TG+Y H +G LG HA++I+GWG + GT YWLVA
Sbjct: 254 KNGPVEVAFTVYEDFYQYTTGVYVHTSGASLGGHAVKILGWG---VDNGT----PYWLVA 306
Query: 321 NSFNTNWGENGLFRI 335
NS+N NWGE G FRI
Sbjct: 307 NSWNVNWGEKGYFRI 321
Score = 146 bits (369), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 78/182 (42%), Positives = 105/182 (57%), Gaps = 18/182 (9%)
Query: 327 WGENGLF-------RIGCRPYEI-PCERYMNG-SRSSCQANEPNTPECIRKCQPG--YDV 375
W ++GL + GC+PY I PC + +NG + C + TP+C+ C Y
Sbjct: 167 WVKHGLVTGGSYESQFGCKPYSIAPCGQTVNGVTWPKCPDDTEPTPKCVEACTSNNTYPT 226
Query: 376 SYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGE 435
Y D +FG AY++ E I EI ++GPVE + T+Y D Y TG+Y H +G LG
Sbjct: 227 PYLQDKHFGATAYAVGKKVEQIQTEILKNGPVEVAFTVYEDFYQYTTGVYVHTSGASLGG 286
Query: 436 HAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLP 495
HA++I+GWG + GT YWLVANS+N NWGE G FRI+RG NECGIE AG+P
Sbjct: 287 HAVKILGWG---VDNGT----PYWLVANSWNVNWGEKGYFRIIRGLNECGIEHSAVAGIP 339
Query: 496 KI 497
+
Sbjct: 340 DL 341
>gi|56758644|gb|AAW27462.1| unknown [Schistosoma japonicum]
Length = 294
Score = 239 bits (611), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 113/239 (47%), Positives = 157/239 (65%), Gaps = 3/239 (1%)
Query: 63 TLSELEMRMGVHP-DSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGS 121
+L + + MG D+++ + R P V D E+P FD+R WP+C +I +IRDQ
Sbjct: 56 SLDDARILMGARKEDAEMKRKRRPT-VDHHDLNVEIPSQFDSRKKWPHCKSISQIRDQSR 114
Query: 122 CGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTT 181
CGS WA GAVEAM+DR+CI S G++ LS+ DL+SCC+DCG+GCQGGF G AW YWV
Sbjct: 115 CGSCWAFGAVEAMTDRICIQSGGQQSAELSALDLISCCEDCGDGCQGGFPGVAWDYWVKR 174
Query: 182 GIVSGGTYASKQGCRPYEIP-CERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLN 240
GIV+GG+ + GC+PY P CE + G + +C TP+C +KCQ GY YE D +
Sbjct: 175 GIVTGGSKENHTGCQPYPFPKCEHHTKGKYPACGTKIYKTPQCKQKCQKGYKTPYEQDKH 234
Query: 241 FGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRII 299
+G +Y++ +NE+ I +EI +GPVE + +Y D + YK+GIY+HV G +G HAIRII
Sbjct: 235 YGEESYNVISNEKAIQKEIMMNGPVEAAFDVYEDFLNYKSGIYRHVTGSIVGGHAIRII 293
Score = 106 bits (265), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 52/127 (40%), Positives = 73/127 (57%), Gaps = 4/127 (3%)
Query: 316 YWLVANSFNTNWGENGLFRIGCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYD 374
YW+ EN GC+PY P CE + G +C TP+C +KCQ GY
Sbjct: 170 YWVKRGIVTGGSKEN---HTGCQPYPFPKCEHHTKGKYPACGTKIYKTPQCKQKCQKGYK 226
Query: 375 VSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLG 434
YE D ++G +Y++ +NE+ I +EI +GPVE + +Y D + YK+GIY+HV G +G
Sbjct: 227 TPYEQDKHYGEESYNVISNEKAIQKEIMMNGPVEAAFDVYEDFLNYKSGIYRHVTGSIVG 286
Query: 435 EHAIRII 441
HAIRII
Sbjct: 287 GHAIRII 293
>gi|5031250|gb|AAD38132.1|AF127592_1 vitellogenic cathepsin-B like protease [Aedes aegypti]
Length = 386
Score = 239 bits (610), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 119/260 (45%), Positives = 161/260 (61%), Gaps = 16/260 (6%)
Query: 77 SKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSD 136
+ L + +LPL + +LP+ FDAR WP CP+++EIRDQG CGS WA+ A AM+D
Sbjct: 105 ADLERTKLPLGIMADVEDLDLPDTFDAREKWPECPSLREIRDQGCCGSCWAVSAASAMTD 164
Query: 137 RVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCR 196
R C+ S+GK S DL+SCC CG GC+GG G AW++WV G+ SGG S+QGC
Sbjct: 165 RWCVRSKGKEQFIFGSLDLLSCCHSCGQGCRGGTLGPAWQFWVEKGLSSGGPLNSRQGCH 224
Query: 197 PYEIPCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVS-YEDDLNFGRIAYSLPANEETI 255
PY I + +TP+C KC+ GY+V+ D + GR+AYSLP +E I
Sbjct: 225 PYPI--------GECRIPGEDEDTPKCSNKCRSGYNVTDVWQDRHIGRVAYSLPNDERKI 276
Query: 256 MREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVK 315
M EIF +GPV+ + Y D+ YK+GIY+HV G G HA++++GWG E + VK
Sbjct: 277 MEEIFINGPVQAAFHTYLDLHAYKSGIYRHVWGPLSGGHAVKLLGWGVE-------NGVK 329
Query: 316 YWLVANSFNTNWGENGLFRI 335
YWLVANS+ WGENG F++
Sbjct: 330 YWLVANSWGREWGENGFFKM 349
Score = 148 bits (373), Expect = 8e-33, Method: Compositional matrix adjust.
Identities = 80/179 (44%), Positives = 105/179 (58%), Gaps = 23/179 (12%)
Query: 327 WGENGLF-------RIGCRPYEIPCERYMNGSRSSCQANEPNTPECIRKCQPGYDVS-YE 378
W E GL R GC PY I + +TP+C KC+ GY+V+
Sbjct: 206 WVEKGLSSGGPLNSRQGCHPYPI--------GECRIPGEDEDTPKCSNKCRSGYNVTDVW 257
Query: 379 DDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAI 438
D + GR+AYSLP +E IM EIF +GPV+ + Y D+ YK+GIY+HV G G HA+
Sbjct: 258 QDRHIGRVAYSLPNDERKIMEEIFINGPVQAAFHTYLDLHAYKSGIYRHVWGPLSGGHAV 317
Query: 439 RIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
+++GWG E + VKYWLVANS+ WGENG F++VRG+N CGIE +I AGLP
Sbjct: 318 KLLGWGVE-------NGVKYWLVANSWGREWGENGFFKMVRGENHCGIEENIHAGLPNF 369
>gi|268579855|ref|XP_002644910.1| C. briggsae CBR-CPR-6 protein [Caenorhabditis briggsae]
Length = 376
Score = 239 bits (610), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 118/252 (46%), Positives = 157/252 (62%), Gaps = 10/252 (3%)
Query: 87 LVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKR 146
L + D ++PE FD+R NWP C +I+ IRDQ SCGS WA GAVEAMSDR+CIAS G+
Sbjct: 96 LSKTKDLDLDIPESFDSRENWPKCQSIRNIRDQSSCGSCWAFGAVEAMSDRICIASHGEL 155
Query: 147 HVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEI-PCERY 205
V LS+DDL+SCC+ CG GC GG AW+YWV GIV+G Y + GC+PY PCE +
Sbjct: 156 QVSLSADDLLSCCRSCGFGCNGGDPLAAWRYWVKDGIVTGSNYTANSGCKPYPFPPCEHH 215
Query: 206 MNGSH-SSCQDNEPNTPECIRKCQPGY-DVSYEDDLNFGRIAYSLPANEETIMREIFRHG 263
+H C + TP+C +KC Y D +Y +D +G AY + + E I +E+ HG
Sbjct: 216 SKKTHFDPCPHDLYPTPKCEKKCIADYTDKTYSEDKFYGHSAYGVKDDVEAIQKELMTHG 275
Query: 264 PVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSF 323
P+E + +Y D + Y G+Y H G G HA+++IGWG E + YW ANS+
Sbjct: 276 PLEIAFEVYEDFLNYDGGVYVHTGGKLGGGHAVKLIGWGIE-------DGIPYWTCANSW 328
Query: 324 NTNWGENGLFRI 335
NT+WGE+G FRI
Sbjct: 329 NTDWGEDGFFRI 340
Score = 134 bits (337), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 72/186 (38%), Positives = 105/186 (56%), Gaps = 13/186 (6%)
Query: 315 KYWLVANSFNTNWGENGLFRIGCRPYEIP-CERYMNGSR-SSCQANEPNTPECIRKCQPG 372
+YW V + T G N GC+PY P CE + + C + TP+C +KC
Sbjct: 185 RYW-VKDGIVT--GSNYTANSGCKPYPFPPCEHHSKKTHFDPCPHDLYPTPKCEKKCIAD 241
Query: 373 Y-DVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGG 431
Y D +Y +D +G AY + + E I +E+ HGP+E + +Y D + Y G+Y H G
Sbjct: 242 YTDKTYSEDKFYGHSAYGVKDDVEAIQKELMTHGPLEIAFEVYEDFLNYDGGVYVHTGGK 301
Query: 432 PLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADIT 491
G HA+++IGWG E + YW ANS+NT+WGE+G FRI+RG +ECGIE+ +
Sbjct: 302 LGGGHAVKLIGWGIE-------DGIPYWTCANSWNTDWGEDGFFRILRGVDECGIESGVV 354
Query: 492 AGLPKI 497
G+PK+
Sbjct: 355 GGIPKL 360
>gi|308511959|ref|XP_003118162.1| CRE-CPR-6 protein [Caenorhabditis remanei]
gi|308238808|gb|EFO82760.1| CRE-CPR-6 protein [Caenorhabditis remanei]
Length = 387
Score = 239 bits (610), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 117/252 (46%), Positives = 158/252 (62%), Gaps = 10/252 (3%)
Query: 87 LVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKR 146
L + D ++PE FD+R NWP C +I+ IRDQ SCGS WA GAVEAMSDR+CIAS G+
Sbjct: 95 LSKTKDLDMDIPENFDSRENWPKCQSIRNIRDQSSCGSCWAFGAVEAMSDRICIASHGEL 154
Query: 147 HVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEI-PCERY 205
V LS+DDL+SCC+ CG GC GG AW+YWV GIV+G Y + GC+PY PCE +
Sbjct: 155 QVSLSADDLLSCCRSCGFGCNGGDPLAAWRYWVKDGIVTGSNYTANSGCKPYPFPPCEHH 214
Query: 206 MNGSH-SSCQDNEPNTPECIRKCQPGY-DVSYEDDLNFGRIAYSLPANEETIMREIFRHG 263
+H C + TP+C +KC Y D +Y +D +G AY + + E I +E+ HG
Sbjct: 215 SKKTHFDPCPHDLYPTPKCEKKCIADYTDKTYSEDKFYGASAYGVKDDVEAIQKELMTHG 274
Query: 264 PVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSF 323
P+E + +Y D + Y G+Y H G G HA++++GWG E + + YW ANS+
Sbjct: 275 PLEIAFEVYEDFLNYDGGVYVHTGGKLGGGHAVKLVGWGIE-------NGIPYWTCANSW 327
Query: 324 NTNWGENGLFRI 335
NT+WGE+G FRI
Sbjct: 328 NTDWGEDGFFRI 339
Score = 133 bits (334), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 71/186 (38%), Positives = 106/186 (56%), Gaps = 13/186 (6%)
Query: 315 KYWLVANSFNTNWGENGLFRIGCRPYEIP-CERYMNGSR-SSCQANEPNTPECIRKCQPG 372
+YW V + T G N GC+PY P CE + + C + TP+C +KC
Sbjct: 184 RYW-VKDGIVT--GSNYTANSGCKPYPFPPCEHHSKKTHFDPCPHDLYPTPKCEKKCIAD 240
Query: 373 Y-DVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGG 431
Y D +Y +D +G AY + + E I +E+ HGP+E + +Y D + Y G+Y H G
Sbjct: 241 YTDKTYSEDKFYGASAYGVKDDVEAIQKELMTHGPLEIAFEVYEDFLNYDGGVYVHTGGK 300
Query: 432 PLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADIT 491
G HA++++GWG E + + YW ANS+NT+WGE+G FRI+RG +ECGIE+ +
Sbjct: 301 LGGGHAVKLVGWGIE-------NGIPYWTCANSWNTDWGEDGFFRILRGVDECGIESGVV 353
Query: 492 AGLPKI 497
G+PK+
Sbjct: 354 GGVPKL 359
>gi|226471008|emb|CAX70585.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 239 bits (610), Expect = 3e-60, Method: Compositional matrix adjust.
Identities = 133/275 (48%), Positives = 180/275 (65%), Gaps = 10/275 (3%)
Query: 63 TLSELEMRMGVHP-DSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGS 121
+L + + MG D+++ + R P V D E+P FD+R WP+C +I +IRDQ
Sbjct: 56 SLDDARILMGARKEDAEMKRKRRPT-VDHHDLNVEIPSQFDSRKKWPHCKSISQIRDQSR 114
Query: 122 CGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTT 181
CGS WA GAVEAM+DR+CI S G++ LS+ DL+SCC+DCG GC+GGF G+AW YWV
Sbjct: 115 CGSCWAFGAVEAMTDRICIQSGGQQSAELSALDLISCCEDCGGGCKGGFPGQAWDYWVKR 174
Query: 182 GIVSGGTYASKQGCRPYEIP-CERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLN 240
GIV+GG+ + GC+PY P CE G + +C TP+C + CQ GY YE D +
Sbjct: 175 GIVTGGSEENHTGCQPYPFPKCEHLTKGKYPACGTKIYKTPQCKQTCQKGYKTPYEQDKH 234
Query: 241 FGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIG 300
+G Y++ +NE+ I REI +GPVE + +Y D + YK+GIY+HVAG +G HAIRIIG
Sbjct: 235 YGDQRYNVISNEKAIQREIMMYGPVEAAFDVYEDFLNYKSGIYRHVAGSIVGGHAIRIIG 294
Query: 301 WGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
WG E +G YWL+ANS+N +WGENGLFR+
Sbjct: 295 WGVE---KGKP----YWLIANSWNEDWGENGLFRM 322
Score = 164 bits (414), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 79/160 (49%), Positives = 105/160 (65%), Gaps = 8/160 (5%)
Query: 336 GCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANE 394
GC+PY P CE G +C TP+C + CQ GY YE D ++G Y++ +NE
Sbjct: 187 GCQPYPFPKCEHLTKGKYPACGTKIYKTPQCKQTCQKGYKTPYEQDKHYGDQRYNVISNE 246
Query: 395 ETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSS 454
+ I REI +GPVE + +Y D + YK+GIY+HVAG +G HAIRIIGWG E +G
Sbjct: 247 KAIQREIMMYGPVEAAFDVYEDFLNYKSGIYRHVAGSIVGGHAIRIIGWGVE---KGKP- 302
Query: 455 VVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGL 494
YWL+ANS+N +WGENGLFR+VRG++EC IE+ + AGL
Sbjct: 303 ---YWLIANSWNEDWGENGLFRMVRGRDECSIESHVVAGL 339
>gi|226471006|emb|CAX70584.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 239 bits (610), Expect = 3e-60, Method: Compositional matrix adjust.
Identities = 133/275 (48%), Positives = 180/275 (65%), Gaps = 10/275 (3%)
Query: 63 TLSELEMRMGVHP-DSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGS 121
+L + + MG D+++ + R P V D E+P FD+R WP+C +I +IRDQ
Sbjct: 56 SLDDARILMGARKEDAEMKRKRRPT-VDHHDLNVEIPSQFDSRKKWPHCKSISQIRDQSR 114
Query: 122 CGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTT 181
CGS WA GAVEAM+DR+CI S G++ LS+ DL+SCC+DCG GC+GGF G+AW YWV
Sbjct: 115 CGSCWAFGAVEAMTDRICIQSGGQQSAELSALDLISCCEDCGGGCKGGFPGQAWDYWVKR 174
Query: 182 GIVSGGTYASKQGCRPYEIP-CERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLN 240
GIV+GG+ + GC+PY P CE G + +C TP+C + CQ GY YE D +
Sbjct: 175 GIVTGGSEENHTGCQPYPFPKCEHLTKGKYPACGTKIYKTPQCKQTCQKGYKTPYEQDKH 234
Query: 241 FGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIG 300
+G Y++ +NE+ I REI +GPVE + +Y D + YK+GIY+HVAG +G HAIRIIG
Sbjct: 235 YGDQRYNVISNEKAIQREIMMYGPVEAAFDVYEDFLNYKSGIYRHVAGSIVGGHAIRIIG 294
Query: 301 WGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
WG E +G YWL+ANS+N +WGENGLFR+
Sbjct: 295 WGVE---KGKP----YWLIANSWNEDWGENGLFRM 322
Score = 164 bits (414), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 79/160 (49%), Positives = 105/160 (65%), Gaps = 8/160 (5%)
Query: 336 GCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANE 394
GC+PY P CE G +C TP+C + CQ GY YE D ++G Y++ +NE
Sbjct: 187 GCQPYPFPKCEHLTKGKYPACGTKIYKTPQCKQTCQKGYKTPYEQDKHYGDQRYNVISNE 246
Query: 395 ETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSS 454
+ I REI +GPVE + +Y D + YK+GIY+HVAG +G HAIRIIGWG E +G
Sbjct: 247 KAIQREIMMYGPVEAAFDVYEDFLNYKSGIYRHVAGSIVGGHAIRIIGWGVE---KGKP- 302
Query: 455 VVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGL 494
YWL+ANS+N +WGENGLFR+VRG++EC IE+ + AGL
Sbjct: 303 ---YWLIANSWNEDWGENGLFRMVRGRDECSIESHVVAGL 339
>gi|239938582|gb|ACS36090.1| cysteine proteinase [Haemonchus contortus]
Length = 346
Score = 238 bits (606), Expect = 8e-60, Method: Compositional matrix adjust.
Identities = 116/256 (45%), Positives = 158/256 (61%), Gaps = 8/256 (3%)
Query: 81 QNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCI 140
QNR P + D +++PE FDAR +WP C +I+ IRDQ +CGS WA+ A+SDR+CI
Sbjct: 78 QNRKPAVENEDDEGDDIPESFDARTHWPNCTSIRHIRDQANCGSCWAVSTASALSDRICI 137
Query: 141 ASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEI 200
S G+ + +SS D VSCC+ C GC GG+ A+ ++ G V+GG Y SK GCRPY
Sbjct: 138 ESNGETQMHISSIDFVSCCESCSYGCDGGWPILAFDFYTYEGAVTGGDYGSKDGCRPYPF 197
Query: 201 -PCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREI 259
PC + N ++ TP+C R+CQ Y +Y D ++G AY +P + + I REI
Sbjct: 198 HPCGHHGNDTYYGECPKGAKTPKCRRRCQRSYKKAYYMDKSYGEDAYEVPHSVKAIQREI 257
Query: 260 FRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLV 319
++GPV G+ T+Y D YK GIYKH AG G HAI+IIGWG E + V YWL+
Sbjct: 258 MKNGPVVGAFTVYEDFSYYKKGIYKHTAGQARGGHAIKIIGWGVE-------NDVPYWLI 310
Query: 320 ANSFNTNWGENGLFRI 335
ANS++ +WGE G FR+
Sbjct: 311 ANSWHNDWGEEGYFRM 326
Score = 148 bits (373), Expect = 9e-33, Method: Compositional matrix adjust.
Identities = 74/159 (46%), Positives = 98/159 (61%), Gaps = 8/159 (5%)
Query: 336 GCRPYEI-PCERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANE 394
GCRPY PC + N + TP+C R+CQ Y +Y D ++G AY +P +
Sbjct: 191 GCRPYPFHPCGHHGNDTYYGECPKGAKTPKCRRRCQRSYKKAYYMDKSYGEDAYEVPHSV 250
Query: 395 ETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSS 454
+ I REI ++GPV G+ T+Y D YK GIYKH AG G HAI+IIGWG E +
Sbjct: 251 KAIQREIMKNGPVVGAFTVYEDFSYYKKGIYKHTAGQARGGHAIKIIGWGVE-------N 303
Query: 455 VVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAG 493
V YWL+ANS++ +WGE G FR++RG NECGIE ++ AG
Sbjct: 304 DVPYWLIANSWHNDWGEEGYFRMIRGINECGIEQEVVAG 342
>gi|56752997|gb|AAW24710.1| unknown [Schistosoma japonicum]
Length = 342
Score = 238 bits (606), Expect = 8e-60, Method: Compositional matrix adjust.
Identities = 132/275 (48%), Positives = 179/275 (65%), Gaps = 10/275 (3%)
Query: 63 TLSELEMRMGVHP-DSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGS 121
+L + + MG D+++ +NR P V D E+P FD+R WP+C +I +IRDQ
Sbjct: 56 SLDDARILMGARKEDAEMKRNRRPT-VDHHDLNVEIPSQFDSRKKWPHCKSISQIRDQSR 114
Query: 122 CGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTT 181
CGS WA GAVEAM+DR+CI S G++ LS+ DL+SCC+DCG GC+GGF G+AW YWV
Sbjct: 115 CGSCWAFGAVEAMTDRICIQSGGQQSAELSALDLISCCEDCGGGCKGGFPGQAWDYWVKR 174
Query: 182 GIVSGGTYASKQGCRPYEIP-CERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLN 240
GIV+GG+ + GC+PY P CE G + +C TP+C + CQ GY YE D +
Sbjct: 175 GIVTGGSEENHTGCQPYPFPKCEHLTKGKYPACGTKIYKTPQCKQTCQKGYKTPYEQDKH 234
Query: 241 FGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIG 300
+G Y++ +NE+ I REI +GPVE + +Y D + YK+GIY+HV G +G HAIRIIG
Sbjct: 235 YGDQRYNVISNEKAIQREIMMYGPVEAAFDVYEDFLNYKSGIYRHVTGSIVGGHAIRIIG 294
Query: 301 WGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
WG E +G YWL+ANS+N +WGE GLFR+
Sbjct: 295 WGVE---KGKP----YWLIANSWNEDWGEKGLFRM 322
Score = 160 bits (404), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 77/160 (48%), Positives = 103/160 (64%), Gaps = 8/160 (5%)
Query: 336 GCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANE 394
GC+PY P CE G +C TP+C + CQ GY YE D ++G Y++ +NE
Sbjct: 187 GCQPYPFPKCEHLTKGKYPACGTKIYKTPQCKQTCQKGYKTPYEQDKHYGDQRYNVISNE 246
Query: 395 ETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSS 454
+ I REI +GPVE + +Y D + YK+GIY+HV G +G HAIRIIGWG E +G
Sbjct: 247 KAIQREIMMYGPVEAAFDVYEDFLNYKSGIYRHVTGSIVGGHAIRIIGWGVE---KGKP- 302
Query: 455 VVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGL 494
YWL+ANS+N +WGE GLFR+VRG++EC IE+ + AGL
Sbjct: 303 ---YWLIANSWNEDWGEKGLFRMVRGRDECSIESHVVAGL 339
>gi|154761391|gb|ABS85545.1| cathepsin B preproprotein [Biomphalaria glabrata]
Length = 333
Score = 238 bits (606), Expect = 8e-60, Method: Compositional matrix adjust.
Identities = 114/267 (42%), Positives = 166/267 (62%), Gaps = 12/267 (4%)
Query: 71 MGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGA 130
+GV+ NR+ L + P +LP+ FD R WP C ++ EIRDQ +CGS WA G+
Sbjct: 61 LGVNMAENKAYNRIHLKYKQVQPRNDLPDNFDPRTKWPDCASLNEIRDQANCGSCWAFGS 120
Query: 131 VEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYA 190
EAM+DR+CIA +G H+ S++D+ CCK CG GC GG+ AW+++V TG+VSGG Y
Sbjct: 121 AEAMTDRICIAGKGNIHI--SAEDINDCCKSCGMGCNGGYPAAAWEWYVDTGVVSGGQYG 178
Query: 191 SKQGCRPYEIP-CERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLP 249
+ +GC PY +P C+ + G + C P TP+C +KC GY SY +D G+ +Y +
Sbjct: 179 TNEGCMPYSLPHCDHHTTGKYQPCPAVVP-TPKCEKKCLTGYPKSYSNDKTRGKKSYGV- 236
Query: 250 ANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEG 309
++IM+E+ +GPV + +Y+D + YKTG+Y+H G G HA++IIG+G E
Sbjct: 237 RGVQSIMQELVDNGPVTAAFDVYSDFLSYKTGVYRHTTGSYEGGHAVKIIGYGTE----- 291
Query: 310 TSSVVKYWLVANSFNTNWGENGLFRIG 336
S YWLVANS+N +WG+ G F+I
Sbjct: 292 --SGQDYWLVANSWNEDWGDKGFFKIA 316
Score = 138 bits (347), Expect = 8e-30, Method: Compositional matrix adjust.
Identities = 69/161 (42%), Positives = 100/161 (62%), Gaps = 10/161 (6%)
Query: 336 GCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANE 394
GC PY +P C+ + G C A P TP+C +KC GY SY +D G+ +Y +
Sbjct: 182 GCMPYSLPHCDHHTTGKYQPCPAVVP-TPKCEKKCLTGYPKSYSNDKTRGKKSYGV-RGV 239
Query: 395 ETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSS 454
++IM+E+ +GPV + +Y+D + YKTG+Y+H G G HA++IIG+G E S
Sbjct: 240 QSIMQELVDNGPVTAAFDVYSDFLSYKTGVYRHTTGSYEGGHAVKIIGYGTE-------S 292
Query: 455 VVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLP 495
YWLVANS+N +WG+ G F+I +G++ECGIE+ I AG P
Sbjct: 293 GQDYWLVANSWNEDWGDKGFFKIAKGKDECGIESSIVAGDP 333
>gi|45822211|emb|CAE47502.1| cathepsin B-like proteinase [Diabrotica virgifera virgifera]
Length = 331
Score = 237 bits (605), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 121/285 (42%), Positives = 174/285 (61%), Gaps = 17/285 (5%)
Query: 54 AEKNALSKLTLSELEMRMGVHPDSKLPQNRLPLLVQLSDPLE-ELPEGFDARINWPYCP- 111
A KN L++ E++ +G + +L + + + + ++P FDAR NW C
Sbjct: 41 AGKNFDENLSIQEIKNLLGAK------KGKLGVAKEFTHSEDIQVPNSFDARENWKECSD 94
Query: 112 TIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFH 171
I + DQ CGS WA+ A AMSDR CIAS+GK V +S+++L+SCC CG GC+GG+
Sbjct: 95 VISTVVDQSDCGSCWAVAAASAMSDRRCIASQGKLKVPVSAENLLSCCDSCGYGCEGGYP 154
Query: 172 GKAWKYWVTTGIVSGGTYASKQGCRPYEI-PCERYMNGSHSSCQDNEPNTPECIRKCQPG 230
AW YW+ TGI +GG Y SKQGC+PY + PCE + G+ C + +TP C KC
Sbjct: 155 TMAWSYWIDTGITTGGLYGSKQGCQPYSLQPCEHHTEGNKVQCSTLDYDTPSCKHKCDDS 214
Query: 231 YDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGP 290
++Y+ +L FG + + I +EI +GPVE + +Y+D + YK+G+Y+HVAG
Sbjct: 215 -ALNYKSELTFGSGSVRNFYSVANIQKEILTNGPVEAAFDVYSDFVNYKSGVYQHVAGEY 273
Query: 291 LGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
LG HA+RI+GWG+E S V YWLVANS+N +WG+ GLF+I
Sbjct: 274 LGGHAVRILGWGEE-------SGVPYWLVANSWNEDWGDKGLFKI 311
Score = 140 bits (354), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 74/185 (40%), Positives = 109/185 (58%), Gaps = 16/185 (8%)
Query: 316 YWLVANSFNTNWGENGLF--RIGCRPYEI-PCERYMNGSRSSCQANEPNTPECIRKCQPG 372
YW+ +T GL+ + GC+PY + PCE + G++ C + +TP C KC
Sbjct: 160 YWI-----DTGITTGGLYGSKQGCQPYSLQPCEHHTEGNKVQCSTLDYDTPSCKHKCDDS 214
Query: 373 YDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGP 432
++Y+ +L FG + + I +EI +GPVE + +Y+D + YK+G+Y+HVAG
Sbjct: 215 -ALNYKSELTFGSGSVRNFYSVANIQKEILTNGPVEAAFDVYSDFVNYKSGVYQHVAGEY 273
Query: 433 LGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITA 492
LG HA+RI+GWG+E S V YWLVANS+N +WG+ GLF+I RG NE G E I A
Sbjct: 274 LGGHAVRILGWGEE-------SGVPYWLVANSWNEDWGDKGLFKIRRGNNESGFEDSIVA 326
Query: 493 GLPKI 497
++
Sbjct: 327 APAQV 331
>gi|170060938|ref|XP_001866023.1| cathepsin B [Culex quinquefasciatus]
gi|167879260|gb|EDS42643.1| cathepsin B [Culex quinquefasciatus]
Length = 353
Score = 237 bits (604), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 123/267 (46%), Positives = 163/267 (61%), Gaps = 17/267 (6%)
Query: 70 RMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALG 129
R+GV+ + + P ++ L + ++ LPE FDAR WP CP+++EIR+QG CGS WA+
Sbjct: 72 RVGVNMEELESKRLKPGILILKEDID-LPEQFDARDKWPQCPSLREIRNQGCCGSCWAIS 130
Query: 130 AVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTY 189
A EA +DR CI S S DL+SCC CG+GCQGG G AW YWV G+ SGG Y
Sbjct: 131 AAEAFTDRWCIHSPEHTTFSFGSFDLISCCHSCGDGCQGGVLGPAWDYWVQKGVSSGGPY 190
Query: 190 ASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVS-YEDDLNFGRIAYSL 248
SKQGC Y + HS D + + P+C RKCQ Y V D FGR+AYS+
Sbjct: 191 NSKQGCHSYP------FDTCHSP--DEDDDAPKCSRKCQSSYSVQDVSKDRRFGRVAYSV 242
Query: 249 PANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGE 308
A+E IM EIF +GPV+ + +Y D YK+G+Y+HV G G HAI+I+GWG E
Sbjct: 243 VADEHRIMEEIFVNGPVQAAFQVYLDFKTYKSGVYRHVTGPLEGGHAIKILGWGVE---N 299
Query: 309 GTSSVVKYWLVANSFNTNWGENGLFRI 335
GT KYWL +NS+ +WG++G F+I
Sbjct: 300 GT----KYWLCSNSWGEDWGDHGFFKI 322
Score = 140 bits (353), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 68/138 (49%), Positives = 91/138 (65%), Gaps = 8/138 (5%)
Query: 359 EPNTPECIRKCQPGYDVS-YEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADM 417
+ + P+C RKCQ Y V D FGR+AYS+ A+E IM EIF +GPV+ + +Y D
Sbjct: 210 DDDAPKCSRKCQSSYSVQDVSKDRRFGRVAYSVVADEHRIMEEIFVNGPVQAAFQVYLDF 269
Query: 418 ILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 477
YK+G+Y+HV G G HAI+I+GWG E GT KYWL +NS+ +WG++G F+I
Sbjct: 270 KTYKSGVYRHVTGPLEGGHAIKILGWGVE---NGT----KYWLCSNSWGEDWGDHGFFKI 322
Query: 478 VRGQNECGIEADITAGLP 495
VRG+N GIE D+ AGLP
Sbjct: 323 VRGENHLGIETDVHAGLP 340
>gi|255040223|gb|ACT99884.1| truncated cathepsin B [Opisthorchis viverrini]
Length = 313
Score = 237 bits (604), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 117/262 (44%), Positives = 160/262 (61%), Gaps = 9/262 (3%)
Query: 71 MGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGA 130
G ++ + + P + + LP+ FDAR WP+C ++ EIRDQ SCGS WA GA
Sbjct: 60 FGAKMETAEQKAQRPTVKHVGFDDTRLPKNFDARSKWPHCSSVSEIRDQSSCGSCWAFGA 119
Query: 131 VEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYA 190
VEAMSDR+CI S G + LS+ DL+SCCKDCG GC+GG+ AW YW T GIV+GG+
Sbjct: 120 VEAMSDRLCIHSNGSFNKSLSAVDLLSCCKDCGFGCRGGYPAVAWDYWRTHGIVTGGSKE 179
Query: 191 SKQGCRPYEIP-CERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLP 249
GCR Y P C+ ++ G + C TPEC++ C ++ Y +D I+Y++
Sbjct: 180 DPSGCRSYPFPKCDHHVQGHYPPCPRQIYPTPECVQDCDTP-ELGYLEDKTRANISYNIY 238
Query: 250 ANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEG 309
A+E +IM+EI GPVE T+Y D + YK+ +Y H G P+ HAIRI+GWG+E
Sbjct: 239 ASEISIMKEIMLRGPVEAVFTVYEDFLQYKSRVYFHAWGAPMSGHAIRILGWGEE----- 293
Query: 310 TSSVVKYWLVANSFNTNWGENG 331
V YWL+ANS+N +WGE G
Sbjct: 294 --GDVPYWLIANSWNEDWGEKG 313
Score = 113 bits (283), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 57/139 (41%), Positives = 81/139 (58%), Gaps = 9/139 (6%)
Query: 336 GCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANE 394
GCR Y P C+ ++ G C TPEC++ C ++ Y +D I+Y++ A+E
Sbjct: 183 GCRSYPFPKCDHHVQGHYPPCPRQIYPTPECVQDCDTP-ELGYLEDKTRANISYNIYASE 241
Query: 395 ETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSS 454
+IM+EI GPVE T+Y D + YK+ +Y H G P+ HAIRI+GWG+E
Sbjct: 242 ISIMKEIMLRGPVEAVFTVYEDFLQYKSRVYFHAWGAPMSGHAIRILGWGEE-------G 294
Query: 455 VVKYWLVANSFNTNWGENG 473
V YWL+ANS+N +WGE G
Sbjct: 295 DVPYWLIANSWNEDWGEKG 313
>gi|410912140|ref|XP_003969548.1| PREDICTED: cathepsin B-like [Takifugu rubripes]
Length = 246
Score = 236 bits (603), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 108/210 (51%), Positives = 147/210 (70%), Gaps = 8/210 (3%)
Query: 127 ALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSG 186
A GA EAMSDR+CI S K V LS++DL+SCC+ CG GC GG+ AW +W G+VSG
Sbjct: 25 AFGASEAMSDRICIHSNAKISVELSAEDLLSCCESCGMGCNGGYPSAAWDFWTKDGLVSG 84
Query: 187 GTYASKQGCRPYEIP-CERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIA 245
G Y S GCRPY IP CE ++NGS SC TP+C+ +C+ GY SY+ D ++G+ +
Sbjct: 85 GLYDSHIGCRPYTIPPCEHHVNGSRPSCSGEGGETPQCVYRCEAGYTPSYKQDKHYGKTS 144
Query: 246 YSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEP 305
YS+ ++E+ I EI+++GPVEG+ T+Y D +LYKTG+Y+HV G LG HAI+I+GWG+E
Sbjct: 145 YSVSSDEDDIKHEIYKNGPVEGAFTVYEDFVLYKTGVYQHVTGSALGGHAIKILGWGEE- 203
Query: 306 LGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
+ + YWL ANS+NT+WG NG F+I
Sbjct: 204 ------NGIPYWLCANSWNTDWGNNGFFKI 227
Score = 191 bits (486), Expect = 6e-46, Method: Compositional matrix adjust.
Identities = 87/178 (48%), Positives = 124/178 (69%), Gaps = 15/178 (8%)
Query: 327 WGENGLF-------RIGCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYE 378
W ++GL IGCRPY IP CE ++NGSR SC TP+C+ +C+ GY SY+
Sbjct: 76 WTKDGLVSGGLYDSHIGCRPYTIPPCEHHVNGSRPSCSGEGGETPQCVYRCEAGYTPSYK 135
Query: 379 DDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAI 438
D ++G+ +YS+ ++E+ I EI+++GPVEG+ T+Y D +LYKTG+Y+HV G LG HAI
Sbjct: 136 QDKHYGKTSYSVSSDEDDIKHEIYKNGPVEGAFTVYEDFVLYKTGVYQHVTGSALGGHAI 195
Query: 439 RIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPK 496
+I+GWG+E + + YWL ANS+NT+WG NG F+I+RG N CGIE++I AG+P
Sbjct: 196 KILGWGEE-------NGIPYWLCANSWNTDWGNNGFFKILRGSNHCGIESEIVAGIPN 246
>gi|268558600|ref|XP_002637291.1| C. briggsae CBR-CPR-4 protein [Caenorhabditis briggsae]
Length = 335
Score = 236 bits (603), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 129/304 (42%), Positives = 178/304 (58%), Gaps = 21/304 (6%)
Query: 45 LLPKLPF----YGAEKNALSK------LTLSELEMRMGVHPDSKLPQNRLPLLVQLSDPL 94
++PK P Y K +L K +T+ +++ R+ + + P +++
Sbjct: 20 VVPKTPEAITEYVNSKQSLWKAEIPKHITIEQVKKRL-MRTEFVAPHTPDVEVIKHDIQE 78
Query: 95 EELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDD 154
+ +P+ FDAR WP C +I IRDQ CGS WA A EA SDR CIAS G + LS++D
Sbjct: 79 DTIPDTFDARTQWPSCVSINNIRDQSDCGSCWAFAAAEAASDRFCIASNGAVNTLLSAED 138
Query: 155 LVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEI-PC-ERYMNGSHSS 212
++SCC +CG GC+GG+ AWKY V +G +GG+Y S+ GC+PY + PC E N +
Sbjct: 139 VLSCCSNCGYGCEGGYPINAWKYLVKSGFCTGGSYVSQFGCKPYSLAPCGETVGNTTWPD 198
Query: 213 CQDNEPNTPECIRKC-QPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTI 271
C + NTP C+ KC Y+++Y+DD +FG AY++ I EI HGPVE + T+
Sbjct: 199 CPQDGYNTPSCVNKCTNNNYNIAYKDDKHFGSTAYAVGKKVAQIQAEILAHGPVEAAFTV 258
Query: 272 YADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENG 331
Y D YK+G+Y H G LG HAIRI+GW GT + YWLVANS+N NWGENG
Sbjct: 259 YEDFYQYKSGVYVHTTGQELGGHAIRILGW-------GTDNGTPYWLVANSWNVNWGENG 311
Query: 332 LFRI 335
FRI
Sbjct: 312 YFRI 315
Score = 162 bits (409), Expect = 5e-37, Method: Compositional matrix adjust.
Identities = 84/184 (45%), Positives = 112/184 (60%), Gaps = 12/184 (6%)
Query: 317 WLVANSFNTNWGENGLFRIGCRPYEI-PC-ERYMNGSRSSCQANEPNTPECIRKC-QPGY 373
+LV + F T G + + + GC+PY + PC E N + C + NTP C+ KC Y
Sbjct: 161 YLVKSGFCT--GGSYVSQFGCKPYSLAPCGETVGNTTWPDCPQDGYNTPSCVNKCTNNNY 218
Query: 374 DVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPL 433
+++Y+DD +FG AY++ I EI HGPVE + T+Y D YK+G+Y H G L
Sbjct: 219 NIAYKDDKHFGSTAYAVGKKVAQIQAEILAHGPVEAAFTVYEDFYQYKSGVYVHTTGQEL 278
Query: 434 GEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAG 493
G HAIRI+GW GT + YWLVANS+N NWGENG FRI+RG NECGIE + G
Sbjct: 279 GGHAIRILGW-------GTDNGTPYWLVANSWNVNWGENGYFRIIRGTNECGIEHAVVGG 331
Query: 494 LPKI 497
+PK+
Sbjct: 332 VPKV 335
>gi|17565162|ref|NP_503382.1| Protein W07B8.4 [Caenorhabditis elegans]
gi|351059398|emb|CCD74288.1| Protein W07B8.4 [Caenorhabditis elegans]
Length = 335
Score = 236 bits (602), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 127/317 (40%), Positives = 188/317 (59%), Gaps = 27/317 (8%)
Query: 34 SKAFDRVDHSILLPKLPFYGAEKNALSKL-------TLSELEMRMGV-HPDSKLPQNRLP 85
S F +++LP+ + N+ KL T E++ M V H + L ++
Sbjct: 7 SLLFILAASAVVLPRNKLFINHINSAQKLWTAEHYTTPFEVKNLMKVEHVAAHLDKD--- 63
Query: 86 LLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGK 145
++L++ + +P+ +D R +WP C ++ IRDQ CGS WA+ A EA+SDR CIAS G
Sbjct: 64 --IKLAETADSIPDSYDVRDHWPQCISVNNIRDQSHCGSCWAVAAAEAISDRTCIASNGD 121
Query: 146 RHVRLSSDDLVSCCK---DCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEI-P 201
+ LS++D+++CC +CG+GC+GG+ +AW+YWV G+V+GG++ S+ GC+PY I P
Sbjct: 122 VNTLLSAEDILTCCTGKFNCGDGCEGGYPIQAWRYWVKNGLVTGGSFESQYGCKPYSIAP 181
Query: 202 CERYMNG-SHSSCQDNEPNTPECIRKC--QPGYDVSYEDDLNFGRIAYSLPANEETIMRE 258
C ++G + C +TP+C C Y + Y+ D +FG AY++ + + I E
Sbjct: 182 CGETIDGVTWPECPMKISDTPKCEHHCTGNNSYPIPYDQDKHFGASAYAIGRSAKQIQTE 241
Query: 259 IFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWL 318
I HGPVE +Y D LYKTGIY HVAGG LG HA++++GWG + GT YWL
Sbjct: 242 ILAHGPVEVGFIVYEDFYLYKTGIYTHVAGGELGGHAVKMLGWG---VDNGT----PYWL 294
Query: 319 VANSFNTNWGENGLFRI 335
ANS+NT WGE G FRI
Sbjct: 295 AANSWNTVWGEKGYFRI 311
Score = 147 bits (370), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 80/182 (43%), Positives = 107/182 (58%), Gaps = 18/182 (9%)
Query: 327 WGENGLF-------RIGCRPYEI-PCERYMNG-SRSSCQANEPNTPECIRKC--QPGYDV 375
W +NGL + GC+PY I PC ++G + C +TP+C C Y +
Sbjct: 157 WVKNGLVTGGSFESQYGCKPYSIAPCGETIDGVTWPECPMKISDTPKCEHHCTGNNSYPI 216
Query: 376 SYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGE 435
Y+ D +FG AY++ + + I EI HGPVE +Y D LYKTGIY HVAGG LG
Sbjct: 217 PYDQDKHFGASAYAIGRSAKQIQTEILAHGPVEVGFIVYEDFYLYKTGIYTHVAGGELGG 276
Query: 436 HAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLP 495
HA++++GWG + GT YWL ANS+NT WGE G FRI+RG +ECGIE+ AG+P
Sbjct: 277 HAVKMLGWG---VDNGT----PYWLAANSWNTVWGEKGYFRILRGVDECGIESAAVAGMP 329
Query: 496 KI 497
+
Sbjct: 330 DL 331
>gi|341888137|gb|EGT44072.1| hypothetical protein CAEBREN_10156 [Caenorhabditis brenneri]
Length = 344
Score = 236 bits (601), Expect = 3e-59, Method: Compositional matrix adjust.
Identities = 118/248 (47%), Positives = 157/248 (63%), Gaps = 14/248 (5%)
Query: 95 EELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDD 154
+ +P+ FDAR WP C +I IRDQ CGS WA A EA+SDR CIAS G + LSS+D
Sbjct: 80 DAIPDHFDAREQWPSCVSIDNIRDQSDCGSCWAFAAAEAISDRTCIASNGAVNTLLSSED 139
Query: 155 LVSCCK---DCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEI-PCERYMNG-S 209
L+SCC CGNGC+GG+ +AWK+W G+V+GG+Y S+ GC+PY I PC + +NG +
Sbjct: 140 LLSCCTGIFSCGNGCEGGYPIQAWKWWGKHGLVTGGSYESQFGCKPYSIAPCGQTVNGVT 199
Query: 210 HSSCQDNEPNTPECIRKCQPG--YDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEG 267
C ++ TP+C+ C Y +Y D +FG AY++ E I EI ++GP+E
Sbjct: 200 WPKCPEDTEPTPKCVDACTSNHTYPTAYLQDKHFGATAYAVGKKVEQIQTEILKNGPIEV 259
Query: 268 SMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNW 327
+ T+Y D Y TG+Y H AG LG HA++I+GWG + GT YWLVANS+N NW
Sbjct: 260 AFTVYEDFYQYTTGVYVHTAGASLGGHAVKILGWG---VDNGT----PYWLVANSWNINW 312
Query: 328 GENGLFRI 335
GE G FRI
Sbjct: 313 GEKGYFRI 320
Score = 151 bits (382), Expect = 7e-34, Method: Compositional matrix adjust.
Identities = 79/182 (43%), Positives = 107/182 (58%), Gaps = 18/182 (9%)
Query: 327 WGENGLF-------RIGCRPYEI-PCERYMNG-SRSSCQANEPNTPECIRKCQPG--YDV 375
WG++GL + GC+PY I PC + +NG + C + TP+C+ C Y
Sbjct: 166 WGKHGLVTGGSYESQFGCKPYSIAPCGQTVNGVTWPKCPEDTEPTPKCVDACTSNHTYPT 225
Query: 376 SYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGE 435
+Y D +FG AY++ E I EI ++GP+E + T+Y D Y TG+Y H AG LG
Sbjct: 226 AYLQDKHFGATAYAVGKKVEQIQTEILKNGPIEVAFTVYEDFYQYTTGVYVHTAGASLGG 285
Query: 436 HAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLP 495
HA++I+GWG + GT YWLVANS+N NWGE G FRI+RG NECGIE AG+P
Sbjct: 286 HAVKILGWG---VDNGT----PYWLVANSWNINWGEKGYFRIIRGLNECGIEHSAVAGIP 338
Query: 496 KI 497
+
Sbjct: 339 DL 340
>gi|332376204|gb|AEE63242.1| unknown [Dendroctonus ponderosae]
Length = 338
Score = 236 bits (601), Expect = 3e-59, Method: Compositional matrix adjust.
Identities = 119/250 (47%), Positives = 161/250 (64%), Gaps = 15/250 (6%)
Query: 92 DPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLS 151
D EE+PE FDAR NWP C ++++IRDQ SCGS WA GAVEAMSDR+CI S V +S
Sbjct: 79 DESEEVPESFDARENWPRCDSLKQIRDQSSCGSCWAFGAVEAMSDRICIHSDQSNQVYVS 138
Query: 152 SDDLVSCC---KDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEI-PCERYMN 207
++DL SCC CG GC GG+ + W YW T GIV+GG Y S QGC+ Y + PCE ++
Sbjct: 139 AEDLNSCCFGLFACGLGCDGGYVAEPWDYWRTDGIVTGGAYNSSQGCKDYSLEPCEHHVE 198
Query: 208 -GSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVE 266
GS C +TPEC+R C + Y + L FG+ + NE+ + EI ++GP+E
Sbjct: 199 VGSRPQCSSLNFDTPECVRSCYES-SLDYTESLTFGQQVSTF-TNEKQMQLEILKNGPIE 256
Query: 267 GSMTIYADMILYKTGIYKHVAGG-PLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNT 325
+ T+Y D + YK+G+Y+ A +G HAI+++GWG E EGT KYWL+ANS+NT
Sbjct: 257 AAFTVYNDFLSYKSGVYQATAQDESVGGHAIKVLGWGVE---EGT----KYWLIANSWNT 309
Query: 326 NWGENGLFRI 335
+WG+NG F+
Sbjct: 310 DWGDNGYFKF 319
Score = 137 bits (345), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 69/163 (42%), Positives = 104/163 (63%), Gaps = 12/163 (7%)
Query: 336 GCRPYEI-PCERYMN-GSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPAN 393
GC+ Y + PCE ++ GSR C + +TPEC+R C + Y + L FG+ + N
Sbjct: 184 GCKDYSLEPCEHHVEVGSRPQCSSLNFDTPECVRSCYES-SLDYTESLTFGQQVSTF-TN 241
Query: 394 EETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGG-PLGEHAIRIIGWGQEPLGEGT 452
E+ + EI ++GP+E + T+Y D + YK+G+Y+ A +G HAI+++GWG E EGT
Sbjct: 242 EKQMQLEILKNGPIEAAFTVYNDFLSYKSGVYQATAQDESVGGHAIKVLGWGVE---EGT 298
Query: 453 SSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLP 495
KYWL+ANS+NT+WG+NG F+ +RG + CGIE++ A LP
Sbjct: 299 ----KYWLIANSWNTDWGDNGYFKFLRGVDHCGIESETAASLP 337
>gi|341900876|gb|EGT56811.1| hypothetical protein CAEBREN_29569 [Caenorhabditis brenneri]
Length = 344
Score = 235 bits (600), Expect = 3e-59, Method: Compositional matrix adjust.
Identities = 118/248 (47%), Positives = 157/248 (63%), Gaps = 14/248 (5%)
Query: 95 EELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDD 154
+ +P+ FDAR WP C +I IRDQ CGS WA A EA+SDR CIAS G + LSS+D
Sbjct: 80 DAIPDRFDAREQWPSCVSIDNIRDQSDCGSCWAFAAAEAISDRTCIASNGAVNTLLSSED 139
Query: 155 LVSCCK---DCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEI-PCERYMNG-S 209
L+SCC CGNGC+GG+ +AWK+W G+V+GG+Y S+ GC+PY I PC + +NG +
Sbjct: 140 LLSCCTGIFSCGNGCEGGYPIQAWKWWGKHGLVTGGSYESQFGCKPYSIAPCGQTVNGVT 199
Query: 210 HSSCQDNEPNTPECIRKCQPG--YDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEG 267
C ++ TP+C+ C Y +Y D +FG AY++ E I EI ++GP+E
Sbjct: 200 WPKCPEDTEPTPKCVDACTSNHTYPTAYLQDKHFGATAYAVGKKVEQIQTEILKNGPIEV 259
Query: 268 SMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNW 327
+ T+Y D Y TG+Y H AG LG HA++I+GWG + GT YWLVANS+N NW
Sbjct: 260 AFTVYEDFYQYTTGVYVHTAGASLGGHAVKILGWG---VDNGT----PYWLVANSWNINW 312
Query: 328 GENGLFRI 335
GE G FRI
Sbjct: 313 GEKGYFRI 320
Score = 151 bits (381), Expect = 8e-34, Method: Compositional matrix adjust.
Identities = 79/182 (43%), Positives = 107/182 (58%), Gaps = 18/182 (9%)
Query: 327 WGENGLF-------RIGCRPYEI-PCERYMNG-SRSSCQANEPNTPECIRKCQPG--YDV 375
WG++GL + GC+PY I PC + +NG + C + TP+C+ C Y
Sbjct: 166 WGKHGLVTGGSYESQFGCKPYSIAPCGQTVNGVTWPKCPEDTEPTPKCVDACTSNHTYPT 225
Query: 376 SYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGE 435
+Y D +FG AY++ E I EI ++GP+E + T+Y D Y TG+Y H AG LG
Sbjct: 226 AYLQDKHFGATAYAVGKKVEQIQTEILKNGPIEVAFTVYEDFYQYTTGVYVHTAGASLGG 285
Query: 436 HAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLP 495
HA++I+GWG + GT YWLVANS+N NWGE G FRI+RG NECGIE AG+P
Sbjct: 286 HAVKILGWG---VDNGT----PYWLVANSWNINWGEKGYFRIIRGLNECGIEHSAVAGIP 338
Query: 496 KI 497
+
Sbjct: 339 DL 340
>gi|341891084|gb|EGT47019.1| CBN-CPR-4 protein [Caenorhabditis brenneri]
Length = 335
Score = 235 bits (599), Expect = 5e-59, Method: Compositional matrix adjust.
Identities = 129/304 (42%), Positives = 178/304 (58%), Gaps = 21/304 (6%)
Query: 45 LLPKLPF----YGAEKNALSK------LTLSELEMRMGVHPDSKLPQNRLPLLVQLSDPL 94
++PK P Y K +L K +T+ +++ R+ + + P + V+
Sbjct: 20 IVPKTPEAITEYVNSKQSLWKAEIPKHITIEQVKKRL-MRTEFVAPHSPDAEFVKHDIQE 78
Query: 95 EELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDD 154
+ +P FDAR WP C +I IRDQ CGS WA A EA SDR CIAS G + LS++D
Sbjct: 79 DTIPATFDARTQWPSCVSINNIRDQSDCGSCWAFAAAEAASDRFCIASNGAVNTLLSAED 138
Query: 155 LVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEI-PC-ERYMNGSHSS 212
++SCC +CG GC+GG+ AWKY V +G +GG+Y ++ GC+PY + PC E N + +
Sbjct: 139 VLSCCSNCGYGCEGGYPINAWKYLVKSGFCTGGSYEAQFGCKPYSLAPCGETVGNTTWPA 198
Query: 213 CQDNEPNTPECIRKC-QPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTI 271
C + +TP C+ KC Y+V+Y+DD +FG AY++ I EI HGPVE + T+
Sbjct: 199 CPTDGYDTPACVNKCTNSNYNVAYKDDKHFGSTAYAVGKKVAQIQAEIIAHGPVEAAFTV 258
Query: 272 YADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENG 331
Y D YK+G+Y H G LG HAIRI+GW GT + YWLVANS+N NWGENG
Sbjct: 259 YEDFYQYKSGVYVHTTGEELGGHAIRILGW-------GTDNGTPYWLVANSWNVNWGENG 311
Query: 332 LFRI 335
FRI
Sbjct: 312 YFRI 315
Score = 160 bits (406), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 84/184 (45%), Positives = 110/184 (59%), Gaps = 12/184 (6%)
Query: 317 WLVANSFNTNWGENGLFRIGCRPYEI-PC-ERYMNGSRSSCQANEPNTPECIRKC-QPGY 373
+LV + F T F GC+PY + PC E N + +C + +TP C+ KC Y
Sbjct: 161 YLVKSGFCTGGSYEAQF--GCKPYSLAPCGETVGNTTWPACPTDGYDTPACVNKCTNSNY 218
Query: 374 DVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPL 433
+V+Y+DD +FG AY++ I EI HGPVE + T+Y D YK+G+Y H G L
Sbjct: 219 NVAYKDDKHFGSTAYAVGKKVAQIQAEIIAHGPVEAAFTVYEDFYQYKSGVYVHTTGEEL 278
Query: 434 GEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAG 493
G HAIRI+GW GT + YWLVANS+N NWGENG FRI+RG NECGIE + G
Sbjct: 279 GGHAIRILGW-------GTDNGTPYWLVANSWNVNWGENGYFRIIRGTNECGIEHAVVGG 331
Query: 494 LPKI 497
+PK+
Sbjct: 332 VPKV 335
>gi|17559068|ref|NP_504682.1| Protein CPR-4 [Caenorhabditis elegans]
gi|1169085|sp|P43508.1|CPR4_CAEEL RecName: Full=Cathepsin B-like cysteine proteinase 4; AltName:
Full=Cysteine protease-related 4; Flags: Precursor
gi|675500|gb|AAA98785.1| cathepsin B-like cysteine proteinase [Caenorhabditis elegans]
gi|695293|gb|AAA98783.1| cathepsin B-like cysteine proteinase [Caenorhabditis elegans]
gi|351063163|emb|CCD71204.1| Protein CPR-4 [Caenorhabditis elegans]
Length = 335
Score = 234 bits (597), Expect = 8e-59, Method: Compositional matrix adjust.
Identities = 124/277 (44%), Positives = 167/277 (60%), Gaps = 11/277 (3%)
Query: 62 LTLSELEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGS 121
+T+ +++ R+ + + P +V+ + +P FDAR WP C +I IRDQ
Sbjct: 47 ITIEQVKKRL-MRTEFVAPHTPDVEVVKHDINEDTIPATFDARTQWPNCMSINNIRDQSD 105
Query: 122 CGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTT 181
CGS WA A EA SDR CIAS G + LS++D++SCC +CG GC+GG+ AWKY V +
Sbjct: 106 CGSCWAFAAAEAASDRFCIASNGAVNTLLSAEDVLSCCSNCGYGCEGGYPINAWKYLVKS 165
Query: 182 GIVSGGTYASKQGCRPYEI-PC-ERYMNGSHSSCQDNEPNTPECIRKC-QPGYDVSYEDD 238
G +GG+Y ++ GC+PY + PC E N + SC D+ +TP C+ KC Y+V+Y D
Sbjct: 166 GFCTGGSYEAQFGCKPYSLAPCGETVGNVTWPSCPDDGYDTPACVNKCTNKNYNVAYTAD 225
Query: 239 LNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRI 298
+FG AY++ I EI HGPVE + T+Y D YKTG+Y H G LG HAIRI
Sbjct: 226 KHFGSTAYAVGKKVSQIQAEIIAHGPVEAAFTVYEDFYQYKTGVYVHTTGQELGGHAIRI 285
Query: 299 IGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
+GW GT + YWLVANS+N NWGENG FRI
Sbjct: 286 LGW-------GTDNGTPYWLVANSWNVNWGENGYFRI 315
Score = 158 bits (400), Expect = 7e-36, Method: Compositional matrix adjust.
Identities = 85/184 (46%), Positives = 110/184 (59%), Gaps = 12/184 (6%)
Query: 317 WLVANSFNTNWGENGLFRIGCRPYEI-PC-ERYMNGSRSSCQANEPNTPECIRKC-QPGY 373
+LV + F T G + + GC+PY + PC E N + SC + +TP C+ KC Y
Sbjct: 161 YLVKSGFCT--GGSYEAQFGCKPYSLAPCGETVGNVTWPSCPDDGYDTPACVNKCTNKNY 218
Query: 374 DVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPL 433
+V+Y D +FG AY++ I EI HGPVE + T+Y D YKTG+Y H G L
Sbjct: 219 NVAYTADKHFGSTAYAVGKKVSQIQAEIIAHGPVEAAFTVYEDFYQYKTGVYVHTTGQEL 278
Query: 434 GEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAG 493
G HAIRI+GW GT + YWLVANS+N NWGENG FRI+RG NECGIE + G
Sbjct: 279 GGHAIRILGW-------GTDNGTPYWLVANSWNVNWGENGYFRIIRGTNECGIEHAVVGG 331
Query: 494 LPKI 497
+PK+
Sbjct: 332 VPKV 335
>gi|170028916|ref|XP_001842340.1| cathepsin B [Culex quinquefasciatus]
gi|167879390|gb|EDS42773.1| cathepsin B [Culex quinquefasciatus]
Length = 339
Score = 234 bits (597), Expect = 9e-59, Method: Compositional matrix adjust.
Identities = 117/264 (44%), Positives = 157/264 (59%), Gaps = 21/264 (7%)
Query: 76 DSKLPQNRLPLLVQLSDPLEE--LPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEA 133
D LP RL + L+E +P+ FDAR WP+C +I +R+QG+CGS WA+ V
Sbjct: 73 DFTLPSKRLH-----ASSLDEVVIPDRFDAREKWPFCQSIHSVRNQGTCGSCWAVATVSV 127
Query: 134 MSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGF-HGKAWKYWVTTGIVSGGTYASK 192
MSDR+CI S G+ ++ L+++DL+ CCKDCGNGC GGF G A++YWV G+VSG Y S
Sbjct: 128 MSDRLCIHSDGEVNLELATEDLMGCCKDCGNGCNGGFLDGTAFQYWVDAGLVSGAPYNSS 187
Query: 193 QGCRPYEI-PCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPAN 251
+GC+PY PC G H +E P+C+ C GYD Y D FG AY +P +
Sbjct: 188 EGCKPYPFEPCSYPFVGCH-----HEKKNPKCLHHCINGYDRKYRKDKFFGATAYKIPND 242
Query: 252 EETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTS 311
I EI +GPV ++ D Y +G+YKHV G +G HAIRI+GWG E GT
Sbjct: 243 ARMIQLEIMTNGPVATGFEVFEDFYFYHSGVYKHVVGKKVGMHAIRIVGWGTE---NGTP 299
Query: 312 SVVKYWLVANSFNTNWGENGLFRI 335
YWL+ANS+ WG+ G F++
Sbjct: 300 ----YWLIANSYGDTWGDKGFFKM 319
Score = 135 bits (341), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 67/163 (41%), Positives = 90/163 (55%), Gaps = 13/163 (7%)
Query: 336 GCRPYEI-PCERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANE 394
GC+PY PC G +E P+C+ C GYD Y D FG AY +P +
Sbjct: 189 GCKPYPFEPCSYPFVGCH-----HEKKNPKCLHHCINGYDRKYRKDKFFGATAYKIPNDA 243
Query: 395 ETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSS 454
I EI +GPV ++ D Y +G+YKHV G +G HAIRI+GWG E GT
Sbjct: 244 RMIQLEIMTNGPVATGFEVFEDFYFYHSGVYKHVVGKKVGMHAIRIVGWGTE---NGTP- 299
Query: 455 VVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
YWL+ANS+ WG+ G F+++RG N GIE+ + AGLP++
Sbjct: 300 ---YWLIANSYGDTWGDKGFFKMLRGSNHLGIESTVIAGLPQL 339
>gi|308500570|ref|XP_003112470.1| CRE-CPR-4 protein [Caenorhabditis remanei]
gi|308267038|gb|EFP10991.1| CRE-CPR-4 protein [Caenorhabditis remanei]
Length = 335
Score = 234 bits (596), Expect = 1e-58, Method: Compositional matrix adjust.
Identities = 123/277 (44%), Positives = 166/277 (59%), Gaps = 11/277 (3%)
Query: 62 LTLSELEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGS 121
L++ +++ R+ + + P +V+ + +P FDAR WP C +I IRDQ
Sbjct: 47 LSIEQVKKRL-MRTEFVAPHTPDVEVVEHDIQEDTIPATFDARTQWPNCVSINNIRDQSD 105
Query: 122 CGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTT 181
CGS WA A EA SDR CIAS G + LS++D++SCC +CG GC GG+ AWKY V +
Sbjct: 106 CGSCWAFAAAEAASDRFCIASNGAVNTLLSAEDVLSCCSNCGYGCDGGYPINAWKYLVKS 165
Query: 182 GIVSGGTYASKQGCRPYEI-PC-ERYMNGSHSSCQDNEPNTPECIRKC-QPGYDVSYEDD 238
G +GG+Y ++ GC+PY + PC E N + C D+ NTP C+ KC Y+ +Y+DD
Sbjct: 166 GFCTGGSYEAQFGCKPYSLAPCGETVGNVTWPDCPDDGYNTPACVNKCTNTKYNTAYKDD 225
Query: 239 LNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRI 298
+FG AY++ I EI HGPVE + T+Y D YK+G+Y H G LG HAIRI
Sbjct: 226 KHFGSTAYAVGKKVAQIQAEIIAHGPVEAAFTVYEDFYQYKSGVYVHTTGQELGGHAIRI 285
Query: 299 IGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
+GW GT + YWLVANS+N NWGENG FRI
Sbjct: 286 LGW-------GTDNGTPYWLVANSWNVNWGENGYFRI 315
Score = 158 bits (400), Expect = 6e-36, Method: Compositional matrix adjust.
Identities = 84/184 (45%), Positives = 110/184 (59%), Gaps = 12/184 (6%)
Query: 317 WLVANSFNTNWGENGLFRIGCRPYEI-PC-ERYMNGSRSSCQANEPNTPECIRKC-QPGY 373
+LV + F T G + + GC+PY + PC E N + C + NTP C+ KC Y
Sbjct: 161 YLVKSGFCT--GGSYEAQFGCKPYSLAPCGETVGNVTWPDCPDDGYNTPACVNKCTNTKY 218
Query: 374 DVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPL 433
+ +Y+DD +FG AY++ I EI HGPVE + T+Y D YK+G+Y H G L
Sbjct: 219 NTAYKDDKHFGSTAYAVGKKVAQIQAEIIAHGPVEAAFTVYEDFYQYKSGVYVHTTGQEL 278
Query: 434 GEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAG 493
G HAIRI+GW GT + YWLVANS+N NWGENG FRI+RG NECGIE + G
Sbjct: 279 GGHAIRILGW-------GTDNGTPYWLVANSWNVNWGENGYFRIIRGTNECGIEHAVVGG 331
Query: 494 LPKI 497
+PK+
Sbjct: 332 VPKV 335
>gi|226471002|emb|CAX70582.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 234 bits (596), Expect = 1e-58, Method: Compositional matrix adjust.
Identities = 129/275 (46%), Positives = 179/275 (65%), Gaps = 10/275 (3%)
Query: 63 TLSELEMRMGVHP-DSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGS 121
+L + + MG D+++ + R P V D E+P FD+R WP+C +I +IRDQ
Sbjct: 56 SLDDARILMGARKEDAEMKRKRRPT-VDHHDLNVEIPSQFDSRKKWPHCKSISQIRDQSR 114
Query: 122 CGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTT 181
CGS WA GAVEAM+DR+CI S G++ LS+ DL+SCC+DCG GC+GGF G+AW YWV
Sbjct: 115 CGSCWAFGAVEAMTDRICIQSGGQQSAELSALDLISCCEDCGGGCKGGFPGQAWDYWVKR 174
Query: 182 GIVSGGTYASKQGCRPYEIP-CERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLN 240
GIV+GG+ + GC+PY P CE G + +C TP+C + CQ GY Y+ D +
Sbjct: 175 GIVTGGSEENHTGCQPYPFPKCEHLTKGKYPACGTKIYKTPQCKQTCQKGYKTPYKQDKH 234
Query: 241 FGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIG 300
+G +Y++ +NE+ I +EI +GPVE + +Y D + YK+GIY+HV G +G HAIRIIG
Sbjct: 235 YGDESYNVISNEKAIQKEIMMYGPVEAAFDVYEDFLNYKSGIYRHVTGSIVGGHAIRIIG 294
Query: 301 WGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
WG E +G YWL+ANS+N +WGE GLFR+
Sbjct: 295 WGVE---KGKP----YWLIANSWNEDWGEKGLFRM 322
Score = 158 bits (399), Expect = 8e-36, Method: Compositional matrix adjust.
Identities = 75/160 (46%), Positives = 104/160 (65%), Gaps = 8/160 (5%)
Query: 336 GCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANE 394
GC+PY P CE G +C TP+C + CQ GY Y+ D ++G +Y++ +NE
Sbjct: 187 GCQPYPFPKCEHLTKGKYPACGTKIYKTPQCKQTCQKGYKTPYKQDKHYGDESYNVISNE 246
Query: 395 ETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSS 454
+ I +EI +GPVE + +Y D + YK+GIY+HV G +G HAIRIIGWG E +G
Sbjct: 247 KAIQKEIMMYGPVEAAFDVYEDFLNYKSGIYRHVTGSIVGGHAIRIIGWGVE---KGKP- 302
Query: 455 VVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGL 494
YWL+ANS+N +WGE GLFR+VRG++EC IE+ + AGL
Sbjct: 303 ---YWLIANSWNEDWGEKGLFRMVRGRDECSIESHVVAGL 339
>gi|239938576|gb|ACS36087.1| cysteine proteinase [Haemonchus contortus]
Length = 253
Score = 233 bits (595), Expect = 1e-58, Method: Compositional matrix adjust.
Identities = 116/242 (47%), Positives = 159/242 (65%), Gaps = 9/242 (3%)
Query: 96 ELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDL 155
++PE FDAR WP C +++ IRDQ +CGS WA+ A+SDR+CIAS G++ V +S+ D+
Sbjct: 1 DIPESFDARTKWPKCSSLKHIRDQANCGSCWAVSTASALSDRICIASNGRKQVHVSATDI 60
Query: 156 VSCCKD-CGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEI-PCERYMNGSHSSC 213
+SCC + CG GC GG+ +A+ Y+ G V+GG Y + GCRPY PC + ++
Sbjct: 61 LSCCGNQCGYGCNGGWPIQAFNYFSKQGAVTGGDYKATSGCRPYPFHPCGHHGKDTYYGE 120
Query: 214 QDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYA 273
NE TP+C+RKCQ Y SY+ D + G+ AY +P +E+ I REI ++GPV G+ T+Y
Sbjct: 121 CPNEATTPKCVRKCQKSYKKSYKKDRSIGKDAYEVPNSEKAIQREIMKNGPVVGAFTVYE 180
Query: 274 DMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLF 333
D YK GIYKH AG G HAI+IIGWG+E V YWL+ANS++ +WGENG F
Sbjct: 181 DFSYYKKGIYKHTAGKARGGHAIKIIGWGKE-------GGVPYWLIANSWHNDWGENGYF 233
Query: 334 RI 335
RI
Sbjct: 234 RI 235
Score = 153 bits (387), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 79/159 (49%), Positives = 102/159 (64%), Gaps = 8/159 (5%)
Query: 336 GCRPYEI-PCERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANE 394
GCRPY PC + + NE TP+C+RKCQ Y SY+ D + G+ AY +P +E
Sbjct: 100 GCRPYPFHPCGHHGKDTYYGECPNEATTPKCVRKCQKSYKKSYKKDRSIGKDAYEVPNSE 159
Query: 395 ETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSS 454
+ I REI ++GPV G+ T+Y D YK GIYKH AG G HAI+IIGWG+E
Sbjct: 160 KAIQREIMKNGPVVGAFTVYEDFSYYKKGIYKHTAGKARGGHAIKIIGWGKE-------G 212
Query: 455 VVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAG 493
V YWL+ANS++ +WGENG FRI+RG N CGIE ++ AG
Sbjct: 213 GVPYWLIANSWHNDWGENGYFRILRGSNHCGIEENVVAG 251
>gi|159177|gb|AAA29177.1| cysteine proteinase [Haemonchus contortus]
Length = 342
Score = 233 bits (595), Expect = 1e-58, Method: Compositional matrix adjust.
Identities = 118/263 (44%), Positives = 164/263 (62%), Gaps = 10/263 (3%)
Query: 76 DSKLPQNRLPLLVQLS-DPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAM 134
D K +L +V+ +P E++PE +D R + C T IRDQ +CGS WA+ A+
Sbjct: 64 DIKFKNQKLNFVVKNDPEPNEDIPEEYDPREKFK-CSTFY-IRDQANCGSCWAVSTAAAI 121
Query: 135 SDRVCIASRGKRHVRLSSDDLVSCCK-DCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQ 193
SDR+CIA+ G++ V +SS D+++CC CG GC GG+ +AW+Y+V G+VSGG Y +K
Sbjct: 122 SDRICIATNGEKQVNISSTDILTCCNPQCGFGCGGGWSIRAWEYFVYEGVVSGGEYLTKG 181
Query: 194 GCRPYEI-PCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANE 252
CRPY I PC + N ++ E TP C +KCQPGY + D G++AY + E
Sbjct: 182 VCRPYPIHPCGHHGNDTYYGECPREAATPPCKKKCQPGYKKIFRMDKRQGKVAYGVEPKE 241
Query: 253 ETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSS 312
E I REI RHGPV S +Y D LYKTG+YKH AG G HA++++GWG + + +
Sbjct: 242 EAIQREILRHGPVVASFAVYEDFSLYKTGVYKHTAGALRGYHAVKMMGWGVD-----SKT 296
Query: 313 VVKYWLVANSFNTNWGENGLFRI 335
KYWL+ANS++ +WGENG FR
Sbjct: 297 KAKYWLIANSWHNDWGENGYFRF 319
Score = 155 bits (391), Expect = 6e-35, Method: Compositional matrix adjust.
Identities = 76/168 (45%), Positives = 103/168 (61%), Gaps = 10/168 (5%)
Query: 337 CRPYEI-PCERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEE 395
CRPY I PC + N + E TP C +KCQPGY + D G++AY + EE
Sbjct: 183 CRPYPIHPCGHHGNDTYYGECPREAATPPCKKKCQPGYKKIFRMDKRQGKVAYGVEPKEE 242
Query: 396 TIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSV 455
I REI RHGPV S +Y D LYKTG+YKH AG G HA++++GWG + + +
Sbjct: 243 AIQREILRHGPVVASFAVYEDFSLYKTGVYKHTAGALRGYHAVKMMGWGVD-----SKTK 297
Query: 456 VKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKIGLEIDS 503
KYWL+ANS++ +WGENG FR +RG N+C IE + AG+ +++DS
Sbjct: 298 AKYWLIANSWHNDWGENGYFRFIRGINDCEIEDTVAAGI----VDVDS 341
>gi|340380685|ref|XP_003388852.1| PREDICTED: cathepsin B-like [Amphimedon queenslandica]
Length = 341
Score = 233 bits (595), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 116/244 (47%), Positives = 164/244 (67%), Gaps = 15/244 (6%)
Query: 97 LPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLV 156
+P+ FDAR WP CPTI +RDQG+CGS WA GAVEAMSDR CI+ K V +S+++L+
Sbjct: 86 IPDTFDARQKWPDCPTIGTVRDQGACGSCWAFGAVEAMSDRYCISF--KEQVNISAENLL 143
Query: 157 SCCKDCGNGCQGGFHGKAWKYW----VTTGIVSGGTYASKQGCRPYEIP-CERYMNGSHS 211
SCC+ CG+GC GG+ AW++W + GIV+GG Y S GC+PY IP C+ + G +
Sbjct: 144 SCCETCGSGCDGGYPAAAWRHWADKLLYEGIVTGGQYDSNAGCQPYTIPKCDHHEPGPYE 203
Query: 212 SCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTI 271
+C ++ +TP C R C YD SY D ++G+ +YS+ ++ +I EI +GPVEG+ ++
Sbjct: 204 NCSGSQ-STPSCKRSCISSYDKSYRSDKHYGKNSYSISSDVSSIQTEIMTNGPVEGAFSV 262
Query: 272 YADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENG 331
YAD Y +G+Y+H G LG HAI+I+GWG E + V YWLVANS+N +WG++G
Sbjct: 263 YADFPTYTSGVYQHTTGSFLGGHAIKILGWGTE-------NGVPYWLVANSWNPSWGDSG 315
Query: 332 LFRI 335
F+I
Sbjct: 316 FFKI 319
Score = 148 bits (374), Expect = 6e-33, Method: Compositional matrix adjust.
Identities = 73/162 (45%), Positives = 109/162 (67%), Gaps = 9/162 (5%)
Query: 336 GCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANE 394
GC+PY IP C+ + G +C ++ +TP C R C YD SY D ++G+ +YS+ ++
Sbjct: 185 GCQPYTIPKCDHHEPGPYENCSGSQ-STPSCKRSCISSYDKSYRSDKHYGKNSYSISSDV 243
Query: 395 ETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSS 454
+I EI +GPVEG+ ++YAD Y +G+Y+H G LG HAI+I+GWG E +
Sbjct: 244 SSIQTEIMTNGPVEGAFSVYADFPTYTSGVYQHTTGSFLGGHAIKILGWGTE-------N 296
Query: 455 VVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPK 496
V YWLVANS+N +WG++G F+I+RG++ECGIE+ I AG+P+
Sbjct: 297 GVPYWLVANSWNPSWGDSGFFKIIRGKDECGIESSIVAGMPE 338
>gi|17565164|ref|NP_503383.1| Protein CPR-5 [Caenorhabditis elegans]
gi|1169086|sp|P43509.1|CPR5_CAEEL RecName: Full=Cathepsin B-like cysteine proteinase 5; AltName:
Full=Cysteine protease-related 5; Flags: Precursor
gi|671713|gb|AAA98786.1| cathepsin B-like cysteine proteinase [Caenorhabditis elegans]
gi|675502|gb|AAA98784.1| cathepsin B-like cysteine proteinase [Caenorhabditis elegans]
gi|351059399|emb|CCD74289.1| Protein CPR-5 [Caenorhabditis elegans]
Length = 344
Score = 233 bits (595), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 117/248 (47%), Positives = 155/248 (62%), Gaps = 14/248 (5%)
Query: 95 EELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDD 154
+ +P+ FDAR WP C +I IRDQ CGS WA A EA+SDR CIAS G + LSS+D
Sbjct: 80 DAIPDHFDARDQWPNCMSINNIRDQSDCGSCWAFAAAEAISDRTCIASNGAVNTLLSSED 139
Query: 155 LVSCCK---DCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEI-PCERYMNGSH 210
L+SCC CGNGC+GG+ +AWK+WV G+V+GG+Y ++ GC+PY I PC +NG
Sbjct: 140 LLSCCTGMFSCGNGCEGGYPIQAWKWWVKHGLVTGGSYETQFGCKPYSIAPCGETVNGVK 199
Query: 211 -SSCQDNEPNTPECIRKC--QPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEG 267
+C ++ TP+C+ C + Y Y D +FG AY++ E I EI +GP+E
Sbjct: 200 WPACPEDTEPTPKCVDSCTSKNNYATPYLQDKHFGSTAYAVGKKVEQIQTEILTNGPIEV 259
Query: 268 SMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNW 327
+ T+Y D Y TG+Y H AG LG HA++I+GWG + GT YWLVANS+N W
Sbjct: 260 AFTVYEDFYQYTTGVYVHTAGASLGGHAVKILGWG---VDNGT----PYWLVANSWNVAW 312
Query: 328 GENGLFRI 335
GE G FRI
Sbjct: 313 GEKGYFRI 320
Score = 145 bits (365), Expect = 7e-32, Method: Compositional matrix adjust.
Identities = 77/183 (42%), Positives = 104/183 (56%), Gaps = 18/183 (9%)
Query: 327 WGENGLF-------RIGCRPYEI-PCERYMNGSR-SSCQANEPNTPECIRKC--QPGYDV 375
W ++GL + GC+PY I PC +NG + +C + TP+C+ C + Y
Sbjct: 166 WVKHGLVTGGSYETQFGCKPYSIAPCGETVNGVKWPACPEDTEPTPKCVDSCTSKNNYAT 225
Query: 376 SYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGE 435
Y D +FG AY++ E I EI +GP+E + T+Y D Y TG+Y H AG LG
Sbjct: 226 PYLQDKHFGSTAYAVGKKVEQIQTEILTNGPIEVAFTVYEDFYQYTTGVYVHTAGASLGG 285
Query: 436 HAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLP 495
HA++I+GWG + GT YWLVANS+N WGE G FRI+RG NECGIE AG+P
Sbjct: 286 HAVKILGWG---VDNGT----PYWLVANSWNVAWGEKGYFRIIRGLNECGIEHSAVAGIP 338
Query: 496 KIG 498
+
Sbjct: 339 DLA 341
>gi|313229093|emb|CBY18245.1| unnamed protein product [Oikopleura dioica]
Length = 355
Score = 233 bits (594), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 121/288 (42%), Positives = 176/288 (61%), Gaps = 17/288 (5%)
Query: 52 YGAEKNALSKLTLSELEMRMGV--HPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPY 109
+ A +N + TL ++ +G + D PQ + P D + ++P FD+R NW
Sbjct: 52 WTAGENFHEQTTLEDVRSWLGAWSNKDYDWPQ-KYPH----DDLVGDIPATFDSRSNWSD 106
Query: 110 CPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGG 169
C I +IRDQG CGS WA GA EA+SDR+CIAS+G V +++D++SCC CGNGC GG
Sbjct: 107 CSVIGKIRDQGGCGSCWAFGAAEAISDRICIASKGATDVMYAAEDVLSCCLTCGNGCNGG 166
Query: 170 FHGKAWKYWVTTGIVSGGTYASKQGCRPYEI-PCERYMNGSHSSCQDNEPNTPECIRKCQ 228
+ A +Y+VT G+V+GG Y +K C+PY + CE ++ G C + TP+C +C
Sbjct: 167 YPLAAMEYFVTRGLVTGGLYGTKDTCQPYTLEACEHHVPGDRPPCTEG-GGTPKCSHQCI 225
Query: 229 PGYDV-SYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVA 287
P Y +Y+DD G AYS+P + I +EI +GPVE + T+Y+D YK+G+Y+H +
Sbjct: 226 PDYTTKAYKDDKVHGHKAYSVPNDVGKIQQEIMHYGPVEAAFTVYSDFPSYKSGVYRHTS 285
Query: 288 GGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
G LG HAI+IIGWG E + YWL+ NS+N++WG+ G F+I
Sbjct: 286 GSELGGHAIKIIGWGTEGGDD-------YWLINNSWNSDWGDKGTFKI 326
Score = 141 bits (356), Expect = 7e-31, Method: Compositional matrix adjust.
Identities = 69/158 (43%), Positives = 99/158 (62%), Gaps = 10/158 (6%)
Query: 337 CRPYEI-PCERYMNGSRSSCQANEPNTPECIRKCQPGYDV-SYEDDLNFGRIAYSLPANE 394
C+PY + CE ++ G R C TP+C +C P Y +Y+DD G AYS+P +
Sbjct: 192 CQPYTLEACEHHVPGDRPPCTEG-GGTPKCSHQCIPDYTTKAYKDDKVHGHKAYSVPNDV 250
Query: 395 ETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSS 454
I +EI +GPVE + T+Y+D YK+G+Y+H +G LG HAI+IIGWG E +
Sbjct: 251 GKIQQEIMHYGPVEAAFTVYSDFPSYKSGVYRHTSGSELGGHAIKIIGWGTEGGDD---- 306
Query: 455 VVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITA 492
YWL+ NS+N++WG+ G F+I+RG NECGIE ++ A
Sbjct: 307 ---YWLINNSWNSDWGDKGTFKILRGSNECGIEGEVVA 341
>gi|226473754|emb|CAX71562.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 329
Score = 233 bits (594), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 117/265 (44%), Positives = 158/265 (59%), Gaps = 22/265 (8%)
Query: 72 GVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAV 131
G D L Q R P V D E+P FD+R WP C +I +IRDQ CGS WA+ AV
Sbjct: 66 GRKEDPNLRQKRRPT-VDHHDLNVEIPSHFDSRKKWPRCKSISQIRDQSQCGSSWAVSAV 124
Query: 132 EAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYAS 191
A+SDR+CI S GK+ CG+GC GGF G +W YWV GIV+GG+ +
Sbjct: 125 GAISDRICIQSGGKQSY-------------CGSGCDGGFLGPSWDYWVLRGIVTGGSKEN 171
Query: 192 KQGCRPYEIP-CERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPA 250
GCRPY P C+ ++ G + +C D TP+C + CQ GY+ SYE D ++G +Y++ +
Sbjct: 172 HTGCRPYPFPKCDHFVKGKYRACGDKLYKTPQCKQTCQKGYNTSYEQDKHYGGFSYNVLS 231
Query: 251 NEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGT 310
E I ++I HGPVE + IY D + YK+GIY++ G + HA+R+IGWG E GT
Sbjct: 232 VESVIQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGQFISGHAVRLIGWGVE---NGT 288
Query: 311 SSVVKYWLVANSFNTNWGENGLFRI 335
+ YWL AN++N +WGE G FRI
Sbjct: 289 A----YWLAANTWNEDWGEKGYFRI 309
Score = 159 bits (402), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 79/182 (43%), Positives = 111/182 (60%), Gaps = 11/182 (6%)
Query: 316 YWLVANSFNTNWGENGLFRIGCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYD 374
YW++ EN GCRPY P C+ ++ G +C TP+C + CQ GY+
Sbjct: 157 YWVLRGIVTGGSKEN---HTGCRPYPFPKCDHFVKGKYRACGDKLYKTPQCKQTCQKGYN 213
Query: 375 VSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLG 434
SYE D ++G +Y++ + E I ++I HGPVE + IY D + YK+GIY++ G +
Sbjct: 214 TSYEQDKHYGGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGQFIS 273
Query: 435 EHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGL 494
HA+R+IGWG E GT+ YWL AN++N +WGE G FRIVRG+NEC IE++I AGL
Sbjct: 274 GHAVRLIGWGVE---NGTA----YWLAANTWNEDWGEKGYFRIVRGRNECSIESEIAAGL 326
Query: 495 PK 496
K
Sbjct: 327 IK 328
>gi|159179|gb|AAA29178.1| cysteine proteinase, partial [Haemonchus contortus]
Length = 341
Score = 232 bits (592), Expect = 3e-58, Method: Compositional matrix adjust.
Identities = 117/259 (45%), Positives = 162/259 (62%), Gaps = 12/259 (4%)
Query: 81 QNRLPLLVQLSDPLEE--LPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRV 138
+N+ P L+ DP E +PE +D R W C + IRDQ +CGS WA+ A+SDR+
Sbjct: 68 RNQNPNLIVKDDPEPEDDIPEEYDPRKIWSNCTSFY-IRDQANCGSCWAVSTAAAISDRI 126
Query: 139 CIASRGKRHVRLSSDDLVSCCK-DCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRP 197
CIA++ ++ V +S+ DLV+CC CG GC GG+ KAW+Y+ G+VSGG Y SK+ CRP
Sbjct: 127 CIATKARKQVNISATDLVTCCTPTCGFGCDGGWSIKAWEYFTYAGLVSGGEYRSKRCCRP 186
Query: 198 YEI-PCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIM 256
Y I PC + N ++ E +TP C +KCQPGY Y D +G A+ LP + E I
Sbjct: 187 YPIHPCGHHGNDTYYGECPEEASTPSCKKKCQPGYRKLYRMDKRYGTDAFQLPKSVEAIQ 246
Query: 257 REIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKY 316
+E+ ++GPV S +Y D LYK+GIY+H AG G HA+++IGW GT + Y
Sbjct: 247 KELLKNGPVTASFAVYEDFSLYKSGIYRHTAGELRGYHAVKMIGW-------GTENRTDY 299
Query: 317 WLVANSFNTNWGENGLFRI 335
WL+ANS++ +WGENG FRI
Sbjct: 300 WLIANSWHDDWGENGYFRI 318
Score = 153 bits (387), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 74/159 (46%), Positives = 100/159 (62%), Gaps = 8/159 (5%)
Query: 337 CRPYEI-PCERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEE 395
CRPY I PC + N + E +TP C +KCQPGY Y D +G A+ LP + E
Sbjct: 184 CRPYPIHPCGHHGNDTYYGECPEEASTPSCKKKCQPGYRKLYRMDKRYGTDAFQLPKSVE 243
Query: 396 TIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSV 455
I +E+ ++GPV S +Y D LYK+GIY+H AG G HA+++IGW GT +
Sbjct: 244 AIQKELLKNGPVTASFAVYEDFSLYKSGIYRHTAGELRGYHAVKMIGW-------GTENR 296
Query: 456 VKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGL 494
YWL+ANS++ +WGENG FRI+RG N+CGIE ++ AGL
Sbjct: 297 TDYWLIANSWHDDWGENGYFRIIRGINDCGIEENVAAGL 335
>gi|239938574|gb|ACS36086.1| cysteine proteinase [Haemonchus contortus]
Length = 253
Score = 232 bits (591), Expect = 4e-58, Method: Compositional matrix adjust.
Identities = 115/242 (47%), Positives = 159/242 (65%), Gaps = 9/242 (3%)
Query: 96 ELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDL 155
++PE FDAR WP C +++ I DQ +CGS WA+ A+SDR+CIAS G++ V +S+ D+
Sbjct: 1 DIPESFDARTKWPKCSSLKHIHDQANCGSCWAVSTASALSDRICIASNGRKQVHVSATDI 60
Query: 156 VSCCKD-CGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEI-PCERYMNGSHSSC 213
+SCC + CG GC GG+ +A+ Y+ G V+GG Y + GCRPY PC + ++
Sbjct: 61 LSCCGNQCGYGCNGGWPIQAFNYFSKQGAVTGGDYKATSGCRPYPFHPCGHHGKDTYYGE 120
Query: 214 QDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYA 273
NE TP+C+RKCQ Y SY+ D + G+ AY +P +E+ I REI ++GPV G+ T+Y
Sbjct: 121 CPNEATTPKCVRKCQKSYKKSYKKDRSIGKDAYEVPNSEKAIQREIMKNGPVVGAFTVYE 180
Query: 274 DMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLF 333
D YK GIYKH AG G HAI+IIGWG+E + V YWL+ANS++ +WGENG F
Sbjct: 181 DFSYYKKGIYKHTAGKARGGHAIKIIGWGKE-------NGVPYWLIANSWHNDWGENGYF 233
Query: 334 RI 335
RI
Sbjct: 234 RI 235
Score = 154 bits (388), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 79/159 (49%), Positives = 103/159 (64%), Gaps = 8/159 (5%)
Query: 336 GCRPYEI-PCERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANE 394
GCRPY PC + + NE TP+C+RKCQ Y SY+ D + G+ AY +P +E
Sbjct: 100 GCRPYPFHPCGHHGKDTYYGECPNEATTPKCVRKCQKSYKKSYKKDRSIGKDAYEVPNSE 159
Query: 395 ETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSS 454
+ I REI ++GPV G+ T+Y D YK GIYKH AG G HAI+IIGWG+E +
Sbjct: 160 KAIQREIMKNGPVVGAFTVYEDFSYYKKGIYKHTAGKARGGHAIKIIGWGKE-------N 212
Query: 455 VVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAG 493
V YWL+ANS++ +WGENG FRI+RG N CGIE ++ AG
Sbjct: 213 GVPYWLIANSWHNDWGENGYFRILRGSNHCGIEENVVAG 251
>gi|38373697|gb|AAR19103.1| cathepsin B [Uronema marinum]
Length = 350
Score = 232 bits (591), Expect = 4e-58, Method: Compositional matrix adjust.
Identities = 119/287 (41%), Positives = 174/287 (60%), Gaps = 22/287 (7%)
Query: 62 LTLSELEMRMGV--HPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQ 119
++ +++ MG P +P R + + LPE FD R +P C ++Q++RDQ
Sbjct: 51 MSFDQIQAMMGTIATPVHMIPDERYTPFETIQNL--SLPESFDLREAYPKCESLQQVRDQ 108
Query: 120 GSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCK---DCGNGCQGGFHGKAWK 176
+CGS WA G VEA+SDR+CIAS K R+SS++L+SCC+ CG GC GG+ AW
Sbjct: 109 SNCGSCWAFGTVEAISDRICIASGQKDQTRISSENLLSCCRGTFACGMGCNGGYTAGAWN 168
Query: 177 YWVTTGIVSGGTYA-----SKQGCRPYEI-PCERYMNGSHSSCQD-NEPNTPECIRKCQP 229
Y+V TG+VSG Y SK C+PY PC ++ G + +C D + NTP+C +C
Sbjct: 169 YYVKTGLVSGNLYTDDNQNSKTECQPYSFPPCSHHVQGEYQACTDLPQFNTPKCYTECNS 228
Query: 230 GYDV-SYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAG 288
Y SYE DL+ G +YS+P +EE I EI+++G S +Y+D + Y +G+Y++ +G
Sbjct: 229 QYTQNSYEQDLHKGVSSYSVPKSEEQIKAEIYQYGSTTASFNVYSDFLTYSSGVYQNTSG 288
Query: 289 GPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
+G HAI+++GWG E GT YWL ANS+N++WGENG F+I
Sbjct: 289 SYMGGHAIKMLGWGVE---NGT----PYWLCANSWNSSWGENGFFKI 328
Score = 143 bits (361), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 75/184 (40%), Positives = 113/184 (61%), Gaps = 12/184 (6%)
Query: 314 VKYWLVANSFNTNWGENGLFRIGCRPYEIP-CERYMNGSRSSC-QANEPNTPECIRKCQP 371
VK LV+ + T+ +N + C+PY P C ++ G +C + NTP+C +C
Sbjct: 171 VKTGLVSGNLYTDDNQNS--KTECQPYSFPPCSHHVQGEYQACTDLPQFNTPKCYTECNS 228
Query: 372 GYDV-SYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAG 430
Y SYE DL+ G +YS+P +EE I EI+++G S +Y+D + Y +G+Y++ +G
Sbjct: 229 QYTQNSYEQDLHKGVSSYSVPKSEEQIKAEIYQYGSTTASFNVYSDFLTYSSGVYQNTSG 288
Query: 431 GPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADI 490
+G HAI+++GWG E GT YWL ANS+N++WGENG F+I+RG NECGIE+ +
Sbjct: 289 SYMGGHAIKMLGWGVE---NGT----PYWLCANSWNSSWGENGFFKILRGSNECGIESGM 341
Query: 491 TAGL 494
AG
Sbjct: 342 VAGF 345
>gi|156255405|gb|ABU62925.1| cathepsin B [Fasciola hepatica]
Length = 337
Score = 231 bits (589), Expect = 8e-58, Method: Compositional matrix adjust.
Identities = 115/274 (41%), Positives = 164/274 (59%), Gaps = 9/274 (3%)
Query: 64 LSELEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCG 123
+ +++ +GV ++ +N V+ S +LPE FDAR W CP+I EIRDQ SC
Sbjct: 53 IDQVKQNLGVLEETPEDRNTQRQTVRYSVSENDLPESFDARQKWANCPSISEIRDQSSCS 112
Query: 124 SGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGI 183
S WA+ + A++DR+CI S G++ RLS+ D+VSCC CG GC GG +W YW G+
Sbjct: 113 SCWAVSSASAITDRICIHSNGQKKPRLSAIDIVSCCAYCGYGCNGGIPAMSWDYWTREGV 172
Query: 184 VSGGTYASKQGCRPYEIP-CER-YMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNF 241
V+GGT + GC PY P C + C + TP+C +KC GY+ +YE D
Sbjct: 173 VTGGTLENPTGCLPYPFPKCSHGVVTPGLPPCPRDIYPTPKCEKKCHAGYNKTYEQDKVK 232
Query: 242 GRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGW 301
G+ +Y++ E IM EI ++GPV+G ++ D ++YK+GIY + G +G HAIR+IGW
Sbjct: 233 GKSSYNVGGQETDIMMEIMKNGPVDGIFYMFEDFLVYKSGIYHYTTGRLVGGHAIRVIGW 292
Query: 302 GQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
G E + VKYWL+ANS+N WGE G FR+
Sbjct: 293 GVE-------NGVKYWLIANSWNEGWGEKGYFRM 319
Score = 142 bits (359), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 74/162 (45%), Positives = 99/162 (61%), Gaps = 9/162 (5%)
Query: 336 GCRPYEIP-CER-YMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPAN 393
GC PY P C + C + TP+C +KC GY+ +YE D G+ +Y++
Sbjct: 183 GCLPYPFPKCSHGVVTPGLPPCPRDIYPTPKCEKKCHAGYNKTYEQDKVKGKSSYNVGGQ 242
Query: 394 EETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTS 453
E IM EI ++GPV+G ++ D ++YK+GIY + G +G HAIR+IGWG E
Sbjct: 243 ETDIMMEIMKNGPVDGIFYMFEDFLVYKSGIYHYTTGRLVGGHAIRVIGWGVE------- 295
Query: 454 SVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLP 495
+ VKYWL+ANS+N WGE G FR+ RG NECGIEA I AGLP
Sbjct: 296 NGVKYWLIANSWNEGWGEKGYFRMRRGNNECGIEARINAGLP 337
>gi|308488328|ref|XP_003106358.1| hypothetical protein CRE_16047 [Caenorhabditis remanei]
gi|308253708|gb|EFO97660.1| hypothetical protein CRE_16047 [Caenorhabditis remanei]
Length = 343
Score = 231 bits (588), Expect = 1e-57, Method: Compositional matrix adjust.
Identities = 113/246 (45%), Positives = 163/246 (66%), Gaps = 14/246 (5%)
Query: 97 LPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLV 156
+P+ +D R ++ C ++ IRDQ CGS WA+ A EA+SDR CIAS G + LS++D++
Sbjct: 81 IPDHYDVRDDFSQCISVNNIRDQSHCGSCWAVAAAEAISDRTCIASNGVVNTLLSAEDIL 140
Query: 157 SCCKD---CGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEI-PCERYMNG-SHS 211
+CC CG+GC+GG+ +AWKYWV G+V+GG+Y S+ GC+PY I PC + +NG +
Sbjct: 141 TCCIGEYYCGDGCEGGYPIQAWKYWVKNGLVTGGSYESQFGCKPYSIAPCGQTVNGVTWP 200
Query: 212 SCQDNEPNTPECIRKC--QPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSM 269
C +++ +TP+C+ C Y + YE D ++G AY++ + I EI ++GPVE
Sbjct: 201 KCPNSDADTPKCVDHCTSNSSYPIPYEKDKHYGATAYAVSRKVDQIQSEILKNGPVEVGF 260
Query: 270 TIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGE 329
T+YAD YK+G+Y HVAG LG HA++++GWG + GT YWL ANS+NTNWGE
Sbjct: 261 TVYADFYQYKSGVYVHVAGPELGGHAVKLLGWG---VDNGTP----YWLAANSWNTNWGE 313
Query: 330 NGLFRI 335
NG FRI
Sbjct: 314 NGYFRI 319
Score = 158 bits (399), Expect = 7e-36, Method: Compositional matrix adjust.
Identities = 81/182 (44%), Positives = 114/182 (62%), Gaps = 18/182 (9%)
Query: 327 WGENGLF-------RIGCRPYEI-PCERYMNG-SRSSCQANEPNTPECIRKC--QPGYDV 375
W +NGL + GC+PY I PC + +NG + C ++ +TP+C+ C Y +
Sbjct: 165 WVKNGLVTGGSYESQFGCKPYSIAPCGQTVNGVTWPKCPNSDADTPKCVDHCTSNSSYPI 224
Query: 376 SYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGE 435
YE D ++G AY++ + I EI ++GPVE T+YAD YK+G+Y HVAG LG
Sbjct: 225 PYEKDKHYGATAYAVSRKVDQIQSEILKNGPVEVGFTVYADFYQYKSGVYVHVAGPELGG 284
Query: 436 HAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLP 495
HA++++GWG + GT YWL ANS+NTNWGENG FRI+RG NECGIE+ + AG+P
Sbjct: 285 HAVKLLGWG---VDNGTP----YWLAANSWNTNWGENGYFRILRGVNECGIESQVVAGMP 337
Query: 496 KI 497
+
Sbjct: 338 DL 339
>gi|145498570|ref|XP_001435272.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124402403|emb|CAK67875.1| unnamed protein product [Paramecium tetraurelia]
Length = 325
Score = 230 bits (587), Expect = 1e-57, Method: Compositional matrix adjust.
Identities = 122/282 (43%), Positives = 170/282 (60%), Gaps = 23/282 (8%)
Query: 61 KLTLSELEMRMGVHPDS----KLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEI 116
+ + L+ +MG D KLP++ VQ + E+PE FDAR WP C +I+E+
Sbjct: 41 QFNEATLKTQMGTFLDEPDFMKLPEST----VQFENL--EIPESFDARQQWPNCESIKEV 94
Query: 117 RDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCK-DCGNGCQGGFHGKAW 175
RDQ +CGS WA GA EAMSDR+CIA+ + R+S++DL++CC CG GC GGF AW
Sbjct: 95 RDQSTCGSCWAFGAAEAMSDRLCIAT--GKQTRISTEDLLTCCGITCGMGCNGGFPSGAW 152
Query: 176 KYWVTTGIVSGGTYASKQGCRPYEIP-CERYMN-GSHSSCQDNEPNTPECIRKCQPGYDV 233
Y+ G+V+G + CRPY P C+ +++ G + C D++P TP C++ C
Sbjct: 153 NYFKNKGLVTGDLFGDNSWCRPYTFPPCDHHVDDGKYGPCGDSQP-TPACVKSCTAQSGR 211
Query: 234 SYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGE 293
+Y+ D +YS+ + E I EI GPVE S T+Y D + YK+G+Y++VAG LG
Sbjct: 212 NYDSDKIRSIDSYSVSSKVEQIQNEIMTFGPVEASFTVYEDFLTYKSGVYQNVAGANLGG 271
Query: 294 HAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
HA++IIGWG E V YWLV NS+N WGENGLF+I
Sbjct: 272 HAVKIIGWGVE-------KNVPYWLVVNSWNEGWGENGLFKI 306
Score = 135 bits (341), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 71/159 (44%), Positives = 95/159 (59%), Gaps = 10/159 (6%)
Query: 337 CRPYEIP-CERYMN-GSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANE 394
CRPY P C+ +++ G C ++P TP C++ C +Y+ D +YS+ +
Sbjct: 172 CRPYTFPPCDHHVDDGKYGPCGDSQP-TPACVKSCTAQSGRNYDSDKIRSIDSYSVSSKV 230
Query: 395 ETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSS 454
E I EI GPVE S T+Y D + YK+G+Y++VAG LG HA++IIGWG E
Sbjct: 231 EQIQNEIMTFGPVEASFTVYEDFLTYKSGVYQNVAGANLGGHAVKIIGWGVE-------K 283
Query: 455 VVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAG 493
V YWLV NS+N WGENGLF+I+RG N GIE I AG
Sbjct: 284 NVPYWLVVNSWNEGWGENGLFKILRGSNHVGIEGGIYAG 322
>gi|308504375|ref|XP_003114371.1| CRE-CPR-1 protein [Caenorhabditis remanei]
gi|308261756|gb|EFP05709.1| CRE-CPR-1 protein [Caenorhabditis remanei]
Length = 366
Score = 230 bits (586), Expect = 1e-57, Method: Compositional matrix adjust.
Identities = 130/331 (39%), Positives = 178/331 (53%), Gaps = 27/331 (8%)
Query: 16 LDLSQSSRNHSNGVFCDLSKAFDRVDHSILLPKL---PFYGAEKNALSKLTLSELEMRMG 72
L+LS + C ++ AF +DH L L +A S T +E+
Sbjct: 32 LNLSHFQKVFVLAALCAVTLAFVPIDHKSALETLTGQALVDYVNSAQSLFTTEHVEVSEE 91
Query: 73 VHP----DSKLPQNRLPLL--VQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGW 126
V D K + ++ L+ +P FD+R +W C +I+ IRDQ +CGS W
Sbjct: 92 VMKSRVMDVKYAAAHSDEIRATEVDTVLDTIPASFDSRTHWSECKSIKLIRDQATCGSCW 151
Query: 127 ALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCC-KDCGNGCQGGFHGKAWKYWVTTGIVS 185
A GA E +SDR CI ++G + +S DDL+SCC CGNGC+GG+ +A ++W + G+V+
Sbjct: 152 AFGAAEVISDRTCIETKGAQQPIISPDDLLSCCGSSCGNGCEGGYPIQALRWWDSKGVVT 211
Query: 186 GGTYASKQGCRPYEI-PCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRI 244
GG Y GC+PY I PC +S E TP C CQ GY +Y D +FG
Sbjct: 212 GGDYHGA-GCKPYPIAPC--------TSGNCPESKTPSCSLSCQSGYTTAYAKDKHFGTS 262
Query: 245 AYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQE 304
AY++ +I EI +GPVE + T+Y D YK+G+YKH AG LG HAI+IIGWG E
Sbjct: 263 AYAVARKVASIQTEIMTNGPVEAAFTVYEDFYKYKSGVYKHTAGKALGGHAIKIIGWGTE 322
Query: 305 PLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
S YWLVANS+ +WGE+G FRI
Sbjct: 323 -------SGSPYWLVANSWGNSWGESGFFRI 346
Score = 149 bits (376), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 75/163 (46%), Positives = 96/163 (58%), Gaps = 16/163 (9%)
Query: 336 GCRPYEI-PCERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANE 394
GC+PY I PC +S E TP C CQ GY +Y D +FG AY++
Sbjct: 219 GCKPYPIAPC--------TSGNCPESKTPSCSLSCQSGYTTAYAKDKHFGTSAYAVARKV 270
Query: 395 ETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSS 454
+I EI +GPVE + T+Y D YK+G+YKH AG LG HAI+IIGWG E S
Sbjct: 271 ASIQTEIMTNGPVEAAFTVYEDFYKYKSGVYKHTAGKALGGHAIKIIGWGTE-------S 323
Query: 455 VVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
YWLVANS+ +WGE+G FRI RG ++CGIE+ + AG K+
Sbjct: 324 GSPYWLVANSWGNSWGESGFFRIFRGDDQCGIESAVVAGKAKV 366
>gi|19526442|gb|AAL89717.1|AF483623_1 cathepsin B [Apriona germari]
Length = 324
Score = 229 bits (585), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 116/272 (42%), Positives = 159/272 (58%), Gaps = 26/272 (9%)
Query: 64 LSELEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCG 123
L L +G++ D P LP++ + + +P+ FDAR WP+C +I+ IRD+G+CG
Sbjct: 56 LKALADVIGINRD---PNVTLPVV--FHEAISGIPDSFDAREQWPFCESIRTIRDEGACG 110
Query: 124 SGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGI 183
S WA AVE MSDR+C+AS G++ S++++VSCC CG GC+GGF + +KYWVT GI
Sbjct: 111 SCWAFAAVEVMSDRLCLASEGRKKFIFSAEEVVSCCTACGGGCRGGFLNEPYKYWVTNGI 170
Query: 184 VSGGTYASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGR 243
SGG Y SK GC+PY TP+C + C GY+ S+E DL
Sbjct: 171 PSGGDYGSKLGCKPYTAAV--------------SGETPQCQKACVSGYEKSWEKDLRHAT 216
Query: 244 IAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQ 303
AY + I REI +GPV M +Y D Y TGIY+H +G +G HA++IIGWG
Sbjct: 217 SAYQVNGGVLQIQREILDNGPVTAYMEVYEDFYSYGTGIYQHTSGSFVGGHAVKIIGWGS 276
Query: 304 EPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
E + V YW+ ANS+ T +GE+G FRI
Sbjct: 277 E-------NDVPYWIAANSWGTGFGEDGFFRI 301
Score = 124 bits (312), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 73/181 (40%), Positives = 97/181 (53%), Gaps = 24/181 (13%)
Query: 315 KYWLVANSFNTNWGENGLFRIGCRPYEIPCERYMNGSRSSCQANEPNTPECIRKCQPGYD 374
KYW V N + G+ G ++GC+PY A TP+C + C GY+
Sbjct: 163 KYW-VTNGIPSG-GDYGS-KLGCKPYT--------------AAVSGETPQCQKACVSGYE 205
Query: 375 VSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLG 434
S+E DL AY + I REI +GPV M +Y D Y TGIY+H +G +G
Sbjct: 206 KSWEKDLRHATSAYQVNGGVLQIQREILDNGPVTAYMEVYEDFYSYGTGIYQHTSGSFVG 265
Query: 435 EHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGL 494
HA++IIGWG E + V YW+ ANS+ T +GE+G FRI+RG N GIE+ I AG
Sbjct: 266 GHAVKIIGWGSE-------NDVPYWIAANSWGTGFGEDGFFRILRGSNCAGIESYIVAGY 318
Query: 495 P 495
P
Sbjct: 319 P 319
>gi|255040225|gb|ACT99885.1| cathepsin B2 [Opisthorchis viverrini]
Length = 337
Score = 229 bits (585), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 121/271 (44%), Positives = 162/271 (59%), Gaps = 11/271 (4%)
Query: 66 ELEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSG 125
+L R P+ + +++ P + S E +P+ FDAR WP+CPTI +IRDQ SCGS
Sbjct: 51 QLMFRAIREPEEQ--RSKRPTVSHESLGDENIPKTFDAREQWPHCPTIGQIRDQSSCGSC 108
Query: 126 WALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVS 185
WA GAVEAMSDR+CI S G LSS DLVSCC CG GCQGG+ AW +W GIV+
Sbjct: 109 WAFGAVEAMSDRLCIHSNGTFTKSLSSIDLVSCCGYCGFGCQGGYPPAAWDFWQAYGIVT 168
Query: 186 GGTYASKQGCRPYEIP-CERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRI 244
GG+ GCR Y P C + + + C +TP+C+ KC ++ YE D I
Sbjct: 169 GGSKEDPMGCRSYPFPKCSHHGSKKYPPCPHRIYDTPKCVPKCDTP-NIDYETDKTRANI 227
Query: 245 AYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQE 304
Y++ ++ IM+EI +GPVE + +Y D YK G+Y H G +G HAIRI+GWG+E
Sbjct: 228 TYNVQRSQMAIMKEIMINGPVEAAFEVYEDFFGYKQGVYFHSTGEFIGGHAIRILGWGEE 287
Query: 305 PLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
GT YWL+ANS+N WGE+G F++
Sbjct: 288 ---NGTP----YWLIANSWNEGWGEDGYFKM 311
Score = 139 bits (349), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 70/166 (42%), Positives = 102/166 (61%), Gaps = 9/166 (5%)
Query: 335 IGCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPAN 393
+GCR Y P C + + C +TP+C+ KC ++ YE D I Y++ +
Sbjct: 176 MGCRSYPFPKCSHHGSKKYPPCPHRIYDTPKCVPKCDTP-NIDYETDKTRANITYNVQRS 234
Query: 394 EETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTS 453
+ IM+EI +GPVE + +Y D YK G+Y H G +G HAIRI+GWG+E GT
Sbjct: 235 QMAIMKEIMINGPVEAAFEVYEDFFGYKQGVYFHSTGEFIGGHAIRILGWGEE---NGTP 291
Query: 454 SVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKIGL 499
YWL+ANS+N WGE+G F+++RG+NECGIE ++TAGLP++ +
Sbjct: 292 ----YWLIANSWNEGWGEDGYFKMLRGKNECGIEDEVTAGLPELSI 333
>gi|29374027|gb|AAO73004.1| cathepsin B [Fasciola gigantica]
Length = 337
Score = 229 bits (583), Expect = 3e-57, Method: Compositional matrix adjust.
Identities = 114/274 (41%), Positives = 163/274 (59%), Gaps = 9/274 (3%)
Query: 64 LSELEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCG 123
+ +++ +GV ++ +N V+ S +LPE FDAR W CP+I EIRDQ SC
Sbjct: 53 IDQVKQNLGVLEETPEDRNTQRQTVRYSVSENDLPESFDARQKWANCPSISEIRDQSSCS 112
Query: 124 SGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGI 183
S WA+ + A++DR+CI S G++ RLS+ D+VSCC CG GC GG +W YW G+
Sbjct: 113 SCWAVSSASAITDRICIHSNGQKKPRLSAIDIVSCCAYCGYGCNGGIPAMSWDYWTREGV 172
Query: 184 VSGGTYASKQGCRPYEIP-CER-YMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNF 241
V+GGT + GC PY P C + C + TP+C +KC GY+ +YE D
Sbjct: 173 VTGGTLENPTGCLPYPFPKCSHGVVTPGLPPCPRDIYPTPKCEKKCHAGYNKTYEQDKVK 232
Query: 242 GRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGW 301
G+ +Y++ E M EI ++GPV+G ++ D ++YK+GIY + G +G HAIR+IGW
Sbjct: 233 GKSSYNVGEQETDFMMEIMKNGPVDGIFYMFEDFLVYKSGIYHYTTGRLVGGHAIRVIGW 292
Query: 302 GQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
G E + VKYWL+ANS+N WGE G FR+
Sbjct: 293 GVE-------NGVKYWLIANSWNEGWGEKGYFRM 319
Score = 140 bits (354), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 73/162 (45%), Positives = 98/162 (60%), Gaps = 9/162 (5%)
Query: 336 GCRPYEIP-CER-YMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPAN 393
GC PY P C + C + TP+C +KC GY+ +YE D G+ +Y++
Sbjct: 183 GCLPYPFPKCSHGVVTPGLPPCPRDIYPTPKCEKKCHAGYNKTYEQDKVKGKSSYNVGEQ 242
Query: 394 EETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTS 453
E M EI ++GPV+G ++ D ++YK+GIY + G +G HAIR+IGWG E
Sbjct: 243 ETDFMMEIMKNGPVDGIFYMFEDFLVYKSGIYHYTTGRLVGGHAIRVIGWGVE------- 295
Query: 454 SVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLP 495
+ VKYWL+ANS+N WGE G FR+ RG NECGIEA I AGLP
Sbjct: 296 NGVKYWLIANSWNEGWGEKGYFRMRRGNNECGIEARINAGLP 337
>gi|146165818|ref|XP_001015807.2| Papain family cysteine protease containing protein [Tetrahymena
thermophila]
gi|146145394|gb|EAR95562.2| Papain family cysteine protease containing protein [Tetrahymena
thermophila SB210]
Length = 338
Score = 228 bits (582), Expect = 4e-57, Method: Compositional matrix adjust.
Identities = 119/288 (41%), Positives = 170/288 (59%), Gaps = 17/288 (5%)
Query: 53 GAEKNALSKLTLSELEMRMGVHPDSKLPQNRLPL--LVQLSDPLEELPEGFDARINWPYC 110
A ++ L+ +G D L +LP+ + Q +DP+ PE FD+R WP C
Sbjct: 45 AARYQKFEEMDPETLQGHLGALIDEPL-WAKLPIKNVEQTNDPI---PESFDSREQWPNC 100
Query: 111 PTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGF 170
+I+ IRDQ +CGS WA A E SDR+CIAS + +SS+DL+ CC CGNGCQGG+
Sbjct: 101 NSIKTIRDQSTCGSCWAFAATETYSDRICIASNQELQTSISSEDLLECCATCGNGCQGGY 160
Query: 171 HGKAWKYWVTTGIVSGGTYASKQGCRPYEI-PCERYMNGSHSSCQDNEPNTPECIRKCQP 229
AWKY TG+ +GG Y C+PY PC+ ++ G + C +P TP+C+++C
Sbjct: 161 PSAAWKYMKATGVSTGGLYGDDSSCKPYVFPPCDHHVVGQYPPCGPIKP-TPKCVKQCNS 219
Query: 230 GY-DVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIY-KHVA 287
Y + +Y+ DL+ Y LP N E I REI HGPV+ S + +D + YK+G+Y +
Sbjct: 220 QYTEKTYQQDLHHPSKVYQLPNNAEAIQREIMAHGPVQASFRVASDFLTYKSGVYIRDPK 279
Query: 288 GGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
G H+++IIGWG E +GT YWL+ANS+N +WGENGLF++
Sbjct: 280 LKYEGGHSVKIIGWGVE---QGTP----YWLIANSWNEDWGENGLFKM 320
Score = 142 bits (358), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 72/162 (44%), Positives = 105/162 (64%), Gaps = 11/162 (6%)
Query: 337 CRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGY-DVSYEDDLNFGRIAYSLPANE 394
C+PY P C+ ++ G C +P TP+C+++C Y + +Y+ DL+ Y LP N
Sbjct: 185 CKPYVFPPCDHHVVGQYPPCGPIKP-TPKCVKQCNSQYTEKTYQQDLHHPSKVYQLPNNA 243
Query: 395 ETIMREIFRHGPVEGSMTIYADMILYKTGIY-KHVAGGPLGEHAIRIIGWGQEPLGEGTS 453
E I REI HGPV+ S + +D + YK+G+Y + G H+++IIGWG E +GT
Sbjct: 244 EAIQREIMAHGPVQASFRVASDFLTYKSGVYIRDPKLKYEGGHSVKIIGWGVE---QGTP 300
Query: 454 SVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLP 495
YWL+ANS+N +WGENGLF+++RG+NECGIEA++ AGLP
Sbjct: 301 ----YWLIANSWNEDWGENGLFKMLRGKNECGIEAEVVAGLP 338
>gi|32566081|ref|NP_506002.2| Protein CPR-1 [Caenorhabditis elegans]
gi|32172429|sp|P25807.2|CPR1_CAEEL RecName: Full=Gut-specific cysteine proteinase; Flags: Precursor
gi|1395200|gb|AAB88058.1| gut-specific cysteine protease-1 [Caenorhabditis elegans]
gi|24817276|emb|CAB01410.2| Protein CPR-1 [Caenorhabditis elegans]
Length = 329
Score = 228 bits (582), Expect = 4e-57, Method: Compositional matrix adjust.
Identities = 114/244 (46%), Positives = 151/244 (61%), Gaps = 18/244 (7%)
Query: 94 LEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSD 153
L +P FD+R W C +I+ IRDQ +CGS WA GA E +SDR CI ++G + +S D
Sbjct: 82 LASVPATFDSRTQWSECKSIKLIRDQATCGSCWAFGAAEMISDRTCIETKGAQQPIISPD 141
Query: 154 DLVSCC-KDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEI-PCERYMNGSHS 211
DL+SCC CGNGC+GG+ +A ++W + G+V+GG Y GC+PY I PC +
Sbjct: 142 DLLSCCGSSCGNGCEGGYPIQALRWWDSKGVVTGGDYHGA-GCKPYPIAPC--------T 192
Query: 212 SCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTI 271
S E TP C CQ GY +Y D +FG AY++P N +I EI+ +GPVE + ++
Sbjct: 193 SGNCPESKTPSCSMSCQSGYSTAYAKDKHFGVSAYAVPKNAASIQAEIYANGPVEAAFSV 252
Query: 272 YADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENG 331
Y D YK+G+YKH AG LG HAI+IIGWG E S YWLVANS+ NWGE+G
Sbjct: 253 YEDFYKYKSGVYKHTAGKYLGGHAIKIIGWGTE-------SGSPYWLVANSWGVNWGESG 305
Query: 332 LFRI 335
F+I
Sbjct: 306 FFKI 309
Score = 154 bits (388), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 76/163 (46%), Positives = 99/163 (60%), Gaps = 16/163 (9%)
Query: 336 GCRPYEI-PCERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANE 394
GC+PY I PC +S E TP C CQ GY +Y D +FG AY++P N
Sbjct: 182 GCKPYPIAPC--------TSGNCPESKTPSCSMSCQSGYSTAYAKDKHFGVSAYAVPKNA 233
Query: 395 ETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSS 454
+I EI+ +GPVE + ++Y D YK+G+YKH AG LG HAI+IIGWG E S
Sbjct: 234 ASIQAEIYANGPVEAAFSVYEDFYKYKSGVYKHTAGKYLGGHAIKIIGWGTE-------S 286
Query: 455 VVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
YWLVANS+ NWGE+G F+I RG ++CGIE+ + AG K+
Sbjct: 287 GSPYWLVANSWGVNWGESGFFKIYRGDDQCGIESAVVAGKAKV 329
>gi|341878049|gb|EGT33984.1| CBN-CPR-1 protein [Caenorhabditis brenneri]
Length = 330
Score = 228 bits (581), Expect = 6e-57, Method: Compositional matrix adjust.
Identities = 114/244 (46%), Positives = 153/244 (62%), Gaps = 18/244 (7%)
Query: 94 LEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSD 153
L+ +P FD+R +W C +I+ IR+Q +CGS WA GA E +SDR CI ++G + +S D
Sbjct: 83 LDTIPASFDSRTHWSECKSIKLIRNQATCGSCWAFGAAEVISDRTCIETKGAQQPIISPD 142
Query: 154 DLVSCC-KDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEI-PCERYMNGSHS 211
DL+SCC CGNGC+GG+ +A ++W + G+V+GG Y GC+PY I PC +
Sbjct: 143 DLLSCCGSSCGNGCEGGYPIQALRWWDSKGVVTGGDYHG-AGCKPYPIAPC------TSG 195
Query: 212 SCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTI 271
SC E TP C CQPGY +Y D +FG AY++ +I EI +GPVE + T+
Sbjct: 196 SCP--ESKTPACSLSCQPGYTTAYAKDKHFGTSAYAVAKKVASIQTEIMTNGPVEAAFTV 253
Query: 272 YADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENG 331
Y D YK+G+YKH AG LG HAI+IIGWG E S YWLVANS+ T+WGE+G
Sbjct: 254 YEDFYKYKSGVYKHTAGKALGGHAIKIIGWGTE-------SGSPYWLVANSWGTSWGESG 306
Query: 332 LFRI 335
F+I
Sbjct: 307 FFKI 310
Score = 152 bits (383), Expect = 5e-34, Method: Compositional matrix adjust.
Identities = 76/163 (46%), Positives = 99/163 (60%), Gaps = 16/163 (9%)
Query: 336 GCRPYEI-PCERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANE 394
GC+PY I PC + SC E TP C CQPGY +Y D +FG AY++
Sbjct: 183 GCKPYPIAPC------TSGSCP--ESKTPACSLSCQPGYTTAYAKDKHFGTSAYAVAKKV 234
Query: 395 ETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSS 454
+I EI +GPVE + T+Y D YK+G+YKH AG LG HAI+IIGWG E S
Sbjct: 235 ASIQTEIMTNGPVEAAFTVYEDFYKYKSGVYKHTAGKALGGHAIKIIGWGTE-------S 287
Query: 455 VVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
YWLVANS+ T+WGE+G F+I RG ++CGIE+ + AG ++
Sbjct: 288 GSPYWLVANSWGTSWGESGFFKIFRGDDQCGIESAVVAGKARV 330
>gi|91089435|ref|XP_966663.1| PREDICTED: similar to AGAP004533-PA [Tribolium castaneum]
gi|270012706|gb|EFA09154.1| cathepsin B precursor [Tribolium castaneum]
Length = 320
Score = 228 bits (581), Expect = 6e-57, Method: Compositional matrix adjust.
Identities = 110/240 (45%), Positives = 148/240 (61%), Gaps = 20/240 (8%)
Query: 95 EELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDD 154
E++P+ FDARI WP C +I++IR+QGSCGS WA GAVE MSDR+CIAS + S+ D
Sbjct: 77 EQIPQNFDARIVWPQCESIRKIRNQGSCGSCWAFGAVETMSDRLCIASNATKKFEFSAQD 136
Query: 155 LVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQ 214
L++CCK+CG+GC GG+ +AW+YWVT GIVSGG + + QGC PY + R
Sbjct: 137 LLACCKECGHGCGGGYSSRAWQYWVTDGIVSGGDFNTSQGCHPYSVQAFR---------- 186
Query: 215 DNEPNTPECIRKC-QPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYA 273
+ TP C C P Y +Y +D +G +Y + N E I EI GPV+ S +Y
Sbjct: 187 --DSTTPNCSSFCTNPKYQKNYSEDKRYGARSYRIAKNIEQIQAEIMTSGPVQASYVVYD 244
Query: 274 DMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLF 333
D Y+ G+Y+HV G G H+++I+GWG+E GT YWLVANS+ +WG G F
Sbjct: 245 DFYSYQNGVYQHVLGNVSGRHSVKILGWGRE---NGTD----YWLVANSWGRDWGRLGGF 297
Score = 117 bits (294), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 69/195 (35%), Positives = 97/195 (49%), Gaps = 34/195 (17%)
Query: 310 TSSVVKYWLV-----ANSFNTNWGENGLFRIGCRPYEIPCERYMNGSRSSCQANEPNTPE 364
+S +YW+ FNT+ GC PY + R + TP
Sbjct: 153 SSRAWQYWVTDGIVSGGDFNTS--------QGCHPYSVQAFR------------DSTTPN 192
Query: 365 CIRKC-QPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTG 423
C C P Y +Y +D +G +Y + N E I EI GPV+ S +Y D Y+ G
Sbjct: 193 CSSFCTNPKYQKNYSEDKRYGARSYRIAKNIEQIQAEIMTSGPVQASYVVYDDFYSYQNG 252
Query: 424 IYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGE-NGLFRIVRGQN 482
+Y+HV G G H+++I+GWG+E GT YWLVANS+ +WG G F+ +RG+N
Sbjct: 253 VYQHVLGNVSGRHSVKILGWGRE---NGTD----YWLVANSWGRDWGRLGGFFKFLRGEN 305
Query: 483 ECGIEADITAGLPKI 497
C IE++I G PKI
Sbjct: 306 HCDIESNILGGDPKI 320
>gi|282400164|ref|NP_001164205.1| cathepsin B precursor [Tribolium castaneum]
gi|270004839|gb|EFA01287.1| cathepsin B precursor [Tribolium castaneum]
Length = 335
Score = 227 bits (578), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 118/286 (41%), Positives = 169/286 (59%), Gaps = 13/286 (4%)
Query: 52 YGAEKNALSKLTLSELEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYC- 110
+ A +N +S ++ +GV P+++ +LP + S +E+P+ FDAR WP C
Sbjct: 41 WKAGRNFDIDTPISHIKQLLGVLPETE-NTPKLPKKIH-SINAQEIPDSFDAREAWPDCA 98
Query: 111 PTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGF 170
P I IRDQ +CGS WA GAVEAMSDR+CI S V +S++D + CC CG GC GG
Sbjct: 99 PIIGNIRDQSTCGSCWAFGAVEAMSDRICIHSNATVKVNISAEDPLDCCTICGMGCNGGM 158
Query: 171 HGKAWKYWVTTGIVSGGTYASKQGCRPYEI-PCERYMNGSHSSCQDNEPNTPECIRKCQP 229
AW +W GIV+GG Y GC+ Y PCE +++G C +P TP+C ++C
Sbjct: 159 PAMAWLHWTVNGIVTGGNYEDTNGCKAYSFAPCEHHVDGDLPPCGPTKP-TPDCKKECDS 217
Query: 230 GYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGG 289
G ++Y++DL G Y + + I EI +GPVE S ++Y D + YK+G+Y+H+ G
Sbjct: 218 GSSLTYQNDLTHGS-NYGIDPYPKQIQTEIMTNGPVEASFSVYEDFLSYKSGVYQHLEGE 276
Query: 290 PLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
G HAI+I+GWG E + YWLVANS+N +WG+ G F+I
Sbjct: 277 YAGGHAIKILGWGVE-------NDTPYWLVANSWNEDWGDKGYFKI 315
Score = 142 bits (359), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 69/163 (42%), Positives = 102/163 (62%), Gaps = 10/163 (6%)
Query: 336 GCRPYEI-PCERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANE 394
GC+ Y PCE +++G C +P TP+C ++C G ++Y++DL G Y +
Sbjct: 182 GCKAYSFAPCEHHVDGDLPPCGPTKP-TPDCKKECDSGSSLTYQNDLTHGS-NYGIDPYP 239
Query: 395 ETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSS 454
+ I EI +GPVE S ++Y D + YK+G+Y+H+ G G HAI+I+GWG E +
Sbjct: 240 KQIQTEIMTNGPVEASFSVYEDFLSYKSGVYQHLEGEYAGGHAIKILGWGVE-------N 292
Query: 455 VVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
YWLVANS+N +WG+ G F+I+RG NECGIE I AG+P++
Sbjct: 293 DTPYWLVANSWNEDWGDKGYFKILRGSNECGIEGSIVAGIPEL 335
>gi|312382740|gb|EFR28091.1| hypothetical protein AND_04395 [Anopheles darlingi]
Length = 381
Score = 227 bits (578), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 115/276 (41%), Positives = 160/276 (57%), Gaps = 21/276 (7%)
Query: 64 LSELEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCG 123
L + R G + + +LP + L E PE FDAR W +CP++ IR+QG C
Sbjct: 102 LRHDQYRTGALLYEEAARAKLPQGIVLKLQEEPFPESFDARQKWSFCPSVGTIRNQGCCA 161
Query: 124 SGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGI 183
S +A+ AV ++DR CI S GK + D++SCC CG GC GG W YWV GI
Sbjct: 162 SSYAVAAVATITDRWCIHSEGKSQFSFGAYDVLSCCHRCGFGCDGGVPSAVWHYWVENGI 221
Query: 184 VSGGTYASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPE----CIRKCQPGYDVSYEDDL 239
SGG Y S +GC+ Y C+ E P C+R+CQPGY+ +Y +D
Sbjct: 222 TSGGAYESHEGCQSYPF----------GVCKPQEIFAPHVDLICLRQCQPGYNTTYLEDK 271
Query: 240 NFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRII 299
+FGR+AYS+P +E+ I+ E+F GPV+ S T+Y D I YK+G+Y+H G +G+H+++I+
Sbjct: 272 HFGRVAYSVPRDEDRILYELFYFGPVQASFTVYTDFIQYKSGVYRHTYGVRVGDHSVKIV 331
Query: 300 GWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
GWG E GT K+WL ANS+ WGENG F+I
Sbjct: 332 GWGVE---NGT----KFWLCANSWGAEWGENGFFKI 360
Score = 155 bits (393), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 74/173 (42%), Positives = 110/173 (63%), Gaps = 14/173 (8%)
Query: 327 WGENGLFRIGCRPYEIPCERYMNGSRSSCQANEPNTPE----CIRKCQPGYDVSYEDDLN 382
W ENG+ G C+ Y G C+ E P C+R+CQPGY+ +Y +D +
Sbjct: 216 WVENGITSGGAYESHEGCQSYPFGV---CKPQEIFAPHVDLICLRQCQPGYNTTYLEDKH 272
Query: 383 FGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIG 442
FGR+AYS+P +E+ I+ E+F GPV+ S T+Y D I YK+G+Y+H G +G+H+++I+G
Sbjct: 273 FGRVAYSVPRDEDRILYELFYFGPVQASFTVYTDFIQYKSGVYRHTYGVRVGDHSVKIVG 332
Query: 443 WGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLP 495
WG E GT K+WL ANS+ WGENG F+I+RG++ +E+++ AGLP
Sbjct: 333 WGVE---NGT----KFWLCANSWGAEWGENGFFKIIRGEDHLSVESNVVAGLP 378
>gi|4204370|gb|AAD11445.1| cathepsin B protease, partial [Fasciola hepatica]
Length = 247
Score = 227 bits (578), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 113/234 (48%), Positives = 151/234 (64%), Gaps = 9/234 (3%)
Query: 104 RINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCG 163
R WP C TI EIRDQ SCGS WA A AMSDRVCI S G+ RL++ D +SCC CG
Sbjct: 1 RSQWPQCWTISEIRDQASCGSCWATAAASAMSDRVCIHSNGQMRPRLAAADPLSCCTYCG 60
Query: 164 NGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIPCERYMNGS--HSSCQDNEPNTP 221
GC+GG+ KAW YW+ GIV+GGT+ ++ GC+P+ ++ S +S C TP
Sbjct: 61 QGCRGGYPPKAWDYWMREGIVTGGTWENRTGCQPWMFTKCDHVGDSRKYSRCPHYTYPTP 120
Query: 222 ECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTG 281
C R CQ GY+ +YE D +G +Y++ +E IM+EI ++GPVE + I+ D +Y++G
Sbjct: 121 PCARACQTGYNKTYEQDKFYGNSSYNVGEHESYIMQEIMKNGPVEVTFAIFQDFGVYRSG 180
Query: 282 IYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
IY HVAG +G HA+R+IGWG E + V YWL+ANS+N WGENG FR+
Sbjct: 181 IYHHVAGKFIGRHAVRMIGWGVE-------NGVNYWLMANSWNEEWGENGYFRM 227
Score = 155 bits (392), Expect = 5e-35, Method: Compositional matrix adjust.
Identities = 76/166 (45%), Positives = 109/166 (65%), Gaps = 9/166 (5%)
Query: 334 RIGCRPYEIPCERYMNGSR--SSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLP 391
R GC+P+ ++ SR S C TP C R CQ GY+ +YE D +G +Y++
Sbjct: 89 RTGCQPWMFTKCDHVGDSRKYSRCPHYTYPTPPCARACQTGYNKTYEQDKFYGNSSYNVG 148
Query: 392 ANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEG 451
+E IM+EI ++GPVE + I+ D +Y++GIY HVAG +G HA+R+IGWG E
Sbjct: 149 EHESYIMQEIMKNGPVEVTFAIFQDFGVYRSGIYHHVAGKFIGRHAVRMIGWGVE----- 203
Query: 452 TSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
+ V YWL+ANS+N WGENG FR+VRG+NECGIE+++ AG+P++
Sbjct: 204 --NGVNYWLMANSWNEEWGENGYFRMVRGRNECGIESEVVAGMPRL 247
>gi|194387364|dbj|BAG60046.1| unnamed protein product [Homo sapiens]
Length = 245
Score = 226 bits (575), Expect = 3e-56, Method: Compositional matrix adjust.
Identities = 108/216 (50%), Positives = 152/216 (70%), Gaps = 10/216 (4%)
Query: 122 CGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKD-CGNGCQGGFHGKAWKYWVT 180
C WA GAVEA+SDR+CI + V +S++DL++CC CG+GC GG+ +AW +W
Sbjct: 11 CRMSWAFGAVEAISDRICIHTNAHVSVEVSAEDLLTCCGSMCGDGCNGGYPAEAWNFWTR 70
Query: 181 TGIVSGGTYASKQGCRPYEIP-CERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDL 239
G+VSGG Y S GCRPY IP CE ++NGS C E +TP+C + C+PGY +Y+ D
Sbjct: 71 KGLVSGGLYESHVGCRPYSIPPCEHHVNGSRPPCT-GEGDTPKCSKICEPGYSPTYKQDK 129
Query: 240 NFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRII 299
++G +YS+ +E+ IM EI+++GPVEG+ ++Y+D +LYK+G+Y+HV G +G HAIRI+
Sbjct: 130 HYGYNSYSVSNSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRIL 189
Query: 300 GWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
GWG E GT YWLVANS+NT+WG+NG F+I
Sbjct: 190 GWGVE---NGT----PYWLVANSWNTDWGDNGFFKI 218
Score = 182 bits (462), Expect = 4e-43, Method: Compositional matrix adjust.
Identities = 86/169 (50%), Positives = 126/169 (74%), Gaps = 11/169 (6%)
Query: 331 GLF--RIGCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIA 387
GL+ +GCRPY IP CE ++NGSR C E +TP+C + C+PGY +Y+ D ++G +
Sbjct: 77 GLYESHVGCRPYSIPPCEHHVNGSRPPCTG-EGDTPKCSKICEPGYSPTYKQDKHYGYNS 135
Query: 388 YSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEP 447
YS+ +E+ IM EI+++GPVEG+ ++Y+D +LYK+G+Y+HV G +G HAIRI+GWG E
Sbjct: 136 YSVSNSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGVE- 194
Query: 448 LGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPK 496
GT YWLVANS+NT+WG+NG F+I+RGQ+ CGIE+++ AG+P+
Sbjct: 195 --NGT----PYWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPR 237
>gi|341904369|gb|EGT60202.1| hypothetical protein CAEBREN_08101 [Caenorhabditis brenneri]
Length = 330
Score = 225 bits (574), Expect = 4e-56, Method: Compositional matrix adjust.
Identities = 113/244 (46%), Positives = 152/244 (62%), Gaps = 18/244 (7%)
Query: 94 LEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSD 153
L+ +P FD+R +W C +I+ IR+Q +CGS WA GA E +SDR CI ++G + +S D
Sbjct: 83 LDTIPASFDSRTHWSECKSIKLIRNQATCGSCWAFGAAEVISDRTCIETKGAQQPIISPD 142
Query: 154 DLVSCC-KDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEI-PCERYMNGSHS 211
DL+SCC CGNGC+GG+ +A ++W + G+V+GG Y GC+PY I PC +
Sbjct: 143 DLLSCCGSSCGNGCEGGYPIQALRWWDSKGVVTGGDYHG-AGCKPYPIAPC------TSG 195
Query: 212 SCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTI 271
SC E TP C CQ GY +Y D +FG AY++ +I EI +GPVE + T+
Sbjct: 196 SCP--ESKTPACSLSCQSGYTTAYAKDKHFGTSAYAVAKKVASIQTEIMTNGPVEAAFTV 253
Query: 272 YADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENG 331
Y D YK+G+YKH AG LG HAI+IIGWG E S YWLVANS+ T+WGE+G
Sbjct: 254 YEDFYKYKSGVYKHTAGKALGGHAIKIIGWGTE-------SGSPYWLVANSWGTSWGESG 306
Query: 332 LFRI 335
F+I
Sbjct: 307 FFKI 310
Score = 149 bits (376), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 75/163 (46%), Positives = 98/163 (60%), Gaps = 16/163 (9%)
Query: 336 GCRPYEI-PCERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANE 394
GC+PY I PC + SC E TP C CQ GY +Y D +FG AY++
Sbjct: 183 GCKPYPIAPC------TSGSCP--ESKTPACSLSCQSGYTTAYAKDKHFGTSAYAVAKKV 234
Query: 395 ETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSS 454
+I EI +GPVE + T+Y D YK+G+YKH AG LG HAI+IIGWG E S
Sbjct: 235 ASIQTEIMTNGPVEAAFTVYEDFYKYKSGVYKHTAGKALGGHAIKIIGWGTE-------S 287
Query: 455 VVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
YWLVANS+ T+WGE+G F+I RG ++CGIE+ + AG ++
Sbjct: 288 GSPYWLVANSWGTSWGESGFFKIFRGDDQCGIESAVVAGKARV 330
>gi|268557292|ref|XP_002636635.1| C. briggsae CBR-CPR-1 protein [Caenorhabditis briggsae]
Length = 330
Score = 225 bits (573), Expect = 5e-56, Method: Compositional matrix adjust.
Identities = 113/250 (45%), Positives = 152/250 (60%), Gaps = 18/250 (7%)
Query: 88 VQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRH 147
+++ L +P FD+R W C +I+ IR+Q +CGS WA GA E +SDR CI ++G +
Sbjct: 77 TEVNTVLASIPASFDSRTQWSECKSIKLIRNQATCGSCWAFGAAEIISDRTCIETKGAQQ 136
Query: 148 VRLSSDDLVSCC-KDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEI-PCERY 205
+S DDL+SCC CGNGC+GG+ +A ++W + G+V+GG Y GC+PY I PC
Sbjct: 137 PIISPDDLLSCCGSSCGNGCEGGYPIQALRWWDSKGVVTGGDY-HGAGCKPYPIAPC--- 192
Query: 206 MNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPV 265
+S E TP C CQ GY +Y D +FG AY++ + I EI +GPV
Sbjct: 193 -----TSGNCPESKTPACSLSCQSGYSTAYAKDKHFGASAYAVARSVAAIQTEIMTNGPV 247
Query: 266 EGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNT 325
E + T+Y D YK+G+YKH AG LG HAI+IIGWG E S YWLVANS+ T
Sbjct: 248 EAAFTVYEDFYKYKSGVYKHTAGKALGGHAIKIIGWGTE-------SGSPYWLVANSWGT 300
Query: 326 NWGENGLFRI 335
NWGE+G F+I
Sbjct: 301 NWGESGFFKI 310
Score = 150 bits (379), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 75/163 (46%), Positives = 97/163 (59%), Gaps = 16/163 (9%)
Query: 336 GCRPYEI-PCERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANE 394
GC+PY I PC +S E TP C CQ GY +Y D +FG AY++ +
Sbjct: 183 GCKPYPIAPC--------TSGNCPESKTPACSLSCQSGYSTAYAKDKHFGASAYAVARSV 234
Query: 395 ETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSS 454
I EI +GPVE + T+Y D YK+G+YKH AG LG HAI+IIGWG E S
Sbjct: 235 AAIQTEIMTNGPVEAAFTVYEDFYKYKSGVYKHTAGKALGGHAIKIIGWGTE-------S 287
Query: 455 VVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
YWLVANS+ TNWGE+G F+I+RG ++CGIE + AG ++
Sbjct: 288 GSPYWLVANSWGTNWGESGFFKILRGDDQCGIEGAVVAGKARV 330
>gi|427787723|gb|JAA59313.1| Putative cathepsin b-like cysteine protease form 2 [Rhipicephalus
pulchellus]
Length = 338
Score = 224 bits (572), Expect = 6e-56, Method: Compositional matrix adjust.
Identities = 117/285 (41%), Positives = 166/285 (58%), Gaps = 12/285 (4%)
Query: 52 YGAEKNALSKLTLSELEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCP 111
+ A +N + S ++ MGV ++ RLP L+ S P + LPE FDAR +W C
Sbjct: 42 WKAGRNFDKNVPFSYIKGLMGV---ARNKTRRLPTLMHSSIP-DNLPESFDARQHWRKCN 97
Query: 112 TIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFH 171
+I IRDQ SCG+ WA GAVEA+SDR+CI ++G V +S+ DL++CC C GC+GG
Sbjct: 98 SIHVIRDQSSCGACWAFGAVEAISDRICIHTKGSVQVNISAQDLLTCCDYCRTGCKGGVP 157
Query: 172 GKAWKYWVTTGIVSGGTYASKQGCRPYEIPCERY-MNGSHSSCQDNEPNTPECIRKCQPG 230
AW ++ GIV+GG Y ++ GC+PY I RY G ++ P C R+C+
Sbjct: 158 SYAWMFYKEKGIVTGGLYGTEDGCQPYSIHTTRYTTTGLLPPPINDLSPMPPCKRECRKS 217
Query: 231 YDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGP 290
Y Y +D ++G Y+L +E I EIF++GPVE +YAD YK+G+Y+ +
Sbjct: 218 YGKKYSEDKHYGEKVYTLSGDEAQIKTEIFKNGPVEADFAVYADFYSYKSGVYQAHSRVR 277
Query: 291 LGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
G HAIRI+GWG E + V YWL ANS+ +WG+ G F+I
Sbjct: 278 CGSHAIRILGWGTE-------NGVPYWLAANSWTEHWGDKGYFKI 315
Score = 138 bits (348), Expect = 7e-30, Method: Compositional matrix adjust.
Identities = 70/162 (43%), Positives = 94/162 (58%), Gaps = 8/162 (4%)
Query: 336 GCRPYEIPCERYMNGSRSSCQANEPN-TPECIRKCQPGYDVSYEDDLNFGRIAYSLPANE 394
GC+PY I RY N+ + P C R+C+ Y Y +D ++G Y+L +E
Sbjct: 180 GCQPYSIHTTRYTTTGLLPPPINDLSPMPPCKRECRKSYGKKYSEDKHYGEKVYTLSGDE 239
Query: 395 ETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSS 454
I EIF++GPVE +YAD YK+G+Y+ + G HAIRI+GWG E +
Sbjct: 240 AQIKTEIFKNGPVEADFAVYADFYSYKSGVYQAHSRVRCGSHAIRILGWGTE-------N 292
Query: 455 VVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPK 496
V YWL ANS+ +WG+ G F+I RG NECGIE DI AG+PK
Sbjct: 293 GVPYWLAANSWTEHWGDKGYFKIRRGNNECGIEEDINAGIPK 334
>gi|226466816|emb|CAX69543.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 337
Score = 224 bits (570), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 118/310 (38%), Positives = 172/310 (55%), Gaps = 10/310 (3%)
Query: 45 LLPKLPFYGAEKNALSKLT-LSELEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDA 103
L P G + + ++ T +S++ + + + + + LP++ D + LP+ FD+
Sbjct: 32 FLNNHPSSGLKASKHNRFTAISDVYSALEYYGEKQFRHHILPIISHDDDNIL-LPDYFDS 90
Query: 104 RINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCG 163
R W CP+I+ I DQ C S WA+ +V A+SDR+CI + G V LS+ +LVSCC C
Sbjct: 91 REQWKNCPSIKRIYDQSQCYSSWAMASVAAISDRICIQTNGTVKVELSAIELVSCCSKCA 150
Query: 164 NGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIP-CERYMNGSHSSCQDNEPNTPE 222
GC G+ AW YWV G+V+G + + GC PY P C+ + S+ C P
Sbjct: 151 VGCNFGYSESAWYYWVENGLVTGESNGNNSGCLPYPFPKCDHGSSDSYPMCGYVVYTPPV 210
Query: 223 CIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGI 282
C C+PGY + Y DD +FG+ AY + NE I REI +GPVE S+ IY D + YK+G+
Sbjct: 211 CNGTCRPGYPIPYNDDKHFGKSAYQVKQNESDIRREIMLYGPVEASIFIYDDFVDYKSGV 270
Query: 283 YKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIGCRPYEI 342
YKH+ G + ++RIIGWG E + + YWL ANS+N WG NG F+I E
Sbjct: 271 YKHLTGRLITIQSVRIIGWGIE-------NGIPYWLCANSWNEEWGLNGFFKILRGSNEC 323
Query: 343 PCERYMNGSR 352
E ++N R
Sbjct: 324 EIEAFVNAGR 333
Score = 143 bits (361), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 80/186 (43%), Positives = 104/186 (55%), Gaps = 10/186 (5%)
Query: 309 GTSSVVKYWLVANSFNTNWGENGLFRIGCRPYEIP-CERYMNGSRSSCQANEPNTPECIR 367
G S Y+ V N T GE+ GC PY P C+ + S C P C
Sbjct: 156 GYSESAWYYWVENGLVT--GESNGNNSGCLPYPFPKCDHGSSDSYPMCGYVVYTPPVCNG 213
Query: 368 KCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKH 427
C+PGY + Y DD +FG+ AY + NE I REI +GPVE S+ IY D + YK+G+YKH
Sbjct: 214 TCRPGYPIPYNDDKHFGKSAYQVKQNESDIRREIMLYGPVEASIFIYDDFVDYKSGVYKH 273
Query: 428 VAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIE 487
+ G + ++RIIGWG E + + YWL ANS+N WG NG F+I+RG NEC IE
Sbjct: 274 LTGRLITIQSVRIIGWGIE-------NGIPYWLCANSWNEEWGLNGFFKILRGSNECEIE 326
Query: 488 ADITAG 493
A + AG
Sbjct: 327 AFVNAG 332
>gi|312091331|ref|XP_003146940.1| cathepsin B [Loa loa]
Length = 249
Score = 223 bits (569), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 111/219 (50%), Positives = 146/219 (66%), Gaps = 9/219 (4%)
Query: 119 QGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYW 178
+ S GS WA+ AVEAMSDR+CI S+GK+ V LS+DDL+SCCK CG GC GG AWKYW
Sbjct: 11 KSSSGSCWAVAAVEAMSDRICIMSKGKKQVTLSADDLLSCCKTCGFGCFGGEPMAAWKYW 70
Query: 179 VTTGIVSGGTYASKQGCRPYEI-PCERYMNGSH-SSCQDNEPNTPECIRKCQPGYDVSYE 236
V GIV+G Y + GCRPY PCE + N +H C+ + TP+C++KC Y SY+
Sbjct: 71 VLRGIVTGSEYTNHSGCRPYPFPPCEHHNNKTHYEPCKHDLYPTPKCVKKCDKNYGKSYK 130
Query: 237 DDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAI 296
D +G+ Y++ +N E+I +EI GPVE S +Y D + Y GIYKHVAG G HA+
Sbjct: 131 ADKYYGQSVYNVESNVESIQKEIMTLGPVEASFEVYTDFLYYTGGIYKHVAGSMGGGHAV 190
Query: 297 RIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
+++GWG + +G V YWL ANS+NT+WGE+G FRI
Sbjct: 191 KVLGWG---IDQG----VPYWLAANSWNTDWGEDGYFRI 222
Score = 157 bits (398), Expect = 9e-36, Method: Compositional matrix adjust.
Identities = 88/202 (43%), Positives = 121/202 (59%), Gaps = 25/202 (12%)
Query: 301 WGQEPLGEGTSSVVKYW----LVANSFNTNWGENGLFRIGCRPYEIP-CERYMNGSR-SS 354
+G EP+ + KYW +V S TN GCRPY P CE + N +
Sbjct: 59 FGGEPM-----AAWKYWVLRGIVTGSEYTN-------HSGCRPYPFPPCEHHNNKTHYEP 106
Query: 355 CQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIY 414
C+ + TP+C++KC Y SY+ D +G+ Y++ +N E+I +EI GPVE S +Y
Sbjct: 107 CKHDLYPTPKCVKKCDKNYGKSYKADKYYGQSVYNVESNVESIQKEIMTLGPVEASFEVY 166
Query: 415 ADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGL 474
D + Y GIYKHVAG G HA++++GWG + +G V YWL ANS+NT+WGE+G
Sbjct: 167 TDFLYYTGGIYKHVAGSMGGGHAVKVLGWG---IDQG----VPYWLAANSWNTDWGEDGY 219
Query: 475 FRIVRGQNECGIEADITAGLPK 496
FRI+RG NECGIE+ I AG+PK
Sbjct: 220 FRILRGVNECGIESGIIAGIPK 241
>gi|268566077|ref|XP_002647467.1| Hypothetical protein CBG06539 [Caenorhabditis briggsae]
Length = 332
Score = 223 bits (568), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 119/278 (42%), Positives = 163/278 (58%), Gaps = 22/278 (7%)
Query: 62 LTLSELEMR---MGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRD 118
+ +SE M+ M V S P + + +++ L +PE FDAR WP C +I+ IR+
Sbjct: 50 VDVSEEFMKSRVMNVKYASPPPSDEIRA-TEVNTVLATIPETFDARTKWPKCKSIKLIRN 108
Query: 119 QGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKD-CGNGCQGGFHGKAWKY 177
Q +CGS WA GA E +SDR+CIA++G R +S D+V CC + CG GC GG+ +A ++
Sbjct: 109 QANCGSCWAFGAAEVISDRICIATKGARQPVISPMDMVDCCGEYCGYGCDGGYSIQALRW 168
Query: 178 WVTTGIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYED 237
WV G+V+GG Y GC+PY+ + + C D TPEC CQ Y+ Y
Sbjct: 169 WVFDGVVTGGDYQG-DGCKPYQFC-------NSAGCPD--AVTPECALSCQSKYNTEYAK 218
Query: 238 DLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIR 297
D NFG AY + I +I +GPVE S +Y D YK+G+YK++AG LG HAI+
Sbjct: 219 DKNFGTSAYYVGMTVNAIQTDIMTNGPVEASFKVYEDFYKYKSGVYKYIAGKMLGGHAIK 278
Query: 298 IIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
IIGWG E GT+ YWL+ANS+ T WGENG F+I
Sbjct: 279 IIGWGTE---NGTA----YWLIANSWGTKWGENGFFKI 309
Score = 140 bits (352), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 72/162 (44%), Positives = 93/162 (57%), Gaps = 16/162 (9%)
Query: 336 GCRPYEIPCERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEE 395
GC+PY+ +S + TPEC CQ Y+ Y D NFG AY +
Sbjct: 184 GCKPYQFC---------NSAGCPDAVTPECALSCQSKYNTEYAKDKNFGTSAYYVGMTVN 234
Query: 396 TIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSV 455
I +I +GPVE S +Y D YK+G+YK++AG LG HAI+IIGWG E GT+
Sbjct: 235 AIQTDIMTNGPVEASFKVYEDFYKYKSGVYKYIAGKMLGGHAIKIIGWGTE---NGTA-- 289
Query: 456 VKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
YWL+ANS+ T WGENG F+I RG NECGIE ++ AG +
Sbjct: 290 --YWLIANSWGTKWGENGFFKIRRGVNECGIENNVVAGKADV 329
>gi|119638965|gb|ABL85237.1| cysteine proteinase 3 [Necator americanus]
Length = 360
Score = 223 bits (567), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 121/281 (43%), Positives = 163/281 (58%), Gaps = 14/281 (4%)
Query: 57 NALSKLTLSELEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEI 116
NAL+ L + +E R + + ++ L + D EE+P FDAR WP C +I I
Sbjct: 56 NALNILKMRVMESRFLDNEEGEM------LKEEDMDFSEEIPVSFDARDKWPKCTSIGFI 109
Query: 117 RDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWK 176
RDQ CGS WA+ + E MSDR+C+ S G V LS D+++CC +CG GC GG +AW+
Sbjct: 110 RDQSHCGSCWAVSSAETMSDRLCVQSNGTIKVLLSDTDILACCPNCGAGCGGGHTIRAWE 169
Query: 177 YWVTTGIVSGGTYASKQGCRPYEI-PCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSY 235
Y+ TG+ +GG Y +K C+PY PC+ + S+ C + TP+C + CQ Y Y
Sbjct: 170 YFKNTGVCTGGLYGTKDSCKPYAFYPCK---DESYGKCPKDSFPTPKCRKICQYKYSKKY 226
Query: 236 EDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHA 295
DD + AY +P NE I EI R+GPV S IY D Y+ G+Y G LG HA
Sbjct: 227 ADDKYYANSAYRIPQNETWIKLEIMRNGPVTASFRIYPDFGFYEKGVYVTSGGRELGGHA 286
Query: 296 IRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGE-NGLFRI 335
I+IIGWG E + GT + YWL+ANS+ T+WGE NG FRI
Sbjct: 287 IKIIGWGTEKVN-GTD--LPYWLIANSWGTDWGENNGYFRI 324
Score = 135 bits (340), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 78/178 (43%), Positives = 101/178 (56%), Gaps = 10/178 (5%)
Query: 324 NTNWGENGLF--RIGCRPYEI-PCERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDD 380
NT GL+ + C+PY PC+ + S C + TP+C + CQ Y Y DD
Sbjct: 173 NTGVCTGGLYGTKDSCKPYAFYPCK---DESYGKCPKDSFPTPKCRKICQYKYSKKYADD 229
Query: 381 LNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRI 440
+ AY +P NE I EI R+GPV S IY D Y+ G+Y G LG HAI+I
Sbjct: 230 KYYANSAYRIPQNETWIKLEIMRNGPVTASFRIYPDFGFYEKGVYVTSGGRELGGHAIKI 289
Query: 441 IGWGQEPLGEGTSSVVKYWLVANSFNTNWGE-NGLFRIVRGQNECGIEADITAGLPKI 497
IGWG E + GT + YWL+ANS+ T+WGE NG FRI+RGQN C IE + AG+ K+
Sbjct: 290 IGWGTEKVN-GTD--LPYWLIANSWGTDWGENNGYFRILRGQNHCQIEQKVIAGMIKV 344
>gi|170030060|ref|XP_001842908.1| cathepsin B [Culex quinquefasciatus]
gi|167865914|gb|EDS29297.1| cathepsin B [Culex quinquefasciatus]
Length = 320
Score = 222 bits (565), Expect = 4e-55, Method: Compositional matrix adjust.
Identities = 112/267 (41%), Positives = 156/267 (58%), Gaps = 21/267 (7%)
Query: 69 MRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWAL 128
M +G+ P+ + N +PLL + + LPE FD+R WP CP++ +IRDQG CGS + +
Sbjct: 55 MNLGLRPNESVA-NAVPLL-ENQRSVRSLPESFDSRQKWPNCPSLNQIRDQGCCGSCYVV 112
Query: 129 GAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGT 188
A++DR CI S G++ + D ++CC DC C GG+ GK W+YWV +G+ S G
Sbjct: 113 STAAAITDRYCIHSGGQKQFTFGATDYLACCTDCFK-CDGGYVGKTWQYWVDSGLTSEGP 171
Query: 189 YASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSL 248
Y S QGC Y GS+ N+P P C R CQ GY ++Y DL +G AY +
Sbjct: 172 YKSGQGCNSYPF-------GSYCV---NDP-LPTCSRTCQAGYPLTYSQDLKYGGSAYRV 220
Query: 249 PANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGE 308
NE IM EI+++GPV ++AD YK+G+Y+HV G G HA+R+IGWG E
Sbjct: 221 MWNENAIMTEIYQNGPVVVQFEVFADFYQYKSGVYRHVTGATEGWHAVRVIGWGVE---- 276
Query: 309 GTSSVVKYWLVANSFNTNWGENGLFRI 335
+ VKYWLVANS+ WG+ G F+
Sbjct: 277 ---NGVKYWLVANSWGVRWGDKGFFKF 300
Score = 140 bits (354), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 75/170 (44%), Positives = 98/170 (57%), Gaps = 11/170 (6%)
Query: 327 WGENGLFRIGCRPYEIPCERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRI 386
W ++GL G C Y GS N+P P C R CQ GY ++Y DL +G
Sbjct: 161 WVDSGLTSEGPYKSGQGCNSYPFGSYC---VNDP-LPTCSRTCQAGYPLTYSQDLKYGGS 216
Query: 387 AYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQE 446
AY + NE IM EI+++GPV ++AD YK+G+Y+HV G G HA+R+IGWG E
Sbjct: 217 AYRVMWNENAIMTEIYQNGPVVVQFEVFADFYQYKSGVYRHVTGATEGWHAVRVIGWGVE 276
Query: 447 PLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPK 496
+ VKYWLVANS+ WG+ G F+ VRG+N GIE + AGLPK
Sbjct: 277 -------NGVKYWLVANSWGVRWGDKGFFKFVRGENHLGIEDFVYAGLPK 319
>gi|3087803|emb|CAA93279.1| cysteine protease [Haemonchus contortus]
Length = 325
Score = 222 bits (565), Expect = 5e-55, Method: Compositional matrix adjust.
Identities = 106/252 (42%), Positives = 154/252 (61%), Gaps = 8/252 (3%)
Query: 77 SKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSD 136
S + +NR P++ +D +++PE FDAR +WP C ++ IRDQ +CGS WA+ A+SD
Sbjct: 74 SFIGENREPIVGDENDEGDDIPESFDARTHWPNCSSLTHIRDQANCGSCWAVSTAAALSD 133
Query: 137 RVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCR 196
R+CI++ G + V +S+ D+++CC CG GCQGG+ +AW+Y G V+GG +K CR
Sbjct: 134 RICISTNGTKQVNISATDILTCCYKCGYGCQGGWPIEAWEYVAREGAVTGGRLLAKSCCR 193
Query: 197 PYEI-PCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETI 255
+ PC + N ++ TP+C C PGY SY DD G+ AY LP + + I
Sbjct: 194 SHPFPPCGHHGNETYYGECGGRARTPKCRTSCTPGYKNSYSDDKIRGKDAYELPNSVKAI 253
Query: 256 MREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVK 315
REI ++GPV + T+YAD YK GIYKH AG G HA+++IGWG+E V
Sbjct: 254 QREIMKNGPVVAAFTVYADFSYYKKGIYKHTAGRARGSHAVKVIGWGEE-------GDVP 306
Query: 316 YWLVANSFNTNW 327
YW+V NS++ +W
Sbjct: 307 YWIVKNSWHNDW 318
Score = 110 bits (275), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 57/139 (41%), Positives = 77/139 (55%), Gaps = 8/139 (5%)
Query: 332 LFRIGCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSL 390
L + CR + P C + N + TP+C C PGY SY DD G+ AY L
Sbjct: 187 LAKSCCRSHPFPPCGHHGNETYYGECGGRARTPKCRTSCTPGYKNSYSDDKIRGKDAYEL 246
Query: 391 PANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGE 450
P + + I REI ++GPV + T+YAD YK GIYKH AG G HA+++IGWG+E
Sbjct: 247 PNSVKAIQREIMKNGPVVAAFTVYADFSYYKKGIYKHTAGRARGSHAVKVIGWGEE---- 302
Query: 451 GTSSVVKYWLVANSFNTNW 469
V YW+V NS++ +W
Sbjct: 303 ---GDVPYWIVKNSWHNDW 318
>gi|118118|sp|P19092.1|CYSP1_HAECO RecName: Full=Cathepsin B-like cysteine proteinase 1; Flags:
Precursor
gi|159173|gb|AAA29175.1| cysteine protease (AC-1) [Haemonchus contortus]
Length = 342
Score = 221 bits (564), Expect = 6e-55, Method: Compositional matrix adjust.
Identities = 114/264 (43%), Positives = 163/264 (61%), Gaps = 13/264 (4%)
Query: 76 DSKLPQNRLPLLVQLS-DPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAM 134
D K +L L+V+ DP ++P +D R W C T IRDQ +CGS WA+ A+
Sbjct: 65 DIKYKHQKLNLMVKEDPDPEVDIPPSYDPRDVWKNCTTFY-IRDQANCGSCWAVSTAAAI 123
Query: 135 SDRVCIASRGKRHVRLSSDDLVSCCK-DCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQ 193
SDR+CIAS+ ++ V +S+ D+++CC+ CG+GC+GG+ +AWKY++ G+VSGG Y +K
Sbjct: 124 SDRICIASKAEKQVNISATDIMTCCRPQCGDGCEGGWPIEAWKYFIYDGVVSGGEYLTKD 183
Query: 194 GCRPYEI-PCERYMNGSH-SSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPAN 251
CRPY I PC + N ++ C+ P TP C RKC+PG Y D +G+ AY + +
Sbjct: 184 VCRPYPIHPCGHHGNDTYYGECRGTAP-TPPCKRKCRPGVRKMYRIDKRYGKDAYIVKQS 242
Query: 252 EETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTS 311
+ I EI R+GPV S +Y D YK+GIYKH AG G HA+++IGWG E
Sbjct: 243 VKAIQSEILRNGPVVASFAVYEDFRHYKSGIYKHTAGELRGYHAVKMIGWGNE------- 295
Query: 312 SVVKYWLVANSFNTNWGENGLFRI 335
+ +WL+ANS++ +WGE G FRI
Sbjct: 296 NNTDFWLIANSWHNDWGEKGYFRI 319
Score = 140 bits (352), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 72/160 (45%), Positives = 96/160 (60%), Gaps = 10/160 (6%)
Query: 337 CRPYEI-PCERYMNGSR-SSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANE 394
CRPY I PC + N + C+ P TP C RKC+PG Y D +G+ AY + +
Sbjct: 185 CRPYPIHPCGHHGNDTYYGECRGTAP-TPPCKRKCRPGVRKMYRIDKRYGKDAYIVKQSV 243
Query: 395 ETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSS 454
+ I EI R+GPV S +Y D YK+GIYKH AG G HA+++IGWG E +
Sbjct: 244 KAIQSEILRNGPVVASFAVYEDFRHYKSGIYKHTAGELRGYHAVKMIGWGNE-------N 296
Query: 455 VVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGL 494
+WL+ANS++ +WGE G FRI+RG N+CGIE I AG+
Sbjct: 297 NTDFWLIANSWHNDWGEKGYFRIIRGTNDCGIEGTIAAGI 336
>gi|291291827|gb|ADD91786.1| cysteine proteinase [Haemonchus contortus]
Length = 253
Score = 221 bits (564), Expect = 6e-55, Method: Compositional matrix adjust.
Identities = 113/244 (46%), Positives = 155/244 (63%), Gaps = 9/244 (3%)
Query: 96 ELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDL 155
++PE +R WP C +++ IRDQ +CGS WA+ A+SDR+CIAS G++ V +S+ D+
Sbjct: 1 DIPESPYSRTKWPKCSSLKPIRDQANCGSCWAVSTASALSDRICIASNGRKQVHVSATDI 60
Query: 156 VSCCKD-CGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEI-PCERYMNGSHSSC 213
+SCC + CG GC GG+ +A+ Y+ G V+GG Y + GCRPY PC + ++
Sbjct: 61 LSCCGNQCGYGCNGGWPIQAFNYFSKQGAVTGGDYKATSGCRPYPFHPCGHHGKDTYYGE 120
Query: 214 QDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYA 273
NE TP+C+RKCQ Y SY+ D + G+ AY P E+ REI ++GPV G+ T+Y
Sbjct: 121 CPNEATTPKCVRKCQKSYKKSYKKDRSIGKDAYEEPNAEKATQREIMKNGPVVGAFTVYE 180
Query: 274 DMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLF 333
D YK GIYKH AG G HAI+IIGWG+E V YWL+ANS++ +WGENG F
Sbjct: 181 DFSYYKKGIYKHTAGKARGGHAIKIIGWGKE-------GGVPYWLIANSWHNDWGENGYF 233
Query: 334 RIGC 337
RI C
Sbjct: 234 RILC 237
Score = 145 bits (367), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 77/159 (48%), Positives = 98/159 (61%), Gaps = 8/159 (5%)
Query: 336 GCRPYEI-PCERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANE 394
GCRPY PC + + NE TP+C+RKCQ Y SY+ D + G+ AY P E
Sbjct: 100 GCRPYPFHPCGHHGKDTYYGECPNEATTPKCVRKCQKSYKKSYKKDRSIGKDAYEEPNAE 159
Query: 395 ETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSS 454
+ REI ++GPV G+ T+Y D YK GIYKH AG G HAI+IIGWG+E
Sbjct: 160 KATQREIMKNGPVVGAFTVYEDFSYYKKGIYKHTAGKARGGHAIKIIGWGKE-------G 212
Query: 455 VVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAG 493
V YWL+ANS++ +WGENG FRI+ G N CGIE ++ AG
Sbjct: 213 GVPYWLIANSWHNDWGENGYFRILCGSNHCGIEENVVAG 251
>gi|194246069|gb|ACF35526.1| putative cathepsin B-like cysteine protease form 1 [Dermacentor
variabilis]
Length = 277
Score = 220 bits (560), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 111/254 (43%), Positives = 159/254 (62%), Gaps = 12/254 (4%)
Query: 83 RLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIAS 142
RLP+ + P E+LPE FDAR W +C +I IRDQ +CGS A GA EAMSDR+CI +
Sbjct: 12 RLPIRLHEEIP-EDLPESFDAREAWSHCDSIHLIRDQSTCGSCRAFGATEAMSDRICIHT 70
Query: 143 RGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEI-P 201
+G+ V +S+ DL++CC CG GC GG+ AW Y+ GIV+GG Y + GC+PY P
Sbjct: 71 KGRVQVNISAQDLLTCCHQCGMGCFGGYPSAAWDYYKDEGIVTGGLYGTDDGCQPYYFPP 130
Query: 202 CERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFR 261
CE + G +C D +P TP+C++ C+ GY+ SY +D F + YSL ++E I EI++
Sbjct: 131 CEHHTKGPLPNCTDTKP-TPKCLQVCRKGYEKSYSEDKYFAKTVYSLHSDETQIKTEIYK 189
Query: 262 HGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVAN 321
+GPVE ++Y D + YK+G+Y+ + L E + +GW + SV WLVAN
Sbjct: 190 NGPVEADFSVYTDFLAYKSGVYQRHS-YELWEARHQNLGWALK-----RRSV---WLVAN 240
Query: 322 SFNTNWGENGLFRI 335
S+N +WG+ G F+I
Sbjct: 241 SWNQDWGDKGYFKI 254
Score = 133 bits (335), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 69/162 (42%), Positives = 100/162 (61%), Gaps = 11/162 (6%)
Query: 336 GCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANE 394
GC+PY P CE + G +C +P TP+C++ C+ GY+ SY +D F + YSL ++E
Sbjct: 122 GCQPYYFPPCEHHTKGPLPNCTDTKP-TPKCLQVCRKGYEKSYSEDKYFAKTVYSLHSDE 180
Query: 395 ETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSS 454
I EI+++GPVE ++Y D + YK+G+Y+ + L E + +GW + S
Sbjct: 181 TQIKTEIYKNGPVEADFSVYTDFLAYKSGVYQRHS-YELWEARHQNLGWALK-----RRS 234
Query: 455 VVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPK 496
V WLVANS+N +WG+ G F+I RG NECGIE DI AG+PK
Sbjct: 235 V---WLVANSWNQDWGDKGYFKIRRGNNECGIENDINAGIPK 273
>gi|358341561|dbj|GAA37330.2| cathepsin B-like cysteine proteinase [Clonorchis sinensis]
Length = 347
Score = 220 bits (560), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 126/278 (45%), Positives = 159/278 (57%), Gaps = 12/278 (4%)
Query: 63 TLSELEMRMG-VHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGS 121
TL E+ +G + D + + R P + D ELP FDAR +WP C TI +IRDQ
Sbjct: 52 TLEEIRSVLGTMREDQNVKEFRRPTISH-EDITLELPSEFDAREHWPECRTIPQIRDQSG 110
Query: 122 CGSGWALGAVEAMSDRVCIASRGKR-HVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVT 180
CGS WA AV AMSDRVCI S +V+LS+ DL++CC CG GC GG+ G AW YW
Sbjct: 111 CGSCWAFAAVTAMSDRVCIHSNQTLVNVQLSATDLLACCTTCGFGCVGGWGGMAWDYWRD 170
Query: 181 TGIVSGGTYASKQGCRPYEIPCERYMNGSHSS---CQDNEPNTPECIRKCQPGYDVSYED 237
GIV+GG Y C PY P R+ S C + +TP+C+ +CQ GY YED
Sbjct: 171 NGIVTGGEYKDSHTCLPYPFPPCRHHGAKGSEYPPCPEKMYSTPQCVSECQKGYATKYED 230
Query: 238 DLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIR 297
D +Y+L + TI +EI+ GPVE +M +Y D Y G+YKH G LG HAIR
Sbjct: 231 DKIRASTSYNLYRSVTTIQKEIWMRGPVEATMNVYTDFANYAGGVYKHTTGELLGGHAIR 290
Query: 298 IIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
++GWG E G YWL ANS+N +WGE G FRI
Sbjct: 291 LLGWGVEEDG------TPYWLAANSWNPSWGEKGFFRI 322
Score = 149 bits (376), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 78/180 (43%), Positives = 105/180 (58%), Gaps = 18/180 (10%)
Query: 327 WGENGLFRIG-------CRPYEIPCERYMNGSRSS----CQANEPNTPECIRKCQPGYDV 375
W +NG+ G C PY P R+ +G++ S C +TP+C+ +CQ GY
Sbjct: 168 WRDNGIVTGGEYKDSHTCLPYPFPPCRH-HGAKGSEYPPCPEKMYSTPQCVSECQKGYAT 226
Query: 376 SYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGE 435
YEDD +Y+L + TI +EI+ GPVE +M +Y D Y G+YKH G LG
Sbjct: 227 KYEDDKIRASTSYNLYRSVTTIQKEIWMRGPVEATMNVYTDFANYAGGVYKHTTGELLGG 286
Query: 436 HAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLP 495
HAIR++GWG E G YWL ANS+N +WGE G FRI+RG + CGIE+D++AGLP
Sbjct: 287 HAIRLLGWGVEEDG------TPYWLAANSWNPSWGEKGFFRILRGSDHCGIESDVSAGLP 340
>gi|29374023|gb|AAO73002.1| cathepsin B [Fasciola gigantica]
Length = 335
Score = 219 bits (559), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 117/274 (42%), Positives = 159/274 (58%), Gaps = 11/274 (4%)
Query: 64 LSELEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCG 123
+ + + +G ++ +N V+ S +LPE FDAR WP C +I EI DQ SC
Sbjct: 53 IEQFKKHLGALEETPEERNTRRPTVRYSVSENDLPESFDAREKWPNCSSISEIPDQSSCS 112
Query: 124 SGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGI 183
S WA+G AM+DR+CI S G++ RLS+ DLVSCC CG GC+GG+ AW YW GI
Sbjct: 113 SCWAVGTASAMTDRICIHSNGEKKPRLSAVDLVSCCPYCGYGCEGGYPSMAWDYWWRHGI 172
Query: 184 VSGGTYASKQGCRPYEIPCERYMNGSH--SSCQDNEPNTPECIRKCQPGYDVSYEDDLNF 241
VSGGT + GC PY P ++ + + C TP+C ++CQ GY + E+D
Sbjct: 173 VSGGTLENPTGCLPYPFPKCSHLEETPGLAPCPRELYATPKCEKQCQAGYSKTSEEDKIK 232
Query: 242 GRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGW 301
G+ +Y++ E IM EI +GPV I+ D +YK+GIY++ +G +G H IIGW
Sbjct: 233 GKSSYNVGDRETDIMMEIITNGPVSTIYYIFEDFTVYKSGIYQYTSGSLMGGHG--IIGW 290
Query: 302 GQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
G E + VKYWL ANS+N WGENG FRI
Sbjct: 291 GVE-------NGVKYWLAANSWNEGWGENGYFRI 317
Score = 128 bits (321), Expect = 8e-27, Method: Compositional matrix adjust.
Identities = 71/162 (43%), Positives = 95/162 (58%), Gaps = 11/162 (6%)
Query: 336 GCRPYEIPCERYMNGS--RSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPAN 393
GC PY P ++ + + C TP+C ++CQ GY + E+D G+ +Y++
Sbjct: 183 GCLPYPFPKCSHLEETPGLAPCPRELYATPKCEKQCQAGYSKTSEEDKIKGKSSYNVGDR 242
Query: 394 EETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTS 453
E IM EI +GPV I+ D +YK+GIY++ +G +G H I IGWG E
Sbjct: 243 ETDIMMEIITNGPVSTIYYIFEDFTVYKSGIYQYTSGSLMGGHGI--IGWGVE------- 293
Query: 454 SVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLP 495
+ VKYWL ANS+N WGENG FRI RG NECGIE+ I AGLP
Sbjct: 294 NGVKYWLAANSWNEGWGENGYFRIRRGTNECGIESRINAGLP 335
>gi|340501578|gb|EGR28345.1| hypothetical protein IMG5_177790 [Ichthyophthirius multifiliis]
Length = 356
Score = 219 bits (557), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 117/319 (36%), Positives = 180/319 (56%), Gaps = 21/319 (6%)
Query: 25 HSNGVFCDLSKAFDR----VDHSILLPKLPFYGAEKNALSKLTLSELEMRMGVHPDSKLP 80
HS F + ++A+ + + K + E + ++ ++ MGV +SK
Sbjct: 25 HSKFTFDEPNQAYQNKLGNIAKKVNSLKTTWQAGENQRWQNMDIAGIKAHMGVLRESKSG 84
Query: 81 QNRLPLLVQLSDPLEELPEGFDARINW-PYCPTIQEIRDQGSCGSGWALGAVEAMSDRVC 139
N L ++S +E LP+ FD+R W CP++ E+RDQ +CGS WA A E++SDR+C
Sbjct: 85 IN----LEKVSTVVENLPKNFDSRKQWGSKCPSLNEVRDQSTCGSCWAFAAAESLSDRIC 140
Query: 140 IASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYE 199
I + VRLS+++LVSCC CG+GC GG+ A +Y+V TG+V+G + C+ Y
Sbjct: 141 IHT--GEDVRLSTENLVSCCSSCGDGCNGGYPEAAMQYFVKTGLVTGDLFGDNNFCQAYS 198
Query: 200 IP-CERYMNGSHSSCQDNEPNTPECIRKCQPGYDVS--YEDDLNFGRIAYSLPANEETIM 256
P C ++ + E TPEC +KC V Y +DL G+ +YS+ ++ + IM
Sbjct: 199 FPPCAHHVASTKYPPCKGEVPTPECKKKCDDDSKVKRPYNEDLYKGQKSYSVSSDPKAIM 258
Query: 257 REIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKY 316
EI +GPVE + T+Y D + YK+G+Y+HV G LG HA+++IGWG E + Y
Sbjct: 259 TEIMNNGPVEVAFTVYEDFVTYKSGVYQHVTGEQLGGHAVKMIGWGVE-------NDTPY 311
Query: 317 WLVANSFNTNWGENGLFRI 335
WL+ NS+N WG+ G F+I
Sbjct: 312 WLIVNSWNETWGDQGTFKI 330
Score = 137 bits (346), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 66/163 (40%), Positives = 98/163 (60%), Gaps = 10/163 (6%)
Query: 337 CRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVS--YEDDLNFGRIAYSLPAN 393
C+ Y P C ++ ++ E TPEC +KC V Y +DL G+ +YS+ ++
Sbjct: 194 CQAYSFPPCAHHVASTKYPPCKGEVPTPECKKKCDDDSKVKRPYNEDLYKGQKSYSVSSD 253
Query: 394 EETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTS 453
+ IM EI +GPVE + T+Y D + YK+G+Y+HV G LG HA+++IGWG E
Sbjct: 254 PKAIMTEIMNNGPVEVAFTVYEDFVTYKSGVYQHVTGEQLGGHAVKMIGWGVE------- 306
Query: 454 SVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPK 496
+ YWL+ NS+N WG+ G F+I+RG NECGIE ++ LP+
Sbjct: 307 NDTPYWLIVNSWNETWGDQGTFKILRGSNECGIEDEVVTALPQ 349
>gi|118122|sp|P25793.1|CYSP2_HAECO RecName: Full=Cathepsin B-like cysteine proteinase 2; Flags:
Precursor
gi|159165|gb|AAA29171.1| cathepsin B-like cysteine protease [Haemonchus contortus]
Length = 342
Score = 219 bits (557), Expect = 4e-54, Method: Compositional matrix adjust.
Identities = 122/317 (38%), Positives = 177/317 (55%), Gaps = 20/317 (6%)
Query: 30 FCDLSKAFDRVDHSILLPKLPFYGAEKNALSKLTLSELEMRMGVHPDS-------KLPQN 82
C S A + I L G A + + + E+ PD K
Sbjct: 12 LCSQSGADENAAQGIPLEAQRLTGEPLVAYLRRSQNLFEVNSDPTPDFEQKIMSIKYKHQ 71
Query: 83 RLPLLVQLS-DPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIA 141
+L L+V+ DP ++P +D R W C T IRDQ +CGS WA+ A+SDR+CIA
Sbjct: 72 KLNLMVKEDPDPEVDIPPSYDPRDVWKNCTTFY-IRDQANCGSCWAVSTAAAISDRICIA 130
Query: 142 SRGKRHVRLSSDDLVSCCK-DCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEI 200
S+ ++ V +S+ D+++CC+ CG+GC+GG+ +AWKY++ G+VSGG Y +K CRPY I
Sbjct: 131 SKAEKQVNISATDIMTCCRPQCGDGCEGGWPIEAWKYFIYDGVVSGGEYLTKDVCRPYPI 190
Query: 201 -PCERYMNGSH-SSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMRE 258
PC + N ++ C+ P TP C RKC+PG Y D +G+ AY + + + I E
Sbjct: 191 HPCGHHGNDTYYGECRGTAP-TPPCKRKCRPGVRKMYRIDKRYGKDAYIVKQSVKAIQSE 249
Query: 259 IFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWL 318
I ++GPV S +Y D YK+GIYKH AG G HA+++IGWG E + +WL
Sbjct: 250 ILKNGPVVASFAVYEDFRHYKSGIYKHTAGELRGYHAVKMIGWGNE-------NNTDFWL 302
Query: 319 VANSFNTNWGENGLFRI 335
+ANS++ +WGE G FRI
Sbjct: 303 IANSWHNDWGEKGYFRI 319
Score = 139 bits (350), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 72/160 (45%), Positives = 96/160 (60%), Gaps = 10/160 (6%)
Query: 337 CRPYEI-PCERYMNGSR-SSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANE 394
CRPY I PC + N + C+ P TP C RKC+PG Y D +G+ AY + +
Sbjct: 185 CRPYPIHPCGHHGNDTYYGECRGTAP-TPPCKRKCRPGVRKMYRIDKRYGKDAYIVKQSV 243
Query: 395 ETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSS 454
+ I EI ++GPV S +Y D YK+GIYKH AG G HA+++IGWG E +
Sbjct: 244 KAIQSEILKNGPVVASFAVYEDFRHYKSGIYKHTAGELRGYHAVKMIGWGNE-------N 296
Query: 455 VVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGL 494
+WL+ANS++ +WGE G FRIVRG N+CGIE I AG+
Sbjct: 297 NTDFWLIANSWHNDWGEKGYFRIVRGSNDCGIEGTIAAGI 336
>gi|984958|gb|AAC46877.1| cathepsin B-like proteinase [Ancylostoma caninum]
Length = 343
Score = 219 bits (557), Expect = 4e-54, Method: Compositional matrix adjust.
Identities = 108/241 (44%), Positives = 149/241 (61%), Gaps = 10/241 (4%)
Query: 98 PEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVS 157
P FDAR +WP C +I IRDQ SCGS WA+ + EAMSD +C+ S V +S D++S
Sbjct: 90 PASFDARTHWPECRSIGTIRDQSSCGSCWAVSSAEAMSDEICVQSNSTIRVMISDSDILS 149
Query: 158 CCK-DCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEI-PCERYMNGSH-SSCQ 214
CC CG GCQGG+ +A+K+ G+V+GG Y K+ C+PY PC + N + C
Sbjct: 150 CCGISCGYGCQGGWPIEAYKWMQRDGVVTGGKYRQKKVCKPYAFYPCGHHQNDPYYGPCP 209
Query: 215 DNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYAD 274
TP+C + CQ Y+ SY++D +F AY LP NE I +EI+++GPV + +Y D
Sbjct: 210 GGLWPTPKCRKTCQRKYNKSYQEDKHFATRAYYLPNNERNIRQEIYKNGPVVAAFRVYQD 269
Query: 275 MILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFR 334
YK GIY H GG G HA++++GWG+E + YWL+ANS+NT+WGE+G FR
Sbjct: 270 FSYYKKGIYVHKWGGQTGAHAVKVVGWGRE-------NATDYWLIANSWNTDWGESGYFR 322
Query: 335 I 335
I
Sbjct: 323 I 323
Score = 148 bits (374), Expect = 6e-33, Method: Compositional matrix adjust.
Identities = 72/163 (44%), Positives = 99/163 (60%), Gaps = 9/163 (5%)
Query: 337 CRPYEI-PCERYMNGSR-SSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANE 394
C+PY PC + N C TP+C + CQ Y+ SY++D +F AY LP NE
Sbjct: 188 CKPYAFYPCGHHQNDPYYGPCPGGLWPTPKCRKTCQRKYNKSYQEDKHFATRAYYLPNNE 247
Query: 395 ETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSS 454
I +EI+++GPV + +Y D YK GIY H GG G HA++++GWG+E +
Sbjct: 248 RNIRQEIYKNGPVVAAFRVYQDFSYYKKGIYVHKWGGQTGAHAVKVVGWGRE-------N 300
Query: 455 VVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
YWL+ANS+NT+WGE+G FRIVRG NECGIEA + G ++
Sbjct: 301 ATDYWLIANSWNTDWGESGYFRIVRGTNECGIEAQMVGGAMRV 343
>gi|124502519|gb|ABN13633.1| cysteine proteinase [Haemonchus contortus]
Length = 342
Score = 218 bits (555), Expect = 5e-54, Method: Compositional matrix adjust.
Identities = 113/264 (42%), Positives = 163/264 (61%), Gaps = 13/264 (4%)
Query: 76 DSKLPQNRLPLLVQLS-DPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAM 134
D K RL L+V+ DP ++P +D R W C T IRDQ +CGS WA+ A+
Sbjct: 65 DIKYNHQRLNLMVKEDPDPEVDIPPSYDPRDVWKNCTTFY-IRDQANCGSCWAVSTAAAI 123
Query: 135 SDRVCIASRGKRHVRLSSDDLVSCCK-DCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQ 193
SDR+CIAS+ ++ V +S+ D+++CC+ CG+GC+GG+ +AWKY++ G+VSGG Y +K
Sbjct: 124 SDRICIASKAEKQVNISATDIMTCCRPQCGDGCEGGWPIEAWKYFIYDGVVSGGEYLTKG 183
Query: 194 GCRPYEI-PCERYMNGSH-SSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPAN 251
CRPY I PC + N ++ C+ P TP C ++C+PG Y D +G+ AY + +
Sbjct: 184 VCRPYPIHPCGHHGNDTYYGECRGTAP-TPPCKKECRPGVRKVYRIDKRYGKDAYIVKQS 242
Query: 252 EETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTS 311
+ I EI R+GPV S +Y D YK+GIYKH AG G HA+++IGWG E
Sbjct: 243 VKAIQSEILRNGPVVASFAVYEDFRHYKSGIYKHTAGELRGYHAVKMIGWGNE------- 295
Query: 312 SVVKYWLVANSFNTNWGENGLFRI 335
+ +WL+ANS++ +WGE G FRI
Sbjct: 296 NNTDFWLIANSWHNDWGEKGYFRI 319
Score = 136 bits (343), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 70/160 (43%), Positives = 96/160 (60%), Gaps = 10/160 (6%)
Query: 337 CRPYEI-PCERYMNGSR-SSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANE 394
CRPY I PC + N + C+ P TP C ++C+PG Y D +G+ AY + +
Sbjct: 185 CRPYPIHPCGHHGNDTYYGECRGTAP-TPPCKKECRPGVRKVYRIDKRYGKDAYIVKQSV 243
Query: 395 ETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSS 454
+ I EI R+GPV S +Y D YK+GIYKH AG G HA+++IGWG E +
Sbjct: 244 KAIQSEILRNGPVVASFAVYEDFRHYKSGIYKHTAGELRGYHAVKMIGWGNE-------N 296
Query: 455 VVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGL 494
+WL+ANS++ +WGE G FRI+RG N+CGIE I AG+
Sbjct: 297 NTDFWLIANSWHNDWGEKGYFRIIRGTNDCGIEGTIAAGI 336
>gi|321461662|gb|EFX72692.1| hypothetical protein DAPPUDRAFT_308155 [Daphnia pulex]
Length = 379
Score = 218 bits (555), Expect = 6e-54, Method: Compositional matrix adjust.
Identities = 108/242 (44%), Positives = 145/242 (59%), Gaps = 10/242 (4%)
Query: 97 LPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLV 156
+P FDAR+ WP CPTI EI +QGSC S WA+ + MSDR+CI S + VRLS+ +L+
Sbjct: 113 IPAEFDARLRWPNCPTIGEIFEQGSCASCWAVAPTDVMSDRICIHSGSRHIVRLSAGNLL 172
Query: 157 SCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYE-IPC--ERYMNGSHSSC 213
SCCK CG GC+GGF G AW +W GIV+GG+Y+S GC+ Y+ PC R + C
Sbjct: 173 SCCKLCGKGCKGGFPGGAWMHWSKHGIVTGGSYSSDYGCQKYQFFPCYQPRTKGSIKNKC 232
Query: 214 QDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYA 273
+ EC C+ Y+ SY+ DL +G Y +P + I EI +GPV+ ++ IY
Sbjct: 233 PKTDNTLLECRETCRTSYNKSYKQDLYYGESVYRIPNDARAIQLEIMENGPVQANLRIYE 292
Query: 274 DMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLF 333
D + YK G+Y+HV G L HA++I GWG E GT YWL AN ++ WG G F
Sbjct: 293 DFLHYKFGVYRHVHGQGLEYHAVKIFGWGTE---GGTP----YWLAANPWSKRWGNGGFF 345
Query: 334 RI 335
+I
Sbjct: 346 KI 347
Score = 126 bits (317), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 66/176 (37%), Positives = 94/176 (53%), Gaps = 10/176 (5%)
Query: 336 GCRPYEI-PC--ERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPA 392
GC+ Y+ PC R ++ C + EC C+ Y+ SY+ DL +G Y +P
Sbjct: 210 GCQKYQFFPCYQPRTKGSIKNKCPKTDNTLLECRETCRTSYNKSYKQDLYYGESVYRIPN 269
Query: 393 NEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGT 452
+ I EI +GPV+ ++ IY D + YK G+Y+HV G L HA++I GWG E GT
Sbjct: 270 DARAIQLEIMENGPVQANLRIYEDFLHYKFGVYRHVHGQGLEYHAVKIFGWGTE---GGT 326
Query: 453 SSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKIGLEIDSNEINL 508
YWL AN ++ WG G F+I+RG N IE + AG+PK+ L + NL
Sbjct: 327 P----YWLAANPWSKRWGNGGFFKILRGSNHAEIEDHVMAGIPKLDLVDEEEHFNL 378
>gi|144952804|gb|ABP04056.1| cathepsin B-4 [Clonorchis sinensis]
Length = 347
Score = 218 bits (555), Expect = 6e-54, Method: Compositional matrix adjust.
Identities = 125/278 (44%), Positives = 158/278 (56%), Gaps = 12/278 (4%)
Query: 63 TLSELEMRMG-VHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGS 121
TL E+ +G + D + + R P + D ELP FDAR +WP C TI +IRDQ
Sbjct: 52 TLEEIRSVLGTMREDQNVKEFRRPTISH-EDITLELPSEFDAREHWPECRTIPQIRDQSG 110
Query: 122 CGSGWALGAVEAMSDRVCIASRGKR-HVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVT 180
CGS WA AV AMSDRVCI S +V+LS+ DL++CC CG GC GG+ G AW YW
Sbjct: 111 CGSCWAFAAVTAMSDRVCIHSNQTLVNVQLSATDLLACCTTCGFGCVGGWGGMAWDYWRD 170
Query: 181 TGIVSGGTYASKQGCRPYEIPCERYMNGSHSS---CQDNEPNTPECIRKCQPGYDVSYED 237
GIV+GG Y C PY P R+ S C + +TP+C+ +CQ GY YED
Sbjct: 171 NGIVTGGEYKDSHTCLPYPFPPCRHHGAKGSEYPPCPEKMYSTPQCVSECQKGYATKYED 230
Query: 238 DLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIR 297
D +Y+L + I +EI+ GPVE +M +Y D Y G+YKH G LG HAIR
Sbjct: 231 DKIRASTSYNLYRSVTAIQKEIWMRGPVEATMNVYTDFANYAGGVYKHTTGELLGGHAIR 290
Query: 298 IIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
++GWG E G YWL ANS+N +WGE G FRI
Sbjct: 291 LLGWGVEEDG------TPYWLAANSWNPSWGEKGFFRI 322
Score = 147 bits (371), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 77/180 (42%), Positives = 104/180 (57%), Gaps = 18/180 (10%)
Query: 327 WGENGLFRIG-------CRPYEIPCERYMNGSRSS----CQANEPNTPECIRKCQPGYDV 375
W +NG+ G C PY P R+ +G++ S C +TP+C+ +CQ GY
Sbjct: 168 WRDNGIVTGGEYKDSHTCLPYPFPPCRH-HGAKGSEYPPCPEKMYSTPQCVSECQKGYAT 226
Query: 376 SYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGE 435
YEDD +Y+L + I +EI+ GPVE +M +Y D Y G+YKH G LG
Sbjct: 227 KYEDDKIRASTSYNLYRSVTAIQKEIWMRGPVEATMNVYTDFANYAGGVYKHTTGELLGG 286
Query: 436 HAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLP 495
HAIR++GWG E G YWL ANS+N +WGE G FRI+RG + CGIE+D++AGLP
Sbjct: 287 HAIRLLGWGVEEDG------TPYWLAANSWNPSWGEKGFFRILRGSDHCGIESDVSAGLP 340
>gi|187103108|ref|NP_001119614.1| cathepsin B-1418 precursor [Acyrthosiphon pisum]
gi|163300438|tpg|DAA06126.1| TPA_inf: cathepsin B transcript 1418 [Acyrthosiphon pisum]
gi|239788654|dbj|BAH70998.1| ACYPI000010 [Acyrthosiphon pisum]
Length = 346
Score = 218 bits (554), Expect = 8e-54, Method: Compositional matrix adjust.
Identities = 116/270 (42%), Positives = 166/270 (61%), Gaps = 16/270 (5%)
Query: 71 MGVHPDSKLPQNRLPLLVQLSDPLEE-LPEGFDARINWPYCPTIQ-EIRDQGSCGSGWAL 128
MGV P + P+ D E LPE FDAR WP C ++ I+DQ +CGS WA+
Sbjct: 63 MGVLPRNFNSFRFAPIKKSAEDESNEALPENFDARERWPECSSLLGSIKDQSNCGSCWAV 122
Query: 129 GAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGT 188
A SDR+CIA+ G LS++ L +CC CGNGC GG AW +++ GIV+GG
Sbjct: 123 SAASVFSDRLCIATGGAVARNLSAEQLNTCCYRCGNGCDGGSPESAWYFFMRHGIVTGGD 182
Query: 189 YASKQGCRPYEI-PCERYMNGSHSSCQDNEPNTPEC-IRKC-QPGYDVSYEDDLNFGRIA 245
Y S+ GC+PY I PC + N +C +++P+TP+C I+ C Y +Y DL++
Sbjct: 183 YGSEDGCQPYSIYPCGKGRN----TCIEDDPDTPDCSIKTCTNSNYSKNYRADLHYVDTV 238
Query: 246 YSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEP 305
YSL +EE IM++++++GPV+ + +Y D + YK+G+Y + G G HAI+I+GWG
Sbjct: 239 YSLSRSEEDIMKDLYKNGPVQAAFYVYTDFMYYKSGVYSYTRGQIEGGHAIKILGWG--- 295
Query: 306 LGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
+ +GT KYWL ANS++ +WGENGLFRI
Sbjct: 296 VDDGT----KYWLCANSWSRSWGENGLFRI 321
Score = 145 bits (366), Expect = 6e-32, Method: Compositional matrix adjust.
Identities = 72/165 (43%), Positives = 109/165 (66%), Gaps = 14/165 (8%)
Query: 336 GCRPYEI-PCERYMNGSRSSCQANEPNTPEC-IRKC-QPGYDVSYEDDLNFGRIAYSLPA 392
GC+PY I PC + R++C ++P+TP+C I+ C Y +Y DL++ YSL
Sbjct: 188 GCQPYSIYPCGK----GRNTCIEDDPDTPDCSIKTCTNSNYSKNYRADLHYVDTVYSLSR 243
Query: 393 NEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGT 452
+EE IM++++++GPV+ + +Y D + YK+G+Y + G G HAI+I+GWG + +GT
Sbjct: 244 SEEDIMKDLYKNGPVQAAFYVYTDFMYYKSGVYSYTRGQIEGGHAIKILGWG---VDDGT 300
Query: 453 SSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
KYWL ANS++ +WGENGLFRI+RG NEC IE + AG+P +
Sbjct: 301 ----KYWLCANSWSRSWGENGLFRILRGNNECHIEDRVIAGMPHV 341
>gi|2944340|gb|AAC05262.1| cathepsin B-like cysteine protease GCP7 [Haemonchus contortus]
Length = 348
Score = 217 bits (553), Expect = 9e-54, Method: Compositional matrix adjust.
Identities = 111/258 (43%), Positives = 158/258 (61%), Gaps = 12/258 (4%)
Query: 81 QNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCI 140
+N LP+ S+ +++PE FD+R W CP+++ I DQ +CGS WA+ A + MSDR+CI
Sbjct: 82 ENVLPIANITSN--DDIPESFDSREKWKDCPSLRVIPDQSNCGSCWAVSAAQCMSDRLCI 139
Query: 141 ASRGKRHVRLSSDDLVSCC-KDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYE 199
S+G++ V LS+ D+++CC K CG GC GG++ +AWK+ G+V+GG Y K C+PY
Sbjct: 140 HSQGRKKVLLSATDILACCGKFCGYGCDGGYNARAWKWATIAGVVTGGAYKEKGNCKPYV 199
Query: 200 IP-CERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMRE 258
P C + + ++C + TP C CQ GY YE+D R Y LP +E TI E
Sbjct: 200 FPQCGAHKGKAFNNCPSHPYATPACKPYCQYGYGKRYENDKIKARTWYWLPNDERTIQLE 259
Query: 259 IFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWL 318
I + GPV + IY D Y+ G+Y H AG G H+I+IIGW G VKYWL
Sbjct: 260 IMQKGPVHATFNIYEDFEHYEGGVYIHTAGAMEGGHSIKIIGW-------GVDKGVKYWL 312
Query: 319 VANSFNTNWGEN-GLFRI 335
+ANS++T+WGE+ G FR+
Sbjct: 313 IANSWSTDWGEDGGYFRV 330
Score = 128 bits (321), Expect = 7e-27, Method: Compositional matrix adjust.
Identities = 69/159 (43%), Positives = 91/159 (57%), Gaps = 9/159 (5%)
Query: 337 CRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEE 395
C+PY P C + + ++C ++ TP C CQ GY YE+D R Y LP +E
Sbjct: 195 CKPYVFPQCGAHKGKAFNNCPSHPYATPACKPYCQYGYGKRYENDKIKARTWYWLPNDER 254
Query: 396 TIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSV 455
TI EI + GPV + IY D Y+ G+Y H AG G H+I+IIGW G
Sbjct: 255 TIQLEIMQKGPVHATFNIYEDFEHYEGGVYIHTAGAMEGGHSIKIIGW-------GVDKG 307
Query: 456 VKYWLVANSFNTNWGEN-GLFRIVRGQNECGIEADITAG 493
VKYWL+ANS++T+WGE+ G FR+VRG N C IE + AG
Sbjct: 308 VKYWLIANSWSTDWGEDGGYFRVVRGINNCDIEGGVLAG 346
>gi|56758040|gb|AAW27160.1| unknown [Schistosoma japonicum]
Length = 216
Score = 217 bits (553), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 101/203 (49%), Positives = 137/203 (67%), Gaps = 8/203 (3%)
Query: 134 MSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQ 193
M+DR+CI S G++ LS+ DL+SCC+DCG+GCQGGF G+AW YWVT GIV+GG+ +
Sbjct: 1 MTDRICIQSGGQQSAELSALDLISCCEDCGDGCQGGFPGQAWDYWVTQGIVTGGSKENHT 60
Query: 194 GCRPYEIP-CERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANE 252
GC+PY P CE + G + +C TP+C + CQ GY YE D ++G +Y++ +NE
Sbjct: 61 GCQPYPFPKCEHHTKGKYPACGTKIYKTPQCKQTCQKGYKTPYEQDKHYGDESYNVISNE 120
Query: 253 ETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSS 312
+ I +EI +GPVE + +Y D + YK+GIY+HV G +G HAIRIIGWG E
Sbjct: 121 KAIQKEIMMNGPVEAAFDVYEDFLNYKSGIYRHVTGSIVGGHAIRIIGWGVEKR------ 174
Query: 313 VVKYWLVANSFNTNWGENGLFRI 335
YWL+ANS+N +WGE GLFRI
Sbjct: 175 -TPYWLIANSWNEDWGEKGLFRI 196
Score = 162 bits (411), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 81/182 (44%), Positives = 109/182 (59%), Gaps = 11/182 (6%)
Query: 316 YWLVANSFNTNWGENGLFRIGCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYD 374
YW+ EN GC+PY P CE + G +C TP+C + CQ GY
Sbjct: 44 YWVTQGIVTGGSKEN---HTGCQPYPFPKCEHHTKGKYPACGTKIYKTPQCKQTCQKGYK 100
Query: 375 VSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLG 434
YE D ++G +Y++ +NE+ I +EI +GPVE + +Y D + YK+GIY+HV G +G
Sbjct: 101 TPYEQDKHYGDESYNVISNEKAIQKEIMMNGPVEAAFDVYEDFLNYKSGIYRHVTGSIVG 160
Query: 435 EHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGL 494
HAIRIIGWG E YWL+ANS+N +WGE GLFRIVRG++EC IE+ + AGL
Sbjct: 161 GHAIRIIGWGVEKR-------TPYWLIANSWNEDWGEKGLFRIVRGRDECSIESHVVAGL 213
Query: 495 PK 496
K
Sbjct: 214 IK 215
>gi|107921791|gb|ABF85679.1| cathepsin B2 [Fasciola hepatica]
Length = 278
Score = 217 bits (553), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 106/243 (43%), Positives = 150/243 (61%), Gaps = 2/243 (0%)
Query: 64 LSELEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCG 123
+ ++ +G ++ +N L ++ +LPE FDAR WP C TI EIRDQ SCG
Sbjct: 30 VDHFKLDLGALSETPEERNALRPTIKHDISKNDLPESFDARSQWPQCWTISEIRDQASCG 89
Query: 124 SGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGI 183
S WA A AMSDRVCI S G+ RL++ D +SCC CG GC+GG+ KAW YW+ GI
Sbjct: 90 SCWATAAASAMSDRVCIHSNGQMRPRLAAADPLSCCTYCGQGCRGGYPPKAWDYWMREGI 149
Query: 184 VSGGTYASKQGCRPYEIPCERYMNGS--HSSCQDNEPNTPECIRKCQPGYDVSYEDDLNF 241
V+GGT+ ++ GC+P+ ++ S +S C P C R CQ GY+ +YE D +
Sbjct: 150 VTGGTWENRTGCQPWMFTKCDHVGDSRKYSRCPHYTYPKPPCARACQTGYNKTYEQDKFY 209
Query: 242 GRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGW 301
G +Y++ +E IM+EI ++GPVE + I+ D +Y++GIY HVAG +G HA+R+IGW
Sbjct: 210 GNSSYNVGEHESYIMQEIMKNGPVEVTFAIFQDFGVYRSGIYHHVAGKFIGRHAVRMIGW 269
Query: 302 GQE 304
G E
Sbjct: 270 GVE 272
Score = 97.8 bits (242), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 48/115 (41%), Positives = 70/115 (60%), Gaps = 2/115 (1%)
Query: 334 RIGCRPYEIPCERYMNGSR--SSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLP 391
R GC+P+ ++ SR S C P C R CQ GY+ +YE D +G +Y++
Sbjct: 158 RTGCQPWMFTKCDHVGDSRKYSRCPHYTYPKPPCARACQTGYNKTYEQDKFYGNSSYNVG 217
Query: 392 ANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQE 446
+E IM+EI ++GPVE + I+ D +Y++GIY HVAG +G HA+R+IGWG E
Sbjct: 218 EHESYIMQEIMKNGPVEVTFAIFQDFGVYRSGIYHHVAGKFIGRHAVRMIGWGVE 272
>gi|118365170|ref|XP_001015806.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila]
gi|89297573|gb|EAR95561.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila SB210]
Length = 340
Score = 217 bits (552), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 119/296 (40%), Positives = 165/296 (55%), Gaps = 19/296 (6%)
Query: 47 PKLPFYGAEKNALSKLTLSELEMRMGV--HPD-SKLPQNRLPLLVQLSDPLEELPEGFDA 103
P + A K+T +L +G PD KLP +DP+ PE FDA
Sbjct: 39 PNSTWKAARYPHFEKMTREQLLGHLGSLDEPDWVKLPTKEFDPNAN-ADPI---PEFFDA 94
Query: 104 RINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKD-C 162
R WP C +I+ IRDQ +CGS WA A E SDR+CIAS +SS+DL+ CC D C
Sbjct: 95 REQWPNCQSIKLIRDQSTCGSCWAFAATETFSDRICIASNQTLQTSISSEDLLECCADYC 154
Query: 163 GNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEI-PCERYMNGSHSSCQDNEPNTP 221
G GC+GG+ AW Y G+ +GG Y C+PY PC+ ++ G + C +P TP
Sbjct: 155 GMGCKGGYPSAAWGYMKRQGVSTGGLYGDDTSCKPYIFPPCDHHVTGQYQPCGPIQP-TP 213
Query: 222 ECIRKCQPGYDV-SYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKT 280
+C+++C Y +YE DL+F YS+ N + I REI HGPV+ S + AD + YK+
Sbjct: 214 QCVKECNSEYTQNTYEKDLHFASQTYSIKQNVQAIQREIMAHGPVQASFKVAADFLTYKS 273
Query: 281 GIY-KHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
G+Y ++ G H+++IIGWG+E YWL+ANS+N +WGE GLFR+
Sbjct: 274 GVYIRNPKLKYEGGHSVKIIGWGKE-------GNTPYWLIANSWNEDWGEKGLFRM 322
Score = 144 bits (363), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 72/162 (44%), Positives = 102/162 (62%), Gaps = 11/162 (6%)
Query: 337 CRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDV-SYEDDLNFGRIAYSLPANE 394
C+PY P C+ ++ G C +P TP+C+++C Y +YE DL+F YS+ N
Sbjct: 187 CKPYIFPPCDHHVTGQYQPCGPIQP-TPQCVKECNSEYTQNTYEKDLHFASQTYSIKQNV 245
Query: 395 ETIMREIFRHGPVEGSMTIYADMILYKTGIY-KHVAGGPLGEHAIRIIGWGQEPLGEGTS 453
+ I REI HGPV+ S + AD + YK+G+Y ++ G H+++IIGWG+E
Sbjct: 246 QAIQREIMAHGPVQASFKVAADFLTYKSGVYIRNPKLKYEGGHSVKIIGWGKE------- 298
Query: 454 SVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLP 495
YWL+ANS+N +WGE GLFR++RG+NECGIEA I AGLP
Sbjct: 299 GNTPYWLIANSWNEDWGEKGLFRMLRGRNECGIEAQIVAGLP 340
>gi|339242629|ref|XP_003377240.1| Gut-specific cysteine proteinase [Trichinella spiralis]
gi|316973974|gb|EFV57515.1| Gut-specific cysteine proteinase [Trichinella spiralis]
Length = 325
Score = 217 bits (552), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 107/240 (44%), Positives = 148/240 (61%), Gaps = 17/240 (7%)
Query: 96 ELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDL 155
+LP DAR WP C I +RDQ +CGS WA+ + M+DR+CI S + LS ++L
Sbjct: 83 DLPFEMDARKRWPQCKYIGFVRDQANCGSCWAVSSASVMTDRICIESIAAKQPLLSEEEL 142
Query: 156 VSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQD 215
VSCCK CG GC GG+ KA+ YW T GI +GG Y S +GC+PY I GS+S +
Sbjct: 143 VSCCKICGYGCDGGYPDKAFIYWATRGIPTGGPYGSTKGCKPYSI-------GSNS---E 192
Query: 216 NEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADM 275
+E TP C R+C Y + D +FG Y + +NEE IM+E++++GPV + +Y D
Sbjct: 193 DEAETPLCTRQCINEYPYNLSQDRHFGEKPYWVNSNEEQIMQELYKNGPVVVAFNVYEDF 252
Query: 276 ILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
+ Y G+Y+H G LG HA+++IGWG E + KYWL++NS+NT WGENG F+I
Sbjct: 253 MYYIKGVYEHRFGKFLGGHAVKLIGWGIE-------NSKKYWLISNSWNTTWGENGFFKI 305
Score = 136 bits (343), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 68/162 (41%), Positives = 99/162 (61%), Gaps = 17/162 (10%)
Query: 336 GCRPYEIPCERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEE 395
GC+PY I GS S +E TP C R+C Y + D +FG Y + +NEE
Sbjct: 181 GCKPYSI-------GSNSE---DEAETPLCTRQCINEYPYNLSQDRHFGEKPYWVNSNEE 230
Query: 396 TIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSV 455
IM+E++++GPV + +Y D + Y G+Y+H G LG HA+++IGWG E +
Sbjct: 231 QIMQELYKNGPVVVAFNVYEDFMYYIKGVYEHRFGKFLGGHAVKLIGWGIE-------NS 283
Query: 456 VKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
KYWL++NS+NT WGENG F+I+RG+N C IE+ + AG+ +I
Sbjct: 284 KKYWLISNSWNTTWGENGFFKIIRGKNCCAIESYVVAGMARI 325
>gi|29840882|gb|AAP05883.1| similar to GenBank Accession Number X70968 cathepsin B in
Schistosoma japonicum [Schistosoma japonicum]
Length = 312
Score = 216 bits (551), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 105/229 (45%), Positives = 151/229 (65%), Gaps = 3/229 (1%)
Query: 77 SKLPQNRLPL-LVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMS 135
+++ Q++L ++ +D +LP+ FD+R W C +I+ IRDQ SCGS WA GAVE+MS
Sbjct: 71 NRVDQHKLHHPIIHHNDINIKLPKYFDSRKYWKNCSSIRTIRDQSSCGSCWAFGAVESMS 130
Query: 136 DRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGC 195
DR+CI S+G+ + LS+ +L+SCC CG GC GG G AW YW GIV+GG+ + GC
Sbjct: 131 DRICIHSKGRISIELSAVNLLSCCSRCGFGCNGGIPGMAWDYWKDEGIVTGGSNETHTGC 190
Query: 196 RPYEIP-CERYMNG-SHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEE 253
+PY P C + +HSSC+ +TPEC + CQP Y + YE+D +G+ +Y + ++E
Sbjct: 191 QPYPFPECIHHSTSINHSSCEVKYYSTPECYQTCQPDYAIQYENDKYYGKSSYYVTSDEV 250
Query: 254 TIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWG 302
+IM+EI +GPVE + +Y D + YKTG+YK+V G LG HAIRI G
Sbjct: 251 SIMKEILLNGPVEATFYVYDDFLNYKTGVYKYVTGSLLGGHAIRITWLG 299
Score = 102 bits (254), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 49/111 (44%), Positives = 71/111 (63%), Gaps = 2/111 (1%)
Query: 336 GCRPYEIP-CERYMNG-SRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPAN 393
GC+PY P C + + SSC+ +TPEC + CQP Y + YE+D +G+ +Y + ++
Sbjct: 189 GCQPYPFPECIHHSTSINHSSCEVKYYSTPECYQTCQPDYAIQYENDKYYGKSSYYVTSD 248
Query: 394 EETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWG 444
E +IM+EI +GPVE + +Y D + YKTG+YK+V G LG HAIRI G
Sbjct: 249 EVSIMKEILLNGPVEATFYVYDDFLNYKTGVYKYVTGSLLGGHAIRITWLG 299
>gi|22535408|emb|CAC87118.1| cathepsin B-like protease [Nilaparvata lugens]
Length = 347
Score = 216 bits (551), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 105/245 (42%), Positives = 153/245 (62%), Gaps = 12/245 (4%)
Query: 96 ELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDL 155
++P+ FDAR W C +++EIRDQG+CGS WA+ A +DR+CIAS K + +SS +L
Sbjct: 91 KVPKYFDARKKWKKCKSLREIRDQGNCGSCWAVSVAAAFADRLCIASNAKWNGHISSREL 150
Query: 156 VSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEI-PCERYMNGSHSSCQ 214
+SCC CG GC+GGF AW + G+V+GG Y S GC+PY I PCE +M GS +C
Sbjct: 151 MSCCSYCGFGCEGGFPDAAWVFIKRHGLVTGGDYHSHDGCQPYPIAPCEHHMEGSKPNCS 210
Query: 215 DN--EPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIY 272
+ EP TP C C G ++Y+ D G+ AY +P E+ EIF++GP+ + +Y
Sbjct: 211 ASPTEP-TPACETTCTHGSSLAYQKDRQKGKSAYLVPVGEKQTQLEIFKNGPIVAAFKVY 269
Query: 273 ADMILYKTGIYKHVAGGPL-GEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENG 331
D +YK+G+YK P G HA+++IGWG++ + + YWLV NS++ +WG+ G
Sbjct: 270 EDFFMYKSGVYKRHPESPFRGRHAVKVIGWGEQ-------NGLPYWLVQNSWDYDWGDKG 322
Query: 332 LFRIG 336
LF+I
Sbjct: 323 LFKIA 327
Score = 142 bits (357), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 71/165 (43%), Positives = 103/165 (62%), Gaps = 13/165 (7%)
Query: 336 GCRPYEI-PCERYMNGSRSSCQAN--EPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPA 392
GC+PY I PCE +M GS+ +C A+ EP TP C C G ++Y+ D G+ AY +P
Sbjct: 189 GCQPYPIAPCEHHMEGSKPNCSASPTEP-TPACETTCTHGSSLAYQKDRQKGKSAYLVPV 247
Query: 393 NEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPL-GEHAIRIIGWGQEPLGEG 451
E+ EIF++GP+ + +Y D +YK+G+YK P G HA+++IGWG++
Sbjct: 248 GEKQTQLEIFKNGPIVAAFKVYEDFFMYKSGVYKRHPESPFRGRHAVKVIGWGEQ----- 302
Query: 452 TSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPK 496
+ + YWLV NS++ +WG+ GLF+I RG NEC E +TAGLPK
Sbjct: 303 --NGLPYWLVQNSWDYDWGDKGLFKIARG-NECDFEKSMTAGLPK 344
>gi|1345924|sp|P25802.3|CYSP1_OSTOS RecName: Full=Cathepsin B-like cysteine proteinase 1; Flags:
Precursor
Length = 341
Score = 216 bits (550), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 106/240 (44%), Positives = 155/240 (64%), Gaps = 9/240 (3%)
Query: 97 LPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLV 156
+PE +D RI W C ++ I DQ +CGS WA+ + AMSDR+CIAS+G + V +S+ D+V
Sbjct: 91 IPESYDPRIQWANCSSLFHIPDQANCGSCWAVSSAAAMSDRICIASKGAKQVLISAQDVV 150
Query: 157 SCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEI-PCERYMNGSHSSCQD 215
SCC CG+GC+GG+ A+++ G+V+GG Y +K CRPYEI PC + N ++
Sbjct: 151 SCCTWCGDGCEGGWPISAFRFHADEGVVTGGDYNTKGSCRPYEIHPCGHHGNETYYGECV 210
Query: 216 NEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADM 275
+TP C R+C GY SY D + + AY L + + I ++I ++GPV + T+Y D
Sbjct: 211 GMADTPRCKRRCLLGYPKSYPSDRYYKK-AYQLKNSVKAIQKDIMKNGPVVATYTVYEDF 269
Query: 276 ILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
Y++GIYKH AG G HA+++IGWG+E +GT YW+VANS++ +WGENG FR+
Sbjct: 270 AHYRSGIYKHKAGRKTGLHAVKVIGWGEE---KGTP----YWIVANSWHDDWGENGFFRM 322
Score = 133 bits (335), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 69/158 (43%), Positives = 97/158 (61%), Gaps = 9/158 (5%)
Query: 337 CRPYEI-PCERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEE 395
CRPYEI PC + N + +TP C R+C GY SY D + + AY L + +
Sbjct: 189 CRPYEIHPCGHHGNETYYGECVGMADTPRCKRRCLLGYPKSYPSDRYYKK-AYQLKNSVK 247
Query: 396 TIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSV 455
I ++I ++GPV + T+Y D Y++GIYKH AG G HA+++IGWG+E +GT
Sbjct: 248 AIQKDIMKNGPVVATYTVYEDFAHYRSGIYKHKAGRKTGLHAVKVIGWGEE---KGTP-- 302
Query: 456 VKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAG 493
YW+VANS++ +WGENG FR+ RG N+CG E + AG
Sbjct: 303 --YWIVANSWHDDWGENGFFRMHRGSNDCGFEERMAAG 338
>gi|56758130|gb|AAW27205.1| unknown [Schistosoma japonicum]
Length = 279
Score = 216 bits (550), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 102/222 (45%), Positives = 143/222 (64%), Gaps = 3/222 (1%)
Query: 63 TLSELEMRMGVHP-DSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGS 121
+L + + MG D+++ + R P V D E+P FD+R WP+C +I +IRDQ
Sbjct: 56 SLDDARILMGARKEDAEMKRKRRPT-VDHHDLNVEIPSQFDSRKKWPHCKSISQIRDQSR 114
Query: 122 CGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTT 181
CGS WA GAVEAM+DR+CI S G++ LS+ DL+SCC+DCG+GCQGGF G AW YWV
Sbjct: 115 CGSCWAFGAVEAMTDRICIQSGGQQSAELSALDLISCCEDCGDGCQGGFPGVAWDYWVKR 174
Query: 182 GIVSGGTYASKQGCRPYEIP-CERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLN 240
GIV+GG+ + GC+PY P CE + G + +C TP+C + CQ GY YE D +
Sbjct: 175 GIVTGGSKENHTGCQPYPFPKCEHHTKGKYPACGTKIYKTPQCKQTCQKGYKTPYEQDKH 234
Query: 241 FGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGI 282
+G +Y++ +NE+ I REI +GPVE + +Y D + YK+GI
Sbjct: 235 YGDESYNVISNEKAIQREIMMYGPVEAAFDVYEDFLNYKSGI 276
Score = 83.2 bits (204), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 41/110 (37%), Positives = 59/110 (53%), Gaps = 4/110 (3%)
Query: 316 YWLVANSFNTNWGENGLFRIGCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYD 374
YW+ EN GC+PY P CE + G +C TP+C + CQ GY
Sbjct: 170 YWVKRGIVTGGSKEN---HTGCQPYPFPKCEHHTKGKYPACGTKIYKTPQCKQTCQKGYK 226
Query: 375 VSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGI 424
YE D ++G +Y++ +NE+ I REI +GPVE + +Y D + YK+GI
Sbjct: 227 TPYEQDKHYGDESYNVISNEKAIQREIMMYGPVEAAFDVYEDFLNYKSGI 276
>gi|254575663|gb|ACT68328.1| cysteine proteinase [Haemonchus contortus]
Length = 348
Score = 216 bits (550), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 110/258 (42%), Positives = 157/258 (60%), Gaps = 12/258 (4%)
Query: 81 QNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCI 140
+N LP+ S+ +++PE FD+R W CP+++ I DQ +CGS WA+ A + MSDR+CI
Sbjct: 82 ENVLPVANITSN--DDIPESFDSREKWKDCPSLRVIPDQSNCGSCWAVSAAQCMSDRLCI 139
Query: 141 ASRGKRHVRLSSDDLVSCC-KDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYE 199
S+G++ V LS+ D+++CC K CG GC GG++ +AWK+ G+V+GG Y K C+PY
Sbjct: 140 HSQGRKKVLLSATDILACCGKFCGYGCDGGYNARAWKWATIAGVVTGGAYKEKGNCKPYV 199
Query: 200 IP-CERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMRE 258
P C + + ++C + TP C CQ GY YE+D + Y LP +E TI E
Sbjct: 200 FPQCGAHKGKAFNNCPSHPYATPACKPYCQYGYGKRYENDKIKAKTWYWLPNDERTIQLE 259
Query: 259 IFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWL 318
I + GPV + IY D Y G+Y H AG G H+I+IIGW G VKYWL
Sbjct: 260 IMKKGPVHATFNIYEDFEHYNGGVYIHTAGAMEGGHSIKIIGW-------GVDKGVKYWL 312
Query: 319 VANSFNTNWGEN-GLFRI 335
+ANS++T+WGE+ G FR+
Sbjct: 313 IANSWSTDWGEDGGYFRV 330
Score = 127 bits (319), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 68/159 (42%), Positives = 90/159 (56%), Gaps = 9/159 (5%)
Query: 337 CRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEE 395
C+PY P C + + ++C ++ TP C CQ GY YE+D + Y LP +E
Sbjct: 195 CKPYVFPQCGAHKGKAFNNCPSHPYATPACKPYCQYGYGKRYENDKIKAKTWYWLPNDER 254
Query: 396 TIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSV 455
TI EI + GPV + IY D Y G+Y H AG G H+I+IIGW G
Sbjct: 255 TIQLEIMKKGPVHATFNIYEDFEHYNGGVYIHTAGAMEGGHSIKIIGW-------GVDKG 307
Query: 456 VKYWLVANSFNTNWGEN-GLFRIVRGQNECGIEADITAG 493
VKYWL+ANS++T+WGE+ G FR+VRG N C IE + AG
Sbjct: 308 VKYWLIANSWSTDWGEDGGYFRVVRGINNCDIEGGVLAG 346
>gi|239938578|gb|ACS36088.1| cysteine proteinase [Haemonchus contortus]
Length = 332
Score = 216 bits (550), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 113/293 (38%), Positives = 169/293 (57%), Gaps = 18/293 (6%)
Query: 48 KLPFYGAEKNALSKLTLSEL---EMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDAR 104
+ PF+ A+ + ++ L+ L E V K+P+ + +++PE FD+R
Sbjct: 49 RQPFFEAKYSPEAEQRLNHLMDTEFVRNVRKLHKIPRAEKAI------SNDDIPESFDSR 102
Query: 105 INWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCC-KDCG 163
+ W C +I IRDQ +CGS WA+ A E MSDR+C+ S+G+ +S D+++CC ++CG
Sbjct: 103 VVWKNCSSITYIRDQSNCGSCWAVSAAETMSDRICVQSKGRVQKMISDVDILACCGRECG 162
Query: 164 NGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEI-PCERYMNGSHSSCQDNEPNTPE 222
GC GG KAW+Y G+V+GG Y K C+PY + PC + S +D+ TP
Sbjct: 163 RGCNGGMDHKAWEYVKEFGVVTGGRYQEKGVCKPYPLHPCGNHGGKFWSCPRDHSFRTPA 222
Query: 223 CIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGI 282
C + CQ GY YE D ++ + Y L +E+ I RE+ ++GPV+ + Y D Y+ GI
Sbjct: 223 CKKYCQYGYGKRYEKDKSYVKSVYILDEDEKAIQREMMKNGPVQAASITYEDFSFYRRGI 282
Query: 283 YKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
Y H G G HA++++GWG E GT KYW VANS++T+WGE+G FRI
Sbjct: 283 YVHTRGRQRGAHAVKVVGWGVE---NGT----KYWNVANSWSTDWGEDGYFRI 328
Score = 115 bits (287), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 63/167 (37%), Positives = 90/167 (53%), Gaps = 8/167 (4%)
Query: 315 KYWLVANSFNTNWGENGLFRIGCRPYEI-PCERYMNGSRSSCQANEPNTPECIRKCQPGY 373
K W F G + C+PY + PC + S + + TP C + CQ GY
Sbjct: 172 KAWEYVKEFGVVTGGRYQEKGVCKPYPLHPCGNHGGKFWSCPRDHSFRTPACKKYCQYGY 231
Query: 374 DVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPL 433
YE D ++ + Y L +E+ I RE+ ++GPV+ + Y D Y+ GIY H G
Sbjct: 232 GKRYEKDKSYVKSVYILDEDEKAIQREMMKNGPVQAASITYEDFSFYRRGIYVHTRGRQR 291
Query: 434 GEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRG 480
G HA++++GWG E GT KYW VANS++T+WGE+G FRI+RG
Sbjct: 292 GAHAVKVVGWGVE---NGT----KYWNVANSWSTDWGEDGYFRILRG 331
>gi|170030062|ref|XP_001842909.1| cathepsin B-like thiol protease [Culex quinquefasciatus]
gi|167865915|gb|EDS29298.1| cathepsin B-like thiol protease [Culex quinquefasciatus]
Length = 288
Score = 215 bits (548), Expect = 4e-53, Method: Compositional matrix adjust.
Identities = 115/270 (42%), Positives = 158/270 (58%), Gaps = 23/270 (8%)
Query: 67 LEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGW 126
++M +G++ +S+L N LP L Q + LP FDAR WPYCP++ +IR QGSCGS +
Sbjct: 21 MKMSLGLN-ESEL--NNLPRL-QNQRSVRALPASFDARQKWPYCPSLNQIRSQGSCGSCY 76
Query: 127 ALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSG 186
A+ ++DR CI S G+R S +SCC DC C GG+ K + YWV G+ SG
Sbjct: 77 AVSTAAVITDRYCIHSGGERQFYFGSTGYLSCCTDCYK-CDGGYVHKTFDYWVKYGLTSG 135
Query: 187 GTYASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAY 246
G Y S QGC+PY + QD +C R+CQ GY ++Y DL G +Y
Sbjct: 136 GPYHSGQGCKPYPFG---------GATQDVNI-VLKCDRQCQAGYPLTYSQDLKHGASSY 185
Query: 247 SLPANEETIMR-EIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEP 305
LP +E M+ EI+++GP+ S +Y D Y++G+Y+HV G G HA+R+IGWG E
Sbjct: 186 ILPWGDENAMKAEIYQNGPIVTSFDVYGDFFQYRSGVYRHVTGAYKGSHAVRVIGWGVE- 244
Query: 306 LGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
+ VKYWL ANS+N WGENG F+I
Sbjct: 245 ------NGVKYWLCANSWNERWGENGFFKI 268
Score = 135 bits (339), Expect = 6e-29, Method: Compositional matrix adjust.
Identities = 64/134 (47%), Positives = 87/134 (64%), Gaps = 8/134 (5%)
Query: 364 ECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMR-EIFRHGPVEGSMTIYADMILYKT 422
+C R+CQ GY ++Y DL G +Y LP +E M+ EI+++GP+ S +Y D Y++
Sbjct: 161 KCDRQCQAGYPLTYSQDLKHGASSYILPWGDENAMKAEIYQNGPIVTSFDVYGDFFQYRS 220
Query: 423 GIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQN 482
G+Y+HV G G HA+R+IGWG E + VKYWL ANS+N WGENG F+IVRG+N
Sbjct: 221 GVYRHVTGAYKGSHAVRVIGWGVE-------NGVKYWLCANSWNERWGENGFFKIVRGEN 273
Query: 483 ECGIEADITAGLPK 496
G+E AGLPK
Sbjct: 274 HVGVEDISYAGLPK 287
>gi|239938580|gb|ACS36089.1| cysteine proteinase [Haemonchus contortus]
Length = 332
Score = 215 bits (547), Expect = 5e-53, Method: Compositional matrix adjust.
Identities = 114/293 (38%), Positives = 167/293 (56%), Gaps = 18/293 (6%)
Query: 48 KLPFYGAEKNALSKLTLSEL---EMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDAR 104
+ PF+ A+ + ++ L+ L E V K+P+ + +++PE FD+R
Sbjct: 49 RQPFFEAKYSPEAEQRLNHLMDTEFVRNVRKLHKIPRAEKAI------SNDDIPESFDSR 102
Query: 105 INWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCC-KDCG 163
W C +I IRDQ +CGS WA+ A E MSDR+C+ S+G+ +S D+++CC ++CG
Sbjct: 103 EVWKNCSSITYIRDQSNCGSCWAVSAAETMSDRICVQSKGRVQKMISDVDILACCGRECG 162
Query: 164 NGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEI-PCERYMNGSHSSCQDNEPNTPE 222
GC GG KAW+Y G+V+GG Y K C+PY + PC + S +D+ TP
Sbjct: 163 RGCNGGMDHKAWEYVKEFGVVTGGRYQEKGVCKPYPLHPCGNHGGKFWSCPRDHSFRTPA 222
Query: 223 CIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGI 282
C + CQ GY YE D ++ + Y L +E+ I RE+ ++GPV+ + Y D Y GI
Sbjct: 223 CKKYCQYGYGKRYEKDKSYVKSVYILDEDEKAIQREMMKNGPVQAAFITYEDFSFYTKGI 282
Query: 283 YKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
Y H G G HA++++GWG E GT KYW VANS++T+WGENG FRI
Sbjct: 283 YVHTRGRQRGAHAVKVVGWGVE---NGT----KYWNVANSWSTDWGENGYFRI 328
Score = 116 bits (291), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 64/167 (38%), Positives = 89/167 (53%), Gaps = 8/167 (4%)
Query: 315 KYWLVANSFNTNWGENGLFRIGCRPYEI-PCERYMNGSRSSCQANEPNTPECIRKCQPGY 373
K W F G + C+PY + PC + S + + TP C + CQ GY
Sbjct: 172 KAWEYVKEFGVVTGGRYQEKGVCKPYPLHPCGNHGGKFWSCPRDHSFRTPACKKYCQYGY 231
Query: 374 DVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPL 433
YE D ++ + Y L +E+ I RE+ ++GPV+ + Y D Y GIY H G
Sbjct: 232 GKRYEKDKSYVKSVYILDEDEKAIQREMMKNGPVQAAFITYEDFSFYTKGIYVHTRGRQR 291
Query: 434 GEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRG 480
G HA++++GWG E GT KYW VANS++T+WGENG FRI+RG
Sbjct: 292 GAHAVKVVGWGVE---NGT----KYWNVANSWSTDWGENGYFRILRG 331
>gi|107921773|gb|ABF85678.1| cathepsin B1 [Fasciola hepatica]
Length = 278
Score = 215 bits (547), Expect = 5e-53, Method: Compositional matrix adjust.
Identities = 111/212 (52%), Positives = 136/212 (64%), Gaps = 4/212 (1%)
Query: 96 ELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDL 155
+LPE FDAR WP C +I++I DQ SCGS WA+ V AMSDRVCI S G LS+ DL
Sbjct: 62 DLPESFDAREKWPLCRSIRQIPDQSSCGSCWAVAGVGAMSDRVCIHSNGMMQPELSAIDL 121
Query: 156 VSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIPCERYMNGSHSS--- 212
VSCC CGNGCQGG AW YW GIV+GGT + GC PY P R+ GS S
Sbjct: 122 VSCCSYCGNGCQGGSPPAAWDYWWRNGIVTGGTLENPTGCLPYPFPQCRH-PGSRSQLNP 180
Query: 213 CQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIY 272
C TP C CQ GYD +YE+D +G+ +Y++ +E TIM+EI ++GPVE +Y
Sbjct: 181 CPGYIYPTPSCYPYCQAGYDKTYEEDKVYGKTSYNVDRHEYTIMQEIMKNGPVEAGFIVY 240
Query: 273 ADMILYKTGIYKHVAGGPLGEHAIRIIGWGQE 304
D +YK+GIY HV+G G+HAIRIIGWG E
Sbjct: 241 TDFAVYKSGIYHHVSGRYAGKHAIRIIGWGVE 272
Score = 106 bits (264), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 58/130 (44%), Positives = 76/130 (58%), Gaps = 11/130 (8%)
Query: 327 WGENGLFR-------IGCRPYEIPCERYMNGSRSS---CQANEPNTPECIRKCQPGYDVS 376
W NG+ GC PY P R+ GSRS C TP C CQ GYD +
Sbjct: 144 WWRNGIVTGGTLENPTGCLPYPFPQCRH-PGSRSQLNPCPGYIYPTPSCYPYCQAGYDKT 202
Query: 377 YEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEH 436
YE+D +G+ +Y++ +E TIM+EI ++GPVE +Y D +YK+GIY HV+G G+H
Sbjct: 203 YEEDKVYGKTSYNVDRHEYTIMQEIMKNGPVEAGFIVYTDFAVYKSGIYHHVSGRYAGKH 262
Query: 437 AIRIIGWGQE 446
AIRIIGWG E
Sbjct: 263 AIRIIGWGVE 272
>gi|119638992|gb|ABL85238.1| cysteine proteinase 4 [Necator americanus]
Length = 339
Score = 214 bits (545), Expect = 9e-53, Method: Compositional matrix adjust.
Identities = 119/273 (43%), Positives = 164/273 (60%), Gaps = 22/273 (8%)
Query: 67 LEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGW 126
++++ K P+ + L V ELPE FDAR WP+C +I IRD +CGS W
Sbjct: 65 MDIKYMTEASHKYPRKGINLNV-------ELPERFDAREKWPHCASIGLIRDHSACGSCW 117
Query: 127 ALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCC-KDCGNGCQGGFHGKAWKYWVTTGIVS 185
A+ A MSDR+CI + G LSS D+++CC +DCG+GC+GG+ +A+ Y TG+ S
Sbjct: 118 AVSAASVMSDRLCIQTNGTNQKILSSADILACCGEDCGSGCEGGYPIQAYFYLENTGVCS 177
Query: 186 GGTYASKQGCRPYEI-PCERYMNGSHSSC-QDNEPNTPECIRKCQPGYDVSYEDDLNFGR 243
GG Y K C+PY PC+ G++ C ++ +TP+C + CQ Y V YE+D FG+
Sbjct: 178 GGEYREKNVCKPYPFYPCD----GNYGPCPKEGAFDTPKCRKICQFRYPVPYEEDKVFGK 233
Query: 244 IAYS-LPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWG 302
++ L NE I +EIF +GPV + ++ D I YK GIYK G +G HAI++IGWG
Sbjct: 234 NSHILLQDNEARIRQEIFINGPVGANFYVFEDFIHYKEGIYKQTYGKWIGVHAIKLIGWG 293
Query: 303 QEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
E GT YWLVANS+N +WGENG FRI
Sbjct: 294 TE---NGTD----YWLVANSYNYDWGENGTFRI 319
Score = 125 bits (315), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 78/188 (41%), Positives = 108/188 (57%), Gaps = 15/188 (7%)
Query: 308 EGTSSVVKYWLVANSFNTNWGENGLFRIGCRPYEI-PCERYMNGSRSSC-QANEPNTPEC 365
EG + Y+ + N+ + GE + C+PY PC+ G+ C + +TP+C
Sbjct: 159 EGGYPIQAYFYLENTGVCSGGEYREKNV-CKPYPFYPCD----GNYGPCPKEGAFDTPKC 213
Query: 366 IRKCQPGYDVSYEDDLNFGRIAYSL-PANEETIMREIFRHGPVEGSMTIYADMILYKTGI 424
+ CQ Y V YE+D FG+ ++ L NE I +EIF +GPV + ++ D I YK GI
Sbjct: 214 RKICQFRYPVPYEEDKVFGKNSHILLQDNEARIRQEIFINGPVGANFYVFEDFIHYKEGI 273
Query: 425 YKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNEC 484
YK G +G HAI++IGWG E GT YWLVANS+N +WGENG FRI+RG N C
Sbjct: 274 YKQTYGKWIGVHAIKLIGWGTE---NGTD----YWLVANSYNYDWGENGTFRILRGTNHC 326
Query: 485 GIEADITA 492
IE+ + A
Sbjct: 327 LIESQVIA 334
>gi|56756587|gb|AAW26466.1| unknown [Schistosoma japonicum]
Length = 216
Score = 214 bits (545), Expect = 9e-53, Method: Compositional matrix adjust.
Identities = 100/203 (49%), Positives = 134/203 (66%), Gaps = 8/203 (3%)
Query: 134 MSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQ 193
M+DR+CI S G + LS+ DL+SCC+DCG GCQGGF G AW YWVT GIV+GG+ +
Sbjct: 1 MTDRICIQSGGGQSAELSALDLISCCEDCGQGCQGGFPGVAWDYWVTQGIVTGGSKENHT 60
Query: 194 GCRPYEIP-CERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANE 252
GC+PY P CE + G + +C TP+C +KCQ GY Y+ D ++G +Y++ +NE
Sbjct: 61 GCQPYPFPKCEHHTKGKYPACGTKIYKTPQCKQKCQKGYKTPYKQDKHYGDESYNVISNE 120
Query: 253 ETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSS 312
+ I +EI +GPVE + +Y D + YK+GIY+HV G +G HAIRIIGW G
Sbjct: 121 KAIQKEIMMNGPVEAAFDVYEDFLNYKSGIYRHVTGSIVGGHAIRIIGW-------GVKK 173
Query: 313 VVKYWLVANSFNTNWGENGLFRI 335
YWL+ANS+N +WGE GLFRI
Sbjct: 174 RTPYWLIANSWNEDWGEKGLFRI 196
Score = 163 bits (412), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 80/182 (43%), Positives = 110/182 (60%), Gaps = 11/182 (6%)
Query: 316 YWLVANSFNTNWGENGLFRIGCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYD 374
YW+ EN GC+PY P CE + G +C TP+C +KCQ GY
Sbjct: 44 YWVTQGIVTGGSKEN---HTGCQPYPFPKCEHHTKGKYPACGTKIYKTPQCKQKCQKGYK 100
Query: 375 VSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLG 434
Y+ D ++G +Y++ +NE+ I +EI +GPVE + +Y D + YK+GIY+HV G +G
Sbjct: 101 TPYKQDKHYGDESYNVISNEKAIQKEIMMNGPVEAAFDVYEDFLNYKSGIYRHVTGSIVG 160
Query: 435 EHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGL 494
HAIRIIGW G YWL+ANS+N +WGE GLFRIVRG++EC IE+++ AGL
Sbjct: 161 GHAIRIIGW-------GVKKRTPYWLIANSWNEDWGEKGLFRIVRGRDECSIESNVVAGL 213
Query: 495 PK 496
K
Sbjct: 214 IK 215
>gi|404250524|gb|AFR54113.1| cysteine proteinase, partial [Haemonchus contortus]
Length = 332
Score = 214 bits (545), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 114/293 (38%), Positives = 165/293 (56%), Gaps = 18/293 (6%)
Query: 48 KLPFYGAEKNALSKLTLSEL---EMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDAR 104
+ PF+ A+ + ++ L L E V K+P+ + +++PE FD+R
Sbjct: 49 RQPFFEAKYSPEAEQRLDHLMDTEFVRNVRKLHKIPRAEKAI------SNDDIPESFDSR 102
Query: 105 INWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCC-KDCG 163
W C +I IRDQ +CGS WA+ A E MSDR+C+ S+G+ +S D+++CC +CG
Sbjct: 103 EVWKSCSSITYIRDQSNCGSCWAVSAAETMSDRICVQSKGRVQKMISDVDILACCGSECG 162
Query: 164 NGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEI-PCERYMNGSHSSCQDNEPNTPE 222
GC GG KAW+Y G+V+GG Y K C+PY + PC + S +D+ TP
Sbjct: 163 RGCNGGMDHKAWEYVKEFGVVTGGRYQEKGVCKPYPLHPCGNHGGKFWSCPRDHSFRTPA 222
Query: 223 CIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGI 282
C + CQ GY YE D ++ + Y L +E+ I RE+ ++GPV+ + Y D Y GI
Sbjct: 223 CKKYCQYGYGKRYEKDKSYVKSVYILDEDEKAIQREMMKNGPVQAAFITYEDFSFYTKGI 282
Query: 283 YKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
Y H G G HA++++GWG E GT KYW VANS++T+WGENG FRI
Sbjct: 283 YVHTRGRQRGAHAVKVVGWGVE---NGT----KYWNVANSWSTDWGENGYFRI 328
Score = 117 bits (292), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 64/167 (38%), Positives = 89/167 (53%), Gaps = 8/167 (4%)
Query: 315 KYWLVANSFNTNWGENGLFRIGCRPYEI-PCERYMNGSRSSCQANEPNTPECIRKCQPGY 373
K W F G + C+PY + PC + S + + TP C + CQ GY
Sbjct: 172 KAWEYVKEFGVVTGGRYQEKGVCKPYPLHPCGNHGGKFWSCPRDHSFRTPACKKYCQYGY 231
Query: 374 DVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPL 433
YE D ++ + Y L +E+ I RE+ ++GPV+ + Y D Y GIY H G
Sbjct: 232 GKRYEKDKSYVKSVYILDEDEKAIQREMMKNGPVQAAFITYEDFSFYTKGIYVHTRGRQR 291
Query: 434 GEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRG 480
G HA++++GWG E GT KYW VANS++T+WGENG FRI+RG
Sbjct: 292 GAHAVKVVGWGVE---NGT----KYWNVANSWSTDWGENGYFRILRG 331
>gi|91078960|ref|XP_974244.1| PREDICTED: similar to putative cathepsin B-like proteinase
[Tribolium castaneum]
gi|270004840|gb|EFA01288.1| cathepsin B precursor [Tribolium castaneum]
Length = 319
Score = 214 bits (544), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 106/272 (38%), Positives = 155/272 (56%), Gaps = 24/272 (8%)
Query: 64 LSELEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCG 123
L +L +G+HPD N +P ++ + +++P+ FDAR WP C ++ IRDQGSCG
Sbjct: 52 LYKLNGFLGLHPDP----NYMPEKIKHNFNPQDIPKTFDARKKWPKCDSLNRIRDQGSCG 107
Query: 124 SGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGI 183
S WA AVE MSDR+CI S G + S++DL+SCC CG+ C GG+ A+ +++ G+
Sbjct: 108 SCWAFAAVETMSDRICIHSSGAKKFFFSAEDLLSCCTACGS-CSGGYMMAAFDFYIKQGV 166
Query: 184 VSGGTYASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGR 243
VSGG S +GCRPY ++ TP C + C+ GY SY D ++G
Sbjct: 167 VSGGDLNSNEGCRPYTADAH------------DKGVTPSCTKSCRKGYPTSYSSDKHYGS 214
Query: 244 IAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQ 303
Y + A I EI +GP+ S +Y D Y +G+Y HV+G G H ++I+GWG
Sbjct: 215 KDYIVDAGVSNIQYEIMTNGPIIVSFKVYQDFYNYGSGVYHHVSGNYTGNHIVKIVGWGT 274
Query: 304 EPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
E + YWL+ANS+ ++WGE+G F+I
Sbjct: 275 EKEQD-------YWLIANSWGSSWGEHGFFKI 299
Score = 124 bits (312), Expect = 8e-26, Method: Compositional matrix adjust.
Identities = 62/162 (38%), Positives = 89/162 (54%), Gaps = 19/162 (11%)
Query: 336 GCRPYEIPCERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEE 395
GCRPY ++ TP C + C+ GY SY D ++G Y + A
Sbjct: 177 GCRPYTADAH------------DKGVTPSCTKSCRKGYPTSYSSDKHYGSKDYIVDAGVS 224
Query: 396 TIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSV 455
I EI +GP+ S +Y D Y +G+Y HV+G G H ++I+GWG E +
Sbjct: 225 NIQYEIMTNGPIIVSFKVYQDFYNYGSGVYHHVSGNYTGNHIVKIVGWGTEKEQD----- 279
Query: 456 VKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
YWL+ANS+ ++WGE+G F+I+RG+NECGIE + A LPK+
Sbjct: 280 --YWLIANSWGSSWGEHGFFKILRGKNECGIENNPYAVLPKL 319
>gi|335347289|gb|AEH42092.1| cysteine proteinase 1 [Haemonchus contortus]
Length = 332
Score = 214 bits (544), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 113/293 (38%), Positives = 167/293 (56%), Gaps = 18/293 (6%)
Query: 48 KLPFYGAEKNALSKLTLSEL---EMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDAR 104
+ PF+ A+ + ++ L+ L E V K+P+ + +++PE FD+R
Sbjct: 49 RQPFFEAKYSPEAEQRLNHLMDTEFVRNVRKLHKIPRAEKAI------SNDDIPESFDSR 102
Query: 105 INWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCC-KDCG 163
W C +I IRDQ +CGS WA+ A E MSDR+C+ S+G+ +S D+++CC ++CG
Sbjct: 103 EVWKNCSSITYIRDQSNCGSCWAVSAAETMSDRICVQSKGRVQKMISDVDILACCGRECG 162
Query: 164 NGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEI-PCERYMNGSHSSCQDNEPNTPE 222
GC GG KAW+Y G+V+GG Y K C+PY + PC + S +D+ TP
Sbjct: 163 RGCNGGMDHKAWEYVKEFGVVTGGRYQEKGVCKPYPLHPCGNHGGKFWSCPRDHSFRTPA 222
Query: 223 CIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGI 282
C + CQ GY YE D ++ + Y L +E+ I RE+ ++GPV+ + Y D Y GI
Sbjct: 223 CKKYCQYGYGKRYEKDKSYVKSVYILDEDEKAIQREMMKNGPVQAAFITYEDFSFYTKGI 282
Query: 283 YKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
Y H G G HA++++GWG E GT KYW VANS++T+WGE+G FRI
Sbjct: 283 YVHTRGRQRGAHAVKVVGWGVE---NGT----KYWNVANSWSTDWGEDGYFRI 328
Score = 115 bits (288), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 63/167 (37%), Positives = 89/167 (53%), Gaps = 8/167 (4%)
Query: 315 KYWLVANSFNTNWGENGLFRIGCRPYEI-PCERYMNGSRSSCQANEPNTPECIRKCQPGY 373
K W F G + C+PY + PC + S + + TP C + CQ GY
Sbjct: 172 KAWEYVKEFGVVTGGRYQEKGVCKPYPLHPCGNHGGKFWSCPRDHSFRTPACKKYCQYGY 231
Query: 374 DVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPL 433
YE D ++ + Y L +E+ I RE+ ++GPV+ + Y D Y GIY H G
Sbjct: 232 GKRYEKDKSYVKSVYILDEDEKAIQREMMKNGPVQAAFITYEDFSFYTKGIYVHTRGRQR 291
Query: 434 GEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRG 480
G HA++++GWG E GT KYW VANS++T+WGE+G FRI+RG
Sbjct: 292 GAHAVKVVGWGVE---NGT----KYWNVANSWSTDWGEDGYFRILRG 331
>gi|329669000|gb|AEB96388.1| cathepsin B-like cysteine protease 2 [Angiostrongylus cantonensis]
Length = 232
Score = 213 bits (542), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 106/219 (48%), Positives = 141/219 (64%), Gaps = 9/219 (4%)
Query: 119 QGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYW 178
Q SCGS WA+GAVEAM+DR+CIAS+G + V +S+DDL+SCC +CG GC G AW YW
Sbjct: 2 QSSCGSCWAVGAVEAMTDRICIASKGNQKVTISADDLLSCCDECGFGCDGRDPYAAWSYW 61
Query: 179 VTTGIVSGGTYASKQGCRPYEI-PCERYMNGSH-SSCQDNEPNTPECIRKCQPGYDVSYE 236
V+ GIV+G Y SK GC+PY PCE ++ H C + T C KCQ GY +SY
Sbjct: 62 VSNGIVTGSNYTSKSGCKPYPYPPCEHHIPEHHYKKCPKDIYPTNTCEYKCQDGYSISYN 121
Query: 237 DDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAI 296
D ++G Y++ + +I +EI +GPVE + +Y D Y +GIYKH G LG HA+
Sbjct: 122 SDKHYGASVYAVAQDVASIQKEIMTNGPVEVAFDVYEDFEHYSSGIYKHTTGDYLGGHAV 181
Query: 297 RIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
+++GWG E GT YW+ ANS+N++WGENG FRI
Sbjct: 182 KMLGWGTE---NGT----DYWICANSWNSDWGENGFFRI 213
Score = 140 bits (354), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 77/183 (42%), Positives = 107/183 (58%), Gaps = 12/183 (6%)
Query: 316 YWLVANSFNTNWGENGLFRIGCRPYEIP-CERYM-NGSRSSCQANEPNTPECIRKCQPGY 373
YW V+N T G N + GC+PY P CE ++ C + T C KCQ GY
Sbjct: 60 YW-VSNGIVT--GSNYTSKSGCKPYPYPPCEHHIPEHHYKKCPKDIYPTNTCEYKCQDGY 116
Query: 374 DVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPL 433
+SY D ++G Y++ + +I +EI +GPVE + +Y D Y +GIYKH G L
Sbjct: 117 SISYNSDKHYGASVYAVAQDVASIQKEIMTNGPVEVAFDVYEDFEHYSSGIYKHTTGDYL 176
Query: 434 GEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAG 493
G HA++++GWG E GT YW+ ANS+N++WGENG FRI+RG +EC IE+ + AG
Sbjct: 177 GGHAVKMLGWGTE---NGT----DYWICANSWNSDWGENGFFRILRGVDECEIESGVVAG 229
Query: 494 LPK 496
PK
Sbjct: 230 EPK 232
>gi|254575665|gb|ACT68329.1| cysteine proteinase [Haemonchus contortus]
Length = 348
Score = 213 bits (541), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 110/258 (42%), Positives = 156/258 (60%), Gaps = 12/258 (4%)
Query: 81 QNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCI 140
+N LP+ S+ +++PE FD+R W CP+++ I DQ +CGS WA+ A + MSDR+CI
Sbjct: 82 ENVLPIANITSN--DDIPESFDSREKWKDCPSLRVIPDQSNCGSCWAVSAAQCMSDRLCI 139
Query: 141 ASRGKRHVRLSSDDLVSCC-KDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYE 199
S+G++ V LS+ D+++CC K CG GC GG++ +AWK+ G+V+GG Y K C+PY
Sbjct: 140 HSQGRKKVLLSATDILACCGKFCGYGCDGGYNARAWKWATIAGVVTGGAYKEKGNCKPYV 199
Query: 200 IP-CERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMRE 258
P C + + ++C + TP CQ GY YE+D R Y LP +E TI E
Sbjct: 200 FPQCGAHKGKAFNNCPSHPYATPARKPYCQYGYGKRYENDKIKARTWYWLPNDERTIQLE 259
Query: 259 IFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWL 318
I + GPV + IY D Y G+Y H AG G H+I+IIGW G VKYWL
Sbjct: 260 IMQKGPVHATFNIYEDFEHYNGGVYIHTAGAMEGGHSIKIIGW-------GVDKGVKYWL 312
Query: 319 VANSFNTNWGEN-GLFRI 335
+ANS++T+WGE+ G FR+
Sbjct: 313 IANSWSTDWGEDGGYFRV 330
Score = 123 bits (309), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 68/159 (42%), Positives = 89/159 (55%), Gaps = 9/159 (5%)
Query: 337 CRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEE 395
C+PY P C + + ++C ++ TP CQ GY YE+D R Y LP +E
Sbjct: 195 CKPYVFPQCGAHKGKAFNNCPSHPYATPARKPYCQYGYGKRYENDKIKARTWYWLPNDER 254
Query: 396 TIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSV 455
TI EI + GPV + IY D Y G+Y H AG G H+I+IIGW G
Sbjct: 255 TIQLEIMQKGPVHATFNIYEDFEHYNGGVYIHTAGAMEGGHSIKIIGW-------GVDKG 307
Query: 456 VKYWLVANSFNTNWGEN-GLFRIVRGQNECGIEADITAG 493
VKYWL+ANS++T+WGE+ G FR+VRG N C IE + AG
Sbjct: 308 VKYWLIANSWSTDWGEDGGYFRVVRGINNCDIEGGVLAG 346
>gi|268561802|ref|XP_002638421.1| C. briggsae CBR-CPR-3 protein [Caenorhabditis briggsae]
Length = 375
Score = 213 bits (541), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 119/285 (41%), Positives = 166/285 (58%), Gaps = 39/285 (13%)
Query: 74 HPDSKLPQNRLPLLVQ-LSDPLEE----------------LPEGFDARINWPYCPTIQEI 116
H D + + ++ Q +DPLEE LP+ FDAR WP C +++ I
Sbjct: 55 HNDISEDEMKFKVMDQRFADPLEEEVQDEGLVRGEVVPEPLPDTFDARDQWPDCKSLKFI 114
Query: 117 RDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCC-KDCGNGCQGGFHGKAW 175
R+Q SCGS WA GA E +SDRVCI S G + +S++D++SCC CG GCQGG+ +A
Sbjct: 115 RNQASCGSCWAFGAAEVISDRVCIQSNGTQQPIISAEDILSCCGSTCGKGCQGGYTIEAM 174
Query: 176 KYWVTTGIVSGGTYASKQGCRPYEI-PCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVS 234
KYW+ +G+V+GG Y + GC PY PC++ S C E +TP C CQ Y +
Sbjct: 175 KYWMNSGVVTGGDY-NGAGCMPYSFPPCKK------SPCV--EFSTPSCKTTCQEKYTTA 225
Query: 235 -YEDDLNFGRIAYSLPANEE---TIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGP 290
Y++D +F AY L + TI EI+ +GPVE S ++ D YK+G+Y HV+G
Sbjct: 226 DYKNDKHFATSAYKLSTTKNAVPTIQYEIYHNGPVEASYRVFEDFYQYKSGVYHHVSGNL 285
Query: 291 LGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
+G HA++IIGWG E + V YWLVANS+ T++GE G F+I
Sbjct: 286 VGGHAVKIIGWGTE-------NGVDYWLVANSWGTSFGEKGFFKI 323
Score = 135 bits (339), Expect = 7e-29, Method: Compositional matrix adjust.
Identities = 84/210 (40%), Positives = 117/210 (55%), Gaps = 26/210 (12%)
Query: 310 TSSVVKYWLVANSFNTNWGE-NGLFRIGCRPYEIP-CERYMNGSRSSCQANEPNTPECIR 367
T +KYW+ NS G+ NG GC PY P C++ S C E +TP C
Sbjct: 170 TIEAMKYWM--NSGVVTGGDYNGA---GCMPYSFPPCKK------SPCV--EFSTPSCKT 216
Query: 368 KCQPGYDVS-YEDDLNFGRIAYSLPANEE---TIMREIFRHGPVEGSMTIYADMILYKTG 423
CQ Y + Y++D +F AY L + TI EI+ +GPVE S ++ D YK+G
Sbjct: 217 TCQEKYTTADYKNDKHFATSAYKLSTTKNAVPTIQYEIYHNGPVEASYRVFEDFYQYKSG 276
Query: 424 IYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNE 483
+Y HV+G +G HA++IIGWG E + V YWLVANS+ T++GE G F+I RG NE
Sbjct: 277 VYHHVSGNLVGGHAVKIIGWGTE-------NGVDYWLVANSWGTSFGEKGFFKIRRGTNE 329
Query: 484 CGIEADITAGLPKIGLEIDSNEINLGKMMT 513
C IE++I AGL K+G + + + G +
Sbjct: 330 CQIESNIVAGLAKLGTHNEKTDDDDGSATS 359
>gi|268572243|ref|XP_002648913.1| Hypothetical protein CBG17826 [Caenorhabditis briggsae]
Length = 323
Score = 213 bits (541), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 109/272 (40%), Positives = 156/272 (57%), Gaps = 20/272 (7%)
Query: 66 ELEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSG 125
E G++ P + ++++ L +P FD+R W C +I+ IRDQ CGS
Sbjct: 50 EFMKSRGMNVKYAAPHSDEIRSTEVNNVLPFIPPSFDSRTRWSNCTSIEMIRDQAQCGSC 109
Query: 126 WALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCC-KDCGNGCQGGFHGKAWKYWVTTGIV 184
WA E +SDR+CIA++G + +S D+++CC CG+GC+GG+ +A+++W + G+V
Sbjct: 110 WAFSTAEVISDRICIATKGTQQPTISPTDMLACCGNSCGDGCKGGYPIQAFRWWNSRGVV 169
Query: 185 SGGTYASKQGCRPYEI-PCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGR 243
+GG + GCRPY PC SC E TP C CQ GY +Y D FG
Sbjct: 170 TGGDFRG-SGCRPYPFAPC--------ISCP--EEKTPTCSLSCQFGYSTAYAKDKRFGV 218
Query: 244 IAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQ 303
AY++ N I EI +GPV G+ T+Y DM YK+G+Y+H AG LG HAI+IIGW
Sbjct: 219 SAYAVARNVAAIQTEIMTNGPVVGAFTMYEDMYKYKSGVYRHTAGRLLGGHAIKIIGW-- 276
Query: 304 EPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
GT + + YWL+ANS+ NWGENG ++
Sbjct: 277 -----GTQNGIPYWLIANSWGANWGENGFLKM 303
Score = 146 bits (369), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 76/163 (46%), Positives = 96/163 (58%), Gaps = 18/163 (11%)
Query: 336 GCRPYEI-PCERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANE 394
GCRPY PC SC E TP C CQ GY +Y D FG AY++ N
Sbjct: 178 GCRPYPFAPC--------ISCP--EEKTPTCSLSCQFGYSTAYAKDKRFGVSAYAVARNV 227
Query: 395 ETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSS 454
I EI +GPV G+ T+Y DM YK+G+Y+H AG LG HAI+IIGW GT +
Sbjct: 228 AAIQTEIMTNGPVVGAFTMYEDMYKYKSGVYRHTAGRLLGGHAIKIIGW-------GTQN 280
Query: 455 VVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
+ YWL+ANS+ NWGENG ++ RG NECGIE + AG+P++
Sbjct: 281 GIPYWLIANSWGANWGENGFLKMRRGVNECGIERAVVAGMPRV 323
>gi|119638996|gb|ABL85239.1| cysteine proteinase 5 [Necator americanus]
Length = 342
Score = 211 bits (538), Expect = 5e-52, Method: Compositional matrix adjust.
Identities = 119/287 (41%), Positives = 162/287 (56%), Gaps = 15/287 (5%)
Query: 51 FYGAEKNALSKLTLSELEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYC 110
FY AE S L + M +K QN +V+ D LPE FDAR WP C
Sbjct: 49 FYKAE---YSPLVEQYAKAVMRSEFMTKPNQN---YVVKDVDLNINLPETFDAREKWPNC 102
Query: 111 PTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGF 170
+I+ IRDQ +CGS WA+ A MSDR+CI S G S D++SCC +CG GC GG
Sbjct: 103 TSIRTIRDQSNCGSCWAVSAASVMSDRLCIQSNGTIQSWASDTDILSCCWNCGMGCDGGR 162
Query: 171 HGKAWKYWVTTGIVSGGTYASKQGCRPYEI-PCERYMNGSH-SSCQDNEPNTPECIRKCQ 228
A+ + + G+ +GG + C+PY PC R+ N + C TP+C + CQ
Sbjct: 163 PFAAFFFAIDNGVCTGGPFREPNVCKPYAFYPCGRHQNQKYFGPCPKELWPTPKCRKMCQ 222
Query: 229 PGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAG 288
Y+V+Y+DD +G AYSLP NE IM+EIF +GPV GS +++AD +YK G+Y
Sbjct: 223 LKYNVAYKDDKIYGNDAYSLPNNETRIMQEIFTNGPVVGSFSVFADFAIYKKGVYVSNGI 282
Query: 289 GPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
G HA++IIGWG + +G +KYWL+ANS+N +WG+ G R
Sbjct: 283 QQNGAHAVKIIGWGVQ---DG----LKYWLIANSWNNDWGDEGYVRF 322
Score = 141 bits (356), Expect = 7e-31, Method: Compositional matrix adjust.
Identities = 72/163 (44%), Positives = 100/163 (61%), Gaps = 9/163 (5%)
Query: 337 CRPYEI-PCERYMNGSR-SSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANE 394
C+PY PC R+ N C TP+C + CQ Y+V+Y+DD +G AYSLP NE
Sbjct: 187 CKPYAFYPCGRHQNQKYFGPCPKELWPTPKCRKMCQLKYNVAYKDDKIYGNDAYSLPNNE 246
Query: 395 ETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSS 454
IM+EIF +GPV GS +++AD +YK G+Y G HA++IIGWG + +G
Sbjct: 247 TRIMQEIFTNGPVVGSFSVFADFAIYKKGVYVSNGIQQNGAHAVKIIGWGVQ---DG--- 300
Query: 455 VVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
+KYWL+ANS+N +WG+ G R +RG N CGIE+ + G K+
Sbjct: 301 -LKYWLIANSWNNDWGDEGYVRFLRGDNHCGIESRVVTGTMKV 342
>gi|339241013|ref|XP_003376432.1| Gut-specific cysteine proteinase [Trichinella spiralis]
gi|316974853|gb|EFV58323.1| Gut-specific cysteine proteinase [Trichinella spiralis]
Length = 551
Score = 211 bits (538), Expect = 6e-52, Method: Compositional matrix adjust.
Identities = 115/304 (37%), Positives = 174/304 (57%), Gaps = 22/304 (7%)
Query: 63 TLSELEMRMGVH--PDSKLPQNRLPLLVQLSDPLEEL--PEGFDARINWPYC-PTIQEIR 117
++ E++ +G P + +N +P+ L + LE P FD+R +WP C I I+
Sbjct: 252 SIGEIKKLLGYRMLPKTVKERNEMPMPEDLLN-LENFNYPVEFDSRKHWPQCEKVISFIK 310
Query: 118 DQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKY 177
DQ +CGS WA+ + MSDR CIA+ G+ LS +L+SCC CG GC GG+ + +KY
Sbjct: 311 DQANCGSCWAVSSASVMSDRTCIATDGQFTTLLSDAELLSCCTSCGYGCNGGYPQRTFKY 370
Query: 178 WVTTGIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYED 237
WV +G+ +GG Y S C+PY IP S+C +E TP+C + C Y +S +
Sbjct: 371 WVYSGMPTGGPYGSNDTCKPYPIP-------PCSNC--SETRTPKCSKSCISTYPLSLNE 421
Query: 238 DLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIR 297
D ++G Y E+++M++I +GP+ M++Y D + YK G+Y +G LG HA+R
Sbjct: 422 DRHYGSTYYQFWLGEKSMMKDISLYGPIVAGMSVYEDFLHYKEGVYTQESGIFLGGHAVR 481
Query: 298 IIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIGCRPYEIPCERYMNGSRSSCQA 357
IIGWG++ + YWLVANS+NT +GE+GLF+I E E Y++ R+ C+
Sbjct: 482 IIGWGEQ-------DNIPYWLVANSWNTTFGEDGLFKIRRGFDECGIESYVSAGRAKCKQ 534
Query: 358 NEPN 361
N N
Sbjct: 535 NISN 538
Score = 129 bits (323), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 67/168 (39%), Positives = 101/168 (60%), Gaps = 16/168 (9%)
Query: 337 CRPYEIPCERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEET 396
C+PY IP S+C +E TP+C + C Y +S +D ++G Y E++
Sbjct: 388 CKPYPIP-------PCSNC--SETRTPKCSKSCISTYPLSLNEDRHYGSTYYQFWLGEKS 438
Query: 397 IMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVV 456
+M++I +GP+ M++Y D + YK G+Y +G LG HA+RIIGWG++ +
Sbjct: 439 MMKDISLYGPIVAGMSVYEDFLHYKEGVYTQESGIFLGGHAVRIIGWGEQ-------DNI 491
Query: 457 KYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKIGLEIDSN 504
YWLVANS+NT +GE+GLF+I RG +ECGIE+ ++AG K I +N
Sbjct: 492 PYWLVANSWNTTFGEDGLFKIRRGFDECGIESYVSAGRAKCKQNISNN 539
>gi|119638954|gb|ABL85236.1| cysteine proteinase 2 [Necator americanus]
Length = 347
Score = 211 bits (537), Expect = 7e-52, Method: Compositional matrix adjust.
Identities = 110/242 (45%), Positives = 144/242 (59%), Gaps = 11/242 (4%)
Query: 97 LPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLV 156
LPE FDAR WP CP+I IRDQ + G WA+ + E M+DR+CI S G + V +S D++
Sbjct: 94 LPESFDAREKWPECPSIGLIRDQSAGGGCWAVSSAEVMTDRICIQSNGTKQVYVSETDIL 153
Query: 157 SCC-KDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEI-PCERYMN-GSHSSC 213
SCC + CG+GC G +A+ Y + G+ SGG Y +K C+PY PC + + + C
Sbjct: 154 SCCGQRCGSGCTSGVPRQAFNYAIRKGVCSGGPYGTKGVCKPYPFYPCGYHAHLPYYGPC 213
Query: 214 QDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYA 273
D TP C + CQ Y V Y DD FG L EE I REIF +GP+ + T+Y
Sbjct: 214 PDGMWPTPTCEKACQSDYTVPYNDDRIFGSKTIVL-TGEEKIKREIFNNGPLVATYTVYE 272
Query: 274 DMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLF 333
D YK GIY G G HA++IIGWG+E + VKYWL+ANS+NT+WGENG F
Sbjct: 273 DFAYYKNGIYMTGLGRATGAHAVKIIGWGEE-------NGVKYWLIANSWNTDWGENGFF 325
Query: 334 RI 335
R+
Sbjct: 326 RM 327
Score = 129 bits (324), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 72/163 (44%), Positives = 90/163 (55%), Gaps = 10/163 (6%)
Query: 337 CRPYEI-PCERYMN-GSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANE 394
C+PY PC + + C TP C + CQ Y V Y DD FG L E
Sbjct: 193 CKPYPFYPCGYHAHLPYYGPCPDGMWPTPTCEKACQSDYTVPYNDDRIFGSKTIVL-TGE 251
Query: 395 ETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSS 454
E I REIF +GP+ + T+Y D YK GIY G G HA++IIGWG+E +
Sbjct: 252 EKIKREIFNNGPLVATYTVYEDFAYYKNGIYMTGLGRATGAHAVKIIGWGEE-------N 304
Query: 455 VVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
VKYWL+ANS+NT+WGENG FR++RG N C IE T G K+
Sbjct: 305 GVKYWLIANSWNTDWGENGFFRMLRGTNLCDIELSATGGTFKV 347
>gi|242001640|ref|XP_002435463.1| cathepsin B endopeptidase, putative [Ixodes scapularis]
gi|215498799|gb|EEC08293.1| cathepsin B endopeptidase, putative [Ixodes scapularis]
Length = 223
Score = 211 bits (537), Expect = 8e-52, Method: Compositional matrix adjust.
Identities = 104/225 (46%), Positives = 142/225 (63%), Gaps = 9/225 (4%)
Query: 127 ALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSG 186
A GAVEAMSDRVCI S G+ V +S++DL+ CC CG+GC GG AW+YW G+VSG
Sbjct: 1 AFGAVEAMSDRVCIHSNGRVQVDISAEDLMDCCDKCGSGCSGGVSAAAWQYWKDAGLVSG 60
Query: 187 GTYASKQGCRPYEI-PCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIA 245
G Y + GC+PY + PCE GS C P TP+C R+C+ GY+ SY+DD F +
Sbjct: 61 GLYNTTDGCKPYSLAPCEHSSQGSLPECVGTLP-TPKCKRQCREGYERSYDDDKYFAKNV 119
Query: 246 YSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEP 305
YS+ +E+ I EIF++GPVE T YAD + YK+G+Y+H + +G HAIRI+GWG E
Sbjct: 120 YSINGSEKQIRTEIFQNGPVEAEFTAYADFLSYKSGVYQHHSRDIIGRHAIRILGWGSED 179
Query: 306 LGEGTSSVVKYWLVANSFNTNWGENGLFRIGCRPYEIPCERYMNG 350
YWL+ANS+N +WG++G F++ E E ++N
Sbjct: 180 NN-------PYWLLANSWNEDWGDHGYFKMLRGVNECDIESFVNA 217
Score = 157 bits (397), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 75/163 (46%), Positives = 107/163 (65%), Gaps = 9/163 (5%)
Query: 336 GCRPYEI-PCERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANE 394
GC+PY + PCE GS C P TP+C R+C+ GY+ SY+DD F + YS+ +E
Sbjct: 68 GCKPYSLAPCEHSSQGSLPECVGTLP-TPKCKRQCREGYERSYDDDKYFAKNVYSINGSE 126
Query: 395 ETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSS 454
+ I EIF++GPVE T YAD + YK+G+Y+H + +G HAIRI+GWG E
Sbjct: 127 KQIRTEIFQNGPVEAEFTAYADFLSYKSGVYQHHSRDIIGRHAIRILGWGSEDNN----- 181
Query: 455 VVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
YWL+ANS+N +WG++G F+++RG NEC IE+ + AG+PK+
Sbjct: 182 --PYWLLANSWNEDWGDHGYFKMLRGVNECDIESFVNAGIPKL 222
>gi|91089437|ref|XP_966750.1| PREDICTED: similar to putative cathepsin B-like proteinase
[Tribolium castaneum]
gi|270012705|gb|EFA09153.1| cathepsin B precursor [Tribolium castaneum]
Length = 324
Score = 211 bits (537), Expect = 8e-52, Method: Compositional matrix adjust.
Identities = 106/244 (43%), Positives = 145/244 (59%), Gaps = 22/244 (9%)
Query: 95 EELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDD 154
E +PE FD R +W CP+++ IR+QG+CGS WA G+VE M+DR+CIAS+GK S+DD
Sbjct: 80 EAIPETFDGRTHWSQCPSLKNIRNQGNCGSCWAFGSVEVMTDRLCIASKGKTKFEFSADD 139
Query: 155 LVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQ 214
L++CC CG GC GG +A++YWV GIVSGG Y S +GC+PYE ++N
Sbjct: 140 LLACCTACGKGCDGGAPYRAFEYWVAKGIVSGGDYNSNEGCQPYE--GSAFLNSV----- 192
Query: 215 DNEPNTPECIRKC-QPGYDVSYEDDLNFGR-IAYSLPANEETIMREIFRHGPVEGSMTIY 272
TP+C KC Y Y D ++G Y N I EI +GPV M +Y
Sbjct: 193 -----TPKCSTKCLNSKYTTPYAKDKHYGTDFIYMTSKNVAEIQTEIMNNGPVVTHMDVY 247
Query: 273 ADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGE-NG 331
D YK+G+Y+HV+G +G HA++IIGWG E V YWL+ANS+ W + +G
Sbjct: 248 EDFYSYKSGVYQHVSGNSMGGHAVKIIGWGTE-------KGVPYWLIANSWGAKWADLDG 300
Query: 332 LFRI 335
++I
Sbjct: 301 FYKI 304
Score = 108 bits (271), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 61/165 (36%), Positives = 86/165 (52%), Gaps = 22/165 (13%)
Query: 336 GCRPYEIPCERYMNGSRSSCQANEPNTPECIRKC-QPGYDVSYEDDLNFGR-IAYSLPAN 393
GC+PYE ++N TP+C KC Y Y D ++G Y N
Sbjct: 179 GCQPYE--GSAFLNSV----------TPKCSTKCLNSKYTTPYAKDKHYGTDFIYMTSKN 226
Query: 394 EETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTS 453
I EI +GPV M +Y D YK+G+Y+HV+G +G HA++IIGWG E
Sbjct: 227 VAEIQTEIMNNGPVVTHMDVYEDFYSYKSGVYQHVSGNSMGGHAVKIIGWGTE------- 279
Query: 454 SVVKYWLVANSFNTNWGE-NGLFRIVRGQNECGIEADITAGLPKI 497
V YWL+ANS+ W + +G ++I+RG+N C IE I G P++
Sbjct: 280 KGVPYWLIANSWGAKWADLDGFYKILRGKNHCKIETYIYGGTPQV 324
>gi|350535627|ref|NP_001233013.1| uncharacterized protein LOC100164982 precursor [Acyrthosiphon
pisum]
gi|239789514|dbj|BAH71377.1| ACYPI005957 [Acyrthosiphon pisum]
Length = 339
Score = 211 bits (536), Expect = 9e-52, Method: Compositional matrix adjust.
Identities = 105/246 (42%), Positives = 145/246 (58%), Gaps = 11/246 (4%)
Query: 92 DPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLS 151
+ +P FDAR W +C TI E+RDQG CGS WA G A +DR+C+A+ G + LS
Sbjct: 82 NKFSYIPRTFDARKRWRHCKTIGEVRDQGHCGSCWAFGTSSAFADRLCVATDGDFNELLS 141
Query: 152 SDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEI-PCERYMNGSH 210
+++L CC CG+GC GG+ KAWKY+ T G+V+GG Y S +GC PY + PC R +G
Sbjct: 142 AEELTFCCHACGHGCNGGYPIKAWKYFSTHGLVTGGNYKSGKGCEPYRVPPCPRNEDGKS 201
Query: 211 SSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMT 270
S + C R C D+ Y+DD F R Y L +I +++ +GP+E S
Sbjct: 202 SCAGKPKEKNHRCTRMCYGNQDLDYDDDHRFTRDFYYLTYG--SIQKDVLNYGPIEASFD 259
Query: 271 IYADMILYKTGIYKHVAGG-PLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGE 329
+Y D YK+G+Y+ LG HA+++IGWG E EGT YWL+ NS+N WG+
Sbjct: 260 VYDDFPSYKSGVYQRTPNATKLGGHAVKLIGWGVE---EGTP----YWLMVNSWNAQWGD 312
Query: 330 NGLFRI 335
NGLF+I
Sbjct: 313 NGLFKI 318
Score = 123 bits (309), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 69/184 (37%), Positives = 98/184 (53%), Gaps = 11/184 (5%)
Query: 314 VKYWLVANSFNTNWGENGLFRIGCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPG 372
+K W ++ G N GC PY +P C R +G S + C R C
Sbjct: 162 IKAWKYFSTHGLVTGGNYKSGKGCEPYRVPPCPRNEDGKSSCAGKPKEKNHRCTRMCYGN 221
Query: 373 YDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGG- 431
D+ Y+DD F R Y L +I +++ +GP+E S +Y D YK+G+Y+
Sbjct: 222 QDLDYDDDHRFTRDFYYLTYG--SIQKDVLNYGPIEASFDVYDDFPSYKSGVYQRTPNAT 279
Query: 432 PLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADIT 491
LG HA+++IGWG E EGT YWL+ NS+N WG+NGLF+I RG +EC I++ T
Sbjct: 280 KLGGHAVKLIGWGVE---EGTP----YWLMVNSWNAQWGDNGLFKIRRGTDECRIDSATT 332
Query: 492 AGLP 495
AG+P
Sbjct: 333 AGVP 336
>gi|38639325|gb|AAR25800.1| cathepsin B-like cysteine proteinase [Solanum tuberosum]
Length = 354
Score = 210 bits (535), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 116/278 (41%), Positives = 165/278 (59%), Gaps = 24/278 (8%)
Query: 59 LSKLTLSELEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRD 118
LS T+S+ + +GV P + +P+L L+ELP+ FDAR WP C TI +I D
Sbjct: 64 LSNFTVSQFKRLLGVKPAREGDLEGIPVLTH--PRLKELPKEFDARKAWPQCSTIGKILD 121
Query: 119 QGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKD-CGNGCQGGFHGKAWKY 177
QG CGS WA GAVE++SDR CI + LS +DL++CC CG+GC GG+ AW+Y
Sbjct: 122 QGHCGSCWAFGAVESLSDRFCI--HYNLSISLSVNDLLACCSFLCGSGCDGGYPIAAWRY 179
Query: 178 WVTTGIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYED 237
+ +G+V+ + C PY SH C+ P TP+C RKC G +V +
Sbjct: 180 FKRSGVVT-------EECDPYF----DTTGCSHPGCEPLYP-TPKCHRKCVKG-NVLWRK 226
Query: 238 DLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIR 297
++G AY + + ++IM E++++GPVE S T+Y D YK+G+YKHV GG +G HA++
Sbjct: 227 SKHYGVNAYRVSHDPQSIMAEVYKNGPVEVSFTVYEDFAHYKSGVYKHVTGGNMGGHAVK 286
Query: 298 IIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
+IGWG GE YWL+ NS+N WGE+G F+I
Sbjct: 287 LIGWGTSEQGE------DYWLIVNSWNRGWGEDGYFKI 318
Score = 138 bits (348), Expect = 6e-30, Method: Compositional matrix adjust.
Identities = 69/154 (44%), Positives = 97/154 (62%), Gaps = 9/154 (5%)
Query: 344 CERYMNGSRSSCQANEP--NTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREI 401
C+ Y + + S EP TP+C RKC G +V + ++G AY + + ++IM E+
Sbjct: 190 CDPYFDTTGCSHPGCEPLYPTPKCHRKCVKG-NVLWRKSKHYGVNAYRVSHDPQSIMAEV 248
Query: 402 FRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLV 461
+++GPVE S T+Y D YK+G+YKHV GG +G HA+++IGWG GE YWL+
Sbjct: 249 YKNGPVEVSFTVYEDFAHYKSGVYKHVTGGNMGGHAVKLIGWGTSEQGE------DYWLI 302
Query: 462 ANSFNTNWGENGLFRIVRGQNECGIEADITAGLP 495
NS+N WGE+G F+I RG NECGIE + AGLP
Sbjct: 303 VNSWNRGWGEDGYFKIRRGTNECGIEHSVVAGLP 336
>gi|402594312|gb|EJW88238.1| cathepsin B5 [Wuchereria bancrofti]
Length = 407
Score = 210 bits (535), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 109/217 (50%), Positives = 140/217 (64%), Gaps = 12/217 (5%)
Query: 124 SGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGI 183
S WA+ AVEAMSDR+CI S+GK+ V LS+DDL+SCCK CG GC GG AWKYWV +GI
Sbjct: 163 SCWAVAAVEAMSDRICITSKGKKQVILSADDLLSCCKTCGFGCFGGEPMAAWKYWVLSGI 222
Query: 184 VSGGTYASKQGCRPYEI-PCERYMNGSH-SSCQDNEPNTPECIRKCQPGYDVSYEDDLNF 241
V+G Y + GCRPY PCE + N +H C+ + TP+C R+C Y Y+ D +
Sbjct: 223 VTGSDYTNHSGCRPYPFPPCEHHNNKTHYEPCKHDLYPTPKCDRQCDKNYKKPYKADKYY 282
Query: 242 GRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGW 301
G AY++ + E I +EI GPVE S +Y D + Y GIYKHVAG G HA++I+GW
Sbjct: 283 GEQAYNVENDVELIQKEIMTLGPVEASFEVYTDFLHYIGGIYKHVAGSVGGGHAVKILGW 342
Query: 302 GQEPLGEGTSSVVKYWLVANSFNTNWGEN---GLFRI 335
G + +G S YWL ANS+NT+WGE+ G FRI
Sbjct: 343 G---IDQGVS----YWLAANSWNTDWGEDVFSGYFRI 372
Score = 144 bits (364), Expect = 8e-32, Method: Compositional matrix adjust.
Identities = 83/201 (41%), Positives = 117/201 (58%), Gaps = 20/201 (9%)
Query: 301 WGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIGCRPYEIP-CERYMNGSR-SSCQAN 358
+G EP+ + KYW+++ G + GCRPY P CE + N + C+ +
Sbjct: 206 FGGEPM-----AAWKYWVLSGIVT---GSDYTNHSGCRPYPFPPCEHHNNKTHYEPCKHD 257
Query: 359 EPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMI 418
TP+C R+C Y Y+ D +G AY++ + E I +EI GPVE S +Y D +
Sbjct: 258 LYPTPKCDRQCDKNYKKPYKADKYYGEQAYNVENDVELIQKEIMTLGPVEASFEVYTDFL 317
Query: 419 LYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGEN---GLF 475
Y GIYKHVAG G HA++I+GWG + +G S YWL ANS+NT+WGE+ G F
Sbjct: 318 HYIGGIYKHVAGSVGGGHAVKILGWG---IDQGVS----YWLAANSWNTDWGEDVFSGYF 370
Query: 476 RIVRGQNECGIEADITAGLPK 496
RI+RG +ECGIE+ I AG+P+
Sbjct: 371 RILRGVDECGIESGIVAGIPR 391
>gi|94958151|gb|ABF47216.1| cathepsin B [Nicotiana benthamiana]
Length = 356
Score = 210 bits (534), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 115/280 (41%), Positives = 164/280 (58%), Gaps = 28/280 (10%)
Query: 59 LSKLTLSELEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRD 118
S T+S+ + +GV P K +P+L L ELP+ FDAR+ WP C TI I D
Sbjct: 64 FSNFTVSQFKRLLGVKPTRKGDLKGIPILTH--PKLLELPQEFDARVAWPNCSTIGRILD 121
Query: 119 QGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKD-CGNGCQGGFHGKAWKY 177
QG CGS WA GAVE++SDR CI ++ LS++DL++CC CG+GC GG+ +AWKY
Sbjct: 122 QGHCGSCWAFGAVESLSDRFCI--HYGLNISLSANDLLACCGFLCGDGCDGGYPLQAWKY 179
Query: 178 WVTTGIVSG--GTYASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSY 235
+V G+V+ Y +GC SH C+ P TP+C RKC ++ +
Sbjct: 180 FVRKGVVTDECDPYFDNEGC-------------SHPGCEPAYP-TPKCHRKCVK-QNLLW 224
Query: 236 EDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHA 295
+FG AY + ++ +IM E++++GPVE S T+Y D YK+G+YKHV G +G HA
Sbjct: 225 SKSKHFGVNAYMISSDPHSIMTELYKNGPVEVSFTVYEDFAHYKSGVYKHVTGDVMGGHA 284
Query: 296 IRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
+++IGWG GE YWL+AN +N WG++G F+I
Sbjct: 285 VKLIGWGTSEDGE------DYWLLANQWNRGWGDDGYFKI 318
Score = 130 bits (327), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 68/174 (39%), Positives = 103/174 (59%), Gaps = 12/174 (6%)
Query: 344 CERYMNGSRSSCQANEPN--TPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREI 401
C+ Y + S EP TP+C RKC ++ + +FG AY + ++ +IM E+
Sbjct: 190 CDPYFDNEGCSHPGCEPAYPTPKCHRKCVK-QNLLWSKSKHFGVNAYMISSDPHSIMTEL 248
Query: 402 FRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLV 461
+++GPVE S T+Y D YK+G+YKHV G +G HA+++IGWG GE YWL+
Sbjct: 249 YKNGPVEVSFTVYEDFAHYKSGVYKHVTGDVMGGHAVKLIGWGTSEDGE------DYWLL 302
Query: 462 ANSFNTNWGENGLFRIVRGQNECGIEADITAGLP---KIGLEIDSNEINLGKMM 512
AN +N WG++G F+I RG +EC IE ++ AGLP + +E+D ++ L M
Sbjct: 303 ANQWNRGWGDDGYFKIRRGTDECEIEDEVVAGLPSARNLNMELDVSDAFLDAAM 356
>gi|86279343|gb|ABC88767.1| putative cathepsin B-like proteinase [Tenebrio molitor]
Length = 321
Score = 209 bits (533), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 102/272 (37%), Positives = 159/272 (58%), Gaps = 24/272 (8%)
Query: 64 LSELEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCG 123
L +L +G+HPD P + P+LV + ++PE FDAR WP C ++ IRDQG+CG
Sbjct: 54 LYKLNGFIGLHPD---PNYKPPVLVHTFNA-RDVPESFDARTKWPNCDSLNRIRDQGACG 109
Query: 124 SGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGI 183
S WA ++E+MSDR+CI S G S +DL+SCC CG+ C GG+ A +++ GI
Sbjct: 110 SCWAFASIESMSDRICIHSSGSAQFMFSPEDLLSCCTSCGD-CGGGYMMSALDFYINEGI 168
Query: 184 VSGGTYASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGR 243
VSGG S +GCRPY ++ ++ TP C + C+ GY SY D ++G
Sbjct: 169 VSGGDVNSNEGCRPY------------TADAHDQGQTPACTKSCRNGYSTSYSADKHYGS 216
Query: 244 IAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQ 303
Y + + + I E+ +GP+ + ++ D Y +G+Y+HV+G +G H ++I+GWG
Sbjct: 217 NDYVVSSVIDQIQYEVMTNGPIIVNFEVFQDFYNYVSGVYRHVSGESVGFHVVKIVGWGV 276
Query: 304 EPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
E + V YWL+ANS+ ++WG++G F++
Sbjct: 277 E-------NGVPYWLIANSWGSSWGDHGFFKM 301
Score = 119 bits (297), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 56/162 (34%), Positives = 92/162 (56%), Gaps = 19/162 (11%)
Query: 336 GCRPYEIPCERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEE 395
GCRPY ++ TP C + C+ GY SY D ++G Y + + +
Sbjct: 179 GCRPYTADAH------------DQGQTPACTKSCRNGYSTSYSADKHYGSNDYVVSSVID 226
Query: 396 TIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSV 455
I E+ +GP+ + ++ D Y +G+Y+HV+G +G H ++I+GWG E +
Sbjct: 227 QIQYEVMTNGPIIVNFEVFQDFYNYVSGVYRHVSGESVGFHVVKIVGWGVE-------NG 279
Query: 456 VKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
V YWL+ANS+ ++WG++G F+++RGQNECGIE A +P++
Sbjct: 280 VPYWLIANSWGSSWGDHGFFKMLRGQNECGIENYPYAVMPRL 321
>gi|268570495|ref|XP_002648548.1| Hypothetical protein CBG24861 [Caenorhabditis briggsae]
Length = 323
Score = 209 bits (533), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 108/272 (39%), Positives = 155/272 (56%), Gaps = 20/272 (7%)
Query: 66 ELEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSG 125
E G++ P + ++++ L +P FD+R W C +I+ IRDQ CGS
Sbjct: 50 EFMKSRGMNVKYAAPHSDEIRSTEVNNVLPFIPPSFDSRTRWSNCTSIEMIRDQAQCGSC 109
Query: 126 WALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCC-KDCGNGCQGGFHGKAWKYWVTTGIV 184
WA E +SDR+CIA++G + +S D+++CC CG+GC+G + +A+++W + G+V
Sbjct: 110 WAFSTAEVISDRICIATKGTQQPTISPTDMLACCGNSCGDGCKGRYPIQAFRWWNSRGVV 169
Query: 185 SGGTYASKQGCRPYEI-PCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGR 243
+GG + GCRPY PC SC E TP C CQ GY +Y D FG
Sbjct: 170 TGGDFRG-SGCRPYPFAPC--------ISCP--EEKTPTCSLSCQFGYSTAYAKDKRFGV 218
Query: 244 IAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQ 303
AY++ N I EI +GPV G+ T+Y DM YK+G+Y+H AG LG HAI+IIGW
Sbjct: 219 SAYAVARNVAAIQTEIMTNGPVVGAFTMYEDMYKYKSGVYRHTAGRLLGGHAIKIIGW-- 276
Query: 304 EPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
GT + + YWL+ANS+ NWGENG ++
Sbjct: 277 -----GTQNGIPYWLIANSWGANWGENGFLKM 303
Score = 146 bits (369), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 76/163 (46%), Positives = 96/163 (58%), Gaps = 18/163 (11%)
Query: 336 GCRPYEI-PCERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANE 394
GCRPY PC SC E TP C CQ GY +Y D FG AY++ N
Sbjct: 178 GCRPYPFAPC--------ISCP--EEKTPTCSLSCQFGYSTAYAKDKRFGVSAYAVARNV 227
Query: 395 ETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSS 454
I EI +GPV G+ T+Y DM YK+G+Y+H AG LG HAI+IIGW GT +
Sbjct: 228 AAIQTEIMTNGPVVGAFTMYEDMYKYKSGVYRHTAGRLLGGHAIKIIGW-------GTQN 280
Query: 455 VVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
+ YWL+ANS+ NWGENG ++ RG NECGIE + AG+P++
Sbjct: 281 GIPYWLIANSWGANWGENGFLKMRRGVNECGIERAVVAGMPRV 323
>gi|189239879|ref|XP_968767.2| PREDICTED: similar to putative cathepsin B-like proteinase
[Tribolium castaneum]
gi|270012755|gb|EFA09203.1| cathepsin B precursor [Tribolium castaneum]
Length = 353
Score = 209 bits (531), Expect = 4e-51, Method: Compositional matrix adjust.
Identities = 113/269 (42%), Positives = 154/269 (57%), Gaps = 27/269 (10%)
Query: 71 MGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCP-TIQEIRDQGSCGSGWALG 129
+G+HPD P +L +L + +P +P FDAR WP C I IR+QG CGS WA
Sbjct: 51 LGIHPD---PNFQLEVL-EWEEPRTVIPATFDAREYWPQCKDVIGNIRNQGKCGSCWAFA 106
Query: 130 AVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTY 189
A E MSDR+C+A+ G S +DL++CC+ CG C+GG+ AWKY+ +TG+VSGG Y
Sbjct: 107 AAEVMSDRLCVATNGSVKFEFSPEDLINCCETCGKKCKGGYSYYAWKYYTSTGLVSGGDY 166
Query: 190 ASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQ-PGYDVSYEDDLNFGRIAYSL 248
+ +GC+PY S N+ +PEC + CQ Y SY +D +FG Y +
Sbjct: 167 NTSRGCQPY------------SKSNFNDGVSPECSKTCQNTKYPTSYLNDRHFGDGTYYI 214
Query: 249 PANEETIMREI-FRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLG 307
N TI +EI R GPV +Y D LY+ G+Y H +G LG HA++IIGWG E
Sbjct: 215 LKNVTTIQQEILLRGGPVMAGFDVYEDFKLYREGVYVHTSGALLGSHAVKIIGWGTE--- 271
Query: 308 EGTSSVVKYWLVANSFNTNWGE-NGLFRI 335
+ YWLVANS+ +WG G+F+I
Sbjct: 272 ----NGWAYWLVANSWGKDWGALGGVFKI 296
Score = 112 bits (279), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 66/161 (40%), Positives = 85/161 (52%), Gaps = 22/161 (13%)
Query: 336 GCRPYEIPCERYMNGSRSSCQANEPNTPECIRKCQ-PGYDVSYEDDLNFGRIAYSLPANE 394
GC+PY S N+ +PEC + CQ Y SY +D +FG Y + N
Sbjct: 171 GCQPY------------SKSNFNDGVSPECSKTCQNTKYPTSYLNDRHFGDGTYYILKNV 218
Query: 395 ETIMREIF-RHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTS 453
TI +EI R GPV +Y D LY+ G+Y H +G LG HA++IIGWG E
Sbjct: 219 TTIQQEILLRGGPVMAGFDVYEDFKLYREGVYVHTSGALLGSHAVKIIGWGTE------- 271
Query: 454 SVVKYWLVANSFNTNWGE-NGLFRIVRGQNECGIEADITAG 493
+ YWLVANS+ +WG G+F+I RG NEC IE I G
Sbjct: 272 NGWAYWLVANSWGKDWGALGGVFKIRRGTNECKIEQSIITG 312
>gi|159175|gb|AAA29176.1| cysteine proteinase [Haemonchus contortus]
Length = 348
Score = 207 bits (528), Expect = 9e-51, Method: Compositional matrix adjust.
Identities = 113/261 (43%), Positives = 158/261 (60%), Gaps = 14/261 (5%)
Query: 81 QNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCI 140
QN P++ +D +LPE +D RI W C + IRDQ +CGS WA+ A+SDR+CI
Sbjct: 73 QNLNPVVNDDNDTGADLPENYDPRIVWKNCSSFHTIRDQANCGSCWAVSTAAAISDRICI 132
Query: 141 ASRGKRHVRLSSDDLVSCC-KDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYE 199
A++GK+ V S D+++CC CG GC+GG+ +AWK++ G+VSGG Y K C PY
Sbjct: 133 ATKGKKQVYASDTDILTCCGARCGLGCRGGWPIEAWKFFEYDGVVSGGPYLGKGCCSPYP 192
Query: 200 I-PCERYMNGS-HSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGR--IAYSLPANEETI 255
+ PC R+ N + + +C P TP C RKCQPG+ Y D +G Y+LP +E I
Sbjct: 193 LHPCGRHGNDTFYGNCVGMAP-TPPCKRKCQPGFRGMYRVDKRYGEPGRTYTLPRSEVKI 251
Query: 256 MREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPL-GEHAIRIIGWGQEPLGEGTSSVV 314
R+I G V +Y D Y++GIYKH AG G HA+++IGWG++ GT
Sbjct: 252 RRDIKERGSVVAVFAVYEDFSHYQSGIYKHTAGRFTGGYHAVKMIGWGKD---NGTD--- 305
Query: 315 KYWLVANSFNTNWGENGLFRI 335
YWL+ANS++ +WGENG FR+
Sbjct: 306 -YWLIANSWHDDWGENGFFRM 325
Score = 125 bits (314), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 71/166 (42%), Positives = 97/166 (58%), Gaps = 13/166 (7%)
Query: 337 CRPYEI-PCERYMNGS-RSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGR--IAYSLPA 392
C PY + PC R+ N + +C P TP C RKCQPG+ Y D +G Y+LP
Sbjct: 188 CSPYPLHPCGRHGNDTFYGNCVGMAP-TPPCKRKCQPGFRGMYRVDKRYGEPGRTYTLPR 246
Query: 393 NEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLG-EHAIRIIGWGQEPLGEG 451
+E I R+I G V +Y D Y++GIYKH AG G HA+++IGWG++ G
Sbjct: 247 SEVKIRRDIKERGSVVAVFAVYEDFSHYQSGIYKHTAGRFTGGYHAVKMIGWGKD---NG 303
Query: 452 TSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
T YWL+ANS++ +WGENG FR++RG N CGIE + AG+ +
Sbjct: 304 TD----YWLIANSWHDDWGENGFFRMIRGINNCGIEEQVDAGIVDV 345
>gi|328697984|ref|XP_003240502.1| PREDICTED: cathepsin B [Acyrthosiphon pisum]
Length = 339
Score = 207 bits (527), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 103/241 (42%), Positives = 143/241 (59%), Gaps = 11/241 (4%)
Query: 97 LPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLV 156
+P FDAR W +C TI E+RDQG CGS WA G A +DR+C+A+ G + LS+++L
Sbjct: 87 IPRTFDARRRWRHCKTIGEVRDQGYCGSCWAFGTSSAFADRLCVATDGDFNELLSAEELT 146
Query: 157 SCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEI-PCERYMNGSHSSCQD 215
CC CGNGC GG+ KAWKY+ + G+V+GG Y S +GC PY + PC R +G+ S
Sbjct: 147 FCCHTCGNGCNGGYPIKAWKYFSSHGLVTGGNYKSGEGCEPYRVPPCPRNEDGTSSCAGQ 206
Query: 216 NEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADM 275
C R C D+ Y DD F R Y L +I +++ +GP+E S +Y D
Sbjct: 207 PIEKNHRCTRMCYGNQDLDYNDDHRFTRDYYYLTYG--SIQKDVMNYGPIEASFDVYDDF 264
Query: 276 ILYKTGIYKHVAGG-PLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFR 334
YK+G+Y+ LG HA+++IGWG E EG + YWL+ NS++ WG+NGLF+
Sbjct: 265 YSYKSGVYQRTPNATKLGGHAVKLIGWGVE---EG----IPYWLMVNSWSAQWGDNGLFK 317
Query: 335 I 335
I
Sbjct: 318 I 318
Score = 122 bits (306), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 69/184 (37%), Positives = 98/184 (53%), Gaps = 11/184 (5%)
Query: 314 VKYWLVANSFNTNWGENGLFRIGCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPG 372
+K W +S G N GC PY +P C R +G+ S C R C
Sbjct: 162 IKAWKYFSSHGLVTGGNYKSGEGCEPYRVPPCPRNEDGTSSCAGQPIEKNHRCTRMCYGN 221
Query: 373 YDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGG- 431
D+ Y DD F R Y L +I +++ +GP+E S +Y D YK+G+Y+
Sbjct: 222 QDLDYNDDHRFTRDYYYLTYG--SIQKDVMNYGPIEASFDVYDDFYSYKSGVYQRTPNAT 279
Query: 432 PLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADIT 491
LG HA+++IGWG E EG + YWL+ NS++ WG+NGLF+I RG +ECGI++ T
Sbjct: 280 KLGGHAVKLIGWGVE---EG----IPYWLMVNSWSAQWGDNGLFKIRRGTDECGIDSATT 332
Query: 492 AGLP 495
AG+P
Sbjct: 333 AGVP 336
>gi|308488594|ref|XP_003106491.1| hypothetical protein CRE_15919 [Caenorhabditis remanei]
gi|308253841|gb|EFO97793.1| hypothetical protein CRE_15919 [Caenorhabditis remanei]
Length = 342
Score = 207 bits (526), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 109/245 (44%), Positives = 145/245 (59%), Gaps = 12/245 (4%)
Query: 96 ELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDL 155
+L FDAR WP C +I I D C S WA A E+MSDR+CI S G LS+ +L
Sbjct: 84 DLSPFFDARERWPECSSIPLINDISECKSSWAFAAAESMSDRLCINSGGMIDTILSAQEL 143
Query: 156 VSCCK---DCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEI-PCERYM-NGSH 210
+SCC CG GC GG KAW+YW GI +GG+Y S+ GC+PY I PC + + N ++
Sbjct: 144 LSCCTGVLSCGEGCAGGNPLKAWQYWQKHGIPTGGSYESQFGCKPYSIAPCGKTIGNVTY 203
Query: 211 SSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMT 270
C + TP C +KC+PGY V + D ++G LP + I ++ +GPVE +M
Sbjct: 204 PPCTNTTLPTPTCEKKCKPGYPVDLDKDRHYGVSVDQLPNRQIEIQSDVMLNGPVEATME 263
Query: 271 IYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGEN 330
IY D + Y TGIY H+AG G ++RI+GWG + EG V YWL+ANS+ WGEN
Sbjct: 264 IYDDFLQYTTGIYVHLAGNKQGHLSVRILGWG---MFEG----VPYWLLANSWGKEWGEN 316
Query: 331 GLFRI 335
G FR+
Sbjct: 317 GTFRV 321
Score = 143 bits (360), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 72/166 (43%), Positives = 102/166 (61%), Gaps = 9/166 (5%)
Query: 334 RIGCRPYEI-PCERYM-NGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLP 391
+ GC+PY I PC + + N + C TP C +KC+PGY V + D ++G LP
Sbjct: 183 QFGCKPYSIAPCGKTIGNVTYPPCTNTTLPTPTCEKKCKPGYPVDLDKDRHYGVSVDQLP 242
Query: 392 ANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEG 451
+ I ++ +GPVE +M IY D + Y TGIY H+AG G ++RI+GWG + EG
Sbjct: 243 NRQIEIQSDVMLNGPVEATMEIYDDFLQYTTGIYVHLAGNKQGHLSVRILGWG---MFEG 299
Query: 452 TSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
V YWL+ANS+ WGENG FR++RG NECG+EA+ +G+PK+
Sbjct: 300 ----VPYWLLANSWGKEWGENGTFRVLRGVNECGLEANCISGMPKL 341
>gi|107921798|gb|ABF85680.1| cathepsin B3 [Fasciola hepatica]
Length = 278
Score = 206 bits (525), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 101/243 (41%), Positives = 146/243 (60%), Gaps = 2/243 (0%)
Query: 64 LSELEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCG 123
+ +++ +GV ++ +N V+ S +LPE FDAR WP CP+I EIRDQ SC
Sbjct: 30 IDQVKQNLGVLEETPEDRNTQRQTVRYSVSENDLPESFDARQKWPNCPSISEIRDQSSCS 89
Query: 124 SGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGI 183
S WA+ + A++DR+CI S G++ RLS+ D+VSCC CG GC GG +W YW G+
Sbjct: 90 SCWAVSSASAITDRICIHSNGQKKPRLSAIDIVSCCAYCGYGCNGGIPAMSWDYWTREGV 149
Query: 184 VSGGTYASKQGCRPYEIP-CER-YMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNF 241
V+GGT + GC PY P C + C + TP+C +KC GY+ +YE D
Sbjct: 150 VTGGTLENPTGCLPYPFPKCSHGVVTPGLPPCPRDIYPTPKCEKKCHAGYNKTYEQDKVK 209
Query: 242 GRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGW 301
G+ +Y++ E IM EI ++GPV+G ++ D ++YK+GIY + G +G HAIR+IGW
Sbjct: 210 GKSSYNVGEQETDIMMEIMKNGPVDGIFYMFEDFLVYKSGIYHYTTGRLVGGHAIRVIGW 269
Query: 302 GQE 304
G E
Sbjct: 270 GVE 272
Score = 88.6 bits (218), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 45/113 (39%), Positives = 66/113 (58%), Gaps = 2/113 (1%)
Query: 336 GCRPYEIP-CER-YMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPAN 393
GC PY P C + C + TP+C +KC GY+ +YE D G+ +Y++
Sbjct: 160 GCLPYPFPKCSHGVVTPGLPPCPRDIYPTPKCEKKCHAGYNKTYEQDKVKGKSSYNVGEQ 219
Query: 394 EETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQE 446
E IM EI ++GPV+G ++ D ++YK+GIY + G +G HAIR+IGWG E
Sbjct: 220 ETDIMMEIMKNGPVDGIFYMFEDFLVYKSGIYHYTTGRLVGGHAIRVIGWGVE 272
>gi|308512693|gb|ADO33000.1| cathepsin B [Biston betularia]
Length = 217
Score = 206 bits (525), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 98/202 (48%), Positives = 135/202 (66%), Gaps = 9/202 (4%)
Query: 135 SDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQG 194
+DRVC S G +H S++DL+SCC CG GC GG AW+YW G+VSGG Y S QG
Sbjct: 1 TDRVCTYSNGTKHFHFSAEDLLSCCPICGLGCNGGMPTLAWEYWKHMGLVSGGNYNSSQG 60
Query: 195 CRPYEIP-CERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEE 253
C PY IP CE ++ G+ C + + TP+C + C+ GY+V Y+ D +G+ Y++ E+
Sbjct: 61 CSPYVIPPCEHHVPGNRLPC-NGDTKTPKCSKTCENGYNVLYKKDKRYGKHVYAVRGGED 119
Query: 254 TIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSV 313
I E+F++GPVE + T+YAD++ YK+G+YKHV G LG HAI+IIGWG E +
Sbjct: 120 HIKAELFKNGPVEAAFTVYADLLAYKSGVYKHVEGDALGGHAIKIIGWGVE-------NG 172
Query: 314 VKYWLVANSFNTNWGENGLFRI 335
KYWL+ANS+NT+WG NG F+I
Sbjct: 173 NKYWLIANSWNTDWGNNGFFKI 194
Score = 166 bits (421), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 78/161 (48%), Positives = 111/161 (68%), Gaps = 9/161 (5%)
Query: 336 GCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANE 394
GC PY IP CE ++ G+R C + TP+C + C+ GY+V Y+ D +G+ Y++ E
Sbjct: 60 GCSPYVIPPCEHHVPGNRLPCNG-DTKTPKCSKTCENGYNVLYKKDKRYGKHVYAVRGGE 118
Query: 395 ETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSS 454
+ I E+F++GPVE + T+YAD++ YK+G+YKHV G LG HAI+IIGWG E +
Sbjct: 119 DHIKAELFKNGPVEAAFTVYADLLAYKSGVYKHVEGDALGGHAIKIIGWGVE-------N 171
Query: 455 VVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLP 495
KYWL+ANS+NT+WG NG F+I+RG++ CGIE+ I AG P
Sbjct: 172 GNKYWLIANSWNTDWGNNGFFKILRGEDHCGIESSIVAGEP 212
>gi|609175|emb|CAA57522.1| cathepsin B-like cysteine proteinase [Nicotiana rustica]
Length = 356
Score = 206 bits (524), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 114/280 (40%), Positives = 162/280 (57%), Gaps = 28/280 (10%)
Query: 59 LSKLTLSELEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRD 118
S T+S+ + +GV P K +P+L L ELP+ FDAR+ W C TI I D
Sbjct: 64 FSNFTVSQFKRLLGVKPTRKGDLKGIPILTH--PKLLELPQEFDARVAWSNCSTIGRILD 121
Query: 119 QGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKD-CGNGCQGGFHGKAWKY 177
QG CGS WA GAVE++SDR CI ++ LS++DL +CC CG+GC GG+ +AWKY
Sbjct: 122 QGHCGSCWAFGAVESLSDRFCI--HYGLNISLSANDLYACCGFLCGDGCDGGYPLQAWKY 179
Query: 178 WVTTGIVSG--GTYASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSY 235
+V G+V+ Y +GC SH C+ P TP+C RKC ++ +
Sbjct: 180 FVRKGVVTDECDPYFDNEGC-------------SHPGCEPAYP-TPKCHRKCVK-QNLLW 224
Query: 236 EDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHA 295
+FG AY + ++ +IM E++++GPVE S T+Y D YK+G+YKHV G +G HA
Sbjct: 225 SRSKHFGVNAYMISSDPHSIMTEVYKNGPVEVSFTVYEDFAHYKSGVYKHVTGDIMGGHA 284
Query: 296 IRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
+++IGWG GE YWL+AN +N WG++G F+I
Sbjct: 285 VKLIGWGTSEDGE------DYWLLANQWNRGWGDDGYFKI 318
Score = 131 bits (330), Expect = 8e-28, Method: Compositional matrix adjust.
Identities = 69/174 (39%), Positives = 103/174 (59%), Gaps = 12/174 (6%)
Query: 344 CERYMNGSRSSCQANEPN--TPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREI 401
C+ Y + S EP TP+C RKC ++ + +FG AY + ++ +IM E+
Sbjct: 190 CDPYFDNEGCSHPGCEPAYPTPKCHRKCVK-QNLLWSRSKHFGVNAYMISSDPHSIMTEV 248
Query: 402 FRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLV 461
+++GPVE S T+Y D YK+G+YKHV G +G HA+++IGWG GE YWL+
Sbjct: 249 YKNGPVEVSFTVYEDFAHYKSGVYKHVTGDIMGGHAVKLIGWGTSEDGE------DYWLL 302
Query: 462 ANSFNTNWGENGLFRIVRGQNECGIEADITAGLP---KIGLEIDSNEINLGKMM 512
AN +N WG++G F+I RG NEC IE ++ AGLP + +E+D ++ L M
Sbjct: 303 ANQWNRGWGDDGYFKIRRGTNECEIEDEVVAGLPSARNLNVELDVSDAFLDAAM 356
>gi|3929733|emb|CAA77178.1| cathepsin B [Homo sapiens]
Length = 195
Score = 206 bits (524), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 103/205 (50%), Positives = 144/205 (70%), Gaps = 12/205 (5%)
Query: 122 CGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKD-CGNGCQGGFHGKAWKYWVT 180
CGS WA GAVEA+SDR+CI + V +S++DL++CC CG+GC GG+ +AW +W
Sbjct: 1 CGSCWAFGAVEAISDRICIHTN--VSVEVSAEDLLTCCGSMCGDGCNGGYPAEAWNFWTR 58
Query: 181 TGIVSGGTYASKQGCRPYEIP-CERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDL 239
G+VSGG Y S GCRPY IP CE ++NGS C E +TP+C + C+PGY +Y+ D
Sbjct: 59 KGLVSGGLYESHVGCRPYSIPPCEHHVNGSRPPCT-GEGDTPKCSKICEPGYSPTYKQDK 117
Query: 240 NFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRII 299
++G +YS+ +E+ IM EI+++GPVEG+ ++Y+D +LYK+G+Y+HV G +G HAIRI+
Sbjct: 118 HYGYDSYSVSNSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRIL 177
Query: 300 GWGQEPLGEGTSSVVKYWLVANSFN 324
GWG E GT YWLVANS+N
Sbjct: 178 GWGVE---NGT----PYWLVANSWN 195
Score = 137 bits (346), Expect = 9e-30, Method: Compositional matrix adjust.
Identities = 69/139 (49%), Positives = 99/139 (71%), Gaps = 11/139 (7%)
Query: 331 GLF--RIGCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIA 387
GL+ +GCRPY IP CE ++NGSR C E +TP+C + C+PGY +Y+ D ++G +
Sbjct: 65 GLYESHVGCRPYSIPPCEHHVNGSRPPCTG-EGDTPKCSKICEPGYSPTYKQDKHYGYDS 123
Query: 388 YSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEP 447
YS+ +E+ IM EI+++GPVEG+ ++Y+D +LYK+G+Y+HV G +G HAIRI+GWG E
Sbjct: 124 YSVSNSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGVE- 182
Query: 448 LGEGTSSVVKYWLVANSFN 466
GT YWLVANS+N
Sbjct: 183 --NGT----PYWLVANSWN 195
>gi|392922404|ref|NP_507186.3| Protein CPR-2 [Caenorhabditis elegans]
gi|206994217|emb|CAB04322.3| Protein CPR-2 [Caenorhabditis elegans]
Length = 326
Score = 206 bits (524), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 107/246 (43%), Positives = 143/246 (58%), Gaps = 22/246 (8%)
Query: 94 LEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSD 153
L+ P FDAR WP C +++ IR+Q +CGS WA E +SDR CIAS G + +S
Sbjct: 80 LDATPLNFDARTRWPQCKSMKLIREQSNCGSCWAFSTAEVISDRTCIASNGTQQPIISPT 139
Query: 154 DLVSCC-KDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEI-PCERYMNGSHS 211
DL++CC CG GC GGF +A+++W G+V+GG Y GC+PY I PC
Sbjct: 140 DLLTCCGMSCGEGCDGGFPYRAFQWWARRGVVTGGDYLGT-GCKPYPIRPCN-------- 190
Query: 212 SCQDNEPN--TPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSM 269
DN N TP C CQPGY +Y +D N+G AY +P I +I+ +GPV +
Sbjct: 191 --SDNCVNLQTPPCRLSCQPGYRTTYTNDKNYGNSAYPVPRTVAAIQADIYYNGPVVAAF 248
Query: 270 TIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGE 329
+Y D YK+GIY+H+AG G HA+++IGWG E GT YWL NS+ + WGE
Sbjct: 249 IVYEDFEKYKSGIYRHIAGRSKGGHAVKLIGWGTE---RGT----PYWLAVNSWGSQWGE 301
Query: 330 NGLFRI 335
+G FRI
Sbjct: 302 SGTFRI 307
Score = 139 bits (350), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 72/162 (44%), Positives = 96/162 (59%), Gaps = 16/162 (9%)
Query: 336 GCRPYEI-PCERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANE 394
GC+PY I PC +S TP C CQPGY +Y +D N+G AY +P
Sbjct: 180 GCKPYPIRPC--------NSDNCVNLQTPPCRLSCQPGYRTTYTNDKNYGNSAYPVPRTV 231
Query: 395 ETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSS 454
I +I+ +GPV + +Y D YK+GIY+H+AG G HA+++IGWG E GT
Sbjct: 232 AAIQADIYYNGPVVAAFIVYEDFEKYKSGIYRHIAGRSKGGHAVKLIGWGTE---RGT-- 286
Query: 455 VVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPK 496
YWL NS+ + WGE+G FRI+RG +ECGIE+ I AGLP+
Sbjct: 287 --PYWLAVNSWGSQWGESGTFRILRGVDECGIESRIVAGLPR 326
>gi|224064400|ref|XP_002301457.1| predicted protein [Populus trichocarpa]
gi|222843183|gb|EEE80730.1| predicted protein [Populus trichocarpa]
Length = 357
Score = 206 bits (523), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 116/293 (39%), Positives = 170/293 (58%), Gaps = 30/293 (10%)
Query: 47 PKLPFYGAEKNALSKLTLSELEMRMGVHPDSKLPQNRLPLLVQLSDPLE-ELPEGFDARI 105
PK + + S T+++ + +GV P K +P++ S P LPE FDAR
Sbjct: 53 PKAGWKATMNHHFSNYTVAQFKYLLGVKPTPKEELRGIPVI---SHPKSLRLPEEFDART 109
Query: 106 NWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKD-CGN 164
WP C TI +I DQG CGS WA GAVE++SDR CI ++ LS +DL++CC CG+
Sbjct: 110 AWPQCSTIGKILDQGHCGSCWAFGAVESLSDRFCI--HYGMNISLSVNDLLACCGFLCGS 167
Query: 165 GCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPY--EIPCERYMNGSHSSCQDNEPNTPE 222
GC GG+ AW+Y+V G+V+ + C PY +I C SH C+ P TP+
Sbjct: 168 GCNGGYPISAWRYFVHHGVVT-------EECDPYFDDIGC------SHPGCEPGYP-TPK 213
Query: 223 CIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGI 282
C RKC + ++ ++G Y + ++ E+IM EI+++GPVE + T+Y D YK+G+
Sbjct: 214 CARKCVNKNQL-WKKSKHYGVKPYRIDSDPESIMAEIYKNGPVEVAFTVYEDFAHYKSGV 272
Query: 283 YKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
YKH+ GG +G HA+++IGWG GE YWL+AN +N WG++G F+I
Sbjct: 273 YKHITGGMMGGHAVKLIGWGTSEDGEA------YWLLANQWNRGWGDDGYFKI 319
Score = 135 bits (339), Expect = 6e-29, Method: Compositional matrix adjust.
Identities = 70/169 (41%), Positives = 102/169 (60%), Gaps = 16/169 (9%)
Query: 329 ENGLFRIGCRPY--EIPCERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRI 386
+G+ C PY +I C S C+ P TP+C RKC + ++ ++G
Sbjct: 183 HHGVVTEECDPYFDDIGC------SHPGCEPGYP-TPKCARKCVNKNQL-WKKSKHYGVK 234
Query: 387 AYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQE 446
Y + ++ E+IM EI+++GPVE + T+Y D YK+G+YKH+ GG +G HA+++IGWG
Sbjct: 235 PYRIDSDPESIMAEIYKNGPVEVAFTVYEDFAHYKSGVYKHITGGMMGGHAVKLIGWGTS 294
Query: 447 PLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLP 495
GE YWL+AN +N WG++G F+I RG NECGIE D+ AGLP
Sbjct: 295 EDGEA------YWLLANQWNRGWGDDGYFKIRRGTNECGIEGDVVAGLP 337
>gi|3087797|emb|CAA93275.1| cysteine proteinase [Haemonchus contortus]
Length = 330
Score = 205 bits (522), Expect = 4e-50, Method: Compositional matrix adjust.
Identities = 111/286 (38%), Positives = 164/286 (57%), Gaps = 21/286 (7%)
Query: 48 KLPFYGAEKNALSKLTLSEL---EMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDAR 104
+ PF+ A+ + ++ L+ L E V K+P+ + E++PE FD+R
Sbjct: 49 RQPFFEAKYSPEAEQRLNHLMDTEFVRNVRKLHKIPRAEKAI------SNEDIPESFDSR 102
Query: 105 INWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCC-KDCG 163
W C +I IRDQ + GS WA+ A E MSDR+C+ S+G+ +S D+++CC ++CG
Sbjct: 103 EVWKNCSSITYIRDQSNSGSCWAVSAAETMSDRICVQSKGRVQKMISDVDILACCGRECG 162
Query: 164 NGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEI-PCERYMNGSHSSC-QDNEPNTP 221
GC GG KAW+Y G+V+GG Y K C+PY + PCE + G SC +D+ TP
Sbjct: 163 RGCNGGMDHKAWEYVKEFGVVTGGRYQEKGVCKPYHLHPCE--ITGKFWSCPRDHSFRTP 220
Query: 222 ECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTG 281
C + CQ GY YE D ++ + Y L +E+ I RE+ ++GPV+ + T Y D Y+ G
Sbjct: 221 ACKKYCQYGYGKRYEKDKSYVKSVYILDEDEKAIQREMMKNGPVQAAFTTYEDFSFYRKG 280
Query: 282 IYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNW 327
IY H G G HA++++GWG E GT KYW VANS++T+W
Sbjct: 281 IYVHSYGRQRGAHAVKVVGWGVE---NGT----KYWNVANSWSTDW 319
Score = 103 bits (257), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 59/157 (37%), Positives = 83/157 (52%), Gaps = 11/157 (7%)
Query: 315 KYWLVANSFNTNWGENGLFRIGCRPYEI-PCERYMNGSRSSCQANEP-NTPECIRKCQPG 372
K W F G + C+PY + PCE + G SC + TP C + CQ G
Sbjct: 172 KAWEYVKEFGVVTGGRYQEKGVCKPYHLHPCE--ITGKFWSCPRDHSFRTPACKKYCQYG 229
Query: 373 YDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGP 432
Y YE D ++ + Y L +E+ I RE+ ++GPV+ + T Y D Y+ GIY H G
Sbjct: 230 YGKRYEKDKSYVKSVYILDEDEKAIQREMMKNGPVQAAFTTYEDFSFYRKGIYVHSYGRQ 289
Query: 433 LGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNW 469
G HA++++GWG E GT KYW VANS++T+W
Sbjct: 290 RGAHAVKVVGWGVE---NGT----KYWNVANSWSTDW 319
>gi|3087799|emb|CAA93276.1| cysteine proteinase [Haemonchus contortus]
Length = 350
Score = 205 bits (522), Expect = 4e-50, Method: Compositional matrix adjust.
Identities = 111/283 (39%), Positives = 156/283 (55%), Gaps = 13/283 (4%)
Query: 52 YGAEKNALSKLTLSELEMRMGVHPDSKLPQNRLPLL----VQLSDPLEELPEGFDARINW 107
Y + + S+L S+ E RM + +N L + E++PE FD+RI W
Sbjct: 45 YVNKHQSFSRLNTSKAEERMAHLMKTDYIRNARKLYKVKKAEEQTTNEDIPESFDSRIVW 104
Query: 108 PYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCC-KDCGNGC 166
C +I +RDQ CGS WA+ A MSDR+C+ ++GK LS D++SCC + CG+GC
Sbjct: 105 KNCSSITYVRDQSRCGSCWAVSAASTMSDRICVQTKGKLQTILSDTDILSCCGRMCGDGC 164
Query: 167 QGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEI-PCERYMNGSHSSCQDNEPNTPECIR 225
+GG+ AW++ G+V+GG Y K CRPY PC + + D+ +TP C
Sbjct: 165 EGGYDHLAWEWVQRFGVVTGGPYQQKGVCRPYAFHPCGLHHGRRYDCPWDHSFSTPACKP 224
Query: 226 KCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKH 285
CQ GY YE D F + Y L +E+ I RE+ ++GPV+ + Y D YK GIY H
Sbjct: 225 YCQFGYGKRYEKDKFFVKSTYILDNDEKVIQREMMKNGPVQAAFITYEDFSPYKGGIYVH 284
Query: 286 VAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWG 328
V G G HA+++IGWG E GT KYW VANS++ +WG
Sbjct: 285 VKGRERGAHAVKLIGWGVE---NGT----KYWTVANSWHDDWG 320
Score = 101 bits (251), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 58/136 (42%), Positives = 76/136 (55%), Gaps = 10/136 (7%)
Query: 337 CRPYEI-PCERYMNGSRSSCQANEP-NTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANE 394
CRPY PC + +G R C + +TP C CQ GY YE D F + Y L +E
Sbjct: 193 CRPYAFHPCGLH-HGRRYDCPWDHSFSTPACKPYCQFGYGKRYEKDKFFVKSTYILDNDE 251
Query: 395 ETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSS 454
+ I RE+ ++GPV+ + Y D YK GIY HV G G HA+++IGWG E GT
Sbjct: 252 KVIQREMMKNGPVQAAFITYEDFSPYKGGIYVHVKGRERGAHAVKLIGWGVE---NGT-- 306
Query: 455 VVKYWLVANSFNTNWG 470
KYW VANS++ +WG
Sbjct: 307 --KYWTVANSWHDDWG 320
>gi|118358706|ref|XP_001012594.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila]
gi|89294361|gb|EAR92349.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila SB210]
Length = 346
Score = 205 bits (522), Expect = 5e-50, Method: Compositional matrix adjust.
Identities = 109/277 (39%), Positives = 163/277 (58%), Gaps = 19/277 (6%)
Query: 64 LSELEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINW-PYCPTIQEIRDQGSC 122
++ ++ MGV KL Q L +S LPE FDAR+ W C ++ E+RDQ +C
Sbjct: 64 IAGVKAHMGV----KLGQESGIKLETVSAQANGLPEEFDARVQWGDKCSSLWEVRDQSTC 119
Query: 123 GSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTG 182
GS WA GA E++SDR CI + +RLS+ +L++CC CG+GC GG+ A Y+V TG
Sbjct: 120 GSCWAFGAAESLSDRHCI--HLGQDIRLSTQNLLTCCAACGDGCDGGWPEAAMDYYVNTG 177
Query: 183 IVSGGTYASKQGCRPYEI-PCERYMNGS-HSSCQDNEPNTPECIRKCQPG--YDVSYEDD 238
+V+G Y + C+ Y PC ++ + C P TP CI C + + Y D
Sbjct: 178 LVTGDLYGNNSWCQAYTFAPCAHHVTSDIYPPCTGELP-TPPCINSCDSNSTHTIPYSKD 236
Query: 239 LNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRI 298
++ G AY + +E+ IM EI+++GP+E ++T+Y D + YKTG+Y+HV G LG HA+++
Sbjct: 237 IHRGSKAYGIAKDEKAIMAEIYKNGPIEVALTVYEDFLTYKTGVYQHVTGDELGGHAVKM 296
Query: 299 IGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
+GWG E GT YW + NS+N +WG+ G F+I
Sbjct: 297 VGWGVE---NGT----PYWTIVNSWNESWGDKGTFKI 326
Score = 134 bits (336), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 64/162 (39%), Positives = 96/162 (59%), Gaps = 10/162 (6%)
Query: 337 CRPYEI-PCERYMNGSRSSCQANEPNTPECIRKCQPG--YDVSYEDDLNFGRIAYSLPAN 393
C+ Y PC ++ E TP CI C + + Y D++ G AY + +
Sbjct: 190 CQAYTFAPCAHHVTSDIYPPCTGELPTPPCINSCDSNSTHTIPYSKDIHRGSKAYGIAKD 249
Query: 394 EETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTS 453
E+ IM EI+++GP+E ++T+Y D + YKTG+Y+HV G LG HA++++GWG E GT
Sbjct: 250 EKAIMAEIYKNGPIEVALTVYEDFLTYKTGVYQHVTGDELGGHAVKMVGWGVE---NGT- 305
Query: 454 SVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLP 495
YW + NS+N +WG+ G F+I+RG+NECGIE+ LP
Sbjct: 306 ---PYWTIVNSWNESWGDKGTFKILRGKNECGIESSCVTALP 344
>gi|308466896|ref|XP_003095699.1| CRE-CPR-3 protein [Caenorhabditis remanei]
gi|308244581|gb|EFO88533.1| CRE-CPR-3 protein [Caenorhabditis remanei]
Length = 373
Score = 205 bits (521), Expect = 5e-50, Method: Compositional matrix adjust.
Identities = 116/282 (41%), Positives = 161/282 (57%), Gaps = 26/282 (9%)
Query: 64 LSELEMRMGVHPD---SKLPQNRL-PLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQ 119
+S+ EM+ V + LP+ +LV E +P+ FDAR NWP C +I+ IR+Q
Sbjct: 56 ISDSEMKFKVMDERFADPLPEEESGEILVSGEIVPEPIPDTFDARENWPDCKSIKLIRNQ 115
Query: 120 GSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCC-KDCGNGCQGGFHGKAWKYW 178
+CGS WA GA E +SDR+CI S G + +S +D++SCC CG GCQGG+ +A ++W
Sbjct: 116 ATCGSCWAFGAAEVISDRICIQSNGTQQPIISVEDILSCCGTTCGKGCQGGYSIEAMRFW 175
Query: 179 VTTGIVSGGTYASKQGCRPYEI-PCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVS-YE 236
+ G V+GG Y + GC PY PC++ S C E TP C CQ Y + Y
Sbjct: 176 KSNGAVTGGDY-NGNGCMPYSFAPCQK------SPCV--ESTTPTCKTTCQSSYTTANYT 226
Query: 237 DDLNFGRIAYSLPANE---ETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGE 293
D ++G AY L TI EI+ +GPVE S +Y D YK+G+Y +V+G +G
Sbjct: 227 TDKHYGTSAYRLATTNNVVSTIQYEIYHNGPVEASYKVYEDFYQYKSGVYHYVSGKLVGG 286
Query: 294 HAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
HA++IIGWG E + V YWLVANS+ +GE G F+I
Sbjct: 287 HAVKIIGWGTE-------NDVDYWLVANSWGIKFGEGGFFKI 321
Score = 128 bits (322), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 72/179 (40%), Positives = 99/179 (55%), Gaps = 20/179 (11%)
Query: 336 GCRPYEI-PCERYMNGSRSSCQANEPNTPECIRKCQPGYDVS-YEDDLNFGRIAYSLPAN 393
GC PY PC++ S C E TP C CQ Y + Y D ++G AY L
Sbjct: 190 GCMPYSFAPCQK------SPCV--ESTTPTCKTTCQSSYTTANYTTDKHYGTSAYRLATT 241
Query: 394 E---ETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGE 450
TI EI+ +GPVE S +Y D YK+G+Y +V+G +G HA++IIGWG E
Sbjct: 242 NNVVSTIQYEIYHNGPVEASYKVYEDFYQYKSGVYHYVSGKLVGGHAVKIIGWGTE---- 297
Query: 451 GTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKIGLEIDSNEINLG 509
+ V YWLVANS+ +GE G F+I RG NEC IE+++ AG+ K+G + + + G
Sbjct: 298 ---NDVDYWLVANSWGIKFGEGGFFKIRRGTNECQIESNVVAGVAKLGTHAEKGDDDDG 353
>gi|984960|gb|AAC46878.1| cathepsin B proteinase, partial [Ancylostoma caninum]
Length = 340
Score = 205 bits (521), Expect = 5e-50, Method: Compositional matrix adjust.
Identities = 115/292 (39%), Positives = 166/292 (56%), Gaps = 21/292 (7%)
Query: 50 PFYGAE--KNALSKLTLSELEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINW 107
PFY AE NA + + ++ + V P + + V DP P+ FDAR +W
Sbjct: 46 PFYRAEYSPNAEAFVKARIMDSKFLVEPK----KEEVLTEVFGDDP----PDSFDARAHW 97
Query: 108 PYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCK-DCGNGC 166
P C +I IRDQ +CGS WA+ + EAMSD++C+ S V +S D++SCC CG GC
Sbjct: 98 PECRSIGTIRDQSACGSCWAVSSAEAMSDQICVQSNRTTRVMISDTDILSCCGISCGYGC 157
Query: 167 QGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEI-PCERYMNGSH-SSCQDNEPNTPECI 224
+ +A+++ + +V+GG Y K C+PY PC + N + C TP+C
Sbjct: 158 EV-LPIEAYRWMQRSVVVTGGKYRQKDVCKPYAFYPCGNHTNERYYGPCPRGLWPTPKCR 216
Query: 225 RKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYK 284
+ CQ Y+ SY +D F +Y LP+NE +I EI+++GPV + +Y D Y+ GIY
Sbjct: 217 KACQRKYNKSYNEDKYFATRSYYLPSNERSIREEIYKNGPVVAAFKVYQDFSYYRGGIYV 276
Query: 285 HVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIG 336
H GG G HA++++GWG+E GT YWL+ANS+NT+WGENG FRI
Sbjct: 277 HKWGGQTGAHAVKVVGWGRE---NGTD----YWLIANSWNTDWGENGYFRIA 321
Score = 144 bits (362), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 71/163 (43%), Positives = 99/163 (60%), Gaps = 9/163 (5%)
Query: 337 CRPYEI-PCERYMNGSR-SSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANE 394
C+PY PC + N C TP+C + CQ Y+ SY +D F +Y LP+NE
Sbjct: 185 CKPYAFYPCGNHTNERYYGPCPRGLWPTPKCRKACQRKYNKSYNEDKYFATRSYYLPSNE 244
Query: 395 ETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSS 454
+I EI+++GPV + +Y D Y+ GIY H GG G HA++++GWG+E GT
Sbjct: 245 RSIREEIYKNGPVVAAFKVYQDFSYYRGGIYVHKWGGQTGAHAVKVVGWGRE---NGTD- 300
Query: 455 VVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
YWL+ANS+NT+WGENG FRI RG NECGIE + +G+ ++
Sbjct: 301 ---YWLIANSWNTDWGENGYFRIARGSNECGIEGQMVSGVMRV 340
>gi|428174191|gb|EKX43088.1| hypothetical protein GUITHDRAFT_73372 [Guillardia theta CCMP2712]
Length = 255
Score = 204 bits (519), Expect = 9e-50, Method: Compositional matrix adjust.
Identities = 109/247 (44%), Positives = 151/247 (61%), Gaps = 27/247 (10%)
Query: 93 PLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSS 152
P ++P+ FDAR NWP CP+I IRDQ +CGS WA GAVEAMSDR+CIAS G LS+
Sbjct: 11 PNVKIPDNFDARTNWPQCPSIAHIRDQSTCGSCWAFGAVEAMSDRLCIASNGTVKDELSA 70
Query: 153 DDLVSCC-KDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEI-PCERYMNGSH 210
+D++SCC CG GC GGF AW+++ G+ + Y PY PCE ++N +H
Sbjct: 71 EDMLSCCLVQCGMGCNGGFPTGAWRFFKMHGLTTESKY-------PYVFPPCEHHINKTH 123
Query: 211 -SSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSM 269
C ++P TP+C+R S + G+ YS+ + I EI +GPVE +
Sbjct: 124 YKPCGPSQP-TPKCVR-------ASEKKPRYHGKSVYSV--SPAKIQAEIMTNGPVEAAF 173
Query: 270 TIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGE 329
T+Y D + Y++G+Y+HV+G LG HAI+I+GWG E + KYWLVANS+N +WG+
Sbjct: 174 TVYQDFLAYQSGVYRHVSGPELGGHAIKIMGWGVE-------AGNKYWLVANSWNEDWGD 226
Query: 330 NGLFRIG 336
G F+I
Sbjct: 227 KGTFKIA 233
Score = 121 bits (304), Expect = 9e-25, Method: Compositional matrix adjust.
Identities = 63/153 (41%), Positives = 94/153 (61%), Gaps = 18/153 (11%)
Query: 343 PCERYMNGSR-SSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREI 401
PCE ++N + C ++P TP+C+R S + G+ YS+ + I EI
Sbjct: 114 PCEHHINKTHYKPCGPSQP-TPKCVR-------ASEKKPRYHGKSVYSV--SPAKIQAEI 163
Query: 402 FRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLV 461
+GPVE + T+Y D + Y++G+Y+HV+G LG HAI+I+GWG E + KYWLV
Sbjct: 164 MTNGPVEAAFTVYQDFLAYQSGVYRHVSGPELGGHAIKIMGWGVE-------AGNKYWLV 216
Query: 462 ANSFNTNWGENGLFRIVRGQNECGIEADITAGL 494
ANS+N +WG+ G F+I RG +ECGIE+ + AG+
Sbjct: 217 ANSWNEDWGDKGTFKIARGDDECGIESSVVAGM 249
>gi|17559066|ref|NP_506790.1| Protein CPR-3 [Caenorhabditis elegans]
gi|1169083|sp|P43507.1|CPR3_CAEEL RecName: Full=Cathepsin B-like cysteine proteinase 3; AltName:
Full=Cysteine protease-related 3; Flags: Precursor
gi|675494|gb|AAA98788.1| cathepsin B-like cysteine proteinase [Caenorhabditis elegans]
gi|675496|gb|AAA98782.1| cathepsin B-like cysteine proteinase [Caenorhabditis elegans]
gi|14530554|emb|CAB61032.2| Protein CPR-3 [Caenorhabditis elegans]
Length = 370
Score = 204 bits (519), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 114/284 (40%), Positives = 157/284 (55%), Gaps = 30/284 (10%)
Query: 64 LSELEMRMGV------HPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIR 117
+SE EM+ V P K L V+ E LP+ FDAR WP C TI+ IR
Sbjct: 53 ISEFEMKFKVMDVKFAEPLEKDSDVASELFVRGEIVPEPLPDTFDAREKWPDCNTIKLIR 112
Query: 118 DQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCC-KDCGNGCQGGFHGKAWK 176
+Q +CGS WA GA E +SDRVCI S G + +S +D++SCC CG GC+GG+ +A +
Sbjct: 113 NQATCGSCWAFGAAEVISDRVCIQSNGTQQPVISVEDILSCCGTTCGYGCKGGYSIEALR 172
Query: 177 YWVTTGIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQDN--EPNTPECIRKCQPGYDV- 233
+W ++G V+GG Y GC PY S + C N E TP C CQ Y
Sbjct: 173 FWASSGAVTGGDYGG-HGCMPY----------SFAPCTKNCPESTTPSCKTTCQSSYKTE 221
Query: 234 SYEDDLNFGRIAYSLPANEET--IMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPL 291
Y+ D ++G AY + + I EI+ +GPVE S +Y D YK+G+Y + +G +
Sbjct: 222 EYKKDKHYGASAYKVTTTKSVTEIQTEIYHYGPVEASYKVYEDFYHYKSGVYHYTSGKLV 281
Query: 292 GEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
G HA++IIGWG E + V YWL+ANS+ T++GE G F+I
Sbjct: 282 GGHAVKIIGWGVE-------NGVDYWLIANSWGTSFGEKGFFKI 318
Score = 126 bits (317), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 67/174 (38%), Positives = 95/174 (54%), Gaps = 20/174 (11%)
Query: 336 GCRPYEI-PCERYMNGSRSSCQANEPNTPECIRKCQPGYDVS-YEDDLNFGRIAYSLPAN 393
GC PY PC + E TP C CQ Y Y+ D ++G AY +
Sbjct: 189 GCMPYSFAPCTK---------NCPESTTPSCKTTCQSSYKTEEYKKDKHYGASAYKVTTT 239
Query: 394 EET--IMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEG 451
+ I EI+ +GPVE S +Y D YK+G+Y + +G +G HA++IIGWG E
Sbjct: 240 KSVTEIQTEIYHYGPVEASYKVYEDFYHYKSGVYHYTSGKLVGGHAVKIIGWGVE----- 294
Query: 452 TSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKIGLEIDSNE 505
+ V YWL+ANS+ T++GE G F+I RG NEC IE ++ AG+ K+G ++ E
Sbjct: 295 --NGVDYWLIANSWGTSFGEKGFFKIRRGTNECQIEGNVVAGIAKLGTHSETYE 346
>gi|5764077|emb|CAB53367.1| necpain [Necator americanus]
Length = 339
Score = 204 bits (518), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 104/245 (42%), Positives = 149/245 (60%), Gaps = 11/245 (4%)
Query: 95 EELPEGFDARINWPYC-PTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSD 153
+E PE FDAR WPYC I +RDQ CGS WA+ A MSDR+C+ S GK + +S
Sbjct: 82 KEPPEKFDARDAWPYCREIIGHVRDQSRCGSCWAVSAASVMSDRLCVQSNGKIKLHVSDT 141
Query: 154 DLVSCCKD-CGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEI-PCERYMNGSH- 210
D+++CC + CG+GC GG+ +AW++ G+ +GG Y +K C+PY PC + N +
Sbjct: 142 DILACCGEFCGDGCSGGWPFQAWEWVRKYGVCTGGDYRAKGVCKPYAFHPCGNHENQVYY 201
Query: 211 SSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMT 270
C TP C + CQ GY Y+ D + + +Y LP +E+ I +I ++GPV+ +
Sbjct: 202 GVCPKGSWPTPRCEKFCQRGYIKPYKKDKFYAKKSYWLPNDEKEIRLDIMKNGPVQAAFD 261
Query: 271 IYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGEN 330
+Y D LYK GIYKH G G HA++IIGWG++ GT YWL+ANS++ +WGE+
Sbjct: 262 VYEDFKLYKRGIYKHKEGIQTGGHAVKIIGWGKD---NGTD----YWLIANSWSKDWGES 314
Query: 331 GLFRI 335
G FR+
Sbjct: 315 GFFRM 319
Score = 133 bits (334), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 69/160 (43%), Positives = 97/160 (60%), Gaps = 9/160 (5%)
Query: 337 CRPYEI-PCERYMNGSR-SSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANE 394
C+PY PC + N C TP C + CQ GY Y+ D + + +Y LP +E
Sbjct: 184 CKPYAFHPCGNHENQVYYGVCPKGSWPTPRCEKFCQRGYIKPYKKDKFYAKKSYWLPNDE 243
Query: 395 ETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSS 454
+ I +I ++GPV+ + +Y D LYK GIYKH G G HA++IIGWG++ GT
Sbjct: 244 KEIRLDIMKNGPVQAAFDVYEDFKLYKRGIYKHKEGIQTGGHAVKIIGWGKD---NGTD- 299
Query: 455 VVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGL 494
YWL+ANS++ +WGE+G FR+VRG+N+C IE ITAG+
Sbjct: 300 ---YWLIANSWSKDWGESGFFRMVRGENDCEIEDMITAGI 336
>gi|21930117|gb|AAM82155.1| cysteine proteinase [Ancylostoma ceylanicum]
Length = 348
Score = 203 bits (517), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 108/272 (39%), Positives = 153/272 (56%), Gaps = 14/272 (5%)
Query: 67 LEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGW 126
++++ V P+ P ++ ++ ++P+ FDAR WP C +++ IRDQ SCGS W
Sbjct: 68 MDVKFAVDPEKTEPN----YVLANTEMKVDIPDTFDARDRWPNCTSMKHIRDQSSCGSCW 123
Query: 127 ALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCC-KDCGNGCQGGFHGKAWKYWVTTGIVS 185
A+ A AMSDRVC + G+ + LS +++SCC CG GC+GG+ +A+ Y G+ +
Sbjct: 124 AVAAASAMSDRVCALTNGRINRILSDTEVLSCCFGSCGFGCKGGYPARAFGYAWRYGLST 183
Query: 186 GGTYASKQGCRPYEI-PCERYMNGSH-SSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGR 243
GG Y K C+PY PC + + + C D TP C R CQ GY + +E D F
Sbjct: 184 GGPYGEKDACQPYAFYPCGNHAHEPYYGPCPDELWPTPTCRRTCQLGYPIPFEKDKIFND 243
Query: 244 IAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQ 303
Y + NE I EI GPV + +Y D YK G+Y H G G HA++IIGWG+
Sbjct: 244 QTYYIFGNETEIKYEIMTRGPVVATYKVYRDFDYYKKGVYIHREGEVTGLHAVKIIGWGK 303
Query: 304 EPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
+ V YWLVANS+NT+WG+NG FRI
Sbjct: 304 -------GNDVPYWLVANSWNTDWGDNGYFRI 328
Score = 121 bits (304), Expect = 8e-25, Method: Compositional matrix adjust.
Identities = 66/164 (40%), Positives = 87/164 (53%), Gaps = 9/164 (5%)
Query: 336 GCRPYEI-PCERYMNGSR-SSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPAN 393
C+PY PC + + C TP C R CQ GY + +E D F Y + N
Sbjct: 192 ACQPYAFYPCGNHAHEPYYGPCPDELWPTPTCRRTCQLGYPIPFEKDKIFNDQTYYIFGN 251
Query: 394 EETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTS 453
E I EI GPV + +Y D YK G+Y H G G HA++IIGWG+
Sbjct: 252 ETEIKYEIMTRGPVVATYKVYRDFDYYKKGVYIHREGEVTGLHAVKIIGWGK-------G 304
Query: 454 SVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
+ V YWLVANS+NT+WG+NG FRIVRG + C IE + G+ ++
Sbjct: 305 NDVPYWLVANSWNTDWGDNGYFRIVRGTDNCEIERQMVGGIMRV 348
>gi|353228456|emb|CCD74627.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
Length = 333
Score = 203 bits (516), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 101/258 (39%), Positives = 151/258 (58%), Gaps = 13/258 (5%)
Query: 96 ELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDL 155
+LP+ FD+R W CP+I I DQ C SGWA+ + ++SDR CI + G V+LS+ +L
Sbjct: 83 QLPDYFDSREQWKDCPSINIIHDQSKCDSGWAVASAASISDRTCIQTNGTMKVQLSAIEL 142
Query: 156 VSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIP-CERYMNGSHSSCQ 214
+SC K+ GCQ GF +W YW+ G+V+G GC PY P C+ + S+ C
Sbjct: 143 ISCSKN-KLGCQIGFSEFSWDYWLKNGLVTG----DPTGCLPYPFPKCDHRSSNSYPKCG 197
Query: 215 DNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYAD 274
P C + C+ GY + Y+ D ++GR+ YSL NE I +EI +GPVE + +++D
Sbjct: 198 YITYTAPPCTKTCRSGYPIPYKADKHYGRVIYSLRPNESDIRKEIMMNGPVEAGIFVHSD 257
Query: 275 MILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFR 334
+ YK+G+Y+H+ G + H++RIIGWG E + + YWL ANS+N +WG NG F+
Sbjct: 258 FLNYKSGVYRHITGQLVTIHSVRIIGWGIE-------NDIPYWLCANSWNEDWGLNGYFK 310
Query: 335 IGCRPYEIPCERYMNGSR 352
I E E ++N +
Sbjct: 311 ILRGSNECEIESFVNAGK 328
Score = 142 bits (357), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 70/171 (40%), Positives = 102/171 (59%), Gaps = 11/171 (6%)
Query: 327 WGENGLFR---IGCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLN 382
W +NGL GC PY P C+ + S C P C + C+ GY + Y+ D +
Sbjct: 164 WLKNGLVTGDPTGCLPYPFPKCDHRSSNSYPKCGYITYTAPPCTKTCRSGYPIPYKADKH 223
Query: 383 FGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIG 442
+GR+ YSL NE I +EI +GPVE + +++D + YK+G+Y+H+ G + H++RIIG
Sbjct: 224 YGRVIYSLRPNESDIRKEIMMNGPVEAGIFVHSDFLNYKSGVYRHITGQLVTIHSVRIIG 283
Query: 443 WGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAG 493
WG E + + YWL ANS+N +WG NG F+I+RG NEC IE+ + AG
Sbjct: 284 WGIE-------NDIPYWLCANSWNEDWGLNGYFKILRGSNECEIESFVNAG 327
>gi|118358710|ref|XP_001012596.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila]
gi|89294363|gb|EAR92351.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila SB210]
Length = 346
Score = 202 bits (513), Expect = 4e-49, Method: Compositional matrix adjust.
Identities = 105/268 (39%), Positives = 157/268 (58%), Gaps = 15/268 (5%)
Query: 73 VHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINW-PYCPTIQEIRDQGSCGSGWALGAV 131
H + L Q L +++ LP FD+R+ W C ++ E+RDQ +CGS WA GA
Sbjct: 69 AHMGTLLNQKSGVKLEKVNRQANNLPSEFDSRVQWGDKCSSLWEVRDQSNCGSCWAFGAA 128
Query: 132 EAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYAS 191
E++SDR CI + +RLS+ +LV+CC +CG GC GG+ A Y+V G+V+G Y +
Sbjct: 129 ESLSDRHCI--HLGQDIRLSTQNLVTCCDECGFGCDGGWPEAAMDYYVNNGLVTGDLYGN 186
Query: 192 KQGCRPYEI-PCERYMNGS-HSSCQDNEPNTPECIRKCQPG--YDVSYEDDLNFGRIAYS 247
C+ Y + PC ++ + C P TP C++ C Y + Y DL+ G AYS
Sbjct: 187 NSWCQAYSLAPCAHHVTSDVYPPCTGELP-TPPCVKSCDSNSTYTIPYPKDLHKGSKAYS 245
Query: 248 LPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLG 307
+ NE+ IM EI +GP+E + T+Y D + YK+G+Y+HV G LG HA++++GWG E
Sbjct: 246 IDQNEQAIMTEIQTNGPIEVAFTVYEDFLTYKSGVYQHVTGSELGGHAVKMVGWGVE--- 302
Query: 308 EGTSSVVKYWLVANSFNTNWGENGLFRI 335
GT YW++ NS+N +WG+ G F+I
Sbjct: 303 NGT----PYWIIVNSWNESWGDKGTFKI 326
Score = 139 bits (349), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 67/162 (41%), Positives = 98/162 (60%), Gaps = 10/162 (6%)
Query: 337 CRPYEI-PCERYMNGSRSSCQANEPNTPECIRKCQPG--YDVSYEDDLNFGRIAYSLPAN 393
C+ Y + PC ++ E TP C++ C Y + Y DL+ G AYS+ N
Sbjct: 190 CQAYSLAPCAHHVTSDVYPPCTGELPTPPCVKSCDSNSTYTIPYPKDLHKGSKAYSIDQN 249
Query: 394 EETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTS 453
E+ IM EI +GP+E + T+Y D + YK+G+Y+HV G LG HA++++GWG E GT
Sbjct: 250 EQAIMTEIQTNGPIEVAFTVYEDFLTYKSGVYQHVTGSELGGHAVKMVGWGVE---NGT- 305
Query: 454 SVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLP 495
YW++ NS+N +WG+ G F+I+RGQNECGIE++ LP
Sbjct: 306 ---PYWIIVNSWNESWGDKGTFKILRGQNECGIESECVTALP 344
>gi|268560898|ref|XP_002638183.1| Hypothetical protein CBG22612 [Caenorhabditis briggsae]
Length = 721
Score = 202 bits (513), Expect = 4e-49, Method: Compositional matrix adjust.
Identities = 116/289 (40%), Positives = 160/289 (55%), Gaps = 41/289 (14%)
Query: 62 LTLSELEMRMGVHPDSKL----------PQNRLPLLVQLSDPLEELPEGFDARINWPYCP 111
+SE EM+ V DSK P N LP L P FDAR WP C
Sbjct: 42 FEISEEEMKFKVM-DSKFAFPEEQISSEPNNSLP------GSLSRAPTSFDARDYWPNCK 94
Query: 112 TIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFH 171
+I+ IRDQ CGS WA GA E +SDR+CI S G +S +D+++CC + +GCQGGF
Sbjct: 95 SIKMIRDQAYCGSCWAFGAAEVISDRICIQSNGTDQPIISPEDILTCCTN-SHGCQGGFV 153
Query: 172 GKAWKYWVTTGIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQD--NEPNTPECIRKCQP 229
+A K+W + G+V+GG + GC PY S+ SC D TP+C +CQ
Sbjct: 154 LEAMKFWKSKGVVTGGDFQG-DGCIPY----------SYGSCSDCHTAQTTPKCKNECQV 202
Query: 230 GYDVS-YEDDLNFGRIAYSLPANE--ETIMREIFRHGPVEGSMTIYADMILYKTGIYKHV 286
Y + Y++D +G AY L + TI EI R+GPVE + +Y D YK+G+Y+++
Sbjct: 203 KYTKNEYKEDKYYGSSAYRLSTSNAVRTIQSEILRNGPVEATYQVYEDFYYYKSGVYEYI 262
Query: 287 AGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
+G +G HA++IIGWG E V YWL+ANS+ T +GENG F++
Sbjct: 263 SGRHMGGHAVKIIGWGVE-------ENVNYWLIANSWGTGFGENGFFKM 304
Score = 129 bits (323), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 70/166 (42%), Positives = 97/166 (58%), Gaps = 18/166 (10%)
Query: 336 GCRPYEIPCERYMNGSRSSCQANEPNTPECIRKCQPGYDVS-YEDDLNFGRIAYSLPANE 394
GC PY GS S C + TP+C +CQ Y + Y++D +G AY L +
Sbjct: 175 GCIPYSY-------GSCSDCHTAQ-TTPKCKNECQVKYTKNEYKEDKYYGSSAYRLSTSN 226
Query: 395 --ETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGT 452
TI EI R+GPVE + +Y D YK+G+Y++++G +G HA++IIGWG E
Sbjct: 227 AVRTIQSEILRNGPVEATYQVYEDFYYYKSGVYEYISGRHMGGHAVKIIGWGVE------ 280
Query: 453 SSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKIG 498
V YWL+ANS+ T +GENG F++ RG NECGIE + AG+ K G
Sbjct: 281 -ENVNYWLIANSWGTGFGENGFFKMRRGNNECGIENYVVAGMAKSG 325
>gi|187104114|ref|NP_001119617.1| cathepsin B-16A precursor [Acyrthosiphon pisum]
gi|161343835|tpg|DAA06098.1| TPA_inf: cathepsin B [Acyrthosiphon pisum]
Length = 340
Score = 202 bits (513), Expect = 5e-49, Method: Compositional matrix adjust.
Identities = 102/241 (42%), Positives = 140/241 (58%), Gaps = 11/241 (4%)
Query: 97 LPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLV 156
+P FDAR W +C TI E+RDQG CGS WA+ A +DR+C+A+ G + LS++++
Sbjct: 88 IPRTFDARRRWRHCKTIGEVRDQGHCGSCWAMATSSAFADRLCVATNGDFNELLSAEEIT 147
Query: 157 SCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEI-PCERYMNGSHSSCQD 215
CC CG GC GG+ KAWKY+ + GIV+GG Y S +GC PY + PC + G S
Sbjct: 148 FCCHTCGFGCNGGYPIKAWKYFSSHGIVTGGNYKSGEGCEPYRVPPCPQDEEGKSSCAGK 207
Query: 216 NEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADM 275
C R C D+ Y DD F R Y L +I +++ +GP+E S +Y D
Sbjct: 208 PIEKNHRCTRMCYGNQDLDYNDDHRFTRDYYYLTYG--SIQKDVMNYGPIEASFDVYDDF 265
Query: 276 ILYKTGIYKHVAGG-PLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFR 334
YK+G+Y+ LG HA+++IGWG E EGT YWL+ NS+N WG+NGLF+
Sbjct: 266 PSYKSGVYQRTPNATKLGGHAVKLIGWGVE---EGT----PYWLMVNSWNAQWGDNGLFK 318
Query: 335 I 335
I
Sbjct: 319 I 319
Score = 121 bits (303), Expect = 9e-25, Method: Compositional matrix adjust.
Identities = 74/186 (39%), Positives = 102/186 (54%), Gaps = 15/186 (8%)
Query: 314 VKYWLVANSFNTNWGENGLFRIGCRPYEIP-CERYMNGSRSSCQANEP--NTPECIRKCQ 370
+K W +S G N GC PY +P C + G +SSC A +P C R C
Sbjct: 163 IKAWKYFSSHGIVTGGNYKSGEGCEPYRVPPCPQDEEG-KSSC-AGKPIEKNHRCTRMCY 220
Query: 371 PGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAG 430
D+ Y DD F R Y L +I +++ +GP+E S +Y D YK+G+Y+
Sbjct: 221 GNQDLDYNDDHRFTRDYYYLTYG--SIQKDVMNYGPIEASFDVYDDFPSYKSGVYQRTPN 278
Query: 431 G-PLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEAD 489
LG HA+++IGWG E EGT YWL+ NS+N WG+NGLF+I RG +ECGI++
Sbjct: 279 ATKLGGHAVKLIGWGVE---EGTP----YWLMVNSWNAQWGDNGLFKIRRGTDECGIDSA 331
Query: 490 ITAGLP 495
TAG+P
Sbjct: 332 ATAGVP 337
>gi|18411686|ref|NP_567215.1| cathepsin B [Arabidopsis thaliana]
gi|13877861|gb|AAK44008.1|AF370193_1 putative cathepsin B cysteine protease [Arabidopsis thaliana]
gi|17473834|gb|AAL38343.1| unknown protein [Arabidopsis thaliana]
gi|21281113|gb|AAM45063.1| putative cathepsin B cysteine protease [Arabidopsis thaliana]
gi|21554165|gb|AAM63244.1| cathepsin B-like cysteine protease, putative [Arabidopsis thaliana]
gi|24417490|gb|AAN60355.1| unknown [Arabidopsis thaliana]
gi|24899725|gb|AAN65077.1| unknown protein [Arabidopsis thaliana]
gi|51968702|dbj|BAD43043.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
gi|51969104|dbj|BAD43244.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
gi|51969220|dbj|BAD43302.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
gi|51970472|dbj|BAD43928.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
gi|51970630|dbj|BAD44007.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
gi|51970704|dbj|BAD44044.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
gi|51970802|dbj|BAD44093.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
gi|51970974|dbj|BAD44179.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
gi|51971008|dbj|BAD44196.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
gi|51971116|dbj|BAD44250.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
gi|62320144|dbj|BAD94342.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
gi|110740287|dbj|BAF02040.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
gi|332656652|gb|AEE82052.1| cathepsin B [Arabidopsis thaliana]
Length = 359
Score = 201 bits (512), Expect = 6e-49, Method: Compositional matrix adjust.
Identities = 110/290 (37%), Positives = 165/290 (56%), Gaps = 24/290 (8%)
Query: 47 PKLPFYGAEKNALSKLTLSELEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARIN 106
P + A + S T++E + +GV P K +P++ DP +LP+ FDAR
Sbjct: 55 PNAGWKAAINDRFSNATVAEFKRLLGVKPTPKKHFLGVPIVSH--DPSLKLPKAFDARTA 112
Query: 107 WPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCK-DCGNG 165
WP C +I I DQG CGS WA GAVE++SDR CI + ++ LS +DL++CC CG+G
Sbjct: 113 WPQCTSIGNILDQGHCGSCWAFGAVESLSDRFCI--QFGMNISLSVNDLLACCGFRCGDG 170
Query: 166 CQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIR 225
C GG+ AW+Y+ +G+V+ + C PY SH C+ P TP+C R
Sbjct: 171 CDGGYPIAAWQYFSYSGVVT-------EECDPYF----DNTGCSHPGCEPAYP-TPKCSR 218
Query: 226 KCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKH 285
KC + + + ++ Y++ +N + IM E++++GPVE S T+Y D YK+G+YKH
Sbjct: 219 KCVSDNKL-WSESKHYSVSTYTVKSNPQDIMAEVYKNGPVEVSFTVYEDFAHYKSGVYKH 277
Query: 286 VAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
+ G +G HA+++IGWG GE YWL+AN +N WG++G F I
Sbjct: 278 ITGSNIGGHAVKLIGWGTSSEGE------DYWLMANQWNRGWGDDGYFMI 321
Score = 127 bits (320), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 63/154 (40%), Positives = 93/154 (60%), Gaps = 9/154 (5%)
Query: 344 CERYMNGSRSSCQANEPN--TPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREI 401
C+ Y + + S EP TP+C RKC + + + ++ Y++ +N + IM E+
Sbjct: 193 CDPYFDNTGCSHPGCEPAYPTPKCSRKCVSDNKL-WSESKHYSVSTYTVKSNPQDIMAEV 251
Query: 402 FRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLV 461
+++GPVE S T+Y D YK+G+YKH+ G +G HA+++IGWG GE YWL+
Sbjct: 252 YKNGPVEVSFTVYEDFAHYKSGVYKHITGSNIGGHAVKLIGWGTSSEGE------DYWLM 305
Query: 462 ANSFNTNWGENGLFRIVRGQNECGIEADITAGLP 495
AN +N WG++G F I RG NECGIE + AGLP
Sbjct: 306 ANQWNRGWGDDGYFMIRRGTNECGIEDEPVAGLP 339
>gi|187105118|ref|NP_001119619.1| cathepsin B-5880 precursor [Acyrthosiphon pisum]
gi|163300442|tpg|DAA06127.1| TPA_inf: cathepsin B transcript 5880 [Acyrthosiphon pisum]
gi|239790051|dbj|BAH71611.1| ACYPI000015 [Acyrthosiphon pisum]
gi|239790053|dbj|BAH71612.1| ACYPI000015 [Acyrthosiphon pisum]
Length = 302
Score = 201 bits (511), Expect = 7e-49, Method: Compositional matrix adjust.
Identities = 103/246 (41%), Positives = 147/246 (59%), Gaps = 13/246 (5%)
Query: 92 DPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLS 151
DPL LP+ FDAR W CP+I + DQG+C S +A+ A+SDR+CI S G +LS
Sbjct: 49 DPLN-LPKSFDARAKWYMCPSIGMVYDQGNCKSSYAISVASAVSDRICIHSNGTVKPKLS 107
Query: 152 SDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEI-PCERYMNGSH 210
+ ++SCC CG+GC GG H ++W ++ G+VSGG Y S +GC+PY I PC+
Sbjct: 108 AQQILSCCYLCGDGCSGGQHFESWDFYRRHGLVSGGEYGSNEGCQPYTIEPCQHTETAVE 167
Query: 211 SSCQDNEPNTPECIRKC-QPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSM 269
++C + TPEC +C P Y Y D N Y +PA T M+EI+ +GP+ S
Sbjct: 168 NACSNKTLFTPECKVQCYNPDYGTRYVKD-NHQGTHYRVPA--YTAMKEIYENGPITASF 224
Query: 270 TIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGE 329
+Y D + Y++G+Y + +G + A++I+GWG+E GT YWL ANSFNT WG+
Sbjct: 225 YMYQDFVNYQSGVYAYNSGKYVTTQAVKILGWGEE---NGT----PYWLAANSFNTYWGD 277
Query: 330 NGLFRI 335
NG +I
Sbjct: 278 NGFVKI 283
Score = 122 bits (307), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 67/162 (41%), Positives = 93/162 (57%), Gaps = 12/162 (7%)
Query: 336 GCRPYEI-PCERYMNGSRSSCQANEPNTPECIRKC-QPGYDVSYEDDLNFGRIAYSLPAN 393
GC+PY I PC+ ++C TPEC +C P Y Y D N Y +PA
Sbjct: 150 GCQPYTIEPCQHTETAVENACSNKTLFTPECKVQCYNPDYGTRYVKD-NHQGTHYRVPA- 207
Query: 394 EETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTS 453
T M+EI+ +GP+ S +Y D + Y++G+Y + +G + A++I+GWG+E GT
Sbjct: 208 -YTAMKEIYENGPITASFYMYQDFVNYQSGVYAYNSGKYVTTQAVKILGWGEE---NGT- 262
Query: 454 SVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLP 495
YWL ANSFNT WG+NG +I+RG NEC IE + AGLP
Sbjct: 263 ---PYWLAANSFNTYWGDNGFVKILRGANECYIEEFMYAGLP 301
>gi|312283137|dbj|BAJ34434.1| unnamed protein product [Thellungiella halophila]
Length = 362
Score = 201 bits (511), Expect = 8e-49, Method: Compositional matrix adjust.
Identities = 111/292 (38%), Positives = 167/292 (57%), Gaps = 28/292 (9%)
Query: 47 PKLPFYGAEKNALSKLTLSELEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARIN 106
P + A + S T++E + +GV P K +P++ D +LP+ FDAR
Sbjct: 58 PDAGWKAAINDRFSNATVAEFKRLLGVKPTPKKHFLGVPIVSH--DRSLKLPKEFDARTA 115
Query: 107 WPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCK-DCGNG 165
WP C +I I DQG CGS WA GAVE++SDR CI ++ LS +DL++CC CG+G
Sbjct: 116 WPQCTSIGNILDQGHCGSCWAFGAVESLSDRFCI--EFGMNISLSVNDLLACCGFRCGDG 173
Query: 166 CQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPY--EIPCERYMNGSHSSCQDNEPNTPEC 223
C GG+ AW+Y+ +G+V+ + C PY + C SH C+ P TP+C
Sbjct: 174 CDGGYPIAAWQYFSYSGVVT-------EECDPYFDDTGC------SHPGCEPAYP-TPKC 219
Query: 224 IRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIY 283
+RKC G + + ++ Y++ +N + IM E++++GPVE S T+Y D YK+G+Y
Sbjct: 220 MRKCVSGNQL-WSQSKHYSVSTYTVKSNPQDIMAEVYKNGPVEVSFTVYEDFAHYKSGVY 278
Query: 284 KHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
KH+ G +G HA+++IGWG GE YWL+AN +N +WG++G F I
Sbjct: 279 KHITGSNIGGHAVKLIGWGTTDEGE------DYWLLANQWNRSWGDDGYFMI 324
Score = 131 bits (330), Expect = 8e-28, Method: Compositional matrix adjust.
Identities = 64/154 (41%), Positives = 95/154 (61%), Gaps = 9/154 (5%)
Query: 344 CERYMNGSRSSCQANEPN--TPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREI 401
C+ Y + + S EP TP+C+RKC G + + ++ Y++ +N + IM E+
Sbjct: 196 CDPYFDDTGCSHPGCEPAYPTPKCMRKCVSGNQL-WSQSKHYSVSTYTVKSNPQDIMAEV 254
Query: 402 FRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLV 461
+++GPVE S T+Y D YK+G+YKH+ G +G HA+++IGWG GE YWL+
Sbjct: 255 YKNGPVEVSFTVYEDFAHYKSGVYKHITGSNIGGHAVKLIGWGTTDEGE------DYWLL 308
Query: 462 ANSFNTNWGENGLFRIVRGQNECGIEADITAGLP 495
AN +N +WG++G F I RG NECGIE + AGLP
Sbjct: 309 ANQWNRSWGDDGYFMIRRGTNECGIEDEPVAGLP 342
>gi|197304333|dbj|BAG69285.1| cathepsin B-like cysteine protease [Raphanus sativus]
Length = 343
Score = 201 bits (511), Expect = 8e-49, Method: Compositional matrix adjust.
Identities = 114/294 (38%), Positives = 171/294 (58%), Gaps = 32/294 (10%)
Query: 47 PKLPFYGAEKNALSKLTLSELEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARIN 106
P + A + S T++E + +GV P K +P++ D +LP+ FDAR +
Sbjct: 56 PNAGWKAAINDRFSNATVAEFKRLLGVKPTPKKLLLGVPVVSH--DQSLKLPKSFDARTH 113
Query: 107 WPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCK-DCGNG 165
WP C +I +I DQG CGS WA GAVE++SDR CI + ++ LS +DL++CC CG+G
Sbjct: 114 WPQCTSIGKILDQGHCGSCWAFGAVESLSDRFCI--QFGMNITLSVNDLLACCGFRCGDG 171
Query: 166 CQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPY--EIPCERYMNGSHSSCQDNEP--NTP 221
C GG+ AW+Y+ +G+V+ + C PY + C SH C EP NTP
Sbjct: 172 CDGGYPISAWQYFSYSGVVT-------EECDPYFDQTGC------SHPGC---EPAYNTP 215
Query: 222 ECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTG 281
+C+RKC G + + + ++ Y + +N + IM EI+++GPVE S T+Y D YK+G
Sbjct: 216 QCLRKCV-GRNQLWSESKHYSINTYVVESNPQDIMAEIYKNGPVEVSFTVYEDFAHYKSG 274
Query: 282 IYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
+YKH+ G +G HA+++IGWG GE YWL+AN +N +WG++G F I
Sbjct: 275 VYKHITGSNIGGHAVKLIGWGTTDDGE------DYWLLANQWNRSWGDDGYFMI 322
Score = 131 bits (330), Expect = 7e-28, Method: Compositional matrix adjust.
Identities = 66/154 (42%), Positives = 96/154 (62%), Gaps = 9/154 (5%)
Query: 344 CERYMNGSRSSCQANEP--NTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREI 401
C+ Y + + S EP NTP+C+RKC G + + + ++ Y + +N + IM EI
Sbjct: 194 CDPYFDQTGCSHPGCEPAYNTPQCLRKCV-GRNQLWSESKHYSINTYVVESNPQDIMAEI 252
Query: 402 FRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLV 461
+++GPVE S T+Y D YK+G+YKH+ G +G HA+++IGWG GE YWL+
Sbjct: 253 YKNGPVEVSFTVYEDFAHYKSGVYKHITGSNIGGHAVKLIGWGTTDDGE------DYWLL 306
Query: 462 ANSFNTNWGENGLFRIVRGQNECGIEADITAGLP 495
AN +N +WG++G F I RG NECGIE + AGLP
Sbjct: 307 ANQWNRSWGDDGYFMIRRGTNECGIEDEPVAGLP 340
>gi|297814171|ref|XP_002874969.1| hypothetical protein ARALYDRAFT_490415 [Arabidopsis lyrata subsp.
lyrata]
gi|297320806|gb|EFH51228.1| hypothetical protein ARALYDRAFT_490415 [Arabidopsis lyrata subsp.
lyrata]
Length = 359
Score = 201 bits (510), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 110/290 (37%), Positives = 167/290 (57%), Gaps = 24/290 (8%)
Query: 47 PKLPFYGAEKNALSKLTLSELEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARIN 106
P + A + S T++E + +GV P K ++ L + V DP +LP+ FDAR
Sbjct: 55 PNAGWKAAINDRFSNATVAEFKRLLGVKPTPK--KHFLGVPVVSHDPSLKLPKAFDARTA 112
Query: 107 WPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCK-DCGNG 165
WP C +I +I DQG CGS WA GAVE++SDR CI + ++ LS +DL++CC CG+G
Sbjct: 113 WPQCTSIGKILDQGHCGSCWAFGAVESLSDRFCI--QFGMNISLSVNDLLACCGFRCGDG 170
Query: 166 CQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIR 225
C GG+ AW+Y+ +G+V+ + C PY SH C+ P TP C+R
Sbjct: 171 CDGGYPIAAWQYFSYSGVVT-------EECDPYF----DNTGCSHPGCEPAYP-TPRCLR 218
Query: 226 KCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKH 285
KC + + + ++ Y++ ++ + IM E++++GPVE S T+Y D YK+G+YKH
Sbjct: 219 KCVSDNKL-WSESKHYSVSTYTVNSSPQDIMAEVYKNGPVEVSFTVYEDFAHYKSGVYKH 277
Query: 286 VAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
+ G +G HA+++IGWG GE YWL+AN +N WG++G F I
Sbjct: 278 ITGSNIGGHAVKLIGWGTSNEGE------DYWLMANQWNRGWGDDGYFMI 321
Score = 125 bits (315), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 62/154 (40%), Positives = 93/154 (60%), Gaps = 9/154 (5%)
Query: 344 CERYMNGSRSSCQANEPN--TPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREI 401
C+ Y + + S EP TP C+RKC + + + ++ Y++ ++ + IM E+
Sbjct: 193 CDPYFDNTGCSHPGCEPAYPTPRCLRKCVSDNKL-WSESKHYSVSTYTVNSSPQDIMAEV 251
Query: 402 FRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLV 461
+++GPVE S T+Y D YK+G+YKH+ G +G HA+++IGWG GE YWL+
Sbjct: 252 YKNGPVEVSFTVYEDFAHYKSGVYKHITGSNIGGHAVKLIGWGTSNEGE------DYWLM 305
Query: 462 ANSFNTNWGENGLFRIVRGQNECGIEADITAGLP 495
AN +N WG++G F I RG NECGIE + AGLP
Sbjct: 306 ANQWNRGWGDDGYFMIRRGTNECGIEDEPVAGLP 339
>gi|300835056|gb|ADK37857.1| putative cathepsin precursor [Sitobion avenae]
Length = 340
Score = 201 bits (510), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 102/250 (40%), Positives = 142/250 (56%), Gaps = 11/250 (4%)
Query: 88 VQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRH 147
V + +P FDAR W +C TI E+RDQG CGS WA G A +DR+C+A+ G +
Sbjct: 79 VAYDNHFGHIPRTFDARKKWRHCRTIGEVRDQGHCGSCWAFGTSSAFADRLCVATDGDFN 138
Query: 148 VRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEI-PCERYM 206
LS++++ CC CG GC GG+ KAWKY+ G+V+GG Y S +GC PY + PC R
Sbjct: 139 ELLSAEEITFCCHTCGFGCHGGYPIKAWKYFSKHGLVTGGNYKSGEGCEPYRVPPCPRDD 198
Query: 207 NGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVE 266
G+++ C R C D+ Y DD F R Y L +I +++ +GP+E
Sbjct: 199 KGNNTCAGKPIEKNHRCTRMCYGDQDLDYNDDHRFTRDFYYLTYG--SIQKDVMTYGPIE 256
Query: 267 GSMTIYADMILYKTGIYKHVAGGP-LGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNT 325
S +Y D YK+G+Y+ LG HA+++IGWG E EGT YWL+ NS+N
Sbjct: 257 ASFDVYDDFPSYKSGVYEKTENASYLGGHAVKLIGWGVE---EGTP----YWLMVNSWNA 309
Query: 326 NWGENGLFRI 335
WG+ GLF+I
Sbjct: 310 QWGDKGLFKI 319
Score = 119 bits (299), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 65/162 (40%), Positives = 88/162 (54%), Gaps = 11/162 (6%)
Query: 336 GCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANE 394
GC PY +P C R G+ + C R C D+ Y DD F R Y L
Sbjct: 185 GCEPYRVPPCPRDDKGNNTCAGKPIEKNHRCTRMCYGDQDLDYNDDHRFTRDFYYLTYG- 243
Query: 395 ETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGP-LGEHAIRIIGWGQEPLGEGTS 453
+I +++ +GP+E S +Y D YK+G+Y+ LG HA+++IGWG E EGT
Sbjct: 244 -SIQKDVMTYGPIEASFDVYDDFPSYKSGVYEKTENASYLGGHAVKLIGWGVE---EGTP 299
Query: 454 SVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLP 495
YWL+ NS+N WG+ GLF+I RG NECGI+ TAG+P
Sbjct: 300 ----YWLMVNSWNAQWGDKGLFKIRRGTNECGIDNSTTAGVP 337
>gi|118364222|ref|XP_001015333.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila]
gi|89297100|gb|EAR95088.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila SB210]
Length = 341
Score = 201 bits (510), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 101/242 (41%), Positives = 145/242 (59%), Gaps = 13/242 (5%)
Query: 96 ELPEGFDARINW-PYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDD 154
+LP FDAR W C ++ E+RDQ +CGS WA GAVE+++DR CI + +RLS+ +
Sbjct: 91 DLPTAFDARQQWGDKCTSLWEVRDQSNCGSCWAFGAVESLTDRHCI--HLGQDIRLSAQN 148
Query: 155 LVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEI-PCERYMNGSHSSC 213
+++CC CG GC GG+ A Y+V TG+V+G Y + C+ Y PC +++
Sbjct: 149 MLTCCATCGQGCNGGYPASAMSYYVKTGLVTGDLYNTTGWCQAYSFAPCAHHVDTPLYPA 208
Query: 214 QDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYA 273
E TP+C + C G +Y ++ G AYS+ +E IM EI +GPVE + T+Y
Sbjct: 209 CTGELPTPKCAKTCDSGSGQTYT--VHKGSKAYSVGKTQEAIMTEIQTNGPVEAAFTVYE 266
Query: 274 DMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLF 333
D + YK+G+YKHV G LG HAI+I+GWG E + YW+V NS+N WG+NG F
Sbjct: 267 DFLNYKSGVYKHVTGKALGGHAIKIVGWGVE-------NNTPYWIVVNSWNQTWGDNGTF 319
Query: 334 RI 335
+I
Sbjct: 320 KI 321
Score = 138 bits (347), Expect = 9e-30, Method: Compositional matrix adjust.
Identities = 68/160 (42%), Positives = 95/160 (59%), Gaps = 10/160 (6%)
Query: 337 CRPYEI-PCERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEE 395
C+ Y PC +++ E TP+C + C G +Y ++ G AYS+ +E
Sbjct: 189 CQAYSFAPCAHHVDTPLYPACTGELPTPKCAKTCDSGSGQTYT--VHKGSKAYSVGKTQE 246
Query: 396 TIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSV 455
IM EI +GPVE + T+Y D + YK+G+YKHV G LG HAI+I+GWG E +
Sbjct: 247 AIMTEIQTNGPVEAAFTVYEDFLNYKSGVYKHVTGKALGGHAIKIVGWGVE-------NN 299
Query: 456 VKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLP 495
YW+V NS+N WG+NG F+I+RG+NECGIEA + LP
Sbjct: 300 TPYWIVVNSWNQTWGDNGTFKILRGKNECGIEAQVVTALP 339
>gi|268555420|ref|XP_002635699.1| Hypothetical protein CBG22436 [Caenorhabditis briggsae]
Length = 317
Score = 200 bits (509), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 106/249 (42%), Positives = 147/249 (59%), Gaps = 19/249 (7%)
Query: 89 QLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHV 148
Q++ L +P FDAR WP C +I+ IR+Q +CGS WA GA E MSDR+CIAS G +
Sbjct: 67 QVNTVLPSIPTYFDARTRWPNCRSIKMIRNQATCGSCWAFGAAEVMSDRICIASMGTKQP 126
Query: 149 RLSSDDLVSCCKD-CGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEI-PCERYM 206
+S DL+SCC + CG GC+G +A+++W G+V+GG Y GC+PY PC
Sbjct: 127 IISPTDLLSCCGNFCGYGCKGASPLQAFRWWNKKGVVTGGDYRG-SGCKPYPFAPC---- 181
Query: 207 NGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVE 266
+ C +E TP C CQP Y +Y D FG AY + + I EI +GPVE
Sbjct: 182 --TALPCTKSE--TPRCSLNCQPAYSKAYSKDKYFGTPAYIVGMDVAAIQTEI-TNGPVE 236
Query: 267 GSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTN 326
+ +Y D Y++G+Y+HVAG +G HA++IIGWG + + YWL+ANS+
Sbjct: 237 AAFIVYDDFNHYRSGVYRHVAGKLVGGHAVKIIGWGIQ-------NGAPYWLMANSWGPY 289
Query: 327 WGENGLFRI 335
WGENG F++
Sbjct: 290 WGENGFFKM 298
Score = 126 bits (317), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 66/161 (40%), Positives = 92/161 (57%), Gaps = 17/161 (10%)
Query: 336 GCRPYEI-PCERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANE 394
GC+PY PC ++ + TP C CQP Y +Y D FG AY + +
Sbjct: 172 GCKPYPFAPC--------TALPCTKSETPRCSLNCQPAYSKAYSKDKYFGTPAYIVGMDV 223
Query: 395 ETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSS 454
I EI +GPVE + +Y D Y++G+Y+HVAG +G HA++IIGWG + +
Sbjct: 224 AAIQTEI-TNGPVEAAFIVYDDFNHYRSGVYRHVAGKLVGGHAVKIIGWGIQ-------N 275
Query: 455 VVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLP 495
YWL+ANS+ WGENG F+++RG +ECGIE+ I AG P
Sbjct: 276 GAPYWLMANSWGPYWGENGFFKMLRGVDECGIESTIVAGKP 316
>gi|341888694|gb|EGT44629.1| hypothetical protein CAEBREN_31940 [Caenorhabditis brenneri]
Length = 374
Score = 200 bits (509), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 101/244 (41%), Positives = 148/244 (60%), Gaps = 19/244 (7%)
Query: 95 EELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDD 154
E LP+ FD+R WP C +I+ IR+Q +CGS WA GA E +SDR+CI S + +S +D
Sbjct: 95 EPLPDTFDSREQWPECKSIKLIRNQATCGSCWAFGAAEIISDRICIQSNATQTPIISVED 154
Query: 155 LVSCCK-DCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEI-PCERYMNGSHSS 212
++SCC CG GCQGG+ +A ++W ++G V+GG Y + GC PY PC++ S
Sbjct: 155 ILSCCGVSCGKGCQGGYSIEALRFWKSSGAVTGGDY-NGAGCMPYSFAPCKK------DS 207
Query: 213 CQDNEPNTPECIRKCQPGYDVS-YEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTI 271
C + TP C CQ Y + Y D +FG AY + + I EI+ +GPVE S +
Sbjct: 208 CA--QGTTPSCKTTCQSSYKTAEYTKDKHFGTTAYKITNSVAAIQTEIYHNGPVEASFKV 265
Query: 272 YADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENG 331
Y D YK+G+Y++ +G +G HA++IIGWG E + V YWL+ANS+ T +G++G
Sbjct: 266 YEDFYKYKSGVYQYTSGKLVGGHAVKIIGWGTE-------NGVDYWLIANSWGTTFGDSG 318
Query: 332 LFRI 335
F++
Sbjct: 319 FFKM 322
Score = 128 bits (322), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 68/176 (38%), Positives = 99/176 (56%), Gaps = 17/176 (9%)
Query: 336 GCRPYEI-PCERYMNGSRSSCQANEPNTPECIRKCQPGYDVS-YEDDLNFGRIAYSLPAN 393
GC PY PC++ SC + TP C CQ Y + Y D +FG AY + +
Sbjct: 194 GCMPYSFAPCKK------DSCA--QGTTPSCKTTCQSSYKTAEYTKDKHFGTTAYKITNS 245
Query: 394 EETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTS 453
I EI+ +GPVE S +Y D YK+G+Y++ +G +G HA++IIGWG E
Sbjct: 246 VAAIQTEIYHNGPVEASFKVYEDFYKYKSGVYQYTSGKLVGGHAVKIIGWGTE------- 298
Query: 454 SVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKIGLEIDSNEINLG 509
+ V YWL+ANS+ T +G++G F++ RG NE GIE ++ AG K+G + E + G
Sbjct: 299 NGVDYWLIANSWGTTFGDSGFFKMRRGTNEVGIEGNVVAGTAKLGTHDEKREDDDG 354
>gi|340053922|emb|CCC48215.1| cysteine peptidase C (CPC) [Trypanosoma vivax Y486]
Length = 334
Score = 199 bits (507), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 112/278 (40%), Positives = 158/278 (56%), Gaps = 25/278 (8%)
Query: 59 LSKLTLSELEMRMGVH-PDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIR 117
+++LT ++ MG D+ + R +L PL PE FDA WP CPTI+ I
Sbjct: 55 MARLTRQGVKRLMGAKLRDAPVLPRRHFTEEELRAPL---PESFDAATAWPDCPTIKRIA 111
Query: 118 DQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKY 177
DQ SCGS WA+ A AMSDR C+ G R + +S+ DL+SCC CG+GC GG+ +AW Y
Sbjct: 112 DQSSCGSCWAVAAATAMSDRFCVTG-GVRDLGISAGDLLSCCTSCGDGCDGGYPDEAWLY 170
Query: 178 WVTTGIVSGGTYASKQGCRPYEIPCERYMNG--SHSSCQDNEPNTPECIRKCQPGYDVSY 235
+ +G+VS C+PY P ++ G + SC D +TP+C C D
Sbjct: 171 FTESGLVS-------DYCQPYPFPPCKHSGGRSKNPSCHDMHFHTPKCNATCT---DKRI 220
Query: 236 EDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHA 295
F +YSL EE RE++ GP E + T+Y D + Y++G+YKHV+GGP+G HA
Sbjct: 221 PVVRYFASESYSL-QGEEDYKRELYLRGPFEVAFTVYEDFLAYESGVYKHVSGGPVGGHA 279
Query: 296 IRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLF 333
+R++GWG+ + V YW +ANS+NT+WGENG
Sbjct: 280 VRVVGWGER-------NGVPYWKIANSWNTDWGENGYL 310
Score = 127 bits (319), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 68/169 (40%), Positives = 97/169 (57%), Gaps = 13/169 (7%)
Query: 329 ENGLFRIGCRPYEIPCERYMNGSRS--SCQANEPNTPECIRKCQPGYDVSYEDDLNFGRI 386
E+GL C+PY P ++ G SC +TP+C C D F
Sbjct: 173 ESGLVSDYCQPYPFPPCKHSGGRSKNPSCHDMHFHTPKCNATCT---DKRIPVVRYFASE 229
Query: 387 AYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQE 446
+YSL EE RE++ GP E + T+Y D + Y++G+YKHV+GGP+G HA+R++GWG+
Sbjct: 230 SYSL-QGEEDYKRELYLRGPFEVAFTVYEDFLAYESGVYKHVSGGPVGGHAVRVVGWGER 288
Query: 447 PLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLP 495
+ V YW +ANS+NT+WGENG RG++ECGIE+ +AG P
Sbjct: 289 -------NGVPYWKIANSWNTDWGENGYLYFYRGKDECGIESQGSAGTP 330
>gi|90074902|dbj|BAE87131.1| unnamed protein product [Macaca fascicularis]
Length = 296
Score = 199 bits (507), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 107/255 (41%), Positives = 145/255 (56%), Gaps = 54/255 (21%)
Query: 83 RLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIAS 142
+ P V ++ L+ LPE FDAR WP CPTI+EIRDQGSCGS WA GAVEA+SDR+CI +
Sbjct: 67 KPPQRVMFTEDLK-LPESFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHT 125
Query: 143 RGKRHVRLSSDDLVSCCKD-CGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEI- 200
V +S++DL++CC CG+GC GG+ AW +W G+VSGG Y S GCRPY I
Sbjct: 126 NAHVSVEVSAEDLLTCCGIMCGDGCNGGYPAGAWNFWTRKGLVSGGLYDSHVGCRPYSIP 185
Query: 201 PCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIF 260
PCE ++NGS C E +TP+C + C+PGY +Y+ D ++G +YS+ +E+ IM EI+
Sbjct: 186 PCEHHVNGSRPPCT-GEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIY 244
Query: 261 RHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVA 320
++G YWLVA
Sbjct: 245 KNG--------------------------------------------------TPYWLVA 254
Query: 321 NSFNTNWGENGLFRI 335
NS+NT+WG+NG F+I
Sbjct: 255 NSWNTDWGDNGFFKI 269
Score = 110 bits (274), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 60/169 (35%), Positives = 90/169 (53%), Gaps = 54/169 (31%)
Query: 331 GLF--RIGCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIA 387
GL+ +GCRPY IP CE ++NGSR C E +TP+C + C+PGY +Y+ D ++G +
Sbjct: 171 GLYDSHVGCRPYSIPPCEHHVNGSRPPCTG-EGDTPKCSKICEPGYSPTYKQDKHYGYNS 229
Query: 388 YSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEP 447
YS+ +E+ IM EI+++G
Sbjct: 230 YSVSNSEKDIMAEIYKNG------------------------------------------ 247
Query: 448 LGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPK 496
YWLVANS+NT+WG+NG F+I+RGQ+ CGIE+++ AG+P+
Sbjct: 248 --------TPYWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPR 288
>gi|3929817|emb|CAA77181.1| cathepsin B [Mus musculus]
Length = 194
Score = 199 bits (506), Expect = 3e-48, Method: Compositional matrix adjust.
Identities = 98/202 (48%), Positives = 136/202 (67%), Gaps = 10/202 (4%)
Query: 122 CGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCK-DCGNGCQGGFHGKAWKYWVT 180
CGS WA GAVEA+SDR CI + G+ +V +S++DL++CC CG+GC GG+ AW +W
Sbjct: 1 CGSCWAFGAVEAISDRTCIHTNGRVNVEVSAEDLLTCCGIQCGDGCNGGYPSGAWNFWTK 60
Query: 181 TGIVSGGTYASKQGCRPYEIP-CERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDL 239
G+VSGG Y S GC PY IP CE ++NGS E +TP C + C+ GY SY++D
Sbjct: 61 KGLVSGGVYDSHIGCLPYTIPPCEHHVNGSRPP-MHGEGDTPRCNKSCEAGYSPSYKEDK 119
Query: 240 NFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRII 299
+FG +YS+ + + IM EI+++GPVEG+ T+++D + YK+G+YKH AG +G HAIRI+
Sbjct: 120 HFGYTSYSVSNSVKEIMAEIYKNGPVEGAFTVFSDFLTYKSGVYKHEAGDMMGGHAIRIL 179
Query: 300 GWGQEPLGEGTSSVVKYWLVAN 321
GWG E + V YWL AN
Sbjct: 180 GWGVE-------NGVPYWLAAN 194
Score = 125 bits (313), Expect = 7e-26, Method: Compositional matrix adjust.
Identities = 62/131 (47%), Positives = 87/131 (66%), Gaps = 9/131 (6%)
Query: 334 RIGCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPA 392
IGC PY IP CE ++NGSR E +TP C + C+ GY SY++D +FG +YS+
Sbjct: 72 HIGCLPYTIPPCEHHVNGSRPPMHG-EGDTPRCNKSCEAGYSPSYKEDKHFGYTSYSVSN 130
Query: 393 NEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGT 452
+ + IM EI+++GPVEG+ T+++D + YK+G+YKH AG +G HAIRI+GWG E
Sbjct: 131 SVKEIMAEIYKNGPVEGAFTVFSDFLTYKSGVYKHEAGDMMGGHAIRILGWGVE------ 184
Query: 453 SSVVKYWLVAN 463
+ V YWL AN
Sbjct: 185 -NGVPYWLAAN 194
>gi|999909|pdb|1HUC|B Chain B, The Refined 2.15 Angstroms X-Ray Crystal Structure Of
Human Liver Cathepsin B: The Structural Basis For Its
Specificity
gi|999911|pdb|1HUC|D Chain D, The Refined 2.15 Angstroms X-Ray Crystal Structure Of
Human Liver Cathepsin B: The Structural Basis For Its
Specificity
gi|1421164|pdb|1CSB|B Chain B, Crystal Structure Of Cathepsin B Inhibited With Ca030 At
2.1 Angstroms Resolution: A Basis For The Design Of
Specific Epoxysuccinyl Inhibitors
gi|1421167|pdb|1CSB|E Chain E, Crystal Structure Of Cathepsin B Inhibited With Ca030 At
2.1 Angstroms Resolution: A Basis For The Design Of
Specific Epoxysuccinyl Inhibitors
gi|122920711|pdb|2IPP|B Chain B, Crystal Structure Of The Tetragonal Form Of Human Liver
Cathepsin B
Length = 205
Score = 199 bits (506), Expect = 3e-48, Method: Compositional matrix adjust.
Identities = 95/190 (50%), Positives = 136/190 (71%), Gaps = 10/190 (5%)
Query: 148 VRLSSDDLVSCCKD-CGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIP-CERY 205
V +S++DL++CC CG+GC GG+ +AW +W G+VSGG Y S GCRPY IP CE +
Sbjct: 3 VEVSAEDLLTCCGSMCGDGCNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCEHH 62
Query: 206 MNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPV 265
+NGS C E +TP+C + C+PGY +Y+ D ++G +YS+ +E+ IM EI+++GPV
Sbjct: 63 VNGSRPPCT-GEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPV 121
Query: 266 EGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNT 325
EG+ ++Y+D +LYK+G+Y+HV G +G HAIRI+GWG E GT YWLVANS+NT
Sbjct: 122 EGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGVE---NGT----PYWLVANSWNT 174
Query: 326 NWGENGLFRI 335
+WG+NG F+I
Sbjct: 175 DWGDNGFFKI 184
Score = 182 bits (463), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 84/164 (51%), Positives = 123/164 (75%), Gaps = 9/164 (5%)
Query: 334 RIGCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPA 392
+GCRPY IP CE ++NGSR C E +TP+C + C+PGY +Y+ D ++G +YS+
Sbjct: 48 HVGCRPYSIPPCEHHVNGSRPPCTG-EGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVSN 106
Query: 393 NEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGT 452
+E+ IM EI+++GPVEG+ ++Y+D +LYK+G+Y+HV G +G HAIRI+GWG E GT
Sbjct: 107 SEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGVE---NGT 163
Query: 453 SSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPK 496
YWLVANS+NT+WG+NG F+I+RGQ+ CGIE+++ AG+P+
Sbjct: 164 ----PYWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPR 203
>gi|181178|gb|AAA52125.1| lysosomal proteinase cathepsin B, partial [Homo sapiens]
Length = 209
Score = 198 bits (504), Expect = 4e-48, Method: Compositional matrix adjust.
Identities = 95/190 (50%), Positives = 136/190 (71%), Gaps = 10/190 (5%)
Query: 148 VRLSSDDLVSCCKD-CGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIP-CERY 205
V +S++DL++CC CG+GC GG+ +AW +W G+VSGG Y S GCRPY IP CE +
Sbjct: 1 VEVSAEDLLTCCGSMCGDGCNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCEHH 60
Query: 206 MNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPV 265
+NGS C E +TP+C + C+PGY +Y+ D ++G +YS+ +E+ IM EI+++GPV
Sbjct: 61 VNGSRPPCT-GEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPV 119
Query: 266 EGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNT 325
EG+ ++Y+D +LYK+G+Y+HV G +G HAIRI+GWG E GT YWLVANS+NT
Sbjct: 120 EGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGVE---NGT----PYWLVANSWNT 172
Query: 326 NWGENGLFRI 335
+WG+NG F+I
Sbjct: 173 DWGDNGFFKI 182
Score = 182 bits (462), Expect = 4e-43, Method: Compositional matrix adjust.
Identities = 84/164 (51%), Positives = 123/164 (75%), Gaps = 9/164 (5%)
Query: 334 RIGCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPA 392
+GCRPY IP CE ++NGSR C E +TP+C + C+PGY +Y+ D ++G +YS+
Sbjct: 46 HVGCRPYSIPPCEHHVNGSRPPCTG-EGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVSN 104
Query: 393 NEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGT 452
+E+ IM EI+++GPVEG+ ++Y+D +LYK+G+Y+HV G +G HAIRI+GWG E GT
Sbjct: 105 SEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGVE---NGT 161
Query: 453 SSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPK 496
YWLVANS+NT+WG+NG F+I+RGQ+ CGIE+++ AG+P+
Sbjct: 162 ----PYWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPR 201
>gi|116784401|gb|ABK23329.1| unknown [Picea sitchensis]
Length = 350
Score = 198 bits (504), Expect = 5e-48, Method: Compositional matrix adjust.
Identities = 113/282 (40%), Positives = 166/282 (58%), Gaps = 31/282 (10%)
Query: 59 LSKLTLSELEMRMGVHPDSKLPQNRLPLLVQLSDPL-EELPEGFDARINWPYCPTIQEIR 117
S T+ + + +GV P P+N L + ++ P LP+ FDAR WP C ++Q I
Sbjct: 60 FSNHTVGQFKRLLGVLP---TPRNFLENVPVITYPKGMNLPKQFDAREAWPQCTSVQTIL 116
Query: 118 DQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKD-CGNGCQGGFHGKAWK 176
DQG CGS WA GAVEA+SDR CI K +V LS +DLV+CC CG+GC GG+ AW+
Sbjct: 117 DQGHCGSCWAFGAVEALSDRFCI--HHKVNVTLSENDLVACCGFMCGDGCDGGYPISAWQ 174
Query: 177 YWVTTGIVSGGTYASKQGCRPY--EIPCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVS 234
Y+++TG+V+ C PY + C+ H C+ P TP+C+++C+ +
Sbjct: 175 YFISTGVVTA-------ECDPYFDDAGCQ------HPGCEPLYP-TPQCVKQCK-DENQK 219
Query: 235 YEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEH 294
+ + F AY + + IM E++ +GPVE S ++Y D YK+G+YK+ G +G H
Sbjct: 220 WGNSKRFSATAYRISSKPYDIMAEVYTNGPVEVSFSVYEDFAHYKSGVYKYTKGDYMGGH 279
Query: 295 AIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIG 336
A++++GWG E +GT YWLVANS+NT WGE+G F+I
Sbjct: 280 AVKLVGWGTE---DGTD----YWLVANSWNTAWGEDGYFKIA 314
Score = 129 bits (323), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 63/154 (40%), Positives = 95/154 (61%), Gaps = 10/154 (6%)
Query: 344 CERYMNGSRSSCQANEP--NTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREI 401
C+ Y + + EP TP+C+++C+ + + + F AY + + IM E+
Sbjct: 186 CDPYFDDAGCQHPGCEPLYPTPQCVKQCK-DENQKWGNSKRFSATAYRISSKPYDIMAEV 244
Query: 402 FRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLV 461
+ +GPVE S ++Y D YK+G+YK+ G +G HA++++GWG E +GT YWLV
Sbjct: 245 YTNGPVEVSFSVYEDFAHYKSGVYKYTKGDYMGGHAVKLVGWGTE---DGTD----YWLV 297
Query: 462 ANSFNTNWGENGLFRIVRGQNECGIEADITAGLP 495
ANS+NT WGE+G F+I RG NECGIE D+ AG+P
Sbjct: 298 ANSWNTAWGEDGYFKIARGSNECGIEGDVVAGMP 331
>gi|224285427|gb|ACN40436.1| unknown [Picea sitchensis]
Length = 350
Score = 198 bits (503), Expect = 6e-48, Method: Compositional matrix adjust.
Identities = 113/282 (40%), Positives = 166/282 (58%), Gaps = 31/282 (10%)
Query: 59 LSKLTLSELEMRMGVHPDSKLPQNRLPLLVQLSDPLE-ELPEGFDARINWPYCPTIQEIR 117
S T+ + + +GV P P+N L + ++ P LP+ FDAR WP C ++Q I
Sbjct: 60 FSNHTVGQFKRLLGVLP---TPRNFLENVPVITYPKGINLPKQFDAREAWPQCTSVQTIL 116
Query: 118 DQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKD-CGNGCQGGFHGKAWK 176
DQG CGS WA GAVEA+SDR CI K +V LS +DLV+CC CG+GC GG+ AW+
Sbjct: 117 DQGHCGSCWAFGAVEALSDRFCI--HHKVNVTLSENDLVACCGFMCGDGCDGGYPISAWQ 174
Query: 177 YWVTTGIVSGGTYASKQGCRPY--EIPCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVS 234
Y+++TG+V+ C PY + C+ H C+ P TP+C+++C+ +
Sbjct: 175 YFISTGVVTA-------ECDPYFDDAGCQ------HPGCEPLYP-TPQCVKQCK-DENQK 219
Query: 235 YEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEH 294
+ + F AY + + IM E++ +GPVE S ++Y D YK+G+YK+ G +G H
Sbjct: 220 WGNSKRFSATAYRISSKPYDIMAEVYTNGPVEVSFSVYEDFAHYKSGVYKYTKGDYMGGH 279
Query: 295 AIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIG 336
A++++GWG E +GT YWLVANS+NT WGE+G F+I
Sbjct: 280 AVKLVGWGTE---DGTD----YWLVANSWNTAWGEDGYFKIA 314
Score = 129 bits (323), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 63/154 (40%), Positives = 95/154 (61%), Gaps = 10/154 (6%)
Query: 344 CERYMNGSRSSCQANEP--NTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREI 401
C+ Y + + EP TP+C+++C+ + + + F AY + + IM E+
Sbjct: 186 CDPYFDDAGCQHPGCEPLYPTPQCVKQCK-DENQKWGNSKRFSATAYRISSKPYDIMAEV 244
Query: 402 FRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLV 461
+ +GPVE S ++Y D YK+G+YK+ G +G HA++++GWG E +GT YWLV
Sbjct: 245 YTNGPVEVSFSVYEDFAHYKSGVYKYTKGDYMGGHAVKLVGWGTE---DGTD----YWLV 297
Query: 462 ANSFNTNWGENGLFRIVRGQNECGIEADITAGLP 495
ANS+NT WGE+G F+I RG NECGIE D+ AG+P
Sbjct: 298 ANSWNTAWGEDGYFKIARGSNECGIEGDVVAGMP 331
>gi|116779190|gb|ABK21175.1| unknown [Picea sitchensis]
gi|148907952|gb|ABR17096.1| unknown [Picea sitchensis]
gi|224284884|gb|ACN40172.1| unknown [Picea sitchensis]
Length = 350
Score = 197 bits (502), Expect = 9e-48, Method: Compositional matrix adjust.
Identities = 116/294 (39%), Positives = 167/294 (56%), Gaps = 31/294 (10%)
Query: 47 PKLPFYGAEKNALSKLTLSELEMRMGVHPDSKLPQNRLPLLVQLSDPLE-ELPEGFDARI 105
PK + + S T+ + + +GV P P+N L + + P LP+ FDAR
Sbjct: 48 PKAGWKAGMNSRFSNHTVGQFKRLLGVLP---TPRNLLENVPVRTYPKGLNLPKQFDARK 104
Query: 106 NWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCK-DCGN 164
WP C +++ I DQG CGS WA GAVEA+SDR CI K +V LS +DLV+CC CG+
Sbjct: 105 AWPQCTSVRTILDQGHCGSCWAFGAVEALSDRFCI--HYKVNVTLSENDLVACCGFRCGD 162
Query: 165 GCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPY--EIPCERYMNGSHSSCQDNEPNTPE 222
GC GG+ AW+Y+++TG+V+ C PY E C+ H C+ P TP+
Sbjct: 163 GCDGGYPLSAWQYFISTGVVTA-------ECDPYFDEAGCQ------HPGCEPLYP-TPQ 208
Query: 223 CIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGI 282
C+++C+ + ++ + F AY + + IM E++ GPVE +Y D YK+G+
Sbjct: 209 CVKQCK-DENQNWGNSKRFSATAYRITSKPYDIMAEVYTKGPVEVDFLVYEDFAHYKSGV 267
Query: 283 YKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIG 336
YK++ G LG HA+++IGWG E GT YWLVANS+NT WGE+G F+I
Sbjct: 268 YKYITGDFLGGHAVKLIGWGTE---NGTD----YWLVANSWNTAWGEDGYFKIA 314
Score = 125 bits (314), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 67/168 (39%), Positives = 96/168 (57%), Gaps = 17/168 (10%)
Query: 330 NGLFRIGCRPY--EIPCERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIA 387
G+ C PY E C+ C+ P TP+C+++C+ + ++ + F A
Sbjct: 179 TGVVTAECDPYFDEAGCQH------PGCEPLYP-TPQCVKQCK-DENQNWGNSKRFSATA 230
Query: 388 YSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEP 447
Y + + IM E++ GPVE +Y D YK+G+YK++ G LG HA+++IGWG E
Sbjct: 231 YRITSKPYDIMAEVYTKGPVEVDFLVYEDFAHYKSGVYKYITGDFLGGHAVKLIGWGTE- 289
Query: 448 LGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLP 495
GT YWLVANS+NT WGE+G F+I RG NEC IE D+ AG+P
Sbjct: 290 --NGTD----YWLVANSWNTAWGEDGYFKIARGSNECSIEEDVVAGMP 331
>gi|268561866|ref|XP_002638438.1| Hypothetical protein CBG18654 [Caenorhabditis briggsae]
Length = 396
Score = 197 bits (501), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 111/287 (38%), Positives = 162/287 (56%), Gaps = 24/287 (8%)
Query: 52 YGAEKNALSKLTLSELEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCP 111
+ AE +++S + +++R P+S+ + L + L +LP FD+R+ WP C
Sbjct: 44 WKAEYSSIS-MKAKTMDVRFAEVPESEKSEKSDDLEFET---LIQLPTAFDSRVQWPNCN 99
Query: 112 TIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCC-KDCGNGCQGGF 170
+I+ IRDQ CGS WA A E +SDR+CI S G + +S +D++SCC C NGCQGG+
Sbjct: 100 SIKLIRDQTYCGSCWAFAAAEIISDRICIQSNGTQQPIISPEDILSCCGSSCNNGCQGGY 159
Query: 171 HGKAWKYWVTTGIVSGGTYASKQGCRPYEI-PCERYMNGSHSSCQDNEPNTPECIRKCQP 229
+A KYW+ +G+V+GG Y GC PY PC S+C++ + + P C CQ
Sbjct: 160 TIEAMKYWMNSGVVTGGDYQGA-GCIPYSFRPC--------STCKEPK-DAPSCKTTCQA 209
Query: 230 GYDVSYEDDLNFGRIAYSLPANE-ETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAG 288
Y L + ++ AN + I EI+ +GPVE + +Y D YK+G+Y HV G
Sbjct: 210 SYKAKSAYRLPTTTSSNAIVANAVQMIQTEIYNNGPVEVAYQVYDDFYHYKSGVYYHVYG 269
Query: 289 GPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
HA++IIGWG E V YWLVANS++T +GENG F+I
Sbjct: 270 DKPSGHAVKIIGWGTEKK-------VDYWLVANSWSTTFGENGFFKI 309
Score = 121 bits (303), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 76/191 (39%), Positives = 102/191 (53%), Gaps = 26/191 (13%)
Query: 310 TSSVVKYWLVANSFNTNWGENGLFR-IGCRPYEI-PCERYMNGSRSSCQANEP-NTPECI 366
T +KYW+ N+ G ++ GC PY PC S+C+ EP + P C
Sbjct: 160 TIEAMKYWM-----NSGVVTGGDYQGAGCIPYSFRPC--------STCK--EPKDAPSCK 204
Query: 367 RKCQPGYDVSYEDDLNFGRIAYSLPANE-ETIMREIFRHGPVEGSMTIYADMILYKTGIY 425
CQ Y L + ++ AN + I EI+ +GPVE + +Y D YK+G+Y
Sbjct: 205 TTCQASYKAKSAYRLPTTTSSNAIVANAVQMIQTEIYNNGPVEVAYQVYDDFYHYKSGVY 264
Query: 426 KHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECG 485
HV G HA++IIGWG E V YWLVANS++T +GENG F+I RG NECG
Sbjct: 265 YHVYGDKPSGHAVKIIGWGTEKK-------VDYWLVANSWSTTFGENGFFKIRRGTNECG 317
Query: 486 IEADITAGLPK 496
IE ++ AGLPK
Sbjct: 318 IEENVVAGLPK 328
>gi|4325188|gb|AAD17297.1| cysteine proteinase [Ancylostoma ceylanicum]
Length = 341
Score = 197 bits (501), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 106/259 (40%), Positives = 154/259 (59%), Gaps = 11/259 (4%)
Query: 95 EELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDD 154
++ P+ FDAR WP C +I IRDQ +CGS WA+ + EAMSD +C+ S V +S D
Sbjct: 86 DDPPDSFDARTQWPECRSIGTIRDQSACGSCWAVSSAEAMSDEICVQSNSTIKVMISDTD 145
Query: 155 LVSCCK-DCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEI-PCERYMN-GSHS 211
++SCC DCG GCQGG+ +A+++ G+V+GG Y + C+PY PC ++ + +
Sbjct: 146 ILSCCGLDCGYGCQGGWPIEAYRWMQRDGVVTGGKYRQRDVCKPYSFYPCGQHKDVPYYG 205
Query: 212 SCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTI 271
C TP+C + Q Y+ +Y++D +F +YSLP NE +I +EI+++GPV + +
Sbjct: 206 PCPGGLWPTPKCRKSSQRKYNKTYQEDKHFATRSYSLPNNERSIRQEIYKNGPVVAAFKV 265
Query: 272 YADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENG 331
Y D GIY H G G HA ++IGWG+E GT YWL+ANS+NT+WGE+G
Sbjct: 266 YEDYSS-TGGIYVHKWGIQTGAHADKVIGWGRE---NGTD----YWLIANSWNTDWGEDG 317
Query: 332 LFRIGCRPYEIPCERYMNG 350
+RI ER M G
Sbjct: 318 YYRIVRETDNCEIERQMVG 336
Score = 112 bits (281), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 63/166 (37%), Positives = 94/166 (56%), Gaps = 10/166 (6%)
Query: 334 RIGCRPYEI-PCERYMNGSR-SSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLP 391
R C+PY PC ++ + C TP+C + Q Y+ +Y++D +F +YSLP
Sbjct: 184 RDVCKPYSFYPCGQHKDVPYYGPCPGGLWPTPKCRKSSQRKYNKTYQEDKHFATRSYSLP 243
Query: 392 ANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEG 451
NE +I +EI+++GPV + +Y D GIY H G G HA ++IGWG+E G
Sbjct: 244 NNERSIRQEIYKNGPVVAAFKVYEDYSS-TGGIYVHKWGIQTGAHADKVIGWGRE---NG 299
Query: 452 TSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
T YWL+ANS+NT+WGE+G +RIVR + C IE + ++
Sbjct: 300 TD----YWLIANSWNTDWGEDGYYRIVRETDNCEIERQMVGEFMRV 341
>gi|262217337|gb|ACY38050.1| cathepsin B [Dactylis glomerata]
Length = 348
Score = 197 bits (501), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 113/294 (38%), Positives = 166/294 (56%), Gaps = 32/294 (10%)
Query: 47 PKLPFYGAEKNALSKLTLSELEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARIN 106
P + L+ T+ + + +GV P P + + EELP+ FDAR
Sbjct: 46 PNAGWTAGHNAYLANYTIEQFKHILGVKPTP--PGLLAGVPTKTYSKSEELPKQFDARSK 103
Query: 107 WPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKD-CGNG 165
W C TI I DQG CGS WA GAVE + DR CI ++ LS++DLV+CC CG+G
Sbjct: 104 WSGCSTIGTILDQGHCGSCWAFGAVECLQDRFCIHQN--INISLSANDLVACCGFMCGDG 161
Query: 166 CQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPY--EIPCERYMNGSHSSCQDNEP--NTP 221
C GG+ KAW+Y+V +G+V+ + C PY ++ C+ H C EP +TP
Sbjct: 162 CDGGYPIKAWQYFVQSGVVT-------EECDPYFDQVGCK------HPGC---EPAYDTP 205
Query: 222 ECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTG 281
+C +KC+ V +E+ +F AY + ++ IM E++++GPVE + T+Y D YK+G
Sbjct: 206 KCEKKCKVQNQV-WEEKKHFSINAYRVNSDPHDIMAEVYKNGPVEVAFTVYEDFAHYKSG 264
Query: 282 IYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
+YKHV GG +G HA+++IGWG GE YWL+AN +N WG++G F+I
Sbjct: 265 VYKHVTGGVMGGHAVKLIGWGTSDAGE------DYWLLANQWNRGWGDDGYFKI 312
Score = 134 bits (336), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 64/154 (41%), Positives = 98/154 (63%), Gaps = 9/154 (5%)
Query: 344 CERYMNGSRSSCQANEP--NTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREI 401
C+ Y + EP +TP+C +KC+ V +E+ +F AY + ++ IM E+
Sbjct: 184 CDPYFDQVGCKHPGCEPAYDTPKCEKKCKVQNQV-WEEKKHFSINAYRVNSDPHDIMAEV 242
Query: 402 FRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLV 461
+++GPVE + T+Y D YK+G+YKHV GG +G HA+++IGWG GE YWL+
Sbjct: 243 YKNGPVEVAFTVYEDFAHYKSGVYKHVTGGVMGGHAVKLIGWGTSDAGE------DYWLL 296
Query: 462 ANSFNTNWGENGLFRIVRGQNECGIEADITAGLP 495
AN +N WG++G F+I+RG+NECGIE ++ AG+P
Sbjct: 297 ANQWNRGWGDDGYFKIIRGKNECGIEEEVVAGMP 330
>gi|14582576|gb|AAK69541.1|AF283476_1 cathepsin B-like cysteine proteinase [Ipomoea batatas]
Length = 352
Score = 197 bits (501), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 108/280 (38%), Positives = 159/280 (56%), Gaps = 28/280 (10%)
Query: 59 LSKLTLSELEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRD 118
S T+S+ + +GV K R P++ + ELP+ FDAR WP C +I +I D
Sbjct: 60 FSDFTVSQFKRLLGVKKAPKSLLKRTPVVTHSKEI--ELPKTFDARTAWPQCLSIADILD 117
Query: 119 QGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKD-CGNGCQGGFHGKAWKY 177
QG CGS WA GAVE+++DR CI +V LS +DL++CC CG GC GG+ AW+Y
Sbjct: 118 QGHCGSCWAFGAVESLTDRFCI--HYGTNVTLSVNDLLACCGFLCGEGCDGGYPIAAWQY 175
Query: 178 WVTTGIVSG--GTYASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSY 235
+ TG+V+ Y + GC SH C+ P TP C +KC ++ +
Sbjct: 176 FKRTGVVTSECDPYFDQTGC-------------SHPGCEPAYP-TPACEKKCVKK-NLLW 220
Query: 236 EDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHA 295
+ +F AY + +++ +IM E++ +GP E S T+Y D YK+G+YKHV G +G HA
Sbjct: 221 SESKHFSVNAYRVNSDQHSIMTEVYTNGPAEVSFTVYEDFAHYKSGVYKHVTGSEMGGHA 280
Query: 296 IRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
+++IGWG GE YWL+AN +N +WG++G F+I
Sbjct: 281 VKLIGWGTSEDGE------DYWLLANQWNRSWGDDGYFKI 314
Score = 130 bits (328), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 67/163 (41%), Positives = 102/163 (62%), Gaps = 11/163 (6%)
Query: 344 CERYMNGSRSSCQANEPN--TPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREI 401
C+ Y + + S EP TP C +KC ++ + + +F AY + +++ +IM E+
Sbjct: 186 CDPYFDQTGCSHPGCEPAYPTPACEKKCVKK-NLLWSESKHFSVNAYRVNSDQHSIMTEV 244
Query: 402 FRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLV 461
+ +GP E S T+Y D YK+G+YKHV G +G HA+++IGWG GE YWL+
Sbjct: 245 YTNGPAEVSFTVYEDFAHYKSGVYKHVTGSEMGGHAVKLIGWGTSEDGE------DYWLL 298
Query: 462 ANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI-GLEIDS 503
AN +N +WG++G F+I+RG NECGIE D+TAG+P L+I+S
Sbjct: 299 ANQWNRSWGDDGYFKIIRGTNECGIE-DVTAGMPSTKNLDIES 340
>gi|21693|emb|CAA46810.1| cathepsin B [Triticum aestivum]
Length = 305
Score = 197 bits (501), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 109/280 (38%), Positives = 160/280 (57%), Gaps = 28/280 (10%)
Query: 59 LSKLTLSELEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRD 118
L+ T+ + + +GV P P R + + E+LP+ FDAR W C TI +I D
Sbjct: 21 LANYTIEQFKHMLGVKPTP--PGLRAAVRTKTHSRSEQLPKVFDARSKWSGCSTIGKILD 78
Query: 119 QGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKD-CGNGCQGGFHGKAWKY 177
QG CGS WA GAVE + DR CI ++ LS++DLV+CC CG+GC GG+ AW+Y
Sbjct: 79 QGHCGSCWAFGAVECLQDRFCI--HHNMNITLSANDLVACCGFMCGDGCDGGYPISAWQY 136
Query: 178 WVTTGIVSGGTYASKQGCRPY--EIPCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSY 235
+V G+V+ C PY ++ C+ H C+ P TP C +KC+ V +
Sbjct: 137 FVQNGVVT-------DECDPYFDQVGCK------HPGCEPAYP-TPVCEKKCKVQNQV-W 181
Query: 236 EDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHA 295
E+ +F AY + ++ IM E++ +GPVE + T+Y D YK+G+YKH+ GG +G HA
Sbjct: 182 EEKKHFSINAYQVNSDPHDIMAEVYNNGPVEVAFTVYEDFAHYKSGVYKHITGGVMGGHA 241
Query: 296 IRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
+++IGWG GE YWL+AN +N WG++G F+I
Sbjct: 242 VKLIGWGTSDAGE------DYWLLANQWNRGWGDDGYFKI 275
Score = 137 bits (346), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 69/169 (40%), Positives = 104/169 (61%), Gaps = 16/169 (9%)
Query: 329 ENGLFRIGCRPY--EIPCERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRI 386
+NG+ C PY ++ C+ C+ P TP C +KC+ V +E+ +F
Sbjct: 139 QNGVVTDECDPYFDQVGCKH------PGCEPAYP-TPVCEKKCKVQNQV-WEEKKHFSIN 190
Query: 387 AYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQE 446
AY + ++ IM E++ +GPVE + T+Y D YK+G+YKH+ GG +G HA+++IGWG
Sbjct: 191 AYQVNSDPHDIMAEVYNNGPVEVAFTVYEDFAHYKSGVYKHITGGVMGGHAVKLIGWGTS 250
Query: 447 PLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLP 495
GE YWL+AN +N WG++G F+I+RG+NECGIE D+TAG+P
Sbjct: 251 DAGE------DYWLLANQWNRGWGDDGYFKIIRGKNECGIEEDVTAGMP 293
>gi|224285256|gb|ACN40354.1| unknown [Picea sitchensis]
Length = 350
Score = 197 bits (500), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 118/308 (38%), Positives = 165/308 (53%), Gaps = 33/308 (10%)
Query: 33 LSKAFDRVDHSILLPKLPFYGAEKNALSKLTLSELEMRMGVHPDSKLPQNRLPLL-VQLS 91
L K+F V+H P + A S T+ E +GV P PQ L + V++
Sbjct: 35 LQKSF--VEHINKHPNAGWKAAMSTRFSNYTVREFAHLLGVLP---TPQKLLETVPVRVY 89
Query: 92 DPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLS 151
+LP FDAR WP+C + + I DQG CGS WA AVEA+SDR CI + + LS
Sbjct: 90 PKGLKLPSKFDARKAWPHCTSTRSILDQGHCGSCWAFAAVEALSDRFCI--HFQVNATLS 147
Query: 152 SDDLVSCCK-DCGNGCQGGFHGKAWKYWVTTGIVSG--GTYASKQGCRPYEIPCERYMNG 208
+DLV+CC CG+GC GGF AW+Y+ G+V+ Y GC
Sbjct: 148 ENDLVACCGFRCGSGCNGGFPLSAWRYFSRRGVVTDECDPYFDNDGC------------- 194
Query: 209 SHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGS 268
+H C+ + P TP C++ C+ S+ ++ AY + ++ IM E+F +GPVE S
Sbjct: 195 NHPGCEPSYP-TPRCVKNCKDNQRWSHSK--HYSANAYRIKSDPYNIMAEVFNNGPVEVS 251
Query: 269 MTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWG 328
++Y D Y+TG+YKHV G LG HA+++IGWG T + YWL+ANS+NT WG
Sbjct: 252 FSVYEDFAHYETGVYKHVQGRYLGGHAVKLIGWGT------TDDGIDYWLIANSWNTAWG 305
Query: 329 ENGLFRIG 336
E G F+I
Sbjct: 306 EGGYFKIA 313
Score = 129 bits (325), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 68/169 (40%), Positives = 96/169 (56%), Gaps = 10/169 (5%)
Query: 344 CERYMNGSRSSCQANEPN--TPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREI 401
C+ Y + + EP+ TP C++ C+ S+ ++ AY + ++ IM E+
Sbjct: 185 CDPYFDNDGCNHPGCEPSYPTPRCVKNCKDNQRWSHSK--HYSANAYRIKSDPYNIMAEV 242
Query: 402 FRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLV 461
F +GPVE S ++Y D Y+TG+YKHV G LG HA+++IGWG T + YWL+
Sbjct: 243 FNNGPVEVSFSVYEDFAHYETGVYKHVQGRYLGGHAVKLIGWGT------TDDGIDYWLI 296
Query: 462 ANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKIGLEIDSNEINLGK 510
ANS+NT WGE G F+I RG NECGIE D AG+P I +GK
Sbjct: 297 ANSWNTAWGEGGYFKIARGVNECGIERDPVAGMPSAKNLIQDPTDQIGK 345
>gi|388500062|gb|AFK38097.1| unknown [Lotus japonicus]
Length = 357
Score = 196 bits (499), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 114/283 (40%), Positives = 161/283 (56%), Gaps = 34/283 (12%)
Query: 59 LSKLTLSELEMRMGVHPDSKLPQNRLPLLVQLSDPLE-ELPEGFDARINWPYCPTIQEIR 117
S T+++ + +GV P K P++ S P +LP+ FDAR W C TI I
Sbjct: 65 FSNYTVAQFKRLLGVKPSPKKELRSTPVV---SHPRSLKLPKSFDARTAWSQCSTIGRIL 121
Query: 118 DQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKD-CGNGCQGGFHGKAWK 176
DQG CGS WA GAVE++SDR CI +V LS +DL++CC CG+GC GG+ AW+
Sbjct: 122 DQGHCGSCWAFGAVESLSDRFCI--HLDVNVSLSVNDLLACCGFLCGSGCDGGYPLYAWR 179
Query: 177 YWVTTGIVSGGTYASKQGCRPY--EIPCERYMNGSHSSCQDNEP--NTPECIRKCQPGYD 232
Y G+V+ + C PY +I C SH C EP TP+C+RKC G
Sbjct: 180 YLAHHGVVT-------EECDPYFDQIGC------SHPGC---EPAYQTPKCVRKCVKGNQ 223
Query: 233 VSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLG 292
+ ++ F AYS+ ++ IM E++++GPVE + T+Y D YK+G+YKH+ G LG
Sbjct: 224 I-WKKSKYFSVNAYSVKSDPYDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKHITGSQLG 282
Query: 293 EHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
HA+++IGWG GE YWL+AN +N +WG++G F I
Sbjct: 283 GHAVKLIGWGTTDEGE------DYWLIANQWNRSWGDDGYFMI 319
Score = 137 bits (346), Expect = 9e-30, Method: Compositional matrix adjust.
Identities = 68/154 (44%), Positives = 96/154 (62%), Gaps = 9/154 (5%)
Query: 344 CERYMNGSRSSCQANEP--NTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREI 401
C+ Y + S EP TP+C+RKC G + ++ F AYS+ ++ IM E+
Sbjct: 191 CDPYFDQIGCSHPGCEPAYQTPKCVRKCVKGNQI-WKKSKYFSVNAYSVKSDPYDIMAEV 249
Query: 402 FRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLV 461
+++GPVE + T+Y D YK+G+YKH+ G LG HA+++IGWG GE YWL+
Sbjct: 250 YKNGPVEVAFTVYEDFAHYKSGVYKHITGSQLGGHAVKLIGWGTTDEGE------DYWLI 303
Query: 462 ANSFNTNWGENGLFRIVRGQNECGIEADITAGLP 495
AN +N +WG++G F I RG NECGIE D+TAGLP
Sbjct: 304 ANQWNRSWGDDGYFMIRRGTNECGIEEDVTAGLP 337
>gi|302764096|ref|XP_002965469.1| hypothetical protein SELMODRAFT_143272 [Selaginella moellendorffii]
gi|300166283|gb|EFJ32889.1| hypothetical protein SELMODRAFT_143272 [Selaginella moellendorffii]
Length = 331
Score = 196 bits (499), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 111/277 (40%), Positives = 159/277 (57%), Gaps = 31/277 (11%)
Query: 63 TLSELEMRMGVHPDSKLPQNRLPLLVQLSDPLE-ELPEGFDARINWPYCPTIQEIRDQGS 121
T+ + + GV P S + PL S P +LP+ FDAR WP C +I+ I DQG
Sbjct: 55 TVRDFKRLCGVLPKSS--EEVQPLRPLRSHPRTLDLPKHFDAREAWPQCASIKTILDQGH 112
Query: 122 CGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTT 181
CGS WA GAVEA++DR CI + +V LS +DLV+CC CG GC+GG+ AW+Y+ T
Sbjct: 113 CGSCWAFGAVEALTDRFCILN--NENVSLSENDLVACCSSCGFGCEGGYPYAAWEYFAQT 170
Query: 182 GIVSGGTYASKQGCRPYEIPCERYMNGS---HSSCQDNEPNTPECIRKCQPGYDVSYEDD 238
G+V+ C+ Y +G H C+ E +TP C+++C + + D
Sbjct: 171 GVVTS--------------QCDPYFDGKGCKHPGCEP-EYDTPVCVKQCVD--NEQWRDS 213
Query: 239 LNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRI 298
+F Y++ ++ I EI+++GPVE S T+Y D YK+G+YKHV G LG HA++
Sbjct: 214 KHFTVQTYAVNSDIYDIQAEIYKNGPVEVSYTVYEDFAHYKSGVYKHVFGQVLGGHAVKF 273
Query: 299 IGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
IGWG G+ YW+VANS+N +WGE+G F+I
Sbjct: 274 IGWGTTDDGK------DYWIVANSWNRSWGEDGFFQI 304
Score = 126 bits (316), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 64/154 (41%), Positives = 94/154 (61%), Gaps = 10/154 (6%)
Query: 344 CERYMNGSRSSCQANEP--NTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREI 401
C+ Y +G EP +TP C+++C + + D +F Y++ ++ I EI
Sbjct: 177 CDPYFDGKGCKHPGCEPEYDTPVCVKQCVD--NEQWRDSKHFTVQTYAVNSDIYDIQAEI 234
Query: 402 FRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLV 461
+++GPVE S T+Y D YK+G+YKHV G LG HA++ IGWG G+ YW+V
Sbjct: 235 YKNGPVEVSYTVYEDFAHYKSGVYKHVFGQVLGGHAVKFIGWGTTDDGK------DYWIV 288
Query: 462 ANSFNTNWGENGLFRIVRGQNECGIEADITAGLP 495
ANS+N +WGE+G F+I RG NECGIE++ AG+P
Sbjct: 289 ANSWNRSWGEDGFFQISRGSNECGIESEPVAGIP 322
>gi|256086863|ref|XP_002579605.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
gi|353228447|emb|CCD74618.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
Length = 271
Score = 196 bits (497), Expect = 3e-47, Method: Compositional matrix adjust.
Identities = 97/211 (45%), Positives = 133/211 (63%), Gaps = 8/211 (3%)
Query: 127 ALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSG 186
A GAVE+MSDR+CI S+ K V LS+ +L+SCC CG GC+GG G AW YW GIV+G
Sbjct: 45 AFGAVESMSDRICIHSKNKISVELSAINLLSCCTRCGFGCRGGIPGMAWDYWKYEGIVTG 104
Query: 187 GTYASKQGCRPYEIP-CERYMNG-SHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRI 244
G+ + GC+PY P C + + S+ C+ TPEC CQ Y Y+ D +G+
Sbjct: 105 GSNETHTGCQPYPFPECNHHSSSKSYPPCESYYFPTPECHETCQDDYGKPYKKDKFYGKS 164
Query: 245 AYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQE 304
+Y++ + E +IM+EI +GPVEG +Y D + YK+G+YKH+ G LG HAIRIIGWG +
Sbjct: 165 SYNVASEEISIMKEILLNGPVEGGFYVYEDFLNYKSGVYKHITGSYLGGHAIRIIGWGIQ 224
Query: 305 PLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
+ + YWL ANS+N WG+ G F+I
Sbjct: 225 ------QNHIPYWLCANSWNNQWGDQGYFKI 249
Score = 152 bits (383), Expect = 5e-34, Method: Compositional matrix adjust.
Identities = 74/166 (44%), Positives = 105/166 (63%), Gaps = 8/166 (4%)
Query: 334 RIGCRPYEIPCERYMNGSRS--SCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLP 391
GC+PY P + + S+S C++ TPEC CQ Y Y+ D +G+ +Y++
Sbjct: 110 HTGCQPYPFPECNHHSSSKSYPPCESYYFPTPECHETCQDDYGKPYKKDKFYGKSSYNVA 169
Query: 392 ANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEG 451
+ E +IM+EI +GPVEG +Y D + YK+G+YKH+ G LG HAIRIIGWG +
Sbjct: 170 SEEISIMKEILLNGPVEGGFYVYEDFLNYKSGVYKHITGSYLGGHAIRIIGWGIQ----- 224
Query: 452 TSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
+ + YWL ANS+N WG+ G F+I+RG NECGIE+ +TAGLP +
Sbjct: 225 -QNHIPYWLCANSWNNQWGDQGYFKILRGTNECGIESMVTAGLPNL 269
>gi|297843028|ref|XP_002889395.1| hypothetical protein ARALYDRAFT_887368 [Arabidopsis lyrata subsp.
lyrata]
gi|297335237|gb|EFH65654.1| hypothetical protein ARALYDRAFT_887368 [Arabidopsis lyrata subsp.
lyrata]
Length = 360
Score = 196 bits (497), Expect = 3e-47, Method: Compositional matrix adjust.
Identities = 108/292 (36%), Positives = 166/292 (56%), Gaps = 28/292 (9%)
Query: 47 PKLPFYGAEKNALSKLTLSELEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARIN 106
P + A + + T++E + +GV P K +P++ D +LP+ FDAR
Sbjct: 56 PNAGWKAAFNDRFANATVAEFKRLLGVKPTPKTEFLGVPIVSH--DISLKLPKEFDARTA 113
Query: 107 WPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKD-CGNG 165
W C ++ I DQG CGS WA GAVE++SDR CI + ++ LS +DL++CC CG G
Sbjct: 114 WSQCTSVGRILDQGHCGSCWAFGAVESLSDRFCI--KYNMNISLSVNDLLACCGFLCGQG 171
Query: 166 CQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPY--EIPCERYMNGSHSSCQDNEPNTPEC 223
C GG+ AW+Y+ G+V+ + C PY C SH C+ P TP+C
Sbjct: 172 CNGGYPIAAWRYFKHHGVVT-------EECDPYFDNTGC------SHPGCEPAYP-TPKC 217
Query: 224 IRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIY 283
RKC G + + + ++G AY + ++ + IM E++++GPVE + T+Y D YK+G+Y
Sbjct: 218 ARKCVSGNQL-WRESKHYGVSAYKVRSHPDDIMAEVYKNGPVEVAFTVYEDFAHYKSGVY 276
Query: 284 KHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
KH+ G +G HA+++IGWG GE YWL+AN +N +WG++G F+I
Sbjct: 277 KHITGTNIGGHAVKLIGWGTSDDGE------DYWLLANQWNRSWGDDGYFKI 322
Score = 132 bits (332), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 64/154 (41%), Positives = 97/154 (62%), Gaps = 9/154 (5%)
Query: 344 CERYMNGSRSSCQANEPN--TPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREI 401
C+ Y + + S EP TP+C RKC G + + + ++G AY + ++ + IM E+
Sbjct: 194 CDPYFDNTGCSHPGCEPAYPTPKCARKCVSGNQL-WRESKHYGVSAYKVRSHPDDIMAEV 252
Query: 402 FRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLV 461
+++GPVE + T+Y D YK+G+YKH+ G +G HA+++IGWG GE YWL+
Sbjct: 253 YKNGPVEVAFTVYEDFAHYKSGVYKHITGTNIGGHAVKLIGWGTSDDGE------DYWLL 306
Query: 462 ANSFNTNWGENGLFRIVRGQNECGIEADITAGLP 495
AN +N +WG++G F+I RG NECGIE + AGLP
Sbjct: 307 ANQWNRSWGDDGYFKIRRGTNECGIEHGVVAGLP 340
>gi|18378947|ref|NP_563648.1| putative cathepsin B-like cysteine protease [Arabidopsis thaliana]
gi|16226808|gb|AAL16267.1|AF428337_1 At1g02300/T6A9_10 [Arabidopsis thaliana]
gi|14532526|gb|AAK63991.1| At1g02300/T6A9_10 [Arabidopsis thaliana]
gi|25090140|gb|AAN72238.1| At1g02300/T6A9_10 [Arabidopsis thaliana]
gi|332189292|gb|AEE27413.1| putative cathepsin B-like cysteine protease [Arabidopsis thaliana]
Length = 362
Score = 196 bits (497), Expect = 3e-47, Method: Compositional matrix adjust.
Identities = 109/292 (37%), Positives = 166/292 (56%), Gaps = 28/292 (9%)
Query: 47 PKLPFYGAEKNALSKLTLSELEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARIN 106
P + + + + T++E + +GV P K +P++ D +LP+ FDAR
Sbjct: 58 PNAGWKASFNDRFANATVAEFKRLLGVKPTPKTEFLGVPIVSH--DISLKLPKEFDARTA 115
Query: 107 WPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKD-CGNG 165
W C +I I DQG CGS WA GAVE++SDR CI + +V LS +DL++CC CG G
Sbjct: 116 WSQCTSIGRILDQGHCGSCWAFGAVESLSDRFCI--KYNMNVSLSVNDLLACCGFLCGQG 173
Query: 166 CQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPY--EIPCERYMNGSHSSCQDNEPNTPEC 223
C GG+ AW+Y+ G+V+ + C PY C SH C+ P TP+C
Sbjct: 174 CNGGYPIAAWRYFKHHGVVT-------EECDPYFDNTGC------SHPGCEPAYP-TPKC 219
Query: 224 IRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIY 283
RKC G + + + ++G AY + ++ + IM E++++GPVE + T+Y D YK+G+Y
Sbjct: 220 ARKCVSGNQL-WRESKHYGVSAYKVRSHPDDIMAEVYKNGPVEVAFTVYEDFAHYKSGVY 278
Query: 284 KHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
KH+ G +G HA+++IGWG GE YWL+AN +N +WG++G F+I
Sbjct: 279 KHITGTNIGGHAVKLIGWGTSDDGE------DYWLLANQWNRSWGDDGYFKI 324
Score = 132 bits (332), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 64/154 (41%), Positives = 97/154 (62%), Gaps = 9/154 (5%)
Query: 344 CERYMNGSRSSCQANEPN--TPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREI 401
C+ Y + + S EP TP+C RKC G + + + ++G AY + ++ + IM E+
Sbjct: 196 CDPYFDNTGCSHPGCEPAYPTPKCARKCVSGNQL-WRESKHYGVSAYKVRSHPDDIMAEV 254
Query: 402 FRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLV 461
+++GPVE + T+Y D YK+G+YKH+ G +G HA+++IGWG GE YWL+
Sbjct: 255 YKNGPVEVAFTVYEDFAHYKSGVYKHITGTNIGGHAVKLIGWGTSDDGE------DYWLL 308
Query: 462 ANSFNTNWGENGLFRIVRGQNECGIEADITAGLP 495
AN +N +WG++G F+I RG NECGIE + AGLP
Sbjct: 309 ANQWNRSWGDDGYFKIRRGTNECGIEHGVVAGLP 342
>gi|145481831|ref|XP_001426938.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124394016|emb|CAK59540.1| unnamed protein product [Paramecium tetraurelia]
Length = 332
Score = 196 bits (497), Expect = 3e-47, Method: Compositional matrix adjust.
Identities = 105/263 (39%), Positives = 149/263 (56%), Gaps = 19/263 (7%)
Query: 85 PLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRG 144
P+ + + LE LP F A+ WP CP+I+ I DQG+CGS WA+ A MSDR+CIAS
Sbjct: 59 PVEYKYHEKLENLPPSFSAQEKWPGCPSIELIPDQGNCGSCWAVSAASTMSDRLCIASGQ 118
Query: 145 KRHVRLSSDDLVSCCK-----DCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYE 199
++S++DL+SCC D GC GG+ AWKY GIV+GGTY C+PY
Sbjct: 119 TDKRQISAEDLLSCCGINCELDGNGGCDGGYPYGAWKYLRVDGIVTGGTYNDFSLCKPYS 178
Query: 200 IPCERYMN--GSHSSCQDN----EPNTPECIRKCQPGYDVSYE-DDLNFGRIAYSLPANE 252
P + N G +S C+++ TP C +KC P + +Y+ D + Y L ++
Sbjct: 179 FPPCSHGNDSGKYSKCENDFFMLTEVTPSCTKKCHPQFSRTYDVDKIRSRENPYKLIKDQ 238
Query: 253 ETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSS 312
E I EI+ +GPV+ T++ D + YK+G+Y+ G G+HA++IIGW GT +
Sbjct: 239 EQIKNEIYLNGPVQAVFTVFDDFLNYKSGVYQQTTGQRRGKHAVKIIGW-------GTEN 291
Query: 313 VVKYWLVANSFNTNWGENGLFRI 335
V YW NS+N WG NG F+I
Sbjct: 292 GVPYWEAINSWNDGWGINGKFKI 314
Score = 110 bits (274), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 59/165 (35%), Positives = 89/165 (53%), Gaps = 14/165 (8%)
Query: 337 CRPYEIPCERYMN--GSRSSCQAN----EPNTPECIRKCQPGYDVSYE-DDLNFGRIAYS 389
C+PY P + N G S C+ + TP C +KC P + +Y+ D + Y
Sbjct: 174 CKPYSFPPCSHGNDSGKYSKCENDFFMLTEVTPSCTKKCHPQFSRTYDVDKIRSRENPYK 233
Query: 390 LPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLG 449
L ++E I EI+ +GPV+ T++ D + YK+G+Y+ G G+HA++IIGW
Sbjct: 234 LIKDQEQIKNEIYLNGPVQAVFTVFDDFLNYKSGVYQQTTGQRRGKHAVKIIGW------ 287
Query: 450 EGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGL 494
GT + V YW NS+N WG NG F+I+RG N IE ++ A +
Sbjct: 288 -GTENGVPYWEAINSWNDGWGINGKFKILRGFNHLDIEGEVYASI 331
>gi|30678927|ref|NP_849281.1| cathepsin B [Arabidopsis thaliana]
gi|3859606|gb|AAC72872.1| contains similarity to cysteine proteases (Pfam: PF00112,
E=1.3e-79, N=1) [Arabidopsis thaliana]
gi|7268205|emb|CAB77732.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
gi|332656653|gb|AEE82053.1| cathepsin B [Arabidopsis thaliana]
Length = 359
Score = 196 bits (497), Expect = 4e-47, Method: Compositional matrix adjust.
Identities = 108/290 (37%), Positives = 163/290 (56%), Gaps = 24/290 (8%)
Query: 47 PKLPFYGAEKNALSKLTLSELEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARIN 106
P + A + S T++E + +GV P K +P++ DP +LP+ FDAR
Sbjct: 55 PNAGWKAAINDRFSNATVAEFKRLLGVKPTPKKHFLGVPIVSH--DPSLKLPKAFDARTA 112
Query: 107 WPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCK-DCGNG 165
WP C +I I G CGS WA GAVE++SDR CI + ++ LS +DL++CC CG+G
Sbjct: 113 WPQCTSIGNILGLGHCGSCWAFGAVESLSDRFCI--QFGMNISLSVNDLLACCGFRCGDG 170
Query: 166 CQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIR 225
C GG+ AW+Y+ +G+V+ + C PY SH C+ P TP+C R
Sbjct: 171 CDGGYPIAAWQYFSYSGVVT-------EECDPYF----DNTGCSHPGCEPAYP-TPKCSR 218
Query: 226 KCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKH 285
KC + + + ++ Y++ +N + IM E++++GPVE S T+Y D YK+G+YKH
Sbjct: 219 KCVSDNKL-WSESKHYSVSTYTVKSNPQDIMAEVYKNGPVEVSFTVYEDFAHYKSGVYKH 277
Query: 286 VAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
+ G +G HA+++IGWG GE YWL+AN +N WG++G F I
Sbjct: 278 ITGSNIGGHAVKLIGWGTSSEGE------DYWLMANQWNRGWGDDGYFMI 321
Score = 127 bits (319), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 63/154 (40%), Positives = 93/154 (60%), Gaps = 9/154 (5%)
Query: 344 CERYMNGSRSSCQANEPN--TPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREI 401
C+ Y + + S EP TP+C RKC + + + ++ Y++ +N + IM E+
Sbjct: 193 CDPYFDNTGCSHPGCEPAYPTPKCSRKCVSDNKL-WSESKHYSVSTYTVKSNPQDIMAEV 251
Query: 402 FRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLV 461
+++GPVE S T+Y D YK+G+YKH+ G +G HA+++IGWG GE YWL+
Sbjct: 252 YKNGPVEVSFTVYEDFAHYKSGVYKHITGSNIGGHAVKLIGWGTSSEGE------DYWLM 305
Query: 462 ANSFNTNWGENGLFRIVRGQNECGIEADITAGLP 495
AN +N WG++G F I RG NECGIE + AGLP
Sbjct: 306 ANQWNRGWGDDGYFMIRRGTNECGIEDEPVAGLP 339
>gi|268561878|ref|XP_002638441.1| Hypothetical protein CBG18657 [Caenorhabditis briggsae]
Length = 372
Score = 195 bits (496), Expect = 4e-47, Method: Compositional matrix adjust.
Identities = 117/309 (37%), Positives = 168/309 (54%), Gaps = 57/309 (18%)
Query: 64 LSELEMRMGVH--PDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGS 121
+SELEM+ V S++ PL VQ +P FDAR +WP C +I+ IR+Q
Sbjct: 45 ISELEMKFKVMDLKFSEISPKDEPLTVQGV----YVPISFDARDHWPNCKSIKLIRNQAY 100
Query: 122 CGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCC-KDCGNGCQGGFHGKAWKYWVT 180
CG+ WA GA E +SDR+CI S G +S +D++SCC CG GC+GG+ + K+W+
Sbjct: 101 CGACWAFGAAEIISDRICIQSGGAHQPIISVEDILSCCGSSCGEGCKGGYPLEGLKFWMN 160
Query: 181 TGIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQPGY-DVSYEDDL 239
+G+V+GG Y + GC+PY P SSC+ ++ +TP C +KCQ GY + +Y++D
Sbjct: 161 SGVVTGGDY-NGTGCQPYTFP-------PCSSCEASK-STPSCQKKCQTGYLEATYKNDK 211
Query: 240 NF-----------------------GRIAYSLPANEE----------TIMREIFRHGPVE 266
F G+ AY L TI EI+ +GPVE
Sbjct: 212 RFENEEQDSSYMSENFYQVLIILKGGKSAYRLSTTTSSNKISTDAIITIQTEIYNNGPVE 271
Query: 267 GSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTN 326
S ++ D YK+G+Y +V+G G HA++IIGWG E + V YWLVANS+ T+
Sbjct: 272 VSYRVFEDFYQYKSGVYHYVSGKLTGAHAVKIIGWGTE-------NKVDYWLVANSWGTD 324
Query: 327 WGENGLFRI 335
+GE G F+I
Sbjct: 325 FGEKGFFKI 333
Score = 126 bits (316), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 77/198 (38%), Positives = 105/198 (53%), Gaps = 51/198 (25%)
Query: 336 GCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGY-DVSYEDDLNF---------- 383
GC+PY P C SSC+A++ +TP C +KCQ GY + +Y++D F
Sbjct: 173 GCQPYTFPPC--------SSCEASK-STPSCQKKCQTGYLEATYKNDKRFENEEQDSSYM 223
Query: 384 -------------GRIAYSLPANEE----------TIMREIFRHGPVEGSMTIYADMILY 420
G+ AY L TI EI+ +GPVE S ++ D Y
Sbjct: 224 SENFYQVLIILKGGKSAYRLSTTTSSNKISTDAIITIQTEIYNNGPVEVSYRVFEDFYQY 283
Query: 421 KTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRG 480
K+G+Y +V+G G HA++IIGWG E + V YWLVANS+ T++GE G F+I RG
Sbjct: 284 KSGVYHYVSGKLTGAHAVKIIGWGTE-------NKVDYWLVANSWGTDFGEKGFFKIRRG 336
Query: 481 QNECGIEADITAGLPKIG 498
NECGIE ++ AGL K G
Sbjct: 337 TNECGIEENVVAGLAKNG 354
>gi|6165885|gb|AAF04727.1|AF101239_1 cathepsin B-like cysteine proteinase [Ipomoea batatas]
Length = 352
Score = 195 bits (496), Expect = 4e-47, Method: Compositional matrix adjust.
Identities = 108/280 (38%), Positives = 158/280 (56%), Gaps = 28/280 (10%)
Query: 59 LSKLTLSELEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRD 118
S T+S+ + +GV K R P++ + ELP+ FDAR WP C +I +I D
Sbjct: 60 FSDFTVSQFKRLLGVKKAPKSLLKRTPVVTHSKEI--ELPKTFDARTAWPQCLSIADILD 117
Query: 119 QGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKD-CGNGCQGGFHGKAWKY 177
QG CGS WA GAVE+++DR CI +V LS +DL++CC CG GC GG+ AW+Y
Sbjct: 118 QGHCGSCWAFGAVESLTDRFCI--HYGTNVTLSVNDLLACCGFLCGEGCDGGYPIAAWQY 175
Query: 178 WVTTGIVSG--GTYASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSY 235
+ TG+V+ Y + GC SH C+ P TP C +KC ++ +
Sbjct: 176 FKRTGVVTSECDPYFDQTGC-------------SHPGCEPAYP-TPACEKKCVKK-NLLW 220
Query: 236 EDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHA 295
+ +F AY + +++ +IM E++ +GP E S T+Y D YK+G+YKHV G +G HA
Sbjct: 221 SESKHFSVNAYRVNSDQHSIMTEVYTNGPAEVSFTVYEDFAHYKSGVYKHVTGSEMGGHA 280
Query: 296 IRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
+++IGWG GE YWL+AN +N +WG +G F+I
Sbjct: 281 VKLIGWGTSEDGE------DYWLLANQWNRSWGGDGYFKI 314
Score = 127 bits (320), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 67/163 (41%), Positives = 100/163 (61%), Gaps = 11/163 (6%)
Query: 344 CERYMNGSRSSCQANEPN--TPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREI 401
C+ Y + + S EP TP C +KC ++ + + +F AY + +++ +IM E+
Sbjct: 186 CDPYFDQTGCSHPGCEPAYPTPACEKKCVKK-NLLWSESKHFSVNAYRVNSDQHSIMTEV 244
Query: 402 FRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLV 461
+ +GP E S T+Y D YK+G+YKHV G +G HA+++IGWG GE YWL+
Sbjct: 245 YTNGPAEVSFTVYEDFAHYKSGVYKHVTGSEMGGHAVKLIGWGTSEDGE------DYWLL 298
Query: 462 ANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI-GLEIDS 503
AN +N +WG +G F+I+RG NECGIE D+TAG P L+I+S
Sbjct: 299 ANQWNRSWGGDGYFKIIRGTNECGIE-DVTAGTPSTKNLDIES 340
>gi|302823081|ref|XP_002993195.1| hypothetical protein SELMODRAFT_270024 [Selaginella moellendorffii]
gi|300138965|gb|EFJ05715.1| hypothetical protein SELMODRAFT_270024 [Selaginella moellendorffii]
Length = 342
Score = 195 bits (496), Expect = 4e-47, Method: Compositional matrix adjust.
Identities = 111/277 (40%), Positives = 158/277 (57%), Gaps = 31/277 (11%)
Query: 63 TLSELEMRMGVHPDSKLPQNRLPLLVQLSDPLE-ELPEGFDARINWPYCPTIQEIRDQGS 121
T+ + + GV P S + PL S P +LP+ FDAR WP C +I+ I DQG
Sbjct: 66 TVRDFKRLCGVLPKSS--EEVQPLRPLRSHPRTLDLPKHFDAREAWPQCSSIKNILDQGH 123
Query: 122 CGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTT 181
CGS WA GAVEA++DR CI + +V LS +DLV+CC CG GC GG+ AW+Y+ T
Sbjct: 124 CGSCWAFGAVEALTDRFCILN--NENVSLSENDLVACCSSCGFGCDGGYPYAAWEYFAQT 181
Query: 182 GIVSGGTYASKQGCRPYEIPCERYMNGS---HSSCQDNEPNTPECIRKCQPGYDVSYEDD 238
G+V+ C+ Y +G H C+ E +TP C+++C + + D
Sbjct: 182 GVVTS--------------QCDPYFDGKGCKHPGCEP-EYDTPVCVKQCVD--NEQWRDS 224
Query: 239 LNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRI 298
+F Y++ ++ I EI+++GPVE S T+Y D YK+G+YKHV G LG HA++
Sbjct: 225 KHFTVQTYAVNSDIYDIQAEIYKNGPVEVSYTVYEDFAHYKSGVYKHVFGEVLGGHAVKF 284
Query: 299 IGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
IGWG G+ YW+VANS+N +WGE+G F+I
Sbjct: 285 IGWGTTDDGK------DYWIVANSWNRSWGEDGFFQI 315
Score = 125 bits (315), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 64/154 (41%), Positives = 94/154 (61%), Gaps = 10/154 (6%)
Query: 344 CERYMNGSRSSCQANEP--NTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREI 401
C+ Y +G EP +TP C+++C + + D +F Y++ ++ I EI
Sbjct: 188 CDPYFDGKGCKHPGCEPEYDTPVCVKQCVD--NEQWRDSKHFTVQTYAVNSDIYDIQAEI 245
Query: 402 FRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLV 461
+++GPVE S T+Y D YK+G+YKHV G LG HA++ IGWG G+ YW+V
Sbjct: 246 YKNGPVEVSYTVYEDFAHYKSGVYKHVFGEVLGGHAVKFIGWGTTDDGK------DYWIV 299
Query: 462 ANSFNTNWGENGLFRIVRGQNECGIEADITAGLP 495
ANS+N +WGE+G F+I RG NECGIE++ AG+P
Sbjct: 300 ANSWNRSWGEDGFFQISRGSNECGIESEPVAGIP 333
>gi|226497010|ref|NP_001150152.1| LOC100283781 precursor [Zea mays]
gi|195637168|gb|ACG38052.1| cathepsin B-like cysteine proteinase 3 precursor [Zea mays]
Length = 347
Score = 195 bits (496), Expect = 4e-47, Method: Compositional matrix adjust.
Identities = 110/279 (39%), Positives = 160/279 (57%), Gaps = 26/279 (9%)
Query: 59 LSKLTLSELEMRMGVHPDSKLPQNRLP-LLVQLSDPLEELPEGFDARINWPYCPTIQEIR 117
S T+++ + +GV P PQN L + V+ ELP+ FDAR W C TI I
Sbjct: 57 FSNYTIAQFKHILGVKP---APQNALSNVPVKTYSRSLELPKEFDARSAWSRCSTIGNIL 113
Query: 118 DQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKD-CGNGCQGGFHGKAWK 176
DQG CGS WA GAVE + DR CI + LS +DL++CC CG+GC GG+ +AW+
Sbjct: 114 DQGHCGSCWAFGAVECLQDRFCI--HLNMSILLSVNDLLACCGFMCGDGCDGGYPIEAWR 171
Query: 177 YWVTTGIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYE 236
Y+V G+V+ C PY P + H C+ P TP+C +KC+ V ++
Sbjct: 172 YFVQNGVVT-------DECDPYFDP----VGCKHPGCEPAYP-TPKCEKKCKEQNQV-WQ 218
Query: 237 DDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAI 296
+ +F AY + ++ IM E++++GPVE + T+Y D YK+G+YKH+ GG +G HA+
Sbjct: 219 EKKHFSIDAYRINSDPHDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKHITGGIMGGHAV 278
Query: 297 RIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
++IGWG GE YWL+AN +N WG++G F+I
Sbjct: 279 KLIGWGTSDAGE------DYWLLANQWNRGWGDDGYFKI 311
Score = 134 bits (337), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 66/167 (39%), Positives = 102/167 (61%), Gaps = 12/167 (7%)
Query: 329 ENGLFRIGCRPYEIPCERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAY 388
+NG+ C PY P + C+ P TP+C +KC+ V +++ +F AY
Sbjct: 175 QNGVVTDECDPYFDP----VGCKHPGCEPAYP-TPKCEKKCKEQNQV-WQEKKHFSIDAY 228
Query: 389 SLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPL 448
+ ++ IM E++++GPVE + T+Y D YK+G+YKH+ GG +G HA+++IGWG
Sbjct: 229 RINSDPHDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKHITGGIMGGHAVKLIGWGTSDA 288
Query: 449 GEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLP 495
GE YWL+AN +N WG++G F+I+RG+NECGIE + AG+P
Sbjct: 289 GE------DYWLLANQWNRGWGDDGYFKIIRGKNECGIEEGVVAGMP 329
>gi|40643250|emb|CAC83720.1| cathepsin B [Hordeum vulgare subsp. vulgare]
gi|326494236|dbj|BAJ90387.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326499864|dbj|BAJ90767.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 344
Score = 195 bits (496), Expect = 5e-47, Method: Compositional matrix adjust.
Identities = 109/281 (38%), Positives = 163/281 (58%), Gaps = 30/281 (10%)
Query: 59 LSKLTLSELEMRMGVHPDSKLPQNRLPLLVQLSDPL-EELPEGFDARINWPYCPTIQEIR 117
L+ T+ + + +GV P P L + + P E+LP+ FDAR W C TI +I
Sbjct: 60 LANYTIEQFKHMLGVKP---TPPGLLAGVRTKTHPRSEQLPKEFDARSKWSGCSTIGKIL 116
Query: 118 DQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKD-CGNGCQGGFHGKAWK 176
DQG CGS WA GAVE + DR CI ++ LS++DLV+CC CG+GC GG+ AW+
Sbjct: 117 DQGHCGSCWAFGAVECLQDRFCI--HHNMNISLSANDLVACCGFMCGDGCDGGYPISAWQ 174
Query: 177 YWVTTGIVSGGTYASKQGCRPY--EIPCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVS 234
Y+V G+V+ + C PY ++ C+ H C+ P TP C +KC+ V
Sbjct: 175 YFVQNGVVT-------EECDPYFDQVGCK------HPGCEPAYP-TPVCEKKCKVQNQV- 219
Query: 235 YEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEH 294
+++ +F AY + ++ IM E++++GPVE + T+Y D YK+G+YKH+ GG +G H
Sbjct: 220 WQEKKHFSIDAYQVNSDPHDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKHITGGVMGGH 279
Query: 295 AIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
A+++IGWG GE YWL+AN +N WG++G F+I
Sbjct: 280 AVKLIGWGTSDAGE------DYWLLANQWNRGWGDDGYFKI 314
Score = 137 bits (346), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 68/171 (39%), Positives = 106/171 (61%), Gaps = 16/171 (9%)
Query: 329 ENGLFRIGCRPY--EIPCERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRI 386
+NG+ C PY ++ C+ C+ P TP C +KC+ V +++ +F
Sbjct: 178 QNGVVTEECDPYFDQVGCKH------PGCEPAYP-TPVCEKKCKVQNQV-WQEKKHFSID 229
Query: 387 AYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQE 446
AY + ++ IM E++++GPVE + T+Y D YK+G+YKH+ GG +G HA+++IGWG
Sbjct: 230 AYQVNSDPHDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKHITGGVMGGHAVKLIGWGTS 289
Query: 447 PLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
GE YWL+AN +N WG++G F+I+RG+NECGIE D+TAG+P +
Sbjct: 290 DAGE------DYWLLANQWNRGWGDDGYFKIIRGKNECGIEEDVTAGMPSM 334
>gi|187107122|ref|NP_001119621.1| cathepsin B-3098 precursor [Acyrthosiphon pisum]
gi|161343841|tpg|DAA06101.1| TPA_inf: cathepsin B [Acyrthosiphon pisum]
Length = 337
Score = 195 bits (495), Expect = 6e-47, Method: Compositional matrix adjust.
Identities = 110/315 (34%), Positives = 167/315 (53%), Gaps = 33/315 (10%)
Query: 43 SILLPKLPFYGAEKNALSKLTLSELEMRMGVHPDSKLPQNRLPLL-----VQLSDPL--- 94
S+ L + ++ EK+ + + + GV+ P+ + L VQ+ D +
Sbjct: 14 SVYLTEQAYF-LEKDFIDNINKQATTWKAGVNSAPNTPKEHILRLLGSRGVQIPDKVNYN 72
Query: 95 -----------EELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASR 143
+E+P FDAR W C TI E+RDQG+CGS WAL A +DR+C+A+
Sbjct: 73 MYKNDDHADNYQEIPMKFDARKKWIRCKTIGEVRDQGNCGSDWALSTSSAFADRLCVATN 132
Query: 144 GKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIPCE 203
G + LS++++ CC CGNGC GG+ +AWK + G+V+GG Y S +GC PY +P
Sbjct: 133 GDFNQLLSAEEITFCCHKCGNGCNGGYPIRAWKRFKNHGLVTGGNYKSGEGCEPYRVPPC 192
Query: 204 RYMNGSHSSC--QDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFR 261
Y ++C Q E N +C +KC D+ + D + R Y L I +++
Sbjct: 193 PYDKDGKNTCSGQPMESNH-KCSKKCYGDEDIDFNKDHRYTRDDYYLTY--RGIQKDVIN 249
Query: 262 HGPVEGSMTIYADMILYKTGIY-KHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVA 320
+GP+E S +Y D YK+GIY K LG H++++IGWG+E V YWL+
Sbjct: 250 YGPIETSFDVYDDFPNYKSGIYVKSENASYLGGHSVKLIGWGEE-------YGVLYWLMV 302
Query: 321 NSFNTNWGENGLFRI 335
NS+N +WG+ GLF+I
Sbjct: 303 NSWNADWGDKGLFKI 317
Score = 108 bits (270), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 60/163 (36%), Positives = 89/163 (54%), Gaps = 13/163 (7%)
Query: 336 GCRPYEIPCERYMNGSRSSC--QANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPAN 393
GC PY +P Y +++C Q E N +C +KC D+ + D + R Y L
Sbjct: 183 GCEPYRVPPCPYDKDGKNTCSGQPMESNH-KCSKKCYGDEDIDFNKDHRYTRDDYYLTY- 240
Query: 394 EETIMREIFRHGPVEGSMTIYADMILYKTGIY-KHVAGGPLGEHAIRIIGWGQEPLGEGT 452
I +++ +GP+E S +Y D YK+GIY K LG H++++IGWG+E
Sbjct: 241 -RGIQKDVINYGPIETSFDVYDDFPNYKSGIYVKSENASYLGGHSVKLIGWGEE------ 293
Query: 453 SSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLP 495
V YWL+ NS+N +WG+ GLF+I RG NEC ++ T G+P
Sbjct: 294 -YGVLYWLMVNSWNADWGDKGLFKIRRGTNECRVDNSTTGGVP 335
>gi|56754337|gb|AAW25356.1| SJCHGC00056 protein [Schistosoma japonicum]
Length = 342
Score = 195 bits (495), Expect = 6e-47, Method: Compositional matrix adjust.
Identities = 118/275 (42%), Positives = 163/275 (59%), Gaps = 10/275 (3%)
Query: 63 TLSELEMRMGVHP-DSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGS 121
+L + + MG D+++ + R P V + E+P FD+R WP+C +I +IRDQ
Sbjct: 56 SLDDARILMGARKEDAEMKRKRRPT-VDHHNLNVEIPSQFDSRKKWPHCKSISQIRDQSR 114
Query: 122 CGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTT 181
CGS WA GAVEAM+DR+CI S G + LS+ DL+SCC+DCG GC+GGF G+AW T
Sbjct: 115 CGSCWAFGAVEAMTDRICIQSGGGQSAELSALDLISCCEDCGGGCKGGFPGQAWDMGKTR 174
Query: 182 GIVSGGTYASKQGCRPYEIP-CERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLN 240
+ GC+PY P CE G + +C TP+C + CQ GY +E D
Sbjct: 175 DSHWRFRKKNHTGCQPYPFPKCEHLTKGKYPACGTKIYKTPQCKQTCQKGYKTPFEQDKP 234
Query: 241 FGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIG 300
FG + ++ NE+ R+I +GPVE + +Y D + K+GI +HV G +G H IRIIG
Sbjct: 235 FGEGSSNVQNNEKVFQRDIMMYGPVEAAFDVYEDFLNSKSGISRHVTGSIVGGHPIRIIG 294
Query: 301 WGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
WG E +G YWL+ANS+N +WGENGLFR+
Sbjct: 295 WGVE---KGNP----YWLIANSWNEDWGENGLFRM 322
Score = 150 bits (379), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 74/179 (41%), Positives = 105/179 (58%), Gaps = 8/179 (4%)
Query: 317 WLVANSFNTNWGENGLFRIGCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDV 375
W + + +++W GC+PY P CE G +C TP+C + CQ GY
Sbjct: 168 WDMGKTRDSHWRFRKKNHTGCQPYPFPKCEHLTKGKYPACGTKIYKTPQCKQTCQKGYKT 227
Query: 376 SYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGE 435
+E D FG + ++ NE+ R+I +GPVE + +Y D + K+GI +HV G +G
Sbjct: 228 PFEQDKPFGEGSSNVQNNEKVFQRDIMMYGPVEAAFDVYEDFLNSKSGISRHVTGSIVGG 287
Query: 436 HAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGL 494
H IRIIGWG E +G YWL+ANS+N +WGENGLFR+VRG++EC IE+ + AGL
Sbjct: 288 HPIRIIGWGVE---KGNP----YWLIANSWNEDWGENGLFRMVRGRDECSIESHVVAGL 339
>gi|327239610|gb|AEA39649.1| cathepsin B [Epinephelus coioides]
Length = 171
Score = 194 bits (494), Expect = 8e-47, Method: Compositional matrix adjust.
Identities = 87/171 (50%), Positives = 121/171 (70%), Gaps = 1/171 (0%)
Query: 123 GSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTG 182
GS WA GA EA+SDR+CI S GK V +SS+DL++CC CG GC GG+ AW +W G
Sbjct: 1 GSCWAFGAAEAISDRLCIHSNGKVSVEISSEDLLACCDSCGMGCNGGYPSAAWDFWTDVG 60
Query: 183 IVSGGTYASKQGCRPYEI-PCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNF 241
+VSGG Y S GCRPY I PCE ++NG+ C +TP+CI +C+ GY SY+ D ++
Sbjct: 61 LVSGGLYDSHVGCRPYTIPPCEHHVNGTRPPCTGEGGDTPQCILQCESGYTPSYKADKHY 120
Query: 242 GRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLG 292
G+ +YS+P++EE I EI+++GPVEG+ T+Y D +LYKTG+Y+H+ G +G
Sbjct: 121 GKSSYSVPSDEEQIQSEIYKNGPVEGAFTVYEDFLLYKTGVYQHMTGSAVG 171
Score = 117 bits (294), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 51/107 (47%), Positives = 78/107 (72%), Gaps = 3/107 (2%)
Query: 331 GLF--RIGCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIA 387
GL+ +GCRPY IP CE ++NG+R C +TP+CI +C+ GY SY+ D ++G+ +
Sbjct: 65 GLYDSHVGCRPYTIPPCEHHVNGTRPPCTGEGGDTPQCILQCESGYTPSYKADKHYGKSS 124
Query: 388 YSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLG 434
YS+P++EE I EI+++GPVEG+ T+Y D +LYKTG+Y+H+ G +G
Sbjct: 125 YSVPSDEEQIQSEIYKNGPVEGAFTVYEDFLLYKTGVYQHMTGSAVG 171
>gi|193716207|ref|XP_001950562.1| PREDICTED: cathepsin B-like [Acyrthosiphon pisum]
Length = 340
Score = 194 bits (493), Expect = 9e-47, Method: Compositional matrix adjust.
Identities = 96/242 (39%), Positives = 141/242 (58%), Gaps = 11/242 (4%)
Query: 96 ELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDL 155
+P+ FDAR W C TI ++RDQG+CGS WA+ A +DR+C+A+ + LS++++
Sbjct: 87 RIPKHFDARKKWKRCHTIGKVRDQGNCGSCWAMATSSAFADRLCVATNADFNELLSAEEI 146
Query: 156 VSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQD 215
CC CG GC GG+ KAW+ + G+V+GG Y S +GC PY +P Y H++C
Sbjct: 147 TFCCSSCGYGCNGGYPIKAWESFNNRGLVTGGDYQSGEGCEPYRVPPCPYDAEGHNTCAG 206
Query: 216 N-EPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYAD 274
C R C D+ Y DD F R +Y L + +I +++ R+GP+E S +Y D
Sbjct: 207 KPREKNHRCTRTCYGNQDLDYNDDHRFTRDSYYLTYS--SIQKDVMRYGPIEASFDMYDD 264
Query: 275 MILYKTGIY-KHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLF 333
YK+G+Y + LG HA+++IGWG+E V YWL+ NS+N WG+NGLF
Sbjct: 265 FPSYKSGVYVRSENASYLGGHAVKLIGWGEE-------HGVLYWLMVNSWNEGWGDNGLF 317
Query: 334 RI 335
+I
Sbjct: 318 KI 319
Score = 120 bits (301), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 63/162 (38%), Positives = 88/162 (54%), Gaps = 11/162 (6%)
Query: 336 GCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANE 394
GC PY +P C G + C R C D+ Y DD F R +Y L +
Sbjct: 185 GCEPYRVPPCPYDAEGHNTCAGKPREKNHRCTRTCYGNQDLDYNDDHRFTRDSYYLTYS- 243
Query: 395 ETIMREIFRHGPVEGSMTIYADMILYKTGIY-KHVAGGPLGEHAIRIIGWGQEPLGEGTS 453
+I +++ R+GP+E S +Y D YK+G+Y + LG HA+++IGWG+E
Sbjct: 244 -SIQKDVMRYGPIEASFDMYDDFPSYKSGVYVRSENASYLGGHAVKLIGWGEE------- 295
Query: 454 SVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLP 495
V YWL+ NS+N WG+NGLF+I RG NECGI+ T G+P
Sbjct: 296 HGVLYWLMVNSWNEGWGDNGLFKIRRGTNECGIDNSTTGGVP 337
>gi|222424744|dbj|BAH20325.1| AT1G02305 [Arabidopsis thaliana]
Length = 293
Score = 194 bits (493), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 107/278 (38%), Positives = 161/278 (57%), Gaps = 24/278 (8%)
Query: 59 LSKLTLSELEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRD 118
+ T++E + +GV P K +P++ D +LP+ FDAR W C +I I D
Sbjct: 1 FANATVAEFKRLLGVKPTPKTEFLGVPIVSH--DISLKLPKEFDARTAWSQCTSIGRILD 58
Query: 119 QGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKD-CGNGCQGGFHGKAWKY 177
QG CGS WA GAVE++SDR CI + +V LS +DL++CC CG GC GG+ AW+Y
Sbjct: 59 QGHCGSCWAFGAVESLSDRFCI--KYNMNVSLSVNDLLACCGFLCGQGCNGGYPIAAWRY 116
Query: 178 WVTTGIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYED 237
+ G+V+ + C PY SH C+ P TP+C RKC G + + +
Sbjct: 117 FKHHGVVT-------EECDPYF----DNTGCSHPGCEPAYP-TPKCARKCVSGNQL-WRE 163
Query: 238 DLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIR 297
++G AY + ++ + IM E++++GPVE + T+Y D YK+G+YKH+ G +G HA++
Sbjct: 164 SKHYGVSAYKVRSHPDDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKHITGTNIGGHAVK 223
Query: 298 IIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
+IGWG GE YWL+AN +N +WG++G F+I
Sbjct: 224 LIGWGTSDDGE------DYWLLANQWNRSWGDDGYFKI 255
Score = 132 bits (332), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 64/154 (41%), Positives = 97/154 (62%), Gaps = 9/154 (5%)
Query: 344 CERYMNGSRSSCQANEPN--TPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREI 401
C+ Y + + S EP TP+C RKC G + + + ++G AY + ++ + IM E+
Sbjct: 127 CDPYFDNTGCSHPGCEPAYPTPKCARKCVSGNQL-WRESKHYGVSAYKVRSHPDDIMAEV 185
Query: 402 FRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLV 461
+++GPVE + T+Y D YK+G+YKH+ G +G HA+++IGWG GE YWL+
Sbjct: 186 YKNGPVEVAFTVYEDFAHYKSGVYKHITGTNIGGHAVKLIGWGTSDDGE------DYWLL 239
Query: 462 ANSFNTNWGENGLFRIVRGQNECGIEADITAGLP 495
AN +N +WG++G F+I RG NECGIE + AGLP
Sbjct: 240 ANQWNRSWGDDGYFKIRRGTNECGIEHGVVAGLP 273
>gi|341900875|gb|EGT56810.1| hypothetical protein CAEBREN_32632 [Caenorhabditis brenneri]
Length = 287
Score = 194 bits (493), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 103/246 (41%), Positives = 145/246 (58%), Gaps = 14/246 (5%)
Query: 96 ELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDL 155
+L + FDAR WP C +I +I D C S WA A E+MSDR+CI S G + LS+ +L
Sbjct: 27 DLSQFFDARERWPECMSIPQINDISECKSSWAFAAAESMSDRLCINSGGTINTILSAQEL 86
Query: 156 VSCCK---DCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEI-PCERYM-NGSH 210
+SCC CG GC GG KAW+YW G+ +GG+Y S+ GC+PY I PC + + N ++
Sbjct: 87 LSCCTGVLSCGEGCGGGNAFKAWQYWGKHGLPTGGSYESQFGCKPYSIAPCGKTVGNVTY 146
Query: 211 SSCQDNEPNTPECIRKC--QPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGS 268
+C + TP C +KC + GY V + D ++G LP + I ++ +GP+E +
Sbjct: 147 PACTNTTLPTPSCEKKCTSKNGYPVDIDKDRHYGASVDQLPNRQIEIQSDVMLNGPIETT 206
Query: 269 MTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWG 328
+Y D + Y TGIY H+ G G ++RI+GWG + EG V YWL+ANS+ WG
Sbjct: 207 FEVYDDFLQYTTGIYVHLTGNKQGHLSVRILGWG---MYEG----VPYWLLANSWGKEWG 259
Query: 329 ENGLFR 334
ENG FR
Sbjct: 260 ENGTFR 265
Score = 133 bits (335), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 72/183 (39%), Positives = 106/183 (57%), Gaps = 18/183 (9%)
Query: 327 WGENGL-------FRIGCRPYEI-PCERYM-NGSRSSCQANEPNTPECIRKC--QPGYDV 375
WG++GL + GC+PY I PC + + N + +C TP C +KC + GY V
Sbjct: 112 WGKHGLPTGGSYESQFGCKPYSIAPCGKTVGNVTYPACTNTTLPTPSCEKKCTSKNGYPV 171
Query: 376 SYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGE 435
+ D ++G LP + I ++ +GP+E + +Y D + Y TGIY H+ G G
Sbjct: 172 DIDKDRHYGASVDQLPNRQIEIQSDVMLNGPIETTFEVYDDFLQYTTGIYVHLTGNKQGH 231
Query: 436 HAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLP 495
++RI+GWG + EG V YWL+ANS+ WGENG FR +RG NECG+EA+ +G+P
Sbjct: 232 LSVRILGWG---MYEG----VPYWLLANSWGKEWGENGTFRALRGTNECGLEANCVSGMP 284
Query: 496 KIG 498
K+G
Sbjct: 285 KLG 287
>gi|401415968|ref|XP_003872479.1| CPC cysteine peptidase, Clan CA, family C1,Cathepsin B-like
[Leishmania mexicana MHOM/GT/2001/U1103]
gi|322488703|emb|CBZ23950.1| CPC cysteine peptidase, Clan CA, family C1,Cathepsin B-like
[Leishmania mexicana MHOM/GT/2001/U1103]
Length = 340
Score = 194 bits (493), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 122/335 (36%), Positives = 175/335 (52%), Gaps = 23/335 (6%)
Query: 3 KSTADAVATFLKDLDLSQSSRNHSNGVFCDLSKAFDRVDHSILLPKLPFYGAEKNALSKL 62
KS VA F+ L + S+ L K+F +S + + ++
Sbjct: 6 KSALCLVAVFVVLLATTVSALYAKPSDIPLLGKSFVAETNSKAKGQWTASADNGHLVTGK 65
Query: 63 TLSELEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSC 122
+L E+ MGV S + P + + ++LPE FDA WP C TI EIRDQ +C
Sbjct: 66 SLEEVRKLMGV--TSMSTEAVPPRNFSVEEMQQDLPESFDASEKWPMCVTIGEIRDQSNC 123
Query: 123 GSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTG 182
GS WA+ AVEAMSDR C S G R+S+ +L+SCC CG GC GG AW +WV G
Sbjct: 124 GSCWAIAAVEAMSDRYCTMS-GIPDRRISTTNLLSCCFICGFGCYGGIPAMAWLWWVWVG 182
Query: 183 IVSGGTYASKQGCRPYEI-PCERYMNGS-HSSCQDNEPNTPECIRKCQPGYDVSYEDDLN 240
+ + + C+PY PC + N S + C + NTP+C C +V E
Sbjct: 183 VTT-------ELCQPYPFGPCSHHGNSSKYPPCPNTIYNTPKCNTTCD---NVEMELVKY 232
Query: 241 FGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIG 300
G +YS+ E +M E+ +GP+E +M +YAD + YK+G+YKHV+G LG HA++++G
Sbjct: 233 KGVSSYSIKGERE-LMVELMNNGPLEVAMQVYADFVAYKSGVYKHVSGDHLGGHAVKLVG 291
Query: 301 WGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
W G + YW +ANS+NT+WG+ G F I
Sbjct: 292 W-------GVKDGIPYWKIANSWNTDWGDKGYFLI 319
Score = 119 bits (299), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 65/161 (40%), Positives = 93/161 (57%), Gaps = 13/161 (8%)
Query: 337 CRPYEI-PCERYMNGSR-SSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANE 394
C+PY PC + N S+ C NTP+C C +V E G +YS+
Sbjct: 188 CQPYPFGPCSHHGNSSKYPPCPNTIYNTPKCNTTCD---NVEMELVKYKGVSSYSIKGER 244
Query: 395 ETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSS 454
E +M E+ +GP+E +M +YAD + YK+G+YKHV+G LG HA++++GW G
Sbjct: 245 E-LMVELMNNGPLEVAMQVYADFVAYKSGVYKHVSGDHLGGHAVKLVGW-------GVKD 296
Query: 455 VVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLP 495
+ YW +ANS+NT+WG+ G F I RG +ECGIE+ AG P
Sbjct: 297 GIPYWKIANSWNTDWGDKGYFLIQRGNDECGIESSGVAGKP 337
>gi|356505709|ref|XP_003521632.1| PREDICTED: cathepsin B-like [Glycine max]
Length = 357
Score = 194 bits (492), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 113/295 (38%), Positives = 165/295 (55%), Gaps = 34/295 (11%)
Query: 47 PKLPFYGAEKNALSKLTLSELEMRMGVHPDSKLPQNRLPLLVQLSDPLE-ELPEGFDARI 105
P+ + A S T+ + + +GV P +P+ L +S P +LP+ FDAR
Sbjct: 53 PEAGWEAAINPRFSNYTVEQFKRLLGVKP---MPKKELRSTPAISHPKTLKLPKNFDART 109
Query: 106 NWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKD-CGN 164
W C TI I DQG CGS WA GAVE++SDR CI ++ LS +DL++CC CG+
Sbjct: 110 AWSQCSTIGRILDQGHCGSCWAFGAVESLSDRFCI--HFDVNISLSVNDLLACCGFLCGS 167
Query: 165 GCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPY--EIPCERYMNGSHSSCQDNEP--NT 220
GC GG+ AW+Y G+V+ + C PY +I C SH C EP T
Sbjct: 168 GCDGGYPLYAWRYLAHHGVVT-------EECDPYFDQIGC------SHPGC---EPAYRT 211
Query: 221 PECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKT 280
P+C++KC G V ++ ++ AY + ++ IM E++++GPVE + T+Y D YK+
Sbjct: 212 PKCVKKCVSGNQV-WKKSKHYSVSAYRVNSDPHDIMAEVYKNGPVEVAFTVYEDFAYYKS 270
Query: 281 GIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
G+YKH+ G LG HA+++IGWG GE YWL+AN +N WG++G F+I
Sbjct: 271 GVYKHITGYELGGHAVKLIGWGTTDDGE------DYWLLANQWNREWGDDGYFKI 319
Score = 135 bits (339), Expect = 6e-29, Method: Compositional matrix adjust.
Identities = 66/154 (42%), Positives = 96/154 (62%), Gaps = 9/154 (5%)
Query: 344 CERYMNGSRSSCQANEP--NTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREI 401
C+ Y + S EP TP+C++KC G V ++ ++ AY + ++ IM E+
Sbjct: 191 CDPYFDQIGCSHPGCEPAYRTPKCVKKCVSGNQV-WKKSKHYSVSAYRVNSDPHDIMAEV 249
Query: 402 FRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLV 461
+++GPVE + T+Y D YK+G+YKH+ G LG HA+++IGWG GE YWL+
Sbjct: 250 YKNGPVEVAFTVYEDFAYYKSGVYKHITGYELGGHAVKLIGWGTTDDGE------DYWLL 303
Query: 462 ANSFNTNWGENGLFRIVRGQNECGIEADITAGLP 495
AN +N WG++G F+I RG NECGIE D+TAGLP
Sbjct: 304 ANQWNREWGDDGYFKIRRGTNECGIEEDVTAGLP 337
>gi|87240981|gb|ABD32839.1| Peptidase C1A, papain; Somatotropin hormone; Peptidase C1,
propeptide [Medicago truncatula]
Length = 356
Score = 194 bits (492), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 118/331 (35%), Positives = 181/331 (54%), Gaps = 44/331 (13%)
Query: 59 LSKLTLSELEMRMGVHPDSKLPQNRLPLLVQLSDPLE-ELPEGFDARINWPYCPTIQEIR 117
S T+ + + +GV + P++ L ++ P +LP+ FDAR W C TI I
Sbjct: 64 FSNFTVGQFKRLLGV---KQTPRSELSSAPVVTHPKSLKLPKDFDARTAWSQCSTIGRIL 120
Query: 118 DQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKD-CGNGCQGGFHGKAWK 176
DQG CGS WA GAVE++SDR CI +V LS +D+++CC CG GC GG AW
Sbjct: 121 DQGHCGSCWAFGAVESLSDRFCI--HFDMNVSLSVNDILACCGLLCGAGCAGGTPFSAWI 178
Query: 177 YWVTTGIVSGGTYASKQGCRPY--EIPCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVS 234
Y G+V+ + C PY +I C SH C+ TP+C++KC G +
Sbjct: 179 YLAHHGVVT-------EECDPYFDQIGC------SHPGCEPTY-RTPKCVKKCVNGNQL- 223
Query: 235 YEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEH 294
+E ++ AY++ ++ + IM E++++GPVE + T+Y D YK+G+YKH+ G LG H
Sbjct: 224 WETSKHYSVKAYTVNSDPQDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKHITGFALGGH 283
Query: 295 AIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIGCRPYEIPCERYMN--GSR 352
A++++GWG GE YWL+AN +NTNWG++G F+I +R N G
Sbjct: 284 AVKLVGWGTSHEGE------DYWLLANQWNTNWGDDGYFKI---------KRGTNECGIE 328
Query: 353 SSCQANEPNTPECIRKCQPGYDVSYEDDLNF 383
++ A P+T +R+ D+ + D++F
Sbjct: 329 NAVTAGLPSTKNIVREVT---DMDVDADVSF 356
Score = 137 bits (346), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 66/154 (42%), Positives = 99/154 (64%), Gaps = 9/154 (5%)
Query: 344 CERYMNGSRSSCQANEPN--TPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREI 401
C+ Y + S EP TP+C++KC G + +E ++ AY++ ++ + IM E+
Sbjct: 190 CDPYFDQIGCSHPGCEPTYRTPKCVKKCVNGNQL-WETSKHYSVKAYTVNSDPQDIMAEV 248
Query: 402 FRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLV 461
+++GPVE + T+Y D YK+G+YKH+ G LG HA++++GWG GE YWL+
Sbjct: 249 YKNGPVEVAFTVYEDFAHYKSGVYKHITGFALGGHAVKLVGWGTSHEGE------DYWLL 302
Query: 462 ANSFNTNWGENGLFRIVRGQNECGIEADITAGLP 495
AN +NTNWG++G F+I RG NECGIE +TAGLP
Sbjct: 303 ANQWNTNWGDDGYFKIKRGTNECGIENAVTAGLP 336
>gi|357511627|ref|XP_003626102.1| Cathepsin L-like proteinase [Medicago truncatula]
gi|355501117|gb|AES82320.1| Cathepsin L-like proteinase [Medicago truncatula]
Length = 351
Score = 194 bits (492), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 118/331 (35%), Positives = 181/331 (54%), Gaps = 44/331 (13%)
Query: 59 LSKLTLSELEMRMGVHPDSKLPQNRLPLLVQLSDPLE-ELPEGFDARINWPYCPTIQEIR 117
S T+ + + +GV + P++ L ++ P +LP+ FDAR W C TI I
Sbjct: 59 FSNFTVGQFKRLLGV---KQTPRSELSSAPVVTHPKSLKLPKDFDARTAWSQCSTIGRIL 115
Query: 118 DQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKD-CGNGCQGGFHGKAWK 176
DQG CGS WA GAVE++SDR CI +V LS +D+++CC CG GC GG AW
Sbjct: 116 DQGHCGSCWAFGAVESLSDRFCI--HFDMNVSLSVNDILACCGLLCGAGCAGGTPFSAWI 173
Query: 177 YWVTTGIVSGGTYASKQGCRPY--EIPCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVS 234
Y G+V+ + C PY +I C SH C+ TP+C++KC G +
Sbjct: 174 YLAHHGVVT-------EECDPYFDQIGC------SHPGCEPTY-RTPKCVKKCVNGNQL- 218
Query: 235 YEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEH 294
+E ++ AY++ ++ + IM E++++GPVE + T+Y D YK+G+YKH+ G LG H
Sbjct: 219 WETSKHYSVKAYTVNSDPQDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKHITGFALGGH 278
Query: 295 AIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIGCRPYEIPCERYMN--GSR 352
A++++GWG GE YWL+AN +NTNWG++G F+I +R N G
Sbjct: 279 AVKLVGWGTSHEGE------DYWLLANQWNTNWGDDGYFKI---------KRGTNECGIE 323
Query: 353 SSCQANEPNTPECIRKCQPGYDVSYEDDLNF 383
++ A P+T +R+ D+ + D++F
Sbjct: 324 NAVTAGLPSTKNIVREVT---DMDVDADVSF 351
Score = 137 bits (346), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 66/154 (42%), Positives = 99/154 (64%), Gaps = 9/154 (5%)
Query: 344 CERYMNGSRSSCQANEPN--TPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREI 401
C+ Y + S EP TP+C++KC G + +E ++ AY++ ++ + IM E+
Sbjct: 185 CDPYFDQIGCSHPGCEPTYRTPKCVKKCVNGNQL-WETSKHYSVKAYTVNSDPQDIMAEV 243
Query: 402 FRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLV 461
+++GPVE + T+Y D YK+G+YKH+ G LG HA++++GWG GE YWL+
Sbjct: 244 YKNGPVEVAFTVYEDFAHYKSGVYKHITGFALGGHAVKLVGWGTSHEGE------DYWLL 297
Query: 462 ANSFNTNWGENGLFRIVRGQNECGIEADITAGLP 495
AN +NTNWG++G F+I RG NECGIE +TAGLP
Sbjct: 298 ANQWNTNWGDDGYFKIKRGTNECGIENAVTAGLP 331
>gi|56755425|gb|AAW25892.1| unknown [Schistosoma japonicum]
Length = 226
Score = 194 bits (492), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 91/179 (50%), Positives = 122/179 (68%), Gaps = 1/179 (0%)
Query: 127 ALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSG 186
A+ AV AMSDR+CI S GK+ V LS+ DL+SCC++CG+GC GGF G AW YWV+ GIV+G
Sbjct: 42 AVSAVGAMSDRICIQSGGKQSVELSAIDLISCCENCGSGCDGGFPGPAWDYWVSHGIVTG 101
Query: 187 GTYASKQGCRPYEIP-CERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIA 245
G+ + GC+PY P CE + G + SC D TP+C RKCQ GY YE D ++G I+
Sbjct: 102 GSKENHTGCQPYPFPKCEHHSIGKYPSCGDKIYKTPQCKRKCQKGYTTPYEHDKHYGGIS 161
Query: 246 YSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQE 304
++ NE I +EI +GPVE + I+ D + YK+GIY++ G +GEH +RIIGWG E
Sbjct: 162 INVIKNESAIQKEIMMYGPVEAYLLIFEDFLNYKSGIYRYTTGSFVGEHYVRIIGWGIE 220
Score = 108 bits (269), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 55/132 (41%), Positives = 74/132 (56%), Gaps = 4/132 (3%)
Query: 316 YWLVANSFNTNWGENGLFRIGCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYD 374
YW+ EN GC+PY P CE + G SC TP+C RKCQ GY
Sbjct: 92 YWVSHGIVTGGSKEN---HTGCQPYPFPKCEHHSIGKYPSCGDKIYKTPQCKRKCQKGYT 148
Query: 375 VSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLG 434
YE D ++G I+ ++ NE I +EI +GPVE + I+ D + YK+GIY++ G +G
Sbjct: 149 TPYEHDKHYGGISINVIKNESAIQKEIMMYGPVEAYLLIFEDFLNYKSGIYRYTTGSFVG 208
Query: 435 EHAIRIIGWGQE 446
EH +RIIGWG E
Sbjct: 209 EHYVRIIGWGIE 220
>gi|40557606|gb|AAR88096.1| cathepsin B-like cysteine protease [Callosobruchus maculatus]
Length = 330
Score = 193 bits (491), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 103/249 (41%), Positives = 143/249 (57%), Gaps = 18/249 (7%)
Query: 92 DPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLS 151
D ++LPE FDAR W C +I+EIRDQ CGS WA+ + MSDR+CI S K +R+S
Sbjct: 76 DDGKDLPEEFDARKQWSKCESIKEIRDQSGCGSCWAVSSASVMSDRICIQSDQKNQLRIS 135
Query: 152 SDDLVSCCKDCG---NGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIPCERYMNG 208
+ D++ CC+ C +GC GG + W +G VSGG Y S GC Y +P
Sbjct: 136 AADMIECCESCTFSVDGCHGGIPSFTFTEWKDSGFVSGGEYNSTNGCMSYPLP------R 189
Query: 209 SHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPAN-EETIMREIFRHGPVEG 267
+ SC+ + P C ++C G + YE+D ++ + AY + + E I EI ++GPV
Sbjct: 190 CNPSCK-TLYDAPTCKKECDKGSPLKYEEDKHYAKQAYRIMSKVERQIQLEIIKNGPVVA 248
Query: 268 SMTIYADMILYKTGIYKHVAGGP-LGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTN 326
S T+YAD I Y +G+YK LG HA+RIIGWG E + YWLV+NS+N
Sbjct: 249 SFTVYADFIHYLSGVYKFDGESKLLGGHAVRIIGWGIE------NGTYPYWLVSNSWNER 302
Query: 327 WGENGLFRI 335
WG+ GLF+I
Sbjct: 303 WGDQGLFKI 311
Score = 127 bits (318), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 64/138 (46%), Positives = 88/138 (63%), Gaps = 8/138 (5%)
Query: 361 NTPECIRKCQPGYDVSYEDDLNFGRIAYSLPAN-EETIMREIFRHGPVEGSMTIYADMIL 419
+ P C ++C G + YE+D ++ + AY + + E I EI ++GPV S T+YAD I
Sbjct: 199 DAPTCKKECDKGSPLKYEEDKHYAKQAYRIMSKVERQIQLEIIKNGPVVASFTVYADFIH 258
Query: 420 YKTGIYKHVAGGPL-GEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIV 478
Y +G+YK L G HA+RIIGWG E + YWLV+NS+N WG+ GLF+I
Sbjct: 259 YLSGVYKFDGESKLLGGHAVRIIGWGIE------NGTYPYWLVSNSWNERWGDQGLFKIW 312
Query: 479 RGQNECGIEADITAGLPK 496
RG+NECGIE +ITAGLP+
Sbjct: 313 RGKNECGIEEEITAGLPR 330
>gi|201023319|ref|NP_001128401.1| cathepsin B-10270 precursor [Acyrthosiphon pisum]
gi|239788119|dbj|BAH70754.1| ACYPI000021 [Acyrthosiphon pisum]
Length = 341
Score = 193 bits (491), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 98/252 (38%), Positives = 142/252 (56%), Gaps = 16/252 (6%)
Query: 90 LSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVR 149
L D ++PE FDAR W C +I I +QG+C + WA+ A++DR+CI S+
Sbjct: 80 LDDGSNDMPETFDARNKWFECVSIAHIWNQGNCAADWAISVTSAINDRICIKSKKNITAF 139
Query: 150 LSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEI-PCERYMNG 208
S ++SCC DCG+GC GG+ G AW+YW+ G+V+GG Y S +GC+P+ I PC +
Sbjct: 140 YSPQKMLSCCDDCGDGCNGGYSGAAWQYWMKRGLVTGGDYGSNEGCQPWLIPPCNHTVMD 199
Query: 209 SHS---SCQDNEPNTPECIRKC-QPGYDVSYEDDLNFG-RIAYSLPANEETIMREIFRHG 263
S C + TP+C C P Y + D++ G RI + I E+ +HG
Sbjct: 200 ERSPSYMCGKYKSETPQCTLNCYNPNYSKPFLKDISKGIRIDWHCSG---MIRNELKKHG 256
Query: 264 PVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSF 323
P M +Y D + YK+GIY+HV G LG+ +++IGW G V+YWL ANS+
Sbjct: 257 PATAIMRVYEDFLTYKSGIYQHVTGKLLGQITVKVIGW-------GVYRGVQYWLAANSW 309
Query: 324 NTNWGENGLFRI 335
T+WG+ G F+I
Sbjct: 310 GTSWGDKGFFKI 321
Score = 107 bits (268), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 62/168 (36%), Positives = 88/168 (52%), Gaps = 16/168 (9%)
Query: 336 GCRPYEIP-CERYMNGSRS---SCQANEPNTPECIRKC-QPGYDVSYEDDLNFG-RIAYS 389
GC+P+ IP C + RS C + TP+C C P Y + D++ G RI +
Sbjct: 184 GCQPWLIPPCNHTVMDERSPSYMCGKYKSETPQCTLNCYNPNYSKPFLKDISKGIRIDWH 243
Query: 390 LPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLG 449
I E+ +HGP M +Y D + YK+GIY+HV G LG+ +++IGW
Sbjct: 244 CSG---MIRNELKKHGPATAIMRVYEDFLTYKSGIYQHVTGKLLGQITVKVIGW------ 294
Query: 450 EGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
G V+YWL ANS+ T+WG+ G F+I RG NEC E +G P +
Sbjct: 295 -GVYRGVQYWLAANSWGTSWGDKGFFKIRRGYNECLFEDYFISGRPVL 341
>gi|91088083|ref|XP_968689.1| PREDICTED: similar to AGAP004533-PA [Tribolium castaneum]
Length = 360
Score = 193 bits (491), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 108/280 (38%), Positives = 152/280 (54%), Gaps = 33/280 (11%)
Query: 64 LSELEMRMG---VHPDSKL-PQNRLPLLVQLSDPLEELPEGFDARINWPYCP-TIQEIRD 118
++E R+G +HPD P+ + P Q +PE FDAR WP C I IR+
Sbjct: 40 FDDIESRLGFLGIHPDPNFKPEIKEPQATQ-----NVIPETFDAREYWPECADIIGNIRN 94
Query: 119 QGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYW 178
QG C S WA A E MSDR+CIA+ GK ++LS +DL+ CC CGN C+GG+ AW Y+
Sbjct: 95 QGKCSSSWAFAAAEVMSDRLCIATNGKVKIQLSPEDLIDCCHYCGNQCKGGYTYYAWNYF 154
Query: 179 VTTGIVSGGTYASKQGCRPY-EIPCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYED 237
+ TG+VSGG Y + GC+PY E+ R +++CQ+++ Y + Y
Sbjct: 155 MLTGLVSGGDYNTSTGCQPYSELNYYRITPPCNTTCQNDK-------------YPIPYVS 201
Query: 238 DLNFGRIAYSLPANEETIMREIFR-HGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAI 296
D +FG Y +P NE I EI GPV + +Y D +Y+ G+Y + +G G A+
Sbjct: 202 DKHFGDSIYYIPQNETAIQNEILSGGGPVVAAFDVYGDFKIYRDGVYIYTSGALFGRTAV 261
Query: 297 RIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGE-NGLFRI 335
+IIGWG E + YWL ANS+ +WG G F+I
Sbjct: 262 KIIGWGTE-------NGWAYWLAANSWGKDWGALGGFFKI 294
Score = 100 bits (248), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 56/140 (40%), Positives = 73/140 (52%), Gaps = 10/140 (7%)
Query: 362 TPECIRKCQ-PGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHG-PVEGSMTIYADMIL 419
TP C CQ Y + Y D +FG Y +P NE I EI G PV + +Y D +
Sbjct: 183 TPPCNTTCQNDKYPIPYVSDKHFGDSIYYIPQNETAIQNEILSGGGPVVAAFDVYGDFKI 242
Query: 420 YKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGE-NGLFRIV 478
Y+ G+Y + +G G A++IIGWG E + YWL ANS+ +WG G F+I
Sbjct: 243 YRDGVYIYTSGALFGRTAVKIIGWGTE-------NGWAYWLAANSWGKDWGALGGFFKIR 295
Query: 479 RGQNECGIEADITAGLPKIG 498
RG NECG E I AG + G
Sbjct: 296 RGTNECGFEESIIAGQVREG 315
>gi|414886870|tpg|DAA62884.1| TPA: cathepsin B-like cysteine proteinase 3 [Zea mays]
Length = 347
Score = 193 bits (491), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 109/279 (39%), Positives = 160/279 (57%), Gaps = 26/279 (9%)
Query: 59 LSKLTLSELEMRMGVHPDSKLPQNRLP-LLVQLSDPLEELPEGFDARINWPYCPTIQEIR 117
S T+++ + +GV P PQN L + V+ ELP+ FDAR W C TI I
Sbjct: 57 FSNYTIAQFKHILGVKP---APQNALSNVPVKTYSRSLELPKEFDARSAWSRCSTIGNIL 113
Query: 118 DQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKD-CGNGCQGGFHGKAWK 176
+QG CGS WA GAVE + DR CI + LS +DL++CC CG+GC GG+ +AW+
Sbjct: 114 EQGHCGSCWAFGAVECLQDRFCI--HLNMSILLSVNDLLACCGFMCGDGCDGGYPIEAWR 171
Query: 177 YWVTTGIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYE 236
Y+V G+V+ C PY P + H C+ P TP+C +KC+ V ++
Sbjct: 172 YFVQNGVVT-------DECDPYFDP----VGCKHPGCEPAYP-TPKCEKKCKEQNQV-WQ 218
Query: 237 DDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAI 296
+ +F AY + ++ IM E++++GPVE + T+Y D YK+G+YKH+ GG +G HA+
Sbjct: 219 EKKHFSIDAYRINSDPHDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKHITGGIMGGHAV 278
Query: 297 RIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
++IGWG GE YWL+AN +N WG++G F+I
Sbjct: 279 KLIGWGTSDAGE------DYWLLANQWNRGWGDDGYFKI 311
Score = 134 bits (337), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 66/167 (39%), Positives = 102/167 (61%), Gaps = 12/167 (7%)
Query: 329 ENGLFRIGCRPYEIPCERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAY 388
+NG+ C PY P + C+ P TP+C +KC+ V +++ +F AY
Sbjct: 175 QNGVVTDECDPYFDP----VGCKHPGCEPAYP-TPKCEKKCKEQNQV-WQEKKHFSIDAY 228
Query: 389 SLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPL 448
+ ++ IM E++++GPVE + T+Y D YK+G+YKH+ GG +G HA+++IGWG
Sbjct: 229 RINSDPHDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKHITGGIMGGHAVKLIGWGTSDA 288
Query: 449 GEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLP 495
GE YWL+AN +N WG++G F+I+RG+NECGIE + AG+P
Sbjct: 289 GE------DYWLLANQWNRGWGDDGYFKIIRGKNECGIEEGVVAGMP 329
>gi|255647484|gb|ACU24206.1| unknown [Glycine max]
Length = 327
Score = 193 bits (491), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 111/293 (37%), Positives = 164/293 (55%), Gaps = 30/293 (10%)
Query: 47 PKLPFYGAEKNALSKLTLSELEMRMGVHPDSKLPQNRLPLLVQLSDPLE-ELPEGFDARI 105
P+ + A S T+ + + +GV P +P+ L +S P +LP+ FDAR
Sbjct: 53 PEAGWEAAINPRFSNYTVEQFKRLLGVKP---MPKKELRSTPAISHPKTLKLPKNFDART 109
Query: 106 NWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKD-CGN 164
W C TI I DQG CGS WA GAVE++SDR CI ++ LS +DL++CC CG+
Sbjct: 110 AWSQCSTIGRILDQGHCGSCWAFGAVESLSDRFCI--HFDVNISLSVNDLLACCGFLCGS 167
Query: 165 GCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPY--EIPCERYMNGSHSSCQDNEPNTPE 222
GC GG+ AW+Y G+V+ + C PY +I C SH C+ TP+
Sbjct: 168 GCDGGYPLYAWRYLAHHGVVT-------EECDPYFDQIGC------SHPGCEPAY-RTPK 213
Query: 223 CIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGI 282
C++KC G V ++ ++ AY + ++ IM E++++GPVE + T+Y D YK+G+
Sbjct: 214 CVKKCVSGNQV-WKKSKHYSVSAYRVNSDPHDIMAEVYKNGPVEVAFTVYEDFAYYKSGV 272
Query: 283 YKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
YKH+ G LG HA+++IGWG GE YWL+AN +N WG++G F+I
Sbjct: 273 YKHITGYELGGHAVKLIGWGTTDDGE------DYWLLANQWNREWGDDGYFKI 319
Score = 118 bits (295), Expect = 9e-24, Method: Compositional matrix adjust.
Identities = 58/144 (40%), Positives = 87/144 (60%), Gaps = 9/144 (6%)
Query: 344 CERYMNGSRSSCQANEP--NTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREI 401
C+ Y + S EP TP+C++KC G V ++ ++ AY + ++ IM E+
Sbjct: 191 CDPYFDQIGCSHPGCEPAYRTPKCVKKCVSGNQV-WKKSKHYSVSAYRVNSDPHDIMAEV 249
Query: 402 FRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLV 461
+++GPVE + T+Y D YK+G+YKH+ G LG HA+++IGWG GE YWL+
Sbjct: 250 YKNGPVEVAFTVYEDFAYYKSGVYKHITGYELGGHAVKLIGWGTTDDGE------DYWLL 303
Query: 462 ANSFNTNWGENGLFRIVRGQNECG 485
AN +N WG++G F+I RG NECG
Sbjct: 304 ANQWNREWGDDGYFKIRRGTNECG 327
>gi|17565158|ref|NP_503384.1| Protein W07B8.1 [Caenorhabditis elegans]
gi|351059396|emb|CCD74286.1| Protein W07B8.1 [Caenorhabditis elegans]
Length = 335
Score = 193 bits (491), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 101/253 (39%), Positives = 148/253 (58%), Gaps = 14/253 (5%)
Query: 90 LSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVR 149
+S +L FDAR WP C +I +I D C + WA A E+MSDR+CI S G ++
Sbjct: 69 VSQANSDLSPSFDARERWPECMSIPQINDISECKTSWAFAAAESMSDRLCINSGGFKNTI 128
Query: 150 LSSDDLVSCCK---DCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEI-PCERY 205
LS+++L+SCC CG GC+GG KAW+Y GI +GG+Y S+ GC+PY I PC +
Sbjct: 129 LSAEELLSCCTGMFSCGEGCEGGNPFKAWQYIQKHGIPTGGSYESQFGCKPYSIPPCGKT 188
Query: 206 M-NGSHSSCQDNEPNTPECIRKCQP--GYDVSYEDDLNFGRIAYSLPANEETIMREIFRH 262
+ N ++ +C + TP C +KC GY + + D ++G LP ++ I ++ +
Sbjct: 189 VGNVTYPACTNTTSPTPSCEKKCTSRIGYPIDIDKDRHYGVSVDQLPNSQIEIQSDVMLN 248
Query: 263 GPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANS 322
GP++ + +Y D + Y TGIY H+ G G ++RIIGWG + +G V YWL ANS
Sbjct: 249 GPIQATFEVYDDFLQYTTGIYVHLTGNKQGHLSVRIIGWG---VWQG----VPYWLCANS 301
Query: 323 FNTNWGENGLFRI 335
+ WGENG FR+
Sbjct: 302 WGRQWGENGTFRV 314
Score = 129 bits (323), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 64/168 (38%), Positives = 99/168 (58%), Gaps = 11/168 (6%)
Query: 334 RIGCRPYEIP-CERYM-NGSRSSCQANEPNTPECIRKCQP--GYDVSYEDDLNFGRIAYS 389
+ GC+PY IP C + + N + +C TP C +KC GY + + D ++G
Sbjct: 174 QFGCKPYSIPPCGKTVGNVTYPACTNTTSPTPSCEKKCTSRIGYPIDIDKDRHYGVSVDQ 233
Query: 390 LPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLG 449
LP ++ I ++ +GP++ + +Y D + Y TGIY H+ G G ++RIIGWG +
Sbjct: 234 LPNSQIEIQSDVMLNGPIQATFEVYDDFLQYTTGIYVHLTGNKQGHLSVRIIGWG---VW 290
Query: 450 EGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
+G V YWL ANS+ WGENG FR++RG NECG+E++ +G+PK+
Sbjct: 291 QG----VPYWLCANSWGRQWGENGTFRVLRGTNECGLESNCVSGMPKL 334
>gi|86279341|gb|ABC88766.1| putative cathepsin B-like like proteinase [Tenebrio molitor]
Length = 301
Score = 192 bits (488), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 103/235 (43%), Positives = 139/235 (59%), Gaps = 6/235 (2%)
Query: 48 KLPFYGAEKNALSKLTLSELEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINW 107
K + A +N +S + +GV P K +LP+ + L+ +PE FDAR W
Sbjct: 37 KQTTWKAGRNFDVNTPISHVRRLLGVLP-KKANAPKLPVKTHAVN-LDAIPESFDAREAW 94
Query: 108 PYCPTI-QEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGC 166
P C +I EIRDQ SCGS WA GAVEAMSDR+CI S VR+S++DL CC DCG+GC
Sbjct: 95 PECTSIIGEIRDQASCGSCWAFGAVEAMSDRICIHSDASVKVRISAEDLNDCCYDCGDGC 154
Query: 167 QGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEI-PCERYMNGSHSSCQDNEPNTPECIR 225
GG+ AW YW +TGIV+GG Y +GC+ Y I PC+ +++G+ C D + TP C +
Sbjct: 155 NGGWPDLAWSYWSSTGIVTGGLYGVDEGCKAYSIKPCDHHVDGNLGPCGDIQ-RTPACKK 213
Query: 226 KCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKT 280
C D+ Y+ DL G AYS+P +E I EI +GPVE +Y+D + YK
Sbjct: 214 SCDSTSDLEYKSDLRRGS-AYSIPKSESQIQTEIMTNGPVEADYDVYSDFLTYKA 267
Score = 65.9 bits (159), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 35/95 (36%), Positives = 53/95 (55%), Gaps = 5/95 (5%)
Query: 331 GLFRI--GCRPYEI-PCERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIA 387
GL+ + GC+ Y I PC+ +++G+ C + TP C + C D+ Y+ DL G A
Sbjct: 175 GLYGVDEGCKAYSIKPCDHHVDGNLGPC-GDIQRTPACKKSCDSTSDLEYKSDLRRGS-A 232
Query: 388 YSLPANEETIMREIFRHGPVEGSMTIYADMILYKT 422
YS+P +E I EI +GPVE +Y+D + YK
Sbjct: 233 YSIPKSESQIQTEIMTNGPVEADYDVYSDFLTYKA 267
>gi|356572872|ref|XP_003554589.1| PREDICTED: cathepsin B-like [Glycine max]
Length = 356
Score = 192 bits (488), Expect = 4e-46, Method: Compositional matrix adjust.
Identities = 113/295 (38%), Positives = 164/295 (55%), Gaps = 34/295 (11%)
Query: 47 PKLPFYGAEKNALSKLTLSELEMRMGVHPDSKLPQNRLPLLVQLSDPLE-ELPEGFDARI 105
P+ + A S T+ + + +GV P P+ L +S P +LP+ FDAR
Sbjct: 52 PEAGWEAAINPHFSNYTVEQFKRLLGVKP---TPKKELRSTPAISHPKSLKLPKNFDART 108
Query: 106 NWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKD-CGN 164
W C TI I DQG CGS WA GAVE++SDR CI ++ LS +DL++CC CG+
Sbjct: 109 AWSQCSTIGRILDQGHCGSCWAFGAVESLSDRFCI--HFDVNISLSVNDLLACCGFLCGS 166
Query: 165 GCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPY--EIPCERYMNGSHSSCQDNEP--NT 220
GC GG+ AW+Y G+V+ + C PY +I C SH C EP T
Sbjct: 167 GCDGGYPLYAWQYLAHHGVVT-------EECDPYFDQIGC------SHPGC---EPAYRT 210
Query: 221 PECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKT 280
P+C++KC G V ++ ++ AY + ++ IM E++++GPVE + T+Y D YK+
Sbjct: 211 PKCVKKCVSGNQV-WKKSKHYSVNAYRVSSDPHDIMTEVYKNGPVEVAFTVYEDFAHYKS 269
Query: 281 GIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
G+YKH+ G LG HA+++IGWG GE YWL+AN +N WG++G F+I
Sbjct: 270 GVYKHITGYELGGHAVKLIGWGTTEDGE------DYWLLANQWNREWGDDGYFKI 318
Score = 134 bits (338), Expect = 8e-29, Method: Compositional matrix adjust.
Identities = 66/154 (42%), Positives = 96/154 (62%), Gaps = 9/154 (5%)
Query: 344 CERYMNGSRSSCQANEP--NTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREI 401
C+ Y + S EP TP+C++KC G V ++ ++ AY + ++ IM E+
Sbjct: 190 CDPYFDQIGCSHPGCEPAYRTPKCVKKCVSGNQV-WKKSKHYSVNAYRVSSDPHDIMTEV 248
Query: 402 FRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLV 461
+++GPVE + T+Y D YK+G+YKH+ G LG HA+++IGWG GE YWL+
Sbjct: 249 YKNGPVEVAFTVYEDFAHYKSGVYKHITGYELGGHAVKLIGWGTTEDGE------DYWLL 302
Query: 462 ANSFNTNWGENGLFRIVRGQNECGIEADITAGLP 495
AN +N WG++G F+I RG NECGIE D+TAGLP
Sbjct: 303 ANQWNREWGDDGYFKIRRGTNECGIEEDVTAGLP 336
>gi|332374788|gb|AEE62535.1| unknown [Dendroctonus ponderosae]
Length = 328
Score = 192 bits (487), Expect = 5e-46, Method: Compositional matrix adjust.
Identities = 115/278 (41%), Positives = 155/278 (55%), Gaps = 22/278 (7%)
Query: 62 LTLSELEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCP-TIQEIRDQG 120
+ E E+ + K PQ + + EE+PE FD+R WP C I IRDQ
Sbjct: 46 FAIDEYELFKSLASGVKKPQGLKTAQKLVREITEEIPESFDSRTAWPECTQIIGMIRDQS 105
Query: 121 SCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVT 180
CGS WA AVEAMSDR+CI S + + +SS DL++C GC GG+ AW W T
Sbjct: 106 RCGSCWAFAAVEAMSDRICIHSNATKKLLVSSQDLLTC--GTAGGCNGGWPAVAWSDW-T 162
Query: 181 TGIVSGGTY-ASKQGCRPYEIP-CERYMNGSHSSCQDNEPNTPECIRKC-QPGYDVSYED 237
GIV+GG Y A +QGC+ Y + C+ + N C+ N +TP C+ +C +P + Y+
Sbjct: 163 NGIVTGGLYGALEQGCKSYFLEGCDDHPN----KCR-NYVSTPACVEQCDEP--SLYYKA 215
Query: 238 DLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIR 297
+G+ Y + EE I EI +GPVE +M +Y D Y++GIY+ G HA++
Sbjct: 216 QETYGQTPYEIQG-EEQIQYEIMTNGPVEATMDVYVDFAQYQSGIYQLTTDEYEGGHAVK 274
Query: 298 IIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
I+GWG E VKYWLVANS+N WGENGLFRI
Sbjct: 275 ILGWGVE-------DGVKYWLVANSWNERWGENGLFRI 305
Score = 122 bits (305), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 70/170 (41%), Positives = 97/170 (57%), Gaps = 17/170 (10%)
Query: 328 GENGLFRIGCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKC-QPGYDVSYEDDLNFGR 385
G G GC+ Y + C+ + N R N +TP C+ +C +P + Y+ +G+
Sbjct: 169 GLYGALEQGCKSYFLEGCDDHPNKCR-----NYVSTPACVEQCDEP--SLYYKAQETYGQ 221
Query: 386 IAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQ 445
Y + EE I EI +GPVE +M +Y D Y++GIY+ G HA++I+GWG
Sbjct: 222 TPYEIQG-EEQIQYEIMTNGPVEATMDVYVDFAQYQSGIYQLTTDEYEGGHAVKILGWGV 280
Query: 446 EPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLP 495
E VKYWLVANS+N WGENGLFRI+RG++E GIE+ I A LP
Sbjct: 281 E-------DGVKYWLVANSWNERWGENGLFRIIRGRDEVGIESTIDAALP 323
>gi|204022106|dbj|BAG71150.1| cathepsin B-N [Astegopteryx spinocephala]
Length = 332
Score = 192 bits (487), Expect = 5e-46, Method: Compositional matrix adjust.
Identities = 109/305 (35%), Positives = 163/305 (53%), Gaps = 30/305 (9%)
Query: 52 YGAEKNALSKLTLSELEMRMGVHPDSKLP-QNRLPLL----VQLSDPL------------ 94
Y E++ ++++ + + G++ D KL +N + LL VQ +
Sbjct: 19 YFLEEDYINQINENAKTWKAGINFDPKLSVENFVKLLGSKGVQAAKKASPDMFKTDDKTY 78
Query: 95 --EELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSS 152
+ +P+ FDAR W C TI E+RDQG CGS WA G A +DR+CIA+ G + LS+
Sbjct: 79 ENQRIPKFFDARKKWRKCSTIGEVRDQGKCGSCWAFGTSSAFADRLCIATDGDFNELLSA 138
Query: 153 DDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEI-PCERYMNGSHS 211
++L CC CG GC GG+ KAW+ + G+V+GG Y S +GC+PY + PC G+++
Sbjct: 139 EELTFCCHTCGYGCHGGYPIKAWERFKKHGLVTGGNYDSSEGCQPYRVSPCPLDEYGNNT 198
Query: 212 SCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTI 271
C R C D +++D F R AY L TI +++ +GP+E S +
Sbjct: 199 CRGKPAEKNHRCTRMCYGDQDRDFKEDHRFTRDAYYLTYG--TIQKDVMTYGPIEASYEV 256
Query: 272 YADMILYKTGIYKHVAGGP-LGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGEN 330
Y D YK+G+Y LG HA+++IGWG+E V YWL+ NS+N WG+
Sbjct: 257 YDDFPSYKSGVYVRTENATYLGGHAVKLIGWGEE-------YGVPYWLMVNSWNDQWGDR 309
Query: 331 GLFRI 335
GLF+I
Sbjct: 310 GLFKI 314
Score = 112 bits (281), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 61/162 (37%), Positives = 86/162 (53%), Gaps = 11/162 (6%)
Query: 336 GCRPYEI-PCERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANE 394
GC+PY + PC G+ + C R C D +++D F R AY L
Sbjct: 180 GCQPYRVSPCPLDEYGNNTCRGKPAEKNHRCTRMCYGDQDRDFKEDHRFTRDAYYLTYG- 238
Query: 395 ETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGP-LGEHAIRIIGWGQEPLGEGTS 453
TI +++ +GP+E S +Y D YK+G+Y LG HA+++IGWG+E
Sbjct: 239 -TIQKDVMTYGPIEASYEVYDDFPSYKSGVYVRTENATYLGGHAVKLIGWGEE------- 290
Query: 454 SVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLP 495
V YWL+ NS+N WG+ GLF+I RG NECGI+ T G+P
Sbjct: 291 YGVPYWLMVNSWNDQWGDRGLFKIRRGTNECGIDNSTTGGVP 332
>gi|204022104|dbj|BAG71149.1| cathepsin B-N [Astegopteryx styracophila]
Length = 332
Score = 192 bits (487), Expect = 5e-46, Method: Compositional matrix adjust.
Identities = 109/305 (35%), Positives = 165/305 (54%), Gaps = 30/305 (9%)
Query: 52 YGAEKNALSKLTLSELEMRMGVHPDSKLP-QNRLPLL----VQLSDPL------------ 94
Y E++ ++++ + + G++ D KL +N + LL VQ +
Sbjct: 19 YFLEEDYINQINENAKTWKAGINFDPKLSIENFVKLLGSKGVQAAKKASPDMFKTIDKAY 78
Query: 95 --EELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSS 152
+++P+ FDAR W C TI E+RDQG CGS WA G A +DR+CIA+ G+ + LS+
Sbjct: 79 ENQKIPKFFDARKKWRKCFTIGEVRDQGKCGSCWAFGTSSAFADRLCIATNGEFNELLSA 138
Query: 153 DDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEI-PCERYMNGSHS 211
++L CC CG GC GG+ KAW+ + G+V+GG Y S +GC+PY + PC G+++
Sbjct: 139 EELTFCCHKCGFGCHGGYPIKAWERFQKHGLVTGGDYDSGEGCQPYRVSPCPLDEYGNNT 198
Query: 212 SCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTI 271
C R C D+ ++ D +F R AY L I R++ +GP+E S +
Sbjct: 199 CRGKPAEKNHRCTRMCYGNQDLDFKKDHHFTRDAYYLTFG--IIQRDVMAYGPIEASYDV 256
Query: 272 YADMILYKTGIYKHVAGGP-LGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGEN 330
Y D YK+G+Y LG HA+++IGWG+E V YWL+ NS+N WG+
Sbjct: 257 YDDFPSYKSGVYVRTENATYLGGHAVKLIGWGEE-------YGVPYWLMVNSWNDQWGDK 309
Query: 331 GLFRI 335
GLF+I
Sbjct: 310 GLFKI 314
Score = 113 bits (282), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 61/162 (37%), Positives = 86/162 (53%), Gaps = 11/162 (6%)
Query: 336 GCRPYEI-PCERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANE 394
GC+PY + PC G+ + C R C D+ ++ D +F R AY L
Sbjct: 180 GCQPYRVSPCPLDEYGNNTCRGKPAEKNHRCTRMCYGNQDLDFKKDHHFTRDAYYLTFG- 238
Query: 395 ETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGP-LGEHAIRIIGWGQEPLGEGTS 453
I R++ +GP+E S +Y D YK+G+Y LG HA+++IGWG+E
Sbjct: 239 -IIQRDVMAYGPIEASYDVYDDFPSYKSGVYVRTENATYLGGHAVKLIGWGEE------- 290
Query: 454 SVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLP 495
V YWL+ NS+N WG+ GLF+I RG NECGI+ T G+P
Sbjct: 291 YGVPYWLMVNSWNDQWGDKGLFKIRRGTNECGIDNSTTGGVP 332
>gi|449489527|ref|XP_004158338.1| PREDICTED: cathepsin B-like [Cucumis sativus]
Length = 349
Score = 192 bits (487), Expect = 5e-46, Method: Compositional matrix adjust.
Identities = 108/279 (38%), Positives = 157/279 (56%), Gaps = 26/279 (9%)
Query: 59 LSKLTLSELEMRMGVHPDSKLPQNRLPLLVQLSDPLE-ELPEGFDARINWPYCPTIQEIR 117
S ++S+ + +GV + P+ L LS P +LP+ FDAR WP C +I I
Sbjct: 66 FSNYSVSQFKYLLGV---KQTPEKDLKSTPVLSHPKSLKLPKSFDAREAWPQCISIGTIL 122
Query: 118 DQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKD-CGNGCQGGFHGKAWK 176
DQG CGS WA GAVE++SDR CI ++ LS +DL++CC CG+GC GG+ AW+
Sbjct: 123 DQGHCGSCWAFGAVESLSDRFCI--HFDMNITLSVNDLLACCGFMCGDGCDGGYPISAWR 180
Query: 177 YWVTTGIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYE 236
Y+V G+V+ + C PY SH C+ P TP C+R C + +
Sbjct: 181 YFVRHGVVT-------EQCDPYF----DTTGCSHPGCEPAYP-TPRCVRHCVDKNQI-WR 227
Query: 237 DDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAI 296
++G AY + + IM E++++GPVE S T+Y D YK+G+YKH+ G +G HA+
Sbjct: 228 KTKHYGVSAYRVKRDPNDIMAEVYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 287
Query: 297 RIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
++IGWG GE YWL+AN +N WG++G F+I
Sbjct: 288 KLIGWGTTDDGE------DYWLLANQWNRGWGDDGYFKI 320
Score = 130 bits (327), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 64/154 (41%), Positives = 92/154 (59%), Gaps = 9/154 (5%)
Query: 344 CERYMNGSRSSCQANEPN--TPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREI 401
C+ Y + + S EP TP C+R C + + ++G AY + + IM E+
Sbjct: 192 CDPYFDTTGCSHPGCEPAYPTPRCVRHCVDKNQI-WRKTKHYGVSAYRVKRDPNDIMAEV 250
Query: 402 FRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLV 461
+++GPVE S T+Y D YK+G+YKH+ G +G HA+++IGWG GE YWL+
Sbjct: 251 YKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTTDDGE------DYWLL 304
Query: 462 ANSFNTNWGENGLFRIVRGQNECGIEADITAGLP 495
AN +N WG++G F+I RG NECGIE D+ AGLP
Sbjct: 305 ANQWNRGWGDDGYFKIRRGTNECGIEEDVVAGLP 338
>gi|217072748|gb|ACJ84734.1| unknown [Medicago truncatula]
gi|388505480|gb|AFK40806.1| unknown [Medicago truncatula]
Length = 359
Score = 191 bits (486), Expect = 6e-46, Method: Compositional matrix adjust.
Identities = 111/295 (37%), Positives = 165/295 (55%), Gaps = 34/295 (11%)
Query: 47 PKLPFYGAEKNALSKLTLSELEMRMGVHPDSKLPQNRLPLLVQLSDPLE-ELPEGFDARI 105
P+ + A S T+ + + +GV + P+ L ++ P +LP+ FDAR
Sbjct: 55 PEAGWEAAINPRFSNFTVGQFKRLLGV---KQAPKKELLSTPVVTHPKSLKLPKEFDART 111
Query: 106 NWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKD-CGN 164
W C TI +I DQG CGS WA GAVE++ DR CI ++ LS +DL++CC CG
Sbjct: 112 AWSQCSTIGKILDQGHCGSCWAFGAVESLQDRFCI--HFDMNISLSVNDLLACCGFLCGA 169
Query: 165 GCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPY--EIPCERYMNGSHSSCQDNEP--NT 220
GC GG AW+Y G+V+ + C PY +I C SH C EP T
Sbjct: 170 GCDGGTPIYAWRYLAHHGVVT-------EECDPYFDQIGC------SHPGC---EPAYQT 213
Query: 221 PECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKT 280
P+C+RKC G + ++ ++ AY + ++ + IM E++++GPVE + T++ D YK+
Sbjct: 214 PKCVRKCVKGNQI-WKRSKHYSVKAYRVKSDPQDIMAEVYKNGPVEVAFTVFEDFAHYKS 272
Query: 281 GIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
G+YKH+ G LG HA+++IGWG GE YWL+AN +NTNWG++G F+I
Sbjct: 273 GVYKHITGSALGGHAVKLIGWGTSDEGE------DYWLLANQWNTNWGDDGYFKI 321
Score = 141 bits (355), Expect = 8e-31, Method: Compositional matrix adjust.
Identities = 70/171 (40%), Positives = 106/171 (61%), Gaps = 12/171 (7%)
Query: 344 CERYMNGSRSSCQANEP--NTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREI 401
C+ Y + S EP TP+C+RKC G + ++ ++ AY + ++ + IM E+
Sbjct: 193 CDPYFDQIGCSHPGCEPAYQTPKCVRKCVKGNQI-WKRSKHYSVKAYRVKSDPQDIMAEV 251
Query: 402 FRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLV 461
+++GPVE + T++ D YK+G+YKH+ G LG HA+++IGWG GE YWL+
Sbjct: 252 YKNGPVEVAFTVFEDFAHYKSGVYKHITGSALGGHAVKLIGWGTSDEGE------DYWLL 305
Query: 462 ANSFNTNWGENGLFRIVRGQNECGIEADITAGLPK---IGLEIDSNEINLG 509
AN +NTNWG++G F+I RG NECGIE D+TAGLP I E+ +++ G
Sbjct: 306 ANQWNTNWGDDGYFKIKRGTNECGIEDDVTAGLPSTKNIVREVTDMDVDAG 356
>gi|449446774|ref|XP_004141146.1| PREDICTED: cathepsin B-like [Cucumis sativus]
Length = 348
Score = 191 bits (486), Expect = 6e-46, Method: Compositional matrix adjust.
Identities = 108/279 (38%), Positives = 157/279 (56%), Gaps = 26/279 (9%)
Query: 59 LSKLTLSELEMRMGVHPDSKLPQNRLPLLVQLSDPLE-ELPEGFDARINWPYCPTIQEIR 117
S ++S+ + +GV + P+ L LS P +LP+ FDAR WP C +I I
Sbjct: 65 FSNYSVSQFKYLLGV---KQTPEKDLKSTPVLSHPKSLKLPKSFDAREAWPQCISIGTIL 121
Query: 118 DQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKD-CGNGCQGGFHGKAWK 176
DQG CGS WA GAVE++SDR CI ++ LS +DL++CC CG+GC GG+ AW+
Sbjct: 122 DQGHCGSCWAFGAVESLSDRFCI--HFDMNITLSVNDLLACCGFMCGDGCDGGYPISAWR 179
Query: 177 YWVTTGIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYE 236
Y+V G+V+ + C PY SH C+ P TP C+R C + +
Sbjct: 180 YFVRHGVVT-------EQCDPYF----DTTGCSHPGCEPAYP-TPRCVRHCVDKNQI-WR 226
Query: 237 DDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAI 296
++G AY + + IM E++++GPVE S T+Y D YK+G+YKH+ G +G HA+
Sbjct: 227 KTKHYGVSAYRVKRDPNDIMAEVYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 286
Query: 297 RIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
++IGWG GE YWL+AN +N WG++G F+I
Sbjct: 287 KLIGWGTTDDGE------DYWLLANQWNRGWGDDGYFKI 319
Score = 130 bits (327), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 64/154 (41%), Positives = 92/154 (59%), Gaps = 9/154 (5%)
Query: 344 CERYMNGSRSSCQANEPN--TPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREI 401
C+ Y + + S EP TP C+R C + + ++G AY + + IM E+
Sbjct: 191 CDPYFDTTGCSHPGCEPAYPTPRCVRHCVDKNQI-WRKTKHYGVSAYRVKRDPNDIMAEV 249
Query: 402 FRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLV 461
+++GPVE S T+Y D YK+G+YKH+ G +G HA+++IGWG GE YWL+
Sbjct: 250 YKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTTDDGE------DYWLL 303
Query: 462 ANSFNTNWGENGLFRIVRGQNECGIEADITAGLP 495
AN +N WG++G F+I RG NECGIE D+ AGLP
Sbjct: 304 ANQWNRGWGDDGYFKIRRGTNECGIEEDVVAGLP 337
>gi|357511629|ref|XP_003626103.1| Cathepsin B [Medicago truncatula]
gi|87240982|gb|ABD32840.1| Peptidase C1A, papain; Somatotropin hormone; Peptidase C1,
propeptide [Medicago truncatula]
gi|355501118|gb|AES82321.1| Cathepsin B [Medicago truncatula]
Length = 357
Score = 191 bits (486), Expect = 6e-46, Method: Compositional matrix adjust.
Identities = 111/295 (37%), Positives = 165/295 (55%), Gaps = 34/295 (11%)
Query: 47 PKLPFYGAEKNALSKLTLSELEMRMGVHPDSKLPQNRLPLLVQLSDPLE-ELPEGFDARI 105
P+ + A S T+ + + +GV + P+ L ++ P +LP+ FDAR
Sbjct: 53 PEAGWEAAINPRFSNFTVGQFKRLLGV---KQAPKKELLSTPVVTHPKSLKLPKEFDART 109
Query: 106 NWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKD-CGN 164
W C TI +I DQG CGS WA GAVE++ DR CI ++ LS +DL++CC CG
Sbjct: 110 AWSQCSTIGKILDQGHCGSCWAFGAVESLQDRFCI--HFDMNISLSVNDLLACCGFLCGA 167
Query: 165 GCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPY--EIPCERYMNGSHSSCQDNEP--NT 220
GC GG AW+Y G+V+ + C PY +I C SH C EP T
Sbjct: 168 GCDGGTPIYAWRYLAHHGVVT-------EECDPYFDQIGC------SHPGC---EPAYQT 211
Query: 221 PECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKT 280
P+C+RKC G + ++ ++ AY + ++ + IM E++++GPVE + T++ D YK+
Sbjct: 212 PKCVRKCVKGNQI-WKRSKHYSVKAYRVKSDPQDIMAEVYKNGPVEVAFTVFEDFAHYKS 270
Query: 281 GIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
G+YKH+ G LG HA+++IGWG GE YWL+AN +NTNWG++G F+I
Sbjct: 271 GVYKHITGSALGGHAVKLIGWGTSDEGE------DYWLLANQWNTNWGDDGYFKI 319
Score = 141 bits (355), Expect = 9e-31, Method: Compositional matrix adjust.
Identities = 70/171 (40%), Positives = 106/171 (61%), Gaps = 12/171 (7%)
Query: 344 CERYMNGSRSSCQANEP--NTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREI 401
C+ Y + S EP TP+C+RKC G + ++ ++ AY + ++ + IM E+
Sbjct: 191 CDPYFDQIGCSHPGCEPAYQTPKCVRKCVKGNQI-WKRSKHYSVKAYRVKSDPQDIMAEV 249
Query: 402 FRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLV 461
+++GPVE + T++ D YK+G+YKH+ G LG HA+++IGWG GE YWL+
Sbjct: 250 YKNGPVEVAFTVFEDFAHYKSGVYKHITGSALGGHAVKLIGWGTSDEGE------DYWLL 303
Query: 462 ANSFNTNWGENGLFRIVRGQNECGIEADITAGLPK---IGLEIDSNEINLG 509
AN +NTNWG++G F+I RG NECGIE D+TAGLP I E+ +++ G
Sbjct: 304 ANQWNTNWGDDGYFKIKRGTNECGIEDDVTAGLPSTKNIVREVTDMDVDAG 354
>gi|341888224|gb|EGT44159.1| hypothetical protein CAEBREN_15022 [Caenorhabditis brenneri]
Length = 332
Score = 191 bits (486), Expect = 6e-46, Method: Compositional matrix adjust.
Identities = 102/247 (41%), Positives = 146/247 (59%), Gaps = 15/247 (6%)
Query: 96 ELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDL 155
+L + FDAR WP C +I +I D C S WA A E+MSDR+CI S G + LS+ +L
Sbjct: 71 DLSQFFDARERWPECTSIPQINDISECKSSWAFAAAESMSDRLCINSGGMINTILSAQEL 130
Query: 156 VSCCK---DCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEI-PCERYM-NGSH 210
+SCC CG GC GG KAW+YW G+ +GG+Y ++ GC+PY I PC + + N ++
Sbjct: 131 LSCCTGVLSCGEGCGGGNAFKAWQYWGKHGLPTGGSYETQFGCKPYSIAPCGKTVGNVTY 190
Query: 211 SSCQDNEPNTPECIRKC--QPGYDVSYEDDLNFGRIAY-SLPANEETIMREIFRHGPVEG 267
+C + TP C +KC + GY V + D ++G + LP + I ++ +GP+E
Sbjct: 191 PACTNTTLPTPSCEKKCTSKNGYPVDIDKDRHYGASSVDQLPNRQIEIQSDVMLNGPIET 250
Query: 268 SMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNW 327
+ +Y D + Y TGIY H+ G G ++RI+GWG + EG V YWL+ANS+ W
Sbjct: 251 TFEVYDDFLQYTTGIYVHLTGNKQGHLSVRILGWG---MYEG----VPYWLLANSWGKEW 303
Query: 328 GENGLFR 334
GENG FR
Sbjct: 304 GENGTFR 310
Score = 129 bits (324), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 71/184 (38%), Positives = 106/184 (57%), Gaps = 19/184 (10%)
Query: 327 WGENGL-------FRIGCRPYEI-PCERYM-NGSRSSCQANEPNTPECIRKC--QPGYDV 375
WG++GL + GC+PY I PC + + N + +C TP C +KC + GY V
Sbjct: 156 WGKHGLPTGGSYETQFGCKPYSIAPCGKTVGNVTYPACTNTTLPTPSCEKKCTSKNGYPV 215
Query: 376 SYEDDLNFGRIAY-SLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLG 434
+ D ++G + LP + I ++ +GP+E + +Y D + Y TGIY H+ G G
Sbjct: 216 DIDKDRHYGASSVDQLPNRQIEIQSDVMLNGPIETTFEVYDDFLQYTTGIYVHLTGNKQG 275
Query: 435 EHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGL 494
++RI+GWG + EG V YWL+ANS+ WGENG FR +RG NECG+EA+ + +
Sbjct: 276 HLSVRILGWG---MYEG----VPYWLLANSWGKEWGENGTFRALRGTNECGLEANCVSAM 328
Query: 495 PKIG 498
PK+G
Sbjct: 329 PKLG 332
>gi|217073630|gb|ACJ85175.1| unknown [Medicago truncatula]
Length = 359
Score = 191 bits (486), Expect = 7e-46, Method: Compositional matrix adjust.
Identities = 111/295 (37%), Positives = 165/295 (55%), Gaps = 34/295 (11%)
Query: 47 PKLPFYGAEKNALSKLTLSELEMRMGVHPDSKLPQNRLPLLVQLSDPLE-ELPEGFDARI 105
P+ + A S T+ + + +GV + P+ L ++ P +LP+ FDAR
Sbjct: 55 PEAGWEAAINPRFSNFTVGQFKRLLGV---KQAPKKELLSTPVVTHPKSLKLPKEFDARA 111
Query: 106 NWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKD-CGN 164
W C TI +I DQG CGS WA GAVE++ DR C S ++ LS +DL++CC CG
Sbjct: 112 AWSQCSTIGKILDQGHCGSCWAFGAVESLQDRFC--SHFDMNISLSVNDLLACCGFLCGA 169
Query: 165 GCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPY--EIPCERYMNGSHSSCQDNEP--NT 220
GC GG AW+Y G+V+ + C PY +I C SH C EP T
Sbjct: 170 GCDGGTPIYAWRYLAHHGVVT-------EECDPYFDQIGC------SHPGC---EPAYQT 213
Query: 221 PECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKT 280
P+C+RKC G + ++ ++ AY + ++ + IM E++++GPVE + T++ D YK+
Sbjct: 214 PKCVRKCVKGNQI-WKRSKHYSVKAYRVKSDPQDIMTEVYKNGPVEVAFTVFEDFAHYKS 272
Query: 281 GIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
G+YKH+ G LG HA+++IGWG GE YWL+AN +NTNWG++G F+I
Sbjct: 273 GVYKHITGSALGGHAVKLIGWGTSDEGE------DYWLLANQWNTNWGDDGYFKI 321
Score = 141 bits (356), Expect = 7e-31, Method: Compositional matrix adjust.
Identities = 70/171 (40%), Positives = 106/171 (61%), Gaps = 12/171 (7%)
Query: 344 CERYMNGSRSSCQANEP--NTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREI 401
C+ Y + S EP TP+C+RKC G + ++ ++ AY + ++ + IM E+
Sbjct: 193 CDPYFDQIGCSHPGCEPAYQTPKCVRKCVKGNQI-WKRSKHYSVKAYRVKSDPQDIMTEV 251
Query: 402 FRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLV 461
+++GPVE + T++ D YK+G+YKH+ G LG HA+++IGWG GE YWL+
Sbjct: 252 YKNGPVEVAFTVFEDFAHYKSGVYKHITGSALGGHAVKLIGWGTSDEGE------DYWLL 305
Query: 462 ANSFNTNWGENGLFRIVRGQNECGIEADITAGLPK---IGLEIDSNEINLG 509
AN +NTNWG++G F+I RG NECGIE D+TAGLP I E+ +++ G
Sbjct: 306 ANQWNTNWGDDGYFKIKRGTNECGIEDDVTAGLPSTKNIVREVTDMDVDAG 356
>gi|728602|emb|CAA88490.1| cathepsin B-like enzyme [Leishmania mexicana]
gi|1586011|prf||2202319A cathepsin B-like Cys protease
Length = 340
Score = 191 bits (485), Expect = 8e-46, Method: Compositional matrix adjust.
Identities = 121/335 (36%), Positives = 174/335 (51%), Gaps = 23/335 (6%)
Query: 3 KSTADAVATFLKDLDLSQSSRNHSNGVFCDLSKAFDRVDHSILLPKLPFYGAEKNALSKL 62
KS VA F+ L + S+ L K+F +S + + ++
Sbjct: 6 KSALCLVAVFVVLLATTVSALYAKPSDIPLLGKSFVAETNSKAKGQWTASADNGHLVTGK 65
Query: 63 TLSELEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSC 122
+L E+ MGV S + P + + ++LPE FDA WP C TI EIRDQ +C
Sbjct: 66 SLEEVRKLMGV--TSMSTEAVPPRNFSVEEMQQDLPESFDASEKWPMCVTIGEIRDQSNC 123
Query: 123 GSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTG 182
GS WA+ AVEAMSDR C S G R+S+ +L+SCC CG GC GG AW +WV G
Sbjct: 124 GSCWAIAAVEAMSDRYCTMS-GIPDRRISTTNLLSCCFICGFGCYGGIPAMAWLWWVWVG 182
Query: 183 IVSGGTYASKQGCRPYEI-PCERYMNGS-HSSCQDNEPNTPECIRKCQPGYDVSYEDDLN 240
+ + + C+PY PC + N S + C + NTP+C C +V E
Sbjct: 183 VTT-------ELCQPYPFGPCSHHGNSSKYPPCPNTIYNTPKCNTTCD---NVEMELVKY 232
Query: 241 FGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIG 300
G +YS+ E + E+ +GP+E +M +YAD + YK+G+YKHV+G LG HA++++G
Sbjct: 233 KGVSSYSIKGERE-LDHELMNNGPLEVAMQVYADFVAYKSGVYKHVSGDHLGGHAVKLVG 291
Query: 301 WGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
W G + YW +ANS+NT+WG+ G F I
Sbjct: 292 W-------GVKDGIPYWKIANSWNTDWGDKGYFLI 319
Score = 117 bits (292), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 64/161 (39%), Positives = 92/161 (57%), Gaps = 13/161 (8%)
Query: 337 CRPYEI-PCERYMNGSR-SSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANE 394
C+PY PC + N S+ C NTP+C C +V E G +YS+
Sbjct: 188 CQPYPFGPCSHHGNSSKYPPCPNTIYNTPKCNTTCD---NVEMELVKYKGVSSYSIKGER 244
Query: 395 ETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSS 454
E + E+ +GP+E +M +YAD + YK+G+YKHV+G LG HA++++GW G
Sbjct: 245 E-LDHELMNNGPLEVAMQVYADFVAYKSGVYKHVSGDHLGGHAVKLVGW-------GVKD 296
Query: 455 VVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLP 495
+ YW +ANS+NT+WG+ G F I RG +ECGIE+ AG P
Sbjct: 297 GIPYWKIANSWNTDWGDKGYFLIQRGNDECGIESSGVAGKP 337
>gi|52630925|gb|AAU84926.1| putative cathepsin B-N [Toxoptera citricida]
Length = 340
Score = 191 bits (485), Expect = 8e-46, Method: Compositional matrix adjust.
Identities = 109/308 (35%), Positives = 162/308 (52%), Gaps = 34/308 (11%)
Query: 52 YGAEKNALSKLTLSELEMRMGVHPDSKLPQNRLPLL-----VQLSDPLEE---------- 96
Y E++ ++K+ + GV+ D K P+ + L VQ+ L
Sbjct: 22 YFLEEDYINKINEQATTWKAGVNFDPKTPKEHILKLLGSKGVQIPSKLNHKMYKSEDENY 81
Query: 97 ------LPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRL 150
+P FDAR W C TI IRDQG+CGS WAL A +DR+C+ S + L
Sbjct: 82 DNLFGRIPRKFDARKKWRNCKTIGAIRDQGNCGSCWALATSSAFADRLCVVSNEDFNQLL 141
Query: 151 SSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIPCERYMNGSH 210
S+++L CC CG GC GG+ KAW+++ G+V+GG Y S +GC PY +P Y +
Sbjct: 142 SAEELTFCCHKCGFGCNGGYPIKAWEHFKKHGLVTGGDYKSGEGCEPYRVPPCPYDESGN 201
Query: 211 SSCQDN--EPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGS 268
++C E N C R C D+ +++D + R +Y L +I +++ +GPVE S
Sbjct: 202 NTCAGKPMEANH-RCTRMCYGDQDLDFDEDHRYTRDSYYLTYG--SIQKDVLTYGPVEAS 258
Query: 269 MTIYADMILYKTGIY-KHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNW 327
+Y D YK+G+Y + LG HA ++IGWG+E V YWL+ NS+N +W
Sbjct: 259 FDVYDDFPSYKSGVYIRSENASYLGGHAAKLIGWGEE-------YGVPYWLMVNSWNADW 311
Query: 328 GENGLFRI 335
G+NGLF+I
Sbjct: 312 GDNGLFKI 319
Score = 114 bits (286), Expect = 9e-23, Method: Compositional matrix adjust.
Identities = 62/163 (38%), Positives = 91/163 (55%), Gaps = 13/163 (7%)
Query: 336 GCRPYEIPCERYMNGSRSSCQANEP--NTPECIRKCQPGYDVSYEDDLNFGRIAYSLPAN 393
GC PY +P Y ++C A +P C R C D+ +++D + R +Y L
Sbjct: 185 GCEPYRVPPCPYDESGNNTC-AGKPMEANHRCTRMCYGDQDLDFDEDHRYTRDSYYLTYG 243
Query: 394 EETIMREIFRHGPVEGSMTIYADMILYKTGIY-KHVAGGPLGEHAIRIIGWGQEPLGEGT 452
+I +++ +GPVE S +Y D YK+G+Y + LG HA ++IGWG+E
Sbjct: 244 --SIQKDVLTYGPVEASFDVYDDFPSYKSGVYIRSENASYLGGHAAKLIGWGEE------ 295
Query: 453 SSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLP 495
V YWL+ NS+N +WG+NGLF+I RG NECGI+ T G+P
Sbjct: 296 -YGVPYWLMVNSWNADWGDNGLFKIQRGTNECGIDNSTTGGVP 337
>gi|157058749|gb|ABV03132.1| cathepsin B-3098 [Acyrthosiphon pisum]
Length = 256
Score = 191 bits (485), Expect = 8e-46, Method: Compositional matrix adjust.
Identities = 99/245 (40%), Positives = 143/245 (58%), Gaps = 13/245 (5%)
Query: 91 SDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRL 150
+D +E+P FDAR W C TI E+RDQG+CGS WAL A +DR+C+A+ G + L
Sbjct: 22 ADNYQEIPMKFDARKKWIRCKTIGEVRDQGNCGSDWALSTSSAFADRLCVATNGDFNQLL 81
Query: 151 SSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIPCERYMNGSH 210
S++++ CC CGNGC GG+ +AWK + G+V+GG Y S +GC PY +P Y
Sbjct: 82 SAEEITFCCHKCGNGCNGGYPIRAWKRFKNHGLVTGGNYKSGEGCEPYRVPPCPYDKDGK 141
Query: 211 SSC--QDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGS 268
++C Q EPN +C +KC D+ + D + R Y L I +++ +GP+E S
Sbjct: 142 NTCSGQPMEPNH-KCSKKCYGDEDIDFNKDHRYTRDDYYLTY--RGIQKDVINYGPIEAS 198
Query: 269 MTIYADMILYKTGIY-KHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNW 327
+Y D YK+GIY K LG H++++IGWG+E V YWL+ NS+N +W
Sbjct: 199 FDVYDDFPNYKSGIYVKSENASYLGGHSVKLIGWGEE-------YGVLYWLMVNSWNADW 251
Query: 328 GENGL 332
G+ GL
Sbjct: 252 GDKGL 256
Score = 89.4 bits (220), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 51/142 (35%), Positives = 76/142 (53%), Gaps = 13/142 (9%)
Query: 336 GCRPYEIPCERYMNGSRSSC--QANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPAN 393
GC PY +P Y +++C Q EPN +C +KC D+ + D + R Y L
Sbjct: 125 GCEPYRVPPCPYDKDGKNTCSGQPMEPNH-KCSKKCYGDEDIDFNKDHRYTRDDYYLTY- 182
Query: 394 EETIMREIFRHGPVEGSMTIYADMILYKTGIY-KHVAGGPLGEHAIRIIGWGQEPLGEGT 452
I +++ +GP+E S +Y D YK+GIY K LG H++++IGWG+E
Sbjct: 183 -RGIQKDVINYGPIEASFDVYDDFPNYKSGIYVKSENASYLGGHSVKLIGWGEE------ 235
Query: 453 SSVVKYWLVANSFNTNWGENGL 474
V YWL+ NS+N +WG+ GL
Sbjct: 236 -YGVLYWLMVNSWNADWGDKGL 256
>gi|48762485|dbj|BAD23812.1| cathepsin B-N1 [Tuberaphis styraci]
Length = 340
Score = 191 bits (484), Expect = 9e-46, Method: Compositional matrix adjust.
Identities = 107/307 (34%), Positives = 160/307 (52%), Gaps = 32/307 (10%)
Query: 52 YGAEKNALSKLTLSELEMRMGVHPDSKLPQNRL----------------PLLVQLSDPL- 94
Y EK+ ++++ + + GV+ D KL + P + + D
Sbjct: 22 YFLEKDYINQINANAKTWKAGVNFDPKLSIDSFVKLLGSKGVQAAKQASPDMFKTHDEAY 81
Query: 95 ----EELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRL 150
+P FDAR W C TI E+RDQG CGS WA G A +DR+CIA+ G+ + L
Sbjct: 82 NSWSNRIPSSFDARKKWRKCSTIGEVRDQGKCGSCWAFGTSSAFADRLCIATDGEFNELL 141
Query: 151 SSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEI-PCERYMNGS 209
S ++L CC CG GC GG+ +AW+ + G+V+GG Y S +GC+PY + PC G+
Sbjct: 142 SPEELAFCCHKCGFGCSGGYPIRAWERFKKHGLVTGGNYDSGEGCQPYRVPPCPLDEYGN 201
Query: 210 HSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSM 269
++ C R C D+ +++D ++ R AY L TI +I +GP+E S
Sbjct: 202 NTCRGKPAEKNHRCTRMCYGNQDLDFKEDHHYTRDAYYLTYG--TIQNDILAYGPIEASF 259
Query: 270 TIYADMILYKTGIYKHVAGGP-LGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWG 328
+Y D YK+G+Y + LG HA+++IGWG+E V YWL+ NS+N WG
Sbjct: 260 EVYDDFPSYKSGVYTKMENATYLGGHAVKLIGWGEE-------YGVPYWLLVNSWNDQWG 312
Query: 329 ENGLFRI 335
+ GLF+I
Sbjct: 313 DQGLFKI 319
Score = 114 bits (284), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 61/162 (37%), Positives = 88/162 (54%), Gaps = 11/162 (6%)
Query: 336 GCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANE 394
GC+PY +P C G+ + C R C D+ +++D ++ R AY L
Sbjct: 185 GCQPYRVPPCPLDEYGNNTCRGKPAEKNHRCTRMCYGNQDLDFKEDHHYTRDAYYLTYG- 243
Query: 395 ETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGP-LGEHAIRIIGWGQEPLGEGTS 453
TI +I +GP+E S +Y D YK+G+Y + LG HA+++IGWG+E
Sbjct: 244 -TIQNDILAYGPIEASFEVYDDFPSYKSGVYTKMENATYLGGHAVKLIGWGEE------- 295
Query: 454 SVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLP 495
V YWL+ NS+N WG+ GLF+I RG NECGI+ T G+P
Sbjct: 296 YGVPYWLLVNSWNDQWGDQGLFKIRRGTNECGIDNSTTGGVP 337
>gi|204022094|dbj|BAG71144.1| cathepsin B-N1 [Tuberaphis taiwana]
Length = 334
Score = 191 bits (484), Expect = 9e-46, Method: Compositional matrix adjust.
Identities = 97/241 (40%), Positives = 140/241 (58%), Gaps = 11/241 (4%)
Query: 97 LPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLV 156
+P FDAR W C TI E+RDQG CGS WA G A +DR+CIA+ G+ + LS+++L
Sbjct: 85 IPSSFDARKKWRKCSTIGEVRDQGKCGSCWAFGTSSAFADRLCIATDGEFNELLSAEELA 144
Query: 157 SCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEI-PCERYMNGSHSSCQD 215
CC CG GC GG+ +AW+ + G+V+GG Y S +GC+PY + PC G+++
Sbjct: 145 FCCHKCGFGCSGGYPIRAWERFKKHGLVTGGNYDSGEGCQPYRVPPCPLDEYGNNTCRGK 204
Query: 216 NEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADM 275
C R C D+ +++D ++ R AY L TI +I +GP+E S +Y D
Sbjct: 205 PAEKNHRCTRMCYGNQDLDFKEDHHYTRDAYYLTYG--TIQNDILAYGPIEASFEVYDDF 262
Query: 276 ILYKTGIYKHVAGGP-LGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFR 334
YK+G+Y + LG HA+++IGWG+E V YWL+ NS+N WG+ GLF+
Sbjct: 263 PSYKSGVYTKMENATYLGGHAVKLIGWGEE-------YGVPYWLLVNSWNDQWGDQGLFK 315
Query: 335 I 335
I
Sbjct: 316 I 316
Score = 114 bits (284), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 61/162 (37%), Positives = 88/162 (54%), Gaps = 11/162 (6%)
Query: 336 GCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANE 394
GC+PY +P C G+ + C R C D+ +++D ++ R AY L
Sbjct: 182 GCQPYRVPPCPLDEYGNNTCRGKPAEKNHRCTRMCYGNQDLDFKEDHHYTRDAYYLTYG- 240
Query: 395 ETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGP-LGEHAIRIIGWGQEPLGEGTS 453
TI +I +GP+E S +Y D YK+G+Y + LG HA+++IGWG+E
Sbjct: 241 -TIQNDILAYGPIEASFEVYDDFPSYKSGVYTKMENATYLGGHAVKLIGWGEE------- 292
Query: 454 SVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLP 495
V YWL+ NS+N WG+ GLF+I RG NECGI+ T G+P
Sbjct: 293 YGVPYWLLVNSWNDQWGDQGLFKIRRGTNECGIDNSTTGGVP 334
>gi|157167285|ref|XP_001658487.1| cathepsin b [Aedes aegypti]
gi|108876478|gb|EAT40703.1| AAEL007590-PA [Aedes aegypti]
Length = 313
Score = 191 bits (484), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 115/266 (43%), Positives = 156/266 (58%), Gaps = 20/266 (7%)
Query: 75 PDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAM 134
P KL + RL + + LP+ FDAR WP C ++ EIR QG CGS + AM
Sbjct: 40 PLEKLKETRLHPAINVFAEDLVLPKSFDARQQWPQCSSLNEIRTQGCCGSCAYVSGASAM 99
Query: 135 SDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKA-WKYWVTTGIVSGGTYASKQ 193
+DR CI S+GK+ + DL+SCC +CG GC GG W YWV G+ SGG Y S Q
Sbjct: 100 TDRWCIHSKGKKQFTFGAFDLLSCCYECGGGCTGGGIPGPIWSYWVKQGVSSGGPYGSNQ 159
Query: 194 GCRPYEIP--CERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYED--DLNFGRIAYSLP 249
GC PY +P C + G + P+ P C +C GY+V+ ED D FGR+AYS+P
Sbjct: 160 GCHPYPMPPSCPKPSEGDY-------PDEPNCSTRCNAGYNVT-EDLRDRRFGRVAYSIP 211
Query: 250 ANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEG 309
A+E IM +IF +GPV+ Y D++ Y G+Y+H +G G HA+++IGWG E +G
Sbjct: 212 ADERKIMEDIFVNGPVQAVFQWYEDIVNYSGGVYRHQSGRLKGGHAVKLIGWGVE---DG 268
Query: 310 TSSVVKYWLVANSFNTNWGENGLFRI 335
T KYWLVANS+ WG++G F++
Sbjct: 269 T----KYWLVANSWGRVWGDDGFFKM 290
Score = 142 bits (357), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 74/166 (44%), Positives = 104/166 (62%), Gaps = 19/166 (11%)
Query: 336 GCRPYEIP--CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYED--DLNFGRIAYSLP 391
GC PY +P C + G + P+ P C +C GY+V+ ED D FGR+AYS+P
Sbjct: 160 GCHPYPMPPSCPKPSEG-------DYPDEPNCSTRCNAGYNVT-EDLRDRRFGRVAYSIP 211
Query: 392 ANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEG 451
A+E IM +IF +GPV+ Y D++ Y G+Y+H +G G HA+++IGWG E +G
Sbjct: 212 ADERKIMEDIFVNGPVQAVFQWYEDIVNYSGGVYRHQSGRLKGGHAVKLIGWGVE---DG 268
Query: 452 TSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
T KYWLVANS+ WG++G F++VRG+N CGIE ++ AGLP
Sbjct: 269 T----KYWLVANSWGRVWGDDGFFKMVRGENHCGIEENVHAGLPSF 310
>gi|52630945|gb|AAU84936.1| putative cathepsin B-S [Toxoptera citricida]
Length = 335
Score = 190 bits (483), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 108/292 (36%), Positives = 169/292 (57%), Gaps = 23/292 (7%)
Query: 52 YGAEKNALSKLTLSELEMRMGVHPDSKLPQNRLPLLVQLSDPL----EELPEGFDARINW 107
+ A++N +T ++ +G + +P++ L++ +D E+P FDARI W
Sbjct: 39 WKAKQNFPEYMTKEQIVRLLGSKNLTSVPKS----LIKENDSEYINDSEIPNFFDARIQW 94
Query: 108 PYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQ 167
+C TI E+R+QG+CGS WA G A +DR+CIA+ G + +S+++L CC CG GC
Sbjct: 95 SHCKTIGEVRNQGNCGSCWAHGTTGAFADRLCIATNGDFNELISAEELTFCCHRCGFGCN 154
Query: 168 GGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEI-PCERYMNGSHSSC--QDNEPNTPECI 224
GG KAW+Y+ G+V+GG Y + GC+PY++ PC + G H+SC Q EPN +C
Sbjct: 155 GGNPLKAWQYFKRHGVVTGGNYNTTDGCQPYKVPPCVKDEEG-HNSCSGQPTEPNH-KCS 212
Query: 225 RKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYK 284
R C Y+ + AY L N +T+ ++ +GP+E S +Y D + Y++G+Y+
Sbjct: 213 RSCYGDKTCDYKKGHYKTKNAYYL--NIDTMQKDTIAYGPIEASFDVYDDFVNYESGVYQ 270
Query: 285 HVAGGP-LGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
LG HA+++IGWG+E +GT YWL+ NS+ WG NG+F+I
Sbjct: 271 KTEDAKYLGGHAVKMIGWGEE---DGTP----YWLMVNSWGEQWGANGMFKI 315
Score = 113 bits (283), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 65/165 (39%), Positives = 96/165 (58%), Gaps = 13/165 (7%)
Query: 336 GCRPYEIP-CERYMNGSRS-SCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPAN 393
GC+PY++P C + G S S Q EPN +C R C Y+ + AY L N
Sbjct: 181 GCQPYKVPPCVKDEEGHNSCSGQPTEPNH-KCSRSCYGDKTCDYKKGHYKTKNAYYL--N 237
Query: 394 EETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGP-LGEHAIRIIGWGQEPLGEGT 452
+T+ ++ +GP+E S +Y D + Y++G+Y+ LG HA+++IGWG+E +GT
Sbjct: 238 IDTMQKDTIAYGPIEASFDVYDDFVNYESGVYQKTEDAKYLGGHAVKMIGWGEE---DGT 294
Query: 453 SSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
YWL+ NS+ WG NG+F+I+RG NECGIE TAG+P +
Sbjct: 295 P----YWLMVNSWGEQWGANGMFKILRGTNECGIEGSPTAGVPLV 335
>gi|300952942|gb|ADK46902.1| cathepsin B [Radopholus similis]
Length = 356
Score = 190 bits (483), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 97/243 (39%), Positives = 151/243 (62%), Gaps = 18/243 (7%)
Query: 96 ELPEGFDARINWPYCP-TIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDD 154
+LP+ FD+R + C I I+DQ +CGS WA+ + + DR+CIAS G++ V +S+ D
Sbjct: 107 KLPQHFDSRKQFTKCAKVIGTIQDQSNCGSCWAVSSASVIQDRICIASNGEQKVHISAQD 166
Query: 155 LVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQ 214
++SC D GC GG+ +A++++ +G+V+G ++ QGC+PY H++
Sbjct: 167 ILSCATDRSQGCNGGYPDEAFEHYAQSGVVTGSGNSANQGCKPYPFL-------PHTTV- 218
Query: 215 DNEPNTPECIRKCQP-GYDVSYEDDLNFGRIAYSLPANEET-IMREIFRHGPVEGSMTIY 272
E +TPEC +KC+ Y +Y+ D +FG Y++ ++ I EI +GPVE +M +Y
Sbjct: 219 --EYSTPECSKKCENYQYKKAYKQDKHFGMSVYNVQFSDPVDIQYEIMNNGPVEANMIVY 276
Query: 273 ADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGL 332
D + YK+G+Y+ V PLG HA+RI+GWG + + V YWLVANS+NT+WGE+G
Sbjct: 277 YDFMFYKSGVYQTVFPWPLGGHAVRIVGWGVD-----GPTKVPYWLVANSWNTDWGEDGY 331
Query: 333 FRI 335
FRI
Sbjct: 332 FRI 334
Score = 118 bits (295), Expect = 9e-24, Method: Compositional matrix adjust.
Identities = 64/155 (41%), Positives = 92/155 (59%), Gaps = 17/155 (10%)
Query: 336 GCRPYEIPCERYMNGSRSSCQANEPNTPECIRKCQP-GYDVSYEDDLNFGRIAYSLPANE 394
GC+PY + E +TPEC +KC+ Y +Y+ D +FG Y++ ++
Sbjct: 206 GCKPYPFLPHTTV----------EYSTPECSKKCENYQYKKAYKQDKHFGMSVYNVQFSD 255
Query: 395 ET-IMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTS 453
I EI +GPVE +M +Y D + YK+G+Y+ V PLG HA+RI+GWG +
Sbjct: 256 PVDIQYEIMNNGPVEANMIVYYDFMFYKSGVYQTVFPWPLGGHAVRIVGWGVD-----GP 310
Query: 454 SVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEA 488
+ V YWLVANS+NT+WGE+G FRI RG +E IE+
Sbjct: 311 TKVPYWLVANSWNTDWGEDGYFRIRRGTDESYIES 345
>gi|161343871|tpg|DAA06116.1| TPA_inf: cathepsin B [Myzus persicae]
Length = 276
Score = 190 bits (483), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 96/246 (39%), Positives = 141/246 (57%), Gaps = 11/246 (4%)
Query: 92 DPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLS 151
D +E+P FDAR W C TI E+RDQG CGS WA+ A SDR+C+A+ G + LS
Sbjct: 20 DNYQEIPIKFDARKKWLRCKTIGEVRDQGHCGSDWAMSTSSAFSDRLCVATNGDFNQLLS 79
Query: 152 SDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEI-PCERYMNGSH 210
++++ CC CG+GC GG+ +AWK + G+V+GG Y S +GC PY + PC G++
Sbjct: 80 AEEITFCCHTCGDGCSGGYPIRAWKRYKKHGLVTGGNYKSGEGCEPYRVPPCPNDDQGNN 139
Query: 211 SSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMT 270
+ C R C D+ +++D + R Y L I +++ +GP+E S
Sbjct: 140 TCSGQPMEKNHRCTRMCYGDQDLDFDEDHRYTRDHYYLTY--RGIQKDVINYGPIEASFD 197
Query: 271 IYADMILYKTGIY-KHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGE 329
+Y D YK+GIY K LG H++++IGWG+E V YWL+ NS+N +WG+
Sbjct: 198 VYDDFPSYKSGIYVKSENASYLGGHSVKLIGWGEE-------YGVLYWLMVNSWNADWGD 250
Query: 330 NGLFRI 335
GLF+I
Sbjct: 251 KGLFKI 256
Score = 110 bits (274), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 58/162 (35%), Positives = 86/162 (53%), Gaps = 11/162 (6%)
Query: 336 GCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANE 394
GC PY +P C G+ + C R C D+ +++D + R Y L
Sbjct: 122 GCEPYRVPPCPNDDQGNNTCSGQPMEKNHRCTRMCYGDQDLDFDEDHRYTRDHYYLTY-- 179
Query: 395 ETIMREIFRHGPVEGSMTIYADMILYKTGIY-KHVAGGPLGEHAIRIIGWGQEPLGEGTS 453
I +++ +GP+E S +Y D YK+GIY K LG H++++IGWG+E
Sbjct: 180 RGIQKDVINYGPIEASFDVYDDFPSYKSGIYVKSENASYLGGHSVKLIGWGEE------- 232
Query: 454 SVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLP 495
V YWL+ NS+N +WG+ GLF+I RG NECG++ T G+P
Sbjct: 233 YGVLYWLMVNSWNADWGDKGLFKIRRGTNECGVDNSTTGGVP 274
>gi|10803437|emb|CAC13131.1| putative cathepsin B.5 [Ostertagia ostertagi]
Length = 196
Score = 190 bits (483), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 92/189 (48%), Positives = 124/189 (65%), Gaps = 6/189 (3%)
Query: 124 SGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCC-KDCGNGCQGGFHGKAWKYWVTTG 182
S WA GA EAMSDR+CIAS+GK V +S+DD++SCC K CGNGC+GG+ +AWKYWV TG
Sbjct: 1 SCWAFGAAEAMSDRICIASQGKTQVTISADDVLSCCGKKCGNGCEGGYPIEAWKYWVKTG 60
Query: 183 IVSGGTYASKQGCRPYEIP-CERYMNGSH-SSCQDNEPNTPECIRKCQPGYDVSYEDDLN 240
I +GG+Y S+ GC+PY IP C + N ++ C +E +TP C KC Y Y DD +
Sbjct: 61 ICTGGSYESQSGCKPYPIPPCGHHKNQTYFGPCPTDEYDTPVCTNKCIAAYKTPYSDDKH 120
Query: 241 FGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIG 300
+G AY++ I +EI +GPVE + T+Y D Y G+Y H G +G HA+RI+G
Sbjct: 121 YGTSAYNVAKTVAGIQKEIMTNGPVEAAYTVYEDFYQYTGGVYTHTGGAEVGGHAVRILG 180
Query: 301 WG---QEPL 306
WG Q+P+
Sbjct: 181 WGVRQQDPI 189
Score = 95.5 bits (236), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 47/118 (39%), Positives = 66/118 (55%), Gaps = 5/118 (4%)
Query: 336 GCRPYEIP-CERYMNGSR-SSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPAN 393
GC+PY IP C + N + C +E +TP C KC Y Y DD ++G AY++
Sbjct: 72 GCKPYPIPPCGHHKNQTYFGPCPTDEYDTPVCTNKCIAAYKTPYSDDKHYGTSAYNVAKT 131
Query: 394 EETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWG---QEPL 448
I +EI +GPVE + T+Y D Y G+Y H G +G HA+RI+GWG Q+P+
Sbjct: 132 VAGIQKEIMTNGPVEAAYTVYEDFYQYTGGVYTHTGGAEVGGHAVRILGWGVRQQDPI 189
>gi|7507648|pir||T24819 hypothetical protein T10H4.12 - Caenorhabditis elegans
Length = 324
Score = 190 bits (483), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 107/272 (39%), Positives = 149/272 (54%), Gaps = 40/272 (14%)
Query: 86 LLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGK 145
L V+ E LP+ FDAR WP C TI+ IR+Q +CGS WA GA E +SDRVCI S G
Sbjct: 19 LFVRGEIVPEPLPDTFDAREKWPDCNTIKLIRNQATCGSCWAFGAAEVISDRVCIQSNGT 78
Query: 146 RHVRLSSDDLVSCC-KDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIPCER 204
+ +S +D++SCC CG GC+GG+ +A ++W ++G V+GG Y GC PY
Sbjct: 79 QQPVISVEDILSCCGTTCGYGCKGGYSIEALRFWASSGAVTGGDYGG-HGCMPY------ 131
Query: 205 YMNGSHSSCQDN--EPNTPECIRKCQPGYDV-SYEDDLNFGRI----------------A 245
S + C N E TP C CQ Y Y+ D ++G + A
Sbjct: 132 ----SFAPCTKNCPESTTPSCKTTCQSSYKTEEYKKDKHYGELVWHSFNRFQRFLNRASA 187
Query: 246 YSLPANEET--IMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQ 303
Y + + I EI+ +GPVE S +Y D YK+G+Y + +G +G HA++IIGWG
Sbjct: 188 YKVTTTKSVTEIQTEIYHYGPVEASYKVYEDFYHYKSGVYHYTSGKLVGGHAVKIIGWGV 247
Query: 304 EPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
E + V YWL+ANS+ T++GE G F+I
Sbjct: 248 E-------NGVDYWLIANSWGTSFGEKGFFKI 272
Score = 118 bits (295), Expect = 8e-24, Method: Compositional matrix adjust.
Identities = 68/201 (33%), Positives = 99/201 (49%), Gaps = 36/201 (17%)
Query: 336 GCRPYEI-PCERYMNGSRSSCQANEPNTPECIRKCQPGYDVS-YEDDLNFGRI------- 386
GC PY PC + E TP C CQ Y Y+ D ++G +
Sbjct: 127 GCMPYSFAPCTK---------NCPESTTPSCKTTCQSSYKTEEYKKDKHYGELVWHSFNR 177
Query: 387 ---------AYSLPANEET--IMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGE 435
AY + + I EI+ +GPVE S +Y D YK+G+Y + +G +G
Sbjct: 178 FQRFLNRASAYKVTTTKSVTEIQTEIYHYGPVEASYKVYEDFYHYKSGVYHYTSGKLVGG 237
Query: 436 HAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLP 495
HA++IIGWG E + V YWL+ANS+ T++GE G F+I RG NEC IE ++ AG+
Sbjct: 238 HAVKIIGWGVE-------NGVDYWLIANSWGTSFGEKGFFKIRRGTNECQIEGNVVAGIA 290
Query: 496 KIGLEIDSNEINLGKMMTLPL 516
K+G ++ E + G +
Sbjct: 291 KLGTHSETYEDDGGAATSCSF 311
>gi|204022100|dbj|BAG71147.1| cathepsin B-N1 [Tuberaphis takenouchii]
Length = 334
Score = 190 bits (483), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 97/241 (40%), Positives = 142/241 (58%), Gaps = 11/241 (4%)
Query: 97 LPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLV 156
+P FDAR W C TI E+RDQG CGS WA G A +DR+CIA+ G+ + LS+++L
Sbjct: 85 IPSNFDARKKWRKCSTIGEVRDQGHCGSCWAFGTSSAFADRLCIATDGEFNELLSAEELA 144
Query: 157 SCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEI-PCERYMNGSHSSCQD 215
CC CG GC GG+ KAW+++ G+V+GG Y S +GC+PY + PC G+++
Sbjct: 145 FCCHKCGFGCHGGYPIKAWEWFKKHGLVTGGDYDSGEGCQPYRVPPCPLDEYGNNTCRGK 204
Query: 216 NEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADM 275
C R C ++ +++D ++ R AY L TI +++ +GP+E S +Y D
Sbjct: 205 PAEKNHRCTRMCYGNQELDFKEDHHWTRDAYYLTYT--TIQKDVMAYGPIEASFDVYDDF 262
Query: 276 ILYKTGIY-KHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFR 334
YK+G+Y K LG HA+++IGWG+E V YWL+ NS+N WG+ GLF+
Sbjct: 263 PNYKSGVYMKTENASYLGGHAVKLIGWGEE-------YGVPYWLLVNSWNDQWGDQGLFK 315
Query: 335 I 335
I
Sbjct: 316 I 316
Score = 112 bits (281), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 60/162 (37%), Positives = 90/162 (55%), Gaps = 11/162 (6%)
Query: 336 GCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANE 394
GC+PY +P C G+ + C R C ++ +++D ++ R AY L
Sbjct: 182 GCQPYRVPPCPLDEYGNNTCRGKPAEKNHRCTRMCYGNQELDFKEDHHWTRDAYYLTYT- 240
Query: 395 ETIMREIFRHGPVEGSMTIYADMILYKTGIY-KHVAGGPLGEHAIRIIGWGQEPLGEGTS 453
TI +++ +GP+E S +Y D YK+G+Y K LG HA+++IGWG+E
Sbjct: 241 -TIQKDVMAYGPIEASFDVYDDFPNYKSGVYMKTENASYLGGHAVKLIGWGEE------- 292
Query: 454 SVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLP 495
V YWL+ NS+N WG+ GLF+I+RG NECGI+ T G+P
Sbjct: 293 YGVPYWLLVNSWNDQWGDQGLFKILRGTNECGIDNSTTGGVP 334
>gi|201023369|ref|NP_001128426.1| cathepsin B-3483 [Acyrthosiphon pisum]
gi|328712086|ref|XP_003244726.1| PREDICTED: cathepsin B-like [Acyrthosiphon pisum]
Length = 355
Score = 190 bits (482), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 97/251 (38%), Positives = 142/251 (56%), Gaps = 19/251 (7%)
Query: 95 EELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDD 154
E PE FDAR +W C +I I +QG+C + WA+ AM+DR+CIAS+G S
Sbjct: 94 NETPESFDARYHWFNCTSISHIWNQGNCAADWAISVTSAMNDRICIASQGNITALYSPQK 153
Query: 155 LVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEI-PCERYMN------ 207
LVSCC+DCGNGC GG+ AW+Y + GIV+GG Y S +GC+P+ + PC
Sbjct: 154 LVSCCEDCGNGCSGGYTAAAWRYILKKGIVTGGDYGSNEGCQPWLVQPCNASTTAADPSS 213
Query: 208 --GSHSSCQDNEPNTPECIRKCQPG-YDVSYEDDLNFGRIAYSLPANEETIMREIFRHGP 264
G H C + TP+C C ++ Y DD+ + ++ + + + +HGP
Sbjct: 214 VLGPHGVCGGDPATTPKCDLSCYNARHEGKYLDDIIKAKKVFTFDGC--SARKNLRKHGP 271
Query: 265 VEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFN 324
+M +Y D + YK+G+Y HV G LG ++R+IGWG E G+ +WL+ANS+
Sbjct: 272 YVVTMRVYEDFLAYKSGVYHHVTGDYLGLLSVRMIGWGLEG-GQA------FWLLANSWG 324
Query: 325 TNWGENGLFRI 335
T+WG+ G F+I
Sbjct: 325 TSWGDKGFFKI 335
Score = 95.9 bits (237), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 56/172 (32%), Positives = 89/172 (51%), Gaps = 19/172 (11%)
Query: 336 GCRPYEI-PCERYMNGSRSS--------CQANEPNTPECIRKCQPG-YDVSYEDDLNFGR 385
GC+P+ + PC + S C + TP+C C ++ Y DD+ +
Sbjct: 193 GCQPWLVQPCNASTTAADPSSVLGPHGVCGGDPATTPKCDLSCYNARHEGKYLDDIIKAK 252
Query: 386 IAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQ 445
++ + + + +HGP +M +Y D + YK+G+Y HV G LG ++R+IGWG
Sbjct: 253 KVFTFDGC--SARKNLRKHGPYVVTMRVYEDFLAYKSGVYHHVTGDYLGLLSVRMIGWGL 310
Query: 446 EPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
E G+ +WL+ANS+ T+WG+ G F+I R NEC IE AG+P +
Sbjct: 311 EG-GQA------FWLLANSWGTSWGDKGFFKIRRFVNECWIENFRYAGVPNL 355
>gi|167541036|gb|ABZ82028.1| cathepsin B endopeptidase [Clonorchis sinensis]
Length = 228
Score = 189 bits (481), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 95/210 (45%), Positives = 128/210 (60%), Gaps = 9/210 (4%)
Query: 127 ALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSG 186
A GAVEAMSDR+CI + G R+S+ DL+SCC CG GCQGGF AW +W T GIV+G
Sbjct: 1 AFGAVEAMSDRLCIHTNGTFTKRISAVDLISCCGYCGFGCQGGFPPTAWDFWQTEGIVTG 60
Query: 187 GTYASKQGCRPYEIP-CERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIA 245
G+ + GCR Y P C + + + C +TP C++KC D Y D I
Sbjct: 61 GSKENPTGCRSYPFPRCSHHGSKKYPPCSHRIYDTPNCVQKCDTP-DTDYATDKTRANIT 119
Query: 246 YSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEP 305
Y++ A + IM+EI +GPVE + +Y D + YK+G+Y H G LG HAIRI+GWG+E
Sbjct: 120 YNVKAKQNAIMKEIMINGPVEAAFQVYEDFLGYKSGVYFHSDGTLLGGHAIRILGWGEE- 178
Query: 306 LGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
+ V YWL+ANS+N WGE+G F++
Sbjct: 179 ------NGVAYWLIANSWNDGWGEDGYFKM 202
Score = 144 bits (363), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 71/163 (43%), Positives = 100/163 (61%), Gaps = 9/163 (5%)
Query: 336 GCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANE 394
GCR Y P C + + C +TP C++KC D Y D I Y++ A +
Sbjct: 68 GCRSYPFPRCSHHGSKKYPPCSHRIYDTPNCVQKCDTP-DTDYATDKTRANITYNVKAKQ 126
Query: 395 ETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSS 454
IM+EI +GPVE + +Y D + YK+G+Y H G LG HAIRI+GWG+E +
Sbjct: 127 NAIMKEIMINGPVEAAFQVYEDFLGYKSGVYFHSDGTLLGGHAIRILGWGEE-------N 179
Query: 455 VVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
V YWL+ANS+N WGE+G F+++RG+NECGIE ++TAGLP++
Sbjct: 180 GVAYWLIANSWNDGWGEDGYFKMLRGKNECGIEDEVTAGLPEL 222
>gi|166030332|gb|ABY78833.1| cathepsin B-like protease [Trypanosoma congolense]
Length = 336
Score = 189 bits (481), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 112/281 (39%), Positives = 155/281 (55%), Gaps = 27/281 (9%)
Query: 59 LSKLTLSELEMRMGV--HPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEI 116
+ LT SE + G S LP P+ ELPE FDA +WP+CPTI+EI
Sbjct: 55 MQNLTFSEAKRLTGAFSRKTSSLP----PVRFTEEQLRTELPESFDAAEHWPHCPTIREI 110
Query: 117 RDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWK 176
DQ +C + WA+ A+SDR C +GK+ +R+S+ DL++CCKDCG GC+GG+ AW+
Sbjct: 111 ADQSACRASWAVATASAISDRYCTVGKGKQ-LRISAADLMACCKDCGGGCEGGYPDAAWE 169
Query: 177 YWVTTGIVSGGTYASKQGCRPYEIP-CE-RYMNGSHSSCQDNEPNTPECIRKCQPGYDVS 234
Y+V+ GI S C+PY P CE R G C + TP+C C D S
Sbjct: 170 YYVSHGITS-------SQCQPYPFPRCEHRGAQGKKPPCSKYKFVTPQCNATCT---DKS 219
Query: 235 YEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEH 294
G +Y + EE RE++ +GP +++D + YK+G+Y+HVAG LG
Sbjct: 220 VPLIKYRGNHSYEV-RGEEDYKRELYFNGPFVVRFQVHSDFLAYKSGVYQHVAGNFLGGK 278
Query: 295 AIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
A+RI+GWG+ GT YW VANS++T+WG NG F I
Sbjct: 279 AVRIVGWGKL---NGTP----YWKVANSWDTDWGMNGYFLI 312
Score = 105 bits (262), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 66/168 (39%), Positives = 91/168 (54%), Gaps = 13/168 (7%)
Query: 330 NGLFRIGCRPYEIP-CE-RYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIA 387
+G+ C+PY P CE R G + C + TP+C C D S G +
Sbjct: 174 HGITSSQCQPYPFPRCEHRGAQGKKPPCSKYKFVTPQCNATCT---DKSVPLIKYRGNHS 230
Query: 388 YSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEP 447
Y + EE RE++ +GP +++D + YK+G+Y+HVAG LG A+RI+GWG+
Sbjct: 231 YEV-RGEEDYKRELYFNGPFVVRFQVHSDFLAYKSGVYQHVAGNFLGGKAVRIVGWGKL- 288
Query: 448 LGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLP 495
GT YW VANS++T+WG NG F I+RG NEC IE AG P
Sbjct: 289 --NGTP----YWKVANSWDTDWGMNGYFLILRGDNECNIEHLGFAGTP 330
>gi|204022092|dbj|BAG71143.1| cathepsin B-N2 [Tuberaphis coreana]
Length = 334
Score = 189 bits (481), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 97/241 (40%), Positives = 139/241 (57%), Gaps = 11/241 (4%)
Query: 97 LPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLV 156
+P FDAR W C TI E+RDQG CGS WA G A +DR+CIA+ G+ + LS ++L
Sbjct: 85 IPSSFDARKKWRKCSTIGEVRDQGKCGSCWAFGTSSAFADRLCIATDGEFNELLSPEELA 144
Query: 157 SCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEI-PCERYMNGSHSSCQD 215
CC CG GC GG+ +AW+ + G+V+GG Y S +GC+PY + PC G+++
Sbjct: 145 FCCHKCGFGCSGGYPIRAWERFKKHGLVTGGNYDSGEGCQPYRVPPCPLDEYGNNTCRGK 204
Query: 216 NEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADM 275
C R C D+ +++D ++ R AY L TI +I +GP+E S +Y D
Sbjct: 205 PAEKNHRCTRMCYGNQDLDFKEDHHYTRDAYYLTYG--TIQNDILAYGPIEASFEVYDDF 262
Query: 276 ILYKTGIYKHVAGGP-LGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFR 334
YK+G+Y + LG HA+++IGWG+E V YWL+ NS+N WG+ GLF+
Sbjct: 263 PSYKSGVYTKMENATYLGGHAVKLIGWGEE-------YGVPYWLLVNSWNDQWGDQGLFK 315
Query: 335 I 335
I
Sbjct: 316 I 316
Score = 114 bits (284), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 61/162 (37%), Positives = 88/162 (54%), Gaps = 11/162 (6%)
Query: 336 GCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANE 394
GC+PY +P C G+ + C R C D+ +++D ++ R AY L
Sbjct: 182 GCQPYRVPPCPLDEYGNNTCRGKPAEKNHRCTRMCYGNQDLDFKEDHHYTRDAYYLTYG- 240
Query: 395 ETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGP-LGEHAIRIIGWGQEPLGEGTS 453
TI +I +GP+E S +Y D YK+G+Y + LG HA+++IGWG+E
Sbjct: 241 -TIQNDILAYGPIEASFEVYDDFPSYKSGVYTKMENATYLGGHAVKLIGWGEE------- 292
Query: 454 SVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLP 495
V YWL+ NS+N WG+ GLF+I RG NECGI+ T G+P
Sbjct: 293 YGVPYWLLVNSWNDQWGDQGLFKIRRGTNECGIDNSTTGGVP 334
>gi|48762493|dbj|BAD23816.1| cathepsin B-N1 [Tuberaphis coreana]
Length = 340
Score = 189 bits (481), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 97/241 (40%), Positives = 139/241 (57%), Gaps = 11/241 (4%)
Query: 97 LPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLV 156
+P FDAR W C TI E+RDQG CGS WA G A +DR+CIA+ G+ + LS ++L
Sbjct: 88 IPSSFDARKKWRKCSTIGEVRDQGKCGSCWAFGTSSAFADRLCIATDGEFNELLSPEELA 147
Query: 157 SCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEI-PCERYMNGSHSSCQD 215
CC CG GC GG+ +AW+ + G+V+GG Y S +GC+PY + PC G+++
Sbjct: 148 FCCHKCGFGCSGGYPIRAWERFKKHGLVTGGNYDSGEGCQPYRVPPCPLDEYGNNTCRGK 207
Query: 216 NEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADM 275
C R C D+ +++D ++ R AY L TI +I +GP+E S +Y D
Sbjct: 208 PAEKNHRCTRMCYGNQDLDFKEDHHYTRDAYYLTYG--TIQNDILAYGPIEASFEVYDDF 265
Query: 276 ILYKTGIYKHVAGGP-LGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFR 334
YK+G+Y + LG HA+++IGWG+E V YWL+ NS+N WG+ GLF+
Sbjct: 266 PSYKSGVYTKMENATYLGGHAVKLIGWGEE-------YGVPYWLLVNSWNDQWGDQGLFK 318
Query: 335 I 335
I
Sbjct: 319 I 319
Score = 114 bits (284), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 61/162 (37%), Positives = 88/162 (54%), Gaps = 11/162 (6%)
Query: 336 GCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANE 394
GC+PY +P C G+ + C R C D+ +++D ++ R AY L
Sbjct: 185 GCQPYRVPPCPLDEYGNNTCRGKPAEKNHRCTRMCYGNQDLDFKEDHHYTRDAYYLTYG- 243
Query: 395 ETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGP-LGEHAIRIIGWGQEPLGEGTS 453
TI +I +GP+E S +Y D YK+G+Y + LG HA+++IGWG+E
Sbjct: 244 -TIQNDILAYGPIEASFEVYDDFPSYKSGVYTKMENATYLGGHAVKLIGWGEE------- 295
Query: 454 SVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLP 495
V YWL+ NS+N WG+ GLF+I RG NECGI+ T G+P
Sbjct: 296 YGVPYWLLVNSWNDQWGDQGLFKIRRGTNECGIDNSTTGGVP 337
>gi|3912916|gb|AAC78691.1| thiol protease [Trichuris suis]
Length = 348
Score = 189 bits (480), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 101/251 (40%), Positives = 143/251 (56%), Gaps = 20/251 (7%)
Query: 97 LPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLV 156
+P FD R W C ++ IRDQ CGS WA+ A E MSDR+C+ S +S D++
Sbjct: 84 IPPSFDVRSLWHVC-SLNLIRDQAKCGSCWAVSAAETMSDRICVQSNCSIKACISDTDIL 142
Query: 157 SCCK-DCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEI--PCERYMN-GSHSS 212
SCC CG GC GGF +AW+++ G +GG K GC+PY+ P R++ ++
Sbjct: 143 SCCGLYCGYGCNGGFPIEAWRHFTVAGNCTGGKTIDKYGCKPYKPTGPIGRHLKRNDYAP 202
Query: 213 CQDNE--------PNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGP 264
C ++ +TP C R+C GY SY D +G+ AY + + + I REI ++GP
Sbjct: 203 CPNDTYYGECVGMADTPRCKRRCLLGYPKSYPSDRYYGKSAYIVKQSVKAIQREIMKNGP 262
Query: 265 VEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFN 324
V S +Y D YK+GIYKH AG G HA++IIGWG+E + +WL+ANS++
Sbjct: 263 VVASFAVYEDFRHYKSGIYKHTAGELRGYHAVKIIGWGKE-------NNTDFWLIANSWH 315
Query: 325 TNWGENGLFRI 335
+WGE G FRI
Sbjct: 316 QDWGEKGYFRI 326
Score = 140 bits (354), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 74/181 (40%), Positives = 106/181 (58%), Gaps = 18/181 (9%)
Query: 328 GENGLFRIGCRPYEI--PCERYMNGSRSSCQANE---------PNTPECIRKCQPGYDVS 376
G + + GC+PY+ P R++ + + N+ +TP C R+C GY S
Sbjct: 173 GGKTIDKYGCKPYKPTGPIGRHLKRNDYAPCPNDTYYGECVGMADTPRCKRRCLLGYPKS 232
Query: 377 YEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEH 436
Y D +G+ AY + + + I REI ++GPV S +Y D YK+GIYKH AG G H
Sbjct: 233 YPSDRYYGKSAYIVKQSVKAIQREIMKNGPVVASFAVYEDFRHYKSGIYKHTAGELRGYH 292
Query: 437 AIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPK 496
A++IIGWG+E + +WL+ANS++ +WGE G FRIVRG+NECGIE D+ AG+
Sbjct: 293 AVKIIGWGKE-------NNTDFWLIANSWHQDWGEKGYFRIVRGKNECGIETDVVAGIVT 345
Query: 497 I 497
I
Sbjct: 346 I 346
>gi|10803450|emb|CAB97364.2| putative cathepsin B.1 [Ostertagia ostertagi]
Length = 199
Score = 189 bits (480), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 94/195 (48%), Positives = 124/195 (63%), Gaps = 4/195 (2%)
Query: 124 SGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGI 183
S WA+ + AMSDRVCIA++G + V +S D+VSCC CG GCQGG+ +AW Y+ G+
Sbjct: 1 SCWAVSSASAMSDRVCIATQGAKQVLISDQDIVSCCTWCGYGCQGGWSIRAWYYFAEQGV 60
Query: 184 VSGGTYASKQGCRPYEI-PCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFG 242
V+GG Y +K CRPYEI PC + + + D+ +TP C R+CQ GY SY D ++G
Sbjct: 61 VTGGNYNTKGSCRPYEIHPCGYHKDEPYYGECDDLADTPRCKRRCQLGYPKSYPSDKHYG 120
Query: 243 RIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWG 302
R AY LP + E+I REI R+GPV T+Y D YK GIYKH +G G HA+++IGWG
Sbjct: 121 RTAYQLPMSVESIQREIMRNGPVVAGFTVYEDFAHYKGGIYKHTSGKKTGGHAVKVIGWG 180
Query: 303 QEPLGEGTSSVVKYW 317
E G S + YW
Sbjct: 181 SEQKG---SEKIPYW 192
Score = 110 bits (276), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 57/124 (45%), Positives = 73/124 (58%), Gaps = 4/124 (3%)
Query: 337 CRPYEI-PCERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEE 395
CRPYEI PC + + + +TP C R+CQ GY SY D ++GR AY LP + E
Sbjct: 72 CRPYEIHPCGYHKDEPYYGECDDLADTPRCKRRCQLGYPKSYPSDKHYGRTAYQLPMSVE 131
Query: 396 TIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSV 455
+I REI R+GPV T+Y D YK GIYKH +G G HA+++IGWG E G S
Sbjct: 132 SIQREIMRNGPVVAGFTVYEDFAHYKGGIYKHTSGKKTGGHAVKVIGWGSEQKG---SEK 188
Query: 456 VKYW 459
+ YW
Sbjct: 189 IPYW 192
>gi|204022098|dbj|BAG71146.1| cathepsin B-N2 [Tuberaphis sumatrana]
Length = 334
Score = 189 bits (480), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 97/241 (40%), Positives = 139/241 (57%), Gaps = 11/241 (4%)
Query: 97 LPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLV 156
+P FDAR W C TI E+RDQG CGS WA G A +DR+CIA+ G+ + LS ++L
Sbjct: 85 IPSYFDARKKWRKCLTIGEVRDQGHCGSCWAFGTSSAFADRLCIATDGEFNELLSPEELA 144
Query: 157 SCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEI-PCERYMNGSHSSCQD 215
CC CG GC GG+ KAW+ + G+V+GG Y S +GC+PY + PC G+++
Sbjct: 145 FCCHKCGFGCSGGYPIKAWERFKKHGLVTGGNYESGEGCQPYRVPPCPLDEYGNNTCSGK 204
Query: 216 NEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADM 275
C R C D+ +++D ++ R AY L TI ++ +GP+E S +Y D
Sbjct: 205 PTEKNHRCTRMCYGNQDLDFKEDHHYTRDAYYLTYG--TIQNDVLAYGPIEASFEVYDDF 262
Query: 276 ILYKTGIYKHVAGGP-LGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFR 334
YK+G+Y + LG HA+++IGWG+E V YWL+ NS+N WG+ GLF+
Sbjct: 263 PSYKSGVYTKMENATYLGGHAVKLIGWGEE-------YGVPYWLLVNSWNDQWGDQGLFK 315
Query: 335 I 335
I
Sbjct: 316 I 316
Score = 114 bits (285), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 60/162 (37%), Positives = 88/162 (54%), Gaps = 11/162 (6%)
Query: 336 GCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANE 394
GC+PY +P C G+ + C R C D+ +++D ++ R AY L
Sbjct: 182 GCQPYRVPPCPLDEYGNNTCSGKPTEKNHRCTRMCYGNQDLDFKEDHHYTRDAYYLTYG- 240
Query: 395 ETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGP-LGEHAIRIIGWGQEPLGEGTS 453
TI ++ +GP+E S +Y D YK+G+Y + LG HA+++IGWG+E
Sbjct: 241 -TIQNDVLAYGPIEASFEVYDDFPSYKSGVYTKMENATYLGGHAVKLIGWGEE------- 292
Query: 454 SVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLP 495
V YWL+ NS+N WG+ GLF+I RG NECGI+ T G+P
Sbjct: 293 YGVPYWLLVNSWNDQWGDQGLFKIRRGTNECGIDNSTTGGVP 334
>gi|194246059|gb|ACF35521.1| putative cathepsin B-like cysteine protease form 1 [Dermacentor
variabilis]
Length = 217
Score = 189 bits (480), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 91/202 (45%), Positives = 131/202 (64%), Gaps = 9/202 (4%)
Query: 135 SDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQG 194
SDR+CI ++GK V +S++DL++CC CG+GC GG+ AW+++ GIV+GG Y ++ G
Sbjct: 1 SDRICIHTKGKVQVNISAEDLLTCCDSCGSGCNGGYPSAAWQFYKDEGIVTGGLYGTEDG 60
Query: 195 CRPYEIP-CERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEE 253
C+PY P CE + G +C +P TPEC + C+ GY+ SY D +FG+ YS+ ++E
Sbjct: 61 CQPYYFPPCEHHTVGPLPNCTGIKP-TPECAKTCREGYEKSYTRDKHFGKKVYSISSDET 119
Query: 254 TIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSV 313
I EI ++GPVE +YAD YK+G+Y+ + LG HAIRI+GW GT
Sbjct: 120 QIKTEICKNGPVEADFNVYADFPSYKSGVYQRHSKEMLGGHAIRILGW-------GTEDG 172
Query: 314 VKYWLVANSFNTNWGENGLFRI 335
V YWLVANS+N +WG+ G F+I
Sbjct: 173 VPYWLVANSWNEDWGDKGYFKI 194
Score = 152 bits (385), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 77/162 (47%), Positives = 103/162 (63%), Gaps = 9/162 (5%)
Query: 336 GCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANE 394
GC+PY P CE + G +C +P TPEC + C+ GY+ SY D +FG+ YS+ ++E
Sbjct: 60 GCQPYYFPPCEHHTVGPLPNCTGIKP-TPECAKTCREGYEKSYTRDKHFGKKVYSISSDE 118
Query: 395 ETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSS 454
I EI ++GPVE +YAD YK+G+Y+ + LG HAIRI+GW GT
Sbjct: 119 TQIKTEICKNGPVEADFNVYADFPSYKSGVYQRHSKEMLGGHAIRILGW-------GTED 171
Query: 455 VVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPK 496
V YWLVANS+N +WG+ G F+I RG +ECGIE DI AG+PK
Sbjct: 172 GVPYWLVANSWNEDWGDKGYFKIRRGNDECGIENDINAGIPK 213
>gi|59895951|gb|AAX11351.1| cathepsin B-like cysteine protease [Oryza sativa Japonica Group]
gi|125551767|gb|EAY97476.1| hypothetical protein OsI_19406 [Oryza sativa Indica Group]
gi|215694023|dbj|BAG89222.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215712372|dbj|BAG94499.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215765382|dbj|BAG87079.1| unnamed protein product [Oryza sativa Japonica Group]
gi|222631058|gb|EEE63190.1| hypothetical protein OsJ_17999 [Oryza sativa Japonica Group]
Length = 358
Score = 189 bits (480), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 117/333 (35%), Positives = 172/333 (51%), Gaps = 34/333 (10%)
Query: 6 ADAVATFLKDLDLSQSSRNHSNGVFCDLSKAFDRVDHSILLPKLPFYGAEKNALSKLTLS 65
A A + +L L S + D+ KA ++ P + A + T +
Sbjct: 19 AGRAAKPIPNLQLMTKEGGSSRIIQDDIIKAINK------HPNAGWTAARNPYFANYTTA 72
Query: 66 ELEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSG 125
+ + +GV P N +P+ + LP+ FDAR W C TI I DQG CGS
Sbjct: 73 QFKHILGVKPTPHSVLNDVPVKTYPRSLM--LPKEFDARSAWSQCNTIGTILDQGHCGSC 130
Query: 126 WALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKD-CGNGCQGGFHGKAWKYWVTTGIV 184
WA GAVE + DR CI ++ LS +DLV+CC CG+GC GG+ AW+Y+V G+V
Sbjct: 131 WAFGAVECLQDRFCI--HFNMNISLSVNDLVACCGFMCGDGCDGGYPIMAWRYFVRNGVV 188
Query: 185 SGGTYASKQGCRPY--EIPCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFG 242
+ C PY ++ C+ H C+ P TP C +KC+ V E +F
Sbjct: 189 T-------DECDPYFDQVGCK------HPGCEPAYP-TPVCEKKCKVQNQVWLEKK-HFS 233
Query: 243 RIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWG 302
AY + ++ IM E++++GPVE + T+Y D YK+G+YKH+ GG +G HA+++IGWG
Sbjct: 234 VNAYRVNSDPHDIMAEVYQNGPVEVAFTVYEDFAHYKSGVYKHITGGMMGGHAVKLIGWG 293
Query: 303 QEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
GE YWL+AN +N WG++G F+I
Sbjct: 294 TTDAGE------DYWLLANQWNRGWGDDGYFKI 320
Score = 132 bits (331), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 68/169 (40%), Positives = 100/169 (59%), Gaps = 16/169 (9%)
Query: 329 ENGLFRIGCRPY--EIPCERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRI 386
NG+ C PY ++ C+ C+ P TP C +KC+ V E +F
Sbjct: 184 RNGVVTDECDPYFDQVGCKH------PGCEPAYP-TPVCEKKCKVQNQVWLEKK-HFSVN 235
Query: 387 AYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQE 446
AY + ++ IM E++++GPVE + T+Y D YK+G+YKH+ GG +G HA+++IGWG
Sbjct: 236 AYRVNSDPHDIMAEVYQNGPVEVAFTVYEDFAHYKSGVYKHITGGMMGGHAVKLIGWGTT 295
Query: 447 PLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLP 495
GE YWL+AN +N WG++G F+I+RG NECGIE D+ AG+P
Sbjct: 296 DAGE------DYWLLANQWNRGWGDDGYFKIIRGTNECGIEEDVVAGMP 338
>gi|224128101|ref|XP_002320244.1| predicted protein [Populus trichocarpa]
gi|222861017|gb|EEE98559.1| predicted protein [Populus trichocarpa]
Length = 339
Score = 189 bits (480), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 111/292 (38%), Positives = 160/292 (54%), Gaps = 28/292 (9%)
Query: 47 PKLPFYGAEKNALSKLTLSELEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARIN 106
PK + S ++ E + +GV + +PLL +LP FDAR
Sbjct: 35 PKAGWEATMNPQFSNYSVGEFKYLLGVKQTPRKELRGVPLLRHPKS--MKLPIEFDARTA 92
Query: 107 WPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCK-DCGNG 165
WP+C TI I DQG CGS WA GAVE++SDR CI ++ LS +DL++CC CG G
Sbjct: 93 WPHCSTIGRILDQGHCGSCWAFGAVESLSDRFCI--HYGMNLSLSVNDLLACCGWMCGAG 150
Query: 166 CQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPY--EIPCERYMNGSHSSCQDNEPNTPEC 223
C GG AW+Y+V +G+V+ + C PY +I C SH C+ P TP+C
Sbjct: 151 CDGGSPIDAWRYFVQSGVVT-------EECDPYFDDIGC------SHPGCEPGFP-TPKC 196
Query: 224 IRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIY 283
RKC + + + +F AY + ++ +IM E+ +GPVE + T+Y D YK+G+Y
Sbjct: 197 ERKCADKNKL-WAESKHFSVNAYRIDSDPHSIMAEVSSNGPVEVAFTVYEDFAHYKSGVY 255
Query: 284 KHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
KH+ G +G HA+++IGWG GE YWL+AN +N WG++G F+I
Sbjct: 256 KHITGDAMGGHAVKLIGWGTSEDGE------DYWLLANQWNRGWGDDGYFKI 301
Score = 126 bits (316), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 67/169 (39%), Positives = 98/169 (57%), Gaps = 16/169 (9%)
Query: 329 ENGLFRIGCRPY--EIPCERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRI 386
++G+ C PY +I C S C+ P TP+C RKC + + + +F
Sbjct: 165 QSGVVTEECDPYFDDIGC------SHPGCEPGFP-TPKCERKCADKNKL-WAESKHFSVN 216
Query: 387 AYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQE 446
AY + ++ +IM E+ +GPVE + T+Y D YK+G+YKH+ G +G HA+++IGWG
Sbjct: 217 AYRIDSDPHSIMAEVSSNGPVEVAFTVYEDFAHYKSGVYKHITGDAMGGHAVKLIGWGTS 276
Query: 447 PLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLP 495
GE YWL+AN +N WG++G F+I RG NECGIE + AGLP
Sbjct: 277 EDGE------DYWLLANQWNRGWGDDGYFKIKRGTNECGIEGAVVAGLP 319
>gi|204022108|dbj|BAG71151.1| cathepsin B-N [Cerataphis jamuritsu]
Length = 333
Score = 189 bits (480), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 102/279 (36%), Positives = 150/279 (53%), Gaps = 11/279 (3%)
Query: 59 LSKLTLSELEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRD 118
LS + L GV K + + + + +P FDAR W C +I E+RD
Sbjct: 46 LSIDSFVNLLGSKGVQAAKKASPDMFKTGDKAYNLAQRIPSNFDARKKWKKCLSIGEVRD 105
Query: 119 QGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYW 178
QG CGS WA G A +DR+CIA+ G+ + LS+++L CC CG GC GG+ +AW+ +
Sbjct: 106 QGHCGSCWAFGTSSAFADRLCIATEGEFNELLSAEELTFCCHKCGFGCNGGYPIRAWERF 165
Query: 179 VTTGIVSGGTYASKQGCRPYEI-PCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYED 237
G+V+GG Y S +GC+PY + PC G+++ C R C D+ + +
Sbjct: 166 RKHGLVTGGNYDSYEGCQPYRVPPCPLDEYGNNTCHGKPMEKNHRCTRMCYGDQDLDFNN 225
Query: 238 DLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIY-KHVAGGPLGEHAI 296
D ++ R AY L TI ++ +GP+E S +Y D YK+G+Y K LG HA+
Sbjct: 226 DHHYTRDAYYLTYG--TIQNDVLTYGPIEASFEVYDDFPSYKSGVYVKTENASYLGGHAV 283
Query: 297 RIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
++IGWG+E V YWL+ NS+N WG+ GLF+I
Sbjct: 284 KLIGWGEE-------YGVPYWLLVNSWNDQWGDQGLFKI 315
Score = 112 bits (279), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 61/162 (37%), Positives = 87/162 (53%), Gaps = 11/162 (6%)
Query: 336 GCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANE 394
GC+PY +P C G+ + C R C D+ + +D ++ R AY L
Sbjct: 181 GCQPYRVPPCPLDEYGNNTCHGKPMEKNHRCTRMCYGDQDLDFNNDHHYTRDAYYLTYG- 239
Query: 395 ETIMREIFRHGPVEGSMTIYADMILYKTGIY-KHVAGGPLGEHAIRIIGWGQEPLGEGTS 453
TI ++ +GP+E S +Y D YK+G+Y K LG HA+++IGWG+E
Sbjct: 240 -TIQNDVLTYGPIEASFEVYDDFPSYKSGVYVKTENASYLGGHAVKLIGWGEE------- 291
Query: 454 SVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLP 495
V YWL+ NS+N WG+ GLF+I RG NECGI+ T G+P
Sbjct: 292 YGVPYWLLVNSWNDQWGDQGLFKIRRGTNECGIDNSTTGGVP 333
>gi|166030328|gb|ABY78831.1| cathepsin B-like protease [Trypanosoma congolense]
Length = 336
Score = 189 bits (480), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 113/281 (40%), Positives = 155/281 (55%), Gaps = 26/281 (9%)
Query: 59 LSKLTLSELEMRMG--VHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEI 116
+ +T +E + G + S LP R QL +LPE FDA +WP+CPTI+EI
Sbjct: 54 MQNITFAEAKRLTGAWIQKSSTLPPARF-TEEQLR---TKLPETFDAAEHWPHCPTIREI 109
Query: 117 RDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWK 176
DQ +C + WA+ A+SDR C GK+ +R+S+ DL+SCCK CG+GC+GGF G AW
Sbjct: 110 ADQSACRASWAVSTASAISDRYCTVGGGKQ-LRISAADLLSCCKQCGDGCKGGFPGFAWL 168
Query: 177 YWVTTGIVSGGTYASKQGCRPYEIP-CE-RYMNGSHSSCQDNEPNTPECIRKCQPGYDVS 234
Y+V GI S GC+PY P CE R G+ + C + +TP+C C D S
Sbjct: 169 YYVEYGIAS-------SGCQPYPFPHCEHRGAQGNKTPCSKYKFDTPKCNATCT---DKS 218
Query: 235 YEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEH 294
G Y L EE RE++ +GP +Y D+ YK+G+Y++V G LG
Sbjct: 219 IPLVKYRGNATYLLLHGEEDYKRELYFNGPFVAVFFVYTDLFAYKSGVYRNVDGDFLGGQ 278
Query: 295 AIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
A+RI+GWG+ GT YW VANS++T+WG NG I
Sbjct: 279 AVRIVGWGKL---NGTP----YWKVANSWDTDWGMNGYMLI 312
Score = 110 bits (274), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 66/169 (39%), Positives = 90/169 (53%), Gaps = 12/169 (7%)
Query: 329 ENGLFRIGCRPYEIP-CE-RYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRI 386
E G+ GC+PY P CE R G+++ C + +TP+C C D S G
Sbjct: 172 EYGIASSGCQPYPFPHCEHRGAQGNKTPCSKYKFDTPKCNATCT---DKSIPLVKYRGNA 228
Query: 387 AYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQE 446
Y L EE RE++ +GP +Y D+ YK+G+Y++V G LG A+RI+GWG+
Sbjct: 229 TYLLLHGEEDYKRELYFNGPFVAVFFVYTDLFAYKSGVYRNVDGDFLGGQAVRIVGWGKL 288
Query: 447 PLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLP 495
GT YW VANS++T+WG NG I+RG NEC IE G P
Sbjct: 289 ---NGTP----YWKVANSWDTDWGMNGYMLILRGNNECNIEHLGFTGFP 330
>gi|204022088|dbj|BAG71141.1| cathepsin B-N2 [Tuberaphis styraci]
Length = 334
Score = 189 bits (480), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 103/307 (33%), Positives = 163/307 (53%), Gaps = 32/307 (10%)
Query: 52 YGAEKNALSKLTLSELEMRMGVHPDSKLPQNRL----------------PLLVQLSDPL- 94
Y E++ ++++ + + GV+ D KL + P++ + D
Sbjct: 19 YFLEEDYINQINANAKTWKAGVNFDPKLSIDSFVKLLGSKGVQAAKQASPVMFKTHDEAY 78
Query: 95 ----EELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRL 150
+P FDAR W C TI E+RDQG+CGS WA G A +DR+CIA+ G+ + L
Sbjct: 79 NSWSNRIPSSFDARKKWRKCSTIGEVRDQGNCGSCWAFGTSSAFADRLCIATDGEFNELL 138
Query: 151 SSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEI-PCERYMNGS 209
S ++L CC CG GC GG+ +AW+ + G+V+GG Y S +GC+PY++ PC G+
Sbjct: 139 SPEELAFCCHKCGFGCSGGYPIRAWERFKKHGLVTGGNYDSGEGCQPYKVSPCPLDEYGN 198
Query: 210 HSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSM 269
++ C + C ++ +++D ++ R AY L TI ++ +GP+E S
Sbjct: 199 NTCSGKPAEKNHRCTQMCYGNQNLDFKEDHHYTRDAYYLTYG--TIQNDVLAYGPIEASF 256
Query: 270 TIYADMILYKTGIYKHVAGGP-LGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWG 328
+Y D YK+G+Y + LG HA+++IGWG+E V YWL+ NS+N WG
Sbjct: 257 EVYDDFPSYKSGVYTKMENATYLGGHAVKLIGWGEE-------YGVPYWLLVNSWNDQWG 309
Query: 329 ENGLFRI 335
+ GLF+I
Sbjct: 310 DQGLFKI 316
Score = 110 bits (274), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 59/164 (35%), Positives = 92/164 (56%), Gaps = 15/164 (9%)
Query: 336 GCRPYEI---PCERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPA 392
GC+PY++ P + Y N + S A + + C + C ++ +++D ++ R AY L
Sbjct: 182 GCQPYKVSPCPLDEYGNNTCSGKPAEKNH--RCTQMCYGNQNLDFKEDHHYTRDAYYLTY 239
Query: 393 NEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGP-LGEHAIRIIGWGQEPLGEG 451
TI ++ +GP+E S +Y D YK+G+Y + LG HA+++IGWG+E
Sbjct: 240 G--TIQNDVLAYGPIEASFEVYDDFPSYKSGVYTKMENATYLGGHAVKLIGWGEE----- 292
Query: 452 TSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLP 495
V YWL+ NS+N WG+ GLF+I RG NECG + T G+P
Sbjct: 293 --YGVPYWLLVNSWNDQWGDQGLFKIRRGTNECGTDNSTTGGVP 334
>gi|225437812|ref|XP_002281936.1| PREDICTED: cathepsin B-like isoform 1 [Vitis vinifera]
gi|359480250|ref|XP_003632421.1| PREDICTED: cathepsin B-like [Vitis vinifera]
Length = 358
Score = 188 bits (478), Expect = 5e-45, Method: Compositional matrix adjust.
Identities = 104/290 (35%), Positives = 161/290 (55%), Gaps = 24/290 (8%)
Query: 47 PKLPFYGAEKNALSKLTLSELEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARIN 106
PK + A S ++ + +GV P + +P++ +LP+ FDAR
Sbjct: 53 PKAGWKAAMNPRFSNYSVGQFMHLLGVKPTLQKDLEGVPVITH--PKTLKLPKHFDARTA 110
Query: 107 WPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKD-CGNG 165
WP C TI +I DQG CGS WA GAVE++SDR CI ++ LS +DL++CC CG+G
Sbjct: 111 WPQCSTIGKILDQGHCGSCWAFGAVESLSDRFCI--HFGMNISLSVNDLLACCGFLCGSG 168
Query: 166 CQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIR 225
C GG+ AW+Y++ G+V+ + C PY SH C+ P TP+C+R
Sbjct: 169 CDGGYPLYAWRYFIHHGVVT-------EECDPYF----DATGCSHPGCEPGYP-TPKCVR 216
Query: 226 KCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKH 285
KC + + +G+ AY + ++ IM E++++GPVE + T+Y D Y++G+Y++
Sbjct: 217 KCTDENQL-WRKAKRYGQSAYRISSDPYQIMAEVYKNGPVEVAFTVYEDFAHYESGVYRY 275
Query: 286 VAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
G +G HA+++IGWG GE YW++AN +N NWG++G F I
Sbjct: 276 TTGDVMGGHAVKLIGWGTTDDGE------DYWILANQWNRNWGDDGYFMI 319
Score = 124 bits (310), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 60/154 (38%), Positives = 93/154 (60%), Gaps = 9/154 (5%)
Query: 344 CERYMNGSRSSCQANEPN--TPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREI 401
C+ Y + + S EP TP+C+RKC + + +G+ AY + ++ IM E+
Sbjct: 191 CDPYFDATGCSHPGCEPGYPTPKCVRKCTDENQL-WRKAKRYGQSAYRISSDPYQIMAEV 249
Query: 402 FRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLV 461
+++GPVE + T+Y D Y++G+Y++ G +G HA+++IGWG GE YW++
Sbjct: 250 YKNGPVEVAFTVYEDFAHYESGVYRYTTGDVMGGHAVKLIGWGTTDDGE------DYWIL 303
Query: 462 ANSFNTNWGENGLFRIVRGQNECGIEADITAGLP 495
AN +N NWG++G F I RG NECGIE + AGLP
Sbjct: 304 ANQWNRNWGDDGYFMIRRGVNECGIEEGVVAGLP 337
>gi|168000937|ref|XP_001753172.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162695871|gb|EDQ82213.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 347
Score = 188 bits (478), Expect = 5e-45, Method: Compositional matrix adjust.
Identities = 110/283 (38%), Positives = 159/283 (56%), Gaps = 33/283 (11%)
Query: 59 LSKLTLSELEMRMGVHPDSKLPQNRL-PLLVQLSDPLEEL--PEGFDARINWPYCPTIQE 115
+K T+ L+ G P N+L P + +S ++L P+ FDAR W +CPTI +
Sbjct: 65 FAKHTIEHLKKMCGA---ILTPANKLEPSIETISHKHKKLYLPKEFDARKQWSHCPTIGD 121
Query: 116 IRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCK-DCGNGCQGGFHGKA 174
I QG CGS WA GAVE+++DR CI V LS +DL++CC +CG GC+GG+ +A
Sbjct: 122 ILGQGHCGSCWAFGAVESLTDRFCI--HLNESVSLSENDLLACCGFECGYGCEGGYPIRA 179
Query: 175 WKYWVTTGIVSGGT--YASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQPGYD 232
WKY+ +G+V+ Y ++GC +H C TP+C ++C D
Sbjct: 180 WKYFKHSGVVTNKCDPYFDQKGC-------------AHPGCYPTY-ETPKCEKQCVD--D 223
Query: 233 VSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLG 292
+ + G AY + E +M E++ +GPVE + +Y D YKTG+YKH+ GG +G
Sbjct: 224 EFWVQSKHLGVNAYEMSMEPEDLMAELYTNGPVEVAFEVYEDFAHYKTGVYKHLFGGFMG 283
Query: 293 EHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
HA+++IGWG T V YW + NS+NTNWGE+GLFRI
Sbjct: 284 GHAVKLIGWGT------TDDGVDYWTIVNSWNTNWGEDGLFRI 320
Score = 129 bits (323), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 62/134 (46%), Positives = 86/134 (64%), Gaps = 8/134 (5%)
Query: 362 TPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYK 421
TP+C ++C D + + G AY + E +M E++ +GPVE + +Y D YK
Sbjct: 213 TPKCEKQCVD--DEFWVQSKHLGVNAYEMSMEPEDLMAELYTNGPVEVAFEVYEDFAHYK 270
Query: 422 TGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQ 481
TG+YKH+ GG +G HA+++IGWG T V YW + NS+NTNWGE+GLFRIVRG
Sbjct: 271 TGVYKHLFGGFMGGHAVKLIGWGT------TDDGVDYWTIVNSWNTNWGEDGLFRIVRGN 324
Query: 482 NECGIEADITAGLP 495
+ECGIE++ AGLP
Sbjct: 325 DECGIESNAVAGLP 338
>gi|256090364|ref|XP_002581165.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
gi|353228444|emb|CCD74615.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
Length = 303
Score = 188 bits (477), Expect = 6e-45, Method: Compositional matrix adjust.
Identities = 110/275 (40%), Positives = 140/275 (50%), Gaps = 47/275 (17%)
Query: 63 TLSELEMRMGVHPDS-KLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGS 121
+L + ++G + L + R P V +D E+P FD+R WP C +I IRDQ
Sbjct: 55 SLDDARFQLGARREEPDLRRTRRPT-VDHNDWNVEIPSSFDSRKKWPRCKSIATIRDQSR 113
Query: 122 CGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTT 181
CGS A GAVEAMS+R CI S GK++V LS+ DL
Sbjct: 114 CGSCCAFGAVEAMSERSCIQSGGKQNVELSAVDL-------------------------E 148
Query: 182 GIVSGGTYASKQGCRPYEIP-CERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLN 240
GIV+G + + GC PY P CE + G + C TP C CQ Y SY D
Sbjct: 149 GIVTGSSKENNTGCEPYPFPKCEHFTKGQYPPCGSKIYKTPRCKTTCQKRYKTSYAQD-- 206
Query: 241 FGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIG 300
I +EI ++GPVE S T+Y D + YK+GIYKH+ G LG HAIRIIG
Sbjct: 207 ----------KHRAIQKEIMKYGPVEASFTVYEDFLNYKSGIYKHITGETLGGHAIRIIG 256
Query: 301 WGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
WG E + YWL+ANS+N +WGENG FRI
Sbjct: 257 WGVE-------NKTPYWLIANSWNEDWGENGYFRI 284
Score = 147 bits (370), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 74/159 (46%), Positives = 92/159 (57%), Gaps = 20/159 (12%)
Query: 336 GCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANE 394
GC PY P CE + G C + TP C CQ Y SY D
Sbjct: 161 GCEPYPFPKCEHFTKGQYPPCGSKIYKTPRCKTTCQKRYKTSYAQD------------KH 208
Query: 395 ETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSS 454
I +EI ++GPVE S T+Y D + YK+GIYKH+ G LG HAIRIIGWG E +
Sbjct: 209 RAIQKEIMKYGPVEASFTVYEDFLNYKSGIYKHITGETLGGHAIRIIGWGVE-------N 261
Query: 455 VVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAG 493
YWL+ANS+N +WGENG FRIVRG++EC IE+++TAG
Sbjct: 262 KTPYWLIANSWNEDWGENGYFRIVRGRDECSIESEVTAG 300
>gi|201023315|ref|NP_001128400.1| cathepsin B-16D2 precursor [Acyrthosiphon pisum]
Length = 340
Score = 188 bits (477), Expect = 7e-45, Method: Compositional matrix adjust.
Identities = 93/242 (38%), Positives = 140/242 (57%), Gaps = 11/242 (4%)
Query: 96 ELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDL 155
+P FDAR W C TI +RDQG+CGS WA+ A +DR+C+A+ + LS++++
Sbjct: 87 RIPRHFDARRKWRRCHTIGAVRDQGNCGSCWAMATSSAFADRLCVATNADFNELLSAEEI 146
Query: 156 VSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQD 215
CC CG GC GG+ KAW+ + G+V+GG Y S +GC PY +P Y H++C
Sbjct: 147 TFCCHSCGFGCNGGYPIKAWERFKKRGLVTGGDYQSGEGCEPYRVPPCPYDAEGHNTCAG 206
Query: 216 N-EPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYAD 274
+ C R C D+ +++D + R +Y L +I +++ +GP+E S +Y D
Sbjct: 207 KPRESNHRCTRMCYGNQDLDFDEDHRYTRDSYYLTYG--SIQKDVMTYGPIEASFDVYDD 264
Query: 275 MILYKTGIY-KHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLF 333
YK+G+Y K LG HA+++IGWG+E V YWL+ NS+N +WG+NGLF
Sbjct: 265 FPSYKSGVYVKSENATYLGGHAVKLIGWGEE-------YGVPYWLMVNSWNADWGDNGLF 317
Query: 334 RI 335
+I
Sbjct: 318 KI 319
Score = 117 bits (293), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 61/162 (37%), Positives = 90/162 (55%), Gaps = 11/162 (6%)
Query: 336 GCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANE 394
GC PY +P C G + + C R C D+ +++D + R +Y L
Sbjct: 185 GCEPYRVPPCPYDAEGHNTCAGKPRESNHRCTRMCYGNQDLDFDEDHRYTRDSYYLTYG- 243
Query: 395 ETIMREIFRHGPVEGSMTIYADMILYKTGIY-KHVAGGPLGEHAIRIIGWGQEPLGEGTS 453
+I +++ +GP+E S +Y D YK+G+Y K LG HA+++IGWG+E
Sbjct: 244 -SIQKDVMTYGPIEASFDVYDDFPSYKSGVYVKSENATYLGGHAVKLIGWGEE------- 295
Query: 454 SVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLP 495
V YWL+ NS+N +WG+NGLF+I RG NECGI+ TAG+P
Sbjct: 296 YGVPYWLMVNSWNADWGDNGLFKIRRGTNECGIDNSTTAGVP 337
>gi|343477197|emb|CCD11909.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 336
Score = 188 bits (477), Expect = 7e-45, Method: Compositional matrix adjust.
Identities = 111/281 (39%), Positives = 155/281 (55%), Gaps = 27/281 (9%)
Query: 59 LSKLTLSELEMRMGV--HPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEI 116
+ LT SE + G S LP R +D LPE FDA +WP+CPTI+EI
Sbjct: 55 MQNLTFSEAKRLTGAFSRKTSTLPPARFTEEQLRTD----LPESFDAAEHWPHCPTIREI 110
Query: 117 RDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWK 176
DQ +C + WA+ A+SDR C +GK+ +R+S+ DL++CCKDCG GC+GG+ AW+
Sbjct: 111 ADQSACRASWAVATASAISDRYCTVGKGKQ-LRISAADLMACCKDCGGGCEGGYPDAAWE 169
Query: 177 YWVTTGIVSGGTYASKQGCRPYEIP-CE-RYMNGSHSSCQDNEPNTPECIRKCQPGYDVS 234
Y+V+ GI S C+PY P CE R G + C + TP+C C D +
Sbjct: 170 YYVSHGIAS-------SQCQPYPFPRCEHRGAQGKKTPCSKYKFVTPQCNATCT---DKT 219
Query: 235 YEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEH 294
G +Y + EE RE++ +GP +++D + YK G+Y+HVAG LG
Sbjct: 220 IPLIKYRGNHSYEV-RGEEDYKRELYFNGPFVVRFQVHSDFLAYKNGVYQHVAGNFLGGK 278
Query: 295 AIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
A+RI+GWG+ GT YW VANS++T+WG NG F I
Sbjct: 279 AVRIVGWGKL---NGTP----YWKVANSWDTDWGMNGYFLI 312
Score = 105 bits (261), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 65/168 (38%), Positives = 91/168 (54%), Gaps = 13/168 (7%)
Query: 330 NGLFRIGCRPYEIP-CE-RYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIA 387
+G+ C+PY P CE R G ++ C + TP+C C D + G +
Sbjct: 174 HGIASSQCQPYPFPRCEHRGAQGKKTPCSKYKFVTPQCNATCT---DKTIPLIKYRGNHS 230
Query: 388 YSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEP 447
Y + EE RE++ +GP +++D + YK G+Y+HVAG LG A+RI+GWG+
Sbjct: 231 YEV-RGEEDYKRELYFNGPFVVRFQVHSDFLAYKNGVYQHVAGNFLGGKAVRIVGWGKL- 288
Query: 448 LGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLP 495
GT YW VANS++T+WG NG F I+RG NEC IE AG P
Sbjct: 289 --NGTP----YWKVANSWDTDWGMNGYFLILRGDNECNIEHLGFAGTP 330
>gi|71656032|ref|XP_816569.1| cysteine peptidase C (CPC) [Trypanosoma cruzi strain CL Brener]
gi|70881707|gb|EAN94718.1| cysteine peptidase C (CPC), putative [Trypanosoma cruzi]
Length = 333
Score = 188 bits (477), Expect = 7e-45, Method: Compositional matrix adjust.
Identities = 110/252 (43%), Positives = 142/252 (56%), Gaps = 29/252 (11%)
Query: 89 QLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHV 148
+L +PL++ FDA WP CPTI EIRDQ SCGS WA+ A A+SDR C G R +
Sbjct: 87 ELREPLQDR---FDAGEAWPKCPTITEIRDQSSCGSCWAVAAASAISDRYCTLG-GVRDL 142
Query: 149 RLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIP-CERYMN 207
R+S+ DL+SCC CG GC GG+ AW+Y+ GIVS + C+PY P C ++N
Sbjct: 143 RISAGDLMSCCDVCGYGCNGGYPEVAWEYYAVHGIVS-------EYCQPYPFPSCAHHVN 195
Query: 208 GSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEG 267
S S E +TP C C D G +Y L + EE+ RE+ +GP E
Sbjct: 196 SSDLSPCSGEYDTPTCNSTCT---DKKVPLIKYRGNTSYLL-SGEESFKRELLLNGPFEV 251
Query: 268 SMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQ---EPLGEGTSSVVKYWLVANSFN 324
S ++YAD + Y G+YKHVAG LG HA+RI+GWG+ EP YW +ANS+N
Sbjct: 252 SFSVYADFLAYTGGVYKHVAGTFLGGHAVRIVGWGELNGEP----------YWKIANSWN 301
Query: 325 TNWGENGLFRIG 336
WG NG F I
Sbjct: 302 REWGMNGYFLIA 313
Score = 120 bits (300), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 71/172 (41%), Positives = 94/172 (54%), Gaps = 18/172 (10%)
Query: 330 NGLFRIGCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAY 388
+G+ C+PY P C ++N S S + E +TP C C D G +Y
Sbjct: 175 HGIVSEYCQPYPFPSCAHHVNSSDLSPCSGEYDTPTCNSTCT---DKKVPLIKYRGNTSY 231
Query: 389 SLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQ--- 445
L + EE+ RE+ +GP E S ++YAD + Y G+YKHVAG LG HA+RI+GWG+
Sbjct: 232 LL-SGEESFKRELLLNGPFEVSFSVYADFLAYTGGVYKHVAGTFLGGHAVRIVGWGELNG 290
Query: 446 EPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
EP YW +ANS+N WG NG F I RG +ECGIE AG P+I
Sbjct: 291 EP----------YWKIANSWNREWGMNGYFLIARGVDECGIEGSGVAGTPRI 332
>gi|328718094|ref|XP_003246386.1| PREDICTED: cathepsin B [Acyrthosiphon pisum]
Length = 340
Score = 188 bits (477), Expect = 7e-45, Method: Compositional matrix adjust.
Identities = 93/242 (38%), Positives = 140/242 (57%), Gaps = 11/242 (4%)
Query: 96 ELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDL 155
+P FDAR W C TI +RDQG+CGS WA+ A +DR+C+A+ + LS++++
Sbjct: 87 RIPRHFDARRKWRRCHTIGAVRDQGNCGSCWAMATSSAFADRLCVATNADFNELLSAEEI 146
Query: 156 VSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQD 215
CC CG GC GG+ KAW+ + G+V+GG Y S +GC PY +P Y H++C
Sbjct: 147 TFCCHSCGFGCNGGYPIKAWERFKKRGLVTGGDYQSGEGCEPYRVPPCPYDAEGHNTCAG 206
Query: 216 N-EPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYAD 274
+ C R C D+ +++D + R +Y L +I +++ +GP+E S +Y D
Sbjct: 207 KPRESNHRCTRMCYGNQDLDFDEDHRYTRDSYYLTYG--SIQKDVMTYGPIEASFDVYDD 264
Query: 275 MILYKTGIY-KHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLF 333
YK+G+Y K LG HA+++IGWG+E V YWL+ NS+N +WG+NGLF
Sbjct: 265 FPSYKSGVYVKSENATYLGGHAVKLIGWGEE-------YGVPYWLMVNSWNADWGDNGLF 317
Query: 334 RI 335
+I
Sbjct: 318 KI 319
Score = 117 bits (293), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 61/162 (37%), Positives = 90/162 (55%), Gaps = 11/162 (6%)
Query: 336 GCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANE 394
GC PY +P C G + + C R C D+ +++D + R +Y L
Sbjct: 185 GCEPYRVPPCPYDAEGHNTCAGKPRESNHRCTRMCYGNQDLDFDEDHRYTRDSYYLTYG- 243
Query: 395 ETIMREIFRHGPVEGSMTIYADMILYKTGIY-KHVAGGPLGEHAIRIIGWGQEPLGEGTS 453
+I +++ +GP+E S +Y D YK+G+Y K LG HA+++IGWG+E
Sbjct: 244 -SIQKDVMTYGPIEASFDVYDDFPSYKSGVYVKSENATYLGGHAVKLIGWGEE------- 295
Query: 454 SVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLP 495
V YWL+ NS+N +WG+NGLF+I RG NECGI+ TAG+P
Sbjct: 296 YGVPYWLMVNSWNADWGDNGLFKIRRGTNECGIDNSTTAGVP 337
>gi|204022102|dbj|BAG71148.1| cathepsin B-N2 [Tuberaphis takenouchii]
Length = 334
Score = 187 bits (476), Expect = 9e-45, Method: Compositional matrix adjust.
Identities = 93/241 (38%), Positives = 143/241 (59%), Gaps = 11/241 (4%)
Query: 97 LPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLV 156
+P FDAR W C T+ ++RDQG+CG+ WA G A +DR+CIA+ G+ + LS+++L
Sbjct: 85 IPSNFDARKKWRKCSTVGKVRDQGNCGTCWAFGTSSAFADRLCIATNGEFNELLSAEELA 144
Query: 157 SCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQDN 216
CC CG+GC GG+ KAW+ + G+V+GG Y S +GC+PY +P + +++C+
Sbjct: 145 FCCHKCGSGCHGGYPIKAWERFRKHGLVTGGDYNSGEGCQPYRVPPCPFDEYGNNTCRGK 204
Query: 217 -EPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADM 275
C R C ++ +++D + R AY L N + I ++ +GP+E S +Y D
Sbjct: 205 PAEKNHRCTRMCYGNQNLDFKEDHRYTRDAYYL--NYQIIQNDLMTYGPIEASYDVYDDF 262
Query: 276 ILYKTGIY-KHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFR 334
YK+G+Y K LG HA+++IGWG+E V YWL+ NS+N WG+ GLF+
Sbjct: 263 PNYKSGVYMKTENASYLGGHAVKLIGWGEE-------YGVPYWLLVNSWNDQWGDQGLFK 315
Query: 335 I 335
I
Sbjct: 316 I 316
Score = 111 bits (278), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 59/162 (36%), Positives = 89/162 (54%), Gaps = 11/162 (6%)
Query: 336 GCRPYEIPCERYMNGSRSSCQAN-EPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANE 394
GC+PY +P + ++C+ C R C ++ +++D + R AY L N
Sbjct: 182 GCQPYRVPPCPFDEYGNNTCRGKPAEKNHRCTRMCYGNQNLDFKEDHRYTRDAYYL--NY 239
Query: 395 ETIMREIFRHGPVEGSMTIYADMILYKTGIY-KHVAGGPLGEHAIRIIGWGQEPLGEGTS 453
+ I ++ +GP+E S +Y D YK+G+Y K LG HA+++IGWG+E
Sbjct: 240 QIIQNDLMTYGPIEASYDVYDDFPNYKSGVYMKTENASYLGGHAVKLIGWGEE------- 292
Query: 454 SVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLP 495
V YWL+ NS+N WG+ GLF+I RG NECGI+ T G+P
Sbjct: 293 YGVPYWLLVNSWNDQWGDQGLFKIRRGTNECGIDNSTTGGVP 334
>gi|204022090|dbj|BAG71142.1| cathepsin B-N3 [Tuberaphis styraci]
Length = 334
Score = 187 bits (476), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 94/241 (39%), Positives = 141/241 (58%), Gaps = 11/241 (4%)
Query: 97 LPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLV 156
+P FDAR W C TI E+RDQG+CGS WA G A +DR+CIA+ G+ + LS ++L
Sbjct: 85 IPSSFDARKKWRKCSTIGEVRDQGNCGSCWAFGTSSAFADRLCIATDGEFNELLSPEELA 144
Query: 157 SCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEI-PCERYMNGSHSSCQD 215
CC CG GC GG+ +AW+ + G+V+GG Y S +GC+PY++ PC G+++
Sbjct: 145 FCCHKCGFGCSGGYPIRAWERFKKHGLVTGGNYDSGEGCQPYKVPPCPLDEYGNNTCSGK 204
Query: 216 NEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADM 275
C + C ++ +++D ++ R AY L TI ++ +GP+E S +Y D
Sbjct: 205 PAEKNHRCTQMCYGNQNLDFKEDHHYTRDAYYLTYG--TIQNDVLAYGPIEASFEVYDDF 262
Query: 276 ILYKTGIYKHVAGGP-LGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFR 334
YK+G+Y + LG HA+++IGWG+E V YWL+ NS+N WG+ GLF+
Sbjct: 263 PSYKSGVYTKMENATYLGGHAVKLIGWGEE-------YGVPYWLLVNSWNDQWGDQGLFK 315
Query: 335 I 335
I
Sbjct: 316 I 316
Score = 110 bits (274), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 59/164 (35%), Positives = 92/164 (56%), Gaps = 15/164 (9%)
Query: 336 GCRPYEIP---CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPA 392
GC+PY++P + Y N + S A + + C + C ++ +++D ++ R AY L
Sbjct: 182 GCQPYKVPPCPLDEYGNNTCSGKPAEKNH--RCTQMCYGNQNLDFKEDHHYTRDAYYLTY 239
Query: 393 NEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGP-LGEHAIRIIGWGQEPLGEG 451
TI ++ +GP+E S +Y D YK+G+Y + LG HA+++IGWG+E
Sbjct: 240 G--TIQNDVLAYGPIEASFEVYDDFPSYKSGVYTKMENATYLGGHAVKLIGWGEE----- 292
Query: 452 TSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLP 495
V YWL+ NS+N WG+ GLF+I RG NECG + T G+P
Sbjct: 293 --YGVPYWLLVNSWNDQWGDQGLFKIRRGTNECGTDNSTTGGVP 334
>gi|194352768|emb|CAQ00112.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
gi|326488519|dbj|BAJ93928.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326508126|dbj|BAJ99330.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 355
Score = 187 bits (475), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 104/280 (37%), Positives = 157/280 (56%), Gaps = 28/280 (10%)
Query: 59 LSKLTLSELEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRD 118
+ T+ + + +GV P P + ++ +LP+ FDAR W C TI I D
Sbjct: 65 FANYTIEQFKHILGVKPTP--PGLLAGVPIKTHPKSADLPKEFDARTQWSSCSTIGNILD 122
Query: 119 QGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKD-CGNGCQGGFHGKAWKY 177
QG CG+ WA AVE++ DR CI V LS +DL++CC CG+GC GG+ AW+Y
Sbjct: 123 QGHCGACWAFAAVESLQDRFCI--HLNMSVSLSVNDLLACCGFLCGSGCNGGYPISAWRY 180
Query: 178 WVTTGIVSGGTYASKQGCRPY--EIPCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSY 235
+ +G+V+ + C PY + C+ H C+ P TP+C RKC+ V +
Sbjct: 181 FRRSGVVT-------EECDPYFDQTGCQ------HPGCEPAYP-TPKCHRKCKVENQV-W 225
Query: 236 EDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHA 295
+ + +F AY + +N IM E++++GPVE + T+Y D YK+G+YKH+ GG +G HA
Sbjct: 226 KKNKHFSVNAYRVHSNPHDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKHITGGVMGGHA 285
Query: 296 IRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
+++IGWG GE YWL+AN +N WG++G F+I
Sbjct: 286 VKLIGWGTSDAGE------DYWLLANQWNRGWGDDGYFKI 319
Score = 140 bits (353), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 68/171 (39%), Positives = 105/171 (61%), Gaps = 9/171 (5%)
Query: 344 CERYMNGSRSSCQANEPN--TPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREI 401
C+ Y + + EP TP+C RKC+ V ++ + +F AY + +N IM E+
Sbjct: 191 CDPYFDQTGCQHPGCEPAYPTPKCHRKCKVENQV-WKKNKHFSVNAYRVHSNPHDIMAEV 249
Query: 402 FRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLV 461
+++GPVE + T+Y D YK+G+YKH+ GG +G HA+++IGWG GE YWL+
Sbjct: 250 YKNGPVEVAFTVYEDFAHYKSGVYKHITGGVMGGHAVKLIGWGTSDAGE------DYWLL 303
Query: 462 ANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKIGLEIDSNEINLGKMM 512
AN +N WG++G F+I+RG+NECGIE D+TAG+P +N++ G +
Sbjct: 304 ANQWNRGWGDDGYFKIIRGKNECGIEEDVTAGMPSTKNMDRNNDVAFGTAI 354
>gi|146092987|ref|XP_001466605.1| CPC cysteine peptidase, Clan CA, family C1,Cathepsin B-like
[Leishmania infantum JPCM5]
gi|398018677|ref|XP_003862503.1| cysteine peptidase C (CPC) [Leishmania donovani]
gi|12005276|gb|AAG44365.1| cathepsin B-like cysteine protease [Leishmania donovani]
gi|134070968|emb|CAM69644.1| CPC cysteine peptidase, Clan CA, family C1,Cathepsin B-like
[Leishmania infantum JPCM5]
gi|322500733|emb|CBZ35810.1| cysteine peptidase C (CPC) [Leishmania donovani]
Length = 340
Score = 187 bits (475), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 122/337 (36%), Positives = 176/337 (52%), Gaps = 25/337 (7%)
Query: 2 GKSTADAVATFLKDLDLSQSSRNHSNGVFCDLSKAFDRVDHSILLPKLPFYGAEKNALSK 61
KS VA F L + S F L K+F +S + +S
Sbjct: 5 AKSALCLVAVFAVLLATTVSGLYAKPSDFPLLGKSFVAEINSKARGQWTASADNGYLVSG 64
Query: 62 LTLSELEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGS 121
+L E+ MGV S + P + + ++LPE FDA +WP C TI EIRDQ +
Sbjct: 65 KSLEEVRKLMGVTDMST--EAVPPRNFSVDEMQQDLPEFFDAAEHWPMCVTISEIRDQSN 122
Query: 122 CGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTT 181
CGS WA+ AVEA+SDR C G R+S+ +L+SCC CG GC GG AW +WV
Sbjct: 123 CGSCWAIAAVEAISDRYCTLG-GVPDRRISTSNLLSCCFICGFGCYGGIPTMAWLWWVWV 181
Query: 182 GIVSGGTYASKQGCRPYEI-PCERYMNG-SHSSCQDNEPNTPECIRKCQPGYDVSYEDDL 239
GI + + C+PY PC + N + C + +TP+C C+ S D +
Sbjct: 182 GITT-------EVCQPYPFGPCSHHGNSDKYPPCPNTIYDTPKCNTTCEK----SEMDLV 230
Query: 240 NF-GRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRI 298
+ G +YS+ +E +M E+ +GP+E +M +Y+D + YK+G+YKHV+G LG HA+++
Sbjct: 231 KYKGGTSYSVKGEKE-LMIELMTNGPLEVTMQVYSDFVGYKSGVYKHVSGDLLGGHAVKL 289
Query: 299 IGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
+GW GT V YW +ANS+NT+WG+ G F I
Sbjct: 290 VGW-------GTQGGVPYWKIANSWNTDWGDKGYFLI 319
Score = 118 bits (295), Expect = 9e-24, Method: Compositional matrix adjust.
Identities = 67/172 (38%), Positives = 99/172 (57%), Gaps = 15/172 (8%)
Query: 327 WGENGLFRIGCRPYEI-PCERYMNGSR-SSCQANEPNTPECIRKCQPGYDVSYEDDLNF- 383
W G+ C+PY PC + N + C +TP+C C+ S D + +
Sbjct: 178 WVWVGITTEVCQPYPFGPCSHHGNSDKYPPCPNTIYDTPKCNTTCEK----SEMDLVKYK 233
Query: 384 GRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGW 443
G +YS+ +E +M E+ +GP+E +M +Y+D + YK+G+YKHV+G LG HA++++GW
Sbjct: 234 GGTSYSVKGEKE-LMIELMTNGPLEVTMQVYSDFVGYKSGVYKHVSGDLLGGHAVKLVGW 292
Query: 444 GQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLP 495
GT V YW +ANS+NT+WG+ G F I RG NECGIE+ AG P
Sbjct: 293 -------GTQGGVPYWKIANSWNTDWGDKGYFLIQRGSNECGIESGGVAGTP 337
>gi|343470805|emb|CCD16605.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 337
Score = 187 bits (474), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 116/307 (37%), Positives = 162/307 (52%), Gaps = 28/307 (9%)
Query: 33 LSKAFDRVDHSILLPKLPFYGAEKNALSKLTLSELEMRMG--VHPDSKLPQNRLPLLVQL 90
L+K F VDH L + + +T SE + G + S LP R QL
Sbjct: 31 LTKTF--VDHINQLNGGMWRAVYNGKMQNITFSEAKRLTGARIQKSSALPPARF-TEEQL 87
Query: 91 SDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRL 150
+LPE FDA +WP+CPTI+EI DQ C + WA+ A+SDR C +GK+ +R+
Sbjct: 88 R---TKLPETFDAAEHWPHCPTIREIADQSECRASWAVSTASAISDRYCTVGKGKQ-LRI 143
Query: 151 SSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIP-CERY-MNG 208
S+ L+SCCKDCG+GC+GGF G AW+Y+V GI S C+PY P CE G
Sbjct: 144 SAAHLLSCCKDCGDGCKGGFPGFAWRYYVEYGITS-------SSCQPYPFPRCEHQGAQG 196
Query: 209 SHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGS 268
+ + C +TP+C C D S G Y L EE RE++ +GP
Sbjct: 197 NKTPCSKYNFDTPKCNATCT---DKSVPLIKYRGNATYLLLHGEEDYKRELYFNGPFVAV 253
Query: 269 MTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWG 328
+Y D+ YK+G+Y++V G LG A++++GWG+ GT YW VANS++T+WG
Sbjct: 254 FYVYTDLFAYKSGVYRNVDGDFLGGTAVKVVGWGKL---NGTP----YWKVANSWDTDWG 306
Query: 329 ENGLFRI 335
+G I
Sbjct: 307 MDGYLLI 313
Score = 103 bits (256), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 62/170 (36%), Positives = 89/170 (52%), Gaps = 12/170 (7%)
Query: 329 ENGLFRIGCRPYEIP-CERY-MNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRI 386
E G+ C+PY P CE G+++ C +TP+C C D S G
Sbjct: 173 EYGITSSSCQPYPFPRCEHQGAQGNKTPCSKYNFDTPKCNATCT---DKSVPLIKYRGNA 229
Query: 387 AYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQE 446
Y L EE RE++ +GP +Y D+ YK+G+Y++V G LG A++++GWG+
Sbjct: 230 TYLLLHGEEDYKRELYFNGPFVAVFYVYTDLFAYKSGVYRNVDGDFLGGTAVKVVGWGKL 289
Query: 447 PLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPK 496
GT YW VANS++T+WG +G I+RG NEC IE AG P+
Sbjct: 290 ---NGTP----YWKVANSWDTDWGMDGYLLILRGNNECNIEHLGFAGTPE 332
>gi|409905640|gb|AFV46426.1| cysteine protease C [Leishmania donovani]
Length = 345
Score = 187 bits (474), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 122/336 (36%), Positives = 176/336 (52%), Gaps = 25/336 (7%)
Query: 3 KSTADAVATFLKDLDLSQSSRNHSNGVFCDLSKAFDRVDHSILLPKLPFYGAEKNALSKL 62
KS VA F L + S F L K+F +S + +S
Sbjct: 11 KSALCLVAVFAVLLATTVSGLYAKPSDFPLLGKSFVAEINSKARGQWTASADNGYLVSGK 70
Query: 63 TLSELEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSC 122
+L E+ MGV S + P + + ++LPE FDA +WP C TI EIRDQ +C
Sbjct: 71 SLEEVRKLMGVTDMST--EAVPPRNFSVVEMQQDLPEFFDAAEHWPMCVTISEIRDQSNC 128
Query: 123 GSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTG 182
GS WA+ AVEA+SDR C G R+S+ +L+SCC CG GC GG AW +WV G
Sbjct: 129 GSCWAIAAVEAISDRYCTLG-GVPDRRISTSNLLSCCFICGFGCYGGIPTMAWLWWVWVG 187
Query: 183 IVSGGTYASKQGCRPYEI-PCERYMNG-SHSSCQDNEPNTPECIRKCQPGYDVSYEDDLN 240
I + + C+PY PC + N + C + +TP+C C+ S D +
Sbjct: 188 ITT-------EVCQPYPFGPCSHHGNSDKYPPCPNTIYDTPKCNTTCEK----SEMDLVK 236
Query: 241 F-GRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRII 299
+ G +YS+ +E +M E+ +GP+E +M +Y+D + YK+G+YKHV+G LG HA++++
Sbjct: 237 YKGGTSYSVKGEKE-LMIELMTNGPLEVTMQVYSDFVGYKSGVYKHVSGDLLGGHAVKLV 295
Query: 300 GWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
GW GT V YW +ANS+NT+WG+ G F I
Sbjct: 296 GW-------GTQGGVPYWKIANSWNTDWGDKGYFLI 324
Score = 118 bits (295), Expect = 9e-24, Method: Compositional matrix adjust.
Identities = 67/172 (38%), Positives = 99/172 (57%), Gaps = 15/172 (8%)
Query: 327 WGENGLFRIGCRPYEI-PCERYMNGSR-SSCQANEPNTPECIRKCQPGYDVSYEDDLNF- 383
W G+ C+PY PC + N + C +TP+C C+ S D + +
Sbjct: 183 WVWVGITTEVCQPYPFGPCSHHGNSDKYPPCPNTIYDTPKCNTTCEK----SEMDLVKYK 238
Query: 384 GRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGW 443
G +YS+ +E +M E+ +GP+E +M +Y+D + YK+G+YKHV+G LG HA++++GW
Sbjct: 239 GGTSYSVKGEKE-LMIELMTNGPLEVTMQVYSDFVGYKSGVYKHVSGDLLGGHAVKLVGW 297
Query: 444 GQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLP 495
GT V YW +ANS+NT+WG+ G F I RG NECGIE+ AG P
Sbjct: 298 -------GTQGGVPYWKIANSWNTDWGDKGYFLIQRGSNECGIESGGVAGTP 342
>gi|343961899|dbj|BAK62537.1| cathepsin B precursor [Pan troglodytes]
Length = 195
Score = 187 bits (474), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 89/175 (50%), Positives = 125/175 (71%), Gaps = 9/175 (5%)
Query: 162 CGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIP-CERYMNGSHSSCQDNEPNT 220
CG+GC GG+ +AW +W G+VSGG Y S GCRPY IP CE ++NGS C E +T
Sbjct: 2 CGDGCNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCEHHVNGSRPPCT-GEGDT 60
Query: 221 PECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKT 280
P+C + C+PGY +Y+ D ++G +YS+ +E+ IM EI+++GPVEG+ ++Y+D +LYK+
Sbjct: 61 PKCSKICEPGYSPTYKQDKHYGYNSYSVSNSEKGIMAEIYKNGPVEGAFSVYSDFLLYKS 120
Query: 281 GIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
G+Y+HV G +G HAIRI+GWG E GT YWLVANS+NT+WG+NG F+I
Sbjct: 121 GVYQHVTGEMMGGHAIRILGWGVE---NGT----PYWLVANSWNTDWGDNGFFKI 168
Score = 181 bits (460), Expect = 6e-43, Method: Compositional matrix adjust.
Identities = 84/164 (51%), Positives = 123/164 (75%), Gaps = 9/164 (5%)
Query: 334 RIGCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPA 392
+GCRPY IP CE ++NGSR C E +TP+C + C+PGY +Y+ D ++G +YS+
Sbjct: 32 HVGCRPYSIPPCEHHVNGSRPPCTG-EGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVSN 90
Query: 393 NEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGT 452
+E+ IM EI+++GPVEG+ ++Y+D +LYK+G+Y+HV G +G HAIRI+GWG E GT
Sbjct: 91 SEKGIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGVE---NGT 147
Query: 453 SSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPK 496
YWLVANS+NT+WG+NG F+I+RGQ+ CGIE+++ AG+P+
Sbjct: 148 ----PYWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPR 187
>gi|166030330|gb|ABY78832.1| cathepsin B-like protease [Trypanosoma congolense]
gi|343476577|emb|CCD12360.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 337
Score = 187 bits (474), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 115/307 (37%), Positives = 161/307 (52%), Gaps = 28/307 (9%)
Query: 33 LSKAFDRVDHSILLPKLPFYGAEKNALSKLTLSELEMRMG--VHPDSKLPQNRLPLLVQL 90
L+K F VDH L + + +T SE + G + S LP R QL
Sbjct: 31 LTKTF--VDHINQLNGGMWKAVYNGKMQNITFSEAKRLTGARIQKSSALPPARF-TEEQL 87
Query: 91 SDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRL 150
+LPE FDA +WP+CPTI+EI DQ C + WA+ A+SDR C +GK+ +R+
Sbjct: 88 R---TKLPETFDAAEHWPHCPTIREIADQSECRASWAVSTASAISDRYCTVGKGKQ-LRI 143
Query: 151 SSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIP-CERY-MNG 208
S+ L+SCCKDCG+GC+GGF G AW+Y+V GI S C+PY P CE G
Sbjct: 144 SAAHLLSCCKDCGDGCKGGFPGFAWRYYVEYGITS-------SSCQPYPFPRCEHQGAQG 196
Query: 209 SHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGS 268
+ + C +TP+C C D + G Y L EE RE++ +GP
Sbjct: 197 NKTPCSKYNFDTPKCNATCT---DKAIPLIKYRGNATYLLLHGEEDYKRELYFNGPFVAV 253
Query: 269 MTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWG 328
+Y D+ YK+G+Y+HV G LG A++++GWG+ GT YW +ANS++T+WG
Sbjct: 254 FYVYTDLFAYKSGVYRHVDGDFLGGTAVKVVGWGKL---NGTP----YWKLANSWDTDWG 306
Query: 329 ENGLFRI 335
G I
Sbjct: 307 MGGYLLI 313
Score = 102 bits (255), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 61/170 (35%), Positives = 88/170 (51%), Gaps = 12/170 (7%)
Query: 329 ENGLFRIGCRPYEIP-CERY-MNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRI 386
E G+ C+PY P CE G+++ C +TP+C C D + G
Sbjct: 173 EYGITSSSCQPYPFPRCEHQGAQGNKTPCSKYNFDTPKCNATCT---DKAIPLIKYRGNA 229
Query: 387 AYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQE 446
Y L EE RE++ +GP +Y D+ YK+G+Y+HV G LG A++++GWG+
Sbjct: 230 TYLLLHGEEDYKRELYFNGPFVAVFYVYTDLFAYKSGVYRHVDGDFLGGTAVKVVGWGKL 289
Query: 447 PLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPK 496
GT YW +ANS++T+WG G I+RG NEC IE AG P+
Sbjct: 290 ---NGTP----YWKLANSWDTDWGMGGYLLILRGNNECNIEHLGFAGTPE 332
>gi|343474132|emb|CCD14149.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 337
Score = 187 bits (474), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 115/307 (37%), Positives = 161/307 (52%), Gaps = 28/307 (9%)
Query: 33 LSKAFDRVDHSILLPKLPFYGAEKNALSKLTLSELEMRMG--VHPDSKLPQNRLPLLVQL 90
L+K F VDH L + + +T SE + G + S LP R QL
Sbjct: 31 LTKTF--VDHINQLNGGMWRAVYNGKMQNITFSEAKRLTGARIQKSSALPPARF-TEEQL 87
Query: 91 SDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRL 150
+LPE FDA +WP+CPTI+EI DQ C + WA+ A+SDR C +GK+ +R+
Sbjct: 88 R---TKLPETFDAAEHWPHCPTIREIADQSECRASWAVSTASAISDRYCTVGKGKQ-LRI 143
Query: 151 SSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIP-CERY-MNG 208
S+ L+SCCKDCG+GC+GGF G AW+Y+V GI S C+PY P CE G
Sbjct: 144 SAAHLLSCCKDCGDGCKGGFPGFAWRYYVEYGITS-------SSCQPYPFPRCEHQGAQG 196
Query: 209 SHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGS 268
+ + C +TP+C C D + G Y L EE RE++ +GP
Sbjct: 197 NKTPCSKYNFDTPKCNATCT---DKAIPLIKYRGNATYLLLHGEEDYKRELYFNGPFVAV 253
Query: 269 MTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWG 328
+Y D+ YK+G+Y+HV G LG A++++GWG+ GT YW +ANS++T+WG
Sbjct: 254 FYVYTDLFAYKSGVYRHVDGDFLGGTAVKVVGWGKL---NGTP----YWKLANSWDTDWG 306
Query: 329 ENGLFRI 335
G I
Sbjct: 307 MGGYLLI 313
Score = 102 bits (255), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 61/170 (35%), Positives = 88/170 (51%), Gaps = 12/170 (7%)
Query: 329 ENGLFRIGCRPYEIP-CERY-MNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRI 386
E G+ C+PY P CE G+++ C +TP+C C D + G
Sbjct: 173 EYGITSSSCQPYPFPRCEHQGAQGNKTPCSKYNFDTPKCNATCT---DKAIPLIKYRGNA 229
Query: 387 AYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQE 446
Y L EE RE++ +GP +Y D+ YK+G+Y+HV G LG A++++GWG+
Sbjct: 230 TYLLLHGEEDYKRELYFNGPFVAVFYVYTDLFAYKSGVYRHVDGDFLGGTAVKVVGWGKL 289
Query: 447 PLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPK 496
GT YW +ANS++T+WG G I+RG NEC IE AG P+
Sbjct: 290 ---NGTP----YWKLANSWDTDWGMGGYLLILRGNNECNIEHLGFAGTPE 332
>gi|339242313|ref|XP_003377082.1| Gut-specific cysteine proteinase [Trichinella spiralis]
gi|316974149|gb|EFV57673.1| Gut-specific cysteine proteinase [Trichinella spiralis]
Length = 517
Score = 186 bits (473), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 106/285 (37%), Positives = 150/285 (52%), Gaps = 41/285 (14%)
Query: 59 LSKLTLSELEMRMGVHP-------DSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCP 111
S ++ E+ +RMG DSKL N L+ ++LP+ FD+R WP C
Sbjct: 242 FSGMSKEEILIRMGTKLMNSSTEFDSKLSNNNEALI-------KKLPKHFDSREKWPECE 294
Query: 112 TIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFH 171
I+ IRDQ +CGS WA+ A M+DR CIAS+G+ +S + +++C G
Sbjct: 295 WIRFIRDQSNCGSCWAVSAASVMTDRHCIASKGQETPYISDEQILAC----------GMI 344
Query: 172 GKAWKYWVTTGIVSGGTYASKQGCRPYEI-PCERYMNGSHSSCQDNEPNTPECIRKCQPG 230
+ YW GI +GG Y K C+PY I PC + S+++ +TP C CQ
Sbjct: 345 PSPFNYWKKMGIATGGPYGDKSCCQPYSIAPCSK---CSYTA------STPSCKYDCQAD 395
Query: 231 YDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGP 290
YD+ DD + Y + +N+ IM EI+ HGPV +Y D Y +GIY+
Sbjct: 396 YDIPISDDKFYASEHYHVSSNQYEIMNEIYTHGPVVAGFIVYEDFTYYISGIYQQTTYVA 455
Query: 291 LGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
+G HAIRIIGWG+E + + YWL+ANS+NT +GE G FRI
Sbjct: 456 MGGHAIRIIGWGEE-------NGIPYWLIANSWNTTFGEKGFFRI 493
Score = 133 bits (334), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 69/166 (41%), Positives = 94/166 (56%), Gaps = 17/166 (10%)
Query: 337 CRPYEI-PCERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEE 395
C+PY I PC S C + +TP C CQ YD+ DD + Y + +N+
Sbjct: 368 CQPYSIAPC--------SKC-SYTASTPSCKYDCQADYDIPISDDKFYASEHYHVSSNQY 418
Query: 396 TIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSV 455
IM EI+ HGPV +Y D Y +GIY+ +G HAIRIIGWG+E +
Sbjct: 419 EIMNEIYTHGPVVAGFIVYEDFTYYISGIYQQTTYVAMGGHAIRIIGWGEE-------NG 471
Query: 456 VKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKIGLEI 501
+ YWL+ANS+NT +GE G FRI RG NEC IE+++ G+PK+ L +
Sbjct: 472 IPYWLIANSWNTTFGEKGFFRIRRGTNECRIESEVYTGIPKLRLTL 517
Score = 76.3 bits (186), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 44/141 (31%), Positives = 66/141 (46%), Gaps = 18/141 (12%)
Query: 165 GCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIP----CERYMNGSHSSCQDNEPNT 220
GC+ G A+ YW +G+V+GG Y K C PY I C YM
Sbjct: 69 GCRSGKIEAAFIYWQRSGLVTGGPYGEKACCLPYSISPCTMCRPYMLA------------ 116
Query: 221 PECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKT 280
P+C R CQ Y++S + D +G+ Y + +E IM+EI++ GPV +Y D + Y +
Sbjct: 117 PKCQRTCQASYNLSLKRDKYYGKSHYYVNQDEFDIMQEIYQRGPVVAGFKVYHDFLYYIS 176
Query: 281 GIYKHVAGGPLGEHAIRIIGW 301
G + + G E + W
Sbjct: 177 G--QFICGNKRCEEEENLTSW 195
Score = 54.3 bits (129), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 36/126 (28%), Positives = 54/126 (42%), Gaps = 21/126 (16%)
Query: 327 WGENGLFRIG-------CRPYEIPCERYMNGSRSSCQANEPN--TPECIRKCQPGYDVSY 377
W +GL G C PY I S C P P+C R CQ Y++S
Sbjct: 82 WQRSGLVTGGPYGEKACCLPYSI----------SPCTMCRPYMLAPKCQRTCQASYNLSL 131
Query: 378 EDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHA 437
+ D +G+ Y + +E IM+EI++ GPV +Y D + Y +G + + G E
Sbjct: 132 KRDKYYGKSHYYVNQDEFDIMQEIYQRGPVVAGFKVYHDFLYYISG--QFICGNKRCEEE 189
Query: 438 IRIIGW 443
+ W
Sbjct: 190 ENLTSW 195
>gi|17384033|emb|CAD12394.1| cysteine proteinase [Leishmania infantum]
Length = 340
Score = 186 bits (473), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 109/276 (39%), Positives = 158/276 (57%), Gaps = 25/276 (9%)
Query: 63 TLSELEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSC 122
+L E+ MGV S + P + + ++LPE FDA +WP C TI EIRDQ +C
Sbjct: 66 SLEEVRKLMGVTDMST--EAVPPRNFSVDEMQQDLPEFFDAAEHWPMCVTISEIRDQSNC 123
Query: 123 GSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTG 182
GS WA+ AVEA+SDR C G R+S+ +L+SCC CG GC GG AW +WV G
Sbjct: 124 GSCWAIAAVEAISDRYCTLG-GVPDRRISTSNLLSCCFICGFGCYGGIPTMAWLWWVWVG 182
Query: 183 IVSGGTYASKQGCRPYEI-PCERYMNG-SHSSCQDNEPNTPECIRKCQPGYDVSYEDDLN 240
I + + C+PY PC + N + C + +TP+C C+ S D +
Sbjct: 183 ITT-------EVCQPYPFGPCSHHGNSDKYPPCPNTIYDTPKCNTTCEK----SEMDLVK 231
Query: 241 F-GRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRII 299
+ G +YS+ +E +M E+ +GP+E +M +Y+D + YK+G+YKHV+G LG HA++++
Sbjct: 232 YKGGTSYSVKGEKE-LMIELMTNGPLEVTMQVYSDFVGYKSGVYKHVSGDLLGGHAVKLV 290
Query: 300 GWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
GW GT V YW +ANS+NT+WG+ G F I
Sbjct: 291 GW-------GTQGGVPYWKIANSWNTDWGDKGYFLI 319
Score = 118 bits (295), Expect = 9e-24, Method: Compositional matrix adjust.
Identities = 67/172 (38%), Positives = 99/172 (57%), Gaps = 15/172 (8%)
Query: 327 WGENGLFRIGCRPYEI-PCERYMNGSR-SSCQANEPNTPECIRKCQPGYDVSYEDDLNF- 383
W G+ C+PY PC + N + C +TP+C C+ S D + +
Sbjct: 178 WVWVGITTEVCQPYPFGPCSHHGNSDKYPPCPNTIYDTPKCNTTCEK----SEMDLVKYK 233
Query: 384 GRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGW 443
G +YS+ +E +M E+ +GP+E +M +Y+D + YK+G+YKHV+G LG HA++++GW
Sbjct: 234 GGTSYSVKGEKE-LMIELMTNGPLEVTMQVYSDFVGYKSGVYKHVSGDLLGGHAVKLVGW 292
Query: 444 GQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLP 495
GT V YW +ANS+NT+WG+ G F I RG NECGIE+ AG P
Sbjct: 293 -------GTQGGVPYWKIANSWNTDWGDKGYFLIQRGSNECGIESGGVAGTP 337
>gi|209863077|ref|NP_001119612.2| cathepsin B-912 precursor [Acyrthosiphon pisum]
Length = 342
Score = 186 bits (473), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 96/241 (39%), Positives = 140/241 (58%), Gaps = 11/241 (4%)
Query: 97 LPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLV 156
+P+ FDAR W C TI ++RDQG+CGS WAL A +DR+CIA+ + + LS+++L
Sbjct: 90 IPKKFDARKEWRRCITIGQVRDQGNCGSCWALATSSAFADRLCIATNYEFNELLSAEELT 149
Query: 157 SCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEI-PCERYMNGSHSSCQD 215
CC CG C GG+ KAW Y+ GIV+GG Y S +GC PY + PC +G+++
Sbjct: 150 FCCHLCGFACHGGYPIKAWSYFRRHGIVTGGDYQSGEGCAPYRVPPCFSEEDGNNTCRGQ 209
Query: 216 NEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADM 275
C R C ++ Y+DD F R Y L +I +++ +GP+E SM +Y D
Sbjct: 210 PMEKHHRCTRMCYGDQEIDYDDDHRFTRDYYYL--TYASIQKDVMTYGPIEASMEVYDDF 267
Query: 276 ILYKTGIYKHVAGGP-LGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFR 334
YK+G+Y+ LG HA+++IGWG+E V YWL+ NS++ WG+ GLF+
Sbjct: 268 PSYKSGVYEKSENATYLGGHAVKLIGWGEE-------DGVPYWLMVNSWSEMWGDKGLFK 320
Query: 335 I 335
I
Sbjct: 321 I 321
Score = 112 bits (280), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 59/164 (35%), Positives = 90/164 (54%), Gaps = 11/164 (6%)
Query: 336 GCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANE 394
GC PY +P C +G+ + C R C ++ Y+DD F R Y L
Sbjct: 187 GCAPYRVPPCFSEEDGNNTCRGQPMEKHHRCTRMCYGDQEIDYDDDHRFTRDYYYL--TY 244
Query: 395 ETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGP-LGEHAIRIIGWGQEPLGEGTS 453
+I +++ +GP+E SM +Y D YK+G+Y+ LG HA+++IGWG+E
Sbjct: 245 ASIQKDVMTYGPIEASMEVYDDFPSYKSGVYEKSENATYLGGHAVKLIGWGEE------- 297
Query: 454 SVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
V YWL+ NS++ WG+ GLF+I RG NEC ++ +TAG+P +
Sbjct: 298 DGVPYWLMVNSWSEMWGDKGLFKIRRGTNECSVDNSMTAGVPVV 341
>gi|161343869|tpg|DAA06115.1| TPA_inf: cathepsin B [Myzus persicae]
Length = 337
Score = 186 bits (473), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 100/243 (41%), Positives = 141/243 (58%), Gaps = 16/243 (6%)
Query: 96 ELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDL 155
+PE FDAR W YC TI +RDQG+CGS WA+ A +DR+C+A+ G + LS++++
Sbjct: 87 RIPEHFDARNKWVYCDTIGRVRDQGNCGSCWAVATSSAFADRLCVATTGDFNELLSAEEI 146
Query: 156 VSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQD 215
CC CG GC GG+ KAWK + T G+V+GG Y S +GC PY +P N S S
Sbjct: 147 TFCCHTCGFGCHGGYPIKAWKRFSTHGLVTGGDYNSGEGCEPYRVPPSNDGNSSSSD--- 203
Query: 216 NEPNTPE--CIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYA 273
+P C R C + + DD + R Y L +I +++ +GP+E S +Y
Sbjct: 204 -QPLAINHICRRHCYGNQSIDFNDDHRYTRDYYYLTYG--SIQKDVLTYGPIEASFDVYD 260
Query: 274 DMILYKTGIY-KHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGL 332
D YK+G+Y K LG HA+++IGWG+E +GT YWL+ NS+NT WG+NG
Sbjct: 261 DFPSYKSGVYVKSDNASYLGGHAVKLIGWGEE---DGT----PYWLMVNSWNTQWGDNGF 313
Query: 333 FRI 335
F+I
Sbjct: 314 FKI 316
Score = 107 bits (268), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 64/161 (39%), Positives = 89/161 (55%), Gaps = 12/161 (7%)
Query: 336 GCRPYEIPCERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEE 395
GC PY +P S SS Q N C R C + + DD + R Y L
Sbjct: 185 GCEPYRVP-PSNDGNSSSSDQPLAINHI-CRRHCYGNQSIDFNDDHRYTRDYYYLTYG-- 240
Query: 396 TIMREIFRHGPVEGSMTIYADMILYKTGIY-KHVAGGPLGEHAIRIIGWGQEPLGEGTSS 454
+I +++ +GP+E S +Y D YK+G+Y K LG HA+++IGWG+E +GT
Sbjct: 241 SIQKDVLTYGPIEASFDVYDDFPSYKSGVYVKSDNASYLGGHAVKLIGWGEE---DGT-- 295
Query: 455 VVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLP 495
YWL+ NS+NT WG+NG F+I RG NECG++ TAG+P
Sbjct: 296 --PYWLMVNSWNTQWGDNGFFKIRRGTNECGVDNSTTAGVP 334
>gi|226466652|emb|CAX69461.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 340
Score = 186 bits (473), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 94/242 (38%), Positives = 141/242 (58%), Gaps = 11/242 (4%)
Query: 96 ELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDL 155
E+P FDAR +W C TI++I D+ C + WA+ V+++SDR+CI S G+ V+LS+ D
Sbjct: 88 EIPRSFDARYHWINCSTIRQIHDESLCRADWAIATVDSISDRICIRSNGRISVQLSARDA 147
Query: 156 VSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIP-CERYMNGSHSSCQ 214
+SC GC G + YW+T GIV+GG+Y + GC+PY +P C + C
Sbjct: 148 ISC--GFSPGCFHGSEVEVLVYWITYGIVTGGSYEDQSGCQPYPLPKCSYHPESRFLDCN 205
Query: 215 DNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYAD 274
+N P+C +CQ GY+ +Y+DD +G Y++ +E I +EI +GPV S+++ D
Sbjct: 206 NNTFEFPQCTNECQDGYNKTYDDDKFYGERIYNVYGTQEDIQKEILMNGPVIASISVNTD 265
Query: 275 MILYKTGIYKHVAGGP-LGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLF 333
++YK+G+Y LG +RIIGWG E + YWL ANS+N WG+NG
Sbjct: 266 FLVYKSGVYLPTPRSRNLGWITLRIIGWGYE-------GKIPYWLCANSWNEEWGDNGYV 318
Query: 334 RI 335
+I
Sbjct: 319 KI 320
Score = 112 bits (279), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 62/164 (37%), Positives = 91/164 (55%), Gaps = 9/164 (5%)
Query: 336 GCRPYEIPCERYMNGSRS-SCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANE 394
GC+PY +P Y SR C N P+C +CQ GY+ +Y+DD +G Y++ +
Sbjct: 184 GCQPYPLPKCSYHPESRFLDCNNNTFEFPQCTNECQDGYNKTYDDDKFYGERIYNVYGTQ 243
Query: 395 ETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGP-LGEHAIRIIGWGQEPLGEGTS 453
E I +EI +GPV S+++ D ++YK+G+Y LG +RIIGWG E
Sbjct: 244 EDIQKEILMNGPVIASISVNTDFLVYKSGVYLPTPRSRNLGWITLRIIGWGYE------- 296
Query: 454 SVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
+ YWL ANS+N WG+NG +I RG IE+ + A +PK+
Sbjct: 297 GKIPYWLCANSWNEEWGDNGYVKIQRGVQAGYIESYVRAPIPKM 340
>gi|299471123|emb|CBN78981.1| cathepsin B-like proteinase [Ectocarpus siliculosus]
Length = 557
Score = 186 bits (473), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 109/284 (38%), Positives = 157/284 (55%), Gaps = 33/284 (11%)
Query: 79 LPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTI-QEIRDQGSCGSGWALGAVEAMSDR 137
+P R V S E++P FDAR +P C +I +RDQ CGS WA + EA +DR
Sbjct: 262 VPGRRRLTPVAQSSSDEDIPANFDAREAFPECASIIGRVRDQSDCGSCWAFASTEAFNDR 321
Query: 138 VCIASRGKRH-------------VRLSSDDLVSCCK--DCG--NGCQGGFHGKAWKYWVT 180
CIA GK + LS++D +CC CG GC GG G AWK++
Sbjct: 322 RCIAGIGKEDAAGAEGEATADQLLVLSAEDTTACCHGFHCGLSMGCNGGQPGSAWKWFTK 381
Query: 181 TGIVSGGTYA---SKQGCRPYE-IPCERYMN---GSHSSCQDNEPNTPECIRKCQPG--Y 231
TG+V+GG YA + C+PYE +PC +++ + +C D E TPEC+ +C
Sbjct: 382 TGVVTGGDYADIGTGTTCKPYEFMPCAHHVDPGASGYPACPDGEYPTPECLSECSETNFS 441
Query: 232 DVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPL 291
SY +D R AYSL A E I R++ ++G V + ++++D + Y G+Y H +G +
Sbjct: 442 GGSYGEDKKMAREAYSL-AGIENIQRDMMKYGSVTAAFSVFSDFLTYSGGVYTHESGSFM 500
Query: 292 GEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
G HA+++IGWG + + S YWL+ANS+N +WGE GLFRI
Sbjct: 501 GGHAVKMIGWGTDEV-----SGEDYWLIANSWNPSWGEGGLFRI 539
Score = 129 bits (325), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 68/163 (41%), Positives = 99/163 (60%), Gaps = 12/163 (7%)
Query: 337 CRPYE-IPCERYMNGSRS---SCQANEPNTPECIRKCQPG--YDVSYEDDLNFGRIAYSL 390
C+PYE +PC +++ S +C E TPEC+ +C SY +D R AYSL
Sbjct: 399 CKPYEFMPCAHHVDPGASGYPACPDGEYPTPECLSECSETNFSGGSYGEDKKMAREAYSL 458
Query: 391 PANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGE 450
A E I R++ ++G V + ++++D + Y G+Y H +G +G HA+++IGWG + +
Sbjct: 459 -AGIENIQRDMMKYGSVTAAFSVFSDFLTYSGGVYTHESGSFMGGHAVKMIGWGTDEV-- 515
Query: 451 GTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAG 493
S YWL+ANS+N +WGE GLFRI+RG NECGIE I AG
Sbjct: 516 ---SGEDYWLIANSWNPSWGEGGLFRILRGVNECGIEGQIVAG 555
>gi|204022096|dbj|BAG71145.1| cathepsin B-N1 [Tuberaphis sumatrana]
Length = 334
Score = 186 bits (473), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 96/241 (39%), Positives = 139/241 (57%), Gaps = 11/241 (4%)
Query: 97 LPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLV 156
+P FDAR W C TI E+RDQG CGS WA G A +DR+CIA+ G+ + LS ++L
Sbjct: 85 IPSNFDARKKWRKCSTIGEVRDQGHCGSCWAFGTSSAFADRLCIATDGEFNELLSPEELA 144
Query: 157 SCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEI-PCERYMNGSHSSCQD 215
CC CG GC GG KAW+ + G+V+GG Y S +GC+PY++ PC G+++
Sbjct: 145 FCCHKCGFGCSGGNPIKAWERFQKHGLVTGGNYDSGEGCQPYKVPPCPLDEYGNNTCSGK 204
Query: 216 NEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADM 275
C R C ++ +++D ++ R AY L TI ++ +GP+E S +Y D
Sbjct: 205 PAEKNHRCTRMCYGNQNLDFKEDHHYTRDAYYLTYG--TIQYDVLAYGPIEASFEVYDDF 262
Query: 276 ILYKTGIYKHVAGGP-LGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFR 334
YK+G+Y + LG HA+++IGWG+E V YWL+ NS+N WG+ GLF+
Sbjct: 263 PSYKSGVYTKMENATYLGGHAVKLIGWGEE-------YGVPYWLLVNSWNDQWGDQGLFK 315
Query: 335 I 335
I
Sbjct: 316 I 316
Score = 112 bits (281), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 61/164 (37%), Positives = 93/164 (56%), Gaps = 15/164 (9%)
Query: 336 GCRPYEIP---CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPA 392
GC+PY++P + Y N + S A + + C R C ++ +++D ++ R AY L
Sbjct: 182 GCQPYKVPPCPLDEYGNNTCSGKPAEKNH--RCTRMCYGNQNLDFKEDHHYTRDAYYLTY 239
Query: 393 NEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGP-LGEHAIRIIGWGQEPLGEG 451
TI ++ +GP+E S +Y D YK+G+Y + LG HA+++IGWG+E
Sbjct: 240 G--TIQYDVLAYGPIEASFEVYDDFPSYKSGVYTKMENATYLGGHAVKLIGWGEE----- 292
Query: 452 TSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLP 495
V YWL+ NS+N WG+ GLF+I RG NECGI+ T G+P
Sbjct: 293 --YGVPYWLLVNSWNDQWGDQGLFKIRRGTNECGIDNSTTGGVP 334
>gi|48425700|pdb|1SP4|B Chain B, Crystal Structure Of Ns-134 In Complex With Bovine
Cathepsin B: A Two Headed Epoxysuccinyl Inhibitor
Extends Along The Whole Active Site Cleft
Length = 205
Score = 186 bits (472), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 87/163 (53%), Positives = 123/163 (75%), Gaps = 9/163 (5%)
Query: 334 RIGCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPA 392
+GCRPY IP CE ++NGSR C E +TP+C + C+PGY SY++D +FG +YS+
Sbjct: 49 HVGCRPYSIPPCEHHVNGSRPPCTG-EGDTPKCNKTCEPGYSPSYKEDKHFGCSSYSVAN 107
Query: 393 NEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGT 452
NE+ IM EI+++GPVEG+ ++Y+D +LYK+G+Y+HV+G +G HAIRI+GWG E GT
Sbjct: 108 NEKEIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVSGEIMGGHAIRILGWGVE---NGT 164
Query: 453 SSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLP 495
YWLV NS+NT+WG+NG F+I+RGQ+ CGIE++I AG+P
Sbjct: 165 ----PYWLVGNSWNTDWGDNGFFKILRGQDHCGIESEIVAGMP 203
Score = 178 bits (451), Expect = 6e-42, Method: Compositional matrix adjust.
Identities = 86/167 (51%), Positives = 118/167 (70%), Gaps = 9/167 (5%)
Query: 170 FHGKAWKYWVTTGIVSGGTYASKQGCRPYEIP-CERYMNGSHSSCQDNEPNTPECIRKCQ 228
F AW +W G+VSGG Y S GCRPY IP CE ++NGS C E +TP+C + C+
Sbjct: 27 FPSGAWNFWTKKGLVSGGLYNSHVGCRPYSIPPCEHHVNGSRPPCT-GEGDTPKCNKTCE 85
Query: 229 PGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAG 288
PGY SY++D +FG +YS+ NE+ IM EI+++GPVEG+ ++Y+D +LYK+G+Y+HV+G
Sbjct: 86 PGYSPSYKEDKHFGCSSYSVANNEKEIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVSG 145
Query: 289 GPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
+G HAIRI+GWG E GT YWLV NS+NT+WG+NG F+I
Sbjct: 146 EIMGGHAIRILGWGVE---NGT----PYWLVGNSWNTDWGDNGFFKI 185
>gi|343474530|emb|CCD13852.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 335
Score = 186 bits (472), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 110/279 (39%), Positives = 146/279 (52%), Gaps = 27/279 (9%)
Query: 59 LSKLTLSELEM--RMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEI 116
+ T+SE + R P S LP+ + L LPE FDA WP CPTI EI
Sbjct: 54 MQNTTVSEAKRLNRATRKPVSVLPRVNF----TEEELLAPLPETFDAAEKWPNCPTITEI 109
Query: 117 RDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWK 176
DQ SCGS WA+ A +M+DR C G R +R+S+ DL++CC DCG GC GG AW
Sbjct: 110 SDQSSCGSCWAVAAATSMTDRYCTI-HGVRGLRISAADLLACCGDCGYGCLGGDPDMAWA 168
Query: 177 YWVTTGIVSGGTYASKQGCRPYEIP-CERYMNG-SHSSCQDNEPNTPECIRKCQPGYDVS 234
Y+ + GI SG C+PY P C Y N ++ C TP C C D +
Sbjct: 169 YFSSEGIASG-------RCQPYPFPRCSHYTNSTTYPQCSALHLWTPTCNPACT---DST 218
Query: 235 YEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEH 294
G +YSL + EE RE++ GP + +++D+ YK G+YKHV G +G H
Sbjct: 219 ISKKKYRGLKSYSL-SGEEDFRRELYFRGPFQAVFDVWSDLFAYKHGVYKHVGGAFIGAH 277
Query: 295 AIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLF 333
A+RI+GWG + S V YW +ANS+N WG+ G F
Sbjct: 278 AVRIVGWGNQ-------SGVPYWKIANSWNAEWGDRGYF 309
Score = 120 bits (301), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 65/163 (39%), Positives = 89/163 (54%), Gaps = 13/163 (7%)
Query: 337 CRPYEIP-CERYMNGSR-SSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANE 394
C+PY P C Y N + C A TP C C D + G +YSL + E
Sbjct: 180 CQPYPFPRCSHYTNSTTYPQCSALHLWTPTCNPACT---DSTISKKKYRGLKSYSL-SGE 235
Query: 395 ETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSS 454
E RE++ GP + +++D+ YK G+YKHV G +G HA+RI+GWG + S
Sbjct: 236 EDFRRELYFRGPFQAVFDVWSDLFAYKHGVYKHVGGAFIGAHAVRIVGWGNQ-------S 288
Query: 455 VVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
V YW +ANS+N WG+ G F ++RG NECGIE +AG+P I
Sbjct: 289 GVPYWKIANSWNAEWGDRGYFFMLRGDNECGIEDSGSAGVPAI 331
>gi|3088522|gb|AAD03404.1| cathepsin B-like protease precursor [Trypanosoma cruzi]
gi|407859283|gb|EKG06969.1| cysteine peptidase C (CPC) [Trypanosoma cruzi]
Length = 333
Score = 186 bits (472), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 110/252 (43%), Positives = 140/252 (55%), Gaps = 29/252 (11%)
Query: 89 QLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHV 148
+L PL++ FDA WP CPTI EIRDQ SCGS WA+ A AMSDR C G R +
Sbjct: 87 ELRVPLQDR---FDAGEAWPKCPTITEIRDQSSCGSCWAVAAASAMSDRYCTLG-GVRDL 142
Query: 149 RLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIP-CERYMN 207
R+S+ DL+SCC CG GC GG+ AW+Y+ GIVS + C+PY P C ++N
Sbjct: 143 RISAGDLMSCCDVCGYGCNGGYPEVAWEYYAVHGIVS-------EYCQPYPFPSCAHHVN 195
Query: 208 GSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEG 267
S S E +TP C C D G +Y L + EE+ RE+ +GP E
Sbjct: 196 SSDLSPCSGEYDTPTCNSTCT---DKKIPLIKYRGNTSYIL-SGEESFKRELLLNGPFEV 251
Query: 268 SMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQ---EPLGEGTSSVVKYWLVANSFN 324
S ++YAD + Y G+YKHV G LG HA+RI+GWG+ EP YW +ANS+N
Sbjct: 252 SFSVYADFVAYTGGVYKHVTGVFLGGHAVRIVGWGELNGEP----------YWKIANSWN 301
Query: 325 TNWGENGLFRIG 336
WG NG F I
Sbjct: 302 HEWGMNGYFLIA 313
Score = 119 bits (298), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 70/172 (40%), Positives = 94/172 (54%), Gaps = 18/172 (10%)
Query: 330 NGLFRIGCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAY 388
+G+ C+PY P C ++N S S + E +TP C C D G +Y
Sbjct: 175 HGIVSEYCQPYPFPSCAHHVNSSDLSPCSGEYDTPTCNSTCT---DKKIPLIKYRGNTSY 231
Query: 389 SLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQ--- 445
L + EE+ RE+ +GP E S ++YAD + Y G+YKHV G LG HA+RI+GWG+
Sbjct: 232 IL-SGEESFKRELLLNGPFEVSFSVYADFVAYTGGVYKHVTGVFLGGHAVRIVGWGELNG 290
Query: 446 EPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
EP YW +ANS+N WG NG F I RG +ECGIE AG+P+I
Sbjct: 291 EP----------YWKIANSWNHEWGMNGYFLIARGVDECGIEGSGVAGIPRI 332
>gi|161343855|tpg|DAA06108.1| TPA_inf: cathepsin B [Acyrthosiphon pisum]
Length = 342
Score = 186 bits (472), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 96/241 (39%), Positives = 140/241 (58%), Gaps = 11/241 (4%)
Query: 97 LPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLV 156
+P+ FDAR W C TI ++RDQG+CGS WAL A +DR+CIA+ + + LS+++L
Sbjct: 90 IPKKFDARKEWRRCITIGQVRDQGNCGSCWALATSSAFADRLCIATNYEFNELLSAEELT 149
Query: 157 SCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEI-PCERYMNGSHSSCQD 215
CC CG C GG+ KAW Y+ GIV+GG Y S +GC PY + PC +G+++
Sbjct: 150 FCCHLCGFACHGGYPIKAWSYFRRHGIVTGGGYQSGEGCAPYRVPPCFSEEDGNNTCRGQ 209
Query: 216 NEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADM 275
C R C ++ Y+DD F R Y L +I +++ +GP+E SM +Y D
Sbjct: 210 PMEKHHRCTRMCYGDQEIDYDDDHRFTRDYYYL--TYASIQKDVMTYGPIEASMEVYDDF 267
Query: 276 ILYKTGIYKHVAGGP-LGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFR 334
YK+G+Y+ LG HA+++IGWG+E V YWL+ NS++ WG+ GLF+
Sbjct: 268 PSYKSGVYEKSENATYLGGHAVKLIGWGEE-------DGVPYWLMVNSWSEMWGDKGLFK 320
Query: 335 I 335
I
Sbjct: 321 I 321
Score = 112 bits (280), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 59/164 (35%), Positives = 90/164 (54%), Gaps = 11/164 (6%)
Query: 336 GCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANE 394
GC PY +P C +G+ + C R C ++ Y+DD F R Y L
Sbjct: 187 GCAPYRVPPCFSEEDGNNTCRGQPMEKHHRCTRMCYGDQEIDYDDDHRFTRDYYYL--TY 244
Query: 395 ETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGP-LGEHAIRIIGWGQEPLGEGTS 453
+I +++ +GP+E SM +Y D YK+G+Y+ LG HA+++IGWG+E
Sbjct: 245 ASIQKDVMTYGPIEASMEVYDDFPSYKSGVYEKSENATYLGGHAVKLIGWGEE------- 297
Query: 454 SVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
V YWL+ NS++ WG+ GLF+I RG NEC ++ +TAG+P +
Sbjct: 298 DGVPYWLMVNSWSEMWGDKGLFKIRRGTNECSVDNSMTAGVPVV 341
>gi|168020784|ref|XP_001762922.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162685734|gb|EDQ72127.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 345
Score = 186 bits (471), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 98/243 (40%), Positives = 143/243 (58%), Gaps = 27/243 (11%)
Query: 96 ELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDL 155
+LP FDAR +W +C TI +I DQG CGS WA GAVE+++DR CI V LS +DL
Sbjct: 100 DLPTEFDARKHWSHCSTIGDILDQGHCGSCWAFGAVESLTDRFCI--HLNESVSLSENDL 157
Query: 156 VSCCK-DCGNGCQGGFHGKAWKYWVTTGIVSGGT--YASKQGCRPYEIPCERYMNGSHSS 212
++CC +CG+GC+GG+ +AW+Y+ TG+V+ Y ++GC H
Sbjct: 158 LACCGFECGDGCEGGYPIRAWQYFKRTGVVTSKCDPYFDQKGC-------------GHPG 204
Query: 213 CQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIY 272
C +TP+C ++C D + + G AY + E +M E+F +GP+E + ++
Sbjct: 205 CYPTY-DTPKCFKRCVD--DELWVSSKHLGVSAYEVSMEPEELMAELFTNGPIEVAFDVF 261
Query: 273 ADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGL 332
D YKTG+YKH+ GG +G HA++++GWG T V YW + NS+NTNWGE+G
Sbjct: 262 EDFAHYKTGVYKHLYGGYIGGHAVKLVGWGT------TDDGVDYWSMVNSWNTNWGEDGT 315
Query: 333 FRI 335
FRI
Sbjct: 316 FRI 318
Score = 126 bits (317), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 58/135 (42%), Positives = 87/135 (64%), Gaps = 8/135 (5%)
Query: 361 NTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILY 420
+TP+C ++C D + + G AY + E +M E+F +GP+E + ++ D Y
Sbjct: 210 DTPKCFKRCVD--DELWVSSKHLGVSAYEVSMEPEELMAELFTNGPIEVAFDVFEDFAHY 267
Query: 421 KTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRG 480
KTG+YKH+ GG +G HA++++GWG T V YW + NS+NTNWGE+G FRI+RG
Sbjct: 268 KTGVYKHLYGGYIGGHAVKLVGWGT------TDDGVDYWSMVNSWNTNWGEDGTFRILRG 321
Query: 481 QNECGIEADITAGLP 495
++ECGIE++ AGLP
Sbjct: 322 KDECGIESNAVAGLP 336
>gi|187105116|ref|NP_001119618.1| cathepsin B-84 precursor [Acyrthosiphon pisum]
gi|161343843|tpg|DAA06102.1| TPA_inf: cathepsin B [Acyrthosiphon pisum]
Length = 335
Score = 186 bits (471), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 95/243 (39%), Positives = 146/243 (60%), Gaps = 13/243 (5%)
Query: 96 ELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDL 155
E+PE FD+R+ W YC TI +R+QG+CGS WA G A +DR+C+A+ G+ + +S+++L
Sbjct: 83 EVPEFFDSRLEWDYCETIGHVRNQGNCGSCWAHGTTGAFADRLCVATNGEFNELISAEEL 142
Query: 156 VSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIPCERYMNGSHSSC-- 213
CC CG GC GG+ KAW+Y+ G+V+GG Y + GC+PY +P + H+SC
Sbjct: 143 TFCCHRCGFGCNGGYPLKAWQYFKRHGVVTGGDYDTTDGCQPYRVPPCVKDDEGHNSCSG 202
Query: 214 QDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYA 273
Q E N +C +KC + Y+ + + AY L T+ ++ +GP+E S +Y
Sbjct: 203 QPTERN-HKCSKKCYGDDTIDYKKNHYKTKDAYYL--KNTTMQKDTMVYGPIEASFDVYD 259
Query: 274 DMILYKTGIYKHVAGGP-LGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGL 332
D + Y++G+Y+ LG HA+++IGWG E EGT YWL+ NS+ WG+ G+
Sbjct: 260 DFMNYESGVYQRTGNASYLGGHAVKMIGWGVE---EGTP----YWLMVNSWGEQWGDKGM 312
Query: 333 FRI 335
F+I
Sbjct: 313 FKI 315
Score = 108 bits (271), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 58/164 (35%), Positives = 91/164 (55%), Gaps = 11/164 (6%)
Query: 336 GCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANE 394
GC+PY +P C + G S +C +KC + Y+ + + AY L
Sbjct: 181 GCQPYRVPPCVKDDEGHNSCSGQPTERNHKCSKKCYGDDTIDYKKNHYKTKDAYYL--KN 238
Query: 395 ETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGP-LGEHAIRIIGWGQEPLGEGTS 453
T+ ++ +GP+E S +Y D + Y++G+Y+ LG HA+++IGWG E EGT
Sbjct: 239 TTMQKDTMVYGPIEASFDVYDDFMNYESGVYQRTGNASYLGGHAVKMIGWGVE---EGTP 295
Query: 454 SVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
YWL+ NS+ WG+ G+F+I+RG +ECGIE+ TAG+P +
Sbjct: 296 ----YWLMVNSWGEQWGDKGMFKILRGTDECGIESSCTAGVPSV 335
>gi|389593817|ref|XP_003722157.1| cysteine peptidase C (CPC) [Leishmania major strain Friedlin]
gi|321438655|emb|CBZ12414.1| cysteine peptidase C (CPC) [Leishmania major strain Friedlin]
Length = 340
Score = 186 bits (471), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 110/276 (39%), Positives = 156/276 (56%), Gaps = 25/276 (9%)
Query: 63 TLSELEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSC 122
+L E+ MGV S + P + + ++LPE FDA +WP C TI EIRDQ +C
Sbjct: 66 SLGEVRKLMGVTDMST--EAVPPRNFSVEELQQDLPEFFDAAEHWPMCLTISEIRDQSNC 123
Query: 123 GSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTG 182
GS WA+ AVEA+SDR C G R+S+ +L+SCC CG GC GG AW +WV G
Sbjct: 124 GSCWAIAAVEAISDRYCTFG-GVPDRRMSTSNLLSCCFICGLGCHGGIPTVAWLWWVWVG 182
Query: 183 IVSGGTYASKQGCRPYEI-PCERYMNG-SHSSCQDNEPNTPECIRKCQPGYDVSYEDDLN 240
I + + C+PY PC + N + C +TP+C C+ S D +
Sbjct: 183 IAT-------EDCQPYPFDPCSHHGNSEKYPPCPSTIYDTPKCNTTCER----SEMDLVK 231
Query: 241 F-GRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRII 299
+ G +YS+ +E +M E+ +GP+E +M +Y+D + YK+G+YKHV G LG HA++++
Sbjct: 232 YKGSTSYSVKGEKE-LMIELMTNGPLELTMQVYSDFVGYKSGVYKHVLGEFLGGHAVKLV 290
Query: 300 GWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
GW GT V YW VANS+NT+WG+ G F I
Sbjct: 291 GW-------GTQDGVPYWKVANSWNTDWGDKGYFLI 319
Score = 117 bits (294), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 67/172 (38%), Positives = 99/172 (57%), Gaps = 15/172 (8%)
Query: 327 WGENGLFRIGCRPYEI-PCERYMNGSR-SSCQANEPNTPECIRKCQPGYDVSYEDDLNF- 383
W G+ C+PY PC + N + C + +TP+C C+ S D + +
Sbjct: 178 WVWVGIATEDCQPYPFDPCSHHGNSEKYPPCPSTIYDTPKCNTTCER----SEMDLVKYK 233
Query: 384 GRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGW 443
G +YS+ +E +M E+ +GP+E +M +Y+D + YK+G+YKHV G LG HA++++GW
Sbjct: 234 GSTSYSVKGEKE-LMIELMTNGPLELTMQVYSDFVGYKSGVYKHVLGEFLGGHAVKLVGW 292
Query: 444 GQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLP 495
GT V YW VANS+NT+WG+ G F I RG NEC IE+ AG+P
Sbjct: 293 -------GTQDGVPYWKVANSWNTDWGDKGYFLIQRGNNECKIESGGVAGIP 337
>gi|215687149|dbj|BAG90919.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 403
Score = 186 bits (471), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 106/277 (38%), Positives = 154/277 (55%), Gaps = 28/277 (10%)
Query: 62 LTLSELEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGS 121
+ ++ + +GV P N +P+ + LP+ FDAR W C TI I DQG
Sbjct: 114 VQTAQFKHILGVKPTPHSVLNDVPVKTYPRSLM--LPKEFDARSAWSQCNTIGTILDQGH 171
Query: 122 CGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKD-CGNGCQGGFHGKAWKYWVT 180
CGS WA GAVE + DR CI ++ LS +DLV+CC CG+GC GG+ AW+Y+V
Sbjct: 172 CGSCWAFGAVECLQDRFCI--HFNMNISLSVNDLVACCGFMCGDGCDGGYPIMAWRYFVR 229
Query: 181 TGIVSGGTYASKQGCRPY--EIPCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDD 238
G+V+ C PY ++ C+ H C+ P TP C +KC+ V E
Sbjct: 230 NGVVT-------DECDPYFDQVGCK------HPGCEPAYP-TPVCEKKCKVQNQVWLEKK 275
Query: 239 LNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRI 298
+F AY + ++ IM E++++GPVE + T+Y D YK+G+YKH+ GG +G HA+++
Sbjct: 276 -HFSVNAYRVNSDPHDIMAEVYQNGPVEVAFTVYEDFAHYKSGVYKHITGGMMGGHAVKL 334
Query: 299 IGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
IGWG GE YWL+AN +N WG++G F+I
Sbjct: 335 IGWGTTDAGE------DYWLLANQWNRGWGDDGYFKI 365
Score = 131 bits (330), Expect = 7e-28, Method: Compositional matrix adjust.
Identities = 68/169 (40%), Positives = 100/169 (59%), Gaps = 16/169 (9%)
Query: 329 ENGLFRIGCRPY--EIPCERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRI 386
NG+ C PY ++ C+ C+ P TP C +KC+ V E +F
Sbjct: 229 RNGVVTDECDPYFDQVGCKH------PGCEPAYP-TPVCEKKCKVQNQVWLEKK-HFSVN 280
Query: 387 AYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQE 446
AY + ++ IM E++++GPVE + T+Y D YK+G+YKH+ GG +G HA+++IGWG
Sbjct: 281 AYRVNSDPHDIMAEVYQNGPVEVAFTVYEDFAHYKSGVYKHITGGMMGGHAVKLIGWGTT 340
Query: 447 PLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLP 495
GE YWL+AN +N WG++G F+I+RG NECGIE D+ AG+P
Sbjct: 341 DAGE------DYWLLANQWNRGWGDDGYFKIIRGTNECGIEEDVVAGMP 383
>gi|161343865|tpg|DAA06113.1| TPA_inf: cathepsin B [Myzus persicae]
Length = 335
Score = 186 bits (471), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 100/244 (40%), Positives = 148/244 (60%), Gaps = 15/244 (6%)
Query: 96 ELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDL 155
E+PE FD+R+ W C TI E+R+QG+CGS WA G A +DR+CIA+ G+ + +S+++L
Sbjct: 83 EVPEFFDSRLEWKNCKTIGEVRNQGNCGSCWAHGTTGAFADRLCIATDGEFNELISAEEL 142
Query: 156 VSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEI-PCERYMNGSHSSC- 213
CC CG GC GG KAWKY+ G+V+GG Y + GC+PY + PC R G H+SC
Sbjct: 143 TFCCHTCGFGCNGGNPLKAWKYFKRHGVVTGGNYNTTDGCQPYRVPPCVRDDEG-HNSCS 201
Query: 214 -QDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIY 272
Q E N +C +KC ++Y+ + + AY L + T+ ++ +GP+E S +Y
Sbjct: 202 GQPTERN-HKCSKKCYGDETINYKKNHYKTKDAYYL--SNTTMQKDTMVYGPIEASFDVY 258
Query: 273 ADMILYKTGIYKHVAGGP-LGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENG 331
D Y++G+Y+ LG HA+++IGWG E EGT YWL+ NS+ WG+ G
Sbjct: 259 DDFTSYESGVYQKTENASYLGGHAVKMIGWGVE---EGTP----YWLMVNSWGEQWGDKG 311
Query: 332 LFRI 335
+F+I
Sbjct: 312 MFKI 315
Score = 109 bits (273), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 58/164 (35%), Positives = 92/164 (56%), Gaps = 11/164 (6%)
Query: 336 GCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANE 394
GC+PY +P C R G S +C +KC ++Y+ + + AY L +
Sbjct: 181 GCQPYRVPPCVRDDEGHNSCSGQPTERNHKCSKKCYGDETINYKKNHYKTKDAYYL--SN 238
Query: 395 ETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGP-LGEHAIRIIGWGQEPLGEGTS 453
T+ ++ +GP+E S +Y D Y++G+Y+ LG HA+++IGWG E EGT
Sbjct: 239 TTMQKDTMVYGPIEASFDVYDDFTSYESGVYQKTENASYLGGHAVKMIGWGVE---EGTP 295
Query: 454 SVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
YWL+ NS+ WG+ G+F+I+RG +ECG+E+ TAG+P +
Sbjct: 296 ----YWLMVNSWGEQWGDKGMFKILRGTDECGVESSCTAGVPSV 335
>gi|270012756|gb|EFA09204.1| cathepsin B precursor [Tribolium castaneum]
Length = 369
Score = 185 bits (470), Expect = 4e-44, Method: Compositional matrix adjust.
Identities = 108/289 (37%), Positives = 152/289 (52%), Gaps = 42/289 (14%)
Query: 64 LSELEMRMG---VHPDSKL-PQNRLPLLVQLSDPLEELPEGFDARINWPYCP-TIQEIRD 118
++E R+G +HPD P+ + P Q +PE FDAR WP C I IR+
Sbjct: 40 FDDIESRLGFLGIHPDPNFKPEIKEPQATQ-----NVIPETFDAREYWPECADIIGNIRN 94
Query: 119 QGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYW 178
QG C S WA A E MSDR+CIA+ GK ++LS +DL+ CC CGN C+GG+ AW Y+
Sbjct: 95 QGKCSSSWAFAAAEVMSDRLCIATNGKVKIQLSPEDLIDCCHYCGNQCKGGYTYYAWNYF 154
Query: 179 VTTGIVSGGTYASKQGCRPY-EIPCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYED 237
+ TG+VSGG Y + GC+PY E+ R +++CQ+++ Y + Y
Sbjct: 155 MLTGLVSGGDYNTSTGCQPYSELNYYRITPPCNTTCQNDK-------------YPIPYVS 201
Query: 238 DLNFGRIAYSLPANEETIMREIFR-HGPVEGSMTIYADMILYKT---------GIYKHVA 287
D +FG Y +P NE I EI GPV + +Y D +Y+ G+Y + +
Sbjct: 202 DKHFGDSIYYIPQNETAIQNEILSGGGPVVAAFDVYGDFKIYRDGEQHDTILEGVYIYTS 261
Query: 288 GGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGE-NGLFRI 335
G G A++IIGWG E + YWL ANS+ +WG G F+I
Sbjct: 262 GALFGRTAVKIIGWGTE-------NGWAYWLAANSWGKDWGALGGFFKI 303
Score = 92.0 bits (227), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 56/149 (37%), Positives = 73/149 (48%), Gaps = 19/149 (12%)
Query: 362 TPECIRKCQ-PGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHG-PVEGSMTIYADMIL 419
TP C CQ Y + Y D +FG Y +P NE I EI G PV + +Y D +
Sbjct: 183 TPPCNTTCQNDKYPIPYVSDKHFGDSIYYIPQNETAIQNEILSGGGPVVAAFDVYGDFKI 242
Query: 420 YKTG---------IYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWG 470
Y+ G +Y + +G G A++IIGWG E + YWL ANS+ +WG
Sbjct: 243 YRDGEQHDTILEGVYIYTSGALFGRTAVKIIGWGTE-------NGWAYWLAANSWGKDWG 295
Query: 471 E-NGLFRIVRGQNECGIEADITAGLPKIG 498
G F+I RG NECG E I AG + G
Sbjct: 296 ALGGFFKIRRGTNECGFEESIIAGQVREG 324
>gi|1848229|gb|AAB48119.1| cathepsin B-like protease [Leishmania major]
Length = 340
Score = 185 bits (470), Expect = 4e-44, Method: Compositional matrix adjust.
Identities = 111/277 (40%), Positives = 155/277 (55%), Gaps = 27/277 (9%)
Query: 63 TLSELEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSC 122
+L E+ MGV S + P + + ++LPE FDA +WP C TI EIRDQ +C
Sbjct: 66 SLGEVRKLMGVTDMST--EAVPPRNFSVEELQQDLPEFFDAAEHWPMCLTISEIRDQSNC 123
Query: 123 GSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTG 182
GS WA+ AVEA+SDR C G R+S+ +L+SCC CG GC GG AW +WV G
Sbjct: 124 GSCWAIAAVEAISDRYCTFG-GVPDRRMSTSNLLSCCFICGLGCHGGIPTVAWLWWVWVG 182
Query: 183 IVSGGTYASKQGCRPYEI-PCERYMNG-SHSSCQDNEPNTPECIRKCQPGYDVSYEDDL- 239
I + + C+PY PC + N + C +TP+C C+ E DL
Sbjct: 183 IAT-------EDCQPYPFDPCSHHGNSEKYPPCPSTIYDTPKCNTTCERN-----EMDLV 230
Query: 240 -NFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRI 298
G +YS+ +E +M E+ +GP+E +M +Y+D + YK+G+YKHV G LG HA+++
Sbjct: 231 KYKGSTSYSVKGEKE-LMIELMTNGPLELTMQVYSDFVGYKSGVYKHVLGDFLGGHAVKL 289
Query: 299 IGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
+GW GT V YW VANS+NT+WG+ G F I
Sbjct: 290 VGW-------GTQDGVPYWKVANSWNTDWGDKGYFLI 319
Score = 117 bits (292), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 68/173 (39%), Positives = 98/173 (56%), Gaps = 17/173 (9%)
Query: 327 WGENGLFRIGCRPYEI-PCERYMNGSR-SSCQANEPNTPECIRKCQPGYDVSYEDDL--N 382
W G+ C+PY PC + N + C + +TP+C C+ E DL
Sbjct: 178 WVWVGIATEDCQPYPFDPCSHHGNSEKYPPCPSTIYDTPKCNTTCERN-----EMDLVKY 232
Query: 383 FGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIG 442
G +YS+ +E +M E+ +GP+E +M +Y+D + YK+G+YKHV G LG HA++++G
Sbjct: 233 KGSTSYSVKGEKE-LMIELMTNGPLELTMQVYSDFVGYKSGVYKHVLGDFLGGHAVKLVG 291
Query: 443 WGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLP 495
W GT V YW VANS+NT+WG+ G F I RG NEC IE+ AG+P
Sbjct: 292 W-------GTQDGVPYWKVANSWNTDWGDKGYFLIQRGNNECKIESGGVAGIP 337
>gi|193603738|ref|XP_001943652.1| PREDICTED: cathepsin B-like cysteine proteinase 5-like
[Acyrthosiphon pisum]
Length = 337
Score = 185 bits (470), Expect = 5e-44, Method: Compositional matrix adjust.
Identities = 107/276 (38%), Positives = 153/276 (55%), Gaps = 15/276 (5%)
Query: 64 LSELEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCG 123
L+ + MGV P +KL + + L +S LE LPE +D W C ++ IRDQ +CG
Sbjct: 53 LTNVSHLMGVVPWNKLSEKDILLTYDVSIDLESLPESYDITQTWSECKSVVSIRDQSNCG 112
Query: 124 SGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFH-GKAWKYWVTTG 182
S WAL A SDR+CI S + LS + + SCC G H KAWKY G
Sbjct: 113 SCWALSTASAFSDRLCITSNMGVNKVLSGEYINSCCNGKCGNGCNGGHPEKAWKYIKKNG 172
Query: 183 IVSGGTYASKQGCRPYEI-PCERYMNGSHSSCQDNEPNTPECIR-KC-QPGYDVSYEDDL 239
+ +GG Y S +GC+PY I PC R N SC +TP+C + +C Y+ DL
Sbjct: 173 LCTGGEYGSNEGCQPYSIVPCPRNAN----SCSKENEDTPQCYKDQCTNNNYETPLVSDL 228
Query: 240 NFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRII 299
+ YS+ E IM E+F++GPV +M +Y D + YK GIY++ GG G+HA++I+
Sbjct: 229 YYAYKVYSVKPKPEIIMSEVFKNGPVVAAMKVYDDFLCYKGGIYQYTTGGLKGDHAVKIM 288
Query: 300 GWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
GWG++ +G + YWL AN++ +WG G+F+I
Sbjct: 289 GWGED---DG----IDYWLCANTWGNSWGMGGMFKI 317
Score = 136 bits (343), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 70/165 (42%), Positives = 101/165 (61%), Gaps = 14/165 (8%)
Query: 336 GCRPYEI-PCERYMNGSRSSCQANEPNTPECIR-KC-QPGYDVSYEDDLNFGRIAYSLPA 392
GC+PY I PC R N SC +TP+C + +C Y+ DL + YS+
Sbjct: 184 GCQPYSIVPCPRNAN----SCSKENEDTPQCYKDQCTNNNYETPLVSDLYYAYKVYSVKP 239
Query: 393 NEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGT 452
E IM E+F++GPV +M +Y D + YK GIY++ GG G+HA++I+GWG++ +G
Sbjct: 240 KPEIIMSEVFKNGPVVAAMKVYDDFLCYKGGIYQYTTGGLKGDHAVKIMGWGED---DG- 295
Query: 453 SSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
+ YWL AN++ +WG G+F+I RG+NECGIE IT GLPK+
Sbjct: 296 ---IDYWLCANTWGNSWGMGGMFKIRRGRNECGIENRITGGLPKV 337
>gi|296863454|pdb|3HHI|A Chain A, Crystal Structure Of Cathepsin B From T. Brucei In Complex
With Ca074
gi|296863455|pdb|3HHI|B Chain B, Crystal Structure Of Cathepsin B From T. Brucei In Complex
With Ca074
Length = 325
Score = 185 bits (469), Expect = 6e-44, Method: Compositional matrix adjust.
Identities = 116/311 (37%), Positives = 159/311 (51%), Gaps = 34/311 (10%)
Query: 33 LSKAFDRVDHSILLPKLPFYGAEKNALSKLTLSELEMRMGV----HPDSKLPQNRLPLLV 88
LSKAF VD L + + + +TL E + GV + S LP+ R
Sbjct: 10 LSKAF--VDRVNRLNRGIWKAKYDGVMQNITLREAKRLNGVIKKNNNASILPKRRF---- 63
Query: 89 QLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHV 148
+ LP FD+ WP CPTI +I DQ +CGS WA+ A AMSDR C G + V
Sbjct: 64 TEEEARAPLPSSFDSAEAWPNCPTIPQIADQSACGSCWAVAAASAMSDRFCTMG-GVQDV 122
Query: 149 RLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIP-CERYMN 207
+S+ DL++CC DCG+GC GG +AW Y+ +TG+VS C+PY P C +
Sbjct: 123 HISAGDLLACCSDCGDGCNGGDPDRAWAYFSSTGLVS-------DYCQPYPFPHCSHHSK 175
Query: 208 GS--HSSCQDNEPNTPECIRKCQ-PGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGP 264
+ C +TP+C C P V +N+ E+ MRE+F GP
Sbjct: 176 SKNGYPPCSQFNFDTPKCDYTCDDPTIPV-----VNYRSWTSYALQGEDDYMRELFFRGP 230
Query: 265 VEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFN 324
E + +Y D I Y +G+Y HV+G LG HA+R++GW GTS+ V YW +ANS+N
Sbjct: 231 FEVAFDVYEDFIAYNSGVYHHVSGQYLGGHAVRLVGW-------GTSNGVPYWKIANSWN 283
Query: 325 TNWGENGLFRI 335
T WG +G F I
Sbjct: 284 TEWGMDGYFLI 294
Score = 112 bits (281), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 65/169 (38%), Positives = 91/169 (53%), Gaps = 16/169 (9%)
Query: 331 GLFRIGCRPYEIPCERYMNGSRSS---CQANEPNTPECIRKCQ-PGYDVSYEDDLNFGRI 386
GL C+PY P + + S++ C +TP+C C P V +N+
Sbjct: 156 GLVSDYCQPYPFPHCSHHSKSKNGYPPCSQFNFDTPKCDYTCDDPTIPV-----VNYRSW 210
Query: 387 AYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQE 446
E+ MRE+F GP E + +Y D I Y +G+Y HV+G LG HA+R++GW
Sbjct: 211 TSYALQGEDDYMRELFFRGPFEVAFDVYEDFIAYNSGVYHHVSGQYLGGHAVRLVGW--- 267
Query: 447 PLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLP 495
GTS+ V YW +ANS+NT WG +G F I RG +ECGIE +AG+P
Sbjct: 268 ----GTSNGVPYWKIANSWNTEWGMDGYFLIRRGSSECGIEDGGSAGIP 312
>gi|56755295|gb|AAW25827.1| SJCHGC06356 protein [Schistosoma japonicum]
Length = 279
Score = 185 bits (469), Expect = 6e-44, Method: Compositional matrix adjust.
Identities = 94/242 (38%), Positives = 140/242 (57%), Gaps = 11/242 (4%)
Query: 96 ELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDL 155
E+P FDAR +W C TI++I D+ C + WA+ V+++SDR+CI S G+ V+LS+ D
Sbjct: 27 EIPRSFDARYHWINCSTIRQIHDESLCRADWAIATVDSISDRICIRSNGRISVQLSARDA 86
Query: 156 VSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIP-CERYMNGSHSSCQ 214
+SC GC G + YW+T GIV+GG+Y + GC+PY +P C + C
Sbjct: 87 ISC--GFSPGCFHGSEVEVLVYWITYGIVTGGSYEDQSGCQPYPLPKCSYHPESRFLDCN 144
Query: 215 DNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYAD 274
+N P+C +CQ GY+ +Y+DD +G Y++ +E I +EI +GPV S+++ D
Sbjct: 145 NNTFEFPQCTNECQDGYNKTYDDDKFYGERIYNVYGTQEDIQKEILMNGPVIASISVNTD 204
Query: 275 MILYKTGIYKHVAGGP-LGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLF 333
++YK+G+Y LG +RIIGWG E + YWL ANS+N WG NG
Sbjct: 205 FLVYKSGVYLPTPRSRNLGWITLRIIGWGYE-------GKIPYWLCANSWNEEWGANGYV 257
Query: 334 RI 335
+I
Sbjct: 258 KI 259
Score = 111 bits (277), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 62/164 (37%), Positives = 90/164 (54%), Gaps = 9/164 (5%)
Query: 336 GCRPYEIPCERYMNGSRS-SCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANE 394
GC+PY +P Y SR C N P+C +CQ GY+ +Y+DD +G Y++ +
Sbjct: 123 GCQPYPLPKCSYHPESRFLDCNNNTFEFPQCTNECQDGYNKTYDDDKFYGERIYNVYGTQ 182
Query: 395 ETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGP-LGEHAIRIIGWGQEPLGEGTS 453
E I +EI +GPV S+++ D ++YK+G+Y LG +RIIGWG E
Sbjct: 183 EDIQKEILMNGPVIASISVNTDFLVYKSGVYLPTPRSRNLGWITLRIIGWGYE------- 235
Query: 454 SVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
+ YWL ANS+N WG NG +I RG IE+ + A +PK+
Sbjct: 236 GKIPYWLCANSWNEEWGANGYVKIQRGVQAGYIESYVRAPIPKM 279
>gi|166030318|gb|ABY78826.1| cathepsin B-like protease [Trypanosoma congolense]
Length = 335
Score = 185 bits (469), Expect = 6e-44, Method: Compositional matrix adjust.
Identities = 109/279 (39%), Positives = 145/279 (51%), Gaps = 27/279 (9%)
Query: 59 LSKLTLSELEM--RMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEI 116
+ T+SE + R P S LP+ + L LPE FDA WP CPTI EI
Sbjct: 54 MQNTTVSEAKRLNRATRKPVSVLPRVNF----TEEELLAPLPETFDAAEKWPNCPTITEI 109
Query: 117 RDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWK 176
DQ SCGS WA+ A +M+DR C G R +R+S+ DL++CC DCG GC GG AW
Sbjct: 110 SDQSSCGSCWAVAAATSMTDRYCTI-HGVRGLRISAADLLACCGDCGYGCLGGDPDMAWA 168
Query: 177 YWVTTGIVSGGTYASKQGCRPYEIP-CERYMNG-SHSSCQDNEPNTPECIRKCQPGYDVS 234
Y+ + GI SG C+PY P C Y N ++ C TP C C D +
Sbjct: 169 YFSSEGIASG-------RCQPYPFPRCSHYTNSTTYPQCSALHLWTPTCNPACT---DST 218
Query: 235 YEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEH 294
G +YS + EE RE++ GP + +++D+ YK G+YKHV G +G H
Sbjct: 219 ISKKKYRGLKSYSF-SGEEDFRRELYFRGPFQAVFDVWSDLFAYKHGVYKHVGGAFIGAH 277
Query: 295 AIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLF 333
A+RI+GWG + S V YW +ANS+N WG+ G F
Sbjct: 278 AVRIVGWGNQ-------SGVPYWKIANSWNAEWGDRGYF 309
Score = 119 bits (298), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 64/163 (39%), Positives = 88/163 (53%), Gaps = 13/163 (7%)
Query: 337 CRPYEIP-CERYMNGSR-SSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANE 394
C+PY P C Y N + C A TP C C D + G +YS + E
Sbjct: 180 CQPYPFPRCSHYTNSTTYPQCSALHLWTPTCNPACT---DSTISKKKYRGLKSYSF-SGE 235
Query: 395 ETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSS 454
E RE++ GP + +++D+ YK G+YKHV G +G HA+RI+GWG + S
Sbjct: 236 EDFRRELYFRGPFQAVFDVWSDLFAYKHGVYKHVGGAFIGAHAVRIVGWGNQ-------S 288
Query: 455 VVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
V YW +ANS+N WG+ G F ++RG NECGIE +AG+P I
Sbjct: 289 GVPYWKIANSWNAEWGDRGYFFMLRGDNECGIEDSGSAGVPAI 331
>gi|154340956|ref|XP_001566431.1| cysteine peptidase C (CPC) [Leishmania braziliensis
MHOM/BR/75/M2904]
gi|134063754|emb|CAM39941.1| cysteine peptidase C (CPC) [Leishmania braziliensis
MHOM/BR/75/M2904]
Length = 340
Score = 185 bits (469), Expect = 6e-44, Method: Compositional matrix adjust.
Identities = 107/272 (39%), Positives = 148/272 (54%), Gaps = 23/272 (8%)
Query: 66 ELEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSG 125
EL MGV S + P + + +ELP FD+ WP C TI EIRDQ +CGS
Sbjct: 69 ELRKLMGVLNMSTAALS--PRIFSAEELAQELPTSFDSSDKWPKCRTISEIRDQSNCGSC 126
Query: 126 WALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVS 185
WA+ AVEAMSDR C + G +R+S+ L+SCC CG GCQGG AW +WV G+ S
Sbjct: 127 WAIAAVEAMSDRYCTVA-GITDLRVSTGHLLSCCFVCGMGCQGGIPTMAWLWWVWVGLTS 185
Query: 186 GGTYASKQGCRPYEI-PCERYMN-GSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGR 243
+ C+PY PC + + G + +C +TP C C + + G
Sbjct: 186 -------EVCQPYPFPPCGHHTDGGKYPACPSTIYDTPTCNSTCADSHTALTKHK---GE 235
Query: 244 IAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQ 303
+YSL E M E+ +GP E + +YAD + YK+G+Y H G LG HA++++GWG
Sbjct: 236 KSYSLRGERE-YMIELMTYGPFEVAFDVYADFVSYKSGVYSHTTGERLGGHAVKLVGWGV 294
Query: 304 EPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
+ GT YW +ANS+N++WG+NG F I
Sbjct: 295 Q---NGT----PYWKIANSWNSDWGDNGYFLI 319
Score = 115 bits (288), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 65/173 (37%), Positives = 95/173 (54%), Gaps = 13/173 (7%)
Query: 327 WGENGLFRIGCRPYEIP-CERYMNGSR-SSCQANEPNTPECIRKCQPGYDVSYEDDLNFG 384
W GL C+PY P C + +G + +C + +TP C C + + G
Sbjct: 178 WVWVGLTSEVCQPYPFPPCGHHTDGGKYPACPSTIYDTPTCNSTCADSHTALTKHK---G 234
Query: 385 RIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWG 444
+YSL E M E+ +GP E + +YAD + YK+G+Y H G LG HA++++GWG
Sbjct: 235 EKSYSLRGERE-YMIELMTYGPFEVAFDVYADFVSYKSGVYSHTTGERLGGHAVKLVGWG 293
Query: 445 QEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
+ GT YW +ANS+N++WG+NG F I RG +ECGIE+ AGLP +
Sbjct: 294 VQ---NGT----PYWKIANSWNSDWGDNGYFLIRRGTDECGIESTGVAGLPSL 339
>gi|51947600|gb|AAU14266.1| cathepsin B-N [Myzus persicae]
Length = 338
Score = 185 bits (469), Expect = 6e-44, Method: Compositional matrix adjust.
Identities = 91/241 (37%), Positives = 139/241 (57%), Gaps = 11/241 (4%)
Query: 97 LPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLV 156
+P FDAR W +C TI +RDQG+CGS WA+ A +DR+C+A+ + LS++++
Sbjct: 86 IPRFFDARRKWRHCSTIGRVRDQGNCGSCWAVATSSAFADRLCVATNADFNELLSAEEIT 145
Query: 157 SCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEI-PCERYMNGSHSSCQD 215
CC CG GC GG+ KAWK + G+V+GG Y S +GC PY + PC G+++
Sbjct: 146 FCCHTCGFGCNGGYPIKAWKRFSKKGLVTGGDYKSGEGCEPYRVPPCPNDDQGNNTCAGK 205
Query: 216 NEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADM 275
+ C R C D+ +++D + R Y L +I +++ +GP+E S +Y D
Sbjct: 206 PMESNHRCTRMCYGDQDLDFDEDHRYTRDYYYLTYG--SIQKDVMTYGPIEASFDVYDDF 263
Query: 276 ILYKTGIY-KHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFR 334
YK+G+Y K LG HA+++IGWG+E V YWL+ NS+N +WG++G F+
Sbjct: 264 PSYKSGVYVKSENASYLGGHAVKLIGWGEE-------YGVPYWLMVNSWNEDWGDHGFFK 316
Query: 335 I 335
I
Sbjct: 317 I 317
Score = 112 bits (281), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 58/162 (35%), Positives = 89/162 (54%), Gaps = 11/162 (6%)
Query: 336 GCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANE 394
GC PY +P C G+ + + C R C D+ +++D + R Y L
Sbjct: 183 GCEPYRVPPCPNDDQGNNTCAGKPMESNHRCTRMCYGDQDLDFDEDHRYTRDYYYLTYG- 241
Query: 395 ETIMREIFRHGPVEGSMTIYADMILYKTGIY-KHVAGGPLGEHAIRIIGWGQEPLGEGTS 453
+I +++ +GP+E S +Y D YK+G+Y K LG HA+++IGWG+E
Sbjct: 242 -SIQKDVMTYGPIEASFDVYDDFPSYKSGVYVKSENASYLGGHAVKLIGWGEE------- 293
Query: 454 SVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLP 495
V YWL+ NS+N +WG++G F+I RG NECG++ TAG+P
Sbjct: 294 YGVPYWLMVNSWNEDWGDHGFFKIQRGTNECGVDNSTTAGVP 335
>gi|255548165|ref|XP_002515139.1| cathepsin B, putative [Ricinus communis]
gi|223545619|gb|EEF47123.1| cathepsin B, putative [Ricinus communis]
Length = 376
Score = 184 bits (468), Expect = 7e-44, Method: Compositional matrix adjust.
Identities = 111/310 (35%), Positives = 165/310 (53%), Gaps = 47/310 (15%)
Query: 47 PKLPFYGAEKNALSKLTLSELEMRMGVHPDSKLPQNRLPLLVQLSDPLE-ELPEGFDARI 105
P + A LS T+ + + +G P P+ L + +S P +LP+ FDAR
Sbjct: 53 PDAGWEAAMNPQLSNFTVGQFKYLLGAKP---TPKKELMGVPMISHPKTLKLPKEFDART 109
Query: 106 NWPYCPTIQEIRDQ-----------------GSCGSGWALGAVEAMSDRVCIASRGKRHV 148
WP+C TI +I Q G CGS WA GAVE++SDR CI ++
Sbjct: 110 AWPHCSTIGKILGQLLSFYNIFSIFFFLFLEGHCGSCWAFGAVESLSDRFCI--HFGMNI 167
Query: 149 RLSSDDLVSCCKD-CGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPY--EIPCERY 205
LS +DL++CC CG+GC GG+ AW+Y+V G+V+ + C PY I C
Sbjct: 168 SLSVNDLLACCGFLCGDGCDGGYPMYAWRYFVHHGVVT-------EECDPYFDNIGC--- 217
Query: 206 MNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPV 265
SH C+ P TP+C+RKC + + ++ AY + ++ +M E++++GPV
Sbjct: 218 ---SHPGCEPGFP-TPKCVRKCIDKNQL-WRQSKHYSVNAYRISSDPHDVMAEVYKNGPV 272
Query: 266 EGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNT 325
E S T+Y D YK+G+YKH+ G +G HA+++IGWG GE YWL+AN +N
Sbjct: 273 EVSFTVYEDFAHYKSGVYKHITGEVMGGHAVKLIGWGTSDNGE------DYWLLANQWNR 326
Query: 326 NWGENGLFRI 335
WG++G F+I
Sbjct: 327 GWGDDGYFKI 336
Score = 127 bits (319), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 63/154 (40%), Positives = 92/154 (59%), Gaps = 9/154 (5%)
Query: 344 CERYMNGSRSSCQANEPN--TPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREI 401
C+ Y + S EP TP+C+RKC + + ++ AY + ++ +M E+
Sbjct: 208 CDPYFDNIGCSHPGCEPGFPTPKCVRKCIDKNQL-WRQSKHYSVNAYRISSDPHDVMAEV 266
Query: 402 FRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLV 461
+++GPVE S T+Y D YK+G+YKH+ G +G HA+++IGWG GE YWL+
Sbjct: 267 YKNGPVEVSFTVYEDFAHYKSGVYKHITGEVMGGHAVKLIGWGTSDNGE------DYWLL 320
Query: 462 ANSFNTNWGENGLFRIVRGQNECGIEADITAGLP 495
AN +N WG++G F+I RG NECGIE D AGLP
Sbjct: 321 ANQWNRGWGDDGYFKIRRGTNECGIEDDAVAGLP 354
>gi|269146930|gb|ACZ28411.1| cathepsin b [Simulium nigrimanum]
Length = 168
Score = 184 bits (468), Expect = 8e-44, Method: Compositional matrix adjust.
Identities = 83/162 (51%), Positives = 115/162 (70%), Gaps = 8/162 (4%)
Query: 336 GCRPYEI-PCERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANE 394
GC PY+I PCE ++NG+R +C E TP+CI+ CQ Y V+YE D ++G +YS+P +
Sbjct: 14 GCHPYKIAPCEHHVNGTRPACNGEEGKTPKCIKHCQASYTVAYEQDKSYGAKSYSVPHHV 73
Query: 395 ETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSS 454
I +EI +GPVEG+ T+Y D++ YK G+Y+HV G LG HAIRI+GWG E +
Sbjct: 74 AQIQKEIMTNGPVEGAFTVYEDLVQYKDGVYQHVTGKMLGGHAIRILGWGVE-------N 126
Query: 455 VVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPK 496
V YWL+ANS+NT+WG NG F+I+RG + CGIE+ I+AG+PK
Sbjct: 127 DVPYWLIANSWNTDWGNNGFFKILRGSDHCGIESQISAGIPK 168
Score = 165 bits (417), Expect = 5e-38, Method: Compositional matrix adjust.
Identities = 76/152 (50%), Positives = 104/152 (68%), Gaps = 8/152 (5%)
Query: 185 SGGTYASKQGCRPYEI-PCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGR 243
SGG + S QGC PY+I PCE ++NG+ +C E TP+CI+ CQ Y V+YE D ++G
Sbjct: 5 SGGPFGSNQGCHPYKIAPCEHHVNGTRPACNGEEGKTPKCIKHCQASYTVAYEQDKSYGA 64
Query: 244 IAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQ 303
+YS+P + I +EI +GPVEG+ T+Y D++ YK G+Y+HV G LG HAIRI+GWG
Sbjct: 65 KSYSVPHHVAQIQKEIMTNGPVEGAFTVYEDLVQYKDGVYQHVTGKMLGGHAIRILGWGV 124
Query: 304 EPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
E + V YWL+ANS+NT+WG NG F+I
Sbjct: 125 E-------NDVPYWLIANSWNTDWGNNGFFKI 149
>gi|12004577|gb|AAG44098.1| cathepsin B cysteine protease [Leishmania chagasi]
Length = 340
Score = 184 bits (468), Expect = 8e-44, Method: Compositional matrix adjust.
Identities = 122/336 (36%), Positives = 175/336 (52%), Gaps = 25/336 (7%)
Query: 3 KSTADAVATFLKDLDLSQSSRNHSNGVFCDLSKAFDRVDHSILLPKLPFYGAEKNALSKL 62
KS VA F L + S F L K+F +S + +S
Sbjct: 6 KSALCLVAVFAVLLATTVSGLYAKPSDFPLLGKSFVAEINSKARGQWTASADNGYLVSGK 65
Query: 63 TLSELEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSC 122
+L E+ MGV S + P + + ++LPE FDA +WP C TI EIRDQ +C
Sbjct: 66 SLEEVRKLMGVTDMST--EAVPPRNFSVDEMQQDLPEFFDAAEHWPMCVTISEIRDQSNC 123
Query: 123 GSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTG 182
GS WA+ AVEA+SDR C G R+S+ +L+SCC CG GC GG AW +WV G
Sbjct: 124 GSCWAIAAVEAISDRYCTLG-GVPDRRISTSNLLSCCFICGFGCYGGIPTMAWLWWVWVG 182
Query: 183 IVSGGTYASKQGCRPYEI-PCERYMNG-SHSSCQDNEPNTPECIRKCQPGYDVSYEDDLN 240
I + + C+PY PC + N + C + +TP+C C+ S D +
Sbjct: 183 ITT-------EVCQPYPFGPCSHHGNSDKYPPCPNTIYDTPKCNTTCEK----SEMDLVK 231
Query: 241 F-GRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRII 299
+ G +YS+ +E +M E+ +GP+E +M +Y+D + YK+G YKHV+G LG HA++++
Sbjct: 232 YKGGTSYSVKGEKE-LMIELMTNGPLEVTMQVYSDFVGYKSGGYKHVSGDLLGGHAVKLV 290
Query: 300 GWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
GW GT V YW +ANS+NT+WG+ G F I
Sbjct: 291 GW-------GTQGGVPYWKIANSWNTDWGDKGYFLI 319
Score = 115 bits (288), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 67/172 (38%), Positives = 98/172 (56%), Gaps = 15/172 (8%)
Query: 327 WGENGLFRIGCRPYEI-PCERYMNGSR-SSCQANEPNTPECIRKCQPGYDVSYEDDLNF- 383
W G+ C+PY PC + N + C +TP+C C+ S D + +
Sbjct: 178 WVWVGITTEVCQPYPFGPCSHHGNSDKYPPCPNTIYDTPKCNTTCEK----SEMDLVKYK 233
Query: 384 GRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGW 443
G +YS+ +E +M E+ +GP+E +M +Y+D + YK+G YKHV+G LG HA++++GW
Sbjct: 234 GGTSYSVKGEKE-LMIELMTNGPLEVTMQVYSDFVGYKSGGYKHVSGDLLGGHAVKLVGW 292
Query: 444 GQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLP 495
GT V YW +ANS+NT+WG+ G F I RG NECGIE+ AG P
Sbjct: 293 -------GTQGGVPYWKIANSWNTDWGDKGYFLIQRGSNECGIESGGVAGTP 337
>gi|355332948|pdb|3MOR|A Chain A, Crystal Structure Of Cathepsin B From Trypanosoma Brucei
gi|355332949|pdb|3MOR|B Chain B, Crystal Structure Of Cathepsin B From Trypanosoma Brucei
Length = 317
Score = 184 bits (468), Expect = 8e-44, Method: Compositional matrix adjust.
Identities = 116/311 (37%), Positives = 159/311 (51%), Gaps = 34/311 (10%)
Query: 33 LSKAFDRVDHSILLPKLPFYGAEKNALSKLTLSELEMRMGV----HPDSKLPQNRLPLLV 88
LSKAF VD L + + + +TL E + GV + S LP+ R
Sbjct: 9 LSKAF--VDRVNRLNRGIWKAKYDGVMQNITLREAKRLNGVIKKNNNASILPKRRF---- 62
Query: 89 QLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHV 148
+ LP FD+ WP CPTI +I DQ +CGS WA+ A AMSDR C G + V
Sbjct: 63 TEEEARAPLPSSFDSAEAWPNCPTIPQIADQSACGSCWAVAAASAMSDRFCTMG-GVQDV 121
Query: 149 RLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIP-CERYMN 207
+S+ DL++CC DCG+GC GG +AW Y+ +TG+VS C+PY P C +
Sbjct: 122 HISAGDLLACCSDCGDGCNGGDPDRAWAYFSSTGLVS-------DYCQPYPFPHCSHHSK 174
Query: 208 GS--HSSCQDNEPNTPECIRKCQ-PGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGP 264
+ C +TP+C C P V +N+ E+ MRE+F GP
Sbjct: 175 SKNGYPPCSQFNFDTPKCNYTCDDPTIPV-----VNYRSWTSYALQGEDDYMRELFFRGP 229
Query: 265 VEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFN 324
E + +Y D I Y +G+Y HV+G LG HA+R++GW GTS+ V YW +ANS+N
Sbjct: 230 FEVAFDVYEDFIAYNSGVYHHVSGQYLGGHAVRLVGW-------GTSNGVPYWKIANSWN 282
Query: 325 TNWGENGLFRI 335
T WG +G F I
Sbjct: 283 TEWGMDGYFLI 293
Score = 112 bits (280), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 65/169 (38%), Positives = 91/169 (53%), Gaps = 16/169 (9%)
Query: 331 GLFRIGCRPYEIPCERYMNGSRSS---CQANEPNTPECIRKCQ-PGYDVSYEDDLNFGRI 386
GL C+PY P + + S++ C +TP+C C P V +N+
Sbjct: 155 GLVSDYCQPYPFPHCSHHSKSKNGYPPCSQFNFDTPKCNYTCDDPTIPV-----VNYRSW 209
Query: 387 AYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQE 446
E+ MRE+F GP E + +Y D I Y +G+Y HV+G LG HA+R++GW
Sbjct: 210 TSYALQGEDDYMRELFFRGPFEVAFDVYEDFIAYNSGVYHHVSGQYLGGHAVRLVGW--- 266
Query: 447 PLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLP 495
GTS+ V YW +ANS+NT WG +G F I RG +ECGIE +AG+P
Sbjct: 267 ----GTSNGVPYWKIANSWNTEWGMDGYFLIRRGSSECGIEDGGSAGIP 311
>gi|261328564|emb|CBH11542.1| CPC cysteine peptidase, Clan CA, family C1,Cathepsin B-like,
putative [Trypanosoma brucei gambiense DAL972]
Length = 340
Score = 184 bits (467), Expect = 9e-44, Method: Compositional matrix adjust.
Identities = 116/311 (37%), Positives = 159/311 (51%), Gaps = 34/311 (10%)
Query: 33 LSKAFDRVDHSILLPKLPFYGAEKNALSKLTLSELEMRMGV----HPDSKLPQNRLPLLV 88
LSKAF VD L + + + +TL E + GV + S LP+ R
Sbjct: 32 LSKAF--VDRVNRLNRGIWKAKYDGVMQNITLREAKRLNGVIKKNNNASILPKRRF---- 85
Query: 89 QLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHV 148
+ LP FD+ WP CPTI +I DQ +CGS WA+ A AMSDR C G + V
Sbjct: 86 TEEEARAPLPSSFDSAEAWPNCPTIPQIADQSACGSCWAVAAASAMSDRFCTMG-GVQDV 144
Query: 149 RLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIP-CERYMN 207
+S+ DL++CC DCG+GC GG +AW Y+ +TG+VS C+PY P C +
Sbjct: 145 HISAGDLLACCSDCGDGCNGGDPDRAWAYFSSTGLVS-------DYCQPYPFPHCSHHSK 197
Query: 208 GS--HSSCQDNEPNTPECIRKCQ-PGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGP 264
+ C +TP+C C P V +N+ E+ MRE+F GP
Sbjct: 198 SKNGYPPCSQFNFDTPKCNYTCDDPTIPV-----VNYRSWTSYALQGEDDYMRELFFRGP 252
Query: 265 VEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFN 324
E + +Y D I Y +G+Y HV+G LG HA+R++GW GTS+ V YW +ANS+N
Sbjct: 253 FEVAFDVYEDFIAYNSGVYHHVSGQYLGGHAVRLVGW-------GTSNGVPYWKIANSWN 305
Query: 325 TNWGENGLFRI 335
T WG +G F I
Sbjct: 306 TEWGMDGYFLI 316
Score = 112 bits (279), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 65/169 (38%), Positives = 91/169 (53%), Gaps = 16/169 (9%)
Query: 331 GLFRIGCRPYEIPCERYMNGSRSS---CQANEPNTPECIRKCQ-PGYDVSYEDDLNFGRI 386
GL C+PY P + + S++ C +TP+C C P V +N+
Sbjct: 178 GLVSDYCQPYPFPHCSHHSKSKNGYPPCSQFNFDTPKCNYTCDDPTIPV-----VNYRSW 232
Query: 387 AYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQE 446
E+ MRE+F GP E + +Y D I Y +G+Y HV+G LG HA+R++GW
Sbjct: 233 TSYALQGEDDYMRELFFRGPFEVAFDVYEDFIAYNSGVYHHVSGQYLGGHAVRLVGW--- 289
Query: 447 PLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLP 495
GTS+ V YW +ANS+NT WG +G F I RG +ECGIE +AG+P
Sbjct: 290 ----GTSNGVPYWKIANSWNTEWGMDGYFLIRRGSSECGIEDGGSAGIP 334
>gi|72389769|ref|XP_845179.1| cysteine peptidase C (CPC) [Trypanosoma brucei brucei strain 927/4
GUTat10.1]
gi|427931064|pdb|4HWY|A Chain A, Trypanosoma Brucei Procathepsin B Solved From 40 Fs
Free-electron Laser Pulse Data By Serial Femtosecond
X-ray Crystallography
gi|40557577|gb|AAR88085.1| cathepsin B-like cysteine protease [Trypanosoma brucei]
gi|62360039|gb|AAX80461.1| cysteine peptidase C (CPC) [Trypanosoma brucei]
gi|70801714|gb|AAZ11620.1| cysteine peptidase C (CPC) [Trypanosoma brucei brucei strain 927/4
GUTat10.1]
Length = 340
Score = 184 bits (467), Expect = 9e-44, Method: Compositional matrix adjust.
Identities = 116/311 (37%), Positives = 159/311 (51%), Gaps = 34/311 (10%)
Query: 33 LSKAFDRVDHSILLPKLPFYGAEKNALSKLTLSELEMRMGV----HPDSKLPQNRLPLLV 88
LSKAF VD L + + + +TL E + GV + S LP+ R
Sbjct: 32 LSKAF--VDRVNRLNRGIWKAKYDGVMQNITLREAKRLNGVIKKNNNASILPKRRF---- 85
Query: 89 QLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHV 148
+ LP FD+ WP CPTI +I DQ +CGS WA+ A AMSDR C G + V
Sbjct: 86 TEEEARAPLPSSFDSAEAWPNCPTIPQIADQSACGSCWAVAAASAMSDRFCTMG-GVQDV 144
Query: 149 RLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIP-CERYMN 207
+S+ DL++CC DCG+GC GG +AW Y+ +TG+VS C+PY P C +
Sbjct: 145 HISAGDLLACCSDCGDGCNGGDPDRAWAYFSSTGLVS-------DYCQPYPFPHCSHHSK 197
Query: 208 GS--HSSCQDNEPNTPECIRKCQ-PGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGP 264
+ C +TP+C C P V +N+ E+ MRE+F GP
Sbjct: 198 SKNGYPPCSQFNFDTPKCNYTCDDPTIPV-----VNYRSWTSYALQGEDDYMRELFFRGP 252
Query: 265 VEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFN 324
E + +Y D I Y +G+Y HV+G LG HA+R++GW GTS+ V YW +ANS+N
Sbjct: 253 FEVAFDVYEDFIAYNSGVYHHVSGQYLGGHAVRLVGW-------GTSNGVPYWKIANSWN 305
Query: 325 TNWGENGLFRI 335
T WG +G F I
Sbjct: 306 TEWGMDGYFLI 316
Score = 112 bits (279), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 65/169 (38%), Positives = 91/169 (53%), Gaps = 16/169 (9%)
Query: 331 GLFRIGCRPYEIPCERYMNGSRSS---CQANEPNTPECIRKCQ-PGYDVSYEDDLNFGRI 386
GL C+PY P + + S++ C +TP+C C P V +N+
Sbjct: 178 GLVSDYCQPYPFPHCSHHSKSKNGYPPCSQFNFDTPKCNYTCDDPTIPV-----VNYRSW 232
Query: 387 AYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQE 446
E+ MRE+F GP E + +Y D I Y +G+Y HV+G LG HA+R++GW
Sbjct: 233 TSYALQGEDDYMRELFFRGPFEVAFDVYEDFIAYNSGVYHHVSGQYLGGHAVRLVGW--- 289
Query: 447 PLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLP 495
GTS+ V YW +ANS+NT WG +G F I RG +ECGIE +AG+P
Sbjct: 290 ----GTSNGVPYWKIANSWNTEWGMDGYFLIRRGSSECGIEDGGSAGIP 334
>gi|161343861|tpg|DAA06111.1| TPA_inf: cathepsin B [Myzus persicae]
Length = 323
Score = 184 bits (467), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 98/245 (40%), Positives = 142/245 (57%), Gaps = 11/245 (4%)
Query: 96 ELPEGFDARINWPYCP-TIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDD 154
++P+ FDAR + C I +++DQG+C S WA+ +DR+CIAS GK LS+ +
Sbjct: 63 DIPKEFDARQYFISCANVIGDVKDQGNCASSWAVAVASTFTDRLCIASNGKFTDNLSAQN 122
Query: 155 LVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYE-IPCERYMNGSHSSC 213
L+SC D GC GG KAW++ + GIV+GG Y S +GC+PY+ PC+ Y + S ++C
Sbjct: 123 LMSCGDDEKLGCDGGSAYKAWEFTMGKGIVTGGPYDSNEGCQPYKNRPCDHYGDSSLTNC 182
Query: 214 QD-NEPNTPECIRKC-QPGYDVSYEDDLNFGRIAYSLP-ANEETIMREIFRHGPVEGSMT 270
C KC Y V YEDDL + Y N + I +EI +GPV M
Sbjct: 183 SSLRRTQMMFCRDKCVNKNYKVKYEDDLYKTSVVYMTSWTNVKQIQQEIMTYGPVTAFMY 242
Query: 271 IYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGEN 330
+Y + + YK G+YK AG +G H +++IGWG + G ++YWL NS+N+NWG +
Sbjct: 243 VYENFMGYKEGVYKSTAGELIGYHHVKLIGWGVDEAG------IEYWLAMNSWNSNWGND 296
Query: 331 GLFRI 335
GLF+I
Sbjct: 297 GLFKI 301
Score = 124 bits (310), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 65/166 (39%), Positives = 94/166 (56%), Gaps = 10/166 (6%)
Query: 336 GCRPYE-IPCERYMNGSRSSCQA-NEPNTPECIRKC-QPGYDVSYEDDLNFGRIAYSLP- 391
GC+PY+ PC+ Y + S ++C + C KC Y V YEDDL + Y
Sbjct: 162 GCQPYKNRPCDHYGDSSLTNCSSLRRTQMMFCRDKCVNKNYKVKYEDDLYKTSVVYMTSW 221
Query: 392 ANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEG 451
N + I +EI +GPV M +Y + + YK G+YK AG +G H +++IGWG + G
Sbjct: 222 TNVKQIQQEIMTYGPVTAFMYVYENFMGYKEGVYKSTAGELIGYHHVKLIGWGVDEAG-- 279
Query: 452 TSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
++YWL NS+N+NWG +GLF+I+RG N C IE + AGL +
Sbjct: 280 ----IEYWLAMNSWNSNWGNDGLFKILRGYNFCSIELLVMAGLVDV 321
>gi|28932700|gb|AAO60044.1| midgut cysteine proteinase 1 [Rhipicephalus appendiculatus]
Length = 332
Score = 184 bits (467), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 97/243 (39%), Positives = 141/243 (58%), Gaps = 25/243 (10%)
Query: 98 PEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVS 157
PE F R W +C +I+ IRDQ +CGS WA A E++SDR+CI + GK V +S++DL++
Sbjct: 88 PESFTPREYWSHCSSIRVIRDQSACGSCWAFAAAESISDRICIHTNGKVQVNISAEDLLA 147
Query: 158 CCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYA-----SKQGCRPYEIPCERYMNGSHSS 212
CC CG+GC G H + I+ G ++ GC+PY +P +
Sbjct: 148 CCHTCGHGCDGRCHCS------SVAILQGRRLVPEPVRTEDGCQPYSLP------PCVPN 195
Query: 213 CQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIY 272
C EP TP+C C+ GY+ SYE+D +F + Y L + I +I+++GPVE + +Y
Sbjct: 196 CTHPEP-TPKCQHVCRKGYEKSYEEDKHFAKNVYRLLKKCDAIKTDIYKNGPVESAFFVY 254
Query: 273 ADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGL 332
AD YK+G+Y+ +G HAI+I+GWG E +G V YWLVANS+N WG+ G
Sbjct: 255 ADFPSYKSGVYQQHMIKFMGVHAIKILGWGTE---DG----VPYWLVANSWNVGWGDKGY 307
Query: 333 FRI 335
F+I
Sbjct: 308 FKI 310
Score = 133 bits (335), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 69/160 (43%), Positives = 99/160 (61%), Gaps = 14/160 (8%)
Query: 336 GCRPYEIPCERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEE 395
GC+PY +P +C EP TP+C C+ GY+ SYE+D +F + Y L +
Sbjct: 183 GCQPYSLP------PCVPNCTHPEP-TPKCQHVCRKGYEKSYEEDKHFAKNVYRLLKKCD 235
Query: 396 TIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSV 455
I +I+++GPVE + +YAD YK+G+Y+ +G HAI+I+GWG E +G
Sbjct: 236 AIKTDIYKNGPVESAFFVYADFPSYKSGVYQQHMIKFMGVHAIKILGWGTE---DG---- 288
Query: 456 VKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLP 495
V YWLVANS+N WG+ G F+I+RG++ECGIE I AG+P
Sbjct: 289 VPYWLVANSWNVGWGDKGYFKILRGKDECGIEEVIDAGIP 328
>gi|343476048|emb|CCD12737.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 336
Score = 184 bits (467), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 107/281 (38%), Positives = 152/281 (54%), Gaps = 26/281 (9%)
Query: 59 LSKLTLSELEMRMGV--HPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEI 116
+ +T +E G S LP P+ ELPE FD+ WP CPTI+EI
Sbjct: 54 MQNITFAEARRLTGAFRRKTSSLP----PVRFTEEQLRTELPESFDSAEKWPNCPTIREI 109
Query: 117 RDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWK 176
DQ +CGS WA+ A+SDR C G + +R+S+ L+SCCKDCG+GC GG+ AW+
Sbjct: 110 ADQSACGSCWAVSTASAISDRHCTVG-GVQQLRISAAHLLSCCKDCGDGCDGGYPDSAWE 168
Query: 177 YWVTTGIVSGGTYASKQGCRPYEIP-CERYM-NGSHSSCQDNEPNTPECIRKCQPGYDVS 234
Y+V+ G+ S C+PY P C + G C + +TP+C C D +
Sbjct: 169 YYVSHGLAS-------SYCQPYPFPHCGHHGGKGKKPPCSKYDFHTPKCNTTCT---DKA 218
Query: 235 YEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEH 294
G +Y L E+ RE++ +GP + +Y+D + YKTG+Y+HV+G LG H
Sbjct: 219 IPLIKYRGNDSYVLLHGEDDFKRELYFNGPFVVAFQVYSDFLAYKTGVYRHVSGDFLGGH 278
Query: 295 AIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
A+RI+GWG+ GT YW +ANS++T+WG NG F I
Sbjct: 279 AVRIVGWGKL---NGT----PYWKIANSWDTDWGMNGHFLI 312
Score = 119 bits (297), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 68/171 (39%), Positives = 97/171 (56%), Gaps = 12/171 (7%)
Query: 329 ENGLFRIGCRPYEIP-CERYM-NGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRI 386
+GL C+PY P C + G + C + +TP+C C D + G
Sbjct: 172 SHGLASSYCQPYPFPHCGHHGGKGKKPPCSKYDFHTPKCNTTCT---DKAIPLIKYRGND 228
Query: 387 AYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQE 446
+Y L E+ RE++ +GP + +Y+D + YKTG+Y+HV+G LG HA+RI+GWG+
Sbjct: 229 SYVLLHGEDDFKRELYFNGPFVVAFQVYSDFLAYKTGVYRHVSGDFLGGHAVRIVGWGKL 288
Query: 447 PLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
GT YW +ANS++T+WG NG F I+RG NECGIE+ AGLP I
Sbjct: 289 ---NGT----PYWKIANSWDTDWGMNGHFLILRGNNECGIESTGYAGLPAI 332
>gi|343472937|emb|CCD15042.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 336
Score = 184 bits (467), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 111/281 (39%), Positives = 152/281 (54%), Gaps = 26/281 (9%)
Query: 59 LSKLTLSELEMRMG--VHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEI 116
+ +T SE + G + LP R QL +LPE FDA +WP+CPTI+EI
Sbjct: 54 MQNITFSEAKRLTGARIQKSRTLPPARF-TEEQLR---TKLPETFDAAEHWPHCPTIREI 109
Query: 117 RDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWK 176
DQ C + WA+ A+SDR C GK+ +R+S+ DL++CCK CG+GC+GGF G AW
Sbjct: 110 ADQSECRASWAVSTASAISDRYCTVGGGKQ-LRISAADLMACCKQCGDGCKGGFPGFAWL 168
Query: 177 YWVTTGIVSGGTYASKQGCRPYEIP-CE-RYMNGSHSSCQDNEPNTPECIRKCQPGYDVS 234
Y+V GI S C+PY P CE R G+ + C + +TP+C C D S
Sbjct: 169 YYVEYGITS-------SQCQPYPFPHCEHRGAQGNKTPCSKYKFDTPKCNATCT---DKS 218
Query: 235 YEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEH 294
G Y L EE RE++ +GP +Y D+ YK+G+Y++V G LG
Sbjct: 219 IPLVKYRGNATYLLLHGEEDYKRELYFNGPFVAVFFVYTDLFAYKSGVYRNVDGDFLGGQ 278
Query: 295 AIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
A+RI+GWG+ GT YW VANS++T+WG NG I
Sbjct: 279 AVRIVGWGKL---NGTP----YWKVANSWDTDWGMNGYMLI 312
Score = 107 bits (266), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 65/169 (38%), Positives = 89/169 (52%), Gaps = 12/169 (7%)
Query: 329 ENGLFRIGCRPYEIP-CE-RYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRI 386
E G+ C+PY P CE R G+++ C + +TP+C C D S G
Sbjct: 172 EYGITSSQCQPYPFPHCEHRGAQGNKTPCSKYKFDTPKCNATCT---DKSIPLVKYRGNA 228
Query: 387 AYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQE 446
Y L EE RE++ +GP +Y D+ YK+G+Y++V G LG A+RI+GWG+
Sbjct: 229 TYLLLHGEEDYKRELYFNGPFVAVFFVYTDLFAYKSGVYRNVDGDFLGGQAVRIVGWGKL 288
Query: 447 PLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLP 495
GT YW VANS++T+WG NG I+RG NEC IE G P
Sbjct: 289 ---NGTP----YWKVANSWDTDWGMNGYMLILRGNNECNIEHLGFTGFP 330
>gi|166030314|gb|ABY78824.1| cathepsin B-like protease [Trypanosoma congolense]
Length = 335
Score = 184 bits (467), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 108/281 (38%), Positives = 153/281 (54%), Gaps = 27/281 (9%)
Query: 59 LSKLTLSELEMRMG--VHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEI 116
+ +T +E G + S LP P+ ELPE FD+ WP CPTI+EI
Sbjct: 54 MQNITFAEARRLTGARIQKTSSLP----PVRFTEEQLRTELPESFDSAEKWPNCPTIREI 109
Query: 117 RDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWK 176
DQ +CGS WA+ A+SDR C G + +R+S+ L+SCCKDCG GC GG+ G AW+
Sbjct: 110 ADQSACGSCWAVSTASAISDRYCTVG-GVQQLRISAAHLLSCCKDCGYGCDGGYPGTAWE 168
Query: 177 YWVTTGIVSGGTYASKQGCRPYEIP-CERY-MNGSHSSCQDNEPNTPECIRKCQPGYDVS 234
Y+V+ G+ S C+PY P C + G C + +TP+C C D +
Sbjct: 169 YYVSHGLAS-------SYCQPYPFPHCGHHGGKGKKPPCSKYDFHTPKCNTTCT---DKA 218
Query: 235 YEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEH 294
G +Y L E+ RE++ +GP + +Y+D + YKTG+Y+HV+G LG H
Sbjct: 219 IPLIKYRGNHSYGLDG-EDDYKRELYFNGPFVVAFQVYSDFLAYKTGVYRHVSGDVLGGH 277
Query: 295 AIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
A+RI+GWG+ GT YW +ANS++T+WG NG F I
Sbjct: 278 AVRIVGWGKL---NGTP----YWKIANSWDTDWGMNGHFLI 311
Score = 115 bits (288), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 67/171 (39%), Positives = 99/171 (57%), Gaps = 13/171 (7%)
Query: 329 ENGLFRIGCRPYEIP-CERYM-NGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRI 386
+GL C+PY P C + G + C + +TP+C C D + G
Sbjct: 172 SHGLASSYCQPYPFPHCGHHGGKGKKPPCSKYDFHTPKCNTTCT---DKAIPLIKYRGNH 228
Query: 387 AYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQE 446
+Y L E+ RE++ +GP + +Y+D + YKTG+Y+HV+G LG HA+RI+GWG+
Sbjct: 229 SYGLDG-EDDYKRELYFNGPFVVAFQVYSDFLAYKTGVYRHVSGDVLGGHAVRIVGWGKL 287
Query: 447 PLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
GT YW +ANS++T+WG NG F I+RG++ECGIE++ AGLP I
Sbjct: 288 ---NGTP----YWKIANSWDTDWGMNGHFLILRGKDECGIESEGYAGLPAI 331
>gi|71424150|ref|XP_812694.1| cysteine peptidase C (CPC) [Trypanosoma cruzi strain CL Brener]
gi|70877506|gb|EAN90843.1| cysteine peptidase C (CPC), putative [Trypanosoma cruzi]
Length = 333
Score = 184 bits (466), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 109/252 (43%), Positives = 140/252 (55%), Gaps = 29/252 (11%)
Query: 89 QLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHV 148
+L PL++ FDA WP CPT+ EIRDQ SCGS WA+ A A+SDR C G R +
Sbjct: 87 ELRVPLQDR---FDAGEAWPECPTVTEIRDQSSCGSCWAVAAASAISDRYCTLG-GVRDL 142
Query: 149 RLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIP-CERYMN 207
R+S+ DL+SCC CG GC GG+ AW+Y+ GIVS + C+PY P C ++N
Sbjct: 143 RISAGDLMSCCDVCGFGCNGGYPEVAWEYYAVHGIVS-------EYCQPYPFPSCAHHVN 195
Query: 208 GSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEG 267
S S E +TP C C D G +Y L + EE RE+ +GP E
Sbjct: 196 SSDLSPCSGEYDTPTCNSTCT---DKKIPLIKYRGNTSYVL-SGEEPFKRELILNGPFEV 251
Query: 268 SMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQ---EPLGEGTSSVVKYWLVANSFN 324
S ++YAD + Y G+YKHVAG LG HA+RI+GWG+ EP YW +ANS+N
Sbjct: 252 SFSVYADFVAYTGGVYKHVAGIFLGGHAVRIVGWGELNGEP----------YWKIANSWN 301
Query: 325 TNWGENGLFRIG 336
WG NG F I
Sbjct: 302 REWGMNGYFLIA 313
Score = 118 bits (296), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 71/172 (41%), Positives = 93/172 (54%), Gaps = 18/172 (10%)
Query: 330 NGLFRIGCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAY 388
+G+ C+PY P C ++N S S + E +TP C C D G +Y
Sbjct: 175 HGIVSEYCQPYPFPSCAHHVNSSDLSPCSGEYDTPTCNSTCT---DKKIPLIKYRGNTSY 231
Query: 389 SLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQ--- 445
L + EE RE+ +GP E S ++YAD + Y G+YKHVAG LG HA+RI+GWG+
Sbjct: 232 VL-SGEEPFKRELILNGPFEVSFSVYADFVAYTGGVYKHVAGIFLGGHAVRIVGWGELNG 290
Query: 446 EPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
EP YW +ANS+N WG NG F I RG +ECGIE AG P+I
Sbjct: 291 EP----------YWKIANSWNREWGMNGYFLIARGVDECGIEGSGVAGTPRI 332
>gi|268566089|ref|XP_002647469.1| Hypothetical protein CBG06541 [Caenorhabditis briggsae]
Length = 280
Score = 184 bits (466), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 92/216 (42%), Positives = 131/216 (60%), Gaps = 18/216 (8%)
Query: 122 CGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCC-KDCGNGCQGGFHGKAWKYWVT 180
CGS WA E +SDR+CIA++G + +S D+++CC + CG+GC+GG+ +A+++W +
Sbjct: 61 CGSCWAFSTAEVISDRICIATKGTQQPTISPTDMLACCGRSCGDGCEGGYPIQAFRWWNS 120
Query: 181 TGIVSGGTYASKQGCRPYEI-PCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDL 239
G+V+GG + GCRPY PC Y + E TP C CQ GY +Y D
Sbjct: 121 RGVVTGGDFRG-SGCRPYPFAPCNSY--------KCPEEKTPTCSLSCQFGYSTAYAKDK 171
Query: 240 NFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRII 299
FG AY++ N I EI +GPV G+ T+Y DM YK+G+Y+H AG LG HAI+II
Sbjct: 172 RFGVSAYAVARNVAAIQTEIMTNGPVVGAFTMYEDMYKYKSGVYRHTAGRLLGGHAIKII 231
Query: 300 GWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
GW GT + + YWL+ANS+ +WGENG ++
Sbjct: 232 GW-------GTQNGIPYWLIANSWGADWGENGFLKM 260
Score = 177 bits (450), Expect = 9e-42, Method: Compositional matrix adjust.
Identities = 111/292 (38%), Positives = 146/292 (50%), Gaps = 70/292 (23%)
Query: 262 HGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVAN 321
+GPVE S T+Y D +YK G+Y++ AG +G HAI+I+GWG E GT YWL+AN
Sbjct: 3 NGPVEASFTVYEDFYIYKKGVYQYTAGQVVGVHAIKIMGWGTE---HGTD----YWLIAN 55
Query: 322 SFNTNWGENGLF--------RI---------------------------GCR-PYEIPCE 345
S+ G F RI GC Y I
Sbjct: 56 SWGAQCGSCWAFSTAEVISDRICIATKGTQQPTISPTDMLACCGRSCGDGCEGGYPIQAF 115
Query: 346 RYMN-------------GSR-------SSCQANEPNTPECIRKCQPGYDVSYEDDLNFGR 385
R+ N G R +S + E TP C CQ GY +Y D FG
Sbjct: 116 RWWNSRGVVTGGDFRGSGCRPYPFAPCNSYKCPEEKTPTCSLSCQFGYSTAYAKDKRFGV 175
Query: 386 IAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQ 445
AY++ N I EI +GPV G+ T+Y DM YK+G+Y+H AG LG HAI+IIGW
Sbjct: 176 SAYAVARNVAAIQTEIMTNGPVVGAFTMYEDMYKYKSGVYRHTAGRLLGGHAIKIIGW-- 233
Query: 446 EPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
GT + + YWL+ANS+ +WGENG ++ RG NECGIE+ + AG+PK+
Sbjct: 234 -----GTQNGIPYWLIANSWGADWGENGFLKMRRGVNECGIESAVVAGMPKV 280
>gi|1644295|emb|CAB03627.1| cysteine proteinase [Haemonchus contortus]
Length = 345
Score = 184 bits (466), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 107/265 (40%), Positives = 148/265 (55%), Gaps = 23/265 (8%)
Query: 79 LPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRV 138
+ QNR P++ D +++PE FDAR +W C +++ IRDQ +CGS WA+ A+SDR+
Sbjct: 76 VNQNRKPVVENADDEDDDIPESFDARTHWANCTSLRHIRDQANCGSCWAVSTASALSDRI 135
Query: 139 CIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPY 198
CIAS+G+ + +SS D+VSCCK CG GC GG+ +A+ Y+ G V+G T SK GCRPY
Sbjct: 136 CIASKGETQLHISSIDIVSCCKLCGYGCDGGWPIEAFDYFSRQGAVTGET-TSKDGCRPY 194
Query: 199 EI-PCERYMNGS-----HSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANE 252
P Y N + C+ ++ E +++ V+ G A L E
Sbjct: 195 PFHPLWTYGNDTVGRRMSGRCKHSK-TVGEGVKR------VTRNHTRRTGLTARRLRITE 247
Query: 253 ETIMREIFRH--GPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGT 310
H GPV T+Y D YK GIY H+AG G HAI+IIGWG E
Sbjct: 248 FCQSHSEGDHGNGPVVAVFTVYEDFSYYKKGIYVHIAGKARGAHAIKIIGWGVE------ 301
Query: 311 SSVVKYWLVANSFNTNWGENGLFRI 335
+ + YWL+ANS++ +WGE GLFRI
Sbjct: 302 -NGLPYWLIANSWHDDWGEQGLFRI 325
Score = 102 bits (254), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 49/89 (55%), Positives = 60/89 (67%), Gaps = 7/89 (7%)
Query: 405 GPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANS 464
GPV T+Y D YK GIY H+AG G HAI+IIGWG E + + YWL+ANS
Sbjct: 260 GPVVAVFTVYEDFSYYKKGIYVHIAGKARGAHAIKIIGWGVE-------NGLPYWLIANS 312
Query: 465 FNTNWGENGLFRIVRGQNECGIEADITAG 493
++ +WGE GLFRIVRG NECGIE ++ AG
Sbjct: 313 WHDDWGEQGLFRIVRGINECGIEQEVVAG 341
>gi|327408413|emb|CCA30060.1| unnamed protein product [Neospora caninum Liverpool]
Length = 463
Score = 183 bits (465), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 105/266 (39%), Positives = 150/266 (56%), Gaps = 22/266 (8%)
Query: 84 LPLLVQLSDPLEELPEGFDARINWPYCP-TIQEIRDQGSCGSGWALGAVEAMSDRVCIAS 142
LP + E +P FDAR +P C + +RDQG CGS WA + EA +DR+CI S
Sbjct: 155 LPAKTVFENANEPVPANFDARTAFPVCKDVVGHVRDQGDCGSCWAFASTEAFNDRLCIRS 214
Query: 143 RGKRHVRLSSDDLVSCCK--DCGN-GCQGGFHGKAWKYWVTTGIVSGG---TYASKQGCR 196
+GK + LS+ SCC C + GC GG G AW+++ G+V+GG T C
Sbjct: 215 QGKGVMPLSTQHTTSCCNAIHCASFGCNGGQPGMAWRWFERKGVVTGGDFDTLGKGTTCW 274
Query: 197 PYEIP-CERYMNGSHSSCQ-DNEP-NTPECIRKCQPG----YDVSYEDDLNFGRIAYSLP 249
PYEIP C + +C D P TP+C + C+ + + ++ D++ +YSL
Sbjct: 275 PYEIPFCAHHAKAPFPNCDTDVRPRKTPKCRKDCEEAAYSEHVLPFDKDVHKASSSYSL- 333
Query: 250 ANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEG 309
+ + + R++ HG V G+ +Y D + YK+G+YKHV GGPLG HAI+IIGWG E GE
Sbjct: 334 RSRDAVKRDMMAHGTVTGAFMVYEDFLNYKSGVYKHVYGGPLGGHAIKIIGWGTED-GE- 391
Query: 310 TSSVVKYWLVANSFNTNWGENGLFRI 335
+YW NS+NT WG++G F+I
Sbjct: 392 -----EYWHAVNSWNTYWGDSGHFKI 412
Score = 115 bits (287), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 63/164 (38%), Positives = 96/164 (58%), Gaps = 17/164 (10%)
Query: 337 CRPYEIP-CERYMNGSRSSCQAN--EPNTPECIRKCQPG----YDVSYEDDLNFGRIAYS 389
C PYEIP C + +C + TP+C + C+ + + ++ D++ +YS
Sbjct: 273 CWPYEIPFCAHHAKAPFPNCDTDVRPRKTPKCRKDCEEAAYSEHVLPFDKDVHKASSSYS 332
Query: 390 LPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLG 449
L + + + R++ HG V G+ +Y D + YK+G+YKHV GGPLG HAI+IIGWG E G
Sbjct: 333 L-RSRDAVKRDMMAHGTVTGAFMVYEDFLNYKSGVYKHVYGGPLGGHAIKIIGWGTED-G 390
Query: 450 EGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAG 493
E +YW NS+NT WG++G F+I GQ CG++ ++ AG
Sbjct: 391 E------EYWHAVNSWNTYWGDSGHFKIEMGQ--CGVDNEMVAG 426
>gi|209863073|ref|NP_001119610.2| cathepsin B-1852 [Acyrthosiphon pisum]
Length = 333
Score = 183 bits (465), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 99/249 (39%), Positives = 138/249 (55%), Gaps = 17/249 (6%)
Query: 90 LSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVR 149
L P +P FDAR W C TI IRDQG+CGS WA A +DR+CIAS G +
Sbjct: 77 LYSPTGNIPNEFDARKRWKNCTTIGTIRDQGNCGSCWAFSTSGAFADRLCIASNGSFNQL 136
Query: 150 LSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEI-PCERYMNG 208
LS++ + SCC CG GCQGG+ +AW+Y+ G+V+GG + S +GC+PY PC
Sbjct: 137 LSAEHVTSCCYRCGLGCQGGYPIRAWRYYSKHGLVTGGNFNSFEGCQPYMFPPCT----- 191
Query: 209 SHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNF-GRIAYSLPANEETIMREIFRHGPVEG 267
++SC +C +KC +SY D + R Y L + + +I +GP+E
Sbjct: 192 GNNSCSGQSEKNHKCQKKCFGNTSISYRGDRRYVERSPYVLAY--DNMQNDIMTYGPIES 249
Query: 268 SMTIYADMILYKTGIY-KHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTN 326
S +Y D I YK+G+Y K LG H+++ IGWG E V YWL+ NS+N+
Sbjct: 250 SFDVYDDFISYKSGVYFKSPNATYLGGHSVKCIGWGVE-------RNVSYWLMMNSWNST 302
Query: 327 WGENGLFRI 335
WG+ G F+I
Sbjct: 303 WGDGGYFKI 311
Score = 103 bits (256), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 60/164 (36%), Positives = 85/164 (51%), Gaps = 17/164 (10%)
Query: 336 GCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNF-GRIAYSLPAN 393
GC+PY P C +SC +C +KC +SY D + R Y L
Sbjct: 181 GCQPYMFPPCT-----GNNSCSGQSEKNHKCQKKCFGNTSISYRGDRRYVERSPYVLAY- 234
Query: 394 EETIMREIFRHGPVEGSMTIYADMILYKTGIY-KHVAGGPLGEHAIRIIGWGQEPLGEGT 452
+ + +I +GP+E S +Y D I YK+G+Y K LG H+++ IGWG E
Sbjct: 235 -DNMQNDIMTYGPIESSFDVYDDFISYKSGVYFKSPNATYLGGHSVKCIGWGVE------ 287
Query: 453 SSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPK 496
V YWL+ NS+N+ WG+ G F+I RG NEC +E TAG+P+
Sbjct: 288 -RNVSYWLMMNSWNSTWGDGGYFKIRRGTNECQVEDSSTAGVPE 330
>gi|161343867|tpg|DAA06114.1| TPA_inf: cathepsin B [Myzus persicae]
Length = 340
Score = 183 bits (464), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 97/242 (40%), Positives = 143/242 (59%), Gaps = 11/242 (4%)
Query: 96 ELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDL 155
+P+ FDAR W C TI +RDQG+CGS WAL A +DR+C+A+ + LS ++L
Sbjct: 87 RIPKKFDARKKWRKCKTIGAVRDQGNCGSCWALATSSAFADRLCVATDADFNEFLSPEEL 146
Query: 156 VSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQD 215
CC CG GC GG+ KAW+ + + G+V+GG Y S +GC PY +P R+ ++SC D
Sbjct: 147 TFCCHTCGYGCNGGYPIKAWERFKSHGLVTGGDYKSGEGCEPYRVPPCRHHAEGNNSCSD 206
Query: 216 NE-PNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYAD 274
C R C D+ ++DD + R +Y L +I +++ +GP+E S +Y D
Sbjct: 207 KPMEKNHRCTRMCYGDQDLDFDDDHRYTRDSYYLTYG--SIQKDVMNYGPIEASFDVYDD 264
Query: 275 MILYKTGIY-KHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLF 333
YK+G+Y + LG HA+++IGWG+E S V YWL+ NS+NT+WG+ GLF
Sbjct: 265 FPSYKSGVYIRSDNASYLGGHAVKLIGWGEE-------SGVPYWLMVNSWNTDWGDKGLF 317
Query: 334 RI 335
+I
Sbjct: 318 KI 319
Score = 112 bits (279), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 62/162 (38%), Positives = 92/162 (56%), Gaps = 11/162 (6%)
Query: 336 GCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANE 394
GC PY +P C + G+ S C R C D+ ++DD + R +Y L
Sbjct: 185 GCEPYRVPPCRHHAEGNNSCSDKPMEKNHRCTRMCYGDQDLDFDDDHRYTRDSYYLTYG- 243
Query: 395 ETIMREIFRHGPVEGSMTIYADMILYKTGIY-KHVAGGPLGEHAIRIIGWGQEPLGEGTS 453
+I +++ +GP+E S +Y D YK+G+Y + LG HA+++IGWG+E
Sbjct: 244 -SIQKDVMNYGPIEASFDVYDDFPSYKSGVYIRSDNASYLGGHAVKLIGWGEE------- 295
Query: 454 SVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLP 495
S V YWL+ NS+NT+WG+ GLF+I RG NECG++ TAG+P
Sbjct: 296 SGVPYWLMVNSWNTDWGDKGLFKIQRGTNECGVDNSTTAGVP 337
>gi|157058745|gb|ABV03130.1| cathepsin B-2744 [Sitobion avenae]
Length = 260
Score = 183 bits (464), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 101/242 (41%), Positives = 141/242 (58%), Gaps = 11/242 (4%)
Query: 96 ELPEGFDARINWPYCP-TIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDD 154
++P FDAR + C I +++DQG+C S WA+ SDR+CIAS G+ LS+ +
Sbjct: 25 DIPREFDARQYFGSCADVIGDVKDQGNCASSWAVAVASTFSDRLCIASNGQFTDNLSAQN 84
Query: 155 LVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEI-PCERYMNGSHSSC 213
L+SC + GC GG KAW+ ++ GIV+GG + S +GC+PY+I PC Y NG+ +C
Sbjct: 85 LLSCGDEEKMGCDGGSAFKAWELTMSKGIVTGGNFDSNEGCQPYKIRPCNHYGNGNLKNC 144
Query: 214 QD-NEPNTPECIRKC-QPGYDVSYEDDLNFGRIAYSLP-ANEETIMREIFRHGPVEGSMT 270
C KC Y V YEDDL+ I Y N + I +EI +GPV M
Sbjct: 145 SSLRRTQMTVCREKCVNKNYKVKYEDDLHKTSIVYMTSWTNVKQIQQEIMTYGPVTAFMY 204
Query: 271 IYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGEN 330
+Y + + YK GIYK AG +G H +++IGWG + G+GT +YWL NS+N+NWG N
Sbjct: 205 VYENFMGYKEGIYKSTAGELIGYHHVKLIGWGVD--GDGT----EYWLAMNSWNSNWGTN 258
Query: 331 GL 332
GL
Sbjct: 259 GL 260
Score = 112 bits (279), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 69/176 (39%), Positives = 93/176 (52%), Gaps = 10/176 (5%)
Query: 303 QEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIGCRPYEI-PCERYMNGSRSSCQA-NEP 360
+E +G S K W + S G N GC+PY+I PC Y NG+ +C +
Sbjct: 91 EEKMGCDGGSAFKAWELTMSKGIVTGGNFDSNEGCQPYKIRPCNHYGNGNLKNCSSLRRT 150
Query: 361 NTPECIRKC-QPGYDVSYEDDLNFGRIAYSLP-ANEETIMREIFRHGPVEGSMTIYADMI 418
C KC Y V YEDDL+ I Y N + I +EI +GPV M +Y + +
Sbjct: 151 QMTVCREKCVNKNYKVKYEDDLHKTSIVYMTSWTNVKQIQQEIMTYGPVTAFMYVYENFM 210
Query: 419 LYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGL 474
YK GIYK AG +G H +++IGWG + G+GT +YWL NS+N+NWG NGL
Sbjct: 211 GYKEGIYKSTAGELIGYHHVKLIGWGVD--GDGT----EYWLAMNSWNSNWGTNGL 260
>gi|326490902|dbj|BAJ90118.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326508404|dbj|BAJ99469.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326514912|dbj|BAJ99817.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 345
Score = 183 bits (464), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 106/280 (37%), Positives = 153/280 (54%), Gaps = 26/280 (9%)
Query: 59 LSKLTLSELEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRD 118
L+ T+ + + +GV P +P ELP+ FDAR W C TI +I D
Sbjct: 59 LANYTIEQFKHILGVKPTPPGLLAGVPTKTYSRSEKAELPKEFDARSKWSGCSTIGKILD 118
Query: 119 QGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKD-CGNGCQGGFHGKAWKY 177
QG CG+ WA GAVE + DR CI +V LS +DLV+CC CG+GC GG+ AW+Y
Sbjct: 119 QGHCGACWAFGAVECLQDRFCI--HHSVNVSLSVNDLVACCGFLCGDGCDGGYPIFAWQY 176
Query: 178 WVTTGIVSGGTYASKQGCRPY--EIPCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSY 235
+V G+V+ C P+ ++ C+ H C+ P TP C +KC+ V +
Sbjct: 177 FVENGVVT-------DECDPFFDQVGCQ------HPGCEPAYP-TPVCEKKCKVQNQV-W 221
Query: 236 EDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHA 295
E+ +F AY + ++ IM E++++GPVE S IY D YK+G+YK + G +G HA
Sbjct: 222 EEKKHFSIDAYQVNSDPHDIMAEVYKNGPVEVSFIIYEDFAHYKSGVYKQITGRMVGGHA 281
Query: 296 IRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
++IGWG GE YWL+AN +N WG++G F+I
Sbjct: 282 AKLIGWGTSDAGE------DYWLLANQWNRGWGDDGYFKI 315
Score = 127 bits (320), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 67/169 (39%), Positives = 99/169 (58%), Gaps = 16/169 (9%)
Query: 329 ENGLFRIGCRPY--EIPCERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRI 386
ENG+ C P+ ++ C+ C+ P TP C +KC+ V +E+ +F
Sbjct: 179 ENGVVTDECDPFFDQVGCQH------PGCEPAYP-TPVCEKKCKVQNQV-WEEKKHFSID 230
Query: 387 AYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQE 446
AY + ++ IM E++++GPVE S IY D YK+G+YK + G +G HA ++IGWG
Sbjct: 231 AYQVNSDPHDIMAEVYKNGPVEVSFIIYEDFAHYKSGVYKQITGRMVGGHAAKLIGWGTS 290
Query: 447 PLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLP 495
GE YWL+AN +N WG++G F+I+RG NECGIE D+ AG+P
Sbjct: 291 DAGE------DYWLLANQWNRGWGDDGYFKIIRGTNECGIEGDVNAGMP 333
>gi|343474137|emb|CCD14154.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 337
Score = 183 bits (464), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 114/307 (37%), Positives = 160/307 (52%), Gaps = 28/307 (9%)
Query: 33 LSKAFDRVDHSILLPKLPFYGAEKNALSKLTLSELEMRMG--VHPDSKLPQNRLPLLVQL 90
L+K F VDH L + + +T SE + G + S L R QL
Sbjct: 31 LTKTF--VDHINQLNGGMWKAVYNGKMQNITFSEAKRLTGARIQKSSGLQPARFTE-EQL 87
Query: 91 SDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRL 150
+LPE FDA +WP+CPTI+EI DQ C + WA+ A+SDR C +GK+ +R+
Sbjct: 88 R---TKLPETFDAAEHWPHCPTIREIADQSECRASWAVSTASAISDRYCTVGKGKQ-LRI 143
Query: 151 SSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIP-CERY-MNG 208
S+ L+SCCKDCG+GC+GGF G AW+Y+V GI S C+PY P CE G
Sbjct: 144 SAAHLLSCCKDCGDGCKGGFPGFAWRYYVEYGITS-------SSCQPYPFPRCEHQGAQG 196
Query: 209 SHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGS 268
+ + C +TP+C C D + G Y L EE RE++ +GP
Sbjct: 197 NKTPCSKYNFDTPKCNATCT---DKAIPLIKYRGNATYLLLHGEEDYKRELYFNGPFVAV 253
Query: 269 MTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWG 328
+Y D+ YK+G+Y+HV G LG A++++GWG+ GT YW +ANS++T+WG
Sbjct: 254 FYVYTDLFAYKSGVYRHVDGDFLGGTAVKVVGWGKL---NGTP----YWKLANSWDTDWG 306
Query: 329 ENGLFRI 335
G I
Sbjct: 307 MGGYLLI 313
Score = 102 bits (255), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 61/170 (35%), Positives = 88/170 (51%), Gaps = 12/170 (7%)
Query: 329 ENGLFRIGCRPYEIP-CERY-MNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRI 386
E G+ C+PY P CE G+++ C +TP+C C D + G
Sbjct: 173 EYGITSSSCQPYPFPRCEHQGAQGNKTPCSKYNFDTPKCNATCT---DKAIPLIKYRGNA 229
Query: 387 AYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQE 446
Y L EE RE++ +GP +Y D+ YK+G+Y+HV G LG A++++GWG+
Sbjct: 230 TYLLLHGEEDYKRELYFNGPFVAVFYVYTDLFAYKSGVYRHVDGDFLGGTAVKVVGWGKL 289
Query: 447 PLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPK 496
GT YW +ANS++T+WG G I+RG NEC IE AG P+
Sbjct: 290 ---NGTP----YWKLANSWDTDWGMGGYLLILRGNNECNIEHLGFAGTPE 332
>gi|48762491|dbj|BAD23815.1| cathepsin B-S1 [Tuberaphis coreana]
Length = 334
Score = 182 bits (463), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 98/252 (38%), Positives = 146/252 (57%), Gaps = 15/252 (5%)
Query: 88 VQLSDPL---EELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRG 144
++ DPL + P+ FD+R NW C I IRDQG+CGS W+ A +DR+C+++ G
Sbjct: 73 IKKYDPLYVENDSPQQFDSRTNWKSCKQIGHIRDQGNCGSCWSFSTTGAFADRLCVSTGG 132
Query: 145 KRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIPCER 204
K + LS ++L CCKDCG GC GG+ KAWKY+ T G+ +GG Y +K+GC PY++P
Sbjct: 133 KFNQLLSPEELAFCCKDCGQGCGGGYPIKAWKYFRTQGVTTGGDYDTKEGCMPYKVPPCY 192
Query: 205 YMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGP 264
G ++ +C + C Y + + + YS+ + +TI +++ +GP
Sbjct: 193 NKQGKNTCGGQPMERNHQCPKTC---YGKTTVQNRYKTKSEYSINSI-KTIEQDLKTYGP 248
Query: 265 VEGSMTIYADMILYKTGIYKHVAGGPL-GEHAIRIIGWGQEPLGEGTSSVVKYWLVANSF 323
VE S +Y D +YK+GIY+ G H+I+IIGWGQE GT+ YWL NS+
Sbjct: 249 VEASFDVYDDFSVYKSGIYRKTPKAKYEGRHSIKIIGWGQE---NGTT----YWLAVNSW 301
Query: 324 NTNWGENGLFRI 335
+ WGE+G F+I
Sbjct: 302 SKFWGEHGTFKI 313
Score = 103 bits (258), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 67/186 (36%), Positives = 104/186 (55%), Gaps = 18/186 (9%)
Query: 314 VKYWLVANSFNTNWGENGLFRIGCRPYEIP-CERYMNGSRSSC--QANEPNTPECIRKCQ 370
+K W + G + + GC PY++P C Y +++C Q E N +C + C
Sbjct: 160 IKAWKYFRTQGVTTGGDYDTKEGCMPYKVPPC--YNKQGKNTCGGQPMERNH-QCPKTC- 215
Query: 371 PGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAG 430
Y + + + YS+ + + TI +++ +GPVE S +Y D +YK+GIY+
Sbjct: 216 --YGKTTVQNRYKTKSEYSINSIK-TIEQDLKTYGPVEASFDVYDDFSVYKSGIYRKTPK 272
Query: 431 GPL-GEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEAD 489
G H+I+IIGWGQE GT+ YWL NS++ WGE+G F+I++G+NECGIE
Sbjct: 273 AKYEGRHSIKIIGWGQE---NGTT----YWLAVNSWSKFWGEHGTFKIIKGRNECGIERA 325
Query: 490 ITAGLP 495
+TAG+P
Sbjct: 326 VTAGIP 331
>gi|239788404|dbj|BAH70886.1| ACYPI000014 [Acyrthosiphon pisum]
Length = 335
Score = 182 bits (463), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 94/243 (38%), Positives = 145/243 (59%), Gaps = 13/243 (5%)
Query: 96 ELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDL 155
E+PE FD+R+ W YC TI +R+QG+CGS WA G A +DR+C+A+ G+ + +S+++L
Sbjct: 83 EVPEFFDSRLEWDYCETIGHVRNQGNCGSCWAHGTTGAFADRLCVATNGEFNELISAEEL 142
Query: 156 VSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIPCERYMNGSHSSC-- 213
CC C GC GG+ KAW+Y+ G+V+GG Y + GC+PY +P + H+SC
Sbjct: 143 TFCCHRCVFGCNGGYPLKAWQYFKRHGVVTGGDYDTTDGCQPYRVPPCVKDDEGHNSCSG 202
Query: 214 QDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYA 273
Q E N +C +KC + Y+ + + AY L T+ ++ +GP+E S +Y
Sbjct: 203 QPTERN-HKCSKKCYGDDTIDYKKNHYKTKDAYYL--KNTTMQKDTMVYGPIEASFDVYD 259
Query: 274 DMILYKTGIYKHVAGGP-LGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGL 332
D + Y++G+Y+ LG HA+++IGWG E EGT YWL+ NS+ WG+ G+
Sbjct: 260 DFMNYESGVYQRTGNASYLGGHAVKMIGWGVE---EGTP----YWLMVNSWGEQWGDKGM 312
Query: 333 FRI 335
F+I
Sbjct: 313 FKI 315
Score = 108 bits (271), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 58/164 (35%), Positives = 91/164 (55%), Gaps = 11/164 (6%)
Query: 336 GCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANE 394
GC+PY +P C + G S +C +KC + Y+ + + AY L
Sbjct: 181 GCQPYRVPPCVKDDEGHNSCSGQPTERNHKCSKKCYGDDTIDYKKNHYKTKDAYYL--KN 238
Query: 395 ETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGP-LGEHAIRIIGWGQEPLGEGTS 453
T+ ++ +GP+E S +Y D + Y++G+Y+ LG HA+++IGWG E EGT
Sbjct: 239 TTMQKDTMVYGPIEASFDVYDDFMNYESGVYQRTGNASYLGGHAVKMIGWGVE---EGTP 295
Query: 454 SVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
YWL+ NS+ WG+ G+F+I+RG +ECGIE+ TAG+P +
Sbjct: 296 ----YWLMVNSWGEQWGDKGMFKILRGTDECGIESSCTAGVPSV 335
>gi|193783549|dbj|BAG53460.1| unnamed protein product [Homo sapiens]
Length = 276
Score = 182 bits (463), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 84/164 (51%), Positives = 123/164 (75%), Gaps = 9/164 (5%)
Query: 334 RIGCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPA 392
+GCRPY IP CE ++NGSR C E +TP+C + C+PGY +Y+ D ++G +YS+
Sbjct: 113 HVGCRPYSIPPCEHHVNGSRPPCTG-EGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVSN 171
Query: 393 NEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGT 452
+E+ IM EI+++GPVEG+ ++Y+D +LYK+G+Y+HV G +G HAIRI+GWG E GT
Sbjct: 172 SEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGVE---NGT 228
Query: 453 SSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPK 496
YWLVANS+NT+WG+NG F+I+RGQ+ CGIE+++ AG+P+
Sbjct: 229 ----PYWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPR 268
Score = 181 bits (458), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 87/175 (49%), Positives = 122/175 (69%), Gaps = 9/175 (5%)
Query: 162 CGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIP-CERYMNGSHSSCQDNEPNT 220
C C GG+ +AW +W G+VSGG Y S GCRPY IP CE ++NGS C E +T
Sbjct: 83 CLFSCNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCEHHVNGSRPPCT-GEGDT 141
Query: 221 PECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKT 280
P+C + C+PGY +Y+ D ++G +YS+ +E+ IM EI+++GPVEG+ ++Y+D +LYK+
Sbjct: 142 PKCSKICEPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAFSVYSDFLLYKS 201
Query: 281 GIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
G+Y+HV G +G HAIRI+GWG E GT YWLVANS+NT+WG+NG F+I
Sbjct: 202 GVYQHVTGEMMGGHAIRILGWGVE---NGT----PYWLVANSWNTDWGDNGFFKI 249
>gi|161343839|tpg|DAA06100.1| TPA_inf: cathepsin B [Acyrthosiphon pisum]
Length = 323
Score = 182 bits (463), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 98/245 (40%), Positives = 141/245 (57%), Gaps = 11/245 (4%)
Query: 96 ELPEGFDARINWPYCP-TIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDD 154
++P FDAR + C I +++DQG+C S WA+ +DR+CIAS G+ LS+ +
Sbjct: 63 DIPREFDARQYFTSCANVIGDVKDQGNCASSWAVAVASTFTDRLCIASNGQFTDNLSAQN 122
Query: 155 LVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYE-IPCERYMNGSHSSC 213
L+SC GC GG KAW+ + GIV+GG + S +GC+PY+ PC+ Y + ++C
Sbjct: 123 LMSCGDGEKMGCDGGSAFKAWELTMNKGIVTGGNFDSNEGCQPYKNRPCDHYGDSRLTNC 182
Query: 214 QD-NEPNTPECIRKC-QPGYDVSYEDDLNFGRIAYSLP-ANEETIMREIFRHGPVEGSMT 270
C +KC Y V YEDDL+ I Y N + I +EI HGPV M
Sbjct: 183 SSLRRTQMTVCRKKCVNKNYKVKYEDDLHKTSIVYMTSWTNVKQIQQEIMTHGPVTAFMY 242
Query: 271 IYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGEN 330
+Y + + YK GIYK G +G H +++IGWG + G+GT +YWL NS+N+NWG +
Sbjct: 243 VYENFMGYKEGIYKSTTGELIGYHHVKLIGWGVD--GDGT----EYWLAMNSWNSNWGND 296
Query: 331 GLFRI 335
GLF+I
Sbjct: 297 GLFKI 301
Score = 126 bits (317), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 74/198 (37%), Positives = 106/198 (53%), Gaps = 10/198 (5%)
Query: 304 EPLGEGTSSVVKYWLVANSFNTNWGENGLFRIGCRPYE-IPCERYMNGSRSSCQA-NEPN 361
E +G S K W + + G N GC+PY+ PC+ Y + ++C +
Sbjct: 130 EKMGCDGGSAFKAWELTMNKGIVTGGNFDSNEGCQPYKNRPCDHYGDSRLTNCSSLRRTQ 189
Query: 362 TPECIRKC-QPGYDVSYEDDLNFGRIAYSLP-ANEETIMREIFRHGPVEGSMTIYADMIL 419
C +KC Y V YEDDL+ I Y N + I +EI HGPV M +Y + +
Sbjct: 190 MTVCRKKCVNKNYKVKYEDDLHKTSIVYMTSWTNVKQIQQEIMTHGPVTAFMYVYENFMG 249
Query: 420 YKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVR 479
YK GIYK G +G H +++IGWG + G+GT +YWL NS+N+NWG +GLF+I+R
Sbjct: 250 YKEGIYKSTTGELIGYHHVKLIGWGVD--GDGT----EYWLAMNSWNSNWGNDGLFKILR 303
Query: 480 GQNECGIEADITAGLPKI 497
G N C IE + AG+ +
Sbjct: 304 GYNFCSIELLVMAGIVDV 321
>gi|357116869|ref|XP_003560199.1| PREDICTED: cathepsin B-like [Brachypodium distachyon]
Length = 350
Score = 182 bits (462), Expect = 4e-43, Method: Compositional matrix adjust.
Identities = 103/292 (35%), Positives = 159/292 (54%), Gaps = 28/292 (9%)
Query: 47 PKLPFYGAEKNALSKLTLSELEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARIN 106
P + + + + T+++ + +GV P +P + +LP+ FDAR
Sbjct: 50 PNAGWTAGQNSYFANYTIAQFKHILGVKPTPPGLLRGVP--TKTYSRSTDLPKEFDARSK 107
Query: 107 WPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKD-CGNG 165
W C TI I DQG CGS WA GAVE + DR CI ++ LS +DLV+CC CG+G
Sbjct: 108 WSGCSTIGTILDQGHCGSCWAFGAVECLQDRFCI--HLNMNISLSVNDLVACCGFMCGDG 165
Query: 166 CQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPY--EIPCERYMNGSHSSCQDNEPNTPEC 223
C GG+ AW+Y V G+V+ C PY ++ C+ H C+ P TP C
Sbjct: 166 CDGGYPISAWQYLVENGVVT-------DECDPYFDQVGCK------HPGCEPAYP-TPAC 211
Query: 224 IRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIY 283
+KC+ V +++ +F AY + ++ IM E++++GPVE + T+Y D YK+G+Y
Sbjct: 212 EKKCKVQNQV-WQEKKHFSINAYRVNSDPHDIMAEVYKNGPVEVAFTVYEDFAHYKSGVY 270
Query: 284 KHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
+H+ G +G HA+++IGWG G+ YWL+AN +N WG++G F+I
Sbjct: 271 EHITGEMMGGHAVKLIGWGTSADGK------DYWLLANQWNRGWGDDGYFKI 316
Score = 130 bits (328), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 65/169 (38%), Positives = 103/169 (60%), Gaps = 16/169 (9%)
Query: 329 ENGLFRIGCRPY--EIPCERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRI 386
ENG+ C PY ++ C+ C+ P TP C +KC+ V +++ +F
Sbjct: 180 ENGVVTDECDPYFDQVGCKH------PGCEPAYP-TPACEKKCKVQNQV-WQEKKHFSIN 231
Query: 387 AYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQE 446
AY + ++ IM E++++GPVE + T+Y D YK+G+Y+H+ G +G HA+++IGWG
Sbjct: 232 AYRVNSDPHDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYEHITGEMMGGHAVKLIGWGTS 291
Query: 447 PLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLP 495
G+ YWL+AN +N WG++G F+I+RG+NECGIE D+ AG+P
Sbjct: 292 ADGK------DYWLLANQWNRGWGDDGYFKIIRGKNECGIEEDVVAGMP 334
>gi|166030312|gb|ABY78823.1| cathepsin B-like protease [Trypanosoma congolense]
Length = 335
Score = 182 bits (462), Expect = 4e-43, Method: Compositional matrix adjust.
Identities = 107/281 (38%), Positives = 152/281 (54%), Gaps = 27/281 (9%)
Query: 59 LSKLTLSELEMRMG--VHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEI 116
+ +T +E G + S LP P+ ELPE FD+ WP CPTI+EI
Sbjct: 54 MQNITFAEARRLTGARIQKTSSLP----PVRFTEEQLRTELPESFDSAEKWPNCPTIREI 109
Query: 117 RDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWK 176
DQ +CGS WA+ A+SDR C G + +R+S+ L+SCCKDCG GC GG+ AW+
Sbjct: 110 ADQSACGSCWAVSTASAISDRHCTVG-GVQQLRISAAHLLSCCKDCGYGCDGGYPDAAWR 168
Query: 177 YWVTTGIVSGGTYASKQGCRPYEIP-CERYM-NGSHSSCQDNEPNTPECIRKCQPGYDVS 234
Y+V+ G+ S C+PY P C+ + G C + +TP+C C D +
Sbjct: 169 YYVSHGLAS-------SYCQPYPFPHCDHHGGKGKKPPCSKYDFHTPKCNTTCT---DKA 218
Query: 235 YEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEH 294
G +Y + EE RE++ +GP + +Y+D YKTG+Y+HV+G LG H
Sbjct: 219 IPLIKYRGNHSYEV-HGEEDYKRELYFNGPFVVAFQVYSDFFAYKTGVYRHVSGDVLGGH 277
Query: 295 AIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
A+RI+GWG+ GT YW +ANS++T+WG NG F I
Sbjct: 278 AVRIVGWGKL---NGTP----YWKIANSWDTDWGMNGHFLI 311
Score = 112 bits (279), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 66/171 (38%), Positives = 96/171 (56%), Gaps = 13/171 (7%)
Query: 329 ENGLFRIGCRPYEIP-CERYM-NGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRI 386
+GL C+PY P C+ + G + C + +TP+C C D + G
Sbjct: 172 SHGLASSYCQPYPFPHCDHHGGKGKKPPCSKYDFHTPKCNTTCT---DKAIPLIKYRGNH 228
Query: 387 AYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQE 446
+Y + EE RE++ +GP + +Y+D YKTG+Y+HV+G LG HA+RI+GWG+
Sbjct: 229 SYEV-HGEEDYKRELYFNGPFVVAFQVYSDFFAYKTGVYRHVSGDVLGGHAVRIVGWGKL 287
Query: 447 PLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
GT YW +ANS++T+WG NG F I+RG++ECGIE AG P I
Sbjct: 288 ---NGTP----YWKIANSWDTDWGMNGHFLILRGKDECGIEHQGYAGSPAI 331
>gi|326492684|dbj|BAJ90198.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 355
Score = 182 bits (462), Expect = 4e-43, Method: Compositional matrix adjust.
Identities = 103/280 (36%), Positives = 155/280 (55%), Gaps = 28/280 (10%)
Query: 59 LSKLTLSELEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRD 118
+ T+ + + +GV P P + ++ +LP+ FDAR W C TI I D
Sbjct: 65 FANYTIEQFKHILGVKPTP--PGLLAGVPIKTHPKSADLPKEFDARTQWSSCSTIGNILD 122
Query: 119 QGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKD-CGNGCQGGFHGKAWKY 177
QG CG+ WA AVE++ DR CI V LS +DL++CC CG+GC GG+ AW+Y
Sbjct: 123 QGHCGACWAFAAVESLQDRFCI--HLNMSVSLSVNDLLACCGFLCGSGCNGGYPISAWRY 180
Query: 178 WVTTGIVSGGTYASKQGCRPY--EIPCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSY 235
+ +G+V+ + C PY + C+ H C+ P TP+C RKC+ V +
Sbjct: 181 FRRSGVVT-------EECDPYFDQTGCQ------HPGCEPAYP-TPKCHRKCKVENQV-W 225
Query: 236 EDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHA 295
+ + + AY + +N IM E++++GPVE + T+Y D YK+G+YKH+ GG +G HA
Sbjct: 226 KKNKHSSVNAYRVHSNPHDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKHITGGVMGGHA 285
Query: 296 IRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
+++IGWG GE YWL+AN +N WG +G F+I
Sbjct: 286 VKLIGWGTSDAGE------DYWLLANQWNRGWGGDGYFKI 319
Score = 135 bits (340), Expect = 6e-29, Method: Compositional matrix adjust.
Identities = 67/171 (39%), Positives = 103/171 (60%), Gaps = 9/171 (5%)
Query: 344 CERYMNGSRSSCQANEPN--TPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREI 401
C+ Y + + EP TP+C RKC+ V ++ + + AY + +N IM E+
Sbjct: 191 CDPYFDQTGCQHPGCEPAYPTPKCHRKCKVENQV-WKKNKHSSVNAYRVHSNPHDIMAEV 249
Query: 402 FRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLV 461
+++GPVE + T+Y D YK+G+YKH+ GG +G HA+++IGWG GE YWL+
Sbjct: 250 YKNGPVEVAFTVYEDFAHYKSGVYKHITGGVMGGHAVKLIGWGTSDAGE------DYWLL 303
Query: 462 ANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKIGLEIDSNEINLGKMM 512
AN +N WG +G F+I+RG+NECGIE D+TAG+P +N++ G +
Sbjct: 304 ANQWNRGWGGDGYFKIIRGKNECGIEEDVTAGMPSTKNMDRNNDVAFGTAI 354
>gi|10803452|emb|CAB97365.2| putative cathepsin B.2 [Ostertagia ostertagi]
Length = 194
Score = 182 bits (462), Expect = 4e-43, Method: Compositional matrix adjust.
Identities = 95/202 (47%), Positives = 128/202 (63%), Gaps = 10/202 (4%)
Query: 124 SGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGI 183
S WA+ + AMSDRVCIAS G + V LS D+++CC CG GC+GG+ KAW+Y+ G+
Sbjct: 1 SCWAVSSAAAMSDRVCIASXGAKQVLLSDQDMLACCSWCGYGCEGGWPMKAWQYFXLEGV 60
Query: 184 VSGGTYASKQGCRPYEI-PCERY-MNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNF 241
V+GG Y + CRPYE PC R+ + C D+ TP+C + CQ GY Y++D +F
Sbjct: 61 VTGGNYRKQGCCRPYEFPPCGRHGKEPYYGECYDSA-KTPKCQKTCQRGYLKPYKEDKHF 119
Query: 242 GRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGW 301
G+ AY LP N + I R+I ++GPV +Y D YK+GIYKH AG G HA++IIGW
Sbjct: 120 GKSAYRLPNNVKAIQRDIMKNGPVVAGFIVYEDFAHYKSGIYKHTAGRMTGGHAVKIIGW 179
Query: 302 GQEPLGEGTSSVVKYWLVANSF 323
G+E GT YWL+ANS+
Sbjct: 180 GKE---XGTP----YWLIANSW 194
Score = 110 bits (274), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 58/130 (44%), Positives = 77/130 (59%), Gaps = 8/130 (6%)
Query: 337 CRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEE 395
CRPYE P C R+ + TP+C + CQ GY Y++D +FG+ AY LP N +
Sbjct: 72 CRPYEFPPCGRHGKEPYYGECYDSAKTPKCQKTCQRGYLKPYKEDKHFGKSAYRLPNNVK 131
Query: 396 TIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSV 455
I R+I ++GPV +Y D YK+GIYKH AG G HA++IIGWG+E GT
Sbjct: 132 AIQRDIMKNGPVVAGFIVYEDFAHYKSGIYKHTAGRMTGGHAVKIIGWGKE---XGTP-- 186
Query: 456 VKYWLVANSF 465
YWL+ANS+
Sbjct: 187 --YWLIANSW 194
>gi|21699|emb|CAA46811.1| cathepsin B [Triticum aestivum]
Length = 353
Score = 182 bits (461), Expect = 5e-43, Method: Compositional matrix adjust.
Identities = 104/282 (36%), Positives = 158/282 (56%), Gaps = 31/282 (10%)
Query: 59 LSKLTLSELEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRD 118
+ T+ + + +GV P +P+ + P +LP+ FDAR W C TI I D
Sbjct: 62 FANYTIEQFKHILGVKPTPPGLLAGVPIKIH---PEMDLPKEFDARTQWSSCSTIGNILD 118
Query: 119 QGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKD-CGNGCQGGFHGKAWKY 177
QG CG+ WA AVEA+ DR CI V LS +DL++CC CG+GC GG+ AW+Y
Sbjct: 119 QGHCGACWAFAAVEALQDRFCI--HLNMSVSLSVNDLLACCGFLCGSGCNGGYPISAWRY 176
Query: 178 WVTTGIVSGGTYASKQGCRPY--EIPCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSY 235
+ +G+V+ + C PY + C+ H C+ P TP+C RKC+ + ++
Sbjct: 177 FRRSGVVT-------EECDPYFDQTGCQ------HPGCEPAYP-TPKCQRKCKV-ENQAW 221
Query: 236 EDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTI--YADMILYKTGIYKHVAGGPLGE 293
+++ +F AY + +N IM E++++GPVE + T D YK+G+YKH+ GG +G
Sbjct: 222 KENKHFSVNAYRVHSNPHDIMAEVYKNGPVEVAFTYCQILDFAHYKSGVYKHITGGVMGG 281
Query: 294 HAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
HA+++IGWG GE YWL+AN +N WG++G F+I
Sbjct: 282 HAVKLIGWGTSDAGE------DYWLLANQWNRGWGDDGYFKI 317
Score = 132 bits (333), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 66/173 (38%), Positives = 105/173 (60%), Gaps = 11/173 (6%)
Query: 344 CERYMNGSRSSCQANEPN--TPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREI 401
C+ Y + + EP TP+C RKC+ + +++++ +F AY + +N IM E+
Sbjct: 187 CDPYFDQTGCQHPGCEPAYPTPKCQRKCKV-ENQAWKENKHFSVNAYRVHSNPHDIMAEV 245
Query: 402 FRHGPVEGSMTI--YADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYW 459
+++GPVE + T D YK+G+YKH+ GG +G HA+++IGWG GE YW
Sbjct: 246 YKNGPVEVAFTYCQILDFAHYKSGVYKHITGGVMGGHAVKLIGWGTSDAGE------DYW 299
Query: 460 LVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKIGLEIDSNEINLGKMM 512
L+AN +N WG++G F+I+RG+NECGIE D+TAG+P +N++ G +
Sbjct: 300 LLANQWNRGWGDDGYFKIIRGENECGIEGDVTAGMPSTKNTARNNDVAFGTAI 352
>gi|357116879|ref|XP_003560204.1| PREDICTED: cathepsin B-like [Brachypodium distachyon]
Length = 351
Score = 182 bits (461), Expect = 5e-43, Method: Compositional matrix adjust.
Identities = 105/282 (37%), Positives = 156/282 (55%), Gaps = 32/282 (11%)
Query: 59 LSKLTLSELEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRD 118
+ T+++ + +GV P +P + +LP FDAR W C TI I D
Sbjct: 61 FANYTITQFKHILGVKPTPPALLAGVP--TKSYSRSMKLPTEFDARSQWSGCSTIGTILD 118
Query: 119 QGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKD-CGNGCQGGFHGKAWKY 177
QG CGS WA GAVE + DR CI ++ LS +DL++CC CG+GC GG+ AW+Y
Sbjct: 119 QGHCGSCWAFGAVECLQDRFCI--HLNMNISLSVNDLLACCGFLCGSGCNGGYPISAWRY 176
Query: 178 WVTTGIVSGGTYASKQGCRPY--EIPCERYMNGSHSSCQDNEP--NTPECIRKCQPGYDV 233
+ G+V+ C PY ++ C+ H C EP TP+C +KC+ +V
Sbjct: 177 FRRKGVVT-------DECDPYFDQVGCK------HPGC---EPAYRTPKCEKKCKVQNEV 220
Query: 234 SYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGE 293
+++ +F AY + +N IM E++ +GPVE + T+Y D YK+G+YKH+ GG +G
Sbjct: 221 -WKEQKHFSVDAYRVHSNPHDIMAEVYTNGPVEVAFTVYEDFAHYKSGVYKHITGGVMGG 279
Query: 294 HAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
HA+++IGWG GE YWL+AN +N WG++G F+I
Sbjct: 280 HAVKLIGWGTSDAGE------DYWLLANQWNRGWGDDGYFKI 315
Score = 134 bits (337), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 64/154 (41%), Positives = 97/154 (62%), Gaps = 9/154 (5%)
Query: 344 CERYMNGSRSSCQANEP--NTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREI 401
C+ Y + EP TP+C +KC+ +V +++ +F AY + +N IM E+
Sbjct: 187 CDPYFDQVGCKHPGCEPAYRTPKCEKKCKVQNEV-WKEQKHFSVDAYRVHSNPHDIMAEV 245
Query: 402 FRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLV 461
+ +GPVE + T+Y D YK+G+YKH+ GG +G HA+++IGWG GE YWL+
Sbjct: 246 YTNGPVEVAFTVYEDFAHYKSGVYKHITGGVMGGHAVKLIGWGTSDAGE------DYWLL 299
Query: 462 ANSFNTNWGENGLFRIVRGQNECGIEADITAGLP 495
AN +N WG++G F+I+RG+NECGIE D+ AG+P
Sbjct: 300 ANQWNRGWGDDGYFKIIRGKNECGIEEDVVAGMP 333
>gi|166030316|gb|ABY78825.1| cathepsin B-like protease [Trypanosoma congolense]
Length = 336
Score = 182 bits (461), Expect = 5e-43, Method: Compositional matrix adjust.
Identities = 105/279 (37%), Positives = 151/279 (54%), Gaps = 26/279 (9%)
Query: 59 LSKLTLSELEMRMGV--HPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEI 116
+ +T +E G S LP P+ ELPE FD+ WP CPTI+EI
Sbjct: 54 MQNITFAEARRLTGAFRRKTSSLP----PVRFTEEQLRTELPESFDSAEKWPNCPTIREI 109
Query: 117 RDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWK 176
DQ +CGS WA+ A+SDR C G + +R+S+ L+SCCKDCG+GC GG+ AW+
Sbjct: 110 ADQSACGSCWAVSTASAISDRHCTVG-GVQQLRISAAHLLSCCKDCGDGCDGGYPDAAWR 168
Query: 177 YWVTTGIVSGGTYASKQGCRPYEIP-CERYM-NGSHSSCQDNEPNTPECIRKCQPGYDVS 234
Y+V+ G+ S C+PY P C + G C + +TP+C C D +
Sbjct: 169 YYVSHGLAS-------SYCQPYPFPHCGHHGGKGKKPPCSKYDFHTPKCNTTCT---DKA 218
Query: 235 YEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEH 294
G +Y L E+ RE++ +GP + +++D + YKTG+Y+HV+G LG H
Sbjct: 219 IPLIEYRGNDSYVLLHGEDDFKRELYFNGPFVVAFQVFSDFLAYKTGVYRHVSGDFLGGH 278
Query: 295 AIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLF 333
A+RI+GWG+ GT YW +ANS++T+WG NG F
Sbjct: 279 AVRIVGWGKL---NGT----PYWKIANSWDTDWGMNGHF 310
Score = 115 bits (288), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 66/171 (38%), Positives = 96/171 (56%), Gaps = 12/171 (7%)
Query: 329 ENGLFRIGCRPYEIP-CERYM-NGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRI 386
+GL C+PY P C + G + C + +TP+C C D + G
Sbjct: 172 SHGLASSYCQPYPFPHCGHHGGKGKKPPCSKYDFHTPKCNTTCT---DKAIPLIEYRGND 228
Query: 387 AYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQE 446
+Y L E+ RE++ +GP + +++D + YKTG+Y+HV+G LG HA+RI+GWG+
Sbjct: 229 SYVLLHGEDDFKRELYFNGPFVVAFQVFSDFLAYKTGVYRHVSGDFLGGHAVRIVGWGKL 288
Query: 447 PLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
GT YW +ANS++T+WG NG F +RG NECGIE + AGLP I
Sbjct: 289 ---NGT----PYWKIANSWDTDWGMNGHFLFLRGNNECGIEFEGYAGLPAI 332
>gi|10803441|emb|CAC13133.1| putative cathepsin B.7 [Ostertagia ostertagi]
Length = 198
Score = 182 bits (461), Expect = 5e-43, Method: Compositional matrix adjust.
Identities = 91/201 (45%), Positives = 124/201 (61%), Gaps = 4/201 (1%)
Query: 124 SGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGI 183
S WA+ AMSDR+CIAS+G V +S+ D+VSCC CG GC+GG+ +AWKY VT G+
Sbjct: 1 SCWAVSTAAAMSDRICIASKGATQVLISAQDIVSCCTWCGAGCEGGWPIEAWKYGVTEGV 60
Query: 184 VSGGTYASKQGCRPYEI-PCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFG 242
V+GG + K+ CR YEI PC + N + TP C ++C+PGY SY D +G
Sbjct: 61 VTGGNFGRKECCRSYEIHPCGYHGNEPFYGHCHSMARTPPCKKRCRPGYKNSYMMDKRYG 120
Query: 243 RIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWG 302
AY LP + I R+I +GPV +Y D YK+GIY+H AG G HA+++IGWG
Sbjct: 121 TSAYELPNSVXAIQRDIMENGPVVAGFDVYEDFKYYKSGIYRHTAGKXTGGHAVKVIGWG 180
Query: 303 QEPLGEGTSSVVKYWLVANSF 323
+E GT + YW++ANS+
Sbjct: 181 EEXTENGT---IPYWIIANSW 198
Score = 105 bits (261), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 54/130 (41%), Positives = 74/130 (56%), Gaps = 4/130 (3%)
Query: 337 CRPYEI-PCERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEE 395
CR YEI PC + N + TP C ++C+PGY SY D +G AY LP +
Sbjct: 72 CRSYEIHPCGYHGNEPFYGHCHSMARTPPCKKRCRPGYKNSYMMDKRYGTSAYELPNSVX 131
Query: 396 TIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSV 455
I R+I +GPV +Y D YK+GIY+H AG G HA+++IGWG+E GT
Sbjct: 132 AIQRDIMENGPVVAGFDVYEDFKYYKSGIYRHTAGKXTGGHAVKVIGWGEEXTENGT--- 188
Query: 456 VKYWLVANSF 465
+ YW++ANS+
Sbjct: 189 IPYWIIANSW 198
>gi|161343851|tpg|DAA06106.1| TPA_inf: cathepsin B [Acyrthosiphon pisum]
Length = 333
Score = 182 bits (461), Expect = 5e-43, Method: Compositional matrix adjust.
Identities = 99/249 (39%), Positives = 137/249 (55%), Gaps = 17/249 (6%)
Query: 90 LSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVR 149
L P +P FDAR W C TI IRDQG+CGS WA A +DR+CIAS G +
Sbjct: 77 LYSPAGNIPNEFDARKRWKNCTTIGTIRDQGNCGSCWAFSTSGAFADRLCIASNGSFNQL 136
Query: 150 LSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEI-PCERYMNG 208
LS++ + SCC CG GCQGG+ +AW+Y+ G+V+GG + S +GC+PY PC
Sbjct: 137 LSAEHVTSCCYRCGLGCQGGYPIRAWRYYSKHGLVTGGNFNSFEGCQPYMFPPCT----- 191
Query: 209 SHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNF-GRIAYSLPANEETIMREIFRHGPVEG 267
++SC +C +KC +SY D + R Y L + + +I +GP+E
Sbjct: 192 GNNSCSGQSEKNHKCQKKCFGNTSISYRGDRRYVERSPYVLA--YDNMQNDIMTYGPIES 249
Query: 268 SMTIYADMILYKTGIY-KHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTN 326
S +Y D I YK+G+Y K LG H+++ IGWG E V YWL+ NS+N
Sbjct: 250 SFDVYDDFISYKSGVYFKSPNATYLGGHSVKCIGWGVE-------RNVSYWLMMNSWNNT 302
Query: 327 WGENGLFRI 335
WG+ G F+I
Sbjct: 303 WGDGGNFKI 311
Score = 102 bits (253), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 60/164 (36%), Positives = 84/164 (51%), Gaps = 17/164 (10%)
Query: 336 GCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNF-GRIAYSLPAN 393
GC+PY P C +SC +C +KC +SY D + R Y L
Sbjct: 181 GCQPYMFPPCT-----GNNSCSGQSEKNHKCQKKCFGNTSISYRGDRRYVERSPYVLAY- 234
Query: 394 EETIMREIFRHGPVEGSMTIYADMILYKTGIY-KHVAGGPLGEHAIRIIGWGQEPLGEGT 452
+ + +I +GP+E S +Y D I YK+G+Y K LG H+++ IGWG E
Sbjct: 235 -DNMQNDIMTYGPIESSFDVYDDFISYKSGVYFKSPNATYLGGHSVKCIGWGVE------ 287
Query: 453 SSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPK 496
V YWL+ NS+N WG+ G F+I RG NEC +E TAG+P+
Sbjct: 288 -RNVSYWLMMNSWNNTWGDGGNFKIRRGTNECQVEDSSTAGMPE 330
>gi|308507719|ref|XP_003116043.1| hypothetical protein CRE_08645 [Caenorhabditis remanei]
gi|308250987|gb|EFO94939.1| hypothetical protein CRE_08645 [Caenorhabditis remanei]
Length = 356
Score = 182 bits (461), Expect = 5e-43, Method: Compositional matrix adjust.
Identities = 112/263 (42%), Positives = 153/263 (58%), Gaps = 38/263 (14%)
Query: 94 LEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSD 153
L +P FDAR NWP C +I+ +RDQ +CGS WA GA E +SDR+CI S GK +S++
Sbjct: 67 LPSIPTTFDARTNWPKCNSIKMVRDQSNCGSCWAFGAAEVISDRICIHSNGKEQPVISAE 126
Query: 154 DLVSCC-KDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEI-PCERYMNGSHS 211
D+++CC K CGNGCQGG +A K+W T G V+GG Y GC+PY PC S
Sbjct: 127 DILTCCGKSCGNGCQGGQGLEAMKFWTTYGAVTGGDYKG-DGCKPYSFAPC--------S 177
Query: 212 SCQDNEPNTPECIRKCQPGYDVS-YEDDLNFGR---------------IAYSLPANEET- 254
+C +++ TP C KCQ Y V+ Y+ D ++G+ AY L +
Sbjct: 178 NCVESK-TTPSCQSKCQSTYTVTNYKGDKHYGKNEGKVTERHKHLECTSAYRLDTSSNAV 236
Query: 255 --IMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSS 312
I EI+++GPVE + T+Y D YK+G+Y HV G G HA++IIGWG E
Sbjct: 237 PIIQNEIYQNGPVEVAYTVYDDFYHYKSGVYHHVTGKDTGGHAVKIIGWGTEKG------ 290
Query: 313 VVKYWLVANSFNTNWGENGLFRI 335
V YWLV NS+ T++G+ G F+I
Sbjct: 291 -VDYWLVTNSWGTSFGDKGFFKI 312
Score = 125 bits (313), Expect = 7e-26, Method: Compositional matrix adjust.
Identities = 72/183 (39%), Positives = 99/183 (54%), Gaps = 36/183 (19%)
Query: 336 GCRPYEI-PCERYMNGSRSSCQANEPNTPECIRKCQPGYDVS-YEDDLNFGR-------- 385
GC+PY PC S+C TP C KCQ Y V+ Y+ D ++G+
Sbjct: 167 GCKPYSFAPC--------SNC-VESKTTPSCQSKCQSTYTVTNYKGDKHYGKNEGKVTER 217
Query: 386 -------IAYSLPANEET---IMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGE 435
AY L + I EI+++GPVE + T+Y D YK+G+Y HV G G
Sbjct: 218 HKHLECTSAYRLDTSSNAVPIIQNEIYQNGPVEVAYTVYDDFYHYKSGVYHHVTGKDTGG 277
Query: 436 HAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLP 495
HA++IIGWG E V YWLV NS+ T++G+ G F+I RG NECGIE+++ AG+
Sbjct: 278 HAVKIIGWGTEKG-------VDYWLVTNSWGTSFGDKGFFKIRRGTNECGIESNVVAGMA 330
Query: 496 KIG 498
K+G
Sbjct: 331 KVG 333
>gi|168026641|ref|XP_001765840.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162683017|gb|EDQ69431.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 339
Score = 181 bits (460), Expect = 6e-43, Method: Compositional matrix adjust.
Identities = 100/243 (41%), Positives = 139/243 (57%), Gaps = 27/243 (11%)
Query: 97 LPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLV 156
LP+ FDAR +W +C TI I DQG CGS WA GA E+++DR CI V LS +DL+
Sbjct: 95 LPKEFDARKHWGHCSTIGAILDQGHCGSCWAFGAAESLTDRFCI--HMNESVSLSENDLL 152
Query: 157 SCCK-DCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPY--EIPCERYMNGSHSSC 213
+CC +CG+GC GG+ +AW+Y+ TG+V+ C PY +I C H C
Sbjct: 153 ACCGFECGDGCDGGYPIRAWRYFKRTGVVT-------SKCDPYFDQIGC------GHPGC 199
Query: 214 QDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYA 273
TP+C++ C D + + AY + E +M E++ +GP+E S ++
Sbjct: 200 YPTY-RTPKCVKHCVD--DELWVKSKHLSVNAYEVSKEPEDLMAELYTNGPIEVSFEVFE 256
Query: 274 DMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLF 333
D YKTG+YKHV G +G HA+++IGWG T V YW + NS+NTNWGE+GLF
Sbjct: 257 DFAHYKTGVYKHVYGRYIGGHAVKLIGWGT------TDDGVDYWTIVNSWNTNWGEHGLF 310
Query: 334 RIG 336
RI
Sbjct: 311 RIA 313
Score = 122 bits (307), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 60/134 (44%), Positives = 82/134 (61%), Gaps = 8/134 (5%)
Query: 362 TPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYK 421
TP+C++ C D + + AY + E +M E++ +GP+E S ++ D YK
Sbjct: 205 TPKCVKHCVD--DELWVKSKHLSVNAYEVSKEPEDLMAELYTNGPIEVSFEVFEDFAHYK 262
Query: 422 TGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQ 481
TG+YKHV G +G HA+++IGWG T V YW + NS+NTNWGE+GLFRI RG
Sbjct: 263 TGVYKHVYGRYIGGHAVKLIGWGT------TDDGVDYWTIVNSWNTNWGEHGLFRIARGG 316
Query: 482 NECGIEADITAGLP 495
NECGIE+ AGLP
Sbjct: 317 NECGIESYAVAGLP 330
>gi|268555786|ref|XP_002635882.1| Hypothetical protein CBG01102 [Caenorhabditis briggsae]
Length = 374
Score = 181 bits (460), Expect = 7e-43, Method: Compositional matrix adjust.
Identities = 102/276 (36%), Positives = 141/276 (51%), Gaps = 48/276 (17%)
Query: 101 FDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCK 160
FDAR WP C +I I D C S WA A E+MSDR+CI S G + LS+ +L+SCC
Sbjct: 85 FDARERWPECSSIPIINDISDCKSSWAFSAAESMSDRLCINSGGMINTVLSAQELLSCCT 144
Query: 161 ---DCGNG------------------------------------CQGGFHGKAWKYWVTT 181
CG G C GG KAW+YW
Sbjct: 145 GVFSCGEGDSEHWQFRNSKFRKPRCQKFNKEILEARRNLETREKCAGGNVFKAWQYWQKH 204
Query: 182 GIVSGGTYASKQGCRPYEI-PCERYM-NGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDL 239
G+ +GG+Y S+ GC+PY I PC+ + N + C ++ TP C +KC+ GY V + D
Sbjct: 205 GLPTGGSYESQFGCKPYSISPCDTVIGNITFPGCLNSTVQTPSCEKKCKSGYPVELDKDR 264
Query: 240 NFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRII 299
++G LP + I ++ +GP+ +M +Y D + Y TGIY H+ G G ++RI+
Sbjct: 265 HYGVSVDQLPNRQIEIQSDVMLNGPISATMEVYDDFLQYTTGIYVHLTGNKQGHLSVRIL 324
Query: 300 GWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
GWG + EG V YWL+ANS+ WGENG FR+
Sbjct: 325 GWG---MYEG----VPYWLLANSWGKQWGENGTFRV 353
Score = 138 bits (348), Expect = 7e-30, Method: Compositional matrix adjust.
Identities = 67/167 (40%), Positives = 101/167 (60%), Gaps = 9/167 (5%)
Query: 334 RIGCRPYEI-PCERYM-NGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLP 391
+ GC+PY I PC+ + N + C + TP C +KC+ GY V + D ++G LP
Sbjct: 215 QFGCKPYSISPCDTVIGNITFPGCLNSTVQTPSCEKKCKSGYPVELDKDRHYGVSVDQLP 274
Query: 392 ANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEG 451
+ I ++ +GP+ +M +Y D + Y TGIY H+ G G ++RI+GWG + EG
Sbjct: 275 NRQIEIQSDVMLNGPISATMEVYDDFLQYTTGIYVHLTGNKQGHLSVRILGWG---MYEG 331
Query: 452 TSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKIG 498
V YWL+ANS+ WGENG FR++RG NECG+EA+ +G+P++G
Sbjct: 332 ----VPYWLLANSWGKQWGENGTFRVLRGVNECGLEANCVSGMPRLG 374
>gi|407425570|gb|EKF39488.1| cysteine peptidase C (CPC), putative [Trypanosoma cruzi
marinkellei]
Length = 333
Score = 181 bits (458), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 105/247 (42%), Positives = 136/247 (55%), Gaps = 30/247 (12%)
Query: 96 ELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDL 155
L + FDA WP CPTI EIRDQ SCGS WA+ A AMSDR C G R +R+S+ DL
Sbjct: 91 RLEDKFDAAEAWPNCPTITEIRDQSSCGSCWAVAAASAMSDRYCTLG-GVRDLRISAGDL 149
Query: 156 VSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIP-CERYMNGSHSSCQ 214
+SCC CG GC GGF AW ++V G+VS + C+PY P C ++N S +
Sbjct: 150 MSCCDVCGYGCNGGFPEVAWVFYVVHGLVS-------EYCQPYPFPSCAHHVNSSDLAPC 202
Query: 215 DNEPNTPECIRKCQPGYD--VSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIY 272
+ TP+C C + Y G +Y L + EE RE+ +GP E + +Y
Sbjct: 203 SGDYKTPKCNSTCTEKKIPLIRYR-----GNHSYVL-SGEEHFKRELLLNGPFEVAFEVY 256
Query: 273 ADMILYKTGIYKHVAGGPLGEHAIRIIGWGQ---EPLGEGTSSVVKYWLVANSFNTNWGE 329
AD + Y G+YKHVAG LG HA+R++GWG+ EP YW +ANS+N WG
Sbjct: 257 ADFMAYTGGVYKHVAGDLLGGHAVRLVGWGELNGEP----------YWKIANSWNHEWGM 306
Query: 330 NGLFRIG 336
NG F I
Sbjct: 307 NGYFLIA 313
Score = 116 bits (290), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 69/174 (39%), Positives = 95/174 (54%), Gaps = 22/174 (12%)
Query: 330 NGLFRIGCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYD--VSYEDDLNFGRI 386
+GL C+PY P C ++N S + + + TP+C C + Y G
Sbjct: 175 HGLVSEYCQPYPFPSCAHHVNSSDLAPCSGDYKTPKCNSTCTEKKIPLIRYR-----GNH 229
Query: 387 AYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQ- 445
+Y L + EE RE+ +GP E + +YAD + Y G+YKHVAG LG HA+R++GWG+
Sbjct: 230 SYVL-SGEEHFKRELLLNGPFEVAFEVYADFMAYTGGVYKHVAGDLLGGHAVRLVGWGEL 288
Query: 446 --EPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
EP YW +ANS+N WG NG F I RG NECGIE++ AG P+I
Sbjct: 289 NGEP----------YWKIANSWNHEWGMNGYFLIARGVNECGIESNGVAGTPRI 332
>gi|21700775|gb|AAL60053.1| cysteine proteinase [Toxoplasma gondii]
Length = 569
Score = 181 bits (458), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 104/268 (38%), Positives = 151/268 (56%), Gaps = 23/268 (8%)
Query: 84 LPLLV-QLSDPLEELPEGFDARINWPYCP-TIQEIRDQGSCGSGWALGAVEAMSDRVCIA 141
+PL + + E +P FDAR +P C + +RDQG CGS WA + EA +DR+CI
Sbjct: 260 MPLPAKEFENATEPVPAHFDARTAFPACKDVVGHVRDQGDCGSCWAFASTEAFNDRLCIR 319
Query: 142 SRGKRHVRLSSDDLVSCCK--DCGN-GCQGGFHGKAWKYWVTTGIVSGGTY-ASKQG--C 195
S+GKR + LS+ SCC C + GC GG G AW+++ G+V+GG + A +G C
Sbjct: 320 SQGKRLMPLSAQHTTSCCNAIHCASFGCNGGQPGMAWRWFERKGVVTGGDFDALGKGTTC 379
Query: 196 RPYEIP-CERYMNGSHSSCQDN--EPNTPECIRKCQPGYDVS----YEDDLNFGRIAYSL 248
PYE+P C + C TP+C + C+ ++ D + AYSL
Sbjct: 380 WPYEVPFCAHHAKAPFPDCDATLVPRKTPKCRKDCEEQAYADNVHPFDQDTHKATSAYSL 439
Query: 249 PANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGE 308
+ ++ + R++ HGPV G+ +Y D + YK+G+YKHV+G P+G HAI+IIGWG E GE
Sbjct: 440 RSRDD-VKRDMMTHGPVSGAFMVYEDFLSYKSGVYKHVSGLPVGGHAIKIIGWGTEN-GE 497
Query: 309 GTSSVVKYWLVANSFNTNWGENGLFRIG 336
+YW NS+NT WG+ G F+I
Sbjct: 498 ------EYWHAVNSWNTYWGDGGQFKIA 519
Score = 116 bits (290), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 64/164 (39%), Positives = 93/164 (56%), Gaps = 17/164 (10%)
Query: 337 CRPYEIP-CERYMNGSRSSCQAN--EPNTPECIRKCQPGYDVS----YEDDLNFGRIAYS 389
C PYE+P C + C A TP+C + C+ ++ D + AYS
Sbjct: 379 CWPYEVPFCAHHAKAPFPDCDATLVPRKTPKCRKDCEEQAYADNVHPFDQDTHKATSAYS 438
Query: 390 LPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLG 449
L + ++ + R++ HGPV G+ +Y D + YK+G+YKHV+G P+G HAI+IIGWG E G
Sbjct: 439 LRSRDD-VKRDMMTHGPVSGAFMVYEDFLSYKSGVYKHVSGLPVGGHAIKIIGWGTEN-G 496
Query: 450 EGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAG 493
E +YW NS+NT WG+ G F+I GQ CGI+ ++ AG
Sbjct: 497 E------EYWHAVNSWNTYWGDGGQFKIAMGQ--CGIDGEMVAG 532
>gi|221484923|gb|EEE23213.1| cysteine proteinase, putative [Toxoplasma gondii GT1]
Length = 569
Score = 181 bits (458), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 104/268 (38%), Positives = 151/268 (56%), Gaps = 23/268 (8%)
Query: 84 LPLLV-QLSDPLEELPEGFDARINWPYCP-TIQEIRDQGSCGSGWALGAVEAMSDRVCIA 141
+PL + + E +P FDAR +P C + +RDQG CGS WA + EA +DR+CI
Sbjct: 260 MPLPAKEFENATEPVPAHFDARTAFPACKDVVGHVRDQGDCGSCWAFASTEAFNDRLCIR 319
Query: 142 SRGKRHVRLSSDDLVSCCK--DCGN-GCQGGFHGKAWKYWVTTGIVSGGTY-ASKQG--C 195
S+GKR + LS+ SCC C + GC GG G AW+++ G+V+GG + A +G C
Sbjct: 320 SQGKRLMPLSAQHTTSCCNAIHCASFGCNGGQPGMAWRWFERKGVVTGGDFDALGKGTTC 379
Query: 196 RPYEIP-CERYMNGSHSSCQDN--EPNTPECIRKCQPGYDVS----YEDDLNFGRIAYSL 248
PYE+P C + C TP+C + C+ ++ D + AYSL
Sbjct: 380 WPYEVPFCAHHAKAPFPDCDATLVPRKTPKCRKDCEEQAYADNVHPFDQDTHKATSAYSL 439
Query: 249 PANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGE 308
+ ++ + R++ HGPV G+ +Y D + YK+G+YKHV+G P+G HAI+IIGWG E GE
Sbjct: 440 RSRDD-VKRDMMTHGPVSGAFMVYEDFLSYKSGVYKHVSGLPVGGHAIKIIGWGTEN-GE 497
Query: 309 GTSSVVKYWLVANSFNTNWGENGLFRIG 336
+YW NS+NT WG+ G F+I
Sbjct: 498 ------EYWHAVNSWNTYWGDGGQFKIA 519
Score = 116 bits (290), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 64/164 (39%), Positives = 93/164 (56%), Gaps = 17/164 (10%)
Query: 337 CRPYEIP-CERYMNGSRSSCQAN--EPNTPECIRKCQPGYDVS----YEDDLNFGRIAYS 389
C PYE+P C + C A TP+C + C+ ++ D + AYS
Sbjct: 379 CWPYEVPFCAHHAKAPFPDCDATLVPRKTPKCRKDCEEQAYADNVHPFDQDTHKATSAYS 438
Query: 390 LPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLG 449
L + ++ + R++ HGPV G+ +Y D + YK+G+YKHV+G P+G HAI+IIGWG E G
Sbjct: 439 LRSRDD-VKRDMMTHGPVSGAFMVYEDFLSYKSGVYKHVSGLPVGGHAIKIIGWGTEN-G 496
Query: 450 EGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAG 493
E +YW NS+NT WG+ G F+I GQ CGI+ ++ AG
Sbjct: 497 E------EYWHAVNSWNTYWGDGGQFKIAMGQ--CGIDGEMVAG 532
>gi|204022081|dbj|BAG71138.1| cathepsin B-S1 [Tuberaphis takenouchii]
Length = 332
Score = 180 bits (457), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 99/258 (38%), Positives = 144/258 (55%), Gaps = 27/258 (10%)
Query: 88 VQLSDPLEE---LPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRG 144
++ DPL E E FD+R NW C I IRDQG+CGS WA G A +DR+C+++ G
Sbjct: 73 IKTYDPLYEENASVEQFDSRENWKSCKQIGRIRDQGNCGSCWAFGTTGAFADRLCVSTGG 132
Query: 145 KRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIPCER 204
K + LS +D+ CC++CG GC+GG+ KAW+Y+ T G+ +GG Y SK+GC PY+IP
Sbjct: 133 KFNELLSPEDVAFCCQNCGKGCEGGYPIKAWQYFRTQGVPTGGDYDSKEGCAPYKIPPCF 192
Query: 205 YMNGSHSSCQDNEPNTPECIRKC------QPGYDVSYEDDLNFGRIAYSLPANEETIMRE 258
G ++ +C + C Q Y V E LN + T+ ++
Sbjct: 193 DQKGKNTCAGKPLERNHQCPKTCYGSTTVQKRYKVKNEYVLN----------SPNTMEQD 242
Query: 259 IFRHGPVEGSMTIYADMILYKTGIYKHVAGGP-LGEHAIRIIGWGQEPLGEGTSSVVKYW 317
+ ++GP+E S ++ D+ YK+GIY+ L H+I+IIGWG+E + V YW
Sbjct: 243 LIKYGPIEASFNLFDDLSAYKSGIYQKTPKAKFLSGHSIKIIGWGKE-------NGVPYW 295
Query: 318 LVANSFNTNWGENGLFRI 335
L NS++ WGE G FRI
Sbjct: 296 LAVNSWSKFWGEQGTFRI 313
Score = 100 bits (250), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 59/167 (35%), Positives = 86/167 (51%), Gaps = 24/167 (14%)
Query: 336 GCRPYEIPCERYMNGSRSSCQANEPNTPECIRKC------QPGYDVSYEDDLNFGRIAYS 389
GC PY+IP G + +C + C Q Y V E LN
Sbjct: 182 GCAPYKIPPCFDQKGKNTCAGKPLERNHQCPKTCYGSTTVQKRYKVKNEYVLN------- 234
Query: 390 LPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGP-LGEHAIRIIGWGQEPL 448
+ T+ +++ ++GP+E S ++ D+ YK+GIY+ L H+I+IIGWG+E
Sbjct: 235 ---SPNTMEQDLIKYGPIEASFNLFDDLSAYKSGIYQKTPKAKFLSGHSIKIIGWGKE-- 289
Query: 449 GEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLP 495
+ V YWL NS++ WGE G FRI++G+NECGIE TAG+P
Sbjct: 290 -----NGVPYWLAVNSWSKFWGEQGTFRIIKGRNECGIERSATAGIP 331
>gi|209863079|ref|NP_001119613.2| cathepsin B precursor [Acyrthosiphon pisum]
Length = 323
Score = 180 bits (457), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 97/245 (39%), Positives = 141/245 (57%), Gaps = 11/245 (4%)
Query: 96 ELPEGFDARINWPYCP-TIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDD 154
++P FDAR + C I +++DQG+C S WA+ +DR+CIAS G+ LS+ +
Sbjct: 63 DIPREFDARQYFTSCANVIGDVKDQGNCASSWAVAVASTFTDRLCIASNGQFTDNLSAQN 122
Query: 155 LVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYE-IPCERYMNGSHSSC 213
L+SC GC GG KAW+ + GIV+GG + S +GC+PY+ PC+ Y + ++C
Sbjct: 123 LMSCGDGEKMGCDGGSAFKAWELTMNKGIVTGGNFDSNEGCQPYKNRPCDHYGDSRLTNC 182
Query: 214 QD-NEPNTPECIRKC-QPGYDVSYEDDLNFGRIAYSLP-ANEETIMREIFRHGPVEGSMT 270
C +KC Y V YEDDL+ I Y N + I +EI +GPV M
Sbjct: 183 SSLRRTQMTVCRKKCVNKNYKVKYEDDLHKTSIVYMTSWTNVKQIQQEIMTYGPVTAFMY 242
Query: 271 IYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGEN 330
+Y + + YK GIYK G +G H +++IGWG + G+GT +YWL NS+N+NWG +
Sbjct: 243 VYENFMGYKEGIYKSTTGELIGYHHVKLIGWGVD--GDGT----EYWLAMNSWNSNWGND 296
Query: 331 GLFRI 335
GLF+I
Sbjct: 297 GLFKI 301
Score = 124 bits (311), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 73/198 (36%), Positives = 106/198 (53%), Gaps = 10/198 (5%)
Query: 304 EPLGEGTSSVVKYWLVANSFNTNWGENGLFRIGCRPYE-IPCERYMNGSRSSCQA-NEPN 361
E +G S K W + + G N GC+PY+ PC+ Y + ++C +
Sbjct: 130 EKMGCDGGSAFKAWELTMNKGIVTGGNFDSNEGCQPYKNRPCDHYGDSRLTNCSSLRRTQ 189
Query: 362 TPECIRKC-QPGYDVSYEDDLNFGRIAYSLP-ANEETIMREIFRHGPVEGSMTIYADMIL 419
C +KC Y V YEDDL+ I Y N + I +EI +GPV M +Y + +
Sbjct: 190 MTVCRKKCVNKNYKVKYEDDLHKTSIVYMTSWTNVKQIQQEIMTYGPVTAFMYVYENFMG 249
Query: 420 YKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVR 479
YK GIYK G +G H +++IGWG + G+GT +YWL NS+N+NWG +GLF+I+R
Sbjct: 250 YKEGIYKSTTGELIGYHHVKLIGWGVD--GDGT----EYWLAMNSWNSNWGNDGLFKILR 303
Query: 480 GQNECGIEADITAGLPKI 497
G N C IE + AG+ +
Sbjct: 304 GYNFCSIELLVMAGIVDV 321
>gi|157092993|gb|ABV22151.1| cysteine proteinase [Perkinsus chesapeaki]
Length = 396
Score = 180 bits (456), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 100/258 (38%), Positives = 145/258 (56%), Gaps = 24/258 (9%)
Query: 91 SDPLEELPEGFDARINWPYCPT-IQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVR 149
+D L++LP F+A + C + I IRDQ +CGS WA EA +DR+CI S G
Sbjct: 133 ADELKDLPVSFNATEEFKECSSVIGHIRDQSACGSCWAFAPTEAFNDRLCIKSAGNFTSL 192
Query: 150 LSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQ------GCRPYEIP-C 202
LS ++ +C K +GC GG AW++ TTG+V+GG Y++++ GC PY+IP C
Sbjct: 193 LSPGNVAACSK--TSGCHGGSSLDAWQWLHTTGVVTGGDYSAEKDMTESDGCWPYDIPPC 250
Query: 203 ERYMNGS-HSSCQDNEPNTPECIRKC-QPGYDVSYEDDLNF-GRIAYSLPANEETIMREI 259
Y N + + C + + P C C YD E D +F + S + + I +EI
Sbjct: 251 AHYTNSTLYPKCPKTKYDFPTCQESCPNKKYDTPMEKDRHFVEEESLSALRSIDAIKKEI 310
Query: 260 FRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLV 319
+GPV S +Y D + YK+G+YK + LG HA++IIGWG++ YWLV
Sbjct: 311 MTNGPVSASYLVYDDFLTYKSGVYKRTSHNALGGHAVKIIGWGED-----------YWLV 359
Query: 320 ANSFNTNWGENGLFRIGC 337
NS+N NWG+NG+F+IGC
Sbjct: 360 VNSWNKNWGDNGMFKIGC 377
Score = 116 bits (291), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 65/164 (39%), Positives = 90/164 (54%), Gaps = 17/164 (10%)
Query: 336 GCRPYEIP-CERYMNGS-RSSCQANEPNTPECIRKC-QPGYDVSYEDDLNF-GRIAYSLP 391
GC PY+IP C Y N + C + + P C C YD E D +F + S
Sbjct: 241 GCWPYDIPPCAHYTNSTLYPKCPKTKYDFPTCQESCPNKKYDTPMEKDRHFVEEESLSAL 300
Query: 392 ANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEG 451
+ + I +EI +GPV S +Y D + YK+G+YK + LG HA++IIGWG++
Sbjct: 301 RSIDAIKKEIMTNGPVSASYLVYDDFLTYKSGVYKRTSHNALGGHAVKIIGWGED----- 355
Query: 452 TSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLP 495
YWLV NS+N NWG+NG+F+I GQ CGIE ++ AG P
Sbjct: 356 ------YWLVVNSWNKNWGDNGMFKIGCGQ--CGIEDNVLAGTP 391
>gi|166030308|gb|ABY78821.1| cathepsin B-like protease [Trypanosoma congolense]
Length = 336
Score = 180 bits (456), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 106/281 (37%), Positives = 150/281 (53%), Gaps = 26/281 (9%)
Query: 59 LSKLTLSELEMRMGV--HPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEI 116
+ +T +E G S LP P+ ELPE FD+ WP CPTI+EI
Sbjct: 54 MQNITFAEARRLTGAFRRKTSSLP----PVRFTEEQLRTELPESFDSAEKWPNCPTIREI 109
Query: 117 RDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWK 176
DQ +CGS WA+ A+SDR C G + +R+S+ L+SCC+DCG+GC+GG AW+
Sbjct: 110 ADQSACGSCWAVSTASAISDRYCTVG-GVQQLRISAAHLMSCCEDCGDGCKGGAPDSAWE 168
Query: 177 YWVTTGIVSGGTYASKQGCRPYEIP-CERYM-NGSHSSCQDNEPNTPECIRKCQPGYDVS 234
Y+V+ G+ S C+PY P C + G C +TP+C C D +
Sbjct: 169 YYVSHGLAS-------SYCQPYPFPHCGHHGGKGKKPPCSKYHFHTPKCNTTCT---DKA 218
Query: 235 YEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEH 294
G +Y L E+ RE++ +GP +Y+D + YKTG+Y+HV+G LG H
Sbjct: 219 IPLIKYRGNNSYMLLNGEDDYKRELYFNGPFVVDFGVYSDFLAYKTGVYRHVSGDVLGGH 278
Query: 295 AIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
A+RI+GWG+ GT YW +ANS++T+WG NG F I
Sbjct: 279 AVRIVGWGKL---NGT----PYWKIANSWDTDWGMNGHFLI 312
Score = 116 bits (291), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 68/171 (39%), Positives = 95/171 (55%), Gaps = 12/171 (7%)
Query: 329 ENGLFRIGCRPYEIP-CERYM-NGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRI 386
+GL C+PY P C + G + C +TP+C C D + G
Sbjct: 172 SHGLASSYCQPYPFPHCGHHGGKGKKPPCSKYHFHTPKCNTTCT---DKAIPLIKYRGNN 228
Query: 387 AYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQE 446
+Y L E+ RE++ +GP +Y+D + YKTG+Y+HV+G LG HA+RI+GWG+
Sbjct: 229 SYMLLNGEDDYKRELYFNGPFVVDFGVYSDFLAYKTGVYRHVSGDVLGGHAVRIVGWGKL 288
Query: 447 PLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
GT YW +ANS++T+WG NG F I+RG NECGIE+ AGLP I
Sbjct: 289 ---NGT----PYWKIANSWDTDWGMNGHFLILRGNNECGIESTGYAGLPAI 332
>gi|2317912|gb|AAC24376.1| cathepsin B-like cysteine proteinase [Arabidopsis thaliana]
Length = 357
Score = 179 bits (454), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 106/292 (36%), Positives = 163/292 (55%), Gaps = 30/292 (10%)
Query: 47 PKLPFYGAEKNALSKLTLSELEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARIN 106
P + A + + T++E + +GV K +P++ D +LP+ FDAR
Sbjct: 55 PNAGWKAAFNDRFANATVAEFKRLLGVIQTPKTAYLGVPIVRH--DLSLKLPKEFDARTA 112
Query: 107 WPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKD-CGNG 165
W +C +I+ I G CGS WA GAVE++SDR CI + +V LS++D+++CC CG G
Sbjct: 113 WSHCTSIRRIL--GHCGSCWAFGAVESLSDRFCI--KYNLNVSLSANDVIACCGLLCGFG 168
Query: 166 CQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPY--EIPCERYMNGSHSSCQDNEPNTPEC 223
C GGF AW Y+ G+V+ Q C PY C SH C+ P TP+C
Sbjct: 169 CNGGFPMGAWLYFKYHGVVT-------QECDPYFDNTGC------SHPGCEPTYP-TPKC 214
Query: 224 IRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIY 283
RKC + + + ++G AY + + + IM E++++GPVE + T+Y D YK+G+Y
Sbjct: 215 ERKCVSRNQL-WGESKHYGVGAYRINPDPQDIMAEVYKNGPVEVAFTVYEDFAHYKSGVY 273
Query: 284 KHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
K++ G +G HA+++IGWG GE YWL+AN +N +WG++G F+I
Sbjct: 274 KYITGTKIGGHAVKLIGWGTSDDGE------DYWLLANQWNRSWGDDGYFKI 319
Score = 125 bits (315), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 62/154 (40%), Positives = 95/154 (61%), Gaps = 9/154 (5%)
Query: 344 CERYMNGSRSSCQANEPN--TPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREI 401
C+ Y + + S EP TP+C RKC + + + ++G AY + + + IM E+
Sbjct: 191 CDPYFDNTGCSHPGCEPTYPTPKCERKCVSRNQL-WGESKHYGVGAYRINPDPQDIMAEV 249
Query: 402 FRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLV 461
+++GPVE + T+Y D YK+G+YK++ G +G HA+++IGWG GE YWL+
Sbjct: 250 YKNGPVEVAFTVYEDFAHYKSGVYKYITGTKIGGHAVKLIGWGTSDDGE------DYWLL 303
Query: 462 ANSFNTNWGENGLFRIVRGQNECGIEADITAGLP 495
AN +N +WG++G F+I RG NECGIE + AGLP
Sbjct: 304 ANQWNRSWGDDGYFKIRRGTNECGIEQSVVAGLP 337
>gi|294951797|ref|XP_002787132.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
gi|239901778|gb|EER18928.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
Length = 278
Score = 179 bits (454), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 100/253 (39%), Positives = 138/253 (54%), Gaps = 19/253 (7%)
Query: 94 LEELPEGFDARINWPYCP-TIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSS 152
L++LP FDAR +P C I IRDQ +CGS WA G EA +DR+CI S G LS+
Sbjct: 18 LQDLPTDFDARTAFPNCSKVIGHIRDQSACGSCWAFGVTEAFNDRLCIKSHGTFTELLSA 77
Query: 153 DDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQ------GCRPYEIP-CERY 205
++ +C +GC GGF AW + GI +GG Y ++ GC PY+ P C +
Sbjct: 78 GEMNACAP--SHGCNGGFPNSAWSWVHDKGIATGGDYVAEDDMTKDDGCWPYDFPPCAHH 135
Query: 206 MNGS-HSSCQDNEPNTPECIRKCQ-PGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHG 263
+N S + C + TP C +C P Y + DD +F + + I G
Sbjct: 136 VNDSKYPKCPKDSYETPNCAEQCHNPKYTTTLRDDRHFMVESSPYQYSVNDAKNAIRTDG 195
Query: 264 PVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSF 323
PV S T+Y D + YK+G+YKH +G LG HA++IIGWG+E S YWLV NS+
Sbjct: 196 PVSASFTVYEDFLAYKSGVYKHTSGEYLGGHAVKIIGWGEE-------SGQAYWLVVNSW 248
Query: 324 NTNWGENGLFRIG 336
N +WG++GLF+I
Sbjct: 249 NEDWGDHGLFKIA 261
Score = 119 bits (297), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 64/165 (38%), Positives = 90/165 (54%), Gaps = 12/165 (7%)
Query: 336 GCRPYEIP-CERYMNGSR-SSCQANEPNTPECIRKCQ-PGYDVSYEDDLNFGRIAYSLPA 392
GC PY+ P C ++N S+ C + TP C +C P Y + DD +F +
Sbjct: 123 GCWPYDFPPCAHHVNDSKYPKCPKDSYETPNCAEQCHNPKYTTTLRDDRHFMVESSPYQY 182
Query: 393 NEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGT 452
+ I GPV S T+Y D + YK+G+YKH +G LG HA++IIGWG+E
Sbjct: 183 SVNDAKNAIRTDGPVSASFTVYEDFLAYKSGVYKHTSGEYLGGHAVKIIGWGEE------ 236
Query: 453 SSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
S YWLV NS+N +WG++GLF+I G CGI+ + G PK+
Sbjct: 237 -SGQAYWLVVNSWNEDWGDHGLFKIALGN--CGIDDYLLGGTPKV 278
>gi|161343875|tpg|DAA06118.1| TPA_inf: cathepsin B [Myzus persicae]
Length = 210
Score = 179 bits (453), Expect = 4e-42, Method: Compositional matrix adjust.
Identities = 95/217 (43%), Positives = 131/217 (60%), Gaps = 14/217 (6%)
Query: 122 CGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTT 181
CGS WA A SDR+CIA+ G LS++ L +CC CGNGC GG AW +++
Sbjct: 1 CGSCWAASAASVFSDRLCIATGGAVARNLSAEQLNTCCYRCGNGCDGGSPEAAWYFFMRH 60
Query: 182 GIVSGGTYASKQGCRPYEIPCERYMNGS-HSSCQDNEPNTPEC-IRKC-QPGYDVSYEDD 238
GIV+GG Y S GC+PY I Y G ++C D++ +TP+C IR C Y Y D
Sbjct: 61 GIVTGGDYESGDGCQPYSI----YPRGKGRNTCIDDDIDTPDCSIRTCTNSNYTKGYRAD 116
Query: 239 LNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRI 298
L++ YSL +EE IM +I+++GPV+ + +Y D + YK+G+Y + G G HAI+I
Sbjct: 117 LHYVDTVYSLSRSEEDIMTDIYKNGPVQAAFYVYTDFMYYKSGVYSYTRGQIEGGHAIKI 176
Query: 299 IGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
+GWG + KYWL ANS++ +WGENGLFRI
Sbjct: 177 LGWGVD-------DNTKYWLCANSWSRSWGENGLFRI 206
Score = 119 bits (298), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 63/149 (42%), Positives = 91/149 (61%), Gaps = 14/149 (9%)
Query: 336 GCRPYEIPCERYMNGS-RSSCQANEPNTPEC-IRKC-QPGYDVSYEDDLNFGRIAYSLPA 392
GC+PY I Y G R++C ++ +TP+C IR C Y Y DL++ YSL
Sbjct: 73 GCQPYSI----YPRGKGRNTCIDDDIDTPDCSIRTCTNSNYTKGYRADLHYVDTVYSLSR 128
Query: 393 NEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGT 452
+EE IM +I+++GPV+ + +Y D + YK+G+Y + G G HAI+I+GWG +
Sbjct: 129 SEEDIMTDIYKNGPVQAAFYVYTDFMYYKSGVYSYTRGQIEGGHAIKILGWGVD------ 182
Query: 453 SSVVKYWLVANSFNTNWGENGLFRIVRGQ 481
KYWL ANS++ +WGENGLFRI+RG
Sbjct: 183 -DNTKYWLCANSWSRSWGENGLFRILRGN 210
>gi|339831342|gb|AEK20867.1| cathepsin B [Eimeria tenella]
Length = 512
Score = 178 bits (451), Expect = 6e-42, Method: Compositional matrix adjust.
Identities = 101/265 (38%), Positives = 148/265 (55%), Gaps = 22/265 (8%)
Query: 85 PLLVQLSDPLEELPE--GFDARINWPYCP-TIQEIRDQGSCGSGWALGAVEAMSDRVCIA 141
PL V++ +++ E FDAR +P C I +RDQG CGS WA + EA++DR CI
Sbjct: 223 PLPVKVFAETQQVLETDKFDAREAFPQCAEVIGHVRDQGDCGSCWAFASTEALNDRFCIK 282
Query: 142 SRGKRHVRLSSDDLVSCCK--DCGN-GCQGGFHGKAWKYWVTTGIVSGGTY---ASKQGC 195
S G+ LS SCC C + GC GG AW+++ G+V+GG Y + + C
Sbjct: 283 SGGRHREALSPQHTTSCCDLLHCLSFGCSGGQPRMAWRWFSNDGVVTGGDYNELHTGKSC 342
Query: 196 RPYEIP-CERYMNGSHSSCQDNEPNTPECIRKCQPGYDVS----YEDDLNFGRIAYSLPA 250
PYEIP C + G + C+ P P+C + C+ S ++DDL+F AYS+
Sbjct: 343 WPYEIPFCRHHSEGPYPKCEGPLPKAPKCRKDCEEAEYTSKVKPFKDDLHFATSAYSVEG 402
Query: 251 NEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGT 310
++ I RE+ +G + G+ +Y D +LYK G+Y HV G P+G HA+++IG+G E +G
Sbjct: 403 RDQ-IKRELMENGTLTGAFLVYEDFLLYKEGVYHHVTGMPMGGHAVKVIGFGNE---DGR 458
Query: 311 SSVVKYWLVANSFNTNWGENGLFRI 335
YWL NS+N WG+ G F+I
Sbjct: 459 D----YWLAVNSWNEYWGDKGTFKI 479
Score = 119 bits (299), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 63/166 (37%), Positives = 93/166 (56%), Gaps = 15/166 (9%)
Query: 337 CRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVS----YEDDLNFGRIAYSLP 391
C PYEIP C + G C+ P P+C + C+ S ++DDL+F AYS+
Sbjct: 342 CWPYEIPFCRHHSEGPYPKCEGPLPKAPKCRKDCEEAEYTSKVKPFKDDLHFATSAYSVE 401
Query: 392 ANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEG 451
++ I RE+ +G + G+ +Y D +LYK G+Y HV G P+G HA+++IG+G E +G
Sbjct: 402 GRDQ-IKRELMENGTLTGAFLVYEDFLLYKEGVYHHVTGMPMGGHAVKVIGFGNE---DG 457
Query: 452 TSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
YWL NS+N WG+ G F+I G E GI+ + G PK+
Sbjct: 458 RD----YWLAVNSWNEYWGDKGTFKIEMG--EAGIDKEFCGGEPKV 497
>gi|204022077|dbj|BAG71136.1| cathepsin B-S1 [Tuberaphis sumatrana]
gi|204022079|dbj|BAG71137.1| cathepsin B-S2 [Tuberaphis sumatrana]
Length = 334
Score = 178 bits (451), Expect = 7e-42, Method: Compositional matrix adjust.
Identities = 96/253 (37%), Positives = 146/253 (57%), Gaps = 17/253 (6%)
Query: 88 VQLSDPL---EELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRG 144
++ DPL + P+ FD+R NW C I IRDQG+CGS W+ A +DR+C+++ G
Sbjct: 73 IKKYDPLYVENDSPQQFDSRENWKSCKQIGHIRDQGNCGSCWSFSTTGAFADRLCVSTGG 132
Query: 145 KRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEI-PCE 203
K + LS ++L CCKDCGNGC+GG+ KAW+Y+ T G+ +GG Y +K+GC+PY++ PC
Sbjct: 133 KFNELLSPEELAFCCKDCGNGCEGGYPIKAWRYFRTQGVTTGGDYDTKEGCKPYKVAPCY 192
Query: 204 RYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHG 263
+ + E N +C + C D + + + + +TI ++I +G
Sbjct: 193 NKQGKNTCGGKPMERNH-QCPKTCYG----KTTDQKRYKTKSEYVINSIKTIEQDIKTYG 247
Query: 264 PVEGSMTIYADMILYKTGIYKHVAGGPLGE-HAIRIIGWGQEPLGEGTSSVVKYWLVANS 322
PVE S +Y D +YK+GIY+ H+++IIGWGQE GT YWL NS
Sbjct: 248 PVEASFDVYDDFSVYKSGIYRKTPNAKYQNGHSVKIIGWGQE---NGTP----YWLAVNS 300
Query: 323 FNTNWGENGLFRI 335
++ WG++G F+I
Sbjct: 301 WSKFWGDHGTFKI 313
Score = 98.6 bits (244), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 60/185 (32%), Positives = 97/185 (52%), Gaps = 16/185 (8%)
Query: 314 VKYWLVANSFNTNWGENGLFRIGCRPYEI-PCERYMNGSRSSCQANE-PNTPECIRKCQP 371
+K W + G + + GC+PY++ PC Y +++C +C + C
Sbjct: 160 IKAWRYFRTQGVTTGGDYDTKEGCKPYKVAPC--YNKQGKNTCGGKPMERNHQCPKTCYG 217
Query: 372 GYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGG 431
D + + + + +TI ++I +GPVE S +Y D +YK+GIY+
Sbjct: 218 ----KTTDQKRYKTKSEYVINSIKTIEQDIKTYGPVEASFDVYDDFSVYKSGIYRKTPNA 273
Query: 432 PLGE-HAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADI 490
H+++IIGWGQE GT YWL NS++ WG++G F+I++G+NECGIE +
Sbjct: 274 KYQNGHSVKIIGWGQE---NGTP----YWLAVNSWSKFWGDHGTFKIIKGKNECGIERAV 326
Query: 491 TAGLP 495
TAG+P
Sbjct: 327 TAGIP 331
>gi|239792046|dbj|BAH72408.1| ACYPI000003 [Acyrthosiphon pisum]
Length = 182
Score = 178 bits (451), Expect = 7e-42, Method: Compositional matrix adjust.
Identities = 84/162 (51%), Positives = 114/162 (70%), Gaps = 8/162 (4%)
Query: 335 IGCRPYEI-PCERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPAN 393
+GC PYEI PCE ++NG+R C+ TP C++KC+ GY V Y DL+ G+ AYS+ +
Sbjct: 27 MGCIPYEIAPCEHHVNGTRGPCKEG-GKTPTCVKKCEEGYKVPYAQDLHHGKSAYSIRND 85
Query: 394 EETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTS 453
+ I +EI+ +GPVEG+ T+Y D I Y+ G+YKHVAG LG HAIRI+GWG + +
Sbjct: 86 VDQIRQEIYTNGPVEGAFTVYEDFIAYRAGVYKHVAGKALGGHAIRILGWGVQ------N 139
Query: 454 SVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLP 495
+ YWLVANS+NT+WG +G F+I+RG +ECGIE I AGLP
Sbjct: 140 GEIPYWLVANSWNTDWGSDGFFKILRGSDECGIEGQINAGLP 181
Score = 167 bits (422), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 81/160 (50%), Positives = 109/160 (68%), Gaps = 8/160 (5%)
Query: 177 YWVTTGIVSGGTYASKQGCRPYEI-PCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSY 235
Y T GIVSGG Y S GC PYEI PCE ++NG+ C++ TP C++KC+ GY V Y
Sbjct: 11 YCKTKGIVSGGPYGSNMGCIPYEIAPCEHHVNGTRGPCKEG-GKTPTCVKKCEEGYKVPY 69
Query: 236 EDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHA 295
DL+ G+ AYS+ + + I +EI+ +GPVEG+ T+Y D I Y+ G+YKHVAG LG HA
Sbjct: 70 AQDLHHGKSAYSIRNDVDQIRQEIYTNGPVEGAFTVYEDFIAYRAGVYKHVAGKALGGHA 129
Query: 296 IRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
IRI+GWG + + + YWLVANS+NT+WG +G F+I
Sbjct: 130 IRILGWGVQ------NGEIPYWLVANSWNTDWGSDGFFKI 163
>gi|359427491|gb|AEV46267.1| eimeripain [Eimeria tenella]
Length = 512
Score = 178 bits (451), Expect = 7e-42, Method: Compositional matrix adjust.
Identities = 101/265 (38%), Positives = 148/265 (55%), Gaps = 22/265 (8%)
Query: 85 PLLVQLSDPLEELPE--GFDARINWPYCP-TIQEIRDQGSCGSGWALGAVEAMSDRVCIA 141
PL V++ +++ E FDAR +P C I +RDQG CGS WA + EA++DR CI
Sbjct: 223 PLPVKVFAETQQVLETDKFDAREAFPQCAEVIGHVRDQGDCGSCWAFASTEALNDRFCIK 282
Query: 142 SRGKRHVRLSSDDLVSCCK--DCGN-GCQGGFHGKAWKYWVTTGIVSGGTY---ASKQGC 195
S G+ LS SCC C + GC GG AW+++ G+V+GG Y + + C
Sbjct: 283 SGGRHREALSPQHTTSCCDLLHCLSFGCSGGQPRMAWRWFSNDGVVTGGDYNELHTGKSC 342
Query: 196 RPYEIP-CERYMNGSHSSCQDNEPNTPECIRKCQPGYDVS----YEDDLNFGRIAYSLPA 250
PYEIP C + G + C+ P P+C + C+ S ++DDL+F AYS+
Sbjct: 343 WPYEIPFCRHHSEGPYPKCEGPLPKAPKCRKDCEEAEYTSKVKPFKDDLHFATSAYSVEG 402
Query: 251 NEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGT 310
++ I RE+ +G + G+ +Y D +LYK G+Y HV G P+G HA+++IG+G E +G
Sbjct: 403 RDQ-IKRELMENGTLTGAFLVYEDFLLYKEGVYHHVTGMPMGGHAVKVIGFGNE---DGR 458
Query: 311 SSVVKYWLVANSFNTNWGENGLFRI 335
YWL NS+N WG+ G F+I
Sbjct: 459 D----YWLAVNSWNEYWGDKGTFKI 479
Score = 119 bits (298), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 63/166 (37%), Positives = 93/166 (56%), Gaps = 15/166 (9%)
Query: 337 CRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVS----YEDDLNFGRIAYSLP 391
C PYEIP C + G C+ P P+C + C+ S ++DDL+F AYS+
Sbjct: 342 CWPYEIPFCRHHSEGPYPKCEGPLPKAPKCRKDCEEAEYTSKVKPFKDDLHFATSAYSVE 401
Query: 392 ANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEG 451
++ I RE+ +G + G+ +Y D +LYK G+Y HV G P+G HA+++IG+G E +G
Sbjct: 402 GRDQ-IKRELMENGTLTGAFLVYEDFLLYKEGVYHHVTGMPMGGHAVKVIGFGNE---DG 457
Query: 452 TSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
YWL NS+N WG+ G F+I G E GI+ + G PK+
Sbjct: 458 RD----YWLAVNSWNEYWGDKGTFKIEMG--EAGIDKEFCGGEPKV 497
>gi|237836005|ref|XP_002367300.1| cysteine proteinase, putative [Toxoplasma gondii ME49]
gi|211964964|gb|EEB00160.1| cysteine proteinase, putative [Toxoplasma gondii ME49]
gi|221506020|gb|EEE31655.1| cysteine proteinase, putative [Toxoplasma gondii VEG]
Length = 572
Score = 177 bits (450), Expect = 9e-42, Method: Compositional matrix adjust.
Identities = 103/268 (38%), Positives = 150/268 (55%), Gaps = 23/268 (8%)
Query: 84 LPLLV-QLSDPLEELPEGFDARINWPYCP-TIQEIRDQGSCGSGWALGAVEAMSDRVCIA 141
+PL + + E +P FDAR +P C + +RDQG CGS WA + EA +DR+CI
Sbjct: 263 MPLPAKEFENATEPVPAHFDARTAFPACKDVVGHVRDQGDCGSCWAFASTEAFNDRLCIR 322
Query: 142 SRGKRHVRLSSDDLVSCCK--DCGN-GCQGGFHGKAWKYWVTTGIVSGGTY-ASKQG--C 195
S+GK + LS+ SCC C + GC GG G AW+++ G+V+GG + A +G C
Sbjct: 323 SQGKGLMPLSAQHTTSCCNAIHCASFGCNGGQPGMAWRWFERKGVVTGGDFDALGKGTTC 382
Query: 196 RPYEIP-CERYMNGSHSSCQDN--EPNTPECIRKCQPGYDVS----YEDDLNFGRIAYSL 248
PYE+P C + C TP+C + C+ ++ D + AYSL
Sbjct: 383 WPYEVPFCAHHAKAPFPDCDATLVPRKTPKCRKDCEEQAYADNVHPFDQDTHKATSAYSL 442
Query: 249 PANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGE 308
+ ++ + R++ HGPV G+ +Y D + YK+G+YKHV+G P+G HAI+IIGWG E GE
Sbjct: 443 RSRDD-VKRDMMTHGPVSGAFMVYEDFLSYKSGVYKHVSGLPVGGHAIKIIGWGTEN-GE 500
Query: 309 GTSSVVKYWLVANSFNTNWGENGLFRIG 336
+YW NS+NT WG+ G F+I
Sbjct: 501 ------EYWHAVNSWNTYWGDGGQFKIA 522
Score = 116 bits (290), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 64/164 (39%), Positives = 93/164 (56%), Gaps = 17/164 (10%)
Query: 337 CRPYEIP-CERYMNGSRSSCQAN--EPNTPECIRKCQPGYDVS----YEDDLNFGRIAYS 389
C PYE+P C + C A TP+C + C+ ++ D + AYS
Sbjct: 382 CWPYEVPFCAHHAKAPFPDCDATLVPRKTPKCRKDCEEQAYADNVHPFDQDTHKATSAYS 441
Query: 390 LPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLG 449
L + ++ + R++ HGPV G+ +Y D + YK+G+YKHV+G P+G HAI+IIGWG E G
Sbjct: 442 LRSRDD-VKRDMMTHGPVSGAFMVYEDFLSYKSGVYKHVSGLPVGGHAIKIIGWGTEN-G 499
Query: 450 EGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAG 493
E +YW NS+NT WG+ G F+I GQ CGI+ ++ AG
Sbjct: 500 E------EYWHAVNSWNTYWGDGGQFKIAMGQ--CGIDGEMVAG 535
>gi|395734831|ref|XP_003776483.1| PREDICTED: LOW QUALITY PROTEIN: cathepsin B-like [Pongo abelii]
Length = 350
Score = 177 bits (449), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 108/285 (37%), Positives = 160/285 (56%), Gaps = 29/285 (10%)
Query: 59 LSKLTLSELEMRMGVHPD-SKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIR 117
K+ +S L G P SKLPQ R+ ++ LPE FD WP P +EIR
Sbjct: 60 FHKMXISYLRRPCGTFPGRSKLPQ-RVKFAXDIN-----LPESFDPXEQWPDXPX-REIR 112
Query: 118 DQGSCGSGWALGAVEAMSDRVCI-----ASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHG 172
DQGS G WALGA+EA+SD +CI ++G HV +S++D ++C CG+GC GG
Sbjct: 113 DQGSYGFCWALGALEAISDWICIHPNVGGAQGGNHVEVSAEDKLTCL--CGDGCNGGXPN 170
Query: 173 KAWKYWVTTGIVSGGTYASKQGCR--PYEIPCERYMNGSHSSCQDNEPNTPECIRKCQPG 230
+ W +W G+VSGG Y S GCR P +PC+ +++G ++P+C C+PG
Sbjct: 171 EGWNFWTGKGLVSGGLYDSHVGCRLFPSLLPCKHHIHGXPYV---XTGDSPKCSMTCEPG 227
Query: 231 YDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGP 290
+Y+ D ++G +YS+ + + IM I+++ VE + ++Y D ++YK Y+ V G
Sbjct: 228 Q--TYKXDKHYGCSSYSISDSTKDIMTNIYKNDXVEEAFSVYLDFLMYKFKEYQGVTGEM 285
Query: 291 LGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
G HAI I+G E + YWLVAN +N +WG+NG F+I
Sbjct: 286 XGGHAICILGCKVE-------NSTSYWLVANXWNRDWGDNGFFKI 323
Score = 107 bits (267), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 57/164 (34%), Positives = 96/164 (58%), Gaps = 14/164 (8%)
Query: 334 RIGCR--PYEIPCERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLP 391
+GCR P +PC+ +++G ++P+C C+PG +Y+ D ++G +YS+
Sbjct: 190 HVGCRLFPSLLPCKHHIHGXP---YVXTGDSPKCSMTCEPGQ--TYKXDKHYGCSSYSIS 244
Query: 392 ANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEG 451
+ + IM I+++ VE + ++Y D ++YK Y+ V G G HAI I+G E
Sbjct: 245 DSTKDIMTNIYKNDXVEEAFSVYLDFLMYKFKEYQGVTGEMXGGHAICILGCKVE----- 299
Query: 452 TSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLP 495
+ YWLVAN +N +WG+NG F+I+RGQ+ GIE+++ A +P
Sbjct: 300 --NSTSYWLVANXWNRDWGDNGFFKILRGQDHYGIESEVVAEIP 341
>gi|157058747|gb|ABV03131.1| cathepsin B-2744 [Myzus persicae]
Length = 261
Score = 177 bits (448), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 96/241 (39%), Positives = 138/241 (57%), Gaps = 11/241 (4%)
Query: 96 ELPEGFDARINWPYCP-TIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDD 154
++P+ FDAR + C I +++DQG+C S WA+ +DR+CIAS GK LS+ +
Sbjct: 27 DIPKEFDARQYFISCANVIGDVKDQGNCASSWAVAVASTFTDRLCIASNGKFTDNLSAQN 86
Query: 155 LVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYE-IPCERYMNGSHSSC 213
L+SC D GC GG KAW++ + GIV+GG Y S +GC+PY+ PC+ Y + S ++C
Sbjct: 87 LMSCGDDEKLGCDGGSAYKAWEFTMGKGIVTGGPYDSNEGCQPYKNRPCDHYGDSSLTNC 146
Query: 214 QD-NEPNTPECIRKC-QPGYDVSYEDDLNFGRIAYSLP-ANEETIMREIFRHGPVEGSMT 270
C KC Y V YEDDL + Y N + I +EI +GPV M
Sbjct: 147 SSLRRTQMMFCRDKCVNKNYKVKYEDDLYKTSVVYMTSWTNVKQIQQEIMTYGPVTAFMY 206
Query: 271 IYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGEN 330
+Y + + YK G+YK AG +G H +++IGWG + G ++YWL NS+N+NWG N
Sbjct: 207 VYENFMGYKEGVYKSTAGELIGYHHVKLIGWGVDEAG------IEYWLAMNSWNSNWGTN 260
Query: 331 G 331
G
Sbjct: 261 G 261
Score = 100 bits (248), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 54/142 (38%), Positives = 78/142 (54%), Gaps = 10/142 (7%)
Query: 336 GCRPYE-IPCERYMNGSRSSCQA-NEPNTPECIRKC-QPGYDVSYEDDLNFGRIAYSLP- 391
GC+PY+ PC+ Y + S ++C + C KC Y V YEDDL + Y
Sbjct: 126 GCQPYKNRPCDHYGDSSLTNCSSLRRTQMMFCRDKCVNKNYKVKYEDDLYKTSVVYMTSW 185
Query: 392 ANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEG 451
N + I +EI +GPV M +Y + + YK G+YK AG +G H +++IGWG + G
Sbjct: 186 TNVKQIQQEIMTYGPVTAFMYVYENFMGYKEGVYKSTAGELIGYHHVKLIGWGVDEAG-- 243
Query: 452 TSSVVKYWLVANSFNTNWGENG 473
++YWL NS+N+NWG NG
Sbjct: 244 ----IEYWLAMNSWNSNWGTNG 261
>gi|204022071|dbj|BAG71133.1| cathepsin B-S2 [Tuberaphis coreana]
Length = 334
Score = 177 bits (448), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 99/258 (38%), Positives = 144/258 (55%), Gaps = 27/258 (10%)
Query: 88 VQLSDPL---EELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRG 144
++ DPL + P+ FD+R NW C I IRDQG+CGS W+ A +DR+C+++ G
Sbjct: 73 IKKYDPLYVENDSPQQFDSRTNWKSCKQIGHIRDQGNCGSCWSFSTTGAFADRLCVSTGG 132
Query: 145 KRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIPCER 204
K + LS ++L CCKDCG GC GG KAW+Y+ T G+ +GG Y +K+GC PY++P R
Sbjct: 133 KFNQLLSPEELTFCCKDCGQGCGGGNPMKAWEYFRTQGVTTGGDYNTKEGCMPYKVPPCR 192
Query: 205 YMNGSHSSCQDNEPNTPECIRKC------QPGYDVSYEDDLNFGRIAYSLPANEETIMRE 258
G + + +C + C Q Y E +N + +TI ++
Sbjct: 193 NKQGENICDEQPMERNHQCPKTCYGKTTVQNRYKTKSEYYIN----------SIKTIEQD 242
Query: 259 IFRHGPVEGSMTIYADMILYKTGIYKHVAGGPL-GEHAIRIIGWGQEPLGEGTSSVVKYW 317
I +GPVE S Y D+ +YK+GIY+ G H+I+IIGWGQE +GT YW
Sbjct: 243 IKTYGPVEASFDCYDDLSVYKSGIYRKSPNAKYKGGHSIKIIGWGQE---DGTP----YW 295
Query: 318 LVANSFNTNWGENGLFRI 335
L NS++ WG++G F+I
Sbjct: 296 LAVNSWSKFWGDHGTFKI 313
Score = 100 bits (248), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 61/167 (36%), Positives = 89/167 (53%), Gaps = 24/167 (14%)
Query: 336 GCRPYEIPCERYMNGSRSSCQANEPNTPECIRKC------QPGYDVSYEDDLNFGRIAYS 389
GC PY++P R G + +C + C Q Y E +N
Sbjct: 182 GCMPYKVPPCRNKQGENICDEQPMERNHQCPKTCYGKTTVQNRYKTKSEYYIN------- 234
Query: 390 LPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPL-GEHAIRIIGWGQEPL 448
+ +TI ++I +GPVE S Y D+ +YK+GIY+ G H+I+IIGWGQE
Sbjct: 235 ---SIKTIEQDIKTYGPVEASFDCYDDLSVYKSGIYRKSPNAKYKGGHSIKIIGWGQE-- 289
Query: 449 GEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLP 495
+GT YWL NS++ WG++G F+I++G+NECGIE +TAG+P
Sbjct: 290 -DGTP----YWLAVNSWSKFWGDHGTFKIIKGRNECGIERAVTAGIP 331
>gi|161343879|tpg|DAA06120.1| TPA_inf: cathepsin B [Toxoptera citricida]
Length = 340
Score = 174 bits (442), Expect = 7e-41, Method: Compositional matrix adjust.
Identities = 92/241 (38%), Positives = 141/241 (58%), Gaps = 11/241 (4%)
Query: 97 LPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLV 156
+P+ FDAR W +C TI +RDQG+CGS WA+ A +DR+C+A+ + LS++++
Sbjct: 88 IPKKFDARKKWRHCTTIGAVRDQGNCGSCWAIATSSAFADRLCVATNADFNQLLSAEEIT 147
Query: 157 SCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQDN 216
CC CG GC GG+ KAW+ + G+V+GG Y S +GC PY +P Y +++C
Sbjct: 148 FCCHKCGYGCNGGYPIKAWERFKKHGLVTGGEYKSGEGCEPYRVPPCPYDESGNNTCSGK 207
Query: 217 E-PNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADM 275
C R C D+ ++DD R +Y L +I +++ +GP+E S +Y D
Sbjct: 208 PMEQNHRCTRMCYGDQDLDFDDDHRHTRDSYYLTIG--SIQKDVMTYGPIEASFDVYDDF 265
Query: 276 ILYKTGIY-KHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFR 334
+ YK+G+Y + LG HA+++IGWG+E GT YWL+ NS+N +WG+ GLF+
Sbjct: 266 LSYKSGVYVRSENASYLGGHAVKLIGWGEE---YGTP----YWLMMNSWNADWGDEGLFK 318
Query: 335 I 335
I
Sbjct: 319 I 319
Score = 103 bits (256), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 60/162 (37%), Positives = 90/162 (55%), Gaps = 11/162 (6%)
Query: 336 GCRPYEIPCERYMNGSRSSCQANE-PNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANE 394
GC PY +P Y ++C C R C D+ ++DD R +Y L
Sbjct: 185 GCEPYRVPPCPYDESGNNTCSGKPMEQNHRCTRMCYGDQDLDFDDDHRHTRDSYYLTIG- 243
Query: 395 ETIMREIFRHGPVEGSMTIYADMILYKTGIY-KHVAGGPLGEHAIRIIGWGQEPLGEGTS 453
+I +++ +GP+E S +Y D + YK+G+Y + LG HA+++IGWG+E GT
Sbjct: 244 -SIQKDVMTYGPIEASFDVYDDFLSYKSGVYVRSENASYLGGHAVKLIGWGEE---YGTP 299
Query: 454 SVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLP 495
YWL+ NS+N +WG+ GLF+I RG NECG++ TAG+P
Sbjct: 300 ----YWLMMNSWNADWGDEGLFKIRRGTNECGVDNSTTAGVP 337
>gi|166030310|gb|ABY78822.1| cathepsin B-like protease [Trypanosoma congolense]
Length = 335
Score = 174 bits (442), Expect = 8e-41, Method: Compositional matrix adjust.
Identities = 103/277 (37%), Positives = 150/277 (54%), Gaps = 27/277 (9%)
Query: 59 LSKLTLSELEMRMG--VHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEI 116
+ +T +E G + S LP P+ ELPE FD+ WP CPTI+EI
Sbjct: 54 MQNITFAEARRLTGARIQKTSSLP----PVRFTEEQLRTELPESFDSAEKWPNCPTIREI 109
Query: 117 RDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWK 176
DQ +CGS WA+ A+SDR C G + +R+S+ L+SCC+DCG+GC GG+ G +W+
Sbjct: 110 ADQSACGSCWAVSTASAISDRHCTVG-GVQQLRISAAHLMSCCEDCGDGCDGGYPGTSWE 168
Query: 177 YWVTTGIVSGGTYASKQGCRPYEIP-CERY-MNGSHSSCQDNEPNTPECIRKCQPGYDVS 234
Y+V+ G+ S C+PY P C + G C +TP+C C D +
Sbjct: 169 YYVSHGLAS-------SYCQPYPFPHCGHHGGKGKKPPCSKYHFHTPKCNTTCT---DKA 218
Query: 235 YEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEH 294
G +Y + E+ RE++ +GP +Y+D + YKTG+Y+HV+G LG H
Sbjct: 219 IPLIKYRGNHSYEV-HGEDDYKRELYFNGPFVVVFWVYSDFLAYKTGVYRHVSGDFLGGH 277
Query: 295 AIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENG 331
A+RI+GWG+ GT YW +ANS++T+WG NG
Sbjct: 278 AVRIVGWGKL---NGT----PYWKIANSWDTDWGMNG 307
Score = 105 bits (263), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 65/171 (38%), Positives = 92/171 (53%), Gaps = 13/171 (7%)
Query: 329 ENGLFRIGCRPYEIP-CERYM-NGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRI 386
+GL C+PY P C + G + C +TP+C C D + G
Sbjct: 172 SHGLASSYCQPYPFPHCGHHGGKGKKPPCSKYHFHTPKCNTTCT---DKAIPLIKYRGNH 228
Query: 387 AYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQE 446
+Y + E+ RE++ +GP +Y+D + YKTG+Y+HV+G LG HA+RI+GWG+
Sbjct: 229 SYEV-HGEDDYKRELYFNGPFVVVFWVYSDFLAYKTGVYRHVSGDFLGGHAVRIVGWGKL 287
Query: 447 PLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
GT YW +ANS++T+WG NG +RG NECGIEA AG P I
Sbjct: 288 ---NGT----PYWKIANSWDTDWGMNGHLLFLRGNNECGIEAAGYAGSPAI 331
>gi|48762476|dbj|BAD23809.1| cathepsin B-S [Tuberaphis styraci]
gi|204022069|dbj|BAG71132.1| cathepsin B-S1 [Tuberaphis styraci]
Length = 349
Score = 174 bits (441), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 96/252 (38%), Positives = 142/252 (56%), Gaps = 15/252 (5%)
Query: 88 VQLSDPL---EELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRG 144
++ DPL P+ FD+R NW C I IRDQG+CGS W+ A +DR+C+++ G
Sbjct: 73 IKKYDPLYVENNSPKQFDSRENWKSCKQIGHIRDQGNCGSCWSFSTTGAFADRLCVSTGG 132
Query: 145 KRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIPCER 204
K + LS ++L CC DCG GC GG+ KAWKY+ T G+ +GG Y +K+GC PY++P
Sbjct: 133 KFNQLLSPEELAFCCMDCGKGCGGGYPIKAWKYFRTQGVTTGGDYDTKEGCMPYKVPPCY 192
Query: 205 YMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGP 264
G ++ +C + C Y + D + Y + + ETI +++ +GP
Sbjct: 193 DEQGKNTCGGKPMERNHQCPKTC---YGKTTVQDRYKTKNEYVINS-IETIEQDLMTYGP 248
Query: 265 VEGSMTIYADMILYKTGIYKHVAGGPL-GEHAIRIIGWGQEPLGEGTSSVVKYWLVANSF 323
VE S +Y D +YK+GIY+ G H+I+IIGWG+E GT YWL NS+
Sbjct: 249 VEASFDVYDDFSVYKSGIYRKTPKAKYEGGHSIKIIGWGEE---NGTP----YWLAVNSW 301
Query: 324 NTNWGENGLFRI 335
+ WG++G F+I
Sbjct: 302 SKFWGDHGTFKI 313
Score = 99.0 bits (245), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 63/185 (34%), Positives = 99/185 (53%), Gaps = 16/185 (8%)
Query: 314 VKYWLVANSFNTNWGENGLFRIGCRPYEIP-CERYMNGSRSSCQANE-PNTPECIRKCQP 371
+K W + G + + GC PY++P C Y +++C +C + C
Sbjct: 160 IKAWKYFRTQGVTTGGDYDTKEGCMPYKVPPC--YDEQGKNTCGGKPMERNHQCPKTC-- 215
Query: 372 GYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGG 431
Y + D + Y + + E TI +++ +GPVE S +Y D +YK+GIY+
Sbjct: 216 -YGKTTVQDRYKTKNEYVINSIE-TIEQDLMTYGPVEASFDVYDDFSVYKSGIYRKTPKA 273
Query: 432 PL-GEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADI 490
G H+I+IIGWG+E GT YWL NS++ WG++G F+I++G+NECGIE +
Sbjct: 274 KYEGGHSIKIIGWGEE---NGTP----YWLAVNSWSKFWGDHGTFKIIKGRNECGIERAV 326
Query: 491 TAGLP 495
TAG+P
Sbjct: 327 TAGIP 331
>gi|201023321|ref|NP_001128402.1| cathepsin B-1874 precursor [Acyrthosiphon pisum]
Length = 315
Score = 174 bits (440), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 99/246 (40%), Positives = 141/246 (57%), Gaps = 17/246 (6%)
Query: 97 LPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLV 156
LP FD+R WP CP+I I +QG+C S +A+ A A SDR+CI S G ++ +S+ ++
Sbjct: 61 LPINFDSRKKWPNCPSIGHIYNQGNCRSSYAVAAASAASDRICIQSNGTKNPIMSAQQII 120
Query: 157 SCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIPCERYMNGS---HSSC 213
SCC CG+GC GG ++W Y+ G VSGG Y S QGC+PY IP + MN HS
Sbjct: 121 SCCYLCGHGCDGGSLFESWDYYRRHGFVSGGDYNSNQGCQPYTIPPCKLMNEKPPGHSCT 180
Query: 214 QDNEPNTPECIRKC-QPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIY 272
+ TP C +KC P Y S+ D+ G+ P M++IF +GP+ +Y
Sbjct: 181 TYHREETPICEKKCYNPNYYTSFRTDIYKGKYYKLSPY---MAMKDIFDNGPITTQFYMY 237
Query: 273 ADMILYKTGIYKHVAGGPLG---EHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGE 329
D++ YK+G+Y++ H+++I GWG+E + V YWLVANSF T+WG
Sbjct: 238 RDLVDYKSGVYQYDEQSDFDFFTVHSVKIFGWGEE-------NGVPYWLVANSFGTDWGY 290
Query: 330 NGLFRI 335
NG F+I
Sbjct: 291 NGTFKI 296
Score = 104 bits (260), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 60/167 (35%), Positives = 88/167 (52%), Gaps = 17/167 (10%)
Query: 336 GCRPYEIPCERYMNGS---RSSCQANEPNTPECIRKC-QPGYDVSYEDDLNFGRIAYSLP 391
GC+PY IP + MN S + TP C +KC P Y S+ D+ G+ P
Sbjct: 158 GCQPYTIPPCKLMNEKPPGHSCTTYHREETPICEKKCYNPNYYTSFRTDIYKGKYYKLSP 217
Query: 392 ANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLG---EHAIRIIGWGQEPL 448
M++IF +GP+ +Y D++ YK+G+Y++ H+++I GWG+E
Sbjct: 218 Y---MAMKDIFDNGPITTQFYMYRDLVDYKSGVYQYDEQSDFDFFTVHSVKIFGWGEE-- 272
Query: 449 GEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLP 495
+ V YWLVANSF T+WG NG F+I RG + C + + AGLP
Sbjct: 273 -----NGVPYWLVANSFGTDWGYNGTFKISRGNDGCFFQEKMYAGLP 314
>gi|204022085|dbj|BAG71140.1| cathepsin B-S [Astegopteryx spinocephala]
Length = 335
Score = 173 bits (439), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 100/254 (39%), Positives = 146/254 (57%), Gaps = 20/254 (7%)
Query: 85 PLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRG 144
PL + +D ++ FDAR +W C I +RDQG+CGS WA G A +DR+C+A+ G
Sbjct: 78 PLYTKNNDTIKH----FDAREDWKICKQIGHVRDQGNCGSCWAFGTTGAFADRLCVATGG 133
Query: 145 KRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEI-PCE 203
+ +LS++ L CC CG GCQGG KAWKY+ GI +GG Y S +GC PY++ PC
Sbjct: 134 GFNEQLSAEKLTFCCWTCGLGCQGGNPIKAWKYFKRHGITTGGDYGSNEGCAPYKVPPC- 192
Query: 204 RYMNGSHSSCQDN-EPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRH 262
Y + CQ + +C R C Y S ++ + Y L ++ +TI ++I ++
Sbjct: 193 -YDDQGEFLCQGKPTEHNHKCPRAC---YGNSTVENRYKVKSIYVLDSS-KTIEQDIRKY 247
Query: 263 GPVEGSMTIYADMILYKTGIYKHVAGG-PLGEHAIRIIGWGQEPLGEGTSSVVKYWLVAN 321
GPVE S +Y D I YK+GIY+ +G H++++IGWG+E + YWL+ N
Sbjct: 248 GPVEASFDVYDDFITYKSGIYQKTPNAFYVGGHSVKLIGWGEE-------DGIPYWLLVN 300
Query: 322 SFNTNWGENGLFRI 335
S++ WGE G FRI
Sbjct: 301 SWSKFWGEQGTFRI 314
Score = 103 bits (257), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 62/163 (38%), Positives = 94/163 (57%), Gaps = 16/163 (9%)
Query: 336 GCRPYEIP-CERYMNGSRSSCQAN-EPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPAN 393
GC PY++P C Y + CQ + +C R C Y S ++ + Y L ++
Sbjct: 183 GCAPYKVPPC--YDDQGEFLCQGKPTEHNHKCPRAC---YGNSTVENRYKVKSIYVLDSS 237
Query: 394 EETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGG-PLGEHAIRIIGWGQEPLGEGT 452
+ TI ++I ++GPVE S +Y D I YK+GIY+ +G H++++IGWG+E
Sbjct: 238 K-TIEQDIRKYGPVEASFDVYDDFITYKSGIYQKTPNAFYVGGHSVKLIGWGEE------ 290
Query: 453 SSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLP 495
+ YWL+ NS++ WGE G FRI++G+NECGIE TAG+P
Sbjct: 291 -DGIPYWLLVNSWSKFWGEQGTFRIIKGRNECGIERSATAGVP 332
>gi|342181301|emb|CCC90780.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 335
Score = 173 bits (439), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 103/277 (37%), Positives = 149/277 (53%), Gaps = 27/277 (9%)
Query: 59 LSKLTLSELEMRMG--VHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEI 116
+ +T +E G + S LP P+ ELPE FD+ WP CPTI+EI
Sbjct: 54 MQNITFAEARRLTGARIQKTSSLP----PVRFTEEQLRTELPESFDSAEKWPNCPTIREI 109
Query: 117 RDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWK 176
DQ +CGS WA+ A+SDR C G + +R+S+ L+SCC+DCG GC GG+ G +W+
Sbjct: 110 ADQSACGSCWAVSTASAISDRHCTVG-GVQQLRISAAHLMSCCEDCGYGCDGGYPGTSWE 168
Query: 177 YWVTTGIVSGGTYASKQGCRPYEIP-CERY-MNGSHSSCQDNEPNTPECIRKCQPGYDVS 234
Y+V+ G+ S C+PY P C + G C +TP+C C D +
Sbjct: 169 YYVSHGLAS-------SYCQPYPFPHCGHHGGKGKKPPCSKYHFHTPKCNTTCT---DKA 218
Query: 235 YEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEH 294
G +Y + E+ RE++ +GP +Y+D + YKTG+Y+HV+G LG H
Sbjct: 219 IPLIKYRGNHSYEV-HGEDDYKRELYFNGPFVVVFWVYSDFLAYKTGVYRHVSGDFLGGH 277
Query: 295 AIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENG 331
A+RI+GWG+ GT YW +ANS++T+WG NG
Sbjct: 278 AVRIVGWGKL---NGT----PYWKIANSWDTDWGMNG 307
Score = 106 bits (264), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 65/171 (38%), Positives = 92/171 (53%), Gaps = 13/171 (7%)
Query: 329 ENGLFRIGCRPYEIP-CERYM-NGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRI 386
+GL C+PY P C + G + C +TP+C C D + G
Sbjct: 172 SHGLASSYCQPYPFPHCGHHGGKGKKPPCSKYHFHTPKCNTTCT---DKAIPLIKYRGNH 228
Query: 387 AYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQE 446
+Y + E+ RE++ +GP +Y+D + YKTG+Y+HV+G LG HA+RI+GWG+
Sbjct: 229 SYEV-HGEDDYKRELYFNGPFVVVFWVYSDFLAYKTGVYRHVSGDFLGGHAVRIVGWGKL 287
Query: 447 PLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
GT YW +ANS++T+WG NG +RG NECGIEA AG P I
Sbjct: 288 ---NGT----PYWKIANSWDTDWGMNGHLLFLRGNNECGIEAAGYAGSPAI 331
>gi|197129222|gb|ACH45720.1| putative cathepsin B variant 2 [Taeniopygia guttata]
Length = 236
Score = 173 bits (439), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 83/157 (52%), Positives = 107/157 (68%), Gaps = 3/157 (1%)
Query: 83 RLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIAS 142
+LP V + +E LP+ FD+R WP CPTI EIRDQGSCGS WA GAVEA+SDR+C+ +
Sbjct: 67 KLPERVDFAADVE-LPDNFDSRTQWPNCPTISEIRDQGSCGSCWAFGAVEAISDRICVHT 125
Query: 143 RGKRHVRLSSDDLVSCCK-DCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEI- 200
K V +S++DL+SCC +CG GC GG+ AW+YW G+VSGG Y S GCRPY I
Sbjct: 126 NAKVSVEVSAEDLLSCCGFECGMGCNGGYPSGAWRYWTERGLVSGGLYDSHVGCRPYSIP 185
Query: 201 PCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYED 237
PCE ++NG+ C +TP C R C+PGY SY++
Sbjct: 186 PCEHHVNGTRPPCTGEGGSTPRCSRHCEPGYSPSYKE 222
Score = 55.8 bits (133), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 27/61 (44%), Positives = 35/61 (57%), Gaps = 8/61 (13%)
Query: 327 WGENGLF-------RIGCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYE 378
W E GL +GCRPY IP CE ++NG+R C +TP C R C+PGY SY+
Sbjct: 162 WTERGLVSGGLYDSHVGCRPYSIPPCEHHVNGTRPPCTGEGGSTPRCSRHCEPGYSPSYK 221
Query: 379 D 379
+
Sbjct: 222 E 222
>gi|15150360|gb|AAK85411.1| cathepsin B-like protease [Trypanosoma rangeli]
Length = 207
Score = 173 bits (438), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 102/226 (45%), Positives = 131/226 (57%), Gaps = 23/226 (10%)
Query: 101 FDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCK 160
FDA WP CPTI EIRDQ CGS WA+ A AMSDR C G R +R+S+ DL+SCC
Sbjct: 1 FDAGEAWPNCPTITEIRDQSGCGSCWAVAARSAMSDRYCTRG-GVRDLRISAGDLLSCCN 59
Query: 161 DCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIP-CERYMNGSHSSCQDNEPN 219
CG GC GG AW Y+V TGIVS + C+PY P C ++N +H + E +
Sbjct: 60 ACGLGCNGGDPDWAWLYYVETGIVS-------EFCQPYPFPPCAHHVNSTHYTPCSVEYD 112
Query: 220 TPECIRKCQPGYD-VSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILY 278
TP C C + Y+ GRI+YSL + EE RE+F +GP E + T+Y D + Y
Sbjct: 113 TPFCNITCTNTIPPIKYK-----GRISYSL-SGEEDYKRELFLYGPFEVAFTVYEDFVAY 166
Query: 279 KTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFN 324
G+YKH +G LG HA+R++GWG GT YW +ANS+N
Sbjct: 167 SDGVYKHFSGNALGGHAVRLVGWGNL---NGT----PYWKIANSWN 205
Score = 88.2 bits (217), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 53/140 (37%), Positives = 77/140 (55%), Gaps = 15/140 (10%)
Query: 329 ENGLFRIGCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYD-VSYEDDLNFGRI 386
E G+ C+PY P C ++N + + + E +TP C C + Y+ GRI
Sbjct: 79 ETGIVSEFCQPYPFPPCAHHVNSTHYTPCSVEYDTPFCNITCTNTIPPIKYK-----GRI 133
Query: 387 AYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQE 446
+YSL + EE RE+F +GP E + T+Y D + Y G+YKH +G LG HA+R++GWG
Sbjct: 134 SYSL-SGEEDYKRELFLYGPFEVAFTVYEDFVAYSDGVYKHFSGNALGGHAVRLVGWGNL 192
Query: 447 PLGEGTSSVVKYWLVANSFN 466
GT YW +ANS+N
Sbjct: 193 ---NGT----PYWKIANSWN 205
>gi|157058739|gb|ABV03127.1| cathepsin B-2744 [Acyrthosiphon pisum]
Length = 260
Score = 173 bits (438), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 95/242 (39%), Positives = 138/242 (57%), Gaps = 11/242 (4%)
Query: 96 ELPEGFDARINWPYCP-TIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDD 154
++P FDAR + C I +++DQG+C S WA+ +DR+CIAS G+ LS+ +
Sbjct: 25 DIPREFDARQYFTSCANVIGDVKDQGNCASSWAVAVASTFTDRLCIASNGQFTDNLSAQN 84
Query: 155 LVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYE-IPCERYMNGSHSSC 213
L+SC GC GG KAW+ + GIV+GG + S +GC+PY+ PC+ Y + ++C
Sbjct: 85 LMSCGDGEKMGCDGGSAFKAWELTMNKGIVTGGNFDSNEGCQPYKNRPCDHYGDSRLTNC 144
Query: 214 QD-NEPNTPECIRKC-QPGYDVSYEDDLNFGRIAYSLP-ANEETIMREIFRHGPVEGSMT 270
C +KC Y V YEDDL+ I Y N + I +EI +GPV M
Sbjct: 145 SSLRRTQMTVCRKKCVNKNYKVKYEDDLHKTSIVYMTSWTNVKQIQQEIMTYGPVTAFMY 204
Query: 271 IYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGEN 330
+Y + + YK GIYK G +G H +++IGWG + G+GT +YWL NS+N+NWG +
Sbjct: 205 VYENFMGYKEGIYKSTTGELIGYHHVKLIGWGVD--GDGT----EYWLAMNSWNSNWGND 258
Query: 331 GL 332
GL
Sbjct: 259 GL 260
Score = 102 bits (253), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 63/175 (36%), Positives = 91/175 (52%), Gaps = 10/175 (5%)
Query: 304 EPLGEGTSSVVKYWLVANSFNTNWGENGLFRIGCRPYE-IPCERYMNGSRSSCQA-NEPN 361
E +G S K W + + G N GC+PY+ PC+ Y + ++C +
Sbjct: 92 EKMGCDGGSAFKAWELTMNKGIVTGGNFDSNEGCQPYKNRPCDHYGDSRLTNCSSLRRTQ 151
Query: 362 TPECIRKC-QPGYDVSYEDDLNFGRIAYSLP-ANEETIMREIFRHGPVEGSMTIYADMIL 419
C +KC Y V YEDDL+ I Y N + I +EI +GPV M +Y + +
Sbjct: 152 MTVCRKKCVNKNYKVKYEDDLHKTSIVYMTSWTNVKQIQQEIMTYGPVTAFMYVYENFMG 211
Query: 420 YKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGL 474
YK GIYK G +G H +++IGWG + G+GT +YWL NS+N+NWG +GL
Sbjct: 212 YKEGIYKSTTGELIGYHHVKLIGWGVD--GDGT----EYWLAMNSWNSNWGNDGL 260
>gi|21695|emb|CAA46812.1| cathepsin B [Triticum aestivum]
Length = 310
Score = 173 bits (438), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 101/275 (36%), Positives = 152/275 (55%), Gaps = 31/275 (11%)
Query: 59 LSKLTLSELEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRD 118
+ T+ + + +GV P +P+ + P +LP+ FDAR W C TI I D
Sbjct: 62 FANYTIEQFKHILGVKPTPPGLLAGVPIKIH---PEMDLPKEFDARTQWSSCSTIGNILD 118
Query: 119 QGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKD-CGNGCQGGFHGKAWKY 177
QG CG+ WA AVEA+ DR CI V LS +DL++CC CG+GC GG+ AW+Y
Sbjct: 119 QGHCGACWAFAAVEALQDRFCI--HLNMSVSLSVNDLLACCGFLCGSGCNGGYPISAWRY 176
Query: 178 WVTTGIVSGGTYASKQGCRPY--EIPCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSY 235
+ +G+V+ + C PY + C+ H C+ P TP+C RKC+ + ++
Sbjct: 177 FRRSGVVT-------EECDPYFDQTGCQ------HPGCEPAYP-TPKCQRKCKV-ENQAW 221
Query: 236 EDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTI--YADMILYKTGIYKHVAGGPLGE 293
+++ +F AY + +N IM E++++GPVE + T D YK+G+YKH+ GG +G
Sbjct: 222 KENKHFSVNAYRVHSNPHDIMAEVYKNGPVEVAFTYCQILDFAHYKSGVYKHITGGVMGG 281
Query: 294 HAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWG 328
HA+++IGWG GE YWL+AN +N WG
Sbjct: 282 HAVKLIGWGTSDAGE------DYWLLANQWNRGWG 310
Score = 90.1 bits (222), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 48/131 (36%), Positives = 76/131 (58%), Gaps = 11/131 (8%)
Query: 344 CERYMNGSRSSCQANEPN--TPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREI 401
C+ Y + + EP TP+C RKC+ + +++++ +F AY + +N IM E+
Sbjct: 187 CDPYFDQTGCQHPGCEPAYPTPKCQRKCKV-ENQAWKENKHFSVNAYRVHSNPHDIMAEV 245
Query: 402 FRHGPVEGSMTI--YADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYW 459
+++GPVE + T D YK+G+YKH+ GG +G HA+++IGWG GE YW
Sbjct: 246 YKNGPVEVAFTYCQILDFAHYKSGVYKHITGGVMGGHAVKLIGWGTSDAGE------DYW 299
Query: 460 LVANSFNTNWG 470
L+AN +N WG
Sbjct: 300 LLANQWNRGWG 310
>gi|294954734|ref|XP_002788292.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
gi|239903555|gb|EER20088.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
Length = 317
Score = 173 bits (438), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 96/253 (37%), Positives = 138/253 (54%), Gaps = 19/253 (7%)
Query: 94 LEELPEGFDARINWPYCP-TIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSS 152
L++LP FDAR +P C I IRDQ +CGS WA G EA +DR+C+ S G LS+
Sbjct: 57 LQDLPTDFDARTAFPNCSKVIGHIRDQSACGSCWAFGVTEAFNDRLCVKSNGTFTELLSA 116
Query: 153 DDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASK------QGCRPYEIP-CERY 205
++ +C GC GG+ AW + GI +GG Y ++ GC PY+ P C +
Sbjct: 117 GEMNACAP--SYGCDGGYPDSAWSWVHDEGIATGGDYVARGNLTKGDGCWPYDFPPCAHH 174
Query: 206 MNGS-HSSCQDNEPNTPECIRKCQ-PGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHG 263
+N + + C TP C+ +C P Y S ++D ++ + + I G
Sbjct: 175 INDTKYPKCPKGSYETPNCVEQCHNPKYSTSLKNDRHYMLESSPYQYSVNNAKNAIRTDG 234
Query: 264 PVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSF 323
PV S +Y D + YK+G+YKH +G LG HA++IIGWG+E GE YWLV NS+
Sbjct: 235 PVSASYLVYEDFLAYKSGVYKHTSGSYLGGHAVKIIGWGEEN-GEA------YWLVVNSW 287
Query: 324 NTNWGENGLFRIG 336
N +WG++GLF+I
Sbjct: 288 NEDWGDHGLFKIA 300
Score = 115 bits (289), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 62/165 (37%), Positives = 91/165 (55%), Gaps = 12/165 (7%)
Query: 336 GCRPYEIP-CERYMNGSR-SSCQANEPNTPECIRKCQ-PGYDVSYEDDLNFGRIAYSLPA 392
GC PY+ P C ++N ++ C TP C+ +C P Y S ++D ++ +
Sbjct: 162 GCWPYDFPPCAHHINDTKYPKCPKGSYETPNCVEQCHNPKYSTSLKNDRHYMLESSPYQY 221
Query: 393 NEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGT 452
+ I GPV S +Y D + YK+G+YKH +G LG HA++IIGWG+E GE
Sbjct: 222 SVNNAKNAIRTDGPVSASYLVYEDFLAYKSGVYKHTSGSYLGGHAVKIIGWGEEN-GEA- 279
Query: 453 SSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
YWLV NS+N +WG++GLF+I G C I+ D+ G PK+
Sbjct: 280 -----YWLVVNSWNEDWGDHGLFKIALGN--CQIDDDLLGGTPKV 317
>gi|10803443|emb|CAC13134.1| putative cathepsin B.8 [Ostertagia ostertagi]
Length = 197
Score = 172 bits (437), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 85/182 (46%), Positives = 118/182 (64%), Gaps = 3/182 (1%)
Query: 124 SGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGI 183
S WA GAVEA+SDR+CIAS+GK V LS+ DL+SCC+ CG GC GG AWK+WV GI
Sbjct: 1 SCWAFGAVEAISDRICIASKGKTQVTLSAADLLSCCRSCGFGCNGGDPLSAWKFWVKEGI 60
Query: 184 VSGGTYASKQGCRPYEIP-CERYMNGSH-SSCQDNEPNTPECIRKCQPGY-DVSYEDDLN 240
V+G +++ GC+PY P CE + N +H C+ + TP+C + CQ + + +Y++D
Sbjct: 61 VTGSNHSTNAGCKPYPFPACEHHSNKTHYDPCKHDLFPTPKCEKSCQATFGERTYKEDKY 120
Query: 241 FGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIG 300
FGR AY + + E I +EI +GPVE + +Y D + Y GIY H G G HA+++IG
Sbjct: 121 FGRSAYGVKNHMEAIQKEIITYGPVEVAFEVYEDFLNYAGGIYVHQGGALGGGHAVKMIG 180
Query: 301 WG 302
WG
Sbjct: 181 WG 182
Score = 89.0 bits (219), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 52/146 (35%), Positives = 77/146 (52%), Gaps = 11/146 (7%)
Query: 302 GQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIGCRPYEIP-CERYMNGSR-SSCQANE 359
G +PL S K+W+ G N GC+PY P CE + N + C+ +
Sbjct: 45 GGDPL-----SAWKFWVKEGIVT---GSNHSTNAGCKPYPFPACEHHSNKTHYDPCKHDL 96
Query: 360 PNTPECIRKCQPGY-DVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMI 418
TP+C + CQ + + +Y++D FGR AY + + E I +EI +GPVE + +Y D +
Sbjct: 97 FPTPKCEKSCQATFGERTYKEDKYFGRSAYGVKNHMEAIQKEIITYGPVEVAFEVYEDFL 156
Query: 419 LYKTGIYKHVAGGPLGEHAIRIIGWG 444
Y GIY H G G HA+++IGWG
Sbjct: 157 NYAGGIYVHQGGALGGGHAVKMIGWG 182
>gi|356984175|gb|AET43950.1| cathepsin B, partial [Reishia clavigera]
Length = 209
Score = 172 bits (437), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 79/190 (41%), Positives = 128/190 (67%), Gaps = 10/190 (5%)
Query: 147 HVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEI-PCERY 205
H +S+++L++CC+ CG+GC GG+ AW+ + G+V+GG Y SKQGC+PY I C+ +
Sbjct: 9 HAHVSANELLACCESCGDGCNGGYPSAAWEVFDHDGVVTGGQYNSKQGCQPYLIAACDHH 68
Query: 206 MNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPV 265
+ G C+ + TP C +KC+ GY+V+++DD ++G+ +YS+ + + IM E+ GPV
Sbjct: 69 VVGKLKPCK-GDGKTPRCEKKCEAGYNVTFKDDKHYGQRSYSVSSVND-IMEELVTRGPV 126
Query: 266 EGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNT 325
E + T+Y+D + Y +G+Y+H G LG HA++I+G+G E + KYWLVANS+N
Sbjct: 127 EAAFTVYSDFLQYHSGVYRHTTGSALGGHAVKILGYGVE-------NGDKYWLVANSWNP 179
Query: 326 NWGENGLFRI 335
+WG+ G F+I
Sbjct: 180 DWGDQGFFKI 189
Score = 149 bits (377), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 70/165 (42%), Positives = 110/165 (66%), Gaps = 10/165 (6%)
Query: 334 RIGCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPA 392
+ GC+PY I C+ ++ G C+ + TP C +KC+ GY+V+++DD ++G+ +YS+ +
Sbjct: 54 KQGCQPYLIAACDHHVVGKLKPCKG-DGKTPRCEKKCEAGYNVTFKDDKHYGQRSYSVSS 112
Query: 393 NEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGT 452
+ IM E+ GPVE + T+Y+D + Y +G+Y+H G LG HA++I+G+G E
Sbjct: 113 VND-IMEELVTRGPVEAAFTVYSDFLQYHSGVYRHTTGSALGGHAVKILGYGVE------ 165
Query: 453 SSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
+ KYWLVANS+N +WG+ G F+I+RG +ECGIE I AG PK+
Sbjct: 166 -NGDKYWLVANSWNPDWGDQGFFKILRGVDECGIEGQIVAGEPKV 209
>gi|358341865|dbj|GAA49436.1| cathepsin B-like cysteine proteinase [Clonorchis sinensis]
Length = 515
Score = 172 bits (437), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 91/221 (41%), Positives = 122/221 (55%), Gaps = 3/221 (1%)
Query: 64 LSELEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCG 123
+ + R G ++ + R P + ++ +P FDAR W CP+I+EIR Q SCG
Sbjct: 49 MENVRWRFGAKRETTEQKARRPTVNNRFSNVD-IPMQFDARKYWLKCPSIREIRGQSSCG 107
Query: 124 SGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGI 183
S WA GAVEAMSDR+CI S K LS+ DL+SCC CG GC GGF +AW YW T GI
Sbjct: 108 SCWAFGAVEAMSDRLCIHSGAKYQKGLSAVDLLSCCWKCGYGCDGGFPAQAWNYWSTDGI 167
Query: 184 VSGGTYASKQGCRPYEIP-CERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFG 242
V+GG+ + GCR Y P C G H C +TP C +KC + Y +L
Sbjct: 168 VTGGSKENPSGCRSYPFPSCSHDERGRHPLCPSEIYHTPRCTKKCDTD-KLHYSAELTKA 226
Query: 243 RIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIY 283
+Y++ ++ IM EI +GPVE +Y D + Y+ GIY
Sbjct: 227 NSSYNVLDSDREIMMEIMNNGPVEAVFDVYEDFLQYEKGIY 267
Score = 56.2 bits (134), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 30/91 (32%), Positives = 44/91 (48%), Gaps = 2/91 (2%)
Query: 336 GCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANE 394
GCR Y P C G C + +TP C +KC + Y +L +Y++ ++
Sbjct: 178 GCRSYPFPSCSHDERGRHPLCPSEIYHTPRCTKKCDTD-KLHYSAELTKANSSYNVLDSD 236
Query: 395 ETIMREIFRHGPVEGSMTIYADMILYKTGIY 425
IM EI +GPVE +Y D + Y+ GIY
Sbjct: 237 REIMMEIMNNGPVEAVFDVYEDFLQYEKGIY 267
>gi|297744106|emb|CBI37076.3| unnamed protein product [Vitis vinifera]
Length = 392
Score = 172 bits (436), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 104/326 (31%), Positives = 162/326 (49%), Gaps = 60/326 (18%)
Query: 47 PKLPFYGAEKNALSKLTLSELEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARIN 106
PK + A S ++ + +GV P + +P++ +LP+ FDAR
Sbjct: 51 PKAGWKAAMNPRFSNYSVGQFMHLLGVKPTLQKDLEGVPVITH--PKTLKLPKHFDARTA 108
Query: 107 WPYCPTIQEI------------------------------------RDQGSCGSGWALGA 130
WP C TI +I +DQG CGS WA GA
Sbjct: 109 WPQCSTIGKILGRLLDSFSSYFDDFFCFGCTDALYFSYHLLVPFYIKDQGHCGSCWAFGA 168
Query: 131 VEAMSDRVCIASRGKRHVRLSSDDLVSCCKD-CGNGCQGGFHGKAWKYWVTTGIVSGGTY 189
VE++SDR CI ++ LS +DL++CC CG+GC GG+ AW+Y++ G+V+
Sbjct: 169 VESLSDRFCI--HFGMNISLSVNDLLACCGFLCGSGCDGGYPLYAWRYFIHHGVVT---- 222
Query: 190 ASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLP 249
+ C PY SH C+ P TP+C+RKC + + +G+ AY +
Sbjct: 223 ---EECDPYF----DATGCSHPGCEPGYP-TPKCVRKCTDENQL-WRKAKRYGQSAYRIS 273
Query: 250 ANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEG 309
++ IM E++++GPVE + T+Y D Y++G+Y++ G +G HA+++IGWG GE
Sbjct: 274 SDPYQIMAEVYKNGPVEVAFTVYEDFAHYESGVYRYTTGDVMGGHAVKLIGWGTTDDGE- 332
Query: 310 TSSVVKYWLVANSFNTNWGENGLFRI 335
YW++AN +N NWG++G F I
Sbjct: 333 -----DYWILANQWNRNWGDDGYFMI 353
Score = 124 bits (310), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 60/154 (38%), Positives = 93/154 (60%), Gaps = 9/154 (5%)
Query: 344 CERYMNGSRSSCQANEPN--TPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREI 401
C+ Y + + S EP TP+C+RKC + + +G+ AY + ++ IM E+
Sbjct: 225 CDPYFDATGCSHPGCEPGYPTPKCVRKCTDENQL-WRKAKRYGQSAYRISSDPYQIMAEV 283
Query: 402 FRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLV 461
+++GPVE + T+Y D Y++G+Y++ G +G HA+++IGWG GE YW++
Sbjct: 284 YKNGPVEVAFTVYEDFAHYESGVYRYTTGDVMGGHAVKLIGWGTTDDGE------DYWIL 337
Query: 462 ANSFNTNWGENGLFRIVRGQNECGIEADITAGLP 495
AN +N NWG++G F I RG NECGIE + AGLP
Sbjct: 338 ANQWNRNWGDDGYFMIRRGVNECGIEEGVVAGLP 371
>gi|390357905|ref|XP_003729132.1| PREDICTED: cathepsin B-like [Strongylocentrotus purpuratus]
Length = 354
Score = 172 bits (435), Expect = 5e-40, Method: Compositional matrix adjust.
Identities = 85/176 (48%), Positives = 111/176 (63%), Gaps = 9/176 (5%)
Query: 161 DCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIP-CERYMNGSHSSCQDNEPN 219
+C + C GGF G AW+Y+ TGIV+GG + S QGC+PY+I C+ ++NG+ CQ P
Sbjct: 168 ECKHKCNGGFPGSAWEYYKDTGIVTGGQWNSSQGCQPYQIKSCDHHVNGTKGPCQGEGP- 226
Query: 220 TPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYK 279
TPEC KC+ Y YE D ++ S+ N E EI +GPVE T+Y D YK
Sbjct: 227 TPECKHKCEASYSTPYEQDKHYALSVNSISNNPEATQTEIMTNGPVEADFTVYEDFPTYK 286
Query: 280 TGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
+G+Y+H GG LG HAI+I+GWG E EGT KYWLVANS+N WG+NG F+I
Sbjct: 287 SGVYQHTTGGVLGGHAIKILGWGVE---EGT----KYWLVANSWNNEWGDNGFFKI 335
Score = 166 bits (421), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 81/162 (50%), Positives = 105/162 (64%), Gaps = 9/162 (5%)
Query: 336 GCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANE 394
GC+PY+I C+ ++NG++ CQ P TPEC KC+ Y YE D ++ S+ N
Sbjct: 201 GCQPYQIKSCDHHVNGTKGPCQGEGP-TPECKHKCEASYSTPYEQDKHYALSVNSISNNP 259
Query: 395 ETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSS 454
E EI +GPVE T+Y D YK+G+Y+H GG LG HAI+I+GWG E EGT
Sbjct: 260 EATQTEIMTNGPVEADFTVYEDFPTYKSGVYQHTTGGVLGGHAIKILGWGVE---EGT-- 314
Query: 455 VVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPK 496
KYWLVANS+N WG+NG F+I+RG NECGIE+DI G+PK
Sbjct: 315 --KYWLVANSWNNEWGDNGFFKILRGSNECGIESDINFGIPK 354
Score = 63.5 bits (153), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 30/61 (49%), Positives = 41/61 (67%), Gaps = 2/61 (3%)
Query: 172 GKAWKYWVTTGIVSGGTYASKQGCRPYEIP-CERYMNGSHSSCQDNEPNTPECIRKCQPG 230
G AW+Y+ TGIV+GG + S QGC+PY+I C+ ++NG+ CQ P TPEC KC G
Sbjct: 118 GSAWEYYKDTGIVTGGQWNSSQGCQPYQIKSCDHHVNGTKGPCQGEGP-TPECKHKCNGG 176
Query: 231 Y 231
+
Sbjct: 177 F 177
Score = 40.8 bits (94), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 18/39 (46%), Positives = 26/39 (66%), Gaps = 2/39 (5%)
Query: 336 GCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGY 373
GC+PY+I C+ ++NG++ CQ P TPEC KC G+
Sbjct: 140 GCQPYQIKSCDHHVNGTKGPCQGEGP-TPECKHKCNGGF 177
>gi|260782761|ref|XP_002586451.1| hypothetical protein BRAFLDRAFT_247264 [Branchiostoma floridae]
gi|229271561|gb|EEN42462.1| hypothetical protein BRAFLDRAFT_247264 [Branchiostoma floridae]
Length = 272
Score = 172 bits (435), Expect = 5e-40, Method: Compositional matrix adjust.
Identities = 103/281 (36%), Positives = 155/281 (55%), Gaps = 38/281 (13%)
Query: 57 NALSKLTLSELEMRMGVHPDSKLPQNRLPLLVQLSDPLE-ELPEGFDARINWPYCPTIQE 115
N + ++S+L++ G D + L L V+ D + E+P+ FDAR+ W C +
Sbjct: 5 NDFGEASMSDLKVLCGTILDDP---DLLNLPVKQHDLTDMEIPKSFDARMEWSTCVRSHK 61
Query: 116 IRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQ-GGFHGKA 174
I DQG CGS WA + E +SDR+CI +RG ++ LSS+DL+SC K G GC GG +A
Sbjct: 62 IHDQGHCGSCWAFASTEVLSDRLCIQTRGSTNIILSSEDLLSCDK-AGRGCSDGGRLSEA 120
Query: 175 WKYWVTTGIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVS 234
W+Y G+V+ C+PY ++ PEC+ KC G +
Sbjct: 121 WRYMQKKGVVA-------NRCKPYTSGATGFI--------------PECMSKC-TGEGHA 158
Query: 235 YEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEH 294
Y+ +G Y++ + E I EI +GPVE + T+Y+D++ YK+G+Y H +GG LG H
Sbjct: 159 YQK--FYGLYLYTV-SGENQIKVEIMTNGPVEAAFTVYSDIVHYKSGVYHHTSGGKLGGH 215
Query: 295 AIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
A++++GWG E E YWLVANS+ +WG+ G F+I
Sbjct: 216 AVKVLGWGVEDEEE-------YWLVANSWGPDWGDQGFFKI 249
Score = 114 bits (284), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 56/135 (41%), Positives = 85/135 (62%), Gaps = 11/135 (8%)
Query: 363 PECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKT 422
PEC+ KC G +Y+ +G Y++ + E I EI +GPVE + T+Y+D++ YK+
Sbjct: 146 PECMSKC-TGEGHAYQK--FYGLYLYTV-SGENQIKVEIMTNGPVEAAFTVYSDIVHYKS 201
Query: 423 GIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQN 482
G+Y H +GG LG HA++++GWG E E YWLVANS+ +WG+ G F+I RG +
Sbjct: 202 GVYHHTSGGKLGGHAVKVLGWGVEDEEE-------YWLVANSWGPDWGDQGFFKIKRGSD 254
Query: 483 ECGIEADITAGLPKI 497
ECGIE+ + G ++
Sbjct: 255 ECGIESRVLTGTARL 269
>gi|15723276|gb|AAL06326.1| cathepsin B-like protease [Trypanosoma cruzi]
Length = 208
Score = 171 bits (434), Expect = 7e-40, Method: Compositional matrix adjust.
Identities = 102/225 (45%), Positives = 129/225 (57%), Gaps = 20/225 (8%)
Query: 101 FDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCK 160
FDA WP CPTI EIRDQ SCGS WA+ A A+SDR C G R +R+S+ DL+SCC
Sbjct: 1 FDAGEAWPKCPTITEIRDQSSCGSCWAVAAASAISDRYCTLG-GVRDLRISAGDLMSCCD 59
Query: 161 DCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIP-CERYMNGSHSSCQDNEPN 219
CG GC GG+ AW+Y+ GIVS + C+PY P C ++N S S E +
Sbjct: 60 VCGYGCNGGYPEVAWEYYAVHGIVS-------EYCQPYPFPSCAHHVNSSDLSPCSGEYD 112
Query: 220 TPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYK 279
TP C C D G +Y L + EE+ RE+ +GP E S ++YAD + Y
Sbjct: 113 TPTCNSTCT---DKKVPLIKYRGNTSYLL-SGEESFKRELLLNGPFEVSFSVYADFLAYT 168
Query: 280 TGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFN 324
G+YKHVAG LG HA+RI+GWG E GE YW +ANS+N
Sbjct: 169 GGVYKHVAGTFLGGHAVRIVGWG-ELNGE------PYWKIANSWN 206
Score = 85.9 bits (211), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 55/138 (39%), Positives = 75/138 (54%), Gaps = 12/138 (8%)
Query: 330 NGLFRIGCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAY 388
+G+ C+PY P C ++N S S + E +TP C C D G +Y
Sbjct: 80 HGIVSEYCQPYPFPSCAHHVNSSDLSPCSGEYDTPTCNSTCT---DKKVPLIKYRGNTSY 136
Query: 389 SLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPL 448
L + EE+ RE+ +GP E S ++YAD + Y G+YKHVAG LG HA+RI+GWG E
Sbjct: 137 LL-SGEESFKRELLLNGPFEVSFSVYADFLAYTGGVYKHVAGTFLGGHAVRIVGWG-ELN 194
Query: 449 GEGTSSVVKYWLVANSFN 466
GE YW +ANS+N
Sbjct: 195 GE------PYWKIANSWN 206
>gi|157058741|gb|ABV03128.1| cathepsin B-2744 [Aulacorthum solani]
Length = 255
Score = 171 bits (432), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 93/236 (39%), Positives = 133/236 (56%), Gaps = 11/236 (4%)
Query: 97 LPEGFDARINWPYCP-TIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDL 155
+P FDAR + C I +++DQG+C S WA+ +DR+CIAS G+ LS+ +L
Sbjct: 26 IPRTFDARQYFVSCSDVIGDVKDQGNCASSWAVAVASTFTDRLCIASNGQFTDNLSAQNL 85
Query: 156 VSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYE-IPCERYMNGSHSSCQ 214
+SC + GC GG KAW+ ++ GIV+GG Y S +GC+PY+ PC+ Y + S ++C
Sbjct: 86 MSCGNEEKMGCDGGSAFKAWELTMSKGIVTGGNYDSNEGCQPYKNRPCDHYGDSSLTNCS 145
Query: 215 D-NEPNTPECIRKC-QPGYDVSYEDDLNFGRIAYSLP-ANEETIMREIFRHGPVEGSMTI 271
C KC Y V YEDDL+ I Y N + I +EI +GPV M +
Sbjct: 146 SLRRTQMTVCREKCVNKNYKVKYEDDLHKTSIVYMTSWTNVKQIQQEIMTYGPVTALMYV 205
Query: 272 YADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNW 327
Y + + YK GIYK AG +G H +++IGWG + G +YWL NS+N+NW
Sbjct: 206 YENFMGYKKGIYKSTAGELIGYHHVKLIGWGVDEDG------TEYWLAMNSWNSNW 255
Score = 98.6 bits (244), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 61/171 (35%), Positives = 86/171 (50%), Gaps = 10/171 (5%)
Query: 303 QEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIGCRPYE-IPCERYMNGSRSSCQA-NEP 360
+E +G S K W + S G N GC+PY+ PC+ Y + S ++C +
Sbjct: 91 EEKMGCDGGSAFKAWELTMSKGIVTGGNYDSNEGCQPYKNRPCDHYGDSSLTNCSSLRRT 150
Query: 361 NTPECIRKC-QPGYDVSYEDDLNFGRIAYSLP-ANEETIMREIFRHGPVEGSMTIYADMI 418
C KC Y V YEDDL+ I Y N + I +EI +GPV M +Y + +
Sbjct: 151 QMTVCREKCVNKNYKVKYEDDLHKTSIVYMTSWTNVKQIQQEIMTYGPVTALMYVYENFM 210
Query: 419 LYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNW 469
YK GIYK AG +G H +++IGWG + G +YWL NS+N+NW
Sbjct: 211 GYKKGIYKSTAGELIGYHHVKLIGWGVDEDG------TEYWLAMNSWNSNW 255
>gi|18378945|ref|NP_563647.1| putative cathepsin B-like cysteine protease [Arabidopsis thaliana]
gi|332189291|gb|AEE27412.1| putative cathepsin B-like cysteine protease [Arabidopsis thaliana]
Length = 379
Score = 171 bits (432), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 106/312 (33%), Positives = 163/312 (52%), Gaps = 48/312 (15%)
Query: 47 PKLPFYGAEKNALSKLTLSELEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARIN 106
P + A + + T++E + +GV K +P++ D +LP+ FDAR
Sbjct: 55 PNAGWKAAFNDRFANATVAEFKRLLGVIQTPKTAYLGVPIVRH--DLSLKLPKEFDARTA 112
Query: 107 WPYCPTIQEIRD--------------------QGSCGSGWALGAVEAMSDRVCIASRGKR 146
W +C +I+ I G CGS WA GAVE++SDR CI +
Sbjct: 113 WSHCTSIRRILVGYILNNVLLWSTITLWFWFLLGHCGSCWAFGAVESLSDRFCI--KYNL 170
Query: 147 HVRLSSDDLVSCCKD-CGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPY--EIPCE 203
+V LS++D+++CC CG GC GGF AW Y+ G+V+ Q C PY C
Sbjct: 171 NVSLSANDVIACCGLLCGFGCNGGFPMGAWLYFKYHGVVT-------QECDPYFDNTGC- 222
Query: 204 RYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHG 263
SH C+ P TP+C RKC + + + ++G AY + + + IM E++++G
Sbjct: 223 -----SHPGCEPTYP-TPKCERKCVSRNQL-WGESKHYGVGAYRINPDPQDIMAEVYKNG 275
Query: 264 PVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSF 323
PVE + T+Y D YK+G+YK++ G +G HA+++IGWG GE YWL+AN +
Sbjct: 276 PVEVAFTVYEDFAHYKSGVYKYITGTKIGGHAVKLIGWGTSDDGE------DYWLLANQW 329
Query: 324 NTNWGENGLFRI 335
N +WG++G F+I
Sbjct: 330 NRSWGDDGYFKI 341
Score = 125 bits (314), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 62/154 (40%), Positives = 95/154 (61%), Gaps = 9/154 (5%)
Query: 344 CERYMNGSRSSCQANEPN--TPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREI 401
C+ Y + + S EP TP+C RKC + + + ++G AY + + + IM E+
Sbjct: 213 CDPYFDNTGCSHPGCEPTYPTPKCERKCVSRNQL-WGESKHYGVGAYRINPDPQDIMAEV 271
Query: 402 FRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLV 461
+++GPVE + T+Y D YK+G+YK++ G +G HA+++IGWG GE YWL+
Sbjct: 272 YKNGPVEVAFTVYEDFAHYKSGVYKYITGTKIGGHAVKLIGWGTSDDGE------DYWLL 325
Query: 462 ANSFNTNWGENGLFRIVRGQNECGIEADITAGLP 495
AN +N +WG++G F+I RG NECGIE + AGLP
Sbjct: 326 ANQWNRSWGDDGYFKIRRGTNECGIEQSVVAGLP 359
>gi|204022083|dbj|BAG71139.1| cathepsin B-S [Astegopteryx styracophila]
Length = 335
Score = 170 bits (431), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 98/254 (38%), Positives = 140/254 (55%), Gaps = 20/254 (7%)
Query: 85 PLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRG 144
PL + ++ ++ FDAR NW C I +RDQG+CGS WA G A +DR+C+A+ G
Sbjct: 78 PLYTKNNNKIKH----FDARENWKICKQIGHVRDQGNCGSCWAFGTTGAFADRLCVATGG 133
Query: 145 KRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEI-PCE 203
+ +LS++ L CC CG GCQGG KAWKY+ GI +GG Y S +GC PY++ PC
Sbjct: 134 GFNEQLSAEKLTFCCWTCGLGCQGGNPIKAWKYFKRRGITTGGDYGSNEGCAPYKVPPC- 192
Query: 204 RYMNGSHSSCQDN-EPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRH 262
Y + CQ + +C R C V + + S +TI ++I +
Sbjct: 193 -YDDQGEFLCQGKPTEHNHKCPRACYGNSTVENRYKVESIYVLDSF----KTIEQDIRTY 247
Query: 263 GPVEGSMTIYADMILYKTGIYKHVAGGP-LGEHAIRIIGWGQEPLGEGTSSVVKYWLVAN 321
GPVE S +Y D I YK+GIY+ +G H++++IGWG+E + YWL+ N
Sbjct: 248 GPVEASFDVYDDFITYKSGIYQKTPNALYVGGHSVKLIGWGEE-------DGIPYWLLVN 300
Query: 322 SFNTNWGENGLFRI 335
S++ WGE G FRI
Sbjct: 301 SWSKFWGEQGTFRI 314
Score = 100 bits (249), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 60/163 (36%), Positives = 88/163 (53%), Gaps = 16/163 (9%)
Query: 336 GCRPYEIP-CERYMNGSRSSCQAN-EPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPAN 393
GC PY++P C Y + CQ + +C R C V + + S
Sbjct: 183 GCAPYKVPPC--YDDQGEFLCQGKPTEHNHKCPRACYGNSTVENRYKVESIYVLDSF--- 237
Query: 394 EETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGP-LGEHAIRIIGWGQEPLGEGT 452
+TI ++I +GPVE S +Y D I YK+GIY+ +G H++++IGWG+E
Sbjct: 238 -KTIEQDIRTYGPVEASFDVYDDFITYKSGIYQKTPNALYVGGHSVKLIGWGEE------ 290
Query: 453 SSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLP 495
+ YWL+ NS++ WGE G FRI++G+NECGIE TAG+P
Sbjct: 291 -DGIPYWLLVNSWSKFWGEQGTFRIIKGRNECGIERSATAGIP 332
>gi|15723280|gb|AAL06328.1| cathepsin B-like protease [Trypanosoma cruzi]
Length = 208
Score = 170 bits (431), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 102/225 (45%), Positives = 129/225 (57%), Gaps = 20/225 (8%)
Query: 101 FDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCK 160
FDA WP CPTI EIRDQ SCGS WA+ A A+SDR C G R +R+S+ DL+SCC
Sbjct: 1 FDAGEAWPKCPTITEIRDQSSCGSCWAVAAASAISDRYCTLG-GVRDLRISAGDLMSCCD 59
Query: 161 DCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIP-CERYMNGSHSSCQDNEPN 219
CG GC GG+ AW+Y+ GIVS + C+PY P C ++N S S E +
Sbjct: 60 VCGYGCNGGYPEVAWEYYAVHGIVS-------EYCQPYPFPSCAHHVNSSDLSPCSGEYD 112
Query: 220 TPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYK 279
TP C C D G +Y L + EE+ RE+ +GP E S ++YAD + Y
Sbjct: 113 TPTCNSTCT---DKKVPLIKYRGNTSYLL-SGEESFKRELLLNGPFEVSFSVYADFLAYT 168
Query: 280 TGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFN 324
G+YKHVAG LG HA+RI+GWG E GE YW +ANS+N
Sbjct: 169 GGVYKHVAGIFLGGHAVRIVGWG-ELNGE------PYWKIANSWN 206
Score = 85.1 bits (209), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 55/138 (39%), Positives = 75/138 (54%), Gaps = 12/138 (8%)
Query: 330 NGLFRIGCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAY 388
+G+ C+PY P C ++N S S + E +TP C C D G +Y
Sbjct: 80 HGIVSEYCQPYPFPSCAHHVNSSDLSPCSGEYDTPTCNSTCT---DKKVPLIKYRGNTSY 136
Query: 389 SLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPL 448
L + EE+ RE+ +GP E S ++YAD + Y G+YKHVAG LG HA+RI+GWG E
Sbjct: 137 LL-SGEESFKRELLLNGPFEVSFSVYADFLAYTGGVYKHVAGIFLGGHAVRIVGWG-ELN 194
Query: 449 GEGTSSVVKYWLVANSFN 466
GE YW +ANS+N
Sbjct: 195 GE------PYWKIANSWN 206
>gi|341886633|gb|EGT42568.1| hypothetical protein CAEBREN_17563 [Caenorhabditis brenneri]
Length = 358
Score = 170 bits (431), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 103/255 (40%), Positives = 136/255 (53%), Gaps = 18/255 (7%)
Query: 91 SDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRL 150
S EE+P FDAR WP C I +RDQ CGS L A E SDR CI S G + L
Sbjct: 88 SQVFEEIPNSFDARQKWPSCSQIGAVRDQSDCGSAAHLVAAEIASDRTCIFSNGTFNWPL 147
Query: 151 SSDDLVSCC----KDCGN--GCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEI-PCE 203
S+ D +SCC CG+ GC G + K+W T G+ +GG Y + GC+PY I PC+
Sbjct: 148 SAQDPLSCCVGLMSICGDGWGCDGSWPKDILKWWQTHGLCTGGNYDDQFGCKPYTIYPCD 207
Query: 204 -RYMNGSHSSCQDNEPNTPECIRKCQPG--YDVSYEDDLNFGRIAYSLPANEETIMREIF 260
+Y NG+ +S +TP C +C + +SY+ D +FG+ Y++ I EI
Sbjct: 208 KKYPNGT-TSVPCPGYHTPVCEERCTSNITWPISYKQDKHFGKAHYNVGKKMTDIQTEIM 266
Query: 261 RHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVA 320
R+GPV S IY D YK+GIY H AG G +IIGW G + V YWL
Sbjct: 267 RNGPVIASFIIYDDFWDYKSGIYVHTAGDQEGGMDTKIIGW-------GVDNGVPYWLCV 319
Query: 321 NSFNTNWGENGLFRI 335
+ + T++GENG RI
Sbjct: 320 HQWGTDFGENGFVRI 334
Score = 113 bits (282), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 70/201 (34%), Positives = 100/201 (49%), Gaps = 15/201 (7%)
Query: 300 GWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIGCRPYEI-PCERYMNGSRSSCQAN 358
GWG + G ++K+W + G N + GC+PY I PC++ +S
Sbjct: 166 GWGCD--GSWPKDILKWW---QTHGLCTGGNYDDQFGCKPYTIYPCDKKYPNGTTSVPCP 220
Query: 359 EPNTPECIRKCQPG--YDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYAD 416
+TP C +C + +SY+ D +FG+ Y++ I EI R+GPV S IY D
Sbjct: 221 GYHTPVCEERCTSNITWPISYKQDKHFGKAHYNVGKKMTDIQTEIMRNGPVIASFIIYDD 280
Query: 417 MILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFR 476
YK+GIY H AG G +IIGW G + V YWL + + T++GENG R
Sbjct: 281 FWDYKSGIYVHTAGDQEGGMDTKIIGW-------GVDNGVPYWLCVHQWGTDFGENGFVR 333
Query: 477 IVRGQNECGIEADITAGLPKI 497
I+RG NE IE + A P +
Sbjct: 334 ILRGVNEVNIEHQVLAAQPDL 354
>gi|294914603|ref|XP_002778294.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
gi|239886508|gb|EER10089.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
Length = 365
Score = 170 bits (430), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 99/264 (37%), Positives = 145/264 (54%), Gaps = 24/264 (9%)
Query: 88 VQLSDPLEELPEGFDARINWPYCP-TIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKR 146
V ++ L ++P+ FDAR + C I +RDQ +CGS WA G VEA + RVCI S GK
Sbjct: 91 VYPAEELVDIPDSFDARDAFKECKDVIGHVRDQSACGSCWAFGTVEAFNARVCIKSGGKL 150
Query: 147 HVRLSSDDLVSCCKDCGN-----GCQGGFHGKAWKYWVTTGIVSGGTYASKQ------GC 195
+ LS+ D+++CC + G+ GC GG +W + T GIVSGG + ++ GC
Sbjct: 151 NQLLSAADMLACC-NIGHFCLSFGCSGGNPITSWTFLHTNGIVSGGGFVPEKNMKAADGC 209
Query: 196 RPYEIP-CERYMNGS-HSSCQDNEPNTPECIRKC-QPGYDVSYEDDLNFGRIAY-SLPAN 251
PY P C + S + C +TP C C Y +++ D ++ + S +
Sbjct: 210 WPYNFPKCAHHQKESDYKPCAKEIYDTPSCSSSCPNAKYGTAFDKDRHYTESLFPSRFGS 269
Query: 252 EETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTS 311
+I +EI +GP + ++Y D + YK+G+YKH +GG LG HA+ IIGW GT
Sbjct: 270 TSSIKKEIMTNGPTSAAFSVYEDFLSYKSGVYKHTSGGFLGGHAVEIIGW-------GTE 322
Query: 312 SVVKYWLVANSFNTNWGENGLFRI 335
V YWLV NS+N WG++G F+I
Sbjct: 323 KGVDYWLVMNSWNEEWGDHGTFKI 346
Score = 113 bits (283), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 63/166 (37%), Positives = 90/166 (54%), Gaps = 13/166 (7%)
Query: 336 GCRPYEIP-CERYMNGSR-SSCQANEPNTPECIRKC-QPGYDVSYEDDLNFGRIAY-SLP 391
GC PY P C + S C +TP C C Y +++ D ++ + S
Sbjct: 208 GCWPYNFPKCAHHQKESDYKPCAKEIYDTPSCSSSCPNAKYGTAFDKDRHYTESLFPSRF 267
Query: 392 ANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEG 451
+ +I +EI +GP + ++Y D + YK+G+YKH +GG LG HA+ IIGW G
Sbjct: 268 GSTSSIKKEIMTNGPTSAAFSVYEDFLSYKSGVYKHTSGGFLGGHAVEIIGW-------G 320
Query: 452 TSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
T V YWLV NS+N WG++G F+IV+G +CGI+ I AG P I
Sbjct: 321 TEKGVDYWLVMNSWNEEWGDHGTFKIVQG--DCGIDDMILAGTPAI 364
>gi|157058751|gb|ABV03133.1| cathepsin B-3098 [Aulacorthum solani]
Length = 215
Score = 170 bits (430), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 87/216 (40%), Positives = 126/216 (58%), Gaps = 6/216 (2%)
Query: 92 DPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLS 151
D +E+P FDAR W C TI E+RDQG+C SGWAL A +DR+C+A+ G + LS
Sbjct: 1 DNYQEIPRKFDARKKWLRCKTIGEVRDQGNCASGWALSTSSAFADRLCVATNGDFNQLLS 60
Query: 152 SDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIPCERYMNGSHS 211
++++ CC CGNGC GG+ +AWK + G+V+GG Y S +GC PY +P Y ++
Sbjct: 61 AEEITFCCHTCGNGCYGGYPIRAWKSFKKHGLVTGGNYKSGEGCEPYRVPPCPYDEYGNN 120
Query: 212 SC--QDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSM 269
+C Q E N C R C D+ ++ D + R Y L I +++ +GP+E S
Sbjct: 121 TCSGQPMESNH-RCTRMCYGNQDLDFDQDHRYTRDHYYL--TYRGIQKDVINYGPIEASF 177
Query: 270 TIYADMILYKTGIY-KHVAGGPLGEHAIRIIGWGQE 304
+Y D YK+GIY K LG H++++IGWG+E
Sbjct: 178 DVYDDFPSYKSGIYVKSENASYLGGHSVKLIGWGEE 213
Score = 66.6 bits (161), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 39/114 (34%), Positives = 58/114 (50%), Gaps = 6/114 (5%)
Query: 336 GCRPYEIPCERYMNGSRSSC--QANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPAN 393
GC PY +P Y ++C Q E N C R C D+ ++ D + R Y L
Sbjct: 103 GCEPYRVPPCPYDEYGNNTCSGQPMESNH-RCTRMCYGNQDLDFDQDHRYTRDHYYLTY- 160
Query: 394 EETIMREIFRHGPVEGSMTIYADMILYKTGIY-KHVAGGPLGEHAIRIIGWGQE 446
I +++ +GP+E S +Y D YK+GIY K LG H++++IGWG+E
Sbjct: 161 -RGIQKDVINYGPIEASFDVYDDFPSYKSGIYVKSENASYLGGHSVKLIGWGEE 213
>gi|256090674|ref|XP_002581308.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
Length = 250
Score = 170 bits (430), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 86/228 (37%), Positives = 132/228 (57%), Gaps = 13/228 (5%)
Query: 126 WALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVS 185
WA+ + ++SDR CI + G V+LS+ +L+SC K+ GCQ GF +W YW+ G+V+
Sbjct: 30 WAVASAASISDRTCIQTNGTMKVQLSAIELISCSKN-KLGCQIGFSEFSWDYWLKNGLVT 88
Query: 186 GGTYASKQGCRPYEIP-CERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRI 244
G GC PY P C+ + S+ C P C + C+ GY + Y+ D ++GR+
Sbjct: 89 G----DPTGCLPYPFPKCDHRSSNSYPKCGYITYTAPPCTKTCRSGYPIPYKADKHYGRV 144
Query: 245 AYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQE 304
YSL NE I +EI +GPVE + +++D + YK+G+Y+H+ G + H++RIIGWG E
Sbjct: 145 IYSLRPNESDIRKEIMMNGPVEAGIFVHSDFLNYKSGVYRHITGQLVTIHSVRIIGWGIE 204
Query: 305 PLGEGTSSVVKYWLVANSFNTNWGENGLFRIGCRPYEIPCERYMNGSR 352
+ + YWL ANS+N +WG NG F+I E E ++N +
Sbjct: 205 -------NDIPYWLCANSWNEDWGLNGYFKILRGSNECEIESFVNAGK 245
Score = 142 bits (358), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 70/171 (40%), Positives = 102/171 (59%), Gaps = 11/171 (6%)
Query: 327 WGENGLFR---IGCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLN 382
W +NGL GC PY P C+ + S C P C + C+ GY + Y+ D +
Sbjct: 81 WLKNGLVTGDPTGCLPYPFPKCDHRSSNSYPKCGYITYTAPPCTKTCRSGYPIPYKADKH 140
Query: 383 FGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIG 442
+GR+ YSL NE I +EI +GPVE + +++D + YK+G+Y+H+ G + H++RIIG
Sbjct: 141 YGRVIYSLRPNESDIRKEIMMNGPVEAGIFVHSDFLNYKSGVYRHITGQLVTIHSVRIIG 200
Query: 443 WGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAG 493
WG E + + YWL ANS+N +WG NG F+I+RG NEC IE+ + AG
Sbjct: 201 WGIE-------NDIPYWLCANSWNEDWGLNGYFKILRGSNECEIESFVNAG 244
>gi|323448735|gb|EGB04630.1| hypothetical protein AURANDRAFT_32318 [Aureococcus anophagefferens]
Length = 253
Score = 170 bits (430), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 92/235 (39%), Positives = 133/235 (56%), Gaps = 17/235 (7%)
Query: 110 CPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGG 169
CP+++EIRDQ +CGS WA G+ EAM+DR+CIAS G LS+ D+ SC K GC GG
Sbjct: 1 CPSLKEIRDQANCGSCWAFGSTEAMTDRMCIASNGTVTTHLSAQDVTSCDKLGDMGCNGG 60
Query: 170 FHGKAWKYWVTTGIVSGGTYASKQGCRPYEI-PCERYMNGSHSSCQDNEPNTPECIRKCQ 228
+ YW +GIV GG Y K GC Y++ PC ++N S +E P+C RKC+
Sbjct: 61 IPSSVYSYWALSGIVDGGNYGDKSGCWSYQLEPCAHHVNSSKYPACPDEVRAPKCARKCE 120
Query: 229 PGYDVSYEDDLNFGRIAYSLPANEE-------TIMREIFRHGPVEGSMTIYADMILYKTG 281
D + G YS+ E + +I+++GP+ G + D + YK+G
Sbjct: 121 -SEDKDWTKAKVKGEKGYSVCQQGELEGTCAIKMAADIYQNGPITGMFFVKQDFLAYKSG 179
Query: 282 IYK-HVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
+Y+ + PLG HAI+I+G+G E + YWLVANS+N +WG++G F+I
Sbjct: 180 VYEPKLLSPPLGGHAIKIMGFGTEDGKD-------YWLVANSWNEDWGDDGYFKI 227
Score = 108 bits (269), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 66/197 (33%), Positives = 104/197 (52%), Gaps = 20/197 (10%)
Query: 311 SSVVKYWLVANSFNTNWGENGLFRIGCRPYEI-PCERYMNGSRSSCQANEPNTPECIRKC 369
SSV YW ++ + G N + GC Y++ PC ++N S+ +E P+C RKC
Sbjct: 63 SSVYSYWALSGIVD---GGNYGDKSGCWSYQLEPCAHHVNSSKYPACPDEVRAPKCARKC 119
Query: 370 QPGYDVSYEDDLNFGRIAYSLPANEE-------TIMREIFRHGPVEGSMTIYADMILYKT 422
+ D + G YS+ E + +I+++GP+ G + D + YK+
Sbjct: 120 E-SEDKDWTKAKVKGEKGYSVCQQGELEGTCAIKMAADIYQNGPITGMFFVKQDFLAYKS 178
Query: 423 GIYK-HVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQ 481
G+Y+ + PLG HAI+I+G+G E + YWLVANS+N +WG++G F+I+RG+
Sbjct: 179 GVYEPKLLSPPLGGHAIKIMGFGTEDGKD-------YWLVANSWNEDWGDDGYFKIIRGK 231
Query: 482 NECGIEADITAGLPKIG 498
N C IE + G P G
Sbjct: 232 NACQIEDPVINGGPVAG 248
>gi|15723272|gb|AAL06324.1| cathepsin B-like protease [Trypanosoma cruzi]
Length = 208
Score = 169 bits (428), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 101/230 (43%), Positives = 127/230 (55%), Gaps = 30/230 (13%)
Query: 101 FDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCK 160
FDA WP CPTI EIRDQ SCGS WA+ A AMSDR C G R +R+S+ DL+SCC
Sbjct: 1 FDAGEAWPKCPTITEIRDQSSCGSCWAVAAASAMSDRYCTLG-GVRDLRISAGDLMSCCD 59
Query: 161 DCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIP-CERYMNGSHSSCQDNEPN 219
CG GC GG+ AW+Y+ GIVS + C+PY P C ++N S S E +
Sbjct: 60 VCGYGCNGGYPEVAWEYYAVHGIVS-------EYCQPYPFPSCAHHVNSSDLSPCSGEYD 112
Query: 220 TPECIRKCQPGYDVSYEDDLNFGRIAY-----SLPANEETIMREIFRHGPVEGSMTIYAD 274
TP C C D I Y + + EE+ RE+ +GP E S ++YAD
Sbjct: 113 TPTCNSTCT---------DKKIPLIKYRGNTSCILSGEESFKRELLLNGPFEVSFSVYAD 163
Query: 275 MILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFN 324
+ Y G+YKHV G LG HA+RI+GWG E GE YW +ANS+N
Sbjct: 164 FVAYTGGVYKHVTGVFLGGHAVRIVGWG-ELNGE------PYWKIANSWN 206
Score = 82.8 bits (203), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 53/143 (37%), Positives = 73/143 (51%), Gaps = 22/143 (15%)
Query: 330 NGLFRIGCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAY 388
+G+ C+PY P C ++N S S + E +TP C C D I Y
Sbjct: 80 HGIVSEYCQPYPFPSCAHHVNSSDLSPCSGEYDTPTCNSTCT---------DKKIPLIKY 130
Query: 389 -----SLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGW 443
+ + EE+ RE+ +GP E S ++YAD + Y G+YKHV G LG HA+RI+GW
Sbjct: 131 RGNTSCILSGEESFKRELLLNGPFEVSFSVYADFVAYTGGVYKHVTGVFLGGHAVRIVGW 190
Query: 444 GQEPLGEGTSSVVKYWLVANSFN 466
G E GE YW +ANS+N
Sbjct: 191 G-ELNGE------PYWKIANSWN 206
>gi|268619140|gb|ACZ13346.1| cathepsin B-like cysteine proteinase [Bursaphelenchus xylophilus]
Length = 405
Score = 169 bits (428), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 96/247 (38%), Positives = 132/247 (53%), Gaps = 15/247 (6%)
Query: 92 DPLEELPEGFDARINWPYCPTI-QEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRL 150
D EE+PE FDA WP C + IRDQ +CGS WA+ + MSDR+C+A+ GK V +
Sbjct: 67 DLSEEIPESFDAAEKWPECAEVFNNIRDQSNCGSCWAVSSAGVMSDRICVATNGKVKVSI 126
Query: 151 SSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIP-CERYMNGS 209
S SC G+GC GG A++ ++ G +G QGC+PY C ++N +
Sbjct: 127 SGIATASCVG--GDGCNGGLEEVAFEKFIENGFPTGSEVDKHQGCQPYPFKHCAHHVNST 184
Query: 210 HSSCQDNEP--NTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEG 267
D+ P C +CQ YD YE+DL +G+ Y ++E I REI +GPV
Sbjct: 185 EYPPCDSVPEYKADTCSHECQKDYDRKYEEDLYYGKEQYGF-SDEAPIQREIMTNGPVAV 243
Query: 268 SMTIYADMILYKTGIYKHVAGGPL-GEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTN 326
S T+Y + Y GIY+ G + G HA+R++GWG E GT KYW +ANS+N
Sbjct: 244 SFTVYESFLYYSGGIYRSTPGERIKGYHAVRVVGWGVE---NGT----KYWKIANSWNEQ 296
Query: 327 WGENGLF 333
WG L
Sbjct: 297 WGRERLL 303
Score = 92.8 bits (229), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 59/157 (37%), Positives = 81/157 (51%), Gaps = 13/157 (8%)
Query: 336 GCRPYEIP-CERYMNGSR-SSCQA-NEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPA 392
GC+PY C ++N + C + E C +CQ YD YE+DL +G+ Y +
Sbjct: 168 GCQPYPFKHCAHHVNSTEYPPCDSVPEYKADTCSHECQKDYDRKYEEDLYYGKEQYGF-S 226
Query: 393 NEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPL-GEHAIRIIGWGQEPLGEG 451
+E I REI +GPV S T+Y + Y GIY+ G + G HA+R++GWG E G
Sbjct: 227 DEAPIQREIMTNGPVAVSFTVYESFLYYSGGIYRSTPGERIKGYHAVRVVGWGVE---NG 283
Query: 452 TSSVVKYWLVANSFNTNWGENGLF-RIVRGQNECGIE 487
T KYW +ANS+N WG L G +E IE
Sbjct: 284 T----KYWKIANSWNEQWGRERLLPHTPAGVDESDIE 316
>gi|339239305|ref|XP_003381207.1| cathepsin B [Trichinella spiralis]
gi|316975778|gb|EFV59177.1| cathepsin B [Trichinella spiralis]
Length = 343
Score = 169 bits (428), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 97/279 (34%), Positives = 134/279 (48%), Gaps = 55/279 (19%)
Query: 97 LPEGFDARINWPYCPTIQEIRDQGSCGSGW------------------------------ 126
L E FDAR WP C I I+DQ +C W
Sbjct: 60 LEEHFDAREKWPECKYIGFIKDQSTCSCCWVSGDFLYHYDQWKIILLFDFSSSSSHWLFI 119
Query: 127 ----ALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTG 182
A+ + M+DR CIA +G++ LS ++L SCC CG GC GGF A+KYW G
Sbjct: 120 STFKAMSSASVMTDRTCIAYKGEQQPFLSDEELTSCCTSCGYGCNGGFPLLAFKYWNEIG 179
Query: 183 IVSGGTYASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFG 242
+ +GG Y SK GC+P+ I + + TP C KC Y + D +G
Sbjct: 180 VPTGGPYGSKSGCKPFSI--------APPTSSSTAAQTPLCQLKCISDYKRKLDKDRYYG 231
Query: 243 RIAYSLPANEE---TIMREIFRHGPVEGSMTIYADMILYKTGIY---KHVAGGPLGEHAI 296
Y + ++ + TI REI HGPV +M I+ + YK+G+Y K LG HA+
Sbjct: 232 ESYYLITSSNQPVKTIQREIMDHGPVVAAMEIFESFLYYKSGVYSANKRNDDPSLGLHAV 291
Query: 297 RIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
++IGWG++ + YWLV NS+NT +GE GLF+I
Sbjct: 292 KLIGWGEQ-------KRIPYWLVVNSWNTTFGEQGLFKI 323
Score = 110 bits (274), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 63/168 (37%), Positives = 90/168 (53%), Gaps = 22/168 (13%)
Query: 334 RIGCRPYEIPCERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPAN 393
+ GC+P+ I + + + TP C KC Y + D +G Y + ++
Sbjct: 189 KSGCKPFSI--------APPTSSSTAAQTPLCQLKCISDYKRKLDKDRYYGESYYLITSS 240
Query: 394 EE---TIMREIFRHGPVEGSMTIYADMILYKTGIY---KHVAGGPLGEHAIRIIGWGQEP 447
+ TI REI HGPV +M I+ + YK+G+Y K LG HA+++IGWG++
Sbjct: 241 NQPVKTIQREIMDHGPVVAAMEIFESFLYYKSGVYSANKRNDDPSLGLHAVKLIGWGEQ- 299
Query: 448 LGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIE-ADITAGL 494
+ YWLV NS+NT +GE GLF+I RG NECGIE +TAGL
Sbjct: 300 ------KRIPYWLVVNSWNTTFGEQGLFKIRRGTNECGIENLHVTAGL 341
>gi|297843026|ref|XP_002889394.1| hypothetical protein ARALYDRAFT_887367 [Arabidopsis lyrata subsp.
lyrata]
gi|297335236|gb|EFH65653.1| hypothetical protein ARALYDRAFT_887367 [Arabidopsis lyrata subsp.
lyrata]
Length = 359
Score = 169 bits (428), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 104/292 (35%), Positives = 160/292 (54%), Gaps = 28/292 (9%)
Query: 47 PKLPFYGAEKNALSKLTLSELEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARIN 106
P + + + + T++E + +GV P K +P++ D +LP+ FDAR
Sbjct: 55 PNAGWKASLNDRFANATVAEFKRLLGVKPTPKTAYLGVPIVRH--DLSLKLPKEFDARTA 112
Query: 107 WPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGC 166
W C +I I DQG CGS WA GAVE++SDR CI + +V LS++D+V+CC
Sbjct: 113 WSQCTSIPRILDQGHCGSCWAFGAVESLSDRFCI--KYNLNVSLSANDVVACCGLLCGLG 170
Query: 167 QGG-FHGKAWKYWVTTGIVSGGTYASKQGCRPY--EIPCERYMNGSHSSCQDNEPNTPEC 223
G F AW Y+ G+V+ + C PY C SH C+ P TP+C
Sbjct: 171 CNGGFPMGAWLYFKYHGVVT-------EECDPYFDNTGC------SHPGCEPGYP-TPKC 216
Query: 224 IRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIY 283
+RKC + + + ++G AY + + + IM E++++GPVE + T+Y D YK+G+Y
Sbjct: 217 VRKCVSENQL-WGESKHYGVSAYRINHDPQDIMAEVYKNGPVEVAFTVYEDFAHYKSGVY 275
Query: 284 KHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
KH+ G +G HA+++IGWG GE YWL+AN +N +WG++G F+I
Sbjct: 276 KHITGTKIGGHAVKLIGWGTSDDGE------DYWLLANQWNRSWGDDGYFKI 321
Score = 130 bits (327), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 63/154 (40%), Positives = 96/154 (62%), Gaps = 9/154 (5%)
Query: 344 CERYMNGSRSSCQANEPN--TPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREI 401
C+ Y + + S EP TP+C+RKC + + + ++G AY + + + IM E+
Sbjct: 193 CDPYFDNTGCSHPGCEPGYPTPKCVRKCVSENQL-WGESKHYGVSAYRINHDPQDIMAEV 251
Query: 402 FRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLV 461
+++GPVE + T+Y D YK+G+YKH+ G +G HA+++IGWG GE YWL+
Sbjct: 252 YKNGPVEVAFTVYEDFAHYKSGVYKHITGTKIGGHAVKLIGWGTSDDGE------DYWLL 305
Query: 462 ANSFNTNWGENGLFRIVRGQNECGIEADITAGLP 495
AN +N +WG++G F+I RG NECGIE + AGLP
Sbjct: 306 ANQWNRSWGDDGYFKIRRGTNECGIEHGVVAGLP 339
>gi|741376|prf||2007265A cathepsin B
Length = 153
Score = 169 bits (428), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 77/153 (50%), Positives = 115/153 (75%), Gaps = 8/153 (5%)
Query: 344 CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFR 403
CE ++NGSR C E +TP+C + C+PGY +Y+ D ++G +YS+ +E+ IM EI++
Sbjct: 1 CEHHVNGSRPPCTG-EGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIYK 59
Query: 404 HGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVAN 463
+GPVEG+ ++Y+D +LYK+G+Y+HV G +G HAIRI+GWG E GT YWLVAN
Sbjct: 60 NGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGVE---NGTP----YWLVAN 112
Query: 464 SFNTNWGENGLFRIVRGQNECGIEADITAGLPK 496
S+NT+WG+NG F+I+RGQ+ CGIE+++ AG+P+
Sbjct: 113 SWNTDWGDNGFFKILRGQDHCGIESEVVAGIPR 145
Score = 139 bits (351), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 66/134 (49%), Positives = 97/134 (72%), Gaps = 8/134 (5%)
Query: 202 CERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFR 261
CE ++NGS C E +TP+C + C+PGY +Y+ D ++G +YS+ +E+ IM EI++
Sbjct: 1 CEHHVNGSRPPCT-GEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIYK 59
Query: 262 HGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVAN 321
+GPVEG+ ++Y+D +LYK+G+Y+HV G +G HAIRI+GWG E GT YWLVAN
Sbjct: 60 NGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGVE---NGTP----YWLVAN 112
Query: 322 SFNTNWGENGLFRI 335
S+NT+WG+NG F+I
Sbjct: 113 SWNTDWGDNGFFKI 126
>gi|15723274|gb|AAL06325.1| cathepsin B-like protease [Trypanosoma cruzi]
gi|15723278|gb|AAL06327.1| cathepsin B-like protease [Trypanosoma cruzi]
Length = 208
Score = 169 bits (427), Expect = 5e-39, Method: Compositional matrix adjust.
Identities = 101/225 (44%), Positives = 128/225 (56%), Gaps = 20/225 (8%)
Query: 101 FDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCK 160
FDA WP CPT+ EIRDQ SCGS WA+ A A+SDR C G R +R+S+ DL+SCC
Sbjct: 1 FDAGEAWPECPTVTEIRDQSSCGSCWAVAAASAISDRYCTLG-GVRDLRISAGDLMSCCD 59
Query: 161 DCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIP-CERYMNGSHSSCQDNEPN 219
CG GC GG+ AW+Y+ GIVS + C+PY P C ++N S S E +
Sbjct: 60 VCGFGCNGGYPEVAWEYYAVHGIVS-------EYCQPYPFPSCAHHVNSSDLSPCSGEYD 112
Query: 220 TPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYK 279
TP C C D G +Y L + EE RE+ +GP E S ++YAD + Y
Sbjct: 113 TPTCNSTCT---DKKIPLIKYRGNTSYVL-SGEEPFKRELILNGPFEVSFSVYADFVAYT 168
Query: 280 TGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFN 324
G+YKHVAG LG HA+RI+GWG E GE YW +ANS+N
Sbjct: 169 GGVYKHVAGIFLGGHAVRIVGWG-ELNGE------PYWKIANSWN 206
Score = 84.3 bits (207), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 55/138 (39%), Positives = 74/138 (53%), Gaps = 12/138 (8%)
Query: 330 NGLFRIGCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAY 388
+G+ C+PY P C ++N S S + E +TP C C D G +Y
Sbjct: 80 HGIVSEYCQPYPFPSCAHHVNSSDLSPCSGEYDTPTCNSTCT---DKKIPLIKYRGNTSY 136
Query: 389 SLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPL 448
L + EE RE+ +GP E S ++YAD + Y G+YKHVAG LG HA+RI+GWG E
Sbjct: 137 VL-SGEEPFKRELILNGPFEVSFSVYADFVAYTGGVYKHVAGIFLGGHAVRIVGWG-ELN 194
Query: 449 GEGTSSVVKYWLVANSFN 466
GE YW +ANS+N
Sbjct: 195 GE------PYWKIANSWN 206
>gi|4099305|gb|AAD00577.1| cysteine proteinase [Clonorchis sinensis]
Length = 180
Score = 168 bits (426), Expect = 6e-39, Method: Compositional matrix adjust.
Identities = 81/176 (46%), Positives = 113/176 (64%), Gaps = 2/176 (1%)
Query: 129 GAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGT 188
GAVEAMSDR+CI S G + LS+ DL+SCC++CG GC+GG+ AW YW T GIV+GG+
Sbjct: 1 GAVEAMSDRLCIHSNGAFNKSLSAVDLLSCCENCGFGCRGGYPAVAWDYWKTHGIVTGGS 60
Query: 189 YASKQGCRPYEIP-CERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYS 247
GCR Y P CE ++ G + C TPEC+++C DV Y +D ++Y+
Sbjct: 61 KEDPSGCRSYPFPKCEHHVQGHYPPCPRELYPTPECVQQCDTP-DVGYLEDKTRANMSYN 119
Query: 248 LPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQ 303
+ A+E +IM+EI GPVE T+Y D + Y +G+Y H G P+ HA+RI+GWG+
Sbjct: 120 IYASEISIMKEIMLRGPVEAIFTMYEDFLRYSSGVYFHALGAPMSGHAVRILGWGE 175
Score = 94.4 bits (233), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 45/111 (40%), Positives = 66/111 (59%), Gaps = 2/111 (1%)
Query: 336 GCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANE 394
GCR Y P CE ++ G C TPEC+++C DV Y +D ++Y++ A+E
Sbjct: 66 GCRSYPFPKCEHHVQGHYPPCPRELYPTPECVQQCDTP-DVGYLEDKTRANMSYNIYASE 124
Query: 395 ETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQ 445
+IM+EI GPVE T+Y D + Y +G+Y H G P+ HA+RI+GWG+
Sbjct: 125 ISIMKEIMLRGPVEAIFTMYEDFLRYSSGVYFHALGAPMSGHAVRILGWGE 175
>gi|294894292|ref|XP_002774787.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
gi|239880404|gb|EER06603.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
Length = 414
Score = 168 bits (425), Expect = 8e-39, Method: Compositional matrix adjust.
Identities = 102/274 (37%), Positives = 142/274 (51%), Gaps = 34/274 (12%)
Query: 94 LEELPEGFDARINWPYC-PTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSS 152
L++LP FDAR +P C I+ IRDQ CGS WA G EA +DR+CI S G LS+
Sbjct: 139 LQDLPTDFDARTAFPNCSKVIRHIRDQSDCGSCWAFGVTEAFNDRLCIKSNGTFTELLSA 198
Query: 153 DDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQ------GCRPYEIP-CERY 205
++ +C GC GG AW + GI +GG Y ++ GC PY+ P C +
Sbjct: 199 GEMNACAP--SFGCDGGIPSLAWSWVHNKGIATGGDYLAEDDMTKDDGCWPYDFPPCAHH 256
Query: 206 MNGS-HSSCQDNEPNTPECIRKCQ-PGYDVSYEDDLNF--GRIAYSLPANE-ETIMREIF 260
+N S + C + TP C +C P Y + DD +F + Y N+ + +R
Sbjct: 257 VNDSKYPKCPKDSYETPNCAEQCHNPKYTTTLRDDRHFLVESVPYEYSVNDAKNAIRTDG 316
Query: 261 RHGP------------VEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGE 308
GP V S +Y D + Y++G+YKH +G LG HA++IIGWG+E G+
Sbjct: 317 PVGPIYFCDPSVNFDQVSASFIVYEDFLAYRSGVYKHTSGKELGGHAVKIIGWGEE-TGQ 375
Query: 309 GTSSVVKYWLVANSFNTNWGENGLFRIGCRPYEI 342
YWLV NS+N +WG+NGLF+I EI
Sbjct: 376 A------YWLVVNSWNEDWGDNGLFKIALGNCEI 403
Score = 110 bits (275), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 65/180 (36%), Positives = 94/180 (52%), Gaps = 27/180 (15%)
Query: 336 GCRPYEIP-CERYMNGSR-SSCQANEPNTPECIRKCQ-PGYDVSYEDDLNF--GRIAYSL 390
GC PY+ P C ++N S+ C + TP C +C P Y + DD +F + Y
Sbjct: 244 GCWPYDFPPCAHHVNDSKYPKCPKDSYETPNCAEQCHNPKYTTTLRDDRHFLVESVPYEY 303
Query: 391 PANE-ETIMREIFRHGP------------VEGSMTIYADMILYKTGIYKHVAGGPLGEHA 437
N+ + +R GP V S +Y D + Y++G+YKH +G LG HA
Sbjct: 304 SVNDAKNAIRTDGPVGPIYFCDPSVNFDQVSASFIVYEDFLAYRSGVYKHTSGKELGGHA 363
Query: 438 IRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
++IIGWG+E G+ YWLV NS+N +WG+NGLF+I G C I+ D+ G PK+
Sbjct: 364 VKIIGWGEE-TGQA------YWLVVNSWNEDWGDNGLFKIALGN--CEIDDDLLGGTPKV 414
>gi|10803435|emb|CAC13130.1| putative cathepsin B.4 [Ostertagia ostertagi]
Length = 194
Score = 167 bits (423), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 81/175 (46%), Positives = 108/175 (61%), Gaps = 1/175 (0%)
Query: 124 SGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGI 183
S WA+ + AMSDR+CIAS+G + V +S+ D+VSCC CG GC GG+ KAW+++ G+
Sbjct: 1 SCWAVSSAAAMSDRICIASKGVKQVLISAQDMVSCCSYCGYGCDGGWPIKAWQFFAREGV 60
Query: 184 VSGGTYASKQGCRPYEI-PCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFG 242
V+GG Y + CRPYEI PC + + ++ TP C RKCQ GY +Y+ D +G
Sbjct: 61 VTGGNYGRQGCCRPYEITPCGHHGREPYYGECYDDAQTPRCKRKCQSGYKTTYKKDKRYG 120
Query: 243 RIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIR 297
R AY LP + + I REI HGPV T+Y D Y GIYKH AG G HA++
Sbjct: 121 RKAYQLPNSVKAIQREIMMHGPVVAGYTVYEDFSYYTKGIYKHTAGRETGGHAVK 175
Score = 96.3 bits (238), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 50/115 (43%), Positives = 62/115 (53%), Gaps = 5/115 (4%)
Query: 326 NWGENGLFRIGCRPYEI-PCERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFG 384
N+G G CRPYEI PC + ++ TP C RKCQ GY +Y+ D +G
Sbjct: 65 NYGRQGC----CRPYEITPCGHHGREPYYGECYDDAQTPRCKRKCQSGYKTTYKKDKRYG 120
Query: 385 RIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIR 439
R AY LP + + I REI HGPV T+Y D Y GIYKH AG G HA++
Sbjct: 121 RKAYQLPNSVKAIQREIMMHGPVVAGYTVYEDFSYYTKGIYKHTAGRETGGHAVK 175
>gi|270012757|gb|EFA09205.1| cathepsin B precursor [Tribolium castaneum]
Length = 348
Score = 167 bits (422), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 102/265 (38%), Positives = 146/265 (55%), Gaps = 35/265 (13%)
Query: 71 MGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCP-TIQEIRDQGSCGSGWALG 129
+G+HPD P ++ + + +PE FDAR WP C I +IRDQG+CGS WA
Sbjct: 55 LGLHPD---PDYKIQ--TKHHKIAKSIPESFDAREKWPECKDVIGKIRDQGTCGSCWAFA 109
Query: 130 AVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTY 189
+ E M+DR+CI ++G+ S ++L++CC+DC C GG+ KAW Y++ GIVSGG Y
Sbjct: 110 STEVMTDRLCIGTKGETKFVFSPENLLTCCEDCRLECVGGYTAKAWDYYINEGIVSGGDY 169
Query: 190 ASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQ-PGYDVSYEDDLNFGRIAYSL 248
S +GC+PY +Y S +C++ CQ YDV Y+DD ++G Y+L
Sbjct: 170 NSSEGCQPYSKASFQYAVAS------------KCVKACQNDKYDVKYDDDKHYGDSFYTL 217
Query: 249 PANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGE 308
N I EI +GPV + ++ D+I YK+GI + I+ WG E E
Sbjct: 218 ETNVTQIQTEILTNGPVMATFNVFEDIIYYKSGIQL---------SNVSILRWGTE---E 265
Query: 309 GTSSVVKYWLVANSFNTNWGENGLF 333
G V YWL+ANS+ T WG+ G F
Sbjct: 266 G----VPYWLIANSWGTWWGDLGGF 286
Score = 79.0 bits (193), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 60/165 (36%), Positives = 83/165 (50%), Gaps = 30/165 (18%)
Query: 336 GCRPYEIPCERYMNGSRSSCQANEPNTPECIRKCQ-PGYDVSYEDDLNFGRIAYSLPANE 394
GC+PY +Y S+ C++ CQ YDV Y+DD ++G Y+L N
Sbjct: 174 GCQPYSKASFQYAVASK------------CVKACQNDKYDVKYDDDKHYGDSFYTLETNV 221
Query: 395 ETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSS 454
I EI +GPV + ++ D+I YK+GI + I+ WG E EG
Sbjct: 222 TQIQTEILTNGPVMATFNVFEDIIYYKSGIQL---------SNVSILRWGTE---EG--- 266
Query: 455 VVKYWLVANSFNTNWGE-NGLFRIVRGQNECGIEADITAGLPKIG 498
V YWL+ANS+ T WG+ G +I RG NEC IE ++ AG IG
Sbjct: 267 -VPYWLIANSWGTWWGDLGGFIKIKRGTNECAIEQEMAAGNVHIG 310
>gi|17560488|ref|NP_506310.1| Protein F32H5.1 [Caenorhabditis elegans]
gi|3876629|emb|CAB04249.1| Protein F32H5.1 [Caenorhabditis elegans]
Length = 356
Score = 167 bits (422), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 100/252 (39%), Positives = 135/252 (53%), Gaps = 18/252 (7%)
Query: 94 LEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSD 153
L ++P FD+R WP C I +RDQ CGS L AVE SDR CIAS G + LS+
Sbjct: 89 LVDIPSSFDSRQKWPSCSQIGAVRDQSDCGSAAHLVAVEIASDRTCIASNGTFNWPLSAQ 148
Query: 154 DLVSCC----KDCGN--GCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEI-PCE-RY 205
D +SCC CG+ GC G + K+W T G+ +GG Y + GC+PY I PC+ +Y
Sbjct: 149 DPLSCCVGLMSICGDGWGCDGSWPKDILKWWQTHGLCTGGNYNDQFGCKPYSIYPCDKKY 208
Query: 206 MNGSHSSCQDNEPNTPECIRKCQPG--YDVSYEDDLNFGRIAYSLPANEETIMREIFRHG 263
NG+ +S +TP C C + ++Y+ D +FG+ Y++ I EI +G
Sbjct: 209 ANGT-TSVPCPGYHTPTCEEHCTSNITWPIAYKQDKHFGKAHYNVGKKMTDIQIEIMTNG 267
Query: 264 PVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSF 323
PV S IY D YKTGIY H AG G +IIGWG + + V YWL + +
Sbjct: 268 PVIASFIIYDDFWDYKTGIYVHTAGDQEGGMDTKIIGWGVD-------NGVPYWLCVHQW 320
Query: 324 NTNWGENGLFRI 335
T++GENG R
Sbjct: 321 GTDFGENGFVRF 332
Score = 111 bits (277), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 72/200 (36%), Positives = 102/200 (51%), Gaps = 17/200 (8%)
Query: 300 GWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIGCRPYEI-PCER-YMNGSRSSCQA 357
GWG + G ++K+W + G N + GC+PY I PC++ Y NG+ +S
Sbjct: 164 GWGCD--GSWPKDILKWW---QTHGLCTGGNYNDQFGCKPYSIYPCDKKYANGT-TSVPC 217
Query: 358 NEPNTPECIRKCQPG--YDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYA 415
+TP C C + ++Y+ D +FG+ Y++ I EI +GPV S IY
Sbjct: 218 PGYHTPTCEEHCTSNITWPIAYKQDKHFGKAHYNVGKKMTDIQIEIMTNGPVIASFIIYD 277
Query: 416 DMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLF 475
D YKTGIY H AG G +IIGWG + + V YWL + + T++GENG
Sbjct: 278 DFWDYKTGIYVHTAGDQEGGMDTKIIGWGVD-------NGVPYWLCVHQWGTDFGENGFV 330
Query: 476 RIVRGQNECGIEADITAGLP 495
R +RG NE IE + A LP
Sbjct: 331 RFLRGVNEVNIEHQVLAALP 350
>gi|294883442|ref|XP_002770942.1| Cathepsin B precursor, putative [Perkinsus marinus ATCC 50983]
gi|239874068|gb|EER02758.1| Cathepsin B precursor, putative [Perkinsus marinus ATCC 50983]
Length = 393
Score = 167 bits (422), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 106/270 (39%), Positives = 144/270 (53%), Gaps = 27/270 (10%)
Query: 77 SKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPT-IQEIRDQGSCGSGWALGAVEAMS 135
SKLP+ S L LP+ FDAR ++ C T I +RDQ +CGS WA EA S
Sbjct: 111 SKLPKKP----ASESTALSNLPDRFDAREHFKNCATVIGHVRDQSTCGSCWAFATSEAFS 166
Query: 136 DRVCIASRGKRH-VRLSSDDLVSCCKDC----GNGCQGGFHGKAWKYWVTTGIVSGGTYA 190
DR+CI S G+ V LS+ +CC + GC GG AW+++ G+VS
Sbjct: 167 DRLCIRSSGEFDLVPLSAGHTAACCSEAEGCFSFGCDGGQPDSAWRWFSEHGVVS----E 222
Query: 191 SKQGCRPYEIP-CERYMNGS-HSSCQDNEPNTPECIRKCQ-PGYDVSYEDDLNFGRI-AY 246
GC PY P C ++ C+ N P +P C C+ + S+E D +F Y
Sbjct: 223 LDSGCWPYNFPECSHHVETKGMEPCKGNSP-SPVCSTTCRNHHFKPSFESDRHFTEDEGY 281
Query: 247 SLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPL 306
SL +E I +EI +GPV + T+Y D + YK+G+YKHV G LG HA++IIGW
Sbjct: 282 SLDEVDE-IKKEIIDNGPVAAAFTVYEDFLYYKSGVYKHVNGSELGGHAVKIIGW----- 335
Query: 307 GEGTSSVVKYWLVANSFNTNWGENGLFRIG 336
GT +YWLV NS+N NWG+ G+F+I
Sbjct: 336 --GTDQNEQYWLVMNSWNVNWGDQGIFKIA 363
Score = 125 bits (313), Expect = 7e-26, Method: Compositional matrix adjust.
Identities = 70/165 (42%), Positives = 98/165 (59%), Gaps = 15/165 (9%)
Query: 336 GCRPYEIP-CERYMNGS-RSSCQANEPNTPECIRKCQ-PGYDVSYEDDLNFGRI-AYSLP 391
GC PY P C ++ C+ N P +P C C+ + S+E D +F YSL
Sbjct: 226 GCWPYNFPECSHHVETKGMEPCKGNSP-SPVCSTTCRNHHFKPSFESDRHFTEDEGYSLD 284
Query: 392 ANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEG 451
+E I +EI +GPV + T+Y D + YK+G+YKHV G LG HA++IIGW G
Sbjct: 285 EVDE-IKKEIIDNGPVAAAFTVYEDFLYYKSGVYKHVNGSELGGHAVKIIGW-------G 336
Query: 452 TSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPK 496
T +YWLV NS+N NWG+ G+F+I G ECGI++++TAG+PK
Sbjct: 337 TDQNEQYWLVMNSWNVNWGDQGIFKIAIG--ECGIDSEVTAGIPK 379
>gi|281208776|gb|EFA82951.1| peptidase C1A family protein [Polysphondylium pallidum PN500]
Length = 1308
Score = 166 bits (420), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 95/239 (39%), Positives = 131/239 (54%), Gaps = 22/239 (9%)
Query: 97 LPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLV 156
LP FDA WP CPTI I++Q CGS WA GA+E++SDR CI V+LS DL+
Sbjct: 70 LPTNFDAAQQWPQCPTIGAIQNQAECGSCWAFGAIESISDRFCI--HKNESVQLSFQDLI 127
Query: 157 SCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQDN 216
+C + NGC+GG A+KY G+V+ C+PY IP + C N
Sbjct: 128 TC-DNQDNGCEGGDPYTAYKYVQKNGVVTSN-------CQPYTIPT---CPPAQQPCM-N 175
Query: 217 EPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMI 276
NTP C KC V+++ DL+ + Y++ N I EI +GPVE +Y D +
Sbjct: 176 FVNTPPCSAKCANS-SVNFQQDLHHLKTVYAVKPNVAAIQNEIVTNGPVEACFEVYEDFL 234
Query: 277 LYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
YK+G+Y H +G LG H I+I+G+G + GT YW+ NS+ T+WG NG+F I
Sbjct: 235 GYKSGVYTHKSGKDLGGHCIKIVGFG---VSNGTP----YWICNNSWTTSWGNNGIFWI 286
Score = 104 bits (260), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 59/156 (37%), Positives = 86/156 (55%), Gaps = 12/156 (7%)
Query: 329 ENGLFRIGCRPYEIPCERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAY 388
+NG+ C+PY IP ++ C N NTP C KC V+++ DL+ + Y
Sbjct: 150 KNGVVTSNCQPYTIPT---CPPAQQPCM-NFVNTPPCSAKCANS-SVNFQQDLHHLKTVY 204
Query: 389 SLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPL 448
++ N I EI +GPVE +Y D + YK+G+Y H +G LG H I+I+G+G +
Sbjct: 205 AVKPNVAAIQNEIVTNGPVEACFEVYEDFLGYKSGVYTHKSGKDLGGHCIKIVGFG---V 261
Query: 449 GEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNEC 484
GT YW+ NS+ T+WG NG+F I G+NEC
Sbjct: 262 SNGTP----YWICNNSWTTSWGNNGIFWIEAGKNEC 293
>gi|66810163|ref|XP_638805.1| peptidase C1A family protein [Dictyostelium discoideum AX4]
gi|74897075|sp|Q54QD9.1|CTSB_DICDI RecName: Full=Cathepsin B; AltName: Full=Cathepsin B1; Flags:
Precursor
gi|60467425|gb|EAL65448.1| peptidase C1A family protein [Dictyostelium discoideum AX4]
Length = 311
Score = 166 bits (419), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 97/268 (36%), Positives = 144/268 (53%), Gaps = 23/268 (8%)
Query: 69 MRMGVHPDSKLPQNRLPLLVQLSDPL-EELPEGFDARINWPYCPTIQEIRDQGSCGSGWA 127
+++G K NR L ++ DPL ++P F+A+ NWP C TI +I++Q CGS WA
Sbjct: 50 IKVGQLLGFKRSPNRPKLQIKSYDPLGVQIPTSFNAQTNWPNCTTISQIQNQARCGSCWA 109
Query: 128 LGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGG 187
GA E+ +DR+CI +V+LS D+V+ C + NGC+GG AW + G VS
Sbjct: 110 FGATESATDRLCI--HNNENVQLSFMDMVT-CDETDNGCEGGDAFSAWNWLRKQGAVS-- 164
Query: 188 TYASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYS 247
+ C PY IP + C N NTP C ++CQ + Y D + YS
Sbjct: 165 -----EECLPYTIP---TCPPAQQPCL-NFVNTPSCTKECQSNSSLIYSQDKHKMAKIYS 215
Query: 248 LPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLG 307
+ +E IM+EI +GPVE T++ D + YK+G+Y H G LG H ++++G+
Sbjct: 216 FDS-DEAIMQEIVTNGPVEACFTVFEDFLAYKSGVYVHTTGKDLGGHCVKLVGF------ 268
Query: 308 EGTSSVVKYWLVANSFNTNWGENGLFRI 335
GT + V Y+ N + T+WG+NG F I
Sbjct: 269 -GTLNGVDYYAANNQWTTSWGDNGTFLI 295
Score = 108 bits (270), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 63/171 (36%), Positives = 91/171 (53%), Gaps = 15/171 (8%)
Query: 326 NW-GENGLFRIGCRPYEIPCERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFG 384
NW + G C PY IP ++ C N NTP C ++CQ + Y D +
Sbjct: 155 NWLRKQGAVSEECLPYTIP---TCPPAQQPC-LNFVNTPSCTKECQSNSSLIYSQDKHKM 210
Query: 385 RIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWG 444
YS + +E IM+EI +GPVE T++ D + YK+G+Y H G LG H ++++G+
Sbjct: 211 AKIYSFDS-DEAIMQEIVTNGPVEACFTVFEDFLAYKSGVYVHTTGKDLGGHCVKLVGF- 268
Query: 445 QEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLP 495
GT + V Y+ N + T+WG+NG F I RG +CGI D+ AGLP
Sbjct: 269 ------GTLNGVDYYAANNQWTTSWGDNGTFLIKRG--DCGISDDVVAGLP 311
>gi|330805199|ref|XP_003290573.1| hypothetical protein DICPUDRAFT_155103 [Dictyostelium purpureum]
gi|325079281|gb|EGC32888.1| hypothetical protein DICPUDRAFT_155103 [Dictyostelium purpureum]
Length = 313
Score = 165 bits (418), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 100/267 (37%), Positives = 143/267 (53%), Gaps = 22/267 (8%)
Query: 69 MRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWAL 128
+++G K NR + V +DP + P FD+R W C TI I +Q CGS WA
Sbjct: 50 IKIGSLLGFKKSLNRPSIPVLNADPNIKAPASFDSRTAWSNCTTIGYIENQARCGSCWAF 109
Query: 129 GAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGT 188
GAVE+ DR+CI V+LS DLV+ C +GC+GG AW + G+V+
Sbjct: 110 GAVESAQDRICI--HKGLDVQLSFLDLVT-CDQSDDGCEGGDDVSAWNFLKKQGVVT--- 163
Query: 189 YASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSL 248
Q C+PY IP + C N NTP C+++C+ + Y D + YS+
Sbjct: 164 ----QECKPYTIP---TCPPAQQPCL-NFVNTPNCVKQCESNSTLIYSQDKHKMAKIYSI 215
Query: 249 PANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGE 308
+ E IM+EI +GPVE ++Y D + YK+G+Y+H G LG H ++I G+
Sbjct: 216 NS-VEAIMQEISTNGPVEACFSVYEDFLGYKSGVYQHTTGKFLGGHCVKIFGY------- 267
Query: 309 GTSSVVKYWLVANSFNTNWGENGLFRI 335
GT + V YW VANS+ T+WG+NG+F I
Sbjct: 268 GTLNGVNYWSVANSWTTSWGDNGIFLI 294
Score = 123 bits (308), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 65/167 (38%), Positives = 98/167 (58%), Gaps = 12/167 (7%)
Query: 329 ENGLFRIGCRPYEIPCERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAY 388
+ G+ C+PY IP ++ C N NTP C+++C+ + Y D + Y
Sbjct: 158 KQGVVTQECKPYTIP---TCPPAQQPC-LNFVNTPNCVKQCESNSTLIYSQDKHKMAKIY 213
Query: 389 SLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPL 448
S+ + E IM+EI +GPVE ++Y D + YK+G+Y+H G LG H ++I G+G
Sbjct: 214 SINS-VEAIMQEISTNGPVEACFSVYEDFLGYKSGVYQHTTGKFLGGHCVKIFGYG---- 268
Query: 449 GEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLP 495
T + V YW VANS+ T+WG+NG+F I RG +ECGIE ++ AG+P
Sbjct: 269 ---TLNGVNYWSVANSWTTSWGDNGIFLIKRGSDECGIEDEVVAGIP 312
>gi|13469701|gb|AAK27318.1| cysteine proteinase [Clonorchis sinensis]
Length = 179
Score = 165 bits (418), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 82/177 (46%), Positives = 110/177 (62%), Gaps = 2/177 (1%)
Query: 129 GAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGT 188
GAVEAMSDR+CI S G + LS+ DL+SCCKDCG GC GGF AW +W T GIV+GG+
Sbjct: 1 GAVEAMSDRLCIHSSGAFNKSLSAVDLLSCCKDCGYGCDGGFPPMAWDFWKTHGIVTGGS 60
Query: 189 YASKQGCRPYEIP-CERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYS 247
GCRPY P C+ + G + C TP+C++ C + Y+ D +Y+
Sbjct: 61 KEEPAGCRPYPFPKCQHHSQGHYPPCPRRIYPTPKCVKHCDTP-KIDYQKDKTRANTSYN 119
Query: 248 LPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQE 304
+ +E IM+EI +GPVE + ++ D YK+GIY H GG +G HAIRI+GWG+E
Sbjct: 120 VHQSEVAIMKEILLNGPVEATFEVHEDFPEYKSGIYFHAWGGSVGGHAIRILGWGEE 176
Score = 90.5 bits (223), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 44/112 (39%), Positives = 64/112 (57%), Gaps = 2/112 (1%)
Query: 336 GCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANE 394
GCRPY P C+ + G C TP+C++ C + Y+ D +Y++ +E
Sbjct: 66 GCRPYPFPKCQHHSQGHYPPCPRRIYPTPKCVKHCDTP-KIDYQKDKTRANTSYNVHQSE 124
Query: 395 ETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQE 446
IM+EI +GPVE + ++ D YK+GIY H GG +G HAIRI+GWG+E
Sbjct: 125 VAIMKEILLNGPVEATFEVHEDFPEYKSGIYFHAWGGSVGGHAIRILGWGEE 176
>gi|221219800|gb|ACM08561.1| Cathepsin B precursor [Salmo salar]
gi|221222296|gb|ACM09809.1| Cathepsin B precursor [Salmo salar]
Length = 205
Score = 165 bits (418), Expect = 5e-38, Method: Compositional matrix adjust.
Identities = 78/141 (55%), Positives = 98/141 (69%), Gaps = 2/141 (1%)
Query: 83 RLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIAS 142
+LP +VQ + +E LP+ FD R WP CPT++EIRDQGSCGS WA GA EA+SDRVCI S
Sbjct: 66 KLPTMVQYAGDVE-LPDTFDPRQQWPNCPTLKEIRDQGSCGSCWAFGAAEAISDRVCIHS 124
Query: 143 RGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEI-P 201
K V +SS+DL+SCC CG GC GG+ AW +W T G+V+GG Y S GCRPY I P
Sbjct: 125 NAKVSVEISSEDLLSCCDSCGMGCNGGYPSAAWDFWTTEGLVTGGLYDSHVGCRPYSIPP 184
Query: 202 CERYMNGSHSSCQDNEPNTPE 222
CE ++NG+ C E +TP+
Sbjct: 185 CEHHVNGTRPPCTGEEGDTPQ 205
Score = 39.3 bits (90), Expect = 4.8, Method: Compositional matrix adjust.
Identities = 18/37 (48%), Positives = 25/37 (67%), Gaps = 3/37 (8%)
Query: 331 GLF--RIGCRPYEIP-CERYMNGSRSSCQANEPNTPE 364
GL+ +GCRPY IP CE ++NG+R C E +TP+
Sbjct: 169 GLYDSHVGCRPYSIPPCEHHVNGTRPPCTGEEGDTPQ 205
>gi|221221056|gb|ACM09189.1| Cathepsin B precursor [Salmo salar]
gi|221222300|gb|ACM09811.1| Cathepsin B precursor [Salmo salar]
Length = 207
Score = 165 bits (417), Expect = 6e-38, Method: Compositional matrix adjust.
Identities = 78/141 (55%), Positives = 97/141 (68%), Gaps = 2/141 (1%)
Query: 83 RLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIAS 142
+LP +VQ + +E LP+ FD R WP CPT++EIRDQGSCGS WA GA EA+SDRVCI S
Sbjct: 66 KLPTMVQYAGDVE-LPDTFDPRQQWPNCPTLKEIRDQGSCGSCWAFGAAEAISDRVCIHS 124
Query: 143 RGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEI-P 201
K V +SS+DL+SCC CG GC GG+ AW +W T G+V+GG Y S GCRPY I P
Sbjct: 125 NAKVSVEISSEDLLSCCDSCGMGCNGGYPSAAWDFWTTEGLVTGGLYDSHVGCRPYSIPP 184
Query: 202 CERYMNGSHSSCQDNEPNTPE 222
CE ++NG+ C E +TP
Sbjct: 185 CEHHVNGTRPPCTGEEGDTPH 205
Score = 38.5 bits (88), Expect = 8.1, Method: Compositional matrix adjust.
Identities = 18/37 (48%), Positives = 24/37 (64%), Gaps = 3/37 (8%)
Query: 331 GLF--RIGCRPYEIP-CERYMNGSRSSCQANEPNTPE 364
GL+ +GCRPY IP CE ++NG+R C E +TP
Sbjct: 169 GLYDSHVGCRPYSIPPCEHHVNGTRPPCTGEEGDTPH 205
>gi|194384502|dbj|BAG59411.1| unnamed protein product [Homo sapiens]
Length = 273
Score = 164 bits (416), Expect = 8e-38, Method: Compositional matrix adjust.
Identities = 75/150 (50%), Positives = 113/150 (75%), Gaps = 8/150 (5%)
Query: 347 YMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGP 406
++NGSR C E +TP+C + C+PGY +Y+ D ++G +YS+ +E+ IM EI+++GP
Sbjct: 124 HVNGSRPPCTG-EGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGP 182
Query: 407 VEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFN 466
VEG+ ++Y+D +LYK+G+Y+HV G +G HAIRI+GWG E GT YWLVANS+N
Sbjct: 183 VEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGVE---NGTP----YWLVANSWN 235
Query: 467 TNWGENGLFRIVRGQNECGIEADITAGLPK 496
T+WG+NG F+I+RGQ+ CGIE+++ AG+P+
Sbjct: 236 TDWGDNGFFKILRGQDHCGIESEVVAGIPR 265
Score = 135 bits (340), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 64/131 (48%), Positives = 95/131 (72%), Gaps = 8/131 (6%)
Query: 205 YMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGP 264
++NGS C E +TP+C + C+PGY +Y+ D ++G +YS+ +E+ IM EI+++GP
Sbjct: 124 HVNGSRPPCT-GEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGP 182
Query: 265 VEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFN 324
VEG+ ++Y+D +LYK+G+Y+HV G +G HAIRI+GWG E GT YWLVANS+N
Sbjct: 183 VEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGVE---NGTP----YWLVANSWN 235
Query: 325 TNWGENGLFRI 335
T+WG+NG F+I
Sbjct: 236 TDWGDNGFFKI 246
Score = 85.9 bits (211), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 44/81 (54%), Positives = 50/81 (61%), Gaps = 1/81 (1%)
Query: 85 PLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRG 144
P V ++ L+ LP FDAR WP CPTI+EIRDQGSCGS WA GAVEA+SDR+CI G
Sbjct: 69 PQRVMFTEDLK-LPASFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHVNG 127
Query: 145 KRHVRLSSDDLVSCCKDCGNG 165
R D C K C G
Sbjct: 128 SRPPCTGEGDTPKCSKICEPG 148
>gi|12330246|gb|AAG52660.1| cysteine proteinase [Metagonimus yokogawai]
Length = 179
Score = 164 bits (416), Expect = 9e-38, Method: Compositional matrix adjust.
Identities = 78/177 (44%), Positives = 109/177 (61%), Gaps = 2/177 (1%)
Query: 129 GAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGT 188
GAVEAM+DR+CI S +S+ DL+SCC+ CG GC GGF +AW +W+ G+V+GG+
Sbjct: 1 GAVEAMTDRLCIHSNATIKKHISATDLLSCCESCGFGCHGGFPPRAWDFWMENGLVTGGS 60
Query: 189 YASKQGCRPYEIP-CERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYS 247
+ GCR Y P C + G + C +TP C+ C D+ Y D + +Y+
Sbjct: 61 KENPSGCRSYPFPRCSHHGKGKYPPCPKTIFDTPNCVDHCDKP-DIDYAADKTHAKSSYN 119
Query: 248 LPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQE 304
+ +NE IM+EI R+GPVE + +Y D I YK+GIY H G LG HAIR++GWG+E
Sbjct: 120 VQSNERVIMKEIMRNGPVEAAFMVYEDFIEYKSGIYFHSHGKLLGGHAIRMLGWGEE 176
Score = 98.6 bits (244), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 52/128 (40%), Positives = 69/128 (53%), Gaps = 9/128 (7%)
Query: 327 WGENGLFR-------IGCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYE 378
W ENGL GCR Y P C + G C +TP C+ C D+ Y
Sbjct: 50 WMENGLVTGGSKENPSGCRSYPFPRCSHHGKGKYPPCPKTIFDTPNCVDHCDKP-DIDYA 108
Query: 379 DDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAI 438
D + +Y++ +NE IM+EI R+GPVE + +Y D I YK+GIY H G LG HAI
Sbjct: 109 ADKTHAKSSYNVQSNERVIMKEIMRNGPVEAAFMVYEDFIEYKSGIYFHSHGKLLGGHAI 168
Query: 439 RIIGWGQE 446
R++GWG+E
Sbjct: 169 RMLGWGEE 176
>gi|308504721|ref|XP_003114544.1| hypothetical protein CRE_27547 [Caenorhabditis remanei]
gi|308261929|gb|EFP05882.1| hypothetical protein CRE_27547 [Caenorhabditis remanei]
Length = 358
Score = 164 bits (416), Expect = 9e-38, Method: Compositional matrix adjust.
Identities = 105/290 (36%), Positives = 148/290 (51%), Gaps = 18/290 (6%)
Query: 56 KNALSKLTLSELEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQE 115
K S++T E R+ K ++++ + + L ++P FD+R WP C I
Sbjct: 53 KAETSRMTFQEKMARVKDIKFIKSHEDQMVGDSENNQVLLDIPTYFDSRQKWPECTQIGA 112
Query: 116 IRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCC----KDCGN--GCQGG 169
+RDQ CGS L AVE SDR CI S G + LS+ D +SCC CG+ GC G
Sbjct: 113 VRDQSDCGSAAHLVAVELASDRTCIFSNGTFNWPLSAQDPLSCCVGLMSICGDGWGCDGS 172
Query: 170 FHGKAWKYWVTTGIVSGGTYASKQGCRPYEI-PCE-RYMNGSHSSCQDNEPNTPECIRKC 227
+ K+W T G+ +GG Y + GC+PY I PC+ +Y NG+ +S +TP C C
Sbjct: 173 WPKDILKWWQTHGLCTGGNYEDQFGCKPYSIYPCDKKYPNGT-TSVPCPGYHTPTCEEHC 231
Query: 228 QPG--YDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKH 285
+ ++Y+ D +FG+ Y++ I EI +GPV S IY D YK+GIY H
Sbjct: 232 TSNITWPIAYKQDKHFGKAHYNVGKKMTDIQTEIMTNGPVIASFVIYDDFWDYKSGIYVH 291
Query: 286 VAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
AG G +IIGW G S V YWL + + T++GENG R
Sbjct: 292 TAGDQEGGMDTKIIGW-------GVDSGVPYWLCVHQWGTDFGENGFVRF 334
Score = 113 bits (283), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 70/201 (34%), Positives = 97/201 (48%), Gaps = 15/201 (7%)
Query: 300 GWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIGCRPYEI-PCERYMNGSRSSCQAN 358
GWG + G ++K+W G N + GC+PY I PC++ +S
Sbjct: 166 GWGCD--GSWPKDILKWWQTHGLCT---GGNYEDQFGCKPYSIYPCDKKYPNGTTSVPCP 220
Query: 359 EPNTPECIRKCQPG--YDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYAD 416
+TP C C + ++Y+ D +FG+ Y++ I EI +GPV S IY D
Sbjct: 221 GYHTPTCEEHCTSNITWPIAYKQDKHFGKAHYNVGKKMTDIQTEIMTNGPVIASFVIYDD 280
Query: 417 MILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFR 476
YK+GIY H AG G +IIGW G S V YWL + + T++GENG R
Sbjct: 281 FWDYKSGIYVHTAGDQEGGMDTKIIGW-------GVDSGVPYWLCVHQWGTDFGENGFVR 333
Query: 477 IVRGQNECGIEADITAGLPKI 497
+RG NE IE + A LP I
Sbjct: 334 FLRGVNEVNIEHQVLAALPDI 354
>gi|156375635|ref|XP_001630185.1| predicted protein [Nematostella vectensis]
gi|156217201|gb|EDO38122.1| predicted protein [Nematostella vectensis]
Length = 311
Score = 164 bits (415), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 104/261 (39%), Positives = 134/261 (51%), Gaps = 41/261 (15%)
Query: 80 PQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVC 139
P+ LP +++ E +PE FDAR WP +I IR+QG CGS WA GA E +SDR
Sbjct: 67 PEGSLPPEIEVR-VAENIPENFDARKQWP--GSIHPIRNQGQCGSCWAFGASEVLSDRFA 123
Query: 140 IASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVS----GGTYASKQGC 195
IAS+ + +V LS+ LV C D +GC GG+ AW Y V TG+++ G YA + C
Sbjct: 124 IASKNQIYVTLSAQQLVDCDLD-NSGCSGGWPINAWNYMVKTGLLTEQCYGPYYAKQYTC 182
Query: 196 RPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPA-NEET 254
R NT +C QPG + + AY LPA N E
Sbjct: 183 RL-------------------TANTTDC--PWQPGVKARFYH----AKSAYKLPAKNVEA 217
Query: 255 IMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVV 314
I +I +GPVE TI+ D Y++GIY H G LG HAI+I+GW GT V
Sbjct: 218 IQTDIMNNGPVEADFTIFQDFYAYRSGIYVHATGKQLGGHAIKILGW-------GTEDNV 270
Query: 315 KYWLVANSFNTNWGENGLFRI 335
YWL ANS+ NWG G F+I
Sbjct: 271 DYWLCANSWGANWGIQGYFKI 291
Score = 111 bits (278), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 62/138 (44%), Positives = 77/138 (55%), Gaps = 14/138 (10%)
Query: 361 NTPECIRKCQPGYDVSYEDDLNFGRIAYSLPA-NEETIMREIFRHGPVEGSMTIYADMIL 419
NT +C QPG + + AY LPA N E I +I +GPVE TI+ D
Sbjct: 187 NTTDC--PWQPGVKARFYH----AKSAYKLPAKNVEAIQTDIMNNGPVEADFTIFQDFYA 240
Query: 420 YKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVR 479
Y++GIY H G LG HAI+I+GW GT V YWL ANS+ NWG G F+I R
Sbjct: 241 YRSGIYVHATGKQLGGHAIKILGW-------GTEDNVDYWLCANSWGANWGIQGYFKIRR 293
Query: 480 GQNECGIEADITAGLPKI 497
G +ECGIE + AGLP +
Sbjct: 294 GTDECGIEDGLAAGLPLL 311
>gi|268578113|ref|XP_002644039.1| Hypothetical protein CBG17499 [Caenorhabditis briggsae]
Length = 355
Score = 164 bits (414), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 99/252 (39%), Positives = 134/252 (53%), Gaps = 18/252 (7%)
Query: 94 LEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSD 153
L +P FD+R WP C I +RDQ CGS L AVE SDR CI+S G + LS+
Sbjct: 88 LINIPASFDSRQQWPECTQIGAVRDQSDCGSAAHLVAVEMASDRTCISSNGTFNWPLSAQ 147
Query: 154 DLVSCC----KDCGN--GCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEI-PCER-Y 205
D +SCC CG+ GC G + K+W T G+ +GG Y + GC+PY I PC++ Y
Sbjct: 148 DPLSCCVGLMSICGDGWGCDGSWPKDILKWWQTHGLCTGGNYDDQFGCKPYSIYPCDKNY 207
Query: 206 MNGSHSSCQDNEPNTPECIRKCQPG--YDVSYEDDLNFGRIAYSLPANEETIMREIFRHG 263
NG+ +S +TP C C + ++Y+ D +FG+ Y++ I EI +G
Sbjct: 208 PNGT-TSVPCPGYHTPPCEDHCTSNITWPIAYKQDKHFGKAHYNVGKKMTDIQTEIMTNG 266
Query: 264 PVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSF 323
PV S IY D YK+GIY H AG G +IIGW G + V YWL + +
Sbjct: 267 PVIASFIIYEDFWDYKSGIYVHTAGDQEGGMDTKIIGW-------GVDNGVPYWLCVHQW 319
Query: 324 NTNWGENGLFRI 335
T++GENG RI
Sbjct: 320 GTDFGENGFVRI 331
Score = 111 bits (278), Expect = 8e-22, Method: Compositional matrix adjust.
Identities = 69/201 (34%), Positives = 99/201 (49%), Gaps = 15/201 (7%)
Query: 300 GWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIGCRPYEI-PCERYMNGSRSSCQAN 358
GWG + G ++K+W + G N + GC+PY I PC++ +S
Sbjct: 163 GWGCD--GSWPKDILKWW---QTHGLCTGGNYDDQFGCKPYSIYPCDKNYPNGTTSVPCP 217
Query: 359 EPNTPECIRKCQPG--YDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYAD 416
+TP C C + ++Y+ D +FG+ Y++ I EI +GPV S IY D
Sbjct: 218 GYHTPPCEDHCTSNITWPIAYKQDKHFGKAHYNVGKKMTDIQTEIMTNGPVIASFIIYED 277
Query: 417 MILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFR 476
YK+GIY H AG G +IIGW G + V YWL + + T++GENG R
Sbjct: 278 FWDYKSGIYVHTAGDQEGGMDTKIIGW-------GVDNGVPYWLCVHQWGTDFGENGFVR 330
Query: 477 IVRGQNECGIEADITAGLPKI 497
I+RG NE IE + A LP +
Sbjct: 331 ILRGVNEVNIEHQVLAALPDV 351
>gi|294935195|ref|XP_002781337.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
gi|239891887|gb|EER13132.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
Length = 317
Score = 164 bits (414), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 88/251 (35%), Positives = 134/251 (53%), Gaps = 15/251 (5%)
Query: 94 LEELPEGFDARINWPYCP-TIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSS 152
L ++P FDAR + C I + DQ +C S WA+ VEA + R+CI S GK + LS+
Sbjct: 56 LADIPNSFDARDAFKECKDVIGHVWDQSACASCWAIAPVEAFNARLCIKSGGKFNQLLSA 115
Query: 153 DDLVSCCKDCGN----GCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIP-CERYMN 207
++++CC + GC+GG AW + T GI + G+ ++ GC PY P C +
Sbjct: 116 GEMIACCNSTHSWQPRGCKGGMILNAWSFLKTHGIATEGSMSAADGCWPYNFPKCAHHQK 175
Query: 208 GS-HSSCQDNEPNTPECIRKC-QPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPV 265
S + C +TP C+ +C Y + + D +F + L + I +EI +GP
Sbjct: 176 KSKYEPCSKKLYDTPSCLDRCPNEKYGIPLDKDRHFTAHSPDLFEGTDNIKKEIMTNGPT 235
Query: 266 EGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNT 325
+ ++Y D + YK+G+YKH G +G H++ IIGWG E V YWLV NS+N
Sbjct: 236 SATFSVYEDFVSYKSGVYKHTNGTLMGIHSVEIIGWGTE-------KGVDYWLVMNSWNE 288
Query: 326 NWGENGLFRIG 336
WG++G F+I
Sbjct: 289 GWGDHGTFKIA 299
Score = 107 bits (267), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 56/165 (33%), Positives = 86/165 (52%), Gaps = 12/165 (7%)
Query: 336 GCRPYEIP-CERYMNGSR-SSCQANEPNTPECIRKC-QPGYDVSYEDDLNFGRIAYSLPA 392
GC PY P C + S+ C +TP C+ +C Y + + D +F + L
Sbjct: 161 GCWPYNFPKCAHHQKKSKYEPCSKKLYDTPSCLDRCPNEKYGIPLDKDRHFTAHSPDLFE 220
Query: 393 NEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGT 452
+ I +EI +GP + ++Y D + YK+G+YKH G +G H++ IIGWG E
Sbjct: 221 GTDNIKKEIMTNGPTSATFSVYEDFVSYKSGVYKHTNGTLMGIHSVEIIGWGTE------ 274
Query: 453 SSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
V YWLV NS+N WG++G F+I +G +CGI+ + P +
Sbjct: 275 -KGVDYWLVMNSWNEGWGDHGTFKIAQG--DCGIDDAVLGSPPAM 316
>gi|146386348|gb|ABQ23962.1| cathepsin B [Oryctolagus cuniculus]
Length = 228
Score = 163 bits (412), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 93/185 (50%), Positives = 131/185 (70%), Gaps = 4/185 (2%)
Query: 83 RLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIAS 142
+LP V+ +D ++ LPE FDAR WP CPTI+EIRDQGSCGS WA GAVEA+SDR+CI +
Sbjct: 46 KLPRRVEFADDIK-LPESFDAREQWPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHT 104
Query: 143 RGKRHVRLSSDDLVS-CCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEI- 200
G +V +S++D+++ C CG+GC GG+ AW +W G+VSGG Y S GC+PY I
Sbjct: 105 NGHVNVEVSAEDMLTCCGGQCGDGCNGGYPSGAWNFWTKKGLVSGGLYDSHVGCKPYSIP 164
Query: 201 PCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIF 260
PCE ++NGS +C E +TP C + C+PGY SY++D ++G +YS+ ++E I EI+
Sbjct: 165 PCEHHVNGSRPACT-GEGDTPRCSKTCEPGYSPSYKEDKHYGYSSYSVSSDENEIKAEIY 223
Query: 261 RHGPV 265
++GPV
Sbjct: 224 KNGPV 228
Score = 65.9 bits (159), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 37/89 (41%), Positives = 57/89 (64%), Gaps = 9/89 (10%)
Query: 327 WGENGLF-------RIGCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYE 378
W + GL +GC+PY IP CE ++NGSR +C E +TP C + C+PGY SY+
Sbjct: 141 WTKKGLVSGGLYDSHVGCKPYSIPPCEHHVNGSRPACTG-EGDTPRCSKTCEPGYSPSYK 199
Query: 379 DDLNFGRIAYSLPANEETIMREIFRHGPV 407
+D ++G +YS+ ++E I EI+++GPV
Sbjct: 200 EDKHYGYSSYSVSSDENEIKAEIYKNGPV 228
>gi|10803454|emb|CAB97366.2| putative cathepsin B.3 [Ostertagia ostertagi]
Length = 196
Score = 162 bits (411), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 87/203 (42%), Positives = 116/203 (57%), Gaps = 10/203 (4%)
Query: 124 SGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKD-CGNGCQGGFHGKAWKYWVTTG 182
S WA+ A E MSDR+C+ + G++ LS D+++CC D CG GC GG+ +AW Y +G
Sbjct: 1 SCWAVSAAETMSDRLCVQTNGRKKTLLSDTDILACCGDFCGYGCNGGYSARAWLYARNSG 60
Query: 183 IVSGGTYASKQGCRPYEI-PCERYMNGSH-SSCQDNEPNTPECIRKCQPGYDVSYEDDLN 240
+ SGG Y K C+PY PC + N ++ C + TP C + CQ GY YE D
Sbjct: 61 VCSGGRYQEKGVCKPYTFHPCGYHKNQTYYGECPKHTYQTPACKKYCQYGYGKRYEKDKI 120
Query: 241 FGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIG 300
+ AY + ++E I EIF GPV+ S Y D YK+GIY H AG G HA++IIG
Sbjct: 121 YAXDAYRVSSDEAAIRAEIFARGPVQASFATYEDFAHYKSGIYVHTAGKRRGGHAVKIIG 180
Query: 301 WGQEPLGEGTSSVVKYWLVANSF 323
WG E GT K W+VANS+
Sbjct: 181 WGVE---NGT----KXWIVANSW 196
Score = 97.4 bits (241), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 55/131 (41%), Positives = 70/131 (53%), Gaps = 9/131 (6%)
Query: 337 CRPYEI-PCERYMNGSR-SSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANE 394
C+PY PC + N + C + TP C + CQ GY YE D + AY + ++E
Sbjct: 73 CKPYTFHPCGYHKNQTYYGECPKHTYQTPACKKYCQYGYGKRYEKDKIYAXDAYRVSSDE 132
Query: 395 ETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSS 454
I EIF GPV+ S Y D YK+GIY H AG G HA++IIGWG E GT
Sbjct: 133 AAIRAEIFARGPVQASFATYEDFAHYKSGIYVHTAGKRRGGHAVKIIGWGVE---NGT-- 187
Query: 455 VVKYWLVANSF 465
K W+VANS+
Sbjct: 188 --KXWIVANSW 196
>gi|56758658|gb|AAW27469.1| unknown [Schistosoma japonicum]
Length = 181
Score = 162 bits (410), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 78/162 (48%), Positives = 103/162 (63%), Gaps = 8/162 (4%)
Query: 336 GCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANE 394
GC+PY P CE G +C TP+C +KCQ GY YE D N+G Y++ +N
Sbjct: 26 GCQPYPFPKCEHLTKGKYPACGTKIYKTPQCKQKCQKGYKTPYEQDKNYGDQRYNVISNA 85
Query: 395 ETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSS 454
+ I +EI +GPVE + +Y D + YK+GIY+HV G +G HAIRIIGWG E
Sbjct: 86 KAIQKEIMMNGPVEAAFDVYEDFLNYKSGIYRHVTGSIVGGHAIRIIGWGVE-------K 138
Query: 455 VVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPK 496
YWL+ANS+N +WGE GLFRIVRG++EC IE+++ AGL K
Sbjct: 139 RTPYWLIANSWNEDWGEKGLFRIVRGRDECSIESNVVAGLIK 180
Score = 151 bits (382), Expect = 7e-34, Method: Compositional matrix adjust.
Identities = 74/160 (46%), Positives = 98/160 (61%), Gaps = 8/160 (5%)
Query: 177 YWVTTGIVSGGTYASKQGCRPYEIP-CERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSY 235
Y V GIV+GG+ + GC+PY P CE G + +C TP+C +KCQ GY Y
Sbjct: 9 YLVKRGIVTGGSKENHTGCQPYPFPKCEHLTKGKYPACGTKIYKTPQCKQKCQKGYKTPY 68
Query: 236 EDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHA 295
E D N+G Y++ +N + I +EI +GPVE + +Y D + YK+GIY+HV G +G HA
Sbjct: 69 EQDKNYGDQRYNVISNAKAIQKEIMMNGPVEAAFDVYEDFLNYKSGIYRHVTGSIVGGHA 128
Query: 296 IRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
IRIIGWG E YWL+ANS+N +WGE GLFRI
Sbjct: 129 IRIIGWGVE-------KRTPYWLIANSWNEDWGEKGLFRI 161
>gi|86451908|gb|ABC97349.1| cathepsin B [Streblomastix strix]
Length = 312
Score = 162 bits (410), Expect = 5e-37, Method: Compositional matrix adjust.
Identities = 95/245 (38%), Positives = 138/245 (56%), Gaps = 27/245 (11%)
Query: 93 PLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSS 152
P+ LP+ FD+R NWP C I +I DQG CGS WA+ + E + DR CI S GK+ LS
Sbjct: 72 PVANLPDEFDSRTNWPNCQLIGKIYDQGHCGSCWAMSSFEVLQDRFCIKSEGKQTPELSP 131
Query: 153 DDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEI-PCERYMNGSHS 211
L SC C +GC GG+ A+ + + GI+ + C PY++ C+ H
Sbjct: 132 QHLTSCTPGC-SGCNGGWMSTAFGFMQSNGILG-------EDCIPYQMGKCK------HP 177
Query: 212 SCQDNEPNTPECIR-KCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMT 270
C + TP+C + KC P S E L +YS+ +NE I +EI+ +GPV S
Sbjct: 178 GC--STWPTPKCNKTKCYPNDTKSTE--LWHAASSYSVRSNEADIQKEIYENGPVTASFA 233
Query: 271 IYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGEN 330
+Y D+ +Y++G+Y+HV GG G HAI+++GWG + +G VKYW + NS+ +WG +
Sbjct: 234 VYEDLSVYQSGVYQHVTGGFEGLHAIKVVGWG---ILDG----VKYWTIVNSWAEDWGFD 286
Query: 331 GLFRI 335
GL I
Sbjct: 287 GLLLI 291
Score = 116 bits (291), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 67/169 (39%), Positives = 99/169 (58%), Gaps = 17/169 (10%)
Query: 330 NGLFRIGCRPYEIPCERYMNGSRSSCQANEPNTPECIR-KCQPGYDVSYEDDLNFGRIAY 388
NG+ C PY+ M + + P TP+C + KC P S E L +Y
Sbjct: 159 NGILGEDCIPYQ------MGKCKHPGCSTWP-TPKCNKTKCYPNDTKSTE--LWHAASSY 209
Query: 389 SLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPL 448
S+ +NE I +EI+ +GPV S +Y D+ +Y++G+Y+HV GG G HAI+++GWG +
Sbjct: 210 SVRSNEADIQKEIYENGPVTASFAVYEDLSVYQSGVYQHVTGGFEGLHAIKVVGWG---I 266
Query: 449 GEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
+G VKYW + NS+ +WG +GL I RG +ECGIE+D+ AG PK+
Sbjct: 267 LDG----VKYWTIVNSWAEDWGFDGLLLIRRGVDECGIESDVVAGQPKL 311
>gi|312266|emb|CAA51531.1| cathepsin B-like enzyme [Gallus gallus]
Length = 156
Score = 162 bits (409), Expect = 5e-37, Method: Compositional matrix adjust.
Identities = 78/154 (50%), Positives = 101/154 (65%), Gaps = 2/154 (1%)
Query: 106 NWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCK-DCGN 164
WP CPTI EIRDQGSCGS WA G+VE +SDR+C+ + K V +S++DL+SCC +CG
Sbjct: 2 QWPNCPTISEIRDQGSCGSCWAFGSVEVISDRICVHTNAKVSVEVSAEDLLSCCGFECGM 61
Query: 165 GCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEI-PCERYMNGSHSSCQDNEPNTPEC 223
GC GG+ AW+YW G+VSGG Y S GC Y I PCE ++NGS C TP C
Sbjct: 62 GCNGGYPSGAWRYWTERGLVSGGLYDSHVGCAGYTIPPCEHHVNGSRPPCTGEGGETPRC 121
Query: 224 IRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMR 257
R C+PGY SY++D ++G Y +P +E+ I R
Sbjct: 122 SRHCEPGYSPSYKEDKHYGSHIYGVPRSEKEIYR 155
Score = 63.9 bits (154), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 33/81 (40%), Positives = 44/81 (54%), Gaps = 8/81 (9%)
Query: 327 WGENGLF-------RIGCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYE 378
W E GL +GC Y IP CE ++NGSR C TP C R C+PGY SY+
Sbjct: 75 WTERGLVSGGLYDSHVGCAGYTIPPCEHHVNGSRPPCTGEGGETPRCSRHCEPGYSPSYK 134
Query: 379 DDLNFGRIAYSLPANEETIMR 399
+D ++G Y +P +E+ I R
Sbjct: 135 EDKHYGSHIYGVPRSEKEIYR 155
>gi|294877489|ref|XP_002768007.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
gi|239870145|gb|EER00725.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
Length = 344
Score = 162 bits (409), Expect = 5e-37, Method: Compositional matrix adjust.
Identities = 101/289 (34%), Positives = 144/289 (49%), Gaps = 46/289 (15%)
Query: 84 LPLLVQLSDPLEELPEGFDARINWPYCP-TIQEIRDQGSCGSGWALGAVEAMSDRVCIAS 142
L V ++ L ++P FDAR + C I + DQ +CGS WA+ VEA + R+CI S
Sbjct: 46 LEKKVYPTEELADIPSSFDARDAFKECKDVIGHVWDQSACGSCWAIAPVEAFNARLCIKS 105
Query: 143 RGKRHVRLSSDDLVSCCKDC----GNGCQGGFHGKAWKYWVTTGIVSGGTYASK------ 192
GK + LS+ ++++CC +GCQGG AW + GIV+GG + K
Sbjct: 106 GGKFNQLLSAGEMLACCNSVHSCNSHGCQGGIARAAWSFLKMHGIVTGGDFVPKGSMSAA 165
Query: 193 QGCRPYEIP-------------C---------ERYMNGSHSSCQDNEPNTPECIRKC-QP 229
GC PY P C ER+ G+ +S +TP C+ +C
Sbjct: 166 DGCWPYSFPKCAHDQEDSKYEPCPEVRVPPLGERHQRGAGASIHQKLYDTPSCLDRCPNE 225
Query: 230 GYDVSYEDDLNFGRIAYSLP---ANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHV 286
Y + D +F A +LP + I +EI +GP S + Y D YK+G+YKH
Sbjct: 226 KYGTPRDKDRHF--TARALPYLFEGTDNIKKEIMTNGPTSASFSTYEDFSSYKSGVYKHT 283
Query: 287 AGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
+GG LG+H++ IIGW GT V YWLV NS+N WG++G F+I
Sbjct: 284 SGGYLGDHSVEIIGW-------GTEKGVDYWLVMNSWNEGWGDHGTFKI 325
Score = 107 bits (267), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 58/157 (36%), Positives = 86/157 (54%), Gaps = 15/157 (9%)
Query: 345 ERYMNGSRSSCQANEPNTPECIRKC-QPGYDVSYEDDLNFGRIAYSLP---ANEETIMRE 400
ER+ G+ +S +TP C+ +C Y + D +F A +LP + I +E
Sbjct: 198 ERHQRGAGASIHQKLYDTPSCLDRCPNEKYGTPRDKDRHF--TARALPYLFEGTDNIKKE 255
Query: 401 IFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWL 460
I +GP S + Y D YK+G+YKH +GG LG+H++ IIGW GT V YWL
Sbjct: 256 IMTNGPTSASFSTYEDFSSYKSGVYKHTSGGYLGDHSVEIIGW-------GTEKGVDYWL 308
Query: 461 VANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
V NS+N WG++G F+I +G +CGI+ + LP +
Sbjct: 309 VMNSWNEGWGDHGTFKIAQG--DCGIDDAVQGSLPAM 343
>gi|257215762|emb|CAX83033.1| Cysteine PRotease related protein [Schistosoma japonicum]
Length = 233
Score = 162 bits (409), Expect = 6e-37, Method: Compositional matrix adjust.
Identities = 78/168 (46%), Positives = 106/168 (63%), Gaps = 3/168 (1%)
Query: 63 TLSELEMRMGVHP-DSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGS 121
+L + + MG D+++ + R P V D E+P FD+R WP+C +I +IRDQ
Sbjct: 56 SLDDARILMGARKEDAEMKRKRRPT-VDHHDLNVEIPSQFDSRKKWPHCKSISQIRDQSR 114
Query: 122 CGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTT 181
CGS WA GAVEAM+DR+CI S G + LS+ DL+SCCKDCG+GC+GGF G+AW YWV
Sbjct: 115 CGSCWAFGAVEAMTDRICIQSGGGQSAELSALDLISCCKDCGDGCKGGFPGQAWDYWVKR 174
Query: 182 GIVSGGTYASKQGCRPYEIP-CERYMNGSHSSCQDNEPNTPECIRKCQ 228
GIV+GG+ + GC+PY P CE G + +C TP+C + C
Sbjct: 175 GIVTGGSEENHTGCQPYPFPKCEHLTKGKYPACGTKIYKTPQCKQTCH 222
>gi|294894290|ref|XP_002774786.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
gi|239880403|gb|EER06602.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
Length = 830
Score = 161 bits (408), Expect = 7e-37, Method: Compositional matrix adjust.
Identities = 102/291 (35%), Positives = 143/291 (49%), Gaps = 55/291 (18%)
Query: 92 DPLEELPEGFDARINWPYC-PTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRL 150
+ L++LP FDAR +P C I IRDQ +CGS WA G EA +DR+CI S G L
Sbjct: 532 EELQDLPTDFDARTAFPNCSKVIGHIRDQSACGSCWAFGVTEAFNDRLCIKSNGTFTELL 591
Query: 151 SSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQ------GCRPYEIP-CE 203
S+ ++ +C +GC GGF AW + GI +GG Y +K GC PY+ P C
Sbjct: 592 SAGEMNACAP--SHGCNGGFPNSAWSWVHDKGIATGGDYVAKDDMTKDDGCWPYDFPPCA 649
Query: 204 RYMNGSH------SSCQDNEP----------------NTPECIRKCQ-PGYDVSYEDDLN 240
++N + SC P TP C +C P Y + DD +
Sbjct: 650 HHINDTKYPECPKVSCSGESPPATAETATVIAYQNSYETPNCAEQCHNPKYTTTLRDDRH 709
Query: 241 F----GRIAYSLPANEETIMRE-----IFRHGP------VEGSMTIYADMILYKTGIYKH 285
F YS+ + I + I+ P V S ++Y D + YK+G+YKH
Sbjct: 710 FMLESSPYQYSVNDAKNAIRTDGPVGPIYFCDPNVNFDQVSASFSVYEDFLAYKSGVYKH 769
Query: 286 VAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIG 336
+G LG HA++IIGWG+E S YW+V NS+N +WG++GLF+I
Sbjct: 770 TSGEYLGGHAVKIIGWGEE-------SGQAYWIVVNSWNEDWGDHGLFKIA 813
Score = 100 bits (250), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 65/201 (32%), Positives = 96/201 (47%), Gaps = 48/201 (23%)
Query: 336 GCRPYEIP-CERYMNGSR------SSCQANEP----------------NTPECIRKCQ-P 371
GC PY+ P C ++N ++ SC P TP C +C P
Sbjct: 639 GCWPYDFPPCAHHINDTKYPECPKVSCSGESPPATAETATVIAYQNSYETPNCAEQCHNP 698
Query: 372 GYDVSYEDDLNF----GRIAYSLPANEETIMRE-----IFRHGP------VEGSMTIYAD 416
Y + DD +F YS+ + I + I+ P V S ++Y D
Sbjct: 699 KYTTTLRDDRHFMLESSPYQYSVNDAKNAIRTDGPVGPIYFCDPNVNFDQVSASFSVYED 758
Query: 417 MILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFR 476
+ YK+G+YKH +G LG HA++IIGWG+E S YW+V NS+N +WG++GLF+
Sbjct: 759 FLAYKSGVYKHTSGEYLGGHAVKIIGWGEE-------SGQAYWIVVNSWNEDWGDHGLFK 811
Query: 477 IVRGQNECGIEADITAGLPKI 497
I G CGI+ ++ G PK+
Sbjct: 812 IALGN--CGIDDNLLGGTPKV 830
>gi|428180143|gb|EKX49011.1| cathepsin B-like cysteine protease [Guillardia theta CCMP2712]
Length = 330
Score = 161 bits (407), Expect = 8e-37, Method: Compositional matrix adjust.
Identities = 105/288 (36%), Positives = 139/288 (48%), Gaps = 40/288 (13%)
Query: 53 GAEKNALSKLTLSELEMRM-GVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCP 111
E +T+ E M G+ D + +P+ V S L++LPE F+ NWP
Sbjct: 51 AGETEIFKGMTMKEFRSSMLGLRLDRDYSE--VPVKVHSSTALKDLPESFNCYENWP--N 106
Query: 112 TIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGN-GCQGGF 170
+ IRDQ CGS WA A E +SDR IAS G + LS +DLVSC D G+ GCQGG+
Sbjct: 107 YMHPIRDQARCGSCWAFAASEVLSDRFAIASNGTVNKILSPEDLVSC--DKGDMGCQGGY 164
Query: 171 HGKAWKYWVTTGIVSGGT--YASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQ 228
KAW Y T GIV+ YA+++G P SC D EP
Sbjct: 165 LDKAWDYLKTNGIVTESCFPYAAQKGVAP----------SCRISCVDGEPYK-------- 206
Query: 229 PGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKH-VA 287
+ Y EE IM+EI+ +GPVE +Y + YK+G+Y H +
Sbjct: 207 -----------KYKASDYYQLTTEEDIMKEIYLNGPVEAGFRVYTSFMSYKSGVYHHRIL 255
Query: 288 GGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
G HAI+I+GWG EP KYW+ ANS+ +WG NG F+I
Sbjct: 256 DIMEGGHAIKIVGWGVEPPKRFWQKPTKYWICANSWTADWGMNGFFKI 303
Score = 102 bits (254), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 62/174 (35%), Positives = 82/174 (47%), Gaps = 26/174 (14%)
Query: 330 NGLFRIGCRPYEIPCERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYS 389
NG+ C PY A + P C C G E + Y
Sbjct: 175 NGIVTESCFPY---------------AAQKGVAPSCRISCVDG-----EPYKKYKASDYY 214
Query: 390 LPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKH-VAGGPLGEHAIRIIGWGQEPL 448
EE IM+EI+ +GPVE +Y + YK+G+Y H + G HAI+I+GWG EP
Sbjct: 215 QLTTEEDIMKEIYLNGPVEAGFRVYTSFMSYKSGVYHHRILDIMEGGHAIKIVGWGVEPP 274
Query: 449 GEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQN-----ECGIEADITAGLPKI 497
KYW+ ANS+ +WG NG F+I RG+N ECGIE + AG PK+
Sbjct: 275 KRFWQKPTKYWICANSWTADWGMNGFFKIRRGKNRFGQSECGIEDQVFAGHPKL 328
>gi|294939825|ref|XP_002782575.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
gi|239894358|gb|EER14370.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
Length = 398
Score = 159 bits (402), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 97/294 (32%), Positives = 148/294 (50%), Gaps = 22/294 (7%)
Query: 54 AEKNALSKLTLSELEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYC-PT 112
AE+ +L + +M G ++ +++ V + L++LP FDAR +P C
Sbjct: 99 AEQEKFKTSSLRDAKMLCGTL--TRDSNDKVVEKVYAIEELKDLPTDFDARTAFPKCSKV 156
Query: 113 IQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGN-GCQGGFH 171
I +RDQ +CG WA G EA +DR+CI S G LS+ ++ +C + GC+GGF
Sbjct: 157 IGHVRDQSACGDCWAFGVTEAFNDRLCIKSNGTFTKLLSAGEMNACAPSLKDPGCRGGFP 216
Query: 172 GKAWKYWVTTGIVSGGTYASKQ------GCRPYEIP-CERYM-NGSHSSCQDNEPNTPEC 223
AW + GI +GG Y + GC PY+ P C + + + +C C
Sbjct: 217 YSAWSWVHDEGIATGGDYVPRDNMTEDDGCWPYDFPPCAHFFKDPKYPACPKFARVNLRC 276
Query: 224 IRKCQPGYDVSYEDD-LNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGI 282
+ K + V + D + Y A++ I GPV + +Y D + YK+G+
Sbjct: 277 VSKLRHMMVVYFSDRYFMVESVPYHFSADDAK--NAIRTDGPVSATFYVYEDFLAYKSGV 334
Query: 283 YKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIG 336
YKH +G LG HA++IIGWG++ GE YWLV NS+N WG++GLF+I
Sbjct: 335 YKHTSGSLLGAHAVKIIGWGEDG-GEA------YWLVVNSWNEGWGDHGLFKIA 381
Score = 100 bits (250), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 57/165 (34%), Positives = 86/165 (52%), Gaps = 14/165 (8%)
Query: 336 GCRPYEIP-CERYMNGSR-SSCQANEPNTPECIRKCQPGYDVSYEDD-LNFGRIAYSLPA 392
GC PY+ P C + + +C C+ K + V + D + Y A
Sbjct: 245 GCWPYDFPPCAHFFKDPKYPACPKFARVNLRCVSKLRHMMVVYFSDRYFMVESVPYHFSA 304
Query: 393 NEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGT 452
++ I GPV + +Y D + YK+G+YKH +G LG HA++IIGWG++ GE
Sbjct: 305 DDAK--NAIRTDGPVSATFYVYEDFLAYKSGVYKHTSGSLLGAHAVKIIGWGEDG-GEA- 360
Query: 453 SSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
YWLV NS+N WG++GLF+I G +CGI+ ++ G PK+
Sbjct: 361 -----YWLVVNSWNEGWGDHGLFKIALG--DCGIDNELLGGTPKV 398
>gi|12330244|gb|AAG52659.1| cysteine proteinase [Metagonimus yokogawai]
Length = 183
Score = 159 bits (402), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 83/182 (45%), Positives = 106/182 (58%), Gaps = 4/182 (2%)
Query: 130 AVEAMSDRVCIAS-RGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGT 188
AV +MSDRVCI S + K +V+LS+ DL+SCC CG GC GG+ G AW YW GIV+GG
Sbjct: 1 AVTSMSDRVCIHSNQNKTNVQLSARDLLSCCTSCGFGCVGGWIGDAWDYWRDNGIVTGGD 60
Query: 189 YASKQGCRPYEIPCERYMNGSHSSCQDNEPN---TPECIRKCQPGYDVSYEDDLNFGRIA 245
Y K C PY P ++ + + TP C+ KCQ GY YE D F +
Sbjct: 61 YQDKSTCLPYPFPPSHHLVSKGTPFEIYPQTLYPTPPCVSKCQEGYPGEYEKDKIFALSS 120
Query: 246 YSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEP 305
Y + N I +EI +GPVE M +YAD YKTG+Y+H G LG HAIR++GWG+
Sbjct: 121 YKIDRNATEIQKEILINGPVEAGMNVYADFPNYKTGVYQHTTGEILGGHAIRLLGWGKTK 180
Query: 306 LG 307
G
Sbjct: 181 DG 182
Score = 92.0 bits (227), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 50/133 (37%), Positives = 67/133 (50%), Gaps = 10/133 (7%)
Query: 327 WGENGLFRIG-------CRPYEIPCERYMNGSRSSCQANEPN---TPECIRKCQPGYDVS 376
W +NG+ G C PY P ++ + + TP C+ KCQ GY
Sbjct: 50 WRDNGIVTGGDYQDKSTCLPYPFPPSHHLVSKGTPFEIYPQTLYPTPPCVSKCQEGYPGE 109
Query: 377 YEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEH 436
YE D F +Y + N I +EI +GPVE M +YAD YKTG+Y+H G LG H
Sbjct: 110 YEKDKIFALSSYKIDRNATEIQKEILINGPVEAGMNVYADFPNYKTGVYQHTTGEILGGH 169
Query: 437 AIRIIGWGQEPLG 449
AIR++GWG+ G
Sbjct: 170 AIRLLGWGKTKDG 182
>gi|383861394|ref|XP_003706171.1| PREDICTED: tubulointerstitial nephritis antigen-like [Megachile
rotundata]
Length = 442
Score = 159 bits (402), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 105/316 (33%), Positives = 153/316 (48%), Gaps = 34/316 (10%)
Query: 29 VFCDLSKAFDRVDHSILLPKLPFYGAEKN--ALSKLTLSE-LEMRMGVHPDSKLPQNRLP 85
V C+ ++ D + +P +N + TL + + +R+G S+ R+
Sbjct: 131 VLCEQNRCLQEPDLIDEVNAMPLNWRARNYSEFNGRTLKDGMRLRLGTLNPSR-SVYRMN 189
Query: 86 LLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGK 145
+ ++ DP E LP FD+R WP I +I DQG CG+ WA+ + + SDR I S+G
Sbjct: 190 AVRRIYDP-ESLPREFDSRTRWPR--DISKITDQGWCGASWAISSAQVASDRFAIMSKGT 246
Query: 146 RHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTY---ASKQGCRPYEIPC 202
V LS+ L+SC GC GG +AW + G+V Y AS + CR
Sbjct: 247 DAVELSAQHLLSCNNRGQQGCSGGHLDRAWMFMRRFGLVDENCYPWKASTETCR-----L 301
Query: 203 ERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRH 262
+ + + C + K P Y + ANE IM+EI
Sbjct: 302 RKRTDLRSAGCAPPPNPLRTELYKVGPAYRL----------------ANETDIMQEILTS 345
Query: 263 GPVEGSMTIYADMILYKTGIYKHVAGGPLGE---HAIRIIGWGQEPLGEGTSSVVKYWLV 319
GPV+ +M +Y D Y++G+YKH L E H++RIIGWG+EP ++ +KYWLV
Sbjct: 346 GPVQATMRVYQDFFSYESGVYKHSVTAELYESDYHSVRIIGWGEEPPTYSRNTPLKYWLV 405
Query: 320 ANSFNTNWGENGLFRI 335
ANS+ WGENGLFRI
Sbjct: 406 ANSWGQQWGENGLFRI 421
Score = 114 bits (284), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 55/107 (51%), Positives = 72/107 (67%), Gaps = 4/107 (3%)
Query: 387 AYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGE---HAIRIIGW 443
AY L ANE IM+EI GPV+ +M +Y D Y++G+YKH L E H++RIIGW
Sbjct: 329 AYRL-ANETDIMQEILTSGPVQATMRVYQDFFSYESGVYKHSVTAELYESDYHSVRIIGW 387
Query: 444 GQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADI 490
G+EP ++ +KYWLVANS+ WGENGLFRI +G NEC IE+ +
Sbjct: 388 GEEPPTYSRNTPLKYWLVANSWGQQWGENGLFRIQKGTNECEIESFV 434
>gi|224064398|ref|XP_002301456.1| predicted protein [Populus trichocarpa]
gi|222843182|gb|EEE80729.1| predicted protein [Populus trichocarpa]
Length = 325
Score = 159 bits (402), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 107/307 (34%), Positives = 159/307 (51%), Gaps = 44/307 (14%)
Query: 50 PFYGAEKNALSKLTLSELEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPE-GFDARINWP 108
P Y L + + V P SKL N L + + E P G++A +N
Sbjct: 4 PLYLGTLFLLVAALFTFRSQVIAVEPVSKLKLNSRILQDSIVQKVNENPNAGWEATMNPQ 63
Query: 109 YC--------------PTI-QEIRD--QGSCGSGWALGAVEAMSDRVCIASRGKRHVRLS 151
+ PT +E+R G CGS WA GAVE++SDR CI ++ LS
Sbjct: 64 FSNYSVGEFKYLLGVKPTPGKELRGVPLGHCGSCWAFGAVESLSDRFCI--HYGMNLSLS 121
Query: 152 SDDLVSCCK-DCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPY--EIPCERYMNG 208
+DL++CC CG+GC GG+ AW+Y+V +G+V+ + C PY +I C
Sbjct: 122 VNDLLACCGWMCGDGCDGGYPIDAWRYFVQSGVVT-------EECDPYFDDIGC------ 168
Query: 209 SHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGS 268
SH C+ P TP+C RKC + + + +F AY + ++ +IM E+ +GPVE +
Sbjct: 169 SHPGCEPGFP-TPKCERKCADKNKL-WAESKHFSVNAYRIDSDPHSIMAEVSMNGPVEVA 226
Query: 269 MTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWG 328
T+Y D YK+G+YKH+ G +G HA+++IGWG GE YWL+AN +N WG
Sbjct: 227 FTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGE------DYWLLANQWNRGWG 280
Query: 329 ENGLFRI 335
++G F+I
Sbjct: 281 DDGYFKI 287
Score = 127 bits (320), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 68/169 (40%), Positives = 99/169 (58%), Gaps = 16/169 (9%)
Query: 329 ENGLFRIGCRPY--EIPCERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRI 386
++G+ C PY +I C S C+ P TP+C RKC + + + +F
Sbjct: 151 QSGVVTEECDPYFDDIGC------SHPGCEPGFP-TPKCERKCADKNKL-WAESKHFSVN 202
Query: 387 AYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQE 446
AY + ++ +IM E+ +GPVE + T+Y D YK+G+YKH+ G +G HA+++IGWG
Sbjct: 203 AYRIDSDPHSIMAEVSMNGPVEVAFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTS 262
Query: 447 PLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLP 495
GE YWL+AN +N WG++G F+I RG NECGIE D+ AGLP
Sbjct: 263 DDGE------DYWLLANQWNRGWGDDGYFKIRRGTNECGIEEDVVAGLP 305
>gi|166030326|gb|ABY78830.1| cathepsin B-like protease [Trypanosoma congolense]
Length = 336
Score = 158 bits (400), Expect = 6e-36, Method: Compositional matrix adjust.
Identities = 111/281 (39%), Positives = 151/281 (53%), Gaps = 26/281 (9%)
Query: 59 LSKLTLSELEMRMG--VHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEI 116
+ +T SE + G + +S LP P+ ELPE FD+ WP CPTI+EI
Sbjct: 54 MQNITFSEAKRLTGAWIQKNSSLP----PVRFTEEQLRTELPESFDSAEKWPNCPTIREI 109
Query: 117 RDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWK 176
DQ +C + WA+ A+SDR C GK+ +R+S+ L+SCCK CG GC+GGF G AW
Sbjct: 110 ADQSACRASWAVSTASAISDRYCTVGGGKQ-LRISAAHLLSCCKQCGGGCKGGFPGFAWL 168
Query: 177 YWVTTGIVSGGTYASKQGCRPYEIP-CE-RYMNGSHSSCQDNEPNTPECIRKCQPGYDVS 234
Y+V GI S GC+PY P CE R G+ + C + +TP+C C D S
Sbjct: 169 YYVEYGIAS-------SGCQPYPFPHCEHRGAQGNKTPCSKYKFDTPKCNATCT---DKS 218
Query: 235 YEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEH 294
G Y L EE RE++ +GP +Y D+ YK+G+Y++V G LG
Sbjct: 219 IPLVKYRGNATYLLLHGEEDYKRELYFNGPFVAVFFVYTDLFAYKSGVYRNVDGDFLGGQ 278
Query: 295 AIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
A+RI+GWG+ GT YW VANS++T+WG NG I
Sbjct: 279 AVRIVGWGKL---NGTP----YWKVANSWDTDWGMNGYMLI 312
Score = 107 bits (266), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 65/169 (38%), Positives = 89/169 (52%), Gaps = 12/169 (7%)
Query: 329 ENGLFRIGCRPYEIP-CE-RYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRI 386
E G+ GC+PY P CE R G+++ C + +TP+C C D S G
Sbjct: 172 EYGIASSGCQPYPFPHCEHRGAQGNKTPCSKYKFDTPKCNATCT---DKSIPLVKYRGNA 228
Query: 387 AYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQE 446
Y L EE RE++ +GP +Y D+ YK+G+Y++V G LG A+RI+GWG+
Sbjct: 229 TYLLLHGEEDYKRELYFNGPFVAVFFVYTDLFAYKSGVYRNVDGDFLGGQAVRIVGWGKL 288
Query: 447 PLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLP 495
GT YW VANS++T+WG NG I+ G NEC IE G P
Sbjct: 289 ---NGTP----YWKVANSWDTDWGMNGYMLILGGNNECNIEHLGFTGFP 330
>gi|157058733|gb|ABV03124.1| cathepsin B-16a [Acyrthosiphon pisum]
Length = 274
Score = 158 bits (400), Expect = 6e-36, Method: Compositional matrix adjust.
Identities = 76/189 (40%), Positives = 108/189 (57%), Gaps = 3/189 (1%)
Query: 97 LPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLV 156
+P FDAR W +C TI E+RDQG CGS WA+ A +DR+C+A+ G + LS++++
Sbjct: 84 IPRTFDARRRWRHCKTIGEVRDQGHCGSCWAMATSSAFADRLCVATNGDFNELLSAEEIT 143
Query: 157 SCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEI-PCERYMNGSHSSCQD 215
CC CG GC GG+ KAWKY+ + GIV+GG Y S +GC PY + PC + G S
Sbjct: 144 FCCHTCGFGCNGGYPIKAWKYFSSHGIVTGGNYKSGEGCEPYRVPPCPQDEEGKSSCAGK 203
Query: 216 NEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADM 275
C R C D+ Y +D F R Y L +I +++ +GP+E S +Y D
Sbjct: 204 PIEKNHRCTRMCYGNQDLDYNEDHRFTRDYYYLTYG--SIQKDVMNYGPIEASFDVYDDF 261
Query: 276 ILYKTGIYK 284
YK+G+Y+
Sbjct: 262 PSYKSGVYQ 270
Score = 56.2 bits (134), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 38/116 (32%), Positives = 56/116 (48%), Gaps = 7/116 (6%)
Query: 314 VKYWLVANSFNTNWGENGLFRIGCRPYEIP-CERYMNGSRSSCQANEP--NTPECIRKCQ 370
+K W +S G N GC PY +P C + G +SSC A +P C R C
Sbjct: 159 IKAWKYFSSHGIVTGGNYKSGEGCEPYRVPPCPQDEEG-KSSC-AGKPIEKNHRCTRMCY 216
Query: 371 PGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYK 426
D+ Y +D F R Y L +I +++ +GP+E S +Y D YK+G+Y+
Sbjct: 217 GNQDLDYNEDHRFTRDYYYLTYG--SIQKDVMNYGPIEASFDVYDDFPSYKSGVYQ 270
>gi|320167003|gb|EFW43902.1| cathepsin B [Capsaspora owczarzaki ATCC 30864]
Length = 306
Score = 157 bits (398), Expect = 9e-36, Method: Compositional matrix adjust.
Identities = 92/242 (38%), Positives = 122/242 (50%), Gaps = 33/242 (13%)
Query: 94 LEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSD 153
++ +P FDAR WP +I IRDQ CGS WA GA EA+SDR+ IAS +V LS
Sbjct: 78 VDAIPTSFDARTQWP--ASIHPIRDQQQCGSCWAFGATEALSDRLAIASNNSINVVLSPQ 135
Query: 154 DLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIPCERYMNGSHSSC 213
DLVSC GC GG+ AW Y + G+V+ C PY NG +C
Sbjct: 136 DLVSC-DSTDYGCDGGYPINAWHYMQSLGVVT-------DTCYPYT-----SGNGDSGTC 182
Query: 214 QDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYA 273
Q TP C + AY + N I EI +GPVE + ++Y
Sbjct: 183 QITGKKTPACATAT-----------FYKAKTAYQVANNMAAIQSEILANGPVEAAFSVYD 231
Query: 274 DMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLF 333
D Y +G+Y H +G G HA++I+GWG +GT+ YW+VANS+ T+WG+ G F
Sbjct: 232 DFFSYTSGVYSHQSGALDGGHAVKIVGWGV----DGTT---PYWIVANSWGTSWGQAGFF 284
Query: 334 RI 335
I
Sbjct: 285 WI 286
Score = 108 bits (270), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 60/167 (35%), Positives = 82/167 (49%), Gaps = 23/167 (13%)
Query: 331 GLFRIGCRPYEIPCERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSL 390
G+ C PY NG +CQ TP C + AY +
Sbjct: 163 GVVTDTCYPYT-----SGNGDSGTCQITGKKTPACATAT-----------FYKAKTAYQV 206
Query: 391 PANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGE 450
N I EI +GPVE + ++Y D Y +G+Y H +G G HA++I+GWG +
Sbjct: 207 ANNMAAIQSEILANGPVEAAFSVYDDFFSYTSGVYSHQSGALDGGHAVKIVGWGV----D 262
Query: 451 GTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
GT+ YW+VANS+ T+WG+ G F I RG +ECGIE I AGL +
Sbjct: 263 GTT---PYWIVANSWGTSWGQAGFFWIKRGNDECGIEDGIVAGLAAV 306
>gi|343475054|emb|CCD13447.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 336
Score = 157 bits (398), Expect = 9e-36, Method: Compositional matrix adjust.
Identities = 110/281 (39%), Positives = 150/281 (53%), Gaps = 27/281 (9%)
Query: 59 LSKLTLSELEMRMGV--HPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEI 116
+ LT SE + G S LP P+ ELPE FDA WP+CPTI+EI
Sbjct: 55 MQNLTFSEAKRLTGAFSRKTSTLP----PVRFTEEQLRTELPESFDAAEKWPHCPTIREI 110
Query: 117 RDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWK 176
DQ +C + WA+ A+SDR C GK+ +R+S+ DL++CC CG GC+GG+ AW+
Sbjct: 111 PDQSACRASWAVATASAISDRYCTVGNGKQ-LRISAADLMACCTGCGGGCEGGYPDAAWE 169
Query: 177 YWVTTGIVSGGTYASKQGCRPYEIP-CE-RYMNGSHSSCQDNEPNTPECIRKCQPGYDVS 234
Y+V+ GI S C+PY P CE R G C +TP C C D S
Sbjct: 170 YYVSNGITSS-------QCQPYPFPRCEHRGAQGKKPPCSKYNFDTPTCNATCT---DKS 219
Query: 235 YEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEH 294
G +Y + EE RE++ +GP +++D + YK+G+Y+HVAG LG
Sbjct: 220 VPLIKYRGNHSYEV-RGEEDYKRELYFNGPFVVRFQVHSDFLAYKSGVYQHVAGNFLGGK 278
Query: 295 AIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
A+RI+GWG+ GT YW VANS++T+WG NG F I
Sbjct: 279 AVRIVGWGKM---NGTP----YWKVANSWDTDWGMNGYFLI 312
Score = 107 bits (268), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 67/168 (39%), Positives = 90/168 (53%), Gaps = 13/168 (7%)
Query: 330 NGLFRIGCRPYEIP-CE-RYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIA 387
NG+ C+PY P CE R G + C +TP C C D S G +
Sbjct: 174 NGITSSQCQPYPFPRCEHRGAQGKKPPCSKYNFDTPTCNATCT---DKSVPLIKYRGNHS 230
Query: 388 YSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEP 447
Y + EE RE++ +GP +++D + YK+G+Y+HVAG LG A+RI+GWG+
Sbjct: 231 YEV-RGEEDYKRELYFNGPFVVRFQVHSDFLAYKSGVYQHVAGNFLGGKAVRIVGWGKM- 288
Query: 448 LGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLP 495
GT YW VANS++T+WG NG F I+RG NEC IE AG P
Sbjct: 289 --NGTP----YWKVANSWDTDWGMNGYFLILRGNNECNIEHLGFAGTP 330
>gi|195384166|ref|XP_002050789.1| GJ20006 [Drosophila virilis]
gi|194145586|gb|EDW61982.1| GJ20006 [Drosophila virilis]
Length = 432
Score = 157 bits (398), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 108/310 (34%), Positives = 156/310 (50%), Gaps = 36/310 (11%)
Query: 32 DLSKAFDRVDHSI-LLPKLPFYGAEKNALSKLTLSE-LEMRMGVHPDSKLPQNRLPLLVQ 89
DL D + HS+ + +L + + + SE L +R+G +K P R+ + +
Sbjct: 124 DLCLTDDELVHSVNSIHRLGWSARKYDEWWGHKYSEGLRLRLG----TKEPTYRVKAMTR 179
Query: 90 LSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVR 149
L++P ++LP F+A W I E+ DQG CGS W L SDR I S+GK V+
Sbjct: 180 LTNPSDDLPRKFNAVEKWS--SYISEVPDQGWCGSSWVLSTTSVASDRFAIQSQGKEVVQ 237
Query: 150 LSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIPCERYMNGS 209
LS+ +++SC + GC+GG AW+Y G++ + C PY
Sbjct: 238 LSAQNILSCTRR-QQGCEGGHLDAAWRYLHKKGVLD-------EKCYPY--------TQH 281
Query: 210 HSSCQDNEPNTPEC-IRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGS 268
SC+ N+ CQP Y V+ D L AYSL + E IM EI+ GPV+ +
Sbjct: 282 RDSCKIQRHNSRSLKANGCQPAYGVN-RDSLYTVGPAYSL-SREADIMAEIYHSGPVQAT 339
Query: 269 MTIYADMILYKTGIYKHVA---GGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNT 325
M IY D Y GIY+ A G P G H+++++GWG+E G VKYW+ ANS+
Sbjct: 340 MRIYRDFFSYSGGIYRQTAANRGAPTGFHSVKLVGWGEEHDG------VKYWIAANSWGP 393
Query: 326 NWGENGLFRI 335
WGE+G FRI
Sbjct: 394 WWGEHGYFRI 403
Score = 115 bits (287), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 69/173 (39%), Positives = 91/173 (52%), Gaps = 20/173 (11%)
Query: 329 ENGLFRIGCRPYEIPCERYMNGSRSSCQANEPNTPEC-IRKCQPGYDVSYEDDLNFGRIA 387
+ G+ C PY R SC+ N+ CQP Y V+ D L A
Sbjct: 267 KKGVLDEKCYPY--------TQHRDSCKIQRHNSRSLKANGCQPAYGVN-RDSLYTVGPA 317
Query: 388 YSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVA---GGPLGEHAIRIIGWG 444
YSL + E IM EI+ GPV+ +M IY D Y GIY+ A G P G H+++++GWG
Sbjct: 318 YSL-SREADIMAEIYHSGPVQATMRIYRDFFSYSGGIYRQTAANRGAPTGFHSVKLVGWG 376
Query: 445 QEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
+E G VKYW+ ANS+ WGE+G FRI+RG NECGIE + A P +
Sbjct: 377 EEHDG------VKYWIAANSWGPWWGEHGYFRILRGSNECGIEEYVLASWPYV 423
>gi|194246067|gb|ACF35525.1| putative cathepsin B-like cysteine protease form 1 [Dermacentor
variabilis]
Length = 192
Score = 157 bits (396), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 77/175 (44%), Positives = 110/175 (62%), Gaps = 9/175 (5%)
Query: 162 CGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIP-CERYMNGSHSSCQDNEPNT 220
CG+GC GG+ AW+++ IV+GG Y ++ GC+PY P CE + G +C +P T
Sbjct: 3 CGSGCNGGYPSAAWQFYKDEDIVTGGLYGTEDGCQPYYFPPCEHHTVGPLPNCTGIKP-T 61
Query: 221 PECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKT 280
PEC + C+ GY SY D +FG+ YS+ ++E I EI+++GPVE ++YAD YK+
Sbjct: 62 PECAKTCREGYQKSYTRDKHFGKKVYSISSDETQIKTEIYKNGPVEADFSVYADFPSYKS 121
Query: 281 GIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
G+Y+ + LG HAIRI+GW GT V YWLVANS+N +WG+ G F+I
Sbjct: 122 GVYQRHSEEMLGGHAIRILGW-------GTEDGVPYWLVANSWNEDWGDKGYFKI 169
Score = 154 bits (390), Expect = 9e-35, Method: Compositional matrix adjust.
Identities = 77/162 (47%), Positives = 104/162 (64%), Gaps = 9/162 (5%)
Query: 336 GCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANE 394
GC+PY P CE + G +C +P TPEC + C+ GY SY D +FG+ YS+ ++E
Sbjct: 35 GCQPYYFPPCEHHTVGPLPNCTGIKP-TPECAKTCREGYQKSYTRDKHFGKKVYSISSDE 93
Query: 395 ETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSS 454
I EI+++GPVE ++YAD YK+G+Y+ + LG HAIRI+GW GT
Sbjct: 94 TQIKTEIYKNGPVEADFSVYADFPSYKSGVYQRHSEEMLGGHAIRILGW-------GTED 146
Query: 455 VVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPK 496
V YWLVANS+N +WG+ G F+I RG +ECGIE DI AG+PK
Sbjct: 147 GVPYWLVANSWNEDWGDKGYFKIRRGNDECGIEDDINAGIPK 188
>gi|297723949|ref|NP_001174338.1| Os05g0310500 [Oryza sativa Japonica Group]
gi|255676228|dbj|BAH93066.1| Os05g0310500, partial [Oryza sativa Japonica Group]
Length = 234
Score = 157 bits (396), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 88/219 (40%), Positives = 127/219 (57%), Gaps = 26/219 (11%)
Query: 120 GSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKD-CGNGCQGGFHGKAWKYW 178
G CGS WA GAVE + DR CI ++ LS +DLV+CC CG+GC GG+ AW+Y+
Sbjct: 1 GHCGSCWAFGAVECLQDRFCI--HFNMNISLSVNDLVACCGFMCGDGCDGGYPIMAWRYF 58
Query: 179 VTTGIVSGGTYASKQGCRPY--EIPCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYE 236
V G+V+ C PY ++ C+ H C+ P TP C +KC+ V E
Sbjct: 59 VRNGVVT-------DECDPYFDQVGCK------HPGCEPAYP-TPVCEKKCKVQNQVWLE 104
Query: 237 DDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAI 296
+F AY + ++ IM E++++GPVE + T+Y D YK+G+YKH+ GG +G HA+
Sbjct: 105 KK-HFSVNAYRVNSDPHDIMAEVYQNGPVEVAFTVYEDFAHYKSGVYKHITGGMMGGHAV 163
Query: 297 RIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
++IGWG GE YWL+AN +N WG++G F+I
Sbjct: 164 KLIGWGTTDAGE------DYWLLANQWNRGWGDDGYFKI 196
Score = 131 bits (329), Expect = 9e-28, Method: Compositional matrix adjust.
Identities = 68/168 (40%), Positives = 100/168 (59%), Gaps = 16/168 (9%)
Query: 330 NGLFRIGCRPY--EIPCERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIA 387
NG+ C PY ++ C+ C+ P TP C +KC+ V E +F A
Sbjct: 61 NGVVTDECDPYFDQVGCKH------PGCEPAYP-TPVCEKKCKVQNQVWLEKK-HFSVNA 112
Query: 388 YSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEP 447
Y + ++ IM E++++GPVE + T+Y D YK+G+YKH+ GG +G HA+++IGWG
Sbjct: 113 YRVNSDPHDIMAEVYQNGPVEVAFTVYEDFAHYKSGVYKHITGGMMGGHAVKLIGWGTTD 172
Query: 448 LGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLP 495
GE YWL+AN +N WG++G F+I+RG NECGIE D+ AG+P
Sbjct: 173 AGE------DYWLLANQWNRGWGDDGYFKIIRGTNECGIEEDVVAGMP 214
>gi|403345965|gb|EJY72367.1| Cathepsin B [Oxytricha trifallax]
Length = 309
Score = 156 bits (395), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 94/254 (37%), Positives = 126/254 (49%), Gaps = 42/254 (16%)
Query: 84 LPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASR 143
+P Q++ +P+ FD+R W C + IRDQ CGS WA A E++SDR CIAS+
Sbjct: 68 IPAFTQIN---AAVPDSFDSRTQWQGC--VHPIRDQAQCGSCWAFAASESLSDRFCIASQ 122
Query: 144 GKRHVRLSSDDLVSCCKDCGN-GCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIPC 202
GK +V LS D+VSC D N GC GG+ AW+Y G+ S C PY
Sbjct: 123 GKVNVVLSPQDMVSC--DTNNYGCDGGYLNLAWQYLEKKGVAS-------DSCEPY---- 169
Query: 203 ERYMNGSHSSCQDNEPNTPECIRKCQPGYDVS-YEDDLNFGRIAYSLPANEETIMREIFR 261
+ P C KC G + Y+ + A A + I +
Sbjct: 170 -----------KSASGTAPSCPSKCSNGQAIKKYKCKAGSTKQANGAAATKSLIQQS--- 215
Query: 262 HGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVAN 321
GPVE T+YAD YK+GIY HV+GG G HA++I+GWG++ YW+VAN
Sbjct: 216 -GPVETGFTVYADFFNYKSGIYHHVSGGAEGGHAVKILGWGKQ-------GSENYWIVAN 267
Query: 322 SFNTNWGENGLFRI 335
S+ +WGE G F I
Sbjct: 268 SWGESWGEKGFFNI 281
Score = 87.8 bits (216), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 53/160 (33%), Positives = 79/160 (49%), Gaps = 22/160 (13%)
Query: 344 CERYMNGSRSSCQANEPNTPECIRKCQPGYDVS-YEDDLNFGRIAYSLPANEETIMREIF 402
CE Y + S ++ P C KC G + Y+ + A A + I +
Sbjct: 166 CEPYKSASGTA--------PSCPSKCSNGQAIKKYKCKAGSTKQANGAAATKSLIQQS-- 215
Query: 403 RHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVA 462
GPVE T+YAD YK+GIY HV+GG G HA++I+GWG++ YW+VA
Sbjct: 216 --GPVETGFTVYADFFNYKSGIYHHVSGGAEGGHAVKILGWGKQ-------GSENYWIVA 266
Query: 463 NSFNTNWGENGLFRIVRGQNECGIEADITAGLPKIGLEID 502
NS+ +WGE G F I +G + GI+ +P + ++
Sbjct: 267 NSWGESWGEKGFFNIRQG--DSGIDQATFGCIPDLSSALE 304
>gi|403362666|gb|EJY81064.1| Cathepsin B [Oxytricha trifallax]
Length = 309
Score = 156 bits (395), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 94/254 (37%), Positives = 126/254 (49%), Gaps = 42/254 (16%)
Query: 84 LPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASR 143
+P Q++ +P+ FD+R W C + IRDQ CGS WA A E++SDR CIAS+
Sbjct: 68 IPAFTQIN---AAVPDSFDSRTQWQGC--VHPIRDQAQCGSCWAFAASESLSDRFCIASQ 122
Query: 144 GKRHVRLSSDDLVSCCKDCGN-GCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIPC 202
GK +V LS D+VSC D N GC GG+ AW+Y G+ S C PY
Sbjct: 123 GKVNVVLSPQDMVSC--DTNNYGCDGGYLNLAWQYLEKKGVAS-------DSCEPY---- 169
Query: 203 ERYMNGSHSSCQDNEPNTPECIRKCQPGYDVS-YEDDLNFGRIAYSLPANEETIMREIFR 261
+ P C KC G + Y+ + A A + I +
Sbjct: 170 -----------KSASGTAPSCPSKCANGQAIKKYKCQAGSTKQANGAAATKSLIQQS--- 215
Query: 262 HGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVAN 321
GPVE T+YAD YK+GIY HV+GG G HA++I+GWG++ YW+VAN
Sbjct: 216 -GPVETGFTVYADFFNYKSGIYHHVSGGAEGGHAVKILGWGKQ-------GSENYWIVAN 267
Query: 322 SFNTNWGENGLFRI 335
S+ +WGE G F I
Sbjct: 268 SWGESWGEKGFFNI 281
Score = 87.8 bits (216), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 53/160 (33%), Positives = 79/160 (49%), Gaps = 22/160 (13%)
Query: 344 CERYMNGSRSSCQANEPNTPECIRKCQPGYDVS-YEDDLNFGRIAYSLPANEETIMREIF 402
CE Y + S ++ P C KC G + Y+ + A A + I +
Sbjct: 166 CEPYKSASGTA--------PSCPSKCANGQAIKKYKCQAGSTKQANGAAATKSLIQQS-- 215
Query: 403 RHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVA 462
GPVE T+YAD YK+GIY HV+GG G HA++I+GWG++ YW+VA
Sbjct: 216 --GPVETGFTVYADFFNYKSGIYHHVSGGAEGGHAVKILGWGKQ-------GSENYWIVA 266
Query: 463 NSFNTNWGENGLFRIVRGQNECGIEADITAGLPKIGLEID 502
NS+ +WGE G F I +G + GI+ +P + ++
Sbjct: 267 NSWGESWGEKGFFNIRQG--DSGIDQATFGCIPDLSSALE 304
>gi|270012758|gb|EFA09206.1| cathepsin B precursor [Tribolium castaneum]
Length = 326
Score = 156 bits (395), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 105/291 (36%), Positives = 155/291 (53%), Gaps = 43/291 (14%)
Query: 71 MGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCP-TIQEIRDQGSCGSGWALG 129
+G+HPD P ++ + +PE FDAR WP C I +IR+QG+CGS WA
Sbjct: 53 LGLHPD---PNYKIQTKQHKISRIISIPESFDAREKWPECKDVIGKIRNQGNCGSCWAFA 109
Query: 130 AVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTY 189
+ E M+DR+CI+S+GK S ++L++CCKDCG GC+GG+ AW Y++ GI SGG Y
Sbjct: 110 STEVMTDRLCISSKGKIKFVFSPENLLTCCKDCGCGCKGGYIKNAWDYYINEGIASGGDY 169
Query: 190 ASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLP 249
S +GC+PY S SS Q E + EC++ Y+L
Sbjct: 170 NSSEGCQPY----------SESSFQYAEAS--ECVK-------------------FYTLE 198
Query: 250 ANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEG 309
N I EI +GPV ++ D +K+G+Y + +G +G H++++IGWG E EG
Sbjct: 199 TNVAQIQMEILTNGPVMAYYNVFEDFACHKSGVYYYKSGKFVGRHSVKVIGWGTE---EG 255
Query: 310 TSSVVKYWLVANSFNTNWGE-NGLFRIGCRPYEIPCERYMNGSRSSCQANE 359
+ YWL+ANS+ + WGE G F++ E E+ M + + NE
Sbjct: 256 ----IPYWLIANSWGSEWGELGGFFKMRRGTNECWIEQEMTAGKVHIEGNE 302
Score = 90.5 bits (223), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 51/139 (36%), Positives = 80/139 (57%), Gaps = 13/139 (9%)
Query: 369 CQPGYDVSYE-DDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKH 427
CQP + S++ + + Y+L N I EI +GPV ++ D +K+G+Y +
Sbjct: 175 CQPYSESSFQYAEASECVKFYTLETNVAQIQMEILTNGPVMAYYNVFEDFACHKSGVYYY 234
Query: 428 VAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGE-NGLFRIVRGQNECGI 486
+G +G H++++IGWG E EG + YWL+ANS+ + WGE G F++ RG NEC I
Sbjct: 235 KSGKFVGRHSVKVIGWGTE---EG----IPYWLIANSWGSEWGELGGFFKMRRGTNECWI 287
Query: 487 EADITAGLPKIGLEIDSNE 505
E ++TAG + I+ NE
Sbjct: 288 EQEMTAG----KVHIEGNE 302
>gi|157058735|gb|ABV03125.1| cathepsin B-16 [Aulacorthum solani]
Length = 246
Score = 156 bits (394), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 74/180 (41%), Positives = 104/180 (57%), Gaps = 3/180 (1%)
Query: 97 LPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLV 156
+P FDAR W +C TI E+RDQG+CGS WA G A +DR+C+A+ G + LS +++
Sbjct: 68 IPRTFDARKRWRHCKTIGEVRDQGNCGSCWAFGTSSAFADRLCVATDGDFNELLSPEEIA 127
Query: 157 SCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEI-PCERYMNGSHSSCQD 215
CC CG GC GG+ KAWKY+ T G+V+GG Y S +GC PY + PC+ + G++S
Sbjct: 128 FCCHTCGFGCHGGYPIKAWKYFSTHGLVTGGNYKSGEGCEPYRVPPCQHHHQGNNSCSDK 187
Query: 216 NEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADM 275
C R C D+ Y DD F R Y L +I +++ +GP+E S +Y D
Sbjct: 188 PMEKNHRCTRMCYGDQDLDYNDDHRFTRDYYYLTYG--SIQKDVMNYGPIEASFDVYDDF 245
Score = 52.0 bits (123), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 30/105 (28%), Positives = 45/105 (42%), Gaps = 3/105 (2%)
Query: 314 VKYWLVANSFNTNWGENGLFRIGCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPG 372
+K W ++ G N GC PY +P C+ + G+ S C R C
Sbjct: 143 IKAWKYFSTHGLVTGGNYKSGEGCEPYRVPPCQHHHQGNNSCSDKPMEKNHRCTRMCYGD 202
Query: 373 YDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADM 417
D+ Y DD F R Y L +I +++ +GP+E S +Y D
Sbjct: 203 QDLDYNDDHRFTRDYYYLTYG--SIQKDVMNYGPIEASFDVYDDF 245
>gi|161343829|tpg|DAA06095.1| TPA_inf: cathepsin B [Aphis gossypii]
Length = 280
Score = 156 bits (394), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 83/215 (38%), Positives = 119/215 (55%), Gaps = 10/215 (4%)
Query: 97 LPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLV 156
LP FDAR WP CP+I I +QG+C S +A+ A++DR+CI S ++ +S+ ++
Sbjct: 63 LPINFDARKRWPNCPSIGHIYNQGNCRSSYAISVASAVTDRICIHSNETKNPIMSAQQII 122
Query: 157 SCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIPCERYMNGS---HSSC 213
SCC CG GC GG ++W ++ G VSGG Y S QGC+PY IP + +N HS
Sbjct: 123 SCCYLCGYGCDGGSQFESWDFYRRHGFVSGGDYNSNQGCQPYMIPPCKLINEKSPRHSCT 182
Query: 214 QDNEPNTPECIRKC-QPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIY 272
N TP C KC P Y S++ D+ G+ P M+EIF +GP+ +Y
Sbjct: 183 TYNREETPACEIKCNNPNYYSSFKTDIYKGKYYQVYPF---MAMKEIFDNGPITTQFYMY 239
Query: 273 ADMILYKTGIYKH---VAGGPLGEHAIRIIGWGQE 304
D+I YK+G+Y++ G +IIGWG+E
Sbjct: 240 RDLIDYKSGVYQYDEGFYGDFFTVQGXKIIGWGEE 274
Score = 66.2 bits (160), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 42/118 (35%), Positives = 60/118 (50%), Gaps = 10/118 (8%)
Query: 336 GCRPYEIPCERYMN--GSRSSCQA-NEPNTPECIRKCQ-PGYDVSYEDDLNFGRIAYSLP 391
GC+PY IP + +N R SC N TP C KC P Y S++ D+ G+ P
Sbjct: 160 GCQPYMIPPCKLINEKSPRHSCTTYNREETPACEIKCNNPNYYSSFKTDIYKGKYYQVYP 219
Query: 392 ANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKH---VAGGPLGEHAIRIIGWGQE 446
M+EIF +GP+ +Y D+I YK+G+Y++ G +IIGWG+E
Sbjct: 220 F---MAMKEIFDNGPITTQFYMYRDLIDYKSGVYQYDEGFYGDFFTVQGXKIIGWGEE 274
>gi|12658201|gb|AAK01061.1| cysteine proteinase [Metagonimus yokogawai]
Length = 179
Score = 155 bits (392), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 74/177 (41%), Positives = 111/177 (62%), Gaps = 2/177 (1%)
Query: 129 GAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGT 188
GAVEAM+DR+CI S +SS DL+SCC+ CG GC GGF +AW +W+ G+V+GG+
Sbjct: 1 GAVEAMTDRLCIHSNATIKKHISSTDLLSCCESCGFGCHGGFPPRAWDFWMENGLVTGGS 60
Query: 189 YASKQGCRPYEIP-CERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYS 247
+ GCR Y P C + G + C + TP C + C +V+Y D + +Y+
Sbjct: 61 KENPSGCRSYPFPKCNHHGKGPDAPCPEKIFPTPACNKTCDTP-EVNYILDKTKAKSSYN 119
Query: 248 LPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQE 304
+P +E+ IM+EI ++GPVE + +Y D + Y++G+Y H G +G HAIR++GWG+E
Sbjct: 120 VPNSEKAIMKEIMQNGPVEAAFEVYEDFLHYESGVYFHSFGRMIGGHAIRMLGWGEE 176
Score = 87.8 bits (216), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 47/128 (36%), Positives = 71/128 (55%), Gaps = 9/128 (7%)
Query: 327 WGENGLFR-------IGCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYE 378
W ENGL GCR Y P C + G + C TP C + C +V+Y
Sbjct: 50 WMENGLVTGGSKENPSGCRSYPFPKCNHHGKGPDAPCPEKIFPTPACNKTCDTP-EVNYI 108
Query: 379 DDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAI 438
D + +Y++P +E+ IM+EI ++GPVE + +Y D + Y++G+Y H G +G HAI
Sbjct: 109 LDKTKAKSSYNVPNSEKAIMKEIMQNGPVEAAFEVYEDFLHYESGVYFHSFGRMIGGHAI 168
Query: 439 RIIGWGQE 446
R++GWG+E
Sbjct: 169 RMLGWGEE 176
>gi|189502866|gb|ACE06814.1| unknown [Schistosoma japonicum]
Length = 121
Score = 155 bits (392), Expect = 5e-35, Method: Compositional matrix adjust.
Identities = 67/126 (53%), Positives = 98/126 (77%), Gaps = 7/126 (5%)
Query: 372 GYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGG 431
GY+VSYE+D +G++ Y + +N+E IM+E+ +HGPVE +YAD YK+G+Y+HV+G
Sbjct: 1 GYNVSYENDKWYGKVVYRVKSNQEAIMKELMQHGPVEVDFEVYADFPNYKSGVYQHVSGA 60
Query: 432 PLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADIT 491
LG HA+R++GWG+E + V YWL+ANS+NT+WG+NG F+I+RG+NECGIE+D+
Sbjct: 61 LLGGHAVRLLGWGEE-------NNVPYWLIANSWNTDWGDNGYFKIIRGKNECGIESDVN 113
Query: 492 AGLPKI 497
AG+PKI
Sbjct: 114 AGIPKI 119
Score = 121 bits (304), Expect = 8e-25, Method: Compositional matrix adjust.
Identities = 53/106 (50%), Positives = 79/106 (74%), Gaps = 7/106 (6%)
Query: 230 GYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGG 289
GY+VSYE+D +G++ Y + +N+E IM+E+ +HGPVE +YAD YK+G+Y+HV+G
Sbjct: 1 GYNVSYENDKWYGKVVYRVKSNQEAIMKELMQHGPVEVDFEVYADFPNYKSGVYQHVSGA 60
Query: 290 PLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
LG HA+R++GWG+E + V YWL+ANS+NT+WG+NG F+I
Sbjct: 61 LLGGHAVRLLGWGEE-------NNVPYWLIANSWNTDWGDNGYFKI 99
>gi|294898091|ref|XP_002776152.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
gi|239882839|gb|EER07968.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
Length = 382
Score = 155 bits (392), Expect = 5e-35, Method: Compositional matrix adjust.
Identities = 93/247 (37%), Positives = 127/247 (51%), Gaps = 20/247 (8%)
Query: 92 DPLEELPEGFDARINWPYC-PTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRL 150
+ L++LP FDAR +P C I IRDQ +CGS WA G EA +DR+CI S G L
Sbjct: 137 EELQDLPTDFDARTAFPNCSKVIGHIRDQSACGSCWAFGVTEAFNDRLCIKSNGAFTELL 196
Query: 151 SSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIPCERYMNGSH 210
S+ ++ +C GC GG AW + GI +G K+ IP Y
Sbjct: 197 SAGEMNACT--LFFGCGGGDPYSAWSWVHDKGIATGEGSRPKRVSESEAIPVIAY----- 249
Query: 211 SSCQDNEPNTPECIRKCQ-PGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSM 269
QD P TP C+ +C+ P Y + DD +F + + I GPV S
Sbjct: 250 ---QDIYP-TPNCVEQCRNPKYTTTLRDDRHFMLESSPYHYSVNDAKNAIRTDGPVSASF 305
Query: 270 TIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGE 329
T+Y D + YK+G+YKH +G LG HA++IIGWG++ S YWL NS+N +WG+
Sbjct: 306 TVYEDFLAYKSGVYKHTSGSYLGGHAVKIIGWGEK-------SGQAYWLAVNSWNEDWGD 358
Query: 330 NGLFRIG 336
GLF+I
Sbjct: 359 KGLFKIA 365
Score = 108 bits (269), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 54/137 (39%), Positives = 77/137 (56%), Gaps = 10/137 (7%)
Query: 362 TPECIRKCQ-PGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILY 420
TP C+ +C+ P Y + DD +F + + I GPV S T+Y D + Y
Sbjct: 255 TPNCVEQCRNPKYTTTLRDDRHFMLESSPYHYSVNDAKNAIRTDGPVSASFTVYEDFLAY 314
Query: 421 KTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRG 480
K+G+YKH +G LG HA++IIGWG++ S YWL NS+N +WG+ GLF+I G
Sbjct: 315 KSGVYKHTSGSYLGGHAVKIIGWGEK-------SGQAYWLAVNSWNEDWGDKGLFKIALG 367
Query: 481 QNECGIEADITAGLPKI 497
CGI+ D+ G PK+
Sbjct: 368 N--CGIDDDLLGGTPKV 382
>gi|307175943|gb|EFN65753.1| Uncharacterized peptidase C1-like protein F26E4.3 [Camponotus
floridanus]
Length = 443
Score = 155 bits (391), Expect = 7e-35, Method: Compositional matrix adjust.
Identities = 103/297 (34%), Positives = 148/297 (49%), Gaps = 36/297 (12%)
Query: 47 PKLPFYGAEKNALSKLTLSE-LEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARI 105
P L + + TL + +++R+G S+ P+ ++ DP + LP F++R
Sbjct: 154 PTLGWQAGNYSEFWGRTLKDGVQLRLGTLNPSQSVYKMNPVR-RIYDP-DALPREFNSRT 211
Query: 106 NWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNG 165
WP I +I DQG CG+ WA+ + SDR I S+G V LS+ L+SC G
Sbjct: 212 RWPR--DISDIHDQGWCGASWAVSTADVASDRFAIMSKGAETVELSAQHLLSCNNRGQQG 269
Query: 166 CQGGFHGKAWKYWVTTGIVSGGTY---ASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPE 222
C+GG+ +AW + G+V Y CR + N + CQ N PN+
Sbjct: 270 CKGGYLDRAWLFMRKFGLVDEECYPWTGRNDQCR-----LRKRSNLKTAGCQ-NPPNSLR 323
Query: 223 C-IRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTG 281
+ K P Y + NE IM+EI GPV+ +M +Y D +Y++G
Sbjct: 324 TELYKVGPAYRL----------------GNETDIMQEILTSGPVQATMRVYQDFFVYQSG 367
Query: 282 IYKHVAGGPL---GEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
+Y+H L G H++RIIGWG+EP G +KYWLVANS+ NWGENGLFRI
Sbjct: 368 VYRHSRSAELHDSGYHSVRIIGWGEEPSYRGPP--LKYWLVANSWGHNWGENGLFRI 422
Score = 112 bits (280), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 57/113 (50%), Positives = 74/113 (65%), Gaps = 6/113 (5%)
Query: 387 AYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPL---GEHAIRIIGW 443
AY L NE IM+EI GPV+ +M +Y D +Y++G+Y+H L G H++RIIGW
Sbjct: 332 AYRL-GNETDIMQEILTSGPVQATMRVYQDFFVYQSGVYRHSRSAELHDSGYHSVRIIGW 390
Query: 444 GQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPK 496
G+EP G +KYWLVANS+ NWGENGLFRI +G NEC IE+ + A K
Sbjct: 391 GEEPSYRGPP--LKYWLVANSWGHNWGENGLFRIQKGTNECEIESYVLAVWAK 441
>gi|308488550|ref|XP_003106469.1| hypothetical protein CRE_16049 [Caenorhabditis remanei]
gi|308253819|gb|EFO97771.1| hypothetical protein CRE_16049 [Caenorhabditis remanei]
Length = 205
Score = 154 bits (390), Expect = 8e-35, Method: Compositional matrix adjust.
Identities = 79/174 (45%), Positives = 107/174 (61%), Gaps = 11/174 (6%)
Query: 166 CQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEI-PCERYMNG-SHSSCQDNEPNTPEC 223
C+GG+ +AWK+WV G+V+GG+Y S+ GC+PY I PC + +NG + C ++ TP+C
Sbjct: 14 CEGGYPIQAWKWWVKHGLVTGGSYESQFGCKPYSIAPCGQTVNGVTWPKCPEDTEPTPKC 73
Query: 224 IRKCQPG--YDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTG 281
+ C Y Y D +FG AY++ E I EI HGP+E + T+Y D Y TG
Sbjct: 74 VEACTSNNTYPTGYLQDKHFGATAYAVGKKVEQIQTEILAHGPIEVAFTVYEDFYQYTTG 133
Query: 282 IYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
+Y H AG LG HA++I+GWG + GT YWLVANS+N NWGE G FRI
Sbjct: 134 VYVHTAGKSLGGHAVKILGWG---VDNGT----PYWLVANSWNVNWGEKGYFRI 180
Score = 149 bits (375), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 77/168 (45%), Positives = 99/168 (58%), Gaps = 11/168 (6%)
Query: 334 RIGCRPYEI-PCERYMNG-SRSSCQANEPNTPECIRKCQPG--YDVSYEDDLNFGRIAYS 389
+ GC+PY I PC + +NG + C + TP+C+ C Y Y D +FG AY+
Sbjct: 40 QFGCKPYSIAPCGQTVNGVTWPKCPEDTEPTPKCVEACTSNNTYPTGYLQDKHFGATAYA 99
Query: 390 LPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLG 449
+ E I EI HGP+E + T+Y D Y TG+Y H AG LG HA++I+GWG +
Sbjct: 100 VGKKVEQIQTEILAHGPIEVAFTVYEDFYQYTTGVYVHTAGKSLGGHAVKILGWG---VD 156
Query: 450 EGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
GT YWLVANS+N NWGE G FRI+RG NECGIE AGLP +
Sbjct: 157 NGT----PYWLVANSWNVNWGEKGYFRIIRGLNECGIEHSAVAGLPDL 200
>gi|294873367|ref|XP_002766594.1| Cathepsin B precursor, putative [Perkinsus marinus ATCC 50983]
gi|239867622|gb|EEQ99311.1| Cathepsin B precursor, putative [Perkinsus marinus ATCC 50983]
Length = 244
Score = 154 bits (390), Expect = 8e-35, Method: Compositional matrix adjust.
Identities = 89/233 (38%), Positives = 130/233 (55%), Gaps = 23/233 (9%)
Query: 118 DQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGN-----GCQGGFHG 172
DQ +CGS WA G VEA + RVCI S GK + LS+ ++++CC + G+ GC GG
Sbjct: 1 DQSACGSCWAFGTVEAFNARVCIKSGGKLNQLLSAANMLACC-NIGHFCLSFGCSGGNPI 59
Query: 173 KAWKYWVTTGIVSGGTYASKQ------GCRPYEIP-CERYMNGS-HSSCQDNEPNTPECI 224
+W + T GIVSGG + ++ GC PY P C + +GS + C +TP C
Sbjct: 60 TSWTFLHTNGIVSGGGFVPEKNMKAADGCWPYSFPKCAHHQDGSDYKPCAKEIYDTPSCS 119
Query: 225 RKC-QPGYDVSYEDDLNFGRIAY-SLPANEETIMREIFRHGPVEGSMTIYADMILYKTGI 282
C Y +++ D ++ + S + +I +EI +GP + ++Y D + YK+G+
Sbjct: 120 SSCPNAKYGTAFDKDRHYTESLFPSRFGSTSSIKKEIMTNGPTSAAFSVYEDFLSYKSGV 179
Query: 283 YKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
YKH +GG LG HA+ IIGWG E V YWLV NS+N WG++G F+I
Sbjct: 180 YKHTSGGFLGGHAVEIIGWGTE-------KGVDYWLVMNSWNEEWGDHGTFKI 225
Score = 117 bits (292), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 63/166 (37%), Positives = 92/166 (55%), Gaps = 13/166 (7%)
Query: 336 GCRPYEIP-CERYMNGSR-SSCQANEPNTPECIRKC-QPGYDVSYEDDLNFGRIAY-SLP 391
GC PY P C + +GS C +TP C C Y +++ D ++ + S
Sbjct: 87 GCWPYSFPKCAHHQDGSDYKPCAKEIYDTPSCSSSCPNAKYGTAFDKDRHYTESLFPSRF 146
Query: 392 ANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEG 451
+ +I +EI +GP + ++Y D + YK+G+YKH +GG LG HA+ IIGWG E
Sbjct: 147 GSTSSIKKEIMTNGPTSAAFSVYEDFLSYKSGVYKHTSGGFLGGHAVEIIGWGTE----- 201
Query: 452 TSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
V YWLV NS+N WG++G F+IV+G +CGI+ I AG P +
Sbjct: 202 --KGVDYWLVMNSWNEEWGDHGTFKIVQG--DCGIDDTILAGTPAM 243
>gi|350540002|ref|NP_001232104.1| putative cathepsin B variant 2 precursor [Taeniopygia guttata]
gi|197129221|gb|ACH45719.1| putative cathepsin B variant 2 [Taeniopygia guttata]
Length = 261
Score = 154 bits (390), Expect = 9e-35, Method: Compositional matrix adjust.
Identities = 73/133 (54%), Positives = 93/133 (69%), Gaps = 3/133 (2%)
Query: 83 RLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIAS 142
+LP V + +E LP+ FD+R WP CPTI EIRDQGSCGS WA GAVEA+SDR+C+ +
Sbjct: 67 KLPERVDFAADVE-LPDNFDSRTQWPNCPTISEIRDQGSCGSCWAFGAVEAISDRICVHT 125
Query: 143 RGKRHVRLSSDDLVSCCK-DCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEI- 200
K V +S++DL+SCC +CG GC GG+ AW+YW G+VSGG Y S GCRPY I
Sbjct: 126 NAKVSVEVSAEDLLSCCGFECGMGCNGGYPSGAWRYWTERGLVSGGLYDSHVGCRPYSIP 185
Query: 201 PCERYMNGSHSSC 213
PCE ++NG+ C
Sbjct: 186 PCEHHVNGTRPPC 198
>gi|341891034|gb|EGT46969.1| hypothetical protein CAEBREN_30419 [Caenorhabditis brenneri]
Length = 422
Score = 154 bits (388), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 97/286 (33%), Positives = 135/286 (47%), Gaps = 64/286 (22%)
Query: 96 ELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDL 155
+LP+ FDAR WP CP+I + +QG CGS +A+ A SDR CI S G LS +D+
Sbjct: 138 DLPKAFDARQKWPNCPSISNVPNQGGCGSCFAVAAAGVASDRACIHSNGTFKALLSEEDI 197
Query: 156 VSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYE------IPCERYMNGS 209
+ CC CGN C GG KA YWV G+V+GG + GCRPY +PC
Sbjct: 198 IGCCSVCGN-CYGGDPLKALTYWVNQGLVTGG----RDGCRPYSFDLSCGVPC-----SP 247
Query: 210 HSSCQDNEPNTPECIRKCQP-GYDVSYEDDLNFGRIAYSL--------PANEE------- 253
+ + E T C+R+CQ Y YE+D +F AYSL P +E
Sbjct: 248 ATFFEAEEKRT--CMRRCQNIYYQQRYEEDKHFATFAYSLYPRSMTVSPDGKERVKVPTI 305
Query: 254 -------------------TIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGE- 293
I +EI +GP + + + + Y +G+++ +
Sbjct: 306 IGHFNDKNTEKLNVTEYRNVIKKEILLYGPTTMAFPVPEEFLHYSSGVFRPFPLDGFDDR 365
Query: 294 ----HAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
H +R+IGWGQ G YWL NSF ++WG+NGLF+I
Sbjct: 366 IVYWHVVRLIGWGQSEDG------THYWLAVNSFGSHWGDNGLFKI 405
Score = 73.6 bits (179), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 58/202 (28%), Positives = 83/202 (41%), Gaps = 62/202 (30%)
Query: 325 TNWGENGLF---RIGCRPYE------IPCERYMNGSRSSCQANEPNTPECIRKCQP-GYD 374
T W GL R GCRPY +PC + +A E T C+R+CQ Y
Sbjct: 217 TYWVNQGLVTGGRDGCRPYSFDLSCGVPC-----SPATFFEAEEKRT--CMRRCQNIYYQ 269
Query: 375 VSYEDDLNFGRIAYSL--------PANEE--------------------------TIMRE 400
YE+D +F AYSL P +E I +E
Sbjct: 270 QRYEEDKHFATFAYSLYPRSMTVSPDGKERVKVPTIIGHFNDKNTEKLNVTEYRNVIKKE 329
Query: 401 IFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGE-----HAIRIIGWGQEPLGEGTSSV 455
I +GP + + + + Y +G+++ + H +R+IGWGQ G
Sbjct: 330 ILLYGPTTMAFPVPEEFLHYSSGVFRPFPLDGFDDRIVYWHVVRLIGWGQSEDG------ 383
Query: 456 VKYWLVANSFNTNWGENGLFRI 477
YWL NSF ++WG+NGLF+I
Sbjct: 384 THYWLAVNSFGSHWGDNGLFKI 405
>gi|195121981|ref|XP_002005491.1| GI19039 [Drosophila mojavensis]
gi|193910559|gb|EDW09426.1| GI19039 [Drosophila mojavensis]
Length = 432
Score = 154 bits (388), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 99/272 (36%), Positives = 136/272 (50%), Gaps = 33/272 (12%)
Query: 67 LEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGW 126
L +R+G +K P R+ + +LS+P LP F+A W I E+ DQG CGS W
Sbjct: 161 LRLRLG----TKEPTYRVKAMTRLSNPSSGLPRKFNAVERWS--SYISEVPDQGWCGSSW 214
Query: 127 ALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSG 186
L SDR I S+GK V+LS +++SC + GC+GG AW+Y G+V
Sbjct: 215 VLSTTSVASDRFAIQSQGKEVVQLSPQNILSCTRR-QQGCEGGHLDAAWRYLHKKGVVD- 272
Query: 187 GTYASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAY 246
+ C PY SC+ + C+P Y V+ D L AY
Sbjct: 273 ------ETCYPY--------TQRRDSCKIRHNSRSLKANGCRPAYGVN-RDSLYTVGPAY 317
Query: 247 SLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVA---GGPLGEHAIRIIGWGQ 303
SL E IM EI+ GPV+ +M +Y D Y G+Y+ A G P G H+++I+GWG+
Sbjct: 318 SLKG-ETDIMAEIYHSGPVQATMRVYRDFFSYSGGVYRQTAANRGAPTGFHSVKIVGWGE 376
Query: 304 EPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
E G VKYW+ ANS+ WGE+G FRI
Sbjct: 377 EHDG------VKYWIAANSWGPWWGEHGYFRI 402
Score = 112 bits (281), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 62/149 (41%), Positives = 83/149 (55%), Gaps = 11/149 (7%)
Query: 352 RSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSM 411
R SC+ + C+P Y V+ D L AYSL E IM EI+ GPV+ +M
Sbjct: 282 RDSCKIRHNSRSLKANGCRPAYGVN-RDSLYTVGPAYSLKG-ETDIMAEIYHSGPVQATM 339
Query: 412 TIYADMILYKTGIYKHVA---GGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTN 468
+Y D Y G+Y+ A G P G H+++I+GWG+E G VKYW+ ANS+
Sbjct: 340 RVYRDFFSYSGGVYRQTAANRGAPTGFHSVKIVGWGEEHDG------VKYWIAANSWGPW 393
Query: 469 WGENGLFRIVRGQNECGIEADITAGLPKI 497
WGE+G FRI+RG NECGIE + A P +
Sbjct: 394 WGEHGYFRILRGSNECGIEEYVLASWPNV 422
>gi|343476073|emb|CCD12715.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 336
Score = 153 bits (387), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 104/281 (37%), Positives = 153/281 (54%), Gaps = 26/281 (9%)
Query: 59 LSKLTLSELEMRMG--VHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEI 116
+ +T SE + G + S LP P+ ELPE FD+ WP CPTI+EI
Sbjct: 54 MQNITFSEAKRLTGAWIQKTSSLP----PVRFTEEQLRTELPESFDSAEKWPNCPTIREI 109
Query: 117 RDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWK 176
DQ +C + WA+ A+SDR C GK+ +R+S+ L+SCCK CG GC+GGF G AW+
Sbjct: 110 ADQSACRASWAVSTASAISDRYCTVGGGKQ-LRISAAHLLSCCKQCGGGCKGGFPGFAWR 168
Query: 177 YWVTTGIVSGGTYASKQGCRPYEIP-CERY-MNGSHSSCQDNEPNTPECIRKCQPGYDVS 234
Y+V GI S C+PY P CE + G+ + C + + TP+C C D +
Sbjct: 169 YYVEYGIAS-------SYCQPYPFPQCEHHGAQGNKTPCSNYKFVTPQCNTTCT---DKT 218
Query: 235 YEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEH 294
G+ AY L EE RE++ +GP + +Y D+ YK+G+Y++V G +G
Sbjct: 219 IPLIKYRGKDAYMLLPGEEEFKRELYFNGPFVAILFVYTDLFAYKSGVYRNVDGSYMGVT 278
Query: 295 AIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
A++++GWG+ GT YW VAN+++T+WG +G I
Sbjct: 279 AVKVVGWGKL---NGTP----YWKVANTWDTDWGMDGYLLI 312
Score = 103 bits (257), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 60/169 (35%), Positives = 92/169 (54%), Gaps = 12/169 (7%)
Query: 329 ENGLFRIGCRPYEIP-CERY-MNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRI 386
E G+ C+PY P CE + G+++ C + TP+C C D + G+
Sbjct: 172 EYGIASSYCQPYPFPQCEHHGAQGNKTPCSNYKFVTPQCNTTCT---DKTIPLIKYRGKD 228
Query: 387 AYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQE 446
AY L EE RE++ +GP + +Y D+ YK+G+Y++V G +G A++++GWG+
Sbjct: 229 AYMLLPGEEEFKRELYFNGPFVAILFVYTDLFAYKSGVYRNVDGSYMGVTAVKVVGWGKL 288
Query: 447 PLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLP 495
GT YW VAN+++T+WG +G I+RG NEC IE AG P
Sbjct: 289 ---NGTP----YWKVANTWDTDWGMDGYLLILRGNNECNIEHLGFAGTP 330
>gi|348513320|ref|XP_003444190.1| PREDICTED: tubulointerstitial nephritis antigen-like [Oreochromis
niloticus]
Length = 499
Score = 153 bits (387), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 101/294 (34%), Positives = 142/294 (48%), Gaps = 24/294 (8%)
Query: 53 GAEKNALSKLTLSE-LEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCP 111
A + L +TL+E + R+G S+ N + + + + LP F++ WP
Sbjct: 182 AANYSELYGMTLNEGIRYRLGTQRPSRTVMNMNEIQMNMDPQTDNLPPYFNSAEKWP--G 239
Query: 112 TIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFH 171
I E DQG+C + WA SDR+ I S G RLS +L+SC GC GG
Sbjct: 240 KIHEPLDQGNCAASWAFSTAAVASDRISIQSMGHMTPRLSPQNLISCDTRNQGGCAGGRI 299
Query: 172 GKAWKYWVTTGIVSGGTYASKQGCRPYEIPCE--RYMNGSHSSCQDNEPNTPECIRKCQP 229
AW Y G+V+ Y + P++ P E R M S S + T C
Sbjct: 300 DGAWWYLRRRGVVTEDCYPYQP---PHQTPAEVGRCMMQSRSVGRGKRQATQRCPNT--- 353
Query: 230 GYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGG 289
+Y +D+ Y L +NE+ IM+EI +GPV+ M ++ D +YKTGIYKH
Sbjct: 354 ---QNYHNDIYQSTPPYRLSSNEKEIMKEIMDNGPVQAIMEVHEDFFVYKTGIYKHTDVS 410
Query: 290 --------PLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
G H++RI GWG++ +GTS KYW+ ANS+ NWGENG FRI
Sbjct: 411 FTKPPQYRKHGTHSVRITGWGEDRNVDGTSR--KYWIAANSWGKNWGENGYFRI 462
Score = 116 bits (291), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 70/184 (38%), Positives = 96/184 (52%), Gaps = 22/184 (11%)
Query: 331 GLFRIGCRPYEIPCE------RYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFG 384
G+ C PY+ P + R M SRS + T C +Y +D+
Sbjct: 310 GVVTEDCYPYQPPHQTPAEVGRCMMQSRSVGRGKRQATQRCPNT------QNYHNDIYQS 363
Query: 385 RIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGG--------PLGEH 436
Y L +NE+ IM+EI +GPV+ M ++ D +YKTGIYKH G H
Sbjct: 364 TPPYRLSSNEKEIMKEIMDNGPVQAIMEVHEDFFVYKTGIYKHTDVSFTKPPQYRKHGTH 423
Query: 437 AIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPK 496
++RI GWG++ +GTS KYW+ ANS+ NWGENG FRIVRG+NEC IE + +
Sbjct: 424 SVRITGWGEDRNVDGTSR--KYWIAANSWGKNWGENGYFRIVRGENECEIETFVIGVWGR 481
Query: 497 IGLE 500
I +E
Sbjct: 482 ISME 485
>gi|340712697|ref|XP_003394892.1| PREDICTED: tubulointerstitial nephritis antigen-like [Bombus
terrestris]
Length = 445
Score = 153 bits (387), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 110/327 (33%), Positives = 154/327 (47%), Gaps = 56/327 (17%)
Query: 13 LKDLDLSQSSRNHSNGVFCDLSKAFDRVDHSILLPKLPFYGAEKNALSKLTLSELEMRMG 72
+ LDLS +RN+S F+G + KL L L
Sbjct: 150 INSLDLSWRARNYSE-----------------------FWGRTLDEGVKLRLGTLNPSRS 186
Query: 73 VHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVE 132
V+ R+ + ++ DP E LP FDARI WP I +I DQG CG+ WA+ A
Sbjct: 187 VY--------RMNSVRRIYDP-ESLPREFDARIRWP--REISDIDDQGWCGASWAISATR 235
Query: 133 AMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASK 192
SDR + S+G V LS+ L+SC C GG+ +AW Y G+V
Sbjct: 236 VASDRFALMSKGADSVLLSAQHLLSCNNRGQQACSGGYLDRAWLYMRKFGLVD------- 288
Query: 193 QGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANE 252
+ C P+E G+++ C+ + T C+P + + G AY L NE
Sbjct: 289 EDCYPWE--------GTNAQCKLRK-RTDLKTAGCRPPVNPLRTELYKVGP-AYRL-GNE 337
Query: 253 ETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGG---PLGEHAIRIIGWGQEPLGEG 309
IM EI GPV+ +M +Y D Y++GIYKH A G H++RIIGWG++
Sbjct: 338 TDIMYEILTSGPVQATMKVYQDFFSYESGIYKHTATTEHYAFGYHSVRIIGWGEDTSAHR 397
Query: 310 TSSV-VKYWLVANSFNTNWGENGLFRI 335
++ +KYWLV NS+ WGE+GLFRI
Sbjct: 398 HHNLPIKYWLVVNSWGQQWGESGLFRI 424
Score = 105 bits (263), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 54/114 (47%), Positives = 70/114 (61%), Gaps = 5/114 (4%)
Query: 387 AYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGG---PLGEHAIRIIGW 443
AY L NE IM EI GPV+ +M +Y D Y++GIYKH A G H++RIIGW
Sbjct: 331 AYRL-GNETDIMYEILTSGPVQATMKVYQDFFSYESGIYKHTATTEHYAFGYHSVRIIGW 389
Query: 444 GQEPLGEGTSSV-VKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPK 496
G++ ++ +KYWLV NS+ WGE+GLFRI RG NEC IE+ + A K
Sbjct: 390 GEDTSAHRHHNLPIKYWLVVNSWGQQWGESGLFRIQRGTNECDIESFVVAVWAK 443
>gi|166030322|gb|ABY78828.1| cathepsin B-like protease [Trypanosoma congolense]
gi|343471419|emb|CCD16168.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 336
Score = 153 bits (386), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 104/281 (37%), Positives = 153/281 (54%), Gaps = 26/281 (9%)
Query: 59 LSKLTLSELEMRMG--VHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEI 116
+ +T SE + G + +S LP P+ ELPE FD+ WP CPTI+EI
Sbjct: 54 MQNITFSEAKRLTGAWIQKNSSLP----PVRFTEEQLRTELPESFDSAEKWPNCPTIREI 109
Query: 117 RDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWK 176
DQ +C + WA+ A+SDR C GK+ +R+S+ L+SCCK CG GC+GGF G AW+
Sbjct: 110 ADQSACRASWAVSTASAISDRYCTVGGGKQ-LRISAAHLLSCCKQCGGGCKGGFPGFAWR 168
Query: 177 YWVTTGIVSGGTYASKQGCRPYEIP-CERY-MNGSHSSCQDNEPNTPECIRKCQPGYDVS 234
Y+V GI S C+PY P CE G+ + C + + TP+C C D +
Sbjct: 169 YYVEYGIAS-------SYCQPYPFPQCEHQGAQGNKTPCSNYKFVTPQCNTTCT---DKT 218
Query: 235 YEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEH 294
G+ AY L EE RE++ +GP + +Y D+ YK+G+Y++V G +G
Sbjct: 219 IPLIKYRGKDAYMLLPGEEEFKRELYFNGPFVAILFVYTDLFAYKSGVYRNVDGSYMGVT 278
Query: 295 AIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
A++++GWG+ GT YW VAN+++T+WG +G I
Sbjct: 279 AVKVVGWGKL---NGTP----YWKVANTWDTDWGMDGYLLI 312
Score = 102 bits (254), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 60/169 (35%), Positives = 91/169 (53%), Gaps = 12/169 (7%)
Query: 329 ENGLFRIGCRPYEIP-CERY-MNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRI 386
E G+ C+PY P CE G+++ C + TP+C C D + G+
Sbjct: 172 EYGIASSYCQPYPFPQCEHQGAQGNKTPCSNYKFVTPQCNTTCT---DKTIPLIKYRGKD 228
Query: 387 AYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQE 446
AY L EE RE++ +GP + +Y D+ YK+G+Y++V G +G A++++GWG+
Sbjct: 229 AYMLLPGEEEFKRELYFNGPFVAILFVYTDLFAYKSGVYRNVDGSYMGVTAVKVVGWGKL 288
Query: 447 PLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLP 495
GT YW VAN+++T+WG +G I+RG NEC IE AG P
Sbjct: 289 ---NGTP----YWKVANTWDTDWGMDGYLLILRGNNECNIEHLGFAGTP 330
>gi|204022073|dbj|BAG71134.1| cathepsin B-S1 [Tuberaphis taiwana]
Length = 334
Score = 153 bits (386), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 101/253 (39%), Positives = 143/253 (56%), Gaps = 17/253 (6%)
Query: 88 VQLSDPL---EELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRG 144
++ DPL + P+ FD+R NW C I IRDQG+CGS W+ A +DR+C+++ G
Sbjct: 73 IKKYDPLYVENDSPQQFDSRTNWKSCKQIGHIRDQGNCGSCWSFSTTGAFADRLCVSTGG 132
Query: 145 KRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIP-CE 203
K + LS ++L CCKDCG GC GG+ KAWKY+ T G+ +GG Y +K+GC PY++P C
Sbjct: 133 KFNQLLSPEELAFCCKDCGKGCGGGYPIKAWKYFRTQGVTTGGDYGTKEGCMPYKVPPCY 192
Query: 204 RYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHG 263
+ Q E N +C + C V + S+ +TI R+I +G
Sbjct: 193 NKQGKNTCGGQPMERNH-QCPKTCYGKTTVQNRYKTKSEYVINSI----KTIERDIMTYG 247
Query: 264 PVEGSMTIYADMILYKTGIYKHVAGGPL-GEHAIRIIGWGQEPLGEGTSSVVKYWLVANS 322
PVE S +Y D+ YK+GIY+ G H+I+IIGWGQ+ GT YWL NS
Sbjct: 248 PVEASFDVYDDLSAYKSGIYRKTPKAKYQGGHSIKIIGWGQQ---NGTP----YWLAVNS 300
Query: 323 FNTNWGENGLFRI 335
++ WGE+G F+I
Sbjct: 301 WSKFWGEHGTFKI 313
Score = 103 bits (258), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 64/164 (39%), Positives = 92/164 (56%), Gaps = 18/164 (10%)
Query: 336 GCRPYEIP-CERYMNGSRSSC--QANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPA 392
GC PY++P C Y +++C Q E N +C + C V + S+
Sbjct: 182 GCMPYKVPPC--YNKQGKNTCGGQPMERNH-QCPKTCYGKTTVQNRYKTKSEYVINSI-- 236
Query: 393 NEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPL-GEHAIRIIGWGQEPLGEG 451
+TI R+I +GPVE S +Y D+ YK+GIY+ G H+I+IIGWGQ+ G
Sbjct: 237 --KTIERDIMTYGPVEASFDVYDDLSAYKSGIYRKTPKAKYQGGHSIKIIGWGQQ---NG 291
Query: 452 TSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLP 495
T YWL NS++ WGE+G F+I++G+NECGIE +TAG+P
Sbjct: 292 TP----YWLAVNSWSKFWGEHGTFKIIKGRNECGIERAVTAGIP 331
>gi|149030259|gb|EDL85315.1| rCG52258, isoform CRA_b [Rattus norvegicus]
Length = 210
Score = 152 bits (385), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 77/143 (53%), Positives = 98/143 (68%), Gaps = 4/143 (2%)
Query: 83 RLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIAS 142
+LP V S+ + LPE FDAR W CPTI +IRDQGSCGS WA GAVEAMSDR+CI +
Sbjct: 67 KLPERVGFSEDIN-LPESFDAREQWSNCPTIAQIRDQGSCGSCWAFGAVEAMSDRICIHT 125
Query: 143 RGKRHVRLSSDDLVSCCK-DCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEI- 200
G+ +V +S++DL++CC CG+GC GG+ AW +W G+VSGG Y S GC PY I
Sbjct: 126 NGRVNVEVSAEDLLTCCGIQCGDGCNGGYPSGAWNFWTRKGLVSGGVYNSHIGCLPYTIP 185
Query: 201 PCERYMNGSHSSCQDNEPNTPEC 223
PCE ++NGS C E +TP+C
Sbjct: 186 PCEHHVNGSRPPCT-GEGDTPKC 207
>gi|60600065|gb|AAX26576.1| unknown [Schistosoma japonicum]
Length = 190
Score = 152 bits (385), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 81/162 (50%), Positives = 97/162 (59%), Gaps = 3/162 (1%)
Query: 63 TLSELEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSC 122
T+S++ +G PD Q L L ELP+ FDAR W +CP+I EIRDQ SC
Sbjct: 31 TVSDIRRMLGALPDPNGEQLET-LCTGYELTLNELPKSFDARKEWTHCPSISEIRDQSSC 89
Query: 123 GSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTG 182
GS WA GAVEAMSDR+CI S+GK LS+++LVSCC CG GC GGF AW YW G
Sbjct: 90 GSCWAFGAVEAMSDRICIESKGKYKPFLSAENLVSCCSSCGMGCNGGFPHSAWLYWKNQG 149
Query: 183 IVSGGTYASKQGCRPYEI-PCERYMNGSHSSCQDNEPNTPEC 223
IV+G Y + GC+PYE PCE G C D + TP C
Sbjct: 150 IVTGDLYNTTNGCQPYEFPPCEHNTLGPLPVC-DGDVETPPC 190
>gi|291000228|ref|XP_002682681.1| predicted protein [Naegleria gruberi]
gi|284096309|gb|EFC49937.1| predicted protein [Naegleria gruberi]
Length = 225
Score = 152 bits (385), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 101/246 (41%), Positives = 129/246 (52%), Gaps = 42/246 (17%)
Query: 97 LPEGFDARINWPYCPTIQEIRDQGSCGSGWA-----LGAVEAMSDRVCIASRGKRHVRLS 151
LPE FD+R WP C I IR+Q CGS WA + + E +SDR CIAS GK +V LS
Sbjct: 2 LPESFDSREKWPTC--IHPIRNQEQCGSCWACKNLFIQSSEVLSDRFCIASGGKVNVVLS 59
Query: 152 SDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGT--YASKQGCRPYEIPCERYMNGS 209
DLVSC GC GG AW Y TGIV+ Y+S G P C +Y NG+
Sbjct: 60 PQDLVSC-NWYNAGCDGGILWAAWIYLKHTGIVTDQCLPYSSGNGVAP---SCPKYCNGT 115
Query: 210 HSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSM 269
+ P V Y+ + Y + + E IM EI +GPV+
Sbjct: 116 ST-----------------PIDSVKYK-----AKDWYEVGSIAEKIMNEIATNGPVQSGF 153
Query: 270 TIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGE 329
++Y D + YK+G+Y H G LG HAI+I+GWG E + VKYWLVANS+ +WG
Sbjct: 154 SVYQDFMSYKSGVYTHQTGSFLGGHAIKIVGWGVE-------NNVKYWLVANSWGPDWGL 206
Query: 330 NGLFRI 335
NGLF+I
Sbjct: 207 NGLFKI 212
Score = 110 bits (275), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 53/103 (51%), Positives = 69/103 (66%), Gaps = 7/103 (6%)
Query: 388 YSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEP 447
Y + + E IM EI +GPV+ ++Y D + YK+G+Y H G LG HAI+I+GWG E
Sbjct: 130 YEVGSIAEKIMNEIATNGPVQSGFSVYQDFMSYKSGVYTHQTGSFLGGHAIKIVGWGVE- 188
Query: 448 LGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADI 490
+ VKYWLVANS+ +WG NGLF+I RG NECGIEAD+
Sbjct: 189 ------NNVKYWLVANSWGPDWGLNGLFKIKRGDNECGIEADV 225
>gi|328871084|gb|EGG19455.1| peptidase C1A family protein [Dictyostelium fasciculatum]
Length = 352
Score = 152 bits (384), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 91/242 (37%), Positives = 122/242 (50%), Gaps = 22/242 (9%)
Query: 94 LEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSD 153
+ +P F++ W C I I++Q CGS WA GAVE++SDR CI V LS
Sbjct: 67 FQAVPANFNSAQQWSNCSYISAIQNQARCGSCWAFGAVESVSDRFCI--HKGEDVLLSFQ 124
Query: 154 DLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIPCERYMNGSHSSC 213
DLV+C NGCQGG A K+ GIVS C PY IP + C
Sbjct: 125 DLVTC-DQSDNGCQGGDAYTAMKFIQKKGIVSND-------CLPYTIP---TCAPAQQPC 173
Query: 214 QDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYA 273
N +TP+C+ KC +Y DL+F YS+ I +EI +GPVE +Y
Sbjct: 174 L-NFVDTPQCVEKCSNA-SYTYAQDLHFIDGVYSMNPTVNAIQQEIMTNGPVEACFEVYE 231
Query: 274 DMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLF 333
D + YK+G+Y+H G LG H +++IGW GT + YW+ NS+ T WG G+F
Sbjct: 232 DFLGYKSGVYQHTTGKDLGGHCVKMIGW-------GTQNNELYWICNNSWTTYWGNQGVF 284
Query: 334 RI 335
I
Sbjct: 285 WI 286
Score = 115 bits (288), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 63/181 (34%), Positives = 96/181 (53%), Gaps = 13/181 (7%)
Query: 329 ENGLFRIGCRPYEIPCERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAY 388
+ G+ C PY IP ++ C N +TP+C+ KC +Y DL+F Y
Sbjct: 150 KKGIVSNDCLPYTIP---TCAPAQQPC-LNFVDTPQCVEKCSNA-SYTYAQDLHFIDGVY 204
Query: 389 SLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPL 448
S+ I +EI +GPVE +Y D + YK+G+Y+H G LG H +++IGW
Sbjct: 205 SMNPTVNAIQQEIMTNGPVEACFEVYEDFLGYKSGVYQHTTGKDLGGHCVKMIGW----- 259
Query: 449 GEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAG-LPKIGLEIDSNEIN 507
GT + YW+ NS+ T WG G+F I G NECGIE+D+ A ++ L+++ ++
Sbjct: 260 --GTQNNELYWICNNSWTTYWGNQGVFWIKAGVNECGIESDVVAAKFNELWLDLEGEDVL 317
Query: 508 L 508
L
Sbjct: 318 L 318
>gi|307201161|gb|EFN81067.1| Uncharacterized peptidase C1-like protein F26E4.3 [Harpegnathos
saltator]
Length = 443
Score = 152 bits (384), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 101/275 (36%), Positives = 140/275 (50%), Gaps = 33/275 (12%)
Query: 67 LEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGW 126
+E+R+G S+ P+ ++ DP + LP FDAR WP I I DQG CG+ W
Sbjct: 175 VELRLGTLNPSQSMYKMNPVR-RIYDP-DALPREFDARTRWPR--DISGIHDQGWCGASW 230
Query: 127 ALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSG 186
A+ + SDR I S+G V LS+ L+SC GC+GG+ +AW + G+V
Sbjct: 231 AVSTADVASDRFAIMSKGAEDVELSAQHLLSCNNRGQQGCRGGYLDRAWLFMRKFGLVD- 289
Query: 187 GTYASKQGCRPYEIPCERYMNGSHSSC---QDNEPNTPECIRKCQPGYDVSYEDDLNFGR 243
+ C P+ G + C + + N C + P +L
Sbjct: 290 ------KECYPW--------TGRNDQCRLRKRSNLNVAGCRKPPNP-----LRQELYKVG 330
Query: 244 IAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPL---GEHAIRIIG 300
AY L NE IM+EI GPV+ +M +Y D +YK G+Y+H L G H++RIIG
Sbjct: 331 PAYRL-GNETDIMQEILTSGPVQATMRVYQDFFVYKNGVYRHSRSAELHDSGYHSMRIIG 389
Query: 301 WGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
WG+EP G +KYWLVANS+ +WGENGLFRI
Sbjct: 390 WGEEPSYRGPP--LKYWLVANSWGRHWGENGLFRI 422
Score = 112 bits (280), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 57/109 (52%), Positives = 72/109 (66%), Gaps = 6/109 (5%)
Query: 387 AYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPL---GEHAIRIIGW 443
AY L NE IM+EI GPV+ +M +Y D +YK G+Y+H L G H++RIIGW
Sbjct: 332 AYRL-GNETDIMQEILTSGPVQATMRVYQDFFVYKNGVYRHSRSAELHDSGYHSMRIIGW 390
Query: 444 GQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITA 492
G+EP G +KYWLVANS+ +WGENGLFRI RG NEC IE+ + A
Sbjct: 391 GEEPSYRGPP--LKYWLVANSWGRHWGENGLFRIQRGTNECEIESYVLA 437
>gi|603044|gb|AAA96832.1| cysteine protease homolog, partial [Strongyloides ratti]
Length = 202
Score = 152 bits (384), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 84/204 (41%), Positives = 119/204 (58%), Gaps = 6/204 (2%)
Query: 124 SGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCC-KDCGNGCQGGFHGKAWKYWVTTG 182
S WA+ A M+DR+C+ S+G+ +S D++SCC + CG GC+GG + +AWK+ + G
Sbjct: 1 SCWAVSAASVMTDRLCVQSKGRIKRFISDTDILSCCGRFCGYGCRGGANIRAWKHVMRNG 60
Query: 183 IVSGGTYASKQGCRPYEI-PCERYMNGSH-SSCQDNEPNTPECIRKCQPG-YDVSYEDDL 239
+ +GG K GCRPY PC + + + C +TPEC + CQ G + Y D
Sbjct: 61 VCTGGPCGYKYGCRPYAFHPCGVHKDQVYYGECPRKSYDTPECRKICQRGCIQLQYGKDR 120
Query: 240 NFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRII 299
+ AY + + + IMREI R GPV G+ Y D LYK G+Y+H AG G H+I+I+
Sbjct: 121 YYAASAYFVKNDTKAIMREIMRGGPVHGAYDTYTDFRLYKGGVYEHTAGERTGGHSIKIM 180
Query: 300 GWGQEPLGEGTSSVVKYWLVANSF 323
GWG GT V+ YWLVANS+
Sbjct: 181 GWGNYKHPNGT--VIPYWLVANSW 202
Score = 99.0 bits (245), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 56/136 (41%), Positives = 75/136 (55%), Gaps = 5/136 (3%)
Query: 333 FRIGCRPYEI-PCERYMNGSR-SSCQANEPNTPECIRKCQPG-YDVSYEDDLNFGRIAYS 389
++ GCRPY PC + + C +TPEC + CQ G + Y D + AY
Sbjct: 69 YKYGCRPYAFHPCGVHKDQVYYGECPRKSYDTPECRKICQRGCIQLQYGKDRYYAASAYF 128
Query: 390 LPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLG 449
+ + + IMREI R GPV G+ Y D LYK G+Y+H AG G H+I+I+GWG
Sbjct: 129 VKNDTKAIMREIMRGGPVHGAYDTYTDFRLYKGGVYEHTAGERTGGHSIKIMGWGNYKHP 188
Query: 450 EGTSSVVKYWLVANSF 465
GT V+ YWLVANS+
Sbjct: 189 NGT--VIPYWLVANSW 202
>gi|195026034|ref|XP_001986167.1| GH20676 [Drosophila grimshawi]
gi|193902167|gb|EDW01034.1| GH20676 [Drosophila grimshawi]
Length = 432
Score = 152 bits (383), Expect = 6e-34, Method: Compositional matrix adjust.
Identities = 94/272 (34%), Positives = 141/272 (51%), Gaps = 32/272 (11%)
Query: 67 LEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGW 126
L +R+G +K P R+ + +L++P +LP F+A W I E+ DQG CG+ W
Sbjct: 161 LRLRLG----TKEPTFRVKSMTRLTNPSNDLPRSFNAVEKWS--TFISEVPDQGWCGASW 214
Query: 127 ALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSG 186
L SDR I S+GK V+LS+ +++SC + GC GG AW+Y G++
Sbjct: 215 VLSTTSVASDRFAIQSQGKEVVQLSAQNILSCTRR-QQGCDGGHLDAAWRYMHKNGVLDA 273
Query: 187 GTYASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAY 246
Y Q ++ +R+ S + CQP + V+ ++ G AY
Sbjct: 274 NCYPYIQQRDTCKV--QRHRGRSLKA------------YGCQPAHGVNRDNFYTVGP-AY 318
Query: 247 SLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVA---GGPLGEHAIRIIGWGQ 303
SL + E IM EI+ GPV+ +MT+Y D Y +G+Y+H A G G H+++++GWG+
Sbjct: 319 SL-SREADIMAEIYHSGPVQATMTVYRDFFSYSSGVYQHTAANRGAATGFHSVKLVGWGE 377
Query: 304 EPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
E G VKYW+ ANS+ WGE G FRI
Sbjct: 378 EHNG------VKYWIAANSWGPWWGERGYFRI 403
Score = 114 bits (285), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 59/141 (41%), Positives = 84/141 (59%), Gaps = 11/141 (7%)
Query: 369 CQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHV 428
CQP + V+ ++ G AYSL + E IM EI+ GPV+ +MT+Y D Y +G+Y+H
Sbjct: 300 CQPAHGVNRDNFYTVGP-AYSL-SREADIMAEIYHSGPVQATMTVYRDFFSYSSGVYQHT 357
Query: 429 A---GGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECG 485
A G G H+++++GWG+E G VKYW+ ANS+ WGE G FRI+RG NECG
Sbjct: 358 AANRGAATGFHSVKLVGWGEEHNG------VKYWIAANSWGPWWGERGYFRILRGSNECG 411
Query: 486 IEADITAGLPKIGLEIDSNEI 506
IE + A P + ++ I
Sbjct: 412 IEEYVLASWPHVYNYFNTKSI 432
>gi|56758470|gb|AAW27375.1| unknown [Schistosoma japonicum]
Length = 217
Score = 152 bits (383), Expect = 6e-34, Method: Compositional matrix adjust.
Identities = 74/151 (49%), Positives = 95/151 (62%), Gaps = 2/151 (1%)
Query: 72 GVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAV 131
G D L + R P V D E+P FD+R WP C +I +IRDQ CGS WA+ AV
Sbjct: 66 GRREDPNLREKRRPT-VDHHDLNVEIPSHFDSRKKWPRCKSISQIRDQSQCGSSWAVSAV 124
Query: 132 EAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYAS 191
AMSDR+CI S GK+ V LS+ DL+SCCK CG+GC GGF G +W YWV GIV+GG+ +
Sbjct: 125 GAMSDRICIQSGGKQSVELSAVDLISCCKYCGSGCDGGFLGPSWDYWVLRGIVTGGSKEN 184
Query: 192 KQGCRPYEIP-CERYMNGSHSSCQDNEPNTP 221
GCRPY P C+ ++ G + +C D +P
Sbjct: 185 HTGCRPYPFPKCDHFVKGKYRACGDKLYKSP 215
>gi|194753202|ref|XP_001958906.1| GF12327 [Drosophila ananassae]
gi|190620204|gb|EDV35728.1| GF12327 [Drosophila ananassae]
Length = 431
Score = 151 bits (381), Expect = 9e-34, Method: Compositional matrix adjust.
Identities = 94/272 (34%), Positives = 135/272 (49%), Gaps = 33/272 (12%)
Query: 67 LEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGW 126
L +R+G +K P R+ + +L++P + LP F+A WP I E+ DQG CGS W
Sbjct: 160 LRLRLG----TKEPTYRVKAMTRLTNPTDGLPSSFNAVERWP--SYISEVPDQGWCGSSW 213
Query: 127 ALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSG 186
L SDR I S+GK VRLS+ +++SC + GC GG AW++ G+V
Sbjct: 214 VLSTTSVASDRFAIQSKGKEAVRLSAQNILSCTRR-QQGCDGGHLDAAWRFLHKKGVVD- 271
Query: 187 GTYASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAY 246
C PY +C+ + C+P +V + G AY
Sbjct: 272 ------DSCYPY--------TQQRDTCKIRHNSRSLKANGCRPSPNVDRDSFYTVG-PAY 316
Query: 247 SLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVA---GGPLGEHAIRIIGWGQ 303
+L E IM EI+ GPV+ +M +Y D Y GIY+ A G P G H+++++GWG+
Sbjct: 317 TL-NREGDIMAEIYHSGPVQATMRVYRDFFSYSGGIYRQTAANRGAPQGFHSVKLVGWGE 375
Query: 304 EPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
E G+ KYW+ ANS+ WGE G FRI
Sbjct: 376 EHNGD------KYWIAANSWGPWWGERGYFRI 401
Score = 104 bits (260), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 61/172 (35%), Positives = 87/172 (50%), Gaps = 19/172 (11%)
Query: 329 ENGLFRIGCRPYEIPCERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAY 388
+ G+ C PY R +C+ + C+P +V + G AY
Sbjct: 266 KKGVVDDSCYPY--------TQQRDTCKIRHNSRSLKANGCRPSPNVDRDSFYTVG-PAY 316
Query: 389 SLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVA---GGPLGEHAIRIIGWGQ 445
+L E IM EI+ GPV+ +M +Y D Y GIY+ A G P G H+++++GWG+
Sbjct: 317 TL-NREGDIMAEIYHSGPVQATMRVYRDFFSYSGGIYRQTAANRGAPQGFHSVKLVGWGE 375
Query: 446 EPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
E G+ KYW+ ANS+ WGE G FRI+RG NECGIE + A P +
Sbjct: 376 EHNGD------KYWIAANSWGPWWGERGYFRILRGSNECGIEEYVLASWPYV 421
>gi|403365170|gb|EJY82363.1| Cathepsin B [Oxytricha trifallax]
Length = 309
Score = 151 bits (381), Expect = 9e-34, Method: Compositional matrix adjust.
Identities = 99/282 (35%), Positives = 140/282 (49%), Gaps = 45/282 (15%)
Query: 56 KNALSKLTLSELEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQE 115
+N + T ++L+ +G Q+ + Q++ LP+ FD+R W C +
Sbjct: 43 QNKFANYTEAQLKGLLGTVLSH---QSGISAFTQIN---AALPDSFDSRTQWKDC--VHP 94
Query: 116 IRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGN-GCQGGFHGKA 174
IRDQ CGS WA AVE++SDR CIAS+GK ++ LS D++SC D N C GG+ A
Sbjct: 95 IRDQAKCGSCWAFAAVESLSDRFCIASQGKVNLVLSPQDMLSC--DASNFCCFGGYLDTA 152
Query: 175 WKYWVTTGIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVS 234
W+Y G+ S C PY + NG + P C KC G +
Sbjct: 153 WQYLEQQGVGS-------DSCEPY-----KSGNG----------DQPSCPSKCSNGQAIK 190
Query: 235 -YEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGE 293
Y+ + A A + I + GPVE TIY D + Y +GIY HV GG +G
Sbjct: 191 KYKCKAGSTKQAKGAEATKSLIQQS----GPVETGFTIYEDFLNYNSGIYHHVTGGNMGG 246
Query: 294 HAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
HA++I+GWG++ L YW+VANS+ +WGE G F I
Sbjct: 247 HAVKILGWGKQGL-------ENYWIVANSWGEDWGEKGYFNI 281
Score = 87.4 bits (215), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 54/164 (32%), Positives = 81/164 (49%), Gaps = 22/164 (13%)
Query: 344 CERYMNGSRSSCQANEPNTPECIRKCQPGYDVS-YEDDLNFGRIAYSLPANEETIMREIF 402
CE Y +G+ + P C KC G + Y+ + A A + I +
Sbjct: 166 CEPYKSGN--------GDQPSCPSKCSNGQAIKKYKCKAGSTKQAKGAEATKSLIQQS-- 215
Query: 403 RHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVA 462
GPVE TIY D + Y +GIY HV GG +G HA++I+GWG++ L YW+VA
Sbjct: 216 --GPVETGFTIYEDFLNYNSGIYHHVTGGNMGGHAVKILGWGKQGL-------ENYWIVA 266
Query: 463 NSFNTNWGENGLFRIVRGQNECGIEADITAGLPKIGLEIDSNEI 506
NS+ +WGE G F I +G + GI+ +P + +++ I
Sbjct: 267 NSWGEDWGEKGYFNIRQG--DSGIDEATFGCIPDVSSALENEFI 308
>gi|350408961|ref|XP_003488566.1| PREDICTED: tubulointerstitial nephritis antigen-like [Bombus
impatiens]
Length = 445
Score = 150 bits (380), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 102/289 (35%), Positives = 142/289 (49%), Gaps = 33/289 (11%)
Query: 51 FYGAEKNALSKLTLSELEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYC 110
F+G + KL L L V+ R+ + ++ DP E LP FDARI WP
Sbjct: 165 FWGRTLDEGVKLRLGTLNPSRSVY--------RMNSVQRIYDP-ESLPREFDARIRWPR- 214
Query: 111 PTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGF 170
I +I DQG CG+ WA+ SDR + S+G V LS+ L+SC C GG+
Sbjct: 215 -EISDIDDQGWCGASWAISTTRVASDRFALMSKGADSVLLSAQHLLSCNNRGQQACSGGY 273
Query: 171 HGKAWKYWVTTGIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQPG 230
+AW Y G+V + C P+E G++ C+ + T C+P
Sbjct: 274 LDRAWLYMRKFGLVD-------EDCYPWE--------GTNVQCKLRK-RTDLKTAGCRPP 317
Query: 231 YDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGG- 289
+ + G AY L NE IM EI GPV+ +M +Y D Y++GIYKH A
Sbjct: 318 VNPLRTELYKVGP-AYRL-GNETDIMYEILTSGPVQATMKVYQDFFSYESGIYKHTATTE 375
Query: 290 --PLGEHAIRIIGWGQEPLGEGTSSV-VKYWLVANSFNTNWGENGLFRI 335
G H++RIIGWG++ ++ +KYWLV NS+ WGE+GLFRI
Sbjct: 376 HYAFGYHSVRIIGWGEDTSAHRYRNLPIKYWLVVNSWGQQWGESGLFRI 424
Score = 105 bits (263), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 54/114 (47%), Positives = 70/114 (61%), Gaps = 5/114 (4%)
Query: 387 AYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGG---PLGEHAIRIIGW 443
AY L NE IM EI GPV+ +M +Y D Y++GIYKH A G H++RIIGW
Sbjct: 331 AYRL-GNETDIMYEILTSGPVQATMKVYQDFFSYESGIYKHTATTEHYAFGYHSVRIIGW 389
Query: 444 GQEPLGEGTSSV-VKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPK 496
G++ ++ +KYWLV NS+ WGE+GLFRI RG NEC IE+ + A K
Sbjct: 390 GEDTSAHRYRNLPIKYWLVVNSWGQQWGESGLFRIQRGTNECDIESFVVAVWAK 443
>gi|148704124|gb|EDL36071.1| cathepsin B, isoform CRA_b [Mus musculus]
Length = 237
Score = 150 bits (379), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 73/133 (54%), Positives = 92/133 (69%), Gaps = 3/133 (2%)
Query: 96 ELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDL 155
+LPE FDAR W CPTI +IRDQGSCGS WA GAVEA+SDR CI + G+ +V +S++DL
Sbjct: 73 DLPETFDAREQWSNCPTIGQIRDQGSCGSCWAFGAVEAISDRTCIHTNGRVNVEVSAEDL 132
Query: 156 VSCCK-DCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEI-PCERYMNGSHSSC 213
++CC CG+GC GG+ AW +W G+VSGG Y S GC PY I PCE ++NGS C
Sbjct: 133 LTCCGIQCGDGCNGGYPSGAWSFWTKKGLVSGGVYNSHVGCLPYTIPPCEHHVNGSRPPC 192
Query: 214 QDNEPNTPECIRK 226
E +TP C +K
Sbjct: 193 T-GEGDTPRCNKK 204
>gi|195154396|ref|XP_002018108.1| GL16940 [Drosophila persimilis]
gi|194113904|gb|EDW35947.1| GL16940 [Drosophila persimilis]
Length = 433
Score = 150 bits (379), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 92/272 (33%), Positives = 136/272 (50%), Gaps = 33/272 (12%)
Query: 67 LEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGW 126
L +R+G +K P R+ + +L++P LP F+A W I E+ DQG CGS W
Sbjct: 163 LRLRLG----TKEPTYRVKAMSRLTNPTAGLPAAFNAVEKWS--SYISEVPDQGWCGSSW 216
Query: 127 ALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSG 186
L SDR I S+GK V+LS+ +++SC + GC+GG AW+Y G+V
Sbjct: 217 VLSTTSVASDRFAIQSKGKEAVQLSAQNILSCTRR-QQGCEGGHLDAAWRYLHKKGVVD- 274
Query: 187 GTYASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAY 246
+ C PY +C+ + C+P +V + G AY
Sbjct: 275 ------ESCYPY--------TQHRDTCKIRHNSRSLKANGCRPSANVDRDSFYTVG-PAY 319
Query: 247 SLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVA---GGPLGEHAIRIIGWGQ 303
+L E IM EI+ GPV+ +M +Y D Y +G+Y+ A G P G H+++++GWG+
Sbjct: 320 TL-NKESDIMAEIYHSGPVQATMRVYRDFFSYSSGVYRQTAANRGAPTGFHSVKLVGWGE 378
Query: 304 EPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
E G+ KYW+ ANS+ WGE G FRI
Sbjct: 379 EHNGD------KYWIAANSWGPWWGERGYFRI 404
Score = 107 bits (266), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 60/172 (34%), Positives = 88/172 (51%), Gaps = 19/172 (11%)
Query: 329 ENGLFRIGCRPYEIPCERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAY 388
+ G+ C PY R +C+ + C+P +V + G AY
Sbjct: 269 KKGVVDESCYPY--------TQHRDTCKIRHNSRSLKANGCRPSANVDRDSFYTVG-PAY 319
Query: 389 SLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVA---GGPLGEHAIRIIGWGQ 445
+L E IM EI+ GPV+ +M +Y D Y +G+Y+ A G P G H+++++GWG+
Sbjct: 320 TL-NKESDIMAEIYHSGPVQATMRVYRDFFSYSSGVYRQTAANRGAPTGFHSVKLVGWGE 378
Query: 446 EPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
E G+ KYW+ ANS+ WGE G FRI+RG NECGIE + A P +
Sbjct: 379 EHNGD------KYWIAANSWGPWWGERGYFRILRGSNECGIEDYVLASWPYV 424
>gi|125810908|ref|XP_001361665.1| GA15908 [Drosophila pseudoobscura pseudoobscura]
gi|54636841|gb|EAL26244.1| GA15908 [Drosophila pseudoobscura pseudoobscura]
Length = 433
Score = 150 bits (379), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 92/272 (33%), Positives = 136/272 (50%), Gaps = 33/272 (12%)
Query: 67 LEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGW 126
L +R+G +K P R+ + +L++P LP F+A W I E+ DQG CGS W
Sbjct: 163 LRLRLG----TKEPTYRVKAMSRLTNPTAGLPAAFNAVEKWS--SYISEVPDQGWCGSSW 216
Query: 127 ALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSG 186
L SDR I S+GK V+LS+ +++SC + GC+GG AW+Y G+V
Sbjct: 217 VLSTTSVASDRFAIQSKGKEAVQLSAQNILSCTRR-QQGCEGGHLDAAWRYLHKKGVVD- 274
Query: 187 GTYASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAY 246
+ C PY +C+ + C+P +V + G AY
Sbjct: 275 ------ESCYPY--------TQHRDTCKIRHNSRSLKANGCRPSANVDRDSFYTVG-PAY 319
Query: 247 SLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVA---GGPLGEHAIRIIGWGQ 303
+L E IM EI+ GPV+ +M +Y D Y +G+Y+ A G P G H+++++GWG+
Sbjct: 320 TL-NKESDIMAEIYHSGPVQATMRVYRDFFSYSSGVYRQTAANRGAPTGFHSVKLVGWGE 378
Query: 304 EPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
E G+ KYW+ ANS+ WGE G FRI
Sbjct: 379 EHNGD------KYWIAANSWGPWWGERGYFRI 404
Score = 107 bits (266), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 60/172 (34%), Positives = 88/172 (51%), Gaps = 19/172 (11%)
Query: 329 ENGLFRIGCRPYEIPCERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAY 388
+ G+ C PY R +C+ + C+P +V + G AY
Sbjct: 269 KKGVVDESCYPY--------TQHRDTCKIRHNSRSLKANGCRPSANVDRDSFYTVG-PAY 319
Query: 389 SLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVA---GGPLGEHAIRIIGWGQ 445
+L E IM EI+ GPV+ +M +Y D Y +G+Y+ A G P G H+++++GWG+
Sbjct: 320 TL-NKESDIMAEIYHSGPVQATMRVYRDFFSYSSGVYRQTAANRGAPTGFHSVKLVGWGE 378
Query: 446 EPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
E G+ KYW+ ANS+ WGE G FRI+RG NECGIE + A P +
Sbjct: 379 EHNGD------KYWIAANSWGPWWGERGYFRILRGSNECGIEDYVLASWPYV 424
>gi|204022075|dbj|BAG71135.1| cathepsin B-S2 [Tuberaphis taiwana]
Length = 334
Score = 150 bits (378), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 99/253 (39%), Positives = 143/253 (56%), Gaps = 17/253 (6%)
Query: 88 VQLSDPL---EELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRG 144
++ DPL + P+ FD+R NW C I IRDQG+CGS W+ A +DR+C+++ G
Sbjct: 73 IKKYDPLYVENDSPQQFDSRTNWKSCKQIGHIRDQGNCGSCWSFSTTGAFADRLCVSTGG 132
Query: 145 KRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIP-CE 203
K + LS ++L CCKDCG GC GG+ KAWKY+ T G+ +GG Y +K+GC PY++P C
Sbjct: 133 KFNQLLSPEELAFCCKDCGKGCGGGYPIKAWKYFRTQGVTTGGDYGTKEGCMPYKVPPCY 192
Query: 204 RYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHG 263
+ Q E N +C + C V + S+ +TI +++ +G
Sbjct: 193 NKQGKNTCGGQPMERNH-QCPKTCYGKTTVQNRYKTKSEYVMNSI----KTIEQDLKTYG 247
Query: 264 PVEGSMTIYADMILYKTGIYKHVAGGPL-GEHAIRIIGWGQEPLGEGTSSVVKYWLVANS 322
PVE S +Y D +YK+GIY+ G H+I+IIGWGQ+ GT YWL NS
Sbjct: 248 PVEASFDVYDDFSVYKSGIYRKTPKAKYQGGHSIKIIGWGQQ---NGTP----YWLAVNS 300
Query: 323 FNTNWGENGLFRI 335
++ WGE+G F+I
Sbjct: 301 WSKFWGEHGTFKI 313
Score = 100 bits (249), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 62/164 (37%), Positives = 92/164 (56%), Gaps = 18/164 (10%)
Query: 336 GCRPYEIP-CERYMNGSRSSC--QANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPA 392
GC PY++P C Y +++C Q E N +C + C V + S+
Sbjct: 182 GCMPYKVPPC--YNKQGKNTCGGQPMERNH-QCPKTCYGKTTVQNRYKTKSEYVMNSI-- 236
Query: 393 NEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPL-GEHAIRIIGWGQEPLGEG 451
+TI +++ +GPVE S +Y D +YK+GIY+ G H+I+IIGWGQ+ G
Sbjct: 237 --KTIEQDLKTYGPVEASFDVYDDFSVYKSGIYRKTPKAKYQGGHSIKIIGWGQQ---NG 291
Query: 452 TSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLP 495
T YWL NS++ WGE+G F+I++G+NECGIE +TAG+P
Sbjct: 292 TP----YWLAVNSWSKFWGEHGTFKIIKGRNECGIERAVTAGIP 331
>gi|157058743|gb|ABV03129.1| cathepsin B-2744 [Pterocomma populeum]
Length = 244
Score = 149 bits (377), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 82/218 (37%), Positives = 123/218 (56%), Gaps = 6/218 (2%)
Query: 96 ELPEGFDARINWPYCP-TIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDD 154
++P+ FDAR ++ C I +++DQG+C S WA+ +DR+CIA+ GK LS+ +
Sbjct: 24 DVPKEFDARRHFVSCANVIGDVKDQGNCASSWAVAVASTFTDRLCIATGGKFTDNLSAQN 83
Query: 155 LVSCC-KDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYE-IPCERYMNGSHSS 212
L+SC + GC GG KAW++ + GIV+GG + S +GC+PY+ PC+ Y + S ++
Sbjct: 84 LMSCGDSEKFVGCHGGSAFKAWEFTMGNGIVTGGNFNSNEGCQPYKNRPCDHYGDSSMTN 143
Query: 213 CQD-NEPNTPECIRKC-QPGYDVSYEDDLNFGRIAYSLP-ANEETIMREIFRHGPVEGSM 269
C C KC Y V YEDDL+ + Y N I +EI +GPV M
Sbjct: 144 CSSFRRTQMSICREKCVNKNYKVKYEDDLHKTSVVYMTSWTNVTQIQQEIMTYGPVTALM 203
Query: 270 TIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLG 307
+Y + + YK GIYK G +G H +++IGWG + G
Sbjct: 204 YVYENFMGYKEGIYKSTVGDLVGYHHVKLIGWGVDDDG 241
Score = 79.0 bits (193), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 43/118 (36%), Positives = 62/118 (52%), Gaps = 4/118 (3%)
Query: 336 GCRPYE-IPCERYMNGSRSSCQA-NEPNTPECIRKC-QPGYDVSYEDDLNFGRIAYSLP- 391
GC+PY+ PC+ Y + S ++C + C KC Y V YEDDL+ + Y
Sbjct: 124 GCQPYKNRPCDHYGDSSMTNCSSFRRTQMSICREKCVNKNYKVKYEDDLHKTSVVYMTSW 183
Query: 392 ANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLG 449
N I +EI +GPV M +Y + + YK GIYK G +G H +++IGWG + G
Sbjct: 184 TNVTQIQQEIMTYGPVTALMYVYENFMGYKEGIYKSTVGDLVGYHHVKLIGWGVDDDG 241
>gi|166030324|gb|ABY78829.1| cathepsin B-like protease [Trypanosoma congolense]
Length = 336
Score = 149 bits (377), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 104/281 (37%), Positives = 147/281 (52%), Gaps = 26/281 (9%)
Query: 59 LSKLTLSELEMRMG--VHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEI 116
+ +T +E + G + S LP P+ ELPE FD+ WP CPTI+EI
Sbjct: 54 MQNITFAEAKRLTGAWIQKTSSLP----PVRFTEEQLRTELPESFDSAEKWPNCPTIREI 109
Query: 117 RDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWK 176
DQ +C + WA+ +SDR C G + +R+S+ L+SCCK CG GC+GGF G AW+
Sbjct: 110 ADQSACRASWAVSTASVISDRYCTVG-GVQQLRISAAHLLSCCKQCGGGCKGGFPGFAWR 168
Query: 177 YWVTTGIVSGGTYASKQGCRPYEIP-CE-RYMNGSHSSCQDNEPNTPECIRKCQPGYDVS 234
Y+V GI S C+PY P CE R G+ + C +TP+C C D S
Sbjct: 169 YYVEYGIAS-------SYCQPYPFPHCEHRGAQGNKTPCSKYNFDTPKCNATCT---DKS 218
Query: 235 YEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEH 294
G Y L EE RE++ +GP +Y D+ YK+G+Y+HV G LG
Sbjct: 219 IPLVKYRGNATYLLLHGEEDYKRELYFNGPFVAVFYVYTDLFAYKSGVYRHVDGDFLGGT 278
Query: 295 AIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
A++++GWG+ GT YW VAN+++T+WG +G I
Sbjct: 279 AVKVVGWGKL---NGTP----YWKVANTWDTDWGMDGYLLI 312
Score = 105 bits (262), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 63/170 (37%), Positives = 90/170 (52%), Gaps = 12/170 (7%)
Query: 329 ENGLFRIGCRPYEIP-CE-RYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRI 386
E G+ C+PY P CE R G+++ C +TP+C C D S G
Sbjct: 172 EYGIASSYCQPYPFPHCEHRGAQGNKTPCSKYNFDTPKCNATCT---DKSIPLVKYRGNA 228
Query: 387 AYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQE 446
Y L EE RE++ +GP +Y D+ YK+G+Y+HV G LG A++++GWG+
Sbjct: 229 TYLLLHGEEDYKRELYFNGPFVAVFYVYTDLFAYKSGVYRHVDGDFLGGTAVKVVGWGKL 288
Query: 447 PLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPK 496
GT YW VAN+++T+WG +G I+RG NEC IE AG P+
Sbjct: 289 ---NGTP----YWKVANTWDTDWGMDGYLLILRGNNECNIEHLGFAGTPE 331
>gi|268563232|ref|XP_002638788.1| Hypothetical protein CBG05143 [Caenorhabditis briggsae]
Length = 426
Score = 149 bits (377), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 95/285 (33%), Positives = 134/285 (47%), Gaps = 64/285 (22%)
Query: 97 LPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLV 156
LP+ FDAR WP CP+I + +QG CGS +A+ A SDR CI S G LS +D++
Sbjct: 143 LPKHFDARQKWPNCPSISNVPNQGGCGSCFAVAAAGVASDRACIHSNGTFKSLLSEEDII 202
Query: 157 SCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYE------IPCERYMNGSH 210
CC CGN C GG KA YWV G+V+GG + GCRPY +PC
Sbjct: 203 GCCSVCGN-CYGGDPLKALTYWVNQGLVTGG----RDGCRPYSFDLSCGVPC-----SPA 252
Query: 211 SSCQDNEPNTPECIRKCQP-GYDVSYEDDLNFGRIAYSL--------PANEE-------- 253
+ + E T C+R+CQ Y YE+D +F AYSL P +E
Sbjct: 253 TFFEAEEKRT--CMRRCQNIYYQQKYEEDKHFATFAYSLYPRSMTVSPDGKERVKVPTII 310
Query: 254 ------------------TIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGE-- 293
I +EI +GP + + + + Y +G+++ +
Sbjct: 311 GHFNDKNTEKLNVTEYRNVIKKEILLYGPTTMAFPVPEEFLHYSSGVFRPFPLDGFDDRI 370
Query: 294 ---HAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
H +R+IGWG+ G+ YWL NSF +WG+NG+F+I
Sbjct: 371 VYWHVVRLIGWGESDDGQ------HYWLAVNSFGNHWGDNGIFKI 409
Score = 71.2 bits (173), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 56/202 (27%), Positives = 83/202 (41%), Gaps = 62/202 (30%)
Query: 325 TNWGENGLF---RIGCRPYE------IPCERYMNGSRSSCQANEPNTPECIRKCQP-GYD 374
T W GL R GCRPY +PC + +A E T C+R+CQ Y
Sbjct: 221 TYWVNQGLVTGGRDGCRPYSFDLSCGVPC-----SPATFFEAEEKRT--CMRRCQNIYYQ 273
Query: 375 VSYEDDLNFGRIAYSL--------PANEE--------------------------TIMRE 400
YE+D +F AYSL P +E I +E
Sbjct: 274 QKYEEDKHFATFAYSLYPRSMTVSPDGKERVKVPTIIGHFNDKNTEKLNVTEYRNVIKKE 333
Query: 401 IFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGE-----HAIRIIGWGQEPLGEGTSSV 455
I +GP + + + + Y +G+++ + H +R+IGWG+ G+
Sbjct: 334 ILLYGPTTMAFPVPEEFLHYSSGVFRPFPLDGFDDRIVYWHVVRLIGWGESDDGQ----- 388
Query: 456 VKYWLVANSFNTNWGENGLFRI 477
YWL NSF +WG+NG+F+I
Sbjct: 389 -HYWLAVNSFGNHWGDNGIFKI 409
>gi|157058753|gb|ABV03134.1| cathepsin B-84 [Acyrthosiphon pisum]
Length = 230
Score = 149 bits (377), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 73/191 (38%), Positives = 116/191 (60%), Gaps = 5/191 (2%)
Query: 96 ELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDL 155
E+PE FD+R+ W YC TI +R+QG+CGS WA G A +DR+C+A+ G+ + +S+++L
Sbjct: 43 EVPEFFDSRLEWDYCETIGHVRNQGNCGSCWAHGTTGAFADRLCVATNGEFNELISAEEL 102
Query: 156 VSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIPCERYMNGSHSSC-- 213
CC CG GC GG+ KAW+Y+ G+V+GG Y + GC+PY +P + H+SC
Sbjct: 103 TFCCHTCGFGCNGGYPLKAWQYFKRHGVVTGGDYDTTDGCQPYRVPPCVKDDEGHNSCSG 162
Query: 214 QDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYA 273
Q E N +C +KC + Y+ + + AY L T+ ++ +GP+E S +Y
Sbjct: 163 QPTERNH-KCSKKCYGDDTIDYKKNHYKTKDAYYL--KNTTMQKDTMVYGPIEASFDVYD 219
Query: 274 DMILYKTGIYK 284
D + Y++G+Y+
Sbjct: 220 DFMNYESGVYQ 230
Score = 46.2 bits (108), Expect = 0.040, Method: Compositional matrix adjust.
Identities = 25/92 (27%), Positives = 45/92 (48%), Gaps = 3/92 (3%)
Query: 336 GCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANE 394
GC+PY +P C + G S +C +KC + Y+ + + AY L
Sbjct: 141 GCQPYRVPPCVKDDEGHNSCSGQPTERNHKCSKKCYGDDTIDYKKNHYKTKDAYYL--KN 198
Query: 395 ETIMREIFRHGPVEGSMTIYADMILYKTGIYK 426
T+ ++ +GP+E S +Y D + Y++G+Y+
Sbjct: 199 TTMQKDTMVYGPIEASFDVYDDFMNYESGVYQ 230
>gi|166030320|gb|ABY78827.1| cathepsin B-like protease [Trypanosoma congolense]
Length = 336
Score = 149 bits (377), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 105/281 (37%), Positives = 147/281 (52%), Gaps = 26/281 (9%)
Query: 59 LSKLTLSELEMRMG--VHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEI 116
+ +T +E + G + S LP P+ ELPE FD+ WP CPTI+EI
Sbjct: 54 MQNITFAEAKRLTGAWIQKTSSLP----PVRFTEEQLRTELPESFDSAEKWPNCPTIREI 109
Query: 117 RDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWK 176
DQ +C + WA+ +SDR C G + +R+S+ L+SCCK CG GC+GGF G AW+
Sbjct: 110 ADQSACRASWAVSTASVISDRYCTVG-GVQQLRISAAHLLSCCKQCGGGCKGGFPGFAWR 168
Query: 177 YWVTTGIVSGGTYASKQGCRPYEIP-CE-RYMNGSHSSCQDNEPNTPECIRKCQPGYDVS 234
Y+V GI S C+PY P CE R G+ + C +TP+C C D S
Sbjct: 169 YYVEYGIAS-------SYCQPYPFPHCEHRGAQGNKTPCSKYNFDTPKCNATCT---DKS 218
Query: 235 YEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEH 294
G Y L EE RE++ +GP +Y D+ YK+G+Y++V G LG
Sbjct: 219 IPLVKYRGNATYLLLHGEEDYKRELYFNGPFVAVFFVYTDLFAYKSGVYRNVDGDILGGQ 278
Query: 295 AIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
A+RI+GWG+ GT YW VAN+++T+WG +G I
Sbjct: 279 AVRIVGWGKL---NGT----PYWKVANTWDTDWGMDGYLLI 312
Score = 105 bits (261), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 64/170 (37%), Positives = 90/170 (52%), Gaps = 12/170 (7%)
Query: 329 ENGLFRIGCRPYEIP-CE-RYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRI 386
E G+ C+PY P CE R G+++ C +TP+C C D S G
Sbjct: 172 EYGIASSYCQPYPFPHCEHRGAQGNKTPCSKYNFDTPKCNATCT---DKSIPLVKYRGNA 228
Query: 387 AYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQE 446
Y L EE RE++ +GP +Y D+ YK+G+Y++V G LG A+RI+GWG+
Sbjct: 229 TYLLLHGEEDYKRELYFNGPFVAVFFVYTDLFAYKSGVYRNVDGDILGGQAVRIVGWGKL 288
Query: 447 PLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPK 496
GT YW VAN+++T+WG +G I+RG NEC IE AG P+
Sbjct: 289 ---NGT----PYWKVANTWDTDWGMDGYLLILRGNNECNIEHLGFAGTPE 331
>gi|308485822|ref|XP_003105109.1| hypothetical protein CRE_20700 [Caenorhabditis remanei]
gi|308257054|gb|EFP01007.1| hypothetical protein CRE_20700 [Caenorhabditis remanei]
Length = 410
Score = 149 bits (377), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 95/286 (33%), Positives = 135/286 (47%), Gaps = 64/286 (22%)
Query: 96 ELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDL 155
+LP+ FDAR WP CP+I + +QG CGS +A+ A SDR CI S G LS +D+
Sbjct: 126 DLPKHFDARQKWPNCPSISNVPNQGGCGSCFAVAAAGVASDRACIHSNGTFKALLSEEDI 185
Query: 156 VSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYE------IPCERYMNGS 209
+ CC CGN C GG KA YWV G+V+GG + GCRPY +PC
Sbjct: 186 IGCCSVCGN-CYGGDPLKALTYWVNQGLVTGG----RDGCRPYSFDLSCGVPC-----SP 235
Query: 210 HSSCQDNEPNTPECIRKCQP-GYDVSYEDDLNFGRIAYSL--------PANEE------- 253
+ + E T C+R+CQ Y YE+D +F AYS+ P +E
Sbjct: 236 ATFFEAEEKRT--CMRRCQNIYYQQKYEEDKHFATFAYSMYPRSMTVSPDGKERVKVPTI 293
Query: 254 -------------------TIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGE- 293
I +EI +GP + + + + Y +G+++ +
Sbjct: 294 IGHFNDKNTEKLNVTEYRNVIKKEILLYGPTTMAFPVPEEFLHYSSGVFRPFPLDGFDDR 353
Query: 294 ----HAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
H +R+IGWG+ G+ YWL NSF +WG+NGLF+I
Sbjct: 354 IVYWHVVRLIGWGESGDGQ------HYWLAINSFGNHWGDNGLFKI 393
Score = 70.1 bits (170), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 56/202 (27%), Positives = 83/202 (41%), Gaps = 62/202 (30%)
Query: 325 TNWGENGLF---RIGCRPYE------IPCERYMNGSRSSCQANEPNTPECIRKCQP-GYD 374
T W GL R GCRPY +PC + +A E T C+R+CQ Y
Sbjct: 205 TYWVNQGLVTGGRDGCRPYSFDLSCGVPC-----SPATFFEAEEKRT--CMRRCQNIYYQ 257
Query: 375 VSYEDDLNFGRIAYSL--------PANEE--------------------------TIMRE 400
YE+D +F AYS+ P +E I +E
Sbjct: 258 QKYEEDKHFATFAYSMYPRSMTVSPDGKERVKVPTIIGHFNDKNTEKLNVTEYRNVIKKE 317
Query: 401 IFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGE-----HAIRIIGWGQEPLGEGTSSV 455
I +GP + + + + Y +G+++ + H +R+IGWG+ G+
Sbjct: 318 ILLYGPTTMAFPVPEEFLHYSSGVFRPFPLDGFDDRIVYWHVVRLIGWGESGDGQ----- 372
Query: 456 VKYWLVANSFNTNWGENGLFRI 477
YWL NSF +WG+NGLF+I
Sbjct: 373 -HYWLAINSFGNHWGDNGLFKI 393
>gi|403371460|gb|EJY85611.1| Cathepsin B [Oxytricha trifallax]
Length = 309
Score = 149 bits (377), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 97/282 (34%), Positives = 137/282 (48%), Gaps = 45/282 (15%)
Query: 56 KNALSKLTLSELEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQE 115
+N + T ++L+ +G Q+ + Q++ LP+ FD+R W C +
Sbjct: 43 QNKFANYTEAQLKGLLGTVLSH---QSGISAFTQIN---AALPDSFDSRTQWKDC--VHP 94
Query: 116 IRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGN-GCQGGFHGKA 174
IRDQ CGS WA A E++SDR CIAS+GK ++ LS D+VSC D N GC GG+ +A
Sbjct: 95 IRDQAQCGSCWAFAAAESLSDRFCIASQGKVNLVLSPQDMVSC--DTSNFGCFGGYLDQA 152
Query: 175 WKYWVTTGIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQPGYDV- 233
W+Y G+ S C PY + NG + P C KC G +
Sbjct: 153 WQYLEQQGV-------SSDSCEPY-----KSGNG----------DQPSCPTKCSNGQAIK 190
Query: 234 SYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGE 293
Y+ + A A + I GPVE T+Y D Y +G+Y HV G G
Sbjct: 191 KYKCKAGSTKQAKGAEATKSLIQES----GPVETGFTVYQDFYNYNSGVYHHVTGDAEGG 246
Query: 294 HAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
HA++I+GWG++ L YW+VANS+ +WGE G F I
Sbjct: 247 HAVKILGWGKQGL-------ENYWIVANSWGEDWGEKGYFNI 281
Score = 80.9 bits (198), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 51/164 (31%), Positives = 77/164 (46%), Gaps = 22/164 (13%)
Query: 344 CERYMNGSRSSCQANEPNTPECIRKCQPGYDVS-YEDDLNFGRIAYSLPANEETIMREIF 402
CE Y +G+ + P C KC G + Y+ + A A + I
Sbjct: 166 CEPYKSGN--------GDQPSCPTKCSNGQAIKKYKCKAGSTKQAKGAEATKSLIQES-- 215
Query: 403 RHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVA 462
GPVE T+Y D Y +G+Y HV G G HA++I+GWG++ L YW+VA
Sbjct: 216 --GPVETGFTVYQDFYNYNSGVYHHVTGDAEGGHAVKILGWGKQGL-------ENYWIVA 266
Query: 463 NSFNTNWGENGLFRIVRGQNECGIEADITAGLPKIGLEIDSNEI 506
NS+ +WGE G F I +G + GI+ +P + +++ I
Sbjct: 267 NSWGEDWGEKGYFNIRQG--DSGIDEATFGCIPDVSSALENEFI 308
>gi|157116531|ref|XP_001658537.1| tubulointerstitial nephritis antigen [Aedes aegypti]
gi|108883447|gb|EAT47672.1| AAEL001232-PA [Aedes aegypti]
Length = 462
Score = 149 bits (376), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 98/272 (36%), Positives = 137/272 (50%), Gaps = 33/272 (12%)
Query: 69 MRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWAL 128
MR+G P+ ++ + +L++ L+ LP FDA WP I ++RDQG CGS WA+
Sbjct: 161 MRLGTF----YPKIKVKSMSRLTNGLDHLPTHFDATNYWP--GFIGKVRDQGWCGSSWAV 214
Query: 129 GAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGT 188
SDR I S+G+ V+L+ +VSC + GC GG AW Y G V+
Sbjct: 215 STASVASDRFAILSKGRETVQLAPQQIVSCVRR-SQGCSGGHLDTAWSYLRKVGTVN--- 270
Query: 189 YASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSL 248
+ C PY +H+ C+ P+ C+ V + G A+SL
Sbjct: 271 ----EECYPYI--------SAHNVCKI-RPSDTLITANCELPMKVDRTNMYKMGP-AFSL 316
Query: 249 PANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGP-----LGEHAIRIIGWGQ 303
NE IM EI +HGPV+ M ++ D YK+GIY+H A G H++R+IGWG+
Sbjct: 317 -NNETDIMLEIKKHGPVQAIMRVHRDFFSYKSGIYRHSAASTSADQRAGYHSVRLIGWGE 375
Query: 304 EPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
E G V KYW+ NS+ T WGENG FRI
Sbjct: 376 ERHG---YEVTKYWIAVNSWGTWWGENGRFRI 404
Score = 109 bits (273), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 56/120 (46%), Positives = 75/120 (62%), Gaps = 9/120 (7%)
Query: 387 AYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGP-----LGEHAIRII 441
A+SL NE IM EI +HGPV+ M ++ D YK+GIY+H A G H++R+I
Sbjct: 313 AFSL-NNETDIMLEIKKHGPVQAIMRVHRDFFSYKSGIYRHSAASTSADQRAGYHSVRLI 371
Query: 442 GWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKIGLEI 501
GWG+E G V KYW+ NS+ T WGENG FRI+RG NEC IE+ + A LP + ++
Sbjct: 372 GWGEERHG---YEVTKYWIAVNSWGTWWGENGRFRILRGSNECEIESYVLASLPYVHQQV 428
>gi|149030260|gb|EDL85316.1| rCG52258, isoform CRA_c [Rattus norvegicus]
Length = 130
Score = 149 bits (376), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 65/128 (50%), Positives = 97/128 (75%), Gaps = 7/128 (5%)
Query: 369 CQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHV 428
C+ GY SY++D ++G +YS+ +E+ IM EI+++GPVEG+ T+++D + YK+G+YKH
Sbjct: 2 CEAGYSTSYKEDKHYGYTSYSVSDSEKEIMAEIYKNGPVEGAFTVFSDFLTYKSGVYKHE 61
Query: 429 AGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEA 488
AG +G HAIRI+GWG E + V YWLVANS+N +WG+NG F+I+RG+N CGIE+
Sbjct: 62 AGDVMGGHAIRILGWGIE-------NGVPYWLVANSWNVDWGDNGFFKILRGENHCGIES 114
Query: 489 DITAGLPK 496
+I AG+P+
Sbjct: 115 EIVAGIPR 122
Score = 121 bits (304), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 54/109 (49%), Positives = 80/109 (73%), Gaps = 7/109 (6%)
Query: 227 CQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHV 286
C+ GY SY++D ++G +YS+ +E+ IM EI+++GPVEG+ T+++D + YK+G+YKH
Sbjct: 2 CEAGYSTSYKEDKHYGYTSYSVSDSEKEIMAEIYKNGPVEGAFTVFSDFLTYKSGVYKHE 61
Query: 287 AGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
AG +G HAIRI+GWG E + V YWLVANS+N +WG+NG F+I
Sbjct: 62 AGDVMGGHAIRILGWGIE-------NGVPYWLVANSWNVDWGDNGFFKI 103
>gi|290992564|ref|XP_002678904.1| predicted protein [Naegleria gruberi]
gi|284092518|gb|EFC46160.1| predicted protein [Naegleria gruberi]
Length = 289
Score = 149 bits (375), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 93/280 (33%), Positives = 135/280 (48%), Gaps = 42/280 (15%)
Query: 59 LSKLTLSELEMRMG---VHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQE 115
+ +TL + +G VHP + LP+ +P ++ + FDAR W C +
Sbjct: 49 FAGMTLRDARKLLGTVLVHPINNLPKKTMPANLKAA-------SSFDARTKWGKC--VHP 99
Query: 116 IRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAW 175
IRDQ CGS WA A E +SDR CIAS G V LS + ++ C GC GG+ AW
Sbjct: 100 IRDQQQCGSCWAFSASEVLSDRFCIASNGSVDVVLSPEYMLQC-DSTDYGCDGGYLNNAW 158
Query: 176 KYWVTTGIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSY 235
+ TGI S C+ Y +G+ + C C G +
Sbjct: 159 AFLAGTGIPSD--------------KCDPYTSGNG--------DVGSCPTSCTDGSAIKL 196
Query: 236 EDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHA 295
+ S + + I ++I +GPV+ + ++Y D YK+G+Y+HV+G G HA
Sbjct: 197 YKAKSSSVAQLS---SIDDIQKDIQANGPVQAAFSVYQDFFSYKSGVYRHVSGSLAGGHA 253
Query: 296 IRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
I+I+GWG G+ T YW+VANS+NTNWG+ G F I
Sbjct: 254 IKIVGWGVTSDGKDT----PYWIVANSWNTNWGQEGFFWI 289
Score = 88.6 bits (218), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 38/83 (45%), Positives = 56/83 (67%), Gaps = 4/83 (4%)
Query: 395 ETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSS 454
+ I ++I +GPV+ + ++Y D YK+G+Y+HV+G G HAI+I+GWG G+ T
Sbjct: 211 DDIQKDIQANGPVQAAFSVYQDFFSYKSGVYRHVSGSLAGGHAIKIVGWGVTSDGKDT-- 268
Query: 455 VVKYWLVANSFNTNWGENGLFRI 477
YW+VANS+NTNWG+ G F I
Sbjct: 269 --PYWIVANSWNTNWGQEGFFWI 289
>gi|195488613|ref|XP_002092389.1| GE11695 [Drosophila yakuba]
gi|194178490|gb|EDW92101.1| GE11695 [Drosophila yakuba]
Length = 431
Score = 149 bits (375), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 93/272 (34%), Positives = 134/272 (49%), Gaps = 33/272 (12%)
Query: 67 LEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGW 126
L++R+G +K P R+ + +L +P + LP F+A W I E+ DQG CG+ W
Sbjct: 161 LKLRLG----TKEPTYRVKAMTRLKNPTDGLPSSFNALDKWS--SYISEVPDQGWCGASW 214
Query: 127 ALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSG 186
L SDR I S+GK V+LS+ +++SC + GC+GG AW+Y G+V
Sbjct: 215 VLSTTSVASDRFAIQSKGKEAVQLSAQNILSCTRR-QQGCEGGHLDAAWRYLHKKGVVD- 272
Query: 187 GTYASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAY 246
+ C PY +C+ + CQ Y+V + G AY
Sbjct: 273 ------ESCYPY--------TQQRDTCKIRHNSRSLRANGCQTPYNVDRDTFYTVG-PAY 317
Query: 247 SLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAG---GPLGEHAIRIIGWGQ 303
SL E IM EIF GPV+ +M + D Y G+Y+ A P G H+++++GWG+
Sbjct: 318 SL-NREADIMAEIFHSGPVQATMRVNRDFFAYAGGVYRQTAANRMAPTGFHSVKLVGWGE 376
Query: 304 EPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
E GE KYW+ ANS+ WGE G FRI
Sbjct: 377 EHNGE------KYWIAANSWGPWWGERGYFRI 402
Score = 105 bits (261), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 62/172 (36%), Positives = 85/172 (49%), Gaps = 19/172 (11%)
Query: 329 ENGLFRIGCRPYEIPCERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAY 388
+ G+ C PY R +C+ + CQ Y+V + G AY
Sbjct: 267 KKGVVDESCYPY--------TQQRDTCKIRHNSRSLRANGCQTPYNVDRDTFYTVG-PAY 317
Query: 389 SLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAG---GPLGEHAIRIIGWGQ 445
SL E IM EIF GPV+ +M + D Y G+Y+ A P G H+++++GWG+
Sbjct: 318 SL-NREADIMAEIFHSGPVQATMRVNRDFFAYAGGVYRQTAANRMAPTGFHSVKLVGWGE 376
Query: 446 EPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
E GE KYW+ ANS+ WGE G FRI+RG NECGIE + A P +
Sbjct: 377 EHNGE------KYWIAANSWGPWWGERGYFRILRGSNECGIEEYVLASWPYV 422
>gi|24657813|ref|NP_726176.1| secreted Wg-interacting molecule, isoform A [Drosophila
melanogaster]
gi|24657819|ref|NP_611652.2| secreted Wg-interacting molecule, isoform B [Drosophila
melanogaster]
gi|21064305|gb|AAM29382.1| RE01730p [Drosophila melanogaster]
gi|21626543|gb|AAF46818.2| secreted Wg-interacting molecule, isoform A [Drosophila
melanogaster]
gi|21626544|gb|AAM68213.1| secreted Wg-interacting molecule, isoform B [Drosophila
melanogaster]
gi|220949028|gb|ACL87057.1| CG3074-PA [synthetic construct]
gi|220958134|gb|ACL91610.1| CG3074-PA [synthetic construct]
Length = 431
Score = 148 bits (374), Expect = 6e-33, Method: Compositional matrix adjust.
Identities = 101/309 (32%), Positives = 151/309 (48%), Gaps = 35/309 (11%)
Query: 32 DLSKAFDRVDHSI-LLPKLPFYGAEKNALSKLTLSE-LEMRMGVHPDSKLPQNRLPLLVQ 89
DL D + HS+ + +L + + + SE L++R+G +K P R+ + +
Sbjct: 124 DLCLTDDAIVHSVNSIHRLGWSARKYDQWWGRKYSEGLKLRLG----TKEPTYRVKAMTR 179
Query: 90 LSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVR 149
L +P + LP F+A W I E+ DQG CG+ W L SDR I S+GK +V+
Sbjct: 180 LKNPTDGLPSSFNALDKWS--SYISEVPDQGWCGASWVLSTTSVASDRFAIQSKGKENVQ 237
Query: 150 LSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIPCERYMNGS 209
LS+ +++SC + GC+GG AW+Y G+V + C PY
Sbjct: 238 LSAQNILSCTRR-QQGCEGGHLDAAWRYLHKKGVVD-------ENCYPY--------TQH 281
Query: 210 HSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSM 269
+C+ + CQ +V D L AYSL E IM EIF GPV+ +M
Sbjct: 282 RDTCKIRHNSRSLRANGCQKPVNVD-RDSLYTVGPAYSL-NREADIMAEIFHSGPVQATM 339
Query: 270 TIYADMILYKTGIYKHVAG---GPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTN 326
+ D Y G+Y+ A P G H+++++GWG+E GE KYW+ ANS+ +
Sbjct: 340 RVNRDFFAYSGGVYRETAANRKAPTGFHSVKLVGWGEEHNGE------KYWIAANSWGSW 393
Query: 327 WGENGLFRI 335
WGE+G FRI
Sbjct: 394 WGEHGYFRI 402
Score = 103 bits (256), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 62/172 (36%), Positives = 86/172 (50%), Gaps = 19/172 (11%)
Query: 329 ENGLFRIGCRPYEIPCERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAY 388
+ G+ C PY R +C+ + CQ +V D L AY
Sbjct: 267 KKGVVDENCYPY--------TQHRDTCKIRHNSRSLRANGCQKPVNVD-RDSLYTVGPAY 317
Query: 389 SLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAG---GPLGEHAIRIIGWGQ 445
SL E IM EIF GPV+ +M + D Y G+Y+ A P G H+++++GWG+
Sbjct: 318 SL-NREADIMAEIFHSGPVQATMRVNRDFFAYSGGVYRETAANRKAPTGFHSVKLVGWGE 376
Query: 446 EPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
E GE KYW+ ANS+ + WGE+G FRI+RG NECGIE + A P +
Sbjct: 377 EHNGE------KYWIAANSWGSWWGEHGYFRILRGSNECGIEEYVLASWPYV 422
>gi|17510377|ref|NP_490763.1| Protein Y65B4A.2 [Caenorhabditis elegans]
gi|373220066|emb|CCD71920.1| Protein Y65B4A.2 [Caenorhabditis elegans]
Length = 421
Score = 148 bits (373), Expect = 7e-33, Method: Compositional matrix adjust.
Identities = 93/286 (32%), Positives = 134/286 (46%), Gaps = 64/286 (22%)
Query: 96 ELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDL 155
++P+ FDAR WP CP+I + +QG CGS +A+ A SDR CI S G LS +D+
Sbjct: 137 DVPKNFDARQKWPNCPSISNVPNQGGCGSCFAVAAAGVASDRACIHSNGTFKSLLSEEDI 196
Query: 156 VSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYE------IPCERYMNGS 209
+ CC CGN C GG KA YWV G+V+GG + GCRPY +PC
Sbjct: 197 IGCCSVCGN-CYGGDPLKALTYWVNQGLVTGG----RDGCRPYSFDLSCGVPC-----SP 246
Query: 210 HSSCQDNEPNTPECIRKCQP-GYDVSYEDDLNFGRIAYSL--------PANEE------- 253
+ + E T C+++CQ Y YE+D +F AYS+ P +E
Sbjct: 247 ATFFEAEEKRT--CMKRCQNIYYQQKYEEDKHFATFAYSMYPRSMTVSPDGKERVKVPTI 304
Query: 254 -------------------TIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGE- 293
I +EI +GP + + + + Y +G+++ +
Sbjct: 305 IGHFNDKKTEKLNVTEYRDIIKKEILLYGPTTMAFPVPEEFLHYSSGVFRPYPTDGFDDR 364
Query: 294 ----HAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
H +R+IGWG+ G YWL NSF +WG+NGLF+I
Sbjct: 365 IVYWHVVRLIGWGESDDG------THYWLAVNSFGNHWGDNGLFKI 404
Score = 70.1 bits (170), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 55/202 (27%), Positives = 82/202 (40%), Gaps = 62/202 (30%)
Query: 325 TNWGENGLF---RIGCRPYE------IPCERYMNGSRSSCQANEPNTPECIRKCQP-GYD 374
T W GL R GCRPY +PC + +A E T C+++CQ Y
Sbjct: 216 TYWVNQGLVTGGRDGCRPYSFDLSCGVPC-----SPATFFEAEEKRT--CMKRCQNIYYQ 268
Query: 375 VSYEDDLNFGRIAYSL--------PANEE--------------------------TIMRE 400
YE+D +F AYS+ P +E I +E
Sbjct: 269 QKYEEDKHFATFAYSMYPRSMTVSPDGKERVKVPTIIGHFNDKKTEKLNVTEYRDIIKKE 328
Query: 401 IFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGE-----HAIRIIGWGQEPLGEGTSSV 455
I +GP + + + + Y +G+++ + H +R+IGWG+ G
Sbjct: 329 ILLYGPTTMAFPVPEEFLHYSSGVFRPYPTDGFDDRIVYWHVVRLIGWGESDDG------ 382
Query: 456 VKYWLVANSFNTNWGENGLFRI 477
YWL NSF +WG+NGLF+I
Sbjct: 383 THYWLAVNSFGNHWGDNGLFKI 404
>gi|270011021|gb|EFA07469.1| cathepsin B precursor [Tribolium castaneum]
Length = 327
Score = 148 bits (373), Expect = 8e-33, Method: Compositional matrix adjust.
Identities = 102/294 (34%), Positives = 139/294 (47%), Gaps = 38/294 (12%)
Query: 52 YGAEKNALSKLTLSELE----MRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINW 107
YG + SK +LE +R+G + + P+ ++ DP LP FD+ W
Sbjct: 32 YGWSASNYSKFWGHKLEEGIKLRLGTLQPQRFVMHMNPVR-RIYDP-NSLPREFDSEFKW 89
Query: 108 PYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQ 167
P + EI+DQG CGS WA+ SDR I S+G+ V LS+ L+SC + C
Sbjct: 90 P--GWMSEIQDQGWCGSSWAITTAAVASDRFAILSKGREKVTLSAQHLLSCDRRGQQSCN 147
Query: 168 GGFHGKAWKYWVTTGIVSGGTY---ASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECI 224
GG+ +AW Y G+V + A+ + CR IP R + ++CQ
Sbjct: 148 GGYLDRAWSYIRKIGLVDEQCFPYSATNEKCR---IP--RRGDLVTANCQLPTNVDRRSK 202
Query: 225 RKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYK 284
K P Y V NE IM EI GPV+ +M +Y D YK GIY+
Sbjct: 203 YKVAPAYRV----------------GNETDIMYEILHSGPVQATMKVYHDFFTYKRGIYR 246
Query: 285 H---VAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
H G H++RI+GWG+E EG + KYW VANS+ WGENG FRI
Sbjct: 247 HSPISTNDRTGYHSVRIVGWGEEYSPEG---LKKYWKVANSWGPEWGENGYFRI 297
Score = 100 bits (250), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 50/108 (46%), Positives = 65/108 (60%), Gaps = 6/108 (5%)
Query: 393 NEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHV---AGGPLGEHAIRIIGWGQEPLG 449
NE IM EI GPV+ +M +Y D YK GIY+H G H++RI+GWG+E
Sbjct: 213 NETDIMYEILHSGPVQATMKVYHDFFTYKRGIYRHSPISTNDRTGYHSVRIVGWGEEYSP 272
Query: 450 EGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
EG + KYW VANS+ WGENG FRI+RG NEC IE+ + ++
Sbjct: 273 EG---LKKYWKVANSWGPEWGENGYFRILRGSNECEIESFVLGTWAEV 317
>gi|219565128|dbj|BAH04068.1| cathepsin B [Equus caballus]
Length = 162
Score = 147 bits (372), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 69/120 (57%), Positives = 87/120 (72%), Gaps = 2/120 (1%)
Query: 83 RLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIAS 142
+LP V ++ + LPE FDAR WP CPTI+EIRDQGSCGS WA GAVEA+SDR+CI +
Sbjct: 43 KLPQRVWFAEDVV-LPENFDAREQWPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICIRT 101
Query: 143 RGKRHVRLSSDDLVSCCKD-CGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIP 201
G V +S++D+++CC D CG+GC GGF +AW +W G+VSGG Y S GCRPY IP
Sbjct: 102 NGHVSVEVSAEDMLTCCGDQCGDGCNGGFPAEAWNFWTKQGLVSGGLYDSHVGCRPYSIP 161
>gi|324512900|gb|ADY45327.1| Peptidase C1-like protein [Ascaris suum]
Length = 450
Score = 147 bits (372), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 95/281 (33%), Positives = 136/281 (48%), Gaps = 38/281 (13%)
Query: 67 LEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGW 126
+ R+G K QN +L++ ELP FDAR WP I +RDQG C S W
Sbjct: 158 MRYRLGTLFPDKSVQNMNEILMKP----RELPSSFDAREKWPL--YIHPVRDQGDCASSW 211
Query: 127 ALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSG 186
+ +DR+ I + G+ ++ LS+ L+SC + GC+GG+ +AW Y G+VS
Sbjct: 212 SHSTTATSADRLSIITDGRVNIPLSAQQLLSCNQHRQRGCEGGYLDRAWWYIRKLGVVSE 271
Query: 187 GTYASKQGC--RPYE--IPCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFG 242
Y + G +P E IP Y G+H C + P R P
Sbjct: 272 LCYPYESGATQQPGECRIPKSAYRTGAHIDCPSGAAD-PSVYRMTPP------------- 317
Query: 243 RIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHV--------AGGPLGEH 294
Y + + E+ IM EI +GPV+ + +Y D +Y G+Y+H+ G H
Sbjct: 318 ---YRVSSREQDIMTEIITNGPVQATFLVYEDFFMYSGGVYQHLDLHEHKEEERKVQGYH 374
Query: 295 AIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
++RIIGWG++ T VKYWL ANS+ WGE+GLFRI
Sbjct: 375 SVRIIGWGED---YSTGPQVKYWLAANSWGNEWGEDGLFRI 412
Score = 97.8 bits (242), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 47/117 (40%), Positives = 68/117 (58%), Gaps = 11/117 (9%)
Query: 388 YSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHV--------AGGPLGEHAIR 439
Y + + E+ IM EI +GPV+ + +Y D +Y G+Y+H+ G H++R
Sbjct: 318 YRVSSREQDIMTEIITNGPVQATFLVYEDFFMYSGGVYQHLDLHEHKEEERKVQGYHSVR 377
Query: 440 IIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPK 496
IIGWG++ T VKYWL ANS+ WGE+GLFRI+RG+N C IE+ + K
Sbjct: 378 IIGWGED---YSTGPQVKYWLAANSWGNEWGEDGLFRILRGENHCEIESFVIGAWGK 431
>gi|16768502|gb|AAL28470.1| GM06507p [Drosophila melanogaster]
Length = 430
Score = 147 bits (371), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 101/309 (32%), Positives = 150/309 (48%), Gaps = 36/309 (11%)
Query: 32 DLSKAFDRVDHSI-LLPKLPFYGAEKNALSKLTLSE-LEMRMGVHPDSKLPQNRLPLLVQ 89
DL D + HS+ + +L + + + SE L++R+G +K P R+ + +
Sbjct: 124 DLCLTDDAIVHSVNSIHRLGWSARKYDQWWGRKYSEGLKLRLG----TKEPTYRVKAMTR 179
Query: 90 LSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVR 149
L +P + LP F+A W I E+ DQG CG+ W L SDR I S+GK +V+
Sbjct: 180 LKNPTDGLPNSFNALDKWS--SYISEVPDQGWCGASWVLSTTSVASDRFAIQSKGKENVQ 237
Query: 150 LSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIPCERYMNGS 209
LS+ +++SC + GC+GG AW+Y G+V Y Q +I R + +
Sbjct: 238 LSAQNILSCTRR-QQGCEGGHLDAAWRYLHKKGVVDENCYPYTQHRDTCKIRHSRSLKAN 296
Query: 210 HSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSM 269
CQ +V D L AYSL E IM EIF GPV+ +M
Sbjct: 297 ----------------GCQKPVNVD-RDSLYTVGPAYSL-NREADIMAEIFHSGPVQATM 338
Query: 270 TIYADMILYKTGIYKHVAG---GPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTN 326
+ D Y G+Y+ A P G H+++++GWG+E GE KYW+ ANS+ +
Sbjct: 339 RVNRDFFAYSGGVYRETAANRKAPTGFHSVKLVGWGEEHNGE------KYWIAANSWGSW 392
Query: 327 WGENGLFRI 335
WGE+G FRI
Sbjct: 393 WGEHGYFRI 401
Score = 103 bits (256), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 51/114 (44%), Positives = 69/114 (60%), Gaps = 10/114 (8%)
Query: 387 AYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAG---GPLGEHAIRIIGW 443
AYSL E IM EIF GPV+ +M + D Y G+Y+ A P G H+++++GW
Sbjct: 315 AYSL-NREADIMAEIFHSGPVQATMRVNRDFFAYSGGVYRETAANRKAPTGFHSVKLVGW 373
Query: 444 GQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
G+E GE KYW+ ANS+ + WGE+G FRI+RG NECGIE + A P +
Sbjct: 374 GEEHNGE------KYWIAANSWGSWWGEHGYFRILRGSNECGIEEYVLASWPYV 421
>gi|427783627|gb|JAA57265.1| hypothetical protein [Rhipicephalus pulchellus]
Length = 483
Score = 147 bits (371), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 104/302 (34%), Positives = 141/302 (46%), Gaps = 55/302 (18%)
Query: 51 FYGAEKNALSKLTLSELEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYC 110
FYG KL + R+G H + L ++ E+LPE FDARI W
Sbjct: 164 FYG-------KLLEDGIRYRLGTHQPERPTAEMNELHLKKR---EQLPEEFDARIRWS-- 211
Query: 111 PTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNG----- 165
+ +RDQG C + WA SDR+ I SRG V LS DL+SC NG
Sbjct: 212 GLVHGVRDQGDCANSWAFSTAAVASDRLSIQSRGVDKVELSPQDLMSCL----NGGRRVV 267
Query: 166 CQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIPCERYMNGSHSS----CQDNEPNTP 221
CQGG + W++ + G VS + C PYE G HSS C+ P
Sbjct: 268 CQGGHPDRGWRFLLNYGGVS-------EECYPYE--------GVHSSANATCRIPRRRDP 312
Query: 222 ECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTG 281
+C G + +F Y +PANEE IM+EI+ +GPV+ + + D LY++G
Sbjct: 313 IEDARCPTGRT----EQKHFSTPPYRVPANEEDIMQEIYANGPVQALILVKEDFFLYRSG 368
Query: 282 IYKHVAGGP--------LGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLF 333
+Y+H G H++RI+GWG + +KYWL ANS+ WGENG F
Sbjct: 369 VYRHTRIAESLRPQYSRSGWHSVRILGWG---VDRSQYRPIKYWLCANSWGHGWGENGYF 425
Query: 334 RI 335
RI
Sbjct: 426 RI 427
Score = 99.8 bits (247), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 49/119 (41%), Positives = 71/119 (59%), Gaps = 11/119 (9%)
Query: 382 NFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPL-------- 433
+F Y +PANEE IM+EI+ +GPV+ + + D LY++G+Y+H
Sbjct: 327 HFSTPPYRVPANEEDIMQEIYANGPVQALILVKEDFFLYRSGVYRHTRIAESLRPQYSRS 386
Query: 434 GEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITA 492
G H++RI+GWG + +KYWL ANS+ WGENG FRIVRG++E IE+ + A
Sbjct: 387 GWHSVRILGWG---VDRSQYRPIKYWLCANSWGHGWGENGYFRIVRGEDESQIESFVLA 442
>gi|322788703|gb|EFZ14296.1| hypothetical protein SINV_07506 [Solenopsis invicta]
Length = 443
Score = 147 bits (370), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 111/341 (32%), Positives = 167/341 (48%), Gaps = 43/341 (12%)
Query: 14 KDLDLSQSSRNHSNGVFCDLSKAFDRV----DHSILLPKL--------PFYGAEKNALSK 61
K+ D Q+ + + N C L V + ++ P+L P G + S+
Sbjct: 106 KNYDQEQTFKVNCNTCKCTLVDKRAEVLCEENRCLIEPELLEEVNQQEPILGWQVGNYSE 165
Query: 62 L---TLSE-LEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIR 117
TL + +E+R+G S+ P+ ++ DP + LP FD+R W I I
Sbjct: 166 FWGRTLRDGVELRLGTLNPSQSVYKMNPV-KRIYDP-DALPREFDSRTRWSR--DISGIH 221
Query: 118 DQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKY 177
DQG CG+ WA+ + SDR I S+G LS+ L+SC GC+GG+ +AW +
Sbjct: 222 DQGWCGASWAVSTADVASDRYSIMSKGAEAPELSAQQLLSCNNRGQQGCRGGYLDRAWLF 281
Query: 178 WVTTGIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYED 237
G+V + C P+ +G + C+ + +T + +P + + E
Sbjct: 282 MRKFGLVD-------KECYPW--------SGKNDQCKLRKRSTLKAAGCRKPSHPLRTE- 325
Query: 238 DLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPL---GEH 294
L AY L NE IM+EI GPV+ +M +Y D +YK+GIY+H L G H
Sbjct: 326 -LYKVGPAYRL-GNETDIMQEILTSGPVQATMRVYQDFFIYKSGIYRHSRSAELHDSGYH 383
Query: 295 AIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
++RIIGWG+E G +KYWLVANS+ NWG+NGLF+I
Sbjct: 384 SVRIIGWGEERSYRGPP--LKYWLVANSWGYNWGDNGLFKI 422
Score = 109 bits (272), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 56/113 (49%), Positives = 73/113 (64%), Gaps = 6/113 (5%)
Query: 387 AYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPL---GEHAIRIIGW 443
AY L NE IM+EI GPV+ +M +Y D +YK+GIY+H L G H++RIIGW
Sbjct: 332 AYRL-GNETDIMQEILTSGPVQATMRVYQDFFIYKSGIYRHSRSAELHDSGYHSVRIIGW 390
Query: 444 GQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPK 496
G+E G +KYWLVANS+ NWG+NGLF+I +G NEC IE+ + A K
Sbjct: 391 GEERSYRGPP--LKYWLVANSWGYNWGDNGLFKIQKGTNECEIESYVLAVWAK 441
>gi|432884030|ref|XP_004074413.1| PREDICTED: tubulointerstitial nephritis antigen-like [Oryzias
latipes]
Length = 474
Score = 147 bits (370), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 96/308 (31%), Positives = 143/308 (46%), Gaps = 24/308 (7%)
Query: 40 VDHSILLPKLPFYGAEKNALSKLTLSE-LEMRMGVHPDSKLPQNRLPLLVQLSDPLEELP 98
+ H++ + A + ++L E + R+G S+ N + +++ + LP
Sbjct: 144 IIHAVNRGNYGWKAANYSQFFGMSLDEGIRYRLGTQRPSRTVMNMNEIQMKMDPQNDHLP 203
Query: 99 EGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSC 158
F++ WP I E DQG+C + WA SDR+ I S G +LS +L+SC
Sbjct: 204 RYFNSSEKWP--NKIHEPLDQGNCAASWAFSTAAVASDRISIQSMGHMTPQLSPQNLISC 261
Query: 159 CKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIPCE--RYMNGSHSSCQDN 216
GC GG AW Y G+V+ Y + P + P E R M S + +
Sbjct: 262 DTRNQGGCAGGRIDGAWWYLRRRGVVTENCYPYQP---PQQAPAEVGRCMMQSRAVGRGK 318
Query: 217 EPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMI 276
T C +Y +D+ Y L +NE+ IM+EI +GPV+ M ++ D
Sbjct: 319 RQATQRCPNT------YNYHNDIYQSTPPYKLSSNEKEIMKEIMENGPVQAIMEVHEDFF 372
Query: 277 LYKTGIYKHVAGGPL--------GEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWG 328
+YK GIYKH G H++RI GWG++ +GT KYW+ ANS+ NWG
Sbjct: 373 VYKNGIYKHTDVSSTKPPQYRKHGTHSVRITGWGEDKDYDGTPR--KYWIAANSWGKNWG 430
Query: 329 ENGLFRIG 336
ENG FRI
Sbjct: 431 ENGFFRIA 438
Score = 114 bits (286), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 68/184 (36%), Positives = 93/184 (50%), Gaps = 22/184 (11%)
Query: 331 GLFRIGCRPYEIPCE------RYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFG 384
G+ C PY+ P + R M SR+ + T C +Y +D+
Sbjct: 285 GVVTENCYPYQPPQQAPAEVGRCMMQSRAVGRGKRQATQRCPNT------YNYHNDIYQS 338
Query: 385 RIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPL--------GEH 436
Y L +NE+ IM+EI +GPV+ M ++ D +YK GIYKH G H
Sbjct: 339 TPPYKLSSNEKEIMKEIMENGPVQAIMEVHEDFFVYKNGIYKHTDVSSTKPPQYRKHGTH 398
Query: 437 AIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPK 496
++RI GWG++ +GT KYW+ ANS+ NWGENG FRI RG NEC IEA + +
Sbjct: 399 SVRITGWGEDKDYDGTPR--KYWIAANSWGKNWGENGFFRIARGANECEIEAFVIGVWGR 456
Query: 497 IGLE 500
I LE
Sbjct: 457 ISLE 460
>gi|289724789|gb|ADD18342.1| putative cysteine proteinase TIN-ag [Glossina morsitans morsitans]
Length = 387
Score = 147 bits (370), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 97/274 (35%), Positives = 139/274 (50%), Gaps = 37/274 (13%)
Query: 67 LEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGW 126
L R+G +K P R+ + +L + ++ LP F++ W I ++ DQG CGS W
Sbjct: 116 LTKRLG----TKEPTYRVKAMSRLHNIVDHLPRSFNSIDKWA--SYISDVLDQGWCGSSW 169
Query: 127 ALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSG 186
+ SDR I SRGK ++LS +++SC + GC GG AW+Y G+V
Sbjct: 170 VISTASVASDRFAIQSRGKEVIQLSPQNILSCTRR-QQGCNGGHLDAAWRYLHKQGVVD- 227
Query: 187 GTYASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRK--CQPGYDVSYEDDLNFGRI 244
+ C PY G +C+ P+ +R C+ Y D+L
Sbjct: 228 ------ESCYPYV--------GYRDACK--IPHNSRSLRNNGCR-SYSGVDRDELYTVGP 270
Query: 245 AYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVA---GGPLGEHAIRIIGW 301
AYSL NE IM EIF GPV+ ++T+Y D Y GIY+H A G P+G H++++IGW
Sbjct: 271 AYSL-NNETDIMAEIFMSGPVQATLTVYRDFFSYSGGIYRHTAASRGSPVGFHSVKLIGW 329
Query: 302 GQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
G+E G KYW+ NS+ T WGE+G FRI
Sbjct: 330 GEEHDGN------KYWIATNSWGTWWGEHGNFRI 357
Score = 112 bits (279), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 69/174 (39%), Positives = 94/174 (54%), Gaps = 23/174 (13%)
Query: 329 ENGLFRIGCRPYEIPCERYMNGSRSSCQANEPNTPECIRK--CQPGYDVSYEDDLNFGRI 386
+ G+ C PY G R +C+ P+ +R C+ Y D+L
Sbjct: 222 KQGVVDESCYPYV--------GYRDACKI--PHNSRSLRNNGCR-SYSGVDRDELYTVGP 270
Query: 387 AYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVA---GGPLGEHAIRIIGW 443
AYSL NE IM EIF GPV+ ++T+Y D Y GIY+H A G P+G H++++IGW
Sbjct: 271 AYSL-NNETDIMAEIFMSGPVQATLTVYRDFFSYSGGIYRHTAASRGSPVGFHSVKLIGW 329
Query: 444 GQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
G+E G KYW+ NS+ T WGE+G FRI+RG NECGIE + A P +
Sbjct: 330 GEEHDGN------KYWIATNSWGTWWGEHGNFRILRGSNECGIEEYVLAAWPNV 377
>gi|195585648|ref|XP_002082593.1| GD25141 [Drosophila simulans]
gi|194194602|gb|EDX08178.1| GD25141 [Drosophila simulans]
Length = 484
Score = 147 bits (370), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 101/309 (32%), Positives = 150/309 (48%), Gaps = 35/309 (11%)
Query: 32 DLSKAFDRVDHSI-LLPKLPFYGAEKNALSKLTLSE-LEMRMGVHPDSKLPQNRLPLLVQ 89
DL D + HS+ + +L + + + SE L++R+G +K P R+ + +
Sbjct: 124 DLCLTDDAIVHSVNSINRLGWSARKYDQWWGRKYSEGLKLRLG----TKEPTYRVKAMTR 179
Query: 90 LSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVR 149
L +P + LP F+A W I E+ DQG CG+ W L SDR I S+GK V+
Sbjct: 180 LRNPTDGLPSSFNALDKWS--SYISEVPDQGWCGASWVLSTTSVASDRFAIQSKGKEAVQ 237
Query: 150 LSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIPCERYMNGS 209
LS+ +++SC + GC+GG AW+Y G+V + C PY
Sbjct: 238 LSAQNILSCTRR-QQGCEGGHLDAAWRYLHKKGVVD-------ENCYPY--------TQH 281
Query: 210 HSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSM 269
+C+ + CQ +V D L AYSL E IM EIF GPV+ +M
Sbjct: 282 RDTCKIRHNSRSLRANGCQTPVNVD-RDTLYTVGPAYSL-NREADIMAEIFHSGPVQATM 339
Query: 270 TIYADMILYKTGIYKHVAG---GPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTN 326
+ D Y G+Y+ A P G H+++++GWG+E GE KYW+ ANS+ +
Sbjct: 340 RVNRDFFAYSGGVYRETAANRKAPTGFHSVKLVGWGEEHNGE------KYWIAANSWGSW 393
Query: 327 WGENGLFRI 335
WGE+G FRI
Sbjct: 394 WGEHGYFRI 402
Score = 102 bits (254), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 51/114 (44%), Positives = 69/114 (60%), Gaps = 10/114 (8%)
Query: 387 AYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAG---GPLGEHAIRIIGW 443
AYSL E IM EIF GPV+ +M + D Y G+Y+ A P G H+++++GW
Sbjct: 316 AYSL-NREADIMAEIFHSGPVQATMRVNRDFFAYSGGVYRETAANRKAPTGFHSVKLVGW 374
Query: 444 GQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
G+E GE KYW+ ANS+ + WGE+G FRI+RG NECGIE + A P +
Sbjct: 375 GEEHNGE------KYWIAANSWGSWWGEHGYFRILRGSNECGIEEYVLASWPYV 422
>gi|294876463|ref|XP_002767679.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
gi|239869446|gb|EER00397.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
Length = 348
Score = 147 bits (370), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 89/258 (34%), Positives = 124/258 (48%), Gaps = 22/258 (8%)
Query: 94 LEELPEGFDARINWPYCP-TIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSS 152
L ++P FDAR + C I + DQ +C S WA+ V+A S R+CI S GK + LS+
Sbjct: 80 LADIPSSFDARDAFKECKDVIGHVWDQSACASCWAIAPVQAFSARLCIKSGGKFNQLLSA 139
Query: 153 DDLVSCCK---DC-GNGCQGGFHGKAWKYWVTTGIVSGGTYASKQ------GCRPYEIP- 201
+L++CC C GC+GG AW + GI +GG + K GC PY P
Sbjct: 140 GELLACCNLAHSCEARGCKGGVARDAWVFLNKHGIATGGDFVPKSSMEAVDGCWPYNFPR 199
Query: 202 CERYMNGS-HSSCQDNEPNTPECIRKC-QPGYDVSYEDDLNFGRIAYSLPANE-ETIMRE 258
C Y S + C TP C+ +C Y + D +F A N +I +E
Sbjct: 200 CAHYQKKSKYGPCPKKSYETPSCLDRCPNEKYGTPLDKDRHFTARAVPYWFNGIRSIKKE 259
Query: 259 IFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWL 318
I +HGP S Y D YK+G+YK+ +G + H + +IGW GT V YWL
Sbjct: 260 IMKHGPTSASFFTYEDFFSYKSGVYKYTSGAYVEFHTVELIGW-------GTEKGVDYWL 312
Query: 319 VANSFNTNWGENGLFRIG 336
N +N W + G F+I
Sbjct: 313 AKNDWNEEWADLGTFKIA 330
Score = 94.4 bits (233), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 56/164 (34%), Positives = 77/164 (46%), Gaps = 14/164 (8%)
Query: 336 GCRPYEIP-CERYMNGSR-SSCQANEPNTPECIRKC-QPGYDVSYEDDLNFGRIAYSLPA 392
GC PY P C Y S+ C TP C+ +C Y + D +F A
Sbjct: 191 GCWPYNFPRCAHYQKKSKYGPCPKKSYETPSCLDRCPNEKYGTPLDKDRHFTARAVPYWF 250
Query: 393 NE-ETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEG 451
N +I +EI +HGP S Y D YK+G+YK+ +G + H + +IGW G
Sbjct: 251 NGIRSIKKEIMKHGPTSASFFTYEDFFSYKSGVYKYTSGAYVEFHTVELIGW-------G 303
Query: 452 TSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLP 495
T V YWL N +N W + G F+I +G +CGI D+ G P
Sbjct: 304 TEKGVDYWLAKNDWNEEWADLGTFKIAQG--DCGIN-DLVLGAP 344
>gi|332030944|gb|EGI70570.1| Uncharacterized peptidase C1-like protein F26E4.3 [Acromyrmex
echinatior]
Length = 501
Score = 146 bits (369), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 96/278 (34%), Positives = 139/278 (50%), Gaps = 27/278 (9%)
Query: 61 KLTLSELEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQG 120
+ L +E+R+G S+ P+ ++ DP + LP FD+R W I + DQG
Sbjct: 227 RTLLEGVELRLGTLNPSQSVYKMNPVR-RIYDP-DALPREFDSRTRWSR--DISNVHDQG 282
Query: 121 SCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVT 180
CG+ WA+ + +DR I S+G LS+ L+SC GC+GG+ +AW +
Sbjct: 283 WCGASWAISTADVATDRFSIMSKGAEDAELSAQHLLSCNNRGQQGCRGGYLDRAWLFMRK 342
Query: 181 TGIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLN 240
G+V + C P+ G + C+ + N + +P + E L
Sbjct: 343 FGLVD-------KDCYPW--------TGKNGQCKLRKRNNLQAAGCRKPPNPLRTE--LY 385
Query: 241 FGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPL---GEHAIR 297
AY L NE IM+EI GPV+ +M +Y D +YK GIY+H L G H++R
Sbjct: 386 KVGPAYRL-GNETDIMQEILTSGPVQATMRVYQDFFVYKNGIYRHSQSAELHDSGYHSVR 444
Query: 298 IIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
IIGWG+E G +KYWLV NS+ NWGENGLF+I
Sbjct: 445 IIGWGEERSYRGPP--LKYWLVVNSWGYNWGENGLFKI 480
Score = 109 bits (272), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 57/113 (50%), Positives = 71/113 (62%), Gaps = 6/113 (5%)
Query: 387 AYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPL---GEHAIRIIGW 443
AY L NE IM+EI GPV+ +M +Y D +YK GIY+H L G H++RIIGW
Sbjct: 390 AYRL-GNETDIMQEILTSGPVQATMRVYQDFFVYKNGIYRHSQSAELHDSGYHSVRIIGW 448
Query: 444 GQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPK 496
G+E G +KYWLV NS+ NWGENGLF+I RG NEC IE+ + A K
Sbjct: 449 GEERSYRGPP--LKYWLVVNSWGYNWGENGLFKIQRGTNECEIESYVLAVWAK 499
>gi|195426329|ref|XP_002061289.1| GK20838 [Drosophila willistoni]
gi|194157374|gb|EDW72275.1| GK20838 [Drosophila willistoni]
Length = 432
Score = 146 bits (369), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 95/272 (34%), Positives = 136/272 (50%), Gaps = 34/272 (12%)
Query: 67 LEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGW 126
L +R+G +K P R+ + +L++P + LP F+A W I E+ DQG CGS W
Sbjct: 163 LRLRLG----TKEPTYRVKTMTRLTNPTDGLPASFNAVDKWS--RYISEVPDQGWCGSSW 216
Query: 127 ALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSG 186
L SDR I S+GK V+LS +++SC + GC+GG AW+Y G++
Sbjct: 217 VLSTTSVASDRFAIQSQGKEVVQLSPQNILSCTRR-QQGCEGGHLDAAWRYLHKKGVLD- 274
Query: 187 GTYASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAY 246
+ C PY S +C+ + + C+P V D L AY
Sbjct: 275 ------ESCYPY--------TQSRGTCKVRHSGSLK-AHGCRPAPGVD-RDSLYTVGPAY 318
Query: 247 SLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVA---GGPLGEHAIRIIGWGQ 303
SL + E I EIF GPV+ +M +Y D Y GIY+ A G P G H+++++GWG+
Sbjct: 319 SL-SREADIKAEIFHSGPVQATMRVYRDFFSYSGGIYRQTAANRGAPTGFHSVKLVGWGE 377
Query: 304 EPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
E G+ KYW+ ANS+ WGE G FRI
Sbjct: 378 EHNGD------KYWIAANSWGPWWGERGYFRI 403
Score = 105 bits (263), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 64/172 (37%), Positives = 88/172 (51%), Gaps = 20/172 (11%)
Query: 329 ENGLFRIGCRPYEIPCERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAY 388
+ G+ C PY SR +C+ + + C+P V D L AY
Sbjct: 269 KKGVLDESCYPY--------TQSRGTCKVRHSGSLK-AHGCRPAPGVD-RDSLYTVGPAY 318
Query: 389 SLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVA---GGPLGEHAIRIIGWGQ 445
SL + E I EIF GPV+ +M +Y D Y GIY+ A G P G H+++++GWG+
Sbjct: 319 SL-SREADIKAEIFHSGPVQATMRVYRDFFSYSGGIYRQTAANRGAPTGFHSVKLVGWGE 377
Query: 446 EPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
E G+ KYW+ ANS+ WGE G FRI+RG NECGIE + A P +
Sbjct: 378 EHNGD------KYWIAANSWGPWWGERGYFRILRGSNECGIEDYVLASWPYV 423
>gi|38048307|gb|AAR10056.1| similar to Drosophila melanogaster CG10992, partial [Drosophila
yakuba]
Length = 174
Score = 146 bits (369), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 74/144 (51%), Positives = 89/144 (61%), Gaps = 3/144 (2%)
Query: 44 ILLPKLPFYGAEKNALSKLTLSELEMRMGVHPDS---KLPQNRLPLLVQLSDPLEELPEG 100
++ K + +N + +T + MGVHPD+ L R L + ++E+PE
Sbjct: 31 LVRSKAKTWTVGRNFDASVTEGHIRRLMGVHPDAHKFALADKREVLGDLYMNSVDEIPEE 90
Query: 101 FDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCK 160
FD+R WP CPTI EIRDQGSCGS WA GAVEAMSDRVCI S GK + S+DDLVSCC
Sbjct: 91 FDSRKQWPNCPTIGEIRDQGSCGSCWAFGAVEAMSDRVCIHSGGKVNFHFSADDLVSCCH 150
Query: 161 DCGNGCQGGFHGKAWKYWVTTGIV 184
CG GC GGF G AW YW GIV
Sbjct: 151 TCGFGCNGGFPGAAWSYWTRKGIV 174
>gi|189238903|ref|XP_967834.2| PREDICTED: similar to tubulointerstitial nephritis antigen
[Tribolium castaneum]
Length = 453
Score = 146 bits (368), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 102/294 (34%), Positives = 139/294 (47%), Gaps = 38/294 (12%)
Query: 52 YGAEKNALSKLTLSELE----MRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINW 107
YG + SK +LE +R+G + + P+ ++ DP LP FD+ W
Sbjct: 158 YGWSASNYSKFWGHKLEEGIKLRLGTLQPQRFVMHMNPVR-RIYDP-NSLPREFDSEFKW 215
Query: 108 PYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQ 167
P + EI+DQG CGS WA+ SDR I S+G+ V LS+ L+SC + C
Sbjct: 216 P--GWMSEIQDQGWCGSSWAITTAAVASDRFAILSKGREKVTLSAQHLLSCDRRGQQSCN 273
Query: 168 GGFHGKAWKYWVTTGIVSGGTY---ASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECI 224
GG+ +AW Y G+V + A+ + CR IP R + ++CQ
Sbjct: 274 GGYLDRAWSYIRKIGLVDEQCFPYSATNEKCR---IP--RRGDLVTANCQLPTNVDRRSK 328
Query: 225 RKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYK 284
K P Y V NE IM EI GPV+ +M +Y D YK GIY+
Sbjct: 329 YKVAPAYRV----------------GNETDIMYEILHSGPVQATMKVYHDFFTYKRGIYR 372
Query: 285 H---VAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
H G H++RI+GWG+E EG + KYW VANS+ WGENG FRI
Sbjct: 373 HSPISTNDRTGYHSVRIVGWGEEYSPEG---LKKYWKVANSWGPEWGENGYFRI 423
Score = 101 bits (251), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 50/108 (46%), Positives = 65/108 (60%), Gaps = 6/108 (5%)
Query: 393 NEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHV---AGGPLGEHAIRIIGWGQEPLG 449
NE IM EI GPV+ +M +Y D YK GIY+H G H++RI+GWG+E
Sbjct: 339 NETDIMYEILHSGPVQATMKVYHDFFTYKRGIYRHSPISTNDRTGYHSVRIVGWGEEYSP 398
Query: 450 EGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
EG + KYW VANS+ WGENG FRI+RG NEC IE+ + ++
Sbjct: 399 EG---LKKYWKVANSWGPEWGENGYFRILRGSNECEIESFVLGTWAEV 443
>gi|339248603|ref|XP_003373289.1| cathepsin B [Trichinella spiralis]
gi|316970616|gb|EFV54519.1| cathepsin B [Trichinella spiralis]
Length = 576
Score = 145 bits (367), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 104/289 (35%), Positives = 141/289 (48%), Gaps = 51/289 (17%)
Query: 63 TLSE-LEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGS 121
TL E R+G K +N +L+++S+ LPE FDAR WP I +RDQG
Sbjct: 279 TLDEGFSYRLGTLLPEKSVKNMNEILIEMSN---FLPESFDARERWP--SFIHPVRDQGD 333
Query: 122 CGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTT 181
C S WA +DR+ I S GK + LS L+SC + GC GG+ +AW
Sbjct: 334 CASSWAFSTTAVSADRLAIQSGGKFYNPLSVQQLLSCNQARQRGCNGGYLDRAW------ 387
Query: 182 GIVSGG--TYASKQGCRPYE--IPCERYMNGS---HSSCQDNEPNTPECIRKCQPGYDVS 234
+VS TY S Q +P E IP Y++G S DN + K P Y +S
Sbjct: 388 CVVSDECYTYTSGQTNQPGECHIPRTAYLDGEIRCPSGSADNR------VYKMTPPYRIS 441
Query: 235 YEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVA----GGP 290
NE IM EI +GPV+ + ++ D +YK+G+Y+H+ GP
Sbjct: 442 ---------------TNEREIMTEIMANGPVQATFLVHEDFFMYKSGVYQHLPYANDKGP 486
Query: 291 L----GEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
G H++RI+GWG + T +KYWL ANS+ WGENGLFRI
Sbjct: 487 AYARSGYHSVRILGWG---VDHSTGVPIKYWLCANSWGEEWGENGLFRI 532
Score = 102 bits (254), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 50/117 (42%), Positives = 70/117 (59%), Gaps = 11/117 (9%)
Query: 388 YSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVA----GGPL----GEHAIR 439
Y + NE IM EI +GPV+ + ++ D +YK+G+Y+H+ GP G H++R
Sbjct: 438 YRISTNEREIMTEIMANGPVQATFLVHEDFFMYKSGVYQHLPYANDKGPAYARSGYHSVR 497
Query: 440 IIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPK 496
I+GWG + T +KYWL ANS+ WGENGLFRI+RG+N C IE+ I K
Sbjct: 498 ILGWG---VDHSTGVPIKYWLCANSWGEEWGENGLFRILRGENHCDIESFIIGAWGK 551
>gi|325303156|tpg|DAA34330.1| TPA_inf: cysteine proteinase cathepsin L [Amblyomma variegatum]
Length = 207
Score = 145 bits (367), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 75/129 (58%), Positives = 88/129 (68%), Gaps = 6/129 (4%)
Query: 71 MGVHPDSKLPQNRLPLLVQLSDPL-EELPEGFDARINWPYCPTIQEIRDQGSCGSGWALG 129
+GV P++ L RLP + L LPE FDAR WP CPTI EIRDQGSCGS WA G
Sbjct: 80 LGVRPENSL--YRLPERTLDVNALPTALPENFDAREQWPDCPTIGEIRDQGSCGSCWAFG 137
Query: 130 AVEAMSDRVCIASRGKR---HVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSG 186
AVEAMSDR CI S ++ +V L++DD++SCCKDCG GC GGF G AW YWV GIV G
Sbjct: 138 AVEAMSDRTCIHSPARKPRVNVHLAADDVLSCCKDCGAGCNGGFPGAAWSYWVHHGIVDG 197
Query: 187 GTYASKQGC 195
G Y + +GC
Sbjct: 198 GHYDTDEGC 206
>gi|281200411|gb|EFA74631.1| hypothetical protein PPL_11599 [Polysphondylium pallidum PN500]
Length = 311
Score = 145 bits (366), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 91/260 (35%), Positives = 128/260 (49%), Gaps = 36/260 (13%)
Query: 77 SKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSD 136
+K P+N P V +S +P FD+R NWP C + + +QG CGS WA A E++SD
Sbjct: 64 TKKPRNT-PEEVSVSK--VAVPNSFDSRTNWPGC--VHAVLNQGQCGSCWAFAASESLSD 118
Query: 137 RVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCR 196
R+CIAS+G +V LS LVSC + GC GG AW+Y GI + C
Sbjct: 119 RLCIASQGAINVTLSPQALVSCDIEFNQGCNGGIPQMAWEYLELHGIPT-------DSCF 171
Query: 197 PYEIPCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIM 256
PY P+C ++C G F S A I
Sbjct: 172 PYT---------------SGNGTAPDCQKECSDGSKYQLYKGKTFTLKTCSSVA---AIQ 213
Query: 257 REIFRHGPVEGSMTIYADMILYKTGIYKHVAGGP-LGEHAIRIIGWGQEPLGEGTSSVVK 315
+F +GP+EG+M +Y D + Y +G+Y G LG HAI+I+GWG + ++S +
Sbjct: 214 ANVFAYGPIEGTMDVYQDFMSYTSGVYVMTPGSKLLGGHAIKIVGWGTD-----STSGLD 268
Query: 316 YWLVANSFNTNWGENGLFRI 335
YW+V NS+ ++WG NG F I
Sbjct: 269 YWIVQNSWGSDWGMNGFFWI 288
Score = 97.1 bits (240), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 51/136 (37%), Positives = 74/136 (54%), Gaps = 9/136 (6%)
Query: 363 PECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKT 422
P+C ++C G F S A I +F +GP+EG+M +Y D + Y +
Sbjct: 181 PDCQKECSDGSKYQLYKGKTFTLKTCSSVA---AIQANVFAYGPIEGTMDVYQDFMSYTS 237
Query: 423 GIYKHVAGGPL-GEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQ 481
G+Y G L G HAI+I+GWG + ++S + YW+V NS+ ++WG NG F I RG
Sbjct: 238 GVYVMTPGSKLLGGHAIKIVGWGTD-----STSGLDYWIVQNSWGSDWGMNGFFWIQRGT 292
Query: 482 NECGIEADITAGLPKI 497
N CGI+ D +AG I
Sbjct: 293 NMCGIDRDASAGQADI 308
>gi|66506619|ref|XP_393283.2| PREDICTED: uncharacterized peptidase C1-like protein F26E4.3-like
[Apis mellifera]
Length = 439
Score = 145 bits (366), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 101/288 (35%), Positives = 138/288 (47%), Gaps = 33/288 (11%)
Query: 51 FYGAEKNALSKLTLSELEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYC 110
F+G + KL L L V+ R+ + ++ DP E LP FDAR W
Sbjct: 160 FWGKRLSEGVKLRLGTLNPSNSVY--------RMNSVRRVYDP-ESLPREFDARTRWRR- 209
Query: 111 PTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGF 170
I + DQG CG+ WA+ + SDR + S+G V LS+ L+SC K GC GG+
Sbjct: 210 -QISGVDDQGWCGASWAISTAQVASDRFAVMSKGTDSVLLSAQHLLSCNKKGQRGCDGGY 268
Query: 171 HGKAWKYWVTTGIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQPG 230
+AW + G+V + C P++ G + C+ + E P
Sbjct: 269 LDRAWLFMRKFGLVD-------EQCYPWK--------GVYEQCKLQKRTNLEAAGCRAPA 313
Query: 231 YDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGP 290
+ E L AY L NE IMREI GPV+ +M +Y D Y++GIY H
Sbjct: 314 NPLRKE--LYKVGPAYRL-GNETDIMREILTSGPVQATMKVYQDFFSYESGIYMHTPIAE 370
Query: 291 L---GEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
L G H++RIIGWG E + + +KYWLV NS+ WGENGLFRI
Sbjct: 371 LYESGYHSVRIIGWG-EDISTDSGLPIKYWLVVNSWGQEWGENGLFRI 417
Score = 106 bits (265), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 56/116 (48%), Positives = 70/116 (60%), Gaps = 5/116 (4%)
Query: 387 AYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPL---GEHAIRIIGW 443
AY L NE IMREI GPV+ +M +Y D Y++GIY H L G H++RIIGW
Sbjct: 326 AYRL-GNETDIMREILTSGPVQATMKVYQDFFSYESGIYMHTPIAELYESGYHSVRIIGW 384
Query: 444 GQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKIGL 499
G E + + +KYWLV NS+ WGENGLFRI RG NEC IE+ + A K +
Sbjct: 385 G-EDISTDSGLPIKYWLVVNSWGQEWGENGLFRIRRGINECDIESFVVAVWAKTNV 439
>gi|194882138|ref|XP_001975170.1| GG20712 [Drosophila erecta]
gi|190658357|gb|EDV55570.1| GG20712 [Drosophila erecta]
Length = 431
Score = 145 bits (366), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 99/309 (32%), Positives = 148/309 (47%), Gaps = 35/309 (11%)
Query: 32 DLSKAFDRVDHSI-LLPKLPFYGAEKNALSKLTLSE-LEMRMGVHPDSKLPQNRLPLLVQ 89
DL D + HS+ + +L + + + SE L++R+G +K P R+ + +
Sbjct: 124 DLCLTDDAIIHSVNSISRLGWSAHKYDQWWGRKYSEGLKLRLG----TKEPTYRVKAMTR 179
Query: 90 LSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVR 149
L +P + LP F+A W I E+ DQG CG+ W L SDR I S+GK V+
Sbjct: 180 LRNPTDGLPRSFNALDKWS--SYISEVPDQGWCGASWVLSTTSVASDRFAIQSKGKETVQ 237
Query: 150 LSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIPCERYMNGS 209
LS+ +++SC + GC GG AW+Y G+V + C PY
Sbjct: 238 LSAQNILSCTRR-QQGCDGGHLDAAWRYLHKKGVVD-------ESCYPY--------TQH 281
Query: 210 HSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSM 269
+C+ + C+ +V + G AYSL E IM EIF GPV+ +M
Sbjct: 282 RDTCKIRHNSRSLRANGCETPVNVDRDTFYTVG-PAYSL-NREADIMAEIFNSGPVQATM 339
Query: 270 TIYADMILYKTGIYKHVAG---GPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTN 326
+ D Y G+Y+ A P G H+++++GWG+E GE KYW+ ANS+ +
Sbjct: 340 RVNRDFFSYSRGVYRQTAANREAPTGFHSVKLVGWGEEHNGE------KYWIAANSWGSW 393
Query: 327 WGENGLFRI 335
WGE G FRI
Sbjct: 394 WGEKGYFRI 402
Score = 102 bits (255), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 51/114 (44%), Positives = 68/114 (59%), Gaps = 10/114 (8%)
Query: 387 AYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAG---GPLGEHAIRIIGW 443
AYSL E IM EIF GPV+ +M + D Y G+Y+ A P G H+++++GW
Sbjct: 316 AYSL-NREADIMAEIFNSGPVQATMRVNRDFFSYSRGVYRQTAANREAPTGFHSVKLVGW 374
Query: 444 GQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
G+E GE KYW+ ANS+ + WGE G FRI+RG NECGIE + A P +
Sbjct: 375 GEEHNGE------KYWIAANSWGSWWGEKGYFRILRGSNECGIEEYVLASWPYV 422
>gi|508264|gb|AAA96833.1| cysteine protease, partial [Caenorhabditis elegans]
Length = 198
Score = 145 bits (366), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 83/206 (40%), Positives = 123/206 (59%), Gaps = 14/206 (6%)
Query: 124 SGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCC-KDCGNGCQGGFHGKAWKYWVTTG 182
S WA+ A E +SDR+CIAS K + +S+DD+ +CC CGNGC GG+ +AW+++V G
Sbjct: 1 SCWAVSAAETISDRICIASNAKTILSISADDINACCGMVCGNGCNGGYPIEAWRHYVKKG 60
Query: 183 IVSGGTYASKQGCRPYEI-PCERYMNGSH-SSCQDN-EPNTPECIRKCQPGYDVSYEDDL 239
V+GG+Y K GC+PY PCE ++NG+H C N P + ++Y DL
Sbjct: 61 YVTGGSYQDKTGCKPYPYPPCEHHVNGTHYKPCPSNMYPTGQNANALGKLDIALTYHKDL 120
Query: 240 NFGRIAYSLPANEET--IMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIR 297
+F I ++ PA++E I + I HG + G +T++ D Y G+Y H AG LG HA++
Sbjct: 121 HFRTILHT-PASKEAAGIPKGIKTHGQLRGGITVFEDFEHYSGGVYVHTAGASLGGHAVK 179
Query: 298 IIGWGQEPLGEGTSSVVKYWLVANSF 323
++GWG + GT YWL+ANS+
Sbjct: 180 MLGWGVD---NGT----PYWLIANSW 198
Score = 75.9 bits (185), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 48/137 (35%), Positives = 75/137 (54%), Gaps = 13/137 (9%)
Query: 334 RIGCRPYEIP-CERYMNGSR-SSCQAN-EPNTPECIRKCQPGYDVSYEDDLNFGRIAYSL 390
+ GC+PY P CE ++NG+ C +N P + ++Y DL+F I ++
Sbjct: 70 KTGCKPYPYPPCEHHVNGTHYKPCPSNMYPTGQNANALGKLDIALTYHKDLHFRTILHT- 128
Query: 391 PANEET--IMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPL 448
PA++E I + I HG + G +T++ D Y G+Y H AG LG HA++++GWG +
Sbjct: 129 PASKEAAGIPKGIKTHGQLRGGITVFEDFEHYSGGVYVHTAGASLGGHAVKMLGWGVD-- 186
Query: 449 GEGTSSVVKYWLVANSF 465
GT YWL+ANS+
Sbjct: 187 -NGT----PYWLIANSW 198
>gi|162813|gb|AAA30434.1| cathepsin B, partial [Bos taurus]
Length = 122
Score = 145 bits (366), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 67/124 (54%), Positives = 96/124 (77%), Gaps = 7/124 (5%)
Query: 372 GYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGG 431
GY SY++D +FG +YS+ NE+ IM EI+++GPVEG+ ++Y+D +LYK+G+Y+HV+G
Sbjct: 1 GYSPSYKEDKHFGCSSYSVANNEKEIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVSGE 60
Query: 432 PLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADIT 491
+G HAIRI+GWG E GT YWLV NS+NT+WG+NG F+I+RGQ+ CGIE++I
Sbjct: 61 IMGGHAIRILGWGVE---NGT----PYWLVGNSWNTDWGDNGFFKILRGQDHCGIESEIV 113
Query: 492 AGLP 495
AG+P
Sbjct: 114 AGMP 117
Score = 119 bits (297), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 56/106 (52%), Positives = 80/106 (75%), Gaps = 7/106 (6%)
Query: 230 GYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGG 289
GY SY++D +FG +YS+ NE+ IM EI+++GPVEG+ ++Y+D +LYK+G+Y+HV+G
Sbjct: 1 GYSPSYKEDKHFGCSSYSVANNEKEIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVSGE 60
Query: 290 PLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
+G HAIRI+GWG E GT YWLV NS+NT+WG+NG F+I
Sbjct: 61 IMGGHAIRILGWGVE---NGT----PYWLVGNSWNTDWGDNGFFKI 99
>gi|294929081|ref|XP_002779258.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
gi|239888294|gb|EER11053.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
Length = 288
Score = 145 bits (365), Expect = 7e-32, Method: Compositional matrix adjust.
Identities = 91/263 (34%), Positives = 127/263 (48%), Gaps = 30/263 (11%)
Query: 86 LLVQLSDPLEELPEGFDARINWPYCP-TIQEIRDQGSCGSGWALGAVEAMSDRVCIASRG 144
LL L++LP FDAR + C I +RDQ +C + W + + ++DRVCI S G
Sbjct: 26 LLGPTKPELKDLPSNFDARQKFASCAGVIGHVRDQSACHNCWTVSSTGMLNDRVCIKSGG 85
Query: 145 KRHVRLSSDDLVSCCKDC-----GNGCQGGFHGKAWKYWVTTGIVSG------GTYASKQ 193
LS SCC GCQGG + + GIV+G G +S
Sbjct: 86 TFRDILSVGYFTSCCNPANGCPKAKGCQGGNLLEGLNFLKNHGIVTGDEFKPAGQLSSAD 145
Query: 194 GCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKC-QPGYDVSYEDDLNFGRIAYSLPANE 252
GC PY P C+ ++P C KC Y S + DL+ + LPA
Sbjct: 146 GCWPYPFP----------KCKHAGYSSPACQTKCTNKAYKTSLQQDLHRAKSFGRLPAIP 195
Query: 253 ETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSS 312
+ I +EIF +GPV G ++IY D+ +YK G+Y H G G H ++IIGWG E S
Sbjct: 196 QNIKQEIFTNGPVIGMLSIYEDIRVYKAGVYVHQTGSFQGIHTLKIIGWGVE-------S 248
Query: 313 VVKYWLVANSFNTNWGENGLFRI 335
YWL NS+N WG++G+ ++
Sbjct: 249 GQDYWLAVNSWNEEWGDHGMIKL 271
Score = 97.4 bits (241), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 56/155 (36%), Positives = 81/155 (52%), Gaps = 18/155 (11%)
Query: 336 GCRPYEIPCERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEE 395
GC PY P ++ S +CQ +C K Y S + DL+ + LPA +
Sbjct: 146 GCWPYPFPKCKHAGYSSPACQT------KCTNK---AYKTSLQQDLHRAKSFGRLPAIPQ 196
Query: 396 TIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSV 455
I +EIF +GPV G ++IY D+ +YK G+Y H G G H ++IIGWG E S
Sbjct: 197 NIKQEIFTNGPVIGMLSIYEDIRVYKAGVYVHQTGSFQGIHTLKIIGWGVE-------SG 249
Query: 456 VKYWLVANSFNTNWGENGLFRIVRGQNECGIEADI 490
YWL NS+N WG++G+ ++ G+ GIE +
Sbjct: 250 QDYWLAVNSWNEEWGDHGMIKLAVGRT--GIENSV 282
>gi|729283|sp|Q06544.1|CYSP3_OSTOS RecName: Full=Cathepsin B-like cysteine proteinase 3
gi|159952|gb|AAA29436.1| cathepsin B-like cysteine protease, partial [Ostertagia ostertagi]
Length = 174
Score = 144 bits (363), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 73/164 (44%), Positives = 103/164 (62%), Gaps = 10/164 (6%)
Query: 174 AWKYWVTTGIVSGGTYASKQGCRPYEIP-CERY-MNGSHSSCQDNEPNTPECIRKCQPGY 231
AW+Y+ G+V+GG Y + CRPYE P C R+ + C D TP+C + CQ GY
Sbjct: 1 AWQYFALEGVVTGGNYRKQGCCRPYEFPPCGRHGKEPYYGECYDT-AKTPKCQKTCQRGY 59
Query: 232 DVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPL 291
+Y++D +FG+ AY LP N + I R+I ++GPV +Y D YK+GIYKH AG
Sbjct: 60 LKAYKEDKHFGKSAYRLPNNVKAIQRDIMKNGPVVAGFIVYEDFAHYKSGIYKHTAGRMT 119
Query: 292 GEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
G HA++IIGWG+E +GT YWL+ANS++ +WGE G +R+
Sbjct: 120 GGHAVKIIGWGKE---KGTP----YWLIANSWHDDWGEKGFYRM 156
Score = 141 bits (356), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 71/159 (44%), Positives = 99/159 (62%), Gaps = 8/159 (5%)
Query: 337 CRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEE 395
CRPYE P C R+ + TP+C + CQ GY +Y++D +FG+ AY LP N +
Sbjct: 22 CRPYEFPPCGRHGKEPYYGECYDTAKTPKCQKTCQRGYLKAYKEDKHFGKSAYRLPNNVK 81
Query: 396 TIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSV 455
I R+I ++GPV +Y D YK+GIYKH AG G HA++IIGWG+E +GT
Sbjct: 82 AIQRDIMKNGPVVAGFIVYEDFAHYKSGIYKHTAGRMTGGHAVKIIGWGKE---KGTP-- 136
Query: 456 VKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGL 494
YWL+ANS++ +WGE G +R++RG N C IE + AG+
Sbjct: 137 --YWLIANSWHDDWGEKGFYRMIRGINNCRIEEMVFAGI 173
>gi|403340695|gb|EJY69640.1| Cathepsin B [Oxytricha trifallax]
Length = 247
Score = 144 bits (363), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 94/274 (34%), Positives = 132/274 (48%), Gaps = 45/274 (16%)
Query: 64 LSELEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCG 123
+ E++ R+G P +P Q +D + P+ FD+R W C + IRDQ CG
Sbjct: 1 MEEIKARLGTIVQG--PVEGIPEPAQHNDIV---PKTFDSREQWGNC--VHPIRDQAQCG 53
Query: 124 SGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGN-GCQGGFHGKAWKYWVTTG 182
S WA GA E +SDR+CIAS K V LS +DLV+C D N GC GG AW Y TG
Sbjct: 54 SCWAFGASETLSDRICIASDKKTDVILSPEDLVAC--DGWNMGCNGGILPWAWSYLTNTG 111
Query: 183 IVSGGTYASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFG 242
V + C PY ++ P C +KCQ D +
Sbjct: 112 AV-------EDSCFPY---------------SSDKGAVPTCAKKCQNDKDSFTKYKCKKN 149
Query: 243 RIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWG 302
+ + + + I EI ++GP+E T+Y D + Y++G+Y H G LG HA++I+G+G
Sbjct: 150 SVVQA--SGVDKIKAEISKNGPMETGFTVYEDFMNYESGVYHHTTGNQLGGHAVKIVGYG 207
Query: 303 QEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIG 336
YW+ ANS++ WGE G F IG
Sbjct: 208 D-----------GYWICANSWSEKWGEKGFFNIG 230
Score = 83.2 bits (204), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 45/136 (33%), Positives = 69/136 (50%), Gaps = 15/136 (11%)
Query: 362 TPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYK 421
P C +KCQ D + + + + + I EI ++GP+E T+Y D + Y+
Sbjct: 127 VPTCAKKCQNDKDSFTKYKCKKNSVVQA--SGVDKIKAEISKNGPMETGFTVYEDFMNYE 184
Query: 422 TGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQ 481
+G+Y H G LG HA++I+G+G YW+ ANS++ WGE G F I G
Sbjct: 185 SGVYHHTTGNQLGGHAVKIVGYGD-----------GYWICANSWSEKWGEKGFFNI--GF 231
Query: 482 NECGIEADITAGLPKI 497
ECGI++ A P +
Sbjct: 232 GECGIDSAAYACTPDL 247
>gi|290989996|ref|XP_002677623.1| cathepsin B [Naegleria gruberi]
gi|284091231|gb|EFC44879.1| cathepsin B [Naegleria gruberi]
Length = 321
Score = 144 bits (362), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 95/244 (38%), Positives = 124/244 (50%), Gaps = 38/244 (15%)
Query: 97 LPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLV 156
LP FD+R W C I IR+Q CGS WA A E++SDR CIAS GK V LS D+V
Sbjct: 86 LPTNFDSRQQWGKC--IHPIRNQEQCGSCWAFSASESLSDRFCIASNGKVDVILSPQDMV 143
Query: 157 SCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQDN 216
SC + GC GG AW + GIV C PY ++G
Sbjct: 144 SCDYN-DMGCDGGNLDNAWWWMKNKGIV-------PDSCMPY-------VSGGG------ 182
Query: 217 EPNTPECIRKCQPGYDVSYEDDL----NFGRIA-YSLPANEETIMREIFRHGPVEGSMTI 271
N P C C G ++ L +F I+ + I +EI+ +GPV+G ++
Sbjct: 183 --NVPACPSNCN-GTNIPISSQLYYAKSFSHISPWMFWERVADIQQEIYTNGPVQGGFSV 239
Query: 272 YADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENG 331
Y D + YK+G+Y H G LG HAI+IIGWG E V YWLVANS++T+WG +G
Sbjct: 240 YQDFMNYKSGVYSHKTGSFLGGHAIKIIGWGVE-------GGVDYWLVANSWSTDWGIDG 292
Query: 332 LFRI 335
F+I
Sbjct: 293 TFKI 296
Score = 110 bits (274), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 59/138 (42%), Positives = 81/138 (58%), Gaps = 13/138 (9%)
Query: 361 NTPECIRKCQPGYDVSYEDDL----NFGRIA-YSLPANEETIMREIFRHGPVEGSMTIYA 415
N P C C G ++ L +F I+ + I +EI+ +GPV+G ++Y
Sbjct: 183 NVPACPSNCN-GTNIPISSQLYYAKSFSHISPWMFWERVADIQQEIYTNGPVQGGFSVYQ 241
Query: 416 DMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLF 475
D + YK+G+Y H G LG HAI+IIGWG E V YWLVANS++T+WG +G F
Sbjct: 242 DFMNYKSGVYSHKTGSFLGGHAIKIIGWGVE-------GGVDYWLVANSWSTDWGIDGTF 294
Query: 476 RIVRGQNECGIEADITAG 493
+I+RG NECGIE D+ AG
Sbjct: 295 KILRGHNECGIEDDVYAG 312
>gi|195346663|ref|XP_002039877.1| GM15657 [Drosophila sechellia]
gi|194135226|gb|EDW56742.1| GM15657 [Drosophila sechellia]
Length = 431
Score = 143 bits (361), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 100/309 (32%), Positives = 149/309 (48%), Gaps = 35/309 (11%)
Query: 32 DLSKAFDRVDHSI-LLPKLPFYGAEKNALSKLTLSE-LEMRMGVHPDSKLPQNRLPLLVQ 89
DL D + HS+ + +L + + + SE L++R+G +K P R+ + +
Sbjct: 124 DLCLTDDAIVHSVNSINRLGWSARKYDQWWGRKYSEGLKLRLG----TKEPTYRVKAMTR 179
Query: 90 LSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVR 149
L +P + LP F+A W I E+ DQG CG+ W L SDR I S+GK V+
Sbjct: 180 LRNPTDGLPSSFNALDKWS--SYISEVPDQGWCGASWVLSTTSVASDRFAIQSKGKEAVQ 237
Query: 150 LSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIPCERYMNGS 209
LS+ +++SC + GC+GG AW+Y G+V + C PY
Sbjct: 238 LSAQNILSCTRR-QQGCEGGHLDAAWRYLHKKGVVD-------ENCYPY--------TQH 281
Query: 210 HSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSM 269
+C+ + CQ +V D L AYSL E IM EIF GPV+ +M
Sbjct: 282 RDTCKIRHNSRSLRANGCQTPVNVD-RDTLYTVGPAYSL-NREADIMAEIFHSGPVQATM 339
Query: 270 TIYADMILYKTGIYKHVAGGP---LGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTN 326
+ D Y G+Y+ A G H+++++GWG+E GE KYW+ ANS+ +
Sbjct: 340 RVNRDFFAYSGGVYRETAANRKALTGFHSVKLVGWGEEHNGE------KYWIAANSWGSW 393
Query: 327 WGENGLFRI 335
WGE+G FRI
Sbjct: 394 WGEHGYFRI 402
Score = 99.4 bits (246), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 50/114 (43%), Positives = 68/114 (59%), Gaps = 10/114 (8%)
Query: 387 AYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGP---LGEHAIRIIGW 443
AYSL E IM EIF GPV+ +M + D Y G+Y+ A G H+++++GW
Sbjct: 316 AYSL-NREADIMAEIFHSGPVQATMRVNRDFFAYSGGVYRETAANRKALTGFHSVKLVGW 374
Query: 444 GQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
G+E GE KYW+ ANS+ + WGE+G FRI+RG NECGIE + A P +
Sbjct: 375 GEEHNGE------KYWIAANSWGSWWGEHGYFRILRGSNECGIEDYVLASWPYV 422
>gi|324514184|gb|ADY45787.1| Cathepsin B cysteine proteinase 6 [Ascaris suum]
Length = 476
Score = 143 bits (361), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 94/295 (31%), Positives = 138/295 (46%), Gaps = 64/295 (21%)
Query: 93 PLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSS 152
P + LP FDAR W YC ++ + +QG CG+ +A+ AV SDR CIAS G S
Sbjct: 189 PADSLPSEFDARRKWSYCSSLHNVPNQGGCGACYAVAAVGVASDRACIASNGTLQSMFSE 248
Query: 153 DDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYE------IPCERYM 206
+D++ CC CGN C GG KA YWV G+V+GG + GCRPY +PC
Sbjct: 249 EDVLGCCAVCGN-CYGGDPLKALVYWVDEGLVTGG----RDGCRPYSVDLSCGVPCS--- 300
Query: 207 NGSHSSCQDNEPNTPECIRKCQPGY-DVSYEDDLNFGRIAYS-----------------L 248
+ +C R+CQ Y +YE D ++G +AYS L
Sbjct: 301 ----PAVYPLAEYRRKCYRQCQDIYFQYNYESDKHYGSMAYSMFPRTMSLDNKGSERVKL 356
Query: 249 PA-----NE------------ETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPL 291
P NE + IM+E++ GP+ + + + + Y +G++
Sbjct: 357 PTVIGYLNETSDEPLTDKEIRQIIMKELYLWGPMTMAFPVTEEFLHYSSGVFSPFPAANF 416
Query: 292 GE-----HAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIGCRPYE 341
+ H R+IGWG+ +G + YWL NSF +WG++G+FRI + E
Sbjct: 417 SDRIVYWHVARLIGWGKY---DGDN---HYWLAVNSFGRHWGDDGVFRIDTQLLE 465
Score = 64.3 bits (155), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 53/194 (27%), Positives = 84/194 (43%), Gaps = 50/194 (25%)
Query: 327 WGENGLF---RIGCRPYEIPCERYMNGSRSSCQANEPNTPECIRKCQPGY-DVSYEDDLN 382
W + GL R GCRPY + + S + E +C R+CQ Y +YE D +
Sbjct: 273 WVDEGLVTGGRDGCRPYSVDLSCGVPCSPAVYPLAEYRR-KCYRQCQDIYFQYNYESDKH 331
Query: 383 FGRIAYS-----------------LPA-----NE------------ETIMREIFRHGPVE 408
+G +AYS LP NE + IM+E++ GP+
Sbjct: 332 YGSMAYSMFPRTMSLDNKGSERVKLPTVIGYLNETSDEPLTDKEIRQIIMKELYLWGPMT 391
Query: 409 GSMTIYADMILYKTGIYKHVAGGPLGE-----HAIRIIGWGQEPLGEGTSSVVKYWLVAN 463
+ + + + Y +G++ + H R+IGWG+ +G + YWL N
Sbjct: 392 MAFPVTEEFLHYSSGVFSPFPAANFSDRIVYWHVARLIGWGKY---DGDN---HYWLAVN 445
Query: 464 SFNTNWGENGLFRI 477
SF +WG++G+FRI
Sbjct: 446 SFGRHWGDDGVFRI 459
>gi|347546077|gb|AEP03186.1| cathepsin B [Diuraphis noxia]
Length = 239
Score = 143 bits (360), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 77/204 (37%), Positives = 113/204 (55%), Gaps = 12/204 (5%)
Query: 96 ELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDL 155
++P+ FDAR W C TI +RDQG CGS WA+ A +DR+CIA+ G + LS+D++
Sbjct: 46 KIPKTFDARKKWVQCDTIGRVRDQGQCGSCWAVSTSSAFADRLCIATDGDFNELLSADEI 105
Query: 156 VSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQD 215
CC CG GC GG+ KAWK + G+V+GG + S +GC PY +P S
Sbjct: 106 TFCCYTCGFGCDGGYPIKAWKQFSRHGLVTGGDFDSGEGCEPYRVP--------PSGSNS 157
Query: 216 NEPNTPECIRKCQ-PGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYAD 274
+ C KC ++SY +D + R Y L N I +++ +GP+E S +Y D
Sbjct: 158 SNSYNHFCRGKCYGDNQNISYSEDHRYTRDYYYLSYN--AIQKDVLLYGPIEASFEVYDD 215
Query: 275 MILYKTGIY-KHVAGGPLGEHAIR 297
++YK+G+Y K LG HA++
Sbjct: 216 FMIYKSGVYVKSENATHLGGHAVK 239
Score = 49.7 bits (117), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 32/111 (28%), Positives = 50/111 (45%), Gaps = 22/111 (19%)
Query: 336 GCRPYEIP------CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYS 389
GC PY +P Y + R C + N +SY +D + R Y
Sbjct: 144 GCEPYRVPPSGSNSSNSYNHFCRGKCYGDNQN-------------ISYSEDHRYTRDYYY 190
Query: 390 LPANEETIMREIFRHGPVEGSMTIYADMILYKTGIY-KHVAGGPLGEHAIR 439
L N I +++ +GP+E S +Y D ++YK+G+Y K LG HA++
Sbjct: 191 LSYN--AIQKDVLLYGPIEASFEVYDDFMIYKSGVYVKSENATHLGGHAVK 239
>gi|290992302|ref|XP_002678773.1| predicted protein [Naegleria gruberi]
gi|284092387|gb|EFC46029.1| predicted protein [Naegleria gruberi]
Length = 236
Score = 142 bits (359), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 87/236 (36%), Positives = 123/236 (52%), Gaps = 35/236 (14%)
Query: 100 GFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCC 159
FD+R WP+C + IR+Q CGS WA A E +SDR CIAS GK V LS +VSC
Sbjct: 16 AFDSRTKWPHC--VHPIRNQEQCGSCWAFSASEVLSDRFCIASGGKVDVVLSPQYMVSC- 72
Query: 160 KDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQDNEPN 219
GC GG+ AW + TGI S C PY NG +
Sbjct: 73 DSTDYGCDGGYLNNAWAFLAGTGIPS-------DKCAPYT-----SQNG----------D 110
Query: 220 TPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYK 279
C KCQ G V N ++ +P +IM ++ ++GPV+ + ++Y D + YK
Sbjct: 111 VAACPSKCQDGSSVKLYKAKNPQQLN-DIP----SIMEDMQQNGPVQAAFSVYRDFMSYK 165
Query: 280 TGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
+G+Y HV+G LG HAI+++GWG + +++ YW++ANS+ +WG NG F I
Sbjct: 166 SGVYHHVSGSLLGGHAIKMVGWGVD-----SATNKPYWIIANSWGPSWGLNGFFWI 216
Score = 101 bits (251), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 49/137 (35%), Positives = 84/137 (61%), Gaps = 10/137 (7%)
Query: 361 NTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILY 420
+ C KCQ G V N ++ +P +IM ++ ++GPV+ + ++Y D + Y
Sbjct: 110 DVAACPSKCQDGSSVKLYKAKNPQQLN-DIP----SIMEDMQQNGPVQAAFSVYRDFMSY 164
Query: 421 KTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRG 480
K+G+Y HV+G LG HAI+++GWG + +++ YW++ANS+ +WG NG F I+RG
Sbjct: 165 KSGVYHHVSGSLLGGHAIKMVGWGVD-----SATNKPYWIIANSWGPSWGLNGFFWILRG 219
Query: 481 QNECGIEADITAGLPKI 497
+ECGIE ++ +G ++
Sbjct: 220 SDECGIEDNVWSGQAQL 236
>gi|161343859|tpg|DAA06110.1| TPA_inf: cathepsin B [Myzus persicae]
Length = 260
Score = 142 bits (359), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 69/173 (39%), Positives = 97/173 (56%), Gaps = 3/173 (1%)
Query: 98 PEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVS 157
P FDAR W +C TI E+RDQG CGS WA G A +DR+C+A+ G + LS++++
Sbjct: 89 PRTFDARKKWRHCKTIGEVRDQGHCGSCWAFGTSSAFADRLCVATDGDFNELLSAEEITF 148
Query: 158 CCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEI-PCERYMNGSHSSCQDN 216
CC CG GC GG KAWKY+ T G+V+GG Y S +GC PY + PC R G ++
Sbjct: 149 CCHTCGFGCNGGDPIKAWKYFSTHGLVTGGNYKSGEGCEPYRVPPCPRDDKGKNTCAGKP 208
Query: 217 EPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSM 269
C R C D+ Y +D + R Y L +I +++ +GP+E +
Sbjct: 209 REKNHRCTRMCYGNQDLDYREDHRYTRDFYYLTYG--SIQKDVMTYGPIEATF 259
Score = 40.0 bits (92), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 25/99 (25%), Positives = 40/99 (40%), Gaps = 3/99 (3%)
Query: 314 VKYWLVANSFNTNWGENGLFRIGCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPG 372
+K W ++ G N GC PY +P C R G + C R C
Sbjct: 163 IKAWKYFSTHGLVTGGNYKSGEGCEPYRVPPCPRDDKGKNTCAGKPREKNHRCTRMCYGN 222
Query: 373 YDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSM 411
D+ Y +D + R Y L +I +++ +GP+E +
Sbjct: 223 QDLDYREDHRYTRDFYYLTYG--SIQKDVMTYGPIEATF 259
>gi|363742306|ref|XP_428202.3| PREDICTED: tubulointerstitial nephritis antigen-like 1 [Gallus
gallus]
Length = 464
Score = 142 bits (358), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 101/301 (33%), Positives = 144/301 (47%), Gaps = 28/301 (9%)
Query: 62 LTLSE-LEMRMGVH--PDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRD 118
+TL + + R+G P + + N + + + D E LP FDA WP I E D
Sbjct: 167 MTLEDGMRYRLGTFRPPPTVMNMNEMHMAM---DSNEVLPRHFDAATKWP--GMIHEPLD 221
Query: 119 QGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYW 178
QG+C WA SDR+ I S G LS +L+SC GC GG AW Y
Sbjct: 222 QGNCAGSWAFSTAAVASDRISIHSMGHMTPSLSPQNLLSCDTRNQRGCSGGRLDGAWWYL 281
Query: 179 VTTGIVSGGTYA-SKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYED 237
G+V+ Y + Q +P PC M S S+ + T C ++ +
Sbjct: 282 RRRGVVTDECYPFTSQDSQPAAQPC---MMHSRSTGRGKRQATARCPNP------QTHAN 332
Query: 238 DLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVA----GGPL-- 291
D+ AY L +E+ IM+E+ +GPV+ + ++ D LYK+GIY+H A GP
Sbjct: 333 DIYQSTPAYRLAPSEKEIMKELMENGPVQAILEVHEDFFLYKSGIYRHTAVAEGKGPKHQ 392
Query: 292 --GEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIGCRPYEIPCERYMN 349
G H+++I GWG+E L +G V KYW ANS+ WGE+G FRI E E ++
Sbjct: 393 QHGTHSVKITGWGEEQLPDG--QVQKYWTAANSWGRAWGEDGHFRIARGVNECEVESFVV 450
Query: 350 G 350
G
Sbjct: 451 G 451
Score = 105 bits (263), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 61/171 (35%), Positives = 92/171 (53%), Gaps = 19/171 (11%)
Query: 338 RPYEIPCERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETI 397
+P PC M SRS+ + T C ++ +D+ AY L +E+ I
Sbjct: 300 QPAAQPC---MMHSRSTGRGKRQATARCPNP------QTHANDIYQSTPAYRLAPSEKEI 350
Query: 398 MREIFRHGPVEGSMTIYADMILYKTGIYKHVA----GGPL----GEHAIRIIGWGQEPLG 449
M+E+ +GPV+ + ++ D LYK+GIY+H A GP G H+++I GWG+E L
Sbjct: 351 MKELMENGPVQAILEVHEDFFLYKSGIYRHTAVAEGKGPKHQQHGTHSVKITGWGEEQLP 410
Query: 450 EGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKIGLE 500
+G V KYW ANS+ WGE+G FRI RG NEC +E+ + ++ +E
Sbjct: 411 DG--QVQKYWTAANSWGRAWGEDGHFRIARGVNECEVESFVVGVWGRVSVE 459
>gi|149941230|emb|CAO02547.1| putative cathepsin B-like cysteine protease [Vigna unguiculata]
Length = 201
Score = 142 bits (358), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 82/206 (39%), Positives = 117/206 (56%), Gaps = 21/206 (10%)
Query: 87 LVQLSDPLE-ELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGK 145
L +S P +LP FDAR W C TI I DQG CGS WA GAVE++SDR CI
Sbjct: 7 LTVISHPKSLKLPVNFDARTAWSQCSTIGRILDQGHCGSCWAFGAVESLSDRFCI--HFD 64
Query: 146 RHVRLSSDDLVSCCKD-CGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPY--EIPC 202
++ LS +DL++CC CG+GC GG+ AW+Y G+V+ + C PY + C
Sbjct: 65 VNISLSVNDLLACCGFLCGSGCNGGYPLSAWRYLSNHGVVT-------EECDPYFDQTGC 117
Query: 203 ERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRH 262
SH C+ TP+C++KC G + ++ ++ AY + +N IM E++++
Sbjct: 118 ------SHPGCEPAY-RTPKCVKKCVSGNQL-WKKSKHYSVSAYKVKSNPHDIMAEVYKN 169
Query: 263 GPVEGSMTIYADMILYKTGIYKHVAG 288
GPVE + T+Y D YK+G+YKHV G
Sbjct: 170 GPVEVAFTVYEDFAHYKSGVYKHVTG 195
Score = 66.6 bits (161), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 32/89 (35%), Positives = 53/89 (59%), Gaps = 3/89 (3%)
Query: 344 CERYMNGSRSSCQANEP--NTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREI 401
C+ Y + + S EP TP+C++KC G + ++ ++ AY + +N IM E+
Sbjct: 108 CDPYFDQTGCSHPGCEPAYRTPKCVKKCVSGNQL-WKKSKHYSVSAYKVKSNPHDIMAEV 166
Query: 402 FRHGPVEGSMTIYADMILYKTGIYKHVAG 430
+++GPVE + T+Y D YK+G+YKHV G
Sbjct: 167 YKNGPVEVAFTVYEDFAHYKSGVYKHVTG 195
>gi|294891881|ref|XP_002773785.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
gi|239878989|gb|EER05601.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
Length = 455
Score = 142 bits (357), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 91/269 (33%), Positives = 125/269 (46%), Gaps = 24/269 (8%)
Query: 94 LEELPEGFDARINWPYCP-TIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSS 152
L LP FDAR + C I +R+QG C + WA AV +DRVCI S G+ LS
Sbjct: 142 LTTLPSSFDARQKFASCADVIGHVREQGECNNCWASAAVGMFNDRVCIKSGGRITDILSL 201
Query: 153 DDLVSCCKDC-----GNGCQGGFHGKAWKYWVTTGIVSGGTY------ASKQGCRPYEIP 201
L SCC NGC G + + G+V+GG Y + GC PY P
Sbjct: 202 GYLTSCCNRANGCPKSNGCMFGSVPEGLNFMKNHGLVTGGEYKPPEELGNDDGCWPYPFP 261
Query: 202 CERYMNGSHSS---CQDNEPNTPECIRKC-QPGYDVSYEDDLNFGRIAYSLPANEETIMR 257
++ G S C + P C C Y S + D + + LP E I +
Sbjct: 262 KCNHVPGLESKYPRCAQVR-DLPACATTCPNKAYGTSMQKDTHRAKSWGRLPIGPEKIKQ 320
Query: 258 EIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYW 317
EIF +GPV MT+Y D YK+G+Y H G L H +++IGWG E S +YW
Sbjct: 321 EIFDNGPVAAMMTLYEDFRFYKSGVYVHKTGQMLAAHTLKLIGWGVE-------SGQEYW 373
Query: 318 LVANSFNTNWGENGLFRIGCRPYEIPCER 346
L N++N WG++G+ ++ Y + R
Sbjct: 374 LAVNAWNEEWGDHGMIKLASSVYFVRMTR 402
Score = 90.9 bits (224), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 50/146 (34%), Positives = 72/146 (49%), Gaps = 10/146 (6%)
Query: 336 GCRPYEIPCERYMNGSRSSCQ--ANEPNTPECIRKC-QPGYDVSYEDDLNFGRIAYSLPA 392
GC PY P ++ G S A + P C C Y S + D + + LP
Sbjct: 254 GCWPYPFPKCNHVPGLESKYPRCAQVRDLPACATTCPNKAYGTSMQKDTHRAKSWGRLPI 313
Query: 393 NEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGT 452
E I +EIF +GPV MT+Y D YK+G+Y H G L H +++IGWG E
Sbjct: 314 GPEKIKQEIFDNGPVAAMMTLYEDFRFYKSGVYVHKTGQMLAAHTLKLIGWGVE------ 367
Query: 453 SSVVKYWLVANSFNTNWGENGLFRIV 478
S +YWL N++N WG++G+ ++
Sbjct: 368 -SGQEYWLAVNAWNEEWGDHGMIKLA 392
>gi|345488309|ref|XP_001605531.2| PREDICTED: uncharacterized peptidase C1-like protein F26E4.3-like
[Nasonia vitripennis]
Length = 481
Score = 142 bits (357), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 92/273 (33%), Positives = 134/273 (49%), Gaps = 26/273 (9%)
Query: 67 LEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGW 126
L++R+G S+ + LP+ + +LP FD+RI W I ++DQG CG+ W
Sbjct: 206 LKLRLGTINPSQSTRQMLPVTRHYNP--NDLPREFDSRIQWG--NDITPVQDQGWCGASW 261
Query: 127 ALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSG 186
A+ V+ SDR I S+G V+LS L+SC GC+GG+ +AW + G+V
Sbjct: 262 AISTVDVASDRFAIMSKGIEKVQLSGQHLISCNNRGQRGCKGGYLDRAWLFMRKFGVVDE 321
Query: 187 GTYASKQGCRPYEIPCERYMNGSHSSCQ-DNEPNTPECIRKCQPGYDVSYEDDLNFGRIA 245
Y G R + R S + CQ N N + K P Y +
Sbjct: 322 DCYPWLSG-RSDKCRIPRRGKLSDAGCQRRNSYNLRNEMYKVGPAYRL------------ 368
Query: 246 YSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKH---VAGGPLGEHAIRIIGWG 302
NE IM+EI GPV+ +M ++ D Y++GIY H G H++RI+GWG
Sbjct: 369 ----GNETDIMQEILTSGPVQATMRVHRDFFHYESGIYVHSRPFDTRQSGYHSVRIVGWG 424
Query: 303 QEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
+EP +K+W VANS+ +WGE+G FRI
Sbjct: 425 EEP-SPYNGKPIKFWRVANSWGRDWGEDGYFRI 456
Score = 96.7 bits (239), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 49/105 (46%), Positives = 66/105 (62%), Gaps = 5/105 (4%)
Query: 387 AYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKH---VAGGPLGEHAIRIIGW 443
AY L NE IM+EI GPV+ +M ++ D Y++GIY H G H++RI+GW
Sbjct: 365 AYRL-GNETDIMQEILTSGPVQATMRVHRDFFHYESGIYVHSRPFDTRQSGYHSVRIVGW 423
Query: 444 GQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEA 488
G+EP +K+W VANS+ +WGE+G FRIVRG NEC IE+
Sbjct: 424 GEEP-SPYNGKPIKFWRVANSWGRDWGEDGYFRIVRGNNECEIES 467
>gi|417409900|gb|JAA51439.1| Putative cysteine proteinase tin-ag, partial [Desmodus rotundus]
Length = 346
Score = 142 bits (357), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 100/289 (34%), Positives = 139/289 (48%), Gaps = 22/289 (7%)
Query: 57 NALSKLTLSE-LEMRMG-VHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQ 114
+A +TL E + R+G + P S + + + + P E LP F+A WP I
Sbjct: 42 SAFWGMTLDEGIRYRLGTIRPSSSVAS--MNEIHTVLGPGEVLPTAFEASEKWPN--LIH 97
Query: 115 EIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKA 174
E DQG+C WA SDRV I S G LS +L+SC K GCQGG A
Sbjct: 98 EPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQNLLSCDKRNQQGCQGGHLDSA 157
Query: 175 WKYWVTTGIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVS 234
W + G+VS Y R P R M S + + T C P + V
Sbjct: 158 WWFLRRRGVVSDHCYPFSGQGRTETGPAPRCMMHSRAMGRGKRQATARC-----PNHQV- 211
Query: 235 YEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVA---GGP- 290
+ +D+ AY L ++E+ IM+E+ +GPV+ M ++ D LY+ GIY H G P
Sbjct: 212 HANDIYQVTPAYRLGSSEKEIMKELMENGPVQALMEVHEDFFLYQNGIYSHTPVSLGRPE 271
Query: 291 ----LGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
G H+++I GWG+E L +G + +KYW ANS+ WGE G FRI
Sbjct: 272 RYRRHGTHSVKITGWGEESLPDGRT--LKYWTAANSWGPAWGERGHFRI 318
Score = 106 bits (265), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 62/166 (37%), Positives = 91/166 (54%), Gaps = 16/166 (9%)
Query: 343 PCERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIF 402
P R M SR+ + T C P + V + +D+ AY L ++E+ IM+E+
Sbjct: 184 PAPRCMMHSRAMGRGKRQATARC-----PNHQV-HANDIYQVTPAYRLGSSEKEIMKELM 237
Query: 403 RHGPVEGSMTIYADMILYKTGIYKHVA---GGP-----LGEHAIRIIGWGQEPLGEGTSS 454
+GPV+ M ++ D LY+ GIY H G P G H+++I GWG+E L +G +
Sbjct: 238 ENGPVQALMEVHEDFFLYQNGIYSHTPVSLGRPERYRRHGTHSVKITGWGEESLPDGRT- 296
Query: 455 VVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKIGLE 500
+KYW ANS+ WGE G FRIVRG NEC IE+ + ++G+E
Sbjct: 297 -LKYWTAANSWGPAWGERGHFRIVRGANECDIESFVLGVWGRVGME 341
>gi|157058737|gb|ABV03126.1| cathepsin B-16 [Myzus persicae]
Length = 238
Score = 141 bits (356), Expect = 7e-31, Method: Compositional matrix adjust.
Identities = 76/212 (35%), Positives = 109/212 (51%), Gaps = 4/212 (1%)
Query: 60 SKLTLSELEMRMGVHPDSKLPQNRLPLL-VQLSDPLEELPEGFDARINWPYCPTIQEIRD 118
S+ + +L GV K N + V + P FDAR W +C TI E+RD
Sbjct: 28 SQEDIVKLLGSTGVESAMKASANEFKMDDVAYNKLYGYTPRTFDARKKWRHCKTIGEVRD 87
Query: 119 QGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYW 178
QG CGS WA G A +DR+C+A+ G + LS++++ CC CG GC GG KAWKY+
Sbjct: 88 QGHCGSCWAFGTSSAFADRLCVATDGDFNELLSAEEITFCCHTCGFGCNGGDPIKAWKYF 147
Query: 179 VTTGIVSGGTYASKQGCRPYEI-PCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYED 237
T G+V+GG Y S +GC PY + PC R G ++ C R C D+ Y +
Sbjct: 148 STHGLVTGGNYKSGEGCEPYRVPPCPRDDKGKNTCAGKPREKNHRCTRMCYGNQDLDYRE 207
Query: 238 DLNFGRIAYSLPANEETIMREIFRHGPVEGSM 269
D + R Y L +I +++ +GP+E +
Sbjct: 208 DHRYTRDFYYLTYG--SIQKDVMTYGPIEATF 237
Score = 40.4 bits (93), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 25/99 (25%), Positives = 40/99 (40%), Gaps = 3/99 (3%)
Query: 314 VKYWLVANSFNTNWGENGLFRIGCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPG 372
+K W ++ G N GC PY +P C R G + C R C
Sbjct: 141 IKAWKYFSTHGLVTGGNYKSGEGCEPYRVPPCPRDDKGKNTCAGKPREKNHRCTRMCYGN 200
Query: 373 YDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSM 411
D+ Y +D + R Y L +I +++ +GP+E +
Sbjct: 201 QDLDYREDHRYTRDFYYLTYG--SIQKDVMTYGPIEATF 237
>gi|157058757|gb|ABV03136.1| cathepsin B-84 [Pterocomma populeum]
Length = 218
Score = 141 bits (355), Expect = 9e-31, Method: Compositional matrix adjust.
Identities = 67/176 (38%), Positives = 105/176 (59%), Gaps = 2/176 (1%)
Query: 97 LPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLV 156
+P+ FDAR+ W YC TI ++RDQG+CGS WA G A +DR+CIA++G + +S+++L
Sbjct: 44 IPKAFDARLEWKYCKTIGQVRDQGNCGSCWAHGTSGAFADRLCIATKGDFNELISAEELT 103
Query: 157 SCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEI-PCERYMNGSHSSCQD 215
CC CG GC GG +AW+Y+ G+V+GG Y + GC+PY + PC G +S
Sbjct: 104 FCCHLCGIGCNGGNPLRAWQYFKRHGVVTGGNYNTTNGCQPYRVPPCTNGDKGHYSCSGQ 163
Query: 216 NEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTI 271
+ +C++ C V Y+ D + AY L +N T+ +++ +GP+E S +
Sbjct: 164 QKERNHKCLKTCYGDKTVDYKRDHYKTKDAYYL-SNTTTMQKDVILYGPIEASFDV 218
Score = 40.0 bits (92), Expect = 2.8, Method: Compositional matrix adjust.
Identities = 22/79 (27%), Positives = 38/79 (48%), Gaps = 2/79 (2%)
Query: 336 GCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANE 394
GC+PY +P C G S + +C++ C V Y+ D + AY L +N
Sbjct: 141 GCQPYRVPPCTNGDKGHYSCSGQQKERNHKCLKTCYGDKTVDYKRDHYKTKDAYYL-SNT 199
Query: 395 ETIMREIFRHGPVEGSMTI 413
T+ +++ +GP+E S +
Sbjct: 200 TTMQKDVILYGPIEASFDV 218
>gi|149941232|emb|CAO02548.1| putative cathepsin B-like cysteine protease,putative [Vigna
unguiculata]
Length = 195
Score = 141 bits (355), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 78/194 (40%), Positives = 111/194 (57%), Gaps = 16/194 (8%)
Query: 96 ELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDL 155
+LP FDAR W C TI I DQG CGS WA GAVE++SDR CI ++ LS +DL
Sbjct: 17 KLPVNFDARTAWSQCSTIGRILDQGHCGSCWAFGAVESLSDRFCI--HFDVNISLSVNDL 74
Query: 156 VSCCKD-CGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQ 214
++CC CG+GC GG+ AW+Y G+V+ + C PY SH C+
Sbjct: 75 LACCGFLCGSGCNGGYPLSAWRYLSNHGVVT-------EECDPY----FDQTGCSHPGCE 123
Query: 215 DNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYAD 274
TP+C++KC G + ++ ++ AY + +N IM E++++GPVE + T+Y D
Sbjct: 124 PAY-RTPKCVKKCVSGNQL-WKKSKHYSVSAYKVKSNPHDIMAEVYKNGPVEVAFTVYED 181
Query: 275 MILYKTGIYKHVAG 288
YK+G+YKHV G
Sbjct: 182 FAHYKSGVYKHVTG 195
Score = 66.6 bits (161), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 32/89 (35%), Positives = 53/89 (59%), Gaps = 3/89 (3%)
Query: 344 CERYMNGSRSSCQANEP--NTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREI 401
C+ Y + + S EP TP+C++KC G + ++ ++ AY + +N IM E+
Sbjct: 108 CDPYFDQTGCSHPGCEPAYRTPKCVKKCVSGNQL-WKKSKHYSVSAYKVKSNPHDIMAEV 166
Query: 402 FRHGPVEGSMTIYADMILYKTGIYKHVAG 430
+++GPVE + T+Y D YK+G+YKHV G
Sbjct: 167 YKNGPVEVAFTVYEDFAHYKSGVYKHVTG 195
>gi|312082955|ref|XP_003143660.1| hypothetical protein LOAG_08080 [Loa loa]
gi|307761175|gb|EFO20409.1| hypothetical protein LOAG_08080 [Loa loa]
Length = 339
Score = 140 bits (354), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 89/274 (32%), Positives = 135/274 (49%), Gaps = 33/274 (12%)
Query: 67 LEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGW 126
+ R+G + QN ++V+ ELP FDAR WP I I+DQG C S W
Sbjct: 54 IRHRLGTLFPERSVQNMNEMIVKP----RELPTSFDARQKWP--DFIHPIQDQGDCASSW 107
Query: 127 ALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSG 186
A +DR+ + + G+++V LS+ +SC + GC+GG+ +AW Y G+VS
Sbjct: 108 AQSTAATSADRLALITEGRQNVALSAQQFLSCNQHRQKGCEGGYLDRAWWYIRKFGVVSE 167
Query: 187 GTYASKQGC--RPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRI 244
Y G +P ++ + + C PN+ + + P Y VS
Sbjct: 168 ECYPYISGTTRKPEICYMQKSKHANGRQCPSGHPNSR--VYRTTPSYRVS---------- 215
Query: 245 AYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKH---VAGGPLGEHAIRIIGW 301
+ E+ IM EI +GPV+ + ++ D + G+YKH V G H++R++GW
Sbjct: 216 -----SREQDIMSEILTNGPVQATFRVHGDFFI--AGVYKHLPTVGEEIEGYHSVRLLGW 268
Query: 302 GQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
G++ T VKYW+ ANS+ TNWGENG FRI
Sbjct: 269 GED---YSTGIPVKYWIAANSWGTNWGENGTFRI 299
Score = 93.2 bits (230), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 46/113 (40%), Positives = 69/113 (61%), Gaps = 8/113 (7%)
Query: 387 AYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKH---VAGGPLGEHAIRIIGW 443
+Y + + E+ IM EI +GPV+ + ++ D + G+YKH V G H++R++GW
Sbjct: 211 SYRVSSREQDIMSEILTNGPVQATFRVHGDFFI--AGVYKHLPTVGEEIEGYHSVRLLGW 268
Query: 444 GQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPK 496
G++ T VKYW+ ANS+ TNWGENG FRI+RG+N C IE+ + K
Sbjct: 269 GED---YSTGIPVKYWIAANSWGTNWGENGTFRILRGENHCEIESFVIGAWGK 318
>gi|170045773|ref|XP_001850470.1| tubulointerstitial nephritis antigen [Culex quinquefasciatus]
gi|167868692|gb|EDS32075.1| tubulointerstitial nephritis antigen [Culex quinquefasciatus]
Length = 463
Score = 140 bits (354), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 92/261 (35%), Positives = 131/261 (50%), Gaps = 29/261 (11%)
Query: 80 PQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVC 139
P+ ++ + +L++ E LP FDA WP I E++DQG CGS WAL SDR
Sbjct: 169 PKFKVKSMSRLTNGQEHLPTHFDATTYWP--GFIGEVKDQGWCGSSWALSTASVASDRFA 226
Query: 140 IASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYE 199
I S+G+ V+L+ ++SC + GC GG AW Y G V+ C PY
Sbjct: 227 ILSKGREIVQLAPQQIISCVRR-SQGCSGGHLDTAWNYVRKVGTVN-------DECYPY- 277
Query: 200 IPCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREI 259
+ ++C+ P+ C V + G A+SL NE IM EI
Sbjct: 278 -------ISAQNACK-IRPSDTLITANCDLPTKVDRTNMYKMG-PAFSL-NNETDIMIEI 327
Query: 260 FRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGE-----HAIRIIGWGQEPLGEGTSSVV 314
+HGPV+ + ++ D YK+GIY+H A G+ H++R+IGWG+E G T+
Sbjct: 328 KKHGPVQAILRVHRDFFSYKSGIYRHSAASSAGDERAGYHSVRLIGWGEERNGYETT--- 384
Query: 315 KYWLVANSFNTNWGENGLFRI 335
KYW+ NS+ WGENG FRI
Sbjct: 385 KYWVAVNSWGRWWGENGRFRI 405
Score = 109 bits (273), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 56/120 (46%), Positives = 77/120 (64%), Gaps = 9/120 (7%)
Query: 387 AYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGE-----HAIRII 441
A+SL NE IM EI +HGPV+ + ++ D YK+GIY+H A G+ H++R+I
Sbjct: 314 AFSL-NNETDIMIEIKKHGPVQAILRVHRDFFSYKSGIYRHSAASSAGDERAGYHSVRLI 372
Query: 442 GWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKIGLEI 501
GWG+E G T+ KYW+ NS+ WGENG FRIVRGQNEC IE+ + A LP + ++
Sbjct: 373 GWGEERNGYETT---KYWVAVNSWGRWWGENGRFRIVRGQNECEIESYVLASLPYVHQQV 429
>gi|345327151|ref|XP_001507103.2| PREDICTED: tubulointerstitial nephritis antigen-like
[Ornithorhynchus anatinus]
Length = 327
Score = 140 bits (354), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 93/262 (35%), Positives = 125/262 (47%), Gaps = 32/262 (12%)
Query: 97 LPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLV 156
LP FDA WP I E DQG+C WA SDR+ I S+G LS +L+
Sbjct: 57 LPRNFDAAQKWPGL--IHEPLDQGNCAGSWAFSTAAVASDRISIHSKGHMTPSLSPQNLL 114
Query: 157 SCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTY--ASK----QGCRPYEIPCERYMNGSH 210
SC GC GG +AW + G+VS Y AS+ + CR Y P R +
Sbjct: 115 SCNTRHQQGCNGGRLDRAWSFLRRRGLVSDKCYPLASQNSIAEPCRMYSRPMGRGKRQAT 174
Query: 211 SSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMT 270
C +N ++ + Y +D+ Y L +NE+ IM+EI +GPV+ M
Sbjct: 175 GPCPNNFHHSND------------YSNDIYQSTPPYRLSSNEKDIMKEIMENGPVQALME 222
Query: 271 IYADMILYKTGIYKHVA---GGP-----LGEHAIRIIGWGQEPLGEGTSSVVKYWLVANS 322
++ D LYK GIY+H G P G H+++I GWG+E G VK+W ANS
Sbjct: 223 VHEDFFLYKDGIYRHTPASNGKPPQFRRQGTHSVKITGWGEELQPNGRR--VKFWRAANS 280
Query: 323 FNTNWGENGLFRI--GCRPYEI 342
+ WGE G FRI GC +I
Sbjct: 281 WGPTWGEGGSFRILRGCNECDI 302
Score = 102 bits (253), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 54/132 (40%), Positives = 75/132 (56%), Gaps = 10/132 (7%)
Query: 377 YEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVA---GGP- 432
Y +D+ Y L +NE+ IM+EI +GPV+ M ++ D LYK GIY+H G P
Sbjct: 187 YSNDIYQSTPPYRLSSNEKDIMKEIMENGPVQALMEVHEDFFLYKDGIYRHTPASNGKPP 246
Query: 433 ----LGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEA 488
G H+++I GWG+E G VK+W ANS+ WGE G FRI+RG NEC IE+
Sbjct: 247 QFRRQGTHSVKITGWGEELQPNGRR--VKFWRAANSWGPTWGEGGSFRILRGCNECDIES 304
Query: 489 DITAGLPKIGLE 500
+ ++G E
Sbjct: 305 FVVGVWGRVGSE 316
>gi|410910940|ref|XP_003968948.1| PREDICTED: tubulointerstitial nephritis antigen-like [Takifugu
rubripes]
Length = 477
Score = 140 bits (353), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 94/286 (32%), Positives = 137/286 (47%), Gaps = 24/286 (8%)
Query: 62 LTLSE-LEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQG 120
+TL E + R+G +K N + + + ++LP F++ WP I E DQG
Sbjct: 169 MTLDEGIRYRLGTQRPAKTIMNMNEIQMNMDPERDQLPLYFNSAEKWP--GKIHEPLDQG 226
Query: 121 SCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVT 180
+C + WA SDR+ I S G +LS +L+SC GC GG AW +
Sbjct: 227 NCAASWAFSTAAVASDRISIQSMGHMTPQLSPQNLISCDTRNQGGCTGGRIDGAWWFLRR 286
Query: 181 TGIVSGGTYASKQGCRPYEIPCE--RYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDD 238
G+V+ Y + P + P E R M S S + T C P + +Y++D
Sbjct: 287 RGVVTEDCYPYRP---PQQTPAELGRCMMQSRSVGRGKRQATQRC-----PNTN-NYQND 337
Query: 239 LNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGG--------P 290
+ Y L NE+ IM+EI +GPV+ M ++ D +YK+GIYKH
Sbjct: 338 IYQSTPPYRLSTNEKEIMKEIQDNGPVQAIMEVHEDFFVYKSGIYKHTDVSFTKPPQYRK 397
Query: 291 LGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIG 336
G H+++I GWG+E +G KYW+ ANS+ NWGE G FRI
Sbjct: 398 HGTHSVKITGWGEERNVDGAKR--KYWIAANSWGKNWGEEGYFRIA 441
Score = 108 bits (269), Expect = 9e-21, Method: Compositional matrix adjust.
Identities = 67/184 (36%), Positives = 94/184 (51%), Gaps = 22/184 (11%)
Query: 331 GLFRIGCRPYEIPCE------RYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFG 384
G+ C PY P + R M SRS + T C P + +Y++D+
Sbjct: 288 GVVTEDCYPYRPPQQTPAELGRCMMQSRSVGRGKRQATQRC-----PNTN-NYQNDIYQS 341
Query: 385 RIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGG--------PLGEH 436
Y L NE+ IM+EI +GPV+ M ++ D +YK+GIYKH G H
Sbjct: 342 TPPYRLSTNEKEIMKEIQDNGPVQAIMEVHEDFFVYKSGIYKHTDVSFTKPPQYRKHGTH 401
Query: 437 AIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPK 496
+++I GWG+E +G KYW+ ANS+ NWGE G FRI RG+NEC IEA + +
Sbjct: 402 SVKITGWGEERNVDGAKR--KYWIAANSWGKNWGEEGYFRIARGENECEIEAFVIGVWGR 459
Query: 497 IGLE 500
I +E
Sbjct: 460 ITME 463
>gi|403332696|gb|EJY65386.1| Cathepsin B [Oxytricha trifallax]
Length = 297
Score = 140 bits (353), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 98/279 (35%), Positives = 126/279 (45%), Gaps = 42/279 (15%)
Query: 57 NALSKLTLSELEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEI 116
N S LT +L + G + +P N+ PL P+ FDAR W I I
Sbjct: 43 NPFSDLTKEQLLAKCGTY---IVPSNKQ----YPGSPLISTPDNFDARQQWG--SKIHAI 93
Query: 117 RDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWK 176
RDQ CG+ WA GA EA+SDR IAS G V S +DLVSC + GC GG+ AW+
Sbjct: 94 RDQQQCGACWAFGATEALSDRFTIASNGSVDVVFSPEDLVSCDTN-DYGCNGGYMDMAWE 152
Query: 177 YWVTTGIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYE 236
+ G+V+ C PY P C KC G S E
Sbjct: 153 FLDQHGVVA-------DSCFPYSA---------------GSGFAPACASKCADG---SAE 187
Query: 237 DDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAI 296
+ + E I EI HGPVEG+ T+Y D Y++G+Y G HAI
Sbjct: 188 KKYSCVHGSIRQSQGVEQIKSEIVAHGPVEGAFTVYTDFFNYQSGVYTPTTSDVAGGHAI 247
Query: 297 RIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
+I+G+G E GT YWL ANS+ +WG G F+I
Sbjct: 248 KILGFGVE---NGT----PYWLCANSWGPSWGMQGFFKI 279
Score = 84.7 bits (208), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 51/135 (37%), Positives = 69/135 (51%), Gaps = 12/135 (8%)
Query: 363 PECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKT 422
P C KC G S E + + E I EI HGPVEG+ T+Y D Y++
Sbjct: 175 PACASKCADG---SAEKKYSCVHGSIRQSQGVEQIKSEIVAHGPVEGAFTVYTDFFNYQS 231
Query: 423 GIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQN 482
G+Y G HAI+I+G+G E GT YWL ANS+ +WG G F+I +G
Sbjct: 232 GVYTPTTSDVAGGHAIKILGFGVE---NGT----PYWLCANSWGPSWGMQGFFKIKQG-- 282
Query: 483 ECGIEADITAGLPKI 497
ECGIE + + P++
Sbjct: 283 ECGIEDQVFSCDPQL 297
>gi|330846430|ref|XP_003295033.1| hypothetical protein DICPUDRAFT_51857 [Dictyostelium purpureum]
gi|325074364|gb|EGC28440.1| hypothetical protein DICPUDRAFT_51857 [Dictyostelium purpureum]
Length = 257
Score = 140 bits (352), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 91/241 (37%), Positives = 123/241 (51%), Gaps = 32/241 (13%)
Query: 96 ELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDL 155
+P+ FDAR WP C I I +Q CGS WA A E +SDR+CIAS GK V LS L
Sbjct: 30 SIPQSFDARTQWPNC--IHPILNQEQCGSCWAFSASEVLSDRLCIASNGKTGVVLSPQAL 87
Query: 156 VSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQD 215
VSC GC GG AW+Y GI TY GC PY + +SC D
Sbjct: 88 VSCDIFGNQGCNGGIPQLAWEYMELHGIP---TY----GCFPYTSGNGTDGSCVKNSCVD 140
Query: 216 NEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADM 275
NE T + + +P + A+ E I ++I + GP++G+M +Y+D
Sbjct: 141 NEQYT---LYRAKP--------------LTLKTCASVECIQQDIMKFGPIQGTMEVYSDF 183
Query: 276 ILYKTGIYKHVAGGP-LGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFR 334
+ Y +G+Y G LG HAI+I+GWG + +S YW+VANS+ +WG +G F
Sbjct: 184 MSYTSGVYTMTPGSSLLGGHAIKIVGWGFD-----QASNQNYWIVANSWGPSWGIDGFFW 238
Query: 335 I 335
I
Sbjct: 239 I 239
Score = 82.0 bits (201), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 41/107 (38%), Positives = 65/107 (60%), Gaps = 8/107 (7%)
Query: 392 ANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGP-LGEHAIRIIGWGQEPLGE 450
A+ E I ++I + GP++G+M +Y+D + Y +G+Y G LG HAI+I+GWG +
Sbjct: 158 ASVECIQQDIMKFGPIQGTMEVYSDFMSYTSGVYTMTPGSSLLGGHAIKIVGWGFD---- 213
Query: 451 GTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
+S YW+VANS+ +WG +G F I ++CGI +D A +I
Sbjct: 214 -QASNQNYWIVANSWGPSWGIDGFFWIAF--DQCGINSDACAAQARI 257
>gi|417401428|gb|JAA47600.1| Putative cysteine proteinase tin-ag [Desmodus rotundus]
Length = 466
Score = 140 bits (352), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 100/289 (34%), Positives = 139/289 (48%), Gaps = 22/289 (7%)
Query: 57 NALSKLTLSE-LEMRMG-VHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQ 114
+A +TL E + R+G + P S + + + + P E LP F+A WP I
Sbjct: 162 SAFWGMTLDEGIRYRLGTIRPSSSVAS--MNEIHTVLGPGEVLPTAFEASEKWPN--LIH 217
Query: 115 EIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKA 174
E DQG+C WA SDRV I S G LS +L+SC K GCQGG A
Sbjct: 218 EPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQNLLSCDKRNQQGCQGGHLDSA 277
Query: 175 WKYWVTTGIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVS 234
W + G+VS Y R P R M S + + T C P + V
Sbjct: 278 WWFLRRRGVVSDHCYPFSGQGRTETGPAPRCMMHSRAMGRGKRQATARC-----PNHQV- 331
Query: 235 YEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVA---GGP- 290
+ +D+ AY L ++E+ IM+E+ +GPV+ M ++ D LY+ GIY H G P
Sbjct: 332 HANDIYQVTPAYRLGSSEKEIMKELMENGPVQALMEVHEDFFLYQNGIYSHTPVSLGRPE 391
Query: 291 ----LGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
G H+++I GWG+E L +G + +KYW ANS+ WGE G FRI
Sbjct: 392 RYRRHGTHSVKITGWGEESLPDGRT--LKYWTAANSWGPAWGERGHFRI 438
Score = 106 bits (264), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 62/166 (37%), Positives = 91/166 (54%), Gaps = 16/166 (9%)
Query: 343 PCERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIF 402
P R M SR+ + T C P + V + +D+ AY L ++E+ IM+E+
Sbjct: 304 PAPRCMMHSRAMGRGKRQATARC-----PNHQV-HANDIYQVTPAYRLGSSEKEIMKELM 357
Query: 403 RHGPVEGSMTIYADMILYKTGIYKHVA---GGP-----LGEHAIRIIGWGQEPLGEGTSS 454
+GPV+ M ++ D LY+ GIY H G P G H+++I GWG+E L +G +
Sbjct: 358 ENGPVQALMEVHEDFFLYQNGIYSHTPVSLGRPERYRRHGTHSVKITGWGEESLPDGRT- 416
Query: 455 VVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKIGLE 500
+KYW ANS+ WGE G FRIVRG NEC IE+ + ++G+E
Sbjct: 417 -LKYWTAANSWGPAWGERGHFRIVRGANECDIESFVLGVWGRVGME 461
>gi|156708120|gb|ABU93318.1| cathepsin B9 cysteine protease, partial [Monocercomonoides sp. PA]
Length = 382
Score = 140 bits (352), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 82/224 (36%), Positives = 113/224 (50%), Gaps = 17/224 (7%)
Query: 80 PQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVC 139
P L + +E+PE FDAR NWP CPTI I DQG CGS WA+ + E + DR C
Sbjct: 53 PSFSLQFKNEFVKIEDEIPESFDARTNWPNCPTIGHIYDQGHCGSCWAMCSFEVLQDRFC 112
Query: 140 IASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYE 199
I S G LS D+ S C +GC GG+ A++Y G+ + + C PY
Sbjct: 113 IHSNGSEKPWLSGQDITS-CDSRSHGCNGGWTETAFEYAKKAGVPT-------EECVPYL 164
Query: 200 I-PCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMRE 258
+ C H C + TP C ++C + +Y + + +YS+ N E I E
Sbjct: 165 MGKCH------HPGCSSWQ--TPTCKKECSSLSNYNYSSNRYYASKSYSIQRNVEAIQLE 216
Query: 259 IFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWG 302
+ R+GPV T Y D+ +Y G+Y HV G G HAI+I+GWG
Sbjct: 217 LMRNGPVTAVFTTYDDLAVYWRGVYNHVMGSEQGLHAIKIVGWG 260
Score = 67.8 bits (164), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 35/101 (34%), Positives = 52/101 (51%)
Query: 344 CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFR 403
C Y+ G + TP C ++C + +Y + + +YS+ N E I E+ R
Sbjct: 160 CVPYLMGKCHHPGCSSWQTPTCKKECSSLSNYNYSSNRYYASKSYSIQRNVEAIQLELMR 219
Query: 404 HGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWG 444
+GPV T Y D+ +Y G+Y HV G G HAI+I+GWG
Sbjct: 220 NGPVTAVFTTYDDLAVYWRGVYNHVMGSEQGLHAIKIVGWG 260
Score = 51.2 bits (121), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 20/42 (47%), Positives = 31/42 (73%)
Query: 456 VKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
+ YW++ NS+ ++G +G+ I RG NECGIE+D+ G+PKI
Sbjct: 324 IPYWIIVNSWGEDFGMDGILLIKRGVNECGIESDVYTGIPKI 365
>gi|189308076|gb|ACD86922.1| cysteine protease [Caenorhabditis brenneri]
Length = 228
Score = 139 bits (351), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 81/210 (38%), Positives = 121/210 (57%), Gaps = 14/210 (6%)
Query: 45 LLPKLPF----YGAEKNALSK------LTLSELEMRMGVHPDSKLPQNRLPLLVQLSDPL 94
++PK P Y K +L K +T+ +++ R+ + + P + V+
Sbjct: 20 IVPKTPEAITEYVNSKQSLWKAEIPKHITIEQVKKRL-MRTEFVAPHSPDAEFVKHDIQE 78
Query: 95 EELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDD 154
+ +P FDAR WP C +I IRDQ CGS WA A EA SDR CIAS G + LS++D
Sbjct: 79 DTIPATFDARTQWPSCVSINNIRDQSDCGSCWAFAAAEAASDRFCIASNGAVNTLLSAED 138
Query: 155 LVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEI-PC-ERYMNGSHSS 212
++SCC +CG GC+GG+ AWKY V +G +GG+Y ++ GC+PY + PC E N + +
Sbjct: 139 VLSCCSNCGYGCEGGYPINAWKYLVKSGFCTGGSYEAQFGCKPYSLAPCGETVGNTTWPA 198
Query: 213 CQDNEPNTPECIRKC-QPGYDVSYEDDLNF 241
C + +TP C+ KC Y+V+Y+DD +F
Sbjct: 199 CPTDGYDTPACVNKCTNSNYNVAYKDDKHF 228
Score = 39.7 bits (91), Expect = 4.1, Method: Compositional matrix adjust.
Identities = 25/70 (35%), Positives = 38/70 (54%), Gaps = 5/70 (7%)
Query: 317 WLVANSFNTNWGENGLFRIGCRPYEI-PC-ERYMNGSRSSCQANEPNTPECIRKC-QPGY 373
+LV + F T F GC+PY + PC E N + +C + +TP C+ KC Y
Sbjct: 161 YLVKSGFCTGGSYEAQF--GCKPYSLAPCGETVGNTTWPACPTDGYDTPACVNKCTNSNY 218
Query: 374 DVSYEDDLNF 383
+V+Y+DD +F
Sbjct: 219 NVAYKDDKHF 228
>gi|388499754|gb|AFK37943.1| unknown [Lotus japonicus]
Length = 209
Score = 138 bits (348), Expect = 6e-30, Method: Compositional matrix adjust.
Identities = 68/154 (44%), Positives = 97/154 (62%), Gaps = 9/154 (5%)
Query: 344 CERYMNGSRSSCQANEP--NTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREI 401
C+ Y + S EP TP+C+RKC G + ++ +F AYS+ ++ IM E+
Sbjct: 43 CDPYFDQIGCSHPGCEPAYQTPKCVRKCVKGNQI-WKKSKHFSVNAYSVKSDPYDIMAEV 101
Query: 402 FRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLV 461
+++GPVE + T+Y D YK+G+YKH+ G LG HA+++IGWG GE YWL+
Sbjct: 102 YKNGPVEVAFTVYEDFAHYKSGVYKHITGSQLGGHAVKLIGWGTTDEGE------DYWLI 155
Query: 462 ANSFNTNWGENGLFRIVRGQNECGIEADITAGLP 495
AN +N +WG++G F I RG NECGIE D+TAGLP
Sbjct: 156 ANQWNRSWGDDGYFMIRRGTNECGIEEDVTAGLP 189
Score = 117 bits (293), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 67/177 (37%), Positives = 99/177 (55%), Gaps = 27/177 (15%)
Query: 163 GNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPY--EIPCERYMNGSHSSCQDNEP-- 218
G GG+ AW+Y G+V+ + C PY +I C SH C EP
Sbjct: 18 GLAVMGGYPLYAWRYLAHHGVVT-------EECDPYFDQIGC------SHPGC---EPAY 61
Query: 219 NTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILY 278
TP+C+RKC G + ++ +F AYS+ ++ IM E++++GPVE + T+Y D Y
Sbjct: 62 QTPKCVRKCVKGNQI-WKKSKHFSVNAYSVKSDPYDIMAEVYKNGPVEVAFTVYEDFAHY 120
Query: 279 KTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
K+G+YKH+ G LG HA+++IGWG GE YWL+AN +N +WG++G F I
Sbjct: 121 KSGVYKHITGSQLGGHAVKLIGWGTTDEGE------DYWLIANQWNRSWGDDGYFMI 171
>gi|126330441|ref|XP_001381244.1| PREDICTED: tubulointerstitial nephritis antigen-like 1 [Monodelphis
domestica]
Length = 466
Score = 138 bits (347), Expect = 7e-30, Method: Compositional matrix adjust.
Identities = 97/290 (33%), Positives = 137/290 (47%), Gaps = 24/290 (8%)
Query: 57 NALSKLTLSE-LEMRMG-VHPDSK-LPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTI 113
+A +TL E ++ R+G V P S + N + +++ P E LP F+A WP I
Sbjct: 161 SAFWGMTLEEGIQYRLGTVRPASSVMNMNEIQMVMA---PQETLPLAFNASDKWP--GLI 215
Query: 114 QEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGK 173
E DQG+C WA SDR+ I S G LS +L+SC GC+GG
Sbjct: 216 HEPLDQGNCAGSWAFSTAAVASDRISIHSMGHMTPALSPQNLLSCDTHNQKGCRGGRLDG 275
Query: 174 AWKYWVTTGIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQPGYDV 233
AW + G+VS Y G R P M S S + T C
Sbjct: 276 AWWFLRRRGLVSNHCYPFSAGNRDATAPAAPCMMHSRSMGRGKRQATAHCPNS------R 329
Query: 234 SYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVA---GGP 290
++ + + Y L ++E+ IM+E+ +GPV+ M ++ D LYK+GIYKH G P
Sbjct: 330 AHANHIYQATPPYRLSSDEKDIMKELMENGPVQALMEVHEDFFLYKSGIYKHTPASLGKP 389
Query: 291 L-----GEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
G H+++I GWG+E +G +KYW ANS+ WGE G FRI
Sbjct: 390 ARYRQHGTHSVKITGWGEERQPDGQR--LKYWTAANSWGPTWGEKGHFRI 437
Score = 104 bits (259), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 61/176 (34%), Positives = 89/176 (50%), Gaps = 16/176 (9%)
Query: 333 FRIGCRPYEIPCERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPA 392
F G R P M SRS + T C ++ + + Y L +
Sbjct: 293 FSAGNRDATAPAAPCMMHSRSMGRGKRQATAHCPNS------RAHANHIYQATPPYRLSS 346
Query: 393 NEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVA---GGPL-----GEHAIRIIGWG 444
+E+ IM+E+ +GPV+ M ++ D LYK+GIYKH G P G H+++I GWG
Sbjct: 347 DEKDIMKELMENGPVQALMEVHEDFFLYKSGIYKHTPASLGKPARYRQHGTHSVKITGWG 406
Query: 445 QEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKIGLE 500
+E +G +KYW ANS+ WGE G FRI+RG NEC IE+ + ++G+E
Sbjct: 407 EERQPDGQR--LKYWTAANSWGPTWGEKGHFRILRGANECDIESFVVGVWGRVGME 460
>gi|290990464|ref|XP_002677856.1| predicted protein [Naegleria gruberi]
gi|284091466|gb|EFC45112.1| predicted protein [Naegleria gruberi]
Length = 231
Score = 138 bits (347), Expect = 8e-30, Method: Compositional matrix adjust.
Identities = 88/242 (36%), Positives = 117/242 (48%), Gaps = 38/242 (15%)
Query: 101 FDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCK 160
FD+R WP C + IRDQG+CGS ++ + E MSDR CI S G +V LS DLV+C
Sbjct: 6 FDSRQKWPNC--VHPIRDQGNCGSCYSFASSEVMSDRFCIFSNGSVNVVLSPQDLVTCSW 63
Query: 161 DCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQDNEPNT 220
GC GG G + Y G+VS C PY Y +H C D
Sbjct: 64 -YSFGCNGGIPGLVFDYIHKDGLVS-------DACFPYL----SYDGNTHVKCPD----- 106
Query: 221 PECIRKCQPGYDVSYEDDLNFGRIAYSLPANEET-------IMREIFRHGPVEGSMTIYA 273
C S++ D +F Y + E I +EI HGPV +Y+
Sbjct: 107 -----FCYNNKTKSFKSDKHFADKVYHVGEFLEDKAKRVLEIQKEILTHGPVNADFMVYS 161
Query: 274 DMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLF 333
D +YK+G+Y+H G G HA++IIGWG E + V YWL+ANS+ T +G G F
Sbjct: 162 DFTVYKSGVYRHQTGSFEGIHAVKIIGWGTE-------NGVDYWLIANSWGTTFGLQGFF 214
Query: 334 RI 335
+I
Sbjct: 215 KI 216
Score = 86.7 bits (213), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 46/124 (37%), Positives = 65/124 (52%), Gaps = 14/124 (11%)
Query: 364 ECIRKCQPGYDVSYEDDLNFGRIAYSLPANEET-------IMREIFRHGPVEGSMTIYAD 416
+C C S++ D +F Y + E I +EI HGPV +Y+D
Sbjct: 103 KCPDFCYNNKTKSFKSDKHFADKVYHVGEFLEDKAKRVLEIQKEILTHGPVNADFMVYSD 162
Query: 417 MILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFR 476
+YK+G+Y+H G G HA++IIGWG E + V YWL+ANS+ T +G G F+
Sbjct: 163 FTVYKSGVYRHQTGSFEGIHAVKIIGWGTE-------NGVDYWLIANSWGTTFGLQGFFK 215
Query: 477 IVRG 480
IVRG
Sbjct: 216 IVRG 219
>gi|312383398|gb|EFR28501.1| hypothetical protein AND_03481 [Anopheles darlingi]
Length = 573
Score = 138 bits (347), Expect = 8e-30, Method: Compositional matrix adjust.
Identities = 93/262 (35%), Positives = 132/262 (50%), Gaps = 31/262 (11%)
Query: 81 QNRLPL--LVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRV 138
Q ++P+ + +LS+ LP FDA +WP + E RDQG CGS WAL SDR
Sbjct: 278 QPKIPVKAMKRLSNRGGPLPSHFDAADHWPR--LVGEARDQGWCGSSWALSTTTMASDRF 335
Query: 139 CIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPY 198
I S+G+ V+L+ L++C + C GG AW+Y G+V+ C PY
Sbjct: 336 AILSKGREQVQLAPQQLLACVRR-QQACSGGHLDTAWQYLRRVGVVN-------DECYPY 387
Query: 199 EIPCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMRE 258
+ + C+ N+ +T C+ +V+ G AYSL NE IM E
Sbjct: 388 I--------AAKNQCKINDGDTLVSA-NCELPANVNRTAMYRMG-PAYSL-NNETDIMTE 436
Query: 259 IFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGE-----HAIRIIGWGQEPLGEGTSSV 313
I G V+ + +Y D Y+ GIY+H A E H++R+IGWG+E +G +
Sbjct: 437 IKERGTVQAILRVYRDFFSYQNGIYRHSAAATPAEERSAYHSVRLIGWGEERVG---YDM 493
Query: 314 VKYWLVANSFNTNWGENGLFRI 335
VKYW+ NS+ T WGENG FRI
Sbjct: 494 VKYWIAVNSWGTWWGENGRFRI 515
Score = 103 bits (257), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 56/133 (42%), Positives = 77/133 (57%), Gaps = 10/133 (7%)
Query: 387 AYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGE-----HAIRII 441
AYSL NE IM EI G V+ + +Y D Y+ GIY+H A E H++R+I
Sbjct: 424 AYSL-NNETDIMTEIKERGTVQAILRVYRDFFSYQNGIYRHSAAATPAEERSAYHSVRLI 482
Query: 442 GWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKIGLEI 501
GWG+E +G +VKYW+ NS+ T WGENG FRI+RG NEC IE+ + A P + +
Sbjct: 483 GWGEERVG---YDMVKYWIAVNSWGTWWGENGRFRILRGTNECEIESYVLASNPYVHQHV 539
Query: 502 DSNEINLGKMMTL 514
+ N+G + L
Sbjct: 540 QTVR-NVGDLQEL 551
>gi|47212965|emb|CAF93376.1| unnamed protein product [Tetraodon nigroviridis]
Length = 271
Score = 137 bits (345), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 91/266 (34%), Positives = 130/266 (48%), Gaps = 23/266 (8%)
Query: 95 EELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDD 154
++LP F++ WP I E DQG+C + WA SDR+ I S G +LS +
Sbjct: 6 DQLPLYFNSAEKWP--GKIHEPLDQGNCAASWAFSTAAVASDRISIQSMGHMTPQLSPQN 63
Query: 155 LVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIPCE--RYMNGSHSS 212
L+SC GC GG AW Y G+V+ Y + P + P E R M S S
Sbjct: 64 LISCDTRNQGGCAGGRLDGAWWYLRRRGVVTEDCYPYRP---PQQTPAELSRCMMQSRSV 120
Query: 213 CQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIY 272
+ T C P + +Y++D+ Y L +E+ IM+EI +GPV+ M ++
Sbjct: 121 GRGKRQATQRC-----PNTN-NYQNDIYQSTPPYRLSTSEKEIMKEIQDNGPVQAIMEVH 174
Query: 273 ADMILYKTGIYKHVAGG--------PLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFN 324
D +Y +GIYKH G H+++I GWG+E +GT+ KYW+ ANS+
Sbjct: 175 EDFFMYNSGIYKHTDVSFTKPPHYRKHGTHSVKITGWGEERNFDGTTR--KYWIAANSWG 232
Query: 325 TNWGENGLFRIGCRPYEIPCERYMNG 350
NWGENG FRI E E ++ G
Sbjct: 233 KNWGENGYFRIARGENECEIEAFVIG 258
Score = 110 bits (275), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 67/184 (36%), Positives = 96/184 (52%), Gaps = 22/184 (11%)
Query: 331 GLFRIGCRPYEIPCE------RYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFG 384
G+ C PY P + R M SRS + T C P + +Y++D+
Sbjct: 91 GVVTEDCYPYRPPQQTPAELSRCMMQSRSVGRGKRQATQRC-----PNTN-NYQNDIYQS 144
Query: 385 RIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGG--------PLGEH 436
Y L +E+ IM+EI +GPV+ M ++ D +Y +GIYKH G H
Sbjct: 145 TPPYRLSTSEKEIMKEIQDNGPVQAIMEVHEDFFMYNSGIYKHTDVSFTKPPHYRKHGTH 204
Query: 437 AIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPK 496
+++I GWG+E +GT+ KYW+ ANS+ NWGENG FRI RG+NEC IEA + +
Sbjct: 205 SVKITGWGEERNFDGTTR--KYWIAANSWGKNWGENGYFRIARGENECEIEAFVIGVWGR 262
Query: 497 IGLE 500
I +E
Sbjct: 263 ITME 266
>gi|290975216|ref|XP_002670339.1| cathepsin B-like cysteine proteinase [Naegleria gruberi]
gi|284083897|gb|EFC37595.1| cathepsin B-like cysteine proteinase [Naegleria gruberi]
Length = 350
Score = 137 bits (345), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 91/284 (32%), Positives = 138/284 (48%), Gaps = 42/284 (14%)
Query: 57 NALSKLTLSELEMRMGV-----HPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCP 111
N L L ++E +++ V ++ L +R +++L D P FDAR WP C
Sbjct: 84 NLLGALNVNENDLKGEVMDKDNSTNTPLSDSRYLTILRLRD----FPTQFDAREQWPQC- 138
Query: 112 TIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFH 171
I+ I++Q +CGS WA A ++DR CI S GK +V LS +VSC NGC GGF
Sbjct: 139 -IRSIKNQKNCGSCWAFSASSVLADRFCIKSGGKVNVDLSPQFMVSCSGQ-NNGCNGGFF 196
Query: 172 GKAWKYWVTTGIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQPGY 231
W++ V+ G VS + C PY G+ +C P + P Y
Sbjct: 197 DATWRFLVSVGTVS-------EACVPYV-----SFGGAVPACNVKSCGVPG---QKSPFY 241
Query: 232 DVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPL 291
L IM ++ +GP++ +M +Y D YK+G+Y HV+G +
Sbjct: 242 RAGSARKLE----------GMLDIMADLKANGPIQVAMGVYRDFYSYKSGVYHHVSGRYV 291
Query: 292 GEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
G HA++I+GWG + ++S + YW+ ANS+ +WG G F I
Sbjct: 292 GGHAVKIVGWGYD-----SASKLPYWICANSWGEDWGIKGYFWI 330
Score = 90.1 bits (222), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 40/101 (39%), Positives = 66/101 (65%), Gaps = 5/101 (4%)
Query: 397 IMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVV 456
IM ++ +GP++ +M +Y D YK+G+Y HV+G +G HA++I+GWG + ++S +
Sbjct: 255 IMADLKANGPIQVAMGVYRDFYSYKSGVYHHVSGRYVGGHAVKIVGWGYD-----SASKL 309
Query: 457 KYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
YW+ ANS+ +WG G F I+RG+ ECGI + +G P +
Sbjct: 310 PYWICANSWGEDWGIKGYFWILRGRGECGIGKMVWSGKPAL 350
>gi|321478457|gb|EFX89414.1| hypothetical protein DAPPUDRAFT_303204 [Daphnia pulex]
Length = 442
Score = 137 bits (345), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 85/242 (35%), Positives = 120/242 (49%), Gaps = 11/242 (4%)
Query: 97 LPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLV 156
LP FD RI W T+Q++RDQG CG+ WA +DR+ I SRG LS +L+
Sbjct: 185 LPMSFDGRIEWR--DTLQDVRDQGWCGASWAFSTAAVAADRLAIQSRGHEVYPLSMQNLL 242
Query: 157 SCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQDN 216
+C GC GG +AW Y G+V+ Y G C+ G+ ++ +
Sbjct: 243 ACNNRGQQGCNGGHLDRAWNYMRRFGVVNEECYPYISGRTGQVEKCKVPRRGNLATMKCQ 302
Query: 217 EPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMI 276
N E RK D L AY + E+ IM EI +HGPV+ +M ++ D
Sbjct: 303 LVNAAE--RKSDRS-DKPPRKGLFRSPPAYRIAPFEDDIMNEILQHGPVQATMRVHPDFF 359
Query: 277 LYKTGIYKHVAGGPL---GEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLF 333
LY+ G+Y++ G H++RI+GWG + + KYWLVANS+ WGE+G F
Sbjct: 360 LYRGGVYRYSGTNSQQRSGYHSVRIVGWG---VDSSKRNPTKYWLVANSWGRLWGEDGYF 416
Query: 334 RI 335
RI
Sbjct: 417 RI 418
Score = 97.4 bits (241), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 47/110 (42%), Positives = 67/110 (60%), Gaps = 6/110 (5%)
Query: 387 AYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPL---GEHAIRIIGW 443
AY + E+ IM EI +HGPV+ +M ++ D LY+ G+Y++ G H++RI+GW
Sbjct: 328 AYRIAPFEDDIMNEILQHGPVQATMRVHPDFFLYRGGVYRYSGTNSQQRSGYHSVRIVGW 387
Query: 444 GQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAG 493
G + + KYWLVANS+ WGE+G FRIVRG+NE IE + A
Sbjct: 388 G---VDSSKRNPTKYWLVANSWGRLWGEDGYFRIVRGENESDIEKFVLAA 434
>gi|335290878|ref|XP_003127800.2| PREDICTED: tubulointerstitial nephritis antigen-like 1 [Sus scrofa]
Length = 362
Score = 137 bits (344), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 99/289 (34%), Positives = 138/289 (47%), Gaps = 22/289 (7%)
Query: 57 NALSKLTLSE-LEMRMG-VHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQ 114
+A +TL E + R+G + P S + + + + P E LP F+A WP I
Sbjct: 58 SAFWGMTLDEGIRYRLGTIRPSSSVAN--MNEIHTVLGPGEVLPRAFEASEKWPN--LIH 113
Query: 115 EIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKA 174
+ DQG+C WA SDRV I S G LS +L+SC GCQGG A
Sbjct: 114 DPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQNLLSCDTHNQQGCQGGRLDGA 173
Query: 175 WKYWVTTGIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVS 234
W + G+VS Y R P R M S + + T C P V
Sbjct: 174 WWFLRRRGVVSDHCYPFSGHERNEAGPAPRCMMHSRAMGRGKRQATARC-----PNSYV- 227
Query: 235 YEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVA---GGP- 290
+ +D+ AY L +NE+ IM+E+ +GPV+ M ++ D LY++GIY H G P
Sbjct: 228 HANDIYQVTPAYRLGSNEKDIMKELMENGPVQALMEVHEDFFLYQSGIYSHTPVSHGRPE 287
Query: 291 ----LGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
G H+++I GWG+E L +G ++KYW ANS+ WGE G FRI
Sbjct: 288 RYRRHGTHSVKITGWGEETLPDG--RMLKYWTAANSWGPGWGERGHFRI 334
Score = 106 bits (264), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 63/166 (37%), Positives = 91/166 (54%), Gaps = 16/166 (9%)
Query: 343 PCERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIF 402
P R M SR+ + T C P V + +D+ AY L +NE+ IM+E+
Sbjct: 200 PAPRCMMHSRAMGRGKRQATARC-----PNSYV-HANDIYQVTPAYRLGSNEKDIMKELM 253
Query: 403 RHGPVEGSMTIYADMILYKTGIYKHVA---GGP-----LGEHAIRIIGWGQEPLGEGTSS 454
+GPV+ M ++ D LY++GIY H G P G H+++I GWG+E L +G
Sbjct: 254 ENGPVQALMEVHEDFFLYQSGIYSHTPVSHGRPERYRRHGTHSVKITGWGEETLPDG--R 311
Query: 455 VVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKIGLE 500
++KYW ANS+ WGE G FRIVRG NEC IE+ + ++G+E
Sbjct: 312 MLKYWTAANSWGPGWGERGHFRIVRGANECDIESFVLGVWGRVGME 357
>gi|350596935|ref|XP_001927698.4| PREDICTED: tubulointerstitial nephritis antigen, partial [Sus
scrofa]
Length = 368
Score = 137 bits (344), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 94/290 (32%), Positives = 131/290 (45%), Gaps = 37/290 (12%)
Query: 62 LTLSE-LEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQG 120
+TL E + R+G P S L + + L + + LPE F A WP DQ
Sbjct: 74 MTLEEGFKYRLGTLPPSPLLLSMNEVTASLPETTD-LPEFFVASYKWP--GWTHGPLDQK 130
Query: 121 SCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVT 180
+C + WA +DR+ I S G+ LS +L+SCC +GC G +AW Y
Sbjct: 131 NCAASWAFSTASVAADRIAIQSEGRYTANLSPQNLISCCAKNRHGCNSGSIDRAWWYLRK 190
Query: 181 TGIVSGGTY-------ASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQPGYDV 233
G+VS Y A+ GC R + C +N + I +C P Y V
Sbjct: 191 RGLVSHACYPLFKDQNATNNGCAMASRSDGRGKRHATKPCPNNFEKSNR-IYQCSPPYRV 249
Query: 234 SYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGG---- 289
S +NE IMREI ++GPV+ M ++ D YKTGIY+HV
Sbjct: 250 S---------------SNETEIMREIMQNGPVQAIMQVHEDFFHYKTGIYRHVTSTNEES 294
Query: 290 ----PLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
L HA+++ GWG +G K+W+ ANS+ +WGENG FRI
Sbjct: 295 DKYRKLRTHAVKLTGWGTLKGAQGRKE--KFWIAANSWGKSWGENGYFRI 342
Score = 97.1 bits (240), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 49/118 (41%), Positives = 68/118 (57%), Gaps = 10/118 (8%)
Query: 388 YSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGG--------PLGEHAIR 439
Y + +NE IMREI ++GPV+ M ++ D YKTGIY+HV L HA++
Sbjct: 247 YRVSSNETEIMREIMQNGPVQAIMQVHEDFFHYKTGIYRHVTSTNEESDKYRKLRTHAVK 306
Query: 440 IIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
+ GWG +G K+W+ ANS+ +WGENG FRI+RG NE IE I A ++
Sbjct: 307 LTGWGTLKGAQGRKE--KFWIAANSWGKSWGENGYFRILRGVNESDIEKLIIAAWGQL 362
>gi|161343873|tpg|DAA06117.1| TPA_inf: cathepsin B [Myzus persicae]
Length = 254
Score = 137 bits (344), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 76/193 (39%), Positives = 110/193 (56%), Gaps = 7/193 (3%)
Query: 97 LPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLV 156
LP FD+R WP CP+I I +QG+C S +A+ A A SDR+CI S ++ +S+ ++
Sbjct: 63 LPTNFDSRKKWPNCPSIGHIYNQGNCRSSYAVAAASAASDRICIHSNSTKNPIMSAQQII 122
Query: 157 SCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIPCERYMNGS---HSSC 213
SCC CG GC GG ++W ++ G VSGG Y S QGC+PY IP + +N HS
Sbjct: 123 SCCYLCGYGCDGGSLFESWDFYRRHGFVSGGEYNSNQGCQPYTIPPCKLINEKPPGHSCT 182
Query: 214 QDNEPNTPECIRKC-QPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIY 272
N TP C +KC P Y S+ D+ G+ P M+EIF +GP+ +Y
Sbjct: 183 TFNREETPTCEKKCNNPNYYTSFRADIYRGKYYKVSPY---MAMKEIFDNGPITTQFYMY 239
Query: 273 ADMILYKTGIYKH 285
D++ YK+G+Y++
Sbjct: 240 RDLVDYKSGVYQY 252
Score = 55.5 bits (132), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 32/96 (33%), Positives = 49/96 (51%), Gaps = 7/96 (7%)
Query: 336 GCRPYEIPCERYMNGS---RSSCQANEPNTPECIRKCQ-PGYDVSYEDDLNFGRIAYSLP 391
GC+PY IP + +N S N TP C +KC P Y S+ D+ G+ P
Sbjct: 160 GCQPYTIPPCKLINEKPPGHSCTTFNREETPTCEKKCNNPNYYTSFRADIYRGKYYKVSP 219
Query: 392 ANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKH 427
M+EIF +GP+ +Y D++ YK+G+Y++
Sbjct: 220 Y---MAMKEIFDNGPITTQFYMYRDLVDYKSGVYQY 252
>gi|157058755|gb|ABV03135.1| cathepsin B-84 [Aulacorthum solani]
Length = 218
Score = 137 bits (344), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 69/179 (38%), Positives = 107/179 (59%), Gaps = 5/179 (2%)
Query: 96 ELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDL 155
E+PE FD+R+ W YC TI +R+QG+CGS WA G A +DR+C+A+ G+ + +S++++
Sbjct: 43 EVPEFFDSRLEWKYCKTIGHVRNQGNCGSCWAHGTTGAFADRLCVATNGEVNQLISAEEV 102
Query: 156 VSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIPCERYMNGSHSSC-- 213
CC CG GC GG +AW+Y+ G+V+GG Y + GC+PY +P + H+SC
Sbjct: 103 TFCCHRCGFGCNGGNPLRAWQYFKRHGVVTGGDYNTTDGCQPYRVPPCVKDDKGHNSCSG 162
Query: 214 QDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIY 272
Q E N +C +KC V Y+ D + AY L + T+ ++ +GP+E S +Y
Sbjct: 163 QPTERNH-KCSKKCYGDDTVDYKSDHYKTKDAYYL--SNTTMQKDTMVYGPIEASFDVY 218
>gi|414886872|tpg|DAA62886.1| TPA: hypothetical protein ZEAMMB73_253741 [Zea mays]
gi|414886873|tpg|DAA62887.1| TPA: hypothetical protein ZEAMMB73_253741 [Zea mays]
Length = 208
Score = 136 bits (343), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 72/189 (38%), Positives = 113/189 (59%), Gaps = 20/189 (10%)
Query: 148 VRLSSDDLVSCCKD-CGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIPCERYM 206
+ LS +DL++CC CG+GC GG+ +AW+Y+V G+V+ C PY P +
Sbjct: 3 ILLSVNDLLACCGFMCGDGCDGGYPIEAWRYFVQNGVVT-------DECDPYFDP----V 51
Query: 207 NGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVE 266
H C+ P TP+C +KC+ V +++ +F AY + ++ IM E++++GPVE
Sbjct: 52 GCKHPGCEPAYP-TPKCEKKCKEQNQV-WQEKKHFSIDAYRINSDPHDIMAEVYKNGPVE 109
Query: 267 GSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTN 326
+ T+Y D YK+G+YKH+ GG +G HA+++IGWG GE YWL+AN +N
Sbjct: 110 VAFTVYEDFAHYKSGVYKHITGGIMGGHAVKLIGWGTSDAGE------DYWLLANQWNRG 163
Query: 327 WGENGLFRI 335
WG++G F+I
Sbjct: 164 WGDDGYFKI 172
Score = 134 bits (336), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 66/167 (39%), Positives = 102/167 (61%), Gaps = 12/167 (7%)
Query: 329 ENGLFRIGCRPYEIPCERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAY 388
+NG+ C PY P + C+ P TP+C +KC+ V +++ +F AY
Sbjct: 36 QNGVVTDECDPYFDP----VGCKHPGCEPAYP-TPKCEKKCKEQNQV-WQEKKHFSIDAY 89
Query: 389 SLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPL 448
+ ++ IM E++++GPVE + T+Y D YK+G+YKH+ GG +G HA+++IGWG
Sbjct: 90 RINSDPHDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKHITGGIMGGHAVKLIGWGTSDA 149
Query: 449 GEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLP 495
GE YWL+AN +N WG++G F+I+RG+NECGIE + AG+P
Sbjct: 150 GE------DYWLLANQWNRGWGDDGYFKIIRGKNECGIEEGVVAGMP 190
>gi|268564843|ref|XP_002639246.1| Hypothetical protein CBG03805 [Caenorhabditis briggsae]
Length = 526
Score = 136 bits (343), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 92/281 (32%), Positives = 132/281 (46%), Gaps = 38/281 (13%)
Query: 67 LEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGW 126
++ R+G + QN +L++ ELPE FDAR W + I I DQG CGS W
Sbjct: 232 IKYRLGTLFPERSVQNMNEILIKP----RELPEHFDARDKWGH--LIHPIADQGDCGSSW 285
Query: 127 ALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSG 186
A+ SDR+ I S G+ + LSS L+SC + GC+GG+ +AW Y G+V
Sbjct: 286 AVSTTGISSDRLSIISEGRINASLSSQQLLSCNQHRQKGCEGGYLDRAWWYIRKLGVVGD 345
Query: 187 GT--YASKQGCRPYE--IPCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFG 242
Y S Q P IP Y N C ++ K P Y VS
Sbjct: 346 HCYPYVSGQSREPGHCLIPKRDYTNRQGLRCPSGSQDST--AFKMTPPYKVS-------- 395
Query: 243 RIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKH--------VAGGPLGEH 294
+ EE I E+ +GPV+ + ++ D +Y G+Y+H + G H
Sbjct: 396 -------SREEDIQTELMTNGPVQATFVVHEDFFMYAGGVYQHSDLAAQKGASSVAEGYH 448
Query: 295 AIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
++R++GWG + T +KYWL ANS+ T WGE+G F+I
Sbjct: 449 SVRVLGWG---VDHSTGRPIKYWLCANSWGTQWGEDGYFKI 486
Score = 91.3 bits (225), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 41/117 (35%), Positives = 66/117 (56%), Gaps = 11/117 (9%)
Query: 388 YSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKH--------VAGGPLGEHAIR 439
Y + + EE I E+ +GPV+ + ++ D +Y G+Y+H + G H++R
Sbjct: 392 YKVSSREEDIQTELMTNGPVQATFVVHEDFFMYAGGVYQHSDLAAQKGASSVAEGYHSVR 451
Query: 440 IIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPK 496
++GWG + T +KYWL ANS+ T WGE+G F+I+RG+N C IE+ + K
Sbjct: 452 VLGWG---VDHSTGRPIKYWLCANSWGTQWGEDGYFKILRGENHCEIESFVIGAWGK 505
>gi|426221788|ref|XP_004005089.1| PREDICTED: tubulointerstitial nephritis antigen-like [Ovis aries]
Length = 362
Score = 136 bits (343), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 99/289 (34%), Positives = 137/289 (47%), Gaps = 22/289 (7%)
Query: 57 NALSKLTLSE-LEMRMG-VHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQ 114
+A +TL E + R+G V P S + + + + P E LP F+A WP I
Sbjct: 58 SAFWGMTLDEGIRYRLGTVRPSSSV--TNMNEIHTVLGPGEVLPRTFEASEKWPN--LIH 113
Query: 115 EIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKA 174
+ DQG+C WA SDRV I S G LS +L+SC GC GG A
Sbjct: 114 DPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMSPVLSPQNLLSCDTHNQQGCHGGRLDGA 173
Query: 175 WKYWVTTGIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVS 234
W + G+VS Y R +P M S + + T C P V
Sbjct: 174 WWFLRRRGVVSDHCYPFSGHGRDEAVPAPPCMMHSRAMGRGKRQATARC-----PNSYV- 227
Query: 235 YEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVA---GGP- 290
+ +D+ AY L +NE+ IM+E+ +GPV+ M ++ D LY++GIY H G P
Sbjct: 228 HANDIYQVTPAYRLGSNEKEIMKELMENGPVQALMEVHEDFFLYQSGIYSHTPVSLGRPE 287
Query: 291 ----LGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
G H+++I GWG+E L +G + VKYW ANS+ WGE G FRI
Sbjct: 288 RYRRHGTHSVKITGWGEETLPDGRT--VKYWTAANSWGPAWGERGHFRI 334
Score = 106 bits (264), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 54/122 (44%), Positives = 76/122 (62%), Gaps = 10/122 (8%)
Query: 387 AYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVA---GGP-----LGEHAI 438
AY L +NE+ IM+E+ +GPV+ M ++ D LY++GIY H G P G H++
Sbjct: 238 AYRLGSNEKEIMKELMENGPVQALMEVHEDFFLYQSGIYSHTPVSLGRPERYRRHGTHSV 297
Query: 439 RIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKIG 498
+I GWG+E L +G + VKYW ANS+ WGE G FRIVRG NEC IE+ + ++G
Sbjct: 298 KITGWGEETLPDGRT--VKYWTAANSWGPAWGERGHFRIVRGANECDIESFVLGVWGRVG 355
Query: 499 LE 500
+E
Sbjct: 356 ME 357
>gi|358421824|ref|XP_003585145.1| PREDICTED: tubulointerstitial nephritis antigen-like [Bos taurus]
Length = 428
Score = 136 bits (342), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 98/288 (34%), Positives = 135/288 (46%), Gaps = 20/288 (6%)
Query: 57 NALSKLTLSE-LEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQE 115
+A +TL E + R+G S N + L P E LP F+A WP I +
Sbjct: 124 SAFWGMTLDEGIRYRLGTVRPSSFVANMNEIHTVLG-PGEVLPRTFEASEKWPN--LIHD 180
Query: 116 IRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAW 175
DQG+C WA SDRV I S G LS +L+SC GC+GG AW
Sbjct: 181 PLDQGNCAGSWAFSTAAVASDRVSIHSLGHMSPVLSPQNLLSCDTHNQQGCRGGRLDGAW 240
Query: 176 KYWVTTGIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSY 235
+ G+VS Y R +P M S + + T C P V +
Sbjct: 241 WFLRRRGVVSDHCYPFSGHGRDEAVPAPPCMMHSRAMGRGKRQATARC-----PNSYV-H 294
Query: 236 EDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVA---GGP-- 290
+D+ AY L +NE+ IM+E+ +GPV+ M ++ D LY++GIY H G P
Sbjct: 295 ANDIYQVTPAYRLGSNEKEIMKELMENGPVQALMEVHEDFFLYQSGIYSHTPVSLGRPER 354
Query: 291 ---LGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
G H+++I GWG+E L +G + +KYW ANS+ WGE G FRI
Sbjct: 355 YRRHGTHSVKITGWGEETLPDGRT--IKYWTAANSWGPAWGERGHFRI 400
Score = 106 bits (264), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 53/122 (43%), Positives = 76/122 (62%), Gaps = 10/122 (8%)
Query: 387 AYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVA---GGP-----LGEHAI 438
AY L +NE+ IM+E+ +GPV+ M ++ D LY++GIY H G P G H++
Sbjct: 304 AYRLGSNEKEIMKELMENGPVQALMEVHEDFFLYQSGIYSHTPVSLGRPERYRRHGTHSV 363
Query: 439 RIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKIG 498
+I GWG+E L +G + +KYW ANS+ WGE G FRIVRG NEC IE+ + ++G
Sbjct: 364 KITGWGEETLPDGRT--IKYWTAANSWGPAWGERGHFRIVRGANECDIESFVLGVWGRVG 421
Query: 499 LE 500
+E
Sbjct: 422 ME 423
>gi|496968|gb|AAA96831.1| cysteine protease homologue, partial [Ancylostoma caninum]
Length = 197
Score = 136 bits (342), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 76/205 (37%), Positives = 111/205 (54%), Gaps = 13/205 (6%)
Query: 124 SGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCK-DCGNGCQGGFHGKAWKYWVTTG 182
S WA+ + EAMSD +C+ S V +S D++SCC CG GCQGG+ +A+K+
Sbjct: 1 SCWAVSSAEAMSDEICVQSNSTIRVMISDSDILSCCGISCGYGCQGGWSIEAYKWMQRER 60
Query: 183 IVSGGTYASKQGCRPYEIPCERYMNGS----HSSCQDNEPNTPECIRKCQPGYDVSYEDD 238
++ C+P P R N + C TP+C + CQ Y SY++D
Sbjct: 61 CCYRWENTDRRVCKPVR-PSIRVGNHPNDPYYGPCPGGLWPTPKCRKTCQRKYYKSYQED 119
Query: 239 LNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRI 298
+F AY LP NE +I +EI+++GPV + +Y D YK GIY H GG G HA+++
Sbjct: 120 KHFATRAYYLPNNERSIRQEIYKNGPVVAAFRVYQDFSYYKKGIYVHKWGGQTGAHAVKV 179
Query: 299 IGWGQEPLGEGTSSVVKYWLVANSF 323
+GWG+E + YWL+ANS+
Sbjct: 180 VGWGRE-------NATDYWLIANSW 197
Score = 100 bits (250), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 53/143 (37%), Positives = 75/143 (52%), Gaps = 28/143 (19%)
Query: 344 CERYMNGSRSSCQ--------ANEPN-------------TPECIRKCQPGYDVSYEDDLN 382
C R+ N R C+ N PN TP+C + CQ Y SY++D +
Sbjct: 62 CYRWENTDRRVCKPVRPSIRVGNHPNDPYYGPCPGGLWPTPKCRKTCQRKYYKSYQEDKH 121
Query: 383 FGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIG 442
F AY LP NE +I +EI+++GPV + +Y D YK GIY H GG G HA++++G
Sbjct: 122 FATRAYYLPNNERSIRQEIYKNGPVVAAFRVYQDFSYYKKGIYVHKWGGQTGAHAVKVVG 181
Query: 443 WGQEPLGEGTSSVVKYWLVANSF 465
WG+E + YWL+ANS+
Sbjct: 182 WGRE-------NATDYWLIANSW 197
>gi|395833440|ref|XP_003789742.1| PREDICTED: tubulointerstitial nephritis antigen [Otolemur
garnettii]
Length = 464
Score = 136 bits (342), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 92/296 (31%), Positives = 136/296 (45%), Gaps = 49/296 (16%)
Query: 62 LTLSE-LEMRMGVHPDSKLPQNRLPLLVQLSD------PLEELPEGFDARINWPYCPTIQ 114
+TL + + R+G P S PLL+ +++ +LPE F A WP
Sbjct: 170 MTLEDGFKFRLGTLPPS-------PLLLSMNEMTASLPKTTDLPEFFVASYKWP--GWTH 220
Query: 115 EIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKA 174
DQ +C + WA +DR+ I S+G+ LS +L+SCC +GC G +A
Sbjct: 221 GPLDQKNCAASWAFSTASVAADRIAIQSKGRYTANLSPQNLISCCAKNRHGCNSGSIDRA 280
Query: 175 WKYWVTTGIVSGGTY-------ASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKC 227
W Y G+VS Y A+ GC R + C +N + I +C
Sbjct: 281 WWYLRKRGLVSHACYPLFKDQHATNSGCAMASRSDGRGKRHATKPCPNNIEKSNR-IYQC 339
Query: 228 QPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVA 287
P Y +S +NE IM+EI ++GPV+ M ++ D YK+GIY+HVA
Sbjct: 340 SPPYRIS---------------SNETEIMKEIMQNGPVQAIMQVHEDFFHYKSGIYRHVA 384
Query: 288 GG--------PLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
L HA++++GWG +G K+W+ ANS+ +WGENG FRI
Sbjct: 385 STHGESENYRKLRTHAVKLLGWGTLRGAQGRKE--KFWIAANSWGKSWGENGYFRI 438
Score = 97.4 bits (241), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 48/118 (40%), Positives = 70/118 (59%), Gaps = 10/118 (8%)
Query: 388 YSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGG--------PLGEHAIR 439
Y + +NE IM+EI ++GPV+ M ++ D YK+GIY+HVA L HA++
Sbjct: 343 YRISSNETEIMKEIMQNGPVQAIMQVHEDFFHYKSGIYRHVASTHGESENYRKLRTHAVK 402
Query: 440 IIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
++GWG +G K+W+ ANS+ +WGENG FRI+RG NE IE I A ++
Sbjct: 403 LLGWGTLRGAQGRKE--KFWIAANSWGKSWGENGYFRILRGVNESDIEKLIIAAWGQL 458
>gi|158285208|ref|XP_001687862.1| AGAP007684-PA [Anopheles gambiae str. PEST]
gi|158285210|ref|XP_308187.4| AGAP007684-PB [Anopheles gambiae str. PEST]
gi|157019881|gb|EDO64511.1| AGAP007684-PA [Anopheles gambiae str. PEST]
gi|157019882|gb|EAA04576.4| AGAP007684-PB [Anopheles gambiae str. PEST]
Length = 463
Score = 136 bits (342), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 91/261 (34%), Positives = 130/261 (49%), Gaps = 29/261 (11%)
Query: 80 PQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVC 139
P+ R+ + +LS+ LP FDA +W + E RDQG CGS WA SDR
Sbjct: 170 PRFRVKAMKRLSNKGGHLPTRFDASEHWT--GLVAEARDQGWCGSSWAFSTATMASDRFA 227
Query: 140 IASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYE 199
I S+G+ V+L+ +++C + GC GG AW+Y TG+V+ + C PY
Sbjct: 228 ILSKGREMVQLAPQQMLACVRR-QQGCSGGHLDTAWQYLRRTGVVN-------EECYPY- 278
Query: 200 IPCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREI 259
+ + C+ + +T C+ V+ G A+SL NE IM EI
Sbjct: 279 -------IAAQNVCKISNDDTL-ITANCELPVKVNRTLMYKMG-PAFSL-NNETDIMAEI 328
Query: 260 FRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGE-----HAIRIIGWGQEPLGEGTSSVV 314
G V+ M +Y D Y++GIY+H A E H++R+IGWG+E +G VV
Sbjct: 329 KDRGTVQAIMRVYRDFFSYRSGIYRHSAAATPAEERSAYHSVRLIGWGEERVG---YDVV 385
Query: 315 KYWLVANSFNTNWGENGLFRI 335
KYW+ NS+ WGENG FRI
Sbjct: 386 KYWIAINSWGQWWGENGRFRI 406
Score = 100 bits (250), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 53/122 (43%), Positives = 72/122 (59%), Gaps = 9/122 (7%)
Query: 387 AYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGE-----HAIRII 441
A+SL NE IM EI G V+ M +Y D Y++GIY+H A E H++R+I
Sbjct: 315 AFSL-NNETDIMAEIKDRGTVQAIMRVYRDFFSYRSGIYRHSAAATPAEERSAYHSVRLI 373
Query: 442 GWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKIGLEI 501
GWG+E +G VVKYW+ NS+ WGENG FRI+RG NEC IE+ + A P + +
Sbjct: 374 GWGEERVG---YDVVKYWIAINSWGQWWGENGRFRILRGSNECDIESYVLASNPYVHEHV 430
Query: 502 DS 503
+
Sbjct: 431 QA 432
>gi|395526635|ref|XP_003765465.1| PREDICTED: tubulointerstitial nephritis antigen-like [Sarcophilus
harrisii]
Length = 467
Score = 135 bits (341), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 95/285 (33%), Positives = 134/285 (47%), Gaps = 24/285 (8%)
Query: 62 LTLSE-LEMRMG-VHPDSK-LPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRD 118
+TL E + R+G V P S + N + +++ P E LP F A WP I E D
Sbjct: 167 MTLDEGIRYRLGTVRPTSSVMNMNEIQMVMS---PDETLPSAFSASNKWP--GLIHEPLD 221
Query: 119 QGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYW 178
QG+C WA SDR+ I S G LS +L+SC +GC+GG AW +
Sbjct: 222 QGNCAGSWAFSTAAVASDRISIHSMGHMSPALSPQNLLSCNTHNQHGCRGGRLDGAWWFL 281
Query: 179 VTTGIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDD 238
G+VS Y +G P M S + T C ++ +
Sbjct: 282 RRRGLVSNNCYPFSEGDHNGAAPAAPCMMHSRHMGRGKRQATAHCPNS------RTHANH 335
Query: 239 LNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVA---GGP----- 290
+ Y L ++E+ IM+E+ +GPV+ + ++ D LYK+GIYKH G P
Sbjct: 336 IYQATPPYRLSSHEKDIMKELMENGPVQALLEVHEDFFLYKSGIYKHTPASLGKPERYRQ 395
Query: 291 LGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
G H+++I GWG+E +G VKYW ANS+ WGENG FRI
Sbjct: 396 HGTHSVKITGWGEEIQPDGQK--VKYWTAANSWGPTWGENGYFRI 438
Score = 105 bits (261), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 53/121 (43%), Positives = 74/121 (61%), Gaps = 10/121 (8%)
Query: 388 YSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVA---GGP-----LGEHAIR 439
Y L ++E+ IM+E+ +GPV+ + ++ D LYK+GIYKH G P G H+++
Sbjct: 343 YRLSSHEKDIMKELMENGPVQALLEVHEDFFLYKSGIYKHTPASLGKPERYRQHGTHSVK 402
Query: 440 IIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKIGL 499
I GWG+E +G VKYW ANS+ WGENG FRIVRG NEC IE+ + ++G
Sbjct: 403 ITGWGEEIQPDGQK--VKYWTAANSWGPTWGENGYFRIVRGANECDIESFVVGVWGRVGT 460
Query: 500 E 500
E
Sbjct: 461 E 461
>gi|193606095|ref|XP_001951499.1| PREDICTED: cathepsin B-like cysteine proteinase 5-like
[Acyrthosiphon pisum]
Length = 330
Score = 135 bits (341), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 86/256 (33%), Positives = 130/256 (50%), Gaps = 21/256 (8%)
Query: 96 ELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDL 155
++ E FDAR WP C TI E+ D G+ GWA ++DR+CIA+ G + LS+++L
Sbjct: 85 QIHEEFDARKGWPQCKTIGEVHDDGNTRWGWAYATAGVLADRMCIATNGSYNQLLSTEEL 144
Query: 156 VSC--CKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIPCERYMNGSHSSC 213
+ C K +G G W+Y + G+VSGG Y + GC+P +IP G+ +
Sbjct: 145 IFCGGIKTKQSGAVRG--DDVWEYLKSHGLVSGGKYNTNDGCQPSKIP----PIGNIPTH 198
Query: 214 QDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYA 273
N C +C + Y D Y++ +NE+ I +E+ +GPV +Y
Sbjct: 199 LYNHT----CEERCYGNNTIHYYHDHVKVSHYYNIKSNED-IQKEVQTYGPVSVKFRVYD 253
Query: 274 DMILYKTGIY-KHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGL 332
D LYK+G+Y K + H ++IGWG E + V YWL+ NS+ WG+NGL
Sbjct: 254 DFFLYKSGVYVKTEKSLYVRRHFAKLIGWGVE-------NGVDYWLLVNSWGNEWGQNGL 306
Query: 333 FRIGCRPYEIPCERYM 348
F+I E+ E Y+
Sbjct: 307 FKIKRGTNEVHVEDYV 322
Score = 84.7 bits (208), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 48/134 (35%), Positives = 71/134 (52%), Gaps = 9/134 (6%)
Query: 365 CIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGI 424
C +C + Y D Y++ +NE+ I +E+ +GPV +Y D LYK+G+
Sbjct: 204 CEERCYGNNTIHYYHDHVKVSHYYNIKSNED-IQKEVQTYGPVSVKFRVYDDFFLYKSGV 262
Query: 425 Y-KHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNE 483
Y K + H ++IGWG E + V YWL+ NS+ WG+NGLF+I RG NE
Sbjct: 263 YVKTEKSLYVRRHFAKLIGWGVE-------NGVDYWLLVNSWGNEWGQNGLFKIKRGTNE 315
Query: 484 CGIEADITAGLPKI 497
+E + AG P+I
Sbjct: 316 VHVEDYVYAGEPEI 329
>gi|308494436|ref|XP_003109407.1| hypothetical protein CRE_08204 [Caenorhabditis remanei]
gi|308246820|gb|EFO90772.1| hypothetical protein CRE_08204 [Caenorhabditis remanei]
Length = 470
Score = 135 bits (341), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 91/281 (32%), Positives = 133/281 (47%), Gaps = 38/281 (13%)
Query: 67 LEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGW 126
++ R+G + QN +L++ ELPE FDAR W + I + DQG CGS W
Sbjct: 176 IKYRLGTLFPERSVQNMNEILIKP----RELPEHFDARDKWGH--LIHPVADQGDCGSSW 229
Query: 127 ALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSG 186
A+ SDR+ I S G+ + LSS L+SC + GC+GG+ +AW Y G+V
Sbjct: 230 AVSTTGISSDRLSIISEGRINASLSSQQLLSCNQHRQKGCEGGYLDRAWWYIRKLGVVGD 289
Query: 187 GT--YASKQGCRPYE--IPCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFG 242
Y S Q P IP Y N C + ++ K P Y VS
Sbjct: 290 HCYPYVSGQSREPGHCLIPKRDYTNRQGLRCPSGDQDST--AFKMTPPYKVS-------- 339
Query: 243 RIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKH--------VAGGPLGEH 294
+ EE I E+ +GPV+ + ++ D +Y G+Y+H + G H
Sbjct: 340 -------SREEDIQTELMTNGPVQATFVVHEDFFMYAGGVYQHSDLAAQKGASSVAEGYH 392
Query: 295 AIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
++R++GWG + T +KYWL ANS+ T WGE+G F+I
Sbjct: 393 SVRVLGWG---VDHSTGRPIKYWLCANSWGTQWGEDGYFKI 430
Score = 90.9 bits (224), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 41/117 (35%), Positives = 66/117 (56%), Gaps = 11/117 (9%)
Query: 388 YSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKH--------VAGGPLGEHAIR 439
Y + + EE I E+ +GPV+ + ++ D +Y G+Y+H + G H++R
Sbjct: 336 YKVSSREEDIQTELMTNGPVQATFVVHEDFFMYAGGVYQHSDLAAQKGASSVAEGYHSVR 395
Query: 440 IIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPK 496
++GWG + T +KYWL ANS+ T WGE+G F+I+RG+N C IE+ + K
Sbjct: 396 VLGWG---VDHSTGRPIKYWLCANSWGTQWGEDGYFKILRGENHCEIESFVIGAWGK 449
>gi|297465285|ref|XP_887401.3| PREDICTED: tubulointerstitial nephritis antigen-like 1 isoform 2
[Bos taurus]
gi|297472148|ref|XP_002685665.1| PREDICTED: tubulointerstitial nephritis antigen-like 1 [Bos taurus]
gi|296490232|tpg|DAA32345.1| TPA: tubulointerstitial nephritis antigen-like 1-like [Bos taurus]
Length = 534
Score = 135 bits (340), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 96/288 (33%), Positives = 133/288 (46%), Gaps = 20/288 (6%)
Query: 57 NALSKLTLSE-LEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQE 115
+A +TL E + R+G S N + L P E LP F+A WP I +
Sbjct: 230 SAFWGMTLDEGIRYRLGTVRPSSFVANMNEIHTVLG-PGEVLPRTFEASEKWPN--LIHD 286
Query: 116 IRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAW 175
DQG+C WA SDRV I S G LS +L+SC GC+GG AW
Sbjct: 287 PLDQGNCAGSWAFSTAAVASDRVSIHSLGHMSPVLSPQNLLSCDTHNQQGCRGGRLDGAW 346
Query: 176 KYWVTTGIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSY 235
+ G+VS Y R +P M S + + T C +
Sbjct: 347 WFLRRRGVVSDHCYPFSGHGRDEAVPAPPCMMHSRAMGRGKRQATARCPNS------YVH 400
Query: 236 EDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVA---GGP-- 290
+D+ AY L +NE+ IM+E+ +GPV+ M ++ D LY++GIY H G P
Sbjct: 401 ANDIYQVTPAYRLGSNEKEIMKELMENGPVQALMEVHEDFFLYQSGIYSHTPVSLGRPER 460
Query: 291 ---LGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
G H+++I GWG+E L +G + +KYW ANS+ WGE G FRI
Sbjct: 461 YRRHGTHSVKITGWGEETLPDGRT--IKYWTAANSWGPAWGERGHFRI 506
Score = 105 bits (262), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 53/122 (43%), Positives = 76/122 (62%), Gaps = 10/122 (8%)
Query: 387 AYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVA---GGP-----LGEHAI 438
AY L +NE+ IM+E+ +GPV+ M ++ D LY++GIY H G P G H++
Sbjct: 410 AYRLGSNEKEIMKELMENGPVQALMEVHEDFFLYQSGIYSHTPVSLGRPERYRRHGTHSV 469
Query: 439 RIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKIG 498
+I GWG+E L +G + +KYW ANS+ WGE G FRIVRG NEC IE+ + ++G
Sbjct: 470 KITGWGEETLPDGRT--IKYWTAANSWGPAWGERGHFRIVRGANECDIESFVLGVWGRVG 527
Query: 499 LE 500
+E
Sbjct: 528 ME 529
>gi|193202653|ref|NP_492593.2| Protein F26E4.3 [Caenorhabditis elegans]
gi|205371857|sp|P90850.3|YCF2E_CAEEL RecName: Full=Uncharacterized peptidase C1-like protein F26E4.3;
Flags: Precursor
gi|166157004|emb|CAB03007.2| Protein F26E4.3 [Caenorhabditis elegans]
Length = 452
Score = 135 bits (340), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 90/281 (32%), Positives = 132/281 (46%), Gaps = 38/281 (13%)
Query: 67 LEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGW 126
++ R+G + QN +L++ ELPE FDAR W P I + DQG CGS W
Sbjct: 158 IKYRLGTLFPERSVQNMNEILIKP----RELPEHFDARDKWG--PLIHPVADQGDCGSSW 211
Query: 127 ALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSG 186
++ SDR+ I S G+ + LSS L+SC + GC+GG+ +AW Y G+V
Sbjct: 212 SVSTTAISSDRLAIISEGRINSTLSSQQLLSCNQHRQKGCEGGYLDRAWWYIRKLGVVGD 271
Query: 187 GT--YASKQGCRPYE--IPCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFG 242
Y S Q P IP Y N C ++ K P Y VS
Sbjct: 272 HCYPYVSGQSREPGHCLIPKRDYTNRQGLRCPSGSQDSTA--FKMTPPYKVS-------- 321
Query: 243 RIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKH--------VAGGPLGEH 294
+ EE I E+ +GPV+ + ++ D +Y G+Y+H + G H
Sbjct: 322 -------SREEDIQTELMTNGPVQATFVVHEDFFMYAGGVYQHSDLAAQKGASSVAEGYH 374
Query: 295 AIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
++R++GWG + T +KYWL ANS+ T WGE+G F++
Sbjct: 375 SVRVLGWG---VDHSTGKPIKYWLCANSWGTQWGEDGYFKV 412
Score = 90.5 bits (223), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 40/117 (34%), Positives = 66/117 (56%), Gaps = 11/117 (9%)
Query: 388 YSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKH--------VAGGPLGEHAIR 439
Y + + EE I E+ +GPV+ + ++ D +Y G+Y+H + G H++R
Sbjct: 318 YKVSSREEDIQTELMTNGPVQATFVVHEDFFMYAGGVYQHSDLAAQKGASSVAEGYHSVR 377
Query: 440 IIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPK 496
++GWG + T +KYWL ANS+ T WGE+G F+++RG+N C IE+ + K
Sbjct: 378 VLGWG---VDHSTGKPIKYWLCANSWGTQWGEDGYFKVLRGENHCEIESFVIGAWGK 431
>gi|156708122|gb|ABU93319.1| cathepsin B10 cysteine protease [Monocercomonoides sp. PA]
Length = 283
Score = 135 bits (340), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 90/266 (33%), Positives = 127/266 (47%), Gaps = 49/266 (18%)
Query: 79 LPQNRLPLLVQLS-DPLEE--------LPEGFDARINWPYCPTIQEIRDQGSCGSGWALG 129
+ + RL LL L+ P+E+ +PE FDAR WP I +RDQ CGS WA
Sbjct: 37 ITRARLTLLAPLAIGPVEKFTIEDSFYVPESFDARDEWP--NAILPVRDQEKCGSCWAFS 94
Query: 130 AVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTY 189
E++ DR I GK H LS DL+SC + GC GG+ +W + +TTGI +
Sbjct: 95 IAESLGDRFGILGCGKGH--LSPQDLISCDSN-DLGCNGGYQENSWTWVLTTGITT---- 147
Query: 190 ASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLP 249
+ C PY R P C +C G + N+ R+
Sbjct: 148 ---ESCWPYRSGSGR---------------IPSCPHRCVNGSVLQRNTINNYRRL----- 184
Query: 250 ANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEG 309
+ + E++ +GP++ + +Y D Y GIYKH++G +G HA+ ++GWG E
Sbjct: 185 -DSSELQDELYNNGPIQVTYVVYEDFFYYSKGIYKHLSGNKVGGHAVVLMGWGIE----- 238
Query: 310 TSSVVKYWLVANSFNTNWGENGLFRI 335
VKYWLV NS+ WGE G FRI
Sbjct: 239 --DGVKYWLVQNSWGYEWGEQGYFRI 262
Score = 99.4 bits (246), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 50/131 (38%), Positives = 72/131 (54%), Gaps = 13/131 (9%)
Query: 363 PECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKT 422
P C +C G + N+ R+ + + E++ +GP++ + +Y D Y
Sbjct: 161 PSCPHRCVNGSVLQRNTINNYRRL------DSSELQDELYNNGPIQVTYVVYEDFFYYSK 214
Query: 423 GIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQN 482
GIYKH++G +G HA+ ++GWG E VKYWLV NS+ WGE G FRI+RG N
Sbjct: 215 GIYKHLSGNKVGGHAVVLMGWGIE-------DGVKYWLVQNSWGYEWGEQGYFRILRGSN 267
Query: 483 ECGIEADITAG 493
ECGIE+ AG
Sbjct: 268 ECGIESSAYAG 278
>gi|363732245|ref|XP_419905.3| PREDICTED: tubulointerstitial nephritis antigen [Gallus gallus]
Length = 467
Score = 135 bits (340), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 100/284 (35%), Positives = 132/284 (46%), Gaps = 31/284 (10%)
Query: 62 LTLSE-LEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQG 120
+TL E R+G P S N + + S P E+ PE F A WP I + DQ
Sbjct: 187 MTLEEGFRKRLGTLPPSHSLLN-MKAIPGSSVPEEKFPEFFAATYAWP--DWIHDPLDQR 243
Query: 121 SCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGN--GCQGGFHGKAWKYW 178
+CG+ WA +DR+ I S G+ LS +L+SC D GN GC GG AW+Y
Sbjct: 244 NCGASWAFSTASVAADRITIHSDGQITDNLSVQNLISC--DTGNQRGCNGGSIDGAWRYL 301
Query: 179 VTTGIVSGGTYAS---KQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSY 235
T G+VS Y S P E C Y++ + N P P +
Sbjct: 302 TTHGVVSYACYPSFWKHHLDSPSENQC--YVSSEYGKNHTNGP-CPNAL----------- 347
Query: 236 EDDLNFGRIA--YSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHV--AGGPL 291
ED R Y + + E IM EI GPV+ M +Y D LYK GIY+H AG
Sbjct: 348 EDSNRLYRCGSHYRVSSKETDIMEEIMAKGPVQAIMKVYEDFFLYKEGIYRHSYKAGSKW 407
Query: 292 GEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
H+++++GWG P G K+W+ ANS+ WGENG FRI
Sbjct: 408 KTHSVKLLGWGSLPGKNGQKQ--KFWIAANSWGKYWGENGYFRI 449
Score = 102 bits (255), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 50/109 (45%), Positives = 64/109 (58%), Gaps = 4/109 (3%)
Query: 388 YSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHV--AGGPLGEHAIRIIGWGQ 445
Y + + E IM EI GPV+ M +Y D LYK GIY+H AG H+++++GWG
Sbjct: 360 YRVSSKETDIMEEIMAKGPVQAIMKVYEDFFLYKEGIYRHSYKAGSKWKTHSVKLLGWGS 419
Query: 446 EPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGL 494
P G K+W+ ANS+ WGENG FRI+RGQNEC IE I L
Sbjct: 420 LPGKNGQKQ--KFWIAANSWGKYWGENGYFRILRGQNECDIEKLILTTL 466
>gi|431891156|gb|ELK02033.1| Tubulointerstitial nephritis antigen-like protein [Pteropus alecto]
Length = 467
Score = 135 bits (340), Expect = 6e-29, Method: Compositional matrix adjust.
Identities = 99/289 (34%), Positives = 138/289 (47%), Gaps = 22/289 (7%)
Query: 57 NALSKLTLSE-LEMRMG-VHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQ 114
+A +TL E + R+G + P S + + + + P E LP F+A WP I
Sbjct: 163 SAFWGMTLDEGIRYRLGTIRPSSSV--TNMNEIHTVLVPGERLPTAFEASEKWPN--LIH 218
Query: 115 EIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKA 174
E DQG+C WA SDRV I S G LS +L+SC K GC+GG A
Sbjct: 219 EPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQNLLSCDKHNQQGCRGGRLDGA 278
Query: 175 WKYWVTTGIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVS 234
W + G+VS Y R P R M S + + C P + V
Sbjct: 279 WWFLRRRGVVSDHCYPFSGQERNEAGPEPRCMMHSRAMGRGKRQAIARC-----PNHHV- 332
Query: 235 YEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVA---GGP- 290
+ +D+ AY L +NE+ IM+E+ +GPV+ M ++ D LY+ GIY H G P
Sbjct: 333 HANDIYQVTPAYRLGSNEKEIMKELMENGPVQALMEVHEDFFLYQGGIYSHTPVSLGKPE 392
Query: 291 ----LGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
G H+++I GWG+E L +G + +KYW ANS+ WGE G FRI
Sbjct: 393 RYRRHGTHSVKITGWGEETLPDGRT--LKYWTAANSWGPAWGERGHFRI 439
Score = 104 bits (260), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 58/145 (40%), Positives = 86/145 (59%), Gaps = 12/145 (8%)
Query: 364 ECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTG 423
+ I +C P + V + +D+ AY L +NE+ IM+E+ +GPV+ M ++ D LY+ G
Sbjct: 322 QAIARC-PNHHV-HANDIYQVTPAYRLGSNEKEIMKELMENGPVQALMEVHEDFFLYQGG 379
Query: 424 IYKHVA---GGP-----LGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLF 475
IY H G P G H+++I GWG+E L +G + +KYW ANS+ WGE G F
Sbjct: 380 IYSHTPVSLGKPERYRRHGTHSVKITGWGEETLPDGRT--LKYWTAANSWGPAWGERGHF 437
Query: 476 RIVRGQNECGIEADITAGLPKIGLE 500
RIVRG NEC IE+ + ++G+E
Sbjct: 438 RIVRGTNECDIESFVLGVWGRVGME 462
>gi|431838263|gb|ELK00195.1| Tubulointerstitial nephritis antigen [Pteropus alecto]
Length = 425
Score = 135 bits (340), Expect = 6e-29, Method: Compositional matrix adjust.
Identities = 99/336 (29%), Positives = 147/336 (43%), Gaps = 54/336 (16%)
Query: 27 NGVFCDLSKAFDRVDHSILLPKLPFYGAEK------NALSKLTLSE-LEMRMGVHPDSKL 79
N C K+F R + L P+Y ++ + +TL E + R+G P S
Sbjct: 91 NSDCCPDYKSFCRGEKEWLPHATPWYTEDRWTAQNYSQFWGMTLEEGFKYRLGTLPPS-- 148
Query: 80 PQNRLPLLVQLSD-----PLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAM 134
P+L+ +++ + +LPE F A WP DQ +C + WA
Sbjct: 149 -----PMLLSMNEVTAVPAIIDLPEFFVAYYKWP--GWTHGPLDQKNCAASWAFSTASVA 201
Query: 135 SDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTY----- 189
+DR+ I S+G+ LS +L+SCC +GC G +AW Y G+VS Y
Sbjct: 202 ADRIAIQSKGRYTANLSPQNLISCCAKNRHGCSSGSIDRAWWYLRKRGLVSHACYPFLKD 261
Query: 190 --ASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYS 247
+ C R + C +N + I +C P Y VS
Sbjct: 262 QNTTNNACAMASRSDGRGKRHATKPCPNNIEKSNR-IYQCSPPYRVS------------- 307
Query: 248 LPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGG--------PLGEHAIRII 299
+NE IM+EI +GPV+ M ++ D YK+GIY+HV L HA+++
Sbjct: 308 --SNETEIMKEIIHNGPVQAIMQVHEDFFHYKSGIYRHVTSTNEKSEKYQKLQTHAVKLT 365
Query: 300 GWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
GWG +G K+W+VANS+ +WGENG FRI
Sbjct: 366 GWGTLRGAQGRKE--KFWIVANSWGNSWGENGYFRI 399
Score = 95.9 bits (237), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 48/118 (40%), Positives = 68/118 (57%), Gaps = 10/118 (8%)
Query: 388 YSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGG--------PLGEHAIR 439
Y + +NE IM+EI +GPV+ M ++ D YK+GIY+HV L HA++
Sbjct: 304 YRVSSNETEIMKEIIHNGPVQAIMQVHEDFFHYKSGIYRHVTSTNEKSEKYQKLQTHAVK 363
Query: 440 IIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
+ GWG +G K+W+VANS+ +WGENG FRI+RG NE IE I A ++
Sbjct: 364 LTGWGTLRGAQGRKE--KFWIVANSWGNSWGENGYFRILRGVNESDIEKLIIAAWGQL 419
>gi|149436731|ref|XP_001513125.1| PREDICTED: cathepsin B-like [Ornithorhynchus anatinus]
Length = 211
Score = 135 bits (339), Expect = 6e-29, Method: Compositional matrix adjust.
Identities = 64/113 (56%), Positives = 81/113 (71%), Gaps = 1/113 (0%)
Query: 83 RLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIAS 142
+LP V L++ +LPE FDAR WP CPTI+EIRDQGSCGS WA GAVEA+SDRVC+ +
Sbjct: 67 KLPARVGLANSDMKLPENFDARQQWPNCPTIKEIRDQGSCGSCWAFGAVEAISDRVCVHT 126
Query: 143 RGKRHVRLSSDDLVSCCK-DCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQG 194
G+ V +S++DL++CC +CG GC GG+ AW YW G+VSGG Y S G
Sbjct: 127 NGQVSVEVSAEDLLTCCGLECGMGCNGGYPTGAWTYWTKKGLVSGGLYDSHVG 179
>gi|402853710|ref|XP_003891533.1| PREDICTED: tubulointerstitial nephritis antigen-like [Papio anubis]
Length = 362
Score = 135 bits (339), Expect = 6e-29, Method: Compositional matrix adjust.
Identities = 99/294 (33%), Positives = 134/294 (45%), Gaps = 20/294 (6%)
Query: 51 FYGAEKNALSKLTLSE-LEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPY 109
+ +A +TL E + R+G S L N + L +P E LP F+A WP
Sbjct: 52 WQAGNHSAFWGMTLDEGIRYRLGTIRPSSLVMNMHEIYTVL-NPGEVLPTAFEASEKWPN 110
Query: 110 CPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGG 169
I E DQG+C WA SDRV I S G LS +L+SC GC+GG
Sbjct: 111 --LIHEPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQNLLSCDTHQQQGCRGG 168
Query: 170 FHGKAWKYWVTTGIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQP 229
AW + G+VS Y R P M S + + T C P
Sbjct: 169 RLDGAWWFLRRRGVVSDHCYPFSGRERDEAGPAPPCMMHSRAMGRGKRQATARC-----P 223
Query: 230 GYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVA-- 287
V+ D + Y L +N++ IM+E+ +GPV+ M ++ D LYK GIY H
Sbjct: 224 NSHVNNNDIYQVTPV-YRLGSNDKEIMKELMENGPVQALMEVHEDFFLYKGGIYSHTPVS 282
Query: 288 -GGP-----LGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
G P G H+++I GWG+E L +G + +KYW ANS+ WGE G FRI
Sbjct: 283 LGRPERYRRHGTHSVKITGWGEETLPDGRT--LKYWTAANSWGPAWGERGHFRI 334
Score = 101 bits (252), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 52/122 (42%), Positives = 74/122 (60%), Gaps = 10/122 (8%)
Query: 387 AYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVA---GGP-----LGEHAI 438
Y L +N++ IM+E+ +GPV+ M ++ D LYK GIY H G P G H++
Sbjct: 238 VYRLGSNDKEIMKELMENGPVQALMEVHEDFFLYKGGIYSHTPVSLGRPERYRRHGTHSV 297
Query: 439 RIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKIG 498
+I GWG+E L +G + +KYW ANS+ WGE G FRIVRG NEC IE+ + ++G
Sbjct: 298 KITGWGEETLPDGRT--LKYWTAANSWGPAWGERGHFRIVRGVNECDIESFVLGVWGRVG 355
Query: 499 LE 500
+E
Sbjct: 356 ME 357
>gi|53850626|ref|NP_001005549.1| tubulointerstitial nephritis antigen precursor [Rattus norvegicus]
gi|51858645|gb|AAH81887.1| Tubulointerstitial nephritis antigen [Rattus norvegicus]
gi|149019129|gb|EDL77770.1| tubulointerstitial nephritis antigen [Rattus norvegicus]
Length = 475
Score = 134 bits (338), Expect = 8e-29, Method: Compositional matrix adjust.
Identities = 89/292 (30%), Positives = 137/292 (46%), Gaps = 42/292 (14%)
Query: 62 LTLSE-LEMRMGVHPDSKLPQNRLPLLVQLSD-----PLEELPEGFDARINWPYCPTIQE 115
+TL E + R+G P S P+L+ +++ P +LPE F A WP
Sbjct: 182 MTLEEGFKFRLGTLPPS-------PMLLSMNEMTASYPRADLPEVFIASYKWP--GWTHG 232
Query: 116 IRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAW 175
DQ +C + WA +DR+ I S+G+ LS +L+SCC +GC G +AW
Sbjct: 233 PLDQKNCAASWAFSTASVAADRIAIQSKGRYTANLSPQNLISCCAKNRHGCNSGSIDRAW 292
Query: 176 KYWVTTGIVSGGTYASKQGCRPYEIPCERYMNGSHSSC----QDNEPNTPECIRKCQPGY 231
+ G+VS Y P + + +++SC + + R C +
Sbjct: 293 WFLRKRGLVSHACY-----------PLFKEQSTNNNSCAMASRSDGRGKRHATRPCPNSF 341
Query: 232 DVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGG-- 289
+ S + + Y + +NE IMREI ++GPV+ M ++ D YKTGIY+HV
Sbjct: 342 EKS--NRIYQCSPPYRISSNETEIMREIIQNGPVQAIMQVHEDFFYYKTGIYRHVVSTNE 399
Query: 290 ------PLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
L HA+++ GWG +G K+W+ ANS+ +WGENG FRI
Sbjct: 400 EPEKYRKLRTHAVKLTGWGTLRGAQGKKE--KFWIAANSWGKSWGENGYFRI 449
Score = 97.8 bits (242), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 49/118 (41%), Positives = 68/118 (57%), Gaps = 10/118 (8%)
Query: 388 YSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGG--------PLGEHAIR 439
Y + +NE IMREI ++GPV+ M ++ D YKTGIY+HV L HA++
Sbjct: 354 YRISSNETEIMREIIQNGPVQAIMQVHEDFFYYKTGIYRHVVSTNEEPEKYRKLRTHAVK 413
Query: 440 IIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
+ GWG +G K+W+ ANS+ +WGENG FRI+RG NE IE I A ++
Sbjct: 414 LTGWGTLRGAQGKKE--KFWIAANSWGKSWGENGYFRILRGVNESDIEKLIIAAWGQL 469
>gi|297282815|ref|XP_002802331.1| PREDICTED: tubulointerstitial nephritis antigen-like [Macaca
mulatta]
Length = 322
Score = 134 bits (338), Expect = 9e-29, Method: Compositional matrix adjust.
Identities = 98/288 (34%), Positives = 133/288 (46%), Gaps = 20/288 (6%)
Query: 57 NALSKLTLSE-LEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQE 115
+A +TL E + R+G S L N + L +P E LP F+A WP I E
Sbjct: 18 SAFWGMTLDEGIRYRLGTIRPSSLVMNMHEIYTVL-NPGEVLPTAFEASEKWPNL--IHE 74
Query: 116 IRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAW 175
DQG+C WA SDRV I S G LS +L++C GC+GG AW
Sbjct: 75 PLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQNLLACDTHHQQGCRGGRLDGAW 134
Query: 176 KYWVTTGIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSY 235
+ G+VS Y R P M S + + T C P V+
Sbjct: 135 WFLRRRGVVSDHCYPFSGRERDEAGPAPPCMMHSRAMGRGKRQATARC-----PNSHVNN 189
Query: 236 EDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVA---GGP-- 290
D + Y L +N++ IM+E+ +GPV+ M ++ D LYK GIY H G P
Sbjct: 190 NDIYQVTPV-YRLGSNDKEIMKELMENGPVQALMEVHEDFFLYKGGIYSHTPVSLGRPER 248
Query: 291 ---LGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
G H+++I GWG+E L +G + +KYW ANS+ WGE G FRI
Sbjct: 249 YRRHGTHSVKITGWGEETLPDGRT--LKYWTAANSWGPAWGERGHFRI 294
Score = 101 bits (251), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 62/166 (37%), Positives = 88/166 (53%), Gaps = 19/166 (11%)
Query: 343 PCERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIF 402
PC M SR+ + T C P V+ D + Y L +N++ IM+E+
Sbjct: 163 PC---MMHSRAMGRGKRQATARC-----PNSHVNNNDIYQVTPV-YRLGSNDKEIMKELM 213
Query: 403 RHGPVEGSMTIYADMILYKTGIYKHVA---GGP-----LGEHAIRIIGWGQEPLGEGTSS 454
+GPV+ M ++ D LYK GIY H G P G H+++I GWG+E L +G +
Sbjct: 214 ENGPVQALMEVHEDFFLYKGGIYSHTPVSLGRPERYRRHGTHSVKITGWGEETLPDGRT- 272
Query: 455 VVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKIGLE 500
+KYW ANS+ WGE G FRIVRG NEC IE+ + ++G+E
Sbjct: 273 -LKYWTAANSWGPAWGERGHFRIVRGVNECDIESFVLGVWGRVGME 317
>gi|449498128|ref|XP_002193225.2| PREDICTED: tubulointerstitial nephritis antigen [Taeniopygia
guttata]
Length = 469
Score = 134 bits (338), Expect = 9e-29, Method: Compositional matrix adjust.
Identities = 94/280 (33%), Positives = 133/280 (47%), Gaps = 23/280 (8%)
Query: 62 LTLSE-LEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQG 120
+T+ E + R+G P S N + + S P E+ P F A WP I + DQ
Sbjct: 187 MTVEEGFKKRLGTFPPSHSLLN-MREVPGKSLPEEKFPAIFSAIYEWP--EWIHDPLDQR 243
Query: 121 SCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVT 180
+CG+ WA +DR+ I S+G+ LS+ +L+SC +GC GG AW+Y T
Sbjct: 244 NCGASWAFSTASVAADRIAIHSKGQITDNLSAQNLISCDTRNQHGCNGGSIDGAWRYLKT 303
Query: 181 TGIVSGGTYAS---KQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYED 237
G+VS Y S K E C Y++ + N P P K Y +
Sbjct: 304 HGVVSYACYPSFWNKHLGPSAENQC--YVSNEYGKNHTNGP-CPNAFEKSNRLYRCASH- 359
Query: 238 DLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHV--AGGPLGEHA 295
Y + + E IM+EI GPV+ M +Y D LYK GIY+H AG H+
Sbjct: 360 --------YRVSSKETDIMKEIKDRGPVQAIMKVYEDFFLYKEGIYQHSQKAGSKWKTHS 411
Query: 296 IRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
++++GWG P G K+W+ ANS+ +WGENG FRI
Sbjct: 412 VKLLGWGALPDKNGQKQ--KFWIAANSWGKSWGENGYFRI 449
Score = 104 bits (260), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 51/109 (46%), Positives = 67/109 (61%), Gaps = 4/109 (3%)
Query: 388 YSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHV--AGGPLGEHAIRIIGWGQ 445
Y + + E IM+EI GPV+ M +Y D LYK GIY+H AG H+++++GWG
Sbjct: 360 YRVSSKETDIMKEIKDRGPVQAIMKVYEDFFLYKEGIYQHSQKAGSKWKTHSVKLLGWGA 419
Query: 446 EPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGL 494
P G K+W+ ANS+ +WGENG FRI+RGQNEC IE I A L
Sbjct: 420 LPDKNGQKQ--KFWIAANSWGKSWGENGYFRILRGQNECDIEKLILATL 466
>gi|47125398|gb|AAH70278.1| Tubulointerstitial nephritis antigen [Homo sapiens]
gi|190690249|gb|ACE86899.1| tubulointerstitial nephritis antigen protein [synthetic construct]
gi|190691623|gb|ACE87586.1| tubulointerstitial nephritis antigen protein [synthetic construct]
gi|312150986|gb|ADQ32005.1| tubulointerstitial nephritis antigen [synthetic construct]
Length = 476
Score = 134 bits (338), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 91/290 (31%), Positives = 131/290 (45%), Gaps = 37/290 (12%)
Query: 62 LTLSE-LEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQG 120
+TL + + R+G P S + + + L +LPE F A WP DQ
Sbjct: 182 MTLEDGFKFRLGTLPPSLMLLSMNEMTASLP-ATTDLPEFFVASYKWP--GWTHGPLDQK 238
Query: 121 SCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVT 180
+C + WA +DR+ I S+G+ LS +L+SCC +GC G +AW Y
Sbjct: 239 NCAASWAFSTASVAADRIAIQSKGRYTANLSPQNLISCCAKNRHGCNSGSIDRAWWYLRK 298
Query: 181 TGIVSGGTY-------ASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQPGYDV 233
G+VS Y A+ GC R + C +N + I +C P Y V
Sbjct: 299 RGLVSHACYPLFKDQNATNNGCAMASRSDGRGKRHATKPCPNNVEKSNR-IYQCSPPYRV 357
Query: 234 SYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGG---- 289
S +NE IM+EI ++GPV+ M ++ D YKTGIY+HV
Sbjct: 358 S---------------SNETEIMKEIMQNGPVQAIMQVHEDFFHYKTGIYRHVTSTNKES 402
Query: 290 ----PLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
L HA+++ GWG +G K+W+ ANS+ +WGENG FRI
Sbjct: 403 EKYRKLQTHAVKLTGWGTLRGAQGQKE--KFWIAANSWGKSWGENGYFRI 450
Score = 96.7 bits (239), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 48/118 (40%), Positives = 68/118 (57%), Gaps = 10/118 (8%)
Query: 388 YSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGG--------PLGEHAIR 439
Y + +NE IM+EI ++GPV+ M ++ D YKTGIY+HV L HA++
Sbjct: 355 YRVSSNETEIMKEIMQNGPVQAIMQVHEDFFHYKTGIYRHVTSTNKESEKYRKLQTHAVK 414
Query: 440 IIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
+ GWG +G K+W+ ANS+ +WGENG FRI+RG NE IE I A ++
Sbjct: 415 LTGWGTLRGAQGQKE--KFWIAANSWGKSWGENGYFRILRGVNESDIEKLIIAAWGQL 470
>gi|6009533|dbj|BAA84949.1| tubulointerstitial nephritis antigen [Homo sapiens]
Length = 476
Score = 134 bits (337), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 91/290 (31%), Positives = 131/290 (45%), Gaps = 37/290 (12%)
Query: 62 LTLSE-LEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQG 120
+TL + + R+G P S + + + L +LPE F A WP DQ
Sbjct: 182 MTLEDGFKFRLGTLPPSLMLLSMNEMTASLP-ATTDLPEFFVASYKWP--GWTHGPLDQK 238
Query: 121 SCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVT 180
+C + WA +DR+ I S+G+ LS +L+SCC +GC G +AW Y
Sbjct: 239 NCAASWAFSTASVAADRIAIQSKGRYTANLSPQNLISCCAKNRHGCNSGSIDRAWWYLRK 298
Query: 181 TGIVSGGTY-------ASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQPGYDV 233
G+VS Y A+ GC R + C +N + I +C P Y V
Sbjct: 299 RGLVSHACYPLFKDQNATNNGCAMASRSDGRGKRHATKPCPNNVEKSNR-IYQCSPPYRV 357
Query: 234 SYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGG---- 289
S +NE IM+EI ++GPV+ M ++ D YKTGIY+HV
Sbjct: 358 S---------------SNETEIMKEIMQNGPVQAIMQVHEDFFHYKTGIYRHVTSTNKES 402
Query: 290 ----PLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
L HA+++ GWG +G K+W+ ANS+ +WGENG FRI
Sbjct: 403 EKYRKLQTHAVKLTGWGTLRGAQGQKE--KFWIAANSWGKSWGENGYFRI 450
Score = 96.7 bits (239), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 48/118 (40%), Positives = 68/118 (57%), Gaps = 10/118 (8%)
Query: 388 YSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGG--------PLGEHAIR 439
Y + +NE IM+EI ++GPV+ M ++ D YKTGIY+HV L HA++
Sbjct: 355 YRVSSNETEIMKEIMQNGPVQAIMQVHEDFFHYKTGIYRHVTSTNKESEKYRKLQTHAVK 414
Query: 440 IIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
+ GWG +G K+W+ ANS+ +WGENG FRI+RG NE IE I A ++
Sbjct: 415 LTGWGTLRGAQGQKE--KFWIAANSWGKSWGENGYFRILRGVNESDIEKLIIAAWGQL 470
>gi|115621283|ref|XP_782184.2| PREDICTED: tubulointerstitial nephritis antigen-like
[Strongylocentrotus purpuratus]
Length = 450
Score = 134 bits (337), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 98/305 (32%), Positives = 142/305 (46%), Gaps = 47/305 (15%)
Query: 49 LPFYGAEKNALSKLTLSE-LEMRMGVHPDSKLPQNRLPLLVQLSDPLE--ELPEGFDARI 105
L + + + L LT +E + R+G P P L + +++ E LPE FDAR
Sbjct: 150 LGWRASNYSFLWGLTQAEGVLYRLGTFP----PGRALSEMAEVNIDTEGARLPETFDARE 205
Query: 106 NWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNG 165
NWP I E+ DQG CGS WA+ SDR+ I S G+ + RLS L+SC G
Sbjct: 206 NWP--GLIDEVIDQGKCGSSWAISTASVASDRLAIQSMGEINPRLSEQHLLSCNIRGQRG 263
Query: 166 CQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIR 225
C GG+ +AW + G VS + C PY HS + +T
Sbjct: 264 CSGGYLDRAWYHLRRAGAVS-------RACYPY-----------HSGLDE---DTIMQKL 302
Query: 226 KCQPGYDVS------YEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYK 279
+C+ Y S DL Y + A E IM EI+++GPV+ + + D +Y
Sbjct: 303 RCRVAYGSSQCPERGVTSDLYLSTPPYRIAAREVDIMTEIYQNGPVQATFNVKNDFFVYN 362
Query: 280 TGIYKHVA---------GGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGEN 330
G+Y++V G H+++I+GWG + + +KYWL NS+ NWGE
Sbjct: 363 RGVYRNVKQEFTASQSDSDQAGWHSVKIVGWGIDR--SDWYNPIKYWLCTNSWGRNWGEQ 420
Query: 331 GLFRI 335
G+FRI
Sbjct: 421 GMFRI 425
Score = 91.7 bits (226), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 46/128 (35%), Positives = 70/128 (54%), Gaps = 11/128 (8%)
Query: 380 DLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVA---------G 430
DL Y + A E IM EI+++GPV+ + + D +Y G+Y++V
Sbjct: 321 DLYLSTPPYRIAAREVDIMTEIYQNGPVQATFNVKNDFFVYNRGVYRNVKQEFTASQSDS 380
Query: 431 GPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADI 490
G H+++I+GWG + + +KYWL NS+ NWGE G+FRIVRG NEC IE+ +
Sbjct: 381 DQAGWHSVKIVGWGIDR--SDWYNPIKYWLCTNSWGRNWGEQGMFRIVRGVNECEIESFV 438
Query: 491 TAGLPKIG 498
++G
Sbjct: 439 LGVWMQVG 446
>gi|351704465|gb|EHB07384.1| Tubulointerstitial nephritis antigen [Heterocephalus glaber]
Length = 475
Score = 134 bits (337), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 94/295 (31%), Positives = 133/295 (45%), Gaps = 48/295 (16%)
Query: 62 LTLSE-LEMRMGVHPDSKLPQNRLPLLVQLSD------PLEELPEGFDARINWPYCPTIQ 114
+TL E R+G S P+L+ +++ +LPE F A WP
Sbjct: 182 MTLEEGFTFRLGTLAPS-------PMLLSMNEVTAALPAKTDLPEFFIASYKWP--GWTH 232
Query: 115 EIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKA 174
+ DQ +C + WA +DR+ I S G+ V LS +L+SCC GC GG +A
Sbjct: 233 DPLDQKNCAASWAFSTASVAADRIAIQSNGRYTVNLSPQNLISCCLKHRYGCSGGSIDRA 292
Query: 175 WKYWVTTGIVSGGTYA------SKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQ 228
W Y G+VS Y S GC R + + C +N + I +C
Sbjct: 293 WWYLRKRGLVSHACYPLFKDQNSTNGCAMASRSDGRGKRHATTPCPNNIEKSNR-IYQCS 351
Query: 229 PGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAG 288
P Y VS +NE IM+EI ++GPV+ M ++ D YKTGIY+HV
Sbjct: 352 PPYRVS---------------SNETQIMKEIMKNGPVQAIMQVHEDFFYYKTGIYRHVTS 396
Query: 289 G--------PLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
L HA+++ GWG +G K+W+ ANS+ +WGENG FRI
Sbjct: 397 TIEDSEKYQKLRTHAVKLTGWGTLRGAKGRKE--KFWIAANSWGKSWGENGYFRI 449
Score = 96.7 bits (239), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 48/118 (40%), Positives = 68/118 (57%), Gaps = 10/118 (8%)
Query: 388 YSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGG--------PLGEHAIR 439
Y + +NE IM+EI ++GPV+ M ++ D YKTGIY+HV L HA++
Sbjct: 354 YRVSSNETQIMKEIMKNGPVQAIMQVHEDFFYYKTGIYRHVTSTIEDSEKYQKLRTHAVK 413
Query: 440 IIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
+ GWG +G K+W+ ANS+ +WGENG FRI+RG NE IE I A ++
Sbjct: 414 LTGWGTLRGAKGRKE--KFWIAANSWGKSWGENGYFRILRGVNESDIEKLIIAAWGQL 469
>gi|552159|gb|AAA29434.1| cathepsin B-like cysteine protease, partial [Ostertagia ostertagi]
Length = 240
Score = 134 bits (336), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 64/143 (44%), Positives = 91/143 (63%), Gaps = 1/143 (0%)
Query: 97 LPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLV 156
+PE +D RI W C ++ I DQ +CGS WA+ + AMSDR+CIAS+G + V +S+ D+V
Sbjct: 95 IPESYDPRIQWANCSSLFHIPDQANCGSCWAVSSAAAMSDRICIASKGAKQVLISAQDVV 154
Query: 157 SCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEI-PCERYMNGSHSSCQD 215
SCC CG+GC+GG+ A+++ G+V+GG Y +K CRPYEI PC + N ++
Sbjct: 155 SCCTWCGDGCEGGWPISAFRFHADEGVVTGGDYNTKGSCRPYEIHPCGHHGNETYYGECV 214
Query: 216 NEPNTPECIRKCQPGYDVSYEDD 238
+TP C R+C GY SY D
Sbjct: 215 GMADTPRCKRRCLLGYPKSYPSD 237
>gi|351709947|gb|EHB12866.1| Tubulointerstitial nephritis antigen-like protein [Heterocephalus
glaber]
Length = 467
Score = 134 bits (336), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 98/288 (34%), Positives = 139/288 (48%), Gaps = 20/288 (6%)
Query: 57 NALSKLTL-SELEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQE 115
+A +TL S + R+G S N + L+ P E LP+ F+A WP I +
Sbjct: 163 SAFWGMTLDSGIRYRLGTIRPSSSVMNMNEIYTVLA-PGEVLPKAFEASKKWPN--MIHD 219
Query: 116 IRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAW 175
DQG+C WA SDRV I S G LS +L+SC GCQGG AW
Sbjct: 220 PLDQGNCAGSWAFSTAAVASDRVSIHSMGHMTPVLSPQNLLSCDTHHQQGCQGGRLDGAW 279
Query: 176 KYWVTTGIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSY 235
+ G+VS Y + P M S + + T R+C +D +
Sbjct: 280 WFLRRRGVVSDHCYPFSGHEQAEAGPATPCMMHSRAMGRGKRQAT----RRCPNSHDDA- 334
Query: 236 EDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHV---AGGP-- 290
+++ AY L ++E+ IM+E+ +GPV+ M +Y D LYK+GIY H G P
Sbjct: 335 -NEIYQVTPAYRLGSDEKEIMKELMENGPVQALMEVYEDFFLYKSGIYSHTLVSMGRPEQ 393
Query: 291 ---LGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
G H+++I GWG+E L +G + +KYW ANS+ +WGE G FRI
Sbjct: 394 YRRHGTHSVKITGWGEEMLPDGRT--LKYWTAANSWGPSWGERGYFRI 439
Score = 106 bits (265), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 53/122 (43%), Positives = 77/122 (63%), Gaps = 10/122 (8%)
Query: 387 AYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHV---AGGP-----LGEHAI 438
AY L ++E+ IM+E+ +GPV+ M +Y D LYK+GIY H G P G H++
Sbjct: 343 AYRLGSDEKEIMKELMENGPVQALMEVYEDFFLYKSGIYSHTLVSMGRPEQYRRHGTHSV 402
Query: 439 RIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKIG 498
+I GWG+E L +G + +KYW ANS+ +WGE G FRI+RG NEC IE+ + ++G
Sbjct: 403 KITGWGEEMLPDGRT--LKYWTAANSWGPSWGERGYFRILRGSNECDIESFVLGVWGRVG 460
Query: 499 LE 500
+E
Sbjct: 461 ME 462
>gi|552158|gb|AAA29433.1| cathepsin B-like cysteine protease, partial [Ostertagia ostertagi]
Length = 236
Score = 134 bits (336), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 64/143 (44%), Positives = 91/143 (63%), Gaps = 1/143 (0%)
Query: 97 LPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLV 156
+PE +D RI W C ++ I DQ +CGS WA+ + AMSDR+CIAS+G + V +S+ D+V
Sbjct: 91 IPESYDPRIQWANCSSLFHIPDQANCGSCWAVSSAAAMSDRICIASKGAKQVLISAQDVV 150
Query: 157 SCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEI-PCERYMNGSHSSCQD 215
SCC CG+GC+GG+ A+++ G+V+GG Y +K CRPYEI PC + N ++
Sbjct: 151 SCCTWCGDGCEGGWPISAFRFHADEGVVTGGDYNTKGSCRPYEIHPCGHHGNETYYGECV 210
Query: 216 NEPNTPECIRKCQPGYDVSYEDD 238
+TP C R+C GY SY D
Sbjct: 211 GMADTPRCKRRCLLGYPKSYPSD 233
>gi|355557764|gb|EHH14544.1| hypothetical protein EGK_00488 [Macaca mulatta]
gi|355745087|gb|EHH49712.1| hypothetical protein EGM_00421 [Macaca fascicularis]
gi|384948750|gb|AFI37980.1| tubulointerstitial nephritis antigen-like isoform 1 precursor
[Macaca mulatta]
gi|384948752|gb|AFI37981.1| tubulointerstitial nephritis antigen-like isoform 1 precursor
[Macaca mulatta]
gi|387540550|gb|AFJ70902.1| tubulointerstitial nephritis antigen-like isoform 1 precursor
[Macaca mulatta]
Length = 467
Score = 134 bits (336), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 99/294 (33%), Positives = 134/294 (45%), Gaps = 20/294 (6%)
Query: 51 FYGAEKNALSKLTLSE-LEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPY 109
+ +A +TL E + R+G S L N + L +P E LP F+A WP
Sbjct: 157 WQAGNHSAFWGMTLDEGIRYRLGTIRPSSLVMNMHEIYTVL-NPGEVLPTAFEASEKWPN 215
Query: 110 CPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGG 169
I E DQG+C WA SDRV I S G LS +L+SC GC+GG
Sbjct: 216 --LIHEPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQNLLSCDTHQQQGCRGG 273
Query: 170 FHGKAWKYWVTTGIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQP 229
AW + G+VS Y R P M S + + T C P
Sbjct: 274 RLDGAWWFLRRRGVVSDHCYPFSGRERDEAGPAPPCMMHSRAMGRGKRQATARC-----P 328
Query: 230 GYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVA-- 287
V+ D + Y L +N++ IM+E+ +GPV+ M ++ D LYK GIY H
Sbjct: 329 NSHVNNNDIYQVTPV-YRLGSNDKEIMKELMENGPVQALMEVHEDFFLYKGGIYSHTPVS 387
Query: 288 -GGP-----LGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
G P G H+++I GWG+E L +G + +KYW ANS+ WGE G FRI
Sbjct: 388 LGRPERYRRHGTHSVKITGWGEETLPDGRT--LKYWTAANSWGPAWGERGHFRI 439
Score = 101 bits (252), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 52/122 (42%), Positives = 74/122 (60%), Gaps = 10/122 (8%)
Query: 387 AYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVA---GGP-----LGEHAI 438
Y L +N++ IM+E+ +GPV+ M ++ D LYK GIY H G P G H++
Sbjct: 343 VYRLGSNDKEIMKELMENGPVQALMEVHEDFFLYKGGIYSHTPVSLGRPERYRRHGTHSV 402
Query: 439 RIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKIG 498
+I GWG+E L +G + +KYW ANS+ WGE G FRIVRG NEC IE+ + ++G
Sbjct: 403 KITGWGEETLPDGRT--LKYWTAANSWGPAWGERGHFRIVRGVNECDIESFVLGVWGRVG 460
Query: 499 LE 500
+E
Sbjct: 461 ME 462
>gi|355724275|gb|AES08176.1| tubulointerstitial nephritis antigen-like 1 [Mustela putorius furo]
Length = 454
Score = 134 bits (336), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 97/294 (32%), Positives = 135/294 (45%), Gaps = 20/294 (6%)
Query: 51 FYGAEKNALSKLTLSE-LEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPY 109
++ +A +TL E + R+G S N + L P E LP F+A WP
Sbjct: 144 WWAGNHSAFWGMTLDEGIRYRLGTMRPSSSVTNMNEIHTVLR-PGEVLPTAFEASEKWPN 202
Query: 110 CPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGG 169
I E DQG+C WA SDRV I S G LS +L+SC GC GG
Sbjct: 203 --LIHEPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQNLLSCDTHNQRGCHGG 260
Query: 170 FHGKAWKYWVTTGIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQP 229
AW + G+VS Y + P R M S + + T C
Sbjct: 261 RLDGAWWFLRRRGVVSDHCYPFVGREQDEAGPAPRCMMHSRAMGRGKRQATARCPSS--- 317
Query: 230 GYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVA-- 287
++ +D+ AY L +NE+ IM+E+ +GPV+ M ++ D LY++GIY H
Sbjct: 318 ---HAHANDIYQVTPAYRLGSNEKEIMKELMENGPVQALMEVHEDFFLYQSGIYSHTPVS 374
Query: 288 -GGP-----LGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
G P G H+++I GWG+E L +G + +KYW ANS+ WGE G FRI
Sbjct: 375 LGRPERYRRHGTHSVKITGWGEETLPDGRT--LKYWTAANSWGPAWGERGHFRI 426
Score = 105 bits (263), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 61/166 (36%), Positives = 90/166 (54%), Gaps = 16/166 (9%)
Query: 343 PCERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIF 402
P R M SR+ + T C ++ +D+ AY L +NE+ IM+E+
Sbjct: 292 PAPRCMMHSRAMGRGKRQATARCPSS------HAHANDIYQVTPAYRLGSNEKEIMKELM 345
Query: 403 RHGPVEGSMTIYADMILYKTGIYKHVA---GGP-----LGEHAIRIIGWGQEPLGEGTSS 454
+GPV+ M ++ D LY++GIY H G P G H+++I GWG+E L +G +
Sbjct: 346 ENGPVQALMEVHEDFFLYQSGIYSHTPVSLGRPERYRRHGTHSVKITGWGEETLPDGRT- 404
Query: 455 VVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKIGLE 500
+KYW ANS+ WGE G FRIVRG NEC IE+ + ++G+E
Sbjct: 405 -LKYWTAANSWGPAWGERGHFRIVRGANECDIESFVLGVWGRVGME 449
>gi|426250116|ref|XP_004018784.1| PREDICTED: tubulointerstitial nephritis antigen [Ovis aries]
Length = 476
Score = 134 bits (336), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 94/293 (32%), Positives = 133/293 (45%), Gaps = 43/293 (14%)
Query: 62 LTLSE-LEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQG 120
+TL E + R+G P S L + + L++ + LPE F A WP DQ
Sbjct: 182 MTLEEGFKYRLGTLPPSPLLLSMNEVTASLAETTD-LPEFFIASYKWP--GWTHGPLDQK 238
Query: 121 SCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVT 180
+C + WA +DR+ I S+G+ LS +L+SCC +GC G +AW Y
Sbjct: 239 NCAASWAFSTASVAADRIAIQSQGRYTANLSPQNLISCCAKKRHGCNSGSVDRAWWYLRK 298
Query: 181 TGIVSGGTY-------ASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPEC---IRKCQPG 230
G+VS Y A+ GC R + + C PN+ E I +C P
Sbjct: 299 RGLVSHACYPLFKDQNATNNGCAMASRSDGRGKRHATTPC----PNSIEKSNRIYQCSPP 354
Query: 231 YDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGG- 289
Y VS +NE IMREI ++GPV+ M ++ D YKTGIY+H+
Sbjct: 355 YRVS---------------SNETEIMREIMQNGPVQAIMQVHEDFFNYKTGIYRHITSTN 399
Query: 290 -------PLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
HA+++ GWG G K+W+ ANS+ +WGENG FRI
Sbjct: 400 EDSEKYRKFRTHAVKLTGWGTLRGAHGQKE--KFWIAANSWGKSWGENGYFRI 450
Score = 94.4 bits (233), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 47/118 (39%), Positives = 66/118 (55%), Gaps = 10/118 (8%)
Query: 388 YSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGG--------PLGEHAIR 439
Y + +NE IMREI ++GPV+ M ++ D YKTGIY+H+ HA++
Sbjct: 355 YRVSSNETEIMREIMQNGPVQAIMQVHEDFFNYKTGIYRHITSTNEDSEKYRKFRTHAVK 414
Query: 440 IIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
+ GWG G K+W+ ANS+ +WGENG FRI+RG NE IE I A ++
Sbjct: 415 LTGWGTLRGAHGQKE--KFWIAANSWGKSWGENGYFRILRGVNESDIEKLIIAAWGQL 470
>gi|256052325|ref|XP_002569723.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
gi|353228438|emb|CCD74609.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
Length = 198
Score = 134 bits (336), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 64/129 (49%), Positives = 81/129 (62%), Gaps = 1/129 (0%)
Query: 104 RINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCG 163
+ WP C +I IRDQ CGS WA GAVEAMSDR CI S GK++V LS+ DL+SCC+ CG
Sbjct: 70 KKKWPGCKSIATIRDQSRCGSSWAFGAVEAMSDRSCIQSGGKQNVELSAVDLLSCCEHCG 129
Query: 164 NGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIP-CERYMNGSHSSCQDNEPNTPE 222
+G +GGF AW YWV GIV+G + + C+PY P CE + G + +C + TP
Sbjct: 130 DGFEGGFPALAWDYWVKEGIVTGSSKENHTVCQPYPFPKCEHHTKGKYPACGEEIYRTPN 189
Query: 223 CIRKCQPGY 231
C CQ Y
Sbjct: 190 CENTCQKSY 198
>gi|224586907|ref|NP_055279.3| tubulointerstitial nephritis antigen [Homo sapiens]
gi|317373501|sp|Q9UJW2.3|TINAG_HUMAN RecName: Full=Tubulointerstitial nephritis antigen; Short=TIN-Ag
gi|119624842|gb|EAX04437.1| tubulointerstitial nephritis antigen [Homo sapiens]
gi|189066513|dbj|BAG35763.1| unnamed protein product [Homo sapiens]
Length = 476
Score = 134 bits (336), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 92/296 (31%), Positives = 133/296 (44%), Gaps = 49/296 (16%)
Query: 62 LTLSE-LEMRMGVHPDSKLPQNRLPLLVQLSD------PLEELPEGFDARINWPYCPTIQ 114
+TL + + R+G P S P+L+ +++ +LPE F A WP
Sbjct: 182 MTLEDGFKFRLGTLPPS-------PMLLSMNEMTASLPATTDLPEFFVASYKWP--GWTH 232
Query: 115 EIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKA 174
DQ +C + WA +DR+ I S+G+ LS +L+SCC +GC G +A
Sbjct: 233 GPLDQKNCAASWAFSTASVAADRIAIQSKGRYTANLSPQNLISCCAKNRHGCNSGSIDRA 292
Query: 175 WKYWVTTGIVSGGTY-------ASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKC 227
W Y G+VS Y A+ GC R + C +N + I +C
Sbjct: 293 WWYLRKRGLVSHACYPLFKDQNATNNGCAMASRSDGRGKRHATKPCPNNVEKSNR-IYQC 351
Query: 228 QPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVA 287
P Y VS +NE IM+EI ++GPV+ M + D YKTGIY+HV
Sbjct: 352 SPPYRVS---------------SNETEIMKEIMQNGPVQAIMQVREDFFHYKTGIYRHVT 396
Query: 288 GG--------PLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
L HA+++ GWG +G K+W+ ANS+ +WGENG FRI
Sbjct: 397 STNKESEKYRKLQTHAVKLTGWGTLRGAQGQKE--KFWIAANSWGKSWGENGYFRI 450
Score = 95.1 bits (235), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 48/118 (40%), Positives = 67/118 (56%), Gaps = 10/118 (8%)
Query: 388 YSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGG--------PLGEHAIR 439
Y + +NE IM+EI ++GPV+ M + D YKTGIY+HV L HA++
Sbjct: 355 YRVSSNETEIMKEIMQNGPVQAIMQVREDFFHYKTGIYRHVTSTNKESEKYRKLQTHAVK 414
Query: 440 IIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
+ GWG +G K+W+ ANS+ +WGENG FRI+RG NE IE I A ++
Sbjct: 415 LTGWGTLRGAQGQKE--KFWIAANSWGKSWGENGYFRILRGVNESDIEKLIIAAWGQL 470
>gi|397517574|ref|XP_003828984.1| PREDICTED: tubulointerstitial nephritis antigen [Pan paniscus]
Length = 476
Score = 133 bits (335), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 92/296 (31%), Positives = 133/296 (44%), Gaps = 49/296 (16%)
Query: 62 LTLSE-LEMRMGVHPDSKLPQNRLPLLVQLSD------PLEELPEGFDARINWPYCPTIQ 114
+TL + + R+G P S P+L+ +++ +LPE F A WP
Sbjct: 182 MTLEDGFKFRLGTLPPS-------PMLLSMNEMTASLPATTDLPEFFIASYKWP--GWTH 232
Query: 115 EIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKA 174
DQ +C + WA +DR+ I S+G+ LS +L+SCC +GC G +A
Sbjct: 233 GPLDQKNCAASWAFSTASVAADRIAIQSKGRYTANLSPQNLISCCAKNRHGCNSGSIDRA 292
Query: 175 WKYWVTTGIVSGGTY-------ASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKC 227
W Y G+VS Y A+ GC R + C +N + I +C
Sbjct: 293 WWYLRKRGLVSHACYPLFKDHNATNNGCAMASRSDGRGKRHATKPCPNNVEKSNR-IYQC 351
Query: 228 QPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVA 287
P Y VS +NE IM+EI ++GPV+ M + D YKTGIY+HV
Sbjct: 352 SPPYRVS---------------SNETEIMKEIMQNGPVQAIMQVREDFFHYKTGIYRHVT 396
Query: 288 GG--------PLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
L HA+++ GWG +G K+W+ ANS+ +WGENG FRI
Sbjct: 397 STNKESEKYRKLQTHAVKLTGWGTLRGAQGQKE--KFWIAANSWGKSWGENGYFRI 450
Score = 95.1 bits (235), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 48/118 (40%), Positives = 67/118 (56%), Gaps = 10/118 (8%)
Query: 388 YSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGG--------PLGEHAIR 439
Y + +NE IM+EI ++GPV+ M + D YKTGIY+HV L HA++
Sbjct: 355 YRVSSNETEIMKEIMQNGPVQAIMQVREDFFHYKTGIYRHVTSTNKESEKYRKLQTHAVK 414
Query: 440 IIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
+ GWG +G K+W+ ANS+ +WGENG FRI+RG NE IE I A ++
Sbjct: 415 LTGWGTLRGAQGQKE--KFWIAANSWGKSWGENGYFRILRGVNESDIEKLIIAAWGQL 470
>gi|161343845|tpg|DAA06103.1| TPA_inf: cathepsin B [Acyrthosiphon pisum]
Length = 261
Score = 133 bits (335), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 63/177 (35%), Positives = 99/177 (55%), Gaps = 3/177 (1%)
Query: 96 ELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDL 155
+P FDAR W C TI +RDQG+CGS WA+ A +DR+C+A+ + LS++++
Sbjct: 87 RIPRHFDARRKWRRCHTIGAVRDQGNCGSCWAMATSSAFADRLCVATNADFNELLSAEEI 146
Query: 156 VSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQD 215
CC CG GC GG+ KAW+ + G+V+GG Y S +GC PY +P Y H++C
Sbjct: 147 TFCCHSCGFGCNGGYPIKAWERFKKRGLVTGGDYQSGEGCEPYRVPPCPYDAEGHNTCAG 206
Query: 216 N-EPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTI 271
+ C R C D+ +++D + R +Y L +I +++ +GP+E S +
Sbjct: 207 KPRESNHRCTRMCYGNQDLDFDEDHRYTRDSYYLTYG--SIQKDVMTYGPIEASFDV 261
Score = 39.7 bits (91), Expect = 3.9, Method: Compositional matrix adjust.
Identities = 20/79 (25%), Positives = 36/79 (45%), Gaps = 3/79 (3%)
Query: 336 GCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANE 394
GC PY +P C G + + C R C D+ +++D + R +Y L
Sbjct: 185 GCEPYRVPPCPYDAEGHNTCAGKPRESNHRCTRMCYGNQDLDFDEDHRYTRDSYYLTYG- 243
Query: 395 ETIMREIFRHGPVEGSMTI 413
+I +++ +GP+E S +
Sbjct: 244 -SIQKDVMTYGPIEASFDV 261
>gi|426353589|ref|XP_004044272.1| PREDICTED: tubulointerstitial nephritis antigen [Gorilla gorilla
gorilla]
Length = 476
Score = 133 bits (335), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 92/296 (31%), Positives = 133/296 (44%), Gaps = 49/296 (16%)
Query: 62 LTLSE-LEMRMGVHPDSKLPQNRLPLLVQLSD------PLEELPEGFDARINWPYCPTIQ 114
+TL + + R+G P S P+L+ +++ +LPE F A WP
Sbjct: 182 MTLEDGFKFRLGTLPPS-------PMLLSMNEMTASLPATTDLPEFFVASYKWP--GWTH 232
Query: 115 EIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKA 174
DQ +C + WA +DR+ I S+G+ LS +L+SCC +GC G +A
Sbjct: 233 GPLDQKNCAASWAFSTASVAADRIAIQSKGRYTANLSPQNLISCCAKNRHGCNSGSIDRA 292
Query: 175 WKYWVTTGIVSGGTY-------ASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKC 227
W Y G+VS Y A+ GC R + C +N + I +C
Sbjct: 293 WWYLRKRGLVSHACYPLFKDQNATNNGCAMASRSDGRGKRHATKPCPNNVEKSNR-IYQC 351
Query: 228 QPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVA 287
P Y VS +NE IM+EI ++GPV+ M + D YKTGIY+HV
Sbjct: 352 SPPYRVS---------------SNETEIMKEIMQNGPVQAIMQVREDFFHYKTGIYRHVT 396
Query: 288 GG--------PLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
L HA+++ GWG +G K+W+ ANS+ +WGENG FRI
Sbjct: 397 STNKESEKYRKLQTHAVKLTGWGTLRGAQGQKE--KFWIAANSWGKSWGENGYFRI 450
Score = 95.1 bits (235), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 48/118 (40%), Positives = 67/118 (56%), Gaps = 10/118 (8%)
Query: 388 YSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGG--------PLGEHAIR 439
Y + +NE IM+EI ++GPV+ M + D YKTGIY+HV L HA++
Sbjct: 355 YRVSSNETEIMKEIMQNGPVQAIMQVREDFFHYKTGIYRHVTSTNKESEKYRKLQTHAVK 414
Query: 440 IIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
+ GWG +G K+W+ ANS+ +WGENG FRI+RG NE IE I A ++
Sbjct: 415 LTGWGTLRGAQGQKE--KFWIAANSWGKSWGENGYFRILRGVNESDIEKLIIAAWGQL 470
>gi|332824268|ref|XP_518550.3| PREDICTED: tubulointerstitial nephritis antigen [Pan troglodytes]
Length = 476
Score = 133 bits (335), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 92/296 (31%), Positives = 133/296 (44%), Gaps = 49/296 (16%)
Query: 62 LTLSE-LEMRMGVHPDSKLPQNRLPLLVQLSD------PLEELPEGFDARINWPYCPTIQ 114
+TL + + R+G P S P+L+ +++ +LPE F A WP
Sbjct: 182 MTLEDGFKFRLGTLPPS-------PMLLSMNEMTASLPATTDLPEFFVASYKWP--GWTH 232
Query: 115 EIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKA 174
DQ +C + WA +DR+ I S+G+ LS +L+SCC +GC G +A
Sbjct: 233 GPLDQKNCAASWAFSTASVAADRIAIQSKGRYTANLSPQNLISCCAKNRHGCNSGSIDRA 292
Query: 175 WKYWVTTGIVSGGTY-------ASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKC 227
W Y G+VS Y A+ GC R + C +N + I +C
Sbjct: 293 WWYLRKRGLVSHACYPLFKDHNATNNGCAMASRSDGRGKRHATKPCPNNVEKSNR-IYQC 351
Query: 228 QPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVA 287
P Y VS +NE IM+EI ++GPV+ M + D YKTGIY+HV
Sbjct: 352 SPPYRVS---------------SNETEIMKEIMQNGPVQAIMQVREDFFHYKTGIYRHVT 396
Query: 288 GG--------PLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
L HA+++ GWG +G K+W+ ANS+ +WGENG FRI
Sbjct: 397 STNKESEKYRKLQTHAVKLTGWGTLRGAQGQKE--KFWIAANSWGKSWGENGYFRI 450
Score = 95.1 bits (235), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 48/118 (40%), Positives = 67/118 (56%), Gaps = 10/118 (8%)
Query: 388 YSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGG--------PLGEHAIR 439
Y + +NE IM+EI ++GPV+ M + D YKTGIY+HV L HA++
Sbjct: 355 YRVSSNETEIMKEIMQNGPVQAIMQVREDFFHYKTGIYRHVTSTNKESEKYRKLQTHAVK 414
Query: 440 IIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
+ GWG +G K+W+ ANS+ +WGENG FRI+RG NE IE I A ++
Sbjct: 415 LTGWGTLRGAQGQKE--KFWIAANSWGKSWGENGYFRILRGVNESDIEKLIIAAWGQL 470
>gi|78042562|ref|NP_001030279.1| tubulointerstitial nephritis antigen [Bos taurus]
gi|108861910|sp|Q3SZI1.1|TINAG_BOVIN RecName: Full=Tubulointerstitial nephritis antigen; Short=TIN-Ag
gi|74354008|gb|AAI02844.1| Tubulointerstitial nephritis antigen [Bos taurus]
gi|296474572|tpg|DAA16687.1| TPA: tubulointerstitial nephritis antigen [Bos taurus]
Length = 476
Score = 133 bits (335), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 94/293 (32%), Positives = 133/293 (45%), Gaps = 43/293 (14%)
Query: 62 LTLSE-LEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQG 120
+TL E + R+G P S L + + L+ + LPE F A WP DQ
Sbjct: 182 MTLEEGFKYRLGTLPPSPLLLSMNEVTASLTKTTD-LPEFFIASYKWP--GWTHGPLDQK 238
Query: 121 SCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVT 180
+C + WA +DR+ I S+G+ LS +L+SCC +GC G +AW Y
Sbjct: 239 NCAASWAFSTASVAADRIAIQSQGRYTANLSPQNLISCCAKKRHGCNSGSVDRAWWYLRK 298
Query: 181 TGIVSGGTY-------ASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPEC---IRKCQPG 230
G+VS Y A+ GC R + + C PN+ E I +C P
Sbjct: 299 RGLVSHACYPLFKDQNATNNGCAMASRSDGRGKRHATTPC----PNSIEKSNRIYQCSPP 354
Query: 231 YDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGG- 289
Y VS +NE IMREI ++GPV+ M ++ D YKTGIY+H+
Sbjct: 355 YRVS---------------SNETEIMREIMQNGPVQAIMQVHEDFFNYKTGIYRHITSTN 399
Query: 290 -------PLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
HA+++ GWG +G K+W+ ANS+ +WGENG FRI
Sbjct: 400 EDSEKYRKFRTHAVKLTGWGTLRGAQGQKE--KFWIAANSWGKSWGENGYFRI 450
Score = 95.1 bits (235), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 47/118 (39%), Positives = 67/118 (56%), Gaps = 10/118 (8%)
Query: 388 YSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGG--------PLGEHAIR 439
Y + +NE IMREI ++GPV+ M ++ D YKTGIY+H+ HA++
Sbjct: 355 YRVSSNETEIMREIMQNGPVQAIMQVHEDFFNYKTGIYRHITSTNEDSEKYRKFRTHAVK 414
Query: 440 IIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
+ GWG +G K+W+ ANS+ +WGENG FRI+RG NE IE I A ++
Sbjct: 415 LTGWGTLRGAQGQKE--KFWIAANSWGKSWGENGYFRILRGVNESDIEKLIIAAWGQL 470
>gi|239790303|dbj|BAH71722.1| ACYPI001175 [Acyrthosiphon pisum]
Length = 330
Score = 133 bits (335), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 85/256 (33%), Positives = 129/256 (50%), Gaps = 21/256 (8%)
Query: 96 ELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDL 155
++ E FDAR WP C TI E+ D G+ GWA ++DR+CIA+ G + LS+++L
Sbjct: 85 QIHEEFDARKGWPQCKTIGEVHDDGNTRWGWAYATAGVLADRMCIATNGSYNQLLSTEEL 144
Query: 156 VSC--CKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIPCERYMNGSHSSC 213
+ C K +G G W+Y + G+VSGG Y + GC+P +IP G+ +
Sbjct: 145 IFCGGIKTKQSGAVRG--DDVWEYLKSHGLVSGGKYNTNDGCQPSKIPPI----GNIPTH 198
Query: 214 QDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYA 273
N C +C + Y D Y++ +NE+ I +E+ +GPV +Y
Sbjct: 199 LYNHT----CEERCYGNNTIHYYHDHVKVSHYYNIKSNED-IQKEVQTYGPVSVKFRVYD 253
Query: 274 DMILYKTGIY-KHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGL 332
D LYK+G+Y K + H ++IGWG E + V YWL+ N + WG+NGL
Sbjct: 254 DFFLYKSGVYVKTEKSLYVRRHFAKLIGWGVE-------NGVDYWLLVNFWGNEWGQNGL 306
Query: 333 FRIGCRPYEIPCERYM 348
F+I E+ E Y+
Sbjct: 307 FKIKRGTNEVHVEDYV 322
Score = 82.4 bits (202), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 47/134 (35%), Positives = 70/134 (52%), Gaps = 9/134 (6%)
Query: 365 CIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGI 424
C +C + Y D Y++ +NE+ I +E+ +GPV +Y D LYK+G+
Sbjct: 204 CEERCYGNNTIHYYHDHVKVSHYYNIKSNED-IQKEVQTYGPVSVKFRVYDDFFLYKSGV 262
Query: 425 Y-KHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNE 483
Y K + H ++IGWG E + V YWL+ N + WG+NGLF+I RG NE
Sbjct: 263 YVKTEKSLYVRRHFAKLIGWGVE-------NGVDYWLLVNFWGNEWGQNGLFKIKRGTNE 315
Query: 484 CGIEADITAGLPKI 497
+E + AG P+I
Sbjct: 316 VHVEDYVYAGEPEI 329
>gi|355724272|gb|AES08175.1| tubulointerstitial nephritis antigen [Mustela putorius furo]
Length = 476
Score = 133 bits (335), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 92/293 (31%), Positives = 133/293 (45%), Gaps = 43/293 (14%)
Query: 62 LTLSE-LEMRMGVHPDSKLPQNRLPLLVQLSDPL---EELPEGFDARINWPYCPTIQEIR 117
+TL E + R+G P P RL + +++ L +LPE F A WP
Sbjct: 182 MTLEEGFKYRLGTLP----PSPRLLSMNEMTASLPATTDLPEFFIASYKWP--GWTHGPL 235
Query: 118 DQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKY 177
DQ +C + WA +DR+ I S+G+ LS +L+SCC +GC G +AW +
Sbjct: 236 DQKNCAASWAFSTASVAADRIAIQSKGRYTANLSPQNLISCCAKNRHGCNSGSIDRAWWF 295
Query: 178 WVTTGIVSGGTY-------ASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQPG 230
G+VS Y A+ GC R + C +N + I +C P
Sbjct: 296 LRKRGLVSHACYPLFKDQNATNDGCAMASRSDGRGKRHATKPCPNNIEKSNR-IYQCSPP 354
Query: 231 YDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGG- 289
Y VS +NE IM+EI ++GPV+ M ++ D YKTGIY+HV
Sbjct: 355 YRVS---------------SNETEIMKEIMQNGPVQAIMQVHEDFFHYKTGIYRHVTRTN 399
Query: 290 -------PLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
HA+++ GWG +G K+W+ ANS+ +WGENG FRI
Sbjct: 400 EEASKYRKFQTHAVKLTGWGTLKGAQGQKE--KFWIAANSWGKSWGENGYFRI 450
Score = 94.7 bits (234), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 47/114 (41%), Positives = 65/114 (57%), Gaps = 10/114 (8%)
Query: 388 YSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGG--------PLGEHAIR 439
Y + +NE IM+EI ++GPV+ M ++ D YKTGIY+HV HA++
Sbjct: 355 YRVSSNETEIMKEIMQNGPVQAIMQVHEDFFHYKTGIYRHVTRTNEEASKYRKFQTHAVK 414
Query: 440 IIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAG 493
+ GWG +G K+W+ ANS+ +WGENG FRI+RG NE IE I A
Sbjct: 415 LTGWGTLKGAQGQKE--KFWIAANSWGKSWGENGYFRILRGVNESDIEKLIIAA 466
>gi|410959397|ref|XP_003986297.1| PREDICTED: tubulointerstitial nephritis antigen [Felis catus]
Length = 474
Score = 133 bits (334), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 91/297 (30%), Positives = 134/297 (45%), Gaps = 50/297 (16%)
Query: 62 LTLSE-LEMRMGVHPDSKLPQNRLPLLVQLSD------PLEELPEGFDARINWPYCPTIQ 114
+TL E + R+G P S P+L+ +++ +LPE F A WP
Sbjct: 179 MTLEEGFKYRLGTLPPS-------PMLLSMNEVTASLPATTDLPEFFIASYKWP--GWTH 229
Query: 115 EIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKA 174
DQ +C + WA +DR+ I S+G+ LS +L+SCC +GC G +A
Sbjct: 230 GPLDQKNCAASWAFSTASVAADRIAIQSKGRYTANLSPQNLISCCPKNRHGCNSGSIDRA 289
Query: 175 WKYWVTTGIVSGGTY-------ASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKC 227
W + G+VS Y A+ GC R + C +N + I +C
Sbjct: 290 WWFLRKRGLVSHACYPLFKNQNATNHGCAMASRSDGRGKRHATKPCPNNIEKSNR-IYQC 348
Query: 228 QPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVA 287
P Y VS +NE IM+EI ++GPV+ M ++ D YKTGIY+H+
Sbjct: 349 SPPYRVS---------------SNETEIMKEIMQNGPVQAIMQVHEDFFHYKTGIYRHIT 393
Query: 288 GGP---------LGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
L HA+++ GWG +G K+W+ ANS+ +WGENG FRI
Sbjct: 394 KKANEESGKYRKLQTHAVKLTGWGTLKGAQGRKE--KFWIAANSWGKSWGENGYFRI 448
Score = 95.5 bits (236), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 47/119 (39%), Positives = 68/119 (57%), Gaps = 11/119 (9%)
Query: 388 YSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGP---------LGEHAI 438
Y + +NE IM+EI ++GPV+ M ++ D YKTGIY+H+ L HA+
Sbjct: 352 YRVSSNETEIMKEIMQNGPVQAIMQVHEDFFHYKTGIYRHITKKANEESGKYRKLQTHAV 411
Query: 439 RIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
++ GWG +G K+W+ ANS+ +WGENG FRI+RG NE IE I A ++
Sbjct: 412 KLTGWGTLKGAQGRKE--KFWIAANSWGKSWGENGYFRILRGVNESDIEKLIIAAWGQL 468
>gi|403377404|gb|EJY88697.1| hypothetical protein OXYTRI_00086 [Oxytricha trifallax]
Length = 351
Score = 133 bits (334), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 78/240 (32%), Positives = 118/240 (49%), Gaps = 40/240 (16%)
Query: 95 EELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDD 154
+ +P FD R WP C +++IRDQ +CG+ WA ++DR+CI + G + LS D
Sbjct: 118 DSIPLEFDFRTKWPQC--LRKIRDQANCGACWAFTGSGMLADRICILTNGTINEELSPQD 175
Query: 155 LVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQ 214
+V C D GC+GG+ A Y + G+ +K+ C PY+ + CQ
Sbjct: 176 MVDCSHD-NFGCEGGYLMNALDYLMNEGV-------TKESCTPYK--------DKTNKCQ 219
Query: 215 DNEPNTPECIRK--CQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIY 272
N E K C+PG + NEE I R++ ++GP+ +T+Y
Sbjct: 220 YTCQNKTEEFHKHYCKPG--------------TLRVLTNEEQIKRDLMQNGPLMVGLTVY 265
Query: 273 ADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGL 332
D I Y TG YK VAG +G HA++++GW G+ + WL+ N +N +WGE G
Sbjct: 266 EDFINYATGDYKFVAGEIVGGHAVKLMGWRTTQKGQTS------WLIQNQWNDDWGEQGF 319
Score = 83.6 bits (205), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 49/150 (32%), Positives = 79/150 (52%), Gaps = 10/150 (6%)
Query: 351 SRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGS 410
++ SC + T +C CQ + ++ G + + NEE I R++ ++GP+
Sbjct: 204 TKESCTPYKDKTNKCQYTCQNKTEEFHKHYCKPGTL--RVLTNEEQIKRDLMQNGPLMVG 261
Query: 411 MTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWG 470
+T+Y D I Y TG YK VAG +G HA++++GW G+ + WL+ N +N +WG
Sbjct: 262 LTVYEDFINYATGDYKFVAGEIVGGHAVKLMGWRTTQKGQTS------WLIQNQWNDDWG 315
Query: 471 ENGLFRIVRGQNECGIEADITAGLPKIGLE 500
E G I+ +NE GI++ P I LE
Sbjct: 316 EQGFGYIL--ENEVGIDSIGVGCTPDIDLE 343
>gi|14789619|gb|AAH10745.1| Tubulointerstitial nephritis antigen [Mus musculus]
Length = 475
Score = 133 bits (334), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 89/291 (30%), Positives = 133/291 (45%), Gaps = 39/291 (13%)
Query: 62 LTLSE-LEMRMGVHPDSKLPQNRLPLLVQLSD------PLEELPEGFDARINWPYCPTIQ 114
+TL E + R+G P S P+L+ +++ P +LPE F A WP
Sbjct: 181 MTLEEGFKFRLGTLPPS-------PMLLSMNEMTASFPPRADLPEIFIASYKWP--GWTH 231
Query: 115 EIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKA 174
DQ +C + WA +DR+ I S+G+ LS +L+SCC +GC G +A
Sbjct: 232 GPLDQKNCAASWAFSTASVAADRIAIQSKGRYTANLSPQNLISCCAKNRHGCNSGSIDRA 291
Query: 175 WKYWVTTGIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVS 234
W + G+VS Y P + N +++ C + R S
Sbjct: 292 WWFLRKRGLVSHACY-----------PLFKDQNTTNNICAMASRSDGRGKRHATKPCPNS 340
Query: 235 YEDDLNFGRIA--YSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAG---- 288
+E + + Y + +NE IMREI ++GPV+ M ++ D YKTGIY+HV
Sbjct: 341 FEKSNRIYQCSPPYRVSSNETEIMREIIQNGPVQAIMQVHEDFFYYKTGIYRHVVSTNEE 400
Query: 289 ----GPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
L HA+++ GWG G K+W+ ANS+ +WGENG FRI
Sbjct: 401 PEKYKKLRTHAVKLTGWGTLRGARGKKE--KFWIAANSWGKSWGENGYFRI 449
Score = 97.1 bits (240), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 50/123 (40%), Positives = 68/123 (55%), Gaps = 10/123 (8%)
Query: 388 YSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAG--------GPLGEHAIR 439
Y + +NE IMREI ++GPV+ M ++ D YKTGIY+HV L HA++
Sbjct: 354 YRVSSNETEIMREIIQNGPVQAIMQVHEDFFYYKTGIYRHVVSTNEEPEKYKKLRTHAVK 413
Query: 440 IIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKIGL 499
+ GWG G K+W+ ANS+ +WGENG FRI+RG NE IE I A ++
Sbjct: 414 LTGWGTLRGARGKKE--KFWIAANSWGKSWGENGYFRILRGVNESDIEKLIIAAWGQLTS 471
Query: 500 EID 502
D
Sbjct: 472 SDD 474
>gi|349604734|gb|AEQ00202.1| Cathepsin B-like protein, partial [Equus caballus]
Length = 134
Score = 133 bits (334), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 64/133 (48%), Positives = 94/133 (70%), Gaps = 10/133 (7%)
Query: 365 CIRKCQPGYDVSYEDDLNFGRIAYSLP--ANEETIMREIFRHGPVEGSMTIYADMILYKT 422
C + C+PGY SY++D ++G +YS+ A + R ++GPVE + T+Y+D + YK+
Sbjct: 1 CSKICEPGYSPSYKEDKHYGCSSYSVSRGARRRSWQRSS-KNGPVEAAFTVYSDFLQYKS 59
Query: 423 GIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQN 482
G+Y+HVAG +G HA+RI+GWG E GT YWLV NS+NT+WG+NG F+I+RGQ+
Sbjct: 60 GVYQHVAGDMMGGHAVRILGWGVE---NGT----PYWLVGNSWNTDWGDNGFFKILRGQD 112
Query: 483 ECGIEADITAGLP 495
CGIE++I AG+P
Sbjct: 113 HCGIESEIVAGIP 125
Score = 106 bits (265), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 53/115 (46%), Positives = 78/115 (67%), Gaps = 10/115 (8%)
Query: 223 CIRKCQPGYDVSYEDDLNFGRIAYSLP--ANEETIMREIFRHGPVEGSMTIYADMILYKT 280
C + C+PGY SY++D ++G +YS+ A + R ++GPVE + T+Y+D + YK+
Sbjct: 1 CSKICEPGYSPSYKEDKHYGCSSYSVSRGARRRSWQRSS-KNGPVEAAFTVYSDFLQYKS 59
Query: 281 GIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
G+Y+HVAG +G HA+RI+GWG E GT YWLV NS+NT+WG+NG F+I
Sbjct: 60 GVYQHVAGDMMGGHAVRILGWGVE---NGT----PYWLVGNSWNTDWGDNGFFKI 107
>gi|227499499|ref|NP_036163.3| tubulointerstitial nephritis antigen precursor [Mus musculus]
gi|4929827|gb|AAD34171.1| tubulo-interstitial nephritis antigen [Mus musculus]
gi|148694397|gb|EDL26344.1| tubulointerstitial nephritis antigen, isoform CRA_a [Mus musculus]
Length = 475
Score = 132 bits (333), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 89/291 (30%), Positives = 133/291 (45%), Gaps = 39/291 (13%)
Query: 62 LTLSE-LEMRMGVHPDSKLPQNRLPLLVQLSD------PLEELPEGFDARINWPYCPTIQ 114
+TL E + R+G P S P+L+ +++ P +LPE F A WP
Sbjct: 181 MTLEEGFKFRLGTLPPS-------PMLLSMNEMTASFPPRADLPEIFIASYKWP--GWTH 231
Query: 115 EIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKA 174
DQ +C + WA +DR+ I S+G+ LS +L+SCC +GC G +A
Sbjct: 232 GPLDQKNCAASWAFSTASVAADRIAIQSKGRYTANLSPQNLISCCAKNRHGCNSGSIDRA 291
Query: 175 WKYWVTTGIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVS 234
W + G+VS Y P + N +++ C + R S
Sbjct: 292 WWFLRKRGLVSHACY-----------PLFKDQNTTNNICAMASRSDGRGKRHATKPCPNS 340
Query: 235 YEDDLNFGRIA--YSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAG---- 288
+E + + Y + +NE IMREI ++GPV+ M ++ D YKTGIY+HV
Sbjct: 341 FEKSNRIYQCSPPYRVSSNETEIMREIIQNGPVQAIMQVHEDFFYYKTGIYRHVVSTNEE 400
Query: 289 ----GPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
L HA+++ GWG G K+W+ ANS+ +WGENG FRI
Sbjct: 401 PEKYKKLRTHAVKLTGWGTLRGARGKKE--KFWIAANSWGKSWGENGYFRI 449
Score = 97.1 bits (240), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 50/123 (40%), Positives = 68/123 (55%), Gaps = 10/123 (8%)
Query: 388 YSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAG--------GPLGEHAIR 439
Y + +NE IMREI ++GPV+ M ++ D YKTGIY+HV L HA++
Sbjct: 354 YRVSSNETEIMREIIQNGPVQAIMQVHEDFFYYKTGIYRHVVSTNEEPEKYKKLRTHAVK 413
Query: 440 IIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKIGL 499
+ GWG G K+W+ ANS+ +WGENG FRI+RG NE IE I A ++
Sbjct: 414 LTGWGTLRGARGKKE--KFWIAANSWGKSWGENGYFRILRGVNESDIEKLIIAAWGQLTS 471
Query: 500 EID 502
D
Sbjct: 472 SDD 474
>gi|332254562|ref|XP_003276398.1| PREDICTED: tubulointerstitial nephritis antigen-like isoform 3
[Nomascus leucogenys]
Length = 362
Score = 132 bits (333), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 97/294 (32%), Positives = 133/294 (45%), Gaps = 20/294 (6%)
Query: 51 FYGAEKNALSKLTLSE-LEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPY 109
+ +A +TL E + R+G S N + L +P E LP F+A WP
Sbjct: 52 WQAGNHSAFWGMTLDEGIRYRLGTMRPSSSVMNMHEIYTVL-NPGEVLPTAFEASEKWPN 110
Query: 110 CPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGG 169
I E DQG+C WA SDRV I S G LS +L+SC GC+GG
Sbjct: 111 --LIHEPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQNLLSCDTHQQQGCRGG 168
Query: 170 FHGKAWKYWVTTGIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQP 229
AW + G+VS Y R P M S + + T C P
Sbjct: 169 RLDGAWWFLRRRGVVSDHCYPFSGRERDEAGPAPPCMMHSRAMGRGKRQATAHC-----P 223
Query: 230 GYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVA-- 287
V+ D + Y L +N++ +M+E+ +GPV+ M ++ D LYK GIY H
Sbjct: 224 NSHVNNNDIYQVTPV-YRLGSNDKEVMKELMENGPVQALMEVHEDFFLYKGGIYSHTPVS 282
Query: 288 -GGP-----LGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
G P G H+++I GWG+E L +G + +KYW ANS+ WGE G FRI
Sbjct: 283 LGRPERYRRHGTHSVKITGWGEETLPDGRT--LKYWTAANSWGPAWGERGHFRI 334
Score = 101 bits (251), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 51/122 (41%), Positives = 74/122 (60%), Gaps = 10/122 (8%)
Query: 387 AYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVA---GGP-----LGEHAI 438
Y L +N++ +M+E+ +GPV+ M ++ D LYK GIY H G P G H++
Sbjct: 238 VYRLGSNDKEVMKELMENGPVQALMEVHEDFFLYKGGIYSHTPVSLGRPERYRRHGTHSV 297
Query: 439 RIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKIG 498
+I GWG+E L +G + +KYW ANS+ WGE G FRIVRG NEC IE+ + ++G
Sbjct: 298 KITGWGEETLPDGRT--LKYWTAANSWGPAWGERGHFRIVRGVNECDIESFVLGVWGRVG 355
Query: 499 LE 500
+E
Sbjct: 356 ME 357
>gi|403268748|ref|XP_003926429.1| PREDICTED: tubulointerstitial nephritis antigen [Saimiri
boliviensis boliviensis]
Length = 476
Score = 132 bits (333), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 91/296 (30%), Positives = 134/296 (45%), Gaps = 49/296 (16%)
Query: 62 LTLSE-LEMRMGVHPDSKLPQNRLPLLVQLSD------PLEELPEGFDARINWPYCPTIQ 114
+TL + + R+G P S P+L+ +++ +LPE F A WP
Sbjct: 182 MTLEDGFKFRLGTLPPS-------PMLLSMNEMTASLPATTDLPEFFVASYKWP--GWTH 232
Query: 115 EIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKA 174
DQ +C + WA +DR+ I S+G+ LS +L+SCC +GC G +A
Sbjct: 233 GPLDQKNCAASWAFSTASVAADRIAIQSKGRYTANLSPQNLISCCAKNRHGCNSGSIDRA 292
Query: 175 WKYWVTTGIVSGGTY-------ASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKC 227
W Y G+VS Y A+ GC R + C +N + I +C
Sbjct: 293 WWYLRKRGLVSHACYPLFKDQNATNSGCAMASRSDGRGKRHATKPCPNNIEKSNR-IYQC 351
Query: 228 QPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVA 287
P Y VS ++E IM+EI ++GPV+ M ++ D YKTGIY+HV
Sbjct: 352 SPPYRVS---------------SSETEIMKEIMQNGPVQAIMKVHEDFFHYKTGIYRHVT 396
Query: 288 GG--------PLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
L HA+++ GWG +G K+W+ ANS+ +WGENG FRI
Sbjct: 397 STNKESEKFLKLQTHAVKLTGWGTLRGAQGRKE--KFWIAANSWGKSWGENGYFRI 450
Score = 94.7 bits (234), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 47/119 (39%), Positives = 68/119 (57%), Gaps = 10/119 (8%)
Query: 388 YSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGG--------PLGEHAIR 439
Y + ++E IM+EI ++GPV+ M ++ D YKTGIY+HV L HA++
Sbjct: 355 YRVSSSETEIMKEIMQNGPVQAIMKVHEDFFHYKTGIYRHVTSTNKESEKFLKLQTHAVK 414
Query: 440 IIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKIG 498
+ GWG +G K+W+ ANS+ +WGENG FRI+RG NE IE I A ++
Sbjct: 415 LTGWGTLRGAQGRKE--KFWIAANSWGKSWGENGYFRILRGVNESDIEKLIIAAWGQLA 471
>gi|344264196|ref|XP_003404179.1| PREDICTED: tubulointerstitial nephritis antigen [Loxodonta
africana]
Length = 476
Score = 132 bits (332), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 93/296 (31%), Positives = 134/296 (45%), Gaps = 49/296 (16%)
Query: 62 LTLSE-LEMRMGVHPDSKLPQNRLPLLVQLSD------PLEELPEGFDARINWPYCPTIQ 114
+TL E L+ R+G P S P+L+ +++ +LPE F A WP
Sbjct: 182 MTLEEGLKFRLGTLPPS-------PMLLSMNEVTPSLPATTDLPEFFVASYKWP--GWTH 232
Query: 115 EIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKA 174
DQ +C + WA +DR+ I S G+ LS +L+SCC +GC G +A
Sbjct: 233 GPLDQKNCAASWAFSTASVAADRIAIQSNGRYTANLSPQNLISCCTKNRHGCNSGSVDRA 292
Query: 175 WKYWVTTGIVSGGTY-------ASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKC 227
W Y G+VS Y A+ GC R + C +N + I +C
Sbjct: 293 WWYLRKRGLVSHACYPLFKDQNANNNGCAMASRSDGRGKRHATKPCPNNIEKS-NVIYQC 351
Query: 228 QPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVA 287
P Y VS +NE IM+EI ++GPV+ M ++ D YKTGIY+HV
Sbjct: 352 SPPYRVS---------------SNETEIMKEIMQNGPVQAIMQVHEDFFHYKTGIYRHVI 396
Query: 288 GG--------PLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
L HA+++ GWG +G K+W+ ANS+ +WGE+G FRI
Sbjct: 397 RTSEESEKYQKLRTHAVKLTGWGMMKGAKGRKE--KFWVAANSWGKSWGEDGYFRI 450
Score = 93.2 bits (230), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 47/114 (41%), Positives = 66/114 (57%), Gaps = 10/114 (8%)
Query: 388 YSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGG--------PLGEHAIR 439
Y + +NE IM+EI ++GPV+ M ++ D YKTGIY+HV L HA++
Sbjct: 355 YRVSSNETEIMKEIMQNGPVQAIMQVHEDFFHYKTGIYRHVIRTSEESEKYQKLRTHAVK 414
Query: 440 IIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAG 493
+ GWG +G K+W+ ANS+ +WGE+G FRI+RG NE IE I A
Sbjct: 415 LTGWGMMKGAKGRKE--KFWVAANSWGKSWGEDGYFRILRGVNESDIEKLIIAA 466
>gi|296198446|ref|XP_002746707.1| PREDICTED: tubulointerstitial nephritis antigen [Callithrix
jacchus]
Length = 476
Score = 132 bits (332), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 91/296 (30%), Positives = 134/296 (45%), Gaps = 49/296 (16%)
Query: 62 LTLSE-LEMRMGVHPDSKLPQNRLPLLVQLSD------PLEELPEGFDARINWPYCPTIQ 114
+TL + + R+G P S P+L+ +++ +LPE F A WP
Sbjct: 182 MTLEDGFKFRLGTLPPS-------PMLLSMNEMTASLPATTDLPEFFVASYKWP--GWTH 232
Query: 115 EIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKA 174
DQ +C + WA +DR+ I S+G+ LS +L+SCC +GC G +A
Sbjct: 233 GPLDQKNCAASWAFSTASVAADRIAIQSKGRYTANLSPQNLISCCAKNRHGCNSGSIDRA 292
Query: 175 WKYWVTTGIVSGGTY-------ASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKC 227
W Y G+VS Y A+ GC R + C +N + I +C
Sbjct: 293 WWYLRKRGLVSHACYPLFKDQNATNSGCAMASRSDGRGKRHATKPCPNNIEKSNR-IYQC 351
Query: 228 QPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVA 287
P Y VS ++E IM+EI ++GPV+ M ++ D YKTGIY+HV
Sbjct: 352 SPPYRVS---------------SSETEIMKEIMQNGPVQAIMKVHEDFFHYKTGIYRHVT 396
Query: 288 GG--------PLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
L HA+++ GWG +G K+W+ ANS+ +WGENG FRI
Sbjct: 397 STNKESEKFQKLQTHAVKLTGWGTLRGAQGRKE--KFWIAANSWGKSWGENGYFRI 450
Score = 94.4 bits (233), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 47/118 (39%), Positives = 68/118 (57%), Gaps = 10/118 (8%)
Query: 388 YSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGG--------PLGEHAIR 439
Y + ++E IM+EI ++GPV+ M ++ D YKTGIY+HV L HA++
Sbjct: 355 YRVSSSETEIMKEIMQNGPVQAIMKVHEDFFHYKTGIYRHVTSTNKESEKFQKLQTHAVK 414
Query: 440 IIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
+ GWG +G K+W+ ANS+ +WGENG FRI+RG NE IE I A ++
Sbjct: 415 LTGWGTLRGAQGRKE--KFWIAANSWGKSWGENGYFRILRGVNESDIEKLIIAAWGQL 470
>gi|157058761|gb|ABV03138.1| cathepsin B-84 [Myzus persicae]
Length = 220
Score = 132 bits (332), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 72/179 (40%), Positives = 108/179 (60%), Gaps = 7/179 (3%)
Query: 96 ELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDL 155
E+PE FD+R+ W C TI E+R+QG+CGS WA G A +DR+CIA+ G+ + +S+++L
Sbjct: 46 EVPEFFDSRLEWKNCKTIGEVRNQGNCGSCWAHGTTGAFADRLCIATDGEFNELISAEEL 105
Query: 156 VSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEI-PCERYMNGSHSSC- 213
CC CG GC GG KAWKY+ G+V+GG Y + GC+P + PC R G H+SC
Sbjct: 106 TFCCHTCGFGCNGGNPLKAWKYFKRHGVVTGGNYNTTDGCQPSRVPPCVRDDEG-HNSCS 164
Query: 214 -QDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTI 271
Q E N +C +KC ++Y+ + + AY L + T+ ++ +GP+E S +
Sbjct: 165 GQPTERNH-KCSKKCYGDETINYKKNHYKTKDAYYL--SNTTMQKDTMVYGPIEASFDV 220
>gi|449283627|gb|EMC90232.1| Tubulointerstitial nephritis antigen [Columba livia]
Length = 469
Score = 132 bits (332), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 95/287 (33%), Positives = 134/287 (46%), Gaps = 37/287 (12%)
Query: 62 LTLSE-LEMRMGVHPDS-------KLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTI 113
+T+ E + R+G P S + P N LP E+ P F A WP I
Sbjct: 187 MTVEEAFKKRLGTFPPSHSLLNMRESPGNSLPE--------EKFPVFFAATYAWP--EWI 236
Query: 114 QEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGK 173
+ DQ +CG+ WA +DR+ I S G+ LS +L+SC +GC GG
Sbjct: 237 HDPLDQRNCGASWAFSTASVAADRIAIHSEGQITDNLSVQNLISCDTRNQHGCNGGNIDS 296
Query: 174 AWKYWVTTGIVSGGTYAS--KQGCRPY-EIPCERYMNGSHSSCQDNEPNTPECIRKCQPG 230
AW+Y T G+VS Y S K+ P E C Y++ + N P P + K
Sbjct: 297 AWRYLKTHGVVSYACYPSFWKKHLEPSGENHC--YVSSEYGKNYTNGP-CPNALEKSNRL 353
Query: 231 YDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHV--AG 288
Y + Y + + E IM+EI GPV+ M +Y D LYK GIY+H AG
Sbjct: 354 YRCASH---------YRVSSKETNIMKEIMDKGPVQAIMKVYEDFFLYKEGIYRHSQKAG 404
Query: 289 GPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
H+++++GWG L + K+W+ ANS+ +WGENG FRI
Sbjct: 405 SKWKTHSVKLLGWG--ALADKNGQKQKFWIAANSWGKSWGENGYFRI 449
Score = 105 bits (261), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 51/112 (45%), Positives = 69/112 (61%), Gaps = 6/112 (5%)
Query: 388 YSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHV--AGGPLGEHAIRIIGWGQ 445
Y + + E IM+EI GPV+ M +Y D LYK GIY+H AG H+++++GWG
Sbjct: 360 YRVSSKETNIMKEIMDKGPVQAIMKVYEDFFLYKEGIYRHSQKAGSKWKTHSVKLLGWG- 418
Query: 446 EPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADI--TAGLP 495
L + K+W+ ANS+ +WGENG FRI+RGQNEC IE I T+G P
Sbjct: 419 -ALADKNGQKQKFWIAANSWGKSWGENGYFRILRGQNECDIEKLILATSGQP 469
>gi|338722032|ref|XP_003364468.1| PREDICTED: tubulointerstitial nephritis antigen-like 1 isoform 2
[Equus caballus]
Length = 436
Score = 132 bits (332), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 97/289 (33%), Positives = 137/289 (47%), Gaps = 22/289 (7%)
Query: 57 NALSKLTLSE-LEMRMG-VHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQ 114
+A +TL E + R+G + P S + + + + P E LP F+A WP I
Sbjct: 132 SAFWGMTLDEGIRYRLGTIRPSSSV--TSMNEIHTVLGPGEVLPTAFEASEKWPN--LIH 187
Query: 115 EIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKA 174
E DQG+C WA SDRV I S G LS +L+SC GC+GG A
Sbjct: 188 EPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQNLLSCDTHNQQGCRGGHLDGA 247
Query: 175 WKYWVTTGIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVS 234
W + G+VS Y R P R M S + + T C P V
Sbjct: 248 WWFLRRRGVVSDHCYPFSGRERDEAGPAPRCMMHSRAMGRGKRQATAHC-----PNSRV- 301
Query: 235 YEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVA---GGP- 290
+ +D+ AY L ++E+ IM+E+ +GPV+ M ++ D LY+ G+Y H G P
Sbjct: 302 HTNDIYQVTPAYRLGSSEKEIMKELMENGPVQALMEVHEDFFLYQGGVYSHTPVSHGRPE 361
Query: 291 ----LGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
G H+++I GWG+E L +G + +KYW ANS+ WGE G FRI
Sbjct: 362 RYRRHGTHSVKITGWGEETLPDGRT--LKYWTAANSWGPAWGERGHFRI 408
Score = 102 bits (255), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 61/166 (36%), Positives = 90/166 (54%), Gaps = 16/166 (9%)
Query: 343 PCERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIF 402
P R M SR+ + T C P V + +D+ AY L ++E+ IM+E+
Sbjct: 274 PAPRCMMHSRAMGRGKRQATAHC-----PNSRV-HTNDIYQVTPAYRLGSSEKEIMKELM 327
Query: 403 RHGPVEGSMTIYADMILYKTGIYKHVA---GGP-----LGEHAIRIIGWGQEPLGEGTSS 454
+GPV+ M ++ D LY+ G+Y H G P G H+++I GWG+E L +G +
Sbjct: 328 ENGPVQALMEVHEDFFLYQGGVYSHTPVSHGRPERYRRHGTHSVKITGWGEETLPDGRT- 386
Query: 455 VVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKIGLE 500
+KYW ANS+ WGE G FRIVRG NEC IE+ + ++G+E
Sbjct: 387 -LKYWTAANSWGPAWGERGHFRIVRGANECDIESFVLGVWGRVGME 431
>gi|395730851|ref|XP_003775799.1| PREDICTED: tubulointerstitial nephritis antigen-like 1 [Pongo
abelii]
Length = 362
Score = 132 bits (332), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 98/294 (33%), Positives = 133/294 (45%), Gaps = 20/294 (6%)
Query: 51 FYGAEKNALSKLTLSE-LEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPY 109
+ +A +TL E + R+G S N + L +P E LP F+A WP
Sbjct: 52 WQAGNHSAFWGMTLDEGIRYRLGTIRPSSSVMNMHEIYTVL-NPGEVLPTAFEASEKWPN 110
Query: 110 CPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGG 169
I E DQG+C WA SDRV I S G LS +L+SC GC+GG
Sbjct: 111 --LIHEPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQNLLSCDTHQQQGCRGG 168
Query: 170 FHGKAWKYWVTTGIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQP 229
AW + G+VS Y R P M S + + T C P
Sbjct: 169 RLDGAWWFLRRRGVVSDHCYPFSGRERDEAGPTPPCMMHSRAMGRGKRQATASC-----P 223
Query: 230 GYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVA-- 287
V+ D + Y L +N++ IM+E+ +GPV+ M ++ D LYK GIY H
Sbjct: 224 NSHVNNNDIYQVTPV-YRLGSNDKEIMKELMENGPVQALMEVHEDFFLYKGGIYSHTPVS 282
Query: 288 -GGP-----LGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
G P G H+++I GWG+E L +G + +KYW ANS+ WGE G FRI
Sbjct: 283 LGRPERYRRHGTHSVKITGWGEETLPDGRT--LKYWTAANSWGPAWGERGHFRI 334
Score = 101 bits (252), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 52/122 (42%), Positives = 74/122 (60%), Gaps = 10/122 (8%)
Query: 387 AYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVA---GGP-----LGEHAI 438
Y L +N++ IM+E+ +GPV+ M ++ D LYK GIY H G P G H++
Sbjct: 238 VYRLGSNDKEIMKELMENGPVQALMEVHEDFFLYKGGIYSHTPVSLGRPERYRRHGTHSV 297
Query: 439 RIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKIG 498
+I GWG+E L +G + +KYW ANS+ WGE G FRIVRG NEC IE+ + ++G
Sbjct: 298 KITGWGEETLPDGRT--LKYWTAANSWGPAWGERGHFRIVRGVNECDIESFVLGVWGRVG 355
Query: 499 LE 500
+E
Sbjct: 356 ME 357
>gi|149694136|ref|XP_001503950.1| PREDICTED: tubulointerstitial nephritis antigen-like 1 isoform 1
[Equus caballus]
Length = 467
Score = 132 bits (332), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 97/295 (32%), Positives = 138/295 (46%), Gaps = 22/295 (7%)
Query: 51 FYGAEKNALSKLTLSE-LEMRMG-VHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWP 108
+ +A +TL E + R+G + P S + + + + P E LP F+A WP
Sbjct: 157 WRAGNHSAFWGMTLDEGIRYRLGTIRPSSSV--TSMNEIHTVLGPGEVLPTAFEASEKWP 214
Query: 109 YCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQG 168
I E DQG+C WA SDRV I S G LS +L+SC GC+G
Sbjct: 215 N--LIHEPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQNLLSCDTHNQQGCRG 272
Query: 169 GFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQ 228
G AW + G+VS Y R P R M S + + T C
Sbjct: 273 GHLDGAWWFLRRRGVVSDHCYPFSGRERDEAGPAPRCMMHSRAMGRGKRQATAHC----- 327
Query: 229 PGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVA- 287
P V + +D+ AY L ++E+ IM+E+ +GPV+ M ++ D LY+ G+Y H
Sbjct: 328 PNSRV-HTNDIYQVTPAYRLGSSEKEIMKELMENGPVQALMEVHEDFFLYQGGVYSHTPV 386
Query: 288 --GGP-----LGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
G P G H+++I GWG+E L +G + +KYW ANS+ WGE G FRI
Sbjct: 387 SHGRPERYRRHGTHSVKITGWGEETLPDGRT--LKYWTAANSWGPAWGERGHFRI 439
Score = 102 bits (254), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 61/166 (36%), Positives = 90/166 (54%), Gaps = 16/166 (9%)
Query: 343 PCERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIF 402
P R M SR+ + T C P V + +D+ AY L ++E+ IM+E+
Sbjct: 305 PAPRCMMHSRAMGRGKRQATAHC-----PNSRV-HTNDIYQVTPAYRLGSSEKEIMKELM 358
Query: 403 RHGPVEGSMTIYADMILYKTGIYKHVA---GGP-----LGEHAIRIIGWGQEPLGEGTSS 454
+GPV+ M ++ D LY+ G+Y H G P G H+++I GWG+E L +G +
Sbjct: 359 ENGPVQALMEVHEDFFLYQGGVYSHTPVSHGRPERYRRHGTHSVKITGWGEETLPDGRT- 417
Query: 455 VVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKIGLE 500
+KYW ANS+ WGE G FRIVRG NEC IE+ + ++G+E
Sbjct: 418 -LKYWTAANSWGPAWGERGHFRIVRGANECDIESFVLGVWGRVGME 462
>gi|242014495|ref|XP_002427925.1| tubulointerstitial nephritis antigen, putative [Pediculus humanus
corporis]
gi|212512409|gb|EEB15187.1| tubulointerstitial nephritis antigen, putative [Pediculus humanus
corporis]
Length = 473
Score = 132 bits (331), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 97/301 (32%), Positives = 137/301 (45%), Gaps = 34/301 (11%)
Query: 96 ELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDL 155
+LP FDAR WP I DQG CG+ WA+ SDR I S+G V LS L
Sbjct: 189 KLPNSFDARNKWP--GWISGPADQGWCGASWAVSTASVASDRYAIMSKGLTKVDLSPQHL 246
Query: 156 VSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQ- 214
+SC K GCQGG +AW + G+V C P+ G+ + C+
Sbjct: 247 LSCNKG-QRGCQGGHLSRAWTFIRKFGLVD-------DYCYPW--------TGTPTKCKI 290
Query: 215 DNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYAD 274
PN C P + +L AY + +E+ IM EI + GPV+ +M +Y D
Sbjct: 291 PKRPNFDALSSICPPSLGSNLRSELYRVGPAYKI-QDEKDIMEEIMQSGPVQATMKVYQD 349
Query: 275 MILYKTGIYKHV----AGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGEN 330
YK+G+Y G H+++I+GWG+E G +KYWL ANS+ WGEN
Sbjct: 350 FFSYKSGVYTKSNTERESSNFGYHSVKILGWGEETNIYGQP--IKYWLAANSWGQQWGEN 407
Query: 331 GLFRIGCRPYEIPCERYMNGSRSSCQANEPNT------PECIRKCQPGYDVSYEDDLNFG 384
G F+I E E ++ + + + N+P+ + + QP Y D L F
Sbjct: 408 GFFKIRRGTNECEIEEFVLAAWA--ETNDPSREIITKFQQAFMQGQPLYVNKINDTLYFD 465
Query: 385 R 385
R
Sbjct: 466 R 466
Score = 95.5 bits (236), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 54/150 (36%), Positives = 77/150 (51%), Gaps = 8/150 (5%)
Query: 349 NGSRSSCQ-ANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPV 407
G+ + C+ PN C P + +L AY + +E+ IM EI + GPV
Sbjct: 282 TGTPTKCKIPKRPNFDALSSICPPSLGSNLRSELYRVGPAYKI-QDEKDIMEEIMQSGPV 340
Query: 408 EGSMTIYADMILYKTGIYKHV----AGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVAN 463
+ +M +Y D YK+G+Y G H+++I+GWG+E G +KYWL AN
Sbjct: 341 QATMKVYQDFFSYKSGVYTKSNTERESSNFGYHSVKILGWGEETNIYGQP--IKYWLAAN 398
Query: 464 SFNTNWGENGLFRIVRGQNECGIEADITAG 493
S+ WGENG F+I RG NEC IE + A
Sbjct: 399 SWGQQWGENGFFKIRRGTNECEIEEFVLAA 428
>gi|193629592|ref|XP_001944624.1| PREDICTED: cathepsin B-like isoform 4 [Acyrthosiphon pisum]
Length = 331
Score = 132 bits (331), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 80/253 (31%), Positives = 126/253 (49%), Gaps = 25/253 (9%)
Query: 101 FDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSC-- 158
FDAR WP C TI E+ ++G+ WA +DR+CIA+ G + LS+++L+SC
Sbjct: 91 FDARKRWPQCKTIGEVYNEGNALLSWAYATTGVFADRMCIATNGSYNKHLSTEELISCSG 150
Query: 159 CKDCGNGCQGGFHGKAWKYWVTTGIVSGGT-YASKQGCRPYEIPCERYMNGSHSSCQ-DN 216
K NG G AW+Y+ T G+VSGG+ Y + GC+P +IP C
Sbjct: 151 IKASANGWVRD--GLAWEYFKTHGLVSGGSIYNTNDGCQPSKIP---------PVCNLPT 199
Query: 217 EPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMI 276
+ N C+ C + Y D ++ Y + I +E+ +GPV ++ +Y D+
Sbjct: 200 KINKRTCVDYCYGNDTIKYNHD--HVKVRYYYHVKPKDIQKEVQTYGPVTAALNLYDDIF 257
Query: 277 LYKTGIYKHVAGGP-LGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
L+K+G+Y + +++IGWG E + V YWL+ NS+ WG+NGL +I
Sbjct: 258 LHKSGVYTLTKNAKYVRLQYVKLIGWGVE-------NGVDYWLLVNSWGNEWGQNGLLKI 310
Query: 336 GCRPYEIPCERYM 348
Y E ++
Sbjct: 311 KRGKYGCAVESFV 323
Score = 84.3 bits (207), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 48/164 (29%), Positives = 80/164 (48%), Gaps = 20/164 (12%)
Query: 336 GCRPYEIPCERYMNGSRSSCQA-NEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANE 394
GC+P +IP C + N C+ C + Y D ++ Y
Sbjct: 185 GCQPSKIP---------PVCNLPTKINKRTCVDYCYGNDTIKYNHD--HVKVRYYYHVKP 233
Query: 395 ETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGP-LGEHAIRIIGWGQEPLGEGTS 453
+ I +E+ +GPV ++ +Y D+ L+K+G+Y + +++IGWG E
Sbjct: 234 KDIQKEVQTYGPVTAALNLYDDIFLHKSGVYTLTKNAKYVRLQYVKLIGWGVE------- 286
Query: 454 SVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
+ V YWL+ NS+ WG+NGL +I RG+ C +E+ + A +PKI
Sbjct: 287 NGVDYWLLVNSWGNEWGQNGLLKIKRGKYGCAVESFVYAAVPKI 330
>gi|294945206|ref|XP_002784584.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
gi|239897729|gb|EER16380.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
Length = 298
Score = 132 bits (331), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 89/272 (32%), Positives = 133/272 (48%), Gaps = 33/272 (12%)
Query: 87 LVQLSDPL-EELPEGFDARINWPYC-PTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRG 144
LV+ + P+ E LP FDAR + YC I +RDQG CG+ WA+ E ++DR+CI S G
Sbjct: 22 LVESTKPVVENLPPEFDARQKFNYCRDVIGHVRDQGRCGNCWAVCPTEVLNDRLCIKSSG 81
Query: 145 KRHVRLSSDDLVSCCKDC-----GNGCQGGFHGKAWKYWVTTGIVSGGTYASK------Q 193
K LS+ + SCC GC GG +A + G+V+G + +
Sbjct: 82 KIQEILSAGYVTSCCNPAHGCLHAKGCNGGRLVEAMSFLRDHGVVTGNDFKPQDQLREAD 141
Query: 194 GCRPY------EIPCERYMNGSHSSCQD--NEPNTPECIRKC-QPGYDVSYEDDLNFGRI 244
GC PY +P E + C+D +P P C C Y S E D++ +
Sbjct: 142 GCWPYPFQKCNHVPTE---GTGYPKCKDVVQQP-VPPCRTTCTNKAYKKSLEKDVHRAKS 197
Query: 245 AYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQE 304
+ + ++I +EIF +GPV + +Y D YK+G+Y H I+IIGWG +
Sbjct: 198 WRKVLNDAQSIKQEIFDNGPVFSAFEMYKDFRYYKSGVYVPTTKEVDCLHVIKIIGWGAD 257
Query: 305 PLGEGTSSVVKYWLVANSFNTNWGENGLFRIG 336
SV +YWL N++N WG++GL ++
Sbjct: 258 -------SVREYWLAMNAWNEEWGDHGLIKMA 282
Score = 75.5 bits (184), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 40/122 (32%), Positives = 63/122 (51%), Gaps = 8/122 (6%)
Query: 362 TPECIRKC-QPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILY 420
P C C Y S E D++ + + + ++I +EIF +GPV + +Y D Y
Sbjct: 172 VPPCRTTCTNKAYKKSLEKDVHRAKSWRKVLNDAQSIKQEIFDNGPVFSAFEMYKDFRYY 231
Query: 421 KTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRG 480
K+G+Y H I+IIGWG + SV +YWL N++N WG++GL ++ G
Sbjct: 232 KSGVYVPTTKEVDCLHVIKIIGWGAD-------SVREYWLAMNAWNEEWGDHGLIKMAFG 284
Query: 481 QN 482
+N
Sbjct: 285 KN 286
>gi|115605092|gb|ABJ15785.1| cathepsin B [Bos taurus]
Length = 118
Score = 132 bits (331), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 60/119 (50%), Positives = 84/119 (70%), Gaps = 2/119 (1%)
Query: 166 CQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIP-CERYMNGSHSSCQDNEPNTPECI 224
C GGF AW +W G+VSGG Y S GCRPY IP CE ++NGS C E +TP+C
Sbjct: 1 CNGGFPSGAWNFWTKKGLVSGGLYNSHVGCRPYSIPPCEHHVNGSRPPCT-GEGDTPKCS 59
Query: 225 RKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIY 283
+ C+PGY SY++D +FG +YS+ NE+ IM EI+++GPVEG+ ++Y+D +LYK+G+Y
Sbjct: 60 KTCEPGYSPSYKEDKHFGCSSYSVANNEKEIMAEIYKNGPVEGAFSVYSDFLLYKSGVY 118
Score = 107 bits (266), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 47/93 (50%), Positives = 70/93 (75%), Gaps = 2/93 (2%)
Query: 334 RIGCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPA 392
+GCRPY IP CE ++NGSR C E +TP+C + C+PGY SY++D +FG +YS+
Sbjct: 27 HVGCRPYSIPPCEHHVNGSRPPCTG-EGDTPKCSKTCEPGYSPSYKEDKHFGCSSYSVAN 85
Query: 393 NEETIMREIFRHGPVEGSMTIYADMILYKTGIY 425
NE+ IM EI+++GPVEG+ ++Y+D +LYK+G+Y
Sbjct: 86 NEKEIMAEIYKNGPVEGAFSVYSDFLLYKSGVY 118
>gi|10803439|emb|CAC13132.1| putative cathepsin B.6 [Ostertagia ostertagi]
Length = 197
Score = 132 bits (331), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 74/184 (40%), Positives = 105/184 (57%), Gaps = 3/184 (1%)
Query: 124 SGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGG-FHGKAWKYWVTTG 182
S WA+ A AMSDR+CI + GK V LS D+++CC + G + +AW++ G
Sbjct: 1 SCWAVSAASAMSDRLCIQTNGKNKVILSDTDILACCGEXCGXGCEGGYPSQAWEFAXRNG 60
Query: 183 IVSGGTYASKQGCRPYEI-PCERYMNGSH-SSCQDNEPNTPECIRKCQPGYDVSYEDDLN 240
+ SGG Y K C+PY + PC ++ N ++ C D+ TP C + CQ GYD Y +D
Sbjct: 61 VCSGGWYGEKGVCKPYPLHPCGKHXNQTYYGECPDHXYXTPACKKYCQYGYDKRYXNDKV 120
Query: 241 FGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIG 300
AY + ++E I EI GPV+ + T+Y D +LY GIY H AG +G H ++IIG
Sbjct: 121 XVTSAYQVXSDEAAIRAEIMSRGPVQAAFTVYGDFMLYTXGIYVHTAGKLMGGHGVKIIG 180
Query: 301 WGQE 304
WG E
Sbjct: 181 WGVE 184
Score = 92.8 bits (229), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 48/122 (39%), Positives = 68/122 (55%), Gaps = 6/122 (4%)
Query: 327 WGENGLFRIGCRPYEI-PCERYMNGSR-SSCQANEPNTPECIRKCQPGYDVSYEDDLNFG 384
+GE G+ C+PY + PC ++ N + C + TP C + CQ GYD Y +D
Sbjct: 67 YGEKGV----CKPYPLHPCGKHXNQTYYGECPDHXYXTPACKKYCQYGYDKRYXNDKVXV 122
Query: 385 RIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWG 444
AY + ++E I EI GPV+ + T+Y D +LY GIY H AG +G H ++IIGWG
Sbjct: 123 TSAYQVXSDEAAIRAEIMSRGPVQAAFTVYGDFMLYTXGIYVHTAGKLMGGHGVKIIGWG 182
Query: 445 QE 446
E
Sbjct: 183 VE 184
>gi|6562772|emb|CAB62590.1| putative cathepsin B-like protease [Pisum sativum]
Length = 174
Score = 132 bits (331), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 67/158 (42%), Positives = 98/158 (62%), Gaps = 11/158 (6%)
Query: 344 CERYMNGSRSSCQANEP--NTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREI 401
C+ Y + S EP TP+C+RKC G V ++ ++ Y + ++ + IM E+
Sbjct: 23 CDPYFDQIGCSHPGCEPGYQTPKCVRKCVKGNQV-WKKSKHYSVKPYKVNSDPQNIMEEV 81
Query: 402 FRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLV 461
+++GPVE + ++Y D YK+G+YKH+ G LG HA+++ GWG GE YWL+
Sbjct: 82 YKNGPVEVAFSVYEDFAHYKSGVYKHITGSALGGHAVKLNGWGTSDEGE------DYWLL 135
Query: 462 ANSFNTNWGENGLFRIVRGQNECGIEADITAG--LPKI 497
AN +NTNWG++G F+I RG NECGIE D+TA LPKI
Sbjct: 136 ANQWNTNWGDDGYFKIKRGTNECGIEEDVTAVCLLPKI 173
Score = 120 bits (300), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 64/172 (37%), Positives = 99/172 (57%), Gaps = 23/172 (13%)
Query: 166 CQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPY--EIPCERYMNGSHSSCQDNEPNTPEC 223
C GG+ AWKY+ G+V+ + C PY +I C SH C+ TP+C
Sbjct: 1 CDGGYPISAWKYFAHHGVVT-------EECDPYFDQIGC------SHPGCEPGY-QTPKC 46
Query: 224 IRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIY 283
+RKC G V ++ ++ Y + ++ + IM E++++GPVE + ++Y D YK+G+Y
Sbjct: 47 VRKCVKGNQV-WKKSKHYSVKPYKVNSDPQNIMEEVYKNGPVEVAFSVYEDFAHYKSGVY 105
Query: 284 KHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
KH+ G LG HA+++ GWG GE YWL+AN +NTNWG++G F+I
Sbjct: 106 KHITGSALGGHAVKLNGWGTSDEGE------DYWLLANQWNTNWGDDGYFKI 151
>gi|395856781|ref|XP_003800797.1| PREDICTED: tubulointerstitial nephritis antigen-like isoform 2
[Otolemur garnettii]
Length = 436
Score = 132 bits (331), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 99/291 (34%), Positives = 132/291 (45%), Gaps = 26/291 (8%)
Query: 57 NALSKLTLSE-LEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQE 115
+A +TL E + R+G S N + LS P E LP F+A WP I E
Sbjct: 132 SAFWGMTLDEGIRYRLGTIRPSSSVMNMNEIYTVLS-PGEVLPTAFEASEKWPN--LIHE 188
Query: 116 IRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAW 175
DQG+C WA SDRV I S G LS +L+SC GC GG AW
Sbjct: 189 PLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQNLLSCDTHHQQGCHGGRLDGAW 248
Query: 176 KYWVTTGIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSY 235
+ G+VS Y R P M S + T C P V
Sbjct: 249 WFLRRRGVVSDHCYPFSGQERDKAGPAPLCMMHSRPMGRGKRQATARC-----PNNQVQ- 302
Query: 236 EDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPL---- 291
+D+ AY L +NE+ IM+E+ +GPV+ M ++ D LY++GIY H P+
Sbjct: 303 ANDIYQVTPAYRLGSNEKEIMKELMENGPVQALMEVHEDFFLYQSGIYSHT---PVSLQR 359
Query: 292 -------GEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
G H+++I GWG+E L +G + +KYW ANS+ WGE G FRI
Sbjct: 360 PEGYRRHGTHSVKITGWGEETLPDGRT--LKYWTAANSWGPAWGERGHFRI 408
Score = 103 bits (256), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 52/125 (41%), Positives = 76/125 (60%), Gaps = 16/125 (12%)
Query: 387 AYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPL-----------GE 435
AY L +NE+ IM+E+ +GPV+ M ++ D LY++GIY H P+ G
Sbjct: 312 AYRLGSNEKEIMKELMENGPVQALMEVHEDFFLYQSGIYSHT---PVSLQRPEGYRRHGT 368
Query: 436 HAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLP 495
H+++I GWG+E L +G + +KYW ANS+ WGE G FRIVRG NEC IE+ +
Sbjct: 369 HSVKITGWGEETLPDGRT--LKYWTAANSWGPAWGERGHFRIVRGANECDIESFVLGVWG 426
Query: 496 KIGLE 500
++G+E
Sbjct: 427 RVGME 431
>gi|6449322|gb|AAF08931.1| tubulointerstitial nephritis antigen isoform TIN-ag [Homo sapiens]
Length = 476
Score = 132 bits (331), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 91/296 (30%), Positives = 132/296 (44%), Gaps = 49/296 (16%)
Query: 62 LTLSE-LEMRMGVHPDSKLPQNRLPLLVQLSD------PLEELPEGFDARINWPYCPTIQ 114
+TL + + R+G P S P+L+ +++ +LPE F A WP
Sbjct: 182 MTLEDGFKFRLGTLPPS-------PMLLSMNEMTASLPATTDLPEFFVASYKWP--GWTH 232
Query: 115 EIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKA 174
DQ +C + WA +DR+ I S+G+ LS +L+SCC +GC G +A
Sbjct: 233 GPLDQKNCAASWAFSTASVAADRIAIQSKGRYTANLSPQNLISCCAKNRHGCNSGSIDRA 292
Query: 175 WKYWVTTGIVSGGTY-------ASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKC 227
W Y G+VS Y A+ GC R + C +N + I +C
Sbjct: 293 WWYLRKRGLVSHACYPLFKDQNATNNGCAMASRSDGRGKRDATKPCPNNVEKSNR-IYQC 351
Query: 228 QPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVA 287
P Y VS +NE IM+EI ++GPV+ M + D YKTGIY+HV
Sbjct: 352 SPPYRVS---------------SNETEIMKEIMQNGPVQAIMQVREDFFHYKTGIYRHVT 396
Query: 288 GG--------PLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
L HA+++ GWG +G K+W+ AN + +WGENG FRI
Sbjct: 397 STNKESEKYRKLQTHAVKLTGWGTLRGAQGQKE--KFWIAANFWGKSWGENGYFRI 450
Score = 92.4 bits (228), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 46/118 (38%), Positives = 66/118 (55%), Gaps = 10/118 (8%)
Query: 388 YSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGG--------PLGEHAIR 439
Y + +NE IM+EI ++GPV+ M + D YKTGIY+HV L HA++
Sbjct: 355 YRVSSNETEIMKEIMQNGPVQAIMQVREDFFHYKTGIYRHVTSTNKESEKYRKLQTHAVK 414
Query: 440 IIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
+ GWG +G K+W+ AN + +WGENG FRI+RG NE IE + A ++
Sbjct: 415 LTGWGTLRGAQGQKE--KFWIAANFWGKSWGENGYFRILRGVNESDIEKLVIAAWGQL 470
>gi|73973401|ref|XP_538969.2| PREDICTED: tubulointerstitial nephritis antigen [Canis lupus
familiaris]
Length = 476
Score = 132 bits (331), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 91/296 (30%), Positives = 133/296 (44%), Gaps = 49/296 (16%)
Query: 62 LTLSE-LEMRMGVHPDSKLPQNRLPLLVQLSD------PLEELPEGFDARINWPYCPTIQ 114
+TL E + R+G P S P+L+ +++ +LPE F A WP
Sbjct: 182 MTLEEGFKYRLGTLPPS-------PMLLSMNEMTASLPATTDLPEFFIASYKWP--GWTH 232
Query: 115 EIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKA 174
DQ +C + WA +DR+ I S G+ LS +L+SCC +GC G +A
Sbjct: 233 GPLDQKNCAASWAFSTASVAADRIAIQSNGRYTANLSPQNLISCCAKNRHGCNSGSIDRA 292
Query: 175 WKYWVTTGIVSGGTY-------ASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKC 227
W + G+VS Y A+ GC R + C +N + I +C
Sbjct: 293 WWFLRKRGLVSHACYPLFKDQNATNYGCAMASRSDGRGKRHATKPCPNNIEKSNR-IYQC 351
Query: 228 QPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVA 287
P Y VS +NE IM+EI ++GPV+ M ++ D YKTGIY+H+
Sbjct: 352 SPPYRVS---------------SNETEIMKEIMQNGPVQAIMQVHEDFFHYKTGIYRHIT 396
Query: 288 GG--------PLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
L HA+++ GWG +G K+W+ ANS+ +WGENG FRI
Sbjct: 397 RTNEESRKYQKLQTHAVKLTGWGTLKGAQGQKE--KFWIAANSWGISWGENGYFRI 450
Score = 95.1 bits (235), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 47/114 (41%), Positives = 66/114 (57%), Gaps = 10/114 (8%)
Query: 388 YSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGG--------PLGEHAIR 439
Y + +NE IM+EI ++GPV+ M ++ D YKTGIY+H+ L HA++
Sbjct: 355 YRVSSNETEIMKEIMQNGPVQAIMQVHEDFFHYKTGIYRHITRTNEESRKYQKLQTHAVK 414
Query: 440 IIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAG 493
+ GWG +G K+W+ ANS+ +WGENG FRI+RG NE IE I A
Sbjct: 415 LTGWGTLKGAQGQKE--KFWIAANSWGISWGENGYFRILRGVNESDIEKLIIAA 466
>gi|324713036|ref|NP_001191344.1| tubulointerstitial nephritis antigen-like isoform 3 [Homo sapiens]
gi|119628008|gb|EAX07603.1| tubulointerstitial nephritis antigen-like 1, isoform CRA_a [Homo
sapiens]
Length = 362
Score = 131 bits (330), Expect = 7e-28, Method: Compositional matrix adjust.
Identities = 98/294 (33%), Positives = 133/294 (45%), Gaps = 20/294 (6%)
Query: 51 FYGAEKNALSKLTLSE-LEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPY 109
+ +A +TL E + R+G S N + L +P E LP F+A WP
Sbjct: 52 WQAGNHSAFWGMTLDEGIRYRLGTIRPSSSVMNMHEIYTVL-NPGEVLPTAFEASEKWPN 110
Query: 110 CPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGG 169
I E DQG+C WA SDRV I S G LS +L+SC GC+GG
Sbjct: 111 --LIHEPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQNLLSCDTHQQQGCRGG 168
Query: 170 FHGKAWKYWVTTGIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQP 229
AW + G+VS Y R P M S + + T C P
Sbjct: 169 RLDGAWWFLRRRGVVSDHCYPFSGRERDEAGPAPPCMMHSRAMGRGKRQATAHC-----P 223
Query: 230 GYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVA-- 287
V+ D + Y L +N++ IM+E+ +GPV+ M ++ D LYK GIY H
Sbjct: 224 NSYVNNNDIYQVTPV-YRLGSNDKEIMKELMENGPVQALMEVHEDFFLYKGGIYSHTPVS 282
Query: 288 -GGP-----LGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
G P G H+++I GWG+E L +G + +KYW ANS+ WGE G FRI
Sbjct: 283 LGRPERYRRHGTHSVKITGWGEETLPDGRT--LKYWTAANSWGPAWGERGHFRI 334
Score = 101 bits (252), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 52/122 (42%), Positives = 74/122 (60%), Gaps = 10/122 (8%)
Query: 387 AYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVA---GGP-----LGEHAI 438
Y L +N++ IM+E+ +GPV+ M ++ D LYK GIY H G P G H++
Sbjct: 238 VYRLGSNDKEIMKELMENGPVQALMEVHEDFFLYKGGIYSHTPVSLGRPERYRRHGTHSV 297
Query: 439 RIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKIG 498
+I GWG+E L +G + +KYW ANS+ WGE G FRIVRG NEC IE+ + ++G
Sbjct: 298 KITGWGEETLPDGRT--LKYWTAANSWGPAWGERGHFRIVRGVNECDIESFVLGVWGRVG 355
Query: 499 LE 500
+E
Sbjct: 356 ME 357
>gi|395856779|ref|XP_003800796.1| PREDICTED: tubulointerstitial nephritis antigen-like isoform 1
[Otolemur garnettii]
Length = 467
Score = 131 bits (330), Expect = 7e-28, Method: Compositional matrix adjust.
Identities = 99/291 (34%), Positives = 132/291 (45%), Gaps = 26/291 (8%)
Query: 57 NALSKLTLSE-LEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQE 115
+A +TL E + R+G S N + LS P E LP F+A WP I E
Sbjct: 163 SAFWGMTLDEGIRYRLGTIRPSSSVMNMNEIYTVLS-PGEVLPTAFEASEKWPN--LIHE 219
Query: 116 IRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAW 175
DQG+C WA SDRV I S G LS +L+SC GC GG AW
Sbjct: 220 PLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQNLLSCDTHHQQGCHGGRLDGAW 279
Query: 176 KYWVTTGIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSY 235
+ G+VS Y R P M S + T C P V
Sbjct: 280 WFLRRRGVVSDHCYPFSGQERDKAGPAPLCMMHSRPMGRGKRQATARC-----PNNQVQ- 333
Query: 236 EDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPL---- 291
+D+ AY L +NE+ IM+E+ +GPV+ M ++ D LY++GIY H P+
Sbjct: 334 ANDIYQVTPAYRLGSNEKEIMKELMENGPVQALMEVHEDFFLYQSGIYSHT---PVSLQR 390
Query: 292 -------GEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
G H+++I GWG+E L +G + +KYW ANS+ WGE G FRI
Sbjct: 391 PEGYRRHGTHSVKITGWGEETLPDGRT--LKYWTAANSWGPAWGERGHFRI 439
Score = 103 bits (256), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 52/125 (41%), Positives = 76/125 (60%), Gaps = 16/125 (12%)
Query: 387 AYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPL-----------GE 435
AY L +NE+ IM+E+ +GPV+ M ++ D LY++GIY H P+ G
Sbjct: 343 AYRLGSNEKEIMKELMENGPVQALMEVHEDFFLYQSGIYSHT---PVSLQRPEGYRRHGT 399
Query: 436 HAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLP 495
H+++I GWG+E L +G + +KYW ANS+ WGE G FRIVRG NEC IE+ +
Sbjct: 400 HSVKITGWGEETLPDGRT--LKYWTAANSWGPAWGERGHFRIVRGANECDIESFVLGVWG 457
Query: 496 KIGLE 500
++G+E
Sbjct: 458 RVGME 462
>gi|341898422|gb|EGT54357.1| hypothetical protein CAEBREN_10381 [Caenorhabditis brenneri]
Length = 466
Score = 131 bits (330), Expect = 7e-28, Method: Compositional matrix adjust.
Identities = 89/281 (31%), Positives = 132/281 (46%), Gaps = 38/281 (13%)
Query: 67 LEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGW 126
++ R+G + QN +L++ ELPE FD+R W + I + DQG CGS W
Sbjct: 172 IKYRLGTLFPERSVQNMNEILIKP----RELPEHFDSRDKWGH--LINPVVDQGDCGSSW 225
Query: 127 ALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSG 186
A+ SDR+ I S G+ + LSS L+SC + GC+GG+ +AW Y G+V
Sbjct: 226 AVSTTGISSDRLAIISEGRINASLSSQQLLSCNQHRQKGCEGGYLDRAWWYIRKLGVVGD 285
Query: 187 GT--YASKQGCRPYE--IPCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFG 242
Y S Q P IP Y + C ++ K P Y VS
Sbjct: 286 HCYPYVSGQSREPGHCLIPKRDYTDRRGLRCPSGSQDST--AFKMTPPYKVS-------- 335
Query: 243 RIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKH--------VAGGPLGEH 294
+ EE I E+ +GPV+ + ++ D +Y G+Y+H + G H
Sbjct: 336 -------SREEDIQTELMTNGPVQATFVVHEDFFMYAGGVYQHSDLAAQKGASSVAEGYH 388
Query: 295 AIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
++R++GWG + T +KYWL ANS+ T WGE+G F+I
Sbjct: 389 SVRVLGWG---VDHSTGRPIKYWLCANSWGTQWGEDGYFKI 426
Score = 90.1 bits (222), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 41/117 (35%), Positives = 65/117 (55%), Gaps = 11/117 (9%)
Query: 388 YSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKH--------VAGGPLGEHAIR 439
Y + + EE I E+ +GPV+ + ++ D +Y G+Y+H + G H++R
Sbjct: 332 YKVSSREEDIQTELMTNGPVQATFVVHEDFFMYAGGVYQHSDLAAQKGASSVAEGYHSVR 391
Query: 440 IIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPK 496
++GWG + T +KYWL ANS+ T WGE+G F+I+RG N C IE+ + K
Sbjct: 392 VLGWG---VDHSTGRPIKYWLCANSWGTQWGEDGYFKILRGDNHCEIESFVIGAWGK 445
>gi|290979437|ref|XP_002672440.1| predicted protein [Naegleria gruberi]
gi|284086017|gb|EFC39696.1| predicted protein [Naegleria gruberi]
Length = 354
Score = 131 bits (330), Expect = 8e-28, Method: Compositional matrix adjust.
Identities = 81/245 (33%), Positives = 115/245 (46%), Gaps = 34/245 (13%)
Query: 92 DPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLS 151
+P +LP FDAR W C I +RDQ +CG+ WA A ++ R+CIA+ GK +V LS
Sbjct: 127 EPRVDLPMNFDARTQWRGC--IPAVRDQQTCGACWAFSATYVLAHRLCIATNGKTNVVLS 184
Query: 152 SDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIPCERYMNGSHS 211
+ V C CQGG+ AW + TG + C PY + +G+
Sbjct: 185 PEYQVQC-DTMNKACQGGYLKYAWSFLERTG-------TTVDSCIPYASGRATFSSGT-- 234
Query: 212 SCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTI 271
C KC+ VS + + + I I +G V+ TI
Sbjct: 235 -----------CPAKCK----VSTQSMTMYKAKNSRYISGVNNIKAAIMSYGSVQSGFTI 279
Query: 272 YADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENG 331
Y D + Y++G+YKHV+ LG HA+ +IGWG E S YWL NS+ +NWG +G
Sbjct: 280 YRDFMSYRSGVYKHVSTTTLGGHAVALIGWGVE-------SGTNYWLAVNSWGSNWGMSG 332
Query: 332 LFRIG 336
F+I
Sbjct: 333 YFKIA 337
Score = 89.0 bits (219), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 44/99 (44%), Positives = 59/99 (59%), Gaps = 9/99 (9%)
Query: 397 IMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVV 456
I I +G V+ TIY D + Y++G+YKHV+ LG HA+ +IGWG E S
Sbjct: 263 IKAAIMSYGSVQSGFTIYRDFMSYRSGVYKHVSTTTLGGHAVALIGWGVE-------SGT 315
Query: 457 KYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLP 495
YWL NS+ +NWG +G F+I +G ECGIE + AG P
Sbjct: 316 NYWLAVNSWGSNWGMSGYFKIAQG--ECGIENQVYAGEP 352
>gi|197100841|ref|NP_001126804.1| tubulointerstitial nephritis antigen [Pongo abelii]
gi|55732702|emb|CAH93049.1| hypothetical protein [Pongo abelii]
Length = 476
Score = 131 bits (330), Expect = 8e-28, Method: Compositional matrix adjust.
Identities = 93/302 (30%), Positives = 131/302 (43%), Gaps = 38/302 (12%)
Query: 52 YGAEKNALSKLTLSELEMRMGVHPDSKLPQNRLPLLVQLSDPL---EELPEGFDARINWP 108
YG S+ LE H + P L + +++ L +LPE F A WP
Sbjct: 169 YGWTAQNYSQFWGMTLEDGFKFHLGTLPPSPMLLSMNEMTASLPATTDLPEFFVASYKWP 228
Query: 109 YCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQG 168
DQ +C + WA +DR+ I S+G+ LS +L+SCC +GC
Sbjct: 229 --GWTHGPLDQKNCAASWAFSTASVAADRIAIQSKGRYTANLSPQNLISCCAKNRHGCNS 286
Query: 169 GFHGKAWKYWVTTGIVSGGTY-------ASKQGCRPYEIPCERYMNGSHSSCQDNEPNTP 221
G +AW Y G+VS Y A+ GC R + C +N +
Sbjct: 287 GSIDRAWWYLRKRGLVSHACYPLSKDQNATNNGCAMASRSDGRGKRHATKPCPNNVEKSN 346
Query: 222 ECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTG 281
I +C P Y VS +NE IM+EI ++GPV+ M + D YKTG
Sbjct: 347 R-IYQCSPPYRVS---------------SNETEIMKEIMQNGPVQAIMQVREDFFHYKTG 390
Query: 282 IYKHVAGG--------PLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLF 333
IY+HV L HA+++ GWG +G K+W+ ANS+ +WGENG F
Sbjct: 391 IYRHVTSTNKESEKYRKLQTHAVKLTGWGTLRGAQGQKE--KFWVAANSWGKSWGENGYF 448
Query: 334 RI 335
RI
Sbjct: 449 RI 450
Score = 95.1 bits (235), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 48/118 (40%), Positives = 67/118 (56%), Gaps = 10/118 (8%)
Query: 388 YSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGG--------PLGEHAIR 439
Y + +NE IM+EI ++GPV+ M + D YKTGIY+HV L HA++
Sbjct: 355 YRVSSNETEIMKEIMQNGPVQAIMQVREDFFHYKTGIYRHVTSTNKESEKYRKLQTHAVK 414
Query: 440 IIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
+ GWG +G K+W+ ANS+ +WGENG FRI+RG NE IE I A ++
Sbjct: 415 LTGWGTLRGAQGQKE--KFWVAANSWGKSWGENGYFRILRGVNESDIEKLIIAAWGQL 470
>gi|297291062|ref|XP_002803846.1| PREDICTED: tubulointerstitial nephritis antigen-like [Macaca
mulatta]
Length = 463
Score = 131 bits (330), Expect = 8e-28, Method: Compositional matrix adjust.
Identities = 91/292 (31%), Positives = 133/292 (45%), Gaps = 42/292 (14%)
Query: 62 LTLSE-LEMRMGVHPDSKLPQNRLPLLVQLSDPL---EELPEGFDARINWPYCPTIQEIR 117
+TL + + R+G P S + L + +++ PL +LPE F A WP
Sbjct: 170 MTLEDGFKFRLGTLPPSPM----LLSMNEMTAPLPATTDLPEFFVASYKWP--GWTHGPL 223
Query: 118 DQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKY 177
DQ +C + WA +DR+ I S+G+ LS +L+SCC +GC G +AW Y
Sbjct: 224 DQKNCAASWAFSTASVAADRIAIQSKGRYTANLSPQNLISCCAKNRHGCNSGSIDRAWWY 283
Query: 178 WVTTGIVSGGTYA------SKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQPGY 231
G+VS Y + GC R + C +N + I +C P Y
Sbjct: 284 LRKRGLVSHACYPLFKDQNANNGCAMASRSDGRGKRHATKPCPNNIEKSNR-IYQCSPPY 342
Query: 232 DVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGG-- 289
VS ++E IM+EI ++GPV+ M + D YKTGIY+HV
Sbjct: 343 RVS---------------SSETEIMKEIMQNGPVQAIMQVREDFFHYKTGIYRHVTSTNK 387
Query: 290 ------PLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
L HA+++ GWG +G K+W+ ANS+ +WGENG FRI
Sbjct: 388 ESEKYRKLQTHAVKLTGWGTLRGAQGRKE--KFWIAANSWGKSWGENGYFRI 437
Score = 92.4 bits (228), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 47/114 (41%), Positives = 65/114 (57%), Gaps = 10/114 (8%)
Query: 388 YSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGG--------PLGEHAIR 439
Y + ++E IM+EI ++GPV+ M + D YKTGIY+HV L HA++
Sbjct: 342 YRVSSSETEIMKEIMQNGPVQAIMQVREDFFHYKTGIYRHVTSTNKESEKYRKLQTHAVK 401
Query: 440 IIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAG 493
+ GWG +G K+W+ ANS+ +WGENG FRI+RG NE IE I A
Sbjct: 402 LTGWGTLRGAQGRKE--KFWIAANSWGKSWGENGYFRILRGVNESDIEKLIIAA 453
>gi|345794363|ref|XP_535330.3| PREDICTED: tubulointerstitial nephritis antigen-like 1 [Canis lupus
familiaris]
Length = 467
Score = 131 bits (330), Expect = 8e-28, Method: Compositional matrix adjust.
Identities = 98/289 (33%), Positives = 136/289 (47%), Gaps = 22/289 (7%)
Query: 57 NALSKLTLSE-LEMRMG-VHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQ 114
+A +TL E + R+G + P S + + + + P E LP F+A WP I
Sbjct: 163 SAFWGMTLDEGIRYRLGTIRPSSSV--TNMNEIHTVLRPGEVLPTAFEAAEKWPN--LIH 218
Query: 115 EIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKA 174
E DQG+C WA SDRV I S G LS +L+SC GC+GG A
Sbjct: 219 EPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQNLLSCDTHNQQGCRGGRLDGA 278
Query: 175 WKYWVTTGIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVS 234
W + G+VS Y + P R M S + + T C P V
Sbjct: 279 WWFLRRRGVVSDHCYPFVGREQDEAGPAPRCMMHSRAMGRGKRQATARC-----PSSHV- 332
Query: 235 YEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVA---GGP- 290
+ +D+ AY L NE+ IM+E+ +GPV+ M ++ D LY+ GIY H G P
Sbjct: 333 HANDIYQVTPAYRLGTNEKEIMKELMENGPVQALMEVHEDFFLYQGGIYSHTPVSLGRPE 392
Query: 291 ----LGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
G H+++I GWG+E L +G + +KYW ANS+ WGE G FRI
Sbjct: 393 RYRRHGTHSVKITGWGEETLPDGRT--LKYWTAANSWGPAWGERGHFRI 439
Score = 104 bits (260), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 63/166 (37%), Positives = 89/166 (53%), Gaps = 16/166 (9%)
Query: 343 PCERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIF 402
P R M SR+ + T C P V + +D+ AY L NE+ IM+E+
Sbjct: 305 PAPRCMMHSRAMGRGKRQATARC-----PSSHV-HANDIYQVTPAYRLGTNEKEIMKELM 358
Query: 403 RHGPVEGSMTIYADMILYKTGIYKHVA---GGP-----LGEHAIRIIGWGQEPLGEGTSS 454
+GPV+ M ++ D LY+ GIY H G P G H+++I GWG+E L +G +
Sbjct: 359 ENGPVQALMEVHEDFFLYQGGIYSHTPVSLGRPERYRRHGTHSVKITGWGEETLPDGRT- 417
Query: 455 VVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKIGLE 500
+KYW ANS+ WGE G FRIVRG NEC IE+ + ++G+E
Sbjct: 418 -LKYWTAANSWGPAWGERGHFRIVRGANECDIESFVLGVWGRVGME 462
>gi|332254558|ref|XP_003276396.1| PREDICTED: tubulointerstitial nephritis antigen-like isoform 1
[Nomascus leucogenys]
Length = 467
Score = 131 bits (330), Expect = 8e-28, Method: Compositional matrix adjust.
Identities = 97/294 (32%), Positives = 133/294 (45%), Gaps = 20/294 (6%)
Query: 51 FYGAEKNALSKLTLSE-LEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPY 109
+ +A +TL E + R+G S N + L +P E LP F+A WP
Sbjct: 157 WQAGNHSAFWGMTLDEGIRYRLGTMRPSSSVMNMHEIYTVL-NPGEVLPTAFEASEKWPN 215
Query: 110 CPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGG 169
I E DQG+C WA SDRV I S G LS +L+SC GC+GG
Sbjct: 216 --LIHEPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQNLLSCDTHQQQGCRGG 273
Query: 170 FHGKAWKYWVTTGIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQP 229
AW + G+VS Y R P M S + + T C P
Sbjct: 274 RLDGAWWFLRRRGVVSDHCYPFSGRERDEAGPAPPCMMHSRAMGRGKRQATAHC-----P 328
Query: 230 GYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVA-- 287
V+ D + Y L +N++ +M+E+ +GPV+ M ++ D LYK GIY H
Sbjct: 329 NSHVNNNDIYQVTPV-YRLGSNDKEVMKELMENGPVQALMEVHEDFFLYKGGIYSHTPVS 387
Query: 288 -GGP-----LGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
G P G H+++I GWG+E L +G + +KYW ANS+ WGE G FRI
Sbjct: 388 LGRPERYRRHGTHSVKITGWGEETLPDGRT--LKYWTAANSWGPAWGERGHFRI 439
Score = 101 bits (251), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 51/122 (41%), Positives = 74/122 (60%), Gaps = 10/122 (8%)
Query: 387 AYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVA---GGP-----LGEHAI 438
Y L +N++ +M+E+ +GPV+ M ++ D LYK GIY H G P G H++
Sbjct: 343 VYRLGSNDKEVMKELMENGPVQALMEVHEDFFLYKGGIYSHTPVSLGRPERYRRHGTHSV 402
Query: 439 RIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKIG 498
+I GWG+E L +G + +KYW ANS+ WGE G FRIVRG NEC IE+ + ++G
Sbjct: 403 KITGWGEETLPDGRT--LKYWTAANSWGPAWGERGHFRIVRGVNECDIESFVLGVWGRVG 460
Query: 499 LE 500
+E
Sbjct: 461 ME 462
>gi|332254560|ref|XP_003276397.1| PREDICTED: tubulointerstitial nephritis antigen-like isoform 2
[Nomascus leucogenys]
Length = 436
Score = 131 bits (329), Expect = 9e-28, Method: Compositional matrix adjust.
Identities = 97/294 (32%), Positives = 133/294 (45%), Gaps = 20/294 (6%)
Query: 51 FYGAEKNALSKLTLSE-LEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPY 109
+ +A +TL E + R+G S N + L +P E LP F+A WP
Sbjct: 126 WQAGNHSAFWGMTLDEGIRYRLGTMRPSSSVMNMHEIYTVL-NPGEVLPTAFEASEKWPN 184
Query: 110 CPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGG 169
I E DQG+C WA SDRV I S G LS +L+SC GC+GG
Sbjct: 185 --LIHEPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQNLLSCDTHQQQGCRGG 242
Query: 170 FHGKAWKYWVTTGIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQP 229
AW + G+VS Y R P M S + + T C P
Sbjct: 243 RLDGAWWFLRRRGVVSDHCYPFSGRERDEAGPAPPCMMHSRAMGRGKRQATAHC-----P 297
Query: 230 GYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVA-- 287
V+ D + Y L +N++ +M+E+ +GPV+ M ++ D LYK GIY H
Sbjct: 298 NSHVNNNDIYQVTPV-YRLGSNDKEVMKELMENGPVQALMEVHEDFFLYKGGIYSHTPVS 356
Query: 288 -GGP-----LGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
G P G H+++I GWG+E L +G + +KYW ANS+ WGE G FRI
Sbjct: 357 LGRPERYRRHGTHSVKITGWGEETLPDGRT--LKYWTAANSWGPAWGERGHFRI 408
Score = 100 bits (250), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 51/122 (41%), Positives = 74/122 (60%), Gaps = 10/122 (8%)
Query: 387 AYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVA---GGP-----LGEHAI 438
Y L +N++ +M+E+ +GPV+ M ++ D LYK GIY H G P G H++
Sbjct: 312 VYRLGSNDKEVMKELMENGPVQALMEVHEDFFLYKGGIYSHTPVSLGRPERYRRHGTHSV 371
Query: 439 RIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKIG 498
+I GWG+E L +G + +KYW ANS+ WGE G FRIVRG NEC IE+ + ++G
Sbjct: 372 KITGWGEETLPDGRT--LKYWTAANSWGPAWGERGHFRIVRGVNECDIESFVLGVWGRVG 429
Query: 499 LE 500
+E
Sbjct: 430 ME 431
>gi|156708106|gb|ABU93311.1| cathepsin B2 cysteine protease [Monocercomonoides sp. PA]
Length = 282
Score = 131 bits (329), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 88/265 (33%), Positives = 116/265 (43%), Gaps = 51/265 (19%)
Query: 82 NRLPLLVQLSDPLEE-----------LPEGFDARINWPYCPTIQEIRDQGSCGSGWALGA 130
R L +L +EE LPE FDAR WP I +RDQ SCGS WA
Sbjct: 37 TRAKFLARLGTHVEEYEERTYESDNALPENFDAREQWP--EQILPVRDQASCGSCWAFSV 94
Query: 131 VEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYA 190
E M DR+ I G+ H +S DLVSC GC GG+ KAW + + G+
Sbjct: 95 AETMGDRLSIIGCGRGH--MSPQDLVSC-DTTDMGCNGGYMDKAWAWTKSHGV------- 144
Query: 191 SKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPA 250
+ + C PY Q P C KC G + +F S
Sbjct: 145 TNEECMPY---------------QSGGGRVPACPAKCVNGSTIVRTKSQSFTHFTAS--- 186
Query: 251 NEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGT 310
+ +E++ +GP+ + T+Y D + YK+G+Y H GG G HA+ IGWG E
Sbjct: 187 ---QMQQELYENGPLSVAFTVYYDFMNYKSGVYVHKTGGVAGGHAVLCIGWGVE------ 237
Query: 311 SSVVKYWLVANSFNTNWGENGLFRI 335
YWL NS+ WGE G F+I
Sbjct: 238 -DNTPYWLCQNSWGPAWGEKGHFKI 261
Score = 89.0 bits (219), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 47/135 (34%), Positives = 66/135 (48%), Gaps = 13/135 (9%)
Query: 356 QANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYA 415
Q+ P C KC G + +F S + +E++ +GP+ + T+Y
Sbjct: 153 QSGGGRVPACPAKCVNGSTIVRTKSQSFTHFTAS------QMQQELYENGPLSVAFTVYY 206
Query: 416 DMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLF 475
D + YK+G+Y H GG G HA+ IGWG E YWL NS+ WGE G F
Sbjct: 207 DFMNYKSGVYVHKTGGVAGGHAVLCIGWGVE-------DNTPYWLCQNSWGPAWGEKGHF 259
Query: 476 RIVRGQNECGIEADI 490
+I+RG N CGIE +
Sbjct: 260 KILRGSNHCGIENQV 274
>gi|297665716|ref|XP_002811185.1| PREDICTED: tubulointerstitial nephritis antigen-like 1 isoform 3
[Pongo abelii]
Length = 436
Score = 131 bits (329), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 98/294 (33%), Positives = 133/294 (45%), Gaps = 20/294 (6%)
Query: 51 FYGAEKNALSKLTLSE-LEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPY 109
+ +A +TL E + R+G S N + L +P E LP F+A WP
Sbjct: 126 WQAGNHSAFWGMTLDEGIRYRLGTIRPSSSVMNMHEIYTVL-NPGEVLPTAFEASEKWPN 184
Query: 110 CPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGG 169
I E DQG+C WA SDRV I S G LS +L+SC GC+GG
Sbjct: 185 --LIHEPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQNLLSCDTHQQQGCRGG 242
Query: 170 FHGKAWKYWVTTGIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQP 229
AW + G+VS Y R P M S + + T C P
Sbjct: 243 RLDGAWWFLRRRGVVSDHCYPFSGRERDEAGPTPPCMMHSRAMGRGKRQATASC-----P 297
Query: 230 GYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVA-- 287
V+ D + Y L +N++ IM+E+ +GPV+ M ++ D LYK GIY H
Sbjct: 298 NSHVNNNDIYQVTPV-YRLGSNDKEIMKELMENGPVQALMEVHEDFFLYKGGIYSHTPVS 356
Query: 288 -GGP-----LGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
G P G H+++I GWG+E L +G + +KYW ANS+ WGE G FRI
Sbjct: 357 LGRPERYRRHGTHSVKITGWGEETLPDGRT--LKYWTAANSWGPAWGERGHFRI 408
Score = 101 bits (251), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 52/122 (42%), Positives = 74/122 (60%), Gaps = 10/122 (8%)
Query: 387 AYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVA---GGP-----LGEHAI 438
Y L +N++ IM+E+ +GPV+ M ++ D LYK GIY H G P G H++
Sbjct: 312 VYRLGSNDKEIMKELMENGPVQALMEVHEDFFLYKGGIYSHTPVSLGRPERYRRHGTHSV 371
Query: 439 RIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKIG 498
+I GWG+E L +G + +KYW ANS+ WGE G FRIVRG NEC IE+ + ++G
Sbjct: 372 KITGWGEETLPDGRT--LKYWTAANSWGPAWGERGHFRIVRGVNECDIESFVLGVWGRVG 429
Query: 499 LE 500
+E
Sbjct: 430 ME 431
>gi|332210168|ref|XP_003254178.1| PREDICTED: tubulointerstitial nephritis antigen [Nomascus
leucogenys]
Length = 476
Score = 131 bits (329), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 91/296 (30%), Positives = 132/296 (44%), Gaps = 49/296 (16%)
Query: 62 LTLSE-LEMRMGVHPDSKLPQNRLPLLVQLSD------PLEELPEGFDARINWPYCPTIQ 114
+TL + + R+G P S P+L+ +++ +LPE F A WP
Sbjct: 182 MTLEDGFKFRLGTLPPS-------PMLLSMNEMTASLPATTDLPEFFVASYKWP--GWTH 232
Query: 115 EIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKA 174
DQ +C + WA +DR+ I S+G+ LS +L+SCC GC G +A
Sbjct: 233 GPLDQKNCAASWAFSTASVAADRIAIQSKGRYTANLSPQNLISCCSKNRPGCNSGSIDRA 292
Query: 175 WKYWVTTGIVSGGTY-------ASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKC 227
W Y G+VS Y A+ GC R + C +N + I +C
Sbjct: 293 WWYLRKRGLVSHACYPLFKDQNATSNGCAMASRSDGRGKRHATKPCPNNVEKSNR-IYQC 351
Query: 228 QPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVA 287
P Y VS ++E IM+EI ++GPV+ M + D YKTGIY+HV
Sbjct: 352 SPPYRVS---------------SSETEIMKEIMQNGPVQAIMQVREDFFHYKTGIYRHVT 396
Query: 288 GG--------PLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
L HA+++ GWG +G K+W+ ANS+ +WGENG FRI
Sbjct: 397 SANKESEKYRKLQTHAVKLTGWGTLRGAQGQKE--KFWIAANSWGKSWGENGYFRI 450
Score = 93.6 bits (231), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 47/118 (39%), Positives = 67/118 (56%), Gaps = 10/118 (8%)
Query: 388 YSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGG--------PLGEHAIR 439
Y + ++E IM+EI ++GPV+ M + D YKTGIY+HV L HA++
Sbjct: 355 YRVSSSETEIMKEIMQNGPVQAIMQVREDFFHYKTGIYRHVTSANKESEKYRKLQTHAVK 414
Query: 440 IIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
+ GWG +G K+W+ ANS+ +WGENG FRI+RG NE IE I A ++
Sbjct: 415 LTGWGTLRGAQGQKE--KFWIAANSWGKSWGENGYFRILRGVNESDIEKLIIAAWGQL 470
>gi|326916361|ref|XP_003204476.1| PREDICTED: tubulointerstitial nephritis antigen-like [Meleagris
gallopavo]
Length = 467
Score = 131 bits (329), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 98/282 (34%), Positives = 131/282 (46%), Gaps = 27/282 (9%)
Query: 62 LTLSE-LEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQG 120
+TL E R+G P S N + + S E+ PE F A WP I + DQ
Sbjct: 187 MTLEEGFRKRLGTLPPSHSLLN-MEAIPGSSLLEEKFPEFFAATYAWP--DWIHDPLDQR 243
Query: 121 SCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVT 180
+CG+ WA +DR+ I S G+ LS +L+SC +GC GG AW+Y T
Sbjct: 244 NCGASWAFSTASVAADRIAIHSDGQITDNLSVQNLISCDTKNQHGCGGGNIEGAWRYLKT 303
Query: 181 TGIVSGGTYAS--KQGC-RPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYED 237
G+VS Y S K P E C Y++ + N P P + ED
Sbjct: 304 HGVVSYACYPSFWKHSLDSPSENHC--YVSSEYGKNHTNGP-CPNAL-----------ED 349
Query: 238 DLNFGRIA--YSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHV--AGGPLGE 293
R A Y + + E IM EI GPV+ M +Y D LYK GIY+H AG
Sbjct: 350 SNRLYRCASHYRISSKETDIMEEIMAKGPVQAIMKVYEDFFLYKEGIYRHSYKAGSKWKT 409
Query: 294 HAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
H+++++GWG P G K+W+ ANS+ WGENG FRI
Sbjct: 410 HSVKLLGWGSLPGKNGQKQ--KFWIAANSWGKYWGENGYFRI 449
Score = 103 bits (256), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 54/121 (44%), Positives = 68/121 (56%), Gaps = 6/121 (4%)
Query: 378 EDDLNFGRIA--YSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHV--AGGPL 433
ED R A Y + + E IM EI GPV+ M +Y D LYK GIY+H AG
Sbjct: 348 EDSNRLYRCASHYRISSKETDIMEEIMAKGPVQAIMKVYEDFFLYKEGIYRHSYKAGSKW 407
Query: 434 GEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAG 493
H+++++GWG P G K+W+ ANS+ WGENG FRI+RGQNEC IE I
Sbjct: 408 KTHSVKLLGWGSLPGKNGQKQ--KFWIAANSWGKYWGENGYFRILRGQNECDIEKLILTT 465
Query: 494 L 494
L
Sbjct: 466 L 466
>gi|290973645|ref|XP_002669558.1| predicted protein [Naegleria gruberi]
gi|284083107|gb|EFC36814.1| predicted protein [Naegleria gruberi]
Length = 343
Score = 131 bits (329), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 89/296 (30%), Positives = 139/296 (46%), Gaps = 52/296 (17%)
Query: 47 PKLPFYGAEKNALSKLTLSELEMRM--GVHPDSKLPQNRLPLLVQLSDPLEELP-EGFDA 103
PK + + +T+ E + + +H ++ P ++ + P P FD+
Sbjct: 31 PKSSWKAKVYEKFANMTVGEFKQKYLGAIHEEAITPSSKSRFSIVTGPPTAYTPPTNFDS 90
Query: 104 RINWPYCPTIQEIRDQGSCGSGWA--------LGAVEAMSDRVCIASRGKRHVRLSSDDL 155
R WP C + +R+Q CGS WA + A + +SDR CIAS G +V +S
Sbjct: 91 RQKWPQC--VHTVRNQLDCGSCWAFWIEFNDLVSATKVLSDRFCIASNGSVNVIMSPQYQ 148
Query: 156 VSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQD 215
+ C D GC GG K W + G VS + CRPY ++
Sbjct: 149 IDCNMD-NLGCSGGSLPKTWNFLTNVGSVS-------EQCRPY---------------KN 185
Query: 216 NEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADM 275
N+ + +C KC G S+ + +Y+ ++IM EI +GPV S+T+Y D+
Sbjct: 186 NDDD--DCPSKCVDGKAPSF-----YKAKSYASIKGLDSIMYEIQNYGPVHASLTVYKDL 238
Query: 276 ILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENG 331
+ Y++G+Y H+ G +G HAI IIG+G + L + YW++ANS WGENG
Sbjct: 239 MSYQSGVYSHLTGNEIGGHAIVIIGFGMDSLSKK-----PYWIIANS----WGENG 285
Score = 76.3 bits (186), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 41/110 (37%), Positives = 63/110 (57%), Gaps = 14/110 (12%)
Query: 364 ECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTG 423
+C KC G S+ + +Y+ ++IM EI +GPV S+T+Y D++ Y++G
Sbjct: 190 DCPSKCVDGKAPSF-----YKAKSYASIKGLDSIMYEIQNYGPVHASLTVYKDLMSYQSG 244
Query: 424 IYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENG 473
+Y H+ G +G HAI IIG+G + L + YW++ANS WGENG
Sbjct: 245 VYSHLTGNEIGGHAIVIIGFGMDSLSKK-----PYWIIANS----WGENG 285
>gi|448278133|gb|AGE43966.1| putative cathepsin B [Naegleria fowleri]
Length = 349
Score = 131 bits (329), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 85/245 (34%), Positives = 121/245 (49%), Gaps = 36/245 (14%)
Query: 94 LEELPEGFDAR--INWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLS 151
++ LP+ ++A N+ C + IR+Q CGS WA E ++DR CI +RGK + +S
Sbjct: 118 MDALPDSYNAANDSNYYMCQQLHRIRNQEQCGSCWAFSISEMVADRFCIGTRGKINTIMS 177
Query: 152 SDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIPCERYMNGSHS 211
+VS C NGC GG A+++ TTG+VS GC PY
Sbjct: 178 PQWMVS-CDTADNGCNGGEFPTAFQFVETTGLVS-------DGCVPY------------- 216
Query: 212 SCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANE-ETIMREIFRHGPVEGSMT 270
Q P C C G D++ R + N+ +++ I +GPV
Sbjct: 217 --QSGNGFVPPCPNSCANGEDINVRYRTKNSR---NFDVNDMKSVQASILANGPVISGFK 271
Query: 271 IYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGEN 330
+Y D Y++G YKHVAGG +G HAI+++GWG T S V YW+VANS++ WG N
Sbjct: 272 VYRDFYNYRSG-YKHVAGGLVGGHAIKVVGWGV------TQSNVPYWIVANSWSDEWGMN 324
Query: 331 GLFRI 335
G F I
Sbjct: 325 GYFWI 329
Score = 96.7 bits (239), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 59/168 (35%), Positives = 83/168 (49%), Gaps = 26/168 (15%)
Query: 331 GLFRIGCRPYEIPCERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSL 390
GL GC PY Q+ P C C G D++ R +
Sbjct: 207 GLVSDGCVPY---------------QSGNGFVPPCPNSCANGEDINVRYRTKNSR---NF 248
Query: 391 PANE-ETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLG 449
N+ +++ I +GPV +Y D Y++G YKHVAGG +G HAI+++GWG
Sbjct: 249 DVNDMKSVQASILANGPVISGFKVYRDFYNYRSG-YKHVAGGLVGGHAIKVVGWGV---- 303
Query: 450 EGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
T S V YW+VANS++ WG NG F I+RG NEC IE ++ +P +
Sbjct: 304 --TQSNVPYWIVANSWSDEWGMNGYFWILRGTNECSIEENMWETIPAL 349
>gi|355561807|gb|EHH18439.1| hypothetical protein EGK_15031 [Macaca mulatta]
Length = 475
Score = 131 bits (329), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 91/292 (31%), Positives = 133/292 (45%), Gaps = 42/292 (14%)
Query: 62 LTLSE-LEMRMGVHPDSKLPQNRLPLLVQLSDPL---EELPEGFDARINWPYCPTIQEIR 117
+TL + + R+G P S + L + +++ PL +LPE F A WP
Sbjct: 182 MTLEDGFKFRLGTLPPSPM----LLSMNEMTXPLPATTDLPEFFVASYKWP--GWTHGPL 235
Query: 118 DQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKY 177
DQ +C + WA +DR+ I S+G+ LS +L+SCC +GC G +AW Y
Sbjct: 236 DQKNCAASWAFSTASVAADRIAIQSKGRYTANLSPQNLISCCAKNRHGCNSGSIDRAWWY 295
Query: 178 WVTTGIVSGGTYA------SKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQPGY 231
G+VS Y + GC R + C +N + I +C P Y
Sbjct: 296 LRKRGLVSHACYPLFKDQNANNGCAMASRSDGRGKRHATKPCPNNIEKSNR-IYQCSPPY 354
Query: 232 DVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGG-- 289
VS ++E IM+EI ++GPV+ M + D YKTGIY+HV
Sbjct: 355 RVS---------------SSETEIMKEIMQNGPVQAIMQVREDFFHYKTGIYRHVTSTNK 399
Query: 290 ------PLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
L HA+++ GWG +G K+W+ ANS+ +WGENG FRI
Sbjct: 400 ESEKYRKLQTHAVKLTGWGTLRGAQGRKE--KFWIAANSWGKSWGENGYFRI 449
Score = 92.4 bits (228), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 47/114 (41%), Positives = 65/114 (57%), Gaps = 10/114 (8%)
Query: 388 YSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGG--------PLGEHAIR 439
Y + ++E IM+EI ++GPV+ M + D YKTGIY+HV L HA++
Sbjct: 354 YRVSSSETEIMKEIMQNGPVQAIMQVREDFFHYKTGIYRHVTSTNKESEKYRKLQTHAVK 413
Query: 440 IIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAG 493
+ GWG +G K+W+ ANS+ +WGENG FRI+RG NE IE I A
Sbjct: 414 LTGWGTLRGAQGRKE--KFWIAANSWGKSWGENGYFRILRGVNESDIEKLIIAA 465
>gi|157058759|gb|ABV03137.1| cathepsin B-84 [Rhopalosiphum padi]
Length = 219
Score = 131 bits (329), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 72/179 (40%), Positives = 104/179 (58%), Gaps = 7/179 (3%)
Query: 96 ELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDL 155
E+P+ FDARI W YC TI E+R+QG+CGS WA G A +DR+C+A+ G + +S+++L
Sbjct: 45 EVPDFFDARIEWKYCKTIGEVRNQGNCGSCWAHGTTGAFADRLCVATNGDFNELISAEEL 104
Query: 156 VSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEI-PCERYMNGSHSSC- 213
CC CG GC GG +AW Y+ G+V+GG Y + GC+PY++ PC R G H+SC
Sbjct: 105 TFCCHTCGFGCNGGNPIRAWLYFKRHGVVTGGNYNTTDGCQPYKVPPCIRDEEG-HNSCS 163
Query: 214 -QDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTI 271
Q E N C + C Y++ + AY L N T+ + +GP+E S +
Sbjct: 164 GQRTERN-HRCSKSCYGNTTSDYKNGHYKTKDAYYLTNN--TMQIDTMIYGPIESSFDV 219
>gi|297665714|ref|XP_002811184.1| PREDICTED: tubulointerstitial nephritis antigen-like 1 isoform 2
[Pongo abelii]
Length = 467
Score = 131 bits (329), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 98/294 (33%), Positives = 133/294 (45%), Gaps = 20/294 (6%)
Query: 51 FYGAEKNALSKLTLSE-LEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPY 109
+ +A +TL E + R+G S N + L +P E LP F+A WP
Sbjct: 157 WQAGNHSAFWGMTLDEGIRYRLGTIRPSSSVMNMHEIYTVL-NPGEVLPTAFEASEKWPN 215
Query: 110 CPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGG 169
I E DQG+C WA SDRV I S G LS +L+SC GC+GG
Sbjct: 216 --LIHEPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQNLLSCDTHQQQGCRGG 273
Query: 170 FHGKAWKYWVTTGIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQP 229
AW + G+VS Y R P M S + + T C P
Sbjct: 274 RLDGAWWFLRRRGVVSDHCYPFSGRERDEAGPTPPCMMHSRAMGRGKRQATASC-----P 328
Query: 230 GYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVA-- 287
V+ D + Y L +N++ IM+E+ +GPV+ M ++ D LYK GIY H
Sbjct: 329 NSHVNNNDIYQVTPV-YRLGSNDKEIMKELMENGPVQALMEVHEDFFLYKGGIYSHTPVS 387
Query: 288 -GGP-----LGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
G P G H+++I GWG+E L +G + +KYW ANS+ WGE G FRI
Sbjct: 388 LGRPERYRRHGTHSVKITGWGEETLPDGRT--LKYWTAANSWGPAWGERGHFRI 439
Score = 101 bits (252), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 52/122 (42%), Positives = 74/122 (60%), Gaps = 10/122 (8%)
Query: 387 AYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVA---GGP-----LGEHAI 438
Y L +N++ IM+E+ +GPV+ M ++ D LYK GIY H G P G H++
Sbjct: 343 VYRLGSNDKEIMKELMENGPVQALMEVHEDFFLYKGGIYSHTPVSLGRPERYRRHGTHSV 402
Query: 439 RIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKIG 498
+I GWG+E L +G + +KYW ANS+ WGE G FRIVRG NEC IE+ + ++G
Sbjct: 403 KITGWGEETLPDGRT--LKYWTAANSWGPAWGERGHFRIVRGVNECDIESFVLGVWGRVG 460
Query: 499 LE 500
+E
Sbjct: 461 ME 462
>gi|327281715|ref|XP_003225592.1| PREDICTED: tubulointerstitial nephritis antigen-like [Anolis
carolinensis]
Length = 520
Score = 130 bits (328), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 90/285 (31%), Positives = 137/285 (48%), Gaps = 24/285 (8%)
Query: 62 LTLSE-LEMRMGV--HPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRD 118
+TL E ++ R+G P S + N L + + +D LP F+A W I E D
Sbjct: 219 MTLDEGIQYRLGTIKPPTSVMNMNELQMNMDEND---VLPSYFNAADKWS--GMIHEPLD 273
Query: 119 QGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYW 178
QG+C WA SDR+ I S G LS +L+SC GC GG AW +
Sbjct: 274 QGNCAGSWAFSTAAVASDRISIHSMGHMTPALSPQNLLSCNTRHQQGCNGGRIDGAWWFL 333
Query: 179 VTTGIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDD 238
G+V+ Y + M S S+ + C S+ ++
Sbjct: 334 RRRGVVTDECYPFSNQETNHSPNAPACMMHSRSTGRGKRQAIARCPNP------RSHANE 387
Query: 239 LNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHV---AGGP----- 290
+ AY L +NE+ IM+E+ +GPV+ + ++ D +Y+TGIY+H AG P
Sbjct: 388 IYQSTPAYRLSSNEKEIMKELMENGPVQAILEVHEDFFMYRTGIYRHTAVAAGKPEQYRR 447
Query: 291 LGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
G H+++I GWG+E + +G++ KYW+ ANS+ +WGE+G FRI
Sbjct: 448 HGTHSVKITGWGEEQMPDGSNQ--KYWIAANSWGKDWGEHGYFRI 490
Score = 111 bits (277), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 52/133 (39%), Positives = 85/133 (63%), Gaps = 10/133 (7%)
Query: 376 SYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVA---GGP 432
S+ +++ AY L +NE+ IM+E+ +GPV+ + ++ D +Y+TGIY+H A G P
Sbjct: 383 SHANEIYQSTPAYRLSSNEKEIMKELMENGPVQAILEVHEDFFMYRTGIYRHTAVAAGKP 442
Query: 433 -----LGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIE 487
G H+++I GWG+E + +G++ KYW+ ANS+ +WGE+G FRI RG+NEC IE
Sbjct: 443 EQYRRHGTHSVKITGWGEEQMPDGSNQ--KYWIAANSWGKDWGEHGYFRITRGENECEIE 500
Query: 488 ADITAGLPKIGLE 500
+ ++G+E
Sbjct: 501 TFVVGVWGRVGME 513
>gi|426328832|ref|XP_004025452.1| PREDICTED: tubulointerstitial nephritis antigen-like [Gorilla
gorilla gorilla]
Length = 462
Score = 130 bits (328), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 98/294 (33%), Positives = 133/294 (45%), Gaps = 20/294 (6%)
Query: 51 FYGAEKNALSKLTLSE-LEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPY 109
+ +A +TL E + R+G S N + L +P E LP F+A WP
Sbjct: 152 WQAGNHSAFWGMTLDEGIRYRLGTIRPSSSVMNMHEIYTVL-NPGEVLPTAFEASEKWPN 210
Query: 110 CPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGG 169
I E DQG+C WA SDRV I S G LS +L+SC GC+GG
Sbjct: 211 --LIHEPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQNLLSCDTHQQQGCRGG 268
Query: 170 FHGKAWKYWVTTGIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQP 229
AW + G+VS Y R P M S + + T C P
Sbjct: 269 RLDGAWWFLRRRGVVSDHCYPFSGRERDEAGPAPPCMMHSQAMGRGKRQATAHC-----P 323
Query: 230 GYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVA-- 287
V+ D + Y L +N++ IM+E+ +GPV+ M ++ D LYK GIY H
Sbjct: 324 NSYVNNNDIYQVTPV-YRLGSNDKEIMKELMENGPVQALMEVHEDFFLYKGGIYSHTPVS 382
Query: 288 -GGP-----LGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
G P G H+++I GWG+E L +G + +KYW ANS+ WGE G FRI
Sbjct: 383 LGRPERYRRHGTHSVKITGWGEETLPDGRT--LKYWTAANSWGPAWGERGHFRI 434
Score = 101 bits (252), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 52/122 (42%), Positives = 74/122 (60%), Gaps = 10/122 (8%)
Query: 387 AYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVA---GGP-----LGEHAI 438
Y L +N++ IM+E+ +GPV+ M ++ D LYK GIY H G P G H++
Sbjct: 338 VYRLGSNDKEIMKELMENGPVQALMEVHEDFFLYKGGIYSHTPVSLGRPERYRRHGTHSV 397
Query: 439 RIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKIG 498
+I GWG+E L +G + +KYW ANS+ WGE G FRIVRG NEC IE+ + ++G
Sbjct: 398 KITGWGEETLPDGRT--LKYWTAANSWGPAWGERGHFRIVRGVNECDIESFVLGVWGRVG 455
Query: 499 LE 500
+E
Sbjct: 456 ME 457
>gi|193610664|ref|XP_001948185.1| PREDICTED: cathepsin B-like [Acyrthosiphon pisum]
Length = 324
Score = 130 bits (328), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 80/264 (30%), Positives = 126/264 (47%), Gaps = 41/264 (15%)
Query: 85 PLLVQLSDP----LEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCI 140
LL + DP ++ + FDAR +W C TI E+ + G+ WA A +DR+C+
Sbjct: 69 KLLYKTRDPRYVAYGKISKEFDARKHWSQCKTIGEVYNDGNSDLSWAYATTGAFADRMCV 128
Query: 141 ASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEI 200
A+ G + LS++ L+SC N +AWK++ G+VSGG Y + GC+P +I
Sbjct: 129 ATNGSYNQLLSTEQLISCSGIKSNAMA---DDQAWKFFKKQGLVSGGKYNTNDGCQPSKI 185
Query: 201 PCERYMNGSHSSCQDNEP--NTPE------CIRKCQPGYDVSYEDDLNFGRIAYSLPANE 252
P P N P+ C C + Y D +++Y+
Sbjct: 186 P----------------PIFNLPKKIYNRTCDNFCYGNSLIDYNHD--HVKVSYTYHVLY 227
Query: 253 ETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEH-AIRIIGWGQEPLGEGTS 311
+ I RE+ +GPV ++Y D+ LY +G+Y + + ++IGWG E
Sbjct: 228 KNIQREVQTYGPVSAYFSLYDDLFLYTSGVYARTEKSKFVRYQSAKLIGWGVE------- 280
Query: 312 SVVKYWLVANSFNTNWGENGLFRI 335
+ V YWL+ NS+ WG+NGLF+I
Sbjct: 281 NGVDYWLLVNSWGNEWGQNGLFKI 304
Score = 83.2 bits (204), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 39/114 (34%), Positives = 63/114 (55%), Gaps = 8/114 (7%)
Query: 385 RIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEH-AIRIIGW 443
+++Y+ + I RE+ +GPV ++Y D+ LY +G+Y + + ++IGW
Sbjct: 218 KVSYTYHVLYKNIQREVQTYGPVSAYFSLYDDLFLYTSGVYARTEKSKFVRYQSAKLIGW 277
Query: 444 GQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
G E + V YWL+ NS+ WG+NGLF+I RG +EC AG+PK+
Sbjct: 278 GVE-------NGVDYWLLVNSWGNEWGQNGLFKIKRGTDECQFGRHTYAGVPKM 324
>gi|355748654|gb|EHH53137.1| hypothetical protein EGM_13709 [Macaca fascicularis]
Length = 475
Score = 130 bits (328), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 91/292 (31%), Positives = 133/292 (45%), Gaps = 42/292 (14%)
Query: 62 LTLSE-LEMRMGVHPDSKLPQNRLPLLVQLSDPL---EELPEGFDARINWPYCPTIQEIR 117
+TL + + R+G P S + L + +++ PL +LPE F A WP
Sbjct: 182 MTLEDGFKFRLGTLPPSPM----LLSMNEMTAPLPATTDLPEFFVASYKWP--GWTHGPL 235
Query: 118 DQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKY 177
DQ +C + WA +DR+ I S+G+ LS +L+SCC +GC G +AW Y
Sbjct: 236 DQKNCAASWAFSTASVAADRIAIQSKGRYTANLSPQNLISCCAKNRHGCNSGSIDRAWWY 295
Query: 178 WVTTGIVSGGTYA------SKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQPGY 231
G+VS Y + GC R + C +N + I +C P Y
Sbjct: 296 LRKRGLVSHACYPLFKDQNANNGCAMASRSDGRGKRHATKPCPNNIEKSNR-IYQCSPPY 354
Query: 232 DVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGG-- 289
VS ++E IM+EI ++GPV+ M + D YKTGIY+HV
Sbjct: 355 RVS---------------SSETEIMKEIMQNGPVQAIMQVREDFFHYKTGIYRHVTSTNK 399
Query: 290 ------PLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
L HA+++ GWG +G K+W+ ANS+ +WGENG FRI
Sbjct: 400 ESEKYRKLQTHAVKLTGWGTLRGAQGRKE--KFWIAANSWGKSWGENGYFRI 449
Score = 92.4 bits (228), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 47/114 (41%), Positives = 65/114 (57%), Gaps = 10/114 (8%)
Query: 388 YSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGG--------PLGEHAIR 439
Y + ++E IM+EI ++GPV+ M + D YKTGIY+HV L HA++
Sbjct: 354 YRVSSSETEIMKEIMQNGPVQAIMQVREDFFHYKTGIYRHVTSTNKESEKYRKLQTHAVK 413
Query: 440 IIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAG 493
+ GWG +G K+W+ ANS+ +WGENG FRI+RG NE IE I A
Sbjct: 414 LTGWGTLRGAQGRKE--KFWIAANSWGKSWGENGYFRILRGVNESDIEKLIIAA 465
>gi|402867308|ref|XP_003897801.1| PREDICTED: tubulointerstitial nephritis antigen [Papio anubis]
Length = 475
Score = 130 bits (328), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 91/292 (31%), Positives = 133/292 (45%), Gaps = 42/292 (14%)
Query: 62 LTLSE-LEMRMGVHPDSKLPQNRLPLLVQLSDPL---EELPEGFDARINWPYCPTIQEIR 117
+TL + + R+G P S + L + +++ PL +LPE F A WP
Sbjct: 182 MTLEDGFKFRLGTLPPSPM----LLSMNEMTAPLPATTDLPEFFVASYKWP--GWTHGPL 235
Query: 118 DQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKY 177
DQ +C + WA +DR+ I S+G+ LS +L+SCC +GC G +AW Y
Sbjct: 236 DQKNCAASWAFSTASVAADRIAIQSKGRYTANLSPQNLISCCAKNRHGCNSGSIDRAWWY 295
Query: 178 WVTTGIVSGGTYA------SKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQPGY 231
G+VS Y + GC R + C +N + I +C P Y
Sbjct: 296 LRKRGLVSHACYPLFKDQNANNGCAMASRSDGRGKRHATKPCPNNIEKSNR-IYQCSPPY 354
Query: 232 DVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGG-- 289
VS ++E IM+EI ++GPV+ M + D YKTGIY+HV
Sbjct: 355 RVS---------------SSETEIMKEIMQNGPVQAIMQVREDFFHYKTGIYRHVTSTNK 399
Query: 290 ------PLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
L HA+++ GWG +G K+W+ ANS+ +WGENG FRI
Sbjct: 400 ESEKYRKLQTHAVKLTGWGTLRGAQGRKE--KFWIAANSWGKSWGENGYFRI 449
Score = 92.4 bits (228), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 47/114 (41%), Positives = 65/114 (57%), Gaps = 10/114 (8%)
Query: 388 YSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGG--------PLGEHAIR 439
Y + ++E IM+EI ++GPV+ M + D YKTGIY+HV L HA++
Sbjct: 354 YRVSSSETEIMKEIMQNGPVQAIMQVREDFFHYKTGIYRHVTSTNKESEKYRKLQTHAVK 413
Query: 440 IIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAG 493
+ GWG +G K+W+ ANS+ +WGENG FRI+RG NE IE I A
Sbjct: 414 LTGWGTLRGAQGRKE--KFWIAANSWGKSWGENGYFRILRGVNESDIEKLIIAA 465
>gi|324711034|ref|NP_001191343.1| tubulointerstitial nephritis antigen-like isoform 2 precursor [Homo
sapiens]
gi|194391000|dbj|BAG60618.1| unnamed protein product [Homo sapiens]
Length = 436
Score = 130 bits (327), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 98/294 (33%), Positives = 133/294 (45%), Gaps = 20/294 (6%)
Query: 51 FYGAEKNALSKLTLSE-LEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPY 109
+ +A +TL E + R+G S N + L +P E LP F+A WP
Sbjct: 126 WQAGNHSAFWGMTLDEGIRYRLGTIRPSSSVMNMHEIYTVL-NPGEVLPTAFEASEKWPN 184
Query: 110 CPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGG 169
I E DQG+C WA SDRV I S G LS +L+SC GC+GG
Sbjct: 185 --LIHEPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQNLLSCDTHQQQGCRGG 242
Query: 170 FHGKAWKYWVTTGIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQP 229
AW + G+VS Y R P M S + + T C P
Sbjct: 243 RLDGAWWFLRRRGVVSDHCYPFSGRERDEAGPAPPCMMHSRAMGRGKRQATAHC-----P 297
Query: 230 GYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVA-- 287
V+ D + Y L +N++ IM+E+ +GPV+ M ++ D LYK GIY H
Sbjct: 298 NSYVNNNDIYQVTPV-YRLGSNDKEIMKELMENGPVQALMEVHEDFFLYKGGIYSHTPVS 356
Query: 288 -GGP-----LGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
G P G H+++I GWG+E L +G + +KYW ANS+ WGE G FRI
Sbjct: 357 LGRPERYRRHGTHSVKITGWGEETLPDGRT--LKYWTAANSWGPAWGERGHFRI 408
Score = 101 bits (252), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 52/122 (42%), Positives = 74/122 (60%), Gaps = 10/122 (8%)
Query: 387 AYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVA---GGP-----LGEHAI 438
Y L +N++ IM+E+ +GPV+ M ++ D LYK GIY H G P G H++
Sbjct: 312 VYRLGSNDKEIMKELMENGPVQALMEVHEDFFLYKGGIYSHTPVSLGRPERYRRHGTHSV 371
Query: 439 RIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKIG 498
+I GWG+E L +G + +KYW ANS+ WGE G FRIVRG NEC IE+ + ++G
Sbjct: 372 KITGWGEETLPDGRT--LKYWTAANSWGPAWGERGHFRIVRGVNECDIESFVLGVWGRVG 429
Query: 499 LE 500
+E
Sbjct: 430 ME 431
>gi|12060418|dbj|BAB20596.1| ARG1 [Mus musculus]
gi|71059879|emb|CAJ18483.1| Lcn7 [Mus musculus]
Length = 415
Score = 130 bits (327), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 101/313 (32%), Positives = 141/313 (45%), Gaps = 27/313 (8%)
Query: 32 DLSKAFDRVDHSILLPKLPFYGAEKNALSKLTLSE-LEMRMGVHPDSKLPQNRLPLLVQL 90
D+ KA +R ++ + +A +TL E + R+G S N + L
Sbjct: 93 DMIKAINRGNYG-------WQAGNHSAFWGMTLDEGIRYRLGTIRPSSTVMNMNEIYTVL 145
Query: 91 SDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRL 150
E LP F+A WP I E DQG+C WA SDRV I S G L
Sbjct: 146 GQG-EVLPTAFEASEKWPN--LIHEPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPIL 202
Query: 151 SSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIPCERYMNGSH 210
S +L+SC GC+GG AW + G+VS Y + P R M S
Sbjct: 203 SPQNLLSCDTHHQQGCRGGRLDGAWWFLRRRGVVSDNCYPFSGREQNEASPTPRCMMHSR 262
Query: 211 SSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMT 270
+ + T C P V +D+ AY L ++E+ IM+E+ +GPV+ M
Sbjct: 263 AMGRGKRQATSRC-----PNGQVD-SNDIYQVTPAYRLGSDEKEIMKELMENGPVQALME 316
Query: 271 IYADMILYKTGIYKHV---AGGP-----LGEHAIRIIGWGQEPLGEGTSSVVKYWLVANS 322
++ D LY+ GIY H G P G H+++I GWG+E L +G + +KYW ANS
Sbjct: 317 VHEDFFLYQRGIYSHTPVSQGRPEQYRRHGTHSVKITGWGEETLPDGRT--IKYWTAANS 374
Query: 323 FNTNWGENGLFRI 335
+ WGE G FRI
Sbjct: 375 WGPWWGERGHFRI 387
Score = 102 bits (253), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 62/166 (37%), Positives = 88/166 (53%), Gaps = 16/166 (9%)
Query: 343 PCERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIF 402
P R M SR+ + T C P V +D+ AY L ++E+ IM+E+
Sbjct: 253 PTPRCMMHSRAMGRGKRQATSRC-----PNGQVD-SNDIYQVTPAYRLGSDEKEIMKELM 306
Query: 403 RHGPVEGSMTIYADMILYKTGIYKHV---AGGP-----LGEHAIRIIGWGQEPLGEGTSS 454
+GPV+ M ++ D LY+ GIY H G P G H+++I GWG+E L +G +
Sbjct: 307 ENGPVQALMEVHEDFFLYQRGIYSHTPVSQGRPEQYRRHGTHSVKITGWGEETLPDGRT- 365
Query: 455 VVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKIGLE 500
+KYW ANS+ WGE G FRIVRG NEC IE + ++G+E
Sbjct: 366 -IKYWTAANSWGPWWGERGHFRIVRGTNECDIETFVLGVWGRVGME 410
>gi|440907441|gb|ELR57591.1| Tubulointerstitial nephritis antigen [Bos grunniens mutus]
Length = 476
Score = 130 bits (327), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 93/293 (31%), Positives = 131/293 (44%), Gaps = 43/293 (14%)
Query: 62 LTLSE-LEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQG 120
+TL E + R+G P S L + + L+ + LPE F A WP DQ
Sbjct: 182 MTLEEGFKYRLGTLPPSPLLLSMNEVTASLTKTTD-LPEFFIASYKWP--GWTHGPLDQK 238
Query: 121 SCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVT 180
+C + WA +DR+ I S+G+ LS +L+SCC GC +AW Y
Sbjct: 239 NCAASWAFSTASVAADRIAIQSQGRYTANLSPQNLISCCAKKRRGCNSESVDRAWWYLRK 298
Query: 181 TGIVSGGTY-------ASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPEC---IRKCQPG 230
G+VS Y A+ GC R + + C PN+ E I +C P
Sbjct: 299 RGLVSHACYPLFKDQNATNNGCAMASRSDGRGKRHATTPC----PNSIEKSNRIYQCSPP 354
Query: 231 YDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGG- 289
Y VS +NE IMREI ++GPV+ M ++ D YKTGIY+H+
Sbjct: 355 YRVS---------------SNETEIMREIMQNGPVQAIMQVHEDFFNYKTGIYRHITSTN 399
Query: 290 -------PLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
HA+++ GWG +G K+W+ ANS+ +WGENG FRI
Sbjct: 400 EDSEKYRKFRTHAVKLTGWGTLRGAQGQKE--KFWIAANSWGKSWGENGYFRI 450
Score = 95.5 bits (236), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 47/118 (39%), Positives = 67/118 (56%), Gaps = 10/118 (8%)
Query: 388 YSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGG--------PLGEHAIR 439
Y + +NE IMREI ++GPV+ M ++ D YKTGIY+H+ HA++
Sbjct: 355 YRVSSNETEIMREIMQNGPVQAIMQVHEDFFNYKTGIYRHITSTNEDSEKYRKFRTHAVK 414
Query: 440 IIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
+ GWG +G K+W+ ANS+ +WGENG FRI+RG NE IE I A ++
Sbjct: 415 LTGWGTLRGAQGQKE--KFWIAANSWGKSWGENGYFRILRGVNESDIEKLIIAAWGQL 470
>gi|134023803|gb|AAI35570.1| LOC100124858 protein [Xenopus (Silurana) tropicalis]
Length = 484
Score = 130 bits (327), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 92/302 (30%), Positives = 140/302 (46%), Gaps = 31/302 (10%)
Query: 62 LTLSE-LEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQG 120
+TL E ++ R+G S N + V +++ + LP F+A WP + E DQG
Sbjct: 187 MTLDEGIQYRLGTAKPSSSVMNMNEIHVNMNNDI--LPSHFNAAEKWP--GLVHEPLDQG 242
Query: 121 SCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVT 180
+C WA SDR+ I S G LS +L+SC +GC+GG AW Y
Sbjct: 243 NCAGSWAFSTAAVASDRISIQSMGHMTQSLSPQNLLSCDTRNQHGCRGGRVDGAWWYLRR 302
Query: 181 TGIVSGGTYA-SKQGCRPYEIPC---ERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYE 236
G+VS Y + + PC R M +N PN
Sbjct: 303 RGVVSEPCYPFTSLNTNGHSAPCMMQSRSMGRGKRQATNNCPN------------QYYSS 350
Query: 237 DDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPL----- 291
+++ AY L ++E+ IM+E++ +GPV+ M ++ D +YK+GIY+
Sbjct: 351 NEIYQSTPAYRLASSEKDIMKELYENGPVQAIMEVHEDFFMYKSGIYRRTPVTEREPEHH 410
Query: 292 ---GEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIGCRPYEIPCERYM 348
G H+++I GWG+E +G + KYWL ANS+ +WGE+G FRI E E ++
Sbjct: 411 RRHGTHSVKITGWGEERGRDGQTH--KYWLAANSWGRDWGEDGYFRIARGENECEIETFI 468
Query: 349 NG 350
G
Sbjct: 469 VG 470
Score = 100 bits (250), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 48/122 (39%), Positives = 74/122 (60%), Gaps = 10/122 (8%)
Query: 387 AYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPL--------GEHAI 438
AY L ++E+ IM+E++ +GPV+ M ++ D +YK+GIY+ G H++
Sbjct: 359 AYRLASSEKDIMKELYENGPVQAIMEVHEDFFMYKSGIYRRTPVTEREPEHHRRHGTHSV 418
Query: 439 RIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKIG 498
+I GWG+E +G + KYWL ANS+ +WGE+G FRI RG+NEC IE I ++
Sbjct: 419 KITGWGEERGRDGQTH--KYWLAANSWGRDWGEDGYFRIARGENECEIETFIVGVWGRVS 476
Query: 499 LE 500
+E
Sbjct: 477 ME 478
>gi|11545918|ref|NP_071447.1| tubulointerstitial nephritis antigen-like isoform 1 precursor [Homo
sapiens]
gi|61213628|sp|Q9GZM7.1|TINAL_HUMAN RecName: Full=Tubulointerstitial nephritis antigen-like; AltName:
Full=Glucocorticoid-inducible protein 5; AltName:
Full=Oxidized LDL-responsive gene 2 protein;
Short=OLRG-2; AltName: Full=Tubulointerstitial nephritis
antigen-related protein; Short=TIN Ag-related protein;
Short=TIN-Ag-RP; Flags: Precursor
gi|11602840|gb|AAG38876.1|AF236150_1 tubulointerstitial nephritis antigen-related protein precursor
[Homo sapiens]
gi|11275667|gb|AAG33699.1| oxidized-LDL responsive gene 2 [Homo sapiens]
gi|11527793|dbj|BAB18636.1| glucocorticoid-inducible protein [Homo sapiens]
gi|11527809|dbj|BAB18727.1| glucocorticoid-inducible protein [Homo sapiens]
gi|11761715|gb|AAG40154.1| tubulointerstitial nephritis antigen-related protein [Homo sapiens]
gi|22761462|dbj|BAC11596.1| unnamed protein product [Homo sapiens]
gi|37181967|gb|AAQ88787.1| LCN7 [Homo sapiens]
gi|40353044|gb|AAH64633.1| Tubulointerstitial nephritis antigen-like 1 [Homo sapiens]
gi|119628009|gb|EAX07604.1| tubulointerstitial nephritis antigen-like 1, isoform CRA_b [Homo
sapiens]
gi|119628010|gb|EAX07605.1| tubulointerstitial nephritis antigen-like 1, isoform CRA_b [Homo
sapiens]
gi|119628011|gb|EAX07606.1| tubulointerstitial nephritis antigen-like 1, isoform CRA_b [Homo
sapiens]
gi|158258977|dbj|BAF85459.1| unnamed protein product [Homo sapiens]
gi|261858502|dbj|BAI45773.1| tubulointerstitial nephritis antigen-like 1 [synthetic construct]
gi|410265400|gb|JAA20666.1| tubulointerstitial nephritis antigen-like 1 [Pan troglodytes]
gi|410307560|gb|JAA32380.1| tubulointerstitial nephritis antigen-like 1 [Pan troglodytes]
gi|410307562|gb|JAA32381.1| tubulointerstitial nephritis antigen-like 1 [Pan troglodytes]
gi|410307564|gb|JAA32382.1| tubulointerstitial nephritis antigen-like 1 [Pan troglodytes]
gi|410335249|gb|JAA36571.1| tubulointerstitial nephritis antigen-like 1 [Pan troglodytes]
Length = 467
Score = 130 bits (327), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 98/294 (33%), Positives = 133/294 (45%), Gaps = 20/294 (6%)
Query: 51 FYGAEKNALSKLTLSE-LEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPY 109
+ +A +TL E + R+G S N + L +P E LP F+A WP
Sbjct: 157 WQAGNHSAFWGMTLDEGIRYRLGTIRPSSSVMNMHEIYTVL-NPGEVLPTAFEASEKWPN 215
Query: 110 CPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGG 169
I E DQG+C WA SDRV I S G LS +L+SC GC+GG
Sbjct: 216 --LIHEPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQNLLSCDTHQQQGCRGG 273
Query: 170 FHGKAWKYWVTTGIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQP 229
AW + G+VS Y R P M S + + T C P
Sbjct: 274 RLDGAWWFLRRRGVVSDHCYPFSGRERDEAGPAPPCMMHSRAMGRGKRQATAHC-----P 328
Query: 230 GYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVA-- 287
V+ D + Y L +N++ IM+E+ +GPV+ M ++ D LYK GIY H
Sbjct: 329 NSYVNNNDIYQVTPV-YRLGSNDKEIMKELMENGPVQALMEVHEDFFLYKGGIYSHTPVS 387
Query: 288 -GGP-----LGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
G P G H+++I GWG+E L +G + +KYW ANS+ WGE G FRI
Sbjct: 388 LGRPERYRRHGTHSVKITGWGEETLPDGRT--LKYWTAANSWGPAWGERGHFRI 439
Score = 101 bits (252), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 52/122 (42%), Positives = 74/122 (60%), Gaps = 10/122 (8%)
Query: 387 AYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVA---GGP-----LGEHAI 438
Y L +N++ IM+E+ +GPV+ M ++ D LYK GIY H G P G H++
Sbjct: 343 VYRLGSNDKEIMKELMENGPVQALMEVHEDFFLYKGGIYSHTPVSLGRPERYRRHGTHSV 402
Query: 439 RIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKIG 498
+I GWG+E L +G + +KYW ANS+ WGE G FRIVRG NEC IE+ + ++G
Sbjct: 403 KITGWGEETLPDGRT--LKYWTAANSWGPAWGERGHFRIVRGVNECDIESFVLGVWGRVG 460
Query: 499 LE 500
+E
Sbjct: 461 ME 462
>gi|412985820|emb|CCO17020.1| cathepsin B-like cysteine proteinase [Bathycoccus prasinos]
Length = 541
Score = 130 bits (326), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 89/263 (33%), Positives = 128/263 (48%), Gaps = 33/263 (12%)
Query: 92 DPLEELPEGFDARINWPYCPT-IQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRL 150
+P +LPE FDAR WP C I E DQG CGS WA+ + MSDR+CIAS GK RL
Sbjct: 271 EPPSDLPESFDAREKWPECSEFIGEAWDQGECGSCWAIAPTKVMSDRLCIASGGKVQERL 330
Query: 151 SSDDLVSCCKDCGN----GCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIPCERYM 206
++ +++SC + C+GG A+++ G+ SGG Y ++GC Y P
Sbjct: 331 AASEILSCGQLVSEFSFGSCEGGMPDDAYEFAKEFGVASGGKYGDEKGCAAYPFP----- 385
Query: 207 NGSHSSCQDNEPNTPECIRK-----CQPGYDVSYEDDL--NFGRIAYSLPANEETIMREI 259
H C +P TP C K CQ D +++ + ++ + + + + REI
Sbjct: 386 -PCHHPCH-VQP-TPACPLKSDTAQCQGDLDEHTRNEVAQHIDKLIHCPDGDYDCMAREI 442
Query: 260 FRHGPVEG-SMTIYADMILYKTGIYKHVA-----GGPLGEHAIRIIGWGQEPLGEGTSSV 313
+ GPV + TIY + YK G Y+ A G G H I +IGW +E G + +
Sbjct: 443 YNSGPVSSYAGTIYDEFYAYKDGAYRTSADSETRGRSHGGHVIEVIGWHKESDGTYSWKI 502
Query: 314 VKYWLVANSFNTNWGENGLFRIG 336
+ WL NWG+ G RI
Sbjct: 503 INSWL-------NWGKKGHGRIA 518
Score = 52.8 bits (125), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 39/133 (29%), Positives = 60/133 (45%), Gaps = 20/133 (15%)
Query: 362 TPECIRK-----CQPGYDVSYEDDL--NFGRIAYSLPANEETIMREIFRHGPVEG-SMTI 413
TP C K CQ D +++ + ++ + + + + REI+ GPV + TI
Sbjct: 396 TPACPLKSDTAQCQGDLDEHTRNEVAQHIDKLIHCPDGDYDCMAREIYNSGPVSSYAGTI 455
Query: 414 YADMILYKTGIYKHVA-----GGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTN 468
Y + YK G Y+ A G G H I +IGW +E G + ++ WL N
Sbjct: 456 YDEFYAYKDGAYRTSADSETRGRSHGGHVIEVIGWHKESDGTYSWKIINSWL-------N 508
Query: 469 WGENGLFRIVRGQ 481
WG+ G RI G+
Sbjct: 509 WGKKGHGRIAVGE 521
>gi|156708108|gb|ABU93312.1| cathepsin B2 cysteine protease [Monocercomonoides sp. PA]
Length = 281
Score = 130 bits (326), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 80/239 (33%), Positives = 111/239 (46%), Gaps = 40/239 (16%)
Query: 97 LPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLV 156
LPE FD+R WP I +RDQ SCGS WA E M DR+ I +G +S DLV
Sbjct: 63 LPENFDSREQWP--GKILPVRDQASCGSCWAFSVAETMGDRLSI--KGCDFGDMSPQDLV 118
Query: 157 SCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQDN 216
SC GC GG+ AW + + GI + + C PY Q
Sbjct: 119 SC-DTTDMGCNGGYMDHAWAWTKSHGITT-------EKCMPY---------------QSG 155
Query: 217 EPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMI 276
P C KC G + +++ ++ N + +M E++ +GP+ + T+Y D +
Sbjct: 156 SGRVPACPAKCVNGSAIVRNKSVSYKKL------NAQQMMEELYENGPISVAFTVYYDFM 209
Query: 277 LYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
YK+G+Y H GG G HA+ +GWG E YWL NS+ WGE G F+I
Sbjct: 210 NYKSGVYVHKTGGIAGGHAVLCVGWGVE-------DNTPYWLCQNSWGPAWGEKGHFKI 261
Score = 94.4 bits (233), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 46/132 (34%), Positives = 69/132 (52%), Gaps = 13/132 (9%)
Query: 356 QANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYA 415
Q+ P C KC G + +++ ++ N + +M E++ +GP+ + T+Y
Sbjct: 153 QSGSGRVPACPAKCVNGSAIVRNKSVSYKKL------NAQQMMEELYENGPISVAFTVYY 206
Query: 416 DMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLF 475
D + YK+G+Y H GG G HA+ +GWG E YWL NS+ WGE G F
Sbjct: 207 DFMNYKSGVYVHKTGGIAGGHAVLCVGWGVE-------DNTPYWLCQNSWGPAWGEKGHF 259
Query: 476 RIVRGQNECGIE 487
+I+RG N CGIE
Sbjct: 260 KILRGSNHCGIE 271
>gi|270132817|ref|NP_075965.2| tubulointerstitial nephritis antigen-like precursor [Mus musculus]
gi|270132824|ref|NP_001161805.1| tubulointerstitial nephritis antigen-like precursor [Mus musculus]
gi|61213616|sp|Q99JR5.1|TINAL_MOUSE RecName: Full=Tubulointerstitial nephritis antigen-like; AltName:
Full=Adrenocortical zonation factor 1; Short=AZ-1;
AltName: Full=Androgen-regulated gene 1 protein;
AltName: Full=Tubulointerstitial nephritis
antigen-related protein; Short=TARP; Flags: Precursor
gi|13543125|gb|AAH05738.1| Tinagl1 protein [Mus musculus]
gi|17391278|gb|AAH18539.1| Tinagl1 protein [Mus musculus]
gi|30314458|dbj|BAC76038.1| tubulointersititial nephritis antigen-related protein [Mus
musculus]
gi|148698197|gb|EDL30144.1| tubulointerstitial nephritis antigen-like, isoform CRA_a [Mus
musculus]
gi|148698198|gb|EDL30145.1| tubulointerstitial nephritis antigen-like, isoform CRA_a [Mus
musculus]
Length = 466
Score = 130 bits (326), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 101/313 (32%), Positives = 141/313 (45%), Gaps = 27/313 (8%)
Query: 32 DLSKAFDRVDHSILLPKLPFYGAEKNALSKLTLSE-LEMRMGVHPDSKLPQNRLPLLVQL 90
D+ KA +R ++ + +A +TL E + R+G S N + L
Sbjct: 144 DMIKAINRGNYG-------WQAGNHSAFWGMTLDEGIRYRLGTIRPSSTVMNMNEIYTVL 196
Query: 91 SDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRL 150
E LP F+A WP I E DQG+C WA SDRV I S G L
Sbjct: 197 GQG-EVLPTAFEASEKWPN--LIHEPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPIL 253
Query: 151 SSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIPCERYMNGSH 210
S +L+SC GC+GG AW + G+VS Y + P R M S
Sbjct: 254 SPQNLLSCDTHHQQGCRGGRLDGAWWFLRRRGVVSDNCYPFSGREQNEASPTPRCMMHSR 313
Query: 211 SSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMT 270
+ + T C P V +D+ AY L ++E+ IM+E+ +GPV+ M
Sbjct: 314 AMGRGKRQATSRC-----PNGQVD-SNDIYQVTPAYRLGSDEKEIMKELMENGPVQALME 367
Query: 271 IYADMILYKTGIYKHV---AGGP-----LGEHAIRIIGWGQEPLGEGTSSVVKYWLVANS 322
++ D LY+ GIY H G P G H+++I GWG+E L +G + +KYW ANS
Sbjct: 368 VHEDFFLYQRGIYSHTPVSQGRPEQYRRHGTHSVKITGWGEETLPDGRT--IKYWTAANS 425
Query: 323 FNTNWGENGLFRI 335
+ WGE G FRI
Sbjct: 426 WGPWWGERGHFRI 438
Score = 102 bits (253), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 62/166 (37%), Positives = 88/166 (53%), Gaps = 16/166 (9%)
Query: 343 PCERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIF 402
P R M SR+ + T C P V +D+ AY L ++E+ IM+E+
Sbjct: 304 PTPRCMMHSRAMGRGKRQATSRC-----PNGQVD-SNDIYQVTPAYRLGSDEKEIMKELM 357
Query: 403 RHGPVEGSMTIYADMILYKTGIYKHV---AGGP-----LGEHAIRIIGWGQEPLGEGTSS 454
+GPV+ M ++ D LY+ GIY H G P G H+++I GWG+E L +G +
Sbjct: 358 ENGPVQALMEVHEDFFLYQRGIYSHTPVSQGRPEQYRRHGTHSVKITGWGEETLPDGRT- 416
Query: 455 VVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKIGLE 500
+KYW ANS+ WGE G FRIVRG NEC IE + ++G+E
Sbjct: 417 -IKYWTAANSWGPWWGERGHFRIVRGTNECDIETFVLGVWGRVGME 461
>gi|294888035|ref|XP_002772321.1| Cathepsin B precursor, putative [Perkinsus marinus ATCC 50983]
gi|239876433|gb|EER04137.1| Cathepsin B precursor, putative [Perkinsus marinus ATCC 50983]
Length = 200
Score = 130 bits (326), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 81/224 (36%), Positives = 115/224 (51%), Gaps = 31/224 (13%)
Query: 118 DQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKY 177
DQ +CGS WA G EA +DR+CI S G LS+ ++ +C GC GG AW +
Sbjct: 1 DQSACGSCWAFGVTEAFNDRLCIKSDGAFTELLSAGEMNACT--LFFGCGGGDPYSAWSW 58
Query: 178 WVTTGIVSGGTYASKQ------GCRPYEIP-CERYMNGSHSSCQDNEPNTPECIRKCQPG 230
GI +GG Y +K GC PY+ P C ++N + P+C + G
Sbjct: 59 VHDKGIATGGDYVAKDDMTKDDGCWPYDFPPCAHHIN---------DTKYPKCPKVSCSG 109
Query: 231 YDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGP 290
D + L YS+ + I + GPV S T+Y D + Y++G+YKH +G
Sbjct: 110 DDRHFM--LESSPYHYSVNDAKNAIRTD----GPVSASFTVYEDFLAYRSGVYKHTSGSY 163
Query: 291 LGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFR 334
LG HA++IIGWG++ S YWL NS+N +WG++GLFR
Sbjct: 164 LGGHAVKIIGWGEK-------SGQAYWLAVNSWNEDWGDHGLFR 200
Score = 88.2 bits (217), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 50/142 (35%), Positives = 75/142 (52%), Gaps = 23/142 (16%)
Query: 336 GCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANE 394
GC PY+ P C ++N ++ P+C + G D + L YS+ +
Sbjct: 81 GCWPYDFPPCAHHINDTK---------YPKCPKVSCSGDDRHFM--LESSPYHYSVNDAK 129
Query: 395 ETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSS 454
I + GPV S T+Y D + Y++G+YKH +G LG HA++IIGWG++ S
Sbjct: 130 NAIRTD----GPVSASFTVYEDFLAYRSGVYKHTSGSYLGGHAVKIIGWGEK-------S 178
Query: 455 VVKYWLVANSFNTNWGENGLFR 476
YWL NS+N +WG++GLFR
Sbjct: 179 GQAYWLAVNSWNEDWGDHGLFR 200
>gi|332808277|ref|XP_524645.3| PREDICTED: LOW QUALITY PROTEIN: tubulointerstitial nephritis
antigen-like 1 [Pan troglodytes]
Length = 472
Score = 130 bits (326), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 98/294 (33%), Positives = 133/294 (45%), Gaps = 20/294 (6%)
Query: 51 FYGAEKNALSKLTLSE-LEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPY 109
+ +A +TL E + R+G S N + L +P E LP F+A WP
Sbjct: 162 WQAGNHSAFWGMTLDEGIRYRLGTIRPSSSVMNMHEIYTVL-NPGEVLPTAFEASEKWPN 220
Query: 110 CPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGG 169
I E DQG+C WA SDRV I S G LS +L+SC GC+GG
Sbjct: 221 --LIHEPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQNLLSCDTHQQQGCRGG 278
Query: 170 FHGKAWKYWVTTGIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQP 229
AW + G+VS Y R P M S + + T C P
Sbjct: 279 RLDGAWWFLRRRGVVSDHCYPFSGRERDEAGPAPPCMMHSRAMGRGKRQATAHC-----P 333
Query: 230 GYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVA-- 287
V+ D + Y L +N++ IM+E+ +GPV+ M ++ D LYK GIY H
Sbjct: 334 NSYVNNNDIYQVTPV-YRLGSNDKEIMKELMENGPVQALMEVHEDFFLYKGGIYSHTPVS 392
Query: 288 -GGP-----LGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
G P G H+++I GWG+E L +G + +KYW ANS+ WGE G FRI
Sbjct: 393 LGRPERYRRHGTHSVKITGWGEETLPDGRT--LKYWTAANSWGPAWGERGHFRI 444
Score = 101 bits (252), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 52/122 (42%), Positives = 74/122 (60%), Gaps = 10/122 (8%)
Query: 387 AYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVA---GGP-----LGEHAI 438
Y L +N++ IM+E+ +GPV+ M ++ D LYK GIY H G P G H++
Sbjct: 348 VYRLGSNDKEIMKELMENGPVQALMEVHEDFFLYKGGIYSHTPVSLGRPERYRRHGTHSV 407
Query: 439 RIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKIG 498
+I GWG+E L +G + +KYW ANS+ WGE G FRIVRG NEC IE+ + ++G
Sbjct: 408 KITGWGEETLPDGRT--LKYWTAANSWGPAWGERGHFRIVRGVNECDIESFVLGVWGRVG 465
Query: 499 LE 500
+E
Sbjct: 466 ME 467
>gi|126310154|ref|XP_001364630.1| PREDICTED: tubulointerstitial nephritis antigen [Monodelphis
domestica]
Length = 468
Score = 130 bits (326), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 90/290 (31%), Positives = 133/290 (45%), Gaps = 37/290 (12%)
Query: 62 LTLSE-LEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQG 120
+TL E + R+G P S + + V L + LPE F + WP + DQ
Sbjct: 174 MTLEEGYKFRLGTLPPSPTLLSMNEMTVTLPSQTD-LPEFFISSYKWP--GWTHDPLDQK 230
Query: 121 SCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVT 180
+C + WA +DR+ I S+G+ LS +L+SCC +GC+GG +AW Y
Sbjct: 231 NCAASWAFSTASVAADRIAIQSKGRYTDNLSPQNLISCCVKNRHGCKGGSIDRAWWYLRK 290
Query: 181 TGIVSGGTYA-------SKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQPGYDV 233
G+VS Y + GC R + C +N + I +C P Y V
Sbjct: 291 RGLVSHACYPLFKDQIFNNNGCDMASRSDGRGKRHATKPCPNNIEKSNR-IYQCSPPYRV 349
Query: 234 SYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAG----- 288
S +NE IM+EI ++GPV+ M ++ D YK+GIY+H+
Sbjct: 350 S---------------SNETEIMKEIMQNGPVQAIMQVHEDFFHYKSGIYRHINNLKDES 394
Query: 289 ---GPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
L HA+++ GWG +G K+W+ ANS+ +WGENG FRI
Sbjct: 395 EKYRNLRTHAVKLTGWGVLRGAQGKKE--KFWIAANSWGKSWGENGYFRI 442
Score = 92.4 bits (228), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 46/114 (40%), Positives = 66/114 (57%), Gaps = 10/114 (8%)
Query: 388 YSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAG--------GPLGEHAIR 439
Y + +NE IM+EI ++GPV+ M ++ D YK+GIY+H+ L HA++
Sbjct: 347 YRVSSNETEIMKEIMQNGPVQAIMQVHEDFFHYKSGIYRHINNLKDESEKYRNLRTHAVK 406
Query: 440 IIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAG 493
+ GWG +G K+W+ ANS+ +WGENG FRI+RG NE IE I A
Sbjct: 407 LTGWGVLRGAQGKKE--KFWIAANSWGKSWGENGYFRILRGVNESDIEKLIIAA 458
>gi|328722316|ref|XP_003247542.1| PREDICTED: cathepsin B-like isoform 2 [Acyrthosiphon pisum]
gi|328722318|ref|XP_003247543.1| PREDICTED: cathepsin B-like isoform 3 [Acyrthosiphon pisum]
Length = 276
Score = 130 bits (326), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 80/253 (31%), Positives = 126/253 (49%), Gaps = 25/253 (9%)
Query: 101 FDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSC-- 158
FDAR WP C TI E+ ++G+ WA +DR+CIA+ G + LS+++L+SC
Sbjct: 36 FDARKRWPQCKTIGEVYNEGNALLSWAYATTGVFADRMCIATNGSYNKHLSTEELISCSG 95
Query: 159 CKDCGNGCQGGFHGKAWKYWVTTGIVSGGT-YASKQGCRPYEIPCERYMNGSHSSCQ-DN 216
K NG G AW+Y+ T G+VSGG+ Y + GC+P +IP C
Sbjct: 96 IKASANGWVRD--GLAWEYFKTHGLVSGGSIYNTNDGCQPSKIP---------PVCNLPT 144
Query: 217 EPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMI 276
+ N C+ C + Y D ++ Y + I +E+ +GPV ++ +Y D+
Sbjct: 145 KINKRTCVDYCYGNDTIKYNHD--HVKVRYYYHVKPKDIQKEVQTYGPVTAALNLYDDIF 202
Query: 277 LYKTGIYKHVAGGP-LGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
L+K+G+Y + +++IGWG E + V YWL+ NS+ WG+NGL +I
Sbjct: 203 LHKSGVYTLTKNAKYVRLQYVKLIGWGVE-------NGVDYWLLVNSWGNEWGQNGLLKI 255
Query: 336 GCRPYEIPCERYM 348
Y E ++
Sbjct: 256 KRGKYGCAVESFV 268
Score = 84.0 bits (206), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 48/164 (29%), Positives = 79/164 (48%), Gaps = 20/164 (12%)
Query: 336 GCRPYEIPCERYMNGSRSSCQA-NEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANE 394
GC+P +IP C + N C+ C + Y D ++ Y
Sbjct: 130 GCQPSKIP---------PVCNLPTKINKRTCVDYCYGNDTIKYNHD--HVKVRYYYHVKP 178
Query: 395 ETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGE-HAIRIIGWGQEPLGEGTS 453
+ I +E+ +GPV ++ +Y D+ L+K+G+Y +++IGWG E
Sbjct: 179 KDIQKEVQTYGPVTAALNLYDDIFLHKSGVYTLTKNAKYVRLQYVKLIGWGVE------- 231
Query: 454 SVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
+ V YWL+ NS+ WG+NGL +I RG+ C +E+ + A +PKI
Sbjct: 232 NGVDYWLLVNSWGNEWGQNGLLKIKRGKYGCAVESFVYAAVPKI 275
>gi|338718488|ref|XP_001918155.2| PREDICTED: LOW QUALITY PROTEIN: tubulointerstitial nephritis
antigen-like [Equus caballus]
Length = 480
Score = 129 bits (325), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 92/296 (31%), Positives = 131/296 (44%), Gaps = 49/296 (16%)
Query: 62 LTLSE-LEMRMGVHPDSKLPQNRLPLLVQLSD------PLEELPEGFDARINWPYCPTIQ 114
+TL E + R+G P S P+L+ +++ +LPE F A WP
Sbjct: 186 MTLEEGFKYRLGTLPPS-------PMLLSMNEVTPSLPATTDLPEFFIASYKWP--GWTH 236
Query: 115 EIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKA 174
DQ +C + WA +DR+ I S G+ LS +L+SCC +GC G +A
Sbjct: 237 GPLDQKNCAASWAFSTASVAADRIAIQSNGRFTANLSPQNLISCCAKNRHGCNSGSIDRA 296
Query: 175 WKYWVTTGIVSGGTY-------ASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKC 227
W Y G+VS Y A+ C R + C +N + I +C
Sbjct: 297 WWYLRKRGLVSHACYPLFKDQNATNNDCAMASRSDGRGKRHATKPCPNNIEKSNR-IYQC 355
Query: 228 QPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVA 287
P Y VS +NE IM+EI ++GPV+ M ++ D YK GIY+HV
Sbjct: 356 SPPYRVS---------------SNETEIMKEIMQNGPVQAIMQVHDDFFHYKKGIYRHVT 400
Query: 288 GG--------PLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
L HAI++ GWG +G K+W+ ANS+ +WGENG FRI
Sbjct: 401 STHEEPEKYRKLRTHAIKLAGWGTLRGAQGRKE--KFWIAANSWGKSWGENGYFRI 454
Score = 94.0 bits (232), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 48/118 (40%), Positives = 67/118 (56%), Gaps = 10/118 (8%)
Query: 388 YSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGG--------PLGEHAIR 439
Y + +NE IM+EI ++GPV+ M ++ D YK GIY+HV L HAI+
Sbjct: 359 YRVSSNETEIMKEIMQNGPVQAIMQVHDDFFHYKKGIYRHVTSTHEEPEKYRKLRTHAIK 418
Query: 440 IIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
+ GWG +G K+W+ ANS+ +WGENG FRI+RG NE IE I A ++
Sbjct: 419 LAGWGTLRGAQGRKE--KFWIAANSWGKSWGENGYFRILRGVNESDIEKLIIAAWGQL 474
>gi|354483193|ref|XP_003503779.1| PREDICTED: tubulointerstitial nephritis antigen-like [Cricetulus
griseus]
Length = 475
Score = 129 bits (325), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 89/291 (30%), Positives = 131/291 (45%), Gaps = 39/291 (13%)
Query: 62 LTLSE-LEMRMGVHPDSKLPQNRLPLLVQLSD------PLEELPEGFDARINWPYCPTIQ 114
+TL E + R+G P S P L+ +++ +LPE F + WP
Sbjct: 181 MTLEEGFKFRLGTLPPS-------PTLLSMNEMTATFPARADLPEVFISSYKWP--GWTH 231
Query: 115 EIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKA 174
DQ +C + WA +DR+ I SRG+ LS +L+SCC +GC G +A
Sbjct: 232 GPLDQKNCAASWAFSTASVAADRIAIQSRGRYTANLSPQNLISCCAKKRHGCNSGSIDRA 291
Query: 175 WKYWVTTGIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVS 234
W + G+VS Y P + N +++ C + R S
Sbjct: 292 WWFLRKRGLVSHACY-----------PLFKDQNTTNNICAMASRSDGRGKRHATKPCPNS 340
Query: 235 YEDDLNFGRIA--YSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGG--- 289
+E + + Y + +NE IMREI R+GPV+ M ++ D YKTGIY+HV
Sbjct: 341 FEKSNRIYQCSPPYRVSSNETEIMREIIRNGPVQAIMQVHEDFFYYKTGIYRHVISTNEE 400
Query: 290 -----PLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
L HA+++ GWG L K+W+ ANS+ +WGENG FRI
Sbjct: 401 SEKYRKLRSHAVKLTGWGT--LRGAGGKKEKFWIAANSWGKSWGENGYFRI 449
Score = 97.8 bits (242), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 51/123 (41%), Positives = 68/123 (55%), Gaps = 10/123 (8%)
Query: 388 YSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGG--------PLGEHAIR 439
Y + +NE IMREI R+GPV+ M ++ D YKTGIY+HV L HA++
Sbjct: 354 YRVSSNETEIMREIIRNGPVQAIMQVHEDFFYYKTGIYRHVISTNEESEKYRKLRSHAVK 413
Query: 440 IIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKIGL 499
+ GWG L K+W+ ANS+ +WGENG FRI+RG NE IE I A ++
Sbjct: 414 LTGWGT--LRGAGGKKEKFWIAANSWGKSWGENGYFRILRGVNESDIEKLIIAAWGQLTS 471
Query: 500 EID 502
D
Sbjct: 472 SDD 474
>gi|397515891|ref|XP_003828175.1| PREDICTED: tubulointerstitial nephritis antigen-like isoform 2 [Pan
paniscus]
Length = 436
Score = 129 bits (325), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 97/283 (34%), Positives = 130/283 (45%), Gaps = 20/283 (7%)
Query: 62 LTLSE-LEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQG 120
+TL E + R+G S N + L +P E LP F+A WP I E DQG
Sbjct: 137 MTLDEGIRYRLGTIRPSSSVMNMHEIYTVL-NPGEVLPTAFEASEKWPN--LIHEPLDQG 193
Query: 121 SCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVT 180
+C WA SDRV I S G LS +L+SC GC+GG AW +
Sbjct: 194 NCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQNLLSCDTHQQQGCRGGRLDGAWWFLRR 253
Query: 181 TGIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLN 240
G+VS Y R P M S + + T C P V+ D
Sbjct: 254 RGVVSDHCYPFSGRERDEAGPAPPCMMHSRAMGRGKRQATAHC-----PNSYVNNNDIYQ 308
Query: 241 FGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVA---GGP-----LG 292
+ Y L +N++ IM+E+ +GPV+ M ++ D LYK GIY H G P G
Sbjct: 309 VTPV-YRLGSNDKEIMKELMENGPVQALMEVHEDFFLYKGGIYSHTPVSLGRPERYRRHG 367
Query: 293 EHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
H+++I GWG+E L +G + +KYW ANS+ WGE G FRI
Sbjct: 368 THSVKITGWGEETLPDGRT--LKYWTAANSWGPAWGERGHFRI 408
Score = 101 bits (252), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 52/122 (42%), Positives = 74/122 (60%), Gaps = 10/122 (8%)
Query: 387 AYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVA---GGP-----LGEHAI 438
Y L +N++ IM+E+ +GPV+ M ++ D LYK GIY H G P G H++
Sbjct: 312 VYRLGSNDKEIMKELMENGPVQALMEVHEDFFLYKGGIYSHTPVSLGRPERYRRHGTHSV 371
Query: 439 RIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKIG 498
+I GWG+E L +G + +KYW ANS+ WGE G FRIVRG NEC IE+ + ++G
Sbjct: 372 KITGWGEETLPDGRT--LKYWTAANSWGPAWGERGHFRIVRGVNECDIESFVLGVWGRVG 429
Query: 499 LE 500
+E
Sbjct: 430 ME 431
>gi|156708104|gb|ABU93310.1| cathepsin B1 cysteine protease [Monocercomonoides sp. PA]
Length = 281
Score = 129 bits (325), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 78/239 (32%), Positives = 112/239 (46%), Gaps = 40/239 (16%)
Query: 97 LPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLV 156
LPE FD+R WP I +RDQ SCGS WA E M DR+ I +G + ++ DLV
Sbjct: 63 LPENFDSREQWP--GKILPVRDQASCGSCWAFSVAETMGDRLSI--KGCDYGDMAPQDLV 118
Query: 157 SCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQDN 216
SC GC GG+ AW + + G+ + + C PY Q
Sbjct: 119 SC-DTTDMGCNGGYMDHAWAWTKSHGVTT-------EKCMPY---------------QSG 155
Query: 217 EPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMI 276
P C KC G + +++ ++ N + +M E++ +GP+ + T+Y D +
Sbjct: 156 SGRVPACPAKCVNGSAIVRNKSVSYKKL------NAQQMMEELYENGPISVAFTVYYDFM 209
Query: 277 LYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
YK+G+Y H GG G HA+ +GWG E YWL NS+ WGE G F+I
Sbjct: 210 NYKSGVYVHKTGGIAGGHAVLCVGWGVE-------DNTPYWLCQNSWGPAWGEKGHFKI 261
Score = 94.4 bits (233), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 46/132 (34%), Positives = 69/132 (52%), Gaps = 13/132 (9%)
Query: 356 QANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYA 415
Q+ P C KC G + +++ ++ N + +M E++ +GP+ + T+Y
Sbjct: 153 QSGSGRVPACPAKCVNGSAIVRNKSVSYKKL------NAQQMMEELYENGPISVAFTVYY 206
Query: 416 DMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLF 475
D + YK+G+Y H GG G HA+ +GWG E YWL NS+ WGE G F
Sbjct: 207 DFMNYKSGVYVHKTGGIAGGHAVLCVGWGVE-------DNTPYWLCQNSWGPAWGEKGHF 259
Query: 476 RIVRGQNECGIE 487
+I+RG N CGIE
Sbjct: 260 KILRGSNHCGIE 271
>gi|397515889|ref|XP_003828174.1| PREDICTED: tubulointerstitial nephritis antigen-like isoform 1 [Pan
paniscus]
Length = 467
Score = 129 bits (324), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 97/283 (34%), Positives = 130/283 (45%), Gaps = 20/283 (7%)
Query: 62 LTLSE-LEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQG 120
+TL E + R+G S N + L +P E LP F+A WP I E DQG
Sbjct: 168 MTLDEGIRYRLGTIRPSSSVMNMHEIYTVL-NPGEVLPTAFEASEKWPN--LIHEPLDQG 224
Query: 121 SCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVT 180
+C WA SDRV I S G LS +L+SC GC+GG AW +
Sbjct: 225 NCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQNLLSCDTHQQQGCRGGRLDGAWWFLRR 284
Query: 181 TGIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLN 240
G+VS Y R P M S + + T C P V+ D
Sbjct: 285 RGVVSDHCYPFSGRERDEAGPAPPCMMHSRAMGRGKRQATAHC-----PNSYVNNNDIYQ 339
Query: 241 FGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVA---GGP-----LG 292
+ Y L +N++ IM+E+ +GPV+ M ++ D LYK GIY H G P G
Sbjct: 340 VTPV-YRLGSNDKEIMKELMENGPVQALMEVHEDFFLYKGGIYSHTPVSLGRPERYRRHG 398
Query: 293 EHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
H+++I GWG+E L +G + +KYW ANS+ WGE G FRI
Sbjct: 399 THSVKITGWGEETLPDGRT--LKYWTAANSWGPAWGERGHFRI 439
Score = 101 bits (252), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 52/122 (42%), Positives = 74/122 (60%), Gaps = 10/122 (8%)
Query: 387 AYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVA---GGP-----LGEHAI 438
Y L +N++ IM+E+ +GPV+ M ++ D LYK GIY H G P G H++
Sbjct: 343 VYRLGSNDKEIMKELMENGPVQALMEVHEDFFLYKGGIYSHTPVSLGRPERYRRHGTHSV 402
Query: 439 RIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKIG 498
+I GWG+E L +G + +KYW ANS+ WGE G FRIVRG NEC IE+ + ++G
Sbjct: 403 KITGWGEETLPDGRT--LKYWTAANSWGPAWGERGHFRIVRGVNECDIESFVLGVWGRVG 460
Query: 499 LE 500
+E
Sbjct: 461 ME 462
>gi|255087666|ref|XP_002505756.1| cathepsin B-like cysteine proteinase [Micromonas sp. RCC299]
gi|226521026|gb|ACO67014.1| cathepsin B-like cysteine proteinase [Micromonas sp. RCC299]
Length = 273
Score = 129 bits (324), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 85/254 (33%), Positives = 129/254 (50%), Gaps = 31/254 (12%)
Query: 97 LPEGFDARINWPYCPTIQEI-RDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDL 155
LPE FDAR WP C + + RDQG+CGS WA+ E MSDR CI S G+ LS L
Sbjct: 18 LPESFDARTKWPTCAHLIGVARDQGNCGSCWAMAPAEVMSDRACIQSGGEIDAELSPFQL 77
Query: 156 VSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEI-PCERYMNGSHSSCQ 214
++C + GC+GG A+++ + G+V+GG + + C PY PC H C+
Sbjct: 78 LACAQG-SFGCEGGESADAYEFAKSNGVVTGGGFDDQNTCAPYPFAPC-------HHPCE 129
Query: 215 DNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPA-------NEETIMREIFRHGPVEG 267
TP C C G + D + G+ ++ + A + + EI+ +GPV
Sbjct: 130 VFP--TPACPATCVGGSN----DGVQNGKASFKVKAIVDCPSFDYGCVANEIYHNGPVSS 183
Query: 268 -SMTIYADMILYKTGIYKHVA-----GGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVAN 321
+ IY + YK+G+++ G G H +++IGWG+ +G YW+V N
Sbjct: 184 YAGDIYEEFYAYKSGVFRESPSVAQRGANHGGHVVKVIGWGKADPAKGEGEGY-YWIVVN 242
Query: 322 SFNTNWGENGLFRI 335
S+ NWG++G+ RI
Sbjct: 243 SW-LNWGDDGVGRI 255
Score = 58.2 bits (139), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 48/176 (27%), Positives = 78/176 (44%), Gaps = 39/176 (22%)
Query: 337 CRPYEI-----PCERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLP 391
C PY PCE + TP C C G + D + G+ ++ +
Sbjct: 116 CAPYPFAPCHHPCEVFP-------------TPACPATCVGGSN----DGVQNGKASFKVK 158
Query: 392 A-------NEETIMREIFRHGPVEG-SMTIYADMILYKTGIYKHVA-----GGPLGEHAI 438
A + + EI+ +GPV + IY + YK+G+++ G G H +
Sbjct: 159 AIVDCPSFDYGCVANEIYHNGPVSSYAGDIYEEFYAYKSGVFRESPSVAQRGANHGGHVV 218
Query: 439 RIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGL 494
++IGWG+ +G YW+V NS+ NWG++G+ RI G E GI A + + +
Sbjct: 219 KVIGWGKADPAKGEGEGY-YWIVVNSW-LNWGDDGVGRIAVG--EVGIGAGVESAV 270
>gi|328869211|gb|EGG17589.1| hypothetical protein DFA_08585 [Dictyostelium fasciculatum]
Length = 323
Score = 129 bits (324), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 96/286 (33%), Positives = 140/286 (48%), Gaps = 42/286 (14%)
Query: 54 AEKNA-LSKLTLSELEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPT 112
A++NA T+ ++ MG K+ N +++ D +P FDAR WP C
Sbjct: 56 AKRNARFEGHTIGQVMAMMGT---KKVINNNAAPSIKIVDA--SIPSTFDAREQWPGC-- 108
Query: 113 IQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGN-GCQGGFH 171
+ + +Q CGS WA + EA+SDR+CIAS+G+ +V LS LV+ C D GN GC GG
Sbjct: 109 VHAVLNQEQCGSCWAFSSSEALSDRLCIASKGQVNVTLSPQALVA-CDDIGNQGCNGGVP 167
Query: 172 GKAWKYWVTTGIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQPGY 231
AW+Y G+ + C PY NG+ +CQ R+C G
Sbjct: 168 QLAWEYMEWKGLPT-------FECYPYTAG-----NGTDGTCQ----------RQCADGS 205
Query: 232 DVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGP- 290
++Y F + A I EI +GPV G+M +Y D + Y +G+Y +
Sbjct: 206 AMTYYRAKPFSMTTCNSVA---CIQNEIITYGPVVGTMMVYQDFMSYSSGVYVYDGTAEL 262
Query: 291 LGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGE-NGLFRI 335
LG HAI I+GWG + +S + YW+V NS++ WG +G F I
Sbjct: 263 LGGHAIEIVGWGTD-----ATSKLDYWIVKNSWSAAWGGLDGYFWI 303
Score = 83.6 bits (205), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 57/169 (33%), Positives = 81/169 (47%), Gaps = 25/169 (14%)
Query: 331 GLFRIGCRPYEIPCERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSL 390
GL C PY NG+ +CQ R+C G ++Y F +
Sbjct: 178 GLPTFECYPYTAG-----NGTDGTCQ----------RQCADGSAMTYYRAKPFSMTTCNS 222
Query: 391 PANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGP-LGEHAIRIIGWGQEPLG 449
A I EI +GPV G+M +Y D + Y +G+Y + LG HAI I+GWG +
Sbjct: 223 VA---CIQNEIITYGPVVGTMMVYQDFMSYSSGVYVYDGTAELLGGHAIEIVGWGTD--- 276
Query: 450 EGTSSVVKYWLVANSFNTNWGE-NGLFRIVRGQNECGIEADITAGLPKI 497
+S + YW+V NS++ WG +G F I RG N CGI+ D +A K+
Sbjct: 277 --ATSKLDYWIVKNSWSAAWGGLDGYFWIQRGTNMCGIDHDASASQAKL 323
>gi|339242631|ref|XP_003377241.1| cathepsin B [Trichinella spiralis]
gi|316973973|gb|EFV57514.1| cathepsin B [Trichinella spiralis]
Length = 199
Score = 129 bits (323), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 65/147 (44%), Positives = 89/147 (60%), Gaps = 11/147 (7%)
Query: 97 LPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLV 156
LP+ +D R +P+C I I+DQ +CGS WA+ + MSDR CIA+ G LS ++L+
Sbjct: 54 LPKEYDVRKAYPHCKYINFIKDQSNCGSCWAVSSASVMSDRHCIATNGTEQPFLSEEELI 113
Query: 157 SCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEI-PCERYMNGSHSSCQD 215
SCCK CG GC GG+ A++YWV G+ SGG Y K GC+PY I PC ++C +
Sbjct: 114 SCCKTCGLGCDGGYVSHAFEYWVEKGLPSGGAYGWKTGCKPYSIAPC--------NNCDE 165
Query: 216 NEPNTPECIRKCQPGYDVSYEDDLNFG 242
E TP+C C P Y ++ +DD FG
Sbjct: 166 AE--TPKCKNTCIPEYPLTPKDDKYFG 190
>gi|348570708|ref|XP_003471139.1| PREDICTED: tubulointerstitial nephritis antigen-like [Cavia
porcellus]
Length = 468
Score = 129 bits (323), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 97/297 (32%), Positives = 137/297 (46%), Gaps = 26/297 (8%)
Query: 51 FYGAEKNALSKLTLSE-LEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPY 109
+ +A +TL E + R+G S N + L+ P E LP F+A WP
Sbjct: 158 WQAGNHSAFWGMTLDEGIRYRLGTIRPSSSVMNMNEIYTVLA-PGEVLPTAFEASEKWPN 216
Query: 110 CPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGG 169
I E DQG+C WA SDRV I S G LS +L+SC GC+GG
Sbjct: 217 --LIHEPLDQGNCAGSWAFSTAAVASDRVSIHSMGHMTPLLSPQNLLSCDTLHQQGCRGG 274
Query: 170 FHGKAWKYWVTTGIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQP 229
AW + G+VS Y + P M S + + T R+C
Sbjct: 275 HLDGAWWFLRRRGVVSDHCYPFSGREQAEAGPAPPCMMHSRAMGRGKRQAT----RRCPN 330
Query: 230 GYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGG 289
+ + +D+ AY L ++E+ IM+E+ +GPV+ M ++ D LYK GIY H
Sbjct: 331 SH--TDANDIYQVTPAYRLGSDEKEIMKELMENGPVQALMEVHEDFFLYKGGIYSHT--- 385
Query: 290 PL-----------GEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
PL G H+++I GWG+E L +G + +KYW ANS+ +WGE G FRI
Sbjct: 386 PLSMARPEQYRRHGTHSVKITGWGEETLPDGRT--LKYWTAANSWGPSWGERGHFRI 440
Score = 103 bits (256), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 52/125 (41%), Positives = 76/125 (60%), Gaps = 16/125 (12%)
Query: 387 AYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPL-----------GE 435
AY L ++E+ IM+E+ +GPV+ M ++ D LYK GIY H PL G
Sbjct: 344 AYRLGSDEKEIMKELMENGPVQALMEVHEDFFLYKGGIYSHT---PLSMARPEQYRRHGT 400
Query: 436 HAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLP 495
H+++I GWG+E L +G + +KYW ANS+ +WGE G FRI+RG NEC IE+ +
Sbjct: 401 HSVKITGWGEETLPDGRT--LKYWTAANSWGPSWGERGHFRILRGSNECDIESFVLGVWG 458
Query: 496 KIGLE 500
++G+E
Sbjct: 459 RVGME 463
>gi|348553066|ref|XP_003462348.1| PREDICTED: tubulointerstitial nephritis antigen-like [Cavia
porcellus]
Length = 475
Score = 129 bits (323), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 93/300 (31%), Positives = 131/300 (43%), Gaps = 58/300 (19%)
Query: 62 LTLSE-LEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQG 120
+TL E + R+G P S + L ++ LPE F A WP DQ
Sbjct: 182 MTLEEGFKFRLGTLPPSPALLGMNEVTAALPAKID-LPEFFIASYKWP--GWTHGPLDQK 238
Query: 121 SCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVT 180
+C + WA +DR+ I S G+ LS +L+SCC +GC GG +AW Y
Sbjct: 239 NCAASWAFSTASVAADRIAIQSSGRYTANLSPQNLISCCARKRHGCGGGSVDRAWWYLRK 298
Query: 181 TGIVSGGTY--------------ASK---QGCRPYEIPCERYMNGSHSSCQDNEPNTPEC 223
G+VS Y AS+ +G R PC ++ S+
Sbjct: 299 RGLVSHACYPLFKDQNATNGCAMASRSDGRGKRHATTPCPNHIEKSNR------------ 346
Query: 224 IRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIY 283
I +C P Y VS +NE IM+EI ++GPV+ M ++ D YKTGIY
Sbjct: 347 IYQCSPPYRVS---------------SNETQIMKEIMQNGPVQAIMKVHEDFFSYKTGIY 391
Query: 284 KHVAGG--------PLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
+HV L HA+++ GWG G K+W+ ANS+ +WGENG F+I
Sbjct: 392 RHVTSTSEDSEKYQKLRTHAVKLTGWGTLKGARGKKE--KFWIAANSWGKSWGENGYFKI 449
Score = 94.4 bits (233), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 47/118 (39%), Positives = 67/118 (56%), Gaps = 10/118 (8%)
Query: 388 YSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGG--------PLGEHAIR 439
Y + +NE IM+EI ++GPV+ M ++ D YKTGIY+HV L HA++
Sbjct: 354 YRVSSNETQIMKEIMQNGPVQAIMKVHEDFFSYKTGIYRHVTSTSEDSEKYQKLRTHAVK 413
Query: 440 IIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
+ GWG G K+W+ ANS+ +WGENG F+I+RG NE IE I A ++
Sbjct: 414 LTGWGTLKGARGKKE--KFWIAANSWGKSWGENGYFKILRGVNESDIEKLIIAAWGQL 469
>gi|290982673|ref|XP_002674054.1| predicted protein [Naegleria gruberi]
gi|284087642|gb|EFC41310.1| predicted protein [Naegleria gruberi]
Length = 673
Score = 128 bits (322), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 86/300 (28%), Positives = 135/300 (45%), Gaps = 33/300 (11%)
Query: 38 DRVDHSILLPKLPFYGAEKNALSKLTLSELEMRMGVHPDSKLPQNRLPLLVQLSDPLEEL 97
D +D P + + A + + + +EL +G + + +
Sbjct: 29 DMIDSLNQDPSVKWEAANYDQFAGKSFAELRKLLGGKRGEESSSEEARYNTRDVKSTVAI 88
Query: 98 PEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVS 157
P+ FD+R WP C I IR+QG CGS WA SDR+CI + +V +S + L+
Sbjct: 89 PDTFDSRTKWPQC--IHGIRNQGQCGSCWAFATTGVFSDRLCITTNNVSNVVISPEFLIE 146
Query: 158 CCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQDNE 217
C K CQGG+ +WK+++ TGI + C PY Y N +++ C+
Sbjct: 147 CDKT-SFACQGGYGYYSWKFFMNTGI-------PLESCVPYTKDSLVYGNTTNAQCRSTC 198
Query: 218 PN-TPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMI 276
+ +P + K Y + YS N +T EI +GPVE +Y+D
Sbjct: 199 TDGSPLKLYKAASAYYI------------YSPITNYQT---EIMTNGPVEADFDVYSDFY 243
Query: 277 LYKTGIYKHVAGGP-LGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
YK+GIY+ AG +G HA++++GW + G YW+ N + T+WG G F I
Sbjct: 244 SYKSGIYQKTAGSTYVGGHAVKVLGWASDSNG------TPYWIAQNQWGTSWGMGGYFYI 297
Score = 70.9 bits (172), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 36/96 (37%), Positives = 52/96 (54%), Gaps = 10/96 (10%)
Query: 388 YSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGP-LGEHAIRIIGWGQE 446
YS N +T EI +GPVE +Y+D YK+GIY+ AG +G HA++++GW +
Sbjct: 216 YSPITNYQT---EIMTNGPVEADFDVYSDFYSYKSGIYQKTAGSTYVGGHAVKVLGWASD 272
Query: 447 PLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQN 482
G YW+ N + T+WG G F I RG +
Sbjct: 273 SNG------TPYWIAQNQWGTSWGMGGYFYIYRGNS 302
>gi|129270160|ref|NP_001038442.2| tubulointerstitial nephritis antigen-like precursor [Danio rerio]
gi|126632071|gb|AAI33830.1| Si:dkey-158b13.1 [Danio rerio]
Length = 471
Score = 128 bits (322), Expect = 7e-27, Method: Compositional matrix adjust.
Identities = 85/256 (33%), Positives = 118/256 (46%), Gaps = 31/256 (12%)
Query: 95 EELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDD 154
+ LP F+A WP I E DQG+C + WA SDR+ I S G +LS +
Sbjct: 198 DHLPSYFNAVDKWP--GKIHEPLDQGNCNASWAFSTAAVASDRISIQSMGHMTPQLSPQN 255
Query: 155 LVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIP------CERYMNG 208
L+SC +GC GG AW + G+V+ Q C P+ P R M
Sbjct: 256 LISCDTRHQDGCAGGRIDGAWWFMRRRGVVT-------QDCYPFSPPEQSAVEVARCMMQ 308
Query: 209 SHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGS 268
S + + T C SY +D+ Y L NE IM+EI +GPV+
Sbjct: 309 SRAVGRGKRQATAHCPNS------HSYHNDIYQSTPPYRLSTNENEIMKEIMDNGPVQAI 362
Query: 269 MTIYADMILYKTGIYKHVAGG--------PLGEHAIRIIGWGQEPLGEGTSSVVKYWLVA 320
M ++ D +YK+GI++H H++RI GWG+E G + KYW+ A
Sbjct: 363 MEVHEDFFVYKSGIFRHTDVNYHKPSQYRKHATHSVRITGWGEERDYSGRTR--KYWIGA 420
Query: 321 NSFNTNWGENGLFRIG 336
NS+ NWGE+G FRI
Sbjct: 421 NSWGKNWGEDGYFRIA 436
Score = 100 bits (248), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 51/133 (38%), Positives = 73/133 (54%), Gaps = 10/133 (7%)
Query: 376 SYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGG---- 431
SY +D+ Y L NE IM+EI +GPV+ M ++ D +YK+GI++H
Sbjct: 328 SYHNDIYQSTPPYRLSTNENEIMKEIMDNGPVQAIMEVHEDFFVYKSGIFRHTDVNYHKP 387
Query: 432 ----PLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIE 487
H++RI GWG+E G + KYW+ ANS+ NWGE+G FRI RG NEC IE
Sbjct: 388 SQYRKHATHSVRITGWGEERDYSGRTR--KYWIGANSWGKNWGEDGYFRIARGVNECDIE 445
Query: 488 ADITAGLPKIGLE 500
+ ++ +E
Sbjct: 446 TFVIGVWGRVTME 458
>gi|161343837|tpg|DAA06099.1| TPA_inf: cathepsin B [Acyrthosiphon pisum]
Length = 255
Score = 128 bits (321), Expect = 8e-27, Method: Compositional matrix adjust.
Identities = 61/171 (35%), Positives = 96/171 (56%), Gaps = 3/171 (1%)
Query: 96 ELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDL 155
+P+ FDAR W C TI +RDQG+CGS WA+ A +DR+C+A+ + LS++++
Sbjct: 87 RIPKHFDARRKWRSCHTIGAVRDQGNCGSCWAMATSSAFADRLCVATNADFNELLSAEEI 146
Query: 156 VSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQD 215
CC CG GC GG+ KAW+ + G+V+GG Y S +GC PY +P Y H++C
Sbjct: 147 TFCCHSCGFGCNGGYPIKAWERFKKRGLVTGGDYQSGEGCEPYRVPPCPYDAEGHNTCAG 206
Query: 216 N-EPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPV 265
+ C R C D+ +++D + R Y L +I +++ +GP+
Sbjct: 207 KPRESNHRCTRMCYGNXDLDFDEDHRYTRDFYYLTYG--SIQKDVMTYGPI 255
>gi|344287520|ref|XP_003415501.1| PREDICTED: tubulointerstitial nephritis antigen isoform 2
[Loxodonta africana]
Length = 437
Score = 128 bits (321), Expect = 8e-27, Method: Compositional matrix adjust.
Identities = 97/294 (32%), Positives = 130/294 (44%), Gaps = 20/294 (6%)
Query: 51 FYGAEKNALSKLTLSE-LEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPY 109
+ +A +TL E + R+G S N + L P E LP F+A WP
Sbjct: 127 WRAGNHSAFWGMTLDEGIRYRLGTIRPSSSVMNMNEIHTVLG-PGEVLPMAFEASKKWPN 185
Query: 110 CPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGG 169
I E DQG C WA SDRV I S G LS +L+SC GC+GG
Sbjct: 186 --LIHEPLDQGDCAGSWAFSTAAVASDRVSIHSLGHMTPILSPQNLLSCDTHNQQGCRGG 243
Query: 170 FHGKAWKYWVTTGIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQP 229
AW + G+VS Y R P M S + + T C P
Sbjct: 244 RLDGAWWFLRRRGVVSDHCYPFSGHERDKAGPVPPCMMHSRAMGRGKRQATSRC-----P 298
Query: 230 GYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGG 289
V + +D+ AY L NE+ IM+E+ +GPV+ M ++ D LY+ GIY H
Sbjct: 299 NSHV-HGNDIYQVTPAYRLGTNEKEIMKELMENGPVQALMEVHEDFFLYQGGIYSHTPVS 357
Query: 290 P--------LGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
G H+++I GWG+E L +G + +KYW ANS+ WGE G FRI
Sbjct: 358 QERPEQYRRHGTHSVKITGWGEETLPDGRT--LKYWTAANSWGPAWGERGHFRI 409
Score = 101 bits (252), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 51/122 (41%), Positives = 72/122 (59%), Gaps = 10/122 (8%)
Query: 387 AYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGP--------LGEHAI 438
AY L NE+ IM+E+ +GPV+ M ++ D LY+ GIY H G H++
Sbjct: 313 AYRLGTNEKEIMKELMENGPVQALMEVHEDFFLYQGGIYSHTPVSQERPEQYRRHGTHSV 372
Query: 439 RIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKIG 498
+I GWG+E L +G + +KYW ANS+ WGE G FRIVRG NEC IE+ + ++G
Sbjct: 373 KITGWGEETLPDGRT--LKYWTAANSWGPAWGERGHFRIVRGANECDIESFVLGVWGRVG 430
Query: 499 LE 500
+E
Sbjct: 431 ME 432
>gi|344287518|ref|XP_003415500.1| PREDICTED: tubulointerstitial nephritis antigen isoform 1
[Loxodonta africana]
Length = 468
Score = 128 bits (321), Expect = 8e-27, Method: Compositional matrix adjust.
Identities = 97/294 (32%), Positives = 130/294 (44%), Gaps = 20/294 (6%)
Query: 51 FYGAEKNALSKLTLSE-LEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPY 109
+ +A +TL E + R+G S N + L P E LP F+A WP
Sbjct: 158 WRAGNHSAFWGMTLDEGIRYRLGTIRPSSSVMNMNEIHTVLG-PGEVLPMAFEASKKWPN 216
Query: 110 CPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGG 169
I E DQG C WA SDRV I S G LS +L+SC GC+GG
Sbjct: 217 --LIHEPLDQGDCAGSWAFSTAAVASDRVSIHSLGHMTPILSPQNLLSCDTHNQQGCRGG 274
Query: 170 FHGKAWKYWVTTGIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQP 229
AW + G+VS Y R P M S + + T C P
Sbjct: 275 RLDGAWWFLRRRGVVSDHCYPFSGHERDKAGPVPPCMMHSRAMGRGKRQATSRC-----P 329
Query: 230 GYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGG 289
V + +D+ AY L NE+ IM+E+ +GPV+ M ++ D LY+ GIY H
Sbjct: 330 NSHV-HGNDIYQVTPAYRLGTNEKEIMKELMENGPVQALMEVHEDFFLYQGGIYSHTPVS 388
Query: 290 P--------LGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
G H+++I GWG+E L +G + +KYW ANS+ WGE G FRI
Sbjct: 389 QERPEQYRRHGTHSVKITGWGEETLPDGRT--LKYWTAANSWGPAWGERGHFRI 440
Score = 101 bits (252), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 51/122 (41%), Positives = 72/122 (59%), Gaps = 10/122 (8%)
Query: 387 AYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGP--------LGEHAI 438
AY L NE+ IM+E+ +GPV+ M ++ D LY+ GIY H G H++
Sbjct: 344 AYRLGTNEKEIMKELMENGPVQALMEVHEDFFLYQGGIYSHTPVSQERPEQYRRHGTHSV 403
Query: 439 RIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKIG 498
+I GWG+E L +G + +KYW ANS+ WGE G FRIVRG NEC IE+ + ++G
Sbjct: 404 KITGWGEETLPDGRT--LKYWTAANSWGPAWGERGHFRIVRGANECDIESFVLGVWGRVG 461
Query: 499 LE 500
+E
Sbjct: 462 ME 463
>gi|296207307|ref|XP_002750588.1| PREDICTED: tubulointerstitial nephritis antigen-like [Callithrix
jacchus]
Length = 467
Score = 127 bits (320), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 97/294 (32%), Positives = 133/294 (45%), Gaps = 20/294 (6%)
Query: 51 FYGAEKNALSKLTLSE-LEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPY 109
+ +A +TL E + R+G S N + L +P E LP F+A WP
Sbjct: 157 WQAGNHSAFWGMTLDEGIRYRLGTIRPSSSVMNMHEIYTVL-NPGEVLPTAFEASEKWPN 215
Query: 110 CPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGG 169
I E DQG+C WA SDRV I S G LS +L+SC GC+GG
Sbjct: 216 --LIHEPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPILSPQNLLSCNTHHQQGCRGG 273
Query: 170 FHGKAWKYWVTTGIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQP 229
AW + G+VS Y R P M S ++ + T C P
Sbjct: 274 HLDGAWWFLRRRGVVSDHCYPFLGRERDKAGPVPPCMMHSRATGRGKRQATAHC-----P 328
Query: 230 GYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVA-- 287
V+ +++ AY L +N+ IM+E+ +GPV+ M ++ D LYK GIY H
Sbjct: 329 NGHVN-NNNIYQVTPAYRLGSNDTEIMKELMENGPVQALMEVHEDFFLYKGGIYSHTPVN 387
Query: 288 -GGP-----LGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
G P G H+++I GWG+E +G +KYW ANS+ WGE G FRI
Sbjct: 388 LGRPERYRRHGTHSVKITGWGEETWPDGRK--LKYWTAANSWGPAWGERGHFRI 439
Score = 99.8 bits (247), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 52/122 (42%), Positives = 72/122 (59%), Gaps = 10/122 (8%)
Query: 387 AYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVA---GGP-----LGEHAI 438
AY L +N+ IM+E+ +GPV+ M ++ D LYK GIY H G P G H++
Sbjct: 343 AYRLGSNDTEIMKELMENGPVQALMEVHEDFFLYKGGIYSHTPVNLGRPERYRRHGTHSV 402
Query: 439 RIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKIG 498
+I GWG+E +G +KYW ANS+ WGE G FRIVRG NEC IE+ + ++G
Sbjct: 403 KITGWGEETWPDGRK--LKYWTAANSWGPAWGERGHFRIVRGVNECDIESFVLGVWGRVG 460
Query: 499 LE 500
+E
Sbjct: 461 ME 462
>gi|403293251|ref|XP_003937634.1| PREDICTED: tubulointerstitial nephritis antigen-like isoform 2
[Saimiri boliviensis boliviensis]
Length = 436
Score = 127 bits (320), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 97/294 (32%), Positives = 132/294 (44%), Gaps = 20/294 (6%)
Query: 51 FYGAEKNALSKLTLSE-LEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPY 109
+ +A +TL E + R+G S N + L +P E LP F+A WP
Sbjct: 126 WQAGNHSAFWGMTLDEGIRYRLGTIRPSSSVMNMHEIYTVL-NPGEALPTAFEASEKWPN 184
Query: 110 CPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGG 169
I E DQG+C WA SDRV I S G LS +L+SC GC+GG
Sbjct: 185 --LIHEPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQNLLSCNTHHQQGCRGG 242
Query: 170 FHGKAWKYWVTTGIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQP 229
AW + G+VS Y R P M S + + T C P
Sbjct: 243 RLDGAWWFLRRRGVVSDHCYPFSGRERDKAGPAPPCMMHSRAMGRGKRQATAHC-----P 297
Query: 230 GYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVA-- 287
V+ +++ AY L +N+ IM+E+ +GPV+ M ++ D LYK GIY H
Sbjct: 298 NGHVN-NNNIYQVTPAYRLGSNDTEIMKELMENGPVQALMEVHEDFFLYKGGIYSHTPVN 356
Query: 288 -GGP-----LGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
G P G H+++I GWG+E +G +KYW ANS+ WGE G FRI
Sbjct: 357 LGRPERYRRHGTHSVKITGWGEETRPDGRK--LKYWTAANSWGPAWGERGHFRI 408
Score = 99.4 bits (246), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 52/122 (42%), Positives = 72/122 (59%), Gaps = 10/122 (8%)
Query: 387 AYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVA---GGP-----LGEHAI 438
AY L +N+ IM+E+ +GPV+ M ++ D LYK GIY H G P G H++
Sbjct: 312 AYRLGSNDTEIMKELMENGPVQALMEVHEDFFLYKGGIYSHTPVNLGRPERYRRHGTHSV 371
Query: 439 RIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKIG 498
+I GWG+E +G +KYW ANS+ WGE G FRIVRG NEC IE+ + ++G
Sbjct: 372 KITGWGEETRPDGRK--LKYWTAANSWGPAWGERGHFRIVRGVNECDIESFVLGVWGRVG 429
Query: 499 LE 500
+E
Sbjct: 430 ME 431
>gi|66805843|ref|XP_636643.1| hypothetical protein DDB_G0288563 [Dictyostelium discoideum AX4]
gi|60465035|gb|EAL63141.1| hypothetical protein DDB_G0288563 [Dictyostelium discoideum AX4]
Length = 314
Score = 127 bits (320), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 87/253 (34%), Positives = 122/253 (48%), Gaps = 38/253 (15%)
Query: 89 QLSDPLEEL----PEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRG 144
+L++ EEL P FD+R+ WP C I I +Q CGS WA + E +SDR+CIAS
Sbjct: 76 KLTENGEELKGSIPTSFDSRVQWPDC--IHPILNQEQCGSCWAFSSSEVLSDRLCIASNN 133
Query: 145 KRHV-RLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIPCE 203
K + LS LV+C +GC GG AW+Y G+ + C PY
Sbjct: 134 KTNPGALSPQTLVACDVYGNDGCSGGIPQLAWEYMELKGLPT-------DSCVPYTAG-- 184
Query: 204 RYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHG 263
NG+ SCQ R C D S F ++ + I I +G
Sbjct: 185 ---NGTVYSCQ----------RSCSDSEDYSLYRAKPF---TLKTCSSVQCIQENILAYG 228
Query: 264 PVEGSMTIYADMILYKTGIYKHVAGGP-LGEHAIRIIGWGQEPLGEGTSSVVKYWLVANS 322
P+ G+M +Y D + Y +G+Y G LG HAI+I+GWG + +S + YW+VANS
Sbjct: 229 PIVGTMEVYEDFMSYSSGVYVMTPGSSLLGGHAIKIVGWGFD-----QTSQLNYWIVANS 283
Query: 323 FNTNWGENGLFRI 335
+ +WG+ G F I
Sbjct: 284 WGADWGQQGFFFI 296
Score = 77.0 bits (188), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 52/168 (30%), Positives = 76/168 (45%), Gaps = 26/168 (15%)
Query: 331 GLFRIGCRPYEIPCERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSL 390
GL C PY NG+ SCQ R C D S F
Sbjct: 172 GLPTDSCVPYTAG-----NGTVYSCQ----------RSCSDSEDYSLYRAKPF---TLKT 213
Query: 391 PANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGP-LGEHAIRIIGWGQEPLG 449
++ + I I +GP+ G+M +Y D + Y +G+Y G LG HAI+I+GWG +
Sbjct: 214 CSSVQCIQENILAYGPIVGTMEVYEDFMSYSSGVYVMTPGSSLLGGHAIKIVGWGFD--- 270
Query: 450 EGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
+S + YW+VANS+ +WG+ G F I C I +D +A ++
Sbjct: 271 --QTSQLNYWIVANSWGADWGQQGFFFI--SMETCSISSDASAAEARV 314
>gi|354472325|ref|XP_003498390.1| PREDICTED: tubulointerstitial nephritis antigen [Cricetulus
griseus]
gi|344245030|gb|EGW01134.1| Tubulointerstitial nephritis antigen-like [Cricetulus griseus]
Length = 465
Score = 127 bits (320), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 97/294 (32%), Positives = 134/294 (45%), Gaps = 20/294 (6%)
Query: 51 FYGAEKNALSKLTLSE-LEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPY 109
+ +A +TL E + R+G S N + L E LP F+A WP
Sbjct: 156 WQAGNHSAFWGMTLDEGIRYRLGTIRPSSSVMNMNEIYTALGRG-EVLPRAFEASEKWPN 214
Query: 110 CPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGG 169
IQE DQG+C WA SDRV I S G LS +L+SC GC+GG
Sbjct: 215 --LIQEPLDQGNCAGSWAFSTAAVASDRVSIHSMGHMTPILSPQNLLSCDTHHQQGCRGG 272
Query: 170 FHGKAWKYWVTTGIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQP 229
AW + G+VS Y + R M S + + T C P
Sbjct: 273 RLDGAWWFLRRRGVVSDNCYPFVGREQNEAGTSSRCMMHSRAMGRGKRQATSRC-----P 327
Query: 230 GYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHV--- 286
V +D+ AY L ++E+ IM+E+ +GPV+ M ++ D LY++GIY H
Sbjct: 328 NGQVD-SNDIYQVTPAYRLGSDEKEIMKELMENGPVQALMEVHEDFFLYQSGIYSHTPIS 386
Query: 287 AGGP-----LGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
G P G H+++I GWG+E L +G + +KYW ANS+ WGE G FRI
Sbjct: 387 QGRPEQYRRHGTHSVKITGWGEEKLPDGRT--IKYWTAANSWGPWWGERGHFRI 438
Score = 103 bits (257), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 52/122 (42%), Positives = 76/122 (62%), Gaps = 10/122 (8%)
Query: 387 AYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHV---AGGP-----LGEHAI 438
AY L ++E+ IM+E+ +GPV+ M ++ D LY++GIY H G P G H++
Sbjct: 342 AYRLGSDEKEIMKELMENGPVQALMEVHEDFFLYQSGIYSHTPISQGRPEQYRRHGTHSV 401
Query: 439 RIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKIG 498
+I GWG+E L +G + +KYW ANS+ WGE G FRIVRG NEC IE+ + ++G
Sbjct: 402 KITGWGEEKLPDGRT--IKYWTAANSWGPWWGERGHFRIVRGTNECDIESFVLGVWGRVG 459
Query: 499 LE 500
+E
Sbjct: 460 ME 461
>gi|403293249|ref|XP_003937633.1| PREDICTED: tubulointerstitial nephritis antigen-like isoform 1
[Saimiri boliviensis boliviensis]
Length = 467
Score = 127 bits (319), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 97/294 (32%), Positives = 132/294 (44%), Gaps = 20/294 (6%)
Query: 51 FYGAEKNALSKLTLSE-LEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPY 109
+ +A +TL E + R+G S N + L +P E LP F+A WP
Sbjct: 157 WQAGNHSAFWGMTLDEGIRYRLGTIRPSSSVMNMHEIYTVL-NPGEALPTAFEASEKWPN 215
Query: 110 CPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGG 169
I E DQG+C WA SDRV I S G LS +L+SC GC+GG
Sbjct: 216 --LIHEPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQNLLSCNTHHQQGCRGG 273
Query: 170 FHGKAWKYWVTTGIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQP 229
AW + G+VS Y R P M S + + T C P
Sbjct: 274 RLDGAWWFLRRRGVVSDHCYPFSGRERDKAGPAPPCMMHSRAMGRGKRQATAHC-----P 328
Query: 230 GYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVA-- 287
V+ +++ AY L +N+ IM+E+ +GPV+ M ++ D LYK GIY H
Sbjct: 329 NGHVN-NNNIYQVTPAYRLGSNDTEIMKELMENGPVQALMEVHEDFFLYKGGIYSHTPVN 387
Query: 288 -GGP-----LGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
G P G H+++I GWG+E +G +KYW ANS+ WGE G FRI
Sbjct: 388 LGRPERYRRHGTHSVKITGWGEETRPDGRK--LKYWTAANSWGPAWGERGHFRI 439
Score = 99.4 bits (246), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 52/122 (42%), Positives = 72/122 (59%), Gaps = 10/122 (8%)
Query: 387 AYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVA---GGP-----LGEHAI 438
AY L +N+ IM+E+ +GPV+ M ++ D LYK GIY H G P G H++
Sbjct: 343 AYRLGSNDTEIMKELMENGPVQALMEVHEDFFLYKGGIYSHTPVNLGRPERYRRHGTHSV 402
Query: 439 RIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKIG 498
+I GWG+E +G +KYW ANS+ WGE G FRIVRG NEC IE+ + ++G
Sbjct: 403 KITGWGEETRPDGRK--LKYWTAANSWGPAWGERGHFRIVRGVNECDIESFVLGVWGRVG 460
Query: 499 LE 500
+E
Sbjct: 461 ME 462
>gi|161343821|tpg|DAA06091.1| TPA_inf: cathepsin B [Aphis gossypii]
Length = 196
Score = 127 bits (319), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 65/184 (35%), Positives = 103/184 (55%), Gaps = 11/184 (5%)
Query: 154 DLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEI-PCERYMNGSHSS 212
+L CC CG GC GG+ +AWK + G+V+GG Y S +GC PY + PC G+++
Sbjct: 1 ELTFCCHTCGFGCHGGYPIRAWKRFKNHGLVTGGDYKSGEGCEPYRVPPCPYDEQGNNTC 60
Query: 213 CQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIY 272
C R C ++ +++D + R Y L +I +++ +GP+E S +Y
Sbjct: 61 AGKPMEKNHRCTRICYGDQELDFDEDHRYTRDYYYLTYG--SIQKDVMTYGPIEASFDVY 118
Query: 273 ADMILYKTGIYKHVAGGP-LGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENG 331
+D YK+GIY+ LG HA+++IGWG++ + YWL+ NS+N +WG+NG
Sbjct: 119 SDFPSYKSGIYERTENATYLGGHAVKLIGWGEQ-------YGIPYWLMVNSWNEDWGDNG 171
Query: 332 LFRI 335
LF+I
Sbjct: 172 LFKI 175
Score = 112 bits (281), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 59/163 (36%), Positives = 93/163 (57%), Gaps = 13/163 (7%)
Query: 336 GCRPYEIPCERYMNGSRSSCQANEP--NTPECIRKCQPGYDVSYEDDLNFGRIAYSLPAN 393
GC PY +P Y ++C A +P C R C ++ +++D + R Y L
Sbjct: 41 GCEPYRVPPCPYDEQGNNTC-AGKPMEKNHRCTRICYGDQELDFDEDHRYTRDYYYLTYG 99
Query: 394 EETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGP-LGEHAIRIIGWGQEPLGEGT 452
+I +++ +GP+E S +Y+D YK+GIY+ LG HA+++IGWG++
Sbjct: 100 --SIQKDVMTYGPIEASFDVYSDFPSYKSGIYERTENATYLGGHAVKLIGWGEQ------ 151
Query: 453 SSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLP 495
+ YWL+ NS+N +WG+NGLF+I RG NECG++ TAG+P
Sbjct: 152 -YGIPYWLMVNSWNEDWGDNGLFKIRRGTNECGVDNSTTAGVP 193
>gi|323448265|gb|EGB04166.1| hypothetical protein AURANDRAFT_32974 [Aureococcus anophagefferens]
Length = 298
Score = 127 bits (319), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 88/254 (34%), Positives = 111/254 (43%), Gaps = 25/254 (9%)
Query: 98 PEGFDARINWPYCPT-IQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLV 156
PE FD+ WP C I +IRDQ +CG WA EA SDR CIA+ G V LS+ D+
Sbjct: 25 PEAFDSAARWPECAKLIGDIRDQSNCGCCWAFAGAEAASDRQCIATGGAVAVPLSAQDV- 83
Query: 157 SCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQG-----CRPYEIP-CERY----- 205
C +GC GG W Y G V+GG Y C + P C +
Sbjct: 84 -CFNANVDGCDGGQIITPWTYVAKAGAVTGGQYNGTGPFGAGLCADWFAPHCHHHGPRGD 142
Query: 206 ----MNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFR 261
G + P P+ ++ D + + E IM I
Sbjct: 143 DPYPAEGDAGCPSEKSPEGPKACDATAAAGHDAFAADKHTFAGDVQTASGEAAIMAMIAE 202
Query: 262 HGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVAN 321
GPVE + T+Y D Y GIY HV G G HA++ +GWG E + KYW VAN
Sbjct: 203 GGPVETAFTVYEDFENYAGGIYHHVTGEEAGGHAVKFVGWGVE-------NGTKYWKVAN 255
Query: 322 SFNTNWGENGLFRI 335
S+N WGE G FRI
Sbjct: 256 SWNPYWGEAGYFRI 269
Score = 93.6 bits (231), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 48/100 (48%), Positives = 58/100 (58%), Gaps = 7/100 (7%)
Query: 392 ANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEG 451
+ E IM I GPVE + T+Y D Y GIY HV G G HA++ +GWG E
Sbjct: 191 SGEAAIMAMIAEGGPVETAFTVYEDFENYAGGIYHHVTGEEAGGHAVKFVGWGVE----- 245
Query: 452 TSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADIT 491
+ KYW VANS+N WGE G FRI+RG NE GIE +T
Sbjct: 246 --NGTKYWKVANSWNPYWGEAGYFRILRGSNEGGIEDQVT 283
>gi|16758354|ref|NP_446034.1| tubulointerstitial nephritis antigen-like precursor [Rattus
norvegicus]
gi|61213054|sp|Q9EQT5.1|TINAL_RAT RecName: Full=Tubulointerstitial nephritis antigen-like; AltName:
Full=Glucocorticoid-inducible protein 5; Flags:
Precursor
gi|11527795|dbj|BAB18637.1| glucocorticoid-inducible protein [Rattus norvegicus]
Length = 467
Score = 127 bits (319), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 97/295 (32%), Positives = 132/295 (44%), Gaps = 21/295 (7%)
Query: 51 FYGAEKNALSKLTLSE-LEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPY 109
+ +A +TL E + R+G S N + L E LP F+A WP
Sbjct: 156 WQAGNHSAFWGMTLDEGIRYRLGTIRPSSSVMNMNEIYTVLGQG-EVLPTAFEASEKWPN 214
Query: 110 CPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGG 169
I E DQG+C WA SDRV I S G LS +L+SC GC+GG
Sbjct: 215 --LIHEPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPILSPQNLLSCDTHHQKGCRGG 272
Query: 170 FHGKAWKYWVTTGIVSGGTYA-SKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQ 228
AW + G+VS Y S + P R M S + + T C
Sbjct: 273 RLDGAWWFLRRRGVVSDNCYPFSGREQNDEASPTPRCMMHSRAMGRGKRQATSRC----- 327
Query: 229 PGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHV-- 286
P V D + Y L ++E+ IM+E+ +GPV+ M ++ D LY+ GIY H
Sbjct: 328 PNSQVDSNDIYQVTPV-YRLASDEKEIMKELMENGPVQALMEVHEDFFLYQRGIYSHTPV 386
Query: 287 -AGGP-----LGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
G P G H+++I GWG+E L +G + +KYW ANS+ WGE G FRI
Sbjct: 387 SQGRPEQYRRHGTHSVKITGWGEETLPDGRT--IKYWTAANSWGPWWGERGHFRI 439
Score = 102 bits (253), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 61/166 (36%), Positives = 86/166 (51%), Gaps = 16/166 (9%)
Query: 343 PCERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIF 402
P R M SR+ + T C P V D + Y L ++E+ IM+E+
Sbjct: 305 PTPRCMMHSRAMGRGKRQATSRC-----PNSQVDSNDIYQVTPV-YRLASDEKEIMKELM 358
Query: 403 RHGPVEGSMTIYADMILYKTGIYKHV---AGGP-----LGEHAIRIIGWGQEPLGEGTSS 454
+GPV+ M ++ D LY+ GIY H G P G H+++I GWG+E L +G +
Sbjct: 359 ENGPVQALMEVHEDFFLYQRGIYSHTPVSQGRPEQYRRHGTHSVKITGWGEETLPDGRT- 417
Query: 455 VVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKIGLE 500
+KYW ANS+ WGE G FRIVRG NEC IE + ++G+E
Sbjct: 418 -IKYWTAANSWGPWWGERGHFRIVRGINECDIETFVLGVWGRVGME 462
>gi|328712819|ref|XP_001942906.2| PREDICTED: tubulointerstitial nephritis antigen-like isoform 1
[Acyrthosiphon pisum]
gi|328712821|ref|XP_003244911.1| PREDICTED: tubulointerstitial nephritis antigen-like isoform 2
[Acyrthosiphon pisum]
Length = 463
Score = 127 bits (318), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 89/276 (32%), Positives = 136/276 (49%), Gaps = 31/276 (11%)
Query: 67 LEMRMG-VHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSG 125
L++R+G +H K+ Q + PL +L +DAR W I DQG CG+
Sbjct: 167 LQLRLGTLHSKRKILQMK-PLKAAFQRG--KLRRSYDAREVWG--NYISSPIDQGWCGAS 221
Query: 126 WALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVS 185
WA+ V+ +DR I S+ LS L+SC GCQGG +AW + G+++
Sbjct: 222 WAITTVQVTTDRFGIMSKRAISDVLSPQHLLSCNNLNQQGCQGGHLTRAWNWIRKFGLIT 281
Query: 186 GGTYASKQGCRPYEIPCERYMNGSHSSC---QDNEPNTPECIRKCQPGYDVSYEDDLNFG 242
+ C P++ G S+C + + +C + + D + + L+
Sbjct: 282 -------EECYPWQ--------GRMSTCAVPKKKKETMAQCPSRVRSNNDRTTKTRLHRV 326
Query: 243 RIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYK---HVAGGPLGEHAIRII 299
Y + A EE IM EI GPV+ M + D +YK+G+YK +G G H++RI+
Sbjct: 327 GPVYRV-ATEEGIMHEILTSGPVQAVMKVSRDFFMYKSGVYKCSNLASGSRTGYHSVRIV 385
Query: 300 GWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
GWG+E G +VKYW+ +NS+ + WGENG FRI
Sbjct: 386 GWGEEYQG---GKIVKYWIASNSWGSWWGENGYFRI 418
Score = 99.0 bits (245), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 59/173 (34%), Positives = 90/173 (52%), Gaps = 18/173 (10%)
Query: 331 GLFRIGCRPYEIPCERYMNGSRSSC---QANEPNTPECIRKCQPGYDVSYEDDLNFGRIA 387
GL C P++ G S+C + + +C + + D + + L+
Sbjct: 278 GLITEECYPWQ--------GRMSTCAVPKKKKETMAQCPSRVRSNNDRTTKTRLHRVGPV 329
Query: 388 YSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYK---HVAGGPLGEHAIRIIGWG 444
Y + A EE IM EI GPV+ M + D +YK+G+YK +G G H++RI+GWG
Sbjct: 330 YRV-ATEEGIMHEILTSGPVQAVMKVSRDFFMYKSGVYKCSNLASGSRTGYHSVRIVGWG 388
Query: 445 QEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
+E G +VKYW+ +NS+ + WGENG FRI++G +EC IE + A I
Sbjct: 389 EEYQG---GKIVKYWIASNSWGSWWGENGYFRILKGVDECEIEDFVIAAWADI 438
>gi|323447573|gb|EGB03489.1| hypothetical protein AURANDRAFT_72715 [Aureococcus anophagefferens]
Length = 812
Score = 127 bits (318), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 86/249 (34%), Positives = 117/249 (46%), Gaps = 42/249 (16%)
Query: 92 DPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLS 151
D + ++P F+A W +Q IRDQ CGS WA A E +SDR I K LS
Sbjct: 335 DNITDVPSEFNAVTQWKGL--VQPIRDQQQCGSCWAFSAAEVLSDRNAI-QHNKAEPVLS 391
Query: 152 SDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIPCERYMNGSHS 211
+DLVSC + GC GG G AW Y TGIV+ C PY
Sbjct: 392 PEDLVSCDR-VDQGCNGGNLGTAWTYLKNTGIVT-------DACFPYTA----------- 432
Query: 212 SCQDNEPNTPECIRKCQPGYD-VSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMT 270
+ P+C C+ G Y+ AY++ E + +EI HGP++ +
Sbjct: 433 ----GGGDAPKCETSCKDGSSWTKYK-----AASAYAV-NGVENMQKEIMTHGPIQVAFN 482
Query: 271 IYADMILYKTGIY--KHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWG 328
+Y + YK+G+Y K P G HA++I+GWG T YWLVANS+NT+WG
Sbjct: 483 VYKSFMSYKSGVYAKKWYELMPEGGHAVKIVGWG-------TEGGKDYWLVANSWNTSWG 535
Query: 329 ENGLFRIGC 337
+ G F+I
Sbjct: 536 DEGYFKIAV 544
Score = 76.3 bits (186), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 36/95 (37%), Positives = 54/95 (56%), Gaps = 9/95 (9%)
Query: 395 ETIMREIFRHGPVEGSMTIYADMILYKTGIY--KHVAGGPLGEHAIRIIGWGQEPLGEGT 452
E + +EI HGP++ + +Y + YK+G+Y K P G HA++I+GWG T
Sbjct: 465 ENMQKEIMTHGPIQVAFNVYKSFMSYKSGVYAKKWYELMPEGGHAVKIVGWG-------T 517
Query: 453 SSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIE 487
YWLVANS+NT+WG+ G F+I G ++
Sbjct: 518 EGGKDYWLVANSWNTSWGDEGYFKIAVGAESISLD 552
>gi|66911417|gb|AAH97299.1| Tubulointerstitial nephritis antigen-like 1 [Rattus norvegicus]
gi|149024087|gb|EDL80584.1| lipocalin 7, isoform CRA_a [Rattus norvegicus]
gi|149024088|gb|EDL80585.1| lipocalin 7, isoform CRA_a [Rattus norvegicus]
gi|149024089|gb|EDL80586.1| lipocalin 7, isoform CRA_a [Rattus norvegicus]
Length = 467
Score = 126 bits (317), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 97/295 (32%), Positives = 132/295 (44%), Gaps = 21/295 (7%)
Query: 51 FYGAEKNALSKLTLSE-LEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPY 109
+ +A +TL E + R+G S N + L E LP F+A WP
Sbjct: 156 WQAGNHSAFWGMTLDEGIRYRLGTIRPSSSVMNMNEIYTVLGQG-EVLPTAFEASEKWPN 214
Query: 110 CPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGG 169
I E DQG+C WA SDRV I S G LS +L+SC GC+GG
Sbjct: 215 --LIHEPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPILSPQNLLSCDTHHQKGCRGG 272
Query: 170 FHGKAWKYWVTTGIVSGGTYA-SKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQ 228
AW + G+VS Y S + P R M S + + T C
Sbjct: 273 RLDGAWWFLRCRGVVSDNCYPFSGREQNDEASPTPRCMMHSRAMGRGKRQATSRC----- 327
Query: 229 PGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHV-- 286
P V D + Y L ++E+ IM+E+ +GPV+ M ++ D LY+ GIY H
Sbjct: 328 PNSHVDSNDIYQVTPV-YRLASDEKEIMKELMENGPVQALMEVHEDFFLYQRGIYSHTPV 386
Query: 287 -AGGP-----LGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
G P G H+++I GWG+E L +G + +KYW ANS+ WGE G FRI
Sbjct: 387 SQGRPEQYRRHGTHSVKITGWGEETLPDGRT--IKYWTAANSWGPWWGERGHFRI 439
Score = 102 bits (254), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 61/166 (36%), Positives = 86/166 (51%), Gaps = 16/166 (9%)
Query: 343 PCERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIF 402
P R M SR+ + T C P V D + Y L ++E+ IM+E+
Sbjct: 305 PTPRCMMHSRAMGRGKRQATSRC-----PNSHVDSNDIYQVTPV-YRLASDEKEIMKELM 358
Query: 403 RHGPVEGSMTIYADMILYKTGIYKHV---AGGP-----LGEHAIRIIGWGQEPLGEGTSS 454
+GPV+ M ++ D LY+ GIY H G P G H+++I GWG+E L +G +
Sbjct: 359 ENGPVQALMEVHEDFFLYQRGIYSHTPVSQGRPEQYRRHGTHSVKITGWGEETLPDGRT- 417
Query: 455 VVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKIGLE 500
+KYW ANS+ WGE G FRIVRG NEC IE + ++G+E
Sbjct: 418 -IKYWTAANSWGPWWGERGHFRIVRGTNECDIETFVLGVWGRVGME 462
>gi|160688716|gb|ABX45136.1| cathepsin B-like cysteine protease 2 [Callosobruchus maculatus]
Length = 260
Score = 126 bits (317), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 64/138 (46%), Positives = 88/138 (63%), Gaps = 8/138 (5%)
Query: 361 NTPECIRKCQPGYDVSYEDDLNFGRIAYSLPAN-EETIMREIFRHGPVEGSMTIYADMIL 419
+ P C ++C G + YE+D ++ + AY + + E I EI ++GPV S T+YAD I
Sbjct: 129 DAPTCKKECDKGSPLKYEEDKHYAKQAYRIMSKVERQIQLEIIKNGPVVASFTVYADFIH 188
Query: 420 YKTGIYKHVAGGPL-GEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIV 478
Y +G+YK L G HA+RIIGWG E + YWLV+NS+N WG+ GLF+I
Sbjct: 189 YLSGVYKFDGESKLLGGHAVRIIGWGIE------NGTYPYWLVSNSWNERWGDQGLFKIW 242
Query: 479 RGQNECGIEADITAGLPK 496
RG+NECGIE +ITAGLP+
Sbjct: 243 RGKNECGIEEEITAGLPR 260
Score = 97.4 bits (241), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 57/144 (39%), Positives = 79/144 (54%), Gaps = 15/144 (10%)
Query: 194 GCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPAN-E 252
GC Y +P N S + D P C ++C G + YE+D ++ + AY + + E
Sbjct: 111 GCMSYPLP---RCNPSCKTLYD----APTCKKECDKGSPLKYEEDKHYAKQAYRIMSKVE 163
Query: 253 ETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGP-LGEHAIRIIGWGQEPLGEGTS 311
I EI ++GPV S T+YAD I Y +G+YK LG HA+RIIGWG E +
Sbjct: 164 RQIQLEIIKNGPVVASFTVYADFIHYLSGVYKFDGESKLLGGHAVRIIGWGIE------N 217
Query: 312 SVVKYWLVANSFNTNWGENGLFRI 335
YWLV+NS+N WG+ GLF+I
Sbjct: 218 GTYPYWLVSNSWNERWGDQGLFKI 241
Score = 46.2 bits (108), Expect = 0.040, Method: Compositional matrix adjust.
Identities = 20/35 (57%), Positives = 24/35 (68%)
Query: 92 DPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGW 126
D ++LPE FDAR W C +I+EIRDQ CGS W
Sbjct: 76 DDGKDLPEEFDARKQWSKCESIKEIRDQSGCGSCW 110
>gi|294889976|ref|XP_002773021.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
gi|239877724|gb|EER04837.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
Length = 342
Score = 126 bits (317), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 88/263 (33%), Positives = 122/263 (46%), Gaps = 34/263 (12%)
Query: 94 LEELPEGFDARINWPYCP-TIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSS 152
L LP F+A+I + C I IRDQ C + WA +V +DRVCI S G+ LS
Sbjct: 36 LTTLPSNFNAQIKFASCADVIGHIRDQAECHNCWASASVGMFNDRVCIQSGGRITDILSL 95
Query: 153 DDLVSCCKDC-----GNGCQGGFHGKAWKYWVTTGIVSGGTY------ASKQGCRPYEIP 201
L SCC +GC+ G + + GIV+GG Y + GC PY P
Sbjct: 96 AYLTSCCNHANGCPKSDGCRRGSVAEGLIFMKNHGIVTGGEYKPPKKLGNDDGCWPYPFP 155
Query: 202 CERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAY--------SLPANEE 253
++ G + P C K S+ D L+ R LP + E
Sbjct: 156 KCNHVPGM-------KVKYPRCGSKVGRLAAPSHCDGLHCRRAGDVHRAKSWGRLPISPE 208
Query: 254 TIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSV 313
I +EIF +GPV MTI+ D LYK+G+Y++ G +G H +++IGWG E E
Sbjct: 209 KIKQEIFDNGPVAAIMTIHEDFRLYKSGVYEYKTGAMVGAHTLKLIGWGVEAGQE----- 263
Query: 314 VKYWLVANSFNTNWGENGLFRIG 336
YWL NS+N WG+ G ++
Sbjct: 264 --YWLAVNSWNEEWGDQGKIKLA 284
Score = 84.3 bits (207), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 44/124 (35%), Positives = 69/124 (55%), Gaps = 13/124 (10%)
Query: 390 LPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLG 449
LP + E I +EIF +GPV MTI+ D LYK+G+Y++ G +G H +++IGWG E
Sbjct: 203 LPISPEKIKQEIFDNGPVAAIMTIHEDFRLYKSGVYEYKTGAMVGAHTLKLIGWGVEAGQ 262
Query: 450 EGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKIGLEIDSNEINLG 509
E YWL NS+N WG+ G ++ G+N ++ + +P+ + NE++
Sbjct: 263 E-------YWLAVNSWNEEWGDQGKIKLAVGKN--ALDEESRQQVPRRAV----NELDED 309
Query: 510 KMMT 513
MM
Sbjct: 310 AMMA 313
>gi|328701234|ref|XP_001948885.2| PREDICTED: cathepsin B-like cysteine proteinase 5-like
[Acyrthosiphon pisum]
Length = 326
Score = 126 bits (317), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 79/258 (30%), Positives = 130/258 (50%), Gaps = 26/258 (10%)
Query: 85 PLLVQLSDPL----EELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCI 140
+L + DP ++ + FDAR WP C TI E+ ++G+ WA A +DR+CI
Sbjct: 70 KMLYKTRDPRYIIDNQIHKEFDARKRWPQCKTIGEVHNEGNELLSWAYAATGVFADRMCI 129
Query: 141 ASRGKRHVRLSSDDLVSC--CKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPY 198
A+ G + LS+++L+SC K+ +G W+Y+ T G+VSGG Y + +GC+P
Sbjct: 130 ATNGNYNQLLSTEELISCSGIKEREDGYVNRV--LVWEYFKTHGLVSGGKYNTNEGCQPS 187
Query: 199 EIPCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMRE 258
++P + S + C+ C ++Y D +++ + I +E
Sbjct: 188 KVPT---VYNSQTKIYKR-----TCVEYCYGKDTINYNHD--HVKVSNHYFIRIKDIQKE 237
Query: 259 IFRHGPVEGSMTIYADMILYKTGIY-KHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYW 317
+ +GPV ++ D+ LYK+G+Y K H ++IGWG E + V YW
Sbjct: 238 VQTYGPVSVFFDLHDDLFLYKSGVYAKTEKSKDKRYHHAKLIGWGVE-------NGVDYW 290
Query: 318 LVANSFNTNWGENGLFRI 335
L+ NS+ WG+NGLF+I
Sbjct: 291 LLVNSWGYEWGQNGLFKI 308
Score = 80.1 bits (196), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 48/160 (30%), Positives = 80/160 (50%), Gaps = 18/160 (11%)
Query: 336 GCRPYEIPCERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEE 395
GC+P ++P + S++ C+ C ++Y D +++ +
Sbjct: 183 GCQPSKVPT---VYNSQTKIYKRT-----CVEYCYGKDTINYNHD--HVKVSNHYFIRIK 232
Query: 396 TIMREIFRHGPVEGSMTIYADMILYKTGIY-KHVAGGPLGEHAIRIIGWGQEPLGEGTSS 454
I +E+ +GPV ++ D+ LYK+G+Y K H ++IGWG E +
Sbjct: 233 DIQKEVQTYGPVSVFFDLHDDLFLYKSGVYAKTEKSKDKRYHHAKLIGWGVE-------N 285
Query: 455 VVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGL 494
V YWL+ NS+ WG+NGLF+I RG +EC +E+ + AGL
Sbjct: 286 GVDYWLLVNSWGYEWGQNGLFKIKRGTDECSVESHVYAGL 325
>gi|170028894|ref|XP_001842329.1| conserved hypothetical protein [Culex quinquefasciatus]
gi|167879379|gb|EDS42762.1| conserved hypothetical protein [Culex quinquefasciatus]
Length = 355
Score = 126 bits (316), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 82/247 (33%), Positives = 111/247 (44%), Gaps = 35/247 (14%)
Query: 91 SDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRL 150
S + +P FDAR WP CP+I I +QG C S ++DR CI S G
Sbjct: 14 STSAQLIPTSFDARTRWPNCPSIALIPNQGCCNSSAFQIPAAVITDRACIRSNGTSTRTY 73
Query: 151 SSDDLVSCCKDCGNG----CQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEI-PCERY 205
S+ D ++CC DC C GG K W YW TTG+VS C P+ + P
Sbjct: 74 SAYDALACCTDCPFSQLFKCAGGDPLKVWNYWATTGLVS-------DSCMPFSLSPLCLG 126
Query: 206 MNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPV 265
N C C PGY S D G ++ + I EI +GPV
Sbjct: 127 FN---------------CPLLCAPGYAGSIVGDRKKGLKVVTVAPYVDAIQSEIILNGPV 171
Query: 266 EGSMTIYADMI-LYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFN 324
E S +Y D + L ++ +Y +G LG +++IIGWG E GT +YWL+ ++F
Sbjct: 172 EASFDLYLDFVHLKQSQVYNSRSGPNLGRQSVKIIGWGVE---NGT----EYWLITSTFG 224
Query: 325 TNWGENG 331
WG G
Sbjct: 225 IGWGNQG 231
Score = 72.0 bits (175), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 48/159 (30%), Positives = 68/159 (42%), Gaps = 24/159 (15%)
Query: 327 WGENGLFRIGCRPYEI-PCERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGR 385
W GL C P+ + P N C C PGY S D G
Sbjct: 105 WATTGLVSDSCMPFSLSPLCLGFN---------------CPLLCAPGYAGSIVGDRKKGL 149
Query: 386 IAYSLPANEETIMREIFRHGPVEGSMTIYADMI-LYKTGIYKHVAGGPLGEHAIRIIGWG 444
++ + I EI +GPVE S +Y D + L ++ +Y +G LG +++IIGWG
Sbjct: 150 KVVTVAPYVDAIQSEIILNGPVEASFDLYLDFVHLKQSQVYNSRSGPNLGRQSVKIIGWG 209
Query: 445 QEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNE 483
E GT +YWL+ ++F WG G +RG N
Sbjct: 210 VE---NGT----EYWLITSTFGIGWGNQGTAMFLRGVNH 241
>gi|209863086|ref|NP_001119616.2| cathepsin B-1674 precursor [Acyrthosiphon pisum]
gi|239799412|dbj|BAH70627.1| ACYPI000012 [Acyrthosiphon pisum]
Length = 334
Score = 126 bits (316), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 81/264 (30%), Positives = 125/264 (47%), Gaps = 36/264 (13%)
Query: 81 QNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCI 140
+NR V++ ++ + FDAR WP+C TI E+ + G+ WA +DR+CI
Sbjct: 77 RNRRCFRVEID---HQIDQEFDARKRWPHCKTIGEVHNDGNSLLSWAYVPTGVFADRMCI 133
Query: 141 ASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEI 200
A+ G + LS+++L+SC + W+Y G+VSGG Y + GC+P +I
Sbjct: 134 ATNGTYNQLLSTEELISCSGIKEDEFGSVNDDYVWEYLKNHGLVSGGKYNTNNGCQPSKI 193
Query: 201 ------PCERYMNGSHSSCQ-DNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEE 253
P Y N C +N N + K + YD+ YED
Sbjct: 194 PPIGNLPTGLYENTCEKRCYGNNTINYNQDHVKIKNHYDIEYED---------------- 237
Query: 254 TIMREIFRHGPVEGSMTIY-ADMILYKTGIYKHVAGGPLGE-HAIRIIGWGQEPLGEGTS 311
I RE+ +GPV + ++ D LYK+G+Y+ + ++IGWG E
Sbjct: 238 -IQREVQNYGPVSMAFRVFDNDFFLYKSGVYEKTTNSEFIQWQYAKLIGWGVE------- 289
Query: 312 SVVKYWLVANSFNTNWGENGLFRI 335
+ V YWL+ NS+ WG+NGLF+I
Sbjct: 290 NGVDYWLLVNSWGYEWGQNGLFKI 313
Score = 84.7 bits (208), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 55/171 (32%), Positives = 80/171 (46%), Gaps = 33/171 (19%)
Query: 336 GCRPYEIP------CERYMNGSRSSCQANEP-NTPECIRKCQPGYDVSYEDDLNFGRIAY 388
GC+P +IP Y N C N N + K + YD+ YED
Sbjct: 187 GCQPSKIPPIGNLPTGLYENTCEKRCYGNNTINYNQDHVKIKNHYDIEYED--------- 237
Query: 389 SLPANEETIMREIFRHGPVEGSMTIY-ADMILYKTGIYKHVAGGPLGE-HAIRIIGWGQE 446
I RE+ +GPV + ++ D LYK+G+Y+ + ++IGWG E
Sbjct: 238 --------IQREVQNYGPVSMAFRVFDNDFFLYKSGVYEKTTNSEFIQWQYAKLIGWGVE 289
Query: 447 PLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
+ V YWL+ NS+ WG+NGLF+I RG +EC IE + AG P++
Sbjct: 290 -------NGVDYWLLVNSWGYEWGQNGLFKIKRGTDECNIETFVHAGEPQL 333
>gi|193688334|ref|XP_001945855.1| PREDICTED: cathepsin B-like cysteine proteinase 5-like
[Acyrthosiphon pisum]
Length = 313
Score = 126 bits (316), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 92/292 (31%), Positives = 141/292 (48%), Gaps = 38/292 (13%)
Query: 54 AEKN--ALSKLTLSELEMRMGVHPDSK---LPQNRLPLLVQLSDPLEELPEGFDARINWP 108
EKN ++ L+ LE R GV +K + + R P V + +E FDAR WP
Sbjct: 29 VEKNRGIVTDLSKIFLETR-GVEAATKSNMMYKTRNPKYVIDNRDYKE----FDARKRWP 83
Query: 109 YCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCC---KDCGNG 165
C TI E+ ++G+ GWA A ++DR CIA+ G + LS+++L+SC + GN
Sbjct: 84 KCKTIGEVHNEGNFAFGWAYAAAGVLADRTCIATNGGYNKLLSTEELISCSGIKETNGNV 143
Query: 166 CQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEI-PCERYMNGSHSSCQDNEPNTPECI 224
+ W+Y + G+VSGG Y S GC+P++ P + +C D+
Sbjct: 144 NERSI----WEYLKSHGVVSGGKYNSNDGCQPFKFPPIANILTHLQHTCDDH-------- 191
Query: 225 RKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIY- 283
C ++Y D R Y++ I +E+ +GPV + D +LYK+G+Y
Sbjct: 192 --CYGNTSINYNHDHVRVRNYYTIRTG--YIQKEVQTYGPVAVQFKVCDDFLLYKSGVYV 247
Query: 284 KHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
K + ++IGWG E + V YWLV NS+ WG+ GLF+I
Sbjct: 248 KSDNAKVIRTQYAKLIGWGVE-------NGVDYWLVINSWGHEWGQKGLFKI 292
Score = 80.9 bits (198), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 50/164 (30%), Positives = 81/164 (49%), Gaps = 21/164 (12%)
Query: 336 GCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANE 394
GC+P++ P + + +C + C ++Y D R Y++
Sbjct: 168 GCQPFKFPPIANILTHLQHTCDDH----------CYGNTSINYNHDHVRVRNYYTIRTG- 216
Query: 395 ETIMREIFRHGPVEGSMTIYADMILYKTGIY-KHVAGGPLGEHAIRIIGWGQEPLGEGTS 453
I +E+ +GPV + D +LYK+G+Y K + ++IGWG E
Sbjct: 217 -YIQKEVQTYGPVAVQFKVCDDFLLYKSGVYVKSDNAKVIRTQYAKLIGWGVE------- 268
Query: 454 SVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
+ V YWLV NS+ WG+ GLF+I RG N+CG+E+ + AG+P+I
Sbjct: 269 NGVDYWLVINSWGHEWGQKGLFKIKRGTNQCGVESVVYAGVPEI 312
>gi|149392557|gb|ABR26081.1| cathepsin b-like cysteine proteinase 3 [Oryza sativa Indica Group]
Length = 142
Score = 125 bits (315), Expect = 4e-26, Method: Composition-based stats.
Identities = 57/129 (44%), Positives = 84/129 (65%), Gaps = 7/129 (5%)
Query: 367 RKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYK 426
+KC+ V E +F AY + ++ IM E++++GPVE + T+Y D YK+G+YK
Sbjct: 1 KKCKVQNQVWLEKK-HFSVNAYRVNSDPHDIMAEVYQNGPVEVAFTVYEDFAHYKSGVYK 59
Query: 427 HVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGI 486
H+ GG +G HA+++IGWG GE YWL+AN +N WG++G F+I+RG NECGI
Sbjct: 60 HITGGMMGGHAVKLIGWGTTDAGE------DYWLLANQWNRGWGDDGYFKIIRGTNECGI 113
Query: 487 EADITAGLP 495
E D+ AG+P
Sbjct: 114 EEDVVAGMP 122
Score = 96.3 bits (238), Expect = 3e-17, Method: Composition-based stats.
Identities = 45/111 (40%), Positives = 69/111 (62%), Gaps = 7/111 (6%)
Query: 225 RKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYK 284
+KC+ V E +F AY + ++ IM E++++GPVE + T+Y D YK+G+YK
Sbjct: 1 KKCKVQNQVWLEKK-HFSVNAYRVNSDPHDIMAEVYQNGPVEVAFTVYEDFAHYKSGVYK 59
Query: 285 HVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
H+ GG +G HA+++IGWG GE YWL+AN +N WG++G F+I
Sbjct: 60 HITGGMMGGHAVKLIGWGTTDAGE------DYWLLANQWNRGWGDDGYFKI 104
>gi|291408920|ref|XP_002720687.1| PREDICTED: tubulointerstitial nephritis antigen-like 1 [Oryctolagus
cuniculus]
Length = 467
Score = 125 bits (315), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 95/297 (31%), Positives = 133/297 (44%), Gaps = 26/297 (8%)
Query: 51 FYGAEKNALSKLTLSE-LEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPY 109
+ +A +TL E + R+G + N + L E LP F+A WP
Sbjct: 157 WQAGNHSAFWGMTLEEGIRYRLGTNRPPSSVMNMNEIYTGLGSG-EVLPTAFEASEKWPN 215
Query: 110 CPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGG 169
I E DQG+C WA SDRV I S G LS +L+SC GC+GG
Sbjct: 216 --LIHEPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQNLLSCDTHHQQGCRGG 273
Query: 170 FHGKAWKYWVTTGIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQP 229
AW + G+VS Y + P M S + + T C P
Sbjct: 274 RLDGAWWFLRRRGVVSDHCYPFSGHEQDEAGPAPPCMMHSRAMGRGKRQATARC-----P 328
Query: 230 GYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGG 289
V + +D+ AY L +NE+ IM+E+ +GPV+ M ++ D LY+ GIY H
Sbjct: 329 NSHV-HANDIYQVTPAYRLGSNEKEIMKELLENGPVQALMEVHEDFFLYQGGIYSHT--- 384
Query: 290 PL-----------GEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
P+ G H+++I GWG+E L +G + +KYW ANS+ WGE G FRI
Sbjct: 385 PVSLERPERYRRHGTHSVKITGWGEETLPDGRT--LKYWTAANSWGPAWGERGHFRI 439
Score = 100 bits (250), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 51/125 (40%), Positives = 75/125 (60%), Gaps = 16/125 (12%)
Query: 387 AYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPL-----------GE 435
AY L +NE+ IM+E+ +GPV+ M ++ D LY+ GIY H P+ G
Sbjct: 343 AYRLGSNEKEIMKELLENGPVQALMEVHEDFFLYQGGIYSHT---PVSLERPERYRRHGT 399
Query: 436 HAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLP 495
H+++I GWG+E L +G + +KYW ANS+ WGE G FRI+RG NEC IE+ +
Sbjct: 400 HSVKITGWGEETLPDGRT--LKYWTAANSWGPAWGERGHFRILRGTNECDIESFVLGVWG 457
Query: 496 KIGLE 500
++G+E
Sbjct: 458 RVGME 462
>gi|161343849|tpg|DAA06105.1| TPA_inf: cathepsin B [Acyrthosiphon pisum]
Length = 334
Score = 125 bits (315), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 79/251 (31%), Positives = 122/251 (48%), Gaps = 37/251 (14%)
Query: 96 ELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDL 155
++ + FDAR WP+C TI E+ + G+ WA +DR+CIA+ G + LS+++L
Sbjct: 89 QIDQEFDARKRWPHCKTIGEVHNDGNSLLSWAYVPTGVFADRMCIATNGTYNQLLSTEEL 148
Query: 156 VSC--CKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEI------PCERYMN 207
+SC K+ G ++ W+Y G+VSGG Y + GC+P +I P Y N
Sbjct: 149 ISCSGIKEDEFGSVNDYY--VWEYLKNHGLVSGGKYNTNNGCQPSKIPPIGNLPTGLYEN 206
Query: 208 GSHSSCQ-DNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVE 266
C +N N + K + YD+ YED I RE+ +GPV
Sbjct: 207 TCEKRCYGNNTINYNQDHVKIKNHYDIEYED-----------------IQREVQNYGPVS 249
Query: 267 GSMTIY-ADMILYKTGIYKHVAGGPLGE-HAIRIIGWGQEPLGEGTSSVVKYWLVANSFN 324
+ ++ D LYK+G+Y+ + ++IGWG E + V YWL+ N +
Sbjct: 250 MAFKVFDNDFFLYKSGVYEKTTNSEFIQWQYAKLIGWGVE-------NGVDYWLLVNFWG 302
Query: 325 TNWGENGLFRI 335
WG+NGLF+I
Sbjct: 303 YEWGQNGLFKI 313
Score = 83.2 bits (204), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 54/171 (31%), Positives = 79/171 (46%), Gaps = 33/171 (19%)
Query: 336 GCRPYEIP------CERYMNGSRSSCQANEP-NTPECIRKCQPGYDVSYEDDLNFGRIAY 388
GC+P +IP Y N C N N + K + YD+ YED
Sbjct: 187 GCQPSKIPPIGNLPTGLYENTCEKRCYGNNTINYNQDHVKIKNHYDIEYED--------- 237
Query: 389 SLPANEETIMREIFRHGPVEGSMTIY-ADMILYKTGIYKHVAGGPLGE-HAIRIIGWGQE 446
I RE+ +GPV + ++ D LYK+G+Y+ + ++IGWG E
Sbjct: 238 --------IQREVQNYGPVSMAFKVFDNDFFLYKSGVYEKTTNSEFIQWQYAKLIGWGVE 289
Query: 447 PLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
+ V YWL+ N + WG+NGLF+I RG +EC IE + AG P++
Sbjct: 290 -------NGVDYWLLVNFWGYEWGQNGLFKIKRGTDECNIETFVHAGEPQL 333
>gi|308157829|gb|EFO60849.1| Cathepsin B precursor [Giardia lamblia P15]
Length = 300
Score = 125 bits (314), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 87/288 (30%), Positives = 132/288 (45%), Gaps = 49/288 (17%)
Query: 51 FYGAEKNALSKLTL--SELEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWP 108
F G K+ +S L + S L+ G P +++PE FD R +P
Sbjct: 39 FEGLTKDEISSLLMPVSFLKSAKGAAPRGTFADK------------DDVPESFDFREEYP 86
Query: 109 YCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGN-GCQ 167
+C I E+ DQG CGS WA +V DR CIA K+ V+ S +VSC D GN C
Sbjct: 87 HC--IPEVVDQGGCGSCWAFSSVATFGDRRCIAGLDKKPVKYSPQYVVSC--DHGNMACN 142
Query: 168 GGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKC 227
GG+ AWK+ TG + C PY+ + C D K
Sbjct: 143 GGWLPNAWKFLTKTGTTT-------DECVPYQSGSTTLRGTCPTKCADGS-------SKV 188
Query: 228 QPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVA 287
SY+D +PA +M+ + GP++ + +Y+D + Y++G+Y+H
Sbjct: 189 HLTTATSYKD------YGLDIPA----MMKALSTTGPLQVAFLVYSDFMYYESGVYQHTY 238
Query: 288 GGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
G G HA+ ++G+G + G V YW++ NS+ +WGE+G FR+
Sbjct: 239 GYMEGGHAVEMVGYGTDDDG------VDYWIIRNSWGPDWGEDGYFRM 280
Score = 83.6 bits (205), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 42/131 (32%), Positives = 71/131 (54%), Gaps = 11/131 (8%)
Query: 365 CIRKCQPGYD-VSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTG 423
C KC G V ++ +PA +M+ + GP++ + +Y+D + Y++G
Sbjct: 177 CPTKCADGSSKVHLTTATSYKDYGLDIPA----MMKALSTTGPLQVAFLVYSDFMYYESG 232
Query: 424 IYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNE 483
+Y+H G G HA+ ++G+G + G V YW++ NS+ +WGE+G FR++RG N+
Sbjct: 233 VYQHTYGYMEGGHAVEMVGYGTDDDG------VDYWIIRNSWGPDWGEDGYFRMIRGIND 286
Query: 484 CGIEADITAGL 494
C IE AG
Sbjct: 287 CSIEEQAYAGF 297
>gi|255076333|ref|XP_002501841.1| cysteine endopeptidase [Micromonas sp. RCC299]
gi|226517105|gb|ACO63099.1| cysteine endopeptidase [Micromonas sp. RCC299]
Length = 359
Score = 125 bits (314), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 100/262 (38%), Positives = 128/262 (48%), Gaps = 33/262 (12%)
Query: 92 DPLE-ELPEGFDARINWPYC-PTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVR 149
DP + LP FDAR WP C I +RDQG CGS WA+ E M+DR+CIAS G
Sbjct: 99 DPADWNLPLNFDARQKWPQCRAIIGTVRDQGKCGSCWAVATAEVMNDRLCIASGGAEQRE 158
Query: 150 LSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTY-ASKQGCRPYEI-PCERYMN 207
LS +S C D G+GCQGG A T G+V GG SK C PYE PCE
Sbjct: 159 LSPQYPLS-CYDGGSGCQGGDVAVAMHEATTKGMVFGGMLNRSKTACLPYEFEPCEH--- 214
Query: 208 GSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRI-----AYSLPANE-ETIMREIFR 261
CQ EC G + L ++ Y+ P N+ I +EI
Sbjct: 215 ----PCQVQGVIPHECPAHVDDGTCLGNTFKLADQKVFPKSDVYTCPPNDWACIAQEIMT 270
Query: 262 HGPVEGSM-TIYADMILYKTGIY------KHVAGGPLGEHAIRIIGWGQEPLGEGTSSVV 314
+GPV + T+++D Y G+Y K+ G LG HA ++IGWG E T
Sbjct: 271 YGPVAVTFGTVHSDFYGYHAGVYTVREEDKNEEG--LGMHATKLIGWG---FDEATGH-- 323
Query: 315 KYWLVANSFNTNWGENGLFRIG 336
YWL+ NS++ NWG +GL R+G
Sbjct: 324 PYWLMMNSWD-NWGIHGLGRVG 344
Score = 54.7 bits (130), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 51/162 (31%), Positives = 73/162 (45%), Gaps = 29/162 (17%)
Query: 334 RIGCRPYEI-PCERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRI-----A 387
+ C PYE PCE CQ EC G + L ++
Sbjct: 201 KTACLPYEFEPCEH-------PCQVQGVIPHECPAHVDDGTCLGNTFKLADQKVFPKSDV 253
Query: 388 YSLPANE-ETIMREIFRHGPVEGSM-TIYADMILYKTGIY------KHVAGGPLGEHAIR 439
Y+ P N+ I +EI +GPV + T+++D Y G+Y K+ G LG HA +
Sbjct: 254 YTCPPNDWACIAQEIMTYGPVAVTFGTVHSDFYGYHAGVYTVREEDKNEEG--LGMHATK 311
Query: 440 IIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQ 481
+IGWG E T YWL+ NS++ NWG +GL R+ G+
Sbjct: 312 LIGWG---FDEATGH--PYWLMMNSWD-NWGIHGLGRVGVGE 347
>gi|62320420|dbj|BAD94873.1| cathepsin B-like cysteine proteinase like protein [Arabidopsis
thaliana]
Length = 183
Score = 125 bits (314), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 62/154 (40%), Positives = 95/154 (61%), Gaps = 9/154 (5%)
Query: 344 CERYMNGSRSSCQANEPN--TPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREI 401
C+ Y + + S EP TP+C RKC + + + ++G AY + + + IM E+
Sbjct: 17 CDPYFDNTGCSHPGCEPTYPTPKCERKCVSRNQL-WGESKHYGVGAYRINPDPQDIMAEV 75
Query: 402 FRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLV 461
+++GPVE + T+Y D YK+G+YK++ G +G HA+++IGWG GE YWL+
Sbjct: 76 YKNGPVEVAFTVYEDFAHYKSGVYKYITGTKIGGHAVKLIGWGTSDDGE------DYWLL 129
Query: 462 ANSFNTNWGENGLFRIVRGQNECGIEADITAGLP 495
AN +N +WG++G F+I RG NECGIE + AGLP
Sbjct: 130 ANQWNRSWGDDGYFKIRRGTNECGIEQSVVAGLP 163
Score = 104 bits (259), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 59/164 (35%), Positives = 92/164 (56%), Gaps = 23/164 (14%)
Query: 174 AWKYWVTTGIVSGGTYASKQGCRPY--EIPCERYMNGSHSSCQDNEPNTPECIRKCQPGY 231
AW Y+ G+V+ Q C PY C SH C+ P TP+C RKC
Sbjct: 3 AWLYFKYHGVVT-------QECDPYFDNTGC------SHPGCEPTYP-TPKCERKCVSRN 48
Query: 232 DVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPL 291
+ + + ++G AY + + + IM E++++GPVE + T+Y D YK+G+YK++ G +
Sbjct: 49 QL-WGESKHYGVGAYRINPDPQDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKYITGTKI 107
Query: 292 GEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
G HA+++IGWG GE YWL+AN +N +WG++G F+I
Sbjct: 108 GGHAVKLIGWGTSDDGE------DYWLLANQWNRSWGDDGYFKI 145
>gi|268572247|ref|XP_002648914.1| Hypothetical protein CBG17827 [Caenorhabditis briggsae]
Length = 150
Score = 125 bits (314), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 70/190 (36%), Positives = 92/190 (48%), Gaps = 47/190 (24%)
Query: 116 IRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKD-CGNGCQGGFHGKA 174
IR+Q +CGS WA GA E +SDR+CI ++G R +S D++ CC + CG GC G
Sbjct: 2 IRNQTNCGSCWAFGAAEVISDRICIVTKGARQPIISPTDMLDCCGEYCGYGCDG------ 55
Query: 175 WKYWVTTGIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVS 234
C + + TP+C CQ Y+
Sbjct: 56 ---------------------------CPKAV-------------TPKCALSCQSKYNTE 75
Query: 235 YEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEH 294
Y D NFG AY + N I EI +GPVE S T+Y D +YK G+Y++ AG LG H
Sbjct: 76 YAKDKNFGSSAYYVGRNFSVIQTEIMTNGPVEASFTVYEDFYIYKKGVYQYTAGEVLGGH 135
Query: 295 AIRIIGWGQE 304
AI+IIGWG E
Sbjct: 136 AIKIIGWGTE 145
Score = 91.7 bits (226), Expect = 9e-16, Method: Compositional matrix adjust.
Identities = 42/85 (49%), Positives = 52/85 (61%)
Query: 362 TPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYK 421
TP+C CQ Y+ Y D NFG AY + N I EI +GPVE S T+Y D +YK
Sbjct: 61 TPKCALSCQSKYNTEYAKDKNFGSSAYYVGRNFSVIQTEIMTNGPVEASFTVYEDFYIYK 120
Query: 422 TGIYKHVAGGPLGEHAIRIIGWGQE 446
G+Y++ AG LG HAI+IIGWG E
Sbjct: 121 KGVYQYTAGEVLGGHAIKIIGWGTE 145
>gi|390367767|ref|XP_787947.3| PREDICTED: cathepsin B-like [Strongylocentrotus purpuratus]
Length = 146
Score = 125 bits (313), Expect = 6e-26, Method: Composition-based stats.
Identities = 54/88 (61%), Positives = 69/88 (78%), Gaps = 1/88 (1%)
Query: 78 KLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDR 137
K P RLP L + +++LPE FDAR NWP CPTI+E+RDQGSCGS WA GAVEA+SDR
Sbjct: 60 KNPNGRLPKL-ENQTRIKDLPENFDARENWPNCPTIKEVRDQGSCGSCWAFGAVEAISDR 118
Query: 138 VCIASRGKRHVRLSSDDLVSCCKDCGNG 165
+CI S+G+ V +S++DL++CCK CGNG
Sbjct: 119 ICIKSKGQTQVHISAEDLMTCCKTCGNG 146
>gi|308160258|gb|EFO62754.1| Cathepsin B precursor [Giardia lamblia P15]
Length = 298
Score = 125 bits (313), Expect = 7e-26, Method: Compositional matrix adjust.
Identities = 84/274 (30%), Positives = 136/274 (49%), Gaps = 39/274 (14%)
Query: 64 LSELEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCG 123
L++ E+ + P S L ++R + + + ++P+ FD R +P+C I E+ DQG CG
Sbjct: 42 LTKDEISSLLMPVSFLKRDRAAV-PRGTVSATQVPDSFDFREEYPHC--IPEVVDQGGCG 98
Query: 124 SGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGN-GCQGGFHGKAWKYWVTTG 182
S WA +V ++ DR C+A K+ VR S +VSC D G+ C GG+ W++ V TG
Sbjct: 99 SCWAFSSVASVGDRRCVAGLDKKAVRYSPQYVVSC--DRGDMACDGGWLPSVWRFLVKTG 156
Query: 183 IVSGGTYASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPEC-IRKCQPGYDVSYEDDLNF 241
+ C PY+ G+ +C + E I K D + DL
Sbjct: 157 TTT-------DECVPYQ----SGSTGARGTCPTKCADGSELPIYKATKAVDYGLDCDL-- 203
Query: 242 GRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGW 301
IM+ + GP++ + T+Y+D + Y+ G+Y+HV G G HA+ ++G+
Sbjct: 204 -------------IMKALATGGPLQTAFTVYSDFMYYQGGVYQHVYGRAEGGHAVEMVGY 250
Query: 302 GQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
G + V YW++ NS+ +WGE+G FRI
Sbjct: 251 GTDEYD------VDYWIIRNSWGPDWGEDGYFRI 278
Score = 89.4 bits (220), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 38/100 (38%), Positives = 62/100 (62%), Gaps = 6/100 (6%)
Query: 395 ETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSS 454
+ IM+ + GP++ + T+Y+D + Y+ G+Y+HV G G HA+ ++G+G +
Sbjct: 202 DLIMKALATGGPLQTAFTVYSDFMYYQGGVYQHVYGRAEGGHAVEMVGYGTDEYD----- 256
Query: 455 VVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGL 494
V YW++ NS+ +WGE+G FRI+R NECGIE + G
Sbjct: 257 -VDYWIIRNSWGPDWGEDGYFRIIRMTNECGIEEQVIGGF 295
>gi|328726600|ref|XP_003248962.1| PREDICTED: cathepsin B-like cysteine proteinase-like [Acyrthosiphon
pisum]
Length = 169
Score = 124 bits (312), Expect = 9e-26, Method: Compositional matrix adjust.
Identities = 65/165 (39%), Positives = 93/165 (56%), Gaps = 11/165 (6%)
Query: 333 FRIGCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLP 391
F +GC PY +P C R +G+ S C R C D+ Y DD F R Y L
Sbjct: 11 FAVGCEPYRVPPCPRNEDGTSSCAGQPIEKNHRCTRMCYGNQDLDYNDDHRFTRDYYYLT 70
Query: 392 ANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGG-PLGEHAIRIIGWGQEPLGE 450
+I +++ +GP+E S +Y D YK+G+Y+ LG HA+++IGWG E E
Sbjct: 71 YG--SIQKDVMNYGPIEASFDVYDDFYSYKSGVYQRTPNATKLGGHAVKLIGWGVE---E 125
Query: 451 GTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLP 495
G + YWL+ NS++ WG+NGLF+I RG +ECGI++ TAG+P
Sbjct: 126 G----IPYWLMVNSWSAQWGDNGLFKIRRGTDECGIDSATTAGVP 166
Score = 99.8 bits (247), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 54/144 (37%), Positives = 77/144 (53%), Gaps = 11/144 (7%)
Query: 194 GCRPYEIP-CERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANE 252
GC PY +P C R +G+ S C R C D+ Y DD F R Y L
Sbjct: 14 GCEPYRVPPCPRNEDGTSSCAGQPIEKNHRCTRMCYGNQDLDYNDDHRFTRDYYYLTYG- 72
Query: 253 ETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGG-PLGEHAIRIIGWGQEPLGEGTS 311
+I +++ +GP+E S +Y D YK+G+Y+ LG HA+++IGWG E EG
Sbjct: 73 -SIQKDVMNYGPIEASFDVYDDFYSYKSGVYQRTPNATKLGGHAVKLIGWGVE---EG-- 126
Query: 312 SVVKYWLVANSFNTNWGENGLFRI 335
+ YWL+ NS++ WG+NGLF+I
Sbjct: 127 --IPYWLMVNSWSAQWGDNGLFKI 148
>gi|189308104|gb|ACD86936.1| cysteine protease [Caenorhabditis brenneri]
Length = 210
Score = 124 bits (312), Expect = 9e-26, Method: Compositional matrix adjust.
Identities = 73/192 (38%), Positives = 109/192 (56%), Gaps = 13/192 (6%)
Query: 45 LLPKLPF----YGAEKNALSK------LTLSELEMRMGVHPDSKLPQNRLPLLVQLSDPL 94
++PK P Y K +L K +T+ +++ R+ + + P + V+
Sbjct: 20 IVPKTPEAITEYVNSKQSLWKAEIPKHITIEQVKKRL-MRTEFVAPHSPDAEFVKHDIQE 78
Query: 95 EELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDD 154
+ +P FDAR WP C +I IRDQ CGS WA A EA SDR CIAS G + LS++D
Sbjct: 79 DTIPATFDARTQWPSCVSINNIRDQSDCGSCWAFAAAEAASDRFCIASNGAVNTLLSAED 138
Query: 155 LVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEI-PC-ERYMNGSHSS 212
++SCC +CG GC+GG+ AWKY V +G +GG+Y ++ GC+PY + PC E N + +
Sbjct: 139 VLSCCSNCGYGCEGGYPINAWKYLVKSGFCTGGSYEAQFGCKPYSLAPCGETVGNTTWPA 198
Query: 213 CQDNEPNTPECI 224
C + +TP C+
Sbjct: 199 CPTDGYDTPACV 210
>gi|28974200|gb|AAO61484.1| cathepsin B [Sterkiella histriomuscorum]
Length = 294
Score = 124 bits (311), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 97/281 (34%), Positives = 128/281 (45%), Gaps = 49/281 (17%)
Query: 57 NALSKLTLSELEMRMGVHPDSKLPQNR-LPLLVQLSDPLEELPEGFDARINWPYCPTIQE 115
N + +T +L + G + +P N+ P + +PE FDAR W I
Sbjct: 43 NPFNNMTKEQLLAKCGTY---IVPANKEYP-----GSKIMTVPENFDARQQWG--SKIHA 92
Query: 116 IRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAW 175
IRDQ CGS WA GA EA SDR I + V LS +DLVSC + GC GG+ AW
Sbjct: 93 IRDQQQCGSCWAFGATEAFSDRFAI---NGKDVILSPEDLVSCDTN-DYGCNGGYMDVAW 148
Query: 176 KYWVTTGIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIR-KCQPGYDVS 234
+Y G A+ C PY +G +C D + R KC P
Sbjct: 149 EYLADHG-------AATDSCFPYSAG-----SGFAPACSDKCADGSAMQRFKCAP----- 191
Query: 235 YEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEH 294
N R + + I EI HGPVEG+ T+Y D Y++G+Y G H
Sbjct: 192 -----NSVRQSKGV----AQIQSEIVSHGPVEGAFTVYTDFFNYQSGVYTPTTTDVAGGH 242
Query: 295 AIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
AI+I+G+G E GT YWL ANS+ WG +G F+I
Sbjct: 243 AIKILGYGVE---NGT----PYWLCANSWGPAWGMSGFFKI 276
Score = 80.9 bits (198), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 43/101 (42%), Positives = 59/101 (58%), Gaps = 9/101 (8%)
Query: 397 IMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVV 456
I EI HGPVEG+ T+Y D Y++G+Y G HAI+I+G+G E GT
Sbjct: 203 IQSEIVSHGPVEGAFTVYTDFFNYQSGVYTPTTTDVAGGHAIKILGYGVE---NGT---- 255
Query: 457 KYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
YWL ANS+ WG +G F+I +G ECGIE + + P++
Sbjct: 256 PYWLCANSWGPAWGMSGFFKIKQG--ECGIEDQVFSCDPQL 294
>gi|193688336|ref|XP_001945899.1| PREDICTED: cathepsin B-like cysteine proteinase 5-like
[Acyrthosiphon pisum]
Length = 308
Score = 124 bits (311), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 78/241 (32%), Positives = 121/241 (50%), Gaps = 28/241 (11%)
Query: 101 FDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCK 160
FDAR WP C TI E+ ++G+ GWA ++DR CIA+ G + LS+++L+SC
Sbjct: 69 FDARKRWPKCKTIGEVHNEGNFALGWAYAVAGVLADRTCIATNGGYNKLLSTEELISCSG 128
Query: 161 DCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIP----CERYMNGSHSSCQDN 216
N W+Y + G+VSGG Y S GC+P++ P ++++ +C D+
Sbjct: 129 IKENNGSVPSERSIWEYLKSHGVVSGGKYNSNDGCQPFKFPPIANIPKHLH--KHTCDDH 186
Query: 217 EPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMI 276
C ++Y D R Y++ + I +E+ +GPV + D
Sbjct: 187 ----------CYGNSTINYNHDHVRVRNYYTIRTRD--IQKEVQTYGPVVVRFMVCDDFF 234
Query: 277 LYKTGIYKHV--AGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFR 334
LYK+G+Y A G ++A ++IGWG E + V YWLV NS+ WG+ GLF+
Sbjct: 235 LYKSGVYAKSDKAKGIRTQYA-KLIGWGVE-------NGVDYWLVINSWGHEWGQKGLFK 286
Query: 335 I 335
I
Sbjct: 287 I 287
Score = 81.3 bits (199), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 55/167 (32%), Positives = 83/167 (49%), Gaps = 26/167 (15%)
Query: 336 GCRPYEIPCERYMNGSRSSCQANEP---NTPECIRKCQPGYDVSYEDDLNFGRIAYSLPA 392
GC+P++ P AN P + C C ++Y D R Y++
Sbjct: 162 GCQPFKFPP-----------IANIPKHLHKHTCDDHCYGNSTINYNHDHVRVRNYYTIRT 210
Query: 393 NEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHV--AGGPLGEHAIRIIGWGQEPLGE 450
+ I +E+ +GPV + D LYK+G+Y A G ++A ++IGWG E
Sbjct: 211 RD--IQKEVQTYGPVVVRFMVCDDFFLYKSGVYAKSDKAKGIRTQYA-KLIGWGVE---- 263
Query: 451 GTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
+ V YWLV NS+ WG+ GLF+I G N+CG+E+ + AGLP+I
Sbjct: 264 ---NGVDYWLVINSWGHEWGQKGLFKIKSGTNQCGVESFVYAGLPEI 307
>gi|253748399|gb|EET02549.1| Cathepsin B precursor [Giardia intestinalis ATCC 50581]
Length = 303
Score = 124 bits (311), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 83/278 (29%), Positives = 130/278 (46%), Gaps = 41/278 (14%)
Query: 64 LSELEMR-MGVHPDS-KLPQNRLPL--LVQLSDPLEELPEGFDARINWPYCPTIQEIRDQ 119
++E E R M ++PD K +P L +++DP + LP FD R +P+C + + DQ
Sbjct: 42 VTEDEFRGMLINPDRLKARSGSMPSAPLKEINDPTDPLPAQFDFRDEYPHC--VSPVFDQ 99
Query: 120 GSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWV 179
GSCG WA A+ R C K V S L+SC + GC GG W +
Sbjct: 100 GSCGGCWAFSAIGMFGSRRCAVGIDKAAVLYSQQHLISCSTE-NFGCSGGDFFPTWSFLT 158
Query: 180 TTGIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDL 239
TG + C +Y++ S C C G + +
Sbjct: 159 QTGATTA--------------ECVKYVDYGSSV-------AAACPTTCDDGSQIQFYKAH 197
Query: 240 NFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPL--GEHAIR 297
+G+++ S+PA IM+ + GPV+ + +YAD++ Y G+Y+H GP+ G HA+
Sbjct: 198 GYGQVSKSVPA----IMQMLVSGGPVQTMIVVYADLLYYAGGVYRHTY-GPISNGLHALE 252
Query: 298 IIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
++G+G T YW + NS+ ++WGE+G FRI
Sbjct: 253 MVGYGT------TDDGTDYWTIKNSWGSDWGEDGYFRI 284
Score = 92.8 bits (229), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 52/152 (34%), Positives = 82/152 (53%), Gaps = 20/152 (13%)
Query: 344 CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFR 403
C +Y++ S A C C G + + +G+++ S+PA IM+ +
Sbjct: 167 CVKYVDYGSSVAAA-------CPTTCDDGSQIQFYKAHGYGQVSKSVPA----IMQMLVS 215
Query: 404 HGPVEGSMTIYADMILYKTGIYKHVAGGPL--GEHAIRIIGWGQEPLGEGTSSVVKYWLV 461
GPV+ + +YAD++ Y G+Y+H G P+ G HA+ ++G+G T YW +
Sbjct: 216 GGPVQTMIVVYADLLYYAGGVYRHTYG-PISNGLHALEMVGYGT------TDDGTDYWTI 268
Query: 462 ANSFNTNWGENGLFRIVRGQNECGIEADITAG 493
NS+ ++WGE+G FRIVRG NEC IE +I A
Sbjct: 269 KNSWGSDWGEDGYFRIVRGVNECRIEDEIYAA 300
>gi|301775398|ref|XP_002923119.1| PREDICTED: LOW QUALITY PROTEIN: tubulointerstitial nephritis
antigen-like [Ailuropoda melanoleuca]
Length = 472
Score = 124 bits (311), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 92/296 (31%), Positives = 130/296 (43%), Gaps = 53/296 (17%)
Query: 62 LTLSE-LEMRMGVHPDSKLPQNRLPLLVQLSD------PLEELPEGFDARINWPYCPTIQ 114
+TL E + R+G P S PLL+ +++ +LPE F A WP
Sbjct: 182 MTLEEGFKYRLGTLPPS-------PLLLSMNEMTASLPATTDLPEFFIASYKWP--GWTH 232
Query: 115 EIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKA 174
DQ +C + WA +DR+ G+ LS +L+SCC +GC G +A
Sbjct: 233 GPLDQKNCAASWAFSTASVAADRI----XGRYTANLSPQNLISCCAKNRHGCNSGSIDRA 288
Query: 175 WKYWVTTGIVSGGTY-------ASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKC 227
W + G+VS Y A+ GC R + C +N + I +C
Sbjct: 289 WWFLRKRGLVSHACYPLFKDQNATNYGCAMASRSDGRGKRHATKPCPNNIEKSNR-IYQC 347
Query: 228 QPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVA 287
P Y VS +NE IM+EI ++GPV+ M ++ D YKTGIY+HV
Sbjct: 348 SPPYRVS---------------SNETEIMKEIMQNGPVQAIMQVHEDFFHYKTGIYRHVT 392
Query: 288 GG--------PLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
L HAI++ GWG G K+W+ ANS+ +WGENG FRI
Sbjct: 393 RTNEESSKYRKLQTHAIKLTGWGTLKGARGQKE--KFWIAANSWGKSWGENGYFRI 446
Score = 95.5 bits (236), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 49/114 (42%), Positives = 65/114 (57%), Gaps = 10/114 (8%)
Query: 388 YSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGG--------PLGEHAIR 439
Y + +NE IM+EI ++GPV+ M ++ D YKTGIY+HV L HAI+
Sbjct: 351 YRVSSNETEIMKEIMQNGPVQAIMQVHEDFFHYKTGIYRHVTRTNEESSKYRKLQTHAIK 410
Query: 440 IIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAG 493
+ GWG G K+W+ ANS+ +WGENG FRI+RG NE IE I A
Sbjct: 411 LTGWGTLKGARGQKE--KFWIAANSWGKSWGENGYFRILRGVNESDIEKLIIAA 462
>gi|32129435|sp|P92133.2|CATB3_GIALA RecName: Full=Cathepsin B-like CP3; AltName: Full=Cathepsin B-like
protease B3; Flags: Precursor
gi|1763663|gb|AAB58260.1| cysteine protease [Giardia intestinalis]
gi|11691660|emb|CAC18648.1| cathepsin B-like cysteine protease 3 [Giardia intestinalis]
Length = 299
Score = 124 bits (310), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 83/273 (30%), Positives = 136/273 (49%), Gaps = 36/273 (13%)
Query: 64 LSELEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCG 123
L++ E+ + P S L ++R + + + + P+ FD R +P+C I E+ DQG CG
Sbjct: 42 LTKDEISSLLMPVSFLKRDRA-AVPRGTVSATQAPDSFDFREEYPHC--IPEVVDQGGCG 98
Query: 124 SGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGN-GCQGGFHGKAWKYWVTTG 182
S WA +V ++ DR C A K+ V+ S +VSC D G+ C GG+ W++ TG
Sbjct: 99 SCWAFSSVASVGDRRCFAGLDKKAVKYSPQYVVSC--DRGDMACDGGWLPSVWRFLTKTG 156
Query: 183 IVSGGTYASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFG 242
+ C PY+ +GS + C KC G D+ + L
Sbjct: 157 TTT-------DECVPYQ-------SGSTGA-------RGTCPTKCADGSDLPH---LYKA 192
Query: 243 RIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWG 302
A + IM+ + GP++ + T+Y+D + Y++G+Y+H G G HA+ ++G+G
Sbjct: 193 TKAVDYGLDAPAIMKALATGGPLQTAFTVYSDFMYYESGVYQHTYGRVEGGHAVDMVGYG 252
Query: 303 QEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
+ G V YW++ NS+ +WGE+G FRI
Sbjct: 253 TDDDG------VDYWIIKNSWGPDWGEDGYFRI 279
Score = 92.4 bits (228), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 45/130 (34%), Positives = 72/130 (55%), Gaps = 9/130 (6%)
Query: 365 CIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGI 424
C KC G D+ + L A + IM+ + GP++ + T+Y+D + Y++G+
Sbjct: 176 CPTKCADGSDLPH---LYKATKAVDYGLDAPAIMKALATGGPLQTAFTVYSDFMYYESGV 232
Query: 425 YKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNEC 484
Y+H G G HA+ ++G+G + G V YW++ NS+ +WGE+G FRI+R NEC
Sbjct: 233 YQHTYGRVEGGHAVDMVGYGTDDDG------VDYWIIKNSWGPDWGEDGYFRIIRMTNEC 286
Query: 485 GIEADITAGL 494
GIE + G
Sbjct: 287 GIEEQVIGGF 296
>gi|161343827|tpg|DAA06094.1| TPA_inf: cathepsin B [Aphis gossypii]
Length = 207
Score = 124 bits (310), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 65/171 (38%), Positives = 93/171 (54%), Gaps = 21/171 (12%)
Query: 52 YGAEKNALSKLTLSELEMRMGVHPDSKLPQNRLPLL-----VQL---------------- 90
Y EK+ ++K+ + GV+ D K P+ + L VQ+
Sbjct: 22 YFLEKDYINKINEQATTWKAGVNFDPKTPKEHILKLLGSKGVQIPSKVNYKMYKSEDENY 81
Query: 91 SDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRL 150
+ L +P FDAR W C TI IRDQG+CGS WAL A +DR+C+AS G + L
Sbjct: 82 DNLLGRIPRKFDARKKWRNCKTIGAIRDQGNCGSCWALATSSAFADRLCVASNGNFNQLL 141
Query: 151 SSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIP 201
S+++L CC CG GC GG+ KAW+ ++ G+V+GG Y S++GC PY +P
Sbjct: 142 SAEELTFCCHKCGFGCNGGYPIKAWERFMKHGLVTGGDYKSREGCEPYRVP 192
>gi|402585445|gb|EJW79385.1| hypothetical protein WUBG_09708, partial [Wuchereria bancrofti]
Length = 190
Score = 124 bits (310), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 69/178 (38%), Positives = 98/178 (55%), Gaps = 30/178 (16%)
Query: 81 QNRLPLLVQLSDPLEEL----------PEGFDARINWPYCPTIQEIRDQGSCGSGWALGA 130
+N++ L D +E + PE FDAR+ WP C ++ ++ +QG CGS WA+ A
Sbjct: 22 KNKISQEYLLGDTIEHIKLLKSKKLHLPEQFDARLQWPLCWSVHQVANQGGCGSCWAISA 81
Query: 131 VEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGG-FHGKAWKYWVTTGIVSGGTY 189
MSDR+CIA+ ++S++DL+SCC +CG GCQG + A+ YW GIV+GG Y
Sbjct: 82 ASVMSDRLCIATNYSNQKQISAEDLISCCAECG-GCQGSNWALSAFIYWRNHGIVTGGDY 140
Query: 190 ASKQGCRPYEI------PC--ERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDL 239
S +GC+PY PC E Y + P C + CQP Y +SYE+DL
Sbjct: 141 GSFEGCKPYATAPNCGSPCSFEYY----------RKKAAPICQKTCQPLYGLSYEEDL 188
>gi|393902164|gb|EFO13452.2| hypothetical protein LOAG_15077, partial [Loa loa]
Length = 186
Score = 124 bits (310), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 67/165 (40%), Positives = 94/165 (56%), Gaps = 20/165 (12%)
Query: 97 LPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLV 156
LP+ FDAR+ WP C ++ + +QG CGS WA+ A MSDR+CIA+ ++S++DL+
Sbjct: 11 LPKHFDARLRWPLCWSVHVVANQGGCGSCWAISAASVMSDRLCIATNYSNQKQISAEDLI 70
Query: 157 SCCKDCGNGCQGGFHG-KAWKYWVTTGIVSGGTYASKQGCRPYEI------PC--ERYMN 207
SCC +CG GCQG A+ YW G+V+GG Y S +GC+PY PC E Y
Sbjct: 71 SCCTECG-GCQGSHWALSAFIYWRNHGVVTGGDYGSFEGCKPYTTAPNCGSPCSFEYY-- 127
Query: 208 GSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANE 252
+P C + CQP Y +SYE+DL + AY + A +
Sbjct: 128 --------RRKISPACQKTCQPLYGLSYEEDLISSQKAYWIRAQK 164
>gi|86451924|gb|ABC97357.1| cathepsin B [Streblomastix strix]
Length = 283
Score = 124 bits (310), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 97/294 (32%), Positives = 131/294 (44%), Gaps = 58/294 (19%)
Query: 77 SKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSD 136
SKL P V+ ++P+ FDAR WP I +RDQG CGS WA E + D
Sbjct: 43 SKLGARFTPHRVRPYRDSNKVPDTFDAREKWP--DAILPVRDQGECGSCWAFSIAETIGD 100
Query: 137 RVCI--ASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQG 194
R+ + SRG ++ +DLVS C +GC GGF AW + G+ + +
Sbjct: 101 RLGVLGCSRGD----IAPEDLVS-CDIFDDGCDGGFIDMAWDWCQENGLTT-------EE 148
Query: 195 CRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEET 254
C PY + G S C + + R Y DD
Sbjct: 149 CIPY-----KAGEGVPSPCPETCEDGSAIYRTPIESYRYIDADD---------------- 187
Query: 255 IMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVV 314
I EI+ +GPV +Y+D + YK+G+Y H AG G HA+ I+GWG E V
Sbjct: 188 IQGEIYEYGPVSMGFIVYSDFMSYKSGVYVHQAGYIEGGHAVLIVGWGVE-------DEV 240
Query: 315 KYWLVANSFNTNWGENGLFRIGCRPYEIPCERYMNGS-RSSCQAN-EPNTPECI 366
YWLV NS+ T+WGENG F+I + GS C++N PECI
Sbjct: 241 PYWLVQNSWGTDWGENGFFKI------------LRGSDHCECESNVTAGYPECI 282
Score = 103 bits (256), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 60/170 (35%), Positives = 89/170 (52%), Gaps = 15/170 (8%)
Query: 335 IGCRPYEIPCER-YMNGSRSSCQANEPNTPECI-RKCQPGYDVSYEDDLNFGRIAYSLPA 392
+ C ++ C+ +++ + CQ N T ECI K G + G Y P
Sbjct: 118 VSCDIFDDGCDGGFIDMAWDWCQENGLTTEECIPYKAGEGVPSPCPETCEDGSAIYRTPI 177
Query: 393 ------NEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQE 446
+ + I EI+ +GPV +Y+D + YK+G+Y H AG G HA+ I+GWG E
Sbjct: 178 ESYRYIDADDIQGEIYEYGPVSMGFIVYSDFMSYKSGVYVHQAGYIEGGHAVLIVGWGVE 237
Query: 447 PLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPK 496
V YWLV NS+ T+WGENG F+I+RG + C E+++TAG P+
Sbjct: 238 -------DEVPYWLVQNSWGTDWGENGFFKILRGSDHCECESNVTAGYPE 280
>gi|253742315|gb|EES99155.1| Cathepsin B precursor [Giardia intestinalis ATCC 50581]
Length = 303
Score = 124 bits (310), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 83/278 (29%), Positives = 130/278 (46%), Gaps = 41/278 (14%)
Query: 64 LSELEMR-MGVHPDS-KLPQNRLPL--LVQLSDPLEELPEGFDARINWPYCPTIQEIRDQ 119
++E E R M ++PD K +P L +++DP + LP FD R +P+C + + DQ
Sbjct: 42 VTEDEFRGMLINPDRLKARSGSMPSAPLKEINDPTDPLPAQFDFRDEYPHC--VSPVFDQ 99
Query: 120 GSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWV 179
GSCG WA A+ R C K V S L+SC + GC GG W +
Sbjct: 100 GSCGGCWAFSAIGMFGSRRCAVGIDKAAVLYSQQHLISCSTE-NFGCSGGDFFPTWSFLT 158
Query: 180 TTGIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDL 239
TG + C +Y++ S C C G + +
Sbjct: 159 QTGATTA--------------ECVKYVDYGSSV-------AAACPTTCDDGSQIQFYKAH 197
Query: 240 NFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPL--GEHAIR 297
+G+++ S+PA IM+ + GPV+ + +YAD++ Y G+Y+H GP+ G HA+
Sbjct: 198 GYGQLSKSVPA----IMQMLVSGGPVQTMIVVYADLLYYAGGVYRHTY-GPISNGLHALE 252
Query: 298 IIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
++G+G T YW + NS+ ++WGE+G FRI
Sbjct: 253 MVGYGT------TDDGTDYWTIKNSWGSDWGEDGYFRI 284
Score = 92.8 bits (229), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 52/152 (34%), Positives = 82/152 (53%), Gaps = 20/152 (13%)
Query: 344 CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFR 403
C +Y++ S A C C G + + +G+++ S+PA IM+ +
Sbjct: 167 CVKYVDYGSSVAAA-------CPTTCDDGSQIQFYKAHGYGQLSKSVPA----IMQMLVS 215
Query: 404 HGPVEGSMTIYADMILYKTGIYKHVAGGPL--GEHAIRIIGWGQEPLGEGTSSVVKYWLV 461
GPV+ + +YAD++ Y G+Y+H G P+ G HA+ ++G+G T YW +
Sbjct: 216 GGPVQTMIVVYADLLYYAGGVYRHTYG-PISNGLHALEMVGYGT------TDDGTDYWTI 268
Query: 462 ANSFNTNWGENGLFRIVRGQNECGIEADITAG 493
NS+ ++WGE+G FRIVRG NEC IE +I A
Sbjct: 269 KNSWGSDWGEDGYFRIVRGVNECRIEDEIYAA 300
>gi|156708112|gb|ABU93314.1| cathepsin B5 cysteine protease [Monocercomonoides sp. PA]
Length = 281
Score = 123 bits (309), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 85/274 (31%), Positives = 127/274 (46%), Gaps = 48/274 (17%)
Query: 62 LTLSELEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGS 121
+T ++L R+G ++ P N +P LP+ FDAR WP I +R+Q
Sbjct: 36 ITTAKLRARLGAIDLNEGPSNYVPDT--------SLPDNFDAREQWP--GKILPVRNQEQ 85
Query: 122 CGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTT 181
CGS WA E +R+ I G+ +S DLVSC K +GC GG +W++ +
Sbjct: 86 CGSCWAFAVAETTGNRLNILGCGRGD--MSPQDLVSCDK-VDHGCNGGSPLFSWEWVKHS 142
Query: 182 GIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNF 241
GI + + C PY R P C +KC G + +
Sbjct: 143 GITT-------EECIPYVSGGGR---------------VPSCPKKCTNGSAIVRTKAKSV 180
Query: 242 GRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGW 301
G + + + E++ GP E + ++Y D YK+G+Y H+ G LG HA+ ++GW
Sbjct: 181 GLV------KGDKMQNELYSRGPFEAAFSVYEDFKSYKSGVYHHITGKMLGGHAVMVVGW 234
Query: 302 GQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
G E +GT YWL+ NS+ T WGE G F+I
Sbjct: 235 GVE---DGT----PYWLIQNSWGTTWGEQGFFKI 261
Score = 100 bits (249), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 49/132 (37%), Positives = 72/132 (54%), Gaps = 13/132 (9%)
Query: 362 TPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYK 421
P C +KC G + + G + + + E++ GP E + ++Y D YK
Sbjct: 159 VPSCPKKCTNGSAIVRTKAKSVGLV------KGDKMQNELYSRGPFEAAFSVYEDFKSYK 212
Query: 422 TGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQ 481
+G+Y H+ G LG HA+ ++GWG E +GT YWL+ NS+ T WGE G F+I+RG+
Sbjct: 213 SGVYHHITGKMLGGHAVMVVGWGVE---DGT----PYWLIQNSWGTTWGEQGFFKILRGK 265
Query: 482 NECGIEADITAG 493
NECGIE G
Sbjct: 266 NECGIETTCFQG 277
>gi|294914336|ref|XP_002778250.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
gi|239886453|gb|EER10045.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
Length = 388
Score = 123 bits (308), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 70/165 (42%), Positives = 97/165 (58%), Gaps = 15/165 (9%)
Query: 336 GCRPYEIP-CERYMNGS-RSSCQANEPNTPECIRKCQ-PGYDVSYEDDLNFGRI-AYSLP 391
GC PY P C +++ C+ N P +P C C+ + S+E D +F YSL
Sbjct: 221 GCWPYNFPECSHHVDTKGMEPCKGNSP-SPVCSTTCRNHHFKPSFESDRHFTEDEGYSLD 279
Query: 392 ANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEG 451
+E I REI +GPV + T+Y D YK+G+YKHV G LG HA++IIGW G
Sbjct: 280 EVDE-IKREIIDNGPVAAAFTVYEDFPYYKSGVYKHVNGSELGGHAVKIIGW-------G 331
Query: 452 TSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPK 496
+YWLV NS+N NWG+ G+F+I G ECGI++++TAG+PK
Sbjct: 332 IDQNEQYWLVMNSWNVNWGDQGIFKIAIG--ECGIDSEVTAGIPK 374
Score = 108 bits (270), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 83/269 (30%), Positives = 125/269 (46%), Gaps = 30/269 (11%)
Query: 77 SKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSD 136
SKLP+ S L LP+ FDAR ++ C T+ A G + +
Sbjct: 111 SKLPKKP----ASESTVLSNLPDRFDAREHFKNCATVIGHVSPPVV----AAGLLRRLKH 162
Query: 137 RVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVS---GGTYASK- 192
+ + + L++ + + FH A + +++ G T+A +
Sbjct: 163 SAIVCASARVDSLTWYHFLLATLRHVAQKKKVAFHLVA----MAVNLIAHGGGSTFAPEL 218
Query: 193 -QGCRPYEIP-CERYMNGS-HSSCQDNEPNTPECIRKCQ-PGYDVSYEDDLNFGRI-AYS 247
GC PY P C +++ C+ N P +P C C+ + S+E D +F YS
Sbjct: 219 DSGCWPYNFPECSHHVDTKGMEPCKGNSP-SPVCSTTCRNHHFKPSFESDRHFTEDEGYS 277
Query: 248 LPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLG 307
L +E I REI +GPV + T+Y D YK+G+YKHV G LG HA++IIGW
Sbjct: 278 LDEVDE-IKREIIDNGPVAAAFTVYEDFPYYKSGVYKHVNGSELGGHAVKIIGW------ 330
Query: 308 EGTSSVVKYWLVANSFNTNWGENGLFRIG 336
G +YWLV NS+N NWG+ G+F+I
Sbjct: 331 -GIDQNEQYWLVMNSWNVNWGDQGIFKIA 358
>gi|157058731|gb|ABV03123.1| cathepsin B-16D1 [Acyrthosiphon pisum]
Length = 243
Score = 123 bits (308), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 58/154 (37%), Positives = 87/154 (56%), Gaps = 1/154 (0%)
Query: 96 ELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDL 155
+P FDAR W C TI +RDQG+CGS WA+ A +DR+C+A+ + LS++++
Sbjct: 85 RIPRHFDARRKWRSCHTIGAVRDQGNCGSCWAMATSSAFADRLCVATNADFNELLSAEEI 144
Query: 156 VSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQD 215
CC CG GC GG+ KAW+ + G+V+GG Y S +GC PY +P Y H++C
Sbjct: 145 TFCCYSCGFGCNGGYPIKAWERFKKRGLVTGGDYQSGEGCEPYRVPPCPYDAEGHNTCAG 204
Query: 216 N-EPNTPECIRKCQPGYDVSYEDDLNFGRIAYSL 248
+ C R C D+ +++D + R +Y L
Sbjct: 205 KPRESNHRCTRMCYGNQDLDFDEDHRYTRDSYYL 238
>gi|157058775|gb|ABV03145.1| cathepsin B-16D [Myzus persicae]
Length = 236
Score = 123 bits (308), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 58/153 (37%), Positives = 87/153 (56%), Gaps = 1/153 (0%)
Query: 97 LPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLV 156
+P FDAR W +C TI +RDQG+CGS WA+ A +DR+C+A+ + LS++++
Sbjct: 81 IPRFFDARRKWRHCSTIGRVRDQGNCGSCWAVATSSAFADRLCVATNADFNELLSAEEIT 140
Query: 157 SCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEI-PCERYMNGSHSSCQD 215
CC CG GC GG+ KAWK + G+V+GG Y S +GC PY + PC G+++
Sbjct: 141 FCCHTCGFGCNGGYPIKAWKRFSKKGLVTGGDYKSGEGCEPYRVPPCPNDDQGNNTCAGK 200
Query: 216 NEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSL 248
+ C R C D+ +++D + R Y L
Sbjct: 201 PMESNHRCTRMCYGDQDLDFDEDHRYTRDYYYL 233
>gi|32129433|sp|P92131.3|CATB1_GIALA RecName: Full=Cathepsin B-like CP1; AltName: Full=Cathepsin B-like
protease B1; Flags: Precursor
Length = 303
Score = 122 bits (307), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 86/280 (30%), Positives = 127/280 (45%), Gaps = 45/280 (16%)
Query: 64 LSELEMR-MGVHPD------SKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEI 116
++E E R M + PD LP + + +L DP+ P FD R +P C ++
Sbjct: 42 VTEDEFRSMLIRPDRLRARSGSLPPISITEVQELVDPI---PPQFDFRDEYPQC--VKPA 96
Query: 117 RDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWK 176
DQGSCGS WA A+ DR C K V S L+SC + GC GG W
Sbjct: 97 LDQGSCGSCWAFSAIGVFGDRRCAMGIDKEAVSYSQQHLISCSLE-NFGCDGGDFQPTWS 155
Query: 177 YWVTTGIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYE 236
+ TG + C +Y++ H+ C C G +
Sbjct: 156 FLTFTGATTA--------------ECVKYVDYGHTVAS-------PCPAVCDDGSPIQLY 194
Query: 237 DDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGG-PLGEHA 295
+G+++ S+PA IM + GP++ + +YAD+ Y++G+YKH G LG HA
Sbjct: 195 KAHGYGQVSKSVPA----IMGMLVAGGPLQTMIVVYADLSYYESGVYKHTYGTINLGFHA 250
Query: 296 IRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
+ I+G+G T YW++ NS+ +WGENG FRI
Sbjct: 251 LEIVGYGT------TDDGTDYWIIKNSWGPDWGENGYFRI 284
Score = 92.0 bits (227), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 50/129 (38%), Positives = 73/129 (56%), Gaps = 11/129 (8%)
Query: 365 CIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGI 424
C C G + +G+++ S+PA IM + GP++ + +YAD+ Y++G+
Sbjct: 181 CPAVCDDGSPIQLYKAHGYGQVSKSVPA----IMGMLVAGGPLQTMIVVYADLSYYESGV 236
Query: 425 YKHVAGG-PLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNE 483
YKH G LG HA+ I+G+G T YW++ NS+ +WGENG FRIVRG NE
Sbjct: 237 YKHTYGTINLGFHALEIVGYGT------TDDGTDYWIIKNSWGPDWGENGYFRIVRGVNE 290
Query: 484 CGIEADITA 492
C IE +I A
Sbjct: 291 CRIEDEIYA 299
>gi|328712825|ref|XP_001945477.2| PREDICTED: tubulointerstitial nephritis antigen-like isoform 1
[Acyrthosiphon pisum]
Length = 487
Score = 122 bits (307), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 86/273 (31%), Positives = 127/273 (46%), Gaps = 29/273 (10%)
Query: 67 LEMRMG-VHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSG 125
L+ R+G + P K+ Q +PL + LP FD R + I + DQG CG+
Sbjct: 193 LKWRLGTLQPPEKILQ-VVPLKAVFHQDYQ-LPSSFDLRK--VFGDKITDPIDQGWCGAS 248
Query: 126 WALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVS 185
WA+ + +DR I ++G LS L+SC D GCQGG AW + +T G+V+
Sbjct: 249 WAISTAQVTTDRFVIMTKGLMRDALSPKHLLSCNNDLQRGCQGGHLTSAWNWVMTFGLVT 308
Query: 186 GGTYASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIA 245
Y + +R N +C + +P +R+ Y V
Sbjct: 309 EECYPWDGRATDCAVSNQRSNNNLIVTCPRSAKTSP--LRRVGLMYRV------------ 354
Query: 246 YSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVA---GGPLGEHAIRIIGWG 302
A EE IM EI G V+ M + + +Y++G+YK G G H +RI+GWG
Sbjct: 355 ----ATEEGIMYEIMNWGSVQAMMKVSKEFFMYESGVYKCSKLDLGSKTGYHTVRIVGWG 410
Query: 303 QEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
+E + VKYW+V+NS+ WGE+G FRI
Sbjct: 411 EE---QQNGRTVKYWIVSNSWGLWWGESGYFRI 440
Score = 95.1 bits (235), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 46/109 (42%), Positives = 65/109 (59%), Gaps = 6/109 (5%)
Query: 392 ANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVA---GGPLGEHAIRIIGWGQEPL 448
A EE IM EI G V+ M + + +Y++G+YK G G H +RI+GWG+E
Sbjct: 355 ATEEGIMYEIMNWGSVQAMMKVSKEFFMYESGVYKCSKLDLGSKTGYHTVRIVGWGEE-- 412
Query: 449 GEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
+ VKYW+V+NS+ WGE+G FRI++G NEC IE + A +P I
Sbjct: 413 -QQNGRTVKYWIVSNSWGLWWGESGYFRILKGTNECQIEDFVVAAMPDI 460
>gi|328712827|ref|XP_003244913.1| PREDICTED: tubulointerstitial nephritis antigen-like isoform 2
[Acyrthosiphon pisum]
Length = 487
Score = 122 bits (307), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 85/273 (31%), Positives = 127/273 (46%), Gaps = 29/273 (10%)
Query: 67 LEMRMG-VHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSG 125
L+ R+G + P K+ Q +PL + LP FD R + I + DQG CG+
Sbjct: 193 LKWRLGTLQPPEKILQ-VVPLKAVFHQDYQ-LPSSFDLRK--VFGDKITDPIDQGWCGAS 248
Query: 126 WALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVS 185
WA+ + +DR I ++G LS L+SC D GCQGG AW + +T G+V+
Sbjct: 249 WAISTAQVTTDRFVIMTKGLMRDALSPKHLLSCNNDLQRGCQGGHLTSAWNWVMTFGLVT 308
Query: 186 GGTYASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIA 245
Y + +R N +C + +P +R+ Y V
Sbjct: 309 EECYPWDGRATDCAVSNQRSNNNLIVTCPRSAKTSP--LRRVGLMYRV------------ 354
Query: 246 YSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYK---HVAGGPLGEHAIRIIGWG 302
A EE IM EI G V+ M + + +Y++G+Y+ G G H +RI+GWG
Sbjct: 355 ----ATEEGIMYEIMNWGSVQAMMKVSKEFFMYESGVYRCSNLALGSKTGYHTVRIVGWG 410
Query: 303 QEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
+E + VKYW+V+NS+ WGE+G FRI
Sbjct: 411 EE---QQNGRTVKYWIVSNSWGLWWGESGYFRI 440
Score = 94.0 bits (232), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 45/110 (40%), Positives = 65/110 (59%), Gaps = 6/110 (5%)
Query: 392 ANEETIMREIFRHGPVEGSMTIYADMILYKTGIYK---HVAGGPLGEHAIRIIGWGQEPL 448
A EE IM EI G V+ M + + +Y++G+Y+ G G H +RI+GWG+E
Sbjct: 355 ATEEGIMYEIMNWGSVQAMMKVSKEFFMYESGVYRCSNLALGSKTGYHTVRIVGWGEE-- 412
Query: 449 GEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKIG 498
+ VKYW+V+NS+ WGE+G FRI++G NEC IE + A + IG
Sbjct: 413 -QQNGRTVKYWIVSNSWGLWWGESGYFRILKGTNECQIEDFVVAAMADIG 461
>gi|159109223|ref|XP_001704877.1| Cathepsin B precursor [Giardia lamblia ATCC 50803]
gi|157432952|gb|EDO77203.1| Cathepsin B precursor [Giardia lamblia ATCC 50803]
Length = 300
Score = 122 bits (306), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 75/242 (30%), Positives = 118/242 (48%), Gaps = 35/242 (14%)
Query: 95 EELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDD 154
+++PE FD R +P+C I E+ DQG CGS WA +V DR C+A K+ V+ S
Sbjct: 73 DDVPESFDFREEYPHC--IPEVVDQGGCGSCWAFSSVATFGDRRCVAGLDKKPVKYSPQY 130
Query: 155 LVSCCKDCGN-GCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIPCERYMNGSHSSC 213
+VSC D G+ C GG+ WK+ TG + C PY+ + C
Sbjct: 131 VVSC--DHGDMACNGGWLPNVWKFLTKTGTTT-------DECVPYKSGSTTLRGTCPTKC 181
Query: 214 QDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYA 273
D K SY+D +PA +M+ + GP++ + +Y+
Sbjct: 182 ADGS-------SKVHLATATSYKD------YGLDIPA----MMKALSTSGPLQVAFLVYS 224
Query: 274 DMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLF 333
D + Y++G+Y+H G G HA+ ++G+G + G V YW++ NS+ +WGE+G F
Sbjct: 225 DFMYYESGVYQHTYGYMEGGHAVEMVGYGTDDDG------VDYWIIRNSWGPDWGEDGYF 278
Query: 334 RI 335
R+
Sbjct: 279 RM 280
Score = 84.3 bits (207), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 42/131 (32%), Positives = 71/131 (54%), Gaps = 11/131 (8%)
Query: 365 CIRKCQPGYD-VSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTG 423
C KC G V ++ +PA +M+ + GP++ + +Y+D + Y++G
Sbjct: 177 CPTKCADGSSKVHLATATSYKDYGLDIPA----MMKALSTSGPLQVAFLVYSDFMYYESG 232
Query: 424 IYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNE 483
+Y+H G G HA+ ++G+G + G V YW++ NS+ +WGE+G FR++RG N+
Sbjct: 233 VYQHTYGYMEGGHAVEMVGYGTDDDG------VDYWIIRNSWGPDWGEDGYFRMIRGIND 286
Query: 484 CGIEADITAGL 494
C IE AG
Sbjct: 287 CSIEEQAYAGF 297
>gi|443686962|gb|ELT90079.1| hypothetical protein CAPTEDRAFT_166233 [Capitella teleta]
Length = 495
Score = 122 bits (306), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 90/290 (31%), Positives = 131/290 (45%), Gaps = 34/290 (11%)
Query: 59 LSKLTLSE-LEMRMGVHPDSKLPQNRLPLLVQLS-DPLEELPEGFDARINWPYCPTIQEI 116
L +TL + ++ R+G PQ + + L D E +P+ FDAR WP I +
Sbjct: 177 LWGMTLKDGIKYRLGTFK----PQGMIEEMSSLKVDADEVMPDEFDAREEWP--SFIHPV 230
Query: 117 RDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGN-GCQGGFHGKAW 175
+DQG+CG+ +A +DR+ I S G+ LS+ L+SC D GC+GG +AW
Sbjct: 231 QDQGNCGASYAFSTSTVAADRLSIHSGGELKDMLSAQYLISCTTDHHQKGCEGGHVDRAW 290
Query: 176 KYWVTTGIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSY 235
G VS Y G C ++ P+ +C G ++
Sbjct: 291 WQLRRVGTVSKDCYPYTSG-----------DTNDPGKCLMSKYKLPKKNIECPVGQGIT- 338
Query: 236 EDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAG------- 288
L Y + A E IM EI +GPV+ M + D Y+ G+YKH
Sbjct: 339 -SKLYQASPPYRIAAKEREIMNEIILNGPVQAVMHVKDDFYTYERGVYKHSHAPKPANYP 397
Query: 289 --GPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIG 336
G H++RIIGWG + G+ +KYWL AN++ +WGE G FRI
Sbjct: 398 HLGKEAYHSVRIIGWGTDYTGD---DPIKYWLAANTWGRHWGEGGFFRIA 444
Score = 87.4 bits (215), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 45/119 (37%), Positives = 63/119 (52%), Gaps = 12/119 (10%)
Query: 388 YSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAG---------GPLGEHAI 438
Y + A E IM EI +GPV+ M + D Y+ G+YKH G H++
Sbjct: 348 YRIAAKEREIMNEIILNGPVQAVMHVKDDFYTYERGVYKHSHAPKPANYPHLGKEAYHSV 407
Query: 439 RIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
RIIGWG + G+ +KYWL AN++ +WGE G FRI RG +E IE+ + K+
Sbjct: 408 RIIGWGTDYTGD---DPIKYWLAANTWGRHWGEGGFFRIARGSDESHIESFVVGVWGKV 463
>gi|227018340|gb|ACP18836.1| cysteine proteinase 3 [Chrysomela tremula]
Length = 190
Score = 122 bits (306), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 59/107 (55%), Positives = 73/107 (68%), Gaps = 1/107 (0%)
Query: 97 LPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLV 156
+PE FDAR NWP C +I+ IRDQ CGS WA+ A A+SDR+CI S G +S +DL+
Sbjct: 83 IPENFDARENWPECESIRMIRDQSDCGSCWAVAAAAAVSDRICIYSYGANQTIVSDEDLL 142
Query: 157 SCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEI-PC 202
SCC DCG GC GG+ +AW YW GIVSGG Y S +GC+ Y + PC
Sbjct: 143 SCCDDCGFGCDGGYSWEAWNYWKNDGIVSGGPYNSTRGCKAYSMQPC 189
>gi|253748582|gb|EET02635.1| Cathepsin B precursor [Giardia intestinalis ATCC 50581]
Length = 298
Score = 122 bits (305), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 75/241 (31%), Positives = 122/241 (50%), Gaps = 36/241 (14%)
Query: 96 ELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDL 155
++P+ FD R +P+C I E+ DQGSCGS WA +V ++ DR C A K+ V S +
Sbjct: 73 KVPDSFDFREEYPHC--IPEVVDQGSCGSCWAFSSVASLGDRRCFAGLDKKAVTYSPQYV 130
Query: 156 VSCCKDCGN-GCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQ 214
VSC D G+ C GG+ W++ TG + C PY+ G+ +C
Sbjct: 131 VSC--DHGDMACDGGWLQSVWRFLTKTGTTT-------NECVPYQ----SGTTGARGTCP 177
Query: 215 DNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYAD 274
KC G ++S + A + + IM+ + GP++ + T+Y+D
Sbjct: 178 ----------TKCADGGELSTVK----AKKAVDYGLDCDLIMKALVTGGPLQTAFTVYSD 223
Query: 275 MILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFR 334
+ Y+ G+Y+H++G G HA+ ++G+G + V YW++ NS+ +WGE+G FR
Sbjct: 224 FMYYEGGVYQHMSGRVEGGHAVEMVGYGTDEYD------VDYWIIRNSWGPDWGEDGYFR 277
Query: 335 I 335
I
Sbjct: 278 I 278
Score = 89.7 bits (221), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 43/130 (33%), Positives = 74/130 (56%), Gaps = 10/130 (7%)
Query: 365 CIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGI 424
C KC G ++S + A + + IM+ + GP++ + T+Y+D + Y+ G+
Sbjct: 176 CPTKCADGGELSTVK----AKKAVDYGLDCDLIMKALVTGGPLQTAFTVYSDFMYYEGGV 231
Query: 425 YKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNEC 484
Y+H++G G HA+ ++G+G + V YW++ NS+ +WGE+G FRI+R NEC
Sbjct: 232 YQHMSGRVEGGHAVEMVGYGTDEYD------VDYWIIRNSWGPDWGEDGYFRIIRMTNEC 285
Query: 485 GIEADITAGL 494
GIE + G+
Sbjct: 286 GIEEQVMGGI 295
>gi|182509202|ref|NP_001116812.1| tubulointerstitial nephritis antigen precursor [Bombyx mori]
gi|81303350|gb|ABB71105.1| TIN-ag-RP [Bombyx mori]
Length = 404
Score = 122 bits (305), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 87/244 (35%), Positives = 118/244 (48%), Gaps = 46/244 (18%)
Query: 96 ELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDL 155
+ P+ FDAR W I I DQ CGS WA+ + DR I S G +VR+SS L
Sbjct: 184 QYPDEFDARREWY--GYISPIADQDWCGSDWAVSIASIVGDRFSIQSFGTENVRMSSQTL 241
Query: 156 VSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQD 215
+SC GC GG A+ + T G+VS + C PYE G+ + C+
Sbjct: 242 LSCHLKGQRGCNGGNLDIAFDFVKTHGLVS-------EQCFPYE--------GAVTQCRI 286
Query: 216 NEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADM 275
+C R Y V + +S+ + EE IM +I GP G MT+Y D
Sbjct: 287 GN----DCRR-----YRVG---------VPFSI-SKEEDIMYDIMTSGPALGIMTVYQDF 327
Query: 276 ILYKTGIYKHVAGGPL---GEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGL 332
Y+ GIY+H G G H++RI+GWG++ KYW+VANS+ T+WGE G
Sbjct: 328 FHYREGIYRHTRHGDQLMRGLHSVRIVGWGEDAED-------KYWIVANSWGTSWGEKGY 380
Query: 333 FRIG 336
FRI
Sbjct: 381 FRIA 384
Score = 95.9 bits (237), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 47/109 (43%), Positives = 64/109 (58%), Gaps = 10/109 (9%)
Query: 392 ANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPL---GEHAIRIIGWGQEPL 448
+ EE IM +I GP G MT+Y D Y+ GIY+H G G H++RI+GWG++
Sbjct: 302 SKEEDIMYDIMTSGPALGIMTVYQDFFHYREGIYRHTRHGDQLMRGLHSVRIVGWGEDAE 361
Query: 449 GEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
KYW+VANS+ T+WGE G FRI RG + GIE+ + LP +
Sbjct: 362 D-------KYWIVANSWGTSWGEKGYFRIARGHSGTGIESSVLTVLPYV 403
>gi|1763659|gb|AAB58258.1| cysteine protease [Giardia intestinalis]
Length = 269
Score = 121 bits (304), Expect = 8e-25, Method: Compositional matrix adjust.
Identities = 85/280 (30%), Positives = 126/280 (45%), Gaps = 45/280 (16%)
Query: 64 LSELEMR-MGVHPD------SKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEI 116
++E E R M + PD LP + + +L DP+ P FD R +P C ++
Sbjct: 8 VTEDEFRSMLIRPDRLRARSGSLPPISITEVQELVDPI---PPQFDFRDEYPQC--VKPA 62
Query: 117 RDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWK 176
DQGSCG WA A+ DR C K V S L+SC + GC GG W
Sbjct: 63 LDQGSCGECWAFSAIGVFGDRRCAMGIDKEAVSYSQQHLISCSLE-NFGCDGGDFQPTWS 121
Query: 177 YWVTTGIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYE 236
+ TG + C +Y++ H+ C C G +
Sbjct: 122 FLTFTGATTA--------------ECVKYVDYGHTVAS-------PCPAVCDDGSPIQLY 160
Query: 237 DDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGG-PLGEHA 295
+G+++ S+PA IM + GP++ + +YAD+ Y++G+YKH G LG HA
Sbjct: 161 KAHGYGQVSKSVPA----IMGMLVAGGPLQTMIVVYADLSYYESGVYKHTYGTINLGFHA 216
Query: 296 IRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
+ I+G+G T YW++ NS+ +WGENG FRI
Sbjct: 217 LEIVGYGT------TDDGTDYWIIKNSWGPDWGENGYFRI 250
Score = 92.0 bits (227), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 50/129 (38%), Positives = 73/129 (56%), Gaps = 11/129 (8%)
Query: 365 CIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGI 424
C C G + +G+++ S+PA IM + GP++ + +YAD+ Y++G+
Sbjct: 147 CPAVCDDGSPIQLYKAHGYGQVSKSVPA----IMGMLVAGGPLQTMIVVYADLSYYESGV 202
Query: 425 YKHVAGG-PLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNE 483
YKH G LG HA+ I+G+G T YW++ NS+ +WGENG FRIVRG NE
Sbjct: 203 YKHTYGTINLGFHALEIVGYGT------TDDGTDYWIIKNSWGPDWGENGYFRIVRGVNE 256
Query: 484 CGIEADITA 492
C IE +I A
Sbjct: 257 CRIEDEIYA 265
>gi|301618234|ref|XP_002938532.1| PREDICTED: tubulointerstitial nephritis antigen-like [Xenopus
(Silurana) tropicalis]
Length = 494
Score = 121 bits (304), Expect = 8e-25, Method: Compositional matrix adjust.
Identities = 90/302 (29%), Positives = 137/302 (45%), Gaps = 36/302 (11%)
Query: 62 LTLSE-LEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQG 120
+TL E ++ R+G S N + V +++ + LP F+A WP + E DQG
Sbjct: 202 MTLDEGIQYRLGTAKPSSSVMNMNEIHVNMNNDI--LPSHFNAAEKWP--GLVHEPLDQG 257
Query: 121 SCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVT 180
+C WA SDR+ I S G LS +L+SC +GC+GG AW Y
Sbjct: 258 NCAGSWAFSTAAVASDRISIQSMGHMTQSLSPQNLLSCDTRNQHGCRGGRVDGAWWYLRR 317
Query: 181 TGIVSGGTYA-SKQGCRPYEIPC---ERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYE 236
G+VS Y + + PC R M +N PN
Sbjct: 318 RGVVSEPCYPFTSLNTNGHSAPCMMQSRSMGRGKRQATNNCPN------------QYYSS 365
Query: 237 DDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPL----- 291
+++ AY L ++E+ IM+E++ +GPV+ M ++ D +YK+GIY+H
Sbjct: 366 NEIYQSTPAYRLASSEKDIMKELYENGPVQAIMEVHEDFFMYKSGIYRHTPVTEREPEHH 425
Query: 292 ---GEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIGCRPYEIPCERYM 348
G H+++I G G++ KYWL ANS+ +WGE+G FRI E E ++
Sbjct: 426 RRHGTHSVKITG-GRD------GQTHKYWLAANSWGRDWGEDGYFRIARGENECEIETFI 478
Query: 349 NG 350
G
Sbjct: 479 VG 480
Score = 91.3 bits (225), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 46/122 (37%), Positives = 71/122 (58%), Gaps = 15/122 (12%)
Query: 387 AYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPL--------GEHAI 438
AY L ++E+ IM+E++ +GPV+ M ++ D +YK+GIY+H G H++
Sbjct: 374 AYRLASSEKDIMKELYENGPVQAIMEVHEDFFMYKSGIYRHTPVTEREPEHHRRHGTHSV 433
Query: 439 RIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKIG 498
+I G G++ KYWL ANS+ +WGE+G FRI RG+NEC IE I ++
Sbjct: 434 KITG-GRD------GQTHKYWLAANSWGRDWGEDGYFRIARGENECEIETFIVGVWGRVS 486
Query: 499 LE 500
+E
Sbjct: 487 ME 488
>gi|156708114|gb|ABU93315.1| cathepsin B6 cysteine protease [Monocercomonoides sp. PA]
Length = 281
Score = 121 bits (304), Expect = 8e-25, Method: Compositional matrix adjust.
Identities = 96/290 (33%), Positives = 133/290 (45%), Gaps = 58/290 (20%)
Query: 62 LTLSELEMRMGVHPDSKLPQNRLPLL-VQLSDPLEELPEGFDARINWPYCPTIQEIRDQG 120
+TL+++ +G + LPL V+ +P +PE FDAR WP I +RDQ
Sbjct: 36 ITLAKMRAMLG--------EEVLPLEDVEYVEP-NNVPENFDAREQWP--GKIYPVRDQA 84
Query: 121 SCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVT 180
SCGS WA A EA+ +R I GK LS DLVSC K +GC GG + K+ V+
Sbjct: 85 SCGSCWAHAASEAIGNRFSIKGCGKG--MLSVQDLVSCDKG-DSGCNGGSGPLSSKWLVS 141
Query: 181 TGIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLN 240
G+ + + C PY P C KC G +
Sbjct: 142 NGVTT-------EECLPY---------------VSGNGRVPACAAKCSNGSQI------- 172
Query: 241 FGRIAYSLPANE----ETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAI 296
I Y E + I E+ ++GPV T+Y+D + YK+G+Y+H +G G HA+
Sbjct: 173 ---IRYKYEKAETYTVQNIQEELMKNGPVYFRFTVYSDFMNYKSGVYQHKSGYQEGGHAV 229
Query: 297 RIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIGCRPYEIPCER 346
+IGWG E V YWL+ NS+ WGE G F+I E CE+
Sbjct: 230 LLIGWGVE-------DGVPYWLLQNSWGPAWGEKGHFKIIRGKNECGCEQ 272
Score = 92.4 bits (228), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 51/136 (37%), Positives = 70/136 (51%), Gaps = 21/136 (15%)
Query: 362 TPECIRKCQPGYDVSYEDDLNFGRIAYSLPANE----ETIMREIFRHGPVEGSMTIYADM 417
P C KC G + I Y E + I E+ ++GPV T+Y+D
Sbjct: 159 VPACAAKCSNGSQI----------IRYKYEKAETYTVQNIQEELMKNGPVYFRFTVYSDF 208
Query: 418 ILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 477
+ YK+G+Y+H +G G HA+ +IGWG E V YWL+ NS+ WGE G F+I
Sbjct: 209 MNYKSGVYQHKSGYQEGGHAVLLIGWGVE-------DGVPYWLLQNSWGPAWGEKGHFKI 261
Query: 478 VRGQNECGIEADITAG 493
+RG+NECG E AG
Sbjct: 262 IRGKNECGCEQGFYAG 277
>gi|283468816|emb|CAO98753.1| putative cathepsin B [Fasciola hepatica]
Length = 112
Score = 121 bits (304), Expect = 8e-25, Method: Compositional matrix adjust.
Identities = 60/119 (50%), Positives = 79/119 (66%), Gaps = 7/119 (5%)
Query: 377 YEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEH 436
YE D G+ +Y++ E IM EI ++GPV+G ++ D ++YK+GIY + G +G H
Sbjct: 1 YEQDKVKGKSSYNVGEQETDIMMEIMKNGPVDGIFYMFEDFLVYKSGIYHYTTGRLVGGH 60
Query: 437 AIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLP 495
AIR+IGWG E + VKYWL+ANS+N WGE G FR+ RG NECGIEA I AGLP
Sbjct: 61 AIRVIGWGVE-------NGVKYWLIANSWNEGWGEKGYFRMRRGNNECGIEARINAGLP 112
Score = 94.4 bits (233), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 46/101 (45%), Positives = 65/101 (64%), Gaps = 7/101 (6%)
Query: 235 YEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEH 294
YE D G+ +Y++ E IM EI ++GPV+G ++ D ++YK+GIY + G +G H
Sbjct: 1 YEQDKVKGKSSYNVGEQETDIMMEIMKNGPVDGIFYMFEDFLVYKSGIYHYTTGRLVGGH 60
Query: 295 AIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
AIR+IGWG E + VKYWL+ANS+N WGE G FR+
Sbjct: 61 AIRVIGWGVE-------NGVKYWLIANSWNEGWGEKGYFRM 94
>gi|290977636|ref|XP_002671543.1| predicted protein [Naegleria gruberi]
gi|284085113|gb|EFC38799.1| predicted protein [Naegleria gruberi]
Length = 268
Score = 121 bits (304), Expect = 9e-25, Method: Compositional matrix adjust.
Identities = 82/247 (33%), Positives = 117/247 (47%), Gaps = 45/247 (18%)
Query: 62 LTLSELEMRMG---VHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRD 118
+TL E +G + P + LP+ ++P ++ + FDAR W C I EIR+
Sbjct: 58 MTLREARKYLGTVIISPINNLPKKKMPKNLKAA-------SHFDAREKWEDC--IHEIRN 108
Query: 119 QGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYW 178
Q CGS WA A EA SDR+CIA+ G ++ LS +VS C GC GG+ AW +
Sbjct: 109 QEECGSCWAFSASEAFSDRLCIATNGSVNIVLSPQYMVS-CDATDYGCDGGYLNNAWNFL 167
Query: 179 VTTGIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQPG-----YDV 233
TGI S C PY+ S + P+ + +KCQ G Y V
Sbjct: 168 ANTGIPS-------DECVPYQ------------SGSGHVPSCSKLNKKCQDGSDIKLYKV 208
Query: 234 SYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGE 293
S + N I E I ++I +G ++ ++Y D YK+G+Y HV G G
Sbjct: 209 SKKSIANLDSI--------EDIQKDIQENGSIQSGFSVYKDFFSYKSGVYHHVTGSLAGG 260
Query: 294 HAIRIIG 300
HAI++IG
Sbjct: 261 HAIKVIG 267
Score = 54.3 bits (129), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 33/104 (31%), Positives = 50/104 (48%), Gaps = 18/104 (17%)
Query: 344 CERYMNGSRSSCQANEPNTPECIRKCQPG-----YDVSYEDDLNFGRIAYSLPANEETIM 398
C Y +GS + P+ + +KCQ G Y VS + N I E I
Sbjct: 177 CVPYQSGS-----GHVPSCSKLNKKCQDGSDIKLYKVSKKSIANLDSI--------EDIQ 223
Query: 399 REIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIG 442
++I +G ++ ++Y D YK+G+Y HV G G HAI++IG
Sbjct: 224 KDIQENGSIQSGFSVYKDFFSYKSGVYHHVTGSLAGGHAIKVIG 267
>gi|161343831|tpg|DAA06096.1| TPA_inf: cathepsin B [Aphis gossypii]
Length = 194
Score = 121 bits (303), Expect = 9e-25, Method: Compositional matrix adjust.
Identities = 67/164 (40%), Positives = 88/164 (53%), Gaps = 6/164 (3%)
Query: 64 LSELEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCG 123
L+ + MGV P +KL + L E LPE +D W C ++ IRDQ +CG
Sbjct: 32 LTNVSRLMGVLPRNKLSEKDTLLTYDSPAGSEPLPESYDVTQTWSECKSVVSIRDQSNCG 91
Query: 124 SGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCK-DCGNGCQGGFHGKAWKYWVTTG 182
S WAL A S R+CIAS ++ LS + + SCC CG+GC GG KAWKY G
Sbjct: 92 SCWALSTASAFSGRLCIASNMDFNIVLSGEYINSCCNGKCGDGCNGGHPEKAWKYIKKNG 151
Query: 183 IVSGGTYASKQGCRPYEI-PCERYMNGSHSSCQDNEPNTPECIR 225
+ +GG Y S +GC+PY I PC R N SC +TP+C +
Sbjct: 152 LCTGGEYNSNEGCQPYSIFPCPRNSN----SCSKENEDTPQCYK 191
>gi|11691656|emb|CAC18646.1| cathepsin B-like protease 1 [Giardia intestinalis]
Length = 303
Score = 121 bits (303), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 85/280 (30%), Positives = 126/280 (45%), Gaps = 45/280 (16%)
Query: 64 LSELEMR-MGVHPD------SKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEI 116
++E E R M + PD LP + + +L DP+ P FD R +P C ++
Sbjct: 42 VTEDEFRSMLIRPDRLRARSGSLPPISITEVQELVDPI---PPQFDFRDEYPQC--VKPA 96
Query: 117 RDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWK 176
DQGSCG WA A+ DR C K V S L+SC + GC GG W
Sbjct: 97 LDQGSCGECWAFSAIGVFGDRRCAMGIDKEAVSYSQQHLISCSLE-NFGCDGGDFQPTWS 155
Query: 177 YWVTTGIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYE 236
+ TG + C +Y++ H+ C C G +
Sbjct: 156 FLTFTGATTA--------------ECVKYVDYGHTVAS-------PCPAVCDDGSPIQLY 194
Query: 237 DDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGG-PLGEHA 295
+G+++ S+PA IM + GP++ + +YAD+ Y++G+YKH G LG HA
Sbjct: 195 KAHGYGQVSKSVPA----IMGMLVAGGPLQTMIVVYADLSYYESGVYKHTYGTINLGFHA 250
Query: 296 IRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
+ I+G+G T YW++ NS+ +WGENG FRI
Sbjct: 251 LEIVGYGT------TDDGTDYWIIKNSWGPDWGENGYFRI 284
Score = 92.0 bits (227), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 50/129 (38%), Positives = 73/129 (56%), Gaps = 11/129 (8%)
Query: 365 CIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGI 424
C C G + +G+++ S+PA IM + GP++ + +YAD+ Y++G+
Sbjct: 181 CPAVCDDGSPIQLYKAHGYGQVSKSVPA----IMGMLVAGGPLQTMIVVYADLSYYESGV 236
Query: 425 YKHVAGG-PLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNE 483
YKH G LG HA+ I+G+G T YW++ NS+ +WGENG FRIVRG NE
Sbjct: 237 YKHTYGTINLGFHALEIVGYGT------TDDGTDYWIIKNSWGPDWGENGYFRIVRGVNE 290
Query: 484 CGIEADITA 492
C IE +I A
Sbjct: 291 CRIEDEIYA 299
>gi|291228863|ref|XP_002734398.1| PREDICTED: hypothetical protein [Saccoglossus kowalevskii]
Length = 451
Score = 121 bits (303), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 88/293 (30%), Positives = 137/293 (46%), Gaps = 44/293 (15%)
Query: 51 FYGAEKNALSKLTLSELEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYC 110
F+G + K L L++ VH +++ +++ +++P+ FDAR W
Sbjct: 143 FWGMKLTDAVKHKLGTLKVERDVHTMTEID-------IKMK---KKIPKSFDARDKWG-- 190
Query: 111 PTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGF 170
I I DQG+C S WA V SDR+ I S G+ + LS L+SC GC GG
Sbjct: 191 SMITGILDQGNCASSWAFSTVGVASDRLAIQSSGETGMTLSPQHLLSCNTRGQRGCSGGH 250
Query: 171 HGKAWKYWVTTGIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQPG 230
+AW + G+VS Y G + + C M G S C G
Sbjct: 251 IDRAWWFMRKRGVVSNDCYPYTSGDQDKKGVC--MMPGKLPS-------------DCPTG 295
Query: 231 YDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKH--VAG 288
+ + ++L+ Y + ANE I EI +GPV+ S + D +Y +G+Y+H +A
Sbjct: 296 RERN--NELHHSTPPYRIAANEREIQVEIMENGPVQASFEVKEDFFMYGSGVYRHTPIAS 353
Query: 289 GPLGE------HAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
+ H+++++GWG E + +KYWL ANS+ T WGE+G F+I
Sbjct: 354 NDAEQYHASEWHSVKLLGWGVE-------NGIKYWLGANSWGTKWGEDGYFKI 399
Score = 91.7 bits (226), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 46/127 (36%), Positives = 74/127 (58%), Gaps = 15/127 (11%)
Query: 379 DDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKH--VAGGPLGE- 435
++L+ Y + ANE I EI +GPV+ S + D +Y +G+Y+H +A +
Sbjct: 300 NELHHSTPPYRIAANEREIQVEIMENGPVQASFEVKEDFFMYGSGVYRHTPIASNDAEQY 359
Query: 436 -----HAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADI 490
H+++++GWG E + +KYWL ANS+ T WGE+G F+I+RG+NEC IE+ +
Sbjct: 360 HASEWHSVKLLGWGVE-------NGIKYWLGANSWGTKWGEDGYFKILRGENECNIESYV 412
Query: 491 TAGLPKI 497
A K+
Sbjct: 413 VAVWGKV 419
>gi|159112288|ref|XP_001706373.1| Cathepsin B precursor [Giardia lamblia ATCC 50803]
gi|157434469|gb|EDO78699.1| Cathepsin B precursor [Giardia lamblia ATCC 50803]
Length = 303
Score = 121 bits (303), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 85/280 (30%), Positives = 126/280 (45%), Gaps = 45/280 (16%)
Query: 64 LSELEMR-MGVHPD------SKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEI 116
++E E R M + PD LP + + +L DP+ P FD R +P C ++
Sbjct: 42 VTEDEFRSMLIRPDRLRARSGSLPPISITEVQELVDPI---PPQFDFRDEYPQC--VKPA 96
Query: 117 RDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWK 176
DQGSCG WA A+ DR C K V S L+SC + GC GG W
Sbjct: 97 LDQGSCGGCWAFSAIGVFGDRRCAMGIDKEAVSYSQQHLISCSLE-NFGCDGGDFQPTWS 155
Query: 177 YWVTTGIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYE 236
+ TG + C +Y++ H+ C C G +
Sbjct: 156 FLTFTGATTA--------------ECVKYVDYGHTVAS-------PCPAVCDDGSPIQLY 194
Query: 237 DDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGG-PLGEHA 295
+G+++ S+PA IM + GP++ + +YAD+ Y++G+YKH G LG HA
Sbjct: 195 KAHGYGQVSKSVPA----IMGMLVAGGPLQTMIVVYADLSYYESGVYKHTYGTINLGFHA 250
Query: 296 IRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
+ I+G+G T YW++ NS+ +WGENG FRI
Sbjct: 251 LEIVGYGT------TDDGTDYWIIKNSWGPDWGENGYFRI 284
Score = 92.0 bits (227), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 50/129 (38%), Positives = 73/129 (56%), Gaps = 11/129 (8%)
Query: 365 CIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGI 424
C C G + +G+++ S+PA IM + GP++ + +YAD+ Y++G+
Sbjct: 181 CPAVCDDGSPIQLYKAHGYGQVSKSVPA----IMGMLVAGGPLQTMIVVYADLSYYESGV 236
Query: 425 YKHVAGG-PLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNE 483
YKH G LG HA+ I+G+G T YW++ NS+ +WGENG FRIVRG NE
Sbjct: 237 YKHTYGTINLGFHALEIVGYGT------TDDGTDYWIIKNSWGPDWGENGYFRIVRGVNE 290
Query: 484 CGIEADITA 492
C IE +I A
Sbjct: 291 CRIEDEIYA 299
>gi|32129434|sp|P92132.2|CATB2_GIALA RecName: Full=Cathepsin B-like CP2; AltName: Full=Cathepsin B-like
protease B2; Flags: Precursor
gi|11691658|emb|CAC18647.1| cathepsin B-like protease 2 [Giardia intestinalis]
Length = 300
Score = 120 bits (302), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 81/278 (29%), Positives = 131/278 (47%), Gaps = 40/278 (14%)
Query: 59 LSKLTLSELEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRD 118
L+K +S L M + ++K R + +++PE FD R +P+C I E+ D
Sbjct: 42 LTKDEISSLLMPVSFLKNAKGAAPRGTFTDK-----DDVPESFDFREEYPHC--IPEVVD 94
Query: 119 QGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGN-GCQGGFHGKAWKY 177
QG CGS WA +V DR C+A K+ V+ S +VSC D G+ C GG+ WK+
Sbjct: 95 QGGCGSCWAFSSVATFGDRRCVAGLDKKPVKYSPQYVVSC--DHGDMACNGGWLPNVWKF 152
Query: 178 WVTTGIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYED 237
TG + C PY+ + C D K SY+D
Sbjct: 153 LTKTGTTT-------DECVPYKSGSTTLRGTCPTKCADGS-------SKVHLATATSYKD 198
Query: 238 DLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIR 297
+PA +M+ + GP++ + +++D + Y++G+Y+H G G HA+
Sbjct: 199 ------YGLDIPA----MMKALSTSGPLQVAFLVHSDFMYYESGVYQHTYGYMEGGHAVE 248
Query: 298 IIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
++G+G + G V YW++ NS+ +WGE+G FR+
Sbjct: 249 MVGYGTDDDG------VDYWIIKNSWGPDWGEDGYFRM 280
Score = 82.8 bits (203), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 41/131 (31%), Positives = 71/131 (54%), Gaps = 11/131 (8%)
Query: 365 CIRKCQPGYD-VSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTG 423
C KC G V ++ +PA +M+ + GP++ + +++D + Y++G
Sbjct: 177 CPTKCADGSSKVHLATATSYKDYGLDIPA----MMKALSTSGPLQVAFLVHSDFMYYESG 232
Query: 424 IYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNE 483
+Y+H G G HA+ ++G+G + G V YW++ NS+ +WGE+G FR++RG N+
Sbjct: 233 VYQHTYGYMEGGHAVEMVGYGTDDDG------VDYWIIKNSWGPDWGEDGYFRMIRGIND 286
Query: 484 CGIEADITAGL 494
C IE AG
Sbjct: 287 CSIEEQAYAGF 297
>gi|308162940|gb|EFO65307.1| Cathepsin B precursor [Giardia lamblia P15]
Length = 303
Score = 120 bits (302), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 82/280 (29%), Positives = 129/280 (46%), Gaps = 45/280 (16%)
Query: 64 LSELEMR-MGVHPD------SKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEI 116
++E E R M + PD LP + + ++ +P + +P FD R +P C + +
Sbjct: 42 ITEDEFRGMLIRPDILGAGSGSLPPSSV---TEIQEPADPIPSQFDFRDEYPQC--VTPV 96
Query: 117 RDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWK 176
DQGSCG WA A+ DR C+A K V S L+SC + +GC GG W
Sbjct: 97 MDQGSCGGCWAFSAIGVFGDRRCVAGIDKEGVPYSQQYLISCSTE-NHGCDGGDFWPTWS 155
Query: 177 YWVTTGIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYE 236
+ TG + C +Y++ N +P C C G +
Sbjct: 156 FLTLTGATTA--------------ECVKYID------YPNIVASP-CPAVCDDGSQIQLY 194
Query: 237 DDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGG-PLGEHA 295
+G+++ N + IM + GPV+ + +Y+D+ Y++G+YKH G LG HA
Sbjct: 195 KAHGYGQVS----KNVQAIMHMLATGGPVQTMIVVYSDLSYYESGVYKHTYGTISLGLHA 250
Query: 296 IRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
+ ++G+G T YW++ NS+ +WGENG FRI
Sbjct: 251 LEMVGYGT------TDDGTDYWIIRNSWGADWGENGYFRI 284
Score = 89.4 bits (220), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 47/130 (36%), Positives = 71/130 (54%), Gaps = 11/130 (8%)
Query: 365 CIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGI 424
C C G + +G+++ N + IM + GPV+ + +Y+D+ Y++G+
Sbjct: 181 CPAVCDDGSQIQLYKAHGYGQVS----KNVQAIMHMLATGGPVQTMIVVYSDLSYYESGV 236
Query: 425 YKHVAGG-PLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNE 483
YKH G LG HA+ ++G+G T YW++ NS+ +WGENG FRIVRG NE
Sbjct: 237 YKHTYGTISLGLHALEMVGYGT------TDDGTDYWIIRNSWGADWGENGYFRIVRGVNE 290
Query: 484 CGIEADITAG 493
C IE +I A
Sbjct: 291 CRIEDEIYAA 300
>gi|130502070|ref|NP_001076255.1| tubulointerstitial nephritis antigen [Oryctolagus cuniculus]
gi|818411|gb|AAC48477.1| tubulointerstitial nephritis antigen [Oryctolagus cuniculus]
Length = 474
Score = 120 bits (301), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 89/298 (29%), Positives = 127/298 (42%), Gaps = 53/298 (17%)
Query: 62 LTLSE-LEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDA--------RINWPYCPT 112
+TL E R+G P S P+L+ +++ LPE D ++ W
Sbjct: 180 MTLEEGFRFRLGTLPPS-------PVLLSMNEMRATLPETTDLPEFFIAFLQMAWMDSWA 232
Query: 113 IQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHG 172
I +C + WA +DR+ I S G+ LS +L+SCC +GC G
Sbjct: 233 I----GSKNCAASWAFSTASVAADRIAIQSNGRYTANLSPQNLISCCAKNRHGCNSGSID 288
Query: 173 KAWKYWVTTGIVSGGTY-------ASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIR 225
+AW Y G+VS Y S C R + C +N + I
Sbjct: 289 RAWWYLRKRGLVSHACYPLFKDQNISNNTCAMTSKADGRGKRHATRPCPNNIEKSNR-IY 347
Query: 226 KCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKH 285
+C P Y VS +NE IM+EI ++GPV+ M ++ D YKTGIY+H
Sbjct: 348 QCSPPYRVS---------------SNETEIMKEIMQNGPVQAIMQVHEDFFHYKTGIYRH 392
Query: 286 VAGG--------PLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
V L HA+++ GWG G K+W+ ANS+ +WGENG FRI
Sbjct: 393 VISTNEESEKYRKLQTHAVKLTGWGTLKGARGQKE--KFWIAANSWGKSWGENGYFRI 448
Score = 95.9 bits (237), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 48/118 (40%), Positives = 67/118 (56%), Gaps = 10/118 (8%)
Query: 388 YSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGG--------PLGEHAIR 439
Y + +NE IM+EI ++GPV+ M ++ D YKTGIY+HV L HA++
Sbjct: 353 YRVSSNETEIMKEIMQNGPVQAIMQVHEDFFHYKTGIYRHVISTNEESEKYRKLQTHAVK 412
Query: 440 IIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
+ GWG G K+W+ ANS+ +WGENG FRI+RG NE IE I A ++
Sbjct: 413 LTGWGTLKGARGQKE--KFWIAANSWGKSWGENGYFRILRGVNESDIEKLIIAAWGQL 468
>gi|157058773|gb|ABV03144.1| cathepsin B-16D [Sitobion avenae]
Length = 215
Score = 120 bits (300), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 56/146 (38%), Positives = 86/146 (58%), Gaps = 1/146 (0%)
Query: 97 LPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLV 156
+P FDAR W +C TI E+RDQG+CGS WA+ A +DR+C+A+ G + LS++++
Sbjct: 70 IPRHFDARRKWRHCQTIGEVRDQGNCGSCWAVATSSAFADRLCVATDGDFNQLLSAEEIT 129
Query: 157 SCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEI-PCERYMNGSHSSCQD 215
CC CG GC GG+ KAW+ + G+V+GG Y S++GC PY + PC +G+++
Sbjct: 130 FCCHTCGFGCNGGYPIKAWERFKKHGLVTGGDYKSEEGCEPYRVPPCPYDESGNNTCAGK 189
Query: 216 NEPNTPECIRKCQPGYDVSYEDDLNF 241
C R C D+ ++ D +
Sbjct: 190 PMEKNHRCTRMCYGDQDLDFDQDHRY 215
>gi|403357104|gb|EJY78168.1| Cathepsin B [Oxytricha trifallax]
Length = 349
Score = 120 bits (300), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 83/245 (33%), Positives = 117/245 (47%), Gaps = 36/245 (14%)
Query: 91 SDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRL 150
D E +PE FD+R WP C I IRDQ CGS WA + +SDR CI S G+ + L
Sbjct: 119 QDLNETIPESFDSRDKWPNC--IHGIRDQQLCGSCWAFASSAFLSDRFCIHSEGQINEDL 176
Query: 151 SSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIPCERYMNGSH 210
S DLVSC + GC GG ++ + + GIVS + C+PY MN
Sbjct: 177 SPQDLVSCSYE-NFGCSGGQLTESVDFLIYEGIVS-------EKCKPY-------MN--- 218
Query: 211 SSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMT 270
QD C KCQ D + + + ++ E I E+ +GP+ ++
Sbjct: 219 ---QDTY-----CKFKCQN--DKQPYTKYFCEQKSMLILSDIEEIQLELMTNGPMMVGLS 268
Query: 271 IYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGEN 330
+Y D++ YK G+Y++ G +G HAI+IIGWG GE +W N + +WG
Sbjct: 269 VYEDLMNYKEGVYEYTTGNQVGGHAIKIIGWGHTEKGE------LFWKCQNQWGKDWGMG 322
Query: 331 GLFRI 335
G I
Sbjct: 323 GYINI 327
Score = 67.8 bits (164), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 34/105 (32%), Positives = 55/105 (52%), Gaps = 8/105 (7%)
Query: 395 ETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSS 454
E I E+ +GP+ +++Y D++ YK G+Y++ G +G HAI+IIGWG GE
Sbjct: 251 EEIQLELMTNGPMMVGLSVYEDLMNYKEGVYEYTTGNQVGGHAIKIIGWGHTEKGE---- 306
Query: 455 VVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKIGL 499
+W N + +WG G I G E G++ + +P I +
Sbjct: 307 --LFWKCQNQWGKDWGMGGYINIKAG--ELGMDTMVLGCMPDISV 347
>gi|312105965|ref|XP_003150617.1| hypothetical protein LOAG_15077 [Loa loa]
Length = 150
Score = 120 bits (300), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 64/152 (42%), Positives = 88/152 (57%), Gaps = 20/152 (13%)
Query: 97 LPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLV 156
LP+ FDAR+ WP C ++ + +QG CGS WA+ A MSDR+CIA+ ++S++DL+
Sbjct: 8 LPKHFDARLRWPLCWSVHVVANQGGCGSCWAISAASVMSDRLCIATNYSNQKQISAEDLI 67
Query: 157 SCCKDCGNGCQGGFHG-KAWKYWVTTGIVSGGTYASKQGCRPYEI------PC--ERYMN 207
SCC +CG GCQG A+ YW G+V+GG Y S +GC+PY PC E Y
Sbjct: 68 SCCTECG-GCQGSHWALSAFIYWRNHGVVTGGDYGSFEGCKPYTTAPNCGSPCSFEYY-- 124
Query: 208 GSHSSCQDNEPNTPECIRKCQPGYDVSYEDDL 239
+P C + CQP Y +SYE+DL
Sbjct: 125 --------RRKISPACQKTCQPLYGLSYEEDL 148
>gi|156708118|gb|ABU93317.1| cathepsin B8 cysteine protease, partial [Monocercomonoides sp. PA]
Length = 275
Score = 120 bits (300), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 80/241 (33%), Positives = 115/241 (47%), Gaps = 40/241 (16%)
Query: 95 EELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDD 154
E P FD R WP + +R+Q SCGS WA A E M R+ I RG +S D
Sbjct: 55 ENAPASFDCRQKWP--GKAEPVRNQASCGSCWAHAASETMGFRMGI--RGCYKGVMSPQD 110
Query: 155 LVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQ 214
LVSC + GC+GG+ + W + GI + + C PY ++GS
Sbjct: 111 LVSC-ESNNMGCEGGYADRVWNWIQKKGITT-------EQCLPY-------VSGSG---- 151
Query: 215 DNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYAD 274
P C KC+ G ++ ++G N +T+M E+ +GPV ++ D
Sbjct: 152 ----RVPTCPSKCKNGSNIVRSFVSSWGSF------NSKTVMDEVANNGPVYACFEVFED 201
Query: 275 MILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFR 334
+ YK+GIY+H G G H + ++GWG E + V YWL+ NS+ + WGE G FR
Sbjct: 202 FLNYKSGIYQHKTGKSKGWHHVMLMGWGTE-------NGVPYWLLQNSWGSGWGEKGFFR 254
Query: 335 I 335
I
Sbjct: 255 I 255
Score = 97.1 bits (240), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 49/135 (36%), Positives = 73/135 (54%), Gaps = 13/135 (9%)
Query: 362 TPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYK 421
P C KC+ G ++ ++G N +T+M E+ +GPV ++ D + YK
Sbjct: 153 VPTCPSKCKNGSNIVRSFVSSWGSF------NSKTVMDEVANNGPVYACFEVFEDFLNYK 206
Query: 422 TGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQ 481
+GIY+H G G H + ++GWG E + V YWL+ NS+ + WGE G FRI RG
Sbjct: 207 SGIYQHKTGKSKGWHHVMLMGWGTE-------NGVPYWLLQNSWGSGWGEKGFFRIRRGT 259
Query: 482 NECGIEADITAGLPK 496
N+C I+ +GLPK
Sbjct: 260 NDCHIDEIFYSGLPK 274
>gi|327282776|ref|XP_003226118.1| PREDICTED: tubulointerstitial nephritis antigen-like [Anolis
carolinensis]
Length = 476
Score = 120 bits (300), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 82/249 (32%), Positives = 119/249 (47%), Gaps = 22/249 (8%)
Query: 95 EELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDD 154
++ PE F A WP I + DQ +C + WA +DR+ I S+G+ LS
Sbjct: 222 DDFPEFFVAWHEWP--GWIHDPLDQRNCAASWAFSTASVAADRIAIHSKGRFTDNLSPQH 279
Query: 155 LVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQ 214
L+SC GC+GG AW Y G+VS Y ++ CE SS
Sbjct: 280 LISCDTRNQYGCKGGSITGAWSYLKKYGLVSHACYPLFWN-NLHQTSCEM------SSVF 332
Query: 215 DNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYAD 274
D E + I+ C ++ S + + + Y + + + IM+EI +GPV+ M +Y D
Sbjct: 333 DAE-GKRQAIQPCPNRWEPS--NHIYQCGLPYRISSQDADIMKEIKENGPVQAVMQVYDD 389
Query: 275 MILYKTGIYKHVAG--------GPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTN 326
LYK+GIYKH+ H+I+I+GWG EG K+W+ ANS+ +
Sbjct: 390 FFLYKSGIYKHIWSLEGKTQNRHQKKPHSIKIVGWGTLRDAEGQRQ--KFWIAANSWGNS 447
Query: 327 WGENGLFRI 335
WGENG FRI
Sbjct: 448 WGENGYFRI 456
Score = 101 bits (251), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 49/115 (42%), Positives = 68/115 (59%), Gaps = 10/115 (8%)
Query: 386 IAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAG--------GPLGEHA 437
+ Y + + + IM+EI +GPV+ M +Y D LYK+GIYKH+ H+
Sbjct: 359 LPYRISSQDADIMKEIKENGPVQAVMQVYDDFFLYKSGIYKHIWSLEGKTQNRHQKKPHS 418
Query: 438 IRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITA 492
I+I+GWG EG K+W+ ANS+ +WGENG FRI+RGQNEC IE + A
Sbjct: 419 IKIVGWGTLRDAEGQRQ--KFWIAANSWGNSWGENGYFRILRGQNECDIEKTVIA 471
>gi|145356617|ref|XP_001422524.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144582767|gb|ABP00841.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 245
Score = 119 bits (299), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 89/264 (33%), Positives = 127/264 (48%), Gaps = 48/264 (18%)
Query: 97 LPEGFDARINWPYCPT-IQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDL 155
LP+ FD R WP C + E DQG CGS WA+ + M+DR+CIA+ G LS+ L
Sbjct: 2 LPKDFDVREKWPKCAALVSEALDQGECGSCWAVAPAKVMADRLCIATNGAVASHLSAMQL 61
Query: 156 VSCCK------DCGN----GCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEI-PCER 204
+SC K D G+ C GGF +A++ T+GIVSGG + + C PY PC+
Sbjct: 62 LSCGKLENGTFDAGSTYSGSCDGGFPNEAYEKARTSGIVSGGLFGDDKTCMPYAFAPCQH 121
Query: 205 YMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPA------NEETIMR- 257
N +H +C C+ ++N Y + + N+ M
Sbjct: 122 PCNPNH---------VAQCPTTCR-------NKNVNLSSQRYEVTSLVTCGTNDFNCMAL 165
Query: 258 EIFRHGPVEGSM-TIYADMILYKTGIY---KHVA--GGPLGEHAIRIIGWGQEPLGEGTS 311
E+F HGPV + ++ + YK+G+Y K VA G G H + +IGWG T
Sbjct: 166 ELFYHGPVSSYVGDVFDEFYKYKSGVYSLSKDVAARGENHGGHVMEVIGWGT------TE 219
Query: 312 SVVKYWLVANSFNTNWGENGLFRI 335
S +YW V NS+ NWG+ G +I
Sbjct: 220 SGTRYWKVYNSW-LNWGDQGYGKI 242
Score = 50.1 bits (118), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 33/87 (37%), Positives = 46/87 (52%), Gaps = 13/87 (14%)
Query: 400 EIFRHGPVEGSM-TIYADMILYKTGIY---KHVA--GGPLGEHAIRIIGWGQEPLGEGTS 453
E+F HGPV + ++ + YK+G+Y K VA G G H + +IGWG T
Sbjct: 166 ELFYHGPVSSYVGDVFDEFYKYKSGVYSLSKDVAARGENHGGHVMEVIGWGT------TE 219
Query: 454 SVVKYWLVANSFNTNWGENGLFRIVRG 480
S +YW V NS+ NWG+ G +I G
Sbjct: 220 SGTRYWKVYNSW-LNWGDQGYGKIAVG 245
>gi|412992960|emb|CCO16493.1| cysteine proteinase, putative [Bathycoccus prasinos]
Length = 396
Score = 119 bits (298), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 92/255 (36%), Positives = 123/255 (48%), Gaps = 16/255 (6%)
Query: 97 LPEGFDARINWPYCP-TIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDL 155
LP FDAR W C I +RDQG CGS WA+ A E M+DRVCIA GK LS
Sbjct: 146 LPRQFDARKEWAECKGLIGTVRDQGKCGSCWAVAATEVMNDRVCIA-HGKTE-ELSPQYA 203
Query: 156 VSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYA-SKQGCRPYEIPCERYMNGSHSSCQ 214
+S C G GC+GG + + G+ +GG + S C PYE CQ
Sbjct: 204 LS-CYSAGAGCEGGNVIDTLQEAIEKGVPTGGMFGDSSSACLPYEF------EACDHPCQ 256
Query: 215 DNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANE-ETIMREIFRHGPVEGSM-TIY 272
EC C G +S + + Y P + + I +E+ ++G + + +
Sbjct: 257 VPGTIAEECPTTCADGTPISETEMMRPTSEPYECPPGDWKCITQELHKYGSMAVTFGPVC 316
Query: 273 ADMILYKTGIYKHVAGG-PLGEHAIRIIGWGQEPLGEGTSSVVK-YWLVANSFNTNWGEN 330
D +K G+Y+ GG PLG HA +IIGWG E E T K YW++ NS+ NWGE+
Sbjct: 317 DDFYGHKHGVYEQPEGGKPLGLHATKIIGWGFEGDDEETGKGGKPYWIMINSWQ-NWGEH 375
Query: 331 GLFRIGCRPYEIPCE 345
G+ RIG I E
Sbjct: 376 GVGRIGIGEMSIESE 390
Score = 63.9 bits (154), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 52/160 (32%), Positives = 76/160 (47%), Gaps = 13/160 (8%)
Query: 337 CRPYEIPCERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANE-E 395
C PYE CQ EC C G +S + + Y P + +
Sbjct: 243 CLPYEF------EACDHPCQVPGTIAEECPTTCADGTPISETEMMRPTSEPYECPPGDWK 296
Query: 396 TIMREIFRHGPVEGSM-TIYADMILYKTGIYKHVAGG-PLGEHAIRIIGWGQEPLGEGTS 453
I +E+ ++G + + + D +K G+Y+ GG PLG HA +IIGWG E E T
Sbjct: 297 CITQELHKYGSMAVTFGPVCDDFYGHKHGVYEQPEGGKPLGLHATKIIGWGFEGDDEETG 356
Query: 454 SVVK-YWLVANSFNTNWGENGLFRIVRGQNECGIEADITA 492
K YW++ NS+ NWGE+G+ RI G E IE++ T+
Sbjct: 357 KGGKPYWIMINSWQ-NWGEHGVGRI--GIGEMSIESEATS 393
>gi|159108625|ref|XP_001704582.1| Cathepsin B precursor [Giardia lamblia ATCC 50803]
gi|157432649|gb|EDO76908.1| Cathepsin B precursor [Giardia lamblia ATCC 50803]
Length = 298
Score = 119 bits (298), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 82/273 (30%), Positives = 134/273 (49%), Gaps = 37/273 (13%)
Query: 64 LSELEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCG 123
L++ E+ + P S L ++R + + + + P+ FD R +P+C I E+ DQG CG
Sbjct: 42 LTKDEISSLLMPVSFLKRDRAAV-PRGTVSATQAPDSFDFREEYPHC--IPEVVDQGGCG 98
Query: 124 SGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGN-GCQGGFHGKAWKYWVTTG 182
S WA +V ++ DR C A K+ V+ S +VSC D G+ C GG+ W++ TG
Sbjct: 99 SCWAFSSVASVGDRRCFAGLDKKAVKYSPQYVVSC--DRGDMACDGGWLPSVWRFLTKTG 156
Query: 183 IVSGGTYASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFG 242
+ C PY+ +GS + C KC G D+
Sbjct: 157 TTT-------DECVPYQ-------SGSTGA-------RGTCPTKCADGSDLPIYKATK-- 193
Query: 243 RIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWG 302
+ Y L + IM+ + GP++ + T+Y+D + Y+ G+Y+H G G HA+ ++G+G
Sbjct: 194 AVDYGLDCD--LIMKALATGGPLQTAFTVYSDFMYYEGGVYQHTYGRVEGGHAVEMVGYG 251
Query: 303 QEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
+ V YW++ NS+ +WGE+G FRI
Sbjct: 252 TDEYD------VDYWIIRNSWGPDWGEDGYFRI 278
Score = 87.8 bits (216), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 44/130 (33%), Positives = 70/130 (53%), Gaps = 10/130 (7%)
Query: 365 CIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGI 424
C KC G D+ + Y L + IM+ + GP++ + T+Y+D + Y+ G+
Sbjct: 176 CPTKCADGSDLPIYKATK--AVDYGLDCD--LIMKALATGGPLQTAFTVYSDFMYYEGGV 231
Query: 425 YKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNEC 484
Y+H G G HA+ ++G+G + V YW++ NS+ +WGE+G FRI+R NEC
Sbjct: 232 YQHTYGRVEGGHAVEMVGYGTDEYD------VDYWIIRNSWGPDWGEDGYFRIIRMTNEC 285
Query: 485 GIEADITAGL 494
GIE + G
Sbjct: 286 GIEEQVIGGF 295
>gi|328702238|ref|XP_001943280.2| PREDICTED: cathepsin B-like cysteine proteinase 5-like
[Acyrthosiphon pisum]
Length = 328
Score = 119 bits (297), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 77/241 (31%), Positives = 118/241 (48%), Gaps = 28/241 (11%)
Query: 101 FDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCK 160
FDAR WP C TI E R++G+ WA A ++DR+CIA+ G + +S+++L+SC
Sbjct: 88 FDARKRWPQCKTIGEFRNEGNFALSWAYAAAGVLADRMCIATNGSYNQLISTEELISC-- 145
Query: 161 DCGNGCQGGFHG-----KAWKYWVTTGIVSGGTYASKQGCRPYEI-PCERYMNGSHSSCQ 214
+G GG+HG + W+Y + G+VSGG Y + GC+P +I P E YM S
Sbjct: 146 ---SGVSGGYHGIVSEREVWEYLKSHGLVSGGKYNTSDGCQPSKIPPIEEYMEYS----- 197
Query: 215 DNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYAD 274
E C C ++Y DD +++ E I E+ +GPV I D
Sbjct: 198 --EIKNYTCNDHCYGNKTINYNDD--HVKVSNYYQVQYEDIQEEVQNYGPVSVEFYIRDD 253
Query: 275 MILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFR 334
+ I + +++IGWG E GE YWL+ +S+ G+NG+F+
Sbjct: 254 IFTPFLSINPRFQRRKYKGY-VKLIGWGVEN-GED------YWLLVDSWGYERGQNGVFK 305
Query: 335 I 335
+
Sbjct: 306 V 306
Score = 47.4 bits (111), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 48/167 (28%), Positives = 74/167 (44%), Gaps = 25/167 (14%)
Query: 336 GCRPYEIP-CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANE 394
GC+P +IP E YM S E C C ++Y DD +++
Sbjct: 181 GCQPSKIPPIEEYMEYS-------EIKNYTCNDHCYGNKTINYNDD--HVKVSNYYQVQY 231
Query: 395 ETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSS 454
E I E+ +GPV I D+ I + +++IGWG E GE
Sbjct: 232 EDIQEEVQNYGPVSVEFYIRDDIFTPFLSINPRFQRRKYKGY-VKLIGWGVEN-GED--- 286
Query: 455 VVKYWLVANSFNTNWGENGLFRIVRGQNECGIEAD-IT---AGLPKI 497
YWL+ +S+ G+NG+F++ R + ++AD IT AG+P+I
Sbjct: 287 ---YWLLVDSWGYERGQNGVFKVERFKT---VKADSITQAYAGVPEI 327
>gi|405963121|gb|EKC28721.1| Tubulointerstitial nephritis antigen-like protein [Crassostrea
gigas]
Length = 464
Score = 118 bits (295), Expect = 8e-24, Method: Compositional matrix adjust.
Identities = 82/244 (33%), Positives = 120/244 (49%), Gaps = 31/244 (12%)
Query: 95 EELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDD 154
++LP FDARINW I +RDQ +C S WA V+ +DR+ I S G +LS
Sbjct: 191 DQLPIHFDARINWT--SWIHPVRDQKNCASSWAFSTVDVAADRLAIESEGLLTNQLSPQH 248
Query: 155 LVSCCKDCGNGCQGGFHG-KAWKYWVTTGIVSGGTYASKQGCRPYEIPCERYMNGSHSSC 213
LVSC G G KAW + GI++ + C PY ++G ++C
Sbjct: 249 LVSCNTGRGQRGCRGGSTEKAWWFVKRRGIIT-------EECYPYTASDGECLDG-ETTC 300
Query: 214 QDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYA 273
+ +T + + P Y V +EE I EI+R+GPV+ + + +
Sbjct: 301 PNANSSTAKIVLYVTPPYRVR---------------QDEEDIKAEIYRNGPVQATFRVSS 345
Query: 274 DMILYKTGIYKHVAGGPLGEH--AIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENG 331
D +Y++G+Y+H G LGE ++RIIGWG++ KYW+ NS+ T WGE G
Sbjct: 346 DFFMYRSGVYRHT-GADLGESRLSVRIIGWGEK--TNKKGKKRKYWICLNSWGTKWGEKG 402
Query: 332 LFRI 335
FRI
Sbjct: 403 AFRI 406
Score = 94.4 bits (233), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 46/107 (42%), Positives = 68/107 (63%), Gaps = 5/107 (4%)
Query: 388 YSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEH--AIRIIGWGQ 445
Y + +EE I EI+R+GPV+ + + +D +Y++G+Y+H G LGE ++RIIGWG+
Sbjct: 318 YRVRQDEEDIKAEIYRNGPVQATFRVSSDFFMYRSGVYRHT-GADLGESRLSVRIIGWGE 376
Query: 446 EPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITA 492
+ KYW+ NS+ T WGE G FRIVRG+N GIE ++ A
Sbjct: 377 K--TNKKGKKRKYWICLNSWGTKWGEKGAFRIVRGENHLGIEENVLA 421
>gi|118379711|ref|XP_001023021.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila]
gi|89304788|gb|EAS02776.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila SB210]
Length = 382
Score = 118 bits (295), Expect = 8e-24, Method: Compositional matrix adjust.
Identities = 77/273 (28%), Positives = 134/273 (49%), Gaps = 42/273 (15%)
Query: 101 FDARINWPYCPTIQEIRDQGS-CGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCC 159
F+ +P C ++ I +QG C + +++ AV +++DR+C+AS G + LS+ +SC
Sbjct: 129 FNFHTKYPQC--VRPIANQGKDCSASYSIAAVSSVADRLCMASEGDFNFGLSAQPTISCY 186
Query: 160 KDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQDNEPN 219
++ C+GG+ K ++ TTG V K+ C PY + S+ C
Sbjct: 187 ENQSYKCEGGYVSKTFQKGKTTGFV-------KEECLPY------HGTDSNEGC------ 227
Query: 220 TPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYK 279
I KC+ +F Y + A +E+I REI +GPV M +++D ++YK
Sbjct: 228 --SLIDKCE-----------HFKIYDYCVSAGQESIKREIMLNGPVVSLMNVFSDFLVYK 274
Query: 280 TGIYKHV--AGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIGC 337
+G+Y+ + A G+ A++IIGW +PL + YW++ NS+ WG NGL +
Sbjct: 275 SGVYRVLENAAKLKGQQAVKIIGWDIDPLTKDY-----YWIIENSWGEEWGLNGLAYVAM 329
Query: 338 RPYEIPCERYMNGSRSSCQANEPNTPECIRKCQ 370
E+ E Y + + QA ++ + + Q
Sbjct: 330 GQEELRLEEYALAAITLQQAETASSQQAKAQSQ 362
Score = 80.1 bits (196), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 42/126 (33%), Positives = 69/126 (54%), Gaps = 7/126 (5%)
Query: 382 NFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHV--AGGPLGEHAIR 439
+F Y + A +E+I REI +GPV M +++D ++YK+G+Y+ + A G+ A++
Sbjct: 235 HFKIYDYCVSAGQESIKREIMLNGPVVSLMNVFSDFLVYKSGVYRVLENAAKLKGQQAVK 294
Query: 440 IIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKIGL 499
IIGW +PL + YW++ NS+ WG NGL + GQ E +E A +
Sbjct: 295 IIGWDIDPLTKDY-----YWIIENSWGEEWGLNGLAYVAMGQEELRLEEYALAAITLQQA 349
Query: 500 EIDSNE 505
E S++
Sbjct: 350 ETASSQ 355
>gi|161343825|tpg|DAA06093.1| TPA_inf: cathepsin B [Aphis gossypii]
Length = 199
Score = 117 bits (294), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 50/105 (47%), Positives = 71/105 (67%)
Query: 97 LPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLV 156
+P+ FDAR W +C TI ++RDQG+CGS WAL A +DR+C+A+ G + LS+++L
Sbjct: 88 IPKKFDARKKWRHCTTIGKVRDQGNCGSCWALSTSSAFADRLCVATNGDFNQLLSAEELT 147
Query: 157 SCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIP 201
CC CG GC GG+ KAW+ + G+V+GG Y S +GC PY +P
Sbjct: 148 FCCHKCGYGCNGGYPIKAWERFKKHGLVTGGEYKSGEGCEPYRVP 192
>gi|38639319|gb|AAR25797.1| cathepsin B-like cysteine proteinase [Solanum tuberosum]
Length = 218
Score = 117 bits (294), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 71/173 (41%), Positives = 97/173 (56%), Gaps = 17/173 (9%)
Query: 59 LSKLTLSELEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRD 118
LS T+S+ + +GV P + +P+L L+ELP+ FDAR WP C TI +I D
Sbjct: 62 LSNFTVSQFKRLLGVKPAREGDLEGIPVLTH--PRLKELPKEFDARKAWPQCSTIGKILD 119
Query: 119 QGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKD-CGNGCQGGFHGKAWKY 177
QG CGS WA GAVE++SDR CI + LS +DL++CC CG+GC GG+ AW+Y
Sbjct: 120 QGHCGSCWAFGAVESLSDRFCI--HYNLSISLSVNDLLACCSFLCGSGCDGGYPIAAWRY 177
Query: 178 WVTTGIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQPG 230
+ +G+V+ + C PY SH C+ P TP+C RKC G
Sbjct: 178 FKRSGVVT-------EECDPY----FDTTGCSHPGCEPLYP-TPKCHRKCVKG 218
>gi|2330009|gb|AAB66719.1| cysteine protease [Giardia muris]
Length = 301
Score = 117 bits (293), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 73/242 (30%), Positives = 116/242 (47%), Gaps = 36/242 (14%)
Query: 95 EELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDD 154
+ELP+ +D R+ +C + E+ DQ SCGS WA AV +DR C + V S
Sbjct: 75 KELPKDYDPRVERAHC--LPEVADQASCGSCWAFSAVATFADRRCAYGLDSKQVHYSEQY 132
Query: 155 LVSCCKDCGNG-CQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIPCERYMNGSHSSC 213
+VSC D G+G C GG+ WK+ TG+ ++ C +Y +G
Sbjct: 133 VVSC--DFGDGACNGGWLSNVWKFLTKTGVP--------------KLDCLKYFSGMTG-- 174
Query: 214 QDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYA 273
+ CI C G V +L + + + +M + GP++ + +Y+
Sbjct: 175 -----DRESCITHCTDGSPV----ELYQASHVINYGMDLDRMMEALVYDGPLQVAFVVYS 225
Query: 274 DMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLF 333
D Y +G+Y+HV G G HA+ ++G+G + G +KYW++ NS+ +WGE G F
Sbjct: 226 DFGYYSSGVYQHVNGMMEGGHAVEMVGYGIDESG------LKYWIIRNSWGPDWGEGGYF 279
Query: 334 RI 335
RI
Sbjct: 280 RI 281
Score = 82.4 bits (202), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 48/151 (31%), Positives = 79/151 (52%), Gaps = 25/151 (16%)
Query: 341 EIPCERY---MNGSRSSCQAN-EPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEET 396
++ C +Y M G R SC + +P + Q + ++Y DL+
Sbjct: 162 KLDCLKYFSGMTGDRESCITHCTDGSP--VELYQASHVINYGMDLD-------------R 206
Query: 397 IMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVV 456
+M + GP++ + +Y+D Y +G+Y+HV G G HA+ ++G+G + G +
Sbjct: 207 MMEALVYDGPLQVAFVVYSDFGYYSSGVYQHVNGMMEGGHAVEMVGYGIDESG------L 260
Query: 457 KYWLVANSFNTNWGENGLFRIVRGQNECGIE 487
KYW++ NS+ +WGE G FRI+R NECGIE
Sbjct: 261 KYWIIRNSWGPDWGEGGYFRIIRRVNECGIE 291
>gi|294879717|ref|XP_002768767.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
gi|239871616|gb|EER01485.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
Length = 157
Score = 117 bits (292), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 61/165 (36%), Positives = 90/165 (54%), Gaps = 12/165 (7%)
Query: 336 GCRPYEIP-CERYMNGSR-SSCQANEPNTPECIRKCQ-PGYDVSYEDDLNFGRIAYSLPA 392
GC PY+ P C ++N ++ C TP C+ +C P Y + DD +F +
Sbjct: 2 GCWPYDFPPCAHHINDTKYPKCPKGLYPTPNCVEQCHNPKYTTTLRDDRHFMLESSPYHY 61
Query: 393 NEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGT 452
+ I GPV S T+Y D + Y++G+YKH +G LG HA++IIGWG++
Sbjct: 62 SVNDAKNAIRTDGPVSASFTVYEDFLAYRSGVYKHTSGSYLGGHAVKIIGWGEK------ 115
Query: 453 SSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
S YWL NS+N +WG++GLF+I G CGI+ D+ G PK+
Sbjct: 116 -SGQAYWLAVNSWNEDWGDHGLFKIALGN--CGIDDDLLGGTPKV 157
Score = 102 bits (254), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 53/147 (36%), Positives = 79/147 (53%), Gaps = 10/147 (6%)
Query: 193 QGCRPYEIP-CERYMNGS-HSSCQDNEPNTPECIRKCQ-PGYDVSYEDDLNFGRIAYSLP 249
GC PY+ P C ++N + + C TP C+ +C P Y + DD +F +
Sbjct: 1 DGCWPYDFPPCAHHINDTKYPKCPKGLYPTPNCVEQCHNPKYTTTLRDDRHFMLESSPYH 60
Query: 250 ANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEG 309
+ I GPV S T+Y D + Y++G+YKH +G LG HA++IIGWG++
Sbjct: 61 YSVNDAKNAIRTDGPVSASFTVYEDFLAYRSGVYKHTSGSYLGGHAVKIIGWGEK----- 115
Query: 310 TSSVVKYWLVANSFNTNWGENGLFRIG 336
S YWL NS+N +WG++GLF+I
Sbjct: 116 --SGQAYWLAVNSWNEDWGDHGLFKIA 140
>gi|157058771|gb|ABV03143.1| cathepsin B-16D [Aulacorthum solani]
Length = 201
Score = 117 bits (292), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 51/117 (43%), Positives = 76/117 (64%)
Query: 97 LPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLV 156
+P FDAR W +C TI ++RDQG+CGS WA+ A +DR+C+A+ G + LS++++
Sbjct: 72 IPRHFDARRKWRHCQTIGKVRDQGNCGSCWAMATSSAFADRLCVATNGDFNELLSAEEIT 131
Query: 157 SCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIPCERYMNGSHSSC 213
CC CG GC GG+ KAWK + G+V+GG Y S +GC PY +P Y + +++C
Sbjct: 132 FCCHTCGFGCHGGYPIKAWKRFNKHGLVTGGNYNSGEGCEPYRVPPCPYDDQGNNTC 188
>gi|60598652|gb|AAX25875.1| unknown [Schistosoma japonicum]
Length = 195
Score = 116 bits (290), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 72/149 (48%), Positives = 97/149 (65%), Gaps = 3/149 (2%)
Query: 63 TLSELEMRMGVHP-DSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGS 121
+L + + MG D+++ + R P V D E+P FD+R WP+C +I +IRDQ
Sbjct: 23 SLDDARILMGARKEDAEMKRKRRPT-VDHHDLNVEIPSQFDSRKKWPHCKSISQIRDQSR 81
Query: 122 CGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTT 181
CGS WA GAVEAM+DR+CI S G++ LS+ DL+SCC+DCG GC+GGF G+AW YWV
Sbjct: 82 CGSCWAFGAVEAMTDRICIQSGGQQSAELSALDLISCCEDCGGGCKGGFPGQAWDYWVKR 141
Query: 182 GIVSGGTYASKQGCRPYEIP-CERYMNGS 209
GIV+GG+ + GC+PY P CE S
Sbjct: 142 GIVTGGSKENHTGCQPYPFPKCEHLTKES 170
>gi|156708110|gb|ABU93313.1| cathepsin B4 cysteine protease [Monocercomonoides sp. PA]
Length = 281
Score = 115 bits (289), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 91/280 (32%), Positives = 128/280 (45%), Gaps = 56/280 (20%)
Query: 60 SKLTLSELEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQ 119
S + +++ +G +P + PL LS+P E FDAR WP I +RDQ
Sbjct: 34 SVINVAKFRAMLGAELGPHMPYVQ-PL--SLSEPTE-----FDAREQWP--GKILPVRDQ 83
Query: 120 GSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWV 179
SCGS WA EAM D IA G +S DLVSC K + C GG KA +Y V
Sbjct: 84 ASCGSCWAHSVAEAMGDAQNIA--GCPRGAMSVQDLVSCDKT-DSACNGGDMKKAQEYLV 140
Query: 180 TTGIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDL 239
TGI + C +Y++GS P C KC G +
Sbjct: 141 KTGITTEA--------------CVKYVSGSG--------RVPACPSKCDNGSQI------ 172
Query: 240 NFGRIAYSLPANEET----IMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHA 295
I Y L + + IM+ + +GP+ +Y+D + Y++G+Y+H +G G HA
Sbjct: 173 ----IRYKLQSWKSVEPSEIMQALMEYGPLSCGFMVYSDFMNYRSGVYQHKSGYFEGGHA 228
Query: 296 IRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
+ + GWG E + + YWLV NS+ WGE G F+I
Sbjct: 229 VLLCGWGVE-------NGLPYWLVQNSWGPAWGEKGFFKI 261
Score = 92.0 bits (227), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 52/157 (33%), Positives = 80/157 (50%), Gaps = 29/157 (18%)
Query: 344 CERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEET----IMR 399
C +Y++GS P C KC G + I Y L + + IM+
Sbjct: 149 CVKYVSGSG--------RVPACPSKCDNGSQI----------IRYKLQSWKSVEPSEIMQ 190
Query: 400 EIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYW 459
+ +GP+ +Y+D + Y++G+Y+H +G G HA+ + GWG E + + YW
Sbjct: 191 ALMEYGPLSCGFMVYSDFMNYRSGVYQHKSGYFEGGHAVLLCGWGVE-------NGLPYW 243
Query: 460 LVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPK 496
LV NS+ WGE G F+I+RG N C IE+ +T G+PK
Sbjct: 244 LVQNSWGPAWGEKGFFKILRGSNHCEIESYVTLGVPK 280
>gi|123478051|ref|XP_001322190.1| Clan CA, family C1, cathepsin B-like cysteine peptidase
[Trichomonas vaginalis G3]
gi|121905031|gb|EAY09967.1| Clan CA, family C1, cathepsin B-like cysteine peptidase
[Trichomonas vaginalis G3]
Length = 288
Score = 115 bits (289), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 87/294 (29%), Positives = 134/294 (45%), Gaps = 54/294 (18%)
Query: 49 LPFYGAEKNALSKLTLSELEMRMG----VHPDSKLPQNRLPLLVQLSDPLEELPEGFDAR 104
LP+ E +T + + G + PD+ +P R P + +S +P ++
Sbjct: 23 LPWVAGENERFKGMTFKDASVISGNAHKLRPDT-IPLARPPK-INIS-----IPMSYNFT 75
Query: 105 INWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGN 164
+P C + DQG CGS W+ ++ S R C + + V S LV+C + +
Sbjct: 76 ERFPQCDF--GVLDQGKCGSCWSFAVSKSFSHRYC--RKYNKPVLFSQSHLVACDRR-NS 130
Query: 165 GCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECI 224
GC GG AW+Y G+ C+PY+ +Y C
Sbjct: 131 GCGGGIEVNAWRYIDLRGL-------PLDSCQPYDGNITKY----------------NCS 167
Query: 225 RKC---QPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTG 281
+KC Y+ + + + R A S+ + IM E GPV S+ +Y+D++ YK+G
Sbjct: 168 KKCTNESETYEAQFTEYWSVARYA-SIEEMQIGIMTE----GPVTTSLKVYSDLMYYKSG 222
Query: 282 IYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
IY H G LG HA+ IIGW GT + + YW+++NS+NT WG NGLF I
Sbjct: 223 IYTHTKGEFLGHHAVEIIGW-------GTKNGIDYWIISNSWNTTWGMNGLFLI 269
Score = 102 bits (253), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 60/148 (40%), Positives = 83/148 (56%), Gaps = 17/148 (11%)
Query: 354 SCQANEPNTPE--CIRKC---QPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVE 408
SCQ + N + C +KC Y+ + + + R A S+ + IM E GPV
Sbjct: 153 SCQPYDGNITKYNCSKKCTNESETYEAQFTEYWSVARYA-SIEEMQIGIMTE----GPVT 207
Query: 409 GSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTN 468
S+ +Y+D++ YK+GIY H G LG HA+ IIGWG T + + YW+++NS+NT
Sbjct: 208 TSLKVYSDLMYYKSGIYTHTKGEFLGHHAVEIIGWG-------TKNGIDYWIISNSWNTT 260
Query: 469 WGENGLFRIVRGQNECGIEADITAGLPK 496
WG NGLF I RG NEC IE + AG K
Sbjct: 261 WGMNGLFLIKRGVNECHIEDYVCAGKVK 288
>gi|294891889|ref|XP_002773789.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
gi|239878993|gb|EER05605.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
Length = 422
Score = 115 bits (289), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 84/273 (30%), Positives = 119/273 (43%), Gaps = 39/273 (14%)
Query: 94 LEELPEGFDARINWPYCP-TIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSS 152
L LP FDAR + C I +R+QG C + WA AV +DRVCI S G+ LS
Sbjct: 142 LTTLPSSFDARQKFASCADVIGHVREQGECNNCWASAAVGMFNDRVCIKSGGRITDILSL 201
Query: 153 DDLVSCCKDC-----GNGCQGGFHGKAWKYWVTTGIVSGGTY-----------------A 190
L SCC NGC G + + G+V+G +
Sbjct: 202 GYLTSCCNRANGCPKSNGCMFGSVPEGLNFMKNHGLVTGRNFRFESFKLSGEYKPPEELG 261
Query: 191 SKQGCRPYEIPCERYMNGSHSS---CQDNEPNTPECIRKC-QPGYDVSYEDDLNFGRIAY 246
+ GC PY P ++ G S C + P C C Y S + D + +
Sbjct: 262 NDDGCWPYPFPKCNHVPGLESKYPRCAQVR-DLPACATTCPNKAYGTSMQKDTHRAKSWG 320
Query: 247 SLPANEETIMREIFRHGPVE---GSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQ 303
LP E I +EIF +GP+ MT+Y D L + +Y H G L H +++IGWG
Sbjct: 321 RLPIGPEKIKQEIFDNGPLRXXAAMMTLYEDFDL-QVCVYVHKTGQMLAAHTLKLIGWGV 379
Query: 304 EPLGEGTSSVVKYWLVANSFNTNWGENGLFRIG 336
E S +YWL N++N WG++G+ ++
Sbjct: 380 E-------SGQEYWLAVNAWNEEWGDHGMIKLA 405
Score = 75.9 bits (185), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 50/161 (31%), Positives = 76/161 (47%), Gaps = 16/161 (9%)
Query: 336 GCRPYEIPCERYMNGSRSSCQ--ANEPNTPECIRKC-QPGYDVSYEDDLNFGRIAYSLPA 392
GC PY P ++ G S A + P C C Y S + D + + LP
Sbjct: 265 GCWPYPFPKCNHVPGLESKYPRCAQVRDLPACATTCPNKAYGTSMQKDTHRAKSWGRLPI 324
Query: 393 NEETIMREIFRHGPVE---GSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLG 449
E I +EIF +GP+ MT+Y D L + +Y H G L H +++IGWG E
Sbjct: 325 GPEKIKQEIFDNGPLRXXAAMMTLYEDFDL-QVCVYVHKTGQMLAAHTLKLIGWGVE--- 380
Query: 450 EGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADI 490
S +YWL N++N WG++G+ ++ G+ G+E +
Sbjct: 381 ----SGQEYWLAVNAWNEEWGDHGMIKLAVGKT--GLEHQV 415
>gi|452264|emb|CAA80449.1| cathepsin B-like protease [Fasciola hepatica]
Length = 166
Score = 115 bits (289), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 64/178 (35%), Positives = 97/178 (54%), Gaps = 24/178 (13%)
Query: 119 QGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYW 178
QG CG WA GA MSDR+CIAS+GK LS++++V CC CG GC GG +
Sbjct: 1 QGQCGWCWAFGA-STMSDRICIASQGKHTPVLSAENMVDCCTSCGMGCNGG------GFP 53
Query: 179 VTTGIVSGGTYASKQGCRPYEIPCERYM------------NGSHSSCQDNEPNTPECIRK 226
+ G + T + C + + ER+M + + + C + + TP C
Sbjct: 54 LKLGSIGKKT----RSCHRWFVRIERWMPTILVPSLRTSRDWTETPC-NQDVTTPACKHT 108
Query: 227 CQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYK 284
C+PGY+++Y+ D + R Y +PA+E IMRE+ +GP+E S +Y D YK+G+Y+
Sbjct: 109 CRPGYNMTYQKDKWYARTVYKVPADEHRIMRELLTNGPMEVSFEVYGDFPSYKSGVYQ 166
Score = 71.2 bits (173), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 28/66 (42%), Positives = 43/66 (65%)
Query: 361 NTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILY 420
TP C C+PGY+++Y+ D + R Y +PA+E IMRE+ +GP+E S +Y D Y
Sbjct: 101 TTPACKHTCRPGYNMTYQKDKWYARTVYKVPADEHRIMRELLTNGPMEVSFEVYGDFPSY 160
Query: 421 KTGIYK 426
K+G+Y+
Sbjct: 161 KSGVYQ 166
>gi|156708116|gb|ABU93316.1| cathepsin B7 cysteine protease, partial [Monocercomonoides sp. PA]
Length = 273
Score = 115 bits (288), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 79/241 (32%), Positives = 115/241 (47%), Gaps = 40/241 (16%)
Query: 95 EELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDD 154
E P FD R WP + +R+QGSCGS WA A E M R+ I R + V +S D
Sbjct: 53 ENAPASFDCRQKWP--GKAEPVRNQGSCGSCWAHAASETMGFRMGI-RRCSKGV-MSPQD 108
Query: 155 LVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQ 214
LVSC + GC GG+ + W + GI + + C PY ++GS
Sbjct: 109 LVSC-ESNNMGCNGGYADRVWNWIQKKGITT-------EQCIPY-------VSGSG---- 149
Query: 215 DNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYAD 274
P C KC+ G ++ ++G N +T+M E+ +GPV ++ D
Sbjct: 150 ----RVPTCPSKCKNGSNIVRSFVSSWGSF------NSKTVMDEVANNGPVYACFEVFED 199
Query: 275 MILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFR 334
Y++G+Y+H G G H + ++GWG E + V YWL+ NS+ + WGE G FR
Sbjct: 200 FYNYRSGVYQHKTGRSQGWHHVMLMGWGTE-------NGVPYWLLQNSWGSGWGEKGFFR 252
Query: 335 I 335
I
Sbjct: 253 I 253
Score = 94.0 bits (232), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 47/135 (34%), Positives = 72/135 (53%), Gaps = 13/135 (9%)
Query: 362 TPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYK 421
P C KC+ G ++ ++G N +T+M E+ +GPV ++ D Y+
Sbjct: 151 VPTCPSKCKNGSNIVRSFVSSWGSF------NSKTVMDEVANNGPVYACFEVFEDFYNYR 204
Query: 422 TGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQ 481
+G+Y+H G G H + ++GWG E + V YWL+ NS+ + WGE G FRI RG
Sbjct: 205 SGVYQHKTGRSQGWHHVMLMGWGTE-------NGVPYWLLQNSWGSGWGEKGFFRIRRGT 257
Query: 482 NECGIEADITAGLPK 496
N+C I+ +GLPK
Sbjct: 258 NDCHIDEIFYSGLPK 272
>gi|294955270|ref|XP_002788457.1| cysteine protease, putative [Perkinsus marinus ATCC 50983]
gi|239903926|gb|EER20253.1| cysteine protease, putative [Perkinsus marinus ATCC 50983]
Length = 392
Score = 115 bits (288), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 69/213 (32%), Positives = 104/213 (48%), Gaps = 23/213 (10%)
Query: 150 LSSDDLVSCCKD-----CGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIP-CE 203
+ D CKD C +GC G AW + GI + G+ ++ GC PY P C
Sbjct: 138 FDARDAFKECKDVIGHVCCDGCTKGRPDAAWSFLNVYGIATEGSMSAADGCWPYNFPKCG 197
Query: 204 RYMNGS-HSSCQDNEPNTPECIRKC-QPGYDVSYEDDLNFGRIAYSLP---ANEETIMRE 258
+ S + C + +TP C+ +C Y + D +F A+ P + I +E
Sbjct: 198 HHQQDSKYQPCPEKNYDTPPCLDRCPNKNYGTPLDKDRHF--TAHFSPYQLKGTDNIKKE 255
Query: 259 IFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWL 318
I +GP + ++Y D + Y++G+YKH +G +GEH + IIGW GT V YWL
Sbjct: 256 IMTNGPTSAAFSMYDDFLSYESGVYKHTSGTLMGEHGVEIIGW-------GTKQGVDYWL 308
Query: 319 VANSFNTNWGENGLFRIG---CRPYEIPCERYM 348
V NS+N WG +G F+I C ++ ER+M
Sbjct: 309 VMNSWNEGWGVHGTFKIAQGDCGINDMAIERFM 341
Score = 98.6 bits (244), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 56/157 (35%), Positives = 82/157 (52%), Gaps = 17/157 (10%)
Query: 336 GCRPYEIP-CERYMNGSR-SSCQANEPNTPECIRKC-QPGYDVSYEDDLNFGRIAYSLP- 391
GC PY P C + S+ C +TP C+ +C Y + D +F A+ P
Sbjct: 187 GCWPYNFPKCGHHQQDSKYQPCPEKNYDTPPCLDRCPNKNYGTPLDKDRHF--TAHFSPY 244
Query: 392 --ANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLG 449
+ I +EI +GP + ++Y D + Y++G+YKH +G +GEH + IIGW
Sbjct: 245 QLKGTDNIKKEIMTNGPTSAAFSMYDDFLSYESGVYKHTSGTLMGEHGVEIIGW------ 298
Query: 450 EGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGI 486
GT V YWLV NS+N WG +G F+I +G +CGI
Sbjct: 299 -GTKQGVDYWLVMNSWNEGWGVHGTFKIAQG--DCGI 332
>gi|303289014|ref|XP_003063795.1| cathepsin B-like cysteine proteinase [Micromonas pusilla CCMP1545]
gi|226454863|gb|EEH52168.1| cathepsin B-like cysteine proteinase [Micromonas pusilla CCMP1545]
Length = 390
Score = 115 bits (288), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 88/269 (32%), Positives = 128/269 (47%), Gaps = 42/269 (15%)
Query: 92 DPLEE-LPEGFDARINWPYCP-TIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVR 149
DP+ + LPE FDAR WP C + DQG CGS WA+ ++DR CIA+ G
Sbjct: 110 DPVADGLPELFDARERWPRCARVVGTALDQGKCGSCWAVATAAVLTDRACIATNGALGGG 169
Query: 150 ------LSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPY----- 198
LS+ L+SC +GC+GG A++Y T G+V+GG Y + C PY
Sbjct: 170 GGGGEFLSASQLLSC--GAADGCEGGDERDAFEYAKTHGVVTGGAYGDESTCAPYLFDAC 227
Query: 199 EIPCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIA--YSLPANEET-I 255
+ PCE+ TPEC C ED + R+ S P + + +
Sbjct: 228 QHPCEKS-------------PTPECPLSCVRPKGTRVEDAPAY-RVKEIVSCPERDYSCV 273
Query: 256 MREIFRHGPVEG-SMTIYADMILYK-TGIY-----KHVAGGPLGEHAIRIIGWGQEPLGE 308
+EI GPV + TI+ + LY G++ + V G G H +++IGWG++
Sbjct: 274 AKEIATRGPVTSYAGTIWGEFYLYDGRGVFASSGDERVRGENHGGHVVKLIGWGRDEKAR 333
Query: 309 GTSSVVK--YWLVANSFNTNWGENGLFRI 335
G + +WLV NS+ NWG +G R+
Sbjct: 334 GKPATAGGYHWLVVNSWR-NWGNDGFGRV 361
Score = 50.1 bits (118), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 40/132 (30%), Positives = 62/132 (46%), Gaps = 14/132 (10%)
Query: 362 TPECIRKCQPGYDVSYEDDLNFGRIA--YSLPANEET-IMREIFRHGPVEG-SMTIYADM 417
TPEC C ED + R+ S P + + + +EI GPV + TI+ +
Sbjct: 236 TPECPLSCVRPKGTRVEDAPAY-RVKEIVSCPERDYSCVAKEIATRGPVTSYAGTIWGEF 294
Query: 418 ILYK-TGIY-----KHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVK--YWLVANSFNTNW 469
LY G++ + V G G H +++IGWG++ G + +WLV NS+ NW
Sbjct: 295 YLYDGRGVFASSGDERVRGENHGGHVVKLIGWGRDEKARGKPATAGGYHWLVVNSWR-NW 353
Query: 470 GENGLFRIVRGQ 481
G +G R+ G
Sbjct: 354 GNDGFGRVAVGD 365
>gi|239793652|dbj|BAH72931.1| ACYPI000018 [Acyrthosiphon pisum]
Length = 239
Score = 115 bits (288), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 54/132 (40%), Positives = 76/132 (57%)
Query: 96 ELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDL 155
+P FDAR W C TI +RDQG+CGS WA+ A +DR+C+A+ + LS++++
Sbjct: 87 RIPRHFDARRKWRRCHTIGAVRDQGNCGSCWAMATSSAFADRLCVATNTDFNELLSAEEI 146
Query: 156 VSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQD 215
CC CG GC GG+ KAW+ + G+V+GG Y S +GC PY +P Y H Q
Sbjct: 147 TFCCHSCGFGCNGGYPIKAWERFKKRGLVTGGDYQSGEGCEPYRVPPCPYDAEGHIHAQV 206
Query: 216 NEPNTPECIRKC 227
N+ N +R C
Sbjct: 207 NQGNRITDVRGC 218
>gi|161343847|tpg|DAA06104.1| TPA_inf: cathepsin B [Acyrthosiphon pisum]
Length = 187
Score = 115 bits (288), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 48/108 (44%), Positives = 69/108 (63%)
Query: 90 LSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVR 149
L D ++PE FDAR W C +I I +QG+C + WA+ A++DR+CI S+
Sbjct: 80 LDDGSNDMPETFDARNKWFECVSIAHIWNQGNCAADWAISVTSAINDRICIKSKKNITAF 139
Query: 150 LSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRP 197
S ++SCC DCG+GC GG+ G AW+YW+ G+V+GG Y S +GC+P
Sbjct: 140 YSPQKMLSCCDDCGDGCNGGYSGAAWQYWMKRGLVTGGDYGSNEGCQP 187
>gi|159114116|ref|XP_001707283.1| Cathepsin B precursor [Giardia lamblia ATCC 50803]
gi|157435387|gb|EDO79609.1| Cathepsin B precursor [Giardia lamblia ATCC 50803]
Length = 332
Score = 115 bits (288), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 85/286 (29%), Positives = 120/286 (41%), Gaps = 36/286 (12%)
Query: 50 PFYGAEKNALSKLTLSELEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPY 109
P+ + L L+ +L H P P ++P +P+ FD R +P
Sbjct: 61 PWTAGISDRLVGLSEDDLRAMFPRHGQPTRPSAECPR----AEPSGPIPDAFDLREEYPQ 116
Query: 110 CPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGG 169
C I + DQG CG+ WA A A DR C+ V S VSC D GC GG
Sbjct: 117 C--ITPVYDQGYCGACWAFSATGAFGDRRCMQWLDPVGVPYSQQYTVSC-DDLDLGCAGG 173
Query: 170 FHGKAWKYWVTTGIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQP 229
W + G + + C RY + D + ++P C C
Sbjct: 174 TSFNVWTFLTEHGTTT--------------LECVRYTDA------DKDLSSP-CPALCDD 212
Query: 230 GYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGG 289
G ++ G + YS N IM+ + GPV+ M++Y D + Y+ G+YKHV G
Sbjct: 213 GSEIQLVK--ADGCLDYS--GNVTAIMQTLANDGPVQAVMSVYRDFLYYRGGVYKHVYGI 268
Query: 290 PLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
+ HA+ IIG+G E + YW+V NS NWGE G F I
Sbjct: 269 QISSHAVEIIGYGTTDDEER----IPYWIVKNSLGPNWGEEGYFNI 310
Score = 95.5 bits (236), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 45/102 (44%), Positives = 61/102 (59%), Gaps = 4/102 (3%)
Query: 393 NEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGT 452
N IM+ + GPV+ M++Y D + Y+ G+YKHV G + HA+ IIG+G E
Sbjct: 230 NVTAIMQTLANDGPVQAVMSVYRDFLYYRGGVYKHVYGIQISSHAVEIIGYGTTDDEER- 288
Query: 453 SSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGL 494
+ YW+V NS NWGE G F IVRG NEC IE+ + +GL
Sbjct: 289 ---IPYWIVKNSLGPNWGEEGYFNIVRGSNECDIESAVYSGL 327
>gi|389608479|dbj|BAM17849.1| tubulointerstitial nephritis antigen [Papilio xuthus]
Length = 429
Score = 115 bits (287), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 91/277 (32%), Positives = 120/277 (43%), Gaps = 42/277 (15%)
Query: 67 LEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGW 126
L ++G P + + PL P P FDAR WP I I DQG CGS W
Sbjct: 163 LIYKLGTFPLNAETRRMGPLRYDKDVPY---PTQFDARTRWP--GFISPIVDQGWCGSDW 217
Query: 127 ALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSG 186
A+ SDR I S G ++ LS L+SC GC GG AW + G+V
Sbjct: 218 AVSLAGVASDRFAIQSNGAENMVLSPQTLLSCNVRAQQGCHGGHIDVAWNFARGHGLVD- 276
Query: 187 GTYASKQGCRPYEIPCERY-MNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIA 245
+ C PY+ R + QD C P R
Sbjct: 277 ------EKCFPYKASVTRCPFRPRGNLIQDG----------CMPLV------KRRTSRYK 314
Query: 246 YSLPA---NEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGP---LGEHAIRII 299
PA +E+ IM +I GPV+ MT+Y D Y+ G+Y+ G G H++RII
Sbjct: 315 LGPPAKLSHEKDIMYDIMESGPVQAVMTVYQDFFHYRDGVYRRSYHGNNELKGFHSVRII 374
Query: 300 GWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIG 336
GWG++ G+ +YW+VANS+ WGENG FRI
Sbjct: 375 GWGED-RGD------RYWVVANSWGRQWGENGYFRIA 404
Score = 91.7 bits (226), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 46/109 (42%), Positives = 66/109 (60%), Gaps = 10/109 (9%)
Query: 392 ANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGP---LGEHAIRIIGWGQEPL 448
++E+ IM +I GPV+ MT+Y D Y+ G+Y+ G G H++RIIGWG++
Sbjct: 322 SHEKDIMYDIMESGPVQAVMTVYQDFFHYRDGVYRRSYHGNNELKGFHSVRIIGWGED-R 380
Query: 449 GEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
G+ +YW+VANS+ WGENG FRI RG NE IE+ + GL +
Sbjct: 381 GD------RYWVVANSWGRQWGENGYFRIARGSNEADIESFVVTGLSDV 423
>gi|56757237|gb|AAW26790.1| unknown [Schistosoma japonicum]
Length = 170
Score = 115 bits (287), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 56/106 (52%), Positives = 68/106 (64%), Gaps = 1/106 (0%)
Query: 72 GVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAV 131
G D L Q R P V D E+P FD+R WP C +I +IRDQ C S WA+ AV
Sbjct: 66 GRREDPNLRQKRRPT-VDHHDLKVEIPSHFDSRKKWPRCKSISQIRDQSRCASSWAVSAV 124
Query: 132 EAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKY 177
AMSDR+CI S GK+ V LS+ DL+SCC++CG+GC GGF G AW Y
Sbjct: 125 GAMSDRICIQSGGKQSVELSAIDLISCCENCGSGCDGGFPGPAWDY 170
>gi|291236586|ref|XP_002738220.1| PREDICTED: cathepsin B preproprotein-like [Saccoglossus
kowalevskii]
Length = 93
Score = 115 bits (287), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 54/100 (54%), Positives = 72/100 (72%), Gaps = 7/100 (7%)
Query: 398 MREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVK 457
M EI ++GPVEG+ T+YAD YK+G+Y+H G LG HAI+I+GWG E +
Sbjct: 1 MAEIQKYGPVEGAFTVYADFPSYKSGVYQHETGEALGGHAIKILGWGNEDGHD------- 53
Query: 458 YWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
YWLVANS+N +WG+ G F+I+RG +ECGIE+ ITAG PK+
Sbjct: 54 YWLVANSWNEDWGDQGFFKILRGVDECGIESQITAGSPKL 93
Score = 87.8 bits (216), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 41/80 (51%), Positives = 55/80 (68%), Gaps = 7/80 (8%)
Query: 256 MREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVK 315
M EI ++GPVEG+ T+YAD YK+G+Y+H G LG HAI+I+GWG E +
Sbjct: 1 MAEIQKYGPVEGAFTVYADFPSYKSGVYQHETGEALGGHAIKILGWGNEDGHD------- 53
Query: 316 YWLVANSFNTNWGENGLFRI 335
YWLVANS+N +WG+ G F+I
Sbjct: 54 YWLVANSWNEDWGDQGFFKI 73
>gi|52546914|gb|AAU81590.1| cysteine proteinase, partial [Petunia x hybrida]
Length = 122
Score = 115 bits (287), Expect = 8e-23, Method: Compositional matrix adjust.
Identities = 53/120 (44%), Positives = 78/120 (65%), Gaps = 9/120 (7%)
Query: 396 TIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSV 455
+IM E++++GPVE + T+Y D YK+G+YKHV G LG HA+++IGWG GE
Sbjct: 9 SIMTEVYKNGPVEVAFTVYEDFAHYKSGVYKHVTGDELGGHAVKLIGWGTSEDGE----- 63
Query: 456 VKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLP---KIGLEIDSNEINLGKMM 512
YWL+AN +N WG++G F+I RG NEC IE ++ AG+P + +E+D ++ L M
Sbjct: 64 -DYWLLANQWNRGWGDDGYFKIRRGTNECDIEDEVVAGMPSPKNLNMELDVSDAFLDASM 122
Score = 90.9 bits (224), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 39/82 (47%), Positives = 56/82 (68%), Gaps = 6/82 (7%)
Query: 254 TIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSV 313
+IM E++++GPVE + T+Y D YK+G+YKHV G LG HA+++IGWG GE
Sbjct: 9 SIMTEVYKNGPVEVAFTVYEDFAHYKSGVYKHVTGDELGGHAVKLIGWGTSEDGE----- 63
Query: 314 VKYWLVANSFNTNWGENGLFRI 335
YWL+AN +N WG++G F+I
Sbjct: 64 -DYWLLANQWNRGWGDDGYFKI 84
>gi|403354695|gb|EJY76909.1| Cathepsin B [Oxytricha trifallax]
Length = 311
Score = 115 bits (287), Expect = 8e-23, Method: Compositional matrix adjust.
Identities = 72/258 (27%), Positives = 115/258 (44%), Gaps = 44/258 (17%)
Query: 95 EELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDD 154
+++P +D R +P C I+DQ CGS WA + R C+A++GK++ LS +
Sbjct: 85 KQMPSSYDVRTVYPMCEN--RIKDQAQCGSCWAFATTNVLEYRYCMATKGKKYPELSPQN 142
Query: 155 LVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQ 214
L+SC GC GG+ + + Y G+ + + C PY+ + M S C
Sbjct: 143 LISCFNSASWGCDGGYIDQTFLYLEMMGV-------NTEQCMPYK-SGDGNMTACPSKCA 194
Query: 215 DNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYAD 274
+ E N C+PG + E+ +F GP+ ++ D
Sbjct: 195 NGE-NLYMNKYYCRPG--------------STQYMRGEQQFKNYLFNKGPMVAVFDVFED 239
Query: 275 MILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFR 334
I Y GIY V+G LG+HA++++G+G E + Y++ N + +WGE+G FR
Sbjct: 240 FINYGGGIYNKVSGDKLGKHAVKLLGYGVE-------NSTNYYIGVNQWGKDWGEDGYFR 292
Query: 335 I------------GCRPY 340
I GC PY
Sbjct: 293 IKAGEVLIDNEGYGCDPY 310
Score = 63.9 bits (154), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 29/88 (32%), Positives = 48/88 (54%), Gaps = 7/88 (7%)
Query: 394 EETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTS 453
E+ +F GP+ ++ D I Y GIY V+G LG+HA++++G+G E
Sbjct: 217 EQQFKNYLFNKGPMVAVFDVFEDFINYGGGIYNKVSGDKLGKHAVKLLGYGVE------- 269
Query: 454 SVVKYWLVANSFNTNWGENGLFRIVRGQ 481
+ Y++ N + +WGE+G FRI G+
Sbjct: 270 NSTNYYIGVNQWGKDWGEDGYFRIKAGE 297
>gi|198434980|ref|XP_002126076.1| PREDICTED: similar to LOC100124858 protein [Ciona intestinalis]
Length = 541
Score = 114 bits (286), Expect = 8e-23, Method: Compositional matrix adjust.
Identities = 83/252 (32%), Positives = 121/252 (48%), Gaps = 38/252 (15%)
Query: 97 LPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLV 156
LP+ FD+R WP ++ RDQ + G+ WA +SDR+ I S+ V LS LV
Sbjct: 288 LPKTFDSRTKWP--GSLSLPRDQENEGTSWAFSTTSVLSDRLAIQSKNFTVVELSPQHLV 345
Query: 157 SCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTY-----ASKQGCRPYEIPCERYMNGSHS 211
SC + +G + W Y G+VS Y S QG + + +G+H
Sbjct: 346 SCFSS--HEGRGERLDRTWWYLRKKGVVSTVCYPESRSKSTQGIGSCGLVA--HSSGAHI 401
Query: 212 SCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTI 271
N ++ E I K P Y VS +NEE IM+EIF +GPV+ M +
Sbjct: 402 CPNGNVISSNE-IYKTSPVYRVS---------------SNEENIMKEIFENGPVQAVMRV 445
Query: 272 YADMILYKTGIYKHVAGGPL--------GEHAIRIIGWGQEPLGEGTSSVVKYWLVANSF 323
D +YK+G+Y A + H+++IIGWG++ + ++ KYW+V NS+
Sbjct: 446 QPDFFVYKSGVYSSTAIDNIVVEQVKDNTYHSVKIIGWGEK---KSKTNSGKYWIVQNSW 502
Query: 324 NTNWGENGLFRI 335
NWGE G FRI
Sbjct: 503 GANWGEGGYFRI 514
Score = 103 bits (257), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 51/121 (42%), Positives = 73/121 (60%), Gaps = 11/121 (9%)
Query: 387 AYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPL--------GEHAI 438
Y + +NEE IM+EIF +GPV+ M + D +YK+G+Y A + H++
Sbjct: 419 VYRVSSNEENIMKEIFENGPVQAVMRVQPDFFVYKSGVYSSTAIDNIVVEQVKDNTYHSV 478
Query: 439 RIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKIG 498
+IIGWG++ + ++ KYW+V NS+ NWGE G FRI +G NECGIE I A P+I
Sbjct: 479 KIIGWGEK---KSKTNSGKYWIVQNSWGANWGEGGYFRIRKGVNECGIEEMILAAWPQIP 535
Query: 499 L 499
L
Sbjct: 536 L 536
>gi|294885809|ref|XP_002771442.1| cathepsin L precursor, putative [Perkinsus marinus ATCC 50983]
gi|239875086|gb|EER03258.1| cathepsin L precursor, putative [Perkinsus marinus ATCC 50983]
Length = 527
Score = 114 bits (286), Expect = 9e-23, Method: Compositional matrix adjust.
Identities = 61/165 (36%), Positives = 90/165 (54%), Gaps = 12/165 (7%)
Query: 336 GCRPYEIP-CERYMNGSR-SSCQANEPNTPECIRKCQ-PGYDVSYEDDLNFGRIAYSLPA 392
GC PY+ P C ++N ++ C TP C+ +C P Y S ++D ++ +
Sbjct: 372 GCWPYDFPPCAHHINDTKYPKCPKGSYETPNCVEQCHNPKYTTSLKNDRHYMLESSPYQY 431
Query: 393 NEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGT 452
+ I GP+ S +Y D + YK+G+YKH +G LG HA++IIGWG+E GE
Sbjct: 432 SVNNAKNAIRTDGPISASYLVYEDFLAYKSGVYKHTSGSYLGGHAVKIIGWGEEN-GEA- 489
Query: 453 SSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
YWLV NS+N +WG+ GLF+I G C I+ D+ G PK+
Sbjct: 490 -----YWLVVNSWNEDWGDQGLFKIALGN--CEIDDDLLGGTPKV 527
Score = 106 bits (264), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 58/162 (35%), Positives = 85/162 (52%), Gaps = 10/162 (6%)
Query: 184 VSGGTYASKQGCRPYEIP-CERYMNGS-HSSCQDNEPNTPECIRKCQ-PGYDVSYEDDLN 240
V+ G GC PY+ P C ++N + + C TP C+ +C P Y S ++D +
Sbjct: 362 VARGNLTKGDGCWPYDFPPCAHHINDTKYPKCPKGSYETPNCVEQCHNPKYTTSLKNDRH 421
Query: 241 FGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIG 300
+ + + I GP+ S +Y D + YK+G+YKH +G LG HA++IIG
Sbjct: 422 YMLESSPYQYSVNNAKNAIRTDGPISASYLVYEDFLAYKSGVYKHTSGSYLGGHAVKIIG 481
Query: 301 WGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIGCRPYEI 342
WG+E GE YWLV NS+N +WG+ GLF+I EI
Sbjct: 482 WGEEN-GEA------YWLVVNSWNEDWGDQGLFKIALGNCEI 516
>gi|403331769|gb|EJY64852.1| hypothetical protein OXYTRI_15000 [Oxytricha trifallax]
Length = 259
Score = 114 bits (286), Expect = 9e-23, Method: Compositional matrix adjust.
Identities = 80/247 (32%), Positives = 116/247 (46%), Gaps = 42/247 (17%)
Query: 95 EELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDD 154
LP FD+ + WP C I R+QGSCGS +A A MSDR+CI S G+ ++ LS +
Sbjct: 31 SNLPLSFDSTVEWPDC--IHATRNQGSCGSCYAFAASGMMSDRLCIKSNGQINLVLSPQE 88
Query: 155 LVSCCKDCGN-GCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIPCERYMNGSHSSC 213
LVSC D N GC GG+ Y ++ GI S + C PY++ N +C
Sbjct: 89 LVSC--DYQNYGCSGGWMTNTLYYLMSYGIPS-------ETCLPYDM-----FNSETKAC 134
Query: 214 QD--NEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTI 271
+ PN KC+ G + ++ ETIMR+I +GP +
Sbjct: 135 SGRCDSPNYEYTRHKCKKG--------------TSKIMSDPETIMRDIMENGPSIVAFQA 180
Query: 272 YADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNW---G 328
+ D + + GIYK+ +G L HA ++ GWG + G YW+ N F W G
Sbjct: 181 FEDFLNFGGGIYKYTSGKFLVGHATKLTGWGLDSAGR------LYWIGQNQFGLGWGGRG 234
Query: 329 ENGLFRI 335
+ G ++I
Sbjct: 235 DYGFYKI 241
Score = 61.6 bits (148), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 43/166 (25%), Positives = 72/166 (43%), Gaps = 32/166 (19%)
Query: 337 CRPYEIPCERYMNGSRSSC--QANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANE 394
C PY++ N +C + + PN KC+ G + ++
Sbjct: 121 CLPYDM-----FNSETKACSGRCDSPNYEYTRHKCKKG--------------TSKIMSDP 161
Query: 395 ETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSS 454
ETIMR+I +GP + + D + + GIYK+ +G L HA ++ GWG + G
Sbjct: 162 ETIMRDIMENGPSIVAFQAFEDFLNFGGGIYKYTSGKFLVGHATKLTGWGLDSAGR---- 217
Query: 455 VVKYWLVANSFNTNW---GENGLFRIVRGQNECGIEADITAGLPKI 497
YW+ N F W G+ G ++I G E G + + + +P +
Sbjct: 218 --LYWIGQNQFGLGWGGRGDYGFYKIYDG--EVGFGSAVWSCIPDV 259
>gi|290998826|ref|XP_002681981.1| predicted protein [Naegleria gruberi]
gi|284095607|gb|EFC49237.1| predicted protein [Naegleria gruberi]
Length = 310
Score = 114 bits (286), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 79/242 (32%), Positives = 113/242 (46%), Gaps = 36/242 (14%)
Query: 96 ELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDL 155
++P FDAR W C + IRDQ +CG+ WA A ++ R+CIA+ GK +V LS +
Sbjct: 87 DIPMNFDARTQWKGC--VPAIRDQQTCGACWAFSANYVLAHRLCIATNGKTNVVLSPEYQ 144
Query: 156 VSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQD 215
V C CQGG+ +W + TG C PY + +G+
Sbjct: 145 VQC-DTMNKACQGGYLKYSWTFLENTG-------TPLDTCIPYASGRGTFSSGT------ 190
Query: 216 NEPNTPECIRKCQ-PGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYAD 274
C +C+ +S N I+ I I +G V+ T+Y D
Sbjct: 191 -------CPTQCKIASMSMSKYKAKNTVYIS-----GINNIKTAIMTYGSVQAGFTVYRD 238
Query: 275 MILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFR 334
+ YK+G+YKHV LG HA+ +IG+G EG S+ YWL ANS+ NWG +G F+
Sbjct: 239 LTGYKSGVYKHVVSTVLGGHAVALIGFGV----EGGSN---YWLAANSWGPNWGMSGYFK 291
Query: 335 IG 336
I
Sbjct: 292 IA 293
Score = 77.4 bits (189), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 43/97 (44%), Positives = 58/97 (59%), Gaps = 9/97 (9%)
Query: 397 IMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVV 456
I I +G V+ T+Y D+ YK+G+YKHV LG HA+ +IG+G EG S+
Sbjct: 219 IKTAIMTYGSVQAGFTVYRDLTGYKSGVYKHVVSTVLGGHAVALIGFGV----EGGSN-- 272
Query: 457 KYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAG 493
YWL ANS+ NWG +G F+I +G E GIE + AG
Sbjct: 273 -YWLAANSWGPNWGMSGYFKIAQG--EGGIENQVYAG 306
>gi|308804940|ref|XP_003079782.1| Cysteine proteinase Cathepsin F (ISS) [Ostreococcus tauri]
gi|116058239|emb|CAL53428.1| Cysteine proteinase Cathepsin F (ISS) [Ostreococcus tauri]
Length = 498
Score = 114 bits (284), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 93/284 (32%), Positives = 132/284 (46%), Gaps = 30/284 (10%)
Query: 61 KLTLSELEMRMGVHPDSKLPQNRLPLLVQLSDPLEE-LPEGFDARINWPYCP-TIQEIRD 118
KLTLS H + + L D L+ LP FDAR +P C I +RD
Sbjct: 220 KLTLSPYASSDETHGAHPFDRKAVGLGRVKWDALKHSLPRHFDARDEYPKCARLIGTVRD 279
Query: 119 QGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYW 178
QG CGS WA+ A E M+DR+CI+S GK LS +S C + G GC+GG
Sbjct: 280 QGKCGSCWAVAATEIMNDRLCISSGGKEVAELSPQFALS-CYNSGAGCEGGDVVDTLTLA 338
Query: 179 VTTGIVSGGTYASKQGCRPYEI-PCERYMNGSHSSCQDNEPNTPECIRKCQPG--YDVSY 235
+ G+ GG K C PY+ PC+ C + C C G + + Y
Sbjct: 339 LAKGVPHGGML-DKGACLPYQFEPCDH-------PCMIPGTSPEACPATCADGSKFQLVY 390
Query: 236 EDDLNFGRIAYSLPANE-ETIMREIFRHGPVEGSM-TIYADMILYKTGIYK--HVAGGPL 291
+L Y+ P ++ I +EI G V + ++ D +K G+YK +G L
Sbjct: 391 PKNL-----PYTCPPDDIACIAKEIKNRGSVAVTFGPVHEDFYGHKEGVYKVTESSGREL 445
Query: 292 GEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
G HA ++IGWG G+ YW++ NS+ NWGENG+ ++
Sbjct: 446 GNHATKLIGWGVTQEGD------HYWIMVNSWR-NWGENGVGKV 482
Score = 56.2 bits (134), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 49/168 (29%), Positives = 77/168 (45%), Gaps = 28/168 (16%)
Query: 332 LFRIGCRPYEI-PCERYMNGSRSSCQANEPNTPECIRKCQPG--YDVSYEDDLNFGRIAY 388
L + C PY+ PC+ C + C C G + + Y +L Y
Sbjct: 349 LDKGACLPYQFEPCDH-------PCMIPGTSPEACPATCADGSKFQLVYPKNL-----PY 396
Query: 389 SLPANE-ETIMREIFRHGPVEGSM-TIYADMILYKTGIYK--HVAGGPLGEHAIRIIGWG 444
+ P ++ I +EI G V + ++ D +K G+YK +G LG HA ++IGWG
Sbjct: 397 TCPPDDIACIAKEIKNRGSVAVTFGPVHEDFYGHKEGVYKVTESSGRELGNHATKLIGWG 456
Query: 445 QEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITA 492
G+ YW++ NS+ NWGENG+ ++ G E IE+ + A
Sbjct: 457 VTQEGD------HYWIMVNSWR-NWGENGVGKVRMG--EMSIESGVAA 495
>gi|290981656|ref|XP_002673546.1| predicted protein [Naegleria gruberi]
gi|284087130|gb|EFC40802.1| predicted protein [Naegleria gruberi]
Length = 362
Score = 114 bits (284), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 77/242 (31%), Positives = 113/242 (46%), Gaps = 36/242 (14%)
Query: 96 ELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDL 155
++P FDAR W C + IRDQ +CG+ WA A ++ R+CIA+ G+ +V LS +
Sbjct: 139 DIPMNFDARTQWKGC--VPAIRDQQTCGACWAFSANYVLAHRLCIATNGQTNVVLSPEYQ 196
Query: 156 VSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQD 215
V C CQGG+ +W + TG C PY + +G+
Sbjct: 197 VQC-DTMNKACQGGYLKYSWTFLENTG-------TPLDSCIPYASGRGTFSSGT------ 242
Query: 216 NEPNTPECIRKCQ-PGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYAD 274
C +C+ +S N I+ I I +G V+ T+Y D
Sbjct: 243 -------CPTQCKIASMSMSKYKAKNTVYIS-----GINNIKTAIMTYGSVQAGFTVYRD 290
Query: 275 MILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFR 334
+ YK+G+YKH+ LG HA+ +IG+G EG S+ YWL ANS+ NWG +G F+
Sbjct: 291 LTGYKSGVYKHIENTVLGGHAVALIGFGV----EGGSN---YWLAANSWGPNWGMSGYFK 343
Query: 335 IG 336
I
Sbjct: 344 IA 345
Score = 76.6 bits (187), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 42/97 (43%), Positives = 58/97 (59%), Gaps = 9/97 (9%)
Query: 397 IMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVV 456
I I +G V+ T+Y D+ YK+G+YKH+ LG HA+ +IG+G EG S+
Sbjct: 271 IKTAIMTYGSVQAGFTVYRDLTGYKSGVYKHIENTVLGGHAVALIGFGV----EGGSN-- 324
Query: 457 KYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAG 493
YWL ANS+ NWG +G F+I +G E GIE + AG
Sbjct: 325 -YWLAANSWGPNWGMSGYFKIAQG--EGGIENQVYAG 358
>gi|290971375|ref|XP_002668483.1| predicted protein [Naegleria gruberi]
gi|284081912|gb|EFC35739.1| predicted protein [Naegleria gruberi]
Length = 325
Score = 114 bits (284), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 76/241 (31%), Positives = 111/241 (46%), Gaps = 34/241 (14%)
Query: 96 ELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDL 155
++P FDAR W C + IRDQ +CG+ WA A ++ R+CIA+ G+ +V LS +
Sbjct: 102 DIPMNFDARTQWRGC--VPAIRDQQTCGACWAFSANYVLAHRLCIATNGQTNVVLSPEYQ 159
Query: 156 VSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQD 215
V C CQGG+ +W + TG C PY + +G+
Sbjct: 160 VQC-DTMNKACQGGYLKYSWTFLENTG-------TPLDTCIPYASGRGTFSSGT------ 205
Query: 216 NEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADM 275
C +C+ + R + I I +G V+ T+Y D+
Sbjct: 206 -------CPTQCKIASMSMSKYKAKNTRYITGI----NNIKTAIMTYGSVQAGFTVYRDL 254
Query: 276 ILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
YK+G+YKHV LG HA+ +IG+G EG S+ YWL ANS+ NWG +G F+I
Sbjct: 255 TGYKSGVYKHVVSTVLGGHAVALIGFGV----EGGSN---YWLAANSWGANWGMSGYFKI 307
Query: 336 G 336
Sbjct: 308 A 308
Score = 78.2 bits (191), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 43/97 (44%), Positives = 58/97 (59%), Gaps = 9/97 (9%)
Query: 397 IMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVV 456
I I +G V+ T+Y D+ YK+G+YKHV LG HA+ +IG+G EG S+
Sbjct: 234 IKTAIMTYGSVQAGFTVYRDLTGYKSGVYKHVVSTVLGGHAVALIGFGV----EGGSN-- 287
Query: 457 KYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAG 493
YWL ANS+ NWG +G F+I +G E GIE + AG
Sbjct: 288 -YWLAANSWGANWGMSGYFKIAQG--EGGIENQVYAG 321
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.318 0.137 0.435
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 9,107,801,684
Number of Sequences: 23463169
Number of extensions: 418509154
Number of successful extensions: 777607
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 3198
Number of HSP's successfully gapped in prelim test: 3436
Number of HSP's that attempted gapping in prelim test: 751687
Number of HSP's gapped (non-prelim): 16802
length of query: 524
length of database: 8,064,228,071
effective HSP length: 147
effective length of query: 377
effective length of database: 8,910,109,524
effective search space: 3359111290548
effective search space used: 3359111290548
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 79 (35.0 bits)