BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= psy8713
(309 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|194384502|dbj|BAG59411.1| unnamed protein product [Homo sapiens]
Length = 273
Score = 221 bits (563), Expect = 4e-55, Method: Compositional matrix adjust.
Identities = 120/268 (44%), Positives = 160/268 (59%), Gaps = 47/268 (17%)
Query: 44 NSLSNIPRAHLKSWMGVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREI 103
++ N+ ++LK G L + P+ + ++E D LPA+FD+R +WP CPTI+EI
Sbjct: 45 HNFYNVDMSYLKRLCGTF----LGGPKPPQRVMFTE-DLKLPASFDAREQWPQCPTIKEI 99
Query: 104 RDQGSCGSCWGCRPYEIAPCEH--HVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLN 161
RDQGSCGSCW E HVNG+RP C +G TPKC + C+ Y YK+D +
Sbjct: 100 RDQGSCGSCWAFGAVEAISDRICIHVNGSRPPC-TGEGDTPKCSKICEPGYSPTYKQDKH 158
Query: 162 FGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTI 221
+G SYSVS++EK IM EIY++GPVEGAF+V+ D +LYKSG +
Sbjct: 159 YGYNSYSVSNSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVY----------------- 201
Query: 222 RDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWG 281
+ +G+ +GGHAIRILGWG + + YWL+ANSWNTDWG
Sbjct: 202 --------------------QHVTGEMMGGHAIRILGWGVENGT--PYWLVANSWNTDWG 239
Query: 282 DNGLFKILRGKDECGIESSITAGVPKLD 309
DNG FKILRG+D CGIES + AG+P+ D
Sbjct: 240 DNGFFKILRGQDHCGIESEVVAGIPRTD 267
>gi|496317|dbj|BAA04103.1| Sarcophaga pro-cathepsin B [Sarcophaga peregrina]
Length = 344
Score = 216 bits (551), Expect = 9e-54, Method: Compositional matrix adjust.
Identities = 113/210 (53%), Positives = 133/210 (63%), Gaps = 39/210 (18%)
Query: 99 TIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKK 158
T + I G GS GCRPYEIAPCEHHVNGTRP CD G TP C ECQ++YDV YK
Sbjct: 174 TRKGIVSGGPYGSSQGCRPYEIAPCEHHVNGTRPPCDGEHGKTPSCRHECQKSYDVDYKT 233
Query: 159 DLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIK 218
D +FG+KSYSV N K I KEI ++GPVEGAFTV++DLILYK G +
Sbjct: 234 DKHFGSKSYSVKRNVKDIQKEIMQNGPVEGAFTVYEDLILYKDGVY-------------- 279
Query: 219 WTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNT 278
+ G+ LGGHAIRILGWG + K+ YWLIANSWNT
Sbjct: 280 -----------------------QHVHGRELGGHAIRILGWGVENKT--PYWLIANSWNT 314
Query: 279 DWGDNGLFKILRGKDECGIESSITAGVPKL 308
DWG+NG FK+LRG+D CGIES+I AG+PK+
Sbjct: 315 DWGNNGFFKMLRGEDHCGIESAIAAGLPKV 344
Score = 81.3 bits (199), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 38/75 (50%), Positives = 47/75 (62%), Gaps = 3/75 (4%)
Query: 43 KNSLSNIPRAHLKSWMGVHPDYNLPANRLPELIGYSEV---DEDLPANFDSRTKWPNCPT 99
+N ++PR+H + MGVHPD + L+ EV D D+P FD+R WPNCPT
Sbjct: 48 RNYDKSVPRSHFRRLMGVHPDAHKFTLHEKSLVLGEEVGLADSDVPEEFDARKAWPNCPT 107
Query: 100 IREIRDQGSCGSCWG 114
I EIRDQGSCGSCW
Sbjct: 108 IGEIRDQGSCGSCWA 122
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 25/32 (78%), Positives = 26/32 (81%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQ 40
CGFGCNGGFPG AW YW + GIVSGG YGS Q
Sbjct: 157 CGFGCNGGFPGAAWAYWTRKGIVSGGPYGSSQ 188
>gi|347972086|ref|XP_313835.5| AGAP004533-PA [Anopheles gambiae str. PEST]
gi|333469165|gb|EAA09183.5| AGAP004533-PA [Anopheles gambiae str. PEST]
Length = 337
Score = 214 bits (546), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 110/202 (54%), Positives = 133/202 (65%), Gaps = 39/202 (19%)
Query: 107 GSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKS 166
G GS GC+PY IAPCEHHVNGTRPSC+ G TPKCV++CQE+Y+VPY+KD FGA S
Sbjct: 175 GPFGSNLGCQPYAIAPCEHHVNGTRPSCEGEGGKTPKCVKKCQESYNVPYQKDKRFGASS 234
Query: 167 YSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTS 226
YS++ +E I KEI +GPVEGAFTV++DL+ YK G +
Sbjct: 235 YSIARHEAQIQKEIMTNGPVEGAFTVYEDLLHYKEGVY---------------------- 272
Query: 227 QLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLF 286
+ +GK LGGHAIRILGWG + + KYWLIANSWN+DWGDNG F
Sbjct: 273 ---------------QHVTGKMLGGHAIRILGWGVENGT--KYWLIANSWNSDWGDNGFF 315
Query: 287 KILRGKDECGIESSITAGVPKL 308
KILRG+D GIESSI+AG+PKL
Sbjct: 316 KILRGEDHLGIESSISAGLPKL 337
Score = 58.2 bits (139), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 30/83 (36%), Positives = 41/83 (49%), Gaps = 3/83 (3%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQAEKNSLSNIPRAHLKSWMGVHPDYNLPA 68
CGFGCNGGFPG AW YWV+ G+VSGG +GS + H+ G P
Sbjct: 150 CGFGCNGGFPGAAWSYWVRKGLVSGGPFGSNLGCQPYAIAPCEHHVN---GTRPSCEGEG 206
Query: 69 NRLPELIGYSEVDEDLPANFDSR 91
+ P+ + + ++P D R
Sbjct: 207 GKTPKCVKKCQESYNVPYQKDKR 229
>gi|47217183|emb|CAG11019.1| unnamed protein product [Tetraodon nigroviridis]
Length = 351
Score = 214 bits (545), Expect = 4e-53, Method: Compositional matrix adjust.
Identities = 124/299 (41%), Positives = 149/299 (49%), Gaps = 91/299 (30%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQAEKNSLSNIPRAHLKSWMGVHPDYNLPA 68
CG GCNGG+P AW +WV G+VSGG Y S + I + + V D+ P
Sbjct: 144 CGMGCNGGYPSSAWNFWVSDGLVSGGLYDSH------IGRIQVSLCVLLLAVDRDFVSP- 196
Query: 69 NRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEIAPCEHHVN 128
GCRPY I PCEHHVN
Sbjct: 197 ---------------------------------------------GCRPYTIPPCEHHVN 211
Query: 129 GTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEG 188
G+RPSC G TP+C+ C+ Y YK+D +FG SYSVSS E I +EIY++GPVEG
Sbjct: 212 GSRPSCSGEGGDTPECIFRCEAGYSPSYKQDKHFGKTSYSVSSEEDEIKQEIYKNGPVEG 271
Query: 189 AFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKA 248
AFTV++D +LYKSG + + SG A
Sbjct: 272 AFTVYEDFVLYKSGVY-------------------------------------QHVSGSA 294
Query: 249 LGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVPK 307
LGGHAI++LGWGE+ + YWL ANSWNTDWGDNG FKILRG D CGIES I AG PK
Sbjct: 295 LGGHAIKMLGWGEE--NGVPYWLCANSWNTDWGDNGFFKILRGADHCGIESEIVAGNPK 351
Score = 67.0 bits (162), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 33/71 (46%), Positives = 45/71 (63%), Gaps = 5/71 (7%)
Query: 44 NSLSNIPRAHLKSWMGVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREI 103
++ N+ +++K G L +LP +I Y+ D LP FDSR +WPNCPT++EI
Sbjct: 44 HNFHNVDYSYVKKLCGTL----LKGPKLPLMIRYAG-DIKLPKEFDSREQWPNCPTLKEI 98
Query: 104 RDQGSCGSCWG 114
RDQGSCGSCW
Sbjct: 99 RDQGSCGSCWA 109
>gi|125981197|ref|XP_001354605.1| GA10694 [Drosophila pseudoobscura pseudoobscura]
gi|54642915|gb|EAL31659.1| GA10694 [Drosophila pseudoobscura pseudoobscura]
Length = 338
Score = 213 bits (543), Expect = 7e-53, Method: Compositional matrix adjust.
Identities = 115/210 (54%), Positives = 131/210 (62%), Gaps = 39/210 (18%)
Query: 99 TIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKK 158
T + I G GS GCRPYEIAPCEHHVNGTRP C S G TP C +CQ +Y V Y K
Sbjct: 168 TRKGIVSGGPYGSTQGCRPYEIAPCEHHVNGTRPPC--SHGSTPSCQHKCQASYSVEYAK 225
Query: 159 DLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIK 218
D NFG+KSYSV N I +EI +GPVEGAFTV++DLILYKSG +
Sbjct: 226 DKNFGSKSYSVRRNVAEIQQEIMTNGPVEGAFTVYEDLILYKSGVY-------------- 271
Query: 219 WTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNT 278
++ GK LGGHAIRILGWG +SK YWLI NSWNT
Sbjct: 272 -----------------------QHEHGKELGGHAIRILGWGVWGESKVPYWLIGNSWNT 308
Query: 279 DWGDNGLFKILRGKDECGIESSITAGVPKL 308
DWGDNG F+ILRG+D CGIESSI+AG+PKL
Sbjct: 309 DWGDNGFFRILRGQDHCGIESSISAGLPKL 338
Score = 77.8 bits (190), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 36/77 (46%), Positives = 48/77 (62%), Gaps = 3/77 (3%)
Query: 40 QAEKNSLSNIPRAHLKSWMGVHPD---YNLPANRLPELIGYSEVDEDLPANFDSRTKWPN 96
Q +N ++ +++ MGVHPD + LP R+ Y++ D+P FD+R WPN
Sbjct: 39 QVGRNFKESVSEEYIRGLMGVHPDAHKFALPEKRIVLGDLYADDGVDIPEEFDARKAWPN 98
Query: 97 CPTIREIRDQGSCGSCW 113
CPTI EIRDQGSCGSCW
Sbjct: 99 CPTIGEIRDQGSCGSCW 115
Score = 61.2 bits (147), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 25/34 (73%), Positives = 27/34 (79%)
Query: 7 RLCGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQ 40
+CGFGCNGGFPG AW YW + GIVSGG YGS Q
Sbjct: 149 HICGFGCNGGFPGAAWSYWTRKGIVSGGPYGSTQ 182
>gi|312374701|gb|EFR22198.1| hypothetical protein AND_15621 [Anopheles darlingi]
Length = 335
Score = 209 bits (533), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 108/202 (53%), Positives = 130/202 (64%), Gaps = 39/202 (19%)
Query: 107 GSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKS 166
G GS GC+PY IAPCEHHVNGTRPSC+ G TPKCV++CQ++Y VPY KD +G+KS
Sbjct: 173 GPFGSNLGCQPYAIAPCEHHVNGTRPSCEGEGGKTPKCVKKCQDSYTVPYAKDKRYGSKS 232
Query: 167 YSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTS 226
YS+ +E I KEI +GPVEGAFTV++DL+ YK G +
Sbjct: 233 YSIPRHEDQIRKEIMTNGPVEGAFTVYEDLLHYKEGVY---------------------- 270
Query: 227 QLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLF 286
+ +GK LGGHAIRILGWG + + KYWLIANSWN+DWGDNG F
Sbjct: 271 ---------------QHVTGKMLGGHAIRILGWGVENNT--KYWLIANSWNSDWGDNGFF 313
Query: 287 KILRGKDECGIESSITAGVPKL 308
KILRG+D GIESSI AG+PKL
Sbjct: 314 KILRGEDHLGIESSIAAGLPKL 335
Score = 56.6 bits (135), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 23/30 (76%), Positives = 25/30 (83%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
CGFGCNGGFPG AW YWV G+VSGG +GS
Sbjct: 148 CGFGCNGGFPGAAWSYWVHKGLVSGGPFGS 177
>gi|195438776|ref|XP_002067308.1| GK16352 [Drosophila willistoni]
gi|194163393|gb|EDW78294.1| GK16352 [Drosophila willistoni]
Length = 340
Score = 209 bits (532), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 114/210 (54%), Positives = 131/210 (62%), Gaps = 39/210 (18%)
Query: 99 TIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKK 158
T + I G+ GS GCRPYEI PCEHHVNGTRP C S G TP+C C+ +Y V YKK
Sbjct: 170 TRKGIVSGGNFGSQQGCRPYEIEPCEHHVNGTRPPC--SSGSTPRCQHVCESSYKVDYKK 227
Query: 159 DLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIK 218
D NFG+KSYS+ +N I KEI +GPVEGAFTV++DLILYKSG +
Sbjct: 228 DKNFGSKSYSIKNNVLDIQKEIMNNGPVEGAFTVYEDLILYKSGVY-------------- 273
Query: 219 WTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNT 278
+ GK LGGHAIRILGWG K YWLIANSWNT
Sbjct: 274 -----------------------EHVHGKELGGHAIRILGWGVWGDEKIPYWLIANSWNT 310
Query: 279 DWGDNGLFKILRGKDECGIESSITAGVPKL 308
DWGDNG F+I+RGKD CGIESSI+AG+PKL
Sbjct: 311 DWGDNGFFRIVRGKDHCGIESSISAGLPKL 340
Score = 73.2 bits (178), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 36/77 (46%), Positives = 48/77 (62%), Gaps = 7/77 (9%)
Query: 43 KNSLSNIPRAHLKSWMGVHPD---YNLPANRLPELIG--YSEVDEDLPANFDSRTKWPNC 97
+N ++ +++ MGVHPD + LP E++G + D D+P FD+R KW NC
Sbjct: 44 RNFHESVSEKYIRGLMGVHPDADKFALPDKM--EVLGKLVEDSDSDIPTEFDAREKWSNC 101
Query: 98 PTIREIRDQGSCGSCWG 114
PTI EIRDQGSCGSCW
Sbjct: 102 PTIGEIRDQGSCGSCWA 118
Score = 59.7 bits (143), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 24/33 (72%), Positives = 27/33 (81%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQA 41
CGFGCNGGFPG AW YW + GIVSGG +GS+Q
Sbjct: 153 CGFGCNGGFPGAAWSYWTRKGIVSGGNFGSQQG 185
>gi|195058549|ref|XP_001995463.1| GH17748 [Drosophila grimshawi]
gi|193896249|gb|EDV95115.1| GH17748 [Drosophila grimshawi]
Length = 340
Score = 208 bits (530), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 132/334 (39%), Positives = 166/334 (49%), Gaps = 107/334 (32%)
Query: 43 KNSLSNIPRAHLKSWMGVHPD---YNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPT 99
+N + + H+++ MGVHPD + LP R EL+G D+DLP FDS WPNCPT
Sbjct: 46 RNFDAAVSEHHIRALMGVHPDSHKFTLPEKR--ELLGADGEDKDLPEEFDSSKNWPNCPT 103
Query: 100 IREIRDQGSCGSCWGCRPYE----------------------IAPCEHH----VNGTRP- 132
IREIRDQGSCGSCW E + C H NG P
Sbjct: 104 IREIRDQGSCGSCWAFGAVEAMSDRVCIHSNATVNFHFSADDLVTCCHTCGFGCNGGFPG 163
Query: 133 ---------------SCDASKGHTPKCVRECQENYDVP---------------------- 155
S ++++G P V C+ + D P
Sbjct: 164 AAWSYWTTRGIVSGGSYNSTEGCRPYEVEPCEHHVDGPRPPCHSGSTPHCKHQCQPNYSV 223
Query: 156 -YKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAM 214
Y+KD +FGA SYS++ N ++I +EI +GPVEGAFTV++DLILYK+G +
Sbjct: 224 DYEKDKHFGASSYSINRNPRNIQREIMTNGPVEGAFTVYEDLILYKTGVY---------- 273
Query: 215 SLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIAN 274
+ GK LGGHAIRI+GWG +SK YWLIAN
Sbjct: 274 ---------------------------QHVHGKQLGGHAIRIIGWGVWGESKVPYWLIAN 306
Query: 275 SWNTDWGDNGLFKILRGKDECGIESSITAGVPKL 308
SWNTDWGDNG F+ILRGKD CGIES I+AG+PKL
Sbjct: 307 SWNTDWGDNGFFRILRGKDHCGIESQISAGLPKL 340
Score = 56.6 bits (135), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 23/32 (71%), Positives = 25/32 (78%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQ 40
CGFGCNGGFPG AW YW GIVSGG+Y S +
Sbjct: 153 CGFGCNGGFPGAAWSYWTTRGIVSGGSYNSTE 184
>gi|14141821|gb|AAK07477.2|AF329480_1 probable cathepsin B-like cysteine proteinase precursor [Glossina
morsitans morsitans]
gi|289743431|gb|ADD20463.1| putative cathepsin B-like cysteine proteinase precursor [Glossina
morsitans morsitans]
Length = 340
Score = 207 bits (528), Expect = 4e-51, Method: Compositional matrix adjust.
Identities = 108/206 (52%), Positives = 131/206 (63%), Gaps = 39/206 (18%)
Query: 103 IRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNF 162
I G GS GCRPYEIAPCEHHVNGTRP C+ G TP+C +CQ +Y V YK D +F
Sbjct: 174 IVSGGPYGSSQGCRPYEIAPCEHHVNGTRPPCEKEYGKTPRCQHKCQASYKVDYKTDKHF 233
Query: 163 GAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIR 222
G+++YS+S N I +EI HGPVEGAFTV++DLILYK G
Sbjct: 234 GSRAYSISKNVHDIQEEIMTHGPVEGAFTVYEDLILYKDG-------------------- 273
Query: 223 DNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGD 282
V++ + GK LGGHAIRI+GWG ++ YWL+ANSWNTDWG+
Sbjct: 274 -------------VYEHV----HGKELGGHAIRIIGWGVEKDI--PYWLVANSWNTDWGN 314
Query: 283 NGLFKILRGKDECGIESSITAGVPKL 308
NG FKILRGKD CGIESSI+AG+PK+
Sbjct: 315 NGFFKILRGKDHCGIESSISAGLPKI 340
Score = 62.0 bits (149), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 26/32 (81%), Positives = 27/32 (84%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQ 40
CGFGCNGGFPG AW YWV+ GIVSGG YGS Q
Sbjct: 153 CGFGCNGGFPGAAWSYWVRKGIVSGGPYGSSQ 184
>gi|91078958|ref|XP_974220.1| PREDICTED: similar to cathepsin b [Tribolium castaneum]
gi|270004841|gb|EFA01289.1| cathepsin B precursor [Tribolium castaneum]
Length = 334
Score = 207 bits (527), Expect = 5e-51, Method: Compositional matrix adjust.
Identities = 107/205 (52%), Positives = 129/205 (62%), Gaps = 39/205 (19%)
Query: 103 IRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNF 162
I GS GS GCRPYEIAPCEHHVNGTRP C TP C ++C++ Y+VPYKKD NF
Sbjct: 166 IVSGGSFGSNQGCRPYEIAPCEHHVNGTRPPCTGDDNKTPSCKQQCEKGYNVPYKKDKNF 225
Query: 163 GAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIR 222
G ++YS+SS + I KEI +GPVEGAF V++DL+ YK G +
Sbjct: 226 GKEAYSISSEVQQIQKEIMTNGPVEGAFEVYEDLLSYKKGVY------------------ 267
Query: 223 DNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGD 282
+ G+ALGGHAIRILGWG ++ + YWLIANSWN+DWGD
Sbjct: 268 -------------------QHVKGEALGGHAIRILGWGTEKGT--PYWLIANSWNSDWGD 306
Query: 283 NGLFKILRGKDECGIESSITAGVPK 307
NG FKILRG+D CGIESSI AG+PK
Sbjct: 307 NGTFKILRGEDHCGIESSIVAGIPK 331
Score = 58.9 bits (141), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 24/32 (75%), Positives = 26/32 (81%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQ 40
CG GCNGGFPG AW YWV GIVSGG++GS Q
Sbjct: 145 CGMGCNGGFPGAAWHYWVNKGIVSGGSFGSNQ 176
>gi|289743429|gb|ADD20462.1| putative cathepsin B-like cysteine proteinase precursor [Glossina
morsitans morsitans]
Length = 340
Score = 206 bits (525), Expect = 8e-51, Method: Compositional matrix adjust.
Identities = 108/206 (52%), Positives = 132/206 (64%), Gaps = 39/206 (18%)
Query: 103 IRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNF 162
I G GS GCRPYEIAPCEHHVNGTRP C+ G TP+C +CQ +Y V YK D +F
Sbjct: 174 IVSGGPYGSSQGCRPYEIAPCEHHVNGTRPPCEKEYGKTPRCQHKCQASYKVDYKTDKHF 233
Query: 163 GAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIR 222
G+++YS+S N + I EI +GPVEGAFTV++DLILYK G
Sbjct: 234 GSRAYSISKNVRDIQGEIMTNGPVEGAFTVYEDLILYKDG-------------------- 273
Query: 223 DNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGD 282
V++ + GK LGGHAIRI+GWG ++ + YWLIANSWNTDWG+
Sbjct: 274 -------------VYEHV----HGKELGGHAIRIIGWGVEKDT--PYWLIANSWNTDWGN 314
Query: 283 NGLFKILRGKDECGIESSITAGVPKL 308
NG FKILRGKD CGIESSI+AG+PK+
Sbjct: 315 NGFFKILRGKDHCGIESSISAGLPKI 340
Score = 61.2 bits (147), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 26/33 (78%), Positives = 27/33 (81%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQA 41
CGFGCNGGFPG AW YWV+ GIVSGG YGS Q
Sbjct: 153 CGFGCNGGFPGAAWGYWVRKGIVSGGPYGSSQG 185
>gi|157167366|ref|XP_001653890.1| cathepsin b [Aedes aegypti]
gi|54289254|gb|AAV31917.1| lysosomal cathepsin B [Aedes aegypti]
gi|108874249|gb|EAT38474.1| AAEL009637-PA [Aedes aegypti]
Length = 340
Score = 206 bits (525), Expect = 8e-51, Method: Compositional matrix adjust.
Identities = 106/202 (52%), Positives = 131/202 (64%), Gaps = 39/202 (19%)
Query: 107 GSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKS 166
G GS GC+PY IAPCEHHVNG+RPSC+ G TPKCV++CQ +Y+VPY KD +G S
Sbjct: 178 GPFGSDQGCQPYAIAPCEHHVNGSRPSCEGEGGKTPKCVKKCQASYNVPYAKDKMYGKSS 237
Query: 167 YSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTS 226
YS++++EK I KEI +GPVEGAFTV++DL+ YK G +
Sbjct: 238 YSIANHEKQIQKEIMTNGPVEGAFTVYEDLLNYKEGVYH--------------------- 276
Query: 227 QLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLF 286
+ GK LGGHAIRILGWG ++ + KYWLIANSWN+DWGDNG F
Sbjct: 277 ----------------HVHGKMLGGHAIRILGWGVEDGT--KYWLIANSWNSDWGDNGFF 318
Query: 287 KILRGKDECGIESSITAGVPKL 308
KILRG+D GIESSI AG+PK+
Sbjct: 319 KILRGEDHLGIESSIAAGLPKV 340
Score = 59.7 bits (143), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 24/32 (75%), Positives = 27/32 (84%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQ 40
CGFGCNGGFPG AW YWV+ G+VSGG +GS Q
Sbjct: 153 CGFGCNGGFPGAAWSYWVRKGLVSGGPFGSDQ 184
>gi|18921171|ref|NP_572920.1| cathepsin B1, isoform A [Drosophila melanogaster]
gi|7292926|gb|AAF48317.1| cathepsin B1, isoform A [Drosophila melanogaster]
gi|16767940|gb|AAL28188.1| GH06546p [Drosophila melanogaster]
gi|220944992|gb|ACL85039.1| CG10992-PA [synthetic construct]
gi|220954816|gb|ACL89951.1| CG10992-PA [synthetic construct]
Length = 340
Score = 205 bits (521), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 111/210 (52%), Positives = 129/210 (61%), Gaps = 38/210 (18%)
Query: 99 TIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKK 158
T + I G GS GCRPYEI+PCEHHVNGTRP C A G TPKC CQ Y V Y K
Sbjct: 169 TRKGIVSGGPYGSNQGCRPYEISPCEHHVNGTRPPC-AHGGRTPKCSHVCQSGYTVDYAK 227
Query: 159 DLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIK 218
D +FG+KSYSV N + I +EI +GPVEGAFTV++DLILYK G +
Sbjct: 228 DKHFGSKSYSVRRNVREIQEEIMTNGPVEGAFTVYEDLILYKDGVY-------------- 273
Query: 219 WTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNT 278
++ GK LGGHAIRILGWG + K YWLI NSWNT
Sbjct: 274 -----------------------QHEHGKELGGHAIRILGWGVWGEEKIPYWLIGNSWNT 310
Query: 279 DWGDNGLFKILRGKDECGIESSITAGVPKL 308
DWGD+G F+ILRG+D CGIESSI+AG+PKL
Sbjct: 311 DWGDHGFFRILRGQDHCGIESSISAGLPKL 340
Score = 60.1 bits (144), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 25/33 (75%), Positives = 26/33 (78%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQA 41
CGFGCNGGFPG AW YW + GIVSGG YGS Q
Sbjct: 152 CGFGCNGGFPGAAWSYWTRKGIVSGGPYGSNQG 184
>gi|442616292|ref|NP_001259536.1| cathepsin B1, isoform B [Drosophila melanogaster]
gi|440216755|gb|AGB95378.1| cathepsin B1, isoform B [Drosophila melanogaster]
Length = 330
Score = 205 bits (521), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 111/210 (52%), Positives = 129/210 (61%), Gaps = 38/210 (18%)
Query: 99 TIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKK 158
T + I G GS GCRPYEI+PCEHHVNGTRP C A G TPKC CQ Y V Y K
Sbjct: 159 TRKGIVSGGPYGSNQGCRPYEISPCEHHVNGTRPPC-AHGGRTPKCSHVCQSGYTVDYAK 217
Query: 159 DLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIK 218
D +FG+KSYSV N + I +EI +GPVEGAFTV++DLILYK G +
Sbjct: 218 DKHFGSKSYSVRRNVREIQEEIMTNGPVEGAFTVYEDLILYKDGVY-------------- 263
Query: 219 WTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNT 278
++ GK LGGHAIRILGWG + K YWLI NSWNT
Sbjct: 264 -----------------------QHEHGKELGGHAIRILGWGVWGEEKIPYWLIGNSWNT 300
Query: 279 DWGDNGLFKILRGKDECGIESSITAGVPKL 308
DWGD+G F+ILRG+D CGIESSI+AG+PKL
Sbjct: 301 DWGDHGFFRILRGQDHCGIESSISAGLPKL 330
Score = 60.1 bits (144), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 25/33 (75%), Positives = 26/33 (78%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQA 41
CGFGCNGGFPG AW YW + GIVSGG YGS Q
Sbjct: 142 CGFGCNGGFPGAAWSYWTRKGIVSGGPYGSNQG 174
>gi|194895314|ref|XP_001978227.1| GG19486 [Drosophila erecta]
gi|190649876|gb|EDV47154.1| GG19486 [Drosophila erecta]
Length = 340
Score = 204 bits (520), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 111/210 (52%), Positives = 128/210 (60%), Gaps = 38/210 (18%)
Query: 99 TIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKK 158
T + I G GS GCRPYEIAPCEHHVNGTRP C G TPKC C+ Y V Y K
Sbjct: 169 TRKGIVSGGPYGSNQGCRPYEIAPCEHHVNGTRPPCGHGGG-TPKCSHVCESGYTVDYAK 227
Query: 159 DLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIK 218
D +FG+KSYSV N + I +EI +GPVEGAFTV++DLILYK G +
Sbjct: 228 DKHFGSKSYSVKRNVRDIQEEIMTNGPVEGAFTVYEDLILYKDGVY-------------- 273
Query: 219 WTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNT 278
++ GK LGGHAIRILGWG + K YWLI NSWNT
Sbjct: 274 -----------------------QHQHGKELGGHAIRILGWGVWGEEKIPYWLIGNSWNT 310
Query: 279 DWGDNGLFKILRGKDECGIESSITAGVPKL 308
DWGDNG F+ILRG+D CGIESSI+AG+PKL
Sbjct: 311 DWGDNGFFRILRGQDHCGIESSISAGLPKL 340
Score = 60.1 bits (144), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 25/33 (75%), Positives = 26/33 (78%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQA 41
CGFGCNGGFPG AW YW + GIVSGG YGS Q
Sbjct: 152 CGFGCNGGFPGAAWSYWTRKGIVSGGPYGSNQG 184
>gi|195478432|ref|XP_002100515.1| GE16138 [Drosophila yakuba]
gi|194188039|gb|EDX01623.1| GE16138 [Drosophila yakuba]
Length = 340
Score = 204 bits (519), Expect = 4e-50, Method: Compositional matrix adjust.
Identities = 111/210 (52%), Positives = 128/210 (60%), Gaps = 38/210 (18%)
Query: 99 TIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKK 158
T + I G GS GCRPYEI+PCEHHVNGTRP C A G TPKC CQ +Y V Y K
Sbjct: 169 TRKGIVSGGPYGSNQGCRPYEISPCEHHVNGTRPPC-AHGGATPKCSHVCQSSYTVDYAK 227
Query: 159 DLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIK 218
D +FG+KSYSV N + I +EI +GPVEGAFTV++DLILYK G +
Sbjct: 228 DKHFGSKSYSVRRNVRDIQEEIMTNGPVEGAFTVYEDLILYKDGVY-------------- 273
Query: 219 WTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNT 278
++ GK LGGHAIRILGWG K YWLI NSWNT
Sbjct: 274 -----------------------QHEHGKELGGHAIRILGWGVWGDEKIPYWLIGNSWNT 310
Query: 279 DWGDNGLFKILRGKDECGIESSITAGVPKL 308
DWGD G F+ILRG+D CGIESSI+AG+PKL
Sbjct: 311 DWGDQGFFRILRGQDHCGIESSISAGLPKL 340
Score = 60.1 bits (144), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 25/32 (78%), Positives = 26/32 (81%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQ 40
CGFGCNGGFPG AW YW + GIVSGG YGS Q
Sbjct: 152 CGFGCNGGFPGAAWSYWTRKGIVSGGPYGSNQ 183
>gi|195566634|ref|XP_002106884.1| GD15875 [Drosophila simulans]
gi|194204277|gb|EDX17853.1| GD15875 [Drosophila simulans]
Length = 340
Score = 204 bits (518), Expect = 5e-50, Method: Compositional matrix adjust.
Identities = 111/210 (52%), Positives = 129/210 (61%), Gaps = 38/210 (18%)
Query: 99 TIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKK 158
T + I G GS GCRPYEI+PCEHHVNGTRP C A G TPKC CQ +Y V Y K
Sbjct: 169 TRKGIVSGGPYGSNQGCRPYEISPCEHHVNGTRPPC-AHGGGTPKCSHVCQSSYTVDYAK 227
Query: 159 DLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIK 218
D +FG+KSYSV N + I +EI +GPVEGAFTV++DLILYK G +
Sbjct: 228 DKHFGSKSYSVKRNVREIQEEIMTNGPVEGAFTVYEDLILYKDGVY-------------- 273
Query: 219 WTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNT 278
++ GK LGGHAIRILGWG K YWLI NSWNT
Sbjct: 274 -----------------------QHEHGKELGGHAIRILGWGVWGDEKIPYWLIGNSWNT 310
Query: 279 DWGDNGLFKILRGKDECGIESSITAGVPKL 308
DWGD+G F+ILRG+D CGIESSI+AG+PKL
Sbjct: 311 DWGDHGFFRILRGQDHCGIESSISAGLPKL 340
Score = 60.1 bits (144), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 25/33 (75%), Positives = 26/33 (78%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQA 41
CGFGCNGGFPG AW YW + GIVSGG YGS Q
Sbjct: 152 CGFGCNGGFPGAAWSYWTRKGIVSGGPYGSNQG 184
>gi|195393194|ref|XP_002055239.1| GJ19262 [Drosophila virilis]
gi|194149749|gb|EDW65440.1| GJ19262 [Drosophila virilis]
Length = 338
Score = 203 bits (517), Expect = 6e-50, Method: Compositional matrix adjust.
Identities = 107/210 (50%), Positives = 131/210 (62%), Gaps = 39/210 (18%)
Query: 99 TIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKK 158
T + I GS GS GCRPYE+ PCEHHVNGTRP C + G TP+C+ +C+ Y V Y K
Sbjct: 168 THKGIVSGGSYGSKEGCRPYEVEPCEHHVNGTRPPCHS--GSTPRCMHKCESGYSVDYAK 225
Query: 159 DLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIK 218
D +FGAK+YSV+ N I +EI +GPVEGAFTV++DLILYK+G +
Sbjct: 226 DKHFGAKAYSVNRNPLDIQREIMTNGPVEGAFTVYEDLILYKTGVY-------------- 271
Query: 219 WTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNT 278
+ G+ LGGHAIRILGWG +K YWLI NSWNT
Sbjct: 272 -----------------------QHVHGRQLGGHAIRILGWGVWGDNKVPYWLIGNSWNT 308
Query: 279 DWGDNGLFKILRGKDECGIESSITAGVPKL 308
DWGDNG F+ILRG+D CGIES+I+AG+PKL
Sbjct: 309 DWGDNGFFRILRGEDHCGIESAISAGLPKL 338
Score = 74.3 bits (181), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 38/75 (50%), Positives = 48/75 (64%), Gaps = 5/75 (6%)
Query: 43 KNSLSNIPRAHLKSWMGVHPD---YNLPANRLPELIGYSEVDE-DLPANFDSRTKWPNCP 98
+N +++ H++ MGVHPD + LP + L E D DLP FD+RT WP+CP
Sbjct: 42 RNFDASVSEHHIRGLMGVHPDAHKFTLP-EKSQVLGNLMEADGGDLPEEFDARTAWPDCP 100
Query: 99 TIREIRDQGSCGSCW 113
TI EIRDQGSCGSCW
Sbjct: 101 TIGEIRDQGSCGSCW 115
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 25/32 (78%), Positives = 27/32 (84%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQ 40
CGFGCNGGFPG AW YW GIVSGG+YGSK+
Sbjct: 151 CGFGCNGGFPGAAWSYWTHKGIVSGGSYGSKE 182
>gi|269146930|gb|ACZ28411.1| cathepsin b [Simulium nigrimanum]
Length = 168
Score = 203 bits (516), Expect = 8e-50, Method: Compositional matrix adjust.
Identities = 103/201 (51%), Positives = 127/201 (63%), Gaps = 39/201 (19%)
Query: 107 GSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKS 166
G GS GC PY+IAPCEHHVNGTRP+C+ +G TPKC++ CQ +Y V Y++D ++GAKS
Sbjct: 7 GPFGSNQGCHPYKIAPCEHHVNGTRPACNGEEGKTPKCIKHCQASYTVAYEQDKSYGAKS 66
Query: 167 YSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTS 226
YSV + I KEI +GPVEGAFTV++DL+ YK G +
Sbjct: 67 YSVPHHVAQIQKEIMTNGPVEGAFTVYEDLVQYKDGVY---------------------- 104
Query: 227 QLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLF 286
+ +GK LGGHAIRILGWG + + YWLIANSWNTDWG+NG F
Sbjct: 105 ---------------QHVTGKMLGGHAIRILGWGVE--NDVPYWLIANSWNTDWGNNGFF 147
Query: 287 KILRGKDECGIESSITAGVPK 307
KILRG D CGIES I+AG+PK
Sbjct: 148 KILRGSDHCGIESQISAGIPK 168
>gi|195352458|ref|XP_002042729.1| GM17589 [Drosophila sechellia]
gi|194126760|gb|EDW48803.1| GM17589 [Drosophila sechellia]
Length = 340
Score = 203 bits (516), Expect = 9e-50, Method: Compositional matrix adjust.
Identities = 110/210 (52%), Positives = 128/210 (60%), Gaps = 38/210 (18%)
Query: 99 TIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKK 158
T + I G GS GCRPYEI+PCEHHVNGTRP C G TPKC CQ +Y V Y K
Sbjct: 169 TRKGIVSGGPYGSNQGCRPYEISPCEHHVNGTRPPCANGSG-TPKCSHVCQSSYTVDYAK 227
Query: 159 DLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIK 218
D +FG+KSYSV N + I +EI +GPVEGAFTV++DLILYK G +
Sbjct: 228 DKHFGSKSYSVKRNVREIQEEIMTNGPVEGAFTVYEDLILYKDGVY-------------- 273
Query: 219 WTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNT 278
++ GK LGGHAIRILGWG K YWLI NSWNT
Sbjct: 274 -----------------------QHEHGKELGGHAIRILGWGVWGNEKIPYWLIGNSWNT 310
Query: 279 DWGDNGLFKILRGKDECGIESSITAGVPKL 308
DWGD+G F+ILRG+D CGIESSI+AG+PKL
Sbjct: 311 DWGDHGFFRILRGQDHCGIESSISAGLPKL 340
Score = 60.1 bits (144), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 25/33 (75%), Positives = 26/33 (78%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQA 41
CGFGCNGGFPG AW YW + GIVSGG YGS Q
Sbjct: 152 CGFGCNGGFPGAAWSYWTRKGIVSGGPYGSNQG 184
>gi|185135431|ref|NP_001117776.1| procathepsin B precursor [Oncorhynchus mykiss]
gi|14582897|gb|AAK69705.1|AF358667_1 procathepsin B [Oncorhynchus mykiss]
Length = 330
Score = 201 bits (511), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 102/199 (51%), Positives = 125/199 (62%), Gaps = 40/199 (20%)
Query: 110 GSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSV 169
GS GCRPY IAPCEHHVNGTRP C +G TPKCV EC Y YKKD FG ++YSV
Sbjct: 172 GSNIGCRPYSIAPCEHHVNGTRPPC-TGEGDTPKCVSECNAGYTPSYKKDKRFGKQTYSV 230
Query: 170 SSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLG 229
E+ IM E+Y++GPVE AF+V++D +LYK+G +
Sbjct: 231 PPKEQQIMTELYKNGPVEAAFSVYEDFLLYKTGVY------------------------- 265
Query: 230 AEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKIL 289
+ +G+ LGGHAI+ILGWG++ + YWL+ANSWNTDWGDNG FKIL
Sbjct: 266 ------------QHVTGQMLGGHAIKILGWGKENNT--PYWLVANSWNTDWGDNGFFKIL 311
Query: 290 RGKDECGIESSITAGVPKL 308
RGKDECGIES I AG+P+L
Sbjct: 312 RGKDECGIESEIVAGIPRL 330
Score = 49.7 bits (117), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 20/30 (66%), Positives = 23/30 (76%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
CG GC GGFP AW YW +SG+V+GG YGS
Sbjct: 144 CGMGCMGGFPSAAWDYWAESGLVTGGLYGS 173
>gi|45822203|emb|CAE47498.1| cathepsin B-like proteinase [Diabrotica virgifera virgifera]
Length = 328
Score = 201 bits (510), Expect = 5e-49, Method: Compositional matrix adjust.
Identities = 100/201 (49%), Positives = 126/201 (62%), Gaps = 39/201 (19%)
Query: 107 GSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKS 166
G G+ GCRPYEI PCEHH NG+RP+CDAS+G+TPKC + C+ NY + Y DL+FG+K+
Sbjct: 167 GQYGTKQGCRPYEIPPCEHHTNGSRPACDASEGNTPKCAKSCESNYKINYSNDLHFGSKA 226
Query: 167 YSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTS 226
YS+SS+ K I EI ++GPVEGAF+V+ D + YK+G +
Sbjct: 227 YSISSDVKQIQAEILQNGPVEGAFSVYADFVNYKTGVY---------------------- 264
Query: 227 QLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLF 286
+ G+ LGGHAIRI GWG + + YWLIANSWNTDWGD+G F
Sbjct: 265 ---------------QHIKGQFLGGHAIRIFGWGVENNT--PYWLIANSWNTDWGDSGTF 307
Query: 287 KILRGKDECGIESSITAGVPK 307
KILRG D CGIES I AG+PK
Sbjct: 308 KILRGSDHCGIESGIVAGLPK 328
Score = 72.4 bits (176), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 33/74 (44%), Positives = 48/74 (64%), Gaps = 3/74 (4%)
Query: 41 AEKNSLSNIPRAHLKSWMGVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTI 100
A +N + ++ MGV PD+ N +P ++ + ++PA+FD+R +WP+CPTI
Sbjct: 37 AGRNFAQDKSMDYIIKLMGVLPDHK---NYMPPVLTHKLEALEIPADFDARQQWPHCPTI 93
Query: 101 REIRDQGSCGSCWG 114
REIRDQGSCGSCW
Sbjct: 94 REIRDQGSCGSCWA 107
Score = 58.9 bits (141), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 23/32 (71%), Positives = 27/32 (84%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQ 40
CG GCNGG+PG AW YWV+ G+VSGG YG+KQ
Sbjct: 142 CGMGCNGGYPGAAWHYWVRKGLVSGGQYGTKQ 173
>gi|194766882|ref|XP_001965553.1| GF22391 [Drosophila ananassae]
gi|190619544|gb|EDV35068.1| GF22391 [Drosophila ananassae]
Length = 342
Score = 200 bits (509), Expect = 6e-49, Method: Compositional matrix adjust.
Identities = 109/210 (51%), Positives = 129/210 (61%), Gaps = 38/210 (18%)
Query: 99 TIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKK 158
T + I G GS GCRPYEIAPCEHHVNGTR C+ TPKC +C+ Y+V Y K
Sbjct: 170 TRKGIVSGGRYGSKTGCRPYEIAPCEHHVNGTRAPCNHDS-KTPKCQHQCEAGYNVEYSK 228
Query: 159 DLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIK 218
D +FG+KSYSV N + I +EI +GPVEGAFTV++DLILYKSG +
Sbjct: 229 DKHFGSKSYSVRRNVRDIQEEIMTNGPVEGAFTVYEDLILYKSGVY-------------- 274
Query: 219 WTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNT 278
++ GK LGGHAIRILGWG K + YWLIANSWN
Sbjct: 275 -----------------------QHEHGKELGGHAIRILGWGVWGKEEVPYWLIANSWND 311
Query: 279 DWGDNGLFKILRGKDECGIESSITAGVPKL 308
DWGD G F+ILRG+D CGIESSI+AG+PKL
Sbjct: 312 DWGDKGFFRILRGEDHCGIESSISAGLPKL 341
Score = 82.0 bits (201), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 39/77 (50%), Positives = 50/77 (64%), Gaps = 2/77 (2%)
Query: 40 QAEKNSLSNIPRAHLKSWMGVHPD-YNLPANRLPELIGY-SEVDEDLPANFDSRTKWPNC 97
QA +N + +++ MGVHPD Y E++GY S+ +D+P FD+R KWPNC
Sbjct: 42 QAGRNFDEGVSEEYIRGLMGVHPDAYKFALPDKQEVLGYLSQKVDDIPKEFDAREKWPNC 101
Query: 98 PTIREIRDQGSCGSCWG 114
PTI EIRDQGSCGSCW
Sbjct: 102 PTINEIRDQGSCGSCWA 118
Score = 59.7 bits (143), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 25/31 (80%), Positives = 26/31 (83%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSK 39
CGFGCNGGFPG AW YW + GIVSGG YGSK
Sbjct: 153 CGFGCNGGFPGAAWSYWTRKGIVSGGRYGSK 183
>gi|195130519|ref|XP_002009699.1| GI15503 [Drosophila mojavensis]
gi|193908149|gb|EDW07016.1| GI15503 [Drosophila mojavensis]
Length = 342
Score = 199 bits (507), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 107/210 (50%), Positives = 127/210 (60%), Gaps = 39/210 (18%)
Query: 99 TIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKK 158
T + I GS S GCRPYEI PCEHHVNGTRP C G TP C +C+ +Y V Y K
Sbjct: 172 THKGIVSGGSYNSNEGCRPYEIEPCEHHVNGTRPPC--KNGRTPSCKHQCESSYSVDYAK 229
Query: 159 DLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIK 218
D +FG+KSYS+ N + I +EI +GPVEGAFTV++DLILYKSG +
Sbjct: 230 DKHFGSKSYSIRRNPREIQREIMTNGPVEGAFTVYEDLILYKSGVY-------------- 275
Query: 219 WTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNT 278
+ GK LGGHAIRILGWG SK YWLI NSWNT
Sbjct: 276 -----------------------KHVHGKELGGHAIRILGWGVWGDSKVPYWLIGNSWNT 312
Query: 279 DWGDNGLFKILRGKDECGIESSITAGVPKL 308
DWGDNG F+I+RG+D CGIES+I+AG+P L
Sbjct: 313 DWGDNGFFRIVRGEDHCGIESAISAGLPAL 342
Score = 79.0 bits (193), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 39/76 (51%), Positives = 52/76 (68%), Gaps = 7/76 (9%)
Query: 43 KNSLSNIPRAHLKSWMGVHPD---YNLP--ANRLPELIGYSEVDEDLPANFDSRTKWPNC 97
+N +++ H++ MGVHPD + LP + L L+G + +DLP +FD+RT WPNC
Sbjct: 46 RNFDASVSEGHIRGLMGVHPDAHKFTLPEKSQVLGNLVG--DDGDDLPESFDARTAWPNC 103
Query: 98 PTIREIRDQGSCGSCW 113
PTI EIRDQGSCGSCW
Sbjct: 104 PTIGEIRDQGSCGSCW 119
Score = 56.2 bits (134), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 23/33 (69%), Positives = 25/33 (75%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQA 41
CGFGCNGGFPG AW YW GIVSGG+Y S +
Sbjct: 155 CGFGCNGGFPGAAWSYWTHKGIVSGGSYNSNEG 187
>gi|351695295|gb|EHA98213.1| Cathepsin B [Heterocephalus glaber]
Length = 340
Score = 197 bits (502), Expect = 4e-48, Method: Compositional matrix adjust.
Identities = 101/196 (51%), Positives = 122/196 (62%), Gaps = 39/196 (19%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GCRPY I PCEHHVNGTRP C G TPKC + C+ Y YK+D +FG SYSVSSNE
Sbjct: 178 GCRPYSIPPCEHHVNGTRPQCTGEGGDTPKCSKTCEPGYSPSYKEDKHFGYDSYSVSSNE 237
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
K IM EIY++GPVEGAFTVF D ++YK+G
Sbjct: 238 KEIMAEIYKNGPVEGAFTVFSDFLMYKTG------------------------------- 266
Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
V+ L +G+ LGGHAIRILGWG++ + YWL+ NSWN DWGD+G FKI+RG+D
Sbjct: 267 --VYKHL----AGEMLGGHAIRILGWGKE--NGVPYWLVGNSWNVDWGDSGFFKIVRGED 318
Query: 294 ECGIESSITAGVPKLD 309
CGIES I AG+P+ D
Sbjct: 319 HCGIESEIVAGIPRTD 334
Score = 48.5 bits (114), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 20/30 (66%), Positives = 23/30 (76%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
CG GCNGG+P AW+YW K G+VSGG Y S
Sbjct: 146 CGDGCNGGYPSAAWKYWTKKGLVSGGLYDS 175
>gi|148229459|ref|NP_001079570.1| cathepsin B precursor [Xenopus laevis]
gi|28277314|gb|AAH44689.1| MGC53360 protein [Xenopus laevis]
Length = 333
Score = 197 bits (500), Expect = 7e-48, Method: Compositional matrix adjust.
Identities = 100/194 (51%), Positives = 120/194 (61%), Gaps = 39/194 (20%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GCRPY I PCEHHVNG+RP+C +G TPKCV++C+E Y Y D +FG SY V ++E
Sbjct: 178 GCRPYSIPPCEHHVNGSRPACKGEEGDTPKCVKQCEEGYSPAYGTDKHFGTTSYGVPTSE 237
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
K IM EIY++GPVEGAF V+ D LYKSG +
Sbjct: 238 KEIMAEIYKNGPVEGAFLVYADFPLYKSGVY----------------------------- 268
Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
+++G+ LGGHAI+ILGWG + + YWL ANSWNTDWGDNG FKILRGKD
Sbjct: 269 --------QHETGEELGGHAIKILGWGVENGT--PYWLCANSWNTDWGDNGFFKILRGKD 318
Query: 294 ECGIESSITAGVPK 307
CGIES I AGVPK
Sbjct: 319 HCGIESEIVAGVPK 332
Score = 47.0 bits (110), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 18/30 (60%), Positives = 24/30 (80%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
CG GCNGG+P AW++W ++G+VSGG Y S
Sbjct: 146 CGMGCNGGYPSGAWQFWTETGLVSGGLYDS 175
>gi|348534156|ref|XP_003454569.1| PREDICTED: cathepsin B-like [Oreochromis niloticus]
Length = 330
Score = 197 bits (500), Expect = 7e-48, Method: Compositional matrix adjust.
Identities = 98/194 (50%), Positives = 121/194 (62%), Gaps = 39/194 (20%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GCRPY IAPCEHHVNG+RP C G TP+CVR+C+ Y Y +D ++G SYSV S+E
Sbjct: 176 GCRPYTIAPCEHHVNGSRPPCTGEGGDTPECVRQCESGYTPSYIQDKHYGKTSYSVPSDE 235
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
+ I EIY++GPVEGAFTV++D +LYK+G +
Sbjct: 236 QQIQTEIYKNGPVEGAFTVYEDFLLYKTGVY----------------------------- 266
Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
+ SG A+GGHAI++LGWGE+ + YWL ANSWNTDWGDNG FKILRG D
Sbjct: 267 --------QHVSGSAVGGHAIKVLGWGEENGT--PYWLCANSWNTDWGDNGYFKILRGSD 316
Query: 294 ECGIESSITAGVPK 307
CGIES I AG+PK
Sbjct: 317 HCGIESEIVAGIPK 330
Score = 45.8 bits (107), Expect = 0.023, Method: Compositional matrix adjust.
Identities = 18/30 (60%), Positives = 21/30 (70%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
CG GCNGG+P AW +W G+VSGG Y S
Sbjct: 144 CGMGCNGGYPSAAWDFWASEGLVSGGLYES 173
>gi|170028910|ref|XP_001842337.1| cathepsin L [Culex quinquefasciatus]
gi|167879387|gb|EDS42770.1| cathepsin L [Culex quinquefasciatus]
Length = 334
Score = 197 bits (500), Expect = 7e-48, Method: Compositional matrix adjust.
Identities = 103/202 (50%), Positives = 130/202 (64%), Gaps = 40/202 (19%)
Query: 107 GSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKS 166
G GS GC+PY I+PCEHHVNGTR C+ +G TPKCV++CQ +Y+VPY KD FG S
Sbjct: 173 GPYGSDQGCQPYAISPCEHHVNGTRGPCNG-EGKTPKCVKKCQASYNVPYAKDKFFGKSS 231
Query: 167 YSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTS 226
YS++S+E+ I KE++ +GPVEGAFTV++DL+ YK G +
Sbjct: 232 YSIASHEQQIQKELFTNGPVEGAFTVYEDLLNYKEGVY---------------------- 269
Query: 227 QLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLF 286
+ +GK LGGHAIRILGWG + + K+WLIANSWN+DWGDNG F
Sbjct: 270 ---------------QHTAGKMLGGHAIRILGWGVENDT--KFWLIANSWNSDWGDNGYF 312
Query: 287 KILRGKDECGIESSITAGVPKL 308
KILRG D GIESSI AG+PK+
Sbjct: 313 KILRGSDHLGIESSIAAGLPKV 334
Score = 60.5 bits (145), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 25/33 (75%), Positives = 27/33 (81%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQA 41
CGFGCNGGFPG AW YWV+ G+VSGG YGS Q
Sbjct: 148 CGFGCNGGFPGAAWSYWVRKGLVSGGPYGSDQG 180
>gi|213514196|ref|NP_001133994.1| Cathepsin B precursor [Salmo salar]
gi|209156086|gb|ACI34275.1| Cathepsin B precursor [Salmo salar]
Length = 330
Score = 196 bits (498), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 97/193 (50%), Positives = 122/193 (63%), Gaps = 39/193 (20%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GCRPY I PCEHHVNGTRP C +G TP+C +C+ Y YK+D +FG +SYSV S+E
Sbjct: 176 GCRPYSIPPCEHHVNGTRPPCKGEEGDTPQCTNQCEPGYTPGYKQDKHFGKRSYSVPSDE 235
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
K IMKE+Y++GPVEGAFTV++D +LYKSG +
Sbjct: 236 KEIMKELYKNGPVEGAFTVYEDFLLYKSGVY----------------------------- 266
Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
+ SG A+GGHAI++LGWGE+ YWL ANSWNTDWG+NG FKI+RG+D
Sbjct: 267 --------RHVSGSAVGGHAIKVLGWGEE--GGIPYWLAANSWNTDWGENGFFKIVRGED 316
Query: 294 ECGIESSITAGVP 306
CGIES + AG+P
Sbjct: 317 HCGIESEMVAGIP 329
Score = 43.5 bits (101), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 18/30 (60%), Positives = 21/30 (70%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
CG GCNGG+P A +W K G+VSGG Y S
Sbjct: 144 CGMGCNGGYPSAACDFWTKEGLVSGGLYDS 173
>gi|45361295|ref|NP_989225.1| cathepsin B precursor [Xenopus (Silurana) tropicalis]
gi|38969948|gb|AAH63365.1| hypothetical protein MGC75969 [Xenopus (Silurana) tropicalis]
Length = 333
Score = 196 bits (498), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 99/194 (51%), Positives = 121/194 (62%), Gaps = 39/194 (20%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GCRPY I PCEHHVNG+RP+C +G TPKCV++C++ Y Y D +FGA SY V S+E
Sbjct: 178 GCRPYSIPPCEHHVNGSRPACKGEEGDTPKCVKQCEDGYAPVYGSDKHFGATSYGVPSSE 237
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
K IM EIY++GPVEGAF V+ D +YKSG +
Sbjct: 238 KEIMAEIYKNGPVEGAFLVYADFPMYKSGVY----------------------------- 268
Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
+++G+ LGGHAI+ILGWG + + YWL ANSWNTDWGDNG FKILRGKD
Sbjct: 269 --------QHETGEELGGHAIKILGWGVENGT--PYWLCANSWNTDWGDNGFFKILRGKD 318
Query: 294 ECGIESSITAGVPK 307
CGIES I AG+PK
Sbjct: 319 HCGIESEIVAGIPK 332
Score = 47.0 bits (110), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 18/30 (60%), Positives = 24/30 (80%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
CG GCNGG+P AW++W ++G+VSGG Y S
Sbjct: 146 CGMGCNGGYPSGAWKFWTETGLVSGGLYDS 175
>gi|148222779|ref|NP_001080410.1| uncharacterized protein LOC380102 precursor [Xenopus laevis]
gi|28302291|gb|AAH46667.1| Cg10992 protein [Xenopus laevis]
Length = 333
Score = 196 bits (497), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 99/194 (51%), Positives = 120/194 (61%), Gaps = 39/194 (20%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GCRPY I PCEHHVNG+RPSC +G TPKC++ C+E Y Y D +FGA SY V S+E
Sbjct: 178 GCRPYSIPPCEHHVNGSRPSCKGEEGDTPKCMKTCEEGYTPAYGSDKHFGATSYGVPSSE 237
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
K IM +IY++GPVEGAF V+ D LYKSG +
Sbjct: 238 KEIMADIYKNGPVEGAFVVYADFPLYKSGVY----------------------------- 268
Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
+++G+ LGGHAI+ILGWG + + YWL ANSWNTDWGDNG FKILRGKD
Sbjct: 269 --------QHETGEELGGHAIKILGWGVENGT--PYWLCANSWNTDWGDNGFFKILRGKD 318
Query: 294 ECGIESSITAGVPK 307
CGIES + AG+PK
Sbjct: 319 HCGIESEVVAGIPK 332
Score = 48.9 bits (115), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 19/30 (63%), Positives = 24/30 (80%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
CG GCNGG+P AWR+W ++G+VSGG Y S
Sbjct: 146 CGMGCNGGYPSGAWRFWTETGLVSGGLYDS 175
>gi|74179506|dbj|BAE44111.1| cathepsin B preproprotein [Cyprinus carpio]
Length = 330
Score = 194 bits (494), Expect = 3e-47, Method: Compositional matrix adjust.
Identities = 100/194 (51%), Positives = 119/194 (61%), Gaps = 39/194 (20%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GCRPY I PCEHHVNG+RP C G TP C C+ Y YK+D +FG SYSV SN+
Sbjct: 176 GCRPYTIEPCEHHVNGSRPPCTGEGGDTPNCDMSCEPGYSPSYKQDKHFGKTSYSVPSNQ 235
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
K IMKE+Y++GPVEGAFTV++D + YKSG +
Sbjct: 236 KDIMKELYKNGPVEGAFTVYEDFLSYKSGVY----------------------------- 266
Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
+ SG ALGGHAI+ILGWGE+ + YWL ANSWNTDWGDNG FKILRG+D
Sbjct: 267 --------QHVSGPALGGHAIKILGWGEE--NGVPYWLAANSWNTDWGDNGYFKILRGED 316
Query: 294 ECGIESSITAGVPK 307
CGIES I AG+P+
Sbjct: 317 HCGIESEIVAGIPQ 330
Score = 45.1 bits (105), Expect = 0.047, Method: Compositional matrix adjust.
Identities = 17/30 (56%), Positives = 21/30 (70%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
CG GCNGG+P AW +W G+V+GG Y S
Sbjct: 144 CGMGCNGGYPSAAWDFWSSDGLVTGGLYNS 173
>gi|160333103|ref|NP_001103948.1| capthepsin B, b precursor [Danio rerio]
gi|133777414|gb|AAI15255.1| Ctsbb protein [Danio rerio]
Length = 326
Score = 194 bits (492), Expect = 5e-47, Method: Compositional matrix adjust.
Identities = 100/195 (51%), Positives = 123/195 (63%), Gaps = 40/195 (20%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GCRPY IAPCEHHVNGTRP C + TPKC C Y VPYK+D +FG+K Y+V S++
Sbjct: 172 GCRPYSIAPCEHHVNGTRPPCSGEQ-DTPKCTGVCIPKYSVPYKQDKHFGSKVYNVPSDQ 230
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
+ IM E+Y +GPVE AFTV++D LYKSG
Sbjct: 231 QQIMTELYTNGPVEAAFTVYEDFPLYKSG------------------------------- 259
Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
V+ L +G ALGGHA++ILGWGE+ + +WL+ANSWN+DWGDNG FKILRG D
Sbjct: 260 --VYQHL----TGSALGGHAVKILGWGEENGT--PFWLVANSWNSDWGDNGYFKILRGHD 311
Query: 294 ECGIESSITAGVPKL 308
ECGIES + AG+PKL
Sbjct: 312 ECGIESEMVAGLPKL 326
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 20/30 (66%), Positives = 24/30 (80%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
CGFGC+GGFP AW YW +SG+V+GG Y S
Sbjct: 140 CGFGCSGGFPAEAWDYWRRSGLVTGGLYNS 169
>gi|9955277|pdb|1QDQ|A Chain A, X-Ray Crystal Structure Of Bovine Cathepsin B-Ca074
Complex
Length = 253
Score = 193 bits (491), Expect = 7e-47, Method: Compositional matrix adjust.
Identities = 99/193 (51%), Positives = 121/193 (62%), Gaps = 40/193 (20%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GCRPY I PCEHHVNG+RP C +G TPKC + C+ Y YK+D +FG SYSV++NE
Sbjct: 99 GCRPYSIPPCEHHVNGSRPPC-TGEGDTPKCSKTCEPGYSPSYKEDKHFGCSSYSVANNE 157
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
K IM EIY++GPVEGAF+V+ D +LYKSG +
Sbjct: 158 KEIMAEIYKNGPVEGAFSVYSDFLLYKSGVY----------------------------- 188
Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
+ SG+ +GGHAIRILGWG + + YWL+ANSWNTDWGDNG FKILRG+D
Sbjct: 189 --------QHVSGEIMGGHAIRILGWGVENGT--PYWLVANSWNTDWGDNGFFKILRGQD 238
Query: 294 ECGIESSITAGVP 306
CGIES I AG+P
Sbjct: 239 HCGIESEIVAGMP 251
Score = 61.6 bits (148), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 24/30 (80%), Positives = 28/30 (93%)
Query: 84 LPANFDSRTKWPNCPTIREIRDQGSCGSCW 113
LP +FD+R +WPNCPTI+EIRDQGSCGSCW
Sbjct: 1 LPESFDAREQWPNCPTIKEIRDQGSCGSCW 30
>gi|354471594|ref|XP_003498026.1| PREDICTED: cathepsin B-like [Cricetulus griseus]
gi|344254255|gb|EGW10359.1| Cathepsin B [Cricetulus griseus]
Length = 339
Score = 193 bits (490), Expect = 9e-47, Method: Compositional matrix adjust.
Identities = 99/196 (50%), Positives = 121/196 (61%), Gaps = 40/196 (20%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GC PY I PCEHHVNG+RP C +G TPKC + C+ Y YK+D ++G SYSVS+NE
Sbjct: 178 GCLPYTIPPCEHHVNGSRPQC-TGEGDTPKCTKSCEAGYSPSYKEDKHYGYTSYSVSNNE 236
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
K IM EIY++GPVEGAFTVF D + YKSG +
Sbjct: 237 KEIMAEIYKNGPVEGAFTVFSDFLTYKSGVY----------------------------- 267
Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
+++G +GGHAIRILGWG + + YWL+ANSWN DWGDNGLFKILRG+D
Sbjct: 268 --------KHEAGDIMGGHAIRILGWGVE--NSVPYWLVANSWNVDWGDNGLFKILRGED 317
Query: 294 ECGIESSITAGVPKLD 309
CGIES I AG+P+ D
Sbjct: 318 HCGIESEIVAGIPRTD 333
Score = 47.4 bits (111), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 19/30 (63%), Positives = 23/30 (76%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
CG GCNGG+P AW +W+K G+VSGG Y S
Sbjct: 146 CGDGCNGGYPSGAWNFWIKKGLVSGGLYNS 175
>gi|332244666|ref|XP_003271495.1| PREDICTED: cathepsin B [Nomascus leucogenys]
Length = 351
Score = 192 bits (489), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 97/196 (49%), Positives = 123/196 (62%), Gaps = 40/196 (20%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GCRPY I PCEHHVNG+RP C +G TPKC + C+ Y YK+D ++G SYSVS++E
Sbjct: 190 GCRPYSIPPCEHHVNGSRPPC-TGEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVSNSE 248
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
K IM EIY++GPVEGAF+V+ D +LYKSG +
Sbjct: 249 KDIMAEIYKNGPVEGAFSVYSDFLLYKSGVY----------------------------- 279
Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
+ +G+ +GGHAIRILGWG + + YWL+ANSWNTDWGDNG FKILRG+D
Sbjct: 280 --------QHITGEMMGGHAIRILGWGVENGT--PYWLVANSWNTDWGDNGFFKILRGQD 329
Query: 294 ECGIESSITAGVPKLD 309
CGIES + AG+P+ D
Sbjct: 330 HCGIESEVVAGIPRTD 345
Score = 46.6 bits (109), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 18/31 (58%), Positives = 23/31 (74%)
Query: 8 LCGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
+CG GCNGG+P AW +W + G+VSGG Y S
Sbjct: 157 MCGDGCNGGYPAEAWNFWTRKGLVSGGLYDS 187
>gi|157833437|pdb|1PBH|A Chain A, Crystal Structure Of Human Recombinant Procathepsin B At
3.2 Angstrom Resolution
gi|157835646|pdb|2PBH|A Chain A, Crystal Structure Of Human Procathepsin B At 3.3 Angstrom
Resolution
gi|157836863|pdb|3PBH|A Chain A, Refined Crystal Structure Of Human Procathepsin B At 2.5
Angstrom Resolution
Length = 317
Score = 192 bits (489), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 97/196 (49%), Positives = 123/196 (62%), Gaps = 40/196 (20%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GCRPY I PCEHHVNG+RP C +G TPKC + C+ Y YK+D ++G SYSVS++E
Sbjct: 162 GCRPYSIPPCEHHVNGSRPPC-TGEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVSNSE 220
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
K IM EIY++GPVEGAF+V+ D +LYKSG +
Sbjct: 221 KDIMAEIYKNGPVEGAFSVYSDFLLYKSGVY----------------------------- 251
Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
+ +G+ +GGHAIRILGWG + + YWL+ANSWNTDWGDNG FKILRG+D
Sbjct: 252 --------QHVTGEMMGGHAIRILGWGVENGT--PYWLVANSWNTDWGDNGFFKILRGQD 301
Query: 294 ECGIESSITAGVPKLD 309
CGIES + AG+P+ D
Sbjct: 302 HCGIESEVVAGIPRTD 317
Score = 46.6 bits (109), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 18/31 (58%), Positives = 23/31 (74%)
Query: 8 LCGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
+CG GCNGG+P AW +W + G+VSGG Y S
Sbjct: 129 MCGDGCNGGYPAEAWNFWTRKGLVSGGLYES 159
>gi|193783549|dbj|BAG53460.1| unnamed protein product [Homo sapiens]
Length = 276
Score = 192 bits (489), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 97/196 (49%), Positives = 123/196 (62%), Gaps = 40/196 (20%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GCRPY I PCEHHVNG+RP C +G TPKC + C+ Y YK+D ++G SYSVS++E
Sbjct: 115 GCRPYSIPPCEHHVNGSRPPCTG-EGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVSNSE 173
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
K IM EIY++GPVEGAF+V+ D +LYKSG +
Sbjct: 174 KDIMAEIYKNGPVEGAFSVYSDFLLYKSGVY----------------------------- 204
Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
+ +G+ +GGHAIRILGWG + + YWL+ANSWNTDWGDNG FKILRG+D
Sbjct: 205 --------QHVTGEMMGGHAIRILGWGVENGT--PYWLVANSWNTDWGDNGFFKILRGQD 254
Query: 294 ECGIESSITAGVPKLD 309
CGIES + AG+P+ D
Sbjct: 255 HCGIESEVVAGIPRTD 270
Score = 42.4 bits (98), Expect = 0.26, Method: Compositional matrix adjust.
Identities = 18/33 (54%), Positives = 22/33 (66%)
Query: 6 IRLCGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
I C F CNGG+P AW +W + G+VSGG Y S
Sbjct: 80 ITGCLFSCNGGYPAEAWNFWTRKGLVSGGLYES 112
>gi|4503139|ref|NP_001899.1| cathepsin B preproprotein [Homo sapiens]
gi|22538431|ref|NP_680090.1| cathepsin B preproprotein [Homo sapiens]
gi|22538433|ref|NP_680091.1| cathepsin B preproprotein [Homo sapiens]
gi|22538435|ref|NP_680092.1| cathepsin B preproprotein [Homo sapiens]
gi|22538437|ref|NP_680093.1| cathepsin B preproprotein [Homo sapiens]
gi|68067549|sp|P07858.3|CATB_HUMAN RecName: Full=Cathepsin B; AltName: Full=APP secretase; Short=APPS;
AltName: Full=Cathepsin B1; Contains: RecName:
Full=Cathepsin B light chain; Contains: RecName:
Full=Cathepsin B heavy chain; Flags: Precursor
gi|291888|gb|AAC37547.1| cathepsin B [Homo sapiens]
gi|63102437|gb|AAH95408.1| Cathepsin B [Homo sapiens]
gi|119586034|gb|EAW65630.1| cathepsin B, isoform CRA_a [Homo sapiens]
gi|119586036|gb|EAW65632.1| cathepsin B, isoform CRA_a [Homo sapiens]
gi|119586037|gb|EAW65633.1| cathepsin B, isoform CRA_a [Homo sapiens]
gi|119586038|gb|EAW65634.1| cathepsin B, isoform CRA_a [Homo sapiens]
gi|119586039|gb|EAW65635.1| cathepsin B, isoform CRA_a [Homo sapiens]
gi|119586040|gb|EAW65636.1| cathepsin B, isoform CRA_a [Homo sapiens]
gi|168277954|dbj|BAG10955.1| cathepsin B precursor [synthetic construct]
gi|193786804|dbj|BAG52127.1| unnamed protein product [Homo sapiens]
Length = 339
Score = 192 bits (489), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 97/196 (49%), Positives = 123/196 (62%), Gaps = 40/196 (20%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GCRPY I PCEHHVNG+RP C +G TPKC + C+ Y YK+D ++G SYSVS++E
Sbjct: 178 GCRPYSIPPCEHHVNGSRPPC-TGEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVSNSE 236
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
K IM EIY++GPVEGAF+V+ D +LYKSG +
Sbjct: 237 KDIMAEIYKNGPVEGAFSVYSDFLLYKSGVY----------------------------- 267
Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
+ +G+ +GGHAIRILGWG + + YWL+ANSWNTDWGDNG FKILRG+D
Sbjct: 268 --------QHVTGEMMGGHAIRILGWGVENGT--PYWLVANSWNTDWGDNGFFKILRGQD 317
Query: 294 ECGIESSITAGVPKLD 309
CGIES + AG+P+ D
Sbjct: 318 HCGIESEVVAGIPRTD 333
Score = 46.6 bits (109), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 18/31 (58%), Positives = 23/31 (74%)
Query: 8 LCGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
+CG GCNGG+P AW +W + G+VSGG Y S
Sbjct: 145 MCGDGCNGGYPAEAWNFWTRKGLVSGGLYES 175
>gi|181192|gb|AAA52129.1| preprocathepsin B [Homo sapiens]
gi|193787271|dbj|BAG52477.1| unnamed protein product [Homo sapiens]
Length = 339
Score = 192 bits (489), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 97/196 (49%), Positives = 123/196 (62%), Gaps = 40/196 (20%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GCRPY I PCEHHVNG+RP C +G TPKC + C+ Y YK+D ++G SYSVS++E
Sbjct: 178 GCRPYSIPPCEHHVNGSRPPC-TGEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVSNSE 236
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
K IM EIY++GPVEGAF+V+ D +LYKSG +
Sbjct: 237 KDIMAEIYKNGPVEGAFSVYSDFLLYKSGVY----------------------------- 267
Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
+ +G+ +GGHAIRILGWG + + YWL+ANSWNTDWGDNG FKILRG+D
Sbjct: 268 --------QHVTGEMMGGHAIRILGWGVENGT--PYWLVANSWNTDWGDNGFFKILRGQD 317
Query: 294 ECGIESSITAGVPKLD 309
CGIES + AG+P+ D
Sbjct: 318 HCGIESEVVAGIPRTD 333
Score = 46.6 bits (109), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 18/31 (58%), Positives = 23/31 (74%)
Query: 8 LCGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
+CG GCNGG+P AW +W + G+VSGG Y S
Sbjct: 145 MCGDGCNGGYPAEAWNFWTRKGLVSGGLYES 175
>gi|332862712|ref|XP_003317964.1| PREDICTED: cathepsin B isoform 1 [Pan troglodytes]
gi|332862714|ref|XP_003317965.1| PREDICTED: cathepsin B isoform 2 [Pan troglodytes]
gi|332862716|ref|XP_003317966.1| PREDICTED: cathepsin B isoform 3 [Pan troglodytes]
gi|332862718|ref|XP_519607.3| PREDICTED: cathepsin B isoform 5 [Pan troglodytes]
gi|410057614|ref|XP_003954244.1| PREDICTED: cathepsin B [Pan troglodytes]
gi|410262606|gb|JAA19269.1| cathepsin B [Pan troglodytes]
gi|410262608|gb|JAA19270.1| cathepsin B [Pan troglodytes]
gi|410359820|gb|JAA44654.1| cathepsin B [Pan troglodytes]
gi|410359822|gb|JAA44655.1| cathepsin B [Pan troglodytes]
gi|410359824|gb|JAA44656.1| cathepsin B [Pan troglodytes]
gi|410359826|gb|JAA44657.1| cathepsin B [Pan troglodytes]
gi|410359828|gb|JAA44658.1| cathepsin B [Pan troglodytes]
Length = 339
Score = 192 bits (489), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 97/196 (49%), Positives = 123/196 (62%), Gaps = 40/196 (20%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GCRPY I PCEHHVNG+RP C +G TPKC + C+ Y YK+D ++G SYSVS++E
Sbjct: 178 GCRPYSIPPCEHHVNGSRPPC-TGEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVSNSE 236
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
K IM EIY++GPVEGAF+V+ D +LYKSG +
Sbjct: 237 KDIMAEIYKNGPVEGAFSVYSDFLLYKSGVY----------------------------- 267
Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
+ +G+ +GGHAIRILGWG + + YWL+ANSWNTDWGDNG FKILRG+D
Sbjct: 268 --------QHVTGEMMGGHAIRILGWGVENGT--PYWLVANSWNTDWGDNGFFKILRGQD 317
Query: 294 ECGIESSITAGVPKLD 309
CGIES + AG+P+ D
Sbjct: 318 HCGIESEVVAGIPRTD 333
Score = 46.6 bits (109), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 18/31 (58%), Positives = 23/31 (74%)
Query: 8 LCGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
+CG GCNGG+P AW +W + G+VSGG Y S
Sbjct: 145 MCGDGCNGGYPAEAWNFWTRKGLVSGGLYES 175
>gi|440913587|gb|ELR63025.1| Cathepsin B [Bos grunniens mutus]
Length = 335
Score = 192 bits (489), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 98/193 (50%), Positives = 120/193 (62%), Gaps = 40/193 (20%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GCRPY I PCEHHVNG+RP C +G TPKC + C+ Y YK+D +FG SYSV++NE
Sbjct: 178 GCRPYSIPPCEHHVNGSRPPC-TGEGDTPKCSKTCEPGYSPSYKEDKHFGCSSYSVANNE 236
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
K IM EIY++GPVEGAF+V+ D +LYKSG +
Sbjct: 237 KEIMAEIYKNGPVEGAFSVYSDFLLYKSGVY----------------------------- 267
Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
+ SG+ +GGHAIRILGWG + + YWL+ NSWNTDWGDNG FKILRG+D
Sbjct: 268 --------QHVSGEIMGGHAIRILGWGVENGT--PYWLVGNSWNTDWGDNGFFKILRGQD 317
Query: 294 ECGIESSITAGVP 306
CGIES I AG+P
Sbjct: 318 HCGIESEIVAGMP 330
Score = 64.3 bits (155), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 33/83 (39%), Positives = 43/83 (51%), Gaps = 12/83 (14%)
Query: 44 NSLSNIPRAHLKSWMGVHPDYNLPANRLPELIGYS------------EVDEDLPANFDSR 91
+ L N +W H YN+ + + +L G D LP +FD+R
Sbjct: 28 DELVNFVNKQNTTWKAGHNFYNVDLSYVKKLCGTILGGPKLPQRDAFAADVVLPESFDAR 87
Query: 92 TKWPNCPTIREIRDQGSCGSCWG 114
+WPNCPTI+EIRDQGSCGSCW
Sbjct: 88 KQWPNCPTIKEIRDQGSCGSCWA 110
>gi|27806671|ref|NP_776456.1| cathepsin B precursor [Bos taurus]
gi|115312124|sp|P07688.5|CATB_BOVIN RecName: Full=Cathepsin B; AltName: Full=BCSB; Contains: RecName:
Full=Cathepsin B light chain; Contains: RecName:
Full=Cathepsin B heavy chain; Flags: Precursor
gi|289402|gb|AAA03064.1| cathepsin B [Bos taurus]
gi|809479|gb|AAA80198.1| cathepsin B [Bos taurus]
gi|296484950|tpg|DAA27065.1| TPA: cathepsin B precursor [Bos taurus]
Length = 335
Score = 192 bits (489), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 98/193 (50%), Positives = 120/193 (62%), Gaps = 40/193 (20%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GCRPY I PCEHHVNG+RP C +G TPKC + C+ Y YK+D +FG SYSV++NE
Sbjct: 178 GCRPYSIPPCEHHVNGSRPPC-TGEGDTPKCSKTCEPGYSPSYKEDKHFGCSSYSVANNE 236
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
K IM EIY++GPVEGAF+V+ D +LYKSG +
Sbjct: 237 KEIMAEIYKNGPVEGAFSVYSDFLLYKSGVY----------------------------- 267
Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
+ SG+ +GGHAIRILGWG + + YWL+ NSWNTDWGDNG FKILRG+D
Sbjct: 268 --------QHVSGEIMGGHAIRILGWGVENGT--PYWLVGNSWNTDWGDNGFFKILRGQD 317
Query: 294 ECGIESSITAGVP 306
CGIES I AG+P
Sbjct: 318 HCGIESEIVAGMP 330
Score = 64.3 bits (155), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 33/83 (39%), Positives = 43/83 (51%), Gaps = 12/83 (14%)
Query: 44 NSLSNIPRAHLKSWMGVHPDYNLPANRLPELIGYS------------EVDEDLPANFDSR 91
+ L N +W H YN+ + + +L G D LP +FD+R
Sbjct: 28 DELVNFVNKQNTTWKAGHNFYNVDLSYVKKLCGAILGGPKLPQRDAFAADVVLPESFDAR 87
Query: 92 TKWPNCPTIREIRDQGSCGSCWG 114
+WPNCPTI+EIRDQGSCGSCW
Sbjct: 88 EQWPNCPTIKEIRDQGSCGSCWA 110
>gi|194387364|dbj|BAG60046.1| unnamed protein product [Homo sapiens]
Length = 245
Score = 192 bits (489), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 97/196 (49%), Positives = 123/196 (62%), Gaps = 40/196 (20%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GCRPY I PCEHHVNG+RP C +G TPKC + C+ Y YK+D ++G SYSVS++E
Sbjct: 84 GCRPYSIPPCEHHVNGSRPPC-TGEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVSNSE 142
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
K IM EIY++GPVEGAF+V+ D +LYKSG +
Sbjct: 143 KDIMAEIYKNGPVEGAFSVYSDFLLYKSGVY----------------------------- 173
Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
+ +G+ +GGHAIRILGWG + + YWL+ANSWNTDWGDNG FKILRG+D
Sbjct: 174 --------QHVTGEMMGGHAIRILGWGVENGT--PYWLVANSWNTDWGDNGFFKILRGQD 223
Query: 294 ECGIESSITAGVPKLD 309
CGIES + AG+P+ D
Sbjct: 224 HCGIESEVVAGIPRTD 239
Score = 45.8 bits (107), Expect = 0.024, Method: Compositional matrix adjust.
Identities = 18/31 (58%), Positives = 23/31 (74%)
Query: 8 LCGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
+CG GCNGG+P AW +W + G+VSGG Y S
Sbjct: 51 MCGDGCNGGYPAEAWNFWTRKGLVSGGLYES 81
>gi|426358853|ref|XP_004046705.1| PREDICTED: cathepsin B isoform 1 [Gorilla gorilla gorilla]
gi|426358855|ref|XP_004046706.1| PREDICTED: cathepsin B isoform 2 [Gorilla gorilla gorilla]
gi|426358857|ref|XP_004046707.1| PREDICTED: cathepsin B isoform 3 [Gorilla gorilla gorilla]
Length = 339
Score = 192 bits (489), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 97/196 (49%), Positives = 123/196 (62%), Gaps = 40/196 (20%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GCRPY I PCEHHVNG+RP C +G TPKC + C+ Y YK+D ++G SYSVS++E
Sbjct: 178 GCRPYSIPPCEHHVNGSRPPC-TGEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVSNSE 236
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
K IM EIY++GPVEGAF+V+ D +LYKSG +
Sbjct: 237 KDIMAEIYKNGPVEGAFSVYSDFLLYKSGVY----------------------------- 267
Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
+ +G+ +GGHAIRILGWG + + YWL+ANSWNTDWGDNG FKILRG+D
Sbjct: 268 --------QHVTGEMMGGHAIRILGWGVENGT--PYWLVANSWNTDWGDNGFFKILRGQD 317
Query: 294 ECGIESSITAGVPKLD 309
CGIES + AG+P+ D
Sbjct: 318 HCGIESEVVAGIPRTD 333
Score = 46.6 bits (109), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 18/31 (58%), Positives = 23/31 (74%)
Query: 8 LCGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
+CG GCNGG+P AW +W + G+VSGG Y S
Sbjct: 145 MCGDGCNGGYPAEAWNFWTRKGLVSGGLYES 175
>gi|397467300|ref|XP_003805362.1| PREDICTED: cathepsin B [Pan paniscus]
Length = 339
Score = 192 bits (489), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 97/196 (49%), Positives = 123/196 (62%), Gaps = 40/196 (20%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GCRPY I PCEHHVNG+RP C +G TPKC + C+ Y YK+D ++G SYSVS++E
Sbjct: 178 GCRPYSIPPCEHHVNGSRPPC-TGEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVSNSE 236
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
K IM EIY++GPVEGAF+V+ D +LYKSG +
Sbjct: 237 KDIMAEIYKNGPVEGAFSVYSDFLLYKSGVY----------------------------- 267
Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
+ +G+ +GGHAIRILGWG + + YWL+ANSWNTDWGDNG FKILRG+D
Sbjct: 268 --------QHVTGEMMGGHAIRILGWGVENGT--PYWLVANSWNTDWGDNGFFKILRGQD 317
Query: 294 ECGIESSITAGVPKLD 309
CGIES + AG+P+ D
Sbjct: 318 HCGIESEVVAGIPRTD 333
Score = 46.6 bits (109), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 18/31 (58%), Positives = 23/31 (74%)
Query: 8 LCGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
+CG GCNGG+P AW +W + G+VSGG Y S
Sbjct: 145 MCGDGCNGGYPAEAWNFWTRKGLVSGGLYES 175
>gi|60816353|gb|AAX36379.1| cathepsin B [synthetic construct]
gi|61358313|gb|AAX41546.1| cathepsin B [synthetic construct]
Length = 339
Score = 192 bits (489), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 97/196 (49%), Positives = 123/196 (62%), Gaps = 40/196 (20%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GCRPY I PCEHHVNG+RP C +G TPKC + C+ Y YK+D ++G SYSVS++E
Sbjct: 178 GCRPYSIPPCEHHVNGSRPPC-TGEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVSNSE 236
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
K IM EIY++GPVEGAF+V+ D +LYKSG +
Sbjct: 237 KDIMAEIYKNGPVEGAFSVYSDFLLYKSGVY----------------------------- 267
Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
+ +G+ +GGHAIRILGWG + + YWL+ANSWNTDWGDNG FKILRG+D
Sbjct: 268 --------QHVTGEMMGGHAIRILGWGVENGT--PYWLVANSWNTDWGDNGFFKILRGQD 317
Query: 294 ECGIESSITAGVPKLD 309
CGIES + AG+P+ D
Sbjct: 318 HCGIESEVVAGIPRTD 333
Score = 45.4 bits (106), Expect = 0.032, Method: Compositional matrix adjust.
Identities = 18/30 (60%), Positives = 22/30 (73%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
CG GCNGG+P AW +W + G+VSGG Y S
Sbjct: 146 CGDGCNGGYPAEAWNFWTRKGLVSGGLYES 175
>gi|16307393|gb|AAH10240.1| Cathepsin B [Homo sapiens]
Length = 339
Score = 192 bits (488), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 97/196 (49%), Positives = 123/196 (62%), Gaps = 40/196 (20%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GCRPY I PCEHHVNG+RP C +G TPKC + C+ Y YK+D ++G SYSVS++E
Sbjct: 178 GCRPYSIPPCEHHVNGSRPPC-TGEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVSNSE 236
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
K IM EIY++GPVEGAF+V+ D +LYKSG +
Sbjct: 237 KDIMAEIYKNGPVEGAFSVYSDFLLYKSGVY----------------------------- 267
Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
+ +G+ +GGHAIRILGWG + + YWL+ANSWNTDWGDNG FKILRG+D
Sbjct: 268 --------QHVTGEMMGGHAIRILGWGVENGT--PYWLVANSWNTDWGDNGFFKILRGQD 317
Query: 294 ECGIESSITAGVPKLD 309
CGIES + AG+P+ D
Sbjct: 318 HCGIESEVVAGIPRTD 333
Score = 46.2 bits (108), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 18/31 (58%), Positives = 23/31 (74%)
Query: 8 LCGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
+CG GCNGG+P AW +W + G+VSGG Y S
Sbjct: 145 MCGDGCNGGYPAEAWNFWTRKGLVSGGLYES 175
>gi|262368170|pdb|3K9M|A Chain A, Cathepsin B In Complex With Stefin A
gi|262368172|pdb|3K9M|B Chain B, Cathepsin B In Complex With Stefin A
Length = 254
Score = 192 bits (488), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 97/196 (49%), Positives = 123/196 (62%), Gaps = 40/196 (20%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GCRPY I PCEHHVNG+RP C +G TPKC + C+ Y YK+D ++G SYSVS++E
Sbjct: 99 GCRPYSIPPCEHHVNGSRPPC-TGEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVSNSE 157
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
K IM EIY++GPVEGAF+V+ D +LYKSG +
Sbjct: 158 KDIMAEIYKNGPVEGAFSVYSDFLLYKSGVY----------------------------- 188
Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
+ +G+ +GGHAIRILGWG + + YWL+ANSWNTDWGDNG FKILRG+D
Sbjct: 189 --------QHVTGEMMGGHAIRILGWGVENGT--PYWLVANSWNTDWGDNGFFKILRGQD 238
Query: 294 ECGIESSITAGVPKLD 309
CGIES + AG+P+ D
Sbjct: 239 HCGIESEVVAGIPRTD 254
Score = 46.2 bits (108), Expect = 0.021, Method: Compositional matrix adjust.
Identities = 18/31 (58%), Positives = 23/31 (74%)
Query: 8 LCGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
+CG GCNGG+P AW +W + G+VSGG Y S
Sbjct: 66 MCGDGCNGGYPAEAWNFWTRKGLVSGGLYES 96
>gi|999909|pdb|1HUC|B Chain B, The Refined 2.15 Angstroms X-Ray Crystal Structure Of
Human Liver Cathepsin B: The Structural Basis For Its
Specificity
gi|999911|pdb|1HUC|D Chain D, The Refined 2.15 Angstroms X-Ray Crystal Structure Of
Human Liver Cathepsin B: The Structural Basis For Its
Specificity
gi|1421164|pdb|1CSB|B Chain B, Crystal Structure Of Cathepsin B Inhibited With Ca030 At
2.1 Angstroms Resolution: A Basis For The Design Of
Specific Epoxysuccinyl Inhibitors
gi|1421167|pdb|1CSB|E Chain E, Crystal Structure Of Cathepsin B Inhibited With Ca030 At
2.1 Angstroms Resolution: A Basis For The Design Of
Specific Epoxysuccinyl Inhibitors
gi|122920711|pdb|2IPP|B Chain B, Crystal Structure Of The Tetragonal Form Of Human Liver
Cathepsin B
Length = 205
Score = 192 bits (488), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 97/196 (49%), Positives = 123/196 (62%), Gaps = 40/196 (20%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GCRPY I PCEHHVNG+RP C +G TPKC + C+ Y YK+D ++G SYSVS++E
Sbjct: 50 GCRPYSIPPCEHHVNGSRPPC-TGEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVSNSE 108
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
K IM EIY++GPVEGAF+V+ D +LYKSG +
Sbjct: 109 KDIMAEIYKNGPVEGAFSVYSDFLLYKSGVY----------------------------- 139
Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
+ +G+ +GGHAIRILGWG + + YWL+ANSWNTDWGDNG FKILRG+D
Sbjct: 140 --------QHVTGEMMGGHAIRILGWGVENGT--PYWLVANSWNTDWGDNGFFKILRGQD 189
Query: 294 ECGIESSITAGVPKLD 309
CGIES + AG+P+ D
Sbjct: 190 HCGIESEVVAGIPRTD 205
Score = 45.8 bits (107), Expect = 0.027, Method: Compositional matrix adjust.
Identities = 18/31 (58%), Positives = 23/31 (74%)
Query: 8 LCGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
+CG GCNGG+P AW +W + G+VSGG Y S
Sbjct: 17 MCGDGCNGGYPAEAWNFWTRKGLVSGGLYES 47
>gi|30583753|gb|AAP36125.1| Homo sapiens cathepsin B [synthetic construct]
gi|61370555|gb|AAX43516.1| cathepsin B [synthetic construct]
Length = 340
Score = 192 bits (488), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 97/196 (49%), Positives = 123/196 (62%), Gaps = 40/196 (20%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GCRPY I PCEHHVNG+RP C +G TPKC + C+ Y YK+D ++G SYSVS++E
Sbjct: 178 GCRPYSIPPCEHHVNGSRPPC-TGEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVSNSE 236
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
K IM EIY++GPVEGAF+V+ D +LYKSG +
Sbjct: 237 KDIMAEIYKNGPVEGAFSVYSDFLLYKSGVY----------------------------- 267
Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
+ +G+ +GGHAIRILGWG + + YWL+ANSWNTDWGDNG FKILRG+D
Sbjct: 268 --------QHVTGEMMGGHAIRILGWGVENGT--PYWLVANSWNTDWGDNGFFKILRGQD 317
Query: 294 ECGIESSITAGVPKLD 309
CGIES + AG+P+ D
Sbjct: 318 HCGIESEVVAGIPRTD 333
Score = 46.2 bits (108), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 18/31 (58%), Positives = 23/31 (74%)
Query: 8 LCGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
+CG GCNGG+P AW +W + G+VSGG Y S
Sbjct: 145 MCGDGCNGGYPAEAWNFWTRKGLVSGGLYES 175
>gi|333361087|pdb|3AI8|B Chain B, Cathepsin B In Complex With The Nitroxoline
gi|333361088|pdb|3AI8|A Chain A, Cathepsin B In Complex With The Nitroxoline
Length = 256
Score = 192 bits (488), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 97/196 (49%), Positives = 123/196 (62%), Gaps = 40/196 (20%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GCRPY I PCEHHVNG+RP C +G TPKC + C+ Y YK+D ++G SYSVS++E
Sbjct: 101 GCRPYSIPPCEHHVNGSRPPC-TGEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVSNSE 159
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
K IM EIY++GPVEGAF+V+ D +LYKSG +
Sbjct: 160 KDIMAEIYKNGPVEGAFSVYSDFLLYKSGVY----------------------------- 190
Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
+ +G+ +GGHAIRILGWG + + YWL+ANSWNTDWGDNG FKILRG+D
Sbjct: 191 --------QHVTGEMMGGHAIRILGWGVENGT--PYWLVANSWNTDWGDNGFFKILRGQD 240
Query: 294 ECGIESSITAGVPKLD 309
CGIES + AG+P+ D
Sbjct: 241 HCGIESEVVAGIPRTD 256
Score = 46.2 bits (108), Expect = 0.020, Method: Compositional matrix adjust.
Identities = 18/31 (58%), Positives = 23/31 (74%)
Query: 8 LCGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
+CG GCNGG+P AW +W + G+VSGG Y S
Sbjct: 68 MCGDGCNGGYPAEAWNFWTRKGLVSGGLYES 98
>gi|24158605|pdb|1GMY|A Chain A, Cathepsin B Complexed With Dipeptidyl Nitrile Inhibitor
gi|24158606|pdb|1GMY|B Chain B, Cathepsin B Complexed With Dipeptidyl Nitrile Inhibitor
gi|24158607|pdb|1GMY|C Chain C, Cathepsin B Complexed With Dipeptidyl Nitrile Inhibitor
Length = 261
Score = 192 bits (488), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 97/196 (49%), Positives = 123/196 (62%), Gaps = 40/196 (20%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GCRPY I PCEHHVNG+RP C +G TPKC + C+ Y YK+D ++G SYSVS++E
Sbjct: 100 GCRPYSIPPCEHHVNGSRPPC-TGEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVSNSE 158
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
K IM EIY++GPVEGAF+V+ D +LYKSG +
Sbjct: 159 KDIMAEIYKNGPVEGAFSVYSDFLLYKSGVY----------------------------- 189
Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
+ +G+ +GGHAIRILGWG + + YWL+ANSWNTDWGDNG FKILRG+D
Sbjct: 190 --------QHVTGEMMGGHAIRILGWGVENGT--PYWLVANSWNTDWGDNGFFKILRGQD 239
Query: 294 ECGIESSITAGVPKLD 309
CGIES + AG+P+ D
Sbjct: 240 HCGIESEVVAGIPRTD 255
Score = 46.2 bits (108), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 18/31 (58%), Positives = 23/31 (74%)
Query: 8 LCGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
+CG GCNGG+P AW +W + G+VSGG Y S
Sbjct: 67 MCGDGCNGGYPAEAWNFWTRKGLVSGGLYES 97
>gi|402877481|ref|XP_003902454.1| PREDICTED: cathepsin B [Papio anubis]
Length = 339
Score = 192 bits (488), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 97/196 (49%), Positives = 123/196 (62%), Gaps = 40/196 (20%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GCRPY I PCEHHVNG+RP C +G TPKC + C+ Y YK+D ++G SYSVS++E
Sbjct: 178 GCRPYSIPPCEHHVNGSRPPC-TGEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVSNSE 236
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
K IM EIY++GPVEGAF+V+ D +LYKSG +
Sbjct: 237 KDIMAEIYKNGPVEGAFSVYSDFLLYKSGVY----------------------------- 267
Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
+ +G+ +GGHAIRILGWG + + YWL+ANSWNTDWGDNG FKILRG+D
Sbjct: 268 --------QHVTGEMMGGHAIRILGWGVENGT--PYWLVANSWNTDWGDNGFFKILRGQD 317
Query: 294 ECGIESSITAGVPKLD 309
CGIES + AG+P+ D
Sbjct: 318 HCGIESEVVAGIPRTD 333
Score = 46.2 bits (108), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 18/31 (58%), Positives = 23/31 (74%)
Query: 8 LCGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
+CG GCNGG+P AW +W + G+VSGG Y S
Sbjct: 145 MCGDGCNGGYPAGAWNFWTRKGLVSGGLYDS 175
>gi|302564570|ref|NP_001181828.1| cathepsin B precursor [Macaca mulatta]
Length = 339
Score = 192 bits (488), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 97/196 (49%), Positives = 123/196 (62%), Gaps = 40/196 (20%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GCRPY I PCEHHVNG+RP C +G TPKC + C+ Y YK+D ++G SYSVS++E
Sbjct: 178 GCRPYSIPPCEHHVNGSRPPC-TGEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVSNSE 236
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
K IM EIY++GPVEGAF+V+ D +LYKSG +
Sbjct: 237 KDIMAEIYKNGPVEGAFSVYSDFLLYKSGVY----------------------------- 267
Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
+ +G+ +GGHAIRILGWG + + YWL+ANSWNTDWGDNG FKILRG+D
Sbjct: 268 --------QHVTGEMMGGHAIRILGWGVENGT--PYWLVANSWNTDWGDNGFFKILRGQD 317
Query: 294 ECGIESSITAGVPKLD 309
CGIES + AG+P+ D
Sbjct: 318 HCGIESEVVAGIPRTD 333
Score = 46.2 bits (108), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 18/31 (58%), Positives = 23/31 (74%)
Query: 8 LCGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
+CG GCNGG+P AW +W + G+VSGG Y S
Sbjct: 145 MCGDGCNGGYPAGAWNFWTRKGLVSGGLYDS 175
>gi|343961899|dbj|BAK62537.1| cathepsin B precursor [Pan troglodytes]
Length = 195
Score = 192 bits (488), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 97/196 (49%), Positives = 123/196 (62%), Gaps = 40/196 (20%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GCRPY I PCEHHVNG+RP C +G TPKC + C+ Y YK+D ++G SYSVS++E
Sbjct: 34 GCRPYSIPPCEHHVNGSRPPC-TGEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVSNSE 92
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
K IM EIY++GPVEGAF+V+ D +LYKSG +
Sbjct: 93 KGIMAEIYKNGPVEGAFSVYSDFLLYKSGVY----------------------------- 123
Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
+ +G+ +GGHAIRILGWG + + YWL+ANSWNTDWGDNG FKILRG+D
Sbjct: 124 --------QHVTGEMMGGHAIRILGWGVENGT--PYWLVANSWNTDWGDNGFFKILRGQD 173
Query: 294 ECGIESSITAGVPKLD 309
CGIES + AG+P+ D
Sbjct: 174 HCGIESEVVAGIPRTD 189
Score = 45.4 bits (106), Expect = 0.032, Method: Compositional matrix adjust.
Identities = 18/31 (58%), Positives = 23/31 (74%)
Query: 8 LCGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
+CG GCNGG+P AW +W + G+VSGG Y S
Sbjct: 1 MCGDGCNGGYPAEAWNFWTRKGLVSGGLYES 31
>gi|73586701|gb|AAI02998.1| CTSB protein [Bos taurus]
Length = 335
Score = 192 bits (488), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 98/193 (50%), Positives = 120/193 (62%), Gaps = 40/193 (20%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GCRPY I PCEHHVNG+RP C +G TPKC + C+ Y YK+D +FG SYSV++NE
Sbjct: 178 GCRPYSIPPCEHHVNGSRPPC-TGEGDTPKCSKTCEPGYSPSYKEDKHFGCSSYSVANNE 236
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
K IM EIY++GPVEGAF+V+ D +LYKSG +
Sbjct: 237 KEIMAEIYKNGPVEGAFSVYSDFLLYKSGVY----------------------------- 267
Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
+ SG+ +GGHAIRILGWG + + YWL+ NSWNTDWGDNG FKILRG+D
Sbjct: 268 --------QHVSGEIMGGHAIRILGWGVENGT--PYWLVGNSWNTDWGDNGFFKILRGQD 317
Query: 294 ECGIESSITAGVP 306
CGIES I AG+P
Sbjct: 318 HCGIESEIVAGMP 330
Score = 47.4 bits (111), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 20/30 (66%), Positives = 22/30 (73%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
CG GCNGGFP AW +W K G+VSGG Y S
Sbjct: 146 CGDGCNGGFPSGAWNFWTKKGLVSGGLYNS 175
>gi|355697726|gb|EHH28274.1| Cathepsin B [Macaca mulatta]
Length = 339
Score = 192 bits (488), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 97/196 (49%), Positives = 123/196 (62%), Gaps = 40/196 (20%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GCRPY I PCEHHVNG+RP C +G TPKC + C+ Y YK+D ++G SYSVS++E
Sbjct: 178 GCRPYSIPPCEHHVNGSRPPC-TGEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVSNSE 236
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
K IM EIY++GPVEGAF+V+ D +LYKSG +
Sbjct: 237 KDIMAEIYKNGPVEGAFSVYSDFLLYKSGVY----------------------------- 267
Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
+ +G+ +GGHAIRILGWG + + YWL+ANSWNTDWGDNG FKILRG+D
Sbjct: 268 --------QHVTGEMMGGHAIRILGWGVENGT--PYWLVANSWNTDWGDNGFFKILRGQD 317
Query: 294 ECGIESSITAGVPKLD 309
CGIES + AG+P+ D
Sbjct: 318 HCGIESEVVAGIPRTD 333
Score = 42.0 bits (97), Expect = 0.34, Method: Compositional matrix adjust.
Identities = 17/31 (54%), Positives = 22/31 (70%)
Query: 8 LCGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
+CG GCNGG+P AW + + G+VSGG Y S
Sbjct: 145 MCGDGCNGGYPAGAWNFLTRKGLVSGGLYDS 175
>gi|296221607|ref|XP_002756833.1| PREDICTED: cathepsin B, partial [Callithrix jacchus]
Length = 330
Score = 192 bits (488), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 96/196 (48%), Positives = 122/196 (62%), Gaps = 40/196 (20%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GCRPY I PCEHHVNG+RP C +G TPKC + C+ Y YK+D ++G SYSVS+NE
Sbjct: 169 GCRPYSIPPCEHHVNGSRPPC-TGEGDTPKCSKSCEPGYSPTYKQDKHYGYDSYSVSNNE 227
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
+ IM EIY++GPVEGAF+V+ D +LYKSG +
Sbjct: 228 RDIMAEIYKNGPVEGAFSVYADFLLYKSGVY----------------------------- 258
Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
+ +G+ +GGHAIRILGWG + + YWL+ NSWNTDWGDNG FKILRG+D
Sbjct: 259 --------QHVTGEMMGGHAIRILGWGVENGT--PYWLVGNSWNTDWGDNGFFKILRGQD 308
Query: 294 ECGIESSITAGVPKLD 309
CGIES + AG+P+ D
Sbjct: 309 HCGIESEVVAGIPRTD 324
Score = 47.0 bits (110), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 18/31 (58%), Positives = 23/31 (74%)
Query: 8 LCGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
+CG GCNGG+P AW +W + G+VSGG Y S
Sbjct: 136 MCGDGCNGGYPAEAWNFWTRKGLVSGGLYDS 166
>gi|75076082|sp|Q4R5M2.1|CATB_MACFA RecName: Full=Cathepsin B; Contains: RecName: Full=Cathepsin B
light chain; Contains: RecName: Full=Cathepsin B heavy
chain; Flags: Precursor
gi|67970521|dbj|BAE01603.1| unnamed protein product [Macaca fascicularis]
gi|355779504|gb|EHH63980.1| Cathepsin B [Macaca fascicularis]
gi|383411999|gb|AFH29213.1| cathepsin B preproprotein [Macaca mulatta]
gi|384942194|gb|AFI34702.1| cathepsin B preproprotein [Macaca mulatta]
Length = 339
Score = 192 bits (488), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 97/196 (49%), Positives = 123/196 (62%), Gaps = 40/196 (20%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GCRPY I PCEHHVNG+RP C +G TPKC + C+ Y YK+D ++G SYSVS++E
Sbjct: 178 GCRPYSIPPCEHHVNGSRPPC-TGEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVSNSE 236
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
K IM EIY++GPVEGAF+V+ D +LYKSG +
Sbjct: 237 KDIMAEIYKNGPVEGAFSVYSDFLLYKSGVY----------------------------- 267
Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
+ +G+ +GGHAIRILGWG + + YWL+ANSWNTDWGDNG FKILRG+D
Sbjct: 268 --------QHVTGEMMGGHAIRILGWGVENGT--PYWLVANSWNTDWGDNGFFKILRGQD 317
Query: 294 ECGIESSITAGVPKLD 309
CGIES + AG+P+ D
Sbjct: 318 HCGIESEVVAGIPRTD 333
Score = 46.2 bits (108), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 18/31 (58%), Positives = 23/31 (74%)
Query: 8 LCGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
+CG GCNGG+P AW +W + G+VSGG Y S
Sbjct: 145 MCGDGCNGGYPAGAWNFWTRKGLVSGGLYDS 175
>gi|344195776|gb|AEM98130.1| cathepsin B [Cynoglossus semilaevis]
Length = 332
Score = 192 bits (487), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 95/194 (48%), Positives = 119/194 (61%), Gaps = 39/194 (20%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GCRPY IAPCEHHVNG+RP C G TP+C ++C+ Y Y +D ++G SYSV +E
Sbjct: 176 GCRPYSIAPCEHHVNGSRPPCTGEGGDTPQCTKKCEAGYTPGYTQDKHYGKLSYSVDDSE 235
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
K I EIY++GPVEGAFTV++D +LYK+G +
Sbjct: 236 KEIQLEIYKNGPVEGAFTVYEDFLLYKTGVY----------------------------- 266
Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
+ +G A+GGHAI++LGWGE+ + YWL ANSWNTDWGDNG FKILRG D
Sbjct: 267 --------QHVTGSAVGGHAIKVLGWGEENGT--PYWLCANSWNTDWGDNGFFKILRGSD 316
Query: 294 ECGIESSITAGVPK 307
CGIES I AG+PK
Sbjct: 317 HCGIESEIVAGIPK 330
Score = 46.6 bits (109), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 18/30 (60%), Positives = 21/30 (70%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
CG GCNGG+P AW +W G+VSGG Y S
Sbjct: 144 CGMGCNGGYPSAAWEFWTTDGLVSGGLYDS 173
>gi|410912140|ref|XP_003969548.1| PREDICTED: cathepsin B-like [Takifugu rubripes]
Length = 246
Score = 192 bits (487), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 97/194 (50%), Positives = 119/194 (61%), Gaps = 39/194 (20%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GCRPY I PCEHHVNG+RPSC G TP+CV C+ Y YK+D ++G SYSVSS+E
Sbjct: 92 GCRPYTIPPCEHHVNGSRPSCSGEGGETPQCVYRCEAGYTPSYKQDKHYGKTSYSVSSDE 151
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
I EIY++GPVEGAFTV++D +LYK+G +
Sbjct: 152 DDIKHEIYKNGPVEGAFTVYEDFVLYKTGVY----------------------------- 182
Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
+ +G ALGGHAI+ILGWGE+ + YWL ANSWNTDWG+NG FKILRG +
Sbjct: 183 --------QHVTGSALGGHAIKILGWGEE--NGIPYWLCANSWNTDWGNNGFFKILRGSN 232
Query: 294 ECGIESSITAGVPK 307
CGIES I AG+P
Sbjct: 233 HCGIESEIVAGIPN 246
Score = 47.4 bits (111), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 19/30 (63%), Positives = 22/30 (73%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
CG GCNGG+P AW +W K G+VSGG Y S
Sbjct: 60 CGMGCNGGYPSAAWDFWTKDGLVSGGLYDS 89
>gi|223646922|gb|ACN10219.1| Cathepsin B precursor [Salmo salar]
gi|223647940|gb|ACN10728.1| Cathepsin B precursor [Salmo salar]
gi|223672785|gb|ACN12574.1| Cathepsin B precursor [Salmo salar]
Length = 330
Score = 192 bits (487), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 96/193 (49%), Positives = 119/193 (61%), Gaps = 39/193 (20%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GCRPY I PCEHHVNGTRP C +G TP+C +C+ Y YK+D +FG SYS+ S E
Sbjct: 176 GCRPYSIPPCEHHVNGTRPPCTGEEGDTPQCSNQCETGYTPGYKQDKHFGKNSYSLPSEE 235
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
+ IM E+ ++GPVEGAFTV++D +LYKSG +
Sbjct: 236 QQIMAELLKNGPVEGAFTVYEDFLLYKSGVY----------------------------- 266
Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
+ SG A+GGHAI++LGWGE+ + YWL ANSWNTDWG+NG FKILRGKD
Sbjct: 267 --------QHVSGSAVGGHAIKVLGWGEEGGT--PYWLAANSWNTDWGENGFFKILRGKD 316
Query: 294 ECGIESSITAGVP 306
CGIES + AGVP
Sbjct: 317 HCGIESEMVAGVP 329
Score = 45.1 bits (105), Expect = 0.038, Method: Compositional matrix adjust.
Identities = 17/30 (56%), Positives = 21/30 (70%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
CG GCNGG+P AW +W G+V+GG Y S
Sbjct: 144 CGMGCNGGYPSAAWDFWTTEGLVTGGLYDS 173
>gi|181178|gb|AAA52125.1| lysosomal proteinase cathepsin B, partial [Homo sapiens]
Length = 209
Score = 192 bits (487), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 97/196 (49%), Positives = 123/196 (62%), Gaps = 40/196 (20%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GCRPY I PCEHHVNG+RP C +G TPKC + C+ Y YK+D ++G SYSVS++E
Sbjct: 48 GCRPYSIPPCEHHVNGSRPPC-TGEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVSNSE 106
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
K IM EIY++GPVEGAF+V+ D +LYKSG +
Sbjct: 107 KDIMAEIYKNGPVEGAFSVYSDFLLYKSGVY----------------------------- 137
Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
+ +G+ +GGHAIRILGWG + + YWL+ANSWNTDWGDNG FKILRG+D
Sbjct: 138 --------QHVTGEMMGGHAIRILGWGVENGT--PYWLVANSWNTDWGDNGFFKILRGQD 187
Query: 294 ECGIESSITAGVPKLD 309
CGIES + AG+P+ D
Sbjct: 188 HCGIESEVVAGIPRTD 203
Score = 45.8 bits (107), Expect = 0.025, Method: Compositional matrix adjust.
Identities = 18/31 (58%), Positives = 23/31 (74%)
Query: 8 LCGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
+CG GCNGG+P AW +W + G+VSGG Y S
Sbjct: 15 MCGDGCNGGYPAEAWNFWTRKGLVSGGLYES 45
>gi|197098184|ref|NP_001126573.1| cathepsin B precursor [Pongo abelii]
gi|75061687|sp|Q5R6D1.1|CATB_PONAB RecName: Full=Cathepsin B; Contains: RecName: Full=Cathepsin B
light chain; Contains: RecName: Full=Cathepsin B heavy
chain; Flags: Precursor
gi|55731764|emb|CAH92586.1| hypothetical protein [Pongo abelii]
gi|55731953|emb|CAH92685.1| hypothetical protein [Pongo abelii]
Length = 339
Score = 191 bits (486), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 96/196 (48%), Positives = 123/196 (62%), Gaps = 40/196 (20%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GCRPY I PCEHHVNG+RP C +G TPKC + C+ Y YK+D ++G SYSVS++E
Sbjct: 178 GCRPYSIPPCEHHVNGSRPPC-TGEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVSNSE 236
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
+ IM EIY++GPVEGAF+V+ D +LYKSG +
Sbjct: 237 RDIMAEIYKNGPVEGAFSVYSDFLLYKSGVY----------------------------- 267
Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
+ +G+ +GGHAIRILGWG + + YWL+ANSWNTDWGDNG FKILRG+D
Sbjct: 268 --------QHVTGEMMGGHAIRILGWGVENGT--PYWLVANSWNTDWGDNGFFKILRGQD 317
Query: 294 ECGIESSITAGVPKLD 309
CGIES + AG+P+ D
Sbjct: 318 HCGIESEVVAGIPRTD 333
Score = 46.6 bits (109), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 18/31 (58%), Positives = 23/31 (74%)
Query: 8 LCGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
+CG GCNGG+P AW +W + G+VSGG Y S
Sbjct: 145 MCGDGCNGGYPAEAWNFWTRKGLVSGGLYES 175
>gi|28373366|pdb|1ITO|A Chain A, Crystal Structure Analysis Of Bovine Spleen Cathepsin B-
E64c Complex
gi|88192750|pdb|2DC6|A Chain A, X-ray Crystal Structure Analysis Of Bovine Spleen
Cathepsin B-ca073 Complex
gi|88192751|pdb|2DC7|A Chain A, X-ray Crystal Structure Analysis Of Bovine Spleen
Cathepsin B-ca042 Complex
gi|88192752|pdb|2DC8|A Chain A, X-ray Crystal Structure Analysis Of Bovine Spleen
Cathepsin B-ca059 Complex
gi|88192753|pdb|2DC9|A Chain A, X-Ray Crystal Structure Analysis Of Bovine Spleen
Cathepsin B-Ca074me Complex
gi|88192754|pdb|2DCA|A Chain A, X-ray Crystal Structure Analysis Of Bovine Spleen
Cathepsin B-ca075 Complex
gi|88192755|pdb|2DCB|A Chain A, X-Ray Crystal Structure Analysis Of Bovine Spleen
Cathepsin B-Ca076 Complex
gi|88192756|pdb|2DCC|A Chain A, X-Ray Crystal Structure Analysis Of Bovine Spleen
Cathepsin B-Ca077 Complex
gi|88192757|pdb|2DCD|A Chain A, X-Ray Crystal Structure Analysis Of Bovine Spleen
Cathepsin B-Ca078 Complex
Length = 256
Score = 191 bits (486), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 98/193 (50%), Positives = 120/193 (62%), Gaps = 40/193 (20%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GCRPY I PCEHHVNG+RP C +G TPKC + C+ Y YK+D +FG SYSV++NE
Sbjct: 99 GCRPYSIPPCEHHVNGSRPPC-TGEGDTPKCSKTCEPGYSPSYKEDKHFGCSSYSVANNE 157
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
K IM EIY++GPVEGAF+V+ D +LYKSG +
Sbjct: 158 KEIMAEIYKNGPVEGAFSVYSDFLLYKSGVY----------------------------- 188
Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
+ SG+ +GGHAIRILGWG + + YWL+ NSWNTDWGDNG FKILRG+D
Sbjct: 189 --------QHVSGEIMGGHAIRILGWGVENGT--PYWLVGNSWNTDWGDNGFFKILRGQD 238
Query: 294 ECGIESSITAGVP 306
CGIES I AG+P
Sbjct: 239 HCGIESEIVAGMP 251
Score = 61.6 bits (148), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 24/30 (80%), Positives = 28/30 (93%)
Query: 84 LPANFDSRTKWPNCPTIREIRDQGSCGSCW 113
LP +FD+R +WPNCPTI+EIRDQGSCGSCW
Sbjct: 1 LPESFDAREQWPNCPTIKEIRDQGSCGSCW 30
>gi|444525951|gb|ELV14228.1| Cathepsin B [Tupaia chinensis]
Length = 339
Score = 191 bits (486), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 97/196 (49%), Positives = 120/196 (61%), Gaps = 40/196 (20%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GCRPY I PCEHHVNG+RP C +G TPKC + C+ Y YK+D ++G SYSV E
Sbjct: 178 GCRPYSIPPCEHHVNGSRPPC-TGEGDTPKCSKSCEPGYSSSYKEDKHYGYSSYSVPGIE 236
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
K IM EIY++GPVEGAF+V+ D +LYKSG +
Sbjct: 237 KEIMAEIYKNGPVEGAFSVYSDFLLYKSGVY----------------------------- 267
Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
+ +G+ +GGHAIRILGWG + + YWL+ANSWNTDWGDNG FKILRG+D
Sbjct: 268 --------QHVTGEMMGGHAIRILGWGTENGT--PYWLVANSWNTDWGDNGFFKILRGQD 317
Query: 294 ECGIESSITAGVPKLD 309
CGIES I AG+P+ D
Sbjct: 318 HCGIESEIVAGIPRTD 333
>gi|48425700|pdb|1SP4|B Chain B, Crystal Structure Of Ns-134 In Complex With Bovine
Cathepsin B: A Two Headed Epoxysuccinyl Inhibitor
Extends Along The Whole Active Site Cleft
Length = 205
Score = 191 bits (486), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 98/193 (50%), Positives = 120/193 (62%), Gaps = 40/193 (20%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GCRPY I PCEHHVNG+RP C +G TPKC + C+ Y YK+D +FG SYSV++NE
Sbjct: 51 GCRPYSIPPCEHHVNGSRPPC-TGEGDTPKCNKTCEPGYSPSYKEDKHFGCSSYSVANNE 109
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
K IM EIY++GPVEGAF+V+ D +LYKSG +
Sbjct: 110 KEIMAEIYKNGPVEGAFSVYSDFLLYKSGVY----------------------------- 140
Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
+ SG+ +GGHAIRILGWG + + YWL+ NSWNTDWGDNG FKILRG+D
Sbjct: 141 --------QHVSGEIMGGHAIRILGWGVENGT--PYWLVGNSWNTDWGDNGFFKILRGQD 190
Query: 294 ECGIESSITAGVP 306
CGIES I AG+P
Sbjct: 191 HCGIESEIVAGMP 203
>gi|379067374|gb|AFC90100.1| cathepsin B [Capra hircus]
Length = 335
Score = 191 bits (486), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 99/193 (51%), Positives = 118/193 (61%), Gaps = 40/193 (20%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GCRPY I PCEHHVNG+RP C +G TPKC + C+ Y YK D +FG SYSVSSNE
Sbjct: 178 GCRPYSIPPCEHHVNGSRPPC-TGEGDTPKCSKICEPGYSPSYKDDKHFGCSSYSVSSNE 236
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
K IM EIY++GPVEGAF+V+ D +LYKSG +
Sbjct: 237 KEIMAEIYKNGPVEGAFSVYSDFLLYKSGVY----------------------------- 267
Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
+ SG+ +GGHAIRILGWG + + YWL+ NSWNTDWGD G FKILRG+D
Sbjct: 268 --------QHVSGEMMGGHAIRILGWGVENDT--PYWLVGNSWNTDWGDKGFFKILRGQD 317
Query: 294 ECGIESSITAGVP 306
CGIES I AG+P
Sbjct: 318 HCGIESEIVAGMP 330
Score = 47.4 bits (111), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 20/30 (66%), Positives = 22/30 (73%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
CG GCNGGFP AW +W K G+VSGG Y S
Sbjct: 146 CGDGCNGGFPSGAWNFWTKKGLVSGGLYDS 175
>gi|50540542|ref|NP_998501.1| cathepsin B, a precursor [Danio rerio]
gi|34784038|gb|AAH56688.1| Cathepsin B, a [Danio rerio]
gi|37681773|gb|AAQ97764.1| cathepsin B [Danio rerio]
gi|41351445|gb|AAH65589.1| Cathepsin B, a [Danio rerio]
Length = 330
Score = 191 bits (486), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 97/193 (50%), Positives = 118/193 (61%), Gaps = 39/193 (20%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GCRPY I PCEHHVNG+RP C G TP C +C+ Y YK+D +FG SYSV SN+
Sbjct: 176 GCRPYTIEPCEHHVNGSRPPCSGEGGDTPNCDMKCEPGYSPSYKQDKHFGKTSYSVPSNQ 235
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
SIM E++++GPVEGAFTV++D +LYKSG +
Sbjct: 236 NSIMAELFKNGPVEGAFTVYEDFLLYKSGVY----------------------------- 266
Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
+ SG +GGHAI+ILGWGE+ + YWL ANSWNTDWGDNG FKILRG+D
Sbjct: 267 --------QHMSGSPVGGHAIKILGWGEE--NGVPYWLAANSWNTDWGDNGYFKILRGED 316
Query: 294 ECGIESSITAGVP 306
CGIES I AG+P
Sbjct: 317 HCGIESEIVAGIP 329
Score = 45.4 bits (106), Expect = 0.032, Method: Compositional matrix adjust.
Identities = 17/30 (56%), Positives = 21/30 (70%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
CG GCNGG+P AW +W G+V+GG Y S
Sbjct: 144 CGMGCNGGYPSAAWDFWATEGLVTGGLYNS 173
>gi|426220597|ref|XP_004004501.1| PREDICTED: cathepsin B [Ovis aries]
Length = 335
Score = 191 bits (486), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 99/193 (51%), Positives = 118/193 (61%), Gaps = 40/193 (20%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GCRPY I PCEHHVNG+RP C +G TPKC + C+ Y YK D +FG SYSVSSNE
Sbjct: 178 GCRPYSIPPCEHHVNGSRPPC-TGEGDTPKCSKICEPGYSPSYKDDKHFGCSSYSVSSNE 236
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
K IM EIY++GPVEGAF+V+ D +LYKSG +
Sbjct: 237 KEIMAEIYKNGPVEGAFSVYSDFLLYKSGVY----------------------------- 267
Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
+ SG+ +GGHAIRILGWG + + YWL+ NSWNTDWGD G FKILRG+D
Sbjct: 268 --------QHVSGEMMGGHAIRILGWGVENDT--PYWLVGNSWNTDWGDKGFFKILRGQD 317
Query: 294 ECGIESSITAGVP 306
CGIES I AG+P
Sbjct: 318 HCGIESEIVAGMP 330
Score = 47.4 bits (111), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 20/30 (66%), Positives = 22/30 (73%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
CG GCNGGFP AW +W K G+VSGG Y S
Sbjct: 146 CGDGCNGGFPSGAWNFWTKKGLVSGGLYDS 175
>gi|326916753|ref|XP_003204669.1| PREDICTED: cathepsin B-like [Meleagris gallopavo]
Length = 340
Score = 191 bits (485), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 96/196 (48%), Positives = 118/196 (60%), Gaps = 39/196 (19%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GCRPY I PCEHHVNG+RP C G TP+C R C+ Y YK+D ++G SY V +E
Sbjct: 178 GCRPYTIPPCEHHVNGSRPPCTGEGGETPRCSRHCEPGYSPSYKEDKHYGITSYGVPRSE 237
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
K IM EIY++GPVEGAF V++D ++YKSG +
Sbjct: 238 KEIMAEIYKNGPVEGAFIVYEDFLMYKSGVY----------------------------- 268
Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
+ SG+ +GGHAIRILGWG + + YWL ANSWNTDWGDNG FKILRG+D
Sbjct: 269 --------QHVSGEQVGGHAIRILGWGVENGT--PYWLAANSWNTDWGDNGFFKILRGED 318
Query: 294 ECGIESSITAGVPKLD 309
CGIES I AGVP+ +
Sbjct: 319 HCGIESEIVAGVPRTE 334
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 20/30 (66%), Positives = 23/30 (76%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
CG GCNGG+P AWRYW + G+VSGG Y S
Sbjct: 146 CGMGCNGGYPSGAWRYWTERGLVSGGLYDS 175
>gi|348587350|ref|XP_003479431.1| PREDICTED: cathepsin B-like [Cavia porcellus]
Length = 340
Score = 191 bits (485), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 100/211 (47%), Positives = 122/211 (57%), Gaps = 39/211 (18%)
Query: 99 TIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKK 158
T + + G GS GCRPY I PCEHHVNGTRP C G TPKC + C+ Y YK+
Sbjct: 163 TRKGLVSGGLYGSHVGCRPYSIPPCEHHVNGTRPKCTGEGGDTPKCSKTCEPGYSPSYKE 222
Query: 159 DLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIK 218
D +G SYSV S EK IM EIY++GPVE AF+VF D + YKSG +
Sbjct: 223 DKYYGYSSYSVPSTEKEIMAEIYKNGPVEAAFSVFSDFLTYKSGVY-------------- 268
Query: 219 WTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNT 278
+ +G+ LGGHAIRILGWG++ + YWL+ NSWN
Sbjct: 269 -----------------------KHVAGEVLGGHAIRILGWGKE--NGVPYWLVGNSWNV 303
Query: 279 DWGDNGLFKILRGKDECGIESSITAGVPKLD 309
DWGDNG FKILRG+D CGIES + AG+P+ D
Sbjct: 304 DWGDNGFFKILRGEDHCGIESEVVAGIPRTD 334
Score = 51.2 bits (121), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 21/31 (67%), Positives = 25/31 (80%)
Query: 8 LCGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
LCG GCNGG+P AW+YW + G+VSGG YGS
Sbjct: 145 LCGEGCNGGYPTEAWKYWTRKGLVSGGLYGS 175
>gi|327322926|gb|AEA48884.1| cathepsin B [Oplegnathus fasciatus]
Length = 330
Score = 191 bits (485), Expect = 4e-46, Method: Compositional matrix adjust.
Identities = 95/194 (48%), Positives = 119/194 (61%), Gaps = 39/194 (20%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GCRPY IAPCEHHVNG+RPSC G TP+C+ +C+ Y YK+D +FG SY+V S+E
Sbjct: 176 GCRPYTIAPCEHHVNGSRPSCTGEGGDTPQCITKCEAGYTPSYKEDKHFGKTSYTVLSDE 235
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
+ I EI+++GPVEGAF V++D +LYKSG +
Sbjct: 236 EQIQSEIFKNGPVEGAFIVYEDFVLYKSGVY----------------------------- 266
Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
+ SG A+GGHAI+ILGWG ++ YWL ANSWNTDWGDNG FK LRG D
Sbjct: 267 --------QHVSGSAVGGHAIKILGWGVEDGV--PYWLCANSWNTDWGDNGFFKFLRGSD 316
Query: 294 ECGIESSITAGVPK 307
CGIES + AG+PK
Sbjct: 317 HCGIESEVVAGIPK 330
Score = 47.8 bits (112), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 19/30 (63%), Positives = 22/30 (73%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
CG GCNGG+P AW +W K G+VSGG Y S
Sbjct: 144 CGMGCNGGYPSAAWDFWTKEGLVSGGLYDS 173
>gi|158261501|dbj|BAF82928.1| unnamed protein product [Homo sapiens]
Length = 339
Score = 191 bits (485), Expect = 4e-46, Method: Compositional matrix adjust.
Identities = 96/196 (48%), Positives = 122/196 (62%), Gaps = 40/196 (20%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GCRPY I PCEHHVNG+RP C +G TPKC + C+ Y YK+D ++G SYSVS++E
Sbjct: 178 GCRPYSIPPCEHHVNGSRPPC-TGEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVSNSE 236
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
K IM EIY++GP EGAF+V+ D +LYKSG +
Sbjct: 237 KDIMAEIYKNGPAEGAFSVYSDFLLYKSGVY----------------------------- 267
Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
+ +G+ +GGHAIRILGWG + + YWL+ANSWNTDWGDNG FKILRG+D
Sbjct: 268 --------QHVTGEMMGGHAIRILGWGVENGT--PYWLVANSWNTDWGDNGFFKILRGQD 317
Query: 294 ECGIESSITAGVPKLD 309
CGIES + AG+P+ D
Sbjct: 318 HCGIESEVVAGIPRTD 333
Score = 46.6 bits (109), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 18/31 (58%), Positives = 23/31 (74%)
Query: 8 LCGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
+CG GCNGG+P AW +W + G+VSGG Y S
Sbjct: 145 MCGDGCNGGYPAEAWNFWTRKGLVSGGLYES 175
>gi|345790427|ref|XP_543203.3| PREDICTED: cathepsin B [Canis lupus familiaris]
Length = 339
Score = 191 bits (485), Expect = 4e-46, Method: Compositional matrix adjust.
Identities = 97/196 (49%), Positives = 120/196 (61%), Gaps = 40/196 (20%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GCRPY I PCEHHVNG+RP C +G TPKC + C+ Y YK+D ++G SYSVS NE
Sbjct: 178 GCRPYSIPPCEHHVNGSRPPC-TGEGDTPKCSKICEPGYSPSYKEDKHYGCSSYSVSDNE 236
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
K IM EIY++GPVE AFTV+ D +LYKSG +
Sbjct: 237 KEIMAEIYKNGPVEAAFTVYSDFLLYKSGVY----------------------------- 267
Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
+ +G+ +GGHA+RILGWG ++ + YWL+ NSWNTDWGDNG FKILRG+D
Sbjct: 268 --------QHVTGEMMGGHAVRILGWGVEDGT--PYWLVGNSWNTDWGDNGFFKILRGRD 317
Query: 294 ECGIESSITAGVPKLD 309
CGIES I AG+P D
Sbjct: 318 HCGIESEIVAGIPCTD 333
Score = 48.1 bits (113), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 20/30 (66%), Positives = 22/30 (73%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
CG GCNGGFP AW +W K G+VSGG Y S
Sbjct: 146 CGDGCNGGFPAEAWNFWTKQGLVSGGLYDS 175
>gi|344281458|ref|XP_003412496.1| PREDICTED: cathepsin B-like [Loxodonta africana]
Length = 340
Score = 191 bits (484), Expect = 5e-46, Method: Compositional matrix adjust.
Identities = 95/196 (48%), Positives = 119/196 (60%), Gaps = 39/196 (19%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GCRPY I PCEHHVNG+RP C G TPKC + C+ Y YK+D ++G SY V S+E
Sbjct: 178 GCRPYSIPPCEHHVNGSRPPCKGEGGETPKCSKTCEPGYSPSYKEDKHYGYSSYGVPSSE 237
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
+ IM EIY++GPVEGAF+V+ D ++YKSG +
Sbjct: 238 QEIMAEIYKNGPVEGAFSVYTDFLVYKSGVY----------------------------- 268
Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
+ +G+ +GGHAIRILGWG + + YWL ANSWNTDWGDNG FKILRG+D
Sbjct: 269 --------QHVTGEEVGGHAIRILGWGVENGT--PYWLAANSWNTDWGDNGFFKILRGQD 318
Query: 294 ECGIESSITAGVPKLD 309
CGIES I AG+P+ D
Sbjct: 319 HCGIESEIVAGIPRTD 334
Score = 47.8 bits (112), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 20/30 (66%), Positives = 22/30 (73%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
CG GCNGGFP AW +W K G+VSGG Y S
Sbjct: 146 CGDGCNGGFPAGAWNFWTKKGLVSGGLYDS 175
>gi|403307501|ref|XP_003944231.1| PREDICTED: cathepsin B [Saimiri boliviensis boliviensis]
Length = 351
Score = 190 bits (483), Expect = 5e-46, Method: Compositional matrix adjust.
Identities = 95/196 (48%), Positives = 122/196 (62%), Gaps = 40/196 (20%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GCRPY I PCEHHVNG+RP C +G TPKC + C+ Y YK+D ++G SYSVS++E
Sbjct: 190 GCRPYSIPPCEHHVNGSRPPC-TGEGDTPKCSKSCEPGYTPTYKQDKHYGYNSYSVSNSE 248
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
+ IM EIY++GPVEGAF+V+ D +LYKSG +
Sbjct: 249 RDIMAEIYKNGPVEGAFSVYSDFLLYKSGVY----------------------------- 279
Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
+ +G+ +GGHAIRILGWG + + YWL+ NSWNTDWGDNG FKILRG+D
Sbjct: 280 --------QHVTGEMMGGHAIRILGWGVENGT--PYWLVGNSWNTDWGDNGFFKILRGQD 329
Query: 294 ECGIESSITAGVPKLD 309
CGIES + AG+P+ D
Sbjct: 330 HCGIESEVVAGIPRTD 345
Score = 46.6 bits (109), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 18/31 (58%), Positives = 23/31 (74%)
Query: 8 LCGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
+CG GCNGG+P AW +W + G+VSGG Y S
Sbjct: 157 MCGDGCNGGYPAEAWNFWTRKGLVSGGLYDS 187
>gi|226821413|gb|ACO82382.1| cathepsin B [Lutjanus argentimaculatus]
Length = 330
Score = 190 bits (483), Expect = 7e-46, Method: Compositional matrix adjust.
Identities = 95/194 (48%), Positives = 118/194 (60%), Gaps = 39/194 (20%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GCRPY I PCEHHVNG+RP C G TP+C+ +C+ Y Y++D ++G SYSV S+E
Sbjct: 176 GCRPYTIPPCEHHVNGSRPPCTGEGGDTPQCLSQCEAGYTPSYREDKHYGKTSYSVLSDE 235
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
I EIY++GPVEGAFTV++D +LYKSG +
Sbjct: 236 AEIQYEIYKNGPVEGAFTVYEDFVLYKSGVY----------------------------- 266
Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
+ SG A+GGHAI++LGWGE+ + YWL ANSWNTDWGDNG FK LRG D
Sbjct: 267 --------QHVSGSAVGGHAIKVLGWGEE--NGVPYWLCANSWNTDWGDNGFFKFLRGSD 316
Query: 294 ECGIESSITAGVPK 307
CGIES I AG+PK
Sbjct: 317 HCGIESEIVAGIPK 330
Score = 47.8 bits (112), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 19/30 (63%), Positives = 22/30 (73%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
CG GCNGG+P AW +W K G+VSGG Y S
Sbjct: 144 CGMGCNGGYPSAAWDFWTKEGLVSGGLYDS 173
>gi|327281751|ref|XP_003225610.1| PREDICTED: cathepsin B-like [Anolis carolinensis]
Length = 330
Score = 190 bits (482), Expect = 7e-46, Method: Compositional matrix adjust.
Identities = 93/194 (47%), Positives = 118/194 (60%), Gaps = 39/194 (20%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GCRPY I PCEHH NGTRP C G TP+CV++C++ Y YK+D ++G SY + +E
Sbjct: 168 GCRPYSIPPCEHHTNGTRPPCSGEGGETPECVKKCEDGYTPAYKQDKHYGVTSYGIPRSE 227
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
K IM EIY++GPVEGAF V+ D ++YKSG +
Sbjct: 228 KEIMAEIYKNGPVEGAFVVYSDFLMYKSGVY----------------------------- 258
Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
+ SG+ +GGHAIRILGWG D + YWL ANSWNTDWG++G F+ILRG+D
Sbjct: 259 --------QHVSGEEVGGHAIRILGWGVDNGT--PYWLAANSWNTDWGEDGFFRILRGQD 308
Query: 294 ECGIESSITAGVPK 307
CGIES I AG+PK
Sbjct: 309 HCGIESEIVAGIPK 322
Score = 48.1 bits (113), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 19/30 (63%), Positives = 23/30 (76%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
CG GCNGG+P AW+YW + G+VSGG Y S
Sbjct: 136 CGMGCNGGYPSGAWKYWTEKGLVSGGLYDS 165
>gi|27882093|gb|AAH44517.1| Zgc:55862 [Danio rerio]
Length = 330
Score = 190 bits (482), Expect = 7e-46, Method: Compositional matrix adjust.
Identities = 97/193 (50%), Positives = 117/193 (60%), Gaps = 39/193 (20%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GCRPY I PCEHHVNG+RP C G TP C +C+ Y YK+D +FG SYSV SN+
Sbjct: 176 GCRPYTIEPCEHHVNGSRPPCTGEGGDTPNCDMKCEPGYSPLYKEDKHFGKTSYSVPSNQ 235
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
IM E++++GPVE AFTV++D +LYKSG +
Sbjct: 236 NGIMAELFKNGPVEAAFTVYEDFLLYKSGVY----------------------------- 266
Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
+ SG ALGGHAI+ILGWGE+ + YWL ANSWNTDWGDNG FKILRG+D
Sbjct: 267 --------QHMSGSALGGHAIKILGWGEE--NGVPYWLAANSWNTDWGDNGYFKILRGED 316
Query: 294 ECGIESSITAGVP 306
CGIES I AG+P
Sbjct: 317 HCGIESEIVAGIP 329
Score = 45.4 bits (106), Expect = 0.030, Method: Compositional matrix adjust.
Identities = 17/30 (56%), Positives = 21/30 (70%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
CG GCNGG+P AW +W G+V+GG Y S
Sbjct: 144 CGMGCNGGYPSAAWDFWTTDGLVTGGLYNS 173
>gi|449267314|gb|EMC78276.1| Cathepsin B [Columba livia]
Length = 340
Score = 190 bits (482), Expect = 9e-46, Method: Compositional matrix adjust.
Identities = 94/196 (47%), Positives = 117/196 (59%), Gaps = 39/196 (19%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GCRPY I PCEHHVNG+RP C G TP+C R C+ Y YK+D ++G SY V +E
Sbjct: 178 GCRPYSIPPCEHHVNGSRPPCTGEGGETPRCSRHCEPGYSPSYKEDKHYGITSYGVPRSE 237
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
K IM EIY++GPVEGAF V++D ++YKSG +
Sbjct: 238 KEIMAEIYKNGPVEGAFIVYEDFLMYKSGVY----------------------------- 268
Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
+ +G+ +GGHAIR+LGWG D + YWL ANSWNTDWGDNG FKILRG+D
Sbjct: 269 --------QHVTGEQVGGHAIRLLGWGVDNGT--PYWLAANSWNTDWGDNGFFKILRGED 318
Query: 294 ECGIESSITAGVPKLD 309
CGIES I AG+P +
Sbjct: 319 HCGIESEIVAGIPSTE 334
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 20/30 (66%), Positives = 23/30 (76%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
CG GCNGG+P AWRYW + G+VSGG Y S
Sbjct: 146 CGMGCNGGYPSGAWRYWTEKGLVSGGLYDS 175
>gi|395842321|ref|XP_003793966.1| PREDICTED: cathepsin B [Otolemur garnettii]
Length = 339
Score = 190 bits (482), Expect = 9e-46, Method: Compositional matrix adjust.
Identities = 97/196 (49%), Positives = 122/196 (62%), Gaps = 40/196 (20%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GCRPY I PCEHHVNG+RP+C +G TPKC + C+ Y YK+D +FG SYS+ +NE
Sbjct: 178 GCRPYSIPPCEHHVNGSRPAC-TGEGDTPKCSKTCEPGYSPTYKEDKHFGYTSYSLPTNE 236
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
IM EIY++GPVEGAF+V+ D +LYKSG
Sbjct: 237 WEIMAEIYKNGPVEGAFSVYSDFLLYKSG------------------------------- 265
Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
V+ L +G +GGHAIRILGWGE+ + YWL+ANSWNTDWGD G F+ILRG+D
Sbjct: 266 --VYQHL----TGDMMGGHAIRILGWGEE--NGVPYWLVANSWNTDWGDGGFFRILRGQD 317
Query: 294 ECGIESSITAGVPKLD 309
CGIES + AG+P+ D
Sbjct: 318 HCGIESEVVAGIPRTD 333
Score = 63.5 bits (153), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 30/71 (42%), Positives = 47/71 (66%), Gaps = 5/71 (7%)
Query: 44 NSLSNIPRAHLKSWMGVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREI 103
++ N+ ++LK G L +LP+ + +++ D +LP +FD+R +W +CPTI+EI
Sbjct: 45 HNFRNVDMSYLKRLCGSF----LGGPKLPQRVKFAK-DMNLPKSFDAREQWSHCPTIKEI 99
Query: 104 RDQGSCGSCWG 114
RDQGSCGSCW
Sbjct: 100 RDQGSCGSCWA 110
>gi|417399216|gb|JAA46636.1| Putative cathepsin b [Desmodus rotundus]
Length = 340
Score = 189 bits (481), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 94/193 (48%), Positives = 117/193 (60%), Gaps = 39/193 (20%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GCRPY I PCEHHVNG+RP C G TPKC + C+ Y YK+D +FG +YSV S+E
Sbjct: 178 GCRPYSIPPCEHHVNGSRPPCSGEGGDTPKCSKICEPGYSPSYKEDKHFGCDTYSVPSDE 237
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
K IM EIY++GPVE AF+V+ D +LYKSG +
Sbjct: 238 KEIMVEIYKNGPVEAAFSVYSDFLLYKSGVY----------------------------- 268
Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
+ +G+ +GGHA+RILGWG + + YWL+ NSWNTDWGDNG FKILRG+D
Sbjct: 269 --------QHVTGEMVGGHAVRILGWGVENGT--PYWLVGNSWNTDWGDNGFFKILRGRD 318
Query: 294 ECGIESSITAGVP 306
CGIES I AG+P
Sbjct: 319 HCGIESEIVAGIP 331
Score = 46.6 bits (109), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 20/30 (66%), Positives = 22/30 (73%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
CG GCNGGFP AW +W K G+VSGG Y S
Sbjct: 146 CGEGCNGGFPSGAWNFWKKQGLVSGGLYDS 175
>gi|147906534|ref|NP_001090927.1| cathepsin B precursor [Sus scrofa]
gi|187470655|sp|A1E295.1|CATB_PIG RecName: Full=Cathepsin B; Contains: RecName: Full=Cathepsin B
light chain; Contains: RecName: Full=Cathepsin B heavy
chain; Flags: Precursor
gi|118490058|gb|ABK96810.1| cathepsin B [Sus scrofa]
Length = 335
Score = 189 bits (480), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 97/193 (50%), Positives = 117/193 (60%), Gaps = 40/193 (20%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GCRPY I PCEHHVNG+RP C +G TPKC + C+ Y YK+D +FG SYS+S NE
Sbjct: 178 GCRPYSIPPCEHHVNGSRPPC-TGEGDTPKCSKICEPGYTPSYKEDKHFGCSSYSISRNE 236
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
K IM EIY++GPVEGAFTV+ D + YKSG +
Sbjct: 237 KEIMAEIYKNGPVEGAFTVYSDFLQYKSGVY----------------------------- 267
Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
+ +G +GGHAIRILGWG + + YWL+ NSWNTDWGDNG FKILRG+D
Sbjct: 268 --------QHVTGDLMGGHAIRILGWGVENGT--PYWLVGNSWNTDWGDNGFFKILRGQD 317
Query: 294 ECGIESSITAGVP 306
CGIES I AG+P
Sbjct: 318 HCGIESEIVAGIP 330
Score = 47.0 bits (110), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 20/30 (66%), Positives = 22/30 (73%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
CG GCNGGFP AW +W K G+VSGG Y S
Sbjct: 146 CGDGCNGGFPSGAWNFWTKKGLVSGGLYDS 175
>gi|431918315|gb|ELK17542.1| Cathepsin B [Pteropus alecto]
Length = 359
Score = 189 bits (480), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 94/193 (48%), Positives = 117/193 (60%), Gaps = 39/193 (20%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GCRPY I PCEHHVNG+RP C G TPKC R C+ Y YK+D +FG SYSV S+E
Sbjct: 201 GCRPYSIPPCEHHVNGSRPPCTGEGGSTPKCSRICEAGYTPSYKEDKHFGCSSYSVPSSE 260
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
IM EIY++GPVE AF+V+ D +LYKSG +
Sbjct: 261 TEIMAEIYKNGPVEAAFSVYSDFLLYKSGVY----------------------------- 291
Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
+ +G+ +GGHA+RILGWG ++ + YWL+ NSWNTDWGD+G FKILRG+D
Sbjct: 292 --------QHVTGEMMGGHAVRILGWGVEDGT--PYWLVGNSWNTDWGDSGFFKILRGQD 341
Query: 294 ECGIESSITAGVP 306
CGIES I AG+P
Sbjct: 342 HCGIESEIVAGLP 354
Score = 47.4 bits (111), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 20/30 (66%), Positives = 22/30 (73%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
CG GCNGGFP AW +W K G+VSGG Y S
Sbjct: 169 CGEGCNGGFPSGAWNFWTKKGLVSGGLYDS 198
>gi|171948776|gb|ACB59245.1| cathepsin B [Sus scrofa]
Length = 335
Score = 189 bits (480), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 97/193 (50%), Positives = 117/193 (60%), Gaps = 40/193 (20%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GCRPY I PCEHHVNG+RP C +G TPKC + C+ Y YK+D +FG SYS+S NE
Sbjct: 178 GCRPYSIPPCEHHVNGSRPPC-TGEGDTPKCSKICEPGYTPSYKEDKHFGCSSYSISRNE 236
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
K IM EIY++GPVEGAFTV+ D + YKSG +
Sbjct: 237 KEIMAEIYKNGPVEGAFTVYSDFLQYKSGVY----------------------------- 267
Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
+ +G +GGHAIRILGWG + + YWL+ NSWNTDWGDNG FKILRG+D
Sbjct: 268 --------QHVTGDLMGGHAIRILGWGVENGT--PYWLVGNSWNTDWGDNGFFKILRGQD 317
Query: 294 ECGIESSITAGVP 306
CGIES I AG+P
Sbjct: 318 HCGIESEIVAGIP 330
Score = 47.0 bits (110), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 20/30 (66%), Positives = 22/30 (73%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
CG GCNGGFP AW +W K G+VSGG Y S
Sbjct: 146 CGDGCNGGFPSGAWNFWTKKGLVSGGLYDS 175
>gi|432852559|ref|XP_004067308.1| PREDICTED: cathepsin B-like [Oryzias latipes]
Length = 330
Score = 189 bits (480), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 95/194 (48%), Positives = 115/194 (59%), Gaps = 39/194 (20%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GCRPY I PCEHHVNGTRP C G TP+C+ +C+ Y YKKD ++G SYSV +NE
Sbjct: 176 GCRPYTIPPCEHHVNGTRPPCTGEGGDTPQCINQCESGYTPSYKKDKHYGKTSYSVEANE 235
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
I EIY++GPVEGAF V++D +YKSG +
Sbjct: 236 NQIQTEIYKNGPVEGAFMVYEDFPMYKSGVY----------------------------- 266
Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
+ SG +GGHAI+ILGWG ++ YWL ANSWNTDWGDNG FKILRG D
Sbjct: 267 --------QHVSGSLIGGHAIKILGWGVEDGV--PYWLCANSWNTDWGDNGYFKILRGSD 316
Query: 294 ECGIESSITAGVPK 307
CGIES + AG+PK
Sbjct: 317 HCGIESEVVAGIPK 330
Score = 46.6 bits (109), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 18/30 (60%), Positives = 22/30 (73%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
CG GCNGG+P AW +W K G+V+GG Y S
Sbjct: 144 CGMGCNGGYPTAAWDFWTKEGLVTGGLYDS 173
>gi|443692853|gb|ELT94358.1| hypothetical protein CAPTEDRAFT_221292 [Capitella teleta]
Length = 374
Score = 189 bits (479), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 100/201 (49%), Positives = 122/201 (60%), Gaps = 40/201 (19%)
Query: 107 GSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKS 166
G G+ GCRPY IAPCEHHVNGTR C + +G TPKC R C++ Y V Y+ D NFG +
Sbjct: 212 GQYGTHQGCRPYSIAPCEHHVNGTRLPC-SGEGPTPKCERTCEKGYKVKYEDDKNFGYTA 270
Query: 167 YSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTS 226
YSV ++EK IM EI +GPVEGAFTV+ D YKSG +
Sbjct: 271 YSVDNDEKQIMTEIMTNGPVEGAFTVYADFPTYKSGVY---------------------- 308
Query: 227 QLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLF 286
+ SG LGGHAIR+LGWG ++ + YWL+ANSWN+DWGDNG F
Sbjct: 309 ---------------QHVSGGELGGHAIRVLGWGVEDGT--PYWLVANSWNSDWGDNGFF 351
Query: 287 KILRGKDECGIESSITAGVPK 307
KILRG++ECGIE I AG+PK
Sbjct: 352 KILRGQNECGIEGEIVAGLPK 372
Score = 49.7 bits (117), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 20/32 (62%), Positives = 24/32 (75%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQ 40
CG GCNGGFP AW Y+ +G+VSGG YG+ Q
Sbjct: 187 CGMGCNGGFPPAAWEYFRDTGLVSGGQYGTHQ 218
>gi|189096178|pdb|3CBJ|A Chain A, Chagasin-cathepsin B Complex
gi|189096180|pdb|3CBK|A Chain A, Chagasin-Cathepsin B
Length = 266
Score = 188 bits (478), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 96/196 (48%), Positives = 121/196 (61%), Gaps = 40/196 (20%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GCRPY I PCE HVNG RP C +G TPKC + C+ Y YK+D ++G SYSVS++E
Sbjct: 105 GCRPYSIPPCEAHVNGARPPC-TGEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVSNSE 163
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
K IM EIY++GPVEGAF+V+ D +LYKSG +
Sbjct: 164 KDIMAEIYKNGPVEGAFSVYSDFLLYKSGVY----------------------------- 194
Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
+ +G+ +GGHAIRILGWG + + YWL+ANSWNTDWGDNG FKILRG+D
Sbjct: 195 --------QHVTGEMMGGHAIRILGWGVENGT--PYWLVANSWNTDWGDNGFFKILRGQD 244
Query: 294 ECGIESSITAGVPKLD 309
CGIES + AG+P+ D
Sbjct: 245 HCGIESEVVAGIPRTD 260
Score = 46.2 bits (108), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 18/31 (58%), Positives = 23/31 (74%)
Query: 8 LCGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
+CG GCNGG+P AW +W + G+VSGG Y S
Sbjct: 72 MCGDGCNGGYPAEAWNFWTRKGLVSGGLYES 102
>gi|239792046|dbj|BAH72408.1| ACYPI000003 [Acyrthosiphon pisum]
Length = 182
Score = 188 bits (477), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 101/217 (46%), Positives = 127/217 (58%), Gaps = 39/217 (17%)
Query: 90 SRTKWPNCPTIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQ 149
S+ + N + I G GS GC PYEIAPCEHHVNGTR C G TP CV++C+
Sbjct: 4 SQEQHGNYCKTKGIVSGGPYGSNMGCIPYEIAPCEHHVNGTRGPCKEG-GKTPTCVKKCE 62
Query: 150 ENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGN 209
E Y VPY +DL+ G +YS+ ++ I +EIY +GPVEGAFTV++D I Y++G +
Sbjct: 63 EGYKVPYAQDLHHGKSAYSIRNDVDQIRQEIYTNGPVEGAFTVYEDFIAYRAGVY----- 117
Query: 210 ETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKY 269
+ +GKALGGHAIRILGWG + + Y
Sbjct: 118 --------------------------------KHVAGKALGGHAIRILGWGV-QNGEIPY 144
Query: 270 WLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVP 306
WL+ANSWNTDWG +G FKILRG DECGIE I AG+P
Sbjct: 145 WLVANSWNTDWGSDGFFKILRGSDECGIEGQINAGLP 181
>gi|395507317|ref|XP_003757972.1| PREDICTED: cathepsin B [Sarcophilus harrisii]
Length = 342
Score = 188 bits (477), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 94/196 (47%), Positives = 118/196 (60%), Gaps = 39/196 (19%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GCRPY I PCEHHVNG+RP+C G TPKC ++C+ Y YK D ++G +Y+V S+E
Sbjct: 180 GCRPYSIPPCEHHVNGSRPACTGEGGDTPKCNKKCEAGYSPDYKDDKHYGTTAYNVPSSE 239
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
K IM EIY++GPVEGAF V+ D + YKSG +
Sbjct: 240 KEIMAEIYKNGPVEGAFIVYADFLQYKSGVY----------------------------- 270
Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
+ +G LGGHAIR+LGWG ++ YWL ANSWNTDWGDNG FKILRGKD
Sbjct: 271 --------QHVTGDMLGGHAIRVLGWGVEDGV--PYWLAANSWNTDWGDNGFFKILRGKD 320
Query: 294 ECGIESSITAGVPKLD 309
CGIES + AG+P+ +
Sbjct: 321 HCGIESEMVAGIPRTE 336
Score = 52.4 bits (124), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 29/75 (38%), Positives = 36/75 (48%), Gaps = 19/75 (25%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAY-------------------GSKQAEKNSLSNI 49
CG GCNGGFP AW+YW+K G+VSGG Y GS+ A +
Sbjct: 148 CGEGCNGGFPAGAWKYWIKKGLVSGGLYDSHVGCRPYSIPPCEHHVNGSRPACTGEGGDT 207
Query: 50 PRAHLKSWMGVHPDY 64
P+ + K G PDY
Sbjct: 208 PKCNKKCEAGYSPDY 222
>gi|116177489|gb|ABJ80691.1| cathepsin B [Hippoglossus hippoglossus]
Length = 330
Score = 187 bits (475), Expect = 5e-45, Method: Compositional matrix adjust.
Identities = 93/194 (47%), Positives = 115/194 (59%), Gaps = 39/194 (20%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GCRPY I PCEHHVNG+RP C G TPKCV C+ Y Y KD ++G SYSV ++
Sbjct: 176 GCRPYTIPPCEHHVNGSRPHCSGEGGDTPKCVHSCEAGYSPTYTKDKHYGKSSYSVEASV 235
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
+ I EI ++GPVEGAF V++D ++YKSG +
Sbjct: 236 EQIQAEISQNGPVEGAFIVYEDFVMYKSGVY----------------------------- 266
Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
+ +G ALGGHAI++LGWGE++ YWL ANSWNTDWG+NG FKILRG D
Sbjct: 267 --------QHTTGSALGGHAIKVLGWGEEDGV--PYWLCANSWNTDWGENGFFKILRGSD 316
Query: 294 ECGIESSITAGVPK 307
CGIES I AG+PK
Sbjct: 317 HCGIESEIVAGIPK 330
Score = 48.1 bits (113), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 19/30 (63%), Positives = 22/30 (73%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
CG GCNGG+P AW +W K G+VSGG Y S
Sbjct: 144 CGMGCNGGYPSAAWDFWTKEGLVSGGLYNS 173
>gi|187097096|ref|NP_001119608.1| cathepsin B-348 precursor [Acyrthosiphon pisum]
gi|161343833|tpg|DAA06097.1| TPA_inf: cathepsin B [Acyrthosiphon pisum]
Length = 342
Score = 187 bits (474), Expect = 7e-45, Method: Compositional matrix adjust.
Identities = 99/206 (48%), Positives = 123/206 (59%), Gaps = 39/206 (18%)
Query: 101 REIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDL 160
+ I G GS GC PYEIAPCEHHVNGTR C G TP CV++C+E Y VPY +DL
Sbjct: 175 KGIVSGGPYGSNMGCIPYEIAPCEHHVNGTRGPCKEG-GKTPTCVKKCEEGYKVPYAQDL 233
Query: 161 NFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWT 220
+ G +YS+ ++ I +EIY +GPVEGAFTV++D I Y++G +
Sbjct: 234 HHGKSAYSIRNDVDQIRQEIYTNGPVEGAFTVYEDFIAYRAGVY---------------- 277
Query: 221 IRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDW 280
+ +GKALGGHAIRILGWG + + YWL+ANSWNTDW
Sbjct: 278 ---------------------KHVAGKALGGHAIRILGWGV-QNGEIPYWLVANSWNTDW 315
Query: 281 GDNGLFKILRGKDECGIESSITAGVP 306
G +G FKILRG DECGIE I AG+P
Sbjct: 316 GSDGFFKILRGSDECGIEGQINAGLP 341
Score = 55.8 bits (133), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 24/32 (75%), Positives = 24/32 (75%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQ 40
CGFGCNGGFPG AW YW GIVSGG YGS
Sbjct: 156 CGFGCNGGFPGAAWNYWKTKGIVSGGPYGSNM 187
>gi|321452279|gb|EFX63703.1| hypothetical protein DAPPUDRAFT_306608 [Daphnia pulex]
Length = 340
Score = 187 bits (474), Expect = 7e-45, Method: Compositional matrix adjust.
Identities = 124/337 (36%), Positives = 154/337 (45%), Gaps = 110/337 (32%)
Query: 40 QAEKNSLSNIPRAHLKSWMGVHPDYNLPANRLPELIGYSEV--DEDLPANFDSRTKWPNC 97
+A +N N P L MGVHPD NL +P L S++ ++ +P FD+R +WP+C
Sbjct: 46 KAGRNFGKNFPMGALTQMMGVHPDSNL---YMPPLKNVSQMYSNQAIPEAFDAREQWPDC 102
Query: 98 PTIREIRDQGSCGSCWGCRPYEIA--------------------------PCEHHVNGTR 131
PTI+EIRDQGSCGSCW E C NG
Sbjct: 103 PTIQEIRDQGSCGSCWAFGAVEAMSDRICIHSKGEVNAHLSAENLVSCCYTCGFGCNGGF 162
Query: 132 PSC----------------DASKGHTPKCVRECQEN------------------------ 151
P ++S+G P + C+ +
Sbjct: 163 PGAAWSHWVKKGIVTGGNFNSSQGCQPYIIPACEHHTTGDRPPCSEGGGTPKCLKTCEDG 222
Query: 152 YDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNET 211
Y V Y +DL++GA SYSV + I EI +GPVEGA TV++D YKSG +
Sbjct: 223 YTVDYTQDLHYGASSYSVHKRMEDIQLEIMNNGPVEGALTVYEDFPTYKSGVY------- 275
Query: 212 TAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWL 271
+ GKALGGHAIRILGWG +E YWL
Sbjct: 276 ------------------------------QHVHGKALGGHAIRILGWGVEEGV--PYWL 303
Query: 272 IANSWNTDWGDNGLFKILRGKDECGIESSITAGVPKL 308
IANSWNTDWGDNG K+LRGKD CGIES ITAG+PKL
Sbjct: 304 IANSWNTDWGDNGYIKLLRGKDHCGIESQITAGLPKL 340
Score = 56.6 bits (135), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 23/32 (71%), Positives = 26/32 (81%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQ 40
CGFGCNGGFPG AW +WVK GIV+GG + S Q
Sbjct: 154 CGFGCNGGFPGAAWSHWVKKGIVTGGNFNSSQ 185
>gi|225708580|gb|ACO10136.1| Cathepsin B precursor [Osmerus mordax]
Length = 329
Score = 186 bits (473), Expect = 8e-45, Method: Compositional matrix adjust.
Identities = 98/199 (49%), Positives = 122/199 (61%), Gaps = 40/199 (20%)
Query: 110 GSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSV 169
GS GCRPY I PCEHHVNGTRP C +G TPKC +C + Y Y+KD FG K+YSV
Sbjct: 171 GSNKGCRPYSIPPCEHHVNGTRPPCQG-EGDTPKCQTKCIDGYTPAYEKDKYFGKKTYSV 229
Query: 170 SSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLG 229
S ++ IM E+Y++GPVE AF+V++D +LYKSG
Sbjct: 230 PSKQEQIMTELYKNGPVEAAFSVYEDFLLYKSG--------------------------- 262
Query: 230 AEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKIL 289
V+ L +G LGGHAI+ILGWG++ + YWL ANSWNTDWG+ G FKIL
Sbjct: 263 ------VYQHL----TGDMLGGHAIKILGWGKENNT--PYWLAANSWNTDWGNQGFFKIL 310
Query: 290 RGKDECGIESSITAGVPKL 308
RG DECGIES + AG+P+L
Sbjct: 311 RGGDECGIESEVVAGIPQL 329
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 20/32 (62%), Positives = 24/32 (75%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQ 40
CG GC GG+P AW YW KSG+V+GG YGS +
Sbjct: 143 CGMGCFGGYPSAAWEYWAKSGLVTGGLYGSNK 174
>gi|195165479|ref|XP_002023566.1| GL19846 [Drosophila persimilis]
gi|194105700|gb|EDW27743.1| GL19846 [Drosophila persimilis]
Length = 329
Score = 186 bits (473), Expect = 9e-45, Method: Compositional matrix adjust.
Identities = 107/210 (50%), Positives = 121/210 (57%), Gaps = 48/210 (22%)
Query: 99 TIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKK 158
T + I G GS GCRPYEIAPCEHHVNGTRP C S G TP C +CQ +Y V Y K
Sbjct: 168 TRKGIVSGGPYGSTQGCRPYEIAPCEHHVNGTRPPC--SHGSTPSCQHKCQASYSVEYAK 225
Query: 159 DLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIK 218
D NFG+KSYSV N I +EI +GPVEGAFTV++DLILYKSG +
Sbjct: 226 DKNFGSKSYSVRRNVAEIQQEIMTNGPVEGAFTVYEDLILYKSGVY-------------- 271
Query: 219 WTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNT 278
++ GK LGGHAIRILGWG +SK YWLI NSWNT
Sbjct: 272 -----------------------QHEHGKELGGHAIRILGWGVWGESKVPYWLIGNSWNT 308
Query: 279 DWGDNGLFKILRGKDECGIESSITAGVPKL 308
DWGDN D CGIESSI+AG+ L
Sbjct: 309 DWGDN---------DHCGIESSISAGLSHL 329
Score = 77.4 bits (189), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 36/77 (46%), Positives = 48/77 (62%), Gaps = 3/77 (3%)
Query: 40 QAEKNSLSNIPRAHLKSWMGVHPD---YNLPANRLPELIGYSEVDEDLPANFDSRTKWPN 96
Q +N ++ +++ MGVHPD + LP R+ Y++ D+P FD+R WPN
Sbjct: 39 QVGRNFKESVSEEYIRGLMGVHPDAHKFALPEKRIVLGDLYADDGIDIPEEFDARKAWPN 98
Query: 97 CPTIREIRDQGSCGSCW 113
CPTI EIRDQGSCGSCW
Sbjct: 99 CPTIGEIRDQGSCGSCW 115
Score = 61.2 bits (147), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 25/34 (73%), Positives = 27/34 (79%)
Query: 7 RLCGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQ 40
+CGFGCNGGFPG AW YW + GIVSGG YGS Q
Sbjct: 149 HICGFGCNGGFPGAAWSYWTRKGIVSGGPYGSTQ 182
>gi|410916585|ref|XP_003971767.1| PREDICTED: cathepsin B-like [Takifugu rubripes]
Length = 328
Score = 186 bits (473), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 98/208 (47%), Positives = 123/208 (59%), Gaps = 40/208 (19%)
Query: 99 TIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKK 158
T + + G CGS GCRPY IAPCEHHVNGTRP C ++ TPKC ++C + Y Y K
Sbjct: 159 TKKGLVTGGLCGSEVGCRPYSIAPCEHHVNGTRPPCQGTQ-ETPKCEKKCIDGYLTSYLK 217
Query: 159 DLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIK 218
D +FG +SYS+ S ++ IM E+Y++GPVE AFTV+ D +LYK+G +
Sbjct: 218 DKHFGKRSYSLPSQQEQIMTELYKNGPVEAAFTVYADFLLYKTGVY-------------- 263
Query: 219 WTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNT 278
+ +G+ LGGHAI+ILGWGE+ S YWL ANSWN
Sbjct: 264 -----------------------QHVTGEVLGGHAIKILGWGEE--SGTPYWLAANSWNG 298
Query: 279 DWGDNGLFKILRGKDECGIESSITAGVP 306
DWGD G FKI RG DECGIES + AG P
Sbjct: 299 DWGDKGFFKIKRGNDECGIESEMVAGTP 326
Score = 45.4 bits (106), Expect = 0.034, Method: Compositional matrix adjust.
Identities = 17/31 (54%), Positives = 23/31 (74%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSK 39
CG GC+GG+P AW +W K G+V+GG GS+
Sbjct: 142 CGMGCSGGYPSSAWEFWTKKGLVTGGLCGSE 172
>gi|330434688|gb|AEC22812.1| cathepsin B [Macrobrachium nipponense]
Length = 331
Score = 186 bits (472), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 100/207 (48%), Positives = 123/207 (59%), Gaps = 40/207 (19%)
Query: 103 IRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNF 162
I G+ S GC+PYEIAPCEHHV+G RP C A G TPKC + C+ NY V Y+ DL+
Sbjct: 165 IVSGGAFNSTQGCQPYEIAPCEHHVSGPRPKC-AEGGSTPKCHKNCESNYVVDYESDLHH 223
Query: 163 GAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIR 222
G+K YSV +E I +I +GPVEGAFTV+ D + YKSG +
Sbjct: 224 GSKHYSVDKDETQIKYDIMTNGPVEGAFTVYVDFLHYKSGVY------------------ 265
Query: 223 DNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGD 282
+ G LGGHAIR+LGWGE++ + YWL ANSWNTDWGD
Sbjct: 266 -------------------QHTHGLPLGGHAIRVLGWGEEDGT--PYWLCANSWNTDWGD 304
Query: 283 NGLFKILRGKDECGIESSITAGVPKLD 309
NG FKILRG D CGIES I+AG+PK++
Sbjct: 305 NGYFKILRGSDHCGIESEISAGLPKVE 331
Score = 60.8 bits (146), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 26/34 (76%), Positives = 29/34 (85%)
Query: 7 RLCGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQ 40
LCGFGCNGGFPG A++YWV SGIVSGGA+ S Q
Sbjct: 142 HLCGFGCNGGFPGAAFQYWVHSGIVSGGAFNSTQ 175
>gi|25988674|gb|AAN76202.1| lysosomal cysteine proteinase cathepsin B/green fluorescent protein
EGFP fusion protein [synthetic construct]
Length = 578
Score = 186 bits (472), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 95/194 (48%), Positives = 118/194 (60%), Gaps = 40/194 (20%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GC PY I PCEHHVNG+RP C +G TPKC + C+ Y YK+D ++G SYSVS +E
Sbjct: 178 GCLPYTIPPCEHHVNGSRPPC-TGEGDTPKCNKMCEAGYSTSYKEDKHYGYTSYSVSDSE 236
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
K IM EIY++GPVEGAFTVF D + YKSG +
Sbjct: 237 KEIMAEIYKNGPVEGAFTVFSDFLTYKSGVY----------------------------- 267
Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
+++G +GGHAIRILGWG + + YWL+ANSWN DWGDNG FKILRG++
Sbjct: 268 --------KHEAGDVMGGHAIRILGWGIE--NGVPYWLVANSWNVDWGDNGFFKILRGEN 317
Query: 294 ECGIESSITAGVPK 307
CGIES I AG+P+
Sbjct: 318 HCGIESEIVAGIPR 331
Score = 70.1 bits (170), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 37/75 (49%), Positives = 49/75 (65%), Gaps = 6/75 (8%)
Query: 40 QAEKNSLSNIPRAHLKSWMGVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPT 99
QA +N N+ ++LK G L +LPE +G+SE D +LP +FD+R +W NCPT
Sbjct: 42 QAGRN-FYNVDISYLKKLCGT----VLGGPKLPERVGFSE-DINLPESFDAREQWSNCPT 95
Query: 100 IREIRDQGSCGSCWG 114
I +IRDQGSCGSCW
Sbjct: 96 IAQIRDQGSCGSCWA 110
Score = 47.0 bits (110), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 18/30 (60%), Positives = 22/30 (73%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
CG GCNGG+P AW +W + G+VSGG Y S
Sbjct: 146 CGDGCNGGYPSGAWNFWTRKGLVSGGVYNS 175
>gi|51038793|gb|AAT94175.1| cathepsin B [Paralichthys olivaceus]
gi|121053785|gb|ABM47001.1| cathepsin B [Paralichthys olivaceus]
Length = 330
Score = 185 bits (470), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 92/194 (47%), Positives = 116/194 (59%), Gaps = 39/194 (20%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GCRPY I+PCEHHVNG+RP C G TP+C+ C+ Y YK+D ++G SYSV +
Sbjct: 176 GCRPYTISPCEHHVNGSRPPCTGEGGDTPECISRCEAGYSPSYKQDKHYGKSSYSVEGSV 235
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
+ I EI ++GPVEGAFTV++D ++YKSG +
Sbjct: 236 EQIQAEISKNGPVEGAFTVYEDFVMYKSGVY----------------------------- 266
Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
+ SG LGGHAI++LGWGE++ YWL ANSWNTDWGDNG FKILRG +
Sbjct: 267 --------QHVSGSVLGGHAIKVLGWGEEDGI--PYWLCANSWNTDWGDNGFFKILRGSN 316
Query: 294 ECGIESSITAGVPK 307
CGIES I AG+PK
Sbjct: 317 HCGIESEIVAGIPK 330
Score = 47.4 bits (111), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 19/30 (63%), Positives = 22/30 (73%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
CG GCNGG+P AW +W K G+VSGG Y S
Sbjct: 144 CGMGCNGGYPSSAWDFWTKEGLVSGGLYNS 173
>gi|389608541|dbj|BAM17880.1| cathepsin B [Papilio xuthus]
Length = 334
Score = 185 bits (470), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 100/203 (49%), Positives = 120/203 (59%), Gaps = 40/203 (19%)
Query: 107 GSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKS 166
GS S GCRPYEI PCEHHV G R C TPKCV+EC+ Y VPYK+D ++G
Sbjct: 172 GSYNSSQGCRPYEIPPCEHHVPGNRLPCSGDT-KTPKCVKECESGYKVPYKQDKHYGKHV 230
Query: 167 YSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTS 226
YSV E I E+Y++GPVEGAFTV+ DL+ YKSG +
Sbjct: 231 YSVRGGEDHIKAELYKNGPVEGAFTVYADLLSYKSGVY---------------------- 268
Query: 227 QLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLF 286
+ +G ALGGHAI+I+GWG + + KYWLIANSWN+DWGDNG F
Sbjct: 269 ---------------KHVTGDALGGHAIKIMGWGVE--NGNKYWLIANSWNSDWGDNGFF 311
Query: 287 KILRGKDECGIESSITAGVPKLD 309
KILRG+D CGIESSI AG P +
Sbjct: 312 KILRGEDHCGIESSIVAGEPLFN 334
Score = 53.1 bits (126), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 27/65 (41%), Positives = 36/65 (55%), Gaps = 12/65 (18%)
Query: 8 LCGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQAEKNSLSNIPRAHLKSWMGVHPDYNLP 67
+CG GCNGG P +AW YW G+VSGG+Y S Q + IP ++++P
Sbjct: 146 ICGLGCNGGMPTLAWEYWKHFGLVSGGSYNSSQGCRP--YEIPPC----------EHHVP 193
Query: 68 ANRLP 72
NRLP
Sbjct: 194 GNRLP 198
>gi|46195455|ref|NP_990702.1| cathepsin B precursor [Gallus gallus]
gi|1168790|sp|P43233.1|CATB_CHICK RecName: Full=Cathepsin B; AltName: Full=Cathepsin B1; Contains:
RecName: Full=Cathepsin B light chain; Contains:
RecName: Full=Cathepsin B heavy chain; Flags: Precursor
gi|603203|gb|AAA87075.1| cathepsin B [Gallus gallus]
Length = 340
Score = 184 bits (468), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 93/196 (47%), Positives = 116/196 (59%), Gaps = 39/196 (19%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GCR Y I PCEHHVNG+RP C G TP+C R C+ Y YK+D ++G SY V +E
Sbjct: 178 GCRAYTIPPCEHHVNGSRPPCTGEGGETPRCSRHCEPGYSPSYKEDKHYGITSYGVPRSE 237
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
K IM EIY++GPVEGAF V++D ++YKSG +
Sbjct: 238 KEIMAEIYKNGPVEGAFIVYEDFLMYKSGVY----------------------------- 268
Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
+ SG+ +GGHAIRILGWG + + YWL ANSWNTDWG G FKILRG+D
Sbjct: 269 --------QHVSGEQVGGHAIRILGWGVENGT--PYWLAANSWNTDWGITGFFKILRGED 318
Query: 294 ECGIESSITAGVPKLD 309
CGIES I AGVP+++
Sbjct: 319 HCGIESEIVAGVPRME 334
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 20/30 (66%), Positives = 23/30 (76%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
CG GCNGG+P AWRYW + G+VSGG Y S
Sbjct: 146 CGMGCNGGYPSGAWRYWTERGLVSGGLYDS 175
>gi|1705630|sp|P00787.2|CATB_RAT RecName: Full=Cathepsin B; AltName: Full=Cathepsin B1; AltName:
Full=RSG-2; Contains: RecName: Full=Cathepsin B light
chain; Contains: RecName: Full=Cathepsin B heavy chain;
Flags: Precursor
gi|1524328|emb|CAA57792.1| cathepsin b [Rattus norvegicus]
Length = 339
Score = 184 bits (468), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 95/194 (48%), Positives = 118/194 (60%), Gaps = 40/194 (20%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GC PY I PCEHHVNG+RP C +G TPKC + C+ Y YK+D ++G SYSVS +E
Sbjct: 178 GCLPYTIPPCEHHVNGSRPPC-TGEGDTPKCNKMCEAGYSTSYKEDKHYGYTSYSVSDSE 236
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
K IM EIY++GPVEGAFTVF D + YKSG +
Sbjct: 237 KEIMAEIYKNGPVEGAFTVFSDFLTYKSGVY----------------------------- 267
Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
+++G +GGHAIRILGWG + + YWL+ANSWN DWGDNG FKILRG++
Sbjct: 268 --------KHEAGDVMGGHAIRILGWGIE--NGVPYWLVANSWNVDWGDNGFFKILRGEN 317
Query: 294 ECGIESSITAGVPK 307
CGIES I AG+P+
Sbjct: 318 HCGIESEIVAGIPR 331
Score = 45.8 bits (107), Expect = 0.026, Method: Compositional matrix adjust.
Identities = 18/30 (60%), Positives = 22/30 (73%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
CG GCNGG+P AW +W + G+VSGG Y S
Sbjct: 146 CGDGCNGGYPSGAWNFWTRKGLVSGGVYNS 175
>gi|82830420|ref|NP_072119.2| cathepsin B preproprotein [Rattus norvegicus]
gi|47939014|gb|AAH72490.1| Cathepsin B [Rattus norvegicus]
gi|149030258|gb|EDL85314.1| rCG52258, isoform CRA_a [Rattus norvegicus]
Length = 339
Score = 184 bits (468), Expect = 4e-44, Method: Compositional matrix adjust.
Identities = 95/194 (48%), Positives = 118/194 (60%), Gaps = 40/194 (20%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GC PY I PCEHHVNG+RP C +G TPKC + C+ Y YK+D ++G SYSVS +E
Sbjct: 178 GCLPYTIPPCEHHVNGSRPPC-TGEGDTPKCNKMCEAGYSTSYKEDKHYGYTSYSVSDSE 236
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
K IM EIY++GPVEGAFTVF D + YKSG +
Sbjct: 237 KEIMAEIYKNGPVEGAFTVFSDFLTYKSGVY----------------------------- 267
Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
+++G +GGHAIRILGWG + + YWL+ANSWN DWGDNG FKILRG++
Sbjct: 268 --------KHEAGDVMGGHAIRILGWGIE--NGVPYWLVANSWNVDWGDNGFFKILRGEN 317
Query: 294 ECGIESSITAGVPK 307
CGIES I AG+P+
Sbjct: 318 HCGIESEIVAGIPR 331
Score = 45.8 bits (107), Expect = 0.028, Method: Compositional matrix adjust.
Identities = 18/30 (60%), Positives = 22/30 (73%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
CG GCNGG+P AW +W + G+VSGG Y S
Sbjct: 146 CGDGCNGGYPSGAWNFWTRKGLVSGGVYNS 175
>gi|203648|gb|AAA40993.1| cathepsin (EC 3.4.22.1), partial [Rattus norvegicus]
Length = 271
Score = 184 bits (467), Expect = 4e-44, Method: Compositional matrix adjust.
Identities = 95/194 (48%), Positives = 118/194 (60%), Gaps = 40/194 (20%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GC PY I PCEHHVNG+RP C +G TPKC + C+ Y YK+D ++G SYSVS +E
Sbjct: 110 GCLPYTIPPCEHHVNGSRPPC-TGEGDTPKCNKMCEAGYSTSYKEDKHYGYTSYSVSDSE 168
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
K IM EIY++GPVEGAFTVF D + YKSG +
Sbjct: 169 KEIMAEIYKNGPVEGAFTVFSDFLTYKSGVY----------------------------- 199
Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
+++G +GGHAIRILGWG + + YWL+ANSWN DWGDNG FKILRG++
Sbjct: 200 --------KHEAGDVMGGHAIRILGWGIE--NGVPYWLVANSWNVDWGDNGFFKILRGEN 249
Query: 294 ECGIESSITAGVPK 307
CGIES I AG+P+
Sbjct: 250 HCGIESEIVAGIPR 263
Score = 45.4 bits (106), Expect = 0.035, Method: Compositional matrix adjust.
Identities = 18/30 (60%), Positives = 22/30 (73%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
CG GCNGG+P AW +W + G+VSGG Y S
Sbjct: 78 CGDGCNGGYPSGAWNFWTRKGLVSGGVYNS 107
>gi|1942645|pdb|1MIR|A Chain A, Rat Procathepsin B
gi|1942646|pdb|1MIR|B Chain B, Rat Procathepsin B
Length = 322
Score = 184 bits (467), Expect = 4e-44, Method: Compositional matrix adjust.
Identities = 95/194 (48%), Positives = 117/194 (60%), Gaps = 40/194 (20%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GC PY I PCEHHVNG RP C +G TPKC + C+ Y YK+D ++G SYSVS +E
Sbjct: 161 GCLPYTIPPCEHHVNGARPPC-TGEGDTPKCNKMCEAGYSTSYKEDKHYGYTSYSVSDSE 219
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
K IM EIY++GPVEGAFTVF D + YKSG +
Sbjct: 220 KEIMAEIYKNGPVEGAFTVFSDFLTYKSGVY----------------------------- 250
Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
+++G +GGHAIRILGWG + + YWL+ANSWN DWGDNG FKILRG++
Sbjct: 251 --------KHEAGDVMGGHAIRILGWGIE--NGVPYWLVANSWNADWGDNGFFKILRGEN 300
Query: 294 ECGIESSITAGVPK 307
CGIES I AG+P+
Sbjct: 301 HCGIESEIVAGIPR 314
Score = 45.8 bits (107), Expect = 0.026, Method: Compositional matrix adjust.
Identities = 18/30 (60%), Positives = 22/30 (73%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
CG GCNGG+P AW +W + G+VSGG Y S
Sbjct: 129 CGDGCNGGYPSGAWNFWTRKGLVSGGVYNS 158
>gi|1311050|pdb|1CPJ|A Chain A, Crystal Structures Of Recombinant Rat Cathepsin B And A
Cathepsin B-Inhibitor Complex: Implications For
Structure- Based Inhibitor Design
gi|1311051|pdb|1CPJ|B Chain B, Crystal Structures Of Recombinant Rat Cathepsin B And A
Cathepsin B-Inhibitor Complex: Implications For
Structure- Based Inhibitor Design
gi|1421561|pdb|1THE|A Chain A, Crystal Structures Of Recombinant Rat Cathepsin B And A
Cathepsin B- Inhibitor Complex: Implications For
Structure-Based Inhibitor Design
gi|1421562|pdb|1THE|B Chain B, Crystal Structures Of Recombinant Rat Cathepsin B And A
Cathepsin B- Inhibitor Complex: Implications For
Structure-Based Inhibitor Design
Length = 260
Score = 184 bits (467), Expect = 5e-44, Method: Compositional matrix adjust.
Identities = 95/194 (48%), Positives = 117/194 (60%), Gaps = 40/194 (20%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GC PY I PCEHHVNG RP C +G TPKC + C+ Y YK+D ++G SYSVS +E
Sbjct: 105 GCLPYTIPPCEHHVNGARPPC-TGEGDTPKCNKMCEAGYSTSYKEDKHYGYTSYSVSDSE 163
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
K IM EIY++GPVEGAFTVF D + YKSG +
Sbjct: 164 KEIMAEIYKNGPVEGAFTVFSDFLTYKSGVY----------------------------- 194
Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
+++G +GGHAIRILGWG + + YWL+ANSWN DWGDNG FKILRG++
Sbjct: 195 --------KHEAGDVMGGHAIRILGWGIE--NGVPYWLVANSWNADWGDNGFFKILRGEN 244
Query: 294 ECGIESSITAGVPK 307
CGIES I AG+P+
Sbjct: 245 HCGIESEIVAGIPR 258
Score = 45.4 bits (106), Expect = 0.032, Method: Compositional matrix adjust.
Identities = 18/30 (60%), Positives = 22/30 (73%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
CG GCNGG+P AW +W + G+VSGG Y S
Sbjct: 73 CGDGCNGGYPSGAWNFWTRKGLVSGGVYNS 102
>gi|1127275|pdb|1CTE|A Chain A, Crystal Structures Of Recombinant Rat Cathepsin B And A
Cathepsin B-Inhibitor Complex: Implications For
Structure- Based Inhibitor Design
gi|1127276|pdb|1CTE|B Chain B, Crystal Structures Of Recombinant Rat Cathepsin B And A
Cathepsin B-Inhibitor Complex: Implications For
Structure- Based Inhibitor Design
Length = 254
Score = 184 bits (466), Expect = 5e-44, Method: Compositional matrix adjust.
Identities = 95/194 (48%), Positives = 117/194 (60%), Gaps = 40/194 (20%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GC PY I PCEHHVNG RP C +G TPKC + C+ Y YK+D ++G SYSVS +E
Sbjct: 99 GCLPYTIPPCEHHVNGARPPC-TGEGDTPKCNKMCEAGYSTSYKEDKHYGYTSYSVSDSE 157
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
K IM EIY++GPVEGAFTVF D + YKSG +
Sbjct: 158 KEIMAEIYKNGPVEGAFTVFSDFLTYKSGVY----------------------------- 188
Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
+++G +GGHAIRILGWG + + YWL+ANSWN DWGDNG FKILRG++
Sbjct: 189 --------KHEAGDVMGGHAIRILGWGIE--NGVPYWLVANSWNADWGDNGFFKILRGEN 238
Query: 294 ECGIESSITAGVPK 307
CGIES I AG+P+
Sbjct: 239 HCGIESEIVAGIPR 252
Score = 45.4 bits (106), Expect = 0.035, Method: Compositional matrix adjust.
Identities = 18/30 (60%), Positives = 22/30 (73%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
CG GCNGG+P AW +W + G+VSGG Y S
Sbjct: 67 CGDGCNGGYPSGAWNFWTRKGLVSGGVYNS 96
>gi|410956528|ref|XP_003984894.1| PREDICTED: cathepsin B [Felis catus]
Length = 339
Score = 183 bits (465), Expect = 7e-44, Method: Compositional matrix adjust.
Identities = 92/193 (47%), Positives = 118/193 (61%), Gaps = 40/193 (20%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GCRPY I PCEHHVNG+RP C +G TPKC + C+ Y YK+D ++G SYSVS++E
Sbjct: 178 GCRPYSIPPCEHHVNGSRPPC-TGEGDTPKCSKICEPGYTPSYKEDKHYGCNSYSVSNSE 236
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
K IM EIY++GPVE AF+VF D + YKSG +
Sbjct: 237 KEIMAEIYKNGPVEAAFSVFSDFLQYKSGVY----------------------------- 267
Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
+ +G+ +GGHA+RILGWG + + YWL+ NSWNTDWGD+G FKILRG+D
Sbjct: 268 --------QHVTGEMMGGHAVRILGWGVENDT--PYWLVGNSWNTDWGDHGFFKILRGRD 317
Query: 294 ECGIESSITAGVP 306
CGIES + AG+P
Sbjct: 318 HCGIESEVVAGIP 330
Score = 48.1 bits (113), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 20/30 (66%), Positives = 22/30 (73%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
CG GCNGGFP AW +W K G+VSGG Y S
Sbjct: 146 CGDGCNGGFPAEAWNFWTKQGLVSGGLYDS 175
>gi|37788265|gb|AAO64472.1| cathepsin B precursor [Fundulus heteroclitus]
Length = 330
Score = 183 bits (464), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 92/194 (47%), Positives = 115/194 (59%), Gaps = 39/194 (20%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GCRPY I PCEHHVNG+RP C G TP+CV +C+ Y Y+KD ++G SY V S E
Sbjct: 176 GCRPYTIEPCEHHVNGSRPPCTGEGGDTPECVTQCEAGYTPSYQKDKHYGKTSYGVPSEE 235
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
+ I EIY++GPVEGAF V++D YKSG +
Sbjct: 236 EQIQSEIYKNGPVEGAFIVYEDFPSYKSGVY----------------------------- 266
Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
+ +G ALGGHAI+++GWGE+ + YWL ANSWNTDWGDNG FKILRG +
Sbjct: 267 --------QHVTGSALGGHAIKMIGWGEE--NGVPYWLCANSWNTDWGDNGFFKILRGSN 316
Query: 294 ECGIESSITAGVPK 307
CGIES + AG+PK
Sbjct: 317 HCGIESEVVAGIPK 330
Score = 46.2 bits (108), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 17/30 (56%), Positives = 22/30 (73%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
CG GCNGG+P AW +W + G+V+GG Y S
Sbjct: 144 CGMGCNGGYPANAWEFWTEQGLVTGGLYNS 173
>gi|161343863|tpg|DAA06112.1| TPA_inf: cathepsin B [Myzus persicae]
Length = 340
Score = 182 bits (463), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 97/206 (47%), Positives = 123/206 (59%), Gaps = 39/206 (18%)
Query: 101 REIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDL 160
+ I G GS GC PYEIAPCEHHVNGTR C G TP CV++C++ Y VPY +DL
Sbjct: 173 KGIVSGGPYGSKMGCIPYEIAPCEHHVNGTRGPCKEG-GKTPACVKKCEDGYKVPYAQDL 231
Query: 161 NFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWT 220
+ G +YS+ ++ I +EIY +GPVEGAFTV++D I Y++G +
Sbjct: 232 HRGKSAYSLGNDVDQIRQEIYTNGPVEGAFTVYEDFIAYRAGVY---------------- 275
Query: 221 IRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDW 280
+ +GKALGGHAIRILGWG + + YWL+ANSWN+DW
Sbjct: 276 ---------------------KHVAGKALGGHAIRILGWGV-QNGEIPYWLVANSWNSDW 313
Query: 281 GDNGLFKILRGKDECGIESSITAGVP 306
G +G FKILRG DECGIE I AG+P
Sbjct: 314 GSDGFFKILRGSDECGIEGQINAGLP 339
Score = 59.7 bits (143), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 26/34 (76%), Positives = 26/34 (76%)
Query: 7 RLCGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQ 40
R CGFGCNGGFPG AW YW GIVSGG YGSK
Sbjct: 152 RTCGFGCNGGFPGAAWHYWKTKGIVSGGPYGSKM 185
>gi|56759588|gb|AAW28820.1| Parcxpwnx02 [Periplaneta americana]
Length = 343
Score = 182 bits (463), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 95/206 (46%), Positives = 123/206 (59%), Gaps = 41/206 (19%)
Query: 103 IRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNF 162
I GS S GC+PY I PCEHHVNGTR C +G TP+CV+ C+E YDVPY KD +F
Sbjct: 179 IVSGGSYNSHQGCQPYAIEPCEHHVNGTRKPC--GEGDTPRCVKRCEEGYDVPYGKDRHF 236
Query: 163 GAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIR 222
G +Y+V + K+I KE+ +GP E A TV+DD + Y++G +
Sbjct: 237 GKSAYAVPGSVKAIQKELLLNGPAEAALTVYDDFLHYRTGVY------------------ 278
Query: 223 DNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGD 282
+ SG ALGGHA+R+LGWG ++ + YWL+ANSWN DWGD
Sbjct: 279 -------------------QHVSGGALGGHAVRLLGWGVEDGT--PYWLLANSWNYDWGD 317
Query: 283 NGLFKILRGKDECGIESSITAGVPKL 308
NG F+ILRG+DECGIES I G+PK+
Sbjct: 318 NGYFRILRGQDECGIESDINGGLPKV 343
Score = 56.2 bits (134), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 24/33 (72%), Positives = 26/33 (78%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQA 41
CGFGCNGG PG AW YWV +GIVSGG+Y S Q
Sbjct: 158 CGFGCNGGEPGAAWDYWVSTGIVSGGSYNSHQG 190
>gi|126303983|ref|XP_001381634.1| PREDICTED: cathepsin B-like [Monodelphis domestica]
Length = 337
Score = 182 bits (462), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 95/203 (46%), Positives = 118/203 (58%), Gaps = 53/203 (26%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GCRPY I PCEHHVNG+RP+C +G TP C ++C+E Y YK D N+G+ SYSV S+E
Sbjct: 179 GCRPYSIPPCEHHVNGSRPACTGEEGDTPTCRKKCEEGYSTQYKDDKNYGSTSYSVPSSE 238
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
+ IM EIY++GPV EGA
Sbjct: 239 QEIMAEIYKNGPV--------------------------------------------EGA 254
Query: 234 FTVFDDLILYKSG-------KALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLF 286
F+V++D + YKSG + LGGHAIRILGWG + + +YWL ANSWN DWGDNG F
Sbjct: 255 FSVYEDFLHYKSGVYQHVAGEMLGGHAIRILGWGVE--NGIRYWLAANSWNIDWGDNGFF 312
Query: 287 KILRGKDECGIESSITAGVPKLD 309
K LRGK+ CGIES I AG+P+ D
Sbjct: 313 KFLRGKNHCGIESEIIAGIPRTD 335
Score = 47.8 bits (112), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 20/30 (66%), Positives = 22/30 (73%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
CG GCNGGFP AW +W K G+VSGG Y S
Sbjct: 147 CGDGCNGGFPAGAWNFWTKKGLVSGGLYDS 176
>gi|6681079|ref|NP_031824.1| cathepsin B preproprotein [Mus musculus]
gi|115712|sp|P10605.2|CATB_MOUSE RecName: Full=Cathepsin B; AltName: Full=Cathepsin B1; Contains:
RecName: Full=Cathepsin B light chain; Contains:
RecName: Full=Cathepsin B heavy chain; Flags: Precursor
gi|239907|gb|AAB20536.1| preprocathepsin B [Mus sp.]
gi|309152|gb|AAA37375.1| cathepsin B [Mus musculus]
gi|13879360|gb|AAH06656.1| Cathepsin B [Mus musculus]
gi|26350521|dbj|BAC38900.1| unnamed protein product [Mus musculus]
gi|74180941|dbj|BAE27751.1| unnamed protein product [Mus musculus]
gi|74191261|dbj|BAE39458.1| unnamed protein product [Mus musculus]
gi|74198944|dbj|BAE30691.1| unnamed protein product [Mus musculus]
gi|74208073|dbj|BAE29144.1| unnamed protein product [Mus musculus]
gi|148704123|gb|EDL36070.1| cathepsin B, isoform CRA_a [Mus musculus]
Length = 339
Score = 182 bits (462), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 95/196 (48%), Positives = 118/196 (60%), Gaps = 40/196 (20%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GC PY I PCEHHVNG+RP C +G TP+C + C+ Y YK+D +FG SYSVS++
Sbjct: 178 GCLPYTIPPCEHHVNGSRPPC-TGEGDTPRCNKSCEAGYSPSYKEDKHFGYTSYSVSNSV 236
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
K IM EIY++GPVEGAFTVF D + YKSG +
Sbjct: 237 KEIMAEIYKNGPVEGAFTVFSDFLTYKSGVY----------------------------- 267
Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
+++G +GGHAIRILGWG + + YWL ANSWN DWGDNG FKILRG++
Sbjct: 268 --------KHEAGDMMGGHAIRILGWGVE--NGVPYWLAANSWNLDWGDNGFFKILRGEN 317
Query: 294 ECGIESSITAGVPKLD 309
CGIES I AG+P+ D
Sbjct: 318 HCGIESEIVAGIPRTD 333
Score = 46.2 bits (108), Expect = 0.020, Method: Compositional matrix adjust.
Identities = 19/30 (63%), Positives = 22/30 (73%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
CG GCNGG+P AW +W K G+VSGG Y S
Sbjct: 146 CGDGCNGGYPSGAWSFWTKKGLVSGGVYNS 175
>gi|308390275|gb|ADO32581.1| cathepsin B [Marsupenaeus japonicus]
Length = 332
Score = 181 bits (459), Expect = 4e-43, Method: Compositional matrix adjust.
Identities = 121/341 (35%), Positives = 159/341 (46%), Gaps = 117/341 (34%)
Query: 40 QAEKNSLSNIPRAHLKSWMGVHPD--YNLP---ANRLPELIGYSEVDEDLPANFDSRTKW 94
+A +N ++ + + MGVHPD Y++P A+++PE + D+P FDSR W
Sbjct: 38 EAGRNFNRHLSIRYFRRLMGVHPDSKYHMPGYEAHKIPE-------NFDMPKEFDSRAAW 90
Query: 95 PNCPTIREIRDQGSCGSCWGCRPYEIAP--------------------------CEHHVN 128
P CPTI EIRDQGSCGSCW E+ C N
Sbjct: 91 PMCPTIGEIRDQGSCGSCWAFGAVEVMSDRQCIHSKGKSNFHYSSENLVSCCHLCGFGCN 150
Query: 129 GTRP----------------SCDASKGHTPKCVRECQEN--------------------- 151
G P S ++++G P + C+ +
Sbjct: 151 GGFPGAAFKYWVHSGIVSGGSFNSTQGCQPYEIAPCEHHVPGPRPKCSEGGGTPKCVKRC 210
Query: 152 ---YDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPG 208
Y V Y+ DL+ G K+YS+ +E I EI ++GPVEGAFTV+ D + YKSG +
Sbjct: 211 ENGYTVDYESDLHHGGKAYSIMKDEDQIKYEIMKNGPVEGAFTVYVDFLHYKSGVY---- 266
Query: 209 NETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEK 268
++ G LGGHAIRILGWGE+ +
Sbjct: 267 ---------------------------------QHRHGLPLGGHAIRILGWGEENGT--P 291
Query: 269 YWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVPKLD 309
YWL ANSWNTDWGDNGLFKILRG D CGIES I+AG+PKL+
Sbjct: 292 YWLCANSWNTDWGDNGLFKILRGSDHCGIESEISAGLPKLN 332
Score = 59.3 bits (142), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 25/35 (71%), Positives = 29/35 (82%)
Query: 7 RLCGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQA 41
LCGFGCNGGFPG A++YWV SGIVSGG++ S Q
Sbjct: 143 HLCGFGCNGGFPGAAFKYWVHSGIVSGGSFNSTQG 177
>gi|432946172|ref|XP_004083803.1| PREDICTED: cathepsin B-like [Oryzias latipes]
Length = 330
Score = 181 bits (458), Expect = 5e-43, Method: Compositional matrix adjust.
Identities = 89/193 (46%), Positives = 115/193 (59%), Gaps = 40/193 (20%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GCRPY +APCEHHVNG+RP C TPKCV +C Y + Y KD +FG +SYS+ S +
Sbjct: 176 GCRPYTLAPCEHHVNGSRPPCQGEV-ETPKCVTQCNNGYSLSYPKDKHFGQRSYSIPSQQ 234
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
+ IM E+Y++GPVE AF+V+ D +LYK+G +
Sbjct: 235 EQIMTELYKNGPVEAAFSVYADFLLYKNGVY----------------------------- 265
Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
+ +G LGGHA++ILGWGE+ + YWL+ANSWN+DWGD G FKI RG D
Sbjct: 266 --------QHVTGDMLGGHAVKILGWGEENGT--PYWLVANSWNSDWGDKGFFKIKRGND 315
Query: 294 ECGIESSITAGVP 306
ECGIES + AG P
Sbjct: 316 ECGIESEMVAGAP 328
Score = 44.7 bits (104), Expect = 0.054, Method: Compositional matrix adjust.
Identities = 17/31 (54%), Positives = 21/31 (67%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSK 39
CG GC GGFP AW +W G+V+GG + SK
Sbjct: 144 CGMGCFGGFPSAAWEFWTNKGLVTGGLFDSK 174
>gi|309202|gb|AAA37494.1| mouse preprocathepsin B [Mus musculus]
Length = 339
Score = 179 bits (454), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 94/196 (47%), Positives = 117/196 (59%), Gaps = 40/196 (20%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GC PY I PCEHHVNG+RP C +G TP+C + C+ Y YK+D +FG SYSVS++
Sbjct: 178 GCLPYTIPPCEHHVNGSRPPC-TGEGDTPRCNKSCEAGYSPSYKEDKHFGYTSYSVSNSV 236
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
K IM EIY++GPVEGAFTVF D + YKSG +
Sbjct: 237 KEIMAEIYKNGPVEGAFTVFSDFLTYKSGVY----------------------------- 267
Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
+++G +GGHAIRIL WG + + YWL ANSWN DWGDNG FKILRG++
Sbjct: 268 --------KHEAGDMMGGHAIRILVWGVE--NGVPYWLAANSWNLDWGDNGFFKILRGEN 317
Query: 294 ECGIESSITAGVPKLD 309
CGIES I AG+P+ D
Sbjct: 318 HCGIESEIVAGIPRTD 333
Score = 46.2 bits (108), Expect = 0.021, Method: Compositional matrix adjust.
Identities = 19/30 (63%), Positives = 22/30 (73%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
CG GCNGG+P AW +W K G+VSGG Y S
Sbjct: 146 CGDGCNGGYPSGAWNFWTKKGLVSGGVYDS 175
>gi|74213457|dbj|BAE35542.1| unnamed protein product [Mus musculus]
Length = 339
Score = 179 bits (453), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 94/196 (47%), Positives = 117/196 (59%), Gaps = 40/196 (20%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GC PY I PCEHHVNG+RP C +G T +C + C+ Y YK+D +FG SYSVS++
Sbjct: 178 GCLPYTIPPCEHHVNGSRPPC-TGEGDTHRCNKSCEAGYSPSYKEDKHFGYTSYSVSNSV 236
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
K IM EIY++GPVEGAFTVF D + YKSG +
Sbjct: 237 KEIMAEIYKNGPVEGAFTVFSDFLTYKSGVY----------------------------- 267
Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
+++G +GGHAIRILGWG + + YWL ANSWN DWGDNG FKILRG++
Sbjct: 268 --------KHEAGDMMGGHAIRILGWGVE--NGVPYWLAANSWNLDWGDNGFFKILRGEN 317
Query: 294 ECGIESSITAGVPKLD 309
CGIES I AG+P+ D
Sbjct: 318 HCGIESEIVAGIPRTD 333
Score = 46.2 bits (108), Expect = 0.020, Method: Compositional matrix adjust.
Identities = 19/30 (63%), Positives = 22/30 (73%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
CG GCNGG+P AW +W K G+VSGG Y S
Sbjct: 146 CGDGCNGGYPSGAWSFWTKKGLVSGGVYNS 175
>gi|74221319|dbj|BAE42140.1| unnamed protein product [Mus musculus]
Length = 339
Score = 179 bits (453), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 94/196 (47%), Positives = 116/196 (59%), Gaps = 40/196 (20%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GC PY I PCEHHVNG+RP C +G TP+C + C+ Y YK+D +FG SYSVS++
Sbjct: 178 GCLPYTIPPCEHHVNGSRPPC-TGEGDTPRCNKSCEAGYSPSYKEDKHFGYTSYSVSNSV 236
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
K IM EIY++ PVEGAFTVF D + YKSG +
Sbjct: 237 KEIMAEIYKNDPVEGAFTVFSDFLTYKSGVY----------------------------- 267
Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
+++G +GGHAIRILGWG + YWL ANSWN DWGDNG FKILRG++
Sbjct: 268 --------KHEAGDMMGGHAIRILGWG--VGNGVPYWLAANSWNLDWGDNGFFKILRGEN 317
Query: 294 ECGIESSITAGVPKLD 309
CGIES I AG+P+ D
Sbjct: 318 HCGIESEIVAGIPRTD 333
Score = 46.2 bits (108), Expect = 0.020, Method: Compositional matrix adjust.
Identities = 19/30 (63%), Positives = 22/30 (73%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
CG GCNGG+P AW +W K G+VSGG Y S
Sbjct: 146 CGDGCNGGYPSGAWSFWTKKGLVSGGVYNS 175
>gi|225711544|gb|ACO11618.1| Cathepsin B precursor [Caligus rogercresseyi]
Length = 332
Score = 178 bits (451), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 99/210 (47%), Positives = 121/210 (57%), Gaps = 41/210 (19%)
Query: 99 TIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQ-ENYDVPYK 157
T + + G GS GC+PY+I PCEHHVNGTR C A G TPKC R C+ ENY VPY
Sbjct: 163 TSKGLVSGGLYGSHSGCQPYDIEPCEHHVNGTRQPC-AEGGRTPKCHRTCENENYSVPYD 221
Query: 158 KDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLI 217
KDL+FG SYS+ S+ K I EI ++GPVE AF+V+ D + KSG +
Sbjct: 222 KDLSFGRSSYSIRSDPKQIQLEIMDNGPVEAAFSVYSDFMNDKSGVY------------- 268
Query: 218 KWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWN 277
+ G LGGHAIRILGWG ++ + YWL+ANSWN
Sbjct: 269 ------------------------RHVKGSLLGGHAIRILGWGVEKGT--PYWLVANSWN 302
Query: 278 TDWGDNGLFKILRGKDECGIESSITAGVPK 307
TDWGD G FKILRG D CGIE S+ G+P+
Sbjct: 303 TDWGDKGTFKILRGSDHCGIEGSVVTGLPR 332
Score = 57.4 bits (137), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 23/30 (76%), Positives = 25/30 (83%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
CGFGCNGGFPG AW+YW G+VSGG YGS
Sbjct: 146 CGFGCNGGFPGAAWKYWTSKGLVSGGLYGS 175
>gi|225713216|gb|ACO12454.1| Cathepsin B precursor [Lepeophtheirus salmonis]
gi|290561811|gb|ADD38303.1| Cathepsin B [Lepeophtheirus salmonis]
Length = 333
Score = 177 bits (450), Expect = 5e-42, Method: Compositional matrix adjust.
Identities = 97/199 (48%), Positives = 116/199 (58%), Gaps = 41/199 (20%)
Query: 110 GSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQ-ENYDVPYKKDLNFGAKSYS 168
GS GC+PY IAPCEHH NGTRP C G TPKC C+ E+Y +PY+KD +FG SYS
Sbjct: 175 GSHKGCQPYAIAPCEHHANGTRPPCSGG-GRTPKCHTFCENEDYSLPYEKDKSFGRSSYS 233
Query: 169 VSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQL 228
V S+ K I EI +GPVE AF+V+ D + YKSG +
Sbjct: 234 VKSDPKQIQLEIMNNGPVEAAFSVYSDFLNYKSGVY------------------------ 269
Query: 229 GAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKI 288
+ G LGGHAIRILGWG + + YWL+ANSWNTDWGDNG FKI
Sbjct: 270 -------------RHVKGSLLGGHAIRILGWGVENGT--PYWLVANSWNTDWGDNGTFKI 314
Query: 289 LRGKDECGIESSITAGVPK 307
L+G D CGIE SI AG+P+
Sbjct: 315 LKGSDHCGIEGSIVAGLPQ 333
Score = 55.8 bits (133), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 23/32 (71%), Positives = 26/32 (81%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQ 40
CGFGCNGGFPG AW +W K G+VSGG YGS +
Sbjct: 147 CGFGCNGGFPGAAWSFWKKKGLVSGGLYGSHK 178
>gi|301776581|ref|XP_002923704.1| PREDICTED: cathepsin B-like [Ailuropoda melanoleuca]
gi|281347694|gb|EFB23278.1| hypothetical protein PANDA_012896 [Ailuropoda melanoleuca]
Length = 339
Score = 177 bits (450), Expect = 5e-42, Method: Compositional matrix adjust.
Identities = 97/196 (49%), Positives = 120/196 (61%), Gaps = 40/196 (20%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GCRPY I PCEHHVNG+RP C +G TPKC + C+ Y YK+D ++G SYSVSS+E
Sbjct: 178 GCRPYSIPPCEHHVNGSRPPC-TGEGDTPKCSKFCEPGYTPSYKEDKHYGCSSYSVSSSE 236
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
K IM EIY++GPVE AFTV+ D +LYKSG +
Sbjct: 237 KEIMAEIYKNGPVEAAFTVYSDFLLYKSGVY----------------------------- 267
Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
+ +G+ +GGHA+RILGWG + + YWL+ NSWNTDWGDNG FKILRG+D
Sbjct: 268 --------QHVTGEMMGGHAVRILGWGVENGT--PYWLVGNSWNTDWGDNGFFKILRGRD 317
Query: 294 ECGIESSITAGVPKLD 309
CGIES I AG+P D
Sbjct: 318 HCGIESEIVAGIPCTD 333
Score = 47.4 bits (111), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 20/30 (66%), Positives = 22/30 (73%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
CG GCNGGFP AW +W K G+VSGG Y S
Sbjct: 146 CGDGCNGGFPAEAWNFWTKQGLVSGGLYES 175
>gi|154089579|gb|ABS57370.1| cathepsin B2 [Trichobilharzia regenti]
Length = 344
Score = 177 bits (449), Expect = 5e-42, Method: Compositional matrix adjust.
Identities = 93/195 (47%), Positives = 115/195 (58%), Gaps = 40/195 (20%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GC+PYE PCEHHV G RPSC+ TPKC CQ Y++PY KD +G Y V SN+
Sbjct: 189 GCQPYEFPPCEHHVVGPRPSCEGDV-ETPKCKTTCQPGYNIPYNKDKWYGKTVYRVHSNQ 247
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
++IMKE+ EHGPVE F V+ D YKSG +
Sbjct: 248 EAIMKEVKEHGPVEVDFEVYADFPNYKSGVY----------------------------- 278
Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
+ SG LGGHA+R+LGWGE+ + YWLIANSWN+DWGDNG FKI+RG++
Sbjct: 279 --------QHVSGGLLGGHAVRLLGWGEE--NGVPYWLIANSWNSDWGDNGYFKIIRGRN 328
Query: 294 ECGIESSITAGVPKL 308
ECGIES + AG+PKL
Sbjct: 329 ECGIESDVNAGIPKL 343
Score = 62.8 bits (151), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 30/62 (48%), Positives = 39/62 (62%), Gaps = 3/62 (4%)
Query: 54 LKSWMGVHPDYNLPANRLPEL-IGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSC 112
++ +G PD N LP L GY+ ++LP FD+R WP+CP+I EIRDQ SCGSC
Sbjct: 63 IRRMLGALPDPN--GGHLPTLCTGYTPSLDELPKEFDARKYWPHCPSISEIRDQSSCGSC 120
Query: 113 WG 114
W
Sbjct: 121 WA 122
Score = 46.6 bits (109), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 19/28 (67%), Positives = 21/28 (75%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAY 36
CG GCNGGFP AW YW +SGIV+G Y
Sbjct: 157 CGMGCNGGFPHSAWSYWKRSGIVTGDLY 184
>gi|325302580|dbj|BAJ83490.1| cathepsin B-like peptidase [Echinococcus multilocularis]
Length = 351
Score = 177 bits (448), Expect = 7e-42, Method: Compositional matrix adjust.
Identities = 93/193 (48%), Positives = 110/193 (56%), Gaps = 40/193 (20%)
Query: 115 CRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEK 174
CR YEI PCEHHVNGTRP C+ TPKC CQE Y VPYKKD ++ K YSV SNE
Sbjct: 198 CRAYEIPPCEHHVNGTRPPCEGD-APTPKCKNVCQEEYKVPYKKDKHYAVKVYSVHSNED 256
Query: 175 SIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAF 234
+I E+ HGPVE F V+ D YKSG +
Sbjct: 257 AIKHELITHGPVEADFEVYADFPTYKSGVY------------------------------ 286
Query: 235 TVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDE 294
+ SG LGGHAI+++GWGE++ YWL ANSWNTDWG+ G FKILRGK+
Sbjct: 287 -------QHVSGALLGGHAIKLMGWGEEDGV--PYWLCANSWNTDWGEGGFFKILRGKNH 337
Query: 295 CGIESSITAGVPK 307
CGIES I AG+P+
Sbjct: 338 CGIESDIVAGIPQ 350
Score = 52.0 bits (123), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 21/33 (63%), Positives = 24/33 (72%)
Query: 7 RLCGFGCNGGFPGMAWRYWVKSGIVSGGAYGSK 39
R CG GCNGGFP AW +W G+VSGG YG+K
Sbjct: 163 RDCGMGCNGGFPSQAWNFWKHEGLVSGGLYGTK 195
>gi|149698064|ref|XP_001498242.1| PREDICTED: cathepsin B [Equus caballus]
Length = 340
Score = 176 bits (447), Expect = 8e-42, Method: Compositional matrix adjust.
Identities = 95/196 (48%), Positives = 117/196 (59%), Gaps = 39/196 (19%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GCRPY I PCEHHVNG+RP C G TPKC + C+ Y YK+D ++G SYSVSS+E
Sbjct: 178 GCRPYSIPPCEHHVNGSRPPCTGEGGDTPKCSKICEPGYSPSYKEDKHYGCSSYSVSSSE 237
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
K IM EI+++GPVE AFTV+ D + YKSG +
Sbjct: 238 KEIMAEIFKNGPVEAAFTVYSDFLQYKSGVY----------------------------- 268
Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
+ +G +GGHA+RILGWG + + YWL+ NSWNTDWGDNG FKILRG+D
Sbjct: 269 --------QHVAGDMMGGHAVRILGWGVENGT--PYWLVGNSWNTDWGDNGFFKILRGQD 318
Query: 294 ECGIESSITAGVPKLD 309
CGIES I AG+P D
Sbjct: 319 HCGIESEIVAGIPCTD 334
Score = 68.9 bits (167), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 33/70 (47%), Positives = 47/70 (67%), Gaps = 5/70 (7%)
Query: 44 NSLSNIPRAHLKSWMGVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREI 103
++ N+ +++K G L +LP+ + ++E D LP NFD+R +WPNCPTI+EI
Sbjct: 45 HNFHNVDLSYVKRLCGTF----LGGPKLPQRVWFAE-DVVLPENFDAREQWPNCPTIKEI 99
Query: 104 RDQGSCGSCW 113
RDQGSCGSCW
Sbjct: 100 RDQGSCGSCW 109
Score = 47.8 bits (112), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 20/30 (66%), Positives = 22/30 (73%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
CG GCNGGFP AW +W K G+VSGG Y S
Sbjct: 146 CGDGCNGGFPAEAWNFWTKQGLVSGGLYDS 175
>gi|161671340|gb|ABX75522.1| cathepsin b [Lycosa singoriensis]
Length = 247
Score = 176 bits (447), Expect = 9e-42, Method: Compositional matrix adjust.
Identities = 94/194 (48%), Positives = 116/194 (59%), Gaps = 40/194 (20%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GC+PYEI CEHH G RP C + TPKCV C++ Y+ Y+ D +FG KSYS+ S E
Sbjct: 93 GCQPYEIPACEHHTTGDRPPC-SDIVDTPKCVHLCEKGYNTSYRDDKHFGKKSYSIESLE 151
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
+ I EI+++GPVEGAF+V+ D I YKSG +
Sbjct: 152 QQIQTEIFKNGPVEGAFSVYSDFINYKSGVY----------------------------- 182
Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
+ SG++LGGHAIR+LGWG + + YWL ANSWNTDWGD G FKILRG D
Sbjct: 183 --------QHHSGESLGGHAIRVLGWGYE--NDVPYWLCANSWNTDWGDKGYFKILRGSD 232
Query: 294 ECGIESSITAGVPK 307
ECGIESSI AG+PK
Sbjct: 233 ECGIESSIVAGIPK 246
Score = 43.5 bits (101), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 17/30 (56%), Positives = 21/30 (70%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
CG GC+GGFP AW +WV GI +GG + S
Sbjct: 61 CGMGCDGGFPPSAWEFWVDKGIATGGLWNS 90
>gi|126681075|gb|ABO26563.1| cathepsin B-like cysteine protease form 1 [Ixodes ricinus]
Length = 337
Score = 176 bits (445), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 92/194 (47%), Positives = 114/194 (58%), Gaps = 40/194 (20%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GC+PY +APCEHH G+ P+C + TPKCV C++ Y Y+ D +FG K YS+SSNE
Sbjct: 182 GCKPYSLAPCEHHTKGSLPNCTGTVP-TPKCVHLCRKGYGKDYQHDKHFGKKVYSISSNE 240
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
K I EI+++GPVE FTV+ D + YKSG +
Sbjct: 241 KQIQTEIFKNGPVEADFTVYADFLSYKSGVY----------------------------- 271
Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
+ SG LGGHAIRILGWG + + YWL+ANSWN DWGD+G FKILRGKD
Sbjct: 272 --------QHHSGDVLGGHAIRILGWGTENGT--PYWLVANSWNEDWGDHGYFKILRGKD 321
Query: 294 ECGIESSITAGVPK 307
ECGIE I AG+PK
Sbjct: 322 ECGIEDDINAGIPK 335
Score = 46.6 bits (109), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 18/30 (60%), Positives = 23/30 (76%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
CG GC+GG+P AW YW +SG+VS G YG+
Sbjct: 150 CGAGCDGGYPAAAWEYWKESGLVSDGLYGT 179
>gi|355681635|gb|AER96808.1| cathepsin B [Mustela putorius furo]
Length = 338
Score = 176 bits (445), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 95/196 (48%), Positives = 120/196 (61%), Gaps = 40/196 (20%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GCRPY I PCEHHVNG+RP C +G TPKC + C+ Y YK+D ++G SYSVSS+E
Sbjct: 178 GCRPYSIPPCEHHVNGSRPPC-TGEGDTPKCSKICEPGYTPSYKEDKHYGCSSYSVSSSE 236
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
K IM EIY++GPVE AF+V+ D ++YKSG +
Sbjct: 237 KEIMAEIYKNGPVEAAFSVYSDFLMYKSGVY----------------------------- 267
Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
+ +G+ +GGHA+RILGWG + + YWL+ NSWNTDWGDNG FKILRG+D
Sbjct: 268 --------QHVTGEMMGGHAVRILGWGVENGT--PYWLVGNSWNTDWGDNGFFKILRGQD 317
Query: 294 ECGIESSITAGVPKLD 309
CGIES I AG+P D
Sbjct: 318 HCGIESEIVAGIPCTD 333
Score = 65.9 bits (159), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 32/71 (45%), Positives = 47/71 (66%), Gaps = 5/71 (7%)
Query: 44 NSLSNIPRAHLKSWMGVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREI 103
++ N+ +++LK G L + P+ + ++E + LP +FDSR +WPNCPTI+EI
Sbjct: 45 HNFHNVDQSYLKKLCGTF----LGGPKPPQRLWFAE-NMILPESFDSREQWPNCPTIKEI 99
Query: 104 RDQGSCGSCWG 114
RDQGSCGSCW
Sbjct: 100 RDQGSCGSCWA 110
Score = 45.8 bits (107), Expect = 0.025, Method: Compositional matrix adjust.
Identities = 19/30 (63%), Positives = 21/30 (70%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
CG GCNGGFP AW +W G+VSGG Y S
Sbjct: 146 CGDGCNGGFPAEAWNFWTXXGLVSGGLYDS 175
>gi|225717770|gb|ACO14731.1| Cathepsin B precursor [Caligus clemensi]
Length = 331
Score = 175 bits (444), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 95/199 (47%), Positives = 117/199 (58%), Gaps = 41/199 (20%)
Query: 110 GSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVREC-QENYDVPYKKDLNFGAKSYS 168
GS GC+PY I PCEHHVNGTR C A G TPKC + C +NY + Y+KDL+FG SYS
Sbjct: 173 GSHKGCQPYLIEPCEHHVNGTRKPC-AEGGRTPKCHKTCDNKNYPISYEKDLSFGRSSYS 231
Query: 169 VSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQL 228
+ S+ K I +I +GPVE AF+V+ D + YKSG +
Sbjct: 232 IRSDPKQIQMDIMTNGPVEAAFSVYSDFMSYKSGVY------------------------ 267
Query: 229 GAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKI 288
+ G LGGHAIRILGWG ++ + YWL+ANSWNTDWGDNG FKI
Sbjct: 268 -------------RHVKGSLLGGHAIRILGWGMEKGT--PYWLVANSWNTDWGDNGTFKI 312
Query: 289 LRGKDECGIESSITAGVPK 307
LRG D CGIE S+ AG+P+
Sbjct: 313 LRGSDHCGIEDSVVAGLPR 331
Score = 57.0 bits (136), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 23/32 (71%), Positives = 26/32 (81%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQ 40
CGFGCNGGFPG AWR+W G+VSGG YGS +
Sbjct: 145 CGFGCNGGFPGAAWRFWENKGLVSGGLYGSHK 176
>gi|195729973|gb|ACG50797.1| cathepsin B2 [Trichobilharzia szidati]
Length = 344
Score = 175 bits (444), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 92/195 (47%), Positives = 114/195 (58%), Gaps = 40/195 (20%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GC+PYE PCEHHV G RPSC TPKC CQ Y++PY KD +G Y V SN+
Sbjct: 189 GCQPYEFPPCEHHVVGPRPSCGGDV-ETPKCKTTCQPGYNIPYNKDKWYGKTVYRVHSNQ 247
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
++IMKE+ +HGPVE F V+ D YKSG +
Sbjct: 248 EAIMKEVMDHGPVEVDFEVYADFPNYKSGVY----------------------------- 278
Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
+ SG LGGHA+R+LGWGE+ + YWLIANSWN+DWGDNG FKI+RG++
Sbjct: 279 --------QHVSGGLLGGHAVRLLGWGEE--NGVPYWLIANSWNSDWGDNGYFKIIRGRN 328
Query: 294 ECGIESSITAGVPKL 308
ECGIES + AG+PKL
Sbjct: 329 ECGIESDVNAGIPKL 343
Score = 62.8 bits (151), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 30/62 (48%), Positives = 39/62 (62%), Gaps = 3/62 (4%)
Query: 54 LKSWMGVHPDYNLPANRLPEL-IGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSC 112
++ +G PD N LP L GY+ ++LP FD+R WP+CP+I EIRDQ SCGSC
Sbjct: 63 IRRMLGALPDPN--GGYLPTLCTGYTPSLDELPKEFDARKHWPHCPSISEIRDQSSCGSC 120
Query: 113 WG 114
W
Sbjct: 121 WA 122
Score = 47.0 bits (110), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 19/30 (63%), Positives = 22/30 (73%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
CG GCNGGFP AW YW +SGIV+G Y +
Sbjct: 157 CGMGCNGGFPHSAWSYWKRSGIVTGDLYNT 186
>gi|389611087|dbj|BAM19154.1| cathepsin B [Papilio polytes]
Length = 334
Score = 175 bits (443), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 99/210 (47%), Positives = 119/210 (56%), Gaps = 54/210 (25%)
Query: 107 GSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKS 166
GS S GCRPYEI PCEHHV G R C TPKC+++C++NY+V YK+D ++G
Sbjct: 172 GSYNSTQGCRPYEIPPCEHHVPGNRLPCSGDT-KTPKCIKKCEDNYNVAYKQDKHYGKHI 230
Query: 167 YSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTS 226
YSV E I E+Y++GPVE
Sbjct: 231 YSVRGGEDHIKAELYKNGPVE--------------------------------------- 251
Query: 227 QLGAEGAFTVFDDLILYKSG-------KALGGHAIRILGWGEDEKSKEKYWLIANSWNTD 279
GAFTV+ DL+ YKSG ALGGHAI+I+GWG + +K YWLIANSWN+D
Sbjct: 252 -----GAFTVYADLLSYKSGVYKHVAGDALGGHAIKIMGWGVENGNK--YWLIANSWNSD 304
Query: 280 WGDNGLFKILRGKDECGIESSITAGVPKLD 309
WGDNG FKILRG+D CGIESSI AG P LD
Sbjct: 305 WGDNGFFKILRGEDHCGIESSIVAGEPLLD 334
>gi|240992699|ref|XP_002404474.1| cathepsin B endopeptidase, putative [Ixodes scapularis]
gi|215491571|gb|EEC01212.1| cathepsin B endopeptidase, putative [Ixodes scapularis]
Length = 337
Score = 175 bits (443), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 92/198 (46%), Positives = 117/198 (59%), Gaps = 40/198 (20%)
Query: 110 GSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSV 169
G+ GC+PY +APCEHH G+ P+C + TPKCV C++ Y Y+ D +FG K YS+
Sbjct: 178 GTSDGCKPYSLAPCEHHTKGSLPNCTGTVP-TPKCVHLCRKGYGKDYQDDKHFGRKVYSI 236
Query: 170 SSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLG 229
SS+EK I EI+++GPVE FTV+ D + YKSG +
Sbjct: 237 SSDEKQIQTEIFKNGPVEADFTVYADFLSYKSGVY------------------------- 271
Query: 230 AEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKIL 289
++SG LGGHAIRILGWG + + YWL+ANSWN DWGD+G FKIL
Sbjct: 272 ------------QHQSGDVLGGHAIRILGWGTENGT--PYWLVANSWNEDWGDHGYFKIL 317
Query: 290 RGKDECGIESSITAGVPK 307
RGKDECGIE I AG+PK
Sbjct: 318 RGKDECGIEDDINAGIPK 335
>gi|50657025|emb|CAH04630.1| cathepsin B [Suberites domuncula]
Length = 331
Score = 175 bits (443), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 90/194 (46%), Positives = 115/194 (59%), Gaps = 39/194 (20%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GC+PY IA C+HHV G + C + + HTP+C + C+ YDV ++KD +FGA +YSV S+
Sbjct: 173 GCQPYLIASCDHHVVGKKQPCASKEEHTPRCSKTCEAGYDVSFEKDKHFGASAYSVRSSV 232
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
++I EI +GPVEGAFTV+ D YKSG +
Sbjct: 233 EAIQTEIMTNGPVEGAFTVYADFPTYKSGVY----------------------------- 263
Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
+ SG LGGHAIRILGWG + + YWL+ANSWN DWG G FKI+RGKD
Sbjct: 264 --------QHTSGAMLGGHAIRILGWGTENGT--PYWLVANSWNEDWGAMGYFKIIRGKD 313
Query: 294 ECGIESSITAGVPK 307
+CGIES ITAG+PK
Sbjct: 314 DCGIESQITAGMPK 327
Score = 47.8 bits (112), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 19/32 (59%), Positives = 25/32 (78%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQ 40
CG GCNGG+ G AWRY+ +G+V+GG Y SK+
Sbjct: 141 CGMGCNGGYLGAAWRYFEHTGLVTGGQYNSKE 172
>gi|146217390|gb|ABQ10737.1| cathepsin B [Penaeus monodon]
Length = 331
Score = 174 bits (442), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 99/214 (46%), Positives = 119/214 (55%), Gaps = 54/214 (25%)
Query: 103 IRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNF 162
I GS S GC+PYEIAPCEHHV+G RP C G TPKC + C++ Y V Y+ DL+
Sbjct: 165 IVSGGSFNSTQGCQPYEIAPCEHHVSGPRPKCSEGGG-TPKCAKTCEKGYIVDYESDLHH 223
Query: 163 GAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIR 222
G K+YS+ +E I EI +GPV
Sbjct: 224 GGKAYSIMKDEDQIKYEIMNNGPV------------------------------------ 247
Query: 223 DNTSQLGAEGAFTVFDDLILYKSGK-------ALGGHAIRILGWGEDEKSKEKYWLIANS 275
EGAFTV+ D + YKSG LGGHAIR+LGWGE+ + YWL ANS
Sbjct: 248 --------EGAFTVYVDFLHYKSGVYQHRHGLPLGGHAIRVLGWGEENGTP--YWLCANS 297
Query: 276 WNTDWGDNGLFKILRGKDECGIESSITAGVPKLD 309
WNTDWGDNGLFKILRG D CGIES I+AG+PK++
Sbjct: 298 WNTDWGDNGLFKILRGSDHCGIESEISAGLPKVN 331
Score = 59.3 bits (142), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 25/34 (73%), Positives = 29/34 (85%)
Query: 7 RLCGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQ 40
LCGFGCNGGFPG A++YWV SGIVSGG++ S Q
Sbjct: 142 HLCGFGCNGGFPGAAFKYWVHSGIVSGGSFNSTQ 175
>gi|298370749|gb|ADI80349.1| cathepsin B [Litopenaeus vannamei]
Length = 331
Score = 174 bits (442), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 118/341 (34%), Positives = 157/341 (46%), Gaps = 117/341 (34%)
Query: 40 QAEKNSLSNIPRAHLKSWMGVHPD--YNLP---ANRLPELIGYSEVDEDLPANFDSRTKW 94
+A +N ++ + + MGVHPD Y++P +++PE + +LP FDSR W
Sbjct: 37 EAGRNFNKHLSIRYFRRLMGVHPDSKYHMPKYEVHQIPE-------NFELPKEFDSRAAW 89
Query: 95 PNCPTIREIRDQGSCGSCWGCRPYEIAP--------------------------CEHHVN 128
P CPTI EIRDQGSCGSCW E+ C N
Sbjct: 90 PMCPTIGEIRDQGSCGSCWAFGAVEVMSDRQCIHSKGKSNFHYSAENLVSCCHLCGFGCN 149
Query: 129 GTRP----------------SCDASKGHTPKCVRECQENYDVP----------------- 155
G P S ++++G P + C+ + P
Sbjct: 150 GGFPGAAFKYWVHSGIVSGGSFNSTQGCQPYEIAPCEHHVPGPRPKCSEGGGTPKCAKTC 209
Query: 156 -------YKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPG 208
Y+ DL+ G K+YS+ +E I EI ++GPVEGAFTV+ D + YKSG +
Sbjct: 210 EKGYIVDYESDLHHGGKAYSIMKDEDQIKYEIMKNGPVEGAFTVYVDFLHYKSGVY---- 265
Query: 209 NETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEK 268
++ G LGGHAIR+LGWGE+ +
Sbjct: 266 ---------------------------------QHRHGLPLGGHAIRVLGWGEENGT--P 290
Query: 269 YWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVPKLD 309
YWL ANSWNTDWGDNGLFKILRG D CGIES I+AG+PKL+
Sbjct: 291 YWLCANSWNTDWGDNGLFKILRGSDHCGIESEISAGLPKLN 331
Score = 59.3 bits (142), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 25/35 (71%), Positives = 29/35 (82%)
Query: 7 RLCGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQA 41
LCGFGCNGGFPG A++YWV SGIVSGG++ S Q
Sbjct: 142 HLCGFGCNGGFPGAAFKYWVHSGIVSGGSFNSTQG 176
>gi|260786791|ref|XP_002588440.1| hypothetical protein BRAFLDRAFT_199166 [Branchiostoma floridae]
gi|229273602|gb|EEN44451.1| hypothetical protein BRAFLDRAFT_199166 [Branchiostoma floridae]
Length = 332
Score = 174 bits (442), Expect = 4e-41, Method: Compositional matrix adjust.
Identities = 90/203 (44%), Positives = 119/203 (58%), Gaps = 40/203 (19%)
Query: 107 GSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKS 166
G GS GC+PYEIAPCEHH+NG+RP+C + TP+C + C+ Y+V + KD ++ +
Sbjct: 170 GPYGSMQGCQPYEIAPCEHHINGSRPACGKIEP-TPRCKKTCESGYNVTFNKDKHYAKSA 228
Query: 167 YSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTS 226
YSVSS + I EI +GPVE AFTV+ D YKSG +
Sbjct: 229 YSVSSKVQQIQMEIMTNGPVEAAFTVYADFPHYKSGVY---------------------- 266
Query: 227 QLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLF 286
++SG LGGHA++++GWG + + YWLIANSWN+DWGD G F
Sbjct: 267 ---------------QHESGAELGGHAVKMIGWGMEGST--PYWLIANSWNSDWGDMGFF 309
Query: 287 KILRGKDECGIESSITAGVPKLD 309
KILRG+DECGIE I AG P++D
Sbjct: 310 KILRGQDECGIERDIVAGEPRMD 332
Score = 50.8 bits (120), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 20/32 (62%), Positives = 24/32 (75%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQ 40
CG GC+GGFP AW YW + G+V+GG YGS Q
Sbjct: 145 CGMGCHGGFPEAAWEYWKQDGLVTGGPYGSMQ 176
>gi|198429088|ref|XP_002120307.1| PREDICTED: similar to cathepsin B [Ciona intestinalis]
Length = 364
Score = 174 bits (442), Expect = 4e-41, Method: Compositional matrix adjust.
Identities = 93/199 (46%), Positives = 113/199 (56%), Gaps = 40/199 (20%)
Query: 110 GSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSV 169
GS GC PY+I PCEHHV G RP C G TP CV +C+ N + Y +D ++G SY+V
Sbjct: 206 GSKTGCLPYQIKPCEHHVPGDRPKCSEGGG-TPSCVSKCKGNTTIHYNQDKHYGLSSYAV 264
Query: 170 SSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLG 229
S+ I EI HGPVEGAFTV+ D YKSG +
Sbjct: 265 GSDPTQIQTEIMTHGPVEGAFTVYADFPTYKSGVY------------------------- 299
Query: 230 AEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKIL 289
+ +G LGGHAIRILGWG + + YWL+ANSWNTDWGD G FKIL
Sbjct: 300 ------------KHVTGGVLGGHAIRILGWGSE--NGVAYWLVANSWNTDWGDKGYFKIL 345
Query: 290 RGKDECGIESSITAGVPKL 308
RG DECGIESS+ AG+P++
Sbjct: 346 RGSDECGIESSVVAGIPQI 364
Score = 53.1 bits (126), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 22/31 (70%), Positives = 25/31 (80%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSK 39
CG GCNGGFPG AW+YW G+V+GG YGSK
Sbjct: 178 CGDGCNGGFPGSAWKYWNSDGLVTGGLYGSK 208
>gi|333408990|gb|AEF32260.1| cathepsin B [Cristaria plicata]
Length = 347
Score = 174 bits (440), Expect = 6e-41, Method: Compositional matrix adjust.
Identities = 91/198 (45%), Positives = 116/198 (58%), Gaps = 40/198 (20%)
Query: 111 SCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVS 170
S GC+PY I C+HHV G C + TPKC ++C+ NY+V YK D ++G SYSV
Sbjct: 189 SSQGCQPYMIPACDHHVVGHLQPCPKEEAKTPKCSKKCEANYNVTYKDDKHYGKNSYSVD 248
Query: 171 SNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGA 230
S EK IM EI +GPVE AFTV++D + YKSG +
Sbjct: 249 SVEK-IMTEIMTNGPVEAAFTVYEDFLSYKSGVY-------------------------- 281
Query: 231 EGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILR 290
+++G+ LGGHA++ILGWGED + YW++ANSWN DWG+ G F ILR
Sbjct: 282 -----------QHRTGQELGGHAVKILGWGEDNGT--PYWIVANSWNPDWGNQGFFNILR 328
Query: 291 GKDECGIESSITAGVPKL 308
GKDECGIES I AG+PKL
Sbjct: 329 GKDECGIESQIVAGLPKL 346
Score = 47.4 bits (111), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 19/32 (59%), Positives = 23/32 (71%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQ 40
CG GC GGFP AWRY+ + G+V+GG Y S Q
Sbjct: 160 CGEGCQGGFPAEAWRYYEREGLVTGGLYNSSQ 191
>gi|291385792|ref|XP_002709482.1| PREDICTED: cathepsin B [Oryctolagus cuniculus]
Length = 339
Score = 173 bits (439), Expect = 7e-41, Method: Compositional matrix adjust.
Identities = 95/196 (48%), Positives = 121/196 (61%), Gaps = 40/196 (20%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GC+PY I PCEHHVNG+RP+C +G TP+C + C+ Y YK+D ++G SYSVSS+E
Sbjct: 178 GCKPYSIPPCEHHVNGSRPAC-TGEGDTPRCSKTCEPGYSPSYKEDKHYGYSSYSVSSDE 236
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
I EIY++GPVEGAFTV+ D ++YKSG +
Sbjct: 237 NEIKAEIYKNGPVEGAFTVYSDFLMYKSGVY----------------------------- 267
Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
+ +G +GGHAIRILGWGE+ + YWL+ANSWNTDWGD G FKILRG+D
Sbjct: 268 --------QHTTGDIMGGHAIRILGWGEE--NGVPYWLVANSWNTDWGDKGFFKILRGQD 317
Query: 294 ECGIESSITAGVPKLD 309
CGIES I AG+P+ D
Sbjct: 318 HCGIESEIVAGIPRTD 333
Score = 66.6 bits (161), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 32/71 (45%), Positives = 46/71 (64%), Gaps = 5/71 (7%)
Query: 44 NSLSNIPRAHLKSWMGVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREI 103
++ N+ ++LK G L +LP + +++ D LP +FD+R +WPNCPTI+EI
Sbjct: 45 HNFFNVEVSYLKKLCGTF----LGGPKLPRRVEFAD-DIKLPESFDAREQWPNCPTIKEI 99
Query: 104 RDQGSCGSCWG 114
RDQGSCGSCW
Sbjct: 100 RDQGSCGSCWA 110
>gi|183988832|gb|ACC66065.1| cathepsin B [Antheraea assama]
Length = 287
Score = 173 bits (438), Expect = 9e-41, Method: Compositional matrix adjust.
Identities = 94/194 (48%), Positives = 115/194 (59%), Gaps = 40/194 (20%)
Query: 107 GSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKS 166
G+ S GCRPYEI PCEHHV G R C+ TPKC + C+ +Y VP+KKD +G
Sbjct: 134 GNYNSSQGCRPYEIPPCEHHVPGNRMPCNGDT-KTPKCEKTCESSYTVPFKKDKRYGKHV 192
Query: 167 YSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTS 226
YSVS +E +I E++++GPVEGAFTV+ DL+ YKSG +
Sbjct: 193 YSVSGHEDNIKAELFKNGPVEGAFTVYSDLLSYKSGVY---------------------- 230
Query: 227 QLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLF 286
+ G ALGGHAI+ILGWG + S KYWLIANSWN+DWGDNG
Sbjct: 231 ---------------QHTHGNALGGHAIKILGWGVENGS--KYWLIANSWNSDWGDNGFL 273
Query: 287 KILRGKDECGIESS 300
KILRG+D CGIESS
Sbjct: 274 KILRGEDHCGIESS 287
>gi|227293|prf||1701299A cathepsin B
Length = 339
Score = 173 bits (438), Expect = 9e-41, Method: Compositional matrix adjust.
Identities = 92/196 (46%), Positives = 115/196 (58%), Gaps = 40/196 (20%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GC PY I PCEHHVNG+RP C +G T +C + C+ Y YK+D +FG SYSVS++
Sbjct: 178 GCLPYTIPPCEHHVNGSRPPC-TGEGDTRRCNKSCEAGYSPSYKEDKHFGYTSYSVSNSV 236
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
K IM EIY++GPVEGAFTVF D + YKSG +
Sbjct: 237 KKIMAEIYKNGPVEGAFTVFSDFLTYKSGVY----------------------------- 267
Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
+++G +GGHAIRIL WG + + YW ANSWN DWGDNG FKILRG++
Sbjct: 268 --------KHEAGDMMGGHAIRILVWGVE--NGVPYWAAANSWNLDWGDNGFFKILRGEN 317
Query: 294 ECGIESSITAGVPKLD 309
CGIES I AG+P+ D
Sbjct: 318 HCGIESEIVAGIPRTD 333
Score = 45.4 bits (106), Expect = 0.036, Method: Compositional matrix adjust.
Identities = 19/30 (63%), Positives = 22/30 (73%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
CG GCNGG+P AW +W K G+VSGG Y S
Sbjct: 146 CGDGCNGGYPSGAWNFWTKKGLVSGGYYDS 175
>gi|46812327|gb|AAT02230.1| cathepsin B-like proteinase [Triatoma dimidiata]
Length = 332
Score = 172 bits (437), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 89/206 (43%), Positives = 120/206 (58%), Gaps = 40/206 (19%)
Query: 103 IRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNF 162
I G+ GS GC+PY IAPCEHH+ G+RP C +GHT C ++C++ Y +PY KDL++
Sbjct: 167 IVSGGNYGSKEGCQPYSIAPCEHHIPGSRPPCRG-EGHTADCRKQCEKGYSIPYDKDLHY 225
Query: 163 GAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIR 222
YS + K I EI ++GPVE AF V++DL+ YK G +
Sbjct: 226 AEFVYSTERDVKEIQTEILKNGPVEAAFFVYEDLLTYKEGVY------------------ 267
Query: 223 DNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGD 282
+ +G +GGHAI+ILGWG + + YWLIANSWNTDWG+
Sbjct: 268 -------------------KHVAGAPVGGHAIKILGWGVENGT--PYWLIANSWNTDWGN 306
Query: 283 NGLFKILRGKDECGIESSITAGVPKL 308
NG FKILRG DECGIE ++AG+P++
Sbjct: 307 NGFFKILRGSDECGIEIDVSAGLPRI 332
Score = 48.1 bits (113), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 22/32 (68%), Positives = 23/32 (71%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQ 40
CG GC GG PG AW YW GIVSGG YGSK+
Sbjct: 146 CGAGCFGGDPGSAWEYWRDVGIVSGGNYGSKE 177
>gi|56462338|gb|AAV91452.1| cysteine peptidase 2 cathepsin-B-like [Lonomia obliqua]
Length = 338
Score = 172 bits (435), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 98/213 (46%), Positives = 118/213 (55%), Gaps = 54/213 (25%)
Query: 103 IRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNF 162
I GS S GC PYE+ PCEHHV G R C+ TPKC + C+ Y+VP+KKD ++
Sbjct: 170 IVSGGSYNSTQGCIPYEVPPCEHHVPGNRLPCNGDT-KTPKCQKTCEAGYNVPFKKDKHY 228
Query: 163 GAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIR 222
G YSVS NE +I E++++GPVE
Sbjct: 229 GKHVYSVSGNEDNIKAELFKNGPVE----------------------------------- 253
Query: 223 DNTSQLGAEGAFTVFDDLILYKSG-------KALGGHAIRILGWGEDEKSKEKYWLIANS 275
GAFTV+ DL+ YKSG ALGGHA++ILGWG + SK YWLIANS
Sbjct: 254 ---------GAFTVYSDLLSYKSGVYQHTDGSALGGHAVKILGWGVENGSK--YWLIANS 302
Query: 276 WNTDWGDNGLFKILRGKDECGIESSITAGVPKL 308
WN+DWGDNG FKILRG+D CGIESSI G P L
Sbjct: 303 WNSDWGDNGFFKILRGEDHCGIESSIVTGEPLL 335
>gi|112983908|ref|NP_001036850.1| cathepsin B precursor [Bombyx mori]
gi|13548667|dbj|BAB40804.1| cathepsin B [Bombyx mori]
Length = 337
Score = 172 bits (435), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 97/210 (46%), Positives = 116/210 (55%), Gaps = 54/210 (25%)
Query: 107 GSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKS 166
GS S GCRPYEI PCEHHV G R C TPKC ++C+ YDV YK+D +G
Sbjct: 173 GSYNSSQGCRPYEIPPCEHHVPGNRMPCSGDT-KTPKCTKKCESGYDVNYKQDKQYGKHV 231
Query: 167 YSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTS 226
Y+VS +E I E++++GPVE
Sbjct: 232 YTVSGDEDHIRAELFKNGPVE--------------------------------------- 252
Query: 227 QLGAEGAFTVFDDLILYKSG-------KALGGHAIRILGWGEDEKSKEKYWLIANSWNTD 279
GAFTV+ DL+ YKSG ALGGHA++ILGWG + +K YWLIANSWN+D
Sbjct: 253 -----GAFTVYSDLLSYKSGVYKHTQGDALGGHAVKILGWGVENDNK--YWLIANSWNSD 305
Query: 280 WGDNGLFKILRGKDECGIESSITAGVPKLD 309
WGDNG FKILRG+D CGIESSI G P LD
Sbjct: 306 WGDNGFFKILRGEDHCGIESSIVTGEPFLD 335
>gi|55793941|gb|AAV65881.1| cathepsin B1 isotype 1 precursor [Trichobilharzia regenti]
Length = 342
Score = 171 bits (434), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 88/193 (45%), Positives = 109/193 (56%), Gaps = 39/193 (20%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GC+PY CEHH G P C TPKC ++CQ+ Y PYKKD +G SY+V +NE
Sbjct: 187 GCQPYPFPKCEHHTTGKYPECGEKIYKTPKCHQKCQKGYKTPYKKDKYYGRMSYNVLNNE 246
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
+I KEI HGPVE AFTV D + YKSG
Sbjct: 247 NAIKKEIMMHGPVEAAFTVHSDFLNYKSG------------------------------- 275
Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
+ Y +G +GGHA+RI+GWG ++K+ YWLIANSWN DWG+ G F+ILRGKD
Sbjct: 276 ------IYKYMTGAEIGGHAVRIIGWGVEKKT--PYWLIANSWNEDWGEKGYFRILRGKD 327
Query: 294 ECGIESSITAGVP 306
ECGIES +T G+P
Sbjct: 328 ECGIESEVTGGLP 340
Score = 47.8 bits (112), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 18/27 (66%), Positives = 21/27 (77%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGA 35
CG GC GGFPG AW YWV+ GIV+G +
Sbjct: 155 CGLGCQGGFPGAAWDYWVEDGIVTGSS 181
>gi|55793945|gb|AAV65883.1| cathepsin B1 isotype 3 precursor [Trichobilharzia regenti]
Length = 342
Score = 171 bits (433), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 88/193 (45%), Positives = 109/193 (56%), Gaps = 39/193 (20%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GC+PY CEHH G P C TPKC ++CQ+ Y PYKKD +G SY+V +NE
Sbjct: 187 GCQPYPFPKCEHHTTGKYPECGEKIYKTPKCHQKCQKGYKTPYKKDKYYGRMSYNVLNNE 246
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
+I KEI HGPVE AFTV D + YKSG
Sbjct: 247 NAIKKEIMMHGPVEAAFTVHSDFLNYKSG------------------------------- 275
Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
+ Y +G +GGHA+RI+GWG ++K+ YWLIANSWN DWG+ G F+ILRGKD
Sbjct: 276 ------IYKYMTGAEIGGHAVRIIGWGVEKKT--PYWLIANSWNEDWGEKGYFRILRGKD 327
Query: 294 ECGIESSITAGVP 306
ECGIES +T G+P
Sbjct: 328 ECGIESEVTGGLP 340
Score = 47.8 bits (112), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 18/27 (66%), Positives = 21/27 (77%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGA 35
CG GC GGFPG AW YWV+ GIV+G +
Sbjct: 155 CGLGCQGGFPGAAWDYWVEDGIVTGSS 181
>gi|118424551|gb|ABK90823.1| cathepsin B-like cysteine proteinase [Spodoptera exigua]
Length = 341
Score = 171 bits (433), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 99/209 (47%), Positives = 116/209 (55%), Gaps = 54/209 (25%)
Query: 107 GSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKS 166
GS S GCRPYEI PCEHHV G R C+ TPKC + C+ +Y+V Y KD +G
Sbjct: 177 GSYNSSQGCRPYEIPPCEHHVPGNRMPCNGDS-KTPKCHKTCESSYNVDYHKDKRYGKHV 235
Query: 167 YSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTS 226
YSVSS E I E+Y++GPVE
Sbjct: 236 YSVSSKEDHIKAELYKNGPVE--------------------------------------- 256
Query: 227 QLGAEGAFTVFDDLILYKSG-------KALGGHAIRILGWGEDEKSKEKYWLIANSWNTD 279
GAFTV+ DL+ YK+G ALGGHAI+ILGWG + +K YWLIANSWN+D
Sbjct: 257 -----GAFTVYSDLLNYKNGVYKHTVGNALGGHAIKILGWGVENGNK--YWLIANSWNSD 309
Query: 280 WGDNGLFKILRGKDECGIESSITAGVPKL 308
WGDNG FKILRG+D CGIESSI AG P L
Sbjct: 310 WGDNGFFKILRGEDHCGIESSIVAGEPLL 338
>gi|226472810|emb|CAX71091.1| cathepsin B [Schistosoma japonicum]
Length = 348
Score = 170 bits (431), Expect = 6e-40, Method: Compositional matrix adjust.
Identities = 90/195 (46%), Positives = 111/195 (56%), Gaps = 40/195 (20%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GC+PYE PCEHH G P CD TP C R CQ Y+V Y+ D +G Y V SN+
Sbjct: 192 GCQPYEFPPCEHHTLGPLPVCDGDV-ETPPCKRTCQAGYNVSYENDKWYGKVVYRVKSNQ 250
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
++IMKE+ +HGPVE F V+ D YKSG +
Sbjct: 251 EAIMKELMQHGPVEVDFEVYADFPNYKSGVY----------------------------- 281
Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
+ SG LGGHA+R+LGWGE+ + YWLIANSWNTDWGDNG FKI+RGK+
Sbjct: 282 --------QHVSGALLGGHAVRLLGWGEE--NNVPYWLIANSWNTDWGDNGYFKIIRGKN 331
Query: 294 ECGIESSITAGVPKL 308
ECGIES + AG+PK+
Sbjct: 332 ECGIESDVNAGIPKI 346
Score = 53.1 bits (126), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 27/62 (43%), Positives = 37/62 (59%), Gaps = 3/62 (4%)
Query: 54 LKSWMGVHPDYNLPANRLPEL-IGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSC 112
++ +G PD N +L L GY +LP +FD+R +W +CP+I EIRDQ SCGS
Sbjct: 66 IRRMLGALPDPN--GEQLETLCTGYELTVNELPKSFDARKEWTHCPSISEIRDQSSCGSY 123
Query: 113 WG 114
W
Sbjct: 124 WA 125
Score = 45.1 bits (105), Expect = 0.043, Method: Compositional matrix adjust.
Identities = 18/30 (60%), Positives = 20/30 (66%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
CG GCNGGFP AW YW GIV+G Y +
Sbjct: 160 CGMGCNGGFPHSAWLYWKNQGIVTGDLYNT 189
>gi|226472800|emb|CAX71086.1| cathepsin B [Schistosoma japonicum]
gi|226472804|emb|CAX71088.1| cathepsin B [Schistosoma japonicum]
Length = 348
Score = 170 bits (431), Expect = 6e-40, Method: Compositional matrix adjust.
Identities = 90/195 (46%), Positives = 111/195 (56%), Gaps = 40/195 (20%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GC+PYE PCEHH G P CD TP C R CQ Y+V Y+ D +G Y V SN+
Sbjct: 192 GCQPYEFPPCEHHTLGPLPVCDGDV-ETPPCKRTCQAGYNVSYENDKWYGKVVYRVKSNQ 250
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
++IMKE+ +HGPVE F V+ D YKSG +
Sbjct: 251 EAIMKELMQHGPVEVDFEVYADFPNYKSGVY----------------------------- 281
Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
+ SG LGGHA+R+LGWGE+ + YWLIANSWNTDWGDNG FKI+RGK+
Sbjct: 282 --------QHVSGALLGGHAVRLLGWGEE--NNVPYWLIANSWNTDWGDNGYFKIIRGKN 331
Query: 294 ECGIESSITAGVPKL 308
ECGIES + AG+PK+
Sbjct: 332 ECGIESDVNAGIPKI 346
Score = 57.0 bits (136), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 28/62 (45%), Positives = 38/62 (61%), Gaps = 3/62 (4%)
Query: 54 LKSWMGVHPDYNLPANRLPEL-IGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSC 112
++ +G PD N +L L GY +LP +FD+R +W +CP+I EIRDQ SCGSC
Sbjct: 66 IRRMLGALPDPN--GEQLETLCTGYELTLNELPKSFDARKEWTHCPSISEIRDQSSCGSC 123
Query: 113 WG 114
W
Sbjct: 124 WA 125
Score = 45.1 bits (105), Expect = 0.042, Method: Compositional matrix adjust.
Identities = 18/30 (60%), Positives = 20/30 (66%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
CG GCNGGFP AW YW GIV+G Y +
Sbjct: 160 CGMGCNGGFPHSAWLYWKNQGIVTGDLYNT 189
>gi|30995341|gb|AAO59414.2| cathepsin B endopeptidase [Schistosoma japonicum]
gi|226472794|emb|CAX71083.1| cathepsin B [Schistosoma japonicum]
gi|226472796|emb|CAX71084.1| cathepsin B [Schistosoma japonicum]
gi|226472798|emb|CAX71085.1| cathepsin B [Schistosoma japonicum]
gi|226472802|emb|CAX71087.1| cathepsin B [Schistosoma japonicum]
gi|226472806|emb|CAX71089.1| cathepsin B [Schistosoma japonicum]
Length = 348
Score = 170 bits (431), Expect = 6e-40, Method: Compositional matrix adjust.
Identities = 90/195 (46%), Positives = 111/195 (56%), Gaps = 40/195 (20%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GC+PYE PCEHH G P CD TP C R CQ Y+V Y+ D +G Y V SN+
Sbjct: 192 GCQPYEFPPCEHHTLGPLPVCDGDV-ETPPCKRTCQAGYNVSYENDKWYGKVVYRVKSNQ 250
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
++IMKE+ +HGPVE F V+ D YKSG +
Sbjct: 251 EAIMKELMQHGPVEVDFEVYADFPNYKSGVY----------------------------- 281
Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
+ SG LGGHA+R+LGWGE+ + YWLIANSWNTDWGDNG FKI+RGK+
Sbjct: 282 --------QHVSGALLGGHAVRLLGWGEE--NNVPYWLIANSWNTDWGDNGYFKIIRGKN 331
Query: 294 ECGIESSITAGVPKL 308
ECGIES + AG+PK+
Sbjct: 332 ECGIESDVNAGIPKI 346
Score = 57.0 bits (136), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 28/62 (45%), Positives = 38/62 (61%), Gaps = 3/62 (4%)
Query: 54 LKSWMGVHPDYNLPANRLPEL-IGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSC 112
++ +G PD N +L L GY +LP +FD+R +W +CP+I EIRDQ SCGSC
Sbjct: 66 IRRMLGALPDPN--GEQLETLCTGYELTLNELPKSFDARKEWTHCPSISEIRDQSSCGSC 123
Query: 113 WG 114
W
Sbjct: 124 WA 125
Score = 45.1 bits (105), Expect = 0.043, Method: Compositional matrix adjust.
Identities = 18/30 (60%), Positives = 20/30 (66%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
CG GCNGGFP AW YW GIV+G Y +
Sbjct: 160 CGMGCNGGFPHSAWLYWKNQGIVTGDLYNT 189
>gi|119887749|gb|ABM05925.1| cathepsin B-like cysteine proteinase [Helicoverpa assulta]
Length = 338
Score = 170 bits (430), Expect = 9e-40, Method: Compositional matrix adjust.
Identities = 96/209 (45%), Positives = 116/209 (55%), Gaps = 54/209 (25%)
Query: 107 GSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKS 166
GS S GCRPYEI PCEHHV G R C+ TPKC + C+ NY+V Y+KD +G
Sbjct: 174 GSYNSSQGCRPYEIPPCEHHVPGNRMPCNGDS-KTPKCEKTCESNYNVDYRKDKRYGKHV 232
Query: 167 YSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTS 226
+SVSS E I E++++GPVE
Sbjct: 233 FSVSSKEDHIRAELFKNGPVE--------------------------------------- 253
Query: 227 QLGAEGAFTVFDDLILYKSG-------KALGGHAIRILGWGEDEKSKEKYWLIANSWNTD 279
GAFTV+ DL+ YK+G ALGGHA++ILGWG + +K YWLIANSWN+D
Sbjct: 254 -----GAFTVYSDLLNYKTGVYKHTIGDALGGHAVKILGWGVENGNK--YWLIANSWNSD 306
Query: 280 WGDNGLFKILRGKDECGIESSITAGVPKL 308
WGDNG FKILRG+D CGIESSI AG P
Sbjct: 307 WGDNGFFKILRGEDHCGIESSIVAGEPMF 335
Score = 80.9 bits (198), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 38/75 (50%), Positives = 50/75 (66%), Gaps = 2/75 (2%)
Query: 40 QAEKNSLSNIPRAHLKSWMGVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPT 99
+A +N + P AH+K GV PDY+L ++L ++ E+ LP NFD R KWPNCPT
Sbjct: 42 KAGRNFPEHTPFAHIKKLAGVLPDYHL--SKLSKVEHEDELIASLPENFDPRDKWPNCPT 99
Query: 100 IREIRDQGSCGSCWG 114
+ E+RDQGSCGSCW
Sbjct: 100 LNEVRDQGSCGSCWA 114
>gi|7537454|gb|AAF35867.2| cathepsin B-like cysteine proteinase [Helicoverpa armigera]
Length = 338
Score = 169 bits (429), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 96/209 (45%), Positives = 116/209 (55%), Gaps = 54/209 (25%)
Query: 107 GSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKS 166
GS S GCRPYEI PCEHHV G R C+ TPKC + C+ NY+V Y+KD +G
Sbjct: 174 GSYNSSQGCRPYEIPPCEHHVPGNRMPCNGDS-KTPKCEKTCESNYNVDYRKDKRYGKHV 232
Query: 167 YSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTS 226
+SVSS E I E++++GPVE
Sbjct: 233 FSVSSKEDHIRAELFKNGPVE--------------------------------------- 253
Query: 227 QLGAEGAFTVFDDLILYKSG-------KALGGHAIRILGWGEDEKSKEKYWLIANSWNTD 279
GAFTV+ DL+ YK+G ALGGHA++ILGWG + +K YWLIANSWN+D
Sbjct: 254 -----GAFTVYSDLLNYKTGVYKHTIGDALGGHAVKILGWGVENGNK--YWLIANSWNSD 306
Query: 280 WGDNGLFKILRGKDECGIESSITAGVPKL 308
WGDNG FKILRG+D CGIESSI AG P
Sbjct: 307 WGDNGFFKILRGEDHCGIESSIVAGEPMF 335
Score = 80.5 bits (197), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 38/75 (50%), Positives = 50/75 (66%), Gaps = 2/75 (2%)
Query: 40 QAEKNSLSNIPRAHLKSWMGVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPT 99
+A +N + P AH+K GV PDY+L ++L ++ E+ LP NFD R KWPNCPT
Sbjct: 42 KAGRNFPEHTPFAHIKRLAGVLPDYHL--SKLSKVEHEDELIASLPENFDPRDKWPNCPT 99
Query: 100 IREIRDQGSCGSCWG 114
+ E+RDQGSCGSCW
Sbjct: 100 LNEVRDQGSCGSCWA 114
>gi|254746338|emb|CAX16634.1| putative C1A cysteine protease precursor [Manduca sexta]
Length = 337
Score = 168 bits (426), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 98/213 (46%), Positives = 115/213 (53%), Gaps = 54/213 (25%)
Query: 103 IRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNF 162
I G+ S GCRPYEI PCEHHV G R C TPKC + C+ Y+V YKKD +
Sbjct: 169 IVSGGNYNSTQGCRPYEIPPCEHHVPGNRMPCSGDT-KTPKCQKNCENGYNVMYKKDKRY 227
Query: 163 GAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIR 222
G YSVS+ E I E+Y++GPVE
Sbjct: 228 GKHVYSVSAGEDHIRAELYKNGPVE----------------------------------- 252
Query: 223 DNTSQLGAEGAFTVFDDLILYKSG-------KALGGHAIRILGWGEDEKSKEKYWLIANS 275
GAFTV+ DL+ YKSG ALGGHAI+ILGWG + +K YWL+ANS
Sbjct: 253 ---------GAFTVYADLLAYKSGVYKHIQGDALGGHAIKILGWGVENDNK--YWLVANS 301
Query: 276 WNTDWGDNGLFKILRGKDECGIESSITAGVPKL 308
WNTDWGDNG FKILRG++ CGIE SI AG P L
Sbjct: 302 WNTDWGDNGFFKILRGENHCGIEGSIIAGEPLL 334
Score = 52.4 bits (124), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 27/65 (41%), Positives = 35/65 (53%), Gaps = 12/65 (18%)
Query: 8 LCGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQAEKNSLSNIPRAHLKSWMGVHPDYNLP 67
+CG GCNGG P +AW YW GIVSGG Y S Q + IP ++++P
Sbjct: 147 ICGLGCNGGIPSLAWEYWKHFGIVSGGNYNSTQGCRP--YEIPPC----------EHHVP 194
Query: 68 ANRLP 72
NR+P
Sbjct: 195 GNRMP 199
>gi|55793947|gb|AAV65884.1| cathepsin B1 isotype 4 precursor [Trichobilharzia regenti]
Length = 342
Score = 168 bits (426), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 86/194 (44%), Positives = 111/194 (57%), Gaps = 39/194 (20%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GC+PY CEHH G P C TPKC ++CQ+ Y PYKKD +G SY+V +NE
Sbjct: 187 GCQPYPFPKCEHHTTGKYPECGEKIYKTPKCHQKCQKGYKTPYKKDKYYGRMSYNVLNNE 246
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
+I KEI HGPVE AFTV D + YKSG
Sbjct: 247 NAIKKEIMMHGPVEVAFTVHSDFLNYKSG------------------------------- 275
Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
+ Y +G +G HA+RI+GWG ++K+ YWLIANSWN DWG+ G F++LRGKD
Sbjct: 276 ------IYKYMTGAEIGEHAVRIIGWGVEKKT--PYWLIANSWNEDWGEKGYFRMLRGKD 327
Query: 294 ECGIESSITAGVPK 307
ECGIES++T+G+P+
Sbjct: 328 ECGIESAVTSGLPR 341
Score = 47.8 bits (112), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 18/27 (66%), Positives = 21/27 (77%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGA 35
CG GC GGFPG AW YWV+ GIV+G +
Sbjct: 155 CGLGCQGGFPGAAWDYWVEDGIVTGSS 181
>gi|226468762|emb|CAX76409.1| cathepsin B [Schistosoma japonicum]
gi|257206178|emb|CAX82740.1| cathepsin B [Schistosoma japonicum]
Length = 348
Score = 167 bits (424), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 89/195 (45%), Positives = 111/195 (56%), Gaps = 40/195 (20%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GC+PYE PCEH+ G P CD TP C R CQ Y+V Y+ D +G Y V SN+
Sbjct: 192 GCQPYEFPPCEHNTLGPLPVCDGDV-ETPPCKRTCQAGYNVSYENDKWYGKVVYRVKSNQ 250
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
++IMKE+ +HGPVE F V+ D YKSG +
Sbjct: 251 EAIMKELMQHGPVEVDFEVYADFPNYKSGVY----------------------------- 281
Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
+ SG LGGHA+R+LGWGE+ + YWLIANSWNTDWGDNG FKI+RGK+
Sbjct: 282 --------QHVSGALLGGHAVRLLGWGEE--NNVPYWLIANSWNTDWGDNGYFKIIRGKN 331
Query: 294 ECGIESSITAGVPKL 308
ECGIES + AG+PK+
Sbjct: 332 ECGIESDVNAGIPKI 346
Score = 57.0 bits (136), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 28/62 (45%), Positives = 38/62 (61%), Gaps = 3/62 (4%)
Query: 54 LKSWMGVHPDYNLPANRLPEL-IGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSC 112
++ +G PD N +L L GY +LP +FD+R +W +CP+I EIRDQ SCGSC
Sbjct: 66 IRRMLGALPDPN--GEQLETLCTGYELTLNELPKSFDARKEWTHCPSISEIRDQSSCGSC 123
Query: 113 WG 114
W
Sbjct: 124 WA 125
Score = 45.4 bits (106), Expect = 0.030, Method: Compositional matrix adjust.
Identities = 22/51 (43%), Positives = 26/51 (50%), Gaps = 9/51 (17%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQA---------EKNSLSNIP 50
CG GCNGGFP AW YW GIV+G Y + E N+L +P
Sbjct: 160 CGMGCNGGFPHSAWLYWKNQGIVTGDLYNTTNGCQPYEFPPCEHNTLGPLP 210
>gi|405971658|gb|EKC36483.1| Cathepsin B [Crassostrea gigas]
Length = 341
Score = 167 bits (423), Expect = 6e-39, Method: Compositional matrix adjust.
Identities = 91/195 (46%), Positives = 112/195 (57%), Gaps = 40/195 (20%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GC PY I C+HHV G C S G TPKC C+ Y+V Y+KD ++G+ +YSV E
Sbjct: 186 GCLPYTIKACDHHVVGKLQPCSKSIGPTPKCKHTCEAGYNVTYEKDKHYGSSAYSVHGVE 245
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
K IM EI +GPVEGAFTV+ D YKSG +
Sbjct: 246 K-IMTEIMTNGPVEGAFTVYADFPQYKSGVY----------------------------- 275
Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
+ +G+ LGGHAI+ILGWG + + + YWL+ANSWN DWGD G FKILRG+D
Sbjct: 276 --------KHTTGQPLGGHAIKILGWGTE--NGDDYWLVANSWNPDWGDQGFFKILRGQD 325
Query: 294 ECGIESSITAGVPKL 308
ECGIES I+AG PKL
Sbjct: 326 ECGIESQISAGEPKL 340
Score = 48.1 bits (113), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 21/38 (55%), Positives = 24/38 (63%)
Query: 3 TQQIRLCGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQ 40
T R CG GC GGFP AW Y+ K G+V+GG Y S Q
Sbjct: 148 TSCCRTCGNGCEGGFPSAAWSYYKKDGLVTGGQYNSHQ 185
>gi|256077361|ref|XP_002574974.1| SmCB2 peptidase (C01 family) [Schistosoma mansoni]
gi|18181863|emb|CAC85211.2| cathepsin B endopeptidase [Schistosoma mansoni]
gi|353231645|emb|CCD79000.1| SmCB2 peptidase (C01 family) [Schistosoma mansoni]
Length = 347
Score = 167 bits (422), Expect = 7e-39, Method: Compositional matrix adjust.
Identities = 88/195 (45%), Positives = 111/195 (56%), Gaps = 40/195 (20%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GC+PYE PCEHHV G PSCD TP C CQ Y++PY+KD +G K Y + SN
Sbjct: 191 GCQPYEFPPCEHHVIGPLPSCDGDV-ETPSCKTNCQPGYNIPYEKDKWYGEKVYRIHSNP 249
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
++IM E+ +GPVE F V+ D YKSG +
Sbjct: 250 EAIMLELMRNGPVEVDFEVYADFPNYKSGVY----------------------------- 280
Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
+ SG LGGHA+R+LGWGE+ + YWLIANSWN+DWGD G FKI+RGK+
Sbjct: 281 --------QHVSGALLGGHAVRLLGWGEE--NNVPYWLIANSWNSDWGDKGYFKIVRGKN 330
Query: 294 ECGIESSITAGVPKL 308
ECGIES + AG+PK+
Sbjct: 331 ECGIESDVNAGIPKI 345
Score = 60.1 bits (144), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 26/61 (42%), Positives = 38/61 (62%), Gaps = 3/61 (4%)
Query: 54 LKSWMGVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCW 113
++ +G PD P E + + ++LP +FD+R +WP+CP+I EIRDQ SCGSCW
Sbjct: 67 IRRMLGALPD---PNGEQLETLCTGYISDELPKSFDARVEWPHCPSISEIRDQSSCGSCW 123
Query: 114 G 114
Sbjct: 124 A 124
Score = 45.1 bits (105), Expect = 0.047, Method: Compositional matrix adjust.
Identities = 18/30 (60%), Positives = 20/30 (66%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
CG GCNGGFP AW YW GIV+G Y +
Sbjct: 159 CGMGCNGGFPHSAWLYWKNQGIVTGDLYNT 188
>gi|183988834|gb|ACC66066.1| cathepsin B [Samia ricini]
Length = 283
Score = 167 bits (422), Expect = 8e-39, Method: Compositional matrix adjust.
Identities = 88/191 (46%), Positives = 112/191 (58%), Gaps = 40/191 (20%)
Query: 107 GSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKS 166
G+ S GCRPYEI PCEHHV G R C+ TPKC + C+ +Y+VP+KKD +G
Sbjct: 133 GNYNSSQGCRPYEIPPCEHHVPGNRMPCNGDT-KTPKCQKNCESSYNVPFKKDKRYGKHV 191
Query: 167 YSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTS 226
YSVS +E I E++++GPVE AFTV+ DL+ YK+G +
Sbjct: 192 YSVSGHEDHIKAELFKNGPVEAAFTVYSDLLSYKNGVY---------------------- 229
Query: 227 QLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLF 286
+ G ALGGHAI+I+GWG + + KYWLIANSWN+DWGDNG F
Sbjct: 230 ---------------KHTEGNALGGHAIKIIGWGVE--NNNKYWLIANSWNSDWGDNGFF 272
Query: 287 KILRGKDECGI 297
KILRG+D CGI
Sbjct: 273 KILRGEDHCGI 283
Score = 53.9 bits (128), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 23/36 (63%), Positives = 25/36 (69%)
Query: 79 EVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG 114
E+ LP FD R KWP C T+ EIRDQGSCGSCW
Sbjct: 38 ELIATLPEIFDPRDKWPECLTLNEIRDQGSCGSCWA 73
>gi|240992702|ref|XP_002404475.1| cathepsin B endopeptidase, putative [Ixodes scapularis]
gi|215491572|gb|EEC01213.1| cathepsin B endopeptidase, putative [Ixodes scapularis]
Length = 337
Score = 166 bits (421), Expect = 9e-39, Method: Compositional matrix adjust.
Identities = 88/194 (45%), Positives = 110/194 (56%), Gaps = 40/194 (20%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GC+PY +APCEHH G+ P+C + TPKCV C++ Y Y+ D +FG K YS+SS+E
Sbjct: 182 GCKPYSLAPCEHHTKGSLPNCTGTVP-TPKCVHLCRKGYGKDYQDDKHFGKKVYSISSDE 240
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
K I EI+++GPVE F V D + YKSG +
Sbjct: 241 KQIQTEIFKNGPVEADFIVLADFLSYKSGVY----------------------------- 271
Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
+ S +GGHAIRILGWG + + YWL ANSWN DWGD+G FKILRGKD
Sbjct: 272 --------QHHSDDVIGGHAIRILGWGTENGT--PYWLAANSWNEDWGDHGYFKILRGKD 321
Query: 294 ECGIESSITAGVPK 307
ECGIE I AG+PK
Sbjct: 322 ECGIEEDINAGIPK 335
Score = 48.1 bits (113), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 20/35 (57%), Positives = 24/35 (68%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQAEK 43
CG GCNGG P AW YW +SG+V+GG YG+ K
Sbjct: 150 CGAGCNGGTPAAAWEYWKESGLVTGGLYGTNDGCK 184
>gi|338815385|gb|AEJ08755.1| cathepsin B [Crassostrea ariakensis]
Length = 341
Score = 166 bits (420), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 90/195 (46%), Positives = 111/195 (56%), Gaps = 40/195 (20%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GC+PY I C+HHV G C G TPKC C+ Y+V Y+KD ++G +YSV E
Sbjct: 186 GCQPYTIKACDHHVVGKLQPCSKDIGPTPKCKHTCEAGYNVTYEKDKHYGMSAYSVHGVE 245
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
K IM EI +GPVEGAFTV+ D YKSG +
Sbjct: 246 K-IMTEIMTNGPVEGAFTVYADFPQYKSGVY----------------------------- 275
Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
+ +G+ LGGHAI+ILGWG + + + YWL+ANSWN DWGD G FKILRG+D
Sbjct: 276 --------KHTTGQPLGGHAIKILGWGTE--NGDDYWLVANSWNPDWGDQGFFKILRGQD 325
Query: 294 ECGIESSITAGVPKL 308
ECGIES I+AG PKL
Sbjct: 326 ECGIESQISAGEPKL 340
Score = 47.4 bits (111), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 20/38 (52%), Positives = 24/38 (63%)
Query: 3 TQQIRLCGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQ 40
T R CG GC GGFP AW Y+ + G+V+GG Y S Q
Sbjct: 148 TSCCRTCGNGCEGGFPSAAWSYYKRDGLVTGGQYNSHQ 185
>gi|156365510|ref|XP_001626688.1| predicted protein [Nematostella vectensis]
gi|156213574|gb|EDO34588.1| predicted protein [Nematostella vectensis]
Length = 259
Score = 166 bits (420), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 90/195 (46%), Positives = 109/195 (55%), Gaps = 40/195 (20%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GC+PY+IA C+HHV G C TPKC R+C+ Y+V Y D +FG +YSV S+
Sbjct: 101 GCQPYKIAACDHHVVGKLKPCKGDS-PTPKCERKCEAGYNVSYSDDKHFGQSAYSVRSDP 159
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
I KEI +GPVEGAFTV+ D YKSG +
Sbjct: 160 AEIQKEIMTNGPVEGAFTVYADFPTYKSGVY----------------------------- 190
Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
+ SG ALGGHAI+ILGWGE+ + YWL+ANSWN+DWGD G FKI RG D
Sbjct: 191 --------QHTSGSALGGHAIKILGWGEENGT--PYWLVANSWNSDWGDEGFFKIKRGND 240
Query: 294 ECGIESSITAGVPKL 308
ECGIES I G+PK
Sbjct: 241 ECGIESGIVGGLPKF 255
Score = 43.1 bits (100), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 17/35 (48%), Positives = 22/35 (62%)
Query: 7 RLCGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQA 41
CG GCNGG+P AW +W G+V+GG Y S +
Sbjct: 67 ETCGMGCNGGYPESAWDHWKSKGLVTGGQYDSHKG 101
>gi|325302582|dbj|BAJ83491.1| cathepsin B-like peptidase [Echinococcus multilocularis]
Length = 338
Score = 166 bits (419), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 90/210 (42%), Positives = 115/210 (54%), Gaps = 40/210 (19%)
Query: 99 TIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKK 158
I I G GS GCRPYEI PCEHH +G RP C + TPKC R+C E++D Y+
Sbjct: 167 AIDGIVSGGLYGSHVGCRPYEIPPCEHHTSGNRPDCKGNS-KTPKCQRQCVESFDGKYQA 225
Query: 159 DLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIK 218
D +F + Y+V ++E+ IM EI +GPVE F V+ D + YKSG +
Sbjct: 226 DKHFASNVYNVRASEEDIMNEILVYGPVEADFIVYADFLTYKSGVY-------------- 271
Query: 219 WTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNT 278
+ G LGGHA++ILGWGE+ + YWL ANSWNT
Sbjct: 272 -----------------------QHVKGGFLGGHAVKILGWGEE--NGVPYWLCANSWNT 306
Query: 279 DWGDNGLFKILRGKDECGIESSITAGVPKL 308
DWGD G FKILRG + C IE+ I AG+PK+
Sbjct: 307 DWGDGGFFKILRGYNHCKIEADINAGIPKI 336
Score = 57.4 bits (137), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 22/30 (73%), Positives = 26/30 (86%)
Query: 84 LPANFDSRTKWPNCPTIREIRDQGSCGSCW 113
LP+ FD+R WP+CPTI EIRDQG+CGSCW
Sbjct: 84 LPSEFDARKAWPDCPTIGEIRDQGTCGSCW 113
Score = 53.5 bits (127), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 23/31 (74%), Positives = 23/31 (74%)
Query: 8 LCGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
CGFGCNGG P AWRYW GIVSGG YGS
Sbjct: 149 FCGFGCNGGLPENAWRYWAIDGIVSGGLYGS 179
>gi|121309133|dbj|BAF43801.1| Longipain [Haemaphysalis longicornis]
Length = 341
Score = 165 bits (417), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 90/194 (46%), Positives = 110/194 (56%), Gaps = 39/194 (20%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GC PY I C+HHVNGT CD S TP+CVR C++ Y+V + D ++G KSYSV SN
Sbjct: 186 GCMPYPIKACDHHVNGTLGPCDKSIPPTPRCVRMCRKGYNVDFADDKHYGKKSYSVPSNV 245
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
I EI +GPVE FTV+ D LYKSG + + +T Q
Sbjct: 246 TQIQVEIMTNGPVEADFTVYADFPLYKSGVY-----------------QRHTDQ------ 282
Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
ALGGHAIR+LGWG ++ YWL ANSWNT+WGD G FKILRG D
Sbjct: 283 --------------ALGGHAIRLLGWGVEKGV--PYWLAANSWNTEWGDKGFFKILRGSD 326
Query: 294 ECGIESSITAGVPK 307
ECGIE + AG+P+
Sbjct: 327 ECGIEDDVVAGIPR 340
Score = 52.4 bits (124), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 22/32 (68%), Positives = 24/32 (75%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQ 40
CG GCNGGFPG AW YWV GIV+GG Y S +
Sbjct: 154 CGSGCNGGFPGAAWSYWVHKGIVTGGNYDSDE 185
>gi|56756587|gb|AAW26466.1| unknown [Schistosoma japonicum]
Length = 216
Score = 165 bits (417), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 84/194 (43%), Positives = 113/194 (58%), Gaps = 39/194 (20%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GC+PY CEHH G P+C TP+C ++CQ+ Y PYK+D ++G +SY+V SNE
Sbjct: 61 GCQPYPFPKCEHHTKGKYPACGTKIYKTPQCKQKCQKGYKTPYKQDKHYGDESYNVISNE 120
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
K+I KEI +GPVE AF V++D + YKSG
Sbjct: 121 KAIQKEIMMNGPVEAAFDVYEDFLNYKSG------------------------------- 149
Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
+ + +G +GGHAIRI+GWG K + YWLIANSWN DWG+ GLF+I+RG+D
Sbjct: 150 ------IYRHVTGSIVGGHAIRIIGWG--VKKRTPYWLIANSWNEDWGEKGLFRIVRGRD 201
Query: 294 ECGIESSITAGVPK 307
EC IES++ AG+ K
Sbjct: 202 ECSIESNVVAGLIK 215
Score = 47.8 bits (112), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 19/27 (70%), Positives = 22/27 (81%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGA 35
CG GC GGFPG+AW YWV GIV+GG+
Sbjct: 29 CGQGCQGGFPGVAWDYWVTQGIVTGGS 55
>gi|427785213|gb|JAA58058.1| Putative cathepsin l culex quinquefasciatus cathepsin l
[Rhipicephalus pulchellus]
Length = 346
Score = 165 bits (417), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 89/194 (45%), Positives = 106/194 (54%), Gaps = 39/194 (20%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GC PY I C+HHVNGT CD TP+CV C++ YDV Y D ++G SYSV S E
Sbjct: 191 GCMPYPIKACDHHVNGTLGPCDKKIPPTPRCVHMCRKGYDVDYHDDKHYGKSSYSVPSEE 250
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
K I EI +GPVE FTV+ D + YKSG + +E
Sbjct: 251 KQIQAEIMTNGPVEADFTVYSDFVHYKSGVYQRHTDE----------------------- 287
Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
ALGGHAIR+LGWG + + YWL ANSWNT+WGD G FKILRG D
Sbjct: 288 --------------ALGGHAIRLLGWGVE--NGVPYWLAANSWNTEWGDKGFFKILRGSD 331
Query: 294 ECGIESSITAGVPK 307
ECGIE + AG+PK
Sbjct: 332 ECGIEDDVVAGLPK 345
Score = 54.7 bits (130), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 23/32 (71%), Positives = 26/32 (81%)
Query: 7 RLCGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
R CG GCNGGFPG AW +WVK+GIV+GG Y S
Sbjct: 157 RTCGNGCNGGFPGSAWSFWVKTGIVTGGNYDS 188
>gi|357613937|gb|EHJ68797.1| cathepsin B-like cysteine proteinase [Danaus plexippus]
Length = 334
Score = 164 bits (416), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 114/322 (35%), Positives = 146/322 (45%), Gaps = 107/322 (33%)
Query: 54 LKSWMGVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCW 113
+K MG D L ++L + + +LP NFD R KWPNCPT+ EIRDQGSCGSCW
Sbjct: 54 IKKLMGALEDKYL--HKLYTVEHDDDTINNLPENFDPRDKWPNCPTLNEIRDQGSCGSCW 111
Query: 114 GCRPYEIAPCEH--HVNGTR-------------PSC------------------------ 134
E + + NGT+ P C
Sbjct: 112 AFGAVEAMTDRYCTYSNGTKHFHFSAEDLLSCCPVCGLGCNGGIPSFAWEYWKHFGIVSG 171
Query: 135 ---DASKGHTPKCVRECQEN------------------------YDVPYKKDLNFGAKSY 167
++S+G P + C+ + Y YK D +G Y
Sbjct: 172 GNYNSSQGCLPYEIPPCEHHVPGNRIPCNGETSTPKCHRSCRKEYTNSYKSDKKYGKHVY 231
Query: 168 SVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQ 227
SV E+ I EI+++GPVEGAFTV+ DL+ YKSG +
Sbjct: 232 SVGGGEEHIKAEIFKNGPVEGAFTVYADLLTYKSGVY----------------------- 268
Query: 228 LGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFK 287
+ G+ALGGHAI+I+GWG + + KYWLIANSWN+DWGDNG FK
Sbjct: 269 --------------KHTEGEALGGHAIKIMGWGVE--NGNKYWLIANSWNSDWGDNGFFK 312
Query: 288 ILRGKDECGIESSITAGVPKLD 309
ILRG+D CGIESSI AG P D
Sbjct: 313 ILRGEDHCGIESSIVAGEPSYD 334
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 21/34 (61%), Positives = 22/34 (64%)
Query: 8 LCGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQA 41
+CG GCNGG P AW YW GIVSGG Y S Q
Sbjct: 146 VCGLGCNGGIPSFAWEYWKHFGIVSGGNYNSSQG 179
>gi|160688716|gb|ABX45136.1| cathepsin B-like cysteine protease 2 [Callosobruchus maculatus]
Length = 260
Score = 164 bits (415), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 92/236 (38%), Positives = 124/236 (52%), Gaps = 46/236 (19%)
Query: 73 ELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTRP 132
E I + + +DLP FD+R +W C +I+EIRDQ CGSCWGC Y + C P
Sbjct: 70 ETIFHEDDGKDLPEEFDARKQWSKCESIKEIRDQSGCGSCWGCMSYPLPRC-------NP 122
Query: 133 SCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSN-EKSIMKEIYEHGPVEGAFT 191
SC + P C +EC + + Y++D ++ ++Y + S E+ I EI ++GPV +FT
Sbjct: 123 SC-KTLYDAPTCKKECDKGSPLKYEEDKHYAKQAYRIMSKVERQIQLEIIKNGPVVASFT 181
Query: 192 VFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGG 251
V+ D I Y SG + G K LGG
Sbjct: 182 VYADFIHYLSGVYKFDGE------------------------------------SKLLGG 205
Query: 252 HAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVPK 307
HA+RI+GWG E YWL++NSWN WGD GLFKI RGK+ECGIE ITAG+P+
Sbjct: 206 HAVRIIGWG-IENGTYPYWLVSNSWNERWGDQGLFKIWRGKNECGIEEEITAGLPR 260
>gi|346472613|gb|AEO36151.1| hypothetical protein [Amblyomma maculatum]
Length = 373
Score = 164 bits (415), Expect = 5e-38, Method: Compositional matrix adjust.
Identities = 87/194 (44%), Positives = 108/194 (55%), Gaps = 39/194 (20%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GC PY I C+HHVNGT CD + TP+CVR C++ YDV + D ++G +YSV +
Sbjct: 218 GCMPYPIKACDHHVNGTLGPCDKTIPPTPRCVRMCRKGYDVDFMDDKHYGRHAYSVPAKA 277
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
K I EI +GPVE FTV++D + YKSG +
Sbjct: 278 KQIQAEIMMNGPVEADFTVYEDFLHYKSGVY----------------------------- 308
Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
+ ALGGHAIR+LGWG + + YWL ANSWNT+WGD G FKILRG D
Sbjct: 309 --------QRHTDSALGGHAIRLLGWGVE--NGVPYWLAANSWNTEWGDKGFFKILRGSD 358
Query: 294 ECGIESSITAGVPK 307
ECGIES I AG+PK
Sbjct: 359 ECGIESDIVAGLPK 372
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 22/32 (68%), Positives = 24/32 (75%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQ 40
CG GCNGGFPG AW YWV GIV+GG Y S +
Sbjct: 186 CGAGCNGGFPGSAWSYWVHKGIVTGGNYDSDE 217
>gi|38147393|gb|AAR12009.1| cathepsin B-like proteinase [Triatoma infestans]
Length = 332
Score = 164 bits (415), Expect = 5e-38, Method: Compositional matrix adjust.
Identities = 91/213 (42%), Positives = 117/213 (54%), Gaps = 54/213 (25%)
Query: 103 IRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNF 162
I G+ GS GC+PY IAPCEHHV G RP+C + +G TP C +C + + Y KDL +
Sbjct: 167 IVSGGNYGSKQGCQPYSIAPCEHHVPGPRPAC-SGEGSTPDCRNQCDKRSGISYDKDLYY 225
Query: 163 GAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIR 222
G +YS+ K I EI ++GPVE
Sbjct: 226 GESAYSLEDEAKQIQAEILKNGPVEA---------------------------------- 251
Query: 223 DNTSQLGAEGAFTVFDDLILYK-------SGKALGGHAIRILGWGEDEKSKEKYWLIANS 275
AFTV++DL+ YK +G LGGHAI+ILGWG + + YWL+ANS
Sbjct: 252 ----------AFTVYEDLVNYKEGVYQHVAGSVLGGHAIKILGWGVENDTP--YWLVANS 299
Query: 276 WNTDWGDNGLFKILRGKDECGIESSITAGVPKL 308
WNTDWG+NG FKILRGKDECGIE ++AG+P+L
Sbjct: 300 WNTDWGNNGFFKILRGKDECGIEIDVSAGLPRL 332
Score = 54.7 bits (130), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 23/32 (71%), Positives = 25/32 (78%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQ 40
CGFGC+GG+P AW YW GIVSGG YGSKQ
Sbjct: 146 CGFGCDGGYPASAWDYWQNVGIVSGGNYGSKQ 177
>gi|31872149|gb|AAP59456.1| cathepsin B precursor [Araneus ventricosus]
Length = 334
Score = 164 bits (415), Expect = 5e-38, Method: Compositional matrix adjust.
Identities = 112/304 (36%), Positives = 155/304 (50%), Gaps = 47/304 (15%)
Query: 40 QAEKNSLSNIPRAHLKSWMGVHPD---YNLPANRLPELIGYSEVDEDLPANFDSRTKWPN 96
+A +N + +++ +GVH D Y LP+ R V DLP +FDSR +WPN
Sbjct: 43 KAGRNFHEGVTMKYIRGLLGVHKDNHKYRLPSIR-------HAVPGDLPESFDSREQWPN 95
Query: 97 CPTIREIRDQGSCGSCWGCRPYEIAPCEH--HVNGTRPSCDASKGHTPKCVREC------ 148
CPTI EIRDQGSCGSCW E H H NG + + + S C C
Sbjct: 96 CPTISEIRDQGSCGSCWAFGAAEAMSDRHCIHSNG-KVNVEISAEDLLTCCDSCGMGCNG 154
Query: 149 ---QENYDVPYKKDL--------NFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDL- 196
++ K L + G + Y+++S E ++ G +
Sbjct: 155 GFPGSAWEYWVDKGLVTGGLYNSHVGCQPYTIASCEHHTKGKLPPCGDIVDTPQCVHMCE 214
Query: 197 ----ILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLG-AEGAFTVFDDLILYKS------ 245
+ Y++ ++F G ++ ++ + I+ S G E AFTV+ D + YKS
Sbjct: 215 KGYNVSYRADKYF--GKKSYSIDEQEDQIKTEISTNGPVEAAFTVYADFVTYKSGVYRHV 272
Query: 246 -GKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAG 304
G+ +GGHA+RILGWG + S YWL+ANSWNTDWGD G FKILRG DECGIESSI AG
Sbjct: 273 TGEEMGGHAVRILGWGTE--SGTPYWLVANSWNTDWGDKGYFKILRGSDECGIESSIVAG 330
Query: 305 VPKL 308
+PK+
Sbjct: 331 LPKV 334
Score = 52.4 bits (124), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 21/30 (70%), Positives = 23/30 (76%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
CG GCNGGFPG AW YWV G+V+GG Y S
Sbjct: 148 CGMGCNGGFPGSAWEYWVDKGLVTGGLYNS 177
>gi|741376|prf||2007265A cathepsin B
Length = 153
Score = 164 bits (414), Expect = 5e-38, Method: Compositional matrix adjust.
Identities = 89/194 (45%), Positives = 112/194 (57%), Gaps = 54/194 (27%)
Query: 123 CEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYE 182
CEHHVNG+RP C +G TPKC + C+ Y YK+D ++G SYSVS++EK IM EIY+
Sbjct: 1 CEHHVNGSRPPC-TGEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIYK 59
Query: 183 HGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLIL 242
+GPVE GAF+V+ D +L
Sbjct: 60 NGPVE--------------------------------------------GAFSVYSDFLL 75
Query: 243 YKSG-------KALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDEC 295
YKSG + +GGHAIRILGWG + + YWL+ANSWNTDWGDNG FKILRG+D C
Sbjct: 76 YKSGVYQHVTGEMMGGHAIRILGWGVENGTP--YWLVANSWNTDWGDNGFFKILRGQDHC 133
Query: 296 GIESSITAGVPKLD 309
GIES + AG+P+ D
Sbjct: 134 GIESEVVAGIPRTD 147
>gi|306992171|gb|ADN19566.1| cathepsin B-like proteinase [Spodoptera frugiperda]
Length = 341
Score = 164 bits (414), Expect = 6e-38, Method: Compositional matrix adjust.
Identities = 97/209 (46%), Positives = 114/209 (54%), Gaps = 54/209 (25%)
Query: 107 GSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKS 166
GS S GCRPYEI PCEHHV G R C+ TPKC + C+ +Y V Y KD +G
Sbjct: 177 GSYNSGQGCRPYEIPPCEHHVPGNRVPCNGDS-KTPKCHKTCEASYSVDYHKDKRYGKHV 235
Query: 167 YSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTS 226
YSVSS E I E++++GPVE
Sbjct: 236 YSVSSKEDHIKAELFKNGPVE--------------------------------------- 256
Query: 227 QLGAEGAFTVFDDLILYKSG-------KALGGHAIRILGWGEDEKSKEKYWLIANSWNTD 279
GAFTV+ DL+ YK+G ALGGHAI+ILGWG + +K Y LIANSWN+D
Sbjct: 257 -----GAFTVYSDLLNYKNGVYKHTVGNALGGHAIKILGWGVENGNK--YRLIANSWNSD 309
Query: 280 WGDNGLFKILRGKDECGIESSITAGVPKL 308
WGDNG FKILRG+D CGIESSI AG P L
Sbjct: 310 WGDNGFFKILRGEDHCGIESSIVAGEPLL 338
>gi|170028912|ref|XP_001842338.1| oryzain gamma chain [Culex quinquefasciatus]
gi|167879388|gb|EDS42771.1| oryzain gamma chain [Culex quinquefasciatus]
Length = 333
Score = 163 bits (413), Expect = 8e-38, Method: Compositional matrix adjust.
Identities = 89/202 (44%), Positives = 113/202 (55%), Gaps = 41/202 (20%)
Query: 107 GSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKS 166
G GS GCRPY I PC H NG + C S TPKC+++C Y+VPY KD +FG +
Sbjct: 173 GPFGSDQGCRPYTIEPCVHVENGAQSPCKDSI--TPKCIKKCLPGYNVPYAKDKSFGKST 230
Query: 167 YSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTS 226
YS++++E+ I KEI+ +GPVE FTVFDD YK G
Sbjct: 231 YSIANDERQIRKEIFTNGPVEATFTVFDDFASYKHG------------------------ 266
Query: 227 QLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLF 286
+ + SG G HA+RILGWG + + KYWL ANSWN+DWGDNG F
Sbjct: 267 -------------IYQHTSGNLAGEHAVRILGWGVENGT--KYWLAANSWNSDWGDNGYF 311
Query: 287 KILRGKDECGIESSITAGVPKL 308
KILRG + IES+I AG+PK+
Sbjct: 312 KILRGSNHVDIESAIVAGLPKV 333
Score = 48.9 bits (115), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 18/32 (56%), Positives = 25/32 (78%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQ 40
CG GC+GG PG W++W++ G+VSGG +GS Q
Sbjct: 148 CGHGCDGGAPGAGWKHWIEKGLVSGGPFGSDQ 179
>gi|313233819|emb|CBY09988.1| unnamed protein product [Oikopleura dioica]
Length = 356
Score = 163 bits (413), Expect = 8e-38, Method: Compositional matrix adjust.
Identities = 86/200 (43%), Positives = 110/200 (55%), Gaps = 53/200 (26%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GC+PY I PCEHH G RP C +G TPKC +C + Y + +D ++G+ +Y + +NE
Sbjct: 192 GCQPYAIEPCEHHTEGDRPPCTGEEGTTPKCSHKCVDGYTGNFAQDKHYGSVAYRIPANE 251
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
K+IM EIY++GPV EGA
Sbjct: 252 KAIMNEIYKNGPV--------------------------------------------EGA 267
Query: 234 FTVFDDLILYKSG-------KALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLF 286
F V++D YKSG ALGGHAIR+LGWGE+ + EKYWL NSWNTDWG+NG F
Sbjct: 268 FIVYEDFPTYKSGVYSHHTGSALGGHAIRVLGWGEE--NGEKYWLCGNSWNTDWGNNGFF 325
Query: 287 KILRGKDECGIESSITAGVP 306
KI RG +ECGIES + G+P
Sbjct: 326 KIKRGVNECGIESEMVGGIP 345
Score = 69.7 bits (169), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 37/77 (48%), Positives = 46/77 (59%), Gaps = 6/77 (7%)
Query: 38 SKQAEKNSLSNIPRAHLKSWMG-VHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPN 96
S +A N SN H+ G + D LP N L ++ D +LPANFDSR WP+
Sbjct: 53 SWKAGANFNSNYAPKHVAGLCGTIMGDDRLPVNHL-----LNDADLELPANFDSREAWPD 107
Query: 97 CPTIREIRDQGSCGSCW 113
CP+I E+RDQGSCGSCW
Sbjct: 108 CPSISEVRDQGSCGSCW 124
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 20/29 (68%), Positives = 24/29 (82%)
Query: 8 LCGFGCNGGFPGMAWRYWVKSGIVSGGAY 36
+CG GCNGGFP AW YWV++G+VSGG Y
Sbjct: 160 VCGNGCNGGFPQAAWEYWVQNGLVSGGLY 188
>gi|56758040|gb|AAW27160.1| unknown [Schistosoma japonicum]
Length = 216
Score = 163 bits (412), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 82/194 (42%), Positives = 113/194 (58%), Gaps = 39/194 (20%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GC+PY CEHH G P+C TP+C + CQ+ Y PY++D ++G +SY+V SNE
Sbjct: 61 GCQPYPFPKCEHHTKGKYPACGTKIYKTPQCKQTCQKGYKTPYEQDKHYGDESYNVISNE 120
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
K+I KEI +GPVE AF V++D + YKSG
Sbjct: 121 KAIQKEIMMNGPVEAAFDVYEDFLNYKSG------------------------------- 149
Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
+ + +G +GGHAIRI+GWG ++++ YWLIANSWN DWG+ GLF+I+RG+D
Sbjct: 150 ------IYRHVTGSIVGGHAIRIIGWGVEKRT--PYWLIANSWNEDWGEKGLFRIVRGRD 201
Query: 294 ECGIESSITAGVPK 307
EC IES + AG+ K
Sbjct: 202 ECSIESHVVAGLIK 215
Score = 47.4 bits (111), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 19/27 (70%), Positives = 21/27 (77%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGA 35
CG GC GGFPG AW YWV GIV+GG+
Sbjct: 29 CGDGCQGGFPGQAWDYWVTQGIVTGGS 55
>gi|449667614|ref|XP_002166962.2| PREDICTED: cathepsin B-like [Hydra magnipapillata]
Length = 330
Score = 162 bits (411), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 84/195 (43%), Positives = 113/195 (57%), Gaps = 40/195 (20%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GC+PY I C+HHV ++ C+ S TPKC + C++ Y++ YK D ++G SYS+++++
Sbjct: 176 GCQPYAIPACDHHVPHSKNPCNGSLP-TPKCEKVCEKGYNITYKNDKHYGVTSYSINNDQ 234
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
IM+EI +GPVE AFTVF D YKSG +
Sbjct: 235 NEIMREIMTNGPVEAAFTVFADFPNYKSGVY----------------------------- 265
Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
+ SG+ LGGHAI+ILGWG + + YWL+ANSWN WGDNG FKILRG D
Sbjct: 266 --------QHVSGEELGGHAIKILGWGVENNT--PYWLVANSWNPSWGDNGFFKILRGSD 315
Query: 294 ECGIESSITAGVPKL 308
ECGIE + AG+PK+
Sbjct: 316 ECGIEDEVVAGLPKV 330
>gi|323147412|gb|ADX32985.1| cathepsin B [Pinctada fucata]
Length = 366
Score = 162 bits (410), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 86/195 (44%), Positives = 109/195 (55%), Gaps = 40/195 (20%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GC+PY + C+HHV G C + HTP C EC+ Y+V Y KD ++GA +YSV +
Sbjct: 211 GCQPYTVKACDHHVVGKLQPCSKKEEHTPVCKHECESGYNVSYTKDKHYGATAYSVRGVQ 270
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
+ IM EI +GPVEGAFTV+ D YKSG +
Sbjct: 271 Q-IMTEIMTNGPVEGAFTVYADFPQYKSGVY----------------------------- 300
Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
+ +G LGGHAI+I+GWG + + YWL+ANSWN DWG+ G FKILRG+D
Sbjct: 301 --------KHTTGSPLGGHAIKIMGWGTE--GGDDYWLVANSWNPDWGNQGTFKILRGRD 350
Query: 294 ECGIESSITAGVPKL 308
ECGIES I AG PKL
Sbjct: 351 ECGIESQIAAGEPKL 365
Score = 44.7 bits (104), Expect = 0.056, Method: Compositional matrix adjust.
Identities = 20/39 (51%), Positives = 24/39 (61%)
Query: 3 TQQIRLCGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQA 41
T R CG GCNGGF AW Y+ + G+V+GG Y S Q
Sbjct: 173 TSCCRSCGNGCNGGFLSGAWEYYKRDGLVTGGQYNSHQG 211
>gi|221107055|ref|XP_002166984.1| PREDICTED: cathepsin B-like [Hydra magnipapillata]
Length = 330
Score = 161 bits (407), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 86/195 (44%), Positives = 113/195 (57%), Gaps = 40/195 (20%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GC+PYEI CEHH +G++ C+ S+ TPKC R C+E Y+V Y D + + YS++++E
Sbjct: 176 GCQPYEIPSCEHHTSGSKKPCEGSE-PTPKCKRSCREGYNVSYSDDKHKVSSHYSIANDE 234
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
+ I EIY +GPVE AFTV+ D YKSG +
Sbjct: 235 EQIKNEIYLNGPVEAAFTVYSDFPNYKSGVY----------------------------- 265
Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
Y +G ALGGHAI+ILGWG + + YWL+ANSWN DWGD G FKILRG +
Sbjct: 266 --------KYTTGNALGGHAIKILGWGVE--NNVPYWLVANSWNPDWGDKGFFKILRGSN 315
Query: 294 ECGIESSITAGVPKL 308
ECGIE+S+ AG+ L
Sbjct: 316 ECGIEASVVAGMVLL 330
Score = 48.1 bits (113), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 21/35 (60%), Positives = 25/35 (71%), Gaps = 1/35 (2%)
Query: 80 VDEDLPANFDSRTKW-PNCPTIREIRDQGSCGSCW 113
V LP ++D+R KW CP+ EIRDQGSCGSCW
Sbjct: 73 VIATLPDSYDTREKWGSTCPSTTEIRDQGSCGSCW 107
Score = 39.7 bits (91), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 17/32 (53%), Positives = 22/32 (68%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQ 40
CGFGCNGG G AW ++ +G V+GG Y S +
Sbjct: 144 CGFGCNGGRLGPAWNFFKYAGAVTGGQYNSSE 175
>gi|346470617|gb|AEO35153.1| hypothetical protein [Amblyomma maculatum]
Length = 335
Score = 160 bits (406), Expect = 5e-37, Method: Compositional matrix adjust.
Identities = 85/194 (43%), Positives = 109/194 (56%), Gaps = 40/194 (20%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GC+PY PCEHH G P+C K TP+CVR+C++ Y+ Y +D ++ K Y++S++E
Sbjct: 181 GCQPYYFPPCEHHTVGPLPNCTGIKP-TPQCVRDCRKGYEKSYSEDKHYAKKVYTLSADE 239
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
I EI+++GPVE FTV+ D + YKSG +
Sbjct: 240 TQIKTEIFKNGPVEADFTVYADFVSYKSGVY----------------------------- 270
Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
S ALGGHAIRILGWG + + YWL+ANSWN DWGD G FKILRG D
Sbjct: 271 --------QRHSDDALGGHAIRILGWGTE--NGVPYWLVANSWNEDWGDKGYFKILRGND 320
Query: 294 ECGIESSITAGVPK 307
ECGIE I AG+PK
Sbjct: 321 ECGIEDDINAGIPK 334
Score = 45.1 bits (105), Expect = 0.046, Method: Compositional matrix adjust.
Identities = 17/30 (56%), Positives = 22/30 (73%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
CG GCNGG+P AW ++ GIV+GG YG+
Sbjct: 149 CGAGCNGGYPAAAWEFYKTDGIVTGGLYGT 178
>gi|380791571|gb|AFE67661.1| cathepsin B preproprotein, partial [Macaca mulatta]
Length = 311
Score = 160 bits (404), Expect = 9e-37, Method: Compositional matrix adjust.
Identities = 83/174 (47%), Positives = 105/174 (60%), Gaps = 40/174 (22%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GCRPY I PCEHHVNG+RP C +G TPKC + C+ Y YK+D ++G SYSVS++E
Sbjct: 178 GCRPYSIPPCEHHVNGSRPPC-TGEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVSNSE 236
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
K IM EIY++GPVEGAF+V+ D +LYKSG +
Sbjct: 237 KDIMAEIYKNGPVEGAFSVYSDFLLYKSGVY----------------------------- 267
Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFK 287
+ +G+ +GGHAIRILGWG + + YWL+ANSWNTDWGDNG FK
Sbjct: 268 --------QHVTGEMMGGHAIRILGWGVENGT--PYWLVANSWNTDWGDNGFFK 311
Score = 46.2 bits (108), Expect = 0.021, Method: Compositional matrix adjust.
Identities = 18/31 (58%), Positives = 23/31 (74%)
Query: 8 LCGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
+CG GCNGG+P AW +W + G+VSGG Y S
Sbjct: 145 MCGDGCNGGYPAGAWNFWTRKGLVSGGLYDS 175
>gi|22531389|emb|CAD44625.1| cathepsin B1 isotype 2 [Schistosoma mansoni]
Length = 340
Score = 160 bits (404), Expect = 9e-37, Method: Compositional matrix adjust.
Identities = 81/190 (42%), Positives = 109/190 (57%), Gaps = 39/190 (20%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GC PY CEHH G P C + TP+C + CQ+ Y PY +D + G SY+V ++E
Sbjct: 186 GCEPYPFPKCEHHTKGKYPPCGSKIYKTPRCKQTCQKKYKTPYTQDKHRGKSSYNVKNDE 245
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
K+I KEI ++GPVE +FTV++D + YKSG
Sbjct: 246 KAIQKEIMKYGPVEASFTVYEDFLNYKSG------------------------------- 274
Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
+ + +G+ALGGHAIRI+GWG + K+ YWLIANSWN DWG+NG F+I+RG+D
Sbjct: 275 ------IYKHITGEALGGHAIRIIGWGVENKT--PYWLIANSWNEDWGENGYFRIVRGRD 326
Query: 294 ECGIESSITA 303
EC IES + A
Sbjct: 327 ECFIESEVIA 336
Score = 40.8 bits (94), Expect = 0.73, Method: Compositional matrix adjust.
Identities = 16/27 (59%), Positives = 19/27 (70%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGA 35
CG GC GG G AW +WVK GIV+G +
Sbjct: 154 CGLGCEGGILGPAWDFWVKEGIVTGSS 180
>gi|195729971|gb|ACG50796.1| cathepsin B1 [Trichobilharzia szidati]
Length = 342
Score = 159 bits (402), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 86/200 (43%), Positives = 107/200 (53%), Gaps = 53/200 (26%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GC+PY CEHH G P+C TPKC ++CQ+ Y PYKKD +G SY+V S E
Sbjct: 187 GCQPYPFPKCEHHTKGKYPACGEKIYKTPKCQQKCQKGYKTPYKKDKYYGKLSYNVLSKE 246
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
+I KEI HGPVE A
Sbjct: 247 DAIKKEIMMHGPVEAA-------------------------------------------- 262
Query: 234 FTVFDDLILYKSG-------KALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLF 286
FTV+ D + YKSG +GGHA+RI+GWG ++K+ YWLIANSWN DWG+ G F
Sbjct: 263 FTVYSDFLNYKSGIYKHMKGTVIGGHAVRIIGWGVEKKTP--YWLIANSWNEDWGEKGYF 320
Query: 287 KILRGKDECGIESSITAGVP 306
+ILRGKD CGIES++TAG+P
Sbjct: 321 RILRGKDVCGIESAVTAGLP 340
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 22/52 (42%), Positives = 29/52 (55%), Gaps = 1/52 (1%)
Query: 63 DYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG 114
D L R P + + V ++P++FDSR KW C +I IRDQ CG CW
Sbjct: 70 DEELRKKRRP-TVDHQNVSLEIPSSFDSRKKWRQCKSISNIRDQSRCGPCWA 120
Score = 47.8 bits (112), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 18/27 (66%), Positives = 21/27 (77%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGA 35
CG GC GGFPG AW YWV+ GIV+G +
Sbjct: 155 CGLGCQGGFPGAAWDYWVEEGIVTGSS 181
>gi|308504233|ref|XP_003114300.1| hypothetical protein CRE_27039 [Caenorhabditis remanei]
gi|308261685|gb|EFP05638.1| hypothetical protein CRE_27039 [Caenorhabditis remanei]
Length = 351
Score = 159 bits (402), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 82/196 (41%), Positives = 108/196 (55%), Gaps = 40/196 (20%)
Query: 114 GCRPYEIAPCEHHVNGTR-PSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSN 172
GC+PY PCEHHVNGT C ++ T KC R CQ Y + Y +DL+FG +Y+VS
Sbjct: 195 GCKPYPYPPCEHHVNGTHYKPCPSNMYPTDKCERSCQAGYALTYTQDLHFGQSAYAVSKK 254
Query: 173 EKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEG 232
I KEI HGPVE AF+V++D Y G +
Sbjct: 255 VTEIQKEIMTHGPVEVAFSVYEDFEHYSGGVY---------------------------- 286
Query: 233 AFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGK 292
++ +G +LGGHA+++LGWG D + YWL ANSWN DWG+NG F+I+RG
Sbjct: 287 ---------VHTAGASLGGHAVKMLGWGVDNGTP--YWLCANSWNEDWGENGYFRIIRGV 335
Query: 293 DECGIESSITAGVPKL 308
+ECGIES + G+PKL
Sbjct: 336 NECGIESGVVGGIPKL 351
Score = 60.8 bits (146), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 30/74 (40%), Positives = 39/74 (52%)
Query: 46 LSNIPRAHLKSWMGVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRD 105
S+ P K MG R+ E+ +D +P +FDSR +WPNCP+I +IRD
Sbjct: 59 FSSYPDTIKKQLMGAKMIEIPDEYRVFEMTHPEVLDAAIPDSFDSRAQWPNCPSISKIRD 118
Query: 106 QGSCGSCWGCRPYE 119
Q SCGSCW E
Sbjct: 119 QSSCGSCWAVSAAE 132
Score = 46.2 bits (108), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 20/36 (55%), Positives = 26/36 (72%)
Query: 8 LCGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQAEK 43
+CG GCNGG+P AWR++VK G V+GG+Y K K
Sbjct: 162 VCGNGCNGGYPIEAWRHYVKKGYVTGGSYQEKTGCK 197
>gi|226471002|emb|CAX70582.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 159 bits (401), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 83/192 (43%), Positives = 110/192 (57%), Gaps = 39/192 (20%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GC+PY CEH G P+C TP+C + CQ+ Y PYK+D ++G +SY+V SNE
Sbjct: 187 GCQPYPFPKCEHLTKGKYPACGTKIYKTPQCKQTCQKGYKTPYKQDKHYGDESYNVISNE 246
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
K+I KEI +GPVE AF V++D + YKSG
Sbjct: 247 KAIQKEIMMYGPVEAAFDVYEDFLNYKSG------------------------------- 275
Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
+ + +G +GGHAIRI+GWG EK K YWLIANSWN DWG+ GLF+++RG+D
Sbjct: 276 ------IYRHVTGSIVGGHAIRIIGWGV-EKGK-PYWLIANSWNEDWGEKGLFRMVRGRD 327
Query: 294 ECGIESSITAGV 305
EC IES + AG+
Sbjct: 328 ECSIESHVVAGL 339
Score = 56.6 bits (135), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 22/52 (42%), Positives = 34/52 (65%), Gaps = 1/52 (1%)
Query: 63 DYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG 114
D + R P + + +++ ++P+ FDSR KWP+C +I +IRDQ CGSCW
Sbjct: 70 DAEMKRKRRP-TVDHHDLNVEIPSQFDSRKKWPHCKSISQIRDQSRCGSCWA 120
>gi|34979797|gb|AAQ83887.1| cathepsin B [Branchiostoma belcheri tsingtauense]
Length = 332
Score = 159 bits (401), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 89/210 (42%), Positives = 113/210 (53%), Gaps = 54/210 (25%)
Query: 107 GSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKS 166
G GS GC+PYEI PCEHH+NG+RP+C + TP+C + C+ Y+V + KD ++ +
Sbjct: 170 GPYGSHQGCQPYEIKPCEHHINGSRPACGKLEP-TPRCKKSCESGYNVTFAKDKHYAKTA 228
Query: 167 YSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTS 226
YSVSS + I EI +GPVE A
Sbjct: 229 YSVSSKVQQIQMEIMTNGPVEAA------------------------------------- 251
Query: 227 QLGAEGAFTVFDDLILYKSG-------KALGGHAIRILGWGEDEKSKEKYWLIANSWNTD 279
FTV+ D YKSG LGGHA++++GWG + + YWLIANSWNTD
Sbjct: 252 -------FTVYADFPHYKSGVYQHESGAELGGHAVKMIGWGTEGSTP--YWLIANSWNTD 302
Query: 280 WGDNGLFKILRGKDECGIESSITAGVPKLD 309
WG+ G FKILRG+DECGIE I AG PKLD
Sbjct: 303 WGNMGFFKILRGQDECGIERDIVAGEPKLD 332
Score = 51.6 bits (122), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 21/34 (61%), Positives = 25/34 (73%)
Query: 7 RLCGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQ 40
+ CG GCNGGFP AW YW + G+V+GG YGS Q
Sbjct: 143 KSCGNGCNGGFPEAAWEYWKRDGLVTGGPYGSHQ 176
>gi|341904470|gb|EGT60303.1| hypothetical protein CAEBREN_20420 [Caenorhabditis brenneri]
Length = 351
Score = 159 bits (401), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 82/196 (41%), Positives = 107/196 (54%), Gaps = 40/196 (20%)
Query: 114 GCRPYEIAPCEHHVNGTR-PSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSN 172
GC+PY PCEHHVNGT C + T KC R CQ Y + YK+DL+FG +Y+VS
Sbjct: 195 GCKPYPYPPCEHHVNGTHYKPCPSDMYPTDKCERSCQAGYSLTYKQDLHFGQSAYAVSKK 254
Query: 173 EKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEG 232
I KEI +GPVE AFTV+ D +Y G +
Sbjct: 255 ATEIQKEIMTNGPVEVAFTVYADFEVYSGGVY---------------------------- 286
Query: 233 AFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGK 292
++ +G +LGGHA+++LGWG D + YWL ANSWN DWG+NG F+I+RG
Sbjct: 287 ---------VHTAGASLGGHAVKMLGWGVDNGT--PYWLCANSWNEDWGENGYFRIIRGV 335
Query: 293 DECGIESSITAGVPKL 308
+ECGIE + G+PKL
Sbjct: 336 NECGIEHGVVGGIPKL 351
Score = 45.8 bits (107), Expect = 0.022, Method: Compositional matrix adjust.
Identities = 19/31 (61%), Positives = 25/31 (80%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSK 39
CG GCNGG+P AWR++VK+G V+GG+Y K
Sbjct: 163 CGNGCNGGYPIEAWRHYVKNGYVTGGSYQEK 193
>gi|268555790|ref|XP_002635884.1| Hypothetical protein CBG01104 [Caenorhabditis briggsae]
Length = 337
Score = 158 bits (399), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 85/205 (41%), Positives = 113/205 (55%), Gaps = 42/205 (20%)
Query: 107 GSCGSCWGCRPYEIAPCEHHVNG-TRPSCDASKGHTPKCVREC--QENYDVPYKKDLNFG 163
GS S +GC+PY IAPC VNG T P C A + TP+C C + +Y V Y+KD ++G
Sbjct: 168 GSYESQYGCKPYSIAPCGQTVNGVTWPKCPAQEEATPECASHCTSKSSYSVAYEKDKHYG 227
Query: 164 AKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRD 223
+Y V E I EI +HGPVE F V+ D YKSG
Sbjct: 228 LSAYPVGRKEAQIQTEILQHGPVEAGFLVYSDFYRYKSG--------------------- 266
Query: 224 NTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDN 283
+ + SG+ LGGHA++ILGWG + +K YWL+ANSWN +WG+
Sbjct: 267 ----------------IYTHVSGQELGGHAVKILGWGVENGTK--YWLVANSWNINWGEK 308
Query: 284 GLFKILRGKDECGIESSITAGVPKL 308
G F+ILRG++ECGIES++ AG+P L
Sbjct: 309 GYFRILRGRNECGIESAVVAGIPDL 333
>gi|268557308|ref|XP_002636643.1| Hypothetical protein CBG23351 [Caenorhabditis briggsae]
Length = 351
Score = 157 bits (398), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 82/196 (41%), Positives = 106/196 (54%), Gaps = 40/196 (20%)
Query: 114 GCRPYEIAPCEHHVNGTR-PSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSN 172
GC+PY PCEHHVNGT C ++ T KC CQ Y + Y +DL+FG +Y+VS
Sbjct: 195 GCKPYPYPPCEHHVNGTHYKPCPSNMYPTDKCEHSCQAGYPLTYTQDLHFGQSAYAVSKK 254
Query: 173 EKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEG 232
I KEI HGPVE AFTV++D Y G +
Sbjct: 255 PAEIQKEIMTHGPVEVAFTVYEDFEHYSGGVY---------------------------- 286
Query: 233 AFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGK 292
++ +G +LGGHA+++LGWG D + YWL ANSWN DWG+NG F+I+RG
Sbjct: 287 ---------VHTAGASLGGHAVKMLGWGVDNGTP--YWLCANSWNEDWGENGYFRIIRGV 335
Query: 293 DECGIESSITAGVPKL 308
+ECGIES + G PKL
Sbjct: 336 NECGIESGVVGGTPKL 351
Score = 46.6 bits (109), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 20/36 (55%), Positives = 26/36 (72%)
Query: 8 LCGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQAEK 43
+CG GCNGG+P AWR++VK G V+GG+Y K K
Sbjct: 162 VCGNGCNGGYPIEAWRHYVKKGYVTGGSYQEKSGCK 197
>gi|356984175|gb|AET43950.1| cathepsin B, partial [Reishia clavigera]
Length = 209
Score = 157 bits (398), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 85/195 (43%), Positives = 111/195 (56%), Gaps = 41/195 (21%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GC+PY IA C+HHV G C G TP+C ++C+ Y+V +K D ++G +SYSVSS
Sbjct: 56 GCQPYLIAACDHHVVGKLKPCKGD-GKTPRCEKKCEAGYNVTFKDDKHYGQRSYSVSS-V 113
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
IM+E+ GPVE AFTV+ D + Y SG +
Sbjct: 114 NDIMEELVTRGPVEAAFTVYSDFLQYHSGVY----------------------------- 144
Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
+ +G ALGGHA++ILG+G + + +KYWL+ANSWN DWGD G FKILRG D
Sbjct: 145 --------RHTTGSALGGHAVKILGYGVE--NGDKYWLVANSWNPDWGDQGFFKILRGVD 194
Query: 294 ECGIESSITAGVPKL 308
ECGIE I AG PK+
Sbjct: 195 ECGIEGQIVAGEPKV 209
Score = 43.1 bits (100), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 18/32 (56%), Positives = 22/32 (68%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQ 40
CG GCNGG+P AW + G+V+GG Y SKQ
Sbjct: 24 CGDGCNGGYPSAAWEVFDHDGVVTGGQYNSKQ 55
>gi|194246067|gb|ACF35525.1| putative cathepsin B-like cysteine protease form 1 [Dermacentor
variabilis]
Length = 192
Score = 157 bits (398), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 86/194 (44%), Positives = 105/194 (54%), Gaps = 40/194 (20%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GC+PY PCEHH G P+C K TP+C + C+E Y Y +D +FG K YS+SS+E
Sbjct: 35 GCQPYYFPPCEHHTVGPLPNCTGIK-PTPECAKTCREGYQKSYTRDKHFGKKVYSISSDE 93
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
I EIY++GPVE F+V+ D YKSG +
Sbjct: 94 TQIKTEIYKNGPVEADFSVYADFPSYKSGVY----------------------------- 124
Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
S + LGGHAIRILGWG ++ YWL+ANSWN DWGD G FKI RG D
Sbjct: 125 --------QRHSEEMLGGHAIRILGWGTEDGV--PYWLVANSWNEDWGDKGYFKIRRGND 174
Query: 294 ECGIESSITAGVPK 307
ECGIE I AG+PK
Sbjct: 175 ECGIEDDINAGIPK 188
Score = 42.0 bits (97), Expect = 0.38, Method: Compositional matrix adjust.
Identities = 16/33 (48%), Positives = 23/33 (69%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQA 41
CG GCNGG+P AW+++ IV+GG YG++
Sbjct: 3 CGSGCNGGYPSAAWQFYKDEDIVTGGLYGTEDG 35
>gi|87246247|gb|ABD35300.1| cathepsin B-like cysteine protease [Triatoma infestans]
Length = 333
Score = 157 bits (397), Expect = 6e-36, Method: Compositional matrix adjust.
Identities = 81/206 (39%), Positives = 117/206 (56%), Gaps = 40/206 (19%)
Query: 103 IRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNF 162
I G+ GS GC+PY IAPCEH ++G+ P+C TPKC ++C++ Y +PY K +
Sbjct: 168 IVSGGNYGSKQGCQPYSIAPCEHSIHGSSPACGGVT-DTPKCKKQCEKGYSIPYDKAFYY 226
Query: 163 GAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIR 222
G Y++ ++ + I EI ++GP+ +F V++DL YK
Sbjct: 227 GQPGYAIPNDAQKIQAEILKNGPIVASFLVYEDLFSYK---------------------- 264
Query: 223 DNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGD 282
EG + + +G+ LGGH I+I GWG + + YWL+ANSWNTDWG+
Sbjct: 265 --------EGVYQ-------HVAGEFLGGHVIKIFGWGIENGTP--YWLVANSWNTDWGN 307
Query: 283 NGLFKILRGKDECGIESSITAGVPKL 308
NG FKI RGKDECGIE ++AG+P+L
Sbjct: 308 NGFFKIPRGKDECGIEIDVSAGLPRL 333
Score = 47.0 bits (110), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 23/33 (69%), Positives = 23/33 (69%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQA 41
CG GC GG P AW YW K GIVSGG YGSKQ
Sbjct: 147 CGDGCLGGSPESAWEYWHKFGIVSGGNYGSKQG 179
>gi|226473754|emb|CAX71562.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 329
Score = 157 bits (396), Expect = 8e-36, Method: Compositional matrix adjust.
Identities = 93/303 (30%), Positives = 136/303 (44%), Gaps = 94/303 (31%)
Query: 59 GVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSC--------- 109
G D NL R P + + +++ ++P++FDSR KWP C +I +IRDQ C
Sbjct: 66 GRKEDPNLRQKRRP-TVDHHDLNVEIPSHFDSRKKWPRCKSISQIRDQSQCGSSWAVSAV 124
Query: 110 ---------------------------GSCW------------------GCRPYEIAPCE 124
G W GCRPY C+
Sbjct: 125 GAISDRICIQSGGKQSYCGSGCDGGFLGPSWDYWVLRGIVTGGSKENHTGCRPYPFPKCD 184
Query: 125 HHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHG 184
H V G +C TP+C + CQ+ Y+ Y++D ++G SY+V S E I K+I HG
Sbjct: 185 HFVKGKYRACGDKLYKTPQCKQTCQKGYNTSYEQDKHYGGFSYNVLSVESVIQKDIMMHG 244
Query: 185 PVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYK 244
PVE +++D + YKSG + Y
Sbjct: 245 PVEAYLEIYEDFLNYKSG-------------------------------------IYRYT 267
Query: 245 SGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAG 304
+G+ + GHA+R++GWG + + YWL AN+WN DWG+ G F+I+RG++EC IES I AG
Sbjct: 268 TGQFISGHAVRLIGWGVENGT--AYWLAANTWNEDWGEKGYFRIVRGRNECSIESEIAAG 325
Query: 305 VPK 307
+ K
Sbjct: 326 LIK 328
Score = 41.6 bits (96), Expect = 0.42, Method: Compositional matrix adjust.
Identities = 17/27 (62%), Positives = 21/27 (77%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGA 35
CG GC+GGF G +W YWV GIV+GG+
Sbjct: 142 CGSGCDGGFLGPSWDYWVLRGIVTGGS 168
>gi|308512693|gb|ADO33000.1| cathepsin B [Biston betularia]
Length = 217
Score = 157 bits (396), Expect = 8e-36, Method: Compositional matrix adjust.
Identities = 92/209 (44%), Positives = 111/209 (53%), Gaps = 54/209 (25%)
Query: 107 GSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKS 166
G+ S GC PY I PCEHHV G R C+ TPKC + C+ Y+V YKKD +G
Sbjct: 53 GNYNSSQGCSPYVIPPCEHHVPGNRLPCNGDT-KTPKCSKTCENGYNVLYKKDKRYGKHV 111
Query: 167 YSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTS 226
Y+V E I E++++GPVE A
Sbjct: 112 YAVRGGEDHIKAELFKNGPVEAA------------------------------------- 134
Query: 227 QLGAEGAFTVFDDLILYKSG-------KALGGHAIRILGWGEDEKSKEKYWLIANSWNTD 279
FTV+ DL+ YKSG ALGGHAI+I+GWG + +K YWLIANSWNTD
Sbjct: 135 -------FTVYADLLAYKSGVYKHVEGDALGGHAIKIIGWGVENGNK--YWLIANSWNTD 185
Query: 280 WGDNGLFKILRGKDECGIESSITAGVPKL 308
WG+NG FKILRG+D CGIESSI AG P L
Sbjct: 186 WGNNGFFKILRGEDHCGIESSIVAGEPLL 214
>gi|241998314|ref|XP_002433800.1| longipain, putative [Ixodes scapularis]
gi|215495559|gb|EEC05200.1| longipain, putative [Ixodes scapularis]
Length = 339
Score = 156 bits (395), Expect = 9e-36, Method: Compositional matrix adjust.
Identities = 90/201 (44%), Positives = 109/201 (54%), Gaps = 54/201 (26%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GC PY + C+HHVNGT C TPKCVR C++ Y++ +K D ++G SYSVSSNE
Sbjct: 185 GCMPYPVPSCDHHVNGTLGPC-GQDPPTPKCVRLCRKGYNIDFKDDKHYGKSSYSVSSNE 243
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
I EI ++GPV EGA
Sbjct: 244 TQIQMEIMKNGPV--------------------------------------------EGA 259
Query: 234 FTVFDDLILYKSG-------KALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLF 286
FTV+ D LYKSG ALGGHAIRILGWG + + +WL+ANSWNT+WGD G F
Sbjct: 260 FTVYADFPLYKSGVYKSHSTDALGGHAIRILGWGVE--NGVPFWLVANSWNTEWGDKGYF 317
Query: 287 KILRGKDECGIESSITAGVPK 307
KILRG +ECGIE I AG+PK
Sbjct: 318 KILRGSNECGIEEDIVAGIPK 338
Score = 51.6 bits (122), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 21/33 (63%), Positives = 25/33 (75%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQA 41
CG GCNGGFPG AW YWV+ GIV+GG Y + +
Sbjct: 153 CGSGCNGGFPGAAWSYWVEKGIVTGGNYDTDEG 185
>gi|49036808|gb|AAT48985.1| cathepsin B-like proteinase [Triatoma vitticeps]
Length = 332
Score = 156 bits (394), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 79/206 (38%), Positives = 115/206 (55%), Gaps = 40/206 (19%)
Query: 103 IRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNF 162
I G+ GS GC+PY IAPCEH + G+RP+C+ + TPKC ++C++ Y +PY DL +
Sbjct: 167 IVSGGNYGSKQGCQPYSIAPCEHSIPGSRPACEGVR-DTPKCKKQCEKGYGIPYGDDLCY 225
Query: 163 GAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIR 222
G Y++ ++ + I EI ++GP+ + V+
Sbjct: 226 GQPGYTIENDAQKIQAEILKNGPIVASILVY----------------------------- 256
Query: 223 DNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGD 282
E F+ + + +G+ LGGH I+ILGWG + + YWL+ANSWNTDWG+
Sbjct: 257 --------EDLFSYKAGVYQHVAGEVLGGHVIKILGWGVENDTP--YWLVANSWNTDWGN 306
Query: 283 NGLFKILRGKDECGIESSITAGVPKL 308
NG FKILRG DECGIE I AG+P++
Sbjct: 307 NGFFKILRGSDECGIEDQIVAGIPRV 332
Score = 46.6 bits (109), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 22/32 (68%), Positives = 23/32 (71%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQ 40
CG+GC GG AW YW K GIVSGG YGSKQ
Sbjct: 146 CGYGCLGGSAENAWEYWHKFGIVSGGNYGSKQ 177
>gi|56753605|gb|AAW25005.1| SJCHGC02852 protein [Schistosoma japonicum]
Length = 346
Score = 156 bits (394), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 83/195 (42%), Positives = 103/195 (52%), Gaps = 38/195 (19%)
Query: 114 GCRPYEIAPCEHHVNG-TRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSN 172
GC+PY C HH SC+ TP+C + CQ +Y + Y+ D +G SY V+S+
Sbjct: 189 GCQPYPFPECIHHSTSINHSSCEVKYYSTPECYQTCQPDYAIQYENDKYYGKSSYYVTSD 248
Query: 173 EKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEG 232
E SIMKEI +GPVE F VFDD + YK+G +
Sbjct: 249 EVSIMKEILLNGPVEATFYVFDDFLNYKTGVY---------------------------- 280
Query: 233 AFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGK 292
Y +G LGGHAIRI+GWG + YWL ANSWN WGD G FKILRG
Sbjct: 281 ---------KYVTGSLLGGHAIRIIGWGVSTLNHTPYWLCANSWNKQWGDKGYFKILRGS 331
Query: 293 DECGIESSITAGVPK 307
+ECGIES +TAG+PK
Sbjct: 332 NECGIESMVTAGLPK 346
Score = 50.8 bits (120), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 20/27 (74%), Positives = 22/27 (81%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGA 35
CGFGCNGG PGMAW YW GIV+GG+
Sbjct: 157 CGFGCNGGIPGMAWDYWKDEGIVTGGS 183
>gi|268557292|ref|XP_002636635.1| C. briggsae CBR-CPR-1 protein [Caenorhabditis briggsae]
Length = 330
Score = 155 bits (393), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 95/288 (32%), Positives = 130/288 (45%), Gaps = 98/288 (34%)
Query: 80 VDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEIAP----------------- 122
V +PA+FDSRT+W C +I+ IR+Q +CGSCW EI
Sbjct: 82 VLASIPASFDSRTQWSECKSIKLIRNQATCGSCWAFGAAEIISDRTCIETKGAQQPIISP 141
Query: 123 --------------CE---------------------HHVNGTRP-------SCDASKGH 140
CE +H G +P S + +
Sbjct: 142 DDLLSCCGSSCGNGCEGGYPIQALRWWDSKGVVTGGDYHGAGCKPYPIAPCTSGNCPESK 201
Query: 141 TPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYK 200
TP C CQ Y Y KD +FGA +Y+V+ + +I EI +GPVE AFTV++D YK
Sbjct: 202 TPACSLSCQSGYSTAYAKDKHFGASAYAVARSVAAIQTEIMTNGPVEAAFTVYEDFYKYK 261
Query: 201 SGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWG 260
SG + + +GKALGGHAI+I+GWG
Sbjct: 262 SGVY-------------------------------------KHTAGKALGGHAIKIIGWG 284
Query: 261 EDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVPKL 308
+ S YWL+ANSW T+WG++G FKILRG D+CGIE ++ AG ++
Sbjct: 285 TE--SGSPYWLVANSWGTNWGESGFFKILRGDDQCGIEGAVVAGKARV 330
>gi|32566081|ref|NP_506002.2| Protein CPR-1 [Caenorhabditis elegans]
gi|32172429|sp|P25807.2|CPR1_CAEEL RecName: Full=Gut-specific cysteine proteinase; Flags: Precursor
gi|1395200|gb|AAB88058.1| gut-specific cysteine protease-1 [Caenorhabditis elegans]
gi|24817276|emb|CAB01410.2| Protein CPR-1 [Caenorhabditis elegans]
Length = 329
Score = 155 bits (393), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 97/288 (33%), Positives = 129/288 (44%), Gaps = 98/288 (34%)
Query: 80 VDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCW-----------------GCRPYEIAP 122
V +PA FDSRT+W C +I+ IRDQ +CGSCW G + I+P
Sbjct: 81 VLASVPATFDSRTQWSECKSIKLIRDQATCGSCWAFGAAEMISDRTCIETKGAQQPIISP 140
Query: 123 --------------CE---------------------HHVNGTRP-------SCDASKGH 140
CE +H G +P S + +
Sbjct: 141 DDLLSCCGSSCGNGCEGGYPIQALRWWDSKGVVTGGDYHGAGCKPYPIAPCTSGNCPESK 200
Query: 141 TPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYK 200
TP C CQ Y Y KD +FG +Y+V N SI EIY +GPVE AF+V++D YK
Sbjct: 201 TPSCSMSCQSGYSTAYAKDKHFGVSAYAVPKNAASIQAEIYANGPVEAAFSVYEDFYKYK 260
Query: 201 SGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWG 260
SG + + +GK LGGHAI+I+GWG
Sbjct: 261 SGVY-------------------------------------KHTAGKYLGGHAIKIIGWG 283
Query: 261 EDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVPKL 308
+ S YWL+ANSW +WG++G FKI RG D+CGIES++ AG K+
Sbjct: 284 TE--SGSPYWLVANSWGVNWGESGFFKIYRGDDQCGIESAVVAGKAKV 329
>gi|55793949|gb|AAV65885.1| cathepsin B1 isotype 5 precursor [Trichobilharzia regenti]
Length = 342
Score = 155 bits (393), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 79/194 (40%), Positives = 107/194 (55%), Gaps = 39/194 (20%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GC+PY CEHH G P C PKC ++CQ+ Y PY+KD +G SY++ NE
Sbjct: 187 GCQPYPFPKCEHHTKGRYPECGEIIYMKPKCHQKCQKGYKTPYEKDKYYGKVSYNLLKNE 246
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
SI KEI HGPVE +F V D + YKSG
Sbjct: 247 DSIKKEIMMHGPVEASFRVHSDFLNYKSG------------------------------- 275
Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
+ + +G +G H +RI+GWG ++++ YWLIANSWN DWG+ G F++LRGKD
Sbjct: 276 ------IYKHMTGIDIGSHVVRIIGWGVEKET--PYWLIANSWNEDWGEKGYFRMLRGKD 327
Query: 294 ECGIESSITAGVPK 307
ECGIES++T+G+P+
Sbjct: 328 ECGIESAVTSGLPR 341
Score = 48.5 bits (114), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 18/27 (66%), Positives = 22/27 (81%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGA 35
CG GC GFPG+AW YWV+ GIV+GG+
Sbjct: 155 CGLGCQMGFPGIAWDYWVQEGIVTGGS 181
>gi|256086863|ref|XP_002579605.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
gi|353228447|emb|CCD74618.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
Length = 271
Score = 155 bits (393), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 85/196 (43%), Positives = 110/196 (56%), Gaps = 39/196 (19%)
Query: 114 GCRPYEIAPCEHHVNG-TRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSN 172
GC+PY C HH + + P C++ TP+C CQ++Y PYKKD +G SY+V+S
Sbjct: 112 GCQPYPFPECNHHSSSKSYPPCESYYFPTPECHETCQDDYGKPYKKDKFYGKSSYNVASE 171
Query: 173 EKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEG 232
E SIMKEI +GPVEG F V++D + YKSG +
Sbjct: 172 EISIMKEILLNGPVEGGFYVYEDFLNYKSGVY---------------------------- 203
Query: 233 AFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGK 292
+ +G LGGHAIRI+GWG +++ YWL ANSWN WGD G FKILRG
Sbjct: 204 ---------KHITGSYLGGHAIRIIGWG-IQQNHIPYWLCANSWNNQWGDQGYFKILRGT 253
Query: 293 DECGIESSITAGVPKL 308
+ECGIES +TAG+P L
Sbjct: 254 NECGIESMVTAGLPNL 269
Score = 47.8 bits (112), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 19/27 (70%), Positives = 21/27 (77%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGA 35
CGFGC GG PGMAW YW GIV+GG+
Sbjct: 80 CGFGCRGGIPGMAWDYWKYEGIVTGGS 106
>gi|56756436|gb|AAW26391.1| unknown [Schistosoma japonicum]
Length = 342
Score = 155 bits (392), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 101/312 (32%), Positives = 144/312 (46%), Gaps = 107/312 (34%)
Query: 63 DYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYE--- 119
D + R P + + +++ ++P+ FDSR KWP+C +I +IRDQ CGSCW E
Sbjct: 70 DAEMKRKRRP-TVDHHDLNVEIPSQFDSRKKWPHCKSISQIRDQSRCGSCWAFGAVEAMT 128
Query: 120 --------------------IAPCEHHVNGTR---------------------------- 131
I+ CE +G +
Sbjct: 129 DRICIQSGGQQSAELSALDLISCCEDCGDGCKGGFPGQAWDYWVKRGIVTGGSEENHTGC 188
Query: 132 -----PSCD-ASKGHTPKC----------VRECQENYDVPYKKDLNFGAKSYSVSSNEKS 175
P C+ +KG P C + CQ+ Y PY++D ++G + Y+V SNEK+
Sbjct: 189 QPYPFPKCEHLTKGKYPACGTKIYKTPQCKQTCQKGYKTPYEQDKHYGDQRYNVISNEKA 248
Query: 176 IMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFT 235
I +EI +GPVE AF V++D + YKSG
Sbjct: 249 IQREIMMYGPVEAAFDVYEDFLNYKSG--------------------------------- 275
Query: 236 VFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDEC 295
+ + +G +GGHAIRI+GWG EK K YWLIANSWN DWG+ GLF+++RG+DEC
Sbjct: 276 ----IYRHVTGSIVGGHAIRIIGWGV-EKGK-PYWLIANSWNEDWGEKGLFRMVRGRDEC 329
Query: 296 GIESSITAGVPK 307
IES + AG+ K
Sbjct: 330 SIESHVVAGLIK 341
Score = 49.7 bits (117), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 20/27 (74%), Positives = 22/27 (81%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGA 35
CG GC GGFPG AW YWVK GIV+GG+
Sbjct: 155 CGDGCKGGFPGQAWDYWVKRGIVTGGS 181
>gi|49036806|gb|AAT48984.1| cathepsin B-like proteinase [Triatoma sordida]
Length = 331
Score = 155 bits (391), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 89/213 (41%), Positives = 115/213 (53%), Gaps = 55/213 (25%)
Query: 103 IRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNF 162
I G+ GS GC+PY IAPCEHHV G+RP+C + G TP C +C E + Y +D +
Sbjct: 167 IVSGGNYGSKQGCQPYSIAPCEHHVPGSRPAC-SGGGDTPDCRNQCDEGSGISYDQDHYY 225
Query: 163 GAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIR 222
G Y++ K I EI ++GPVE A
Sbjct: 226 GETVYTLDE-AKQIQAEILKNGPVEAA--------------------------------- 251
Query: 223 DNTSQLGAEGAFTVFDDLILYK-------SGKALGGHAIRILGWGEDEKSKEKYWLIANS 275
FTV++DL+ YK +G+ALGGHAI+ILGWG + + YWL+ANS
Sbjct: 252 -----------FTVYEDLLNYKEGVYQHVAGEALGGHAIKILGWGVENDTP--YWLVANS 298
Query: 276 WNTDWGDNGLFKILRGKDECGIESSITAGVPKL 308
WNTDWG+NG FKILRG DECGIE I AG+P++
Sbjct: 299 WNTDWGNNGFFKILRGSDECGIEDQIVAGLPRV 331
Score = 55.1 bits (131), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 23/32 (71%), Positives = 25/32 (78%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQ 40
CG+GC+GGFP AW YW GIVSGG YGSKQ
Sbjct: 146 CGYGCDGGFPASAWDYWQNEGIVSGGNYGSKQ 177
>gi|55793943|gb|AAV65882.1| cathepsin B1 isotype 2 precursor [Trichobilharzia regenti]
Length = 342
Score = 155 bits (391), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 85/200 (42%), Positives = 104/200 (52%), Gaps = 53/200 (26%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GC+PY CEHH G P C TPKC ++CQ+ Y PY KD +G SY+V +NE
Sbjct: 187 GCQPYPFPKCEHHTTGKYPECGEKIYKTPKCHQKCQKGYKTPYGKDKYYGRMSYNVLNNE 246
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
+I KEI HGPVE A
Sbjct: 247 NAIKKEIMMHGPVEAA-------------------------------------------- 262
Query: 234 FTVFDDLILYKSG-------KALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLF 286
FTV D + YKSG +GGHA+RI+GWG ++K+ YWLIANSWN DWG+ G F
Sbjct: 263 FTVHSDFLNYKSGIYKYMTGAEIGGHAVRIIGWGVEKKTP--YWLIANSWNEDWGEKGYF 320
Query: 287 KILRGKDECGIESSITAGVP 306
+ILRGKDECGIES +T G+P
Sbjct: 321 RILRGKDECGIESEVTGGLP 340
Score = 47.8 bits (112), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 18/27 (66%), Positives = 21/27 (77%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGA 35
CG GC GGFPG AW YWV+ GIV+G +
Sbjct: 155 CGLGCQGGFPGAAWDYWVEDGIVTGSS 181
>gi|194246059|gb|ACF35521.1| putative cathepsin B-like cysteine protease form 1 [Dermacentor
variabilis]
Length = 217
Score = 154 bits (390), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 85/194 (43%), Positives = 105/194 (54%), Gaps = 40/194 (20%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GC+PY PCEHH G P+C K TP+C + C+E Y+ Y +D +FG K YS+SS+E
Sbjct: 60 GCQPYYFPPCEHHTVGPLPNCTGIK-PTPECAKTCREGYEKSYTRDKHFGKKVYSISSDE 118
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
I EI ++GPVE F V+ D YKSG +
Sbjct: 119 TQIKTEICKNGPVEADFNVYADFPSYKSGVY----------------------------- 149
Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
S + LGGHAIRILGWG ++ YWL+ANSWN DWGD G FKI RG D
Sbjct: 150 --------QRHSKEMLGGHAIRILGWGTEDGV--PYWLVANSWNEDWGDKGYFKIRRGND 199
Query: 294 ECGIESSITAGVPK 307
ECGIE+ I AG+PK
Sbjct: 200 ECGIENDINAGIPK 213
Score = 44.7 bits (104), Expect = 0.061, Method: Compositional matrix adjust.
Identities = 17/33 (51%), Positives = 24/33 (72%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQA 41
CG GCNGG+P AW+++ GIV+GG YG++
Sbjct: 28 CGSGCNGGYPSAAWQFYKDEGIVTGGLYGTEDG 60
>gi|308488550|ref|XP_003106469.1| hypothetical protein CRE_16049 [Caenorhabditis remanei]
gi|308253819|gb|EFO97771.1| hypothetical protein CRE_16049 [Caenorhabditis remanei]
Length = 205
Score = 154 bits (389), Expect = 5e-35, Method: Compositional matrix adjust.
Identities = 85/206 (41%), Positives = 111/206 (53%), Gaps = 42/206 (20%)
Query: 107 GSCGSCWGCRPYEIAPCEHHVNG-TRPSCDASKGHTPKCVRECQENYDVP--YKKDLNFG 163
GS S +GC+PY IAPC VNG T P C TPKCV C N P Y +D +FG
Sbjct: 35 GSYESQFGCKPYSIAPCGQTVNGVTWPKCPEDTEPTPKCVEACTSNNTYPTGYLQDKHFG 94
Query: 164 AKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRD 223
A +Y+V + I EI HGP+E AFTV++D Y +G +
Sbjct: 95 ATAYAVGKKVEQIQTEILAHGPIEVAFTVYEDFYQYTTGVY------------------- 135
Query: 224 NTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDN 283
++ +GK+LGGHA++ILGWG D + YWL+ANSWN +WG+
Sbjct: 136 ------------------VHTAGKSLGGHAVKILGWGVDNGTP--YWLVANSWNVNWGEK 175
Query: 284 GLFKILRGKDECGIESSITAGVPKLD 309
G F+I+RG +ECGIE S AG+P LD
Sbjct: 176 GYFRIIRGLNECGIEHSAVAGLPDLD 201
>gi|341904369|gb|EGT60202.1| hypothetical protein CAEBREN_08101 [Caenorhabditis brenneri]
Length = 330
Score = 154 bits (389), Expect = 5e-35, Method: Compositional matrix adjust.
Identities = 98/288 (34%), Positives = 130/288 (45%), Gaps = 102/288 (35%)
Query: 82 EDLPANFDSRTKWPNCPTIREIRDQGSCGSCW-----------------GCRPYEIAP-- 122
+ +PA+FDSRT W C +I+ IR+Q +CGSCW G + I+P
Sbjct: 84 DTIPASFDSRTHWSECKSIKLIRNQATCGSCWAFGAAEVISDRTCIETKGAQQPIISPDD 143
Query: 123 ------------CE---------------------HHVNGTRP---------SCDASKGH 140
CE +H G +P SC SK
Sbjct: 144 LLSCCGSSCGNGCEGGYPIQALRWWDSKGVVTGGDYHGAGCKPYPIAPCTSGSCPESK-- 201
Query: 141 TPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYK 200
TP C CQ Y Y KD +FG +Y+V+ SI EI +GPVE AFTV++D YK
Sbjct: 202 TPACSLSCQSGYTTAYAKDKHFGTSAYAVAKKVASIQTEIMTNGPVEAAFTVYEDFYKYK 261
Query: 201 SGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWG 260
SG + + +GKALGGHAI+I+GWG
Sbjct: 262 SGVY-------------------------------------KHTAGKALGGHAIKIIGWG 284
Query: 261 EDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVPKL 308
+ S YWL+ANSW T WG++G FKI RG D+CGIES++ AG ++
Sbjct: 285 TE--SGSPYWLVANSWGTSWGESGFFKIFRGDDQCGIESAVVAGKARV 330
>gi|308504375|ref|XP_003114371.1| CRE-CPR-1 protein [Caenorhabditis remanei]
gi|308261756|gb|EFP05709.1| CRE-CPR-1 protein [Caenorhabditis remanei]
Length = 366
Score = 153 bits (387), Expect = 7e-35, Method: Compositional matrix adjust.
Identities = 96/296 (32%), Positives = 131/296 (44%), Gaps = 101/296 (34%)
Query: 75 IGYSEVD---EDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG----------------- 114
I +EVD + +PA+FDSRT W C +I+ IRDQ +CGSCW
Sbjct: 110 IRATEVDTVLDTIPASFDSRTHWSECKSIKLIRDQATCGSCWAFGAAEVISDRTCIETKG 169
Query: 115 -----CRPYEIAPC------------------------------EHHVNGTRP------- 132
P ++ C ++H G +P
Sbjct: 170 AQQPIISPDDLLSCCGSSCGNGCEGGYPIQALRWWDSKGVVTGGDYHGAGCKPYPIAPCT 229
Query: 133 SCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTV 192
S + + TP C CQ Y Y KD +FG +Y+V+ SI EI +GPVE AFTV
Sbjct: 230 SGNCPESKTPSCSLSCQSGYTTAYAKDKHFGTSAYAVARKVASIQTEIMTNGPVEAAFTV 289
Query: 193 FDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGH 252
++D YKSG + + +GKALGGH
Sbjct: 290 YEDFYKYKSGVY-------------------------------------KHTAGKALGGH 312
Query: 253 AIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVPKL 308
AI+I+GWG + S YWL+ANSW WG++G F+I RG D+CGIES++ AG K+
Sbjct: 313 AIKIIGWGTE--SGSPYWLVANSWGNSWGESGFFRIFRGDDQCGIESAVVAGKAKV 366
Score = 37.4 bits (85), Expect = 9.0, Method: Compositional matrix adjust.
Identities = 15/28 (53%), Positives = 19/28 (67%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAY 36
CG GC GG+P A R+W G+V+GG Y
Sbjct: 188 CGNGCEGGYPIQALRWWDSKGVVTGGDY 215
>gi|167538317|ref|XP_001750823.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163770644|gb|EDQ84327.1| predicted protein [Monosiga brevicollis MX1]
Length = 341
Score = 153 bits (387), Expect = 8e-35, Method: Compositional matrix adjust.
Identities = 87/213 (40%), Positives = 112/213 (52%), Gaps = 54/213 (25%)
Query: 103 IRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNF 162
I G+ S GC+PY + C+HHV+G P+C + +G TP C + C+ Y+ Y D +F
Sbjct: 176 IVTGGNYNSSQGCQPYSLPNCDHHVSGQYPAC-SGEGPTPACKKSCEAGYNNTYSNDKHF 234
Query: 163 GAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIR 222
GA +YSV+ I EI +GPVE
Sbjct: 235 GATAYSVAGEADKIATEIMTNGPVE----------------------------------- 259
Query: 223 DNTSQLGAEGAFTVFDDLILYKSG-------KALGGHAIRILGWGEDEKSKEKYWLIANS 275
GAFTV++DL+ YKSG + LGGHAI+I+GWG + S YW +ANS
Sbjct: 260 ---------GAFTVYEDLLTYKSGVYQHTTGQVLGGHAIKIIGWGVE--SGVDYWWVANS 308
Query: 276 WNTDWGDNGLFKILRGKDECGIESSITAGVPKL 308
WN DWGDNG FKI +G DECGIES I AG+PKL
Sbjct: 309 WNNDWGDNGFFKIKKGVDECGIESQIVAGMPKL 341
Score = 42.0 bits (97), Expect = 0.35, Method: Compositional matrix adjust.
Identities = 18/40 (45%), Positives = 26/40 (65%)
Query: 1 MYTQQIRLCGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQ 40
+ T + CG GC+GG+P AW ++ +GIV+GG Y S Q
Sbjct: 147 LMTCCLFTCGSGCSGGYPSAAWSWFKTTGIVTGGNYNSSQ 186
>gi|268555788|ref|XP_002635883.1| C. briggsae CBR-CPR-5 protein [Caenorhabditis briggsae]
Length = 345
Score = 153 bits (387), Expect = 8e-35, Method: Compositional matrix adjust.
Identities = 85/205 (41%), Positives = 111/205 (54%), Gaps = 42/205 (20%)
Query: 107 GSCGSCWGCRPYEIAPCEHHVNG-TRPSCDASKGHTPKCVRECQEN--YDVPYKKDLNFG 163
GS S +GC+PY IAPC VNG T P C TPKCV C N Y PY +D +FG
Sbjct: 176 GSYESQFGCKPYSIAPCGQTVNGVTWPKCPDDTEPTPKCVEACTSNNTYPTPYLQDKHFG 235
Query: 164 AKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRD 223
A +Y+V + I EI ++GPVE AFTV++D Y +G +
Sbjct: 236 ATAYAVGKKVEQIQTEILKNGPVEVAFTVYEDFYQYTTGVY------------------- 276
Query: 224 NTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDN 283
++ SG +LGGHA++ILGWG D + YWL+ANSWN +WG+
Sbjct: 277 ------------------VHTSGASLGGHAVKILGWGVDNGTP--YWLVANSWNVNWGEK 316
Query: 284 GLFKILRGKDECGIESSITAGVPKL 308
G F+I+RG +ECGIE S AG+P L
Sbjct: 317 GYFRIIRGLNECGIEHSAVAGIPDL 341
>gi|341878049|gb|EGT33984.1| CBN-CPR-1 protein [Caenorhabditis brenneri]
Length = 330
Score = 153 bits (387), Expect = 9e-35, Method: Compositional matrix adjust.
Identities = 98/288 (34%), Positives = 130/288 (45%), Gaps = 102/288 (35%)
Query: 82 EDLPANFDSRTKWPNCPTIREIRDQGSCGSCW-----------------GCRPYEIAP-- 122
+ +PA+FDSRT W C +I+ IR+Q +CGSCW G + I+P
Sbjct: 84 DTIPASFDSRTHWSECKSIKLIRNQATCGSCWAFGAAEVISDRTCIETKGAQQPIISPDD 143
Query: 123 ------------CE---------------------HHVNGTRP---------SCDASKGH 140
CE +H G +P SC SK
Sbjct: 144 LLSCCGSSCGNGCEGGYPIQALRWWDSKGVVTGGDYHGAGCKPYPIAPCTSGSCPESK-- 201
Query: 141 TPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYK 200
TP C CQ Y Y KD +FG +Y+V+ SI EI +GPVE AFTV++D YK
Sbjct: 202 TPACSLSCQPGYTTAYAKDKHFGTSAYAVAKKVASIQTEIMTNGPVEAAFTVYEDFYKYK 261
Query: 201 SGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWG 260
SG + + +GKALGGHAI+I+GWG
Sbjct: 262 SGVY-------------------------------------KHTAGKALGGHAIKIIGWG 284
Query: 261 EDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVPKL 308
+ S YWL+ANSW T WG++G FKI RG D+CGIES++ AG ++
Sbjct: 285 TE--SGSPYWLVANSWGTSWGESGFFKIFRGDDQCGIESAVVAGKARV 330
>gi|268558600|ref|XP_002637291.1| C. briggsae CBR-CPR-4 protein [Caenorhabditis briggsae]
Length = 335
Score = 153 bits (386), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 98/301 (32%), Positives = 131/301 (43%), Gaps = 112/301 (37%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQAEKNSLSNIPRAHLKSWMGVHPDYNLPA 68
CG+GC GG+P AW+Y VKSG +GG+Y S+ G P P
Sbjct: 146 CGYGCEGGYPINAWKYLVKSGFCTGGSYVSQ------------------FGCKPYSLAPC 187
Query: 69 NRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEIAPCEHHVN 128
E +G T WP+CP
Sbjct: 188 G---ETVG--------------NTTWPDCP------------------------------ 200
Query: 129 GTRPSCDASKGHTPKCVREC-QENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVE 187
+TP CV +C NY++ YK D +FG+ +Y+V I EI HGPVE
Sbjct: 201 -------QDGYNTPSCVNKCTNNNYNIAYKDDKHFGSTAYAVGKKVAQIQAEILAHGPVE 253
Query: 188 GAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGK 247
AFTV++D YKSG + ++ +G+
Sbjct: 254 AAFTVYEDFYQYKSGVY-------------------------------------VHTTGQ 276
Query: 248 ALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVPK 307
LGGHAIRILGWG D + YWL+ANSWN +WG+NG F+I+RG +ECGIE ++ GVPK
Sbjct: 277 ELGGHAIRILGWGTDNGT--PYWLVANSWNVNWGENGYFRIIRGTNECGIEHAVVGGVPK 334
Query: 308 L 308
+
Sbjct: 335 V 335
>gi|341891084|gb|EGT47019.1| CBN-CPR-4 protein [Caenorhabditis brenneri]
Length = 335
Score = 153 bits (386), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 99/301 (32%), Positives = 130/301 (43%), Gaps = 112/301 (37%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQAEKNSLSNIPRAHLKSWMGVHPDYNLPA 68
CG+GC GG+P AW+Y VKSG +GG+Y ++ G P P
Sbjct: 146 CGYGCEGGYPINAWKYLVKSGFCTGGSYEAQ------------------FGCKPYSLAPC 187
Query: 69 NRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEIAPCEHHVN 128
E +G T WP CPT
Sbjct: 188 G---ETVG--------------NTTWPACPT----------------------------- 201
Query: 129 GTRPSCDASKGHTPKCVREC-QENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVE 187
TP CV +C NY+V YK D +FG+ +Y+V I EI HGPVE
Sbjct: 202 --------DGYDTPACVNKCTNSNYNVAYKDDKHFGSTAYAVGKKVAQIQAEIIAHGPVE 253
Query: 188 GAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGK 247
AFTV++D YKSG + ++ +G+
Sbjct: 254 AAFTVYEDFYQYKSGVY-------------------------------------VHTTGE 276
Query: 248 ALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVPK 307
LGGHAIRILGWG D + YWL+ANSWN +WG+NG F+I+RG +ECGIE ++ GVPK
Sbjct: 277 ELGGHAIRILGWGTDNGT--PYWLVANSWNVNWGENGYFRIIRGTNECGIEHAVVGGVPK 334
Query: 308 L 308
+
Sbjct: 335 V 335
>gi|326515156|dbj|BAK03491.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 471
Score = 152 bits (385), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 79/195 (40%), Positives = 106/195 (54%), Gaps = 42/195 (21%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GC PY++ C+HHV G C + TP C CQ N + D +FGA SYSV +++
Sbjct: 314 GCYPYQLQACDHHVTGKYQPCGDIQ-PTPACANSCQNN--ATWSSDKHFGASSYSVGTDQ 370
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
+SIM EIY +GPVE ++ V+ D + YKSG +
Sbjct: 371 QSIMTEIYTNGPVEASYDVYADFVSYKSGVY----------------------------- 401
Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
+ +G LGGHA++I+GWG D + YW++ANSWN DWG+NG F ILRG D
Sbjct: 402 --------QHVTGDYLGGHAVKIIGWGVDGST--PYWIVANSWNNDWGNNGFFNILRGSD 451
Query: 294 ECGIESSITAGVPKL 308
ECGIE I AG+PK+
Sbjct: 452 ECGIEDGIVAGIPKV 466
Score = 42.0 bits (97), Expect = 0.39, Method: Compositional matrix adjust.
Identities = 16/32 (50%), Positives = 22/32 (68%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQ 40
CG GC GG+P AW Y+ +G+V+GG + S Q
Sbjct: 282 CGMGCEGGYPSAAWDYFQSTGLVTGGDWNSNQ 313
>gi|341888137|gb|EGT44072.1| hypothetical protein CAEBREN_10156 [Caenorhabditis brenneri]
Length = 344
Score = 152 bits (383), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 83/206 (40%), Positives = 112/206 (54%), Gaps = 42/206 (20%)
Query: 107 GSCGSCWGCRPYEIAPCEHHVNG-TRPSCDASKGHTPKCVRECQENYDVP--YKKDLNFG 163
GS S +GC+PY IAPC VNG T P C TPKCV C N+ P Y +D +FG
Sbjct: 175 GSYESQFGCKPYSIAPCGQTVNGVTWPKCPEDTEPTPKCVDACTSNHTYPTAYLQDKHFG 234
Query: 164 AKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRD 223
A +Y+V + I EI ++GP+E AFTV++D Y +G +
Sbjct: 235 ATAYAVGKKVEQIQTEILKNGPIEVAFTVYEDFYQYTTGVY------------------- 275
Query: 224 NTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDN 283
++ +G +LGGHA++ILGWG D + YWL+ANSWN +WG+
Sbjct: 276 ------------------VHTAGASLGGHAVKILGWGVDNGTP--YWLVANSWNINWGEK 315
Query: 284 GLFKILRGKDECGIESSITAGVPKLD 309
G F+I+RG +ECGIE S AG+P LD
Sbjct: 316 GYFRIIRGLNECGIEHSAVAGIPDLD 341
>gi|341900876|gb|EGT56811.1| hypothetical protein CAEBREN_29569 [Caenorhabditis brenneri]
Length = 344
Score = 152 bits (383), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 83/206 (40%), Positives = 112/206 (54%), Gaps = 42/206 (20%)
Query: 107 GSCGSCWGCRPYEIAPCEHHVNG-TRPSCDASKGHTPKCVRECQENYDVP--YKKDLNFG 163
GS S +GC+PY IAPC VNG T P C TPKCV C N+ P Y +D +FG
Sbjct: 175 GSYESQFGCKPYSIAPCGQTVNGVTWPKCPEDTEPTPKCVDACTSNHTYPTAYLQDKHFG 234
Query: 164 AKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRD 223
A +Y+V + I EI ++GP+E AFTV++D Y +G +
Sbjct: 235 ATAYAVGKKVEQIQTEILKNGPIEVAFTVYEDFYQYTTGVY------------------- 275
Query: 224 NTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDN 283
++ +G +LGGHA++ILGWG D + YWL+ANSWN +WG+
Sbjct: 276 ------------------VHTAGASLGGHAVKILGWGVDNGTP--YWLVANSWNINWGEK 315
Query: 284 GLFKILRGKDECGIESSITAGVPKLD 309
G F+I+RG +ECGIE S AG+P LD
Sbjct: 316 GYFRIIRGLNECGIEHSAVAGIPDLD 341
>gi|241154720|ref|XP_002407359.1| cathepsin B endopeptidase, putative [Ixodes scapularis]
gi|215494103|gb|EEC03744.1| cathepsin B endopeptidase, putative [Ixodes scapularis]
Length = 337
Score = 151 bits (382), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 85/209 (40%), Positives = 111/209 (53%), Gaps = 40/209 (19%)
Query: 101 REIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDL 160
R I G G+ GC+PY +APCE+H P+C HTP+CV C++ YD Y++D
Sbjct: 169 RGIVSGGLYGTPDGCKPYSLAPCEYHTKCRIPNC-IPIVHTPECVHHCRKGYDKDYQEDK 227
Query: 161 NFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWT 220
+FG K YS+S +EK I EI+ +GPVE F V+ D + YKSG + N+ M
Sbjct: 228 HFGQKVYSISRDEKQIQTEIFTNGPVEADFHVYGDFLCYKSGVYQRHSNDGRGM------ 281
Query: 221 IRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDW 280
HAIRILGWG + + YWL ANSWN +W
Sbjct: 282 -------------------------------HAIRILGWGTENGT--PYWLAANSWNENW 308
Query: 281 GDNGLFKILRGKDECGIESSITAGVPKLD 309
GD G FKILR +ECGIE I AG+PK++
Sbjct: 309 GDKGYFKILRRTNECGIEEHIYAGIPKIE 337
Score = 62.0 bits (149), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 30/75 (40%), Positives = 45/75 (60%), Gaps = 3/75 (4%)
Query: 40 QAEKNSLSNIPRAHLKSWMGVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPT 99
+A N I ++++ +GVHP RL E + + E+ +DLP +FD+R KW +C +
Sbjct: 44 KAGSNFDKCISMSYIRGLLGVHPKSE--EYRLAEFV-HEEIPDDLPESFDARAKWSHCDS 100
Query: 100 IREIRDQGSCGSCWG 114
I IRDQ +CGSCW
Sbjct: 101 IHLIRDQSTCGSCWA 115
>gi|118153|sp|P25792.1|CYSP_SCHMA RecName: Full=Cathepsin B-like cysteine proteinase; AltName:
Full=Antigen Sm31; Flags: Precursor
gi|160950|gb|AAA29865.1| cathepsin B [Schistosoma mansoni]
Length = 340
Score = 151 bits (382), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 81/198 (40%), Positives = 106/198 (53%), Gaps = 53/198 (26%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GC PY CEHH G P C + +TP+C + CQ Y PY +D + G SY+V ++E
Sbjct: 186 GCEPYPFPKCEHHTKGKYPPCGSKIYNTPRCKQTCQRKYKTPYTQDKHRGKSSYNVKNDE 245
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
K+I KEI ++GPVE +
Sbjct: 246 KAIQKEIMKYGPVEAS-------------------------------------------- 261
Query: 234 FTVFDDLILYKSG-------KALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLF 286
FTV++D + YKSG +ALGGHAIRI+GWG + K+ YWLIANSWN DWG+NG F
Sbjct: 262 FTVYEDFLNYKSGIYKHITGEALGGHAIRIIGWGVENKTP--YWLIANSWNEDWGENGYF 319
Query: 287 KILRGKDECGIESSITAG 304
+I+RG+DEC IES + AG
Sbjct: 320 RIVRGRDECSIESEVIAG 337
Score = 53.5 bits (127), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 23/49 (46%), Positives = 33/49 (67%), Gaps = 1/49 (2%)
Query: 65 NLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCW 113
+L R P + +++ + ++P+NFDSR KWP C +I IRDQ CGSCW
Sbjct: 71 DLRRKRRP-TVDHNDWNVEIPSNFDSRKKWPGCKSIATIRDQSRCGSCW 118
Score = 40.4 bits (93), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 16/27 (59%), Positives = 18/27 (66%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGA 35
CG GC GG G AW YWVK GIV+ +
Sbjct: 154 CGLGCEGGILGPAWDYWVKEGIVTASS 180
>gi|341888136|gb|EGT44071.1| hypothetical protein CAEBREN_13576 [Caenorhabditis brenneri]
Length = 337
Score = 151 bits (381), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 82/205 (40%), Positives = 113/205 (55%), Gaps = 42/205 (20%)
Query: 107 GSCGSCWGCRPYEIAPCEHHVNG-TRPSCDASKGHTPKCVREC--QENYDVPYKKDLNFG 163
GS S +GC+PY IAPC VNG T P C A + TP+CV++C + +Y VPY +D ++G
Sbjct: 168 GSYESQYGCKPYSIAPCGQTVNGVTWPKCAADEVATPECVKQCTSKSDYAVPYDQDKHYG 227
Query: 164 AKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRD 223
+ +Y++ N I EI +GPVE F V+ D YKSG +
Sbjct: 228 SSAYAIRQNVAQIQTEIMRNGPVEVGFLVYSDFYQYKSGIY------------------- 268
Query: 224 NTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDN 283
+ +G+ LGGHA++ILGWG + + YWL ANSWN +WG+
Sbjct: 269 ------------------KHVAGRELGGHAVKILGWGVENGT--PYWLAANSWNVNWGEK 308
Query: 284 GLFKILRGKDECGIESSITAGVPKL 308
G F+I RG +ECGIESS+ AG+P L
Sbjct: 309 GYFRIRRGTNECGIESSVVAGIPDL 333
Score = 48.1 bits (113), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 19/31 (61%), Positives = 25/31 (80%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSK 39
CG GC GG+P AWRYWV +G+V+GG+Y S+
Sbjct: 143 CGDGCEGGYPIQAWRYWVHNGLVTGGSYESQ 173
>gi|339242313|ref|XP_003377082.1| Gut-specific cysteine proteinase [Trichinella spiralis]
gi|316974149|gb|EFV57673.1| Gut-specific cysteine proteinase [Trichinella spiralis]
Length = 517
Score = 151 bits (381), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 92/275 (33%), Positives = 121/275 (44%), Gaps = 87/275 (31%)
Query: 82 EDLPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEIAPCEHHVN------------- 128
+ LP +FDSR KWP C IR IRDQ +CGSCW + H +
Sbjct: 278 KKLPKHFDSREKWPECEWIRFIRDQSNCGSCWAVSAASVMTDRHCIASKGQETPYISDEQ 337
Query: 129 -------------------------GTRPSCD----------ASKGHTPKCVRECQENYD 153
G + C + TP C +CQ +YD
Sbjct: 338 ILACGMIPSPFNYWKKMGIATGGPYGDKSCCQPYSIAPCSKCSYTASTPSCKYDCQADYD 397
Query: 154 VPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTA 213
+P D + ++ Y VSSN+ IM EIY HGPV F V++D Y SG + +TT
Sbjct: 398 IPISDDKFYASEHYHVSSNQYEIMNEIYTHGPVVAGFIVYEDFTYYISGIY----QQTTY 453
Query: 214 MSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIA 273
+ A+GGHAIRI+GWGE+ + YWLIA
Sbjct: 454 V---------------------------------AMGGHAIRIIGWGEE--NGIPYWLIA 478
Query: 274 NSWNTDWGDNGLFKILRGKDECGIESSITAGVPKL 308
NSWNT +G+ G F+I RG +EC IES + G+PKL
Sbjct: 479 NSWNTTFGEKGFFRIRRGTNECRIESEVYTGIPKL 513
Score = 72.4 bits (176), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 37/91 (40%), Positives = 50/91 (54%), Gaps = 9/91 (9%)
Query: 115 CRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEK 174
C PY I+PC RP A PKC R CQ +Y++ K+D +G Y V+ +E
Sbjct: 99 CLPYSISPCTM----CRPYMLA-----PKCQRTCQASYNLSLKRDKYYGKSHYYVNQDEF 149
Query: 175 SIMKEIYEHGPVEGAFTVFDDLILYKSGRFF 205
IM+EIY+ GPV F V+ D + Y SG+F
Sbjct: 150 DIMQEIYQRGPVVAGFKVYHDFLYYISGQFI 180
>gi|196009263|ref|XP_002114497.1| expressed hypothetical protein [Trichoplax adhaerens]
gi|190583516|gb|EDV23587.1| expressed hypothetical protein [Trichoplax adhaerens]
Length = 333
Score = 151 bits (381), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 87/201 (43%), Positives = 108/201 (53%), Gaps = 54/201 (26%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GC+PYEI CEHHV G +C + TPKC ++CQ Y+ + +D +FG KSYS+++N
Sbjct: 179 GCQPYEIPKCEHHVKGPFKAC-GKELPTPKCSQKCQPGYNKTFNQDKHFGKKSYSITNNI 237
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
+ I KEI +GPVE A
Sbjct: 238 QQIQKEIMMNGPVEA--------------------------------------------A 253
Query: 234 FTVFDDLILYKSGK-------ALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLF 286
FTV+ D YKSG LGGHA++ILGWG + + YWLIANSWN WGD G F
Sbjct: 254 FTVYADFPSYKSGVYQHTTGGPLGGHAVKILGWGTENNTP--YWLIANSWNPTWGDKGYF 311
Query: 287 KILRGKDECGIESSITAGVPK 307
KI+RGKDECGIESSI AG+PK
Sbjct: 312 KIIRGKDECGIESSIVAGMPK 332
Score = 48.1 bits (113), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 20/32 (62%), Positives = 23/32 (71%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQ 40
CG GCNGGF AW YWV +GIV+GG Y S +
Sbjct: 147 CGMGCNGGFLPQAWHYWVNNGIVTGGQYHSHK 178
>gi|407080581|gb|AFS89610.1| procathepsin B precursor [Phenacoccus solenopsis]
Length = 309
Score = 151 bits (381), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 83/205 (40%), Positives = 111/205 (54%), Gaps = 39/205 (19%)
Query: 103 IRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNF 162
I GS GS GC+PY + PCEHH G R +C G TP C R CQ +Y + Y+ DL+F
Sbjct: 137 IVSGGSYGSKEGCQPYHLPPCEHHRAGPRRNC-TKYGPTPSCARVCQPDYKISYEDDLHF 195
Query: 163 GAKSYSVS-SNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTI 221
G + Y+++ NEK I EI+ +GPVE ++D Y+SG +
Sbjct: 196 GKQWYALAPHNEKIIRTEIFHNGPVEATMAAYEDFYTYESGIYH---------------- 239
Query: 222 RDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWG 281
EG F HA++I+GWG D+K+ YWL+ANS+NTDWG
Sbjct: 240 -------HIEGTFVC--------------DHAVKIIGWGTDKKTNTPYWLVANSFNTDWG 278
Query: 282 DNGLFKILRGKDECGIESSITAGVP 306
+ G FKI RG +ECGIE+ ITAG+P
Sbjct: 279 EYGFFKIKRGVNECGIENKITAGIP 303
Score = 62.0 bits (149), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 29/48 (60%), Positives = 34/48 (70%), Gaps = 3/48 (6%)
Query: 66 LPANRLPEL---IGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCG 110
L N P+L S++ E+LP FDSR +WPNCPTIREIRDQGSCG
Sbjct: 30 LKPNVTPDLEPPFVVSKISENLPDEFDSRVRWPNCPTIREIRDQGSCG 77
Score = 41.2 bits (95), Expect = 0.59, Method: Compositional matrix adjust.
Identities = 19/32 (59%), Positives = 23/32 (71%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQ 40
C GC G +AW +WVK GIVSGG+YGSK+
Sbjct: 116 CEKGCLGCDHHLAWDHWVKHGIVSGGSYGSKE 147
>gi|56753443|gb|AAW24925.1| unknown [Schistosoma japonicum]
Length = 342
Score = 151 bits (381), Expect = 5e-34, Method: Compositional matrix adjust.
Identities = 82/199 (41%), Positives = 108/199 (54%), Gaps = 53/199 (26%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GC+PY CEHH G P+C TP+C ++CQ+ Y PY++D N+G + Y+V SNE
Sbjct: 187 GCQPYPFPKCEHHTKGKYPACGTKIYKTPQCKQKCQKGYKTPYEQDKNYGDQRYNVISNE 246
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
K+I +EI +GPVE A
Sbjct: 247 KAIQREIMMYGPVEAA-------------------------------------------- 262
Query: 234 FTVFDDLILYKSG-------KALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLF 286
F V++D + YKSG +GGHAIRI+GWG EK K YWLIANSWN DWG+NGLF
Sbjct: 263 FDVYEDFLNYKSGIYRHVAGSIVGGHAIRIIGWGV-EKGK-PYWLIANSWNEDWGENGLF 320
Query: 287 KILRGKDECGIESSITAGV 305
+++RG+DEC IES + AG+
Sbjct: 321 RMVRGRDECSIESHVVAGL 339
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 20/27 (74%), Positives = 23/27 (85%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGA 35
CG GC GGFPG+AW YWVK GIV+GG+
Sbjct: 155 CGDGCQGGFPGVAWDYWVKRGIVTGGS 181
>gi|256090368|ref|XP_002581167.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
gi|22531387|emb|CAD44624.1| cathepsin B1 isotype 1 [Schistosoma mansoni]
gi|353228442|emb|CCD74613.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
Length = 340
Score = 151 bits (381), Expect = 5e-34, Method: Compositional matrix adjust.
Identities = 81/198 (40%), Positives = 105/198 (53%), Gaps = 53/198 (26%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GC PY CEHH G P C + TP+C + CQ+ Y PY +D + G SY+V ++E
Sbjct: 186 GCEPYPFPKCEHHTKGKYPPCGSKIYKTPRCKQTCQKKYKTPYTQDKHRGKSSYNVKNDE 245
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
K+I KEI ++GPVE
Sbjct: 246 KAIQKEIMKYGPVEAG-------------------------------------------- 261
Query: 234 FTVFDDLILYKSG-------KALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLF 286
FTV++D + YKSG + LGGHAIRI+GWG + K+ YWLIANSWN DWG+NG F
Sbjct: 262 FTVYEDFLNYKSGIYKHITGETLGGHAIRIIGWGVENKTP--YWLIANSWNEDWGENGYF 319
Query: 287 KILRGKDECGIESSITAG 304
+I+RG+DEC IES +TAG
Sbjct: 320 RIVRGRDECSIESEVTAG 337
Score = 42.0 bits (97), Expect = 0.31, Method: Compositional matrix adjust.
Identities = 17/27 (62%), Positives = 19/27 (70%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGA 35
CG GC GG G AW YWVK GIV+G +
Sbjct: 154 CGLGCEGGILGPAWDYWVKEGIVTGSS 180
>gi|390357905|ref|XP_003729132.1| PREDICTED: cathepsin B-like [Strongylocentrotus purpuratus]
Length = 354
Score = 150 bits (380), Expect = 6e-34, Method: Compositional matrix adjust.
Identities = 101/297 (34%), Positives = 137/297 (46%), Gaps = 66/297 (22%)
Query: 18 PGMAWRYWVKSGIVSGGAYGSKQAEKNSLSNIPRAHLKSWMGVHPDYNLPANRLPELIGY 77
PG AW Y+ +GIV+GG + S Q + H+ G P
Sbjct: 117 PGSAWEYYKDTGIVTGGQWNSSQGCQPYQIKSCDHHVNGTKGP------CQGEGPTPECK 170
Query: 78 SEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDAS 137
+ + P + K I G S GC+PY+I C+HHVNGT+ C
Sbjct: 171 HKCNGGFPGSAWEYYK------DTGIVTGGQWNSSQGCQPYQIKSCDHHVNGTKGPCQG- 223
Query: 138 KGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLI 197
+G TP+C +C+ +Y PY++D ++ S+S+N ++ EI +GPVE
Sbjct: 224 EGPTPECKHKCEASYSTPYEQDKHYALSVNSISNNPEATQTEIMTNGPVEAD-------- 275
Query: 198 LYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSG-------KALG 250
FTV++D YKSG LG
Sbjct: 276 ------------------------------------FTVYEDFPTYKSGVYQHTTGGVLG 299
Query: 251 GHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVPK 307
GHAI+ILGWG +E +K YWL+ANSWN +WGDNG FKILRG +ECGIES I G+PK
Sbjct: 300 GHAIKILGWGVEEGTK--YWLVANSWNNEWGDNGFFKILRGSNECGIESDINFGIPK 354
Score = 43.1 bits (100), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 18/33 (54%), Positives = 22/33 (66%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQA 41
C CNGGFPG AW Y+ +GIV+GG + S Q
Sbjct: 169 CKHKCNGGFPGSAWEYYKDTGIVTGGQWNSSQG 201
>gi|55793951|gb|AAV65886.1| cathepsin B1 isotype 6 precursor [Trichobilharzia regenti]
Length = 342
Score = 150 bits (379), Expect = 6e-34, Method: Compositional matrix adjust.
Identities = 78/194 (40%), Positives = 104/194 (53%), Gaps = 39/194 (20%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GC+PY CEH+ G P+C TPKC ++CQ+ Y PYKKD ++G +Y+V +NE
Sbjct: 187 GCQPYPFPKCEHNTTGKYPACGQKIYETPKCQKKCQKGYKTPYKKDKHYGKVAYNVPNNE 246
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
SI KEI HGPV FTV+ D + YKSG
Sbjct: 247 DSIKKEIMMHGPVGSFFTVYSDFLNYKSG------------------------------- 275
Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
+ + G +G H +RI+GWG ++ + YWLIANSWN WG+ G F+ILRGKD
Sbjct: 276 ------IYKHMKGTEIGVHTVRIVGWGVEKGT--PYWLIANSWNEGWGEKGYFRILRGKD 327
Query: 294 ECGIESSITAGVPK 307
EC IES + G+P+
Sbjct: 328 ECDIESLVIGGLPR 341
>gi|343197337|pdb|3QSD|A Chain A, Structure Of Cathepsin B1 From Schistosoma Mansoni In
Complex With Ca074 Inhibitor
gi|343197588|pdb|3S3Q|A Chain A, Structure Of Cathepsin B1 From Schistosoma Mansoni In
Complex With K11017 Inhibitor
gi|343197589|pdb|3S3R|A Chain A, Structure Of Cathepsin B1 From Schistosoma Mansoni In
Complex With K11777 Inhibitor
gi|343197590|pdb|3S3R|B Chain B, Structure Of Cathepsin B1 From Schistosoma Mansoni In
Complex With K11777 Inhibitor
gi|343197591|pdb|3S3R|C Chain C, Structure Of Cathepsin B1 From Schistosoma Mansoni In
Complex With K11777 Inhibitor
Length = 254
Score = 150 bits (379), Expect = 7e-34, Method: Compositional matrix adjust.
Identities = 81/198 (40%), Positives = 105/198 (53%), Gaps = 53/198 (26%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GC PY CEHH G P C + TP+C + CQ+ Y PY +D + G SY+V ++E
Sbjct: 100 GCEPYPFPKCEHHTKGKYPPCGSKIYKTPRCKQTCQKKYKTPYTQDKHRGKSSYNVKNDE 159
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
K+I KEI ++GPVE
Sbjct: 160 KAIQKEIMKYGPVEAG-------------------------------------------- 175
Query: 234 FTVFDDLILYKSG-------KALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLF 286
FTV++D + YKSG + LGGHAIRI+GWG + K+ YWLIANSWN DWG+NG F
Sbjct: 176 FTVYEDFLNYKSGIYKHITGETLGGHAIRIIGWGVENKAP--YWLIANSWNEDWGENGYF 233
Query: 287 KILRGKDECGIESSITAG 304
+I+RG+DEC IES +TAG
Sbjct: 234 RIVRGRDECSIESEVTAG 251
Score = 41.2 bits (95), Expect = 0.60, Method: Compositional matrix adjust.
Identities = 17/27 (62%), Positives = 19/27 (70%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGA 35
CG GC GG G AW YWVK GIV+G +
Sbjct: 68 CGLGCEGGILGPAWDYWVKEGIVTGSS 94
>gi|256090364|ref|XP_002581165.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
gi|353228444|emb|CCD74615.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
Length = 303
Score = 150 bits (378), Expect = 9e-34, Method: Compositional matrix adjust.
Identities = 94/289 (32%), Positives = 126/289 (43%), Gaps = 108/289 (37%)
Query: 65 NLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSC------------ 112
+L R P + +++ + ++P++FDSR KWP C +I IRDQ CGSC
Sbjct: 71 DLRRTRRP-TVDHNDWNVEIPSSFDSRKKWPRCKSIATIRDQSRCGSCCAFGAVEAMSER 129
Query: 113 ------------------------------WGCRPYEIAPCEHHVNGTRPSCDASKGHTP 142
GC PY CEH G P C + TP
Sbjct: 130 SCIQSGGKQNVELSAVDLEGIVTGSSKENNTGCEPYPFPKCEHFTKGQYPPCGSKIYKTP 189
Query: 143 KCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSG 202
+C CQ+ Y Y +D ++I KEI ++GPVE +
Sbjct: 190 RCKTTCQKRYKTSYAQD------------KHRAIQKEIMKYGPVEAS------------- 224
Query: 203 RFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSG-------KALGGHAIR 255
FTV++D + YKSG + LGGHAIR
Sbjct: 225 -------------------------------FTVYEDFLNYKSGIYKHITGETLGGHAIR 253
Query: 256 ILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAG 304
I+GWG + K+ YWLIANSWN DWG+NG F+I+RG+DEC IES +TAG
Sbjct: 254 IIGWGVENKTP--YWLIANSWNEDWGENGYFRIVRGRDECSIESEVTAG 300
>gi|170787211|gb|ACB38229.1| cathepsin B [Meretrix meretrix]
Length = 337
Score = 150 bits (378), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 85/196 (43%), Positives = 107/196 (54%), Gaps = 41/196 (20%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GC PYEI C+HHV G C G TP+C +EC+ Y+ Y KD + ++V E
Sbjct: 183 GCLPYEIKACDHHVVGKLQPCKGD-GPTPRCKKECESGYNNTYSKDEHHAKTVHAVEGVE 241
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
+ IM EI +GPVE AFTV+ D YKSG +
Sbjct: 242 Q-IMTEIMTNGPVEAAFTVYSDFPTYKSGVY----------------------------- 271
Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
+KSG LGGHAI+ LGWG ++ + YWL+ANSWN DWGDNG FKILRG+D
Sbjct: 272 --------EHKSGGPLGGHAIKTLGWGNED--GKDYWLVANSWNPDWGDNGFFKILRGRD 321
Query: 294 ECGIESSITAGVPKLD 309
ECGIES+I AG+ L+
Sbjct: 322 ECGIESNIVAGMMVLE 337
Score = 66.6 bits (161), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 32/79 (40%), Positives = 49/79 (62%), Gaps = 8/79 (10%)
Query: 40 QAEKNSLSNIPRA----HLKSWMGVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWP 95
+A + N+P ++KS G +P P + P + EV +DLP FD+RT+WP
Sbjct: 42 KATTENFKNVPYKGRMDYVKSLCGANP--APPEMKFP--VKEIEVPKDLPDTFDARTQWP 97
Query: 96 NCPTIREIRDQGSCGSCWG 114
+CP+++E+RDQG+CGSCW
Sbjct: 98 DCPSLKEVRDQGACGSCWA 116
Score = 44.3 bits (103), Expect = 0.080, Method: Compositional matrix adjust.
Identities = 21/38 (55%), Positives = 23/38 (60%)
Query: 3 TQQIRLCGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQ 40
T R CG GCNGGF AW Y + GIV+GG Y S Q
Sbjct: 145 TSCCRTCGNGCNGGFLEGAWNYLKRDGIVTGGPYNSHQ 182
>gi|256052329|ref|XP_002569725.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
gi|353228436|emb|CCD74607.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
Length = 345
Score = 150 bits (378), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 81/198 (40%), Positives = 106/198 (53%), Gaps = 53/198 (26%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GC PY CEHH G P C + TP+C + CQ+ Y PY +D + G SY+V ++E
Sbjct: 191 GCEPYPFPKCEHHTKGKYPPCGSKIYKTPRCKQTCQKKYKTPYTQDKHRGKSSYNVKNDE 250
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
K+I KEI ++GPVE +
Sbjct: 251 KAIQKEIMKYGPVEAS-------------------------------------------- 266
Query: 234 FTVFDDLILYKSG-------KALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLF 286
FTV++D + YKSG +ALGGHAIRI+GWG + K+ YWLIANSWN DWG+NG F
Sbjct: 267 FTVYEDFLNYKSGIYKHITGEALGGHAIRIIGWGVENKTP--YWLIANSWNEDWGENGYF 324
Query: 287 KILRGKDECGIESSITAG 304
+I+RG+DEC IES + AG
Sbjct: 325 RIVRGRDECFIESEVIAG 342
Score = 40.8 bits (94), Expect = 0.78, Method: Compositional matrix adjust.
Identities = 16/27 (59%), Positives = 19/27 (70%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGA 35
CG GC GG G AW +WVK GIV+G +
Sbjct: 159 CGLGCEGGILGPAWDFWVKEGIVTGSS 185
>gi|17565164|ref|NP_503383.1| Protein CPR-5 [Caenorhabditis elegans]
gi|1169086|sp|P43509.1|CPR5_CAEEL RecName: Full=Cathepsin B-like cysteine proteinase 5; AltName:
Full=Cysteine protease-related 5; Flags: Precursor
gi|671713|gb|AAA98786.1| cathepsin B-like cysteine proteinase [Caenorhabditis elegans]
gi|675502|gb|AAA98784.1| cathepsin B-like cysteine proteinase [Caenorhabditis elegans]
gi|351059399|emb|CCD74289.1| Protein CPR-5 [Caenorhabditis elegans]
Length = 344
Score = 150 bits (378), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 80/205 (39%), Positives = 111/205 (54%), Gaps = 42/205 (20%)
Query: 107 GSCGSCWGCRPYEIAPCEHHVNGTR-PSCDASKGHTPKCVREC--QENYDVPYKKDLNFG 163
GS + +GC+PY IAPC VNG + P+C TPKCV C + NY PY +D +FG
Sbjct: 175 GSYETQFGCKPYSIAPCGETVNGVKWPACPEDTEPTPKCVDSCTSKNNYATPYLQDKHFG 234
Query: 164 AKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRD 223
+ +Y+V + I EI +GP+E AFTV++D Y +G +
Sbjct: 235 STAYAVGKKVEQIQTEILTNGPIEVAFTVYEDFYQYTTGVY------------------- 275
Query: 224 NTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDN 283
++ +G +LGGHA++ILGWG D + YWL+ANSWN WG+
Sbjct: 276 ------------------VHTAGASLGGHAVKILGWGVDNGTP--YWLVANSWNVAWGEK 315
Query: 284 GLFKILRGKDECGIESSITAGVPKL 308
G F+I+RG +ECGIE S AG+P L
Sbjct: 316 GYFRIIRGLNECGIEHSAVAGIPDL 340
>gi|308500570|ref|XP_003112470.1| CRE-CPR-4 protein [Caenorhabditis remanei]
gi|308267038|gb|EFP10991.1| CRE-CPR-4 protein [Caenorhabditis remanei]
Length = 335
Score = 149 bits (377), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 97/301 (32%), Positives = 131/301 (43%), Gaps = 112/301 (37%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQAEKNSLSNIPRAHLKSWMGVHPDYNLPA 68
CG+GC+GG+P AW+Y VKSG +GG+Y ++ G P P
Sbjct: 146 CGYGCDGGYPINAWKYLVKSGFCTGGSYEAQ------------------FGCKPYSLAPC 187
Query: 69 NRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEIAPCEHHVN 128
E +G WP+CP D G
Sbjct: 188 G---ETVG--------------NVTWPDCP------DDGY-------------------- 204
Query: 129 GTRPSCDASKGHTPKCVRECQEN-YDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVE 187
+TP CV +C Y+ YK D +FG+ +Y+V I EI HGPVE
Sbjct: 205 -----------NTPACVNKCTNTKYNTAYKDDKHFGSTAYAVGKKVAQIQAEIIAHGPVE 253
Query: 188 GAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGK 247
AFTV++D YKSG + ++ +G+
Sbjct: 254 AAFTVYEDFYQYKSGVY-------------------------------------VHTTGQ 276
Query: 248 ALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVPK 307
LGGHAIRILGWG D + YWL+ANSWN +WG+NG F+I+RG +ECGIE ++ GVPK
Sbjct: 277 ELGGHAIRILGWGTDNGT--PYWLVANSWNVNWGENGYFRIIRGTNECGIEHAVVGGVPK 334
Query: 308 L 308
+
Sbjct: 335 V 335
>gi|146165818|ref|XP_001015807.2| Papain family cysteine protease containing protein [Tetrahymena
thermophila]
gi|146145394|gb|EAR95562.2| Papain family cysteine protease containing protein [Tetrahymena
thermophila SB210]
Length = 338
Score = 149 bits (376), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 79/193 (40%), Positives = 110/193 (56%), Gaps = 40/193 (20%)
Query: 115 CRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENY-DVPYKKDLNFGAKSYSVSSNE 173
C+PY PC+HHV G P C K TPKCV++C Y + Y++DL+ +K Y + +N
Sbjct: 185 CKPYVFPPCDHHVVGQYPPCGPIKP-TPKCVKQCNSQYTEKTYQQDLHHPSKVYQLPNNA 243
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
++I +EI HGPV+ +F V D + YKSG + IRD
Sbjct: 244 EAIQREIMAHGPVQASFRVASDFLTYKSGVY----------------IRD---------- 277
Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
K GGH+++I+GWG ++ + YWLIANSWN DWG+NGLFK+LRGK+
Sbjct: 278 ----------PKLKYEGGHSVKIIGWGVEQGT--PYWLIANSWNEDWGENGLFKMLRGKN 325
Query: 294 ECGIESSITAGVP 306
ECGIE+ + AG+P
Sbjct: 326 ECGIEAEVVAGLP 338
Score = 40.8 bits (94), Expect = 0.80, Method: Compositional matrix adjust.
Identities = 15/29 (51%), Positives = 20/29 (68%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYG 37
CG GC GG+P AW+Y +G+ +GG YG
Sbjct: 152 CGNGCQGGYPSAAWKYMKATGVSTGGLYG 180
>gi|157058767|gb|ABV03141.1| cathepsin B-348 [Sitobion avenae]
Length = 252
Score = 149 bits (376), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 82/180 (45%), Positives = 106/180 (58%), Gaps = 39/180 (21%)
Query: 101 REIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDL 160
+ I G GS GC PYEIAPCEHHVNGTR C G TPKCV++C++ Y VPY++DL
Sbjct: 112 KGIVSGGPYGSNMGCIPYEIAPCEHHVNGTRGPCKEG-GKTPKCVKKCEDGYKVPYEQDL 170
Query: 161 NFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWT 220
+ G +YS+S++ I +EIY +GPVEGAFTV++D I Y++G +
Sbjct: 171 HRGKSAYSLSNDVDQIRQEIYTNGPVEGAFTVYEDFIAYRAGVY---------------- 214
Query: 221 IRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDW 280
+ +GKALGGHAIRILGWG + + YWL+ANSWNTDW
Sbjct: 215 ---------------------KHVAGKALGGHAIRILGWGV-QNGEIPYWLVANSWNTDW 252
Score = 55.8 bits (133), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 24/30 (80%), Positives = 24/30 (80%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
CGFGCNGGFPG AW YW GIVSGG YGS
Sbjct: 93 CGFGCNGGFPGAAWHYWKTKGIVSGGPYGS 122
>gi|56759504|gb|AAW27892.1| unknown [Schistosoma japonicum]
Length = 279
Score = 149 bits (375), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 78/201 (38%), Positives = 109/201 (54%), Gaps = 53/201 (26%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GC+PY CEHH G P+C TP+C + CQ+ Y PY++D ++G +SY+V +NE
Sbjct: 124 GCQPYPFPKCEHHTKGKYPACGTKIYKTPQCKQTCQKGYKTPYEQDKHYGEESYNVQNNE 183
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
K I ++I +GPVE A
Sbjct: 184 KVIQRDIMMYGPVEAA-------------------------------------------- 199
Query: 234 FTVFDDLILYKSG-------KALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLF 286
F V++D + YKSG +GGHAIRI+GWG ++++ YWLIANSWN DWG+ GLF
Sbjct: 200 FDVYEDFLNYKSGIYRHVTGSIVGGHAIRIIGWGVEKRTP--YWLIANSWNEDWGEKGLF 257
Query: 287 KILRGKDECGIESSITAGVPK 307
+I+RG+DEC IES++ AG+ K
Sbjct: 258 RIVRGRDECSIESNVVAGLIK 278
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 20/27 (74%), Positives = 23/27 (85%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGA 35
CG GC GGFPG+AW YWVK GIV+GG+
Sbjct: 92 CGQGCQGGFPGVAWDYWVKRGIVTGGS 118
>gi|1169189|sp|P43157.1|CYSP_SCHJA RecName: Full=Cathepsin B-like cysteine proteinase; AltName:
Full=Antigen Sj31; Flags: Precursor
gi|11167|emb|CAA50305.1| cathepsin B [Schistosoma japonicum]
Length = 342
Score = 147 bits (372), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 77/201 (38%), Positives = 108/201 (53%), Gaps = 53/201 (26%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GC+PY CEHH G P+C TP+C + CQ+ Y PY++D ++G +SY+V +NE
Sbjct: 187 GCQPYPFPKCEHHTKGKYPACGTKIYKTPQCKQTCQKGYKTPYEQDKHYGDESYNVQNNE 246
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
K I ++I +GPVE A
Sbjct: 247 KVIQRDIMMYGPVEAA-------------------------------------------- 262
Query: 234 FTVFDDLILYKSG-------KALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLF 286
F V++D + YKSG +GGHAIRI+GWG ++++ YWLIANSWN DWG+ GLF
Sbjct: 263 FDVYEDFLNYKSGIYRHVTGSIVGGHAIRIIGWGVEKRTP--YWLIANSWNEDWGEKGLF 320
Query: 287 KILRGKDECGIESSITAGVPK 307
+++RG+DEC IES + AG+ K
Sbjct: 321 RMVRGRDECSIESDVVAGLIK 341
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 20/27 (74%), Positives = 23/27 (85%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGA 35
CG GC GGFPG+AW YWVK GIV+GG+
Sbjct: 155 CGDGCQGGFPGVAWDYWVKRGIVTGGS 181
>gi|56758658|gb|AAW27469.1| unknown [Schistosoma japonicum]
Length = 181
Score = 147 bits (372), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 83/214 (38%), Positives = 113/214 (52%), Gaps = 53/214 (24%)
Query: 101 REIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDL 160
R I GS + GC+PY CEH G P+C TP+C ++CQ+ Y PY++D
Sbjct: 13 RGIVTGGSKENHTGCQPYPFPKCEHLTKGKYPACGTKIYKTPQCKQKCQKGYKTPYEQDK 72
Query: 161 NFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWT 220
N+G + Y+V SN K+I KEI +GPVE A
Sbjct: 73 NYGDQRYNVISNAKAIQKEIMMNGPVEAA------------------------------- 101
Query: 221 IRDNTSQLGAEGAFTVFDDLILYKSG-------KALGGHAIRILGWGEDEKSKEKYWLIA 273
F V++D + YKSG +GGHAIRI+GWG ++++ YWLIA
Sbjct: 102 -------------FDVYEDFLNYKSGIYRHVTGSIVGGHAIRIIGWGVEKRTP--YWLIA 146
Query: 274 NSWNTDWGDNGLFKILRGKDECGIESSITAGVPK 307
NSWN DWG+ GLF+I+RG+DEC IES++ AG+ K
Sbjct: 147 NSWNEDWGEKGLFRIVRGRDECSIESNVVAGLIK 180
>gi|392920988|ref|NP_506011.2| Protein F57F5.1 [Caenorhabditis elegans]
gi|206994319|emb|CAB00098.2| Protein F57F5.1 [Caenorhabditis elegans]
Length = 351
Score = 147 bits (371), Expect = 6e-33, Method: Compositional matrix adjust.
Identities = 82/196 (41%), Positives = 108/196 (55%), Gaps = 40/196 (20%)
Query: 114 GCRPYEIAPCEHHVNGTR-PSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSN 172
GC+PY PCEHHVNGT C ++ T KC R CQ Y + Y++DL+FG +Y+VS
Sbjct: 195 GCKPYPYPPCEHHVNGTHYKPCPSNMYPTDKCERSCQAGYALTYQQDLHFGQSAYAVSKK 254
Query: 173 EKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEG 232
I KEI HGPVE AFTV++D Y G +
Sbjct: 255 AAEIQKEIMTHGPVEVAFTVYEDFEHYSGGVY---------------------------- 286
Query: 233 AFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGK 292
++ +G +LGGHA+++LGWG D + YWL ANSWN DWG+NG F+I+RG
Sbjct: 287 ---------VHTAGASLGGHAVKMLGWGVDNGT--PYWLCANSWNEDWGENGYFRIIRGV 335
Query: 293 DECGIESSITAGVPKL 308
+ECGIE + G+PKL
Sbjct: 336 NECGIEGGVVGGIPKL 351
Score = 60.8 bits (146), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 34/88 (38%), Positives = 47/88 (53%), Gaps = 12/88 (13%)
Query: 44 NSLSNIPRAHLKSWMGVHPDY---NLPANRLPEL--------IGYSEV-DEDLPANFDSR 91
N + +A L S+ +PD L ++ E+ + + EV D +P +FDSR
Sbjct: 45 NKVQTSFKAELGSYFSSYPDTIKKQLMGAKMVEIPEEYRVFEMTHPEVEDAAVPDSFDSR 104
Query: 92 TKWPNCPTIREIRDQGSCGSCWGCRPYE 119
T WPNCP+I +IRDQ SCGSCW E
Sbjct: 105 TAWPNCPSISKIRDQSSCGSCWAVSAAE 132
Score = 46.6 bits (109), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 19/32 (59%), Positives = 25/32 (78%)
Query: 8 LCGFGCNGGFPGMAWRYWVKSGIVSGGAYGSK 39
+CG GCNGG+P AWR++VK G V+GG+Y K
Sbjct: 162 VCGNGCNGGYPIEAWRHYVKKGYVTGGSYQDK 193
>gi|121073168|gb|ABM47070.1| cathepsin B1 [Clonorchis sinensis]
gi|358341105|dbj|GAA29748.2| cathepsin B [Clonorchis sinensis]
Length = 339
Score = 147 bits (371), Expect = 6e-33, Method: Compositional matrix adjust.
Identities = 79/194 (40%), Positives = 106/194 (54%), Gaps = 40/194 (20%)
Query: 115 CRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEK 174
CRPY PCEHHV G R C TP+CV++CQ Y Y+ D +G K+YS+ S+++
Sbjct: 186 CRPYSFPPCEHHVVGPRKPCTGDPT-TPQCVKKCQPEYPKTYENDKWYGLKAYSIHSDQE 244
Query: 175 SIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAF 234
+IM+++ +GP+E F V+ D Y SG +
Sbjct: 245 AIMRDLMTYGPLEVDFEVYADFPSYSSGVY------------------------------ 274
Query: 235 TVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDE 294
+ +G LGGHA+R++GWG ++ + YWLIANSWNTDWGD G FKI RG +E
Sbjct: 275 -------RHVAGGLLGGHAVRLVGWGVEDGAD--YWLIANSWNTDWGDGGYFKIRRGVNE 325
Query: 295 CGIESSITAGVPKL 308
CGIES AG PKL
Sbjct: 326 CGIESDANAGHPKL 339
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 20/34 (58%), Positives = 27/34 (79%)
Query: 81 DEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG 114
+++LP +FD+R KWP C +I EIRDQ +CGSCW
Sbjct: 85 EQELPESFDAREKWPYCSSIAEIRDQSNCGSCWA 118
Score = 45.8 bits (107), Expect = 0.025, Method: Compositional matrix adjust.
Identities = 16/30 (53%), Positives = 22/30 (73%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
CG GC GG+P AW YWV++G+V+G Y +
Sbjct: 153 CGMGCQGGYPAQAWEYWVRNGLVTGDLYNT 182
>gi|38639325|gb|AAR25800.1| cathepsin B-like cysteine proteinase [Solanum tuberosum]
Length = 354
Score = 147 bits (370), Expect = 7e-33, Method: Compositional matrix adjust.
Identities = 102/324 (31%), Positives = 145/324 (44%), Gaps = 96/324 (29%)
Query: 37 GSKQAEKNSLSNIPRAHLKSWMGVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPN 96
G K A LSN + K +GV P +P L + + E LP FD+R WP
Sbjct: 55 GWKAAFNPQLSNFTVSQFKRLLGVKPAREGDLEGIPVLT-HPRLKE-LPKEFDARKAWPQ 112
Query: 97 CPTIREIRDQGSCGSCWG-----------CRPYEIA------------------------ 121
C TI +I DQG CGSCW C Y ++
Sbjct: 113 CSTIGKILDQGHCGSCWAFGAVESLSDRFCIHYNLSISLSVNDLLACCSFLCGSGCDGGY 172
Query: 122 ----------------PCEHHVNGT---RPSCDASKGHTPKCVRECQENYDVPYKKDLNF 162
C+ + + T P C+ TPKC R+C + +V ++K ++
Sbjct: 173 PIAAWRYFKRSGVVTEECDPYFDTTGCSHPGCEPLYP-TPKCHRKCVKG-NVLWRKSKHY 230
Query: 163 GAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIR 222
G +Y VS + +SIM E+Y++GPVE +FTV++D YKSG +
Sbjct: 231 GVNAYRVSHDPQSIMAEVYKNGPVEVSFTVYEDFAHYKSGVY------------------ 272
Query: 223 DNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGD 282
+ +G +GGHA++++GWG E+ E YWLI NSWN WG+
Sbjct: 273 -------------------KHVTGGNMGGHAVKLIGWGTSEQG-EDYWLIVNSWNRGWGE 312
Query: 283 NGLFKILRGKDECGIESSITAGVP 306
+G FKI RG +ECGIE S+ AG+P
Sbjct: 313 DGYFKIRRGTNECGIEHSVVAGLP 336
Score = 39.7 bits (91), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 15/25 (60%), Positives = 21/25 (84%)
Query: 8 LCGFGCNGGFPGMAWRYWVKSGIVS 32
LCG GC+GG+P AWRY+ +SG+V+
Sbjct: 163 LCGSGCDGGYPIAAWRYFKRSGVVT 187
>gi|76576341|gb|ABA53864.1| cathepsin B-like cysteine protease 2 [Parelaphostrongylus tenuis]
Length = 344
Score = 147 bits (370), Expect = 7e-33, Method: Compositional matrix adjust.
Identities = 78/190 (41%), Positives = 98/190 (51%), Gaps = 39/190 (20%)
Query: 115 CRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEK 174
CRPYEI PC HH N T TP CV CQ Y + Y D FG SY++ S+
Sbjct: 192 CRPYEIPPCGHHRNETFYGNCTQIADTPDCVTTCQAGYPISYDDDKTFGKDSYTIESSVT 251
Query: 175 SIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAF 234
+I KEI +GPV AF V++D Y G
Sbjct: 252 AIQKEIMTYGPVTAAFIVYEDFFHYHRG-------------------------------- 279
Query: 235 TVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDE 294
+ + SG GGHA+RILGWGE++ + YWL+ANSWNTDWG+NG F+ILRG +E
Sbjct: 280 -----IYKHVSGGEEGGHAVRILGWGEEKGTA--YWLVANSWNTDWGENGYFRILRGSNE 332
Query: 295 CGIESSITAG 304
CGIE ++ AG
Sbjct: 333 CGIEENVVAG 342
Score = 47.0 bits (110), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 18/33 (54%), Positives = 27/33 (81%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQA 41
CG GC+GG+P AW Y+V++G+V+GG YG+K +
Sbjct: 159 CGDGCDGGYPISAWEYFVETGVVTGGLYGTKDS 191
>gi|313229093|emb|CBY18245.1| unnamed protein product [Oikopleura dioica]
Length = 355
Score = 147 bits (370), Expect = 8e-33, Method: Compositional matrix adjust.
Identities = 110/345 (31%), Positives = 147/345 (42%), Gaps = 122/345 (35%)
Query: 32 SGGAYGSKQAEKNSLSNIPRAHLKSWMGV--HPDYNLPANRLP--ELIGYSEVDEDLPAN 87
+G G E+ +L ++ +SW+G + DY+ P + P +L+G D+PA
Sbjct: 50 AGWTAGENFHEQTTLEDV-----RSWLGAWSNKDYDWP-QKYPHDDLVG------DIPAT 97
Query: 88 FDSRTKWPNCPTIREIRDQGSCGSCW---------------------------------- 113
FDSR+ W +C I +IRDQG CGSCW
Sbjct: 98 FDSRSNWSDCSVIGKIRDQGGCGSCWAFGAAEAISDRICIASKGATDVMYAAEDVLSCCL 157
Query: 114 ----GCR-PYEIAPCEHHVN---------GTRPSCD------------------ASKGHT 141
GC Y +A E+ V GT+ +C G T
Sbjct: 158 TCGNGCNGGYPLAAMEYFVTRGLVTGGLYGTKDTCQPYTLEACEHHVPGDRPPCTEGGGT 217
Query: 142 PKCVRECQENYDV-PYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYK 200
PKC +C +Y YK D G K+YSV ++ I +EI +GPVE AFTV+ D YK
Sbjct: 218 PKCSHQCIPDYTTKAYKDDKVHGHKAYSVPNDVGKIQQEIMHYGPVEAAFTVYSDFPSYK 277
Query: 201 SGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWG 260
SG + + SG LGGHAI+I+GWG
Sbjct: 278 SGVY-------------------------------------RHTSGSELGGHAIKIIGWG 300
Query: 261 EDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGV 305
+ + YWLI NSWN+DWGD G FKILRG +ECGIE + A
Sbjct: 301 TE--GGDDYWLINNSWNSDWGDKGTFKILRGSNECGIEGEVVAAT 343
Score = 42.4 bits (98), Expect = 0.27, Method: Compositional matrix adjust.
Identities = 18/31 (58%), Positives = 23/31 (74%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSK 39
CG GCNGG+P A Y+V G+V+GG YG+K
Sbjct: 159 CGNGCNGGYPLAAMEYFVTRGLVTGGLYGTK 189
>gi|1008858|gb|AAA79004.1| cathepsin B-like thiol protease [Aedes aegypti]
Length = 342
Score = 146 bits (369), Expect = 9e-33, Method: Compositional matrix adjust.
Identities = 102/319 (31%), Positives = 141/319 (44%), Gaps = 102/319 (31%)
Query: 50 PRAHLKSWMGVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSC 109
P A+ K GV D L RL I + D LP +FD+R KW CP++ IR+QG C
Sbjct: 57 PAAYFK---GVLYD-RLGETRLAPAILVNPQDIQLPESFDARQKWSQCPSLNVIRNQGCC 112
Query: 110 GSCW--------------------------------------GCRPYEIAPC-----EHH 126
GSCW GC+ + P E
Sbjct: 113 GSCWAISAASAMTDRWCIKSKGKEQFSFGATDMLACCHACGDGCKGGYLGPAWQFWVEQG 172
Query: 127 VNGTRP-------------SCDAS--KGHTPKCVRECQENYDVP-YKKDLNFGAKSYSVS 170
V+ P CDAS + TPKC + CQ Y+V +D +G +YS+
Sbjct: 173 VSSGGPYNSRQGCHPYPIDVCDASGEEADTPKCSKRCQSGYNVTDVWQDRRYGRVAYSIP 232
Query: 171 SNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGA 230
++E+ IM+EIY +GPV+ AF + DL YKSG +
Sbjct: 233 NDEQKIMEEIYINGPVQAAFMTYQDLHAYKSGVY-------------------------- 266
Query: 231 EGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILR 290
+ G GGHA++++GWG + + KYWL+ANSW DWGDNG FKI+R
Sbjct: 267 -----------RHVWGHMAGGHAVKLMGWGVE--NGLKYWLVANSWGDDWGDNGFFKIVR 313
Query: 291 GKDECGIESSITAGVPKLD 309
G++ CGIE + AG+P +
Sbjct: 314 GENHCGIEKDVHAGLPSFN 332
Score = 44.3 bits (103), Expect = 0.065, Method: Compositional matrix adjust.
Identities = 18/32 (56%), Positives = 24/32 (75%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQ 40
CG GC GG+ G AW++WV+ G+ SGG Y S+Q
Sbjct: 152 CGDGCKGGYLGPAWQFWVEQGVSSGGPYNSRQ 183
>gi|356572872|ref|XP_003554589.1| PREDICTED: cathepsin B-like [Glycine max]
Length = 356
Score = 146 bits (369), Expect = 9e-33, Method: Compositional matrix adjust.
Identities = 100/315 (31%), Positives = 136/315 (43%), Gaps = 96/315 (30%)
Query: 46 LSNIPRAHLKSWMGVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRD 105
SN K +GV P P + + LP NFD+RT W C TI I D
Sbjct: 64 FSNYTVEQFKRLLGVKPTPKKELRSTPAISHPKSLK--LPKNFDARTAWSQCSTIGRILD 121
Query: 106 QGSCGSCWG-----------CRPYEI----------APC--------------------E 124
QG CGSCW C +++ A C
Sbjct: 122 QGHCGSCWAFGAVESLSDRFCIHFDVNISLSVNDLLACCGFLCGSGCDGGYPLYAWQYLA 181
Query: 125 HH-------------VNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSS 171
HH + + P C+ + TPKCV++C V +KK ++ +Y VSS
Sbjct: 182 HHGVVTEECDPYFDQIGCSHPGCEPAY-RTPKCVKKCVSGNQV-WKKSKHYSVNAYRVSS 239
Query: 172 NEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAE 231
+ IM E+Y++GPVE AFTV++D YKSG +
Sbjct: 240 DPHDIMTEVYKNGPVEVAFTVYEDFAHYKSGVY--------------------------- 272
Query: 232 GAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRG 291
+ +G LGGHA++++GWG E E YWL+AN WN +WGD+G FKI RG
Sbjct: 273 ----------KHITGYELGGHAVKLIGWGTTEDG-EDYWLLANQWNREWGDDGYFKIRRG 321
Query: 292 KDECGIESSITAGVP 306
+ECGIE +TAG+P
Sbjct: 322 TNECGIEEDVTAGLP 336
>gi|300176937|emb|CBK25506.2| unnamed protein product [Blastocystis hominis]
Length = 320
Score = 146 bits (369), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 77/194 (39%), Positives = 110/194 (56%), Gaps = 40/194 (20%)
Query: 115 CRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEK 174
C YE C+HHV G P C ++ TP+CV +CQE Y V YKKD +F ++Y V SN +
Sbjct: 167 CNAYEFPKCDHHVEGKYPPCGETQ-PTPECVEKCQEGYPVEYKKDKHFFGEAYHVPSNVE 225
Query: 175 SIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAF 234
+I E+ +GP+E F+V++D + YKSG
Sbjct: 226 AIKTELMTNGPIEVDFSVYEDFMTYKSG-------------------------------- 253
Query: 235 TVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDE 294
+ + +GK LGGHA++++GWG ++ +YW IANSWN DWG+NG F+I+ GK+E
Sbjct: 254 -----IYQHVAGKYLGGHAVKLVGWGVEDGV--EYWKIANSWNEDWGENGYFRIIAGKNE 306
Query: 295 CGIESSITAGVPKL 308
CGIES AG+P+L
Sbjct: 307 CGIESDGVAGIPEL 320
Score = 48.1 bits (113), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 19/31 (61%), Positives = 25/31 (80%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSK 39
CGFGCNGG+P MAW ++ +G+ +GG YGSK
Sbjct: 134 CGFGCNGGWPSMAWSWFHSTGVTTGGEYGSK 164
>gi|157167283|ref|XP_001658486.1| cathepsin b [Aedes aegypti]
gi|108876477|gb|EAT40702.1| AAEL007599-PA [Aedes aegypti]
Length = 342
Score = 146 bits (369), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 102/319 (31%), Positives = 141/319 (44%), Gaps = 102/319 (31%)
Query: 50 PRAHLKSWMGVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSC 109
P A+ K GV D L RL I + D LP +FD+R KW CP++ IR+QG C
Sbjct: 57 PAAYFK---GVLYD-RLGETRLAPAILVNPQDIQLPESFDARQKWSQCPSLNVIRNQGCC 112
Query: 110 GSCW--------------------------------------GCRPYEIAPC-----EHH 126
GSCW GC+ + P E
Sbjct: 113 GSCWAISAASAMTDRWCIKSKGKEQFSFGATDMLACCHACGDGCKGGYLGPAWQFWVEQG 172
Query: 127 VNGTRP-------------SCDAS--KGHTPKCVRECQENYDVP-YKKDLNFGAKSYSVS 170
V+ P CDAS + TPKC + CQ Y+V +D +G +YS+
Sbjct: 173 VSSGGPYNSRQGCHPYPIDVCDASGEEADTPKCSKRCQSGYNVTDVWQDRRYGRVAYSIP 232
Query: 171 SNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGA 230
++E+ IM+EIY +GPV+ AF + DL YKSG +
Sbjct: 233 NDEQKIMEEIYINGPVQAAFMTYQDLHAYKSGVY-------------------------- 266
Query: 231 EGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILR 290
+ G GGHA++++GWG + + KYWL+ANSW DWGDNG FKI+R
Sbjct: 267 -----------RHVWGHMAGGHAVKLMGWGVE--NGLKYWLVANSWGDDWGDNGFFKIVR 313
Query: 291 GKDECGIESSITAGVPKLD 309
G++ CGIE + AG+P +
Sbjct: 314 GENHCGIEKDVHAGLPSFN 332
Score = 44.3 bits (103), Expect = 0.062, Method: Compositional matrix adjust.
Identities = 18/32 (56%), Positives = 24/32 (75%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQ 40
CG GC GG+ G AW++WV+ G+ SGG Y S+Q
Sbjct: 152 CGDGCKGGYLGPAWQFWVEQGVSSGGPYNSRQ 183
>gi|308488328|ref|XP_003106358.1| hypothetical protein CRE_16047 [Caenorhabditis remanei]
gi|308253708|gb|EFO97660.1| hypothetical protein CRE_16047 [Caenorhabditis remanei]
Length = 343
Score = 146 bits (369), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 88/213 (41%), Positives = 110/213 (51%), Gaps = 56/213 (26%)
Query: 107 GSCGSCWGCRPYEIAPCEHHVNG-TRPSCDASKGHTPKCVRECQEN--YDVPYKKDLNFG 163
GS S +GC+PY IAPC VNG T P C S TPKCV C N Y +PY+KD ++G
Sbjct: 174 GSYESQFGCKPYSIAPCGQTVNGVTWPKCPNSDADTPKCVDHCTSNSSYPIPYEKDKHYG 233
Query: 164 AKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRD 223
A +Y+VS I EI ++GPVE F
Sbjct: 234 ATAYAVSRKVDQIQSEILKNGPVEVGF--------------------------------- 260
Query: 224 NTSQLGAEGAFTVFDDLILYKSG-------KALGGHAIRILGWGEDEKSKEKYWLIANSW 276
TV+ D YKSG LGGHA+++LGWG D + YWL ANSW
Sbjct: 261 -----------TVYADFYQYKSGVYVHVAGPELGGHAVKLLGWGVDNGTP--YWLAANSW 307
Query: 277 NTDWGDNGLFKILRGKDECGIESSITAGVPKLD 309
NT+WG+NG F+ILRG +ECGIES + AG+P L+
Sbjct: 308 NTNWGENGYFRILRGVNECGIESQVVAGMPDLE 340
Score = 48.9 bits (115), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 19/31 (61%), Positives = 26/31 (83%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSK 39
CG GC GG+P AW+YWVK+G+V+GG+Y S+
Sbjct: 149 CGDGCEGGYPIQAWKYWVKNGLVTGGSYESQ 179
>gi|17565162|ref|NP_503382.1| Protein W07B8.4 [Caenorhabditis elegans]
gi|351059398|emb|CCD74288.1| Protein W07B8.4 [Caenorhabditis elegans]
Length = 335
Score = 146 bits (368), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 98/301 (32%), Positives = 129/301 (42%), Gaps = 109/301 (36%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQAEKNSLSNIPRAHLKSWMGVHPDYNLPA 68
CG GC GG+P AWRYWVK+G+V+GG++ S+ G P P
Sbjct: 141 CGDGCEGGYPIQAWRYWVKNGLVTGGSFESQ------------------YGCKPYSIAPC 182
Query: 69 NRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEIAPCEHHVN 128
D T WP CP +I D CEHH
Sbjct: 183 GE----------------TIDGVT-WPECPM--KISDT--------------PKCEHHCT 209
Query: 129 GTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEG 188
G +Y +PY +D +FGA +Y++ + K I EI HGPVE
Sbjct: 210 G-------------------NNSYPIPYDQDKHFGASAYAIGRSAKQIQTEILAHGPVEV 250
Query: 189 AFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKA 248
F V++D LYK+G + + +G
Sbjct: 251 GFIVYEDFYLYKTGIY-------------------------------------THVAGGE 273
Query: 249 LGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVPKL 308
LGGHA+++LGWG D + YWL ANSWNT WG+ G F+ILRG DECGIES+ AG+P L
Sbjct: 274 LGGHAVKMLGWGVDNGT--PYWLAANSWNTVWGEKGYFRILRGVDECGIESAAVAGMPDL 331
Query: 309 D 309
+
Sbjct: 332 N 332
>gi|390994431|gb|AFM37365.1| cathepsin B2 [Dictyocaulus viviparus]
Length = 346
Score = 146 bits (368), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 80/209 (38%), Positives = 109/209 (52%), Gaps = 40/209 (19%)
Query: 101 REIRDQGSCGSCWGCRPYEIAPCEHHVNGTR-PSCDASKGHTPKCVRECQENYDVPYKKD 159
+ I GS S GC+PY PCEHH NGT C T C +CQ Y Y D
Sbjct: 177 KGIVSGGSYTSKSGCKPYPFPPCEHHTNGTHYHPCPKDLYPTNTCEHKCQSGYATAYTND 236
Query: 160 LNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKW 219
+GAK+Y+V++ K+I KEI HGPVE A+ V++D Y G
Sbjct: 237 KRYGAKAYTVAARVKAIQKEIMLHGPVEVAYDVYEDFEHYLKG----------------- 279
Query: 220 TIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTD 279
+ + +G LGGHA++++GWG + + YW+ +NSWN+D
Sbjct: 280 --------------------IYKHTAGSYLGGHAVKMIGWGTE--NGIPYWICSNSWNSD 317
Query: 280 WGDNGLFKILRGKDECGIESSITAGVPKL 308
WG+NG F+ILRG DECGIES + AG+PK+
Sbjct: 318 WGENGFFRILRGTDECGIESGVVAGLPKI 346
Score = 55.5 bits (132), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 24/35 (68%), Positives = 27/35 (77%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQAEK 43
CGFGC+GGFP AW YWV+ GIVSGG+Y SK K
Sbjct: 158 CGFGCDGGFPYAAWNYWVEKGIVSGGSYTSKSGCK 192
>gi|1777779|gb|AAB40605.1| cathepsin B-like cysteine proteinase [Ascaris suum]
gi|324515014|gb|ADY46062.1| Cathepsin B cysteine proteinase 6 [Ascaris suum]
Length = 398
Score = 146 bits (368), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 79/198 (39%), Positives = 107/198 (54%), Gaps = 41/198 (20%)
Query: 114 GCRPYEIAPCEHHVNGTR-PSCDASKGHTPKCVRECQENY-DVPYKKDLNFGAKSYSVSS 171
GC+PY PCEHH N T C TPKC ++C + Y + Y +D FG +Y V
Sbjct: 218 GCKPYPFPPCEHHSNKTHYQPCKHDLYPTPKCEKKCLDIYTEKTYAEDKFFGETAYGVED 277
Query: 172 NEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAE 231
+ SI KEI HGPVE AF V++D ++Y G
Sbjct: 278 DVTSIQKEILTHGPVEVAFEVYEDFLMYDGG----------------------------- 308
Query: 232 GAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRG 291
+ ++ GK GGHA+++LGWG ++ YWL+ANSWNTDWG++G F+I+RG
Sbjct: 309 --------IYVHTGGKIGGGHAVKMLGWGVEQGVP--YWLVANSWNTDWGEDGFFRIIRG 358
Query: 292 KDECGIESSITAGVPKLD 309
DECGIESS+ G+PKL+
Sbjct: 359 IDECGIESSVVGGLPKLN 376
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 21/35 (60%), Positives = 25/35 (71%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQAEK 43
CGFGC+GG P AW+YWVK GIV+G + KQ K
Sbjct: 186 CGFGCDGGDPMAAWKYWVKEGIVTGSNFTMKQGCK 220
>gi|388500062|gb|AFK38097.1| unknown [Lotus japonicus]
Length = 357
Score = 145 bits (367), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 100/315 (31%), Positives = 134/315 (42%), Gaps = 96/315 (30%)
Query: 46 LSNIPRAHLKSWMGVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRD 105
SN A K +GV P P + + LP +FD+RT W C TI I D
Sbjct: 65 FSNYTVAQFKRLLGVKPSPKKELRSTPVVSHPRSLK--LPKSFDARTAWSQCSTIGRILD 122
Query: 106 QGSCGSCWGCRPYE---------------------IAPC--------------------E 124
QG CGSCW E +A C
Sbjct: 123 QGHCGSCWAFGAVESLSDRFCIHLDVNVSLSVNDLLACCGFLCGSGCDGGYPLYAWRYLA 182
Query: 125 HH-------------VNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSS 171
HH + + P C+ + TPKCVR+C + + +KK F +YSV S
Sbjct: 183 HHGVVTEECDPYFDQIGCSHPGCEPAY-QTPKCVRKCVKGNQI-WKKSKYFSVNAYSVKS 240
Query: 172 NEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAE 231
+ IM E+Y++GPVE AFTV++D YKSG +
Sbjct: 241 DPYDIMAEVYKNGPVEVAFTVYEDFAHYKSGVY--------------------------- 273
Query: 232 GAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRG 291
+ +G LGGHA++++GWG ++ E YWLIAN WN WGD+G F I RG
Sbjct: 274 ----------KHITGSQLGGHAVKLIGWGTTDEG-EDYWLIANQWNRSWGDDGYFMIRRG 322
Query: 292 KDECGIESSITAGVP 306
+ECGIE +TAG+P
Sbjct: 323 TNECGIEEDVTAGLP 337
>gi|329668994|gb|AEB96385.1| cathepsin B-like cysteine protease 2 [Angiostrongylus cantonensis]
Length = 316
Score = 145 bits (367), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 80/191 (41%), Positives = 99/191 (51%), Gaps = 39/191 (20%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
CRPYEI PC H N T S + TP C CQ Y + Y D +G +YSVS++
Sbjct: 163 ACRPYEIPPCGIHKNETFYSNCTQEIDTPDCKTTCQAGYPISYDDDKTYGKTAYSVSNSV 222
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
+I KEI +GPV AFTV+DD YK+G
Sbjct: 223 HAIQKEIMTYGPVVAAFTVYDDFFHYKTG------------------------------- 251
Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
+ + SG GGHA+RILGWG ++ YWL+ANSWNTDWG+NG F+ILRG D
Sbjct: 252 ------IYKHVSGAEAGGHAVRILGWG--QQGGVPYWLVANSWNTDWGENGYFRILRGSD 303
Query: 294 ECGIESSITAG 304
ECGIE + AG
Sbjct: 304 ECGIEDGVVAG 314
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 26/64 (40%), Positives = 35/64 (54%), Gaps = 9/64 (14%)
Query: 78 SEVD-EDLPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDA 136
+E+D +P +FD+R WP+CP+I IRDQ CGSCW E+ + C A
Sbjct: 59 TEIDGSKIPDSFDARVTWPHCPSISYIRDQSQCGSCWAFSSAEVM--------SDRVCIA 110
Query: 137 SKGH 140
S GH
Sbjct: 111 SHGH 114
Score = 47.0 bits (110), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 18/32 (56%), Positives = 28/32 (87%)
Query: 10 GFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQA 41
G+GC+GG+P AW+Y+V++G+V+GG YG+K A
Sbjct: 132 GYGCDGGWPVSAWQYFVETGVVTGGLYGTKDA 163
>gi|356505709|ref|XP_003521632.1| PREDICTED: cathepsin B-like [Glycine max]
Length = 357
Score = 145 bits (367), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 100/316 (31%), Positives = 137/316 (43%), Gaps = 98/316 (31%)
Query: 46 LSNIPRAHLKSWMGVHPDYNLPANRLPELIGYSEVDE-DLPANFDSRTKWPNCPTIREIR 104
SN K +GV P +P L S LP NFD+RT W C TI I
Sbjct: 65 FSNYTVEQFKRLLGVKP---MPKKELRSTPAISHPKTLKLPKNFDARTAWSQCSTIGRIL 121
Query: 105 DQGSCGSCWG-----------CRPYEI----------APC-------------------- 123
DQG CGSCW C +++ A C
Sbjct: 122 DQGHCGSCWAFGAVESLSDRFCIHFDVNISLSVNDLLACCGFLCGSGCDGGYPLYAWRYL 181
Query: 124 EHH-------------VNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVS 170
HH + + P C+ + TPKCV++C V +KK ++ +Y V+
Sbjct: 182 AHHGVVTEECDPYFDQIGCSHPGCEPAY-RTPKCVKKCVSGNQV-WKKSKHYSVSAYRVN 239
Query: 171 SNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGA 230
S+ IM E+Y++GPVE AFTV++D YKSG +
Sbjct: 240 SDPHDIMAEVYKNGPVEVAFTVYEDFAYYKSGVY-------------------------- 273
Query: 231 EGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILR 290
+ +G LGGHA++++GWG + E YWL+AN WN +WGD+G FKI R
Sbjct: 274 -----------KHITGYELGGHAVKLIGWGTTDDG-EDYWLLANQWNREWGDDGYFKIRR 321
Query: 291 GKDECGIESSITAGVP 306
G +ECGIE +TAG+P
Sbjct: 322 GTNECGIEEDVTAGLP 337
Score = 37.7 bits (86), Expect = 7.5, Method: Compositional matrix adjust.
Identities = 14/25 (56%), Positives = 18/25 (72%)
Query: 8 LCGFGCNGGFPGMAWRYWVKSGIVS 32
LCG GC+GG+P AWRY G+V+
Sbjct: 164 LCGSGCDGGYPLYAWRYLAHHGVVT 188
>gi|118358710|ref|XP_001012596.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila]
gi|89294363|gb|EAR92351.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila SB210]
Length = 346
Score = 145 bits (367), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 77/195 (39%), Positives = 105/195 (53%), Gaps = 43/195 (22%)
Query: 115 CRPYEIAPCEHHVNG-TRPSCDASKGHTPKCVRECQEN--YDVPYKKDLNFGAKSYSVSS 171
C+ Y +APC HHV P C TP CV+ C N Y +PY KDL+ G+K+YS+
Sbjct: 190 CQAYSLAPCAHHVTSDVYPPCTGELP-TPPCVKSCDSNSTYTIPYPKDLHKGSKAYSIDQ 248
Query: 172 NEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAE 231
NE++IM EI +GP+E AFTV++D + YKSG +
Sbjct: 249 NEQAIMTEIQTNGPIEVAFTVYEDFLTYKSGVY--------------------------- 281
Query: 232 GAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRG 291
+ +G LGGHA++++GWG + + YW+I NSWN WGD G FKILRG
Sbjct: 282 ----------QHVTGSELGGHAVKMVGWGVENGT--PYWIIVNSWNESWGDKGTFKILRG 329
Query: 292 KDECGIESSITAGVP 306
++ECGIES +P
Sbjct: 330 QNECGIESECVTALP 344
Score = 40.0 bits (92), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 16/30 (53%), Positives = 23/30 (76%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
CGFGC+GG+P A Y+V +G+V+G YG+
Sbjct: 157 CGFGCDGGWPEAAMDYYVNNGLVTGDLYGN 186
>gi|118365170|ref|XP_001015806.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila]
gi|89297573|gb|EAR95561.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila SB210]
Length = 340
Score = 145 bits (367), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 76/193 (39%), Positives = 109/193 (56%), Gaps = 40/193 (20%)
Query: 115 CRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDV-PYKKDLNFGAKSYSVSSNE 173
C+PY PC+HHV G C + TP+CV+EC Y Y+KDL+F +++YS+ N
Sbjct: 187 CKPYIFPPCDHHVTGQYQPCGPIQP-TPQCVKECNSEYTQNTYEKDLHFASQTYSIKQNV 245
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
++I +EI HGPV+ +F V D + YKSG + IR+
Sbjct: 246 QAIQREIMAHGPVQASFKVAADFLTYKSGVY----------------IRN---------- 279
Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
K GGH+++I+GWG++ + YWLIANSWN DWG+ GLF++LRG++
Sbjct: 280 ----------PKLKYEGGHSVKIIGWGKEGNT--PYWLIANSWNEDWGEKGLFRMLRGRN 327
Query: 294 ECGIESSITAGVP 306
ECGIE+ I AG+P
Sbjct: 328 ECGIEAQIVAGLP 340
Score = 49.7 bits (117), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 18/33 (54%), Positives = 25/33 (75%)
Query: 82 EDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG 114
+ +P FD+R +WPNC +I+ IRDQ +CGSCW
Sbjct: 86 DPIPEFFDAREQWPNCQSIKLIRDQSTCGSCWA 118
Score = 40.4 bits (93), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 15/29 (51%), Positives = 19/29 (65%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYG 37
CG GC GG+P AW Y + G+ +GG YG
Sbjct: 154 CGMGCKGGYPSAAWGYMKRQGVSTGGLYG 182
>gi|226471008|emb|CAX70585.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 145 bits (367), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 80/199 (40%), Positives = 106/199 (53%), Gaps = 53/199 (26%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GC+PY CEH G P+C TP+C + CQ+ Y PY++D ++G + Y+V SNE
Sbjct: 187 GCQPYPFPKCEHLTKGKYPACGTKIYKTPQCKQTCQKGYKTPYEQDKHYGDQRYNVISNE 246
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
K+I +EI +GPVE A
Sbjct: 247 KAIQREIMMYGPVEAA-------------------------------------------- 262
Query: 234 FTVFDDLILYKSG-------KALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLF 286
F V++D + YKSG +GGHAIRI+GWG EK K YWLIANSWN DWG+NGLF
Sbjct: 263 FDVYEDFLNYKSGIYRHVAGSIVGGHAIRIIGWGV-EKGK-PYWLIANSWNEDWGENGLF 320
Query: 287 KILRGKDECGIESSITAGV 305
+++RG+DEC IES + AG+
Sbjct: 321 RMVRGRDECSIESHVVAGL 339
Score = 56.6 bits (135), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 22/52 (42%), Positives = 34/52 (65%), Gaps = 1/52 (1%)
Query: 63 DYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG 114
D + R P + + +++ ++P+ FDSR KWP+C +I +IRDQ CGSCW
Sbjct: 70 DAEMKRKRRP-TVDHHDLNVEIPSQFDSRKKWPHCKSISQIRDQSRCGSCWA 120
>gi|226471006|emb|CAX70584.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 145 bits (366), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 80/199 (40%), Positives = 106/199 (53%), Gaps = 53/199 (26%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GC+PY CEH G P+C TP+C + CQ+ Y PY++D ++G + Y+V SNE
Sbjct: 187 GCQPYPFPKCEHLTKGKYPACGTKIYKTPQCKQTCQKGYKTPYEQDKHYGDQRYNVISNE 246
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
K+I +EI +GPVE A
Sbjct: 247 KAIQREIMMYGPVEAA-------------------------------------------- 262
Query: 234 FTVFDDLILYKSG-------KALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLF 286
F V++D + YKSG +GGHAIRI+GWG EK K YWLIANSWN DWG+NGLF
Sbjct: 263 FDVYEDFLNYKSGIYRHVAGSIVGGHAIRIIGWGV-EKGK-PYWLIANSWNEDWGENGLF 320
Query: 287 KILRGKDECGIESSITAGV 305
+++RG+DEC IES + AG+
Sbjct: 321 RMVRGRDECSIESHVVAGL 339
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 22/52 (42%), Positives = 34/52 (65%), Gaps = 1/52 (1%)
Query: 63 DYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG 114
D + R P + + +++ ++P+ FDSR KWP+C +I +IRDQ CGSCW
Sbjct: 70 DAEMKRKRRP-TVDHHDLNVEIPSQFDSRKKWPHCKSISQIRDQSRCGSCWA 120
>gi|17559068|ref|NP_504682.1| Protein CPR-4 [Caenorhabditis elegans]
gi|1169085|sp|P43508.1|CPR4_CAEEL RecName: Full=Cathepsin B-like cysteine proteinase 4; AltName:
Full=Cysteine protease-related 4; Flags: Precursor
gi|675500|gb|AAA98785.1| cathepsin B-like cysteine proteinase [Caenorhabditis elegans]
gi|695293|gb|AAA98783.1| cathepsin B-like cysteine proteinase [Caenorhabditis elegans]
gi|351063163|emb|CCD71204.1| Protein CPR-4 [Caenorhabditis elegans]
Length = 335
Score = 145 bits (365), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 96/300 (32%), Positives = 131/300 (43%), Gaps = 110/300 (36%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQAEKNSLSNIPRAHLKSWMGVHPDYNLPA 68
CG+GC GG+P AW+Y VKSG +GG+Y ++ G P P
Sbjct: 146 CGYGCEGGYPINAWKYLVKSGFCTGGSYEAQ------------------FGCKPYSLAPC 187
Query: 69 NRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEIAPCEHHVN 128
E +G WP+CP D G Y+ C +
Sbjct: 188 G---ETVG--------------NVTWPSCP------DDG----------YDTPACVN--- 211
Query: 129 GTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEG 188
KC +NY+V Y D +FG+ +Y+V I EI HGPVE
Sbjct: 212 --------------KCT---NKNYNVAYTADKHFGSTAYAVGKKVSQIQAEIIAHGPVEA 254
Query: 189 AFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKA 248
AFTV++D YK+G + ++ +G+
Sbjct: 255 AFTVYEDFYQYKTGVY-------------------------------------VHTTGQE 277
Query: 249 LGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVPKL 308
LGGHAIRILGWG D + YWL+ANSWN +WG+NG F+I+RG +ECGIE ++ GVPK+
Sbjct: 278 LGGHAIRILGWGTDNGT--PYWLVANSWNVNWGENGYFRIIRGTNECGIEHAVVGGVPKV 335
>gi|226471004|emb|CAX70583.1| Cysteine PRotease related protein [Schistosoma japonicum]
Length = 304
Score = 145 bits (365), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 77/199 (38%), Positives = 106/199 (53%), Gaps = 53/199 (26%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GC+PY CEH G P+C TP+C + CQ+ Y PY++D ++G + Y+V SNE
Sbjct: 149 GCQPYPFPKCEHLTKGKYPACGTKIYKTPQCKQTCQKGYKTPYEQDKHYGDQRYNVISNE 208
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
K+I +EI +GPVE A
Sbjct: 209 KAIQREIMMYGPVEAA-------------------------------------------- 224
Query: 234 FTVFDDLILYKSG-------KALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLF 286
F V++D + YKSG +GGHAIRI+GWG ++++ YWLIANSWN DWG+ GLF
Sbjct: 225 FDVYEDFLNYKSGIYRHVTGSIVGGHAIRIIGWGVEKRTP--YWLIANSWNEDWGEKGLF 282
Query: 287 KILRGKDECGIESSITAGV 305
+I+RG+DEC IES + AG+
Sbjct: 283 RIVRGRDECSIESHVVAGL 301
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 20/27 (74%), Positives = 22/27 (81%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGA 35
CG GC GGFPG AW YWVK GIV+GG+
Sbjct: 117 CGDGCKGGFPGQAWDYWVKRGIVTGGS 143
>gi|86279343|gb|ABC88767.1| putative cathepsin B-like proteinase [Tenebrio molitor]
Length = 321
Score = 144 bits (364), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 92/308 (29%), Positives = 138/308 (44%), Gaps = 96/308 (31%)
Query: 54 LKSWMGVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCW 113
L ++G+HPD N P ++ ++ D+P +FD+RTKWPNC ++ IRDQG+CGSCW
Sbjct: 57 LNGFIGLHPD----PNYKPPVLVHTFNARDVPESFDARTKWPNCDSLNRIRDQGACGSCW 112
Query: 114 GCRP--------------------------------------YEIAPCEHHVN------- 128
Y ++ + ++N
Sbjct: 113 AFASIESMSDRICIHSSGSAQFMFSPEDLLSCCTSCGDCGGGYMMSALDFYINEGIVSGG 172
Query: 129 ------GTRP-SCDA-SKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEI 180
G RP + DA +G TP C + C+ Y Y D ++G+ Y VSS I E+
Sbjct: 173 DVNSNEGCRPYTADAHDQGQTPACTKSCRNGYSTSYSADKHYGSNDYVVSSVIDQIQYEV 232
Query: 181 YEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDL 240
+GP+ F VF D Y SG +
Sbjct: 233 MTNGPIIVNFEVFQDFYNYVSGVY------------------------------------ 256
Query: 241 ILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESS 300
+ SG+++G H ++I+GWG + + YWLIANSW + WGD+G FK+LRG++ECGIE+
Sbjct: 257 -RHVSGESVGFHVVKIVGWGVE--NGVPYWLIANSWGSSWGDHGFFKMLRGQNECGIENY 313
Query: 301 ITAGVPKL 308
A +P+L
Sbjct: 314 PYAVMPRL 321
>gi|326427908|gb|EGD73478.1| cathepsin B [Salpingoeca sp. ATCC 50818]
Length = 341
Score = 144 bits (363), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 78/195 (40%), Positives = 104/195 (53%), Gaps = 41/195 (21%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GC+PY +A CEHH G C TP C R C++ Y+V Y D +FGA SY V +
Sbjct: 188 GCQPYSLAKCEHHTTGPYKPC-GDIVPTPACKRSCRQGYNVTYPNDKHFGASSYGVRGVD 246
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
+ I EI +GPVE AFTV+ D + YKSG +
Sbjct: 247 Q-IATEIMTNGPVEAAFTVYSDFLSYKSGVY----------------------------- 276
Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
+ SG+ LGGHAI+I+GWG + + YW++ANSWN WG++G F I +G D
Sbjct: 277 --------QHTSGQPLGGHAIKIIGWGVQDGT--DYWIVANSWNDSWGNDGFFWIKKGTD 326
Query: 294 ECGIESSITAGVPKL 308
ECGIES + AG+PK+
Sbjct: 327 ECGIESQVVAGLPKV 341
Score = 48.5 bits (114), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 20/32 (62%), Positives = 22/32 (68%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQ 40
CG GCNGG+P AW YW GIV+GG Y S Q
Sbjct: 156 CGDGCNGGYPAAAWEYWKNQGIVTGGQYDSNQ 187
>gi|442754445|gb|JAA69382.1| Putative cathepsin b precursor [Ixodes ricinus]
Length = 340
Score = 144 bits (363), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 89/201 (44%), Positives = 105/201 (52%), Gaps = 54/201 (26%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GC PY + C+HHVNGT C TPKCVR C++ Y+V +K D ++G SYSV
Sbjct: 186 GCMPYPVPSCDHHVNGTLGPC-GQDPPTPKCVRLCRKGYNVDFKDDKHYGKSSYSV---- 240
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
P NET I+ I N EGA
Sbjct: 241 ---------------------------------PSNETQ----IQMEIMKNGP---VEGA 260
Query: 234 FTVFDDLILYKSG-------KALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLF 286
FTV+ D LYKSG ALGGHAIRILGWG + + YWL+ANSWNT+WGD G F
Sbjct: 261 FTVYADFPLYKSGVYKSHSTDALGGHAIRILGWGVE--NDVPYWLVANSWNTEWGDKGYF 318
Query: 287 KILRGKDECGIESSITAGVPK 307
KILRG +ECGIE I AG+PK
Sbjct: 319 KILRGSNECGIEEDIVAGIPK 339
Score = 49.7 bits (117), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 20/32 (62%), Positives = 23/32 (71%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQ 40
CG GCNGGFP AW YWV GIV+GG Y + +
Sbjct: 154 CGSGCNGGFPAAAWSYWVDKGIVTGGNYDTDE 185
>gi|332376204|gb|AEE63242.1| unknown [Dendroctonus ponderosae]
Length = 338
Score = 144 bits (363), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 84/209 (40%), Positives = 110/209 (52%), Gaps = 57/209 (27%)
Query: 107 GSCGSCWGCRPYEIAPCEHHVN-GTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAK 165
G+ S GC+ Y + PCEHHV G+RP C + TP+CVR C E+ + Y + L FG +
Sbjct: 177 GAYNSSQGCKDYSLEPCEHHVEVGSRPQCSSLNFDTPECVRSCYES-SLDYTESLTFG-Q 234
Query: 166 SYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNT 225
S +NEK + EI ++GP+E A
Sbjct: 235 QVSTFTNEKQMQLEILKNGPIEAA------------------------------------ 258
Query: 226 SQLGAEGAFTVFDDLILYKSG--------KALGGHAIRILGWGEDEKSKEKYWLIANSWN 277
FTV++D + YKSG +++GGHAI++LGWG +E +K YWLIANSWN
Sbjct: 259 --------FTVYNDFLSYKSGVYQATAQDESVGGHAIKVLGWGVEEGTK--YWLIANSWN 308
Query: 278 TDWGDNGLFKILRGKDECGIESSITAGVP 306
TDWGDNG FK LRG D CGIES A +P
Sbjct: 309 TDWGDNGYFKFLRGVDHCGIESETAASLP 337
Score = 45.8 bits (107), Expect = 0.026, Method: Compositional matrix adjust.
Identities = 19/36 (52%), Positives = 23/36 (63%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQAEKN 44
CG GC+GG+ W YW GIV+GGAY S Q K+
Sbjct: 152 CGLGCDGGYVAEPWDYWRTDGIVTGGAYNSSQGCKD 187
>gi|56752811|gb|AAW24617.1| unknown [Schistosoma japonicum]
Length = 342
Score = 144 bits (362), Expect = 6e-32, Method: Compositional matrix adjust.
Identities = 80/201 (39%), Positives = 106/201 (52%), Gaps = 53/201 (26%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GC+PY CEH G P+C TP+C + CQ+ Y PY++D ++G + Y+V SNE
Sbjct: 187 GCQPYPFPKCEHLTKGKYPACGTKIYKTPQCKQTCQKGYKTPYEQDKHYGDQRYNVISNE 246
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
K+I +EI +GPVE A
Sbjct: 247 KAIQREIMMYGPVEAA-------------------------------------------- 262
Query: 234 FTVFDDLILYKSG-------KALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLF 286
F V++D + YKSG +GGHAIRI+GWG EK K YWLIANSWN DWG+ GLF
Sbjct: 263 FDVYEDFLNYKSGIYRHVTGSIVGGHAIRIIGWGV-EKGK-PYWLIANSWNEDWGEKGLF 320
Query: 287 KILRGKDECGIESSITAGVPK 307
+++RG+DEC IES + AG+ K
Sbjct: 321 RMVRGRDECSIESHVVAGLIK 341
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 22/52 (42%), Positives = 34/52 (65%), Gaps = 1/52 (1%)
Query: 63 DYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG 114
D + R P + + +++ ++P+ FDSR KWP+C +I +IRDQ CGSCW
Sbjct: 70 DAEMKRKRRP-TVDHHDLNVEIPSQFDSRKKWPHCKSISQIRDQSRCGSCWA 120
>gi|242001640|ref|XP_002435463.1| cathepsin B endopeptidase, putative [Ixodes scapularis]
gi|215498799|gb|EEC08293.1| cathepsin B endopeptidase, putative [Ixodes scapularis]
Length = 223
Score = 144 bits (362), Expect = 7e-32, Method: Compositional matrix adjust.
Identities = 80/203 (39%), Positives = 105/203 (51%), Gaps = 54/203 (26%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GC+PY +APCEH G+ P C + TPKC R+C+E Y+ Y D F YS++ +E
Sbjct: 68 GCKPYSLAPCEHSSQGSLPECVGTL-PTPKCKRQCREGYERSYDDDKYFAKNVYSINGSE 126
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
K I EI+++GPVE
Sbjct: 127 KQIRTEIFQNGPVEAE-------------------------------------------- 142
Query: 234 FTVFDDLILYKSG-------KALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLF 286
FT + D + YKSG +G HAIRILGWG ++ + YWL+ANSWN DWGD+G F
Sbjct: 143 FTAYADFLSYKSGVYQHHSRDIIGRHAIRILGWGSEDNNP--YWLLANSWNEDWGDHGYF 200
Query: 287 KILRGKDECGIESSITAGVPKLD 309
K+LRG +EC IES + AG+PKLD
Sbjct: 201 KMLRGVNECDIESFVNAGIPKLD 223
Score = 40.8 bits (94), Expect = 0.72, Method: Compositional matrix adjust.
Identities = 17/35 (48%), Positives = 22/35 (62%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQAEK 43
CG GC+GG AW+YW +G+VSGG Y + K
Sbjct: 36 CGSGCSGGVSAAAWQYWKDAGLVSGGLYNTTDGCK 70
>gi|402594312|gb|EJW88238.1| cathepsin B5 [Wuchereria bancrofti]
Length = 407
Score = 144 bits (362), Expect = 7e-32, Method: Compositional matrix adjust.
Identities = 81/198 (40%), Positives = 104/198 (52%), Gaps = 43/198 (21%)
Query: 114 GCRPYEIAPCEHHVNGTR-PSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSN 172
GCRPY PCEHH N T C TPKC R+C +NY PYK D +G ++Y+V ++
Sbjct: 233 GCRPYPFPPCEHHNNKTHYEPCKHDLYPTPKCDRQCDKNYKKPYKADKYYGEQAYNVEND 292
Query: 173 EKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEG 232
+ I KEI GPVE +F V+ D + Y G
Sbjct: 293 VELIQKEIMTLGPVEASFEVYTDFLHYIGG------------------------------ 322
Query: 233 AFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDN---GLFKIL 289
+ + +G GGHA++ILGWG D+ YWL ANSWNTDWG++ G F+IL
Sbjct: 323 -------IYKHVAGSVGGGHAVKILGWGIDQGV--SYWLAANSWNTDWGEDVFSGYFRIL 373
Query: 290 RGKDECGIESSITAGVPK 307
RG DECGIES I AG+P+
Sbjct: 374 RGVDECGIESGIVAGIPR 391
Score = 45.8 bits (107), Expect = 0.028, Method: Compositional matrix adjust.
Identities = 19/30 (63%), Positives = 22/30 (73%)
Query: 7 RLCGFGCNGGFPGMAWRYWVKSGIVSGGAY 36
+ CGFGC GG P AW+YWV SGIV+G Y
Sbjct: 199 KTCGFGCFGGEPMAAWKYWVLSGIVTGSDY 228
>gi|256052331|ref|XP_002569726.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
gi|353228435|emb|CCD74606.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
Length = 319
Score = 144 bits (362), Expect = 7e-32, Method: Compositional matrix adjust.
Identities = 75/190 (39%), Positives = 101/190 (53%), Gaps = 39/190 (20%)
Query: 115 CRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEK 174
C+PY CEHH G P+C TP C CQ++Y PY +D + G Y+V ++EK
Sbjct: 165 CQPYPFPKCEHHTKGKYPACFEEIYKTPNCENTCQKSYKTPYAQDKHRGKSRYNVKNDEK 224
Query: 175 SIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAF 234
+I KEI ++GPVE F V++D + YKSG
Sbjct: 225 AIQKEIMKYGPVEANFIVYEDFLNYKSG-------------------------------- 252
Query: 235 TVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDE 294
+ + +GK + HAIRI+GWG + + YWLI NSWN DWG+NG F+ILRG+ E
Sbjct: 253 -----IYKHITGKLVSWHAIRIIGWGVENNT--PYWLIPNSWNEDWGENGNFRILRGRHE 305
Query: 295 CGIESSITAG 304
C IES +TAG
Sbjct: 306 CSIESEVTAG 315
Score = 43.1 bits (100), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 17/27 (62%), Positives = 20/27 (74%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGA 35
CG G GGFP +AW YWVK GIV+G +
Sbjct: 132 CGDGFEGGFPALAWDYWVKEGIVTGSS 158
>gi|56752997|gb|AAW24710.1| unknown [Schistosoma japonicum]
Length = 342
Score = 143 bits (361), Expect = 9e-32, Method: Compositional matrix adjust.
Identities = 79/199 (39%), Positives = 105/199 (52%), Gaps = 53/199 (26%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GC+PY CEH G P+C TP+C + CQ+ Y PY++D ++G + Y+V SNE
Sbjct: 187 GCQPYPFPKCEHLTKGKYPACGTKIYKTPQCKQTCQKGYKTPYEQDKHYGDQRYNVISNE 246
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
K+I +EI +GPVE A
Sbjct: 247 KAIQREIMMYGPVEAA-------------------------------------------- 262
Query: 234 FTVFDDLILYKSG-------KALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLF 286
F V++D + YKSG +GGHAIRI+GWG EK K YWLIANSWN DWG+ GLF
Sbjct: 263 FDVYEDFLNYKSGIYRHVTGSIVGGHAIRIIGWGV-EKGK-PYWLIANSWNEDWGEKGLF 320
Query: 287 KILRGKDECGIESSITAGV 305
+++RG+DEC IES + AG+
Sbjct: 321 RMVRGRDECSIESHVVAGL 339
Score = 58.5 bits (140), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 23/52 (44%), Positives = 35/52 (67%), Gaps = 1/52 (1%)
Query: 63 DYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG 114
D + NR P + + +++ ++P+ FDSR KWP+C +I +IRDQ CGSCW
Sbjct: 70 DAEMKRNRRP-TVDHHDLNVEIPSQFDSRKKWPHCKSISQIRDQSRCGSCWA 120
>gi|194352768|emb|CAQ00112.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
gi|326488519|dbj|BAJ93928.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326508126|dbj|BAJ99330.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 355
Score = 143 bits (360), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 100/315 (31%), Positives = 134/315 (42%), Gaps = 96/315 (30%)
Query: 46 LSNIPRAHLKSWMGVHPDYNLPANRLPEL-IGYSEVDEDLPANFDSRTKWPNCPTIREIR 104
+N K +GV P P L + I DLP FD+RT+W +C TI I
Sbjct: 65 FANYTIEQFKHILGVKP---TPPGLLAGVPIKTHPKSADLPKEFDARTQWSSCSTIGNIL 121
Query: 105 DQGSCGSCWGCRPYEIAP-------------------------CEHHVNGTRP------- 132
DQG CG+CW E C NG P
Sbjct: 122 DQGHCGACWAFAAVESLQDRFCIHLNMSVSLSVNDLLACCGFLCGSGCNGGYPISAWRYF 181
Query: 133 --------SCDASKGHT-------------PKCVRECQENYDVPYKKDLNFGAKSYSVSS 171
CD T PKC R+C+ V +KK+ +F +Y V S
Sbjct: 182 RRSGVVTEECDPYFDQTGCQHPGCEPAYPTPKCHRKCKVENQV-WKKNKHFSVNAYRVHS 240
Query: 172 NEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAE 231
N IM E+Y++GPVE AFTV++D YKSG +
Sbjct: 241 NPHDIMAEVYKNGPVEVAFTVYEDFAHYKSGVY--------------------------- 273
Query: 232 GAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRG 291
+ +G +GGHA++++GWG + + E YWL+AN WN WGD+G FKI+RG
Sbjct: 274 ----------KHITGGVMGGHAVKLIGWGTSD-AGEDYWLLANQWNRGWGDDGYFKIIRG 322
Query: 292 KDECGIESSITAGVP 306
K+ECGIE +TAG+P
Sbjct: 323 KNECGIEEDVTAGMP 337
Score = 40.8 bits (94), Expect = 0.80, Method: Compositional matrix adjust.
Identities = 16/25 (64%), Positives = 21/25 (84%)
Query: 8 LCGFGCNGGFPGMAWRYWVKSGIVS 32
LCG GCNGG+P AWRY+ +SG+V+
Sbjct: 164 LCGSGCNGGYPISAWRYFRRSGVVT 188
>gi|339236191|ref|XP_003379650.1| cathepsin B [Trichinella spiralis]
gi|316977649|gb|EFV60721.1| cathepsin B [Trichinella spiralis]
Length = 356
Score = 143 bits (360), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 78/201 (38%), Positives = 101/201 (50%), Gaps = 53/201 (26%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GCRPY APC HH NGT C TP C + CQ Y + Y KD +G K+YS+ +
Sbjct: 199 GCRPYPFAPCNHHSNGTYGPCSHDLEPTPVCKKACQSTYKIQYNKDKYYGLKAYSLHNKA 258
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
+ KE+ +GP+E A
Sbjct: 259 SDLQKELMMNGPMEVA-------------------------------------------- 274
Query: 234 FTVFDDLILYKSG-------KALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLF 286
F V++D +LYK+G LGGHA+R+LGWGE+ + YWL+ANSWNT+WGD G F
Sbjct: 275 FEVYEDFLLYKTGVYQHHTGSVLGGHAVRLLGWGEE--NGVPYWLLANSWNTEWGDKGFF 332
Query: 287 KILRGKDECGIESSITAGVPK 307
KI RG++ECGIES AG+ K
Sbjct: 333 KIYRGRNECGIESEAVAGLYK 353
Score = 71.2 bits (173), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 32/77 (41%), Positives = 42/77 (54%), Gaps = 1/77 (1%)
Query: 39 KQAEKNSLSNIPRAHLKSWMGVHPDYNLPANRLP-ELIGYSEVDEDLPANFDSRTKWPNC 97
K +P ++ MGV L N +P +I Y +D ++P FDSR +WP C
Sbjct: 56 KAGRNPYFETVPSHVIQGMMGVRRSSKLETNSIPLPVISYEHIDMEIPVEFDSRKQWPYC 115
Query: 98 PTIREIRDQGSCGSCWG 114
PTI EIRDQ +CGSCW
Sbjct: 116 PTIGEIRDQSNCGSCWA 132
Score = 47.0 bits (110), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 18/32 (56%), Positives = 24/32 (75%)
Query: 7 RLCGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
++CGFGC GG P AW +WVK G+V+GG Y +
Sbjct: 165 KICGFGCQGGDPHQAWSFWVKYGLVTGGNYTT 196
>gi|211853248|emb|CAP17587.1| cathepsin-like protein 4 [Crateromorpha meyeri]
Length = 325
Score = 143 bits (360), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 75/192 (39%), Positives = 101/192 (52%), Gaps = 53/192 (27%)
Query: 115 CRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEK 174
C+PY + CEHH+NG++P+C + TP+CV C Y Y++DL++G +YSV
Sbjct: 180 CQPYPLPSCEHHINGSKPACPSKIAKTPECVHTCHAGYPTSYEQDLHYGESAYSVRRRVA 239
Query: 175 SIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAF 234
I EI +GPVE A F
Sbjct: 240 EIQTEIMTNGPVEAA--------------------------------------------F 255
Query: 235 TVFDDLILYKSG-------KALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFK 287
TV+ D YKSG + LGGHA++++GWGE++ YWLIANSWN+DWGD+G FK
Sbjct: 256 TVYADFPAYKSGVYKRHSLRQLGGHAVKMIGWGEEDGIP--YWLIANSWNSDWGDHGYFK 313
Query: 288 ILRGKDECGIES 299
I+RG+DECGIES
Sbjct: 314 IVRGQDECGIES 325
Score = 55.8 bits (133), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 28/56 (50%), Positives = 33/56 (58%), Gaps = 7/56 (12%)
Query: 59 GVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG 114
GV LP + LP L ED+P FDSRT+WP+C TI I DQ +CGSCW
Sbjct: 62 GVKGSIPLPLSDLPVL-------EDIPDMFDSRTQWPDCKTIGLIEDQSNCGSCWA 110
Score = 47.0 bits (110), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 25/44 (56%)
Query: 7 RLCGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQAEKNSLSNIP 50
R CG GC GGF G AW YW + G+V+GG Y E ++ P
Sbjct: 141 RNCGNGCEGGFLGAAWNYWKQEGLVTGGLYNPSATESDTCQPYP 184
>gi|91078960|ref|XP_974244.1| PREDICTED: similar to putative cathepsin B-like proteinase
[Tribolium castaneum]
gi|270004840|gb|EFA01288.1| cathepsin B precursor [Tribolium castaneum]
Length = 319
Score = 143 bits (360), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 95/308 (30%), Positives = 139/308 (45%), Gaps = 96/308 (31%)
Query: 54 LKSWMGVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGS-- 111
L ++G+HPD N +PE I ++ +D+P FD+R KWP C ++ IRDQGSCGS
Sbjct: 55 LNGFLGLHPD----PNYMPEKIKHNFNPQDIPKTFDARKKWPKCDSLNRIRDQGSCGSCW 110
Query: 112 --------------------------------CWGCRP----YEIAPCEHHVN------- 128
C C Y +A + ++
Sbjct: 111 AFAAVETMSDRICIHSSGAKKFFFSAEDLLSCCTACGSCSGGYMMAAFDFYIKQGVVSGG 170
Query: 129 ------GTRP-SCDA-SKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEI 180
G RP + DA KG TP C + C++ Y Y D ++G+K Y V + +I EI
Sbjct: 171 DLNSNEGCRPYTADAHDKGVTPSCTKSCRKGYPTSYSSDKHYGSKDYIVDAGVSNIQYEI 230
Query: 181 YEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDL 240
+GP+ +F V+ D Y SG +
Sbjct: 231 MTNGPIIVSFKVYQDFYNYGSGVYH----------------------------------- 255
Query: 241 ILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESS 300
+ SG G H ++I+GWG +++ + YWLIANSW + WG++G FKILRGK+ECGIE++
Sbjct: 256 --HVSGNYTGNHIVKIVGWGTEKE--QDYWLIANSWGSSWGEHGFFKILRGKNECGIENN 311
Query: 301 ITAGVPKL 308
A +PKL
Sbjct: 312 PYAVLPKL 319
>gi|312271211|gb|ADQ57303.1| cathepsin B-like cysteine proteinase 1 [Angiostrongylus
cantonensis]
Length = 394
Score = 142 bits (359), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 78/196 (39%), Positives = 104/196 (53%), Gaps = 41/196 (20%)
Query: 114 GCRPYEIAPCEHHVNGTR-PSCDASKGHTPKCVRECQENY-DVPYKKDLNFGAKSYSVSS 171
GC+PY PCEHH N TR C TPKC ++C +Y + Y D +G +Y V +
Sbjct: 218 GCKPYPFPPCEHHSNKTRFDPCRHDLYPTPKCSKKCVPSYKEKNYDDDRFYGRTAYGVKN 277
Query: 172 NEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAE 231
+ +I KEI HGPVE AF V++D + Y G
Sbjct: 278 DVAAIQKEILTHGPVEVAFEVYEDFLHYAGG----------------------------- 308
Query: 232 GAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRG 291
+ ++ GK GGHA++++GWG D+ + YWLIANSWNTDWG+ G F+ILRG
Sbjct: 309 --------IYVHTGGKLGGGHAVKLIGWGIDQGTP--YWLIANSWNTDWGEEGFFRILRG 358
Query: 292 KDECGIESSITAGVPK 307
DECGIES + G+PK
Sbjct: 359 VDECGIESGVVGGIPK 374
Score = 55.1 bits (131), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 25/57 (43%), Positives = 34/57 (59%), Gaps = 1/57 (1%)
Query: 58 MGVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG 114
MGV+ + +L L ++D D+P FD+R W NC +I+ IRDQ SCGSCW
Sbjct: 96 MGVN-NVHLSVKAKQHLSSTKDLDIDIPETFDARQHWSNCQSIKNIRDQSSCGSCWA 151
Score = 47.8 bits (112), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 20/37 (54%), Positives = 24/37 (64%)
Query: 7 RLCGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQAEK 43
R CGFGC GG P AW+YWV GIV+G + + Q K
Sbjct: 184 RTCGFGCEGGDPMFAWQYWVDHGIVTGSNFTANQGCK 220
>gi|157058763|gb|ABV03139.1| cathepsin B-348 [Acyrthosiphon pisum]
Length = 248
Score = 142 bits (359), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 79/178 (44%), Positives = 101/178 (56%), Gaps = 39/178 (21%)
Query: 101 REIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDL 160
+ I G GS GC PYEIAPCEHHVNGTR C G TP CV++C+E Y VPY +DL
Sbjct: 110 KGIVSGGPYGSNMGCIPYEIAPCEHHVNGTRGPCKEG-GKTPTCVKKCEEGYKVPYAQDL 168
Query: 161 NFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWT 220
+ G +YS+ ++ I +EIY +GPVEGAFTV++D I Y++G +
Sbjct: 169 HHGKSAYSIRNDVDQIRQEIYTNGPVEGAFTVYEDFIAYRAGVY---------------- 212
Query: 221 IRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNT 278
+ +GKALGGHAIRILGWG + + YWL+ANSWNT
Sbjct: 213 ---------------------KHVAGKALGGHAIRILGWGV-QNGEIPYWLVANSWNT 248
Score = 55.8 bits (133), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 24/30 (80%), Positives = 24/30 (80%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
CGFGCNGGFPG AW YW GIVSGG YGS
Sbjct: 91 CGFGCNGGFPGAAWNYWKTKGIVSGGPYGS 120
>gi|339242629|ref|XP_003377240.1| Gut-specific cysteine proteinase [Trichinella spiralis]
gi|316973974|gb|EFV57515.1| Gut-specific cysteine proteinase [Trichinella spiralis]
Length = 325
Score = 142 bits (358), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 87/287 (30%), Positives = 128/287 (44%), Gaps = 95/287 (33%)
Query: 78 SEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEIAP---CEHHVNGTRP-- 132
+ + DLP D+R +WP C I +RDQ +CGSCW + C + +P
Sbjct: 78 ANLSVDLPFEMDARKRWPQCKYIGFVRDQANCGSCWAVSSASVMTDRICIESIAAKQPLL 137
Query: 133 --------------SCD-----------ASKG--------------------------HT 141
CD A++G T
Sbjct: 138 SEEELVSCCKICGYGCDGGYPDKAFIYWATRGIPTGGPYGSTKGCKPYSIGSNSEDEAET 197
Query: 142 PKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKS 201
P C R+C Y +D +FG K Y V+SNE+ IM+E+Y++GPV AF V++D + Y
Sbjct: 198 PLCTRQCINEYPYNLSQDRHFGEKPYWVNSNEEQIMQELYKNGPVVVAFNVYEDFMYYIK 257
Query: 202 GRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGE 261
G + ++ GK LGGHA++++GWG
Sbjct: 258 GVY-------------------------------------EHRFGKFLGGHAVKLIGWGI 280
Query: 262 DEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVPKL 308
+ + +KYWLI+NSWNT WG+NG FKI+RGK+ C IES + AG+ ++
Sbjct: 281 E--NSKKYWLISNSWNTTWGENGFFKIIRGKNCCAIESYVVAGMARI 325
Score = 44.7 bits (104), Expect = 0.048, Method: Compositional matrix adjust.
Identities = 18/37 (48%), Positives = 26/37 (70%)
Query: 7 RLCGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQAEK 43
++CG+GC+GG+P A+ YW GI +GG YGS + K
Sbjct: 147 KICGYGCDGGYPDKAFIYWATRGIPTGGPYGSTKGCK 183
>gi|349956183|dbj|GAA30948.1| cathepsin B-like cysteine proteinase [Clonorchis sinensis]
Length = 337
Score = 142 bits (358), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 79/195 (40%), Positives = 103/195 (52%), Gaps = 40/195 (20%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GCR Y C HH + P C TP CV++C + D Y D +Y+V + +
Sbjct: 177 GCRSYPFPRCSHHGSKKYPPCSHRIYDTPNCVQKC-DTPDTDYATDKTRANITYNVKAKQ 235
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
+IMKEI +GPVE AF V++D + YKSG +F
Sbjct: 236 NAIMKEIMINGPVEAAFQVYEDFLGYKSGVYF---------------------------- 267
Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
+ G LGGHAIRILGWGE+ + YWLIANSWN WG++G FK+LRGK+
Sbjct: 268 ---------HSDGTLLGGHAIRILGWGEE--NGVAYWLIANSWNDGWGEDGCFKMLRGKN 316
Query: 294 ECGIESSITAGVPKL 308
ECGIE +TAG+P+L
Sbjct: 317 ECGIEDEVTAGLPEL 331
Score = 45.1 bits (105), Expect = 0.047, Method: Compositional matrix adjust.
Identities = 17/27 (62%), Positives = 21/27 (77%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGA 35
CGFGC GGFP +AW +W GIV+GG+
Sbjct: 145 CGFGCQGGFPPIAWDFWQTEGIVTGGS 171
>gi|126116630|gb|ABN79675.1| cathepsin B3 [Clonorchis sinensis]
Length = 337
Score = 142 bits (358), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 79/195 (40%), Positives = 103/195 (52%), Gaps = 40/195 (20%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GCR Y C HH + P C TP CV++C + D Y D +Y+V + +
Sbjct: 177 GCRSYPFPRCSHHGSKKYPPCSHRIYDTPNCVQKC-DTPDTDYATDKTRANITYNVKAKQ 235
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
+IMKEI +GPVE AF V++D + YKSG +F
Sbjct: 236 NAIMKEIMINGPVEAAFQVYEDFLGYKSGVYF---------------------------- 267
Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
+ G LGGHAIRILGWGE+ + YWLIANSWN WG++G FK+LRGK+
Sbjct: 268 ---------HSDGTLLGGHAIRILGWGEE--NGVAYWLIANSWNDGWGEDGYFKMLRGKN 316
Query: 294 ECGIESSITAGVPKL 308
ECGIE +TAG+P+L
Sbjct: 317 ECGIEDEVTAGLPEL 331
Score = 44.3 bits (103), Expect = 0.078, Method: Compositional matrix adjust.
Identities = 17/27 (62%), Positives = 20/27 (74%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGA 35
CGFGC GGFP AW +W GIV+GG+
Sbjct: 145 CGFGCQGGFPPTAWDFWQTEGIVTGGS 171
>gi|167541036|gb|ABZ82028.1| cathepsin B endopeptidase [Clonorchis sinensis]
Length = 228
Score = 142 bits (357), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 79/195 (40%), Positives = 103/195 (52%), Gaps = 40/195 (20%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GCR Y C HH + P C TP CV++C + D Y D +Y+V + +
Sbjct: 68 GCRSYPFPRCSHHGSKKYPPCSHRIYDTPNCVQKC-DTPDTDYATDKTRANITYNVKAKQ 126
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
+IMKEI +GPVE AF V++D + YKSG +F
Sbjct: 127 NAIMKEIMINGPVEAAFQVYEDFLGYKSGVYF---------------------------- 158
Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
+ G LGGHAIRILGWGE+ + YWLIANSWN WG++G FK+LRGK+
Sbjct: 159 ---------HSDGTLLGGHAIRILGWGEE--NGVAYWLIANSWNDGWGEDGYFKMLRGKN 207
Query: 294 ECGIESSITAGVPKL 308
ECGIE +TAG+P+L
Sbjct: 208 ECGIEDEVTAGLPEL 222
Score = 43.9 bits (102), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 17/27 (62%), Positives = 20/27 (74%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGA 35
CGFGC GGFP AW +W GIV+GG+
Sbjct: 36 CGFGCQGGFPPTAWDFWQTEGIVTGGS 62
>gi|38373697|gb|AAR19103.1| cathepsin B [Uronema marinum]
Length = 350
Score = 142 bits (357), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 79/195 (40%), Positives = 103/195 (52%), Gaps = 42/195 (21%)
Query: 115 CRPYEIAPCEHHVNGTRPSC-DASKGHTPKCVRECQENYDV-PYKKDLNFGAKSYSVSSN 172
C+PY PC HHV G +C D + +TPKC EC Y Y++DL+ G SYSV +
Sbjct: 192 CQPYSFPPCSHHVQGEYQACTDLPQFNTPKCYTECNSQYTQNSYEQDLHKGVSSYSVPKS 251
Query: 173 EKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEG 232
E+ I EIY++G +F V+ D + Y SG + NTS
Sbjct: 252 EEQIKAEIYQYGSTTASFNVYSDFLTYSSG------------------VYQNTS------ 287
Query: 233 AFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGK 292
G +GGHAI++LGWG + + YWL ANSWN+ WG+NG FKILRG
Sbjct: 288 -------------GSYMGGHAIKMLGWGVENGTP--YWLCANSWNSSWGENGFFKILRGS 332
Query: 293 DECGIESSITAG-VP 306
+ECGIES + AG VP
Sbjct: 333 NECGIESGMVAGFVP 347
Score = 43.5 bits (101), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 17/28 (60%), Positives = 21/28 (75%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAY 36
CG GCNGG+ AW Y+VK+G+VSG Y
Sbjct: 154 CGMGCNGGYTAGAWNYYVKTGLVSGNLY 181
>gi|28971815|dbj|BAC65419.1| cathepsin B [Pandalus borealis]
Length = 328
Score = 141 bits (356), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 100/334 (29%), Positives = 145/334 (43%), Gaps = 113/334 (33%)
Query: 40 QAEKNSLSNIPRAHLKSWMGVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPT 99
+A +N +I + LKS V + ++P L + E+ P FD+R +WP+CP
Sbjct: 36 KAGRNFAKDISKDFLKSLNCVRKNPDIPKLPLKNVTPTKEI----PVEFDAREQWPHCPC 91
Query: 100 IREIRDQGSCGSCWG-----------CRPYE-----------IAPC-------------- 123
I EIRDQG+CGSCW C E +A C
Sbjct: 92 IDEIRDQGNCGSCWAVSAASVMTDRTCIDTEGLVDFRFSSENVAACCTECGNACYGGDED 151
Query: 124 --------EHHVNGTRPSCDASKGHTPKCVRECQENYDVP-------------------- 155
+ V+G R ++++G P V EC+ + + P
Sbjct: 152 TAFTHWVTKGFVSGGRH--NSNEGCQPYSVEECEHHIEGPRPPCEGDMPELVCSETCHEE 209
Query: 156 ----YKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNET 211
Y++DL +G ++Y + + I +EI +GPV AF V+DD + YKSG +
Sbjct: 210 YGKTYEEDLEYGLEAYVLPQDVTQIQEEIMTNGPVTAAFAVYDDFLSYKSGVY------- 262
Query: 212 TAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWL 271
+++G G HA+R++GWGE+E + YWL
Sbjct: 263 ------------------------------QHETGLLDGYHAVRVIGWGEEEGT--PYWL 290
Query: 272 IANSWNTDWGDNGLFKILRGKDECGIESSITAGV 305
+ANSWNTDWGDNGLFKILRG DEC E + A
Sbjct: 291 VANSWNTDWGDNGLFKILRGSDECEFEGDMAAAT 324
>gi|390994429|gb|AFM37364.1| cathepsin B1 [Dictyocaulus viviparus]
Length = 350
Score = 141 bits (355), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 71/195 (36%), Positives = 105/195 (53%), Gaps = 39/195 (20%)
Query: 115 CRPYEIAPCEHHVNGTR-PSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
C+PY PC HH N C TP+C + CQ Y Y+ D +G +Y++ +NE
Sbjct: 194 CKPYAFHPCGHHRNEIYYGECPKEIFPTPQCTQSCQAGYASDYEDDKIYGKSAYALPNNE 253
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
K+I +EI +GPV+ AF V++D Y+SG
Sbjct: 254 KAIQREIMTNGPVQAAFMVYEDFSRYRSG------------------------------- 282
Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
+ ++ +G+ GGHA++++GWG D+ KYWL ANSWN+DWG+NG F+I+RG D
Sbjct: 283 ------IYVHTAGRREGGHAVKLIGWGVDDDGN-KYWLAANSWNSDWGENGYFRIVRGVD 335
Query: 294 ECGIESSITAGVPKL 308
CGIES++ AG+P +
Sbjct: 336 HCGIESAVVAGMPDV 350
Score = 41.6 bits (96), Expect = 0.48, Method: Compositional matrix adjust.
Identities = 17/35 (48%), Positives = 22/35 (62%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQAEK 43
CG GC GG+P AWRY++ G+ +GG Y K K
Sbjct: 161 CGRGCRGGYPIEAWRYFMLHGVCTGGHYAEKDVCK 195
>gi|118358706|ref|XP_001012594.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila]
gi|89294361|gb|EAR92349.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila SB210]
Length = 346
Score = 141 bits (355), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 74/195 (37%), Positives = 104/195 (53%), Gaps = 43/195 (22%)
Query: 115 CRPYEIAPCEHHVNG-TRPSCDASKGHTPKCVRECQEN--YDVPYKKDLNFGAKSYSVSS 171
C+ Y APC HHV P C TP C+ C N + +PY KD++ G+K+Y ++
Sbjct: 190 CQAYTFAPCAHHVTSDIYPPCTGELP-TPPCINSCDSNSTHTIPYSKDIHRGSKAYGIAK 248
Query: 172 NEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAE 231
+EK+IM EIY++GP+E A TV++D + YK+G +
Sbjct: 249 DEKAIMAEIYKNGPIEVALTVYEDFLTYKTGVY--------------------------- 281
Query: 232 GAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRG 291
+ +G LGGHA++++GWG + + YW I NSWN WGD G FKILRG
Sbjct: 282 ----------QHVTGDELGGHAVKMVGWGVENGT--PYWTIVNSWNESWGDKGTFKILRG 329
Query: 292 KDECGIESSITAGVP 306
K+ECGIESS +P
Sbjct: 330 KNECGIESSCVTALP 344
>gi|268570495|ref|XP_002648548.1| Hypothetical protein CBG24861 [Caenorhabditis briggsae]
Length = 323
Score = 141 bits (355), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 91/307 (29%), Positives = 128/307 (41%), Gaps = 96/307 (31%)
Query: 59 GVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCW----- 113
G++ Y P + + V +P +FDSRT+W NC +I IRDQ CGSCW
Sbjct: 56 GMNVKYAAPHSDEIRSTEVNNVLPFIPPSFDSRTRWSNCTSIEMIRDQAQCGSCWAFSTA 115
Query: 114 ------------GCRPYEIAPCEHHVNGTRPSCDASKGHTP------------------- 142
G + I+P + D KG P
Sbjct: 116 EVISDRICIATKGTQQPTISPTDMLACCGNSCGDGCKGRYPIQAFRWWNSRGVVTGGDFR 175
Query: 143 ---------------------KCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIY 181
C CQ Y Y KD FG +Y+V+ N +I EI
Sbjct: 176 GSGCRPYPFAPCISCPEEKTPTCSLSCQFGYSTAYAKDKRFGVSAYAVARNVAAIQTEIM 235
Query: 182 EHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLI 241
+GPV GAFT+++D+ YKSG +
Sbjct: 236 TNGPVVGAFTMYEDMYKYKSGVY------------------------------------- 258
Query: 242 LYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSI 301
+ +G+ LGGHAI+I+GWG ++ YWLIANSW +WG+NG K+ RG +ECGIE ++
Sbjct: 259 RHTAGRLLGGHAIKIIGWG--TQNGIPYWLIANSWGANWGENGFLKMRRGVNECGIERAV 316
Query: 302 TAGVPKL 308
AG+P++
Sbjct: 317 VAGMPRV 323
>gi|255040225|gb|ACT99885.1| cathepsin B2 [Opisthorchis viverrini]
Length = 337
Score = 140 bits (354), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 77/195 (39%), Positives = 105/195 (53%), Gaps = 40/195 (20%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GCR Y C HH + P C TPKCV +C + ++ Y+ D +Y+V ++
Sbjct: 177 GCRSYPFPKCSHHGSKKYPPCPHRIYDTPKCVPKC-DTPNIDYETDKTRANITYNVQRSQ 235
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
+IMKEI +GPVE AF V++D YK G +F
Sbjct: 236 MAIMKEIMINGPVEAAFEVYEDFFGYKQGVYF---------------------------- 267
Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
+ +G+ +GGHAIRILGWGE+ + YWLIANSWN WG++G FK+LRGK+
Sbjct: 268 ---------HSTGEFIGGHAIRILGWGEENGT--PYWLIANSWNEGWGEDGYFKMLRGKN 316
Query: 294 ECGIESSITAGVPKL 308
ECGIE +TAG+P+L
Sbjct: 317 ECGIEDEVTAGLPEL 331
Score = 42.4 bits (98), Expect = 0.30, Method: Compositional matrix adjust.
Identities = 16/27 (59%), Positives = 20/27 (74%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGA 35
CGFGC GG+P AW +W GIV+GG+
Sbjct: 145 CGFGCQGGYPPAAWDFWQAYGIVTGGS 171
>gi|324507953|gb|ADY43363.1| Cathepsin B cysteine proteinase 6 [Ascaris suum]
Length = 352
Score = 140 bits (354), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 79/199 (39%), Positives = 107/199 (53%), Gaps = 43/199 (21%)
Query: 114 GCRPYEIAPCEHHVNGTR-PSCDASKGHTPKCVRECQENY-DVPYKKDLNFGAKSYSVSS 171
GC+PY PCEHH N T C TPKC ++C + Y + Y +D FG +Y V
Sbjct: 177 GCKPYPFPPCEHHSNKTHYQPCKHDLYPTPKCEKKCLDIYTEKTYAEDKFFGETAYGVED 236
Query: 172 NEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAE 231
+ SI KEI HGPVE AF V++D
Sbjct: 237 DVTSIQKEILTHGPVEVAFEVYED------------------------------------ 260
Query: 232 GAFTVFDD-LILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILR 290
F ++D + ++ GK GGHA+++LGWG ++ YWL+ANSWNTDWG++G F+I+R
Sbjct: 261 --FLMYDGGIYVHTGGKIGGGHAVKMLGWGVEQGVP--YWLVANSWNTDWGEDGFFRIIR 316
Query: 291 GKDECGIESSITAGVPKLD 309
G DECGIESS+ G+PKL+
Sbjct: 317 GIDECGIESSVVGGLPKLN 335
Score = 48.9 bits (115), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 21/37 (56%), Positives = 26/37 (70%)
Query: 7 RLCGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQAEK 43
+ CGFGC+GG P AW+YWVK GIV+G + KQ K
Sbjct: 143 KSCGFGCDGGDPMAAWKYWVKEGIVTGSNFTMKQGCK 179
>gi|390994433|gb|AFM37366.1| cathepsin B3 [Dictyocaulus viviparus]
Length = 342
Score = 140 bits (354), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 92/297 (30%), Positives = 132/297 (44%), Gaps = 113/297 (38%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQAEKNSLSNIPRAHLKSWMGVHPDYNLPA 68
CGFGC+GGFP AW Y+V +G+V+GG YG+K N R + S G HP+
Sbjct: 158 CGFGCDGGFPDAAWEYFVSTGVVTGGLYGTK--------NACRPYEISPCGNHPN----- 204
Query: 69 NRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEIAPCEHHVN 128
T + NC +
Sbjct: 205 ----------------------ETFYRNCTGV---------------------------- 214
Query: 129 GTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEG 188
+ PSC S CQ+ Y V YK D G KSY+++++ +I K+I +HGP+
Sbjct: 215 -STPSCKTS----------CQKGYPVSYKDDKTRGRKSYNLANSVSAIQKDILKHGPLVA 263
Query: 189 AFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKA 248
F+V++D + YK G + Y G
Sbjct: 264 TFSVYEDFMYYKKG-------------------------------------IYRYTHGGY 286
Query: 249 LGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGV 305
GGHA+RILGWG + + KYW+IANSWNTDWG++G F+++RG ++CGIE S++AG+
Sbjct: 287 EGGHAVRILGWGVE--NNVKYWIIANSWNTDWGEDGFFRMVRGINDCGIEESVSAGL 341
Score = 55.5 bits (132), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 22/43 (51%), Positives = 29/43 (67%)
Query: 72 PELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG 114
P+L E +P +FD+RT+WP+CP+I IRDQ CGSCW
Sbjct: 81 PQLQENEEDTAGIPESFDARTQWPHCPSISLIRDQADCGSCWA 123
>gi|268555420|ref|XP_002635699.1| Hypothetical protein CBG22436 [Caenorhabditis briggsae]
Length = 317
Score = 140 bits (354), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 98/305 (32%), Positives = 130/305 (42%), Gaps = 101/305 (33%)
Query: 63 DYNLPANRLPELIG--YSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCW------- 113
D A PEL + V +P FD+RT+WPNC +I+ IR+Q +CGSCW
Sbjct: 52 DVKYAAPHSPELRASQVNTVLPSIPTYFDARTRWPNCRSIKMIRNQATCGSCWAFGAAEV 111
Query: 114 ----------GCRPYEIAP----------CEHHVNGTRP--------------------- 132
G + I+P C + G P
Sbjct: 112 MSDRICIASMGTKQPIISPTDLLSCCGNFCGYGCKGASPLQAFRWWNKKGVVTGGDYRGS 171
Query: 133 --------SCDA---SKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIY 181
C A +K TP+C CQ Y Y KD FG +Y V + +I EI
Sbjct: 172 GCKPYPFAPCTALPCTKSETPRCSLNCQPAYSKAYSKDKYFGTPAYIVGMDVAAIQTEI- 230
Query: 182 EHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLI 241
+GPVE AF V+DD Y+SG +
Sbjct: 231 TNGPVEAAFIVYDDFNHYRSGVY------------------------------------- 253
Query: 242 LYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSI 301
+ +GK +GGHA++I+GWG ++ YWL+ANSW WG+NG FK+LRG DECGIES+I
Sbjct: 254 RHVAGKLVGGHAVKIIGWG--IQNGAPYWLMANSWGPYWGENGFFKMLRGVDECGIESTI 311
Query: 302 TAGVP 306
AG P
Sbjct: 312 VAGKP 316
Score = 38.1 bits (87), Expect = 4.6, Method: Compositional matrix adjust.
Identities = 15/29 (51%), Positives = 20/29 (68%)
Query: 8 LCGFGCNGGFPGMAWRYWVKSGIVSGGAY 36
CG+GC G P A+R+W K G+V+GG Y
Sbjct: 140 FCGYGCKGASPLQAFRWWNKKGVVTGGDY 168
>gi|193209594|ref|NP_001123113.1| Protein CPR-6, isoform c [Caenorhabditis elegans]
gi|351058222|emb|CCD65637.1| Protein CPR-6, isoform c [Caenorhabditis elegans]
Length = 369
Score = 140 bits (354), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 76/198 (38%), Positives = 105/198 (53%), Gaps = 41/198 (20%)
Query: 114 GCRPYEIAPCEHHVNGTR-PSCDASKGHTPKCVRECQENY-DVPYKKDLNFGAKSYSVSS 171
GC+PY PCEHH T C TPKC ++C +Y D Y +D FGA +Y V
Sbjct: 192 GCKPYPFPPCEHHSKKTHFDPCPHDLYPTPKCEKKCVSDYTDKTYSEDKFFGASAYGVKD 251
Query: 172 NEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAE 231
+ ++I KE+ HGP+E AF V++D + Y G +
Sbjct: 252 DVEAIQKELMTHGPLEIAFEVYEDFLNYDGGVY--------------------------- 284
Query: 232 GAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRG 291
++ GK GGHA++++GWG D+ YW +ANSWNTDWG++G F+ILRG
Sbjct: 285 ----------VHTGGKLGGGHAVKLIGWGIDDGIP--YWTVANSWNTDWGEDGFFRILRG 332
Query: 292 KDECGIESSITAGVPKLD 309
DECGIES + G+PKL+
Sbjct: 333 VDECGIESGVVGGIPKLN 350
Score = 54.7 bits (130), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 21/36 (58%), Positives = 27/36 (75%)
Query: 79 EVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG 114
++D D+P +FDSR WP C +I+ IRDQ SCGSCW
Sbjct: 90 DLDLDIPESFDSRDNWPKCDSIKVIRDQSSCGSCWA 125
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 22/37 (59%), Positives = 25/37 (67%)
Query: 7 RLCGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQAEK 43
+ CGFGCNGG P AWRYWVK GIV+G Y + K
Sbjct: 158 KSCGFGCNGGDPLAAWRYWVKDGIVTGSNYTANNGCK 194
>gi|71984043|ref|NP_001024426.1| Protein CPR-6, isoform b [Caenorhabditis elegans]
gi|351058214|emb|CCD65629.1| Protein CPR-6, isoform b [Caenorhabditis elegans]
Length = 378
Score = 140 bits (354), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 76/198 (38%), Positives = 105/198 (53%), Gaps = 41/198 (20%)
Query: 114 GCRPYEIAPCEHHVNGTR-PSCDASKGHTPKCVRECQENY-DVPYKKDLNFGAKSYSVSS 171
GC+PY PCEHH T C TPKC ++C +Y D Y +D FGA +Y V
Sbjct: 201 GCKPYPFPPCEHHSKKTHFDPCPHDLYPTPKCEKKCVSDYTDKTYSEDKFFGASAYGVKD 260
Query: 172 NEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAE 231
+ ++I KE+ HGP+E AF V++D + Y G +
Sbjct: 261 DVEAIQKELMTHGPLEIAFEVYEDFLNYDGGVY--------------------------- 293
Query: 232 GAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRG 291
++ GK GGHA++++GWG D+ YW +ANSWNTDWG++G F+ILRG
Sbjct: 294 ----------VHTGGKLGGGHAVKLIGWGIDDGIP--YWTVANSWNTDWGEDGFFRILRG 341
Query: 292 KDECGIESSITAGVPKLD 309
DECGIES + G+PKL+
Sbjct: 342 VDECGIESGVVGGIPKLN 359
Score = 54.7 bits (130), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 21/36 (58%), Positives = 27/36 (75%)
Query: 79 EVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG 114
++D D+P +FDSR WP C +I+ IRDQ SCGSCW
Sbjct: 99 DLDLDIPESFDSRDNWPKCDSIKVIRDQSSCGSCWA 134
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 22/37 (59%), Positives = 25/37 (67%)
Query: 7 RLCGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQAEK 43
+ CGFGCNGG P AWRYWVK GIV+G Y + K
Sbjct: 167 KSCGFGCNGGDPLAAWRYWVKDGIVTGSNYTANNGCK 203
>gi|25146613|ref|NP_741818.1| Protein CPR-6, isoform a [Caenorhabditis elegans]
gi|1169087|sp|P43510.1|CPR6_CAEEL RecName: Full=Cathepsin B-like cysteine proteinase 6; AltName:
Full=Cysteine protease-related 6; Flags: Precursor
gi|671715|gb|AAA98787.1| cathepsin B-like cysteine proteinase [Caenorhabditis elegans]
gi|695294|gb|AAA98789.1| cathepsin B-like cysteine proteinase [Caenorhabditis elegans]
gi|351058213|emb|CCD65628.1| Protein CPR-6, isoform a [Caenorhabditis elegans]
Length = 379
Score = 140 bits (353), Expect = 7e-31, Method: Compositional matrix adjust.
Identities = 76/198 (38%), Positives = 105/198 (53%), Gaps = 41/198 (20%)
Query: 114 GCRPYEIAPCEHHVNGTR-PSCDASKGHTPKCVRECQENY-DVPYKKDLNFGAKSYSVSS 171
GC+PY PCEHH T C TPKC ++C +Y D Y +D FGA +Y V
Sbjct: 202 GCKPYPFPPCEHHSKKTHFDPCPHDLYPTPKCEKKCVSDYTDKTYSEDKFFGASAYGVKD 261
Query: 172 NEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAE 231
+ ++I KE+ HGP+E AF V++D + Y G +
Sbjct: 262 DVEAIQKELMTHGPLEIAFEVYEDFLNYDGGVY--------------------------- 294
Query: 232 GAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRG 291
++ GK GGHA++++GWG D+ YW +ANSWNTDWG++G F+ILRG
Sbjct: 295 ----------VHTGGKLGGGHAVKLIGWGIDDGIP--YWTVANSWNTDWGEDGFFRILRG 342
Query: 292 KDECGIESSITAGVPKLD 309
DECGIES + G+PKL+
Sbjct: 343 VDECGIESGVVGGIPKLN 360
Score = 54.7 bits (130), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 21/36 (58%), Positives = 27/36 (75%)
Query: 79 EVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG 114
++D D+P +FDSR WP C +I+ IRDQ SCGSCW
Sbjct: 100 DLDLDIPESFDSRDNWPKCDSIKVIRDQSSCGSCWA 135
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 22/37 (59%), Positives = 25/37 (67%)
Query: 7 RLCGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQAEK 43
+ CGFGCNGG P AWRYWVK GIV+G Y + K
Sbjct: 168 KSCGFGCNGGDPLAAWRYWVKDGIVTGSNYTANNGCK 204
>gi|345308|pir||S31909 cathepsin B-like cysteine proteinase (EC 3.4.22.-) - fluke
(Schistosoma japonicum)
Length = 316
Score = 140 bits (352), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 71/191 (37%), Positives = 99/191 (51%), Gaps = 39/191 (20%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GC+PY CEHH G PSC TP+C R+CQ+ Y PY+ D ++G S +V NE
Sbjct: 161 GCQPYPFPKCEHHSKGKYPSCGDKMYKTPQCKRKCQKGYKTPYEHDKHYGGISINVIKNE 220
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
+I KEI +GPVE +F+D + YKSG
Sbjct: 221 SAIQKEIMMYGPVEAYLLIFEDFLNYKSG------------------------------- 249
Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
+ Y +G +G H +RI+GWG + + YWL AN+WN DWG+ G F+I+RG++
Sbjct: 250 ------IYRYTTGSFVGEHYVRIIGWGIENGT--AYWLAANTWNEDWGEKGYFRIVRGRN 301
Query: 294 ECGIESSITAG 304
EC +ES + AG
Sbjct: 302 ECSVESVVVAG 312
Score = 52.0 bits (123), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 23/56 (41%), Positives = 33/56 (58%), Gaps = 1/56 (1%)
Query: 59 GVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG 114
G D NL R P + + ++ ++P++FDSR KWP C +I +IRDQ C S W
Sbjct: 40 GRREDPNLRQKRRP-TVDHHDLKVEIPSHFDSRKKWPRCKSISQIRDQSRCASSWA 94
Score = 47.0 bits (110), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 19/27 (70%), Positives = 22/27 (81%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGA 35
CG GC+GGFPG AW YWV GIV+GG+
Sbjct: 129 CGSGCDGGFPGPAWDYWVSHGIVTGGS 155
>gi|121073189|gb|ABM47071.1| cathepsin B2 [Clonorchis sinensis]
gi|358341868|dbj|GAA36574.2| cathepsin B-like cysteine proteinase [Clonorchis sinensis]
Length = 343
Score = 140 bits (352), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 79/195 (40%), Positives = 99/195 (50%), Gaps = 40/195 (20%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GCRPY C+HH G P C TPKCV+ C + + Y+KD SY+V +E
Sbjct: 183 GCRPYPFPKCQHHSQGHYPPCPRRIYPTPKCVKHC-DTPKIDYQKDKTRANTSYNVHQSE 241
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
+IMKEI +GPVE F V +D YKSG
Sbjct: 242 VAIMKEILLNGPVEATFEVHEDFPEYKSG------------------------------- 270
Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
+ + G ++GGHAIRILGWGE+ + YWLIANSWN DWG+ G + LRG +
Sbjct: 271 ------IYFHAWGGSVGGHAIRILGWGEE--NGVPYWLIANSWNEDWGEKGYLRFLRGHN 322
Query: 294 ECGIESSITAGVPKL 308
ECGIE TAG+P L
Sbjct: 323 ECGIEEEATAGLPDL 337
>gi|157058769|gb|ABV03142.1| cathepsin B-348 [Myzus persicae]
Length = 246
Score = 139 bits (351), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 78/178 (43%), Positives = 101/178 (56%), Gaps = 39/178 (21%)
Query: 101 REIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDL 160
+ I G GS GC PYEIAPCEHHVNGTR C G TP CV++C++ Y VPY +DL
Sbjct: 108 KGIVSGGPYGSKMGCIPYEIAPCEHHVNGTRGPCKEG-GKTPACVKKCEDGYKVPYAQDL 166
Query: 161 NFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWT 220
+ G +YS+ ++ I +EIY +GPVEGAFTV++D I Y++G +
Sbjct: 167 HRGKSAYSLGNDVDQIRQEIYTNGPVEGAFTVYEDFIAYRAGVY---------------- 210
Query: 221 IRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNT 278
+ +GKALGGHAIRILGWG + + YWL+ANSWNT
Sbjct: 211 ---------------------KHVAGKALGGHAIRILGWGV-QNGEIPYWLVANSWNT 246
Score = 74.7 bits (182), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 29/44 (65%), Positives = 36/44 (81%)
Query: 70 RLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCW 113
+L +L+ Y++ DLP NFD+R WPNCPTIRE+RDQGSCGSCW
Sbjct: 10 KLEQLVSYTDTPTDLPENFDAREHWPNCPTIREVRDQGSCGSCW 53
Score = 57.8 bits (138), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 25/32 (78%), Positives = 25/32 (78%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQ 40
CGFGCNGGFPG AW YW GIVSGG YGSK
Sbjct: 89 CGFGCNGGFPGAAWHYWKTKGIVSGGPYGSKM 120
>gi|393909827|gb|EJD75608.1| cysteine endopeptidase [Loa loa]
Length = 383
Score = 139 bits (351), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 81/202 (40%), Positives = 101/202 (50%), Gaps = 54/202 (26%)
Query: 114 GCRPYEIAPCEHHVNGTR-PSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSN 172
GCRPY PCEHH N T C TPKCV++C +NY YK D +G + Y+V SN
Sbjct: 220 GCRPYPFPPCEHHNNKTHYEPCKHDLYPTPKCVKKCDKNYGKSYKADKYYGEQVYNVESN 279
Query: 173 EKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEG 232
+SI KEI GPVE +
Sbjct: 280 VESIQKEIMTLGPVEAS------------------------------------------- 296
Query: 233 AFTVFDDLILYK-------SGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGL 285
F V+ D + Y +G GGHA+++LGWG D+ YWL ANSWNTDWG++G
Sbjct: 297 -FEVYTDFLYYTGGIYKHVAGSMGGGHAVKVLGWGIDQGVP--YWLAANSWNTDWGEDGY 353
Query: 286 FKILRGKDECGIESSITAGVPK 307
F+ILRG +ECGIES I AG+PK
Sbjct: 354 FRILRGVNECGIESGIIAGIPK 375
Score = 50.8 bits (120), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 18/36 (50%), Positives = 24/36 (66%)
Query: 84 LPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYE 119
+P +FD+R WP C ++R +RDQ SCGSCW E
Sbjct: 123 IPESFDARKHWPECASLRNVRDQSSCGSCWAVAAVE 158
Score = 44.7 bits (104), Expect = 0.061, Method: Compositional matrix adjust.
Identities = 18/30 (60%), Positives = 21/30 (70%)
Query: 7 RLCGFGCNGGFPGMAWRYWVKSGIVSGGAY 36
+ CGFGC GG P AW+YWV GIV+G Y
Sbjct: 186 KTCGFGCFGGEPMAAWKYWVLRGIVTGSEY 215
>gi|3929733|emb|CAA77178.1| cathepsin B [Homo sapiens]
Length = 195
Score = 139 bits (351), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 74/164 (45%), Positives = 96/164 (58%), Gaps = 40/164 (24%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GCRPY I PCEHHVNG+RP C +G TPKC + C+ Y YK+D ++G SYSVS++E
Sbjct: 72 GCRPYSIPPCEHHVNGSRPPCTG-EGDTPKCSKICEPGYSPTYKQDKHYGYDSYSVSNSE 130
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
K IM EIY++GPVEGAF+V+ D +LYKSG +
Sbjct: 131 KDIMAEIYKNGPVEGAFSVYSDFLLYKSGVY----------------------------- 161
Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWN 277
+ +G+ +GGHAIRILGWG + + YWL+ANSWN
Sbjct: 162 --------QHVTGEMMGGHAIRILGWGVENGT--PYWLVANSWN 195
Score = 45.4 bits (106), Expect = 0.029, Method: Compositional matrix adjust.
Identities = 18/31 (58%), Positives = 23/31 (74%)
Query: 8 LCGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
+CG GCNGG+P AW +W + G+VSGG Y S
Sbjct: 39 MCGDGCNGGYPAEAWNFWTRKGLVSGGLYES 69
>gi|225437812|ref|XP_002281936.1| PREDICTED: cathepsin B-like isoform 1 [Vitis vinifera]
gi|359480250|ref|XP_003632421.1| PREDICTED: cathepsin B-like [Vitis vinifera]
Length = 358
Score = 139 bits (350), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 97/323 (30%), Positives = 136/323 (42%), Gaps = 94/323 (29%)
Query: 37 GSKQAEKNSLSNIPRAHLKSWMGVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPN 96
G K A SN +GV P +P + + LP +FD+RT WP
Sbjct: 56 GWKAAMNPRFSNYSVGQFMHLLGVKPTLQKDLEGVPVITHPKTLK--LPKHFDARTAWPQ 113
Query: 97 CPTIREIRDQGSCGSCWGCRPYE---------------------IAPCE----------- 124
C TI +I DQG CGSCW E +A C
Sbjct: 114 CSTIGKILDQGHCGSCWAFGAVESLSDRFCIHFGMNISLSVNDLLACCGFLCGSGCDGGY 173
Query: 125 ---------HHVNGTR---PSCDASKGHTP---------KCVRECQENYDVPYKKDLNFG 163
HH T P DA+ P KCVR+C + + ++K +G
Sbjct: 174 PLYAWRYFIHHGVVTEECDPYFDATGCSHPGCEPGYPTPKCVRKCTDENQL-WRKAKRYG 232
Query: 164 AKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRD 223
+Y +SS+ IM E+Y++GPVE AFTV++D Y+SG +
Sbjct: 233 QSAYRISSDPYQIMAEVYKNGPVEVAFTVYEDFAHYESGVY------------------- 273
Query: 224 NTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDN 283
Y +G +GGHA++++GWG + E YW++AN WN +WGD+
Sbjct: 274 ------------------RYTTGDVMGGHAVKLIGWGTTDDG-EDYWILANQWNRNWGDD 314
Query: 284 GLFKILRGKDECGIESSITAGVP 306
G F I RG +ECGIE + AG+P
Sbjct: 315 GYFMIRRGVNECGIEEGVVAGLP 337
Score = 38.9 bits (89), Expect = 2.8, Method: Compositional matrix adjust.
Identities = 14/25 (56%), Positives = 20/25 (80%)
Query: 8 LCGFGCNGGFPGMAWRYWVKSGIVS 32
LCG GC+GG+P AWRY++ G+V+
Sbjct: 164 LCGSGCDGGYPLYAWRYFIHHGVVT 188
>gi|226469950|emb|CAX70256.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 139 bits (350), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 72/191 (37%), Positives = 99/191 (51%), Gaps = 39/191 (20%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GC+PY CEHH G PSC TP+C R+CQ+ Y PY+ D ++G S +V NE
Sbjct: 187 GCQPYPFPKCEHHSIGKYPSCGDKIYKTPQCKRKCQKGYTTPYEHDKHYGGISINVIKNE 246
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
+I KEI +GPVE +F+D + YKSG
Sbjct: 247 SAIQKEIMMYGPVEAYLLIFEDFLNYKSG------------------------------- 275
Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
+ Y +G +G H +RI+GWG + + YWL AN+WN DWG+ G F+I+RG++
Sbjct: 276 ------IYRYTTGSFVGEHYVRIIGWGIENGT--AYWLAANTWNEDWGEKGYFRIVRGRN 327
Query: 294 ECGIESSITAG 304
EC IES + AG
Sbjct: 328 ECSIESVVVAG 338
Score = 47.4 bits (111), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 19/27 (70%), Positives = 22/27 (81%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGA 35
CG GC+GGFPG AW YWV GIV+GG+
Sbjct: 155 CGSGCDGGFPGPAWDYWVSHGIVTGGS 181
>gi|94958151|gb|ABF47216.1| cathepsin B [Nicotiana benthamiana]
Length = 356
Score = 139 bits (349), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 102/327 (31%), Positives = 135/327 (41%), Gaps = 102/327 (31%)
Query: 37 GSKQAEKNSLSNIPRAHLKSWMGVHPDYN-----LPANRLPELIGYSEVDEDLPANFDSR 91
G K A SN + K +GV P +P P+L+ +LP FD+R
Sbjct: 55 GWKAALNPRFSNFTVSQFKRLLGVKPTRKGDLKGIPILTHPKLL-------ELPQEFDAR 107
Query: 92 TKWPNCPTIREIRDQGSCGSCWGCRPYEIAP---CEHH-------VNGTRPSC-----DA 136
WPNC TI I DQG CGSCW E C H+ N C D
Sbjct: 108 VAWPNCSTIGRILDQGHCGSCWAFGAVESLSDRFCIHYGLNISLSANDLLACCGFLCGDG 167
Query: 137 SKGHTP-----KCVRE--------------------CQENYDVP------------YKKD 159
G P VR+ C+ Y P + K
Sbjct: 168 CDGGYPLQAWKYFVRKGVVTDECDPYFDNEGCSHPGCEPAYPTPKCHRKCVKQNLLWSKS 227
Query: 160 LNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKW 219
+FG +Y +SS+ SIM E+Y++GPVE +FTV++D YKSG +
Sbjct: 228 KHFGVNAYMISSDPHSIMTELYKNGPVEVSFTVYEDFAHYKSGVY--------------- 272
Query: 220 TIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTD 279
+ +G +GGHA++++GWG E E YWL+AN WN
Sbjct: 273 ----------------------KHVTGDVMGGHAVKLIGWGTSEDG-EDYWLLANQWNRG 309
Query: 280 WGDNGLFKILRGKDECGIESSITAGVP 306
WGD+G FKI RG DEC IE + AG+P
Sbjct: 310 WGDDGYFKIRRGTDECEIEDEVVAGLP 336
Score = 39.7 bits (91), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 14/25 (56%), Positives = 21/25 (84%)
Query: 8 LCGFGCNGGFPGMAWRYWVKSGIVS 32
LCG GC+GG+P AW+Y+V+ G+V+
Sbjct: 163 LCGDGCDGGYPLQAWKYFVRKGVVT 187
>gi|56756475|gb|AAW26410.1| unknown [Schistosoma japonicum]
Length = 342
Score = 139 bits (349), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 75/209 (35%), Positives = 108/209 (51%), Gaps = 39/209 (18%)
Query: 99 TIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKK 158
+R I GS + GCRPY C+H V G +C TP+C + CQ+ Y+ Y++
Sbjct: 172 VLRGIVTGGSKENHTGCRPYPFPKCDHFVKGKYRACGDKLYKTPQCKQTCQKGYNTSYEQ 231
Query: 159 DLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIK 218
D ++G SY+V S E I K+I HGPVE +++D + YKSG
Sbjct: 232 DKHYGGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSG---------------- 275
Query: 219 WTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNT 278
+ Y +GK + GHA+R++GWG + + YWL AN+WN
Sbjct: 276 ---------------------IYRYTTGKYISGHAVRLIGWGVENGT--AYWLAANTWNE 312
Query: 279 DWGDNGLFKILRGKDECGIESSITAGVPK 307
DWG+ G F+I+RG++EC IES I AG+ K
Sbjct: 313 DWGEKGYFRIVRGRNECLIESEIAAGLIK 341
Score = 55.5 bits (132), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 24/56 (42%), Positives = 35/56 (62%), Gaps = 1/56 (1%)
Query: 59 GVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG 114
G D NL R P + + +++ ++P++FDSR KWP C +I +IRDQ CGS W
Sbjct: 66 GRREDPNLRQKRRP-TVDHHDLNVEIPSHFDSRKKWPRCKSISQIRDQSQCGSSWA 120
Score = 42.4 bits (98), Expect = 0.28, Method: Compositional matrix adjust.
Identities = 17/29 (58%), Positives = 22/29 (75%)
Query: 7 RLCGFGCNGGFPGMAWRYWVKSGIVSGGA 35
+ CG GC+GGF G +W YWV GIV+GG+
Sbjct: 153 KYCGSGCDGGFLGPSWDYWVLRGIVTGGS 181
>gi|217072748|gb|ACJ84734.1| unknown [Medicago truncatula]
gi|388505480|gb|AFK40806.1| unknown [Medicago truncatula]
Length = 359
Score = 139 bits (349), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 92/284 (32%), Positives = 128/284 (45%), Gaps = 108/284 (38%)
Query: 84 LPANFDSRTKWPNCPTIREIRDQGSCGSCWG-----------CRPYEI------------ 120
LP FD+RT W C TI +I DQG CGSCW C +++
Sbjct: 103 LPKEFDARTAWSQCSTIGKILDQGHCGSCWAFGAVESLQDRFCIHFDMNISLSVNDLLAC 162
Query: 121 ------APCE------------HH-------------VNGTRPSCDASKGHTPKCVRECQ 149
A C+ HH + + P C+ + TPKCVR+C
Sbjct: 163 CGFLCGAGCDGGTPIYAWRYLAHHGVVTEECDPYFDQIGCSHPGCEPAY-QTPKCVRKCV 221
Query: 150 ENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGN 209
+ + +K+ ++ K+Y V S+ + IM E+Y++GPVE A
Sbjct: 222 KGNQI-WKRSKHYSVKAYRVKSDPQDIMAEVYKNGPVEVA-------------------- 260
Query: 210 ETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSG-------KALGGHAIRILGWGED 262
FTVF+D YKSG ALGGHA++++GWG
Sbjct: 261 ------------------------FTVFEDFAHYKSGVYKHITGSALGGHAVKLIGWGTS 296
Query: 263 EKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVP 306
++ E YWL+AN WNT+WGD+G FKI RG +ECGIE +TAG+P
Sbjct: 297 DEG-EDYWLLANQWNTNWGDDGYFKIKRGTNECGIEDDVTAGLP 339
>gi|226474180|emb|CAX71576.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 139 bits (349), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 74/209 (35%), Positives = 108/209 (51%), Gaps = 39/209 (18%)
Query: 99 TIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKK 158
+R I GS + GCRPY C+H V G +C TP+C + CQ+ Y+ Y++
Sbjct: 172 VLRGIVTGGSKENHTGCRPYPFPKCDHFVKGKYRACGDKLYETPQCKQTCQKGYNTSYEQ 231
Query: 159 DLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIK 218
D ++G SY+V S E I K+I HGPVE +++D + YKSG
Sbjct: 232 DKHYGGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSG---------------- 275
Query: 219 WTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNT 278
+ Y +G+ + GHA+R++GWG + + YWL AN+WN
Sbjct: 276 ---------------------IYRYTTGQFISGHAVRLIGWGVENGT--AYWLAANTWNE 312
Query: 279 DWGDNGLFKILRGKDECGIESSITAGVPK 307
DWG+ G F+I+RG++EC IES I AG+ K
Sbjct: 313 DWGEKGYFRIVRGRNECSIESEIAAGLIK 341
Score = 42.4 bits (98), Expect = 0.27, Method: Compositional matrix adjust.
Identities = 17/29 (58%), Positives = 22/29 (75%)
Query: 7 RLCGFGCNGGFPGMAWRYWVKSGIVSGGA 35
+ CG GC+GGF G +W YWV GIV+GG+
Sbjct: 153 KYCGSGCDGGFLGPSWDYWVLRGIVTGGS 181
>gi|56756410|gb|AAW26378.1| unknown [Schistosoma japonicum]
Length = 342
Score = 138 bits (348), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 74/209 (35%), Positives = 108/209 (51%), Gaps = 39/209 (18%)
Query: 99 TIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKK 158
+R I GS + GCRPY C+H V G +C TP+C + CQ+ Y+ Y++
Sbjct: 172 VLRGIVTGGSKENHTGCRPYPFPKCDHFVKGKYRACGDKLYKTPQCKQTCQKGYNTSYEQ 231
Query: 159 DLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIK 218
D ++G SY+V S E I K+I HGPVE +++D + YKSG
Sbjct: 232 DKHYGGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSG---------------- 275
Query: 219 WTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNT 278
+ Y +G+ + GHA+R++GWG + + YWL AN+WN
Sbjct: 276 ---------------------IYRYTTGQFISGHAVRLIGWGVENGT--AYWLAANTWNE 312
Query: 279 DWGDNGLFKILRGKDECGIESSITAGVPK 307
DWG+ G F+I+RG++EC IES I AG+ K
Sbjct: 313 DWGEKGYFRIVRGRNECSIESEIAAGLIK 341
Score = 54.7 bits (130), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 24/56 (42%), Positives = 34/56 (60%), Gaps = 1/56 (1%)
Query: 59 GVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG 114
G D NL R P + + ++ ++P++FDSR KWP C +I +IRDQ CGS W
Sbjct: 66 GRKEDPNLRQKRRP-TVDHHDLKVEIPSHFDSRKKWPRCKSISQIRDQSQCGSSWA 120
Score = 42.4 bits (98), Expect = 0.27, Method: Compositional matrix adjust.
Identities = 17/29 (58%), Positives = 22/29 (75%)
Query: 7 RLCGFGCNGGFPGMAWRYWVKSGIVSGGA 35
+ CG GC+GGF G +W YWV GIV+GG+
Sbjct: 153 KYCGSGCDGGFLGPSWDYWVLRGIVTGGS 181
>gi|224285427|gb|ACN40436.1| unknown [Picea sitchensis]
Length = 350
Score = 138 bits (348), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 99/328 (30%), Positives = 139/328 (42%), Gaps = 105/328 (32%)
Query: 37 GSKQAEKNSLSNIPRAHLKSWMGVHPD-----YNLPANRLPELIGYSEVDEDLPANFDSR 91
G K + SN K +GV P N+P P+ I +LP FD+R
Sbjct: 51 GWKAGMNSRFSNHTVGQFKRLLGVLPTPRNFLENVPVITYPKGI-------NLPKQFDAR 103
Query: 92 TKWPNCPTIREIRDQGSCGSCWGCRPYEIAP---CEHH-VNGTRP-----SC-------- 134
WP C +++ I DQG CGSCW E C HH VN T +C
Sbjct: 104 EAWPQCTSVQTILDQGHCGSCWAFGAVEALSDRFCIHHKVNVTLSENDLVACCGFMCGDG 163
Query: 135 ----------------------------DASKGH--------TPKCVRECQENYDVPYKK 158
DA H TP+CV++C++ + +
Sbjct: 164 CDGGYPISAWQYFISTGVVTAECDPYFDDAGCQHPGCEPLYPTPQCVKQCKDE-NQKWGN 222
Query: 159 DLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIK 218
F A +Y +SS IM E+Y +GPVE +F+V++D YKSG +
Sbjct: 223 SKRFSATAYRISSKPYDIMAEVYTNGPVEVSFSVYEDFAHYKSGVY-------------- 268
Query: 219 WTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNT 278
Y G +GGHA++++GWG ++ + YWL+ANSWNT
Sbjct: 269 -----------------------KYTKGDYMGGHAVKLVGWGTEDGT--DYWLVANSWNT 303
Query: 279 DWGDNGLFKILRGKDECGIESSITAGVP 306
WG++G FKI RG +ECGIE + AG+P
Sbjct: 304 AWGEDGYFKIARGSNECGIEGDVVAGMP 331
>gi|56757646|gb|AAW26973.1| unknown [Schistosoma japonicum]
Length = 342
Score = 138 bits (348), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 74/209 (35%), Positives = 107/209 (51%), Gaps = 39/209 (18%)
Query: 99 TIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKK 158
+R I GS + GCRPY C+H V G +C TP+C + CQ+ Y+ Y++
Sbjct: 172 VLRGIVTGGSKENHTGCRPYPFPKCDHFVKGKYRACGDKLYKTPQCNQTCQKGYNTSYEQ 231
Query: 159 DLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIK 218
D ++G SY+V S E K+I HGPVE +++D + YKSG
Sbjct: 232 DKHYGGFSYNVLSGESVFQKDIMMHGPVEAYLEIYEDFLNYKSG---------------- 275
Query: 219 WTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNT 278
+ Y +GK + GHA+R++GWG + + YWL AN+WN
Sbjct: 276 ---------------------IYRYTTGKYISGHAVRLIGWGVENGT--AYWLAANTWNE 312
Query: 279 DWGDNGLFKILRGKDECGIESSITAGVPK 307
DWG+ G F+I+RG++EC IES I AG+ K
Sbjct: 313 DWGEKGYFRIVRGRNECSIESEIAAGLIK 341
Score = 55.1 bits (131), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 24/56 (42%), Positives = 35/56 (62%), Gaps = 1/56 (1%)
Query: 59 GVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG 114
G D NL R P + + +++ ++P++FDSR KWP C +I +IRDQ CGS W
Sbjct: 66 GRREDPNLREKRRP-TVDHHDLNVEIPSHFDSRKKWPRCKSISQIRDQSQCGSSWA 120
Score = 42.4 bits (98), Expect = 0.29, Method: Compositional matrix adjust.
Identities = 17/29 (58%), Positives = 22/29 (75%)
Query: 7 RLCGFGCNGGFPGMAWRYWVKSGIVSGGA 35
+ CG GC+GGF G +W YWV GIV+GG+
Sbjct: 153 KYCGSGCDGGFLGPSWDYWVLRGIVTGGS 181
>gi|56756907|gb|AAW26625.1| unknown [Schistosoma japonicum]
Length = 342
Score = 138 bits (348), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 75/209 (35%), Positives = 108/209 (51%), Gaps = 39/209 (18%)
Query: 99 TIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKK 158
+R I GS + GCRPY C+H V G +C TP+C + CQ+ Y+ Y++
Sbjct: 172 VLRGIVTGGSKENHTGCRPYPFPKCDHFVKGKYRACGDKLYKTPQCNQTCQKGYNTSYEQ 231
Query: 159 DLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIK 218
D ++G SY+V S E I K+I HGPVE +++D + YKSG
Sbjct: 232 DKHYGGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSG---------------- 275
Query: 219 WTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNT 278
+ Y +GK + GHA+R++GWG + + YWL AN+WN
Sbjct: 276 ---------------------IYRYTTGKYISGHAVRLIGWGVENGT--AYWLAANTWNE 312
Query: 279 DWGDNGLFKILRGKDECGIESSITAGVPK 307
DWG+ G F+I+RG++EC IES I AG+ K
Sbjct: 313 DWGEKGYFRIVRGRNECLIESEIAAGLIK 341
Score = 55.1 bits (131), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 24/56 (42%), Positives = 35/56 (62%), Gaps = 1/56 (1%)
Query: 59 GVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG 114
G D NL R P + + +++ ++P++FDSR KWP C +I +IRDQ CGS W
Sbjct: 66 GRREDPNLREKRRP-TVDHHDLNVEIPSHFDSRKKWPRCKSISQIRDQSQCGSSWA 120
Score = 42.4 bits (98), Expect = 0.26, Method: Compositional matrix adjust.
Identities = 17/29 (58%), Positives = 22/29 (75%)
Query: 7 RLCGFGCNGGFPGMAWRYWVKSGIVSGGA 35
+ CG GC+GGF G +W YWV GIV+GG+
Sbjct: 153 KYCGSGCDGGFLGPSWDYWVLRGIVTGGS 181
>gi|357511629|ref|XP_003626103.1| Cathepsin B [Medicago truncatula]
gi|87240982|gb|ABD32840.1| Peptidase C1A, papain; Somatotropin hormone; Peptidase C1,
propeptide [Medicago truncatula]
gi|355501118|gb|AES82321.1| Cathepsin B [Medicago truncatula]
Length = 357
Score = 138 bits (348), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 92/284 (32%), Positives = 128/284 (45%), Gaps = 108/284 (38%)
Query: 84 LPANFDSRTKWPNCPTIREIRDQGSCGSCWG-----------CRPYEI------------ 120
LP FD+RT W C TI +I DQG CGSCW C +++
Sbjct: 101 LPKEFDARTAWSQCSTIGKILDQGHCGSCWAFGAVESLQDRFCIHFDMNISLSVNDLLAC 160
Query: 121 ------APCE------------HH-------------VNGTRPSCDASKGHTPKCVRECQ 149
A C+ HH + + P C+ + TPKCVR+C
Sbjct: 161 CGFLCGAGCDGGTPIYAWRYLAHHGVVTEECDPYFDQIGCSHPGCEPAY-QTPKCVRKCV 219
Query: 150 ENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGN 209
+ + +K+ ++ K+Y V S+ + IM E+Y++GPVE A
Sbjct: 220 KGNQI-WKRSKHYSVKAYRVKSDPQDIMAEVYKNGPVEVA-------------------- 258
Query: 210 ETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSG-------KALGGHAIRILGWGED 262
FTVF+D YKSG ALGGHA++++GWG
Sbjct: 259 ------------------------FTVFEDFAHYKSGVYKHITGSALGGHAVKLIGWGTS 294
Query: 263 EKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVP 306
++ E YWL+AN WNT+WGD+G FKI RG +ECGIE +TAG+P
Sbjct: 295 DEG-EDYWLLANQWNTNWGDDGYFKIKRGTNECGIEDDVTAGLP 337
>gi|56752809|gb|AAW24616.1| unknown [Schistosoma japonicum]
Length = 342
Score = 138 bits (348), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 74/209 (35%), Positives = 108/209 (51%), Gaps = 39/209 (18%)
Query: 99 TIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKK 158
+R I GS + GCRPY C+H V G +C TP+C + CQ+ Y+ Y++
Sbjct: 172 VLRGIVTGGSKENHTGCRPYPFPKCDHFVKGKYRACGDKLYKTPQCKQTCQKGYNTSYEQ 231
Query: 159 DLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIK 218
D ++G SY+V S E I K+I HGPVE +++D + YKSG
Sbjct: 232 DKHYGGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSG---------------- 275
Query: 219 WTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNT 278
+ Y +G+ + GHA+R++GWG + + YWL AN+WN
Sbjct: 276 ---------------------IYRYTTGQFISGHAVRLIGWGVENGT--AYWLAANTWNE 312
Query: 279 DWGDNGLFKILRGKDECGIESSITAGVPK 307
DWG+ G F+I+RG++EC IES I AG+ K
Sbjct: 313 DWGEKGYFRIVRGRNECSIESEIAAGLIK 341
Score = 54.7 bits (130), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 24/56 (42%), Positives = 34/56 (60%), Gaps = 1/56 (1%)
Query: 59 GVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG 114
G D NL R P + + ++ ++P++FDSR KWP C +I +IRDQ CGS W
Sbjct: 66 GRREDPNLREKRRP-TVDHHDLKVEIPSHFDSRKKWPRCKSISQIRDQSQCGSSWA 120
Score = 42.4 bits (98), Expect = 0.27, Method: Compositional matrix adjust.
Identities = 17/29 (58%), Positives = 22/29 (75%)
Query: 7 RLCGFGCNGGFPGMAWRYWVKSGIVSGGA 35
+ CG GC+GGF G +W YWV GIV+GG+
Sbjct: 153 KYCGSGCDGGFLGPSWDYWVLRGIVTGGS 181
>gi|226473756|emb|CAX71563.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 138 bits (348), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 75/209 (35%), Positives = 108/209 (51%), Gaps = 39/209 (18%)
Query: 99 TIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKK 158
+R I GS + GCRPY C+H V G +C TP+C + CQ+ Y+ Y++
Sbjct: 172 VLRGIVTGGSKENHTGCRPYPFPKCDHFVKGKYRACGDKLYKTPQCNQTCQKGYNTSYEQ 231
Query: 159 DLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIK 218
D ++G SY+V S E I K+I HGPVE +++D + YKSG
Sbjct: 232 DKHYGGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSG---------------- 275
Query: 219 WTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNT 278
+ Y +GK + GHA+R++GWG + + YWL AN+WN
Sbjct: 276 ---------------------IYRYTTGKYISGHAVRLIGWGVENGT--AYWLAANTWNE 312
Query: 279 DWGDNGLFKILRGKDECGIESSITAGVPK 307
DWG+ G F+I+RG++EC IES I AG+ K
Sbjct: 313 DWGEKGYFRIVRGRNECLIESEIAAGLIK 341
Score = 54.3 bits (129), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 24/56 (42%), Positives = 34/56 (60%), Gaps = 1/56 (1%)
Query: 59 GVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG 114
G D NL R P + + ++ ++P++FDSR KWP C +I +IRDQ CGS W
Sbjct: 66 GRREDPNLREKRRP-TVDHHDLKVEIPSHFDSRKKWPRCKSISQIRDQSQCGSSWA 120
Score = 42.4 bits (98), Expect = 0.29, Method: Compositional matrix adjust.
Identities = 17/29 (58%), Positives = 22/29 (75%)
Query: 7 RLCGFGCNGGFPGMAWRYWVKSGIVSGGA 35
+ CG GC+GGF G +W YWV GIV+GG+
Sbjct: 153 KYCGSGCDGGFLGPSWDYWVLRGIVTGGS 181
>gi|171474007|gb|AAX31052.2| SJCHGC09761 protein [Schistosoma japonicum]
Length = 342
Score = 138 bits (348), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 75/209 (35%), Positives = 108/209 (51%), Gaps = 39/209 (18%)
Query: 99 TIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKK 158
+R I GS + GCRPY C+H V G +C TP+C + CQ+ Y+ Y++
Sbjct: 172 VLRGIVTGGSKENHTGCRPYPFPKCDHFVKGKYRACGDKLYKTPQCKQTCQKGYNTSYEQ 231
Query: 159 DLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIK 218
D ++G SY+V S E I K+I HGPVE +++D + YKSG
Sbjct: 232 DKHYGGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSG---------------- 275
Query: 219 WTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNT 278
+ Y +GK + GHA+R++GWG + + YWL AN+WN
Sbjct: 276 ---------------------IYRYTTGKYISGHAVRLIGWGVENGT--AYWLAANTWNE 312
Query: 279 DWGDNGLFKILRGKDECGIESSITAGVPK 307
DWG+ G F+I+RG++EC IES I AG+ K
Sbjct: 313 DWGEKGYFRIVRGRNECLIESEIAAGLIK 341
Score = 55.8 bits (133), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 25/56 (44%), Positives = 35/56 (62%), Gaps = 1/56 (1%)
Query: 59 GVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG 114
G D NL R P I + +++ ++P++FDSR KWP C +I +IRDQ CGS W
Sbjct: 66 GRREDPNLREKRRP-TIDHHDLNVEIPSHFDSRKKWPRCKSISQIRDQSQCGSSWA 120
Score = 42.4 bits (98), Expect = 0.27, Method: Compositional matrix adjust.
Identities = 17/29 (58%), Positives = 22/29 (75%)
Query: 7 RLCGFGCNGGFPGMAWRYWVKSGIVSGGA 35
+ CG GC+GGF G +W YWV GIV+GG+
Sbjct: 153 KYCGSGCDGGFLGPSWDYWVLRGIVTGGS 181
>gi|226474184|emb|CAX71578.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 138 bits (348), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 75/209 (35%), Positives = 108/209 (51%), Gaps = 39/209 (18%)
Query: 99 TIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKK 158
+R I GS + GCRPY C+H V G +C TP+C + CQ+ Y+ Y++
Sbjct: 172 VLRGIVTGGSKENHTGCRPYPFPKCDHFVKGKYRACGDKLYKTPQCNQTCQKGYNTSYEQ 231
Query: 159 DLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIK 218
D ++G SY+V S E I K+I HGPVE +++D + YKSG
Sbjct: 232 DKHYGGFSYNVLSVESVIQKDIMVHGPVEAYLEIYEDFLNYKSG---------------- 275
Query: 219 WTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNT 278
+ Y +GK + GHA+R++GWG + + YWL AN+WN
Sbjct: 276 ---------------------IYRYTTGKYISGHAVRLIGWGVENGT--AYWLAANTWNE 312
Query: 279 DWGDNGLFKILRGKDECGIESSITAGVPK 307
DWG+ G F+I+RG++EC IES I AG+ K
Sbjct: 313 DWGEKGYFRIVRGRNECLIESEIAAGLIK 341
Score = 55.1 bits (131), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 24/56 (42%), Positives = 35/56 (62%), Gaps = 1/56 (1%)
Query: 59 GVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG 114
G D NL R P + + +++ ++P++FDSR KWP C +I +IRDQ CGS W
Sbjct: 66 GRKEDPNLREKRRP-TVDHHDLNVEIPSHFDSRKKWPRCKSISQIRDQSQCGSSWA 120
Score = 42.4 bits (98), Expect = 0.26, Method: Compositional matrix adjust.
Identities = 17/29 (58%), Positives = 22/29 (75%)
Query: 7 RLCGFGCNGGFPGMAWRYWVKSGIVSGGA 35
+ CG GC+GGF G +W YWV GIV+GG+
Sbjct: 153 KYCGSGCDGGFLGPSWDYWVLRGIVTGGS 181
>gi|56754499|gb|AAW25437.1| unknown [Schistosoma japonicum]
Length = 342
Score = 138 bits (348), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 75/209 (35%), Positives = 108/209 (51%), Gaps = 39/209 (18%)
Query: 99 TIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKK 158
+R I GS + GCRPY C+H V G +C TP+C + CQ+ Y+ Y++
Sbjct: 172 VLRGIVTGGSKENHTGCRPYPFPKCDHFVKGKYRACGDKLYKTPQCNQTCQKGYNTSYEQ 231
Query: 159 DLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIK 218
D ++G SY+V S E I K+I HGPVE +++D + YKSG
Sbjct: 232 DKHYGGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSG---------------- 275
Query: 219 WTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNT 278
+ Y +GK + GHA+R++GWG + + YWL AN+WN
Sbjct: 276 ---------------------IYRYTTGKYISGHAVRLIGWGVENGT--AYWLAANTWNE 312
Query: 279 DWGDNGLFKILRGKDECGIESSITAGVPK 307
DWG+ G F+I+RG++EC IES I AG+ K
Sbjct: 313 DWGEKGYFRIVRGRNECLIESEIAAGLIK 341
Score = 55.1 bits (131), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 24/56 (42%), Positives = 35/56 (62%), Gaps = 1/56 (1%)
Query: 59 GVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG 114
G D NL R P + + +++ ++P++FDSR KWP C +I +IRDQ CGS W
Sbjct: 66 GRREDPNLREKRRPT-VDHHDLNVEIPSHFDSRKKWPRCKSISQIRDQSQCGSSWA 120
Score = 42.4 bits (98), Expect = 0.30, Method: Compositional matrix adjust.
Identities = 17/29 (58%), Positives = 22/29 (75%)
Query: 7 RLCGFGCNGGFPGMAWRYWVKSGIVSGGA 35
+ CG GC+GGF G +W YWV GIV+GG+
Sbjct: 153 KYCGSGCDGGFLGPSWDYWVLRGIVTGGS 181
>gi|312271213|gb|ADQ57304.1| cathepsin B-like cysteine protease 2 [Angiostrongylus cantonensis]
Length = 347
Score = 138 bits (347), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 75/195 (38%), Positives = 101/195 (51%), Gaps = 40/195 (20%)
Query: 114 GCRPYEIAPCEHHV-NGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSN 172
GC+PY PCEHH+ C T C +CQ+ Y + Y D ++GA Y+V+ +
Sbjct: 192 GCKPYPYPPCEHHIPEHHYKKCPKDIYPTNTCEYKCQDGYSISYNSDKHYGASVYAVAQD 251
Query: 173 EKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEG 232
SI KEI +GPVE AF V++D Y SG
Sbjct: 252 VASIQKEIMTNGPVEVAFDVYEDFEHYSSG------------------------------ 281
Query: 233 AFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGK 292
+ + +G LGGHA+++LGWG + + YW+ ANSWN+DWG+NG F+ILRG
Sbjct: 282 -------IYKHTTGDYLGGHAVKMLGWGTENGTD--YWICANSWNSDWGENGFFRILRGV 332
Query: 293 DECGIESSITAGVPK 307
DEC IESS+ AG PK
Sbjct: 333 DECQIESSVVAGEPK 347
Score = 47.8 bits (112), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 25/52 (48%), Positives = 29/52 (55%), Gaps = 5/52 (9%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQAEK-----NSLSNIPRAHLK 55
CGFGC+GG P AW YWV +GIV+G Y SK K +IP H K
Sbjct: 160 CGFGCDGGDPYAAWSYWVSNGIVTGSNYTSKSGCKPYPYPPCEHHIPEHHYK 211
>gi|226473762|emb|CAX71566.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
gi|226474170|emb|CAX71571.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 138 bits (347), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 75/209 (35%), Positives = 108/209 (51%), Gaps = 39/209 (18%)
Query: 99 TIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKK 158
+R I GS + GCRPY C+H V G +C TP+C + CQ+ Y+ Y++
Sbjct: 172 VLRGIVTGGSKENHTGCRPYPFPKCDHFVKGKYRACGDKLYKTPQCNQTCQKGYNTSYEQ 231
Query: 159 DLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIK 218
D ++G SY+V S E I K+I HGPVE +++D + YKSG
Sbjct: 232 DKHYGGFSYNVLSVESVIQKDIMMHGPVEAYIEIYEDFLNYKSG---------------- 275
Query: 219 WTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNT 278
+ Y +GK + GHA+R++GWG + + YWL AN+WN
Sbjct: 276 ---------------------IYRYTTGKYISGHAVRLIGWGVENGT--AYWLAANTWNE 312
Query: 279 DWGDNGLFKILRGKDECGIESSITAGVPK 307
DWG+ G F+I+RG++EC IES I AG+ K
Sbjct: 313 DWGEKGYFRIVRGRNECLIESEIAAGLIK 341
Score = 55.1 bits (131), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 24/56 (42%), Positives = 35/56 (62%), Gaps = 1/56 (1%)
Query: 59 GVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG 114
G D NL R P + + +++ ++P++FDSR KWP C +I +IRDQ CGS W
Sbjct: 66 GRREDPNLREKRRP-TVDHHDLNVEIPSHFDSRKKWPRCKSISQIRDQSQCGSSWA 120
Score = 42.4 bits (98), Expect = 0.27, Method: Compositional matrix adjust.
Identities = 17/29 (58%), Positives = 22/29 (75%)
Query: 7 RLCGFGCNGGFPGMAWRYWVKSGIVSGGA 35
+ CG GC+GGF G +W YWV GIV+GG+
Sbjct: 153 KYCGSGCDGGFLGPSWDYWVLRGIVTGGS 181
>gi|226474176|emb|CAX71574.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 138 bits (347), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 74/209 (35%), Positives = 108/209 (51%), Gaps = 39/209 (18%)
Query: 99 TIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKK 158
+R I GS + GCRPY C+H V G +C TP+C + CQ+ Y+ Y++
Sbjct: 172 VLRGIVTGGSKENHTGCRPYPFPKCDHFVKGKYRACGDKLYKTPQCNQTCQKGYNTSYEQ 231
Query: 159 DLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIK 218
D ++G SY+V S E I K+I HGPVE +++D + YKSG
Sbjct: 232 DKHYGGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSG---------------- 275
Query: 219 WTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNT 278
+ Y +GK + GHA+R++GWG + + YWL AN+WN
Sbjct: 276 ---------------------IYRYTTGKYISGHAVRLIGWGVENGT--AYWLAANTWNE 312
Query: 279 DWGDNGLFKILRGKDECGIESSITAGVPK 307
DWG+ G F+I+RG++EC I+S I AG+ K
Sbjct: 313 DWGEKGYFRIVRGRNECSIDSEIAAGLIK 341
Score = 55.1 bits (131), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 24/56 (42%), Positives = 35/56 (62%), Gaps = 1/56 (1%)
Query: 59 GVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG 114
G D NL R P + + +++ ++P++FDSR KWP C +I +IRDQ CGS W
Sbjct: 66 GRKEDPNLREKRRP-TVDHHDLNVEIPSHFDSRKKWPRCKSISQIRDQSQCGSSWA 120
Score = 42.4 bits (98), Expect = 0.26, Method: Compositional matrix adjust.
Identities = 17/29 (58%), Positives = 22/29 (75%)
Query: 7 RLCGFGCNGGFPGMAWRYWVKSGIVSGGA 35
+ CG GC+GGF G +W YWV GIV+GG+
Sbjct: 153 KYCGSGCDGGFLGPSWDYWVLRGIVTGGS 181
>gi|226469952|emb|CAX70257.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 138 bits (347), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 71/191 (37%), Positives = 99/191 (51%), Gaps = 39/191 (20%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GC+PY CEHH G PSC TP+C R+CQ+ Y PY+ D ++G + +V NE
Sbjct: 187 GCQPYPFPKCEHHSIGKYPSCGDKMYKTPQCKRKCQKGYTTPYEHDKHYGGIAINVIKNE 246
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
+I KEI +GPVE +F+D + YKSG
Sbjct: 247 LAIQKEIMMYGPVEAYLLIFEDFLNYKSG------------------------------- 275
Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
+ Y +G +G H +RI+GWG + + YWL AN+WN DWG+ G F+I+RG++
Sbjct: 276 ------IYKYTTGSFVGEHYVRIIGWGIENGT--AYWLAANTWNEDWGEKGYFRIVRGRN 327
Query: 294 ECGIESSITAG 304
EC IES + AG
Sbjct: 328 ECSIESVVVAG 338
Score = 53.9 bits (128), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 24/56 (42%), Positives = 34/56 (60%), Gaps = 1/56 (1%)
Query: 59 GVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG 114
G D NL R P + + ++ ++P++FDSR KWP C +I +IRDQ CGS W
Sbjct: 66 GRREDPNLREKRRP-TVDHHDLKVEIPSHFDSRKKWPRCKSISQIRDQSRCGSSWA 120
Score = 47.4 bits (111), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 19/27 (70%), Positives = 22/27 (81%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGA 35
CG GC+GGFPG AW YWV GIV+GG+
Sbjct: 155 CGSGCDGGFPGPAWDYWVSHGIVTGGS 181
>gi|312091331|ref|XP_003146940.1| cathepsin B [Loa loa]
Length = 249
Score = 138 bits (347), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 81/202 (40%), Positives = 100/202 (49%), Gaps = 54/202 (26%)
Query: 114 GCRPYEIAPCEHHVNGTR-PSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSN 172
GCRPY PCEHH N T C TPKCV++C +NY YK D +G Y+V SN
Sbjct: 86 GCRPYPFPPCEHHNNKTHYEPCKHDLYPTPKCVKKCDKNYGKSYKADKYYGQSVYNVESN 145
Query: 173 EKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEG 232
+SI KEI GPVE +
Sbjct: 146 VESIQKEIMTLGPVEAS------------------------------------------- 162
Query: 233 AFTVFDDLILYK-------SGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGL 285
F V+ D + Y +G GGHA+++LGWG D+ YWL ANSWNTDWG++G
Sbjct: 163 -FEVYTDFLYYTGGIYKHVAGSMGGGHAVKVLGWGIDQGVP--YWLAANSWNTDWGEDGY 219
Query: 286 FKILRGKDECGIESSITAGVPK 307
F+ILRG +ECGIES I AG+PK
Sbjct: 220 FRILRGVNECGIESGIIAGIPK 241
Score = 43.5 bits (101), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 18/30 (60%), Positives = 21/30 (70%)
Query: 7 RLCGFGCNGGFPGMAWRYWVKSGIVSGGAY 36
+ CGFGC GG P AW+YWV GIV+G Y
Sbjct: 52 KTCGFGCFGGEPMAAWKYWVLRGIVTGSEY 81
>gi|56756114|gb|AAW26235.1| unknown [Schistosoma japonicum]
Length = 342
Score = 138 bits (347), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 71/194 (36%), Positives = 102/194 (52%), Gaps = 39/194 (20%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GCRPY C+H V G +C TP+C + CQ+ Y+ Y++D ++G SY+V S E
Sbjct: 187 GCRPYPFPKCDHFVKGKYRACGDKLYETPQCKQTCQKGYNTSYEQDKHYGGFSYNVLSVE 246
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
I K+I HGPVE +++D + YKSG
Sbjct: 247 SVIQKDIMMHGPVEAYLEIYEDFLNYKSG------------------------------- 275
Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
+ Y +GK + GHA+R++GWG + + YWL AN+WN DWG+ G F+I+RG++
Sbjct: 276 ------IYRYTTGKYISGHAVRLIGWGVENGT--AYWLAANTWNEDWGEKGYFRIVRGRN 327
Query: 294 ECGIESSITAGVPK 307
EC IES I AG+ K
Sbjct: 328 ECLIESEIAAGLIK 341
Score = 37.7 bits (86), Expect = 6.1, Method: Compositional matrix adjust.
Identities = 15/27 (55%), Positives = 20/27 (74%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGA 35
CG GC+GG+ +W YWV GIV+GG+
Sbjct: 155 CGSGCDGGYFLPSWDYWVSHGIVTGGS 181
>gi|226474164|emb|CAX71568.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
gi|226474166|emb|CAX71569.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 137 bits (346), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 74/209 (35%), Positives = 108/209 (51%), Gaps = 39/209 (18%)
Query: 99 TIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKK 158
+R I GS + GCRPY C+H V G +C TP+C + CQ+ Y+ Y++
Sbjct: 172 VLRGIVTGGSKENHTGCRPYPFPKCDHFVKGKYRACGDKLYKTPQCKQICQKGYNTSYEQ 231
Query: 159 DLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIK 218
D ++G SY+V S E I K+I HGPVE +++D + YKSG
Sbjct: 232 DKHYGGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSG---------------- 275
Query: 219 WTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNT 278
+ Y +G+ + GHA+R++GWG + + YWL AN+WN
Sbjct: 276 ---------------------IYRYTTGQFISGHAVRLIGWGVENGT--AYWLAANTWNE 312
Query: 279 DWGDNGLFKILRGKDECGIESSITAGVPK 307
DWG+ G F+I+RG++EC IES I AG+ K
Sbjct: 313 DWGEKGYFRIVRGRNECSIESEIAAGLIK 341
Score = 55.5 bits (132), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 24/56 (42%), Positives = 35/56 (62%), Gaps = 1/56 (1%)
Query: 59 GVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG 114
G D NL R P + + +++ ++P++FDSR KWP C +I +IRDQ CGS W
Sbjct: 66 GRREDPNLREKRRP-TVDHHDLNVEIPSHFDSRKKWPRCKSISQIRDQSQCGSSWA 120
Score = 42.4 bits (98), Expect = 0.27, Method: Compositional matrix adjust.
Identities = 17/29 (58%), Positives = 22/29 (75%)
Query: 7 RLCGFGCNGGFPGMAWRYWVKSGIVSGGA 35
+ CG GC+GGF G +W YWV GIV+GG+
Sbjct: 153 KYCGSGCDGGFLGPSWDYWVLRGIVTGGS 181
>gi|157167281|ref|XP_001658485.1| cathepsin b [Aedes aegypti]
gi|108876476|gb|EAT40701.1| AAEL007585-PA [Aedes aegypti]
Length = 386
Score = 137 bits (346), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 95/304 (31%), Positives = 137/304 (45%), Gaps = 100/304 (32%)
Query: 65 NLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCW----------- 113
+L +LP I D DLP FD+R KWP CP++REIRDQG CGSCW
Sbjct: 106 DLERTKLPLGIMADVEDLDLPDTFDAREKWPECPSLREIRDQGCCGSCWAVSAASAMTDR 165
Query: 114 ---------------------------GCRPYEIAPC-----EHHVNGTRPSCDASKGHT 141
GCR + P E ++ P ++ +G
Sbjct: 166 WCVRSKGKEQFIFGSLDLLSCCHSCGQGCRGGTLGPAWQFWVEKGLSSGGP-LNSRQGCH 224
Query: 142 PKCVRECQ---ENYDVP--------------YKKDLNFGAKSYSVSSNEKSIMKEIYEHG 184
P + EC+ E+ D P +D ++G +YS+ ++E+ IM+EI+ +G
Sbjct: 225 PYPIGECRIPGEDEDTPKCSNKCRSGYNVTDVWQDRHYGRVAYSLPNDERKIMEEIFING 284
Query: 185 PVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYK 244
PV+ AF + DL YKSG + +
Sbjct: 285 PVQAAFHTYLDLHAYKSG-------------------------------------IYRHV 307
Query: 245 SGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAG 304
G GGHA+++LGWG + + KYWL+ANSW +WG+NG FKI+RG++ CGIE +I AG
Sbjct: 308 WGPLSGGHAVKLLGWGVE--NGVKYWLVANSWGREWGENGFFKIVRGENHCGIEENIHAG 365
Query: 305 VPKL 308
+P
Sbjct: 366 LPNF 369
Score = 39.7 bits (91), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 17/32 (53%), Positives = 22/32 (68%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQ 40
CG GC GG G AW++WV+ G+ SGG S+Q
Sbjct: 190 CGQGCRGGTLGPAWQFWVEKGLSSGGPLNSRQ 221
>gi|226474160|emb|CAX71567.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 137 bits (346), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 74/209 (35%), Positives = 108/209 (51%), Gaps = 39/209 (18%)
Query: 99 TIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKK 158
+R I GS + GCRPY C+H V G +C TP+C + CQ+ Y+ Y++
Sbjct: 172 VLRGIVTGGSKENHTGCRPYPFPKCDHFVKGKYRACGDKLYKTPQCKQICQKGYNTSYEQ 231
Query: 159 DLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIK 218
D ++G SY+V S E I K+I HGPVE +++D + YKSG
Sbjct: 232 DKHYGGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSG---------------- 275
Query: 219 WTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNT 278
+ Y +G+ + GHA+R++GWG + + YWL AN+WN
Sbjct: 276 ---------------------IYRYTTGQFISGHAVRLIGWGVENGT--AYWLAANTWNE 312
Query: 279 DWGDNGLFKILRGKDECGIESSITAGVPK 307
DWG+ G F+I+RG++EC IES I AG+ K
Sbjct: 313 DWGEKGYFRIVRGRNECSIESEIAAGLIK 341
Score = 54.7 bits (130), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 24/56 (42%), Positives = 34/56 (60%), Gaps = 1/56 (1%)
Query: 59 GVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG 114
G D NL R P + + ++ ++P++FDSR KWP C +I +IRDQ CGS W
Sbjct: 66 GRREDPNLREKRRP-TVDHHDLKVEIPSHFDSRKKWPRCKSISQIRDQSQCGSSWA 120
Score = 42.4 bits (98), Expect = 0.27, Method: Compositional matrix adjust.
Identities = 17/29 (58%), Positives = 22/29 (75%)
Query: 7 RLCGFGCNGGFPGMAWRYWVKSGIVSGGA 35
+ CG GC+GGF G +W YWV GIV+GG+
Sbjct: 153 KYCGSGCDGGFLGPSWDYWVLRGIVTGGS 181
>gi|116784401|gb|ABK23329.1| unknown [Picea sitchensis]
Length = 350
Score = 137 bits (346), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 100/325 (30%), Positives = 140/325 (43%), Gaps = 99/325 (30%)
Query: 37 GSKQAEKNSLSNIPRAHLKSWMGVHPDYNLPANRLPEL--IGYSEVDEDLPANFDSRTKW 94
G K + SN K +GV P P N L + I Y + +LP FD+R W
Sbjct: 51 GWKAGMNSRFSNHTVGQFKRLLGVLP---TPRNFLENVPVITYPK-GMNLPKQFDAREAW 106
Query: 95 PNCPTIREIRDQGSCGSCWGCRPYEIAP---CEHH-VNGTRP-----SC----------- 134
P C +++ I DQG CGSCW E C HH VN T +C
Sbjct: 107 PQCTSVQTILDQGHCGSCWAFGAVEALSDRFCIHHKVNVTLSENDLVACCGFMCGDGCDG 166
Query: 135 -------------------------DASKGH--------TPKCVRECQENYDVPYKKDLN 161
DA H TP+CV++C++ + +
Sbjct: 167 GYPISAWQYFISTGVVTAECDPYFDDAGCQHPGCEPLYPTPQCVKQCKDE-NQKWGNSKR 225
Query: 162 FGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTI 221
F A +Y +SS IM E+Y +GPVE +F+V++D YKSG +
Sbjct: 226 FSATAYRISSKPYDIMAEVYTNGPVEVSFSVYEDFAHYKSGVY----------------- 268
Query: 222 RDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWG 281
Y G +GGHA++++GWG ++ + YWL+ANSWNT WG
Sbjct: 269 --------------------KYTKGDYMGGHAVKLVGWGTEDGT--DYWLVANSWNTAWG 306
Query: 282 DNGLFKILRGKDECGIESSITAGVP 306
++G FKI RG +ECGIE + AG+P
Sbjct: 307 EDGYFKIARGSNECGIEGDVVAGMP 331
>gi|300176938|emb|CBK25507.2| unnamed protein product [Blastocystis hominis]
Length = 320
Score = 137 bits (346), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 73/194 (37%), Positives = 103/194 (53%), Gaps = 40/194 (20%)
Query: 115 CRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEK 174
C Y CEHH G P C S+ TP+CV++CQE Y V Y+KD +F ++Y V
Sbjct: 167 CNAYSFPKCEHHAEGKYPPCGESQ-ETPECVKQCQEGYPVEYEKDKHFFGEAYYVQGGID 225
Query: 175 SIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAF 234
+I E+ +GP+E +F V++D + YKSG
Sbjct: 226 AIKTELMTNGPLEVSFFVYEDFLTYKSG-------------------------------- 253
Query: 235 TVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDE 294
+ + +GK LGGHA++++GWG ++ +YW IANSWN DWG+NG F+I+ GK E
Sbjct: 254 -----IYQHVAGKYLGGHAVKLVGWGVEDGI--EYWKIANSWNEDWGENGYFRIVAGKGE 306
Query: 295 CGIESSITAGVPKL 308
CGIE G+PKL
Sbjct: 307 CGIEVGPIGGIPKL 320
Score = 55.8 bits (133), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 35/92 (38%), Positives = 43/92 (46%), Gaps = 16/92 (17%)
Query: 49 IPRAHLKSWMGV-HPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQG 107
IP ++GV D LP+ + DLP +FD KWP CP+++EIRDQ
Sbjct: 40 IPTRDYTQYLGVLFGDRQLPSKTIV-------ARGDLPESFDPVEKWPECPSLKEIRDQS 92
Query: 108 SCGSCWGCRPYEIAPCEHHVNGTRPSCDASKG 139
CGSCW E A T C ASKG
Sbjct: 93 VCGSCWAFGAAEAA--------TDRLCIASKG 116
Score = 45.1 bits (105), Expect = 0.038, Method: Compositional matrix adjust.
Identities = 18/31 (58%), Positives = 25/31 (80%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSK 39
CGFGC+GG+ MAWR++ +G+ +GG YGSK
Sbjct: 134 CGFGCDGGWLDMAWRWFQSTGVTTGGEYGSK 164
>gi|226469948|emb|CAX70255.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 137 bits (346), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 71/191 (37%), Positives = 98/191 (51%), Gaps = 39/191 (20%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GC+PY CEHH G PSC TP+C R+CQ+ Y PY+ D ++G S +V NE
Sbjct: 187 GCQPYPFPKCEHHSIGKYPSCGDKIYKTPQCKRKCQKGYTTPYEHDKHYGGISINVIKNE 246
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
+I EI +GPVE +F+D + YKSG
Sbjct: 247 SAIQNEIMMYGPVEAYLLIFEDFLNYKSG------------------------------- 275
Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
+ Y +G +G H +RI+GWG + + YWL AN+WN DWG+ G F+I+RG++
Sbjct: 276 ------IYRYTTGSFVGEHYVRIIGWGIENGT--AYWLAANTWNEDWGEKGYFRIVRGRN 327
Query: 294 ECGIESSITAG 304
EC IES + AG
Sbjct: 328 ECSIESVVVAG 338
Score = 47.4 bits (111), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 19/27 (70%), Positives = 22/27 (81%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGA 35
CG GC+GGFPG AW YWV GIV+GG+
Sbjct: 155 CGSGCDGGFPGPAWDYWVSHGIVTGGS 181
>gi|56755451|gb|AAW25905.1| unknown [Schistosoma japonicum]
Length = 342
Score = 137 bits (346), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 74/209 (35%), Positives = 108/209 (51%), Gaps = 39/209 (18%)
Query: 99 TIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKK 158
+R I GS + GCRPY C+H V G +C TP+C + CQ+ Y+ Y++
Sbjct: 172 VLRGIVTGGSKENHTGCRPYPFPKCDHFVKGKYRACGDKLYKTPQCKQICQKGYNTSYEQ 231
Query: 159 DLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIK 218
D ++G SY+V S E I K+I HGPVE +++D + YKSG
Sbjct: 232 DKHYGGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSG---------------- 275
Query: 219 WTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNT 278
+ Y +G+ + GHA+R++GWG + + YWL AN+WN
Sbjct: 276 ---------------------IYRYTTGQFISGHAVRLIGWGVENGT--AYWLAANTWNE 312
Query: 279 DWGDNGLFKILRGKDECGIESSITAGVPK 307
DWG+ G F+I+RG++EC IES I AG+ K
Sbjct: 313 DWGEKGYFRIVRGRNECSIESEIAAGLIK 341
Score = 54.3 bits (129), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 24/56 (42%), Positives = 34/56 (60%), Gaps = 1/56 (1%)
Query: 59 GVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG 114
G D NL R P + + ++ ++P++FDSR KWP C +I +IRDQ CGS W
Sbjct: 66 GRKEDPNLRQKRRPT-VDHHDLKVEIPSHFDSRKKWPRCKSISQIRDQSQCGSSWA 120
Score = 42.4 bits (98), Expect = 0.29, Method: Compositional matrix adjust.
Identities = 17/29 (58%), Positives = 22/29 (75%)
Query: 7 RLCGFGCNGGFPGMAWRYWVKSGIVSGGA 35
+ CG GC+GGF G +W YWV GIV+GG+
Sbjct: 153 KYCGSGCDGGFLGPSWDYWVLRGIVTGGS 181
>gi|217073630|gb|ACJ85175.1| unknown [Medicago truncatula]
Length = 359
Score = 137 bits (346), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 91/284 (32%), Positives = 127/284 (44%), Gaps = 108/284 (38%)
Query: 84 LPANFDSRTKWPNCPTIREIRDQGSCGSCWG-----------CRPYEI------------ 120
LP FD+R W C TI +I DQG CGSCW C +++
Sbjct: 103 LPKEFDARAAWSQCSTIGKILDQGHCGSCWAFGAVESLQDRFCSHFDMNISLSVNDLLAC 162
Query: 121 ------APCE------------HH-------------VNGTRPSCDASKGHTPKCVRECQ 149
A C+ HH + + P C+ + TPKCVR+C
Sbjct: 163 CGFLCGAGCDGGTPIYAWRYLAHHGVVTEECDPYFDQIGCSHPGCEPAY-QTPKCVRKCV 221
Query: 150 ENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGN 209
+ + +K+ ++ K+Y V S+ + IM E+Y++GPVE A
Sbjct: 222 KGNQI-WKRSKHYSVKAYRVKSDPQDIMTEVYKNGPVEVA-------------------- 260
Query: 210 ETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSG-------KALGGHAIRILGWGED 262
FTVF+D YKSG ALGGHA++++GWG
Sbjct: 261 ------------------------FTVFEDFAHYKSGVYKHITGSALGGHAVKLIGWGTS 296
Query: 263 EKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVP 306
++ E YWL+AN WNT+WGD+G FKI RG +ECGIE +TAG+P
Sbjct: 297 DEG-EDYWLLANQWNTNWGDDGYFKIKRGTNECGIEDDVTAGLP 339
>gi|56752925|gb|AAW24674.1| unknown [Schistosoma japonicum]
Length = 342
Score = 137 bits (345), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 72/194 (37%), Positives = 101/194 (52%), Gaps = 39/194 (20%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GCRPY C+H V G +C TP+C + CQ+ Y+ Y++D ++G SYSV E
Sbjct: 187 GCRPYPFPKCDHFVKGKYRACGDKLYKTPQCKQTCQKGYNTSYEQDKHYGGFSYSVIGVE 246
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
+I KEI +GPVE +++D + YKSG
Sbjct: 247 SAIQKEIMMYGPVEAYLQIYEDFLNYKSG------------------------------- 275
Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
+ Y +GK + GHA+R++GWG + + YWL AN+WN DWG+ G F+I+RG+D
Sbjct: 276 ------IYRYTTGKYISGHAVRLIGWGVENGT--SYWLAANTWNEDWGEKGYFRIVRGRD 327
Query: 294 ECGIESSITAGVPK 307
EC IES I AG K
Sbjct: 328 ECLIESFIVAGQIK 341
Score = 43.1 bits (100), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 17/27 (62%), Positives = 21/27 (77%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGA 35
CG GC+GG G +W YWVK GIV+GG+
Sbjct: 155 CGSGCDGGVTGYSWDYWVKHGIVTGGS 181
>gi|226474174|emb|CAX71573.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 137 bits (345), Expect = 6e-30, Method: Compositional matrix adjust.
Identities = 74/209 (35%), Positives = 108/209 (51%), Gaps = 39/209 (18%)
Query: 99 TIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKK 158
+R I GS + GCRPY C+H V G +C TP+C + CQ+ Y+ Y++
Sbjct: 172 VLRGIVTGGSKENHTGCRPYPFPKCDHFVKGKYRACGDKLYKTPQCNQTCQKGYNTSYEQ 231
Query: 159 DLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIK 218
D ++G SY+V S E I K+I HGPVE +++D + YKSG
Sbjct: 232 DKHYGGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSG---------------- 275
Query: 219 WTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNT 278
+ Y +G+ + GHA+R++GWG + + YWL AN+WN
Sbjct: 276 ---------------------IYRYTTGQFISGHAVRLIGWGVENGT--AYWLAANTWNE 312
Query: 279 DWGDNGLFKILRGKDECGIESSITAGVPK 307
DWG+ G F+I+RG++EC IES I AG+ K
Sbjct: 313 DWGEKGYFRIVRGRNECLIESEIAAGLIK 341
Score = 55.1 bits (131), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 24/56 (42%), Positives = 35/56 (62%), Gaps = 1/56 (1%)
Query: 59 GVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG 114
G D NL R P + + +++ ++P++FDSR KWP C +I +IRDQ CGS W
Sbjct: 66 GRKEDPNLREKRRP-TVDHHDLNVEIPSHFDSRKKWPRCKSISQIRDQSQCGSSWA 120
Score = 42.4 bits (98), Expect = 0.27, Method: Compositional matrix adjust.
Identities = 17/29 (58%), Positives = 22/29 (75%)
Query: 7 RLCGFGCNGGFPGMAWRYWVKSGIVSGGA 35
+ CG GC+GGF G +W YWV GIV+GG+
Sbjct: 153 KYCGSGCDGGFLGPSWDYWVLRGIVTGGS 181
>gi|226473758|emb|CAX71564.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 137 bits (345), Expect = 6e-30, Method: Compositional matrix adjust.
Identities = 74/209 (35%), Positives = 108/209 (51%), Gaps = 39/209 (18%)
Query: 99 TIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKK 158
+R I GS + GCRPY C+H V G +C TP+C + CQ+ Y+ Y++
Sbjct: 172 VLRGIVTGGSKENHTGCRPYPFPKCDHFVKGKYRACGDKLYKTPQCNQTCQKGYNTSYEQ 231
Query: 159 DLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIK 218
D ++G SY+V S E I K+I HGPVE +++D + YKSG
Sbjct: 232 DKHYGGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSG---------------- 275
Query: 219 WTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNT 278
+ Y +G+ + GHA+R++GWG + + YWL AN+WN
Sbjct: 276 ---------------------IYRYTTGQFISGHAVRLIGWGVENGT--AYWLAANTWNE 312
Query: 279 DWGDNGLFKILRGKDECGIESSITAGVPK 307
DWG+ G F+I+RG++EC IES I AG+ K
Sbjct: 313 DWGEKGYFRIVRGRNECLIESEIAAGLIK 341
Score = 55.1 bits (131), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 24/56 (42%), Positives = 35/56 (62%), Gaps = 1/56 (1%)
Query: 59 GVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG 114
G D NL R P + + +++ ++P++FDSR KWP C +I +IRDQ CGS W
Sbjct: 66 GRREDPNLRQKRRPT-VDHHDLNVEIPSHFDSRKKWPRCKSISQIRDQSQCGSSWA 120
Score = 42.4 bits (98), Expect = 0.29, Method: Compositional matrix adjust.
Identities = 17/29 (58%), Positives = 22/29 (75%)
Query: 7 RLCGFGCNGGFPGMAWRYWVKSGIVSGGA 35
+ CG GC+GGF G +W YWV GIV+GG+
Sbjct: 153 KYCGSGCDGGFLGPSWDYWVLRGIVTGGS 181
>gi|226474172|emb|CAX71572.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 137 bits (345), Expect = 6e-30, Method: Compositional matrix adjust.
Identities = 74/209 (35%), Positives = 108/209 (51%), Gaps = 39/209 (18%)
Query: 99 TIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKK 158
+R I GS + GCRPY C+H V G +C TP+C + CQ+ Y+ Y++
Sbjct: 172 VLRGIVTGGSKENHTGCRPYPFPKCDHFVKGKYRACGDKLYKTPQCNQTCQKGYNTSYEQ 231
Query: 159 DLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIK 218
D ++G SY+V S E I K+I HGPVE +++D + YKSG
Sbjct: 232 DKHYGGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSG---------------- 275
Query: 219 WTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNT 278
+ Y +G+ + GHA+R++GWG + + YWL AN+WN
Sbjct: 276 ---------------------IYRYTTGQFISGHAVRLIGWGVENGT--AYWLAANTWNE 312
Query: 279 DWGDNGLFKILRGKDECGIESSITAGVPK 307
DWG+ G F+I+RG++EC IES I AG+ K
Sbjct: 313 DWGEKGYFRIVRGRNECLIESEIAAGLIK 341
Score = 55.1 bits (131), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 24/56 (42%), Positives = 35/56 (62%), Gaps = 1/56 (1%)
Query: 59 GVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG 114
G D NL R P + + +++ ++P++FDSR KWP C +I +IRDQ CGS W
Sbjct: 66 GRKEDPNLREKRRP-TVDHHDLNVEIPSHFDSRKKWPRCKSISQIRDQSQCGSSWA 120
Score = 42.4 bits (98), Expect = 0.27, Method: Compositional matrix adjust.
Identities = 17/29 (58%), Positives = 22/29 (75%)
Query: 7 RLCGFGCNGGFPGMAWRYWVKSGIVSGGA 35
+ CG GC+GGF G +W YWV GIV+GG+
Sbjct: 153 KYCGSGCDGGFLGPSWDYWVLRGIVTGGS 181
>gi|56758716|gb|AAW27498.1| unknown [Schistosoma japonicum]
Length = 342
Score = 137 bits (344), Expect = 7e-30, Method: Compositional matrix adjust.
Identities = 72/194 (37%), Positives = 101/194 (52%), Gaps = 39/194 (20%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GCRPY C+H V G +C TP+C + CQ+ Y+ Y++D ++G SYSV E
Sbjct: 187 GCRPYPFPKCDHFVKGKYRACGDKLYKTPQCKQTCQKGYNTSYEQDKHYGGFSYSVIGVE 246
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
+I KEI +GPVE +++D + YKSG
Sbjct: 247 SAIQKEIMMYGPVEAYLEIYEDFLNYKSG------------------------------- 275
Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
+ Y +GK + GHA+R++GWG + + YWL AN+WN DWG+ G F+I+RG+D
Sbjct: 276 ------IYRYTTGKYISGHAVRLIGWGVENGT--AYWLAANTWNEDWGEKGYFRIVRGRD 327
Query: 294 ECGIESSITAGVPK 307
EC IES I AG K
Sbjct: 328 ECLIESFIVAGQIK 341
Score = 43.1 bits (100), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 17/27 (62%), Positives = 21/27 (77%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGA 35
CG GC+GG G +W YWVK GIV+GG+
Sbjct: 155 CGSGCDGGVTGYSWDYWVKHGIVTGGS 181
>gi|56756380|gb|AAW26363.1| unknown [Schistosoma japonicum]
Length = 342
Score = 137 bits (344), Expect = 7e-30, Method: Compositional matrix adjust.
Identities = 74/209 (35%), Positives = 108/209 (51%), Gaps = 39/209 (18%)
Query: 99 TIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKK 158
+R I GS + GCRPY C+H V G +C TP+C + CQ+ Y+ Y++
Sbjct: 172 VLRGIVTGGSKENHTGCRPYPFPKCDHFVKGKYRACGDKLYKTPQCNQTCQKGYNTSYEQ 231
Query: 159 DLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIK 218
D ++G SY+V S E I K+I HGPVE +++D + YKSG
Sbjct: 232 DKHYGGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSG---------------- 275
Query: 219 WTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNT 278
+ Y +G+ + GHA+R++GWG + + YWL AN+WN
Sbjct: 276 ---------------------IYRYTTGQFISGHAVRLIGWGVENGT--AYWLAANTWNE 312
Query: 279 DWGDNGLFKILRGKDECGIESSITAGVPK 307
DWG+ G F+I+RG++EC IES I AG+ K
Sbjct: 313 DWGEKGYFRIVRGRNECLIESEIAAGLIK 341
Score = 54.7 bits (130), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 24/56 (42%), Positives = 35/56 (62%), Gaps = 1/56 (1%)
Query: 59 GVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG 114
G D NL R P + + +++ ++P++FDSR KWP C +I +IRDQ CGS W
Sbjct: 66 GRKEDPNLRQRRRP-TVDHHDLNVEIPSHFDSRKKWPRCKSISQIRDQSQCGSSWA 120
Score = 42.0 bits (97), Expect = 0.38, Method: Compositional matrix adjust.
Identities = 17/27 (62%), Positives = 21/27 (77%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGA 35
CG GC+GGF G +W YWV GIV+GG+
Sbjct: 155 CGSGCDGGFLGPSWDYWVLRGIVTGGS 181
>gi|326492684|dbj|BAJ90198.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 355
Score = 137 bits (344), Expect = 8e-30, Method: Compositional matrix adjust.
Identities = 98/315 (31%), Positives = 132/315 (41%), Gaps = 96/315 (30%)
Query: 46 LSNIPRAHLKSWMGVHPDYNLPANRLPEL-IGYSEVDEDLPANFDSRTKWPNCPTIREIR 104
+N K +GV P P L + I DLP FD+RT+W +C TI I
Sbjct: 65 FANYTIEQFKHILGVKP---TPPGLLAGVPIKTHPKSADLPKEFDARTQWSSCSTIGNIL 121
Query: 105 DQGSCGSCWGCRPYEIAP-------------------------CEHHVNGTRP------- 132
DQG CG+CW E C NG P
Sbjct: 122 DQGHCGACWAFAAVESLQDRFCIHLNMSVSLSVNDLLACCGFLCGSGCNGGYPISAWRYF 181
Query: 133 --------SCDASKGHT-------------PKCVRECQENYDVPYKKDLNFGAKSYSVSS 171
CD T PKC R+C+ V +KK+ + +Y V S
Sbjct: 182 RRSGVVTEECDPYFDQTGCQHPGCEPAYPTPKCHRKCKVENQV-WKKNKHSSVNAYRVHS 240
Query: 172 NEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAE 231
N IM E+Y++GPVE AFTV++D YKSG +
Sbjct: 241 NPHDIMAEVYKNGPVEVAFTVYEDFAHYKSGVY--------------------------- 273
Query: 232 GAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRG 291
+ +G +GGHA++++GWG + + E YWL+AN WN WG +G FKI+RG
Sbjct: 274 ----------KHITGGVMGGHAVKLIGWGTSD-AGEDYWLLANQWNRGWGGDGYFKIIRG 322
Query: 292 KDECGIESSITAGVP 306
K+ECGIE +TAG+P
Sbjct: 323 KNECGIEEDVTAGMP 337
Score = 40.8 bits (94), Expect = 0.81, Method: Compositional matrix adjust.
Identities = 16/25 (64%), Positives = 21/25 (84%)
Query: 8 LCGFGCNGGFPGMAWRYWVKSGIVS 32
LCG GCNGG+P AWRY+ +SG+V+
Sbjct: 164 LCGSGCNGGYPISAWRYFRRSGVVT 188
>gi|157111449|ref|XP_001651570.1| cathepsin b [Aedes aegypti]
gi|108868331|gb|EAT32556.1| AAEL015312-PA [Aedes aegypti]
Length = 386
Score = 137 bits (344), Expect = 8e-30, Method: Compositional matrix adjust.
Identities = 94/304 (30%), Positives = 137/304 (45%), Gaps = 100/304 (32%)
Query: 65 NLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCW----------- 113
+L +LP I D DLP FD+R KWP CP++REIRDQG CGSCW
Sbjct: 106 DLERTKLPLGIMADVEDLDLPDTFDAREKWPECPSLREIRDQGCCGSCWAVSAASAMTDR 165
Query: 114 ---------------------------GCRPYEIAPC-----EHHVNGTRPSCDASKGHT 141
GCR + P E ++ P ++ +G
Sbjct: 166 WCVRSKGKEQFIFGSLDLLSCCHSCGQGCRGGTLGPAWQFWVEKGLSSGGP-LNSRQGCH 224
Query: 142 PKCVRECQ---ENYDVP--------------YKKDLNFGAKSYSVSSNEKSIMKEIYEHG 184
P + EC+ E+ D P +D ++G +YS+ ++E+ IM+EI+ +G
Sbjct: 225 PYPIGECRIPGEDEDTPKCSNKCRSGYNVTDVWQDRHYGRVAYSLPNDERKIMEEIFING 284
Query: 185 PVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYK 244
PV+ AF + DL YKSG + +
Sbjct: 285 PVQAAFHTYLDLHAYKSG-------------------------------------IYRHV 307
Query: 245 SGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAG 304
G GGHA+++LGWG + + KYWL+ANSW +WG+NG FK++RG++ CGIE +I AG
Sbjct: 308 WGPLSGGHAVKLLGWGVE--NGVKYWLVANSWGREWGENGFFKMVRGENHCGIEENIHAG 365
Query: 305 VPKL 308
+P
Sbjct: 366 LPNF 369
Score = 39.7 bits (91), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 17/32 (53%), Positives = 22/32 (68%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQ 40
CG GC GG G AW++WV+ G+ SGG S+Q
Sbjct: 190 CGQGCRGGTLGPAWQFWVEKGLSSGGPLNSRQ 221
>gi|157131748|ref|XP_001662318.1| cathepsin b [Aedes aegypti]
gi|108871395|gb|EAT35620.1| AAEL012216-PA [Aedes aegypti]
Length = 386
Score = 137 bits (344), Expect = 9e-30, Method: Compositional matrix adjust.
Identities = 94/304 (30%), Positives = 137/304 (45%), Gaps = 100/304 (32%)
Query: 65 NLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCW----------- 113
+L +LP I D DLP FD+R KWP CP++REIRDQG CGSCW
Sbjct: 106 DLERTKLPLGIMADVEDLDLPDTFDAREKWPECPSLREIRDQGCCGSCWAVSAASAMTDR 165
Query: 114 ---------------------------GCRPYEIAPC-----EHHVNGTRPSCDASKGHT 141
GCR + P E ++ P ++ +G
Sbjct: 166 WCVRSKGKEQFIFGSLDLLSCCHSCGQGCRGGTLGPAWQFWVEKGLSSGGP-LNSRQGCH 224
Query: 142 PKCVRECQ---ENYDVP--------------YKKDLNFGAKSYSVSSNEKSIMKEIYEHG 184
P + EC+ E+ D P +D ++G +YS+ ++E+ IM+EI+ +G
Sbjct: 225 PYPIGECRIPGEDEDTPKCSNKCRSGYNVTDVWQDRHYGRVAYSLPNDERKIMEEIFING 284
Query: 185 PVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYK 244
PV+ AF + DL YKSG + +
Sbjct: 285 PVQAAFHTYLDLHAYKSG-------------------------------------IYRHV 307
Query: 245 SGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAG 304
G GGHA+++LGWG + + KYWL+ANSW +WG+NG FK++RG++ CGIE +I AG
Sbjct: 308 WGPLSGGHAVKLLGWGVE--NGVKYWLVANSWGREWGENGFFKMVRGENHCGIEENIHAG 365
Query: 305 VPKL 308
+P
Sbjct: 366 LPNF 369
Score = 39.7 bits (91), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 17/32 (53%), Positives = 22/32 (68%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQ 40
CG GC GG G AW++WV+ G+ SGG S+Q
Sbjct: 190 CGQGCRGGTLGPAWQFWVEKGLSSGGPLNSRQ 221
>gi|56752787|gb|AAW24605.1| unknown [Schistosoma japonicum]
Length = 309
Score = 136 bits (343), Expect = 9e-30, Method: Compositional matrix adjust.
Identities = 74/209 (35%), Positives = 108/209 (51%), Gaps = 39/209 (18%)
Query: 99 TIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKK 158
+R I GS + GCRPY C+H V G +C TP+C + CQ+ Y+ Y++
Sbjct: 139 VLRGIVTGGSKENHTGCRPYPFPKCDHFVKGKYRACGDKLYKTPQCNQTCQKGYNTSYEQ 198
Query: 159 DLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIK 218
D ++G SY+V S E I K+I HGPVE +++D + YKSG
Sbjct: 199 DKHYGGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSG---------------- 242
Query: 219 WTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNT 278
+ Y +G+ + GHA+R++GWG + + YWL AN+WN
Sbjct: 243 ---------------------IYRYTTGQFISGHAVRLIGWGVENGT--AYWLAANTWNE 279
Query: 279 DWGDNGLFKILRGKDECGIESSITAGVPK 307
DWG+ G F+I+RG++EC IES I AG+ K
Sbjct: 280 DWGEKGYFRIVRGRNECLIESEIAAGLIK 308
Score = 55.1 bits (131), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 24/56 (42%), Positives = 35/56 (62%), Gaps = 1/56 (1%)
Query: 59 GVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG 114
G D NL R P + + +++ ++P++FDSR KWP C +I +IRDQ CGS W
Sbjct: 33 GRREDPNLREKRRP-TVDHHDLNVEIPSHFDSRKKWPRCKSISQIRDQSQCGSSWA 87
Score = 42.0 bits (97), Expect = 0.34, Method: Compositional matrix adjust.
Identities = 17/29 (58%), Positives = 22/29 (75%)
Query: 7 RLCGFGCNGGFPGMAWRYWVKSGIVSGGA 35
+ CG GC+GGF G +W YWV GIV+GG+
Sbjct: 120 KYCGSGCDGGFLGPSWDYWVLRGIVTGGS 148
>gi|282400164|ref|NP_001164205.1| cathepsin B precursor [Tribolium castaneum]
gi|270004839|gb|EFA01287.1| cathepsin B precursor [Tribolium castaneum]
Length = 335
Score = 136 bits (343), Expect = 9e-30, Method: Compositional matrix adjust.
Identities = 82/217 (37%), Positives = 107/217 (49%), Gaps = 55/217 (25%)
Query: 99 TIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKK 158
T+ I G+ GC+ Y APCEHHV+G P C +K TP C +EC + Y+
Sbjct: 167 TVNGIVTGGNYEDTNGCKAYSFAPCEHHVDGDLPPCGPTKP-TPDCKKECDSGSSLTYQN 225
Query: 159 DLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIK 218
DL G+ +Y + K I EI +GPVE +
Sbjct: 226 DLTHGS-NYGIDPYPKQIQTEIMTNGPVEAS----------------------------- 255
Query: 219 WTIRDNTSQLGAEGAFTVFDDLILYKSG-------KALGGHAIRILGWGEDEKSKEKYWL 271
F+V++D + YKSG + GGHAI+ILGWG + + YWL
Sbjct: 256 ---------------FSVYEDFLSYKSGVYQHLEGEYAGGHAIKILGWGVENDTP--YWL 298
Query: 272 IANSWNTDWGDNGLFKILRGKDECGIESSITAGVPKL 308
+ANSWN DWGD G FKILRG +ECGIE SI AG+P+L
Sbjct: 299 VANSWNEDWGDKGYFKILRGSNECGIEGSIVAGIPEL 335
>gi|226474178|emb|CAX71575.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 136 bits (343), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 74/209 (35%), Positives = 107/209 (51%), Gaps = 39/209 (18%)
Query: 99 TIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKK 158
+R I GS + GCRPY C+H V G +C TP+C + CQ+ Y+ Y++
Sbjct: 172 VLRGIVTGGSKENHTGCRPYPFPKCDHFVKGKYRACGDKLYKTPQCNQTCQKGYNTSYEQ 231
Query: 159 DLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIK 218
D ++G SY+V S E I K+I HGPVE +++D + YKSG
Sbjct: 232 DKHYGGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSG---------------- 275
Query: 219 WTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNT 278
+ Y +G+ + GHA+R++GWG + + YWL AN+WN
Sbjct: 276 ---------------------IYRYTTGQFISGHAVRLIGWGVENGT--AYWLAANTWNE 312
Query: 279 DWGDNGLFKILRGKDECGIESSITAGVPK 307
DWG+ G F+I+RG++EC IES I AG K
Sbjct: 313 DWGEKGYFRIVRGRNECSIESEIAAGRIK 341
Score = 55.1 bits (131), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 24/56 (42%), Positives = 35/56 (62%), Gaps = 1/56 (1%)
Query: 59 GVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG 114
G D NL R P + + +++ ++P++FDSR KWP C +I +IRDQ CGS W
Sbjct: 66 GRREDPNLREKRRP-TVDHHDLNVEIPSHFDSRKKWPRCKSISQIRDQSQCGSSWA 120
Score = 42.4 bits (98), Expect = 0.27, Method: Compositional matrix adjust.
Identities = 17/29 (58%), Positives = 22/29 (75%)
Query: 7 RLCGFGCNGGFPGMAWRYWVKSGIVSGGA 35
+ CG GC+GGF G +W YWV GIV+GG+
Sbjct: 153 KYCGSGCDGGFLGPSWDYWVLRGIVTGGS 181
>gi|226474182|emb|CAX71577.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 136 bits (343), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 73/209 (34%), Positives = 107/209 (51%), Gaps = 39/209 (18%)
Query: 99 TIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKK 158
+R I GS + CRPY C+H V G +C TP+C + CQ+ Y+ Y++
Sbjct: 172 VLRGIVTGGSKENHTSCRPYPFPKCDHFVKGKYRACGDKLYETPQCKQTCQKGYNTSYEQ 231
Query: 159 DLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIK 218
D ++G SY+V S E I K+I HGPVE +++D + YKSG
Sbjct: 232 DKHYGGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSG---------------- 275
Query: 219 WTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNT 278
+ Y +G+ + GHA+R++GWG + + YWL AN+WN
Sbjct: 276 ---------------------IYRYTTGQFISGHAVRLIGWGVENGT--AYWLAANTWNE 312
Query: 279 DWGDNGLFKILRGKDECGIESSITAGVPK 307
DWG+ G F+I+RG++EC IES I AG+ K
Sbjct: 313 DWGEKGYFRIVRGRNECSIESEIAAGLIK 341
Score = 55.5 bits (132), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 24/56 (42%), Positives = 35/56 (62%), Gaps = 1/56 (1%)
Query: 59 GVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG 114
G D NL R P + + +++ ++P++FDSR KWP C +I +IRDQ CGS W
Sbjct: 66 GRREDPNLREKRRP-TVDHHDLNVEIPSHFDSRKKWPRCKSISQIRDQSQCGSSWA 120
Score = 42.4 bits (98), Expect = 0.25, Method: Compositional matrix adjust.
Identities = 17/29 (58%), Positives = 22/29 (75%)
Query: 7 RLCGFGCNGGFPGMAWRYWVKSGIVSGGA 35
+ CG GC+GGF G +W YWV GIV+GG+
Sbjct: 153 KYCGSGCDGGFLGPSWDYWVLRGIVTGGS 181
>gi|197725747|gb|ACH73069.1| cathepsin B precursor [Epinephelus coioides]
Length = 333
Score = 136 bits (343), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 78/193 (40%), Positives = 100/193 (51%), Gaps = 43/193 (22%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GCRPY I PCEHHVNGTRP C G TP+C+ +C+ Y YK D ++G SYSV S+E
Sbjct: 176 GCRPYTIPPCEHHVNGTRPPCTGEGGDTPQCILQCESGYTPSYKADKHYGKSSYSVPSDE 235
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
+ I EIY++GPVEGAFTV++D +LYK+G +
Sbjct: 236 EQIQSEIYKNGPVEGAFTVYEDFLLYKTGVY----------------------------- 266
Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
+ +G A+GGHAI+ W +E L +TDWGD G D
Sbjct: 267 --------QHMTGSAVGGHAIK--SWLGEEVCS---LLALCHSDTDWGDMVSLSS-AGSD 312
Query: 294 ECGIESSITAGVP 306
CGIES I AG+P
Sbjct: 313 HCGIESEIVAGIP 325
>gi|195437434|ref|XP_002066645.1| GK24603 [Drosophila willistoni]
gi|194162730|gb|EDW77631.1| GK24603 [Drosophila willistoni]
Length = 341
Score = 136 bits (343), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 77/209 (36%), Positives = 104/209 (49%), Gaps = 53/209 (25%)
Query: 107 GSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKS 166
G GS GC+PY I PC+H NG+RP C G +C C+ +Y V +++D NF +K
Sbjct: 179 GDYGSQQGCQPYTIEPCDHSGNGSRPVCTVGGG--VRCQHLCEPSYKVDFQRDKNFASKV 236
Query: 167 YSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTS 226
YS+S++ I KEI +GPV+ T
Sbjct: 237 YSISNDVLEIQKEIMTNGPVQAILT----------------------------------- 261
Query: 227 QLGAEGAFTVFDDLILYKSG-------KALGGHAIRILGWGEDEKSKEKYWLIANSWNTD 279
V++D + YK+G + +G HA+RILGWG K YWL+ANSW +D
Sbjct: 262 ---------VYEDFLSYKTGVYYHLEGEKVGPHAVRILGWGVWGTKKVPYWLVANSWGSD 312
Query: 280 WGDNGLFKILRGKDECGIESSITAGVPKL 308
WGDNG F I RG++ C IE I AG+PKL
Sbjct: 313 WGDNGFFHIFRGENHCDIEGYIMAGLPKL 341
Score = 69.7 bits (169), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 34/76 (44%), Positives = 48/76 (63%), Gaps = 4/76 (5%)
Query: 43 KNSLSNIPRAHLKSWMGVHPD-YNLPANRLPELIGYSEVD---EDLPANFDSRTKWPNCP 98
+N +I +L+ MGVH + Y P E++G S+ + DLP +FD+R +W +CP
Sbjct: 44 RNFHESISEKYLRGLMGVHEESYKYPLPDKQEVLGESDDEISLADLPVDFDARLRWTSCP 103
Query: 99 TIREIRDQGSCGSCWG 114
TI EIR+QGSCGSCW
Sbjct: 104 TISEIREQGSCGSCWA 119
Score = 55.1 bits (131), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 21/33 (63%), Positives = 26/33 (78%)
Query: 8 LCGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQ 40
+CGF C GG+PG AW YW + G+VSGG YGS+Q
Sbjct: 153 ICGFACQGGYPGAAWAYWARKGLVSGGDYGSQQ 185
>gi|56759488|gb|AAW27884.1| unknown [Schistosoma japonicum]
Length = 342
Score = 136 bits (343), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 74/209 (35%), Positives = 107/209 (51%), Gaps = 39/209 (18%)
Query: 99 TIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKK 158
+R I GS + GCRPY C+H V G +C TP+C + CQ+ Y+ Y++
Sbjct: 172 VLRGIVTGGSKENHTGCRPYPFPKCDHFVKGKYRACGDKLYKTPQCKQTCQKGYNTSYEQ 231
Query: 159 DLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIK 218
D ++G SY+V E I K+I HGPVE +++D + YKSG
Sbjct: 232 DKHYGGFSYNVLGIESVIQKDIMMHGPVEAYLEIYEDFLNYKSG---------------- 275
Query: 219 WTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNT 278
+ Y +GK + GHA+R++GWG + + YWL AN+WN
Sbjct: 276 ---------------------IYRYTTGKYISGHAVRLIGWGVENGT--AYWLAANTWNE 312
Query: 279 DWGDNGLFKILRGKDECGIESSITAGVPK 307
DWG+ G F+I+RG++EC IES I AG+ K
Sbjct: 313 DWGEKGYFRIVRGRNECLIESEIAAGLIK 341
Score = 42.4 bits (98), Expect = 0.29, Method: Compositional matrix adjust.
Identities = 17/29 (58%), Positives = 22/29 (75%)
Query: 7 RLCGFGCNGGFPGMAWRYWVKSGIVSGGA 35
+ CG GC+GGF G +W YWV GIV+GG+
Sbjct: 153 KYCGSGCDGGFLGPSWDYWVLRGIVTGGS 181
>gi|56758864|gb|AAW27572.1| unknown [Schistosoma japonicum]
Length = 342
Score = 136 bits (343), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 74/209 (35%), Positives = 107/209 (51%), Gaps = 39/209 (18%)
Query: 99 TIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKK 158
+R I GS + GCRPY C+H V G +C TP+C + CQ+ Y+ Y++
Sbjct: 172 VLRGIVTGGSKENHTGCRPYPFPKCDHFVKGKYRACGDKLYKTPQCNQTCQKGYNTSYEQ 231
Query: 159 DLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIK 218
D ++G SY+V E I K+I HGPVE +++D + YKSG
Sbjct: 232 DKHYGGFSYNVLGIESVIQKDIMMHGPVEAYLEIYEDFLNYKSG---------------- 275
Query: 219 WTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNT 278
+ Y +GK + GHA+R++GWG + + YWL AN+WN
Sbjct: 276 ---------------------IYRYTTGKYISGHAVRLIGWGVENGT--AYWLAANTWNE 312
Query: 279 DWGDNGLFKILRGKDECGIESSITAGVPK 307
DWG+ G F+I+RG++EC IES I AG+ K
Sbjct: 313 DWGEKGYFRIVRGRNECLIESEIAAGLIK 341
Score = 42.4 bits (98), Expect = 0.29, Method: Compositional matrix adjust.
Identities = 17/29 (58%), Positives = 22/29 (75%)
Query: 7 RLCGFGCNGGFPGMAWRYWVKSGIVSGGA 35
+ CG GC+GGF G +W YWV GIV+GG+
Sbjct: 153 KYCGSGCDGGFLGPSWDYWVLRGIVTGGS 181
>gi|14582576|gb|AAK69541.1|AF283476_1 cathepsin B-like cysteine proteinase [Ipomoea batatas]
Length = 352
Score = 136 bits (343), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 94/315 (29%), Positives = 139/315 (44%), Gaps = 97/315 (30%)
Query: 46 LSNIPRAHLKSWMGVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRD 105
S+ + K +GV R P + E++ LP FD+RT WP C +I +I D
Sbjct: 60 FSDFTVSQFKRLLGVKKAPKSLLKRTPVVTHSKEIE--LPKTFDARTAWPQCLSIADILD 117
Query: 106 QGSCGSCW-------------------------------------GCR-PYEIAP----- 122
QG CGSCW GC Y IA
Sbjct: 118 QGHCGSCWAFGAVESLTDRFCIHYGTNVTLSVNDLLACCGFLCGEGCDGGYPIAAWQYFK 177
Query: 123 --------CEHHVNGT---RPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSS 171
C+ + + T P C+ + TP C ++C + ++ + + +F +Y V+S
Sbjct: 178 RTGVVTSECDPYFDQTGCSHPGCEPAYP-TPACEKKCVKK-NLLWSESKHFSVNAYRVNS 235
Query: 172 NEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAE 231
++ SIM E+Y +GP E +FTV++D YKSG +
Sbjct: 236 DQHSIMTEVYTNGPAEVSFTVYEDFAHYKSGVY--------------------------- 268
Query: 232 GAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRG 291
+ +G +GGHA++++GWG E E YWL+AN WN WGD+G FKI+RG
Sbjct: 269 ----------KHVTGSEMGGHAVKLIGWGTSEDG-EDYWLLANQWNRSWGDDGYFKIIRG 317
Query: 292 KDECGIESSITAGVP 306
+ECGIE +TAG+P
Sbjct: 318 TNECGIE-DVTAGMP 331
Score = 37.4 bits (85), Expect = 7.7, Method: Compositional matrix adjust.
Identities = 13/25 (52%), Positives = 21/25 (84%)
Query: 8 LCGFGCNGGFPGMAWRYWVKSGIVS 32
LCG GC+GG+P AW+Y+ ++G+V+
Sbjct: 159 LCGEGCDGGYPIAAWQYFKRTGVVT 183
>gi|224064400|ref|XP_002301457.1| predicted protein [Populus trichocarpa]
gi|222843183|gb|EEE80730.1| predicted protein [Populus trichocarpa]
Length = 357
Score = 136 bits (342), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 102/331 (30%), Positives = 135/331 (40%), Gaps = 110/331 (33%)
Query: 37 GSKQAEKNSLSNIPRAHLKSWMGVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPN 96
G K + SN A K +GV P +P + + LP FD+RT WP
Sbjct: 56 GWKATMNHHFSNYTVAQFKYLLGVKPTPKEELRGIPVISHPKSLR--LPEEFDARTAWPQ 113
Query: 97 CPTIREIRDQGSCGSCWGCRPYE---------------------IAPC------------ 123
C TI +I DQG CGSCW E +A C
Sbjct: 114 CSTIGKILDQGHCGSCWAFGAVESLSDRFCIHYGMNISLSVNDLLACCGFLCGSGCNGGY 173
Query: 124 --------EHH-------------VNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNF 162
HH + + P C+ TPKC R+C N + +KK ++
Sbjct: 174 PISAWRYFVHHGVVTEECDPYFDDIGCSHPGCEPGYP-TPKCARKCV-NKNQLWKKSKHY 231
Query: 163 GAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIR 222
G K Y + S+ +SIM EIY++GPVE A
Sbjct: 232 GVKPYRIDSDPESIMAEIYKNGPVEVA--------------------------------- 258
Query: 223 DNTSQLGAEGAFTVFDDLILYKSGK-------ALGGHAIRILGWGEDEKSKEKYWLIANS 275
FTV++D YKSG +GGHA++++GWG E E YWL+AN
Sbjct: 259 -----------FTVYEDFAHYKSGVYKHITGGMMGGHAVKLIGWGTSEDG-EAYWLLANQ 306
Query: 276 WNTDWGDNGLFKILRGKDECGIESSITAGVP 306
WN WGD+G FKI RG +ECGIE + AG+P
Sbjct: 307 WNRGWGDDGYFKIRRGTNECGIEGDVVAGLP 337
Score = 40.8 bits (94), Expect = 0.81, Method: Compositional matrix adjust.
Identities = 16/25 (64%), Positives = 20/25 (80%)
Query: 8 LCGFGCNGGFPGMAWRYWVKSGIVS 32
LCG GCNGG+P AWRY+V G+V+
Sbjct: 164 LCGSGCNGGYPISAWRYFVHHGVVT 188
>gi|341887135|gb|EGT43070.1| hypothetical protein CAEBREN_13756 [Caenorhabditis brenneri]
Length = 398
Score = 136 bits (342), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 73/198 (36%), Positives = 103/198 (52%), Gaps = 41/198 (20%)
Query: 114 GCRPYEIAPCEHHVNGTR-PSCDASKGHTPKCVRECQENY-DVPYKKDLNFGAKSYSVSS 171
GC+PY PCEHH T C TPKC + C Y D Y +D +G+ +Y V
Sbjct: 217 GCKPYPFPPCEHHSKKTHFDPCPHDLYPTPKCEKRCNAEYTDKTYSEDKFYGSSAYGVKD 276
Query: 172 NEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAE 231
+ ++I KE+ HGP+E AF V++D + Y G +
Sbjct: 277 DVEAIQKELMTHGPLEIAFEVYEDFLNYDGGVY--------------------------- 309
Query: 232 GAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRG 291
++ GK GGHA++++GWG ++ YW +ANSWNTDWG++G F+ILRG
Sbjct: 310 ----------VHTGGKLGGGHAVKLIGWGIEDGIP--YWTVANSWNTDWGEDGFFRILRG 357
Query: 292 KDECGIESSITAGVPKLD 309
DECGIES + G+PKL+
Sbjct: 358 VDECGIESGVVGGIPKLN 375
Score = 55.5 bits (132), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 21/36 (58%), Positives = 27/36 (75%)
Query: 79 EVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG 114
++D D+P +FDSR WP C +I+ IRDQ SCGSCW
Sbjct: 115 DLDMDIPESFDSRENWPKCESIKAIRDQSSCGSCWA 150
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 21/30 (70%), Positives = 23/30 (76%)
Query: 7 RLCGFGCNGGFPGMAWRYWVKSGIVSGGAY 36
R CGFGCNGG P AWRYWVK GIV+G +
Sbjct: 183 RSCGFGCNGGDPLAAWRYWVKDGIVTGSNF 212
>gi|22535408|emb|CAC87118.1| cathepsin B-like protease [Nilaparvata lugens]
Length = 347
Score = 136 bits (342), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 75/195 (38%), Positives = 103/195 (52%), Gaps = 40/195 (20%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGH-TPKCVRECQENYDVPYKKDLNFGAKSYSVSSN 172
GC+PY IAPCEHH+ G++P+C AS TP C C + Y+KD G +Y V
Sbjct: 189 GCQPYPIAPCEHHMEGSKPNCSASPTEPTPACETTCTHGSSLAYQKDRQKGKSAYLVPVG 248
Query: 173 EKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEG 232
EK EI+++GP+ AF V++D +YKSG + + E
Sbjct: 249 EKQTQLEIFKNGPIVAAFKVYEDFFMYKSGVY----------------------KRHPES 286
Query: 233 AFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGK 292
F G HA++++GWG E++ YWL+ NSW+ DWGD GLFKI RG
Sbjct: 287 PFR--------------GRHAVKVIGWG--EQNGLPYWLVQNSWDYDWGDKGLFKIARG- 329
Query: 293 DECGIESSITAGVPK 307
+EC E S+TAG+PK
Sbjct: 330 NECDFEKSMTAGLPK 344
Score = 60.1 bits (144), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 32/91 (35%), Positives = 48/91 (52%), Gaps = 22/91 (24%)
Query: 44 NSLSNIPRAHLKSWMGVHPDYNLPANRLPELIGYSEVDEDL------------------- 84
++++N P++ K+ HPD P + L L+G SE++ +L
Sbjct: 34 DAINNNPKSTWKAGHNFHPD--TPMSYLQGLLGVSELESNLADLDKYEEMEENEENKKIK 91
Query: 85 -PANFDSRTKWPNCPTIREIRDQGSCGSCWG 114
P FD+R KW C ++REIRDQG+CGSCW
Sbjct: 92 VPKYFDARKKWKKCKSLREIRDQGNCGSCWA 122
Score = 42.0 bits (97), Expect = 0.38, Method: Compositional matrix adjust.
Identities = 17/30 (56%), Positives = 21/30 (70%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
CGFGC GGFP AW + + G+V+GG Y S
Sbjct: 157 CGFGCEGGFPDAAWVFIKRHGLVTGGDYHS 186
>gi|5031250|gb|AAD38132.1|AF127592_1 vitellogenic cathepsin-B like protease [Aedes aegypti]
Length = 386
Score = 135 bits (341), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 94/304 (30%), Positives = 136/304 (44%), Gaps = 100/304 (32%)
Query: 65 NLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCW----------- 113
+L +LP I D DLP FD+R KWP CP++REIRDQG CGSCW
Sbjct: 106 DLERTKLPLGIMADVEDLDLPDTFDAREKWPECPSLREIRDQGCCGSCWAVSAASAMTDR 165
Query: 114 ---------------------------GCRPYEIAPC-----EHHVNGTRPSCDASKGHT 141
GCR + P E ++ P ++ +G
Sbjct: 166 WCVRSKGKEQFIFGSLDLLSCCHSCGQGCRGGTLGPAWQFWVEKGLSSGGP-LNSRQGCH 224
Query: 142 PKCVRECQ---ENYDVP--------------YKKDLNFGAKSYSVSSNEKSIMKEIYEHG 184
P + EC+ E+ D P +D + G +YS+ ++E+ IM+EI+ +G
Sbjct: 225 PYPIGECRIPGEDEDTPKCSNKCRSGYNVTDVWQDRHIGRVAYSLPNDERKIMEEIFING 284
Query: 185 PVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYK 244
PV+ AF + DL YKSG + +
Sbjct: 285 PVQAAFHTYLDLHAYKSG-------------------------------------IYRHV 307
Query: 245 SGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAG 304
G GGHA+++LGWG + + KYWL+ANSW +WG+NG FK++RG++ CGIE +I AG
Sbjct: 308 WGPLSGGHAVKLLGWGVE--NGVKYWLVANSWGREWGENGFFKMVRGENHCGIEENIHAG 365
Query: 305 VPKL 308
+P
Sbjct: 366 LPNF 369
Score = 39.7 bits (91), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 17/32 (53%), Positives = 22/32 (68%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQ 40
CG GC GG G AW++WV+ G+ SGG S+Q
Sbjct: 190 CGQGCRGGTLGPAWQFWVEKGLSSGGPLNSRQ 221
>gi|226474168|emb|CAX71570.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 135 bits (341), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 73/209 (34%), Positives = 107/209 (51%), Gaps = 39/209 (18%)
Query: 99 TIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKK 158
+R I GS + GCRPY C+H V G +C TP+C + CQ+ Y+ Y++
Sbjct: 172 VLRGIVTGGSKENHTGCRPYPFPKCDHFVKGKYRACGDKLYKTPQCNQTCQKGYNTSYEQ 231
Query: 159 DLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIK 218
D ++G SY+V S E I K+I HGP E +++D + YKSG
Sbjct: 232 DKHYGGFSYNVLSVESVIQKDIMMHGPAEAYLEIYEDFLNYKSG---------------- 275
Query: 219 WTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNT 278
+ Y +G+ + GHA+R++GWG + + YWL AN+WN
Sbjct: 276 ---------------------IYRYTTGQFISGHAVRLIGWGVENGT--AYWLAANTWNE 312
Query: 279 DWGDNGLFKILRGKDECGIESSITAGVPK 307
DWG+ G F+I+RG++EC IES I AG+ K
Sbjct: 313 DWGEKGYFRIVRGRNECLIESEIAAGLIK 341
Score = 55.1 bits (131), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 24/56 (42%), Positives = 35/56 (62%), Gaps = 1/56 (1%)
Query: 59 GVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG 114
G D NL R P + + +++ ++P++FDSR KWP C +I +IRDQ CGS W
Sbjct: 66 GRKEDPNLREKRRP-TVDHHDLNVEIPSHFDSRKKWPRCKSISQIRDQSQCGSSWA 120
Score = 42.4 bits (98), Expect = 0.27, Method: Compositional matrix adjust.
Identities = 17/29 (58%), Positives = 22/29 (75%)
Query: 7 RLCGFGCNGGFPGMAWRYWVKSGIVSGGA 35
+ CG GC+GGF G +W YWV GIV+GG+
Sbjct: 153 KYCGSGCDGGFLGPSWDYWVLRGIVTGGS 181
>gi|343476048|emb|CCD12737.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 336
Score = 135 bits (340), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 105/299 (35%), Positives = 131/299 (43%), Gaps = 42/299 (14%)
Query: 39 KQAEKNSLSNIPRAHLKSWMGV--HPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPN 96
K + NI A + G +LP R E ++ +LP +FDS KWPN
Sbjct: 47 KAVYNGKMQNITFAEARRLTGAFRRKTSSLPPVRFTE----EQLRTELPESFDSAEKWPN 102
Query: 97 CPTIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPY 156
CPTIREI DQ +CGSCW H G S H C ++C + D Y
Sbjct: 103 CPTIREIADQSACGSCWAVSTASAISDRHCTVGGVQQLRISAAHLLSCCKDCGDGCDGGY 162
Query: 157 KKDLNFGAKSYSVSSNEKSIMKEIY--EHGPVEGAFTVFDDLILYKSGRFFVPGNETT-- 212
A Y VS S + Y H G Y F P TT
Sbjct: 163 PDS----AWEYYVSHGLASSYCQPYPFPHCGHHGGKGKKPPCSKYD---FHTPKCNTTCT 215
Query: 213 --AMSLIKWTIRDNTSQLGAEG--------------AFTVFDDLILYK-------SGKAL 249
A+ LIK+ D+ L E AF V+ D + YK SG L
Sbjct: 216 DKAIPLIKYRGNDSYVLLHGEDDFKRELYFNGPFVVAFQVYSDFLAYKTGVYRHVSGDFL 275
Query: 250 GGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVPKL 308
GGHA+RI+GWG+ + YW IANSW+TDWG NG F ILRG +ECGIES+ AG+P +
Sbjct: 276 GGHAVRIVGWGKLNGT--PYWKIANSWDTDWGMNGHFLILRGNNECGIESTGYAGLPAI 332
>gi|329669000|gb|AEB96388.1| cathepsin B-like cysteine protease 2 [Angiostrongylus cantonensis]
Length = 232
Score = 135 bits (340), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 74/195 (37%), Positives = 100/195 (51%), Gaps = 40/195 (20%)
Query: 114 GCRPYEIAPCEHHV-NGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSN 172
GC+PY PCEHH+ C T C +CQ+ Y + Y D ++GA Y+V+ +
Sbjct: 77 GCKPYPYPPCEHHIPEHHYKKCPKDIYPTNTCEYKCQDGYSISYNSDKHYGASVYAVAQD 136
Query: 173 EKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEG 232
SI KEI +GPVE AF V++D Y SG
Sbjct: 137 VASIQKEIMTNGPVEVAFDVYEDFEHYSSG------------------------------ 166
Query: 233 AFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGK 292
+ + +G LGGHA+++LGWG + + YW+ ANSWN+DWG+NG F+ILRG
Sbjct: 167 -------IYKHTTGDYLGGHAVKMLGWGTENGTD--YWICANSWNSDWGENGFFRILRGV 217
Query: 293 DECGIESSITAGVPK 307
DEC IES + AG PK
Sbjct: 218 DECEIESGVVAGEPK 232
Score = 44.7 bits (104), Expect = 0.056, Method: Compositional matrix adjust.
Identities = 24/52 (46%), Positives = 28/52 (53%), Gaps = 5/52 (9%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQAEK-----NSLSNIPRAHLK 55
CGFGC+G P AW YWV +GIV+G Y SK K +IP H K
Sbjct: 45 CGFGCDGRDPYAAWSYWVSNGIVTGSNYTSKSGCKPYPYPPCEHHIPEHHYK 96
>gi|340380665|ref|XP_003388842.1| PREDICTED: cathepsin B-like [Amphimedon queenslandica]
Length = 333
Score = 135 bits (339), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 94/258 (36%), Positives = 126/258 (48%), Gaps = 43/258 (16%)
Query: 84 LPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPK 143
+P FD+R KW +CP+I +IRDQGSCGSCW E A + + + + S +
Sbjct: 82 IPDTFDARQKWSDCPSISDIRDQGSCGSCWALGAVE-AMSDRYCVSFQENVHISAENLMT 140
Query: 144 CVREC---------QENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEH--GPVE----- 187
C + C Q+ ++ K L G + S + ++ + H GP E
Sbjct: 141 CCKFCGNGCAGGFLQQAWEYWVKDGLVTGGQYGSDEGCQPYLIPKCNHHEPGPYENCTGE 200
Query: 188 -----------GAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTV 236
+T + L+ + + E A I+ I N EGAFTV
Sbjct: 201 GKTPQCERTCRSGYTTSYEADLHYGEKAYAVHREVEA---IQTEIMTNGP---VEGAFTV 254
Query: 237 FDDLILYKS-------GKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKIL 289
+ D YKS G ALGGHAIRILGWG + + YWLIANSWN WGD G FK++
Sbjct: 255 YSDFPTYKSGVYQHVVGHALGGHAIRILGWGTE--NGVPYWLIANSWNPSWGDKGYFKMI 312
Query: 290 RGKDECGIESSITAGVPK 307
RGKD+CGIES+I AG PK
Sbjct: 313 RGKDDCGIESNIVAGTPK 330
Score = 48.9 bits (115), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 23/47 (48%), Positives = 30/47 (63%), Gaps = 2/47 (4%)
Query: 7 RLCGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQAEKNSLSNIPRAH 53
+ CG GC GGF AW YWVK G+V+GG YGS + + L IP+ +
Sbjct: 143 KFCGNGCAGGFLQQAWEYWVKDGLVTGGQYGSDEGCQPYL--IPKCN 187
>gi|308511959|ref|XP_003118162.1| CRE-CPR-6 protein [Caenorhabditis remanei]
gi|308238808|gb|EFO82760.1| CRE-CPR-6 protein [Caenorhabditis remanei]
Length = 387
Score = 135 bits (339), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 75/198 (37%), Positives = 104/198 (52%), Gaps = 41/198 (20%)
Query: 114 GCRPYEIAPCEHHVNGTR-PSCDASKGHTPKCVRECQENY-DVPYKKDLNFGAKSYSVSS 171
GC+PY PCEHH T C TPKC ++C +Y D Y +D +GA +Y V
Sbjct: 202 GCKPYPFPPCEHHSKKTHFDPCPHDLYPTPKCEKKCIADYTDKTYSEDKFYGASAYGVKD 261
Query: 172 NEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAE 231
+ ++I KE+ HGP+E AF V++D + Y G +
Sbjct: 262 DVEAIQKELMTHGPLEIAFEVYEDFLNYDGGVY--------------------------- 294
Query: 232 GAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRG 291
++ GK GGHA++++GWG + + YW ANSWNTDWG++G F+ILRG
Sbjct: 295 ----------VHTGGKLGGGHAVKLVGWGIE--NGIPYWTCANSWNTDWGEDGFFRILRG 342
Query: 292 KDECGIESSITAGVPKLD 309
DECGIES + GVPKL+
Sbjct: 343 VDECGIESGVVGGVPKLN 360
Score = 58.9 bits (141), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 23/36 (63%), Positives = 27/36 (75%)
Query: 79 EVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG 114
++D D+P NFDSR WP C +IR IRDQ SCGSCW
Sbjct: 100 DLDMDIPENFDSRENWPKCQSIRNIRDQSSCGSCWA 135
Score = 51.2 bits (121), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 23/37 (62%), Positives = 25/37 (67%)
Query: 7 RLCGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQAEK 43
R CGFGCNGG P AWRYWVK GIV+G Y + K
Sbjct: 168 RSCGFGCNGGDPLAAWRYWVKDGIVTGSNYTANSGCK 204
>gi|226497010|ref|NP_001150152.1| LOC100283781 precursor [Zea mays]
gi|195637168|gb|ACG38052.1| cathepsin B-like cysteine proteinase 3 precursor [Zea mays]
Length = 347
Score = 134 bits (338), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 96/315 (30%), Positives = 134/315 (42%), Gaps = 96/315 (30%)
Query: 46 LSNIPRAHLKSWMGVHPDYNLPANRLPEL-IGYSEVDEDLPANFDSRTKWPNCPTIREIR 104
SN A K +GV P P N L + + +LP FD+R+ W C TI I
Sbjct: 57 FSNYTIAQFKHILGVKP---APQNALSNVPVKTYSRSLELPKEFDARSAWSRCSTIGNIL 113
Query: 105 DQGSCGSCWGCRPYEIAP---CEH-------HVNGTRPSC-----DASKGHTP------- 142
DQG CGSCW E C H VN C D G P
Sbjct: 114 DQGHCGSCWAFGAVECLQDRFCIHLNMSILLSVNDLLACCGFMCGDGCDGGYPIEAWRYF 173
Query: 143 -------------------------------KCVRECQENYDVPYKKDLNFGAKSYSVSS 171
KC ++C+E V +++ +F +Y ++S
Sbjct: 174 VQNGVVTDECDPYFDPVGCKHPGCEPAYPTPKCEKKCKEQNQV-WQEKKHFSIDAYRINS 232
Query: 172 NEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAE 231
+ IM E+Y++GPVE AFTV++D YKSG +
Sbjct: 233 DPHDIMAEVYKNGPVEVAFTVYEDFAHYKSGVY--------------------------- 265
Query: 232 GAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRG 291
+ +G +GGHA++++GWG + + E YWL+AN WN WGD+G FKI+RG
Sbjct: 266 ----------KHITGGIMGGHAVKLIGWGTSD-AGEDYWLLANQWNRGWGDDGYFKIIRG 314
Query: 292 KDECGIESSITAGVP 306
K+ECGIE + AG+P
Sbjct: 315 KNECGIEEGVVAGMP 329
Score = 38.5 bits (88), Expect = 3.6, Method: Compositional matrix adjust.
Identities = 14/25 (56%), Positives = 22/25 (88%)
Query: 8 LCGFGCNGGFPGMAWRYWVKSGIVS 32
+CG GC+GG+P AWRY+V++G+V+
Sbjct: 156 MCGDGCDGGYPIEAWRYFVQNGVVT 180
>gi|268566077|ref|XP_002647467.1| Hypothetical protein CBG06539 [Caenorhabditis briggsae]
Length = 332
Score = 134 bits (337), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 87/283 (30%), Positives = 121/283 (42%), Gaps = 96/283 (33%)
Query: 84 LPANFDSRTKWPNCPTIREIRDQGS-----------------CGSCWGCR-----PYEIA 121
+P FD+RTKWP C +I+ IR+Q + C + G R P ++
Sbjct: 87 IPETFDARTKWPKCKSIKLIRNQANCGSCWAFGAAEVISDRICIATKGARQPVISPMDMV 146
Query: 122 PC------------------------------EHHVNGTRP-----SCDASKGHTPKCVR 146
C ++ +G +P S TP+C
Sbjct: 147 DCCGEYCGYGCDGGYSIQALRWWVFDGVVTGGDYQGDGCKPYQFCNSAGCPDAVTPECAL 206
Query: 147 ECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFV 206
CQ Y+ Y KD NFG +Y V +I +I +GPVE +F V++D YKSG
Sbjct: 207 SCQSKYNTEYAKDKNFGTSAYYVGMTVNAIQTDIMTNGPVEASFKVYEDFYKYKSG---- 262
Query: 207 PGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSK 266
+ Y +GK LGGHAI+I+GWG + +
Sbjct: 263 ---------------------------------VYKYIAGKMLGGHAIKIIGWGTENGTA 289
Query: 267 EKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVPKLD 309
YWLIANSW T WG+NG FKI RG +ECGIE+++ AG +D
Sbjct: 290 --YWLIANSWGTKWGENGFFKIRRGVNECGIENNVVAGKADVD 330
Score = 38.9 bits (89), Expect = 3.1, Method: Compositional matrix adjust.
Identities = 15/28 (53%), Positives = 21/28 (75%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAY 36
CG+GC+GG+ A R+WV G+V+GG Y
Sbjct: 153 CGYGCDGGYSIQALRWWVFDGVVTGGDY 180
>gi|29374025|gb|AAO73003.1| cathepsin B [Fasciola gigantica]
Length = 339
Score = 134 bits (337), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 90/300 (30%), Positives = 133/300 (44%), Gaps = 111/300 (37%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQAEKNSLSNIPRAHLKSWMGVHPDYNLPA 68
CG GC GG+P AW YW++ GIV+GG + + R + WM D+
Sbjct: 151 CGQGCRGGYPPKAWDYWMREGIVTGGTWEN------------RTGCQPWMFTKCDH---- 194
Query: 69 NRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEIAPCEHHVN 128
+G DSR K+ CP H+
Sbjct: 195 ------VG------------DSR-KYSRCP--------------------------HYTY 209
Query: 129 GTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEG 188
T P C R CQ Y+ Y++D +G SY+V +E IM+EI ++GPVE
Sbjct: 210 PTPP-----------CARACQTGYNKTYEQDKFYGNSSYNVGEHESYIMQEIMKNGPVEV 258
Query: 189 AFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKA 248
F +F D +Y+SG + + +GK
Sbjct: 259 TFAIFQDFGVYRSG-------------------------------------IYHHVAGKF 281
Query: 249 LGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVPKL 308
+G HA+R++GWG + + YWL+ANSWN +WG+NG F+++RG++ECGIES + AG+P+L
Sbjct: 282 IGRHAVRMIGWGVE--NGVNYWLMANSWNEEWGENGYFRMVRGRNECGIESEVVAGMPRL 339
Score = 62.0 bits (149), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 32/76 (42%), Positives = 40/76 (52%), Gaps = 2/76 (2%)
Query: 39 KQAEKNSLSNIPRAHLKSWMGVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCP 98
K A SN+ H K +G + N L I + DLP +FD+R++WP C
Sbjct: 43 KAARSTRFSNVD--HFKLHLGALSETPEERNALRPTIKHDISKNDLPESFDARSQWPQCW 100
Query: 99 TIREIRDQGSCGSCWG 114
TI EIRDQ SCGSCW
Sbjct: 101 TISEIRDQASCGSCWA 116
>gi|384597848|gb|AFI23675.1| cathepsin B, partial [Brugia malayi]
Length = 319
Score = 134 bits (337), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 73/185 (39%), Positives = 96/185 (51%), Gaps = 40/185 (21%)
Query: 114 GCRPYEIAPCEHHVNGTR-PSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSN 172
GCRPY PCEHH N T C TPKC ++C +NY YK D +G ++Y+V ++
Sbjct: 174 GCRPYPFPPCEHHSNKTHYEPCKHDLYPTPKCYKQCDKNYTKSYKADKYYGEQAYNVEND 233
Query: 173 EKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEG 232
+SI KEI GPVE +F V+ D + Y SG
Sbjct: 234 VESIQKEIMTLGPVEASFEVYTDFLHYTSG------------------------------ 263
Query: 233 AFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGK 292
+ + +G GGHA++ILGWG D+ YWL ANSWN DWG++G F+ILRG
Sbjct: 264 -------IYKHVAGSVGGGHAVKILGWGIDQGV--SYWLAANSWNNDWGEDGYFRILRGA 314
Query: 293 DECGI 297
DECG+
Sbjct: 315 DECGM 319
Score = 51.2 bits (121), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 19/36 (52%), Positives = 24/36 (66%)
Query: 84 LPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYE 119
+P +FD+R WP C ++R IRDQ SCGSCW E
Sbjct: 77 IPESFDARKNWPECASLRNIRDQSSCGSCWAVAAVE 112
Score = 45.8 bits (107), Expect = 0.025, Method: Compositional matrix adjust.
Identities = 19/30 (63%), Positives = 22/30 (73%)
Query: 7 RLCGFGCNGGFPGMAWRYWVKSGIVSGGAY 36
+ CGFGC GG P AW+YWV SGIV+G Y
Sbjct: 140 KTCGFGCFGGEPMAAWKYWVLSGIVTGSDY 169
>gi|166030312|gb|ABY78823.1| cathepsin B-like protease [Trypanosoma congolense]
Length = 335
Score = 134 bits (337), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 105/298 (35%), Positives = 127/298 (42%), Gaps = 41/298 (13%)
Query: 39 KQAEKNSLSNIPRAHLKSWMG--VHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPN 96
K + NI A + G + +LP R E ++ +LP +FDS KWPN
Sbjct: 47 KAVYNGKMQNITFAEARRLTGARIQKTSSLPPVRFTE----EQLRTELPESFDSAEKWPN 102
Query: 97 CPTIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPY 156
CPTIREI DQ +CGSCW H G S H C ++C D Y
Sbjct: 103 CPTIREIADQSACGSCWAVSTASAISDRHCTVGGVQQLRISAAHLLSCCKDCGYGCDGGY 162
Query: 157 KKDLNFGAKSYSVSSNEKSIMKEIY--EHGPVEGAFTVFDDLILYKSGRFFVPGNETT-- 212
A Y VS S + Y H G Y F P TT
Sbjct: 163 PD----AAWRYYVSHGLASSYCQPYPFPHCDHHGGKGKKPPCSKYD---FHTPKCNTTCT 215
Query: 213 --AMSLIKWTIRDNTSQLGAEG-------------AFTVFDDLILYK-------SGKALG 250
A+ LIK+ + G E AF V+ D YK SG LG
Sbjct: 216 DKAIPLIKYRGNHSYEVHGEEDYKRELYFNGPFVVAFQVYSDFFAYKTGVYRHVSGDVLG 275
Query: 251 GHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVPKL 308
GHA+RI+GWG+ + YW IANSW+TDWG NG F ILRGKDECGIE AG P +
Sbjct: 276 GHAVRIVGWGKLNGT--PYWKIANSWDTDWGMNGHFLILRGKDECGIEHQGYAGSPAI 331
Score = 40.0 bits (92), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 14/24 (58%), Positives = 19/24 (79%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVS 32
CG+GC+GG+P AWRY+V G+ S
Sbjct: 154 CGYGCDGGYPDAAWRYYVSHGLAS 177
>gi|984960|gb|AAC46878.1| cathepsin B proteinase, partial [Ancylostoma caninum]
Length = 340
Score = 134 bits (336), Expect = 6e-29, Method: Compositional matrix adjust.
Identities = 71/195 (36%), Positives = 100/195 (51%), Gaps = 40/195 (20%)
Query: 115 CRPYEIAPCEHHVNGTRPS-CDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
C+PY PC +H N C TPKC + CQ Y+ Y +D F +SY + SNE
Sbjct: 185 CKPYAFYPCGNHTNERYYGPCPRGLWPTPKCRKACQRKYNKSYNEDKYFATRSYYLPSNE 244
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
+SI +EIY++GPV AF V+ D Y+ G
Sbjct: 245 RSIREEIYKNGPVVAAFKVYQDFSYYRGG------------------------------- 273
Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
+ ++K G G HA++++GWG + + YWLIANSWNTDWG+NG F+I RG +
Sbjct: 274 ------IYVHKWGGQTGAHAVKVVGWGRENGTD--YWLIANSWNTDWGENGYFRIARGSN 325
Query: 294 ECGIESSITAGVPKL 308
ECGIE + +GV ++
Sbjct: 326 ECGIEGQMVSGVMRV 340
Score = 47.8 bits (112), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 27/44 (61%), Gaps = 5/44 (11%)
Query: 71 LPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG 114
L E+ G +D P +FD+R WP C +I IRDQ +CGSCW
Sbjct: 79 LTEVFG-----DDPPDSFDARAHWPECRSIGTIRDQSACGSCWA 117
>gi|357511627|ref|XP_003626102.1| Cathepsin L-like proteinase [Medicago truncatula]
gi|355501117|gb|AES82320.1| Cathepsin L-like proteinase [Medicago truncatula]
Length = 351
Score = 134 bits (336), Expect = 6e-29, Method: Compositional matrix adjust.
Identities = 96/322 (29%), Positives = 137/322 (42%), Gaps = 110/322 (34%)
Query: 46 LSNIPRAHLKSWMGVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRD 105
SN K +GV + P + + LP +FD+RT W C TI I D
Sbjct: 59 FSNFTVGQFKRLLGVKQTPRSELSSAPVVTHPKSLK--LPKDFDARTAWSQCSTIGRILD 116
Query: 106 QGSCGSCW-------------------------------------GC---RPY------- 118
QG CGSCW GC P+
Sbjct: 117 QGHCGSCWAFGAVESLSDRFCIHFDMNVSLSVNDILACCGLLCGAGCAGGTPFSAWIYLA 176
Query: 119 -------EIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSS 171
E P + + P C+ + TPKCV++C N + ++ ++ K+Y+V+S
Sbjct: 177 HHGVVTEECDPYFDQIGCSHPGCEPTY-RTPKCVKKCV-NGNQLWETSKHYSVKAYTVNS 234
Query: 172 NEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAE 231
+ + IM E+Y++GPVE A
Sbjct: 235 DPQDIMAEVYKNGPVEVA------------------------------------------ 252
Query: 232 GAFTVFDDLILYKSGK-------ALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNG 284
FTV++D YKSG ALGGHA++++GWG + E YWL+AN WNT+WGD+G
Sbjct: 253 --FTVYEDFAHYKSGVYKHITGFALGGHAVKLVGWGTSHEG-EDYWLLANQWNTNWGDDG 309
Query: 285 LFKILRGKDECGIESSITAGVP 306
FKI RG +ECGIE+++TAG+P
Sbjct: 310 YFKIKRGTNECGIENAVTAGLP 331
>gi|166030314|gb|ABY78824.1| cathepsin B-like protease [Trypanosoma congolense]
Length = 335
Score = 134 bits (336), Expect = 6e-29, Method: Compositional matrix adjust.
Identities = 108/301 (35%), Positives = 133/301 (44%), Gaps = 47/301 (15%)
Query: 39 KQAEKNSLSNIPRAHLKSWMG--VHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPN 96
K + NI A + G + +LP R E ++ +LP +FDS KWPN
Sbjct: 47 KAVYNGKMQNITFAEARRLTGARIQKTSSLPPVRFTE----EQLRTELPESFDSAEKWPN 102
Query: 97 CPTIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPY 156
CPTIREI DQ +CGSCW + G S H C ++C D Y
Sbjct: 103 CPTIREIADQSACGSCWAVSTASAISDRYCTVGGVQQLRISAAHLLSCCKDCGYGCDGGY 162
Query: 157 KKDLNFGAKSYSVSSNEKSIMKEIY--EHGPVEGAFTVFDDLILYKSGRFFVPGNETT-- 212
A Y VS S + Y H G Y F P TT
Sbjct: 163 PGT----AWEYYVSHGLASSYCQPYPFPHCGHHGGKGKKPPCSKYD---FHTPKCNTTCT 215
Query: 213 --AMSLIKWTIRDNTSQLGAEG----------------AFTVFDDLILYK-------SGK 247
A+ LIK+ R N S G +G AF V+ D + YK SG
Sbjct: 216 DKAIPLIKY--RGNHS-YGLDGEDDYKRELYFNGPFVVAFQVYSDFLAYKTGVYRHVSGD 272
Query: 248 ALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVPK 307
LGGHA+RI+GWG+ + YW IANSW+TDWG NG F ILRGKDECGIES AG+P
Sbjct: 273 VLGGHAVRIVGWGKLNGT--PYWKIANSWDTDWGMNGHFLILRGKDECGIESEGYAGLPA 330
Query: 308 L 308
+
Sbjct: 331 I 331
Score = 40.0 bits (92), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 14/24 (58%), Positives = 19/24 (79%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVS 32
CG+GC+GG+PG AW Y+V G+ S
Sbjct: 154 CGYGCDGGYPGTAWEYYVSHGLAS 177
>gi|154761391|gb|ABS85545.1| cathepsin B preproprotein [Biomphalaria glabrata]
Length = 333
Score = 134 bits (336), Expect = 6e-29, Method: Compositional matrix adjust.
Identities = 78/200 (39%), Positives = 102/200 (51%), Gaps = 41/200 (20%)
Query: 107 GSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKS 166
G G+ GC PY + C+HH G C A TPKC ++C Y Y D G KS
Sbjct: 175 GQYGTNEGCMPYSLPHCDHHTTGKYQPCPAVV-PTPKCEKKCLTGYPKSYSNDKTRGKKS 233
Query: 167 YSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTS 226
Y V + SIM+E+ ++GPV AF V+ D + YK+G
Sbjct: 234 YGVRGVQ-SIMQELVDNGPVTAAFDVYSDFLSYKTG------------------------ 268
Query: 227 QLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLF 286
+ + +G GGHA++I+G+G + S + YWL+ANSWN DWGD G F
Sbjct: 269 -------------VYRHTTGSYEGGHAVKIIGYGTE--SGQDYWLVANSWNEDWGDKGFF 313
Query: 287 KILRGKDECGIESSITAGVP 306
KI +GKDECGIESSI AG P
Sbjct: 314 KIAKGKDECGIESSIVAGDP 333
Score = 48.9 bits (115), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 18/34 (52%), Positives = 26/34 (76%)
Query: 7 RLCGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQ 40
+ CG GCNGG+P AW ++V +G+VSGG YG+ +
Sbjct: 148 KSCGMGCNGGYPAAAWEWYVDTGVVSGGQYGTNE 181
>gi|87240981|gb|ABD32839.1| Peptidase C1A, papain; Somatotropin hormone; Peptidase C1,
propeptide [Medicago truncatula]
Length = 356
Score = 134 bits (336), Expect = 6e-29, Method: Compositional matrix adjust.
Identities = 90/284 (31%), Positives = 127/284 (44%), Gaps = 108/284 (38%)
Query: 84 LPANFDSRTKWPNCPTIREIRDQGSCGSCW------------------------------ 113
LP +FD+RT W C TI I DQG CGSCW
Sbjct: 100 LPKDFDARTAWSQCSTIGRILDQGHCGSCWAFGAVESLSDRFCIHFDMNVSLSVNDILAC 159
Query: 114 -------GC---RPY--------------EIAPCEHHVNGTRPSCDASKGHTPKCVRECQ 149
GC P+ E P + + P C+ + TPKCV++C
Sbjct: 160 CGLLCGAGCAGGTPFSAWIYLAHHGVVTEECDPYFDQIGCSHPGCEPTY-RTPKCVKKCV 218
Query: 150 ENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGN 209
N + ++ ++ K+Y+V+S+ + IM E+Y++GPVE A
Sbjct: 219 -NGNQLWETSKHYSVKAYTVNSDPQDIMAEVYKNGPVEVA-------------------- 257
Query: 210 ETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGK-------ALGGHAIRILGWGED 262
FTV++D YKSG ALGGHA++++GWG
Sbjct: 258 ------------------------FTVYEDFAHYKSGVYKHITGFALGGHAVKLVGWGTS 293
Query: 263 EKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVP 306
+ E YWL+AN WNT+WGD+G FKI RG +ECGIE+++TAG+P
Sbjct: 294 HEG-EDYWLLANQWNTNWGDDGYFKIKRGTNECGIENAVTAGLP 336
>gi|170586854|ref|XP_001898194.1| cathepsin B-like cysteine proteinase [Brugia malayi]
gi|158594589|gb|EDP33173.1| cathepsin B-like cysteine proteinase, putative [Brugia malayi]
Length = 384
Score = 134 bits (336), Expect = 7e-29, Method: Compositional matrix adjust.
Identities = 80/207 (38%), Positives = 101/207 (48%), Gaps = 57/207 (27%)
Query: 114 GCRPYEIAPCEHHVNGTR-PSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSN 172
GCRPY PCEHH N T C TPKC ++C +NY YK D +G ++Y+V ++
Sbjct: 218 GCRPYPFPPCEHHSNKTHYEPCKHDLYPTPKCYKQCDKNYTKSYKADKYYGEQAYNVEND 277
Query: 173 EKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEG 232
+SI KEI GPVE +
Sbjct: 278 VESIQKEIMTLGPVEAS------------------------------------------- 294
Query: 233 AFTVFDDLILYKSG-------KALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDN-- 283
F V+ D + Y SG GGHA++ILGWG D+ YWL ANSWN DWG++
Sbjct: 295 -FEVYTDFLHYTSGIYKHVAGSVGGGHAVKILGWGIDQGVS--YWLAANSWNNDWGEDVF 351
Query: 284 -GLFKILRGKDECGIESSITAGVPKLD 309
G F+ILRG DECGIES I AG+P+ D
Sbjct: 352 SGYFRILRGADECGIESGIVAGIPRKD 378
Score = 52.0 bits (123), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 19/36 (52%), Positives = 24/36 (66%)
Query: 84 LPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYE 119
+P +FD+R WP C ++R IRDQ SCGSCW E
Sbjct: 121 IPESFDARKNWPECASLRNIRDQSSCGSCWAVAAVE 156
Score = 46.2 bits (108), Expect = 0.020, Method: Compositional matrix adjust.
Identities = 19/30 (63%), Positives = 22/30 (73%)
Query: 7 RLCGFGCNGGFPGMAWRYWVKSGIVSGGAY 36
+ CGFGC GG P AW+YWV SGIV+G Y
Sbjct: 184 KTCGFGCFGGEPMAAWKYWVLSGIVTGSDY 213
>gi|194246069|gb|ACF35526.1| putative cathepsin B-like cysteine protease form 1 [Dermacentor
variabilis]
Length = 277
Score = 134 bits (336), Expect = 7e-29, Method: Compositional matrix adjust.
Identities = 75/194 (38%), Positives = 97/194 (50%), Gaps = 42/194 (21%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GC+PY PCEHH G P+C +K TPKC++ C++ Y+ Y +D F YS+ S+E
Sbjct: 122 GCQPYYFPPCEHHTKGPLPNCTDTKP-TPKCLQVCRKGYEKSYSEDKYFAKTVYSLHSDE 180
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
I EIY++GPVE F+V+ D + YKSG + S W R
Sbjct: 181 TQIKTEIYKNGPVEADFSVYTDFLAYKSGVY-------QRHSYELWEARHQN-------- 225
Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
LGW +S WL+ANSWN DWGD G FKI RG +
Sbjct: 226 -----------------------LGWALKRRS---VWLVANSWNQDWGDKGYFKIRRGNN 259
Query: 294 ECGIESSITAGVPK 307
ECGIE+ I AG+PK
Sbjct: 260 ECGIENDINAGIPK 273
Score = 49.7 bits (117), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 22/43 (51%), Positives = 30/43 (69%), Gaps = 1/43 (2%)
Query: 70 RLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSC 112
RLP + + E+ EDLP +FD+R W +C +I IRDQ +CGSC
Sbjct: 12 RLPIRL-HEEIPEDLPESFDAREAWSHCDSIHLIRDQSTCGSC 53
Score = 43.1 bits (100), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 17/30 (56%), Positives = 21/30 (70%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
CG GC GG+P AW Y+ GIV+GG YG+
Sbjct: 90 CGMGCFGGYPSAAWDYYKDEGIVTGGLYGT 119
>gi|984958|gb|AAC46877.1| cathepsin B-like proteinase [Ancylostoma caninum]
Length = 343
Score = 134 bits (336), Expect = 7e-29, Method: Compositional matrix adjust.
Identities = 68/195 (34%), Positives = 102/195 (52%), Gaps = 40/195 (20%)
Query: 115 CRPYEIAPCEHHVNGTRPS-CDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
C+PY PC HH N C TPKC + CQ Y+ Y++D +F ++Y + +NE
Sbjct: 188 CKPYAFYPCGHHQNDPYYGPCPGGLWPTPKCRKTCQRKYNKSYQEDKHFATRAYYLPNNE 247
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
++I +EIY++GPV AF V+ D YK G
Sbjct: 248 RNIRQEIYKNGPVVAAFRVYQDFSYYKKG------------------------------- 276
Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
+ ++K G G HA++++GWG + + YWLIANSWNTDWG++G F+I+RG +
Sbjct: 277 ------IYVHKWGGQTGAHAVKVVGWGRENATD--YWLIANSWNTDWGESGYFRIVRGTN 328
Query: 294 ECGIESSITAGVPKL 308
ECGIE+ + G ++
Sbjct: 329 ECGIEAQMVGGAMRV 343
Score = 52.0 bits (123), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 21/32 (65%), Positives = 24/32 (75%)
Query: 83 DLPANFDSRTKWPNCPTIREIRDQGSCGSCWG 114
D PA+FD+RT WP C +I IRDQ SCGSCW
Sbjct: 88 DPPASFDARTHWPECRSIGTIRDQSSCGSCWA 119
Score = 37.4 bits (85), Expect = 9.7, Method: Compositional matrix adjust.
Identities = 15/35 (42%), Positives = 24/35 (68%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQAEK 43
CG+GC GG+P A+++ + G+V+GG Y K+ K
Sbjct: 155 CGYGCQGGWPIEAYKWMQRDGVVTGGKYRQKKVCK 189
>gi|340380685|ref|XP_003388852.1| PREDICTED: cathepsin B-like [Amphimedon queenslandica]
Length = 341
Score = 134 bits (336), Expect = 7e-29, Method: Compositional matrix adjust.
Identities = 97/297 (32%), Positives = 128/297 (43%), Gaps = 107/297 (36%)
Query: 79 EVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEIA----------------- 121
EV +P FD+R KWP+CPTI +RDQG+CGSCW E
Sbjct: 81 EVPAVIPDTFDARQKWPDCPTIGTVRDQGACGSCWAFGAVEAMSDRYCISFKEQVNISAE 140
Query: 122 -------PCEHHVNGTRPSC--------------------DASKGHTPKCVRECQENYDV 154
C +G P+ D++ G P + +C +
Sbjct: 141 NLLSCCETCGSGCDGGYPAAAWRHWADKLLYEGIVTGGQYDSNAGCQPYTIPKCDHHEPG 200
Query: 155 PY------------------------KKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAF 190
PY + D ++G SYS+SS+ SI EI +GPVEGAF
Sbjct: 201 PYENCSGSQSTPSCKRSCISSYDKSYRSDKHYGKNSYSISSDVSSIQTEIMTNGPVEGAF 260
Query: 191 TVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALG 250
+V+ D Y SG + + +G LG
Sbjct: 261 SVYADFPTYTSGVY-------------------------------------QHTTGSFLG 283
Query: 251 GHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVPK 307
GHAI+ILGWG + + YWL+ANSWN WGD+G FKI+RGKDECGIESSI AG+P+
Sbjct: 284 GHAIKILGWGTE--NGVPYWLVANSWNPSWGDSGFFKIIRGKDECGIESSIVAGMPE 338
Score = 40.4 bits (93), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 18/34 (52%), Positives = 23/34 (67%), Gaps = 4/34 (11%)
Query: 9 CGFGCNGGFPGMAWRYW----VKSGIVSGGAYGS 38
CG GC+GG+P AWR+W + GIV+GG Y S
Sbjct: 149 CGSGCDGGYPAAAWRHWADKLLYEGIVTGGQYDS 182
>gi|226473760|emb|CAX71565.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 134 bits (336), Expect = 7e-29, Method: Compositional matrix adjust.
Identities = 74/209 (35%), Positives = 107/209 (51%), Gaps = 39/209 (18%)
Query: 99 TIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKK 158
+R I GS + GCRPY C+H V G +C TP+C + CQ+ Y+ Y++
Sbjct: 172 VLRGIVTGGSKENHTGCRPYPFPKCDHFVKGKYRACGDKLYKTPQCNQTCQKGYNTSYEQ 231
Query: 159 DLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIK 218
D ++G SY+V S E I K+I HGPVE +++D + YKSG
Sbjct: 232 DKHYGGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSG---------------- 275
Query: 219 WTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNT 278
+ Y +GK + GHA+R++G G + + YWL AN+WN
Sbjct: 276 ---------------------IYRYTTGKYISGHAVRLIGCGVENGT--AYWLAANTWNE 312
Query: 279 DWGDNGLFKILRGKDECGIESSITAGVPK 307
DWG+ G F+I+RG++EC IES I AG+ K
Sbjct: 313 DWGEKGYFRIVRGRNECLIESEIAAGLIK 341
Score = 54.3 bits (129), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 24/56 (42%), Positives = 34/56 (60%), Gaps = 1/56 (1%)
Query: 59 GVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG 114
G D NL R P + + ++ ++P++FDSR KWP C +I +IRDQ CGS W
Sbjct: 66 GRREDPNLREKRRP-TVDHHDLKVEIPSHFDSRKKWPRCKSISQIRDQSQCGSSWA 120
Score = 42.4 bits (98), Expect = 0.27, Method: Compositional matrix adjust.
Identities = 17/29 (58%), Positives = 22/29 (75%)
Query: 7 RLCGFGCNGGFPGMAWRYWVKSGIVSGGA 35
+ CG GC+GGF G +W YWV GIV+GG+
Sbjct: 153 KYCGSGCDGGFLGPSWDYWVLRGIVTGGS 181
>gi|27526823|emb|CAD32937.1| pro-cathepsin B2 [Fasciola hepatica]
Length = 337
Score = 133 bits (335), Expect = 9e-29, Method: Compositional matrix adjust.
Identities = 78/199 (39%), Positives = 100/199 (50%), Gaps = 44/199 (22%)
Query: 114 GCRPYEIAPCEHHVNGTRPS---CDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVS 170
GC PY C H G+R C TP C CQ YD Y+KD +G SY+V
Sbjct: 173 GCLPYPFPQCRH--PGSRSQLNPCPRYTYPTPSCYPYCQAGYDKTYEKDKVYGKTSYNVD 230
Query: 171 SNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGA 230
+E +IM+EI ++GPVE F V+ D +YKSG +
Sbjct: 231 RHEYTIMEEIMKNGPVEAGFIVYTDFAVYKSGIYH------------------------- 265
Query: 231 EGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILR 290
+ SG+ G HAIRI+GWG + + KYWL ANSWN WG+NG F+ILR
Sbjct: 266 ------------HVSGRYAGKHAIRIIGWGVE--NGVKYWLTANSWNVGWGENGYFRILR 311
Query: 291 GKDECGIESSITAGVPKLD 309
G DEC IES + AG+P+L
Sbjct: 312 GTDECRIESIVVAGMPRLQ 330
Score = 57.8 bits (138), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 30/76 (39%), Positives = 40/76 (52%), Gaps = 2/76 (2%)
Query: 39 KQAEKNSLSNIPRAHLKSWMGVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCP 98
K A + NI H K +G+ + + Y+ D DLP +FD+R KWP C
Sbjct: 33 KAAPSSRFINI--EHFKQHLGLLEETPEERQTRRPTVRYNVSDNDLPESFDAREKWPLCR 90
Query: 99 TIREIRDQGSCGSCWG 114
+IR+I DQ SCGSCW
Sbjct: 91 SIRQIPDQSSCGSCWA 106
>gi|44965401|gb|AAS49537.1| cathepsin B [Latimeria chalumnae]
Length = 225
Score = 133 bits (335), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 69/147 (46%), Positives = 85/147 (57%), Gaps = 37/147 (25%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GCRPY I PCEHHVNG+RPSC +G TPKCV +C+ Y Y KD +FG+ SY+VSSNE
Sbjct: 111 GCRPYTIPPCEHHVNGSRPSCTGEEGDTPKCVMQCEAGYTPSYFKDKHFGSTSYAVSSNE 170
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
I EIY++GPVEGAFTV++D + YKSG +
Sbjct: 171 ADIQIEIYKNGPVEGAFTVYEDFLQYKSGVY----------------------------- 201
Query: 234 FTVFDDLILYKSGKALGGHAIRILGWG 260
+ +G A+GGHAIRILGWG
Sbjct: 202 --------KHVTGDAVGGHAIRILGWG 220
Score = 67.4 bits (163), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 30/43 (69%), Positives = 34/43 (79%), Gaps = 1/43 (2%)
Query: 71 LPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCW 113
LP +G + D LP NFDSRT+WP CPTI+EIRDQGSCGSCW
Sbjct: 1 LPMKLGMA-TDVKLPENFDSRTQWPKCPTIQEIRDQGSCGSCW 42
>gi|262217337|gb|ACY38050.1| cathepsin B [Dactylis glomerata]
Length = 348
Score = 133 bits (335), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 98/313 (31%), Positives = 134/313 (42%), Gaps = 92/313 (29%)
Query: 46 LSNIPRAHLKSWMGVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRD 105
L+N K +GV P +P YS+ E+LP FD+R+KW C TI I D
Sbjct: 58 LANYTIEQFKHILGVKPTPPGLLAGVPTKT-YSK-SEELPKQFDARSKWSGCSTIGTILD 115
Query: 106 QGSCGSCWGCRPYEIAP---CEHH-------VNGTRPSC-----DASKGHTP-------- 142
QG CGSCW E C H N C D G P
Sbjct: 116 QGHCGSCWAFGAVECLQDRFCIHQNINISLSANDLVACCGFMCGDGCDGGYPIKAWQYFV 175
Query: 143 --KCVRE---------------CQENYDVP------------YKKDLNFGAKSYSVSSNE 173
V E C+ YD P +++ +F +Y V+S+
Sbjct: 176 QSGVVTEECDPYFDQVGCKHPGCEPAYDTPKCEKKCKVQNQVWEEKKHFSINAYRVNSDP 235
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
IM E+Y++GPVE AFTV++D YKSG +
Sbjct: 236 HDIMAEVYKNGPVEVAFTVYEDFAHYKSGVY----------------------------- 266
Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
+ +G +GGHA++++GWG + + E YWL+AN WN WGD+G FKI+RGK+
Sbjct: 267 --------KHVTGGVMGGHAVKLIGWGTSD-AGEDYWLLANQWNRGWGDDGYFKIIRGKN 317
Query: 294 ECGIESSITAGVP 306
ECGIE + AG+P
Sbjct: 318 ECGIEEEVVAGMP 330
Score = 38.5 bits (88), Expect = 3.4, Method: Compositional matrix adjust.
Identities = 14/25 (56%), Positives = 22/25 (88%)
Query: 8 LCGFGCNGGFPGMAWRYWVKSGIVS 32
+CG GC+GG+P AW+Y+V+SG+V+
Sbjct: 157 MCGDGCDGGYPIKAWQYFVQSGVVT 181
>gi|4204370|gb|AAD11445.1| cathepsin B protease, partial [Fasciola hepatica]
Length = 247
Score = 133 bits (334), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 90/300 (30%), Positives = 133/300 (44%), Gaps = 111/300 (37%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQAEKNSLSNIPRAHLKSWMGVHPDYNLPA 68
CG GC GG+P AW YW++ GIV+GG + + R + WM D+
Sbjct: 59 CGQGCRGGYPPKAWDYWMREGIVTGGTWEN------------RTGCQPWMFTKCDH---- 102
Query: 69 NRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEIAPCEHHVN 128
+G DSR K+ CP H+
Sbjct: 103 ------VG------------DSR-KYSRCP--------------------------HYTY 117
Query: 129 GTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEG 188
T P C R CQ Y+ Y++D +G SY+V +E IM+EI ++GPVE
Sbjct: 118 PTPP-----------CARACQTGYNKTYEQDKFYGNSSYNVGEHESYIMQEIMKNGPVEV 166
Query: 189 AFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKA 248
F +F D +Y+SG + + +GK
Sbjct: 167 TFAIFQDFGVYRSG-------------------------------------IYHHVAGKF 189
Query: 249 LGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVPKL 308
+G HA+R++GWG + + YWL+ANSWN +WG+NG F+++RG++ECGIES + AG+P+L
Sbjct: 190 IGRHAVRMIGWGVE--NGVNYWLMANSWNEEWGENGYFRMVRGRNECGIESEVVAGMPRL 247
>gi|56757271|gb|AAW26807.1| unknown [Schistosoma japonicum]
Length = 342
Score = 133 bits (334), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 70/194 (36%), Positives = 100/194 (51%), Gaps = 39/194 (20%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GCRPY C+H V G +C TP+C + CQ+ Y+ Y++D ++G SY+V E
Sbjct: 187 GCRPYPFPKCDHFVKGKYRACGDKLYKTPQCKQTCQKGYNTSYEQDKHYGEFSYNVIGVE 246
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
I KEI +GPVE +++D + YKSG
Sbjct: 247 SVIQKEIMMYGPVEAYLHIYEDFLNYKSG------------------------------- 275
Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
+ Y +G+ + GHA+R++GWG + + YWL AN+WN DWG+ G F+I+RG+D
Sbjct: 276 ------IYRYTTGQFISGHAVRLIGWGVENGT--SYWLAANTWNEDWGEKGYFRIVRGRD 327
Query: 294 ECGIESSITAGVPK 307
EC IES I AG K
Sbjct: 328 ECLIESFIVAGQIK 341
Score = 43.1 bits (100), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 17/27 (62%), Positives = 21/27 (77%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGA 35
CG GC+GG G +W YWVK GIV+GG+
Sbjct: 155 CGSGCDGGVTGYSWDYWVKHGIVTGGS 181
>gi|56754307|gb|AAW25341.1| unknown [Schistosoma japonicum]
Length = 309
Score = 133 bits (334), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 69/194 (35%), Positives = 101/194 (52%), Gaps = 39/194 (20%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GCRPY C+H V G +C TP+C + CQ+ Y+ Y++D ++G SY+V S E
Sbjct: 154 GCRPYPFPKCDHFVKGKYRACGDKLYKTPQCKQTCQKGYNTSYEQDKHYGGFSYNVLSVE 213
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
I K+I HG VE +++D + YKSG
Sbjct: 214 SVIQKDIMMHGTVEAYLEIYEDFLNYKSG------------------------------- 242
Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
+ Y +G+ + GHA+R++GWG + + YWL AN+WN DWG+ G F+I+RG++
Sbjct: 243 ------IYRYTTGQFISGHAVRLIGWGVENGT--AYWLAANTWNEDWGEKGYFRIVRGRN 294
Query: 294 ECGIESSITAGVPK 307
EC IES I AG+ K
Sbjct: 295 ECLIESEIAAGLIK 308
Score = 41.2 bits (95), Expect = 0.68, Method: Compositional matrix adjust.
Identities = 16/27 (59%), Positives = 20/27 (74%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGA 35
CG GC+GG G +W YWV GIV+GG+
Sbjct: 122 CGSGCDGGVTGYSWDYWVSHGIVTGGS 148
>gi|6165885|gb|AAF04727.1|AF101239_1 cathepsin B-like cysteine proteinase [Ipomoea batatas]
Length = 352
Score = 133 bits (334), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 93/315 (29%), Positives = 137/315 (43%), Gaps = 97/315 (30%)
Query: 46 LSNIPRAHLKSWMGVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRD 105
S+ + K +GV R P + E++ LP FD+RT WP C +I +I D
Sbjct: 60 FSDFTVSQFKRLLGVKKAPKSLLKRTPVVTHSKEIE--LPKTFDARTAWPQCLSIADILD 117
Query: 106 QGSCGSCW-------------------------------------GCR-PYEIAP----- 122
QG CGSCW GC Y IA
Sbjct: 118 QGHCGSCWAFGAVESLTDRFCIHYGTNVTLSVNDLLACCGFLCGEGCDGGYPIAAWQYFK 177
Query: 123 --------CEHHVNGT---RPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSS 171
C+ + + T P C+ + TP C ++C + ++ + + +F +Y V+S
Sbjct: 178 RTGVVTSECDPYFDQTGCSHPGCEPAYP-TPACEKKCVKK-NLLWSESKHFSVNAYRVNS 235
Query: 172 NEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAE 231
++ SIM E+Y +GP E +FTV++D YKSG +
Sbjct: 236 DQHSIMTEVYTNGPAEVSFTVYEDFAHYKSGVY--------------------------- 268
Query: 232 GAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRG 291
+ +G +GGHA++++GWG E E YWL+AN WN WG +G FKI+RG
Sbjct: 269 ----------KHVTGSEMGGHAVKLIGWGTSEDG-EDYWLLANQWNRSWGGDGYFKIIRG 317
Query: 292 KDECGIESSITAGVP 306
+ECGIE +TAG P
Sbjct: 318 TNECGIE-DVTAGTP 331
Score = 37.4 bits (85), Expect = 7.9, Method: Compositional matrix adjust.
Identities = 13/25 (52%), Positives = 21/25 (84%)
Query: 8 LCGFGCNGGFPGMAWRYWVKSGIVS 32
LCG GC+GG+P AW+Y+ ++G+V+
Sbjct: 159 LCGEGCDGGYPIAAWQYFKRTGVVT 183
>gi|268579855|ref|XP_002644910.1| C. briggsae CBR-CPR-6 protein [Caenorhabditis briggsae]
Length = 376
Score = 133 bits (334), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 73/198 (36%), Positives = 103/198 (52%), Gaps = 41/198 (20%)
Query: 114 GCRPYEIAPCEHHVNGTR-PSCDASKGHTPKCVRECQENY-DVPYKKDLNFGAKSYSVSS 171
GC+PY PCEHH T C TPKC ++C +Y D Y +D +G +Y V
Sbjct: 203 GCKPYPFPPCEHHSKKTHFDPCPHDLYPTPKCEKKCIADYTDKTYSEDKFYGHSAYGVKD 262
Query: 172 NEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAE 231
+ ++I KE+ HGP+E AF V++D + Y G +
Sbjct: 263 DVEAIQKELMTHGPLEIAFEVYEDFLNYDGGVY--------------------------- 295
Query: 232 GAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRG 291
++ GK GGHA++++GWG ++ YW ANSWNTDWG++G F+ILRG
Sbjct: 296 ----------VHTGGKLGGGHAVKLIGWGIEDGIP--YWTCANSWNTDWGEDGFFRILRG 343
Query: 292 KDECGIESSITAGVPKLD 309
DECGIES + G+PKL+
Sbjct: 344 VDECGIESGVVGGIPKLN 361
Score = 56.6 bits (135), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 22/36 (61%), Positives = 27/36 (75%)
Query: 79 EVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG 114
++D D+P +FDSR WP C +IR IRDQ SCGSCW
Sbjct: 101 DLDLDIPESFDSRENWPKCQSIRNIRDQSSCGSCWA 136
Score = 52.0 bits (123), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 22/30 (73%), Positives = 23/30 (76%)
Query: 7 RLCGFGCNGGFPGMAWRYWVKSGIVSGGAY 36
R CGFGCNGG P AWRYWVK GIV+G Y
Sbjct: 169 RSCGFGCNGGDPLAAWRYWVKDGIVTGSNY 198
>gi|76576339|gb|ABA53863.1| cathepsin B-like cysteine protease 1 [Parelaphostrongylus tenuis]
Length = 346
Score = 133 bits (334), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 70/195 (35%), Positives = 103/195 (52%), Gaps = 40/195 (20%)
Query: 114 GCRPYEIAPCEHHVNGTR-PSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSN 172
GC+PY PCEH+++ R C T C +CQ+NY + Y +D ++GA Y + +
Sbjct: 191 GCKPYPYPPCEHYIDAGRYKKCPKDLYPTNTCEYKCQDNYTISYDEDKHYGAYPYVLVGD 250
Query: 173 EKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEG 232
I +EI HGPVE F V++D Y SG
Sbjct: 251 ASFIQQEIMNHGPVEVTFDVYEDFEHYSSG------------------------------ 280
Query: 233 AFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGK 292
+ + +G+ +G HA+++LGWG + + YW+ ANSWN+DWG+NG F+ILRG+
Sbjct: 281 -------IYKHMAGEYVGVHAVKMLGWGTE--NGVDYWICANSWNSDWGENGFFRILRGE 331
Query: 293 DECGIESSITAGVPK 307
+ECGIES++ AG PK
Sbjct: 332 NECGIESNVVAGKPK 346
Score = 50.4 bits (119), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 25/70 (35%), Positives = 38/70 (54%), Gaps = 2/70 (2%)
Query: 46 LSNIPRAHLKSWMGVHPDYNLPANRLPELIGYSEVDED-LPANFDSRTKWPNCPTIREIR 104
+N+PR MG LPA ++++D +P +FD+RT WP C ++R +R
Sbjct: 56 FANLPRDIKHRLMG-SKYVALPAKYRMNEKTHNDIDNSTIPKSFDARTNWPKCASLRTVR 114
Query: 105 DQGSCGSCWG 114
DQ +CGS W
Sbjct: 115 DQSACGSGWA 124
Score = 39.7 bits (91), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 17/35 (48%), Positives = 20/35 (57%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQAEK 43
CG+GC GG AW YW GIV+G Y +K K
Sbjct: 159 CGYGCEGGDTYKAWNYWTTDGIVTGSNYTTKSGCK 193
>gi|40557606|gb|AAR88096.1| cathepsin B-like cysteine protease [Callosobruchus maculatus]
Length = 330
Score = 133 bits (334), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 93/298 (31%), Positives = 125/298 (41%), Gaps = 100/298 (33%)
Query: 73 ELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEIA----------- 121
E I + + +DLP FD+R +W C +I+EIRDQ CGSCW +
Sbjct: 70 ETIFHEDDGKDLPEEFDARKQWSKCESIKEIRDQSGCGSCWAVSSASVMSDRICIQSDQK 129
Query: 122 ---------------PCEHHVNGT------------RPSCDASKGH-----------TPK 143
C V+G + S S G P+
Sbjct: 130 NQLRISAADMIECCESCTFSVDGCHGGIPSFTFTEWKDSGFVSGGEYNSTNGCMSYPLPR 189
Query: 144 CVRECQENYDVP-------------YKKDLNFGAKSYSVSSN-EKSIMKEIYEHGPVEGA 189
C C+ YD P Y++D ++ ++Y + S E+ I EI ++GPV +
Sbjct: 190 CNPSCKTLYDAPTCKKECDKGSPLKYEEDKHYAKQAYRIMSKVERQIQLEIIKNGPVVAS 249
Query: 190 FTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKAL 249
FTV+ D I Y SG + G K L
Sbjct: 250 FTVYADFIHYLSGVYKFDGE------------------------------------SKLL 273
Query: 250 GGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVPK 307
GGHA+RI+GWG E YWL++NSWN WGD GLFKI RGK+ECGIE ITAG+P+
Sbjct: 274 GGHAVRIIGWG-IENGTYPYWLVSNSWNERWGDQGLFKIWRGKNECGIEEEITAGLPR 330
>gi|414886870|tpg|DAA62884.1| TPA: cathepsin B-like cysteine proteinase 3 [Zea mays]
Length = 347
Score = 132 bits (333), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 95/315 (30%), Positives = 134/315 (42%), Gaps = 96/315 (30%)
Query: 46 LSNIPRAHLKSWMGVHPDYNLPANRLPEL-IGYSEVDEDLPANFDSRTKWPNCPTIREIR 104
SN A K +GV P P N L + + +LP FD+R+ W C TI I
Sbjct: 57 FSNYTIAQFKHILGVKP---APQNALSNVPVKTYSRSLELPKEFDARSAWSRCSTIGNIL 113
Query: 105 DQGSCGSCWGCRPYEIAP---CEH-------HVNGTRPSC-----DASKGHTP------- 142
+QG CGSCW E C H VN C D G P
Sbjct: 114 EQGHCGSCWAFGAVECLQDRFCIHLNMSILLSVNDLLACCGFMCGDGCDGGYPIEAWRYF 173
Query: 143 -------------------------------KCVRECQENYDVPYKKDLNFGAKSYSVSS 171
KC ++C+E V +++ +F +Y ++S
Sbjct: 174 VQNGVVTDECDPYFDPVGCKHPGCEPAYPTPKCEKKCKEQNQV-WQEKKHFSIDAYRINS 232
Query: 172 NEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAE 231
+ IM E+Y++GPVE AFTV++D YKSG +
Sbjct: 233 DPHDIMAEVYKNGPVEVAFTVYEDFAHYKSGVY--------------------------- 265
Query: 232 GAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRG 291
+ +G +GGHA++++GWG + + E YWL+AN WN WGD+G FKI+RG
Sbjct: 266 ----------KHITGGIMGGHAVKLIGWGTSD-AGEDYWLLANQWNRGWGDDGYFKIIRG 314
Query: 292 KDECGIESSITAGVP 306
K+ECGIE + AG+P
Sbjct: 315 KNECGIEEGVVAGMP 329
Score = 38.5 bits (88), Expect = 3.6, Method: Compositional matrix adjust.
Identities = 14/25 (56%), Positives = 22/25 (88%)
Query: 8 LCGFGCNGGFPGMAWRYWVKSGIVS 32
+CG GC+GG+P AWRY+V++G+V+
Sbjct: 156 MCGDGCDGGYPIEAWRYFVQNGVVT 180
>gi|609175|emb|CAA57522.1| cathepsin B-like cysteine proteinase [Nicotiana rustica]
Length = 356
Score = 132 bits (333), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 99/327 (30%), Positives = 134/327 (40%), Gaps = 102/327 (31%)
Query: 37 GSKQAEKNSLSNIPRAHLKSWMGVHPDYN-----LPANRLPELIGYSEVDEDLPANFDSR 91
G K A SN + K +GV P +P P+L+ +LP FD+R
Sbjct: 55 GWKAALNPRFSNFTVSQFKRLLGVKPTRKGDLKGIPILTHPKLL-------ELPQEFDAR 107
Query: 92 TKWPNCPTIREIRDQGSCGSCWGCRPYEIAP---CEHH-------VNGTRPSC-----DA 136
W NC TI I DQG CGSCW E C H+ N C D
Sbjct: 108 VAWSNCSTIGRILDQGHCGSCWAFGAVESLSDRFCIHYGLNISLSANDLYACCGFLCGDG 167
Query: 137 SKGHTP-----KCVRE--------------------CQENYDVP------------YKKD 159
G P VR+ C+ Y P + +
Sbjct: 168 CDGGYPLQAWKYFVRKGVVTDECDPYFDNEGCSHPGCEPAYPTPKCHRKCVKQNLLWSRS 227
Query: 160 LNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKW 219
+FG +Y +SS+ SIM E+Y++GPVE +FTV++D YKSG +
Sbjct: 228 KHFGVNAYMISSDPHSIMTEVYKNGPVEVSFTVYEDFAHYKSGVY--------------- 272
Query: 220 TIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTD 279
+ +G +GGHA++++GWG E E YWL+AN WN
Sbjct: 273 ----------------------KHVTGDIMGGHAVKLIGWGTSEDG-EDYWLLANQWNRG 309
Query: 280 WGDNGLFKILRGKDECGIESSITAGVP 306
WGD+G FKI RG +EC IE + AG+P
Sbjct: 310 WGDDGYFKIRRGTNECEIEDEVVAGLP 336
Score = 39.3 bits (90), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 14/25 (56%), Positives = 21/25 (84%)
Query: 8 LCGFGCNGGFPGMAWRYWVKSGIVS 32
LCG GC+GG+P AW+Y+V+ G+V+
Sbjct: 163 LCGDGCDGGYPLQAWKYFVRKGVVT 187
>gi|91078964|ref|XP_974298.1| PREDICTED: similar to putative cathepsin B-like like proteinase
[Tribolium castaneum]
gi|270004838|gb|EFA01286.1| cathepsin B precursor [Tribolium castaneum]
Length = 335
Score = 132 bits (333), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 78/202 (38%), Positives = 99/202 (49%), Gaps = 55/202 (27%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GC+ Y + PCEHH G P+C TP+C +EC D+ YK DL G+ +Y SS+E
Sbjct: 182 GCKAYTVPPCEHHTEGDLPAC-GDIVPTPQCKKECDAGVDIEYKSDLRKGS-AYQTSSDE 239
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
I EI +GPVE
Sbjct: 240 SQIQTEIMTNGPVEAD-------------------------------------------- 255
Query: 234 FTVFDDLILYKSG-------KALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLF 286
F V++D + YKSG GGHAI+ILGWG ++ + YWL ANSWN DWGD G F
Sbjct: 256 FDVYEDFLNYKSGVYQQTTGNYAGGHAIKILGWGVEDGTP--YWLAANSWNEDWGDKGYF 313
Query: 287 KILRGKDECGIESSITAGVPKL 308
KILRG++ECGIES I G+P +
Sbjct: 314 KILRGQNECGIESDIIGGIPVV 335
Score = 47.8 bits (112), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 26/60 (43%), Positives = 35/60 (58%), Gaps = 8/60 (13%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQAEKNSLSNIPRAHLKSWMGVHPDYNLPA 68
CG GCNGG+P AW YW ++GIV+GG Y +K K + + P H H + +LPA
Sbjct: 150 CGDGCNGGWPAEAWAYWAETGIVTGGKYETKDGCK-AYTVPPCEH-------HTEGDLPA 201
>gi|44965462|gb|AAS49538.1| cathepsin B [Protopterus dolloi]
Length = 225
Score = 132 bits (332), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 71/162 (43%), Positives = 92/162 (56%), Gaps = 37/162 (22%)
Query: 99 TIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKK 158
T + + G GS GCRPY I PCEHHVNG+RPSC G TPKCV++C Y Y+K
Sbjct: 96 TEKGLVSGGLYGSGIGCRPYTIPPCEHHVNGSRPSCSGEGGDTPKCVQKCDSGYTPAYEK 155
Query: 159 DLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIK 218
D +G +YSV S+ +SIM+EIY+ GPVEGAFTV++D +LYKSG +
Sbjct: 156 DKIYGQSAYSVPSSPESIMEEIYKDGPVEGAFTVYEDFLLYKSGVY-------------- 201
Query: 219 WTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWG 260
+ +G+A+GGHAI+ILGWG
Sbjct: 202 -----------------------QHHTGEAVGGHAIKILGWG 220
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 20/30 (66%), Positives = 24/30 (80%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
CG GCNGG+P AW+YW + G+VSGG YGS
Sbjct: 79 CGMGCNGGYPSGAWQYWTEKGLVSGGLYGS 108
>gi|357116869|ref|XP_003560199.1| PREDICTED: cathepsin B-like [Brachypodium distachyon]
Length = 350
Score = 132 bits (332), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 98/322 (30%), Positives = 136/322 (42%), Gaps = 92/322 (28%)
Query: 37 GSKQAEKNSLSNIPRAHLKSWMGVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPN 96
G + + +N A K +GV P +P YS DLP FD+R+KW
Sbjct: 53 GWTAGQNSYFANYTIAQFKHILGVKPTPPGLLRGVPTKT-YSR-STDLPKEFDARSKWSG 110
Query: 97 CPTIREIRDQGSCGSCWGCRPYEIAP---CEH-------HVNGTRPSC-----DASKGHT 141
C TI I DQG CGSCW E C H VN C D G
Sbjct: 111 CSTIGTILDQGHCGSCWAFGAVECLQDRFCIHLNMNISLSVNDLVACCGFMCGDGCDGGY 170
Query: 142 PKCVRE-------------------------CQENYDVP------------YKKDLNFGA 164
P + C+ Y P +++ +F
Sbjct: 171 PISAWQYLVENGVVTDECDPYFDQVGCKHPGCEPAYPTPACEKKCKVQNQVWQEKKHFSI 230
Query: 165 KSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDN 224
+Y V+S+ IM E+Y++GPVE AFTV++D YKSG
Sbjct: 231 NAYRVNSDPHDIMAEVYKNGPVEVAFTVYEDFAHYKSG---------------------- 268
Query: 225 TSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNG 284
V++ + +G+ +GGHA++++GWG K+ YWL+AN WN WGD+G
Sbjct: 269 -----------VYEHI----TGEMMGGHAVKLIGWGTSADGKD-YWLLANQWNRGWGDDG 312
Query: 285 LFKILRGKDECGIESSITAGVP 306
FKI+RGK+ECGIE + AG+P
Sbjct: 313 YFKIIRGKNECGIEEDVVAGMP 334
>gi|56754337|gb|AAW25356.1| SJCHGC00056 protein [Schistosoma japonicum]
Length = 342
Score = 132 bits (331), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 72/199 (36%), Positives = 100/199 (50%), Gaps = 53/199 (26%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GC+PY CEH G P+C TP+C + CQ+ Y P+++D FG S +V +NE
Sbjct: 187 GCQPYPFPKCEHLTKGKYPACGTKIYKTPQCKQTCQKGYKTPFEQDKPFGEGSSNVQNNE 246
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
K ++I +GPVE A
Sbjct: 247 KVFQRDIMMYGPVEAA-------------------------------------------- 262
Query: 234 FTVFDDLILYKSG-------KALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLF 286
F V++D + KSG +GGH IRI+GWG ++ + YWLIANSWN DWG+NGLF
Sbjct: 263 FDVYEDFLNSKSGISRHVTGSIVGGHPIRIIGWGVEKGNP--YWLIANSWNEDWGENGLF 320
Query: 287 KILRGKDECGIESSITAGV 305
+++RG+DEC IES + AG+
Sbjct: 321 RMVRGRDECSIESHVVAGL 339
Score = 55.5 bits (132), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 22/52 (42%), Positives = 33/52 (63%), Gaps = 1/52 (1%)
Query: 63 DYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG 114
D + R P + + ++ ++P+ FDSR KWP+C +I +IRDQ CGSCW
Sbjct: 70 DAEMKRKRRP-TVDHHNLNVEIPSQFDSRKKWPHCKSISQIRDQSRCGSCWA 120
>gi|21699|emb|CAA46811.1| cathepsin B [Triticum aestivum]
Length = 353
Score = 131 bits (330), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 95/316 (30%), Positives = 131/316 (41%), Gaps = 97/316 (30%)
Query: 46 LSNIPRAHLKSWMGVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRD 105
+N K +GV P P L + + DLP FD+RT+W +C TI I D
Sbjct: 62 FANYTIEQFKHILGVKP---TPPGLLAGVPIKIHPEMDLPKEFDARTQWSSCSTIGNILD 118
Query: 106 QGSCGSCWGCRPYEIAP-------------------------CEHHVNGTRP-------- 132
QG CG+CW E C NG P
Sbjct: 119 QGHCGACWAFAAVEALQDRFCIHLNMSVSLSVNDLLACCGFLCGSGCNGGYPISAWRYFR 178
Query: 133 -------SCDASKGHT-------------PKCVRECQENYDVPYKKDLNFGAKSYSVSSN 172
CD T PKC R+C+ + +K++ +F +Y V SN
Sbjct: 179 RSGVVTEECDPYFDQTGCQHPGCEPAYPTPKCQRKCKVE-NQAWKENKHFSVNAYRVHSN 237
Query: 173 EKSIMKEIYEHGPVEGAFTVFD--DLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGA 230
IM E+Y++GPVE AFT D YKSG +
Sbjct: 238 PHDIMAEVYKNGPVEVAFTYCQILDFAHYKSGVY-------------------------- 271
Query: 231 EGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILR 290
+ +G +GGHA++++GWG + + E YWL+AN WN WGD+G FKI+R
Sbjct: 272 -----------KHITGGVMGGHAVKLIGWGTSD-AGEDYWLLANQWNRGWGDDGYFKIIR 319
Query: 291 GKDECGIESSITAGVP 306
G++ECGIE +TAG+P
Sbjct: 320 GENECGIEGDVTAGMP 335
Score = 40.8 bits (94), Expect = 0.87, Method: Compositional matrix adjust.
Identities = 16/25 (64%), Positives = 21/25 (84%)
Query: 8 LCGFGCNGGFPGMAWRYWVKSGIVS 32
LCG GCNGG+P AWRY+ +SG+V+
Sbjct: 160 LCGSGCNGGYPISAWRYFRRSGVVT 184
>gi|340501578|gb|EGR28345.1| hypothetical protein IMG5_177790 [Ichthyophthirius multifiliis]
Length = 356
Score = 131 bits (329), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 79/205 (38%), Positives = 104/205 (50%), Gaps = 61/205 (29%)
Query: 115 CRPYEIAPCEHHVNGTR-PSCDASKGH--TPKCVRECQENYDV--PYKKDLNFGAKSYSV 169
C+ Y PC HHV T+ P C KG TP+C ++C ++ V PY +DL G KSYSV
Sbjct: 194 CQAYSFPPCAHHVASTKYPPC---KGEVPTPECKKKCDDDSKVKRPYNEDLYKGQKSYSV 250
Query: 170 SSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLG 229
SS+ K+IM EI +GPVE A
Sbjct: 251 SSDPKAIMTEIMNNGPVEVA---------------------------------------- 270
Query: 230 AEGAFTVFDDLILYKSG-------KALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGD 282
FTV++D + YKSG + LGGHA++++GWG + + YWLI NSWN WGD
Sbjct: 271 ----FTVYEDFVTYKSGVYQHVTGEQLGGHAVKMIGWGVENDTP--YWLIVNSWNETWGD 324
Query: 283 NGLFKILRGKDECGIESSITAGVPK 307
G FKILRG +ECGIE + +P+
Sbjct: 325 QGTFKILRGSNECGIEDEVVTALPQ 349
Score = 41.6 bits (96), Expect = 0.46, Method: Compositional matrix adjust.
Identities = 16/29 (55%), Positives = 23/29 (79%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYG 37
CG GCNGG+P A +Y+VK+G+V+G +G
Sbjct: 161 CGDGCNGGYPEAAMQYFVKTGLVTGDLFG 189
>gi|335347291|gb|AEH42093.1| cysteine proteinase 6 [Haemonchus contortus]
Length = 346
Score = 131 bits (329), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 71/198 (35%), Positives = 100/198 (50%), Gaps = 39/198 (19%)
Query: 107 GSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKS 166
G GS GCRPY PC HH N T + TP+CV++CQ+ Y Y++D +G
Sbjct: 184 GDYGSKTGCRPYPFHPCGHHGNETYYGECPKEESTPECVKQCQKGYKNSYRRDKTWGEDY 243
Query: 167 YSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTS 226
Y V ++ K+I +EI GPV +FTV+DD Y G
Sbjct: 244 YEVENSVKAIQREIMRSGPVVSSFTVYDDFSYYVKG------------------------ 279
Query: 227 QLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLF 286
+ + +GKA G HAI+I+GWG ++ YW+IANSW+ DWG+ G F
Sbjct: 280 -------------IYKHTAGKARGSHAIKIIGWGTEKNV--PYWIIANSWHNDWGEKGFF 324
Query: 287 KILRGKDECGIESSITAG 304
+++RG + CGIE + AG
Sbjct: 325 RMVRGTNHCGIEEDVVAG 342
Score = 63.2 bits (152), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 25/46 (54%), Positives = 34/46 (73%)
Query: 69 NRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG 114
NR P + S+ +D+P +FD+RTKWPNC +I+ IRDQ +CGSCW
Sbjct: 79 NRKPVVEDASDKGDDIPESFDARTKWPNCTSIKHIRDQANCGSCWA 124
Score = 42.4 bits (98), Expect = 0.27, Method: Compositional matrix adjust.
Identities = 18/31 (58%), Positives = 23/31 (74%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSK 39
CGFGC GG+P A+ Y+ G+V+GG YGSK
Sbjct: 159 CGFGCEGGWPIDAFEYYSYQGVVTGGDYGSK 189
>gi|166030316|gb|ABY78825.1| cathepsin B-like protease [Trypanosoma congolense]
Length = 336
Score = 130 bits (328), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 103/299 (34%), Positives = 128/299 (42%), Gaps = 42/299 (14%)
Query: 39 KQAEKNSLSNIPRAHLKSWMGV--HPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPN 96
K + NI A + G +LP R E ++ +LP +FDS KWPN
Sbjct: 47 KAVYNGKMQNITFAEARRLTGAFRRKTSSLPPVRFTE----EQLRTELPESFDSAEKWPN 102
Query: 97 CPTIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPY 156
CPTIREI DQ +CGSCW H G S H C ++C + D Y
Sbjct: 103 CPTIREIADQSACGSCWAVSTASAISDRHCTVGGVQQLRISAAHLLSCCKDCGDGCDGGY 162
Query: 157 KKDLNFGAKSYSVSSNEKSIMKEIYE--HGPVEGAFTVFDDLILYKSGRFFVPGNETT-- 212
A Y VS S + Y H G Y F P TT
Sbjct: 163 PD----AAWRYYVSHGLASSYCQPYPFPHCGHHGGKGKKPPCSKYD---FHTPKCNTTCT 215
Query: 213 --AMSLIKWTIRDNTSQLGAEG--------------AFTVFDDLILYK-------SGKAL 249
A+ LI++ D+ L E AF VF D + YK SG L
Sbjct: 216 DKAIPLIEYRGNDSYVLLHGEDDFKRELYFNGPFVVAFQVFSDFLAYKTGVYRHVSGDFL 275
Query: 250 GGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVPKL 308
GGHA+RI+GWG+ + YW IANSW+TDWG NG F LRG +ECGIE AG+P +
Sbjct: 276 GGHAVRIVGWGKLNGTP--YWKIANSWDTDWGMNGHFLFLRGNNECGIEFEGYAGLPAI 332
>gi|86451908|gb|ABC97349.1| cathepsin B [Streblomastix strix]
Length = 312
Score = 130 bits (327), Expect = 7e-28, Method: Compositional matrix adjust.
Identities = 92/278 (33%), Positives = 115/278 (41%), Gaps = 93/278 (33%)
Query: 83 DLPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEIAP---------------CEHHV 127
+LP FDSRT WPNC I +I DQG CGSCW +E+ H+
Sbjct: 75 NLPDEFDSRTNWPNCQLIGKIYDQGHCGSCWAMSSFEVLQDRFCIKSEGKQTPELSPQHL 134
Query: 128 NGTRPSCDASKG------------------------------------HTPKCVR-ECQE 150
P C G TPKC + +C
Sbjct: 135 TSCTPGCSGCNGGWMSTAFGFMQSNGILGEDCIPYQMGKCKHPGCSTWPTPKCNKTKCYP 194
Query: 151 NYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNE 210
N +L A SYSV SNE I KEIYE+GPV +F V++DL +Y+SG +
Sbjct: 195 NDT--KSTELWHAASSYSVRSNEADIQKEIYENGPVTASFAVYEDLSVYQSGVY------ 246
Query: 211 TTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYW 270
+ +G G HAI+++GWG + KYW
Sbjct: 247 -------------------------------QHVTGGFEGLHAIKVVGWGILDGV--KYW 273
Query: 271 LIANSWNTDWGDNGLFKILRGKDECGIESSITAGVPKL 308
I NSW DWG +GL I RG DECGIES + AG PKL
Sbjct: 274 TIVNSWAEDWGFDGLLLIRRGVDECGIESDVVAGQPKL 311
>gi|224285256|gb|ACN40354.1| unknown [Picea sitchensis]
Length = 350
Score = 130 bits (327), Expect = 8e-28, Method: Compositional matrix adjust.
Identities = 93/324 (28%), Positives = 134/324 (41%), Gaps = 97/324 (29%)
Query: 37 GSKQAEKNSLSNIPRAHLKSWMGVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPN 96
G K A SN +GV P +P + + LP+ FD+R WP+
Sbjct: 50 GWKAAMSTRFSNYTVREFAHLLGVLPTPQKLLETVPVRVYPKGLK--LPSKFDARKAWPH 107
Query: 97 CPTIREIRDQGSCGSCWGCRPYEIAP---CEH-HVNGT---------------------- 130
C + R I DQG CGSCW E C H VN T
Sbjct: 108 CTSTRSILDQGHCGSCWAFAAVEALSDRFCIHFQVNATLSENDLVACCGFRCGSGCNGGF 167
Query: 131 ----------------------------RPSCDASKGHTPKCVRECQENYDVPYKKDLNF 162
P C+ S TP+CV+ C++N + K ++
Sbjct: 168 PLSAWRYFSRRGVVTDECDPYFDNDGCNHPGCEPSYP-TPRCVKNCKDNQRWSHSK--HY 224
Query: 163 GAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIR 222
A +Y + S+ +IM E++ +GPVE +F+V++D Y++G
Sbjct: 225 SANAYRIKSDPYNIMAEVFNNGPVEVSFSVYEDFAHYETG-------------------- 264
Query: 223 DNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGD 282
+ + G+ LGGHA++++GWG + + YWLIANSWNT WG+
Sbjct: 265 -----------------VYKHVQGRYLGGHAVKLIGWGTTDDGID-YWLIANSWNTAWGE 306
Query: 283 NGLFKILRGKDECGIESSITAGVP 306
G FKI RG +ECGIE AG+P
Sbjct: 307 GGYFKIARGVNECGIERDPVAGMP 330
Score = 39.3 bits (90), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 15/24 (62%), Positives = 19/24 (79%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVS 32
CG GCNGGFP AWRY+ + G+V+
Sbjct: 159 CGSGCNGGFPLSAWRYFSRRGVVT 182
>gi|159177|gb|AAA29177.1| cysteine proteinase [Haemonchus contortus]
Length = 342
Score = 130 bits (326), Expect = 9e-28, Method: Compositional matrix adjust.
Identities = 69/195 (35%), Positives = 99/195 (50%), Gaps = 37/195 (18%)
Query: 115 CRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEK 174
CRPY I PC HH N T + TP C ++CQ Y ++ D G +Y V E+
Sbjct: 183 CRPYPIHPCGHHGNDTYYGECPREAATPPCKKKCQPGYKKIFRMDKRQGKVAYGVEPKEE 242
Query: 175 SIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAF 234
+I +EI HGPV +F V++D SL K + +T+
Sbjct: 243 AIQREILRHGPVVASFAVYEDF------------------SLYKTGVYKHTA-------- 276
Query: 235 TVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDE 294
G G HA++++GWG D K+K KYWLIANSW+ DWG+NG F+ +RG ++
Sbjct: 277 -----------GALRGYHAVKMMGWGVDSKTKAKYWLIANSWHNDWGENGYFRFIRGIND 325
Query: 295 CGIESSITAGVPKLD 309
C IE ++ AG+ +D
Sbjct: 326 CEIEDTVAAGIVDVD 340
Score = 41.6 bits (96), Expect = 0.41, Method: Compositional matrix adjust.
Identities = 18/31 (58%), Positives = 22/31 (70%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSK 39
CGFGC GG+ AW Y+V G+VSGG Y +K
Sbjct: 150 CGFGCGGGWSIRAWEYFVYEGVVSGGEYLTK 180
Score = 37.4 bits (85), Expect = 7.7, Method: Compositional matrix adjust.
Identities = 18/36 (50%), Positives = 23/36 (63%), Gaps = 2/36 (5%)
Query: 79 EVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG 114
E +ED+P +D R K+ C T IRDQ +CGSCW
Sbjct: 81 EPNEDIPEEYDPREKF-KCSTFY-IRDQANCGSCWA 114
>gi|268560898|ref|XP_002638183.1| Hypothetical protein CBG22612 [Caenorhabditis briggsae]
Length = 721
Score = 130 bits (326), Expect = 9e-28, Method: Compositional matrix adjust.
Identities = 92/300 (30%), Positives = 129/300 (43%), Gaps = 104/300 (34%)
Query: 67 PANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG------------ 114
P N LP + + P +FD+R WPNC +I+ IRDQ CGSCW
Sbjct: 69 PNNSLPGSLSRA------PTSFDARDYWPNCKSIKMIRDQAYCGSCWAFGAAEVISDRIC 122
Query: 115 ----------CRPYEIAPCEHHVNGTR--------------------------------P 132
P +I C + +G +
Sbjct: 123 IQSNGTDQPIISPEDILTCCTNSHGCQGGFVLEAMKFWKSKGVVTGGDFQGDGCIPYSYG 182
Query: 133 SC-DASKGHT-PKCVRECQENYDV-PYKKDLNFGAKSYSVSSNE--KSIMKEIYEHGPVE 187
SC D T PKC ECQ Y YK+D +G+ +Y +S++ ++I EI +GPVE
Sbjct: 183 SCSDCHTAQTTPKCKNECQVKYTKNEYKEDKYYGSSAYRLSTSNAVRTIQSEILRNGPVE 242
Query: 188 GAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGK 247
+ V++D YKSG + Y SG+
Sbjct: 243 ATYQVYEDFYYYKSGVY-------------------------------------EYISGR 265
Query: 248 ALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVPK 307
+GGHA++I+GWG +E YWLIANSW T +G+NG FK+ RG +ECGIE+ + AG+ K
Sbjct: 266 HMGGHAVKIIGWGVEENV--NYWLIANSWGTGFGENGFFKMRRGNNECGIENYVVAGMAK 323
>gi|170030062|ref|XP_001842909.1| cathepsin B-like thiol protease [Culex quinquefasciatus]
gi|167865915|gb|EDS29298.1| cathepsin B-like thiol protease [Culex quinquefasciatus]
Length = 288
Score = 130 bits (326), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 89/295 (30%), Positives = 124/295 (42%), Gaps = 96/295 (32%)
Query: 69 NRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEIAPCEH--H 126
N LP L V LPA+FD+R KWP CP++ +IR QGSCGSC+ + + H
Sbjct: 33 NNLPRLQNQRSV-RALPASFDARQKWPYCPSLNQIRSQGSCGSCYAVSTAAVITDRYCIH 91
Query: 127 VNGTRP----------------SCDASKGHTP---------------------------- 142
G R CD H
Sbjct: 92 SGGERQFYFGSTGYLSCCTDCYKCDGGYVHKTFDYWVKYGLTSGGPYHSGQGCKPYPFGG 151
Query: 143 ---------KCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMK-EIYEHGPVEGAFTV 192
KC R+CQ Y + Y +DL GA SY + +++ MK EIY++GP+ +F V
Sbjct: 152 ATQDVNIVLKCDRQCQAGYPLTYSQDLKHGASSYILPWGDENAMKAEIYQNGPIVTSFDV 211
Query: 193 FDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGH 252
+ D Y+SG + + +G G H
Sbjct: 212 YGDFFQYRSGVY-------------------------------------RHVTGAYKGSH 234
Query: 253 AIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVPK 307
A+R++GWG + + KYWL ANSWN WG+NG FKI+RG++ G+E AG+PK
Sbjct: 235 AVRVIGWGVE--NGVKYWLCANSWNERWGENGFFKIVRGENHVGVEDISYAGLPK 287
>gi|358341561|dbj|GAA37330.2| cathepsin B-like cysteine proteinase [Clonorchis sinensis]
Length = 347
Score = 129 bits (325), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 72/194 (37%), Positives = 94/194 (48%), Gaps = 40/194 (20%)
Query: 115 CRPYEIAPCEHH-VNGTR-PSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSN 172
C PY PC HH G+ P C TP+CV ECQ+ Y Y+ D + SY++ +
Sbjct: 185 CLPYPFPPCRHHGAKGSEYPPCPEKMYSTPQCVSECQKGYATKYEDDKIRASTSYNLYRS 244
Query: 173 EKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEG 232
+I KEI+ GPVE V+ D Y G +
Sbjct: 245 VTTIQKEIWMRGPVEATMNVYTDFANYAGGVY---------------------------- 276
Query: 233 AFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGK 292
+ +G+ LGGHAIR+LGWG +E YWL ANSWN WG+ G F+ILRG
Sbjct: 277 ---------KHTTGELLGGHAIRLLGWGVEEDGT-PYWLAANSWNPSWGEKGFFRILRGS 326
Query: 293 DECGIESSITAGVP 306
D CGIES ++AG+P
Sbjct: 327 DHCGIESDVSAGLP 340
Score = 55.8 bits (133), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 26/62 (41%), Positives = 38/62 (61%), Gaps = 2/62 (3%)
Query: 54 LKSWMG-VHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSC 112
++S +G + D N+ R P I + ++ +LP+ FD+R WP C TI +IRDQ CGSC
Sbjct: 56 IRSVLGTMREDQNVKEFRRP-TISHEDITLELPSEFDAREHWPECRTIPQIRDQSGCGSC 114
Query: 113 WG 114
W
Sbjct: 115 WA 116
>gi|144952804|gb|ABP04056.1| cathepsin B-4 [Clonorchis sinensis]
Length = 347
Score = 129 bits (325), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 72/194 (37%), Positives = 94/194 (48%), Gaps = 40/194 (20%)
Query: 115 CRPYEIAPCEHH-VNGTR-PSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSN 172
C PY PC HH G+ P C TP+CV ECQ+ Y Y+ D + SY++ +
Sbjct: 185 CLPYPFPPCRHHGAKGSEYPPCPEKMYSTPQCVSECQKGYATKYEDDKIRASTSYNLYRS 244
Query: 173 EKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEG 232
+I KEI+ GPVE V+ D Y G +
Sbjct: 245 VTAIQKEIWMRGPVEATMNVYTDFANYAGGVY---------------------------- 276
Query: 233 AFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGK 292
+ +G+ LGGHAIR+LGWG +E YWL ANSWN WG+ G F+ILRG
Sbjct: 277 ---------KHTTGELLGGHAIRLLGWGVEEDGT-PYWLAANSWNPSWGEKGFFRILRGS 326
Query: 293 DECGIESSITAGVP 306
D CGIES ++AG+P
Sbjct: 327 DHCGIESDVSAGLP 340
Score = 55.8 bits (133), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 26/62 (41%), Positives = 38/62 (61%), Gaps = 2/62 (3%)
Query: 54 LKSWMG-VHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSC 112
++S +G + D N+ R P I + ++ +LP+ FD+R WP C TI +IRDQ CGSC
Sbjct: 56 IRSVLGTMREDQNVKEFRRP-TISHEDITLELPSEFDAREHWPECRTIPQIRDQSGCGSC 114
Query: 113 WG 114
W
Sbjct: 115 WA 116
>gi|341888694|gb|EGT44629.1| hypothetical protein CAEBREN_31940 [Caenorhabditis brenneri]
Length = 374
Score = 129 bits (324), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 73/196 (37%), Positives = 102/196 (52%), Gaps = 48/196 (24%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVP-YKKDLNFGAKSYSVSSN 172
GC PY APC+ + SC ++G TP C CQ +Y Y KD +FG +Y ++++
Sbjct: 194 GCMPYSFAPCK------KDSC--AQGTTPSCKTTCQSSYKTAEYTKDKHFGTTAYKITNS 245
Query: 173 EKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEG 232
+I EIY +GPVE +F V++D YKSG +
Sbjct: 246 VAAIQTEIYHNGPVEASFKVYEDFYKYKSGVY---------------------------- 277
Query: 233 AFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGK 292
Y SGK +GGHA++I+GWG + + YWLIANSW T +GD+G FK+ RG
Sbjct: 278 ---------QYTSGKLVGGHAVKIIGWGTE--NGVDYWLIANSWGTTFGDSGFFKMRRGT 326
Query: 293 DECGIESSITAGVPKL 308
+E GIE ++ AG KL
Sbjct: 327 NEVGIEGNVVAGTAKL 342
>gi|193603738|ref|XP_001943652.1| PREDICTED: cathepsin B-like cysteine proteinase 5-like
[Acyrthosiphon pisum]
Length = 337
Score = 129 bits (324), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 75/215 (34%), Positives = 104/215 (48%), Gaps = 50/215 (23%)
Query: 101 REIRDQGSC-----GSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRE--CQENYD 153
+ I+ G C GS GC+PY I PC + N SC TP+C ++ NY+
Sbjct: 166 KYIKKNGLCTGGEYGSNEGCQPYSIVPCPRNAN----SCSKENEDTPQCYKDQCTNNNYE 221
Query: 154 VPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTA 213
P DL + K YSV + IM E++++GPV A V+DD + YK G +
Sbjct: 222 TPLVSDLYYAYKVYSVKPKPEIIMSEVFKNGPVVAAMKVYDDFLCYKGGIY--------- 272
Query: 214 MSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIA 273
Y +G G HA++I+GWGED+ YWL A
Sbjct: 273 ----------------------------QYTTGGLKGDHAVKIMGWGEDDGID--YWLCA 302
Query: 274 NSWNTDWGDNGLFKILRGKDECGIESSITAGVPKL 308
N+W WG G+FKI RG++ECGIE+ IT G+PK+
Sbjct: 303 NTWGNSWGMGGMFKIRRGRNECGIENRITGGLPKV 337
Score = 47.4 bits (111), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 28/83 (33%), Positives = 42/83 (50%), Gaps = 14/83 (16%)
Query: 46 LSNIPRAHLKSWMGVHPDY---------NLPANRLPE---LIGYS-EVD-EDLPANFDSR 91
++NIP+ K+ + HP +P N+L E L+ Y +D E LP ++D
Sbjct: 34 VNNIPKHTWKAGINFHPSLLTNVSHLMGVVPWNKLSEKDILLTYDVSIDLESLPESYDIT 93
Query: 92 TKWPNCPTIREIRDQGSCGSCWG 114
W C ++ IRDQ +CGSCW
Sbjct: 94 QTWSECKSVVSIRDQSNCGSCWA 116
>gi|21693|emb|CAA46810.1| cathepsin B [Triticum aestivum]
Length = 305
Score = 129 bits (323), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 95/322 (29%), Positives = 129/322 (40%), Gaps = 110/322 (34%)
Query: 46 LSNIPRAHLKSWMGVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRD 105
L+N K +GV P P R E LP FD+R+KW C TI +I D
Sbjct: 21 LANYTIEQFKHMLGVKP--TPPGLRAAVRTKTHSRSEQLPKVFDARSKWSGCSTIGKILD 78
Query: 106 QGSCGSCWGCRPYEIAP---CEHH------------------------------------ 126
QG CGSCW E C HH
Sbjct: 79 QGHCGSCWAFGAVECLQDRFCIHHNMNITLSANDLVACCGFMCGDGCDGGYPISAWQYFV 138
Query: 127 ---------------VNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSS 171
V P C+ + TP C ++C+ V +++ +F +Y V+S
Sbjct: 139 QNGVVTDECDPYFDQVGCKHPGCEPAYP-TPVCEKKCKVQNQV-WEEKKHFSINAYQVNS 196
Query: 172 NEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAE 231
+ IM E+Y +GPVE A
Sbjct: 197 DPHDIMAEVYNNGPVEVA------------------------------------------ 214
Query: 232 GAFTVFDDLILYKSG-------KALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNG 284
FTV++D YKSG +GGHA++++GWG + + E YWL+AN WN WGD+G
Sbjct: 215 --FTVYEDFAHYKSGVYKHITGGVMGGHAVKLIGWGTSD-AGEDYWLLANQWNRGWGDDG 271
Query: 285 LFKILRGKDECGIESSITAGVP 306
FKI+RGK+ECGIE +TAG+P
Sbjct: 272 YFKIIRGKNECGIEEDVTAGMP 293
Score = 37.4 bits (85), Expect = 9.4, Method: Compositional matrix adjust.
Identities = 13/25 (52%), Positives = 22/25 (88%)
Query: 8 LCGFGCNGGFPGMAWRYWVKSGIVS 32
+CG GC+GG+P AW+Y+V++G+V+
Sbjct: 120 MCGDGCDGGYPISAWQYFVQNGVVT 144
>gi|189502866|gb|ACE06814.1| unknown [Schistosoma japonicum]
Length = 121
Score = 129 bits (323), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 70/157 (44%), Positives = 90/157 (57%), Gaps = 39/157 (24%)
Query: 152 YDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNET 211
Y+V Y+ D +G Y V SN+++IMKE+ +HGPVE F V+ D YKSG +
Sbjct: 2 YNVSYENDKWYGKVVYRVKSNQEAIMKELMQHGPVEVDFEVYADFPNYKSGVY------- 54
Query: 212 TAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWL 271
+ SG LGGHA+R+LGWGE+ + YWL
Sbjct: 55 ------------------------------QHVSGALLGGHAVRLLGWGEE--NNVPYWL 82
Query: 272 IANSWNTDWGDNGLFKILRGKDECGIESSITAGVPKL 308
IANSWNTDWGDNG FKI+RGK+ECGIES + AG+PK+
Sbjct: 83 IANSWNTDWGDNGYFKIIRGKNECGIESDVNAGIPKI 119
>gi|149030260|gb|EDL85316.1| rCG52258, isoform CRA_c [Rattus norvegicus]
Length = 130
Score = 129 bits (323), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 73/167 (43%), Positives = 89/167 (53%), Gaps = 53/167 (31%)
Query: 148 CQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVP 207
C+ Y YK+D ++G SYSVS +EK IM EIY++GPVE
Sbjct: 2 CEAGYSTSYKEDKHYGYTSYSVSDSEKEIMAEIYKNGPVE-------------------- 41
Query: 208 GNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSG-------KALGGHAIRILGWG 260
GAFTVF D + YKSG +GGHAIRILGWG
Sbjct: 42 ------------------------GAFTVFSDFLTYKSGVYKHEAGDVMGGHAIRILGWG 77
Query: 261 EDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVPK 307
+ + YWL+ANSWN DWGDNG FKILRG++ CGIES I AG+P+
Sbjct: 78 IE--NGVPYWLVANSWNVDWGDNGFFKILRGENHCGIESEIVAGIPR 122
>gi|132566367|gb|ABO34080.1| cathepsin B5 [Clonorchis sinensis]
Length = 343
Score = 128 bits (322), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 80/202 (39%), Positives = 98/202 (48%), Gaps = 54/202 (26%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GCR Y CEHHV G P C TP+CV++C + DV Y +D SY++ ++E
Sbjct: 183 GCRSYPFPKCEHHVQGHYPPCPRELYPTPECVQQC-DTPDVGYLEDKTRANMSYNIYASE 241
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
SIMKEI GPVE
Sbjct: 242 ISIMKEIMLRGPVEAI-------------------------------------------- 257
Query: 234 FTVFDDLILYKSG---KALG----GHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLF 286
FT+++D + Y SG ALG GHA+RILGWGE YWLIANSWN DWG+ G
Sbjct: 258 FTMYEDFLRYSSGVYFHALGAPMSGHAVRILGWGE--LGNVPYWLIANSWNEDWGEEGYM 315
Query: 287 KILRGKDECGIESSITAGVPKL 308
K LRG +ECGIE +TAG+P L
Sbjct: 316 KFLRGYNECGIEDDVTAGLPYL 337
Score = 45.4 bits (106), Expect = 0.031, Method: Compositional matrix adjust.
Identities = 17/27 (62%), Positives = 21/27 (77%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGA 35
CGFGC GG+P +AW YW GIV+GG+
Sbjct: 151 CGFGCRGGYPAVAWDYWKTHGIVTGGS 177
>gi|162813|gb|AAA30434.1| cathepsin B, partial [Bos taurus]
Length = 122
Score = 128 bits (322), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 73/162 (45%), Positives = 89/162 (54%), Gaps = 53/162 (32%)
Query: 152 YDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNET 211
Y YK+D +FG SYSV++NEK IM EIY++GPVE
Sbjct: 2 YSPSYKEDKHFGCSSYSVANNEKEIMAEIYKNGPVE------------------------ 37
Query: 212 TAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSG-------KALGGHAIRILGWGEDEK 264
GAF+V+ D +LYKSG + +GGHAIRILGWG +
Sbjct: 38 --------------------GAFSVYSDFLLYKSGVYQHVSGEIMGGHAIRILGWGVENG 77
Query: 265 SKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVP 306
+ YWL+ NSWNTDWGDNG FKILRG+D CGIES I AG+P
Sbjct: 78 TP--YWLVGNSWNTDWGDNGFFKILRGQDHCGIESEIVAGMP 117
>gi|40643250|emb|CAC83720.1| cathepsin B [Hordeum vulgare subsp. vulgare]
gi|326494236|dbj|BAJ90387.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326499864|dbj|BAJ90767.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 344
Score = 128 bits (322), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 95/325 (29%), Positives = 133/325 (40%), Gaps = 112/325 (34%)
Query: 46 LSNIPRAHLKSWMGVHPDYNLPANRLPELIGYSEV-DEDLPANFDSRTKWPNCPTIREIR 104
L+N K +GV P P L + + E LP FD+R+KW C TI +I
Sbjct: 60 LANYTIEQFKHMLGVKP---TPPGLLAGVRTKTHPRSEQLPKEFDARSKWSGCSTIGKIL 116
Query: 105 DQGSCGSCWGCRPYEIAP---CEHH----------------------------------- 126
DQG CGSCW E C HH
Sbjct: 117 DQGHCGSCWAFGAVECLQDRFCIHHNMNISLSANDLVACCGFMCGDGCDGGYPISAWQYF 176
Query: 127 ----------------VNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVS 170
V P C+ + TP C ++C+ V +++ +F +Y V+
Sbjct: 177 VQNGVVTEECDPYFDQVGCKHPGCEPAYP-TPVCEKKCKVQNQV-WQEKKHFSIDAYQVN 234
Query: 171 SNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGA 230
S+ IM E+Y++GPVE A
Sbjct: 235 SDPHDIMAEVYKNGPVEVA----------------------------------------- 253
Query: 231 EGAFTVFDDLILYKSG-------KALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDN 283
FTV++D YKSG +GGHA++++GWG + + E YWL+AN WN WGD+
Sbjct: 254 ---FTVYEDFAHYKSGVYKHITGGVMGGHAVKLIGWGTSD-AGEDYWLLANQWNRGWGDD 309
Query: 284 GLFKILRGKDECGIESSITAGVPKL 308
G FKI+RGK+ECGIE +TAG+P +
Sbjct: 310 GYFKIIRGKNECGIEEDVTAGMPSM 334
>gi|157058765|gb|ABV03140.1| cathepsin B-348 [Aulacorthum solani]
Length = 237
Score = 128 bits (321), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 70/160 (43%), Positives = 91/160 (56%), Gaps = 38/160 (23%)
Query: 101 REIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDL 160
+ I G GS GC PYE+APCEHHVNGTR C G TPKCV++C++ Y VPY +DL
Sbjct: 112 KGIVSGGPYGSNMGCIPYEVAPCEHHVNGTRGPCKEG-GKTPKCVKKCEDGYKVPYAQDL 170
Query: 161 NFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWT 220
+ G +YS+S++ I +EIY +GPVEGAFTV++D I Y++G +
Sbjct: 171 HHGKSAYSLSNDVDQIRQEIYTNGPVEGAFTVYEDFIAYRAGVY---------------- 214
Query: 221 IRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWG 260
+ +GKALGGHAIRILGWG
Sbjct: 215 ---------------------KHVAGKALGGHAIRILGWG 233
Score = 55.8 bits (133), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 24/30 (80%), Positives = 24/30 (80%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
CGFGCNGGFPG AW YW GIVSGG YGS
Sbjct: 93 CGFGCNGGFPGAAWNYWKTKGIVSGGPYGS 122
>gi|3087801|emb|CAA93277.1| cysteine proteinase [Haemonchus contortus]
Length = 344
Score = 128 bits (321), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 69/191 (36%), Positives = 99/191 (51%), Gaps = 40/191 (20%)
Query: 115 CRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVS-SNE 173
CRPY PC HH N T G TP+CVR+CQE Y+ Y +D G +Y + +
Sbjct: 189 CRPYPFHPCGHHGNETYYGECPEDGSTPECVRKCQEGYETEYHEDRVRGEDAYRLPIGSV 248
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
K+I KEI +GPV AF VFDD Y+ G
Sbjct: 249 KAIQKEIMRNGPVVAAFIVFDDFSFYRKG------------------------------- 277
Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
+ + +G GGHA++I+GWG + YW+IANSW++DWG++G F+++RG +
Sbjct: 278 ------IYAHVAGSPRGGHAVKIIGWGTEHGV--PYWIIANSWHSDWGEDGYFRMVRGIN 329
Query: 294 ECGIESSITAG 304
+CGIE+++ AG
Sbjct: 330 DCGIETNVVAG 340
>gi|297843028|ref|XP_002889395.1| hypothetical protein ARALYDRAFT_887368 [Arabidopsis lyrata subsp.
lyrata]
gi|297335237|gb|EFH65654.1| hypothetical protein ARALYDRAFT_887368 [Arabidopsis lyrata subsp.
lyrata]
Length = 360
Score = 128 bits (321), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 96/331 (29%), Positives = 135/331 (40%), Gaps = 110/331 (33%)
Query: 37 GSKQAEKNSLSNIPRAHLKSWMGVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPN 96
G K A + +N A K +GV P +P I ++ LP FD+RT W
Sbjct: 59 GWKAAFNDRFANATVAEFKRLLGVKPTPKTEFLGVP--IVSHDISLKLPKEFDARTAWSQ 116
Query: 97 CPTIREIRDQGSCGSCW-------------------------------------GCRP-Y 118
C ++ I DQG CGSCW GC Y
Sbjct: 117 CTSVGRILDQGHCGSCWAFGAVESLSDRFCIKYNMNISLSVNDLLACCGFLCGQGCNGGY 176
Query: 119 EIAP-------------CEHHVNGT---RPSCDASKGHTPKCVRECQENYDVPYKKDLNF 162
IA C+ + + T P C+ + TPKC R+C + +++ ++
Sbjct: 177 PIAAWRYFKHHGVVTEECDPYFDNTGCSHPGCEPAYP-TPKCARKCVSGNQL-WRESKHY 234
Query: 163 GAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIR 222
G +Y V S+ IM E+Y++GPVE A
Sbjct: 235 GVSAYKVRSHPDDIMAEVYKNGPVEVA--------------------------------- 261
Query: 223 DNTSQLGAEGAFTVFDDLILYKSG-------KALGGHAIRILGWGEDEKSKEKYWLIANS 275
FTV++D YKSG +GGHA++++GWG + E YWL+AN
Sbjct: 262 -----------FTVYEDFAHYKSGVYKHITGTNIGGHAVKLIGWGTSDDG-EDYWLLANQ 309
Query: 276 WNTDWGDNGLFKILRGKDECGIESSITAGVP 306
WN WGD+G FKI RG +ECGIE + AG+P
Sbjct: 310 WNRSWGDDGYFKIRRGTNECGIEHGVVAGLP 340
Score = 38.1 bits (87), Expect = 4.5, Method: Compositional matrix adjust.
Identities = 15/25 (60%), Positives = 19/25 (76%)
Query: 8 LCGFGCNGGFPGMAWRYWVKSGIVS 32
LCG GCNGG+P AWRY+ G+V+
Sbjct: 167 LCGQGCNGGYPIAAWRYFKHHGVVT 191
>gi|339241013|ref|XP_003376432.1| Gut-specific cysteine proteinase [Trichinella spiralis]
gi|316974853|gb|EFV58323.1| Gut-specific cysteine proteinase [Trichinella spiralis]
Length = 551
Score = 127 bits (320), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 87/283 (30%), Positives = 120/283 (42%), Gaps = 97/283 (34%)
Query: 83 DLPANFDSRTKWPNC-PTIREIRDQGSCGSCWGCRPYEI--------------------- 120
+ P FDSR WP C I I+DQ +CGSCW +
Sbjct: 288 NYPVEFDSRKHWPQCEKVISFIKDQANCGSCWAVSSASVMSDRTCIATDGQFTTLLSDAE 347
Query: 121 -----APCEHHVNGTRP----------------------SC---------DASKGHTPKC 144
C + NG P +C + S+ TPKC
Sbjct: 348 LLSCCTSCGYGCNGGYPQRTFKYWVYSGMPTGGPYGSNDTCKPYPIPPCSNCSETRTPKC 407
Query: 145 VRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRF 204
+ C Y + +D ++G+ Y EKS+MK+I +GP+ +V++D + YK
Sbjct: 408 SKSCISTYPLSLNEDRHYGSTYYQFWLGEKSMMKDISLYGPIVAGMSVYEDFLHYK---- 463
Query: 205 FVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEK 264
EG +T +SG LGGHA+RI+GWGE +
Sbjct: 464 --------------------------EGVYT-------QESGIFLGGHAVRIIGWGEQDN 490
Query: 265 SKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVPK 307
YWL+ANSWNT +G++GLFKI RG DECGIES ++AG K
Sbjct: 491 I--PYWLVANSWNTTFGEDGLFKIRRGFDECGIESYVSAGRAK 531
Score = 47.8 bits (112), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 19/35 (54%), Positives = 25/35 (71%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQAEK 43
CG+GCNGG+P ++YWV SG+ +GG YGS K
Sbjct: 355 CGYGCNGGYPQRTFKYWVYSGMPTGGPYGSNDTCK 389
>gi|59895951|gb|AAX11351.1| cathepsin B-like cysteine protease [Oryza sativa Japonica Group]
gi|125551767|gb|EAY97476.1| hypothetical protein OsI_19406 [Oryza sativa Indica Group]
gi|215694023|dbj|BAG89222.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215712372|dbj|BAG94499.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215765382|dbj|BAG87079.1| unnamed protein product [Oryza sativa Japonica Group]
gi|222631058|gb|EEE63190.1| hypothetical protein OsJ_17999 [Oryza sativa Japonica Group]
Length = 358
Score = 127 bits (320), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 96/322 (29%), Positives = 131/322 (40%), Gaps = 92/322 (28%)
Query: 37 GSKQAEKNSLSNIPRAHLKSWMGVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPN 96
G A +N A K +GV P + N +P + LP FD+R+ W
Sbjct: 57 GWTAARNPYFANYTTAQFKHILGVKPTPHSVLNDVP--VKTYPRSLMLPKEFDARSAWSQ 114
Query: 97 CPTIREIRDQGSCGSCWGCRPYEIAP---CEHH-------VNGTRPSC-----DASKGHT 141
C TI I DQG CGSCW E C H VN C D G
Sbjct: 115 CNTIGTILDQGHCGSCWAFGAVECLQDRFCIHFNMNISLSVNDLVACCGFMCGDGCDGGY 174
Query: 142 PKC-----VRE--------------------CQENYDVP------------YKKDLNFGA 164
P VR C+ Y P + + +F
Sbjct: 175 PIMAWRYFVRNGVVTDECDPYFDQVGCKHPGCEPAYPTPVCEKKCKVQNQVWLEKKHFSV 234
Query: 165 KSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDN 224
+Y V+S+ IM E+Y++GPVE AFTV++D YKSG +
Sbjct: 235 NAYRVNSDPHDIMAEVYQNGPVEVAFTVYEDFAHYKSGVY-------------------- 274
Query: 225 TSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNG 284
+ +G +GGHA++++GWG + + E YWL+AN WN WGD+G
Sbjct: 275 -----------------KHITGGMMGGHAVKLIGWGTTD-AGEDYWLLANQWNRGWGDDG 316
Query: 285 LFKILRGKDECGIESSITAGVP 306
FKI+RG +ECGIE + AG+P
Sbjct: 317 YFKIIRGTNECGIEEDVVAGMP 338
Score = 42.4 bits (98), Expect = 0.30, Method: Compositional matrix adjust.
Identities = 15/25 (60%), Positives = 23/25 (92%)
Query: 8 LCGFGCNGGFPGMAWRYWVKSGIVS 32
+CG GC+GG+P MAWRY+V++G+V+
Sbjct: 165 MCGDGCDGGYPIMAWRYFVRNGVVT 189
>gi|116779190|gb|ABK21175.1| unknown [Picea sitchensis]
gi|148907952|gb|ABR17096.1| unknown [Picea sitchensis]
gi|224284884|gb|ACN40172.1| unknown [Picea sitchensis]
Length = 350
Score = 127 bits (319), Expect = 7e-27, Method: Compositional matrix adjust.
Identities = 97/327 (29%), Positives = 129/327 (39%), Gaps = 103/327 (31%)
Query: 37 GSKQAEKNSLSNIPRAHLKSWMGVHPD-----YNLPANRLPELIGYSEVDEDLPANFDSR 91
G K + SN K +GV P N+P P+ + +LP FD+R
Sbjct: 51 GWKAGMNSRFSNHTVGQFKRLLGVLPTPRNLLENVPVRTYPKGL-------NLPKQFDAR 103
Query: 92 TKWPNCPTIREIRDQGSCGSCWGCRPYEIAP---CEHH-VNGTRPSCDASKGHTPKC--- 144
WP C ++R I DQG CGSCW E C H+ VN T D +C
Sbjct: 104 KAWPQCTSVRTILDQGHCGSCWAFGAVEALSDRFCIHYKVNVTLSENDLVACCGFRCGDG 163
Query: 145 -------------------VRECQENYD------------------VPYKKDLN------ 161
EC +D V KD N
Sbjct: 164 CDGGYPLSAWQYFISTGVVTAECDPYFDEAGCQHPGCEPLYPTPQCVKQCKDENQNWGNS 223
Query: 162 --FGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKW 219
F A +Y ++S IM E+Y GPVE F V++D YKSG +
Sbjct: 224 KRFSATAYRITSKPYDIMAEVYTKGPVEVDFLVYEDFAHYKSGVY--------------- 268
Query: 220 TIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTD 279
Y +G LGGHA++++GWG + + YWL+ANSWNT
Sbjct: 269 ----------------------KYITGDFLGGHAVKLIGWGTENGT--DYWLVANSWNTA 304
Query: 280 WGDNGLFKILRGKDECGIESSITAGVP 306
WG++G FKI RG +EC IE + AG+P
Sbjct: 305 WGEDGYFKIARGSNECSIEEDVVAGMP 331
>gi|428174191|gb|EKX43088.1| hypothetical protein GUITHDRAFT_73372 [Guillardia theta CCMP2712]
Length = 255
Score = 127 bits (318), Expect = 8e-27, Method: Compositional matrix adjust.
Identities = 89/264 (33%), Positives = 121/264 (45%), Gaps = 40/264 (15%)
Query: 62 PDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEIA 121
PD++ P ++P+ NFD+RT WP CP+I IRDQ +CGSCW E
Sbjct: 6 PDFDYPNVKIPD-------------NFDARTNWPQCPSIAHIRDQSTCGSCWAFGAVEAM 52
Query: 122 PCEHHV--NGTRPSCDASKGHTPKCVRECQE--NYDVPYKKDLNFGAKSYSVSSNEKSIM 177
+ NGT +++ C+ +C N P F + S +
Sbjct: 53 SDRLCIASNGTVKDELSAEDMLSCCLVQCGMGCNGGFPTGAWRFFKMHGLTTESKYPYVF 112
Query: 178 KEIYEH---------GPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQL 228
H GP + K R+ + + + I+ I N
Sbjct: 113 PPCEHHINKTHYKPCGPSQPTPKCVR--ASEKKPRYHGKSVYSVSPAKIQAEIMTNGP-- 168
Query: 229 GAEGAFTVFDDLILYK-------SGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWG 281
E AFTV+ D + Y+ SG LGGHAI+I+GWG + + KYWL+ANSWN DWG
Sbjct: 169 -VEAAFTVYQDFLAYQSGVYRHVSGPELGGHAIKIMGWGVE--AGNKYWLVANSWNEDWG 225
Query: 282 DNGLFKILRGKDECGIESSITAGV 305
D G FKI RG DECGIESS+ AG+
Sbjct: 226 DKGTFKIARGDDECGIESSVVAGM 249
>gi|45822211|emb|CAE47502.1| cathepsin B-like proteinase [Diabrotica virgifera virgifera]
Length = 331
Score = 127 bits (318), Expect = 9e-27, Method: Compositional matrix adjust.
Identities = 79/213 (37%), Positives = 100/213 (46%), Gaps = 54/213 (25%)
Query: 103 IRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNF 162
I G GS GC+PY + PCEHH G + C TP C +C ++ + YK +L F
Sbjct: 166 ITTGGLYGSKQGCQPYSLQPCEHHTEGNKVQCSTLDYDTPSCKHKCDDS-ALNYKSELTF 224
Query: 163 GAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIR 222
G+ S + +I KEI +GPVE A
Sbjct: 225 GSGSVRNFYSVANIQKEILTNGPVEAA--------------------------------- 251
Query: 223 DNTSQLGAEGAFTVFDDLILYKSG-------KALGGHAIRILGWGEDEKSKEKYWLIANS 275
F V+ D + YKSG + LGGHA+RILGWGE+ S YWL+ANS
Sbjct: 252 -----------FDVYSDFVNYKSGVYQHVAGEYLGGHAVRILGWGEE--SGVPYWLVANS 298
Query: 276 WNTDWGDNGLFKILRGKDECGIESSITAGVPKL 308
WN DWGD GLFKI RG +E G E SI A ++
Sbjct: 299 WNEDWGDKGLFKIRRGNNESGFEDSIVAAPAQV 331
Score = 56.2 bits (134), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 21/32 (65%), Positives = 26/32 (81%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQ 40
CG+GC GG+P MAW YW+ +GI +GG YGSKQ
Sbjct: 145 CGYGCEGGYPTMAWSYWIDTGITTGGLYGSKQ 176
>gi|268566089|ref|XP_002647469.1| Hypothetical protein CBG06541 [Caenorhabditis briggsae]
Length = 280
Score = 127 bits (318), Expect = 9e-27, Method: Compositional matrix adjust.
Identities = 75/202 (37%), Positives = 97/202 (48%), Gaps = 61/202 (30%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GCRPY APC + C K TP C CQ Y Y KD FG +Y+V+ N
Sbjct: 133 GCRPYPFAPCNSY------KCPEEK--TPTCSLSCQFGYSTAYAKDKRFGVSAYAVARNV 184
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
+I EI +GPV GA
Sbjct: 185 AAIQTEIMTNGPVVGA-------------------------------------------- 200
Query: 234 FTVFDDLILYKSG-------KALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLF 286
FT+++D+ YKSG + LGGHAI+I+GWG ++ YWLIANSW DWG+NG
Sbjct: 201 FTMYEDMYKYKSGVYRHTAGRLLGGHAIKIIGWGT--QNGIPYWLIANSWGADWGENGFL 258
Query: 287 KILRGKDECGIESSITAGVPKL 308
K+ RG +ECGIES++ AG+PK+
Sbjct: 259 KMRRGVNECGIESAVVAGMPKV 280
Score = 51.6 bits (122), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 26/64 (40%), Positives = 37/64 (57%), Gaps = 9/64 (14%)
Query: 230 AEGAFTVFDDLILYK-------SGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGD 282
E +FTV++D +YK +G+ +G HAI+I+GWG + + YWLIANSW G
Sbjct: 6 VEASFTVYEDFYIYKKGVYQYTAGQVVGVHAIKIMGWGTEHGT--DYWLIANSWGAQCGS 63
Query: 283 NGLF 286
F
Sbjct: 64 CWAF 67
>gi|18378947|ref|NP_563648.1| putative cathepsin B-like cysteine protease [Arabidopsis thaliana]
gi|16226808|gb|AAL16267.1|AF428337_1 At1g02300/T6A9_10 [Arabidopsis thaliana]
gi|14532526|gb|AAK63991.1| At1g02300/T6A9_10 [Arabidopsis thaliana]
gi|25090140|gb|AAN72238.1| At1g02300/T6A9_10 [Arabidopsis thaliana]
gi|332189292|gb|AEE27413.1| putative cathepsin B-like cysteine protease [Arabidopsis thaliana]
Length = 362
Score = 127 bits (318), Expect = 9e-27, Method: Compositional matrix adjust.
Identities = 95/331 (28%), Positives = 134/331 (40%), Gaps = 110/331 (33%)
Query: 37 GSKQAEKNSLSNIPRAHLKSWMGVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPN 96
G K + + +N A K +GV P +P I ++ LP FD+RT W
Sbjct: 61 GWKASFNDRFANATVAEFKRLLGVKPTPKTEFLGVP--IVSHDISLKLPKEFDARTAWSQ 118
Query: 97 CPTIREIRDQGSCGSCWG-----------CRPYE----------IAPC------------ 123
C +I I DQG CGSCW C Y +A C
Sbjct: 119 CTSIGRILDQGHCGSCWAFGAVESLSDRFCIKYNMNVSLSVNDLLACCGFLCGQGCNGGY 178
Query: 124 --------EHH-------------VNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNF 162
+HH + P C+ + TPKC R+C + +++ ++
Sbjct: 179 PIAAWRYFKHHGVVTEECDPYFDNTGCSHPGCEPAYP-TPKCARKCVSGNQL-WRESKHY 236
Query: 163 GAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIR 222
G +Y V S+ IM E+Y++GPVE A
Sbjct: 237 GVSAYKVRSHPDDIMAEVYKNGPVEVA--------------------------------- 263
Query: 223 DNTSQLGAEGAFTVFDDLILYKSG-------KALGGHAIRILGWGEDEKSKEKYWLIANS 275
FTV++D YKSG +GGHA++++GWG + E YWL+AN
Sbjct: 264 -----------FTVYEDFAHYKSGVYKHITGTNIGGHAVKLIGWGTSDDG-EDYWLLANQ 311
Query: 276 WNTDWGDNGLFKILRGKDECGIESSITAGVP 306
WN WGD+G FKI RG +ECGIE + AG+P
Sbjct: 312 WNRSWGDDGYFKIRRGTNECGIEHGVVAGLP 342
Score = 38.1 bits (87), Expect = 4.5, Method: Compositional matrix adjust.
Identities = 15/25 (60%), Positives = 19/25 (76%)
Query: 8 LCGFGCNGGFPGMAWRYWVKSGIVS 32
LCG GCNGG+P AWRY+ G+V+
Sbjct: 169 LCGQGCNGGYPIAAWRYFKHHGVVT 193
>gi|48762485|dbj|BAD23812.1| cathepsin B-N1 [Tuberaphis styraci]
Length = 340
Score = 127 bits (318), Expect = 9e-27, Method: Compositional matrix adjust.
Identities = 103/363 (28%), Positives = 149/363 (41%), Gaps = 133/363 (36%)
Query: 36 YGSKQA---EKNSLSNIPRAHLKSW-MGVHPDYNLPANRLPELIG--------------- 76
Y ++QA EK+ ++ I A+ K+W GV+ D L + +L+G
Sbjct: 16 YRTEQAYFLEKDYINQI-NANAKTWKAGVNFDPKLSIDSFVKLLGSKGVQAAKQASPDMF 74
Query: 77 ------YSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG---------------- 114
Y+ +P++FD+R KW C TI E+RDQG CGSCW
Sbjct: 75 KTHDEAYNSWSNRIPSSFDARKKWRKCSTIGEVRDQGKCGSCWAFGTSSAFADRLCIATD 134
Query: 115 ------CRPYEIAPCEH--------------------HVNGTRPSCDASKGHTP------ 142
P E+A C H H T + D+ +G P
Sbjct: 135 GEFNELLSPEELAFCCHKCGFGCSGGYPIRAWERFKKHGLVTGGNYDSGEGCQPYRVPPC 194
Query: 143 -------------------KCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEH 183
+C R C N D+ +K+D ++ +Y ++ +I +I +
Sbjct: 195 PLDEYGNNTCRGKPAEKNHRCTRMCYGNQDLDFKEDHHYTRDAYYLTYG--TIQNDILAY 252
Query: 184 GPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILY 243
GP+E +F V+DD YKSG + N T
Sbjct: 253 GPIEASFEVYDDFPSYKSGVYTKMENATY------------------------------- 281
Query: 244 KSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITA 303
LGGHA++++GWGE+ YWL+ NSWN WGD GLFKI RG +ECGI++S T
Sbjct: 282 -----LGGHAVKLIGWGEEYGV--PYWLLVNSWNDQWGDQGLFKIRRGTNECGIDNSTTG 334
Query: 304 GVP 306
GVP
Sbjct: 335 GVP 337
Score = 40.8 bits (94), Expect = 0.87, Method: Compositional matrix adjust.
Identities = 17/30 (56%), Positives = 22/30 (73%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
CGFGC+GG+P AW + K G+V+GG Y S
Sbjct: 153 CGFGCSGGYPIRAWERFKKHGLVTGGNYDS 182
>gi|166030308|gb|ABY78821.1| cathepsin B-like protease [Trypanosoma congolense]
Length = 336
Score = 126 bits (317), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 102/299 (34%), Positives = 128/299 (42%), Gaps = 42/299 (14%)
Query: 39 KQAEKNSLSNIPRAHLKSWMGV--HPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPN 96
K + NI A + G +LP R E ++ +LP +FDS KWPN
Sbjct: 47 KAVYNGKMQNITFAEARRLTGAFRRKTSSLPPVRFTE----EQLRTELPESFDSAEKWPN 102
Query: 97 CPTIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPY 156
CPTIREI DQ +CGSCW + G S H C +C +
Sbjct: 103 CPTIREIADQSACGSCWAVSTASAISDRYCTVGGVQQLRISAAHLMSCCEDCGDG----C 158
Query: 157 KKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETT---- 212
K A Y VS S + Y P G F P TT
Sbjct: 159 KGGAPDSAWEYYVSHGLASSYCQPYPF-PHCGHHGGKGKKPPCSKYHFHTPKCNTTCTDK 217
Query: 213 AMSLIKWTIRDNTSQLGAEGA----------------FTVFDDLILYK-------SGKAL 249
A+ LIK+ R N S + G F V+ D + YK SG L
Sbjct: 218 AIPLIKY--RGNNSYMLLNGEDDYKRELYFNGPFVVDFGVYSDFLAYKTGVYRHVSGDVL 275
Query: 250 GGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVPKL 308
GGHA+RI+GWG+ + YW IANSW+TDWG NG F ILRG +ECGIES+ AG+P +
Sbjct: 276 GGHAVRIVGWGKLNGT--PYWKIANSWDTDWGMNGHFLILRGNNECGIESTGYAGLPAI 332
>gi|118364222|ref|XP_001015333.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila]
gi|89297100|gb|EAR95088.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila SB210]
Length = 341
Score = 126 bits (317), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 74/200 (37%), Positives = 99/200 (49%), Gaps = 57/200 (28%)
Query: 115 CRPYEIAPCEHHVNG-TRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
C+ Y APC HHV+ P+C TPKC + C Y ++ G+K+YSV +
Sbjct: 189 CQAYSFAPCAHHVDTPLYPACTGEL-PTPKCAKTCDSGSGQTYT--VHKGSKAYSVGKTQ 245
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
++IM EI +GPVE A
Sbjct: 246 EAIMTEIQTNGPVEAA-------------------------------------------- 261
Query: 234 FTVFDDLILYKSG-------KALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLF 286
FTV++D + YKSG KALGGHAI+I+GWG + + YW++ NSWN WGDNG F
Sbjct: 262 FTVYEDFLNYKSGVYKHVTGKALGGHAIKIVGWGVENNTP--YWIVVNSWNQTWGDNGTF 319
Query: 287 KILRGKDECGIESSITAGVP 306
KILRGK+ECGIE+ + +P
Sbjct: 320 KILRGKNECGIEAQVVTALP 339
Score = 40.4 bits (93), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 16/30 (53%), Positives = 22/30 (73%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
CG GCNGG+P A Y+VK+G+V+G Y +
Sbjct: 156 CGQGCNGGYPASAMSYYVKTGLVTGDLYNT 185
>gi|343474530|emb|CCD13852.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 335
Score = 126 bits (316), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 98/303 (32%), Positives = 120/303 (39%), Gaps = 104/303 (34%)
Query: 67 PANRLPEL-IGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGS----------------- 108
P + LP + E+ LP FD+ KWPNCPTI EI DQ S
Sbjct: 72 PVSVLPRVNFTEEELLAPLPETFDAAEKWPNCPTITEISDQSSCGSCWAVAAATSMTDRY 131
Query: 109 -------------------CGSC------------WG-----------CRPYEIAPCEHH 126
CG C W C+PY C H+
Sbjct: 132 CTIHGVRGLRISAADLLACCGDCGYGCLGGDPDMAWAYFSSEGIASGRCQPYPFPRCSHY 191
Query: 127 VNGTR-PSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGP 185
N T P C A TP C C D K G KSYS+S E+ +E+Y GP
Sbjct: 192 TNSTTYPQCSALHLWTPTCNPACT---DSTISKKKYRGLKSYSLS-GEEDFRRELYFRGP 247
Query: 186 VEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKS 245
+ F V+ DL YK G + G GAF
Sbjct: 248 FQAVFDVWSDLFAYKHGVYKHVG-----------------------GAF----------- 273
Query: 246 GKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGV 305
+G HA+RI+GWG +S YW IANSWN +WGD G F +LRG +ECGIE S +AGV
Sbjct: 274 ---IGAHAVRIVGWGN--QSGVPYWKIANSWNAEWGDRGYFFMLRGDNECGIEDSGSAGV 328
Query: 306 PKL 308
P +
Sbjct: 329 PAI 331
>gi|91089435|ref|XP_966663.1| PREDICTED: similar to AGAP004533-PA [Tribolium castaneum]
gi|270012706|gb|EFA09154.1| cathepsin B precursor [Tribolium castaneum]
Length = 320
Score = 126 bits (316), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 93/324 (28%), Positives = 135/324 (41%), Gaps = 98/324 (30%)
Query: 41 AEKNSLSNIPRAHLKSWMGVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTI 100
A N N P +HL+S G D PA + E +P NFD+R WP C +I
Sbjct: 39 AGPNFPPNTPHSHLRSLNGARDD---PAFFTDTETKNVTIPEQIPQNFDARIVWPQCESI 95
Query: 101 REIRDQGSCGSCW-----------------GCRPYEIAP---------CEHHVNGTRPS- 133
R+IR+QGSCGSCW + +E + C H G S
Sbjct: 96 RKIRNQGSCGSCWAFGAVETMSDRLCIASNATKKFEFSAQDLLACCKECGHGCGGGYSSR 155
Query: 134 ---------------CDASKGHTPKCVRECQEN-------------YDVPYKKDLNFGAK 165
+ S+G P V+ +++ Y Y +D +GA+
Sbjct: 156 AWQYWVTDGIVSGGDFNTSQGCHPYSVQAFRDSTTPNCSSFCTNPKYQKNYSEDKRYGAR 215
Query: 166 SYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNT 225
SY ++ N + I EI GPV+ ++ V+DD Y++G
Sbjct: 216 SYRIAKNIEQIQAEIMTSGPVQASYVVYDDFYSYQNG----------------------- 252
Query: 226 SQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGD-NG 284
V+ ++ G G H+++ILGWG + + YWL+ANSW DWG G
Sbjct: 253 ----------VYQHVL----GNVSGRHSVKILGWGRENGT--DYWLVANSWGRDWGRLGG 296
Query: 285 LFKILRGKDECGIESSITAGVPKL 308
FK LRG++ C IES+I G PK+
Sbjct: 297 FFKFLRGENHCDIESNILGGDPKI 320
Score = 45.1 bits (105), Expect = 0.040, Method: Compositional matrix adjust.
Identities = 18/32 (56%), Positives = 22/32 (68%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQ 40
CG GC GG+ AW+YWV GIVSGG + + Q
Sbjct: 144 CGHGCGGGYSSRAWQYWVTDGIVSGGDFNTSQ 175
>gi|392922404|ref|NP_507186.3| Protein CPR-2 [Caenorhabditis elegans]
gi|206994217|emb|CAB04322.3| Protein CPR-2 [Caenorhabditis elegans]
Length = 326
Score = 126 bits (316), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 93/335 (27%), Positives = 140/335 (41%), Gaps = 108/335 (32%)
Query: 35 AYGSKQAEKNSLSNIPRAHLKSWMGVHPDYNLPANRLPELIGYSE---VDEDLPANFDSR 91
A + Q E ++++ + H +S +H +N P P+ +E V + P NFD+R
Sbjct: 38 AASTFQTENYAVTH-EKMHTRS---MHEKFNAP---FPDEFRATEREFVLDATPLNFDAR 90
Query: 92 TKWPNCPTIREIRDQGSCGSCWGCRPYEIAPCEHHV--NGTRP----------------- 132
T+WP C +++ IR+Q +CGSCW E+ + NGT+
Sbjct: 91 TRWPQCKSMKLIREQSNCGSCWAFSTAEVISDRTCIASNGTQQPIISPTDLLTCCGMSCG 150
Query: 133 -SCDA----------------------SKGHTPKCVREC-----------------QENY 152
CD G P +R C Q Y
Sbjct: 151 EGCDGGFPYRAFQWWARRGVVTGGDYLGTGCKPYPIRPCNSDNCVNLQTPPCRLSCQPGY 210
Query: 153 DVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETT 212
Y D N+G +Y V +I +IY +GPV AF V++D YKSG
Sbjct: 211 RTTYTNDKNYGNSAYPVPRTVAAIQADIYYNGPVVAAFIVYEDFEKYKSG---------- 260
Query: 213 AMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLI 272
+ + +G++ GGHA++++GWG + + YWL
Sbjct: 261 ---------------------------IYRHIAGRSKGGHAVKLIGWGTERGT--PYWLA 291
Query: 273 ANSWNTDWGDNGLFKILRGKDECGIESSITAGVPK 307
NSW + WG++G F+ILRG DECGIES I AG+P+
Sbjct: 292 VNSWGSQWGESGTFRILRGVDECGIESRIVAGLPR 326
Score = 40.0 bits (92), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 15/28 (53%), Positives = 22/28 (78%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAY 36
CG GC+GGFP A+++W + G+V+GG Y
Sbjct: 149 CGEGCDGGFPYRAFQWWARRGVVTGGDY 176
>gi|119638954|gb|ABL85236.1| cysteine proteinase 2 [Necator americanus]
Length = 347
Score = 126 bits (316), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 70/195 (35%), Positives = 100/195 (51%), Gaps = 41/195 (21%)
Query: 115 CRPYEIAPCEHHVN-GTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
C+PY PC +H + C TP C + CQ +Y VPY D FG+K+ ++ E
Sbjct: 193 CKPYPFYPCGYHAHLPYYGPCPDGMWPTPTCEKACQSDYTVPYNDDRIFGSKTIVLTGEE 252
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
K I +EI+ +GP+ +TV++D YK+G
Sbjct: 253 K-IKREIFNNGPLVATYTVYEDFAYYKNG------------------------------- 280
Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
+ + G+A G HA++I+GWGE+ + KYWLIANSWNTDWG+NG F++LRG +
Sbjct: 281 ------IYMTGLGRATGAHAVKIIGWGEE--NGVKYWLIANSWNTDWGENGFFRMLRGTN 332
Query: 294 ECGIESSITAGVPKL 308
C IE S T G K+
Sbjct: 333 LCDIELSATGGTFKV 347
Score = 37.7 bits (86), Expect = 6.8, Method: Compositional matrix adjust.
Identities = 15/31 (48%), Positives = 20/31 (64%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSK 39
CG GC G P A+ Y ++ G+ SGG YG+K
Sbjct: 160 CGSGCTSGVPRQAFNYAIRKGVCSGGPYGTK 190
>gi|1181143|emb|CAA93278.1| cysteine proteinase [Haemonchus contortus]
Length = 341
Score = 126 bits (316), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 78/198 (39%), Positives = 104/198 (52%), Gaps = 53/198 (26%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GCRPY PC HH T ++ TPKCVR+CQ++Y YKKD + G +Y V ++E
Sbjct: 188 GCRPYPFHPCGHHGKDTYYGECPNEATTPKCVRKCQKSYKKSYKKDRSIGKDAYEVPNSE 247
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
K+I +EI ++GPV GA
Sbjct: 248 KAIQREIMKNGPVVGA-------------------------------------------- 263
Query: 234 FTVFDDLILYK-------SGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLF 286
FTV++D YK +GKA GGHAI+I+GWG++ YWLIANSW+ DWG+NG F
Sbjct: 264 FTVYEDFSYYKKGIYKHTAGKARGGHAIKIIGWGKE--GGVPYWLIANSWHNDWGENGYF 321
Query: 287 KILRGKDECGIESSITAG 304
+ILRG + CGIE ++ AG
Sbjct: 322 RILRGSNHCGIEENVVAG 339
Score = 59.3 bits (142), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 23/46 (50%), Positives = 32/46 (69%)
Query: 69 NRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG 114
NR P ++ ED+P +FD+RTKWP C +++ IRDQ +CGSCW
Sbjct: 75 NRKPVFDDKNDKGEDIPESFDARTKWPKCSSLKHIRDQANCGSCWA 120
Score = 39.3 bits (90), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 16/28 (57%), Positives = 21/28 (75%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAY 36
CG+GCNGG+P A+ Y+ K G V+GG Y
Sbjct: 156 CGYGCNGGWPIQAFNYFSKQGAVTGGDY 183
>gi|215687149|dbj|BAG90919.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 403
Score = 126 bits (316), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 96/315 (30%), Positives = 131/315 (41%), Gaps = 94/315 (29%)
Query: 46 LSNIP--RAHLKSWMGVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREI 103
L+N P A K +GV P + N +P + LP FD+R+ W C TI I
Sbjct: 109 LNNPPVQTAQFKHILGVKPTPHSVLNDVP--VKTYPRSLMLPKEFDARSAWSQCNTIGTI 166
Query: 104 RDQGSCGSCWGCRPYEIAP---CEHH-------VNGTRPSC-----DASKGHTPKC---- 144
DQG CGSCW E C H VN C D G P
Sbjct: 167 LDQGHCGSCWAFGAVECLQDRFCIHFNMNISLSVNDLVACCGFMCGDGCDGGYPIMAWRY 226
Query: 145 -VRE--------------------CQENYDVP------------YKKDLNFGAKSYSVSS 171
VR C+ Y P + + +F +Y V+S
Sbjct: 227 FVRNGVVTDECDPYFDQVGCKHPGCEPAYPTPVCEKKCKVQNQVWLEKKHFSVNAYRVNS 286
Query: 172 NEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAE 231
+ IM E+Y++GPVE AFTV++D YKSG +
Sbjct: 287 DPHDIMAEVYQNGPVEVAFTVYEDFAHYKSGVY--------------------------- 319
Query: 232 GAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRG 291
+ +G +GGHA++++GWG + + E YWL+AN WN WGD+G FKI+RG
Sbjct: 320 ----------KHITGGMMGGHAVKLIGWGTTD-AGEDYWLLANQWNRGWGDDGYFKIIRG 368
Query: 292 KDECGIESSITAGVP 306
+ECGIE + AG+P
Sbjct: 369 TNECGIEEDVVAGMP 383
Score = 42.4 bits (98), Expect = 0.30, Method: Compositional matrix adjust.
Identities = 15/25 (60%), Positives = 23/25 (92%)
Query: 8 LCGFGCNGGFPGMAWRYWVKSGIVS 32
+CG GC+GG+P MAWRY+V++G+V+
Sbjct: 210 MCGDGCDGGYPIMAWRYFVRNGVVT 234
>gi|401758196|gb|AFQ01133.1| cathepsin B [Chilo suppressalis]
Length = 350
Score = 125 bits (315), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 99/297 (33%), Positives = 127/297 (42%), Gaps = 97/297 (32%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQAEKNSLSNIPRAHLKSWMGVHPDYNLPA 68
CG GC GG AW Y+ K GIVSGG Y KS G P P
Sbjct: 149 CGNGCEGGVLTRAWIYYKKIGIVSGGGY------------------KSKQGCQPYTIPPC 190
Query: 69 NRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEIAPCEHHVN 128
N L + G E +++P P C I I +Q C+ I
Sbjct: 191 NHL--VWGEIEQCKNIPMT-------PKCKNIPVIPEQ--------CKYIPI-------- 225
Query: 129 GTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEG 188
TP+C ++C +NY V Y KD + G Y V +E I KEIYE+GPV
Sbjct: 226 ------------TPECEKKCNKNYKVCYSKDKHRGKSVYRVKKSE--IFKEIYEYGPVTS 271
Query: 189 AFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKA 248
FTV++D + YK G + Y SG+
Sbjct: 272 YFTVYEDFLNYKEG-------------------------------------IYNYTSGQK 294
Query: 249 LGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILR-GKDECGIESSITAG 304
LG H+++I+GWGE+ K YWL ANS+NTDWGD G FKI+R G CGI ++ AG
Sbjct: 295 LGLHSVKIIGWGEERGIK--YWLAANSFNTDWGDKGFFKIIREGVGSCGISDNVVAG 349
Score = 56.2 bits (134), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 27/74 (36%), Positives = 38/74 (51%), Gaps = 2/74 (2%)
Query: 41 AEKNSLSNIPRAHLKSWMGVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTI 100
A +N P ++ + MG D + + LP+ P FD+R W NCPT+
Sbjct: 43 AGRNFPKKTPLKYIYNLMGTLSDSRM--DNLPQRNYTFSRKTKYPNQFDAREHWKNCPTL 100
Query: 101 REIRDQGSCGSCWG 114
++IRDQG CGSCW
Sbjct: 101 KDIRDQGGCGSCWA 114
>gi|297843026|ref|XP_002889394.1| hypothetical protein ARALYDRAFT_887367 [Arabidopsis lyrata subsp.
lyrata]
gi|297335236|gb|EFH65653.1| hypothetical protein ARALYDRAFT_887367 [Arabidopsis lyrata subsp.
lyrata]
Length = 359
Score = 125 bits (315), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 92/331 (27%), Positives = 133/331 (40%), Gaps = 110/331 (33%)
Query: 37 GSKQAEKNSLSNIPRAHLKSWMGVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPN 96
G K + + +N A K +GV P +P I ++ LP FD+RT W
Sbjct: 58 GWKASLNDRFANATVAEFKRLLGVKPTPKTAYLGVP--IVRHDLSLKLPKEFDARTAWSQ 115
Query: 97 CPTIREIRDQGSCGSCWG-----------CRPYEI------------------------- 120
C +I I DQG CGSCW C Y +
Sbjct: 116 CTSIPRILDQGHCGSCWAFGAVESLSDRFCIKYNLNVSLSANDVVACCGLLCGLGCNGGF 175
Query: 121 ---------------APCEHHVNGT---RPSCDASKGHTPKCVRECQENYDVPYKKDLNF 162
C+ + + T P C+ TPKCVR+C + + + ++
Sbjct: 176 PMGAWLYFKYHGVVTEECDPYFDNTGCSHPGCEPGYP-TPKCVRKCVSENQL-WGESKHY 233
Query: 163 GAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIR 222
G +Y ++ + + IM E+Y++GPVE A
Sbjct: 234 GVSAYRINHDPQDIMAEVYKNGPVEVA--------------------------------- 260
Query: 223 DNTSQLGAEGAFTVFDDLILYKSG-------KALGGHAIRILGWGEDEKSKEKYWLIANS 275
FTV++D YKSG +GGHA++++GWG + E YWL+AN
Sbjct: 261 -----------FTVYEDFAHYKSGVYKHITGTKIGGHAVKLIGWGTSDDG-EDYWLLANQ 308
Query: 276 WNTDWGDNGLFKILRGKDECGIESSITAGVP 306
WN WGD+G FKI RG +ECGIE + AG+P
Sbjct: 309 WNRSWGDDGYFKIRRGTNECGIEHGVVAGLP 339
>gi|239938574|gb|ACS36086.1| cysteine proteinase [Haemonchus contortus]
Length = 253
Score = 125 bits (315), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 78/198 (39%), Positives = 105/198 (53%), Gaps = 53/198 (26%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GCRPY PC HH T ++ TPKCVR+CQ++Y YKKD + G +Y V ++E
Sbjct: 100 GCRPYPFHPCGHHGKDTYYGECPNEATTPKCVRKCQKSYKKSYKKDRSIGKDAYEVPNSE 159
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
K+I +EI ++GPV GA
Sbjct: 160 KAIQREIMKNGPVVGA-------------------------------------------- 175
Query: 234 FTVFDDLILYK-------SGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLF 286
FTV++D YK +GKA GGHAI+I+GWG++ + YWLIANSW+ DWG+NG F
Sbjct: 176 FTVYEDFSYYKKGIYKHTAGKARGGHAIKIIGWGKE--NGVPYWLIANSWHNDWGENGYF 233
Query: 287 KILRGKDECGIESSITAG 304
+ILRG + CGIE ++ AG
Sbjct: 234 RILRGSNHCGIEENVVAG 251
Score = 52.0 bits (123), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 18/31 (58%), Positives = 25/31 (80%)
Query: 83 DLPANFDSRTKWPNCPTIREIRDQGSCGSCW 113
D+P +FD+RTKWP C +++ I DQ +CGSCW
Sbjct: 1 DIPESFDARTKWPKCSSLKHIHDQANCGSCW 31
Score = 38.1 bits (87), Expect = 4.9, Method: Compositional matrix adjust.
Identities = 16/28 (57%), Positives = 21/28 (75%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAY 36
CG+GCNGG+P A+ Y+ K G V+GG Y
Sbjct: 68 CGYGCNGGWPIQAFNYFSKQGAVTGGDY 95
>gi|48762493|dbj|BAD23816.1| cathepsin B-N1 [Tuberaphis coreana]
Length = 340
Score = 125 bits (315), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 102/364 (28%), Positives = 149/364 (40%), Gaps = 133/364 (36%)
Query: 35 AYGSKQA---EKNSLSNIPRAHLKSW-MGVHPDYNLPANRLPELIG-------------- 76
Y ++QA E++ ++ I A+ K+W GV+ D L + +L+G
Sbjct: 15 VYRTEQAYFLEEDYINQI-NANAKTWKAGVNFDPKLSIDSFVKLLGSKGVQAAKQASPDM 73
Query: 77 -------YSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG--------------- 114
Y+ +P++FD+R KW C TI E+RDQG CGSCW
Sbjct: 74 FKTHDEAYNSWSNRIPSSFDARKKWRKCSTIGEVRDQGKCGSCWAFGTSSAFADRLCIAT 133
Query: 115 -------CRPYEIAPCEH--------------------HVNGTRPSCDASKGHTP----- 142
P E+A C H H T + D+ +G P
Sbjct: 134 DGEFNELLSPEELAFCCHKCGFGCSGGYPIRAWERFKKHGLVTGGNYDSGEGCQPYRVPP 193
Query: 143 --------------------KCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYE 182
+C R C N D+ +K+D ++ +Y ++ +I +I
Sbjct: 194 CPLDEYGNNTCRGKPAEKNHRCTRMCYGNQDLDFKEDHHYTRDAYYLTYG--TIQNDILA 251
Query: 183 HGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLIL 242
+GP+E +F V+DD YKSG + N T
Sbjct: 252 YGPIEASFEVYDDFPSYKSGVYTKMENATY------------------------------ 281
Query: 243 YKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSIT 302
LGGHA++++GWGE+ YWL+ NSWN WGD GLFKI RG +ECGI++S T
Sbjct: 282 ------LGGHAVKLIGWGEEYGV--PYWLLVNSWNDQWGDQGLFKIRRGTNECGIDNSTT 333
Query: 303 AGVP 306
GVP
Sbjct: 334 GGVP 337
Score = 40.8 bits (94), Expect = 0.80, Method: Compositional matrix adjust.
Identities = 17/32 (53%), Positives = 23/32 (71%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQ 40
CGFGC+GG+P AW + K G+V+GG Y S +
Sbjct: 153 CGFGCSGGYPIRAWERFKKHGLVTGGNYDSGE 184
>gi|166030318|gb|ABY78826.1| cathepsin B-like protease [Trypanosoma congolense]
Length = 335
Score = 125 bits (315), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 98/303 (32%), Positives = 119/303 (39%), Gaps = 104/303 (34%)
Query: 67 PANRLPEL-IGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGS----------------- 108
P + LP + E+ LP FD+ KWPNCPTI EI DQ S
Sbjct: 72 PVSVLPRVNFTEEELLAPLPETFDAAEKWPNCPTITEISDQSSCGSCWAVAAATSMTDRY 131
Query: 109 -------------------CGSC------------WG-----------CRPYEIAPCEHH 126
CG C W C+PY C H+
Sbjct: 132 CTIHGVRGLRISAADLLACCGDCGYGCLGGDPDMAWAYFSSEGIASGRCQPYPFPRCSHY 191
Query: 127 VNGTR-PSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGP 185
N T P C A TP C C D K G KSYS S E+ +E+Y GP
Sbjct: 192 TNSTTYPQCSALHLWTPTCNPACT---DSTISKKKYRGLKSYSFS-GEEDFRRELYFRGP 247
Query: 186 VEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKS 245
+ F V+ DL YK G + G GAF
Sbjct: 248 FQAVFDVWSDLFAYKHGVYKHVG-----------------------GAF----------- 273
Query: 246 GKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGV 305
+G HA+RI+GWG +S YW IANSWN +WGD G F +LRG +ECGIE S +AGV
Sbjct: 274 ---IGAHAVRIVGWGN--QSGVPYWKIANSWNAEWGDRGYFFMLRGDNECGIEDSGSAGV 328
Query: 306 PKL 308
P +
Sbjct: 329 PAI 331
>gi|119638965|gb|ABL85237.1| cysteine proteinase 3 [Necator americanus]
Length = 360
Score = 125 bits (314), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 92/305 (30%), Positives = 127/305 (41%), Gaps = 106/305 (34%)
Query: 73 ELIGYSEVD--EDLPANFDSRTKWPNCPTIREIRDQGSCGSCW----------------- 113
E++ ++D E++P +FD+R KWP C +I IRDQ CGSCW
Sbjct: 77 EMLKEEDMDFSEEIPVSFDARDKWPKCTSIGFIRDQSHCGSCWAVSSAETMSDRLCVQSN 136
Query: 114 ---------------------GC------RPYE----IAPCEHHVNGTRPSC-------- 134
GC R +E C + GT+ SC
Sbjct: 137 GTIKVLLSDTDILACCPNCGAGCGGGHTIRAWEYFKNTGVCTGGLYGTKDSCKPYAFYPC 196
Query: 135 -DASKGH-------TPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPV 186
D S G TPKC + CQ Y Y D + +Y + NE I EI +GPV
Sbjct: 197 KDESYGKCPKDSFPTPKCRKICQYKYSKKYADDKYYANSAYRIPQNETWIKLEIMRNGPV 256
Query: 187 EGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSG 246
+F ++ D Y+ G + G G
Sbjct: 257 TASFRIYPDFGFYEKGVYVTSG-------------------------------------G 279
Query: 247 KALGGHAIRILGWGEDEK--SKEKYWLIANSWNTDWGD-NGLFKILRGKDECGIESSITA 303
+ LGGHAI+I+GWG ++ + YWLIANSW TDWG+ NG F+ILRG++ C IE + A
Sbjct: 280 RELGGHAIKIIGWGTEKVNGTDLPYWLIANSWGTDWGENNGYFRILRGQNHCQIEQKVIA 339
Query: 304 GVPKL 308
G+ K+
Sbjct: 340 GMIKV 344
Score = 37.7 bits (86), Expect = 7.0, Method: Compositional matrix adjust.
Identities = 16/35 (45%), Positives = 22/35 (62%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQAEK 43
CG GC GG AW Y+ +G+ +GG YG+K + K
Sbjct: 155 CGAGCGGGHTIRAWEYFKNTGVCTGGLYGTKDSCK 189
>gi|204022092|dbj|BAG71143.1| cathepsin B-N2 [Tuberaphis coreana]
Length = 334
Score = 125 bits (314), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 102/364 (28%), Positives = 149/364 (40%), Gaps = 133/364 (36%)
Query: 35 AYGSKQA---EKNSLSNIPRAHLKSW-MGVHPDYNLPANRLPELIG-------------- 76
Y ++QA E++ ++ I A+ K+W GV+ D L + +L+G
Sbjct: 12 VYRTEQAYFLEEDYINQI-NANAKTWKAGVNFDPKLSIDSFVKLLGSKGVQAAKQASPDM 70
Query: 77 -------YSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG--------------- 114
Y+ +P++FD+R KW C TI E+RDQG CGSCW
Sbjct: 71 FKTHDEAYNSWSNRIPSSFDARKKWRKCSTIGEVRDQGKCGSCWAFGTSSAFADRLCIAT 130
Query: 115 -------CRPYEIAPCEH--------------------HVNGTRPSCDASKGHTP----- 142
P E+A C H H T + D+ +G P
Sbjct: 131 DGEFNELLSPEELAFCCHKCGFGCSGGYPIRAWERFKKHGLVTGGNYDSGEGCQPYRVPP 190
Query: 143 --------------------KCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYE 182
+C R C N D+ +K+D ++ +Y ++ +I +I
Sbjct: 191 CPLDEYGNNTCRGKPAEKNHRCTRMCYGNQDLDFKEDHHYTRDAYYLTYG--TIQNDILA 248
Query: 183 HGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLIL 242
+GP+E +F V+DD YKSG + N T
Sbjct: 249 YGPIEASFEVYDDFPSYKSGVYTKMENATY------------------------------ 278
Query: 243 YKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSIT 302
LGGHA++++GWGE+ YWL+ NSWN WGD GLFKI RG +ECGI++S T
Sbjct: 279 ------LGGHAVKLIGWGEEYGV--PYWLLVNSWNDQWGDQGLFKIRRGTNECGIDNSTT 330
Query: 303 AGVP 306
GVP
Sbjct: 331 GGVP 334
Score = 40.8 bits (94), Expect = 0.85, Method: Compositional matrix adjust.
Identities = 17/32 (53%), Positives = 23/32 (71%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQ 40
CGFGC+GG+P AW + K G+V+GG Y S +
Sbjct: 150 CGFGCSGGYPIRAWERFKKHGLVTGGNYDSGE 181
>gi|388499754|gb|AFK37943.1| unknown [Lotus japonicus]
Length = 209
Score = 125 bits (314), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 70/188 (37%), Positives = 99/188 (52%), Gaps = 40/188 (21%)
Query: 119 EIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMK 178
E P + + P C+ + TPKCVR+C + + +KK +F +YSV S+ IM
Sbjct: 42 ECDPYFDQIGCSHPGCEPAY-QTPKCVRKCVKGNQI-WKKSKHFSVNAYSVKSDPYDIMA 99
Query: 179 EIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFD 238
E+Y++GPVE AFTV++D YKSG +
Sbjct: 100 EVYKNGPVEVAFTVYEDFAHYKSGVY---------------------------------- 125
Query: 239 DLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIE 298
+ +G LGGHA++++GWG ++ E YWLIAN WN WGD+G F I RG +ECGIE
Sbjct: 126 ---KHITGSQLGGHAVKLIGWGTTDEG-EDYWLIANQWNRSWGDDGYFMIRRGTNECGIE 181
Query: 299 SSITAGVP 306
+TAG+P
Sbjct: 182 EDVTAGLP 189
>gi|19526442|gb|AAL89717.1|AF483623_1 cathepsin B [Apriona germari]
Length = 324
Score = 125 bits (314), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 90/320 (28%), Positives = 134/320 (41%), Gaps = 98/320 (30%)
Query: 41 AEKNSLSNIPRA--HLKSWMGVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCP 98
A KN P L +G++ D N+ LP + + E +P +FD+R +WP C
Sbjct: 44 ARKNFEGRTPEQLKALADVIGINRDPNV---TLP--VVFHEAISGIPDSFDAREQWPFCE 98
Query: 99 TIREIRDQGSCGSCW--------------------------------------GCRP-YE 119
+IR IRD+G+CGSCW GCR +
Sbjct: 99 SIRTIRDEGACGSCWAFAAVEVMSDRLCLASEGRKKFIFSAEEVVSCCTACGGGCRGGFL 158
Query: 120 IAPCEHHVN-------------GTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKS 166
P ++ V G +P A G TP+C + C Y+ ++KDL +
Sbjct: 159 NEPYKYWVTNGIPSGGDYGSKLGCKPYTAAVSGETPQCQKACVSGYEKSWEKDLRHATSA 218
Query: 167 YSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTS 226
Y V+ I +EI ++GPV V++D Y +G
Sbjct: 219 YQVNGGVLQIQREILDNGPVTAYMEVYEDFYSYGTG------------------------ 254
Query: 227 QLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLF 286
+ + SG +GGHA++I+GWG + + YW+ ANSW T +G++G F
Sbjct: 255 -------------IYQHTSGSFVGGHAVKIIGWGSE--NDVPYWIAANSWGTGFGEDGFF 299
Query: 287 KILRGKDECGIESSITAGVP 306
+ILRG + GIES I AG P
Sbjct: 300 RILRGSNCAGIESYIVAGYP 319
Score = 41.6 bits (96), Expect = 0.49, Method: Compositional matrix adjust.
Identities = 19/31 (61%), Positives = 22/31 (70%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSK 39
CG GC GGF ++YWV +GI SGG YGSK
Sbjct: 149 CGGGCRGGFLNEPYKYWVTNGIPSGGDYGSK 179
>gi|239938576|gb|ACS36087.1| cysteine proteinase [Haemonchus contortus]
Length = 253
Score = 125 bits (314), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 78/198 (39%), Positives = 104/198 (52%), Gaps = 53/198 (26%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GCRPY PC HH T ++ TPKCVR+CQ++Y YKKD + G +Y V ++E
Sbjct: 100 GCRPYPFHPCGHHGKDTYYGECPNEATTPKCVRKCQKSYKKSYKKDRSIGKDAYEVPNSE 159
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
K+I +EI ++GPV GA
Sbjct: 160 KAIQREIMKNGPVVGA-------------------------------------------- 175
Query: 234 FTVFDDLILYK-------SGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLF 286
FTV++D YK +GKA GGHAI+I+GWG++ YWLIANSW+ DWG+NG F
Sbjct: 176 FTVYEDFSYYKKGIYKHTAGKARGGHAIKIIGWGKE--GGVPYWLIANSWHNDWGENGYF 233
Query: 287 KILRGKDECGIESSITAG 304
+ILRG + CGIE ++ AG
Sbjct: 234 RILRGSNHCGIEENVVAG 251
Score = 53.9 bits (128), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 19/31 (61%), Positives = 26/31 (83%)
Query: 83 DLPANFDSRTKWPNCPTIREIRDQGSCGSCW 113
D+P +FD+RTKWP C +++ IRDQ +CGSCW
Sbjct: 1 DIPESFDARTKWPKCSSLKHIRDQANCGSCW 31
Score = 38.1 bits (87), Expect = 5.7, Method: Compositional matrix adjust.
Identities = 16/28 (57%), Positives = 21/28 (75%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAY 36
CG+GCNGG+P A+ Y+ K G V+GG Y
Sbjct: 68 CGYGCNGGWPIQAFNYFSKQGAVTGGDY 95
>gi|239938582|gb|ACS36090.1| cysteine proteinase [Haemonchus contortus]
Length = 346
Score = 125 bits (313), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 75/205 (36%), Positives = 99/205 (48%), Gaps = 53/205 (25%)
Query: 107 GSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKS 166
G GS GCRPY PC HH N T TPKC R CQ +Y Y D ++G +
Sbjct: 184 GDYGSKDGCRPYPFHPCGHHGNDTYYGECPKGAKTPKCRRRCQRSYKKAYYMDKSYGEDA 243
Query: 167 YSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTS 226
Y V + K+I +EI ++GPV GA
Sbjct: 244 YEVPHSVKAIQREIMKNGPVVGA------------------------------------- 266
Query: 227 QLGAEGAFTVFDDLILYK-------SGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTD 279
FTV++D YK +G+A GGHAI+I+GWG + + YWLIANSW+ D
Sbjct: 267 -------FTVYEDFSYYKKGIYKHTAGQARGGHAIKIIGWGVE--NDVPYWLIANSWHND 317
Query: 280 WGDNGLFKILRGKDECGIESSITAG 304
WG+ G F+++RG +ECGIE + AG
Sbjct: 318 WGEEGYFRMIRGINECGIEQEVVAG 342
Score = 61.6 bits (148), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 24/46 (52%), Positives = 32/46 (69%)
Query: 69 NRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG 114
NR P + + +D+P +FD+RT WPNC +IR IRDQ +CGSCW
Sbjct: 79 NRKPAVENEDDEGDDIPESFDARTHWPNCTSIRHIRDQANCGSCWA 124
Score = 38.9 bits (89), Expect = 3.4, Method: Compositional matrix adjust.
Identities = 15/31 (48%), Positives = 23/31 (74%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSK 39
C +GC+GG+P +A+ ++ G V+GG YGSK
Sbjct: 159 CSYGCDGGWPILAFDFYTYEGAVTGGDYGSK 189
>gi|222424744|dbj|BAH20325.1| AT1G02305 [Arabidopsis thaliana]
Length = 293
Score = 125 bits (313), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 94/322 (29%), Positives = 131/322 (40%), Gaps = 110/322 (34%)
Query: 46 LSNIPRAHLKSWMGVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRD 105
+N A K +GV P +P I ++ LP FD+RT W C +I I D
Sbjct: 1 FANATVAEFKRLLGVKPTPKTEFLGVP--IVSHDISLKLPKEFDARTAWSQCTSIGRILD 58
Query: 106 QGSCGSCW-------------------------------------GCRP-YEIAP----- 122
QG CGSCW GC Y IA
Sbjct: 59 QGHCGSCWAFGAVESLSDRFCIKYNMNVSLSVNDLLACCGFLCGQGCNGGYPIAAWRYFK 118
Query: 123 --------CEHHVNGT---RPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSS 171
C+ + + T P C+ + TPKC R+C + +++ ++G +Y V S
Sbjct: 119 HHGVVTEECDPYFDNTGCSHPGCEPAYP-TPKCARKCVSGNQL-WRESKHYGVSAYKVRS 176
Query: 172 NEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAE 231
+ IM E+Y++GPVE A
Sbjct: 177 HPDDIMAEVYKNGPVEVA------------------------------------------ 194
Query: 232 GAFTVFDDLILYKSG-------KALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNG 284
FTV++D YKSG +GGHA++++GWG + E YWL+AN WN WGD+G
Sbjct: 195 --FTVYEDFAHYKSGVYKHITGTNIGGHAVKLIGWGTSDDG-EDYWLLANQWNRSWGDDG 251
Query: 285 LFKILRGKDECGIESSITAGVP 306
FKI RG +ECGIE + AG+P
Sbjct: 252 YFKIRRGTNECGIEHGVVAGLP 273
Score = 38.1 bits (87), Expect = 5.0, Method: Compositional matrix adjust.
Identities = 15/25 (60%), Positives = 19/25 (76%)
Query: 8 LCGFGCNGGFPGMAWRYWVKSGIVS 32
LCG GCNGG+P AWRY+ G+V+
Sbjct: 100 LCGQGCNGGYPIAAWRYFKHHGVVT 124
>gi|239938584|gb|ACS36091.1| cysteine proteinase [Haemonchus contortus]
Length = 346
Score = 125 bits (313), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 75/205 (36%), Positives = 99/205 (48%), Gaps = 53/205 (25%)
Query: 107 GSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKS 166
G GS GCRPY PC HH N T TPKC R CQ +Y Y D ++G +
Sbjct: 184 GDYGSKDGCRPYPFHPCGHHGNDTYYGECPKGAKTPKCRRRCQRSYKKAYYMDKSYGEDA 243
Query: 167 YSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTS 226
Y V + K+I +EI ++GPV GA
Sbjct: 244 YEVPHSVKAIQREIMKNGPVVGA------------------------------------- 266
Query: 227 QLGAEGAFTVFDDLILYK-------SGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTD 279
FTV++D YK +G+A GGHAI+I+GWG + + YWLIANSW+ D
Sbjct: 267 -------FTVYEDFSYYKKGIYKHTAGQARGGHAIKIIGWGVE--NDVPYWLIANSWHND 317
Query: 280 WGDNGLFKILRGKDECGIESSITAG 304
WG+ G F+++RG +ECGIE + AG
Sbjct: 318 WGEEGYFRMIRGINECGIEQEVVAG 342
Score = 61.6 bits (148), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 24/46 (52%), Positives = 32/46 (69%)
Query: 69 NRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG 114
NR P + + +D+P +FD+RT WPNC +IR IRDQ +CGSCW
Sbjct: 79 NRKPAVENEDDEGDDIPESFDARTHWPNCTSIRHIRDQANCGSCWA 124
Score = 40.8 bits (94), Expect = 0.81, Method: Compositional matrix adjust.
Identities = 16/31 (51%), Positives = 24/31 (77%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSK 39
CG+GC+GG+P +A+ ++ G V+GG YGSK
Sbjct: 159 CGYGCDGGWPILAFDFYTYEGAVTGGDYGSK 189
>gi|118429529|gb|ABK91812.1| cathepsin B precursor [Clonorchis sinensis]
Length = 342
Score = 124 bits (312), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 72/195 (36%), Positives = 98/195 (50%), Gaps = 40/195 (20%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GCR Y CEH G P C TP+C++ C + ++ Y+KD SY+V E
Sbjct: 183 GCRSYPFPSCEHRGKGQYPPCPHQLYPTPECIKRC-DTKEIDYEKDKTRANISYNVYPAE 241
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
+++MKEI GPV V++DL+ YKSG +
Sbjct: 242 QAVMKEIMLRGPVGAILHVYEDLLDYKSGVY----------------------------- 272
Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
F V+ G LG H IRILGWGE++ YWL+ANSWN DWG+ G ++LR ++
Sbjct: 273 FHVW--------GGHLGEHGIRILGWGEEDGVP--YWLVANSWNEDWGEKGYMRVLRWRN 322
Query: 294 ECGIESSITAGVPKL 308
ECGI +TAG+P L
Sbjct: 323 ECGIVDQVTAGLPDL 337
Score = 55.8 bits (133), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 21/34 (61%), Positives = 27/34 (79%)
Query: 81 DEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG 114
++ LP +FD+R WP+CP+I EIRDQ SCGSCW
Sbjct: 83 NQHLPESFDARANWPHCPSISEIRDQSSCGSCWA 116
>gi|255548165|ref|XP_002515139.1| cathepsin B, putative [Ricinus communis]
gi|223545619|gb|EEF47123.1| cathepsin B, putative [Ricinus communis]
Length = 376
Score = 124 bits (312), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 94/341 (27%), Positives = 140/341 (41%), Gaps = 113/341 (33%)
Query: 37 GSKQAEKNSLSNIPRAHLKSWMGVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPN 96
G + A LSN K +G P +P +I + + + LP FD+RT WP+
Sbjct: 56 GWEAAMNPQLSNFTVGQFKYLLGAKPTPKKELMGVP-MISHPKTLK-LPKEFDARTAWPH 113
Query: 97 CPTIREIRDQ-----------------GSCGSCWGCRPYE-------------------- 119
C TI +I Q G CGSCW E
Sbjct: 114 CSTIGKILGQLLSFYNIFSIFFFLFLEGHCGSCWAFGAVESLSDRFCIHFGMNISLSVND 173
Query: 120 -IAPC--------------------EHH-------------VNGTRPSCDASKGHTPKCV 145
+A C HH + + P C+ TPKCV
Sbjct: 174 LLACCGFLCGDGCDGGYPMYAWRYFVHHGVVTEECDPYFDNIGCSHPGCEPGFP-TPKCV 232
Query: 146 RECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFF 205
R+C + + +++ ++ +Y +SS+ +M E+Y++GPVE +FTV++D YKSG +
Sbjct: 233 RKCIDKNQL-WRQSKHYSVNAYRISSDPHDVMAEVYKNGPVEVSFTVYEDFAHYKSGVY- 290
Query: 206 VPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKS 265
+ +G+ +GGHA++++GWG +
Sbjct: 291 ------------------------------------KHITGEVMGGHAVKLIGWGTSDNG 314
Query: 266 KEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVP 306
E YWL+AN WN WGD+G FKI RG +ECGIE AG+P
Sbjct: 315 -EDYWLLANQWNRGWGDDGYFKIRRGTNECGIEDDAVAGLP 354
Score = 39.7 bits (91), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 15/25 (60%), Positives = 20/25 (80%)
Query: 8 LCGFGCNGGFPGMAWRYWVKSGIVS 32
LCG GC+GG+P AWRY+V G+V+
Sbjct: 181 LCGDGCDGGYPMYAWRYFVHHGVVT 205
>gi|204022096|dbj|BAG71145.1| cathepsin B-N1 [Tuberaphis sumatrana]
Length = 334
Score = 124 bits (311), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 101/363 (27%), Positives = 149/363 (41%), Gaps = 133/363 (36%)
Query: 36 YGSKQA---EKNSLSNIPRAHLKSW-MGVHPDYNLPANRLPELIG--------------- 76
Y ++QA E++ ++ I A+ K+W GV+ D L + +L+G
Sbjct: 13 YQTEQAYFLEEDYINQI-NANAKTWKAGVNFDPKLSIDSFVKLLGSKGVQAAKQASPDMF 71
Query: 77 ------YSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG---------------- 114
Y+ +P+NFD+R KW C TI E+RDQG CGSCW
Sbjct: 72 KTHDEAYNNWSNRIPSNFDARKKWRKCSTIGEVRDQGHCGSCWAFGTSSAFADRLCIATD 131
Query: 115 ------CRPYEIAPCEH--------------------HVNGTRPSCDASKGHTP------ 142
P E+A C H H T + D+ +G P
Sbjct: 132 GEFNELLSPEELAFCCHKCGFGCSGGNPIKAWERFQKHGLVTGGNYDSGEGCQPYKVPPC 191
Query: 143 -------------------KCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEH 183
+C R C N ++ +K+D ++ +Y ++ +I ++ +
Sbjct: 192 PLDEYGNNTCSGKPAEKNHRCTRMCYGNQNLDFKEDHHYTRDAYYLTYG--TIQYDVLAY 249
Query: 184 GPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILY 243
GP+E +F V+DD YKSG + N T
Sbjct: 250 GPIEASFEVYDDFPSYKSGVYTKMENATY------------------------------- 278
Query: 244 KSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITA 303
LGGHA++++GWGE+ YWL+ NSWN WGD GLFKI RG +ECGI++S T
Sbjct: 279 -----LGGHAVKLIGWGEEYGV--PYWLLVNSWNDQWGDQGLFKIRRGTNECGIDNSTTG 331
Query: 304 GVP 306
GVP
Sbjct: 332 GVP 334
Score = 38.5 bits (88), Expect = 4.1, Method: Compositional matrix adjust.
Identities = 17/30 (56%), Positives = 21/30 (70%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
CGFGC+GG P AW + K G+V+GG Y S
Sbjct: 150 CGFGCSGGNPIKAWERFQKHGLVTGGNYDS 179
>gi|328697984|ref|XP_003240502.1| PREDICTED: cathepsin B [Acyrthosiphon pisum]
Length = 339
Score = 124 bits (311), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 70/193 (36%), Positives = 93/193 (48%), Gaps = 40/193 (20%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GC PY + PC + +GT +C R C N D+ Y D F Y ++
Sbjct: 184 GCEPYRVPPCPRNEDGTSSCAGQPIEKNHRCTRMCYGNQDLDYNDDHRFTRDYYYLTYG- 242
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
SI K++ +GP+E +F V+DD YKSG + N T
Sbjct: 243 -SIQKDVMNYGPIEASFDVYDDFYSYKSGVYQRTPNAT---------------------- 279
Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
LGGHA++++GWG +E YWL+ NSW+ WGDNGLFKI RG D
Sbjct: 280 --------------KLGGHAVKLIGWGVEEGI--PYWLMVNSWSAQWGDNGLFKIRRGTD 323
Query: 294 ECGIESSITAGVP 306
ECGI+S+ TAGVP
Sbjct: 324 ECGIDSATTAGVP 336
Score = 48.9 bits (115), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 18/31 (58%), Positives = 23/31 (74%)
Query: 84 LPANFDSRTKWPNCPTIREIRDQGSCGSCWG 114
+P FD+R +W +C TI E+RDQG CGSCW
Sbjct: 87 IPRTFDARRRWRHCKTIGEVRDQGYCGSCWA 117
>gi|328726600|ref|XP_003248962.1| PREDICTED: cathepsin B-like cysteine proteinase-like [Acyrthosiphon
pisum]
Length = 169
Score = 124 bits (311), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 70/193 (36%), Positives = 93/193 (48%), Gaps = 40/193 (20%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GC PY + PC + +GT +C R C N D+ Y D F Y ++
Sbjct: 14 GCEPYRVPPCPRNEDGTSSCAGQPIEKNHRCTRMCYGNQDLDYNDDHRFTRDYYYLTYG- 72
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
SI K++ +GP+E +F V+DD YKSG + N T
Sbjct: 73 -SIQKDVMNYGPIEASFDVYDDFYSYKSGVYQRTPNAT---------------------- 109
Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
LGGHA++++GWG +E YWL+ NSW+ WGDNGLFKI RG D
Sbjct: 110 --------------KLGGHAVKLIGWGVEEGI--PYWLMVNSWSAQWGDNGLFKIRRGTD 153
Query: 294 ECGIESSITAGVP 306
ECGI+S+ TAGVP
Sbjct: 154 ECGIDSATTAGVP 166
>gi|166030310|gb|ABY78822.1| cathepsin B-like protease [Trypanosoma congolense]
Length = 335
Score = 124 bits (311), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 101/298 (33%), Positives = 128/298 (42%), Gaps = 41/298 (13%)
Query: 39 KQAEKNSLSNIPRAHLKSWMG--VHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPN 96
K + NI A + G + +LP R E ++ +LP +FDS KWPN
Sbjct: 47 KAVYNGKMQNITFAEARRLTGARIQKTSSLPPVRFTE----EQLRTELPESFDSAEKWPN 102
Query: 97 CPTIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPY 156
CPTIREI DQ +CGSCW H G S H C +C + D Y
Sbjct: 103 CPTIREIADQSACGSCWAVSTASAISDRHCTVGGVQQLRISAAHLMSCCEDCGDGCDGGY 162
Query: 157 KKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETT---- 212
+ Y VS S + Y P G F P TT
Sbjct: 163 PGT----SWEYYVSHGLASSYCQPYPF-PHCGHHGGKGKKPPCSKYHFHTPKCNTTCTDK 217
Query: 213 AMSLIKWTIRDNTS-----------QLGAEGAFT----VFDDLILYK-------SGKALG 250
A+ LIK+ R N S +L G F V+ D + YK SG LG
Sbjct: 218 AIPLIKY--RGNHSYEVHGEDDYKRELYFNGPFVVVFWVYSDFLAYKTGVYRHVSGDFLG 275
Query: 251 GHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVPKL 308
GHA+RI+GWG+ + YW IANSW+TDWG NG LRG +ECGIE++ AG P +
Sbjct: 276 GHAVRIVGWGKLNGT--PYWKIANSWDTDWGMNGHLLFLRGNNECGIEAAGYAGSPAI 331
>gi|299471123|emb|CBN78981.1| cathepsin B-like proteinase [Ectocarpus siliculosus]
Length = 557
Score = 124 bits (311), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 75/210 (35%), Positives = 100/210 (47%), Gaps = 53/210 (25%)
Query: 105 DQGSCGSCWGCRPYEIAPCEHHVN---GTRPSCDASKGHTPKCVRECQENYDVPYKKDLN 161
D G+ C+PYE PC HHV+ P+C + TP+C+ EC E N
Sbjct: 389 DYADIGTGTTCKPYEFMPCAHHVDPGASGYPACPDGEYPTPECLSECSET---------N 439
Query: 162 FGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTI 221
F SY +K + +E Y +E + D++ Y S
Sbjct: 440 FSGGSYG---EDKKMAREAYSLAGIE---NIQRDMMKYGS-------------------- 473
Query: 222 RDNTSQLGAEGAFTVFDDLILY-------KSGKALGGHAIRILGWGEDEKSKEKYWLIAN 274
AF+VF D + Y +SG +GGHA++++GWG DE S E YWLIAN
Sbjct: 474 --------VTAAFSVFSDFLTYSGGVYTHESGSFMGGHAVKMIGWGTDEVSGEDYWLIAN 525
Query: 275 SWNTDWGDNGLFKILRGKDECGIESSITAG 304
SWN WG+ GLF+ILRG +ECGIE I AG
Sbjct: 526 SWNPSWGEGGLFRILRGVNECGIEGQIVAG 555
Score = 53.5 bits (127), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 23/49 (46%), Positives = 30/49 (61%), Gaps = 1/49 (2%)
Query: 66 LPANRLPELIGYSEVDEDLPANFDSRTKWPNCPT-IREIRDQGSCGSCW 113
+P R + S DED+PANFD+R +P C + I +RDQ CGSCW
Sbjct: 262 VPGRRRLTPVAQSSSDEDIPANFDAREAFPECASIIGRVRDQSDCGSCW 310
Score = 40.0 bits (92), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 17/30 (56%), Positives = 23/30 (76%), Gaps = 2/30 (6%)
Query: 9 CGF--GCNGGFPGMAWRYWVKSGIVSGGAY 36
CG GCNGG PG AW+++ K+G+V+GG Y
Sbjct: 361 CGLSMGCNGGQPGSAWKWFTKTGVVTGGDY 390
>gi|118429531|gb|ABK91813.1| cathepsin B-like cysteine proteinase precursor [Clonorchis
sinensis]
gi|358331549|dbj|GAA37857.2| cathepsin B-like cysteine proteinase [Clonorchis sinensis]
Length = 343
Score = 124 bits (311), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 77/200 (38%), Positives = 92/200 (46%), Gaps = 54/200 (27%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GCR Y CEHHV G P C TP+CV+ C + + Y KD SY++ S+E
Sbjct: 183 GCRSYPFPKCEHHVQGHYPPCPHQYYPTPECVQHC-DTPGIDYVKDKTRANMSYNIYSSE 241
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
IMKEI GPVE
Sbjct: 242 ILIMKEIMLRGPVEAV-------------------------------------------- 257
Query: 234 FTVFDDLILYKSG-------KALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLF 286
FTV++D + YK G L HAIRILGWGE+ YWLIANSWN DWG+ G
Sbjct: 258 FTVYEDFLQYKFGVYFHSWGAPLSEHAIRILGWGEE--GDVPYWLIANSWNEDWGEKGYM 315
Query: 287 KILRGKDECGIESSITAGVP 306
K LRG +ECGIE +TAG+P
Sbjct: 316 KFLRGLNECGIEDDVTAGLP 335
Score = 59.3 bits (142), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 23/31 (74%), Positives = 26/31 (83%)
Query: 84 LPANFDSRTKWPNCPTIREIRDQGSCGSCWG 114
LP NFD+RTKWP+CP+I EIRDQ CGSCW
Sbjct: 86 LPKNFDARTKWPHCPSISEIRDQSGCGSCWA 116
Score = 43.9 bits (102), Expect = 0.092, Method: Compositional matrix adjust.
Identities = 16/27 (59%), Positives = 22/27 (81%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGA 35
CG+GC+GG+P +AW YW GIV+GG+
Sbjct: 151 CGYGCSGGYPAVAWDYWGAHGIVTGGS 177
>gi|350535627|ref|NP_001233013.1| uncharacterized protein LOC100164982 precursor [Acyrthosiphon
pisum]
gi|239789514|dbj|BAH71377.1| ACYPI005957 [Acyrthosiphon pisum]
Length = 339
Score = 124 bits (311), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 70/193 (36%), Positives = 93/193 (48%), Gaps = 40/193 (20%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GC PY + PC + +G K +C R C N D+ Y D F Y ++
Sbjct: 184 GCEPYRVPPCPRNEDGKSSCAGKPKEKNHRCTRMCYGNQDLDYDDDHRFTRDFYYLTYG- 242
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
SI K++ +GP+E +F V+DD YKSG + N T
Sbjct: 243 -SIQKDVLNYGPIEASFDVYDDFPSYKSGVYQRTPNAT---------------------- 279
Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
LGGHA++++GWG +E + YWL+ NSWN WGDNGLFKI RG D
Sbjct: 280 --------------KLGGHAVKLIGWGVEEGT--PYWLMVNSWNAQWGDNGLFKIRRGTD 323
Query: 294 ECGIESSITAGVP 306
EC I+S+ TAGVP
Sbjct: 324 ECRIDSATTAGVP 336
Score = 43.1 bits (100), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 27/44 (61%), Gaps = 1/44 (2%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGS-KQAEKNSLSNIPR 51
CG GCNGG+P AW+Y+ G+V+GG Y S K E + PR
Sbjct: 152 CGHGCNGGYPIKAWKYFSTHGLVTGGNYKSGKGCEPYRVPPCPR 195
>gi|427787723|gb|JAA59313.1| Putative cathepsin b-like cysteine protease form 2 [Rhipicephalus
pulchellus]
Length = 338
Score = 124 bits (310), Expect = 8e-26, Method: Compositional matrix adjust.
Identities = 72/194 (37%), Positives = 94/194 (48%), Gaps = 39/194 (20%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GC+PY I + G P P C REC+++Y Y +D ++G K Y++S +E
Sbjct: 180 GCQPYSIHTTRYTTTGLLPPPINDLSPMPPCKRECRKSYGKKYSEDKHYGEKVYTLSGDE 239
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
I EI+++GPVE F V+ D YKSG + A S ++
Sbjct: 240 AQIKTEIFKNGPVEADFAVYADFYSYKSGVY-------QAHSRVR--------------- 277
Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
G HAIRILGWG + + YWL ANSW WGD G FKI RG +
Sbjct: 278 ---------------CGSHAIRILGWGTE--NGVPYWLAANSWTEHWGDKGYFKIRRGNN 320
Query: 294 ECGIESSITAGVPK 307
ECGIE I AG+PK
Sbjct: 321 ECGIEEDINAGIPK 334
Score = 61.6 bits (148), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 30/75 (40%), Positives = 46/75 (61%), Gaps = 4/75 (5%)
Query: 40 QAEKNSLSNIPRAHLKSWMGVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPT 99
+A +N N+P +++K MGV + RLP L+ +S + ++LP +FD+R W C +
Sbjct: 43 KAGRNFDKNVPFSYIKGLMGVARN---KTRRLPTLM-HSSIPDNLPESFDARQHWRKCNS 98
Query: 100 IREIRDQGSCGSCWG 114
I IRDQ SCG+CW
Sbjct: 99 IHVIRDQSSCGACWA 113
Score = 38.9 bits (89), Expect = 3.0, Method: Compositional matrix adjust.
Identities = 15/31 (48%), Positives = 21/31 (67%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSK 39
C GC GG P AW ++ + GIV+GG YG++
Sbjct: 148 CRTGCKGGVPSYAWMFYKEKGIVTGGLYGTE 178
>gi|18411686|ref|NP_567215.1| cathepsin B [Arabidopsis thaliana]
gi|13877861|gb|AAK44008.1|AF370193_1 putative cathepsin B cysteine protease [Arabidopsis thaliana]
gi|17473834|gb|AAL38343.1| unknown protein [Arabidopsis thaliana]
gi|21281113|gb|AAM45063.1| putative cathepsin B cysteine protease [Arabidopsis thaliana]
gi|21554165|gb|AAM63244.1| cathepsin B-like cysteine protease, putative [Arabidopsis thaliana]
gi|24417490|gb|AAN60355.1| unknown [Arabidopsis thaliana]
gi|24899725|gb|AAN65077.1| unknown protein [Arabidopsis thaliana]
gi|51968702|dbj|BAD43043.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
gi|51969104|dbj|BAD43244.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
gi|51969220|dbj|BAD43302.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
gi|51970472|dbj|BAD43928.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
gi|51970630|dbj|BAD44007.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
gi|51970704|dbj|BAD44044.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
gi|51970802|dbj|BAD44093.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
gi|51970974|dbj|BAD44179.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
gi|51971008|dbj|BAD44196.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
gi|51971116|dbj|BAD44250.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
gi|62320144|dbj|BAD94342.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
gi|110740287|dbj|BAF02040.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
gi|332656652|gb|AEE82052.1| cathepsin B [Arabidopsis thaliana]
Length = 359
Score = 123 bits (309), Expect = 9e-26, Method: Compositional matrix adjust.
Identities = 98/335 (29%), Positives = 134/335 (40%), Gaps = 118/335 (35%)
Query: 37 GSKQAEKNSLSNIPRAHLKSWMGVHPDYNLPANRLPELIGYSEVDED----LPANFDSRT 92
G K A + SN A K +GV P +G V D LP FD+RT
Sbjct: 58 GWKAAINDRFSNATVAEFKRLLGVKP------TPKKHFLGVPIVSHDPSLKLPKAFDART 111
Query: 93 KWPNCPTIREIRDQGSCGSCW-------------------------------------GC 115
WP C +I I DQG CGSCW GC
Sbjct: 112 AWPQCTSIGNILDQGHCGSCWAFGAVESLSDRFCIQFGMNISLSVNDLLACCGFRCGDGC 171
Query: 116 RP-YEIAP-------------CEHHVNGT---RPSCDASKGHTPKCVRECQENYDVPYKK 158
Y IA C+ + + T P C+ + TPKC R+C + + + +
Sbjct: 172 DGGYPIAAWQYFSYSGVVTEECDPYFDNTGCSHPGCEPAY-PTPKCSRKCVSDNKL-WSE 229
Query: 159 DLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIK 218
++ +Y+V SN + IM E+Y++GPV
Sbjct: 230 SKHYSVSTYTVKSNPQDIMAEVYKNGPV-------------------------------- 257
Query: 219 WTIRDNTSQLGAEGAFTVFDDLILYKSG-------KALGGHAIRILGWGEDEKSKEKYWL 271
E +FTV++D YKSG +GGHA++++GWG + E YWL
Sbjct: 258 ------------EVSFTVYEDFAHYKSGVYKHITGSNIGGHAVKLIGWGTSSEG-EDYWL 304
Query: 272 IANSWNTDWGDNGLFKILRGKDECGIESSITAGVP 306
+AN WN WGD+G F I RG +ECGIE AG+P
Sbjct: 305 MANQWNRGWGDDGYFMIRRGTNECGIEDEPVAGLP 339
>gi|330805199|ref|XP_003290573.1| hypothetical protein DICPUDRAFT_155103 [Dictyostelium purpureum]
gi|325079281|gb|EGC32888.1| hypothetical protein DICPUDRAFT_155103 [Dictyostelium purpureum]
Length = 313
Score = 123 bits (309), Expect = 9e-26, Method: Compositional matrix adjust.
Identities = 93/312 (29%), Positives = 132/312 (42%), Gaps = 110/312 (35%)
Query: 54 LKSWMGVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCW 113
+ S +G N P+ +P L ++ + PA+FDSRT W NC TI I +Q CGSCW
Sbjct: 52 IGSLLGFKKSLNRPS--IPVL--NADPNIKAPASFDSRTAWSNCTTIGYIENQARCGSCW 107
Query: 114 GCRPYE--------------------IAPCEHHVNG------------------------ 129
E + C+ +G
Sbjct: 108 AFGAVESAQDRICIHKGLDVQLSFLDLVTCDQSDDGCEGGDDVSAWNFLKKQGVVTQECK 167
Query: 130 --TRPSCDASKG------HTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIY 181
T P+C ++ +TP CV++C+ N + Y +D + AK YS++S E +IM+EI
Sbjct: 168 PYTIPTCPPAQQPCLNFVNTPNCVKQCESNSTLIYSQDKHKMAKIYSINSVE-AIMQEIS 226
Query: 182 EHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLI 241
+GPVE F+V++D +
Sbjct: 227 TNGPVEAC--------------------------------------------FSVYEDFL 242
Query: 242 LYKSG-------KALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDE 294
YKSG K LGGH ++I G+G + YW +ANSW T WGDNG+F I RG DE
Sbjct: 243 GYKSGVYQHTTGKFLGGHCVKIFGYGT--LNGVNYWSVANSWTTSWGDNGIFLIKRGSDE 300
Query: 295 CGIESSITAGVP 306
CGIE + AG+P
Sbjct: 301 CGIEDEVVAGIP 312
>gi|17559066|ref|NP_506790.1| Protein CPR-3 [Caenorhabditis elegans]
gi|1169083|sp|P43507.1|CPR3_CAEEL RecName: Full=Cathepsin B-like cysteine proteinase 3; AltName:
Full=Cysteine protease-related 3; Flags: Precursor
gi|675494|gb|AAA98788.1| cathepsin B-like cysteine proteinase [Caenorhabditis elegans]
gi|675496|gb|AAA98782.1| cathepsin B-like cysteine proteinase [Caenorhabditis elegans]
gi|14530554|emb|CAB61032.2| Protein CPR-3 [Caenorhabditis elegans]
Length = 370
Score = 123 bits (309), Expect = 9e-26, Method: Compositional matrix adjust.
Identities = 88/290 (30%), Positives = 124/290 (42%), Gaps = 100/290 (34%)
Query: 80 VDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEIAPCEHHV--NGTR------ 131
V E LP FD+R KWP+C TI+ IR+Q +CGSCW E+ + NGT+
Sbjct: 88 VPEPLPDTFDAREKWPDCNTIKLIRNQATCGSCWAFGAAEVISDRVCIQSNGTQQPVISV 147
Query: 132 ----PSCDASKGHTPK----------------------------------CVREC----- 148
C + G+ K C + C
Sbjct: 148 EDILSCCGTTCGYGCKGGYSIEALRFWASSGAVTGGDYGGHGCMPYSFAPCTKNCPESTT 207
Query: 149 -------QENYDVPYKK-DLNFGAKSYSVSSNEK--SIMKEIYEHGPVEGAFTVFDDLIL 198
Q +Y K D ++GA +Y V++ + I EIY +GPVE ++ V++D
Sbjct: 208 PSCKTTCQSSYKTEEYKKDKHYGASAYKVTTTKSVTEIQTEIYHYGPVEASYKVYEDFYH 267
Query: 199 YKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILG 258
YKSG + Y SGK +GGHA++I+G
Sbjct: 268 YKSGVYH-------------------------------------YTSGKLVGGHAVKIIG 290
Query: 259 WGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVPKL 308
WG + + YWLIANSW T +G+ G FKI RG +EC IE ++ AG+ KL
Sbjct: 291 WGVE--NGVDYWLIANSWGTSFGEKGFFKIRRGTNECQIEGNVVAGIAKL 338
Score = 40.0 bits (92), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 16/29 (55%), Positives = 20/29 (68%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYG 37
CG+GC GG+ A R+W SG V+GG YG
Sbjct: 158 CGYGCKGGYSIEALRFWASSGAVTGGDYG 186
>gi|187104114|ref|NP_001119617.1| cathepsin B-16A precursor [Acyrthosiphon pisum]
gi|161343835|tpg|DAA06098.1| TPA_inf: cathepsin B [Acyrthosiphon pisum]
Length = 340
Score = 123 bits (309), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 70/193 (36%), Positives = 91/193 (47%), Gaps = 40/193 (20%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GC PY + PC G +C R C N D+ Y D F Y ++
Sbjct: 185 GCEPYRVPPCPQDEEGKSSCAGKPIEKNHRCTRMCYGNQDLDYNDDHRFTRDYYYLTYG- 243
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
SI K++ +GP+E +F V+DD YKSG + N T
Sbjct: 244 -SIQKDVMNYGPIEASFDVYDDFPSYKSGVYQRTPNAT---------------------- 280
Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
LGGHA++++GWG +E + YWL+ NSWN WGDNGLFKI RG D
Sbjct: 281 --------------KLGGHAVKLIGWGVEEGT--PYWLMVNSWNAQWGDNGLFKIRRGTD 324
Query: 294 ECGIESSITAGVP 306
ECGI+S+ TAGVP
Sbjct: 325 ECGIDSAATAGVP 337
Score = 49.3 bits (116), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 18/31 (58%), Positives = 23/31 (74%)
Query: 84 LPANFDSRTKWPNCPTIREIRDQGSCGSCWG 114
+P FD+R +W +C TI E+RDQG CGSCW
Sbjct: 88 IPRTFDARRRWRHCKTIGEVRDQGHCGSCWA 118
Score = 46.2 bits (108), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 19/32 (59%), Positives = 24/32 (75%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQ 40
CGFGCNGG+P AW+Y+ GIV+GG Y S +
Sbjct: 153 CGFGCNGGYPIKAWKYFSSHGIVTGGNYKSGE 184
>gi|358331547|dbj|GAA35870.2| cathepsin B-like cysteine proteinase [Clonorchis sinensis]
Length = 508
Score = 123 bits (308), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 77/197 (39%), Positives = 94/197 (47%), Gaps = 54/197 (27%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GCR Y CEHHV G P C TP+CV++C + DV Y +D SY++ ++E
Sbjct: 183 GCRSYPFPKCEHHVQGHYPPCPRELYPTPECVQQC-DTPDVGYLEDKTRANMSYNIYASE 241
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
SIMKEI GPVE
Sbjct: 242 ISIMKEIMLRGPVEAI-------------------------------------------- 257
Query: 234 FTVFDDLILYKSG---KALG----GHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLF 286
FT+++D + Y SG ALG GHA+RILGWGE YWLIANSWN DWG+ G
Sbjct: 258 FTMYEDFLRYSSGVYFHALGAPMSGHAVRILGWGE--LGNVPYWLIANSWNEDWGEEGYM 315
Query: 287 KILRGKDECGIESSITA 303
K LRG +ECGIE +TA
Sbjct: 316 KFLRGYNECGIEDDVTA 332
Score = 54.3 bits (129), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 21/31 (67%), Positives = 24/31 (77%)
Query: 84 LPANFDSRTKWPNCPTIREIRDQGSCGSCWG 114
LP NFD+R WP+C +I EIRDQ SCGSCW
Sbjct: 86 LPKNFDARKTWPHCSSISEIRDQSSCGSCWA 116
Score = 46.2 bits (108), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 17/27 (62%), Positives = 21/27 (77%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGA 35
CGFGC GG+P +AW YW GIV+GG+
Sbjct: 151 CGFGCRGGYPAVAWDYWKTHGIVTGGS 177
>gi|342181301|emb|CCC90780.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 335
Score = 123 bits (308), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 101/298 (33%), Positives = 127/298 (42%), Gaps = 41/298 (13%)
Query: 39 KQAEKNSLSNIPRAHLKSWMG--VHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPN 96
K + NI A + G + +LP R E ++ +LP +FDS KWPN
Sbjct: 47 KAVYNGKMQNITFAEARRLTGARIQKTSSLPPVRFTE----EQLRTELPESFDSAEKWPN 102
Query: 97 CPTIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPY 156
CPTIREI DQ +CGSCW H G S H C +C D Y
Sbjct: 103 CPTIREIADQSACGSCWAVSTASAISDRHCTVGGVQQLRISAAHLMSCCEDCGYGCDGGY 162
Query: 157 KKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETT---- 212
+ Y VS S + Y P G F P TT
Sbjct: 163 PGT----SWEYYVSHGLASSYCQPYPF-PHCGHHGGKGKKPPCSKYHFHTPKCNTTCTDK 217
Query: 213 AMSLIKWTIRDNTS-----------QLGAEGAFT----VFDDLILYK-------SGKALG 250
A+ LIK+ R N S +L G F V+ D + YK SG LG
Sbjct: 218 AIPLIKY--RGNHSYEVHGEDDYKRELYFNGPFVVVFWVYSDFLAYKTGVYRHVSGDFLG 275
Query: 251 GHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVPKL 308
GHA+RI+GWG+ + YW IANSW+TDWG NG LRG +ECGIE++ AG P +
Sbjct: 276 GHAVRIVGWGKLNGT--PYWKIANSWDTDWGMNGHLLFLRGNNECGIEAAGYAGSPAI 331
Score = 38.9 bits (89), Expect = 3.0, Method: Compositional matrix adjust.
Identities = 13/24 (54%), Positives = 19/24 (79%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVS 32
CG+GC+GG+PG +W Y+V G+ S
Sbjct: 154 CGYGCDGGYPGTSWEYYVSHGLAS 177
>gi|302764096|ref|XP_002965469.1| hypothetical protein SELMODRAFT_143272 [Selaginella moellendorffii]
gi|300166283|gb|EFJ32889.1| hypothetical protein SELMODRAFT_143272 [Selaginella moellendorffii]
Length = 331
Score = 123 bits (308), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 84/284 (29%), Positives = 123/284 (43%), Gaps = 108/284 (38%)
Query: 83 DLPANFDSRTKWPNCPTIREIRDQGSCGSCW----------------------------- 113
DLP +FD+R WP C +I+ I DQG CGSCW
Sbjct: 87 DLPKHFDAREAWPQCASIKTILDQGHCGSCWAFGAVEALTDRFCILNNENVSLSENDLVA 146
Query: 114 -------GCR---PYE-----------IAPCEHHVNGT---RPSCDASKGHTPKCVRECQ 149
GC PY + C+ + +G P C+ TP CV++C
Sbjct: 147 CCSSCGFGCEGGYPYAAWEYFAQTGVVTSQCDPYFDGKGCKHPGCEPEY-DTPVCVKQCV 205
Query: 150 ENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGN 209
+N ++ +F ++Y+V+S+ I EIY++GPVE ++
Sbjct: 206 DNEQ--WRDSKHFTVQTYAVNSDIYDIQAEIYKNGPVEVSY------------------- 244
Query: 210 ETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSG-------KALGGHAIRILGWGED 262
TV++D YKSG + LGGHA++ +GWG
Sbjct: 245 -------------------------TVYEDFAHYKSGVYKHVFGQVLGGHAVKFIGWGTT 279
Query: 263 EKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVP 306
+ K+ YW++ANSWN WG++G F+I RG +ECGIES AG+P
Sbjct: 280 DDGKD-YWIVANSWNRSWGEDGFFQISRGSNECGIESEPVAGIP 322
Score = 38.5 bits (88), Expect = 4.3, Method: Compositional matrix adjust.
Identities = 13/24 (54%), Positives = 19/24 (79%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVS 32
CGFGC GG+P AW Y+ ++G+V+
Sbjct: 151 CGFGCEGGYPYAAWEYFAQTGVVT 174
>gi|401415968|ref|XP_003872479.1| CPC cysteine peptidase, Clan CA, family C1,Cathepsin B-like
[Leishmania mexicana MHOM/GT/2001/U1103]
gi|322488703|emb|CBZ23950.1| CPC cysteine peptidase, Clan CA, family C1,Cathepsin B-like
[Leishmania mexicana MHOM/GT/2001/U1103]
Length = 340
Score = 123 bits (308), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 96/297 (32%), Positives = 122/297 (41%), Gaps = 107/297 (36%)
Query: 72 PELIGYSEVDEDLPANFDSRTKWPNCPTI---REIRDQGS-------------------- 108
P E+ +DLP +FD+ KWP C TI R+ + GS
Sbjct: 86 PRNFSVEEMQQDLPESFDASEKWPMCVTIGEIRDQSNCGSCWAIAAVEAMSDRYCTMSGI 145
Query: 109 ----------------CG-SCWG-------------------CRPYEIAPCEHHVNGTR- 131
CG C+G C+PY PC HH N ++
Sbjct: 146 PDRRISTTNLLSCCFICGFGCYGGIPAMAWLWWVWVGVTTELCQPYPFGPCSHHGNSSKY 205
Query: 132 PSCDASKGHTPKCVRECQ--ENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGA 189
P C + +TPKC C E V YK G SYS+ E+ +M E+ +GP+E A
Sbjct: 206 PPCPNTIYNTPKCNTTCDNVEMELVKYK-----GVSSYSIK-GERELMVELMNNGPLEVA 259
Query: 190 FTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKAL 249
V+ D + YKSG + + SG L
Sbjct: 260 MQVYADFVAYKSG-------------------------------------VYKHVSGDHL 282
Query: 250 GGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVP 306
GGHA++++GWG K YW IANSWNTDWGD G F I RG DECGIESS AG P
Sbjct: 283 GGHAVKLVGWGV--KDGIPYWKIANSWNTDWGDKGYFLIQRGNDECGIESSGVAGKP 337
Score = 39.3 bits (90), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 14/25 (56%), Positives = 18/25 (72%)
Query: 8 LCGFGCNGGFPGMAWRYWVKSGIVS 32
+CGFGC GG P MAW +WV G+ +
Sbjct: 161 ICGFGCYGGIPAMAWLWWVWVGVTT 185
>gi|297814171|ref|XP_002874969.1| hypothetical protein ARALYDRAFT_490415 [Arabidopsis lyrata subsp.
lyrata]
gi|297320806|gb|EFH51228.1| hypothetical protein ARALYDRAFT_490415 [Arabidopsis lyrata subsp.
lyrata]
Length = 359
Score = 122 bits (307), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 96/335 (28%), Positives = 137/335 (40%), Gaps = 118/335 (35%)
Query: 37 GSKQAEKNSLSNIPRAHLKSWMGVHPDYNLPANRLPELIGYSEVDED----LPANFDSRT 92
G K A + SN A K +GV P +G V D LP FD+RT
Sbjct: 58 GWKAAINDRFSNATVAEFKRLLGVKP------TPKKHFLGVPVVSHDPSLKLPKAFDART 111
Query: 93 KWPNCPTIREIRDQGSCGSCW-------------------------------------GC 115
WP C +I +I DQG CGSCW GC
Sbjct: 112 AWPQCTSIGKILDQGHCGSCWAFGAVESLSDRFCIQFGMNISLSVNDLLACCGFRCGDGC 171
Query: 116 RP-YEIAP-------------CEHHVNGT---RPSCDASKGHTPKCVRECQENYDVPYKK 158
Y IA C+ + + T P C+ + TP+C+R+C + + + +
Sbjct: 172 DGGYPIAAWQYFSYSGVVTEECDPYFDNTGCSHPGCEPAY-PTPRCLRKCVSDNKL-WSE 229
Query: 159 DLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIK 218
++ +Y+V+S+ + IM E+Y++GPVE +
Sbjct: 230 SKHYSVSTYTVNSSPQDIMAEVYKNGPVEVS----------------------------- 260
Query: 219 WTIRDNTSQLGAEGAFTVFDDLILYKSG-------KALGGHAIRILGWGEDEKSKEKYWL 271
FTV++D YKSG +GGHA++++GWG + E YWL
Sbjct: 261 ---------------FTVYEDFAHYKSGVYKHITGSNIGGHAVKLIGWGTSNEG-EDYWL 304
Query: 272 IANSWNTDWGDNGLFKILRGKDECGIESSITAGVP 306
+AN WN WGD+G F I RG +ECGIE AG+P
Sbjct: 305 MANQWNRGWGDDGYFMIRRGTNECGIEDEPVAGLP 339
>gi|256090674|ref|XP_002581308.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
Length = 250
Score = 122 bits (307), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 71/191 (37%), Positives = 91/191 (47%), Gaps = 39/191 (20%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GC PY C+H + + P C P C + C+ Y +PYK D ++G YS+ NE
Sbjct: 93 GCLPYPFPKCDHRSSNSYPKCGYITYTAPPCTKTCRSGYPIPYKADKHYGRVIYSLRPNE 152
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
I KEI +GPVE V D + YKSG + R T QL
Sbjct: 153 SDIRKEIMMNGPVEAGIFVHSDFLNYKSGVY-----------------RHITGQL----- 190
Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
+ H++RI+GWG + + YWL ANSWN DWG NG FKILRG +
Sbjct: 191 ---------------VTIHSVRIIGWGIE--NDIPYWLCANSWNEDWGLNGYFKILRGSN 233
Query: 294 ECGIESSITAG 304
EC IES + AG
Sbjct: 234 ECEIESFVNAG 244
>gi|168000937|ref|XP_001753172.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162695871|gb|EDQ82213.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 347
Score = 122 bits (306), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 87/294 (29%), Positives = 122/294 (41%), Gaps = 92/294 (31%)
Query: 67 PANRLP---ELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCW---------- 113
PAN+L E I + LP FD+R +W +CPTI +I QG CGSCW
Sbjct: 83 PANKLEPSIETISHKHKKLYLPKEFDARKQWSHCPTIGDILGQGHCGSCWAFGAVESLTD 142
Query: 114 ------------------GCRPYEIA-PCE-----------HHVNGTRPSCDASKGHTPK 143
C +E CE H CD
Sbjct: 143 RFCIHLNESVSLSENDLLACCGFECGYGCEGGYPIRAWKYFKHSGVVTNKCDPYFDQKGC 202
Query: 144 CVRECQENYDVP-----------YKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTV 192
C Y+ P + + + G +Y +S + +M E+Y +GPVE AF V
Sbjct: 203 AHPGCYPTYETPKCEKQCVDDEFWVQSKHLGVNAYEMSMEPEDLMAELYTNGPVEVAFEV 262
Query: 193 FDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGH 252
++D YK+G V+ L G +GGH
Sbjct: 263 YEDFAHYKTG---------------------------------VYKHLF----GGFMGGH 285
Query: 253 AIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVP 306
A++++GWG + + YW I NSWNT+WG++GLF+I+RG DECGIES+ AG+P
Sbjct: 286 AVKLIGWGTTDDGVD-YWTIVNSWNTNWGEDGLFRIVRGNDECGIESNAVAGLP 338
>gi|3929817|emb|CAA77181.1| cathepsin B [Mus musculus]
Length = 194
Score = 122 bits (306), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 69/161 (42%), Positives = 88/161 (54%), Gaps = 40/161 (24%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GC PY I PCEHHVNG+RP +G TP+C + C+ Y YK+D +FG SYSVS++
Sbjct: 74 GCLPYTIPPCEHHVNGSRPPMHG-EGDTPRCNKSCEAGYSPSYKEDKHFGYTSYSVSNSV 132
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
K IM EIY++GPVEGAFTVF D + YKSG +
Sbjct: 133 KEIMAEIYKNGPVEGAFTVFSDFLTYKSGVY----------------------------- 163
Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIAN 274
+++G +GGHAIRILGWG + + YWL AN
Sbjct: 164 --------KHEAGDMMGGHAIRILGWGVE--NGVPYWLAAN 194
Score = 45.4 bits (106), Expect = 0.032, Method: Compositional matrix adjust.
Identities = 19/30 (63%), Positives = 22/30 (73%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
CG GCNGG+P AW +W K G+VSGG Y S
Sbjct: 42 CGDGCNGGYPSGAWNFWTKKGLVSGGVYDS 71
>gi|268572243|ref|XP_002648913.1| Hypothetical protein CBG17826 [Caenorhabditis briggsae]
Length = 323
Score = 122 bits (306), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 73/202 (36%), Positives = 96/202 (47%), Gaps = 63/202 (31%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GCRPY APC SC K TP C CQ Y Y KD FG +Y+V+ N
Sbjct: 178 GCRPYPFAPC--------ISCPEEK--TPTCSLSCQFGYSTAYAKDKRFGVSAYAVARNV 227
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
+I EI +GPV GA
Sbjct: 228 AAIQTEIMTNGPVVGA-------------------------------------------- 243
Query: 234 FTVFDDLILYKSG-------KALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLF 286
FT+++D+ YKSG + LGGHAI+I+GWG ++ YWLIANSW +WG+NG
Sbjct: 244 FTMYEDMYKYKSGVYRHTAGRLLGGHAIKIIGWGT--QNGIPYWLIANSWGANWGENGFL 301
Query: 287 KILRGKDECGIESSITAGVPKL 308
K+ RG +ECGIE ++ AG+P++
Sbjct: 302 KMRRGVNECGIERAVVAGMPRV 323
>gi|449489527|ref|XP_004158338.1| PREDICTED: cathepsin B-like [Cucumis sativus]
Length = 349
Score = 122 bits (306), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 85/275 (30%), Positives = 115/275 (41%), Gaps = 90/275 (32%)
Query: 84 LPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEIAP---CEHH-------VNGTRPS 133
LP +FD+R WP C +I I DQG CGSCW E C H VN
Sbjct: 102 LPKSFDAREAWPQCISIGTILDQGHCGSCWAFGAVESLSDRFCIHFDMNITLSVNDLLAC 161
Query: 134 C-----DASKGHTPKC-----VRE--------------------CQENYDVP-------- 155
C D G P VR C+ Y P
Sbjct: 162 CGFMCGDGCDGGYPISAWRYFVRHGVVTEQCDPYFDTTGCSHPGCEPAYPTPRCVRHCVD 221
Query: 156 ----YKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNET 211
++K ++G +Y V + IM E+Y++GPVE +FTV++D YKSG +
Sbjct: 222 KNQIWRKTKHYGVSAYRVKRDPNDIMAEVYKNGPVEVSFTVYEDFAHYKSGVY------- 274
Query: 212 TAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWL 271
+ +G +GGHA++++GWG + E YWL
Sbjct: 275 ------------------------------KHITGDVMGGHAVKLIGWGTTDDG-EDYWL 303
Query: 272 IANSWNTDWGDNGLFKILRGKDECGIESSITAGVP 306
+AN WN WGD+G FKI RG +ECGIE + AG+P
Sbjct: 304 LANQWNRGWGDDGYFKIRRGTNECGIEEDVVAGLP 338
Score = 39.7 bits (91), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 14/25 (56%), Positives = 21/25 (84%)
Query: 8 LCGFGCNGGFPGMAWRYWVKSGIVS 32
+CG GC+GG+P AWRY+V+ G+V+
Sbjct: 165 MCGDGCDGGYPISAWRYFVRHGVVT 189
>gi|353228456|emb|CCD74627.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
Length = 333
Score = 122 bits (306), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 68/191 (35%), Positives = 91/191 (47%), Gaps = 39/191 (20%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GC PY C+H + + P C P C + C+ Y +PYK D ++G YS+ NE
Sbjct: 176 GCLPYPFPKCDHRSSNSYPKCGYITYTAPPCTKTCRSGYPIPYKADKHYGRVIYSLRPNE 235
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
I KEI +GPVE V D + YKSG +
Sbjct: 236 SDIRKEIMMNGPVEAGIFVHSDFLNYKSGVY----------------------------- 266
Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
+ +G+ + H++RI+GWG + + YWL ANSWN DWG NG FKILRG +
Sbjct: 267 --------RHITGQLVTIHSVRIIGWGIE--NDIPYWLCANSWNEDWGLNGYFKILRGSN 316
Query: 294 ECGIESSITAG 304
EC IES + AG
Sbjct: 317 ECEIESFVNAG 327
>gi|449446774|ref|XP_004141146.1| PREDICTED: cathepsin B-like [Cucumis sativus]
Length = 348
Score = 122 bits (306), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 85/275 (30%), Positives = 115/275 (41%), Gaps = 90/275 (32%)
Query: 84 LPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEIAP---CEHH-------VNGTRPS 133
LP +FD+R WP C +I I DQG CGSCW E C H VN
Sbjct: 101 LPKSFDAREAWPQCISIGTILDQGHCGSCWAFGAVESLSDRFCIHFDMNITLSVNDLLAC 160
Query: 134 C-----DASKGHTPKC-----VRE--------------------CQENYDVP-------- 155
C D G P VR C+ Y P
Sbjct: 161 CGFMCGDGCDGGYPISAWRYFVRHGVVTEQCDPYFDTTGCSHPGCEPAYPTPRCVRHCVD 220
Query: 156 ----YKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNET 211
++K ++G +Y V + IM E+Y++GPVE +FTV++D YKSG +
Sbjct: 221 KNQIWRKTKHYGVSAYRVKRDPNDIMAEVYKNGPVEVSFTVYEDFAHYKSGVY------- 273
Query: 212 TAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWL 271
+ +G +GGHA++++GWG + E YWL
Sbjct: 274 ------------------------------KHITGDVMGGHAVKLIGWGTTDDG-EDYWL 302
Query: 272 IANSWNTDWGDNGLFKILRGKDECGIESSITAGVP 306
+AN WN WGD+G FKI RG +ECGIE + AG+P
Sbjct: 303 LANQWNRGWGDDGYFKIRRGTNECGIEEDVVAGLP 337
Score = 39.3 bits (90), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 14/25 (56%), Positives = 21/25 (84%)
Query: 8 LCGFGCNGGFPGMAWRYWVKSGIVS 32
+CG GC+GG+P AWRY+V+ G+V+
Sbjct: 164 MCGDGCDGGYPISAWRYFVRHGVVT 188
>gi|170028916|ref|XP_001842340.1| cathepsin B [Culex quinquefasciatus]
gi|167879390|gb|EDS42773.1| cathepsin B [Culex quinquefasciatus]
Length = 339
Score = 122 bits (305), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 71/195 (36%), Positives = 96/195 (49%), Gaps = 44/195 (22%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GC+PY PC + G C K PKC+ C YD Y+KD FGA +Y + ++
Sbjct: 189 GCKPYPFEPCSYPFVG----CHHEK-KNPKCLHHCINGYDRKYRKDKFFGATAYKIPNDA 243
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
+ I EI +GPV F VF+D Y SG
Sbjct: 244 RMIQLEIMTNGPVATGFEVFEDFYFYHSG------------------------------- 272
Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
V+ ++ GK +G HAIRI+GWG + + YWLIANS+ WGD G FK+LRG +
Sbjct: 273 --VYKHVV----GKKVGMHAIRIVGWGTENGTP--YWLIANSYGDTWGDKGFFKMLRGSN 324
Query: 294 ECGIESSITAGVPKL 308
GIES++ AG+P+L
Sbjct: 325 HLGIESTVIAGLPQL 339
Score = 41.6 bits (96), Expect = 0.50, Method: Compositional matrix adjust.
Identities = 20/36 (55%), Positives = 25/36 (69%), Gaps = 1/36 (2%)
Query: 9 CGFGCNGGF-PGMAWRYWVKSGIVSGGAYGSKQAEK 43
CG GCNGGF G A++YWV +G+VSG Y S + K
Sbjct: 156 CGNGCNGGFLDGTAFQYWVDAGLVSGAPYNSSEGCK 191
>gi|320166129|gb|EFW43028.1| cathepsin B [Capsaspora owczarzaki ATCC 30864]
Length = 332
Score = 122 bits (305), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 73/203 (35%), Positives = 92/203 (45%), Gaps = 57/203 (28%)
Query: 115 CRPYEIAPCEHH-VNGTRPSCDASKGHTPKCVRECQENYDV-PYKKDLNFGAKSYSVSSN 172
C+PY+ C HH + P C ++ TPKC + C Y Y DL++G SYSV
Sbjct: 177 CKPYDFPACAHHEASPDYPDCPSTDYSTPKCTKSCVAGYTANTYTADLHYGQSSYSVGRT 236
Query: 173 EKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEG 232
+ +I EI HGPVE A
Sbjct: 237 DAAIQTEILNHGPVEAA------------------------------------------- 253
Query: 233 AFTVFDDLILYKSG-------KALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGL 285
FTV+ D Y+SG LGGHAI I+GWG + S YWL+ NSWN WGD G
Sbjct: 254 -FTVYSDFPTYRSGVYKHTSGSVLGGHAISIVGWGTESGSP--YWLVKNSWNPSWGDGGF 310
Query: 286 FKILRGKDECGIESSITAGVPKL 308
FKILRG +CGI + + G+PKL
Sbjct: 311 FKILRG--DCGINNDVVGGLPKL 331
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 25/46 (54%), Positives = 32/46 (69%), Gaps = 2/46 (4%)
Query: 69 NRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG 114
RLP + + + E +P FDSRT WP CPTI+E+RDQ +CGSCW
Sbjct: 66 QRLP--LKVAPIAEAIPDTFDSRTNWPACPTIKEVRDQSACGSCWA 109
>gi|302823081|ref|XP_002993195.1| hypothetical protein SELMODRAFT_270024 [Selaginella moellendorffii]
gi|300138965|gb|EFJ05715.1| hypothetical protein SELMODRAFT_270024 [Selaginella moellendorffii]
Length = 342
Score = 122 bits (305), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 84/284 (29%), Positives = 123/284 (43%), Gaps = 108/284 (38%)
Query: 83 DLPANFDSRTKWPNCPTIREIRDQGSCGSCW----------------------------- 113
DLP +FD+R WP C +I+ I DQG CGSCW
Sbjct: 98 DLPKHFDAREAWPQCSSIKNILDQGHCGSCWAFGAVEALTDRFCILNNENVSLSENDLVA 157
Query: 114 -------GCR---PYE-----------IAPCEHHVNGT---RPSCDASKGHTPKCVRECQ 149
GC PY + C+ + +G P C+ TP CV++C
Sbjct: 158 CCSSCGFGCDGGYPYAAWEYFAQTGVVTSQCDPYFDGKGCKHPGCEPEY-DTPVCVKQCV 216
Query: 150 ENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGN 209
+N ++ +F ++Y+V+S+ I EIY++GPVE ++
Sbjct: 217 DNEQ--WRDSKHFTVQTYAVNSDIYDIQAEIYKNGPVEVSY------------------- 255
Query: 210 ETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSG-------KALGGHAIRILGWGED 262
TV++D YKSG + LGGHA++ +GWG
Sbjct: 256 -------------------------TVYEDFAHYKSGVYKHVFGEVLGGHAVKFIGWGTT 290
Query: 263 EKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVP 306
+ K+ YW++ANSWN WG++G F+I RG +ECGIES AG+P
Sbjct: 291 DDGKD-YWIVANSWNRSWGEDGFFQISRGSNECGIESEPVAGIP 333
Score = 38.9 bits (89), Expect = 2.8, Method: Compositional matrix adjust.
Identities = 13/24 (54%), Positives = 20/24 (83%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVS 32
CGFGC+GG+P AW Y+ ++G+V+
Sbjct: 162 CGFGCDGGYPYAAWEYFAQTGVVT 185
>gi|204022090|dbj|BAG71142.1| cathepsin B-N3 [Tuberaphis styraci]
Length = 334
Score = 122 bits (305), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 89/331 (26%), Positives = 136/331 (41%), Gaps = 114/331 (34%)
Query: 50 PRAHLKSWMGVHPDYNLPANRLPELIGYSEVDE-------DLPANFDSRTKWPNCPTIRE 102
P+ + S++ + + A + L+ + DE +P++FD+R KW C TI E
Sbjct: 44 PKLSIDSFVKLLGSKGVQAAKQASLVMFKTHDEAYNSWSNRIPSSFDARKKWRKCSTIGE 103
Query: 103 IRDQGSCGSCWG----------------------CRPYEIAPCEH--------------- 125
+RDQG+CGSCW P E+A C H
Sbjct: 104 VRDQGNCGSCWAFGTSSAFADRLCIATDGEFNELLSPEELAFCCHKCGFGCSGGYPIRAW 163
Query: 126 -----HVNGTRPSCDASKGHTP-------------------------KCVRECQENYDVP 155
H T + D+ +G P +C + C N ++
Sbjct: 164 ERFKKHGLVTGGNYDSGEGCQPYKVPPCPLDEYGNNTCSGKPAEKNHRCTQMCYGNQNLD 223
Query: 156 YKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMS 215
+K+D ++ +Y ++ +I ++ +GP+E +F V+DD YKSG + N T
Sbjct: 224 FKEDHHYTRDAYYLTYG--TIQNDVLAYGPIEASFEVYDDFPSYKSGVYTKMENATY--- 278
Query: 216 LIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANS 275
LGGHA++++GWGE+ YWL+ NS
Sbjct: 279 ---------------------------------LGGHAVKLIGWGEEYGV--PYWLLVNS 303
Query: 276 WNTDWGDNGLFKILRGKDECGIESSITAGVP 306
WN WGD GLFKI RG +ECG ++S T GVP
Sbjct: 304 WNDQWGDQGLFKIRRGTNECGTDNSTTGGVP 334
Score = 40.4 bits (93), Expect = 0.94, Method: Compositional matrix adjust.
Identities = 17/32 (53%), Positives = 23/32 (71%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQ 40
CGFGC+GG+P AW + K G+V+GG Y S +
Sbjct: 150 CGFGCSGGYPIRAWERFKKHGLVTGGNYDSGE 181
>gi|300122171|emb|CBK22745.2| unnamed protein product [Blastocystis hominis]
Length = 319
Score = 121 bits (304), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 63/191 (32%), Positives = 91/191 (47%), Gaps = 39/191 (20%)
Query: 115 CRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEK 174
C+ Y PC H + G P C PKC CQE Y + Y+KD + Y + +N
Sbjct: 167 CKSYPFPPCSHGIEGQYPQCSTKPPVVPKCETTCQEGYPIEYEKDRYKFSNVYQLENNVD 226
Query: 175 SIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAF 234
I EI E+GPV+ +F V++D + YKSG +
Sbjct: 227 QIKNEIMENGPVDASFQVYEDFMTYKSGIYH----------------------------- 257
Query: 235 TVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDE 294
+ GK + H ++I+GWGE+ + E YW NSWN++WG+NGLF+I G +E
Sbjct: 258 --------HVEGKFMNLHTVKIIGWGEE--NGEAYWKAVNSWNSEWGENGLFRIRLGTNE 307
Query: 295 CGIESSITAGV 305
C IES + G+
Sbjct: 308 CTIESQVEGGL 318
Score = 43.9 bits (102), Expect = 0.086, Method: Compositional matrix adjust.
Identities = 16/32 (50%), Positives = 22/32 (68%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQ 40
CGF C GG+ MAW Y ++G+V+GG Y S +
Sbjct: 134 CGFQCQGGYSAMAWEYLRRTGVVTGGQYNSTE 165
>gi|6562772|emb|CAB62590.1| putative cathepsin B-like protease [Pisum sativum]
Length = 174
Score = 121 bits (304), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 69/185 (37%), Positives = 101/185 (54%), Gaps = 40/185 (21%)
Query: 119 EIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMK 178
E P + + P C+ TPKCVR+C + V +KK ++ K Y V+S+ ++IM+
Sbjct: 22 ECDPYFDQIGCSHPGCEPGY-QTPKCVRKCVKGNQV-WKKSKHYSVKPYKVNSDPQNIME 79
Query: 179 EIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFD 238
E+Y++GPVE AF+V++D YKSG +
Sbjct: 80 EVYKNGPVEVAFSVYEDFAHYKSGVY---------------------------------- 105
Query: 239 DLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIE 298
+ +G ALGGHA+++ GWG ++ E YWL+AN WNT+WGD+G FKI RG +ECGIE
Sbjct: 106 ---KHITGSALGGHAVKLNGWGTSDEG-EDYWLLANQWNTNWGDDGYFKIKRGTNECGIE 161
Query: 299 SSITA 303
+TA
Sbjct: 162 EDVTA 166
>gi|168020784|ref|XP_001762922.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162685734|gb|EDQ72127.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 345
Score = 121 bits (304), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 95/308 (30%), Positives = 131/308 (42%), Gaps = 95/308 (30%)
Query: 53 HLKSWMGVHPDYNLPANRLP---ELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSC 109
HLK G PAN + E + + + DLP FD+R W +C TI +I DQG C
Sbjct: 70 HLKKMCGAK---MTPANEVEPSIERVTHKHKNLDLPTEFDARKHWSHCSTIGDILDQGHC 126
Query: 110 GSCWGCRPYEIAP---CEH-------HVNGTRPSC-----DASKGHTP------------ 142
GSCW E C H N C D +G P
Sbjct: 127 GSCWAFGAVESLTDRFCIHLNESVSLSENDLLACCGFECGDGCEGGYPIRAWQYFKRTGV 186
Query: 143 ---KC----------VRECQENYDVP--YKKDLN---------FGAKSYSVSSNEKSIMK 178
KC C YD P +K+ ++ G +Y VS + +M
Sbjct: 187 VTSKCDPYFDQKGCGHPGCYPTYDTPKCFKRCVDDELWVSSKHLGVSAYEVSMEPEELMA 246
Query: 179 EIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFD 238
E++ +GP+E AF VF+D YK+G V+
Sbjct: 247 ELFTNGPIEVAFDVFEDFAHYKTG---------------------------------VYK 273
Query: 239 DLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIE 298
L G +GGHA++++GWG + + YW + NSWNT+WG++G F+ILRGKDECGIE
Sbjct: 274 HLY----GGYIGGHAVKLVGWGTTDDGVD-YWSMVNSWNTNWGEDGTFRILRGKDECGIE 328
Query: 299 SSITAGVP 306
S+ AG+P
Sbjct: 329 SNAVAGLP 336
>gi|239788404|dbj|BAH70886.1| ACYPI000014 [Acyrthosiphon pisum]
Length = 335
Score = 120 bits (302), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 97/316 (30%), Positives = 138/316 (43%), Gaps = 67/316 (21%)
Query: 40 QAEKNSLSNIPRAHLKSWMGVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPT 99
+A++N N P+ + +G + + + E + ++P FDSR +W C T
Sbjct: 40 KAKQNFPENTPKEQIVRLLGSKRLLGVSKSPIKENDELYMDNSEVPEFFDSRLEWDYCET 99
Query: 100 IREIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPS--CDASKGH-------------TPKC 144
I +R+QG+CGSCW H G C A+ G +C
Sbjct: 100 IGHVRNQGNCGSCWA----------HGTTGAFADRLCVATNGEFNELISAEELTFCCHRC 149
Query: 145 VRECQENYDVP----YKK---------DLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFT 191
V C Y + +K+ D G + Y V +K+ H G T
Sbjct: 150 VFGCNGGYPLKAWQYFKRHGVVTGGDYDTTDGCQPYRVPP----CVKDDEGHNSCSGQPT 205
Query: 192 ----------VFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLG-AEGAFTVFDDL 240
DD I YK + A L T++ +T G E +F V+DD
Sbjct: 206 ERNHKCSKKCYGDDTIDYKKNHY----KTKDAYYLKNTTMQKDTMVYGPIEASFDVYDDF 261
Query: 241 ILYKSG--------KALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGK 292
+ Y+SG LGGHA++++GWG +E + YWL+ NSW WGD G+FKILRG
Sbjct: 262 MNYESGVYQRTGNASYLGGHAVKMIGWGVEEGTP--YWLMVNSWGEQWGDKGMFKILRGT 319
Query: 293 DECGIESSITAGVPKL 308
DECGIESS TAGVP +
Sbjct: 320 DECGIESSCTAGVPSV 335
Score = 42.7 bits (99), Expect = 0.22, Method: Compositional matrix adjust.
Identities = 16/30 (53%), Positives = 23/30 (76%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
C FGCNGG+P AW+Y+ + G+V+GG Y +
Sbjct: 149 CVFGCNGGYPLKAWQYFKRHGVVTGGDYDT 178
>gi|327239610|gb|AEA39649.1| cathepsin B [Epinephelus coioides]
Length = 171
Score = 120 bits (302), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 52/91 (57%), Positives = 67/91 (73%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GCRPY I PCEHHVNGTRP C G TP+C+ +C+ Y YK D ++G SYSV S+E
Sbjct: 72 GCRPYTIPPCEHHVNGTRPPCTGEGGDTPQCILQCESGYTPSYKADKHYGKSSYSVPSDE 131
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRF 204
+ I EIY++GPVEGAFTV++D +LYK+G +
Sbjct: 132 EQIQSEIYKNGPVEGAFTVYEDFLLYKTGVY 162
>gi|115605092|gb|ABJ15785.1| cathepsin B [Bos taurus]
Length = 118
Score = 120 bits (302), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 55/91 (60%), Positives = 69/91 (75%), Gaps = 1/91 (1%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GCRPY I PCEHHVNG+RP C +G TPKC + C+ Y YK+D +FG SYSV++NE
Sbjct: 29 GCRPYSIPPCEHHVNGSRPPCTG-EGDTPKCSKTCEPGYSPSYKEDKHFGCSSYSVANNE 87
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRF 204
K IM EIY++GPVEGAF+V+ D +LYKSG +
Sbjct: 88 KEIMAEIYKNGPVEGAFSVYSDFLLYKSGVY 118
Score = 41.6 bits (96), Expect = 0.45, Method: Compositional matrix adjust.
Identities = 17/26 (65%), Positives = 19/26 (73%)
Query: 13 CNGGFPGMAWRYWVKSGIVSGGAYGS 38
CNGGFP AW +W K G+VSGG Y S
Sbjct: 1 CNGGFPSGAWNFWTKKGLVSGGLYNS 26
>gi|268561866|ref|XP_002638438.1| Hypothetical protein CBG18654 [Caenorhabditis briggsae]
Length = 396
Score = 120 bits (301), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 91/283 (32%), Positives = 124/283 (43%), Gaps = 98/283 (34%)
Query: 84 LPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEIAPCEHHV--NGTRP--------- 132
LP FDSR +WPNC +I+ IRDQ CGSCW EI + NGT+
Sbjct: 85 LPTAFDSRVQWPNCNSIKLIRDQTYCGSCWAFAAAEIISDRICIQSNGTQQPIISPEDIL 144
Query: 133 SCDASK-------GHTPKCVR---------------------------ECQENYDVPYKK 158
SC S G+T + ++ C+E D P K
Sbjct: 145 SCCGSSCNNGCQGGYTIEAMKYWMNSGVVTGGDYQGAGCIPYSFRPCSTCKEPKDAPSCK 204
Query: 159 ---DLNFGAKS-----YSVSSNE------KSIMKEIYEHGPVEGAFTVFDDLILYKSGRF 204
++ AKS + SSN + I EIY +GPVE A+ V+DD YKSG +
Sbjct: 205 TTCQASYKAKSAYRLPTTTSSNAIVANAVQMIQTEIYNNGPVEVAYQVYDDFYHYKSGVY 264
Query: 205 FVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEK 264
+ + G GHA++I+GWG ++K
Sbjct: 265 Y-------------------------------------HVYGDKPSGHAVKIIGWGTEKK 287
Query: 265 SKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVPK 307
YWL+ANSW+T +G+NG FKI RG +ECGIE ++ AG+PK
Sbjct: 288 V--DYWLVANSWSTTFGENGFFKIRRGTNECGIEENVVAGLPK 328
>gi|358341867|dbj|GAA49438.1| cathepsin B-like cysteine proteinase [Clonorchis sinensis]
Length = 952
Score = 120 bits (301), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 70/190 (36%), Positives = 93/190 (48%), Gaps = 40/190 (21%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GCR + C H G P C TP+C+++C E +V Y+KD SY+V ++
Sbjct: 148 GCRSFPFPKCGHRRKGRYPPCPRHIYPTPECIKQCDEP-EVNYEKDKTRANISYNVYPSD 206
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
SIMKEI +GPVE +F ++ D + Y G +F
Sbjct: 207 ISIMKEIMLNGPVEASFGIYADFLEYNGGVYF---------------------------- 238
Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
+ G + HAIRILGWGED+ YWLIANSWN DWG+ G + LRG +
Sbjct: 239 ---------HCWGGPISRHAIRILGWGEDDGVP--YWLIANSWNEDWGEKGYVRFLRGHN 287
Query: 294 ECGIESSITA 303
ECGIE +TA
Sbjct: 288 ECGIEEEVTA 297
Score = 104 bits (260), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 91/305 (29%), Positives = 122/305 (40%), Gaps = 66/305 (21%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQAEKNSLSNIPRAHLKSWMGVHPDYNLPA 68
CG GC GG+ +AW +W GIV+G GSK+ S + G +P
Sbjct: 704 CGCGCRGGYSPIAWDFWKTHGIVTG---GSKEKPTGCRSYPFPSCEHRGKGQYPPCPHQL 760
Query: 69 NRLPELIGYSEVDE-----DLPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEIAPC 123
PE I + E D FDS + R + G R + C
Sbjct: 761 YPTPECIKRCDTKEIDYEKDKTRGFDSASS--EQLADRHCFHTSNFGEASAQRTLHLT-C 817
Query: 124 EHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEH 183
+N S D K V N SY+V E+++MKEI
Sbjct: 818 ---LNFMHHSIDLLSSRLEKAVLRSTANI-------------SYNVYPAEQAVMKEIMLR 861
Query: 184 GPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILY 243
GPV V++DL+ YKSG +F +
Sbjct: 862 GPVGAILHVYEDLLDYKSGVYF-------------------------------------H 884
Query: 244 KSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITA 303
G LG H IRILGWGE++ YWL+ANSWN DWG+ G ++LR ++ECGI +TA
Sbjct: 885 VWGGHLGEHGIRILGWGEEDGV--PYWLVANSWNEDWGEKGYMRVLRWRNECGIVDQVTA 942
Query: 304 GVPKL 308
G+P L
Sbjct: 943 GLPDL 947
Score = 56.6 bits (135), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 21/34 (61%), Positives = 27/34 (79%)
Query: 81 DEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG 114
++ LP +FD+R WP+CP+I EIRDQ SCGSCW
Sbjct: 636 NQHLPESFDARANWPHCPSISEIRDQSSCGSCWA 669
Score = 53.9 bits (128), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 23/45 (51%), Positives = 31/45 (68%)
Query: 70 RLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG 114
R P + +++LP +FD+RTKWP+CP+I EIRDQ SC S W
Sbjct: 37 RRPTVKHEVSDEKELPKSFDARTKWPHCPSISEIRDQSSCESFWA 81
Score = 39.3 bits (90), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 15/27 (55%), Positives = 18/27 (66%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGA 35
CG GC GF MAW +W GIV+GG+
Sbjct: 116 CGLGCGAGFHPMAWDFWKTHGIVTGGS 142
>gi|204022088|dbj|BAG71141.1| cathepsin B-N2 [Tuberaphis styraci]
Length = 334
Score = 120 bits (301), Expect = 8e-25, Method: Compositional matrix adjust.
Identities = 98/363 (26%), Positives = 149/363 (41%), Gaps = 133/363 (36%)
Query: 36 YGSKQA---EKNSLSNIPRAHLKSW-MGVHPDYNLPANRLPELIG--------------- 76
Y ++QA E++ ++ I A+ K+W GV+ D L + +L+G
Sbjct: 13 YRTEQAYFLEEDYINQI-NANAKTWKAGVNFDPKLSIDSFVKLLGSKGVQAAKQASPVMF 71
Query: 77 ------YSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG---------------- 114
Y+ +P++FD+R KW C TI E+RDQG+CGSCW
Sbjct: 72 KTHDEAYNSWSNRIPSSFDARKKWRKCSTIGEVRDQGNCGSCWAFGTSSAFADRLCIATD 131
Query: 115 ------CRPYEIAPCEH--------------------HVNGTRPSCDASKGHTP------ 142
P E+A C H H T + D+ +G P
Sbjct: 132 GEFNELLSPEELAFCCHKCGFGCSGGYPIRAWERFKKHGLVTGGNYDSGEGCQPYKVSPC 191
Query: 143 -------------------KCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEH 183
+C + C N ++ +K+D ++ +Y ++ +I ++ +
Sbjct: 192 PLDEYGNNTCSGKPAEKNHRCTQMCYGNQNLDFKEDHHYTRDAYYLTYG--TIQNDVLAY 249
Query: 184 GPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILY 243
GP+E +F V+DD YKSG + N T
Sbjct: 250 GPIEASFEVYDDFPSYKSGVYTKMENATY------------------------------- 278
Query: 244 KSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITA 303
LGGHA++++GWGE+ YWL+ NSWN WGD GLFKI RG +ECG ++S T
Sbjct: 279 -----LGGHAVKLIGWGEEYGV--PYWLLVNSWNDQWGDQGLFKIRRGTNECGTDNSTTG 331
Query: 304 GVP 306
GVP
Sbjct: 332 GVP 334
Score = 40.4 bits (93), Expect = 0.92, Method: Compositional matrix adjust.
Identities = 17/32 (53%), Positives = 23/32 (71%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQ 40
CGFGC+GG+P AW + K G+V+GG Y S +
Sbjct: 150 CGFGCSGGYPIRAWERFKKHGLVTGGNYDSGE 181
>gi|312283137|dbj|BAJ34434.1| unnamed protein product [Thellungiella halophila]
Length = 362
Score = 120 bits (300), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 97/335 (28%), Positives = 134/335 (40%), Gaps = 118/335 (35%)
Query: 37 GSKQAEKNSLSNIPRAHLKSWMGVHPDYNLPANRLPELIGYSEVDED----LPANFDSRT 92
G K A + SN A K +GV P +G V D LP FD+RT
Sbjct: 61 GWKAAINDRFSNATVAEFKRLLGVKP------TPKKHFLGVPIVSHDRSLKLPKEFDART 114
Query: 93 KWPNCPTIREIRDQGSCGS-------------------------------CWGCR----- 116
WP C +I I DQG CGS C G R
Sbjct: 115 AWPQCTSIGNILDQGHCGSCWAFGAVESLSDRFCIEFGMNISLSVNDLLACCGFRCGDGC 174
Query: 117 --PYEIAP-------------CEHHVNGT---RPSCDASKGHTPKCVRECQENYDVPYKK 158
Y IA C+ + + T P C+ + TPKC+R+C + + +
Sbjct: 175 DGGYPIAAWQYFSYSGVVTEECDPYFDDTGCSHPGCEPAY-PTPKCMRKCVSGNQL-WSQ 232
Query: 159 DLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIK 218
++ +Y+V SN + IM E+Y++GPVE +
Sbjct: 233 SKHYSVSTYTVKSNPQDIMAEVYKNGPVEVS----------------------------- 263
Query: 219 WTIRDNTSQLGAEGAFTVFDDLILYKSG-------KALGGHAIRILGWGEDEKSKEKYWL 271
FTV++D YKSG +GGHA++++GWG ++ E YWL
Sbjct: 264 ---------------FTVYEDFAHYKSGVYKHITGSNIGGHAVKLIGWGTTDEG-EDYWL 307
Query: 272 IANSWNTDWGDNGLFKILRGKDECGIESSITAGVP 306
+AN WN WGD+G F I RG +ECGIE AG+P
Sbjct: 308 LANQWNRSWGDDGYFMIRRGTNECGIEDEPVAGLP 342
>gi|260782761|ref|XP_002586451.1| hypothetical protein BRAFLDRAFT_247264 [Branchiostoma floridae]
gi|229271561|gb|EEN42462.1| hypothetical protein BRAFLDRAFT_247264 [Branchiostoma floridae]
Length = 272
Score = 120 bits (300), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 96/299 (32%), Positives = 128/299 (42%), Gaps = 60/299 (20%)
Query: 40 QAEKNSLSNIPRAHLKSWMGVHPD----YNLPANRLPELIGYSEVDEDLPANFDSRTKWP 95
QA N + LK G D NLP + + D ++P +FD+R +W
Sbjct: 1 QAGWNDFGEASMSDLKVLCGTILDDPDLLNLPVKQ------HDLTDMEIPKSFDARMEWS 54
Query: 96 NCPTIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHT------------PK 143
C +I DQG CGSCW E+ + C ++G T K
Sbjct: 55 TCVRSHKIHDQGHCGSCWAFASTEVL--------SDRLCIQTRGSTNIILSSEDLLSCDK 106
Query: 144 CVRECQENYDVP-----YKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEG-AFTVFDDLI 197
R C + + +K + +S + E EG A+ F L
Sbjct: 107 AGRGCSDGGRLSEAWRYMQKKGVVANRCKPYTSGATGFIPECMSKCTGEGHAYQKFYGLY 166
Query: 198 LYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSG-------KALG 250
LY + + IK I N E AFTV+ D++ YKSG LG
Sbjct: 167 LYT----------VSGENQIKVEIMTNGP---VEAAFTVYSDIVHYKSGVYHHTSGGKLG 213
Query: 251 GHAIRILGWG-EDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVPKL 308
GHA+++LGWG EDE E+YWL+ANSW DWGD G FKI RG DECGIES + G +L
Sbjct: 214 GHAVKVLGWGVEDE---EEYWLVANSWGPDWGDQGFFKIKRGSDECGIESRVLTGTARL 269
>gi|312374702|gb|EFR22199.1| hypothetical protein AND_15622 [Anopheles darlingi]
Length = 339
Score = 119 bits (299), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 86/258 (33%), Positives = 121/258 (46%), Gaps = 41/258 (15%)
Query: 81 DEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEIAPCEH--HVNGTRPSCDASK 138
D D+P +FDSR +WPNC ++REIR+QG+CGSCW + H NGTR A++
Sbjct: 89 DIDIPESFDSRDRWPNCDSLREIRNQGTCGSCWAVAAASVMSDRVCIHTNGTRNVAIAAE 148
Query: 139 ---GHTPKCVRECQENY----DVPYKKDLNF----------GAKSYSVSSNEKSIMKEIY 181
G C C+ + Y D G K Y
Sbjct: 149 DLMGCCADCGNGCEGGFLDGTSFQYWVDAGLVSGGAYNSTEGCKPYPFKPCLYPFTDCHR 208
Query: 182 EHGP------VEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFT 235
E P G + ++ S + VP +E +I++ I N EG F
Sbjct: 209 EESPKCKHHCQHGVDKRYARDKVFGSVAYSVPRDE----RVIRYEIMTNGP---VEGGFD 261
Query: 236 VFDDLILYKS-------GKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKI 288
V++D+ LYKS G+ +G HA+RI+GWG + YWLI+NS+ DWGD+G FKI
Sbjct: 262 VYEDVFLYKSGVYRHVYGEHVGKHAVRIIGWGRE--GGIPYWLISNSYGEDWGDHGYFKI 319
Query: 289 LRGKDECGIESSITAGVP 306
+RG + GIES + G+P
Sbjct: 320 VRGINHLGIESKVITGLP 337
Score = 41.6 bits (96), Expect = 0.43, Method: Compositional matrix adjust.
Identities = 20/36 (55%), Positives = 26/36 (72%), Gaps = 1/36 (2%)
Query: 9 CGFGCNGGF-PGMAWRYWVKSGIVSGGAYGSKQAEK 43
CG GC GGF G +++YWV +G+VSGGAY S + K
Sbjct: 157 CGNGCEGGFLDGTSFQYWVDAGLVSGGAYNSTEGCK 192
>gi|728602|emb|CAA88490.1| cathepsin B-like enzyme [Leishmania mexicana]
gi|1586011|prf||2202319A cathepsin B-like Cys protease
Length = 340
Score = 119 bits (299), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 95/297 (31%), Positives = 121/297 (40%), Gaps = 107/297 (36%)
Query: 72 PELIGYSEVDEDLPANFDSRTKWPNCPTI---REIRDQGS-------------------- 108
P E+ +DLP +FD+ KWP C TI R+ + GS
Sbjct: 86 PRNFSVEEMQQDLPESFDASEKWPMCVTIGEIRDQSNCGSCWAIAAVEAMSDRYCTMSGI 145
Query: 109 ----------------CG-SCWG-------------------CRPYEIAPCEHHVNGTR- 131
CG C+G C+PY PC HH N ++
Sbjct: 146 PDRRISTTNLLSCCFICGFGCYGGIPAMAWLWWVWVGVTTELCQPYPFGPCSHHGNSSKY 205
Query: 132 PSCDASKGHTPKCVRECQ--ENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGA 189
P C + +TPKC C E V YK G SYS+ E+ + E+ +GP+E A
Sbjct: 206 PPCPNTIYNTPKCNTTCDNVEMELVKYK-----GVSSYSIK-GERELDHELMNNGPLEVA 259
Query: 190 FTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKAL 249
V+ D + YKSG + + SG L
Sbjct: 260 MQVYADFVAYKSG-------------------------------------VYKHVSGDHL 282
Query: 250 GGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVP 306
GGHA++++GWG K YW IANSWNTDWGD G F I RG DECGIESS AG P
Sbjct: 283 GGHAVKLVGWGV--KDGIPYWKIANSWNTDWGDKGYFLIQRGNDECGIESSGVAGKP 337
Score = 39.3 bits (90), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 14/25 (56%), Positives = 18/25 (72%)
Query: 8 LCGFGCNGGFPGMAWRYWVKSGIVS 32
+CGFGC GG P MAW +WV G+ +
Sbjct: 161 ICGFGCYGGIPAMAWLWWVWVGVTT 185
>gi|187103108|ref|NP_001119614.1| cathepsin B-1418 precursor [Acyrthosiphon pisum]
gi|163300438|tpg|DAA06126.1| TPA_inf: cathepsin B transcript 1418 [Acyrthosiphon pisum]
gi|239788654|dbj|BAH70998.1| ACYPI000010 [Acyrthosiphon pisum]
Length = 346
Score = 119 bits (299), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 76/211 (36%), Positives = 104/211 (49%), Gaps = 59/211 (27%)
Query: 107 GSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKC-VREC-QENYDVPYKKDLNFGA 164
G GS GC+PY I PC R +C TP C ++ C NY Y+ DL++
Sbjct: 181 GDYGSEDGCQPYSIYPCGKG----RNTCIEDDPDTPDCSIKTCTNSNYSKNYRADLHYVD 236
Query: 165 KSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDN 224
YS+S +E+ IMK++Y++GPV+ A
Sbjct: 237 TVYSLSRSEEDIMKDLYKNGPVQAA----------------------------------- 261
Query: 225 TSQLGAEGAFTVFDDLILYKSG-------KALGGHAIRILGWGEDEKSKEKYWLIANSWN 277
F V+ D + YKSG + GGHAI+ILGWG D+ +K YWL ANSW+
Sbjct: 262 ---------FYVYTDFMYYKSGVYSYTRGQIEGGHAIKILGWGVDDGTK--YWLCANSWS 310
Query: 278 TDWGDNGLFKILRGKDECGIESSITAGVPKL 308
WG+NGLF+ILRG +EC IE + AG+P +
Sbjct: 311 RSWGENGLFRILRGNNECHIEDRVIAGMPHV 341
Score = 47.4 bits (111), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 25/65 (38%), Positives = 35/65 (53%), Gaps = 4/65 (6%)
Query: 53 HLKSWMGVHPDYNLPANRLPEL--IGYSEVDEDLPANFDSRTKWPNCPTIR-EIRDQGSC 109
+ MGV P N + R + E +E LP NFD+R +WP C ++ I+DQ +C
Sbjct: 58 NFNQLMGVLPR-NFNSFRFAPIKKSAEDESNEALPENFDARERWPECSSLLGSIKDQSNC 116
Query: 110 GSCWG 114
GSCW
Sbjct: 117 GSCWA 121
Score = 41.6 bits (96), Expect = 0.49, Method: Compositional matrix adjust.
Identities = 17/31 (54%), Positives = 24/31 (77%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSK 39
CG GC+GG P AW ++++ GIV+GG YGS+
Sbjct: 156 CGNGCDGGSPESAWYFFMRHGIVTGGDYGSE 186
>gi|154340956|ref|XP_001566431.1| cysteine peptidase C (CPC) [Leishmania braziliensis
MHOM/BR/75/M2904]
gi|134063754|emb|CAM39941.1| cysteine peptidase C (CPC) [Leishmania braziliensis
MHOM/BR/75/M2904]
Length = 340
Score = 119 bits (299), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 96/315 (30%), Positives = 142/315 (45%), Gaps = 33/315 (10%)
Query: 18 PGMAWRYWVKSGIVSGGAYGSKQAEKNSLSNIPRAHLKSWMGVHPDYNLPANRL-PELIG 76
P ++ R+ + + + G + + + +S L+ MGV N+ L P +
Sbjct: 34 PLLSNRFVAEINLKAKGQWTASADNGHLVSGKSDEELRKLMGV---LNMSTAALSPRIFS 90
Query: 77 YSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDA 136
E+ ++LP +FDS KWP C TI EIRDQ +CGSCW E +
Sbjct: 91 AEELAQELPTSFDSSDKWPKCRTISEIRDQSNCGSCWAIAAVEAMSDRYCTVAGITDLRV 150
Query: 137 SKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPV-----EGAFT 191
S GH C C + + + A + V S + + Y P G +
Sbjct: 151 STGHLLSCCFVC----GMGCQGGIPTMAWLWWVWVGLTSEVCQPYPFPPCGHHTDGGKYP 206
Query: 192 VFDDLILYKSGRFFVPGNETTAMSLIK----WTIR---DNTSQLGAEGAFTV----FDDL 240
I + TA++ K +++R + +L G F V + D
Sbjct: 207 ACPSTIYDTPTCNSTCADSHTALTKHKGEKSYSLRGEREYMIELMTYGPFEVAFDVYADF 266
Query: 241 ILYKS-------GKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
+ YKS G+ LGGHA++++GWG ++ YW IANSWN+DWGDNG F I RG D
Sbjct: 267 VSYKSGVYSHTTGERLGGHAVKLVGWG--VQNGTPYWKIANSWNSDWGDNGYFLIRRGTD 324
Query: 294 ECGIESSITAGVPKL 308
ECGIES+ AG+P L
Sbjct: 325 ECGIESTGVAGLPSL 339
Score = 37.4 bits (85), Expect = 9.2, Method: Compositional matrix adjust.
Identities = 14/25 (56%), Positives = 17/25 (68%)
Query: 8 LCGFGCNGGFPGMAWRYWVKSGIVS 32
+CG GC GG P MAW +WV G+ S
Sbjct: 161 VCGMGCQGGIPTMAWLWWVWVGLTS 185
>gi|1345924|sp|P25802.3|CYSP1_OSTOS RecName: Full=Cathepsin B-like cysteine proteinase 1; Flags:
Precursor
Length = 341
Score = 119 bits (298), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 63/190 (33%), Positives = 97/190 (51%), Gaps = 40/190 (21%)
Query: 115 CRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEK 174
CRPYEI PC HH N T TP+C R C Y Y D + K+Y + ++ K
Sbjct: 189 CRPYEIHPCGHHGNETYYGECVGMADTPRCKRRCLLGYPKSYPSD-RYYKKAYQLKNSVK 247
Query: 175 SIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAF 234
+I K+I ++GPV +TV++D Y+SG
Sbjct: 248 AIQKDIMKNGPVVATYTVYEDFAHYRSG-------------------------------- 275
Query: 235 TVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDE 294
+ +K+G+ G HA++++GWGE++ + YW++ANSW+ DWG+NG F++ RG ++
Sbjct: 276 -----IYKHKAGRKTGLHAVKVIGWGEEKGTP--YWIVANSWHDDWGENGFFRMHRGSND 328
Query: 295 CGIESSITAG 304
CG E + AG
Sbjct: 329 CGFEERMAAG 338
Score = 41.6 bits (96), Expect = 0.42, Method: Compositional matrix adjust.
Identities = 14/31 (45%), Positives = 21/31 (67%)
Query: 84 LPANFDSRTKWPNCPTIREIRDQGSCGSCWG 114
+P ++D R +W NC ++ I DQ +CGSCW
Sbjct: 91 IPESYDPRIQWANCSSLFHIPDQANCGSCWA 121
>gi|28932700|gb|AAO60044.1| midgut cysteine proteinase 1 [Rhipicephalus appendiculatus]
Length = 332
Score = 119 bits (297), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 87/283 (30%), Positives = 114/283 (40%), Gaps = 97/283 (34%)
Query: 85 PANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYE-----------------------IA 121
P +F R W +C +IR IRDQ +CGSCW E +A
Sbjct: 88 PESFTPREYWSHCSSIRVIRDQSACGSCWAFAAAESISDRICIHTNGKVQVNISAEDLLA 147
Query: 122 PCEHHVNGTRPSCDASKG----------------------HTPKCVREC----------- 148
C +G C S P CV C
Sbjct: 148 CCHTCGHGCDGRCHCSSVAILQGRRLVPEPVRTEDGCQPYSLPPCVPNCTHPEPTPKCQH 207
Query: 149 --QENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFV 206
++ Y+ Y++D +F Y + +I +IY++GPVE AF V+ D YKSG +
Sbjct: 208 VCRKGYEKSYEEDKHFAKNVYRLLKKCDAIKTDIYKNGPVESAFFVYADFPSYKSGVY-- 265
Query: 207 PGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSK 266
+IK+ +G HAI+ILGWG ++
Sbjct: 266 ------QQHMIKF-----------------------------MGVHAIKILGWGTEDGV- 289
Query: 267 EKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVPKLD 309
YWL+ANSWN WGD G FKILRGKDECGIE I AG+P D
Sbjct: 290 -PYWLVANSWNVGWGDKGYFKILRGKDECGIEEVIDAGIPMED 331
>gi|308488594|ref|XP_003106491.1| hypothetical protein CRE_15919 [Caenorhabditis remanei]
gi|308253841|gb|EFO97793.1| hypothetical protein CRE_15919 [Caenorhabditis remanei]
Length = 342
Score = 118 bits (296), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 66/207 (31%), Positives = 102/207 (49%), Gaps = 40/207 (19%)
Query: 103 IRDQGSCGSCWGCRPYEIAPCEHHV-NGTRPSCDASKGHTPKCVRECQENYDVPYKKDLN 161
I GS S +GC+PY IAPC + N T P C + TP C ++C+ Y V KD +
Sbjct: 174 IPTGGSYESQFGCKPYSIAPCGKTIGNVTYPPCTNTTLPTPTCEKKCKPGYPVDLDKDRH 233
Query: 162 FGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTI 221
+G + + + I ++ +GPVE ++DD + Y +G +
Sbjct: 234 YGVSVDQLPNRQIEIQSDVMLNGPVEATMEIYDDFLQYTTGIY----------------- 276
Query: 222 RDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWG 281
++ +G G ++RILGWG E YWL+ANSW +WG
Sbjct: 277 --------------------VHLAGNKQGHLSVRILGWGMFEGVP--YWLLANSWGKEWG 314
Query: 282 DNGLFKILRGKDECGIESSITAGVPKL 308
+NG F++LRG +ECG+E++ +G+PKL
Sbjct: 315 ENGTFRVLRGVNECGLEANCISGMPKL 341
Score = 40.4 bits (93), Expect = 0.90, Method: Compositional matrix adjust.
Identities = 18/31 (58%), Positives = 22/31 (70%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSK 39
CG GC GG P AW+YW K GI +GG+Y S+
Sbjct: 153 CGEGCAGGNPLKAWQYWQKHGIPTGGSYESQ 183
>gi|156255405|gb|ABU62925.1| cathepsin B [Fasciola hepatica]
Length = 337
Score = 118 bits (296), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 71/201 (35%), Positives = 96/201 (47%), Gaps = 54/201 (26%)
Query: 114 GCRPYEIAPCEHHV-NGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSN 172
GC PY C H V P C TPKC ++C Y+ Y++D G SY+V
Sbjct: 183 GCLPYPFPKCSHGVVTPGLPPCPRDIYPTPKCEKKCHAGYNKTYEQDKVKGKSSYNVGGQ 242
Query: 173 EKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEG 232
E IM EI ++GPV+G
Sbjct: 243 ETDIMMEIMKNGPVDGI------------------------------------------- 259
Query: 233 AFTVFDDLILYKSG-------KALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGL 285
F +F+D ++YKSG + +GGHAIR++GWG + + KYWLIANSWN WG+ G
Sbjct: 260 -FYMFEDFLVYKSGIYHYTTGRLVGGHAIRVIGWGVE--NGVKYWLIANSWNEGWGEKGY 316
Query: 286 FKILRGKDECGIESSITAGVP 306
F++ RG +ECGIE+ I AG+P
Sbjct: 317 FRMRRGNNECGIEARINAGLP 337
Score = 62.8 bits (151), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 32/76 (42%), Positives = 43/76 (56%), Gaps = 2/76 (2%)
Query: 39 KQAEKNSLSNIPRAHLKSWMGVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCP 98
K A +NI + +K +GV + N + + YS + DLP +FD+R KW NCP
Sbjct: 43 KAAPSTRFNNIDQ--VKQNLGVLEETPEDRNTQRQTVRYSVSENDLPESFDARQKWANCP 100
Query: 99 TIREIRDQGSCGSCWG 114
+I EIRDQ SC SCW
Sbjct: 101 SISEIRDQSSCSSCWA 116
>gi|21930117|gb|AAM82155.1| cysteine proteinase [Ancylostoma ceylanicum]
Length = 348
Score = 118 bits (296), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 64/196 (32%), Positives = 92/196 (46%), Gaps = 40/196 (20%)
Query: 114 GCRPYEIAPCEHHVNGTR-PSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSN 172
C+PY PC +H + C TP C R CQ Y +P++KD F ++Y + N
Sbjct: 192 ACQPYAFYPCGNHAHEPYYGPCPDELWPTPTCRRTCQLGYPIPFEKDKIFNDQTYYIFGN 251
Query: 173 EKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEG 232
E I EI GPV + V+ D YK G +
Sbjct: 252 ETEIKYEIMTRGPVVATYKVYRDFDYYKKGVY---------------------------- 283
Query: 233 AFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGK 292
+++ G+ G HA++I+GWG+ + YWL+ANSWNTDWGDNG F+I+RG
Sbjct: 284 ---------IHREGEVTGLHAVKIIGWGKG--NDVPYWLVANSWNTDWGDNGYFRIVRGT 332
Query: 293 DECGIESSITAGVPKL 308
D C IE + G+ ++
Sbjct: 333 DNCEIERQMVGGIMRV 348
Score = 56.2 bits (134), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 20/41 (48%), Positives = 30/41 (73%)
Query: 74 LIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG 114
++ +E+ D+P FD+R +WPNC +++ IRDQ SCGSCW
Sbjct: 84 VLANTEMKVDIPDTFDARDRWPNCTSMKHIRDQSSCGSCWA 124
Score = 40.4 bits (93), Expect = 0.99, Method: Compositional matrix adjust.
Identities = 17/33 (51%), Positives = 22/33 (66%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQA 41
CGFGC GG+P A+ Y + G+ +GG YG K A
Sbjct: 160 CGFGCKGGYPARAFGYAWRYGLSTGGPYGEKDA 192
>gi|326490902|dbj|BAJ90118.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326508404|dbj|BAJ99469.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326514912|dbj|BAJ99817.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 345
Score = 118 bits (296), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 87/315 (27%), Positives = 130/315 (41%), Gaps = 94/315 (29%)
Query: 46 LSNIPRAHLKSWMGVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRD 105
L+N K +GV P +P +LP FD+R+KW C TI +I D
Sbjct: 59 LANYTIEQFKHILGVKPTPPGLLAGVPTKTYSRSEKAELPKEFDARSKWSGCSTIGKILD 118
Query: 106 QGSCGSCWGCRPYEIAP---CEHH------------------------------------ 126
QG CG+CW E C HH
Sbjct: 119 QGHCGACWAFGAVECLQDRFCIHHSVNVSLSVNDLVACCGFLCGDGCDGGYPIFAWQYFV 178
Query: 127 ---------------VNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSS 171
V P C+ + TP C ++C+ V +++ +F +Y V+S
Sbjct: 179 ENGVVTDECDPFFDQVGCQHPGCEPAY-PTPVCEKKCKVQNQV-WEEKKHFSIDAYQVNS 236
Query: 172 NEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAE 231
+ IM E+Y++GPVE +F I+Y+ + G
Sbjct: 237 DPHDIMAEVYKNGPVEVSF------IIYEDFAHYKSG----------------------- 267
Query: 232 GAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRG 291
V+ + +G+ +GGHA +++GWG + + E YWL+AN WN WGD+G FKI+RG
Sbjct: 268 ----VYKQI----TGRMVGGHAAKLIGWGTSD-AGEDYWLLANQWNRGWGDDGYFKIIRG 318
Query: 292 KDECGIESSITAGVP 306
+ECGIE + AG+P
Sbjct: 319 TNECGIEGDVNAGMP 333
Score = 38.5 bits (88), Expect = 4.0, Method: Compositional matrix adjust.
Identities = 14/25 (56%), Positives = 22/25 (88%)
Query: 8 LCGFGCNGGFPGMAWRYWVKSGIVS 32
LCG GC+GG+P AW+Y+V++G+V+
Sbjct: 160 LCGDGCDGGYPIFAWQYFVENGVVT 184
>gi|297744106|emb|CBI37076.3| unnamed protein product [Vitis vinifera]
Length = 392
Score = 118 bits (295), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 67/193 (34%), Positives = 99/193 (51%), Gaps = 47/193 (24%)
Query: 115 CRPY-EIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
C PY + C H P C+ TPKCVR+C + + ++K +G +Y +SS+
Sbjct: 225 CDPYFDATGCSH------PGCEPGYP-TPKCVRKCTDENQL-WRKAKRYGQSAYRISSDP 276
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
IM E+Y++GPVE AFTV++D Y+SG
Sbjct: 277 YQIMAEVYKNGPVEVAFTVYEDFAHYESG------------------------------- 305
Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
+ Y +G +GGHA++++GWG + E YW++AN WN +WGD+G F I RG +
Sbjct: 306 ------VYRYTTGDVMGGHAVKLIGWGTTDDG-EDYWILANQWNRNWGDDGYFMIRRGVN 358
Query: 294 ECGIESSITAGVP 306
ECGIE + AG+P
Sbjct: 359 ECGIEEGVVAGLP 371
Score = 38.9 bits (89), Expect = 2.8, Method: Compositional matrix adjust.
Identities = 14/25 (56%), Positives = 20/25 (80%)
Query: 8 LCGFGCNGGFPGMAWRYWVKSGIVS 32
LCG GC+GG+P AWRY++ G+V+
Sbjct: 198 LCGSGCDGGYPLYAWRYFIHHGVVT 222
>gi|204022104|dbj|BAG71149.1| cathepsin B-N [Astegopteryx styracophila]
Length = 332
Score = 118 bits (295), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 69/195 (35%), Positives = 99/195 (50%), Gaps = 44/195 (22%)
Query: 114 GCRPYEIAPC--EHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSS 171
GC+PY ++PC + + N T A K H +C R C N D+ +KKD +F +Y ++
Sbjct: 180 GCQPYRVSPCPLDEYGNNTCRGKPAEKNH--RCTRMCYGNQDLDFKKDHHFTRDAYYLTF 237
Query: 172 NEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAE 231
I +++ +GP+E ++ V+DD YKSG + N T
Sbjct: 238 G--IIQRDVMAYGPIEASYDVYDDFPSYKSGVYVRTENATY------------------- 276
Query: 232 GAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRG 291
LGGHA++++GWGE+ YWL+ NSWN WGD GLFKI RG
Sbjct: 277 -----------------LGGHAVKLIGWGEEYGV--PYWLMVNSWNDQWGDKGLFKIRRG 317
Query: 292 KDECGIESSITAGVP 306
+ECGI++S T GVP
Sbjct: 318 TNECGIDNSTTGGVP 332
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 19/34 (55%), Positives = 24/34 (70%)
Query: 81 DEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG 114
++ +P FD+R KW C TI E+RDQG CGSCW
Sbjct: 80 NQKIPKFFDARKKWRKCFTIGEVRDQGKCGSCWA 113
Score = 41.2 bits (95), Expect = 0.68, Method: Compositional matrix adjust.
Identities = 17/30 (56%), Positives = 22/30 (73%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
CGFGC+GG+P AW + K G+V+GG Y S
Sbjct: 148 CGFGCHGGYPIKAWERFQKHGLVTGGDYDS 177
>gi|71656032|ref|XP_816569.1| cysteine peptidase C (CPC) [Trypanosoma cruzi strain CL Brener]
gi|70881707|gb|EAN94718.1| cysteine peptidase C (CPC), putative [Trypanosoma cruzi]
Length = 333
Score = 118 bits (295), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 92/264 (34%), Positives = 118/264 (44%), Gaps = 38/264 (14%)
Query: 72 PELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTR 131
P E+ E L FD+ WP CPTI EIRDQ SCGSCW + G
Sbjct: 80 PRQFSEEELREPLQDRFDAGEAWPKCPTITEIRDQSSCGSCWAVAAASAISDRYCTLGGV 139
Query: 132 PSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEG-AF 190
S G C C + Y + + Y+V I+ E + P A
Sbjct: 140 RDLRISAGDLMSCCDVCGYGCNGGYPE---VAWEYYAV----HGIVSEYCQPYPFPSCAH 192
Query: 191 TVFDDLILYKSGRFFVPGNETTA----MSLIKWTIRDNTSQLGA---------------E 231
V + SG + P +T + LIK+ R NTS L + E
Sbjct: 193 HVNSSDLSPCSGEYDTPTCNSTCTDKKVPLIKY--RGNTSYLLSGEESFKRELLLNGPFE 250
Query: 232 GAFTVFDDLILYK-------SGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNG 284
+F+V+ D + Y +G LGGHA+RI+GWG E + E YW IANSWN +WG NG
Sbjct: 251 VSFSVYADFLAYTGGVYKHVAGTFLGGHAVRIVGWG--ELNGEPYWKIANSWNREWGMNG 308
Query: 285 LFKILRGKDECGIESSITAGVPKL 308
F I RG DECGIE S AG P++
Sbjct: 309 YFLIARGVDECGIEGSGVAGTPRI 332
Score = 40.4 bits (93), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 15/25 (60%), Positives = 20/25 (80%)
Query: 8 LCGFGCNGGFPGMAWRYWVKSGIVS 32
+CG+GCNGG+P +AW Y+ GIVS
Sbjct: 155 VCGYGCNGGYPEVAWEYYAVHGIVS 179
>gi|30678927|ref|NP_849281.1| cathepsin B [Arabidopsis thaliana]
gi|3859606|gb|AAC72872.1| contains similarity to cysteine proteases (Pfam: PF00112,
E=1.3e-79, N=1) [Arabidopsis thaliana]
gi|7268205|emb|CAB77732.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
gi|332656653|gb|AEE82053.1| cathepsin B [Arabidopsis thaliana]
Length = 359
Score = 118 bits (295), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 96/335 (28%), Positives = 132/335 (39%), Gaps = 118/335 (35%)
Query: 37 GSKQAEKNSLSNIPRAHLKSWMGVHPDYNLPANRLPELIGYSEVDED----LPANFDSRT 92
G K A + SN A K +GV P +G V D LP FD+RT
Sbjct: 58 GWKAAINDRFSNATVAEFKRLLGVKP------TPKKHFLGVPIVSHDPSLKLPKAFDART 111
Query: 93 KWPNCPTIREIRDQGSCGSCW-------------------------------------GC 115
WP C +I I G CGSCW GC
Sbjct: 112 AWPQCTSIGNILGLGHCGSCWAFGAVESLSDRFCIQFGMNISLSVNDLLACCGFRCGDGC 171
Query: 116 RP-YEIAP-------------CEHHVNGT---RPSCDASKGHTPKCVRECQENYDVPYKK 158
Y IA C+ + + T P C+ + TPKC R+C + + + +
Sbjct: 172 DGGYPIAAWQYFSYSGVVTEECDPYFDNTGCSHPGCEPAY-PTPKCSRKCVSDNKL-WSE 229
Query: 159 DLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIK 218
++ +Y+V SN + IM E+Y++GPV
Sbjct: 230 SKHYSVSTYTVKSNPQDIMAEVYKNGPV-------------------------------- 257
Query: 219 WTIRDNTSQLGAEGAFTVFDDLILYKSG-------KALGGHAIRILGWGEDEKSKEKYWL 271
E +FTV++D YKSG +GGHA++++GWG + E YWL
Sbjct: 258 ------------EVSFTVYEDFAHYKSGVYKHITGSNIGGHAVKLIGWGTSSEG-EDYWL 304
Query: 272 IANSWNTDWGDNGLFKILRGKDECGIESSITAGVP 306
+AN WN WGD+G F I RG +ECGIE AG+P
Sbjct: 305 MANQWNRGWGDDGYFMIRRGTNECGIEDEPVAGLP 339
>gi|44968648|gb|AAS49594.1| cathepsin B [Scyliorhinus canicula]
Length = 206
Score = 117 bits (294), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 61/149 (40%), Positives = 80/149 (53%), Gaps = 38/149 (25%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GCRPY I+PCEHHVNG+RP C TP+C R C+ Y Y +D ++G SYS+ S+
Sbjct: 93 GCRPYSISPCEHHVNGSRPKCSGEI-ETPRCSRRCEAGYSPKYSEDKHYGLTSYSIGSDV 151
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
IM EIY++GPVE A VF D +LYKSG +
Sbjct: 152 TEIMTEIYKNGPVEAALEVFKDFLLYKSGVY----------------------------- 182
Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGED 262
+K+G ++GGHAI+ILGWGE+
Sbjct: 183 --------QHKTGGSIGGHAIKILGWGEE 203
Score = 43.5 bits (101), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 17/28 (60%), Positives = 20/28 (71%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAY 36
CG GCNGG+P AW +W G+VSGG Y
Sbjct: 61 CGNGCNGGYPSGAWEFWTNDGLVSGGLY 88
>gi|187105118|ref|NP_001119619.1| cathepsin B-5880 precursor [Acyrthosiphon pisum]
gi|163300442|tpg|DAA06127.1| TPA_inf: cathepsin B transcript 5880 [Acyrthosiphon pisum]
gi|239790051|dbj|BAH71611.1| ACYPI000015 [Acyrthosiphon pisum]
gi|239790053|dbj|BAH71612.1| ACYPI000015 [Acyrthosiphon pisum]
Length = 302
Score = 117 bits (293), Expect = 7e-24, Method: Compositional matrix adjust.
Identities = 72/201 (35%), Positives = 97/201 (48%), Gaps = 43/201 (21%)
Query: 107 GSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVREC-QENYDVPYKKDLNFGAK 165
G GS GC+PY I PC+H +C TP+C +C +Y Y KD N
Sbjct: 143 GEYGSNEGCQPYTIEPCQHTETAVENACSNKTLFTPECKVQCYNPDYGTRYVKD-NHQGT 201
Query: 166 SYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNT 225
Y V + + MKEIYE+GP+ +F ++ D + Y+SG +
Sbjct: 202 HYRVPA--YTAMKEIYENGPITASFYMYQDFVNYQSGVY--------------------- 238
Query: 226 SQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGL 285
Y SGK + A++ILGWGE+ + YWL ANS+NT WGDNG
Sbjct: 239 ----------------AYNSGKYVTTQAVKILGWGEENGTP--YWLAANSFNTYWGDNGF 280
Query: 286 FKILRGKDECGIESSITAGVP 306
KILRG +EC IE + AG+P
Sbjct: 281 VKILRGANECYIEEFMYAGLP 301
>gi|347972088|ref|XP_313836.5| AGAP004534-PA [Anopheles gambiae str. PEST]
gi|333469166|gb|EAA09182.5| AGAP004534-PA [Anopheles gambiae str. PEST]
Length = 334
Score = 117 bits (293), Expect = 7e-24, Method: Compositional matrix adjust.
Identities = 91/305 (29%), Positives = 134/305 (43%), Gaps = 51/305 (16%)
Query: 40 QAEKNSLSNIPRAHLKSWMGV---HPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPN 96
+ + N SN+ +S G+ + +P R + D D+P +FD+R WPN
Sbjct: 45 KPDTNFQSNVHFHAFRSLKGIGESRTGFKVPIRRYEYVY-----DVDIPESFDARNHWPN 99
Query: 97 CPTIREIRDQGSCGSCWGCRPYEIAPCEH--HVNGTRPSCDASKGHTPKCVRECQENYDV 154
C ++R IR+QG+CGSCW + H NGT A++ CV +C +
Sbjct: 100 CESLRAIRNQGTCGSCWAVAAASVMSDRVCIHSNGTINVALAAEDLMGCCV-DCGNGCNG 158
Query: 155 PYKKDLNF------------------GAKSYSVSSNEKSIMKEIYEHGP------VEGAF 190
+ +F G K Y E E P +G
Sbjct: 159 GFLDGTSFQYWVDAGLVSGGAYNSTDGCKPYPFKPCEYPFNDCHVEISPKCTHHCRDGVD 218
Query: 191 TVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKS----- 245
+ L+ + VP +E I++ I N E F V++D++LYKS
Sbjct: 219 RHYSKDKLFGKVAYSVPRDERA----IRYEIMTNGP---VEAGFDVYEDVLLYKSGVYRH 271
Query: 246 --GKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITA 303
G+ +G HA+RI+GWG D YWLIANS+ DWGD+G FK +RG + GIES I
Sbjct: 272 VYGEQIGKHAVRIIGWGRD--GGIPYWLIANSYGDDWGDHGYFKFVRGSNHLGIESKIIT 329
Query: 304 GVPKL 308
G+P +
Sbjct: 330 GLPLI 334
Score = 43.5 bits (101), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 21/36 (58%), Positives = 26/36 (72%), Gaps = 1/36 (2%)
Query: 9 CGFGCNGGF-PGMAWRYWVKSGIVSGGAYGSKQAEK 43
CG GCNGGF G +++YWV +G+VSGGAY S K
Sbjct: 152 CGNGCNGGFLDGTSFQYWVDAGLVSGGAYNSTDGCK 187
>gi|226466816|emb|CAX69543.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 337
Score = 117 bits (292), Expect = 8e-24, Method: Compositional matrix adjust.
Identities = 66/191 (34%), Positives = 92/191 (48%), Gaps = 39/191 (20%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GC PY C+H + + P C P C C+ Y +PY D +FG +Y V NE
Sbjct: 181 GCLPYPFPKCDHGSSDSYPMCGYVVYTPPVCNGTCRPGYPIPYNDDKHFGKSAYQVKQNE 240
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
I +EI +GPVE + ++DD + YKSG
Sbjct: 241 SDIRREIMLYGPVEASIFIYDDFVDYKSG------------------------------- 269
Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
V+ L +G+ + ++RI+GWG + + YWL ANSWN +WG NG FKILRG +
Sbjct: 270 --VYKHL----TGRLITIQSVRIIGWGIE--NGIPYWLCANSWNEEWGLNGFFKILRGSN 321
Query: 294 ECGIESSITAG 304
EC IE+ + AG
Sbjct: 322 ECEIEAFVNAG 332
>gi|332374788|gb|AEE62535.1| unknown [Dendroctonus ponderosae]
Length = 328
Score = 117 bits (292), Expect = 8e-24, Method: Compositional matrix adjust.
Identities = 88/290 (30%), Positives = 115/290 (39%), Gaps = 100/290 (34%)
Query: 79 EVDEDLPANFDSRTKWPNCP-TIREIRDQGSCGSCWGCRPYEIAPCE--HHVNGTRP--- 132
E+ E++P +FDSRT WP C I IRDQ CGSCW E H N T+
Sbjct: 76 EITEEIPESFDSRTAWPECTQIIGMIRDQSRCGSCWAFAAVEAMSDRICIHSNATKKLLV 135
Query: 133 ------SCDASKG----------------------------------------HTPKC-- 144
+C + G H KC
Sbjct: 136 SSQDLLTCGTAGGCNGGWPAVAWSDWTNGIVTGGLYGALEQGCKSYFLEGCDDHPNKCRN 195
Query: 145 ---VRECQENYDVP---YKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLIL 198
C E D P YK +G Y + E+ I EI +GPVE V+ D
Sbjct: 196 YVSTPACVEQCDEPSLYYKAQETYGQTPYEIQGEEQ-IQYEIMTNGPVEATMDVYVDFAQ 254
Query: 199 YKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILG 258
Y+SG + + +E GGHA++ILG
Sbjct: 255 YQSGIYQLTTDEYE-------------------------------------GGHAVKILG 277
Query: 259 WGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVPKL 308
WG ++ KYWL+ANSWN WG+NGLF+I+RG+DE GIES+I A +P
Sbjct: 278 WGVEDGV--KYWLVANSWNERWGENGLFRIIRGRDEVGIESTIDAALPDF 325
Score = 38.9 bits (89), Expect = 2.7, Method: Compositional matrix adjust.
Identities = 19/38 (50%), Positives = 26/38 (68%), Gaps = 3/38 (7%)
Query: 3 TQQIRLCGF--GCNGGFPGMAWRYWVKSGIVSGGAYGS 38
+Q + CG GCNGG+P +AW W +GIV+GG YG+
Sbjct: 137 SQDLLTCGTAGGCNGGWPAVAWSDWT-NGIVTGGLYGA 173
>gi|204022100|dbj|BAG71147.1| cathepsin B-N1 [Tuberaphis takenouchii]
Length = 334
Score = 117 bits (292), Expect = 8e-24, Method: Compositional matrix adjust.
Identities = 68/195 (34%), Positives = 100/195 (51%), Gaps = 44/195 (22%)
Query: 114 GCRPYEIAPC--EHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSS 171
GC+PY + PC + + N T A K H +C R C N ++ +K+D ++ +Y ++
Sbjct: 182 GCQPYRVPPCPLDEYGNNTCRGKPAEKNH--RCTRMCYGNQELDFKEDHHWTRDAYYLTY 239
Query: 172 NEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAE 231
+I K++ +GP+E +F V+DD YKSG + N +
Sbjct: 240 T--TIQKDVMAYGPIEASFDVYDDFPNYKSGVYMKTENASY------------------- 278
Query: 232 GAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRG 291
LGGHA++++GWGE+ YWL+ NSWN WGD GLFKILRG
Sbjct: 279 -----------------LGGHAVKLIGWGEEYGV--PYWLLVNSWNDQWGDQGLFKILRG 319
Query: 292 KDECGIESSITAGVP 306
+ECGI++S T GVP
Sbjct: 320 TNECGIDNSTTGGVP 334
Score = 53.9 bits (128), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 21/39 (53%), Positives = 27/39 (69%)
Query: 76 GYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG 114
Y+ + +P+NFD+R KW C TI E+RDQG CGSCW
Sbjct: 77 AYNSLPNRIPSNFDARKKWRKCSTIGEVRDQGHCGSCWA 115
Score = 42.0 bits (97), Expect = 0.35, Method: Compositional matrix adjust.
Identities = 17/32 (53%), Positives = 24/32 (75%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQ 40
CGFGC+GG+P AW ++ K G+V+GG Y S +
Sbjct: 150 CGFGCHGGYPIKAWEWFKKHGLVTGGDYDSGE 181
>gi|29374027|gb|AAO73004.1| cathepsin B [Fasciola gigantica]
Length = 337
Score = 117 bits (292), Expect = 8e-24, Method: Compositional matrix adjust.
Identities = 70/201 (34%), Positives = 95/201 (47%), Gaps = 54/201 (26%)
Query: 114 GCRPYEIAPCEHHV-NGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSN 172
GC PY C H V P C TPKC ++C Y+ Y++D G SY+V
Sbjct: 183 GCLPYPFPKCSHGVVTPGLPPCPRDIYPTPKCEKKCHAGYNKTYEQDKVKGKSSYNVGEQ 242
Query: 173 EKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEG 232
E M EI ++GPV+G
Sbjct: 243 ETDFMMEIMKNGPVDGI------------------------------------------- 259
Query: 233 AFTVFDDLILYKSG-------KALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGL 285
F +F+D ++YKSG + +GGHAIR++GWG + + KYWLIANSWN WG+ G
Sbjct: 260 -FYMFEDFLVYKSGIYHYTTGRLVGGHAIRVIGWGVE--NGVKYWLIANSWNEGWGEKGY 316
Query: 286 FKILRGKDECGIESSITAGVP 306
F++ RG +ECGIE+ I AG+P
Sbjct: 317 FRMRRGNNECGIEARINAGLP 337
Score = 62.4 bits (150), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 32/76 (42%), Positives = 43/76 (56%), Gaps = 2/76 (2%)
Query: 39 KQAEKNSLSNIPRAHLKSWMGVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCP 98
K A +NI + +K +GV + N + + YS + DLP +FD+R KW NCP
Sbjct: 43 KAAPSTRFNNIDQ--VKQNLGVLEETPEDRNTQRQTVRYSVSENDLPESFDARQKWANCP 100
Query: 99 TIREIRDQGSCGSCWG 114
+I EIRDQ SC SCW
Sbjct: 101 SISEIRDQSSCSSCWA 116
Score = 47.4 bits (111), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 16/27 (59%), Positives = 21/27 (77%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGA 35
CG+GCNGG P M+W YW + G+V+GG
Sbjct: 151 CGYGCNGGIPAMSWDYWTREGVVTGGT 177
>gi|308466896|ref|XP_003095699.1| CRE-CPR-3 protein [Caenorhabditis remanei]
gi|308244581|gb|EFO88533.1| CRE-CPR-3 protein [Caenorhabditis remanei]
Length = 373
Score = 117 bits (292), Expect = 9e-24, Method: Compositional matrix adjust.
Identities = 69/199 (34%), Positives = 97/199 (48%), Gaps = 51/199 (25%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVP-YKKDLNFGAKSYSVSSN 172
GC PY APC+ + C S TP C CQ +Y Y D ++G +Y +++
Sbjct: 190 GCMPYSFAPCQ------KSPCVEST--TPTCKTTCQSSYTTANYTTDKHYGTSAYRLATT 241
Query: 173 E---KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLG 229
+I EIY +GPVE ++ V++D YKSG +
Sbjct: 242 NNVVSTIQYEIYHNGPVEASYKVYEDFYQYKSGVYH------------------------ 277
Query: 230 AEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKIL 289
Y SGK +GGHA++I+GWG + + YWL+ANSW +G+ G FKI
Sbjct: 278 -------------YVSGKLVGGHAVKIIGWGTE--NDVDYWLVANSWGIKFGEGGFFKIR 322
Query: 290 RGKDECGIESSITAGVPKL 308
RG +EC IES++ AGV KL
Sbjct: 323 RGTNECQIESNVVAGVAKL 341
>gi|291291827|gb|ADD91786.1| cysteine proteinase [Haemonchus contortus]
Length = 253
Score = 117 bits (292), Expect = 9e-24, Method: Compositional matrix adjust.
Identities = 75/198 (37%), Positives = 100/198 (50%), Gaps = 53/198 (26%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GCRPY PC HH T ++ TPKCVR+CQ++Y YKKD + G +Y + E
Sbjct: 100 GCRPYPFHPCGHHGKDTYYGECPNEATTPKCVRKCQKSYKKSYKKDRSIGKDAYEEPNAE 159
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
K+ +EI ++GPV GA
Sbjct: 160 KATQREIMKNGPVVGA-------------------------------------------- 175
Query: 234 FTVFDDLILYK-------SGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLF 286
FTV++D YK +GKA GGHAI+I+GWG++ YWLIANSW+ DWG+NG F
Sbjct: 176 FTVYEDFSYYKKGIYKHTAGKARGGHAIKIIGWGKE--GGVPYWLIANSWHNDWGENGYF 233
Query: 287 KILRGKDECGIESSITAG 304
+IL G + CGIE ++ AG
Sbjct: 234 RILCGSNHCGIEENVVAG 251
Score = 47.0 bits (110), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 18/31 (58%), Positives = 24/31 (77%)
Query: 83 DLPANFDSRTKWPNCPTIREIRDQGSCGSCW 113
D+P + SRTKWP C +++ IRDQ +CGSCW
Sbjct: 1 DIPESPYSRTKWPKCSSLKPIRDQANCGSCW 31
Score = 37.7 bits (86), Expect = 5.9, Method: Compositional matrix adjust.
Identities = 16/28 (57%), Positives = 21/28 (75%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAY 36
CG+GCNGG+P A+ Y+ K G V+GG Y
Sbjct: 68 CGYGCNGGWPIQAFNYFSKQGAVTGGDY 95
>gi|341900875|gb|EGT56810.1| hypothetical protein CAEBREN_32632 [Caenorhabditis brenneri]
Length = 287
Score = 116 bits (291), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 68/205 (33%), Positives = 103/205 (50%), Gaps = 42/205 (20%)
Query: 107 GSCGSCWGCRPYEIAPCEHHV-NGTRPSCDASKGHTPKCVREC--QENYDVPYKKDLNFG 163
GS S +GC+PY IAPC V N T P+C + TP C ++C + Y V KD ++G
Sbjct: 121 GSYESQFGCKPYSIAPCGKTVGNVTYPACTNTTLPTPSCEKKCTSKNGYPVDIDKDRHYG 180
Query: 164 AKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRD 223
A + + + I ++ +GP+E F V+DD + Y +G +
Sbjct: 181 ASVDQLPNRQIEIQSDVMLNGPIETTFEVYDDFLQYTTGIY------------------- 221
Query: 224 NTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDN 283
++ +G G ++RILGWG E YWL+ANSW +WG+N
Sbjct: 222 ------------------VHLTGNKQGHLSVRILGWGMYEGVP--YWLLANSWGKEWGEN 261
Query: 284 GLFKILRGKDECGIESSITAGVPKL 308
G F+ LRG +ECG+E++ +G+PKL
Sbjct: 262 GTFRALRGTNECGLEANCVSGMPKL 286
>gi|204022094|dbj|BAG71144.1| cathepsin B-N1 [Tuberaphis taiwana]
Length = 334
Score = 116 bits (291), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 69/195 (35%), Positives = 98/195 (50%), Gaps = 44/195 (22%)
Query: 114 GCRPYEIAPC--EHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSS 171
GC+PY + PC + + N T A K H +C R C N D+ +K+D ++ +Y ++
Sbjct: 182 GCQPYRVPPCPLDEYGNNTCRGKPAEKNH--RCTRMCYGNQDLDFKEDHHYTRDAYYLTY 239
Query: 172 NEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAE 231
+I +I +GP+E +F V+DD YKSG + N T
Sbjct: 240 G--TIQNDILAYGPIEASFEVYDDFPSYKSGVYTKMENATY------------------- 278
Query: 232 GAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRG 291
LGGHA++++GWGE+ YWL+ NSWN WGD GLFKI RG
Sbjct: 279 -----------------LGGHAVKLIGWGEEYGV--PYWLLVNSWNDQWGDQGLFKIRRG 319
Query: 292 KDECGIESSITAGVP 306
+ECGI++S T GVP
Sbjct: 320 TNECGIDNSTTGGVP 334
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 35/110 (31%), Positives = 54/110 (49%), Gaps = 26/110 (23%)
Query: 30 IVSGGAYGSKQA---EKNSLSNIPRAHLKSW-MGVHPDYNLPANRLPELIG--------- 76
+V Y ++QA E++ ++ I A+ K+W GV+ D L + +L+G
Sbjct: 7 VVLFSVYRTEQAYFLEEDYINQI-NANAKTWKAGVNFDPKLSIDSFVKLLGSKGVQAAKQ 65
Query: 77 ------------YSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG 114
Y+ +P++FD+R KW C TI E+RDQG CGSCW
Sbjct: 66 ASPDMFKTHDEAYNSWSNRIPSSFDARKKWRKCSTIGEVRDQGKCGSCWA 115
Score = 40.8 bits (94), Expect = 0.80, Method: Compositional matrix adjust.
Identities = 17/32 (53%), Positives = 23/32 (71%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQ 40
CGFGC+GG+P AW + K G+V+GG Y S +
Sbjct: 150 CGFGCSGGYPIRAWERFKKHGLVTGGNYDSGE 181
>gi|2317912|gb|AAC24376.1| cathepsin B-like cysteine proteinase [Arabidopsis thaliana]
Length = 357
Score = 116 bits (291), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 91/331 (27%), Positives = 133/331 (40%), Gaps = 112/331 (33%)
Query: 37 GSKQAEKNSLSNIPRAHLKSWMGVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPN 96
G K A + +N A K +GV +P I ++ LP FD+RT W +
Sbjct: 58 GWKAAFNDRFANATVAEFKRLLGVIQTPKTAYLGVP--IVRHDLSLKLPKEFDARTAWSH 115
Query: 97 CPTIREIRDQGSCGSCWG-----------CRPYEI------------------------- 120
C +IR I G CGSCW C Y +
Sbjct: 116 CTSIRRIL--GHCGSCWAFGAVESLSDRFCIKYNLNVSLSANDVIACCGLLCGFGCNGGF 173
Query: 121 ---------------APCEHHVNGT---RPSCDASKGHTPKCVRECQENYDVPYKKDLNF 162
C+ + + T P C+ + TPKC R+C + + + ++
Sbjct: 174 PMGAWLYFKYHGVVTQECDPYFDNTGCSHPGCEPTY-PTPKCERKCVSRNQL-WGESKHY 231
Query: 163 GAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIR 222
G +Y ++ + + IM E+Y++GPVE A
Sbjct: 232 GVGAYRINPDPQDIMAEVYKNGPVEVA--------------------------------- 258
Query: 223 DNTSQLGAEGAFTVFDDLILYKSG-------KALGGHAIRILGWGEDEKSKEKYWLIANS 275
FTV++D YKSG +GGHA++++GWG + E YWL+AN
Sbjct: 259 -----------FTVYEDFAHYKSGVYKYITGTKIGGHAVKLIGWGTSDDG-EDYWLLANQ 306
Query: 276 WNTDWGDNGLFKILRGKDECGIESSITAGVP 306
WN WGD+G FKI RG +ECGIE S+ AG+P
Sbjct: 307 WNRSWGDDGYFKIRRGTNECGIEQSVVAGLP 337
Score = 39.7 bits (91), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 16/25 (64%), Positives = 19/25 (76%)
Query: 8 LCGFGCNGGFPGMAWRYWVKSGIVS 32
LCGFGCNGGFP AW Y+ G+V+
Sbjct: 164 LCGFGCNGGFPMGAWLYFKYHGVVT 188
>gi|189239879|ref|XP_968767.2| PREDICTED: similar to putative cathepsin B-like proteinase
[Tribolium castaneum]
gi|270012755|gb|EFA09203.1| cathepsin B precursor [Tribolium castaneum]
Length = 353
Score = 116 bits (291), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 90/311 (28%), Positives = 128/311 (41%), Gaps = 101/311 (32%)
Query: 57 WMGVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCP-TIREIRDQGSCGSCW-- 113
++G+HPD N E++ + E +PA FD+R WP C I IR+QG CGSCW
Sbjct: 50 FLGIHPDPNFQL----EVLEWEEPRTVIPATFDAREYWPQCKDVIGNIRNQGKCGSCWAF 105
Query: 114 ---------------GCRPYEIAP------CE---------------------------- 124
G +E +P CE
Sbjct: 106 AAAEVMSDRLCVATNGSVKFEFSPEDLINCCETCGKKCKGGYSYYAWKYYTSTGLVSGGD 165
Query: 125 -HHVNGTRP--SCDASKGHTPKCVRECQEN-YDVPYKKDLNFGAKSYSVSSNEKSIMKEI 180
+ G +P + + G +P+C + CQ Y Y D +FG +Y + N +I +EI
Sbjct: 166 YNTSRGCQPYSKSNFNDGVSPECSKTCQNTKYPTSYLNDRHFGDGTYYILKNVTTIQQEI 225
Query: 181 YEHG-PVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDD 239
G PV F V++D LY+ G +
Sbjct: 226 LLRGGPVMAGFDVYEDFKLYREGVY----------------------------------- 250
Query: 240 LILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGD-NGLFKILRGKDECGIE 298
++ SG LG HA++I+GWG + + YWL+ANSW DWG G+FKI RG +EC IE
Sbjct: 251 --VHTSGALLGSHAVKIIGWGTE--NGWAYWLVANSWGKDWGALGGVFKIRRGTNECKIE 306
Query: 299 SSITAGVPKLD 309
SI G + D
Sbjct: 307 QSIITGHVRKD 317
>gi|268555786|ref|XP_002635882.1| Hypothetical protein CBG01102 [Caenorhabditis briggsae]
Length = 374
Score = 116 bits (290), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 63/203 (31%), Positives = 100/203 (49%), Gaps = 40/203 (19%)
Query: 107 GSCGSCWGCRPYEIAPCEHHV-NGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAK 165
GS S +GC+PY I+PC+ + N T P C S TP C ++C+ Y V KD ++G
Sbjct: 210 GSYESQFGCKPYSISPCDTVIGNITFPGCLNSTVQTPSCEKKCKSGYPVELDKDRHYGVS 269
Query: 166 SYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNT 225
+ + + I ++ +GP+ V+DD + Y +G
Sbjct: 270 VDQLPNRQIEIQSDVMLNGPISATMEVYDDFLQYTTG----------------------- 306
Query: 226 SQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGL 285
+ ++ +G G ++RILGWG E YWL+ANSW WG+NG
Sbjct: 307 --------------IYVHLTGNKQGHLSVRILGWGMYEGVP--YWLLANSWGKQWGENGT 350
Query: 286 FKILRGKDECGIESSITAGVPKL 308
F++LRG +ECG+E++ +G+P+L
Sbjct: 351 FRVLRGVNECGLEANCVSGMPRL 373
>gi|156708120|gb|ABU93318.1| cathepsin B9 cysteine protease, partial [Monocercomonoides sp. PA]
Length = 382
Score = 116 bits (290), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 85/304 (27%), Positives = 134/304 (44%), Gaps = 71/304 (23%)
Query: 77 YSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEIAP---CEHHVNGTRP- 132
+ ++++++P +FD+RT WPNCPTI I DQG CGSCW +E+ C H +P
Sbjct: 63 FVKIEDEIPESFDARTNWPNCPTIGHIYDQGHCGSCWAMCSFEVLQDRFCIHSNGSEKPW 122
Query: 133 -------SCDA----------------------------------------SKGHTPKCV 145
SCD+ S TP C
Sbjct: 123 LSGQDITSCDSRSHGCNGGWTETAFEYAKKAGVPTEECVPYLMGKCHHPGCSSWQTPTCK 182
Query: 146 RECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRF- 204
+EC + Y + + +KSYS+ N ++I E+ +GPV FT +DDL +Y G +
Sbjct: 183 KECSSLSNYNYSSNRYYASKSYSIQRNVEAIQLELMRNGPVTAVFTTYDDLAVYWRGVYN 242
Query: 205 FVPGNET--TAMSLIKWTI-RDNTSQLGAEGAFTVFDDLIL-------------YKSGKA 248
V G+E A+ ++ W + R++ L E + + K
Sbjct: 243 HVMGSEQGLHAIKIVGWGVWRESEHMLTEEEKKAEEEKRKRIEEEIKKEKREDKWHDFKQ 302
Query: 249 LGGHAIRILGWGEDEKSKEK---YWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGV 305
+ + E + +KE+ YW+I NSW D+G +G+ I RG +ECGIES + G+
Sbjct: 303 NALEKSKKVKRDETKNNKEEGIPYWIIVNSWGEDFGMDGILLIKRGVNECGIESDVYTGI 362
Query: 306 PKLD 309
PK++
Sbjct: 363 PKIE 366
>gi|170060938|ref|XP_001866023.1| cathepsin B [Culex quinquefasciatus]
gi|167879260|gb|EDS42643.1| cathepsin B [Culex quinquefasciatus]
Length = 353
Score = 116 bits (290), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 65/167 (38%), Positives = 89/167 (53%), Gaps = 40/167 (23%)
Query: 141 TPKCVRECQENYDVP-YKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILY 199
PKC R+CQ +Y V KD FG +YSV ++E IM+EI+ +GPV+ AF V+ D Y
Sbjct: 213 APKCSRKCQSSYSVQDVSKDRRFGRVAYSVVADEHRIMEEIFVNGPVQAAFQVYLDFKTY 272
Query: 200 KSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGW 259
KSG + + +G GGHAI+ILGW
Sbjct: 273 KSGVY-------------------------------------RHVTGPLEGGHAIKILGW 295
Query: 260 GEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVP 306
G + + KYWL +NSW DWGD+G FKI+RG++ GIE+ + AG+P
Sbjct: 296 GVENGT--KYWLCSNSWGEDWGDHGFFKIVRGENHLGIETDVHAGLP 340
Score = 64.7 bits (156), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 34/71 (47%), Positives = 44/71 (61%), Gaps = 2/71 (2%)
Query: 50 PRAHLKSW-MGVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGS 108
PR L S+ +GV+ + L + RL I + D DLP FD+R KWP CP++REIR+QG
Sbjct: 64 PRQPLSSYRVGVNME-ELESKRLKPGILILKEDIDLPEQFDARDKWPQCPSLREIRNQGC 122
Query: 109 CGSCWGCRPYE 119
CGSCW E
Sbjct: 123 CGSCWAISAAE 133
Score = 45.8 bits (107), Expect = 0.026, Method: Compositional matrix adjust.
Identities = 20/32 (62%), Positives = 22/32 (68%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQ 40
CG GC GG G AW YWV+ G+ SGG Y SKQ
Sbjct: 163 CGDGCQGGVLGPAWDYWVQKGVSSGGPYNSKQ 194
>gi|3088522|gb|AAD03404.1| cathepsin B-like protease precursor [Trypanosoma cruzi]
gi|407859283|gb|EKG06969.1| cysteine peptidase C (CPC) [Trypanosoma cruzi]
Length = 333
Score = 115 bits (289), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 90/264 (34%), Positives = 118/264 (44%), Gaps = 38/264 (14%)
Query: 72 PELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTR 131
P E+ L FD+ WP CPTI EIRDQ SCGSCW + G
Sbjct: 80 PRQFSEEELRVPLQDRFDAGEAWPKCPTITEIRDQSSCGSCWAVAAASAMSDRYCTLGGV 139
Query: 132 PSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEG-AF 190
S G C C + Y + + Y+V I+ E + P A
Sbjct: 140 RDLRISAGDLMSCCDVCGYGCNGGYPE---VAWEYYAV----HGIVSEYCQPYPFPSCAH 192
Query: 191 TVFDDLILYKSGRFFVPGNETTA----MSLIKWTIRDNTSQLGA---------------E 231
V + SG + P +T + LIK+ R NTS + + E
Sbjct: 193 HVNSSDLSPCSGEYDTPTCNSTCTDKKIPLIKY--RGNTSYILSGEESFKRELLLNGPFE 250
Query: 232 GAFTVFDDLILYK-------SGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNG 284
+F+V+ D + Y +G LGGHA+RI+GWG E + E YW IANSWN +WG NG
Sbjct: 251 VSFSVYADFVAYTGGVYKHVTGVFLGGHAVRIVGWG--ELNGEPYWKIANSWNHEWGMNG 308
Query: 285 LFKILRGKDECGIESSITAGVPKL 308
F I RG DECGIE S AG+P++
Sbjct: 309 YFLIARGVDECGIEGSGVAGIPRI 332
Score = 40.4 bits (93), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 15/25 (60%), Positives = 20/25 (80%)
Query: 8 LCGFGCNGGFPGMAWRYWVKSGIVS 32
+CG+GCNGG+P +AW Y+ GIVS
Sbjct: 155 VCGYGCNGGYPEVAWEYYAVHGIVS 179
>gi|340053922|emb|CCC48215.1| cysteine peptidase C (CPC) [Trypanosoma vivax Y486]
Length = 334
Score = 115 bits (289), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 98/330 (29%), Positives = 133/330 (40%), Gaps = 123/330 (37%)
Query: 46 LSNIPRAHLKSWMG--VHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREI 103
++ + R +K MG + LP E E+ LP +FD+ T WP+CPTI+ I
Sbjct: 55 MARLTRQGVKRLMGAKLRDAPVLPRRHFTE----EELRAPLPESFDAATAWPDCPTIKRI 110
Query: 104 ----------------------------RDQG-----------SCGS-CWG--------- 114
RD G SCG C G
Sbjct: 111 ADQSSCGSCWAVAAATAMSDRFCVTGGVRDLGISAGDLLSCCTSCGDGCDGGYPDEAWLY 170
Query: 115 ----------CRPYEIAPCEHHVNGTR-PSCDASKGHTPKCVRECQENYDVPYKKDLNFG 163
C+PY PC+H ++ PSC HTPKC C + +P + F
Sbjct: 171 FTESGLVSDYCQPYPFPPCKHSGGRSKNPSCHDMHFHTPKCNATCTDK-RIPVVR--YFA 227
Query: 164 AKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRD 223
++SYS+ E+ +E+Y GP E A
Sbjct: 228 SESYSLQ-GEEDYKRELYLRGPFEVA---------------------------------- 252
Query: 224 NTSQLGAEGAFTVFDDLILYKSG-------KALGGHAIRILGWGEDEKSKEKYWLIANSW 276
FTV++D + Y+SG +GGHA+R++GWGE ++ YW IANSW
Sbjct: 253 ----------FTVYEDFLAYESGVYKHVSGGPVGGHAVRVVGWGE--RNGVPYWKIANSW 300
Query: 277 NTDWGDNGLFKILRGKDECGIESSITAGVP 306
NTDWG+NG RGKDECGIES +AG P
Sbjct: 301 NTDWGENGYLYFYRGKDECGIESQGSAGTP 330
>gi|170060936|ref|XP_001866022.1| cathepsin B [Culex quinquefasciatus]
gi|167879259|gb|EDS42642.1| cathepsin B [Culex quinquefasciatus]
Length = 341
Score = 115 bits (289), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 71/207 (34%), Positives = 100/207 (48%), Gaps = 49/207 (23%)
Query: 101 REIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVP-YKKD 159
R + G S GC PY + C S D TPKC R+CQ Y+V D
Sbjct: 173 RGVSSGGPYNSRQGCHPYPVDVCH--------SAD-EDADTPKCTRKCQSMYNVTNVSDD 223
Query: 160 LNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKW 219
FG +YSVS +E+ I +EI+ +GPV+ +F V+ D YK+G
Sbjct: 224 RRFGRVAYSVSQDEERIKEEIFRNGPVQASFDVYLDFKAYKTG----------------- 266
Query: 220 TIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTD 279
+ + G GGHA++++GWG + +K YWL +NSW D
Sbjct: 267 --------------------VYRHVFGPMEGGHAVKMIGWGVENGTK--YWLCSNSWGED 304
Query: 280 WGDNGLFKILRGKDECGIESSITAGVP 306
WG+ G FKI+RG++ CGIES + AG+P
Sbjct: 305 WGERGFFKIVRGENHCGIESDVHAGLP 331
Score = 43.1 bits (100), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 18/32 (56%), Positives = 23/32 (71%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQ 40
CG GC GG G AW++WV+ G+ SGG Y S+Q
Sbjct: 154 CGDGCQGGNLGPAWQFWVQRGVSSGGPYNSRQ 185
>gi|294954734|ref|XP_002788292.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
gi|239903555|gb|EER20088.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
Length = 317
Score = 115 bits (289), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 66/197 (33%), Positives = 92/197 (46%), Gaps = 43/197 (21%)
Query: 114 GCRPYEIAPCEHHVNGTR-PSCDASKGHTPKCVRECQE-NYDVPYKKDLNFGAKSYSVSS 171
GC PY+ PC HH+N T+ P C TP CV +C Y K D ++ +S
Sbjct: 162 GCWPYDFPPCAHHINDTKYPKCPKGSYETPNCVEQCHNPKYSTSLKNDRHYMLESSPYQY 221
Query: 172 NEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAE 231
+ + I GPV ++ V++D + YKSG +
Sbjct: 222 SVNNAKNAIRTDGPVSASYLVYEDFLAYKSGVY--------------------------- 254
Query: 232 GAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRG 291
+ SG LGGHA++I+GWGE+ + E YWL+ NSWN DWGD+GLFKI G
Sbjct: 255 ----------KHTSGSYLGGHAVKIIGWGEE--NGEAYWLVVNSWNEDWGDHGLFKIALG 302
Query: 292 KDECGIESSITAGVPKL 308
C I+ + G PK+
Sbjct: 303 N--CQIDDDLLGGTPKV 317
Score = 48.1 bits (113), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 20/34 (58%), Positives = 25/34 (73%), Gaps = 1/34 (2%)
Query: 82 EDLPANFDSRTKWPNCP-TIREIRDQGSCGSCWG 114
+DLP +FD+RT +PNC I IRDQ +CGSCW
Sbjct: 58 QDLPTDFDARTAFPNCSKVIGHIRDQSACGSCWA 91
>gi|300835056|gb|ADK37857.1| putative cathepsin precursor [Sitobion avenae]
Length = 340
Score = 115 bits (289), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 67/193 (34%), Positives = 92/193 (47%), Gaps = 40/193 (20%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GC PY + PC G +C R C + D+ Y D F Y ++
Sbjct: 185 GCEPYRVPPCPRDDKGNNTCAGKPIEKNHRCTRMCYGDQDLDYNDDHRFTRDFYYLTYG- 243
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
SI K++ +GP+E +F V+DD YKSG + E T
Sbjct: 244 -SIQKDVMTYGPIEASFDVYDDFPSYKSGVY-----EKT--------------------- 276
Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
++ LGGHA++++GWG +E + YWL+ NSWN WGD GLFKI RG +
Sbjct: 277 ----------ENASYLGGHAVKLIGWGVEEGT--PYWLMVNSWNAQWGDKGLFKIRRGTN 324
Query: 294 ECGIESSITAGVP 306
ECGI++S TAGVP
Sbjct: 325 ECGIDNSTTAGVP 337
Score = 52.0 bits (123), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 33/116 (28%), Positives = 51/116 (43%), Gaps = 24/116 (20%)
Query: 23 RYWVKSGIVSGGAYGSKQA---EKNSLSNIPRAHLKSWMGVHPDYNLPANRLPELIGYSE 79
R + +V Y ++QA EK+ + I GV+ D ++P + +++G
Sbjct: 3 RLVILLSVVLFSVYQTEQAYFLEKSYIDMINEVATTWTAGVNFDPSIPEDHFIKMLGSKG 62
Query: 80 VDE---------------------DLPANFDSRTKWPNCPTIREIRDQGSCGSCWG 114
V+ +P FD+R KW +C TI E+RDQG CGSCW
Sbjct: 63 VESAKQASAHEFKTNDVAYDNHFGHIPRTFDARKKWRHCRTIGEVRDQGHCGSCWA 118
>gi|357116879|ref|XP_003560204.1| PREDICTED: cathepsin B-like [Brachypodium distachyon]
Length = 351
Score = 115 bits (288), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 94/322 (29%), Positives = 127/322 (39%), Gaps = 110/322 (34%)
Query: 46 LSNIPRAHLKSWMGVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRD 105
+N K +GV P +P YS LP FD+R++W C TI I D
Sbjct: 61 FANYTITQFKHILGVKPTPPALLAGVPTK-SYSR-SMKLPTEFDARSQWSGCSTIGTILD 118
Query: 106 QGS---------------------------------------CGS-CWGCRPY------- 118
QG CGS C G P
Sbjct: 119 QGHCGSCWAFGAVECLQDRFCIHLNMNISLSVNDLLACCGFLCGSGCNGGYPISAWRYFR 178
Query: 119 -------EIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSS 171
E P V P C+ + TPKC ++C+ +V +K+ +F +Y V S
Sbjct: 179 RKGVVTDECDPYFDQVGCKHPGCEPAY-RTPKCEKKCKVQNEV-WKEQKHFSVDAYRVHS 236
Query: 172 NEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAE 231
N IM E+Y +GPVE A
Sbjct: 237 NPHDIMAEVYTNGPVEVA------------------------------------------ 254
Query: 232 GAFTVFDDLILYKSGK-------ALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNG 284
FTV++D YKSG +GGHA++++GWG + + E YWL+AN WN WGD+G
Sbjct: 255 --FTVYEDFAHYKSGVYKHITGGVMGGHAVKLIGWGTSD-AGEDYWLLANQWNRGWGDDG 311
Query: 285 LFKILRGKDECGIESSITAGVP 306
FKI+RGK+ECGIE + AG+P
Sbjct: 312 YFKIIRGKNECGIEEDVVAGMP 333
Score = 39.3 bits (90), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 15/25 (60%), Positives = 20/25 (80%)
Query: 8 LCGFGCNGGFPGMAWRYWVKSGIVS 32
LCG GCNGG+P AWRY+ + G+V+
Sbjct: 160 LCGSGCNGGYPISAWRYFRRKGVVT 184
>gi|294885809|ref|XP_002771442.1| cathepsin L precursor, putative [Perkinsus marinus ATCC 50983]
gi|239875086|gb|EER03258.1| cathepsin L precursor, putative [Perkinsus marinus ATCC 50983]
Length = 527
Score = 115 bits (288), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 68/201 (33%), Positives = 97/201 (48%), Gaps = 51/201 (25%)
Query: 114 GCRPYEIAPCEHHVNGTR-PSCDASKGHTPKCVRECQE-NYDVPYKKDLNFGAKS----Y 167
GC PY+ PC HH+N T+ P C TP CV +C Y K D ++ +S Y
Sbjct: 372 GCWPYDFPPCAHHINDTKYPKCPKGSYETPNCVEQCHNPKYTTSLKNDRHYMLESSPYQY 431
Query: 168 SVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQ 227
SV++ + +I + GP+ ++ V++D + YKSG +
Sbjct: 432 SVNNAKNAIRTD----GPISASYLVYEDFLAYKSGVY----------------------- 464
Query: 228 LGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFK 287
+ SG LGGHA++I+GWGE+ + E YWL+ NSWN DWGD GLFK
Sbjct: 465 --------------KHTSGSYLGGHAVKIIGWGEE--NGEAYWLVVNSWNEDWGDQGLFK 508
Query: 288 ILRGKDECGIESSITAGVPKL 308
I G C I+ + G PK+
Sbjct: 509 IALGN--CEIDDDLLGGTPKV 527
>gi|29374023|gb|AAO73002.1| cathepsin B [Fasciola gigantica]
Length = 335
Score = 115 bits (288), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 72/195 (36%), Positives = 91/195 (46%), Gaps = 44/195 (22%)
Query: 114 GCRPYEIAPCEH--HVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSS 171
GC PY C H G P C TPKC ++CQ Y ++D G SY+V
Sbjct: 183 GCLPYPFPKCSHLEETPGLAP-CPRELYATPKCEKQCQAGYSKTSEEDKIKGKSSYNVGD 241
Query: 172 NEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAE 231
E IM EI +GPV + +F+D +YKSG
Sbjct: 242 RETDIMMEIITNGPVSTIYYIFEDFTVYKSG----------------------------- 272
Query: 232 GAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRG 291
+ Y SG +GGH I +GWG + + KYWL ANSWN WG+NG F+I RG
Sbjct: 273 --------IYQYTSGSLMGGHGI--IGWGVE--NGVKYWLAANSWNEGWGENGYFRIRRG 320
Query: 292 KDECGIESSITAGVP 306
+ECGIES I AG+P
Sbjct: 321 TNECGIESRINAGLP 335
Score = 57.4 bits (137), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 30/76 (39%), Positives = 38/76 (50%), Gaps = 2/76 (2%)
Query: 39 KQAEKNSLSNIPRAHLKSWMGVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCP 98
K A +NI K +G + N + YS + DLP +FD+R KWPNC
Sbjct: 43 KAARSTRFNNI--EQFKKHLGALEETPEERNTRRPTVRYSVSENDLPESFDAREKWPNCS 100
Query: 99 TIREIRDQGSCGSCWG 114
+I EI DQ SC SCW
Sbjct: 101 SISEIPDQSSCSSCWA 116
Score = 47.0 bits (110), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 18/27 (66%), Positives = 21/27 (77%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGA 35
CG+GC GG+P MAW YW + GIVSGG
Sbjct: 151 CGYGCEGGYPSMAWDYWWRHGIVSGGT 177
>gi|17384033|emb|CAD12394.1| cysteine proteinase [Leishmania infantum]
Length = 340
Score = 115 bits (287), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 93/297 (31%), Positives = 117/297 (39%), Gaps = 107/297 (36%)
Query: 72 PELIGYSEVDEDLPANFDSRTKWPNCPTI---REIRDQGS-------------------- 108
P E+ +DLP FD+ WP C TI R+ + GS
Sbjct: 86 PRNFSVDEMQQDLPEFFDAAEHWPMCVTISEIRDQSNCGSCWAIAAVEAISDRYCTLGGV 145
Query: 109 ----------------CG-SCWG-------------------CRPYEIAPCEHHVNGTR- 131
CG C+G C+PY PC HH N +
Sbjct: 146 PDRRISTSNLLSCCFICGFGCYGGIPTMAWLWWVWVGITTEVCQPYPFGPCSHHGNSDKY 205
Query: 132 PSCDASKGHTPKCVRECQ--ENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGA 189
P C + TPKC C+ E V YK G SYSV EK +M E+ +GP+E
Sbjct: 206 PPCPNTIYDTPKCNTTCEKSEMDLVKYK-----GGTSYSVK-GEKELMIELMTNGPLEVT 259
Query: 190 FTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKAL 249
V+ D + YKSG + + SG L
Sbjct: 260 MQVYSDFVGYKSGVY-------------------------------------KHVSGDLL 282
Query: 250 GGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVP 306
GGHA++++GWG + YW IANSWNTDWGD G F I RG +ECGIES AG P
Sbjct: 283 GGHAVKLVGWGT--QGGVPYWKIANSWNTDWGDKGYFLIQRGSNECGIESGGVAGTP 337
Score = 38.9 bits (89), Expect = 2.9, Method: Compositional matrix adjust.
Identities = 15/25 (60%), Positives = 18/25 (72%)
Query: 8 LCGFGCNGGFPGMAWRYWVKSGIVS 32
+CGFGC GG P MAW +WV GI +
Sbjct: 161 ICGFGCYGGIPTMAWLWWVWVGITT 185
>gi|12004577|gb|AAG44098.1| cathepsin B cysteine protease [Leishmania chagasi]
Length = 340
Score = 115 bits (287), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 93/297 (31%), Positives = 117/297 (39%), Gaps = 107/297 (36%)
Query: 72 PELIGYSEVDEDLPANFDSRTKWPNCPTI---REIRDQGS-------------------- 108
P E+ +DLP FD+ WP C TI R+ + GS
Sbjct: 86 PRNFSVDEMQQDLPEFFDAAEHWPMCVTISEIRDQSNCGSCWAIAAVEAISDRYCTLGGV 145
Query: 109 ----------------CG-SCWG-------------------CRPYEIAPCEHHVNGTR- 131
CG C+G C+PY PC HH N +
Sbjct: 146 PDRRISTSNLLSCCFICGFGCYGGIPTMAWLWWVWVGITTEVCQPYPFGPCSHHGNSDKY 205
Query: 132 PSCDASKGHTPKCVRECQ--ENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGA 189
P C + TPKC C+ E V YK G SYSV EK +M E+ +GP+E
Sbjct: 206 PPCPNTIYDTPKCNTTCEKSEMDLVKYK-----GGTSYSVK-GEKELMIELMTNGPLEVT 259
Query: 190 FTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKAL 249
V+ D + YKSG + + SG L
Sbjct: 260 MQVYSDFVGYKSGGY-------------------------------------KHVSGDLL 282
Query: 250 GGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVP 306
GGHA++++GWG + YW IANSWNTDWGD G F I RG +ECGIES AG P
Sbjct: 283 GGHAVKLVGWGT--QGGVPYWKIANSWNTDWGDKGYFLIQRGSNECGIESGGVAGTP 337
Score = 38.9 bits (89), Expect = 3.0, Method: Compositional matrix adjust.
Identities = 15/25 (60%), Positives = 18/25 (72%)
Query: 8 LCGFGCNGGFPGMAWRYWVKSGIVS 32
+CGFGC GG P MAW +WV GI +
Sbjct: 161 ICGFGCYGGIPTMAWLWWVWVGITT 185
>gi|146092987|ref|XP_001466605.1| CPC cysteine peptidase, Clan CA, family C1,Cathepsin B-like
[Leishmania infantum JPCM5]
gi|398018677|ref|XP_003862503.1| cysteine peptidase C (CPC) [Leishmania donovani]
gi|12005276|gb|AAG44365.1| cathepsin B-like cysteine protease [Leishmania donovani]
gi|134070968|emb|CAM69644.1| CPC cysteine peptidase, Clan CA, family C1,Cathepsin B-like
[Leishmania infantum JPCM5]
gi|322500733|emb|CBZ35810.1| cysteine peptidase C (CPC) [Leishmania donovani]
Length = 340
Score = 115 bits (287), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 93/297 (31%), Positives = 117/297 (39%), Gaps = 107/297 (36%)
Query: 72 PELIGYSEVDEDLPANFDSRTKWPNCPTI---REIRDQGS-------------------- 108
P E+ +DLP FD+ WP C TI R+ + GS
Sbjct: 86 PRNFSVDEMQQDLPEFFDAAEHWPMCVTISEIRDQSNCGSCWAIAAVEAISDRYCTLGGV 145
Query: 109 ----------------CG-SCWG-------------------CRPYEIAPCEHHVNGTR- 131
CG C+G C+PY PC HH N +
Sbjct: 146 PDRRISTSNLLSCCFICGFGCYGGIPTMAWLWWVWVGITTEVCQPYPFGPCSHHGNSDKY 205
Query: 132 PSCDASKGHTPKCVRECQ--ENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGA 189
P C + TPKC C+ E V YK G SYSV EK +M E+ +GP+E
Sbjct: 206 PPCPNTIYDTPKCNTTCEKSEMDLVKYK-----GGTSYSVK-GEKELMIELMTNGPLEVT 259
Query: 190 FTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKAL 249
V+ D + YKSG + + SG L
Sbjct: 260 MQVYSDFVGYKSGVY-------------------------------------KHVSGDLL 282
Query: 250 GGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVP 306
GGHA++++GWG + YW IANSWNTDWGD G F I RG +ECGIES AG P
Sbjct: 283 GGHAVKLVGWGT--QGGVPYWKIANSWNTDWGDKGYFLIQRGSNECGIESGGVAGTP 337
Score = 38.9 bits (89), Expect = 3.0, Method: Compositional matrix adjust.
Identities = 15/25 (60%), Positives = 18/25 (72%)
Query: 8 LCGFGCNGGFPGMAWRYWVKSGIVS 32
+CGFGC GG P MAW +WV GI +
Sbjct: 161 ICGFGCYGGIPTMAWLWWVWVGITT 185
>gi|347972080|ref|XP_313831.5| AGAP004531-PA [Anopheles gambiae str. PEST]
gi|333469162|gb|EAA09191.5| AGAP004531-PA [Anopheles gambiae str. PEST]
Length = 375
Score = 115 bits (287), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 78/302 (25%), Positives = 122/302 (40%), Gaps = 111/302 (36%)
Query: 70 RLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRP------------ 117
+LP + ++ LP +FD+R KW CP++ +R+QG C S +
Sbjct: 117 QLPLGFVLKKDEQPLPMSFDARQKWSYCPSMNMVRNQGCCDSSYAVAAVSTMTDRWCVHS 176
Query: 118 ----------YEIAPCEHHV----NGTRPS------------------------------ 133
Y++ C H +G PS
Sbjct: 177 EGKAQFNFGAYDVLSCCHRCGFGCDGGVPSAVWHYWVENGITSGGAFGSHEGCQSYPFDV 236
Query: 134 CDAS--KGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFT 191
C S TP+C+R CQ Y+V Y +D ++G +Y+V +E+ IM E++ GP
Sbjct: 237 CKKSGDSNDTPRCLRFCQPGYNVTYPEDKHYGRVAYTVPKDEERIMYEVFNFGP------ 290
Query: 192 VFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGK---- 247
A+ FT++ D + YKSG
Sbjct: 291 --------------------------------------AQATFTMYTDFVQYKSGVYRHT 312
Query: 248 ---ALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAG 304
+G H+++++GWG + + KYWL ANSW WGD G FKI+RG+D E+++ AG
Sbjct: 313 FGVRVGTHSVKVMGWGVE--NDVKYWLCANSWGAQWGDGGFFKIVRGEDHLSFETNVVAG 370
Query: 305 VP 306
+P
Sbjct: 371 LP 372
Score = 51.6 bits (122), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 20/32 (62%), Positives = 25/32 (78%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQ 40
CGFGC+GG P W YWV++GI SGGA+GS +
Sbjct: 196 CGFGCDGGVPSAVWHYWVENGITSGGAFGSHE 227
>gi|5764077|emb|CAB53367.1| necpain [Necator americanus]
Length = 339
Score = 115 bits (287), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 68/199 (34%), Positives = 97/199 (48%), Gaps = 54/199 (27%)
Query: 115 CRPYEIAPCEHHVNGTRPS-CDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
C+PY PC +H N C TP+C + CQ Y PYKKD + KSY + ++E
Sbjct: 184 CKPYAFHPCGNHENQVYYGVCPKGSWPTPRCEKFCQRGYIKPYKKDKFYAKKSYWLPNDE 243
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
K I +I ++GPV+ A
Sbjct: 244 KEIRLDIMKNGPVQAA-------------------------------------------- 259
Query: 234 FTVFDDLILYK-------SGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLF 286
F V++D LYK G GGHA++I+GWG+D + YWLIANSW+ DWG++G F
Sbjct: 260 FDVYEDFKLYKRGIYKHKEGIQTGGHAVKIIGWGKDNGTD--YWLIANSWSKDWGESGFF 317
Query: 287 KILRGKDECGIESSITAGV 305
+++RG+++C IE ITAG+
Sbjct: 318 RMVRGENDCEIEDMITAGI 336
>gi|409905640|gb|AFV46426.1| cysteine protease C [Leishmania donovani]
Length = 345
Score = 115 bits (287), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 93/297 (31%), Positives = 117/297 (39%), Gaps = 107/297 (36%)
Query: 72 PELIGYSEVDEDLPANFDSRTKWPNCPTI---REIRDQGS-------------------- 108
P E+ +DLP FD+ WP C TI R+ + GS
Sbjct: 91 PRNFSVVEMQQDLPEFFDAAEHWPMCVTISEIRDQSNCGSCWAIAAVEAISDRYCTLGGV 150
Query: 109 ----------------CG-SCWG-------------------CRPYEIAPCEHHVNGTR- 131
CG C+G C+PY PC HH N +
Sbjct: 151 PDRRISTSNLLSCCFICGFGCYGGIPTMAWLWWVWVGITTEVCQPYPFGPCSHHGNSDKY 210
Query: 132 PSCDASKGHTPKCVRECQ--ENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGA 189
P C + TPKC C+ E V YK G SYSV EK +M E+ +GP+E
Sbjct: 211 PPCPNTIYDTPKCNTTCEKSEMDLVKYK-----GGTSYSVK-GEKELMIELMTNGPLEVT 264
Query: 190 FTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKAL 249
V+ D + YKSG + + SG L
Sbjct: 265 MQVYSDFVGYKSGVY-------------------------------------KHVSGDLL 287
Query: 250 GGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVP 306
GGHA++++GWG + YW IANSWNTDWGD G F I RG +ECGIES AG P
Sbjct: 288 GGHAVKLVGWGT--QGGVPYWKIANSWNTDWGDKGYFLIQRGSNECGIESGGVAGTP 342
Score = 38.9 bits (89), Expect = 3.2, Method: Compositional matrix adjust.
Identities = 15/25 (60%), Positives = 18/25 (72%)
Query: 8 LCGFGCNGGFPGMAWRYWVKSGIVS 32
+CGFGC GG P MAW +WV GI +
Sbjct: 166 ICGFGCYGGIPTMAWLWWVWVGITT 190
>gi|291000228|ref|XP_002682681.1| predicted protein [Naegleria gruberi]
gi|284096309|gb|EFC49937.1| predicted protein [Naegleria gruberi]
Length = 225
Score = 115 bits (287), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 83/267 (31%), Positives = 110/267 (41%), Gaps = 92/267 (34%)
Query: 84 LPANFDSRTKWPNCPTIREIRDQGSCGSCWGCR--------------------------- 116
LP +FDSR KWP C I IR+Q CGSCW C+
Sbjct: 2 LPESFDSREKWPTC--IHPIRNQEQCGSCWACKNLFIQSSEVLSDRFCIASGGKVNVVLS 59
Query: 117 PYEIAPCE------------------HHVNGTRPSC---DASKGHTPKCVRECQENYDVP 155
P ++ C H C + G P C + C N
Sbjct: 60 PQDLVSCNWYNAGCDGGILWAAWIYLKHTGIVTDQCLPYSSGNGVAPSCPKYC--NGTST 117
Query: 156 YKKDLNFGAKS-YSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAM 214
+ + AK Y V S + IM EI +GPV+ F+V+ D + YKSG +
Sbjct: 118 PIDSVKYKAKDWYEVGSIAEKIMNEIATNGPVQSGFSVYQDFMSYKSGVY---------- 167
Query: 215 SLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIAN 274
+++G LGGHAI+I+GWG + + KYWL+AN
Sbjct: 168 ---------------------------THQTGSFLGGHAIKIVGWGVE--NNVKYWLVAN 198
Query: 275 SWNTDWGDNGLFKILRGKDECGIESSI 301
SW DWG NGLFKI RG +ECGIE+ +
Sbjct: 199 SWGPDWGLNGLFKIKRGDNECGIEADV 225
>gi|159179|gb|AAA29178.1| cysteine proteinase, partial [Haemonchus contortus]
Length = 341
Score = 114 bits (286), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 63/198 (31%), Positives = 97/198 (48%), Gaps = 53/198 (26%)
Query: 115 CRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEK 174
CRPY I PC HH N T + TP C ++CQ Y Y+ D +G ++ + + +
Sbjct: 184 CRPYPIHPCGHHGNDTYYGECPEEASTPSCKKKCQPGYRKLYRMDKRYGTDAFQLPKSVE 243
Query: 175 SIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAF 234
+I KE+ ++GPV +F
Sbjct: 244 AIQKELLKNGPVTASFA------------------------------------------- 260
Query: 235 TVFDDLILYKSG-------KALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFK 287
V++D LYKSG + G HA++++GWG + ++ YWLIANSW+ DWG+NG F+
Sbjct: 261 -VYEDFSLYKSGIYRHTAGELRGYHAVKMIGWGTENRTD--YWLIANSWHDDWGENGYFR 317
Query: 288 ILRGKDECGIESSITAGV 305
I+RG ++CGIE ++ AG+
Sbjct: 318 IIRGINDCGIEENVAAGL 335
Score = 43.1 bits (100), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 18/32 (56%), Positives = 24/32 (75%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQ 40
CGFGC+GG+ AW Y+ +G+VSGG Y SK+
Sbjct: 151 CGFGCDGGWSIKAWEYFTYAGLVSGGEYRSKR 182
Score = 42.4 bits (98), Expect = 0.27, Method: Compositional matrix adjust.
Identities = 21/48 (43%), Positives = 28/48 (58%), Gaps = 3/48 (6%)
Query: 69 NRLPELIGYS--EVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG 114
N+ P LI E ++D+P +D R W NC + IRDQ +CGSCW
Sbjct: 69 NQNPNLIVKDDPEPEDDIPEEYDPRKIWSNCTSFY-IRDQANCGSCWA 115
>gi|157167368|ref|XP_001653891.1| cathepsin b [Aedes aegypti]
gi|108874250|gb|EAT38475.1| AAEL009642-PA [Aedes aegypti]
Length = 332
Score = 114 bits (286), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 66/202 (32%), Positives = 95/202 (47%), Gaps = 59/202 (29%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GC+PY PC + G P TP C C E YD Y++D +G+ +Y + ++E
Sbjct: 183 GCKPYPFKPCLYPFVGCHPE------KTPSCTHHCTEGYDGTYRRDKYYGSAAYKLPNDE 236
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
+ I EI +GPVE
Sbjct: 237 RMIQLEIMTNGPVESG-------------------------------------------- 252
Query: 234 FTVFDDLILYKSG-------KALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLF 286
F+V+ DL LYK+G + +G HA+R++GWG++ YWLIANS+ DWG++G F
Sbjct: 253 FSVYQDLYLYKTGVYQHVVGREVGKHAVRLIGWGKERGVP--YWLIANSYGEDWGEHGYF 310
Query: 287 KILRGKDECGIESSITAGVPKL 308
K LRG + GIES + AG+PK+
Sbjct: 311 KFLRGSNHLGIESVVIAGLPKV 332
Score = 40.8 bits (94), Expect = 0.85, Method: Compositional matrix adjust.
Identities = 24/56 (42%), Positives = 32/56 (57%), Gaps = 4/56 (7%)
Query: 9 CGFGCNGGF-PGMAWRYWVKSGIVSGGAYGSKQAEKNSLSNIPRAHLKSWMGVHPD 63
CG GCNGGF G +++YWV G+VSG AY S K + L ++G HP+
Sbjct: 150 CGNGCNGGFLDGTSFQYWVDVGLVSGAAYNSTDGCKPYPF---KPCLYPFVGCHPE 202
>gi|145498570|ref|XP_001435272.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124402403|emb|CAK67875.1| unnamed protein product [Paramecium tetraurelia]
Length = 325
Score = 114 bits (286), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 70/197 (35%), Positives = 87/197 (44%), Gaps = 53/197 (26%)
Query: 115 CRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEK 174
CRPY PC+HHV+ + TP CV+ C Y D SYSVSS +
Sbjct: 172 CRPYTFPPCDHHVDDGKYGPCGDSQPTPACVKSCTAQSGRNYDSDKIRSIDSYSVSSKVE 231
Query: 175 SIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAF 234
I EI GPVE + F
Sbjct: 232 QIQNEIMTFGPVEAS--------------------------------------------F 247
Query: 235 TVFDDLILYKSG-------KALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFK 287
TV++D + YKSG LGGHA++I+GWG ++ YWL+ NSWN WG+NGLFK
Sbjct: 248 TVYEDFLTYKSGVYQNVAGANLGGHAVKIIGWGVEKNVP--YWLVVNSWNEGWGENGLFK 305
Query: 288 ILRGKDECGIESSITAG 304
ILRG + GIE I AG
Sbjct: 306 ILRGSNHVGIEGGIYAG 322
Score = 58.5 bits (140), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 29/67 (43%), Positives = 42/67 (62%), Gaps = 8/67 (11%)
Query: 51 RAHLKSWMGV---HPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQG 107
A LK+ MG PD+ +LPE E + ++P +FD+R +WPNC +I+E+RDQ
Sbjct: 44 EATLKTQMGTFLDEPDFM----KLPESTVQFE-NLEIPESFDARQQWPNCESIKEVRDQS 98
Query: 108 SCGSCWG 114
+CGSCW
Sbjct: 99 TCGSCWA 105
Score = 41.2 bits (95), Expect = 0.63, Method: Compositional matrix adjust.
Identities = 16/29 (55%), Positives = 20/29 (68%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYG 37
CG GCNGGFP AW Y+ G+V+G +G
Sbjct: 139 CGMGCNGGFPSGAWNYFKNKGLVTGDLFG 167
>gi|54289256|gb|AAV31918.1| putative vitellogenic cathepsin B [Aedes aegypti]
Length = 332
Score = 114 bits (286), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 66/202 (32%), Positives = 95/202 (47%), Gaps = 59/202 (29%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GC+PY PC + G P TP C C E YD Y++D +G+ +Y + ++E
Sbjct: 183 GCKPYPFKPCLYPFVGCHPE------KTPSCTHHCTEGYDGTYRRDKYYGSAAYKLPNDE 236
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
+ I EI +GPVE
Sbjct: 237 RMIQLEIMTNGPVESG-------------------------------------------- 252
Query: 234 FTVFDDLILYKSG-------KALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLF 286
F+V+ DL LYK+G + +G HA+R++GWG++ YWLIANS+ DWG++G F
Sbjct: 253 FSVYQDLYLYKTGVYQHVVGREVGKHAVRLIGWGKERGVP--YWLIANSYGEDWGEHGYF 310
Query: 287 KILRGKDECGIESSITAGVPKL 308
K LRG + GIES + AG+PK+
Sbjct: 311 KFLRGSNHLGIESVVIAGLPKV 332
Score = 39.7 bits (91), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 23/56 (41%), Positives = 32/56 (57%), Gaps = 4/56 (7%)
Query: 9 CGFGCNGGF-PGMAWRYWVKSGIVSGGAYGSKQAEKNSLSNIPRAHLKSWMGVHPD 63
CG GCNGGF G +++YWV G+VSG AY + K + L ++G HP+
Sbjct: 150 CGNGCNGGFLDGTSFQYWVDVGLVSGAAYNNTDGCKPYPF---KPCLYPFVGCHPE 202
>gi|204022098|dbj|BAG71146.1| cathepsin B-N2 [Tuberaphis sumatrana]
Length = 334
Score = 114 bits (286), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 67/195 (34%), Positives = 97/195 (49%), Gaps = 44/195 (22%)
Query: 114 GCRPYEIAPC--EHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSS 171
GC+PY + PC + + N T K H +C R C N D+ +K+D ++ +Y ++
Sbjct: 182 GCQPYRVPPCPLDEYGNNTCSGKPTEKNH--RCTRMCYGNQDLDFKEDHHYTRDAYYLTY 239
Query: 172 NEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAE 231
+I ++ +GP+E +F V+DD YKSG + N T
Sbjct: 240 G--TIQNDVLAYGPIEASFEVYDDFPSYKSGVYTKMENATY------------------- 278
Query: 232 GAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRG 291
LGGHA++++GWGE+ YWL+ NSWN WGD GLFKI RG
Sbjct: 279 -----------------LGGHAVKLIGWGEEYGV--PYWLLVNSWNDQWGDQGLFKIRRG 319
Query: 292 KDECGIESSITAGVP 306
+ECGI++S T GVP
Sbjct: 320 TNECGIDNSTTGGVP 334
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 34/104 (32%), Positives = 52/104 (50%), Gaps = 26/104 (25%)
Query: 36 YGSKQA---EKNSLSNIPRAHLKSW-MGVHPDYNLPANRLPELIG--------------- 76
Y ++QA E++ +++I A+ K+W GV+ D L + +L+G
Sbjct: 13 YQTEQAYFLEEDYINHI-NANAKTWKAGVNFDPKLSIDSFVKLLGSKGVQAAKQASPDMF 71
Query: 77 ------YSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG 114
Y+ +P+ FD+R KW C TI E+RDQG CGSCW
Sbjct: 72 KTHDEAYNNWSNRIPSYFDARKKWRKCLTIGEVRDQGHCGSCWA 115
Score = 40.4 bits (93), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 17/33 (51%), Positives = 23/33 (69%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQA 41
CGFGC+GG+P AW + K G+V+GG Y S +
Sbjct: 150 CGFGCSGGYPIKAWERFKKHGLVTGGNYESGEG 182
>gi|261328564|emb|CBH11542.1| CPC cysteine peptidase, Clan CA, family C1,Cathepsin B-like,
putative [Trypanosoma brucei gambiense DAL972]
Length = 340
Score = 114 bits (285), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 86/298 (28%), Positives = 112/298 (37%), Gaps = 100/298 (33%)
Query: 66 LPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEIAPCEH 125
LP R E E LP++FDS WPNCPTI +I DQ +CGSCW
Sbjct: 80 LPKRRFTE----EEARAPLPSSFDSAEAWPNCPTIPQIADQSACGSCWAVAAASAMSDRF 135
Query: 126 HVNGTRPSCDASKGHTPKCVREC------------------------------------- 148
G S G C +C
Sbjct: 136 CTMGGVQDVHISAGDLLACCSDCGDGCNGGDPDRAWAYFSSTGLVSDYCQPYPFPHCSHH 195
Query: 149 -----------QENYDVP---YKKD------LNFGAKSYSVSSNEKSIMKEIYEHGPVEG 188
Q N+D P Y D +N+ + + E M+E++ GP E
Sbjct: 196 SKSKNGYPPCSQFNFDTPKCNYTCDDPTIPVVNYRSWTSYALQGEDDYMRELFFRGPFEV 255
Query: 189 AFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKA 248
AF V++D I Y SG + + SG+
Sbjct: 256 AFDVYEDFIAYNSGVYH-------------------------------------HVSGQY 278
Query: 249 LGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVP 306
LGGHA+R++GWG + YW IANSWNT+WG +G F I RG ECGIE +AG+P
Sbjct: 279 LGGHAVRLVGWG--TSNGVPYWKIANSWNTEWGMDGYFLIRRGSSECGIEDGGSAGIP 334
>gi|296863454|pdb|3HHI|A Chain A, Crystal Structure Of Cathepsin B From T. Brucei In Complex
With Ca074
gi|296863455|pdb|3HHI|B Chain B, Crystal Structure Of Cathepsin B From T. Brucei In Complex
With Ca074
Length = 325
Score = 114 bits (285), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 92/322 (28%), Positives = 119/322 (36%), Gaps = 104/322 (32%)
Query: 46 LSNIPRAHLKSWMGVHPDYN----LPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIR 101
+ NI K GV N LP R E E LP++FDS WPNCPTI
Sbjct: 34 MQNITLREAKRLNGVIKKNNNASILPKRRFTE----EEARAPLPSSFDSAEAWPNCPTIP 89
Query: 102 EIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVREC------------- 148
+I DQ +CGSCW G S G C +C
Sbjct: 90 QIADQSACGSCWAVAAASAMSDRFCTMGGVQDVHISAGDLLACCSDCGDGCNGGDPDRAW 149
Query: 149 -----------------------------------QENYDVP---YKKD------LNFGA 164
Q N+D P Y D +N+ +
Sbjct: 150 AYFSSTGLVSDYCQPYPFPHCSHHSKSKNGYPPCSQFNFDTPKCDYTCDDPTIPVVNYRS 209
Query: 165 KSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDN 224
+ E M+E++ GP E AF V++D I Y SG +
Sbjct: 210 WTSYALQGEDDYMRELFFRGPFEVAFDVYEDFIAYNSGVYH------------------- 250
Query: 225 TSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNG 284
+ SG+ LGGHA+R++GWG + YW IANSWNT+WG +G
Sbjct: 251 ------------------HVSGQYLGGHAVRLVGWG--TSNGVPYWKIANSWNTEWGMDG 290
Query: 285 LFKILRGKDECGIESSITAGVP 306
F I RG ECGIE +AG+P
Sbjct: 291 YFLIRRGSSECGIEDGGSAGIP 312
>gi|72389769|ref|XP_845179.1| cysteine peptidase C (CPC) [Trypanosoma brucei brucei strain 927/4
GUTat10.1]
gi|427931064|pdb|4HWY|A Chain A, Trypanosoma Brucei Procathepsin B Solved From 40 Fs
Free-electron Laser Pulse Data By Serial Femtosecond
X-ray Crystallography
gi|40557577|gb|AAR88085.1| cathepsin B-like cysteine protease [Trypanosoma brucei]
gi|62360039|gb|AAX80461.1| cysteine peptidase C (CPC) [Trypanosoma brucei]
gi|70801714|gb|AAZ11620.1| cysteine peptidase C (CPC) [Trypanosoma brucei brucei strain 927/4
GUTat10.1]
Length = 340
Score = 114 bits (285), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 86/298 (28%), Positives = 112/298 (37%), Gaps = 100/298 (33%)
Query: 66 LPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEIAPCEH 125
LP R E E LP++FDS WPNCPTI +I DQ +CGSCW
Sbjct: 80 LPKRRFTE----EEARAPLPSSFDSAEAWPNCPTIPQIADQSACGSCWAVAAASAMSDRF 135
Query: 126 HVNGTRPSCDASKGHTPKCVREC------------------------------------- 148
G S G C +C
Sbjct: 136 CTMGGVQDVHISAGDLLACCSDCGDGCNGGDPDRAWAYFSSTGLVSDYCQPYPFPHCSHH 195
Query: 149 -----------QENYDVP---YKKD------LNFGAKSYSVSSNEKSIMKEIYEHGPVEG 188
Q N+D P Y D +N+ + + E M+E++ GP E
Sbjct: 196 SKSKNGYPPCSQFNFDTPKCNYTCDDPTIPVVNYRSWTSYALQGEDDYMRELFFRGPFEV 255
Query: 189 AFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKA 248
AF V++D I Y SG + + SG+
Sbjct: 256 AFDVYEDFIAYNSGVYH-------------------------------------HVSGQY 278
Query: 249 LGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVP 306
LGGHA+R++GWG + YW IANSWNT+WG +G F I RG ECGIE +AG+P
Sbjct: 279 LGGHAVRLVGWG--TSNGVPYWKIANSWNTEWGMDGYFLIRRGSSECGIEDGGSAGIP 334
>gi|355332948|pdb|3MOR|A Chain A, Crystal Structure Of Cathepsin B From Trypanosoma Brucei
gi|355332949|pdb|3MOR|B Chain B, Crystal Structure Of Cathepsin B From Trypanosoma Brucei
Length = 317
Score = 114 bits (285), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 92/322 (28%), Positives = 119/322 (36%), Gaps = 104/322 (32%)
Query: 46 LSNIPRAHLKSWMGVHPDYN----LPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIR 101
+ NI K GV N LP R E E LP++FDS WPNCPTI
Sbjct: 33 MQNITLREAKRLNGVIKKNNNASILPKRRFTE----EEARAPLPSSFDSAEAWPNCPTIP 88
Query: 102 EIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVREC------------- 148
+I DQ +CGSCW G S G C +C
Sbjct: 89 QIADQSACGSCWAVAAASAMSDRFCTMGGVQDVHISAGDLLACCSDCGDGCNGGDPDRAW 148
Query: 149 -----------------------------------QENYDVP---YKKD------LNFGA 164
Q N+D P Y D +N+ +
Sbjct: 149 AYFSSTGLVSDYCQPYPFPHCSHHSKSKNGYPPCSQFNFDTPKCNYTCDDPTIPVVNYRS 208
Query: 165 KSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDN 224
+ E M+E++ GP E AF V++D I Y SG +
Sbjct: 209 WTSYALQGEDDYMRELFFRGPFEVAFDVYEDFIAYNSGVYH------------------- 249
Query: 225 TSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNG 284
+ SG+ LGGHA+R++GWG + YW IANSWNT+WG +G
Sbjct: 250 ------------------HVSGQYLGGHAVRLVGWG--TSNGVPYWKIANSWNTEWGMDG 289
Query: 285 LFKILRGKDECGIESSITAGVP 306
F I RG ECGIE +AG+P
Sbjct: 290 YFLIRRGSSECGIEDGGSAGIP 311
>gi|157092993|gb|ABV22151.1| cysteine proteinase [Perkinsus chesapeaki]
Length = 396
Score = 114 bits (285), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 72/203 (35%), Positives = 95/203 (46%), Gaps = 62/203 (30%)
Query: 114 GCRPYEIAPCEHHVNGTR-PSCDASKGHTPKCVREC-QENYDVPYKKDLNF-GAKSYSVS 170
GC PY+I PC H+ N T P C +K P C C + YD P +KD +F +S S
Sbjct: 241 GCWPYDIPPCAHYTNSTLYPKCPKTKYDFPTCQESCPNKKYDTPMEKDRHFVEEESLSAL 300
Query: 171 SNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGA 230
+ +I KEI +GP
Sbjct: 301 RSIDAIKKEIMTNGP--------------------------------------------V 316
Query: 231 EGAFTVFDDLILYKSG-------KALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDN 283
++ V+DD + YKSG ALGGHA++I+GWGED YWL+ NSWN +WGDN
Sbjct: 317 SASYLVYDDFLTYKSGVYKRTSHNALGGHAVKIIGWGED------YWLVVNSWNKNWGDN 370
Query: 284 GLFKILRGKDECGIESSITAGVP 306
G+FKI G +CGIE ++ AG P
Sbjct: 371 GMFKI--GCGQCGIEDNVLAGTP 391
Score = 40.0 bits (92), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 21/54 (38%), Positives = 31/54 (57%), Gaps = 2/54 (3%)
Query: 67 PANRLPELIGYSEVDEDLPANFDSRTKWPNCPT-IREIRDQGSCGSCWGCRPYE 119
P N +L E+ +DLP +F++ ++ C + I IRDQ +CGSCW P E
Sbjct: 123 PENIREKLYTADEL-KDLPVSFNATEEFKECSSVIGHIRDQSACGSCWAFAPTE 175
>gi|156375635|ref|XP_001630185.1| predicted protein [Nematostella vectensis]
gi|156217201|gb|EDO38122.1| predicted protein [Nematostella vectensis]
Length = 311
Score = 114 bits (285), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 89/295 (30%), Positives = 123/295 (41%), Gaps = 92/295 (31%)
Query: 59 GVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPY 118
G P+ +LP PE+ V E++P NFD+R +WP +I IR+QG CGSCW
Sbjct: 64 GAWPEGSLP----PEI--EVRVAENIPENFDARKQWPG--SIHPIRNQGQCGSCWAFGAS 115
Query: 119 EIAPCEHHVNGTRP-----------SCDASKG-------------------HTPKC---- 144
E+ + CD T +C
Sbjct: 116 EVLSDRFAIASKNQIYVTLSAQQLVDCDLDNSGCSGGWPINAWNYMVKTGLLTEQCYGPY 175
Query: 145 ------VRECQENYDVPYKKDLN---FGAKS-YSV-SSNEKSIMKEIYEHGPVEGAFTVF 193
R D P++ + + AKS Y + + N ++I +I +GPVE FT+F
Sbjct: 176 YAKQYTCRLTANTTDCPWQPGVKARFYHAKSAYKLPAKNVEAIQTDIMNNGPVEADFTIF 235
Query: 194 DDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHA 253
D Y+SG + ++ +GK LGGHA
Sbjct: 236 QDFYAYRSGIY-------------------------------------VHATGKQLGGHA 258
Query: 254 IRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVPKL 308
I+ILGWG ++ YWL ANSW +WG G FKI RG DECGIE + AG+P L
Sbjct: 259 IKILGWGTEDNV--DYWLCANSWGANWGIQGYFKIRRGTDECGIEDGLAAGLPLL 311
>gi|170030060|ref|XP_001842908.1| cathepsin B [Culex quinquefasciatus]
gi|167865914|gb|EDS29297.1| cathepsin B [Culex quinquefasciatus]
Length = 320
Score = 114 bits (284), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 61/166 (36%), Positives = 82/166 (49%), Gaps = 39/166 (23%)
Query: 142 PKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKS 201
P C R CQ Y + Y +DL +G +Y V NE +IM EIY++GPV F VF D YKS
Sbjct: 193 PTCSRTCQAGYPLTYSQDLKYGGSAYRVMWNENAIMTEIYQNGPVVVQFEVFADFYQYKS 252
Query: 202 GRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGE 261
G + + +G G HA+R++GWG
Sbjct: 253 GVY-------------------------------------RHVTGATEGWHAVRVIGWGV 275
Query: 262 DEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVPK 307
+ + KYWL+ANSW WGD G FK +RG++ GIE + AG+PK
Sbjct: 276 E--NGVKYWLVANSWGVRWGDKGFFKFVRGENHLGIEDFVYAGLPK 319
Score = 37.7 bits (86), Expect = 6.1, Method: Compositional matrix adjust.
Identities = 16/30 (53%), Positives = 20/30 (66%)
Query: 11 FGCNGGFPGMAWRYWVKSGIVSGGAYGSKQ 40
F C+GG+ G W+YWV SG+ S G Y S Q
Sbjct: 147 FKCDGGYVGKTWQYWVDSGLTSEGPYKSGQ 176
>gi|328718094|ref|XP_003246386.1| PREDICTED: cathepsin B [Acyrthosiphon pisum]
Length = 340
Score = 114 bits (284), Expect = 8e-23, Method: Compositional matrix adjust.
Identities = 67/201 (33%), Positives = 92/201 (45%), Gaps = 56/201 (27%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GC PY + PC + G + +C R C N D+ + +D + SY ++
Sbjct: 185 GCEPYRVPPCPYDAEGHNTCAGKPRESNHRCTRMCYGNQDLDFDEDHRYTRDSYYLTYG- 243
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
SI K++ +GP+E +
Sbjct: 244 -SIQKDVMTYGPIEAS-------------------------------------------- 258
Query: 234 FTVFDDLILYKSG--------KALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGL 285
F V+DD YKSG LGGHA++++GWGE+ YWL+ NSWN DWGDNGL
Sbjct: 259 FDVYDDFPSYKSGVYVKSENATYLGGHAVKLIGWGEEYGVP--YWLMVNSWNADWGDNGL 316
Query: 286 FKILRGKDECGIESSITAGVP 306
FKI RG +ECGI++S TAGVP
Sbjct: 317 FKIRRGTNECGIDNSTTAGVP 337
Score = 48.1 bits (113), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 19/39 (48%), Positives = 26/39 (66%)
Query: 76 GYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG 114
Y ++ +P +FD+R KW C TI +RDQG+CGSCW
Sbjct: 80 AYDKLFGRIPRHFDARRKWRRCHTIGAVRDQGNCGSCWA 118
Score = 42.7 bits (99), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 18/32 (56%), Positives = 23/32 (71%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQ 40
CGFGCNGG+P AW + K G+V+GG Y S +
Sbjct: 153 CGFGCNGGYPIKAWERFKKRGLVTGGDYQSGE 184
>gi|201023315|ref|NP_001128400.1| cathepsin B-16D2 precursor [Acyrthosiphon pisum]
Length = 340
Score = 113 bits (283), Expect = 8e-23, Method: Compositional matrix adjust.
Identities = 67/201 (33%), Positives = 92/201 (45%), Gaps = 56/201 (27%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GC PY + PC + G + +C R C N D+ + +D + SY ++
Sbjct: 185 GCEPYRVPPCPYDAEGHNTCAGKPRESNHRCTRMCYGNQDLDFDEDHRYTRDSYYLTYG- 243
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
SI K++ +GP+E +
Sbjct: 244 -SIQKDVMTYGPIEAS-------------------------------------------- 258
Query: 234 FTVFDDLILYKSG--------KALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGL 285
F V+DD YKSG LGGHA++++GWGE+ YWL+ NSWN DWGDNGL
Sbjct: 259 FDVYDDFPSYKSGVYVKSENATYLGGHAVKLIGWGEEYGVP--YWLMVNSWNADWGDNGL 316
Query: 286 FKILRGKDECGIESSITAGVP 306
FKI RG +ECGI++S TAGVP
Sbjct: 317 FKIRRGTNECGIDNSTTAGVP 337
Score = 47.8 bits (112), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 19/39 (48%), Positives = 25/39 (64%)
Query: 76 GYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG 114
Y + +P +FD+R KW C TI +RDQG+CGSCW
Sbjct: 80 AYDNLFGRIPRHFDARRKWRRCHTIGAVRDQGNCGSCWA 118
Score = 42.7 bits (99), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 18/32 (56%), Positives = 23/32 (71%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQ 40
CGFGCNGG+P AW + K G+V+GG Y S +
Sbjct: 153 CGFGCNGGYPIKAWERFKKRGLVTGGDYQSGE 184
>gi|157167285|ref|XP_001658487.1| cathepsin b [Aedes aegypti]
gi|108876478|gb|EAT40703.1| AAEL007590-PA [Aedes aegypti]
Length = 313
Score = 113 bits (283), Expect = 9e-23, Method: Compositional matrix adjust.
Identities = 64/208 (30%), Positives = 102/208 (49%), Gaps = 48/208 (23%)
Query: 103 IRDQGSCGSCWGCRPYEIAP-CEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYK-KDL 160
+ G GS GC PY + P C G P P C C Y+V +D
Sbjct: 149 VSSGGPYGSNQGCHPYPMPPSCPKPSEGDYPD-------EPNCSTRCNAGYNVTEDLRDR 201
Query: 161 NFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWT 220
FG +YS+ ++E+ IM++I+ +GPV+ F ++D++ Y G +
Sbjct: 202 RFGRVAYSIPADERKIMEDIFVNGPVQAVFQWYEDIVNYSGGVY---------------- 245
Query: 221 IRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDW 280
++SG+ GGHA++++GWG ++ +K YWL+ANSW W
Sbjct: 246 ---------------------RHQSGRLKGGHAVKLIGWGVEDGTK--YWLVANSWGRVW 282
Query: 281 GDNGLFKILRGKDECGIESSITAGVPKL 308
GD+G FK++RG++ CGIE ++ AG+P
Sbjct: 283 GDDGFFKMVRGENHCGIEENVHAGLPSF 310
Score = 51.6 bits (122), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 30/75 (40%), Positives = 37/75 (49%), Gaps = 1/75 (1%)
Query: 38 SKQAEKNSLSNIPRAHLKSWMGVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNC 97
SK N + P A + GV P L RL I D LP +FD+R +WP C
Sbjct: 17 SKILSSNLTTTSPFAWILDLPGV-PLEKLKETRLHPAINVFAEDLVLPKSFDARQQWPQC 75
Query: 98 PTIREIRDQGSCGSC 112
++ EIR QG CGSC
Sbjct: 76 SSLNEIRTQGCCGSC 90
>gi|71424150|ref|XP_812694.1| cysteine peptidase C (CPC) [Trypanosoma cruzi strain CL Brener]
gi|70877506|gb|EAN90843.1| cysteine peptidase C (CPC), putative [Trypanosoma cruzi]
Length = 333
Score = 113 bits (283), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 87/262 (33%), Positives = 115/262 (43%), Gaps = 34/262 (12%)
Query: 72 PELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTR 131
P E+ L FD+ WP CPT+ EIRDQ SCGSCW + G
Sbjct: 80 PRQFSEEELRVPLQDRFDAGEAWPECPTVTEIRDQSSCGSCWAVAAASAISDRYCTLGGV 139
Query: 132 PSCDASKGHTPKCVRECQ------------ENYDV-----PYKKDLNFGAKSYSVSSNEK 174
S G C C E Y V Y + F + ++ V+S++
Sbjct: 140 RDLRISAGDLMSCCDVCGFGCNGGYPEVAWEYYAVHGIVSEYCQPYPFPSCAHHVNSSDL 199
Query: 175 SIMKEIYEHGPVEGAFTVFD-DLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
S Y+ T LI Y+ GN + +S + R+ E +
Sbjct: 200 SPCSGEYDTPTCNSTCTDKKIPLIKYR-------GNTSYVLSGEEPFKRELILNGPFEVS 252
Query: 234 FTVFDDLILYK-------SGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLF 286
F+V+ D + Y +G LGGHA+RI+GWG E + E YW IANSWN +WG NG F
Sbjct: 253 FSVYADFVAYTGGVYKHVAGIFLGGHAVRIVGWG--ELNGEPYWKIANSWNREWGMNGYF 310
Query: 287 KILRGKDECGIESSITAGVPKL 308
I RG DECGIE S AG P++
Sbjct: 311 LIARGVDECGIEGSGVAGTPRI 332
Score = 41.6 bits (96), Expect = 0.43, Method: Compositional matrix adjust.
Identities = 16/25 (64%), Positives = 20/25 (80%)
Query: 8 LCGFGCNGGFPGMAWRYWVKSGIVS 32
+CGFGCNGG+P +AW Y+ GIVS
Sbjct: 155 VCGFGCNGGYPEVAWEYYAVHGIVS 179
>gi|281208776|gb|EFA82951.1| peptidase C1A family protein [Polysphondylium pallidum PN500]
Length = 1308
Score = 113 bits (282), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 78/265 (29%), Positives = 111/265 (41%), Gaps = 92/265 (34%)
Query: 83 DLPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYE--------------------IAP 122
+LP NFD+ +WP CPTI I++Q CGSCW E +
Sbjct: 69 NLPTNFDAAQQWPQCPTIGAIQNQAECGSCWAFGAIESISDRFCIHKNESVQLSFQDLIT 128
Query: 123 CEHHVNG--------------------------TRPSCDASKG------HTPKCVRECQE 150
C++ NG T P+C ++ +TP C +C
Sbjct: 129 CDNQDNGCEGGDPYTAYKYVQKNGVVTSNCQPYTIPTCPPAQQPCMNFVNTPPCSAKC-A 187
Query: 151 NYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNE 210
N V +++DL+ Y+V N +I EI +GPVE F V++D + YKSG +
Sbjct: 188 NSSVNFQQDLHHLKTVYAVKPNVAAIQNEIVTNGPVEACFEVYEDFLGYKSGVY------ 241
Query: 211 TTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYW 270
+KSGK LGGH I+I+G+G + YW
Sbjct: 242 -------------------------------THKSGKDLGGHCIKIVGFGVSNGTP--YW 268
Query: 271 LIANSWNTDWGDNGLFKILRGKDEC 295
+ NSW T WG+NG+F I GK+EC
Sbjct: 269 ICNNSWTTSWGNNGIFWIEAGKNEC 293
>gi|281200411|gb|EFA74631.1| hypothetical protein PPL_11599 [Polysphondylium pallidum PN500]
Length = 311
Score = 113 bits (282), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 78/277 (28%), Positives = 110/277 (39%), Gaps = 87/277 (31%)
Query: 72 PELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG----------------- 114
PE + S+V +P +FDSRT WP C + + +QG CGSCW
Sbjct: 71 PEEVSVSKVA--VPNSFDSRTNWPGC--VHAVLNQGQCGSCWAFAASESLSDRLCIASQG 126
Query: 115 -----CRPYEIAPCE----HHVNGTRP---------------SC---DASKGHTPKCVRE 147
P + C+ NG P SC + G P C +E
Sbjct: 127 AINVTLSPQALVSCDIEFNQGCNGGIPQMAWEYLELHGIPTDSCFPYTSGNGTAPDCQKE 186
Query: 148 CQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVP 207
C + K F K+ S+ +I ++ +GP+EG V+ D + Y SG +
Sbjct: 187 CSDGSKYQLYKGKTFTLKT---CSSVAAIQANVFAYGPIEGTMDVYQDFMSYTSGVY--- 240
Query: 208 GNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKE 267
++ K LGGHAI+I+GWG D S
Sbjct: 241 ---------------------------------VMTPGSKLLGGHAIKIVGWGTDSTSGL 267
Query: 268 KYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAG 304
YW++ NSW +DWG NG F I RG + CGI+ +AG
Sbjct: 268 DYWIVQNSWGSDWGMNGFFWIQRGTNMCGIDRDASAG 304
>gi|341888224|gb|EGT44159.1| hypothetical protein CAEBREN_15022 [Caenorhabditis brenneri]
Length = 332
Score = 113 bits (282), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 68/206 (33%), Positives = 102/206 (49%), Gaps = 43/206 (20%)
Query: 107 GSCGSCWGCRPYEIAPCEHHV-NGTRPSCDASKGHTPKCVREC--QENYDVPYKKDLNFG 163
GS + +GC+PY IAPC V N T P+C + TP C ++C + Y V KD ++G
Sbjct: 165 GSYETQFGCKPYSIAPCGKTVGNVTYPACTNTTLPTPSCEKKCTSKNGYPVDIDKDRHYG 224
Query: 164 AKSYSVSSNEK-SIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIR 222
A S N + I ++ +GP+E F V+DD + Y +G +
Sbjct: 225 ASSVDQLPNRQIEIQSDVMLNGPIETTFEVYDDFLQYTTGIY------------------ 266
Query: 223 DNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGD 282
++ +G G ++RILGWG E YWL+ANSW +WG+
Sbjct: 267 -------------------VHLTGNKQGHLSVRILGWGMYEGVP--YWLLANSWGKEWGE 305
Query: 283 NGLFKILRGKDECGIESSITAGVPKL 308
NG F+ LRG +ECG+E++ + +PKL
Sbjct: 306 NGTFRALRGTNECGLEANCVSAMPKL 331
>gi|204022102|dbj|BAG71148.1| cathepsin B-N2 [Tuberaphis takenouchii]
Length = 334
Score = 113 bits (282), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 66/195 (33%), Positives = 97/195 (49%), Gaps = 44/195 (22%)
Query: 114 GCRPYEIAPC--EHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSS 171
GC+PY + PC + + N T A K H +C R C N ++ +K+D + +Y +
Sbjct: 182 GCQPYRVPPCPFDEYGNNTCRGKPAEKNH--RCTRMCYGNQNLDFKEDHRYTRDAYYL-- 237
Query: 172 NEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAE 231
N + I ++ +GP+E ++ V+DD YKSG + N +
Sbjct: 238 NYQIIQNDLMTYGPIEASYDVYDDFPNYKSGVYMKTENASY------------------- 278
Query: 232 GAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRG 291
LGGHA++++GWGE+ YWL+ NSWN WGD GLFKI RG
Sbjct: 279 -----------------LGGHAVKLIGWGEEYGV--PYWLLVNSWNDQWGDQGLFKIRRG 319
Query: 292 KDECGIESSITAGVP 306
+ECGI++S T GVP
Sbjct: 320 TNECGIDNSTTGGVP 334
Score = 51.6 bits (122), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 18/39 (46%), Positives = 28/39 (71%)
Query: 76 GYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG 114
Y+ + +P+NFD+R KW C T+ ++RDQG+CG+CW
Sbjct: 77 AYNSLPNRIPSNFDARKKWRKCSTVGKVRDQGNCGTCWA 115
Score = 37.7 bits (86), Expect = 6.3, Method: Compositional matrix adjust.
Identities = 16/30 (53%), Positives = 21/30 (70%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
CG GC+GG+P AW + K G+V+GG Y S
Sbjct: 150 CGSGCHGGYPIKAWERFRKHGLVTGGDYNS 179
>gi|270012756|gb|EFA09204.1| cathepsin B precursor [Tribolium castaneum]
Length = 369
Score = 112 bits (281), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 88/305 (28%), Positives = 119/305 (39%), Gaps = 91/305 (29%)
Query: 57 WMGVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCP-TIREIRDQGSCGSCWG- 114
++G+HPD N PE+ +P FD+R WP C I IR+QG C S W
Sbjct: 49 FLGIHPDPNFK----PEIKEPQATQNVIPETFDAREYWPECADIIGNIRNQGKCSSSWAF 104
Query: 115 ---------------------CRPYEIAPCEHHV-------------------------- 127
P ++ C H+
Sbjct: 105 AAAEVMSDRLCIATNGKVKIQLSPEDLIDCCHYCGNQCKGGYTYYAWNYFMLTGLVSGGD 164
Query: 128 ----NGTRPSCDASKGH-TPKCVRECQ-ENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIY 181
G +P + + TP C CQ + Y +PY D +FG Y + NE +I EI
Sbjct: 165 YNTSTGCQPYSELNYYRITPPCNTTCQNDKYPIPYVSDKHFGDSIYYIPQNETAIQNEIL 224
Query: 182 EHG-PVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDL 240
G PV AF V+ D +Y+ G E T+ + +
Sbjct: 225 SGGGPVVAAFDVYGDFKIYRDG----------------------------EQHDTILEGV 256
Query: 241 ILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGD-NGLFKILRGKDECGIES 299
+Y SG G A++I+GWG + + YWL ANSW DWG G FKI RG +ECG E
Sbjct: 257 YIYTSGALFGRTAVKIIGWGTE--NGWAYWLAANSWGKDWGALGGFFKIRRGTNECGFEE 314
Query: 300 SITAG 304
SI AG
Sbjct: 315 SIIAG 319
>gi|1848229|gb|AAB48119.1| cathepsin B-like protease [Leishmania major]
Length = 340
Score = 112 bits (281), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 72/195 (36%), Positives = 95/195 (48%), Gaps = 48/195 (24%)
Query: 115 CRPYEIAPCEHHVNGTR-PSCDASKGHTPKCVRECQENYD--VPYKKDLNFGAKSYSVSS 171
C+PY PC HH N + P C ++ TPKC C+ N V YK G+ SYSV
Sbjct: 188 CQPYPFDPCSHHGNSEKYPPCPSTIYDTPKCNTTCERNEMDLVKYK-----GSTSYSVK- 241
Query: 172 NEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAE 231
EK +M E+ +GP+E V+ D + YKSG
Sbjct: 242 GEKELMIELMTNGPLELTMQVYSDFVGYKSG----------------------------- 272
Query: 232 GAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRG 291
V+ ++ G LGGHA++++GWG + YW +ANSWNTDWGD G F I RG
Sbjct: 273 ----VYKHVL----GDFLGGHAVKLVGWGTQDGVP--YWKVANSWNTDWGDKGYFLIQRG 322
Query: 292 KDECGIESSITAGVP 306
+EC IES AG+P
Sbjct: 323 NNECKIESGGVAGIP 337
>gi|86451924|gb|ABC97357.1| cathepsin B [Streblomastix strix]
Length = 283
Score = 112 bits (281), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 86/265 (32%), Positives = 115/265 (43%), Gaps = 88/265 (33%)
Query: 84 LPANFDSRTKWPNCPTIREIRDQGSCGSCW---------------GCRPYEIAP-----C 123
+P FD+R KWP+ I +RDQG CGSCW GC +IAP C
Sbjct: 63 VPDTFDAREKWPD--AILPVRDQGECGSCWAFSIAETIGDRLGVLGCSRGDIAPEDLVSC 120
Query: 124 EHHVNGTRPSCDASKGHTPKCVRECQEN-----YDVPYK----------KDLNFGAKSYS 168
+ +G CD G CQEN +PYK + G+ Y
Sbjct: 121 DIFDDG----CDG--GFIDMAWDWCQENGLTTEECIPYKAGEGVPSPCPETCEDGSAIYR 174
Query: 169 VSS------NEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIR 222
+ I EIYE+GPV F V+ D + YKSG +
Sbjct: 175 TPIESYRYIDADDIQGEIYEYGPVSMGFIVYSDFMSYKSGVY------------------ 216
Query: 223 DNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGD 282
++++G GGHA+ I+GWG +++ YWL+ NSW TDWG+
Sbjct: 217 -------------------VHQAGYIEGGHAVLIVGWGVEDEVP--YWLVQNSWGTDWGE 255
Query: 283 NGLFKILRGKDECGIESSITAGVPK 307
NG FKILRG D C ES++TAG P+
Sbjct: 256 NGFFKILRGSDHCECESNVTAGYPE 280
>gi|359427491|gb|AEV46267.1| eimeripain [Eimeria tenella]
Length = 512
Score = 112 bits (281), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 67/202 (33%), Positives = 97/202 (48%), Gaps = 49/202 (24%)
Query: 111 SCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQE----NYDVPYKKDLNFGAKS 166
SCW PYEI C HH G P C+ PKC ++C+E + P+K DL+F +
Sbjct: 341 SCW---PYEIPFCRHHSEGPYPKCEGPLPKAPKCRKDCEEAEYTSKVKPFKDDLHFATSA 397
Query: 167 YSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTS 226
YSV ++ I +E+ E+G + GAF V++D +LYK G +
Sbjct: 398 YSVEGRDQ-IKRELMENGTLTGAFLVYEDFLLYKEGVYH--------------------- 435
Query: 227 QLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLF 286
+ +G +GGHA++++G+G ++ YWL NSWN WGD G F
Sbjct: 436 ----------------HVTGMPMGGHAVKVIGFGNED--GRDYWLAVNSWNEYWGDKGTF 477
Query: 287 KILRGKDECGIESSITAGVPKL 308
KI G E GI+ G PK+
Sbjct: 478 KIEMG--EAGIDKEFCGGEPKV 497
Score = 40.8 bits (94), Expect = 0.69, Method: Compositional matrix adjust.
Identities = 22/69 (31%), Positives = 37/69 (53%), Gaps = 5/69 (7%)
Query: 51 RAHLKSWMGVHPDYNLPANRLPELIG---YSEVDEDLPAN-FDSRTKWPNCP-TIREIRD 105
+ H+ +++ + D + P L E + ++E + L + FD+R +P C I +RD
Sbjct: 200 KRHMGTYLSFYSDPDKPEVPLGEPLPVKVFAETQQVLETDKFDAREAFPQCAEVIGHVRD 259
Query: 106 QGSCGSCWG 114
QG CGSCW
Sbjct: 260 QGDCGSCWA 268
Score = 38.9 bits (89), Expect = 2.7, Method: Compositional matrix adjust.
Identities = 15/26 (57%), Positives = 20/26 (76%)
Query: 11 FGCNGGFPGMAWRYWVKSGIVSGGAY 36
FGC+GG P MAWR++ G+V+GG Y
Sbjct: 308 FGCSGGQPRMAWRWFSNDGVVTGGDY 333
>gi|339831342|gb|AEK20867.1| cathepsin B [Eimeria tenella]
Length = 512
Score = 112 bits (281), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 67/202 (33%), Positives = 97/202 (48%), Gaps = 49/202 (24%)
Query: 111 SCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQE----NYDVPYKKDLNFGAKS 166
SCW PYEI C HH G P C+ PKC ++C+E + P+K DL+F +
Sbjct: 341 SCW---PYEIPFCRHHSEGPYPKCEGPLPKAPKCRKDCEEAEYTSKVKPFKDDLHFATSA 397
Query: 167 YSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTS 226
YSV ++ I +E+ E+G + GAF V++D +LYK G +
Sbjct: 398 YSVEGRDQ-IKRELMENGTLTGAFLVYEDFLLYKEGVYH--------------------- 435
Query: 227 QLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLF 286
+ +G +GGHA++++G+G ++ YWL NSWN WGD G F
Sbjct: 436 ----------------HVTGMPMGGHAVKVIGFGNED--GRDYWLAVNSWNEYWGDKGTF 477
Query: 287 KILRGKDECGIESSITAGVPKL 308
KI G E GI+ G PK+
Sbjct: 478 KIEMG--EAGIDKEFCGGEPKV 497
Score = 40.8 bits (94), Expect = 0.71, Method: Compositional matrix adjust.
Identities = 22/69 (31%), Positives = 37/69 (53%), Gaps = 5/69 (7%)
Query: 51 RAHLKSWMGVHPDYNLPANRLPELIG---YSEVDEDLPAN-FDSRTKWPNCP-TIREIRD 105
+ H+ +++ + D + P L E + ++E + L + FD+R +P C I +RD
Sbjct: 200 KRHMGTYLSFYSDPDKPEVPLGEPLPVKVFAETQQVLETDKFDAREAFPQCAEVIGHVRD 259
Query: 106 QGSCGSCWG 114
QG CGSCW
Sbjct: 260 QGDCGSCWA 268
Score = 38.9 bits (89), Expect = 2.8, Method: Compositional matrix adjust.
Identities = 15/26 (57%), Positives = 20/26 (76%)
Query: 11 FGCNGGFPGMAWRYWVKSGIVSGGAY 36
FGC+GG P MAWR++ G+V+GG Y
Sbjct: 308 FGCSGGQPRMAWRWFSNDGVVTGGDY 333
>gi|291236586|ref|XP_002738220.1| PREDICTED: cathepsin B preproprotein-like [Saccoglossus
kowalevskii]
Length = 93
Score = 112 bits (281), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 63/132 (47%), Positives = 75/132 (56%), Gaps = 39/132 (29%)
Query: 177 MKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTV 236
M EI ++GPVEGAFTV+ D YKSG +
Sbjct: 1 MAEIQKYGPVEGAFTVYADFPSYKSGVY-------------------------------- 28
Query: 237 FDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECG 296
+++G+ALGGHAI+ILGWG ++ YWL+ANSWN DWGD G FKILRG DECG
Sbjct: 29 -----QHETGEALGGHAIKILGWGNED--GHDYWLVANSWNEDWGDQGFFKILRGVDECG 81
Query: 297 IESSITAGVPKL 308
IES ITAG PKL
Sbjct: 82 IESQITAGSPKL 93
>gi|161343865|tpg|DAA06113.1| TPA_inf: cathepsin B [Myzus persicae]
Length = 335
Score = 112 bits (281), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 64/195 (32%), Positives = 94/195 (48%), Gaps = 40/195 (20%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GC+PY + PC G KC ++C + + YKK+ +Y +S+
Sbjct: 181 GCQPYRVPPCVRDDEGHNSCSGQPTERNHKCSKKCYGDETINYKKNHYKTKDAYYLSNT- 239
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
++ K+ +GP+E +F V+DD Y+SG + N +
Sbjct: 240 -TMQKDTMVYGPIEASFDVYDDFTSYESGVYQKTENAS---------------------- 276
Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
LGGHA++++GWG +E + YWL+ NSW WGD G+FKILRG D
Sbjct: 277 --------------YLGGHAVKMIGWGVEEGT--PYWLMVNSWGEQWGDKGMFKILRGTD 320
Query: 294 ECGIESSITAGVPKL 308
ECG+ESS TAGVP +
Sbjct: 321 ECGVESSCTAGVPSV 335
Score = 53.9 bits (128), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 27/75 (36%), Positives = 41/75 (54%)
Query: 40 QAEKNSLSNIPRAHLKSWMGVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPT 99
+A++N N PR + +G L + + E + ++P FDSR +W NC T
Sbjct: 40 KAKQNFPENTPREDIVRLLGSKRLLGLNKSPIKENDILYVDNGEVPEFFDSRLEWKNCKT 99
Query: 100 IREIRDQGSCGSCWG 114
I E+R+QG+CGSCW
Sbjct: 100 IGEVRNQGNCGSCWA 114
Score = 43.9 bits (102), Expect = 0.082, Method: Compositional matrix adjust.
Identities = 17/30 (56%), Positives = 23/30 (76%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
CGFGCNGG P AW+Y+ + G+V+GG Y +
Sbjct: 149 CGFGCNGGNPLKAWKYFKRHGVVTGGNYNT 178
>gi|389593817|ref|XP_003722157.1| cysteine peptidase C (CPC) [Leishmania major strain Friedlin]
gi|321438655|emb|CBZ12414.1| cysteine peptidase C (CPC) [Leishmania major strain Friedlin]
Length = 340
Score = 112 bits (280), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 85/292 (29%), Positives = 115/292 (39%), Gaps = 97/292 (33%)
Query: 72 PELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTR 131
P E+ +DLP FD+ WP C TI EIRDQ +CGSCW E + G
Sbjct: 86 PRNFSVEELQQDLPEFFDAAEHWPMCLTISEIRDQSNCGSCWAIAAVEAISDRYCTFGGV 145
Query: 132 PSCDASKGHTPKC--------------------------VRECQEN-------------- 151
P S + C +CQ
Sbjct: 146 PDRRMSTSNLLSCCFICGLGCHGGIPTVAWLWWVWVGIATEDCQPYPFDPCSHHGNSEKY 205
Query: 152 -------YDVPY------KKDLNF----GAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFD 194
YD P + +++ G+ SYSV EK +M E+ +GP+E V+
Sbjct: 206 PPCPSTIYDTPKCNTTCERSEMDLVKYKGSTSYSV-KGEKELMIELMTNGPLELTMQVYS 264
Query: 195 DLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAI 254
D + YKSG V+ ++ G+ LGGHA+
Sbjct: 265 DFVGYKSG---------------------------------VYKHVL----GEFLGGHAV 287
Query: 255 RILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVP 306
+++GWG + YW +ANSWNTDWGD G F I RG +EC IES AG+P
Sbjct: 288 KLVGWGTQDGV--PYWKVANSWNTDWGDKGYFLIQRGNNECKIESGGVAGIP 337
>gi|161343871|tpg|DAA06116.1| TPA_inf: cathepsin B [Myzus persicae]
Length = 276
Score = 112 bits (280), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 61/193 (31%), Positives = 91/193 (47%), Gaps = 40/193 (20%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GC PY + PC + G +C R C + D+ + +D + Y ++
Sbjct: 122 GCEPYRVPPCPNDDQGNNTCSGQPMEKNHRCTRMCYGDQDLDFDEDHRYTRDHYYLTY-- 179
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
+ I K++ +GP+E +F V+DD YKSG + N +
Sbjct: 180 RGIQKDVINYGPIEASFDVYDDFPSYKSGIYVKSENAS---------------------- 217
Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
LGGH+++++GWGE+ YWL+ NSWN DWGD GLFKI RG +
Sbjct: 218 --------------YLGGHSVKLIGWGEEYGV--LYWLMVNSWNADWGDKGLFKIRRGTN 261
Query: 294 ECGIESSITAGVP 306
ECG+++S T GVP
Sbjct: 262 ECGVDNSTTGGVP 274
Score = 45.4 bits (106), Expect = 0.033, Method: Compositional matrix adjust.
Identities = 18/32 (56%), Positives = 23/32 (71%)
Query: 82 EDLPANFDSRTKWPNCPTIREIRDQGSCGSCW 113
+++P FD+R KW C TI E+RDQG CGS W
Sbjct: 23 QEIPIKFDARKKWLRCKTIGEVRDQGHCGSDW 54
Score = 38.1 bits (87), Expect = 4.6, Method: Compositional matrix adjust.
Identities = 16/33 (48%), Positives = 23/33 (69%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQA 41
CG GC+GG+P AW+ + K G+V+GG Y S +
Sbjct: 90 CGDGCSGGYPIRAWKRYKKHGLVTGGNYKSGEG 122
>gi|17565158|ref|NP_503384.1| Protein W07B8.1 [Caenorhabditis elegans]
gi|351059396|emb|CCD74286.1| Protein W07B8.1 [Caenorhabditis elegans]
Length = 335
Score = 112 bits (280), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 63/206 (30%), Positives = 101/206 (49%), Gaps = 42/206 (20%)
Query: 107 GSCGSCWGCRPYEIAPCEHHV-NGTRPSCDASKGHTPKCVRECQEN--YDVPYKKDLNFG 163
GS S +GC+PY I PC V N T P+C + TP C ++C Y + KD ++G
Sbjct: 169 GSYESQFGCKPYSIPPCGKTVGNVTYPACTNTTSPTPSCEKKCTSRIGYPIDIDKDRHYG 228
Query: 164 AKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRD 223
+ +++ I ++ +GP++ F V+DD + Y +G +
Sbjct: 229 VSVDQLPNSQIEIQSDVMLNGPIQATFEVYDDFLQYTTGIY------------------- 269
Query: 224 NTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDN 283
++ +G G ++RI+GWG + YWL ANSW WG+N
Sbjct: 270 ------------------VHLTGNKQGHLSVRIIGWGVWQGVP--YWLCANSWGRQWGEN 309
Query: 284 GLFKILRGKDECGIESSITAGVPKLD 309
G F++LRG +ECG+ES+ +G+PKL+
Sbjct: 310 GTFRVLRGTNECGLESNCVSGMPKLN 335
>gi|204022071|dbj|BAG71133.1| cathepsin B-S2 [Tuberaphis coreana]
Length = 334
Score = 112 bits (280), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 79/253 (31%), Positives = 114/253 (45%), Gaps = 28/253 (11%)
Query: 80 VDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCW----------------GCRPYEIAPC 123
V+ D P FDSRT W +C I IRDQG+CGSCW G + ++
Sbjct: 81 VENDSPQQFDSRTNWKSCKQIGHIRDQGNCGSCWSFSTTGAFADRLCVSTGGKFNQLLSP 140
Query: 124 EHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNF--GAKSYSVSSNEKSIMKEIY 181
E + G P E V D N G Y V + I
Sbjct: 141 EELTFCCKDCGQGCGGGNPMKAWEYFRTQGVTTGGDYNTKEGCMPYKVPPCRNKQGENIC 200
Query: 182 EHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLI 241
+ P+E + + ++ IK +D + E +F +DDL
Sbjct: 201 DEQPMERNHQCPKTCYGKTTVQNRYKTKSEYYINSIKTIEQDIKTYGPVEASFDCYDDLS 260
Query: 242 LYKSG--------KALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
+YKSG K GGH+I+I+GWG+++ + YWL NSW+ WGD+G FKI++G++
Sbjct: 261 VYKSGIYRKSPNAKYKGGHSIKIIGWGQEDGTP--YWLAVNSWSKFWGDHGTFKIIKGRN 318
Query: 294 ECGIESSITAGVP 306
ECGIE ++TAG+P
Sbjct: 319 ECGIERAVTAGIP 331
>gi|161343867|tpg|DAA06114.1| TPA_inf: cathepsin B [Myzus persicae]
Length = 340
Score = 112 bits (280), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 69/201 (34%), Positives = 93/201 (46%), Gaps = 56/201 (27%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GC PY + PC HH G D +C R C + D+ + D + SY ++
Sbjct: 185 GCEPYRVPPCRHHAEGNNSCSDKPMEKNHRCTRMCYGDQDLDFDDDHRYTRDSYYLTYG- 243
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
SI K++ +GP+E +
Sbjct: 244 -SIQKDVMNYGPIEAS-------------------------------------------- 258
Query: 234 FTVFDDLILYKSG--------KALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGL 285
F V+DD YKSG LGGHA++++GWGE+ S YWL+ NSWNTDWGD GL
Sbjct: 259 FDVYDDFPSYKSGVYIRSDNASYLGGHAVKLIGWGEE--SGVPYWLMVNSWNTDWGDKGL 316
Query: 286 FKILRGKDECGIESSITAGVP 306
FKI RG +ECG+++S TAGVP
Sbjct: 317 FKIQRGTNECGVDNSTTAGVP 337
Score = 48.1 bits (113), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 19/38 (50%), Positives = 24/38 (63%)
Query: 77 YSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG 114
Y + +P FD+R KW C TI +RDQG+CGSCW
Sbjct: 81 YDNLFGRIPKKFDARKKWRKCKTIGAVRDQGNCGSCWA 118
Score = 39.7 bits (91), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 16/32 (50%), Positives = 22/32 (68%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQ 40
CG+GCNGG+P AW + G+V+GG Y S +
Sbjct: 153 CGYGCNGGYPIKAWERFKSHGLVTGGDYKSGE 184
>gi|226466652|emb|CAX69461.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 340
Score = 112 bits (280), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 81/299 (27%), Positives = 120/299 (40%), Gaps = 103/299 (34%)
Query: 75 IGYSEVDEDLPANFDSRTKWPNCPTIREI------------------------------- 103
I ++ ++ ++P +FD+R W NC TIR+I
Sbjct: 80 ISHNSINMEIPRSFDARYHWINCSTIRQIHDESLCRADWAIATVDSISDRICIRSNGRIS 139
Query: 104 -----RDQGSCG---SCW--------------------------GCRPYEIAPCEHHVNG 129
RD SCG C+ GC+PY + C +H
Sbjct: 140 VQLSARDAISCGFSPGCFHGSEVEVLVYWITYGIVTGGSYEDQSGCQPYPLPKCSYHPES 199
Query: 130 TRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGA 189
C+ + P+C ECQ+ Y+ Y D +G + Y+V ++ I KEI +GPV +
Sbjct: 200 RFLDCNNNTFEFPQCTNECQDGYNKTYDDDKFYGERIYNVYGTQEDIQKEILMNGPVIAS 259
Query: 190 FTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKAL 249
+V D ++YKSG ++P + + L
Sbjct: 260 ISVNTDFLVYKSG-VYLPTPRS-----------------------------------RNL 283
Query: 250 GGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVPKL 308
G +RI+GWG + K YWL ANSWN +WGDNG KI RG IES + A +PK+
Sbjct: 284 GWITLRIIGWGYE--GKIPYWLCANSWNEEWGDNGYVKIQRGVQAGYIESYVRAPIPKM 340
>gi|66810163|ref|XP_638805.1| peptidase C1A family protein [Dictyostelium discoideum AX4]
gi|74897075|sp|Q54QD9.1|CTSB_DICDI RecName: Full=Cathepsin B; AltName: Full=Cathepsin B1; Flags:
Precursor
gi|60467425|gb|EAL65448.1| peptidase C1A family protein [Dictyostelium discoideum AX4]
Length = 311
Score = 112 bits (279), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 82/293 (27%), Positives = 119/293 (40%), Gaps = 108/293 (36%)
Query: 73 ELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYE------------- 119
++ Y + +P +F+++T WPNC TI +I++Q CGSCW E
Sbjct: 68 QIKSYDPLGVQIPTSFNAQTNWPNCTTISQIQNQARCGSCWAFGATESATDRLCIHNNEN 127
Query: 120 -------IAPCEHHVNG--------------------------TRPSCDASKG------H 140
+ C+ NG T P+C ++ +
Sbjct: 128 VQLSFMDMVTCDETDNGCEGGDAFSAWNWLRKQGAVSEECLPYTIPTCPPAQQPCLNFVN 187
Query: 141 TPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYK 200
TP C +ECQ N + Y +D + AK YS S+E +IM+EI +GPVE
Sbjct: 188 TPSCTKECQSNSSLIYSQDKHKMAKIYSFDSDE-AIMQEIVTNGPVEAC----------- 235
Query: 201 SGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSG-------KALGGHA 253
FTVF+D + YKSG K LGGH
Sbjct: 236 ---------------------------------FTVFEDFLAYKSGVYVHTTGKDLGGHC 262
Query: 254 IRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVP 306
++++G+G + Y+ N W T WGDNG F I RG +CGI + AG+P
Sbjct: 263 VKLVGFGT--LNGVDYYAANNQWTTSWGDNGTFLIKRG--DCGISDDVVAGLP 311
>gi|294898091|ref|XP_002776152.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
gi|239882839|gb|EER07968.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
Length = 382
Score = 112 bits (279), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 91/295 (30%), Positives = 121/295 (41%), Gaps = 107/295 (36%)
Query: 76 GYS-EVDEDLPANFDSRTKWPNCP-TIREIRDQGSCGSCW-------------------- 113
GY+ E +DLP +FD+RT +PNC I IRDQ +CGSCW
Sbjct: 133 GYAIEELQDLPTDFDARTAFPNCSKVIGHIRDQSACGSCWAFGVTEAFNDRLCIKSNGAF 192
Query: 114 ----------------GC---RPYEIAPCEHHV-----NGTRPSCDASKGHTP------- 142
GC PY H G+RP + P
Sbjct: 193 TELLSAGEMNACTLFFGCGGGDPYSAWSWVHDKGIATGEGSRPKRVSESEAIPVIAYQDI 252
Query: 143 ----KCVRECQE-NYDVPYKKDLNFGAKS----YSVSSNEKSIMKEIYEHGPVEGAFTVF 193
CV +C+ Y + D +F +S YSV+ + +I + GPV +FTV+
Sbjct: 253 YPTPNCVEQCRNPKYTTTLRDDRHFMLESSPYHYSVNDAKNAIRTD----GPVSASFTVY 308
Query: 194 DDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHA 253
+D + YKSG + + SG LGGHA
Sbjct: 309 EDFLAYKSGVY-------------------------------------KHTSGSYLGGHA 331
Query: 254 IRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVPKL 308
++I+GWG EKS + YWL NSWN DWGD GLFKI G CGI+ + G PK+
Sbjct: 332 VKIIGWG--EKSGQAYWLAVNSWNEDWGDKGLFKIALGN--CGIDDDLLGGTPKV 382
>gi|268561878|ref|XP_002638441.1| Hypothetical protein CBG18657 [Caenorhabditis briggsae]
Length = 372
Score = 112 bits (279), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 76/228 (33%), Positives = 101/228 (44%), Gaps = 82/228 (35%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENY-DVPYKKDLNF---------- 162
GC+PY PC SC+ASK TP C ++CQ Y + YK D F
Sbjct: 173 GCQPYTFPPCS--------SCEASKS-TPSCQKKCQTGYLEATYKNDKRFENEEQDSSYM 223
Query: 163 -------------GAKSYSVSSNEKS----------IMKEIYEHGPVEGAFTVFDDLILY 199
G +Y +S+ S I EIY +GPVE ++ VF+D Y
Sbjct: 224 SENFYQVLIILKGGKSAYRLSTTTSSNKISTDAIITIQTEIYNNGPVEVSYRVFEDFYQY 283
Query: 200 KSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGW 259
KSG + Y SGK G HA++I+GW
Sbjct: 284 KSGVYH-------------------------------------YVSGKLTGAHAVKIIGW 306
Query: 260 GEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVPK 307
G + +K YWL+ANSW TD+G+ G FKI RG +ECGIE ++ AG+ K
Sbjct: 307 GTE--NKVDYWLVANSWGTDFGEKGFFKIRRGTNECGIEENVVAGLAK 352
Score = 48.1 bits (113), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 18/37 (48%), Positives = 25/37 (67%)
Query: 84 LPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEI 120
+P +FD+R WPNC +I+ IR+Q CG+CW EI
Sbjct: 76 VPISFDARDHWPNCKSIKLIRNQAYCGACWAFGAAEI 112
Score = 37.4 bits (85), Expect = 7.7, Method: Compositional matrix adjust.
Identities = 14/28 (50%), Positives = 20/28 (71%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAY 36
CG GC GG+P ++W+ SG+V+GG Y
Sbjct: 142 CGEGCKGGYPLEGLKFWMNSGVVTGGDY 169
>gi|7507648|pir||T24819 hypothetical protein T10H4.12 - Caenorhabditis elegans
Length = 324
Score = 112 bits (279), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 71/214 (33%), Positives = 98/214 (45%), Gaps = 67/214 (31%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVP-YKKDLNFG--------- 163
GC PY APC + + TP C CQ +Y YKKD ++G
Sbjct: 127 GCMPYSFAPCTK---------NCPESTTPSCKTTCQSSYKTEEYKKDKHYGELVWHSFNR 177
Query: 164 -------AKSYSVSSNEK--SIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAM 214
A +Y V++ + I EIY +GPVE ++ V++D YKSG +
Sbjct: 178 FQRFLNRASAYKVTTTKSVTEIQTEIYHYGPVEASYKVYEDFYHYKSGVYH--------- 228
Query: 215 SLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIAN 274
Y SGK +GGHA++I+GWG + + YWLIAN
Sbjct: 229 ----------------------------YTSGKLVGGHAVKIIGWGVE--NGVDYWLIAN 258
Query: 275 SWNTDWGDNGLFKILRGKDECGIESSITAGVPKL 308
SW T +G+ G FKI RG +EC IE ++ AG+ KL
Sbjct: 259 SWGTSFGEKGFFKIRRGTNECQIEGNVVAGIAKL 292
Score = 39.7 bits (91), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 16/29 (55%), Positives = 20/29 (68%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYG 37
CG+GC GG+ A R+W SG V+GG YG
Sbjct: 96 CGYGCKGGYSIEALRFWASSGAVTGGDYG 124
>gi|193716207|ref|XP_001950562.1| PREDICTED: cathepsin B-like [Acyrthosiphon pisum]
Length = 340
Score = 111 bits (278), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 66/201 (32%), Positives = 90/201 (44%), Gaps = 56/201 (27%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GC PY + PC + G + +C R C N D+ Y D F SY ++ +
Sbjct: 185 GCEPYRVPPCPYDAEGHNTCAGKPREKNHRCTRTCYGNQDLDYNDDHRFTRDSYYLTYS- 243
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
SI K++ +GP+E +
Sbjct: 244 -SIQKDVMRYGPIEAS-------------------------------------------- 258
Query: 234 FTVFDDLILYKSG--------KALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGL 285
F ++DD YKSG LGGHA++++GWGE+ YWL+ NSWN WGDNGL
Sbjct: 259 FDMYDDFPSYKSGVYVRSENASYLGGHAVKLIGWGEEHGVL--YWLMVNSWNEGWGDNGL 316
Query: 286 FKILRGKDECGIESSITAGVP 306
FKI RG +ECGI++S T GVP
Sbjct: 317 FKIRRGTNECGIDNSTTGGVP 337
Score = 52.0 bits (123), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 33/110 (30%), Positives = 52/110 (47%), Gaps = 26/110 (23%)
Query: 30 IVSGGAYGSKQA---EKNSLSNIPRAHLKSW-MGVHPDYNLPANRLPELIG--------- 76
++ Y ++QA +K+ + NI H +W GV+ D N P +++G
Sbjct: 10 VIFVSVYVTEQAYFLQKDFIDNI-NNHATTWKAGVNFDPNTPKEYFLKMLGSKGVQIPDK 68
Query: 77 ------------YSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG 114
Y + +P +FD+R KW C TI ++RDQG+CGSCW
Sbjct: 69 HNIHMYKTHDAAYDNLFGRIPKHFDARKKWKRCHTIGKVRDQGNCGSCWA 118
Score = 39.3 bits (90), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 16/32 (50%), Positives = 22/32 (68%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQ 40
CG+GCNGG+P AW + G+V+GG Y S +
Sbjct: 153 CGYGCNGGYPIKAWESFNNRGLVTGGDYQSGE 184
>gi|187107122|ref|NP_001119621.1| cathepsin B-3098 precursor [Acyrthosiphon pisum]
gi|161343841|tpg|DAA06101.1| TPA_inf: cathepsin B [Acyrthosiphon pisum]
Length = 337
Score = 111 bits (278), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 61/193 (31%), Positives = 92/193 (47%), Gaps = 40/193 (20%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GC PY + PC + +G KC ++C + D+ + KD + Y ++
Sbjct: 183 GCEPYRVPPCPYDKDGKNTCSGQPMESNHKCSKKCYGDEDIDFNKDHRYTRDDYYLTY-- 240
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
+ I K++ +GP+E +F V+DD YKSG + N +
Sbjct: 241 RGIQKDVINYGPIETSFDVYDDFPNYKSGIYVKSENAS---------------------- 278
Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
LGGH+++++GWGE+ YWL+ NSWN DWGD GLFKI RG +
Sbjct: 279 --------------YLGGHSVKLIGWGEEYGV--LYWLMVNSWNADWGDKGLFKIRRGTN 322
Query: 294 ECGIESSITAGVP 306
EC +++S T GVP
Sbjct: 323 ECRVDNSTTGGVP 335
Score = 52.4 bits (124), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 28/83 (33%), Positives = 41/83 (49%), Gaps = 14/83 (16%)
Query: 40 QAEKNSLSNIPRAHLKSWMGVHPDYNLPANRLPELIGYSEVDED--------LPANFDSR 91
+A NS N P+ H+ +G ++P+ + Y+ D +P FD+R
Sbjct: 40 KAGVNSAPNTPKEHILRLLGSR------GVQIPDKVNYNMYKNDDHADNYQEIPMKFDAR 93
Query: 92 TKWPNCPTIREIRDQGSCGSCWG 114
KW C TI E+RDQG+CGS W
Sbjct: 94 KKWIRCKTIGEVRDQGNCGSDWA 116
Score = 38.9 bits (89), Expect = 3.1, Method: Compositional matrix adjust.
Identities = 16/30 (53%), Positives = 21/30 (70%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
CG GCNGG+P AW+ + G+V+GG Y S
Sbjct: 151 CGNGCNGGYPIRAWKRFKNHGLVTGGNYKS 180
>gi|4325188|gb|AAD17297.1| cysteine proteinase [Ancylostoma ceylanicum]
Length = 341
Score = 111 bits (277), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 63/189 (33%), Positives = 94/189 (49%), Gaps = 41/189 (21%)
Query: 115 CRPYEIAPCEHHVN-GTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
C+PY PC H + C TPKC + Q Y+ Y++D +F +SYS+ +NE
Sbjct: 187 CKPYSFYPCGQHKDVPYYGPCPGGLWPTPKCRKSSQRKYNKTYQEDKHFATRSYSLPNNE 246
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
+SI +EIY++GPV AF V++D + T + + KW I+
Sbjct: 247 RSIRQEIYKNGPVVAAFKVYEDY------------SSTGGIYVHKWGIQ----------- 283
Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
G HA +++GWG + + YWLIANSWNTDWG++G ++I+R D
Sbjct: 284 ---------------TGAHADKVIGWGRENGT--DYWLIANSWNTDWGEDGYYRIVRETD 326
Query: 294 ECGIESSIT 302
C IE +
Sbjct: 327 NCEIERQMV 335
Score = 37.7 bits (86), Expect = 6.5, Method: Compositional matrix adjust.
Identities = 15/35 (42%), Positives = 23/35 (65%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQAEK 43
CG+GC GG+P A+R+ + G+V+GG Y + K
Sbjct: 154 CGYGCQGGWPIEAYRWMQRDGVVTGGKYRQRDVCK 188
>gi|294883442|ref|XP_002770942.1| Cathepsin B precursor, putative [Perkinsus marinus ATCC 50983]
gi|239874068|gb|EER02758.1| Cathepsin B precursor, putative [Perkinsus marinus ATCC 50983]
Length = 393
Score = 111 bits (277), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 89/266 (33%), Positives = 121/266 (45%), Gaps = 49/266 (18%)
Query: 83 DLPANFDSRTKWPNCPT-IREIRDQGSCGSCWGCRPYEIAPCEHHV--NGTRPSCDASKG 139
+LP FD+R + NC T I +RDQ +CGSCW E + +G S G
Sbjct: 126 NLPDRFDAREHFKNCATVIGHVRDQSTCGSCWAFATSEAFSDRLCIRSSGEFDLVPLSAG 185
Query: 140 HTPKCVRECQ-------------------ENYDVPYKKD-----LNFGAKSYSVSSNEKS 175
HT C E + + V + D NF S+ V E
Sbjct: 186 HTAACCSEAEGCFSFGCDGGQPDSAWRWFSEHGVVSELDSGCWPYNFPECSHHV---ETK 242
Query: 176 IMKEIYEHGPVEGAFTVFDDLIL---YKSGRFFV--PGNETTAMSLIKWTIRDNTSQLGA 230
M+ + P T + ++S R F G + IK I DN
Sbjct: 243 GMEPCKGNSPSPVCSTTCRNHHFKPSFESDRHFTEDEGYSLDEVDEIKKEIIDNGP---V 299
Query: 231 EGAFTVFDDLILYKS-------GKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDN 283
AFTV++D + YKS G LGGHA++I+GWG D+ E+YWL+ NSWN +WGD
Sbjct: 300 AAAFTVYEDFLYYKSGVYKHVNGSELGGHAVKIIGWGTDQ--NEQYWLVMNSWNVNWGDQ 357
Query: 284 GLFKILRGKDECGIESSITAGVPKLD 309
G+FKI G ECGI+S +TAG+PK +
Sbjct: 358 GIFKIAIG--ECGIDSEVTAGIPKYE 381
>gi|224128101|ref|XP_002320244.1| predicted protein [Populus trichocarpa]
gi|222861017|gb|EEE98559.1| predicted protein [Populus trichocarpa]
Length = 339
Score = 110 bits (275), Expect = 8e-22, Method: Compositional matrix adjust.
Identities = 84/284 (29%), Positives = 114/284 (40%), Gaps = 108/284 (38%)
Query: 84 LPANFDSRTKWPNCPTIREIRDQGS----------------------------------- 108
LP FD+RT WP+C TI I DQG
Sbjct: 83 LPIEFDARTAWPHCSTIGRILDQGHCGSCWAFGAVESLSDRFCIHYGMNLSLSVNDLLAC 142
Query: 109 ----CGS-CWGCRPY--------------EIAPCEHHVNGTRPSCDASKGHTPKCVRECQ 149
CG+ C G P E P + + P C+ TPKC R+C
Sbjct: 143 CGWMCGAGCDGGSPIDAWRYFVQSGVVTEECDPYFDDIGCSHPGCEPGF-PTPKCERKCA 201
Query: 150 ENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGN 209
+ + + + +F +Y + S+ SIM E+ +GPVE A
Sbjct: 202 DKNKL-WAESKHFSVNAYRIDSDPHSIMAEVSSNGPVEVA-------------------- 240
Query: 210 ETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSG-------KALGGHAIRILGWGED 262
FTV++D YKSG A+GGHA++++GWG
Sbjct: 241 ------------------------FTVYEDFAHYKSGVYKHITGDAMGGHAVKLIGWGTS 276
Query: 263 EKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVP 306
E E YWL+AN WN WGD+G FKI RG +ECGIE ++ AG+P
Sbjct: 277 EDG-EDYWLLANQWNRGWGDDGYFKIKRGTNECGIEGAVVAGLP 319
Score = 38.1 bits (87), Expect = 4.7, Method: Compositional matrix adjust.
Identities = 15/25 (60%), Positives = 21/25 (84%)
Query: 8 LCGFGCNGGFPGMAWRYWVKSGIVS 32
+CG GC+GG P AWRY+V+SG+V+
Sbjct: 146 MCGAGCDGGSPIDAWRYFVQSGVVT 170
>gi|294894292|ref|XP_002774787.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
gi|239880404|gb|EER06603.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
Length = 414
Score = 110 bits (274), Expect = 9e-22, Method: Compositional matrix adjust.
Identities = 70/212 (33%), Positives = 98/212 (46%), Gaps = 58/212 (27%)
Query: 114 GCRPYEIAPCEHHVNGTR-PSCDASKGHTPKCVRECQE-NYDVPYKKDLNFGAKS----Y 167
GC PY+ PC HHVN ++ P C TP C +C Y + D +F +S Y
Sbjct: 244 GCWPYDFPPCAHHVNDSKYPKCPKDSYETPNCAEQCHNPKYTTTLRDDRHFLVESVPYEY 303
Query: 168 SVSSNEKSIMKE-----IYEHGP------VEGAFTVFDDLILYKSGRFFVPGNETTAMSL 216
SV+ + +I + IY P V +F V++D + Y+SG +
Sbjct: 304 SVNDAKNAIRTDGPVGPIYFCDPSVNFDQVSASFIVYEDFLAYRSGVY------------ 351
Query: 217 IKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSW 276
+ SGK LGGHA++I+GWGE+ + + YWL+ NSW
Sbjct: 352 -------------------------KHTSGKELGGHAVKIIGWGEE--TGQAYWLVVNSW 384
Query: 277 NTDWGDNGLFKILRGKDECGIESSITAGVPKL 308
N DWGDNGLFKI G C I+ + G PK+
Sbjct: 385 NEDWGDNGLFKIALGN--CEIDDDLLGGTPKV 414
Score = 51.2 bits (121), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 21/34 (61%), Positives = 25/34 (73%), Gaps = 1/34 (2%)
Query: 82 EDLPANFDSRTKWPNCP-TIREIRDQGSCGSCWG 114
+DLP +FD+RT +PNC IR IRDQ CGSCW
Sbjct: 140 QDLPTDFDARTAFPNCSKVIRHIRDQSDCGSCWA 173
>gi|224064398|ref|XP_002301456.1| predicted protein [Populus trichocarpa]
gi|222843182|gb|EEE80729.1| predicted protein [Populus trichocarpa]
Length = 325
Score = 110 bits (274), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 63/188 (33%), Positives = 93/188 (49%), Gaps = 40/188 (21%)
Query: 119 EIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMK 178
E P + + P C+ TPKC R+C + + + + +F +Y + S+ SIM
Sbjct: 158 ECDPYFDDIGCSHPGCEPGF-PTPKCERKCADKNKL-WAESKHFSVNAYRIDSDPHSIMA 215
Query: 179 EIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFD 238
E+ +GPVE AFTV++D YKSG +
Sbjct: 216 EVSMNGPVEVAFTVYEDFAHYKSGVY---------------------------------- 241
Query: 239 DLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIE 298
+ +G +GGHA++++GWG + E YWL+AN WN WGD+G FKI RG +ECGIE
Sbjct: 242 ---KHITGDVMGGHAVKLIGWGTSDDG-EDYWLLANQWNRGWGDDGYFKIRRGTNECGIE 297
Query: 299 SSITAGVP 306
+ AG+P
Sbjct: 298 EDVVAGLP 305
Score = 39.7 bits (91), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 15/25 (60%), Positives = 22/25 (88%)
Query: 8 LCGFGCNGGFPGMAWRYWVKSGIVS 32
+CG GC+GG+P AWRY+V+SG+V+
Sbjct: 132 MCGDGCDGGYPIDAWRYFVQSGVVT 156
>gi|204022108|dbj|BAG71151.1| cathepsin B-N [Cerataphis jamuritsu]
Length = 333
Score = 110 bits (274), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 63/195 (32%), Positives = 96/195 (49%), Gaps = 44/195 (22%)
Query: 114 GCRPYEIAPC--EHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSS 171
GC+PY + PC + + N T K H +C R C + D+ + D ++ +Y ++
Sbjct: 181 GCQPYRVPPCPLDEYGNNTCHGKPMEKNH--RCTRMCYGDQDLDFNNDHHYTRDAYYLTY 238
Query: 172 NEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAE 231
+I ++ +GP+E +F V+DD YKSG +
Sbjct: 239 G--TIQNDVLTYGPIEASFEVYDDFPSYKSGVY--------------------------- 269
Query: 232 GAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRG 291
+ ++ LGGHA++++GWGE+ YWL+ NSWN WGD GLFKI RG
Sbjct: 270 ---------VKTENASYLGGHAVKLIGWGEEYGV--PYWLLVNSWNDQWGDQGLFKIRRG 318
Query: 292 KDECGIESSITAGVP 306
+ECGI++S T GVP
Sbjct: 319 TNECGIDNSTTGGVP 333
Score = 51.2 bits (121), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 19/33 (57%), Positives = 25/33 (75%)
Query: 82 EDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG 114
+ +P+NFD+R KW C +I E+RDQG CGSCW
Sbjct: 82 QRIPSNFDARKKWKKCLSIGEVRDQGHCGSCWA 114
Score = 42.7 bits (99), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 18/33 (54%), Positives = 23/33 (69%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQA 41
CGFGCNGG+P AW + K G+V+GG Y S +
Sbjct: 149 CGFGCNGGYPIRAWERFRKHGLVTGGNYDSYEG 181
>gi|3912916|gb|AAC78691.1| thiol protease [Trichuris suis]
Length = 348
Score = 109 bits (273), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 64/203 (31%), Positives = 98/203 (48%), Gaps = 49/203 (24%)
Query: 113 WGCRPYE-IAPCEHHV---------NGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNF 162
+GC+PY+ P H+ N T TP+C R C Y Y D +
Sbjct: 180 YGCKPYKPTGPIGRHLKRNDYAPCPNDTYYGECVGMADTPRCKRRCLLGYPKSYPSDRYY 239
Query: 163 GAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIR 222
G +Y V + K+I +EI ++GPV +F V++D YKSG
Sbjct: 240 GKSAYIVKQSVKAIQREIMKNGPVVASFAVYEDFRHYKSG-------------------- 279
Query: 223 DNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGD 282
+ + +G+ G HA++I+GWG++ + +WLIANSW+ DWG+
Sbjct: 280 -----------------IYKHTAGELRGYHAVKIIGWGKENNTD--FWLIANSWHQDWGE 320
Query: 283 NGLFKILRGKDECGIESSITAGV 305
G F+I+RGK+ECGIE+ + AG+
Sbjct: 321 KGYFRIVRGKNECGIETDVVAGI 343
Score = 38.9 bits (89), Expect = 3.0, Method: Compositional matrix adjust.
Identities = 15/26 (57%), Positives = 20/26 (76%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGG 34
CG+GCNGGFP AWR++ +G +GG
Sbjct: 149 CGYGCNGGFPIEAWRHFTVAGNCTGG 174
>gi|226472808|emb|CAX71090.1| cathepsin B [Schistosoma japonicum]
Length = 325
Score = 109 bits (273), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 63/160 (39%), Positives = 79/160 (49%), Gaps = 40/160 (25%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GC+PYE PCEHH G P CD TP C R CQ Y+V Y+ D +G Y V SN+
Sbjct: 192 GCQPYEFPPCEHHTLGPLPVCDGDV-ETPPCKRTCQAGYNVSYENDKWYGKVVYRVKSNQ 250
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
++IMKE+ +HGPVE F V+ D YKSG +
Sbjct: 251 EAIMKELMQHGPVEVDFEVYADFPNYKSGVY----------------------------- 281
Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIA 273
+ SG LGGHA+R+LGWGE+ + YWLIA
Sbjct: 282 --------QHVSGALLGGHAVRLLGWGEE--NNVPYWLIA 311
Score = 57.4 bits (137), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 28/62 (45%), Positives = 38/62 (61%), Gaps = 3/62 (4%)
Query: 54 LKSWMGVHPDYNLPANRLPEL-IGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSC 112
++ +G PD N +L L GY +LP +FD+R +W +CP+I EIRDQ SCGSC
Sbjct: 66 IRRMLGALPDPN--GEQLETLCTGYELTLNELPKSFDARKEWTHCPSISEIRDQSSCGSC 123
Query: 113 WG 114
W
Sbjct: 124 WA 125
Score = 45.4 bits (106), Expect = 0.036, Method: Compositional matrix adjust.
Identities = 18/30 (60%), Positives = 20/30 (66%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
CG GCNGGFP AW YW GIV+G Y +
Sbjct: 160 CGMGCNGGFPHSAWLYWKNQGIVTGDLYNT 189
>gi|124502519|gb|ABN13633.1| cysteine proteinase [Haemonchus contortus]
Length = 342
Score = 109 bits (273), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 62/191 (32%), Positives = 92/191 (48%), Gaps = 39/191 (20%)
Query: 115 CRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEK 174
CRPY I PC HH N T TP C +EC+ Y+ D +G +Y V + K
Sbjct: 185 CRPYPIHPCGHHGNDTYYGECRGTAPTPPCKKECRPGVRKVYRIDKRYGKDAYIVKQSVK 244
Query: 175 SIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAF 234
+I EI +GPV +F V Y+ R + G
Sbjct: 245 AIQSEILRNGPVVASFAV------YEDFRHYKSG-------------------------- 272
Query: 235 TVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDE 294
+ + +G+ G HA++++GWG + + +WLIANSW+ DWG+ G F+I+RG ++
Sbjct: 273 -----IYKHTAGELRGYHAVKMIGWGNENNTD--FWLIANSWHNDWGEKGYFRIIRGTND 325
Query: 295 CGIESSITAGV 305
CGIE +I AG+
Sbjct: 326 CGIEGTIAAGI 336
Score = 41.2 bits (95), Expect = 0.56, Method: Compositional matrix adjust.
Identities = 17/31 (54%), Positives = 23/31 (74%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSK 39
CG GC GG+P AW+Y++ G+VSGG Y +K
Sbjct: 152 CGDGCEGGWPIEAWKYFIYDGVVSGGEYLTK 182
Score = 40.8 bits (94), Expect = 0.70, Method: Compositional matrix adjust.
Identities = 17/32 (53%), Positives = 21/32 (65%), Gaps = 1/32 (3%)
Query: 83 DLPANFDSRTKWPNCPTIREIRDQGSCGSCWG 114
D+P ++D R W NC T IRDQ +CGSCW
Sbjct: 86 DIPPSYDPRDVWKNCTTFY-IRDQANCGSCWA 116
>gi|56755295|gb|AAW25827.1| SJCHGC06356 protein [Schistosoma japonicum]
Length = 279
Score = 109 bits (272), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 80/299 (26%), Positives = 119/299 (39%), Gaps = 103/299 (34%)
Query: 75 IGYSEVDEDLPANFDSRTKWPNCPTIREI------------------------------- 103
I ++ ++ ++P +FD+R W NC TIR+I
Sbjct: 19 ISHNSINMEIPRSFDARYHWINCSTIRQIHDESLCRADWAIATVDSISDRICIRSNGRIS 78
Query: 104 -----RDQGSCG---SCW--------------------------GCRPYEIAPCEHHVNG 129
RD SCG C+ GC+PY + C +H
Sbjct: 79 VQLSARDAISCGFSPGCFHGSEVEVLVYWITYGIVTGGSYEDQSGCQPYPLPKCSYHPES 138
Query: 130 TRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGA 189
C+ + P+C ECQ+ Y+ Y D +G + Y+V ++ I KEI +GPV +
Sbjct: 139 RFLDCNNNTFEFPQCTNECQDGYNKTYDDDKFYGERIYNVYGTQEDIQKEILMNGPVIAS 198
Query: 190 FTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKAL 249
+V D ++YKSG ++P + + L
Sbjct: 199 ISVNTDFLVYKSG-VYLPTPRS-----------------------------------RNL 222
Query: 250 GGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVPKL 308
G +RI+GWG + K YWL ANSWN +WG NG KI RG IES + A +PK+
Sbjct: 223 GWITLRIIGWGYE--GKIPYWLCANSWNEEWGANGYVKIQRGVQAGYIESYVRAPIPKM 279
>gi|255040223|gb|ACT99884.1| truncated cathepsin B [Opisthorchis viverrini]
Length = 313
Score = 109 bits (272), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 64/171 (37%), Positives = 84/171 (49%), Gaps = 40/171 (23%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GCR Y C+HHV G P C TP+CV++C + ++ Y +D SY++ ++E
Sbjct: 183 GCRSYPFPKCDHHVQGHYPPCPRQIYPTPECVQDC-DTPELGYLEDKTRANISYNIYASE 241
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
SIMKEI GPVE FTV++D + YKS +F
Sbjct: 242 ISIMKEIMLRGPVEAVFTVYEDFLQYKSRVYF---------------------------- 273
Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNG 284
+ G + GHAIRILGWGE+ YWLIANSWN DWG+ G
Sbjct: 274 ---------HAWGAPMSGHAIRILGWGEE--GDVPYWLIANSWNEDWGEKG 313
Score = 57.4 bits (137), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 22/34 (64%), Positives = 27/34 (79%)
Query: 81 DEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG 114
D LP NFD+R+KWP+C ++ EIRDQ SCGSCW
Sbjct: 83 DTRLPKNFDARSKWPHCSSVSEIRDQSSCGSCWA 116
Score = 45.4 bits (106), Expect = 0.031, Method: Compositional matrix adjust.
Identities = 17/27 (62%), Positives = 21/27 (77%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGA 35
CGFGC GG+P +AW YW GIV+GG+
Sbjct: 151 CGFGCRGGYPAVAWDYWRTHGIVTGGS 177
>gi|118118|sp|P19092.1|CYSP1_HAECO RecName: Full=Cathepsin B-like cysteine proteinase 1; Flags:
Precursor
gi|159173|gb|AAA29175.1| cysteine protease (AC-1) [Haemonchus contortus]
Length = 342
Score = 109 bits (272), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 62/191 (32%), Positives = 92/191 (48%), Gaps = 39/191 (20%)
Query: 115 CRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEK 174
CRPY I PC HH N T TP C R+C+ Y+ D +G +Y V + K
Sbjct: 185 CRPYPIHPCGHHGNDTYYGECRGTAPTPPCKRKCRPGVRKMYRIDKRYGKDAYIVKQSVK 244
Query: 175 SIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAF 234
+I EI +GPV +F V Y+ R + G
Sbjct: 245 AIQSEILRNGPVVASFAV------YEDFRHYKSG-------------------------- 272
Query: 235 TVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDE 294
+ + +G+ G HA++++GWG + + +WLIANSW+ DWG+ G F+I+RG ++
Sbjct: 273 -----IYKHTAGELRGYHAVKMIGWGNENNTD--FWLIANSWHNDWGEKGYFRIIRGTND 325
Query: 295 CGIESSITAGV 305
CGIE +I AG+
Sbjct: 326 CGIEGTIAAGI 336
Score = 41.2 bits (95), Expect = 0.53, Method: Compositional matrix adjust.
Identities = 17/31 (54%), Positives = 23/31 (74%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSK 39
CG GC GG+P AW+Y++ G+VSGG Y +K
Sbjct: 152 CGDGCEGGWPIEAWKYFIYDGVVSGGEYLTK 182
>gi|118122|sp|P25793.1|CYSP2_HAECO RecName: Full=Cathepsin B-like cysteine proteinase 2; Flags:
Precursor
gi|159165|gb|AAA29171.1| cathepsin B-like cysteine protease [Haemonchus contortus]
Length = 342
Score = 109 bits (272), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 62/191 (32%), Positives = 93/191 (48%), Gaps = 39/191 (20%)
Query: 115 CRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEK 174
CRPY I PC HH N T TP C R+C+ Y+ D +G +Y V + K
Sbjct: 185 CRPYPIHPCGHHGNDTYYGECRGTAPTPPCKRKCRPGVRKMYRIDKRYGKDAYIVKQSVK 244
Query: 175 SIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAF 234
+I EI ++GPV +F V Y+ R + G
Sbjct: 245 AIQSEILKNGPVVASFAV------YEDFRHYKSG-------------------------- 272
Query: 235 TVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDE 294
+ + +G+ G HA++++GWG + + +WLIANSW+ DWG+ G F+I+RG ++
Sbjct: 273 -----IYKHTAGELRGYHAVKMIGWGNENNTD--FWLIANSWHNDWGEKGYFRIVRGSND 325
Query: 295 CGIESSITAGV 305
CGIE +I AG+
Sbjct: 326 CGIEGTIAAGI 336
Score = 41.6 bits (96), Expect = 0.52, Method: Compositional matrix adjust.
Identities = 17/31 (54%), Positives = 23/31 (74%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSK 39
CG GC GG+P AW+Y++ G+VSGG Y +K
Sbjct: 152 CGDGCEGGWPIEAWKYFIYDGVVSGGEYLTK 182
>gi|729283|sp|Q06544.1|CYSP3_OSTOS RecName: Full=Cathepsin B-like cysteine proteinase 3
gi|159952|gb|AAA29436.1| cathepsin B-like cysteine protease, partial [Ostertagia ostertagi]
Length = 174
Score = 109 bits (272), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 67/202 (33%), Positives = 98/202 (48%), Gaps = 61/202 (30%)
Query: 115 CRPYEIAPCEHHVNGTRP---SC-DASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVS 170
CRPYE PC H G P C D +K TPKC + CQ Y YK+D +FG +Y +
Sbjct: 22 CRPYEFPPCGRH--GKEPYYGECYDTAK--TPKCQKTCQRGYLKAYKEDKHFGKSAYRLP 77
Query: 171 SNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGA 230
+N K+I ++I ++GPV F
Sbjct: 78 NNVKAIQRDIMKNGPVVAGFI--------------------------------------- 98
Query: 231 EGAFTVFDDLILYKSG-------KALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDN 283
V++D YKSG + GGHA++I+GWG+++ + YWLIANSW+ DWG+
Sbjct: 99 -----VYEDFAHYKSGIYKHTAGRMTGGHAVKIIGWGKEKGTP--YWLIANSWHDDWGEK 151
Query: 284 GLFKILRGKDECGIESSITAGV 305
G ++++RG + C IE + AG+
Sbjct: 152 GFYRMIRGINNCRIEEMVFAGI 173
>gi|255647484|gb|ACU24206.1| unknown [Glycine max]
Length = 327
Score = 108 bits (271), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 93/322 (28%), Positives = 123/322 (38%), Gaps = 112/322 (34%)
Query: 37 GSKQAEKNSLSNIPRAHLKSWMGVHPDYNLPANRLPELIGYSEVDE-DLPANFDSRTKWP 95
G + A SN K +GV P +P L S LP NFD+RT W
Sbjct: 56 GWEAAINPRFSNYTVEQFKRLLGVKP---MPKKELRSTPAISHPKTLKLPKNFDARTAWS 112
Query: 96 NCPTIREIRDQGS---------------------------------------CGS-CWGC 115
C TI I DQG CGS C G
Sbjct: 113 QCSTIGRILDQGHCGSCWAFGAVESLSDRFCIHFDVNISLSVNDLLACCGFLCGSGCDGG 172
Query: 116 RPY--------------EIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLN 161
P E P + + P C+ + TPKCV++C V +KK +
Sbjct: 173 YPLYAWRYLAHHGVVTEECDPYFDQIGCSHPGCEPAY-RTPKCVKKCVSGNQV-WKKSKH 230
Query: 162 FGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTI 221
+ +Y V+S+ IM E+Y++GPVE A
Sbjct: 231 YSVSAYRVNSDPHDIMAEVYKNGPVEVA-------------------------------- 258
Query: 222 RDNTSQLGAEGAFTVFDDLILYKSG-------KALGGHAIRILGWGEDEKSKEKYWLIAN 274
FTV++D YKSG LGGHA++++GWG + E YWL+AN
Sbjct: 259 ------------FTVYEDFAYYKSGVYKHITGYELGGHAVKLIGWGTTDDG-EDYWLLAN 305
Query: 275 SWNTDWGDNGLFKILRGKDECG 296
WN +WGD+G FKI RG +ECG
Sbjct: 306 QWNREWGDDGYFKIRRGTNECG 327
Score = 37.4 bits (85), Expect = 9.4, Method: Compositional matrix adjust.
Identities = 14/25 (56%), Positives = 18/25 (72%)
Query: 8 LCGFGCNGGFPGMAWRYWVKSGIVS 32
LCG GC+GG+P AWRY G+V+
Sbjct: 164 LCGSGCDGGYPLYAWRYLAHHGVVT 188
>gi|349604734|gb|AEQ00202.1| Cathepsin B-like protein, partial [Equus caballus]
Length = 134
Score = 108 bits (271), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 66/174 (37%), Positives = 84/174 (48%), Gaps = 54/174 (31%)
Query: 144 CVRECQENYDVPYKKDLNFGAKSYSVSSN-EKSIMKEIYEHGPVEGAFTVFDDLILYKSG 202
C + C+ Y YK+D ++G SYSVS + + ++GPVE A
Sbjct: 1 CSKICEPGYSPSYKEDKHYGCSSYSVSRGARRRSWQRSSKNGPVEAA------------- 47
Query: 203 RFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSG-------KALGGHAIR 255
FTV+ D + YKSG +GGHA+R
Sbjct: 48 -------------------------------FTVYSDFLQYKSGVYQHVAGDMMGGHAVR 76
Query: 256 ILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVPKLD 309
ILGWG + + YWL+ NSWNTDWGDNG FKILRG+D CGIES I AG+P D
Sbjct: 77 ILGWGVENGTP--YWLVGNSWNTDWGDNGFFKILRGQDHCGIESEIVAGIPCTD 128
>gi|52630945|gb|AAU84936.1| putative cathepsin B-S [Toxoptera citricida]
Length = 335
Score = 108 bits (271), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 65/193 (33%), Positives = 93/193 (48%), Gaps = 40/193 (20%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GC+PY++ PC G KC R C + YKK +Y + N
Sbjct: 181 GCQPYKVPPCVKDEEGHNSCSGQPTEPNHKCSRSCYGDKTCDYKKGHYKTKNAYYL--NI 238
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
++ K+ +GP+E +F V+DD + Y+SG + + T
Sbjct: 239 DTMQKDTIAYGPIEASFDVYDDFVNYESGVY-----QKT--------------------- 272
Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
+ K LGGHA++++GWGE++ + YWL+ NSW WG NG+FKILRG +
Sbjct: 273 ----------EDAKYLGGHAVKMIGWGEEDGT--PYWLMVNSWGEQWGANGMFKILRGTN 320
Query: 294 ECGIESSITAGVP 306
ECGIE S TAGVP
Sbjct: 321 ECGIEGSPTAGVP 333
Score = 51.2 bits (121), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 23/75 (30%), Positives = 42/75 (56%)
Query: 40 QAEKNSLSNIPRAHLKSWMGVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPT 99
+A++N + + + +G ++P + + E D ++P FD+R +W +C T
Sbjct: 40 KAKQNFPEYMTKEQIVRLLGSKNLTSVPKSLIKENDSEYINDSEIPNFFDARIQWSHCKT 99
Query: 100 IREIRDQGSCGSCWG 114
I E+R+QG+CGSCW
Sbjct: 100 IGEVRNQGNCGSCWA 114
>gi|168026641|ref|XP_001765840.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162683017|gb|EDQ69431.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 339
Score = 108 bits (271), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 82/265 (30%), Positives = 119/265 (44%), Gaps = 34/265 (12%)
Query: 67 PANRLP---ELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEIAPC 123
PAN L E + + LP FD+R W +C TI I DQG CGSCW E
Sbjct: 75 PANELEPSIERVTHKHKKLVLPKEFDARKHWGHCSTIGAILDQGHCGSCWAFGAAESLTD 134
Query: 124 EHHVNGTRPSCDASKGHTPKCVRECQENYDVPYK-KDLNFGAKSYSVSSNEKSIMKEIYE 182
++ + C EC + D Y + + ++ V+S +I
Sbjct: 135 RFCIHMNESVSLSENDLLACCGFECGDGCDGGYPIRAWRYFKRTGVVTSKCDPYFDQIGC 194
Query: 183 HGPVEGAFTVF----------DDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEG 232
P G + + DD + KS V E + D ++L G
Sbjct: 195 GHP--GCYPTYRTPKCVKHCVDDELWVKSKHLSVNAYEVSKEP------EDLMAELYTNG 246
Query: 233 ----AFTVFDDLILYKS-------GKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWG 281
+F VF+D YK+ G+ +GGHA++++GWG + + YW I NSWNT+WG
Sbjct: 247 PIEVSFEVFEDFAHYKTGVYKHVYGRYIGGHAVKLIGWGTTDDGVD-YWTIVNSWNTNWG 305
Query: 282 DNGLFKILRGKDECGIESSITAGVP 306
++GLF+I RG +ECGIES AG+P
Sbjct: 306 EHGLFRIARGGNECGIESYAVAGLP 330
>gi|339239305|ref|XP_003381207.1| cathepsin B [Trichinella spiralis]
gi|316975778|gb|EFV59177.1| cathepsin B [Trichinella spiralis]
Length = 343
Score = 108 bits (271), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 69/205 (33%), Positives = 101/205 (49%), Gaps = 49/205 (23%)
Query: 107 GSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKS 166
G GS GC+P+ IAP P+ ++ TP C +C +Y KD +G
Sbjct: 184 GPYGSKSGCKPFSIAP---------PTSSSTAAQTPLCQLKCISDYKRKLDKDRYYGESY 234
Query: 167 YSVSSNE---KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRD 223
Y ++S+ K+I +EI +HGPV A +F+ + YKS
Sbjct: 235 YLITSSNQPVKTIQREIMDHGPVVAAMEIFESFLYYKS---------------------- 272
Query: 224 NTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDN 283
G A DD +LG HA++++GWGE ++ YWL+ NSWNT +G+
Sbjct: 273 -----GVYSANKRNDD-------PSLGLHAVKLIGWGEQKRIP--YWLVVNSWNTTFGEQ 318
Query: 284 GLFKILRGKDECGIES-SITAGVPK 307
GLFKI RG +ECGIE+ +TAG+ +
Sbjct: 319 GLFKIRRGTNECGIENLHVTAGLAE 343
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 19/31 (61%), Positives = 26/31 (83%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSK 39
CG+GCNGGFP +A++YW + G+ +GG YGSK
Sbjct: 159 CGYGCNGGFPLLAFKYWNEIGVPTGGPYGSK 189
Score = 40.0 bits (92), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 17/34 (50%), Positives = 21/34 (61%), Gaps = 2/34 (5%)
Query: 82 EDLPA--NFDSRTKWPNCPTIREIRDQGSCGSCW 113
E LP +FD+R KWP C I I+DQ +C CW
Sbjct: 56 ESLPLEEHFDAREKWPECKYIGFIKDQSTCSCCW 89
>gi|209863073|ref|NP_001119610.2| cathepsin B-1852 [Acyrthosiphon pisum]
Length = 333
Score = 108 bits (271), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 64/194 (32%), Positives = 92/194 (47%), Gaps = 44/194 (22%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GC+PY PC + SC KC ++C N + Y+ D + +S V + +
Sbjct: 181 GCQPYMFPPCTGN-----NSCSGQSEKNHKCQKKCFGNTSISYRGDRRYVERSPYVLAYD 235
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
++ +I +GP+E +F V+DD I YKSG +F N T
Sbjct: 236 -NMQNDIMTYGPIESSFDVYDDFISYKSGVYFKSPNAT---------------------- 272
Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
LGGH+++ +GWG + YWL+ NSWN+ WGD G FKI RG +
Sbjct: 273 --------------YLGGHSVKCIGWGVERNV--SYWLMMNSWNSTWGDGGYFKIRRGTN 316
Query: 294 ECGIESSITAGVPK 307
EC +E S TAGVP+
Sbjct: 317 ECQVEDSSTAGVPE 330
Score = 42.7 bits (99), Expect = 0.20, Method: Compositional matrix adjust.
Identities = 17/30 (56%), Positives = 22/30 (73%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
CG GC GG+P AWRY+ K G+V+GG + S
Sbjct: 149 CGLGCQGGYPIRAWRYYSKHGLVTGGNFNS 178
>gi|18378945|ref|NP_563647.1| putative cathepsin B-like cysteine protease [Arabidopsis thaliana]
gi|332189291|gb|AEE27412.1| putative cathepsin B-like cysteine protease [Arabidopsis thaliana]
Length = 379
Score = 108 bits (270), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 91/351 (25%), Positives = 133/351 (37%), Gaps = 130/351 (37%)
Query: 37 GSKQAEKNSLSNIPRAHLKSWMGVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPN 96
G K A + +N A K +GV +P I ++ LP FD+RT W +
Sbjct: 58 GWKAAFNDRFANATVAEFKRLLGVIQTPKTAYLGVP--IVRHDLSLKLPKEFDARTAWSH 115
Query: 97 CPTIREIRD--------------------QGSCGSCWG-----------CRPYEI----- 120
C +IR I G CGSCW C Y +
Sbjct: 116 CTSIRRILVGYILNNVLLWSTITLWFWFLLGHCGSCWAFGAVESLSDRFCIKYNLNVSLS 175
Query: 121 -----------------------------------APCEHHVNGT---RPSCDASKGHTP 142
C+ + + T P C+ + TP
Sbjct: 176 ANDVIACCGLLCGFGCNGGFPMGAWLYFKYHGVVTQECDPYFDNTGCSHPGCEPTY-PTP 234
Query: 143 KCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSG 202
KC R+C + + + ++G +Y ++ + + IM E+Y++GPVE A
Sbjct: 235 KCERKCVSRNQL-WGESKHYGVGAYRINPDPQDIMAEVYKNGPVEVA------------- 280
Query: 203 RFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSG-------KALGGHAIR 255
FTV++D YKSG +GGHA++
Sbjct: 281 -------------------------------FTVYEDFAHYKSGVYKYITGTKIGGHAVK 309
Query: 256 ILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVP 306
++GWG + E YWL+AN WN WGD+G FKI RG +ECGIE S+ AG+P
Sbjct: 310 LIGWGTSDDG-EDYWLLANQWNRSWGDDGYFKIRRGTNECGIEQSVVAGLP 359
Score = 39.3 bits (90), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 16/25 (64%), Positives = 19/25 (76%)
Query: 8 LCGFGCNGGFPGMAWRYWVKSGIVS 32
LCGFGCNGGFP AW Y+ G+V+
Sbjct: 186 LCGFGCNGGFPMGAWLYFKYHGVVT 210
>gi|159175|gb|AAA29176.1| cysteine proteinase [Haemonchus contortus]
Length = 348
Score = 108 bits (270), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 61/198 (30%), Positives = 96/198 (48%), Gaps = 42/198 (21%)
Query: 115 CRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFG--AKSYSVSSN 172
C PY + PC H N T TP C R+CQ + Y+ D +G ++Y++ +
Sbjct: 188 CSPYPLHPCGRHGNDTFYGNCVGMAPTPPCKRKCQPGFRGMYRVDKRYGEPGRTYTLPRS 247
Query: 173 EKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEG 232
E I ++I E G V F V++D Y+SG
Sbjct: 248 EVKIRRDIKERGSVVAVFAVYEDFSHYQSG------------------------------ 277
Query: 233 AFTVFDDLILYKSGKALGG-HAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRG 291
+ + +G+ GG HA++++GWG+D + YWLIANSW+ DWG+NG F+++RG
Sbjct: 278 -------IYKHTAGRFTGGYHAVKMIGWGKDNGTD--YWLIANSWHDDWGENGFFRMIRG 328
Query: 292 KDECGIESSITAGVPKLD 309
+ CGIE + AG+ ++
Sbjct: 329 INNCGIEEQVDAGIVDVE 346
Score = 48.5 bits (114), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 21/50 (42%), Positives = 27/50 (54%)
Query: 65 NLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG 114
N N P + ++ DLP N+D R W NC + IRDQ +CGSCW
Sbjct: 70 NANQNLNPVVNDDNDTGADLPENYDPRIVWKNCSSFHTIRDQANCGSCWA 119
Score = 38.1 bits (87), Expect = 4.9, Method: Compositional matrix adjust.
Identities = 16/31 (51%), Positives = 21/31 (67%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSK 39
CG GC GG+P AW+++ G+VSGG Y K
Sbjct: 155 CGLGCRGGWPIEAWKFFEYDGVVSGGPYLGK 185
>gi|156708108|gb|ABU93312.1| cathepsin B2 cysteine protease [Monocercomonoides sp. PA]
Length = 281
Score = 107 bits (268), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 76/262 (29%), Positives = 109/262 (41%), Gaps = 88/262 (33%)
Query: 79 EVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCW---------------GC-----RPY 118
++D LP NFDSR +WP I +RDQ SCGSCW GC P
Sbjct: 58 DLDNALPENFDSREQWPG--KILPVRDQASCGSCWAFSVAETMGDRLSIKGCDFGDMSPQ 115
Query: 119 EIAPCEHHVNG------------------TRPSC---DASKGHTPKCVRECQENYDVPYK 157
++ C+ G T C + G P C +C +
Sbjct: 116 DLVSCDTTDMGCNGGYMDHAWAWTKSHGITTEKCMPYQSGSGRVPACPAKCVNGSAIVRN 175
Query: 158 KDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLI 217
K +++ N + +M+E+YE+GP+ AFTV+ D + YKSG +
Sbjct: 176 KSVSYKKL------NAQQMMEELYENGPISVAFTVYYDFMNYKSGVY------------- 216
Query: 218 KWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWN 277
++K+G GGHA+ +GWG ++ + YWL NSW
Sbjct: 217 ------------------------VHKTGGIAGGHAVLCVGWGVEDNTP--YWLCQNSWG 250
Query: 278 TDWGDNGLFKILRGKDECGIES 299
WG+ G FKILRG + CGIE+
Sbjct: 251 PAWGEKGHFKILRGSNHCGIEN 272
>gi|52630925|gb|AAU84926.1| putative cathepsin B-N [Toxoptera citricida]
Length = 340
Score = 107 bits (268), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 66/201 (32%), Positives = 90/201 (44%), Gaps = 56/201 (27%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GC PY + PC + +G +C R C + D+ + +D + SY ++
Sbjct: 185 GCEPYRVPPCPYDESGNNTCAGKPMEANHRCTRMCYGDQDLDFDEDHRYTRDSYYLTYG- 243
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
SI K++ +GPVE +
Sbjct: 244 -SIQKDVLTYGPVEAS-------------------------------------------- 258
Query: 234 FTVFDDLILYKSG--------KALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGL 285
F V+DD YKSG LGGHA +++GWGE+ YWL+ NSWN DWGDNGL
Sbjct: 259 FDVYDDFPSYKSGVYIRSENASYLGGHAAKLIGWGEEYGVP--YWLMVNSWNADWGDNGL 316
Query: 286 FKILRGKDECGIESSITAGVP 306
FKI RG +ECGI++S T GVP
Sbjct: 317 FKIQRGTNECGIDNSTTGGVP 337
Score = 53.1 bits (126), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 35/116 (30%), Positives = 53/116 (45%), Gaps = 24/116 (20%)
Query: 23 RYWVKSGIVSGGAYGSKQA---EKNSLSNIPRAHLKSWMGVHPDYNLPANRLPELIG--- 76
R ++ ++ Y ++QA E++ ++ I GV+ D P + +L+G
Sbjct: 3 RVFILLSVILFSVYMTEQAYFLEEDYINKINEQATTWKAGVNFDPKTPKEHILKLLGSKG 62
Query: 77 -----------YSEVDED-------LPANFDSRTKWPNCPTIREIRDQGSCGSCWG 114
Y DE+ +P FD+R KW NC TI IRDQG+CGSCW
Sbjct: 63 VQIPSKLNHKMYKSEDENYDNLFGRIPRKFDARKKWRNCKTIGAIRDQGNCGSCWA 118
Score = 44.3 bits (103), Expect = 0.070, Method: Compositional matrix adjust.
Identities = 18/32 (56%), Positives = 24/32 (75%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQ 40
CGFGCNGG+P AW ++ K G+V+GG Y S +
Sbjct: 153 CGFGCNGGYPIKAWEHFKKHGLVTGGDYKSGE 184
>gi|1644295|emb|CAB03627.1| cysteine proteinase [Haemonchus contortus]
Length = 345
Score = 107 bits (267), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 88/289 (30%), Positives = 125/289 (43%), Gaps = 79/289 (27%)
Query: 69 NRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEIAPCEHHVN 128
NR P + + D+D+P +FD+RT W NC ++R IRDQ + C C A
Sbjct: 79 NRKPVVENADDEDDDIPESFDARTHWANCTSLRHIRDQAN---CGSCWAVSTASAL---- 131
Query: 129 GTRPSCDASKGHTP---------KCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKE 179
+ C ASKG T C + C Y D + +++ S +
Sbjct: 132 -SDRICIASKGETQLHISSIDIVSCCKLC------GYGCDGGWPIEAFDYFSRQ------ 178
Query: 180 IYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSL-------------IKWTIRDNTS 226
G V G T D Y + GN+T + +K R++T
Sbjct: 179 ----GAVTGETTSKDGCRPYPFHPLWTYGNDTVGRRMSGRCKHSKTVGEGVKRVTRNHTR 234
Query: 227 QLG---------------AEG---------AFTVFDDLILYK-------SGKALGGHAIR 255
+ G +EG FTV++D YK +GKA G HAI+
Sbjct: 235 RTGLTARRLRITEFCQSHSEGDHGNGPVVAVFTVYEDFSYYKKGIYVHIAGKARGAHAIK 294
Query: 256 ILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAG 304
I+GWG + + YWLIANSW+ DWG+ GLF+I+RG +ECGIE + AG
Sbjct: 295 IIGWGVE--NGLPYWLIANSWHDDWGEQGLFRIVRGINECGIEQEVVAG 341
>gi|156708104|gb|ABU93310.1| cathepsin B1 cysteine protease [Monocercomonoides sp. PA]
Length = 281
Score = 107 bits (267), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 76/262 (29%), Positives = 109/262 (41%), Gaps = 88/262 (33%)
Query: 79 EVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCW---------------GC-----RPY 118
++D LP NFDSR +WP I +RDQ SCGSCW GC P
Sbjct: 58 DLDNALPENFDSREQWPG--KILPVRDQASCGSCWAFSVAETMGDRLSIKGCDYGDMAPQ 115
Query: 119 EIAPCEHHVNG------------------TRPSC---DASKGHTPKCVRECQENYDVPYK 157
++ C+ G T C + G P C +C +
Sbjct: 116 DLVSCDTTDMGCNGGYMDHAWAWTKSHGVTTEKCMPYQSGSGRVPACPAKCVNGSAIVRN 175
Query: 158 KDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLI 217
K +++ N + +M+E+YE+GP+ AFTV+ D + YKSG +
Sbjct: 176 KSVSYK------KLNAQQMMEELYENGPISVAFTVYYDFMNYKSGVY------------- 216
Query: 218 KWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWN 277
++K+G GGHA+ +GWG ++ + YWL NSW
Sbjct: 217 ------------------------VHKTGGIAGGHAVLCVGWGVEDNTP--YWLCQNSWG 250
Query: 278 TDWGDNGLFKILRGKDECGIES 299
WG+ G FKILRG + CGIE+
Sbjct: 251 PAWGEKGHFKILRGSNHCGIEN 272
>gi|414886872|tpg|DAA62886.1| TPA: hypothetical protein ZEAMMB73_253741 [Zea mays]
gi|414886873|tpg|DAA62887.1| TPA: hypothetical protein ZEAMMB73_253741 [Zea mays]
Length = 208
Score = 107 bits (267), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 67/200 (33%), Positives = 100/200 (50%), Gaps = 61/200 (30%)
Query: 115 CRPY-EIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
C PY + C+H P C+ + TPKC ++C+E V +++ +F +Y ++S+
Sbjct: 44 CDPYFDPVGCKH------PGCEPAY-PTPKCEKKCKEQNQV-WQEKKHFSIDAYRINSDP 95
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
IM E+Y++GPVE A
Sbjct: 96 HDIMAEVYKNGPVEVA-------------------------------------------- 111
Query: 234 FTVFDDLILYKSG-------KALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLF 286
FTV++D YKSG +GGHA++++GWG + + E YWL+AN WN WGD+G F
Sbjct: 112 FTVYEDFAHYKSGVYKHITGGIMGGHAVKLIGWGTSD-AGEDYWLLANQWNRGWGDDGYF 170
Query: 287 KILRGKDECGIESSITAGVP 306
KI+RGK+ECGIE + AG+P
Sbjct: 171 KIIRGKNECGIEEGVVAGMP 190
Score = 38.1 bits (87), Expect = 4.6, Method: Compositional matrix adjust.
Identities = 14/25 (56%), Positives = 22/25 (88%)
Query: 8 LCGFGCNGGFPGMAWRYWVKSGIVS 32
+CG GC+GG+P AWRY+V++G+V+
Sbjct: 17 MCGDGCDGGYPIEAWRYFVQNGVVT 41
>gi|290992302|ref|XP_002678773.1| predicted protein [Naegleria gruberi]
gi|284092387|gb|EFC46029.1| predicted protein [Naegleria gruberi]
Length = 236
Score = 107 bits (266), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 74/237 (31%), Positives = 107/237 (45%), Gaps = 26/237 (10%)
Query: 84 LPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGH--- 140
+PA FDSRTKWP+C + IR+Q CGSCW E+ + C AS G
Sbjct: 14 VPA-FDSRTKWPHC--VHPIRNQEQCGSCWAFSASEVL--------SDRFCIASGGKVDV 62
Query: 141 --TPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLIL 198
+P+ + C Y D + +++ + + + G
Sbjct: 63 VLSPQYMVSCDS---TDYGCDGGYLNNAWAFLAGTGIPSDKCAPYTSQNGDVAACPSKCQ 119
Query: 199 YKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSG-------KALGG 251
S ++ I + D + AF+V+ D + YKSG LGG
Sbjct: 120 DGSSVKLYKAKNPQQLNDIPSIMEDMQQNGPVQAAFSVYRDFMSYKSGVYHHVSGSLLGG 179
Query: 252 HAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVPKL 308
HAI+++GWG D + + YW+IANSW WG NG F ILRG DECGIE ++ +G +L
Sbjct: 180 HAIKMVGWGVDSATNKPYWIIANSWGPSWGLNGFFWILRGSDECGIEDNVWSGQAQL 236
>gi|91088083|ref|XP_968689.1| PREDICTED: similar to AGAP004533-PA [Tribolium castaneum]
Length = 360
Score = 107 bits (266), Expect = 9e-21, Method: Compositional matrix adjust.
Identities = 86/305 (28%), Positives = 115/305 (37%), Gaps = 100/305 (32%)
Query: 57 WMGVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCP-TIREIRDQGSCGSCWG- 114
++G+HPD N PE+ +P FD+R WP C I IR+QG C S W
Sbjct: 49 FLGIHPDPNFK----PEIKEPQATQNVIPETFDAREYWPECADIIGNIRNQGKCSSSWAF 104
Query: 115 ---------------------CRPYEIAPCEHHV-------------------------- 127
P ++ C H+
Sbjct: 105 AAAEVMSDRLCIATNGKVKIQLSPEDLIDCCHYCGNQCKGGYTYYAWNYFMLTGLVSGGD 164
Query: 128 ----NGTRPSCDASKGH-TPKCVRECQ-ENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIY 181
G +P + + TP C CQ + Y +PY D +FG Y + NE +I EI
Sbjct: 165 YNTSTGCQPYSELNYYRITPPCNTTCQNDKYPIPYVSDKHFGDSIYYIPQNETAIQNEIL 224
Query: 182 EHG-PVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDL 240
G PV AF V+ D +Y+ D +
Sbjct: 225 SGGGPVVAAFDVYGDFKIYR-------------------------------------DGV 247
Query: 241 ILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGD-NGLFKILRGKDECGIES 299
+Y SG G A++I+GWG + + YWL ANSW DWG G FKI RG +ECG E
Sbjct: 248 YIYTSGALFGRTAVKIIGWGTE--NGWAYWLAANSWGKDWGALGGFFKIRRGTNECGFEE 305
Query: 300 SITAG 304
SI AG
Sbjct: 306 SIIAG 310
>gi|161343839|tpg|DAA06100.1| TPA_inf: cathepsin B [Acyrthosiphon pisum]
Length = 323
Score = 107 bits (266), Expect = 9e-21, Method: Compositional matrix adjust.
Identities = 64/200 (32%), Positives = 99/200 (49%), Gaps = 45/200 (22%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDA-SKGHTPKCVREC-QENYDVPYKKDLNFGAKSYSVS- 170
GC+PY+ PC+H+ + +C + + C ++C +NY V Y+ DL+ + Y S
Sbjct: 162 GCQPYKNRPCDHYGDSRLTNCSSLRRTQMTVCRKKCVNKNYKVKYEDDLHKTSIVYMTSW 221
Query: 171 SNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGA 230
+N K I +EI HGPV V+++ + YK G
Sbjct: 222 TNVKQIQQEIMTHGPVTAFMYVYENFMGYKEG---------------------------- 253
Query: 231 EGAFTVFDDLILYKS--GKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKI 288
+YKS G+ +G H ++++GWG D E YWL NSWN++WG++GLFKI
Sbjct: 254 -----------IYKSTTGELIGYHHVKLIGWGVDGDGTE-YWLAMNSWNSNWGNDGLFKI 301
Query: 289 LRGKDECGIESSITAGVPKL 308
LRG + C IE + AG+ +
Sbjct: 302 LRGYNFCSIELLVMAGIVDV 321
>gi|428180143|gb|EKX49011.1| cathepsin B-like cysteine protease [Guillardia theta CCMP2712]
Length = 330
Score = 106 bits (265), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 96/315 (30%), Positives = 123/315 (39%), Gaps = 114/315 (36%)
Query: 56 SWMGVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWGC 115
S +G+ D + + +P + S +DLP +F+ WPN + IRDQ CGSCW
Sbjct: 68 SMLGLRLDRDY--SEVPVKVHSSTALKDLPESFNCYENWPN--YMHPIRDQARCGSCWAF 123
Query: 116 RPYEIAPCEHHV--NGT---------RPSCD----------------------------- 135
E+ + NGT SCD
Sbjct: 124 AASEVLSDRFAIASNGTVNKILSPEDLVSCDKGDMGCQGGYLDKAWDYLKTNGIVTESCF 183
Query: 136 ---ASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTV 192
A KG P C C + PYKK + A Y + E+ IMKEIY +GPVE
Sbjct: 184 PYAAQKGVAPSCRISCVDGE--PYKK---YKASDYYQLTTEEDIMKEIYLNGPVEAG--- 235
Query: 193 FDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSG----KA 248
F V+ + YKSG +
Sbjct: 236 -----------------------------------------FRVYTSFMSYKSGVYHHRI 254
Query: 249 L----GGHAIRILGWGEDEKSK-----EKYWLIANSWNTDWGDNGLFKILRGKD-----E 294
L GGHAI+I+GWG + + KYW+ ANSW DWG NG FKI RGK+ E
Sbjct: 255 LDIMEGGHAIKIVGWGVEPPKRFWQKPTKYWICANSWTADWGMNGFFKIRRGKNRFGQSE 314
Query: 295 CGIESSITAGVPKLD 309
CGIE + AG PKLD
Sbjct: 315 CGIEDQVFAGHPKLD 329
>gi|308507719|ref|XP_003116043.1| hypothetical protein CRE_08645 [Caenorhabditis remanei]
gi|308250987|gb|EFO94939.1| hypothetical protein CRE_08645 [Caenorhabditis remanei]
Length = 356
Score = 106 bits (265), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 72/203 (35%), Positives = 100/203 (49%), Gaps = 45/203 (22%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVP-YKKDLNFGAKSYSVSSN 172
GC+PY APC + C SK TP C +CQ Y V YK D ++G V+
Sbjct: 167 GCKPYSFAPCSN--------CVESKT-TPSCQSKCQSTYTVTNYKGDKHYGKNEGKVT-- 215
Query: 173 EKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEG 232
E ++H A+ + + A+ +I+ I N E
Sbjct: 216 ------ERHKHLECTSAYRL---------------DTSSNAVPIIQNEIYQNGP---VEV 251
Query: 233 AFTVFDDLILYKSG-------KALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGL 285
A+TV+DD YKSG K GGHA++I+GWG ++ YWL+ NSW T +GD G
Sbjct: 252 AYTVYDDFYHYKSGVYHHVTGKDTGGHAVKIIGWGTEKGVD--YWLVTNSWGTSFGDKGF 309
Query: 286 FKILRGKDECGIESSITAGVPKL 308
FKI RG +ECGIES++ AG+ K+
Sbjct: 310 FKIRRGTNECGIESNVVAGMAKV 332
Score = 50.8 bits (120), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 18/37 (48%), Positives = 25/37 (67%)
Query: 84 LPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEI 120
+P FD+RT WP C +I+ +RDQ +CGSCW E+
Sbjct: 70 IPTTFDARTNWPKCNSIKMVRDQSNCGSCWAFGAAEV 106
>gi|161343869|tpg|DAA06115.1| TPA_inf: cathepsin B [Myzus persicae]
Length = 337
Score = 106 bits (265), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 63/193 (32%), Positives = 91/193 (47%), Gaps = 43/193 (22%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GC PY + P + + A C R C N + + D + Y ++
Sbjct: 185 GCEPYRVPPSNDGNSSSSDQPLAIN---HICRRHCYGNQSIDFNDDHRYTRDYYYLTYG- 240
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
SI K++ +GP+E +F V+DD YKSG + N +
Sbjct: 241 -SIQKDVLTYGPIEASFDVYDDFPSYKSGVYVKSDNAS---------------------- 277
Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
LGGHA++++GWGE++ + YWL+ NSWNT WGDNG FKI RG +
Sbjct: 278 --------------YLGGHAVKLIGWGEEDGT--PYWLMVNSWNTQWGDNGFFKIRRGTN 321
Query: 294 ECGIESSITAGVP 306
ECG+++S TAGVP
Sbjct: 322 ECGVDNSTTAGVP 334
Score = 51.6 bits (122), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 33/116 (28%), Positives = 55/116 (47%), Gaps = 24/116 (20%)
Query: 23 RYWVKSGIVSGGAYGSKQA---EKNSLSNIPRAHLKSWMGVHPDYNLPANRLPELIG--- 76
R ++ ++ Y ++QA +++ ++NI G++ D N P + + +L+G
Sbjct: 3 RVFMLLSVIFVSVYATEQAYFLQEDFINNINEQATTWKAGMNFDPNTPHDDIIKLLGSRG 62
Query: 77 -----------YSEVDE-------DLPANFDSRTKWPNCPTIREIRDQGSCGSCWG 114
Y DE +P +FD+R KW C TI +RDQG+CGSCW
Sbjct: 63 VQNPDKVNHKLYKTHDEAYDNLFGRIPEHFDARNKWVYCDTIGRVRDQGNCGSCWA 118
Score = 40.8 bits (94), Expect = 0.85, Method: Compositional matrix adjust.
Identities = 16/32 (50%), Positives = 23/32 (71%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQ 40
CGFGC+GG+P AW+ + G+V+GG Y S +
Sbjct: 153 CGFGCHGGYPIKAWKRFSTHGLVTGGDYNSGE 184
>gi|407425570|gb|EKF39488.1| cysteine peptidase C (CPC), putative [Trypanosoma cruzi
marinkellei]
Length = 333
Score = 106 bits (264), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 86/296 (29%), Positives = 115/296 (38%), Gaps = 102/296 (34%)
Query: 72 PELIGYSEVDEDLPANFDSRTKWPNCPTI------------------------------- 100
P +E+ L FD+ WPNCPTI
Sbjct: 80 PRQFSEAELRVRLEDKFDAAEAWPNCPTITEIRDQSSCGSCWAVAAASAMSDRYCTLGGV 139
Query: 101 REIR----DQGSCGSCWG------------------------CRPYEIAPCEHHVNGTRP 132
R++R D SC G C+PY C HHVN +
Sbjct: 140 RDLRISAGDLMSCCDVCGYGCNGGFPEVAWVFYVVHGLVSEYCQPYPFPSCAHHVNSSDL 199
Query: 133 SCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTV 192
+ + TPKC C E +P + G SY V S E+ +E+ +GP E AF V
Sbjct: 200 APCSGDYKTPKCNSTCTEK-KIPLIRYR--GNHSY-VLSGEEHFKRELLLNGPFEVAFEV 255
Query: 193 FDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGH 252
+ D + Y G + + +G LGGH
Sbjct: 256 YADFMAYTGGVY-------------------------------------KHVAGDLLGGH 278
Query: 253 AIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVPKL 308
A+R++GWGE + E YW IANSWN +WG NG F I RG +ECGIES+ AG P++
Sbjct: 279 AVRLVGWGE--LNGEPYWKIANSWNHEWGMNGYFLIARGVNECGIESNGVAGTPRI 332
Score = 40.8 bits (94), Expect = 0.81, Method: Compositional matrix adjust.
Identities = 15/25 (60%), Positives = 21/25 (84%)
Query: 8 LCGFGCNGGFPGMAWRYWVKSGIVS 32
+CG+GCNGGFP +AW ++V G+VS
Sbjct: 155 VCGYGCNGGFPEVAWVFYVVHGLVS 179
>gi|294951797|ref|XP_002787132.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
gi|239901778|gb|EER18928.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
Length = 278
Score = 106 bits (264), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 66/204 (32%), Positives = 89/204 (43%), Gaps = 57/204 (27%)
Query: 114 GCRPYEIAPCEHHVNGTR-PSCDASKGHTPKCVRECQE-NYDVPYKKDLNFGAKSYSVSS 171
GC PY+ PC HHVN ++ P C TP C +C Y + D +F +S
Sbjct: 123 GCWPYDFPPCAHHVNDSKYPKCPKDSYETPNCAEQCHNPKYTTTLRDDRHFMVESSPYQY 182
Query: 172 NEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAE 231
+ I GPV +
Sbjct: 183 SVNDAKNAIRTDGPVSAS------------------------------------------ 200
Query: 232 GAFTVFDDLILYKSG-------KALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNG 284
FTV++D + YKSG + LGGHA++I+GWGE+ S + YWL+ NSWN DWGD+G
Sbjct: 201 --FTVYEDFLAYKSGVYKHTSGEYLGGHAVKIIGWGEE--SGQAYWLVVNSWNEDWGDHG 256
Query: 285 LFKILRGKDECGIESSITAGVPKL 308
LFKI G CGI+ + G PK+
Sbjct: 257 LFKIALGN--CGIDDYLLGGTPKV 278
Score = 47.8 bits (112), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 20/34 (58%), Positives = 25/34 (73%), Gaps = 1/34 (2%)
Query: 82 EDLPANFDSRTKWPNCP-TIREIRDQGSCGSCWG 114
+DLP +FD+RT +PNC I IRDQ +CGSCW
Sbjct: 19 QDLPTDFDARTAFPNCSKVIGHIRDQSACGSCWA 52
>gi|323448735|gb|EGB04630.1| hypothetical protein AURANDRAFT_32318 [Aureococcus anophagefferens]
Length = 253
Score = 106 bits (264), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 74/217 (34%), Positives = 103/217 (47%), Gaps = 50/217 (23%)
Query: 99 TIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTR-PSCDASKGHTPKCVRECQENYDVPYK 157
+ I D G+ G GC Y++ PC HHVN ++ P+C + PKC R+C E+ D +
Sbjct: 70 ALSGIVDGGNYGDKSGCWSYQLEPCAHHVNSSKYPAC-PDEVRAPKCARKC-ESEDKDWT 127
Query: 158 KDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLI 217
K G K YSV + G +EG + +Y++G
Sbjct: 128 KAKVKGEKGYSVC-----------QQGELEGTCAIKMAADIYQNGPI------------- 163
Query: 218 KWTIRDNTSQLGAEGAFTVFDDLILYKSG--------KALGGHAIRILGWGEDEKSKEKY 269
G F V D + YKSG LGGHAI+I+G+G ++ + Y
Sbjct: 164 -------------TGMFFVKQDFLAYKSGVYEPKLLSPPLGGHAIKIMGFGTEDG--KDY 208
Query: 270 WLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVP 306
WL+ANSWN DWGD+G FKI+RGK+ C IE + G P
Sbjct: 209 WLVANSWNEDWGDDGYFKIIRGKNACQIEDPVINGGP 245
Score = 40.4 bits (93), Expect = 0.92, Method: Compositional matrix adjust.
Identities = 18/33 (54%), Positives = 20/33 (60%)
Query: 7 RLCGFGCNGGFPGMAWRYWVKSGIVSGGAYGSK 39
+L GCNGG P + YW SGIV GG YG K
Sbjct: 51 KLGDMGCNGGIPSSVYSYWALSGIVDGGNYGDK 83
>gi|161343851|tpg|DAA06106.1| TPA_inf: cathepsin B [Acyrthosiphon pisum]
Length = 333
Score = 106 bits (264), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 63/194 (32%), Positives = 91/194 (46%), Gaps = 44/194 (22%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GC+PY PC + SC KC ++C N + Y+ D + +S V + +
Sbjct: 181 GCQPYMFPPCTGN-----NSCSGQSEKNHKCQKKCFGNTSISYRGDRRYVERSPYVLAYD 235
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
++ +I +GP+E +F V+DD I YKSG +F N T
Sbjct: 236 -NMQNDIMTYGPIESSFDVYDDFISYKSGVYFKSPNAT---------------------- 272
Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
LGGH+++ +GWG + YWL+ NSWN WGD G FKI RG +
Sbjct: 273 --------------YLGGHSVKCIGWGVERNV--SYWLMMNSWNNTWGDGGNFKIRRGTN 316
Query: 294 ECGIESSITAGVPK 307
EC +E S TAG+P+
Sbjct: 317 ECQVEDSSTAGMPE 330
Score = 42.7 bits (99), Expect = 0.20, Method: Compositional matrix adjust.
Identities = 17/30 (56%), Positives = 22/30 (73%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
CG GC GG+P AWRY+ K G+V+GG + S
Sbjct: 149 CGLGCQGGYPIRAWRYYSKHGLVTGGNFNS 178
>gi|321461662|gb|EFX72692.1| hypothetical protein DAPPUDRAFT_308155 [Daphnia pulex]
Length = 379
Score = 105 bits (263), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 80/267 (29%), Positives = 120/267 (44%), Gaps = 52/267 (19%)
Query: 84 LPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEIAP---CEHHVNGTRPSCDASKGH 140
+PA FD+R +WPNCPTI EI +QGSC SCW P ++ C H +G+R S G+
Sbjct: 113 IPAEFDARLRWPNCPTIGEIFEQGSCASCWAVAPTDVMSDRICIH--SGSRHIVRLSAGN 170
Query: 141 TPKCVRECQENYDVPY---------KKDLNFGAKSYSVSSNEKSIMKEIYE---HGPVEG 188
C + C + + K + G S +K Y+ G ++
Sbjct: 171 LLSCCKLCGKGCKGGFPGGAWMHWSKHGIVTGGSYSSDYGCQKYQFFPCYQPRTKGSIKN 230
Query: 189 AFTVFDDLIL-------------YKSGRFF------VPGNETTAMSLIKWTIRDNTSQLG 229
D+ +L YK ++ +P N+ A+ L I +N
Sbjct: 231 KCPKTDNTLLECRETCRTSYNKSYKQDLYYGESVYRIP-NDARAIQL---EIMENGP--- 283
Query: 230 AEGAFTVFDDLILYK-------SGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGD 282
+ +++D + YK G+ L HA++I GWG + + YWL AN W+ WG+
Sbjct: 284 VQANLRIYEDFLHYKFGVYRHVHGQGLEYHAVKIFGWGTEGGT--PYWLAANPWSKRWGN 341
Query: 283 NGLFKILRGKDECGIESSITAGVPKLD 309
G FKILRG + IE + AG+PKLD
Sbjct: 342 GGFFKILRGSNHAEIEDHVMAGIPKLD 368
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 21/32 (65%), Positives = 25/32 (78%)
Query: 7 RLCGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
+LCG GC GGFPG AW +W K GIV+GG+Y S
Sbjct: 176 KLCGKGCKGGFPGGAWMHWSKHGIVTGGSYSS 207
>gi|156708112|gb|ABU93314.1| cathepsin B5 cysteine protease [Monocercomonoides sp. PA]
Length = 281
Score = 105 bits (262), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 87/305 (28%), Positives = 120/305 (39%), Gaps = 110/305 (36%)
Query: 49 IPRAHLKSWMGVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGS 108
I A L++ +G P+N +P D LP NFD+R +WP I +R+Q
Sbjct: 36 ITTAKLRARLGAIDLNEGPSNYVP--------DTSLPDNFDAREQWPG--KILPVRNQEQ 85
Query: 109 CGSCW---------------GC-----RPYEIAPCE---HHVNGTRPSCD---------- 135
CGSCW GC P ++ C+ H NG P
Sbjct: 86 CGSCWAFAVAETTGNRLNILGCGRGDMSPQDLVSCDKVDHGCNGGSPLFSWEWVKHSGIT 145
Query: 136 --------ASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVE 187
+ G P C ++C + K AKS + +K + E+Y GP E
Sbjct: 146 TEECIPYVSGGGRVPSCPKKCTNGSAIVRTK-----AKSVGLVKGDK-MQNELYSRGPFE 199
Query: 188 GAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSG- 246
A F+V++D YKSG
Sbjct: 200 AA--------------------------------------------FSVYEDFKSYKSGV 215
Query: 247 ------KALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESS 300
K LGGHA+ ++GWG ++ + YWLI NSW T WG+ G FKILRGK+ECGIE++
Sbjct: 216 YHHITGKMLGGHAVMVVGWGVEDGTP--YWLIQNSWGTTWGEQGFFKILRGKNECGIETT 273
Query: 301 ITAGV 305
G
Sbjct: 274 CFQGT 278
>gi|395734831|ref|XP_003776483.1| PREDICTED: LOW QUALITY PROTEIN: cathepsin B-like [Pongo abelii]
Length = 350
Score = 105 bits (262), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 69/204 (33%), Positives = 95/204 (46%), Gaps = 59/204 (28%)
Query: 114 GCRPY-EIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSN 172
GCR + + PC+HH++G G +PKC C+ YK D ++G SYS+S +
Sbjct: 192 GCRLFPSLLPCKHHIHGXP---YVXTGDSPKCSMTCEPGQT--YKXDKHYGCSSYSISDS 246
Query: 173 EKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEG 232
K IM IY++ VE A
Sbjct: 247 TKDIMTNIYKNDXVEEA------------------------------------------- 263
Query: 233 AFTVFDDLILYK-------SGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGL 285
F+V+ D ++YK +G+ GGHAI ILG + + YWL+AN WN DWGDNG
Sbjct: 264 -FSVYLDFLMYKFKEYQGVTGEMXGGHAICILGCKVENSTS--YWLVANXWNRDWGDNGF 320
Query: 286 FKILRGKDECGIESSITAGVPKLD 309
FKILRG+D GIES + A +P +
Sbjct: 321 FKILRGQDHYGIESEVVAEIPHTE 344
Score = 44.3 bits (103), Expect = 0.075, Method: Compositional matrix adjust.
Identities = 21/46 (45%), Positives = 25/46 (54%)
Query: 8 LCGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQAEKNSLSNIPRAH 53
LCG GCNGG P W +W G+VSGG Y S + S +P H
Sbjct: 159 LCGDGCNGGXPNEGWNFWTGKGLVSGGLYDSHVGCRLFPSLLPCKH 204
Score = 43.9 bits (102), Expect = 0.081, Method: Compositional matrix adjust.
Identities = 21/46 (45%), Positives = 31/46 (67%), Gaps = 2/46 (4%)
Query: 69 NRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG 114
++LP+ + ++ D +LP +FD +WP+ P REIRDQGS G CW
Sbjct: 79 SKLPQRVKFAX-DINLPESFDPXEQWPDXPX-REIRDQGSYGFCWA 122
>gi|270012758|gb|EFA09206.1| cathepsin B precursor [Tribolium castaneum]
Length = 326
Score = 105 bits (261), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 86/319 (26%), Positives = 134/319 (42%), Gaps = 106/319 (33%)
Query: 40 QAEKNSLSNIPRAHLKSWMGVHPDYNLP----ANRLPELIGYSEVDEDLPANFDSRTKWP 95
+AE N L R ++G+HPD N +++ +I +P +FD+R KWP
Sbjct: 38 KAETNCLDIKSRL---GFLGLHPDPNYKIQTKQHKISRII-------SIPESFDAREKWP 87
Query: 96 NCP-TIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGH-----TPKCVRE-- 147
C I +IR+QG+CGSCW E+ T C +SKG +P+ +
Sbjct: 88 ECKDVIGKIRNQGNCGSCWAFASTEVM--------TDRLCISSKGKIKFVFSPENLLTCC 139
Query: 148 -----------------------------------CQENYDVPYK-KDLNFGAKSYSVSS 171
CQ + ++ + + K Y++ +
Sbjct: 140 KDCGCGCKGGYIKNAWDYYINEGIASGGDYNSSEGCQPYSESSFQYAEASECVKFYTLET 199
Query: 172 NEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAE 231
N I EI +GPV + VF+D +KSG ++
Sbjct: 200 NVAQIQMEILTNGPVMAYYNVFEDFACHKSGVYY-------------------------- 233
Query: 232 GAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGD-NGLFKILR 290
YKSGK +G H+++++GWG +E YWLIANSW ++WG+ G FK+ R
Sbjct: 234 -----------YKSGKFVGRHSVKVIGWGTEEGIP--YWLIANSWGSEWGELGGFFKMRR 280
Query: 291 GKDECGIESSITAGVPKLD 309
G +EC IE +TAG ++
Sbjct: 281 GTNECWIEQEMTAGKVHIE 299
>gi|294879717|ref|XP_002768767.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
gi|239871616|gb|EER01485.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
Length = 157
Score = 105 bits (261), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 67/204 (32%), Positives = 88/204 (43%), Gaps = 57/204 (27%)
Query: 114 GCRPYEIAPCEHHVNGTR-PSCDASKGHTPKCVRECQE-NYDVPYKKDLNFGAKSYSVSS 171
GC PY+ PC HH+N T+ P C TP CV +C Y + D +F +S
Sbjct: 2 GCWPYDFPPCAHHINDTKYPKCPKGLYPTPNCVEQCHNPKYTTTLRDDRHFMLESSPYHY 61
Query: 172 NEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAE 231
+ I GPV +
Sbjct: 62 SVNDAKNAIRTDGPVSAS------------------------------------------ 79
Query: 232 GAFTVFDDLILYKSG-------KALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNG 284
FTV++D + Y+SG LGGHA++I+GWGE KS + YWL NSWN DWGD+G
Sbjct: 80 --FTVYEDFLAYRSGVYKHTSGSYLGGHAVKIIGWGE--KSGQAYWLAVNSWNEDWGDHG 135
Query: 285 LFKILRGKDECGIESSITAGVPKL 308
LFKI G CGI+ + G PK+
Sbjct: 136 LFKIALG--NCGIDDDLLGGTPKV 157
>gi|328871084|gb|EGG19455.1| peptidase C1A family protein [Dictyostelium fasciculatum]
Length = 352
Score = 105 bits (261), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 77/282 (27%), Positives = 111/282 (39%), Gaps = 106/282 (37%)
Query: 82 EDLPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYE--------------------IA 121
+ +PANF+S +W NC I I++Q CGSCW E +
Sbjct: 68 QAVPANFNSAQQWSNCSYISAIQNQARCGSCWAFGAVESVSDRFCIHKGEDVLLSFQDLV 127
Query: 122 PCEHHVNG--------------------------TRPSCDASKG------HTPKCVRECQ 149
C+ NG T P+C ++ TP+CV +C
Sbjct: 128 TCDQSDNGCQGGDAYTAMKFIQKKGIVSNDCLPYTIPTCAPAQQPCLNFVDTPQCVEKC- 186
Query: 150 ENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGN 209
N Y +DL+F YS++ +I +EI +GPVE
Sbjct: 187 SNASYTYAQDLHFIDGVYSMNPTVNAIQQEIMTNGPVEAC-------------------- 226
Query: 210 ETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSG-------KALGGHAIRILGWGED 262
F V++D + YKSG K LGGH ++++GWG
Sbjct: 227 ------------------------FEVYEDFLGYKSGVYQHTTGKDLGGHCVKMIGWGT- 261
Query: 263 EKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAG 304
++ E YW+ NSW T WG+ G+F I G +ECGIES + A
Sbjct: 262 -QNNELYWICNNSWTTYWGNQGVFWIKAGVNECGIESDVVAA 302
>gi|290975216|ref|XP_002670339.1| cathepsin B-like cysteine proteinase [Naegleria gruberi]
gi|284083897|gb|EFC37595.1| cathepsin B-like cysteine proteinase [Naegleria gruberi]
Length = 350
Score = 105 bits (261), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 79/271 (29%), Positives = 106/271 (39%), Gaps = 87/271 (32%)
Query: 82 EDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG----------------------CRPYE 119
D P FD+R +WP C IR I++Q +CGSCW P
Sbjct: 123 RDFPTQFDAREQWPQC--IRSIKNQKNCGSCWAFSASSVLADRFCIKSGGKVNVDLSPQF 180
Query: 120 IAPCEHHVNGTRPSC-DAS--------------------KGHTPKC-VRECQENYDVPYK 157
+ C NG DA+ G P C V+ C VP +
Sbjct: 181 MVSCSGQNNGCNGGFFDATWRFLVSVGTVSEACVPYVSFGGAVPACNVKSC----GVPGQ 236
Query: 158 KDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLI 217
K + A S IM ++ +GP++ A V+ D YKSG +
Sbjct: 237 KSPFYRAGSARKLEGMLDIMADLKANGPIQVAMGVYRDFYSYKSGVYH------------ 284
Query: 218 KWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWN 277
+ SG+ +GGHA++I+GWG D SK YW+ ANSW
Sbjct: 285 -------------------------HVSGRYVGGHAVKIVGWGYDSASKLPYWICANSWG 319
Query: 278 TDWGDNGLFKILRGKDECGIESSITAGVPKL 308
DWG G F ILRG+ ECGI + +G P L
Sbjct: 320 EDWGIKGYFWILRGRGECGIGKMVWSGKPAL 350
>gi|119638996|gb|ABL85239.1| cysteine proteinase 5 [Necator americanus]
Length = 342
Score = 104 bits (260), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 68/202 (33%), Positives = 87/202 (43%), Gaps = 54/202 (26%)
Query: 115 CRPYEIAPCEHHVNGTR-PSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
C+PY PC H N C TPKC + CQ Y+V YK D +G +YS+
Sbjct: 187 CKPYAFYPCGRHQNQKYFGPCPKELWPTPKCRKMCQLKYNVAYKDDKIYGNDAYSL---- 242
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
P NET M I + G+
Sbjct: 243 ---------------------------------PNNETRIMQEI-------FTNGPVVGS 262
Query: 234 FTVFDDLILYKSGKAL-------GGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLF 286
F+VF D +YK G + G HA++I+GWG + K YWLIANSWN DWGD G
Sbjct: 263 FSVFADFAIYKKGVYVSNGIQQNGAHAVKIIGWGVQDGLK--YWLIANSWNNDWGDEGYV 320
Query: 287 KILRGKDECGIESSITAGVPKL 308
+ LRG + CGIES + G K+
Sbjct: 321 RFLRGDNHCGIESRVVTGTMKV 342
Score = 56.2 bits (134), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 21/36 (58%), Positives = 28/36 (77%)
Query: 79 EVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG 114
+++ +LP FD+R KWPNC +IR IRDQ +CGSCW
Sbjct: 84 DLNINLPETFDAREKWPNCTSIRTIRDQSNCGSCWA 119
>gi|187105116|ref|NP_001119618.1| cathepsin B-84 precursor [Acyrthosiphon pisum]
gi|161343843|tpg|DAA06102.1| TPA_inf: cathepsin B [Acyrthosiphon pisum]
Length = 335
Score = 104 bits (260), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 93/307 (30%), Positives = 136/307 (44%), Gaps = 49/307 (15%)
Query: 40 QAEKNSLSNIPRAHLKSWMGVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPT 99
+A++N N P+ + +G + + + E + ++P FDSR +W C T
Sbjct: 40 KAKQNFPENTPKEQIVRLLGSKRLLGVSKSPIKENDELYMDNSEVPEFFDSRLEWDYCET 99
Query: 100 IREIRDQ---GSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVR---ECQENYD 153
I +R+Q GSC + + C NG +++ T C R C Y
Sbjct: 100 IGHVRNQGNCGSCWAHGTTGAFADRLCVA-TNGEFNELISAEELTFCCHRCGFGCNGGYP 158
Query: 154 VP----YKK---------DLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFT--------- 191
+ +K+ D G + Y V +K+ H G T
Sbjct: 159 LKAWQYFKRHGVVTGGDYDTTDGCQPYRVPP----CVKDDEGHNSCSGQPTERNHKCSKK 214
Query: 192 -VFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLG-AEGAFTVFDDLILYKSG--- 246
DD I YK + A L T++ +T G E +F V+DD + Y+SG
Sbjct: 215 CYGDDTIDYKKNHY----KTKDAYYLKNTTMQKDTMVYGPIEASFDVYDDFMNYESGVYQ 270
Query: 247 -----KALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSI 301
LGGHA++++GWG +E + YWL+ NSW WGD G+FKILRG DECGIESS
Sbjct: 271 RTGNASYLGGHAVKMIGWGVEEGTP--YWLMVNSWGEQWGDKGMFKILRGTDECGIESSC 328
Query: 302 TAGVPKL 308
TAGVP +
Sbjct: 329 TAGVPSV 335
Score = 45.4 bits (106), Expect = 0.029, Method: Compositional matrix adjust.
Identities = 17/30 (56%), Positives = 24/30 (80%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
CGFGCNGG+P AW+Y+ + G+V+GG Y +
Sbjct: 149 CGFGCNGGYPLKAWQYFKRHGVVTGGDYDT 178
>gi|156708106|gb|ABU93311.1| cathepsin B2 cysteine protease [Monocercomonoides sp. PA]
Length = 282
Score = 104 bits (260), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 76/264 (28%), Positives = 107/264 (40%), Gaps = 88/264 (33%)
Query: 79 EVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCW---------------GC-----RPY 118
E D LP NFD+R +WP I +RDQ SCGSCW GC P
Sbjct: 58 ESDNALPENFDAREQWPE--QILPVRDQASCGSCWAFSVAETMGDRLSIIGCGRGHMSPQ 115
Query: 119 EIAPCEHHVNG------------------TRPSC---DASKGHTPKCVRECQENYDVPYK 157
++ C+ G T C + G P C +C +
Sbjct: 116 DLVSCDTTDMGCNGGYMDKAWAWTKSHGVTNEECMPYQSGGGRVPACPAKCVNGSTIVRT 175
Query: 158 KDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLI 217
K +F + S + +E+YE+GP+ AFTV+ D + YKSG +
Sbjct: 176 KSQSFTHFTAS------QMQQELYENGPLSVAFTVYYDFMNYKSGVY------------- 216
Query: 218 KWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWN 277
++K+G GGHA+ +GWG ++ + YWL NSW
Sbjct: 217 ------------------------VHKTGGVAGGHAVLCIGWGVEDNTP--YWLCQNSWG 250
Query: 278 TDWGDNGLFKILRGKDECGIESSI 301
WG+ G FKILRG + CGIE+ +
Sbjct: 251 PAWGEKGHFKILRGSNHCGIENQV 274
>gi|209863079|ref|NP_001119613.2| cathepsin B precursor [Acyrthosiphon pisum]
Length = 323
Score = 104 bits (260), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 63/200 (31%), Positives = 99/200 (49%), Gaps = 45/200 (22%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDA-SKGHTPKCVREC-QENYDVPYKKDLNFGAKSYSVS- 170
GC+PY+ PC+H+ + +C + + C ++C +NY V Y+ DL+ + Y S
Sbjct: 162 GCQPYKNRPCDHYGDSRLTNCSSLRRTQMTVCRKKCVNKNYKVKYEDDLHKTSIVYMTSW 221
Query: 171 SNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGA 230
+N K I +EI +GPV V+++ + YK G
Sbjct: 222 TNVKQIQQEIMTYGPVTAFMYVYENFMGYKEG---------------------------- 253
Query: 231 EGAFTVFDDLILYKS--GKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKI 288
+YKS G+ +G H ++++GWG D E YWL NSWN++WG++GLFKI
Sbjct: 254 -----------IYKSTTGELIGYHHVKLIGWGVDGDGTE-YWLAMNSWNSNWGNDGLFKI 301
Query: 289 LRGKDECGIESSITAGVPKL 308
LRG + C IE + AG+ +
Sbjct: 302 LRGYNFCSIELLVMAGIVDV 321
>gi|308504721|ref|XP_003114544.1| hypothetical protein CRE_27547 [Caenorhabditis remanei]
gi|308261929|gb|EFP05882.1| hypothetical protein CRE_27547 [Caenorhabditis remanei]
Length = 358
Score = 104 bits (259), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 61/199 (30%), Positives = 87/199 (43%), Gaps = 41/199 (20%)
Query: 113 WGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQEN--YDVPYKKDLNFGAKSYSVS 170
+GC+PY I PC+ S HTP C C N + + YK+D +FG Y+V
Sbjct: 196 FGCKPYSIYPCDKKYPNGTTSVPCPGYHTPTCEEHCTSNITWPIAYKQDKHFGKAHYNVG 255
Query: 171 SNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGA 230
I EI +GPV +F ++DD YKSG
Sbjct: 256 KKMTDIQTEIMTNGPVIASFVIYDDFWDYKSG---------------------------- 287
Query: 231 EGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILR 290
+ ++ +G GG +I+GWG D S YWL + W TD+G+NG + LR
Sbjct: 288 ---------IYVHTAGDQEGGMDTKIIGWGVD--SGVPYWLCVHQWGTDFGENGFVRFLR 336
Query: 291 GKDECGIESSITAGVPKLD 309
G +E IE + A +P +D
Sbjct: 337 GVNEVNIEHQVLAALPDID 355
>gi|51947600|gb|AAU14266.1| cathepsin B-N [Myzus persicae]
Length = 338
Score = 103 bits (258), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 62/201 (30%), Positives = 89/201 (44%), Gaps = 56/201 (27%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GC PY + PC + G +C R C + D+ + +D + Y ++
Sbjct: 183 GCEPYRVPPCPNDDQGNNTCAGKPMESNHRCTRMCYGDQDLDFDEDHRYTRDYYYLTYG- 241
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
SI K++ +GP+E +
Sbjct: 242 -SIQKDVMTYGPIEAS-------------------------------------------- 256
Query: 234 FTVFDDLILYKSG--------KALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGL 285
F V+DD YKSG LGGHA++++GWGE+ YWL+ NSWN DWGD+G
Sbjct: 257 FDVYDDFPSYKSGVYVKSENASYLGGHAVKLIGWGEEYGVP--YWLMVNSWNEDWGDHGF 314
Query: 286 FKILRGKDECGIESSITAGVP 306
FKI RG +ECG+++S TAGVP
Sbjct: 315 FKIQRGTNECGVDNSTTAGVP 335
Score = 48.5 bits (114), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 34/108 (31%), Positives = 51/108 (47%), Gaps = 24/108 (22%)
Query: 30 IVSGGAYGSKQA---EKNSLSNIPRAHLKSW-MGVHPDYNLPANRLPELIG--------- 76
++ Y ++QA EK+ + NI A +W GV+ D + +L+G
Sbjct: 10 VIFVSVYMTEQAYFLEKDFIDNI-NAQATTWKAGVNFDPKTSKEHIMKLLGSRGVQIPNK 68
Query: 77 -----YSEVDED-----LPANFDSRTKWPNCPTIREIRDQGSCGSCWG 114
Y D + +P FD+R KW +C TI +RDQG+CGSCW
Sbjct: 69 NNMNLYKSEDAEYDNTYIPRFFDARRKWRHCSTIGRVRDQGNCGSCWA 116
Score = 44.3 bits (103), Expect = 0.079, Method: Compositional matrix adjust.
Identities = 18/32 (56%), Positives = 24/32 (75%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQ 40
CGFGCNGG+P AW+ + K G+V+GG Y S +
Sbjct: 151 CGFGCNGGYPIKAWKRFSKKGLVTGGDYKSGE 182
>gi|204022106|dbj|BAG71150.1| cathepsin B-N [Astegopteryx spinocephala]
Length = 332
Score = 103 bits (258), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 68/210 (32%), Positives = 98/210 (46%), Gaps = 60/210 (28%)
Query: 107 GSCGSCWGCRPYEIAPC--EHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGA 164
G+ S GC+PY ++PC + + N T A K H +C R C + D +K+D F
Sbjct: 173 GNYDSSEGCQPYRVSPCPLDEYGNNTCRGKPAEKNH--RCTRMCYGDQDRDFKEDHRFTR 230
Query: 165 KSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDN 224
+Y ++ +I K++ +GP+E +
Sbjct: 231 DAYYLTYG--TIQKDVMTYGPIEAS----------------------------------- 253
Query: 225 TSQLGAEGAFTVFDDLILYKSG--------KALGGHAIRILGWGEDEKSKEKYWLIANSW 276
+ V+DD YKSG LGGHA++++GWGE+ YWL+ NSW
Sbjct: 254 ---------YEVYDDFPSYKSGVYVRTENATYLGGHAVKLIGWGEEYGVP--YWLMVNSW 302
Query: 277 NTDWGDNGLFKILRGKDECGIESSITAGVP 306
N WGD GLFKI RG +ECGI++S T GVP
Sbjct: 303 NDQWGDRGLFKIRRGTNECGIDNSTTGGVP 332
Score = 49.7 bits (117), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 19/34 (55%), Positives = 24/34 (70%)
Query: 81 DEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG 114
++ +P FD+R KW C TI E+RDQG CGSCW
Sbjct: 80 NQRIPKFFDARKKWRKCSTIGEVRDQGKCGSCWA 113
Score = 40.4 bits (93), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 16/32 (50%), Positives = 23/32 (71%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQ 40
CG+GC+GG+P AW + K G+V+GG Y S +
Sbjct: 148 CGYGCHGGYPIKAWERFKKHGLVTGGNYDSSE 179
>gi|291000017|ref|XP_002682576.1| cathepsin C [Naegleria gruberi]
gi|284096203|gb|EFC49832.1| cathepsin C [Naegleria gruberi]
Length = 430
Score = 103 bits (257), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 83/274 (30%), Positives = 116/274 (42%), Gaps = 69/274 (25%)
Query: 78 SEVDEDLPANFDSRTKWPNC---PTIREIRDQGSCGSCW--------GCR---------- 116
S+ E L A+ + W N + +R+Q CGSC+ G R
Sbjct: 179 SQDAEKLRASLPTEFDWTNVNGRDFVVPVRNQEQCGSCYAFSSSDMFGSRVRIPSNLTQV 238
Query: 117 ----PYEIAPCEHHVNG------------------TRPSCDASKGH-TPKCVRECQENYD 153
P +I C + G T SCD +GH KC +C N
Sbjct: 239 PVYSPQDIVDCSAYSQGCDGGFPFLVGKYAMDYGLTVESCDPYQGHDLGKCSNQCPVNRQ 298
Query: 154 VPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRF-FVPGNETT 212
+ Y +S+E S+M EIY++GP+ F V+ DL YK G + V E
Sbjct: 299 QRLHSSNYYFVGGYYGNSHELSMMHEIYQNGPLAIGFEVYPDLRNYKHGVYKHVTAEELK 358
Query: 213 AMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLI 272
A L + D++I + + HA+ ++GWG + + YW I
Sbjct: 359 AQGLSE-------------------DEMIPHFE---VVNHAVLMVGWGVENGTP--YWKI 394
Query: 273 ANSWNTDWGDNGLFKILRGKDECGIESSITAGVP 306
NSW+T WGDNG FKILRG DECG+ES AG+P
Sbjct: 395 KNSWSTTWGDNGYFKILRGSDECGVESDAEAGIP 428
>gi|341886633|gb|EGT42568.1| hypothetical protein CAEBREN_17563 [Caenorhabditis brenneri]
Length = 358
Score = 103 bits (257), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 63/199 (31%), Positives = 86/199 (43%), Gaps = 41/199 (20%)
Query: 113 WGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVP--YKKDLNFGAKSYSVS 170
+GC+PY I PC+ S HTP C C N P YK+D +FG Y+V
Sbjct: 196 FGCKPYTIYPCDKKYPNGTTSVPCPGYHTPVCEERCTSNITWPISYKQDKHFGKAHYNVG 255
Query: 171 SNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGA 230
I EI +GPV +F ++DD YKSG
Sbjct: 256 KKMTDIQTEIMRNGPVIASFIIYDDFWDYKSG---------------------------- 287
Query: 231 EGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILR 290
+ ++ +G GG +I+GWG D + YWL + W TD+G+NG +ILR
Sbjct: 288 ---------IYVHTAGDQEGGMDTKIIGWGVD--NGVPYWLCVHQWGTDFGENGFVRILR 336
Query: 291 GKDECGIESSITAGVPKLD 309
G +E IE + A P LD
Sbjct: 337 GVNEVNIEHQVLAAQPDLD 355
>gi|197304333|dbj|BAG69285.1| cathepsin B-like cysteine protease [Raphanus sativus]
Length = 343
Score = 103 bits (257), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 91/335 (27%), Positives = 130/335 (38%), Gaps = 118/335 (35%)
Query: 37 GSKQAEKNSLSNIPRAHLKSWMGVHPDYNLPANRLPELIGYSEVDED----LPANFDSRT 92
G K A + SN A K +GV P +L L+G V D LP +FD+RT
Sbjct: 59 GWKAAINDRFSNATVAEFKRLLGVKPT----PKKL--LLGVPVVSHDQSLKLPKSFDART 112
Query: 93 KWPNCPT------------------IREIRDQ-----------------GSCG------- 110
WP C + + + D+ CG
Sbjct: 113 HWPQCTSIGKILDQGHCGSCWAFGAVESLSDRFCIQFGMNITLSVNDLLACCGFRCGDGC 172
Query: 111 ------SCW------GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKK 158
S W G E P + P C+ + +TP+C+R+C + + +
Sbjct: 173 DGGYPISAWQYFSYSGVVTEECDPYFDQTGCSHPGCEPAY-NTPQCLRKCVGRNQL-WSE 230
Query: 159 DLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIK 218
++ +Y V SN + IM EIY++GPVE +
Sbjct: 231 SKHYSINTYVVESNPQDIMAEIYKNGPVEVS----------------------------- 261
Query: 219 WTIRDNTSQLGAEGAFTVFDDLILYKSG-------KALGGHAIRILGWGEDEKSKEKYWL 271
FTV++D YKSG +GGHA++++GWG + E YWL
Sbjct: 262 ---------------FTVYEDFAHYKSGVYKHITGSNIGGHAVKLIGWGTTDDG-EDYWL 305
Query: 272 IANSWNTDWGDNGLFKILRGKDECGIESSITAGVP 306
+AN WN WGD+G F I RG +ECGIE AG+P
Sbjct: 306 LANQWNRSWGDDGYFMIRRGTNECGIEDEPVAGLP 340
>gi|268561802|ref|XP_002638421.1| C. briggsae CBR-CPR-3 protein [Caenorhabditis briggsae]
Length = 375
Score = 103 bits (256), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 69/206 (33%), Positives = 93/206 (45%), Gaps = 65/206 (31%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVP-YKKDLNFGAKSYSVSSN 172
GC PY PC+ P + S TP C CQE Y YK D +F +Y +S+
Sbjct: 192 GCMPYSFPPCKK-----SPCVEFS---TPSCKTTCQEKYTTADYKNDKHFATSAYKLSTT 243
Query: 173 EK---SIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLG 229
+ +I EIY +GPVE +
Sbjct: 244 KNAVPTIQYEIYHNGPVEAS---------------------------------------- 263
Query: 230 AEGAFTVFDDLILYKSG-------KALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGD 282
+ VF+D YKSG +GGHA++I+GWG + + YWL+ANSW T +G+
Sbjct: 264 ----YRVFEDFYQYKSGVYHHVSGNLVGGHAVKIIGWGTE--NGVDYWLVANSWGTSFGE 317
Query: 283 NGLFKILRGKDECGIESSITAGVPKL 308
G FKI RG +EC IES+I AG+ KL
Sbjct: 318 KGFFKIRRGTNECQIESNIVAGLAKL 343
Score = 38.1 bits (87), Expect = 4.9, Method: Compositional matrix adjust.
Identities = 15/28 (53%), Positives = 20/28 (71%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAY 36
CG GC GG+ A +YW+ SG+V+GG Y
Sbjct: 161 CGKGCQGGYTIEAMKYWMNSGVVTGGDY 188
>gi|312382740|gb|EFR28091.1| hypothetical protein AND_04395 [Anopheles darlingi]
Length = 381
Score = 103 bits (256), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 81/306 (26%), Positives = 121/306 (39%), Gaps = 117/306 (38%)
Query: 70 RLPELIGYSEVDEDLPANFDSRTKWPNCP---TIRE---------------------IRD 105
+LP+ I +E P +FD+R KW CP TIR I
Sbjct: 121 KLPQGIVLKLQEEPFPESFDARQKWSFCPSVGTIRNQGCCASSYAVAAVATITDRWCIHS 180
Query: 106 QGSCGSCWGCRPYEIAPCEHHV----NGTRPS---------------------------- 133
+G +G Y++ C H +G PS
Sbjct: 181 EGKSQFSFG--AYDVLSCCHRCGFGCDGGVPSAVWHYWVENGITSGGAYESHEGCQSYPF 238
Query: 134 --CDASKGHTPK----CVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVE 187
C + P C+R+CQ Y+ Y +D +FG +YSV +E I+ E++ GPV+
Sbjct: 239 GVCKPQEIFAPHVDLICLRQCQPGYNTTYLEDKHFGRVAYSVPRDEDRILYELFYFGPVQ 298
Query: 188 GAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGK 247
+F TV+ D I YKSG
Sbjct: 299 ASF--------------------------------------------TVYTDFIQYKSGV 314
Query: 248 -------ALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESS 300
+G H+++I+GWG + +K +WL ANSW +WG+NG FKI+RG+D +ES+
Sbjct: 315 YRHTYGVRVGDHSVKIVGWGVENGTK--FWLCANSWGAEWGENGFFKIIRGEDHLSVESN 372
Query: 301 ITAGVP 306
+ AG+P
Sbjct: 373 VVAGLP 378
Score = 49.7 bits (117), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 20/32 (62%), Positives = 24/32 (75%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQ 40
CGFGC+GG P W YWV++GI SGGAY S +
Sbjct: 200 CGFGCDGGVPSAVWHYWVENGITSGGAYESHE 231
>gi|161343861|tpg|DAA06111.1| TPA_inf: cathepsin B [Myzus persicae]
Length = 323
Score = 103 bits (256), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 63/198 (31%), Positives = 100/198 (50%), Gaps = 41/198 (20%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDA-SKGHTPKCVREC-QENYDVPYKKDLNFGAKSYSVS- 170
GC+PY+ PC+H+ + + +C + + C +C +NY V Y+ DL + Y S
Sbjct: 162 GCQPYKNRPCDHYGDSSLTNCSSLRRTQMMFCRDKCVNKNYKVKYEDDLYKTSVVYMTSW 221
Query: 171 SNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGA 230
+N K I +EI +GPV V+++ + YK G + ++TA
Sbjct: 222 TNVKQIQQEIMTYGPVTAFMYVYENFMGYKEGVY-----KSTA----------------- 259
Query: 231 EGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILR 290
G+ +G H ++++GWG DE E YWL NSWN++WG++GLFKILR
Sbjct: 260 ---------------GELIGYHHVKLIGWGVDEAGIE-YWLAMNSWNSNWGNDGLFKILR 303
Query: 291 GKDECGIESSITAGVPKL 308
G + C IE + AG+ +
Sbjct: 304 GYNFCSIELLVMAGLVDV 321
>gi|290989996|ref|XP_002677623.1| cathepsin B [Naegleria gruberi]
gi|284091231|gb|EFC44879.1| cathepsin B [Naegleria gruberi]
Length = 321
Score = 103 bits (256), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 85/285 (29%), Positives = 111/285 (38%), Gaps = 114/285 (40%)
Query: 84 LPANFDSRTKWPNCPTIREIRDQGSCGSCWG----------------------CRPYEIA 121
LP NFDSR +W C I IR+Q CGSCW P ++
Sbjct: 86 LPTNFDSRQQWGKC--IHPIRNQEQCGSCWAFSASESLSDRFCIASNGKVDVILSPQDMV 143
Query: 122 PCEHHVNGTRPSCDASK-------------------------GHTPKCVRECQENYDVPY 156
C+++ G CD G+ P C C ++P
Sbjct: 144 SCDYNDMG----CDGGNLDNAWWWMKNKGIVPDSCMPYVSGGGNVPACPSNCNGT-NIPI 198
Query: 157 KKDLNFGAKSYSVSS------NEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNE 210
L + AKS+S S I +EIY +GPV+G
Sbjct: 199 SSQLYY-AKSFSHISPWMFWERVADIQQEIYTNGPVQGG--------------------- 236
Query: 211 TTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSG-------KALGGHAIRILGWGEDE 263
F+V+ D + YKSG LGGHAI+I+GWG +
Sbjct: 237 -----------------------FSVYQDFMNYKSGVYSHKTGSFLGGHAIKIIGWGVE- 272
Query: 264 KSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVPKL 308
YWL+ANSW+TDWG +G FKILRG +ECGIE + AG L
Sbjct: 273 -GGVDYWLVANSWSTDWGIDGTFKILRGHNECGIEDDVYAGPADL 316
>gi|343477197|emb|CCD11909.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 336
Score = 102 bits (254), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 84/271 (30%), Positives = 110/271 (40%), Gaps = 45/271 (16%)
Query: 65 NLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEIAPCE 124
LP R E ++ DLP +FD+ WP+CPTIREI DQ +C + W
Sbjct: 76 TLPPARFTE----EQLRTDLPESFDAAEHWPHCPTIREIADQSACRASWAVATASAISDR 131
Query: 125 HHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIY--- 181
+ G S C ++C + Y A Y VS S + Y
Sbjct: 132 YCTVGKGKQLRISAADLMACCKDCGGGCEGGYPD----AAWEYYVSHGIASSQCQPYPFP 187
Query: 182 --EHGPVEGAFTVFDDLILYKSGRFFVPGNETT----AMSLIKWTIRDNTSQLGAEG--- 232
EH +G T +F P T + LIK+ + G E
Sbjct: 188 RCEHRGAQGKKTPCSKY------KFVTPQCNATCTDKTIPLIKYRGNHSYEVRGEEDYKR 241
Query: 233 ----------AFTVFDDLILYK-------SGKALGGHAIRILGWGEDEKSKEKYWLIANS 275
F V D + YK +G LGG A+RI+GWG+ + YW +ANS
Sbjct: 242 ELYFNGPFVVRFQVHSDFLAYKNGVYQHVAGNFLGGKAVRIVGWGKLNGT--PYWKVANS 299
Query: 276 WNTDWGDNGLFKILRGKDECGIESSITAGVP 306
W+TDWG NG F ILRG +EC IE AG P
Sbjct: 300 WDTDWGMNGYFLILRGDNECNIEHLGFAGTP 330
>gi|268578113|ref|XP_002644039.1| Hypothetical protein CBG17499 [Caenorhabditis briggsae]
Length = 355
Score = 102 bits (254), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 60/199 (30%), Positives = 89/199 (44%), Gaps = 41/199 (20%)
Query: 113 WGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQEN--YDVPYKKDLNFGAKSYSVS 170
+GC+PY I PC+ + S HTP C C N + + YK+D +FG Y+V
Sbjct: 193 FGCKPYSIYPCDKNYPNGTTSVPCPGYHTPPCEDHCTSNITWPIAYKQDKHFGKAHYNVG 252
Query: 171 SNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGA 230
I EI +GPV +F +++D YKSG +
Sbjct: 253 KKMTDIQTEIMTNGPVIASFIIYEDFWDYKSGIY-------------------------- 286
Query: 231 EGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILR 290
++ +G GG +I+GWG D + YWL + W TD+G+NG +ILR
Sbjct: 287 -----------VHTAGDQEGGMDTKIIGWGVD--NGVPYWLCVHQWGTDFGENGFVRILR 333
Query: 291 GKDECGIESSITAGVPKLD 309
G +E IE + A +P +D
Sbjct: 334 GVNEVNIEHQVLAALPDVD 352
>gi|327408413|emb|CCA30060.1| unnamed protein product [Neospora caninum Liverpool]
Length = 463
Score = 102 bits (254), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 68/211 (32%), Positives = 100/211 (47%), Gaps = 65/211 (30%)
Query: 107 GSCGSCWGCRPYEIAPCEHHVNGTRPSCDAS--KGHTPKCVRECQE----NYDVPYKKDL 160
G +CW PYEI C HH P+CD TPKC ++C+E + +P+ KD+
Sbjct: 268 GKGTTCW---PYEIPFCAHHAKAPFPNCDTDVRPRKTPKCRKDCEEAAYSEHVLPFDKDV 324
Query: 161 NFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWT 220
+ + SYS+ S + ++ +++ HG V
Sbjct: 325 HKASSSYSLRSRD-AVKRDMMAHGTVT--------------------------------- 350
Query: 221 IRDNTSQLGAEGAFTVFDDLILYKSG-------KALGGHAIRILGWGEDEKSKEKYWLIA 273
GAF V++D + YKSG LGGHAI+I+GWG ++ E+YW
Sbjct: 351 -----------GAFMVYEDFLNYKSGVYKHVYGGPLGGHAIKIIGWGTEDG--EEYWHAV 397
Query: 274 NSWNTDWGDNGLFKILRGKDECGIESSITAG 304
NSWNT WGD+G FKI G +CG+++ + AG
Sbjct: 398 NSWNTYWGDSGHFKIEMG--QCGVDNEMVAG 426
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 21/45 (46%), Positives = 28/45 (62%), Gaps = 1/45 (2%)
Query: 71 LPELIGYSEVDEDLPANFDSRTKWPNCP-TIREIRDQGSCGSCWG 114
LP + +E +PANFD+RT +P C + +RDQG CGSCW
Sbjct: 155 LPAKTVFENANEPVPANFDARTAFPVCKDVVGHVRDQGDCGSCWA 199
Score = 44.3 bits (103), Expect = 0.073, Method: Compositional matrix adjust.
Identities = 17/33 (51%), Positives = 24/33 (72%)
Query: 6 IRLCGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
I FGCNGG PGMAWR++ + G+V+GG + +
Sbjct: 234 IHCASFGCNGGQPGMAWRWFERKGVVTGGDFDT 266
>gi|320167003|gb|EFW43902.1| cathepsin B [Capsaspora owczarzaki ATCC 30864]
Length = 306
Score = 102 bits (253), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 80/300 (26%), Positives = 111/300 (37%), Gaps = 119/300 (39%)
Query: 69 NRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG-------------- 114
+L E + + +P +FD+RT+WP +I IRDQ CGSCW
Sbjct: 66 TKLREFPVVDTIVDAIPTSFDARTQWP--ASIHPIRDQQQCGSCWAFGATEALSDRLAIA 123
Query: 115 --------CRPYEIAPCE---------------HHV----------------NGTRPSCD 135
P ++ C+ H++ NG +C
Sbjct: 124 SNNSINVVLSPQDLVSCDSTDYGCDGGYPINAWHYMQSLGVVTDTCYPYTSGNGDSGTCQ 183
Query: 136 ASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDD 195
+ TP C YK +Y V++N +I EI +GPVE A
Sbjct: 184 ITGKKTPACATA------TFYKAK-----TAYQVANNMAAIQSEILANGPVEAA------ 226
Query: 196 LILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILY-------KSGKA 248
F+V+DD Y +SG
Sbjct: 227 --------------------------------------FSVYDDFFSYTSGVYSHQSGAL 248
Query: 249 LGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVPKL 308
GGHA++I+GWG D + YW++ANSW T WG G F I RG DECGIE I AG+ +
Sbjct: 249 DGGHAVKIVGWGVDGTTP--YWIVANSWGTSWGQAGFFWIKRGNDECGIEDGIVAGLAAV 306
>gi|297723949|ref|NP_001174338.1| Os05g0310500 [Oryza sativa Japonica Group]
gi|255676228|dbj|BAH93066.1| Os05g0310500, partial [Oryza sativa Japonica Group]
Length = 234
Score = 102 bits (253), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 66/200 (33%), Positives = 96/200 (48%), Gaps = 61/200 (30%)
Query: 115 CRPY-EIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
C PY + C+H P C+ + TP C ++C+ V +K +F +Y V+S+
Sbjct: 68 CDPYFDQVGCKH------PGCEPAY-PTPVCEKKCKVQNQVWLEKK-HFSVNAYRVNSDP 119
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
IM E+Y++GPVE A
Sbjct: 120 HDIMAEVYQNGPVEVA-------------------------------------------- 135
Query: 234 FTVFDDLILYKSG-------KALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLF 286
FTV++D YKSG +GGHA++++GWG + + E YWL+AN WN WGD+G F
Sbjct: 136 FTVYEDFAHYKSGVYKHITGGMMGGHAVKLIGWGTTD-AGEDYWLLANQWNRGWGDDGYF 194
Query: 287 KILRGKDECGIESSITAGVP 306
KI+RG +ECGIE + AG+P
Sbjct: 195 KIIRGTNECGIEEDVVAGMP 214
Score = 41.2 bits (95), Expect = 0.55, Method: Compositional matrix adjust.
Identities = 15/25 (60%), Positives = 23/25 (92%)
Query: 8 LCGFGCNGGFPGMAWRYWVKSGIVS 32
+CG GC+GG+P MAWRY+V++G+V+
Sbjct: 41 MCGDGCDGGYPIMAWRYFVRNGVVT 65
>gi|62320420|dbj|BAD94873.1| cathepsin B-like cysteine proteinase like protein [Arabidopsis
thaliana]
Length = 183
Score = 102 bits (253), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 65/200 (32%), Positives = 95/200 (47%), Gaps = 61/200 (30%)
Query: 115 CRPY-EIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
C PY + C H P C+ + TPKC R+C + + + ++G +Y ++ +
Sbjct: 17 CDPYFDNTGCSH------PGCEPTY-PTPKCERKCVSRNQL-WGESKHYGVGAYRINPDP 68
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
+ IM E+Y++GPVE A
Sbjct: 69 QDIMAEVYKNGPVEVA-------------------------------------------- 84
Query: 234 FTVFDDLILYKSG-------KALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLF 286
FTV++D YKSG +GGHA++++GWG + E YWL+AN WN WGD+G F
Sbjct: 85 FTVYEDFAHYKSGVYKYITGTKIGGHAVKLIGWGTSDDG-EDYWLLANQWNRSWGDDGYF 143
Query: 287 KILRGKDECGIESSITAGVP 306
KI RG +ECGIE S+ AG+P
Sbjct: 144 KIRRGTNECGIEQSVVAGLP 163
>gi|161343877|tpg|DAA06119.1| TPA_inf: cathepsin B [Myzus persicae]
Length = 145
Score = 102 bits (253), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 62/186 (33%), Positives = 87/186 (46%), Gaps = 43/186 (23%)
Query: 122 PCEHHVNGTRPSCDASKGHTPKCVREC-QENYDVPYKKDLNFGAKSYSVSSNEKSIMKEI 180
PC+H + C TP+C +C +Y Y KD N Y + + MKEI
Sbjct: 1 PCQHTESAVENPCSNKTFFTPECKVQCYNPDYGTRYVKD-NHKGTQYRIPG--YTAMKEI 57
Query: 181 YEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDL 240
YE+GP+ +F ++ D + Y+SG +
Sbjct: 58 YENGPITASFYMYQDFVNYQSGVY------------------------------------ 81
Query: 241 ILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESS 300
+ SGK + A++ILGWGE+ + YWL ANS+NT WGDNG KILRG +EC IE
Sbjct: 82 -AFNSGKYVTTQAVKILGWGEENGTP--YWLAANSFNTYWGDNGFVKILRGANECYIEEF 138
Query: 301 ITAGVP 306
+ AG+P
Sbjct: 139 MYAGLP 144
>gi|166030320|gb|ABY78827.1| cathepsin B-like protease [Trypanosoma congolense]
Length = 336
Score = 102 bits (253), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 86/329 (26%), Positives = 118/329 (35%), Gaps = 104/329 (31%)
Query: 39 KQAEKNSLSNIPRAHLKSWMG--VHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPN 96
K + NI A K G + +LP R E ++ +LP +FDS KWPN
Sbjct: 47 KAVYNGKMQNITFAEAKRLTGAWIQKTSSLPPVRFTE----EQLRTELPESFDSAEKWPN 102
Query: 97 CPTIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHT--------------- 141
CPTIREI DQ +C + W + + G S H
Sbjct: 103 CPTIREIADQSACRASWAVSTASVISDRYCTVGGVQQLRISAAHLLSCCKQCGGGCKGGF 162
Query: 142 -----------------------PKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSI-- 176
P C + P K NF + + +KSI
Sbjct: 163 PGFAWRYYVEYGIASSYCQPYPFPHCEHRGAQGNKTPCSK-YNFDTPKCNATCTDKSIPL 221
Query: 177 ------------------MKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIK 218
+E+Y +GP F V+ DL YKSG
Sbjct: 222 VKYRGNATYLLLHGEEDYKRELYFNGPFVAVFFVYTDLFAYKSG---------------- 265
Query: 219 WTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNT 278
V+ ++ G LGG A+RI+GWG+ + YW +AN+W+T
Sbjct: 266 -----------------VYRNV----DGDILGGQAVRIVGWGKLNGT--PYWKVANTWDT 302
Query: 279 DWGDNGLFKILRGKDECGIESSITAGVPK 307
DWG +G ILRG +EC IE AG P+
Sbjct: 303 DWGMDGYLLILRGNNECNIEHLGFAGTPE 331
>gi|161343821|tpg|DAA06091.1| TPA_inf: cathepsin B [Aphis gossypii]
Length = 196
Score = 101 bits (252), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 62/201 (30%), Positives = 89/201 (44%), Gaps = 56/201 (27%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GC PY + PC + G +C R C + ++ + +D + Y ++
Sbjct: 41 GCEPYRVPPCPYDEQGNNTCAGKPMEKNHRCTRICYGDQELDFDEDHRYTRDYYYLTYG- 99
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
SI K++ +GP+E +
Sbjct: 100 -SIQKDVMTYGPIEAS-------------------------------------------- 114
Query: 234 FTVFDDLILYKSG--------KALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGL 285
F V+ D YKSG LGGHA++++GWGE + YWL+ NSWN DWGDNGL
Sbjct: 115 FDVYSDFPSYKSGIYERTENATYLGGHAVKLIGWGE--QYGIPYWLMVNSWNEDWGDNGL 172
Query: 286 FKILRGKDECGIESSITAGVP 306
FKI RG +ECG+++S TAGVP
Sbjct: 173 FKIRRGTNECGVDNSTTAGVP 193
Score = 39.3 bits (90), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 16/33 (48%), Positives = 23/33 (69%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQA 41
CGFGC+GG+P AW+ + G+V+GG Y S +
Sbjct: 9 CGFGCHGGYPIRAWKRFKNHGLVTGGDYKSGEG 41
>gi|404250524|gb|AFR54113.1| cysteine proteinase, partial [Haemonchus contortus]
Length = 332
Score = 101 bits (252), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 56/178 (31%), Positives = 87/178 (48%), Gaps = 41/178 (23%)
Query: 115 CRPYEIAPCEHHVNGTRPSCDASKG-HTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
C+PY + PC +H G SC TP C + CQ Y Y+KD ++ Y + +E
Sbjct: 194 CKPYPLHPCGNH-GGKFWSCPRDHSFRTPACKKYCQYGYGKRYEKDKSYVKSVYILDEDE 252
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
K+I +E+ ++GPV+ AF ++D Y G
Sbjct: 253 KAIQREMMKNGPVQAAFITYEDFSFYTKG------------------------------- 281
Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRG 291
+ ++ G+ G HA++++GWG + +K YW +ANSW+TDWG+NG F+ILRG
Sbjct: 282 ------IYVHTRGRQRGAHAVKVVGWGVENGTK--YWNVANSWSTDWGENGYFRILRG 331
Score = 48.1 bits (113), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 19/39 (48%), Positives = 26/39 (66%)
Query: 81 DEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYE 119
++D+P +FDSR W +C +I IRDQ +CGSCW E
Sbjct: 92 NDDIPESFDSREVWKSCSSITYIRDQSNCGSCWAVSAAE 130
>gi|239938580|gb|ACS36089.1| cysteine proteinase [Haemonchus contortus]
Length = 332
Score = 101 bits (252), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 56/178 (31%), Positives = 87/178 (48%), Gaps = 41/178 (23%)
Query: 115 CRPYEIAPCEHHVNGTRPSCDASKG-HTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
C+PY + PC +H G SC TP C + CQ Y Y+KD ++ Y + +E
Sbjct: 194 CKPYPLHPCGNH-GGKFWSCPRDHSFRTPACKKYCQYGYGKRYEKDKSYVKSVYILDEDE 252
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
K+I +E+ ++GPV+ AF ++D Y G
Sbjct: 253 KAIQREMMKNGPVQAAFITYEDFSFYTKG------------------------------- 281
Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRG 291
+ ++ G+ G HA++++GWG + +K YW +ANSW+TDWG+NG F+ILRG
Sbjct: 282 ------IYVHTRGRQRGAHAVKVVGWGVENGTK--YWNVANSWSTDWGENGYFRILRG 331
Score = 38.1 bits (87), Expect = 5.4, Method: Compositional matrix adjust.
Identities = 17/33 (51%), Positives = 20/33 (60%)
Query: 7 RLCGFGCNGGFPGMAWRYWVKSGIVSGGAYGSK 39
R CG GCNGG AW Y + G+V+GG Y K
Sbjct: 159 RECGRGCNGGMDHKAWEYVKEFGVVTGGRYQEK 191
>gi|221484923|gb|EEE23213.1| cysteine proteinase, putative [Toxoplasma gondii GT1]
Length = 569
Score = 101 bits (251), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 67/211 (31%), Positives = 95/211 (45%), Gaps = 65/211 (30%)
Query: 107 GSCGSCWGCRPYEIAPCEHHVNGTRPSCDAS--KGHTPKCVRECQENYDV----PYKKDL 160
G +CW PYE+ C HH P CDA+ TPKC ++C+E P+ +D
Sbjct: 374 GKGTTCW---PYEVPFCAHHAKAPFPDCDATLVPRKTPKCRKDCEEQAYADNVHPFDQDT 430
Query: 161 NFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWT 220
+ +YS+ S + + +++ HGPV G
Sbjct: 431 HKATSAYSLRSRD-DVKRDMMTHGPVSG-------------------------------- 457
Query: 221 IRDNTSQLGAEGAFTVFDDLILYKSGK-------ALGGHAIRILGWGEDEKSKEKYWLIA 273
AF V++D + YKSG +GGHAI+I+GWG + + E+YW
Sbjct: 458 ------------AFMVYEDFLSYKSGVYKHVSGLPVGGHAIKIIGWGTE--NGEEYWHAV 503
Query: 274 NSWNTDWGDNGLFKILRGKDECGIESSITAG 304
NSWNT WGD G FKI G +CGI+ + AG
Sbjct: 504 NSWNTYWGDGGQFKIAMG--QCGIDGEMVAG 532
Score = 45.4 bits (106), Expect = 0.034, Method: Compositional matrix adjust.
Identities = 18/39 (46%), Positives = 25/39 (64%), Gaps = 1/39 (2%)
Query: 77 YSEVDEDLPANFDSRTKWPNCP-TIREIRDQGSCGSCWG 114
+ E +PA+FD+RT +P C + +RDQG CGSCW
Sbjct: 267 FENATEPVPAHFDARTAFPACKDVVGHVRDQGDCGSCWA 305
Score = 44.7 bits (104), Expect = 0.052, Method: Compositional matrix adjust.
Identities = 17/31 (54%), Positives = 23/31 (74%)
Query: 6 IRLCGFGCNGGFPGMAWRYWVKSGIVSGGAY 36
I FGCNGG PGMAWR++ + G+V+GG +
Sbjct: 340 IHCASFGCNGGQPGMAWRWFERKGVVTGGDF 370
>gi|294914336|ref|XP_002778250.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
gi|239886453|gb|EER10045.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
Length = 388
Score = 101 bits (251), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 71/209 (33%), Positives = 101/209 (48%), Gaps = 66/209 (31%)
Query: 114 GCRPYEIAPCEHHVN--GTRPSCDASKGHTPK--CVRECQENYDVP-YKKDLNFGA-KSY 167
GC PY C HHV+ G P KG++P C C+ ++ P ++ D +F + Y
Sbjct: 221 GCWPYNFPECSHHVDTKGMEPC----KGNSPSPVCSTTCRNHHFKPSFESDRHFTEDEGY 276
Query: 168 SVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQ 227
S+ ++ I +EI ++GPV A
Sbjct: 277 SLDEVDE-IKREIIDNGPVAAA-------------------------------------- 297
Query: 228 LGAEGAFTVFDDLILYKSG-------KALGGHAIRILGWGEDEKSKEKYWLIANSWNTDW 280
FTV++D YKSG LGGHA++I+GWG D+ E+YWL+ NSWN +W
Sbjct: 298 ------FTVYEDFPYYKSGVYKHVNGSELGGHAVKIIGWGIDQN--EQYWLVMNSWNVNW 349
Query: 281 GDNGLFKILRGKDECGIESSITAGVPKLD 309
GD G+FKI G ECGI+S +TAG+PK +
Sbjct: 350 GDQGIFKIAIG--ECGIDSEVTAGIPKYE 376
>gi|403362666|gb|EJY81064.1| Cathepsin B [Oxytricha trifallax]
Length = 309
Score = 100 bits (250), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 77/312 (24%), Positives = 124/312 (39%), Gaps = 95/312 (30%)
Query: 40 QAEKNSLSNIPRAHLKSWMGVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPT 99
+ +N +N A +K +G ++ ++ +++++ +P +FDSRT+W C
Sbjct: 40 EVSENKFANYTEAQIKGLLGTVLSHS------SDIPAFTQINAAVPDSFDSRTQWQGC-- 91
Query: 100 IREIRDQGSCGSCWG----------------------CRPYEIAPCEHHVNG-------- 129
+ IRDQ CGSCW P ++ C+ + G
Sbjct: 92 VHPIRDQAQCGSCWAFAASESLSDRFCIASQGKVNVVLSPQDMVSCDTNNYGCDGGYLNL 151
Query: 130 ----------TRPSCD---ASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSI 176
SC+ ++ G P C +C + K K + ++ KS+
Sbjct: 152 AWQYLEKKGVASDSCEPYKSASGTAPSCPSKCANGQAIKKYKCQAGSTKQANGAAATKSL 211
Query: 177 MKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTV 236
I + GPVE FTV+ D YKSG +
Sbjct: 212 ---IQQSGPVETGFTVYADFFNYKSGIYH------------------------------- 237
Query: 237 FDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECG 296
+ SG A GGHA++ILGWG ++ E YW++ANSW WG+ G F I +G + G
Sbjct: 238 ------HVSGGAEGGHAVKILGWG--KQGSENYWIVANSWGESWGEKGFFNIRQG--DSG 287
Query: 297 IESSITAGVPKL 308
I+ + +P L
Sbjct: 288 IDQATFGCIPDL 299
>gi|237836005|ref|XP_002367300.1| cysteine proteinase, putative [Toxoplasma gondii ME49]
gi|211964964|gb|EEB00160.1| cysteine proteinase, putative [Toxoplasma gondii ME49]
gi|221506020|gb|EEE31655.1| cysteine proteinase, putative [Toxoplasma gondii VEG]
Length = 572
Score = 100 bits (250), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 67/211 (31%), Positives = 95/211 (45%), Gaps = 65/211 (30%)
Query: 107 GSCGSCWGCRPYEIAPCEHHVNGTRPSCDAS--KGHTPKCVRECQENYDV----PYKKDL 160
G +CW PYE+ C HH P CDA+ TPKC ++C+E P+ +D
Sbjct: 377 GKGTTCW---PYEVPFCAHHAKAPFPDCDATLVPRKTPKCRKDCEEQAYADNVHPFDQDT 433
Query: 161 NFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWT 220
+ +YS+ S + + +++ HGPV G
Sbjct: 434 HKATSAYSLRSRD-DVKRDMMTHGPVSG-------------------------------- 460
Query: 221 IRDNTSQLGAEGAFTVFDDLILYKSGK-------ALGGHAIRILGWGEDEKSKEKYWLIA 273
AF V++D + YKSG +GGHAI+I+GWG + + E+YW
Sbjct: 461 ------------AFMVYEDFLSYKSGVYKHVSGLPVGGHAIKIIGWGTE--NGEEYWHAV 506
Query: 274 NSWNTDWGDNGLFKILRGKDECGIESSITAG 304
NSWNT WGD G FKI G +CGI+ + AG
Sbjct: 507 NSWNTYWGDGGQFKIAMG--QCGIDGEMVAG 535
Score = 45.4 bits (106), Expect = 0.030, Method: Compositional matrix adjust.
Identities = 18/39 (46%), Positives = 25/39 (64%), Gaps = 1/39 (2%)
Query: 77 YSEVDEDLPANFDSRTKWPNCP-TIREIRDQGSCGSCWG 114
+ E +PA+FD+RT +P C + +RDQG CGSCW
Sbjct: 270 FENATEPVPAHFDARTAFPACKDVVGHVRDQGDCGSCWA 308
Score = 44.7 bits (104), Expect = 0.055, Method: Compositional matrix adjust.
Identities = 17/31 (54%), Positives = 23/31 (74%)
Query: 6 IRLCGFGCNGGFPGMAWRYWVKSGIVSGGAY 36
I FGCNGG PGMAWR++ + G+V+GG +
Sbjct: 343 IHCASFGCNGGQPGMAWRWFERKGVVTGGDF 373
>gi|119638992|gb|ABL85238.1| cysteine proteinase 4 [Necator americanus]
Length = 339
Score = 100 bits (250), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 68/197 (34%), Positives = 92/197 (46%), Gaps = 57/197 (28%)
Query: 115 CRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSV-SSNE 173
C+PY PC+ + P TPKC + CQ Y VPY++D FG S+ + NE
Sbjct: 187 CKPYPFYPCDGNYG---PCPKEGAFDTPKCRKICQFRYPVPYEEDKVFGKNSHILLQDNE 243
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
I +EI+ +GPV GA
Sbjct: 244 ARIRQEIFINGPV------------------------------------------GAN-- 259
Query: 234 FTVFDDLILYKSG-------KALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLF 286
F VF+D I YK G K +G HAI+++GWG + + YWL+ANS+N DWG+NG F
Sbjct: 260 FYVFEDFIHYKEGIYKQTYGKWIGVHAIKLIGWGTENGTD--YWLVANSYNYDWGENGTF 317
Query: 287 KILRGKDECGIESSITA 303
+ILRG + C IES + A
Sbjct: 318 RILRGTNHCLIESQVIA 334
>gi|21700775|gb|AAL60053.1| cysteine proteinase [Toxoplasma gondii]
Length = 569
Score = 100 bits (250), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 67/211 (31%), Positives = 95/211 (45%), Gaps = 65/211 (30%)
Query: 107 GSCGSCWGCRPYEIAPCEHHVNGTRPSCDAS--KGHTPKCVRECQENYDV----PYKKDL 160
G +CW PYE+ C HH P CDA+ TPKC ++C+E P+ +D
Sbjct: 374 GKGTTCW---PYEVPFCAHHAKAPFPDCDATLVPRKTPKCRKDCEEQAYADNVHPFDQDT 430
Query: 161 NFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWT 220
+ +YS+ S + + +++ HGPV G
Sbjct: 431 HKATSAYSLRSRD-DVKRDMMTHGPVSG-------------------------------- 457
Query: 221 IRDNTSQLGAEGAFTVFDDLILYKSGK-------ALGGHAIRILGWGEDEKSKEKYWLIA 273
AF V++D + YKSG +GGHAI+I+GWG + + E+YW
Sbjct: 458 ------------AFMVYEDFLSYKSGVYKHVSGLPVGGHAIKIIGWGTE--NGEEYWHAV 503
Query: 274 NSWNTDWGDNGLFKILRGKDECGIESSITAG 304
NSWNT WGD G FKI G +CGI+ + AG
Sbjct: 504 NSWNTYWGDGGQFKIAMG--QCGIDGEMVAG 532
Score = 45.4 bits (106), Expect = 0.034, Method: Compositional matrix adjust.
Identities = 18/39 (46%), Positives = 25/39 (64%), Gaps = 1/39 (2%)
Query: 77 YSEVDEDLPANFDSRTKWPNCP-TIREIRDQGSCGSCWG 114
+ E +PA+FD+RT +P C + +RDQG CGSCW
Sbjct: 267 FENATEPVPAHFDARTAFPACKDVVGHVRDQGDCGSCWA 305
Score = 44.7 bits (104), Expect = 0.057, Method: Compositional matrix adjust.
Identities = 17/31 (54%), Positives = 23/31 (74%)
Query: 6 IRLCGFGCNGGFPGMAWRYWVKSGIVSGGAY 36
I FGCNGG PGMAWR++ + G+V+GG +
Sbjct: 340 IHCASFGCNGGQPGMAWRWFERKGVVTGGDF 370
>gi|403345965|gb|EJY72367.1| Cathepsin B [Oxytricha trifallax]
Length = 309
Score = 100 bits (250), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 77/312 (24%), Positives = 124/312 (39%), Gaps = 95/312 (30%)
Query: 40 QAEKNSLSNIPRAHLKSWMGVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPT 99
+ +N +N A +K +G ++ ++ +++++ +P +FDSRT+W C
Sbjct: 40 EVSENKFANYTEAQIKGLLGTVLSHS------SDIPAFTQINAAVPDSFDSRTQWQGC-- 91
Query: 100 IREIRDQGSCGSCWG----------------------CRPYEIAPCEHHVNG-------- 129
+ IRDQ CGSCW P ++ C+ + G
Sbjct: 92 VHPIRDQAQCGSCWAFAASESLSDRFCIASQGKVNVVLSPQDMVSCDTNNYGCDGGYLNL 151
Query: 130 ----------TRPSCD---ASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSI 176
SC+ ++ G P C +C + K K + ++ KS+
Sbjct: 152 AWQYLEKKGVASDSCEPYKSASGTAPSCPSKCSNGQAIKKYKCKAGSTKQANGAAATKSL 211
Query: 177 MKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTV 236
I + GPVE FTV+ D YKSG +
Sbjct: 212 ---IQQSGPVETGFTVYADFFNYKSGIYH------------------------------- 237
Query: 237 FDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECG 296
+ SG A GGHA++ILGWG ++ E YW++ANSW WG+ G F I +G + G
Sbjct: 238 ------HVSGGAEGGHAVKILGWG--KQGSENYWIVANSWGESWGEKGFFNIRQG--DSG 287
Query: 297 IESSITAGVPKL 308
I+ + +P L
Sbjct: 288 IDQATFGCIPDL 299
>gi|91089437|ref|XP_966750.1| PREDICTED: similar to putative cathepsin B-like proteinase
[Tribolium castaneum]
gi|270012705|gb|EFA09153.1| cathepsin B precursor [Tribolium castaneum]
Length = 324
Score = 100 bits (250), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 93/331 (28%), Positives = 130/331 (39%), Gaps = 110/331 (33%)
Query: 41 AEKNSLSNIPRAHLKSWMGVHPDYNLPANRLPELIGYSE------VDEDLPANFDSRTKW 94
A +N + P HLK G A P+L+G ++ + E +P FD RT W
Sbjct: 41 AGRNFPEDTPIEHLKRLNG--------ALITPDLVGKNQTHVINVIPEAIPETFDGRTHW 92
Query: 95 PNCPT---IREIRDQGS----------------------------------CGSCW-GC- 115
CP+ IR + GS C +C GC
Sbjct: 93 SQCPSLKNIRNQGNCGSCWAFGSVEVMTDRLCIASKGKTKFEFSADDLLACCTACGKGCD 152
Query: 116 -----RPYEIAPCEHHVNG----TRPSCDASKGH------TPKCVREC-QENYDVPYKKD 159
R +E + V+G + C +G TPKC +C Y PY KD
Sbjct: 153 GGAPYRAFEYWVAKGIVSGGDYNSNEGCQPYEGSAFLNSVTPKCSTKCLNSKYTTPYAKD 212
Query: 160 LNFGAK-SYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIK 218
++G Y S N I EI +GPV V++D YKSG +
Sbjct: 213 KHYGTDFIYMTSKNVAEIQTEIMNNGPVVTHMDVYEDFYSYKSGVY-------------- 258
Query: 219 WTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNT 278
+ SG ++GGHA++I+GWG ++ YWLIANSW
Sbjct: 259 -----------------------QHVSGNSMGGHAVKIIGWGTEKGV--PYWLIANSWGA 293
Query: 279 DWGD-NGLFKILRGKDECGIESSITAGVPKL 308
W D +G +KILRGK+ C IE+ I G P++
Sbjct: 294 KWADLDGFYKILRGKNHCKIETYIYGGTPQV 324
Score = 42.4 bits (98), Expect = 0.27, Method: Compositional matrix adjust.
Identities = 19/32 (59%), Positives = 22/32 (68%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQ 40
CG GC+GG P A+ YWV GIVSGG Y S +
Sbjct: 147 CGKGCDGGAPYRAFEYWVAKGIVSGGDYNSNE 178
>gi|403377404|gb|EJY88697.1| hypothetical protein OXYTRI_00086 [Oxytricha trifallax]
Length = 351
Score = 100 bits (249), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 71/270 (26%), Positives = 111/270 (41%), Gaps = 84/270 (31%)
Query: 80 VDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG----------------------CRP 117
+ + +P FD RTKWP C +R+IRDQ +CG+CW P
Sbjct: 116 LKDSIPLEFDFRTKWPQC--LRKIRDQANCGACWAFTGSGMLADRICILTNGTINEELSP 173
Query: 118 YEIAPCEHHVNG------------------TRPSCDASKGHTPKCVRECQENYDVPYKKD 159
++ C H G T+ SC K T KC CQ + +K
Sbjct: 174 QDMVDCSHDNFGCEGGYLMNALDYLMNEGVTKESCTPYKDKTNKCQYTCQNKTEEFHKHY 233
Query: 160 LNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKW 219
G + V +NE+ I +++ ++GP+ TV++D I Y +G +
Sbjct: 234 CKPG--TLRVLTNEEQIKRDLMQNGPLMVGLTVYEDFINYATGDY--------------- 276
Query: 220 TIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTD 279
+ +G+ +GGHA++++GW +K + WLI N WN D
Sbjct: 277 ----------------------KFVAGEIVGGHAVKLMGWRTTQKGQTS-WLIQNQWNDD 313
Query: 280 WGDNGLFKILRGKDECGIESSITAGVPKLD 309
WG+ G IL ++E GI+S P +D
Sbjct: 314 WGEQGFGYIL--ENEVGIDSIGVGCTPDID 341
>gi|328869211|gb|EGG17589.1| hypothetical protein DFA_08585 [Dictyostelium fasciculatum]
Length = 323
Score = 100 bits (249), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 74/274 (27%), Positives = 105/274 (38%), Gaps = 86/274 (31%)
Query: 80 VDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG----------------------CRP 117
VD +P+ FD+R +WP C + + +Q CGSCW P
Sbjct: 91 VDASIPSTFDAREQWPGC--VHAVLNQEQCGSCWAFSSSEALSDRLCIASKGQVNVTLSP 148
Query: 118 YEIAPCE----HHVNGTRPSC------------------DASKGHTPKCVRECQENYDVP 155
+ C+ NG P A G C R+C + +
Sbjct: 149 QALVACDDIGNQGCNGGVPQLAWEYMEWKGLPTFECYPYTAGNGTDGTCQRQCADGSAMT 208
Query: 156 YKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMS 215
Y + F S + ++ I EI +GPV G V+ D + Y SG +
Sbjct: 209 YYRAKPF---SMTTCNSVACIQNEIITYGPVVGTMMVYQDFMSYSSGVY----------- 254
Query: 216 LIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANS 275
+ D T++L LGGHAI I+GWG D SK YW++ NS
Sbjct: 255 -----VYDGTAEL--------------------LGGHAIEIVGWGTDATSKLDYWIVKNS 289
Query: 276 WNTDWGD-NGLFKILRGKDECGIESSITAGVPKL 308
W+ WG +G F I RG + CGI+ +A KL
Sbjct: 290 WSAAWGGLDGYFWIQRGTNMCGIDHDASASQAKL 323
>gi|56758644|gb|AAW27462.1| unknown [Schistosoma japonicum]
Length = 294
Score = 100 bits (249), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 44/91 (48%), Positives = 62/91 (68%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GC+PY CEHH G P+C TP+C ++CQ+ Y PY++D ++G +SY+V SNE
Sbjct: 187 GCQPYPFPKCEHHTKGKYPACGTKIYKTPQCKQKCQKGYKTPYEQDKHYGEESYNVISNE 246
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRF 204
K+I KEI +GPVE AF V++D + YKSG +
Sbjct: 247 KAIQKEIMMNGPVEAAFDVYEDFLNYKSGIY 277
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 22/52 (42%), Positives = 34/52 (65%), Gaps = 1/52 (1%)
Query: 63 DYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG 114
D + R P + + +++ ++P+ FDSR KWP+C +I +IRDQ CGSCW
Sbjct: 70 DAEMKRKRRP-TVDHHDLNVEIPSQFDSRKKWPHCKSISQIRDQSRCGSCWA 120
Score = 50.8 bits (120), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 20/27 (74%), Positives = 23/27 (85%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGA 35
CG GC GGFPG+AW YWVK GIV+GG+
Sbjct: 155 CGDGCQGGFPGVAWDYWVKRGIVTGGS 181
>gi|294894290|ref|XP_002774786.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
gi|239880403|gb|EER06602.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
Length = 830
Score = 100 bits (248), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 87/313 (27%), Positives = 125/313 (39%), Gaps = 102/313 (32%)
Query: 12 GCNGGFPGMAWRYWVKSGIVSGGAYGSKQAEKNSLSNIPRAHLKSWMGVHPDYNLPANRL 71
GCNGGFP AW + GI +GG Y +K P Y+ P
Sbjct: 604 GCNGGFPNSAWSWVHDKGIATGGDYVAKDDMTKDDGCWP-------------YDFPP--- 647
Query: 72 PELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTR 131
A+ + TK+P CP + SC G P A +
Sbjct: 648 -------------CAHHINDTKYPECPKV----------SCSGESPPATAETATVI---- 680
Query: 132 PSCDASKGHTPKCVRECQE-NYDVPYKKDLNFGAKS----YSVSSNEKSIMKE-----IY 181
+ TP C +C Y + D +F +S YSV+ + +I + IY
Sbjct: 681 --AYQNSYETPNCAEQCHNPKYTTTLRDDRHFMLESSPYQYSVNDAKNAIRTDGPVGPIY 738
Query: 182 EHGP------VEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFT 235
P V +F+V++D + YKSG +
Sbjct: 739 FCDPNVNFDQVSASFSVYEDFLAYKSGVY------------------------------- 767
Query: 236 VFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDEC 295
+ SG+ LGGHA++I+GWGE+ S + YW++ NSWN DWGD+GLFKI G C
Sbjct: 768 ------KHTSGEYLGGHAVKIIGWGEE--SGQAYWIVVNSWNEDWGDHGLFKIALGN--C 817
Query: 296 GIESSITAGVPKL 308
GI+ ++ G PK+
Sbjct: 818 GIDDNLLGGTPKV 830
Score = 48.5 bits (114), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 23/41 (56%), Positives = 29/41 (70%), Gaps = 2/41 (4%)
Query: 76 GYS-EVDEDLPANFDSRTKWPNCP-TIREIRDQGSCGSCWG 114
GY+ E +DLP +FD+RT +PNC I IRDQ +CGSCW
Sbjct: 528 GYAIEELQDLPTDFDARTAFPNCSKVIGHIRDQSACGSCWA 568
>gi|161343879|tpg|DAA06120.1| TPA_inf: cathepsin B [Toxoptera citricida]
Length = 340
Score = 100 bits (248), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 64/201 (31%), Positives = 91/201 (45%), Gaps = 56/201 (27%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GC PY + PC + +G +C R C + D+ + D SY ++
Sbjct: 185 GCEPYRVPPCPYDESGNNTCSGKPMEQNHRCTRMCYGDQDLDFDDDHRHTRDSYYLTIG- 243
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
SI K++ +GP+E +
Sbjct: 244 -SIQKDVMTYGPIEAS-------------------------------------------- 258
Query: 234 FTVFDDLILYKSG--------KALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGL 285
F V+DD + YKSG LGGHA++++GWGE+ + YWL+ NSWN DWGD GL
Sbjct: 259 FDVYDDFLSYKSGVYVRSENASYLGGHAVKLIGWGEEYGTP--YWLMMNSWNADWGDEGL 316
Query: 286 FKILRGKDECGIESSITAGVP 306
FKI RG +ECG+++S TAGVP
Sbjct: 317 FKIRRGTNECGVDNSTTAGVP 337
Score = 48.5 bits (114), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 19/38 (50%), Positives = 25/38 (65%)
Query: 77 YSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG 114
Y + +P FD+R KW +C TI +RDQG+CGSCW
Sbjct: 81 YDNLFGRIPKKFDARKKWRHCTTIGAVRDQGNCGSCWA 118
Score = 42.0 bits (97), Expect = 0.39, Method: Compositional matrix adjust.
Identities = 17/32 (53%), Positives = 23/32 (71%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQ 40
CG+GCNGG+P AW + K G+V+GG Y S +
Sbjct: 153 CGYGCNGGYPIKAWERFKKHGLVTGGEYKSGE 184
>gi|335347289|gb|AEH42092.1| cysteine proteinase 1 [Haemonchus contortus]
Length = 332
Score = 100 bits (248), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 55/178 (30%), Positives = 87/178 (48%), Gaps = 41/178 (23%)
Query: 115 CRPYEIAPCEHHVNGTRPSCDASKG-HTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
C+PY + PC +H G SC TP C + CQ Y Y+KD ++ Y + +E
Sbjct: 194 CKPYPLHPCGNH-GGKFWSCPRDHSFRTPACKKYCQYGYGKRYEKDKSYVKSVYILDEDE 252
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
K+I +E+ ++GPV+ AF ++D Y G
Sbjct: 253 KAIQREMMKNGPVQAAFITYEDFSFYTKG------------------------------- 281
Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRG 291
+ ++ G+ G HA++++GWG + + KYW +ANSW+TDWG++G F+ILRG
Sbjct: 282 ------IYVHTRGRQRGAHAVKVVGWGVENGT--KYWNVANSWSTDWGEDGYFRILRG 331
Score = 38.1 bits (87), Expect = 5.3, Method: Compositional matrix adjust.
Identities = 17/33 (51%), Positives = 20/33 (60%)
Query: 7 RLCGFGCNGGFPGMAWRYWVKSGIVSGGAYGSK 39
R CG GCNGG AW Y + G+V+GG Y K
Sbjct: 159 RECGRGCNGGMDHKAWEYVKEFGVVTGGRYQEK 191
>gi|156708118|gb|ABU93317.1| cathepsin B8 cysteine protease, partial [Monocercomonoides sp. PA]
Length = 275
Score = 99.8 bits (247), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 72/268 (26%), Positives = 111/268 (41%), Gaps = 88/268 (32%)
Query: 81 DEDLPANFDSRTKWPNCPTIREIRDQGSCGSCW---------------GC-----RPYEI 120
+E+ PA+FD R KWP +R+Q SCGSCW GC P ++
Sbjct: 54 NENAPASFDCRQKWPG--KAEPVRNQASCGSCWAHAASETMGFRMGIRGCYKGVMSPQDL 111
Query: 121 APCEHHVNG------------------TRPSC---DASKGHTPKCVRECQENYDVPYKKD 159
CE + G T C + G P C +C+ ++
Sbjct: 112 VSCESNNMGCEGGYADRVWNWIQKKGITTEQCLPYVSGSGRVPTCPSKCKNGSNIVRSFV 171
Query: 160 LNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKW 219
++G S N K++M E+ +GPV F VF+D + YKSG
Sbjct: 172 SSWG------SFNSKTVMDEVANNGPVYACFEVFEDFLNYKSG----------------- 208
Query: 220 TIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTD 279
+ +K+GK+ G H + ++GWG + + YWL+ NSW +
Sbjct: 209 --------------------IYQHKTGKSKGWHHVMLMGWGTE--NGVPYWLLQNSWGSG 246
Query: 280 WGDNGLFKILRGKDECGIESSITAGVPK 307
WG+ G F+I RG ++C I+ +G+PK
Sbjct: 247 WGEKGFFRIRRGTNDCHIDEIFYSGLPK 274
>gi|156708110|gb|ABU93313.1| cathepsin B4 cysteine protease [Monocercomonoides sp. PA]
Length = 281
Score = 99.8 bits (247), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 83/298 (27%), Positives = 114/298 (38%), Gaps = 100/298 (33%)
Query: 51 RAHLKSWMGVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCG 110
RA L + +G H Y P + SE P FD+R +WP I +RDQ SCG
Sbjct: 42 RAMLGAELGPHMPYVQP-------LSLSE-----PTEFDAREQWPG--KILPVRDQASCG 87
Query: 111 SCWGCRPYE-------IAPCEHHVNGTRP--SCDAS------------------------ 137
SCW E IA C + SCD +
Sbjct: 88 SCWAHSVAEAMGDAQNIAGCPRGAMSVQDLVSCDKTDSACNGGDMKKAQEYLVKTGITTE 147
Query: 138 --------KGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGA 189
G P C +C + + + +S+ S IM+ + E+GP+
Sbjct: 148 ACVKYVSGSGRVPACPSKCDNGSQI-----IRYKLQSWK-SVEPSEIMQALMEYGPLSCG 201
Query: 190 FTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKAL 249
F V+ D + Y+SG + +KSG
Sbjct: 202 FMVYSDFMNYRSGVY-------------------------------------QHKSGYFE 224
Query: 250 GGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVPK 307
GGHA+ + GWG + + YWL+ NSW WG+ G FKILRG + C IES +T GVPK
Sbjct: 225 GGHAVLLCGWGVE--NGLPYWLVQNSWGPAWGEKGFFKILRGSNHCEIESYVTLGVPK 280
>gi|270012757|gb|EFA09205.1| cathepsin B precursor [Tribolium castaneum]
Length = 348
Score = 99.8 bits (247), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 85/273 (31%), Positives = 120/273 (43%), Gaps = 46/273 (16%)
Query: 57 WMGVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCP-TIREIRDQGSCGSCWGC 115
++G+HPD P ++ + ++ + +P +FD+R KWP C I +IRDQG+CGSCW
Sbjct: 54 FLGLHPD---PDYKIQT--KHHKIAKSIPESFDAREKWPECKDVIGKIRDQGTCGSCWAF 108
Query: 116 RPYEIAPCEHHVNGTRPSCDASKGHT-----PKCVRECQENYDVPYKKDLNFGAKSYSVS 170
E+ T C +KG T P+ + C E D + + AK++
Sbjct: 109 ASTEVM--------TDRLCIGTKGETKFVFSPENLLTCCE--DCRLECVGGYTAKAWDYY 158
Query: 171 SNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIK----------WT 220
NE + Y EG Y V + + +T
Sbjct: 159 INEGIVSGGDYNSS--EGCQPYSKASFQYAVASKCVKACQNDKYDVKYDDDKHYGDSFYT 216
Query: 221 IRDNTSQLGAE--------GAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLI 272
+ N +Q+ E F VF+D+I YKSG L + IL WG +E YWLI
Sbjct: 217 LETNVTQIQTEILTNGPVMATFNVFEDIIYYKSGIQLSN--VSILRWGTEEGVP--YWLI 272
Query: 273 ANSWNTDWGD-NGLFKILRGKDECGIESSITAG 304
ANSW T WGD G KI RG +EC IE + AG
Sbjct: 273 ANSWGTWWGDLGGFIKIKRGTNECAIEQEMAAG 305
>gi|343475054|emb|CCD13447.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 336
Score = 99.4 bits (246), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 87/302 (28%), Positives = 118/302 (39%), Gaps = 107/302 (35%)
Query: 65 NLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRP------- 117
LP R E ++ +LP +FD+ KWP+CPTIREI DQ +C + W
Sbjct: 76 TLPPVRFTE----EQLRTELPESFDAAEKWPHCPTIREIPDQSACRASWAVATASAISDR 131
Query: 118 -------------------------------YEIAPCEHHV-NGTR---------PSCD- 135
Y A E++V NG P C+
Sbjct: 132 YCTVGNGKQLRISAADLMACCTGCGGGCEGGYPDAAWEYYVSNGITSSQCQPYPFPRCEH 191
Query: 136 -ASKGHTPKCVRECQENYDVP------YKKDLNF----GAKSYSVSSNEKSIMKEIYEHG 184
++G P C + N+D P K + G SY V E+ +E+Y +G
Sbjct: 192 RGAQGKKPPCSK---YNFDTPTCNATCTDKSVPLIKYRGNHSYEVRG-EEDYKRELYFNG 247
Query: 185 PVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYK 244
P F V D + YKSG + +
Sbjct: 248 PFVVRFQVHSDFLAYKSG-------------------------------------VYQHV 270
Query: 245 SGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAG 304
+G LGG A+RI+GWG + + YW +ANSW+TDWG NG F ILRG +EC IE AG
Sbjct: 271 AGNFLGGKAVRIVGWG--KMNGTPYWKVANSWDTDWGMNGYFLILRGNNECNIEHLGFAG 328
Query: 305 VP 306
P
Sbjct: 329 TP 330
>gi|166030328|gb|ABY78831.1| cathepsin B-like protease [Trypanosoma congolense]
Length = 336
Score = 99.4 bits (246), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 86/297 (28%), Positives = 116/297 (39%), Gaps = 42/297 (14%)
Query: 39 KQAEKNSLSNIPRAHLKSWMG--VHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPN 96
K + NI A K G + LP R E ++ LP FD+ WP+
Sbjct: 47 KAVYNGKMQNITFAEAKRLTGAWIQKSSTLPPARFTE----EQLRTKLPETFDAAEHWPH 102
Query: 97 CPTIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPY 156
CPTIREI DQ +C + W + G S C ++C + +
Sbjct: 103 CPTIREIADQSACRASWAVSTASAISDRYCTVGGGKQLRISAADLLSCCKQCGDGCKGGF 162
Query: 157 KKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETT---- 212
Y ++S+ + H GA YK F P T
Sbjct: 163 PGFAWLYYVEYGIASS--GCQPYPFPHCEHRGAQGNKTPCSKYK---FDTPKCNATCTDK 217
Query: 213 AMSLIKWTIRDNTSQLGAEG----------------AFTVFDDLILYKS-------GKAL 249
++ L+K+ R N + L G F V+ DL YKS G L
Sbjct: 218 SIPLVKY--RGNATYLLLHGEEDYKRELYFNGPFVAVFFVYTDLFAYKSGVYRNVDGDFL 275
Query: 250 GGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVP 306
GG A+RI+GWG+ + YW +ANSW+TDWG NG ILRG +EC IE G P
Sbjct: 276 GGQAVRIVGWGKLNGT--PYWKVANSWDTDWGMNGYMLILRGNNECNIEHLGFTGFP 330
Score = 41.2 bits (95), Expect = 0.68, Method: Compositional matrix adjust.
Identities = 17/28 (60%), Positives = 20/28 (71%)
Query: 7 RLCGFGCNGGFPGMAWRYWVKSGIVSGG 34
+ CG GC GGFPG AW Y+V+ GI S G
Sbjct: 152 KQCGDGCKGGFPGFAWLYYVEYGIASSG 179
>gi|403371460|gb|EJY85611.1| Cathepsin B [Oxytricha trifallax]
Length = 309
Score = 99.4 bits (246), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 78/312 (25%), Positives = 120/312 (38%), Gaps = 95/312 (30%)
Query: 40 QAEKNSLSNIPRAHLKSWMGVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPT 99
+ +N +N A LK +G + + +++++ LP +FDSRT+W +C
Sbjct: 40 EVSQNKFANYTEAQLKGLLGTVLSHQ------SGISAFTQINAALPDSFDSRTQWKDC-- 91
Query: 100 IREIRDQGSCGSCWG----------------------CRPYEIAPCEHHVNG-------- 129
+ IRDQ CGSCW P ++ C+ G
Sbjct: 92 VHPIRDQAQCGSCWAFAAAESLSDRFCIASQGKVNLVLSPQDMVSCDTSNFGCFGGYLDQ 151
Query: 130 ----------TRPSCDASK---GHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSI 176
+ SC+ K G P C +C + K A S + ++
Sbjct: 152 AWQYLEQQGVSSDSCEPYKSGNGDQPSCPTKCSNGQAI---KKYKCKAGSTKQAKGAEAT 208
Query: 177 MKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTV 236
I E GPVE FTV+ D Y SG +
Sbjct: 209 KSLIQESGPVETGFTVYQDFYNYNSGVYH------------------------------- 237
Query: 237 FDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECG 296
+ +G A GGHA++ILGWG+ + E YW++ANSW DWG+ G F I +G + G
Sbjct: 238 ------HVTGDAEGGHAVKILGWGK--QGLENYWIVANSWGEDWGEKGYFNIRQG--DSG 287
Query: 297 IESSITAGVPKL 308
I+ + +P +
Sbjct: 288 IDEATFGCIPDV 299
>gi|166030332|gb|ABY78833.1| cathepsin B-like protease [Trypanosoma congolense]
Length = 336
Score = 99.4 bits (246), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 83/271 (30%), Positives = 111/271 (40%), Gaps = 45/271 (16%)
Query: 65 NLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEIAPCE 124
+LP R E ++ +LP +FD+ WP+CPTIREI DQ +C + W
Sbjct: 76 SLPPVRFTE----EQLRTELPESFDAAEHWPHCPTIREIADQSACRASWAVATASAISDR 131
Query: 125 HHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIY--- 181
+ G S C ++C + Y A Y VS S + Y
Sbjct: 132 YCTVGKGKQLRISAADLMACCKDCGGGCEGGYPD----AAWEYYVSHGITSSQCQPYPFP 187
Query: 182 --EHGPVEGAFTVFDDLILYKSGRFFVPGNETT----AMSLIKWTIRDNTSQLGAEG--- 232
EH +G +F P T ++ LIK+ + G E
Sbjct: 188 RCEHRGAQGKKPPCSKY------KFVTPQCNATCTDKSVPLIKYRGNHSYEVRGEEDYKR 241
Query: 233 ----------AFTVFDDLILYKS-------GKALGGHAIRILGWGEDEKSKEKYWLIANS 275
F V D + YKS G LGG A+RI+GWG+ + YW +ANS
Sbjct: 242 ELYFNGPFVVRFQVHSDFLAYKSGVYQHVAGNFLGGKAVRIVGWGKLNGT--PYWKVANS 299
Query: 276 WNTDWGDNGLFKILRGKDECGIESSITAGVP 306
W+TDWG NG F ILRG +EC IE AG P
Sbjct: 300 WDTDWGMNGYFLILRGDNECNIEHLGFAGTP 330
>gi|2944340|gb|AAC05262.1| cathepsin B-like cysteine protease GCP7 [Haemonchus contortus]
Length = 348
Score = 99.0 bits (245), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 59/192 (30%), Positives = 87/192 (45%), Gaps = 40/192 (20%)
Query: 115 CRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEK 174
C+PY C H +C + TP C CQ Y Y+ D Y + ++E+
Sbjct: 195 CKPYVFPQCGAHKGKAFNNCPSHPYATPACKPYCQYGYGKRYENDKIKARTWYWLPNDER 254
Query: 175 SIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAF 234
+I EI + GPV F +++D Y+ G +
Sbjct: 255 TIQLEIMQKGPVHATFNIYEDFEHYEGGVY------------------------------ 284
Query: 235 TVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWG-DNGLFKILRGKD 293
++ +G GGH+I+I+GWG D+ K YWLIANSW+TDWG D G F+++RG +
Sbjct: 285 -------IHTAGAMEGGHSIKIIGWGVDKGVK--YWLIANSWSTDWGEDGGYFRVVRGIN 335
Query: 294 ECGIESSITAGV 305
C IE + AG
Sbjct: 336 NCDIEGGVLAGT 347
Score = 52.4 bits (124), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 23/46 (50%), Positives = 31/46 (67%), Gaps = 2/46 (4%)
Query: 69 NRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG 114
N LP I ++D+P +FDSR KW +CP++R I DQ +CGSCW
Sbjct: 83 NVLP--IANITSNDDIPESFDSREKWKDCPSLRVIPDQSNCGSCWA 126
Score = 41.2 bits (95), Expect = 0.56, Method: Compositional matrix adjust.
Identities = 15/33 (45%), Positives = 24/33 (72%)
Query: 7 RLCGFGCNGGFPGMAWRYWVKSGIVSGGAYGSK 39
+ CG+GC+GG+ AW++ +G+V+GGAY K
Sbjct: 160 KFCGYGCDGGYNARAWKWATIAGVVTGGAYKEK 192
>gi|343472937|emb|CCD15042.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 336
Score = 99.0 bits (245), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 85/297 (28%), Positives = 116/297 (39%), Gaps = 42/297 (14%)
Query: 39 KQAEKNSLSNIPRAHLKSWMG--VHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPN 96
K + NI + K G + LP R E ++ LP FD+ WP+
Sbjct: 47 KAVYNGKMQNITFSEAKRLTGARIQKSRTLPPARFTE----EQLRTKLPETFDAAEHWPH 102
Query: 97 CPTIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPY 156
CPTIREI DQ C + W + G S C ++C + +
Sbjct: 103 CPTIREIADQSECRASWAVSTASAISDRYCTVGGGKQLRISAADLMACCKQCGDGCKGGF 162
Query: 157 KKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETT---- 212
Y ++S++ + H GA YK F P T
Sbjct: 163 PGFAWLYYVEYGITSSQ--CQPYPFPHCEHRGAQGNKTPCSKYK---FDTPKCNATCTDK 217
Query: 213 AMSLIKWTIRDNTSQLGAEG----------------AFTVFDDLILYKS-------GKAL 249
++ L+K+ R N + L G F V+ DL YKS G L
Sbjct: 218 SIPLVKY--RGNATYLLLHGEEDYKRELYFNGPFVAVFFVYTDLFAYKSGVYRNVDGDFL 275
Query: 250 GGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVP 306
GG A+RI+GWG+ + YW +ANSW+TDWG NG ILRG +EC IE G P
Sbjct: 276 GGQAVRIVGWGKLNGT--PYWKVANSWDTDWGMNGYMLILRGNNECNIEHLGFTGFP 330
Score = 39.7 bits (91), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 16/26 (61%), Positives = 19/26 (73%)
Query: 7 RLCGFGCNGGFPGMAWRYWVKSGIVS 32
+ CG GC GGFPG AW Y+V+ GI S
Sbjct: 152 KQCGDGCKGGFPGFAWLYYVEYGITS 177
>gi|52546914|gb|AAU81590.1| cysteine proteinase, partial [Petunia x hybrida]
Length = 122
Score = 99.0 bits (245), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 54/137 (39%), Positives = 72/137 (52%), Gaps = 38/137 (27%)
Query: 170 SSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLG 229
SS+ SIM E+Y++GPVE AFTV++D YKSG +
Sbjct: 4 SSDPYSIMTEVYKNGPVEVAFTVYEDFAHYKSGVY------------------------- 38
Query: 230 AEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKIL 289
+ +G LGGHA++++GWG E E YWL+AN WN WGD+G FKI
Sbjct: 39 ------------KHVTGDELGGHAVKLIGWGTSEDG-EDYWLLANQWNRGWGDDGYFKIR 85
Query: 290 RGKDECGIESSITAGVP 306
RG +EC IE + AG+P
Sbjct: 86 RGTNECDIEDEVVAGMP 102
>gi|166030324|gb|ABY78829.1| cathepsin B-like protease [Trypanosoma congolense]
Length = 336
Score = 99.0 bits (245), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 83/329 (25%), Positives = 116/329 (35%), Gaps = 104/329 (31%)
Query: 39 KQAEKNSLSNIPRAHLKSWMG--VHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPN 96
K + NI A K G + +LP R E ++ +LP +FDS KWPN
Sbjct: 47 KAVYNGKMQNITFAEAKRLTGAWIQKTSSLPPVRFTE----EQLRTELPESFDSAEKWPN 102
Query: 97 CPTIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHT--------------- 141
CPTIREI DQ +C + W + + G S H
Sbjct: 103 CPTIREIADQSACRASWAVSTASVISDRYCTVGGVQQLRISAAHLLSCCKQCGGGCKGGF 162
Query: 142 -----------------------PKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSI-- 176
P C + P K NF + + +KSI
Sbjct: 163 PGFAWRYYVEYGIASSYCQPYPFPHCEHRGAQGNKTPCSK-YNFDTPKCNATCTDKSIPL 221
Query: 177 ------------------MKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIK 218
+E+Y +GP F V+ DL YKSG +
Sbjct: 222 VKYRGNATYLLLHGEEDYKRELYFNGPFVAVFYVYTDLFAYKSGVY-------------- 267
Query: 219 WTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNT 278
+ G LGG A++++GWG+ + YW +AN+W+T
Sbjct: 268 -----------------------RHVDGDFLGGTAVKVVGWGKLNGT--PYWKVANTWDT 302
Query: 279 DWGDNGLFKILRGKDECGIESSITAGVPK 307
DWG +G ILRG +EC IE AG P+
Sbjct: 303 DWGMDGYLLILRGNNECNIEHLGFAGTPE 331
>gi|161343875|tpg|DAA06118.1| TPA_inf: cathepsin B [Myzus persicae]
Length = 210
Score = 99.0 bits (245), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 66/188 (35%), Positives = 88/188 (46%), Gaps = 59/188 (31%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKC-VREC-QENYDVPYKKDLNFGAKSYSVSS 171
GC+PY I P R +C TP C +R C NY Y+ DL++ YS+S
Sbjct: 73 GCQPYSIYP----RGKGRNTCIDDDIDTPDCSIRTCTNSNYTKGYRADLHYVDTVYSLSR 128
Query: 172 NEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAE 231
+E+ IM +IY++GPV+ A
Sbjct: 129 SEEDIMTDIYKNGPVQAA------------------------------------------ 146
Query: 232 GAFTVFDDLILYKSG-------KALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNG 284
F V+ D + YKSG + GGHAI+ILGWG D+ +K YWL ANSW+ WG+NG
Sbjct: 147 --FYVYTDFMYYKSGVYSYTRGQIEGGHAIKILGWGVDDNTK--YWLCANSWSRSWGENG 202
Query: 285 LFKILRGK 292
LF+ILRG
Sbjct: 203 LFRILRGN 210
>gi|56758130|gb|AAW27205.1| unknown [Schistosoma japonicum]
Length = 279
Score = 99.0 bits (245), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 43/89 (48%), Positives = 60/89 (67%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GC+PY CEHH G P+C TP+C + CQ+ Y PY++D ++G +SY+V SNE
Sbjct: 187 GCQPYPFPKCEHHTKGKYPACGTKIYKTPQCKQTCQKGYKTPYEQDKHYGDESYNVISNE 246
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSG 202
K+I +EI +GPVE AF V++D + YKSG
Sbjct: 247 KAIQREIMMYGPVEAAFDVYEDFLNYKSG 275
Score = 56.2 bits (134), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 22/52 (42%), Positives = 34/52 (65%), Gaps = 1/52 (1%)
Query: 63 DYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG 114
D + R P + + +++ ++P+ FDSR KWP+C +I +IRDQ CGSCW
Sbjct: 70 DAEMKRKRRP-TVDHHDLNVEIPSQFDSRKKWPHCKSISQIRDQSRCGSCWA 120
Score = 50.8 bits (120), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 20/27 (74%), Positives = 23/27 (85%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGA 35
CG GC GGFPG+AW YWVK GIV+GG+
Sbjct: 155 CGDGCQGGFPGVAWDYWVKRGIVTGGS 181
>gi|204022075|dbj|BAG71135.1| cathepsin B-S2 [Tuberaphis taiwana]
Length = 334
Score = 99.0 bits (245), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 63/202 (31%), Positives = 98/202 (48%), Gaps = 47/202 (23%)
Query: 107 GSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKS 166
G G+ GC PY++ PC ++ G +C + C V + + KS
Sbjct: 175 GDYGTKEGCMPYKVPPC-YNKQGKNTCGGQPMERNHQCPKTCYGKTTVQNR----YKTKS 229
Query: 167 YSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTS 226
V ++ K+I +++ +GPVE +F V+DD +YKSG
Sbjct: 230 EYVMNSIKTIEQDLKTYGPVEASFDVYDDFSVYKSG------------------------ 265
Query: 227 QLGAEGAFTVFDDLILYKSGKA--LGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNG 284
I K+ KA GGH+I+I+GWG+ + YWL NSW+ WG++G
Sbjct: 266 --------------IYRKTPKAKYQGGHSIKIIGWGQQNGT--PYWLAVNSWSKFWGEHG 309
Query: 285 LFKILRGKDECGIESSITAGVP 306
FKI++G++ECGIE ++TAG+P
Sbjct: 310 TFKIIKGRNECGIERAVTAGIP 331
Score = 52.0 bits (123), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 21/34 (61%), Positives = 24/34 (70%)
Query: 80 VDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCW 113
V+ D P FDSRT W +C I IRDQG+CGSCW
Sbjct: 81 VENDSPQQFDSRTNWKSCKQIGHIRDQGNCGSCW 114
>gi|17560488|ref|NP_506310.1| Protein F32H5.1 [Caenorhabditis elegans]
gi|3876629|emb|CAB04249.1| Protein F32H5.1 [Caenorhabditis elegans]
Length = 356
Score = 98.6 bits (244), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 58/196 (29%), Positives = 85/196 (43%), Gaps = 41/196 (20%)
Query: 113 WGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQEN--YDVPYKKDLNFGAKSYSVS 170
+GC+PY I PC+ S HTP C C N + + YK+D +FG Y+V
Sbjct: 194 FGCKPYSIYPCDKKYANGTTSVPCPGYHTPTCEEHCTSNITWPIAYKQDKHFGKAHYNVG 253
Query: 171 SNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGA 230
I EI +GPV +F ++DD YK+G +
Sbjct: 254 KKMTDIQIEIMTNGPVIASFIIYDDFWDYKTGIY-------------------------- 287
Query: 231 EGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILR 290
++ +G GG +I+GWG D + YWL + W TD+G+NG + LR
Sbjct: 288 -----------VHTAGDQEGGMDTKIIGWGVD--NGVPYWLCVHQWGTDFGENGFVRFLR 334
Query: 291 GKDECGIESSITAGVP 306
G +E IE + A +P
Sbjct: 335 GVNEVNIEHQVLAALP 350
>gi|66506619|ref|XP_393283.2| PREDICTED: uncharacterized peptidase C1-like protein F26E4.3-like
[Apis mellifera]
Length = 439
Score = 98.6 bits (244), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 77/248 (31%), Positives = 110/248 (44%), Gaps = 36/248 (14%)
Query: 82 EDLPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEIAPCEHHV--NGTRPSCDASKG 139
E LP FD+RT+W I + DQG CG+ W ++A V GT S S
Sbjct: 195 ESLPREFDARTRWRR--QISGVDDQGWCGASWAISTAQVASDRFAVMSKGT-DSVLLSAQ 251
Query: 140 HTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVE--------GAFT 191
H C ++ Q D Y + + + + K +YE ++ G
Sbjct: 252 HLLSCNKKGQRGCDGGYLDRAWLFMRKFGLVDEQCYPWKGVYEQCKLQKRTNLEAAGCRA 311
Query: 192 VFDDLI--LYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKAL 249
+ L LYK G + GNET M R+ + + V+ D Y+SG +
Sbjct: 312 PANPLRKELYKVGPAYRLGNETDIM-------REILTSGPVQATMKVYQDFFSYESGIYM 364
Query: 250 ----------GGHAIRILGWGEDEKSKE----KYWLIANSWNTDWGDNGLFKILRGKDEC 295
G H++RI+GWGED + KYWL+ NSW +WG+NGLF+I RG +EC
Sbjct: 365 HTPIAELYESGYHSVRIIGWGEDISTDSGLPIKYWLVVNSWGQEWGENGLFRIRRGINEC 424
Query: 296 GIESSITA 303
IES + A
Sbjct: 425 DIESFVVA 432
>gi|300952942|gb|ADK46902.1| cathepsin B [Radopholus similis]
Length = 356
Score = 98.6 bits (244), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 65/188 (34%), Positives = 90/188 (47%), Gaps = 50/188 (26%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQE-NYDVPYKKDLNFGAKSYSVS-S 171
GC+PY P + + S TP+C ++C+ Y YK+D +FG Y+V S
Sbjct: 206 GCKPYPFLP--------HTTVEYS---TPECSKKCENYQYKKAYKQDKHFGMSVYNVQFS 254
Query: 172 NEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAE 231
+ I EI +GPVE ++I+Y F+ G ++ W
Sbjct: 255 DPVDIQYEIMNNGPVEA------NMIVYYDFMFYKSG---VYQTVFPW------------ 293
Query: 232 GAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRG 291
LGGHA+RI+GWG D +K YWL+ANSWNTDWG++G F+I RG
Sbjct: 294 ----------------PLGGHAVRIVGWGVDGPTKVPYWLVANSWNTDWGEDGYFRIRRG 337
Query: 292 KDECGIES 299
DE IES
Sbjct: 338 TDESYIES 345
Score = 37.7 bits (86), Expect = 7.0, Method: Compositional matrix adjust.
Identities = 16/32 (50%), Positives = 21/32 (65%), Gaps = 1/32 (3%)
Query: 84 LPANFDSRTKWPNCP-TIREIRDQGSCGSCWG 114
LP +FDSR ++ C I I+DQ +CGSCW
Sbjct: 108 LPQHFDSRKQFTKCAKVIGTIQDQSNCGSCWA 139
>gi|66805843|ref|XP_636643.1| hypothetical protein DDB_G0288563 [Dictyostelium discoideum AX4]
gi|60465035|gb|EAL63141.1| hypothetical protein DDB_G0288563 [Dictyostelium discoideum AX4]
Length = 314
Score = 98.2 bits (243), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 82/339 (24%), Positives = 123/339 (36%), Gaps = 104/339 (30%)
Query: 31 VSGGAYGSKQAEKNSLSNIPRAHLKSWMGVHPDYNLPANRLPELIGY------------- 77
V G++ K ++L N + KS H + N ++IG
Sbjct: 19 VCLGSFLDKPVLDDNLINSINNNKKSSWTAHRNKNFEGKTFGDIIGMMGTKKTAAPFKLT 78
Query: 78 ---SEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEIAP---CEHHVNGTR 131
E+ +P +FDSR +WP+C I I +Q CGSCW E+ C N T
Sbjct: 79 ENGEELKGSIPTSFDSRVQWPDC--IHPILNQEQCGSCWAFSSSEVLSDRLCIASNNKTN 136
Query: 132 P---------SCD---------------------------------ASKGHTPKCVRECQ 149
P +CD A G C R C
Sbjct: 137 PGALSPQTLVACDVYGNDGCSGGIPQLAWEYMELKGLPTDSCVPYTAGNGTVYSCQRSCS 196
Query: 150 ENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGN 209
++ D + F K+ S+ + I + I +GP+ G V++D + Y SG +
Sbjct: 197 DSEDYSLYRAKPFTLKT---CSSVQCIQENILAYGPIVGTMEVYEDFMSYSSGVY----- 248
Query: 210 ETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKY 269
++ LGGHAI+I+GWG D+ S+ Y
Sbjct: 249 -------------------------------VMTPGSSLLGGHAIKIVGWGFDQTSQLNY 277
Query: 270 WLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVPKL 308
W++ANSW DWG G F I + C I S +A ++
Sbjct: 278 WIVANSWGADWGQQGFFFI--SMETCSISSDASAAEARV 314
>gi|254575663|gb|ACT68328.1| cysteine proteinase [Haemonchus contortus]
Length = 348
Score = 98.2 bits (243), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 59/192 (30%), Positives = 86/192 (44%), Gaps = 40/192 (20%)
Query: 115 CRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEK 174
C+PY C H +C + TP C CQ Y Y+ D Y + ++E+
Sbjct: 195 CKPYVFPQCGAHKGKAFNNCPSHPYATPACKPYCQYGYGKRYENDKIKAKTWYWLPNDER 254
Query: 175 SIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAF 234
+I EI + GPV F +++D Y G +
Sbjct: 255 TIQLEIMKKGPVHATFNIYEDFEHYNGGVY------------------------------ 284
Query: 235 TVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWG-DNGLFKILRGKD 293
++ +G GGH+I+I+GWG D+ K YWLIANSW+TDWG D G F+++RG +
Sbjct: 285 -------IHTAGAMEGGHSIKIIGWGVDKGVK--YWLIANSWSTDWGEDGGYFRVVRGIN 335
Query: 294 ECGIESSITAGV 305
C IE + AG
Sbjct: 336 NCDIEGGVLAGT 347
Score = 52.0 bits (123), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 19/34 (55%), Positives = 27/34 (79%)
Query: 81 DEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG 114
++D+P +FDSR KW +CP++R I DQ +CGSCW
Sbjct: 93 NDDIPESFDSREKWKDCPSLRVIPDQSNCGSCWA 126
Score = 41.2 bits (95), Expect = 0.60, Method: Compositional matrix adjust.
Identities = 15/33 (45%), Positives = 24/33 (72%)
Query: 7 RLCGFGCNGGFPGMAWRYWVKSGIVSGGAYGSK 39
+ CG+GC+GG+ AW++ +G+V+GGAY K
Sbjct: 160 KFCGYGCDGGYNARAWKWATIAGVVTGGAYKEK 192
>gi|328726763|ref|XP_003249034.1| PREDICTED: cathepsin B-like cysteine proteinase-like, partial
[Acyrthosiphon pisum]
Length = 129
Score = 97.8 bits (242), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 60/172 (34%), Positives = 79/172 (45%), Gaps = 56/172 (32%)
Query: 143 KCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSG 202
+C R C N D+ Y D F Y ++ SI K++ +GP+E +
Sbjct: 3 RCTRMCYGNQDLDYDDDHRFTRDFYYLTYG--SIQKDVLNYGPIEAS------------- 47
Query: 203 RFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSG--------KALGGHAI 254
F V+DD YKSG LGGHA+
Sbjct: 48 -------------------------------FDVYDDFPSYKSGVYQRTPNATKLGGHAV 76
Query: 255 RILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVP 306
+++GWG +E + YWL+ NSWN WGDNGLFKI RG DEC I+S+ TAGVP
Sbjct: 77 KLIGWGVEEGTP--YWLMVNSWNAQWGDNGLFKIRRGTDECRIDSATTAGVP 126
>gi|166030330|gb|ABY78832.1| cathepsin B-like protease [Trypanosoma congolense]
gi|343476577|emb|CCD12360.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 337
Score = 97.8 bits (242), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 89/303 (29%), Positives = 117/303 (38%), Gaps = 52/303 (17%)
Query: 39 KQAEKNSLSNIPRAHLKSWMG--VHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPN 96
K + NI + K G + LP R E ++ LP FD+ WP+
Sbjct: 48 KAVYNGKMQNITFSEAKRLTGARIQKSSALPPARFTE----EQLRTKLPETFDAAEHWPH 103
Query: 97 CPTIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPY 156
CPTIREI DQ C + W + G S H C ++C +
Sbjct: 104 CPTIREIADQSECRASWAVSTASAISDRYCTVGKGKQLRISAAHLLSCCKDCGDG----C 159
Query: 157 KKDLNFGAKSYSVSSNEKSIMKEIY-----EHGPVEGAFTVFDDLILYKSGRFFVPGNET 211
K A Y V S + Y EH +G T F P
Sbjct: 160 KGGFPGFAWRYYVEYGITSSSCQPYPFPRCEHQGAQGNKTPCSKY------NFDTPKCNA 213
Query: 212 T----AMSLIKWTIRDNTSQLGAEG----------------AFTVFDDLILYKS------ 245
T A+ LIK+ R N + L G F V+ DL YKS
Sbjct: 214 TCTDKAIPLIKY--RGNATYLLLHGEEDYKRELYFNGPFVAVFYVYTDLFAYKSGVYRHV 271
Query: 246 -GKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAG 304
G LGG A++++GWG+ + YW +ANSW+TDWG G ILRG +EC IE AG
Sbjct: 272 DGDFLGGTAVKVVGWGKLNGT--PYWKLANSWDTDWGMGGYLLILRGNNECNIEHLGFAG 329
Query: 305 VPK 307
P+
Sbjct: 330 TPE 332
Score = 42.4 bits (98), Expect = 0.28, Method: Compositional matrix adjust.
Identities = 17/24 (70%), Positives = 19/24 (79%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVS 32
CG GC GGFPG AWRY+V+ GI S
Sbjct: 155 CGDGCKGGFPGFAWRYYVEYGITS 178
>gi|403365170|gb|EJY82363.1| Cathepsin B [Oxytricha trifallax]
Length = 309
Score = 97.8 bits (242), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 77/312 (24%), Positives = 120/312 (38%), Gaps = 95/312 (30%)
Query: 40 QAEKNSLSNIPRAHLKSWMGVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPT 99
+ +N +N A LK +G + + +++++ LP +FDSRT+W +C
Sbjct: 40 EVSQNKFANYTEAQLKGLLGTVLSHQ------SGISAFTQINAALPDSFDSRTQWKDC-- 91
Query: 100 IREIRDQGSCGSCWGCRPYE------IAPCEHHVNGTRP-----SCDAS----------- 137
+ IRDQ CGSCW E + VN SCDAS
Sbjct: 92 VHPIRDQAKCGSCWAFAAVESLSDRFCIASQGKVNLVLSPQDMLSCDASNFCCFGGYLDT 151
Query: 138 ---------------------KGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSI 176
G P C +C + K A S + ++
Sbjct: 152 AWQYLEQQGVGSDSCEPYKSGNGDQPSCPSKCSNGQAI---KKYKCKAGSTKQAKGAEAT 208
Query: 177 MKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTV 236
I + GPVE FT+++D + Y SG +
Sbjct: 209 KSLIQQSGPVETGFTIYEDFLNYNSGIYH------------------------------- 237
Query: 237 FDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECG 296
+ +G +GGHA++ILGWG ++ E YW++ANSW DWG+ G F I +G + G
Sbjct: 238 ------HVTGGNMGGHAVKILGWG--KQGLENYWIVANSWGEDWGEKGYFNIRQG--DSG 287
Query: 297 IESSITAGVPKL 308
I+ + +P +
Sbjct: 288 IDEATFGCIPDV 299
>gi|294939825|ref|XP_002782575.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
gi|239894358|gb|EER14370.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
Length = 398
Score = 97.8 bits (242), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 62/196 (31%), Positives = 85/196 (43%), Gaps = 43/196 (21%)
Query: 114 GCRPYEIAPCEHHVNGTR-PSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSN 172
GC PY+ PC H + P+C +CV + + V Y D F +S +
Sbjct: 245 GCWPYDFPPCAHFFKDPKYPACPKFARVNLRCVSKLRHMM-VVYFSDRYFMVESVPYHFS 303
Query: 173 EKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEG 232
I GPV F V++D + YKSG +
Sbjct: 304 ADDAKNAIRTDGPVSATFYVYEDFLAYKSGVY---------------------------- 335
Query: 233 AFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGK 292
+ SG LG HA++I+GWGED E YWL+ NSWN WGD+GLFKI G
Sbjct: 336 ---------KHTSGSLLGAHAVKIIGWGED--GGEAYWLVVNSWNEGWGDHGLFKIALG- 383
Query: 293 DECGIESSITAGVPKL 308
+CGI++ + G PK+
Sbjct: 384 -DCGIDNELLGGTPKV 398
Score = 44.7 bits (104), Expect = 0.052, Method: Compositional matrix adjust.
Identities = 19/47 (40%), Positives = 29/47 (61%), Gaps = 1/47 (2%)
Query: 69 NRLPELIGYSEVDEDLPANFDSRTKWPNCP-TIREIRDQGSCGSCWG 114
+++ E + E +DLP +FD+RT +P C I +RDQ +CG CW
Sbjct: 125 DKVVEKVYAIEELKDLPTDFDARTAFPKCSKVIGHVRDQSACGDCWA 171
>gi|156708114|gb|ABU93315.1| cathepsin B6 cysteine protease [Monocercomonoides sp. PA]
Length = 281
Score = 97.8 bits (242), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 75/264 (28%), Positives = 107/264 (40%), Gaps = 90/264 (34%)
Query: 83 DLPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEIAPCEHHVNGT---------RPS 133
++P NFD+R +WP I +RDQ SCGSCW E + G S
Sbjct: 62 NVPENFDAREQWPG--KIYPVRDQASCGSCWAHAASEAIGNRFSIKGCGKGMLSVQDLVS 119
Query: 134 CD--------------------------------ASKGHTPKCVRECQENYDV-PYKKDL 160
CD + G P C +C + YK +
Sbjct: 120 CDKGDSGCNGGSGPLSSKWLVSNGVTTEECLPYVSGNGRVPACAAKCSNGSQIIRYKYE- 178
Query: 161 NFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWT 220
A++Y+V ++I +E+ ++GPV FTV+ D + YKSG +
Sbjct: 179 --KAETYTV----QNIQEELMKNGPVYFRFTVYSDFMNYKSGVY---------------- 216
Query: 221 IRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDW 280
+KSG GGHA+ ++GWG ++ YWL+ NSW W
Sbjct: 217 ---------------------QHKSGYQEGGHAVLLIGWGVEDGVP--YWLLQNSWGPAW 253
Query: 281 GDNGLFKILRGKDECGIESSITAG 304
G+ G FKI+RGK+ECG E AG
Sbjct: 254 GEKGHFKIIRGKNECGCEQGFYAG 277
>gi|268572255|ref|XP_002648916.1| Hypothetical protein CBG17829 [Caenorhabditis briggsae]
Length = 220
Score = 97.4 bits (241), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 54/147 (36%), Positives = 75/147 (51%), Gaps = 39/147 (26%)
Query: 163 GAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIR 222
G +Y V +I EI +GPV G FT+++D+ YKSG +
Sbjct: 111 GTSAYYVGMTVSAIQTEIMTNGPVVGVFTMYEDMYKYKSGVY------------------ 152
Query: 223 DNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGD 282
+ +G+ LGGHAI+I+GWG ++ YWLIANSW T WG+
Sbjct: 153 -------------------RHTAGRLLGGHAIKIIGWG--TQNGIPYWLIANSWGTKWGE 191
Query: 283 NGLFKILRGKDECGIESSITAGVPKLD 309
NG FKI RG +ECGIE+++ AG +D
Sbjct: 192 NGFFKIRRGVNECGIENNVVAGKADVD 218
>gi|239938578|gb|ACS36088.1| cysteine proteinase [Haemonchus contortus]
Length = 332
Score = 97.4 bits (241), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 54/178 (30%), Positives = 87/178 (48%), Gaps = 41/178 (23%)
Query: 115 CRPYEIAPCEHHVNGTRPSCDASKG-HTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
C+PY + PC +H G SC TP C + CQ Y Y+KD ++ Y + +E
Sbjct: 194 CKPYPLHPCGNH-GGKFWSCPRDHSFRTPACKKYCQYGYGKRYEKDKSYVKSVYILDEDE 252
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
K+I +E+ ++GPV+ A ++D Y+ G
Sbjct: 253 KAIQREMMKNGPVQAASITYEDFSFYRRG------------------------------- 281
Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRG 291
+ ++ G+ G HA++++GWG + + KYW +ANSW+TDWG++G F+ILRG
Sbjct: 282 ------IYVHTRGRQRGAHAVKVVGWGVENGT--KYWNVANSWSTDWGEDGYFRILRG 331
Score = 38.1 bits (87), Expect = 5.2, Method: Compositional matrix adjust.
Identities = 17/33 (51%), Positives = 20/33 (60%)
Query: 7 RLCGFGCNGGFPGMAWRYWVKSGIVSGGAYGSK 39
R CG GCNGG AW Y + G+V+GG Y K
Sbjct: 159 RECGRGCNGGMDHKAWEYVKEFGVVTGGRYQEK 191
>gi|343474132|emb|CCD14149.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 337
Score = 97.4 bits (241), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 88/296 (29%), Positives = 116/296 (39%), Gaps = 52/296 (17%)
Query: 46 LSNIPRAHLKSWMG--VHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREI 103
+ NI + K G + LP R E ++ LP FD+ WP+CPTIREI
Sbjct: 55 MQNITFSEAKRLTGARIQKSSALPPARFTE----EQLRTKLPETFDAAEHWPHCPTIREI 110
Query: 104 RDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFG 163
DQ C + W + G S H C ++C + K
Sbjct: 111 ADQSECRASWAVSTASAISDRYCTVGKGKQLRISAAHLLSCCKDCGDG----CKGGFPGF 166
Query: 164 AKSYSVSSNEKSIMKEIY-----EHGPVEGAFTVFDDLILYKSGRFFVPGNETT----AM 214
A Y V S + Y EH +G T F P T A+
Sbjct: 167 AWRYYVEYGITSSSCQPYPFPRCEHQGAQGNKTPCSKY------NFDTPKCNATCTDKAI 220
Query: 215 SLIKWTIRDNTSQLGAEG----------------AFTVFDDLILYKS-------GKALGG 251
LIK+ R N + L G F V+ DL YKS G LGG
Sbjct: 221 PLIKY--RGNATYLLLHGEEDYKRELYFNGPFVAVFYVYTDLFAYKSGVYRHVDGDFLGG 278
Query: 252 HAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVPK 307
A++++GWG+ + YW +ANSW+TDWG G ILRG +EC IE AG P+
Sbjct: 279 TAVKVVGWGKLNGT--PYWKLANSWDTDWGMGGYLLILRGNNECNIEHLGFAGTPE 332
Score = 42.4 bits (98), Expect = 0.28, Method: Compositional matrix adjust.
Identities = 17/24 (70%), Positives = 19/24 (79%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVS 32
CG GC GGFPG AWRY+V+ GI S
Sbjct: 155 CGDGCKGGFPGFAWRYYVEYGITS 178
>gi|156708116|gb|ABU93316.1| cathepsin B7 cysteine protease, partial [Monocercomonoides sp. PA]
Length = 273
Score = 97.4 bits (241), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 71/268 (26%), Positives = 111/268 (41%), Gaps = 88/268 (32%)
Query: 81 DEDLPANFDSRTKWPNCPTIREIRDQGSCGSCW------------GCR--------PYEI 120
+E+ PA+FD R KWP +R+QGSCGSCW G R P ++
Sbjct: 52 NENAPASFDCRQKWPG--KAEPVRNQGSCGSCWAHAASETMGFRMGIRRCSKGVMSPQDL 109
Query: 121 APCEHHVNG------------------TRPSC---DASKGHTPKCVRECQENYDVPYKKD 159
CE + G T C + G P C +C+ ++
Sbjct: 110 VSCESNNMGCNGGYADRVWNWIQKKGITTEQCIPYVSGSGRVPTCPSKCKNGSNIVRSFV 169
Query: 160 LNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKW 219
++G S N K++M E+ +GPV F VF+D Y+SG +
Sbjct: 170 SSWG------SFNSKTVMDEVANNGPVYACFEVFEDFYNYRSGVY--------------- 208
Query: 220 TIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTD 279
+K+G++ G H + ++GWG + + YWL+ NSW +
Sbjct: 209 ----------------------QHKTGRSQGWHHVMLMGWGTE--NGVPYWLLQNSWGSG 244
Query: 280 WGDNGLFKILRGKDECGIESSITAGVPK 307
WG+ G F+I RG ++C I+ +G+PK
Sbjct: 245 WGEKGFFRIRRGTNDCHIDEIFYSGLPK 272
>gi|12330246|gb|AAG52660.1| cysteine proteinase [Metagonimus yokogawai]
Length = 179
Score = 97.4 bits (241), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 57/150 (38%), Positives = 70/150 (46%), Gaps = 38/150 (25%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GCR Y C HH G P C + TP CV C + D+ Y D SY+V SNE
Sbjct: 66 GCRSYPFPRCSHHGKGKYPPCPKTIFDTPNCVDHCDKP-DIDYAADKTHAKSSYNVQSNE 124
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
+ IMKEI +GPVE AF V++D I YKSG +F
Sbjct: 125 RVIMKEIMRNGPVEAAFMVYEDFIEYKSGIYF---------------------------- 156
Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDE 263
+ GK LGGHAIR+LGWGE++
Sbjct: 157 ---------HSHGKLLGGHAIRMLGWGEEK 177
Score = 45.1 bits (105), Expect = 0.037, Method: Compositional matrix adjust.
Identities = 16/27 (59%), Positives = 24/27 (88%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGA 35
CGFGC+GGFP AW +W+++G+V+GG+
Sbjct: 34 CGFGCHGGFPPRAWDFWMENGLVTGGS 60
>gi|321446975|gb|EFX60976.1| hypothetical protein DAPPUDRAFT_274869 [Daphnia pulex]
Length = 71
Score = 97.4 bits (241), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 47/63 (74%), Positives = 52/63 (82%), Gaps = 2/63 (3%)
Query: 246 GKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGV 305
GKA+GGHAIRILGWG +E YWLIAN+WNTDWGDNG K+LRGKD CGIES IT G+
Sbjct: 11 GKAVGGHAIRILGWGVEEGVP--YWLIANNWNTDWGDNGYIKLLRGKDHCGIESQITGGL 68
Query: 306 PKL 308
PKL
Sbjct: 69 PKL 71
>gi|343470805|emb|CCD16605.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 337
Score = 97.1 bits (240), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 87/296 (29%), Positives = 117/296 (39%), Gaps = 52/296 (17%)
Query: 46 LSNIPRAHLKSWMG--VHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREI 103
+ NI + K G + LP R E ++ LP FD+ WP+CPTIREI
Sbjct: 55 MQNITFSEAKRLTGARIQKSSALPPARFTE----EQLRTKLPETFDAAEHWPHCPTIREI 110
Query: 104 RDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFG 163
DQ C + W + G S H C ++C + K
Sbjct: 111 ADQSECRASWAVSTASAISDRYCTVGKGKQLRISAAHLLSCCKDCGDG----CKGGFPGF 166
Query: 164 AKSYSVSSNEKSIMKEIY-----EHGPVEGAFTVFDDLILYKSGRFFVPGNETT----AM 214
A Y V S + Y EH +G T F P T ++
Sbjct: 167 AWRYYVEYGITSSSCQPYPFPRCEHQGAQGNKTPCSKY------NFDTPKCNATCTDKSV 220
Query: 215 SLIKWTIRDNTSQLGAEG----------------AFTVFDDLILYKS-------GKALGG 251
LIK+ R N + L G F V+ DL YKS G LGG
Sbjct: 221 PLIKY--RGNATYLLLHGEEDYKRELYFNGPFVAVFYVYTDLFAYKSGVYRNVDGDFLGG 278
Query: 252 HAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVPK 307
A++++GWG+ + YW +ANSW+TDWG +G ILRG +EC IE AG P+
Sbjct: 279 TAVKVVGWGKLNGT--PYWKVANSWDTDWGMDGYLLILRGNNECNIEHLGFAGTPE 332
Score = 42.4 bits (98), Expect = 0.27, Method: Compositional matrix adjust.
Identities = 17/24 (70%), Positives = 19/24 (79%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVS 32
CG GC GGFPG AWRY+V+ GI S
Sbjct: 155 CGDGCKGGFPGFAWRYYVEYGITS 178
>gi|290979437|ref|XP_002672440.1| predicted protein [Naegleria gruberi]
gi|284086017|gb|EFC39696.1| predicted protein [Naegleria gruberi]
Length = 354
Score = 96.7 bits (239), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 83/252 (32%), Positives = 115/252 (45%), Gaps = 58/252 (23%)
Query: 83 DLPANFDSRTKWPNCPTIREIRDQGSCGSCWG-CRPYEIAPCEHHVNGTRPSCDASKGHT 141
DLP NFD+RT+W C I +RDQ +CG+CW Y +A H + C A+ G T
Sbjct: 131 DLPMNFDARTQWRGC--IPAVRDQQTCGACWAFSATYVLA---HRL------CIATNGKT 179
Query: 142 PKCVR-ECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYK 200
+ E Q D K G Y+ S E++ T D I Y
Sbjct: 180 NVVLSPEYQVQCDT-MNKACQGGYLKYAWSFLERT--------------GTTVDSCIPYA 224
Query: 201 SGR-FFVPGN-------ETTAMSLIKW----------TIRDNTSQLGA-EGAFTVFDDLI 241
SGR F G T +M++ K I+ G+ + FT++ D +
Sbjct: 225 SGRATFSSGTCPAKCKVSTQSMTMYKAKNSRYISGVNNIKAAIMSYGSVQSGFTIYRDFM 284
Query: 242 LYKSG-------KALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDE 294
Y+SG LGGHA+ ++GWG + S YWL NSW ++WG +G FKI +G E
Sbjct: 285 SYRSGVYKHVSTTTLGGHAVALIGWGVE--SGTNYWLAVNSWGSNWGMSGYFKIAQG--E 340
Query: 295 CGIESSITAGVP 306
CGIE+ + AG P
Sbjct: 341 CGIENQVYAGEP 352
>gi|253748582|gb|EET02635.1| Cathepsin B precursor [Giardia intestinalis ATCC 50581]
Length = 298
Score = 96.7 bits (239), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 72/272 (26%), Positives = 107/272 (39%), Gaps = 94/272 (34%)
Query: 81 DEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWGC----------------------RPY 118
D +P +FD R ++P+C I E+ DQGSCGSCW P
Sbjct: 71 DTKVPDSFDFREEYPHC--IPEVVDQGSCGSCWAFSSVASLGDRRCFAGLDKKAVTYSPQ 128
Query: 119 EIAPCEH----------------------HVNGTRPSCDASKGHTPKCVRECQ---ENYD 153
+ C+H N P + G C +C E
Sbjct: 129 YVVSCDHGDMACDGGWLQSVWRFLTKTGTTTNECVPYQSGTTGARGTCPTKCADGGELST 188
Query: 154 VPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTA 213
V KK +++G IMK + GP++ AFTV+ D + Y+ G +
Sbjct: 189 VKAKKAVDYGLDC-------DLIMKALVTGGPLQTAFTVYSDFMYYEGGVY--------- 232
Query: 214 MSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIA 273
+ SG+ GGHA+ ++G+G DE + YW+I
Sbjct: 233 ----------------------------QHMSGRVEGGHAVEMVGYGTDEYDVD-YWIIR 263
Query: 274 NSWNTDWGDNGLFKILRGKDECGIESSITAGV 305
NSW DWG++G F+I+R +ECGIE + G+
Sbjct: 264 NSWGPDWGEDGYFRIIRMTNECGIEEQVMGGI 295
>gi|204022085|dbj|BAG71140.1| cathepsin B-S [Astegopteryx spinocephala]
Length = 335
Score = 96.7 bits (239), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 67/212 (31%), Positives = 96/212 (45%), Gaps = 59/212 (27%)
Query: 103 IRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNF 162
I G GS GC PY++ PC + G H KC R C N V + +
Sbjct: 172 ITTGGDYGSNEGCAPYKVPPC-YDDQGEFLCQGKPTEHNHKCPRACYGNSTVENR----Y 226
Query: 163 GAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIR 222
KS V + K+I ++I ++GPVE +
Sbjct: 227 KVKSIYVLDSSKTIEQDIRKYGPVEAS--------------------------------- 253
Query: 223 DNTSQLGAEGAFTVFDDLILYKSGKA--------LGGHAIRILGWGEDEKSKEKYWLIAN 274
F V+DD I YKSG +GGH+++++GWGE++ YWL+ N
Sbjct: 254 -----------FDVYDDFITYKSGIYQKTPNAFYVGGHSVKLIGWGEEDGIP--YWLLVN 300
Query: 275 SWNTDWGDNGLFKILRGKDECGIESSITAGVP 306
SW+ WG+ G F+I++G++ECGIE S TAGVP
Sbjct: 301 SWSKFWGEQGTFRIIKGRNECGIERSATAGVP 332
Score = 42.0 bits (97), Expect = 0.40, Method: Compositional matrix adjust.
Identities = 16/34 (47%), Positives = 21/34 (61%)
Query: 81 DEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG 114
+ D +FD+R W C I +RDQG+CGSCW
Sbjct: 83 NNDTIKHFDAREDWKICKQIGHVRDQGNCGSCWA 116
Score = 41.2 bits (95), Expect = 0.54, Method: Compositional matrix adjust.
Identities = 17/33 (51%), Positives = 22/33 (66%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQA 41
CG GC GG P AW+Y+ + GI +GG YGS +
Sbjct: 151 CGLGCQGGNPIKAWKYFKRHGITTGGDYGSNEG 183
>gi|48762476|dbj|BAD23809.1| cathepsin B-S [Tuberaphis styraci]
gi|204022069|dbj|BAG71132.1| cathepsin B-S1 [Tuberaphis styraci]
Length = 349
Score = 96.3 bits (238), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 63/195 (32%), Positives = 94/195 (48%), Gaps = 47/195 (24%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GC PY++ PC + G +C + C Y +D Y ++S E
Sbjct: 182 GCMPYKVPPC-YDEQGKNTCGGKPMERNHQCPKTC---YGKTTVQDRYKTKNEYVINSIE 237
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
+I +++ +GPVE +F V+DD +YKSG
Sbjct: 238 -TIEQDLMTYGPVEASFDVYDDFSVYKSG------------------------------- 265
Query: 234 FTVFDDLILYKSGKAL--GGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRG 291
I K+ KA GGH+I+I+GWGE+ + YWL NSW+ WGD+G FKI++G
Sbjct: 266 -------IYRKTPKAKYEGGHSIKIIGWGEENGT--PYWLAVNSWSKFWGDHGTFKIIKG 316
Query: 292 KDECGIESSITAGVP 306
++ECGIE ++TAG+P
Sbjct: 317 RNECGIERAVTAGIP 331
Score = 47.4 bits (111), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 19/34 (55%), Positives = 23/34 (67%)
Query: 80 VDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCW 113
V+ + P FDSR W +C I IRDQG+CGSCW
Sbjct: 81 VENNSPKQFDSRENWKSCKQIGHIRDQGNCGSCW 114
Score = 39.7 bits (91), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 15/33 (45%), Positives = 22/33 (66%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQA 41
CG GC GG+P AW+Y+ G+ +GG Y +K+
Sbjct: 150 CGKGCGGGYPIKAWKYFRTQGVTTGGDYDTKEG 182
>gi|198434980|ref|XP_002126076.1| PREDICTED: similar to LOC100124858 protein [Ciona intestinalis]
Length = 541
Score = 96.3 bits (238), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 54/145 (37%), Positives = 81/145 (55%), Gaps = 33/145 (22%)
Query: 167 YSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTS 226
Y VSSNE++IMKEI+E+GPV+ V D +YKSG + + T +++ ++DNT
Sbjct: 420 YRVSSNEENIMKEIFENGPVQAVMRVQPDFFVYKSGVY----SSTAIDNIVVEQVKDNTY 475
Query: 227 QLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKE---KYWLIANSWNTDWGDN 283
H+++I+GWGE +KSK KYW++ NSW +WG+
Sbjct: 476 -------------------------HSVKIIGWGE-KKSKTNSGKYWIVQNSWGANWGEG 509
Query: 284 GLFKILRGKDECGIESSITAGVPKL 308
G F+I +G +ECGIE I A P++
Sbjct: 510 GYFRIRKGVNECGIEEMILAAWPQI 534
>gi|156708122|gb|ABU93319.1| cathepsin B10 cysteine protease [Monocercomonoides sp. PA]
Length = 283
Score = 96.3 bits (238), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 74/262 (28%), Positives = 109/262 (41%), Gaps = 88/262 (33%)
Query: 84 LPANFDSRTKWPNCPTIREIRDQGSCGSCW---------------GC-----RPYEIAPC 123
+P +FD+R +WPN I +RDQ CGSCW GC P ++ C
Sbjct: 64 VPESFDARDEWPN--AILPVRDQEKCGSCWAFSIAESLGDRFGILGCGKGHLSPQDLISC 121
Query: 124 EHHVNG------------------TRPSC---DASKGHTPKCVRECQENYDVPYKKDLNF 162
+ + G T SC + G P C C N V + +N
Sbjct: 122 DSNDLGCNGGYQENSWTWVLTTGITTESCWPYRSGSGRIPSCPHRCV-NGSVLQRNTIN- 179
Query: 163 GAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIR 222
+ S+E + E+Y +GP++ + V++D Y G +
Sbjct: 180 --NYRRLDSSE--LQDELYNNGPIQVTYVVYEDFFYYSKGIY------------------ 217
Query: 223 DNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGD 282
+ SG +GGHA+ ++GWG ++ K YWL+ NSW +WG+
Sbjct: 218 -------------------KHLSGNKVGGHAVVLMGWGIEDGVK--YWLVQNSWGYEWGE 256
Query: 283 NGLFKILRGKDECGIESSITAG 304
G F+ILRG +ECGIESS AG
Sbjct: 257 QGYFRILRGSNECGIESSAYAG 278
>gi|204022077|dbj|BAG71136.1| cathepsin B-S1 [Tuberaphis sumatrana]
gi|204022079|dbj|BAG71137.1| cathepsin B-S2 [Tuberaphis sumatrana]
Length = 334
Score = 96.3 bits (238), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 79/262 (30%), Positives = 114/262 (43%), Gaps = 46/262 (17%)
Query: 80 VDEDLPANFDSRTKWPNCPTIREIRDQ---GSCGSCWGCRPYEIAPCEHHVNGTRPSCDA 136
V+ D P FDSR W +C I IRDQ GSC S + C G + +
Sbjct: 81 VENDSPQQFDSRENWKSCKQIGHIRDQGNCGSCWSFSTTGAFADRLCVS--TGGKFNELL 138
Query: 137 SKGHTPKCVRECQENYDVPY-----------------KKDLNFGAKSYSVSSNEKSIMKE 179
S C ++C + Y D G K Y V+ K
Sbjct: 139 SPEELAFCCKDCGNGCEGGYPIKAWRYFRTQGVTTGGDYDTKEGCKPYKVAPCYNKQGKN 198
Query: 180 IYEHGPVE-------GAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEG 232
P+E + D YK+ +V ++ IK +D + E
Sbjct: 199 TCGGKPMERNHQCPKTCYGKTTDQKRYKTKSEYV-------INSIKTIEQDIKTYGPVEA 251
Query: 233 AFTVFDDLILYKSG--------KALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNG 284
+F V+DD +YKSG K GH+++I+GWG++ + YWL NSW+ WGD+G
Sbjct: 252 SFDVYDDFSVYKSGIYRKTPNAKYQNGHSVKIIGWGQENGTP--YWLAVNSWSKFWGDHG 309
Query: 285 LFKILRGKDECGIESSITAGVP 306
FKI++GK+ECGIE ++TAG+P
Sbjct: 310 TFKIIKGKNECGIERAVTAGIP 331
Score = 41.2 bits (95), Expect = 0.60, Method: Compositional matrix adjust.
Identities = 17/35 (48%), Positives = 23/35 (65%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQAEK 43
CG GC GG+P AWRY+ G+ +GG Y +K+ K
Sbjct: 150 CGNGCEGGYPIKAWRYFRTQGVTTGGDYDTKEGCK 184
>gi|209863077|ref|NP_001119612.2| cathepsin B-912 precursor [Acyrthosiphon pisum]
Length = 342
Score = 95.9 bits (237), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 60/201 (29%), Positives = 87/201 (43%), Gaps = 56/201 (27%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GC PY + PC +G +C R C + ++ Y D F Y ++
Sbjct: 187 GCAPYRVPPCFSEEDGNNTCRGQPMEKHHRCTRMCYGDQEIDYDDDHRFTRDYYYLTY-- 244
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
SI K++ +GP+E +
Sbjct: 245 ASIQKDVMTYGPIEASME------------------------------------------ 262
Query: 234 FTVFDDLILYKSG--------KALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGL 285
V+DD YKSG LGGHA++++GWGE++ YWL+ NSW+ WGD GL
Sbjct: 263 --VYDDFPSYKSGVYEKSENATYLGGHAVKLIGWGEEDGVP--YWLMVNSWSEMWGDKGL 318
Query: 286 FKILRGKDECGIESSITAGVP 306
FKI RG +EC +++S+TAGVP
Sbjct: 319 FKIRRGTNECSVDNSMTAGVP 339
Score = 50.8 bits (120), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 37/121 (30%), Positives = 58/121 (47%), Gaps = 27/121 (22%)
Query: 20 MAWRYWVKSGIVSG-GAYGSKQA---EKNSLSNIPRAHLKSW-MGVHPDYNLPANRLPEL 74
M R W+ S ++ G ++QA E++ + +I K+W G++ D N P + +L
Sbjct: 1 MGARMWISSSVILLLGVCVTEQAYFLEEDFIDSI-NEKAKTWKAGINFDPNTPKEYIVKL 59
Query: 75 IG--------------YSEVDE-------DLPANFDSRTKWPNCPTIREIRDQGSCGSCW 113
+G Y DE +P FD+R +W C TI ++RDQG+CGSCW
Sbjct: 60 LGSKGVQVPHKLNLKMYKTDDEAYVNLFGRIPKKFDARKEWRRCITIGQVRDQGNCGSCW 119
Query: 114 G 114
Sbjct: 120 A 120
Score = 43.9 bits (102), Expect = 0.088, Method: Compositional matrix adjust.
Identities = 18/32 (56%), Positives = 23/32 (71%)
Query: 7 RLCGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
LCGF C+GG+P AW Y+ + GIV+GG Y S
Sbjct: 153 HLCGFACHGGYPIKAWSYFRRHGIVTGGDYQS 184
>gi|161343855|tpg|DAA06108.1| TPA_inf: cathepsin B [Acyrthosiphon pisum]
Length = 342
Score = 95.9 bits (237), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 60/201 (29%), Positives = 87/201 (43%), Gaps = 56/201 (27%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GC PY + PC +G +C R C + ++ Y D F Y ++
Sbjct: 187 GCAPYRVPPCFSEEDGNNTCRGQPMEKHHRCTRMCYGDQEIDYDDDHRFTRDYYYLTY-- 244
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
SI K++ +GP+E +
Sbjct: 245 ASIQKDVMTYGPIEASME------------------------------------------ 262
Query: 234 FTVFDDLILYKSG--------KALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGL 285
V+DD YKSG LGGHA++++GWGE++ YWL+ NSW+ WGD GL
Sbjct: 263 --VYDDFPSYKSGVYEKSENATYLGGHAVKLIGWGEEDGVP--YWLMVNSWSEMWGDKGL 318
Query: 286 FKILRGKDECGIESSITAGVP 306
FKI RG +EC +++S+TAGVP
Sbjct: 319 FKIRRGTNECSVDNSMTAGVP 339
Score = 50.8 bits (120), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 37/121 (30%), Positives = 58/121 (47%), Gaps = 27/121 (22%)
Query: 20 MAWRYWVKSGIVSG-GAYGSKQA---EKNSLSNIPRAHLKSW-MGVHPDYNLPANRLPEL 74
M R W+ S ++ G ++QA E++ + +I K+W G++ D N P + +L
Sbjct: 1 MGARMWISSSVILLLGVCVTEQAYFLEEDFIDSI-NEKAKTWKAGINFDPNTPKEYIVKL 59
Query: 75 IG--------------YSEVDE-------DLPANFDSRTKWPNCPTIREIRDQGSCGSCW 113
+G Y DE +P FD+R +W C TI ++RDQG+CGSCW
Sbjct: 60 LGSKGVQVPHKLNLKMYKTDDEAYVNLFGRIPKKFDARKEWRRCITIGQVRDQGNCGSCW 119
Query: 114 G 114
Sbjct: 120 A 120
Score = 44.3 bits (103), Expect = 0.063, Method: Compositional matrix adjust.
Identities = 18/34 (52%), Positives = 24/34 (70%)
Query: 7 RLCGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQ 40
LCGF C+GG+P AW Y+ + GIV+GG Y S +
Sbjct: 153 HLCGFACHGGYPIKAWSYFRRHGIVTGGGYQSGE 186
>gi|124487938|gb|ABN12052.1| cathepsin B endopeptidase-like protein [Maconellicoccus hirsutus]
Length = 66
Score = 95.9 bits (237), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 44/61 (72%), Positives = 51/61 (83%)
Query: 249 LGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVPKL 308
LGGHAIRILGWG +K+ YWL+ANSWNTDWGD+G FKI RG +ECGIE SI AG+PKL
Sbjct: 1 LGGHAIRILGWGVCKKTNAPYWLVANSWNTDWGDHGYFKIKRGSNECGIEDSINAGIPKL 60
Query: 309 D 309
+
Sbjct: 61 N 61
>gi|268619140|gb|ACZ13346.1| cathepsin B-like cysteine proteinase [Bursaphelenchus xylophilus]
Length = 405
Score = 95.5 bits (236), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 64/188 (34%), Positives = 87/188 (46%), Gaps = 42/188 (22%)
Query: 114 GCRPYEIAPCEHHVNGTR-PSCDASKGHTP-KCVRECQENYDVPYKKDLNFGAKSYSVSS 171
GC+PY C HHVN T P CD+ + C ECQ++YD Y++DL +G + Y S
Sbjct: 168 GCQPYPFKHCAHHVNSTEYPPCDSVPEYKADTCSHECQKDYDRKYEEDLYYGKEQYGF-S 226
Query: 172 NEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAE 231
+E I +EI +GPV +FTV++ + Y G + +T IK
Sbjct: 227 DEAPIQREIMTNGPVAVSFTVYESFLYYSGGIY-----RSTPGERIK------------- 268
Query: 232 GAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLF-KILR 290
G HA+R++GWG + + KYW IANSWN WG L
Sbjct: 269 ------------------GYHAVRVVGWGVENGT--KYWKIANSWNEQWGRERLLPHTPA 308
Query: 291 GKDECGIE 298
G DE IE
Sbjct: 309 GVDESDIE 316
Score = 45.1 bits (105), Expect = 0.044, Method: Compositional matrix adjust.
Identities = 17/37 (45%), Positives = 25/37 (67%), Gaps = 1/37 (2%)
Query: 79 EVDEDLPANFDSRTKWPNCPTI-REIRDQGSCGSCWG 114
++ E++P +FD+ KWP C + IRDQ +CGSCW
Sbjct: 67 DLSEEIPESFDAAEKWPECAEVFNNIRDQSNCGSCWA 103
>gi|343474137|emb|CCD14154.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 337
Score = 95.1 bits (235), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 88/303 (29%), Positives = 116/303 (38%), Gaps = 52/303 (17%)
Query: 39 KQAEKNSLSNIPRAHLKSWMG--VHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPN 96
K + NI + K G + L R E ++ LP FD+ WP+
Sbjct: 48 KAVYNGKMQNITFSEAKRLTGARIQKSSGLQPARFTE----EQLRTKLPETFDAAEHWPH 103
Query: 97 CPTIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPY 156
CPTIREI DQ C + W + G S H C ++C +
Sbjct: 104 CPTIREIADQSECRASWAVSTASAISDRYCTVGKGKQLRISAAHLLSCCKDCGDG----C 159
Query: 157 KKDLNFGAKSYSVSSNEKSIMKEIY-----EHGPVEGAFTVFDDLILYKSGRFFVPGNET 211
K A Y V S + Y EH +G T F P
Sbjct: 160 KGGFPGFAWRYYVEYGITSSSCQPYPFPRCEHQGAQGNKTPCSKY------NFDTPKCNA 213
Query: 212 T----AMSLIKWTIRDNTSQLGAEG----------------AFTVFDDLILYKS------ 245
T A+ LIK+ R N + L G F V+ DL YKS
Sbjct: 214 TCTDKAIPLIKY--RGNATYLLLHGEEDYKRELYFNGPFVAVFYVYTDLFAYKSGVYRHV 271
Query: 246 -GKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAG 304
G LGG A++++GWG+ + YW +ANSW+TDWG G ILRG +EC IE AG
Sbjct: 272 DGDFLGGTAVKVVGWGKLNGT--PYWKLANSWDTDWGMGGYLLILRGNNECNIEHLGFAG 329
Query: 305 VPK 307
P+
Sbjct: 330 TPE 332
Score = 42.4 bits (98), Expect = 0.27, Method: Compositional matrix adjust.
Identities = 17/24 (70%), Positives = 19/24 (79%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVS 32
CG GC GGFPG AWRY+V+ GI S
Sbjct: 155 CGDGCKGGFPGFAWRYYVEYGITS 178
>gi|166030322|gb|ABY78828.1| cathepsin B-like protease [Trypanosoma congolense]
gi|343471419|emb|CCD16168.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 336
Score = 95.1 bits (235), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 87/300 (29%), Positives = 122/300 (40%), Gaps = 48/300 (16%)
Query: 39 KQAEKNSLSNIPRAHLKSWMG--VHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPN 96
K + NI + K G + + +LP R E ++ +LP +FDS KWPN
Sbjct: 47 KAVYNGKMQNITFSEAKRLTGAWIQKNSSLPPVRFTE----EQLRTELPESFDSAEKWPN 102
Query: 97 CPTIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPY 156
CPTIREI DQ +C + W + G S H C ++C +
Sbjct: 103 CPTIREIADQSACRASWAVSTASAISDRYCTVGGGKQLRISAAHLLSCCKQCGGGCKGGF 162
Query: 157 KKDLNFGAKSYSVSSNEKSIMKEIY-----EHGPVEGAFTVFDDLILYKSGRFFVPGNET 211
A Y V S + Y EH +G T + +F P T
Sbjct: 163 PG----FAWRYYVEYGIASSYCQPYPFPQCEHQGAQGNKTPCSNY------KFVTPQCNT 212
Query: 212 T----AMSLIKWTIRDNTSQLGAEGAFT--------------VFDDLILYKS-------G 246
T + LIK+ +D L E F V+ DL YKS G
Sbjct: 213 TCTDKTIPLIKYRGKDAYMLLPGEEEFKRELYFNGPFVAILFVYTDLFAYKSGVYRNVDG 272
Query: 247 KALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVP 306
+G A++++GWG+ + YW +AN+W+TDWG +G ILRG +EC IE AG P
Sbjct: 273 SYMGVTAVKVVGWGKLNGT--PYWKVANTWDTDWGMDGYLLILRGNNECNIEHLGFAGTP 330
>gi|395528577|ref|XP_003766405.1| PREDICTED: dipeptidyl peptidase 1-like [Sarcophilus harrisii]
Length = 568
Score = 94.7 bits (234), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 68/249 (27%), Positives = 98/249 (39%), Gaps = 78/249 (31%)
Query: 100 IREIRDQGSCGSCWG----------------------CRPYEIAPCEHHVNG-------- 129
+ +R+Q +CGSC+ P EI C + G
Sbjct: 352 VSPVRNQANCGSCYAFASLGMLESRIRIKTNNSQVPVLSPQEIVSCSEYSQGCEGGFPYL 411
Query: 130 -----------TRPSCDASKGHTPKCV-RECQENYDVPYKKDLNFGAKSYSVSSNEKSIM 177
C + + C ++C Y Y F NE +
Sbjct: 412 IGGKYAQDFGLVEEECFPYQAYDSPCTPKKCSRYYTSEYHYVGGFYG-----GCNEALMK 466
Query: 178 KEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVF 237
E+ ++GP+ AF V+DD I Y++G + G +RDN F F
Sbjct: 467 HELIQNGPLTVAFEVYDDFIHYRTGIYHHTG------------LRDN---------FNPF 505
Query: 238 DDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGI 297
+ L HA+ ++G+G DEK+ E YW++ NSW T WG+NG F+ILRG DEC I
Sbjct: 506 E----------LTNHAVLLVGYGTDEKTGEDYWIVKNSWGTSWGENGYFRILRGTDECAI 555
Query: 298 ESSITAGVP 306
ES A P
Sbjct: 556 ESIAVAATP 564
>gi|21695|emb|CAA46812.1| cathepsin B [Triticum aestivum]
Length = 310
Score = 94.7 bits (234), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 79/291 (27%), Positives = 109/291 (37%), Gaps = 97/291 (33%)
Query: 46 LSNIPRAHLKSWMGVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRD 105
+N K +GV P P L + + DLP FD+RT+W +C TI I D
Sbjct: 62 FANYTIEQFKHILGVKPT---PPGLLAGVPIKIHPEMDLPKEFDARTQWSSCSTIGNILD 118
Query: 106 QGSCGSCWGCRPYEIAP-------------------------CEHHVNGTRP-------- 132
QG CG+CW E C NG P
Sbjct: 119 QGHCGACWAFAAVEALQDRFCIHLNMSVSLSVNDLLACCGFLCGSGCNGGYPISAWRYFR 178
Query: 133 -------SCDASKGHT-------------PKCVRECQENYDVPYKKDLNFGAKSYSVSSN 172
CD T PKC R+C+ + +K++ +F +Y V SN
Sbjct: 179 RSGVVTEECDPYFDQTGCQHPGCEPAYPTPKCQRKCKVE-NQAWKENKHFSVNAYRVHSN 237
Query: 173 EKSIMKEIYEHGPVEGAFTVFD--DLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGA 230
IM E+Y++GPVE AFT D YKSG +
Sbjct: 238 PHDIMAEVYKNGPVEVAFTYCQILDFAHYKSGVY-------------------------- 271
Query: 231 EGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWG 281
+ +G +GGHA++++GWG + + E YWL+AN WN WG
Sbjct: 272 -----------KHITGGVMGGHAVKLIGWGTSD-AGEDYWLLANQWNRGWG 310
Score = 40.4 bits (93), Expect = 0.99, Method: Compositional matrix adjust.
Identities = 16/25 (64%), Positives = 21/25 (84%)
Query: 8 LCGFGCNGGFPGMAWRYWVKSGIVS 32
LCG GCNGG+P AWRY+ +SG+V+
Sbjct: 160 LCGSGCNGGYPISAWRYFRRSGVVT 184
>gi|204022083|dbj|BAG71139.1| cathepsin B-S [Astegopteryx styracophila]
Length = 335
Score = 94.7 bits (234), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 66/214 (30%), Positives = 96/214 (44%), Gaps = 59/214 (27%)
Query: 101 REIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDL 160
R I G GS GC PY++ PC + G H KC R C N V +
Sbjct: 170 RGITTGGDYGSNEGCAPYKVPPC-YDDQGEFLCQGKPTEHNHKCPRACYGNSTVENR--- 225
Query: 161 NFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWT 220
+ +S V + K+I ++I +GPVE +
Sbjct: 226 -YKVESIYVLDSFKTIEQDIRTYGPVEAS------------------------------- 253
Query: 221 IRDNTSQLGAEGAFTVFDDLILYKSG--------KALGGHAIRILGWGEDEKSKEKYWLI 272
F V+DD I YKSG +GGH+++++GWGE++ YWL+
Sbjct: 254 -------------FDVYDDFITYKSGIYQKTPNALYVGGHSVKLIGWGEEDGIP--YWLL 298
Query: 273 ANSWNTDWGDNGLFKILRGKDECGIESSITAGVP 306
NSW+ WG+ G F+I++G++ECGIE S TAG+P
Sbjct: 299 VNSWSKFWGEQGTFRIIKGRNECGIERSATAGIP 332
Score = 41.2 bits (95), Expect = 0.56, Method: Compositional matrix adjust.
Identities = 17/33 (51%), Positives = 22/33 (66%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQA 41
CG GC GG P AW+Y+ + GI +GG YGS +
Sbjct: 151 CGLGCQGGNPIKAWKYFKRRGITTGGDYGSNEG 183
Score = 40.4 bits (93), Expect = 0.97, Method: Compositional matrix adjust.
Identities = 15/27 (55%), Positives = 19/27 (70%)
Query: 87 NFDSRTKWPNCPTIREIRDQGSCGSCW 113
+FD+R W C I +RDQG+CGSCW
Sbjct: 89 HFDARENWKICKQIGHVRDQGNCGSCW 115
>gi|10803452|emb|CAB97365.2| putative cathepsin B.2 [Ostertagia ostertagi]
Length = 194
Score = 94.7 bits (234), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 58/166 (34%), Positives = 83/166 (50%), Gaps = 47/166 (28%)
Query: 115 CRPYEIAPCEHHVNGTRP---SC-DASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVS 170
CRPYE PC H G P C D++K TPKC + CQ Y PYK+D +FG +Y +
Sbjct: 72 CRPYEFPPCGRH--GKEPYYGECYDSAK--TPKCQKTCQRGYLKPYKEDKHFGKSAYRLP 127
Query: 171 SNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGA 230
+N K+I ++I ++GPV F V++D YKSG
Sbjct: 128 NNVKAIQRDIMKNGPVVAGFIVYEDFAHYKSG---------------------------- 159
Query: 231 EGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSW 276
+ + +G+ GGHA++I+GWG++ + YWLIANSW
Sbjct: 160 ---------IYKHTAGRMTGGHAVKIIGWGKEXGT--PYWLIANSW 194
Score = 38.5 bits (88), Expect = 4.3, Method: Compositional matrix adjust.
Identities = 15/28 (53%), Positives = 21/28 (75%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAY 36
CG+GC GG+P AW+Y+ G+V+GG Y
Sbjct: 39 CGYGCEGGWPMKAWQYFXLEGVVTGGNY 66
>gi|149392557|gb|ABR26081.1| cathepsin b-like cysteine proteinase 3 [Oryza sativa Indica Group]
Length = 142
Score = 94.4 bits (233), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 56/168 (33%), Positives = 82/168 (48%), Gaps = 53/168 (31%)
Query: 146 RECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFF 205
++C+ V +K +F +Y V+S+ IM E+Y++GPVE A
Sbjct: 1 KKCKVQNQVWLEKK-HFSVNAYRVNSDPHDIMAEVYQNGPVEVA---------------- 43
Query: 206 VPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSG-------KALGGHAIRILG 258
FTV++D YKSG +GGHA++++G
Sbjct: 44 ----------------------------FTVYEDFAHYKSGVYKHITGGMMGGHAVKLIG 75
Query: 259 WGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVP 306
WG + + E YWL+AN WN WGD+G FKI+RG +ECGIE + AG+P
Sbjct: 76 WGTTD-AGEDYWLLANQWNRGWGDDGYFKIIRGTNECGIEEDVVAGMP 122
>gi|157116531|ref|XP_001658537.1| tubulointerstitial nephritis antigen [Aedes aegypti]
gi|108883447|gb|EAT47672.1| AAEL001232-PA [Aedes aegypti]
Length = 462
Score = 94.4 bits (233), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 76/275 (27%), Positives = 108/275 (39%), Gaps = 85/275 (30%)
Query: 82 EDLPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEIAPCEHHV-NGTRPSCDASKGH 140
+ LP +FD+ WP I ++RDQG CGS W +A + + R + +
Sbjct: 183 DHLPTHFDATNYWPG--FIGKVRDQGWCGSSWAVSTASVASDRFAILSKGRETVQLAPQQ 240
Query: 141 TPKCVRECQ--------------------------------------------ENYDVPY 156
CVR Q N ++P
Sbjct: 241 IVSCVRRSQGCSGGHLDTAWSYLRKVGTVNEECYPYISAHNVCKIRPSDTLITANCELPM 300
Query: 157 KKDLNFGAK---SYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTA 213
K D K ++S++ NE IM EI +HGPV+ V D YKSG + T+A
Sbjct: 301 KVDRTNMYKMGPAFSLN-NETDIMLEIKKHGPVQAIMRVHRDFFSYKSGIYRHSAASTSA 359
Query: 214 MSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKE--KYWL 271
+ G H++R++GWGE+ E KYW+
Sbjct: 360 --------------------------------DQRAGYHSVRLIGWGEERHGYEVTKYWI 387
Query: 272 IANSWNTDWGDNGLFKILRGKDECGIESSITAGVP 306
NSW T WG+NG F+ILRG +EC IES + A +P
Sbjct: 388 AVNSWGTWWGENGRFRILRGSNECEIESYVLASLP 422
>gi|294877489|ref|XP_002768007.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
gi|239870145|gb|EER00725.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
Length = 344
Score = 94.4 bits (233), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 77/301 (25%), Positives = 118/301 (39%), Gaps = 91/301 (30%)
Query: 12 GCNGGFPGMAWRYWVKSGIVSGGAYGSKQAEKNSLSNIPRAHLKSWMGVHPDYNLPANRL 71
GC GG AW + GIV+GG + +P+ + + G P Y+ P
Sbjct: 132 GCQGGIARAAWSFLKMHGIVTGGDF------------VPKGSMSAADGCWP-YSFPKC-- 176
Query: 72 PELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEIAPC-EHHVNGT 130
A+ +K+ CP +R + P E H G
Sbjct: 177 --------------AHDQEDSKYEPCPEVR------------------VPPLGERHQRGA 204
Query: 131 RPSCDASKGHTPKCVREC-QENYDVPYKKDLNFGAKSYS-VSSNEKSIMKEIYEHGPVEG 188
S TP C+ C E Y P KD +F A++ + +I KEI +GP
Sbjct: 205 GASIHQKLYDTPSCLDRCPNEKYGTPRDKDRHFTARALPYLFEGTDNIKKEIMTNGPTSA 264
Query: 189 AFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKA 248
+F+ ++D YKSG + + SG
Sbjct: 265 SFSTYEDFSSYKSGVY-------------------------------------KHTSGGY 287
Query: 249 LGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVPKL 308
LG H++ I+GWG ++ YWL+ NSWN WGD+G FKI +G +CGI+ ++ +P +
Sbjct: 288 LGDHSVEIIGWGTEKGVD--YWLVMNSWNEGWGDHGTFKIAQG--DCGIDDAVQGSLPAM 343
Query: 309 D 309
+
Sbjct: 344 N 344
Score = 37.7 bits (86), Expect = 6.4, Method: Compositional matrix adjust.
Identities = 16/38 (42%), Positives = 23/38 (60%), Gaps = 1/38 (2%)
Query: 83 DLPANFDSRTKWPNCP-TIREIRDQGSCGSCWGCRPYE 119
D+P++FD+R + C I + DQ +CGSCW P E
Sbjct: 58 DIPSSFDARDAFKECKDVIGHVWDQSACGSCWAIAPVE 95
>gi|343476073|emb|CCD12715.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 336
Score = 94.4 bits (233), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 87/300 (29%), Positives = 121/300 (40%), Gaps = 48/300 (16%)
Query: 39 KQAEKNSLSNIPRAHLKSWMG--VHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPN 96
K + NI + K G + +LP R E ++ +LP +FDS KWPN
Sbjct: 47 KAVYNGKMQNITFSEAKRLTGAWIQKTSSLPPVRFTE----EQLRTELPESFDSAEKWPN 102
Query: 97 CPTIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPY 156
CPTIREI DQ +C + W + G S H C ++C +
Sbjct: 103 CPTIREIADQSACRASWAVSTASAISDRYCTVGGGKQLRISAAHLLSCCKQCGGGCKGGF 162
Query: 157 KKDLNFGAKSYSVSSNEKSIMKEIY-----EHGPVEGAFTVFDDLILYKSGRFFVPGNET 211
A Y V S + Y EH +G T + +F P T
Sbjct: 163 PG----FAWRYYVEYGIASSYCQPYPFPQCEHHGAQGNKTPCSNY------KFVTPQCNT 212
Query: 212 T----AMSLIKWTIRDNTSQLGAEGAFT--------------VFDDLILYKS-------G 246
T + LIK+ +D L E F V+ DL YKS G
Sbjct: 213 TCTDKTIPLIKYRGKDAYMLLPGEEEFKRELYFNGPFVAILFVYTDLFAYKSGVYRNVDG 272
Query: 247 KALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVP 306
+G A++++GWG+ + YW +AN+W+TDWG +G ILRG +EC IE AG P
Sbjct: 273 SYMGVTAVKVVGWGKLNGT--PYWKVANTWDTDWGMDGYLLILRGNNECNIEHLGFAGTP 330
>gi|170045773|ref|XP_001850470.1| tubulointerstitial nephritis antigen [Culex quinquefasciatus]
gi|167868692|gb|EDS32075.1| tubulointerstitial nephritis antigen [Culex quinquefasciatus]
Length = 463
Score = 94.4 bits (233), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 86/291 (29%), Positives = 120/291 (41%), Gaps = 87/291 (29%)
Query: 67 PANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCW------------- 113
P ++ + + E LP +FD+ T WP I E++DQG CGS W
Sbjct: 169 PKFKVKSMSRLTNGQEHLPTHFDATTYWPG--FIGEVKDQGWCGSSWALSTASVASDRFA 226
Query: 114 ----GCRPYEIAPCEHHVNGTRPSCDASKGHTPKC---VR-------EC----------- 148
G ++AP + ++ R S S GH VR EC
Sbjct: 227 ILSKGREIVQLAP-QQIISCVRRSQGCSGGHLDTAWNYVRKVGTVNDECYPYISAQNACK 285
Query: 149 --------QENYDVPYKKDLNFGAK---SYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLI 197
N D+P K D K ++S++ NE IM EI +HGPV+ V D
Sbjct: 286 IRPSDTLITANCDLPTKVDRTNMYKMGPAFSLN-NETDIMIEIKKHGPVQAILRVHRDFF 344
Query: 198 LYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRIL 257
YKSG + S G E A G H++R++
Sbjct: 345 SYKSGIYR----------------HSAASSAGDERA----------------GYHSVRLI 372
Query: 258 GWGEDEKSKE--KYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVP 306
GWGE+ E KYW+ NSW WG+NG F+I+RG++EC IES + A +P
Sbjct: 373 GWGEERNGYETTKYWVAVNSWGRWWGENGRFRIVRGQNECEIESYVLASLP 423
>gi|254575665|gb|ACT68329.1| cysteine proteinase [Haemonchus contortus]
Length = 348
Score = 94.4 bits (233), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 58/192 (30%), Positives = 85/192 (44%), Gaps = 40/192 (20%)
Query: 115 CRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEK 174
C+PY C H +C + TP CQ Y Y+ D Y + ++E+
Sbjct: 195 CKPYVFPQCGAHKGKAFNNCPSHPYATPARKPYCQYGYGKRYENDKIKARTWYWLPNDER 254
Query: 175 SIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAF 234
+I EI + GPV F +++D Y G +
Sbjct: 255 TIQLEIMQKGPVHATFNIYEDFEHYNGGVY------------------------------ 284
Query: 235 TVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWG-DNGLFKILRGKD 293
++ +G GGH+I+I+GWG D+ K YWLIANSW+TDWG D G F+++RG +
Sbjct: 285 -------IHTAGAMEGGHSIKIIGWGVDKGVK--YWLIANSWSTDWGEDGGYFRVVRGIN 335
Query: 294 ECGIESSITAGV 305
C IE + AG
Sbjct: 336 NCDIEGGVLAGT 347
Score = 52.4 bits (124), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 23/46 (50%), Positives = 31/46 (67%), Gaps = 2/46 (4%)
Query: 69 NRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG 114
N LP I ++D+P +FDSR KW +CP++R I DQ +CGSCW
Sbjct: 83 NVLP--IANITSNDDIPESFDSREKWKDCPSLRVIPDQSNCGSCWA 126
Score = 41.2 bits (95), Expect = 0.60, Method: Compositional matrix adjust.
Identities = 15/33 (45%), Positives = 24/33 (72%)
Query: 7 RLCGFGCNGGFPGMAWRYWVKSGIVSGGAYGSK 39
+ CG+GC+GG+ AW++ +G+V+GGAY K
Sbjct: 160 KFCGYGCDGGYNARAWKWATIAGVVTGGAYKEK 192
>gi|348513320|ref|XP_003444190.1| PREDICTED: tubulointerstitial nephritis antigen-like [Oreochromis
niloticus]
Length = 499
Score = 94.0 bits (232), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 52/149 (34%), Positives = 76/149 (51%), Gaps = 32/149 (21%)
Query: 156 YKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMS 215
Y D+ Y +SSNEK IMKEI ++GPV+ V +D +YK+G + + T +S
Sbjct: 356 YHNDIYQSTPPYRLSSNEKEIMKEIMDNGPVQAIMEVHEDFFVYKTGIY-----KHTDVS 410
Query: 216 LIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEK---SKEKYWLI 272
K + G H++RI GWGED + KYW+
Sbjct: 411 FTK------------------------PPQYRKHGTHSVRITGWGEDRNVDGTSRKYWIA 446
Query: 273 ANSWNTDWGDNGLFKILRGKDECGIESSI 301
ANSW +WG+NG F+I+RG++EC IE+ +
Sbjct: 447 ANSWGKNWGENGYFRIVRGENECEIETFV 475
>gi|283468816|emb|CAO98753.1| putative cathepsin B [Fasciola hepatica]
Length = 112
Score = 94.0 bits (232), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 56/158 (35%), Positives = 78/158 (49%), Gaps = 53/158 (33%)
Query: 156 YKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMS 215
Y++D G SY+V E IM EI ++GPV+G
Sbjct: 1 YEQDKVKGKSSYNVGEQETDIMMEIMKNGPVDGI-------------------------- 34
Query: 216 LIKWTIRDNTSQLGAEGAFTVFDDLILYKSG-------KALGGHAIRILGWGEDEKSKEK 268
F +F+D ++YKSG + +GGHAIR++GWG + + K
Sbjct: 35 ------------------FYMFEDFLVYKSGIYHYTTGRLVGGHAIRVIGWGVE--NGVK 74
Query: 269 YWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVP 306
YWLIANSWN WG+ G F++ RG +ECGIE+ I AG+P
Sbjct: 75 YWLIANSWNEGWGEKGYFRMRRGNNECGIEARINAGLP 112
>gi|239793607|dbj|BAH72912.1| ACYPI000019 [Acyrthosiphon pisum]
Length = 188
Score = 93.6 bits (231), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 63/197 (31%), Positives = 94/197 (47%), Gaps = 44/197 (22%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGH---TPKCVREC-QENYDVPYKKDLNFGAKSYSV 169
GC+PY I PC+ +N P + H TP C ++C NY ++ D+ + K Y +
Sbjct: 31 GCQPYTIPPCKL-MNEKPPGHSCTTYHREETPICEKKCYNPNYYTSFRTDI-YKGKYYKL 88
Query: 170 SSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLG 229
S MK+I+++GP+ F ++ DL+ YKSG + Q
Sbjct: 89 SP--YMAMKDIFDNGPITTQFYMYRDLVDYKSGVY----------------------QYD 124
Query: 230 AEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKIL 289
+ F F H+++I GWGE+ + YWL+ANS+ TDWG NG FKI
Sbjct: 125 EQSDFDFFT------------VHSVKIFGWGEE--NGVPYWLVANSFGTDWGYNGTFKIS 170
Query: 290 RGKDECGIESSITAGVP 306
RG D C + + AG+P
Sbjct: 171 RGNDGCFFQEKMYAGLP 187
>gi|432884030|ref|XP_004074413.1| PREDICTED: tubulointerstitial nephritis antigen-like [Oryzias
latipes]
Length = 474
Score = 93.6 bits (231), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 55/162 (33%), Positives = 80/162 (49%), Gaps = 34/162 (20%)
Query: 143 KCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSG 202
+ + C Y+ Y D+ Y +SSNEK IMKEI E+GPV+ V +D +YK+G
Sbjct: 320 QATQRCPNTYN--YHNDIYQSTPPYKLSSNEKEIMKEIMENGPVQAIMEVHEDFFVYKNG 377
Query: 203 RFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGED 262
+ + T +S K + G H++RI GWGED
Sbjct: 378 IY-----KHTDVSSTK------------------------PPQYRKHGTHSVRITGWGED 408
Query: 263 EK---SKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSI 301
+ + KYW+ ANSW +WG+NG F+I RG +EC IE+ +
Sbjct: 409 KDYDGTPRKYWIAANSWGKNWGENGFFRIARGANECEIEAFV 450
>gi|201023321|ref|NP_001128402.1| cathepsin B-1874 precursor [Acyrthosiphon pisum]
Length = 315
Score = 93.2 bits (230), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 65/204 (31%), Positives = 96/204 (47%), Gaps = 44/204 (21%)
Query: 107 GSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGH---TPKCVREC-QENYDVPYKKDLNF 162
G S GC+PY I PC+ +N P + H TP C ++C NY ++ D+ +
Sbjct: 151 GDYNSNQGCQPYTIPPCKL-MNEKPPGHSCTTYHREETPICEKKCYNPNYYTSFRTDI-Y 208
Query: 163 GAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIR 222
K Y +S MK+I+++GP+ F ++ DL+ YKSG +
Sbjct: 209 KGKYYKLS--PYMAMKDIFDNGPITTQFYMYRDLVDYKSGVY------------------ 248
Query: 223 DNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGD 282
Q + F F H+++I GWGE+ + YWL+ANS+ TDWG
Sbjct: 249 ----QYDEQSDFDFFTV------------HSVKIFGWGEE--NGVPYWLVANSFGTDWGY 290
Query: 283 NGLFKILRGKDECGIESSITAGVP 306
NG FKI RG D C + + AG+P
Sbjct: 291 NGTFKISRGNDGCFFQEKMYAGLP 314
Score = 45.8 bits (107), Expect = 0.026, Method: Compositional matrix adjust.
Identities = 18/26 (69%), Positives = 21/26 (80%)
Query: 84 LPANFDSRTKWPNCPTIREIRDQGSC 109
LP NFDSR KWPNCP+I I +QG+C
Sbjct: 61 LPINFDSRKKWPNCPSIGHIYNQGNC 86
Score = 38.9 bits (89), Expect = 3.4, Method: Compositional matrix adjust.
Identities = 21/45 (46%), Positives = 25/45 (55%), Gaps = 4/45 (8%)
Query: 1 MYTQQI----RLCGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQA 41
M QQI LCG GC+GG +W Y+ + G VSGG Y S Q
Sbjct: 114 MSAQQIISCCYLCGHGCDGGSLFESWDYYRRHGFVSGGDYNSNQG 158
>gi|256052327|ref|XP_002569724.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
Length = 96
Score = 93.2 bits (230), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 54/132 (40%), Positives = 69/132 (52%), Gaps = 43/132 (32%)
Query: 175 SIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAF 234
+I KEI ++GPVE F V++D + YKSG
Sbjct: 2 AIQKEIMKYGPVEANFIVYEDFLNYKSG-------------------------------- 29
Query: 235 TVFDDLILYK--SGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGK 292
+YK +GK HAIRI+GWGE+ + YWLI NSWN DWG+NG F+ILRG+
Sbjct: 30 -------IYKHITGKLFSWHAIRIIGWGEENNT--PYWLIPNSWNEDWGENGNFRILRGR 80
Query: 293 DECGIESSITAG 304
EC IES +TAG
Sbjct: 81 HECSIESEVTAG 92
>gi|161343849|tpg|DAA06105.1| TPA_inf: cathepsin B [Acyrthosiphon pisum]
Length = 334
Score = 93.2 bits (230), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 76/289 (26%), Positives = 111/289 (38%), Gaps = 99/289 (34%)
Query: 79 EVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPY-------------------- 118
E+D + FD+R +WP+C TI E+ + G+ W P
Sbjct: 85 EIDHQIDQEFDARKRWPHCKTIGEVHNDGNSLLSWAYVPTGVFADRMCIATNGTYNQLLS 144
Query: 119 --EIAPC------------EHHV------------------NGTRPSCDASKGHTPK--- 143
E+ C +++V NG +PS G+ P
Sbjct: 145 TEELISCSGIKEDEFGSVNDYYVWEYLKNHGLVSGGKYNTNNGCQPSKIPPIGNLPTGLY 204
Query: 144 ---CVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFD-DLILY 199
C + C N + Y +D Y + + I +E+ +GPV AF VFD D LY
Sbjct: 205 ENTCEKRCYGNNTINYNQDHVKIKNHYDIEY--EDIQREVQNYGPVSMAFKVFDNDFFLY 262
Query: 200 KSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGW 259
KSG + +TT I+W +++GW
Sbjct: 263 KSGVY----EKTTNSEFIQW--------------------------------QYAKLIGW 286
Query: 260 GEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVPKL 308
G + + YWL+ N W +WG NGLFKI RG DEC IE+ + AG P+L
Sbjct: 287 GVE--NGVDYWLLVNFWGYEWGQNGLFKIKRGTDECNIETFVHAGEPQL 333
>gi|56755425|gb|AAW25892.1| unknown [Schistosoma japonicum]
Length = 226
Score = 92.8 bits (229), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 52/152 (34%), Positives = 72/152 (47%), Gaps = 38/152 (25%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GC+PY CEHH G PSC TP+C R+CQ+ Y PY+ D ++G S +V NE
Sbjct: 109 GCQPYPFPKCEHHSIGKYPSCGDKIYKTPQCKRKCQKGYTTPYEHDKHYGGISINVIKNE 168
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
+I KEI +GPVE +F+D + YKSG +
Sbjct: 169 SAIQKEIMMYGPVEAYLLIFEDFLNYKSGIY----------------------------- 199
Query: 234 FTVFDDLILYKSGKALGGHAIRILGWG-EDEK 264
Y +G +G H +RI+GWG E+E+
Sbjct: 200 --------RYTTGSFVGEHYVRIIGWGIENER 223
Score = 47.0 bits (110), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 19/27 (70%), Positives = 22/27 (81%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGA 35
CG GC+GGFPG AW YWV GIV+GG+
Sbjct: 77 CGSGCDGGFPGPAWDYWVSHGIVTGGS 103
>gi|290984292|ref|XP_002674861.1| cathepsin C [Naegleria gruberi]
gi|284088454|gb|EFC42117.1| cathepsin C [Naegleria gruberi]
Length = 569
Score = 92.8 bits (229), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 71/229 (31%), Positives = 99/229 (43%), Gaps = 31/229 (13%)
Query: 88 FDSRTKWPNCPTIRE---IRDQGSCG----SCWGCRPYEIAPCEHHVNGTRPSCDASKG- 139
+SR + + +RE ++D SC C G PY + N SC KG
Sbjct: 358 IESRIRIQSRNNVREPLAVQDIVSCSPYAQKCHGGIPYAVGRHLRDFNLVPESCFPYKGS 417
Query: 140 HTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILY 199
C +C+ N + K Y SN ++MKEIYEHGP+ ++ ++ D Y
Sbjct: 418 ENVACSSKCK-NPEYIVKVTKYRYVSDYYGGSNYANMMKEIYEHGPISASYLIYPDFKYY 476
Query: 200 KSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGW 259
G + G T R N G E H++ I GW
Sbjct: 477 SKGIYKHSGKGYPMK-----TDRINREMNGWEPT-----------------THSVVITGW 514
Query: 260 GEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVPKL 308
GED K+ EKYW + NSW+ WG+NG F+I RG DEC IE+ A P++
Sbjct: 515 GEDPKTGEKYWNVLNSWSESWGENGRFRIKRGNDECAIEAEGVAFYPEV 563
>gi|48762491|dbj|BAD23815.1| cathepsin B-S1 [Tuberaphis coreana]
Length = 334
Score = 92.8 bits (229), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 61/194 (31%), Positives = 96/194 (49%), Gaps = 45/194 (23%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKS-YSVSSN 172
GC PY++ PC ++ G +C + C V + + KS YS++S
Sbjct: 182 GCMPYKVPPC-YNKQGKNTCGGQPMERNHQCPKTCYGKTTVQNR----YKTKSEYSINSI 236
Query: 173 EKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEG 232
K+I +++ +GPVE +F V+DD +YKSG I T + EG
Sbjct: 237 -KTIEQDLKTYGPVEASFDVYDDFSVYKSG------------------IYRKTPKAKYEG 277
Query: 233 AFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGK 292
H+I+I+GWG++ + YWL NSW+ WG++G FKI++G+
Sbjct: 278 R------------------HSIKIIGWGQENGT--TYWLAVNSWSKFWGEHGTFKIIKGR 317
Query: 293 DECGIESSITAGVP 306
+ECGIE ++TAG+P
Sbjct: 318 NECGIERAVTAGIP 331
Score = 51.6 bits (122), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 21/34 (61%), Positives = 24/34 (70%)
Query: 80 VDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCW 113
V+ D P FDSRT W +C I IRDQG+CGSCW
Sbjct: 81 VENDSPQQFDSRTNWKSCKQIGHIRDQGNCGSCW 114
Score = 39.3 bits (90), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 15/32 (46%), Positives = 22/32 (68%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQ 40
CG GC GG+P AW+Y+ G+ +GG Y +K+
Sbjct: 150 CGQGCGGGYPIKAWKYFRTQGVTTGGDYDTKE 181
>gi|209863086|ref|NP_001119616.2| cathepsin B-1674 precursor [Acyrthosiphon pisum]
gi|239799412|dbj|BAH70627.1| ACYPI000012 [Acyrthosiphon pisum]
Length = 334
Score = 92.4 bits (228), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 79/289 (27%), Positives = 113/289 (39%), Gaps = 99/289 (34%)
Query: 79 EVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPY-------------------- 118
E+D + FD+R +WP+C TI E+ + G+ W P
Sbjct: 85 EIDHQIDQEFDARKRWPHCKTIGEVHNDGNSLLSWAYVPTGVFADRMCIATNGTYNQLLS 144
Query: 119 --EIAPC--------------------EHH--VNG----TRPSCDASK----GHTP---- 142
E+ C ++H V+G T C SK G+ P
Sbjct: 145 TEELISCSGIKEDEFGSVNDDYVWEYLKNHGLVSGGKYNTNNGCQPSKIPPIGNLPTGLY 204
Query: 143 --KCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFD-DLILY 199
C + C N + Y +D Y + + I +E+ +GPV AF VFD D LY
Sbjct: 205 ENTCEKRCYGNNTINYNQDHVKIKNHYDIEY--EDIQREVQNYGPVSMAFRVFDNDFFLY 262
Query: 200 KSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGW 259
KSG + +TT I+W +++GW
Sbjct: 263 KSGVY----EKTTNSEFIQW--------------------------------QYAKLIGW 286
Query: 260 GEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVPKL 308
G + + YWL+ NSW +WG NGLFKI RG DEC IE+ + AG P+L
Sbjct: 287 GVE--NGVDYWLLVNSWGYEWGQNGLFKIKRGTDECNIETFVHAGEPQL 333
>gi|204022081|dbj|BAG71138.1| cathepsin B-S1 [Tuberaphis takenouchii]
Length = 332
Score = 92.4 bits (228), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 58/193 (30%), Positives = 92/193 (47%), Gaps = 43/193 (22%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GC PY+I PC G +C + C + V + + K+ V ++
Sbjct: 182 GCAPYKIPPCFDQ-KGKNTCAGKPLERNHQCPKTCYGSTTVQKR----YKVKNEYVLNSP 236
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
++ +++ ++GP+E +F +FDDL YKSG I T +
Sbjct: 237 NTMEQDLIKYGPIEASFNLFDDLSAYKSG------------------IYQKTPK------ 272
Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
K L GH+I+I+GWG++ + YWL NSW+ WG+ G F+I++G++
Sbjct: 273 ------------AKFLSGHSIKIIGWGKE--NGVPYWLAVNSWSKFWGEQGTFRIIKGRN 318
Query: 294 ECGIESSITAGVP 306
ECGIE S TAG+P
Sbjct: 319 ECGIERSATAGIP 331
Score = 43.1 bits (100), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 25/77 (32%), Positives = 39/77 (50%), Gaps = 4/77 (5%)
Query: 40 QAEKNSLSNIPRAHLKSWMGVHPDYNLPANRLPELIGYSEVDEDLPA--NFDSRTKWPNC 97
+AE+ +N + ++ +G N + E+ Y + E+ + FDSR W +C
Sbjct: 41 KAERFFPANTSKEYIMGLLGSRGYTNYSSEV--EIKTYDPLYEENASVEQFDSRENWKSC 98
Query: 98 PTIREIRDQGSCGSCWG 114
I IRDQG+CGSCW
Sbjct: 99 KQIGRIRDQGNCGSCWA 115
Score = 38.9 bits (89), Expect = 3.1, Method: Compositional matrix adjust.
Identities = 16/32 (50%), Positives = 22/32 (68%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQ 40
CG GC GG+P AW+Y+ G+ +GG Y SK+
Sbjct: 150 CGKGCEGGYPIKAWQYFRTQGVPTGGDYDSKE 181
>gi|307175943|gb|EFN65753.1| Uncharacterized peptidase C1-like protein F26E4.3 [Camponotus
floridanus]
Length = 443
Score = 92.4 bits (228), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 72/249 (28%), Positives = 105/249 (42%), Gaps = 43/249 (17%)
Query: 84 LPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEIAPCEHHVNGT-RPSCDASKGHTP 142
LP F+SRT+WP I +I DQG CG+ W ++A + + + S H
Sbjct: 203 LPREFNSRTRWPR--DISDIHDQGWCGASWAVSTADVASDRFAIMSKGAETVELSAQHLL 260
Query: 143 KCVRECQENYDVPYKKDLNFGAKSYSVSSNE---------------KSIMKEIYEHGPVE 187
C Q+ Y + + + E +S +K P
Sbjct: 261 SCNNRGQQGCKGGYLDRAWLFMRKFGLVDEECYPWTGRNDQCRLRKRSNLKTAGCQNPPN 320
Query: 188 GAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSG- 246
T LYK G + GNET M I + + V+ D +Y+SG
Sbjct: 321 SLRTE-----LYKVGPAYRLGNETDIMQEI-------LTSGPVQATMRVYQDFFVYQSGV 368
Query: 247 ---------KALGGHAIRILGWGEDEKSK---EKYWLIANSWNTDWGDNGLFKILRGKDE 294
G H++RI+GWGE+ + KYWL+ANSW +WG+NGLF+I +G +E
Sbjct: 369 YRHSRSAELHDSGYHSVRIIGWGEEPSYRGPPLKYWLVANSWGHNWGENGLFRIQKGTNE 428
Query: 295 CGIESSITA 303
C IES + A
Sbjct: 429 CEIESYVLA 437
>gi|322788703|gb|EFZ14296.1| hypothetical protein SINV_07506 [Solenopsis invicta]
Length = 443
Score = 92.4 bits (228), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 74/249 (29%), Positives = 105/249 (42%), Gaps = 43/249 (17%)
Query: 84 LPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSC-DASKGHTP 142
LP FDSRT+W I I DQG CG+ W ++A + + + S
Sbjct: 203 LPREFDSRTRWSR--DISGIHDQGWCGASWAVSTADVASDRYSIMSKGAEAPELSAQQLL 260
Query: 143 KCVRECQE---------------NYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVE 187
C Q+ + + K+ + K+ ++S +K P
Sbjct: 261 SCNNRGQQGCRGGYLDRAWLFMRKFGLVDKECYPWSGKNDQCKLRKRSTLKAAGCRKPSH 320
Query: 188 GAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSG- 246
T LYK G + GNET M I + + V+ D +YKSG
Sbjct: 321 PLRTE-----LYKVGPAYRLGNETDIMQEI-------LTSGPVQATMRVYQDFFIYKSGI 368
Query: 247 ---------KALGGHAIRILGWGEDEKSK---EKYWLIANSWNTDWGDNGLFKILRGKDE 294
G H++RI+GWGE+ + KYWL+ANSW +WGDNGLFKI +G +E
Sbjct: 369 YRHSRSAELHDSGYHSVRIIGWGEERSYRGPPLKYWLVANSWGYNWGDNGLFKIQKGTNE 428
Query: 295 CGIESSITA 303
C IES + A
Sbjct: 429 CEIESYVLA 437
>gi|290971375|ref|XP_002668483.1| predicted protein [Naegleria gruberi]
gi|284081912|gb|EFC35739.1| predicted protein [Naegleria gruberi]
Length = 325
Score = 92.0 bits (227), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 83/253 (32%), Positives = 113/253 (44%), Gaps = 56/253 (22%)
Query: 79 EVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASK 138
E D+P NFD+RT+W C + IRDQ +CG+CW A ++V R C A+
Sbjct: 98 ETRMDIPMNFDARTQWRGC--VPAIRDQQTCGACW-------AFSANYVLAHR-LCIATN 147
Query: 139 GHTPKCVR-ECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLI 197
G T + E Q D K G YS + +E T D I
Sbjct: 148 GQTNVVLSPEYQVQCDT-MNKACQGGYLKYSWTF--------------LENTGTPLDTCI 192
Query: 198 LYKSGR-FFVPGN-----ETTAMSLIKWTIRDNTSQLG-------------AEGAFTVFD 238
Y SGR F G + +MS+ K+ ++ G + FTV+
Sbjct: 193 PYASGRGTFSSGTCPTQCKIASMSMSKYKAKNTRYITGINNIKTAIMTYGSVQAGFTVYR 252
Query: 239 DLILYKSG-------KALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRG 291
DL YKSG LGGHA+ ++G+G + S YWL ANSW +WG +G FKI +G
Sbjct: 253 DLTGYKSGVYKHVVSTVLGGHAVALIGFGVEGGSN--YWLAANSWGANWGMSGYFKIAQG 310
Query: 292 KDECGIESSITAG 304
E GIE+ + AG
Sbjct: 311 --EGGIENQVYAG 321
>gi|294955270|ref|XP_002788457.1| cysteine protease, putative [Perkinsus marinus ATCC 50983]
gi|239903926|gb|EER20253.1| cysteine protease, putative [Perkinsus marinus ATCC 50983]
Length = 392
Score = 92.0 bits (227), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 68/242 (28%), Positives = 98/242 (40%), Gaps = 69/242 (28%)
Query: 83 DLPANFDSRTKWPNCP-----------------------TIREIRDQGSCGSCWGCRPYE 119
D+P +FD+R + C + I +GS + GC PY
Sbjct: 133 DIPNSFDARDAFKECKDVIGHVCCDGCTKGRPDAAWSFLNVYGIATEGSMSAADGCWPYN 192
Query: 120 IAPCEHHVNGTR-PSCDASKGHTPKCVREC-QENYDVPYKKDLNFGA--KSYSVSSNEKS 175
C HH ++ C TP C+ C +NY P KD +F A Y + + +
Sbjct: 193 FPKCGHHQQDSKYQPCPEKNYDTPPCLDRCPNKNYGTPLDKDRHFTAHFSPYQLKGTD-N 251
Query: 176 IMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFT 235
I KEI +GP AF+++DD + Y+SG +
Sbjct: 252 IKKEIMTNGPTSAAFSMYDDFLSYESGVY------------------------------- 280
Query: 236 VFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDEC 295
+ SG +G H + I+GWG K YWL+ NSWN WG +G FKI +G +C
Sbjct: 281 ------KHTSGTLMGEHGVEIIGWG--TKQGVDYWLVMNSWNEGWGVHGTFKIAQG--DC 330
Query: 296 GI 297
GI
Sbjct: 331 GI 332
>gi|294873367|ref|XP_002766594.1| Cathepsin B precursor, putative [Perkinsus marinus ATCC 50983]
gi|239867622|gb|EEQ99311.1| Cathepsin B precursor, putative [Perkinsus marinus ATCC 50983]
Length = 244
Score = 92.0 bits (227), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 64/207 (30%), Positives = 91/207 (43%), Gaps = 60/207 (28%)
Query: 114 GCRPYEIAPCEHHVNGT--RPSCDASKGHTPKCVREC-QENYDVPYKKDLNFGAKSY-SV 169
GC PY C HH +G+ +P C TP C C Y + KD ++ + S
Sbjct: 87 GCWPYSFPKCAHHQDGSDYKP-CAKEIYDTPSCSSSCPNAKYGTAFDKDRHYTESLFPSR 145
Query: 170 SSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLG 229
+ SI KEI +GP A
Sbjct: 146 FGSTSSIKKEIMTNGPTSAA---------------------------------------- 165
Query: 230 AEGAFTVFDDLILYKSG-------KALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGD 282
F+V++D + YKSG LGGHA+ I+GWG ++ YWL+ NSWN +WGD
Sbjct: 166 ----FSVYEDFLSYKSGVYKHTSGGFLGGHAVEIIGWGTEKGVD--YWLVMNSWNEEWGD 219
Query: 283 NGLFKILRGKDECGIESSITAGVPKLD 309
+G FKI++G +CGI+ +I AG P ++
Sbjct: 220 HGTFKIVQG--DCGIDDTILAGTPAMN 244
>gi|204022073|dbj|BAG71134.1| cathepsin B-S1 [Tuberaphis taiwana]
Length = 334
Score = 91.7 bits (226), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 62/208 (29%), Positives = 94/208 (45%), Gaps = 59/208 (28%)
Query: 107 GSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKS 166
G G+ GC PY++ PC ++ G +C + C V + + KS
Sbjct: 175 GDYGTKEGCMPYKVPPC-YNKQGKNTCGGQPMERNHQCPKTCYGKTTVQNR----YKTKS 229
Query: 167 YSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTS 226
V ++ K+I ++I +GPV
Sbjct: 230 EYVINSIKTIERDIMTYGPV---------------------------------------- 249
Query: 227 QLGAEGAFTVFDDLILYKSG--------KALGGHAIRILGWGEDEKSKEKYWLIANSWNT 278
E +F V+DDL YKSG K GGH+I+I+GWG+ + YWL NSW+
Sbjct: 250 ----EASFDVYDDLSAYKSGIYRKTPKAKYQGGHSIKIIGWGQQNGTP--YWLAVNSWSK 303
Query: 279 DWGDNGLFKILRGKDECGIESSITAGVP 306
WG++G FKI++G++ECGIE ++TAG+P
Sbjct: 304 FWGEHGTFKIIKGRNECGIERAVTAGIP 331
Score = 52.0 bits (123), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 21/34 (61%), Positives = 24/34 (70%)
Query: 80 VDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCW 113
V+ D P FDSRT W +C I IRDQG+CGSCW
Sbjct: 81 VENDSPQQFDSRTNWKSCKQIGHIRDQGNCGSCW 114
>gi|327281715|ref|XP_003225592.1| PREDICTED: tubulointerstitial nephritis antigen-like [Anolis
carolinensis]
Length = 520
Score = 91.3 bits (225), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 56/193 (29%), Positives = 89/193 (46%), Gaps = 36/193 (18%)
Query: 115 CRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVP--YKKDLNFGAKSYSVSSN 172
C P+ H N P+C T + R+ P + ++ +Y +SSN
Sbjct: 343 CYPFSNQETNHSPNA--PACMMHSRSTGRGKRQAIARCPNPRSHANEIYQSTPAYRLSSN 400
Query: 173 EKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEG 232
EK IMKE+ E+GPV+ V +D +Y++G + R G
Sbjct: 401 EKEIMKELMENGPVQAILEVHEDFFMYRTGIY-----------------RHTAVAAGKPE 443
Query: 233 AFTVFDDLILYKSGKALGGHAIRILGWGEDEK---SKEKYWLIANSWNTDWGDNGLFKIL 289
+ + G H+++I GWGE++ S +KYW+ ANSW DWG++G F+I
Sbjct: 444 QY------------RRHGTHSVKITGWGEEQMPDGSNQKYWIAANSWGKDWGEHGYFRIT 491
Query: 290 RGKDECGIESSIT 302
RG++EC IE+ +
Sbjct: 492 RGENECEIETFVV 504
>gi|290981656|ref|XP_002673546.1| predicted protein [Naegleria gruberi]
gi|284087130|gb|EFC40802.1| predicted protein [Naegleria gruberi]
Length = 362
Score = 91.3 bits (225), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 83/253 (32%), Positives = 113/253 (44%), Gaps = 56/253 (22%)
Query: 79 EVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASK 138
E D+P NFD+RT+W C + IRDQ +CG+CW A ++V R C A+
Sbjct: 135 ETRIDIPMNFDARTQWKGC--VPAIRDQQTCGACW-------AFSANYVLAHR-LCIATN 184
Query: 139 GHTPKCVR-ECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLI 197
G T + E Q D K G YS + +E T D I
Sbjct: 185 GQTNVVLSPEYQVQCDT-MNKACQGGYLKYSWTF--------------LENTGTPLDSCI 229
Query: 198 LYKSGR-FFVPGN-----ETTAMSLIKWTIRDNTSQLG-------------AEGAFTVFD 238
Y SGR F G + +MS+ K+ ++ G + FTV+
Sbjct: 230 PYASGRGTFSSGTCPTQCKIASMSMSKYKAKNTVYISGINNIKTAIMTYGSVQAGFTVYR 289
Query: 239 DLILYKSG-------KALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRG 291
DL YKSG LGGHA+ ++G+G + S YWL ANSW +WG +G FKI +G
Sbjct: 290 DLTGYKSGVYKHIENTVLGGHAVALIGFGVEGGSN--YWLAANSWGPNWGMSGYFKIAQG 347
Query: 292 KDECGIESSITAG 304
E GIE+ + AG
Sbjct: 348 --EGGIENQVYAG 358
>gi|403332696|gb|EJY65386.1| Cathepsin B [Oxytricha trifallax]
Length = 297
Score = 91.3 bits (225), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 86/312 (27%), Positives = 119/312 (38%), Gaps = 96/312 (30%)
Query: 40 QAEKNSLSNIPRAHLKSWMGVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPT 99
+ N S++ + L + G Y +P+N+ + G + P NFD+R +W +
Sbjct: 39 ETTTNPFSDLTKEQLLAKCGT---YIVPSNK--QYPGSPLIS--TPDNFDARQQWGS--K 89
Query: 100 IREIRDQGSCGSCWG----------------------CRPYEIAPCEHHVNG-------- 129
I IRDQ CG+CW P ++ C+ + G
Sbjct: 90 IHAIRDQQQCGACWAFGATEALSDRFTIASNGSVDVVFSPEDLVSCDTNDYGCNGGYMDM 149
Query: 130 ----------TRPSC---DASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSI 176
SC A G P C +C D +K + S S + I
Sbjct: 150 AWEFLDQHGVVADSCFPYSAGSGFAPACASKCA---DGSAEKKYSCVHGSIRQSQGVEQI 206
Query: 177 MKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTV 236
EI HGPVEGAFTV+ D Y+SG + P A
Sbjct: 207 KSEIVAHGPVEGAFTVYTDFFNYQSG-VYTPTTSDVA----------------------- 242
Query: 237 FDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECG 296
GGHAI+ILG+G + + YWL ANSW WG G FKI +G ECG
Sbjct: 243 -------------GGHAIKILGFGVENGT--PYWLCANSWGPSWGMQGFFKIKQG--ECG 285
Query: 297 IESSITAGVPKL 308
IE + + P+L
Sbjct: 286 IEDQVFSCDPQL 297
>gi|291236490|ref|XP_002738176.1| PREDICTED: cathepsin C-like [Saccoglossus kowalevskii]
Length = 438
Score = 91.3 bits (225), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 73/297 (24%), Positives = 112/297 (37%), Gaps = 80/297 (26%)
Query: 54 LKSWMGVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCW 113
L +W+ P P P +++ +LPA FD R + +R+Q SCGSC+
Sbjct: 180 LVTWLPTPPIMQ-PPKPAPITSQSAQIAANLPAEFDWRNV-GGVNYVTPVRNQASCGSCF 237
Query: 114 -------------------------------------GCR---PYEIAPCEHHVNGTRPS 133
GC PY ++ +
Sbjct: 238 AFASAGMYESRLKVMTANEVNITISPQDVVQCCNYSQGCSGGFPYLVSKYSEDFGFVEET 297
Query: 134 CDASKGHTPKCVRE--CQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFT 191
C CV E C+ +Y Y+ +F NE + E+ ++GP+ AF
Sbjct: 298 CLPYTAQDGPCVSEIKCKRHYGTKYRYVGDFYG-----GCNEALMKIELVKNGPMAVAFM 352
Query: 192 VFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGG 251
V+DD + Y+ G + G + F F+ +
Sbjct: 353 VYDDFMSYQGGIY---------------------HHTGLQDKFNPFE----------ITN 381
Query: 252 HAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVPKL 308
HA+ ++G+G D +KEK+W++ NSW T WG+ G F+I RG DEC IES P L
Sbjct: 382 HAVLLVGYGYDHDTKEKFWIVKNSWGTGWGEEGYFRIRRGNDECSIESIAVESTPIL 438
>gi|427783627|gb|JAA57265.1| hypothetical protein [Rhipicephalus pulchellus]
Length = 483
Score = 91.3 bits (225), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 53/145 (36%), Positives = 73/145 (50%), Gaps = 31/145 (21%)
Query: 161 NFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWT 220
+F Y V +NE+ IM+EIY +GPV+ V +D LY+SG + + I +
Sbjct: 327 HFSTPPYRVPANEEDIMQEIYANGPVQALILVKEDFFLYRSGVY--------RHTRIAES 378
Query: 221 IRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKE--KYWLIANSWNT 278
+R S+ G H++RILGWG D KYWL ANSW
Sbjct: 379 LRPQYSRSG---------------------WHSVRILGWGVDRSQYRPIKYWLCANSWGH 417
Query: 279 DWGDNGLFKILRGKDECGIESSITA 303
WG+NG F+I+RG+DE IES + A
Sbjct: 418 GWGENGYFRIVRGEDESQIESFVLA 442
>gi|323447573|gb|EGB03489.1| hypothetical protein AURANDRAFT_72715 [Aureococcus anophagefferens]
Length = 812
Score = 91.3 bits (225), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 74/259 (28%), Positives = 106/259 (40%), Gaps = 88/259 (33%)
Query: 83 DLPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEIAPCEHHV--NGTRP-------- 132
D+P+ F++ T+W ++ IRDQ CGSCW E+ + + N P
Sbjct: 339 DVPSEFNAVTQWKGL--VQPIRDQQQCGSCWAFSAAEVLSDRNAIQHNKAEPVLSPEDLV 396
Query: 133 SCD--------------------------------ASKGHTPKCVRECQENYD-VPYKKD 159
SCD A G PKC C++ YK
Sbjct: 397 SCDRVDQGCNGGNLGTAWTYLKNTGIVTDACFPYTAGGGDAPKCETSCKDGSSWTKYK-- 454
Query: 160 LNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKW 219
A +Y+V+ E ++ KEI HGP++ AF V+ + YKSG + KW
Sbjct: 455 ---AASAYAVNGVE-NMQKEIMTHGPIQVAFNVYKSFMSYKSGVY-----------AKKW 499
Query: 220 TIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTD 279
+L EG GHA++I+GWG + + YWL+ANSWNT
Sbjct: 500 Y------ELMPEG------------------GHAVKIVGWGTE--GGKDYWLVANSWNTS 533
Query: 280 WGDNGLFKILRGKDECGIE 298
WGD G FKI G + ++
Sbjct: 534 WGDEGYFKIAVGAESISLD 552
>gi|290998826|ref|XP_002681981.1| predicted protein [Naegleria gruberi]
gi|284095607|gb|EFC49237.1| predicted protein [Naegleria gruberi]
Length = 310
Score = 91.3 bits (225), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 83/253 (32%), Positives = 113/253 (44%), Gaps = 56/253 (22%)
Query: 79 EVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASK 138
E D+P NFD+RT+W C + IRDQ +CG+CW A ++V R C A+
Sbjct: 83 ETRVDIPMNFDARTQWKGC--VPAIRDQQTCGACW-------AFSANYVLAHR-LCIATN 132
Query: 139 GHTPKCVR-ECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLI 197
G T + E Q D K G YS + +E T D I
Sbjct: 133 GKTNVVLSPEYQVQCDT-MNKACQGGYLKYSWTF--------------LENTGTPLDTCI 177
Query: 198 LYKSGR-FFVPGN-----ETTAMSLIKWTIRDNTSQLG-------------AEGAFTVFD 238
Y SGR F G + +MS+ K+ ++ G + FTV+
Sbjct: 178 PYASGRGTFSSGTCPTQCKIASMSMSKYKAKNTVYISGINNIKTAIMTYGSVQAGFTVYR 237
Query: 239 DLILYKSG-------KALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRG 291
DL YKSG LGGHA+ ++G+G + S YWL ANSW +WG +G FKI +G
Sbjct: 238 DLTGYKSGVYKHVVSTVLGGHAVALIGFGVEGGSN--YWLAANSWGPNWGMSGYFKIAQG 295
Query: 292 KDECGIESSITAG 304
E GIE+ + AG
Sbjct: 296 --EGGIENQVYAG 306
>gi|290990726|ref|XP_002677987.1| predicted protein [Naegleria gruberi]
gi|284091597|gb|EFC45243.1| predicted protein [Naegleria gruberi]
Length = 225
Score = 90.9 bits (224), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 82/249 (32%), Positives = 112/249 (44%), Gaps = 56/249 (22%)
Query: 83 DLPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTP 142
D+P NFD+RT+W C + IRDQ +CG+CW A ++V R C A+ G T
Sbjct: 2 DIPMNFDARTQWRGC--VPAIRDQQTCGACW-------AFSANYVLAHRL-CIATNGQTN 51
Query: 143 KCVR-ECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKS 201
+ E Q D K G YS + +E T D I Y S
Sbjct: 52 VVLSPEYQVQCDT-MNKACQGGYLKYSWTF--------------LENTGTPLDTCIPYAS 96
Query: 202 GR-FFVPGN-----ETTAMSLIKWTIRDNTSQLG-------------AEGAFTVFDDLIL 242
GR F G + +MS+ K+ ++ G + FTV+ DL
Sbjct: 97 GRGTFSSGTCPTQCKIASMSMSKYKAKNTRYITGINNIKTAIMTYGSVQAGFTVYRDLTG 156
Query: 243 YKSG-------KALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDEC 295
YKSG LGGHA+ ++G+G + S YWL ANSW +WG +G FKI +G E
Sbjct: 157 YKSGVYKHVVSTVLGGHAVALIGFGVEGGSN--YWLAANSWGPNWGMSGYFKIAQG--EG 212
Query: 296 GIESSITAG 304
GIE+ + AG
Sbjct: 213 GIENQVYAG 221
>gi|294935195|ref|XP_002781337.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
gi|239891887|gb|EER13132.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
Length = 317
Score = 90.9 bits (224), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 61/216 (28%), Positives = 90/216 (41%), Gaps = 57/216 (26%)
Query: 103 IRDQGSCGSCWGCRPYEIAPCEHHVNGTR-PSCDASKGHTPKCVREC-QENYDVPYKKDL 160
I +GS + GC PY C HH ++ C TP C+ C E Y +P KD
Sbjct: 150 IATEGSMSAADGCWPYNFPKCAHHQKKSKYEPCSKKLYDTPSCLDRCPNEKYGIPLDKDR 209
Query: 161 NFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWT 220
+F A S + +I KEI +GP F+
Sbjct: 210 HFTAHSPDLFEGTDNIKKEIMTNGPTSATFS----------------------------- 240
Query: 221 IRDNTSQLGAEGAFTVFDDLILYKSG-------KALGGHAIRILGWGEDEKSKEKYWLIA 273
V++D + YKSG +G H++ I+GWG ++ YWL+
Sbjct: 241 ---------------VYEDFVSYKSGVYKHTNGTLMGIHSVEIIGWGTEKGVD--YWLVM 283
Query: 274 NSWNTDWGDNGLFKILRGKDECGIESSITAGVPKLD 309
NSWN WGD+G FKI +G +CGI+ ++ P ++
Sbjct: 284 NSWNEGWGDHGTFKIAQG--DCGIDDAVLGSPPAMN 317
>gi|28974200|gb|AAO61484.1| cathepsin B [Sterkiella histriomuscorum]
Length = 294
Score = 90.9 bits (224), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 88/294 (29%), Positives = 125/294 (42%), Gaps = 63/294 (21%)
Query: 40 QAEKNSLSNIPRAHLKSWMGVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPT 99
+ N +N+ + L + G Y +PAN+ E G + +P NFD+R +W +
Sbjct: 39 ETTTNPFNNMTKEQLLAKCGT---YIVPANK--EYPGSKIMT--VPENFDARQQWGS--K 89
Query: 100 IREIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKD 159
I IRDQ CGSCW E +NG +P+ + C N
Sbjct: 90 IHAIRDQQQCGSCWAFGATEAFSDRFAINGKDVIL------SPEDLVSCDTN-------- 135
Query: 160 LNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPG-----NETTAM 214
++G M +E+ GA T D Y +G F P + +AM
Sbjct: 136 -DYGCNG--------GYMDVAWEYLADHGAAT--DSCFPYSAGSGFAPACSDKCADGSAM 184
Query: 215 SLIKW---TIRDN----------TSQLGAEGAFTVFDDLILYKSG-------KALGGHAI 254
K ++R + S EGAFTV+ D Y+SG GGHAI
Sbjct: 185 QRFKCAPNSVRQSKGVAQIQSEIVSHGPVEGAFTVYTDFFNYQSGVYTPTTTDVAGGHAI 244
Query: 255 RILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVPKL 308
+ILG+G + + YWL ANSW WG +G FKI +G ECGIE + + P+L
Sbjct: 245 KILGYGVENGTP--YWLCANSWGPAWGMSGFFKIKQG--ECGIEDQVFSCDPQL 294
>gi|324512900|gb|ADY45327.1| Peptidase C1-like protein [Ascaris suum]
Length = 450
Score = 90.5 bits (223), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 51/144 (35%), Positives = 72/144 (50%), Gaps = 39/144 (27%)
Query: 167 YSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTS 226
Y VSS E+ IM EI +GPV+ F V++D +Y G
Sbjct: 318 YRVSSREQDIMTEIITNGPVQATFLVYEDFFMYSGG------------------------ 353
Query: 227 QLGAEGAFTVFDDLILYK----SGKALGGHAIRILGWGEDEKS--KEKYWLIANSWNTDW 280
V+ L L++ K G H++RI+GWGED + + KYWL ANSW +W
Sbjct: 354 ---------VYQHLDLHEHKEEERKVQGYHSVRIIGWGEDYSTGPQVKYWLAANSWGNEW 404
Query: 281 GDNGLFKILRGKDECGIESSITAG 304
G++GLF+ILRG++ C IES +
Sbjct: 405 GEDGLFRILRGENHCEIESFVIGA 428
>gi|63115212|gb|AAY33830.1| cathepsin B, partial [Siniperca chuatsi]
Length = 69
Score = 90.5 bits (223), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 43/62 (69%), Positives = 48/62 (77%), Gaps = 2/62 (3%)
Query: 246 GKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGV 305
G A+GGHAI+ILGWGE++ YWL ANSWNTDWGDNG FK LRG D C IES I AG+
Sbjct: 10 GSAVGGHAIKILGWGEEDGVP--YWLCANSWNTDWGDNGFFKFLRGSDHCRIESEIVAGI 67
Query: 306 PK 307
PK
Sbjct: 68 PK 69
>gi|167508668|gb|ABZ81540.1| cathepsin B-like cysteine protease [Caenorhabditis brenneri]
Length = 193
Score = 90.5 bits (223), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 58/187 (31%), Positives = 79/187 (42%), Gaps = 41/187 (21%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVP--YKKDLNFGAKSYSVSS 171
GC+PY I PC+ S HTP C C N P YK+ +FG Y+V
Sbjct: 46 GCKPYTIYPCDKTYPNGTTSVPCPGYHTPVCEERCTSNITWPISYKQVKHFGKAHYNVGK 105
Query: 172 NEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAE 231
I EI +GPV +F ++DD YKSG +
Sbjct: 106 KMTDIQTEIMRNGPVIASFIIYDDFWDYKSGIY--------------------------- 138
Query: 232 GAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRG 291
++ +G GG +I+GWG D + YWL + W TD+G+NG +ILRG
Sbjct: 139 ----------VHTAGDQEGGMDTKIIGWGVD--NGVPYWLCVHQWGTDFGENGFMRILRG 186
Query: 292 KDECGIE 298
+E IE
Sbjct: 187 VNEVHIE 193
>gi|10803454|emb|CAB97366.2| putative cathepsin B.3 [Ostertagia ostertagi]
Length = 196
Score = 90.5 bits (223), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 51/163 (31%), Positives = 75/163 (46%), Gaps = 40/163 (24%)
Query: 115 CRPYEIAPCEHHVNGTR-PSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
C+PY PC +H N T C TP C + CQ Y Y+KD + +Y VSS+E
Sbjct: 73 CKPYTFHPCGYHKNQTYYGECPKHTYQTPACKKYCQYGYGKRYEKDKIYAXDAYRVSSDE 132
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
+I EI+ GPV+ +F ++D YKSG
Sbjct: 133 AAIRAEIFARGPVQASFATYEDFAHYKSG------------------------------- 161
Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSW 276
+ ++ +GK GGHA++I+GWG + +K W++ANSW
Sbjct: 162 ------IYVHTAGKRRGGHAVKIIGWGVENGTKX--WIVANSW 196
Score = 40.4 bits (93), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 17/32 (53%), Positives = 20/32 (62%)
Query: 8 LCGFGCNGGFPGMAWRYWVKSGIVSGGAYGSK 39
CG+GCNGG+ AW Y SG+ SGG Y K
Sbjct: 39 FCGYGCNGGYSARAWLYARNSGVCSGGRYQEK 70
>gi|281204808|gb|EFA79003.1| hypothetical protein PPL_08471 [Polysphondylium pallidum PN500]
Length = 322
Score = 90.5 bits (223), Expect = 9e-16, Method: Compositional matrix adjust.
Identities = 74/249 (29%), Positives = 110/249 (44%), Gaps = 18/249 (7%)
Query: 71 LPELIGYSEVDE-DLPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEIAPCEHHV-N 128
L ++ Y++ D ++PA+FD+RT+WPNC I +RDQGSC SCW I + +
Sbjct: 25 LDNVVSYTDQDRANIPASFDARTQWPNC--ISPVRDQGSCSSCWAMTSSSILADRLCIAS 82
Query: 129 GTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEI---YEHGP 185
G S + C + C+ N FG S+ I E Y+
Sbjct: 83 GGAIKKLLSPQYMVDCAKNCKTNSQSDCNSGCKFGFLDISMEYLSNGISAESCLPYKESD 142
Query: 186 VEGAFTVFDD--LILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLI-- 241
D + LY GN A + I N L FT ++
Sbjct: 143 ATCPSQCKDGSPIQLYYGSGCISIGNLKDA----QLEIMKNGPILAVFQIFTSLYNIGSG 198
Query: 242 LYK-SGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESS 300
LY+ +G GHA R++GWGE+ + YWL NSW T++G +G FK+ G++ G ES
Sbjct: 199 LYRGTGDPAEGHAARVIGWGEENGTP--YWLALNSWGTEFGMDGAFKVPMGENIAGFESQ 256
Query: 301 ITAGVPKLD 309
+ + P +D
Sbjct: 257 LLSVKPNVD 265
>gi|294897889|ref|XP_002776090.1| cysteine protease Cys2, putative [Perkinsus marinus ATCC 50983]
gi|239882699|gb|EER07906.1| cysteine protease Cys2, putative [Perkinsus marinus ATCC 50983]
Length = 134
Score = 90.5 bits (223), Expect = 9e-16, Method: Compositional matrix adjust.
Identities = 56/171 (32%), Positives = 81/171 (47%), Gaps = 43/171 (25%)
Query: 141 TPKCVREC-QENYDVPYKKDLNFGAKSY-SVSSNEKSIMKEIYEHGPVEGAFTVFDDLIL 198
TP C C Y + KD ++ + S + SI KEI +GP AF+V++D +
Sbjct: 5 TPSCSSSCPNAKYGTAFDKDRHYTESLFPSRFGSTSSIKKEIMTNGPTSAAFSVYEDFLS 64
Query: 199 YKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILG 258
YKSG + + SG LGGHA+ I+G
Sbjct: 65 YKSGVY-------------------------------------KHTSGGFLGGHAVEIIG 87
Query: 259 WGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVPKLD 309
WG ++ YWL+ NSWN +WGD+G FKI++G +CGI+ I AG P ++
Sbjct: 88 WGTEKGV--DYWLVMNSWNEEWGDHGTFKIVQG--DCGIDDMILAGTPAIN 134
>gi|321478457|gb|EFX89414.1| hypothetical protein DAPPUDRAFT_303204 [Daphnia pulex]
Length = 442
Score = 90.5 bits (223), Expect = 9e-16, Method: Compositional matrix adjust.
Identities = 73/286 (25%), Positives = 109/286 (38%), Gaps = 101/286 (35%)
Query: 84 LPANFDSRTKWPNCPTIREIRDQGSCGSCWG--------------CRPYEIAPC------ 123
LP +FD R +W + T++++RDQG CG+ W R +E+ P
Sbjct: 185 LPMSFDGRIEWRD--TLQDVRDQGWCGASWAFSTAAVAADRLAIQSRGHEVYPLSMQNLL 242
Query: 124 ------EHHVNGTR-------------------PSCDASKGHTPKC---------VRECQ 149
+ NG P G KC +CQ
Sbjct: 243 ACNNRGQQGCNGGHLDRAWNYMRRFGVVNEECYPYISGRTGQVEKCKVPRRGNLATMKCQ 302
Query: 150 ---------ENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYK 200
+ D P +K L +Y ++ E IM EI +HGPV+ V D LY+
Sbjct: 303 LVNAAERKSDRSDKPPRKGLFRSPPAYRIAPFEDDIMNEILQHGPVQATMRVHPDFFLYR 362
Query: 201 SGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWG 260
G + G + S G H++RI+GWG
Sbjct: 363 GGVYRYSGTNSQQRS----------------------------------GYHSVRIVGWG 388
Query: 261 ED--EKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAG 304
D +++ KYWL+ANSW WG++G F+I+RG++E IE + A
Sbjct: 389 VDSSKRNPTKYWLVANSWGRLWGEDGYFRIVRGENESDIEKFVLAA 434
>gi|290998874|ref|XP_002682005.1| predicted protein [Naegleria gruberi]
gi|284095631|gb|EFC49261.1| predicted protein [Naegleria gruberi]
Length = 310
Score = 90.5 bits (223), Expect = 9e-16, Method: Compositional matrix adjust.
Identities = 83/255 (32%), Positives = 113/255 (44%), Gaps = 56/255 (21%)
Query: 79 EVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASK 138
E D+P NFD+RT+W C + IRDQ +CG+CW A ++V R C A+
Sbjct: 83 ETRVDIPMNFDARTQWKGC--VPAIRDQQTCGACW-------AFSANYVLAHR-LCIATN 132
Query: 139 GHTPKCVR-ECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLI 197
G T + E Q D K G YS + +E T D I
Sbjct: 133 GQTNVVLSPEYQVQCDT-MNKACQGGYLKYSWTF--------------LENTGTPLDTCI 177
Query: 198 LYKSGR-FFVPGN-----ETTAMSLIKWTIRDNTSQLG-------------AEGAFTVFD 238
Y SG F G + +MS+ K+ ++ G + FTV+
Sbjct: 178 PYASGGGTFSSGTCPTQCKIASMSMSKYKAKNTVYISGINNIKTAIMTYGSVQAGFTVYR 237
Query: 239 DLILYKSG-------KALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRG 291
DL YKSG LGGHA+ ++G+G + S YWL ANSW +WG +G FKI +G
Sbjct: 238 DLTGYKSGVYKHLVSTVLGGHAVALIGFGVEGGSN--YWLAANSWGPNWGMSGYFKIAQG 295
Query: 292 KDECGIESSITAGVP 306
E GIE+ + AG P
Sbjct: 296 --EGGIENQVYAGEP 308
>gi|290990464|ref|XP_002677856.1| predicted protein [Naegleria gruberi]
gi|284091466|gb|EFC45112.1| predicted protein [Naegleria gruberi]
Length = 231
Score = 90.5 bits (223), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 74/257 (28%), Positives = 96/257 (37%), Gaps = 92/257 (35%)
Query: 86 ANFDSRTKWPNCPTIREIRDQGSCGSCWG----------------------CRPYEIAPC 123
A FDSR KWPNC + IRDQG+CGSC+ P ++ C
Sbjct: 4 AEFDSRQKWPNC--VHPIRDQGNCGSCYSFASSEVMSDRFCIFSNGSVNVVLSPQDLVTC 61
Query: 124 EHH---VNGTRPSCDASKGHTP-------------------KCVRECQENYDVPYKKDLN 161
+ NG P H KC C N +K D +
Sbjct: 62 SWYSFGCNGGIPGLVFDYIHKDGLVSDACFPYLSYDGNTHVKCPDFCYNNKTKSFKSDKH 121
Query: 162 FGAKSYSV-------SSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAM 214
F K Y V + I KEI HGPV F V+ D +YKSG +
Sbjct: 122 FADKVYHVGEFLEDKAKRVLEIQKEILTHGPVNADFMVYSDFTVYKSGVY---------- 171
Query: 215 SLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIAN 274
+++G G HA++I+GWG + + YWLIAN
Sbjct: 172 ---------------------------RHQTGSFEGIHAVKIIGWGTE--NGVDYWLIAN 202
Query: 275 SWNTDWGDNGLFKILRG 291
SW T +G G FKI+RG
Sbjct: 203 SWGTTFGLQGFFKIVRG 219
>gi|432892467|ref|XP_004075795.1| PREDICTED: dipeptidyl peptidase 1-like [Oryzias latipes]
Length = 453
Score = 90.1 bits (222), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 77/307 (25%), Positives = 119/307 (38%), Gaps = 88/307 (28%)
Query: 50 PRAHLKSWMGVHPDYNLPANRLPELIGYSEVDEDLP---ANFDSRTKWPNC---PTIREI 103
P + + +H PA+R+P + + V DL A W N + +
Sbjct: 181 PEHEMYTLQELHYRAGGPASRVPVRVRPAPVTADLAKVAAALPESWDWRNVGGVNFVSPV 240
Query: 104 RDQGSCGSCWGC----------------------RPYEIAPCEHHVNGTRPSCDASKGHT 141
R+Q +CGSC+ P ++ C + G CD G
Sbjct: 241 RNQAACGSCYSFATMGMLEARVRVLTNNSQTPVFSPQQVVSCSEYSQG----CD---GGF 293
Query: 142 PKCVRECQENYDV------PY-KKDLNFGA-----KSYSVS----------SNEKSIMKE 179
P + + +++ + PY KD G ++Y+ +E ++MKE
Sbjct: 294 PYLIGKYSQDFGIVEESCFPYIAKDSPCGVPQNCGRAYTAEYKYVGGFYGGCSEMAMMKE 353
Query: 180 IYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDD 239
+ HGP+ AF V+ D + Y G + G F F+
Sbjct: 354 LVHHGPMAVAFEVYPDFMHYAGGIY---------------------HHTGLADPFNPFE- 391
Query: 240 LILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIES 299
L HA+ ++G+G K+ EKYW++ NSW T WG+NG F+I RG DEC IES
Sbjct: 392 ---------LTNHAVLLVGYGRCHKTGEKYWIVKNSWGTSWGENGFFRIRRGSDECSIES 442
Query: 300 SITAGVP 306
A P
Sbjct: 443 IAVAATP 449
>gi|350408961|ref|XP_003488566.1| PREDICTED: tubulointerstitial nephritis antigen-like [Bombus
impatiens]
Length = 445
Score = 90.1 bits (222), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 76/254 (29%), Positives = 104/254 (40%), Gaps = 46/254 (18%)
Query: 82 EDLPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEIAPCEHHVNGT-RPSCDASKGH 140
E LP FD+R +WP I +I DQG CG+ W +A + S S H
Sbjct: 200 ESLPREFDARIRWPR--EISDIDDQGWCGASWAISTTRVASDRFALMSKGADSVLLSAQH 257
Query: 141 TPKC----VRECQENY-DVPYKKDLNFG----------AKSYSVSSNEKSIMKEIYEHGP 185
C + C Y D + FG + +++ +K P
Sbjct: 258 LLSCNNRGQQACSGGYLDRAWLYMRKFGLVDEDCYPWEGTNVQCKLRKRTDLKTAGCRPP 317
Query: 186 VEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKS 245
V T LYK G + GNET M I + + V+ D Y+S
Sbjct: 318 VNPLRTE-----LYKVGPAYRLGNETDIMYEI-------LTSGPVQATMKVYQDFFSYES 365
Query: 246 G----------KALGGHAIRILGWGEDEKSKE------KYWLIANSWNTDWGDNGLFKIL 289
G A G H++RI+GWGED + KYWL+ NSW WG++GLF+I
Sbjct: 366 GIYKHTATTEHYAFGYHSVRIIGWGEDTSAHRYRNLPIKYWLVVNSWGQQWGESGLFRIQ 425
Query: 290 RGKDECGIESSITA 303
RG +EC IES + A
Sbjct: 426 RGTNECDIESFVVA 439
>gi|449485032|ref|XP_002188357.2| PREDICTED: dipeptidyl peptidase 1 [Taeniopygia guttata]
Length = 667
Score = 90.1 bits (222), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 74/283 (26%), Positives = 108/283 (38%), Gaps = 83/283 (29%)
Query: 67 PANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWGCR---------- 116
PA PEL+ + LP ++D R + +R+QGSCGSC+
Sbjct: 421 PAPLTPELL---KKVSSLPESWDWRNV-NGVNYVSPVRNQGSCGSCYAFSSMAMLEARIR 476
Query: 117 ------------PYEIAPCEHHVNG-------------------TRPSCDASKGHTPKCV 145
P ++ C + G C C+
Sbjct: 477 ILTNNTQKPVFSPQQVVSCSRYSQGCDGGFPYLIGGKYVQDFGVVEDDCFPYTAQDSPCL 536
Query: 146 --RECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGR 203
R C Y Y F NE + E+ HGP+ AF V++D +LYK G
Sbjct: 537 FKRSCYHYYTSEYHYVGGFYG-----GCNEALMKLELVHHGPMAVAFEVYNDFMLYKEGI 591
Query: 204 FFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDE 263
+ G + DDL ++ L HA+ ++G+G+D
Sbjct: 592 YHHTGLQ---------------------------DDLNPFE----LTNHAVLLVGYGKDP 620
Query: 264 KSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVP 306
+S EK+W++ NSW T WG++G F+I RG DEC IES A P
Sbjct: 621 ESGEKFWIVKNSWGTSWGEDGYFRIRRGTDECAIESIAVAATP 663
>gi|348565723|ref|XP_003468652.1| PREDICTED: dipeptidyl peptidase 1-like [Cavia porcellus]
Length = 463
Score = 90.1 bits (222), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 76/270 (28%), Positives = 102/270 (37%), Gaps = 86/270 (31%)
Query: 83 DLPANFDSRTKWPNCPTIR---EIRDQGSCGSCWG----------------------CRP 117
LPA++D W N I +R+QGSCGSC+ P
Sbjct: 230 QLPASWD----WRNVNGINFVTPVRNQGSCGSCYSFASVGMLEARIRILTNNTQTPILSP 285
Query: 118 YEIAPCEHHVNG-------------------TRPSCDASKGHTPKCV--RECQENYDVPY 156
EI C + G SC KG C ++C Y Y
Sbjct: 286 QEIVSCSQYAQGCEGGFPYLIAGKYAQDFGLVEESCFPYKGIDVPCKVKKDCVRYYTSEY 345
Query: 157 KKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSL 216
F NE + E+ +HGP+ AF V+DD + Y G + G
Sbjct: 346 HYVGGFYG-----GCNEALMKLELVQHGPMAVAFEVYDDFLHYHKGIYHRTG-------- 392
Query: 217 IKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSW 276
+RD F F+ L HA+ ++G+G D S YW++ NSW
Sbjct: 393 ----LRD---------PFNPFE----------LTNHAVLLVGYGTDPVSGRDYWIVKNSW 429
Query: 277 NTDWGDNGLFKILRGKDECGIESSITAGVP 306
T WG++G F+ILRG DEC IES A P
Sbjct: 430 GTGWGEDGYFRILRGTDECAIESIAMAATP 459
>gi|345327151|ref|XP_001507103.2| PREDICTED: tubulointerstitial nephritis antigen-like
[Ornithorhynchus anatinus]
Length = 327
Score = 90.1 bits (222), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 59/184 (32%), Positives = 82/184 (44%), Gaps = 37/184 (20%)
Query: 122 PCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIY 181
PC + +RP + T C + D Y D+ Y +SSNEK IMKEI
Sbjct: 158 PCRMY---SRPMGRGKRQATGPCPNNFHHSND--YSNDIYQSTPPYRLSSNEKDIMKEIM 212
Query: 182 EHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLI 241
E+GPV+ V +D LYK G + R + G F
Sbjct: 213 ENGPVQALMEVHEDFFLYKDGIY-----------------RHTPASNGKPPQF------- 248
Query: 242 LYKSGKALGGHAIRILGWGEDEK---SKEKYWLIANSWNTDWGDNGLFKILRGKDECGIE 298
+ G H+++I GWGE+ + + K+W ANSW WG+ G F+ILRG +EC IE
Sbjct: 249 -----RRQGTHSVKITGWGEELQPNGRRVKFWRAANSWGPTWGEGGSFRILRGCNECDIE 303
Query: 299 SSIT 302
S +
Sbjct: 304 SFVV 307
>gi|340712697|ref|XP_003394892.1| PREDICTED: tubulointerstitial nephritis antigen-like [Bombus
terrestris]
Length = 445
Score = 90.1 bits (222), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 76/254 (29%), Positives = 104/254 (40%), Gaps = 46/254 (18%)
Query: 82 EDLPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEIAPCEHHVNGT-RPSCDASKGH 140
E LP FD+R +WP I +I DQG CG+ W +A + S S H
Sbjct: 200 ESLPREFDARIRWPR--EISDIDDQGWCGASWAISATRVASDRFALMSKGADSVLLSAQH 257
Query: 141 TPKC----VRECQENY-DVPYKKDLNFG----------AKSYSVSSNEKSIMKEIYEHGP 185
C + C Y D + FG + +++ +K P
Sbjct: 258 LLSCNNRGQQACSGGYLDRAWLYMRKFGLVDEDCYPWEGTNAQCKLRKRTDLKTAGCRPP 317
Query: 186 VEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKS 245
V T LYK G + GNET M I + + V+ D Y+S
Sbjct: 318 VNPLRTE-----LYKVGPAYRLGNETDIMYEI-------LTSGPVQATMKVYQDFFSYES 365
Query: 246 G----------KALGGHAIRILGWGEDEKSKE------KYWLIANSWNTDWGDNGLFKIL 289
G A G H++RI+GWGED + KYWL+ NSW WG++GLF+I
Sbjct: 366 GIYKHTATTEHYAFGYHSVRIIGWGEDTSAHRHHNLPIKYWLVVNSWGQQWGESGLFRIQ 425
Query: 290 RGKDECGIESSITA 303
RG +EC IES + A
Sbjct: 426 RGTNECDIESFVVA 439
>gi|307201161|gb|EFN81067.1| Uncharacterized peptidase C1-like protein F26E4.3 [Harpegnathos
saltator]
Length = 443
Score = 89.7 bits (221), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 74/244 (30%), Positives = 103/244 (42%), Gaps = 33/244 (13%)
Query: 84 LPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEIAPCEHHVNGT-RPSCDASKGHTP 142
LP FD+RT+WP I I DQG CG+ W ++A + + S H
Sbjct: 203 LPREFDARTRWPR--DISGIHDQGWCGASWAVSTADVASDRFAIMSKGAEDVELSAQHLL 260
Query: 143 KCVRECQEN-----YDVPYKKDLNFG---AKSYSVSSNEKSIMKEIYEHGPVEGAFTVFD 194
C Q+ D + FG + Y + + V G +
Sbjct: 261 SCNNRGQQGCRGGYLDRAWLFMRKFGLVDKECYPWTGRNDQCRLRKRSNLNVAGCRKPPN 320
Query: 195 DLI--LYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSG------ 246
L LYK G + GNET M I + + V+ D +YK+G
Sbjct: 321 PLRQELYKVGPAYRLGNETDIMQEI-------LTSGPVQATMRVYQDFFVYKNGVYRHSR 373
Query: 247 ----KALGGHAIRILGWGEDEKSK---EKYWLIANSWNTDWGDNGLFKILRGKDECGIES 299
G H++RI+GWGE+ + KYWL+ANSW WG+NGLF+I RG +EC IES
Sbjct: 374 SAELHDSGYHSMRIIGWGEEPSYRGPPLKYWLVANSWGRHWGENGLFRIQRGTNECEIES 433
Query: 300 SITA 303
+ A
Sbjct: 434 YVLA 437
>gi|330846430|ref|XP_003295033.1| hypothetical protein DICPUDRAFT_51857 [Dictyostelium purpureum]
gi|325074364|gb|EGC28440.1| hypothetical protein DICPUDRAFT_51857 [Dictyostelium purpureum]
Length = 257
Score = 89.7 bits (221), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 72/268 (26%), Positives = 108/268 (40%), Gaps = 84/268 (31%)
Query: 84 LPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEIAPCEHHV--NG---------TRP 132
+P +FD+RT+WPNC I I +Q CGSCW E+ + NG
Sbjct: 31 IPQSFDARTQWPNC--IHPILNQEQCGSCWAFSASEVLSDRLCIASNGKTGVVLSPQALV 88
Query: 133 SCD-----ASKGHTPKCVRE------------------------CQENYDVPYKKDLNFG 163
SCD G P+ E C +N V ++ +
Sbjct: 89 SCDIFGNQGCNGGIPQLAWEYMELHGIPTYGCFPYTSGNGTDGSCVKNSCVDNEQYTLYR 148
Query: 164 AKSYSVSS--NEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRF-FVPGNETTAMSLIKWT 220
AK ++ + + + I ++I + GP++G V+ D + Y SG + PG+
Sbjct: 149 AKPLTLKTCASVECIQQDIMKFGPIQGTMEVYSDFMSYTSGVYTMTPGSSL--------- 199
Query: 221 IRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDW 280
LGGHAI+I+GWG D+ S + YW++ANSW W
Sbjct: 200 ----------------------------LGGHAIKIVGWGFDQASNQNYWIVANSWGPSW 231
Query: 281 GDNGLFKILRGKDECGIESSITAGVPKL 308
G +G F I D+CGI S A ++
Sbjct: 232 GIDGFFWI--AFDQCGINSDACAAQARI 257
>gi|410910940|ref|XP_003968948.1| PREDICTED: tubulointerstitial nephritis antigen-like [Takifugu
rubripes]
Length = 477
Score = 89.7 bits (221), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 50/149 (33%), Positives = 76/149 (51%), Gaps = 32/149 (21%)
Query: 156 YKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMS 215
Y+ D+ Y +S+NEK IMKEI ++GPV+ V +D +YKSG + + T +S
Sbjct: 334 YQNDIYQSTPPYRLSTNEKEIMKEIQDNGPVQAIMEVHEDFFVYKSGIY-----KHTDVS 388
Query: 216 LIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEK---SKEKYWLI 272
K + G H+++I GWGE+ +K KYW+
Sbjct: 389 FTK------------------------PPQYRKHGTHSVKITGWGEERNVDGAKRKYWIA 424
Query: 273 ANSWNTDWGDNGLFKILRGKDECGIESSI 301
ANSW +WG+ G F+I RG++EC IE+ +
Sbjct: 425 ANSWGKNWGEEGYFRIARGENECEIEAFV 453
>gi|395526635|ref|XP_003765465.1| PREDICTED: tubulointerstitial nephritis antigen-like [Sarcophilus
harrisii]
Length = 467
Score = 89.4 bits (220), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 50/139 (35%), Positives = 71/139 (51%), Gaps = 32/139 (23%)
Query: 167 YSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTS 226
Y +SS+EK IMKE+ E+GPV+ V +D LYKSG + + +
Sbjct: 343 YRLSSHEKDIMKELMENGPVQALLEVHEDFFLYKSGIY-----------------KHTPA 385
Query: 227 QLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGED---EKSKEKYWLIANSWNTDWGDN 283
LG + + G H+++I GWGE+ + K KYW ANSW WG+N
Sbjct: 386 SLGKPERY------------RQHGTHSVKITGWGEEIQPDGQKVKYWTAANSWGPTWGEN 433
Query: 284 GLFKILRGKDECGIESSIT 302
G F+I+RG +EC IES +
Sbjct: 434 GYFRIVRGANECDIESFVV 452
>gi|4099305|gb|AAD00577.1| cysteine proteinase [Clonorchis sinensis]
Length = 180
Score = 89.0 bits (219), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 52/148 (35%), Positives = 69/148 (46%), Gaps = 38/148 (25%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GCR Y CEHHV G P C TP+CV++C + DV Y +D SY++ ++E
Sbjct: 66 GCRSYPFPKCEHHVQGHYPPCPRELYPTPECVQQC-DTPDVGYLEDKTRANMSYNIYASE 124
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
SIMKEI GPVE FT+++D + Y SG +F
Sbjct: 125 ISIMKEIMLRGPVEAIFTMYEDFLRYSSGVYF---------------------------- 156
Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGE 261
+ G + GHA+RILGWGE
Sbjct: 157 ---------HALGAPMSGHAVRILGWGE 175
Score = 45.1 bits (105), Expect = 0.040, Method: Compositional matrix adjust.
Identities = 17/27 (62%), Positives = 21/27 (77%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGA 35
CGFGC GG+P +AW YW GIV+GG+
Sbjct: 34 CGFGCRGGYPAVAWDYWKTHGIVTGGS 60
>gi|134023803|gb|AAI35570.1| LOC100124858 protein [Xenopus (Silurana) tropicalis]
Length = 484
Score = 89.0 bits (219), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 73/283 (25%), Positives = 112/283 (39%), Gaps = 92/283 (32%)
Query: 81 DEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEIAPCE-------HHVNGTRP- 132
++ LP++F++ KWP + E DQG+C W +A H P
Sbjct: 218 NDILPSHFNAAEKWPG--LVHEPLDQGNCAGSWAFSTAAVASDRISIQSMGHMTQSLSPQ 275
Query: 133 ---SCDA-----------------------------------SKGHTPKCV--------- 145
SCD + GH+ C+
Sbjct: 276 NLLSCDTRNQHGCRGGRVDGAWWYLRRRGVVSEPCYPFTSLNTNGHSAPCMMQSRSMGRG 335
Query: 146 -RECQENYDVPY--KKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSG 202
R+ N Y ++ +Y ++S+EK IMKE+YE+GPV+ V +D +YKSG
Sbjct: 336 KRQATNNCPNQYYSSNEIYQSTPAYRLASSEKDIMKELYENGPVQAIMEVHEDFFMYKSG 395
Query: 203 RFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGED 262
+ R ++ E + G H+++I GWGE+
Sbjct: 396 IYR----------------RTPVTEREPE-------------HHRRHGTHSVKITGWGEE 426
Query: 263 ---EKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSIT 302
+ KYWL ANSW DWG++G F+I RG++EC IE+ I
Sbjct: 427 RGRDGQTHKYWLAANSWGRDWGEDGYFRIARGENECEIETFIV 469
>gi|300121294|emb|CBK21674.2| unnamed protein product [Blastocystis hominis]
Length = 561
Score = 89.0 bits (219), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 53/164 (32%), Positives = 79/164 (48%), Gaps = 40/164 (24%)
Query: 146 RECQENYDV-PYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRF 204
R+C +Y P + + + Y S E+ +MKEIY GP+ A D+L+ YK G
Sbjct: 153 RDCGHDYPCHPVQNYTKYFVEEYGYVSGEERMMKEIYARGPITCALDATDELVAYKGG-- 210
Query: 205 FVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEK 264
+F+D K+G HAI ++GWGE++
Sbjct: 211 -------------------------------IFED----KTGTTSLNHAISVVGWGEEDG 235
Query: 265 SKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVPKL 308
+KYW++ NSW T WG+NG F+I+RG + GIES T VP++
Sbjct: 236 --KKYWIVRNSWGTYWGENGWFRIVRGTNNLGIESECTWAVPRV 277
Score = 70.1 bits (170), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 59/230 (25%), Positives = 87/230 (37%), Gaps = 45/230 (19%)
Query: 87 NFDSRTKWPNCP-TIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKC- 144
N + KWP +++E+ + G+ GSC G + H +C + +C
Sbjct: 369 NIMRKGKWPTVELSVQEVINCGNTGSCNGGWDSGVYRYAHEEGIPDQTCQVYEARNKECN 428
Query: 145 ----VRECQENYDVPYKKDLN-FGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILY 199
+C + D KD + Y S + + EI+ GP+ +V + + Y
Sbjct: 429 DMNRCMDCPPDRDCYAVKDYKRYKVGDYGYVSGKDKMKAEIFARGPISCYVSVSQEFLDY 488
Query: 200 KSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGW 259
G F + LGGH I + GW
Sbjct: 489 TGGVF-------------------------------------VEHDHSMLGGHIIEVAGW 511
Query: 260 GEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVPKLD 309
G E +E YW+ NSW WG+NG F+I KD IESS T GVP +D
Sbjct: 512 GVTEDGQE-YWIGRNSWGEYWGENGWFRIQTDKDNLEIESSCTWGVPIID 560
>gi|270011021|gb|EFA07469.1| cathepsin B precursor [Tribolium castaneum]
Length = 327
Score = 89.0 bits (219), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 77/244 (31%), Positives = 105/244 (43%), Gaps = 38/244 (15%)
Query: 84 LPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEIAPCEHHV-NGTRPSCDASKGHTP 142
LP FDS KWP + EI+DQG CGS W +A + + R S H
Sbjct: 79 LPREFDSEFKWPGW--MSEIQDQGWCGSSWAITTAAVASDRFAILSKGREKVTLSAQHLL 136
Query: 143 KCVRECQENYDVPY--------KKDLNFGAKSYSVS-SNEKSIMKEIYEHGPVEGAF--- 190
C R Q++ + Y +K + + S +NEK I G + A
Sbjct: 137 SCDRRGQQSCNGGYLDRAWSYIRKIGLVDEQCFPYSATNEKC---RIPRRGDLVTANCQL 193
Query: 191 -TVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSG--- 246
T D YK + GNET M I + V+ D YK G
Sbjct: 194 PTNVDRRSKYKVAPAYRVGNETDIMYEI-------LHSGPVQATMKVYHDFFTYKRGIYR 246
Query: 247 -------KALGGHAIRILGWGEDEKSK--EKYWLIANSWNTDWGDNGLFKILRGKDECGI 297
G H++RI+GWGE+ + +KYW +ANSW +WG+NG F+ILRG +EC I
Sbjct: 247 HSPISTNDRTGYHSVRIVGWGEEYSPEGLKKYWKVANSWGPEWGENGYFRILRGSNECEI 306
Query: 298 ESSI 301
ES +
Sbjct: 307 ESFV 310
>gi|13469701|gb|AAK27318.1| cysteine proteinase [Clonorchis sinensis]
Length = 179
Score = 89.0 bits (219), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 54/149 (36%), Positives = 69/149 (46%), Gaps = 38/149 (25%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GCRPY C+HH G P C TPKCV+ C + + Y+KD SY+V +E
Sbjct: 66 GCRPYPFPKCQHHSQGHYPPCPRRIYPTPKCVKHC-DTPKIDYQKDKTRANTSYNVHQSE 124
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
+IMKEI +GPVE F V +D YKSG +F
Sbjct: 125 VAIMKEILLNGPVEATFEVHEDFPEYKSGIYF---------------------------- 156
Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGED 262
+ G ++GGHAIRILGWGE+
Sbjct: 157 ---------HAWGGSVGGHAIRILGWGEE 176
>gi|449269572|gb|EMC80333.1| Dipeptidyl-peptidase 1 [Columba livia]
Length = 412
Score = 88.6 bits (218), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 85/335 (25%), Positives = 117/335 (34%), Gaps = 97/335 (28%)
Query: 33 GGAYGSKQAEKNSLSNIPRAHLKSWMG-VHPDY-NLPANRLPELIG--YSEVDEDLPANF 88
G G + N AH KSW ++ +Y N L G YS V PA
Sbjct: 110 GSLSGRRYVHNFDFVNAINAHQKSWKATIYKEYENFALEELTRRSGGLYSRVPRPKPAPL 169
Query: 89 DSRT-----------KWPNCP---TIREIRDQGSCGSCWGCR------------------ 116
+ W N + IR+QGSCGSC+
Sbjct: 170 TAELLKKVSGLPDSWDWRNVNGVNYVSPIRNQGSCGSCYAFSSMGMLEARIRILTNNTQK 229
Query: 117 ----PYEIAPCEHHVNG-------------------TRPSCDASKGHTPKCV--RECQEN 151
P ++ C + G C C+ R C
Sbjct: 230 PIFSPQQVVSCSQYSQGCDGGFPYLIAGKYVQDFGVVEEDCFPYTAQDSPCLFKRSCYHY 289
Query: 152 YDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNET 211
Y Y F NE + E+ HGP+ AF V++D I YK G + G
Sbjct: 290 YTSEYHYVGGFYG-----GCNEALMKLELVLHGPMAVAFEVYNDFIHYKEGIYHHTG--- 341
Query: 212 TAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWL 271
+RD+ F F+ L HA+ ++G+G D +S EK+W+
Sbjct: 342 ---------LRDD---------FNPFE----------LTNHAVLLVGYGTDPQSGEKFWI 373
Query: 272 IANSWNTDWGDNGLFKILRGKDECGIESSITAGVP 306
+ NSW WG+NG F+I RG DEC IES + P
Sbjct: 374 VKNSWGILWGENGYFRIRRGTDECAIESIAVSATP 408
>gi|126330441|ref|XP_001381244.1| PREDICTED: tubulointerstitial nephritis antigen-like 1 [Monodelphis
domestica]
Length = 466
Score = 88.6 bits (218), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 56/185 (30%), Positives = 78/185 (42%), Gaps = 41/185 (22%)
Query: 121 APCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEI 180
APC H + H P Y Y +SS+EK IMKE+
Sbjct: 305 APCMMHSRSMGRGKRQATAHCPNSRAHANHIYQA---------TPPYRLSSDEKDIMKEL 355
Query: 181 YEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDL 240
E+GPV+ V +D LYKSG + + + LG +
Sbjct: 356 MENGPVQALMEVHEDFFLYKSGIY-----------------KHTPASLGKPARY------ 392
Query: 241 ILYKSGKALGGHAIRILGWGEDEK---SKEKYWLIANSWNTDWGDNGLFKILRGKDECGI 297
+ G H+++I GWGE+ + + KYW ANSW WG+ G F+ILRG +EC I
Sbjct: 393 ------RQHGTHSVKITGWGEERQPDGQRLKYWTAANSWGPTWGEKGHFRILRGANECDI 446
Query: 298 ESSIT 302
ES +
Sbjct: 447 ESFVV 451
>gi|90074902|dbj|BAE87131.1| unnamed protein product [Macaca fascicularis]
Length = 296
Score = 88.6 bits (218), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 41/71 (57%), Positives = 52/71 (73%), Gaps = 1/71 (1%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GCRPY I PCEHHVNG+RP C +G TPKC + C+ Y YK+D ++G SYSVS++E
Sbjct: 178 GCRPYSIPPCEHHVNGSRPPC-TGEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVSNSE 236
Query: 174 KSIMKEIYEHG 184
K IM EIY++G
Sbjct: 237 KDIMAEIYKNG 247
Score = 77.8 bits (190), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 33/50 (66%), Positives = 39/50 (78%)
Query: 260 GEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVPKLD 309
E K+ YWL+ANSWNTDWGDNG FKILRG+D CGIES + AG+P+ D
Sbjct: 241 AEIYKNGTPYWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPRTD 290
Score = 63.5 bits (153), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 33/84 (39%), Positives = 45/84 (53%), Gaps = 14/84 (16%)
Query: 44 NSLSNIPRAHLKSWMGVHPDYNLPANRL-------------PELIGYSEVDEDLPANFDS 90
+ L N +W H YN+ + L P+ + ++E D LP +FD+
Sbjct: 28 DELVNYVNKQNTTWQAGHNFYNVDVSYLKRLCGTFLGGPKPPQRVMFTE-DLKLPESFDA 86
Query: 91 RTKWPNCPTIREIRDQGSCGSCWG 114
R +WP CPTI+EIRDQGSCGSCW
Sbjct: 87 REQWPQCPTIKEIRDQGSCGSCWA 110
Score = 46.6 bits (109), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 18/31 (58%), Positives = 23/31 (74%)
Query: 8 LCGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
+CG GCNGG+P AW +W + G+VSGG Y S
Sbjct: 145 MCGDGCNGGYPAGAWNFWTRKGLVSGGLYDS 175
>gi|339248603|ref|XP_003373289.1| cathepsin B [Trichinella spiralis]
gi|316970616|gb|EFV54519.1| cathepsin B [Trichinella spiralis]
Length = 576
Score = 88.2 bits (217), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 70/249 (28%), Positives = 109/249 (43%), Gaps = 35/249 (14%)
Query: 79 EVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEIAPCEHHV-NGTRPSCDAS 137
E+ LP +FD+R +WP+ I +RDQG C S W ++ + +G + S
Sbjct: 306 EMSNFLPESFDARERWPS--FIHPVRDQGDCASSWAFSTTAVSADRLAIQSGGKFYNPLS 363
Query: 138 KGHTPKCVRECQENYDVPY--KKDLNFGAKSYSVSSNEKS------IMKEIYEHGPVEGA 189
C + Q + Y + + Y+ +S + + I + Y G +
Sbjct: 364 VQQLLSCNQARQRGCNGGYLDRAWCVVSDECYTYTSGQTNQPGECHIPRTAYLDGEIRCP 423
Query: 190 FTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSG--- 246
D+ + + + + NE M+ I + + F V +D +YKSG
Sbjct: 424 SGSADNRVYKMTPPYRISTNEREIMTEI-------MANGPVQATFLVHEDFFMYKSGVYQ 476
Query: 247 ------------KALGGHAIRILGWGEDEKS--KEKYWLIANSWNTDWGDNGLFKILRGK 292
G H++RILGWG D + KYWL ANSW +WG+NGLF+ILRG+
Sbjct: 477 HLPYANDKGPAYARSGYHSVRILGWGVDHSTGVPIKYWLCANSWGEEWGENGLFRILRGE 536
Query: 293 DECGIESSI 301
+ C IES I
Sbjct: 537 NHCDIESFI 545
>gi|326914532|ref|XP_003203579.1| PREDICTED: dipeptidyl peptidase 1-like [Meleagris gallopavo]
Length = 420
Score = 88.2 bits (217), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 73/283 (25%), Positives = 108/283 (38%), Gaps = 83/283 (29%)
Query: 67 PANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWGC----------- 115
PA PEL+ + +LP ++D R + +R+Q SCGSC+
Sbjct: 174 PAPLTPELL---KKVSNLPESWDWRNV-NGVNYVSPVRNQASCGSCYAFASMGMLEARIR 229
Query: 116 -----------RPYEIAPCEHHVNG-------------------TRPSCDASKGHTPKCV 145
P ++ C + G C C+
Sbjct: 230 ILTNNTQKPVFSPQQVVSCSQYSQGCDGGFPYLIAGKYVQDFGVVEEDCFPYTAQDSPCL 289
Query: 146 --RECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGR 203
R C Y Y F + NE + E+ GP+ AF V++D + YK G
Sbjct: 290 FKRSCYHYYTSEYHYVGGFYG-----ACNEALMKLELVLSGPMAVAFEVYNDFMFYKEGI 344
Query: 204 FFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDE 263
+ G ++DN F F+ L HA+ ++G+G+D
Sbjct: 345 YHHTG------------LKDN---------FNPFE----------LTNHAVLLVGYGKDP 373
Query: 264 KSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVP 306
KS EK+W++ NSW T WG++G F+I RG DEC IES A P
Sbjct: 374 KSGEKFWIVKNSWGTSWGEDGYFRIRRGTDECAIESIAVAATP 416
>gi|291228863|ref|XP_002734398.1| PREDICTED: hypothetical protein [Saccoglossus kowalevskii]
Length = 451
Score = 88.2 bits (217), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 74/283 (26%), Positives = 114/283 (40%), Gaps = 88/283 (31%)
Query: 79 EVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEIA----------------- 121
++ + +P +FD+R KW + I I DQG+C S W +A
Sbjct: 174 KMKKKIPKSFDARDKWGS--MITGILDQGNCASSWAFSTVGVASDRLAIQSSGETGMTLS 231
Query: 122 PCEHHVNGTRPSCDASKGHTPK----------------------------CVRECQENYD 153
P TR S GH + C+ + D
Sbjct: 232 PQHLLSCNTRGQRGCSGGHIDRAWWFMRKRGVVSNDCYPYTSGDQDKKGVCMMPGKLPSD 291
Query: 154 VPYKKD----LNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFF---V 206
P ++ L+ Y +++NE+ I EI E+GPV+ +F V +D +Y SG + +
Sbjct: 292 CPTGRERNNELHHSTPPYRIAANEREIQVEIMENGPVQASFEVKEDFFMYGSGVYRHTPI 351
Query: 207 PGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSK 266
N+ +W H++++LGWG + +
Sbjct: 352 ASNDAEQYHASEW--------------------------------HSVKLLGWGVE--NG 377
Query: 267 EKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVPKLD 309
KYWL ANSW T WG++G FKILRG++EC IES + A K+D
Sbjct: 378 IKYWLGANSWGTKWGEDGYFKILRGENECNIESYVVAVWGKVD 420
>gi|253743418|gb|EES99819.1| Cathepsin B precursor [Giardia intestinalis ATCC 50581]
Length = 296
Score = 88.2 bits (217), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 69/250 (27%), Positives = 99/250 (39%), Gaps = 76/250 (30%)
Query: 85 PANFDSRTKWPNCPTIREIRDQGSCGSCWG-----------CRP-----------YEIAP 122
P ++D R ++P+C I E+ DQGSCGSCW CR +
Sbjct: 77 PESYDFRDEYPHC--ITEVVDQGSCGSCWAFSSIQTFADHRCRSGLDATGVSYSVQYVLD 134
Query: 123 CE---HHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKS-----------YS 168
C+ H NG P+ H+ V +Y + F K+ +
Sbjct: 135 CDRKDHGCNGGEPTKAFDFLHSTGTVLTSCVDYTAGADNVVKFCPKTCDDGSAVENVFAA 194
Query: 169 VSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQL 228
S S + + HGPV F V D + YKSG +
Sbjct: 195 SGSKSGSAIDVLLSHGPVVATFNVAQDFMYYKSGVY------------------------ 230
Query: 229 GAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKI 288
++ G LGGHA+ ++G+G + S YW + NSW DWG++G F+I
Sbjct: 231 -------------QHRWGVWLGGHAVEVVGYGVTD-SGLDYWTVRNSWGPDWGEDGYFRI 276
Query: 289 LRGKDECGIE 298
+RG DECGIE
Sbjct: 277 VRGSDECGIE 286
>gi|114153242|gb|ABI52787.1| cathepsin B-like protein [Argas monolakensis]
Length = 91
Score = 87.8 bits (216), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 40/59 (67%), Positives = 47/59 (79%), Gaps = 2/59 (3%)
Query: 249 LGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVPK 307
+GGHAIRI+GWG +E YWL+ANSWN +WGDNG FKILRG +ECGIE I AG+PK
Sbjct: 34 MGGHAIRIIGWGVEEDVP--YWLVANSWNREWGDNGYFKILRGSNECGIEDDIVAGIPK 90
>gi|157058745|gb|ABV03130.1| cathepsin B-2744 [Sitobion avenae]
Length = 260
Score = 87.8 bits (216), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 56/177 (31%), Positives = 82/177 (46%), Gaps = 45/177 (25%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRE--CQENYDVPYKKDLNFGAKSYSVS- 170
GC+PY+I PC H+ NG +C + + RE +NY V Y+ DL+ + Y S
Sbjct: 124 GCQPYKIRPCNHYGNGNLKNCSSLRRTQMTVCREKCVNKNYKVKYEDDLHKTSIVYMTSW 183
Query: 171 SNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGA 230
+N K I +EI +GPV V+++ + YK G
Sbjct: 184 TNVKQIQQEIMTYGPVTAFMYVYENFMGYKEG---------------------------- 215
Query: 231 EGAFTVFDDLILYKS--GKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGL 285
+YKS G+ +G H ++++GWG D E YWL NSWN++WG NGL
Sbjct: 216 -----------IYKSTAGELIGYHHVKLIGWGVDGDGTE-YWLAMNSWNSNWGTNGL 260
>gi|3087797|emb|CAA93275.1| cysteine proteinase [Haemonchus contortus]
Length = 330
Score = 87.8 bits (216), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 49/167 (29%), Positives = 79/167 (47%), Gaps = 42/167 (25%)
Query: 115 CRPYEIAPCEHHVNGTRPSCDASKG-HTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
C+PY + PCE + G SC TP C + CQ Y Y+KD ++ Y + +E
Sbjct: 194 CKPYHLHPCE--ITGKFWSCPRDHSFRTPACKKYCQYGYGKRYEKDKSYVKSVYILDEDE 251
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
K+I +E+ ++GPV+ AFT ++D Y+ G
Sbjct: 252 KAIQREMMKNGPVQAAFTTYEDFSFYRKG------------------------------- 280
Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDW 280
+ ++ G+ G HA++++GWG + + KYW +ANSW+TDW
Sbjct: 281 ------IYVHSYGRQRGAHAVKVVGWGVENGT--KYWNVANSWSTDW 319
Score = 38.5 bits (88), Expect = 3.6, Method: Compositional matrix adjust.
Identities = 17/33 (51%), Positives = 20/33 (60%)
Query: 7 RLCGFGCNGGFPGMAWRYWVKSGIVSGGAYGSK 39
R CG GCNGG AW Y + G+V+GG Y K
Sbjct: 159 RECGRGCNGGMDHKAWEYVKEFGVVTGGRYQEK 191
>gi|195026034|ref|XP_001986167.1| GH20676 [Drosophila grimshawi]
gi|193902167|gb|EDW01034.1| GH20676 [Drosophila grimshawi]
Length = 432
Score = 87.8 bits (216), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 79/294 (26%), Positives = 114/294 (38%), Gaps = 92/294 (31%)
Query: 67 PANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEIAP---- 122
P R+ + + DLP +F++ KW I E+ DQG CG+ W +A
Sbjct: 170 PTFRVKSMTRLTNPSNDLPRSFNAVEKWST--FISEVPDQGWCGASWVLSTTSVASDRFA 227
Query: 123 -------------------------CE----------HHVNGT-----------RPSCDA 136
C+ H NG R +C
Sbjct: 228 IQSQGKEVVQLSAQNILSCTRRQQGCDGGHLDAAWRYMHKNGVLDANCYPYIQQRDTCKV 287
Query: 137 SKGHTPKCVRE--CQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFD 194
+ H + ++ CQ + V G +YS+S E IM EIY GPV+ TV+
Sbjct: 288 QR-HRGRSLKAYGCQPAHGVNRDNFYTVG-PAYSLS-READIMAEIYHSGPVQATMTVYR 344
Query: 195 DLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAI 254
D Y SG + + TA + G A G H++
Sbjct: 345 DFFSYSSGVY-----QHTAAN-----------------------------RGAATGFHSV 370
Query: 255 RILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVPKL 308
+++GWGE E + KYW+ ANSW WG+ G F+ILRG +ECGIE + A P +
Sbjct: 371 KLVGWGE-EHNGVKYWIAANSWGPWWGERGYFRILRGSNECGIEEYVLASWPHV 423
>gi|308161503|gb|EFO63946.1| Cathepsin B precursor [Giardia lamblia P15]
Length = 363
Score = 87.8 bits (216), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 72/250 (28%), Positives = 99/250 (39%), Gaps = 76/250 (30%)
Query: 85 PANFDSRTKWPNCPTIREIRDQGSCGSCWG-----------CRP-----------YEIAP 122
P ++D R ++P+C I E+ DQGSCGSCW CR +
Sbjct: 144 PESYDFREEYPHC--ITEVVDQGSCGSCWAFSSIQTFADHRCRSGLDATGVSYSVQYVLD 201
Query: 123 CE---HHVNGTRPSCDASKGHTPKCVRECQENYDV---------PYKKDLNFGAKSYSVS 170
C+ H NG P + H V Y P K D ++ +
Sbjct: 202 CDRKDHGCNGGEPVNAFNFLHNTGTVLTSCVEYTAGDDAVVKFCPQKCDDGSAVENIVAT 261
Query: 171 SNEKS--IMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQL 228
S KS + + HGPV F V D + YKSG +
Sbjct: 262 SGAKSGSAIDVLLAHGPVVATFNVAQDFMYYKSGVY------------------------ 297
Query: 229 GAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKI 288
++ G LGGHA+ I+G+G + S YW + NSW DWG++G F+I
Sbjct: 298 -------------QHRWGVWLGGHAVEIVGYGVTD-SGLDYWTVRNSWGPDWGEDGYFRI 343
Query: 289 LRGKDECGIE 298
+RG DECGIE
Sbjct: 344 VRGGDECGIE 353
>gi|33327024|gb|AAQ08887.1| cathepsin C [Homo sapiens]
Length = 463
Score = 87.8 bits (216), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 72/267 (26%), Positives = 101/267 (37%), Gaps = 82/267 (30%)
Query: 84 LPANFDSRTKWPNCPTIR---EIRDQGSCGSCWG----------------------CRPY 118
LP ++D W N I +R+Q SCGSC+ P
Sbjct: 231 LPTSWD----WRNVHGINFVSPVRNQASCGSCYSFASMGMLEARIRILTNNSQTPILSPQ 286
Query: 119 EIAPCEHHVNG-------------------TRPSCDASKGHTPKCVRECQENYDVPYKKD 159
E+ C H G +C G C + +E+ Y +
Sbjct: 287 EVVSCSQHAQGCEGGFPYLIAGKYAQDFGLVEEACFPYTGTDSPC--KMKEDCFRYYSSE 344
Query: 160 LNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKW 219
++ Y NE + E+ HGP+ AF V+DD + YK G + G
Sbjct: 345 YHYVGGFYG-GCNEALMKLELVHHGPMAVAFEVYDDFLHYKKGIYHHTG----------- 392
Query: 220 TIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTD 279
+RD F F+ L HA+ ++G+G D S YW++ NSW T
Sbjct: 393 -LRD---------PFNPFE----------LTNHAVLLVGYGTDSASGMDYWIVKNSWGTG 432
Query: 280 WGDNGLFKILRGKDECGIESSITAGVP 306
WG+NG F+I RG DEC IES A P
Sbjct: 433 WGENGYFRIRRGTDECAIESIAVAATP 459
>gi|291408920|ref|XP_002720687.1| PREDICTED: tubulointerstitial nephritis antigen-like 1 [Oryctolagus
cuniculus]
Length = 467
Score = 87.8 bits (216), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 83/308 (26%), Positives = 116/308 (37%), Gaps = 101/308 (32%)
Query: 64 YNLPANRLPE-LIGYSEV------DEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG-- 114
Y L NR P ++ +E+ E LP F++ KWPN I E DQG+C W
Sbjct: 176 YRLGTNRPPSSVMNMNEIYTGLGSGEVLPTAFEASEKWPN--LIHEPLDQGNCAGSWAFS 233
Query: 115 --------------------CRPYEIAPCE-HHVNGTR------------------PSCD 135
P + C+ HH G R C
Sbjct: 234 TAAVASDRVSIHSLGHMTPVLSPQNLLSCDTHHQQGCRGGRLDGAWWFLRRRGVVSDHCY 293
Query: 136 ASKGH-------TPKCVREC--------QENYDVP----YKKDLNFGAKSYSVSSNEKSI 176
GH P C+ Q P + D+ +Y + SNEK I
Sbjct: 294 PFSGHEQDEAGPAPPCMMHSRAMGRGKRQATARCPNSHVHANDIYQVTPAYRLGSNEKEI 353
Query: 177 MKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTV 236
MKE+ E+GPV+ V +D LY+ G + T +SL +
Sbjct: 354 MKELLENGPVQALMEVHEDFFLYQGGIY-----SHTPVSLER------------------ 390
Query: 237 FDDLILYKSGKALGGHAIRILGWGED---EKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
+ + G H+++I GWGE+ + KYW ANSW WG+ G F+ILRG +
Sbjct: 391 ------PERYRRHGTHSVKITGWGEETLPDGRTLKYWTAANSWGPAWGERGHFRILRGTN 444
Query: 294 ECGIESSI 301
EC IES +
Sbjct: 445 ECDIESFV 452
>gi|403357104|gb|EJY78168.1| Cathepsin B [Oxytricha trifallax]
Length = 349
Score = 87.8 bits (216), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 71/271 (26%), Positives = 102/271 (37%), Gaps = 84/271 (30%)
Query: 78 SEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG----------------------C 115
+++E +P +FDSR KWPNC I IRDQ CGSCW
Sbjct: 119 QDLNETIPESFDSRDKWPNC--IHGIRDQQLCGSCWAFASSAFLSDRFCIHSEGQINEDL 176
Query: 116 RPYEIAPCEHHVNG------------------TRPSCDASKGHTPKCVRECQENYDVPYK 157
P ++ C + G C C +CQ N PY
Sbjct: 177 SPQDLVSCSYENFGCSGGQLTESVDFLIYEGIVSEKCKPYMNQDTYCKFKCQ-NDKQPYT 235
Query: 158 KDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLI 217
K KS + S+ + I E+ +GP+ +V++DL+ YK G +
Sbjct: 236 KYF-CEQKSMLILSDIEEIQLELMTNGPMMVGLSVYEDLMNYKEGVYE------------ 282
Query: 218 KWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWN 277
Y +G +GGHAI+I+GWG EK E +W N W
Sbjct: 283 -------------------------YTTGNQVGGHAIKIIGWGHTEKG-ELFWKCQNQWG 316
Query: 278 TDWGDNGLFKILRGKDECGIESSITAGVPKL 308
DWG G I G E G+++ + +P +
Sbjct: 317 KDWGMGGYINIKAG--ELGMDTMVLGCMPDI 345
>gi|289724789|gb|ADD18342.1| putative cysteine proteinase TIN-ag [Glossina morsitans morsitans]
Length = 387
Score = 87.8 bits (216), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 78/291 (26%), Positives = 116/291 (39%), Gaps = 87/291 (29%)
Query: 67 PANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCW------------- 113
P R+ + + + LP +F+S KW + I ++ DQG CGS W
Sbjct: 125 PTYRVKAMSRLHNIVDHLPRSFNSIDKWAS--YISDVLDQGWCGSSWVISTASVASDRFA 182
Query: 114 ----GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDV-----PY-------- 156
G +++P ++ ++ TR + GH R + V PY
Sbjct: 183 IQSRGKEVIQLSP-QNILSCTRRQQGCNGGHLDAAWRYLHKQGVVDESCYPYVGYRDACK 241
Query: 157 -----KKDLNFGAKSYS--------------VSSNEKSIMKEIYEHGPVEGAFTVFDDLI 197
+ N G +SYS +NE IM EI+ GPV+ TV+ D
Sbjct: 242 IPHNSRSLRNNGCRSYSGVDRDELYTVGPAYSLNNETDIMAEIFMSGPVQATLTVYRDFF 301
Query: 198 LYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRIL 257
Y G + TA S G +G H+++++
Sbjct: 302 SYSGGIY-----RHTAAS-----------------------------RGSPVGFHSVKLI 327
Query: 258 GWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVPKL 308
GWGE E KYW+ NSW T WG++G F+ILRG +ECGIE + A P +
Sbjct: 328 GWGE-EHDGNKYWIATNSWGTWWGEHGNFRILRGSNECGIEEYVLAAWPNV 377
>gi|449283627|gb|EMC90232.1| Tubulointerstitial nephritis antigen [Columba livia]
Length = 469
Score = 87.8 bits (216), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 53/148 (35%), Positives = 73/148 (49%), Gaps = 40/148 (27%)
Query: 164 AKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRD 223
A Y VSS E +IMKEI + GPV+ V++D LYK G +
Sbjct: 357 ASHYRVSSKETNIMKEIMDKGPVQAIMKVYEDFFLYKEGIY------------------- 397
Query: 224 NTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWG---EDEKSKEKYWLIANSWNTDW 280
SQ K+G H++++LGWG + K+K+W+ ANSW W
Sbjct: 398 RHSQ----------------KAGSKWKTHSVKLLGWGALADKNGQKQKFWIAANSWGKSW 441
Query: 281 GDNGLFKILRGKDECGIESSI--TAGVP 306
G+NG F+ILRG++EC IE I T+G P
Sbjct: 442 GENGYFRILRGQNECDIEKLILATSGQP 469
>gi|123478051|ref|XP_001322190.1| Clan CA, family C1, cathepsin B-like cysteine peptidase
[Trichomonas vaginalis G3]
gi|121905031|gb|EAY09967.1| Clan CA, family C1, cathepsin B-like cysteine peptidase
[Trichomonas vaginalis G3]
Length = 288
Score = 87.8 bits (216), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 80/291 (27%), Positives = 117/291 (40%), Gaps = 96/291 (32%)
Query: 60 VHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG----- 114
+ PD +P R P+ ++ +P +++ ++P C + DQG CGSCW
Sbjct: 51 LRPD-TIPLARPPK------INISIPMSYNFTERFPQCDF--GVLDQGKCGSCWSFAVSK 101
Query: 115 ------CRPY---------EIAPCEHH---------VNGTR---------PSCDASKGHT 141
CR Y + C+ VN R SC G+
Sbjct: 102 SFSHRYCRKYNKPVLFSQSHLVACDRRNSGCGGGIEVNAWRYIDLRGLPLDSCQPYDGNI 161
Query: 142 PK--CVREC---QENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDL 196
K C ++C E Y+ + + + A+ S+ + IM E GPV + V+ DL
Sbjct: 162 TKYNCSKKCTNESETYEAQFTEYWSV-ARYASIEEMQIGIMTE----GPVTTSLKVYSDL 216
Query: 197 ILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRI 256
+ YKSG + + G+ LG HA+ I
Sbjct: 217 MYYKSG-------------------------------------IYTHTKGEFLGHHAVEI 239
Query: 257 LGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVPK 307
+GWG K+ YW+I+NSWNT WG NGLF I RG +EC IE + AG K
Sbjct: 240 IGWGT--KNGIDYWIISNSWNTTWGMNGLFLIKRGVNECHIEDYVCAGKVK 288
>gi|126327832|ref|XP_001363345.1| PREDICTED: dipeptidyl peptidase 1-like [Monodelphis domestica]
Length = 462
Score = 87.4 bits (215), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 65/248 (26%), Positives = 100/248 (40%), Gaps = 76/248 (30%)
Query: 100 IREIRDQGSCGSCWG----------------------CRPYEIAPCEHHVNG-------- 129
+ +R+Q SCGSC+ +I C + G
Sbjct: 246 VSPVRNQASCGSCYAFASMAMLEARIRILTNNSKTPVLSTQQIVSCSEYSQGCDGGFPYL 305
Query: 130 -----------TRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMK 178
+C GH C + Y Y D ++ Y + NE +
Sbjct: 306 IAGKYVQDFGVVEENCFPYLGHDSPCSPKNCTRY---YVSDYHYVGGFYG-ACNEALMKL 361
Query: 179 EIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFD 238
E+ E+GP+ AF V++D I Y+ G + G +RD +F F+
Sbjct: 362 ELVENGPMAVAFEVYNDFIHYQKGVYHHTG------------LRD---------SFNPFE 400
Query: 239 DLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIE 298
+ HA+ ++G+G DEK+ E YW++ NSW + WG++G F+ILRG DECGIE
Sbjct: 401 ----------ITNHAVLLVGYGTDEKTGEHYWIVKNSWGSYWGEDGYFRILRGTDECGIE 450
Query: 299 SSITAGVP 306
S + P
Sbjct: 451 SIAVSATP 458
>gi|159950|gb|AAA29435.1| cathepsin B-like cysteine protease, partial [Ostertagia ostertagi]
Length = 105
Score = 87.4 bits (215), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 43/140 (30%), Positives = 75/140 (53%), Gaps = 39/140 (27%)
Query: 165 KSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDN 224
K+Y + ++ K+I K+I ++GPV +TV++D Y+SG
Sbjct: 2 KAYQLKNSVKAIQKDIMKNGPVVATYTVYEDFAHYRSG---------------------- 39
Query: 225 TSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNG 284
+ +K+G+ G HA++++GWGE++ + YW++ANSW+ DWG+NG
Sbjct: 40 ---------------IYKHKAGRKTGLHAVKVIGWGEEKGTP--YWIVANSWHDDWGENG 82
Query: 285 LFKILRGKDECGIESSITAG 304
F++ RG ++CG E + AG
Sbjct: 83 FFRMHRGSNDCGFEERMAAG 102
>gi|189238903|ref|XP_967834.2| PREDICTED: similar to tubulointerstitial nephritis antigen
[Tribolium castaneum]
Length = 453
Score = 87.4 bits (215), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 77/244 (31%), Positives = 105/244 (43%), Gaps = 38/244 (15%)
Query: 84 LPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEIAPCEHHV-NGTRPSCDASKGHTP 142
LP FDS KWP + EI+DQG CGS W +A + + R S H
Sbjct: 205 LPREFDSEFKWPG--WMSEIQDQGWCGSSWAITTAAVASDRFAILSKGREKVTLSAQHLL 262
Query: 143 KCVRECQENYDVPY--------KKDLNFGAKSYSVS-SNEKSIMKEIYEHGPVEGAF--- 190
C R Q++ + Y +K + + S +NEK I G + A
Sbjct: 263 SCDRRGQQSCNGGYLDRAWSYIRKIGLVDEQCFPYSATNEKC---RIPRRGDLVTANCQL 319
Query: 191 -TVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSG--- 246
T D YK + GNET M I + V+ D YK G
Sbjct: 320 PTNVDRRSKYKVAPAYRVGNETDIMYEI-------LHSGPVQATMKVYHDFFTYKRGIYR 372
Query: 247 -------KALGGHAIRILGWGEDEKSK--EKYWLIANSWNTDWGDNGLFKILRGKDECGI 297
G H++RI+GWGE+ + +KYW +ANSW +WG+NG F+ILRG +EC I
Sbjct: 373 HSPISTNDRTGYHSVRIVGWGEEYSPEGLKKYWKVANSWGPEWGENGYFRILRGSNECEI 432
Query: 298 ESSI 301
ES +
Sbjct: 433 ESFV 436
>gi|290992564|ref|XP_002678904.1| predicted protein [Naegleria gruberi]
gi|284092518|gb|EFC46160.1| predicted protein [Naegleria gruberi]
Length = 289
Score = 87.4 bits (215), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 76/249 (30%), Positives = 109/249 (43%), Gaps = 45/249 (18%)
Query: 60 VHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYE 119
VHP NLP +P + ++FD+RTKW C + IRDQ CGSCW E
Sbjct: 66 VHPINNLPKKTMP-------ANLKAASSFDARTKWGKC--VHPIRDQQQCGSCWAFSASE 116
Query: 120 IAPCEHHVNGTRPSCDASKGH-----TPKCVRECQENYDVPYKKDLNF--GAKSYSVSSN 172
+ + C AS G +P+ + +C Y D + A ++ +
Sbjct: 117 VL--------SDRFCIASNGSVDVVLSPEYMLQCDST---DYGCDGGYLNNAWAFLAGTG 165
Query: 173 EKSIMKEIYE--HGPVEGAFTVFDD---LILYKSGRFFVPGNETTAMSLIKWTIRDNTSQ 227
S + Y +G V T D + LYK+ + +S I +D +
Sbjct: 166 IPSDKCDPYTSGNGDVGSCPTSCTDGSAIKLYKA-----KSSSVAQLSSIDDIQKDIQAN 220
Query: 228 LGAEGAFTVFDDLILYKSG-------KALGGHAIRILGWGEDEKSKEK-YWLIANSWNTD 279
+ AF+V+ D YKSG GGHAI+I+GWG K+ YW++ANSWNT+
Sbjct: 221 GPVQAAFSVYQDFFSYKSGVYRHVSGSLAGGHAIKIVGWGVTSDGKDTPYWIVANSWNTN 280
Query: 280 WGDNGLFKI 288
WG G F I
Sbjct: 281 WGQEGFFWI 289
>gi|193688334|ref|XP_001945855.1| PREDICTED: cathepsin B-like cysteine proteinase 5-like
[Acyrthosiphon pisum]
Length = 313
Score = 87.4 bits (215), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 64/236 (27%), Positives = 98/236 (41%), Gaps = 52/236 (22%)
Query: 73 ELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTRP 132
ELI S + E N + R+ W + + G S GC+P++ P + + +
Sbjct: 129 ELISCSGIKET-NGNVNERSIWEYLKS-HGVVSGGKYNSNDGCQPFKFPPIANILTHLQH 186
Query: 133 SCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTV 192
+CD C N + Y D Y++ + I KE+ +GPV F V
Sbjct: 187 TCD----------DHCYGNTSINYNHDHVRVRNYYTIRTG--YIQKEVQTYGPVAVQFKV 234
Query: 193 FDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGH 252
DD +LYKSG + N K +
Sbjct: 235 CDDFLLYKSGVYVKSDN------------------------------------AKVIRTQ 258
Query: 253 AIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVPKL 308
+++GWG + + YWL+ NSW +WG GLFKI RG ++CG+ES + AGVP++
Sbjct: 259 YAKLIGWGVE--NGVDYWLVINSWGHEWGQKGLFKIKRGTNQCGVESVVYAGVPEI 312
>gi|339235559|ref|XP_003379334.1| dipeptidyl-peptidase 1 [Trichinella spiralis]
gi|316978005|gb|EFV61034.1| dipeptidyl-peptidase 1 [Trichinella spiralis]
Length = 465
Score = 87.0 bits (214), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 71/268 (26%), Positives = 109/268 (40%), Gaps = 78/268 (29%)
Query: 84 LPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEIAPCEHHV---NGTRPS------- 133
LP FD R N I ++RDQ +CGSC+ + +H+ N R +
Sbjct: 232 LPEKFDWRNNNGN-NFIGDVRDQKNCGSCYAFASASMLEARYHILTQNRERVTFSPQDVV 290
Query: 134 -------------------------------CDASKGHTPKCV--RECQENYDVPYKKDL 160
C A G +C C+ Y Y+
Sbjct: 291 NCSPYSQGCDGGFSYLIAGKYAEDYGMVSERCVAYTGKQQQCRTPSTCERYYATDYEY-- 348
Query: 161 NFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWT 220
Y +SNE +M+ + ++GP+ F V DD + Y G + + T+A+S +KW
Sbjct: 349 ---IGGYYGASNEILMMQALVKNGPIAVGFEVHDDFLSYSHGIY----HYTSAVSPLKWN 401
Query: 221 IRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDW 280
F ++ HA+ I+G+G DE +KEKYW++ NSW +
Sbjct: 402 ---------------PFVEV----------NHAVIIVGYGTDEMTKEKYWIVKNSWGRKF 436
Query: 281 GDNGLFKILRGKDECGIESSITAGVPKL 308
G++G F+I RG +ECGIES P +
Sbjct: 437 GEDGYFRIRRGTNECGIESLAFQATPII 464
>gi|294914603|ref|XP_002778294.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
gi|239886508|gb|EER10089.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
Length = 365
Score = 87.0 bits (214), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 63/207 (30%), Positives = 88/207 (42%), Gaps = 60/207 (28%)
Query: 114 GCRPYEIAPCEHHVNGT--RPSCDASKGHTPKCVREC-QENYDVPYKKDLNFGAKSY-SV 169
GC PY C HH + +P C TP C C Y + KD ++ + S
Sbjct: 208 GCWPYNFPKCAHHQKESDYKP-CAKEIYDTPSCSSSCPNAKYGTAFDKDRHYTESLFPSR 266
Query: 170 SSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLG 229
+ SI KEI +GP A
Sbjct: 267 FGSTSSIKKEIMTNGPTSAA---------------------------------------- 286
Query: 230 AEGAFTVFDDLILYKSGKA-------LGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGD 282
F+V++D + YKSG LGGHA+ I+GWG ++ YWL+ NSWN +WGD
Sbjct: 287 ----FSVYEDFLSYKSGVYKHTSGGFLGGHAVEIIGWGTEKGVD--YWLVMNSWNEEWGD 340
Query: 283 NGLFKILRGKDECGIESSITAGVPKLD 309
+G FKI++G +CGI+ I AG P ++
Sbjct: 341 HGTFKIVQG--DCGIDDMILAGTPAIN 365
Score = 38.5 bits (88), Expect = 4.4, Method: Compositional matrix adjust.
Identities = 18/46 (39%), Positives = 26/46 (56%), Gaps = 1/46 (2%)
Query: 70 RLPELIGYSEVDEDLPANFDSRTKWPNCP-TIREIRDQGSCGSCWG 114
L E + +E D+P +FD+R + C I +RDQ +CGSCW
Sbjct: 86 ELEEKVYPAEELVDIPDSFDARDAFKECKDVIGHVRDQSACGSCWA 131
>gi|321476473|gb|EFX87434.1| hypothetical protein DAPPUDRAFT_221708 [Daphnia pulex]
Length = 464
Score = 87.0 bits (214), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 67/271 (24%), Positives = 104/271 (38%), Gaps = 82/271 (30%)
Query: 82 EDLPANFDSRTKWPNCPTIREI---RDQGSCGSCWG----------------------CR 116
E LP +D W N + + ++QGSCGSC+
Sbjct: 228 EFLPEEWD----WRNVSGVNYVPVVKNQGSCGSCYAFSSMGMLESRLRVATKNQVQVNLS 283
Query: 117 PYEIAPCEHHVNG-------------------TRPSCDASKGHTPKC--VRECQENYDVP 155
P +I C + G C G C ++CQ +Y
Sbjct: 284 PQDIVSCSAYSQGCEGGFPYLIAGKYAQDHGVVAEECYPYTGRDSACSAAKKCQRSYVAK 343
Query: 156 YKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMS 215
Y+ Y + NE+ + + E GP+ +F V+ D + Y G +
Sbjct: 344 YRY-----VGGYYGACNEELMKMSLVESGPLSVSFEVYSDFMHYAGGVYH---------- 388
Query: 216 LIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANS 275
+G F ++ ++ L HA+ ++G+G D ++KEKYW++ NS
Sbjct: 389 -------------RTDGLFNKINEFNPFE----LTNHAVLLVGYGTDSQTKEKYWIVKNS 431
Query: 276 WNTDWGDNGLFKILRGKDECGIESSITAGVP 306
W T WG++G F+I RG DECGIES P
Sbjct: 432 WGTKWGEDGFFRIRRGVDECGIESIAVEVTP 462
>gi|47212965|emb|CAF93376.1| unnamed protein product [Tetraodon nigroviridis]
Length = 271
Score = 87.0 bits (214), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 48/149 (32%), Positives = 75/149 (50%), Gaps = 32/149 (21%)
Query: 156 YKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMS 215
Y+ D+ Y +S++EK IMKEI ++GPV+ V +D +Y SG + + T +S
Sbjct: 137 YQNDIYQSTPPYRLSTSEKEIMKEIQDNGPVQAIMEVHEDFFMYNSGIY-----KHTDVS 191
Query: 216 LIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEK---SKEKYWLI 272
K + G H+++I GWGE+ + KYW+
Sbjct: 192 FTK------------------------PPHYRKHGTHSVKITGWGEERNFDGTTRKYWIA 227
Query: 273 ANSWNTDWGDNGLFKILRGKDECGIESSI 301
ANSW +WG+NG F+I RG++EC IE+ +
Sbjct: 228 ANSWGKNWGENGYFRIARGENECEIEAFV 256
>gi|242014495|ref|XP_002427925.1| tubulointerstitial nephritis antigen, putative [Pediculus humanus
corporis]
gi|212512409|gb|EEB15187.1| tubulointerstitial nephritis antigen, putative [Pediculus humanus
corporis]
Length = 473
Score = 87.0 bits (214), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 80/274 (29%), Positives = 104/274 (37%), Gaps = 88/274 (32%)
Query: 84 LPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEIAPCEHHVNGT-RPSCDASKGHTP 142
LP +FD+R KWP I DQG CG+ W +A + + D S H
Sbjct: 190 LPNSFDARNKWPG--WISGPADQGWCGASWAVSTASVASDRYAIMSKGLTKVDLSPQHLL 247
Query: 143 KC---VRECQ-----------------ENYDVPYK---------KDLNFGAKSY----SV 169
C R CQ ++Y P+ K NF A S S+
Sbjct: 248 SCNKGQRGCQGGHLSRAWTFIRKFGLVDDYCYPWTGTPTKCKIPKRPNFDALSSICPPSL 307
Query: 170 SSN----------------EKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTA 213
SN EK IM+EI + GPV+ V+ D YKSG +
Sbjct: 308 GSNLRSELYRVGPAYKIQDEKDIMEEIMQSGPVQATMKVYQDFFSYKSGVY--------- 358
Query: 214 MSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEK---SKEKYW 270
+ NT + G H+++ILGWGE+ KYW
Sbjct: 359 -------TKSNTE-----------------RESSNFGYHSVKILGWGEETNIYGQPIKYW 394
Query: 271 LIANSWNTDWGDNGLFKILRGKDECGIESSITAG 304
L ANSW WG+NG FKI RG +EC IE + A
Sbjct: 395 LAANSWGQQWGENGFFKIRRGTNECEIEEFVLAA 428
>gi|196009233|ref|XP_002114482.1| hypothetical protein TRIADDRAFT_28083 [Trichoplax adhaerens]
gi|190583501|gb|EDV23572.1| hypothetical protein TRIADDRAFT_28083 [Trichoplax adhaerens]
Length = 466
Score = 86.7 bits (213), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 68/271 (25%), Positives = 101/271 (37%), Gaps = 89/271 (32%)
Query: 84 LPANFDSRTKWPNCPTIREIRDQGSCGSCWG----------------------CRPYEIA 121
P FD R N + +R+QG+CGSC+ P ++
Sbjct: 235 FPKQFDWRNV-SNVNYVSPVRNQGACGSCYAFSSMAMYEARLRVLSKNSVKRVMSPQDVV 293
Query: 122 PCEHHVNG-------------------TRPSC-------DASKGHTPKCVRECQENYDVP 155
C + G SC + K KC R NY
Sbjct: 294 SCSEYAQGCAGGFPYLIAGKYGEDFGLVEESCFPYNGKDEPCKETKSKCRRHSTTNY--- 350
Query: 156 YKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMS 215
+ + + NE +M+E+ ++GP+ +F V+ D YK G + G S
Sbjct: 351 ------YYVGGFYGACNEYLMMRELVKNGPISISFEVYGDFKHYKGGIYQHTG---LGDS 401
Query: 216 LIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANS 275
W I + HA+ ++G+G D+KS + YW++ NS
Sbjct: 402 YNPWQITN----------------------------HAVLLVGYGTDQKSGKDYWIVKNS 433
Query: 276 WNTDWGDNGLFKILRGKDECGIESSITAGVP 306
W T WG+NG F+ILRG DEC IE+ A P
Sbjct: 434 WGTKWGENGFFRILRGVDECSIENEAVAVTP 464
>gi|301779281|ref|XP_002925058.1| PREDICTED: dipeptidyl peptidase 1-like [Ailuropoda melanoleuca]
gi|281337582|gb|EFB13166.1| hypothetical protein PANDA_014484 [Ailuropoda melanoleuca]
Length = 461
Score = 86.7 bits (213), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 76/266 (28%), Positives = 104/266 (39%), Gaps = 74/266 (27%)
Query: 84 LPANFDSRTKWPNCPTIREIRDQGSCGSCWG----------------------CRPYEIA 121
LPA++D R + +R+Q SCGSC+ P E+
Sbjct: 229 LPASWDWRNV-HGTNFVSPVRNQASCGSCYAFASMGMLEARIRILTNNTQTPILSPQEVV 287
Query: 122 PCEHHVNGTR---PSCDASK-GHTPKCVRECQENY---DVP----------YKKDLNFGA 164
C + G P A K V E Y D P Y D ++
Sbjct: 288 SCSQYAQGCEGGFPYLIAGKYAQDFGLVEEACFPYMGADFPCKPKKDCFRYYSSDYHYVG 347
Query: 165 KSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDN 224
Y NE + E+ HGP+ AF V+DD Y++G ++ G +RD
Sbjct: 348 GFYG-GCNEALMKLELVHHGPIAVAFQVYDDFFHYRTGIYYHTG------------LRD- 393
Query: 225 TSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNG 284
F F+ L HA+ ++G+G D S YW++ NSW WG+NG
Sbjct: 394 --------PFNPFE----------LTNHAVLLVGYGTDTASGMDYWIVKNSWGAGWGENG 435
Query: 285 LFKILRGKDECGIESSITAG--VPKL 308
F+I RG DEC IES A VPKL
Sbjct: 436 YFRIRRGTDECAIESIAVAATPVPKL 461
>gi|201023319|ref|NP_001128401.1| cathepsin B-10270 precursor [Acyrthosiphon pisum]
gi|239788119|dbj|BAH70754.1| ACYPI000021 [Acyrthosiphon pisum]
Length = 341
Score = 86.7 bits (213), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 69/219 (31%), Positives = 89/219 (40%), Gaps = 59/219 (26%)
Query: 101 REIRDQGSCGSCWGCRPYEIAPCEHHVNGTR-PS--CDASKGHTPKCVRECQE-NYDVPY 156
R + G GS GC+P+ I PC H V R PS C K TP+C C NY P+
Sbjct: 171 RGLVTGGDYGSNEGCQPWLIPPCNHTVMDERSPSYMCGKYKSETPQCTLNCYNPNYSKPF 230
Query: 157 KKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSL 216
KD++ G + S I E+ +HGP
Sbjct: 231 LKDISKGIRIDWHCSG--MIRNELKKHGP------------------------------- 257
Query: 217 IKWTIRDNTSQLGAEGAFTVFDDLILYKSG-------KALGGHAIRILGWGEDEKSKEKY 269
A V++D + YKSG K LG ++++GWG + Y
Sbjct: 258 -------------ATAIMRVYEDFLTYKSGIYQHVTGKLLGQITVKVIGWGVYRGVQ--Y 302
Query: 270 WLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVPKL 308
WL ANSW T WGD G FKI RG +EC E +G P L
Sbjct: 303 WLAANSWGTSWGDKGFFKIRRGYNECLFEDYFISGRPVL 341
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 20/33 (60%), Positives = 26/33 (78%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQA 41
CG GCNGG+ G AW+YW+K G+V+GG YGS +
Sbjct: 152 CGDGCNGGYSGAAWQYWMKRGLVTGGDYGSNEG 184
>gi|159108157|ref|XP_001704351.1| Cathepsin B precursor [Giardia lamblia ATCC 50803]
gi|157432412|gb|EDO76677.1| Cathepsin B precursor [Giardia lamblia ATCC 50803]
Length = 360
Score = 86.7 bits (213), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 71/250 (28%), Positives = 99/250 (39%), Gaps = 76/250 (30%)
Query: 85 PANFDSRTKWPNCPTIREIRDQGSCGSCWG-----------CRP-----------YEIAP 122
P ++D R ++P+C I E+ DQG+CGSCW CR +
Sbjct: 141 PESYDFRDEYPHC--ITEVVDQGNCGSCWAFSSVQTFADHRCRSGLDATGVSYSVQYVLD 198
Query: 123 CE---HHVNGTRPSCDASKGHTPKCVRECQENYDV---------PYKKDLNFGAKSYSVS 170
C+ H NG P + H V Y P K D ++ +
Sbjct: 199 CDRKDHGCNGGEPVNAFNFLHNTGTVLASCVGYTAGDDAVVKFCPQKCDDGSAVENVVAT 258
Query: 171 SNEKS--IMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQL 228
S KS + + HGPV F V D + YKSG +
Sbjct: 259 SGSKSGSAIDVLLAHGPVVATFNVAQDFMYYKSGVY------------------------ 294
Query: 229 GAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKI 288
++ G LGGHA+ I+G+G + S YW + NSW DWG++G F+I
Sbjct: 295 -------------QHRWGLWLGGHAVEIIGYGVTD-SGLDYWTVRNSWGPDWGEDGYFRI 340
Query: 289 LRGKDECGIE 298
+RG DECGIE
Sbjct: 341 VRGGDECGIE 350
>gi|308494436|ref|XP_003109407.1| hypothetical protein CRE_08204 [Caenorhabditis remanei]
gi|308246820|gb|EFO90772.1| hypothetical protein CRE_08204 [Caenorhabditis remanei]
Length = 470
Score = 86.7 bits (213), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 51/137 (37%), Positives = 68/137 (49%), Gaps = 31/137 (22%)
Query: 167 YSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTS 226
Y VSS E+ I E+ +GPV+ F V +D +Y G + D +
Sbjct: 336 YKVSSREEDIQTELMTNGPVQATFVVHEDFFMYAGGVY---------------QHSDLAA 380
Query: 227 QLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKE--KYWLIANSWNTDWGDNG 284
Q GA S A G H++R+LGWG D + KYWL ANSW T WG++G
Sbjct: 381 QKGA--------------SSVAEGYHSVRVLGWGVDHSTGRPIKYWLCANSWGTQWGEDG 426
Query: 285 LFKILRGKDECGIESSI 301
FKILRG++ C IES +
Sbjct: 427 YFKILRGENHCEIESFV 443
>gi|383861394|ref|XP_003706171.1| PREDICTED: tubulointerstitial nephritis antigen-like [Megachile
rotundata]
Length = 442
Score = 86.7 bits (213), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 73/252 (28%), Positives = 107/252 (42%), Gaps = 47/252 (18%)
Query: 82 EDLPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEIAPCEHHV--NGTRPSCDASKG 139
E LP FDSRT+WP I +I DQG CG+ W ++A + GT + + S
Sbjct: 198 ESLPREFDSRTRWPR--DISKITDQGWCGASWAISSAQVASDRFAIMSKGT-DAVELSAQ 254
Query: 140 HTPKCVRECQE-----NYDVPYKKDLNFG----------AKSYSVSSNEKSIMKEIYEHG 184
H C Q+ + D + FG A + + +++ ++
Sbjct: 255 HLLSCNNRGQQGCSGGHLDRAWMFMRRFGLVDENCYPWKASTETCRLRKRTDLRSAGCAP 314
Query: 185 PVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYK 244
P T LYK G + NET M I + + V+ D Y+
Sbjct: 315 PPNPLRTE-----LYKVGPAYRLANETDIMQEI-------LTSGPVQATMRVYQDFFSYE 362
Query: 245 SGKALGG----------HAIRILGWGED-----EKSKEKYWLIANSWNTDWGDNGLFKIL 289
SG H++RI+GWGE+ + KYWL+ANSW WG+NGLF+I
Sbjct: 363 SGVYKHSVTAELYESDYHSVRIIGWGEEPPTYSRNTPLKYWLVANSWGQQWGENGLFRIQ 422
Query: 290 RGKDECGIESSI 301
+G +EC IES +
Sbjct: 423 KGTNECEIESFV 434
>gi|193202653|ref|NP_492593.2| Protein F26E4.3 [Caenorhabditis elegans]
gi|205371857|sp|P90850.3|YCF2E_CAEEL RecName: Full=Uncharacterized peptidase C1-like protein F26E4.3;
Flags: Precursor
gi|166157004|emb|CAB03007.2| Protein F26E4.3 [Caenorhabditis elegans]
Length = 452
Score = 86.7 bits (213), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 50/137 (36%), Positives = 69/137 (50%), Gaps = 31/137 (22%)
Query: 167 YSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTS 226
Y VSS E+ I E+ +GPV+ F V +D +Y G + D +
Sbjct: 318 YKVSSREEDIQTELMTNGPVQATFVVHEDFFMYAGGVY---------------QHSDLAA 362
Query: 227 QLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKE--KYWLIANSWNTDWGDNG 284
Q GA S A G H++R+LGWG D + + KYWL ANSW T WG++G
Sbjct: 363 QKGA--------------SSVAEGYHSVRVLGWGVDHSTGKPIKYWLCANSWGTQWGEDG 408
Query: 285 LFKILRGKDECGIESSI 301
FK+LRG++ C IES +
Sbjct: 409 YFKVLRGENHCEIESFV 425
>gi|341891358|gb|EGT47293.1| hypothetical protein CAEBREN_29072 [Caenorhabditis brenneri]
Length = 349
Score = 86.3 bits (212), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 51/140 (36%), Positives = 67/140 (47%), Gaps = 31/140 (22%)
Query: 167 YSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTS 226
Y VSS E+ I E+ +GPV+ F V +D +Y G + D +
Sbjct: 215 YKVSSREEDIQTELMTNGPVQATFVVHEDFFMYAGGVY---------------QHSDLAA 259
Query: 227 QLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKE--KYWLIANSWNTDWGDNG 284
Q GA S A G H++R+LGWG D + KYWL ANSW T WG++G
Sbjct: 260 QKGA--------------SSVAEGYHSVRVLGWGVDHSTGRPIKYWLCANSWGTQWGEDG 305
Query: 285 LFKILRGKDECGIESSITAG 304
FKILRG + C IES +
Sbjct: 306 YFKILRGDNHCEIESFVVGA 325
>gi|268564843|ref|XP_002639246.1| Hypothetical protein CBG03805 [Caenorhabditis briggsae]
Length = 526
Score = 86.3 bits (212), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 51/137 (37%), Positives = 68/137 (49%), Gaps = 31/137 (22%)
Query: 167 YSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTS 226
Y VSS E+ I E+ +GPV+ F V +D +Y G + D +
Sbjct: 392 YKVSSREEDIQTELMTNGPVQATFVVHEDFFMYAGGVY---------------QHSDLAA 436
Query: 227 QLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKE--KYWLIANSWNTDWGDNG 284
Q GA S A G H++R+LGWG D + KYWL ANSW T WG++G
Sbjct: 437 QKGA--------------SSVAEGYHSVRVLGWGVDHSTGRPIKYWLCANSWGTQWGEDG 482
Query: 285 LFKILRGKDECGIESSI 301
FKILRG++ C IES +
Sbjct: 483 YFKILRGENHCEIESFV 499
>gi|403340695|gb|EJY69640.1| Cathepsin B [Oxytricha trifallax]
Length = 247
Score = 86.3 bits (212), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 76/263 (28%), Positives = 113/263 (42%), Gaps = 51/263 (19%)
Query: 67 PANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEIAPCEHH 126
P +PE ++++ +P FDSR +W NC + IRDQ CGSCW E
Sbjct: 15 PVEGIPEPAQHNDI---VPKTFDSREQWGNC--VHPIRDQAQCGSCWAFGASETL----- 64
Query: 127 VNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAK------SYSVSSNEKSIMKEI 180
+ C AS T + D+ N G ++S +N ++
Sbjct: 65 ---SDRICIASDKKTDVILSP----EDLVACDGWNMGCNGGILPWAWSYLTNTGAVEDSC 117
Query: 181 YEHGPVEGAFTVF--------DDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEG 232
+ + +GA D YK + V + + + IK I N E
Sbjct: 118 FPYSSDKGAVPTCAKKCQNDKDSFTKYKCKKNSVV--QASGVDKIKAEISKNGPM---ET 172
Query: 233 AFTVFDDLILYKSG-------KALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGL 285
FTV++D + Y+SG LGGHA++I+G+G+ YW+ ANSW+ WG+ G
Sbjct: 173 GFTVYEDFMNYESGVYHHTTGNQLGGHAVKIVGYGD------GYWICANSWSEKWGEKGF 226
Query: 286 FKILRGKDECGIESSITAGVPKL 308
F I G ECGI+S+ A P L
Sbjct: 227 FNI--GFGECGIDSAAYACTPDL 247
>gi|354459545|pdb|3PDF|A Chain A, Discovery Of Novel Cyanamide-Based Inhibitors Of Cathepsin
C
Length = 441
Score = 85.9 bits (211), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 67/253 (26%), Positives = 98/253 (38%), Gaps = 77/253 (30%)
Query: 100 IREIRDQGSCGSCWG----------------------CRPYEIAPCEHHVNG-------- 129
+ +R+Q SCGSC+ P E+ C + G
Sbjct: 222 VSPVRNQASCGSCYSFASMGMLEARIRILTNNSQTPILSPQEVVSCSQYAQGCEGGFPYL 281
Query: 130 -----------TRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMK 178
+C G C + +E+ Y + ++ Y NE +
Sbjct: 282 IAGKYAQDFGLVEEACFPYTGTDSPC--KMKEDCFRYYSSEYHYVGGFYG-GCNEALMKL 338
Query: 179 EIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFD 238
E+ HGP+ AF V+DD + YK G + G +RD F F+
Sbjct: 339 ELVHHGPMAVAFEVYDDFLHYKKGIYHHTG------------LRD---------PFNPFE 377
Query: 239 DLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIE 298
L HA+ ++G+G D S YW++ NSW T WG+NG F+I RG DEC IE
Sbjct: 378 ----------LTNHAVLLVGYGTDSASGMDYWIVKNSWGTGWGENGYFRIRRGTDECAIE 427
Query: 299 SSITAG--VPKLD 309
S A +PKL+
Sbjct: 428 SIAVAATPIPKLE 440
>gi|308160258|gb|EFO62754.1| Cathepsin B precursor [Giardia lamblia P15]
Length = 298
Score = 85.9 bits (211), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 72/252 (28%), Positives = 113/252 (44%), Gaps = 62/252 (24%)
Query: 84 LPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPK 143
+P +FD R ++P+C I E+ DQG CGSCW S AS G
Sbjct: 74 VPDSFDFREEYPHC--IPEVVDQGGCGSCWAF-----------------SSVASVGD--- 111
Query: 144 CVRECQENYDVPYKKDLNFGAKSYSVSSNEKSI------MKEIYEHGPVEGAFTVFDDLI 197
R C D KK + + + Y VS + + + ++ G T D+ +
Sbjct: 112 --RRCVAGLD---KKAVRY-SPQYVVSCDRGDMACDGGWLPSVWRFLVKTGTTT--DECV 163
Query: 198 LYKSGRFFVPGN------ETTAMSLIKWT------------IRDNTSQLGAEGAFTVFDD 239
Y+SG G + + + + K T ++ + + AFTV+ D
Sbjct: 164 PYQSGSTGARGTCPTKCADGSELPIYKATKAVDYGLDCDLIMKALATGGPLQTAFTVYSD 223
Query: 240 LILYKSG-------KALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGK 292
+ Y+ G +A GGHA+ ++G+G DE + YW+I NSW DWG++G F+I+R
Sbjct: 224 FMYYQGGVYQHVYGRAEGGHAVEMVGYGTDEYDVD-YWIIRNSWGPDWGEDGYFRIIRMT 282
Query: 293 DECGIESSITAG 304
+ECGIE + G
Sbjct: 283 NECGIEEQVIGG 294
>gi|10803441|emb|CAC13133.1| putative cathepsin B.7 [Ostertagia ostertagi]
Length = 198
Score = 85.9 bits (211), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 50/164 (30%), Positives = 73/164 (44%), Gaps = 39/164 (23%)
Query: 115 CRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEK 174
CR YEI PC +H N S TP C + C+ Y Y D +G +Y + ++
Sbjct: 72 CRSYEIHPCGYHGNEPFYGHCHSMARTPPCKKRCRPGYKNSYMMDKRYGTSAYELPNSVX 131
Query: 175 SIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAF 234
+I ++I E+GPV F V++D YKSG
Sbjct: 132 AIQRDIMENGPVVAGFDVYEDFKYYKSG-------------------------------- 159
Query: 235 TVFDDLILYKSGKALGGHAIRILGWGED--EKSKEKYWLIANSW 276
+ + +GK GGHA++++GWGE+ E YW+IANSW
Sbjct: 160 -----IYRHTAGKXTGGHAVKVIGWGEEXTENGTIPYWIIANSW 198
Score = 40.0 bits (92), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 17/32 (53%), Positives = 23/32 (71%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQ 40
CG GC GG+P AW+Y V G+V+GG +G K+
Sbjct: 39 CGAGCEGGWPIEAWKYGVTEGVVTGGNFGRKE 70
>gi|341898422|gb|EGT54357.1| hypothetical protein CAEBREN_10381 [Caenorhabditis brenneri]
Length = 466
Score = 85.9 bits (211), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 51/137 (37%), Positives = 67/137 (48%), Gaps = 31/137 (22%)
Query: 167 YSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTS 226
Y VSS E+ I E+ +GPV+ F V +D +Y G + D +
Sbjct: 332 YKVSSREEDIQTELMTNGPVQATFVVHEDFFMYAGGVY---------------QHSDLAA 376
Query: 227 QLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKE--KYWLIANSWNTDWGDNG 284
Q GA S A G H++R+LGWG D + KYWL ANSW T WG++G
Sbjct: 377 QKGA--------------SSVAEGYHSVRVLGWGVDHSTGRPIKYWLCANSWGTQWGEDG 422
Query: 285 LFKILRGKDECGIESSI 301
FKILRG + C IES +
Sbjct: 423 YFKILRGDNHCEIESFV 439
>gi|62897637|dbj|BAD96758.1| cathepsin C isoform a preproprotein variant [Homo sapiens]
Length = 463
Score = 85.9 bits (211), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 71/267 (26%), Positives = 101/267 (37%), Gaps = 82/267 (30%)
Query: 84 LPANFDSRTKWPNCPTIR---EIRDQGSCGSCWG----------------------CRPY 118
LP ++D W N I +R+Q SCGSC+ P
Sbjct: 231 LPTSWD----WRNVHGINFVSPVRNQASCGSCYSFASMGMLEARIRILTNNSQTPILSPQ 286
Query: 119 EIAPCEHHVNG-------------------TRPSCDASKGHTPKCVRECQENYDVPYKKD 159
E+ C + G +C G C + +E+ Y +
Sbjct: 287 EVVSCSQYAQGCEGGFPYLIAGKYAQDFGLVEEACFPYTGTDSPC--KMKEDCFRYYSSE 344
Query: 160 LNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKW 219
++ Y NE + E+ HGP+ AF V+DD + YK G + G
Sbjct: 345 YHYVGGFYG-GCNEALMKLELVHHGPMAVAFEVYDDFLHYKKGIYHHTG----------- 392
Query: 220 TIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTD 279
+RD F F+ L HA+ ++G+G D S YW++ NSW T
Sbjct: 393 -LRD---------PFNPFE----------LTNHAVLLVGYGTDSASGMDYWIVKNSWGTG 432
Query: 280 WGDNGLFKILRGKDECGIESSITAGVP 306
WG+NG F+I RG DEC IES A P
Sbjct: 433 WGENGYFRIRRGTDECAIESIAVAATP 459
>gi|17933071|gb|AAL48192.1| cathepsin C [Homo sapiens]
Length = 463
Score = 85.9 bits (211), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 65/248 (26%), Positives = 94/248 (37%), Gaps = 75/248 (30%)
Query: 100 IREIRDQGSCGSCWG----------------------CRPYEIAPCEHHVNG-------- 129
+ +R+Q SCGSC+ P E+ C + G
Sbjct: 246 VSPVRNQASCGSCYSFASMGMLEARIRILTNNSQTPILSPQEVVSCSQYAQGCEGGFPYL 305
Query: 130 -----------TRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMK 178
+C G C + +E+ Y + ++ Y NE +
Sbjct: 306 IAGKYAQDFGLVEEACFPYTGTDSPC--KMKEDCFRYYSSEYHYVGGFYG-GCNEALMKL 362
Query: 179 EIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFD 238
E+ HGP+ AF V+DD + YK G + G +RD F F+
Sbjct: 363 ELVHHGPMAVAFEVYDDFLHYKKGIYHHTG------------LRD---------PFNPFE 401
Query: 239 DLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIE 298
L HA+ ++G+G D S YW++ NSW T WG+NG F+I RG DEC IE
Sbjct: 402 ----------LTNHAVLLVGYGTDSASGMDYWIVKNSWGTGWGENGYFRIRRGTDECAIE 451
Query: 299 SSITAGVP 306
S A P
Sbjct: 452 SIAVAATP 459
>gi|119579767|gb|EAW59363.1| cathepsin C, isoform CRA_a [Homo sapiens]
Length = 316
Score = 85.9 bits (211), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 65/248 (26%), Positives = 94/248 (37%), Gaps = 75/248 (30%)
Query: 100 IREIRDQGSCGSCWG----------------------CRPYEIAPCEHHVNG-------- 129
+ +R+Q SCGSC+ P E+ C + G
Sbjct: 99 VSPVRNQASCGSCYSFASMGMLEARIRILTNNSQTPILSPQEVVSCSQYAQGCEGGFPYL 158
Query: 130 -----------TRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMK 178
+C G C + +E+ Y + ++ Y NE +
Sbjct: 159 IAGKYAQDFGLVEEACFPYTGTDSPC--KMKEDCFRYYSSEYHYVGGFYG-GCNEALMKL 215
Query: 179 EIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFD 238
E+ HGP+ AF V+DD + YK G + G +RD F F+
Sbjct: 216 ELVHHGPMAVAFEVYDDFLHYKKGIYHHTG------------LRD---------PFNPFE 254
Query: 239 DLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIE 298
L HA+ ++G+G D S YW++ NSW T WG+NG F+I RG DEC IE
Sbjct: 255 ----------LTNHAVLLVGYGTDSASGMDYWIVKNSWGTGWGENGYFRIRRGTDECAIE 304
Query: 299 SSITAGVP 306
S A P
Sbjct: 305 SIAVAATP 312
>gi|317373330|sp|P53634.2|CATC_HUMAN RecName: Full=Dipeptidyl peptidase 1; AltName: Full=Cathepsin C;
AltName: Full=Cathepsin J; AltName: Full=Dipeptidyl
peptidase I; Short=DPP-I; Short=DPPI; AltName:
Full=Dipeptidyl transferase; Contains: RecName:
Full=Dipeptidyl peptidase 1 exclusion domain chain;
AltName: Full=Dipeptidyl peptidase I exclusion domain
chain; Contains: RecName: Full=Dipeptidyl peptidase 1
heavy chain; AltName: Full=Dipeptidyl peptidase I heavy
chain; Contains: RecName: Full=Dipeptidyl peptidase 1
light chain; AltName: Full=Dipeptidyl peptidase I light
chain; Flags: Precursor
gi|17933069|gb|AAL48191.1| cathepsin C [Homo sapiens]
Length = 463
Score = 85.5 bits (210), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 71/267 (26%), Positives = 101/267 (37%), Gaps = 82/267 (30%)
Query: 84 LPANFDSRTKWPNCPTIR---EIRDQGSCGSCWG----------------------CRPY 118
LP ++D W N I +R+Q SCGSC+ P
Sbjct: 231 LPTSWD----WRNVHGINFVSPVRNQASCGSCYSFASMGMLEARIRILTNNSQTPILSPQ 286
Query: 119 EIAPCEHHVNG-------------------TRPSCDASKGHTPKCVRECQENYDVPYKKD 159
E+ C + G +C G C + +E+ Y +
Sbjct: 287 EVVSCSQYAQGCEGGFPYLIAGKYAQDFGLVEEACFPYTGTDSPC--KMKEDCFRYYSSE 344
Query: 160 LNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKW 219
++ Y NE + E+ HGP+ AF V+DD + YK G + G
Sbjct: 345 YHYVGGFYG-GCNEALMKLELVHHGPMAVAFEVYDDFLHYKKGIYHHTG----------- 392
Query: 220 TIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTD 279
+RD F F+ L HA+ ++G+G D S YW++ NSW T
Sbjct: 393 -LRD---------PFNPFE----------LTNHAVLLVGYGTDSASGMDYWIVKNSWGTG 432
Query: 280 WGDNGLFKILRGKDECGIESSITAGVP 306
WG+NG F+I RG DEC IES A P
Sbjct: 433 WGENGYFRIRRGTDECAIESIAVAATP 459
>gi|60827947|gb|AAX36820.1| cathepsin C [synthetic construct]
gi|61368416|gb|AAX43175.1| cathepsin C [synthetic construct]
Length = 464
Score = 85.5 bits (210), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 65/248 (26%), Positives = 94/248 (37%), Gaps = 75/248 (30%)
Query: 100 IREIRDQGSCGSCWG----------------------CRPYEIAPCEHHVNG-------- 129
+ +R+Q SCGSC+ P E+ C + G
Sbjct: 246 VSPVRNQASCGSCYSFASMGMLEARIRILTNNSQTPILSPQEVVSCSQYAQGCEGGFPYL 305
Query: 130 -----------TRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMK 178
+C G C + +E+ Y + ++ Y NE +
Sbjct: 306 IAGKYAQDFGLVEEACFPYTGTDSPC--KMKEDCFRYYSSEYHYVGGFYG-GCNEALMKL 362
Query: 179 EIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFD 238
E+ HGP+ AF V+DD + YK G + G +RD F F+
Sbjct: 363 ELVHHGPMAVAFEVYDDFLHYKKGIYHHTG------------LRD---------PFNPFE 401
Query: 239 DLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIE 298
L HA+ ++G+G D S YW++ NSW T WG+NG F+I RG DEC IE
Sbjct: 402 ----------LTNHAVLLVGYGTDSASGMDYWIVKNSWGTGWGENGYFRIRRGTDECAIE 451
Query: 299 SSITAGVP 306
S A P
Sbjct: 452 SIAVAATP 459
>gi|54696504|gb|AAV38624.1| cathepsin C [synthetic construct]
gi|54696506|gb|AAV38625.1| cathepsin C [synthetic construct]
gi|61368207|gb|AAX43130.1| cathepsin C [synthetic construct]
gi|61368212|gb|AAX43131.1| cathepsin C [synthetic construct]
Length = 464
Score = 85.5 bits (210), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 71/267 (26%), Positives = 101/267 (37%), Gaps = 82/267 (30%)
Query: 84 LPANFDSRTKWPNCPTIR---EIRDQGSCGSCWG----------------------CRPY 118
LP ++D W N I +R+Q SCGSC+ P
Sbjct: 231 LPTSWD----WRNVHGINFVSPVRNQASCGSCYSFASMGMLEARIRILTNNSQTPILSPQ 286
Query: 119 EIAPCEHHVNG-------------------TRPSCDASKGHTPKCVRECQENYDVPYKKD 159
E+ C + G +C G C + +E+ Y +
Sbjct: 287 EVVSCSQYAQGCEGGFPYLIAGKYAQDFGLVEEACFPYTGTDSPC--KMKEDCFRYYSSE 344
Query: 160 LNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKW 219
++ Y NE + E+ HGP+ AF V+DD + YK G + G
Sbjct: 345 YHYVGGFYG-GCNEALMKLELVHHGPMAVAFEVYDDFLHYKKGIYHHTG----------- 392
Query: 220 TIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTD 279
+RD F F+ L HA+ ++G+G D S YW++ NSW T
Sbjct: 393 -LRD---------PFNPFE----------LTNHAVLLVGYGTDSASGMDYWIVKNSWGTG 432
Query: 280 WGDNGLFKILRGKDECGIESSITAGVP 306
WG+NG F+I RG DEC IES A P
Sbjct: 433 WGENGYFRIRRGTDECAIESIAVAATP 459
>gi|1582221|prf||2118248A prepro-cathepsin C
Length = 463
Score = 85.5 bits (210), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 71/267 (26%), Positives = 101/267 (37%), Gaps = 82/267 (30%)
Query: 84 LPANFDSRTKWPNCPTIR---EIRDQGSCGSCWG----------------------CRPY 118
LP ++D W N I +R+Q SCGSC+ P
Sbjct: 231 LPTSWD----WRNVHGINFVSPVRNQASCGSCYSFASMGMLEARIRILTNNSQTPILSPQ 286
Query: 119 EIAPCEHHVNG-------------------TRPSCDASKGHTPKCVRECQENYDVPYKKD 159
E+ C + G +C G C + +E+ Y +
Sbjct: 287 EVVSCSQYAQGCEGGFPYLIAGKYAQDFGLVEEACFPYTGTDSPC--KMKEDCFRYYSSE 344
Query: 160 LNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKW 219
++ Y NE + E+ HGP+ AF V+DD + YK G + G
Sbjct: 345 YHYVGGFYG-GCNEALMKLELVHHGPMAVAFEVYDDFLHYKKGIYHHTG----------- 392
Query: 220 TIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTD 279
+RD F F+ L HA+ ++G+G D S YW++ NSW T
Sbjct: 393 -LRD---------PFNPFE----------LTNHAVLLVGYGTDSASGMDYWIVKNSWGTG 432
Query: 280 WGDNGLFKILRGKDECGIESSITAGVP 306
WG+NG F+I RG DEC IES A P
Sbjct: 433 WGENGYFRIRRGTDECAIESIAVAATP 459
>gi|189083844|ref|NP_001805.3| dipeptidyl peptidase 1 isoform a preproprotein [Homo sapiens]
gi|1006657|emb|CAA60671.1| cathepsin C [Homo sapiens]
gi|1947071|gb|AAC51341.1| prepro dipeptidyl peptidase I [Homo sapiens]
gi|60816242|gb|AAX36375.1| cathepsin C [synthetic construct]
gi|119579768|gb|EAW59364.1| cathepsin C, isoform CRA_b [Homo sapiens]
gi|158257666|dbj|BAF84806.1| unnamed protein product [Homo sapiens]
gi|261858568|dbj|BAI45806.1| cathepsin C [synthetic construct]
Length = 463
Score = 85.5 bits (210), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 71/267 (26%), Positives = 101/267 (37%), Gaps = 82/267 (30%)
Query: 84 LPANFDSRTKWPNCPTIR---EIRDQGSCGSCWG----------------------CRPY 118
LP ++D W N I +R+Q SCGSC+ P
Sbjct: 231 LPTSWD----WRNVHGINFVSPVRNQASCGSCYSFASMGMLEARIRILTNNSQTPILSPQ 286
Query: 119 EIAPCEHHVNG-------------------TRPSCDASKGHTPKCVRECQENYDVPYKKD 159
E+ C + G +C G C + +E+ Y +
Sbjct: 287 EVVSCSQYAQGCEGGFPYLIAGKYAQDFGLVEEACFPYTGTDSPC--KMKEDCFRYYSSE 344
Query: 160 LNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKW 219
++ Y NE + E+ HGP+ AF V+DD + YK G + G
Sbjct: 345 YHYVGGFYG-GCNEALMKLELVHHGPMAVAFEVYDDFLHYKKGIYHHTG----------- 392
Query: 220 TIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTD 279
+RD F F+ L HA+ ++G+G D S YW++ NSW T
Sbjct: 393 -LRD---------PFNPFE----------LTNHAVLLVGYGTDSASGMDYWIVKNSWGTG 432
Query: 280 WGDNGLFKILRGKDECGIESSITAGVP 306
WG+NG F+I RG DEC IES A P
Sbjct: 433 WGENGYFRIRRGTDECAIESIAVAATP 459
>gi|166030326|gb|ABY78830.1| cathepsin B-like protease [Trypanosoma congolense]
Length = 336
Score = 85.5 bits (210), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 64/201 (31%), Positives = 82/201 (40%), Gaps = 57/201 (28%)
Query: 114 GCRPYEIAPCEHH-VNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSN 172
GC+PY CEH G + C K TPKC C + +P K G +Y +
Sbjct: 179 GCQPYPFPHCEHRGAQGNKTPCSKYKFDTPKCNATCTDK-SIPLVKYR--GNATYLLLHG 235
Query: 173 EKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEG 232
E+ +E+Y +GP FV
Sbjct: 236 EEDYKRELYFNGP-------------------FV-------------------------A 251
Query: 233 AFTVFDDLILYKSG-------KALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGL 285
F V+ DL YKSG LGG A+RI+GWG+ + YW +ANSW+TDWG NG
Sbjct: 252 VFFVYTDLFAYKSGVYRNVDGDFLGGQAVRIVGWGKLNGTP--YWKVANSWDTDWGMNGY 309
Query: 286 FKILRGKDECGIESSITAGVP 306
IL G +EC IE G P
Sbjct: 310 MLILGGNNECNIEHLGFTGFP 330
Score = 55.8 bits (133), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 29/78 (37%), Positives = 40/78 (51%), Gaps = 6/78 (7%)
Query: 39 KQAEKNSLSNIPRAHLKSWMG--VHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPN 96
K + NI + K G + + +LP R E ++ +LP +FDS KWPN
Sbjct: 47 KAVYNGKMQNITFSEAKRLTGAWIQKNSSLPPVRFTE----EQLRTELPESFDSAEKWPN 102
Query: 97 CPTIREIRDQGSCGSCWG 114
CPTIREI DQ +C + W
Sbjct: 103 CPTIREIADQSACRASWA 120
>gi|157058749|gb|ABV03132.1| cathepsin B-3098 [Acyrthosiphon pisum]
Length = 256
Score = 85.5 bits (210), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 49/172 (28%), Positives = 76/172 (44%), Gaps = 40/172 (23%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GC PY + PC + +G KC ++C + D+ + KD + Y ++
Sbjct: 125 GCEPYRVPPCPYDKDGKNTCSGQPMEPNHKCSKKCYGDEDIDFNKDHRYTRDDYYLTY-- 182
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
+ I K++ +GP+E +F V+DD YKSG + N +
Sbjct: 183 RGIQKDVINYGPIEASFDVYDDFPNYKSGIYVKSENASY--------------------- 221
Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGL 285
LGGH+++++GWGE+ YWL+ NSWN DWGD GL
Sbjct: 222 ---------------LGGHSVKLIGWGEEYGV--LYWLMVNSWNADWGDKGL 256
Score = 46.2 bits (108), Expect = 0.020, Method: Compositional matrix adjust.
Identities = 21/52 (40%), Positives = 30/52 (57%), Gaps = 8/52 (15%)
Query: 70 RLPELIGYSEVDED--------LPANFDSRTKWPNCPTIREIRDQGSCGSCW 113
++P+ + Y+ D +P FD+R KW C TI E+RDQG+CGS W
Sbjct: 6 QIPDKVNYNMYKNDDHADNYQEIPMKFDARKKWIRCKTIGEVRDQGNCGSDW 57
Score = 38.1 bits (87), Expect = 5.0, Method: Compositional matrix adjust.
Identities = 16/33 (48%), Positives = 22/33 (66%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQA 41
CG GCNGG+P AW+ + G+V+GG Y S +
Sbjct: 93 CGNGCNGGYPIRAWKRFKNHGLVTGGNYKSGEG 125
>gi|242001446|ref|XP_002435366.1| cysteine proteinase, putative [Ixodes scapularis]
gi|215498696|gb|EEC08190.1| cysteine proteinase, putative [Ixodes scapularis]
Length = 238
Score = 85.5 bits (210), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 52/144 (36%), Positives = 72/144 (50%), Gaps = 31/144 (21%)
Query: 162 FGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTI 221
F Y V +NE+ IM+EIY +GPV+ V +D LY SG V + A +L
Sbjct: 95 FSTPPYRVPANEEDIMQEIYANGPVQALMLVKEDFFLYSSG---VYKHTRLAHNLPPEYQ 151
Query: 222 RDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGED--EKSKEKYWLIANSWNTD 279
+ + H++RILGWG D + +KYWL ANSW +
Sbjct: 152 KSDW--------------------------HSVRILGWGVDRTQYRPQKYWLCANSWGSG 185
Query: 280 WGDNGLFKILRGKDECGIESSITA 303
WG+NG F+I+RG+DE IES + A
Sbjct: 186 WGENGYFRIVRGEDESQIESFVLA 209
>gi|194382330|dbj|BAG58920.1| unnamed protein product [Homo sapiens]
Length = 446
Score = 85.5 bits (210), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 65/248 (26%), Positives = 94/248 (37%), Gaps = 75/248 (30%)
Query: 100 IREIRDQGSCGSCWG----------------------CRPYEIAPCEHHVNG-------- 129
+ +R+Q SCGSC+ P E+ C + G
Sbjct: 229 VSPVRNQASCGSCYSFASMGMLEARIRILTNNSQTPILSPQEVVSCSQYAQGCEGGFPYL 288
Query: 130 -----------TRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMK 178
+C G C + +E+ Y + ++ Y NE +
Sbjct: 289 IAGKYAQDFGLVEEACFPYTGTDSPC--KMKEDCFRYYSSEYHYVGGFYG-GCNEALMKL 345
Query: 179 EIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFD 238
E+ HGP+ AF V+DD + YK G + G +RD F F+
Sbjct: 346 ELVHHGPMAVAFEVYDDFLHYKKGIYHHTG------------LRD---------PFNPFE 384
Query: 239 DLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIE 298
L HA+ ++G+G D S YW++ NSW T WG+NG F+I RG DEC IE
Sbjct: 385 ----------LTNHAVLLVGYGTDSASGMDYWIVKNSWGTGWGENGYFRIRRGTDECAIE 434
Query: 299 SSITAGVP 306
S A P
Sbjct: 435 SIAVAATP 442
>gi|395833440|ref|XP_003789742.1| PREDICTED: tubulointerstitial nephritis antigen [Otolemur
garnettii]
Length = 464
Score = 85.1 bits (209), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 52/145 (35%), Positives = 70/145 (48%), Gaps = 32/145 (22%)
Query: 167 YSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTS 226
Y +SSNE IMKEI ++GPV+ V +D YKSG + R S
Sbjct: 343 YRISSNETEIMKEIMQNGPVQAIMQVHEDFFHYKSGIY-----------------RHVAS 385
Query: 227 QLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGE---DEKSKEKYWLIANSWNTDWGDN 283
G + + L HA+++LGWG + KEK+W+ ANSW WG+N
Sbjct: 386 THGESENY------------RKLRTHAVKLLGWGTLRGAQGRKEKFWIAANSWGKSWGEN 433
Query: 284 GLFKILRGKDECGIESSITAGVPKL 308
G F+ILRG +E IE I A +L
Sbjct: 434 GYFRILRGVNESDIEKLIIAAWGQL 458
>gi|10803437|emb|CAC13131.1| putative cathepsin B.5 [Ostertagia ostertagi]
Length = 196
Score = 85.1 bits (209), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 48/152 (31%), Positives = 66/152 (43%), Gaps = 38/152 (25%)
Query: 114 GCRPYEIAPCEHHVNGTR-PSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSN 172
GC+PY I PC HH N T C + TP C +C Y PY D ++G +Y+V+
Sbjct: 72 GCKPYPIPPCGHHKNQTYFGPCPTDEYDTPVCTNKCIAAYKTPYSDDKHYGTSAYNVAKT 131
Query: 173 EKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEG 232
I KEI +GPVE A+TV++D Y G +
Sbjct: 132 VAGIQKEIMTNGPVEAAYTVYEDFYQYTGGVY---------------------------- 163
Query: 233 AFTVFDDLILYKSGKALGGHAIRILGWGEDEK 264
+ G +GGHA+RILGWG ++
Sbjct: 164 ---------THTGGAEVGGHAVRILGWGVRQQ 186
Score = 47.4 bits (111), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 20/37 (54%), Positives = 27/37 (72%)
Query: 7 RLCGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQAEK 43
+ CG GC GG+P AW+YWVK+GI +GG+Y S+ K
Sbjct: 38 KKCGNGCEGGYPIEAWKYWVKTGICTGGSYESQSGCK 74
>gi|351709947|gb|EHB12866.1| Tubulointerstitial nephritis antigen-like protein [Heterocephalus
glaber]
Length = 467
Score = 85.1 bits (209), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 75/285 (26%), Positives = 111/285 (38%), Gaps = 98/285 (34%)
Query: 82 EDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG----------------------CRPYE 119
E LP F++ KWPN I + DQG+C W P
Sbjct: 201 EVLPKAFEASKKWPN--MIHDPLDQGNCAGSWAFSTAAVASDRVSIHSMGHMTPVLSPQN 258
Query: 120 IAPCE-HHVNGTR------------------PSCDASKGH--------TP---------- 142
+ C+ HH G + C GH TP
Sbjct: 259 LLSCDTHHQQGCQGGRLDGAWWFLRRRGVVSDHCYPFSGHEQAEAGPATPCMMHSRAMGR 318
Query: 143 ---KCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILY 199
+ R C ++D ++ +Y + S+EK IMKE+ E+GPV+ V++D LY
Sbjct: 319 GKRQATRRCPNSHDD--ANEIYQVTPAYRLGSDEKEIMKELMENGPVQALMEVYEDFFLY 376
Query: 200 KSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGW 259
KSG + + +L+ +G + + G H+++I GW
Sbjct: 377 KSGIY--------SHTLVS---------MGRPEQY------------RRHGTHSVKITGW 407
Query: 260 GED---EKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSI 301
GE+ + KYW ANSW WG+ G F+ILRG +EC IES +
Sbjct: 408 GEEMLPDGRTLKYWTAANSWGPSWGERGYFRILRGSNECDIESFV 452
>gi|201023369|ref|NP_001128426.1| cathepsin B-3483 [Acyrthosiphon pisum]
gi|328712086|ref|XP_003244726.1| PREDICTED: cathepsin B-like [Acyrthosiphon pisum]
Length = 355
Score = 85.1 bits (209), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 62/217 (28%), Positives = 90/217 (41%), Gaps = 50/217 (23%)
Query: 101 REIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPS--------CDASKGHTPKCVREC-QEN 151
+ I G GS GC+P+ + PC PS C TPKC C
Sbjct: 180 KGIVTGGDYGSNEGCQPWLVQPCNASTTAADPSSVLGPHGVCGGDPATTPKCDLSCYNAR 239
Query: 152 YDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNET 211
++ Y D+ K ++ + S K + +HGP V++D + YKSG +
Sbjct: 240 HEGKYLDDIIKAKKVFTF--DGCSARKNLRKHGPYVVTMRVYEDFLAYKSGVYH------ 291
Query: 212 TAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWL 271
+ +G LG ++R++GWG + + +WL
Sbjct: 292 -------------------------------HVTGDYLGLLSVRMIGWGLE--GGQAFWL 318
Query: 272 IANSWNTDWGDNGLFKILRGKDECGIESSITAGVPKL 308
+ANSW T WGD G FKI R +EC IE+ AGVP L
Sbjct: 319 LANSWGTSWGDKGFFKIRRFVNECWIENFRYAGVPNL 355
Score = 45.1 bits (105), Expect = 0.046, Method: Compositional matrix adjust.
Identities = 19/32 (59%), Positives = 24/32 (75%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQ 40
CG GC+GG+ AWRY +K GIV+GG YGS +
Sbjct: 161 CGNGCSGGYTAAAWRYILKKGIVTGGDYGSNE 192
>gi|115621283|ref|XP_782184.2| PREDICTED: tubulointerstitial nephritis antigen-like
[Strongylocentrotus purpuratus]
Length = 450
Score = 85.1 bits (209), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 49/147 (33%), Positives = 72/147 (48%), Gaps = 31/147 (21%)
Query: 158 KDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLI 217
DL Y +++ E IM EIY++GPV+ F V +D +Y G + E TA
Sbjct: 320 SDLYLSTPPYRIAAREVDIMTEIYQNGPVQATFNVKNDFFVYNRGVYRNVKQEFTA---- 375
Query: 218 KWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEK---SKEKYWLIAN 274
SQ ++ A G H+++I+GWG D + KYWL N
Sbjct: 376 --------SQSDSDQA----------------GWHSVKIVGWGIDRSDWYNPIKYWLCTN 411
Query: 275 SWNTDWGDNGLFKILRGKDECGIESSI 301
SW +WG+ G+F+I+RG +EC IES +
Sbjct: 412 SWGRNWGEQGMFRIVRGVNECEIESFV 438
>gi|29840882|gb|AAP05883.1| similar to GenBank Accession Number X70968 cathepsin B in
Schistosoma japonicum [Schistosoma japonicum]
Length = 312
Score = 85.1 bits (209), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 51/148 (34%), Positives = 67/148 (45%), Gaps = 38/148 (25%)
Query: 114 GCRPYEIAPCEHHVNGT-RPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSN 172
GC+PY C HH SC+ TP+C + CQ +Y + Y+ D +G SY V+S+
Sbjct: 189 GCQPYPFPECIHHSTSINHSSCEVKYYSTPECYQTCQPDYAIQYENDKYYGKSSYYVTSD 248
Query: 173 EKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEG 232
E SIMKEI +GPVE F V+DD + YK+G +
Sbjct: 249 EVSIMKEILLNGPVEATFYVYDDFLNYKTGVY---------------------------- 280
Query: 233 AFTVFDDLILYKSGKALGGHAIRILGWG 260
Y +G LGGHAIRI G
Sbjct: 281 ---------KYVTGSLLGGHAIRITWLG 299
Score = 51.2 bits (121), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 20/27 (74%), Positives = 22/27 (81%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGA 35
CGFGCNGG PGMAW YW GIV+GG+
Sbjct: 157 CGFGCNGGIPGMAWDYWKDEGIVTGGS 183
>gi|195384166|ref|XP_002050789.1| GJ20006 [Drosophila virilis]
gi|194145586|gb|EDW61982.1| GJ20006 [Drosophila virilis]
Length = 432
Score = 85.1 bits (209), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 78/294 (26%), Positives = 111/294 (37%), Gaps = 92/294 (31%)
Query: 67 PANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEIAP---- 122
P R+ + + +DLP F++ KW + I E+ DQG CGS W +A
Sbjct: 170 PTYRVKAMTRLTNPSDDLPRKFNAVEKWSS--YISEVPDQGWCGSSWVLSTTSVASDRFA 227
Query: 123 -------------------------CE----------HHVNGT-----------RPSCDA 136
CE H G R SC
Sbjct: 228 IQSQGKEVVQLSAQNILSCTRRQQGCEGGHLDAAWRYLHKKGVLDEKCYPYTQHRDSCKI 287
Query: 137 SKGHTPKCVRE--CQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFD 194
+ H + ++ CQ Y V + L +YS+S E IM EIY GPV+ ++
Sbjct: 288 QR-HNSRSLKANGCQPAYGVN-RDSLYTVGPAYSLS-READIMAEIYHSGPVQATMRIYR 344
Query: 195 DLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAI 254
D Y G + R + GA F H++
Sbjct: 345 DFFSYSGGIY-----------------RQTAANRGAPTGF-----------------HSV 370
Query: 255 RILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVPKL 308
+++GWGE E KYW+ ANSW WG++G F+ILRG +ECGIE + A P +
Sbjct: 371 KLVGWGE-EHDGVKYWIAANSWGPWWGEHGYFRILRGSNECGIEEYVLASWPYV 423
>gi|294916338|ref|XP_002778359.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
gi|239886683|gb|EER10154.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
Length = 105
Score = 84.7 bits (208), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 44/101 (43%), Positives = 63/101 (62%), Gaps = 15/101 (14%)
Query: 219 WTIRDNTSQLGAEG----AFTVFDDLILYKSG-------KALGGHAIRILGWGEDEKSKE 267
+++ D + + +G +FTV++D + Y+SG LGGHA++I+GWGE KS +
Sbjct: 9 YSVNDAKNAIRTDGPVSASFTVYEDFLAYRSGVYKHTSGSYLGGHAVKIIGWGE--KSGQ 66
Query: 268 KYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVPKL 308
YWL NSWN DWGD+GLFKI G CGI+ + G PK+
Sbjct: 67 AYWLAVNSWNEDWGDHGLFKIALGN--CGIDDDLLGGTPKV 105
>gi|3087803|emb|CAA93279.1| cysteine protease [Haemonchus contortus]
Length = 325
Score = 84.7 bits (208), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 49/166 (29%), Positives = 72/166 (43%), Gaps = 39/166 (23%)
Query: 115 CRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEK 174
CR + PC HH N T + TPKC C Y Y D G +Y + ++ K
Sbjct: 192 CRSHPFPPCGHHGNETYYGECGGRARTPKCRTSCTPGYKNSYSDDKIRGKDAYELPNSVK 251
Query: 175 SIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAF 234
+I +EI ++GPV AFTV+ D YK G
Sbjct: 252 AIQREIMKNGPVVAAFTVYADFSYYKKG-------------------------------- 279
Query: 235 TVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDW 280
+ + +G+A G HA++++GWGE+ YW++ NSW+ DW
Sbjct: 280 -----IYKHTAGRARGSHAVKVIGWGEE--GDVPYWIVKNSWHNDW 318
Score = 56.2 bits (134), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 22/46 (47%), Positives = 32/46 (69%)
Query: 69 NRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG 114
NR P + ++ +D+P +FD+RT WPNC ++ IRDQ +CGSCW
Sbjct: 79 NREPIVGDENDEGDDIPESFDARTHWPNCSSLTHIRDQANCGSCWA 124
>gi|268572247|ref|XP_002648914.1| Hypothetical protein CBG17827 [Caenorhabditis briggsae]
Length = 150
Score = 84.7 bits (208), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 56/181 (30%), Positives = 74/181 (40%), Gaps = 58/181 (32%)
Query: 103 IRDQGSCGSCWGCRPYEIAP---CEHHVNGTRP-----------------SCDAS-KGHT 141
IR+Q +CGSCW E+ C +P CD K T
Sbjct: 2 IRNQTNCGSCWAFGAAEVISDRICIVTKGARQPIISPTDMLDCCGEYCGYGCDGCPKAVT 61
Query: 142 PKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKS 201
PKC CQ Y+ Y KD NFG+ +Y V N I EI +GPVE +FTV++D +YK
Sbjct: 62 PKCALSCQSKYNTEYAKDKNFGSSAYYVGRNFSVIQTEIMTNGPVEASFTVYEDFYIYKK 121
Query: 202 GRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGE 261
G + Y +G+ LGGHAI+I+GWG
Sbjct: 122 GVY-------------------------------------QYTAGEVLGGHAIKIIGWGT 144
Query: 262 D 262
+
Sbjct: 145 E 145
>gi|307938279|ref|NP_001182763.1| dipeptidyl peptidase 1 precursor [Canis lupus familiaris]
Length = 459
Score = 84.7 bits (208), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 64/244 (26%), Positives = 93/244 (38%), Gaps = 68/244 (27%)
Query: 100 IREIRDQGSCGSCWG----------------------CRPYEIAPCEHHVNGTR---PSC 134
+ +R+Q SCGSC+ P EI C + G P
Sbjct: 243 VSPVRNQASCGSCYAFASTAMLEARIRILTNNTQTPILSPQEIVSCSQYAQGCEGGFPYL 302
Query: 135 DASKGHTPKCVRE------------CQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYE 182
A K + E C+ N Y + + + NE + E+
Sbjct: 303 IAGKYAQDFGLVEEACFPYAGSDSPCKPNDCFRYYSSEYYYVGGFYGACNEALMKLELVR 362
Query: 183 HGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLIL 242
HGP+ AF V+DD Y+ G ++ G +RD F F+
Sbjct: 363 HGPMAVAFEVYDDFFHYQKGIYYHTG------------LRD---------PFNPFE---- 397
Query: 243 YKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSIT 302
L HA+ ++G+G D S YW++ NSW + WG++G F+I RG DEC IES
Sbjct: 398 ------LTNHAVLLVGYGTDSASGMDYWIVKNSWGSRWGEDGYFRIRRGTDECAIESIAV 451
Query: 303 AGVP 306
A P
Sbjct: 452 AATP 455
>gi|431838263|gb|ELK00195.1| Tubulointerstitial nephritis antigen [Pteropus alecto]
Length = 425
Score = 84.7 bits (208), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 52/147 (35%), Positives = 69/147 (46%), Gaps = 36/147 (24%)
Query: 167 YSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSG--RFFVPGNETTAMSLIKWTIRDN 224
Y VSSNE IMKEI +GPV+ V +D YKSG R NE +
Sbjct: 304 YRVSSNETEIMKEIIHNGPVQAIMQVHEDFFHYKSGIYRHVTSTNEKS------------ 351
Query: 225 TSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGE---DEKSKEKYWLIANSWNTDWG 281
+ + L HA+++ GWG + KEK+W++ANSW WG
Sbjct: 352 -------------------EKYQKLQTHAVKLTGWGTLRGAQGRKEKFWIVANSWGNSWG 392
Query: 282 DNGLFKILRGKDECGIESSITAGVPKL 308
+NG F+ILRG +E IE I A +L
Sbjct: 393 ENGYFRILRGVNESDIEKLIIAAWGQL 419
>gi|3087799|emb|CAA93276.1| cysteine proteinase [Haemonchus contortus]
Length = 350
Score = 84.3 bits (207), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 51/168 (30%), Positives = 76/168 (45%), Gaps = 41/168 (24%)
Query: 115 CRPYEIAPCEHHVNGTRPSCDASKGH-TPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
CRPY PC H +G R C TP C CQ Y Y+KD F +Y + ++E
Sbjct: 193 CRPYAFHPCGLH-HGRRYDCPWDHSFSTPACKPYCQFGYGKRYEKDKFFVKSTYILDNDE 251
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
K I +E+ ++GPV+ AF ++D YK G
Sbjct: 252 KVIQREMMKNGPVQAAFITYEDFSPYKGG------------------------------- 280
Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWG 281
+ ++ G+ G HA++++GWG + + KYW +ANSW+ DWG
Sbjct: 281 ------IYVHVKGRERGAHAVKLIGWGVENGT--KYWTVANSWHDDWG 320
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 26/69 (37%), Positives = 37/69 (53%), Gaps = 2/69 (2%)
Query: 48 NIPRAHLKSWMGVHPDYNLPANRLPELIGYSE--VDEDLPANFDSRTKWPNCPTIREIRD 105
N +A + + DY A +L ++ E +ED+P +FDSR W NC +I +RD
Sbjct: 56 NTSKAEERMAHLMKTDYIRNARKLYKVKKAEEQTTNEDIPESFDSRIVWKNCSSITYVRD 115
Query: 106 QGSCGSCWG 114
Q CGSCW
Sbjct: 116 QSRCGSCWA 124
Score = 38.5 bits (88), Expect = 3.4, Method: Compositional matrix adjust.
Identities = 15/33 (45%), Positives = 22/33 (66%)
Query: 7 RLCGFGCNGGFPGMAWRYWVKSGIVSGGAYGSK 39
R+CG GC GG+ +AW + + G+V+GG Y K
Sbjct: 158 RMCGDGCEGGYDHLAWEWVQRFGVVTGGPYQQK 190
>gi|17933077|gb|AAL48195.1| cathepsin C [Homo sapiens]
Length = 463
Score = 84.3 bits (207), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 48/135 (35%), Positives = 64/135 (47%), Gaps = 31/135 (22%)
Query: 172 NEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAE 231
NE + E+ HGP+ AF V+DD + YK G + G +RD
Sbjct: 356 NEALMKLELVHHGPMAVAFEVYDDFLHYKKGIYHHTG------------LRD-------- 395
Query: 232 GAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRG 291
F F+ L HA+ ++G+G D S YW++ NSW T WG+NG F+I RG
Sbjct: 396 -PFNPFE----------LTNHAVLLVGYGTDSASGMDYWIVKNSWGTGWGENGYFRIRRG 444
Query: 292 KDECGIESSITAGVP 306
DEC IES A P
Sbjct: 445 TDECAIESIAVAATP 459
>gi|111054118|gb|ABH04250.1| cathepsin B precursor [Sus scrofa]
Length = 61
Score = 84.3 bits (207), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 39/59 (66%), Positives = 46/59 (77%), Gaps = 2/59 (3%)
Query: 243 YKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSI 301
+ +G +GGHAIRILGWG + + YWL+ NSWNTDWGDNG FKILRG+D CGIES I
Sbjct: 5 HVTGDLMGGHAIRILGWGVENGTP--YWLVGNSWNTDWGDNGFFKILRGQDHCGIESEI 61
>gi|389608479|dbj|BAM17849.1| tubulointerstitial nephritis antigen [Papilio xuthus]
Length = 429
Score = 84.3 bits (207), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 71/258 (27%), Positives = 109/258 (42%), Gaps = 25/258 (9%)
Query: 66 LPANRLPELIGYSEVDEDLP--ANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEIAPC 123
P N +G D+D+P FD+RT+WP I I DQG CGS W +A
Sbjct: 170 FPLNAETRRMGPLRYDKDVPYPTQFDARTRWPG--FISPIVDQGWCGSDWAVSLAGVASD 227
Query: 124 EHHV--NGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIY 181
+ NG + + VR Q + NF A+ + + + K
Sbjct: 228 RFAIQSNGAENMVLSPQTLLSCNVRAQQGCHGGHIDVAWNF-ARGHGLVDEKCFPYKASV 286
Query: 182 EHGPVEGAFTVFDD----LILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVF 237
P + D L+ ++ R+ + +S K + D + TV+
Sbjct: 287 TRCPFRPRGNLIQDGCMPLVKRRTSRYKL--GPPAKLSHEKDIMYDIMESGPVQAVMTVY 344
Query: 238 DDLILYKSG----------KALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFK 287
D Y+ G + G H++RI+GWGED ++YW++ANSW WG+NG F+
Sbjct: 345 QDFFHYRDGVYRRSYHGNNELKGFHSVRIIGWGEDR--GDRYWVVANSWGRQWGENGYFR 402
Query: 288 ILRGKDECGIESSITAGV 305
I RG +E IES + G+
Sbjct: 403 IARGSNEADIESFVVTGL 420
>gi|22653678|sp|O97578.1|CATC_CANFA RecName: Full=Dipeptidyl peptidase 1; AltName: Full=Cathepsin C;
AltName: Full=Cathepsin J; AltName: Full=Dipeptidyl
peptidase I; Short=DPP-I; Short=DPPI; AltName:
Full=Dipeptidyl transferase; Contains: RecName:
Full=Dipeptidyl peptidase 1 exclusion domain chain;
AltName: Full=Dipeptidyl peptidase I exclusion domain
chain; Contains: RecName: Full=Dipeptidyl peptidase 1
heavy chain 1; AltName: Full=Dipeptidyl peptidase I
heavy chain 1; Contains: RecName: Full=Dipeptidyl
peptidase 1 heavy chain 2; AltName: Full=Dipeptidyl
peptidase I heavy chain 2; Contains: RecName:
Full=Dipeptidyl peptidase 1 heavy chain 3; AltName:
Full=Dipeptidyl peptidase I heavy chain 3; Contains:
RecName: Full=Dipeptidyl peptidase 1 heavy chain 4;
AltName: Full=Dipeptidyl peptidase I heavy chain 4;
Contains: RecName: Full=Dipeptidyl peptidase 1 light
chain; AltName: Full=Dipeptidyl peptidase I light chain;
Flags: Precursor
gi|4106126|gb|AAD02704.1| dipeptidyl peptidase I [Canis lupus familiaris]
Length = 435
Score = 84.3 bits (207), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 64/244 (26%), Positives = 93/244 (38%), Gaps = 68/244 (27%)
Query: 100 IREIRDQGSCGSCWG----------------------CRPYEIAPCEHHVNGTR---PSC 134
+ +R+Q SCGSC+ P EI C + G P
Sbjct: 219 VSPVRNQASCGSCYAFASTAMLEARIRILTNNTQTPILSPQEIVSCSQYAQGCEGGFPYL 278
Query: 135 DASKGHTPKCVRE------------CQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYE 182
A K + E C+ N Y + + + NE + E+
Sbjct: 279 IAGKYAQDFGLVEEACFPYAGSDSPCKPNDCFRYYSSEYYYVGGFYGACNEALMKLELVR 338
Query: 183 HGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLIL 242
HGP+ AF V+DD Y+ G ++ G +RD F F+
Sbjct: 339 HGPMAVAFEVYDDFFHYQKGIYYHTG------------LRD---------PFNPFE---- 373
Query: 243 YKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSIT 302
L HA+ ++G+G D S YW++ NSW + WG++G F+I RG DEC IES
Sbjct: 374 ------LTNHAVLLVGYGTDSASGMDYWIVKNSWGSRWGEDGYFRIRRGTDECAIESIAV 427
Query: 303 AGVP 306
A P
Sbjct: 428 AATP 431
>gi|32129433|sp|P92131.3|CATB1_GIALA RecName: Full=Cathepsin B-like CP1; AltName: Full=Cathepsin B-like
protease B1; Flags: Precursor
Length = 303
Score = 84.3 bits (207), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 78/286 (27%), Positives = 119/286 (41%), Gaps = 39/286 (13%)
Query: 39 KQAEKNSLSNIPRAHLKSWMGVHPDY------NLPANRLPELIGYSEVDEDLPANFDSRT 92
K N+ +S M + PD +LP + E+ E+ + +P FD R
Sbjct: 32 KAGMPKRFENVTEDEFRS-MLIRPDRLRARSGSLPPISITEV---QELVDPIPPQFDFRD 87
Query: 93 KWPNCPTIREIRDQGSCGSCWGCRPYEIAPCEHHVNGT-RPSCDASKGHTPKCVRE---C 148
++P C ++ DQGSCGSCW + G + + S+ H C E C
Sbjct: 88 EYPQC--VKPALDQGSCGSCWAFSAIGVFGDRRCAMGIDKEAVSYSQQHLISCSLENFGC 145
Query: 149 QENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDD---LILYKSGRFF 205
P L F ++ + + Y H V DD + LYK+ +
Sbjct: 146 DGGDFQPTWSFLTFTG-----ATTAECVKYVDYGHTVASPCPAVCDDGSPIQLYKAHGY- 199
Query: 206 VPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKA--------LGGHAIRIL 257
G + ++ I + + V+ DL Y+SG LG HA+ I+
Sbjct: 200 --GQVSKSVPAIMGMLVAGGP---LQTMIVVYADLSYYESGVYKHTYGTINLGFHALEIV 254
Query: 258 GWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITA 303
G+G + + YW+I NSW DWG+NG F+I+RG +EC IE I A
Sbjct: 255 GYGTTDDGTD-YWIIKNSWGPDWGENGYFRIVRGVNECRIEDEIYA 299
>gi|291384116|ref|XP_002708690.1| PREDICTED: cathepsin C [Oryctolagus cuniculus]
Length = 463
Score = 84.0 bits (206), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 72/268 (26%), Positives = 101/268 (37%), Gaps = 82/268 (30%)
Query: 83 DLPANFDSRTKWPNCPTIR---EIRDQGSCGSCWG----------------------CRP 117
DLPA++D W N I +R+Q SCGSC+ P
Sbjct: 230 DLPASWD----WRNVGGINFVSPVRNQESCGSCYSFASVGMLEARIRILTNNSQTPILSP 285
Query: 118 YEIAPCEHHVNG-------------------TRPSCDASKGHTPKCVRECQENYDVPYKK 158
EI C + G C G C + +E+ Y
Sbjct: 286 QEIVSCSQYAQGCNGGFPYLIAGKYAQDFGLVEEDCFPYTGTDSPC--KMKEDCFRYYSS 343
Query: 159 DLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIK 218
+ ++ Y NE + E+ HGP+ AF V+DD + Y G + G
Sbjct: 344 EYHYVGGFYG-GCNEALMKLELVHHGPMAVAFEVYDDFLHYHKGIYHHTG---------- 392
Query: 219 WTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNT 278
+RD F F+ L HA+ ++G+G D + YW++ NSW T
Sbjct: 393 --LRD---------PFNPFE----------LTNHAVLLVGYGTDPATGVDYWIVKNSWGT 431
Query: 279 DWGDNGLFKILRGKDECGIESSITAGVP 306
WG+NG F+I RG DEC IES A P
Sbjct: 432 SWGENGYFRIRRGTDECAIESIAVAATP 459
>gi|363729389|ref|XP_417207.2| PREDICTED: dipeptidyl peptidase 1 [Gallus gallus]
Length = 460
Score = 84.0 bits (206), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 70/283 (24%), Positives = 104/283 (36%), Gaps = 83/283 (29%)
Query: 67 PANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWGC----------- 115
PA PEL+ + LP ++D R + +R+Q SCGSC+
Sbjct: 214 PAPLTPELL---KKVSGLPESWDWRNV-NGVNYVSPVRNQASCGSCYAFASMGMLEARIR 269
Query: 116 -----------RPYEIAPCEHHVNG-------------------TRPSCDASKGHTPKCV 145
P ++ C + G C C+
Sbjct: 270 ILTNNTQKPVFSPQQVVSCSQYSQGCDGGFPYLIAGKYVQDFGVVEEDCFPYTAKDTPCL 329
Query: 146 --RECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGR 203
R C Y Y F + NE + E+ GP+ AF V++D + YK G
Sbjct: 330 FKRSCYHYYTSEYHYVGGFYG-----ACNEALMKLELVLSGPMAVAFEVYNDFMFYKEGI 384
Query: 204 FFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDE 263
+ G + F F+ L HA+ ++G+G+D
Sbjct: 385 Y---------------------HHTGLKDEFNPFE----------LTNHAVLLVGYGKDP 413
Query: 264 KSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVP 306
+S EK+W++ NSW T WG++G F+I RG DEC IES A P
Sbjct: 414 ESGEKFWIVKNSWGTSWGEDGYFRIRRGTDECAIESIAVAATP 456
>gi|312383398|gb|EFR28501.1| hypothetical protein AND_03481 [Anopheles darlingi]
Length = 573
Score = 84.0 bits (206), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 69/248 (27%), Positives = 106/248 (42%), Gaps = 35/248 (14%)
Query: 84 LPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEIAPCEHHV-NGTRPSCDASKGHTP 142
LP++FD+ WP + E RDQG CGS W +A + + R +
Sbjct: 296 LPSHFDAADHWPR--LVGEARDQGWCGSSWALSTTTMASDRFAILSKGREQVQLAPQQLL 353
Query: 143 KCVRECQE----NYDVPYKKDLNFGAKS-----YSVSSNEKSIMK-EIYEHGPVEGAFTV 192
CVR Q + D ++ G + Y + N+ I + E V
Sbjct: 354 ACVRRQQACSGGHLDTAWQYLRRVGVVNDECYPYIAAKNQCKINDGDTLVSANCELPANV 413
Query: 193 FDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSG------ 246
+ +Y+ G + NET M+ IK + + V+ D Y++G
Sbjct: 414 -NRTAMYRMGPAYSLNNETDIMTEIK-------ERGTVQAILRVYRDFFSYQNGIYRHSA 465
Query: 247 ------KALGGHAIRILGWGEDEKSKE--KYWLIANSWNTDWGDNGLFKILRGKDECGIE 298
+ H++R++GWGE+ + KYW+ NSW T WG+NG F+ILRG +EC IE
Sbjct: 466 AATPAEERSAYHSVRLIGWGEERVGYDMVKYWIAVNSWGTWWGENGRFRILRGTNECEIE 525
Query: 299 SSITAGVP 306
S + A P
Sbjct: 526 SYVLASNP 533
>gi|426221788|ref|XP_004005089.1| PREDICTED: tubulointerstitial nephritis antigen-like [Ovis aries]
Length = 362
Score = 84.0 bits (206), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 78/298 (26%), Positives = 114/298 (38%), Gaps = 101/298 (33%)
Query: 69 NRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG-------------- 114
N + ++G EV LP F++ KWPN I + DQG+C W
Sbjct: 86 NEIHTVLGPGEV---LPRTFEASEKWPN--LIHDPLDQGNCAGSWAFSTAAVASDRVSIH 140
Query: 115 --------CRPYEIAPCEHH----VNGTR---------------PSCDASKGH------- 140
P + C+ H +G R C GH
Sbjct: 141 SLGHMSPVLSPQNLLSCDTHNQQGCHGGRLDGAWWFLRRRGVVSDHCYPFSGHGRDEAVP 200
Query: 141 TPKCVRE--------------CQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPV 186
P C+ C +Y + D+ +Y + SNEK IMKE+ E+GPV
Sbjct: 201 APPCMMHSRAMGRGKRQATARCPNSY--VHANDIYQVTPAYRLGSNEKEIMKELMENGPV 258
Query: 187 EGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSG 246
+ V +D LY+SG + T +SL + +
Sbjct: 259 QALMEVHEDFFLYQSGIY-----SHTPVSLGR------------------------PERY 289
Query: 247 KALGGHAIRILGWGED---EKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSI 301
+ G H+++I GWGE+ + KYW ANSW WG+ G F+I+RG +EC IES +
Sbjct: 290 RRHGTHSVKITGWGEETLPDGRTVKYWTAANSWGPAWGERGHFRIVRGANECDIESFV 347
>gi|431838501|gb|ELK00433.1| Dipeptidyl-peptidase 1 [Pteropus alecto]
Length = 460
Score = 84.0 bits (206), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 63/248 (25%), Positives = 90/248 (36%), Gaps = 75/248 (30%)
Query: 100 IREIRDQGSCGSCWG----------------------CRPYEIAPCEHHVNG-------- 129
+ +R+Q SCGSC+ P E+ C + G
Sbjct: 243 VTPVRNQASCGSCYSFASVGMLEARIRILTNNTQSPILSPQEVVSCSQYAQGCEGGFPYL 302
Query: 130 -----------TRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMK 178
+C G C + +EN Y + ++ Y NE +
Sbjct: 303 IAGKYAQDFGLVEETCFPYTGTDSPC--KLKENCFRYYSSEYHYVGGFYG-GCNEALMKL 359
Query: 179 EIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFD 238
E+ HGP+ AF V+DD + Y G + G + F F+
Sbjct: 360 ELVHHGPMAVAFEVYDDFLHYHKGIY---------------------HHTGLKDPFNPFE 398
Query: 239 DLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIE 298
L HA+ ++G+G D S YW + NSW T WG+NG F+I RG DEC IE
Sbjct: 399 ----------LTNHAVLLVGYGTDPASGLNYWTVKNSWGTSWGENGYFRIRRGTDECAIE 448
Query: 299 SSITAGVP 306
S A P
Sbjct: 449 SIAMAATP 456
>gi|130502070|ref|NP_001076255.1| tubulointerstitial nephritis antigen [Oryctolagus cuniculus]
gi|818411|gb|AAC48477.1| tubulointerstitial nephritis antigen [Oryctolagus cuniculus]
Length = 474
Score = 84.0 bits (206), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 51/147 (34%), Positives = 71/147 (48%), Gaps = 36/147 (24%)
Query: 167 YSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSG--RFFVPGNETTAMSLIKWTIRDN 224
Y VSSNE IMKEI ++GPV+ V +D YK+G R + NE +
Sbjct: 353 YRVSSNETEIMKEIMQNGPVQAIMQVHEDFFHYKTGIYRHVISTNEES------------ 400
Query: 225 TSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKS---KEKYWLIANSWNTDWG 281
+ + L HA+++ GWG + + KEK+W+ ANSW WG
Sbjct: 401 -------------------EKYRKLQTHAVKLTGWGTLKGARGQKEKFWIAANSWGKSWG 441
Query: 282 DNGLFKILRGKDECGIESSITAGVPKL 308
+NG F+ILRG +E IE I A +L
Sbjct: 442 ENGYFRILRGVNESDIEKLIIAAWGQL 468
>gi|358254887|dbj|GAA56530.1| cathepsin C [Clonorchis sinensis]
Length = 362
Score = 84.0 bits (206), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 74/281 (26%), Positives = 109/281 (38%), Gaps = 82/281 (29%)
Query: 72 PELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG----------------- 114
PEL+ E LP FD R + P+ + +R+Q CGSC+
Sbjct: 120 PELL---EASRYLPDEFDWRKQSPS--PVTPVRNQEVCGSCYAFASAAALEARIRLVSNF 174
Query: 115 -----CRPYEIAPCEHHVNG-------------------TRPSCDASKG-HTPKCVRE-- 147
P ++ C + G SCD G KC +
Sbjct: 175 TEEPILSPQDVVDCSPYSEGCDGGFPYLIAGKYAEDFGIPLESCDPYTGVKANKCPTKPG 234
Query: 148 CQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVP 207
C+ Y Y+ Y + +E + E+ GP F V+DD + YKSG
Sbjct: 235 CRRYYATNYRY-----LGGYYGACSELLMRMELVHGGPFPIGFEVYDDFVHYKSG----- 284
Query: 208 GNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKE 267
+ +T+ F F+ L HA+ ++G+G DE+SK
Sbjct: 285 -------------VYRHTNIRHPLKRFEPFE----------LTNHAVLLVGYGFDEESKL 321
Query: 268 KYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVPKL 308
YW++ NSW T+WG++G F+ILRG DEC +ES P L
Sbjct: 322 PYWIVKNSWGTEWGEDGFFRILRGSDECAVESLAVVFDPVL 362
>gi|114639716|ref|XP_508684.2| PREDICTED: dipeptidyl peptidase 1 isoform 2 [Pan troglodytes]
gi|397526223|ref|XP_003833035.1| PREDICTED: dipeptidyl peptidase 1 [Pan paniscus]
gi|410219182|gb|JAA06810.1| cathepsin C [Pan troglodytes]
gi|410260226|gb|JAA18079.1| cathepsin C [Pan troglodytes]
gi|410304128|gb|JAA30664.1| cathepsin C [Pan troglodytes]
gi|410353831|gb|JAA43519.1| cathepsin C [Pan troglodytes]
Length = 463
Score = 84.0 bits (206), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 64/248 (25%), Positives = 94/248 (37%), Gaps = 75/248 (30%)
Query: 100 IREIRDQGSCGSCWG----------------------CRPYEIAPCEHHVNG-------- 129
+ +R+Q SCGSC+ P E+ C + G
Sbjct: 246 VSPVRNQASCGSCYSFASMGMLEARIRILTNNSQTPILSPQEVVSCSQYAQGCEGGFPYL 305
Query: 130 -----------TRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMK 178
+C G C + +E+ Y + ++ Y NE +
Sbjct: 306 IAGKYAQDFGLVEEACFPYTGTDSPC--KMKEDCFRYYSSEYHYVGGFYG-GCNEALMKL 362
Query: 179 EIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFD 238
E+ HGP+ AF V+DD + YK G + G +RD F F+
Sbjct: 363 ELVHHGPMAVAFEVYDDFLHYKKGIYHHTG------------LRD---------PFNPFE 401
Query: 239 DLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIE 298
L HA+ ++G+G D S YW++ NSW T WG++G F+I RG DEC IE
Sbjct: 402 ----------LTNHAVLLVGYGTDSASGMDYWIVKNSWGTGWGEDGYFRIRRGTDECAIE 451
Query: 299 SSITAGVP 306
S A P
Sbjct: 452 SIAVAATP 459
>gi|426370061|ref|XP_004051995.1| PREDICTED: dipeptidyl peptidase 1 [Gorilla gorilla gorilla]
Length = 463
Score = 83.6 bits (205), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 70/267 (26%), Positives = 101/267 (37%), Gaps = 82/267 (30%)
Query: 84 LPANFDSRTKWPNCPTIR---EIRDQGSCGSCWG----------------------CRPY 118
LP ++D W N I +R+Q SCGSC+ P
Sbjct: 231 LPTSWD----WRNVHGINFVSPVRNQASCGSCYSFASMGMLEARIRILTNNSQTPILSPQ 286
Query: 119 EIAPCEHHVNG-------------------TRPSCDASKGHTPKCVRECQENYDVPYKKD 159
E+ C + G +C G C + +E+ Y +
Sbjct: 287 EVVSCSQYAQGCEGGFPYLIAGKYAQDFGLVEEACFPYTGTDSPC--KMKEDCFRYYSSE 344
Query: 160 LNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKW 219
++ Y NE + E+ HGP+ AF V+DD + YK G + G
Sbjct: 345 YHYVGGFYG-GCNEALMKLELVHHGPMAVAFEVYDDFLHYKKGIYHHTG----------- 392
Query: 220 TIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTD 279
+RD F F+ L HA+ ++G+G D S YW++ NSW T
Sbjct: 393 -LRD---------PFNPFE----------LTNHAVLLVGYGTDSASGMDYWIVKNSWGTG 432
Query: 280 WGDNGLFKILRGKDECGIESSITAGVP 306
WG++G F+I RG DEC IES A P
Sbjct: 433 WGEDGYFRIRRGTDECAIESIAVAATP 459
>gi|358421824|ref|XP_003585145.1| PREDICTED: tubulointerstitial nephritis antigen-like [Bos taurus]
Length = 428
Score = 83.6 bits (205), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 78/296 (26%), Positives = 113/296 (38%), Gaps = 97/296 (32%)
Query: 69 NRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG-------------- 114
N + ++G EV LP F++ KWPN I + DQG+C W
Sbjct: 152 NEIHTVLGPGEV---LPRTFEASEKWPN--LIHDPLDQGNCAGSWAFSTAAVASDRVSIH 206
Query: 115 --------CRPYEIAPCE-HHVNGTR------------------PSCDASKGH------- 140
P + C+ H+ G R C GH
Sbjct: 207 SLGHMSPVLSPQNLLSCDTHNQQGCRGGRLDGAWWFLRRRGVVSDHCYPFSGHGRDEAVP 266
Query: 141 TPKCVREC--------QENYDVP----YKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEG 188
P C+ Q P + D+ +Y + SNEK IMKE+ E+GPV+
Sbjct: 267 APPCMMHSRAMGRGKRQATARCPNSYVHANDIYQVTPAYRLGSNEKEIMKELMENGPVQA 326
Query: 189 AFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKA 248
V +D LY+SG + T +SL + + +
Sbjct: 327 LMEVHEDFFLYQSGIY-----SHTPVSLGR------------------------PERYRR 357
Query: 249 LGGHAIRILGWGED---EKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSI 301
G H+++I GWGE+ + KYW ANSW WG+ G F+I+RG +EC IES +
Sbjct: 358 HGTHSVKITGWGEETLPDGRTIKYWTAANSWGPAWGERGHFRIVRGANECDIESFV 413
>gi|67867504|gb|AAH98085.1| Unknown (protein for MGC:107782) [Xenopus (Silurana) tropicalis]
Length = 458
Score = 83.6 bits (205), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 69/286 (24%), Positives = 109/286 (38%), Gaps = 84/286 (29%)
Query: 66 LPANRLPELIGYSEVDEDLPANFDSRTKWPNCP---TIREIRDQGSCGSCWG-------- 114
+P P + E + LP +D W N + +R+Q SCGSC+
Sbjct: 208 IPMRPRPAPLPTDEKYQGLPTEWD----WRNIAGYNFVTPVRNQASCGSCYAFSSMGMLE 263
Query: 115 --------------CRPYEIAPCEHHVNGTR---PSCDASK-----------------GH 140
P ++ C ++ G P A K
Sbjct: 264 SRIQIRSQLSQKPILSPQQVVSCSNYSQGCEGGFPYLIAGKYVSDYGIVEESDLPYTGSD 323
Query: 141 TPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYK 200
+P +++ Q+ Y Y + ++ Y NE + E+ GP+ AF V+DD + Y+
Sbjct: 324 SPCTLKDSQQKY---YTAEYHYVGGFYG-GCNEAYMKLELVLGGPLSVAFEVYDDFMHYR 379
Query: 201 SGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWG 260
SG + G + F F L HA+ ++G+G
Sbjct: 380 SGVY---------------------HHTGLQDKFNPFQ----------LTNHAVLLVGYG 408
Query: 261 EDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVP 306
D+++ EKYW++ NSW WG+ G F+I RG DEC IES + P
Sbjct: 409 TDQQTGEKYWIVKNSWGESWGEKGYFRIRRGTDECAIESIAVSAEP 454
>gi|158285208|ref|XP_001687862.1| AGAP007684-PA [Anopheles gambiae str. PEST]
gi|158285210|ref|XP_308187.4| AGAP007684-PB [Anopheles gambiae str. PEST]
gi|157019881|gb|EDO64511.1| AGAP007684-PA [Anopheles gambiae str. PEST]
gi|157019882|gb|EAA04576.4| AGAP007684-PB [Anopheles gambiae str. PEST]
Length = 463
Score = 83.6 bits (205), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 74/271 (27%), Positives = 112/271 (41%), Gaps = 47/271 (17%)
Query: 67 PANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCW------------- 113
P R+ + S LP FD+ W + E RDQG CGS W
Sbjct: 170 PRFRVKAMKRLSNKGGHLPTRFDASEHWTG--LVAEARDQGWCGSSWAFSTATMASDRFA 227
Query: 114 ----GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSV 169
G ++AP + + R S GH + + V + A++
Sbjct: 228 ILSKGREMVQLAP-QQMLACVRRQQGCSGGHLDTAWQYLRRTGVVNEECYPYIAAQNVCK 286
Query: 170 SSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLG 229
SN+ +++ E PV+ + ++YK G F NET M+ IK +
Sbjct: 287 ISNDDTLITANCEL-PVK-----VNRTLMYKMGPAFSLNNETDIMAEIK-------DRGT 333
Query: 230 AEGAFTVFDDLILYKSG------------KALGGHAIRILGWGEDEKSKE--KYWLIANS 275
+ V+ D Y+SG + H++R++GWGE+ + KYW+ NS
Sbjct: 334 VQAIMRVYRDFFSYRSGIYRHSAAATPAEERSAYHSVRLIGWGEERVGYDVVKYWIAINS 393
Query: 276 WNTDWGDNGLFKILRGKDECGIESSITAGVP 306
W WG+NG F+ILRG +EC IES + A P
Sbjct: 394 WGQWWGENGRFRILRGSNECDIESYVLASNP 424
>gi|603044|gb|AAA96832.1| cysteine protease homolog, partial [Strongyloides ratti]
Length = 202
Score = 83.6 bits (205), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 57/175 (32%), Positives = 77/175 (44%), Gaps = 42/175 (24%)
Query: 107 GSCGSCWGCRPYEIAPCEHHVNGTR-PSCDASKGHTPKCVRECQEN-YDVPYKKDLNFGA 164
G CG +GCRPY PC H + C TP+C + CQ + Y KD + A
Sbjct: 65 GPCGYKYGCRPYAFHPCGVHKDQVYYGECPRKSYDTPECRKICQRGCIQLQYGKDRYYAA 124
Query: 165 KSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDN 224
+Y V ++ K+IM+EI GPV GA+ + D LYK G + E TA
Sbjct: 125 SAYFVKNDTKAIMREIMRGGPVHGAYDTYTDFRLYKGGVY-----EHTA----------- 168
Query: 225 TSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEK---YWLIANSW 276
G+ GGH+I+I+GWG + YWL+ANSW
Sbjct: 169 ---------------------GERTGGHSIKIMGWGNYKHPNGTVIPYWLVANSW 202
>gi|126310154|ref|XP_001364630.1| PREDICTED: tubulointerstitial nephritis antigen [Monodelphis
domestica]
Length = 468
Score = 83.6 bits (205), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 50/141 (35%), Positives = 69/141 (48%), Gaps = 32/141 (22%)
Query: 167 YSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTS 226
Y VSSNE IMKEI ++GPV+ V +D YKSG + N ++D +
Sbjct: 347 YRVSSNETEIMKEIMQNGPVQAIMQVHEDFFHYKSGIYRHINN-----------LKDESE 395
Query: 227 QLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGE---DEKSKEKYWLIANSWNTDWGDN 283
+ + L HA+++ GWG + KEK+W+ ANSW WG+N
Sbjct: 396 KY------------------RNLRTHAVKLTGWGVLRGAQGKKEKFWIAANSWGKSWGEN 437
Query: 284 GLFKILRGKDECGIESSITAG 304
G F+ILRG +E IE I A
Sbjct: 438 GYFRILRGVNESDIEKLIIAA 458
>gi|32129435|sp|P92133.2|CATB3_GIALA RecName: Full=Cathepsin B-like CP3; AltName: Full=Cathepsin B-like
protease B3; Flags: Precursor
gi|1763663|gb|AAB58260.1| cysteine protease [Giardia intestinalis]
gi|11691660|emb|CAC18648.1| cathepsin B-like cysteine protease 3 [Giardia intestinalis]
Length = 299
Score = 83.6 bits (205), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 75/252 (29%), Positives = 112/252 (44%), Gaps = 63/252 (25%)
Query: 85 PANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKC 144
P +FD R ++P+C I E+ DQG CGSCW S AS G
Sbjct: 75 PDSFDFREEYPHC--IPEVVDQGGCGSCWAF-----------------SSVASVGD---- 111
Query: 145 VRECQENYDVPYKKDLNFGAKSYSVSSNEKSI------MKEIYEHGPVEGAFTVFDDLIL 198
R C D KK + + + Y VS + + + ++ G T D+ +
Sbjct: 112 -RRCFAGLD---KKAVKY-SPQYVVSCDRGDMACDGGWLPSVWRFLTKTG--TTTDECVP 164
Query: 199 YKSGRFFVPGNETTAMS-------LIKWTIR-----DNTSQLGA-------EGAFTVFDD 239
Y+SG G T + L K T D + + A + AFTV+ D
Sbjct: 165 YQSGSTGARGTCPTKCADGSDLPHLYKATKAVDYGLDAPAIMKALATGGPLQTAFTVYSD 224
Query: 240 LILYKSG-------KALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGK 292
+ Y+SG + GGHA+ ++G+G D+ + YW+I NSW DWG++G F+I+R
Sbjct: 225 FMYYESGVYQHTYGRVEGGHAVDMVGYGTDDDGVD-YWIIKNSWGPDWGEDGYFRIIRMT 283
Query: 293 DECGIESSITAG 304
+ECGIE + G
Sbjct: 284 NECGIEEQVIGG 295
>gi|294876463|ref|XP_002767679.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
gi|239869446|gb|EER00397.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
Length = 348
Score = 83.6 bits (205), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 58/196 (29%), Positives = 82/196 (41%), Gaps = 45/196 (22%)
Query: 114 GCRPYEIAPCEHHVNGTR-PSCDASKGHTPKCVREC-QENYDVPYKKDLNFGAKSYSVSS 171
GC PY C H+ ++ C TP C+ C E Y P KD +F A++
Sbjct: 191 GCWPYNFPRCAHYQKKSKYGPCPKKSYETPSCLDRCPNEKYGTPLDKDRHFTARAVPYWF 250
Query: 172 NE-KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGA 230
N +SI KEI +HGP +F ++D YKSG +
Sbjct: 251 NGIRSIKKEIMKHGPTSASFFTYEDFFSYKSGVY-------------------------- 284
Query: 231 EGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILR 290
Y SG + H + ++GWG ++ YWL N WN +W D G FKI +
Sbjct: 285 -----------KYTSGAYVEFHTVELIGWGTEKGV--DYWLAKNDWNEEWADLGTFKIAQ 331
Query: 291 GKDECGIESSITAGVP 306
G +CGI + + G P
Sbjct: 332 G--DCGI-NDLVLGAP 344
>gi|294890224|ref|XP_002773108.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
gi|239878009|gb|EER04924.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
Length = 109
Score = 83.6 bits (205), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 42/102 (41%), Positives = 64/102 (62%), Gaps = 15/102 (14%)
Query: 218 KWTIRDNTSQLGAEG----AFTVFDDLILYKSG-------KALGGHAIRILGWGEDEKSK 266
++++ D + + +G +F V++D + Y+SG K LGGHA++I+GWGE+ +
Sbjct: 12 EYSVNDAKNAIRTDGPVSASFIVYEDFLAYRSGVYKHTSGKELGGHAVKIIGWGEE--TG 69
Query: 267 EKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVPKL 308
+ YWL+ NSWN DWGDNGLFKI G C I+ + G PK+
Sbjct: 70 QAYWLVVNSWNEDWGDNGLFKIALGN--CEIDDDLLGGTPKV 109
>gi|448278133|gb|AGE43966.1| putative cathepsin B [Naegleria fowleri]
Length = 349
Score = 83.6 bits (205), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 68/258 (26%), Positives = 100/258 (38%), Gaps = 90/258 (34%)
Query: 97 CPTIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTR------------PSCDAS------- 137
C + IR+Q CGSCW E+ + GTR SCD +
Sbjct: 136 CQQLHRIRNQEQCGSCWAFSISEMV-ADRFCIGTRGKINTIMSPQWMVSCDTADNGCNGG 194
Query: 138 -------------------------KGHTPKCVRECQ--ENYDVPYKKDLNFGAKSYSVS 170
G P C C E+ +V Y+ ++++ V+
Sbjct: 195 EFPTAFQFVETTGLVSDGCVPYQSGNGFVPPCPNSCANGEDINVRYRTK---NSRNFDVN 251
Query: 171 SNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGA 230
+ KS+ I +GPV F V+ D Y+SG V
Sbjct: 252 -DMKSVQASILANGPVISGFKVYRDFYNYRSGYKHV------------------------ 286
Query: 231 EGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILR 290
+G +GGHAI+++GWG + S YW++ANSW+ +WG NG F ILR
Sbjct: 287 --------------AGGLVGGHAIKVVGWGVTQ-SNVPYWIVANSWSDEWGMNGYFWILR 331
Query: 291 GKDECGIESSITAGVPKL 308
G +EC IE ++ +P L
Sbjct: 332 GTNECSIEENMWETIPAL 349
>gi|410959397|ref|XP_003986297.1| PREDICTED: tubulointerstitial nephritis antigen [Felis catus]
Length = 474
Score = 83.6 bits (205), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 51/145 (35%), Positives = 69/145 (47%), Gaps = 31/145 (21%)
Query: 167 YSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTS 226
Y VSSNE IMKEI ++GPV+ V +D YK+G + R T
Sbjct: 352 YRVSSNETEIMKEIMQNGPVQAIMQVHEDFFHYKTGIY-----------------RHITK 394
Query: 227 QLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGE---DEKSKEKYWLIANSWNTDWGDN 283
+ E + L HA+++ GWG + KEK+W+ ANSW WG+N
Sbjct: 395 KANEESG-----------KYRKLQTHAVKLTGWGTLKGAQGRKEKFWIAANSWGKSWGEN 443
Query: 284 GLFKILRGKDECGIESSITAGVPKL 308
G F+ILRG +E IE I A +L
Sbjct: 444 GYFRILRGVNESDIEKLIIAAWGQL 468
>gi|197101281|ref|NP_001125612.1| dipeptidyl peptidase 1 precursor [Pongo abelii]
gi|75061881|sp|Q5RB02.1|CATC_PONAB RecName: Full=Dipeptidyl peptidase 1; AltName: Full=Cathepsin C;
AltName: Full=Cathepsin J; AltName: Full=Dipeptidyl
peptidase I; Short=DPP-I; Short=DPPI; AltName:
Full=Dipeptidyl transferase; Contains: RecName:
Full=Dipeptidyl peptidase 1 exclusion domain chain;
AltName: Full=Dipeptidyl peptidase I exclusion domain
chain; Contains: RecName: Full=Dipeptidyl peptidase 1
heavy chain; AltName: Full=Dipeptidyl peptidase I heavy
chain; Contains: RecName: Full=Dipeptidyl peptidase 1
light chain; AltName: Full=Dipeptidyl peptidase I light
chain; Flags: Precursor
gi|55728636|emb|CAH91058.1| hypothetical protein [Pongo abelii]
Length = 463
Score = 83.6 bits (205), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 64/248 (25%), Positives = 94/248 (37%), Gaps = 75/248 (30%)
Query: 100 IREIRDQGSCGSCWG----------------------CRPYEIAPCEHHVNG-------- 129
+ +R+Q SCGSC+ P E+ C + G
Sbjct: 246 VSPVRNQASCGSCYSFASMGMLEARIRILTSNSQTPILSPQEVVSCSQYAQGCEGGFPYL 305
Query: 130 -----------TRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMK 178
+C G C + +E+ Y + ++ Y NE +
Sbjct: 306 IAGKYAQDFGLVEEACFPYTGTDSPC--KMKEDCFRYYSSEYHYVGGFYG-GCNEALMKL 362
Query: 179 EIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFD 238
E+ HGP+ AF V+DD + YK G + G +RD F F+
Sbjct: 363 ELVHHGPMAVAFEVYDDFLHYKKGIYHHTG------------LRD---------PFNPFE 401
Query: 239 DLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIE 298
L HA+ ++G+G D S YW++ NSW T WG++G F+I RG DEC IE
Sbjct: 402 ----------LTNHAVLLVGYGTDSASGMDYWIVKNSWGTGWGEDGYFRIRRGTDECAIE 451
Query: 299 SSITAGVP 306
S A P
Sbjct: 452 SIAVAATP 459
>gi|311263676|ref|XP_003129789.1| PREDICTED: dipeptidyl peptidase 1-like [Sus scrofa]
Length = 463
Score = 83.6 bits (205), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 70/265 (26%), Positives = 101/265 (38%), Gaps = 78/265 (29%)
Query: 84 LPANFDSRTKWPNCPTIREIRDQGSCGSCWG----------------------CRPYEIA 121
LPA++D R + +R+Q SCGSC+ P E+
Sbjct: 231 LPASWDWRNV-RGTNFVTPVRNQASCGSCYSFASMGMMEARIRILTNNTQTPILSPQEVV 289
Query: 122 PCEHHVNG-------------------TRPSCDASKGHTPKC-VRECQENYDVPYKKDLN 161
C + G +C G C V+E Y Y + +
Sbjct: 290 SCSQYAQGCAGGFPYLIAGKYAQDFGLVEEACFPYTGTDSPCTVKEGCFRY---YSSEYH 346
Query: 162 FGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTI 221
+ Y NE + E+ HGP+ AF V+DD + Y+ G + G +
Sbjct: 347 YVGGFYG-GCNEALMKLELVHHGPMAVAFEVYDDFLHYRKGIYHHTG------------L 393
Query: 222 RDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWG 281
RD F F+ L HA+ ++G+G D S YW++ NSW T WG
Sbjct: 394 RD---------PFNPFE----------LTNHAVLLVGYGTDLASGMDYWIVKNSWGTSWG 434
Query: 282 DNGLFKILRGKDECGIESSITAGVP 306
++G F+I RG DEC IES A P
Sbjct: 435 EDGYFRIRRGTDECAIESIAVAATP 459
>gi|307548878|ref|NP_001182580.1| dipeptidyl peptidase 1 precursor [Macaca mulatta]
Length = 463
Score = 83.2 bits (204), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 63/248 (25%), Positives = 96/248 (38%), Gaps = 75/248 (30%)
Query: 100 IREIRDQGSCGSCWG----------------------CRPYEIAPCEHHVNG-------- 129
+ +R+Q SCGSC+ P E+ C + G
Sbjct: 246 VSPVRNQASCGSCYSFASVGMLEARIRILTNNSQTPILSPQEVVSCSQYAQGCEGGFPYL 305
Query: 130 -----------TRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMK 178
+C G+ C + +E+ Y + ++ Y NE +
Sbjct: 306 TAGKYAQDFGLVEEACFPYTGNDSPC--KMKEDCFRYYSSEYHYVGGFYG-GCNEALMKL 362
Query: 179 EIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFD 238
E+ HGP+ AF V+DD + Y++G + G +RD F F+
Sbjct: 363 ELVYHGPLAVAFEVYDDFLHYQNGIYHHTG------------LRD---------PFNPFE 401
Query: 239 DLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIE 298
L HA+ ++G+G D S YW++ NSW T WG++G F+I RG DEC IE
Sbjct: 402 ----------LTNHAVLLVGYGTDSASGMDYWIVKNSWGTSWGEDGYFRIRRGTDECAIE 451
Query: 299 SSITAGVP 306
S A P
Sbjct: 452 SIAVAATP 459
>gi|417409900|gb|JAA51439.1| Putative cysteine proteinase tin-ag, partial [Desmodus rotundus]
Length = 346
Score = 83.2 bits (204), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 76/297 (25%), Positives = 114/297 (38%), Gaps = 99/297 (33%)
Query: 69 NRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEIAPCE---H 125
N + ++G EV LP F++ KWPN I E DQG+C W +A H
Sbjct: 70 NEIHTVLGPGEV---LPTAFEASEKWPN--LIHEPLDQGNCAGSWAFSTAAVASDRVSIH 124
Query: 126 HVNGTRP--------SCD-------------------------------------ASKGH 140
+ P SCD G
Sbjct: 125 SLGHMTPVLSPQNLLSCDKRNQQGCQGGHLDSAWWFLRRRGVVSDHCYPFSGQGRTETGP 184
Query: 141 TPKCVRECQE-------------NYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVE 187
P+C+ + N+ V + D+ +Y + S+EK IMKE+ E+GPV+
Sbjct: 185 APRCMMHSRAMGRGKRQATARCPNHQV-HANDIYQVTPAYRLGSSEKEIMKELMENGPVQ 243
Query: 188 GAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGK 247
V +D LY++G + T +SL + + +
Sbjct: 244 ALMEVHEDFFLYQNGIY-----SHTPVSLGR------------------------PERYR 274
Query: 248 ALGGHAIRILGWGED---EKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSI 301
G H+++I GWGE+ + KYW ANSW WG+ G F+I+RG +EC IES +
Sbjct: 275 RHGTHSVKITGWGEESLPDGRTLKYWTAANSWGPAWGERGHFRIVRGANECDIESFV 331
>gi|159109223|ref|XP_001704877.1| Cathepsin B precursor [Giardia lamblia ATCC 50803]
gi|157432952|gb|EDO77203.1| Cathepsin B precursor [Giardia lamblia ATCC 50803]
Length = 300
Score = 83.2 bits (204), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 72/255 (28%), Positives = 115/255 (45%), Gaps = 63/255 (24%)
Query: 82 EDLPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHT 141
+D+P +FD R ++P+C I E+ DQG CGSCW S A+ G
Sbjct: 73 DDVPESFDFREEYPHC--IPEVVDQGGCGSCWAF-----------------SSVATFGD- 112
Query: 142 PKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSI------MKEIYEHGPVEGAFTVFDD 195
R C D KK + + + Y VS + + + +++ G T D+
Sbjct: 113 ----RRCVAGLD---KKPVKYSPQ-YVVSCDHGDMACNGGWLPNVWKFLTKTG--TTTDE 162
Query: 196 LILYKSGRFFVPG-------------NETTAMSL------IKWTIRDNTSQLGAEGAFTV 236
+ YKSG + G + TA S I ++ ++ + AF V
Sbjct: 163 CVPYKSGSTTLRGTCPTKCADGSSKVHLATATSYKDYGLDIPAMMKALSTSGPLQVAFLV 222
Query: 237 FDDLILYKSG-------KALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKIL 289
+ D + Y+SG GGHA+ ++G+G D+ + YW+I NSW DWG++G F+++
Sbjct: 223 YSDFMYYESGVYQHTYGYMEGGHAVEMVGYGTDDDGVD-YWIIRNSWGPDWGEDGYFRMI 281
Query: 290 RGKDECGIESSITAG 304
RG ++C IE AG
Sbjct: 282 RGINDCSIEEQAYAG 296
>gi|355752523|gb|EHH56643.1| hypothetical protein EGM_06098 [Macaca fascicularis]
Length = 463
Score = 83.2 bits (204), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 63/248 (25%), Positives = 96/248 (38%), Gaps = 75/248 (30%)
Query: 100 IREIRDQGSCGSCWG----------------------CRPYEIAPCEHHVNG-------- 129
+ +R+Q SCGSC+ P E+ C + G
Sbjct: 246 VSPVRNQASCGSCYSFASVGMLEARIRILTNNSQTPILSPQEVVSCSQYAQGCEGGFPYL 305
Query: 130 -----------TRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMK 178
+C G+ C + +E+ Y + ++ Y NE +
Sbjct: 306 TAGKYAQDFGLVEEACFPYTGNDSPC--KMKEDCFRYYSSEYHYVGGFYG-GCNEALMKL 362
Query: 179 EIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFD 238
E+ HGP+ AF V+DD + Y++G + G +RD F F+
Sbjct: 363 ELVYHGPLAVAFEVYDDFLHYQNGIYHHTG------------LRD---------PFNPFE 401
Query: 239 DLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIE 298
L HA+ ++G+G D S YW++ NSW T WG++G F+I RG DEC IE
Sbjct: 402 ----------LTNHAVLLVGYGTDSASGMDYWIVKNSWGTSWGEDGYFRIRRGTDECAIE 451
Query: 299 SSITAGVP 306
S A P
Sbjct: 452 SIAVAATP 459
>gi|383415299|gb|AFH30863.1| dipeptidyl peptidase 1 isoform a preproprotein [Macaca mulatta]
gi|384944880|gb|AFI36045.1| dipeptidyl peptidase 1 isoform a preproprotein [Macaca mulatta]
Length = 463
Score = 83.2 bits (204), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 63/248 (25%), Positives = 96/248 (38%), Gaps = 75/248 (30%)
Query: 100 IREIRDQGSCGSCWG----------------------CRPYEIAPCEHHVNG-------- 129
+ +R+Q SCGSC+ P E+ C + G
Sbjct: 246 VSPVRNQASCGSCYSFASVGMLEARIRILTNNSQTPILSPQEVVSCSQYAQGCEGGFPYL 305
Query: 130 -----------TRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMK 178
+C G+ C + +E+ Y + ++ Y NE +
Sbjct: 306 TAGKYAQDFGLVEEACFPYTGNDSPC--KMKEDCFRYYSSEYHYVGGFYG-GCNEALMKL 362
Query: 179 EIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFD 238
E+ HGP+ AF V+DD + Y++G + G +RD F F+
Sbjct: 363 ELVYHGPLAVAFEVYDDFLHYQNGIYHHTG------------LRD---------PFNPFE 401
Query: 239 DLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIE 298
L HA+ ++G+G D S YW++ NSW T WG++G F+I RG DEC IE
Sbjct: 402 ----------LTNHAVLLVGYGTDSASGMDYWIVKNSWGTSWGEDGYFRIRRGTDECAIE 451
Query: 299 SSITAGVP 306
S A P
Sbjct: 452 SIAVAATP 459
>gi|380808942|gb|AFE76346.1| dipeptidyl peptidase 1 isoform a preproprotein [Macaca mulatta]
Length = 463
Score = 83.2 bits (204), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 63/248 (25%), Positives = 96/248 (38%), Gaps = 75/248 (30%)
Query: 100 IREIRDQGSCGSCWG----------------------CRPYEIAPCEHHVNG-------- 129
+ +R+Q SCGSC+ P E+ C + G
Sbjct: 246 VSPVRNQASCGSCYSFASVGMLEARIRILTNNSQTPILSPQEVVSCSQYAQGCEGGFPYL 305
Query: 130 -----------TRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMK 178
+C G+ C + +E+ Y + ++ Y NE +
Sbjct: 306 TAGKYAQDFGLVEEACFPYTGNDSPC--KMKEDCFRYYSSEYHYVGGFYG-GCNEALMKL 362
Query: 179 EIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFD 238
E+ HGP+ AF V+DD + Y++G + G +RD F F+
Sbjct: 363 ELVYHGPLAVAFEVYDDFLHYQNGIYHHTG------------LRD---------PFNPFE 401
Query: 239 DLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIE 298
L HA+ ++G+G D S YW++ NSW T WG++G F+I RG DEC IE
Sbjct: 402 ----------LTNHAVLLVGYGTDSASGMDYWIVKNSWGTSWGEDGYFRIRRGTDECAIE 451
Query: 299 SSITAGVP 306
S A P
Sbjct: 452 SIAVAATP 459
>gi|159108625|ref|XP_001704582.1| Cathepsin B precursor [Giardia lamblia ATCC 50803]
gi|157432649|gb|EDO76908.1| Cathepsin B precursor [Giardia lamblia ATCC 50803]
Length = 298
Score = 83.2 bits (204), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 74/251 (29%), Positives = 110/251 (43%), Gaps = 62/251 (24%)
Query: 85 PANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKC 144
P +FD R ++P+C I E+ DQG CGSCW S AS G
Sbjct: 75 PDSFDFREEYPHC--IPEVVDQGGCGSCWAF-----------------SSVASVGD---- 111
Query: 145 VRECQENYDVPYKKDLNFGAKSYSVSSNEKSI------MKEIYEHGPVEGAFTVFDDLIL 198
R C D KK + + + Y VS + + + ++ G T D+ +
Sbjct: 112 -RRCFAGLD---KKAVKY-SPQYVVSCDRGDMACDGGWLPSVWRFLTKTG--TTTDECVP 164
Query: 199 YKSGRFFVPGNETT------------AMSLIKWTIR-DNTSQLGAEG-----AFTVFDDL 240
Y+SG G T A + + + D + A G AFTV+ D
Sbjct: 165 YQSGSTGARGTCPTKCADGSDLPIYKATKAVDYGLDCDLIMKALATGGPLQTAFTVYSDF 224
Query: 241 ILYKSG-------KALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
+ Y+ G + GGHA+ ++G+G DE + YW+I NSW DWG++G F+I+R +
Sbjct: 225 MYYEGGVYQHTYGRVEGGHAVEMVGYGTDEYDVD-YWIIRNSWGPDWGEDGYFRIIRMTN 283
Query: 294 ECGIESSITAG 304
ECGIE + G
Sbjct: 284 ECGIEEQVIGG 294
>gi|73973401|ref|XP_538969.2| PREDICTED: tubulointerstitial nephritis antigen [Canis lupus
familiaris]
Length = 476
Score = 83.2 bits (204), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 50/143 (34%), Positives = 67/143 (46%), Gaps = 36/143 (25%)
Query: 167 YSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSG--RFFVPGNETTAMSLIKWTIRDN 224
Y VSSNE IMKEI ++GPV+ V +D YK+G R NE +
Sbjct: 355 YRVSSNETEIMKEIMQNGPVQAIMQVHEDFFHYKTGIYRHITRTNEES------------ 402
Query: 225 TSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGE---DEKSKEKYWLIANSWNTDWG 281
+ + L HA+++ GWG + KEK+W+ ANSW WG
Sbjct: 403 -------------------RKYQKLQTHAVKLTGWGTLKGAQGQKEKFWIAANSWGISWG 443
Query: 282 DNGLFKILRGKDECGIESSITAG 304
+NG F+ILRG +E IE I A
Sbjct: 444 ENGYFRILRGVNESDIEKLIIAA 466
>gi|147902366|ref|NP_001080511.1| cathepsin C precursor [Xenopus laevis]
gi|33417162|gb|AAH56109.1| Ctsc protein [Xenopus laevis]
Length = 458
Score = 83.2 bits (204), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 63/250 (25%), Positives = 98/250 (39%), Gaps = 75/250 (30%)
Query: 100 IREIRDQGSCGSCWG----------------------CRPYEIAPCEHH---VNGTRPSC 134
+ +R+QGSCGSC+ P ++ C ++ +G P
Sbjct: 241 VSPVRNQGSCGSCYAFASMGMLESRIQIQSQLSQKPILSPQQVVSCSNYSQGCDGGFPYL 300
Query: 135 DASK----------------GHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMK 178
A K G C +++Y Y + ++ Y NE +
Sbjct: 301 IAGKYLNDFGIVEESDFPYIGSDSPCT--LKDSYQRYYTAEYHYVGGFYG-GCNEAYMKL 357
Query: 179 EIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFD 238
E+ GP+ AF V+DD I Y+SG + G + F F
Sbjct: 358 ELVLGGPLSVAFEVYDDFIHYRSGVY---------------------HHTGLQDKFNPFQ 396
Query: 239 DLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIE 298
L HA+ ++G+G D+++ EKYW++ NSW WG+ G F+I RG DEC IE
Sbjct: 397 ----------LTNHAVLLVGYGTDQQTGEKYWIVKNSWGESWGEKGFFRIRRGSDECAIE 446
Query: 299 SSITAGVPKL 308
S + P +
Sbjct: 447 SIAVSANPII 456
>gi|355566931|gb|EHH23310.1| hypothetical protein EGK_06753 [Macaca mulatta]
Length = 463
Score = 83.2 bits (204), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 63/248 (25%), Positives = 96/248 (38%), Gaps = 75/248 (30%)
Query: 100 IREIRDQGSCGSCWG----------------------CRPYEIAPCEHHVNG-------- 129
+ +R+Q SCGSC+ P E+ C + G
Sbjct: 246 VSPVRNQASCGSCYSFASVGMLEARIRILTNNSQTPILSPQEVVSCSQYAQGCEGGFPYL 305
Query: 130 -----------TRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMK 178
+C G+ C + +E+ Y + ++ Y NE +
Sbjct: 306 TAGKYAQDFGLVEEACFPYTGNDSPC--KMKEDCFRYYSSEYHYVGGFYG-GCNEALMKL 362
Query: 179 EIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFD 238
E+ HGP+ AF V+DD + Y++G + G +RD F F+
Sbjct: 363 ELVYHGPLAVAFEVYDDFLHYQNGIYHHTG------------LRD---------PFNPFE 401
Query: 239 DLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIE 298
L HA+ ++G+G D S YW++ NSW T WG++G F+I RG DEC IE
Sbjct: 402 ----------LTNHAVLLVGYGTDSASGMDYWIVKNSWGTSWGEDGYFRIHRGTDECAIE 451
Query: 299 SSITAGVP 306
S A P
Sbjct: 452 SIAVAATP 459
>gi|338718488|ref|XP_001918155.2| PREDICTED: LOW QUALITY PROTEIN: tubulointerstitial nephritis
antigen-like [Equus caballus]
Length = 480
Score = 83.2 bits (204), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 53/145 (36%), Positives = 68/145 (46%), Gaps = 32/145 (22%)
Query: 167 YSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTS 226
Y VSSNE IMKEI ++GPV+ V DD YK G + R TS
Sbjct: 359 YRVSSNETEIMKEIMQNGPVQAIMQVHDDFFHYKKGIY-----------------RHVTS 401
Query: 227 QLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGE---DEKSKEKYWLIANSWNTDWGDN 283
+ + L HAI++ GWG + KEK+W+ ANSW WG+N
Sbjct: 402 THEEPEKY------------RKLRTHAIKLAGWGTLRGAQGRKEKFWIAANSWGKSWGEN 449
Query: 284 GLFKILRGKDECGIESSITAGVPKL 308
G F+ILRG +E IE I A +L
Sbjct: 450 GYFRILRGVNESDIEKLIIAAWGQL 474
>gi|12658201|gb|AAK01061.1| cysteine proteinase [Metagonimus yokogawai]
Length = 179
Score = 83.2 bits (204), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 50/149 (33%), Positives = 69/149 (46%), Gaps = 38/149 (25%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GCR Y C HH G C TP C + C + +V Y D SY+V ++E
Sbjct: 66 GCRSYPFPKCNHHGKGPDAPCPEKIFPTPACNKTC-DTPEVNYILDKTKAKSSYNVPNSE 124
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
K+IMKEI ++GPVE AF V++D + Y+SG +F
Sbjct: 125 KAIMKEIMQNGPVEAAFEVYEDFLHYESGVYF---------------------------- 156
Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGED 262
+ G+ +GGHAIR+LGWGE+
Sbjct: 157 ---------HSFGRMIGGHAIRMLGWGEE 176
Score = 45.4 bits (106), Expect = 0.036, Method: Compositional matrix adjust.
Identities = 16/27 (59%), Positives = 24/27 (88%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGA 35
CGFGC+GGFP AW +W+++G+V+GG+
Sbjct: 34 CGFGCHGGFPPRAWDFWMENGLVTGGS 60
>gi|312082955|ref|XP_003143660.1| hypothetical protein LOAG_08080 [Loa loa]
gi|307761175|gb|EFO20409.1| hypothetical protein LOAG_08080 [Loa loa]
Length = 339
Score = 83.2 bits (204), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 51/138 (36%), Positives = 69/138 (50%), Gaps = 38/138 (27%)
Query: 166 SYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNT 225
SY VSS E+ IM EI +GPV+ F V D FF+ G + +
Sbjct: 211 SYRVSSREQDIMSEILTNGPVQATFRVHGD--------FFIAG------------VYKHL 250
Query: 226 SQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKS--KEKYWLIANSWNTDWGDN 283
+G E G H++R+LGWGED + KYW+ ANSW T+WG+N
Sbjct: 251 PTVGEE----------------IEGYHSVRLLGWGEDYSTGIPVKYWIAANSWGTNWGEN 294
Query: 284 GLFKILRGKDECGIESSI 301
G F+ILRG++ C IES +
Sbjct: 295 GTFRILRGENHCEIESFV 312
>gi|227499499|ref|NP_036163.3| tubulointerstitial nephritis antigen precursor [Mus musculus]
gi|4929827|gb|AAD34171.1| tubulo-interstitial nephritis antigen [Mus musculus]
gi|148694397|gb|EDL26344.1| tubulointerstitial nephritis antigen, isoform CRA_a [Mus musculus]
Length = 475
Score = 83.2 bits (204), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 52/147 (35%), Positives = 68/147 (46%), Gaps = 36/147 (24%)
Query: 167 YSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSG--RFFVPGNETTAMSLIKWTIRDN 224
Y VSSNE IM+EI ++GPV+ V +D YK+G R V NE
Sbjct: 354 YRVSSNETEIMREIIQNGPVQAIMQVHEDFFYYKTGIYRHVVSTNEEP------------ 401
Query: 225 TSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGE---DEKSKEKYWLIANSWNTDWG 281
+ K L HA+++ GWG KEK+W+ ANSW WG
Sbjct: 402 -------------------EKYKKLRTHAVKLTGWGTLRGARGKKEKFWIAANSWGKSWG 442
Query: 282 DNGLFKILRGKDECGIESSITAGVPKL 308
+NG F+ILRG +E IE I A +L
Sbjct: 443 ENGYFRILRGVNESDIEKLIIAAWGQL 469
>gi|129270160|ref|NP_001038442.2| tubulointerstitial nephritis antigen-like precursor [Danio rerio]
gi|126632071|gb|AAI33830.1| Si:dkey-158b13.1 [Danio rerio]
Length = 471
Score = 83.2 bits (204), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 48/151 (31%), Positives = 70/151 (46%), Gaps = 36/151 (23%)
Query: 156 YKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMS 215
Y D+ Y +S+NE IMKEI ++GPV+ V +D +YKSG F
Sbjct: 329 YHNDIYQSTPPYRLSTNENEIMKEIMDNGPVQAIMEVHEDFFVYKSGIF----------- 377
Query: 216 LIKWTIRDNTSQLGAEGAFTVFDDLILYKSG--KALGGHAIRILGWGEDEK---SKEKYW 270
D+ +K + H++RI GWGE+ KYW
Sbjct: 378 --------------------RHTDVNYHKPSQYRKHATHSVRITGWGEERDYSGRTRKYW 417
Query: 271 LIANSWNTDWGDNGLFKILRGKDECGIESSI 301
+ ANSW +WG++G F+I RG +EC IE+ +
Sbjct: 418 IGANSWGKNWGEDGYFRIARGVNECDIETFV 448
>gi|14789619|gb|AAH10745.1| Tubulointerstitial nephritis antigen [Mus musculus]
Length = 475
Score = 83.2 bits (204), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 52/147 (35%), Positives = 68/147 (46%), Gaps = 36/147 (24%)
Query: 167 YSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSG--RFFVPGNETTAMSLIKWTIRDN 224
Y VSSNE IM+EI ++GPV+ V +D YK+G R V NE
Sbjct: 354 YRVSSNETEIMREIIQNGPVQAIMQVHEDFFYYKTGIYRHVVSTNEEP------------ 401
Query: 225 TSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGE---DEKSKEKYWLIANSWNTDWG 281
+ K L HA+++ GWG KEK+W+ ANSW WG
Sbjct: 402 -------------------EKYKKLRTHAVKLTGWGTLRGARGKKEKFWIAANSWGKSWG 442
Query: 282 DNGLFKILRGKDECGIESSITAGVPKL 308
+NG F+ILRG +E IE I A +L
Sbjct: 443 ENGYFRILRGVNESDIEKLIIAAWGQL 469
>gi|348508181|ref|XP_003441633.1| PREDICTED: dipeptidyl peptidase 1-like isoform 1 [Oreochromis
niloticus]
Length = 455
Score = 83.2 bits (204), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 75/293 (25%), Positives = 111/293 (37%), Gaps = 88/293 (30%)
Query: 67 PANRLPELIGYSEVDED-------LPANFDSRTKWPNCPTIREIRDQGSCGSCWG----- 114
PA+R+P + + V D LP +D R + +R+Q SCGSC+
Sbjct: 200 PASRIPVRVRPAPVKADVAKMASALPEQWDWRNV-DGVNFVSPVRNQESCGSCYSFATMG 258
Query: 115 -----------------CRPYEIAPCEHHVNG------------------TRPSCDASKG 139
P ++ C + G SC G
Sbjct: 259 MLEARIRILTNNSDAPTLSPQQVVSCSEYSQGCDGGFPYLIGKYTQDFGIVDESCFPYVG 318
Query: 140 HTPKC--VRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLI 197
C ++CQ Y Y N+ Y S E ++M E+ ++GP+ AF V+ D +
Sbjct: 319 QNTPCGVPQKCQRIYAAEY----NYVGGFYGGCS-EAAMMLELVKNGPMAVAFEVYPDFM 373
Query: 198 LYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRIL 257
YK G + G F F+ L HA+ ++
Sbjct: 374 NYKEGIY---------------------HHTGLADPFNPFE----------LTNHAVLLV 402
Query: 258 GWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAG--VPKL 308
G+G K+ + YW++ NSW T WG+ G F+I RG DEC IES A +PKL
Sbjct: 403 GYGRCHKTGQNYWIVKNSWGTGWGEEGYFRIRRGNDECAIESIAVAANPIPKL 455
>gi|301618234|ref|XP_002938532.1| PREDICTED: tubulointerstitial nephritis antigen-like [Xenopus
(Silurana) tropicalis]
Length = 494
Score = 83.2 bits (204), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 72/281 (25%), Positives = 110/281 (39%), Gaps = 93/281 (33%)
Query: 81 DEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEIAPCE-------HHVNGTRP- 132
++ LP++F++ KWP + E DQG+C W +A H P
Sbjct: 233 NDILPSHFNAAEKWPG--LVHEPLDQGNCAGSWAFSTAAVASDRISIQSMGHMTQSLSPQ 290
Query: 133 ---SCDA-----------------------------------SKGHTPKCV--------- 145
SCD + GH+ C+
Sbjct: 291 NLLSCDTRNQHGCRGGRVDGAWWYLRRRGVVSEPCYPFTSLNTNGHSAPCMMQSRSMGRG 350
Query: 146 -RECQENYDVPY--KKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSG 202
R+ N Y ++ +Y ++S+EK IMKE+YE+GPV+ V +D +YKSG
Sbjct: 351 KRQATNNCPNQYYSSNEIYQSTPAYRLASSEKDIMKELYENGPVQAIMEVHEDFFMYKSG 410
Query: 203 RF-FVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGE 261
+ P E + + G H+++I G G
Sbjct: 411 IYRHTPVTEREP------------------------------EHHRRHGTHSVKITG-GR 439
Query: 262 DEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSIT 302
D ++ KYWL ANSW DWG++G F+I RG++EC IE+ I
Sbjct: 440 DGQT-HKYWLAANSWGRDWGEDGYFRIARGENECEIETFIV 479
>gi|107921798|gb|ABF85680.1| cathepsin B3 [Fasciola hepatica]
Length = 278
Score = 82.8 bits (203), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 48/148 (32%), Positives = 67/148 (45%), Gaps = 38/148 (25%)
Query: 114 GCRPYEIAPCEHHV-NGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSN 172
GC PY C H V P C TPKC ++C Y+ Y++D G SY+V
Sbjct: 160 GCLPYPFPKCSHGVVTPGLPPCPRDIYPTPKCEKKCHAGYNKTYEQDKVKGKSSYNVGEQ 219
Query: 173 EKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEG 232
E IM EI ++GPV+G F +F+D ++YKSG +
Sbjct: 220 ETDIMMEIMKNGPVDGIFYMFEDFLVYKSGIYH--------------------------- 252
Query: 233 AFTVFDDLILYKSGKALGGHAIRILGWG 260
Y +G+ +GGHAIR++GWG
Sbjct: 253 ----------YTTGRLVGGHAIRVIGWG 270
Score = 65.5 bits (158), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 33/76 (43%), Positives = 44/76 (57%), Gaps = 2/76 (2%)
Query: 39 KQAEKNSLSNIPRAHLKSWMGVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCP 98
K A +NI + +K +GV + N + + YS + DLP +FD+R KWPNCP
Sbjct: 20 KAAPSTRFNNIDQ--VKQNLGVLEETPEDRNTQRQTVRYSVSENDLPESFDARQKWPNCP 77
Query: 99 TIREIRDQGSCGSCWG 114
+I EIRDQ SC SCW
Sbjct: 78 SISEIRDQSSCSSCWA 93
Score = 47.8 bits (112), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 16/27 (59%), Positives = 21/27 (77%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGA 35
CG+GCNGG P M+W YW + G+V+GG
Sbjct: 128 CGYGCNGGIPAMSWDYWTREGVVTGGT 154
>gi|357623033|gb|EHJ74345.1| tubulointerstitial nephritis antigen [Danaus plexippus]
Length = 426
Score = 82.8 bits (203), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 72/258 (27%), Positives = 107/258 (41%), Gaps = 29/258 (11%)
Query: 66 LPANRLPELIGYSEVDEDLP--ANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEIAPC 123
+P + +G D+D+P +FD+R +WPN I + DQG CGS W +A
Sbjct: 166 MPLSHETRRMGPIRYDKDIPYPRDFDARRRWPN--FISPVLDQGWCGSDWAVTIATVASD 223
Query: 124 EHHV--NGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIY 181
+ NG + + +R Q NF A+ + + E K
Sbjct: 224 RFAIQSNGAERMVLSPQVLLSCNIRRQQGCRGGHIDVAWNF-ARGHGLVDEECFPYKAAT 282
Query: 182 EHGPVEGAFTVFDD----LILYKSGRFFV--PGNETTAMSLIKWTIRDNTSQLGAEGAFT 235
P + +D + ++ R+ V PG T ++ D T
Sbjct: 283 TSCPFRPKANLIEDGCRPPVRQRTSRYKVGPPGKLATENDIMY----DIMESGPVHAVMT 338
Query: 236 VFDDLILYKSG----------KALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGL 285
V D Y G G H++RI+GWGED +KYW++ANSW DWG+NG
Sbjct: 339 VHQDFFHYHDGIYRRSPYGDNTLQGLHSVRIVGWGEDR--GDKYWVVANSWGCDWGENGY 396
Query: 286 FKILRGKDECGIESSITA 303
F+I RG +E GIES +
Sbjct: 397 FRIARGSNESGIESFVVT 414
>gi|253744515|gb|EET00718.1| Cathepsin B precursor [Giardia intestinalis ATCC 50581]
Length = 306
Score = 82.8 bits (203), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 71/293 (24%), Positives = 116/293 (39%), Gaps = 91/293 (31%)
Query: 59 GVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG---- 114
+ P ++L A+ +P E +PA+FD R ++P C I + DQG CGSCW
Sbjct: 54 AMFPRHDLAAS-VPAECPRGEPSGSIPASFDFREEYPQC--ITPVYDQGHCGSCWAFSAT 110
Query: 115 -------------------CRPYEIAPCEH-------------------HVNGTR---PS 133
+ Y I+ C++ H T P
Sbjct: 111 SAFGDRRCMQGLDSAGVPYSQQYTIS-CDYLDLGCAGGLSFSVWTFLTEHGTTTLECVPY 169
Query: 134 CDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVF 193
DA+K + C C + ++ K G YS N +IM+ + GPV+ + V+
Sbjct: 170 TDANKDISSPCPDACADGSEIRLVK--ADGCLDYS--GNVTAIMQALANDGPVQASMAVY 225
Query: 194 DDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHA 253
D + Y+SG + + G + HA
Sbjct: 226 RDFLYYRSGVY-------------------------------------RHVYGSQISSHA 248
Query: 254 IRILGWGE-DEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGV 305
+ I+G+G D++ YW++ NS + WG+ G F I+RG +EC IES++ +G+
Sbjct: 249 VEIIGYGAADDEDSTPYWIVKNSLGSGWGEEGYFNIVRGSNECDIESAVYSGL 301
>gi|326916361|ref|XP_003204476.1| PREDICTED: tubulointerstitial nephritis antigen-like [Meleagris
gallopavo]
Length = 467
Score = 82.8 bits (203), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 47/141 (33%), Positives = 65/141 (46%), Gaps = 38/141 (26%)
Query: 164 AKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRD 223
A Y +SS E IM+EI GPV+ V++D LYK G +
Sbjct: 357 ASHYRISSKETDIMEEIMAKGPVQAIMKVYEDFFLYKEGIYRHS---------------- 400
Query: 224 NTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGE---DEKSKEKYWLIANSWNTDW 280
YK+G H++++LGWG K+K+W+ ANSW W
Sbjct: 401 -------------------YKAGSKWKTHSVKLLGWGSLPGKNGQKQKFWIAANSWGKYW 441
Query: 281 GDNGLFKILRGKDECGIESSI 301
G+NG F+ILRG++EC IE I
Sbjct: 442 GENGYFRILRGQNECDIEKLI 462
>gi|297465285|ref|XP_887401.3| PREDICTED: tubulointerstitial nephritis antigen-like 1 isoform 2
[Bos taurus]
gi|297472148|ref|XP_002685665.1| PREDICTED: tubulointerstitial nephritis antigen-like 1 [Bos taurus]
gi|296490232|tpg|DAA32345.1| TPA: tubulointerstitial nephritis antigen-like 1-like [Bos taurus]
Length = 534
Score = 82.8 bits (203), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 49/149 (32%), Positives = 73/149 (48%), Gaps = 32/149 (21%)
Query: 156 YKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMS 215
+ D+ +Y + SNEK IMKE+ E+GPV+ V +D LY+SG + T +S
Sbjct: 400 HANDIYQVTPAYRLGSNEKEIMKELMENGPVQALMEVHEDFFLYQSGIY-----SHTPVS 454
Query: 216 LIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGED---EKSKEKYWLI 272
L + + + G H+++I GWGE+ + KYW
Sbjct: 455 LGR------------------------PERYRRHGTHSVKITGWGEETLPDGRTIKYWTA 490
Query: 273 ANSWNTDWGDNGLFKILRGKDECGIESSI 301
ANSW WG+ G F+I+RG +EC IES +
Sbjct: 491 ANSWGPAWGERGHFRIVRGANECDIESFV 519
>gi|348508183|ref|XP_003441634.1| PREDICTED: dipeptidyl peptidase 1-like isoform 2 [Oreochromis
niloticus]
Length = 461
Score = 82.8 bits (203), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 75/293 (25%), Positives = 111/293 (37%), Gaps = 88/293 (30%)
Query: 67 PANRLPELIGYSEVDED-------LPANFDSRTKWPNCPTIREIRDQGSCGSCWG----- 114
PA+R+P + + V D LP +D R + +R+Q SCGSC+
Sbjct: 206 PASRIPVRVRPAPVKADVAKMASALPEQWDWRNV-DGVNFVSPVRNQESCGSCYSFATMG 264
Query: 115 -----------------CRPYEIAPCEHHVNG------------------TRPSCDASKG 139
P ++ C + G SC G
Sbjct: 265 MLEARIRILTNNSDAPTLSPQQVVSCSEYSQGCDGGFPYLIGKYTQDFGIVDESCFPYVG 324
Query: 140 HTPKC--VRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLI 197
C ++CQ Y Y N+ Y S E ++M E+ ++GP+ AF V+ D +
Sbjct: 325 QNTPCGVPQKCQRIYAAEY----NYVGGFYGGCS-EAAMMLELVKNGPMAVAFEVYPDFM 379
Query: 198 LYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRIL 257
YK G + G F F+ L HA+ ++
Sbjct: 380 NYKEGIY---------------------HHTGLADPFNPFE----------LTNHAVLLV 408
Query: 258 GWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAG--VPKL 308
G+G K+ + YW++ NSW T WG+ G F+I RG DEC IES A +PKL
Sbjct: 409 GYGRCHKTGQNYWIVKNSWGTGWGEEGYFRIRRGNDECAIESIAVAANPIPKL 461
>gi|193629592|ref|XP_001944624.1| PREDICTED: cathepsin B-like isoform 4 [Acyrthosiphon pisum]
Length = 331
Score = 82.8 bits (203), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 57/196 (29%), Positives = 83/196 (42%), Gaps = 51/196 (26%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDA-SKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSN 172
GC+P +I P C+ +K + CV C N + Y D Y
Sbjct: 185 GCQPSKIPPV----------CNLPTKINKRTCVDYCYGNDTIKYNHD--HVKVRYYYHVK 232
Query: 173 EKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEG 232
K I KE+ +GPV A ++DD+ L+KSG +
Sbjct: 233 PKDIQKEVQTYGPVTAALNLYDDIFLHKSGVY---------------------------- 264
Query: 233 AFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGK 292
L K+ K + ++++GWG + + YWL+ NSW +WG NGL KI RGK
Sbjct: 265 --------TLTKNAKYVRLQYVKLIGWGVE--NGVDYWLLVNSWGNEWGQNGLLKIKRGK 314
Query: 293 DECGIESSITAGVPKL 308
C +ES + A VPK+
Sbjct: 315 YGCAVESFVYAAVPKI 330
>gi|351704465|gb|EHB07384.1| Tubulointerstitial nephritis antigen [Heterocephalus glaber]
Length = 475
Score = 82.8 bits (203), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 51/145 (35%), Positives = 71/145 (48%), Gaps = 32/145 (22%)
Query: 167 YSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTS 226
Y VSSNE IMKEI ++GPV+ V +D YK+G + + TI D+
Sbjct: 354 YRVSSNETQIMKEIMKNGPVQAIMQVHEDFFYYKTGIY----------RHVTSTIEDS-- 401
Query: 227 QLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGE---DEKSKEKYWLIANSWNTDWGDN 283
+ + L HA+++ GWG + KEK+W+ ANSW WG+N
Sbjct: 402 -----------------EKYQKLRTHAVKLTGWGTLRGAKGRKEKFWIAANSWGKSWGEN 444
Query: 284 GLFKILRGKDECGIESSITAGVPKL 308
G F+ILRG +E IE I A +L
Sbjct: 445 GYFRILRGVNESDIEKLIIAAWGQL 469
>gi|195154396|ref|XP_002018108.1| GL16940 [Drosophila persimilis]
gi|194113904|gb|EDW35947.1| GL16940 [Drosophila persimilis]
Length = 433
Score = 82.8 bits (203), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 75/292 (25%), Positives = 107/292 (36%), Gaps = 89/292 (30%)
Query: 67 PANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEIAP---- 122
P R+ + + LPA F++ KW + I E+ DQG CGS W +A
Sbjct: 172 PTYRVKAMSRLTNPTAGLPAAFNAVEKWSS--YISEVPDQGWCGSSWVLSTTSVASDRFA 229
Query: 123 -------------------------CE-----------HHVNGTRPSCDASKGHTPKC-V 145
CE H SC H C +
Sbjct: 230 IQSKGKEAVQLSAQNILSCTRRQQGCEGGHLDAAWRYLHKKGVVDESCYPYTQHRDTCKI 289
Query: 146 RE---------CQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDL 196
R C+ + +V +D + + E IM EIY GPV+ V+ D
Sbjct: 290 RHNSRSLKANGCRPSANV--DRDSFYTVGPAYTLNKESDIMAEIYHSGPVQATMRVYRDF 347
Query: 197 ILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRI 256
Y SG + R + GA F H++++
Sbjct: 348 FSYSSGVY-----------------RQTAANRGAPTGF-----------------HSVKL 373
Query: 257 LGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVPKL 308
+GWGE E + +KYW+ ANSW WG+ G F+ILRG +ECGIE + A P +
Sbjct: 374 VGWGE-EHNGDKYWIAANSWGPWWGERGYFRILRGSNECGIEDYVLASWPYV 424
>gi|402894881|ref|XP_003910570.1| PREDICTED: dipeptidyl peptidase 1 [Papio anubis]
Length = 463
Score = 82.8 bits (203), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 63/248 (25%), Positives = 95/248 (38%), Gaps = 75/248 (30%)
Query: 100 IREIRDQGSCGSCWG----------------------CRPYEIAPCEHHVNG-------- 129
+ +R+Q SCGSC+ P E+ C + G
Sbjct: 246 VSPVRNQASCGSCYSFASVGMLEARIRILTNNSQTPILSPQEVVSCSQYAQGCEGGFPYL 305
Query: 130 -----------TRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMK 178
+C G C + +E+ Y + ++ Y NE +
Sbjct: 306 IAGKYAQDFGLVEEACFPYTGTDSPC--KMKEDCFRYYSSEYHYVGGFYG-GCNEALMKL 362
Query: 179 EIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFD 238
E+ HGP+ AF V+DD + Y++G + G +RD F F+
Sbjct: 363 ELVYHGPLSVAFEVYDDFLHYQNGIYHHTG------------LRD---------PFNPFE 401
Query: 239 DLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIE 298
L HA+ ++G+G D S YW++ NSW T WG++G F+I RG DEC IE
Sbjct: 402 ----------LTNHAVLLVGYGTDSASGMDYWIVKNSWGTSWGEDGYFRIRRGTDECAIE 451
Query: 299 SSITAGVP 306
S A P
Sbjct: 452 SIAVAATP 459
>gi|10803450|emb|CAB97364.2| putative cathepsin B.1 [Ostertagia ostertagi]
Length = 199
Score = 82.8 bits (203), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 51/159 (32%), Positives = 72/159 (45%), Gaps = 41/159 (25%)
Query: 115 CRPYEIAPCEHHVNGTR-PSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
CRPYEI PC +H + CD TP+C R CQ Y Y D ++G +Y + +
Sbjct: 72 CRPYEIHPCGYHKDEPYYGECD-DLADTPRCKRRCQLGYPKSYPSDKHYGRTAYQLPMSV 130
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
+SI +EI +GPV FTV++D YK G
Sbjct: 131 ESIQREIMRNGPVVAGFTVYEDFAHYKGG------------------------------- 159
Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEK--YW 270
+ + SGK GGHA++++GWG ++K EK YW
Sbjct: 160 ------IYKHTSGKKTGGHAVKVIGWGSEQKGSEKIPYW 192
Score = 38.1 bits (87), Expect = 4.8, Method: Compositional matrix adjust.
Identities = 26/101 (25%), Positives = 41/101 (40%), Gaps = 26/101 (25%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQA-------------------EKNSLSNI 49
CG+GC GG+ AW Y+ + G+V+GG Y +K + E + L++
Sbjct: 39 CGYGCQGGWSIRAWYYFAEQGVVTGGNYNTKGSCRPYEIHPCGYHKDEPYYGECDDLADT 98
Query: 50 PRAHLKSWMGVHPDYNLPANRLPELIGYSEVDEDLPANFDS 90
PR + +G Y P Y LP + +S
Sbjct: 99 PRCKRRCQLGYPKSY-------PSDKHYGRTAYQLPMSVES 132
>gi|125810908|ref|XP_001361665.1| GA15908 [Drosophila pseudoobscura pseudoobscura]
gi|54636841|gb|EAL26244.1| GA15908 [Drosophila pseudoobscura pseudoobscura]
Length = 433
Score = 82.8 bits (203), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 75/292 (25%), Positives = 107/292 (36%), Gaps = 89/292 (30%)
Query: 67 PANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEIAP---- 122
P R+ + + LPA F++ KW + I E+ DQG CGS W +A
Sbjct: 172 PTYRVKAMSRLTNPTAGLPAAFNAVEKWSS--YISEVPDQGWCGSSWVLSTTSVASDRFA 229
Query: 123 -------------------------CE-----------HHVNGTRPSCDASKGHTPKC-V 145
CE H SC H C +
Sbjct: 230 IQSKGKEAVQLSAQNILSCTRRQQGCEGGHLDAAWRYLHKKGVVDESCYPYTQHRDTCKI 289
Query: 146 RE---------CQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDL 196
R C+ + +V +D + + E IM EIY GPV+ V+ D
Sbjct: 290 RHNSRSLKANGCRPSANV--DRDSFYTVGPAYTLNKESDIMAEIYHSGPVQATMRVYRDF 347
Query: 197 ILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRI 256
Y SG + R + GA F H++++
Sbjct: 348 FSYSSGVY-----------------RQTAANRGAPTGF-----------------HSVKL 373
Query: 257 LGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVPKL 308
+GWGE E + +KYW+ ANSW WG+ G F+ILRG +ECGIE + A P +
Sbjct: 374 VGWGE-EHNGDKYWIAANSWGPWWGERGYFRILRGSNECGIEDYVLASWPYV 424
>gi|332030944|gb|EGI70570.1| Uncharacterized peptidase C1-like protein F26E4.3 [Acromyrmex
echinatior]
Length = 501
Score = 82.8 bits (203), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 54/152 (35%), Positives = 76/152 (50%), Gaps = 38/152 (25%)
Query: 155 PYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAM 214
P + +L +Y + NE IM+EI GPV+ V+ D +YK+G +
Sbjct: 379 PLRTELYKVGPAYRLG-NETDIMQEILTSGPVQATMRVYQDFFVYKNGIY---------- 427
Query: 215 SLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSK---EKYWL 271
R + S AE L+ SG H++RI+GWGE+ + KYWL
Sbjct: 428 -------RHSQS---AE----------LHDSGY----HSVRIIGWGEERSYRGPPLKYWL 463
Query: 272 IANSWNTDWGDNGLFKILRGKDECGIESSITA 303
+ NSW +WG+NGLFKI RG +EC IES + A
Sbjct: 464 VVNSWGYNWGENGLFKIQRGTNECEIESYVLA 495
>gi|159112288|ref|XP_001706373.1| Cathepsin B precursor [Giardia lamblia ATCC 50803]
gi|157434469|gb|EDO78699.1| Cathepsin B precursor [Giardia lamblia ATCC 50803]
Length = 303
Score = 82.8 bits (203), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 77/286 (26%), Positives = 118/286 (41%), Gaps = 39/286 (13%)
Query: 39 KQAEKNSLSNIPRAHLKSWMGVHPDY------NLPANRLPELIGYSEVDEDLPANFDSRT 92
K N+ +S M + PD +LP + E+ E+ + +P FD R
Sbjct: 32 KAGMPKRFENVTEDEFRS-MLIRPDRLRARSGSLPPISITEV---QELVDPIPPQFDFRD 87
Query: 93 KWPNCPTIREIRDQGSCGSCWGCRPYEIAPCEHHVNGT-RPSCDASKGHTPKCVRE---C 148
++P C ++ DQGSCG CW + G + + S+ H C E C
Sbjct: 88 EYPQC--VKPALDQGSCGGCWAFSAIGVFGDRRCAMGIDKEAVSYSQQHLISCSLENFGC 145
Query: 149 QENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDD---LILYKSGRFF 205
P L F ++ + + Y H V DD + LYK+ +
Sbjct: 146 DGGDFQPTWSFLTFTG-----ATTAECVKYVDYGHTVASPCPAVCDDGSPIQLYKAHGY- 199
Query: 206 VPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKA--------LGGHAIRIL 257
G + ++ I + + V+ DL Y+SG LG HA+ I+
Sbjct: 200 --GQVSKSVPAIMGMLVAGGP---LQTMIVVYADLSYYESGVYKHTYGTINLGFHALEIV 254
Query: 258 GWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITA 303
G+G + + YW+I NSW DWG+NG F+I+RG +EC IE I A
Sbjct: 255 GYGTTDDGTD-YWIIKNSWGPDWGENGYFRIVRGVNECRIEDEIYA 299
>gi|348553066|ref|XP_003462348.1| PREDICTED: tubulointerstitial nephritis antigen-like [Cavia
porcellus]
Length = 475
Score = 82.8 bits (203), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 53/145 (36%), Positives = 72/145 (49%), Gaps = 32/145 (22%)
Query: 167 YSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTS 226
Y VSSNE IMKEI ++GPV+ V +D YK+G + T+ S
Sbjct: 354 YRVSSNETQIMKEIMQNGPVQAIMKVHEDFFSYKTGIY----RHVTSTS----------- 398
Query: 227 QLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKS---KEKYWLIANSWNTDWGDN 283
+D Y+ L HA+++ GWG + + KEK+W+ ANSW WG+N
Sbjct: 399 -----------EDSEKYQK---LRTHAVKLTGWGTLKGARGKKEKFWIAANSWGKSWGEN 444
Query: 284 GLFKILRGKDECGIESSITAGVPKL 308
G FKILRG +E IE I A +L
Sbjct: 445 GYFKILRGVNESDIEKLIIAAWGQL 469
>gi|403287831|ref|XP_003935129.1| PREDICTED: dipeptidyl peptidase 1 [Saimiri boliviensis boliviensis]
Length = 463
Score = 82.8 bits (203), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 69/268 (25%), Positives = 102/268 (38%), Gaps = 82/268 (30%)
Query: 83 DLPANFDSRTKWPNCPTIR---EIRDQGSCGSCWG----------------------CRP 117
+LP ++D W N I +R+Q SCGSC+ P
Sbjct: 230 NLPTSWD----WRNVHGINFVSPVRNQASCGSCYSFASMGMLEARIRILTNNSQTPILSP 285
Query: 118 YEIAPCEHHVNG-------------------TRPSCDASKGHTPKCVRECQENYDVPYKK 158
E+ C + G +C G C + +E+ Y
Sbjct: 286 QEVVSCSKYAQGCEGGFPYLIAGKYAQDFGVVEEACFPYTGTDSPC--KMKEDCFRYYSS 343
Query: 159 DLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIK 218
+ ++ Y NE + E+ HGP+ AF V+DD + Y+ G + G
Sbjct: 344 EYHYVGGFYG-GCNEALMKLELVHHGPMAVAFEVYDDFLHYRKGIYHHTG---------- 392
Query: 219 WTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNT 278
+RD F F+ L HA+ ++G+G D S YW++ NSW T
Sbjct: 393 --LRD---------PFNPFE----------LTNHAVLLVGYGTDSASGIHYWIVKNSWGT 431
Query: 279 DWGDNGLFKILRGKDECGIESSITAGVP 306
WG++G F+I RG DEC IES A P
Sbjct: 432 SWGEDGYFRIRRGTDECAIESIAVAATP 459
>gi|6009533|dbj|BAA84949.1| tubulointerstitial nephritis antigen [Homo sapiens]
Length = 476
Score = 82.8 bits (203), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 51/145 (35%), Positives = 69/145 (47%), Gaps = 32/145 (22%)
Query: 167 YSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTS 226
Y VSSNE IMKEI ++GPV+ V +D YK+G + R TS
Sbjct: 355 YRVSSNETEIMKEIMQNGPVQAIMQVHEDFFHYKTGIY-----------------RHVTS 397
Query: 227 QLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGE---DEKSKEKYWLIANSWNTDWGDN 283
+ + L HA+++ GWG + KEK+W+ ANSW WG+N
Sbjct: 398 TNKESEKY------------RKLQTHAVKLTGWGTLRGAQGQKEKFWIAANSWGKSWGEN 445
Query: 284 GLFKILRGKDECGIESSITAGVPKL 308
G F+ILRG +E IE I A +L
Sbjct: 446 GYFRILRGVNESDIEKLIIAAWGQL 470
>gi|444728469|gb|ELW68926.1| Dipeptidyl peptidase 1 [Tupaia chinensis]
Length = 462
Score = 82.8 bits (203), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 63/250 (25%), Positives = 88/250 (35%), Gaps = 79/250 (31%)
Query: 100 IREIRDQGSCGSCWG----------------------CRPYEIAPCEHHVNG-------- 129
+ +R+Q SCGSC+ P E+ C + G
Sbjct: 245 VSPVRNQASCGSCYSFASMGMLEARIRILTNNSQTPILSPQEVVSCSQYAQGCEGGFPYL 304
Query: 130 -----------TRPSCDASKGHTPKCV--RECQENYDVPYKKDLNFGAKSYSVSSNEKSI 176
SC G C ++C Y Y F NE +
Sbjct: 305 IAGKYAQDFGLVEESCFPYTGTDAPCKMKKDCIRYYSSEYHYVGGFYG-----GCNEALM 359
Query: 177 MKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTV 236
E+ HGP+ AF V+DD + Y+ G + G F
Sbjct: 360 KLELVHHGPMAVAFEVYDDFLHYQKGIY---------------------QHTGLRDPFNP 398
Query: 237 FDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECG 296
F+ L HA+ ++G+G D S YW++ NSW T WG++G F+I RG DEC
Sbjct: 399 FE----------LTNHAVLLVGYGTDLASGMDYWIVKNSWGTSWGEDGFFRIRRGIDECS 448
Query: 297 IESSITAGVP 306
IES A P
Sbjct: 449 IESIAMAATP 458
>gi|11691656|emb|CAC18646.1| cathepsin B-like protease 1 [Giardia intestinalis]
Length = 303
Score = 82.8 bits (203), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 77/286 (26%), Positives = 118/286 (41%), Gaps = 39/286 (13%)
Query: 39 KQAEKNSLSNIPRAHLKSWMGVHPDY------NLPANRLPELIGYSEVDEDLPANFDSRT 92
K N+ +S M + PD +LP + E+ E+ + +P FD R
Sbjct: 32 KAGMPKRFENVTEDEFRS-MLIRPDRLRARSGSLPPISITEV---QELVDPIPPQFDFRD 87
Query: 93 KWPNCPTIREIRDQGSCGSCWGCRPYEIAPCEHHVNGT-RPSCDASKGHTPKCVRE---C 148
++P C ++ DQGSCG CW + G + + S+ H C E C
Sbjct: 88 EYPQC--VKPALDQGSCGECWAFSAIGVFGDRRCAMGIDKEAVSYSQQHLISCSLENFGC 145
Query: 149 QENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDD---LILYKSGRFF 205
P L F ++ + + Y H V DD + LYK+ +
Sbjct: 146 DGGDFQPTWSFLTFTG-----ATTAECVKYVDYGHTVASPCPAVCDDGSPIQLYKAHGY- 199
Query: 206 VPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKA--------LGGHAIRIL 257
G + ++ I + + V+ DL Y+SG LG HA+ I+
Sbjct: 200 --GQVSKSVPAIMGMLVAGGP---LQTMIVVYADLSYYESGVYKHTYGTINLGFHALEIV 254
Query: 258 GWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITA 303
G+G + + YW+I NSW DWG+NG F+I+RG +EC IE I A
Sbjct: 255 GYGTTDDGTD-YWIIKNSWGPDWGENGYFRIVRGVNECRIEDEIYA 299
>gi|395856779|ref|XP_003800796.1| PREDICTED: tubulointerstitial nephritis antigen-like isoform 1
[Otolemur garnettii]
Length = 467
Score = 82.4 bits (202), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 51/147 (34%), Positives = 73/147 (49%), Gaps = 32/147 (21%)
Query: 158 KDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLI 217
D+ +Y + SNEK IMKE+ E+GPV+ V +D LY+SG + T +SL
Sbjct: 335 NDIYQVTPAYRLGSNEKEIMKELMENGPVQALMEVHEDFFLYQSGIY-----SHTPVSLQ 389
Query: 218 KWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGED---EKSKEKYWLIAN 274
+ EG + G H+++I GWGE+ + KYW AN
Sbjct: 390 R-----------PEGY-------------RRHGTHSVKITGWGEETLPDGRTLKYWTAAN 425
Query: 275 SWNTDWGDNGLFKILRGKDECGIESSI 301
SW WG+ G F+I+RG +EC IES +
Sbjct: 426 SWGPAWGERGHFRIVRGANECDIESFV 452
>gi|47125398|gb|AAH70278.1| Tubulointerstitial nephritis antigen [Homo sapiens]
gi|190690249|gb|ACE86899.1| tubulointerstitial nephritis antigen protein [synthetic construct]
gi|190691623|gb|ACE87586.1| tubulointerstitial nephritis antigen protein [synthetic construct]
gi|312150986|gb|ADQ32005.1| tubulointerstitial nephritis antigen [synthetic construct]
Length = 476
Score = 82.4 bits (202), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 51/145 (35%), Positives = 69/145 (47%), Gaps = 32/145 (22%)
Query: 167 YSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTS 226
Y VSSNE IMKEI ++GPV+ V +D YK+G + R TS
Sbjct: 355 YRVSSNETEIMKEIMQNGPVQAIMQVHEDFFHYKTGIY-----------------RHVTS 397
Query: 227 QLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGE---DEKSKEKYWLIANSWNTDWGDN 283
+ + L HA+++ GWG + KEK+W+ ANSW WG+N
Sbjct: 398 TNKESEKY------------RKLQTHAVKLTGWGTLRGAQGQKEKFWIAANSWGKSWGEN 445
Query: 284 GLFKILRGKDECGIESSITAGVPKL 308
G F+ILRG +E IE I A +L
Sbjct: 446 GYFRILRGVNESDIEKLIIAAWGQL 470
>gi|300121514|emb|CBK22033.2| unnamed protein product [Blastocystis hominis]
Length = 476
Score = 82.4 bits (202), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 49/147 (33%), Positives = 71/147 (48%), Gaps = 39/147 (26%)
Query: 162 FGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTI 221
+ + Y ++IMKEIY HGPV + V DDL+ YK G
Sbjct: 82 YYVEEYGHVEGVENIMKEIYAHGPVTCSIDVPDDLLEYKGG------------------- 122
Query: 222 RDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWG 281
+++D K+G A GH I ++GWGE+ YW++ NSW T WG
Sbjct: 123 --------------IYED----KTGIAGDGHDISVVGWGEENGIP--YWIVRNSWGTYWG 162
Query: 282 DNGLFKILRGKDECGIESSITAGVPKL 308
+ G F+I+RGK+ GIE T G+P++
Sbjct: 163 EEGFFRIVRGKNNLGIEEGCTYGIPRI 189
Score = 72.8 bits (177), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 31/66 (46%), Positives = 44/66 (66%)
Query: 244 KSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITA 303
+ GK LG HA+ + GWG DE+++ YW++ NSW T WG+NG F+I G++ IE T
Sbjct: 410 REGKWLGKHAVEVTGWGVDEETRTPYWIVRNSWGTYWGENGWFRIAMGQNLLNIEQMCTW 469
Query: 304 GVPKLD 309
GVP +D
Sbjct: 470 GVPVID 475
>gi|349605750|gb|AEQ00879.1| Dipeptidyl-peptidase 1-like protein, partial [Equus caballus]
Length = 356
Score = 82.4 bits (202), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 46/135 (34%), Positives = 64/135 (47%), Gaps = 31/135 (22%)
Query: 172 NEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAE 231
NE I E+ HGP+ AF V++D + Y G + G +RD
Sbjct: 249 NEALIKLELVHHGPMAVAFEVYNDFLHYHDGIYHHTG------------LRD-------- 288
Query: 232 GAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRG 291
F F+ L HA+ ++G+G D S + YW++ NSW T WG++G F+I RG
Sbjct: 289 -PFNPFE----------LTNHAVLLVGYGTDSASGQDYWIVKNSWGTSWGEDGYFRIRRG 337
Query: 292 KDECGIESSITAGVP 306
DEC IES A P
Sbjct: 338 TDECAIESIAMAATP 352
>gi|395856781|ref|XP_003800797.1| PREDICTED: tubulointerstitial nephritis antigen-like isoform 2
[Otolemur garnettii]
Length = 436
Score = 82.4 bits (202), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 51/147 (34%), Positives = 73/147 (49%), Gaps = 32/147 (21%)
Query: 158 KDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLI 217
D+ +Y + SNEK IMKE+ E+GPV+ V +D LY+SG + T +SL
Sbjct: 304 NDIYQVTPAYRLGSNEKEIMKELMENGPVQALMEVHEDFFLYQSGIY-----SHTPVSLQ 358
Query: 218 KWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGED---EKSKEKYWLIAN 274
+ EG + G H+++I GWGE+ + KYW AN
Sbjct: 359 R-----------PEGY-------------RRHGTHSVKITGWGEETLPDGRTLKYWTAAN 394
Query: 275 SWNTDWGDNGLFKILRGKDECGIESSI 301
SW WG+ G F+I+RG +EC IES +
Sbjct: 395 SWGPAWGERGHFRIVRGANECDIESFV 421
>gi|159115721|ref|XP_001708083.1| Cathepsin B precursor [Giardia lamblia ATCC 50803]
gi|157436192|gb|EDO80409.1| Cathepsin B precursor [Giardia lamblia ATCC 50803]
Length = 305
Score = 82.4 bits (202), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 52/168 (30%), Positives = 74/168 (44%), Gaps = 43/168 (25%)
Query: 132 PSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFT 191
P G + +C CQ+ P + ++ A S S SN IM + GPV+ F
Sbjct: 171 PYTSGETGKSGECPTTCQDG--TPVESAFHYKAASASRLSNYNEIMVSLLADGPVQTGFY 228
Query: 192 VFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKS-GKALG 250
V +D + Y G I +K G +LG
Sbjct: 229 VHEDFLYYVGG--------------------------------------IYHKVYGTSLG 250
Query: 251 GHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIE 298
GHA+ I+G+G + YW++ NSW +DWG+NG F+ILRG +ECGIE
Sbjct: 251 GHAVLIVGYGS--MNNHDYWIVRNSWGSDWGENGYFRILRGTNECGIE 296
>gi|53850626|ref|NP_001005549.1| tubulointerstitial nephritis antigen precursor [Rattus norvegicus]
gi|51858645|gb|AAH81887.1| Tubulointerstitial nephritis antigen [Rattus norvegicus]
gi|149019129|gb|EDL77770.1| tubulointerstitial nephritis antigen [Rattus norvegicus]
Length = 475
Score = 82.4 bits (202), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 50/147 (34%), Positives = 69/147 (46%), Gaps = 36/147 (24%)
Query: 167 YSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSG--RFFVPGNETTAMSLIKWTIRDN 224
Y +SSNE IM+EI ++GPV+ V +D YK+G R V NE
Sbjct: 354 YRISSNETEIMREIIQNGPVQAIMQVHEDFFYYKTGIYRHVVSTNEEP------------ 401
Query: 225 TSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGE---DEKSKEKYWLIANSWNTDWG 281
+ + L HA+++ GWG + KEK+W+ ANSW WG
Sbjct: 402 -------------------EKYRKLRTHAVKLTGWGTLRGAQGKKEKFWIAANSWGKSWG 442
Query: 282 DNGLFKILRGKDECGIESSITAGVPKL 308
+NG F+ILRG +E IE I A +L
Sbjct: 443 ENGYFRILRGVNESDIEKLIIAAWGQL 469
>gi|301775398|ref|XP_002923119.1| PREDICTED: LOW QUALITY PROTEIN: tubulointerstitial nephritis
antigen-like [Ailuropoda melanoleuca]
Length = 472
Score = 82.4 bits (202), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 51/143 (35%), Positives = 68/143 (47%), Gaps = 36/143 (25%)
Query: 167 YSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSG--RFFVPGNETTAMSLIKWTIRDN 224
Y VSSNE IMKEI ++GPV+ V +D YK+G R NE ++
Sbjct: 351 YRVSSNETEIMKEIMQNGPVQAIMQVHEDFFHYKTGIYRHVTRTNEESSKY--------- 401
Query: 225 TSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKS---KEKYWLIANSWNTDWG 281
+ L HAI++ GWG + + KEK+W+ ANSW WG
Sbjct: 402 ----------------------RKLQTHAIKLTGWGTLKGARGQKEKFWIAANSWGKSWG 439
Query: 282 DNGLFKILRGKDECGIESSITAG 304
+NG F+ILRG +E IE I A
Sbjct: 440 ENGYFRILRGVNESDIEKLIIAA 462
>gi|363732245|ref|XP_419905.3| PREDICTED: tubulointerstitial nephritis antigen [Gallus gallus]
Length = 467
Score = 82.4 bits (202), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 47/138 (34%), Positives = 64/138 (46%), Gaps = 38/138 (27%)
Query: 167 YSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTS 226
Y VSS E IM+EI GPV+ V++D LYK G +
Sbjct: 360 YRVSSKETDIMEEIMAKGPVQAIMKVYEDFFLYKEGIYRHS------------------- 400
Query: 227 QLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGE---DEKSKEKYWLIANSWNTDWGDN 283
YK+G H++++LGWG K+K+W+ ANSW WG+N
Sbjct: 401 ----------------YKAGSKWKTHSVKLLGWGSLPGKNGQKQKFWIAANSWGKYWGEN 444
Query: 284 GLFKILRGKDECGIESSI 301
G F+ILRG++EC IE I
Sbjct: 445 GYFRILRGQNECDIEKLI 462
>gi|354483193|ref|XP_003503779.1| PREDICTED: tubulointerstitial nephritis antigen-like [Cricetulus
griseus]
Length = 475
Score = 82.4 bits (202), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 50/147 (34%), Positives = 69/147 (46%), Gaps = 36/147 (24%)
Query: 167 YSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSG--RFFVPGNETTAMSLIKWTIRDN 224
Y VSSNE IM+EI +GPV+ V +D YK+G R + NE +
Sbjct: 354 YRVSSNETEIMREIIRNGPVQAIMQVHEDFFYYKTGIYRHVISTNEES------------ 401
Query: 225 TSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKS---KEKYWLIANSWNTDWG 281
+ + L HA+++ GWG + KEK+W+ ANSW WG
Sbjct: 402 -------------------EKYRKLRSHAVKLTGWGTLRGAGGKKEKFWIAANSWGKSWG 442
Query: 282 DNGLFKILRGKDECGIESSITAGVPKL 308
+NG F+ILRG +E IE I A +L
Sbjct: 443 ENGYFRILRGVNESDIEKLIIAAWGQL 469
>gi|328712819|ref|XP_001942906.2| PREDICTED: tubulointerstitial nephritis antigen-like isoform 1
[Acyrthosiphon pisum]
gi|328712821|ref|XP_003244911.1| PREDICTED: tubulointerstitial nephritis antigen-like isoform 2
[Acyrthosiphon pisum]
Length = 463
Score = 82.4 bits (202), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 55/174 (31%), Positives = 78/174 (44%), Gaps = 37/174 (21%)
Query: 138 KGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLI 197
K +C + N D K L+ Y V++ E+ IM EI GPV+ V D
Sbjct: 301 KETMAQCPSRVRSNNDRTTKTRLHRVGPVYRVAT-EEGIMHEILTSGPVQAVMKVSRDFF 359
Query: 198 LYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRIL 257
+YKSG + S L SG G H++RI+
Sbjct: 360 MYKSGVY-------------------KCSNLA---------------SGSRTGYHSVRIV 385
Query: 258 GWGEDEKSKE--KYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVPKLD 309
GWGE+ + + KYW+ +NSW + WG+NG F+IL+G DEC IE + A +D
Sbjct: 386 GWGEEYQGGKIVKYWIASNSWGSWWGENGYFRILKGVDECEIEDFVIAAWADID 439
>gi|296216857|ref|XP_002754752.1| PREDICTED: dipeptidyl peptidase 1 [Callithrix jacchus]
Length = 460
Score = 82.4 bits (202), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 69/268 (25%), Positives = 101/268 (37%), Gaps = 82/268 (30%)
Query: 83 DLPANFDSRTKWPNCPTIR---EIRDQGSCGSCWG----------------------CRP 117
+LP ++D W N I +R+Q SCGSC+ P
Sbjct: 227 NLPTSWD----WRNVHGINFVSPVRNQASCGSCYSFASMGMLEARIRILTNNSQTPILSP 282
Query: 118 YEIAPCEHHVNG-------------------TRPSCDASKGHTPKCVRECQENYDVPYKK 158
E+ C + G +C G C + +E+ Y
Sbjct: 283 QEVVSCSQYAQGCEGGFPYLIAGKYAQDFGVVEEACFPYTGTDSPC--KMKEDCFRYYSS 340
Query: 159 DLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIK 218
+ ++ Y NE + E+ HGP+ AF V+DD + Y G + G
Sbjct: 341 EYHYVGGFYG-GCNEALMKLELVHHGPMAVAFEVYDDFLHYHKGIYHHTG---------- 389
Query: 219 WTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNT 278
+RD F F+ L HA+ ++G+G D S YW++ NSW T
Sbjct: 390 --LRD---------PFNPFE----------LTNHAVLLVGYGTDSASGIHYWIVKNSWGT 428
Query: 279 DWGDNGLFKILRGKDECGIESSITAGVP 306
WG++G F+I RG DEC IES A P
Sbjct: 429 SWGEDGYFRIRRGTDECAIESIAVAATP 456
>gi|1763659|gb|AAB58258.1| cysteine protease [Giardia intestinalis]
Length = 269
Score = 82.4 bits (202), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 76/279 (27%), Positives = 118/279 (42%), Gaps = 39/279 (13%)
Query: 46 LSNIPRAHLKSWMGVHPDY------NLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPT 99
N+ +S M + PD +LP + E+ E+ + +P FD R ++P C
Sbjct: 5 FENVTEDEFRS-MLIRPDRLRARSGSLPPISITEV---QELVDPIPPQFDFRDEYPQC-- 58
Query: 100 IREIRDQGSCGSCWGCRPYEIAPCEHHVNGT-RPSCDASKGHTPKCVRE---CQENYDVP 155
++ DQGSCG CW + G + + S+ H C E C P
Sbjct: 59 VKPALDQGSCGECWAFSAIGVFGDRRCAMGIDKEAVSYSQQHLISCSLENFGCDGGDFQP 118
Query: 156 YKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDD---LILYKSGRFFVPGNETT 212
L F ++ + + Y H V DD + LYK+ + G +
Sbjct: 119 TWSFLTFTG-----ATTAECVKYVDYGHTVASPCPAVCDDGSPIQLYKAHGY---GQVSK 170
Query: 213 AMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKA--------LGGHAIRILGWGEDEK 264
++ I + + + V+ DL Y+SG LG HA+ I+G+G +
Sbjct: 171 SVPAIMGML---VAGGPLQTMIVVYADLSYYESGVYKHTYGTINLGFHALEIVGYGTTDD 227
Query: 265 SKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITA 303
+ YW+I NSW DWG+NG F+I+RG +EC IE I A
Sbjct: 228 GTD-YWIIKNSWGPDWGENGYFRIVRGVNECRIEDEIYA 265
>gi|308157829|gb|EFO60849.1| Cathepsin B precursor [Giardia lamblia P15]
Length = 300
Score = 82.4 bits (202), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 67/249 (26%), Positives = 108/249 (43%), Gaps = 51/249 (20%)
Query: 82 EDLPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHT 141
+D+P +FD R ++P+C I E+ DQG CGSCW + G ++
Sbjct: 73 DDVPESFDFREEYPHC--IPEVVDQGGCGSCWAFSSVATFGDRRCIAGLDKK---PVKYS 127
Query: 142 PKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKS 201
P+ V C N + + K + K T D+ + Y+S
Sbjct: 128 PQYVVSCDHG---------NMACNGGWLPNAWKFLTK----------TGTTTDECVPYQS 168
Query: 202 GRFFVPG-------------NETTAMSL------IKWTIRDNTSQLGAEGAFTVFDDLIL 242
G + G + TTA S I ++ ++ + AF V+ D +
Sbjct: 169 GSTTLRGTCPTKCADGSSKVHLTTATSYKDYGLDIPAMMKALSTTGPLQVAFLVYSDFMY 228
Query: 243 YKSG-------KALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDEC 295
Y+SG GGHA+ ++G+G D+ + YW+I NSW DWG++G F+++RG ++C
Sbjct: 229 YESGVYQHTYGYMEGGHAVEMVGYGTDDDGVD-YWIIRNSWGPDWGEDGYFRMIRGINDC 287
Query: 296 GIESSITAG 304
IE AG
Sbjct: 288 SIEEQAYAG 296
>gi|10803435|emb|CAC13130.1| putative cathepsin B.4 [Ostertagia ostertagi]
Length = 194
Score = 82.4 bits (202), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 40/103 (38%), Positives = 55/103 (53%), Gaps = 1/103 (0%)
Query: 115 CRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEK 174
CRPYEI PC HH TP+C R+CQ Y YKKD +G K+Y + ++ K
Sbjct: 72 CRPYEITPCGHHGREPYYGECYDDAQTPRCKRKCQSGYKTTYKKDKRYGRKAYQLPNSVK 131
Query: 175 SIMKEIYEHGPVEGAFTVFDDLILYKSGRF-FVPGNETTAMSL 216
+I +EI HGPV +TV++D Y G + G ET ++
Sbjct: 132 AIQREIMMHGPVVAGYTVYEDFSYYTKGIYKHTAGRETGGHAV 174
Score = 41.6 bits (96), Expect = 0.42, Method: Compositional matrix adjust.
Identities = 15/31 (48%), Positives = 25/31 (80%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSK 39
CG+GC+GG+P AW+++ + G+V+GG YG +
Sbjct: 39 CGYGCDGGWPIKAWQFFAREGVVTGGNYGRQ 69
>gi|45708820|gb|AAH67941.1| LOC407938 protein, partial [Xenopus (Silurana) tropicalis]
Length = 470
Score = 82.0 bits (201), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 68/279 (24%), Positives = 107/279 (38%), Gaps = 84/279 (30%)
Query: 66 LPANRLPELIGYSEVDEDLPANFDSRTKWPNCP---TIREIRDQGSCGSCWG-------- 114
+P P + E + LP +D W N + +R+Q SCGSC+
Sbjct: 208 IPMRPRPAPLPTDEKYQGLPTEWD----WRNIAGYNFVTPVRNQASCGSCYAFSSMGMLE 263
Query: 115 --------------CRPYEIAPCEHHVNGTR---PSCDASK-----------------GH 140
P ++ C ++ G P A K
Sbjct: 264 SRIQIRSQLSQKPILSPQQVVSCSNYSQGCEGGFPYLIAGKYVSDYGIVEESDLPYTGSD 323
Query: 141 TPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYK 200
+P +++ Q+ Y Y + ++ Y NE + E+ GP+ AF V+DD + Y+
Sbjct: 324 SPCTLKDSQQKY---YTAEYHYVGGFYG-GCNEAYMKLELVLGGPLSVAFEVYDDFMHYR 379
Query: 201 SGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWG 260
SG + G + F F L HA+ ++G+G
Sbjct: 380 SGVY---------------------HHTGLQDKFNPFQ----------LTNHAVLLVGYG 408
Query: 261 EDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIES 299
D+++ EKYW++ NSW WG+ G F+I RG DEC IES
Sbjct: 409 TDQQTGEKYWIVKNSWGESWGEKGYFRIRRGTDECAIES 447
>gi|332210919|ref|XP_003254561.1| PREDICTED: LOW QUALITY PROTEIN: dipeptidyl peptidase 1 [Nomascus
leucogenys]
Length = 463
Score = 82.0 bits (201), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 63/248 (25%), Positives = 94/248 (37%), Gaps = 75/248 (30%)
Query: 100 IREIRDQGSCGSCWG----------------------CRPYEIAPCEHHVNG-------- 129
+ +R+Q SCGSC+ P E+ C + G
Sbjct: 246 VSPVRNQASCGSCYSFASVGMLEARIRILTNNSQTPILSPQEVVSCSQYAQGCEGGFPYL 305
Query: 130 -----------TRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMK 178
+C G C + +E+ Y + ++ Y NE +
Sbjct: 306 TAGKYAQDFGLVEEACFPYTGTDSPC--KMKEDCFRYYSSEYHYVGGFYG-GCNEALMKL 362
Query: 179 EIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFD 238
E+ HGP+ AF V+DD + Y+ G + G +RD F F+
Sbjct: 363 ELVHHGPMAVAFEVYDDFLHYEKGIYHHTG------------LRD---------PFNPFE 401
Query: 239 DLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIE 298
L HA+ ++G+G D S YW++ NSW T WG++G F+I RG DEC IE
Sbjct: 402 ----------LTNHAVLLVGYGTDSASGMDYWIVKNSWGTGWGEDGYFRIRRGTDECAIE 451
Query: 299 SSITAGVP 306
S A P
Sbjct: 452 SIAVAATP 459
>gi|332824268|ref|XP_518550.3| PREDICTED: tubulointerstitial nephritis antigen [Pan troglodytes]
Length = 476
Score = 82.0 bits (201), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 51/145 (35%), Positives = 69/145 (47%), Gaps = 32/145 (22%)
Query: 167 YSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTS 226
Y VSSNE IMKEI ++GPV+ V +D YK+G + R TS
Sbjct: 355 YRVSSNETEIMKEIMQNGPVQAIMQVREDFFHYKTGIY-----------------RHVTS 397
Query: 227 QLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGE---DEKSKEKYWLIANSWNTDWGDN 283
+ + L HA+++ GWG + KEK+W+ ANSW WG+N
Sbjct: 398 TNKESEKY------------RKLQTHAVKLTGWGTLRGAQGQKEKFWIAANSWGKSWGEN 445
Query: 284 GLFKILRGKDECGIESSITAGVPKL 308
G F+ILRG +E IE I A +L
Sbjct: 446 GYFRILRGVNESDIEKLIIAAWGQL 470
>gi|426353589|ref|XP_004044272.1| PREDICTED: tubulointerstitial nephritis antigen [Gorilla gorilla
gorilla]
Length = 476
Score = 82.0 bits (201), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 51/145 (35%), Positives = 69/145 (47%), Gaps = 32/145 (22%)
Query: 167 YSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTS 226
Y VSSNE IMKEI ++GPV+ V +D YK+G + R TS
Sbjct: 355 YRVSSNETEIMKEIMQNGPVQAIMQVREDFFHYKTGIY-----------------RHVTS 397
Query: 227 QLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGE---DEKSKEKYWLIANSWNTDWGDN 283
+ + L HA+++ GWG + KEK+W+ ANSW WG+N
Sbjct: 398 TNKESEKY------------RKLQTHAVKLTGWGTLRGAQGQKEKFWIAANSWGKSWGEN 445
Query: 284 GLFKILRGKDECGIESSITAGVPKL 308
G F+ILRG +E IE I A +L
Sbjct: 446 GYFRILRGVNESDIEKLIIAAWGQL 470
>gi|75812938|ref|NP_001028789.1| dipeptidyl peptidase 1 precursor [Bos taurus]
gi|115312125|sp|Q3ZCJ8.1|CATC_BOVIN RecName: Full=Dipeptidyl peptidase 1; AltName: Full=Cathepsin C;
AltName: Full=Cathepsin J; AltName: Full=Dipeptidyl
peptidase I; Short=DPP-I; Short=DPPI; AltName:
Full=Dipeptidyl transferase; Contains: RecName:
Full=Dipeptidyl peptidase 1 exclusion domain chain;
AltName: Full=Dipeptidyl peptidase I exclusion domain
chain; Contains: RecName: Full=Dipeptidyl peptidase 1
heavy chain; AltName: Full=Dipeptidyl peptidase I heavy
chain; Contains: RecName: Full=Dipeptidyl peptidase 1
light chain; AltName: Full=Dipeptidyl peptidase I light
chain; Flags: Precursor
gi|73587261|gb|AAI02116.1| Cathepsin C [Bos taurus]
Length = 463
Score = 82.0 bits (201), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 64/248 (25%), Positives = 91/248 (36%), Gaps = 75/248 (30%)
Query: 100 IREIRDQGSCGSCWG----------------------CRPYEIAPCEHHVNG-------- 129
+ +R+QGSCGSC+ P E+ C + G
Sbjct: 246 VTPVRNQGSCGSCYSFASMGMMEARIRILTNNTQTPILSPQEVVSCSQYAQGCEGGFPYL 305
Query: 130 -----------TRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMK 178
C G C +E Y + ++ Y NE +
Sbjct: 306 IAGKYAQDFGLVEEDCFPYTGTDSPC--RLKEGCFRYYSSEYHYVGGFYG-GCNEALMKL 362
Query: 179 EIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFD 238
E+ GP+ AF V+DD + Y+ G + G +RD F F+
Sbjct: 363 ELVHQGPMAVAFEVYDDFLHYRKGVYHHTG------------LRD---------PFNPFE 401
Query: 239 DLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIE 298
L HA+ ++G+G D S YW++ NSW T WG+NG F+I RG DEC IE
Sbjct: 402 ----------LTNHAVLLVGYGTDAASGLDYWIVKNSWGTSWGENGYFRIRRGTDECAIE 451
Query: 299 SSITAGVP 306
S A P
Sbjct: 452 SIALAATP 459
>gi|355724275|gb|AES08176.1| tubulointerstitial nephritis antigen-like 1 [Mustela putorius furo]
Length = 454
Score = 82.0 bits (201), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 49/149 (32%), Positives = 73/149 (48%), Gaps = 32/149 (21%)
Query: 156 YKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMS 215
+ D+ +Y + SNEK IMKE+ E+GPV+ V +D LY+SG + T +S
Sbjct: 320 HANDIYQVTPAYRLGSNEKEIMKELMENGPVQALMEVHEDFFLYQSGIY-----SHTPVS 374
Query: 216 LIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGED---EKSKEKYWLI 272
L + + + G H+++I GWGE+ + KYW
Sbjct: 375 LGR------------------------PERYRRHGTHSVKITGWGEETLPDGRTLKYWTA 410
Query: 273 ANSWNTDWGDNGLFKILRGKDECGIESSI 301
ANSW WG+ G F+I+RG +EC IES +
Sbjct: 411 ANSWGPAWGERGHFRIVRGANECDIESFV 439
>gi|195121981|ref|XP_002005491.1| GI19039 [Drosophila mojavensis]
gi|193910559|gb|EDW09426.1| GI19039 [Drosophila mojavensis]
Length = 432
Score = 82.0 bits (201), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 75/292 (25%), Positives = 101/292 (34%), Gaps = 89/292 (30%)
Query: 67 PANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCW------------- 113
P R+ + S LP F++ +W + I E+ DQG CGS W
Sbjct: 170 PTYRVKAMTRLSNPSSGLPRKFNAVERWSS--YISEVPDQGWCGSSWVLSTTSVASDRFA 227
Query: 114 ---------GCRPYEIAPCEHHVNGT----------------------------RPSCDA 136
P I C G R SC
Sbjct: 228 IQSQGKEVVQLSPQNILSCTRRQQGCEGGHLDAAWRYLHKKGVVDETCYPYTQRRDSCKI 287
Query: 137 SKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDL 196
C+ Y V + L +YS+ E IM EIY GPV+ V+ D
Sbjct: 288 RHNSRSLKANGCRPAYGVN-RDSLYTVGPAYSLKG-ETDIMAEIYHSGPVQATMRVYRDF 345
Query: 197 ILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRI 256
Y G + R + GA F H+++I
Sbjct: 346 FSYSGGVY-----------------RQTAANRGAPTGF-----------------HSVKI 371
Query: 257 LGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVPKL 308
+GWGE E KYW+ ANSW WG++G F+ILRG +ECGIE + A P +
Sbjct: 372 VGWGE-EHDGVKYWIAANSWGPWWGEHGYFRILRGSNECGIEEYVLASWPNV 422
>gi|496968|gb|AAA96831.1| cysteine protease homologue, partial [Ancylostoma caninum]
Length = 197
Score = 82.0 bits (201), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 44/136 (32%), Positives = 66/136 (48%), Gaps = 39/136 (28%)
Query: 141 TPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYK 200
TPKC + CQ Y Y++D +F ++Y + +NE+SI +EIY++GPV AF V+ D YK
Sbjct: 101 TPKCRKTCQRKYYKSYQEDKHFATRAYYLPNNERSIRQEIYKNGPVVAAFRVYQDFSYYK 160
Query: 201 SGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWG 260
G + ++K G G HA++++GWG
Sbjct: 161 KG-------------------------------------IYVHKWGGQTGAHAVKVVGWG 183
Query: 261 EDEKSKEKYWLIANSW 276
+ + YWLIANSW
Sbjct: 184 RENAT--DYWLIANSW 197
>gi|224586907|ref|NP_055279.3| tubulointerstitial nephritis antigen [Homo sapiens]
gi|317373501|sp|Q9UJW2.3|TINAG_HUMAN RecName: Full=Tubulointerstitial nephritis antigen; Short=TIN-Ag
gi|119624842|gb|EAX04437.1| tubulointerstitial nephritis antigen [Homo sapiens]
gi|189066513|dbj|BAG35763.1| unnamed protein product [Homo sapiens]
Length = 476
Score = 82.0 bits (201), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 51/145 (35%), Positives = 69/145 (47%), Gaps = 32/145 (22%)
Query: 167 YSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTS 226
Y VSSNE IMKEI ++GPV+ V +D YK+G + R TS
Sbjct: 355 YRVSSNETEIMKEIMQNGPVQAIMQVREDFFHYKTGIY-----------------RHVTS 397
Query: 227 QLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGE---DEKSKEKYWLIANSWNTDWGDN 283
+ + L HA+++ GWG + KEK+W+ ANSW WG+N
Sbjct: 398 TNKESEKY------------RKLQTHAVKLTGWGTLRGAQGQKEKFWIAANSWGKSWGEN 445
Query: 284 GLFKILRGKDECGIESSITAGVPKL 308
G F+ILRG +E IE I A +L
Sbjct: 446 GYFRILRGVNESDIEKLIIAAWGQL 470
>gi|344293788|ref|XP_003418602.1| PREDICTED: dipeptidyl peptidase 1 [Loxodonta africana]
Length = 463
Score = 82.0 bits (201), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 76/268 (28%), Positives = 106/268 (39%), Gaps = 78/268 (29%)
Query: 84 LPANFDSRTKWPNCPTIR---EIRDQGSCGSCWG----------------------CRPY 118
LPA++D W N I +R+Q SCGSC+ P
Sbjct: 231 LPASWD----WRNVHGINFVSPVRNQASCGSCYSFASMGMLEARLRILTNNSQTPVLSPQ 286
Query: 119 EIAPCEHHVNGTR---PSCDASKGHT------PKCVRECQENYDVPYKKD-LNFGAKSYS 168
E+ C + G P A K C + KKD + + Y
Sbjct: 287 EVVSCSQYAQGCEGGFPYLIAGKYAQDFGLVEEACFPYTATDSPCKVKKDCFRYYSSEYH 346
Query: 169 V------SSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIR 222
NE + E+ HGPV +F V+DD I Y G + G +R
Sbjct: 347 YVGGFYGGCNEALMKLELVNHGPVVVSFEVYDDFIHYHKGIYHHTG------------LR 394
Query: 223 DNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGD 282
D F F+ L HA+ ++G+G D S YW++ NSW+ WG+
Sbjct: 395 D---------PFNPFE----------LTNHAVLLVGYGTDSASGLDYWIVKNSWSATWGE 435
Query: 283 NGLFKILRGKDECGIES-SITAG-VPKL 308
+G F+I RG DECGIES ++TA +PKL
Sbjct: 436 DGYFRIRRGTDECGIESIALTATPIPKL 463
>gi|428168267|gb|EKX37214.1| hypothetical protein GUITHDRAFT_78289 [Guillardia theta CCMP2712]
Length = 224
Score = 82.0 bits (201), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 54/161 (33%), Positives = 71/161 (44%), Gaps = 38/161 (23%)
Query: 139 GHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLIL 198
G P C C D K + G + N + I EI +GPV AF V+ D +
Sbjct: 100 GGGPACSDVCSLGPDYSVKAS-SLGV----IQDNVRQIQSEILSNGPVFAAFWVYSDFMA 154
Query: 199 YKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILG 258
Y +G + E A GK GGHA+ ++G
Sbjct: 155 Y-TGGVYSASKEALAQ-------------------------------GKT-GGHAVMMVG 181
Query: 259 WGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIES 299
WG D+++ + YWL+ NSW+ WGD G FKI RG DECGIES
Sbjct: 182 WGTDKETGQDYWLLQNSWSEKWGDKGRFKIKRGVDECGIES 222
>gi|397517574|ref|XP_003828984.1| PREDICTED: tubulointerstitial nephritis antigen [Pan paniscus]
Length = 476
Score = 82.0 bits (201), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 51/145 (35%), Positives = 69/145 (47%), Gaps = 32/145 (22%)
Query: 167 YSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTS 226
Y VSSNE IMKEI ++GPV+ V +D YK+G + R TS
Sbjct: 355 YRVSSNETEIMKEIMQNGPVQAIMQVREDFFHYKTGIY-----------------RHVTS 397
Query: 227 QLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGE---DEKSKEKYWLIANSWNTDWGDN 283
+ + L HA+++ GWG + KEK+W+ ANSW WG+N
Sbjct: 398 TNKESEKY------------RKLQTHAVKLTGWGTLRGAQGQKEKFWIAANSWGKSWGEN 445
Query: 284 GLFKILRGKDECGIESSITAGVPKL 308
G F+ILRG +E IE I A +L
Sbjct: 446 GYFRILRGVNESDIEKLIIAAWGQL 470
>gi|185135783|ref|NP_001117966.1| prepro-cathepsin C precursor [Oncorhynchus mykiss]
gi|51038277|gb|AAT94060.1| prepro-cathepsin C [Oncorhynchus mykiss]
Length = 457
Score = 82.0 bits (201), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 72/295 (24%), Positives = 116/295 (39%), Gaps = 92/295 (31%)
Query: 67 PANRLPELIGYSEVDEDLP---ANFDSRTKWPNCPTIR---EIRDQGSCGSCWGC----- 115
PA+ +P +G + V L A R W + + +R+Q SCGSC+
Sbjct: 202 PASHIPRRVGPAPVTSTLAKMAAGLPERWDWRDVNGVNYLSPVRNQASCGSCYSFALMGM 261
Query: 116 -----------------RPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENY------ 152
P ++ C + G CD G P + + +++
Sbjct: 262 LEARVRLQTNNTETPIFSPQQVVSCSQYSQG----CD---GGFPYLIGKYVQDFGIVEES 314
Query: 153 -----------DVP------YKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDD 195
DVP Y D ++ Y S E ++M E+ ++GP+ AF V+ D
Sbjct: 315 CYPYAGTDSPCDVPDGCLRHYTSDYSYVGGFYGGCS-ESAMMLELVKNGPMGVAFEVYPD 373
Query: 196 LILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIR 255
+ YK G + G ++ F+ L HA+
Sbjct: 374 FMHYKEGIY---------------------HHTGLHDSYNPFE----------LTNHAVL 402
Query: 256 ILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAG--VPKL 308
++G+G+ + +K+W++ NSW T WG+ G FK+ RG DEC IES A +PKL
Sbjct: 403 LVGYGQCHVTGQKFWVVKNSWGTKWGEEGFFKVRRGSDECAIESIAVAAKPIPKL 457
>gi|32129434|sp|P92132.2|CATB2_GIALA RecName: Full=Cathepsin B-like CP2; AltName: Full=Cathepsin B-like
protease B2; Flags: Precursor
gi|11691658|emb|CAC18647.1| cathepsin B-like protease 2 [Giardia intestinalis]
Length = 300
Score = 82.0 bits (201), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 72/255 (28%), Positives = 114/255 (44%), Gaps = 63/255 (24%)
Query: 82 EDLPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHT 141
+D+P +FD R ++P+C I E+ DQG CGSCW S A+ G
Sbjct: 73 DDVPESFDFREEYPHC--IPEVVDQGGCGSCWAF-----------------SSVATFGD- 112
Query: 142 PKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSI------MKEIYEHGPVEGAFTVFDD 195
R C D KK + + + Y VS + + + +++ G T D+
Sbjct: 113 ----RRCVAGLD---KKPVKYSPQ-YVVSCDHGDMACNGGWLPNVWKFLTKTG--TTTDE 162
Query: 196 LILYKSGRFFVPG-------------NETTAMSL------IKWTIRDNTSQLGAEGAFTV 236
+ YKSG + G + TA S I ++ ++ + AF V
Sbjct: 163 CVPYKSGSTTLRGTCPTKCADGSSKVHLATATSYKDYGLDIPAMMKALSTSGPLQVAFLV 222
Query: 237 FDDLILYKSG-------KALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKIL 289
D + Y+SG GGHA+ ++G+G D+ + YW+I NSW DWG++G F+++
Sbjct: 223 HSDFMYYESGVYQHTYGYMEGGHAVEMVGYGTDDDGVD-YWIIKNSWGPDWGEDGYFRMI 281
Query: 290 RGKDECGIESSITAG 304
RG ++C IE AG
Sbjct: 282 RGINDCSIEEQAYAG 296
>gi|197100841|ref|NP_001126804.1| tubulointerstitial nephritis antigen [Pongo abelii]
gi|55732702|emb|CAH93049.1| hypothetical protein [Pongo abelii]
Length = 476
Score = 82.0 bits (201), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 51/145 (35%), Positives = 69/145 (47%), Gaps = 32/145 (22%)
Query: 167 YSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTS 226
Y VSSNE IMKEI ++GPV+ V +D YK+G + R TS
Sbjct: 355 YRVSSNETEIMKEIMQNGPVQAIMQVREDFFHYKTGIY-----------------RHVTS 397
Query: 227 QLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGE---DEKSKEKYWLIANSWNTDWGDN 283
+ + L HA+++ GWG + KEK+W+ ANSW WG+N
Sbjct: 398 TNKESEKY------------RKLQTHAVKLTGWGTLRGAQGQKEKFWVAANSWGKSWGEN 445
Query: 284 GLFKILRGKDECGIESSITAGVPKL 308
G F+ILRG +E IE I A +L
Sbjct: 446 GYFRILRGVNESDIEKLIIAAWGQL 470
>gi|351712812|gb|EHB15731.1| Dipeptidyl-peptidase 1 [Heterocephalus glaber]
Length = 462
Score = 82.0 bits (201), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 64/248 (25%), Positives = 93/248 (37%), Gaps = 75/248 (30%)
Query: 100 IREIRDQGSCGSCWG----------------------CRPYEIAPCEHHVNG-------- 129
+ +R+QG CGSC+ P E+ C + G
Sbjct: 245 VSPVRNQGYCGSCYSFASMGMLEARIRILTNNTQTPILSPQEVVSCSQYAQGCEGGFPYL 304
Query: 130 -----------TRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMK 178
SC G C + +E+ Y + ++ Y NE +
Sbjct: 305 IAGKYAQDFGFVEESCFPYTGTDAPC--KMKEDCMRYYTSEYHYVGGFYG-GCNEALMKL 361
Query: 179 EIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFD 238
E+ +HGP+ AF V DD + Y G + G +RD F F+
Sbjct: 362 ELVQHGPMAVAFEVCDDFMHYHKGIYHHTG------------LRD---------PFNPFE 400
Query: 239 DLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIE 298
L HA+ ++G+G D + YW++ NSW T WG+ G F+ILRG DEC IE
Sbjct: 401 ----------LTNHAVLLVGYGTDSANGMDYWIVKNSWGTSWGEKGYFRILRGTDECAIE 450
Query: 299 SSITAGVP 306
S A P
Sbjct: 451 SIAMAATP 458
>gi|296471940|tpg|DAA14055.1| TPA: dipeptidyl peptidase 1 [Bos taurus]
gi|440894445|gb|ELR46895.1| Dipeptidyl peptidase 1 [Bos grunniens mutus]
Length = 463
Score = 82.0 bits (201), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 64/248 (25%), Positives = 91/248 (36%), Gaps = 75/248 (30%)
Query: 100 IREIRDQGSCGSCWG----------------------CRPYEIAPCEHHVNG-------- 129
+ +R+QGSCGSC+ P E+ C + G
Sbjct: 246 VTPVRNQGSCGSCYSFASMGMMEARIRILTNNTQTPILSPQEVVSCSQYAQGCEGGFPYL 305
Query: 130 -----------TRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMK 178
C G C +E Y + ++ Y NE +
Sbjct: 306 IAGKYAQDFGLVEEDCFPYTGTDSPC--RLKEGCFRYYSSEYHYVGGFYG-GCNEALMKL 362
Query: 179 EIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFD 238
E+ GP+ AF V+DD + Y+ G + G +RD F F+
Sbjct: 363 ELVHQGPMAVAFEVYDDFLHYRKGVYHHTG------------LRD---------PFNPFE 401
Query: 239 DLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIE 298
L HA+ ++G+G D S YW++ NSW T WG+NG F+I RG DEC IE
Sbjct: 402 ----------LTNHAVLLVGYGTDAASGLDYWIVKNSWGTSWGENGYFRIRRGTDECAIE 451
Query: 299 SSITAGVP 306
S A P
Sbjct: 452 SIALAATP 459
>gi|348570708|ref|XP_003471139.1| PREDICTED: tubulointerstitial nephritis antigen-like [Cavia
porcellus]
Length = 468
Score = 81.6 bits (200), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 48/146 (32%), Positives = 71/146 (48%), Gaps = 32/146 (21%)
Query: 159 DLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIK 218
D+ +Y + S+EK IMKE+ E+GPV+ V +D LYK G + T +S+ +
Sbjct: 337 DIYQVTPAYRLGSDEKEIMKELMENGPVQALMEVHEDFFLYKGGIY-----SHTPLSMAR 391
Query: 219 WTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGED---EKSKEKYWLIANS 275
+ + G H+++I GWGE+ + KYW ANS
Sbjct: 392 ------------------------PEQYRRHGTHSVKITGWGEETLPDGRTLKYWTAANS 427
Query: 276 WNTDWGDNGLFKILRGKDECGIESSI 301
W WG+ G F+ILRG +EC IES +
Sbjct: 428 WGPSWGERGHFRILRGSNECDIESFV 453
>gi|363742306|ref|XP_428202.3| PREDICTED: tubulointerstitial nephritis antigen-like 1 [Gallus
gallus]
Length = 464
Score = 81.6 bits (200), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 47/150 (31%), Positives = 73/150 (48%), Gaps = 32/150 (21%)
Query: 156 YKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMS 215
+ D+ +Y ++ +EK IMKE+ E+GPV+ V +D LYKSG
Sbjct: 330 HANDIYQSTPAYRLAPSEKEIMKELMENGPVQAILEVHEDFFLYKSG------------- 376
Query: 216 LIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSK---EKYWLI 272
I +T+ +G + G H+++I GWGE++ +KYW
Sbjct: 377 -----IYRHTAVAEGKG-----------PKHQQHGTHSVKITGWGEEQLPDGQVQKYWTA 420
Query: 273 ANSWNTDWGDNGLFKILRGKDECGIESSIT 302
ANSW WG++G F+I RG +EC +ES +
Sbjct: 421 ANSWGRAWGEDGHFRIARGVNECEVESFVV 450
>gi|30038325|dbj|BAC75711.1| cathepsin C [Bos taurus]
Length = 458
Score = 81.6 bits (200), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 64/248 (25%), Positives = 91/248 (36%), Gaps = 75/248 (30%)
Query: 100 IREIRDQGSCGSCWG----------------------CRPYEIAPCEHHVNG-------- 129
+ +R+QGSCGSC+ P E+ C + G
Sbjct: 241 VTPVRNQGSCGSCYSFASMGMMEARIRILTNNTQTPILSPQEVVSCSQYAQGCEGGFPYL 300
Query: 130 -----------TRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMK 178
C G C +E Y + ++ Y NE +
Sbjct: 301 IAGKYAQDFGLVEEDCFPYTGTDSPC--RLKEGCFRYYSSEYHYVGGFYG-GCNEALMKL 357
Query: 179 EIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFD 238
E+ GP+ AF V+DD + Y+ G + G +RD F F+
Sbjct: 358 ELVHQGPMAVAFEVYDDFLHYRKGVYHHTG------------LRD---------PFNPFE 396
Query: 239 DLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIE 298
L HA+ ++G+G D S YW++ NSW T WG+NG F+I RG DEC IE
Sbjct: 397 ----------LTNHAVLLVGYGTDAASGLDYWIVKNSWGTSWGENGYFRIRRGTDECAIE 446
Query: 299 SSITAGVP 306
S A P
Sbjct: 447 SIALAATP 454
>gi|432108509|gb|ELK33225.1| Dipeptidyl peptidase 1 [Myotis davidii]
Length = 466
Score = 81.6 bits (200), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 61/248 (24%), Positives = 91/248 (36%), Gaps = 75/248 (30%)
Query: 100 IREIRDQGSCGSCWG----------------------CRPYEIAPCEHHVNG-------- 129
+ +R+Q SCGSC+ P E+ C + G
Sbjct: 249 VTPVRNQASCGSCYSFASMGMLEARIRILTNNTQSPILSPQEVVSCSQYAQGCEGGFPYL 308
Query: 130 -----------TRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMK 178
+C G C + +E+ Y + ++ Y NE +
Sbjct: 309 IAGKYAQDFGLVEEACFPYTGTDSPC--KMKEDCIRYYTSEYHYVGGFYG-GCNEALMKL 365
Query: 179 EIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFD 238
E+ HGP+ AF V+DD + Y G + G + F F+
Sbjct: 366 ELVHHGPMAVAFEVYDDFLHYNQGIY---------------------HHTGLKDPFNPFE 404
Query: 239 DLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIE 298
L HA+ ++G+G D K+ YW++ NSW T WG+ G F+I RG DEC IE
Sbjct: 405 ----------LTNHAVLLVGYGTDPKTGLDYWIVKNSWGTSWGEQGYFRIRRGTDECAIE 454
Query: 299 SSITAGVP 306
S A P
Sbjct: 455 SIAMAATP 462
>gi|355724272|gb|AES08175.1| tubulointerstitial nephritis antigen [Mustela putorius furo]
Length = 476
Score = 81.6 bits (200), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 48/141 (34%), Positives = 67/141 (47%), Gaps = 32/141 (22%)
Query: 167 YSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTS 226
Y VSSNE IMKEI ++GPV+ V +D YK+G I + +
Sbjct: 355 YRVSSNETEIMKEIMQNGPVQAIMQVHEDFFHYKTG------------------IYRHVT 396
Query: 227 QLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGE---DEKSKEKYWLIANSWNTDWGDN 283
+ E + + HA+++ GWG + KEK+W+ ANSW WG+N
Sbjct: 397 RTNEEAS-----------KYRKFQTHAVKLTGWGTLKGAQGQKEKFWIAANSWGKSWGEN 445
Query: 284 GLFKILRGKDECGIESSITAG 304
G F+ILRG +E IE I A
Sbjct: 446 GYFRILRGVNESDIEKLIIAA 466
>gi|62510425|sp|Q60HG6.1|CATC_MACFA RecName: Full=Dipeptidyl peptidase 1; AltName: Full=Cathepsin C;
AltName: Full=Cathepsin J; AltName: Full=Dipeptidyl
peptidase I; Short=DPP-I; Short=DPPI; AltName:
Full=Dipeptidyl transferase; Contains: RecName:
Full=Dipeptidyl peptidase 1 exclusion domain chain;
AltName: Full=Dipeptidyl peptidase I exclusion domain
chain; Contains: RecName: Full=Dipeptidyl peptidase 1
heavy chain; AltName: Full=Dipeptidyl peptidase I heavy
chain; Contains: RecName: Full=Dipeptidyl peptidase 1
light chain; AltName: Full=Dipeptidyl peptidase I light
chain; Flags: Precursor
gi|52782205|dbj|BAD51949.1| cathepsin C [Macaca fascicularis]
Length = 463
Score = 81.6 bits (200), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 46/135 (34%), Positives = 65/135 (48%), Gaps = 31/135 (22%)
Query: 172 NEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAE 231
NE + E+ HGP+ AF V+DD + Y++G + G +RD
Sbjct: 356 NEALMKLELVYHGPLAVAFEVYDDFLHYQNGIYHHTG------------LRD-------- 395
Query: 232 GAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRG 291
F F+ L HA+ ++G+G D S YW++ NSW T WG++G F+I RG
Sbjct: 396 -PFNPFE----------LTNHAVLLVGYGTDSASGMDYWIVKNSWGTSWGEDGYFRIRRG 444
Query: 292 KDECGIESSITAGVP 306
DEC IES A P
Sbjct: 445 TDECAIESIAVAATP 459
>gi|335290878|ref|XP_003127800.2| PREDICTED: tubulointerstitial nephritis antigen-like 1 [Sus scrofa]
Length = 362
Score = 81.6 bits (200), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 77/296 (26%), Positives = 112/296 (37%), Gaps = 97/296 (32%)
Query: 69 NRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG-------------- 114
N + ++G EV LP F++ KWPN I + DQG+C W
Sbjct: 86 NEIHTVLGPGEV---LPRAFEASEKWPN--LIHDPLDQGNCAGSWAFSTAAVASDRVSIH 140
Query: 115 --------CRPYEIAPCEHH----VNGTR---------------PSCDASKGH------- 140
P + C+ H G R C GH
Sbjct: 141 SLGHMTPVLSPQNLLSCDTHNQQGCQGGRLDGAWWFLRRRGVVSDHCYPFSGHERNEAGP 200
Query: 141 TPKCVREC--------QENYDVP----YKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEG 188
P+C+ Q P + D+ +Y + SNEK IMKE+ E+GPV+
Sbjct: 201 APRCMMHSRAMGRGKRQATARCPNSYVHANDIYQVTPAYRLGSNEKDIMKELMENGPVQA 260
Query: 189 AFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKA 248
V +D LY+SG + T +S + + +
Sbjct: 261 LMEVHEDFFLYQSGIY-----SHTPVSHGR------------------------PERYRR 291
Query: 249 LGGHAIRILGWGED---EKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSI 301
G H+++I GWGE+ + KYW ANSW WG+ G F+I+RG +EC IES +
Sbjct: 292 HGTHSVKITGWGEETLPDGRMLKYWTAANSWGPGWGERGHFRIVRGANECDIESFV 347
>gi|194213370|ref|XP_001492720.2| PREDICTED: dipeptidyl peptidase 1-like [Equus caballus]
Length = 478
Score = 81.6 bits (200), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 46/135 (34%), Positives = 64/135 (47%), Gaps = 31/135 (22%)
Query: 172 NEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAE 231
NE I E+ HGP+ AF V++D + Y G + G +RD
Sbjct: 371 NEALIKLELVHHGPMAVAFEVYNDFLHYHDGIYHHTG------------LRD-------- 410
Query: 232 GAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRG 291
F F+ L HA+ ++G+G D S + YW++ NSW T WG++G F+I RG
Sbjct: 411 -PFNPFE----------LTNHAVLLVGYGTDSASGQDYWIVKNSWGTSWGEDGYFRIRRG 459
Query: 292 KDECGIESSITAGVP 306
DEC IES A P
Sbjct: 460 TDECAIESIAMAATP 474
>gi|47550737|ref|NP_999887.1| dipeptidyl peptidase 1 precursor [Danio rerio]
gi|39794586|gb|AAH64286.1| Cathepsin C [Danio rerio]
Length = 455
Score = 81.6 bits (200), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 72/291 (24%), Positives = 113/291 (38%), Gaps = 91/291 (31%)
Query: 67 PANRLPELIGYSEVDED------LPANFDSRTKWPNCPTIREIRDQGSCGSCWGC----- 115
PA+R+P + V D LP ++D R + +R+Q CGSC+
Sbjct: 201 PASRIPRRVRPVTVAADSKAASGLPQHWDWRNV-NGVNFVSPVRNQAQCGSCYSFATMGM 259
Query: 116 -----------------RPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENY------ 152
P ++ C + G CD G P + + +++
Sbjct: 260 LEARVRIQTNNTQQPVFSPQQVVSCSQYSQG----CD---GGFPYLIGKYIQDFGIVEED 312
Query: 153 -------DVP----------YKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDD 195
D P Y D ++ Y S E ++M E+ ++GP+ A V+ D
Sbjct: 313 CFPYTGSDSPCNLPAKCTKYYASDYHYVGGFYGGCS-ESAMMLELVKNGPMGVALEVYPD 371
Query: 196 LILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIR 255
+ YK G + G +RD + L HA+
Sbjct: 372 FMNYKEGIYHHTG------------LRDANNPF-------------------ELTNHAVL 400
Query: 256 ILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVP 306
++G+G+ K+ EKYW++ NSW + WG+NG F+I RG DEC IES A P
Sbjct: 401 LVGYGQCHKTGEKYWIVKNSWGSGWGENGFFRIRRGTDECAIESIAVAATP 451
>gi|157058739|gb|ABV03127.1| cathepsin B-2744 [Acyrthosiphon pisum]
Length = 260
Score = 81.3 bits (199), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 52/177 (29%), Positives = 84/177 (47%), Gaps = 45/177 (25%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDA-SKGHTPKCVREC-QENYDVPYKKDLNFGAKSYSVS- 170
GC+PY+ PC+H+ + +C + + C ++C +NY V Y+ DL+ + Y S
Sbjct: 124 GCQPYKNRPCDHYGDSRLTNCSSLRRTQMTVCRKKCVNKNYKVKYEDDLHKTSIVYMTSW 183
Query: 171 SNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGA 230
+N K I +EI +GPV V+++ + YK G
Sbjct: 184 TNVKQIQQEIMTYGPVTAFMYVYENFMGYKEG---------------------------- 215
Query: 231 EGAFTVFDDLILYKS--GKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGL 285
+YKS G+ +G H ++++GWG D E YWL NSWN++WG++GL
Sbjct: 216 -----------IYKSTTGELIGYHHVKLIGWGVDGDGTE-YWLAMNSWNSNWGNDGL 260
>gi|256074073|ref|XP_002573351.1| dipeptidyl-peptidase I (C01 family) [Schistosoma mansoni]
gi|360043488|emb|CCD78901.1| putative cathepsin C [Schistosoma mansoni]
Length = 455
Score = 81.3 bits (199), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 76/282 (26%), Positives = 106/282 (37%), Gaps = 92/282 (32%)
Query: 67 PANRLPELIGYSEVDEDLPANFDSRTKWPNCPT-----IREIRDQGSCGSCWG------- 114
P+ L L G +LP FD W + P + IR+QG CGSC+
Sbjct: 208 PSKELISLTG------NLPLEFD----WTSPPDGSRSPVTPIRNQGICGSCYAFASAAAL 257
Query: 115 ---------------CRPYEIAPCEHH---VNGTRP----------------SCDASKGH 140
P + C + NG P +CD G
Sbjct: 258 EARIRLVSNFSEQPILSPQAVVDCSPYSEGCNGGFPFLIAGKYGEDFGFVSENCDPYTGE 317
Query: 141 -TPKCV--RECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLI 197
T KC + C Y Y Y ++NEK + E+ +GP F V++D
Sbjct: 318 DTGKCTVSKNCTRYYTTDYSY-----IGGYYGATNEKLMQLELISNGPFPVGFEVYEDFQ 372
Query: 198 LYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRIL 257
YK G I +T+ F F+ L HA+ ++
Sbjct: 373 FYKEG------------------IYHHTTVQNDHYNFNPFE----------LTNHAVLLV 404
Query: 258 GWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIES 299
G+G D+ S E YW + NSW +WG+ G F+ILRG DECG+ES
Sbjct: 405 GYGVDKLSGEPYWKVKNSWGVEWGEQGYFRILRGTDECGVES 446
>gi|350596935|ref|XP_001927698.4| PREDICTED: tubulointerstitial nephritis antigen, partial [Sus
scrofa]
Length = 368
Score = 81.3 bits (199), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 50/147 (34%), Positives = 68/147 (46%), Gaps = 36/147 (24%)
Query: 167 YSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSG--RFFVPGNETTAMSLIKWTIRDN 224
Y VSSNE IM+EI ++GPV+ V +D YK+G R NE +
Sbjct: 247 YRVSSNETEIMREIMQNGPVQAIMQVHEDFFHYKTGIYRHVTSTNEES------------ 294
Query: 225 TSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGE---DEKSKEKYWLIANSWNTDWG 281
+ L HA+++ GWG + KEK+W+ ANSW WG
Sbjct: 295 -------------------DKYRKLRTHAVKLTGWGTLKGAQGRKEKFWIAANSWGKSWG 335
Query: 282 DNGLFKILRGKDECGIESSITAGVPKL 308
+NG F+ILRG +E IE I A +L
Sbjct: 336 ENGYFRILRGVNESDIEKLIIAAWGQL 362
>gi|344250687|gb|EGW06791.1| Dipeptidyl-peptidase 1 [Cricetulus griseus]
Length = 483
Score = 81.3 bits (199), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 46/135 (34%), Positives = 65/135 (48%), Gaps = 31/135 (22%)
Query: 172 NEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAE 231
NE + E+ +HGP+ AF V DD + Y SG + G +RD
Sbjct: 376 NEALMKLELVQHGPMAVAFEVQDDFLHYHSGIYHHTG------------LRD-------- 415
Query: 232 GAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRG 291
F F+ L HA+ ++G+G D + YW + NSW T+WG++G F+I RG
Sbjct: 416 -PFNPFE----------LTNHAVLLVGYGRDPDTGTDYWTVKNSWGTEWGESGYFRIRRG 464
Query: 292 KDECGIESSITAGVP 306
DEC IES A +P
Sbjct: 465 TDECAIESIAVAAIP 479
>gi|343459017|gb|AEM37667.1| cathepsin C subunit [Epinephelus bruneus]
Length = 106
Score = 81.3 bits (199), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 45/136 (33%), Positives = 69/136 (50%), Gaps = 33/136 (24%)
Query: 175 SIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAF 234
++M E+ ++GP+ AF V+ D ++YK G + G +F
Sbjct: 2 AMMLELVKNGPMAVAFEVYPDFMIYKEGIY---------------------HHTGLADSF 40
Query: 235 TVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDE 294
F+ L HA+ ++G+G K+ +KYW++ NSW TDWG++G F+I RG DE
Sbjct: 41 NPFE----------LTNHAVLLVGYGRCHKTGQKYWIVKNSWGTDWGEDGYFRIRRGSDE 90
Query: 295 CGIESSITAG--VPKL 308
C IES A +PKL
Sbjct: 91 CSIESIAVAANPIPKL 106
>gi|395815757|ref|XP_003781389.1| PREDICTED: dipeptidyl peptidase 1 [Otolemur garnettii]
Length = 575
Score = 81.3 bits (199), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 66/266 (24%), Positives = 95/266 (35%), Gaps = 80/266 (30%)
Query: 84 LPANFDSRTKWPNCPTIREIRDQGSCGSCWG----------------------CRPYEIA 121
LPA++D R + +R+Q SCGSC+ P E+
Sbjct: 343 LPASWDWRNVH-GVNYVSPVRNQESCGSCYSFASVGMLEARIRILTNNTQTPILSPQEVV 401
Query: 122 PCEHHVNG-------------------TRPSCDASKGHTPKCVRE--CQENYDVPYKKDL 160
C + G +C G C + C+ Y Y
Sbjct: 402 SCSQYAQGCEGGFPYLVAGKHAQDFGLVEEACFPYTGTDAPCTMKEGCRRYYSSEYHYVG 461
Query: 161 NFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWT 220
F NE + E+ HGP+ AF V+DD + Y G +
Sbjct: 462 GFYG-----GCNEALMKLELVHHGPMAVAFEVYDDFLHYHRGIY---------------- 500
Query: 221 IRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDW 280
G F F+ L HA+ ++G+G D + +YW++ NSW T W
Sbjct: 501 -----HHTGLTDPFNPFE----------LTNHAVLLVGYGTDSATGIQYWIVKNSWGTGW 545
Query: 281 GDNGLFKILRGKDECGIESSITAGVP 306
G++G F+I RG DEC IES A P
Sbjct: 546 GEDGYFRIRRGTDECAIESIAVAATP 571
>gi|327282776|ref|XP_003226118.1| PREDICTED: tubulointerstitial nephritis antigen-like [Anolis
carolinensis]
Length = 476
Score = 81.3 bits (199), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 46/140 (32%), Positives = 71/140 (50%), Gaps = 32/140 (22%)
Query: 167 YSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTS 226
Y +SS + IMKEI E+GPV+ V+DD FF+ + + W++ T
Sbjct: 361 YRISSQDADIMKEIKENGPVQAVMQVYDD--------FFL---YKSGIYKHIWSLEGKTQ 409
Query: 227 QLGAEGAFTVFDDLILYKSGKALGGHAIRILGWG---EDEKSKEKYWLIANSWNTDWGDN 283
+ H+I+I+GWG + E ++K+W+ ANSW WG+N
Sbjct: 410 NRHQKKP------------------HSIKIVGWGTLRDAEGQRQKFWIAANSWGNSWGEN 451
Query: 284 GLFKILRGKDECGIESSITA 303
G F+ILRG++EC IE ++ A
Sbjct: 452 GYFRILRGQNECDIEKTVIA 471
>gi|296198446|ref|XP_002746707.1| PREDICTED: tubulointerstitial nephritis antigen [Callithrix
jacchus]
Length = 476
Score = 81.3 bits (199), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 51/145 (35%), Positives = 69/145 (47%), Gaps = 32/145 (22%)
Query: 167 YSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTS 226
Y VSS+E IMKEI ++GPV+ V +D YK+G + R TS
Sbjct: 355 YRVSSSETEIMKEIMQNGPVQAIMKVHEDFFHYKTGIY-----------------RHVTS 397
Query: 227 QLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGE---DEKSKEKYWLIANSWNTDWGDN 283
F + L HA+++ GWG + KEK+W+ ANSW WG+N
Sbjct: 398 TNKESEKF------------QKLQTHAVKLTGWGTLRGAQGRKEKFWIAANSWGKSWGEN 445
Query: 284 GLFKILRGKDECGIESSITAGVPKL 308
G F+ILRG +E IE I A +L
Sbjct: 446 GYFRILRGVNESDIEKLIIAAWGQL 470
>gi|431891156|gb|ELK02033.1| Tubulointerstitial nephritis antigen-like protein [Pteropus alecto]
Length = 467
Score = 81.3 bits (199), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 49/149 (32%), Positives = 72/149 (48%), Gaps = 32/149 (21%)
Query: 156 YKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMS 215
+ D+ +Y + SNEK IMKE+ E+GPV+ V +D LY+ G + T +S
Sbjct: 333 HANDIYQVTPAYRLGSNEKEIMKELMENGPVQALMEVHEDFFLYQGGIY-----SHTPVS 387
Query: 216 LIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGED---EKSKEKYWLI 272
L K + + G H+++I GWGE+ + KYW
Sbjct: 388 LGK------------------------PERYRRHGTHSVKITGWGEETLPDGRTLKYWTA 423
Query: 273 ANSWNTDWGDNGLFKILRGKDECGIESSI 301
ANSW WG+ G F+I+RG +EC IES +
Sbjct: 424 ANSWGPAWGERGHFRIVRGTNECDIESFV 452
>gi|354498051|ref|XP_003511129.1| PREDICTED: LOW QUALITY PROTEIN: dipeptidyl peptidase 1-like
[Cricetulus griseus]
Length = 470
Score = 81.3 bits (199), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 46/135 (34%), Positives = 65/135 (48%), Gaps = 31/135 (22%)
Query: 172 NEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAE 231
NE + E+ +HGP+ AF V DD + Y SG + G +RD
Sbjct: 363 NEALMKLELVQHGPMAVAFEVQDDFLHYHSGIYHHTG------------LRD-------- 402
Query: 232 GAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRG 291
F F+ L HA+ ++G+G D + YW + NSW T+WG++G F+I RG
Sbjct: 403 -PFNPFE----------LTNHAVLLVGYGRDPDTGTDYWTVKNSWGTEWGESGYFRIRRG 451
Query: 292 KDECGIESSITAGVP 306
DEC IES A +P
Sbjct: 452 TDECAIESIAVAAIP 466
>gi|26340150|dbj|BAC33738.1| unnamed protein product [Mus musculus]
Length = 462
Score = 80.9 bits (198), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 44/135 (32%), Positives = 64/135 (47%), Gaps = 31/135 (22%)
Query: 172 NEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAE 231
NE + E+ +HGP+ AF V DD + Y SG + G
Sbjct: 355 NEALMKLELVKHGPMAVAFEVHDDFLHYHSGIY---------------------HHTGLS 393
Query: 232 GAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRG 291
F F+ L HA+ ++G+G D + +YW+I NSW ++WG++G F+I RG
Sbjct: 394 DPFNPFE----------LTNHAVLLVGYGRDPVTGIEYWIIKNSWGSNWGESGYFRIRRG 443
Query: 292 KDECGIESSITAGVP 306
DEC IES A +P
Sbjct: 444 TDECAIESIAVAAIP 458
>gi|160707990|ref|NP_034112.3| dipeptidyl peptidase 1 preproprotein [Mus musculus]
gi|3023454|sp|P97821.1|CATC_MOUSE RecName: Full=Dipeptidyl peptidase 1; AltName: Full=Cathepsin C;
AltName: Full=Cathepsin J; AltName: Full=Dipeptidyl
peptidase I; Short=DPP-I; Short=DPPI; AltName:
Full=Dipeptidyl transferase; Contains: RecName:
Full=Dipeptidyl peptidase 1 exclusion domain chain;
AltName: Full=Dipeptidyl peptidase I exclusion domain
chain; Contains: RecName: Full=Dipeptidyl peptidase 1
heavy chain; AltName: Full=Dipeptidyl peptidase I heavy
chain; Contains: RecName: Full=Dipeptidyl peptidase 1
light chain; AltName: Full=Dipeptidyl peptidase I light
chain; Flags: Precursor
gi|1881656|gb|AAB49457.1| preprodipeptidyl peptidase I [Mus musculus]
gi|7609786|gb|AAB58400.3| dipeptidyl peptidase I precursor [Mus musculus]
gi|45219895|gb|AAH67063.1| Cathepsin C [Mus musculus]
gi|74147157|dbj|BAE27487.1| unnamed protein product [Mus musculus]
gi|74178079|dbj|BAE29829.1| unnamed protein product [Mus musculus]
gi|148674849|gb|EDL06796.1| cathepsin C, isoform CRA_b [Mus musculus]
Length = 462
Score = 80.9 bits (198), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 44/135 (32%), Positives = 64/135 (47%), Gaps = 31/135 (22%)
Query: 172 NEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAE 231
NE + E+ +HGP+ AF V DD + Y SG + G
Sbjct: 355 NEALMKLELVKHGPMAVAFEVHDDFLHYHSGIY---------------------HHTGLS 393
Query: 232 GAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRG 291
F F+ L HA+ ++G+G D + +YW+I NSW ++WG++G F+I RG
Sbjct: 394 DPFNPFE----------LTNHAVLLVGYGRDPVTGIEYWIIKNSWGSNWGESGYFRIRRG 443
Query: 292 KDECGIESSITAGVP 306
DEC IES A +P
Sbjct: 444 TDECAIESIAVAAIP 458
>gi|407196042|gb|AFT64209.1| putative cathepsin C3, partial [Eimeria tenella]
Length = 595
Score = 80.9 bits (198), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 60/195 (30%), Positives = 85/195 (43%), Gaps = 38/195 (19%)
Query: 120 IAPCEHHV-NGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMK 178
+APC H+ N R +++ G P+ D Y ++ N+ Y NE+ IM+
Sbjct: 407 VAPCLMHLGNFLRSPAESAPGCAPE---------DRWYAQEYNYVGGFYE-GCNEEKIME 456
Query: 179 EIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFD 238
EIY HGPV A D L LY+ G F V ++ + D
Sbjct: 457 EIYNHGPVVAALDAPDALFLYEDGFFDVKPSDHGKLC----------------------D 494
Query: 239 DLILYKSGKALGGHAIRILGWGEDEK-----SKEKYWLIANSWNTDWGDNGLFKILRGKD 293
+G HAI I+GWGED + K+W++ N+W DWG NG K+ RG++
Sbjct: 495 SPNKGLTGWEYTNHAIAIVGWGEDPPRMPGMTTRKFWVVRNTWGNDWGRNGYIKMKRGEN 554
Query: 294 ECGIESSITAGVPKL 308
IES A P L
Sbjct: 555 LAAIESQAVAIDPDL 569
>gi|328722316|ref|XP_003247542.1| PREDICTED: cathepsin B-like isoform 2 [Acyrthosiphon pisum]
gi|328722318|ref|XP_003247543.1| PREDICTED: cathepsin B-like isoform 3 [Acyrthosiphon pisum]
Length = 276
Score = 80.9 bits (198), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 58/196 (29%), Positives = 84/196 (42%), Gaps = 51/196 (26%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDA-SKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSN 172
GC+P +I P C+ +K + CV C N + Y D Y
Sbjct: 130 GCQPSKIPPV----------CNLPTKINKRTCVDYCYGNDTIKYNHD--HVKVRYYYHVK 177
Query: 173 EKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEG 232
K I KE+ +GPV A ++DD+ L+KS G
Sbjct: 178 PKDIQKEVQTYGPVTAALNLYDDIFLHKS------------------------------G 207
Query: 233 AFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGK 292
+T L K+ K + ++++GWG + + YWL+ NSW +WG NGL KI RGK
Sbjct: 208 VYT------LTKNAKYVRLQYVKLIGWGVE--NGVDYWLLVNSWGNEWGQNGLLKIKRGK 259
Query: 293 DECGIESSITAGVPKL 308
C +ES + A VPK+
Sbjct: 260 YGCAVESFVYAAVPKI 275
>gi|449498128|ref|XP_002193225.2| PREDICTED: tubulointerstitial nephritis antigen [Taeniopygia
guttata]
Length = 469
Score = 80.9 bits (198), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 78/285 (27%), Positives = 116/285 (40%), Gaps = 44/285 (15%)
Query: 54 LKSWMGVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCW 113
K +G P + N + E+ G S +E PA F + +WP I + DQ +CG+ W
Sbjct: 193 FKKRLGTFPPSHSLLN-MREVPGKSLPEEKFPAIFSAIYEWPE--WIHDPLDQRNCGASW 249
Query: 114 GCRPYEIAP--CEHHVNGTRP---------SCDASKGH------TPKCVRECQENYDVPY 156
+A H G SCD H R + + V Y
Sbjct: 250 AFSTASVAADRIAIHSKGQITDNLSAQNLISCDTRNQHGCNGGSIDGAWRYLKTHGVVSY 309
Query: 157 KKDLNFGAKSYSVSSNEKSIMKEIY----EHGPVEGAFTVFDDLILYKSGRFFVPGNETT 212
+F K S+ + + Y +GP AF + L S + V ET
Sbjct: 310 ACYPSFWNKHLGPSAENQCYVSNEYGKNHTNGPCPNAFEKSNRLYRCAS-HYRVSSKETD 368
Query: 213 AMSLIKWTIRDNTSQLGAEGAFTVFDDLILY---------KSGKALGGHAIRILGWG--- 260
M IK + + V++D LY K+G H++++LGWG
Sbjct: 369 IMKEIK-------DRGPVQAIMKVYEDFFLYKEGIYQHSQKAGSKWKTHSVKLLGWGALP 421
Query: 261 EDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGV 305
+ K+K+W+ ANSW WG+NG F+ILRG++EC IE I A +
Sbjct: 422 DKNGQKQKFWIAANSWGKSWGENGYFRILRGQNECDIEKLILATL 466
>gi|74199074|dbj|BAE30750.1| unnamed protein product [Mus musculus]
Length = 447
Score = 80.9 bits (198), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 44/135 (32%), Positives = 64/135 (47%), Gaps = 31/135 (22%)
Query: 172 NEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAE 231
NE + E+ +HGP+ AF V DD + Y SG + G
Sbjct: 340 NEALMKLELVKHGPMAVAFEVHDDFLHYHSGIY---------------------HHTGLS 378
Query: 232 GAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRG 291
F F+ L HA+ ++G+G D + +YW+I NSW ++WG++G F+I RG
Sbjct: 379 DPFNPFE----------LTNHAVLLVGYGRDPVTGIEYWIIKNSWGSNWGESGYFRIRRG 428
Query: 292 KDECGIESSITAGVP 306
DEC IES A +P
Sbjct: 429 TDECAIESIAVAAIP 443
>gi|74191569|dbj|BAE30359.1| unnamed protein product [Mus musculus]
Length = 462
Score = 80.9 bits (198), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 44/135 (32%), Positives = 64/135 (47%), Gaps = 31/135 (22%)
Query: 172 NEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAE 231
NE + E+ +HGP+ AF V DD + Y SG + G
Sbjct: 355 NEALMKLELVKHGPMAVAFEVHDDFLHYHSGIY---------------------HHTGLS 393
Query: 232 GAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRG 291
F F+ L HA+ ++G+G D + +YW+I NSW ++WG++G F+I RG
Sbjct: 394 DPFNPFE----------LTNHAVLLVGYGRDPVTGIEYWIIKNSWGSNWGESGYFRIRRG 443
Query: 292 KDECGIESSITAGVP 306
DEC IES A +P
Sbjct: 444 TDECAIESIAVAAIP 458
>gi|256086900|ref|XP_002579622.1| cathepsin B (C01 family) [Schistosoma mansoni]
Length = 204
Score = 80.9 bits (198), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 61/236 (25%), Positives = 94/236 (39%), Gaps = 86/236 (36%)
Query: 73 ELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTRP 132
+ I + V+ +P FD+R W NC TI++I D+ C + W
Sbjct: 55 QTISHRNVNMVIPHTFDARDHWVNCSTIKQIHDECCCRADW------------------- 95
Query: 133 SCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTV 192
K Y+V ++++ I KEI +GPV + V
Sbjct: 96 -----------------------------VSEKIYNVYADQEDIQKEILMNGPVIASILV 126
Query: 193 FDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGH 252
D ++YKSG +F P +++ LG
Sbjct: 127 KVDFLVYKSGVYF-PTPKSSN-----------------------------------LGWI 150
Query: 253 AIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVPKL 308
+RI+GWG + K+ YWL ANSW+ +WG+NG K+ RG IES + A +PK+
Sbjct: 151 NLRIIGWGYEGKTP--YWLCANSWSKEWGENGYVKVRRGVQAGYIESYVRAPIPKI 204
>gi|74204274|dbj|BAE39895.1| unnamed protein product [Mus musculus]
Length = 462
Score = 80.9 bits (198), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 44/135 (32%), Positives = 64/135 (47%), Gaps = 31/135 (22%)
Query: 172 NEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAE 231
NE + E+ +HGP+ AF V DD + Y SG + G
Sbjct: 355 NEALMKLELVKHGPMAVAFEVHDDFLHYHSGIY---------------------HHTGLS 393
Query: 232 GAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRG 291
F F+ L HA+ ++G+G D + +YW+I NSW ++WG++G F+I RG
Sbjct: 394 DPFNPFE----------LTNHAVLLVGYGRDPVTGIEYWIIKNSWGSNWGESGYFRIRRG 443
Query: 292 KDECGIESSITAGVP 306
DEC IES A +P
Sbjct: 444 TDECAIESIAVAAIP 458
>gi|78042562|ref|NP_001030279.1| tubulointerstitial nephritis antigen [Bos taurus]
gi|108861910|sp|Q3SZI1.1|TINAG_BOVIN RecName: Full=Tubulointerstitial nephritis antigen; Short=TIN-Ag
gi|74354008|gb|AAI02844.1| Tubulointerstitial nephritis antigen [Bos taurus]
gi|296474572|tpg|DAA16687.1| TPA: tubulointerstitial nephritis antigen [Bos taurus]
Length = 476
Score = 80.9 bits (198), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 49/147 (33%), Positives = 68/147 (46%), Gaps = 36/147 (24%)
Query: 167 YSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSG--RFFVPGNETTAMSLIKWTIRDN 224
Y VSSNE IM+EI ++GPV+ V +D YK+G R NE +
Sbjct: 355 YRVSSNETEIMREIMQNGPVQAIMQVHEDFFNYKTGIYRHITSTNEDS------------ 402
Query: 225 TSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGE---DEKSKEKYWLIANSWNTDWG 281
+ + HA+++ GWG + KEK+W+ ANSW WG
Sbjct: 403 -------------------EKYRKFRTHAVKLTGWGTLRGAQGQKEKFWIAANSWGKSWG 443
Query: 282 DNGLFKILRGKDECGIESSITAGVPKL 308
+NG F+ILRG +E IE I A +L
Sbjct: 444 ENGYFRILRGVNESDIEKLIIAAWGQL 470
>gi|440907441|gb|ELR57591.1| Tubulointerstitial nephritis antigen [Bos grunniens mutus]
Length = 476
Score = 80.9 bits (198), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 49/147 (33%), Positives = 68/147 (46%), Gaps = 36/147 (24%)
Query: 167 YSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSG--RFFVPGNETTAMSLIKWTIRDN 224
Y VSSNE IM+EI ++GPV+ V +D YK+G R NE +
Sbjct: 355 YRVSSNETEIMREIMQNGPVQAIMQVHEDFFNYKTGIYRHITSTNEDS------------ 402
Query: 225 TSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGE---DEKSKEKYWLIANSWNTDWG 281
+ + HA+++ GWG + KEK+W+ ANSW WG
Sbjct: 403 -------------------EKYRKFRTHAVKLTGWGTLRGAQGQKEKFWIAANSWGKSWG 443
Query: 282 DNGLFKILRGKDECGIESSITAGVPKL 308
+NG F+ILRG +E IE I A +L
Sbjct: 444 ENGYFRILRGVNESDIEKLIIAAWGQL 470
>gi|12832450|dbj|BAB22112.1| unnamed protein product [Mus musculus]
Length = 461
Score = 80.9 bits (198), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 44/135 (32%), Positives = 64/135 (47%), Gaps = 31/135 (22%)
Query: 172 NEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAE 231
NE + E+ +HGP+ AF V DD + Y SG + G
Sbjct: 354 NEALMKLELVKHGPMAVAFEVHDDFLHYHSGIY---------------------HHTGLS 392
Query: 232 GAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRG 291
F F+ L HA+ ++G+G D + +YW+I NSW ++WG++G F+I RG
Sbjct: 393 DPFNPFE----------LTNHAVLLVGYGRDPVTGIEYWIIKNSWGSNWGESGYFRIRRG 442
Query: 292 KDECGIESSITAGVP 306
DEC IES A +P
Sbjct: 443 TDECAIESIAVAAIP 457
>gi|353228747|emb|CCD74918.1| cathepsin B (C01 family) [Schistosoma mansoni]
Length = 229
Score = 80.9 bits (198), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 61/236 (25%), Positives = 94/236 (39%), Gaps = 86/236 (36%)
Query: 73 ELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTRP 132
+ I + V+ +P FD+R W NC TI++I D+ C + W
Sbjct: 80 QTISHRNVNMVIPHTFDARDHWVNCSTIKQIHDECCCRADW------------------- 120
Query: 133 SCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTV 192
K Y+V ++++ I KEI +GPV + V
Sbjct: 121 -----------------------------VSEKIYNVYADQEDIQKEILMNGPVIASILV 151
Query: 193 FDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGH 252
D ++YKSG +F P +++ LG
Sbjct: 152 KVDFLVYKSGVYF-PTPKSSN-----------------------------------LGWI 175
Query: 253 AIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVPKL 308
+RI+GWG + K+ YWL ANSW+ +WG+NG K+ RG IES + A +PK+
Sbjct: 176 NLRIIGWGYEGKTP--YWLCANSWSKEWGENGYVKVRRGVQAGYIESYVRAPIPKI 229
>gi|193610664|ref|XP_001948185.1| PREDICTED: cathepsin B-like [Acyrthosiphon pisum]
Length = 324
Score = 80.9 bits (198), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 59/195 (30%), Positives = 82/195 (42%), Gaps = 49/195 (25%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GC+P +I P + K + C C N + Y D SY+
Sbjct: 179 GCQPSKIPPIFNL---------PKKIYNRTCDNFCYGNSLIDYNHD--HVKVSYTYHVLY 227
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
K+I +E+ +GPV F+++DDL LY SG + A
Sbjct: 228 KNIQREVQTYGPVSAYFSLYDDLFLYTSGVY----------------------------A 259
Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
T + Y+S K ++GWG + + YWL+ NSW +WG NGLFKI RG D
Sbjct: 260 RTEKSKFVRYQSAK--------LIGWGVE--NGVDYWLLVNSWGNEWGQNGLFKIKRGTD 309
Query: 294 ECGIESSITAGVPKL 308
EC AGVPK+
Sbjct: 310 ECQFGRHTYAGVPKM 324
>gi|74212565|dbj|BAE31022.1| unnamed protein product [Mus musculus]
Length = 191
Score = 80.9 bits (198), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 44/135 (32%), Positives = 64/135 (47%), Gaps = 31/135 (22%)
Query: 172 NEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAE 231
NE + E+ +HGP+ AF V DD + Y SG + G
Sbjct: 84 NEALMELELVKHGPMAVAFEVHDDFLHYHSGIY---------------------HHTGLS 122
Query: 232 GAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRG 291
F F+ L HA+ ++G+G D + +YW+I NSW ++WG++G F+I RG
Sbjct: 123 DPFNPFE----------LTNHAVLLVGYGRDPVTGIEYWIIKNSWGSNWGESGYFRIRRG 172
Query: 292 KDECGIESSITAGVP 306
DEC IES A +P
Sbjct: 173 TDECAIESIAVAAIP 187
>gi|253742295|gb|EES99137.1| Cathepsin B precursor [Giardia intestinalis ATCC 50581]
Length = 315
Score = 80.5 bits (197), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 70/293 (23%), Positives = 109/293 (37%), Gaps = 98/293 (33%)
Query: 61 HPDYNLPANRLPELIGYSE--VDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWGC--- 115
H + L AN L G +E ++ D P +FD R ++P C + DQG CGSCW
Sbjct: 56 HLVHFLDANAHSHLAGRTEKNINYDYPESFDFREEYPQC--LLPTYDQGHCGSCWAFASS 113
Query: 116 -------------------RPYEIAPCEHHVNGTR----------------------PSC 134
P + C G P
Sbjct: 114 RAFGDTRCMQGLDPVPVLYSPQYLVSCSLQNMGCTGGTMEDVGDFLRDTGIATDTCVPYV 173
Query: 135 DASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFD 194
D H C C + + + ++F N +++M+ I +GP+ + +++
Sbjct: 174 D-EDAHWEPCPVSCVDGSPIRTVQLMDF----VRYDGNLEAMMEAIAMNGPIHASMMIYE 228
Query: 195 DLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAI 254
D + Y+SG + +Y SG G HAI
Sbjct: 229 DFMYYQSGIYH-----------------------------------FIYGSG--CGMHAI 251
Query: 255 RILGWGED--------EKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIES 299
++G+G D E+ + YW+ NSW DWG+NG F+I+RG +ECGIE+
Sbjct: 252 ELVGYGTDISGDSEAGEEVRVDYWIARNSWGEDWGENGYFRIVRGNNECGIEN 304
>gi|110456454|gb|ABG74712.1| cathepsin B preproprotein-like protein [Diaphorina citri]
Length = 125
Score = 80.5 bits (197), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 52/165 (31%), Positives = 75/165 (45%), Gaps = 55/165 (33%)
Query: 151 NYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNE 210
+Y+ Y+ DL G K++ V + M++IYEHGP+ F+
Sbjct: 6 SYESTYRFDLKKGKKAHMVP--RCNAMRQIYEHGPLVAIFS------------------- 44
Query: 211 TTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSG-------KALGGHAIRILGWGEDE 263
V+ D + YKSG ++G HA+R+LGWG +
Sbjct: 45 -------------------------VYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVE- 78
Query: 264 KSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVPKL 308
+ YWL+ANSWN WGD+G FKILRG++E IE G P+
Sbjct: 79 -NDIPYWLVANSWNDHWGDHGTFKILRGENEADIEMGFNVGYPQF 122
>gi|395730851|ref|XP_003775799.1| PREDICTED: tubulointerstitial nephritis antigen-like 1 [Pongo
abelii]
Length = 362
Score = 80.5 bits (197), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 75/283 (26%), Positives = 104/283 (36%), Gaps = 94/283 (33%)
Query: 82 EDLPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEIAPCE---HHVNGTRP------ 132
E LP F++ KWPN I E DQG+C W +A H + P
Sbjct: 96 EVLPTAFEASEKWPN--LIHEPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQN 153
Query: 133 --SCDASK-------------------------------------GHTPKCVREC----- 148
SCD + G TP C+
Sbjct: 154 LLSCDTHQQQGCRGGRLDGAWWFLRRRGVVSDHCYPFSGRERDEAGPTPPCMMHSRAMGR 213
Query: 149 ---QENYDVPY----KKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKS 201
Q P D+ Y + SN+K IMKE+ E+GPV+ V +D LYK
Sbjct: 214 GKRQATASCPNSHVNNNDIYQVTPVYRLGSNDKEIMKELMENGPVQALMEVHEDFFLYKG 273
Query: 202 GRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGE 261
G + T +SL + + + G H+++I GWGE
Sbjct: 274 GIY-----SHTPVSLGR------------------------PERYRRHGTHSVKITGWGE 304
Query: 262 D---EKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSI 301
+ + KYW ANSW WG+ G F+I+RG +EC IES +
Sbjct: 305 ETLPDGRTLKYWTAANSWGPAWGERGHFRIVRGVNECDIESFV 347
>gi|345488309|ref|XP_001605531.2| PREDICTED: uncharacterized peptidase C1-like protein F26E4.3-like
[Nasonia vitripennis]
Length = 481
Score = 80.5 bits (197), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 68/248 (27%), Positives = 102/248 (41%), Gaps = 40/248 (16%)
Query: 83 DLPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEIAPCEHHVNGT-RPSCDASKGHT 141
DLP FDSR +W N I ++DQG CG+ W ++A + S H
Sbjct: 233 DLPREFDSRIQWGN--DITPVQDQGWCGASWAISTVDVASDRFAIMSKGIEKVQLSGQHL 290
Query: 142 PKCVRECQENYDVPYKKDLNFGAKSYSVSSNE-------KSIMKEIYEHGPVEGA----- 189
C Q Y + + V + +S I G + A
Sbjct: 291 ISCNNRGQRGCKGGYLDRAWLFMRKFGVVDEDCYPWLSGRSDKCRIPRRGKLSDAGCQRR 350
Query: 190 --FTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSG- 246
+ + +++ YK G + GNET M I + + V D Y+SG
Sbjct: 351 NSYNLRNEM--YKVGPAYRLGNETDIMQEI-------LTSGPVQATMRVHRDFFHYESGI 401
Query: 247 ---------KALGGHAIRILGWGEDEKSKE----KYWLIANSWNTDWGDNGLFKILRGKD 293
+ G H++RI+GWGE+ K+W +ANSW DWG++G F+I+RG +
Sbjct: 402 YVHSRPFDTRQSGYHSVRIVGWGEEPSPYNGKPIKFWRVANSWGRDWGEDGYFRIVRGNN 461
Query: 294 ECGIESSI 301
EC IES +
Sbjct: 462 ECEIESFV 469
>gi|182509202|ref|NP_001116812.1| tubulointerstitial nephritis antigen precursor [Bombyx mori]
gi|81303350|gb|ABB71105.1| TIN-ag-RP [Bombyx mori]
Length = 404
Score = 80.5 bits (197), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 72/250 (28%), Positives = 108/250 (43%), Gaps = 37/250 (14%)
Query: 74 LIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPS 133
+I YS+ D P FD+R +W I I DQ CGS W I G R S
Sbjct: 176 VISYSK-DGQYPDEFDARREWYG--YISPIADQDWCGSDWAVSIASIV-------GDRFS 225
Query: 134 CDASKGHTPKCVRECQENYDVPYKKDLNFGAK--SYSVSSNEKSIMKEIYEHGPVEGAFT 191
+ + + + + ++ N G ++ + ++ + P EGA T
Sbjct: 226 IQSFGTENVRMSSQTLLSCHLKGQRGCNGGNLDIAFDFVKTHGLVSEQCF---PYEGAVT 282
Query: 192 ---VFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSG-- 246
+ +D Y+ G F E M D + A G TV+ D Y+ G
Sbjct: 283 QCRIGNDCRRYRVGVPFSISKEEDIM-------YDIMTSGPALGIMTVYQDFFHYREGIY 335
Query: 247 --------KALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIE 298
G H++RI+GWGED +++KYW++ANSW T WG+ G F+I RG GIE
Sbjct: 336 RHTRHGDQLMRGLHSVRIVGWGED--AEDKYWIVANSWGTSWGEKGYFRIARGHSGTGIE 393
Query: 299 SSITAGVPKL 308
SS+ +P +
Sbjct: 394 SSVLTVLPYV 403
>gi|2330009|gb|AAB66719.1| cysteine protease [Giardia muris]
Length = 301
Score = 80.5 bits (197), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 70/239 (29%), Positives = 106/239 (44%), Gaps = 44/239 (18%)
Query: 82 EDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG-----------CRPYEIAPCEHHVNGT 130
++LP ++D R + +C + E+ DQ SCGSCW C + H+
Sbjct: 75 KELPKDYDPRVERAHC--LPEVADQASCGSCWAFSAVATFADRRCAYGLDSKQVHYSEQY 132
Query: 131 RPSCD----ASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPV 186
SCD A G V + VP L + + ++ + +S + + PV
Sbjct: 133 VVSCDFGDGACNGGWLSNVWKFLTKTGVPKLDCLKYFS---GMTGDRESCITHCTDGSPV 189
Query: 187 EGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSG 246
E LY++ G + M ++ + D Q+ AF V+ D Y SG
Sbjct: 190 E----------LYQASHVINYGMDLDRM--MEALVYDGPLQV----AFVVYSDFGYYSSG 233
Query: 247 -------KALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIE 298
GGHA+ ++G+G DE S KYW+I NSW DWG+ G F+I+R +ECGIE
Sbjct: 234 VYQHVNGMMEGGHAVEMVGYGIDE-SGLKYWIIRNSWGPDWGEGGYFRIIRRVNECGIE 291
>gi|145481831|ref|XP_001426938.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124394016|emb|CAK59540.1| unnamed protein product [Paramecium tetraurelia]
Length = 332
Score = 80.5 bits (197), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 58/204 (28%), Positives = 82/204 (40%), Gaps = 59/204 (28%)
Query: 115 CRPYEIAPCEH-HVNGTRPSCDAS----KGHTPKCVRECQENYDVPYKKD-LNFGAKSYS 168
C+PY PC H + +G C+ TP C ++C + Y D + Y
Sbjct: 174 CKPYSFPPCSHGNDSGKYSKCENDFFMLTEVTPSCTKKCHPQFSRTYDVDKIRSRENPYK 233
Query: 169 VSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQL 228
+ +++ I EIY +GPV+ F
Sbjct: 234 LIKDQEQIKNEIYLNGPVQAVF-------------------------------------- 255
Query: 229 GAEGAFTVFDDLILYKSG-------KALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWG 281
TVFDD + YKSG + G HA++I+GWG + + YW NSWN WG
Sbjct: 256 ------TVFDDFLNYKSGVYQQTTGQRRGKHAVKIIGWGTE--NGVPYWEAINSWNDGWG 307
Query: 282 DNGLFKILRGKDECGIESSITAGV 305
NG FKILRG + IE + A +
Sbjct: 308 INGKFKILRGFNHLDIEGEVYASI 331
Score = 52.0 bits (123), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 22/43 (51%), Positives = 28/43 (65%)
Query: 72 PELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG 114
P Y E E+LP +F ++ KWP CP+I I DQG+CGSCW
Sbjct: 59 PVEYKYHEKLENLPPSFSAQEKWPGCPSIELIPDQGNCGSCWA 101
>gi|344287520|ref|XP_003415501.1| PREDICTED: tubulointerstitial nephritis antigen isoform 2
[Loxodonta africana]
Length = 437
Score = 80.5 bits (197), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 77/296 (26%), Positives = 111/296 (37%), Gaps = 97/296 (32%)
Query: 69 NRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG-------------- 114
N + ++G EV LP F++ KWPN I E DQG C W
Sbjct: 161 NEIHTVLGPGEV---LPMAFEASKKWPN--LIHEPLDQGDCAGSWAFSTAAVASDRVSIH 215
Query: 115 --------CRPYEIAPCE-HHVNGTR------------------PSCDASKGH------- 140
P + C+ H+ G R C GH
Sbjct: 216 SLGHMTPILSPQNLLSCDTHNQQGCRGGRLDGAWWFLRRRGVVSDHCYPFSGHERDKAGP 275
Query: 141 TPKCVREC--------QENYDVP----YKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEG 188
P C+ Q P + D+ +Y + +NEK IMKE+ E+GPV+
Sbjct: 276 VPPCMMHSRAMGRGKRQATSRCPNSHVHGNDIYQVTPAYRLGTNEKEIMKELMENGPVQA 335
Query: 189 AFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKA 248
V +D LY+ G + T +S ++ Q +
Sbjct: 336 LMEVHEDFFLYQGGIY-----SHTPVS------QERPEQY------------------RR 366
Query: 249 LGGHAIRILGWGED---EKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSI 301
G H+++I GWGE+ + KYW ANSW WG+ G F+I+RG +EC IES +
Sbjct: 367 HGTHSVKITGWGEETLPDGRTLKYWTAANSWGPAWGERGHFRIVRGANECDIESFV 422
>gi|403268748|ref|XP_003926429.1| PREDICTED: tubulointerstitial nephritis antigen [Saimiri
boliviensis boliviensis]
Length = 476
Score = 80.5 bits (197), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 51/145 (35%), Positives = 68/145 (46%), Gaps = 32/145 (22%)
Query: 167 YSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTS 226
Y VSS+E IMKEI ++GPV+ V +D YK+G + R TS
Sbjct: 355 YRVSSSETEIMKEIMQNGPVQAIMKVHEDFFHYKTGIY-----------------RHVTS 397
Query: 227 QLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGE---DEKSKEKYWLIANSWNTDWGDN 283
F L HA+++ GWG + KEK+W+ ANSW WG+N
Sbjct: 398 TNKESEKFL------------KLQTHAVKLTGWGTLRGAQGRKEKFWIAANSWGKSWGEN 445
Query: 284 GLFKILRGKDECGIESSITAGVPKL 308
G F+ILRG +E IE I A +L
Sbjct: 446 GYFRILRGVNESDIEKLIIAAWGQL 470
>gi|410909768|ref|XP_003968362.1| PREDICTED: dipeptidyl peptidase 1-like [Takifugu rubripes]
Length = 455
Score = 80.1 bits (196), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 70/291 (24%), Positives = 110/291 (37%), Gaps = 86/291 (29%)
Query: 68 ANRLPELIGYSEVDEDLP---ANFDSRTKWPNCP---TIREIRDQGSCGSCWG------- 114
A+R+P + + VD +L A W N + +R+QGSCGSC+
Sbjct: 201 ASRIPIRVHPTNVDPELAKKAAALPELWDWRNVEGVNFVSPVRNQGSCGSCYCFATMGML 260
Query: 115 ---------------CRPYEIAPCEHHVNG------------------TRPSCDASKGHT 141
P ++ C + G SC G
Sbjct: 261 EARLRILTNNSQSPVLSPQQVVSCSEYSQGCDGGFPYLTGKYVQDFGIVDESCFPYMGKD 320
Query: 142 PKC--VRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILY 199
C + C+ Y YK F +E ++M E+ ++GP+ A V+ D + Y
Sbjct: 321 SPCGISQSCRRGYAAEYKYVGGFYG-----GCSEAAMMVELVKNGPMAVALEVYSDFMSY 375
Query: 200 KSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGW 259
K G + G + D+ + L HA+ ++G+
Sbjct: 376 KGGIYHHTG------------LTDHVNPF-------------------ELTNHAVLLVGY 404
Query: 260 GEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAG--VPKL 308
G + +KYW++ NSW + WG++G F+I RG DEC IES A +PKL
Sbjct: 405 GRCHMTGQKYWIVKNSWGSSWGEDGYFRIRRGSDECAIESIAVAASPIPKL 455
>gi|390348202|ref|XP_001201161.2| PREDICTED: dipeptidyl peptidase 1-like [Strongylocentrotus
purpuratus]
Length = 458
Score = 80.1 bits (196), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 66/273 (24%), Positives = 98/273 (35%), Gaps = 78/273 (28%)
Query: 78 SEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWGCR--------------------- 116
S+ LP +FD R + +R+Q CGSC+
Sbjct: 217 SKAAFSLPESFDWR-DLNGQNFVSPVRNQAQCGSCFSFAALAMLEARLRIATNNTVQKVF 275
Query: 117 -PYEIAPCEHHVNG-------------------TRPSCDASKGHTPKCVRE---CQENYD 153
P ++ C + G SC +G C +E C+ Y
Sbjct: 276 APQDVVDCSEYAQGCEGGFPYLIAGKYAEDFGVVEESCYPYQGVDSACSKEQPGCRRYYA 335
Query: 154 VPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTA 213
Y+ F + NE+ + + +GP+ F V+ D + YK G +
Sbjct: 336 TNYQYIGGFYG-----ACNEELMRLALVNNGPIAVGFQVYGDFMSYKGGVY--------- 381
Query: 214 MSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIA 273
G + + FD L HA+ ++G+G DE S +W +
Sbjct: 382 ------------HHTGVKNSMLKFDPF-------ELTNHAVLVVGYGVDEASGMSFWTVK 422
Query: 274 NSWNTDWGDNGLFKILRGKDECGIESSITAGVP 306
NSW T WG+ G F+ILRG DECGIES P
Sbjct: 423 NSWGTGWGEGGYFRILRGTDECGIESMAMQSFP 455
>gi|294891865|ref|XP_002773777.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
gi|239878981|gb|EER05593.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
Length = 156
Score = 80.1 bits (196), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 49/171 (28%), Positives = 82/171 (47%), Gaps = 42/171 (24%)
Query: 132 PSCDASKGHTPKCVREC-QENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAF 190
P C + P C EC E+Y ++DL+ + ++ + I +EI+++G V G
Sbjct: 21 PKCPSEALSQPACQTECINESYKTSLQQDLHRAKSWGRLPTSPQKIKQEIFDNGTVLGVI 80
Query: 191 TVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALG 250
++++D LYKSG + ++ +G +G
Sbjct: 81 SMYEDFRLYKSGVY-------------------------------------VHTTGGLVG 103
Query: 251 GHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSI 301
H+++I+GWG + S + YWL NSWN +WGD+G+ K+ G E GIE+SI
Sbjct: 104 VHSLKIIGWGVE--SGQDYWLAVNSWNEEWGDHGMIKLAVG--ETGIENSI 150
>gi|300176830|emb|CBK25399.2| unnamed protein product [Blastocystis hominis]
Length = 563
Score = 80.1 bits (196), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 82/296 (27%), Positives = 125/296 (42%), Gaps = 93/296 (31%)
Query: 67 PANRLPELIGYSEV-----DEDLPANFDSR-TKWPNCPTIREIRDQGS-CGSCWG----- 114
P ++PEL+ ++ DE LP ++D R N T+ + + CGSCW
Sbjct: 21 PNAKVPELVKTAQPYTFLGDEVLPKSYDPRDIDGRNYVTVTKNQHIPQYCGSCWSFASVS 80
Query: 115 ------------------CRPYEIAPCEHHVNGTR---PSCDASKGH---TPK--CVREC 148
P I C+H+ NG + P H P+ C+R
Sbjct: 81 SVSDRLKLMTKGKWPVHDLSPQVILNCDHNSNGCQGGHPLTAFKYMHDHGVPEEGCMRYM 140
Query: 149 QENY---DVPYKKDLN-----FGAKSYS--------VSSNEKSIMKEIYEHGPVEGAFTV 192
+N D+ +D + F K+Y+ + EK++MKEIY GP+ + V
Sbjct: 141 AKNMECTDINICRDCDSEKGCFAVKNYTKYYVDEYGSVAGEKNMMKEIYARGPITCSIAV 200
Query: 193 FDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGH 252
DDL+ YK G + RD T GA T+ H
Sbjct: 201 PDDLMEYKGGIY-----------------RDTT------GAKTL--------------DH 223
Query: 253 AIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVPKL 308
AI ++GWGE++ +KYW+ NSW T WG+ G F+I+RG++ GIE+ VP++
Sbjct: 224 AISVVGWGEEDG--QKYWIARNSWGTFWGEKGWFRIVRGENNLGIEADCQWAVPRV 277
Score = 58.9 bits (141), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 56/232 (24%), Positives = 88/232 (37%), Gaps = 49/232 (21%)
Query: 87 NFDSRTKWPNCP-TIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTRP--SCDASKGHTPK 143
N + KWP + +E+ + + G+C G ++ E+ N P +C + +
Sbjct: 371 NLMRKGKWPTVELSAQEVINCSNAGTCDGGSDADVF--EYAFNEGIPDQTCQVYEAIDKE 428
Query: 144 C-----VRECQENYDV-PYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLI 197
C +C D P K + Y E I EI+ GPV + V ++ +
Sbjct: 429 CNDMARCMDCPPGEDCYPVKDYKRYKVSEYGEVKGEMEIKAEIFARGPVSCSMIVTEEFL 488
Query: 198 LYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRIL 257
Y+ G F DD G +G HA+ +
Sbjct: 489 AYQGGIFV--------------------------------DD-----RGHIVGYHAVEVA 511
Query: 258 GWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVPKLD 309
GWGE E KYW+ NSW WG++G F+++ G + I GVP +D
Sbjct: 512 GWGETEDGT-KYWIARNSWGPYWGEHGWFRMIVGVSKGLITGYCNWGVPVID 562
>gi|410972493|ref|XP_003992693.1| PREDICTED: dipeptidyl peptidase 1 isoform 1 [Felis catus]
Length = 463
Score = 80.1 bits (196), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 66/264 (25%), Positives = 101/264 (38%), Gaps = 76/264 (28%)
Query: 84 LPANFDSRTKWPNCPTIREIRDQGSCGSCWG----------------------CRPYEIA 121
LPA++D R + +R+Q SCGSC+ P E+
Sbjct: 231 LPASWDWRNVH-GTNFVTPVRNQASCGSCYSFASMGMLEARIRILTNNTQTPILSPQEVV 289
Query: 122 PCEHHVNG-------------------TRPSCDASKGHTPKCVRECQENYDVPYKKDLNF 162
C + G +C G C + +E+ Y + ++
Sbjct: 290 SCSQYAQGCDGGFPYLIAGKYAQDFGLVEEACFPYTGTDSPC--KPKEDCVRYYSSEYHY 347
Query: 163 GAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIR 222
Y NE + E+ HGP+ AF V++D + Y+ G ++ G +R
Sbjct: 348 VGGFYG-GCNEALMKLELVHHGPMAVAFEVYNDFLHYRKGIYYHTG------------LR 394
Query: 223 DNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGD 282
D F F+ L HA+ ++G+G D S YW++ NSW WG+
Sbjct: 395 D---------PFNPFE----------LTNHAVLLVGYGTDPVSGMDYWIVKNSWGIGWGE 435
Query: 283 NGLFKILRGKDECGIESSITAGVP 306
+G F+I RG DEC IES A P
Sbjct: 436 DGYFRIRRGTDECAIESIAVAATP 459
>gi|332210168|ref|XP_003254178.1| PREDICTED: tubulointerstitial nephritis antigen [Nomascus
leucogenys]
Length = 476
Score = 80.1 bits (196), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 50/145 (34%), Positives = 69/145 (47%), Gaps = 32/145 (22%)
Query: 167 YSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTS 226
Y VSS+E IMKEI ++GPV+ V +D YK+G + R TS
Sbjct: 355 YRVSSSETEIMKEIMQNGPVQAIMQVREDFFHYKTGIY-----------------RHVTS 397
Query: 227 QLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGE---DEKSKEKYWLIANSWNTDWGDN 283
+ + L HA+++ GWG + KEK+W+ ANSW WG+N
Sbjct: 398 ANKESEKY------------RKLQTHAVKLTGWGTLRGAQGQKEKFWIAANSWGKSWGEN 445
Query: 284 GLFKILRGKDECGIESSITAGVPKL 308
G F+ILRG +E IE I A +L
Sbjct: 446 GYFRILRGVNESDIEKLIIAAWGQL 470
>gi|260826514|ref|XP_002608210.1| hypothetical protein BRAFLDRAFT_125840 [Branchiostoma floridae]
gi|229293561|gb|EEN64220.1| hypothetical protein BRAFLDRAFT_125840 [Branchiostoma floridae]
Length = 470
Score = 80.1 bits (196), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 67/267 (25%), Positives = 94/267 (35%), Gaps = 80/267 (29%)
Query: 83 DLPANFDSRTKWPNCPTIREIRDQGSCGSCWG----------------------CRPYEI 120
LP +FD R K + IRDQG CGSC+ P EI
Sbjct: 236 QLPESFDWR-KVMGLNFVSPIRDQGQCGSCYAFASMGMLEARLRVLTNNTQQFVLSPQEI 294
Query: 121 APCEHHVNG-------------------TRPSCDASKGHTPKC--VRECQENYDVPYKKD 159
C + G C +G C C Y Y+
Sbjct: 295 VSCGKYSQGCEGGFPYLIAGKYAEDFGVVLEECYPYEGKDSSCKDTSRCGRGYATNYRYV 354
Query: 160 LNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKW 219
F NE+ + E+ ++GP+ AF V+ D + YK G +
Sbjct: 355 GGFYG-----GCNEELMQLELVKNGPMAVAFEVYSDFMHYKGGVY--------------- 394
Query: 220 TIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTD 279
G F F+ + HA+ ++G+G D ++ K+W + NSW
Sbjct: 395 ------EHTGLSDPFNPFE----------ITNHAVLLVGYGRDPETGAKFWTVKNSWGEK 438
Query: 280 WGDNGLFKILRGKDECGIESSITAGVP 306
WG+ G F+I RG DEC IES A P
Sbjct: 439 WGEEGFFRIRRGTDECAIESIAVAADP 465
>gi|323448265|gb|EGB04166.1| hypothetical protein AURANDRAFT_32974 [Aureococcus anophagefferens]
Length = 298
Score = 80.1 bits (196), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 47/139 (33%), Positives = 65/139 (46%), Gaps = 39/139 (28%)
Query: 164 AKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRD 223
A +S E +IM I E GPVE AFTV++D Y G +
Sbjct: 184 AGDVQTASGEAAIMAMIAEGGPVETAFTVYEDFENYAGGIYH------------------ 225
Query: 224 NTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDN 283
+ +G+ GGHA++ +GWG + + KYW +ANSWN WG+
Sbjct: 226 -------------------HVTGEEAGGHAVKFVGWGVENGT--KYWKVANSWNPYWGEA 264
Query: 284 GLFKILRGKDECGIESSIT 302
G F+ILRG +E GIE +T
Sbjct: 265 GYFRILRGSNEGGIEDQVT 283
Score = 39.3 bits (90), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 19/50 (38%), Positives = 25/50 (50%), Gaps = 1/50 (2%)
Query: 73 ELIGYSEVDEDLPANFDSRTKWPNCPT-IREIRDQGSCGSCWGCRPYEIA 121
+++ Y P FDS +WP C I +IRDQ +CG CW E A
Sbjct: 13 DVVDYVPRGGAAPEAFDSAARWPECAKLIGDIRDQSNCGCCWAFAGAEAA 62
>gi|1584943|prf||2123443A cathepsin C
Length = 482
Score = 80.1 bits (196), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 75/282 (26%), Positives = 104/282 (36%), Gaps = 92/282 (32%)
Query: 67 PANRLPELIGYSEVDEDLPANFDSRTKWPNCPT-----IREIRDQGSCGSCWG------- 114
P+ L L G +LP FD W + P + IR+QG CGSC+
Sbjct: 207 PSKELISLTG------NLPLEFD----WTSPPDGSRSPVTPIRNQGICGSCYASPSAAAL 256
Query: 115 ---------------CRPYEIAPCEHH---VNGTRPSCDASK-----------------G 139
P + C + NG P A K
Sbjct: 257 EARIRLVSNFSEQPILSPQTVVDCSPYSEGCNGGFPFLIAGKYGEDFGLPQKIVIPYTGE 316
Query: 140 HTPKCV--RECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLI 197
T KC + C Y Y Y ++NEK + E+ +GP F V++D
Sbjct: 317 DTGKCTVSKNCTRYYTTDYSY-----IGGYYGATNEKLMQLELISNGPFPVGFEVYEDFQ 371
Query: 198 LYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRIL 257
YK G I +T+ F F+ L HA+ ++
Sbjct: 372 FYKEG------------------IYHHTTVQTDHYNFNPFE----------LTNHAVLLV 403
Query: 258 GWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIES 299
G+G D+ S E YW + NSW +WG+ G F+ILRG DECG+ES
Sbjct: 404 GYGVDKLSGEPYWKVKNSWGVEWGEQGYFRILRGTDECGVES 445
>gi|426250116|ref|XP_004018784.1| PREDICTED: tubulointerstitial nephritis antigen [Ovis aries]
Length = 476
Score = 79.7 bits (195), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 49/147 (33%), Positives = 67/147 (45%), Gaps = 36/147 (24%)
Query: 167 YSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSG--RFFVPGNETTAMSLIKWTIRDN 224
Y VSSNE IM+EI ++GPV+ V +D YK+G R NE +
Sbjct: 355 YRVSSNETEIMREIMQNGPVQAIMQVHEDFFNYKTGIYRHITSTNEDS------------ 402
Query: 225 TSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGE---DEKSKEKYWLIANSWNTDWG 281
+ + HA+++ GWG KEK+W+ ANSW WG
Sbjct: 403 -------------------EKYRKFRTHAVKLTGWGTLRGAHGQKEKFWIAANSWGKSWG 443
Query: 282 DNGLFKILRGKDECGIESSITAGVPKL 308
+NG F+ILRG +E IE I A +L
Sbjct: 444 ENGYFRILRGVNESDIEKLIIAAWGQL 470
>gi|73696355|gb|AAZ80953.1| cathepsin C [Macaca mulatta]
Length = 118
Score = 79.7 bits (195), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 46/135 (34%), Positives = 65/135 (48%), Gaps = 31/135 (22%)
Query: 172 NEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAE 231
NE + E+ HGP+ AF V+DD + Y++G + G +RD
Sbjct: 11 NEALMKLELVYHGPLAVAFEVYDDFLHYQNGIYHHTG------------LRD-------- 50
Query: 232 GAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRG 291
F F+ L HA+ ++G+G D S YW++ NSW T WG++G F+I RG
Sbjct: 51 -PFNPFE----------LTNHAVLLVGYGTDSASGMDYWIVKNSWGTSWGEDGYFRIRRG 99
Query: 292 KDECGIESSITAGVP 306
DEC IES A P
Sbjct: 100 TDECAIESIAVAATP 114
>gi|290988628|ref|XP_002677000.1| predicted protein [Naegleria gruberi]
gi|284090605|gb|EFC44256.1| predicted protein [Naegleria gruberi]
Length = 158
Score = 79.7 bits (195), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 50/171 (29%), Positives = 74/171 (43%), Gaps = 42/171 (24%)
Query: 139 GHTPKC-VRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLI 197
G P C ++ C V +K + KS +M ++ +GP++ V+ D
Sbjct: 29 GAVPACNIKSCA----VSGEKSPFYKVKSARKLKGMVDMMADLKANGPLQATMIVYKDFF 84
Query: 198 LYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRIL 257
YKSG + + SG+ +G HAI+I+
Sbjct: 85 SYKSGVYH-------------------------------------HVSGRMVGAHAIKIV 107
Query: 258 GWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVPKL 308
GWG D SK YW+ ANSW DWG +G F I RG+ ECG+ ++ +G P L
Sbjct: 108 GWGVDSASKLPYWICANSWGEDWGLDGYFWIARGRGECGLGKTVWSGKPAL 158
>gi|157058747|gb|ABV03131.1| cathepsin B-2744 [Myzus persicae]
Length = 261
Score = 79.7 bits (195), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 52/174 (29%), Positives = 83/174 (47%), Gaps = 41/174 (23%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDA-SKGHTPKCVREC-QENYDVPYKKDLNFGAKSYSVS- 170
GC+PY+ PC+H+ + + +C + + C +C +NY V Y+ DL + Y S
Sbjct: 126 GCQPYKNRPCDHYGDSSLTNCSSLRRTQMMFCRDKCVNKNYKVKYEDDLYKTSVVYMTSW 185
Query: 171 SNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGA 230
+N K I +EI +GPV V+++ + YK G + ++TA
Sbjct: 186 TNVKQIQQEIMTYGPVTAFMYVYENFMGYKEGVY-----KSTA----------------- 223
Query: 231 EGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNG 284
G+ +G H ++++GWG DE E YWL NSWN++WG NG
Sbjct: 224 ---------------GELIGYHHVKLIGWGVDEAGIE-YWLAMNSWNSNWGTNG 261
>gi|345794363|ref|XP_535330.3| PREDICTED: tubulointerstitial nephritis antigen-like 1 [Canis lupus
familiaris]
Length = 467
Score = 79.7 bits (195), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 47/149 (31%), Positives = 72/149 (48%), Gaps = 32/149 (21%)
Query: 156 YKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMS 215
+ D+ +Y + +NEK IMKE+ E+GPV+ V +D LY+ G + T +S
Sbjct: 333 HANDIYQVTPAYRLGTNEKEIMKELMENGPVQALMEVHEDFFLYQGGIY-----SHTPVS 387
Query: 216 LIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGED---EKSKEKYWLI 272
L + + + G H+++I GWGE+ + KYW
Sbjct: 388 LGR------------------------PERYRRHGTHSVKITGWGEETLPDGRTLKYWTA 423
Query: 273 ANSWNTDWGDNGLFKILRGKDECGIESSI 301
ANSW WG+ G F+I+RG +EC IES +
Sbjct: 424 ANSWGPAWGERGHFRIVRGANECDIESFV 452
>gi|403364285|gb|EJY81901.1| Cathepsin H [Oxytricha trifallax]
Length = 363
Score = 79.7 bits (195), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 69/266 (25%), Positives = 105/266 (39%), Gaps = 84/266 (31%)
Query: 73 ELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTRP 132
E + S + +DLPAN+D W + ++DQGSCGSCW + E H
Sbjct: 124 EAVDLSHIVKDLPANWD----WREHNGVTPVKDQGSCGSCWTFST--VGTLEAHF---LI 174
Query: 133 SCDASKGHTPKCVRECQENYD---------------------------VPY--------- 156
S+ + + + +C YD PY
Sbjct: 175 KYQQSRNLSEQQLVDCAGAYDNYGCNGGLPSHAFQYISDNGGIATEAAYPYFAKDRPCTI 234
Query: 157 ---KKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTA 213
+K + S +++ +E + I++HGPV A+ V DD + Y SG +
Sbjct: 235 QQSQKSVGVVGGSVNLTKSEDELAIAIFQHGPVSIAYEVIDDFMDYHSGVY--------- 285
Query: 214 MSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIA 273
T +D K+G HA+ +G+G + + YWL+
Sbjct: 286 ------TTKD-------------------CKNGPDDVNHAVVAVGFGTE--NGVDYWLVK 318
Query: 274 NSWNTDWGDNGLFKILRGKDECGIES 299
NSW+T WGDNG FKI RG + CGI +
Sbjct: 319 NSWSTKWGDNGYFKIQRGVNMCGINN 344
>gi|2499875|sp|Q26563.1|CATC_SCHMA RecName: Full=Cathepsin C; Flags: Precursor
gi|1262412|emb|CAA83543.1| cathepsin C [Schistosoma mansoni]
Length = 454
Score = 79.7 bits (195), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 75/282 (26%), Positives = 104/282 (36%), Gaps = 92/282 (32%)
Query: 67 PANRLPELIGYSEVDEDLPANFDSRTKWPNCP-----TIREIRDQGSCGSCWG------- 114
P+ L L G +LP FD W + P + IR+QG CGSC+
Sbjct: 207 PSKELISLTG------NLPLEFD----WTSPPDGSRSPVTPIRNQGICGSCYASPSAAAL 256
Query: 115 ---------------CRPYEIAPCEHH---VNGTRPSCDASK-----------------G 139
P + C + NG P A K
Sbjct: 257 EARIRLVSNFSEQPILSPQTVVDCSPYSEGCNGGFPFLIAGKYGEDFGLPQKIVIPYTGE 316
Query: 140 HTPKCV--RECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLI 197
T KC + C Y Y Y ++NEK + E+ +GP F V++D
Sbjct: 317 DTGKCTVSKNCTRYYTTDYSY-----IGGYYGATNEKLMQLELISNGPFPVGFEVYEDFQ 371
Query: 198 LYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRIL 257
YK G I +T+ F F+ L HA+ ++
Sbjct: 372 FYKEG------------------IYHHTTVQTDHYNFNPFE----------LTNHAVLLV 403
Query: 258 GWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIES 299
G+G D+ S E YW + NSW +WG+ G F+ILRG DECG+ES
Sbjct: 404 GYGVDKLSGEPYWKVKNSWGVEWGEQGYFRILRGTDECGVES 445
>gi|402867308|ref|XP_003897801.1| PREDICTED: tubulointerstitial nephritis antigen [Papio anubis]
Length = 475
Score = 79.7 bits (195), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 49/141 (34%), Positives = 67/141 (47%), Gaps = 32/141 (22%)
Query: 167 YSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTS 226
Y VSS+E IMKEI ++GPV+ V +D YK+G + R TS
Sbjct: 354 YRVSSSETEIMKEIMQNGPVQAIMQVREDFFHYKTGIY-----------------RHVTS 396
Query: 227 QLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGE---DEKSKEKYWLIANSWNTDWGDN 283
+ + L HA+++ GWG + KEK+W+ ANSW WG+N
Sbjct: 397 TNKESEKY------------RKLQTHAVKLTGWGTLRGAQGRKEKFWIAANSWGKSWGEN 444
Query: 284 GLFKILRGKDECGIESSITAG 304
G F+ILRG +E IE I A
Sbjct: 445 GYFRILRGVNESDIEKLIIAA 465
>gi|328872536|gb|EGG20903.1| hypothetical protein DFA_00770 [Dictyostelium fasciculatum]
Length = 313
Score = 79.7 bits (195), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 72/272 (26%), Positives = 105/272 (38%), Gaps = 87/272 (31%)
Query: 82 EDLPANFDSRTKWPNCPTIREIRDQGS-CGSCWG----------------------CRPY 118
+LPA+FDSR KW +C +RDQG C SCW P
Sbjct: 31 SNLPASFDSRQKWSDC--FSPVRDQGQKCSSCWAMTATGVLADRLCVASGGKVKKVLSPQ 88
Query: 119 EIAPCEHHVN----GTR---------------PSCDASKG-HTPKCVRECQENYDVPYKK 158
E+ C+ + N G R C++ K C C + +
Sbjct: 89 ELIDCDRNGNLGCGGGRLDTPLAYFRDNGVVTEKCESYKATQASSCSNTCDDG--TSFSN 146
Query: 159 DLNFGAKS-YSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLI 217
+ +K Y +SS E++ +IY +GP+ F ++ D+ YKSG + + T
Sbjct: 147 TTKYHSKDCYRLSSIEQA-KADIYLNGPIIAVFDLYTDIYNYKSGVYIKSDSAT------ 199
Query: 218 KWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWN 277
YK HA R++GWG ++ + YWL ANSW
Sbjct: 200 -------------------------YKET-----HAGRVIGWGVEDGVQ--YWLAANSWG 227
Query: 278 TDWGDNGLFKILRGKDECGIESSITAGVPKLD 309
T WG GLFKI G +E G E++ + D
Sbjct: 228 TGWGQQGLFKIRSGTNEVGFEANFFSTTADFD 259
>gi|355748654|gb|EHH53137.1| hypothetical protein EGM_13709 [Macaca fascicularis]
Length = 475
Score = 79.7 bits (195), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 49/141 (34%), Positives = 67/141 (47%), Gaps = 32/141 (22%)
Query: 167 YSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTS 226
Y VSS+E IMKEI ++GPV+ V +D YK+G + R TS
Sbjct: 354 YRVSSSETEIMKEIMQNGPVQAIMQVREDFFHYKTGIY-----------------RHVTS 396
Query: 227 QLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGE---DEKSKEKYWLIANSWNTDWGDN 283
+ + L HA+++ GWG + KEK+W+ ANSW WG+N
Sbjct: 397 TNKESEKY------------RKLQTHAVKLTGWGTLRGAQGRKEKFWIAANSWGKSWGEN 444
Query: 284 GLFKILRGKDECGIESSITAG 304
G F+ILRG +E IE I A
Sbjct: 445 GYFRILRGVNESDIEKLIIAA 465
>gi|344264196|ref|XP_003404179.1| PREDICTED: tubulointerstitial nephritis antigen [Loxodonta
africana]
Length = 476
Score = 79.7 bits (195), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 48/141 (34%), Positives = 69/141 (48%), Gaps = 32/141 (22%)
Query: 167 YSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTS 226
Y VSSNE IMKEI ++GPV+ V +D YK+G + + IR +
Sbjct: 355 YRVSSNETEIMKEIMQNGPVQAIMQVHEDFFHYKTG-------------IYRHVIRTSEE 401
Query: 227 QLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSK---EKYWLIANSWNTDWGDN 283
+ + L HA+++ GWG + +K EK+W+ ANSW WG++
Sbjct: 402 S----------------EKYQKLRTHAVKLTGWGMMKGAKGRKEKFWVAANSWGKSWGED 445
Query: 284 GLFKILRGKDECGIESSITAG 304
G F+ILRG +E IE I A
Sbjct: 446 GYFRILRGVNESDIEKLIIAA 466
>gi|355561807|gb|EHH18439.1| hypothetical protein EGK_15031 [Macaca mulatta]
Length = 475
Score = 79.3 bits (194), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 49/141 (34%), Positives = 67/141 (47%), Gaps = 32/141 (22%)
Query: 167 YSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTS 226
Y VSS+E IMKEI ++GPV+ V +D YK+G + R TS
Sbjct: 354 YRVSSSETEIMKEIMQNGPVQAIMQVREDFFHYKTGIY-----------------RHVTS 396
Query: 227 QLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGE---DEKSKEKYWLIANSWNTDWGDN 283
+ + L HA+++ GWG + KEK+W+ ANSW WG+N
Sbjct: 397 TNKESEKY------------RKLQTHAVKLTGWGTLRGAQGRKEKFWIAANSWGKSWGEN 444
Query: 284 GLFKILRGKDECGIESSITAG 304
G F+ILRG +E IE I A
Sbjct: 445 GYFRILRGVNESDIEKLIIAA 465
>gi|255209|gb|AAB23200.1| preprocathepsin C, dipeptidylaminopeptidase I [rats, kidney,
Peptide, 462 aa]
Length = 462
Score = 79.3 bits (194), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 43/135 (31%), Positives = 63/135 (46%), Gaps = 31/135 (22%)
Query: 172 NEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAE 231
NE + E+ +HGP+ AF V DD + Y SG + G
Sbjct: 355 NEALMKLELVKHGPMAVAFEVHDDFLHYHSGIY---------------------HHTGLS 393
Query: 232 GAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRG 291
F F+ L HA+ ++G+G+D + YW++ NSW + WG++G F+I RG
Sbjct: 394 DPFNPFE----------LTNHAVLLVGYGKDPVTGLDYWIVKNSWGSQWGESGYFRIRRG 443
Query: 292 KDECGIESSITAGVP 306
DEC IES A +P
Sbjct: 444 TDECAIESIAMAAIP 458
>gi|6449324|gb|AAF08932.1|AF195117_1 tubulointerstitial nephritis antigen isoform TIN2 [Homo sapiens]
Length = 333
Score = 79.3 bits (194), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 48/145 (33%), Positives = 68/145 (46%), Gaps = 32/145 (22%)
Query: 167 YSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTS 226
Y VSSNE IMKEI ++GPV+ V +D YK+G I + +
Sbjct: 212 YRVSSNETEIMKEIMQNGPVQAIMQVREDFFHYKTG------------------IYRHVT 253
Query: 227 QLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGED---EKSKEKYWLIANSWNTDWGDN 283
E + + L HA+++ GWG + KEK+W+ AN W WG+N
Sbjct: 254 STNKES-----------EKYRKLQTHAVKLTGWGTRRGAQGQKEKFWIAANFWGKSWGEN 302
Query: 284 GLFKILRGKDECGIESSITAGVPKL 308
G F+ILRG +E IE + A +L
Sbjct: 303 GYFRILRGVNESDIEKLVIAAWGQL 327
>gi|297291062|ref|XP_002803846.1| PREDICTED: tubulointerstitial nephritis antigen-like [Macaca
mulatta]
Length = 463
Score = 79.3 bits (194), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 49/141 (34%), Positives = 67/141 (47%), Gaps = 32/141 (22%)
Query: 167 YSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTS 226
Y VSS+E IMKEI ++GPV+ V +D YK+G + R TS
Sbjct: 342 YRVSSSETEIMKEIMQNGPVQAIMQVREDFFHYKTGIY-----------------RHVTS 384
Query: 227 QLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGE---DEKSKEKYWLIANSWNTDWGDN 283
+ + L HA+++ GWG + KEK+W+ ANSW WG+N
Sbjct: 385 TNKESEKY------------RKLQTHAVKLTGWGTLRGAQGRKEKFWIAANSWGKSWGEN 432
Query: 284 GLFKILRGKDECGIESSITAG 304
G F+ILRG +E IE I A
Sbjct: 433 GYFRILRGVNESDIEKLIIAA 453
>gi|24987409|pdb|1JQP|A Chain A, Dipeptidyl Peptidase I (Cathepsin C), A Tetrameric
Cysteine Protease Of The Papain Family
Length = 438
Score = 79.3 bits (194), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 43/135 (31%), Positives = 63/135 (46%), Gaps = 31/135 (22%)
Query: 172 NEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAE 231
NE + E+ +HGP+ AF V DD + Y SG + G
Sbjct: 331 NEALMKLELVKHGPMAVAFEVHDDFLHYHSGIY---------------------HHTGLS 369
Query: 232 GAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRG 291
F F+ L HA+ ++G+G+D + YW++ NSW + WG++G F+I RG
Sbjct: 370 DPFNPFE----------LTNHAVLLVGYGKDPVTGLDYWIVKNSWGSQWGESGYFRIRRG 419
Query: 292 KDECGIESSITAGVP 306
DEC IES A +P
Sbjct: 420 TDECAIESIAMAAIP 434
>gi|296207307|ref|XP_002750588.1| PREDICTED: tubulointerstitial nephritis antigen-like [Callithrix
jacchus]
Length = 467
Score = 79.3 bits (194), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 54/185 (29%), Positives = 76/185 (41%), Gaps = 41/185 (22%)
Query: 120 IAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKE 179
+ PC H T + H P Y V +Y + SN+ IMKE
Sbjct: 306 VPPCMMHSRATGRGKRQATAHCPNGHVNNNNIYQV---------TPAYRLGSNDTEIMKE 356
Query: 180 IYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDD 239
+ E+GPV+ V +D LYK G + LG +
Sbjct: 357 LMENGPVQALMEVHEDFFLYKGGIY-----------------SHTPVNLGRPERY----- 394
Query: 240 LILYKSGKALGGHAIRILGWGED---EKSKEKYWLIANSWNTDWGDNGLFKILRGKDECG 296
+ G H+++I GWGE+ + K KYW ANSW WG+ G F+I+RG +EC
Sbjct: 395 -------RRHGTHSVKITGWGEETWPDGRKLKYWTAANSWGPAWGERGHFRIVRGVNECD 447
Query: 297 IESSI 301
IES +
Sbjct: 448 IESFV 452
>gi|6449322|gb|AAF08931.1| tubulointerstitial nephritis antigen isoform TIN-ag [Homo sapiens]
Length = 476
Score = 79.3 bits (194), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 49/145 (33%), Positives = 68/145 (46%), Gaps = 32/145 (22%)
Query: 167 YSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTS 226
Y VSSNE IMKEI ++GPV+ V +D YK+G + R TS
Sbjct: 355 YRVSSNETEIMKEIMQNGPVQAIMQVREDFFHYKTGIY-----------------RHVTS 397
Query: 227 QLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGE---DEKSKEKYWLIANSWNTDWGDN 283
+ + L HA+++ GWG + KEK+W+ AN W WG+N
Sbjct: 398 TNKESEKY------------RKLQTHAVKLTGWGTLRGAQGQKEKFWIAANFWGKSWGEN 445
Query: 284 GLFKILRGKDECGIESSITAGVPKL 308
G F+ILRG +E IE + A +L
Sbjct: 446 GYFRILRGVNESDIEKLVIAAWGQL 470
>gi|8393218|ref|NP_058793.1| dipeptidyl peptidase 1 precursor [Rattus norvegicus]
gi|114152780|sp|P80067.3|CATC_RAT RecName: Full=Dipeptidyl peptidase 1; AltName: Full=Cathepsin C;
AltName: Full=Cathepsin J; AltName: Full=Dipeptidyl
peptidase I; Short=DPP-I; Short=DPPI; AltName:
Full=Dipeptidyl transferase; Contains: RecName:
Full=Dipeptidyl peptidase 1 exclusion domain chain;
AltName: Full=Dipeptidyl peptidase I exclusion domain
chain; Contains: RecName: Full=Dipeptidyl peptidase 1
heavy chain; AltName: Full=Dipeptidyl peptidase I heavy
chain; Contains: RecName: Full=Dipeptidyl peptidase 1
light chain; AltName: Full=Dipeptidyl peptidase I light
chain; Flags: Precursor
gi|220686|dbj|BAA14400.1| cathepsin C precursor [Rattus norvegicus]
gi|149069035|gb|EDM18587.1| cathepsin C, isoform CRA_a [Rattus norvegicus]
Length = 462
Score = 79.3 bits (194), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 43/135 (31%), Positives = 63/135 (46%), Gaps = 31/135 (22%)
Query: 172 NEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAE 231
NE + E+ +HGP+ AF V DD + Y SG + G
Sbjct: 355 NEALMKLELVKHGPMAVAFEVHDDFLHYHSGIY---------------------HHTGLS 393
Query: 232 GAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRG 291
F F+ L HA+ ++G+G+D + YW++ NSW + WG++G F+I RG
Sbjct: 394 DPFNPFE----------LTNHAVLLVGYGKDPVTGLDYWIVKNSWGSQWGESGYFRIRRG 443
Query: 292 KDECGIESSITAGVP 306
DEC IES A +P
Sbjct: 444 TDECAIESIAMAAIP 458
>gi|149635146|ref|XP_001512140.1| PREDICTED: dipeptidyl peptidase 1-like [Ornithorhynchus anatinus]
Length = 469
Score = 79.3 bits (194), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 44/135 (32%), Positives = 64/135 (47%), Gaps = 31/135 (22%)
Query: 172 NEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAE 231
NE + E+ HGP+ AF V++D + Y+ G + G +RD
Sbjct: 362 NEALMKLELVRHGPMAVAFEVYNDFLHYREGVYHHTG------------LRD-------- 401
Query: 232 GAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRG 291
F F+ L HA+ ++G+G D + YW++ NSW T WG++G F+I RG
Sbjct: 402 -PFNPFE----------LTNHAVLLVGYGTDPATGLDYWIVKNSWGTAWGEDGYFRIRRG 450
Query: 292 KDECGIESSITAGVP 306
DEC IES A P
Sbjct: 451 SDECAIESIAVAATP 465
>gi|403293249|ref|XP_003937633.1| PREDICTED: tubulointerstitial nephritis antigen-like isoform 1
[Saimiri boliviensis boliviensis]
Length = 467
Score = 79.0 bits (193), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 48/140 (34%), Positives = 68/140 (48%), Gaps = 34/140 (24%)
Query: 166 SYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRF-FVPGNETTAMSLIKWTIRDN 224
+Y + SN+ IMKE+ E+GPV+ V +D LYK G + P N
Sbjct: 343 AYRLGSNDTEIMKELMENGPVQALMEVHEDFFLYKGGIYSHTPVN--------------- 387
Query: 225 TSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEK---SKEKYWLIANSWNTDWG 281
LG + + G H+++I GWGE+ + K KYW ANSW WG
Sbjct: 388 ---LGRPERY------------RRHGTHSVKITGWGEETRPDGRKLKYWTAANSWGPAWG 432
Query: 282 DNGLFKILRGKDECGIESSI 301
+ G F+I+RG +EC IES +
Sbjct: 433 ERGHFRIVRGVNECDIESFV 452
>gi|195346663|ref|XP_002039877.1| GM15657 [Drosophila sechellia]
gi|194135226|gb|EDW56742.1| GM15657 [Drosophila sechellia]
Length = 431
Score = 79.0 bits (193), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 77/293 (26%), Positives = 113/293 (38%), Gaps = 91/293 (31%)
Query: 67 PANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEIAP---- 122
P R+ + + LP++F++ KW + I E+ DQG CG+ W +A
Sbjct: 170 PTYRVKAMTRLRNPTDGLPSSFNALDKWSS--YISEVPDQGWCGASWVLSTTSVASDRFA 227
Query: 123 -------------------------CE-----------HHVNGTRPSCDASKGHTPKC-V 145
CE H +C H C +
Sbjct: 228 IQSKGKEAVQLSAQNILSCTRRQQGCEGGHLDAAWRYLHKKGVVDENCYPYTQHRDTCKI 287
Query: 146 RE---------CQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDL 196
R CQ +V + L +YS++ E IM EI+ GPV+ V D
Sbjct: 288 RHNSRSLRANGCQTPVNVD-RDTLYTVGPAYSLN-READIMAEIFHSGPVQATMRVNRDF 345
Query: 197 ILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGG-HAIR 255
Y G + ET A + KAL G H+++
Sbjct: 346 FAYSGGVY----RETAA-------------------------------NRKALTGFHSVK 370
Query: 256 ILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVPKL 308
++GWGE E + EKYW+ ANSW + WG++G F+ILRG +ECGIE + A P +
Sbjct: 371 LVGWGE-EHNGEKYWIAANSWGSWWGEHGYFRILRGSNECGIEDYVLASWPYV 422
>gi|417401428|gb|JAA47600.1| Putative cysteine proteinase tin-ag [Desmodus rotundus]
Length = 466
Score = 79.0 bits (193), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 49/154 (31%), Positives = 76/154 (49%), Gaps = 33/154 (21%)
Query: 151 NYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNE 210
N+ V + D+ +Y + S+EK IMKE+ E+GPV+ V +D LY++G +
Sbjct: 328 NHQV-HANDIYQVTPAYRLGSSEKEIMKELMENGPVQALMEVHEDFFLYQNGIY-----S 381
Query: 211 TTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGED---EKSKE 267
T +SL + + + G H+++I GWGE+ +
Sbjct: 382 HTPVSLGR------------------------PERYRRHGTHSVKITGWGEESLPDGRTL 417
Query: 268 KYWLIANSWNTDWGDNGLFKILRGKDECGIESSI 301
KYW ANSW WG+ G F+I+RG +EC IES +
Sbjct: 418 KYWTAANSWGPAWGERGHFRIVRGANECDIESFV 451
>gi|403293251|ref|XP_003937634.1| PREDICTED: tubulointerstitial nephritis antigen-like isoform 2
[Saimiri boliviensis boliviensis]
Length = 436
Score = 79.0 bits (193), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 46/139 (33%), Positives = 66/139 (47%), Gaps = 32/139 (23%)
Query: 166 SYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNT 225
+Y + SN+ IMKE+ E+GPV+ V +D LYK G +
Sbjct: 312 AYRLGSNDTEIMKELMENGPVQALMEVHEDFFLYKGGIY-----------------SHTP 354
Query: 226 SQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEK---SKEKYWLIANSWNTDWGD 282
LG + + G H+++I GWGE+ + K KYW ANSW WG+
Sbjct: 355 VNLGRPERY------------RRHGTHSVKITGWGEETRPDGRKLKYWTAANSWGPAWGE 402
Query: 283 NGLFKILRGKDECGIESSI 301
G F+I+RG +EC IES +
Sbjct: 403 RGHFRIVRGVNECDIESFV 421
>gi|308162940|gb|EFO65307.1| Cathepsin B precursor [Giardia lamblia P15]
Length = 303
Score = 79.0 bits (193), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 71/266 (26%), Positives = 116/266 (43%), Gaps = 34/266 (12%)
Query: 58 MGVHPDY------NLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGS 111
M + PD +LP + + E+ E + +P+ FD R ++P C + + DQGSCG
Sbjct: 50 MLIRPDILGAGSGSLPPSSVTEI---QEPADPIPSQFDFRDEYPQC--VTPVMDQGSCGG 104
Query: 112 CWGCRPYEIAPCEHHVNGT-RPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVS 170
CW + V G + S+ + C E + +F + + +
Sbjct: 105 CWAFSAIGVFGDRRCVAGIDKEGVPYSQQYLISCSTENHGCDGGDFWPTWSF--LTLTGA 162
Query: 171 SNEKSIMKEIYEHGPVEGAFTVFDD---LILYKS-GRFFVPGNETTAMSLIKWTIRDNTS 226
+ + + Y + V DD + LYK+ G V N M ++ +
Sbjct: 163 TTAECVKYIDYPNIVASPCPAVCDDGSQIQLYKAHGYGQVSKNVQAIMHML-------AT 215
Query: 227 QLGAEGAFTVFDDLILYKSGK--------ALGGHAIRILGWGEDEKSKEKYWLIANSWNT 278
+ V+ DL Y+SG +LG HA+ ++G+G + + YW+I NSW
Sbjct: 216 GGPVQTMIVVYSDLSYYESGVYKHTYGTISLGLHALEMVGYGTTDDGTD-YWIIRNSWGA 274
Query: 279 DWGDNGLFKILRGKDECGIESSITAG 304
DWG+NG F+I+RG +EC IE I A
Sbjct: 275 DWGENGYFRIVRGVNECRIEDEIYAA 300
>gi|195488613|ref|XP_002092389.1| GE11695 [Drosophila yakuba]
gi|194178490|gb|EDW92101.1| GE11695 [Drosophila yakuba]
Length = 431
Score = 78.6 bits (192), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 76/290 (26%), Positives = 109/290 (37%), Gaps = 89/290 (30%)
Query: 67 PANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEIAP---- 122
P R+ + + LP++F++ KW + I E+ DQG CG+ W +A
Sbjct: 170 PTYRVKAMTRLKNPTDGLPSSFNALDKWSS--YISEVPDQGWCGASWVLSTTSVASDRFA 227
Query: 123 -------------------------CE-----------HHVNGTRPSC-------DASK- 138
CE H SC D K
Sbjct: 228 IQSKGKEAVQLSAQNILSCTRRQQGCEGGHLDAAWRYLHKKGVVDESCYPYTQQRDTCKI 287
Query: 139 GHTPKCVRE--CQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDL 196
H + +R CQ Y+V G +YS++ E IM EI+ GPV+ V D
Sbjct: 288 RHNSRSLRANGCQTPYNVDRDTFYTVGP-AYSLN-READIMAEIFHSGPVQATMRVNRDF 345
Query: 197 ILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRI 256
Y G + +T A + G H++++
Sbjct: 346 FAYAGGVY----RQTAANRMA------------------------------PTGFHSVKL 371
Query: 257 LGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVP 306
+GWGE E + EKYW+ ANSW WG+ G F+ILRG +ECGIE + A P
Sbjct: 372 VGWGE-EHNGEKYWIAANSWGPWWGERGYFRILRGSNECGIEEYVLASWP 420
>gi|12060418|dbj|BAB20596.1| ARG1 [Mus musculus]
gi|71059879|emb|CAJ18483.1| Lcn7 [Mus musculus]
Length = 415
Score = 78.6 bits (192), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 76/296 (25%), Positives = 113/296 (38%), Gaps = 97/296 (32%)
Query: 69 NRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG-------------- 114
N + ++G EV LP F++ KWPN I E DQG+C W
Sbjct: 139 NEIYTVLGQGEV---LPTAFEASEKWPN--LIHEPLDQGNCAGSWAFSTAAVASDRVSIH 193
Query: 115 --------CRPYEIAPCE-HHVNGTR------------------PSCDASKGH------- 140
P + C+ HH G R +C G
Sbjct: 194 SLGHMTPILSPQNLLSCDTHHQQGCRGGRLDGAWWFLRRRGVVSDNCYPFSGREQNEASP 253
Query: 141 TPKCVREC--------QENYDVPYKK----DLNFGAKSYSVSSNEKSIMKEIYEHGPVEG 188
TP+C+ Q P + D+ +Y + S+EK IMKE+ E+GPV+
Sbjct: 254 TPRCMMHSRAMGRGKRQATSRCPNGQVDSNDIYQVTPAYRLGSDEKEIMKELMENGPVQA 313
Query: 189 AFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKA 248
V +D LY+ G + T +S + + +
Sbjct: 314 LMEVHEDFFLYQRGIY-----SHTPVSQGR------------------------PEQYRR 344
Query: 249 LGGHAIRILGWGED---EKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSI 301
G H+++I GWGE+ + KYW ANSW WG+ G F+I+RG +EC IE+ +
Sbjct: 345 HGTHSVKITGWGEETLPDGRTIKYWTAANSWGPWWGERGHFRIVRGTNECDIETFV 400
>gi|355557764|gb|EHH14544.1| hypothetical protein EGK_00488 [Macaca mulatta]
gi|355745087|gb|EHH49712.1| hypothetical protein EGM_00421 [Macaca fascicularis]
gi|384948750|gb|AFI37980.1| tubulointerstitial nephritis antigen-like isoform 1 precursor
[Macaca mulatta]
gi|384948752|gb|AFI37981.1| tubulointerstitial nephritis antigen-like isoform 1 precursor
[Macaca mulatta]
gi|387540550|gb|AFJ70902.1| tubulointerstitial nephritis antigen-like isoform 1 precursor
[Macaca mulatta]
Length = 467
Score = 78.6 bits (192), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 47/138 (34%), Positives = 68/138 (49%), Gaps = 32/138 (23%)
Query: 167 YSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTS 226
Y + SN+K IMKE+ E+GPV+ V +D LYK G + T +SL +
Sbjct: 344 YRLGSNDKEIMKELMENGPVQALMEVHEDFFLYKGGIY-----SHTPVSLGR-------- 390
Query: 227 QLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGED---EKSKEKYWLIANSWNTDWGDN 283
+ + G H+++I GWGE+ + KYW ANSW WG+
Sbjct: 391 ----------------PERYRRHGTHSVKITGWGEETLPDGRTLKYWTAANSWGPAWGER 434
Query: 284 GLFKILRGKDECGIESSI 301
G F+I+RG +EC IES +
Sbjct: 435 GHFRIVRGVNECDIESFV 452
>gi|11545918|ref|NP_071447.1| tubulointerstitial nephritis antigen-like isoform 1 precursor [Homo
sapiens]
gi|61213628|sp|Q9GZM7.1|TINAL_HUMAN RecName: Full=Tubulointerstitial nephritis antigen-like; AltName:
Full=Glucocorticoid-inducible protein 5; AltName:
Full=Oxidized LDL-responsive gene 2 protein;
Short=OLRG-2; AltName: Full=Tubulointerstitial nephritis
antigen-related protein; Short=TIN Ag-related protein;
Short=TIN-Ag-RP; Flags: Precursor
gi|11602840|gb|AAG38876.1|AF236150_1 tubulointerstitial nephritis antigen-related protein precursor
[Homo sapiens]
gi|11275667|gb|AAG33699.1| oxidized-LDL responsive gene 2 [Homo sapiens]
gi|11527793|dbj|BAB18636.1| glucocorticoid-inducible protein [Homo sapiens]
gi|11527809|dbj|BAB18727.1| glucocorticoid-inducible protein [Homo sapiens]
gi|11761715|gb|AAG40154.1| tubulointerstitial nephritis antigen-related protein [Homo sapiens]
gi|22761462|dbj|BAC11596.1| unnamed protein product [Homo sapiens]
gi|37181967|gb|AAQ88787.1| LCN7 [Homo sapiens]
gi|40353044|gb|AAH64633.1| Tubulointerstitial nephritis antigen-like 1 [Homo sapiens]
gi|119628009|gb|EAX07604.1| tubulointerstitial nephritis antigen-like 1, isoform CRA_b [Homo
sapiens]
gi|119628010|gb|EAX07605.1| tubulointerstitial nephritis antigen-like 1, isoform CRA_b [Homo
sapiens]
gi|119628011|gb|EAX07606.1| tubulointerstitial nephritis antigen-like 1, isoform CRA_b [Homo
sapiens]
gi|158258977|dbj|BAF85459.1| unnamed protein product [Homo sapiens]
gi|261858502|dbj|BAI45773.1| tubulointerstitial nephritis antigen-like 1 [synthetic construct]
gi|410265400|gb|JAA20666.1| tubulointerstitial nephritis antigen-like 1 [Pan troglodytes]
gi|410307560|gb|JAA32380.1| tubulointerstitial nephritis antigen-like 1 [Pan troglodytes]
gi|410307562|gb|JAA32381.1| tubulointerstitial nephritis antigen-like 1 [Pan troglodytes]
gi|410307564|gb|JAA32382.1| tubulointerstitial nephritis antigen-like 1 [Pan troglodytes]
gi|410335249|gb|JAA36571.1| tubulointerstitial nephritis antigen-like 1 [Pan troglodytes]
Length = 467
Score = 78.6 bits (192), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 47/138 (34%), Positives = 68/138 (49%), Gaps = 32/138 (23%)
Query: 167 YSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTS 226
Y + SN+K IMKE+ E+GPV+ V +D LYK G + T +SL +
Sbjct: 344 YRLGSNDKEIMKELMENGPVQALMEVHEDFFLYKGGIY-----SHTPVSLGR-------- 390
Query: 227 QLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGED---EKSKEKYWLIANSWNTDWGDN 283
+ + G H+++I GWGE+ + KYW ANSW WG+
Sbjct: 391 ----------------PERYRRHGTHSVKITGWGEETLPDGRTLKYWTAANSWGPAWGER 434
Query: 284 GLFKILRGKDECGIESSI 301
G F+I+RG +EC IES +
Sbjct: 435 GHFRIVRGVNECDIESFV 452
>gi|253748399|gb|EET02549.1| Cathepsin B precursor [Giardia intestinalis ATCC 50581]
Length = 303
Score = 78.6 bits (192), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 71/297 (23%), Positives = 110/297 (37%), Gaps = 96/297 (32%)
Query: 58 MGVHPDY------NLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGS 111
M ++PD ++P+ L E+ ++ + LPA FD R ++P+C + + DQGSCG
Sbjct: 50 MLINPDRLKARSGSMPSAPLKEI---NDPTDPLPAQFDFRDEYPHC--VSPVFDQGSCGG 104
Query: 112 CW-------------------------------------GCRPYEIAPC---EHHVNGTR 131
CW GC + P T
Sbjct: 105 CWAFSAIGMFGSRRCAVGIDKAAVLYSQQHLISCSTENFGCSGGDFFPTWSFLTQTGATT 164
Query: 132 PSC----DASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVE 187
C D C C + + + K +G S SV +IM+ + GPV+
Sbjct: 165 AECVKYVDYGSSVAAACPTTCDDGSQIQFYKAHGYGQVSKSV----PAIMQMLVSGGPVQ 220
Query: 188 GAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGK 247
V+ DL+ Y G + R +
Sbjct: 221 TMIVVYADLLYYAGGVY-----------------RHTYGPISN----------------- 246
Query: 248 ALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAG 304
G HA+ ++G+G + + YW I NSW +DWG++G F+I+RG +EC IE I A
Sbjct: 247 --GLHALEMVGYGTTDDGTD-YWTIKNSWGSDWGEDGYFRIVRGVNECRIEDEIYAA 300
>gi|297282815|ref|XP_002802331.1| PREDICTED: tubulointerstitial nephritis antigen-like [Macaca
mulatta]
Length = 322
Score = 78.6 bits (192), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 55/183 (30%), Positives = 81/183 (44%), Gaps = 41/183 (22%)
Query: 122 PCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIY 181
PC H +R + T +C N D+ + Y + SN+K IMKE+
Sbjct: 163 PCMMH---SRAMGRGKRQATARCPNSHVNNNDIYQVTPV------YRLGSNDKEIMKELM 213
Query: 182 EHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLI 241
E+GPV+ V +D LYK G + T +SL +
Sbjct: 214 ENGPVQALMEVHEDFFLYKGGIY-----SHTPVSLGR----------------------- 245
Query: 242 LYKSGKALGGHAIRILGWGED---EKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIE 298
+ + G H+++I GWGE+ + KYW ANSW WG+ G F+I+RG +EC IE
Sbjct: 246 -PERYRRHGTHSVKITGWGEETLPDGRTLKYWTAANSWGPAWGERGHFRIVRGVNECDIE 304
Query: 299 SSI 301
S +
Sbjct: 305 SFV 307
>gi|28804799|dbj|BAC57943.1| cathepsin C [Marsupenaeus japonicus]
Length = 449
Score = 78.6 bits (192), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 59/197 (29%), Positives = 86/197 (43%), Gaps = 38/197 (19%)
Query: 112 CWGCRPYEIA-PCEHHVNGTRPSCDASKGHTPKCVRE-CQENYDVPYKKDLNFGAKSYSV 169
C G P+ IA V +C +G C R C ++Y Y+ Y
Sbjct: 287 CEGGFPFLIAGRYAQDVGVVLENCYPYEGKDDTCTRSSCTKHYTAYYRY-----VGGYYG 341
Query: 170 SSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLG 229
+ NE+ + + + GP+ V+DD + YKSG + G +RD+ + L
Sbjct: 342 ACNEEEMKIALIKGGPLIVGLEVYDDFLHYKSGIYHHTG------------LRDSFNPL- 388
Query: 230 AEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKIL 289
L HA+ ++G+GEDE + EKYW + NSW WG++G F+I
Sbjct: 389 ------------------ELTNHAVLLVGYGEDETTGEKYWSVKNSWGEGWGEDGYFRIR 430
Query: 290 RGKDECGIESSITAGVP 306
RG DEC IES VP
Sbjct: 431 RGVDECAIESMAVEAVP 447
>gi|270132817|ref|NP_075965.2| tubulointerstitial nephritis antigen-like precursor [Mus musculus]
gi|270132824|ref|NP_001161805.1| tubulointerstitial nephritis antigen-like precursor [Mus musculus]
gi|61213616|sp|Q99JR5.1|TINAL_MOUSE RecName: Full=Tubulointerstitial nephritis antigen-like; AltName:
Full=Adrenocortical zonation factor 1; Short=AZ-1;
AltName: Full=Androgen-regulated gene 1 protein;
AltName: Full=Tubulointerstitial nephritis
antigen-related protein; Short=TARP; Flags: Precursor
gi|13543125|gb|AAH05738.1| Tinagl1 protein [Mus musculus]
gi|17391278|gb|AAH18539.1| Tinagl1 protein [Mus musculus]
gi|30314458|dbj|BAC76038.1| tubulointersititial nephritis antigen-related protein [Mus
musculus]
gi|148698197|gb|EDL30144.1| tubulointerstitial nephritis antigen-like, isoform CRA_a [Mus
musculus]
gi|148698198|gb|EDL30145.1| tubulointerstitial nephritis antigen-like, isoform CRA_a [Mus
musculus]
Length = 466
Score = 78.6 bits (192), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 76/296 (25%), Positives = 113/296 (38%), Gaps = 97/296 (32%)
Query: 69 NRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG-------------- 114
N + ++G EV LP F++ KWPN I E DQG+C W
Sbjct: 190 NEIYTVLGQGEV---LPTAFEASEKWPN--LIHEPLDQGNCAGSWAFSTAAVASDRVSIH 244
Query: 115 --------CRPYEIAPCE-HHVNGTR------------------PSCDASKGH------- 140
P + C+ HH G R +C G
Sbjct: 245 SLGHMTPILSPQNLLSCDTHHQQGCRGGRLDGAWWFLRRRGVVSDNCYPFSGREQNEASP 304
Query: 141 TPKCVREC--------QENYDVPYKK----DLNFGAKSYSVSSNEKSIMKEIYEHGPVEG 188
TP+C+ Q P + D+ +Y + S+EK IMKE+ E+GPV+
Sbjct: 305 TPRCMMHSRAMGRGKRQATSRCPNGQVDSNDIYQVTPAYRLGSDEKEIMKELMENGPVQA 364
Query: 189 AFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKA 248
V +D LY+ G + T +S + + +
Sbjct: 365 LMEVHEDFFLYQRGIY-----SHTPVSQGR------------------------PEQYRR 395
Query: 249 LGGHAIRILGWGED---EKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSI 301
G H+++I GWGE+ + KYW ANSW WG+ G F+I+RG +EC IE+ +
Sbjct: 396 HGTHSVKITGWGEETLPDGRTIKYWTAANSWGPWWGERGHFRIVRGTNECDIETFV 451
>gi|332808277|ref|XP_524645.3| PREDICTED: LOW QUALITY PROTEIN: tubulointerstitial nephritis
antigen-like 1 [Pan troglodytes]
Length = 472
Score = 78.6 bits (192), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 47/138 (34%), Positives = 68/138 (49%), Gaps = 32/138 (23%)
Query: 167 YSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTS 226
Y + SN+K IMKE+ E+GPV+ V +D LYK G + T +SL +
Sbjct: 349 YRLGSNDKEIMKELMENGPVQALMEVHEDFFLYKGGIY-----SHTPVSLGR-------- 395
Query: 227 QLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGED---EKSKEKYWLIANSWNTDWGDN 283
+ + G H+++I GWGE+ + KYW ANSW WG+
Sbjct: 396 ----------------PERYRRHGTHSVKITGWGEETLPDGRTLKYWTAANSWGPAWGER 439
Query: 284 GLFKILRGKDECGIESSI 301
G F+I+RG +EC IES +
Sbjct: 440 GHFRIVRGVNECDIESFV 457
>gi|426328832|ref|XP_004025452.1| PREDICTED: tubulointerstitial nephritis antigen-like [Gorilla
gorilla gorilla]
Length = 462
Score = 78.6 bits (192), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 47/138 (34%), Positives = 68/138 (49%), Gaps = 32/138 (23%)
Query: 167 YSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTS 226
Y + SN+K IMKE+ E+GPV+ V +D LYK G + T +SL +
Sbjct: 339 YRLGSNDKEIMKELMENGPVQALMEVHEDFFLYKGGIY-----SHTPVSLGR-------- 385
Query: 227 QLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGED---EKSKEKYWLIANSWNTDWGDN 283
+ + G H+++I GWGE+ + KYW ANSW WG+
Sbjct: 386 ----------------PERYRRHGTHSVKITGWGEETLPDGRTLKYWTAANSWGPAWGER 429
Query: 284 GLFKILRGKDECGIESSI 301
G F+I+RG +EC IES +
Sbjct: 430 GHFRIVRGVNECDIESFV 447
>gi|397515889|ref|XP_003828174.1| PREDICTED: tubulointerstitial nephritis antigen-like isoform 1 [Pan
paniscus]
Length = 467
Score = 78.6 bits (192), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 47/138 (34%), Positives = 68/138 (49%), Gaps = 32/138 (23%)
Query: 167 YSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTS 226
Y + SN+K IMKE+ E+GPV+ V +D LYK G + T +SL +
Sbjct: 344 YRLGSNDKEIMKELMENGPVQALMEVHEDFFLYKGGIY-----SHTPVSLGR-------- 390
Query: 227 QLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGED---EKSKEKYWLIANSWNTDWGDN 283
+ + G H+++I GWGE+ + KYW ANSW WG+
Sbjct: 391 ----------------PERYRRHGTHSVKITGWGEETLPDGRTLKYWTAANSWGPAWGER 434
Query: 284 GLFKILRGKDECGIESSI 301
G F+I+RG +EC IES +
Sbjct: 435 GHFRIVRGVNECDIESFV 452
>gi|14290553|gb|AAH09048.1| TINAGL1 protein [Homo sapiens]
Length = 218
Score = 78.6 bits (192), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 54/183 (29%), Positives = 77/183 (42%), Gaps = 41/183 (22%)
Query: 122 PCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIY 181
PC H + H P + Y V Y + SN+K IMKE+
Sbjct: 59 PCMMHSRAMGRGKRQATAHCPNSYVNNNDIYQV---------TPVYRLGSNDKEIMKELM 109
Query: 182 EHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLI 241
E+GPV+ V +D LYK G + T +SL +
Sbjct: 110 ENGPVQALMEVHEDFFLYKGGIY-----SHTPVSLGR----------------------- 141
Query: 242 LYKSGKALGGHAIRILGWGED---EKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIE 298
+ + G H+++I GWGE+ + KYW ANSW WG+ G F+I+RG +EC IE
Sbjct: 142 -PERYRRHGTHSVKITGWGEETLPDGRTLKYWTAANSWGPAWGERGHFRIVRGVNECDIE 200
Query: 299 SSI 301
S +
Sbjct: 201 SFV 203
>gi|341891034|gb|EGT46969.1| hypothetical protein CAEBREN_30419 [Caenorhabditis brenneri]
Length = 422
Score = 78.6 bits (192), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 84/335 (25%), Positives = 123/335 (36%), Gaps = 72/335 (21%)
Query: 18 PGMAWRYWVKSGIVSGGAYGSKQAEKNSLSNIPRAHLKSWMGVHPDYNLPANRLPELIGY 77
P W+ V +YG K + H++ + + + L EL Y
Sbjct: 79 PETTWKAKFNKFGVKNRSYGFKYTRNQTAVEEYMEHIRKFF----ESDAMKRHLEELDNY 134
Query: 78 SEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEIAPCEH--HVNGTRPSCD 135
DLP FD+R KWPNCP+I + +QG CGSC+ +A H NGT +
Sbjct: 135 KS--SDLPKAFDARQKWPNCPSISNVPNQGGCGSCFAVAAAGVASDRACIHSNGTFKAL- 191
Query: 136 ASKGHTPKCVRECQENYD-----------------------VPYKKDLNFGA----KSYS 168
S+ C C Y PY DL+ G ++
Sbjct: 192 LSEEDIIGCCSVCGNCYGGDPLKALTYWVNQGLVTGGRDGCRPYSFDLSCGVPCSPATFF 251
Query: 169 VSSNEKSIMKE---IYEHGPVE--GAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRD 223
+ +++ M+ IY E F F + +S G E + I D
Sbjct: 252 EAEEKRTCMRRCQNIYYQQRYEEDKHFATFAYSLYPRSMTVSPDGKERVKVPTIIGHFND 311
Query: 224 -NTSQLGAEG-----------------AFTVFDDLILYKSG------------KALGGHA 253
NT +L AF V ++ + Y SG + + H
Sbjct: 312 KNTEKLNVTEYRNVIKKEILLYGPTTMAFPVPEEFLHYSSGVFRPFPLDGFDDRIVYWHV 371
Query: 254 IRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKI 288
+R++GWG+ E YWL NS+ + WGDNGLFKI
Sbjct: 372 VRLIGWGQSEDGTH-YWLAVNSFGSHWGDNGLFKI 405
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.317 0.137 0.447
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 5,609,389,746
Number of Sequences: 23463169
Number of extensions: 251284525
Number of successful extensions: 508762
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 4034
Number of HSP's successfully gapped in prelim test: 1988
Number of HSP's that attempted gapping in prelim test: 491662
Number of HSP's gapped (non-prelim): 15870
length of query: 309
length of database: 8,064,228,071
effective HSP length: 142
effective length of query: 167
effective length of database: 9,027,425,369
effective search space: 1507580036623
effective search space used: 1507580036623
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 76 (33.9 bits)