BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= psy15353
(344 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|225713216|gb|ACO12454.1| Cathepsin B precursor [Lepeophtheirus salmonis]
gi|290561811|gb|ADD38303.1| Cathepsin B [Lepeophtheirus salmonis]
Length = 333
Score = 241 bits (614), Expect = 5e-61, Method: Compositional matrix adjust.
Identities = 137/349 (39%), Positives = 183/349 (52%), Gaps = 22/349 (6%)
Query: 1 MIHILVFLL----GCTLVRGELYK--FSDAYIDQINREANTWTAGRNFPANLSEEYLRQF 54
MI FLL G RG + S YID IN+++ TW AG NF +S Y+R
Sbjct: 1 MILKFAFLLTVYAGAAYSRGAVSNGILSKDYIDSINKDSKTWRAGSNFDEEISTSYIRGL 60
Query: 55 LIADAKYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHI 114
+ + D LP T +P+ FD+R++WP+C TI + D G+C +
Sbjct: 61 MGVLPNHKDYLPPALPTLLGT------EQIPENFDSRQKWPHCPTISLIRDQGSCGSCWA 114
Query: 115 FAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGS 174
F AV A SDR CI S N +S E + SCC C + C+ G W+F K+G
Sbjct: 115 FGAVEAMSDRLCIHSNKIVN--VSAENLLSCCYSCGF----GCNGGFPGAAWSFWKKKGL 168
Query: 175 VTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKH 234
V+GG YG GCQP I+PC HH + P + PK CHT C N Y + +DK
Sbjct: 169 VSGGLYGSHKGCQPYAIAPCEHHANGTRPPCSGGGRTPK--CHTFCENEDYSLPYEKDKS 226
Query: 235 RTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKL 294
+Y V + I+ EI+ +GP A F++Y DF +YKSGVY+H + L H+ ++
Sbjct: 227 FGRSSYSVKSDPKQIQLEIMNNGPVEAAFSVYSDFLNYKSGVYRHVKGSLLGG--HAIRI 284
Query: 295 IGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
+GWG ENGTPYWLV N+W WGD GT KIL+G C E I AG P+
Sbjct: 285 LGWGVENGTPYWLVANSWNTDWGDNGTFKILKGSDHCGIEGSIVAGLPQ 333
>gi|312374701|gb|EFR22198.1| hypothetical protein AND_15621 [Anopheles darlingi]
Length = 335
Score = 239 bits (610), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 135/353 (38%), Positives = 192/353 (54%), Gaps = 29/353 (8%)
Query: 1 MIHILVFLLGCTLVRG---ELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIA 57
M+ + V ++ T G + Y S +ID+IN +A+TW AGRNF ++S Y+R +
Sbjct: 1 MVLLAVAVVSGTTAAGSGNKKYALSAKFIDEINSKASTWRAGRNFHPDVSLSYIRGLMGV 60
Query: 58 DAKYFDQSDRPLPGDRKTYDPEY----SATV---PDRFDAREQWPNCGTIGHVPDTGACA 110
+ K +PE+ SA V P+ FD+REQWPNC TI + D G+C
Sbjct: 61 HQDAY-----------KFREPEFVHDLSADVDDLPENFDSREQWPNCPTIREIRDQGSCG 109
Query: 111 APHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLH 170
+ F AV A SDR CI S G+ + S E + SCC C + C+ G W++
Sbjct: 110 SCWAFGAVEAMSDRVCIASGGKIHFRFSAEDLVSCCHTCGF----GCNGGFPGAAWSYWV 165
Query: 171 KRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFF 230
+G V+GG +G GCQP I+PC HH + T PSCE + KC +C + +Y +
Sbjct: 166 HKGLVSGGPFGSNLGCQPYAIAPCEHHVNG-TRPSCEGEGGKTPKCVKKCQD-SYTVPYA 223
Query: 231 QDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLH 290
+DK + +Y + +ED I+KEI+ +GP F +Y+D HYK GVY+H + L H
Sbjct: 224 KDKRYGSKSYSIPRHEDQIRKEIMTNGPVEGAFTVYEDLLHYKEGVYQHVTGKMLGG--H 281
Query: 291 SGKLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
+ +++GWG EN T YWL+ N+W WGD G KILRG+ E IAAG PK
Sbjct: 282 AIRILGWGVENNTKYWLIANSWNSDWGDNGFFKILRGEDHLGIESSIAAGLPK 334
>gi|87246247|gb|ABD35300.1| cathepsin B-like cysteine protease [Triatoma infestans]
Length = 333
Score = 238 bits (607), Expect = 3e-60, Method: Compositional matrix adjust.
Identities = 138/348 (39%), Positives = 185/348 (53%), Gaps = 21/348 (6%)
Query: 1 MIHILVF-LLGCTLVRGELYK--FSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIA 57
M ++ F LL C + + SD +ID IN TW AGRNF N ++YL+ L
Sbjct: 1 MKELIPFSLLICGIFSASIPTDPLSDEFIDYINSLQTTWRAGRNFAPNTPKKYLKS-LAG 59
Query: 58 DAKYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAA 117
++ LP D T+PD FDAR+QWPNC TIG + D G+C + F A
Sbjct: 60 GVHKNTKNGFTLP----IRDVSLDITLPDEFDARKQWPNCSTIGEIRDQGSCGSCWAFGA 115
Query: 118 VGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTG 177
V A SDR CI S G+ LS E + SCC C C GS W + HK G V+G
Sbjct: 116 VEAMSDRLCIHSNGKLQVHLSAENLLSCCDSC----GDGCLGGSPESAWEYWHKFGIVSG 171
Query: 178 GDYGDRTGCQPSTISPCSH--HGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHR 235
G+YG + GCQP +I+PC H HGS+P + K +C + P Y + F+ +
Sbjct: 172 GNYGSKQGCQPYSIAPCEHSIHGSSPACGGVTDTPKCKKQCEKGYSIP-YDKAFYYGQP- 229
Query: 236 TTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLI 295
Y + ++ I+ EIL +GP A+F +Y+D + YK GVY+H + L H K+
Sbjct: 230 ---GYAIPNDAQKIQAEILKNGPIVASFLVYEDLFSYKEGVYQHVAGEFLGG--HVIKIF 284
Query: 296 GWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
GWG ENGTPYWLV N+W WG+ G KI RGK EC E ++AG P+
Sbjct: 285 GWGIENGTPYWLVANSWNTDWGNNGFFKIPRGKDECGIEIDVSAGLPR 332
>gi|204022077|dbj|BAG71136.1| cathepsin B-S1 [Tuberaphis sumatrana]
gi|204022079|dbj|BAG71137.1| cathepsin B-S2 [Tuberaphis sumatrana]
Length = 334
Score = 238 bits (606), Expect = 4e-60, Method: Compositional matrix adjust.
Identities = 134/344 (38%), Positives = 187/344 (54%), Gaps = 18/344 (5%)
Query: 1 MIHILVFLLGCT-LVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADA 59
+ I+ + C L L SD I IN A TW A R FPAN SEEY L+
Sbjct: 4 FVTIVCAIFVCVYLTEPTLQFLSDERIKYINEVAKTWKAERYFPANTSEEYFIG-LLGSR 62
Query: 60 KYFDQSDRPLPGDRKTYDPEYSAT-VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAV 118
Y + ++ + K YDP Y P +FD+RE W +C IGH+ D G C + F+
Sbjct: 63 GYKNYTNE---AEIKKYDPLYVENDSPQQFDSRENWKSCKQIGHIRDQGNCGSCWSFSTT 119
Query: 119 GAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGG 178
GAF+DR C+ + G+ N LS E +A CCK C C G + W + +G TGG
Sbjct: 120 GAFADRLCVSTGGKFNELLSPEELAFCCKDC----GNGCEGGYPIKAWRYFRTQGVTTGG 175
Query: 179 DYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTL 238
DY + GC+P ++PC + T C + + + + +C YG+ Q +++T
Sbjct: 176 DYDTKEGCKPYKVAPCYNKQGKNT---CGGKPMER---NHQCPKTCYGKTTDQKRYKTKS 229
Query: 239 TYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWG 298
Y V ++ I+++I +GP A+F +YDDF YKSG+Y+ T NAK +N HS K+IGWG
Sbjct: 230 EY-VINSIKTIEQDIKTYGPVEASFDVYDDFSVYKSGIYRKTPNAKYQN-GHSVKIIGWG 287
Query: 299 TENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
ENGTPYWL +N+W WGD GT KI++GK EC E + AG P
Sbjct: 288 QENGTPYWLAVNSWSKFWGDHGTFKIIKGKNECGIERAVTAGIP 331
>gi|160333103|ref|NP_001103948.1| capthepsin B, b precursor [Danio rerio]
gi|133777414|gb|AAI15255.1| Ctsbb protein [Danio rerio]
Length = 326
Score = 237 bits (605), Expect = 5e-60, Method: Compositional matrix adjust.
Identities = 140/346 (40%), Positives = 185/346 (53%), Gaps = 24/346 (6%)
Query: 1 MIHILVF-LLGCTLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADA 59
M + VF LL T R +L+ D I IN +TWTAG NF N+ +EYL+
Sbjct: 1 MWRVCVFVLLSVTCARPQLHTH-DEMISFINAARSTWTAGVNF-DNVPKEYLKSLC---- 54
Query: 60 KYFDQSDRPLPGDRKTYDPEYSATV--PDRFDAREQWPNCGTIGHVPDTGACAAPHIFAA 117
L G R + ++S V PD FD R+QWPNC T+ + D G+C + F A
Sbjct: 55 ------GTVLKGPRLPHTVKHSTNVKLPDSFDLRDQWPNCKTLSQIRDQGSCGSCWAFGA 108
Query: 118 VGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTG 177
V + SDR CI SKG+Q+ +S E + SCC C + CS G W++ + G VTG
Sbjct: 109 VESISDRICIHSKGKQSPEISAEDLLSCCDQCGF----GCSGGFPAEAWDYWRRSGLVTG 164
Query: 178 GDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTT 237
G Y GC+P +I+PC HH + P Q PK C C P Y + QDKH +
Sbjct: 165 GLYNSDVGCRPYSIAPCEHHVNGTRPPCSGEQDTPK--CTGVCI-PKYSVPYKQDKHFGS 221
Query: 238 LTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGW 297
Y V ++ I E+ +GP A F +Y+DF YKSGVY+H + + L H+ K++GW
Sbjct: 222 KVYNVPSDQQQIMTELYTNGPVEAAFTVYEDFPLYKSGVYQHLTGSALGG--HAVKILGW 279
Query: 298 GTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
G ENGTP+WLV N+W WGD G KILRG EC E + AG PK
Sbjct: 280 GEENGTPFWLVANSWNSDWGDNGYFKILRGHDECGIESEMVAGLPK 325
>gi|49036808|gb|AAT48985.1| cathepsin B-like proteinase [Triatoma vitticeps]
Length = 332
Score = 237 bits (604), Expect = 7e-60, Method: Compositional matrix adjust.
Identities = 138/343 (40%), Positives = 183/343 (53%), Gaps = 19/343 (5%)
Query: 4 ILVFLLGCTLVRGELYK--FSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKY 61
IL LL C + SD +ID IN TW AGRNF N ++YL+ +A
Sbjct: 5 ILFSLLICGTFSASIPTDPLSDEFIDYINSLQTTWRAGRNFAPNTPKKYLKS--LAGVHK 62
Query: 62 FDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAF 121
+ LP + + D TVPD FDAR+ WPNC +I + D G+C + F AV A
Sbjct: 63 DANNAFTLPKRQVSVD----VTVPDEFDARKHWPNCSSITEIRDQGSCGSCWAFGAVEAM 118
Query: 122 SDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYG 181
SDR CI S G+ LS E + SCC C Y C GS W + HK G V+GG+YG
Sbjct: 119 SDRICIHSNGKLQVHLSAENLLSCCDSCGY----GCLGGSAENAWEYWHKFGIVSGGNYG 174
Query: 182 DRTGCQPSTISPCSHHGSAP-TLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTY 240
+ GCQP +I+PC H S P + P+CE + KC +C YG + D Y
Sbjct: 175 SKQGCQPYSIAPCEH--SIPGSRPACEGVR-DTPKCKKQCEK-GYGIPYGDDLCYGQPGY 230
Query: 241 WVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTE 300
++++ I+ EIL +GP A+ +Y+D + YK+GVY+H + L H K++GWG E
Sbjct: 231 TIENDAQKIQAEILKNGPIVASILVYEDLFSYKAGVYQHVAGEVLGG--HVIKILGWGVE 288
Query: 301 NGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
N TPYWLV N+W WG+ G KILRG EC E I AG P+
Sbjct: 289 NDTPYWLVANSWNTDWGNNGFFKILRGSDECGIEDQIVAGIPR 331
>gi|308390275|gb|ADO32581.1| cathepsin B [Marsupenaeus japonicus]
Length = 332
Score = 236 bits (603), Expect = 9e-60, Method: Compositional matrix adjust.
Identities = 136/344 (39%), Positives = 180/344 (52%), Gaps = 18/344 (5%)
Query: 2 IHILVFLLGCTLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIA--DA 59
+ + + L+ V + SD +I + E +TW AGRNF +LS Y R+ + D+
Sbjct: 3 VIVGLLLVAAVAVSANNHFLSDKFIKMLQSEDSTWEAGRNFNRHLSIRYFRRLMGVHPDS 62
Query: 60 KYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVG 119
KY +PG PE + +P FD+R WP C TIG + D G+C + F AV
Sbjct: 63 KYH------MPGYEAHKIPE-NFDMPKEFDSRAAWPMCPTIGEIRDQGSCGSCWAFGAVE 115
Query: 120 AFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGD 179
SDR+CI SKG+ N S+E + SCC +C + N G+ F+ W +H G V+GG
Sbjct: 116 VMSDRQCIHSKGKSNFHYSSENLVSCCHLCGFGCNGGFP-GAAFKYW--VHS-GIVSGGS 171
Query: 180 YGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLT 239
+ GCQP I+PC HH P E PK C RC N Y + D H
Sbjct: 172 FNSTQGCQPYEIAPCEHHVPGPRPKCSEGGGTPK--CVKRCEN-GYTVDYESDLHHGGKA 228
Query: 240 YWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGT 299
Y + +ED IK EI+ +GP F +Y DF HYKSGVY+H L H+ +++GWG
Sbjct: 229 YSIMKDEDQIKYEIMKNGPVEGAFTVYVDFLHYKSGVYQHRHGLPLGG--HAIRILGWGE 286
Query: 300 ENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
ENGTPYWL N+W WGD G KILRG C E I+AG PK
Sbjct: 287 ENGTPYWLCANSWNTDWGDNGLFKILRGSDHCGIESEISAGLPK 330
>gi|56462338|gb|AAV91452.1| cysteine peptidase 2 cathepsin-B-like [Lonomia obliqua]
Length = 338
Score = 236 bits (602), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 126/325 (38%), Positives = 171/325 (52%), Gaps = 14/325 (4%)
Query: 18 LYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYD 77
+Y S+ +I+ +N + TWTAGRNFPAN +++ + A + D L + T+D
Sbjct: 23 VYPLSEDFINILNSKPKTWTAGRNFPANTPFAHIKMLMGAL-----KDDNILKLPKMTHD 77
Query: 78 PEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPL 137
E A++P+ FD R++WPNC T+ + D G+C + F AV A +DR C S G ++
Sbjct: 78 AELIASLPENFDPRDKWPNCPTLNEIRDQGSCGSCWAFGAVEAMTDRVCTYSDGTKHFHF 137
Query: 138 STEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHH 197
S E + SCC IC C+ G W + G V+GG Y GC P + PC HH
Sbjct: 138 SAEDLLSCCPICGL----GCNGGMPTLAWEYWKHAGIVSGGSYNSTQGCIPYEVPPCEHH 193
Query: 198 GSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHG 257
LP + K P KC C Y F +DKH Y V NED IK E+ +G
Sbjct: 194 VPGNRLPCNGDTKTP--KCQKTC-EAGYNVPFKKDKHYGKHVYSVSGNEDNIKAELFKNG 250
Query: 258 PTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWG 317
P F +Y D YKSGVY+HT + L H+ K++GWG ENG+ YWL+ N+W WG
Sbjct: 251 PVEGAFTVYSDLLSYKSGVYQHTDGSALGG--HAVKILGWGVENGSKYWLIANSWNSDWG 308
Query: 318 DRGTVKILRGKYECAFEYLIAAGKP 342
D G KILRG+ C E I G+P
Sbjct: 309 DNGFFKILRGEDHCGIESSIVTGEP 333
>gi|48762476|dbj|BAD23809.1| cathepsin B-S [Tuberaphis styraci]
gi|204022069|dbj|BAG71132.1| cathepsin B-S1 [Tuberaphis styraci]
Length = 349
Score = 235 bits (600), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 130/340 (38%), Positives = 182/340 (53%), Gaps = 17/340 (5%)
Query: 4 ILVFLLGCTLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFD 63
+ + L L SD I IN A TW A R FPAN SEEY L+ Y +
Sbjct: 8 VCAIFVSVYLAEPTLQFLSDERIKYINEVAKTWKAERYFPANTSEEYFIG-LLGSRGYKN 66
Query: 64 QSDRPLPGDRKTYDPEY-SATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFS 122
++ + K YDP Y P +FD+RE W +C IGH+ D G C + F+ GAF+
Sbjct: 67 YTNEV---EIKKYDPLYVENNSPKQFDSRENWKSCKQIGHIRDQGNCGSCWSFSTTGAFA 123
Query: 123 DRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGD 182
DR C+ + G+ N+ LS E +A CC C K C G + W + +G TGGDY
Sbjct: 124 DRLCVSTGGKFNQLLSPEELAFCCMDC----GKGCGGGYPIKAWKYFRTQGVTTGGDYDT 179
Query: 183 RTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWV 242
+ GC P + PC T C + + + + +C YG+ QD+++T Y +
Sbjct: 180 KEGCMPYKVPPCYDEQGKNT---CGGKPMER---NHQCPKTCYGKTTVQDRYKTKNEYVI 233
Query: 243 DDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENG 302
+ E I+++++ +GP A+F +YDDF YKSG+Y+ T AK E HS K+IGWG ENG
Sbjct: 234 NSIE-TIEQDLMTYGPVEASFDVYDDFSVYKSGIYRKTPKAKYEG-GHSIKIIGWGEENG 291
Query: 303 TPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
TPYWL +N+W WGD GT KI++G+ EC E + AG P
Sbjct: 292 TPYWLAVNSWSKFWGDHGTFKIIKGRNECGIERAVTAGIP 331
>gi|146217390|gb|ABQ10737.1| cathepsin B [Penaeus monodon]
Length = 331
Score = 234 bits (597), Expect = 4e-59, Method: Compositional matrix adjust.
Identities = 137/350 (39%), Positives = 177/350 (50%), Gaps = 28/350 (8%)
Query: 1 MIHILVFLLGCTLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAK 60
M IL L+ V + SD +I Q+ E +TW AGRNF +LS +Y R+ +
Sbjct: 1 MRVILGLLVAAVAVNASSHFLSDKFIRQLQSEDSTWEAGRNFNKHLSIKYFRRLMGVHP- 59
Query: 61 YFDQSDRPLPGDRKTYDPEYSA-------TVPDRFDAREQWPNCGTIGHVPDTGACAAPH 113
D K + P+Y A +P FD+R WP C TIG + D G+C +
Sbjct: 60 -----------DSKFHMPKYEAHQIPENFEMPKEFDSRAAWPMCPTIGEIRDQGSCGSCW 108
Query: 114 IFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRG 173
F AV SDR+CI SKG+ N S E + SCC +C + N G+ F+ W +H G
Sbjct: 109 AFGAVEVMSDRQCIHSKGKSNFHYSAENLVSCCHLCGFGCNGGFP-GAAFKYW--VHS-G 164
Query: 174 SVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDK 233
V+GG + GCQP I+PC HH S P E PK C C Y + D
Sbjct: 165 IVSGGSFNSTQGCQPYEIAPCEHHVSGPRPKCSEGGGTPK--CAKTCEK-GYIVDYESDL 221
Query: 234 HRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGK 293
H Y + +ED IK EI+ +GP F +Y DF HYKSGVY+H L H+ +
Sbjct: 222 HHGGKAYSIMKDEDQIKYEIMNNGPVEGAFTVYVDFLHYKSGVYQHRHGLPLGG--HAIR 279
Query: 294 LIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
++GWG ENGTPYWL N+W WGD G KILRG C E I+AG PK
Sbjct: 280 VLGWGEENGTPYWLCANSWNTDWGDNGLFKILRGSDHCGIESEISAGLPK 329
>gi|341900876|gb|EGT56811.1| hypothetical protein CAEBREN_29569 [Caenorhabditis brenneri]
Length = 344
Score = 234 bits (596), Expect = 5e-59, Method: Compositional matrix adjust.
Identities = 129/321 (40%), Positives = 171/321 (53%), Gaps = 15/321 (4%)
Query: 24 AYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLP-GDRKTYDPEYSA 82
A ID +N WTAG E+ +++ + DAKY +P D E S
Sbjct: 31 ALIDYVNSAQKLWTAGHQVVPK--EKIMKKLM--DAKYV------VPHKDEDIVATEVSD 80
Query: 83 TVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYV 142
+PDRFDAREQWP+C +I ++ D C + FAA A SDR CI S G N LS+E +
Sbjct: 81 AIPDRFDAREQWPSCVSIDNIRDQSDCGSCWAFAAAEAISDRTCIASNGAVNTLLSSEDL 140
Query: 143 ASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPT 202
SCC + C G + W + K G VTGG Y + GC+P +I+PC + T
Sbjct: 141 LSCCT-GIFSCGNGCEGGYPIQAWKWWGKHGLVTGGSYESQFGCKPYSIAPCGQTVNGVT 199
Query: 203 LPSCENQKVPKLKCHTRCT-NPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTA 261
P C P KC CT N TY + QDKH Y V + I+ EIL +GP
Sbjct: 200 WPKCPEDTEPTPKCVDACTSNHTYPTAYLQDKHFGATAYAVGKKVEQIQTEILKNGPIEV 259
Query: 262 TFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGT 321
F +Y+DFY Y +GVY HT+ A L H+ K++GWG +NGTPYWLV N+W +WG++G
Sbjct: 260 AFTVYEDFYQYTTGVYVHTAGASLGG--HAVKILGWGVDNGTPYWLVANSWNINWGEKGY 317
Query: 322 VKILRGKYECAFEYLIAAGKP 342
+I+RG EC E+ AG P
Sbjct: 318 FRIIRGLNECGIEHSAVAGIP 338
>gi|347972086|ref|XP_313835.5| AGAP004533-PA [Anopheles gambiae str. PEST]
gi|333469165|gb|EAA09183.5| AGAP004533-PA [Anopheles gambiae str. PEST]
Length = 337
Score = 234 bits (596), Expect = 6e-59, Method: Compositional matrix adjust.
Identities = 130/350 (37%), Positives = 188/350 (53%), Gaps = 22/350 (6%)
Query: 1 MIHILVFLL---GCTLVRG--ELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFL 55
+ H++V L G G + Y S +I++IN +A TW AG+NF + S Y+R +
Sbjct: 2 LFHLVVIALAAVGTNAAAGGSKKYPLSSKFIEEINTKATTWRAGQNFHPDTSLTYIRGLM 61
Query: 56 IA--DAKYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPH 113
DA F + + +D +P+ FD+REQWPNC TI + D G+C +
Sbjct: 62 GVHPDADKFREPE-------ILHDLSDGDELPENFDSREQWPNCPTIREIRDQGSCGSCW 114
Query: 114 IFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRG 173
F AV A SDR C+ S G+ + S E + SCC C + C+ G W++ ++G
Sbjct: 115 AFGAVEAMSDRVCVASGGKIHFRFSAEDLVSCCHTCGF----GCNGGFPGAAWSYWVRKG 170
Query: 174 SVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDK 233
V+GG +G GCQP I+PC HH + T PSCE + KC +C +Y + +DK
Sbjct: 171 LVSGGPFGSNLGCQPYAIAPCEHHVNG-TRPSCEGEGGKTPKCVKKCQE-SYNVPYQKDK 228
Query: 234 HRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGK 293
+Y + +E I+KEI+ +GP F +Y+D HYK GVY+H + L H+ +
Sbjct: 229 RFGASSYSIARHEAQIQKEIMTNGPVEGAFTVYEDLLHYKEGVYQHVTGKMLGG--HAIR 286
Query: 294 LIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
++GWG ENGT YWL+ N+W WGD G KILRG+ E I+AG PK
Sbjct: 287 ILGWGVENGTKYWLIANSWNSDWGDNGFFKILRGEDHLGIESSISAGLPK 336
>gi|161343863|tpg|DAA06112.1| TPA_inf: cathepsin B [Myzus persicae]
Length = 340
Score = 233 bits (595), Expect = 7e-59, Method: Compositional matrix adjust.
Identities = 132/344 (38%), Positives = 182/344 (52%), Gaps = 15/344 (4%)
Query: 1 MIHILVFLLGCTL-VRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADA 59
++ +L+F GC +R +L SD +ID IN W+AGRNF N YL+ +
Sbjct: 9 LVGLLIFSFGCCDDIRVDLDPLSDEFIDHINSIQYYWSAGRNFHKNTPMSYLKGLM---G 65
Query: 60 KYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVG 119
+ + P +Y + +P+ FDARE WPNC TI V D G+C + F AV
Sbjct: 66 VHESNAHYPKLEQLVSYT-DTPTDLPENFDAREHWPNCPTIREVRDQGSCGSCWAFGAVE 124
Query: 120 AFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGD 179
A SDR CI SKG +N S E + SCC+ C + C+ G W++ +G V+GG
Sbjct: 125 AMSDRVCIHSKGAKNFHFSAENLVSCCRTCGF----GCNGGFPGAAWHYWKTKGIVSGGP 180
Query: 180 YGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLT 239
YG + GC P I+PC HH + P E K P C +C + Y + QD HR
Sbjct: 181 YGSKMGCIPYEIAPCEHHVNGTRGPCKEGGKTP--ACVKKCED-GYKVPYAQDLHRGKSA 237
Query: 240 YWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGT 299
Y + ++ D I++EI +GP F +Y+DF Y++GVYKH + L H+ +++GWG
Sbjct: 238 YSLGNDVDQIRQEIYTNGPVEGAFTVYEDFIAYRAGVYKHVAGKALGG--HAIRILGWGV 295
Query: 300 ENG-TPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
+NG PYWLV N+W WG G KILRG EC E I AG P
Sbjct: 296 QNGEIPYWLVANSWNSDWGSDGFFKILRGSDECGIEGQINAGLP 339
>gi|38147393|gb|AAR12009.1| cathepsin B-like proteinase [Triatoma infestans]
Length = 332
Score = 233 bits (594), Expect = 1e-58, Method: Compositional matrix adjust.
Identities = 131/346 (37%), Positives = 184/346 (53%), Gaps = 18/346 (5%)
Query: 1 MIHILVF-LLGCTLVRGELYK--FSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIA 57
M ++ F LL C + + SD +ID IN TW AGRNF N ++YL+ +A
Sbjct: 1 MKELIPFSLLICGIFSASIPTDPLSDEFIDYINSLQTTWRAGRNFAPNTPKKYLKS--LA 58
Query: 58 DAKYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAA 117
+ LP + + D T+P FDAR+ WPNC +I + D G+C + F A
Sbjct: 59 GVHKDANNAFTLPKRQVSLD----VTLPKEFDARKHWPNCTSIAEIRDQGSCGSCWAFGA 114
Query: 118 VGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTG 177
V A SDR CI S G+ LS E + SCC C + C G W++ G V+G
Sbjct: 115 VEAMSDRICIHSNGKLQVHLSAENLVSCCDSCGF----GCDGGYPASAWDYWQNVGIVSG 170
Query: 178 GDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTT 237
G+YG + GCQP +I+PC HH P P+C + C +C + G + +D +
Sbjct: 171 GNYGSKQGCQPYSIAPCEHHVPGPR-PACSGEGSTP-DCRNQCDKRS-GISYDKDLYYGE 227
Query: 238 LTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGW 297
Y ++D I+ EIL +GP A F +Y+D +YK GVY+H + + L H+ K++GW
Sbjct: 228 SAYSLEDEAKQIQAEILKNGPVEAAFTVYEDLVNYKEGVYQHVAGSVLGG--HAIKILGW 285
Query: 298 GTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
G EN TPYWLV N+W WG+ G KILRGK EC E ++AG P+
Sbjct: 286 GVENDTPYWLVANSWNTDWGNNGFFKILRGKDECGIEIDVSAGLPR 331
>gi|157167366|ref|XP_001653890.1| cathepsin b [Aedes aegypti]
gi|54289254|gb|AAV31917.1| lysosomal cathepsin B [Aedes aegypti]
gi|108874249|gb|EAT38474.1| AAEL009637-PA [Aedes aegypti]
Length = 340
Score = 233 bits (594), Expect = 1e-58, Method: Compositional matrix adjust.
Identities = 124/329 (37%), Positives = 181/329 (55%), Gaps = 13/329 (3%)
Query: 15 RGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRK 74
R ++ S +IDQIN +A TW AG NF S ++R + +D+ +P
Sbjct: 24 RQRIHPLSQKFIDQINSKATTWKAGPNFSPETSMSFIRGLM----GVHKDADKFMP-PVY 78
Query: 75 TYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQN 134
++ E P+ FD+R QWPNC TIG + D G+C + F AV A SDR CI S+G+ +
Sbjct: 79 LHEMEADDDFPENFDSRTQWPNCPTIGEIRDQGSCGSCWAFGAVEAMSDRICIHSEGKVH 138
Query: 135 RPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPC 194
+S+E + SCC C + C+ G W++ ++G V+GG +G GCQP I+PC
Sbjct: 139 FRVSSEDLVSCCHTCGF----GCNGGFPGAAWSYWVRKGLVSGGPFGSDQGCQPYAIAPC 194
Query: 195 SHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEIL 254
HH + + PSCE + KC +C +Y + +DK +Y + ++E I+KEI+
Sbjct: 195 EHHVNG-SRPSCEGEGGKTPKCVKKC-QASYNVPYAKDKMYGKSSYSIANHEKQIQKEIM 252
Query: 255 AHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGP 314
+GP F +Y+D +YK GVY H L H+ +++GWG E+GT YWL+ N+W
Sbjct: 253 TNGPVEGAFTVYEDLLNYKEGVYHHVHGKMLGG--HAIRILGWGVEDGTKYWLIANSWNS 310
Query: 315 HWGDRGTVKILRGKYECAFEYLIAAGKPK 343
WGD G KILRG+ E IAAG PK
Sbjct: 311 DWGDNGFFKILRGEDHLGIESSIAAGLPK 339
>gi|49036806|gb|AAT48984.1| cathepsin B-like proteinase [Triatoma sordida]
Length = 331
Score = 233 bits (593), Expect = 1e-58, Method: Compositional matrix adjust.
Identities = 137/350 (39%), Positives = 180/350 (51%), Gaps = 34/350 (9%)
Query: 4 ILVFLLGCTLVRGELYK--FSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKY 61
IL LL C + SD +ID IN TW AGRNF N ++YL+ +A
Sbjct: 5 ILFSLLICGTFSASIPTDPLSDEFIDYINTLQTTWRAGRNFAPNTPKKYLKS--LAGVHK 62
Query: 62 FDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAF 121
+ LP + + D T+PD FDAR+QWPNC +I + D G+C + F AV A
Sbjct: 63 NANNAFTLPKRKVSLD----VTIPDEFDARKQWPNCPSITDIRDQGSCGSCWAFGAVEAM 118
Query: 122 SDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYG 181
SDR CI S G+ LS E + SCC C Y C G W++ G V+GG+YG
Sbjct: 119 SDRICIHSNGKLQVHLSAENLVSCCDSCGY----GCDGGFPASAWDYWQNEGIVSGGNYG 174
Query: 182 DRTGCQPSTISPCSHH--GSAPT------LPSCENQKVPKLKCHTRCTNPTYGRGFFQDK 233
+ GCQP +I+PC HH GS P P C NQ + G + QD
Sbjct: 175 SKQGCQPYSIAPCEHHVPGSRPACSGGGDTPDCRNQ-----------CDEGSGISYDQDH 223
Query: 234 HRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGK 293
+ Y +D+ + I+ EIL +GP A F +Y+D +YK GVY+H + L H+ K
Sbjct: 224 YYGETVYTLDEAKQ-IQAEILKNGPVEAAFTVYEDLLNYKEGVYQHVAGEALGG--HAIK 280
Query: 294 LIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
++GWG EN TPYWLV N+W WG+ G KILRG EC E I AG P+
Sbjct: 281 ILGWGVENDTPYWLVANSWNTDWGNNGFFKILRGSDECGIEDQIVAGLPR 330
>gi|194766882|ref|XP_001965553.1| GF22391 [Drosophila ananassae]
gi|190619544|gb|EDV35068.1| GF22391 [Drosophila ananassae]
Length = 342
Score = 232 bits (592), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 130/352 (36%), Positives = 187/352 (53%), Gaps = 21/352 (5%)
Query: 1 MIHILVFLLGCTLVRG-----ELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFL 55
M +LV + C L G ++ SD +I+ + + TW AGRNF +SEEY+R +
Sbjct: 1 MKLLLVATVACLLAMGSCEENKIPLLSDEFIELVKTKTRTWQAGRNFDEGVSEEYIRGLM 60
Query: 56 IA--DAKYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPH 113
DA F D+ + Y + +P FDARE+WPNC TI + D G+C +
Sbjct: 61 GVHPDAYKFALPDKQ---EVLGYLSQKVDDIPKEFDAREKWPNCPTINEIRDQGSCGSCW 117
Query: 114 IFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRG 173
F AV A SDR CI S G N S + + SCC C + C+ G W++ ++G
Sbjct: 118 AFGAVEAMSDRVCIHSNGNVNFRFSADDLVSCCHTCGF----GCNGGFPGAAWSYWTRKG 173
Query: 174 SVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDK 233
V+GG YG +TGC+P I+PC HH + P + K P KC +C Y + +DK
Sbjct: 174 IVSGGRYGSKTGCRPYEIAPCEHHVNGTRAPCNHDSKTP--KCQHQC-EAGYNVEYSKDK 230
Query: 234 HRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGK 293
H + +Y V N I++EI+ +GP F +Y+D YKSGVY+H +L H+ +
Sbjct: 231 HFGSKSYSVRRNVRDIQEEIMTNGPVEGAFTVYEDLILYKSGVYQHEHGKELGG--HAIR 288
Query: 294 LIGWGT--ENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
++GWG + PYWL+ N+W WGD+G +ILRG+ C E I+AG PK
Sbjct: 289 ILGWGVWGKEEVPYWLIANSWNDDWGDKGFFRILRGEDHCGIESSISAGLPK 340
>gi|31872149|gb|AAP59456.1| cathepsin B precursor [Araneus ventricosus]
Length = 334
Score = 232 bits (592), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 129/326 (39%), Positives = 173/326 (53%), Gaps = 16/326 (4%)
Query: 18 LYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYD 77
++ S+ I+ +N TW AGRNF ++ +Y+R L + D LP R
Sbjct: 24 IHPLSEKMIEYVNFMNTTWKAGRNFHEGVTMKYIRGLL---GVHKDNHKYRLPSIRHAV- 79
Query: 78 PEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPL 137
+P+ FD+REQWPNC TI + D G+C + F A A SDR CI S G+ N +
Sbjct: 80 ---PGDLPESFDSREQWPNCPTISEIRDQGSCGSCWAFGAAEAMSDRHCIHSNGKVNVEI 136
Query: 138 STEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHH 197
S E + +CC C C+ G W + +G VTGG Y GCQP TI+ C HH
Sbjct: 137 SAEDLLTCCDSC----GMGCNGGFPGSAWEYWVDKGLVTGGLYNSHVGCQPYTIASCEHH 192
Query: 198 GSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHG 257
+ LP C V +C C Y + DK+ +Y +D+ ED IK EI +G
Sbjct: 193 -TKGKLPPC-GDIVDTPQCVHMCEK-GYNVSYRADKYFGKKSYSIDEQEDQIKTEISTNG 249
Query: 258 PTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWG 317
P A F +Y DF YKSGVY+H + ++ H+ +++GWGTE+GTPYWLV N+W WG
Sbjct: 250 PVEAAFTVYADFVTYKSGVYRHVTGEEMGG--HAVRILGWGTESGTPYWLVANSWNTDWG 307
Query: 318 DRGTVKILRGKYECAFEYLIAAGKPK 343
D+G KILRG EC E I AG PK
Sbjct: 308 DKGYFKILRGSDECGIESSIVAGLPK 333
>gi|91078958|ref|XP_974220.1| PREDICTED: similar to cathepsin b [Tribolium castaneum]
gi|270004841|gb|EFA01289.1| cathepsin B precursor [Tribolium castaneum]
Length = 334
Score = 232 bits (591), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 128/345 (37%), Positives = 179/345 (51%), Gaps = 18/345 (5%)
Query: 2 IHILVFLLGCTLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIA--DA 59
+ IL + + ++ S +I QIN + +TW AG NF N+ Y+R+ + ++
Sbjct: 4 LPILTIICTAASLSVAVHPLSKEFIQQINEKQSTWKAGPNFAENVPMSYIRRLMGVPPNS 63
Query: 60 KYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVG 119
KY S + R D + +PD FDAR+QWPNC TI + D G+C + F AV
Sbjct: 64 KYHMPSVK-----RHLLD---AMEIPDDFDARKQWPNCPTIREIRDQGSCGSCWAFGAVE 115
Query: 120 AFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGD 179
A SDR CI SKG N LS + + SCC C C+ G W++ +G V+GG
Sbjct: 116 AMSDRVCIHSKGAVNVRLSADDLVSCCYSC----GMGCNGGFPGAAWHYWVNKGIVSGGS 171
Query: 180 YGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLT 239
+G GC+P I+PC HH + T P C C +C Y + +DK+
Sbjct: 172 FGSNQGCRPYEIAPCEHHVNG-TRPPCTGDDNKTPSCKQQCEK-GYNVPYKKDKNFGKEA 229
Query: 240 YWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGT 299
Y + I+KEI+ +GP F +Y+D YK GVY+H L H+ +++GWGT
Sbjct: 230 YSISSEVQQIQKEIMTNGPVEGAFEVYEDLLSYKKGVYQHVKGEALGG--HAIRILGWGT 287
Query: 300 ENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPKN 344
E GTPYWL+ N+W WGD GT KILRG+ C E I AG PK+
Sbjct: 288 EKGTPYWLIANSWNSDWGDNGTFKILRGEDHCGIESSIVAGIPKD 332
>gi|341888137|gb|EGT44072.1| hypothetical protein CAEBREN_10156 [Caenorhabditis brenneri]
Length = 344
Score = 231 bits (590), Expect = 3e-58, Method: Compositional matrix adjust.
Identities = 128/321 (39%), Positives = 170/321 (52%), Gaps = 15/321 (4%)
Query: 24 AYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLP-GDRKTYDPEYSA 82
A ID +N WTAG E+ +++ + DAKY +P D E S
Sbjct: 31 ALIDYVNSAQKLWTAGHQVVPK--EKIMKKLM--DAKYV------VPHKDEDIVATEVSD 80
Query: 83 TVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYV 142
+PD FDAREQWP+C +I ++ D C + FAA A SDR CI S G N LS+E +
Sbjct: 81 AIPDHFDAREQWPSCVSIDNIRDQSDCGSCWAFAAAEAISDRTCIASNGAVNTLLSSEDL 140
Query: 143 ASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPT 202
SCC + C G + W + K G VTGG Y + GC+P +I+PC + T
Sbjct: 141 LSCCT-GIFSCGNGCEGGYPIQAWKWWGKHGLVTGGSYESQFGCKPYSIAPCGQTVNGVT 199
Query: 203 LPSCENQKVPKLKCHTRCT-NPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTA 261
P C P KC CT N TY + QDKH Y V + I+ EIL +GP
Sbjct: 200 WPKCPEDTEPTPKCVDACTSNHTYPTAYLQDKHFGATAYAVGKKVEQIQTEILKNGPIEV 259
Query: 262 TFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGT 321
F +Y+DFY Y +GVY HT+ A L H+ K++GWG +NGTPYWLV N+W +WG++G
Sbjct: 260 AFTVYEDFYQYTTGVYVHTAGASLGG--HAVKILGWGVDNGTPYWLVANSWNINWGEKGY 317
Query: 322 VKILRGKYECAFEYLIAAGKP 342
+I+RG EC E+ AG P
Sbjct: 318 FRIIRGLNECGIEHSAVAGIP 338
>gi|195438776|ref|XP_002067308.1| GK16352 [Drosophila willistoni]
gi|194163393|gb|EDW78294.1| GK16352 [Drosophila willistoni]
Length = 340
Score = 231 bits (589), Expect = 3e-58, Method: Compositional matrix adjust.
Identities = 124/344 (36%), Positives = 194/344 (56%), Gaps = 16/344 (4%)
Query: 4 ILVFLLGCTLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIA--DAKY 61
+L+ ++ ++ + + SD +I+ + +ANTWT GRNF ++SE+Y+R + DA
Sbjct: 8 LLLMVVYLSMFEAKDHLLSDEFIELVRGKANTWTVGRNFHESVSEKYIRGLMGVHPDADK 67
Query: 62 FDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAF 121
F D+ + D + + +P FDARE+W NC TIG + D G+C + F AV A
Sbjct: 68 FALPDKMEVLGKLVEDSD--SDIPTEFDAREKWSNCPTIGEIRDQGSCGSCWAFGAVEAM 125
Query: 122 SDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYG 181
SDR CI S+G+ N LS + + SCC C + C+ G W++ ++G V+GG++G
Sbjct: 126 SDRVCIHSQGKVNFHLSADDLVSCCHTCGF----GCNGGFPGAAWSYWTRKGIVSGGNFG 181
Query: 182 DRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYW 241
+ GC+P I PC HH + T P C + P +C C + +Y + +DK+ + +Y
Sbjct: 182 SQQGCRPYEIEPCEHHVNG-TRPPCSSGSTP--RCQHVCES-SYKVDYKKDKNFGSKSYS 237
Query: 242 VDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGT-- 299
+ +N I+KEI+ +GP F +Y+D YKSGVY+H +L H+ +++GWG
Sbjct: 238 IKNNVLDIQKEIMNNGPVEGAFTVYEDLILYKSGVYEHVHGKELGG--HAIRILGWGVWG 295
Query: 300 ENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
+ PYWL+ N+W WGD G +I+RGK C E I+AG PK
Sbjct: 296 DEKIPYWLIANSWNTDWGDNGFFRIVRGKDHCGIESSISAGLPK 339
>gi|125981197|ref|XP_001354605.1| GA10694 [Drosophila pseudoobscura pseudoobscura]
gi|54642915|gb|EAL31659.1| GA10694 [Drosophila pseudoobscura pseudoobscura]
Length = 338
Score = 231 bits (588), Expect = 6e-58, Method: Compositional matrix adjust.
Identities = 125/328 (38%), Positives = 182/328 (55%), Gaps = 18/328 (5%)
Query: 21 FSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEY 80
SD +I+ + +A+TW GRNF ++SEEY+R + + D LP R Y
Sbjct: 23 LSDEFIELVRSKASTWQVGRNFKESVSEEYIRGLM---GVHPDAHKFALPEKRIVLGDLY 79
Query: 81 S---ATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPL 137
+ +P+ FDAR+ WPNC TIG + D G+C + F AV A SDR CI S+G+ N L
Sbjct: 80 ADDGVDIPEEFDARKAWPNCPTIGEIRDQGSCGSCWAFGAVEAMSDRVCIHSEGKVNFHL 139
Query: 138 STEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHH 197
S + + SCC IC + C+ G W++ ++G V+GG YG GC+P I+PC HH
Sbjct: 140 SADDLVSCCHICGF----GCNGGFPGAAWSYWTRKGIVSGGPYGSTQGCRPYEIAPCEHH 195
Query: 198 GSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHG 257
+ T P C + P C +C +Y + +DK+ + +Y V N I++EI+ +G
Sbjct: 196 VNG-TRPPCSHGSTP--SCQHKC-QASYSVEYAKDKNFGSKSYSVRRNVAEIQQEIMTNG 251
Query: 258 PTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGT--ENGTPYWLVINTWGPH 315
P F +Y+D YKSGVY+H +L H+ +++GWG E+ PYWL+ N+W
Sbjct: 252 PVEGAFTVYEDLILYKSGVYQHEHGKELGG--HAIRILGWGVWGESKVPYWLIGNSWNTD 309
Query: 316 WGDRGTVKILRGKYECAFEYLIAAGKPK 343
WGD G +ILRG+ C E I+AG PK
Sbjct: 310 WGDNGFFRILRGQDHCGIESSISAGLPK 337
>gi|240992699|ref|XP_002404474.1| cathepsin B endopeptidase, putative [Ixodes scapularis]
gi|215491571|gb|EEC01212.1| cathepsin B endopeptidase, putative [Ixodes scapularis]
Length = 337
Score = 230 bits (586), Expect = 7e-58, Method: Compositional matrix adjust.
Identities = 135/346 (39%), Positives = 179/346 (51%), Gaps = 20/346 (5%)
Query: 4 ILVFLLGCTLVRGELYK-----FSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIAD 58
++V LL E++ SD I+ IN+ TW AGRNF ++S Y+R +
Sbjct: 6 LVVGLLAAVCFGREIHPKKWHPLSDQMINFINKINTTWKAGRNFDKSISMSYIRGLMGVH 65
Query: 59 AKYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAV 118
K + D E +P+ FDARE+W +C +I + D C + F A
Sbjct: 66 PKSKEYRLAEFVHD------EIPDDLPESFDAREKWSHCASIHLIRDQSTCGSCWAFGAA 119
Query: 119 GAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGG 178
A SDR CI SKG+ +S E + CC C C+ G W + + G VTGG
Sbjct: 120 EAMSDRVCIHSKGKIQVDISAEDLLDCCDSC----GAGCNGGYPAAAWEYWKESGLVTGG 175
Query: 179 DYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTL 238
YG GC+P +++PC HH + +LP+C VP KC C YG+ + DKH
Sbjct: 176 LYGTSDGCKPYSLAPCEHH-TKGSLPNCTGT-VPTPKCVHLCRK-GYGKDYQDDKHFGRK 232
Query: 239 TYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWG 298
Y + +E I+ EI +GP A F +Y DF YKSGVY+H S L H+ +++GWG
Sbjct: 233 VYSISSDEKQIQTEIFKNGPVEADFTVYADFLSYKSGVYQHQSGDVLGG--HAIRILGWG 290
Query: 299 TENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPKN 344
TENGTPYWLV N+W WGD G KILRGK EC E I AG PKN
Sbjct: 291 TENGTPYWLVANSWNEDWGDHGYFKILRGKDECGIEDDINAGIPKN 336
>gi|344195776|gb|AEM98130.1| cathepsin B [Cynoglossus semilaevis]
Length = 332
Score = 230 bits (586), Expect = 9e-58, Method: Compositional matrix adjust.
Identities = 133/345 (38%), Positives = 177/345 (51%), Gaps = 21/345 (6%)
Query: 1 MIHILVFLLGCTLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAK 60
+I L +L L R L S+ ++ IN+ +TW AG NF N+ YLR+
Sbjct: 5 VIPFLAAILSVGLARPPLKTLSNEMVNHINKVNSTWKAGLNF-QNVDYSYLRRL------ 57
Query: 61 YFDQSDRPLPGDRKTYDPEYSATV--PDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAV 118
L G + +++A V P FDAR QWP C T+ V D G+C + F A
Sbjct: 58 ----CGTMLKGPKLPVKLQFTADVQLPVDFDARVQWPQCPTLKEVRDQGSCGSCWAFGAA 113
Query: 119 GAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGG 178
A SDR CI S G N +S E + SCC C C+ G W F G V+GG
Sbjct: 114 EAISDRLCIHSNGLMNVEISAEDLLSCCDSC----GMGCNGGYPSAAWEFWTTDGLVSGG 169
Query: 179 DYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTL 238
Y GC+P +I+PC HH + + P C + +C +C Y G+ QDKH L
Sbjct: 170 LYDSHIGCRPYSIAPCEHHVNG-SRPPCTGEGGDTPQCTKKC-EAGYTPGYTQDKHYGKL 227
Query: 239 TYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWG 298
+Y VDD+E I+ EI +GP F +Y+DF YK+GVY+H + + + H+ K++GWG
Sbjct: 228 SYSVDDSEKEIQLEIYKNGPVEGAFTVYEDFLLYKTGVYQHVTGSAVGG--HAIKVLGWG 285
Query: 299 TENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
ENGTPYWL N+W WGD G KILRG C E I AG PK
Sbjct: 286 EENGTPYWLCANSWNTDWGDNGFFKILRGSDHCGIESEIVAGIPK 330
>gi|45822203|emb|CAE47498.1| cathepsin B-like proteinase [Diabrotica virgifera virgifera]
Length = 328
Score = 229 bits (585), Expect = 1e-57, Method: Compositional matrix adjust.
Identities = 127/333 (38%), Positives = 177/333 (53%), Gaps = 26/333 (7%)
Query: 17 ELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTY 76
EL+ SD +I+ IN +TWTAGRNF + S +Y+ + + LP D K Y
Sbjct: 16 ELHPLSDEFINSINAAKSTWTAGRNFAQDKSMDYIIKLMGV-----------LP-DHKNY 63
Query: 77 DPEY------SATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSK 130
P + +P FDAR+QWP+C TI + D G+C + F AV A SDR CI S
Sbjct: 64 MPPVLTHKLEALEIPADFDARQQWPHCPTIREIRDQGSCGSCWAFGAVEAMSDRVCIHSN 123
Query: 131 GQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPST 190
G+ N S++ + SCC C C+ G W++ ++G V+GG YG + GC+P
Sbjct: 124 GESNFHFSSDDLVSCCWTC----GMGCNGGYPGAAWHYWVRKGLVSGGQYGTKQGCRPYE 179
Query: 191 ISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIK 250
I PC HH + + P+C+ + KC C + Y + D H + Y + + I+
Sbjct: 180 IPPCEHHTNG-SRPACDASEGNTPKCAKSCES-NYKINYSNDLHFGSKAYSISSDVKQIQ 237
Query: 251 KEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVIN 310
EIL +GP F++Y DF +YK+GVY+H L H+ ++ GWG EN TPYWL+ N
Sbjct: 238 AEILQNGPVEGAFSVYADFVNYKTGVYQHIKGQFLGG--HAIRIFGWGVENNTPYWLIAN 295
Query: 311 TWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
+W WGD GT KILRG C E I AG PK
Sbjct: 296 SWNTDWGDSGTFKILRGSDHCGIESGIVAGLPK 328
>gi|240992702|ref|XP_002404475.1| cathepsin B endopeptidase, putative [Ixodes scapularis]
gi|215491572|gb|EEC01213.1| cathepsin B endopeptidase, putative [Ixodes scapularis]
Length = 337
Score = 229 bits (585), Expect = 1e-57, Method: Compositional matrix adjust.
Identities = 133/346 (38%), Positives = 180/346 (52%), Gaps = 20/346 (5%)
Query: 4 ILVFLLGCTLVRGELYK-----FSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIAD 58
++V LL E++ SD I+ IN+ TW AGRNF ++S Y+R +
Sbjct: 6 LVVGLLAAVCFGREIHPKKWHPLSDQMINFINKINTTWKAGRNFDKSISMSYIRGLMGVH 65
Query: 59 AKYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAV 118
K + D E +P+ FDARE+WP+C +I + D C + F A
Sbjct: 66 PKSKEYRLAEFVHD------EIPDDLPESFDAREKWPHCNSIHLIRDQSTCGSCWAFGAA 119
Query: 119 GAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGG 178
A SDR CI SKG+ +S E + CC C C+ G+ W + + G VTGG
Sbjct: 120 EAMSDRVCIHSKGKIQVNISAEDLLDCCDSC----GAGCNGGTPAAAWEYWKESGLVTGG 175
Query: 179 DYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTL 238
YG GC+P +++PC HH + +LP+C VP KC C YG+ + DKH
Sbjct: 176 LYGTNDGCKPYSLAPCEHH-TKGSLPNCTGT-VPTPKCVHLCRK-GYGKDYQDDKHFGKK 232
Query: 239 TYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWG 298
Y + +E I+ EI +GP A F + DF YKSGVY+H S+ + H+ +++GWG
Sbjct: 233 VYSISSDEKQIQTEIFKNGPVEADFIVLADFLSYKSGVYQHHSDDVIGG--HAIRILGWG 290
Query: 299 TENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPKN 344
TENGTPYWL N+W WGD G KILRGK EC E I AG PKN
Sbjct: 291 TENGTPYWLAANSWNEDWGDHGYFKILRGKDECGIEEDINAGIPKN 336
>gi|170028910|ref|XP_001842337.1| cathepsin L [Culex quinquefasciatus]
gi|167879387|gb|EDS42770.1| cathepsin L [Culex quinquefasciatus]
Length = 334
Score = 229 bits (585), Expect = 1e-57, Method: Compositional matrix adjust.
Identities = 126/323 (39%), Positives = 175/323 (54%), Gaps = 14/323 (4%)
Query: 21 FSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEY 80
S +IDQIN +A TW AGRNF + Y+R + +D+ +P +D +
Sbjct: 25 LSGKFIDQINAKATTWRAGRNFHPDTPMSYIRGLM----GVHKDADKFMP-PVMLHDLDE 79
Query: 81 SATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTE 140
+P+ FDAREQWPNC TI + D G+C + F AV A SDR CI SKG+ + +S E
Sbjct: 80 GDDLPENFDAREQWPNCPTIREIRDQGSCGSCWAFGAVEAMSDRICIHSKGKVHFRVSAE 139
Query: 141 YVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSA 200
+ SCC C + C+ G W++ ++G V+GG YG GCQP ISPC HH +
Sbjct: 140 DLVSCCHTCGF----GCNGGFPGAAWSYWVRKGLVSGGPYGSDQGCQPYAISPCEHHVNG 195
Query: 201 PTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTT 260
P K P KC +C +Y + +DK +Y + +E I+KE+ +GP
Sbjct: 196 TRGPCNGEGKTP--KCVKKC-QASYNVPYAKDKFFGKSSYSIASHEQQIQKELFTNGPVE 252
Query: 261 ATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRG 320
F +Y+D +YK GVY+HT+ L H+ +++GWG EN T +WL+ N+W WGD G
Sbjct: 253 GAFTVYEDLLNYKEGVYQHTAGKMLGG--HAIRILGWGVENDTKFWLIANSWNSDWGDNG 310
Query: 321 TVKILRGKYECAFEYLIAAGKPK 343
KILRG E IAAG PK
Sbjct: 311 YFKILRGSDHLGIESSIAAGLPK 333
>gi|187097096|ref|NP_001119608.1| cathepsin B-348 precursor [Acyrthosiphon pisum]
gi|161343833|tpg|DAA06097.1| TPA_inf: cathepsin B [Acyrthosiphon pisum]
Length = 342
Score = 229 bits (583), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 132/346 (38%), Positives = 180/346 (52%), Gaps = 17/346 (4%)
Query: 1 MIHILVFLLG---CTLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIA 57
++ +L+F G VR +L SD +ID IN W+AGRNF + Y++ +
Sbjct: 9 LVGLLIFSFGRVDGATVRVDLNPLSDEFIDHINSIQYYWSAGRNFHKDTPISYIKGLMGV 68
Query: 58 DAKYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAA 117
K ++ P TY+ + S +P+ FDARE+WPNC TI V D G+C + F A
Sbjct: 69 HEK---NAEYPKLEQLLTYN-DASTDLPETFDARERWPNCPTIREVRDQGSCGSCWAFGA 124
Query: 118 VGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTG 177
V A SDR CI S G +N S E + SCC C + C+ G WN+ +G V+G
Sbjct: 125 VEAMSDRVCIHSNGTKNFHFSAENLVSCCWTCGF----GCNGGFPGAAWNYWKTKGIVSG 180
Query: 178 GDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTT 237
G YG GC P I+PC HH + P E K P C +C Y + QD H
Sbjct: 181 GPYGSNMGCIPYEIAPCEHHVNGTRGPCKEGGKTP--TCVKKCEE-GYKVPYAQDLHHGK 237
Query: 238 LTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGW 297
Y + ++ D I++EI +GP F +Y+DF Y++GVYKH + L H+ +++GW
Sbjct: 238 SAYSIRNDVDQIRQEIYTNGPVEGAFTVYEDFIAYRAGVYKHVAGKALGG--HAIRILGW 295
Query: 298 GTENG-TPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
G +NG PYWLV N+W WG G KILRG EC E I AG P
Sbjct: 296 GVQNGEIPYWLVANSWNTDWGSDGFFKILRGSDECGIEGQINAGLP 341
>gi|323147412|gb|ADX32985.1| cathepsin B [Pinctada fucata]
Length = 366
Score = 229 bits (583), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 132/323 (40%), Positives = 168/323 (52%), Gaps = 12/323 (3%)
Query: 21 FSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEY 80
SD I IN+ +W AG+NF E+ L I Y D P D E
Sbjct: 54 LSDEMIWFINKVNTSWKAGQNFHHIKQEDRLDHVKIMCGTYLD---VPPHLQLPVRDIEP 110
Query: 81 SATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTE 140
+PD FDAR QW NC TI + D G+C + F AV + SDR CIKS GQQN +S E
Sbjct: 111 RKDLPDTFDARTQWSNCPTIKEIRDQGSCGSCWAFGAVESMSDRICIKSNGQQNAHISAE 170
Query: 141 YVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSA 200
+ SCC+ C C+ G + W + + G VTGG Y GCQP T+ C HH
Sbjct: 171 DLTSCCRSC----GNGCNGGFLSGAWEYYKRDGLVTGGQYNSHQGCQPYTVKACDHHVVG 226
Query: 201 PTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTT 260
P C ++ C C + Y + +DKH Y V + I EI+ +GP
Sbjct: 227 KLQP-CSKKEEHTPVCKHECES-GYNVSYTKDKHYGATAYSVRGVQQ-IMTEIMTNGPVE 283
Query: 261 ATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRG 320
F +Y DF YKSGVYKHT+ + L H+ K++GWGTE G YWLV N+W P WG++G
Sbjct: 284 GAFTVYADFPQYKSGVYKHTTGSPLGG--HAIKIMGWGTEGGDDYWLVANSWNPDWGNQG 341
Query: 321 TVKILRGKYECAFEYLIAAGKPK 343
T KILRG+ EC E IAAG+PK
Sbjct: 342 TFKILRGRDECGIESQIAAGEPK 364
>gi|321452279|gb|EFX63703.1| hypothetical protein DAPPUDRAFT_306608 [Daphnia pulex]
Length = 340
Score = 228 bits (582), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 130/352 (36%), Positives = 174/352 (49%), Gaps = 22/352 (6%)
Query: 1 MIHILVFLLGCTLVRGELYKF---------SDAYIDQINREANTWTAGRNFPANLSEEYL 51
M +L + LV + K SD +I+ IN +TW AGRNF N L
Sbjct: 1 MKIVLSIIFAVVLVTSQAKKLKSNKYFNPLSDEFINHINSMKSTWKAGRNFGKNFPMGAL 60
Query: 52 RQFLIADAKYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAA 111
Q + S+ +P + + +P+ FDAREQWP+C TI + D G+C +
Sbjct: 61 TQMM----GVHPDSNLYMPPLKNVSQMYSNQAIPEAFDAREQWPDCPTIQEIRDQGSCGS 116
Query: 112 PHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHK 171
F AV A SDR CI SKG+ N LS E + SCC C + C+ G W+ K
Sbjct: 117 CWAFGAVEAMSDRICIHSKGEVNAHLSAENLVSCCYTCGF----GCNGGFPGAAWSHWVK 172
Query: 172 RGSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQ 231
+G VTGG++ GCQP I C HH + P E P KC C + Y + Q
Sbjct: 173 KGIVTGGNFNSSQGCQPYIIPACEHHTTGDRPPCSEGGGTP--KCLKTCED-GYTVDYTQ 229
Query: 232 DKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHS 291
D H +Y V + I+ EI+ +GP +Y+DF YKSGVY+H L H+
Sbjct: 230 DLHYGASSYSVHKRMEDIQLEIMNNGPVEGALTVYEDFPTYKSGVYQHVHGKALGG--HA 287
Query: 292 GKLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
+++GWG E G PYWL+ N+W WGD G +K+LRGK C E I AG PK
Sbjct: 288 IRILGWGVEEGVPYWLIANSWNTDWGDNGYIKLLRGKDHCGIESQITAGLPK 339
>gi|118424551|gb|ABK90823.1| cathepsin B-like cysteine proteinase [Spodoptera exigua]
Length = 341
Score = 228 bits (582), Expect = 3e-57, Method: Compositional matrix adjust.
Identities = 124/339 (36%), Positives = 171/339 (50%), Gaps = 14/339 (4%)
Query: 4 ILVFLLGCTLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFD 63
+ L V L +D +I+ IN + N+W AGRNFP N ++++ D
Sbjct: 12 VCTLALASASVEDLLNPLTDEFINLINTKQNSWKAGRNFPVNTPLTHIKKLT---GVLVD 68
Query: 64 QSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSD 123
LP + +D + A +P+ FD R++WPNC T+ V D G+C + F AV A +D
Sbjct: 69 THLSKLP--KVEHDADLIADLPENFDPRDKWPNCPTLNEVRDQGSCGSCWAFGAVEAMTD 126
Query: 124 RRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDR 183
R C S G ++ S E + SCC +C C+ G W + G V+GG Y
Sbjct: 127 RYCTYSNGTKHFHFSAEDLLSCCPVCGL----GCNGGMPTLAWEYWKHFGLVSGGSYNSS 182
Query: 184 TGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVD 243
GC+P I PC HH +P + K PK CH C + +Y + +DK Y V
Sbjct: 183 QGCRPYEIPPCEHHVPGNRMPCNGDSKTPK--CHKTCES-SYNVDYHKDKRYGKHVYSVS 239
Query: 244 DNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGT 303
ED IK E+ +GP F +Y D +YK+GVYKHT L H+ K++GWG ENG
Sbjct: 240 SKEDHIKAELYKNGPVEGAFTVYSDLLNYKNGVYKHTVGNALGG--HAIKILGWGVENGN 297
Query: 304 PYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
YWL+ N+W WGD G KILRG+ C E I AG+P
Sbjct: 298 KYWLIANSWNSDWGDNGFFKILRGEDHCGIESSIVAGEP 336
>gi|18921171|ref|NP_572920.1| cathepsin B1, isoform A [Drosophila melanogaster]
gi|7292926|gb|AAF48317.1| cathepsin B1, isoform A [Drosophila melanogaster]
gi|16767940|gb|AAL28188.1| GH06546p [Drosophila melanogaster]
gi|220944992|gb|ACL85039.1| CG10992-PA [synthetic construct]
gi|220954816|gb|ACL89951.1| CG10992-PA [synthetic construct]
Length = 340
Score = 228 bits (582), Expect = 3e-57, Method: Compositional matrix adjust.
Identities = 126/337 (37%), Positives = 181/337 (53%), Gaps = 17/337 (5%)
Query: 12 TLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPG 71
L GE SD +I+ + +A TWT GRNF A+++E ++R+ + + D LP
Sbjct: 15 ALTSGEPSLLSDEFIEVVRSKAKTWTVGRNFDASVTEGHIRRLM---GVHPDAHKFALPD 71
Query: 72 DRKTYDPEYSATV---PDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIK 128
R+ Y +V P+ FD+R+QWPNC TIG + D G+C + F AV A SDR CI
Sbjct: 72 KREVLGDLYVNSVDELPEEFDSRKQWPNCPTIGEIRDQGSCGSCWAFGAVEAMSDRVCIH 131
Query: 129 SKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQP 188
S G+ N S + + SCC C + C+ G W++ ++G V+GG YG GC+P
Sbjct: 132 SGGKVNFHFSADDLVSCCHTCGF----GCNGGFPGAAWSYWTRKGIVSGGPYGSNQGCRP 187
Query: 189 STISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDA 248
ISPC HH + P + P KC C + Y + +DKH + +Y V N
Sbjct: 188 YEISPCEHHVNGTRPPCAHGGRTP--KCSHVCQS-GYTVDYAKDKHFGSKSYSVRRNVRE 244
Query: 249 IKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGT--ENGTPYW 306
I++EI+ +GP F +Y+D YK GVY+H +L H+ +++GWG E PYW
Sbjct: 245 IQEEIMTNGPVEGAFTVYEDLILYKDGVYQHEHGKELGG--HAIRILGWGVWGEEKIPYW 302
Query: 307 LVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
L+ N+W WGD G +ILRG+ C E I+AG PK
Sbjct: 303 LIGNSWNTDWGDHGFFRILRGQDHCGIESSISAGLPK 339
>gi|204022071|dbj|BAG71133.1| cathepsin B-S2 [Tuberaphis coreana]
Length = 334
Score = 228 bits (580), Expect = 4e-57, Method: Compositional matrix adjust.
Identities = 124/340 (36%), Positives = 182/340 (53%), Gaps = 17/340 (5%)
Query: 4 ILVFLLGCTLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFD 63
+ + L L SD I IN A TW A R FPAN SEEY L+ Y +
Sbjct: 8 VCAIFVSVYLAEPTLQFLSDERIKYINEVAKTWKAERYFPANTSEEYFIG-LLGSRGYKN 66
Query: 64 QSDRPLPGDRKTYDPEYSAT-VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFS 122
++ + K YDP Y P +FD+R W +C IGH+ D G C + F+ GAF+
Sbjct: 67 YTNE---FEIKKYDPLYVENDSPQQFDSRTNWKSCKQIGHIRDQGNCGSCWSFSTTGAFA 123
Query: 123 DRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGD 182
DR C+ + G+ N+ LS E + CCK C + C G+ + W + +G TGGDY
Sbjct: 124 DRLCVSTGGKFNQLLSPEELTFCCKDC----GQGCGGGNPMKAWEYFRTQGVTTGGDYNT 179
Query: 183 RTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWV 242
+ GC P + PC + C+ Q + + + +C YG+ Q++++T Y++
Sbjct: 180 KEGCMPYKVPPCRNKQGENI---CDEQPMER---NHQCPKTCYGKTTVQNRYKTKSEYYI 233
Query: 243 DDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENG 302
+ + I+++I +GP A+F YDD YKSG+Y+ + NAK + HS K+IGWG E+G
Sbjct: 234 NSIK-TIEQDIKTYGPVEASFDCYDDLSVYKSGIYRKSPNAKYKG-GHSIKIIGWGQEDG 291
Query: 303 TPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
TPYWL +N+W WGD GT KI++G+ EC E + AG P
Sbjct: 292 TPYWLAVNSWSKFWGDHGTFKIIKGRNECGIERAVTAGIP 331
>gi|338815385|gb|AEJ08755.1| cathepsin B [Crassostrea ariakensis]
Length = 341
Score = 228 bits (580), Expect = 4e-57, Method: Compositional matrix adjust.
Identities = 137/351 (39%), Positives = 175/351 (49%), Gaps = 24/351 (6%)
Query: 5 LVFLLGCTLVRGELYKF------------SDAYIDQINREANTWTAGRNFPANLSEEYLR 52
+ L+ C LV G + SD I IN+ TW AG+NF ++ L
Sbjct: 1 MKVLVLCALVAGAMSALVEFRDKDIFEPLSDEMIWFINKLNTTWKAGQNFHHIAKDDRLA 60
Query: 53 QFLIADAKYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAP 112
+ Y + ++K E +P FD+R QWPNC T+ V D GAC +
Sbjct: 61 HVKMMCGTYLNTPPELRLPEKKM---EPLKDLPASFDSRTQWPNCPTLKEVRDQGACGSC 117
Query: 113 HIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKR 172
F AV A SDR CIKS+G++N +S E + SCC+ C C G W++ +
Sbjct: 118 WAFGAVEAMSDRICIKSQGKENVHISAEDLTSCCRTC----GNGCEGGFPSAAWSYYKRD 173
Query: 173 GSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQD 232
G VTGG Y GCQP TI C HH P C P KC C Y + +D
Sbjct: 174 GLVTGGQYNSHQGCQPYTIKACDHHVVGKLQP-CSKDIGPTPKCKHTC-EAGYNVTYEKD 231
Query: 233 KHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSG 292
KH Y V E I EI+ +GP F +Y DF YKSGVYKHT+ L H+
Sbjct: 232 KHYGMSAYSVHGVEK-IMTEIMTNGPVEGAFTVYADFPQYKSGVYKHTTGQPLGG--HAI 288
Query: 293 KLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
K++GWGTENG YWLV N+W P WGD+G KILRG+ EC E I+AG+PK
Sbjct: 289 KILGWGTENGDDYWLVANSWNPDWGDQGFFKILRGQDECGIESQISAGEPK 339
>gi|73586701|gb|AAI02998.1| CTSB protein [Bos taurus]
Length = 335
Score = 228 bits (580), Expect = 4e-57, Method: Compositional matrix adjust.
Identities = 135/346 (39%), Positives = 179/346 (51%), Gaps = 23/346 (6%)
Query: 1 MIHILVFLLGCTLVRGELY--KFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIAD 58
++ L LL T R LY SD ++ +N++ TW AG NF N+ Y+++ A
Sbjct: 4 LLATLSCLLVLTSARSSLYFPPLSDELVNFVNKQNTTWKAGHNF-YNVDLSYVKKLCGAI 62
Query: 59 AKYFDQSDRPLPGDRKTYDPEYSATV--PDRFDAREQWPNCGTIGHVPDTGACAAPHIFA 116
L G + ++A V P+ FDAREQWPNC TI + D G+C + F
Sbjct: 63 ----------LGGPKLPQRDAFAADVVLPESFDAREQWPNCPTIKEIRDQGSCGSCWAFG 112
Query: 117 AVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVT 176
AV A SDR CI S G+ N +S E + +CC + C+ G WNF K+G V+
Sbjct: 113 AVEAISDRICIHSNGRVNVEVSAEDMLTCCD---GECGDGCNGGFPSGAWNFWTKKGLVS 169
Query: 177 GGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRT 236
GG Y GC+P +I PC HH + P PK C C P Y + +DKH
Sbjct: 170 GGLYNSHVGCRPYSIPPCEHHVNGSRPPCTGEGDTPK--CSKTC-EPGYSPSYKEDKHFG 226
Query: 237 TLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIG 296
+Y V +NE I EI +GP F++Y DF YKSGVY+H S + H+ +++G
Sbjct: 227 CSSYSVANNEKEIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVSGEIMGG--HAIRILG 284
Query: 297 WGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
WG ENGTPYWLV N+W WGD G KILRG+ C E I AG P
Sbjct: 285 WGVENGTPYWLVGNSWNTDWGDNGFFKILRGQDHCGIESEIVAGMP 330
>gi|405971658|gb|EKC36483.1| Cathepsin B [Crassostrea gigas]
Length = 341
Score = 227 bits (579), Expect = 6e-57, Method: Compositional matrix adjust.
Identities = 137/351 (39%), Positives = 175/351 (49%), Gaps = 24/351 (6%)
Query: 5 LVFLLGCTLVRGELYKF------------SDAYIDQINREANTWTAGRNFPANLSEEYLR 52
+ L+ C LV G + SD I IN+ TW AG+NF ++ L
Sbjct: 1 MKVLVLCALVAGAMSALVEFRDKDIFEPLSDEMIWFINKMNTTWKAGQNFHHIAKDDRLA 60
Query: 53 QFLIADAKYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAP 112
+ Y + ++K E +P FD+R QWPNC T+ V D GAC +
Sbjct: 61 HVKMMCGTYLNTPPELRLPEKKM---EPLKDLPATFDSRTQWPNCPTLKEVRDQGACGSC 117
Query: 113 HIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKR 172
F AV A SDR CIKS+G++N +S E + SCC+ C C G W++ K
Sbjct: 118 WAFGAVEAMSDRICIKSQGKENTHISAEDLTSCCRTC----GNGCEGGFPSAAWSYYKKD 173
Query: 173 GSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQD 232
G VTGG Y GC P TI C HH P C P KC C Y + +D
Sbjct: 174 GLVTGGQYNSHQGCLPYTIKACDHHVVGKLQP-CSKSIGPTPKCKHTC-EAGYNVTYEKD 231
Query: 233 KHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSG 292
KH + Y V E I EI+ +GP F +Y DF YKSGVYKHT+ L H+
Sbjct: 232 KHYGSSAYSVHGVEK-IMTEIMTNGPVEGAFTVYADFPQYKSGVYKHTTGQPLGG--HAI 288
Query: 293 KLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
K++GWGTENG YWLV N+W P WGD+G KILRG+ EC E I+AG+PK
Sbjct: 289 KILGWGTENGDDYWLVANSWNPDWGDQGFFKILRGQDECGIESQISAGEPK 339
>gi|17565164|ref|NP_503383.1| Protein CPR-5 [Caenorhabditis elegans]
gi|1169086|sp|P43509.1|CPR5_CAEEL RecName: Full=Cathepsin B-like cysteine proteinase 5; AltName:
Full=Cysteine protease-related 5; Flags: Precursor
gi|671713|gb|AAA98786.1| cathepsin B-like cysteine proteinase [Caenorhabditis elegans]
gi|675502|gb|AAA98784.1| cathepsin B-like cysteine proteinase [Caenorhabditis elegans]
gi|351059399|emb|CCD74289.1| Protein CPR-5 [Caenorhabditis elegans]
Length = 344
Score = 227 bits (579), Expect = 6e-57, Method: Compositional matrix adjust.
Identities = 125/321 (38%), Positives = 169/321 (52%), Gaps = 15/321 (4%)
Query: 24 AYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLP-GDRKTYDPEYSA 82
A ID +N WTAG + +E + + L+ D KY +P D E S
Sbjct: 31 ALIDYVNSAQKLWTAGHQV---IPKEKITKKLM-DVKYL------VPHKDEDIVATEVSD 80
Query: 83 TVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYV 142
+PD FDAR+QWPNC +I ++ D C + FAA A SDR CI S G N LS+E +
Sbjct: 81 AIPDHFDARDQWPNCMSINNIRDQSDCGSCWAFAAAEAISDRTCIASNGAVNTLLSSEDL 140
Query: 143 ASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPT 202
SCC + C G + W + K G VTGG Y + GC+P +I+PC +
Sbjct: 141 LSCCT-GMFSCGNGCEGGYPIQAWKWWVKHGLVTGGSYETQFGCKPYSIAPCGETVNGVK 199
Query: 203 LPSCENQKVPKLKCHTRCTNP-TYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTA 261
P+C P KC CT+ Y + QDKH + Y V + I+ EIL +GP
Sbjct: 200 WPACPEDTEPTPKCVDSCTSKNNYATPYLQDKHFGSTAYAVGKKVEQIQTEILTNGPIEV 259
Query: 262 TFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGT 321
F +Y+DFY Y +GVY HT+ A L H+ K++GWG +NGTPYWLV N+W WG++G
Sbjct: 260 AFTVYEDFYQYTTGVYVHTAGASLGG--HAVKILGWGVDNGTPYWLVANSWNVAWGEKGY 317
Query: 322 VKILRGKYECAFEYLIAAGKP 342
+I+RG EC E+ AG P
Sbjct: 318 FRIIRGLNECGIEHSAVAGIP 338
>gi|170028912|ref|XP_001842338.1| oryzain gamma chain [Culex quinquefasciatus]
gi|167879388|gb|EDS42771.1| oryzain gamma chain [Culex quinquefasciatus]
Length = 333
Score = 227 bits (578), Expect = 6e-57, Method: Compositional matrix adjust.
Identities = 129/343 (37%), Positives = 182/343 (53%), Gaps = 15/343 (4%)
Query: 1 MIHILVFLLGCTLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAK 60
++ V ++ + R + S+ +IDQIN +A TW AGRNF + Y R +
Sbjct: 5 LLTATVIVVLWAMYRVSINPLSEKFIDQINAKATTWHAGRNFHPDTPLSYFRGLM----G 60
Query: 61 YFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGA 120
+D+ +P +D + +P+ FD+REQWPNC TI + D G+C + F AV A
Sbjct: 61 VHKDADKFMP-PVMLHDLDEGDDLPENFDSREQWPNCPTIREIRDQGSCGSCWAFGAVEA 119
Query: 121 FSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDY 180
SDR CI SKG+ +S E + +CC C + C G+ W ++G V+GG +
Sbjct: 120 MSDRVCIHSKGKVLFRVSAEDLLTCCTNCGH----GCDGGAPGAGWKHWIEKGLVSGGPF 175
Query: 181 GDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTY 240
G GC+P TI PC H + P C++ P KC +C P Y + +DK TY
Sbjct: 176 GSDQGCRPYTIEPCVHVENGAQSP-CKDSITP--KCIKKCL-PGYNVPYAKDKSFGKSTY 231
Query: 241 WVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTE 300
+ ++E I+KEI +GP ATF ++DDF YK G+Y+HTS H+ +++GWG E
Sbjct: 232 SIANDERQIRKEIFTNGPVEATFTVFDDFASYKHGIYQHTSGNLAGE--HAVRILGWGVE 289
Query: 301 NGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
NGT YWL N+W WGD G KILRG E I AG PK
Sbjct: 290 NGTKYWLAANSWNSDWGDNGYFKILRGSNHVDIESAIVAGLPK 332
>gi|379067374|gb|AFC90100.1| cathepsin B [Capra hircus]
Length = 335
Score = 226 bits (577), Expect = 9e-57, Method: Compositional matrix adjust.
Identities = 134/346 (38%), Positives = 178/346 (51%), Gaps = 23/346 (6%)
Query: 1 MIHILVFLLGCTLVRGELY--KFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIAD 58
++ L LL T R L+ SD ++ +N++ TW AG NF N+ Y+++ A
Sbjct: 4 LLATLSCLLVLTSARSSLHFPPLSDEMVNYVNKQNTTWKAGHNF-YNVDLSYVKKLCGA- 61
Query: 59 AKYFDQSDRPLPGDRKTYDPEYSA--TVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFA 116
L G + ++A +PD FDAREQWPNC TI + D G+C + F
Sbjct: 62 ---------ILGGPKLPQRDAFAADMVLPDSFDAREQWPNCPTIKEIRDQGSCGSCWAFG 112
Query: 117 AVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVT 176
AV A SDR CI SKG+ N +S E + +CC + C+ G WNF K+G V+
Sbjct: 113 AVEAISDRICIHSKGRVNVEVSAEDMLTCCG---SECGDGCNGGFPSGAWNFWTKKGLVS 169
Query: 177 GGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRT 236
GG Y GC+P +I PC HH + P P KC C P Y + DKH
Sbjct: 170 GGLYDSHVGCRPYSIPPCEHHVNGSRPPCTGEGDTP--KCSKIC-EPGYSPSYKDDKHFG 226
Query: 237 TLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIG 296
+Y V NE I EI +GP F++Y DF YKSGVY+H S + H+ +++G
Sbjct: 227 CSSYSVSSNEKEIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVSGEMMGG--HAIRILG 284
Query: 297 WGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
WG EN TPYWLV N+W WGD+G KILRG+ C E I AG P
Sbjct: 285 WGVENDTPYWLVGNSWNTDWGDKGFFKILRGQDHCGIESEIVAGMP 330
>gi|417399216|gb|JAA46636.1| Putative cathepsin b [Desmodus rotundus]
Length = 340
Score = 226 bits (577), Expect = 9e-57, Method: Compositional matrix adjust.
Identities = 133/345 (38%), Positives = 177/345 (51%), Gaps = 24/345 (6%)
Query: 5 LVFLLGCTLV------RGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIAD 58
L+ L C V R E SD ++ +N++ TW AG NF N+ Y+++
Sbjct: 4 LLATLSCLAVLTTARSRLEFQPLSDELVNYVNKQNTTWKAGHNF-YNVDLSYVKKLCGTK 62
Query: 59 AKYFDQSDR-PLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAA 117
R L GD +P+ FDAREQWP C TI + D G+C + F A
Sbjct: 63 LGGPKLPQRLSLAGD---------IALPESFDAREQWPQCPTIKEIRDQGSCGSCWAFGA 113
Query: 118 VGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTG 177
V A SDR CI+S G QN +S E + +CC + + C+ G WNF K+G V+G
Sbjct: 114 VEAISDRICIRSNGLQNVEVSAEDLLTCCG---FQCGEGCNGGFPSGAWNFWKKQGLVSG 170
Query: 178 GDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTT 237
G Y GC+P +I PC HH + + P C + KC C P Y + +DKH
Sbjct: 171 GLYDSHVGCRPYSIPPCEHHVNG-SRPPCSGEGGDTPKCSKIC-EPGYSPSYKEDKHFGC 228
Query: 238 LTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGW 297
TY V +E I EI +GP A F++Y DF YKSGVY+H + + H+ +++GW
Sbjct: 229 DTYSVPSDEKEIMVEIYKNGPVEAAFSVYSDFLLYKSGVYQHVTGEMVGG--HAVRILGW 286
Query: 298 GTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
G ENGTPYWLV N+W WGD G KILRG+ C E I AG P
Sbjct: 287 GVENGTPYWLVGNSWNTDWGDNGFFKILRGRDHCGIESEIVAGIP 331
>gi|195478432|ref|XP_002100515.1| GE16138 [Drosophila yakuba]
gi|194188039|gb|EDX01623.1| GE16138 [Drosophila yakuba]
Length = 340
Score = 226 bits (576), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 126/337 (37%), Positives = 184/337 (54%), Gaps = 17/337 (5%)
Query: 12 TLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIA--DAKYFDQSD-RP 68
L GE SD +I+ + +A TWT GRNF A+++E ++R+ + DA F +D R
Sbjct: 15 ALTAGEPSLLSDEFIELVRSKAKTWTVGRNFDASVTEGHIRRLMGVHPDAHKFALADKRE 74
Query: 69 LPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIK 128
+ GD + +P+ FD+R+QWPNC TIG + D G+C + F AV A SDR CI
Sbjct: 75 VLGDLYMNSVD---EIPEEFDSRKQWPNCPTIGEIRDQGSCGSCWAFGAVEAMSDRVCIH 131
Query: 129 SKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQP 188
S G+ N S + + SCC C + C+ G W++ ++G V+GG YG GC+P
Sbjct: 132 SGGKVNFHFSADDLVSCCHTCGF----GCNGGFPGAAWSYWTRKGIVSGGPYGSNQGCRP 187
Query: 189 STISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDA 248
ISPC HH + P P KC C + +Y + +DKH + +Y V N
Sbjct: 188 YEISPCEHHVNGTRPPCAHGGATP--KCSHVCQS-SYTVDYAKDKHFGSKSYSVRRNVRD 244
Query: 249 IKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGT--ENGTPYW 306
I++EI+ +GP F +Y+D YK GVY+H +L H+ +++GWG + PYW
Sbjct: 245 IQEEIMTNGPVEGAFTVYEDLILYKDGVYQHEHGKELGG--HAIRILGWGVWGDEKIPYW 302
Query: 307 LVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
L+ N+W WGD+G +ILRG+ C E I+AG PK
Sbjct: 303 LIGNSWNTDWGDQGFFRILRGQDHCGIESSISAGLPK 339
>gi|48762491|dbj|BAD23815.1| cathepsin B-S1 [Tuberaphis coreana]
Length = 334
Score = 226 bits (576), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 126/340 (37%), Positives = 181/340 (53%), Gaps = 17/340 (5%)
Query: 4 ILVFLLGCTLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFD 63
+ + L L SD I IN A TW A R FPAN SEEY L+ Y +
Sbjct: 8 VCAIFVSVYLAEPTLQFLSDERIKYINEVAKTWKAERYFPANTSEEYFIG-LLGSRGYKN 66
Query: 64 QSDRPLPGDRKTYDPEYSAT-VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFS 122
++ + K YDP Y P +FD+R W +C IGH+ D G C + F+ GAF+
Sbjct: 67 YTNEF---EIKKYDPLYVENDSPQQFDSRTNWKSCKQIGHIRDQGNCGSCWSFSTTGAFA 123
Query: 123 DRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGD 182
DR C+ + G+ N+ LS E +A CCK C + C G + W + +G TGGDY
Sbjct: 124 DRLCVSTGGKFNQLLSPEELAFCCKDC----GQGCGGGYPIKAWKYFRTQGVTTGGDYDT 179
Query: 183 RTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWV 242
+ GC P + PC + T C Q + + + +C YG+ Q++++T Y +
Sbjct: 180 KEGCMPYKVPPCYNKQGKNT---CGGQPMER---NHQCPKTCYGKTTVQNRYKTKSEYSI 233
Query: 243 DDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENG 302
+ + I++++ +GP A+F +YDDF YKSG+Y+ T AK E HS K+IGWG ENG
Sbjct: 234 NSIK-TIEQDLKTYGPVEASFDVYDDFSVYKSGIYRKTPKAKYEG-RHSIKIIGWGQENG 291
Query: 303 TPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
T YWL +N+W WG+ GT KI++G+ EC E + AG P
Sbjct: 292 TTYWLAVNSWSKFWGEHGTFKIIKGRNECGIERAVTAGIP 331
>gi|426220597|ref|XP_004004501.1| PREDICTED: cathepsin B [Ovis aries]
Length = 335
Score = 226 bits (576), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 134/346 (38%), Positives = 178/346 (51%), Gaps = 23/346 (6%)
Query: 1 MIHILVFLLGCTLVRGELY--KFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIAD 58
++ L LL T R L+ SD ++ +N++ TW AG NF N+ Y+++ A
Sbjct: 4 LLATLSCLLVLTSARSSLHFPPLSDEMVNYVNKQNTTWKAGHNF-YNVDLSYVKKLCGA- 61
Query: 59 AKYFDQSDRPLPGDRKTYDPEYSA--TVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFA 116
L G + ++A +PD FDAREQWPNC TI + D G+C + F
Sbjct: 62 ---------ILGGPKLPQRDAFAADMVLPDSFDAREQWPNCPTIKEIRDQGSCGSCWAFG 112
Query: 117 AVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVT 176
AV A SDR CI SKG+ N +S E + +CC + C+ G WNF K+G V+
Sbjct: 113 AVEAISDRICIHSKGRVNVEVSAEDMLTCCG---SECGDGCNGGFPSGAWNFWTKKGLVS 169
Query: 177 GGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRT 236
GG Y GC+P +I PC HH + P P KC C P Y + DKH
Sbjct: 170 GGLYDSHVGCRPYSIPPCEHHVNGSRPPCTGEGDTP--KCSKIC-EPGYSPSYKDDKHFG 226
Query: 237 TLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIG 296
+Y V NE I EI +GP F++Y DF YKSGVY+H S + H+ +++G
Sbjct: 227 CSSYSVSSNEKEIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVSGEMMGG--HAIRILG 284
Query: 297 WGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
WG EN TPYWLV N+W WGD+G KILRG+ C E I AG P
Sbjct: 285 WGVENDTPYWLVGNSWNTDWGDKGFFKILRGQDHCGIESEIVAGMP 330
>gi|268555788|ref|XP_002635883.1| C. briggsae CBR-CPR-5 protein [Caenorhabditis briggsae]
Length = 345
Score = 226 bits (576), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 126/320 (39%), Positives = 168/320 (52%), Gaps = 13/320 (4%)
Query: 24 AYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEYSAT 83
A ID +N WTAG + +E + + L+ D KY D E
Sbjct: 32 ALIDYVNSAQKLWTAGHQV---IPKEKITKKLM-DVKYLVPHK-----DEDIVATEVFDA 82
Query: 84 VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVA 143
+PD FDAR+QWP+C +I ++ D C + FAA A SDR CI S G N LS++ +
Sbjct: 83 IPDHFDARDQWPSCVSINNIRDQSDCGSCWAFAAAEAISDRTCIASNGAVNTLLSSQDLL 142
Query: 144 SCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTL 203
SCC N C G + W + K G VTGG Y + GC+P +I+PC + T
Sbjct: 143 SCCTGLLSCGN-GCEGGYPIQAWKWWVKHGLVTGGSYESQFGCKPYSIAPCGQTVNGVTW 201
Query: 204 PSCENQKVPKLKCHTRCT-NPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTAT 262
P C + P KC CT N TY + QDKH Y V + I+ EIL +GP
Sbjct: 202 PKCPDDTEPTPKCVEACTSNNTYPTPYLQDKHFGATAYAVGKKVEQIQTEILKNGPVEVA 261
Query: 263 FALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTV 322
F +Y+DFY Y +GVY HTS A L H+ K++GWG +NGTPYWLV N+W +WG++G
Sbjct: 262 FTVYEDFYQYTTGVYVHTSGASLGG--HAVKILGWGVDNGTPYWLVANSWNVNWGEKGYF 319
Query: 323 KILRGKYECAFEYLIAAGKP 342
+I+RG EC E+ AG P
Sbjct: 320 RIIRGLNECGIEHSAVAGIP 339
>gi|225717770|gb|ACO14731.1| Cathepsin B precursor [Caligus clemensi]
Length = 331
Score = 226 bits (575), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 124/323 (38%), Positives = 167/323 (51%), Gaps = 16/323 (4%)
Query: 21 FSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEY 80
S+++I +N EA W AG NF S Y+R + + D PLP T
Sbjct: 25 LSESFIASVNEEAQIWKAGPNFHPETSSNYIRSLMGVLPNHRDYLPPPLPNLLGT----- 79
Query: 81 SATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTE 140
++PD FDARE WPNC +I + D G+C + F A A SDR CI + +N +S E
Sbjct: 80 -ESIPDTFDAREHWPNCPSIRLIRDQGSCGSCWAFGAAEAMSDRVCIHT--HKNVNISAE 136
Query: 141 YVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSA 200
+ SCC C + C+ G W F +G V+GG YG GCQP I PC HH +
Sbjct: 137 NLLSCCYTCGF----GCNGGFPGAAWRFWENKGLVSGGLYGSHKGCQPYLIEPCEHHVNG 192
Query: 201 PTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTT 260
P E + PK CH C N Y + +D +Y + + I+ +I+ +GP
Sbjct: 193 TRKPCAEGGRTPK--CHKTCDNKNYPISYEKDLSFGRSSYSIRSDPKQIQMDIMTNGPVE 250
Query: 261 ATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRG 320
A F++Y DF YKSGVY+H + L H+ +++GWG E GTPYWLV N+W WGD G
Sbjct: 251 AAFSVYSDFMSYKSGVYRHVKGSLLGG--HAIRILGWGMEKGTPYWLVANSWNTDWGDNG 308
Query: 321 TVKILRGKYECAFEYLIAAGKPK 343
T KILRG C E + AG P+
Sbjct: 309 TFKILRGSDHCGIEDSVVAGLPR 331
>gi|345790427|ref|XP_543203.3| PREDICTED: cathepsin B [Canis lupus familiaris]
Length = 339
Score = 225 bits (574), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 132/341 (38%), Positives = 172/341 (50%), Gaps = 18/341 (5%)
Query: 2 IHILVFLLGCTLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKY 61
+ LV L G R SD +D +N+ TW AG NF N+ YLR+ +
Sbjct: 8 LSCLVMLTG-AQSRLPFRALSDELVDYVNKRNTTWKAGHNF-HNVDPSYLRRLC---GTF 62
Query: 62 FDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAF 121
P + + +P+ FDAREQWPNC TI + D G+C + F AV A
Sbjct: 63 LGGPKLP-----QRVQFAKNLILPESFDAREQWPNCPTIKEIRDQGSCGSCWAFGAVEAI 117
Query: 122 SDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYG 181
SDR CI++ G N +S E + +CC D C+ G WNF K+G V+GG Y
Sbjct: 118 SDRICIRTNGHVNVEVSAEDMLTCCGDQCGD---GCNGGFPAEAWNFWTKQGLVSGGLYD 174
Query: 182 DRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYW 241
GC+P +I PC HH + P PK C C P Y + +DKH +Y
Sbjct: 175 SHVGCRPYSIPPCEHHVNGSRPPCTGEGDTPK--CSKIC-EPGYSPSYKEDKHYGCSSYS 231
Query: 242 VDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTEN 301
V DNE I EI +GP A F +Y DF YKSGVY+H + + H+ +++GWG E+
Sbjct: 232 VSDNEKEIMAEIYKNGPVEAAFTVYSDFLLYKSGVYQHVTGEMMGG--HAVRILGWGVED 289
Query: 302 GTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
GTPYWLV N+W WGD G KILRG+ C E I AG P
Sbjct: 290 GTPYWLVGNSWNTDWGDNGFFKILRGRDHCGIESEIVAGIP 330
>gi|195566634|ref|XP_002106884.1| GD15875 [Drosophila simulans]
gi|194204277|gb|EDX17853.1| GD15875 [Drosophila simulans]
Length = 340
Score = 225 bits (573), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 124/337 (36%), Positives = 180/337 (53%), Gaps = 17/337 (5%)
Query: 12 TLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPG 71
L GE SD +I+ + +A TW GRNF A+++E ++R+ + + D LP
Sbjct: 15 ALTSGEPSLLSDEFIEVVRSKAKTWKVGRNFDASVTEGHIRRLM---GVHPDAHKFALPD 71
Query: 72 DRKTYDPEYSATV---PDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIK 128
R+ Y +V P+ FD+R+QWPNC TIG + D G+C + F AV A SDR CI
Sbjct: 72 KREVLGDLYMNSVDELPEEFDSRKQWPNCPTIGEIRDQGSCGSCWAFGAVEAMSDRVCIH 131
Query: 129 SKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQP 188
S G+ N S + + SCC C + C+ G W++ ++G V+GG YG GC+P
Sbjct: 132 SGGKVNFHFSADDLVSCCHTCGF----GCNGGFPGAAWSYWTRKGIVSGGPYGSNQGCRP 187
Query: 189 STISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDA 248
ISPC HH + P P KC C + +Y + +DKH + +Y V N
Sbjct: 188 YEISPCEHHVNGTRPPCAHGGGTP--KCSHVCQS-SYTVDYAKDKHFGSKSYSVKRNVRE 244
Query: 249 IKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGT--ENGTPYW 306
I++EI+ +GP F +Y+D YK GVY+H +L H+ +++GWG + PYW
Sbjct: 245 IQEEIMTNGPVEGAFTVYEDLILYKDGVYQHEHGKELGG--HAIRILGWGVWGDEKIPYW 302
Query: 307 LVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
L+ N+W WGD G +ILRG+ C E I+AG PK
Sbjct: 303 LIGNSWNTDWGDHGFFRILRGQDHCGIESSISAGLPK 339
>gi|126681075|gb|ABO26563.1| cathepsin B-like cysteine protease form 1 [Ixodes ricinus]
Length = 337
Score = 225 bits (573), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 133/346 (38%), Positives = 179/346 (51%), Gaps = 20/346 (5%)
Query: 4 ILVFLLGCTLVRGELYK-----FSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIAD 58
++V LL E++ SD I+ IN+ TW AGRNF ++S Y+R + +
Sbjct: 6 LVVGLLAAVCFGREIHPKRWHPLSDQMINFINKINTTWKAGRNFDKSISMSYIRGLMGVN 65
Query: 59 AKYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAV 118
K + LP + E +P+ FDARE+W +C +I + D C + F A
Sbjct: 66 PK---SKEYRLP---EFVHEEIPDDLPESFDAREKWSHCASINLIRDQSTCGSCWAFGAA 119
Query: 119 GAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGG 178
A SDR CI S+G +S E + CC C C G W + + G V+ G
Sbjct: 120 EAMSDRVCIHSEGGIQVNISAEDLLDCCDSC----GAGCDGGYPAAAWEYWKESGLVSDG 175
Query: 179 DYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTL 238
YG GC+P +++PC HH + +LP+C VP KC C YG+ + DKH
Sbjct: 176 LYGTPDGCKPYSLAPCEHH-TKGSLPNCTGT-VPTPKCVHLCRK-GYGKDYQHDKHFGKK 232
Query: 239 TYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWG 298
Y + NE I+ EI +GP A F +Y DF YKSGVY+H S L H+ +++GWG
Sbjct: 233 VYSISSNEKQIQTEIFKNGPVEADFTVYADFLSYKSGVYQHHSGDVLGG--HAIRILGWG 290
Query: 299 TENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPKN 344
TENGTPYWLV N+W WGD G KILRGK EC E I AG PK+
Sbjct: 291 TENGTPYWLVANSWNEDWGDHGYFKILRGKDECGIEDDINAGIPKD 336
>gi|194895314|ref|XP_001978227.1| GG19486 [Drosophila erecta]
gi|190649876|gb|EDV47154.1| GG19486 [Drosophila erecta]
Length = 340
Score = 225 bits (573), Expect = 3e-56, Method: Compositional matrix adjust.
Identities = 127/339 (37%), Positives = 181/339 (53%), Gaps = 21/339 (6%)
Query: 12 TLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIA--DAKYFDQSDRPL 69
L GE SD +I+ + +A TWT GRNF ++++E Y+R+ + DA F +D+
Sbjct: 15 ALTSGEPSFLSDEFIELVRSKAKTWTVGRNFDSSVTEGYIRRLMGVHPDAHKFALADK-- 72
Query: 70 PGDRKTYDPEYSATV---PDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRC 126
R+ Y TV P+ FD+R+QWPNC TIG + D G C + F AV A SDR C
Sbjct: 73 ---REVLGDLYMNTVDQIPEEFDSRKQWPNCPTIGEIRDQGECGSCWAFGAVEAMSDRVC 129
Query: 127 IKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGC 186
I S G+ N S + + SCC C + C+ G W++ ++G V+GG YG GC
Sbjct: 130 IHSGGKVNFHFSADDLVSCCHTCGF----GCNGGFPGAAWSYWTRKGIVSGGPYGSNQGC 185
Query: 187 QPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNE 246
+P I+PC HH + P P KC C + Y + +DKH + +Y V N
Sbjct: 186 RPYEIAPCEHHVNGTRPPCGHGGGTP--KCSHVCES-GYTVDYAKDKHFGSKSYSVKRNV 242
Query: 247 DAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGT--ENGTP 304
I++EI+ +GP F +Y+D YK GVY+H +L H+ +++GWG E P
Sbjct: 243 RDIQEEIMTNGPVEGAFTVYEDLILYKDGVYQHQHGKELGG--HAIRILGWGVWGEEKIP 300
Query: 305 YWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
YWL+ N+W WGD G +ILRG+ C E I+AG PK
Sbjct: 301 YWLIGNSWNTDWGDNGFFRILRGQDHCGIESSISAGLPK 339
>gi|298370749|gb|ADI80349.1| cathepsin B [Litopenaeus vannamei]
Length = 331
Score = 224 bits (572), Expect = 4e-56, Method: Compositional matrix adjust.
Identities = 131/325 (40%), Positives = 169/325 (52%), Gaps = 18/325 (5%)
Query: 21 FSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIA--DAKYFDQSDRPLPGDRKTYDP 78
SD +I + E +TW AGRNF +LS Y R+ + D+KY +P P
Sbjct: 21 LSDKFIKLLQSEDSTWEAGRNFNKHLSIRYFRRLMGVHPDSKYH------MPKYEVHQIP 74
Query: 79 EYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLS 138
E + +P FD+R WP C TIG + D G+C + F AV SDR+CI SKG+ N S
Sbjct: 75 E-NFELPKEFDSRAAWPMCPTIGEIRDQGSCGSCWAFGAVEVMSDRQCIHSKGKSNFHYS 133
Query: 139 TEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHG 198
E + SCC +C + N G+ F+ W +H G V+GG + GCQP I+PC HH
Sbjct: 134 AENLVSCCHLCGFGCNGGFP-GAAFKYW--VHS-GIVSGGSFNSTQGCQPYEIAPCEHHV 189
Query: 199 SAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGP 258
P E PK C C Y + D H Y + +ED IK EI+ +GP
Sbjct: 190 PGPRPKCSEGGGTPK--CAKTCEK-GYIVDYESDLHHGGKAYSIMKDEDQIKYEIMKNGP 246
Query: 259 TTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGD 318
F +Y DF HYKSGVY+H L H+ +++GWG ENGTPYWL N+W WGD
Sbjct: 247 VEGAFTVYVDFLHYKSGVYQHRHGLPLGG--HAIRVLGWGEENGTPYWLCANSWNTDWGD 304
Query: 319 RGTVKILRGKYECAFEYLIAAGKPK 343
G KILRG C E I+AG PK
Sbjct: 305 NGLFKILRGSDHCGIESEISAGLPK 329
>gi|112983908|ref|NP_001036850.1| cathepsin B precursor [Bombyx mori]
gi|13548667|dbj|BAB40804.1| cathepsin B [Bombyx mori]
Length = 337
Score = 224 bits (572), Expect = 4e-56, Method: Compositional matrix adjust.
Identities = 123/325 (37%), Positives = 172/325 (52%), Gaps = 16/325 (4%)
Query: 19 YKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFL-IADAKYFDQSDRPLPGDRKTYD 77
Y SD +I+ IN + N+W AGRNFP + S +L++ + + + ++F + P+ KT+
Sbjct: 23 YPLSDEFINTINLKQNSWKAGRNFPRDTSFAHLKKIMGVIEDEHF--ATLPI----KTHK 76
Query: 78 PEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPL 137
+ A +P+ FD R++WP+C T+ V D G+C + F AV A +DR C S G ++
Sbjct: 77 IDLIAGLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHF 136
Query: 138 STEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHH 197
S E + SCC IC CS G W + G V+GG Y GC+P I PC HH
Sbjct: 137 SAEDLLSCCPIC----GLGCSGGMPRLAWEYWKHFGLVSGGSYNSSQGCRPYEIPPCEHH 192
Query: 198 GSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHG 257
+P + K P KC +C + Y + QDK Y V +ED I+ E+ +G
Sbjct: 193 VPGNRMPCSGDTKTP--KCTKKCES-GYDVNYKQDKQYGKHVYTVSGDEDHIRAELFKNG 249
Query: 258 PTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWG 317
P F +Y D YKSGVYKHT L H+ K++GWG EN YWL+ N+W WG
Sbjct: 250 PVEGAFTVYSDLLSYKSGVYKHTQGDALGG--HAVKILGWGVENDNKYWLIANSWNSDWG 307
Query: 318 DRGTVKILRGKYECAFEYLIAAGKP 342
D G KILRG+ C E I G+P
Sbjct: 308 DNGFFKILRGEDHCGIESSIVTGEP 332
>gi|195352458|ref|XP_002042729.1| GM17589 [Drosophila sechellia]
gi|194126760|gb|EDW48803.1| GM17589 [Drosophila sechellia]
Length = 340
Score = 224 bits (571), Expect = 4e-56, Method: Compositional matrix adjust.
Identities = 123/337 (36%), Positives = 179/337 (53%), Gaps = 17/337 (5%)
Query: 12 TLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPG 71
L GE SD +I+ + +A TW GRNF A+++E ++R+ + + D LP
Sbjct: 15 ALTSGEPSLLSDEFIEVVRSKAKTWKVGRNFDASVTEGHIRRLM---GVHPDAHKFALPD 71
Query: 72 DRKTYDPEYSATV---PDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIK 128
R+ Y ++ P+ FD+R+QWPNC TIG + D G+C + F AV A SDR CI
Sbjct: 72 KREVLGDLYMNSLDELPEEFDSRKQWPNCPTIGEIRDQGSCGSCWAFGAVEAMSDRVCIH 131
Query: 129 SKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQP 188
S G+ N S + + SCC C + C+ G W++ ++G V+GG YG GC+P
Sbjct: 132 SGGKVNFHFSADDLVSCCHTCGF----GCNGGFPGAAWSYWTRKGIVSGGPYGSNQGCRP 187
Query: 189 STISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDA 248
ISPC HH + P P KC C + +Y + +DKH + +Y V N
Sbjct: 188 YEISPCEHHVNGTRPPCANGSGTP--KCSHVCQS-SYTVDYAKDKHFGSKSYSVKRNVRE 244
Query: 249 IKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGT--ENGTPYW 306
I++EI+ +GP F +Y+D YK GVY+H +L H+ +++GWG PYW
Sbjct: 245 IQEEIMTNGPVEGAFTVYEDLILYKDGVYQHEHGKELGG--HAIRILGWGVWGNEKIPYW 302
Query: 307 LVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
L+ N+W WGD G +ILRG+ C E I+AG PK
Sbjct: 303 LIGNSWNTDWGDHGFFRILRGQDHCGIESSISAGLPK 339
>gi|196009263|ref|XP_002114497.1| expressed hypothetical protein [Trichoplax adhaerens]
gi|190583516|gb|EDV23587.1| expressed hypothetical protein [Trichoplax adhaerens]
Length = 333
Score = 224 bits (570), Expect = 6e-56, Method: Compositional matrix adjust.
Identities = 129/346 (37%), Positives = 184/346 (53%), Gaps = 25/346 (7%)
Query: 4 ILVFLLGCTLVRGELYK----FSDAYIDQINREANTWTAGRNFPANLSEEYLRQFL--IA 57
I++ L +G + S ID +N + +W AG NF A L Y++ +
Sbjct: 6 IVITLFAVFSAQGAYFPNHQPLSQDLIDYVNLVSTSWKAGTNF-AGLPVSYVKYLCGALE 64
Query: 58 DAKYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAA 117
D +F LP + E ++ +P FD+R++W C +I + D G+C + F A
Sbjct: 65 DPNHFQ-----LP----IHVHEDTSDLPKSFDSRDKWRMCPSIREIRDQGSCGSCWSFGA 115
Query: 118 VGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTG 177
V + +DR CI S G+ +S E + +CC C C+ G + + W++ G VTG
Sbjct: 116 VESITDRICIHSNGKVKVHISAEDLMTCCTSC----GMGCNGGFLPQAWHYWVNNGIVTG 171
Query: 178 GDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTT 237
G Y GCQP I C HH P +C +++P KC +C P Y + F QDKH
Sbjct: 172 GQYHSHKGCQPYEIPKCEHHVKGP-FKAC-GKELPTPKCSQKC-QPGYNKTFNQDKHFGK 228
Query: 238 LTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGW 297
+Y + +N I+KEI+ +GP A F +Y DF YKSGVY+HT+ L H+ K++GW
Sbjct: 229 KSYSITNNIQQIQKEIMMNGPVEAAFTVYADFPSYKSGVYQHTTGGPLGG--HAVKILGW 286
Query: 298 GTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
GTEN TPYWL+ N+W P WGD+G KI+RGK EC E I AG PK
Sbjct: 287 GTENNTPYWLIANSWNPTWGDKGYFKIIRGKDECGIESSIVAGMPK 332
>gi|348534156|ref|XP_003454569.1| PREDICTED: cathepsin B-like [Oreochromis niloticus]
Length = 330
Score = 223 bits (569), Expect = 7e-56, Method: Compositional matrix adjust.
Identities = 126/334 (37%), Positives = 173/334 (51%), Gaps = 21/334 (6%)
Query: 12 TLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPG 71
+L R L+ S ++ IN+ TW AG NF N+ Y+R+ L G
Sbjct: 16 SLARPHLHPLSSEMVNHINKLNTTWKAGHNF-HNVDYSYVRKLC----------GTMLKG 64
Query: 72 DRKTYDPEYSATV--PDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKS 129
+ +Y+ V P FDAR+QWPNC T+ + D G+C + F A A SDR CI S
Sbjct: 65 PKLPVMVQYAGDVKLPKEFDARQQWPNCPTLKEIRDQGSCGSCWAFGAAEAISDRVCIHS 124
Query: 130 KGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPS 189
G+ N +S+E + +CC C C+ G W+F G V+GG Y GC+P
Sbjct: 125 NGKVNVEISSEDLLTCCDSC----GMGCNGGYPSAAWDFWASEGLVSGGLYESHIGCRPY 180
Query: 190 TISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAI 249
TI+PC HH + + P C + +C +C + Y + QDKH +Y V +E I
Sbjct: 181 TIAPCEHHVNG-SRPPCTGEGGDTPECVRQCES-GYTPSYIQDKHYGKTSYSVPSDEQQI 238
Query: 250 KKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVI 309
+ EI +GP F +Y+DF YK+GVY+H S + + H+ K++GWG ENGTPYWL
Sbjct: 239 QTEIYKNGPVEGAFTVYEDFLLYKTGVYQHVSGSAVGG--HAIKVLGWGEENGTPYWLCA 296
Query: 310 NTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
N+W WGD G KILRG C E I AG PK
Sbjct: 297 NSWNTDWGDNGYFKILRGSDHCGIESEIVAGIPK 330
>gi|389611087|dbj|BAM19154.1| cathepsin B [Papilio polytes]
Length = 334
Score = 223 bits (569), Expect = 8e-56, Method: Compositional matrix adjust.
Identities = 122/322 (37%), Positives = 168/322 (52%), Gaps = 14/322 (4%)
Query: 21 FSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEY 80
SD +I+ IN + +TW AGRNFP + +++++ + + DR ++ +
Sbjct: 24 LSDDFINLINSKQDTWKAGRNFPVDTPVKHIQKLMGTL-----KDDRFTTLVTLQHEVDL 78
Query: 81 SATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTE 140
A++P+ FD R++WPNC T+ V D G+C + F AV A +DR C S G ++ S E
Sbjct: 79 IASLPENFDPRDKWPNCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAE 138
Query: 141 YVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSA 200
+ SCC IC C+ G W + G V+GG Y GC+P I PC HH
Sbjct: 139 DLLSCCPICGL----GCNGGMPTLAWEYWKHFGLVSGGSYNSTQGCRPYEIPPCEHHVPG 194
Query: 201 PTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTT 260
LP + K PK C +C + Y + QDKH Y V ED IK E+ +GP
Sbjct: 195 NRLPCSGDTKTPK--CIKKCED-NYNVAYKQDKHYGKHIYSVRGGEDHIKAELYKNGPVE 251
Query: 261 ATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRG 320
F +Y D YKSGVYKH + L H+ K++GWG ENG YWL+ N+W WGD G
Sbjct: 252 GAFTVYADLLSYKSGVYKHVAGDALGG--HAIKIMGWGVENGNKYWLIANSWNSDWGDNG 309
Query: 321 TVKILRGKYECAFEYLIAAGKP 342
KILRG+ C E I AG+P
Sbjct: 310 FFKILRGEDHCGIESSIVAGEP 331
>gi|34979797|gb|AAQ83887.1| cathepsin B [Branchiostoma belcheri tsingtauense]
Length = 332
Score = 223 bits (568), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 131/325 (40%), Positives = 166/325 (51%), Gaps = 20/325 (6%)
Query: 21 FSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEY 80
+ ID +N TW AG NF + Y++ D ++ LP + +
Sbjct: 24 LTQEIIDYVNTIDTTWKAGWNF-QGATVSYVKGLC---GVIRDPNNHKLPLKLHELNAQ- 78
Query: 81 SATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTE 140
+PD FD+R QW NC TI V D G+C + AAV A SDR C+ SKG +S E
Sbjct: 79 --DIPDTFDSRTQWANCPTIKEVRDQGSCGSCWALAAVEAMSDRICVASKGSTMAHISAE 136
Query: 141 YVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHH--G 198
+ SCCK C C+ G W + + G VTGG YG GCQP I PC HH G
Sbjct: 137 DLNSCCKSC----GNGCNGGFPEAAWEYWKRDGLVTGGPYGSHQGCQPYEIKPCEHHING 192
Query: 199 SAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGP 258
S P E P +C C + Y F +DKH Y V I+ EI+ +GP
Sbjct: 193 SRPACGKLE----PTPRCKKSCES-GYNVTFAKDKHYAKTAYSVSSKVQQIQMEIMTNGP 247
Query: 259 TTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGD 318
A F +Y DF HYKSGVY+H S A+L H+ K+IGWGTE TPYWL+ N+W WG+
Sbjct: 248 VEAAFTVYADFPHYKSGVYQHESGAELGG--HAVKMIGWGTEGSTPYWLIANSWNTDWGN 305
Query: 319 RGTVKILRGKYECAFEYLIAAGKPK 343
G KILRG+ EC E I AG+PK
Sbjct: 306 MGFFKILRGQDECGIERDIVAGEPK 330
>gi|14141821|gb|AAK07477.2|AF329480_1 probable cathepsin B-like cysteine proteinase precursor [Glossina
morsitans morsitans]
gi|289743431|gb|ADD20463.1| putative cathepsin B-like cysteine proteinase precursor [Glossina
morsitans morsitans]
Length = 340
Score = 223 bits (567), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 121/328 (36%), Positives = 171/328 (52%), Gaps = 16/328 (4%)
Query: 21 FSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEY 80
SD +++ + ++A TWT GRNF + RQ + + D + LP R E
Sbjct: 23 LSDKFMEIVRQKAKTWTVGRNFHKLTPMSHYRQLM---GVHPDAHNYALPDKRMVLREEE 79
Query: 81 -----SATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNR 135
+ +P FD+R+QWP+C TI + D G+C + F AV A SDR CI S G N
Sbjct: 80 LVGLGNNMIPKDFDSRKQWPHCPTIWEIRDQGSCGSCWAFGAVEAMSDRVCIHSNGTVNF 139
Query: 136 PLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCS 195
S + + SCC C + C+ G W++ ++G V+GG YG GC+P I+PC
Sbjct: 140 HFSADDLVSCCHTCGF----GCNGGFPGAAWSYWVRKGIVSGGPYGSSQGCRPYEIAPCE 195
Query: 196 HHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILA 255
HH + T P CE + +C +C +Y + DKH + Y + N I++EI+
Sbjct: 196 HHVNG-TRPPCEKEYGKTPRCQHKC-QASYKVDYKTDKHFGSRAYSISKNVHDIQEEIMT 253
Query: 256 HGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPH 315
HGP F +Y+D YK GVY+H +L H+ ++IGWG E PYWLV N+W
Sbjct: 254 HGPVEGAFTVYEDLILYKDGVYEHVHGKELGG--HAIRIIGWGVEKDIPYWLVANSWNTD 311
Query: 316 WGDRGTVKILRGKYECAFEYLIAAGKPK 343
WG+ G KILRGK C E I+AG PK
Sbjct: 312 WGNNGFFKILRGKDHCGIESSISAGLPK 339
>gi|330434688|gb|AEC22812.1| cathepsin B [Macrobrachium nipponense]
Length = 331
Score = 223 bits (567), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 129/328 (39%), Positives = 170/328 (51%), Gaps = 18/328 (5%)
Query: 18 LYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFL--IADAKYFDQSDRPLPGDRKT 75
++ SD +I + E TW AGRNF NL YL+ + AD+K+ + K
Sbjct: 18 IHPLSDKFIQLLQNEKTTWKAGRNFNKNLPMRYLKSLMGVHADSKFH------MSPVHKH 71
Query: 76 YDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNR 135
PE +P FD+R W C TI + D G+C + F AV +DR CI S G +N
Sbjct: 72 KIPE-GFKIPKEFDSRTAWSMCPTISEIRDQGSCGSCWAFGAVEVMTDRDCIHSNGTKNF 130
Query: 136 PLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCS 195
S E + SCC +C + N G+ F+ W +H G V+GG + GCQP I+PC
Sbjct: 131 HYSAENLVSCCHLCGFGCNGGFP-GAAFQYW--VHS-GIVSGGAFNSTQGCQPYEIAPCE 186
Query: 196 HHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILA 255
HH S P E PK CH C + Y + D H + Y VD +E IK +I+
Sbjct: 187 HHVSGPRPKCAEGGSTPK--CHKNCES-NYVVDYESDLHHGSKHYSVDKDETQIKYDIMT 243
Query: 256 HGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPH 315
+GP F +Y DF HYKSGVY+HT L H+ +++GWG E+GTPYWL N+W
Sbjct: 244 NGPVEGAFTVYVDFLHYKSGVYQHTHGLPLGG--HAIRVLGWGEEDGTPYWLCANSWNTD 301
Query: 316 WGDRGTVKILRGKYECAFEYLIAAGKPK 343
WGD G KILRG C E I+AG PK
Sbjct: 302 WGDNGYFKILRGSDHCGIESEISAGLPK 329
>gi|346470617|gb|AEO35153.1| hypothetical protein [Amblyomma maculatum]
Length = 335
Score = 223 bits (567), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 130/323 (40%), Positives = 166/323 (51%), Gaps = 15/323 (4%)
Query: 21 FSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEY 80
SD I+ IN+ TW AGRNF N YL+ + + D + LP Y +
Sbjct: 27 LSDEMINFINKLNTTWKAGRNFDKNTPVSYLKGLM---GVHPDSKNYRLP---LFYHEDI 80
Query: 81 SATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTE 140
+P+ FDARE+W +C +I + D C + F A A SDR CI SKG+ +S E
Sbjct: 81 PKDLPESFDAREKWSHCNSIHVIRDQSTCGSCWAFGATEAMSDRVCIHSKGKVQVNISAE 140
Query: 141 YVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSA 200
+ +CC C C+ G W F G VTGG YG GCQP PC HH
Sbjct: 141 DLLTCCDSC----GAGCNGGYPAAAWEFYKTDGIVTGGLYGTDDGCQPYYFPPCEHHTVG 196
Query: 201 PTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTT 260
P LP+C K P +C C Y + + +DKH Y + +E IK EI +GP
Sbjct: 197 P-LPNCTGIK-PTPQCVRDCRK-GYEKSYSEDKHYAKKVYTLSADETQIKTEIFKNGPVE 253
Query: 261 ATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRG 320
A F +Y DF YKSGVY+ S+ L H+ +++GWGTENG PYWLV N+W WGD+G
Sbjct: 254 ADFTVYADFVSYKSGVYQRHSDDALGG--HAIRILGWGTENGVPYWLVANSWNEDWGDKG 311
Query: 321 TVKILRGKYECAFEYLIAAGKPK 343
KILRG EC E I AG PK
Sbjct: 312 YFKILRGNDECGIEDDINAGIPK 334
>gi|389608541|dbj|BAM17880.1| cathepsin B [Papilio xuthus]
Length = 334
Score = 222 bits (566), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 124/337 (36%), Positives = 172/337 (51%), Gaps = 14/337 (4%)
Query: 6 VFLLGCTLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQS 65
+ L + L SD +I+ IN + ++W AGRNFP++ +++++ + +
Sbjct: 9 LLLCAFAVTADTLDPLSDDFINLINSKQDSWKAGRNFPSDTPFKHIKKLMGTL-----RD 63
Query: 66 DRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRR 125
DR ++ E A++P+ FD R++WPNC T+ V D G+C + F AV A +DR
Sbjct: 64 DRFTTLVTMQHEVELIASLPENFDPRDKWPNCPTLNEVRDQGSCGSCWAFGAVEAMTDRI 123
Query: 126 CIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTG 185
C S G ++ S E + SCC IC C+ G W + G V+GG Y G
Sbjct: 124 CTYSNGTKHFHFSAEDLLSCCPIC----GLGCNGGMPTLAWEYWKHFGLVSGGSYNSSQG 179
Query: 186 CQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDN 245
C+P I PC HH LP + K PK C C + Y + QDKH Y V
Sbjct: 180 CRPYEIPPCEHHVPGNRLPCSGDTKTPK--CVKECES-GYKVPYKQDKHYGKHVYSVRGG 236
Query: 246 EDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPY 305
ED IK E+ +GP F +Y D YKSGVYKH + L H+ K++GWG ENG Y
Sbjct: 237 EDHIKAELYKNGPVEGAFTVYADLLSYKSGVYKHVTGDALGG--HAIKIMGWGVENGNKY 294
Query: 306 WLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
WL+ N+W WGD G KILRG+ C E I AG+P
Sbjct: 295 WLIANSWNSDWGDNGFFKILRGEDHCGIESSIVAGEP 331
>gi|289743429|gb|ADD20462.1| putative cathepsin B-like cysteine proteinase precursor [Glossina
morsitans morsitans]
Length = 340
Score = 222 bits (566), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 119/325 (36%), Positives = 170/325 (52%), Gaps = 10/325 (3%)
Query: 21 FSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIA--DAKYFDQSDRPLPGDRKTYDP 78
SD +++ + ++A TWT GRNF + RQ + DA Y+ D+ + +
Sbjct: 23 LSDRFMEIVRQKAKTWTVGRNFHKLTPMSHYRQLMGVHPDAHYYALPDKRMVLREEELVG 82
Query: 79 EYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLS 138
+ +P FD+R QWP+C TI + D G+C + F AV A SDR CI S G N S
Sbjct: 83 LGNDMIPKEFDSRNQWPHCPTIWEIRDQGSCGSCWAFGAVEAMSDRVCIHSNGTVNFHFS 142
Query: 139 TEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHG 198
+ + SCC C + C+ G W + ++G V+GG YG GC+P I+PC HH
Sbjct: 143 ADDLVSCCHTCGF----GCNGGFPGAAWGYWVRKGIVSGGPYGSSQGCRPYEIAPCEHHV 198
Query: 199 SAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGP 258
+ T P CE + +C +C +Y + DKH + Y + N I+ EI+ +GP
Sbjct: 199 NG-TRPPCEKEYGKTPRCQHKC-QASYKVDYKTDKHFGSRAYSISKNVRDIQGEIMTNGP 256
Query: 259 TTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGD 318
F +Y+D YK GVY+H +L H+ ++IGWG E TPYWL+ N+W WG+
Sbjct: 257 VEGAFTVYEDLILYKDGVYEHVHGKELGG--HAIRIIGWGVEKDTPYWLIANSWNTDWGN 314
Query: 319 RGTVKILRGKYECAFEYLIAAGKPK 343
G KILRGK C E I+AG PK
Sbjct: 315 NGFFKILRGKDHCGIESSISAGLPK 339
>gi|427787723|gb|JAA59313.1| Putative cathepsin b-like cysteine protease form 2 [Rhipicephalus
pulchellus]
Length = 338
Score = 222 bits (565), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 129/333 (38%), Positives = 174/333 (52%), Gaps = 17/333 (5%)
Query: 12 TLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFL-IADAKYFDQSDRPLP 70
+V ++ SD ID IN+ TW AGRNF N+ Y++ + +A K R LP
Sbjct: 18 VMVPPSVHPLSDEMIDFINKLNTTWKAGRNFDKNVPFSYIKGLMGVARNK-----TRRLP 72
Query: 71 GDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSK 130
+ P+ +P+ FDAR+ W C +I + D +C A F AV A SDR CI +K
Sbjct: 73 TLMHSSIPD---NLPESFDARQHWRKCNSIHVIRDQSSCGACWAFGAVEAISDRICIHTK 129
Query: 131 GQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPST 190
G +S + + +CC CR C G W F ++G VTGG YG GCQP +
Sbjct: 130 GSVQVNISAQDLLTCCDYCR----TGCKGGVPSYAWMFYKEKGIVTGGLYGTEDGCQPYS 185
Query: 191 ISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIK 250
I + + + LP N P C C +YG+ + +DKH Y + +E IK
Sbjct: 186 IH-TTRYTTTGLLPPPINDLSPMPPCKRECRK-SYGKKYSEDKHYGEKVYTLSGDEAQIK 243
Query: 251 KEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVIN 310
EI +GP A FA+Y DFY YKSGVY+ S + + H+ +++GWGTENG PYWL N
Sbjct: 244 TEIFKNGPVEADFAVYADFYSYKSGVYQAHSRVRCGS--HAIRILGWGTENGVPYWLAAN 301
Query: 311 TWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
+W HWGD+G KI RG EC E I AG PK
Sbjct: 302 SWTEHWGDKGYFKIRRGNNECGIEEDINAGIPK 334
>gi|333408990|gb|AEF32260.1| cathepsin B [Cristaria plicata]
Length = 347
Score = 222 bits (565), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 134/354 (37%), Positives = 179/354 (50%), Gaps = 26/354 (7%)
Query: 1 MIHILVFLLGCTLVRGELYKF-------SDAYIDQINREA--NTWTAGRNFP--ANLSEE 49
++ + V+ + + EL+KF S+ I+ +N TW AG NFP NL ++
Sbjct: 7 LLLLGVWTVSAIPPKDELFKFIRVFRPMSEEMINFLNMPGPGATWKAGNNFPFIRNLDDK 66
Query: 50 YLRQFLIADAKYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGAC 109
L + K + P P K +P +P FDAR QWPNC T+ V D G C
Sbjct: 67 LLYAKRLCGTKL----NNPNPLPVKNIEPLRD--LPTNFDARTQWPNCPTVKEVRDQGDC 120
Query: 110 AAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFL 169
+ F AV A SDR CI S G+ N +S E + +CC C + C G W +
Sbjct: 121 GSCWAFGAVEAMSDRICIASNGKVNAEISAEDLLACCSSC----GEGCQGGFPAEAWRYY 176
Query: 170 HKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGF 229
+ G VTGG Y GCQP I C HH P C ++ KC +C Y +
Sbjct: 177 EREGLVTGGLYNSSQGCQPYMIPACDHHVVGHLQP-CPKEEAKTPKCSKKC-EANYNVTY 234
Query: 230 FQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYL 289
DKH +Y VD E I EI+ +GP A F +Y+DF YKSGVY+H + +L
Sbjct: 235 KDDKHYGKNSYSVDSVEK-IMTEIMTNGPVEAAFTVYEDFLSYKSGVYQHRTGQELGG-- 291
Query: 290 HSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
H+ K++GWG +NGTPYW+V N+W P WG++G ILRGK EC E I AG PK
Sbjct: 292 HAVKILGWGEDNGTPYWIVANSWNPDWGNQGFFNILRGKDECGIESQIVAGLPK 345
>gi|306992171|gb|ADN19566.1| cathepsin B-like proteinase [Spodoptera frugiperda]
Length = 341
Score = 222 bits (565), Expect = 3e-55, Method: Compositional matrix adjust.
Identities = 120/322 (37%), Positives = 166/322 (51%), Gaps = 14/322 (4%)
Query: 21 FSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEY 80
+D +I+ IN + N+W AGRNFP N ++++ D LP + +D +
Sbjct: 29 LTDEFINLINSKQNSWKAGRNFPVNTPLTHIKKLT---GVLVDTHLSKLP--KAEHDMDL 83
Query: 81 SATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTE 140
A++P+ FD R++WPNC T+ V D G+C + F AV A +DR C S G ++ S E
Sbjct: 84 IASLPENFDPRDKWPNCPTLNEVRDQGSCGSCWAFGAVEAMTDRYCTYSNGTKHFHFSAE 143
Query: 141 YVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSA 200
+ SCC +C C+ G W + G V+GG Y GC+P I PC HH
Sbjct: 144 DLLSCCPVCGL----GCNGGMPTLAWEYWKHFGLVSGGSYNSGQGCRPYEIPPCEHHVPG 199
Query: 201 PTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTT 260
+P + K P KCH C +Y + +DK Y V ED IK E+ +GP
Sbjct: 200 NRVPCNGDSKTP--KCHKTC-EASYSVDYHKDKRYGKHVYSVSSKEDHIKAELFKNGPVE 256
Query: 261 ATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRG 320
F +Y D +YK+GVYKHT L H+ K++GWG ENG Y L+ N+W WGD G
Sbjct: 257 GAFTVYSDLLNYKNGVYKHTVGNALGG--HAIKILGWGVENGNKYRLIANSWNSDWGDNG 314
Query: 321 TVKILRGKYECAFEYLIAAGKP 342
KILRG+ C E I AG+P
Sbjct: 315 FFKILRGEDHCGIESSIVAGEP 336
>gi|355697726|gb|EHH28274.1| Cathepsin B [Macaca mulatta]
Length = 339
Score = 222 bits (565), Expect = 3e-55, Method: Compositional matrix adjust.
Identities = 126/338 (37%), Positives = 174/338 (51%), Gaps = 17/338 (5%)
Query: 6 VFLLGCTLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQS 65
+ LG R + SD ++ +N++ TW AG NF N+ YL++ +
Sbjct: 11 LLALGDARSRPSFHPLSDELVNYVNKQNTTWQAGHNF-YNVDVSYLKRLC---GTFLGG- 65
Query: 66 DRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRR 125
P P R + + +P+ FDAREQWP C TI + D G+C + F AV A SDR
Sbjct: 66 --PKPPQRVMFTEDLK--LPESFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVEAISDRI 121
Query: 126 CIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTG 185
CI + + +S E + +CC I D C+ G WNFL ++G V+GG Y G
Sbjct: 122 CIHTNAHVSVEVSAEDLLTCCGIMCGD---GCNGGYPAGAWNFLTRKGLVSGGLYDSHVG 178
Query: 186 CQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDN 245
C+P +I PC HH + P PK C C P Y + QDKH +Y V ++
Sbjct: 179 CRPYSIPPCEHHVNGSRPPCTGEGDTPK--CSKIC-EPGYSPTYKQDKHYGYNSYSVSNS 235
Query: 246 EDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPY 305
E I EI +GP F++Y DF YKSGVY+H + + H+ +++GWG ENGTPY
Sbjct: 236 EKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGG--HAIRILGWGVENGTPY 293
Query: 306 WLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
WLV N+W WGD G KILRG+ C E + AG P+
Sbjct: 294 WLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPR 331
>gi|239788404|dbj|BAH70886.1| ACYPI000014 [Acyrthosiphon pisum]
Length = 335
Score = 222 bits (565), Expect = 3e-55, Method: Compositional matrix adjust.
Identities = 127/348 (36%), Positives = 187/348 (53%), Gaps = 25/348 (7%)
Query: 1 MIHILVFLLGCTLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAK 60
+I I V LL + + S YI++IN A TW A +NFP N +E + + L+ +
Sbjct: 5 VILISVILLSVYFTE-QAHFLSKDYINKINEVAKTWKAKQNFPENTPKEQIVR-LLGSKR 62
Query: 61 YFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGA 120
S P+ + + Y ++ VP+ FD+R +W C TIGHV + G C + GA
Sbjct: 63 LLGVSKSPIKENDELYMD--NSEVPEFFDSRLEWDYCETIGHVRNQGNCGSCWAHGTTGA 120
Query: 121 FSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDY 180
F+DR C+ + G+ N +S E + CC C + C+ G + W + + G VTGGDY
Sbjct: 121 FADRLCVATNGEFNELISAEELTFCCHRCVF----GCNGGYPLKAWQYFKRHGVVTGGDY 176
Query: 181 GDRTGCQPSTISPCSH----HGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRT 236
GCQP + PC H S P+ N K K KC+ T + ++ ++T
Sbjct: 177 DTTDGCQPYRVPPCVKDDEGHNSCSGQPTERNHKCSK-KCYGDDT-----IDYKKNHYKT 230
Query: 237 TLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYL--HSGKL 294
Y++ + ++K+ + +GP A+F +YDDF +Y+SGVY+ T NA +YL H+ K+
Sbjct: 231 KDAYYLKNT--TMQKDTMVYGPIEASFDVYDDFMNYESGVYQRTGNA---SYLGGHAVKM 285
Query: 295 IGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
IGWG E GTPYWL++N+WG WGD+G KILRG EC E AG P
Sbjct: 286 IGWGVEEGTPYWLMVNSWGEQWGDKGMFKILRGTDECGIESSCTAGVP 333
>gi|204022081|dbj|BAG71138.1| cathepsin B-S1 [Tuberaphis takenouchii]
Length = 332
Score = 222 bits (565), Expect = 3e-55, Method: Compositional matrix adjust.
Identities = 126/344 (36%), Positives = 181/344 (52%), Gaps = 18/344 (5%)
Query: 1 MIHILVFLLGCT-LVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADA 59
I I+ + C L + L SD I IN A TW A R FPAN S+EY+ L+
Sbjct: 4 FITIVCAIFVCVYLAKPTLQFLSDERIKYINEVAKTWKAERFFPANTSKEYIMG-LLGSR 62
Query: 60 KYFDQSDRPLPGDRKTYDPEYSATVP-DRFDAREQWPNCGTIGHVPDTGACAAPHIFAAV 118
Y + S + KTYDP Y ++FD+RE W +C IG + D G C + F
Sbjct: 63 GYTNYSSEV---EIKTYDPLYEENASVEQFDSRENWKSCKQIGRIRDQGNCGSCWAFGTT 119
Query: 119 GAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGG 178
GAF+DR C+ + G+ N LS E VA CC+ C K C G + W + +G TGG
Sbjct: 120 GAFADRLCVSTGGKFNELLSPEDVAFCCQNC----GKGCEGGYPIKAWQYFRTQGVPTGG 175
Query: 179 DYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTL 238
DY + GC P I PC T C + + + + +C YG Q +++
Sbjct: 176 DYDSKEGCAPYKIPPCFDQKGKNT---CAGKPLER---NHQCPKTCYGSTTVQKRYKVKN 229
Query: 239 TYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWG 298
Y V ++ + ++++++ +GP A+F L+DD YKSG+Y+ T AK + HS K+IGWG
Sbjct: 230 EY-VLNSPNTMEQDLIKYGPIEASFNLFDDLSAYKSGIYQKTPKAKFLS-GHSIKIIGWG 287
Query: 299 TENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
ENG PYWL +N+W WG++GT +I++G+ EC E AG P
Sbjct: 288 KENGVPYWLAVNSWSKFWGEQGTFRIIKGRNECGIERSATAGIP 331
>gi|25988674|gb|AAN76202.1| lysosomal cysteine proteinase cathepsin B/green fluorescent protein
EGFP fusion protein [synthetic construct]
Length = 578
Score = 221 bits (564), Expect = 3e-55, Method: Compositional matrix adjust.
Identities = 128/329 (38%), Positives = 173/329 (52%), Gaps = 23/329 (6%)
Query: 18 LYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQF---LIADAKYFDQSDRPLPGDRK 74
+ SD I+ IN++ TW AGRNF N+ YL++ ++ K LP +R
Sbjct: 23 FHPLSDDMINYINKQNTTWQAGRNF-YNVDISYLKKLCGTVLGGPK--------LP-ERV 72
Query: 75 TYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQN 134
+ + + +P+ FDAREQW NC TI + D G+C + F AV A SDR CI + G+ N
Sbjct: 73 GFSEDIN--LPESFDAREQWSNCPTIAQIRDQGSCGSCWAFGAVEAMSDRICIHTNGRVN 130
Query: 135 RPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPC 194
+S E + +CC I D C+ G WNF ++G V+GG Y GC P TI PC
Sbjct: 131 VEVSAEDLLTCCGIQCGD---GCNGGYPSGAWNFWTRKGLVSGGVYNSHIGCLPYTIPPC 187
Query: 195 SHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEIL 254
HH + P PK C+ C Y + +DKH +Y V D+E I EI
Sbjct: 188 EHHVNGSRPPCTGEGDTPK--CNKMC-EAGYSTSYKEDKHYGYTSYSVSDSEKEIMAEIY 244
Query: 255 AHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGP 314
+GP F ++ DF YKSGVYKH + + H+ +++GWG ENG PYWLV N+W
Sbjct: 245 KNGPVEGAFTVFSDFLTYKSGVYKHEAGDVMGG--HAIRILGWGIENGVPYWLVANSWNV 302
Query: 315 HWGDRGTVKILRGKYECAFEYLIAAGKPK 343
WGD G KILRG+ C E I AG P+
Sbjct: 303 DWGDNGFFKILRGENHCGIESEIVAGIPR 331
>gi|52630945|gb|AAU84936.1| putative cathepsin B-S [Toxoptera citricida]
Length = 335
Score = 221 bits (564), Expect = 3e-55, Method: Compositional matrix adjust.
Identities = 132/350 (37%), Positives = 185/350 (52%), Gaps = 29/350 (8%)
Query: 1 MIHILVFLLGCTLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAK 60
+I I V LL L + + S +Y+D+IN A TW A +NFP +++E + + L +K
Sbjct: 5 IILISVVLLSVYLTE-QAHFLSKSYVDKINEVAKTWKAKQNFPEYMTKEQIVRLL--GSK 61
Query: 61 YFDQSDRPLPGDRKTYDPEY--SATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAV 118
+ L K D EY + +P+ FDAR QW +C TIG V + G C +
Sbjct: 62 NLTSVPKSLI---KENDSEYINDSEIPNFFDARIQWSHCKTIGEVRNQGNCGSCWAHGTT 118
Query: 119 GAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGG 178
GAF+DR CI + G N +S E + CC C + C+ G+ + W + + G VTGG
Sbjct: 119 GAFADRLCIATNGDFNELISAEELTFCCHRCGF----GCNGGNPLKAWQYFKRHGVVTGG 174
Query: 179 DYGDRTGCQPSTISPCSHHGSAPTLPSCENQKV-PKLKCHTRCTNPT---YGRGFFQDKH 234
+Y GCQP + PC SC Q P KC C Y +G ++ K+
Sbjct: 175 NYNTTDGCQPYKVPPCVKDEEGHN--SCSGQPTEPNHKCSRSCYGDKTCDYKKGHYKTKN 232
Query: 235 RTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYL--HSG 292
L N D ++K+ +A+GP A+F +YDDF +Y+SGVY+ T +AK YL H+
Sbjct: 233 AYYL------NIDTMQKDTIAYGPIEASFDVYDDFVNYESGVYQKTEDAK---YLGGHAV 283
Query: 293 KLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
K+IGWG E+GTPYWL++N+WG WG G KILRG EC E AG P
Sbjct: 284 KMIGWGEEDGTPYWLMVNSWGEQWGANGMFKILRGTNECGIEGSPTAGVP 333
>gi|187105116|ref|NP_001119618.1| cathepsin B-84 precursor [Acyrthosiphon pisum]
gi|161343843|tpg|DAA06102.1| TPA_inf: cathepsin B [Acyrthosiphon pisum]
Length = 335
Score = 221 bits (563), Expect = 3e-55, Method: Compositional matrix adjust.
Identities = 127/348 (36%), Positives = 187/348 (53%), Gaps = 25/348 (7%)
Query: 1 MIHILVFLLGCTLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAK 60
+I I V LL + + S YI++IN A TW A +NFP N +E + + L+ +
Sbjct: 5 VILISVVLLSVYFTE-QAHFLSKDYINKINEVAKTWKAKQNFPENTPKEQIVR-LLGSKR 62
Query: 61 YFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGA 120
S P+ + + Y ++ VP+ FD+R +W C TIGHV + G C + GA
Sbjct: 63 LLGVSKSPIKENDELYMD--NSEVPEFFDSRLEWDYCETIGHVRNQGNCGSCWAHGTTGA 120
Query: 121 FSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDY 180
F+DR C+ + G+ N +S E + CC C + C+ G + W + + G VTGGDY
Sbjct: 121 FADRLCVATNGEFNELISAEELTFCCHRCGF----GCNGGYPLKAWQYFKRHGVVTGGDY 176
Query: 181 GDRTGCQPSTISPCSH----HGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRT 236
GCQP + PC H S P+ N K K KC+ T + ++ ++T
Sbjct: 177 DTTDGCQPYRVPPCVKDDEGHNSCSGQPTERNHKCSK-KCYGDDT-----IDYKKNHYKT 230
Query: 237 TLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYL--HSGKL 294
Y++ + ++K+ + +GP A+F +YDDF +Y+SGVY+ T NA +YL H+ K+
Sbjct: 231 KDAYYLKNT--TMQKDTMVYGPIEASFDVYDDFMNYESGVYQRTGNA---SYLGGHAVKM 285
Query: 295 IGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
IGWG E GTPYWL++N+WG WGD+G KILRG EC E AG P
Sbjct: 286 IGWGVEEGTPYWLMVNSWGEQWGDKGMFKILRGTDECGIESSCTAGVP 333
>gi|1942645|pdb|1MIR|A Chain A, Rat Procathepsin B
gi|1942646|pdb|1MIR|B Chain B, Rat Procathepsin B
Length = 322
Score = 221 bits (563), Expect = 3e-55, Method: Compositional matrix adjust.
Identities = 128/329 (38%), Positives = 173/329 (52%), Gaps = 23/329 (6%)
Query: 18 LYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQF---LIADAKYFDQSDRPLPGDRK 74
+ SD I+ IN++ TW AGRNF N+ YL++ ++ K LP +R
Sbjct: 6 FHPLSDDMINYINKQNTTWQAGRNF-YNVDISYLKKLCGTVLGGPK--------LP-ERV 55
Query: 75 TYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQN 134
+ + + +P+ FDAREQW NC TI + D G+C + F AV A SDR CI + G+ N
Sbjct: 56 GFSEDIN--LPESFDAREQWSNCPTIAQIRDQGSCGSSWAFGAVEAMSDRICIHTNGRVN 113
Query: 135 RPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPC 194
+S E + +CC I D C+ G WNF ++G V+GG Y GC P TI PC
Sbjct: 114 VEVSAEDLLTCCGIQCGD---GCNGGYPSGAWNFWTRKGLVSGGVYNSHIGCLPYTIPPC 170
Query: 195 SHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEIL 254
HH + P PK C+ C Y + +DKH +Y V D+E I EI
Sbjct: 171 EHHVNGARPPCTGEGDTPK--CNKMC-EAGYSTSYKEDKHYGYTSYSVSDSEKEIMAEIY 227
Query: 255 AHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGP 314
+GP F ++ DF YKSGVYKH + + H+ +++GWG ENG PYWLV N+W
Sbjct: 228 KNGPVEGAFTVFSDFLTYKSGVYKHEAGDVMGG--HAIRILGWGIENGVPYWLVANSWNA 285
Query: 315 HWGDRGTVKILRGKYECAFEYLIAAGKPK 343
WGD G KILRG+ C E I AG P+
Sbjct: 286 DWGDNGFFKILRGENHCGIESEIVAGIPR 314
>gi|17559068|ref|NP_504682.1| Protein CPR-4 [Caenorhabditis elegans]
gi|1169085|sp|P43508.1|CPR4_CAEEL RecName: Full=Cathepsin B-like cysteine proteinase 4; AltName:
Full=Cysteine protease-related 4; Flags: Precursor
gi|675500|gb|AAA98785.1| cathepsin B-like cysteine proteinase [Caenorhabditis elegans]
gi|695293|gb|AAA98783.1| cathepsin B-like cysteine proteinase [Caenorhabditis elegans]
gi|351063163|emb|CCD71204.1| Protein CPR-4 [Caenorhabditis elegans]
Length = 335
Score = 221 bits (563), Expect = 4e-55, Method: Compositional matrix adjust.
Identities = 128/343 (37%), Positives = 180/343 (52%), Gaps = 16/343 (4%)
Query: 4 ILVFLLGCT--LVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKY 61
IL L+ T LV + K +A + +N + + W A P +++ E +++ L+
Sbjct: 5 ILAALVAVTAGLVIPLVPKTQEAITEYVNSKQSLWKA--EIPKDITIEQVKKRLMRT--- 59
Query: 62 FDQSDRPLPGDRKTYDPEYSA-TVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGA 120
+ P D + + + T+P FDAR QWPNC +I ++ D C + FAA A
Sbjct: 60 --EFVAPHTPDVEVVKHDINEDTIPATFDARTQWPNCMSINNIRDQSDCGSCWAFAAAEA 117
Query: 121 FSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDY 180
SDR CI S G N LS E V SCC C Y C G W +L K G TGG Y
Sbjct: 118 ASDRFCIASNGAVNTLLSAEDVLSCCSNCGY----GCEGGYPINAWKYLVKSGFCTGGSY 173
Query: 181 GDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTY 240
+ GC+P +++PC T PSC + C +CTN Y + DKH + Y
Sbjct: 174 EAQFGCKPYSLAPCGETVGNVTWPSCPDDGYDTPACVNKCTNKNYNVAYTADKHFGSTAY 233
Query: 241 WVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTE 300
V I+ EI+AHGP A F +Y+DFY YK+GVY HT+ +L H+ +++GWGT+
Sbjct: 234 AVGKKVSQIQAEIIAHGPVEAAFTVYEDFYQYKTGVYVHTTGQELGG--HAIRILGWGTD 291
Query: 301 NGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
NGTPYWLV N+W +WG+ G +I+RG EC E+ + G PK
Sbjct: 292 NGTPYWLVANSWNVNWGENGYFRIIRGTNECGIEHAVVGGVPK 334
>gi|147906534|ref|NP_001090927.1| cathepsin B precursor [Sus scrofa]
gi|187470655|sp|A1E295.1|CATB_PIG RecName: Full=Cathepsin B; Contains: RecName: Full=Cathepsin B
light chain; Contains: RecName: Full=Cathepsin B heavy
chain; Flags: Precursor
gi|118490058|gb|ABK96810.1| cathepsin B [Sus scrofa]
Length = 335
Score = 221 bits (562), Expect = 4e-55, Method: Compositional matrix adjust.
Identities = 130/344 (37%), Positives = 176/344 (51%), Gaps = 19/344 (5%)
Query: 1 MIHILVFLLGCTLVRGELY--KFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIAD 58
++ L L+ T R L+ SD ++ IN++ TWTAG NF N+ Y+++
Sbjct: 4 LLATLSCLVLLTSARESLHFQPLSDELVNFINKQNTTWTAGHNF-YNVDLSYVKKLC--- 59
Query: 59 AKYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAV 118
+ P R + + +P FDAREQWPNC TI + D G+C + F AV
Sbjct: 60 GTFLGGPKLP---QRAAFAAD--MILPKSFDAREQWPNCPTIKEIRDQGSCGSCWAFGAV 114
Query: 119 GAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGG 178
A SDR CI+S G+ N +S E + +CC + C+ G WNF K+G V+GG
Sbjct: 115 EAISDRICIRSNGRVNVEVSAEDMLTCCGD---ECGDGCNGGFPSGAWNFWTKKGLVSGG 171
Query: 179 DYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTL 238
Y GC+P +I PC HH + P PK C C P Y + +DKH
Sbjct: 172 LYDSHVGCRPYSIPPCEHHVNGSRPPCTGEGDTPK--CSKIC-EPGYTPSYKEDKHFGCS 228
Query: 239 TYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWG 298
+Y + NE I EI +GP F +Y DF YKSGVY+H + + H+ +++GWG
Sbjct: 229 SYSISRNEKEIMAEIYKNGPVEGAFTVYSDFLQYKSGVYQHVTGDLMGG--HAIRILGWG 286
Query: 299 TENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
ENGTPYWLV N+W WGD G KILRG+ C E I AG P
Sbjct: 287 VENGTPYWLVGNSWNTDWGDNGFFKILRGQDHCGIESEIVAGIP 330
>gi|171948776|gb|ACB59245.1| cathepsin B [Sus scrofa]
Length = 335
Score = 220 bits (561), Expect = 6e-55, Method: Compositional matrix adjust.
Identities = 130/344 (37%), Positives = 176/344 (51%), Gaps = 19/344 (5%)
Query: 1 MIHILVFLLGCTLVRGELY--KFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIAD 58
++ L L+ T R L+ SD ++ IN++ TWTAG NF N+ Y+++
Sbjct: 4 LLATLSCLVLLTSARESLHFQPLSDELVNFINKQNTTWTAGHNF-YNVDLSYVKKLC--- 59
Query: 59 AKYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAV 118
+ P R + + +P FDAREQWPNC TI + D G+C + F AV
Sbjct: 60 GTFLGGPKLP---QRAAFAAD--MILPKGFDAREQWPNCPTIKEIRDQGSCGSCWAFGAV 114
Query: 119 GAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGG 178
A SDR CI+S G+ N +S E + +CC + C+ G WNF K+G V+GG
Sbjct: 115 EAISDRICIRSNGRVNVEVSAEDMLTCCGD---ECGDGCNGGFPSGAWNFWTKKGLVSGG 171
Query: 179 DYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTL 238
Y GC+P +I PC HH + P PK C C P Y + +DKH
Sbjct: 172 LYDSHVGCRPYSIPPCEHHVNGSRPPCTGEGDTPK--CSKIC-EPGYTPSYKEDKHFGCS 228
Query: 239 TYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWG 298
+Y + NE I EI +GP F +Y DF YKSGVY+H + + H+ +++GWG
Sbjct: 229 SYSISRNEKEIMAEIYKNGPVEGAFTVYSDFLQYKSGVYQHVTGDLMGG--HAIRILGWG 286
Query: 299 TENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
ENGTPYWLV N+W WGD G KILRG+ C E I AG P
Sbjct: 287 VENGTPYWLVGNSWNTDWGDNGFFKILRGQDHCGIESEIVAGIP 330
>gi|351695295|gb|EHA98213.1| Cathepsin B [Heterocephalus glaber]
Length = 340
Score = 220 bits (561), Expect = 6e-55, Method: Compositional matrix adjust.
Identities = 129/343 (37%), Positives = 175/343 (51%), Gaps = 22/343 (6%)
Query: 5 LVFLLGCTLVRGE--LYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYF 62
L FLL T + SD ++ IN++ TW AG NF N+ Y+++
Sbjct: 8 LCFLLALTGAYNAPWFHPLSDELVNYINKQNTTWQAGHNF-HNVHLSYVKRLC------- 59
Query: 63 DQSDRPLPGDRKTYDPEYSATV--PDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGA 120
L G R +++ V P+ FDAR+QWPNC TI + D G+C + F AVGA
Sbjct: 60 ---GTYLGGPRLPQRIKFAEIVDLPESFDARQQWPNCPTIKEIRDQGSCGSCWAFGAVGA 116
Query: 121 FSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDY 180
SDR CI + G N +S E + SCC + + C+ G W + K+G V+GG Y
Sbjct: 117 MSDRVCIHTNGHVNVEVSAEDLLSCCGL---ECGDGCNGGYPSAAWKYWTKKGLVSGGLY 173
Query: 181 GDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTY 240
GC+P +I PC HH + T P C + KC C P Y + +DKH +Y
Sbjct: 174 DSHVGCRPYSIPPCEHHVNG-TRPQCTGEGGDTPKCSKTC-EPGYSPSYKEDKHFGYDSY 231
Query: 241 WVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTE 300
V NE I EI +GP F ++ DF YK+GVYKH + L H+ +++GWG E
Sbjct: 232 SVSSNEKEIMAEIYKNGPVEGAFTVFSDFLMYKTGVYKHLAGEMLGG--HAIRILGWGKE 289
Query: 301 NGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
NG PYWLV N+W WGD G KI+RG+ C E I AG P+
Sbjct: 290 NGVPYWLVGNSWNVDWGDSGFFKIVRGEDHCGIESEIVAGIPR 332
>gi|195058549|ref|XP_001995463.1| GH17748 [Drosophila grimshawi]
gi|193896249|gb|EDV95115.1| GH17748 [Drosophila grimshawi]
Length = 340
Score = 220 bits (561), Expect = 6e-55, Method: Compositional matrix adjust.
Identities = 120/345 (34%), Positives = 176/345 (51%), Gaps = 16/345 (4%)
Query: 2 IHILVFLLGCTLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKY 61
+ +L + E + SD +I+ + +A TWT GRNF A +SE ++R + +
Sbjct: 8 VSLLALVAMTKATESEPHMLSDEFIELVKSKATTWTPGRNFDAAVSEHHIRALM---GVH 64
Query: 62 FDQSDRPLPGDRKTYDPE-YSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGA 120
D LP R+ + +P+ FD+ + WPNC TI + D G+C + F AV A
Sbjct: 65 PDSHKFTLPEKRELLGADGEDKDLPEEFDSSKNWPNCPTIREIRDQGSCGSCWAFGAVEA 124
Query: 121 FSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDY 180
SDR CI S N S + + +CC C + C+ G W++ RG V+GG Y
Sbjct: 125 MSDRVCIHSNATVNFHFSADDLVTCCHTCGF----GCNGGFPGAAWSYWTTRGIVSGGSY 180
Query: 181 GDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTY 240
GC+P + PC HH P P C + P C +C P Y + +DKH +Y
Sbjct: 181 NSTEGCRPYEVEPCEHHVDGPR-PPCHSGSTP--HCKHQC-QPNYSVDYEKDKHFGASSY 236
Query: 241 WVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGT- 299
++ N I++EI+ +GP F +Y+D YK+GVY+H +L H+ ++IGWG
Sbjct: 237 SINRNPRNIQREIMTNGPVEGAFTVYEDLILYKTGVYQHVHGKQLGG--HAIRIIGWGVW 294
Query: 300 -ENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
E+ PYWL+ N+W WGD G +ILRGK C E I+AG PK
Sbjct: 295 GESKVPYWLIANSWNTDWGDNGFFRILRGKDHCGIESQISAGLPK 339
>gi|1705630|sp|P00787.2|CATB_RAT RecName: Full=Cathepsin B; AltName: Full=Cathepsin B1; AltName:
Full=RSG-2; Contains: RecName: Full=Cathepsin B light
chain; Contains: RecName: Full=Cathepsin B heavy chain;
Flags: Precursor
gi|1524328|emb|CAA57792.1| cathepsin b [Rattus norvegicus]
Length = 339
Score = 220 bits (561), Expect = 7e-55, Method: Compositional matrix adjust.
Identities = 126/325 (38%), Positives = 170/325 (52%), Gaps = 17/325 (5%)
Query: 19 YKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDP 78
+ SD I+ IN++ TW AGRNF N+ YL++ + P +R +
Sbjct: 24 HPLSDDMINYINKQNTTWQAGRNF-YNVDISYLKKLC---GTVLGGPNLP---ERVGFSE 76
Query: 79 EYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLS 138
+ + +P+ FDAREQW NC TI + D G+C + F AV A SDR CI + G+ N +S
Sbjct: 77 DIN--LPESFDAREQWSNCPTIAQIRDQGSCGSCWAFGAVEAMSDRICIHTNGRVNVEVS 134
Query: 139 TEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHG 198
E + +CC I D C+ G WNF ++G V+GG Y GC P TI PC HH
Sbjct: 135 AEDLLTCCGIQCGD---GCNGGYPSGAWNFWTRKGLVSGGVYNSHIGCLPYTIPPCEHHV 191
Query: 199 SAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGP 258
+ P PK C+ C Y + +DKH +Y V D+E I EI +GP
Sbjct: 192 NGSRPPCTGEGDTPK--CNKMC-EAGYSTSYKEDKHYGYTSYSVSDSEKEIMAEIYKNGP 248
Query: 259 TTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGD 318
F ++ DF YKSGVYKH + + H+ +++GWG ENG PYWLV N+W WGD
Sbjct: 249 VEGAFTVFSDFLTYKSGVYKHEAGDVMGG--HAIRILGWGIENGVPYWLVANSWNVDWGD 306
Query: 319 RGTVKILRGKYECAFEYLIAAGKPK 343
G KILRG+ C E I AG P+
Sbjct: 307 NGFFKILRGENHCGIESEIVAGIPR 331
>gi|82830420|ref|NP_072119.2| cathepsin B preproprotein [Rattus norvegicus]
gi|47939014|gb|AAH72490.1| Cathepsin B [Rattus norvegicus]
gi|149030258|gb|EDL85314.1| rCG52258, isoform CRA_a [Rattus norvegicus]
Length = 339
Score = 220 bits (561), Expect = 8e-55, Method: Compositional matrix adjust.
Identities = 128/329 (38%), Positives = 173/329 (52%), Gaps = 23/329 (6%)
Query: 18 LYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQF---LIADAKYFDQSDRPLPGDRK 74
+ SD I+ IN++ TW AGRNF N+ YL++ ++ K LP +R
Sbjct: 23 FHPLSDDMINYINKQNTTWQAGRNF-YNVDISYLKKLCGTVLGGPK--------LP-ERV 72
Query: 75 TYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQN 134
+ + + +P+ FDAREQW NC TI + D G+C + F AV A SDR CI + G+ N
Sbjct: 73 GFSEDIN--LPESFDAREQWSNCPTIAQIRDQGSCGSCWAFGAVEAMSDRICIHTNGRVN 130
Query: 135 RPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPC 194
+S E + +CC I D C+ G WNF ++G V+GG Y GC P TI PC
Sbjct: 131 VEVSAEDLLTCCGIQCGD---GCNGGYPSGAWNFWTRKGLVSGGVYNSHIGCLPYTIPPC 187
Query: 195 SHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEIL 254
HH + P PK C+ C Y + +DKH +Y V D+E I EI
Sbjct: 188 EHHVNGSRPPCTGEGDTPK--CNKMC-EAGYSTSYKEDKHYGYTSYSVSDSEKEIMAEIY 244
Query: 255 AHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGP 314
+GP F ++ DF YKSGVYKH + + H+ +++GWG ENG PYWLV N+W
Sbjct: 245 KNGPVEGAFTVFSDFLTYKSGVYKHEAGDVMGG--HAIRILGWGIENGVPYWLVANSWNV 302
Query: 315 HWGDRGTVKILRGKYECAFEYLIAAGKPK 343
WGD G KILRG+ C E I AG P+
Sbjct: 303 DWGDNGFFKILRGENHCGIESEIVAGIPR 331
>gi|221107055|ref|XP_002166984.1| PREDICTED: cathepsin B-like [Hydra magnipapillata]
Length = 330
Score = 219 bits (559), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 127/341 (37%), Positives = 174/341 (51%), Gaps = 25/341 (7%)
Query: 5 LVFLLGCTLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQF---LIADAKY 61
L+FL + + I+ +N TWTAG NF ++E Y++ L
Sbjct: 6 LLFLFAGVGALPQHRGLFNEEINIVNSLKTTWTAGVNFGPEVTESYIKGLCGTLEEKENI 65
Query: 62 FDQSDRPLPGDRKTYDPEYSATVPDRFDAREQW-PNCGTIGHVPDTGACAAPHIFAAVGA 120
+ P+ AT+PD +D RE+W C + + D G+C + F AV A
Sbjct: 66 LEVKQIPV-----------IATLPDSYDTREKWGSTCPSTTEIRDQGSCGSCWAFGAVEA 114
Query: 121 FSDRRCIKSKGQQNRPLSTEYVASCCKI-CRYDDNKSCSHGSVFRTWNFLHKRGSVTGGD 179
F+DR CI+S G +N +S E + +CC C + C+ G + WNF G+VTGG
Sbjct: 115 FTDRICIQSNGAKNPHISAEDLLTCCGFWCGF----GCNGGRLGPAWNFFKYAGAVTGGQ 170
Query: 180 YGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLT 239
Y GCQP I C HH S P CE + P KC C Y + DKH+ +
Sbjct: 171 YNSSEGCQPYEIPSCEHHTSGSKKP-CEGSE-PTPKCKRSCRE-GYNVSYSDDKHKVSSH 227
Query: 240 YWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGT 299
Y + ++E+ IK EI +GP A F +Y DF +YKSGVYK+T+ L H+ K++GWG
Sbjct: 228 YSIANDEEQIKNEIYLNGPVEAAFTVYSDFPNYKSGVYKYTTGNALGG--HAIKILGWGV 285
Query: 300 ENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAG 340
EN PYWLV N+W P WGD+G KILRG EC E + AG
Sbjct: 286 ENNVPYWLVANSWNPDWGDKGFFKILRGSNECGIEASVVAG 326
>gi|161343865|tpg|DAA06113.1| TPA_inf: cathepsin B [Myzus persicae]
Length = 335
Score = 219 bits (559), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 126/348 (36%), Positives = 185/348 (53%), Gaps = 25/348 (7%)
Query: 1 MIHILVFLLGCTLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAK 60
+I I V LL L + + S Y+++IN A TW A +NFP N E + + L+ +
Sbjct: 5 VILISVVLLSVYLTE-QAHFLSKEYVNKINEVAKTWKAKQNFPENTPREDIVR-LLGSKR 62
Query: 61 YFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGA 120
+ P+ + Y + VP+ FD+R +W NC TIG V + G C + GA
Sbjct: 63 LLGLNKSPIKENDILYVD--NGEVPEFFDSRLEWKNCKTIGEVRNQGNCGSCWAHGTTGA 120
Query: 121 FSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDY 180
F+DR CI + G+ N +S E + CC C + C+ G+ + W + + G VTGG+Y
Sbjct: 121 FADRLCIATDGEFNELISAEELTFCCHTCGF----GCNGGNPLKAWKYFKRHGVVTGGNY 176
Query: 181 GDRTGCQPSTISPCSH----HGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRT 236
GCQP + PC H S P+ N K K KC+ T + ++ ++T
Sbjct: 177 NTTDGCQPYRVPPCVRDDEGHNSCSGQPTERNHKCSK-KCYGDET-----INYKKNHYKT 230
Query: 237 TLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYL--HSGKL 294
Y++ + ++K+ + +GP A+F +YDDF Y+SGVY+ T NA +YL H+ K+
Sbjct: 231 KDAYYLSNT--TMQKDTMVYGPIEASFDVYDDFTSYESGVYQKTENA---SYLGGHAVKM 285
Query: 295 IGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
IGWG E GTPYWL++N+WG WGD+G KILRG EC E AG P
Sbjct: 286 IGWGVEEGTPYWLMVNSWGEQWGDKGMFKILRGTDECGVESSCTAGVP 333
>gi|204022100|dbj|BAG71147.1| cathepsin B-N1 [Tuberaphis takenouchii]
Length = 334
Score = 219 bits (559), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 134/336 (39%), Positives = 179/336 (53%), Gaps = 28/336 (8%)
Query: 17 ELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTY 76
+ Y + YI+QIN A TW AG NF LS + + L +K + + P KT+
Sbjct: 17 QAYFLEEDYINQINTNAKTWKAGVNFDPKLSIDSFVKLL--GSKGVQAAKQTSPDMFKTH 74
Query: 77 DPEYSAT---VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQ 133
D Y++ +P FDAR++W C TIG V D G C + F AF+DR CI + G+
Sbjct: 75 DEAYNSLPNRIPSNFDARKKWRKCSTIGEVRDQGHCGSCWAFGTSSAFADRLCIATDGEF 134
Query: 134 NRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISP 193
N LS E +A CC C + C G + W + K G VTGGDY GCQP + P
Sbjct: 135 NELLSAEELAFCCHKCGF----GCHGGYPIKAWEWFKKHGLVTGGDYDSGEGCQPYRVPP 190
Query: 194 C--SHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGR---GFFQDKHRTTLTYWVDDNEDA 248
C +G+ +C + P K H RCT YG F +D H T Y++
Sbjct: 191 CPLDEYGNN----TCRGK--PAEKNH-RCTRMCYGNQELDFKEDHHWTRDAYYL--TYTT 241
Query: 249 IKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYL--HSGKLIGWGTENGTPYW 306
I+K+++A+GP A+F +YDDF +YKSGVY T NA +YL H+ KLIGWG E G PYW
Sbjct: 242 IQKDVMAYGPIEASFDVYDDFPNYKSGVYMKTENA---SYLGGHAVKLIGWGEEYGVPYW 298
Query: 307 LVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
L++N+W WGD+G KILRG EC + G P
Sbjct: 299 LLVNSWNDQWGDQGLFKILRGTNECGIDNSTTGGVP 334
>gi|302564570|ref|NP_001181828.1| cathepsin B precursor [Macaca mulatta]
Length = 339
Score = 219 bits (559), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 125/338 (36%), Positives = 173/338 (51%), Gaps = 17/338 (5%)
Query: 6 VFLLGCTLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQS 65
+ LG R + SD ++ +N++ TW AG NF N+ YL++ +
Sbjct: 11 LLALGDARSRPSFHPLSDELVNYVNKQNTTWQAGHNF-YNVDVSYLKRLC---GTFLGG- 65
Query: 66 DRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRR 125
P P R + + +P+ FDAREQWP C TI + D G+C + F AV A SDR
Sbjct: 66 --PKPPQRVMFTEDLK--LPESFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVEAISDRI 121
Query: 126 CIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTG 185
CI + + +S E + +CC I D C+ G WNF ++G V+GG Y G
Sbjct: 122 CIHTNAHVSVEVSAEDLLTCCGIMCGD---GCNGGYPAGAWNFWTRKGLVSGGLYDSHVG 178
Query: 186 CQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDN 245
C+P +I PC HH + P PK C C P Y + QDKH +Y V ++
Sbjct: 179 CRPYSIPPCEHHVNGSRPPCTGEGDTPK--CSKIC-EPGYSPTYKQDKHYGYNSYSVSNS 235
Query: 246 EDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPY 305
E I EI +GP F++Y DF YKSGVY+H + + H+ +++GWG ENGTPY
Sbjct: 236 EKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGG--HAIRILGWGVENGTPY 293
Query: 306 WLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
WLV N+W WGD G KILRG+ C E + AG P+
Sbjct: 294 WLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPR 331
>gi|402877481|ref|XP_003902454.1| PREDICTED: cathepsin B [Papio anubis]
Length = 339
Score = 219 bits (559), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 125/338 (36%), Positives = 173/338 (51%), Gaps = 17/338 (5%)
Query: 6 VFLLGCTLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQS 65
+ LG R + SD ++ +N++ TW AG NF N+ YL++ +
Sbjct: 11 LLALGDARSRPSFHPLSDELVNYVNKQNTTWQAGHNF-YNVDVSYLKRLC---GTFLGG- 65
Query: 66 DRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRR 125
P P R + + +P+ FDAREQWP C TI + D G+C + F AV A SDR
Sbjct: 66 --PKPPQRVMFTEDLK--LPESFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVEAISDRI 121
Query: 126 CIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTG 185
CI + + +S E + +CC I D C+ G WNF ++G V+GG Y G
Sbjct: 122 CIHTNAHVSVEVSAEDLLTCCGIMCGD---GCNGGYPAGAWNFWTRKGLVSGGLYDSHVG 178
Query: 186 CQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDN 245
C+P +I PC HH + P PK C C P Y + QDKH +Y V ++
Sbjct: 179 CRPYSIPPCEHHVNGSRPPCTGEGDTPK--CSKIC-EPGYSPTYKQDKHYGYNSYSVSNS 235
Query: 246 EDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPY 305
E I EI +GP F++Y DF YKSGVY+H + + H+ +++GWG ENGTPY
Sbjct: 236 EKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGG--HAIRILGWGVENGTPY 293
Query: 306 WLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
WLV N+W WGD G KILRG+ C E + AG P+
Sbjct: 294 WLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPR 331
>gi|195437434|ref|XP_002066645.1| GK24603 [Drosophila willistoni]
gi|194162730|gb|EDW77631.1| GK24603 [Drosophila willistoni]
Length = 341
Score = 219 bits (558), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 129/349 (36%), Positives = 184/349 (52%), Gaps = 20/349 (5%)
Query: 2 IHILVFLLGCTLVRGELYKF-SDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAK 60
I+ L LL L + F SDA+++++ R+A TW GRNF ++SE+YLR + +
Sbjct: 5 IYFLWLLLVTFLTINDAADFLSDAFMEKVRRKAKTWNLGRNFHESISEKYLRGLMGVHEE 64
Query: 61 YFDQSDRPLPGDRKTY---DPEYS-ATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFA 116
+ PLP ++ D E S A +P FDAR +W +C TI + + G+C + A
Sbjct: 65 SYKY---PLPDKQEVLGESDDEISLADLPVDFDARLRWTSCPTISEIREQGSCGSCWAIA 121
Query: 117 AVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVT 176
SDR CI S G N LS + SCC IC + +C G W + ++G V+
Sbjct: 122 TTSVMSDRLCIGSNGVMNFRLSGLDMLSCCAICGF----ACQGGYPGAAWAYWARKGLVS 177
Query: 177 GGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRT 236
GGDYG + GCQP TI PC H G+ + P C ++C C P+Y F +DK+
Sbjct: 178 GGDYGSQQGCQPYTIEPCDHSGNG-SRPVCTVGG--GVRCQHLC-EPSYKVDFQRDKNFA 233
Query: 237 TLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIG 296
+ Y + ++ I+KEI+ +GP A +Y+DF YK+GVY H K+ H+ +++G
Sbjct: 234 SKVYSISNDVLEIQKEIMTNGPVQAILTVYEDFLSYKTGVYYHLEGEKVGP--HAVRILG 291
Query: 297 WGT--ENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
WG PYWLV N+WG WGD G I RG+ C E I AG PK
Sbjct: 292 WGVWGTKKVPYWLVANSWGSDWGDNGFFHIFRGENHCDIEGYIMAGLPK 340
>gi|75076082|sp|Q4R5M2.1|CATB_MACFA RecName: Full=Cathepsin B; Contains: RecName: Full=Cathepsin B
light chain; Contains: RecName: Full=Cathepsin B heavy
chain; Flags: Precursor
gi|67970521|dbj|BAE01603.1| unnamed protein product [Macaca fascicularis]
gi|355779504|gb|EHH63980.1| Cathepsin B [Macaca fascicularis]
gi|383411999|gb|AFH29213.1| cathepsin B preproprotein [Macaca mulatta]
gi|384942194|gb|AFI34702.1| cathepsin B preproprotein [Macaca mulatta]
Length = 339
Score = 219 bits (558), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 125/338 (36%), Positives = 173/338 (51%), Gaps = 17/338 (5%)
Query: 6 VFLLGCTLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQS 65
+ LG R + SD ++ +N++ TW AG NF N+ YL++ +
Sbjct: 11 LLALGDARSRPSFHPLSDELVNYVNKQNTTWQAGHNF-YNVDVSYLKRLC---GTFLGG- 65
Query: 66 DRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRR 125
P P R + + +P+ FDAREQWP C TI + D G+C + F AV A SDR
Sbjct: 66 --PKPPQRVMFTEDLK--LPESFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVEAISDRI 121
Query: 126 CIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTG 185
CI + + +S E + +CC I D C+ G WNF ++G V+GG Y G
Sbjct: 122 CIHTNAHVSVEVSAEDLLTCCGIMCGD---GCNGGYPAGAWNFWTRKGLVSGGLYDSHVG 178
Query: 186 CQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDN 245
C+P +I PC HH + P PK C C P Y + QDKH +Y V ++
Sbjct: 179 CRPYSIPPCEHHVNGSRPPCTGEGDTPK--CSKIC-EPGYSPTYKQDKHYGYNSYSVSNS 235
Query: 246 EDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPY 305
E I EI +GP F++Y DF YKSGVY+H + + H+ +++GWG ENGTPY
Sbjct: 236 EKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGG--HAIRILGWGVENGTPY 293
Query: 306 WLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
WLV N+W WGD G KILRG+ C E + AG P+
Sbjct: 294 WLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPR 331
>gi|56759588|gb|AAW28820.1| Parcxpwnx02 [Periplaneta americana]
Length = 343
Score = 219 bits (558), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 122/323 (37%), Positives = 170/323 (52%), Gaps = 16/323 (4%)
Query: 21 FSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEY 80
SD +ID IN TW A RNF ++ +++ L+ + + P K+ + +
Sbjct: 36 LSDDFIDHINSLNTTWKAHRNFGNDIPLREIKK-LMGVRRSLENFRLP----EKSME-DI 89
Query: 81 SATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTE 140
+P+ FD REQWP C T+ + D G+C + F AV A SDR CI SKG+ + S E
Sbjct: 90 DIEIPEEFDPREQWPECPTLKEIRDQGSCGSCWAFGAVEAMSDRVCIHSKGKTHFHFSAE 149
Query: 141 YVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSA 200
+ +CC C + C+ G W++ G V+GG Y GCQP I PC HH +
Sbjct: 150 DLLTCCSSCGF----GCNGGEPGAAWDYWVSTGIVSGGSYNSHQGCQPYAIEPCEHHVNG 205
Query: 201 PTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTT 260
P C P+ C RC Y + +D+H Y V + AI+KE+L +GP
Sbjct: 206 TRKP-CGEGDTPR--CVKRCEE-GYDVPYGKDRHFGKSAYAVPGSVKAIQKELLLNGPAE 261
Query: 261 ATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRG 320
A +YDDF HY++GVY+H S L H+ +L+GWG E+GTPYWL+ N+W WGD G
Sbjct: 262 AALTVYDDFLHYRTGVYQHVSGGALGG--HAVRLLGWGVEDGTPYWLLANSWNYDWGDNG 319
Query: 321 TVKILRGKYECAFEYLIAAGKPK 343
+ILRG+ EC E I G PK
Sbjct: 320 YFRILRGQDECGIESDINGGLPK 342
>gi|496317|dbj|BAA04103.1| Sarcophaga pro-cathepsin B [Sarcophaga peregrina]
Length = 344
Score = 219 bits (558), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 118/325 (36%), Positives = 173/325 (53%), Gaps = 12/325 (3%)
Query: 21 FSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIA--DAKYFDQSDRPLPGDRKTYDP 78
SD +++ + +A TWT GRN+ ++ + R+ + DA F ++ L +
Sbjct: 29 LSDEFLEIVRSKAKTWTPGRNYDKSVPRSHFRRLMGVHPDAHKFTLHEKSLVLGEEVGLA 88
Query: 79 EYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLS 138
+ + VP+ FDAR+ WPNC TIG + D G+C + F AV A SDR CI S + S
Sbjct: 89 D--SDVPEEFDARKAWPNCPTIGEIRDQGSCGSCWAFGAVEAMSDRLCIHSNATIHFHFS 146
Query: 139 TEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHG 198
+ + SCC C + C+ G W + ++G V+GG YG GC+P I+PC HH
Sbjct: 147 ADDLVSCCHTCGF----GCNGGFPGAAWAYWTRKGIVSGGPYGSSQGCRPYEIAPCEHHV 202
Query: 199 SAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGP 258
+ T P C+ + C C +Y + DKH + +Y V N I+KEI+ +GP
Sbjct: 203 NG-TRPPCDGEHGKTPSCRHECQK-SYDVDYKTDKHFGSKSYSVKRNVKDIQKEIMQNGP 260
Query: 259 TTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGD 318
F +Y+D YK GVY+H +L H+ +++GWG EN TPYWL+ N+W WG+
Sbjct: 261 VEGAFTVYEDLILYKDGVYQHVHGRELGG--HAIRILGWGVENKTPYWLIANSWNTDWGN 318
Query: 319 RGTVKILRGKYECAFEYLIAAGKPK 343
G K+LRG+ C E IAAG PK
Sbjct: 319 NGFFKMLRGEDHCGIESAIAAGLPK 343
>gi|51038793|gb|AAT94175.1| cathepsin B [Paralichthys olivaceus]
gi|121053785|gb|ABM47001.1| cathepsin B [Paralichthys olivaceus]
Length = 330
Score = 219 bits (558), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 126/334 (37%), Positives = 172/334 (51%), Gaps = 21/334 (6%)
Query: 12 TLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPG 71
+L R L S ++ IN+ TW AG NF N+ Y+R+ L G
Sbjct: 16 SLARPHLQPLSSEMVNYINKLNTTWKAGHNF-HNVDYSYVRRLC----------GTMLKG 64
Query: 72 DRKTYDPEYSA--TVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKS 129
+ +Y+ +P FDAREQWP C T+ + D G+C + F A A SDR CI S
Sbjct: 65 PKLPIMVQYAGGLKLPAEFDAREQWPECPTLKEIRDQGSCGSCWAFGAAEAISDRVCIHS 124
Query: 130 KGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPS 189
G+ + +S+E + +CC C C+ G W+F K G V+GG Y GC+P
Sbjct: 125 GGKISVEISSEDLLTCCDSC----GMGCNGGYPSSAWDFWTKEGLVSGGLYNSHIGCRPY 180
Query: 190 TISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAI 249
TISPC HH + + P C + +C +RC Y + QDKH +Y V+ + + I
Sbjct: 181 TISPCEHHVNG-SRPPCTGEGGDTPECISRC-EAGYSPSYKQDKHYGKSSYSVEGSVEQI 238
Query: 250 KKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVI 309
+ EI +GP F +Y+DF YKSGVY+H S + L H+ K++GWG E+G PYWL
Sbjct: 239 QAEISKNGPVEGAFTVYEDFVMYKSGVYQHVSGSVLGG--HAIKVLGWGEEDGIPYWLCA 296
Query: 310 NTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
N+W WGD G KILRG C E I AG PK
Sbjct: 297 NSWNTDWGDNGFFKILRGSNHCGIESEIVAGIPK 330
>gi|268558600|ref|XP_002637291.1| C. briggsae CBR-CPR-4 protein [Caenorhabditis briggsae]
Length = 335
Score = 219 bits (558), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 126/344 (36%), Positives = 177/344 (51%), Gaps = 14/344 (4%)
Query: 1 MIHILVFLLGCTLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAK 60
++ + L LV + K +A + +N + + W A P +++ E +++ L+
Sbjct: 4 VVFASLLALATGLVIPVVPKTPEAITEYVNSKQSLWKA--EIPKHITIEQVKKRLMRT-- 59
Query: 61 YFDQSDRPLPGDRKTYDPEYSA-TVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVG 119
+ P D + + T+PD FDAR QWP+C +I ++ D C + FAA
Sbjct: 60 ---EFVAPHTPDVEVIKHDIQEDTIPDTFDARTQWPSCVSINNIRDQSDCGSCWAFAAAE 116
Query: 120 AFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGD 179
A SDR CI S G N LS E V SCC C Y C G W +L K G TGG
Sbjct: 117 AASDRFCIASNGAVNTLLSAEDVLSCCSNCGY----GCEGGYPINAWKYLVKSGFCTGGS 172
Query: 180 YGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLT 239
Y + GC+P +++PC T P C C +CTN Y + DKH +
Sbjct: 173 YVSQFGCKPYSLAPCGETVGNTTWPDCPQDGYNTPSCVNKCTNNNYNIAYKDDKHFGSTA 232
Query: 240 YWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGT 299
Y V I+ EILAHGP A F +Y+DFY YKSGVY HT+ +L H+ +++GWGT
Sbjct: 233 YAVGKKVAQIQAEILAHGPVEAAFTVYEDFYQYKSGVYVHTTGQELGG--HAIRILGWGT 290
Query: 300 ENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
+NGTPYWLV N+W +WG+ G +I+RG EC E+ + G PK
Sbjct: 291 DNGTPYWLVANSWNVNWGENGYFRIIRGTNECGIEHAVVGGVPK 334
>gi|254746338|emb|CAX16634.1| putative C1A cysteine protease precursor [Manduca sexta]
Length = 337
Score = 219 bits (558), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 121/324 (37%), Positives = 165/324 (50%), Gaps = 14/324 (4%)
Query: 19 YKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDP 78
+ SDA+I IN + NTW AGRNFP ++ + + A Q D + +D
Sbjct: 23 HPLSDAFIRLINSKQNTWRAGRNFPTTTPFAHINKLMGAL-----QDDNVAKMPKVEHDA 77
Query: 79 EYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLS 138
+ A++P+ FD R++WP+C T+ + D G+C + F AV A +DR C S G ++ S
Sbjct: 78 DLIASLPENFDPRDKWPDCPTLNEIRDQGSCGSCWAFGAVEAMTDRYCTYSNGTKHFHFS 137
Query: 139 TEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHG 198
+E + SCC IC C+ G W + G V+GG+Y GC+P I PC HH
Sbjct: 138 SEDLLSCCPICGL----GCNGGIPSLAWEYWKHFGIVSGGNYNSTQGCRPYEIPPCEHHV 193
Query: 199 SAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGP 258
+P + K P KC C N Y + +DK Y V ED I+ E+ +GP
Sbjct: 194 PGNRMPCSGDTKTP--KCQKNCEN-GYNVMYKKDKRYGKHVYSVSAGEDHIRAELYKNGP 250
Query: 259 TTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGD 318
F +Y D YKSGVYKH L H+ K++GWG EN YWLV N+W WGD
Sbjct: 251 VEGAFTVYADLLAYKSGVYKHIQGDALGG--HAIKILGWGVENDNKYWLVANSWNTDWGD 308
Query: 319 RGTVKILRGKYECAFEYLIAAGKP 342
G KILRG+ C E I AG+P
Sbjct: 309 NGFFKILRGENHCGIEGSIIAGEP 332
>gi|308500570|ref|XP_003112470.1| CRE-CPR-4 protein [Caenorhabditis remanei]
gi|308267038|gb|EFP10991.1| CRE-CPR-4 protein [Caenorhabditis remanei]
Length = 335
Score = 219 bits (557), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 127/344 (36%), Positives = 176/344 (51%), Gaps = 14/344 (4%)
Query: 1 MIHILVFLLGCTLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAK 60
+I + + LV K +A + +N + + W A P LS E +++ L+
Sbjct: 4 VIFAALVAVATGLVIPVAPKTPEAITEYVNSKQSLWKA--EIPKGLSIEQVKKRLMRT-- 59
Query: 61 YFDQSDRPLPGDRKTYDPEYSA-TVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVG 119
+ P D + + + T+P FDAR QWPNC +I ++ D C + FAA
Sbjct: 60 ---EFVAPHTPDVEVVEHDIQEDTIPATFDARTQWPNCVSINNIRDQSDCGSCWAFAAAE 116
Query: 120 AFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGD 179
A SDR CI S G N LS E V SCC C Y C G W +L K G TGG
Sbjct: 117 AASDRFCIASNGAVNTLLSAEDVLSCCSNCGY----GCDGGYPINAWKYLVKSGFCTGGS 172
Query: 180 YGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLT 239
Y + GC+P +++PC T P C + C +CTN Y + DKH +
Sbjct: 173 YEAQFGCKPYSLAPCGETVGNVTWPDCPDDGYNTPACVNKCTNTKYNTAYKDDKHFGSTA 232
Query: 240 YWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGT 299
Y V I+ EI+AHGP A F +Y+DFY YKSGVY HT+ +L H+ +++GWGT
Sbjct: 233 YAVGKKVAQIQAEIIAHGPVEAAFTVYEDFYQYKSGVYVHTTGQELGG--HAIRILGWGT 290
Query: 300 ENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
+NGTPYWLV N+W +WG+ G +I+RG EC E+ + G PK
Sbjct: 291 DNGTPYWLVANSWNVNWGENGYFRIIRGTNECGIEHAVVGGVPK 334
>gi|126303983|ref|XP_001381634.1| PREDICTED: cathepsin B-like [Monodelphis domestica]
Length = 337
Score = 219 bits (557), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 125/341 (36%), Positives = 174/341 (51%), Gaps = 21/341 (6%)
Query: 6 VFLLGCTLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQF---LIADAKYF 62
+ +L R + SD ++ IN+ TW AG NF N Y+++ + AK
Sbjct: 11 LVVLTSAKSRLSIPPLSDEMVNHINKLNTTWQAGHNF-LNADMSYVKKLCGTFMGGAKLL 69
Query: 63 DQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFS 122
Q R + D + +P+ FDAREQWPNC TI + D G+C + F AV A S
Sbjct: 70 PQ--RMILAD--------NMKLPENFDAREQWPNCPTIKEIRDQGSCGSCWAFGAVEAIS 119
Query: 123 DRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGD 182
DR C+ S G N +S E + SCC + C+ G WNF K+G V+GG Y
Sbjct: 120 DRICVHSNGNANVEVSAEDLLSCCG---SECGDGCNGGFPAGAWNFWTKKGLVSGGLYDS 176
Query: 183 RTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWV 242
GC+P +I PC HH + + P+C ++ C +C Y + DK+ + +Y V
Sbjct: 177 HVGCRPYSIPPCEHHVNG-SRPACTGEEGDTPTCRKKCEE-GYSTQYKDDKNYGSTSYSV 234
Query: 243 DDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENG 302
+E I EI +GP F++Y+DF HYKSGVY+H + L H+ +++GWG ENG
Sbjct: 235 PSSEQEIMAEIYKNGPVEGAFSVYEDFLHYKSGVYQHVAGEMLGG--HAIRILGWGVENG 292
Query: 303 TPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
YWL N+W WGD G K LRGK C E I AG P+
Sbjct: 293 IRYWLAANSWNIDWGDNGFFKFLRGKNHCGIESEIIAGIPR 333
>gi|225711544|gb|ACO11618.1| Cathepsin B precursor [Caligus rogercresseyi]
Length = 332
Score = 218 bits (556), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 123/323 (38%), Positives = 164/323 (50%), Gaps = 16/323 (4%)
Query: 21 FSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEY 80
S YI IN + W AGRNF S YLR + + D PLP T
Sbjct: 26 LSSEYIHSINEASEIWKAGRNFHPETSSNYLRSLMGVLPNHKDHLPPPLPSLLGT----- 80
Query: 81 SATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTE 140
+P FDARE WPNC +I + D G+C + F A A SDR CI + +N +S E
Sbjct: 81 -EALPSDFDAREHWPNCPSIRLIRDQGSCGSCWAFGAAEAMSDRICIHT--NKNVNISAE 137
Query: 141 YVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSA 200
+ SCC C + C+ G W + +G V+GG YG +GCQP I PC HH +
Sbjct: 138 NLLSCCYSCGF----GCNGGFPGAAWKYWTSKGLVSGGLYGSHSGCQPYDIEPCEHHVNG 193
Query: 201 PTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTT 260
P E + PK CH C N Y + +D +Y + + I+ EI+ +GP
Sbjct: 194 TRQPCAEGGRTPK--CHRTCENENYSVPYDKDLSFGRSSYSIRSDPKQIQLEIMDNGPVE 251
Query: 261 ATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRG 320
A F++Y DF + KSGVY+H + L H+ +++GWG E GTPYWLV N+W WGD+G
Sbjct: 252 AAFSVYSDFMNDKSGVYRHVKGSLLGG--HAIRILGWGVEKGTPYWLVANSWNTDWGDKG 309
Query: 321 TVKILRGKYECAFEYLIAAGKPK 343
T KILRG C E + G P+
Sbjct: 310 TFKILRGSDHCGIEGSVVTGLPR 332
>gi|204022106|dbj|BAG71150.1| cathepsin B-N [Astegopteryx spinocephala]
Length = 332
Score = 218 bits (556), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 139/350 (39%), Positives = 176/350 (50%), Gaps = 30/350 (8%)
Query: 5 LVFLLGCTLVR----GELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAK 60
L FLL L E Y + YI+QIN A TW AG NF LS E + L +K
Sbjct: 1 LAFLLSVVLFSVYQTEEAYFLEEDYINQINENAKTWKAGINFDPKLSVENFVKLL--GSK 58
Query: 61 YFDQSDRPLPGDRKTYDPEY-SATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVG 119
+ + P KT D Y + +P FDAR++W C TIG V D G C + F
Sbjct: 59 GVQAAKKASPDMFKTDDKTYENQRIPKFFDARKKWRKCSTIGEVRDQGKCGSCWAFGTSS 118
Query: 120 AFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGD 179
AF+DR CI + G N LS E + CC C Y C G + W K G VTGG+
Sbjct: 119 AFADRLCIATDGDFNELLSAEELTFCCHTCGY----GCHGGYPIKAWERFKKHGLVTGGN 174
Query: 180 YGDRTGCQPSTISPC--SHHGSAPTLPSCENQKVPKLKCHTRCTNPTYG---RGFFQDKH 234
Y GCQP +SPC +G+ +C + P K H RCT YG R F +D
Sbjct: 175 YDSSEGCQPYRVSPCPLDEYGNN----TCRGK--PAEKNH-RCTRMCYGDQDRDFKEDHR 227
Query: 235 RTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYL--HSG 292
T Y++ I+K+++ +GP A++ +YDDF YKSGVY T NA YL H+
Sbjct: 228 FTRDAYYL--TYGTIQKDVMTYGPIEASYEVYDDFPSYKSGVYVRTENA---TYLGGHAV 282
Query: 293 KLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
KLIGWG E G PYWL++N+W WGDRG KI RG EC + G P
Sbjct: 283 KLIGWGEEYGVPYWLMVNSWNDQWGDRGLFKIRRGTNECGIDNSTTGGVP 332
>gi|195393194|ref|XP_002055239.1| GJ19262 [Drosophila virilis]
gi|194149749|gb|EDW65440.1| GJ19262 [Drosophila virilis]
Length = 338
Score = 218 bits (555), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 120/348 (34%), Positives = 184/348 (52%), Gaps = 18/348 (5%)
Query: 1 MIHILVFLLGCTLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIA--D 58
+ +L F+ + + S+ +++ + +A TWT GRNF A++SE ++R + D
Sbjct: 3 LFLLLAFVAIAAATEDDPHMLSEEFMELVRGKAKTWTVGRNFDASVSEHHIRGLMGVHPD 62
Query: 59 AKYFDQSDRP-LPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAA 117
A F ++ + G+ D +P+ FDAR WP+C TIG + D G+C + F A
Sbjct: 63 AHKFTLPEKSQVLGNLMEAD---GGDLPEEFDARTAWPDCPTIGEIRDQGSCGSCWAFGA 119
Query: 118 VGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTG 177
V A SDR CI S N S + + SCC C + C+ G W++ +G V+G
Sbjct: 120 VEAMSDRVCIHSNATVNFHFSADDLVSCCHTCGF----GCNGGFPGAAWSYWTHKGIVSG 175
Query: 178 GDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTT 237
G YG + GC+P + PC HH + T P C + P +C +C + Y + +DKH
Sbjct: 176 GSYGSKEGCRPYEVEPCEHHVNG-TRPPCHSGSTP--RCMHKCES-GYSVDYAKDKHFGA 231
Query: 238 LTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGW 297
Y V+ N I++EI+ +GP F +Y+D YK+GVY+H +L H+ +++GW
Sbjct: 232 KAYSVNRNPLDIQREIMTNGPVEGAFTVYEDLILYKTGVYQHVHGRQLGG--HAIRILGW 289
Query: 298 GT--ENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
G +N PYWL+ N+W WGD G +ILRG+ C E I+AG PK
Sbjct: 290 GVWGDNKVPYWLIGNSWNTDWGDNGFFRILRGEDHCGIESAISAGLPK 337
>gi|204022094|dbj|BAG71144.1| cathepsin B-N1 [Tuberaphis taiwana]
Length = 334
Score = 218 bits (555), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 137/352 (38%), Positives = 179/352 (50%), Gaps = 28/352 (7%)
Query: 1 MIHILVFLLGCTLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAK 60
++ +L +L + Y + YI+QIN A TW AG NF LS + + L +K
Sbjct: 1 LVILLSVVLFSVYRTEQAYFLEEDYINQINANAKTWKAGVNFDPKLSIDSFVKLL--GSK 58
Query: 61 YFDQSDRPLPGDRKTYDPEY---SATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAA 117
+ + P KT+D Y S +P FDAR++W C TIG V D G C + F
Sbjct: 59 GVQAAKQASPDMFKTHDEAYNSWSNRIPSSFDARKKWRKCSTIGEVRDQGKCGSCWAFGT 118
Query: 118 VGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTG 177
AF+DR CI + G+ N LS E +A CC C + CS G R W K G VTG
Sbjct: 119 SSAFADRLCIATDGEFNELLSAEELAFCCHKCGF----GCSGGYPIRAWERFKKHGLVTG 174
Query: 178 GDYGDRTGCQPSTISPC--SHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGR---GFFQD 232
G+Y GCQP + PC +G+ +C + P K H RCT YG F +D
Sbjct: 175 GNYDSGEGCQPYRVPPCPLDEYGNN----TCRGK--PAEKNH-RCTRMCYGNQDLDFKED 227
Query: 233 KHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYL--H 290
H T Y++ I+ +ILA+GP A+F +YDDF YKSGVY NA YL H
Sbjct: 228 HHYTRDAYYL--TYGTIQNDILAYGPIEASFEVYDDFPSYKSGVYTKMENA---TYLGGH 282
Query: 291 SGKLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
+ KLIGWG E G PYWL++N+W WGD+G KI RG EC + G P
Sbjct: 283 AVKLIGWGEEYGVPYWLLVNSWNDQWGDQGLFKIRRGTNECGIDNSTTGGVP 334
>gi|195130519|ref|XP_002009699.1| GI15503 [Drosophila mojavensis]
gi|193908149|gb|EDW07016.1| GI15503 [Drosophila mojavensis]
Length = 342
Score = 218 bits (555), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 122/329 (37%), Positives = 180/329 (54%), Gaps = 18/329 (5%)
Query: 19 YKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIA--DAKYFDQSDRP-LPGDRKT 75
+ SD +I+ + +A TWT GRNF A++SE ++R + DA F ++ + G+
Sbjct: 25 HMLSDEFIELVRSKAKTWTPGRNFDASVSEGHIRGLMGVHPDAHKFTLPEKSQVLGNLVG 84
Query: 76 YDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNR 135
D + +P+ FDAR WPNC TIG + D G+C + F AV A SDR CI S G N
Sbjct: 85 DDGD---DLPESFDARTAWPNCPTIGEIRDQGSCGSCWAFGAVEAMSDRVCIHSNGTVNF 141
Query: 136 PLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCS 195
S E + SCC C + C+ G W++ +G V+GG Y GC+P I PC
Sbjct: 142 HFSAEDLVSCCHTCGF----GCNGGFPGAAWSYWTHKGIVSGGSYNSNEGCRPYEIEPCE 197
Query: 196 HHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILA 255
HH + T P C+N + P C +C + +Y + +DKH + +Y + N I++EI+
Sbjct: 198 HHVNG-TRPPCKNGRTP--SCKHQCES-SYSVDYAKDKHFGSKSYSIRRNPREIQREIMT 253
Query: 256 HGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGT--ENGTPYWLVINTWG 313
+GP F +Y+D YKSGVYKH +L H+ +++GWG ++ PYWL+ N+W
Sbjct: 254 NGPVEGAFTVYEDLILYKSGVYKHVHGKELGG--HAIRILGWGVWGDSKVPYWLIGNSWN 311
Query: 314 PHWGDRGTVKILRGKYECAFEYLIAAGKP 342
WGD G +I+RG+ C E I+AG P
Sbjct: 312 TDWGDNGFFRIVRGEDHCGIESAISAGLP 340
>gi|357613937|gb|EHJ68797.1| cathepsin B-like cysteine proteinase [Danaus plexippus]
Length = 334
Score = 218 bits (555), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 122/343 (35%), Positives = 168/343 (48%), Gaps = 15/343 (4%)
Query: 1 MIHILVFLLGCTLVRGEL-YKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADA 59
+I + + C + E+ + SD +ID IN + NTW AGRNF + + +++ + A
Sbjct: 3 LIRAICLVFLCGIAVSEIPHPLSDKFIDLINSKQNTWIAGRNFDIGRTLKSIKKLMGALE 62
Query: 60 KYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVG 119
+ + D T + +P+ FD R++WPNC T+ + D G+C + F AV
Sbjct: 63 DKYLHKLYTVEHDDDTIN-----NLPENFDPRDKWPNCPTLNEIRDQGSCGSCWAFGAVE 117
Query: 120 AFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGD 179
A +DR C S G ++ S E + SCC +C C+ G W + G V+GG+
Sbjct: 118 AMTDRYCTYSNGTKHFHFSAEDLLSCCPVCGL----GCNGGIPSFAWEYWKHFGIVSGGN 173
Query: 180 YGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLT 239
Y GC P I PC HH +P PK CH C Y + DK
Sbjct: 174 YNSSQGCLPYEIPPCEHHVPGNRIPCNGETSTPK--CHRSCRK-EYTNSYKSDKKYGKHV 230
Query: 240 YWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGT 299
Y V E+ IK EI +GP F +Y D YKSGVYKHT L H+ K++GWG
Sbjct: 231 YSVGGGEEHIKAEIFKNGPVEGAFTVYADLLTYKSGVYKHTEGEALGG--HAIKIMGWGV 288
Query: 300 ENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
ENG YWL+ N+W WGD G KILRG+ C E I AG+P
Sbjct: 289 ENGNKYWLIANSWNSDWGDNGFFKILRGEDHCGIESSIVAGEP 331
>gi|119887749|gb|ABM05925.1| cathepsin B-like cysteine proteinase [Helicoverpa assulta]
Length = 338
Score = 218 bits (555), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 119/324 (36%), Positives = 167/324 (51%), Gaps = 18/324 (5%)
Query: 21 FSDAYIDQINREANTWTAGRNFPANLSEEYLRQF--LIADAKYFDQSDRPLPGDRKTYDP 78
SD +I+ IN + N+W AGRNFP + ++++ ++ D S + ++
Sbjct: 26 LSDDFINLINTKQNSWKAGRNFPEHTPFAHIKKLAGVLPDYHLSKLS-------KVEHED 78
Query: 79 EYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLS 138
E A++P+ FD R++WPNC T+ V D G+C + F AV A +DR C S G Q+ S
Sbjct: 79 ELIASLPENFDPRDKWPNCPTLNEVRDQGSCGSCWAFGAVEAMTDRYCTYSNGTQHFHFS 138
Query: 139 TEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHG 198
E + SCC IC C+ G W + G V+GG Y GC+P I PC HH
Sbjct: 139 AEDLLSCCPICGL----GCNGGMPTLAWEYWKHFGLVSGGSYNSSQGCRPYEIPPCEHHV 194
Query: 199 SAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGP 258
+P + K P KC C + Y + +DK + V ED I+ E+ +GP
Sbjct: 195 PGNRMPCNGDSKTP--KCEKTCES-NYNVDYRKDKRYGKHVFSVSSKEDHIRAELFKNGP 251
Query: 259 TTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGD 318
F +Y D +YK+GVYKHT L H+ K++GWG ENG YWL+ N+W WGD
Sbjct: 252 VEGAFTVYSDLLNYKTGVYKHTIGDALGG--HAVKILGWGVENGNKYWLIANSWNSDWGD 309
Query: 319 RGTVKILRGKYECAFEYLIAAGKP 342
G KILRG+ C E I AG+P
Sbjct: 310 NGFFKILRGEDHCGIESSIVAGEP 333
>gi|344281458|ref|XP_003412496.1| PREDICTED: cathepsin B-like [Loxodonta africana]
Length = 340
Score = 218 bits (555), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 124/345 (35%), Positives = 179/345 (51%), Gaps = 18/345 (5%)
Query: 1 MIHILVFLLGCTLVRGELY--KFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIAD 58
++ L L+ T + LY SD ++ +N+ TW AG NF ++ Y+++
Sbjct: 4 LLATLCCLVVLTSAQSRLYFKPLSDELVNHVNKLNTTWQAGHNF-YDVDMSYVKRLC--- 59
Query: 59 AKYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAV 118
+ P ++ + E +P+ FDARE WPNC TI + D G+C + F AV
Sbjct: 60 GTLLNGPKLP----QRVHLAE-EMDLPENFDARENWPNCPTIKEIRDQGSCGSCWAFGAV 114
Query: 119 GAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGG 178
A SDR CI + G N +S E + +CC + + C+ G WNF K+G V+GG
Sbjct: 115 EAISDRVCIHTNGNVNVEVSAEDLLTCCHM---ECGDGCNGGFPAGAWNFWTKKGLVSGG 171
Query: 179 DYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTL 238
Y GC+P +I PC HH + + P C+ + KC C P Y + +DKH
Sbjct: 172 LYDSHVGCRPYSIPPCEHHVNG-SRPPCKGEGGETPKCSKTC-EPGYSPSYKEDKHYGYS 229
Query: 239 TYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWG 298
+Y V +E I EI +GP F++Y DF YKSGVY+H + ++ H+ +++GWG
Sbjct: 230 SYGVPSSEQEIMAEIYKNGPVEGAFSVYTDFLVYKSGVYQHVTGEEVGG--HAIRILGWG 287
Query: 299 TENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
ENGTPYWL N+W WGD G KILRG+ C E I AG P+
Sbjct: 288 VENGTPYWLAANSWNTDWGDNGFFKILRGQDHCGIESEIVAGIPR 332
>gi|332862712|ref|XP_003317964.1| PREDICTED: cathepsin B isoform 1 [Pan troglodytes]
gi|332862714|ref|XP_003317965.1| PREDICTED: cathepsin B isoform 2 [Pan troglodytes]
gi|332862716|ref|XP_003317966.1| PREDICTED: cathepsin B isoform 3 [Pan troglodytes]
gi|332862718|ref|XP_519607.3| PREDICTED: cathepsin B isoform 5 [Pan troglodytes]
gi|410057614|ref|XP_003954244.1| PREDICTED: cathepsin B [Pan troglodytes]
gi|410262606|gb|JAA19269.1| cathepsin B [Pan troglodytes]
gi|410262608|gb|JAA19270.1| cathepsin B [Pan troglodytes]
gi|410359820|gb|JAA44654.1| cathepsin B [Pan troglodytes]
gi|410359822|gb|JAA44655.1| cathepsin B [Pan troglodytes]
gi|410359824|gb|JAA44656.1| cathepsin B [Pan troglodytes]
gi|410359826|gb|JAA44657.1| cathepsin B [Pan troglodytes]
gi|410359828|gb|JAA44658.1| cathepsin B [Pan troglodytes]
Length = 339
Score = 218 bits (554), Expect = 4e-54, Method: Compositional matrix adjust.
Identities = 124/339 (36%), Positives = 172/339 (50%), Gaps = 19/339 (5%)
Query: 6 VFLLGCTLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQS 65
+ +L R + SD ++ +N+ TW AG NF N+ YL++ A
Sbjct: 11 LLVLANARSRPSFHPLSDELVNYVNKRNTTWQAGHNF-YNVDMSYLKRLCGAFL------ 63
Query: 66 DRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRR 125
P P R + + +P+ FDAREQWP C TI + D G+C + F AV A SDR
Sbjct: 64 GGPKPPQRVMFTEDLK--LPESFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVEAISDRI 121
Query: 126 CIKSKGQQNRPLSTEYVASCC-KICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRT 184
CI + + +S E + +CC +C C+ G WNF ++G V+GG Y
Sbjct: 122 CIHTNAHVSVEVSAEDLLTCCGSMC----GDGCNGGYPAEAWNFWTRKGLVSGGLYESHV 177
Query: 185 GCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDD 244
GC+P +I PC HH + P PK C C P Y + QDKH +Y V +
Sbjct: 178 GCRPYSIPPCEHHVNGSRPPCTGEGDTPK--CSKIC-EPGYSPTYKQDKHYGYNSYSVSN 234
Query: 245 NEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTP 304
+E I EI +GP F++Y DF YKSGVY+H + + H+ +++GWG ENGTP
Sbjct: 235 SEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGG--HAIRILGWGVENGTP 292
Query: 305 YWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
YWLV N+W WGD G KILRG+ C E + AG P+
Sbjct: 293 YWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPR 331
>gi|260786791|ref|XP_002588440.1| hypothetical protein BRAFLDRAFT_199166 [Branchiostoma floridae]
gi|229273602|gb|EEN44451.1| hypothetical protein BRAFLDRAFT_199166 [Branchiostoma floridae]
Length = 332
Score = 218 bits (554), Expect = 4e-54, Method: Compositional matrix adjust.
Identities = 127/325 (39%), Positives = 165/325 (50%), Gaps = 20/325 (6%)
Query: 21 FSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEY 80
+ ID +N TW AG NF + Y++ D ++ LP + +
Sbjct: 24 LTQEIIDYVNSIDTTWKAGWNF-QGATVSYVKGLC---GVIRDPNNHKLPLKLHELNAQ- 78
Query: 81 SATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTE 140
+PD FD+R QW NC TI V D G+C + AA A SDR C+ S G+ LS+E
Sbjct: 79 --DIPDTFDSRTQWANCPTIKEVRDQGSCGSCWAEAAAEAMSDRTCVASNGKVQVHLSSE 136
Query: 141 YVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHH--G 198
+ +CC+ C C G W + + G VTGG YG GCQP I+PC HH G
Sbjct: 137 NLMACCETC----GMGCHGGFPEAAWEYWKQDGLVTGGPYGSMQGCQPYEIAPCEHHING 192
Query: 199 SAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGP 258
S P E P +C C + Y F +DKH Y V I+ EI+ +GP
Sbjct: 193 SRPACGKIE----PTPRCKKTCES-GYNVTFNKDKHYAKSAYSVSSKVQQIQMEIMTNGP 247
Query: 259 TTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGD 318
A F +Y DF HYKSGVY+H S A+L H+ K+IGWG E TPYWL+ N+W WGD
Sbjct: 248 VEAAFTVYADFPHYKSGVYQHESGAELGG--HAVKMIGWGMEGSTPYWLIANSWNSDWGD 305
Query: 319 RGTVKILRGKYECAFEYLIAAGKPK 343
G KILRG+ EC E I AG+P+
Sbjct: 306 MGFFKILRGQDECGIERDIVAGEPR 330
>gi|204022104|dbj|BAG71149.1| cathepsin B-N [Astegopteryx styracophila]
Length = 332
Score = 218 bits (554), Expect = 4e-54, Method: Compositional matrix adjust.
Identities = 137/350 (39%), Positives = 179/350 (51%), Gaps = 30/350 (8%)
Query: 5 LVFLLGCTLVR----GELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAK 60
L FLL L+ + Y + YI+QIN A TW AG NF LS E + L +K
Sbjct: 1 LAFLLSVVLLSVYQTEQAYFLEEDYINQINENAKTWKAGINFDPKLSIENFVKLL--GSK 58
Query: 61 YFDQSDRPLPGDRKTYDPEY-SATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVG 119
+ + P KT D Y + +P FDAR++W C TIG V D G C + F
Sbjct: 59 GVQAAKKASPDMFKTIDKAYENQKIPKFFDARKKWRKCFTIGEVRDQGKCGSCWAFGTSS 118
Query: 120 AFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGD 179
AF+DR CI + G+ N LS E + CC C + C G + W K G VTGGD
Sbjct: 119 AFADRLCIATNGEFNELLSAEELTFCCHKCGF----GCHGGYPIKAWERFQKHGLVTGGD 174
Query: 180 YGDRTGCQPSTISPC--SHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGR---GFFQDKH 234
Y GCQP +SPC +G+ +C + P K H RCT YG F +D H
Sbjct: 175 YDSGEGCQPYRVSPCPLDEYGNN----TCRGK--PAEKNH-RCTRMCYGNQDLDFKKDHH 227
Query: 235 RTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYL--HSG 292
T Y++ I+++++A+GP A++ +YDDF YKSGVY T NA YL H+
Sbjct: 228 FTRDAYYL--TFGIIQRDVMAYGPIEASYDVYDDFPSYKSGVYVRTENA---TYLGGHAV 282
Query: 293 KLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
KLIGWG E G PYWL++N+W WGD+G KI RG EC + G P
Sbjct: 283 KLIGWGEEYGVPYWLMVNSWNDQWGDKGLFKIRRGTNECGIDNSTTGGVP 332
>gi|348587350|ref|XP_003479431.1| PREDICTED: cathepsin B-like [Cavia porcellus]
Length = 340
Score = 218 bits (554), Expect = 5e-54, Method: Compositional matrix adjust.
Identities = 124/327 (37%), Positives = 171/327 (52%), Gaps = 18/327 (5%)
Query: 18 LYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYD 77
+ SD ++ +N+ TW AGRNF N+ Y+++ Y P R +
Sbjct: 23 FHPLSDELVNYVNKLNTTWQAGRNF-HNVDISYVKRLC---GTYLGGPRLP---QRVQFA 75
Query: 78 PEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPL 137
+ +P+ FDAREQWPNC TI + D G+C + F AV A SDR CI + G N +
Sbjct: 76 EDLD--LPESFDAREQWPNCPTIKEIRDQGSCGSCWAFGAVEAMSDRLCIHTNGHVNVEV 133
Query: 138 STEYVASCC-KICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSH 196
S E + SCC +C + C+ G W + ++G V+GG YG GC+P +I PC H
Sbjct: 134 SAEDLLSCCGPLC----GEGCNGGYPTEAWKYWTRKGLVSGGLYGSHVGCRPYSIPPCEH 189
Query: 197 HGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAH 256
H + T P C + KC C P Y + +DK+ +Y V E I EI +
Sbjct: 190 HVNG-TRPKCTGEGGDTPKCSKTC-EPGYSPSYKEDKYYGYSSYSVPSTEKEIMAEIYKN 247
Query: 257 GPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHW 316
GP A F+++ DF YKSGVYKH + L H+ +++GWG ENG PYWLV N+W W
Sbjct: 248 GPVEAAFSVFSDFLTYKSGVYKHVAGEVLGG--HAIRILGWGKENGVPYWLVGNSWNVDW 305
Query: 317 GDRGTVKILRGKYECAFEYLIAAGKPK 343
GD G KILRG+ C E + AG P+
Sbjct: 306 GDNGFFKILRGEDHCGIESEVVAGIPR 332
>gi|7537454|gb|AAF35867.2| cathepsin B-like cysteine proteinase [Helicoverpa armigera]
Length = 338
Score = 218 bits (554), Expect = 5e-54, Method: Compositional matrix adjust.
Identities = 119/324 (36%), Positives = 167/324 (51%), Gaps = 18/324 (5%)
Query: 21 FSDAYIDQINREANTWTAGRNFPANLSEEYLRQF--LIADAKYFDQSDRPLPGDRKTYDP 78
SD +I+ IN + N+W AGRNFP + ++++ ++ D S + ++
Sbjct: 26 LSDDFINLINTKQNSWKAGRNFPEHTPFAHIKRLAGVLPDYHLSKLS-------KVEHED 78
Query: 79 EYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLS 138
E A++P+ FD R++WPNC T+ V D G+C + F AV A +DR C S G Q+ S
Sbjct: 79 ELIASLPENFDPRDKWPNCPTLNEVRDQGSCGSCWAFGAVEAMTDRYCTYSNGTQHFHFS 138
Query: 139 TEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHG 198
E + SCC IC C+ G W + G V+GG Y GC+P I PC HH
Sbjct: 139 AEDLLSCCPICGL----GCNGGMPTLAWEYWKHFGLVSGGSYNSSQGCRPYEIPPCEHHV 194
Query: 199 SAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGP 258
+P + K P KC C + Y + +DK + V ED I+ E+ +GP
Sbjct: 195 PGNRMPCNGDSKTP--KCEKTCES-NYNVDYRKDKRYGKHVFSVSSKEDHIRAELFKNGP 251
Query: 259 TTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGD 318
F +Y D +YK+GVYKHT L H+ K++GWG ENG YWL+ N+W WGD
Sbjct: 252 VEGAFTVYSDLLNYKTGVYKHTIGDALGG--HAVKILGWGVENGNKYWLIANSWNSDWGD 309
Query: 319 RGTVKILRGKYECAFEYLIAAGKP 342
G KILRG+ C E I AG+P
Sbjct: 310 NGFFKILRGEDHCGIESSIVAGEP 333
>gi|27882093|gb|AAH44517.1| Zgc:55862 [Danio rerio]
Length = 330
Score = 217 bits (553), Expect = 6e-54, Method: Compositional matrix adjust.
Identities = 123/340 (36%), Positives = 173/340 (50%), Gaps = 21/340 (6%)
Query: 5 LVFLLGCTLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQ 64
++ L + R L S ++ IN+ TWTAG NF ++ Y+++
Sbjct: 9 VISALSVSWARPRLAPLSHEMVNFINKANTTWTAGHNF-RDVDYSYVKRLC--------- 58
Query: 65 SDRPLPGDRKTYDPEYSA--TVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFS 122
L G + +Y+ +P FDAREQWPNC T+ + D G+C + F A A S
Sbjct: 59 -GTFLKGPKLPVMVQYTEGLKLPKNFDAREQWPNCPTLKEIRDQGSCGSCWAFGAAEAIS 117
Query: 123 DRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGD 182
DR CI+S + + +S++ + +CC C C+ G W+F G VTGG Y
Sbjct: 118 DRVCIQSNAKVSVEISSQDLLTCCDSC----GMGCNGGYPSAAWDFWTTDGLVTGGLYNS 173
Query: 183 RTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWV 242
GC+P TI PC HH + + P C + C +C P Y + +DKH +Y V
Sbjct: 174 HIGCRPYTIEPCEHHVNG-SRPPCTGEGGDTPNCDMKC-EPGYSPLYKEDKHFGKTSYSV 231
Query: 243 DDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENG 302
N++ I E+ +GP A F +Y+DF YKSGVY+H S + L H+ K++GWG ENG
Sbjct: 232 PSNQNGIMAELFKNGPVEAAFTVYEDFLLYKSGVYQHMSGSALGG--HAIKILGWGEENG 289
Query: 303 TPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
PYWL N+W WGD G KILRG+ C E I AG P
Sbjct: 290 VPYWLAANSWNTDWGDNGYFKILRGEDHCGIESEIVAGIP 329
>gi|328697984|ref|XP_003240502.1| PREDICTED: cathepsin B [Acyrthosiphon pisum]
Length = 339
Score = 217 bits (552), Expect = 7e-54, Method: Compositional matrix adjust.
Identities = 129/347 (37%), Positives = 179/347 (51%), Gaps = 19/347 (5%)
Query: 1 MIHILVFLLGCTLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAK 60
++ +L +L + Y ++YI+ IN A TWTAG NF + E+ L + L +K
Sbjct: 4 LVILLSVVLFSVYQTEQAYFLEESYIEMINDVATTWTAGVNFDPSTPEKDLIKML--GSK 61
Query: 61 YFDQSDRPLPGDRKTYDPEYSAT--VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAV 118
+ + KT+D Y+ +P FDAR +W +C TIG V D G C + F
Sbjct: 62 GVEAAKNASAHMFKTHDVAYNNNGYIPRTFDARRRWRHCKTIGEVRDQGYCGSCWAFGTS 121
Query: 119 GAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGG 178
AF+DR C+ + G N LS E + CC C C+ G + W + G VTGG
Sbjct: 122 SAFADRLCVATDGDFNELLSAEELTFCCHTC----GNGCNGGYPIKAWKYFSSHGLVTGG 177
Query: 179 DYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGF--FQDKHRT 236
+Y GC+P + PC + + SC Q + K + RCT YG + D HR
Sbjct: 178 NYKSGEGCEPYRVPPCPRNEDGTS--SCAGQPIEK---NHRCTRMCYGNQDLDYNDDHRF 232
Query: 237 TLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNA-KLENYLHSGKLI 295
T Y+ +I+K+++ +GP A+F +YDDFY YKSGVY+ T NA KL H+ KLI
Sbjct: 233 TRDYYYL-TYGSIQKDVMNYGPIEASFDVYDDFYSYKSGVYQRTPNATKLGG--HAVKLI 289
Query: 296 GWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
GWG E G PYWL++N+W WGD G KI RG EC + AG P
Sbjct: 290 GWGVEEGIPYWLMVNSWSAQWGDNGLFKIRRGTDECGIDSATTAGVP 336
>gi|185135431|ref|NP_001117776.1| procathepsin B precursor [Oncorhynchus mykiss]
gi|14582897|gb|AAK69705.1|AF358667_1 procathepsin B [Oncorhynchus mykiss]
Length = 330
Score = 217 bits (552), Expect = 7e-54, Method: Compositional matrix adjust.
Identities = 126/326 (38%), Positives = 165/326 (50%), Gaps = 18/326 (5%)
Query: 18 LYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYD 77
L S + IN TWTAG+NF N+ Y++ K P +
Sbjct: 22 LPLLSPEMVQYINNADTTWTAGQNF-HNVDISYVKSLCGTLLKG--------PRLPELVQ 72
Query: 78 PEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPL 137
+ ++PD FDAR QWPNC TI + D G+C + F A A SDR CI S G+ + +
Sbjct: 73 SDEDMSLPDSFDARLQWPNCPTIKEIRDQGSCGSCWAFGAAEAISDRYCIHSNGKVSVEI 132
Query: 138 STEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHH 197
S E + SCC C C G W++ + G VTGG YG GC+P +I+PC HH
Sbjct: 133 SAEDLLSCCDAC----GMGCMGGFPSAAWDYWAESGLVTGGLYGSNIGCRPYSIAPCEHH 188
Query: 198 GSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHG 257
+ P PK C + C N Y + +DK TY V E I E+ +G
Sbjct: 189 VNGTRPPCTGEGDTPK--CVSEC-NAGYTPSYKKDKRFGKQTYSVPPKEQQIMTELYKNG 245
Query: 258 PTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWG 317
P A F++Y+DF YK+GVY+H + L H+ K++GWG EN TPYWLV N+W WG
Sbjct: 246 PVEAAFSVYEDFLLYKTGVYQHVTGQMLGG--HAIKILGWGKENNTPYWLVANSWNTDWG 303
Query: 318 DRGTVKILRGKYECAFEYLIAAGKPK 343
D G KILRGK EC E I AG P+
Sbjct: 304 DNGFFKILRGKDECGIESEIVAGIPR 329
>gi|300835056|gb|ADK37857.1| putative cathepsin precursor [Sitobion avenae]
Length = 340
Score = 217 bits (552), Expect = 8e-54, Method: Compositional matrix adjust.
Identities = 127/349 (36%), Positives = 181/349 (51%), Gaps = 22/349 (6%)
Query: 1 MIHILVFLLGCTLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAK 60
++ +L +L + Y +YID IN A TWTAG NF ++ E++ + L +K
Sbjct: 4 LVILLSVVLFSVYQTEQAYFLEKSYIDMINEVATTWTAGVNFDPSIPEDHFIKML--GSK 61
Query: 61 YFDQSDRPLPGDRKTYDPEYS---ATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAA 117
+ + + + KT D Y +P FDAR++W +C TIG V D G C + F
Sbjct: 62 GVESAKQASAHEFKTNDVAYDNHFGHIPRTFDARKKWRHCRTIGEVRDQGHCGSCWAFGT 121
Query: 118 VGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTG 177
AF+DR C+ + G N LS E + CC C + C G + W + K G VTG
Sbjct: 122 SSAFADRLCVATDGDFNELLSAEEITFCCHTCGF----GCHGGYPIKAWKYFSKHGLVTG 177
Query: 178 GDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGF--FQDKHR 235
G+Y GC+P + PC +C + + K + RCT YG + D HR
Sbjct: 178 GNYKSGEGCEPYRVPPCPRDDKGNN--TCAGKPIEK---NHRCTRMCYGDQDLDYNDDHR 232
Query: 236 TTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYL--HSGK 293
T ++ +I+K+++ +GP A+F +YDDF YKSGVY+ T NA +YL H+ K
Sbjct: 233 FTRDFYYL-TYGSIQKDVMTYGPIEASFDVYDDFPSYKSGVYEKTENA---SYLGGHAVK 288
Query: 294 LIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
LIGWG E GTPYWL++N+W WGD+G KI RG EC + AG P
Sbjct: 289 LIGWGVEEGTPYWLMVNSWNAQWGDKGLFKIRRGTNECGIDNSTTAGVP 337
>gi|401758196|gb|AFQ01133.1| cathepsin B [Chilo suppressalis]
Length = 350
Score = 216 bits (551), Expect = 9e-54, Method: Compositional matrix adjust.
Identities = 134/368 (36%), Positives = 185/368 (50%), Gaps = 45/368 (12%)
Query: 1 MIHILVFLLGCTLV-----RGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFL 55
M L+F +V L+ SD +I+ IN NTW AGRNFP +Y+ +
Sbjct: 1 MFRTLLFTCAICVVCVVASNVHLHPLSDEFIESINFNQNTWIAGRNFPKKTPLKYIYNLM 60
Query: 56 --IADAKYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPH 113
++D++ + R RKT P++FDARE W NC T+ + D G C +
Sbjct: 61 GTLSDSRMDNLPQRNYTFSRKT-------KYPNQFDAREHWKNCPTLKDIRDQGGCGSCW 113
Query: 114 IFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRG 173
AAV A +DR CI SKG+++ S + V SCC C C G + R W + K G
Sbjct: 114 AVAAVSAMTDRMCILSKGKEHFYFSIKDVLSCCGYC----GNGCEGGVLTRAWIYYKKIG 169
Query: 174 SVTGGDYGDRTGCQPSTISPCSHH--------GSAPTLPSCENQKVPKL----------- 214
V+GG Y + GCQP TI PC+H + P P C+N +P +
Sbjct: 170 IVSGGGYKSKQGCQPYTIPPCNHLVWGEIEQCKNIPMTPKCKN--IPVIPEQCKYIPITP 227
Query: 215 KCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKS 274
+C +C N Y + +DKHR Y V +E I KEI +GP T+ F +Y+DF +YK
Sbjct: 228 ECEKKC-NKNYKVCYSKDKHRGKSVYRVKKSE--IFKEIYEYGPVTSYFTVYEDFLNYKE 284
Query: 275 GVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVKILR-GKYECAF 333
G+Y +TS KL LHS K+IGWG E G YWL N++ WGD+G KI+R G C
Sbjct: 285 GIYNYTSGQKLG--LHSVKIIGWGEERGIKYWLAANSFNTDWGDKGFFKIIREGVGSCGI 342
Query: 334 EYLIAAGK 341
+ AG+
Sbjct: 343 SDNVVAGR 350
>gi|209863077|ref|NP_001119612.2| cathepsin B-912 precursor [Acyrthosiphon pisum]
Length = 342
Score = 216 bits (551), Expect = 9e-54, Method: Compositional matrix adjust.
Identities = 129/345 (37%), Positives = 180/345 (52%), Gaps = 23/345 (6%)
Query: 5 LVFLLGCTLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQ 64
++ LLG V + Y + +ID IN +A TW AG NF N +EY+ + L +K
Sbjct: 11 VILLLG-VCVTEQAYFLEEDFIDSINEKAKTWKAGINFDPNTPKEYIVKLL--GSKGVQV 67
Query: 65 SDRPLPGDRKTYDPEYS---ATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAF 121
+ KT D Y +P +FDAR++W C TIG V D G C + A AF
Sbjct: 68 PHKLNLKMYKTDDEAYVNLFGRIPKKFDARKEWRRCITIGQVRDQGNCGSCWALATSSAF 127
Query: 122 SDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYG 181
+DR CI + + N LS E + CC +C + +C G + W++ + G VTGGDY
Sbjct: 128 ADRLCIATNYEFNELLSAEELTFCCHLCGF----ACHGGYPIKAWSYFRRHGIVTGGDYQ 183
Query: 182 DRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGF--FQDKHRTTLT 239
GC P + PC +C Q + K H RCT YG + D HR T
Sbjct: 184 SGEGCAPYRVPPCFSEEDGNN--TCRGQPMEK---HHRCTRMCYGDQEIDYDDDHRFTRD 238
Query: 240 YWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYL--HSGKLIGW 297
Y+ +I+K+++ +GP A+ +YDDF YKSGVY+ + NA YL H+ KLIGW
Sbjct: 239 YYYL-TYASIQKDVMTYGPIEASMEVYDDFPSYKSGVYEKSENA---TYLGGHAVKLIGW 294
Query: 298 GTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
G E+G PYWL++N+W WGD+G KI RG EC+ + + AG P
Sbjct: 295 GEEDGVPYWLMVNSWSEMWGDKGLFKIRRGTNECSVDNSMTAGVP 339
>gi|397467300|ref|XP_003805362.1| PREDICTED: cathepsin B [Pan paniscus]
Length = 339
Score = 216 bits (551), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 123/339 (36%), Positives = 172/339 (50%), Gaps = 19/339 (5%)
Query: 6 VFLLGCTLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQS 65
+ +L R + SD ++ +N+ TW AG NF N+ YL++ +
Sbjct: 11 LLVLANARSRPSFHPLSDELVNYVNKRNTTWQAGHNF-YNVDMSYLKRLC---GTFLGG- 65
Query: 66 DRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRR 125
P P R + + +P+ FDAREQWP C TI + D G+C + F AV A SDR
Sbjct: 66 --PKPPQRVMFTEDLK--LPESFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVEAISDRI 121
Query: 126 CIKSKGQQNRPLSTEYVASCC-KICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRT 184
CI + + +S E + +CC +C C+ G WNF ++G V+GG Y
Sbjct: 122 CIHTNAHVSVEVSAEDLLTCCGSMC----GDGCNGGYPAEAWNFWTRKGLVSGGLYESHV 177
Query: 185 GCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDD 244
GC+P +I PC HH + P PK C C P Y + QDKH +Y V +
Sbjct: 178 GCRPYSIPPCEHHVNGSRPPCTGEGDTPK--CSKIC-EPGYSPTYKQDKHYGYNSYSVSN 234
Query: 245 NEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTP 304
+E I EI +GP F++Y DF YKSGVY+H + + H+ +++GWG ENGTP
Sbjct: 235 SEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGG--HAIRILGWGVENGTP 292
Query: 305 YWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
YWLV N+W WGD G KILRG+ C E + AG P+
Sbjct: 293 YWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPR 331
>gi|341891084|gb|EGT47019.1| CBN-CPR-4 protein [Caenorhabditis brenneri]
Length = 335
Score = 216 bits (551), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 124/343 (36%), Positives = 174/343 (50%), Gaps = 12/343 (3%)
Query: 1 MIHILVFLLGCTLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAK 60
++ + L LV + K +A + +N + + W A P +++ E +++ L+
Sbjct: 4 VVFASLVALATGLVIPIVPKTPEAITEYVNSKQSLWKA--EIPKHITIEQVKKRLMRTEF 61
Query: 61 YFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGA 120
S P T+P FDAR QWP+C +I ++ D C + FAA A
Sbjct: 62 VAPHS----PDAEFVKHDIQEDTIPATFDARTQWPSCVSINNIRDQSDCGSCWAFAAAEA 117
Query: 121 FSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDY 180
SDR CI S G N LS E V SCC C Y C G W +L K G TGG Y
Sbjct: 118 ASDRFCIASNGAVNTLLSAEDVLSCCSNCGY----GCEGGYPINAWKYLVKSGFCTGGSY 173
Query: 181 GDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTY 240
+ GC+P +++PC T P+C C +CTN Y + DKH + Y
Sbjct: 174 EAQFGCKPYSLAPCGETVGNTTWPACPTDGYDTPACVNKCTNSNYNVAYKDDKHFGSTAY 233
Query: 241 WVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTE 300
V I+ EI+AHGP A F +Y+DFY YKSGVY HT+ +L H+ +++GWGT+
Sbjct: 234 AVGKKVAQIQAEIIAHGPVEAAFTVYEDFYQYKSGVYVHTTGEELGG--HAIRILGWGTD 291
Query: 301 NGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
NGTPYWLV N+W +WG+ G +I+RG EC E+ + G PK
Sbjct: 292 NGTPYWLVANSWNVNWGENGYFRIIRGTNECGIEHAVVGGVPK 334
>gi|50540542|ref|NP_998501.1| cathepsin B, a precursor [Danio rerio]
gi|34784038|gb|AAH56688.1| Cathepsin B, a [Danio rerio]
gi|37681773|gb|AAQ97764.1| cathepsin B [Danio rerio]
gi|41351445|gb|AAH65589.1| Cathepsin B, a [Danio rerio]
Length = 330
Score = 216 bits (551), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 122/340 (35%), Positives = 172/340 (50%), Gaps = 21/340 (6%)
Query: 5 LVFLLGCTLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQ 64
++ L + R L S ++ IN+ TWTAG NF ++ Y+++
Sbjct: 9 VISALSVSWARPRLPPLSHEMVNFINKANTTWTAGHNF-RDVDYSYVKKLC--------- 58
Query: 65 SDRPLPGDRKTYDPEYSA--TVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFS 122
L G + +Y+ +P FDAREQWPNC T+ + D G+C + F A A S
Sbjct: 59 -GTFLKGPKLPVMVQYTEGLKLPKNFDAREQWPNCPTLKEIRDQGSCGSCWAFGAAEAIS 117
Query: 123 DRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGD 182
DR CI S + + +S++ + +CC C C+ G W+F G VTGG Y
Sbjct: 118 DRVCIHSDAKVSVEISSQDLLTCCDSC----GMGCNGGYPSAAWDFWATEGLVTGGLYNS 173
Query: 183 RTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWV 242
GC+P TI PC HH + + P C + C +C P Y + QDKH +Y V
Sbjct: 174 HIGCRPYTIEPCEHHVNG-SRPPCSGEGGDTPNCDMKC-EPGYSPSYKQDKHFGKTSYSV 231
Query: 243 DDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENG 302
N+++I E+ +GP F +Y+DF YKSGVY+H S + + H+ K++GWG ENG
Sbjct: 232 PSNQNSIMAELFKNGPVEGAFTVYEDFLLYKSGVYQHMSGSPVGG--HAIKILGWGEENG 289
Query: 303 TPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
PYWL N+W WGD G KILRG+ C E I AG P
Sbjct: 290 VPYWLAANSWNTDWGDNGYFKILRGEDHCGIESEIVAGIP 329
>gi|296221607|ref|XP_002756833.1| PREDICTED: cathepsin B, partial [Callithrix jacchus]
Length = 330
Score = 216 bits (550), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 123/330 (37%), Positives = 171/330 (51%), Gaps = 19/330 (5%)
Query: 15 RGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRK 74
R + SD ++ +N++ TW AG NF N+ YL++ + P P R
Sbjct: 11 RPSFHPLSDELVNYVNKQNTTWQAGHNF-YNVDLSYLKRLC---GTFLGG---PKPPQRV 63
Query: 75 TYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQN 134
+ + + +P+ FDAREQWP C TI + D G+C + F AV A SDR CI + +
Sbjct: 64 KFAEDLN--LPESFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHTNAHVS 121
Query: 135 RPLSTEYVASCC-KICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISP 193
+S E + +CC +C C+ G WNF ++G V+GG Y GC+P +I P
Sbjct: 122 VEVSAEDLLTCCGSMC----GDGCNGGYPAEAWNFWTRKGLVSGGLYDSHVGCRPYSIPP 177
Query: 194 CSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEI 253
C HH + P PK C C P Y + QDKH +Y V +NE I EI
Sbjct: 178 CEHHVNGSRPPCTGEGDTPK--CSKSC-EPGYSPTYKQDKHYGYDSYSVSNNERDIMAEI 234
Query: 254 LAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWG 313
+GP F++Y DF YKSGVY+H + + H+ +++GWG ENGTPYWLV N+W
Sbjct: 235 YKNGPVEGAFSVYADFLLYKSGVYQHVTGEMMGG--HAIRILGWGVENGTPYWLVGNSWN 292
Query: 314 PHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
WGD G KILRG+ C E + AG P+
Sbjct: 293 TDWGDNGFFKILRGQDHCGIESEVVAGIPR 322
>gi|28971815|dbj|BAC65419.1| cathepsin B [Pandalus borealis]
Length = 328
Score = 216 bits (550), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 124/324 (38%), Positives = 175/324 (54%), Gaps = 16/324 (4%)
Query: 17 ELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTY 76
EL SD +++ + + TW AGRNF ++S+++L+ L K D PL K
Sbjct: 16 ELDPLSDEFLELLQSKQMTWKAGRNFAKDISKDFLKS-LNCVRKNPDIPKLPL----KNV 70
Query: 77 DPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRP 136
P + +P FDAREQWP+C I + D G C + +A +DR CI ++G +
Sbjct: 71 TP--TKEIPVEFDAREQWPHCPCIDEIRDQGNCGSCWAVSAASVMTDRTCIDTEGLVDFR 128
Query: 137 LSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSH 196
S+E VA+CC C +C G + +G V+GG + GCQP ++ C H
Sbjct: 129 FSSENVAACCTEC----GNACYGGDEDTAFTHWVTKGFVSGGRHNSNEGCQPYSVEECEH 184
Query: 197 HGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAH 256
H P P CE +P+L C C + YG+ + +D Y + + I++EI+ +
Sbjct: 185 HIEGPR-PPCEGD-MPELVCSETC-HEEYGKTYEEDLEYGLEAYVLPQDVTQIQEEIMTN 241
Query: 257 GPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHW 316
GP TA FA+YDDF YKSGVY+H + L+ Y H+ ++IGWG E GTPYWLV N+W W
Sbjct: 242 GPVTAAFAVYDDFLSYKSGVYQHETGL-LDGY-HAVRVIGWGEEEGTPYWLVANSWNTDW 299
Query: 317 GDRGTVKILRGKYECAFEYLIAAG 340
GD G KILRG EC FE +AA
Sbjct: 300 GDNGLFKILRGSDECEFEGDMAAA 323
>gi|410956528|ref|XP_003984894.1| PREDICTED: cathepsin B [Felis catus]
Length = 339
Score = 216 bits (550), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 130/347 (37%), Positives = 177/347 (51%), Gaps = 29/347 (8%)
Query: 5 LVFLLGCTLV------RGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQF---L 55
L+ L C +V R L SD +D +N+ TW AG NF ++ YLR+ +
Sbjct: 4 LLATLSCLVVLTNAQSRPPLQLLSDELVDYVNKRNTTWKAGHNF-YHVEPSYLRRLCGTI 62
Query: 56 IADAKYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIF 115
+ K LP R ++ + +P+ FDARE WPNC TI + D G+C + F
Sbjct: 63 LGGPK--------LP-QRVSFAED--MVLPENFDAREHWPNCPTIKEIRDQGSCGSCWAF 111
Query: 116 AAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSV 175
AV A SDR CI + G N +S E + +CC D C+ G WNF K+G V
Sbjct: 112 GAVEAISDRICILTNGHVNVEVSAEDMLTCCGDQCGD---GCNGGFPAEAWNFWTKQGLV 168
Query: 176 TGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHR 235
+GG Y GC+P +I PC HH + P PK C C P Y + +DKH
Sbjct: 169 SGGLYDSHVGCRPYSIPPCEHHVNGSRPPCTGEGDTPK--CSKIC-EPGYTPSYKEDKHY 225
Query: 236 TTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLI 295
+Y V ++E I EI +GP A F+++ DF YKSGVY+H + + H+ +++
Sbjct: 226 GCNSYSVSNSEKEIMAEIYKNGPVEAAFSVFSDFLQYKSGVYQHVTGEMMGG--HAVRIL 283
Query: 296 GWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
GWG EN TPYWLV N+W WGD G KILRG+ C E + AG P
Sbjct: 284 GWGVENDTPYWLVGNSWNTDWGDHGFFKILRGRDHCGIESEVVAGIP 330
>gi|268555790|ref|XP_002635884.1| Hypothetical protein CBG01104 [Caenorhabditis briggsae]
Length = 337
Score = 216 bits (550), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 121/347 (34%), Positives = 185/347 (53%), Gaps = 21/347 (6%)
Query: 1 MIHILVFL--LGCTLVRGELYKFSDAY-IDQINREANTWTAGRNFPANLS-EEYLRQFLI 56
M ++++F T++ L++ D + I+ IN + WTAG P+ + +++ R+ +
Sbjct: 1 MEYLILFFAYFSTTVLASNLHQLLDFHEINHINSIQSLWTAG---PSKFAFQKFQRRLMR 57
Query: 57 ADAKYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFA 116
++ +S+ L DRK + T+P+ +D R+ W C ++ ++ D C + A
Sbjct: 58 SEHVKSHKSEDIL--DRKVLE-----TIPESYDVRDHWSKCISVDNIRDQSDCGSCWAVA 110
Query: 117 AVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVT 176
A SDR CI S G N +S E + SCC C C G + W + K+G V+
Sbjct: 111 AAETISDRLCIASNGSINTFVSAEDLLSCCTSC----GDGCDGGYPLQAWRYWVKQGLVS 166
Query: 177 GGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTN-PTYGRGFFQDKHR 235
GG Y + GC+P +I+PC + T P C Q+ +C + CT+ +Y + +DKH
Sbjct: 167 GGSYESQYGCKPYSIAPCGQTVNGVTWPKCPAQEEATPECASHCTSKSSYSVAYEKDKHY 226
Query: 236 TTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLI 295
Y V E I+ EIL HGP A F +Y DFY YKSG+Y H S +L H+ K++
Sbjct: 227 GLSAYPVGRKEAQIQTEILQHGPVEAGFLVYSDFYRYKSGIYTHVSGQELGG--HAVKIL 284
Query: 296 GWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
GWG ENGT YWLV N+W +WG++G +ILRG+ EC E + AG P
Sbjct: 285 GWGVENGTKYWLVANSWNINWGEKGYFRILRGRNECGIESAVVAGIP 331
>gi|426358853|ref|XP_004046705.1| PREDICTED: cathepsin B isoform 1 [Gorilla gorilla gorilla]
gi|426358855|ref|XP_004046706.1| PREDICTED: cathepsin B isoform 2 [Gorilla gorilla gorilla]
gi|426358857|ref|XP_004046707.1| PREDICTED: cathepsin B isoform 3 [Gorilla gorilla gorilla]
Length = 339
Score = 216 bits (549), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 122/339 (35%), Positives = 172/339 (50%), Gaps = 19/339 (5%)
Query: 6 VFLLGCTLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQS 65
+ +L R + SD ++ +N+ TW AG NF N+ YL++ +
Sbjct: 11 LLVLANARSRPSFHPLSDELVNYVNKRNTTWQAGHNF-YNVDMSYLKRLC---GTFLGG- 65
Query: 66 DRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRR 125
P P R + + +P+ FDAREQWP C T+ + D G+C + F AV A SDR
Sbjct: 66 --PKPPQRVMFTEDLK--LPESFDAREQWPQCPTVKEIRDQGSCGSCWAFGAVEAISDRI 121
Query: 126 CIKSKGQQNRPLSTEYVASCC-KICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRT 184
CI + + +S E + +CC +C C+ G WNF ++G V+GG Y
Sbjct: 122 CIHTNAHVSVEVSAEDLLTCCGSMC----GDGCNGGYPAEAWNFWTRKGLVSGGLYESHV 177
Query: 185 GCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDD 244
GC+P +I PC HH + P PK C C P Y + QDKH +Y V +
Sbjct: 178 GCRPYSIPPCEHHVNGSRPPCTGEGDTPK--CSKIC-EPGYSPTYKQDKHYGYNSYSVSN 234
Query: 245 NEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTP 304
+E I EI +GP F++Y DF YKSGVY+H + + H+ +++GWG ENGTP
Sbjct: 235 SEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGG--HAIRILGWGVENGTP 292
Query: 305 YWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
YWLV N+W WGD G KILRG+ C E + AG P+
Sbjct: 293 YWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPR 331
>gi|427785213|gb|JAA58058.1| Putative cathepsin l culex quinquefasciatus cathepsin l
[Rhipicephalus pulchellus]
Length = 346
Score = 216 bits (549), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 130/333 (39%), Positives = 166/333 (49%), Gaps = 16/333 (4%)
Query: 13 LVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGD 72
L+ E SD I IN TW AGRN P Y+R L + LP
Sbjct: 27 LIPAETDASSDKMIQYINYLNTTWQAGRN-PGFEDPAYVRGLLGVSP---ENHRYRLP-- 80
Query: 73 RKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSK-- 130
+ D +P+ FD+RE WP C TIG + D G+C + F AV A SDR CI S
Sbjct: 81 ERRLDLSSLGPLPENFDSRENWPECTTIGEIRDQGSCGSCWAFGAVEAMSDRTCIHSPSG 140
Query: 131 GQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPST 190
G + LS + + SCC+ C C+ G W+F K G VTGG+Y GC P
Sbjct: 141 GPKRVHLSADDLLSCCRTC----GNGCNGGFPGSAWSFWVKTGIVTGGNYDSDDGCMPYP 196
Query: 191 ISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIK 250
I C HH + TL C+ + P +C C Y + DKH +Y V E I+
Sbjct: 197 IKACDHHVNG-TLGPCDKKIPPTPRCVHMCRK-GYDVDYHDDKHYGKSSYSVPSEEKQIQ 254
Query: 251 KEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVIN 310
EI+ +GP A F +Y DF HYKSGVY+ ++ L H+ +L+GWG ENG PYWL N
Sbjct: 255 AEIMTNGPVEADFTVYSDFVHYKSGVYQRHTDEALGG--HAIRLLGWGVENGVPYWLAAN 312
Query: 311 TWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
+W WGD+G KILRG EC E + AG PK
Sbjct: 313 SWNTEWGDKGFFKILRGSDECGIEDDVVAGLPK 345
>gi|301776581|ref|XP_002923704.1| PREDICTED: cathepsin B-like [Ailuropoda melanoleuca]
gi|281347694|gb|EFB23278.1| hypothetical protein PANDA_012896 [Ailuropoda melanoleuca]
Length = 339
Score = 215 bits (548), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 131/344 (38%), Positives = 175/344 (50%), Gaps = 23/344 (6%)
Query: 5 LVFLLGCTLV------RGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIAD 58
L+ L C +V R SD ++ +N+ TW AG NF N+ YLR+
Sbjct: 4 LLACLSCLVVLAGAQSRPPFQLLSDELVNYVNKRNTTWKAGHNF-HNVDPSYLRRLC--- 59
Query: 59 AKYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAV 118
+ P ++ + E + +P+ FDAREQWPNC TI + D G+C + F AV
Sbjct: 60 GTFLGGPKLP----QRVWFAE-NMVLPENFDAREQWPNCPTIKEIRDQGSCGSCWAFGAV 114
Query: 119 GAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGG 178
A SDR CI++ G N +S E + +CC D C+ G WNF K+G V+GG
Sbjct: 115 EAISDRICIRTNGHVNVEVSAEDMLTCCGDQCGD---GCNGGFPAEAWNFWTKQGLVSGG 171
Query: 179 DYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTL 238
Y GC+P +I PC HH + P PK C C P Y + +DKH
Sbjct: 172 LYESHVGCRPYSIPPCEHHVNGSRPPCTGEGDTPK--CSKFC-EPGYTPSYKEDKHYGCS 228
Query: 239 TYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWG 298
+Y V +E I EI +GP A F +Y DF YKSGVY+H + + H+ +++GWG
Sbjct: 229 SYSVSSSEKEIMAEIYKNGPVEAAFTVYSDFLLYKSGVYQHVTGEMMGG--HAVRILGWG 286
Query: 299 TENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
ENGTPYWLV N+W WGD G KILRG+ C E I AG P
Sbjct: 287 VENGTPYWLVGNSWNTDWGDNGFFKILRGRDHCGIESEIVAGIP 330
>gi|204022088|dbj|BAG71141.1| cathepsin B-N2 [Tuberaphis styraci]
Length = 334
Score = 215 bits (548), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 135/336 (40%), Positives = 174/336 (51%), Gaps = 28/336 (8%)
Query: 17 ELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTY 76
+ Y + YI+QIN A TW AG NF LS + + L +K + + P KT+
Sbjct: 17 QAYFLEEDYINQINANAKTWKAGVNFDPKLSIDSFVKLL--GSKGVQAAKQASPVMFKTH 74
Query: 77 DPEY---SATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQ 133
D Y S +P FDAR++W C TIG V D G C + F AF+DR CI + G+
Sbjct: 75 DEAYNSWSNRIPSSFDARKKWRKCSTIGEVRDQGNCGSCWAFGTSSAFADRLCIATDGEF 134
Query: 134 NRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISP 193
N LS E +A CC C + CS G R W K G VTGG+Y GCQP +SP
Sbjct: 135 NELLSPEELAFCCHKCGF----GCSGGYPIRAWERFKKHGLVTGGNYDSGEGCQPYKVSP 190
Query: 194 C--SHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGR---GFFQDKHRTTLTYWVDDNEDA 248
C +G+ +C + P K H RCT YG F +D H T Y++
Sbjct: 191 CPLDEYGNN----TCSGK--PAEKNH-RCTQMCYGNQNLDFKEDHHYTRDAYYL--TYGT 241
Query: 249 IKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYL--HSGKLIGWGTENGTPYW 306
I+ ++LA+GP A+F +YDDF YKSGVY NA YL H+ KLIGWG E G PYW
Sbjct: 242 IQNDVLAYGPIEASFEVYDDFPSYKSGVYTKMENA---TYLGGHAVKLIGWGEEYGVPYW 298
Query: 307 LVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
L++N+W WGD+G KI RG EC + G P
Sbjct: 299 LLVNSWNDQWGDQGLFKIRRGTNECGTDNSTTGGVP 334
>gi|327322926|gb|AEA48884.1| cathepsin B [Oplegnathus fasciatus]
Length = 330
Score = 215 bits (548), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 121/331 (36%), Positives = 171/331 (51%), Gaps = 21/331 (6%)
Query: 15 RGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRK 74
R L S ++ IN+ TW AG NF N+ Y+++ L G +
Sbjct: 19 RPRLQPLSSEMVNYINKFNTTWKAGHNF-HNVDYSYIQRLC----------GTMLKGPKL 67
Query: 75 TYDPEYSA--TVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQ 132
+Y+ +P+ FDAREQWPNC T+ + D G+C + F A A SDR CI S +
Sbjct: 68 PVMVQYTGDLKLPEEFDAREQWPNCPTLKEIRDQGSCGSCWAFGAAEAISDRVCIHSNAK 127
Query: 133 QNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTIS 192
+ +S+E + +CC C C+ G W+F K G V+GG Y GC+P TI+
Sbjct: 128 VSVEISSEDLLTCCMSC----GMGCNGGYPSAAWDFWTKEGLVSGGLYDSHIGCRPYTIA 183
Query: 193 PCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKE 252
PC HH + + PSC + +C T+C Y + +DKH +Y V +E+ I+ E
Sbjct: 184 PCEHHVNG-SRPSCTGEGGDTPQCITKC-EAGYTPSYKEDKHFGKTSYTVLSDEEQIQSE 241
Query: 253 ILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTW 312
I +GP F +Y+DF YKSGVY+H S + + H+ K++GWG E+G PYWL N+W
Sbjct: 242 IFKNGPVEGAFIVYEDFVLYKSGVYQHVSGSAVGG--HAIKILGWGVEDGVPYWLCANSW 299
Query: 313 GPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
WGD G K LRG C E + AG PK
Sbjct: 300 NTDWGDNGFFKFLRGSDHCGIESEVVAGIPK 330
>gi|45361295|ref|NP_989225.1| cathepsin B precursor [Xenopus (Silurana) tropicalis]
gi|38969948|gb|AAH63365.1| hypothetical protein MGC75969 [Xenopus (Silurana) tropicalis]
Length = 333
Score = 215 bits (548), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 129/348 (37%), Positives = 176/348 (50%), Gaps = 24/348 (6%)
Query: 2 IHILVFLLGCTLVRGELY--KFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADA 59
+ +L FL R + S ++ IN+ TW AG NF AN Y+++
Sbjct: 5 VVVLCFLASIASARHLPFFAPLSGDMVNYINKMNTTWKAGHNF-ANADLHYVKRLC---- 59
Query: 60 KYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVG 119
+ P +K + +PD FD+R WPNC TI V D G+C + F AV
Sbjct: 60 ----GTHLNGPQLQKRFGFADGMELPDSFDSRAAWPNCPTIREVRDQGSCGSCWAFGAVE 115
Query: 120 AFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGD 179
A SDR C+ + G+ N +S E + SCC ++ C+ G W F + G V+GG
Sbjct: 116 AISDRVCVHTNGKVNVEVSAEDLLSCCG---FECGMGCNGGYPSGAWKFWTETGLVSGGL 172
Query: 180 YGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTN---PTYGRGFFQDKHRT 236
Y GC+P +I PC HH + + P+C+ ++ KC +C + P YG DKH
Sbjct: 173 YDSHLGCRPYSIPPCEHHVNG-SRPACKGEEGDTPKCVKQCEDGYAPVYG----SDKHFG 227
Query: 237 TLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIG 296
+Y V +E I EI +GP F +Y DF YKSGVY+H + +L H+ K++G
Sbjct: 228 ATSYGVPSSEKEIMAEIYKNGPVEGAFLVYADFPMYKSGVYQHETGEELGG--HAIKILG 285
Query: 297 WGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPKN 344
WG ENGTPYWL N+W WGD G KILRGK C E I AG PKN
Sbjct: 286 WGVENGTPYWLCANSWNTDWGDNGFFKILRGKDHCGIESEIVAGIPKN 333
>gi|354471594|ref|XP_003498026.1| PREDICTED: cathepsin B-like [Cricetulus griseus]
gi|344254255|gb|EGW10359.1| Cathepsin B [Cricetulus griseus]
Length = 339
Score = 215 bits (548), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 128/330 (38%), Positives = 171/330 (51%), Gaps = 25/330 (7%)
Query: 18 LYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQF---LIADAKYFDQSDRPLPGDRK 74
+ SD I+ IN+ TW AGRNF N+ YL++ ++ K LP +R
Sbjct: 23 FHPLSDDLINYINKRNTTWQAGRNF-HNVDISYLKRLCGTIMGGPK--------LP-ERV 72
Query: 75 TYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQN 134
+ + +P+ FDAREQW NC TI + D G+C + F AVGA SDR CI + G N
Sbjct: 73 AFAEDME--LPENFDAREQWSNCPTIKQIRDQGSCGSCWAFGAVGAMSDRLCIHTNGHVN 130
Query: 135 RPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPC 194
+S E + +CC D C+ G WNF K+G V+GG Y GC P TI PC
Sbjct: 131 VEVSAEDLLTCCGSQCGD---GCNGGYPSGAWNFWIKKGLVSGGLYNSHVGCLPYTIPPC 187
Query: 195 SHHGSAPTLPSCENQ-KVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEI 253
HH + + P C + PK T+ Y + +DKH +Y V +NE I EI
Sbjct: 188 EHHVNG-SRPQCTGEGDTPKC---TKSCEAGYSPSYKEDKHYGYTSYSVSNNEKEIMAEI 243
Query: 254 LAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWG 313
+GP F ++ DF YKSGVYKH + + H+ +++GWG EN PYWLV N+W
Sbjct: 244 YKNGPVEGAFTVFSDFLTYKSGVYKHEAGDIMGG--HAIRILGWGVENSVPYWLVANSWN 301
Query: 314 PHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
WGD G KILRG+ C E I AG P+
Sbjct: 302 VDWGDNGLFKILRGEDHCGIESEIVAGIPR 331
>gi|431918315|gb|ELK17542.1| Cathepsin B [Pteropus alecto]
Length = 359
Score = 215 bits (548), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 128/347 (36%), Positives = 180/347 (51%), Gaps = 28/347 (8%)
Query: 5 LVFLLGCTLV------RGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQF---L 55
L+ L C +V R SD ++ +N+ TW AG NF N+ Y+++ +
Sbjct: 27 LLTTLSCLVVLTSARNRPNFPPLSDELVNYVNKRNTTWKAGHNF-HNVDLSYVKRLCGTI 85
Query: 56 IADAKYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIF 115
+ K LP ++ + E +P+ FDAREQWPNC TI + D G+C + F
Sbjct: 86 LGGPK--------LP--QRVWLAE-DLVLPESFDAREQWPNCPTIKEIRDQGSCGSCWAF 134
Query: 116 AAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSV 175
AV A SDR CI + G N +S E + +CC + + C+ G WNF K+G V
Sbjct: 135 GAVEAISDRICILTNGNVNVEVSAEDLLTCCG---FQCGEGCNGGFPSGAWNFWTKKGLV 191
Query: 176 TGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHR 235
+GG Y GC+P +I PC HH + + P C + KC +R Y + +DKH
Sbjct: 192 SGGLYDSHVGCRPYSIPPCEHHVNG-SRPPCTGEGGSTPKC-SRICEAGYTPSYKEDKHF 249
Query: 236 TTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLI 295
+Y V +E I EI +GP A F++Y DF YKSGVY+H + + H+ +++
Sbjct: 250 GCSSYSVPSSETEIMAEIYKNGPVEAAFSVYSDFLLYKSGVYQHVTGEMMGG--HAVRIL 307
Query: 296 GWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
GWG E+GTPYWLV N+W WGD G KILRG+ C E I AG P
Sbjct: 308 GWGVEDGTPYWLVGNSWNTDWGDSGFFKILRGQDHCGIESEIVAGLP 354
>gi|4503139|ref|NP_001899.1| cathepsin B preproprotein [Homo sapiens]
gi|22538431|ref|NP_680090.1| cathepsin B preproprotein [Homo sapiens]
gi|22538433|ref|NP_680091.1| cathepsin B preproprotein [Homo sapiens]
gi|22538435|ref|NP_680092.1| cathepsin B preproprotein [Homo sapiens]
gi|22538437|ref|NP_680093.1| cathepsin B preproprotein [Homo sapiens]
gi|68067549|sp|P07858.3|CATB_HUMAN RecName: Full=Cathepsin B; AltName: Full=APP secretase; Short=APPS;
AltName: Full=Cathepsin B1; Contains: RecName:
Full=Cathepsin B light chain; Contains: RecName:
Full=Cathepsin B heavy chain; Flags: Precursor
gi|291888|gb|AAC37547.1| cathepsin B [Homo sapiens]
gi|63102437|gb|AAH95408.1| Cathepsin B [Homo sapiens]
gi|119586034|gb|EAW65630.1| cathepsin B, isoform CRA_a [Homo sapiens]
gi|119586036|gb|EAW65632.1| cathepsin B, isoform CRA_a [Homo sapiens]
gi|119586037|gb|EAW65633.1| cathepsin B, isoform CRA_a [Homo sapiens]
gi|119586038|gb|EAW65634.1| cathepsin B, isoform CRA_a [Homo sapiens]
gi|119586039|gb|EAW65635.1| cathepsin B, isoform CRA_a [Homo sapiens]
gi|119586040|gb|EAW65636.1| cathepsin B, isoform CRA_a [Homo sapiens]
gi|168277954|dbj|BAG10955.1| cathepsin B precursor [synthetic construct]
gi|193786804|dbj|BAG52127.1| unnamed protein product [Homo sapiens]
Length = 339
Score = 215 bits (547), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 123/339 (36%), Positives = 171/339 (50%), Gaps = 19/339 (5%)
Query: 6 VFLLGCTLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQS 65
+ +L R + SD ++ +N+ TW AG NF N+ YL++ +
Sbjct: 11 LLVLANARSRPSFHPLSDELVNYVNKRNTTWQAGHNF-YNVDMSYLKRLC---GTFLGG- 65
Query: 66 DRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRR 125
P P R + + +P FDAREQWP C TI + D G+C + F AV A SDR
Sbjct: 66 --PKPPQRVMFTEDLK--LPASFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVEAISDRI 121
Query: 126 CIKSKGQQNRPLSTEYVASCC-KICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRT 184
CI + + +S E + +CC +C C+ G WNF ++G V+GG Y
Sbjct: 122 CIHTNAHVSVEVSAEDLLTCCGSMC----GDGCNGGYPAEAWNFWTRKGLVSGGLYESHV 177
Query: 185 GCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDD 244
GC+P +I PC HH + P PK C C P Y + QDKH +Y V +
Sbjct: 178 GCRPYSIPPCEHHVNGSRPPCTGEGDTPK--CSKIC-EPGYSPTYKQDKHYGYNSYSVSN 234
Query: 245 NEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTP 304
+E I EI +GP F++Y DF YKSGVY+H + + H+ +++GWG ENGTP
Sbjct: 235 SEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGG--HAIRILGWGVENGTP 292
Query: 305 YWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
YWLV N+W WGD G KILRG+ C E + AG P+
Sbjct: 293 YWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPR 331
>gi|432946172|ref|XP_004083803.1| PREDICTED: cathepsin B-like [Oryzias latipes]
Length = 330
Score = 215 bits (547), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 125/329 (37%), Positives = 173/329 (52%), Gaps = 18/329 (5%)
Query: 14 VRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDR 73
R +L S ID +N+ TWTAG+NF N +++ K LP
Sbjct: 18 ARPQLPLLSLEMIDFVNKLNTTWTAGQNF-HNKDSSFVKGLCGTILK-----GPKLP--E 69
Query: 74 KTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQ 133
+D E +PD FD REQWPNC T+ + D G C + F A A SDR CI+S G+
Sbjct: 70 LAHDVE-GIKLPDSFDPREQWPNCPTLKQIRDQGNCGSCWAFGAAEAISDRICIQSGGKI 128
Query: 134 NRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISP 193
+ +S E + +CC C C G W F +G VTGG + + GC+P T++P
Sbjct: 129 SLEISAEDLLTCCDEC----GMGCFGGFPSAAWEFWTNKGLVTGGLFDSKVGCRPYTLAP 184
Query: 194 CSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEI 253
C HH + + P C+ + V KC T+C N Y + +DKH +Y + ++ I E+
Sbjct: 185 CEHHVNG-SRPPCQGE-VETPKCVTQCNN-GYSLSYPKDKHFGQRSYSIPSQQEQIMTEL 241
Query: 254 LAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWG 313
+GP A F++Y DF YK+GVY+H + L H+ K++GWG ENGTPYWLV N+W
Sbjct: 242 YKNGPVEAAFSVYADFLLYKNGVYQHVTGDMLGG--HAVKILGWGEENGTPYWLVANSWN 299
Query: 314 PHWGDRGTVKILRGKYECAFEYLIAAGKP 342
WGD+G KI RG EC E + AG P
Sbjct: 300 SDWGDKGFFKIKRGNDECGIESEMVAGAP 328
>gi|197098184|ref|NP_001126573.1| cathepsin B precursor [Pongo abelii]
gi|75061687|sp|Q5R6D1.1|CATB_PONAB RecName: Full=Cathepsin B; Contains: RecName: Full=Cathepsin B
light chain; Contains: RecName: Full=Cathepsin B heavy
chain; Flags: Precursor
gi|55731764|emb|CAH92586.1| hypothetical protein [Pongo abelii]
gi|55731953|emb|CAH92685.1| hypothetical protein [Pongo abelii]
Length = 339
Score = 215 bits (547), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 122/330 (36%), Positives = 169/330 (51%), Gaps = 19/330 (5%)
Query: 15 RGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRK 74
R + SD ++ +N+ TW AG NF N+ YL++ + P P R
Sbjct: 20 RPSFHPLSDELVNYVNKRNTTWQAGHNF-YNVDVSYLKKLC---GTFLGG---PKPPQRV 72
Query: 75 TYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQN 134
+ + +P+ FDAREQWP C TI + D G+C + F AV A SDR CI + +
Sbjct: 73 MFTEDLK--LPESFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHTNAHVS 130
Query: 135 RPLSTEYVASCC-KICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISP 193
+S E + +CC +C C+ G WNF ++G V+GG Y GC+P +I P
Sbjct: 131 VEVSAEDLLTCCGSMC----GDGCNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPP 186
Query: 194 CSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEI 253
C HH + P PK C C P Y + QDKH +Y V ++E I EI
Sbjct: 187 CEHHVNGSRPPCTGEGDTPK--CSKIC-EPGYSPTYKQDKHYGYNSYSVSNSERDIMAEI 243
Query: 254 LAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWG 313
+GP F++Y DF YKSGVY+H + + H+ +++GWG ENGTPYWLV N+W
Sbjct: 244 YKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGG--HAIRILGWGVENGTPYWLVANSWN 301
Query: 314 PHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
WGD G KILRG+ C E + AG P+
Sbjct: 302 TDWGDNGFFKILRGQDHCGIESEVVAGIPR 331
>gi|410916585|ref|XP_003971767.1| PREDICTED: cathepsin B-like [Takifugu rubripes]
Length = 328
Score = 215 bits (547), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 121/324 (37%), Positives = 165/324 (50%), Gaps = 18/324 (5%)
Query: 21 FSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEY 80
S ID IN+ TWTAG+NF N+ Y++ + P +
Sbjct: 23 LSSEMIDFINKVNTTWTAGQNF-HNVDSSYVKGLC---GTFLKGPKLP-----QVLHNTE 73
Query: 81 SATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTE 140
+PD FDAR+QWP+C TI + D G+C + F A A SDR CI S + + +S E
Sbjct: 74 GIRLPDSFDARKQWPDCRTIQQIRDQGSCGSCWAFGAAEAISDRLCIHSGSKISLEISAE 133
Query: 141 YVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSA 200
+ SCC C CS G W F K+G VTGG G GC+P +I+PC HH +
Sbjct: 134 DLLSCCDEC----GMGCSGGYPSSAWEFWTKKGLVTGGLCGSEVGCRPYSIAPCEHHVNG 189
Query: 201 PTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTT 260
P Q+ PK C +C + Y + +DKH +Y + ++ I E+ +GP
Sbjct: 190 TRPPCQGTQETPK--CEKKCID-GYLTSYLKDKHFGKRSYSLPSQQEQIMTELYKNGPVE 246
Query: 261 ATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRG 320
A F +Y DF YK+GVY+H + L H+ K++GWG E+GTPYWL N+W WGD+G
Sbjct: 247 AAFTVYADFLLYKTGVYQHVTGEVLGG--HAIKILGWGEESGTPYWLAANSWNGDWGDKG 304
Query: 321 TVKILRGKYECAFEYLIAAGKPKN 344
KI RG EC E + AG P N
Sbjct: 305 FFKIKRGNDECGIESEMVAGTPLN 328
>gi|158261501|dbj|BAF82928.1| unnamed protein product [Homo sapiens]
Length = 339
Score = 215 bits (547), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 123/339 (36%), Positives = 171/339 (50%), Gaps = 19/339 (5%)
Query: 6 VFLLGCTLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQS 65
+ +L R + SD ++ +N+ TW AG NF N+ YL++ +
Sbjct: 11 LLVLANARSRPSFHPLSDELVNYVNKRNTTWQAGHNF-YNVDMSYLKRLC---GTFLGG- 65
Query: 66 DRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRR 125
P P R + + +P FDAREQWP C TI + D G+C + F AV A SDR
Sbjct: 66 --PKPPQRVMFTEDLK--LPASFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVEAISDRI 121
Query: 126 CIKSKGQQNRPLSTEYVASCC-KICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRT 184
CI + + +S E + +CC +C C+ G WNF ++G V+GG Y
Sbjct: 122 CIHTNAHVSVEVSAEDLLTCCGSMC----GDGCNGGYPAEAWNFWTRKGLVSGGLYESHV 177
Query: 185 GCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDD 244
GC+P +I PC HH + P PK C C P Y + QDKH +Y V +
Sbjct: 178 GCRPYSIPPCEHHVNGSRPPCTGEGDTPK--CSKIC-EPGYSPTYKQDKHYGYNSYSVSN 234
Query: 245 NEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTP 304
+E I EI +GP F++Y DF YKSGVY+H + + H+ +++GWG ENGTP
Sbjct: 235 SEKDIMAEIYKNGPAEGAFSVYSDFLLYKSGVYQHVTGEMMGG--HAIRILGWGVENGTP 292
Query: 305 YWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
YWLV N+W WGD G KILRG+ C E + AG P+
Sbjct: 293 YWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPR 331
>gi|27806671|ref|NP_776456.1| cathepsin B precursor [Bos taurus]
gi|115312124|sp|P07688.5|CATB_BOVIN RecName: Full=Cathepsin B; AltName: Full=BCSB; Contains: RecName:
Full=Cathepsin B light chain; Contains: RecName:
Full=Cathepsin B heavy chain; Flags: Precursor
gi|289402|gb|AAA03064.1| cathepsin B [Bos taurus]
gi|809479|gb|AAA80198.1| cathepsin B [Bos taurus]
gi|296484950|tpg|DAA27065.1| TPA: cathepsin B precursor [Bos taurus]
Length = 335
Score = 215 bits (547), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 132/348 (37%), Positives = 175/348 (50%), Gaps = 27/348 (7%)
Query: 1 MIHILVFLLGCTLVRGELY--KFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIAD 58
++ L LL T R LY SD ++ +N++ TW AG NF N+ Y+++ A
Sbjct: 4 LLATLSCLLVLTSARSSLYFPPLSDELVNFVNKQNTTWKAGHNF-YNVDLSYVKKLCGAI 62
Query: 59 AKYFDQSDRPLPGDRKTYDPEYSATV--PDRFDAREQWPNCGTIGHVPDTGACAAPHIFA 116
L G + ++A V P+ FDAREQWPNC TI + D G+C + F
Sbjct: 63 ----------LGGPKLPQRDAFAADVVLPESFDAREQWPNCPTIKEIRDQGSCGSCWAFG 112
Query: 117 AVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRT--WNFLHKRGS 174
AV A SDR CI S G+ N +S E + + F + WNF K+G
Sbjct: 113 AVEAISDRICIHSNGRVNVEVSAEDM-----LTCCGGECGDGCNGGFPSGAWNFWTKKGL 167
Query: 175 VTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKH 234
V+GG Y GC+P +I PC HH + P PK C C P Y + +DKH
Sbjct: 168 VSGGLYNSHVGCRPYSIPPCEHHVNGSRPPCTGEGDTPK--CSKTC-EPGYSPSYKEDKH 224
Query: 235 RTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKL 294
+Y V +NE I EI +GP F++Y DF YKSGVY+H S + H+ ++
Sbjct: 225 FGCSSYSVANNEKEIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVSGEIMGG--HAIRI 282
Query: 295 IGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
+GWG ENGTPYWLV N+W WGD G KILRG+ C E I AG P
Sbjct: 283 LGWGVENGTPYWLVGNSWNTDWGDNGFFKILRGQDHCGIESEIVAGMP 330
>gi|204022092|dbj|BAG71143.1| cathepsin B-N2 [Tuberaphis coreana]
Length = 334
Score = 214 bits (546), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 135/336 (40%), Positives = 173/336 (51%), Gaps = 28/336 (8%)
Query: 17 ELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTY 76
+ Y + YI+QIN A TW AG NF LS + + L +K + + P KT+
Sbjct: 17 QAYFLEEDYINQINANAKTWKAGVNFDPKLSIDSFVKLL--GSKGVQAAKQASPDMFKTH 74
Query: 77 DPEY---SATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQ 133
D Y S +P FDAR++W C TIG V D G C + F AF+DR CI + G+
Sbjct: 75 DEAYNSWSNRIPSSFDARKKWRKCSTIGEVRDQGKCGSCWAFGTSSAFADRLCIATDGEF 134
Query: 134 NRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISP 193
N LS E +A CC C + CS G R W K G VTGG+Y GCQP + P
Sbjct: 135 NELLSPEELAFCCHKCGF----GCSGGYPIRAWERFKKHGLVTGGNYDSGEGCQPYRVPP 190
Query: 194 C--SHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGR---GFFQDKHRTTLTYWVDDNEDA 248
C +G+ +C + P K H RCT YG F +D H T Y++
Sbjct: 191 CPLDEYGNN----TCRGK--PAEKNH-RCTRMCYGNQDLDFKEDHHYTRDAYYL--TYGT 241
Query: 249 IKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYL--HSGKLIGWGTENGTPYW 306
I+ +ILA+GP A+F +YDDF YKSGVY NA YL H+ KLIGWG E G PYW
Sbjct: 242 IQNDILAYGPIEASFEVYDDFPSYKSGVYTKMENA---TYLGGHAVKLIGWGEEYGVPYW 298
Query: 307 LVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
L++N+W WGD+G KI RG EC + G P
Sbjct: 299 LLVNSWNDQWGDQGLFKIRRGTNECGIDNSTTGGVP 334
>gi|6681079|ref|NP_031824.1| cathepsin B preproprotein [Mus musculus]
gi|115712|sp|P10605.2|CATB_MOUSE RecName: Full=Cathepsin B; AltName: Full=Cathepsin B1; Contains:
RecName: Full=Cathepsin B light chain; Contains:
RecName: Full=Cathepsin B heavy chain; Flags: Precursor
gi|239907|gb|AAB20536.1| preprocathepsin B [Mus sp.]
gi|309152|gb|AAA37375.1| cathepsin B [Mus musculus]
gi|13879360|gb|AAH06656.1| Cathepsin B [Mus musculus]
gi|26350521|dbj|BAC38900.1| unnamed protein product [Mus musculus]
gi|74180941|dbj|BAE27751.1| unnamed protein product [Mus musculus]
gi|74191261|dbj|BAE39458.1| unnamed protein product [Mus musculus]
gi|74198944|dbj|BAE30691.1| unnamed protein product [Mus musculus]
gi|74208073|dbj|BAE29144.1| unnamed protein product [Mus musculus]
gi|148704123|gb|EDL36070.1| cathepsin B, isoform CRA_a [Mus musculus]
Length = 339
Score = 214 bits (546), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 126/329 (38%), Positives = 171/329 (51%), Gaps = 23/329 (6%)
Query: 18 LYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQF---LIADAKYFDQSDRPLPGDRK 74
+ SD I+ IN++ TW AGRNF N+ YL++ ++ K LPG R
Sbjct: 23 FHPLSDDLINYINKQNTTWQAGRNF-YNVDISYLKKLCGTVLGGPK--------LPG-RV 72
Query: 75 TYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQN 134
+ + +P+ FDAREQW NC TIG + D G+C + F AV A SDR CI + G+ N
Sbjct: 73 AFGEDID--LPETFDAREQWSNCPTIGQIRDQGSCGSCWAFGAVEAISDRTCIHTNGRVN 130
Query: 135 RPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPC 194
+S E + +CC I D C+ G W+F K+G V+GG Y GC P TI PC
Sbjct: 131 VEVSAEDLLTCCGIQCGD---GCNGGYPSGAWSFWTKKGLVSGGVYNSHVGCLPYTIPPC 187
Query: 195 SHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEIL 254
HH + P P +C+ C Y + +DKH +Y V ++ I EI
Sbjct: 188 EHHVNGSRPPCTGEGDTP--RCNKSC-EAGYSPSYKEDKHFGYTSYSVSNSVKEIMAEIY 244
Query: 255 AHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGP 314
+GP F ++ DF YKSGVYKH + + H+ +++GWG ENG PYWL N+W
Sbjct: 245 KNGPVEGAFTVFSDFLTYKSGVYKHEAGDMMGG--HAIRILGWGVENGVPYWLAANSWNL 302
Query: 315 HWGDRGTVKILRGKYECAFEYLIAAGKPK 343
WGD G KILRG+ C E I AG P+
Sbjct: 303 DWGDNGFFKILRGENHCGIESEIVAGIPR 331
>gi|48762493|dbj|BAD23816.1| cathepsin B-N1 [Tuberaphis coreana]
Length = 340
Score = 214 bits (546), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 135/336 (40%), Positives = 173/336 (51%), Gaps = 28/336 (8%)
Query: 17 ELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTY 76
+ Y + YI+QIN A TW AG NF LS + + L +K + + P KT+
Sbjct: 20 QAYFLEEDYINQINANAKTWKAGVNFDPKLSIDSFVKLL--GSKGVQAAKQASPDMFKTH 77
Query: 77 DPEY---SATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQ 133
D Y S +P FDAR++W C TIG V D G C + F AF+DR CI + G+
Sbjct: 78 DEAYNSWSNRIPSSFDARKKWRKCSTIGEVRDQGKCGSCWAFGTSSAFADRLCIATDGEF 137
Query: 134 NRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISP 193
N LS E +A CC C + CS G R W K G VTGG+Y GCQP + P
Sbjct: 138 NELLSPEELAFCCHKCGF----GCSGGYPIRAWERFKKHGLVTGGNYDSGEGCQPYRVPP 193
Query: 194 C--SHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGR---GFFQDKHRTTLTYWVDDNEDA 248
C +G+ +C + P K H RCT YG F +D H T Y++
Sbjct: 194 CPLDEYGNN----TCRGK--PAEKNH-RCTRMCYGNQDLDFKEDHHYTRDAYYL--TYGT 244
Query: 249 IKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYL--HSGKLIGWGTENGTPYW 306
I+ +ILA+GP A+F +YDDF YKSGVY NA YL H+ KLIGWG E G PYW
Sbjct: 245 IQNDILAYGPIEASFEVYDDFPSYKSGVYTKMENA---TYLGGHAVKLIGWGEEYGVPYW 301
Query: 307 LVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
L++N+W WGD+G KI RG EC + G P
Sbjct: 302 LLVNSWNDQWGDQGLFKIRRGTNECGIDNSTTGGVP 337
>gi|181192|gb|AAA52129.1| preprocathepsin B [Homo sapiens]
gi|193787271|dbj|BAG52477.1| unnamed protein product [Homo sapiens]
Length = 339
Score = 214 bits (546), Expect = 4e-53, Method: Compositional matrix adjust.
Identities = 123/339 (36%), Positives = 171/339 (50%), Gaps = 19/339 (5%)
Query: 6 VFLLGCTLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQS 65
+ +L R + SD ++ +N+ TW AG NF N+ YL++ +
Sbjct: 11 LLVLANARSRPSFHPVSDELVNYVNKRNTTWQAGHNF-YNVDMSYLKRLC---GTFLGG- 65
Query: 66 DRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRR 125
P P R + + +P FDAREQWP C TI + D G+C + F AV A SDR
Sbjct: 66 --PKPPQRVMFTEDLK--LPASFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVEAISDRI 121
Query: 126 CIKSKGQQNRPLSTEYVASCC-KICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRT 184
CI + + +S E + +CC +C C+ G WNF ++G V+GG Y
Sbjct: 122 CIHTNAHVSVEVSAEDLLTCCGSMC----GDGCNGGYPAEAWNFWTRKGLVSGGLYESHV 177
Query: 185 GCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDD 244
GC+P +I PC HH + P PK C C P Y + QDKH +Y V +
Sbjct: 178 GCRPYSIPPCEHHVNGSRPPCTGEGDTPK--CSKIC-EPGYSPTYKQDKHYGYNSYSVSN 234
Query: 245 NEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTP 304
+E I EI +GP F++Y DF YKSGVY+H + + H+ +++GWG ENGTP
Sbjct: 235 SEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGG--HAIRILGWGVENGTP 292
Query: 305 YWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
YWLV N+W WGD G KILRG+ C E + AG P+
Sbjct: 293 YWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPR 331
>gi|146165818|ref|XP_001015807.2| Papain family cysteine protease containing protein [Tetrahymena
thermophila]
gi|146145394|gb|EAR95562.2| Papain family cysteine protease containing protein [Tetrahymena
thermophila SB210]
Length = 338
Score = 214 bits (546), Expect = 4e-53, Method: Compositional matrix adjust.
Identities = 122/325 (37%), Positives = 167/325 (51%), Gaps = 16/325 (4%)
Query: 21 FSDAYIDQINREAN-TWTAGR--NFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYD 77
FS+ ++++ N+ N TW A R F + E L+ L A + P+ +T D
Sbjct: 27 FSEKFVEEFNKRYNSTWRAARYQKF-EEMDPETLQGHLGALIDEPLWAKLPIKNVEQTND 85
Query: 78 PEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPL 137
P +P+ FD+REQWPNC +I + D C + FAA +SDR CI S + +
Sbjct: 86 P-----IPESFDSREQWPNCNSIKTIRDQSTCGSCWAFAATETYSDRICIASNQELQTSI 140
Query: 138 STEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHH 197
S+E + CC C C G W ++ G TGG YGD + C+P PC HH
Sbjct: 141 SSEDLLECCATC----GNGCQGGYPSAAWKYMKATGVSTGGLYGDDSSCKPYVFPPCDHH 196
Query: 198 GSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHG 257
P C K P KC +C + + + QD H + Y + +N +AI++EI+AHG
Sbjct: 197 -VVGQYPPCGPIK-PTPKCVKQCNSQYTEKTYQQDLHHPSKVYQLPNNAEAIQREIMAHG 254
Query: 258 PTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWG 317
P A+F + DF YKSGVY K E HS K+IGWG E GTPYWL+ N+W WG
Sbjct: 255 PVQASFRVASDFLTYKSGVYIRDPKLKYEGG-HSVKIIGWGVEQGTPYWLIANSWNEDWG 313
Query: 318 DRGTVKILRGKYECAFEYLIAAGKP 342
+ G K+LRGK EC E + AG P
Sbjct: 314 ENGLFKMLRGKNECGIEAEVVAGLP 338
>gi|449667614|ref|XP_002166962.2| PREDICTED: cathepsin B-like [Hydra magnipapillata]
Length = 330
Score = 214 bits (545), Expect = 4e-53, Method: Compositional matrix adjust.
Identities = 120/344 (34%), Positives = 178/344 (51%), Gaps = 19/344 (5%)
Query: 1 MIHILVFLLGCTLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAK 60
+I ++ + T+ + +++ ++I IN TW AG NF +++ +Y+R A
Sbjct: 4 IIFGVLIAMVFTMPKNSMFQ---SHIHTINNMKTTWEAGENFGPHITSDYIRNLCGALKT 60
Query: 61 YFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPN-CGTIGHVPDTGACAAPHIFAAVG 119
+ ++ +D +P FDAR++W + C ++ V D G C + F A
Sbjct: 61 PLSKKLPIKDLSKEVHD------LPIEFDARKEWGSICPSLLEVRDQGECGSCWAFGAAE 114
Query: 120 AFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGD 179
A +DR CI +KG+ +STE + +CC C + C+ G W F +G VTGG
Sbjct: 115 AMTDRICIATKGKNQVRISTEDLLTCCDSCGF----GCNGGYPQSAWEFFKTKGIVTGGP 170
Query: 180 YGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLT 239
Y GCQP I C HH P N +P KC C Y + DKH +
Sbjct: 171 YNSHKGCQPYAIPACDHHVPHSKNPC--NGSLPTPKCEKVCEK-GYNITYKNDKHYGVTS 227
Query: 240 YWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGT 299
Y ++++++ I +EI+ +GP A F ++ DF +YKSGVY+H S +L H+ K++GWG
Sbjct: 228 YSINNDQNEIMREIMTNGPVEAAFTVFADFPNYKSGVYQHVSGEELGG--HAIKILGWGV 285
Query: 300 ENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
EN TPYWLV N+W P WGD G KILRG EC E + AG PK
Sbjct: 286 ENNTPYWLVANSWNPSWGDNGFFKILRGSDECGIEDEVVAGLPK 329
>gi|204022073|dbj|BAG71134.1| cathepsin B-S1 [Tuberaphis taiwana]
Length = 334
Score = 214 bits (545), Expect = 4e-53, Method: Compositional matrix adjust.
Identities = 128/340 (37%), Positives = 181/340 (53%), Gaps = 17/340 (5%)
Query: 4 ILVFLLGCTLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFD 63
+ + L L SD I IN A TW A R FPAN SEEY L+ Y +
Sbjct: 8 VCAIFVSVYLAEPTLQFLSDERIKYINEVAKTWKAERYFPANTSEEYFIG-LLGSRGYKN 66
Query: 64 QSDRPLPGDRKTYDPEYSAT-VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFS 122
++ + K YDP Y P +FD+R W +C IGH+ D G C + F+ GAF+
Sbjct: 67 YTNEV---EIKKYDPLYVENDSPQQFDSRTNWKSCKQIGHIRDQGNCGSCWSFSTTGAFA 123
Query: 123 DRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGD 182
DR C+ + G+ N+ LS E +A C D K C G + W + +G TGGDYG
Sbjct: 124 DRLCVSTGGKFNQLLSPEELA----FCCKDCGKGCGGGYPIKAWKYFRTQGVTTGGDYGT 179
Query: 183 RTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWV 242
+ GC P + PC + T C Q P + H +C YG+ Q++++T Y V
Sbjct: 180 KEGCMPYKVPPCYNKQGKNT---CGGQ--PMERNH-QCPKTCYGKTTVQNRYKTKSEY-V 232
Query: 243 DDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENG 302
++ I+++I+ +GP A+F +YDD YKSG+Y+ T AK + HS K+IGWG +NG
Sbjct: 233 INSIKTIERDIMTYGPVEASFDVYDDLSAYKSGIYRKTPKAKYQG-GHSIKIIGWGQQNG 291
Query: 303 TPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
TPYWL +N+W WG+ GT KI++G+ EC E + AG P
Sbjct: 292 TPYWLAVNSWSKFWGEHGTFKIIKGRNECGIERAVTAGIP 331
>gi|204022083|dbj|BAG71139.1| cathepsin B-S [Astegopteryx styracophila]
Length = 335
Score = 214 bits (545), Expect = 5e-53, Method: Compositional matrix adjust.
Identities = 123/327 (37%), Positives = 169/327 (51%), Gaps = 24/327 (7%)
Query: 21 FSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEY 80
SD I+ IN+ A TW A R FPAN+S+EY+ L + ++ + D DP Y
Sbjct: 25 LSDERIEYINKIAKTWKAERYFPANMSKEYITGLLGSRGYKNYLNEVEIKKD----DPLY 80
Query: 81 SAT--VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLS 138
+ FDARE W C IGHV D G C + F GAF+DR C+ + G N LS
Sbjct: 81 TKNNNKIKHFDARENWKICKQIGHVRDQGNCGSCWAFGTTGAFADRLCVATGGGFNEQLS 140
Query: 139 TEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHG 198
E + CC C C G+ + W + +RG TGGDYG GC P + PC
Sbjct: 141 AEKLTFCCWTC----GLGCQGGNPIKAWKYFKRRGITTGGDYGSNEGCAPYKVPPCYDDQ 196
Query: 199 S---APTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILA 255
P+ N K P+ YG +++++ Y V D+ I+++I
Sbjct: 197 GEFLCQGKPTEHNHKCPR---------ACYGNSTVENRYKVESIY-VLDSFKTIEQDIRT 246
Query: 256 HGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPH 315
+GP A+F +YDDF YKSG+Y+ T NA L HS KLIGWG E+G PYWL++N+W
Sbjct: 247 YGPVEASFDVYDDFITYKSGIYQKTPNA-LYVGGHSVKLIGWGEEDGIPYWLLVNSWSKF 305
Query: 316 WGDRGTVKILRGKYECAFEYLIAAGKP 342
WG++GT +I++G+ EC E AG P
Sbjct: 306 WGEQGTFRIIKGRNECGIERSATAGIP 332
>gi|30583753|gb|AAP36125.1| Homo sapiens cathepsin B [synthetic construct]
gi|61370555|gb|AAX43516.1| cathepsin B [synthetic construct]
Length = 340
Score = 214 bits (544), Expect = 6e-53, Method: Compositional matrix adjust.
Identities = 123/339 (36%), Positives = 171/339 (50%), Gaps = 19/339 (5%)
Query: 6 VFLLGCTLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQS 65
+ +L R + SD ++ +N+ TW AG NF N+ YL++ +
Sbjct: 11 LLVLANARSRPSFHPVSDELVNYVNKRNTTWQAGHNF-YNVDMGYLKRLC---GTFLGG- 65
Query: 66 DRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRR 125
P P R + + +P FDAREQWP C TI + D G+C + F AV A SDR
Sbjct: 66 --PKPPQRVMFTEDLK--LPASFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVEAISDRI 121
Query: 126 CIKSKGQQNRPLSTEYVASCC-KICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRT 184
CI + + +S E + +CC +C C+ G WNF ++G V+GG Y
Sbjct: 122 CIHTNAHVSVEVSAEDLLTCCGSMC----GDGCNGGYPAEAWNFWTRKGLVSGGLYESHV 177
Query: 185 GCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDD 244
GC+P +I PC HH + P PK C C P Y + QDKH +Y V +
Sbjct: 178 GCRPYSIPPCEHHVNGSRPPCTGEGDTPK--CSKIC-EPGYSPTYKQDKHYGYNSYSVSN 234
Query: 245 NEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTP 304
+E I EI +GP F++Y DF YKSGVY+H + + H+ +++GWG ENGTP
Sbjct: 235 SEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGG--HAIRILGWGVENGTP 292
Query: 305 YWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
YWLV N+W WGD G KILRG+ C E + AG P+
Sbjct: 293 YWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPR 331
>gi|16307393|gb|AAH10240.1| Cathepsin B [Homo sapiens]
Length = 339
Score = 214 bits (544), Expect = 6e-53, Method: Compositional matrix adjust.
Identities = 123/339 (36%), Positives = 171/339 (50%), Gaps = 19/339 (5%)
Query: 6 VFLLGCTLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQS 65
+ +L R + SD ++ +N+ TW AG NF N+ YL++ +
Sbjct: 11 LLVLANARSRPSFHPVSDELVNYVNKRNTTWQAGHNF-YNVDMGYLKRLC---GTFLGG- 65
Query: 66 DRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRR 125
P P R + + +P FDAREQWP C TI + D G+C + F AV A SDR
Sbjct: 66 --PKPPQRVMFTEDLK--LPASFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVEAISDRI 121
Query: 126 CIKSKGQQNRPLSTEYVASCC-KICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRT 184
CI + + +S E + +CC +C C+ G WNF ++G V+GG Y
Sbjct: 122 CIHTNAHVSVEVSAEDLLTCCGSMC----GDGCNGGYPAEAWNFWTRKGLVSGGLYESHV 177
Query: 185 GCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDD 244
GC+P +I PC HH + P PK C C P Y + QDKH +Y V +
Sbjct: 178 GCRPYSIPPCEHHVNGSRPPCTGEGDTPK--CSKIC-EPGYSPTYKQDKHYGYNSYSVSN 234
Query: 245 NEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTP 304
+E I EI +GP F++Y DF YKSGVY+H + + H+ +++GWG ENGTP
Sbjct: 235 SEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGG--HAIRILGWGVENGTP 292
Query: 305 YWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
YWLV N+W WGD G KILRG+ C E + AG P+
Sbjct: 293 YWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPR 331
>gi|60816353|gb|AAX36379.1| cathepsin B [synthetic construct]
gi|61358313|gb|AAX41546.1| cathepsin B [synthetic construct]
Length = 339
Score = 214 bits (544), Expect = 6e-53, Method: Compositional matrix adjust.
Identities = 124/338 (36%), Positives = 171/338 (50%), Gaps = 17/338 (5%)
Query: 6 VFLLGCTLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQS 65
+ +L R + SD ++ +N+ TW AG NF N+ YL++ +
Sbjct: 11 LLVLANARSRPSFHPVSDELVNYVNKRNTTWQAGHNF-YNVDMSYLKRLC---GTFLGG- 65
Query: 66 DRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRR 125
P P R + + +P FDAREQWP C TI + D G+C + F AV A SDR
Sbjct: 66 --PKPPQRVMFTEDLK--LPASFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVEAISDRI 121
Query: 126 CIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTG 185
CI + + +S E + +CC R D C+ G WNF ++G V+GG Y G
Sbjct: 122 CIHTNAHVSVEVSAEDLLTCCG-SRCGDG--CNGGYPAEAWNFWTRKGLVSGGLYESHVG 178
Query: 186 CQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDN 245
C+P +I PC HH + P PK C C P Y + QDKH +Y V ++
Sbjct: 179 CRPYSIPPCEHHVNGSRPPCTGEGDTPK--CSKIC-EPGYSPTYKQDKHYGYNSYSVSNS 235
Query: 246 EDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPY 305
E I EI +GP F++Y DF YKSGVY+H + + H+ +++GWG ENGTPY
Sbjct: 236 EKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGG--HAIRILGWGVENGTPY 293
Query: 306 WLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
WLV N+W WGD G KILRG+ C E + AG P+
Sbjct: 294 WLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPR 331
>gi|48762485|dbj|BAD23812.1| cathepsin B-N1 [Tuberaphis styraci]
Length = 340
Score = 214 bits (544), Expect = 6e-53, Method: Compositional matrix adjust.
Identities = 135/336 (40%), Positives = 172/336 (51%), Gaps = 28/336 (8%)
Query: 17 ELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTY 76
+ Y YI+QIN A TW AG NF LS + + L +K + + P KT+
Sbjct: 20 QAYFLEKDYINQINANAKTWKAGVNFDPKLSIDSFVKLL--GSKGVQAAKQASPDMFKTH 77
Query: 77 DPEY---SATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQ 133
D Y S +P FDAR++W C TIG V D G C + F AF+DR CI + G+
Sbjct: 78 DEAYNSWSNRIPSSFDARKKWRKCSTIGEVRDQGKCGSCWAFGTSSAFADRLCIATDGEF 137
Query: 134 NRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISP 193
N LS E +A CC C + CS G R W K G VTGG+Y GCQP + P
Sbjct: 138 NELLSPEELAFCCHKCGF----GCSGGYPIRAWERFKKHGLVTGGNYDSGEGCQPYRVPP 193
Query: 194 C--SHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGR---GFFQDKHRTTLTYWVDDNEDA 248
C +G+ +C + P K H RCT YG F +D H T Y++
Sbjct: 194 CPLDEYGNN----TCRGK--PAEKNH-RCTRMCYGNQDLDFKEDHHYTRDAYYL--TYGT 244
Query: 249 IKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYL--HSGKLIGWGTENGTPYW 306
I+ +ILA+GP A+F +YDDF YKSGVY NA YL H+ KLIGWG E G PYW
Sbjct: 245 IQNDILAYGPIEASFEVYDDFPSYKSGVYTKMENA---TYLGGHAVKLIGWGEEYGVPYW 301
Query: 307 LVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
L++N+W WGD+G KI RG EC + G P
Sbjct: 302 LLVNSWNDQWGDQGLFKIRRGTNECGIDNSTTGGVP 337
>gi|157833437|pdb|1PBH|A Chain A, Crystal Structure Of Human Recombinant Procathepsin B At
3.2 Angstrom Resolution
gi|157835646|pdb|2PBH|A Chain A, Crystal Structure Of Human Procathepsin B At 3.3 Angstrom
Resolution
gi|157836863|pdb|3PBH|A Chain A, Refined Crystal Structure Of Human Procathepsin B At 2.5
Angstrom Resolution
Length = 317
Score = 214 bits (544), Expect = 6e-53, Method: Compositional matrix adjust.
Identities = 122/330 (36%), Positives = 168/330 (50%), Gaps = 19/330 (5%)
Query: 15 RGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRK 74
R + SD ++ +N+ TW AG NF N+ YL++ + P P R
Sbjct: 4 RPSFHPLSDELVNYVNKRNTTWQAGHNF-YNVDMSYLKRLC---GTFLGG---PKPPQRV 56
Query: 75 TYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQN 134
+ + +P FDAREQWP C TI + D G+C + F AV A SDR CI + +
Sbjct: 57 MFTEDLK--LPASFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHTNAHVS 114
Query: 135 RPLSTEYVASCC-KICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISP 193
+S E + +CC +C C+ G WNF ++G V+GG Y GC+P +I P
Sbjct: 115 VEVSAEDLLTCCGSMC----GDGCNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPP 170
Query: 194 CSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEI 253
C HH + P PK C C P Y + QDKH +Y V ++E I EI
Sbjct: 171 CEHHVNGSRPPCTGEGDTPK--CSKIC-EPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEI 227
Query: 254 LAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWG 313
+GP F++Y DF YKSGVY+H + + H+ +++GWG ENGTPYWLV N+W
Sbjct: 228 YKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGG--HAIRILGWGVENGTPYWLVANSWN 285
Query: 314 PHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
WGD G KILRG+ C E + AG P+
Sbjct: 286 TDWGDNGFFKILRGQDHCGIESEVVAGIPR 315
>gi|149698064|ref|XP_001498242.1| PREDICTED: cathepsin B [Equus caballus]
Length = 340
Score = 214 bits (544), Expect = 6e-53, Method: Compositional matrix adjust.
Identities = 128/344 (37%), Positives = 176/344 (51%), Gaps = 22/344 (6%)
Query: 5 LVFLLGCTLV------RGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIAD 58
L+ L C +V R SD ++ +N+ TW AG NF N+ Y+++
Sbjct: 4 LLATLSCLVVLTNARSRPYFQPLSDELVNYVNKRNTTWKAGHNF-HNVDLSYVKRLC--- 59
Query: 59 AKYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAV 118
+ P ++ + E +P+ FDAREQWPNC TI + D G+C + F AV
Sbjct: 60 GTFLGGPKLP----QRVWFAE-DVVLPENFDAREQWPNCPTIKEIRDQGSCGSCWAFGAV 114
Query: 119 GAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGG 178
A SDR CI++ G + +S E + +CC D C+ G WNF K+G V+GG
Sbjct: 115 EAISDRICIRTNGHVSVEVSAEDMLTCCGDQCGD---GCNGGFPAEAWNFWTKQGLVSGG 171
Query: 179 DYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTL 238
Y GC+P +I PC HH + + P C + KC C P Y + +DKH
Sbjct: 172 LYDSHVGCRPYSIPPCEHHVNG-SRPPCTGEGGDTPKCSKIC-EPGYSPSYKEDKHYGCS 229
Query: 239 TYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWG 298
+Y V +E I EI +GP A F +Y DF YKSGVY+H + + H+ +++GWG
Sbjct: 230 SYSVSSSEKEIMAEIFKNGPVEAAFTVYSDFLQYKSGVYQHVAGDMMGG--HAVRILGWG 287
Query: 299 TENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
ENGTPYWLV N+W WGD G KILRG+ C E I AG P
Sbjct: 288 VENGTPYWLVGNSWNTDWGDNGFFKILRGQDHCGIESEIVAGIP 331
>gi|37788265|gb|AAO64472.1| cathepsin B precursor [Fundulus heteroclitus]
Length = 330
Score = 214 bits (544), Expect = 6e-53, Method: Compositional matrix adjust.
Identities = 122/342 (35%), Positives = 172/342 (50%), Gaps = 19/342 (5%)
Query: 4 ILVFLLGCTLVRGE--LYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKY 61
+LV ++ RG ++ S I+ IN+ TW AG NF ++ Y++
Sbjct: 6 LLVLAASLSVSRGRPHIHPLSSDMINYINKLNTTWKAGHNF-HDVDYGYVKNLC---GTL 61
Query: 62 FDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAF 121
P+ +P +FDAREQWP C T+ + D G+C + F A A
Sbjct: 62 LKGPKLPI-----MVQSAGGMKLPKQFDAREQWPECPTLKEIRDQGSCGSCWAFGAAEAI 116
Query: 122 SDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYG 181
SDR CI +KG+ + +S++ + +CC C C+ G W F ++G VTGG Y
Sbjct: 117 SDRICIHTKGKVSVEISSQDLLTCCDSC----GMGCNGGYPANAWEFWTEQGLVTGGLYN 172
Query: 182 DRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYW 241
GC+P TI PC HH + + P C + +C T+C Y + +DKH +Y
Sbjct: 173 SHIGCRPYTIEPCEHHVNG-SRPPCTGEGGDTPECVTQC-EAGYTPSYQKDKHYGKTSYG 230
Query: 242 VDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTEN 301
V E+ I+ EI +GP F +Y+DF YKSGVY+H + + L H+ K+IGWG EN
Sbjct: 231 VPSEEEQIQSEIYKNGPVEGAFIVYEDFPSYKSGVYQHVTGSALGG--HAIKMIGWGEEN 288
Query: 302 GTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
G PYWL N+W WGD G KILRG C E + AG PK
Sbjct: 289 GVPYWLCANSWNTDWGDNGFFKILRGSNHCGIESEVVAGIPK 330
>gi|161343855|tpg|DAA06108.1| TPA_inf: cathepsin B [Acyrthosiphon pisum]
Length = 342
Score = 214 bits (544), Expect = 7e-53, Method: Compositional matrix adjust.
Identities = 128/345 (37%), Positives = 179/345 (51%), Gaps = 23/345 (6%)
Query: 5 LVFLLGCTLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQ 64
++ LLG V + Y + +ID IN +A TW AG NF N +EY+ + L +K
Sbjct: 11 VILLLG-VCVTEQAYFLEEDFIDSINEKAKTWKAGINFDPNTPKEYIVKLL--GSKGVQV 67
Query: 65 SDRPLPGDRKTYDPEYS---ATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAF 121
+ KT D Y +P +FDAR++W C TIG V D G C + A AF
Sbjct: 68 PHKLNLKMYKTDDEAYVNLFGRIPKKFDARKEWRRCITIGQVRDQGNCGSCWALATSSAF 127
Query: 122 SDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYG 181
+DR CI + + N LS E + CC +C + +C G + W++ + G VTGG Y
Sbjct: 128 ADRLCIATNYEFNELLSAEELTFCCHLCGF----ACHGGYPIKAWSYFRRHGIVTGGGYQ 183
Query: 182 DRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGF--FQDKHRTTLT 239
GC P + PC +C Q + K H RCT YG + D HR T
Sbjct: 184 SGEGCAPYRVPPCFSEEDGNN--TCRGQPMEK---HHRCTRMCYGDQEIDYDDDHRFTRD 238
Query: 240 YWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYL--HSGKLIGW 297
Y+ +I+K+++ +GP A+ +YDDF YKSGVY+ + NA YL H+ KLIGW
Sbjct: 239 YYYL-TYASIQKDVMTYGPIEASMEVYDDFPSYKSGVYEKSENA---TYLGGHAVKLIGW 294
Query: 298 GTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
G E+G PYWL++N+W WGD+G KI RG EC+ + + AG P
Sbjct: 295 GEEDGVPYWLMVNSWSEMWGDKGLFKIRRGTNECSVDNSMTAGVP 339
>gi|204022096|dbj|BAG71145.1| cathepsin B-N1 [Tuberaphis sumatrana]
Length = 334
Score = 214 bits (544), Expect = 7e-53, Method: Compositional matrix adjust.
Identities = 133/336 (39%), Positives = 174/336 (51%), Gaps = 28/336 (8%)
Query: 17 ELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTY 76
+ Y + YI+QIN A TW AG NF LS + + L +K + + P KT+
Sbjct: 17 QAYFLEEDYINQINANAKTWKAGVNFDPKLSIDSFVKLL--GSKGVQAAKQASPDMFKTH 74
Query: 77 DPEY---SATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQ 133
D Y S +P FDAR++W C TIG V D G C + F AF+DR CI + G+
Sbjct: 75 DEAYNNWSNRIPSNFDARKKWRKCSTIGEVRDQGHCGSCWAFGTSSAFADRLCIATDGEF 134
Query: 134 NRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISP 193
N LS E +A CC C + CS G+ + W K G VTGG+Y GCQP + P
Sbjct: 135 NELLSPEELAFCCHKCGF----GCSGGNPIKAWERFQKHGLVTGGNYDSGEGCQPYKVPP 190
Query: 194 C--SHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGR---GFFQDKHRTTLTYWVDDNEDA 248
C +G+ +C + P K H RCT YG F +D H T Y++
Sbjct: 191 CPLDEYGNN----TCSGK--PAEKNH-RCTRMCYGNQNLDFKEDHHYTRDAYYL--TYGT 241
Query: 249 IKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYL--HSGKLIGWGTENGTPYW 306
I+ ++LA+GP A+F +YDDF YKSGVY NA YL H+ KLIGWG E G PYW
Sbjct: 242 IQYDVLAYGPIEASFEVYDDFPSYKSGVYTKMENA---TYLGGHAVKLIGWGEEYGVPYW 298
Query: 307 LVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
L++N+W WGD+G KI RG EC + G P
Sbjct: 299 LLVNSWNDQWGDQGLFKIRRGTNECGIDNSTTGGVP 334
>gi|148229459|ref|NP_001079570.1| cathepsin B precursor [Xenopus laevis]
gi|28277314|gb|AAH44689.1| MGC53360 protein [Xenopus laevis]
Length = 333
Score = 213 bits (543), Expect = 9e-53, Method: Compositional matrix adjust.
Identities = 131/349 (37%), Positives = 176/349 (50%), Gaps = 24/349 (6%)
Query: 1 MIHILVFLLGCTLVRGELY--KFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIAD 58
++ L FL R Y S ++ IN+ TW AG NF AN Y+++
Sbjct: 4 LVVALCFLASIASSRHLPYFAPLSHDMVNYINKVNTTWKAGHNF-ANADLHYVKRLCGTL 62
Query: 59 AKYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAV 118
K P +K + +PD FD+R WPNC TI + D G+C + F AV
Sbjct: 63 LKG--------PQLQKRFGFADGLELPDSFDSRAAWPNCPTIREIRDQGSCGSCWAFGAV 114
Query: 119 GAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGG 178
A SDR C+ + G+ N +S E + SCC + C+ G W F + G V+GG
Sbjct: 115 EAISDRVCVHTNGKVNVEVSAEDLLSCCGD---ECGMGCNGGYPSGAWQFWTETGLVSGG 171
Query: 179 DYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCT---NPTYGRGFFQDKHR 235
Y GC+P +I PC HH + + P+C+ ++ KC +C +P YG DKH
Sbjct: 172 LYDSHVGCRPYSIPPCEHHVNG-SRPACKGEEGDTPKCVKQCEEGYSPAYG----TDKHF 226
Query: 236 TTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLI 295
T +Y V +E I EI +GP F +Y DF YKSGVY+H + +L H+ K++
Sbjct: 227 GTTSYGVPTSEKEIMAEIYKNGPVEGAFLVYADFPLYKSGVYQHETGEELGG--HAIKIL 284
Query: 296 GWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPKN 344
GWG ENGTPYWL N+W WGD G KILRGK C E I AG PKN
Sbjct: 285 GWGVENGTPYWLCANSWNTDWGDNGFFKILRGKDHCGIESEIVAGVPKN 333
>gi|74179506|dbj|BAE44111.1| cathepsin B preproprotein [Cyprinus carpio]
Length = 330
Score = 213 bits (542), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 123/341 (36%), Positives = 167/341 (48%), Gaps = 21/341 (6%)
Query: 5 LVFLLGCTLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQ 64
++ L + R S ++ IN+ TW AG NF ++ Y+++
Sbjct: 9 VISALSVSWARPRFAPLSREMVNFINKANTTWKAGHNF-HDVDYSYVKRLC--------- 58
Query: 65 SDRPLPGDRKTYDPEYS--ATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFS 122
L G R +Y+ +P FDAREQWPNC T+ + D G+C + F A A S
Sbjct: 59 -GTLLKGPRLPVMVQYADDLKLPTNFDAREQWPNCPTLKEIRDQGSCGSCWAFGAAEAIS 117
Query: 123 DRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGD 182
DR CI S + + +S + + +CC C C+ G W+F G VTGG Y
Sbjct: 118 DRVCIHSNAKVSVEISAQDLLTCCDGC----GMGCNGGYPSAAWDFWSSDGLVTGGLYNS 173
Query: 183 RTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWV 242
GC+P TI PC HH + + P C + C C P Y + QDKH +Y V
Sbjct: 174 HIGCRPYTIEPCEHHVNG-SRPPCTGEGGDTPNCDMSC-EPGYSPSYKQDKHFGKTSYSV 231
Query: 243 DDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENG 302
N+ I KE+ +GP F +Y+DF YKSGVY+H S L H+ K++GWG ENG
Sbjct: 232 PSNQKDIMKELYKNGPVEGAFTVYEDFLSYKSGVYQHVSGPALGG--HAIKILGWGEENG 289
Query: 303 TPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
PYWL N+W WGD G KILRG+ C E I AG P+
Sbjct: 290 VPYWLAANSWNTDWGDNGYFKILRGEDHCGIESEIVAGIPQ 330
>gi|309202|gb|AAA37494.1| mouse preprocathepsin B [Mus musculus]
Length = 339
Score = 213 bits (542), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 126/329 (38%), Positives = 170/329 (51%), Gaps = 23/329 (6%)
Query: 18 LYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQF---LIADAKYFDQSDRPLPGDRK 74
+ SD I+ IN++ TW AGRNF N+ YL++ ++ K LPG R
Sbjct: 23 FHPLSDDLINYINKQNTTWQAGRNF-YNVDISYLKKLCGTVLGGPK--------LPG-RV 72
Query: 75 TYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQN 134
+ + +P+ FDAREQW NC TIG + D G+C + F AV A SDR CI + G+ N
Sbjct: 73 AFGEDID--LPETFDAREQWSNCPTIGQIRDQGSCGSCWAFGAVEAISDRTCIHTNGRVN 130
Query: 135 RPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPC 194
+S E + +CC I D C+ G WNF K+G V+GG Y GC P TI PC
Sbjct: 131 VEVSAEDLLTCCGIQCGD---GCNGGYPSGAWNFWTKKGLVSGGVYDSHIGCLPYTIPPC 187
Query: 195 SHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEIL 254
HH + P P +C+ C Y + +DKH +Y V ++ I EI
Sbjct: 188 EHHVNGSRPPCTGEGDTP--RCNKSC-EAGYSPSYKEDKHFGYTSYSVSNSVKEIMAEIY 244
Query: 255 AHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGP 314
+GP F ++ DF YKSGVYKH + + H+ +++ WG ENG PYWL N+W
Sbjct: 245 KNGPVEGAFTVFSDFLTYKSGVYKHEAGDMMGG--HAIRILVWGVENGVPYWLAANSWNL 302
Query: 315 HWGDRGTVKILRGKYECAFEYLIAAGKPK 343
WGD G KILRG+ C E I AG P+
Sbjct: 303 DWGDNGFFKILRGENHCGIESEIVAGIPR 331
>gi|204022075|dbj|BAG71135.1| cathepsin B-S2 [Tuberaphis taiwana]
Length = 334
Score = 213 bits (542), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 128/340 (37%), Positives = 181/340 (53%), Gaps = 17/340 (5%)
Query: 4 ILVFLLGCTLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFD 63
+ + L L SD I IN A TW A R FPAN SEEY L+ Y +
Sbjct: 8 VCAIFVSVYLAEPTLQFLSDERIKYINEVAKTWKAERYFPANTSEEYFIG-LLGSRGYKN 66
Query: 64 QSDRPLPGDRKTYDPEYSAT-VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFS 122
++ + K YDP Y P +FD+R W +C IGH+ D G C + F+ GAF+
Sbjct: 67 YTNEV---EIKKYDPLYVENDSPQQFDSRTNWKSCKQIGHIRDQGNCGSCWSFSTTGAFA 123
Query: 123 DRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGD 182
DR C+ + G+ N+ LS E +A C D K C G + W + +G TGGDYG
Sbjct: 124 DRLCVSTGGKFNQLLSPEELA----FCCKDCGKGCGGGYPIKAWKYFRTQGVTTGGDYGT 179
Query: 183 RTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWV 242
+ GC P + PC + T C Q P + H +C YG+ Q++++T Y V
Sbjct: 180 KEGCMPYKVPPCYNKQGKNT---CGGQ--PMERNH-QCPKTCYGKTTVQNRYKTKSEY-V 232
Query: 243 DDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENG 302
++ I++++ +GP A+F +YDDF YKSG+Y+ T AK + HS K+IGWG +NG
Sbjct: 233 MNSIKTIEQDLKTYGPVEASFDVYDDFSVYKSGIYRKTPKAKYQG-GHSIKIIGWGQQNG 291
Query: 303 TPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
TPYWL +N+W WG+ GT KI++G+ EC E + AG P
Sbjct: 292 TPYWLAVNSWSKFWGEHGTFKIIKGRNECGIERAVTAGIP 331
>gi|326916753|ref|XP_003204669.1| PREDICTED: cathepsin B-like [Meleagris gallopavo]
Length = 340
Score = 213 bits (542), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 122/344 (35%), Positives = 172/344 (50%), Gaps = 18/344 (5%)
Query: 2 IHILVFLLGCTLVRGELY--KFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADA 59
+ IL L+ R Y S ++ IN+ TW AG NF N Y+++
Sbjct: 5 VSILCVLVAFANARSIPYYPPLSSDLVNHINKLNTTWKAGHNF-HNTDMSYVKKLC---G 60
Query: 60 KYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVG 119
+ P + D +PD FD+R+QWPNC TI + D G+C + F AV
Sbjct: 61 TFLGGPKLP-----ERVDFAADIDLPDTFDSRKQWPNCPTISEIRDQGSCGSCWAFGAVE 115
Query: 120 AFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGD 179
A SDR C+ + + + +S E + SCC ++ C+ G W + +RG V+GG
Sbjct: 116 AISDRICVHTNAKVSVEVSAEDLLSCCG---FECGMGCNGGYPSGAWRYWTERGLVSGGL 172
Query: 180 YGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLT 239
Y GC+P TI PC HH + + P C + +C C P Y + +DKH +
Sbjct: 173 YDSHVGCRPYTIPPCEHHVNG-SRPPCTGEGGETPRCSRHC-EPGYSPSYKEDKHYGITS 230
Query: 240 YWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGT 299
Y V +E I EI +GP F +Y+DF YKSGVY+H S ++ H+ +++GWG
Sbjct: 231 YGVPRSEKEIMAEIYKNGPVEGAFIVYEDFLMYKSGVYQHVSGEQVGG--HAIRILGWGV 288
Query: 300 ENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
ENGTPYWL N+W WGD G KILRG+ C E I AG P+
Sbjct: 289 ENGTPYWLAANSWNTDWGDNGFFKILRGEDHCGIESEIVAGVPR 332
>gi|223646922|gb|ACN10219.1| Cathepsin B precursor [Salmo salar]
gi|223647940|gb|ACN10728.1| Cathepsin B precursor [Salmo salar]
gi|223672785|gb|ACN12574.1| Cathepsin B precursor [Salmo salar]
Length = 330
Score = 213 bits (542), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 125/340 (36%), Positives = 170/340 (50%), Gaps = 21/340 (6%)
Query: 5 LVFLLGCTLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQ 64
LV L + L S +D IN+ TW AG NF N+ Y+++
Sbjct: 9 LVSGLSVSWAWPRLPPLSHQMVDYINKANTTWKAGPNF-HNVDYSYVKRLC--------- 58
Query: 65 SDRPLPGDRKTYDPEYSATV--PDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFS 122
L G + +Y+ V PD FD R+QWPNC T+ + D G+C + F A A S
Sbjct: 59 -GTLLKGPKLPTMVQYAGDVELPDTFDPRQQWPNCPTLKEIRDQGSCGSCWAFGAAEAIS 117
Query: 123 DRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGD 182
DR CI S + + +S+E + SCC C C+ G W+F G VTGG Y
Sbjct: 118 DRVCIHSNAKVSVEISSEDLLSCCDSC----GMGCNGGYPSAAWDFWTTEGLVTGGLYDS 173
Query: 183 RTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWV 242
GC+P +I PC HH + T P C ++ +C +C Y G+ QDKH +Y +
Sbjct: 174 HVGCRPYSIPPCEHHVNG-TRPPCTGEEGDTPQCSNQCET-GYTPGYKQDKHFGKNSYSL 231
Query: 243 DDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENG 302
E I E+L +GP F +Y+DF YKSGVY+H S + + H+ K++GWG E G
Sbjct: 232 PSEEQQIMAELLKNGPVEGAFTVYEDFLLYKSGVYQHVSGSAVGG--HAIKVLGWGEEGG 289
Query: 303 TPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
TPYWL N+W WG+ G KILRGK C E + AG P
Sbjct: 290 TPYWLAANSWNTDWGENGFFKILRGKDHCGIESEMVAGVP 329
>gi|432852559|ref|XP_004067308.1| PREDICTED: cathepsin B-like [Oryzias latipes]
Length = 330
Score = 213 bits (542), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 122/331 (36%), Positives = 169/331 (51%), Gaps = 21/331 (6%)
Query: 15 RGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRK 74
R + S ++ IN+ TW AG NF N Y+++ L G +
Sbjct: 19 RPRFHPLSSDMVNYINKLNTTWKAGHNF-KNADYSYVQKLC----------GTMLKGPKL 67
Query: 75 TYDPEYSATV--PDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQ 132
+Y+ V P FDAR QWPNC T+ + D G+C + F A A SDR CI S +
Sbjct: 68 PIMVQYAGDVKLPTEFDARAQWPNCPTLKEIRDQGSCGSCWAFGAAEAISDRVCIHSNAR 127
Query: 133 QNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTIS 192
+ +S+E + +CC+ C C+ G W+F K G VTGG Y GC+P TI
Sbjct: 128 VSVEISSEDLLTCCESC----GMGCNGGYPTAAWDFWTKEGLVTGGLYDSHVGCRPYTIP 183
Query: 193 PCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKE 252
PC HH + T P C + +C +C + Y + +DKH +Y V+ NE+ I+ E
Sbjct: 184 PCEHHVNG-TRPPCTGEGGDTPQCINQCES-GYTPSYKKDKHYGKTSYSVEANENQIQTE 241
Query: 253 ILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTW 312
I +GP F +Y+DF YKSGVY+H S + + H+ K++GWG E+G PYWL N+W
Sbjct: 242 IYKNGPVEGAFMVYEDFPMYKSGVYQHVSGSLIGG--HAIKILGWGVEDGVPYWLCANSW 299
Query: 313 GPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
WGD G KILRG C E + AG PK
Sbjct: 300 NTDWGDNGYFKILRGSDHCGIESEVVAGIPK 330
>gi|76576339|gb|ABA53863.1| cathepsin B-like cysteine protease 1 [Parelaphostrongylus tenuis]
Length = 346
Score = 213 bits (541), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 114/324 (35%), Positives = 169/324 (52%), Gaps = 21/324 (6%)
Query: 26 IDQINREANTWTAG-----RNFPANLSEEYL-RQFLIADAKYFDQSDRPLPGDRKTYDPE 79
++ IN+ +TA N P ++ + +++ AKY + KT++
Sbjct: 38 VNYINKAQKLFTAKLSPRFANLPRDIKHRLMGSKYVALPAKY--------RMNEKTHNDI 89
Query: 80 YSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLST 139
++T+P FDAR WP C ++ V D AC + AAVGA DR CI S+G+Q LS
Sbjct: 90 DNSTIPKSFDARTNWPKCASLRTVRDQSACGSGWAVAAVGAIMDRICIASEGKQQVILSA 149
Query: 140 EYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGS 199
+ + SCC C Y C G ++ WN+ G VTG +Y ++GC+P PC H+
Sbjct: 150 DDILSCCTECGY----GCEGGDTYKAWNYWTTDGIVTGSNYTTKSGCKPYPYPPCEHYID 205
Query: 200 APTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPT 259
A C P C +C + Y + +DKH Y + + I++EI+ HGP
Sbjct: 206 AGRYKKCPKDLYPTNTCEYKCQD-NYTISYDEDKHYGAYPYVLVGDASFIQQEIMNHGPV 264
Query: 260 TATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDR 319
TF +Y+DF HY SG+YKH + + +H+ K++GWGTENG YW+ N+W WG+
Sbjct: 265 EVTFDVYEDFEHYSSGIYKHMAGEYVG--VHAVKMLGWGTENGVDYWICANSWNSDWGEN 322
Query: 320 GTVKILRGKYECAFEYLIAAGKPK 343
G +ILRG+ EC E + AGKPK
Sbjct: 323 GFFRILRGENECGIESNVVAGKPK 346
>gi|74213457|dbj|BAE35542.1| unnamed protein product [Mus musculus]
Length = 339
Score = 213 bits (541), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 126/329 (38%), Positives = 173/329 (52%), Gaps = 23/329 (6%)
Query: 18 LYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQF---LIADAKYFDQSDRPLPGDRK 74
+ SD I+ IN++ TW AGRNF N+ YL++ ++ K LPG R
Sbjct: 23 FHPLSDDLINYINKQNTTWQAGRNF-YNVDISYLKKLCGTVLGGPK--------LPG-RV 72
Query: 75 TYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQN 134
+ + +P+ FDAREQW NC TIG + D G+C + F AV A SDR CI + G+ N
Sbjct: 73 AFGEDID--LPETFDAREQWSNCPTIGQIRDQGSCGSCWAFGAVEAISDRTCIHTNGRVN 130
Query: 135 RPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPC 194
+S E + +CC I D C+ G W+F K+G V+GG Y GC P TI PC
Sbjct: 131 VEVSAEDLLTCCGIQCGD---GCNGGYPSGAWSFWTKKGLVSGGVYNSHVGCLPYTIPPC 187
Query: 195 SHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEIL 254
HH + + P C + +C+ C Y + +DKH +Y V ++ I EI
Sbjct: 188 EHHVNG-SRPPCTGEG-DTHRCNKSC-EAGYSPSYKEDKHFGYTSYSVSNSVKEIMAEIY 244
Query: 255 AHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGP 314
+GP F ++ DF YKSGVYKH + + H+ +++GWG ENG PYWL N+W
Sbjct: 245 KNGPVEGAFTVFSDFLTYKSGVYKHEAGDMMGG--HAIRILGWGVENGVPYWLAANSWNL 302
Query: 315 HWGDRGTVKILRGKYECAFEYLIAAGKPK 343
WGD G KILRG+ C E I AG P+
Sbjct: 303 DWGDNGFFKILRGENHCGIESEIVAGIPR 331
>gi|256090368|ref|XP_002581167.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
gi|22531387|emb|CAD44624.1| cathepsin B1 isotype 1 [Schistosoma mansoni]
gi|353228442|emb|CCD74613.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
Length = 340
Score = 212 bits (540), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 123/348 (35%), Positives = 181/348 (52%), Gaps = 24/348 (6%)
Query: 1 MIHILVFLLGCTLVRGELYK-FSDAYIDQINREANT-WTAGRNFPANLSEEYLRQFLIAD 58
+ ++ FL V+ E ++ SD I IN N W A E+ R + D
Sbjct: 8 IASLITFLEAHISVKNEKFEPLSDDIISYINEHPNAGWRA---------EKSNRFHSLDD 58
Query: 59 AKYFDQSDRPLPGDRKTYDP-----EYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPH 113
A+ + R P R+T P +++ +P FD+R++WP C +I + D C +
Sbjct: 59 ARIQMGARREEPDLRRTRRPTVDHNDWNVEIPSSFDSRKKWPRCKSIATIRDQSRCGSCW 118
Query: 114 IFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRG 173
F AV A SDR CI+S G+QN LS + SCC+ C C G + W++ K G
Sbjct: 119 AFGAVEAMSDRSCIQSGGKQNVELSAVDLLSCCESC----GLGCEGGILGPAWDYWVKEG 174
Query: 174 SVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDK 233
VTG + TGC+P C HH + P C ++ +C C Y + QDK
Sbjct: 175 IVTGSSKENHTGCEPYPFPKCEHH-TKGKYPPCGSKIYKTPRCKQTCQK-KYKTPYTQDK 232
Query: 234 HRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGK 293
HR +Y V ++E AI+KEI+ +GP A F +Y+DF +YKSG+YKH + L H+ +
Sbjct: 233 HRGKSSYNVKNDEKAIQKEIMKYGPVEAGFTVYEDFLNYKSGIYKHITGETLGG--HAIR 290
Query: 294 LIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGK 341
+IGWG EN TPYWL+ N+W WG+ G +I+RG+ EC+ E + AG+
Sbjct: 291 IIGWGVENKTPYWLIANSWNEDWGENGYFRIVRGRDECSIESEVTAGR 338
>gi|204022085|dbj|BAG71140.1| cathepsin B-S [Astegopteryx spinocephala]
Length = 335
Score = 212 bits (540), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 122/324 (37%), Positives = 170/324 (52%), Gaps = 18/324 (5%)
Query: 21 FSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEY 80
SD I+ IN+ A TW A R FPAN+S+EY+ L + ++ + D DP Y
Sbjct: 25 ISDERIEYINKIAKTWKAERYFPANMSKEYIMGLLGSRGYKNYLNEVEIKKD----DPLY 80
Query: 81 SAT--VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLS 138
+ FDARE W C IGHV D G C + F GAF+DR C+ + G N LS
Sbjct: 81 TKNNDTIKHFDAREDWKICKQIGHVRDQGNCGSCWAFGTTGAFADRLCVATGGGFNEQLS 140
Query: 139 TEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHG 198
E + CC C C G+ + W + + G TGGDYG GC P + PC +
Sbjct: 141 AEKLTFCCWTC----GLGCQGGNPIKAWKYFKRHGITTGGDYGSNEGCAPYKVPPC-YDD 195
Query: 199 SAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGP 258
L C+ + P H +C YG +++++ Y V D+ I+++I +GP
Sbjct: 196 QGEFL--CQGK--PTEHNH-KCPRACYGNSTVENRYKVKSIY-VLDSSKTIEQDIRKYGP 249
Query: 259 TTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGD 318
A+F +YDDF YKSG+Y+ T NA HS KLIGWG E+G PYWL++N+W WG+
Sbjct: 250 VEASFDVYDDFITYKSGIYQKTPNAFYVG-GHSVKLIGWGEEDGIPYWLLVNSWSKFWGE 308
Query: 319 RGTVKILRGKYECAFEYLIAAGKP 342
+GT +I++G+ EC E AG P
Sbjct: 309 QGTFRIIKGRNECGIERSATAGVP 332
>gi|47217183|emb|CAG11019.1| unnamed protein product [Tetraodon nigroviridis]
Length = 351
Score = 212 bits (540), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 129/355 (36%), Positives = 170/355 (47%), Gaps = 42/355 (11%)
Query: 12 TLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPG 71
+L R L S ++ IN+ +TWTAG NF N+ Y+++ L G
Sbjct: 16 SLARPHLKPLSSEMVNYINKLNSTWTAGHNF-HNVDYSYVKKLC----------GTLLKG 64
Query: 72 DRKTYDPEYSATV--PDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKS 129
+ Y+ + P FD+REQWPNC T+ + D G+C + F A A SDR CI S
Sbjct: 65 PKLPLMIRYAGDIKLPKEFDSREQWPNCPTLKEIRDQGSCGSCWAFGASEAMSDRVCIHS 124
Query: 130 KGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTG---- 185
+ + LS + + +CC C C+ G WNF G V+GG Y G
Sbjct: 125 NAKVSVELSAQDLLTCCNSC----GMGCNGGYPSSAWNFWVSDGLVSGGLYDSHIGRIQV 180
Query: 186 -----------------CQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRG 228
C+P TI PC HH + + PSC + +C RC Y
Sbjct: 181 SLCVLLLAVDRDFVSPGCRPYTIPPCEHHVNG-SRPSCSGEGGDTPECIFRC-EAGYSPS 238
Query: 229 FFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENY 288
+ QDKH +Y V ED IK+EI +GP F +Y+DF YKSGVY+H S + L
Sbjct: 239 YKQDKHFGKTSYSVSSEEDEIKQEIYKNGPVEGAFTVYEDFVLYKSGVYQHVSGSALGG- 297
Query: 289 LHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
H+ K++GWG ENG PYWL N+W WGD G KILRG C E I AG PK
Sbjct: 298 -HAIKMLGWGEENGVPYWLCANSWNTDWGDNGFFKILRGADHCGIESEIVAGNPK 351
>gi|403307501|ref|XP_003944231.1| PREDICTED: cathepsin B [Saimiri boliviensis boliviensis]
Length = 351
Score = 212 bits (540), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 123/340 (36%), Positives = 175/340 (51%), Gaps = 25/340 (7%)
Query: 11 CTLVRGELYK------FSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQ 64
C LV + ++ S+ ++ +N++ TW AG NF N+ YL++ +
Sbjct: 22 CLLVLADSWRGPSFHPLSEELVNYVNKQNTTWQAGHNF-YNVDLSYLKRLC---GTFLGG 77
Query: 65 SDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDR 124
P P R + + + +P+ FDAREQWP C TI + D G+C + F AV A SDR
Sbjct: 78 ---PKPPQRVKFAEDLN--LPESFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVEAISDR 132
Query: 125 RCIKSKGQQNRPLSTEYVASCC-KICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDR 183
CI + + +S E + +CC +C C+ G WNF ++G V+GG Y
Sbjct: 133 ICIHTNAHVSVEVSAEDLLTCCGSMC----GDGCNGGYPAEAWNFWTRKGLVSGGLYDSH 188
Query: 184 TGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVD 243
GC+P +I PC HH + P PK C C P Y + QDKH +Y V
Sbjct: 189 VGCRPYSIPPCEHHVNGSRPPCTGEGDTPK--CSKSC-EPGYTPTYKQDKHYGYNSYSVS 245
Query: 244 DNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGT 303
++E I EI +GP F++Y DF YKSGVY+H + + H+ +++GWG ENGT
Sbjct: 246 NSERDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGG--HAIRILGWGVENGT 303
Query: 304 PYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
PYWLV N+W WGD G KILRG+ C E + AG P+
Sbjct: 304 PYWLVGNSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPR 343
>gi|440913587|gb|ELR63025.1| Cathepsin B [Bos grunniens mutus]
Length = 335
Score = 212 bits (540), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 130/348 (37%), Positives = 174/348 (50%), Gaps = 27/348 (7%)
Query: 1 MIHILVFLLGCTLVRGELY--KFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIAD 58
++ L LL T R LY SD ++ +N++ TW AG NF N+ Y+++
Sbjct: 4 LLATLSCLLVLTSARSSLYFPPLSDELVNFVNKQNTTWKAGHNF-YNVDLSYVKKLC--- 59
Query: 59 AKYFDQSDRPLPGDRKTYDPEYSATV--PDRFDAREQWPNCGTIGHVPDTGACAAPHIFA 116
L G + ++A V P+ FDAR+QWPNC TI + D G+C + F
Sbjct: 60 -------GTILGGPKLPQRDAFAADVVLPESFDARKQWPNCPTIKEIRDQGSCGSCWAFG 112
Query: 117 AVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRT--WNFLHKRGS 174
AV A SDR CI S G+ N +S E + + F + WNF K+G
Sbjct: 113 AVEAISDRICIHSNGRVNVEVSAEDM-----LTCCGGECGDGCNGGFPSGAWNFWTKKGL 167
Query: 175 VTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKH 234
V+GG Y GC+P +I PC HH + P PK C C P Y + +DKH
Sbjct: 168 VSGGLYNSHVGCRPYSIPPCEHHVNGSRPPCTGEGDTPK--CSKTC-EPGYSPSYKEDKH 224
Query: 235 RTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKL 294
+Y V +NE I EI +GP F++Y DF YKSGVY+H S + H+ ++
Sbjct: 225 FGCSSYSVANNEKEIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVSGEIMGG--HAIRI 282
Query: 295 IGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
+GWG ENGTPYWLV N+W WGD G KILRG+ C E I AG P
Sbjct: 283 LGWGVENGTPYWLVGNSWNTDWGDNGFFKILRGQDHCGIESEIVAGMP 330
>gi|86279343|gb|ABC88767.1| putative cathepsin B-like proteinase [Tenebrio molitor]
Length = 321
Score = 212 bits (539), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 122/340 (35%), Positives = 169/340 (49%), Gaps = 28/340 (8%)
Query: 4 ILVFLLGCTLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFD 63
+LV +L +L E+ S +ID INR ++W AGRNFP N + EYL + + D
Sbjct: 9 VLVAVLSASL--AEIDVLSSEFIDSINRIQSSWVAGRNFPENTTNEYLYKLNGFIGLHPD 66
Query: 64 QSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSD 123
+ +P P T++ VP+ FDAR +WPNC ++ + D GAC + FA++ + SD
Sbjct: 67 PNYKP-PVLVHTFNAR---DVPESFDARTKWPNCDSLNRIRDQGACGSCWAFASIESMSD 122
Query: 124 RRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDR 183
R CI S G S E + SCC C C G + +F G V+GGD
Sbjct: 123 RICIHSSGSAQFMFSPEDLLSCCTSC-----GDCGGGYMMSALDFYINEGIVSGGDVNSN 177
Query: 184 TGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVD 243
GC+P T + G P C C N Y + DKH + Y V
Sbjct: 178 EGCRPYT-ADAHDQGQTPA-------------CTKSCRN-GYSTSYSADKHYGSNDYVVS 222
Query: 244 DNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGT 303
D I+ E++ +GP F ++ DFY+Y SGVY+H S + H K++GWG ENG
Sbjct: 223 SVIDQIQYEVMTNGPIIVNFEVFQDFYNYVSGVYRHVSGESVG--FHVVKIVGWGVENGV 280
Query: 304 PYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
PYWL+ N+WG WGD G K+LRG+ EC E A P+
Sbjct: 281 PYWLIANSWGSSWGDHGFFKMLRGQNECGIENYPYAVMPR 320
>gi|282400164|ref|NP_001164205.1| cathepsin B precursor [Tribolium castaneum]
gi|270004839|gb|EFA01287.1| cathepsin B precursor [Tribolium castaneum]
Length = 335
Score = 212 bits (539), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 127/346 (36%), Positives = 170/346 (49%), Gaps = 18/346 (5%)
Query: 1 MIHILVFLLGCTLVRG--ELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIAD 58
+H V L LV G L+ SD +I++IN +TW AGRNF + +++Q L
Sbjct: 4 FLHFAVVLATVALVYGGVHLHPLSDDFINRINSRKSTWKAGRNFDIDTPISHIKQLLGVL 63
Query: 59 AKYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCG-TIGHVPDTGACAAPHIFAA 117
+ + P K + +PD FDARE WP+C IG++ D C + F A
Sbjct: 64 PETENTPKLP-----KKIHSINAQEIPDSFDAREAWPDCAPIIGNIRDQSTCGSCWAFGA 118
Query: 118 VGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTG 177
V A SDR CI S +S E CC IC C+ G W G VTG
Sbjct: 119 VEAMSDRICIHSNATVKVNISAEDPLDCCTIC----GMGCNGGMPAMAWLHWTVNGIVTG 174
Query: 178 GDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTT 237
G+Y D GC+ + +PC HH LP C K P C C + + +Q+
Sbjct: 175 GNYEDTNGCKAYSFAPCEHHVDG-DLPPCGPTK-PTPDCKKECDSGS--SLTYQNDLTHG 230
Query: 238 LTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGW 297
Y +D I+ EI+ +GP A+F++Y+DF YKSGVY+H H+ K++GW
Sbjct: 231 SNYGIDPYPKQIQTEIMTNGPVEASFSVYEDFLSYKSGVYQHLEGEYAGG--HAIKILGW 288
Query: 298 GTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
G EN TPYWLV N+W WGD+G KILRG EC E I AG P+
Sbjct: 289 GVENDTPYWLVANSWNEDWGDKGYFKILRGSNECGIEGSIVAGIPE 334
>gi|198429088|ref|XP_002120307.1| PREDICTED: similar to cathepsin B [Ciona intestinalis]
Length = 364
Score = 211 bits (538), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 124/349 (35%), Positives = 176/349 (50%), Gaps = 19/349 (5%)
Query: 1 MIHILVFLLGCTLVRGELYKFS-----DAYIDQINREANTWTAGRNF-PANLSEEYLRQF 54
++++ + +L V E Y S +A + +N+ TW A NF P E L+
Sbjct: 28 VLNMKLLVLLSAFVLSECYVISKEDNFNAIVKTVNKANTTWKASLNFDPTYYVPEDLK-- 85
Query: 55 LIADAKYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHI 114
L+ K L +Y +P++FD+R+QWP+C +I ++ D G+C +
Sbjct: 86 LLCGVKEDKHGYSKL---ETSYHNLEGIKIPNQFDSRKQWPHCPSISYIRDQGSCGSCWA 142
Query: 115 FAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGS 174
F AV A SDR CI+S G+ +S E + SCC ++ C+ G W + + G
Sbjct: 143 FGAVEAMSDRYCIRSNGKIQVEISAEDLLSCCG---FECGDGCNGGFPGSAWKYWNSDGL 199
Query: 175 VTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKH 234
VTGG YG +TGC P I PC HH E P C ++C T + QDKH
Sbjct: 200 VTGGLYGSKTGCLPYQIKPCEHHVPGDRPKCSEGGGTPS--CVSKCKGNTTIH-YNQDKH 256
Query: 235 RTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKL 294
+Y V + I+ EI+ HGP F +Y DF YKSGVYKH + L H+ ++
Sbjct: 257 YGLSSYAVGSDPTQIQTEIMTHGPVEGAFTVYADFPTYKSGVYKHVTGGVLGG--HAIRI 314
Query: 295 IGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
+GWG+ENG YWLV N+W WGD+G KILRG EC E + AG P+
Sbjct: 315 LGWGSENGVAYWLVANSWNTDWGDKGYFKILRGSDECGIESSVVAGIPQ 363
>gi|55793949|gb|AAV65885.1| cathepsin B1 isotype 5 precursor [Trichobilharzia regenti]
Length = 342
Score = 211 bits (538), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 125/352 (35%), Positives = 181/352 (51%), Gaps = 25/352 (7%)
Query: 1 MIHILVFLLGCTLVRGELY--KFSDAYIDQINREANT-WTAGRNFPANLSEEYLRQFLIA 57
+I + L L E+ SD I IN+ + WTA R+ R +
Sbjct: 8 IISFMSILTAHILPENEIQFEPLSDEMIAYINQHPDAGWTASRSD---------RFKSLK 58
Query: 58 DAKYFDQSDRPLPGDRKTYDP-----EYSATVPDRFDAREQWPNCGTIGHVPDTGACAAP 112
DA+ + R RK P + S +P FD+R++WP C +I ++ D C A
Sbjct: 59 DARILLGAMREDEELRKKRRPTVDHQDVSLEIPTSFDSRKEWPQCKSISNIRDQSRCGAG 118
Query: 113 HIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKR 172
FAAV A SDR CI+SKG+++ LS + SCC C C G W++ +
Sbjct: 119 WAFAAVQAMSDRICIESKGKKSVELSAVDLLSCCIEC----GLGCQMGFPGIAWDYWVQE 174
Query: 173 GSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQD 232
G VTGG + TGCQP C HH + P C K KCH +C Y + +D
Sbjct: 175 GIVTGGSKENHTGCQPYPFPKCEHH-TKGRYPECGEIIYMKPKCHQKCQK-GYKTPYEKD 232
Query: 233 KHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSG 292
K+ ++Y + NED+IKKEI+ HGP A+F ++ DF +YKSG+YKH + + + H
Sbjct: 233 KYYGKVSYNLLKNEDSIKKEIMMHGPVEASFRVHSDFLNYKSGIYKHMTGIDIGS--HVV 290
Query: 293 KLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPKN 344
++IGWG E TPYWL+ N+W WG++G ++LRGK EC E + +G P++
Sbjct: 291 RIIGWGVEKETPYWLIANSWNEDWGEKGYFRMLRGKDECGIESAVTSGLPRD 342
>gi|395507317|ref|XP_003757972.1| PREDICTED: cathepsin B [Sarcophilus harrisii]
Length = 342
Score = 211 bits (538), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 119/326 (36%), Positives = 166/326 (50%), Gaps = 20/326 (6%)
Query: 21 FSDAYIDQINREANTWTAGRNFPANLSEEYLRQF---LIADAKYFDQSDRPLPGDRKTYD 77
SD ++ +N+ TW AG NF N+ Y+++ ++ AK Q D K
Sbjct: 26 LSDEMVNYVNKLNTTWKAGHNF-RNVDMSYVKKLCGTVMGGAKQLPQRVMLADDDMK--- 81
Query: 78 PEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPL 137
+P+ FDAREQWP C TI + D G+C + F AV A SDR C+ + G +
Sbjct: 82 ------LPENFDAREQWPKCPTIKEIRDQGSCGSCWAFGAVEAISDRICVHTNGYITIEV 135
Query: 138 STEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHH 197
S E + SCC + + C+ G W + K+G V+GG Y GC+P +I PC HH
Sbjct: 136 SAEDLLSCCGL---QCGEGCNGGFPAGAWKYWIKKGLVSGGLYDSHVGCRPYSIPPCEHH 192
Query: 198 GSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHG 257
+ + P+C + KC+ +C Y + DKH T Y V +E I EI +G
Sbjct: 193 VNG-SRPACTGEGGDTPKCNKKC-EAGYSPDYKDDKHYGTTAYNVPSSEKEIMAEIYKNG 250
Query: 258 PTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWG 317
P F +Y DF YKSGVY+H + L H+ +++GWG E+G PYWL N+W WG
Sbjct: 251 PVEGAFIVYADFLQYKSGVYQHVTGDMLGG--HAIRVLGWGVEDGVPYWLAANSWNTDWG 308
Query: 318 DRGTVKILRGKYECAFEYLIAAGKPK 343
D G KILRGK C E + AG P+
Sbjct: 309 DNGFFKILRGKDHCGIESEMVAGIPR 334
>gi|442616292|ref|NP_001259536.1| cathepsin B1, isoform B [Drosophila melanogaster]
gi|440216755|gb|AGB95378.1| cathepsin B1, isoform B [Drosophila melanogaster]
Length = 330
Score = 211 bits (537), Expect = 4e-52, Method: Compositional matrix adjust.
Identities = 122/337 (36%), Positives = 175/337 (51%), Gaps = 27/337 (8%)
Query: 12 TLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPG 71
L GE SD +I+ GRNF A+++E ++R+ + + D LP
Sbjct: 15 ALTSGEPSLLSDEFIE----------VGRNFDASVTEGHIRRLM---GVHPDAHKFALPD 61
Query: 72 DRKTYDPEYSATV---PDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIK 128
R+ Y +V P+ FD+R+QWPNC TIG + D G+C + F AV A SDR CI
Sbjct: 62 KREVLGDLYVNSVDELPEEFDSRKQWPNCPTIGEIRDQGSCGSCWAFGAVEAMSDRVCIH 121
Query: 129 SKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQP 188
S G+ N S + + SCC C + C+ G W++ ++G V+GG YG GC+P
Sbjct: 122 SGGKVNFHFSADDLVSCCHTCGF----GCNGGFPGAAWSYWTRKGIVSGGPYGSNQGCRP 177
Query: 189 STISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDA 248
ISPC HH + P + P KC C + Y + +DKH + +Y V N
Sbjct: 178 YEISPCEHHVNGTRPPCAHGGRTP--KCSHVCQS-GYTVDYAKDKHFGSKSYSVRRNVRE 234
Query: 249 IKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGT--ENGTPYW 306
I++EI+ +GP F +Y+D YK GVY+H +L H+ +++GWG E PYW
Sbjct: 235 IQEEIMTNGPVEGAFTVYEDLILYKDGVYQHEHGKELGG--HAIRILGWGVWGEEKIPYW 292
Query: 307 LVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
L+ N+W WGD G +ILRG+ C E I+AG PK
Sbjct: 293 LIGNSWNTDWGDHGFFRILRGQDHCGIESSISAGLPK 329
>gi|225708580|gb|ACO10136.1| Cathepsin B precursor [Osmerus mordax]
Length = 329
Score = 211 bits (537), Expect = 4e-52, Method: Compositional matrix adjust.
Identities = 122/325 (37%), Positives = 167/325 (51%), Gaps = 23/325 (7%)
Query: 21 FSDAYIDQINREANTWTAGRNFPANLSEEYLRQFL--IADAKYFDQSDRPLPGDRKTYDP 78
S I INR TW AG+NF N+ Y++ + + + + P
Sbjct: 25 LSSEMIQYINRLNTTWKAGQNF-YNVDLSYVQGLCGTLQNKPTLPELEHPA--------- 74
Query: 79 EYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLS 138
+PD FDAR+QWPNC TI + D G+C + F A A SDR CI S + +S
Sbjct: 75 --GVKLPDTFDARQQWPNCPTIQDIRDQGSCGSCWAFGAAEAISDRLCIHSNAKITVEIS 132
Query: 139 TEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHG 198
E + SCC+ C C G W + K G VTGG YG GC+P +I PC HH
Sbjct: 133 AEDLLSCCEEC----GMGCFGGYPSAAWEYWAKSGLVTGGLYGSNKGCRPYSIPPCEHHV 188
Query: 199 SAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGP 258
+ T P C+ + KC T+C + Y + +DK+ TY V ++ I E+ +GP
Sbjct: 189 NG-TRPPCQGEG-DTPKCQTKCID-GYTPAYEKDKYFGKKTYSVPSKQEQIMTELYKNGP 245
Query: 259 TTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGD 318
A F++Y+DF YKSGVY+H + L H+ K++GWG EN TPYWL N+W WG+
Sbjct: 246 VEAAFSVYEDFLLYKSGVYQHLTGDMLGG--HAIKILGWGKENNTPYWLAANSWNTDWGN 303
Query: 319 RGTVKILRGKYECAFEYLIAAGKPK 343
+G KILRG EC E + AG P+
Sbjct: 304 QGFFKILRGGDECGIESEVVAGIPQ 328
>gi|50657025|emb|CAH04630.1| cathepsin B [Suberites domuncula]
Length = 331
Score = 211 bits (537), Expect = 4e-52, Method: Compositional matrix adjust.
Identities = 126/317 (39%), Positives = 162/317 (51%), Gaps = 19/317 (5%)
Query: 29 INREANTWTAGRN--FPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEYSATVPD 86
IN+ TW AG N F + RQ + D LP K P VPD
Sbjct: 28 INKLGTTWKAGVNKRFEGLSEVDIRRQMGVLQGGPLDIK---LP--EKDITP--LKDVPD 80
Query: 87 RFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCC 146
FDAR QWP+C TI + D GAC + F AV + SDR CI Q+ +S E + +CC
Sbjct: 81 MFDARMQWPDCPTIKEIRDQGACGSCWAFGAVESMSDRFCIHF--NQSAHISAEDLMACC 138
Query: 147 KICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSC 206
+ C C+ G + W + G VTGG Y + GCQP I+ C HH P C
Sbjct: 139 ETC----GMGCNGGYLGAAWRYFEHTGLVTGGQYNSKEGCQPYLIASCDHHVVGKKQP-C 193
Query: 207 ENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALY 266
+++ +C C Y F +DKH Y V + +AI+ EI+ +GP F +Y
Sbjct: 194 ASKEEHTPRCSKTC-EAGYDVSFEKDKHFGASAYSVRSSVEAIQTEIMTNGPVEGAFTVY 252
Query: 267 DDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVKILR 326
DF YKSGVY+HTS A L H+ +++GWGTENGTPYWLV N+W WG G KI+R
Sbjct: 253 ADFPTYKSGVYQHTSGAMLGG--HAIRILGWGTENGTPYWLVANSWNEDWGAMGYFKIIR 310
Query: 327 GKYECAFEYLIAAGKPK 343
GK +C E I AG PK
Sbjct: 311 GKDDCGIESQITAGMPK 327
>gi|449267314|gb|EMC78276.1| Cathepsin B [Columba livia]
Length = 340
Score = 211 bits (536), Expect = 5e-52, Method: Compositional matrix adjust.
Identities = 122/347 (35%), Positives = 174/347 (50%), Gaps = 24/347 (6%)
Query: 2 IHILVFLLGCTLVRGELY--KFSDAYIDQINREANTWTAGRNFPANLSEEYLRQF---LI 56
+ IL L+ R Y S ++ IN+ TW AG NF N Y++Q +
Sbjct: 5 VSILCVLVAFANARSVPYYRPLSSDLVNHINKLNTTWKAGHNF-YNTDMSYVKQLCGTFL 63
Query: 57 ADAKYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFA 116
K ++ D GD + +PD FD+R QWPNC TI + D G+C + F
Sbjct: 64 GGPKLPERVD--FAGDME---------LPDSFDSRTQWPNCPTISEIRDQGSCGSCWAFG 112
Query: 117 AVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVT 176
AV A SDR C+ + + + +S E + SCC ++ C+ G W + ++G V+
Sbjct: 113 AVEAISDRICVHTNAKVSVEVSAEDLLSCCG---FECGMGCNGGYPSGAWRYWTEKGLVS 169
Query: 177 GGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRT 236
GG Y GC+P +I PC HH + + P C + +C C P Y + +DKH
Sbjct: 170 GGLYDSHVGCRPYSIPPCEHHVNG-SRPPCTGEGGETPRCSRHC-EPGYSPSYKEDKHYG 227
Query: 237 TLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIG 296
+Y V +E I EI +GP F +Y+DF YKSGVY+H + ++ H+ +L+G
Sbjct: 228 ITSYGVPRSEKEIMAEIYKNGPVEGAFIVYEDFLMYKSGVYQHVTGEQVGG--HAIRLLG 285
Query: 297 WGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
WG +NGTPYWL N+W WGD G KILRG+ C E I AG P
Sbjct: 286 WGVDNGTPYWLAANSWNTDWGDNGFFKILRGEDHCGIESEIVAGIPS 332
>gi|154089579|gb|ABS57370.1| cathepsin B2 [Trichobilharzia regenti]
Length = 344
Score = 211 bits (536), Expect = 5e-52, Method: Compositional matrix adjust.
Identities = 129/323 (39%), Positives = 162/323 (50%), Gaps = 13/323 (4%)
Query: 21 FSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEY 80
S I IN EANT P S +R+ L A D + LP Y P
Sbjct: 33 LSSELIHFINHEANTTWKAAPSPRFKSVSDIRRMLGALP---DPNGGHLPTLCTGYTPSL 89
Query: 81 SATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTE 140
+P FDAR+ WP+C +I + D +C + F AV A SDR CI+SKG LS E
Sbjct: 90 D-ELPKEFDARKYWPHCPSISEIRDQSSCGSCWAFGAVEAMSDRICIESKGLHKPFLSAE 148
Query: 141 YVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSA 200
+ +CC C C+ G W++ + G VTG Y GCQP PC HH
Sbjct: 149 NLVACCSSC----GMGCNGGFPHSAWSYWKRSGIVTGDLYNPTDGCQPYEFPPCEHHVVG 204
Query: 201 PTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTT 260
P PSCE V KC T C P Y + +DK Y V N++AI KE+ HGP
Sbjct: 205 PR-PSCEGD-VETPKCKTTC-QPGYNIPYNKDKWYGKTVYRVHSNQEAIMKEVKEHGPVE 261
Query: 261 ATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRG 320
F +Y DF +YKSGVY+H S L H+ +L+GWG ENG PYWL+ N+W WGD G
Sbjct: 262 VDFEVYADFPNYKSGVYQHVSGGLLGG--HAVRLLGWGEENGVPYWLIANSWNSDWGDNG 319
Query: 321 TVKILRGKYECAFEYLIAAGKPK 343
KI+RG+ EC E + AG PK
Sbjct: 320 YFKIIRGRNECGIESDVNAGIPK 342
>gi|148222779|ref|NP_001080410.1| uncharacterized protein LOC380102 precursor [Xenopus laevis]
gi|28302291|gb|AAH46667.1| Cg10992 protein [Xenopus laevis]
Length = 333
Score = 211 bits (536), Expect = 5e-52, Method: Compositional matrix adjust.
Identities = 128/349 (36%), Positives = 173/349 (49%), Gaps = 24/349 (6%)
Query: 1 MIHILVFLLGCTLVRGELY--KFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIAD 58
++ L FL R Y S ++ IN+ TW AG NF AN Y+++
Sbjct: 4 LVVALCFLASIANSRHLPYFAPLSHDMVNYINKVNTTWKAGHNF-ANADVHYVKRLC--- 59
Query: 59 AKYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAV 118
+ P +K + +PD FD+R WPNC TI + D G+C + F AV
Sbjct: 60 -----GTHLNGPQLQKRFGFADDLDLPDSFDSRAAWPNCPTIREIRDQGSCGSCWAFGAV 114
Query: 119 GAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGG 178
A SDR C+ + G+ N +S E + SCC + C+ G W F + G V+GG
Sbjct: 115 EAISDRVCVHTNGKVNVEVSAEDLLSCCG---FKCGMGCNGGYPSGAWRFWTETGLVSGG 171
Query: 179 DYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTN---PTYGRGFFQDKHR 235
Y GC+P +I PC HH + + PSC+ ++ KC C P YG DKH
Sbjct: 172 LYDSHVGCRPYSIPPCEHHVNG-SRPSCKGEEGDTPKCMKTCEEGYTPAYG----SDKHF 226
Query: 236 TTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLI 295
+Y V +E I +I +GP F +Y DF YKSGVY+H + +L H+ K++
Sbjct: 227 GATSYGVPSSEKEIMADIYKNGPVEGAFVVYADFPLYKSGVYQHETGEELGG--HAIKIL 284
Query: 296 GWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPKN 344
GWG ENGTPYWL N+W WGD G KILRGK C E + AG PKN
Sbjct: 285 GWGVENGTPYWLCANSWNTDWGDNGFFKILRGKDHCGIESEVVAGIPKN 333
>gi|256052329|ref|XP_002569725.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
gi|353228436|emb|CCD74607.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
Length = 345
Score = 211 bits (536), Expect = 5e-52, Method: Compositional matrix adjust.
Identities = 123/345 (35%), Positives = 178/345 (51%), Gaps = 24/345 (6%)
Query: 4 ILVFLLGCTLVRGELYK-FSDAYIDQINREANT-WTAGRNFPANLSEEYLRQFLIADAKY 61
++ L ++ E +K SD I IN N W A E+ R + DA+
Sbjct: 16 LITHLDAHISIKNEKFKPLSDDIISYINEHPNAGWRA---------EKSNRFHSLDDARI 66
Query: 62 FDQSDRPLPGDRKTYDP-----EYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFA 116
+ R P R+ P E++ +P FD+R++WP C +I + D C + F
Sbjct: 67 QMGARREEPDLRRKRRPTVDHNEWNVEIPSNFDSRKKWPGCKSIATIRDQSRCGSCWAFG 126
Query: 117 AVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVT 176
AV A SDR CI+S G+QN LS + SCC+ C C G + W+F K G VT
Sbjct: 127 AVEAMSDRSCIQSGGKQNVELSAVDLLSCCESC----GLGCEGGILGPAWDFWVKEGIVT 182
Query: 177 GGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRT 236
G + TGC+P C HH + P C ++ +C C Y + QDKHR
Sbjct: 183 GSSKENHTGCEPYPFPKCEHH-TKGKYPPCGSKIYKTPRCKQTCQK-KYKTPYTQDKHRG 240
Query: 237 TLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIG 296
+Y V ++E AI+KEI+ +GP A+F +Y+DF +YKSG+YKH + L H+ ++IG
Sbjct: 241 KSSYNVKNDEKAIQKEIMKYGPVEASFTVYEDFLNYKSGIYKHITGEALGG--HAIRIIG 298
Query: 297 WGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGK 341
WG EN TPYWL+ N+W WG+ G +I+RG+ EC E + AG+
Sbjct: 299 WGVENKTPYWLIANSWNEDWGENGYFRIVRGRDECFIESEVIAGQ 343
>gi|341888136|gb|EGT44071.1| hypothetical protein CAEBREN_13576 [Caenorhabditis brenneri]
Length = 337
Score = 211 bits (536), Expect = 5e-52, Method: Compositional matrix adjust.
Identities = 115/324 (35%), Positives = 169/324 (52%), Gaps = 17/324 (5%)
Query: 22 SDAYIDQINREANTWTAG-RNFPAN-LSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPE 79
++ I IN + W AG ++ P + ++ +A K+++ P+
Sbjct: 22 TELLIQHINSVQSLWRAGYQDVPKEKMMGNLMKPEHVAPHKFYEV--EPI---------S 70
Query: 80 YSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLST 139
+ +PD FDAREQWPNC +I ++ D C + AA SDR CI S G+ N +S
Sbjct: 71 VAENIPDHFDAREQWPNCVSIDNIRDQSDCGSCWAVAAAETISDRTCIASNGEVNVLISA 130
Query: 140 EYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGS 199
E + SCC Y+ C G + W + G VTGG Y + GC+P +I+PC +
Sbjct: 131 EDLLSCCT-GGYNCGDGCEGGYPIQAWRYWVHNGLVTGGSYESQYGCKPYSIAPCGQTVN 189
Query: 200 APTLPSCENQKVPKLKCHTRCTNPT-YGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGP 258
T P C +V +C +CT+ + Y + QDKH + Y + N I+ EI+ +GP
Sbjct: 190 GVTWPKCAADEVATPECVKQCTSKSDYAVPYDQDKHYGSSAYAIRQNVAQIQTEIMRNGP 249
Query: 259 TTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGD 318
F +Y DFY YKSG+YKH + +L H+ K++GWG ENGTPYWL N+W +WG+
Sbjct: 250 VEVGFLVYSDFYQYKSGIYKHVAGRELGG--HAVKILGWGVENGTPYWLAANSWNVNWGE 307
Query: 319 RGTVKILRGKYECAFEYLIAAGKP 342
+G +I RG EC E + AG P
Sbjct: 308 KGYFRIRRGTNECGIESSVVAGIP 331
>gi|17565162|ref|NP_503382.1| Protein W07B8.4 [Caenorhabditis elegans]
gi|351059398|emb|CCD74288.1| Protein W07B8.4 [Caenorhabditis elegans]
Length = 335
Score = 211 bits (536), Expect = 6e-52, Method: Compositional matrix adjust.
Identities = 117/343 (34%), Positives = 168/343 (48%), Gaps = 15/343 (4%)
Query: 1 MIHILVFLLGCTLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAK 60
M+ +L LL + + +I+ IN WTA E Y F + +
Sbjct: 1 MLKLLPSLLFILAASAVVLPRNKLFINHINSAQKLWTA---------EHYTTPFEVKNLM 51
Query: 61 YFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGA 120
+ L D K E + ++PD +D R+ WP C ++ ++ D C + AA A
Sbjct: 52 KVEHVAAHLDKDIKL--AETADSIPDSYDVRDHWPQCISVNNIRDQSHCGSCWAVAAAEA 109
Query: 121 FSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDY 180
SDR CI S G N LS E + +CC +++ C G + W + K G VTGG +
Sbjct: 110 ISDRTCIASNGDVNTLLSAEDILTCCT-GKFNCGDGCEGGYPIQAWRYWVKNGLVTGGSF 168
Query: 181 GDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCT-NPTYGRGFFQDKHRTTLT 239
+ GC+P +I+PC T P C + KC CT N +Y + QDKH
Sbjct: 169 ESQYGCKPYSIAPCGETIDGVTWPECPMKISDTPKCEHHCTGNNSYPIPYDQDKHFGASA 228
Query: 240 YWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGT 299
Y + + I+ EILAHGP F +Y+DFY YK+G+Y H + +L H+ K++GWG
Sbjct: 229 YAIGRSAKQIQTEILAHGPVEVGFIVYEDFYLYKTGIYTHVAGGELGG--HAVKMLGWGV 286
Query: 300 ENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
+NGTPYWL N+W WG++G +ILRG EC E AG P
Sbjct: 287 DNGTPYWLAANSWNTVWGEKGYFRILRGVDECGIESAAVAGMP 329
>gi|325302580|dbj|BAJ83490.1| cathepsin B-like peptidase [Echinococcus multilocularis]
Length = 351
Score = 211 bits (536), Expect = 6e-52, Method: Compositional matrix adjust.
Identities = 134/330 (40%), Positives = 167/330 (50%), Gaps = 23/330 (6%)
Query: 21 FSDAYIDQINREANTWTA--GRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDP 78
S A ID +NR TW A R F S +RQ L A D R LP
Sbjct: 39 LSSAIIDYVNRINTTWKAEPSRRF---TSPSQVRQQLGA---LPDPMGRRLPVLYSL--S 90
Query: 79 EYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRP-- 136
E ++P FD R++WPNC T+ + D G+C + F A A SDR CI+ + R
Sbjct: 91 ENYKSLPASFDPRKKWPNCKTLFEIRDQGSCGSCWAFGAAEAMSDRLCIQQQTVSGRAVM 150
Query: 137 --LSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPC 194
LS + + SCC+ C C+ G + WNF G V+GG YG + C+ I PC
Sbjct: 151 VRLSADDLLSCCRDC----GMGCNGGFPSQAWNFWKHEGLVSGGLYGTKGVCRAYEIPPC 206
Query: 195 SHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEIL 254
HH + T P CE P KC C Y + +DKH Y V NEDAIK E++
Sbjct: 207 EHHVNG-TRPPCEGD-APTPKCKNVCQE-EYKVPYKKDKHYAVKVYSVHSNEDAIKHELI 263
Query: 255 AHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGP 314
HGP A F +Y DF YKSGVY+H S A L H+ KL+GWG E+G PYWL N+W
Sbjct: 264 THGPVEADFEVYADFPTYKSGVYQHVSGALLGG--HAIKLMGWGEEDGVPYWLCANSWNT 321
Query: 315 HWGDRGTVKILRGKYECAFEYLIAAGKPKN 344
WG+ G KILRGK C E I AG P+N
Sbjct: 322 DWGEGGFFKILRGKNHCGIESDIVAGIPQN 351
>gi|161343867|tpg|DAA06114.1| TPA_inf: cathepsin B [Myzus persicae]
Length = 340
Score = 210 bits (535), Expect = 6e-52, Method: Compositional matrix adjust.
Identities = 126/352 (35%), Positives = 178/352 (50%), Gaps = 28/352 (7%)
Query: 1 MIHILVFLLGCTLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAK 60
++ +L +L + + Y YID IN +A TW AG NFP + +E + + L +
Sbjct: 4 VLILLSVILFSVYMTEQAYFLEKDYIDSINAQATTWKAGVNFPPSTPKEAILRLLGSRGV 63
Query: 61 YF-DQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVG 119
++++ + R + +P +FDAR++W C TIG V D G C + A
Sbjct: 64 QIPNKANYKMYKSRDSNYDNLFGRIPKKFDARKKWRKCKTIGAVRDQGNCGSCWALATSS 123
Query: 120 AFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGD 179
AF+DR C+ + N LS E + CC C Y C+ G + W G VTGGD
Sbjct: 124 AFADRLCVATDADFNEFLSPEELTFCCHTCGY----GCNGGYPIKAWERFKSHGLVTGGD 179
Query: 180 YGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDK--HRTT 237
Y GC+P + PC HH SC ++ + K + RCT YG HR T
Sbjct: 180 YKSGEGCEPYRVPPCRHHAEGNN--SCSDKPMEK---NHRCTRMCYGDQDLDFDDDHRYT 234
Query: 238 -----LTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYL--H 290
LTY +I+K+++ +GP A+F +YDDF YKSGVY + NA +YL H
Sbjct: 235 RDSYYLTY------GSIQKDVMNYGPIEASFDVYDDFPSYKSGVYIRSDNA---SYLGGH 285
Query: 291 SGKLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
+ KLIGWG E+G PYWL++N+W WGD+G KI RG EC + AG P
Sbjct: 286 AVKLIGWGEESGVPYWLMVNSWNTDWGDKGLFKIQRGTNECGVDNSTTAGVP 337
>gi|355681635|gb|AER96808.1| cathepsin B [Mustela putorius furo]
Length = 338
Score = 210 bits (535), Expect = 7e-52, Method: Compositional matrix adjust.
Identities = 128/344 (37%), Positives = 174/344 (50%), Gaps = 23/344 (6%)
Query: 5 LVFLLGCTLV------RGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIAD 58
L+ L C +V R SD + +N++ TW AG NF N+ + YL++
Sbjct: 4 LLATLSCLVVLTSAQRRPPFQPLSDELVHYVNKQNTTWKAGHNF-HNVDQSYLKKLC--- 59
Query: 59 AKYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAV 118
+ P P R + + +P+ FD+REQWPNC TI + D G+C + F AV
Sbjct: 60 GTFLGG---PKPPQRLWF--AENMILPESFDSREQWPNCPTIKEIRDQGSCGSCWAFGAV 114
Query: 119 GAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGG 178
A SDR CI++ G + +S E + +CC D C+ G WNF G V+GG
Sbjct: 115 EAISDRICIRTNGHVSVEVSAEDMLTCCGDQCGD---GCNGGFPAEAWNFWTXXGLVSGG 171
Query: 179 DYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTL 238
Y GC+P +I PC HH + P PK C C P Y + +DKH
Sbjct: 172 LYDSHVGCRPYSIPPCEHHVNGSRPPCTGEGDTPK--CSKIC-EPGYTPSYKEDKHYGCS 228
Query: 239 TYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWG 298
+Y V +E I EI +GP A F++Y DF YKSGVY+H + + H+ +++GWG
Sbjct: 229 SYSVSSSEKEIMAEIYKNGPVEAAFSVYSDFLMYKSGVYQHVTGEMMGG--HAVRILGWG 286
Query: 299 TENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
ENGTPYWLV N+W WGD G KILRG+ C E I AG P
Sbjct: 287 VENGTPYWLVGNSWNTDWGDNGFFKILRGQDHCGIESEIVAGIP 330
>gi|118365170|ref|XP_001015806.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila]
gi|89297573|gb|EAR95561.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila SB210]
Length = 340
Score = 210 bits (535), Expect = 7e-52, Method: Compositional matrix adjust.
Identities = 127/347 (36%), Positives = 175/347 (50%), Gaps = 20/347 (5%)
Query: 4 ILVFLLGCTL-VRGELYKFSDA---YIDQINREAN-TWTAGRNFPANLSEEYLRQFLIAD 58
+ + +LGC +KF + + ++N N TW A R +P E+ R+ L+
Sbjct: 6 LSILILGCLFSTSANCFKFGEMSPFIVFEVNSNPNSTWKAAR-YPH--FEKMTREQLLGH 62
Query: 59 AKYFDQSD-RPLPGDRKTYDPEYSA-TVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFA 116
D+ D LP K +DP +A +P+ FDAREQWPNC +I + D C + FA
Sbjct: 63 LGSLDEPDWVKLP--TKEFDPNANADPIPEFFDAREQWPNCQSIKLIRDQSTCGSCWAFA 120
Query: 117 AVGAFSDRRCIKSKGQQNRPLSTEYVASCCK-ICRYDDNKSCSHGSVFRTWNFLHKRGSV 175
A FSDR CI S +S+E + CC C C G W ++ ++G
Sbjct: 121 ATETFSDRICIASNQTLQTSISSEDLLECCADYC----GMGCKGGYPSAAWGYMKRQGVS 176
Query: 176 TGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHR 235
TGG YGD T C+P PC HH + P Q P+ C C + + +D H
Sbjct: 177 TGGLYGDDTSCKPYIFPPCDHHVTGQYQPCGPIQPTPQ--CVKECNSEYTQNTYEKDLHF 234
Query: 236 TTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLI 295
+ TY + N AI++EI+AHGP A+F + DF YKSGVY K E HS K+I
Sbjct: 235 ASQTYSIKQNVQAIQREIMAHGPVQASFKVAADFLTYKSGVYIRNPKLKYEGG-HSVKII 293
Query: 296 GWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
GWG E TPYWL+ N+W WG++G ++LRG+ EC E I AG P
Sbjct: 294 GWGKEGNTPYWLIANSWNEDWGEKGLFRMLRGRNECGIEAQIVAGLP 340
>gi|204022102|dbj|BAG71148.1| cathepsin B-N2 [Tuberaphis takenouchii]
Length = 334
Score = 210 bits (534), Expect = 8e-52, Method: Compositional matrix adjust.
Identities = 128/334 (38%), Positives = 170/334 (50%), Gaps = 24/334 (7%)
Query: 17 ELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTY 76
+ Y + YI+QIN A TW AG NF LS + + L +K + + P KT+
Sbjct: 17 QAYFLEEDYINQINANAKTWKAGANFDPKLSIDSFVKLL--GSKGVQAAKQASPDMFKTH 74
Query: 77 DPEYSAT---VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQ 133
D Y++ +P FDAR++W C T+G V D G C F AF+DR CI + G+
Sbjct: 75 DEAYNSLPNRIPSNFDARKKWRKCSTVGKVRDQGNCGTCWAFGTSSAFADRLCIATNGEF 134
Query: 134 NRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISP 193
N LS E +A CC C C G + W K G VTGGDY GCQP + P
Sbjct: 135 NELLSAEELAFCCHKC----GSGCHGGYPIKAWERFRKHGLVTGGDYNSGEGCQPYRVPP 190
Query: 194 CSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGR---GFFQDKHRTTLTYWVDDNEDAIK 250
C +C + P K H RCT YG F +D T Y++ N I+
Sbjct: 191 CPFDEYGNN--TCRGK--PAEKNH-RCTRMCYGNQNLDFKEDHRYTRDAYYL--NYQIIQ 243
Query: 251 KEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYL--HSGKLIGWGTENGTPYWLV 308
+++ +GP A++ +YDDF +YKSGVY T NA +YL H+ KLIGWG E G PYWL+
Sbjct: 244 NDLMTYGPIEASYDVYDDFPNYKSGVYMKTENA---SYLGGHAVKLIGWGEEYGVPYWLL 300
Query: 309 INTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
+N+W WGD+G KI RG EC + G P
Sbjct: 301 VNSWNDQWGDQGLFKIRRGTNECGIDNSTTGGVP 334
>gi|187104114|ref|NP_001119617.1| cathepsin B-16A precursor [Acyrthosiphon pisum]
gi|161343835|tpg|DAA06098.1| TPA_inf: cathepsin B [Acyrthosiphon pisum]
Length = 340
Score = 210 bits (534), Expect = 9e-52, Method: Compositional matrix adjust.
Identities = 121/346 (34%), Positives = 175/346 (50%), Gaps = 16/346 (4%)
Query: 1 MIHILVFLLGCTLVRGELYKFSDAYIDQINREANTWTAGRNF-PANLSEEYLRQFLIADA 59
++ +L +L + Y ++YI+ IN A TWTAG NF P+ +++++
Sbjct: 4 LVILLSVVLFSVYQTEQAYFLEESYIEMINDVATTWTAGVNFDPSTPEKDFIKMLGSKGV 63
Query: 60 KYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVG 119
+ + + + + +P FDAR +W +C TIG V D G C + A
Sbjct: 64 EAAKNASAHMFKTHDVANDNNNGYIPRTFDARRRWRHCKTIGEVRDQGHCGSCWAMATSS 123
Query: 120 AFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGD 179
AF+DR C+ + G N LS E + CC C + C+ G + W + G VTGG+
Sbjct: 124 AFADRLCVATNGDFNELLSAEEITFCCHTCGF----GCNGGYPIKAWKYFSSHGIVTGGN 179
Query: 180 YGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGF--FQDKHRTT 237
Y GC+P + PC + SC + + K + RCT YG + D HR T
Sbjct: 180 YKSGEGCEPYRVPPCPQDEEGKS--SCAGKPIEK---NHRCTRMCYGNQDLDYNDDHRFT 234
Query: 238 LTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNA-KLENYLHSGKLIG 296
Y+ +I+K+++ +GP A+F +YDDF YKSGVY+ T NA KL H+ KLIG
Sbjct: 235 RDYYYL-TYGSIQKDVMNYGPIEASFDVYDDFPSYKSGVYQRTPNATKLGG--HAVKLIG 291
Query: 297 WGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
WG E GTPYWL++N+W WGD G KI RG EC + AG P
Sbjct: 292 WGVEEGTPYWLMVNSWNAQWGDNGLFKIRRGTDECGIDSAATAGVP 337
>gi|313229093|emb|CBY18245.1| unnamed protein product [Oikopleura dioica]
Length = 355
Score = 209 bits (533), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 125/332 (37%), Positives = 164/332 (49%), Gaps = 15/332 (4%)
Query: 8 LLGCTLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDR 67
L G T+ FS A ID++N WTAG NF + E +R +L A + D
Sbjct: 25 LFGFTIGIAAASDFS-AIIDEVNTANAGWTAGENFHEQTTLEDVRSWLGA----WSNKDY 79
Query: 68 PLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCI 127
P +K + +P FD+R W +C IG + D G C + F A A SDR CI
Sbjct: 80 DWP--QKYPHDDLVGDIPATFDSRSNWSDCSVIGKIRDQGGCGSCWAFGAAEAISDRICI 137
Query: 128 KSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQ 187
SKG + + E V SCC C C+ G + RG VTGG YG + CQ
Sbjct: 138 ASKGATDVMYAAEDVLSCCLTC----GNGCNGGYPLAAMEYFVTRGLVTGGLYGTKDTCQ 193
Query: 188 PSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNED 247
P T+ C HH P E PK C +C + + DK Y V ++
Sbjct: 194 PYTLEACEHHVPGDRPPCTEGGGTPK--CSHQCIPDYTTKAYKDDKVHGHKAYSVPNDVG 251
Query: 248 AIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWL 307
I++EI+ +GP A F +Y DF YKSGVY+HTS ++L H+ K+IGWGTE G YWL
Sbjct: 252 KIQQEIMHYGPVEAAFTVYSDFPSYKSGVYRHTSGSELGG--HAIKIIGWGTEGGDDYWL 309
Query: 308 VINTWGPHWGDRGTVKILRGKYECAFEYLIAA 339
+ N+W WGD+GT KILRG EC E + A
Sbjct: 310 INNSWNSDWGDKGTFKILRGSNECGIEGEVVA 341
>gi|55793951|gb|AAV65886.1| cathepsin B1 isotype 6 precursor [Trichobilharzia regenti]
Length = 342
Score = 209 bits (532), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 120/328 (36%), Positives = 173/328 (52%), Gaps = 19/328 (5%)
Query: 21 FSDAYIDQINREANT-WTAGRN--FPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYD 77
SD I IN+ + WTA R+ F + L + D K + RP T D
Sbjct: 30 LSDEIIAYINQHPDAGWTASRSDRFKSVEDARILLGVMREDEK-LRKKRRP------TVD 82
Query: 78 PE-YSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRP 136
+ S +P FD+R++W C +I + D C + FAAV SDR CI+SKG+++
Sbjct: 83 HQNVSLEIPSTFDSRKKWSQCKSISSIHDQSRCGSGWAFAAVEVMSDRICIQSKGEKSVE 142
Query: 137 LSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSH 196
LS + SCC+ C C G W++ + G VTG + TGCQP C H
Sbjct: 143 LSAVDLLSCCREC----GLGCLGGFPGSAWDYWVEEGVVTGSSGENHTGCQPYPFPKCEH 198
Query: 197 HGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAH 256
+ + P+C + KC +C Y + +DKH + Y V +NED+IKKEI+ H
Sbjct: 199 NTTG-KYPACGQKIYETPKCQKKCQK-GYKTPYKKDKHYGKVAYNVPNNEDSIKKEIMMH 256
Query: 257 GPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHW 316
GP + F +Y DF +YKSG+YKH ++ +H+ +++GWG E GTPYWL+ N+W W
Sbjct: 257 GPVGSFFTVYSDFLNYKSGIYKHMKGTEIG--VHTVRIVGWGVEKGTPYWLIANSWNEGW 314
Query: 317 GDRGTVKILRGKYECAFEYLIAAGKPKN 344
G++G +ILRGK EC E L+ G P+N
Sbjct: 315 GEKGYFRILRGKDECDIESLVIGGLPRN 342
>gi|74221319|dbj|BAE42140.1| unnamed protein product [Mus musculus]
Length = 339
Score = 209 bits (532), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 124/329 (37%), Positives = 169/329 (51%), Gaps = 23/329 (6%)
Query: 18 LYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQF---LIADAKYFDQSDRPLPGDRK 74
+ SD I+ IN++ TW AGRNF N+ YL++ ++ K LPG R
Sbjct: 23 FHPLSDDLINYINKQNTTWQAGRNF-YNVDISYLKKLCGTVLGGPK--------LPG-RV 72
Query: 75 TYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQN 134
+ + +P+ FDAREQW NC TIG + D G+C + F AV A SDR CI + G+ N
Sbjct: 73 AFGEDID--LPETFDAREQWSNCPTIGQIRDQGSCGSCWAFGAVEAISDRTCIHTNGRVN 130
Query: 135 RPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPC 194
+S E + +CC I D C+ G W+F K+G V+GG Y GC P TI PC
Sbjct: 131 VEVSAEDLLTCCGIQCGD---GCNGGYPSGAWSFWTKKGLVSGGVYNSHVGCLPYTIPPC 187
Query: 195 SHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEIL 254
HH + P P +C+ C Y + +DKH +Y V ++ I EI
Sbjct: 188 EHHVNGSRPPCTGEGDTP--RCNKSC-EAGYSPSYKEDKHFGYTSYSVSNSVKEIMAEIY 244
Query: 255 AHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGP 314
+ P F ++ DF YKSGVYKH + + H+ +++GWG NG PYWL N+W
Sbjct: 245 KNDPVEGAFTVFSDFLTYKSGVYKHEAGDMMGG--HAIRILGWGVGNGVPYWLAANSWNL 302
Query: 315 HWGDRGTVKILRGKYECAFEYLIAAGKPK 343
WGD G KILRG+ C E I AG P+
Sbjct: 303 DWGDNGFFKILRGENHCGIESEIVAGIPR 331
>gi|227293|prf||1701299A cathepsin B
Length = 339
Score = 209 bits (532), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 125/329 (37%), Positives = 172/329 (52%), Gaps = 23/329 (6%)
Query: 18 LYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQF---LIADAKYFDQSDRPLPGDRK 74
+ SD I+ IN++ TW AGRN P N+ YL++ ++ K LPG R
Sbjct: 23 FHPLSDDLINYINKQNTTWQAGRN-PYNVDISYLKKLCGTVLGGPK--------LPG-RV 72
Query: 75 TYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQN 134
+ + +P+ FDAREQW NC TIG + D G+C + F AV A SDR CI + G+ N
Sbjct: 73 AFGEDID--LPETFDAREQWSNCPTIGQIRDQGSCGSCWAFGAVEAISDRTCIHTNGRVN 130
Query: 135 RPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPC 194
+S E + +CC I D C+ G WNF K+G V+GG Y GC P TI PC
Sbjct: 131 VEVSAEDLLTCCGIQCGD---GCNGGYPSGAWNFWTKKGLVSGGYYDSHIGCLPYTIPPC 187
Query: 195 SHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEIL 254
HH + + P C + + +C+ C Y + +DKH +Y V ++ I EI
Sbjct: 188 EHHVNG-SRPPCTGEGDTR-RCNKSC-EAGYSPSYKEDKHFGYTSYSVSNSVKKIMAEIY 244
Query: 255 AHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGP 314
+GP F ++ DF YKSGVYKH + + H+ +++ WG ENG PYW N+W
Sbjct: 245 KNGPVEGAFTVFSDFLTYKSGVYKHEAGDMMGG--HAIRILVWGVENGVPYWAAANSWNL 302
Query: 315 HWGDRGTVKILRGKYECAFEYLIAAGKPK 343
WGD G KILRG+ C E I AG P+
Sbjct: 303 DWGDNGFFKILRGENHCGIESEIVAGIPR 331
>gi|340380665|ref|XP_003388842.1| PREDICTED: cathepsin B-like [Amphimedon queenslandica]
Length = 333
Score = 209 bits (532), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 121/329 (36%), Positives = 165/329 (50%), Gaps = 23/329 (6%)
Query: 17 ELYKFSDAYIDQINREANTWTAGRN--FPANLSEEYLRQFLIADAKYFDQSDRPLPGDRK 74
+ SD ID +N TWTA R+ FP+ + + D K+ LP K
Sbjct: 23 DFQALSDDVIDYVNSLNTTWTAARSPRFPSGNEVDVKDLCGVLDVKH------TLPYKEK 76
Query: 75 TYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQN 134
+PD FDAR++W +C +I + D G+C + AV A SDR C+ Q+N
Sbjct: 77 VS----VGAIPDTFDARQKWSDCPSISDIRDQGSCGSCWALGAVEAMSDRYCVSF--QEN 130
Query: 135 RPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPC 194
+S E + +CCK C C+ G + + W + K G VTGG YG GCQP I C
Sbjct: 131 VHISAENLMTCCKFC----GNGCAGGFLQQAWEYWVKDGLVTGGQYGSDEGCQPYLIPKC 186
Query: 195 SHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEIL 254
+HH P K P+ C C + Y + D H Y V +AI+ EI+
Sbjct: 187 NHHEPGPYENCTGEGKTPQ--CERTCRS-GYTTSYEADLHYGEKAYAVHREVEAIQTEIM 243
Query: 255 AHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGP 314
+GP F +Y DF YKSGVY+H L H+ +++GWGTENG PYWL+ N+W P
Sbjct: 244 TNGPVEGAFTVYSDFPTYKSGVYQHVVGHALGG--HAIRILGWGTENGVPYWLIANSWNP 301
Query: 315 HWGDRGTVKILRGKYECAFEYLIAAGKPK 343
WGD+G K++RGK +C E I AG PK
Sbjct: 302 SWGDKGYFKMIRGKDDCGIESNIVAGTPK 330
>gi|350535627|ref|NP_001233013.1| uncharacterized protein LOC100164982 precursor [Acyrthosiphon
pisum]
gi|239789514|dbj|BAH71377.1| ACYPI005957 [Acyrthosiphon pisum]
Length = 339
Score = 209 bits (531), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 129/347 (37%), Positives = 180/347 (51%), Gaps = 19/347 (5%)
Query: 1 MIHILVFLLGCTLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAK 60
++ +L +L + Y ++YI+ IN A TW AG NF + E + L +K
Sbjct: 4 LVILLSVVLFSVYQTEQAYFLEESYIEMINDVATTWKAGVNFDPSTPETDFIKML--GSK 61
Query: 61 YFDQSDRPLPGDRKTYDPEYS--ATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAV 118
+ + KT+D Y+ + +P FDAR++W +C TIG V D G C + F
Sbjct: 62 GVEAAKNASAHMFKTHDVAYNKFSYIPRTFDARKRWRHCKTIGEVRDQGHCGSCWAFGTS 121
Query: 119 GAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGG 178
AF+DR C+ + G N LS E + CC C + C+ G + W + G VTGG
Sbjct: 122 SAFADRLCVATDGDFNELLSAEELTFCCHACGH----GCNGGYPIKAWKYFSTHGLVTGG 177
Query: 179 DYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGF--FQDKHRT 236
+Y GC+P + PC + + SC + PK K H RCT YG + D HR
Sbjct: 178 NYKSGKGCEPYRVPPCPRNEDGKS--SCAGK--PKEKNH-RCTRMCYGNQDLDYDDDHRF 232
Query: 237 TLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNA-KLENYLHSGKLI 295
T ++ +I+K++L +GP A+F +YDDF YKSGVY+ T NA KL H+ KLI
Sbjct: 233 TRDFYYL-TYGSIQKDVLNYGPIEASFDVYDDFPSYKSGVYQRTPNATKLGG--HAVKLI 289
Query: 296 GWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
GWG E GTPYWL++N+W WGD G KI RG EC + AG P
Sbjct: 290 GWGVEEGTPYWLMVNSWNAQWGDNGLFKIRRGTDECRIDSATTAGVP 336
>gi|443692853|gb|ELT94358.1| hypothetical protein CAPTEDRAFT_221292 [Capitella teleta]
Length = 374
Score = 209 bits (531), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 126/324 (38%), Positives = 168/324 (51%), Gaps = 17/324 (5%)
Query: 21 FSDAYIDQINREANT-WTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPE 79
S +D +N +A+T W A + S + L K D + LP R +
Sbjct: 65 LSQEIVDYVNTKADTTWKA--EVTSKWSSVAEVKNLCGSLK--DPNGSRLPIMRHKLE-- 118
Query: 80 YSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLST 139
+ +PD FDAR++W C TI V D G+C + F AV A SDR CI SKG + +S+
Sbjct: 119 -AVNLPDDFDARKEWTGCPTIKEVRDQGSCGSCWAFGAVEAMSDRICIASKGNVHAHISS 177
Query: 140 EYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGS 199
E + SCC C C+ G W + G V+GG YG GC+P +I+PC HH +
Sbjct: 178 EDLLSCCSSC----GMGCNGGFPPAAWEYFRDTGLVSGGQYGTHQGCRPYSIAPCEHHVN 233
Query: 200 APTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPT 259
LP C + P KC C Y + DK+ Y VD++E I EI+ +GP
Sbjct: 234 GTRLP-CSGEG-PTPKCERTCEK-GYKVKYEDDKNFGYTAYSVDNDEKQIMTEIMTNGPV 290
Query: 260 TATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDR 319
F +Y DF YKSGVY+H S +L H+ +++GWG E+GTPYWLV N+W WGD
Sbjct: 291 EGAFTVYADFPTYKSGVYQHVSGGELGG--HAIRVLGWGVEDGTPYWLVANSWNSDWGDN 348
Query: 320 GTVKILRGKYECAFEYLIAAGKPK 343
G KILRG+ EC E I AG PK
Sbjct: 349 GFFKILRGQNECGIEGEIVAGLPK 372
>gi|156365510|ref|XP_001626688.1| predicted protein [Nematostella vectensis]
gi|156213574|gb|EDO34588.1| predicted protein [Nematostella vectensis]
Length = 259
Score = 209 bits (531), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 116/260 (44%), Positives = 142/260 (54%), Gaps = 9/260 (3%)
Query: 84 VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVA 143
VPD FD+REQWP+C TI V D GAC + F AV A SDR CIKS+G+ +S E +
Sbjct: 4 VPDHFDSREQWPHCPTIKEVRDQGACGSCWAFGAVEAMSDRYCIKSEGKVMPHISAEDLL 63
Query: 144 SCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTL 203
SCC+ C C+ G W+ +G VTGG Y GCQP I+ C HH
Sbjct: 64 SCCETC----GMGCNGGYPESAWDHWKSKGLVTGGQYDSHKGCQPYKIAACDHHVVGKLK 119
Query: 204 PSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATF 263
P C+ P KC +C Y + DKH Y V + I+KEI+ +GP F
Sbjct: 120 P-CKGDS-PTPKCERKC-EAGYNVSYSDDKHFGQSAYSVRSDPAEIQKEIMTNGPVEGAF 176
Query: 264 ALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVK 323
+Y DF YKSGVY+HTS + L H+ K++GWG ENGTPYWLV N+W WGD G K
Sbjct: 177 TVYADFPTYKSGVYQHTSGSALGG--HAIKILGWGEENGTPYWLVANSWNSDWGDEGFFK 234
Query: 324 ILRGKYECAFEYLIAAGKPK 343
I RG EC E I G PK
Sbjct: 235 IKRGNDECGIESGIVGGLPK 254
>gi|195165479|ref|XP_002023566.1| GL19846 [Drosophila persimilis]
gi|194105700|gb|EDW27743.1| GL19846 [Drosophila persimilis]
Length = 329
Score = 208 bits (530), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 118/325 (36%), Positives = 173/325 (53%), Gaps = 27/325 (8%)
Query: 21 FSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEY 80
SD +I+ + +A+TW GRNF ++SEEY+R + + D LP R Y
Sbjct: 23 LSDEFIELVRSKASTWQVGRNFKESVSEEYIRGLM---GVHPDAHKFALPEKRIVLGDLY 79
Query: 81 S---ATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPL 137
+ +P+ FDAR+ WPNC TIG + D G+C + F AV A SDR CI S+G+ N L
Sbjct: 80 ADDGIDIPEEFDARKAWPNCPTIGEIRDQGSCGSCWAFGAVEAMSDRVCIHSEGKVNFHL 139
Query: 138 STEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHH 197
S + + SCC IC + C+ G W++ ++G V+GG YG GC+P I+PC HH
Sbjct: 140 SADDLVSCCHICGF----GCNGGFPGAAWSYWTRKGIVSGGPYGSTQGCRPYEIAPCEHH 195
Query: 198 GSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHG 257
+ T P C + P C +C +Y + +DK+ + +Y V N I++EI+ +G
Sbjct: 196 VNG-TRPPCSHGSTP--SCQHKC-QASYSVEYAKDKNFGSKSYSVRRNVAEIQQEIMTNG 251
Query: 258 PTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGT--ENGTPYWLVINTWGPH 315
P F +Y+D YKSGVY+H +L H+ +++GWG E+ PYWL+ N+W
Sbjct: 252 PVEGAFTVYEDLILYKSGVYQHEHGKELGG--HAIRILGWGVWGESKVPYWLIGNSWNTD 309
Query: 316 WGDRGTVKILRGKYECAFEYLIAAG 340
WGD C E I+AG
Sbjct: 310 WGDND---------HCGIESSISAG 325
>gi|193716207|ref|XP_001950562.1| PREDICTED: cathepsin B-like [Acyrthosiphon pisum]
Length = 340
Score = 208 bits (530), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 131/353 (37%), Positives = 175/353 (49%), Gaps = 32/353 (9%)
Query: 2 IHILVFLLGCTLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKY 61
+ +L + V + Y +ID IN A TW AG NF N +EY + L +K
Sbjct: 5 LMLLSVIFVSVYVTEQAYFLQKDFIDNINNHATTWKAGVNFDPNTPKEYFLKML--GSKG 62
Query: 62 FDQSDRPLPGDRKTYDPEYS---ATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAV 118
D+ KT+D Y +P FDAR++W C TIG V D G C + A
Sbjct: 63 VQIPDKHNIHMYKTHDAAYDNLFGRIPKHFDARKKWKRCHTIGKVRDQGNCGSCWAMATS 122
Query: 119 GAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGG 178
AF+DR C+ + N LS E + CC C Y C+ G + W + RG VTGG
Sbjct: 123 SAFADRLCVATNADFNELLSAEEITFCCSSCGY----GCNGGYPIKAWESFNNRGLVTGG 178
Query: 179 DYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRG--FFQDKHRT 236
DY GC+P + PC + A +C + P+ K H RCT YG + D HR
Sbjct: 179 DYQSGEGCEPYRVPPCPY--DAEGHNTCAGK--PREKNH-RCTRTCYGNQDLDYNDDHRF 233
Query: 237 T-----LTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYL-- 289
T LTY +I+K+++ +GP A+F +YDDF YKSGVY + NA +YL
Sbjct: 234 TRDSYYLTY------SSIQKDVMRYGPIEASFDMYDDFPSYKSGVYVRSENA---SYLGG 284
Query: 290 HSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
H+ KLIGWG E+G YWL++N+W WGD G KI RG EC + G P
Sbjct: 285 HAVKLIGWGEEHGVLYWLMVNSWNEGWGDNGLFKIRRGTNECGIDNSTTGGVP 337
>gi|116177489|gb|ABJ80691.1| cathepsin B [Hippoglossus hippoglossus]
Length = 330
Score = 208 bits (530), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 120/334 (35%), Positives = 170/334 (50%), Gaps = 21/334 (6%)
Query: 12 TLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPG 71
+L R L S ++ IN+ TW AG NF ++ Y+R+ L G
Sbjct: 16 SLARPHLQPLSKEMVNYINKMNTTWKAGHNF-RDVDYSYVRRLC----------GTMLKG 64
Query: 72 DRKTYDPEYSA--TVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKS 129
+ +Y+ +P +FD+REQWP C T+ + D G+C + F A A SDR CI S
Sbjct: 65 PKLPIMVQYAGGLKLPAQFDSREQWPECPTLKEIRDQGSCGSCWAFGAAEAISDRVCIHS 124
Query: 130 KGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPS 189
+ + +S+E + +CC C C+ G W+F K G V+GG Y GC+P
Sbjct: 125 GSKVSVEISSEDLLTCCDAC----GMGCNGGYPSAAWDFWTKEGLVSGGLYNSHIGCRPY 180
Query: 190 TISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAI 249
TI PC HH + + P C + KC C Y + +DKH +Y V+ + + I
Sbjct: 181 TIPPCEHHVNG-SRPHCSGEGGDTPKCVHSC-EAGYSPTYTKDKHYGKSSYSVEASVEQI 238
Query: 250 KKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVI 309
+ EI +GP F +Y+DF YKSGVY+HT+ + L H+ K++GWG E+G PYWL
Sbjct: 239 QAEISQNGPVEGAFIVYEDFVMYKSGVYQHTTGSALGG--HAIKVLGWGEEDGVPYWLCA 296
Query: 310 NTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
N+W WG+ G KILRG C E I AG PK
Sbjct: 297 NSWNTDWGENGFFKILRGSDHCGIESEIVAGIPK 330
>gi|204022098|dbj|BAG71146.1| cathepsin B-N2 [Tuberaphis sumatrana]
Length = 334
Score = 208 bits (530), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 132/336 (39%), Positives = 172/336 (51%), Gaps = 28/336 (8%)
Query: 17 ELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTY 76
+ Y + YI+ IN A TW AG NF LS + + L +K + + P KT+
Sbjct: 17 QAYFLEEDYINHINANAKTWKAGVNFDPKLSIDSFVKLL--GSKGVQAAKQASPDMFKTH 74
Query: 77 DPEY---SATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQ 133
D Y S +P FDAR++W C TIG V D G C + F AF+DR CI + G+
Sbjct: 75 DEAYNNWSNRIPSYFDARKKWRKCLTIGEVRDQGHCGSCWAFGTSSAFADRLCIATDGEF 134
Query: 134 NRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISP 193
N LS E +A CC C + CS G + W K G VTGG+Y GCQP + P
Sbjct: 135 NELLSPEELAFCCHKCGF----GCSGGYPIKAWERFKKHGLVTGGNYESGEGCQPYRVPP 190
Query: 194 C--SHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGR---GFFQDKHRTTLTYWVDDNEDA 248
C +G+ +C + P K H RCT YG F +D H T Y++
Sbjct: 191 CPLDEYGNN----TCSGK--PTEKNH-RCTRMCYGNQDLDFKEDHHYTRDAYYL--TYGT 241
Query: 249 IKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYL--HSGKLIGWGTENGTPYW 306
I+ ++LA+GP A+F +YDDF YKSGVY NA YL H+ KLIGWG E G PYW
Sbjct: 242 IQNDVLAYGPIEASFEVYDDFPSYKSGVYTKMENA---TYLGGHAVKLIGWGEEYGVPYW 298
Query: 307 LVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
L++N+W WGD+G KI RG EC + G P
Sbjct: 299 LLVNSWNDQWGDQGLFKIRRGTNECGIDNSTTGGVP 334
>gi|204022090|dbj|BAG71142.1| cathepsin B-N3 [Tuberaphis styraci]
Length = 334
Score = 208 bits (529), Expect = 4e-51, Method: Compositional matrix adjust.
Identities = 133/336 (39%), Positives = 171/336 (50%), Gaps = 28/336 (8%)
Query: 17 ELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTY 76
+ Y YI+QIN A TW AG NF LS + + L +K + + KT+
Sbjct: 17 QAYFLEVDYINQINANAKTWKAGVNFDPKLSIDSFVKLL--GSKGVQAAKQASLVMFKTH 74
Query: 77 DPEY---SATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQ 133
D Y S +P FDAR++W C TIG V D G C + F AF+DR CI + G+
Sbjct: 75 DEAYNSWSNRIPSSFDARKKWRKCSTIGEVRDQGNCGSCWAFGTSSAFADRLCIATDGEF 134
Query: 134 NRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISP 193
N LS E +A CC C + CS G R W K G VTGG+Y GCQP + P
Sbjct: 135 NELLSPEELAFCCHKCGF----GCSGGYPIRAWERFKKHGLVTGGNYDSGEGCQPYKVPP 190
Query: 194 C--SHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGR---GFFQDKHRTTLTYWVDDNEDA 248
C +G+ +C + P K H RCT YG F +D H T Y++
Sbjct: 191 CPLDEYGNN----TCSGK--PAEKNH-RCTQMCYGNQNLDFKEDHHYTRDAYYL--TYGT 241
Query: 249 IKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYL--HSGKLIGWGTENGTPYW 306
I+ ++LA+GP A+F +YDDF YKSGVY NA YL H+ KLIGWG E G PYW
Sbjct: 242 IQNDVLAYGPIEASFEVYDDFPSYKSGVYTKMENA---TYLGGHAVKLIGWGEEYGVPYW 298
Query: 307 LVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
L++N+W WGD+G KI RG EC + G P
Sbjct: 299 LLVNSWNDQWGDQGLFKIRRGTNECGTDNSTTGGVP 334
>gi|393909827|gb|EJD75608.1| cysteine endopeptidase [Loa loa]
Length = 383
Score = 208 bits (529), Expect = 4e-51, Method: Compositional matrix adjust.
Identities = 118/336 (35%), Positives = 169/336 (50%), Gaps = 16/336 (4%)
Query: 12 TLVRGELYKFSDA-YIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLP 70
T + E SD ID +N W A N NL ++ L+ D
Sbjct: 54 TKIAPEAENLSDQELIDYVNSHQTLWKAEMN-KFNLYSNTVKYGLLGVNNMKQSVD---- 108
Query: 71 GDRKTYDPEYSATV--PDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIK 128
+K P +T+ P+ FDAR+ WP C ++ +V D +C + AAV A SDR CI
Sbjct: 109 -GKKNLSPTRHSTIFIPESFDARKHWPECASLRNVRDQSSCGSCWAVAAVEAMSDRICIM 167
Query: 129 SKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQP 188
SKG++ LS + + SCCK C + C G W + RG VTG +Y + +GC+P
Sbjct: 168 SKGKKQVTLSADDLLSCCKTCGF----GCFGGEPMAAWKYWVLRGIVTGSEYTNHSGCRP 223
Query: 189 STISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDA 248
PC HH + C++ P KC +C + YG+ + DK+ Y V+ N ++
Sbjct: 224 YPFPPCEHHNNKTHYEPCKHDLYPTPKCVKKC-DKNYGKSYKADKYYGEQVYNVESNVES 282
Query: 249 IKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLV 308
I+KEI+ GP A+F +Y DF +Y G+YKH + + H+ K++GWG + G PYWL
Sbjct: 283 IQKEIMTLGPVEASFEVYTDFLYYTGGIYKHVAGSMGGG--HAVKVLGWGIDQGVPYWLA 340
Query: 309 INTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPKN 344
N+W WG+ G +ILRG EC E I AG PK
Sbjct: 341 ANSWNTDWGEDGYFRILRGVNECGIESGIIAGIPKQ 376
>gi|22531389|emb|CAD44625.1| cathepsin B1 isotype 2 [Schistosoma mansoni]
Length = 340
Score = 208 bits (529), Expect = 4e-51, Method: Compositional matrix adjust.
Identities = 122/346 (35%), Positives = 177/346 (51%), Gaps = 24/346 (6%)
Query: 1 MIHILVFLLGCTLVRGELYK-FSDAYIDQINREANT-WTAGRNFPANLSEEYLRQFLIAD 58
+ ++ L ++ E +K SD I IN N W A E+ R + D
Sbjct: 8 IASLITHLDAHISIKNEKFKPLSDDIISYINEHPNAGWRA---------EKSNRFHSLDD 58
Query: 59 AKYFDQSDRPLPGDRKTYDP-----EYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPH 113
A+ + R P R+ P E++ +P FD+R++WP C +I + D C +
Sbjct: 59 ARIQMGARREEPDLRRKRRPTVDHNEWNVEIPSNFDSRKKWPGCKSIATIRDQSRCGSCW 118
Query: 114 IFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRG 173
F AV A SDR CI+S G+QN LS + SCC+ C C G + W+F K G
Sbjct: 119 AFGAVEAMSDRSCIQSGGKQNVELSAVDLLSCCESC----GLGCEGGILGPAWDFWVKEG 174
Query: 174 SVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDK 233
VTG + TGC+P C HH + P C ++ +C C Y + QDK
Sbjct: 175 IVTGSSKENHTGCEPYPFPKCEHH-TKGKYPPCGSKIYKTPRCKQTCQK-KYKTPYTQDK 232
Query: 234 HRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGK 293
HR +Y V ++E AI+KEI+ +GP A+F +Y+DF +YKSG+YKH + L H+ +
Sbjct: 233 HRGKSSYNVKNDEKAIQKEIMKYGPVEASFTVYEDFLNYKSGIYKHITGEALGG--HAIR 290
Query: 294 LIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAA 339
+IGWG EN TPYWL+ N+W WG+ G +I+RG+ EC E + A
Sbjct: 291 IIGWGVENKTPYWLIANSWNEDWGENGYFRIVRGRDECFIESEVIA 336
>gi|294914603|ref|XP_002778294.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
gi|239886508|gb|EER10089.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
Length = 365
Score = 207 bits (528), Expect = 4e-51, Method: Compositional matrix adjust.
Identities = 122/333 (36%), Positives = 173/333 (51%), Gaps = 28/333 (8%)
Query: 24 AYIDQINREANTWTA----GRNFPANL--SEEYLRQFLIADAKYFDQSDRPLPGDRKTYD 77
+ +D++N + N WTA GR + +L +++ FL + + K Y
Sbjct: 44 SLVDEVNSKQNLWTASTEQGRFYGRSLGDAKKLCGTFLNGTEEL----------EEKVYP 93
Query: 78 PEYSATVPDRFDAREQWPNC-GTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRP 136
E +PD FDAR+ + C IGHV D AC + F V AF+ R CIKS G+ N+
Sbjct: 94 AEELVDIPDSFDARDAFKECKDVIGHVRDQSACGSCWAFGTVEAFNARVCIKSGGKLNQL 153
Query: 137 LSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRT------GCQPST 190
LS + +CC I + + CS G+ +W FLH G V+GG + GC P
Sbjct: 154 LSAADMLACCNIGHFCLSFGCSGGNPITSWTFLHTNGIVSGGGFVPEKNMKAADGCWPYN 213
Query: 191 ISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVD-DNEDAI 249
C+HH C + C + C N YG F +D+H T + + +I
Sbjct: 214 FPKCAHHQKESDYKPCAKEIYDTPSCSSSCPNAKYGTAFDKDRHYTESLFPSRFGSTSSI 273
Query: 250 KKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVI 309
KKEI+ +GPT+A F++Y+DF YKSGVYKHTS L H+ ++IGWGTE G YWLV+
Sbjct: 274 KKEIMTNGPTSAAFSVYEDFLSYKSGVYKHTSGGFLGG--HAVEIIGWGTEKGVDYWLVM 331
Query: 310 NTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
N+W WGD GT KI++G +C + +I AG P
Sbjct: 332 NSWNEEWGDHGTFKIVQG--DCGIDDMILAGTP 362
>gi|118153|sp|P25792.1|CYSP_SCHMA RecName: Full=Cathepsin B-like cysteine proteinase; AltName:
Full=Antigen Sm31; Flags: Precursor
gi|160950|gb|AAA29865.1| cathepsin B [Schistosoma mansoni]
Length = 340
Score = 207 bits (528), Expect = 4e-51, Method: Compositional matrix adjust.
Identities = 120/348 (34%), Positives = 180/348 (51%), Gaps = 24/348 (6%)
Query: 1 MIHILVFLLGCTLVRGELYK-FSDAYIDQINREANT-WTAGRNFPANLSEEYLRQFLIAD 58
+ ++ FL V+ E ++ SD I IN N W A E+ R + D
Sbjct: 8 IASLITFLEAHISVKNEKFEPLSDDIISYINEHPNAGWRA---------EKSNRFHSLDD 58
Query: 59 AKYFDQSDRPLPGDRKTYDP-----EYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPH 113
A+ + R P R+ P +++ +P FD+R++WP C +I + D C +
Sbjct: 59 ARIQMGARREEPDLRRKRRPTVDHNDWNVEIPSNFDSRKKWPGCKSIATIRDQSRCGSCW 118
Query: 114 IFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRG 173
F AV A SDR CI+S G+QN LS + +CC+ C C G + W++ K G
Sbjct: 119 SFGAVEAMSDRSCIQSGGKQNVELSAVDLLTCCESC----GLGCEGGILGPAWDYWVKEG 174
Query: 174 SVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDK 233
VT + TGC+P C HH + P C ++ +C C Y + QDK
Sbjct: 175 IVTASSKENHTGCEPYPFPKCEHH-TKGKYPPCGSKIYNTPRCKQTCQR-KYKTPYTQDK 232
Query: 234 HRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGK 293
HR +Y V ++E AI+KEI+ +GP A+F +Y+DF +YKSG+YKH + L H+ +
Sbjct: 233 HRGKSSYNVKNDEKAIQKEIMKYGPVEASFTVYEDFLNYKSGIYKHITGEALGG--HAIR 290
Query: 294 LIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGK 341
+IGWG EN TPYWL+ N+W WG+ G +I+RG+ EC+ E + AG+
Sbjct: 291 IIGWGVENKTPYWLIANSWNEDWGENGYFRIVRGRDECSIESEVIAGR 338
>gi|161343851|tpg|DAA06106.1| TPA_inf: cathepsin B [Acyrthosiphon pisum]
Length = 333
Score = 207 bits (528), Expect = 5e-51, Method: Compositional matrix adjust.
Identities = 129/348 (37%), Positives = 173/348 (49%), Gaps = 35/348 (10%)
Query: 4 ILVFLLGCTLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFD 63
+ VF C+ + Y + YI IN A TW AG NF +++ L +
Sbjct: 10 VFVFFSSCSE---QTYFLNKDYISTINSVAKTWKAGINFHPETPLKFILGLLGSKG---- 62
Query: 64 QSDRPLPGDRKTYDPEYS--ATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAF 121
D G K++DP YS +P+ FDAR++W NC TIG + D G C + F+ GAF
Sbjct: 63 -VDVSSAGPFKSHDPLYSPAGNIPNEFDARKRWKNCTTIGTIRDQGNCGSCWAFSTSGAF 121
Query: 122 SDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYG 181
+DR CI S G N+ LS E+V SCC C C G R W + K G VTGG++
Sbjct: 122 ADRLCIASNGSFNQLLSAEHVTSCCYRC----GLGCQGGYPIRAWRYYSKHGLVTGGNFN 177
Query: 182 DRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRC---TNPTY-GRGFFQDKHRTT 237
GCQP PC+ + SC Q KC +C T+ +Y G + ++
Sbjct: 178 SFEGCQPYMFPPCTGNN------SCSGQSEKNHKCQKKCFGNTSISYRGDRRYVERSPYV 231
Query: 238 LTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYL--HSGKLI 295
L Y D ++ +I+ +GP ++F +YDDF YKSGVY + NA YL HS K I
Sbjct: 232 LAY------DNMQNDIMTYGPIESSFDVYDDFISYKSGVYFKSPNA---TYLGGHSVKCI 282
Query: 296 GWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
GWG E YWL++N+W WGD G KI RG EC E AG P+
Sbjct: 283 GWGVERNVSYWLMMNSWNNTWGDGGNFKIRRGTNECQVEDSSTAGMPE 330
>gi|325302582|dbj|BAJ83491.1| cathepsin B-like peptidase [Echinococcus multilocularis]
Length = 338
Score = 207 bits (528), Expect = 5e-51, Method: Compositional matrix adjust.
Identities = 130/323 (40%), Positives = 165/323 (51%), Gaps = 21/323 (6%)
Query: 26 IDQINREANT-WTAGRN--FPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEYSA 82
ID IN +ANT W AG+N F LS + L F+ LP A
Sbjct: 29 IDYINNKANTTWRAGKNKRFTDALSAKSQMGSL------FNPGGSMLPTKSFYLSSTQKA 82
Query: 83 TVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYV 142
+P FDAR+ WP+C TIG + D G C + F A A SDR CI S+G++ +S + +
Sbjct: 83 ALPSEFDARKAWPDCPTIGEIRDQGTCGSCWAFGATEAMSDRICIHSEGKEVVRISADDL 142
Query: 143 ASCCKI-CRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAP 201
SCC + C + C+ G W + G V+GG YG GC+P I PC HH S
Sbjct: 143 LSCCGLFCGF----GCNGGLPENAWRYWAIDGIVSGGLYGSHVGCRPYEIPPCEHHTSG- 197
Query: 202 TLPSCE-NQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTT 260
P C+ N K PK C +C G+ + DKH + Y V +E+ I EIL +GP
Sbjct: 198 NRPDCKGNSKTPK--CQRQCVESFDGK-YQADKHFASNVYNVRASEEDIMNEILVYGPVE 254
Query: 261 ATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRG 320
A F +Y DF YKSGVY+H L H+ K++GWG ENG PYWL N+W WGD G
Sbjct: 255 ADFIVYADFLTYKSGVYQHVKGGFLGG--HAVKILGWGEENGVPYWLCANSWNTDWGDGG 312
Query: 321 TVKILRGKYECAFEYLIAAGKPK 343
KILRG C E I AG PK
Sbjct: 313 FFKILRGYNHCKIEADINAGIPK 335
>gi|195729973|gb|ACG50797.1| cathepsin B2 [Trichobilharzia szidati]
Length = 344
Score = 207 bits (527), Expect = 5e-51, Method: Compositional matrix adjust.
Identities = 129/324 (39%), Positives = 164/324 (50%), Gaps = 15/324 (4%)
Query: 21 FSDAYIDQINREANT-WTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPE 79
S I IN EANT W A + S +R+ L A D + LP Y P
Sbjct: 33 LSSELIHFINHEANTTWKAAPSSRFK-SVSDIRRMLGALP---DPNGGYLPTLCTGYTPS 88
Query: 80 YSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLST 139
+P FDAR+ WP+C +I + D +C + F AV A SDR CI+SKG LS
Sbjct: 89 LD-ELPKEFDARKHWPHCPSISEIRDQSSCGSCWAFGAVEAMSDRICIESKGLHKPFLSA 147
Query: 140 EYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGS 199
E + +CC C C+ G W++ + G VTG Y GCQP PC HH
Sbjct: 148 ENLVACCSSC----GMGCNGGFPHSAWSYWKRSGIVTGDLYNTTDGCQPYEFPPCEHHVV 203
Query: 200 APTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPT 259
P PSC V KC T C P Y + +DK Y V N++AI KE++ HGP
Sbjct: 204 GPR-PSC-GGDVETPKCKTTC-QPGYNIPYNKDKWYGKTVYRVHSNQEAIMKEVMDHGPV 260
Query: 260 TATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDR 319
F +Y DF +YKSGVY+H S L H+ +L+GWG ENG PYWL+ N+W WGD
Sbjct: 261 EVDFEVYADFPNYKSGVYQHVSGGLLGG--HAVRLLGWGEENGVPYWLIANSWNSDWGDN 318
Query: 320 GTVKILRGKYECAFEYLIAAGKPK 343
G KI+RG+ EC E + AG PK
Sbjct: 319 GYFKIIRGRNECGIESDVNAGIPK 342
>gi|204022108|dbj|BAG71151.1| cathepsin B-N [Cerataphis jamuritsu]
Length = 333
Score = 207 bits (527), Expect = 6e-51, Method: Compositional matrix adjust.
Identities = 128/335 (38%), Positives = 170/335 (50%), Gaps = 27/335 (8%)
Query: 17 ELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTY 76
+ Y + YI QIN A TW AG NF LS + L +K + + P KT
Sbjct: 17 QAYFLEEDYIKQINANAKTWEAGVNFDPKLSIDSFVNLL--GSKGVQAAKKASPDMFKTG 74
Query: 77 DPEYSAT--VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQN 134
D Y+ +P FDAR++W C +IG V D G C + F AF+DR CI ++G+ N
Sbjct: 75 DKAYNLAQRIPSNFDARKKWKKCLSIGEVRDQGHCGSCWAFGTSSAFADRLCIATEGEFN 134
Query: 135 RPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPC 194
LS E + CC C + C+ G R W K G VTGG+Y GCQP + PC
Sbjct: 135 ELLSAEELTFCCHKCGF----GCNGGYPIRAWERFRKHGLVTGGNYDSYEGCQPYRVPPC 190
Query: 195 --SHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGR---GFFQDKHRTTLTYWVDDNEDAI 249
+G+ +C + + K + RCT YG F D H T Y++ I
Sbjct: 191 PLDEYGNN----TCHGKPMEK---NHRCTRMCYGDQDLDFNNDHHYTRDAYYL--TYGTI 241
Query: 250 KKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYL--HSGKLIGWGTENGTPYWL 307
+ ++L +GP A+F +YDDF YKSGVY T NA +YL H+ KLIGWG E G PYWL
Sbjct: 242 QNDVLTYGPIEASFEVYDDFPSYKSGVYVKTENA---SYLGGHAVKLIGWGEEYGVPYWL 298
Query: 308 VINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
++N+W WGD+G KI RG EC + G P
Sbjct: 299 LVNSWNDQWGDQGLFKIRRGTNECGIDNSTTGGVP 333
>gi|346472613|gb|AEO36151.1| hypothetical protein [Amblyomma maculatum]
Length = 373
Score = 207 bits (527), Expect = 6e-51, Method: Compositional matrix adjust.
Identities = 124/321 (38%), Positives = 162/321 (50%), Gaps = 17/321 (5%)
Query: 26 IDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEYSATVP 85
I+ INR TW AG N + E+ + + + S LP +T D +P
Sbjct: 66 IEYINRLNTTWKAGHNSGYDNPEDVIPLLGVRP----ENSRYRLP--ERTLDVSALRVLP 119
Query: 86 DRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRP---LSTEYV 142
+ FDARE WP+C TI + D G+C + F AV A SDR CI S + R L+ + V
Sbjct: 120 ENFDAREHWPDCPTIREIRDQGSCGSCWAFGAVEAISDRTCIHSPEGKPRVIAHLAADDV 179
Query: 143 ASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPT 202
SCC C C+ G W++ +G VTGG+Y GC P I C HH + T
Sbjct: 180 LSCCTEC----GAGCNGGFPGSAWSYWVHKGIVTGGNYDSDEGCMPYPIKACDHHVNG-T 234
Query: 203 LPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTAT 262
L C+ P +C C Y F DKH Y V I+ EI+ +GP A
Sbjct: 235 LGPCDKTIPPTPRCVRMCRK-GYDVDFMDDKHYGRHAYSVPAKAKQIQAEIMMNGPVEAD 293
Query: 263 FALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTV 322
F +Y+DF HYKSGVY+ +++ L H+ +L+GWG ENG PYWL N+W WGD+G
Sbjct: 294 FTVYEDFLHYKSGVYQRHTDSALGG--HAIRLLGWGVENGVPYWLAANSWNTEWGDKGFF 351
Query: 323 KILRGKYECAFEYLIAAGKPK 343
KILRG EC E I AG PK
Sbjct: 352 KILRGSDECGIESDIVAGLPK 372
>gi|46812327|gb|AAT02230.1| cathepsin B-like proteinase [Triatoma dimidiata]
Length = 332
Score = 207 bits (526), Expect = 7e-51, Method: Compositional matrix adjust.
Identities = 122/342 (35%), Positives = 169/342 (49%), Gaps = 17/342 (4%)
Query: 4 ILVFLLGCTLVRGELYK--FSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKY 61
IL LL C + SD +ID IN TW AGRNF N ++YL+ +A
Sbjct: 5 ILFSLLICGTFSASIPTDPLSDEFIDYINTLQTTWRAGRNFAPNTPKKYLKS--LAGVHK 62
Query: 62 FDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAF 121
+ LP + + D T+PD FDAR+QWPNC +I + D G+C + +
Sbjct: 63 NANNAFTLPKRKVSLD----VTIPDEFDARKQWPNCPSITDIRDQGSCGSCWALELLRLC 118
Query: 122 SDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYG 181
S G+ LS E + +CC C C G W + G V+GG+YG
Sbjct: 119 LIVFVSHSNGKLQVHLSAENLVTCCGSC----GAGCFGGDPGSAWEYWRDVGIVSGGNYG 174
Query: 182 DRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYW 241
+ GCQP +I+PC HH + P C + C +C Y + +D H Y
Sbjct: 175 SKEGCQPYSIAPCEHHIPG-SRPPCRGEG-HTADCRKQCEK-GYSIPYDKDLHYAEFVYS 231
Query: 242 VDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTEN 301
+ + I+ EIL +GP A F +Y+D YK GVYKH + A + H+ K++GWG EN
Sbjct: 232 TERDVKEIQTEILKNGPVEAAFFVYEDLLTYKEGVYKHVAGAPVGG--HAIKILGWGVEN 289
Query: 302 GTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
GTPYWL+ N+W WG+ G KILRG EC E ++AG P+
Sbjct: 290 GTPYWLIANSWNTDWGNNGFFKILRGSDECGIEIDVSAGLPR 331
>gi|226821413|gb|ACO82382.1| cathepsin B [Lutjanus argentimaculatus]
Length = 330
Score = 207 bits (526), Expect = 9e-51, Method: Compositional matrix adjust.
Identities = 120/332 (36%), Positives = 166/332 (50%), Gaps = 21/332 (6%)
Query: 14 VRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDR 73
R L S ++ IN+ TW AG NF N+ Y+++ L G +
Sbjct: 18 ARPRLKPLSSEMVNYINKVNTTWKAGHNF-HNVDFSYVQRLC----------GTMLKGPK 66
Query: 74 KTYDPEYSA--TVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKG 131
+Y+ +P FD+REQWPNC T+ + D G+C + F A A SDR CI S
Sbjct: 67 LPIMVQYAGDMKLPKAFDSREQWPNCPTLKEIRDQGSCGSCWAFGASEAISDRLCIHSNA 126
Query: 132 QQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTI 191
+ + +S E + +CC C C+ G W+F K G V+GG Y GC+P TI
Sbjct: 127 KVSVEISAEDLLTCCDSC----GMGCNGGYPSAAWDFWTKEGLVSGGLYDSHVGCRPYTI 182
Query: 192 SPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKK 251
PC HH + + P C + +C ++C Y + +DKH +Y V +E I+
Sbjct: 183 PPCEHHVNG-SRPPCTGEGGDTPQCLSQC-EAGYTPSYREDKHYGKTSYSVLSDEAEIQY 240
Query: 252 EILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINT 311
EI +GP F +Y+DF YKSGVY+H S + + H+ K++GWG ENG PYWL N+
Sbjct: 241 EIYKNGPVEGAFTVYEDFVLYKSGVYQHVSGSAVGG--HAIKVLGWGEENGVPYWLCANS 298
Query: 312 WGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
W WGD G K LRG C E I AG PK
Sbjct: 299 WNTDWGDNGFFKFLRGSDHCGIESEIVAGIPK 330
>gi|209863073|ref|NP_001119610.2| cathepsin B-1852 [Acyrthosiphon pisum]
Length = 333
Score = 206 bits (525), Expect = 9e-51, Method: Compositional matrix adjust.
Identities = 129/348 (37%), Positives = 174/348 (50%), Gaps = 35/348 (10%)
Query: 4 ILVFLLGCTLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFD 63
+ VF C+ + Y + YI IN A TW AG NF +++ L +
Sbjct: 10 VFVFFSSCSE---QTYFLNKDYISTINSVAKTWKAGINFHPETPLKFILGLLGSKGVEVS 66
Query: 64 QSDRPLPGDRKTYDPEYSAT--VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAF 121
+ G K++DP YS T +P+ FDAR++W NC TIG + D G C + F+ GAF
Sbjct: 67 SA-----GPFKSHDPLYSPTGNIPNEFDARKRWKNCTTIGTIRDQGNCGSCWAFSTSGAF 121
Query: 122 SDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYG 181
+DR CI S G N+ LS E+V SCC C C G R W + K G VTGG++
Sbjct: 122 ADRLCIASNGSFNQLLSAEHVTSCCYRC----GLGCQGGYPIRAWRYYSKHGLVTGGNFN 177
Query: 182 DRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRC---TNPTY-GRGFFQDKHRTT 237
GCQP PC+ + SC Q KC +C T+ +Y G + ++
Sbjct: 178 SFEGCQPYMFPPCTGNN------SCSGQSEKNHKCQKKCFGNTSISYRGDRRYVERSPYV 231
Query: 238 LTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYL--HSGKLI 295
L Y D ++ +I+ +GP ++F +YDDF YKSGVY + NA YL HS K I
Sbjct: 232 LAY------DNMQNDIMTYGPIESSFDVYDDFISYKSGVYFKSPNA---TYLGGHSVKCI 282
Query: 296 GWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
GWG E YWL++N+W WGD G KI RG EC E AG P+
Sbjct: 283 GWGVERNVSYWLMMNSWNSTWGDGGYFKIRRGTNECQVEDSSTAGVPE 330
>gi|45822211|emb|CAE47502.1| cathepsin B-like proteinase [Diabrotica virgifera virgifera]
Length = 331
Score = 206 bits (525), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 123/346 (35%), Positives = 166/346 (47%), Gaps = 35/346 (10%)
Query: 4 ILVFLLGCTLV-RGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYF 62
I+ LL L +G S+ +I+ IN + +TW AG+NF NLS + ++ L A
Sbjct: 6 IITLLLPIVLSYKGSPNPLSNDFINYINSKQSTWVAGKNFDENLSIQEIKNLLGAKKGKL 65
Query: 63 DQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCG-TIGHVPDTGACAAPHIFAAVGAF 121
G K + VP+ FDARE W C I V D C + AA A
Sbjct: 66 --------GVAKEFTHSEDIQVPNSFDARENWKECSDVISTVVDQSDCGSCWAVAAASAM 117
Query: 122 SDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYG 181
SDRRCI S+G+ P+S E + SCC C Y C G W++ G TGG YG
Sbjct: 118 SDRRCIASQGKLKVPVSAENLLSCCDSCGY----GCEGGYPTMAWSYWIDTGITTGGLYG 173
Query: 182 DRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNP--------TYGRGFFQDK 233
+ GCQP ++ PC HH + C C +C + T+G G ++
Sbjct: 174 SKQGCQPYSLQPCEHHTEGNKV-QCSTLDYDTPSCKHKCDDSALNYKSELTFGSGSVRNF 232
Query: 234 HRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGK 293
+ I+KEIL +GP A F +Y DF +YKSGVY+H + L H+ +
Sbjct: 233 YSVA----------NIQKEILTNGPVEAAFDVYSDFVNYKSGVYQHVAGEYLGG--HAVR 280
Query: 294 LIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAA 339
++GWG E+G PYWLV N+W WGD+G KI RG E FE I A
Sbjct: 281 ILGWGEESGVPYWLVANSWNEDWGDKGLFKIRRGNNESGFEDSIVA 326
>gi|51947600|gb|AAU14266.1| cathepsin B-N [Myzus persicae]
Length = 338
Score = 206 bits (525), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 123/347 (35%), Positives = 177/347 (51%), Gaps = 20/347 (5%)
Query: 1 MIHILVFLLGCTLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAK 60
++ +L + + + Y +ID IN +A TW AG NF S+E++ + L ++
Sbjct: 4 VLMLLSVIFVSVYMTEQAYFLEKDFIDNINAQATTWKAGVNFDPKTSKEHIMKLL--GSR 61
Query: 61 YFDQSDRPLPGDRKTYDPEYSAT-VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVG 119
++ K+ D EY T +P FDAR +W +C TIG V D G C + A
Sbjct: 62 GVQIPNKNNMNLYKSEDAEYDNTYIPRFFDARRKWRHCSTIGRVRDQGNCGSCWAVATSS 121
Query: 120 AFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGD 179
AF+DR C+ + N LS E + CC C + C+ G + W K+G VTGGD
Sbjct: 122 AFADRLCVATNADFNELLSAEEITFCCHTCGF----GCNGGYPIKAWKRFSKKGLVTGGD 177
Query: 180 YGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGF--FQDKHRTT 237
Y GC+P + PC + +C + ++ + RCT YG F + HR T
Sbjct: 178 YKSGEGCEPYRVPPCPNDDQGNN--TCAGKP---MESNHRCTRMCYGDQDLDFDEDHRYT 232
Query: 238 LTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYL--HSGKLI 295
Y+ +I+K+++ +GP A+F +YDDF YKSGVY + NA +YL H+ KLI
Sbjct: 233 RDYYYL-TYGSIQKDVMTYGPIEASFDVYDDFPSYKSGVYVKSENA---SYLGGHAVKLI 288
Query: 296 GWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
GWG E G PYWL++N+W WGD G KI RG EC + AG P
Sbjct: 289 GWGEEYGVPYWLMVNSWNEDWGDHGFFKIQRGTNECGVDNSTTAGVP 335
>gi|52630925|gb|AAU84926.1| putative cathepsin B-N [Toxoptera citricida]
Length = 340
Score = 206 bits (525), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 121/346 (34%), Positives = 178/346 (51%), Gaps = 22/346 (6%)
Query: 4 ILVFLLGCTLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFD 63
+L +L + + Y + YI++IN +A TW AG NF +E++ + L +
Sbjct: 7 LLSVILFSVYMTEQAYFLEEDYINKINEQATTWKAGVNFDPKTPKEHILKLLGSKGVQIP 66
Query: 64 Q--SDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAF 121
+ + + + YD + +P +FDAR++W NC TIG + D G C + A AF
Sbjct: 67 SKLNHKMYKSEDENYDNLF-GRIPRKFDARKKWRNCKTIGAIRDQGNCGSCWALATSSAF 125
Query: 122 SDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYG 181
+DR C+ S N+ LS E + CC C + C+ G + W K G VTGGDY
Sbjct: 126 ADRLCVVSNEDFNQLLSAEELTFCCHKCGF----GCNGGYPIKAWEHFKKHGLVTGGDYK 181
Query: 182 DRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGR---GFFQDKHRTTL 238
GC+P + PC + S +C + ++ + RCT YG F +D T
Sbjct: 182 SGEGCEPYRVPPCPYDESGNN--TCAGKP---MEANHRCTRMCYGDQDLDFDEDHRYTRD 236
Query: 239 TYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYL--HSGKLIG 296
+Y++ +I+K++L +GP A+F +YDDF YKSGVY + NA +YL H+ KLIG
Sbjct: 237 SYYL--TYGSIQKDVLTYGPVEASFDVYDDFPSYKSGVYIRSENA---SYLGGHAAKLIG 291
Query: 297 WGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
WG E G PYWL++N+W WGD G KI RG EC + G P
Sbjct: 292 WGEEYGVPYWLMVNSWNADWGDNGLFKIQRGTNECGIDNSTTGGVP 337
>gi|91078960|ref|XP_974244.1| PREDICTED: similar to putative cathepsin B-like proteinase
[Tribolium castaneum]
gi|270004840|gb|EFA01288.1| cathepsin B precursor [Tribolium castaneum]
Length = 319
Score = 206 bits (524), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 127/349 (36%), Positives = 174/349 (49%), Gaps = 38/349 (10%)
Query: 2 IHILVFLL--GCTLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQF----- 54
++ L+FLL ++ R E+ S +ID IN++ + W A RNFP N + EYL +
Sbjct: 1 MYFLIFLLLASISVSRAEIDIQSQDFIDSINQKQSHWVARRNFPENTTNEYLYKLNGFLG 60
Query: 55 LIADAKYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHI 114
L D Y + + ++P+ +P FDAR++WP C ++ + D G+C +
Sbjct: 61 LHPDPNYMPEKIK------HNFNPQ---DIPKTFDARKKWPKCDSLNRIRDQGSCGSCWA 111
Query: 115 FAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGS 174
FAAV SDR CI S G + S E + SCC C SCS G + ++F K+G
Sbjct: 112 FAAVETMSDRICIHSSGAKKFFFSAEDLLSCCTAC-----GSCSGGYMMAAFDFYIKQGV 166
Query: 175 VTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKH 234
V+GGD GC+P T H T PSC T+ Y + DKH
Sbjct: 167 VSGGDLNSNEGCRPYTADA---HDKGVT-PSC-----------TKSCRKGYPTSYSSDKH 211
Query: 235 RTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKL 294
+ Y VD I+ EI+ +GP +F +Y DFY+Y SGVY H S N H K+
Sbjct: 212 YGSKDYIVDAGVSNIQYEIMTNGPIIVSFKVYQDFYNYGSGVYHHVSGNYTGN--HIVKI 269
Query: 295 IGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
+GWGTE YWL+ N+WG WG+ G KILRGK EC E A PK
Sbjct: 270 VGWGTEKEQDYWLIANSWGSSWGEHGFFKILRGKNECGIENNPYAVLPK 318
>gi|213514196|ref|NP_001133994.1| Cathepsin B precursor [Salmo salar]
gi|209156086|gb|ACI34275.1| Cathepsin B precursor [Salmo salar]
Length = 330
Score = 206 bits (524), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 123/344 (35%), Positives = 174/344 (50%), Gaps = 25/344 (7%)
Query: 5 LVFLLGCTL----VRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAK 60
L +LG L R L S ++ IN+ TW AG NF N+ Y+++
Sbjct: 5 LFLVLGSGLSISWARPHLPPLSHEMVNFINKANTTWKAGHNF-HNVDYSYVKRLC----- 58
Query: 61 YFDQSDRPLPGDRKTYDPEYS--ATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAV 118
L G + + +Y+ +P FD R QWPNC T+ V D G+C + F A
Sbjct: 59 -----GTLLKGPKLSTMVQYTEDMELPKNFDPRLQWPNCPTLKEVRDQGSCGSCWAFGAA 113
Query: 119 GAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGG 178
A SDR CI S + + +S+E + SCC+ C C+ G +F K G V+GG
Sbjct: 114 EAISDRVCIHSNAKVSVEISSEDLLSCCESC----GMGCNGGYPSAACDFWTKEGLVSGG 169
Query: 179 DYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTL 238
Y GC+P +I PC HH + T P C+ ++ +C +C P Y G+ QDKH
Sbjct: 170 LYDSHIGCRPYSIPPCEHHVNG-TRPPCKGEEGDTPQCTNQC-EPGYTPGYKQDKHFGKR 227
Query: 239 TYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWG 298
+Y V +E I KE+ +GP F +Y+DF YKSGVY+H S + + H+ K++GWG
Sbjct: 228 SYSVPSDEKEIMKELYKNGPVEGAFTVYEDFLLYKSGVYRHVSGSAVGG--HAIKVLGWG 285
Query: 299 TENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
E G PYWL N+W WG+ G KI+RG+ C E + AG P
Sbjct: 286 EEGGIPYWLAANSWNTDWGENGFFKIVRGEDHCGIESEMVAGIP 329
>gi|71984043|ref|NP_001024426.1| Protein CPR-6, isoform b [Caenorhabditis elegans]
gi|351058214|emb|CCD65629.1| Protein CPR-6, isoform b [Caenorhabditis elegans]
Length = 378
Score = 206 bits (524), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 115/323 (35%), Positives = 160/323 (49%), Gaps = 10/323 (3%)
Query: 23 DAYIDQINREANTWTAG--RNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEY 80
D ID +N N WTA R F + E ++ + + S + KT D +
Sbjct: 43 DDLIDYVNENQNLWTAKKQRRFSSVYGENDKAKWGLMGVNHVRLSVKGKQHLSKTKDLDL 102
Query: 81 SATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTE 140
+P+ FD+R+ WP C +I + D +C + F AV A SDR CI S G+ LS +
Sbjct: 103 D--IPESFDSRDNWPKCDSIKVIRDQSSCGSCWAFGAVEAMSDRICIASHGELQVTLSAD 160
Query: 141 YVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSA 200
+ SCCK C + C+ G W + K G VTG +Y GC+P PC HH
Sbjct: 161 DLLSCCKSCGF----GCNGGDPLAAWRYWVKDGIVTGSNYTANNGCKPYPFPPCEHHSKK 216
Query: 201 PTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTT 260
C + P KC +C + + + +DK Y V D+ +AI+KE++ HGP
Sbjct: 217 THFDPCPHDLYPTPKCEKKCVSDYTDKTYSEDKFFGASAYGVKDDVEAIQKELMTHGPLE 276
Query: 261 ATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRG 320
F +Y+DF +Y GVY HT KL H+ KLIGWG ++G PYW V N+W WG+ G
Sbjct: 277 IAFEVYEDFLNYDGGVYVHTG-GKLGGG-HAVKLIGWGIDDGIPYWTVANSWNTDWGEDG 334
Query: 321 TVKILRGKYECAFEYLIAAGKPK 343
+ILRG EC E + G PK
Sbjct: 335 FFRILRGVDECGIESGVVGGIPK 357
>gi|25146613|ref|NP_741818.1| Protein CPR-6, isoform a [Caenorhabditis elegans]
gi|1169087|sp|P43510.1|CPR6_CAEEL RecName: Full=Cathepsin B-like cysteine proteinase 6; AltName:
Full=Cysteine protease-related 6; Flags: Precursor
gi|671715|gb|AAA98787.1| cathepsin B-like cysteine proteinase [Caenorhabditis elegans]
gi|695294|gb|AAA98789.1| cathepsin B-like cysteine proteinase [Caenorhabditis elegans]
gi|351058213|emb|CCD65628.1| Protein CPR-6, isoform a [Caenorhabditis elegans]
Length = 379
Score = 206 bits (524), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 115/323 (35%), Positives = 160/323 (49%), Gaps = 10/323 (3%)
Query: 23 DAYIDQINREANTWTAG--RNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEY 80
D ID +N N WTA R F + E ++ + + S + KT D +
Sbjct: 44 DDLIDYVNENQNLWTAKKQRRFSSVYGENDKAKWGLMGVNHVRLSVKGKQHLSKTKDLDL 103
Query: 81 SATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTE 140
+P+ FD+R+ WP C +I + D +C + F AV A SDR CI S G+ LS +
Sbjct: 104 D--IPESFDSRDNWPKCDSIKVIRDQSSCGSCWAFGAVEAMSDRICIASHGELQVTLSAD 161
Query: 141 YVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSA 200
+ SCCK C + C+ G W + K G VTG +Y GC+P PC HH
Sbjct: 162 DLLSCCKSCGF----GCNGGDPLAAWRYWVKDGIVTGSNYTANNGCKPYPFPPCEHHSKK 217
Query: 201 PTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTT 260
C + P KC +C + + + +DK Y V D+ +AI+KE++ HGP
Sbjct: 218 THFDPCPHDLYPTPKCEKKCVSDYTDKTYSEDKFFGASAYGVKDDVEAIQKELMTHGPLE 277
Query: 261 ATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRG 320
F +Y+DF +Y GVY HT KL H+ KLIGWG ++G PYW V N+W WG+ G
Sbjct: 278 IAFEVYEDFLNYDGGVYVHTG-GKLGGG-HAVKLIGWGIDDGIPYWTVANSWNTDWGEDG 335
Query: 321 TVKILRGKYECAFEYLIAAGKPK 343
+ILRG EC E + G PK
Sbjct: 336 FFRILRGVDECGIESGVVGGIPK 358
>gi|193209594|ref|NP_001123113.1| Protein CPR-6, isoform c [Caenorhabditis elegans]
gi|351058222|emb|CCD65637.1| Protein CPR-6, isoform c [Caenorhabditis elegans]
Length = 369
Score = 206 bits (523), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 115/323 (35%), Positives = 160/323 (49%), Gaps = 10/323 (3%)
Query: 23 DAYIDQINREANTWTAG--RNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEY 80
D ID +N N WTA R F + E ++ + + S + KT D +
Sbjct: 34 DDLIDYVNENQNLWTAKKQRRFSSVYGENDKAKWGLMGVNHVRLSVKGKQHLSKTKDLDL 93
Query: 81 SATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTE 140
+P+ FD+R+ WP C +I + D +C + F AV A SDR CI S G+ LS +
Sbjct: 94 D--IPESFDSRDNWPKCDSIKVIRDQSSCGSCWAFGAVEAMSDRICIASHGELQVTLSAD 151
Query: 141 YVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSA 200
+ SCCK C + C+ G W + K G VTG +Y GC+P PC HH
Sbjct: 152 DLLSCCKSCGF----GCNGGDPLAAWRYWVKDGIVTGSNYTANNGCKPYPFPPCEHHSKK 207
Query: 201 PTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTT 260
C + P KC +C + + + +DK Y V D+ +AI+KE++ HGP
Sbjct: 208 THFDPCPHDLYPTPKCEKKCVSDYTDKTYSEDKFFGASAYGVKDDVEAIQKELMTHGPLE 267
Query: 261 ATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRG 320
F +Y+DF +Y GVY HT KL H+ KLIGWG ++G PYW V N+W WG+ G
Sbjct: 268 IAFEVYEDFLNYDGGVYVHTG-GKLGGG-HAVKLIGWGIDDGIPYWTVANSWNTDWGEDG 325
Query: 321 TVKILRGKYECAFEYLIAAGKPK 343
+ILRG EC E + G PK
Sbjct: 326 FFRILRGVDECGIESGVVGGIPK 348
>gi|339236191|ref|XP_003379650.1| cathepsin B [Trichinella spiralis]
gi|316977649|gb|EFV60721.1| cathepsin B [Trichinella spiralis]
Length = 356
Score = 205 bits (522), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 119/348 (34%), Positives = 177/348 (50%), Gaps = 17/348 (4%)
Query: 1 MIHILVFLLGC--TLVRGELYK------FSDAYIDQINREANTWTAGRNFPANLSEEYLR 52
+ +IL+F L C + V G + + + IN TW AGRN ++
Sbjct: 12 LFYILLFSLPCFYSTVFGIPFGSRNQRLYFNKMATYINNLQTTWKAGRNPYFETVPSHVI 71
Query: 53 QFLIADAKYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAP 112
Q ++ + +P +Y+ +P FD+R+QWP C TIG + D C +
Sbjct: 72 QGMMGVRRSSKLETNSIPLPVISYE-HIDMEIPVEFDSRKQWPYCPTIGEIRDQSNCGSC 130
Query: 113 HIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKR 172
F AV A SDR CI + G+Q +S+ + SCCKIC + C G + W+F K
Sbjct: 131 WAFGAVEAISDRICIATDGRQKPHISSTDLLSCCKICGF----GCQGGDPHQAWSFWVKY 186
Query: 173 GSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQD 232
G VTGG+Y GC+P +PC+HH + P C + P C C + TY + +D
Sbjct: 187 GLVTGGNYTTHDGCRPYPFAPCNHHSNGTYGP-CSHDLEPTPVCKKACQS-TYKIQYNKD 244
Query: 233 KHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSG 292
K+ Y + + ++KE++ +GP F +Y+DF YK+GVY+H + + L H+
Sbjct: 245 KYYGLKAYSLHNKASDLQKELMMNGPMEVAFEVYEDFLLYKTGVYQHHTGSVLGG--HAV 302
Query: 293 KLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAG 340
+L+GWG ENG PYWL+ N+W WGD+G KI RG+ EC E AG
Sbjct: 303 RLLGWGEENGVPYWLLANSWNTEWGDKGFFKIYRGRNECGIESEAVAG 350
>gi|390994431|gb|AFM37365.1| cathepsin B2 [Dictyocaulus viviparus]
Length = 346
Score = 205 bits (521), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 112/349 (32%), Positives = 171/349 (48%), Gaps = 20/349 (5%)
Query: 1 MIHILVFLLGCTLVRGELYKFS-DAYIDQINREANTWTAG-----RNFPANLSEEYLRQF 54
++ + F+ + G+ + + D +D +N+ N +TA +P + +
Sbjct: 11 LVAVAAFVPQSERILGKNVELTGDDLVDYVNKAQNLFTAKLSPRFSEYPTAIKRRLMGSK 70
Query: 55 LIADAKYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHI 114
+A + ++ T+D + +P FD+R QWPNC +I + D +C +
Sbjct: 71 YVAIPSKYRVNEV-------THDDIDDSAIPSSFDSRTQWPNCPSIKSIRDQSSCGSCWA 123
Query: 115 FAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGS 174
F A A +DR CI SKG +S + + SCC C + C G + WN+ ++G
Sbjct: 124 FGAAEAMTDRICIASKGAIQFTVSADDLLSCCDECGF----GCDGGFPYAAWNYWVEKGI 179
Query: 175 VTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKH 234
V+GG Y ++GC+P PC HH + C P C +C + Y + DK
Sbjct: 180 VSGGSYTSKSGCKPYPFPPCEHHTNGTHYHPCPKDLYPTNTCEHKCQS-GYATAYTNDKR 238
Query: 235 RTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKL 294
Y V AI+KEI+ HGP + +Y+DF HY G+YKHT+ + L H+ K+
Sbjct: 239 YGAKAYTVAARVKAIQKEIMLHGPVEVAYDVYEDFEHYLKGIYKHTAGSYLGG--HAVKM 296
Query: 295 IGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
IGWGTENG PYW+ N+W WG+ G +ILRG EC E + AG PK
Sbjct: 297 IGWGTENGIPYWICSNSWNSDWGENGFFRILRGTDECGIESGVVAGLPK 345
>gi|183988832|gb|ACC66065.1| cathepsin B [Antheraea assama]
Length = 287
Score = 205 bits (521), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 111/299 (37%), Positives = 158/299 (52%), Gaps = 14/299 (4%)
Query: 36 WTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWP 95
W AGRNFP + ++++ + D + LP + T+D + A++P+ FD R++WP
Sbjct: 1 WRAGRNFPIHTPFAHIKKLM---GSLKDDNILKLP--KVTHDADLIASLPENFDPRDKWP 55
Query: 96 NCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNK 155
+C T+ + D G+C + F AV A +DR CI S ++ S E + SCC IC
Sbjct: 56 DCPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCPICGL---- 111
Query: 156 SCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLK 215
C+ G W + G V+GG+Y GC+P I PC HH +P + K P K
Sbjct: 112 GCNGGMPTLAWEYWKHVGLVSGGNYNSSQGCRPYEIPPCEHHVPGNRMPCNGDTKTP--K 169
Query: 216 CHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSG 275
C C + +Y F +DK Y V +ED IK E+ +GP F +Y D YKSG
Sbjct: 170 CEKTCES-SYTVPFKKDKRYGKHVYSVSGHEDNIKAELFKNGPVEGAFTVYSDLLSYKSG 228
Query: 276 VYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFE 334
VY+HT L H+ K++GWG ENG+ YWL+ N+W WGD G +KILRG+ C E
Sbjct: 229 VYQHTHGNALGG--HAIKILGWGVENGSKYWLIANSWNSDWGDNGFLKILRGEDHCGIE 285
>gi|91078964|ref|XP_974298.1| PREDICTED: similar to putative cathepsin B-like like proteinase
[Tribolium castaneum]
gi|270004838|gb|EFA01286.1| cathepsin B precursor [Tribolium castaneum]
Length = 335
Score = 205 bits (521), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 124/342 (36%), Positives = 173/342 (50%), Gaps = 22/342 (6%)
Query: 6 VFLLGCTLVRGEL--YKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFD 63
V L L G L + SD +I+ IN + TW AGRNF + +++ L K +
Sbjct: 9 VVLATIALSYGGLNPHPLSDEFINAINSKKTTWKAGRNFDIHTPLANIKKLLGVLPKKAN 68
Query: 64 QSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTI-GHVPDTGACAAPHIFAAVGAFS 122
L K + + +A +P+ FDARE WP C +I G + D +C + F A A S
Sbjct: 69 ARQLEL----KVHSVDVNA-IPESFDAREAWPECASIIGDIRDQASCGSCWAFGAAEAMS 123
Query: 123 DRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGD 182
DR CI S +STE + +CC Y+ C+ G W + + G VTGG Y
Sbjct: 124 DRICIHSNATVKVSISTEDLNTCC----YECGDGCNGGWPAEAWAYWAETGIVTGGKYET 179
Query: 183 RTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWV 242
+ GC+ T+ PC HH + LP+C VP +C C ++ R Y
Sbjct: 180 KDGCKAYTVPPCEHH-TEGDLPAC-GDIVPTPQCKKECDAGVDIE--YKSDLRKGSAYQT 235
Query: 243 DDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYL--HSGKLIGWGTE 300
+E I+ EI+ +GP A F +Y+DF +YKSGVY+ T+ NY H+ K++GWG E
Sbjct: 236 SSDESQIQTEIMTNGPVEADFDVYEDFLNYKSGVYQQTTG----NYAGGHAIKILGWGVE 291
Query: 301 NGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
+GTPYWL N+W WGD+G KILRG+ EC E I G P
Sbjct: 292 DGTPYWLAANSWNEDWGDKGYFKILRGQNECGIESDIIGGIP 333
>gi|55793943|gb|AAV65882.1| cathepsin B1 isotype 2 precursor [Trichobilharzia regenti]
Length = 342
Score = 204 bits (520), Expect = 4e-50, Method: Compositional matrix adjust.
Identities = 123/346 (35%), Positives = 180/346 (52%), Gaps = 17/346 (4%)
Query: 1 MIHILVFLLGCTLVRGELY--KFSDAYIDQINREANT-WTAGRNFPANLSEEYLRQFLIA 57
+I + L L E+ SD I IN+ + WTA R+ E+ ++
Sbjct: 8 IISFMSILTAHILPENEIQFEPLSDEMIAYINQHPDAGWTASRSDRFKSLED---ARILL 64
Query: 58 DAKYFDQSDRPLPGDRKTYDPE-YSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFA 116
A + D+ R R T D + S +P FD+R++W C +I ++ D C + FA
Sbjct: 65 GAMHEDEELRK--KRRPTVDHQNVSLEIPSSFDSRKKWRQCKSISNIRDQSRCGSCWAFA 122
Query: 117 AVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVT 176
AV A SDR CI+SKG+++ LS + SCC C C G W++ + G VT
Sbjct: 123 AVEAMSDRICIESKGKKSVELSAVDLLSCCTEC----GLGCQGGFPGAAWDYWVEDGIVT 178
Query: 177 GGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRT 236
G + TGCQP C HH + P C + KCH +C Y + +DK+
Sbjct: 179 GSSKENHTGCQPYPFPKCEHHTTG-KYPECGEKIYKTPKCHQKCQK-GYKTPYGKDKYYG 236
Query: 237 TLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIG 296
++Y V +NE+AIKKEI+ HGP A F ++ DF +YKSG+YK+ + A++ H+ ++IG
Sbjct: 237 RMSYNVLNNENAIKKEIMMHGPVEAAFTVHSDFLNYKSGIYKYMTGAEIGG--HAVRIIG 294
Query: 297 WGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
WG E TPYWL+ N+W WG++G +ILRGK EC E + G P
Sbjct: 295 WGVEKKTPYWLIANSWNEDWGEKGYFRILRGKDECGIESEVTGGLP 340
>gi|55793941|gb|AAV65881.1| cathepsin B1 isotype 1 precursor [Trichobilharzia regenti]
Length = 342
Score = 204 bits (520), Expect = 4e-50, Method: Compositional matrix adjust.
Identities = 123/346 (35%), Positives = 180/346 (52%), Gaps = 17/346 (4%)
Query: 1 MIHILVFLLGCTLVRGELY--KFSDAYIDQINREANT-WTAGRNFPANLSEEYLRQFLIA 57
+I + L L E+ SD I IN+ + WTA R+ E+ ++
Sbjct: 8 IISFMSILTAHILPENEIQFEPLSDEMIAYINQHPDAGWTASRSDRFKSLED---ARILL 64
Query: 58 DAKYFDQSDRPLPGDRKTYDPE-YSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFA 116
A + D+ R R T D + S +P FD+R++W C +I ++ D C + FA
Sbjct: 65 GAMHEDEELRK--KRRPTVDHQNVSLEIPSSFDSRKKWHQCKSISNIRDQSRCGSCWAFA 122
Query: 117 AVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVT 176
AV A SDR CI+SKG+++ LS + SCC C C G W++ + G VT
Sbjct: 123 AVEAMSDRICIESKGKKSVELSAVDLLSCCTEC----GLGCQGGFPGAAWDYWVEDGIVT 178
Query: 177 GGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRT 236
G + TGCQP C HH + P C + KCH +C Y + +DK+
Sbjct: 179 GSSKENHTGCQPYPFPKCEHHTTG-KYPECGEKIYKTPKCHQKCQK-GYKTPYKKDKYYG 236
Query: 237 TLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIG 296
++Y V +NE+AIKKEI+ HGP A F ++ DF +YKSG+YK+ + A++ H+ ++IG
Sbjct: 237 RMSYNVLNNENAIKKEIMMHGPVEAAFTVHSDFLNYKSGIYKYMTGAEIGG--HAVRIIG 294
Query: 297 WGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
WG E TPYWL+ N+W WG++G +ILRGK EC E + G P
Sbjct: 295 WGVEKKTPYWLIANSWNEDWGEKGYFRILRGKDECGIESEVTGGLP 340
>gi|17565158|ref|NP_503384.1| Protein W07B8.1 [Caenorhabditis elegans]
gi|351059396|emb|CCD74286.1| Protein W07B8.1 [Caenorhabditis elegans]
Length = 335
Score = 204 bits (520), Expect = 4e-50, Method: Compositional matrix adjust.
Identities = 119/345 (34%), Positives = 171/345 (49%), Gaps = 14/345 (4%)
Query: 1 MIHILVFLLGCTL-VRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADA 59
M IL+ L+G G D I +N + TWTAG PA LS + + L+ DA
Sbjct: 1 MRKILICLIGVLFQADGVPPSEIDRIIHYVNSQKTTWTAG--IPA-LSRNSMLKTLVTDA 57
Query: 60 KYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVG 119
+ + D + FDARE+WP C +I + D C FAA
Sbjct: 58 ATIGFKIQNFGVSQANSD------LSPSFDARERWPECMSIPQINDISECKTSWAFAAAE 111
Query: 120 AFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGD 179
+ SDR CI S G +N LS E + SCC + + C G+ F+ W ++ K G TGG
Sbjct: 112 SMSDRLCINSGGFKNTILSAEELLSCCT-GMFSCGEGCEGGNPFKAWQYIQKHGIPTGGS 170
Query: 180 YGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPT-YGRGFFQDKHRTTL 238
Y + GC+P +I PC T P+C N P C +CT+ Y +D+H
Sbjct: 171 YESQFGCKPYSIPPCGKTVGNVTYPACTNTTSPTPSCEKKCTSRIGYPIDIDKDRHYGVS 230
Query: 239 TYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWG 298
+ +++ I+ +++ +GP ATF +YDDF Y +G+Y H + K + +L S ++IGWG
Sbjct: 231 VDQLPNSQIEIQSDVMLNGPIQATFEVYDDFLQYTTGIYVHLTGNK-QGHL-SVRIIGWG 288
Query: 299 TENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
G PYWL N+WG WG+ GT ++LRG EC E +G PK
Sbjct: 289 VWQGVPYWLCANSWGRQWGENGTFRVLRGTNECGLESNCVSGMPK 333
>gi|55793947|gb|AAV65884.1| cathepsin B1 isotype 4 precursor [Trichobilharzia regenti]
Length = 342
Score = 204 bits (519), Expect = 4e-50, Method: Compositional matrix adjust.
Identities = 121/348 (34%), Positives = 182/348 (52%), Gaps = 17/348 (4%)
Query: 1 MIHILVFLLGCTLVRGELY--KFSDAYIDQINREANT-WTAGRNFPANLSEEYLRQFLIA 57
+I + L L E+ SD I IN+ + WTA R+ E+ ++
Sbjct: 8 IISFMSILTAHILPENEIQFEPLSDEMIAYINQHPDAGWTASRSDRFKSLED---ARILL 64
Query: 58 DAKYFDQSDRPLPGDRKTYDPE-YSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFA 116
A + D+ R R T D + S +P FD+R++W C +I ++ D C + FA
Sbjct: 65 GAMHEDEELRK--KRRPTVDHQNVSLEIPSSFDSRKKWHQCKSISNIRDQSRCGSCWAFA 122
Query: 117 AVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVT 176
AV A SDR CI+SKG+++ LS + SCC C C G W++ + G VT
Sbjct: 123 AVEAMSDRICIESKGKKSVELSAVDLLSCCTEC----GLGCQGGFPGAAWDYWVEDGIVT 178
Query: 177 GGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRT 236
G + TGCQP C HH + P C + KCH +C Y + +DK+
Sbjct: 179 GSSKENHTGCQPYPFPKCEHHTTG-KYPECGEKIYKTPKCHQKCQK-GYKTPYKKDKYYG 236
Query: 237 TLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIG 296
++Y V +NE+AIKKEI+ HGP F ++ DF +YKSG+YK+ + A++ H+ ++IG
Sbjct: 237 RMSYNVLNNENAIKKEIMMHGPVEVAFTVHSDFLNYKSGIYKYMTGAEIGE--HAVRIIG 294
Query: 297 WGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPKN 344
WG E TPYWL+ N+W WG++G ++LRGK EC E + +G P++
Sbjct: 295 WGVEKKTPYWLIANSWNEDWGEKGYFRMLRGKDECGIESAVTSGLPRD 342
>gi|91089437|ref|XP_966750.1| PREDICTED: similar to putative cathepsin B-like proteinase
[Tribolium castaneum]
gi|270012705|gb|EFA09153.1| cathepsin B precursor [Tribolium castaneum]
Length = 324
Score = 204 bits (519), Expect = 4e-50, Method: Compositional matrix adjust.
Identities = 124/347 (35%), Positives = 176/347 (50%), Gaps = 36/347 (10%)
Query: 4 ILVFLLG-CTLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYF 62
++VF+L + + + SD +I+ IN + +TWTAGRNFP + E+L++ A
Sbjct: 6 VVVFVLTFSSALSAQNPILSDEFINSINAQQSTWTAGRNFPEDTPIEHLKRLNGALIT-- 63
Query: 63 DQSDRPLPGDRKTY----DPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAV 118
L G +T+ PE +P+ FD R W C ++ ++ + G C + F +V
Sbjct: 64 ----PDLVGKNQTHVINVIPE---AIPETFDGRTHWSQCPSLKNIRNQGNCGSCWAFGSV 116
Query: 119 GAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGG 178
+DR CI SKG+ S + + +CC C K C G+ +R + + +G V+GG
Sbjct: 117 EVMTDRLCIASKGKTKFEFSADDLLACCTAC----GKGCDGGAPYRAFEYWVAKGIVSGG 172
Query: 179 DYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHR-TT 237
DY GCQP + GSA N P KC T+C N Y + +DKH T
Sbjct: 173 DYNSNEGCQP-------YEGSAFL-----NSVTP--KCSTKCLNSKYTTPYAKDKHYGTD 218
Query: 238 LTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGW 297
Y N I+ EI+ +GP +Y+DFY YKSGVY+H S + H+ K+IGW
Sbjct: 219 FIYMTSKNVAEIQTEIMNNGPVVTHMDVYEDFYSYKSGVYQHVSGNSMGG--HAVKIIGW 276
Query: 298 GTENGTPYWLVINTWGPHWGDR-GTVKILRGKYECAFEYLIAAGKPK 343
GTE G PYWL+ N+WG W D G KILRGK C E I G P+
Sbjct: 277 GTEKGVPYWLIANSWGAKWADLDGFYKILRGKNHCKIETYIYGGTPQ 323
>gi|195729971|gb|ACG50796.1| cathepsin B1 [Trichobilharzia szidati]
Length = 342
Score = 204 bits (519), Expect = 5e-50, Method: Compositional matrix adjust.
Identities = 125/348 (35%), Positives = 174/348 (50%), Gaps = 17/348 (4%)
Query: 1 MIHILVFLLGCTLVRGELY--KFSDAYIDQINREANT-WTAGRNFPANLSEEYLRQFLIA 57
++ ++ L L E+ SD I IN+ + WTA R+ S E R L A
Sbjct: 8 IVSLMSILTAHILTDNEVQFEPLSDEMIAYINQHPDAGWTASRSDRFK-SVEDARILLGA 66
Query: 58 DAKYFDQSDRPLPGDRKTYDPE-YSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFA 116
+ + R T D + S +P FD+R++W C +I ++ D C FA
Sbjct: 67 ----MSEDEELRKKRRPTVDHQNVSLEIPSSFDSRKKWRQCKSISNIRDQSRCGPCWAFA 122
Query: 117 AVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVT 176
AV A SDR CI+SKG+++ LS + SCC C C G W++ + G VT
Sbjct: 123 AVEAMSDRICIQSKGKKSVELSAVDLLSCCTEC----GLGCQGGFPGAAWDYWVEEGIVT 178
Query: 177 GGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRT 236
G + TGCQP C HH + P+C + KC +C Y + +DK+
Sbjct: 179 GSSKENHTGCQPYPFPKCEHH-TKGKYPACGEKIYKTPKCQQKCQK-GYKTPYKKDKYYG 236
Query: 237 TLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIG 296
L+Y V EDAIKKEI+ HGP A F +Y DF +YKSG+YKH + H+ ++IG
Sbjct: 237 KLSYNVLSKEDAIKKEIMMHGPVEAAFTVYSDFLNYKSGIYKHMKGTVIGG--HAVRIIG 294
Query: 297 WGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPKN 344
WG E TPYWL+ N+W WG++G +ILRGK C E + AG P N
Sbjct: 295 WGVEKKTPYWLIANSWNEDWGEKGYFRILRGKDVCGIESAVTAGLPHN 342
>gi|161343879|tpg|DAA06120.1| TPA_inf: cathepsin B [Toxoptera citricida]
Length = 340
Score = 204 bits (519), Expect = 5e-50, Method: Compositional matrix adjust.
Identities = 127/351 (36%), Positives = 183/351 (52%), Gaps = 26/351 (7%)
Query: 1 MIHILVFLLGCTLVRGELYKFSDAYIDQINREANTWTAGRNF-PANLSEEYLRQFLIADA 59
++ +L +L + + Y YI++IN +A+TWTAG NF P+ E+ LR + +
Sbjct: 4 VLILLSVILFSVYMTEQAYFLEKDYINKINEKASTWTAGFNFDPSTPKEDILR---LLGS 60
Query: 60 KYFDQSDRPLPGDRKTYDPEYS---ATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFA 116
K + K+ D EY +P +FDAR++W +C TIG V D G C + A
Sbjct: 61 KGVQTPSKINHKMYKSEDKEYDNLFGRIPKKFDARKKWRHCTTIGAVRDQGNCGSCWAIA 120
Query: 117 AVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVT 176
AF+DR C+ + N+ LS E + CC C Y C+ G + W K G VT
Sbjct: 121 TSSAFADRLCVATNADFNQLLSAEEITFCCHKCGY----GCNGGYPIKAWERFKKHGLVT 176
Query: 177 GGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDK--H 234
GG+Y GC+P + PC + S +C + + + + RCT YG H
Sbjct: 177 GGEYKSGEGCEPYRVPPCPYDESGNN--TCSGKPMEQ---NHRCTRMCYGDQDLDFDDDH 231
Query: 235 RTTL-TYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYL--HS 291
R T +Y++ +I+K+++ +GP A+F +YDDF YKSGVY + NA +YL H+
Sbjct: 232 RHTRDSYYLTIG--SIQKDVMTYGPIEASFDVYDDFLSYKSGVYVRSENA---SYLGGHA 286
Query: 292 GKLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
KLIGWG E GTPYWL++N+W WGD G KI RG EC + AG P
Sbjct: 287 VKLIGWGEEYGTPYWLMMNSWNADWGDEGLFKIRRGTNECGVDNSTTAGVP 337
>gi|1311050|pdb|1CPJ|A Chain A, Crystal Structures Of Recombinant Rat Cathepsin B And A
Cathepsin B-Inhibitor Complex: Implications For
Structure- Based Inhibitor Design
gi|1311051|pdb|1CPJ|B Chain B, Crystal Structures Of Recombinant Rat Cathepsin B And A
Cathepsin B-Inhibitor Complex: Implications For
Structure- Based Inhibitor Design
gi|1421561|pdb|1THE|A Chain A, Crystal Structures Of Recombinant Rat Cathepsin B And A
Cathepsin B- Inhibitor Complex: Implications For
Structure-Based Inhibitor Design
gi|1421562|pdb|1THE|B Chain B, Crystal Structures Of Recombinant Rat Cathepsin B And A
Cathepsin B- Inhibitor Complex: Implications For
Structure-Based Inhibitor Design
Length = 260
Score = 204 bits (519), Expect = 5e-50, Method: Compositional matrix adjust.
Identities = 109/260 (41%), Positives = 141/260 (54%), Gaps = 8/260 (3%)
Query: 84 VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVA 143
+P+ FDAREQW NC TI + D G+C + F AV A SDR CI + G+ N +S E +
Sbjct: 7 LPESFDAREQWSNCPTIAQIRDQGSCGSCWAFGAVEAMSDRICIHTNGRVNVEVSAEDLL 66
Query: 144 SCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTL 203
+CC I D C+ G WNF ++G V+GG Y GC P TI PC HH +
Sbjct: 67 TCCGIQCGD---GCNGGYPSGAWNFWTRKGLVSGGVYNSHIGCLPYTIPPCEHHVNGARP 123
Query: 204 PSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATF 263
P PK C+ C Y + +DKH +Y V D+E I EI +GP F
Sbjct: 124 PCTGEGDTPK--CNKMC-EAGYSTSYKEDKHYGYTSYSVSDSEKEIMAEIYKNGPVEGAF 180
Query: 264 ALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVK 323
++ DF YKSGVYKH + + H+ +++GWG ENG PYWLV N+W WGD G K
Sbjct: 181 TVFSDFLTYKSGVYKHEAGDVMGG--HAIRILGWGIENGVPYWLVANSWNADWGDNGFFK 238
Query: 324 ILRGKYECAFEYLIAAGKPK 343
ILRG+ C E I AG P+
Sbjct: 239 ILRGENHCGIESEIVAGIPR 258
>gi|183988834|gb|ACC66066.1| cathepsin B [Samia ricini]
Length = 283
Score = 204 bits (518), Expect = 6e-50, Method: Compositional matrix adjust.
Identities = 112/298 (37%), Positives = 154/298 (51%), Gaps = 15/298 (5%)
Query: 36 WTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWP 95
W+AGRNFP + S +++ + +Y+ + T+D E AT+P+ FD R++WP
Sbjct: 1 WSAGRNFPTHTSFAHIKILREHERRYY------MEVAYVTHDVELIATLPEIFDPRDKWP 54
Query: 96 NCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNK 155
C T+ + D G+C + F AV A +DR CI S ++ S E + SCC IC
Sbjct: 55 ECLTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCPIC----GL 110
Query: 156 SCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLK 215
C+ G W + G V+GG+Y GC+P I PC HH +P + K P K
Sbjct: 111 GCNGGMPTLAWEYWKHVGLVSGGNYNSSQGCRPYEIPPCEHHVPGNRMPCNGDTKTP--K 168
Query: 216 CHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSG 275
C C + +Y F +DK Y V +ED IK E+ +GP A F +Y D YK+G
Sbjct: 169 CQKNCES-SYNVPFKKDKRYGKHVYSVSGHEDHIKAELFKNGPVEAAFTVYSDLLSYKNG 227
Query: 276 VYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAF 333
VYKHT L H+ K+IGWG EN YWL+ N+W WGD G KILRG+ C
Sbjct: 228 VYKHTEGNALGG--HAIKIIGWGVENNNKYWLIANSWNSDWGDNGFFKILRGEDHCGI 283
>gi|1127275|pdb|1CTE|A Chain A, Crystal Structures Of Recombinant Rat Cathepsin B And A
Cathepsin B-Inhibitor Complex: Implications For
Structure- Based Inhibitor Design
gi|1127276|pdb|1CTE|B Chain B, Crystal Structures Of Recombinant Rat Cathepsin B And A
Cathepsin B-Inhibitor Complex: Implications For
Structure- Based Inhibitor Design
Length = 254
Score = 204 bits (518), Expect = 6e-50, Method: Compositional matrix adjust.
Identities = 109/260 (41%), Positives = 141/260 (54%), Gaps = 8/260 (3%)
Query: 84 VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVA 143
+P+ FDAREQW NC TI + D G+C + F AV A SDR CI + G+ N +S E +
Sbjct: 1 LPESFDAREQWSNCPTIAQIRDQGSCGSCWAFGAVEAMSDRICIHTNGRVNVEVSAEDLL 60
Query: 144 SCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTL 203
+CC I D C+ G WNF ++G V+GG Y GC P TI PC HH +
Sbjct: 61 TCCGIQCGD---GCNGGYPSGAWNFWTRKGLVSGGVYNSHIGCLPYTIPPCEHHVNGARP 117
Query: 204 PSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATF 263
P PK C+ C Y + +DKH +Y V D+E I EI +GP F
Sbjct: 118 PCTGEGDTPK--CNKMC-EAGYSTSYKEDKHYGYTSYSVSDSEKEIMAEIYKNGPVEGAF 174
Query: 264 ALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVK 323
++ DF YKSGVYKH + + H+ +++GWG ENG PYWLV N+W WGD G K
Sbjct: 175 TVFSDFLTYKSGVYKHEAGDVMGG--HAIRILGWGIENGVPYWLVANSWNADWGDNGFFK 232
Query: 324 ILRGKYECAFEYLIAAGKPK 343
ILRG+ C E I AG P+
Sbjct: 233 ILRGENHCGIESEIVAGIPR 252
>gi|340380685|ref|XP_003388852.1| PREDICTED: cathepsin B-like [Amphimedon queenslandica]
Length = 341
Score = 204 bits (518), Expect = 6e-50, Method: Compositional matrix adjust.
Identities = 117/270 (43%), Positives = 151/270 (55%), Gaps = 15/270 (5%)
Query: 79 EYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLS 138
E A +PD FDAR++WP+C TIG V D GAC + F AV A SDR CI K Q N +S
Sbjct: 81 EVPAVIPDTFDARQKWPDCPTIGTVRDQGACGSCWAFGAVEAMSDRYCISFKEQVN--IS 138
Query: 139 TEYVASCCKICRYDDNKSCSHG---SVFRTW-NFLHKRGSVTGGDYGDRTGCQPSTISPC 194
E + SCC+ C C G + +R W + L G VTGG Y GCQP TI C
Sbjct: 139 AENLLSCCETC----GSGCDGGYPAAAWRHWADKLLYEGIVTGGQYDSNAGCQPYTIPKC 194
Query: 195 SHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEIL 254
HH P +Q P C C + +Y + + DKH +Y + + +I+ EI+
Sbjct: 195 DHHEPGPYENCSGSQSTPS--CKRSCIS-SYDKSYRSDKHYGKNSYSISSDVSSIQTEIM 251
Query: 255 AHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGP 314
+GP F++Y DF Y SGVY+HT+ + L H+ K++GWGTENG PYWLV N+W P
Sbjct: 252 TNGPVEGAFSVYADFPTYTSGVYQHTTGSFLGG--HAIKILGWGTENGVPYWLVANSWNP 309
Query: 315 HWGDRGTVKILRGKYECAFEYLIAAGKPKN 344
WGD G KI+RGK EC E I AG P+
Sbjct: 310 SWGDSGFFKIIRGKDECGIESSIVAGMPEQ 339
>gi|187105118|ref|NP_001119619.1| cathepsin B-5880 precursor [Acyrthosiphon pisum]
gi|163300442|tpg|DAA06127.1| TPA_inf: cathepsin B transcript 5880 [Acyrthosiphon pisum]
gi|239790051|dbj|BAH71611.1| ACYPI000015 [Acyrthosiphon pisum]
gi|239790053|dbj|BAH71612.1| ACYPI000015 [Acyrthosiphon pisum]
Length = 302
Score = 204 bits (518), Expect = 6e-50, Method: Compositional matrix adjust.
Identities = 108/259 (41%), Positives = 145/259 (55%), Gaps = 10/259 (3%)
Query: 84 VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVA 143
+P FDAR +W C +IG V D G C + + + A SDR CI S G LS + +
Sbjct: 53 LPKSFDARAKWYMCPSIGMVYDQGNCKSSYAISVASAVSDRICIHSNGTVKPKLSAQQIL 112
Query: 144 SCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTL 203
SCC +C CS G F +W+F + G V+GG+YG GCQP TI PC H +A
Sbjct: 113 SCCYLC----GDGCSGGQHFESWDFYRRHGLVSGGEYGSNEGCQPYTIEPCQHTETA-VE 167
Query: 204 PSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATF 263
+C N+ + +C +C NP YG + +D H+ T Y V KEI +GP TA+F
Sbjct: 168 NACSNKTLFTPECKVQCYNPDYGTRYVKDNHQGT-HYRVP--AYTAMKEIYENGPITASF 224
Query: 264 ALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVK 323
+Y DF +Y+SGVY + S + + K++GWG ENGTPYWL N++ +WGD G VK
Sbjct: 225 YMYQDFVNYQSGVYAYNSGKYVTT--QAVKILGWGEENGTPYWLAANSFNTYWGDNGFVK 282
Query: 324 ILRGKYECAFEYLIAAGKP 342
ILRG EC E + AG P
Sbjct: 283 ILRGANECYIEEFMYAGLP 301
>gi|291385792|ref|XP_002709482.1| PREDICTED: cathepsin B [Oryctolagus cuniculus]
Length = 339
Score = 204 bits (518), Expect = 6e-50, Method: Compositional matrix adjust.
Identities = 125/328 (38%), Positives = 172/328 (52%), Gaps = 21/328 (6%)
Query: 18 LYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYD 77
L+ SD ++ IN++ TW AG NF N+ YL++ L G +
Sbjct: 23 LHPLSDELVNFINKQNTTWQAGHNF-FNVEVSYLKKLC----------GTFLGGPKLPRR 71
Query: 78 PEYSATV--PDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNR 135
E++ + P+ FDAREQWPNC TI + D G+C + F AV A SDR CI + G N
Sbjct: 72 VEFADDIKLPESFDAREQWPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHTNGHVNV 131
Query: 136 PLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCS 195
+S E + +CC D WNF K+G V+GG Y GC+P +I PC
Sbjct: 132 EVSAEDMLTCCGGQCGDGCNGGYPSGA---WNFWTKKGLVSGGLYDSHVGCKPYSIPPCE 188
Query: 196 HHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILA 255
HH + + P+C + +C C P Y + +DKH +Y V +E+ IK EI
Sbjct: 189 HHVNG-SRPACTGEG-DTPRCSKTC-EPGYSPSYKEDKHYGYSSYSVSSDENEIKAEIYK 245
Query: 256 HGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPH 315
+GP F +Y DF YKSGVY+HT+ + H+ +++GWG ENG PYWLV N+W
Sbjct: 246 NGPVEGAFTVYSDFLMYKSGVYQHTTGDIMGG--HAIRILGWGEENGVPYWLVANSWNTD 303
Query: 316 WGDRGTVKILRGKYECAFEYLIAAGKPK 343
WGD+G KILRG+ C E I AG P+
Sbjct: 304 WGDKGFFKILRGQDHCGIESEIVAGIPR 331
>gi|170787211|gb|ACB38229.1| cathepsin B [Meretrix meretrix]
Length = 337
Score = 204 bits (518), Expect = 7e-50, Method: Compositional matrix adjust.
Identities = 128/348 (36%), Positives = 176/348 (50%), Gaps = 28/348 (8%)
Query: 2 IHILVFLLGCTLVRGELYK--FSDAYIDQIN-REANTWTAG----RNFPANLSEEYLRQF 54
+ +LVF+ R + + FS+A+++ N R+ +W A +N P +Y++
Sbjct: 4 LLVLVFVGAAWSYRFDFHDDYFSEAFVNYHNSRDDVSWKATTENFKNVPYKGRMDYVKSL 63
Query: 55 LIADAKYFDQSDRPLPGDRK--TYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAP 112
A+ P P + K + E +PD FDAR QWP+C ++ V D GAC +
Sbjct: 64 CGAN---------PAPPEMKFPVKEIEVPKDLPDTFDARTQWPDCPSLKEVRDQGACGSC 114
Query: 113 HIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKR 172
F V A +DR CI+SKG N LS E + SCC+ C C+ G + WN+L +
Sbjct: 115 WAFGCVEAATDRLCIQSKGIVNAHLSAEDLTSCCRTC----GNGCNGGFLEGAWNYLKRD 170
Query: 173 GSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQD 232
G VTGG Y GC P I C HH P C+ P +C C + Y + +D
Sbjct: 171 GIVTGGPYNSHQGCLPYEIKACDHHVVGKLQP-CKGDG-PTPRCKKECES-GYNNTYSKD 227
Query: 233 KHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSG 292
+H + V+ E I EI+ +GP A F +Y DF YKSGVY+H S L H+
Sbjct: 228 EHHAKTVHAVEGVEQ-IMTEIMTNGPVEAAFTVYSDFPTYKSGVYEHKSGGPLGG--HAI 284
Query: 293 KLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAG 340
K +GWG E+G YWLV N+W P WGD G KILRG+ EC E I AG
Sbjct: 285 KTLGWGNEDGKDYWLVANSWNPDWGDNGFFKILRGRDECGIESNIVAG 332
>gi|203648|gb|AAA40993.1| cathepsin (EC 3.4.22.1), partial [Rattus norvegicus]
Length = 271
Score = 203 bits (517), Expect = 9e-50, Method: Compositional matrix adjust.
Identities = 109/261 (41%), Positives = 141/261 (54%), Gaps = 8/261 (3%)
Query: 83 TVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYV 142
+P+ FDAREQW NC TI + D G+C + F AV A SDR CI + G+ N +S E +
Sbjct: 11 NLPESFDAREQWSNCPTIAQIRDQGSCGSCWAFGAVEAMSDRICIHTNGRVNVEVSAEDL 70
Query: 143 ASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPT 202
+CC I D C+ G WNF ++G V+GG Y GC P TI PC HH +
Sbjct: 71 LTCCGIQCGD---GCNGGYPSGAWNFWTRKGLVSGGVYNSHIGCLPYTIPPCEHHVNGSR 127
Query: 203 LPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTAT 262
P PK C+ C Y + +DKH +Y V D+E I EI +GP
Sbjct: 128 PPCTGEGDTPK--CNKMC-EAGYSTSYKEDKHYGYTSYSVSDSEKEIMAEIYKNGPVEGA 184
Query: 263 FALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTV 322
F ++ DF YKSGVYKH + + H+ +++GWG ENG PYWLV N+W WGD G
Sbjct: 185 FTVFSDFLTYKSGVYKHEAGDVMGG--HAIRILGWGIENGVPYWLVANSWNVDWGDNGFF 242
Query: 323 KILRGKYECAFEYLIAAGKPK 343
KILRG+ C E I AG P+
Sbjct: 243 KILRGENHCGIESEIVAGIPR 263
>gi|76576341|gb|ABA53864.1| cathepsin B-like cysteine protease 2 [Parelaphostrongylus tenuis]
Length = 344
Score = 203 bits (517), Expect = 9e-50, Method: Compositional matrix adjust.
Identities = 116/319 (36%), Positives = 162/319 (50%), Gaps = 17/319 (5%)
Query: 23 DAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEYSA 82
DA +D +N + + A PA EE + I +K+ +S +P R E
Sbjct: 42 DALVDYVNNQQQLFKAE---PAAAIEEL--RMKIMKSKFISRSKKP----RVDEIGEEGF 92
Query: 83 TVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYV 142
+PD FDAR QWP+C +I ++ D C + F + A SDR CI S G + LS + +
Sbjct: 93 KIPDSFDARVQWPHCPSISYIRDQSQCGSCWAFGSAEAMSDRVCIASHGNKTVELSADDI 152
Query: 143 ASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPT 202
SCC YD C G W + + G VTGG YG + C+P I PC HH +
Sbjct: 153 LSCC----YDCGDGCDGGYPISAWEYFVETGVVTGGLYGTKDSCRPYEIPPCGHHRNETF 208
Query: 203 LPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTAT 262
+C Q C T C Y + DK +Y ++ + AI+KEI+ +GP TA
Sbjct: 209 YGNC-TQIADTPDCVTTC-QAGYPISYDDDKTFGKDSYTIESSVTAIQKEIMTYGPVTAA 266
Query: 263 FALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTV 322
F +Y+DF+HY G+YKH S E H+ +++GWG E GT YWLV N+W WG+ G
Sbjct: 267 FIVYEDFFHYHRGIYKHVSGG--EEGGHAVRILGWGEEKGTAYWLVANSWNTDWGENGYF 324
Query: 323 KILRGKYECAFEYLIAAGK 341
+ILRG EC E + AG+
Sbjct: 325 RILRGSNECGIEENVVAGR 343
>gi|255040225|gb|ACT99885.1| cathepsin B2 [Opisthorchis viverrini]
Length = 337
Score = 203 bits (517), Expect = 9e-50, Method: Compositional matrix adjust.
Identities = 108/260 (41%), Positives = 144/260 (55%), Gaps = 9/260 (3%)
Query: 84 VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVA 143
+P FDAREQWP+C TIG + D +C + F AV A SDR CI S G + LS+ +
Sbjct: 80 IPKTFDAREQWPHCPTIGQIRDQSSCGSCWAFGAVEAMSDRLCIHSNGTFTKSLSSIDLV 139
Query: 144 SCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTL 203
SCC C + C G W+F G VTGG D GC+ CSHHGS
Sbjct: 140 SCCGYCGF----GCQGGYPPAAWDFWQAYGIVTGGSKEDPMGCRSYPFPKCSHHGSK-KY 194
Query: 204 PSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATF 263
P C ++ KC +C P + DK R +TY V ++ AI KEI+ +GP A F
Sbjct: 195 PPCPHRIYDTPKCVPKCDTPNID--YETDKTRANITYNVQRSQMAIMKEIMINGPVEAAF 252
Query: 264 ALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVK 323
+Y+DF+ YK GVY H++ + H+ +++GWG ENGTPYWL+ N+W WG+ G K
Sbjct: 253 EVYEDFFGYKQGVYFHSTGEFIGG--HAIRILGWGEENGTPYWLIANSWNEGWGEDGYFK 310
Query: 324 ILRGKYECAFEYLIAAGKPK 343
+LRGK EC E + AG P+
Sbjct: 311 MLRGKNECGIEDEVTAGLPE 330
>gi|308488328|ref|XP_003106358.1| hypothetical protein CRE_16047 [Caenorhabditis remanei]
gi|308253708|gb|EFO97660.1| hypothetical protein CRE_16047 [Caenorhabditis remanei]
Length = 343
Score = 203 bits (517), Expect = 9e-50, Method: Compositional matrix adjust.
Identities = 114/323 (35%), Positives = 164/323 (50%), Gaps = 15/323 (4%)
Query: 22 SDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPG-DRKTYDPEY 80
++ I+Q+N WTAG ++ ++ ++ KY +++ P + +
Sbjct: 28 TELLINQVNSAQQLWTAGH-------QDAPKERIL---KYLMKAEHVKPHREEDVVQVDV 77
Query: 81 SATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTE 140
+ +PD +D R+ + C ++ ++ D C + AA A SDR CI S G N LS E
Sbjct: 78 ADVIPDHYDVRDDFSQCISVNNIRDQSHCGSCWAVAAAEAISDRTCIASNGVVNTLLSAE 137
Query: 141 YVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSA 200
+ +CC I Y C G + W + K G VTGG Y + GC+P +I+PC +
Sbjct: 138 DILTCC-IGEYYCGDGCEGGYPIQAWKYWVKNGLVTGGSYESQFGCKPYSIAPCGQTVNG 196
Query: 201 PTLPSCENQKVPKLKCHTRCT-NPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPT 259
T P C N KC CT N +Y + +DKH Y V D I+ EIL +GP
Sbjct: 197 VTWPKCPNSDADTPKCVDHCTSNSSYPIPYEKDKHYGATAYAVSRKVDQIQSEILKNGPV 256
Query: 260 TATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDR 319
F +Y DFY YKSGVY H + +L H+ KL+GWG +NGTPYWL N+W +WG+
Sbjct: 257 EVGFTVYADFYQYKSGVYVHVAGPELGG--HAVKLLGWGVDNGTPYWLAANSWNTNWGEN 314
Query: 320 GTVKILRGKYECAFEYLIAAGKP 342
G +ILRG EC E + AG P
Sbjct: 315 GYFRILRGVNECGIESQVVAGMP 337
>gi|328718094|ref|XP_003246386.1| PREDICTED: cathepsin B [Acyrthosiphon pisum]
Length = 340
Score = 203 bits (517), Expect = 9e-50, Method: Compositional matrix adjust.
Identities = 126/349 (36%), Positives = 174/349 (49%), Gaps = 24/349 (6%)
Query: 2 IHILVFLLGCTLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKY 61
+ +L + V + Y +ID IN +A TW AG NF + +E+ + L +K
Sbjct: 5 LMLLSVIFVSVYVTEQTYFLQKDFIDNINNQATTWKAGVNFDPDTPKEHFLKML--GSKG 62
Query: 62 FDQSDRPLPGDRKTYDPEYS---ATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAV 118
++ KT+D Y +P FDAR +W C TIG V D G C + A
Sbjct: 63 VQIPNKHNIHMYKTHDAAYDKLFGRIPRHFDARRKWRRCHTIGAVRDQGNCGSCWAMATS 122
Query: 119 GAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGG 178
AF+DR C+ + N LS E + CC C + C+ G + W KRG VTGG
Sbjct: 123 SAFADRLCVATNADFNELLSAEEITFCCHSCGF----GCNGGYPIKAWERFKKRGLVTGG 178
Query: 179 DYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGR---GFFQDKHR 235
DY GC+P + PC + A +C + P+ H RCT YG F +D
Sbjct: 179 DYQSGEGCEPYRVPPCPY--DAEGHNTCAGK--PRESNH-RCTRMCYGNQDLDFDEDHRY 233
Query: 236 TTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYL--HSGK 293
T +Y++ +I+K+++ +GP A+F +YDDF YKSGVY + NA YL H+ K
Sbjct: 234 TRDSYYL--TYGSIQKDVMTYGPIEASFDVYDDFPSYKSGVYVKSENA---TYLGGHAVK 288
Query: 294 LIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
LIGWG E G PYWL++N+W WGD G KI RG EC + AG P
Sbjct: 289 LIGWGEEYGVPYWLMVNSWNADWGDNGLFKIRRGTNECGIDNSTTAGVP 337
>gi|308504233|ref|XP_003114300.1| hypothetical protein CRE_27039 [Caenorhabditis remanei]
gi|308261685|gb|EFP05638.1| hypothetical protein CRE_27039 [Caenorhabditis remanei]
Length = 351
Score = 203 bits (517), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 109/328 (33%), Positives = 163/328 (49%), Gaps = 28/328 (8%)
Query: 26 IDQINREANTWTAG-----RNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYD--- 77
+D +N++ ++ A ++P + ++ + +I +P + + ++
Sbjct: 41 VDYVNKQQTSFKAKLGSYFSSYPDTIKKQLMGAKMIE-----------IPDEYRVFEMTH 89
Query: 78 PE-YSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRP 136
PE A +PD FD+R QWPNC +I + D +C + +A SDR CI S G+
Sbjct: 90 PEVLDAAIPDSFDSRAQWPNCPSISKIRDQSSCGSCWAVSAAETISDRICIASNGKTQLS 149
Query: 137 LSTEYVASCC-KICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCS 195
+S + + +CC +C C+ G W K+G VTGG Y ++TGC+P PC
Sbjct: 150 ISADDINACCGMVC----GNGCNGGYPIEAWRHYVKKGYVTGGSYQEKTGCKPYPYPPCE 205
Query: 196 HHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILA 255
HH + C + P KC C Y + QD H Y V I+KEI+
Sbjct: 206 HHVNGTHYKPCPSNMYPTDKCERSC-QAGYALTYTQDLHFGQSAYAVSKKVTEIQKEIMT 264
Query: 256 HGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPH 315
HGP F++Y+DF HY GVY HT+ A L H+ K++GWG +NGTPYWL N+W
Sbjct: 265 HGPVEVAFSVYEDFEHYSGGVYVHTAGASLGG--HAVKMLGWGVDNGTPYWLCANSWNED 322
Query: 316 WGDRGTVKILRGKYECAFEYLIAAGKPK 343
WG+ G +I+RG EC E + G PK
Sbjct: 323 WGENGYFRIIRGVNECGIESGVVGGIPK 350
>gi|170028916|ref|XP_001842340.1| cathepsin B [Culex quinquefasciatus]
gi|167879390|gb|EDS42773.1| cathepsin B [Culex quinquefasciatus]
Length = 339
Score = 203 bits (516), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 116/323 (35%), Positives = 163/323 (50%), Gaps = 15/323 (4%)
Query: 21 FSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEY 80
F+D++++Q+ A TWT F + +F Y D LP R
Sbjct: 31 FNDSFLEQVLARAKTWTPDTAFRGGIR---FGEFRSIKGIYESPLDFTLPSKRLHASSLD 87
Query: 81 SATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTE 140
+PDRFDARE+WP C +I V + G C + A V SDR CI S G+ N L+TE
Sbjct: 88 EVVIPDRFDAREKWPFCQSIHSVRNQGTCGSCWAVATVSVMSDRLCIHSDGEVNLELATE 147
Query: 141 YVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSA 200
+ CCK C N G+ F+ W G V+G Y GC+P PCS+
Sbjct: 148 DLMGCCKDCGNGCNGGFLDGTAFQYWV---DAGLVSGAPYNSSEGCKPYPFEPCSY---- 200
Query: 201 PTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTT 260
P + +K P KC C N Y R + +DK Y + ++ I+ EI+ +GP
Sbjct: 201 PFVGCHHEKKNP--KCLHHCIN-GYDRKYRKDKFFGATAYKIPNDARMIQLEIMTNGPVA 257
Query: 261 ATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRG 320
F +++DFY Y SGVYKH K+ +H+ +++GWGTENGTPYWL+ N++G WGD+G
Sbjct: 258 TGFEVFEDFYFYHSGVYKHVVGKKVG--MHAIRIVGWGTENGTPYWLIANSYGDTWGDKG 315
Query: 321 TVKILRGKYECAFEYLIAAGKPK 343
K+LRG E + AG P+
Sbjct: 316 FFKMLRGSNHLGIESTVIAGLPQ 338
>gi|256077361|ref|XP_002574974.1| SmCB2 peptidase (C01 family) [Schistosoma mansoni]
gi|18181863|emb|CAC85211.2| cathepsin B endopeptidase [Schistosoma mansoni]
gi|353231645|emb|CCD79000.1| SmCB2 peptidase (C01 family) [Schistosoma mansoni]
Length = 347
Score = 203 bits (516), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 111/263 (42%), Positives = 142/263 (53%), Gaps = 9/263 (3%)
Query: 81 SATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTE 140
S +P FDAR +WP+C +I + D +C + F AV A SDR CIKSKG+ LS E
Sbjct: 91 SDELPKSFDARVEWPHCPSISEIRDQSSCGSCWAFGAVEAMSDRICIKSKGKHKPFLSAE 150
Query: 141 YVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSA 200
+ SCC C C+ G W + +G VTG Y GCQP PC HH
Sbjct: 151 NLVSCCSSC----GMGCNGGFPHSAWLYWKNQGIVTGDLYNTTNGCQPYEFPPCEHHVIG 206
Query: 201 PTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTT 260
P LPSC+ V C T C P Y + +DK Y + N +AI E++ +GP
Sbjct: 207 P-LPSCDGD-VETPSCKTNC-QPGYNIPYEKDKWYGEKVYRIHSNPEAIMLELMRNGPVE 263
Query: 261 ATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRG 320
F +Y DF +YKSGVY+H S A L H+ +L+GWG EN PYWL+ N+W WGD+G
Sbjct: 264 VDFEVYADFPNYKSGVYQHVSGALLGG--HAVRLLGWGEENNVPYWLIANSWNSDWGDKG 321
Query: 321 TVKILRGKYECAFEYLIAAGKPK 343
KI+RGK EC E + AG PK
Sbjct: 322 YFKIVRGKNECGIESDVNAGIPK 344
>gi|268557308|ref|XP_002636643.1| Hypothetical protein CBG23351 [Caenorhabditis briggsae]
Length = 351
Score = 202 bits (514), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 109/320 (34%), Positives = 158/320 (49%), Gaps = 12/320 (3%)
Query: 26 IDQINREANTWTAGR-NFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEYSATV 84
+D +N++ T+TA ++ ++ + +Q + A + R T+ V
Sbjct: 41 VDYVNKQQTTFTAKLGSYFSSYPDTIKKQLMGAKMVEIPEEYRVF---EMTHPEVLDTAV 97
Query: 85 PDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVAS 144
PD FD+R QWPNC +I + D +C + +A SDR CI S G+ +S + + +
Sbjct: 98 PDSFDSRTQWPNCPSISKIRDQSSCGSCWAVSAAETISDRICIASNGKTQISISADDINA 157
Query: 145 CC-KICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTL 203
CC +C C+ G W K+G VTGG Y +++GC+P PC HH +
Sbjct: 158 CCGMVC----GNGCNGGYPIEAWRHYVKKGYVTGGSYQEKSGCKPYPYPPCEHHVNGTHY 213
Query: 204 PSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATF 263
C + P KC C Y + QD H Y V I+KEI+ HGP F
Sbjct: 214 KPCPSNMYPTDKCEHSC-QAGYPLTYTQDLHFGQSAYAVSKKPAEIQKEIMTHGPVEVAF 272
Query: 264 ALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVK 323
+Y+DF HY GVY HT+ A L H+ K++GWG +NGTPYWL N+W WG+ G +
Sbjct: 273 TVYEDFEHYSGGVYVHTAGASLGG--HAVKMLGWGVDNGTPYWLCANSWNEDWGENGYFR 330
Query: 324 ILRGKYECAFEYLIAAGKPK 343
I+RG EC E + G PK
Sbjct: 331 IIRGVNECGIESGVVGGTPK 350
>gi|444525951|gb|ELV14228.1| Cathepsin B [Tupaia chinensis]
Length = 339
Score = 202 bits (514), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 128/344 (37%), Positives = 170/344 (49%), Gaps = 25/344 (7%)
Query: 5 LVFLLGCTLVRGELY--KFSDAYIDQINREANTWTAGRNFPANLSEEYLRQF---LIADA 59
L LL T R Y SD ++ IN++ TW AG NF N Y+R+ +
Sbjct: 8 LCCLLALTSARNRPYFHPLSDDLVNYINKQNTTWQAGHNF-RNADMSYVRKLCGTFLGGP 66
Query: 60 KYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVG 119
K LP K + +P+ FDAREQW +C TI + D G+C + F AV
Sbjct: 67 K--------LPHRIKFAE---DMNLPESFDAREQWSSCPTIKEIRDQGSCGSCWAFGAVE 115
Query: 120 AFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGD 179
+ SDR CI + G N +S E + +CC + + WNF K+G V+GG
Sbjct: 116 SISDRICIHTNGHVNVEVSAEDMLTCCGGQCGEGCNGGYPSAA---WNFWTKKGLVSGGL 172
Query: 180 YGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLT 239
Y GC+P +I PC HH + P PK C C P Y + +DKH +
Sbjct: 173 YDSHVGCRPYSIPPCEHHVNGSRPPCTGEGDTPK--CSKSC-EPGYSSSYKEDKHYGYSS 229
Query: 240 YWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGT 299
Y V E I EI +GP F++Y DF YKSGVY+H + + H+ +++GWGT
Sbjct: 230 YSVPGIEKEIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGG--HAIRILGWGT 287
Query: 300 ENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
ENGTPYWLV N+W WGD G KILRG+ C E I AG P+
Sbjct: 288 ENGTPYWLVANSWNTDWGDNGFFKILRGQDHCGIESEIVAGIPR 331
>gi|241998314|ref|XP_002433800.1| longipain, putative [Ixodes scapularis]
gi|215495559|gb|EEC05200.1| longipain, putative [Ixodes scapularis]
Length = 339
Score = 202 bits (514), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 131/350 (37%), Positives = 168/350 (48%), Gaps = 26/350 (7%)
Query: 4 ILVFLLGCTL-VRGE---------LYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQ 53
+ FLLG VR E L SD +D IN TW AG N E R+
Sbjct: 5 VAFFLLGVLASVRAEEGRLMVPTYLAPLSDKMVDYINFINTTWKAGHNEGHRDLETVRRK 64
Query: 54 FLIADAKYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPH 113
++ D LP + +P +FD+R+QW +C TI + D GAC +
Sbjct: 65 LGVSR----DNHKYRLP---ELVHDTLEMDIPAQFDSRQQWQDCPTIREIRDQGACGSCW 117
Query: 114 IFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRG 173
F AV + SDR CI S + L+ + V SCC C C+ G W++ ++G
Sbjct: 118 AFGAVESMSDRHCIHSGAKNIVHLAADDVLSCCWGC----GSGCNGGFPGAAWSYWVEKG 173
Query: 174 SVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDK 233
VTGG+Y GC P + C HH + TL C Q P KC R Y F DK
Sbjct: 174 IVTGGNYDTDEGCMPYPVPSCDHHVNG-TLGPC-GQDPPTPKC-VRLCRKGYNIDFKDDK 230
Query: 234 HRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGK 293
H +Y V NE I+ EI+ +GP F +Y DF YKSGVYK S L H+ +
Sbjct: 231 HYGKSSYSVSSNETQIQMEIMKNGPVEGAFTVYADFPLYKSGVYKSHSTDALGG--HAIR 288
Query: 294 LIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
++GWG ENG P+WLV N+W WGD+G KILRG EC E I AG PK
Sbjct: 289 ILGWGVENGVPFWLVANSWNTEWGDKGYFKILRGSNECGIEEDIVAGIPK 338
>gi|341904470|gb|EGT60303.1| hypothetical protein CAEBREN_20420 [Caenorhabditis brenneri]
Length = 351
Score = 202 bits (513), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 109/327 (33%), Positives = 161/327 (49%), Gaps = 26/327 (7%)
Query: 26 IDQINREANTWTAG-----RNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEY 80
+D IN++ T+TA ++P + ++ + ++ +P + + ++ E+
Sbjct: 41 VDYINKKQTTFTAKLGAYFSDYPDTIKKQLMGAKMVE-----------IPEEYRVFEMEH 89
Query: 81 ----SATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRP 136
A +PD FD+R QWPNC +I + D +C + +A SDR CI SKGQ
Sbjct: 90 PEVLDAAIPDSFDSRAQWPNCPSISKIRDQSSCGSCWAVSAAETISDRICIASKGQTQVS 149
Query: 137 LSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSH 196
+S + + +CC + C+ G W K G VTGG Y ++TGC+P PC H
Sbjct: 150 ISADDINACCGMAC---GNGCNGGYPIEAWRHYVKNGYVTGGSYQEKTGCKPYPYPPCEH 206
Query: 197 HGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAH 256
H + C + P KC C Y + QD H Y V I+KEI+ +
Sbjct: 207 HVNGTHYKPCPSDMYPTDKCERSC-QAGYSLTYKQDLHFGQSAYAVSKKATEIQKEIMTN 265
Query: 257 GPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHW 316
GP F +Y DF Y GVY HT+ A L H+ K++GWG +NGTPYWL N+W W
Sbjct: 266 GPVEVAFTVYADFEVYSGGVYVHTAGASLGG--HAVKMLGWGVDNGTPYWLCANSWNEDW 323
Query: 317 GDRGTVKILRGKYECAFEYLIAAGKPK 343
G+ G +I+RG EC E+ + G PK
Sbjct: 324 GENGYFRIIRGVNECGIEHGVVGGIPK 350
>gi|308511959|ref|XP_003118162.1| CRE-CPR-6 protein [Caenorhabditis remanei]
gi|308238808|gb|EFO82760.1| CRE-CPR-6 protein [Caenorhabditis remanei]
Length = 387
Score = 202 bits (513), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 112/323 (34%), Positives = 159/323 (49%), Gaps = 10/323 (3%)
Query: 23 DAYIDQINREANTWTAG--RNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEY 80
D I+ +N + W A R F + E ++ + + S + KT D +
Sbjct: 44 DELINYVNNNQDLWRAKKQRRFTSVYGENDKAKWGLMGVNHVRLSVKGKQHLSKTKDLDM 103
Query: 81 SATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTE 140
+P+ FD+RE WP C +I ++ D +C + F AV A SDR CI S G+ LS +
Sbjct: 104 D--IPENFDSRENWPKCQSIRNIRDQSSCGSCWAFGAVEAMSDRICIASHGELQVSLSAD 161
Query: 141 YVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSA 200
+ SCC+ C + C+ G W + K G VTG +Y +GC+P PC HH
Sbjct: 162 DLLSCCRSCGF----GCNGGDPLAAWRYWVKDGIVTGSNYTANSGCKPYPFPPCEHHSKK 217
Query: 201 PTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTT 260
C + P KC +C + + +DK Y V D+ +AI+KE++ HGP
Sbjct: 218 THFDPCPHDLYPTPKCEKKCIADYTDKTYSEDKFYGASAYGVKDDVEAIQKELMTHGPLE 277
Query: 261 ATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRG 320
F +Y+DF +Y GVY HT KL H+ KL+GWG ENG PYW N+W WG+ G
Sbjct: 278 IAFEVYEDFLNYDGGVYVHTG-GKLGGG-HAVKLVGWGIENGIPYWTCANSWNTDWGEDG 335
Query: 321 TVKILRGKYECAFEYLIAAGKPK 343
+ILRG EC E + G PK
Sbjct: 336 FFRILRGVDECGIESGVVGGVPK 358
>gi|22535408|emb|CAC87118.1| cathepsin B-like protease [Nilaparvata lugens]
Length = 347
Score = 202 bits (513), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 117/327 (35%), Positives = 165/327 (50%), Gaps = 12/327 (3%)
Query: 20 KFSDAYIDQINRE-ANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDP 78
+ ++ +ID IN +TW AG NF + YL+ L + +D + + +
Sbjct: 27 EIANKWIDAINNNPKSTWKAGHNFHPDTPMSYLQGLLGVSELESNLADLDKYEEMEENEE 86
Query: 79 EYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLS 138
VP FDAR++W C ++ + D G C + + AF+DR CI S + N +S
Sbjct: 87 NKKIKVPKYFDARKKWKKCKSLREIRDQGNCGSCWAVSVAAAFADRLCIASNAKWNGHIS 146
Query: 139 TEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHH- 197
+ + SCC C + C G W F+ + G VTGGDY GCQP I+PC HH
Sbjct: 147 SRELMSCCSYCGF----GCEGGFPDAAWVFIKRHGLVTGGDYHSHDGCQPYPIAPCEHHM 202
Query: 198 -GSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAH 256
GS P + + P C T CT+ + + +D+ + Y V E + EI +
Sbjct: 203 EGSKPNCSASPTEPTPA--CETTCTHGS-SLAYQKDRQKGKSAYLVPVGEKQTQLEIFKN 259
Query: 257 GPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHW 316
GP A F +Y+DF+ YKSGVYK + H+ K+IGWG +NG PYWLV N+W W
Sbjct: 260 GPIVAAFKVYEDFFMYKSGVYKRHPESPFRGR-HAVKVIGWGEQNGLPYWLVQNSWDYDW 318
Query: 317 GDRGTVKILRGKYECAFEYLIAAGKPK 343
GD+G KI RG EC FE + AG PK
Sbjct: 319 GDKGLFKIARGN-ECDFEKSMTAGLPK 344
>gi|121309133|dbj|BAF43801.1| Longipain [Haemaphysalis longicornis]
Length = 341
Score = 202 bits (513), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 126/353 (35%), Positives = 168/353 (47%), Gaps = 33/353 (9%)
Query: 4 ILVFLLGC--------TLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFL 55
+L F++G LV ++ F D I+ IN TW AGRN Y+R L
Sbjct: 8 LLAFVIGVWGDVLEDRYLVPVDMDNFPDKMIEYINYLNTTWQAGRNLGYE-DPRYVRTLL 66
Query: 56 IADAKYFDQSDRPLPGDRKTYDPEY-----SATVPDRFDAREQWPNCGTIGHVPDTGACA 110
P + K PE + +PD FD+R +W +C TI + D G+C
Sbjct: 67 GVH-----------PNNHKYRLPEIEIDTSNVQIPDHFDSRHRWHDCPTIREIRDQGSCG 115
Query: 111 APHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLH 170
+ F AV A SDR CI S + L+ + V SCC C C+ G W++
Sbjct: 116 SCWAFGAVEAMSDRHCIHSGAKNIVHLAADDVLSCCMSC----GSGCNGGFPGAAWSYWV 171
Query: 171 KRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFF 230
+G VTGG+Y GC P I C HH + TL C+ P +C R Y F
Sbjct: 172 HKGIVTGGNYDSDEGCMPYPIKACDHHVNG-TLGPCDKSIPPTPRC-VRMCRKGYNVDFA 229
Query: 231 QDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLH 290
DKH +Y V N I+ EI+ +GP A F +Y DF YKSGVY+ ++ L H
Sbjct: 230 DDKHYGKKSYSVPSNVTQIQVEIMTNGPVEADFTVYADFPLYKSGVYQRHTDQALGG--H 287
Query: 291 SGKLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
+ +L+GWG E G PYWL N+W WGD+G KILRG EC E + AG P+
Sbjct: 288 AIRLLGWGVEKGVPYWLAANSWNTEWGDKGFFKILRGSDECGIEDDVVAGIPR 340
>gi|341887135|gb|EGT43070.1| hypothetical protein CAEBREN_13756 [Caenorhabditis brenneri]
Length = 398
Score = 202 bits (513), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 113/323 (34%), Positives = 158/323 (48%), Gaps = 10/323 (3%)
Query: 23 DAYIDQINREANTWTAG--RNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEY 80
D I+ +N W A R F E ++ + + S + KT D +
Sbjct: 59 DELINYVNNNQQLWKAKKQRRFSMYKGENDKHKWGLMGVNHVRLSVKGKQHLSKTKDLDM 118
Query: 81 SATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTE 140
+P+ FD+RE WP C +I + D +C + F AV A SDR CI S G+ LS +
Sbjct: 119 D--IPESFDSRENWPKCESIKAIRDQSSCGSCWAFGAVEAMSDRICIASHGELQVSLSAD 176
Query: 141 YVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSA 200
+ SCC+ C + C+ G W + K G VTG ++ +GC+P PC HH
Sbjct: 177 DLLSCCRSCGF----GCNGGDPLAAWRYWVKDGIVTGSNFTANSGCKPYPFPPCEHHSKK 232
Query: 201 PTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTT 260
C + P KC RC + + +DK + Y V D+ +AI+KE++ HGP
Sbjct: 233 THFDPCPHDLYPTPKCEKRCNAEYTDKTYSEDKFYGSSAYGVKDDVEAIQKELMTHGPLE 292
Query: 261 ATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRG 320
F +Y+DF +Y GVY HT KL H+ KLIGWG E+G PYW V N+W WG+ G
Sbjct: 293 IAFEVYEDFLNYDGGVYVHTG-GKLGGG-HAVKLIGWGIEDGIPYWTVANSWNTDWGEDG 350
Query: 321 TVKILRGKYECAFEYLIAAGKPK 343
+ILRG EC E + G PK
Sbjct: 351 FFRILRGVDECGIESGVVGGIPK 373
>gi|294935195|ref|XP_002781337.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
gi|239891887|gb|EER13132.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
Length = 317
Score = 201 bits (512), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 116/322 (36%), Positives = 165/322 (51%), Gaps = 13/322 (4%)
Query: 24 AYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQS--DRPLPGDRKTYDPEYS 81
+ +D+IN + N WTA + +E + DAK + + +++ Y P
Sbjct: 3 SLVDEINSKQNLWTASTD------QERFYGRSLGDAKKLCGTLLEETEGLEKRVYPPGEL 56
Query: 82 ATVPDRFDAREQWPNC-GTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTE 140
A +P+ FDAR+ + C IGHV D ACA+ A V AF+ R CIKS G+ N+ LS
Sbjct: 57 ADIPNSFDARDAFKECKDVIGHVWDQSACASCWAIAPVEAFNARLCIKSGGKFNQLLSAG 116
Query: 141 YVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSA 200
+ +CC + C G + W+FL G T G GC P C+HH
Sbjct: 117 EMIACCNSTHSWQPRGCKGGMILNAWSFLKTHGIATEGSMSAADGCWPYNFPKCAHHQKK 176
Query: 201 PTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTT 260
C + C RC N YG +D+H T + + + D IKKEI+ +GPT+
Sbjct: 177 SKYEPCSKKLYDTPSCLDRCPNEKYGIPLDKDRHFTAHSPDLFEGTDNIKKEIMTNGPTS 236
Query: 261 ATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRG 320
ATF++Y+DF YKSGVYKHT+ + +HS ++IGWGTE G YWLV+N+W WGD G
Sbjct: 237 ATFSVYEDFVSYKSGVYKHTNGTLMG--IHSVEIIGWGTEKGVDYWLVMNSWNEGWGDHG 294
Query: 321 TVKILRGKYECAFEYLIAAGKP 342
T KI +G +C + + P
Sbjct: 295 TFKIAQG--DCGIDDAVLGSPP 314
>gi|55793945|gb|AAV65883.1| cathepsin B1 isotype 3 precursor [Trichobilharzia regenti]
Length = 342
Score = 201 bits (512), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 118/328 (35%), Positives = 169/328 (51%), Gaps = 23/328 (7%)
Query: 21 FSDAYIDQINREANT-WTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDP- 78
SD I IN+ + WTA R+ R + DA+ + R RK P
Sbjct: 30 LSDEMIAYINQHPDAGWTASRSD---------RFKSLEDARILLGAMREDEELRKKRRPT 80
Query: 79 ----EYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQN 134
S +P FD+R++W C +I ++ D C + F AV A SDR CI+SKG+++
Sbjct: 81 VDHQNVSLEIPSSFDSRKKWHQCKSISNIRDQSRCGSCWAFTAVEAMSDRICIESKGKKS 140
Query: 135 RPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPC 194
LS + SCC C C G W++ + G VTG + TGCQP C
Sbjct: 141 VELSAVDLLSCCTEC----GLGCQGGFPGAAWDYWVEDGIVTGSSKENHTGCQPYPFPKC 196
Query: 195 SHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEIL 254
HH + P C + KCH +C Y + +DK+ ++Y V +NE+AIKKEI+
Sbjct: 197 EHHTTG-KYPECGEKIYKTPKCHQKCQK-GYKTPYKKDKYYGRMSYNVLNNENAIKKEIM 254
Query: 255 AHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGP 314
HGP A F ++ DF +YKSG+YK+ + A++ H+ ++IGWG E TPYWL+ N+W
Sbjct: 255 MHGPVEAAFTVHSDFLNYKSGIYKYMTGAEIGG--HAVRIIGWGVEKKTPYWLIANSWNE 312
Query: 315 HWGDRGTVKILRGKYECAFEYLIAAGKP 342
WG++G +ILRGK EC E + G P
Sbjct: 313 DWGEKGYFRILRGKDECGIESEVTGGLP 340
>gi|256052331|ref|XP_002569726.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
gi|353228435|emb|CCD74606.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
Length = 319
Score = 201 bits (512), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 103/263 (39%), Positives = 149/263 (56%), Gaps = 8/263 (3%)
Query: 79 EYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLS 138
+++ +P FD+R++WP C +I + D C + F AV A SDR CI+S G+QN LS
Sbjct: 62 DWNVEIPSNFDSRKKWPGCKSIATIRDQSRCGSSWAFGAVEAMSDRSCIQSGGKQNVELS 121
Query: 139 TEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHG 198
+ SCC+ C G W++ K G VTG + T CQP C HH
Sbjct: 122 AVDLLSCCEHC----GDGFEGGFPALAWDYWVKEGIVTGSSKENHTSCQPYPFPKCEHH- 176
Query: 199 SAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGP 258
+ P+C + C C +Y + QDKHR Y V ++E AI+KEI+ +GP
Sbjct: 177 TKGKYPACFEEIYKTPNCENTCQK-SYKTPYAQDKHRGKSRYNVKNDEKAIQKEIMKYGP 235
Query: 259 TTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGD 318
A F +Y+DF +YKSG+YKH + KL ++ H+ ++IGWG EN TPYWL+ N+W WG+
Sbjct: 236 VEANFIVYEDFLNYKSGIYKHIT-GKLVSW-HAIRIIGWGVENNTPYWLIPNSWNEDWGE 293
Query: 319 RGTVKILRGKYECAFEYLIAAGK 341
G +ILRG++EC+ E + AG+
Sbjct: 294 NGNFRILRGRHECSIESEVTAGR 316
>gi|226472800|emb|CAX71086.1| cathepsin B [Schistosoma japonicum]
gi|226472804|emb|CAX71088.1| cathepsin B [Schistosoma japonicum]
Length = 348
Score = 201 bits (512), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 123/325 (37%), Positives = 160/325 (49%), Gaps = 17/325 (5%)
Query: 21 FSDAYIDQINREANT-WTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPE 79
S I IN EANT W AG + R +Q + G T +
Sbjct: 36 LSKELIHFINYEANTTWKAGPTRRFKTVSDIRRMLGALPDPNGEQLETLCTGYELTLN-- 93
Query: 80 YSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLST 139
+P FDAR++W +C +I + D +C + F AV A SDR CI+SKG+ LS
Sbjct: 94 ---ELPKSFDARKEWTHCPSISEIRDQSSCGSCWAFGAVEAMSDRICIESKGKYKPFLSA 150
Query: 140 EYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGS 199
E + SCC C C+ G W + +G VTG Y GCQP PC HH
Sbjct: 151 ENLVSCCSSC----GMGCNGGFPHSAWLYWKNQGIVTGDLYNTTNGCQPYEFPPCEHHTL 206
Query: 200 APTLPSCE-NQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGP 258
P LP C+ + + P K R Y + DK + Y V N++AI KE++ HGP
Sbjct: 207 GP-LPVCDGDVETPPCK---RTCQAGYNVSYENDKWYGKVVYRVKSNQEAIMKELMQHGP 262
Query: 259 TTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGD 318
F +Y DF +YKSGVY+H S A L H+ +L+GWG EN PYWL+ N+W WGD
Sbjct: 263 VEVDFEVYADFPNYKSGVYQHVSGALLGG--HAVRLLGWGEENNVPYWLIANSWNTDWGD 320
Query: 319 RGTVKILRGKYECAFEYLIAAGKPK 343
G KI+RGK EC E + AG PK
Sbjct: 321 NGYFKIIRGKNECGIESDVNAGIPK 345
>gi|345308|pir||S31909 cathepsin B-like cysteine proteinase (EC 3.4.22.-) - fluke
(Schistosoma japonicum)
Length = 316
Score = 201 bits (512), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 117/331 (35%), Positives = 174/331 (52%), Gaps = 25/331 (7%)
Query: 21 FSDAYIDQINREANT-WTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDP- 78
SD I IN+ N W A ++ R + DA+ R P R+ P
Sbjct: 4 LSDEMISFINKHPNAGWKADKSD---------RFHSVDDARILLGGRREDPNLRQKRRPT 54
Query: 79 ----EYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQN 134
+ +P FD+R++WP C +I + D CA+ +AVGA SDR CI+S G+Q+
Sbjct: 55 VDHHDLKVEIPSHFDSRKKWPRCKSISQIRDQSRCASSWAVSAVGAMSDRICIQSGGKQS 114
Query: 135 RPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPC 194
LS + SCC+ C C G W++ G VTGG + TGCQP C
Sbjct: 115 VELSAIDLISCCENC----GSGCDGGFPGPAWDYWVSHGIVTGGSKENHTGCQPYPFPKC 170
Query: 195 SHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEIL 254
HH S PSC ++ +C +C Y + DKH ++ V NE AI+KEI+
Sbjct: 171 EHH-SKGKYPSCGDKMYKTPQCKRKCQK-GYKTPYEHDKHYGGISINVIKNESAIQKEIM 228
Query: 255 AHGPTTATFALYDDFYHYKSGVYKHTSNAKL-ENYLHSGKLIGWGTENGTPYWLVINTWG 313
+GP A +++DF +YKSG+Y++T+ + + E+Y+ ++IGWG ENGT YWL NTW
Sbjct: 229 MYGPVEAYLLIFEDFLNYKSGIYRYTTGSFVGEHYV---RIIGWGIENGTAYWLAANTWN 285
Query: 314 PHWGDRGTVKILRGKYECAFEYLIAAGKPKN 344
WG++G +I+RG+ EC+ E ++ AG+ K+
Sbjct: 286 EDWGEKGYFRIVRGRNECSVESVVVAGRLKS 316
>gi|30995341|gb|AAO59414.2| cathepsin B endopeptidase [Schistosoma japonicum]
gi|226472794|emb|CAX71083.1| cathepsin B [Schistosoma japonicum]
gi|226472796|emb|CAX71084.1| cathepsin B [Schistosoma japonicum]
gi|226472798|emb|CAX71085.1| cathepsin B [Schistosoma japonicum]
gi|226472802|emb|CAX71087.1| cathepsin B [Schistosoma japonicum]
gi|226472806|emb|CAX71089.1| cathepsin B [Schistosoma japonicum]
Length = 348
Score = 201 bits (512), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 123/324 (37%), Positives = 158/324 (48%), Gaps = 15/324 (4%)
Query: 21 FSDAYIDQINREANT-WTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPE 79
S I IN EANT W AG + R +Q + G T +
Sbjct: 36 LSKELIHFINYEANTTWKAGPTRRFKTVSDIRRMLGALPDPNGEQLETLCTGYELTLN-- 93
Query: 80 YSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLST 139
+P FDAR++W +C +I + D +C + F AV A SDR CI+SKG+ LS
Sbjct: 94 ---ELPKSFDARKEWTHCPSISEIRDQSSCGSCWAFGAVEAMSDRICIESKGKYKPFLSA 150
Query: 140 EYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGS 199
E + SCC C C+ G W + +G VTG Y GCQP PC HH
Sbjct: 151 ENLVSCCSSC----GMGCNGGFPHSAWLYWKNQGIVTGDLYNTTNGCQPYEFPPCEHHTL 206
Query: 200 APTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPT 259
P LP C+ V C C Y + DK + Y V N++AI KE++ HGP
Sbjct: 207 GP-LPVCDGD-VETPPCKRTC-QAGYNVSYENDKWYGKVVYRVKSNQEAIMKELMQHGPV 263
Query: 260 TATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDR 319
F +Y DF +YKSGVY+H S A L H+ +L+GWG EN PYWL+ N+W WGD
Sbjct: 264 EVDFEVYADFPNYKSGVYQHVSGALLGG--HAVRLLGWGEENNVPYWLIANSWNTDWGDN 321
Query: 320 GTVKILRGKYECAFEYLIAAGKPK 343
G KI+RGK EC E + AG PK
Sbjct: 322 GYFKIIRGKNECGIESDVNAGIPK 345
>gi|226472810|emb|CAX71091.1| cathepsin B [Schistosoma japonicum]
Length = 348
Score = 201 bits (511), Expect = 4e-49, Method: Compositional matrix adjust.
Identities = 123/324 (37%), Positives = 158/324 (48%), Gaps = 15/324 (4%)
Query: 21 FSDAYIDQINREANT-WTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPE 79
S I IN EANT W AG + R +Q + G T +
Sbjct: 36 LSKELIHFINYEANTTWKAGPTRRFKTVSDIRRMLGALPDPNGEQLETLCTGYELTVN-- 93
Query: 80 YSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLST 139
+P FDAR++W +C +I + D +C + F AV A SDR CI+SKG+ LS
Sbjct: 94 ---ELPKSFDARKEWTHCPSISEIRDQSSCGSYWAFGAVEAMSDRICIESKGKYKPFLSA 150
Query: 140 EYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGS 199
E + SCC C C+ G W + +G VTG Y GCQP PC HH
Sbjct: 151 ENLVSCCSSC----GMGCNGGFPHSAWLYWKNQGIVTGDLYNTTNGCQPYEFPPCEHHTL 206
Query: 200 APTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPT 259
P LP C+ V C C Y + DK + Y V N++AI KE++ HGP
Sbjct: 207 GP-LPVCDGD-VETPPCKRTC-QAGYNVSYENDKWYGKVVYRVKSNQEAIMKELMQHGPV 263
Query: 260 TATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDR 319
F +Y DF +YKSGVY+H S A L H+ +L+GWG EN PYWL+ N+W WGD
Sbjct: 264 EVDFEVYADFPNYKSGVYQHVSGALLGG--HAVRLLGWGEENNVPYWLIANSWNTDWGDN 321
Query: 320 GTVKILRGKYECAFEYLIAAGKPK 343
G KI+RGK EC E + AG PK
Sbjct: 322 GYFKIIRGKNECGIESDVNAGIPK 345
>gi|1777779|gb|AAB40605.1| cathepsin B-like cysteine proteinase [Ascaris suum]
gi|324515014|gb|ADY46062.1| Cathepsin B cysteine proteinase 6 [Ascaris suum]
Length = 398
Score = 201 bits (511), Expect = 4e-49, Method: Compositional matrix adjust.
Identities = 108/322 (33%), Positives = 158/322 (49%), Gaps = 13/322 (4%)
Query: 24 AYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPE--YS 81
A + +NR+ N W A N + ++ L+ + R +K P Y
Sbjct: 64 ALANYVNRKQNLWKAKFNNKFRNYSDRVKYGLMGV-----NNVRLSVKAKKNLSPTRFYD 118
Query: 82 ATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEY 141
+P+ FDARE+W C ++ ++ D +C + F AV A SDR CI S G+ LS +
Sbjct: 119 IYIPEAFDAREKWDQCASLKNIRDQSSCGSCWAFGAVEAMSDRICIASNGKIQVSLSADD 178
Query: 142 VASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAP 201
+ SCCK C + C G W + K G VTG ++ + GC+P PC HH +
Sbjct: 179 LLSCCKSCGF----GCDGGDPMAAWKYWVKEGIVTGSNFTMKQGCKPYPFPPCEHHSNKT 234
Query: 202 TLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTA 261
C++ P KC +C + + + +DK Y V+D+ +I+KEIL HGP
Sbjct: 235 HYQPCKHDLYPTPKCEKKCLDIYTEKTYAEDKFFGETAYGVEDDVTSIQKEILTHGPVEV 294
Query: 262 TFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGT 321
F +Y+DF Y G+Y HT H+ K++GWG E G PYWLV N+W WG+ G
Sbjct: 295 AFEVYEDFLMYDGGIYVHTGGKIGGG--HAVKMLGWGVEQGVPYWLVANSWNTDWGEDGF 352
Query: 322 VKILRGKYECAFEYLIAAGKPK 343
+I+RG EC E + G PK
Sbjct: 353 FRIIRGIDECGIESSVVGGLPK 374
>gi|201023315|ref|NP_001128400.1| cathepsin B-16D2 precursor [Acyrthosiphon pisum]
Length = 340
Score = 201 bits (511), Expect = 5e-49, Method: Compositional matrix adjust.
Identities = 125/350 (35%), Positives = 174/350 (49%), Gaps = 24/350 (6%)
Query: 1 MIHILVFLLGCTLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAK 60
++ +L + + + Y +ID IN A TW AG NF + +E+ + L +K
Sbjct: 4 VLMLLSVIFVSFYLTEQAYFLQKDFIDNINERATTWKAGVNFDPDTPKEHFLKML--GSK 61
Query: 61 YFDQSDRPLPGDRKTYDPEYS---ATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAA 117
++ KT+D Y +P FDAR +W C TIG V D G C + A
Sbjct: 62 GVQIPNKHNIHMYKTHDAAYDNLFGRIPRHFDARRKWRRCHTIGAVRDQGNCGSCWAMAT 121
Query: 118 VGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTG 177
AF+DR C+ + N LS E + CC C + C+ G + W KRG VTG
Sbjct: 122 SSAFADRLCVATNADFNELLSAEEITFCCHSCGF----GCNGGYPIKAWERFKKRGLVTG 177
Query: 178 GDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGR---GFFQDKH 234
GDY GC+P + PC + A +C + P+ H RCT YG F +D
Sbjct: 178 GDYQSGEGCEPYRVPPCPY--DAEGHNTCAGK--PRESNH-RCTRMCYGNQDLDFDEDHR 232
Query: 235 RTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYL--HSG 292
T +Y++ +I+K+++ +GP A+F +YDDF YKSGVY + NA YL H+
Sbjct: 233 YTRDSYYL--TYGSIQKDVMTYGPIEASFDVYDDFPSYKSGVYVKSENA---TYLGGHAV 287
Query: 293 KLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
KLIGWG E G PYWL++N+W WGD G KI RG EC + AG P
Sbjct: 288 KLIGWGEEYGVPYWLMVNSWNADWGDNGLFKIRRGTNECGIDNSTTAGVP 337
>gi|241154720|ref|XP_002407359.1| cathepsin B endopeptidase, putative [Ixodes scapularis]
gi|215494103|gb|EEC03744.1| cathepsin B endopeptidase, putative [Ixodes scapularis]
Length = 337
Score = 201 bits (510), Expect = 5e-49, Method: Compositional matrix adjust.
Identities = 119/324 (36%), Positives = 164/324 (50%), Gaps = 17/324 (5%)
Query: 21 FSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEY 80
SD I+ IN+ TW AG NF +S Y+R L K + + E
Sbjct: 28 LSDQMINYINKINTTWKAGSNFDKCISMSYIRGLLGVHPKSEEYRLAEFVHE------EI 81
Query: 81 SATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTE 140
+P+ FDAR +W +C +I + D C + F A A SDR CI SKG+ +S E
Sbjct: 82 PDDLPESFDARAKWSHCDSIHLIRDQSTCGSCWAFGATEAMSDRICIHSKGKMQVNISAE 141
Query: 141 YVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSA 200
+ CC C + C G W +RG V+GG YG GC+P +++PC +H +
Sbjct: 142 DLLDCCDTCGH----GCKGGFPAAAWEHWKERGIVSGGLYGTPDGCKPYSLAPCEYH-TK 196
Query: 201 PTLPSC-ENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPT 259
+P+C P+ H R Y + + +DKH Y + +E I+ EI +GP
Sbjct: 197 CRIPNCIPIVHTPECVHHCR---KGYDKDYQEDKHFGQKVYSISRDEKQIQTEIFTNGPV 253
Query: 260 TATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDR 319
A F +Y DF YKSGVY+ SN +H+ +++GWGTENGTPYWL N+W +WGD+
Sbjct: 254 EADFHVYGDFLCYKSGVYQRHSNDG--RGMHAIRILGWGTENGTPYWLAANSWNENWGDK 311
Query: 320 GTVKILRGKYECAFEYLIAAGKPK 343
G KILR EC E I AG PK
Sbjct: 312 GYFKILRRTNECGIEEHIYAGIPK 335
>gi|324507953|gb|ADY43363.1| Cathepsin B cysteine proteinase 6 [Ascaris suum]
Length = 352
Score = 201 bits (510), Expect = 6e-49, Method: Compositional matrix adjust.
Identities = 108/322 (33%), Positives = 158/322 (49%), Gaps = 13/322 (4%)
Query: 24 AYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPE--YS 81
A + +NR+ N W A N + ++ L+ + R +K P Y
Sbjct: 23 ALANYVNRKQNLWKAKFNNKFRNYSDRVKYGLMGV-----NNVRLSVKAKKNLSPTRFYD 77
Query: 82 ATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEY 141
+P+ FDARE+W C ++ ++ D +C + F AV A SDR CI S G+ LS +
Sbjct: 78 IYIPEAFDAREKWDQCASLKNIRDQSSCGSCWAFGAVEAMSDRICIASNGKIQVSLSADD 137
Query: 142 VASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAP 201
+ SCCK C + C G W + K G VTG ++ + GC+P PC HH +
Sbjct: 138 LLSCCKSCGF----GCDGGDPMAAWKYWVKEGIVTGSNFTMKQGCKPYPFPPCEHHSNKT 193
Query: 202 TLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTA 261
C++ P KC +C + + + +DK Y V+D+ +I+KEIL HGP
Sbjct: 194 HYQPCKHDLYPTPKCEKKCLDIYTEKTYAEDKFFGETAYGVEDDVTSIQKEILTHGPVEV 253
Query: 262 TFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGT 321
F +Y+DF Y G+Y HT H+ K++GWG E G PYWLV N+W WG+ G
Sbjct: 254 AFEVYEDFLMYDGGIYVHTGGKIGGG--HAVKMLGWGVEQGVPYWLVANSWNTDWGEDGF 311
Query: 322 VKILRGKYECAFEYLIAAGKPK 343
+I+RG EC E + G PK
Sbjct: 312 FRIIRGIDECGIESSVVGGLPK 333
>gi|226474178|emb|CAX71575.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 201 bits (510), Expect = 6e-49, Method: Compositional matrix adjust.
Identities = 116/330 (35%), Positives = 169/330 (51%), Gaps = 23/330 (6%)
Query: 21 FSDAYIDQINREANT-WTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDP- 78
SD I IN N W A ++ R + DA+ R P R+ P
Sbjct: 30 LSDEMISFINEHPNAGWKADKSD---------RFHSVDDARILLGGRREDPNLREKRRPT 80
Query: 79 ----EYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQN 134
+ + +P FD+R++WP C +I + D C + +AVGA SDR CI+S G+Q+
Sbjct: 81 VDHHDLNVEIPSHFDSRKKWPRCKSISQIRDQSQCGSSWAVSAVGAMSDRICIQSGGKQS 140
Query: 135 RPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPC 194
LS + SCCK C C G + +W++ RG VTGG + TGC+P C
Sbjct: 141 VELSAVDLISCCKYC----GSGCDGGFLGPSWDYWVLRGIVTGGSKENHTGCRPYPFPKC 196
Query: 195 SHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEIL 254
H +C ++ +C+ C Y + QDKH +Y V E I+K+I+
Sbjct: 197 DHFVKGK-YRACGDKLYKTPQCNQTC-QKGYNTSYEQDKHYGGFSYNVLSVESVIQKDIM 254
Query: 255 AHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGP 314
HGP A +Y+DF +YKSG+Y++T+ + H+ +LIGWG ENGT YWL NTW
Sbjct: 255 MHGPVEAYLEIYEDFLNYKSGIYRYTTGQFISG--HAVRLIGWGVENGTAYWLAANTWNE 312
Query: 315 HWGDRGTVKILRGKYECAFEYLIAAGKPKN 344
WG++G +I+RG+ EC+ E IAAG+ K+
Sbjct: 313 DWGEKGYFRIVRGRNECSIESEIAAGRIKS 342
>gi|226474184|emb|CAX71578.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 200 bits (509), Expect = 6e-49, Method: Compositional matrix adjust.
Identities = 114/350 (32%), Positives = 181/350 (51%), Gaps = 14/350 (4%)
Query: 1 MIHILVFLLGC-TLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADA 59
M+ I V+++ TL+ + ++ I+ ++ E ++ +++ R + DA
Sbjct: 1 MLKIAVYIVSLFTLLEAHVTTRNNERIEPLSDEMISFINEHPNAGWKADKSDRFHSVDDA 60
Query: 60 KYFDQSDRPLPGDRKTYDP-----EYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHI 114
++ + P R+ P + + +P FD+R++WP C +I + D C +
Sbjct: 61 RFLLGGRKEDPNLREKRRPTVDHHDLNVEIPSHFDSRKKWPRCKSISQIRDQSQCGSSWA 120
Query: 115 FAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGS 174
+AVGA SDR CI+S G+Q+ LS + SCCK C C G + +W++ RG
Sbjct: 121 VSAVGAMSDRICIQSGGKQSVELSAVDLISCCKYC----GSGCDGGFLGPSWDYWVLRGI 176
Query: 175 VTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKH 234
VTGG + TGC+P C H +C ++ +C+ C Y + QDKH
Sbjct: 177 VTGGSKENHTGCRPYPFPKCDHFVKGK-YRACGDKLYKTPQCNQTC-QKGYNTSYEQDKH 234
Query: 235 RTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKL 294
+Y V E I+K+I+ HGP A +Y+DF +YKSG+Y++T+ + H+ +L
Sbjct: 235 YGGFSYNVLSVESVIQKDIMVHGPVEAYLEIYEDFLNYKSGIYRYTTGKYISG--HAVRL 292
Query: 295 IGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPKN 344
IGWG ENGT YWL NTW WG++G +I+RG+ EC E IAAG K+
Sbjct: 293 IGWGVENGTAYWLAANTWNEDWGEKGYFRIVRGRNECLIESEIAAGLIKS 342
>gi|380791571|gb|AFE67661.1| cathepsin B preproprotein, partial [Macaca mulatta]
Length = 311
Score = 200 bits (509), Expect = 7e-49, Method: Compositional matrix adjust.
Identities = 116/318 (36%), Positives = 161/318 (50%), Gaps = 17/318 (5%)
Query: 6 VFLLGCTLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQS 65
+ LG R + SD ++ +N++ TW AG NF N+ YL++ +
Sbjct: 11 LLALGDARSRPSFHPLSDELVNYVNKQNTTWQAGHNF-YNVDVSYLKRLC---GTFLGG- 65
Query: 66 DRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRR 125
P P R + + +P+ FDAREQWP C TI + D G+C + F AV A SDR
Sbjct: 66 --PKPPQRVMFTEDLK--LPESFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVEAISDRI 121
Query: 126 CIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTG 185
CI + + +S E + +CC I D C+ G WNF ++G V+GG Y G
Sbjct: 122 CIHTNAHVSVEVSAEDLLTCCGIMCGD---GCNGGYPAGAWNFWTRKGLVSGGLYDSHVG 178
Query: 186 CQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDN 245
C+P +I PC HH + P P KC C P Y + QDKH +Y V ++
Sbjct: 179 CRPYSIPPCEHHVNGSRPPCTGEGDTP--KCSKIC-EPGYSPTYKQDKHYGYNSYSVSNS 235
Query: 246 EDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPY 305
E I EI +GP F++Y DF YKSGVY+H + + H+ +++GWG ENGTPY
Sbjct: 236 EKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGG--HAIRILGWGVENGTPY 293
Query: 306 WLVINTWGPHWGDRGTVK 323
WLV N+W WGD G K
Sbjct: 294 WLVANSWNTDWGDNGFFK 311
>gi|312271213|gb|ADQ57304.1| cathepsin B-like cysteine protease 2 [Angiostrongylus cantonensis]
Length = 347
Score = 200 bits (508), Expect = 9e-49, Method: Compositional matrix adjust.
Identities = 111/324 (34%), Positives = 161/324 (49%), Gaps = 21/324 (6%)
Query: 26 IDQINREANTWTAG-----RNFPANLSEEYL-RQFLIADAKYFDQSDRPLPGDRKTYDPE 79
+D IN+ +TA NFP + + +++ AKY + KT+
Sbjct: 39 VDYINKAQKLFTAKLSPRFANFPNEIKRRLMGSKYVALPAKY--------RVNEKTHSDI 90
Query: 80 YSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLST 139
T+P FD+R WP C ++ + D +C + AV A +DR CI SKG Q +S
Sbjct: 91 DDTTIPKSFDSRTNWPECPSLYSIRDQSSCGSCWAVGAVEAMTDRICIASKGNQKVTISA 150
Query: 140 EYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGS 199
+ + SCC C + C G + W++ G VTG +Y ++GC+P PC HH
Sbjct: 151 DDLLSCCDECGF----GCDGGDPYAAWSYWVSNGIVTGSNYTSKSGCKPYPYPPCEHHIP 206
Query: 200 APTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPT 259
C P C +C + Y + DKH Y V + +I+KEI+ +GP
Sbjct: 207 EHHYKKCPKDIYPTNTCEYKCQD-GYSISYNSDKHYGASVYAVAQDVASIQKEIMTNGPV 265
Query: 260 TATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDR 319
F +Y+DF HY SG+YKHT+ L H+ K++GWGTENGT YW+ N+W WG+
Sbjct: 266 EVAFDVYEDFEHYSSGIYKHTTGDYLGG--HAVKMLGWGTENGTDYWICANSWNSDWGEN 323
Query: 320 GTVKILRGKYECAFEYLIAAGKPK 343
G +ILRG EC E + AG+PK
Sbjct: 324 GFFRILRGVDECQIESSVVAGEPK 347
>gi|46195455|ref|NP_990702.1| cathepsin B precursor [Gallus gallus]
gi|1168790|sp|P43233.1|CATB_CHICK RecName: Full=Cathepsin B; AltName: Full=Cathepsin B1; Contains:
RecName: Full=Cathepsin B light chain; Contains:
RecName: Full=Cathepsin B heavy chain; Flags: Precursor
gi|603203|gb|AAA87075.1| cathepsin B [Gallus gallus]
Length = 340
Score = 200 bits (508), Expect = 9e-49, Method: Compositional matrix adjust.
Identities = 114/323 (35%), Positives = 161/323 (49%), Gaps = 16/323 (4%)
Query: 21 FSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEY 80
S ++ IN+ T AG NF N Y+++ + P + D
Sbjct: 26 LSSDLVNHINKLNTTGRAGHNF-HNTDMSYVKKLC---GTFLGGPKAP-----ERVDFAE 76
Query: 81 SATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTE 140
+PD FD R+QWPNC TI + D G+C + F AV A SDR C+ + + + +S E
Sbjct: 77 DMDLPDTFDTRKQWPNCPTISEIRDQGSCGSCWAFGAVEAISDRICVHTNAKVSVEVSAE 136
Query: 141 YVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSA 200
+ SCC ++ C+ G W + +RG V+GG Y GC+ TI PC HH +
Sbjct: 137 DLLSCCG---FECGMGCNGGYPSGAWRYWTERGLVSGGLYDSHVGCRAYTIPPCEHHVNG 193
Query: 201 PTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTT 260
+ P C + +C C P Y + +DKH +Y V +E I EI +GP
Sbjct: 194 -SRPPCTGEGGETPRCSRHC-EPGYSPSYKEDKHYGITSYGVPRSEKEIMAEIYKNGPVE 251
Query: 261 ATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRG 320
F +Y+DF YKSGVY+H S ++ H+ +++GWG ENGTPYWL N+W WG G
Sbjct: 252 GAFIVYEDFLMYKSGVYQHVSGEQVGG--HAIRILGWGVENGTPYWLAANSWNTDWGITG 309
Query: 321 TVKILRGKYECAFEYLIAAGKPK 343
KILRG+ C E I AG P+
Sbjct: 310 FFKILRGEDHCGIESEIVAGVPR 332
>gi|226469952|emb|CAX70257.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 200 bits (508), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 117/331 (35%), Positives = 172/331 (51%), Gaps = 25/331 (7%)
Query: 21 FSDAYIDQINREANT-WTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDP- 78
SD I IN+ N W A ++ R + DA+ R P R+ P
Sbjct: 30 LSDEMISFINKHPNAGWKADKSD---------RFHSVDDARILLGGRREDPNLREKRRPT 80
Query: 79 ----EYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQN 134
+ +P FD+R++WP C +I + D C + +AVGA SDR CI+S G+Q+
Sbjct: 81 VDHHDLKVEIPSHFDSRKKWPRCKSISQIRDQSRCGSSWAVSAVGAISDRICIQSGGKQS 140
Query: 135 RPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPC 194
LS + SCC+ C C G W++ G VTGG + TGCQP C
Sbjct: 141 VELSAIDLISCCENC----GSGCDGGFPGPAWDYWVSHGIVTGGSKENHTGCQPYPFPKC 196
Query: 195 SHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEIL 254
HH S PSC ++ +C +C Y + DKH + V NE AI+KEI+
Sbjct: 197 EHH-SIGKYPSCGDKMYKTPQCKRKCQK-GYTTPYEHDKHYGGIAINVIKNELAIQKEIM 254
Query: 255 AHGPTTATFALYDDFYHYKSGVYKHTSNAKL-ENYLHSGKLIGWGTENGTPYWLVINTWG 313
+GP A +++DF +YKSG+YK+T+ + + E+Y+ ++IGWG ENGT YWL NTW
Sbjct: 255 MYGPVEAYLLIFEDFLNYKSGIYKYTTGSFVGEHYV---RIIGWGIENGTAYWLAANTWN 311
Query: 314 PHWGDRGTVKILRGKYECAFEYLIAAGKPKN 344
WG++G +I+RG+ EC+ E ++ AG+ K+
Sbjct: 312 EDWGEKGYFRIVRGRNECSIESVVVAGRLKS 342
>gi|226474176|emb|CAX71574.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 200 bits (508), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 113/350 (32%), Positives = 182/350 (52%), Gaps = 14/350 (4%)
Query: 1 MIHILVFLLGC-TLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADA 59
M+ I V+++ TL+ + ++ I+ ++ E ++ +++ R + DA
Sbjct: 1 MLKIAVYIVSLFTLLEAHVTTRNNERIEPLSDEMISFINEHPNAGWKADKSDRFHSVDDA 60
Query: 60 KYFDQSDRPLPGDRKTYDP-----EYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHI 114
++ + P R+ P + + +P FD+R++WP C +I + D C +
Sbjct: 61 RFLLGGRKEDPNLREKRRPTVDHHDLNVEIPSHFDSRKKWPRCKSISQIRDQSQCGSSWA 120
Query: 115 FAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGS 174
+AVGA SDR CI+S G+Q+ LS + SCCK C C G + +W++ RG
Sbjct: 121 VSAVGAMSDRICIQSGGKQSVELSAVDLISCCKYC----GSGCDGGFLGPSWDYWVLRGI 176
Query: 175 VTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKH 234
VTGG + TGC+P C H +C ++ +C+ C Y + QDKH
Sbjct: 177 VTGGSKENHTGCRPYPFPKCDHFVKGK-YRACGDKLYKTPQCNQTC-QKGYNTSYEQDKH 234
Query: 235 RTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKL 294
+Y V E I+K+I+ HGP A +Y+DF +YKSG+Y++T+ + H+ +L
Sbjct: 235 YGGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGKYISG--HAVRL 292
Query: 295 IGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPKN 344
IGWG ENGT YWL NTW WG++G +I+RG+ EC+ + IAAG K+
Sbjct: 293 IGWGVENGTAYWLAANTWNEDWGEKGYFRIVRGRNECSIDSEIAAGLIKS 342
>gi|226474172|emb|CAX71572.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 199 bits (507), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 114/350 (32%), Positives = 181/350 (51%), Gaps = 14/350 (4%)
Query: 1 MIHILVFLLGC-TLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADA 59
M+ I V+++ TL+ + ++ I+ ++ E ++ +++ R + DA
Sbjct: 1 MLKIAVYIVSLFTLLEAHVTTRNNERIEPLSDEMISFINEHPNAGWKADKSDRFHSVDDA 60
Query: 60 KYFDQSDRPLPGDRKTYDP-----EYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHI 114
++ + P R+ P + + +P FD+R++WP C +I + D C +
Sbjct: 61 RFLLGGRKEDPNLREKRRPTVDHHDLNVEIPSHFDSRKKWPRCKSISQIRDQSQCGSSWA 120
Query: 115 FAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGS 174
+AVGA SDR CI+S G+Q+ LS + SCCK C C G + +W++ RG
Sbjct: 121 VSAVGAMSDRICIQSGGKQSVELSAVDLISCCKYC----GSGCDGGFLGPSWDYWVLRGI 176
Query: 175 VTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKH 234
VTGG + TGC+P C H +C ++ +C+ C Y + QDKH
Sbjct: 177 VTGGSKENHTGCRPYPFPKCDHFVKGK-YRACGDKLYKTPQCNQTC-QKGYNTSYEQDKH 234
Query: 235 RTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKL 294
+Y V E I+K+I+ HGP A +Y+DF +YKSG+Y++T+ + H+ +L
Sbjct: 235 YGGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGQFISG--HAVRL 292
Query: 295 IGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPKN 344
IGWG ENGT YWL NTW WG++G +I+RG+ EC E IAAG K+
Sbjct: 293 IGWGVENGTAYWLAANTWNEDWGEKGYFRIVRGRNECLIESEIAAGLIKS 342
>gi|327281751|ref|XP_003225610.1| PREDICTED: cathepsin B-like [Anolis carolinensis]
Length = 330
Score = 199 bits (507), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 111/306 (36%), Positives = 161/306 (52%), Gaps = 16/306 (5%)
Query: 38 AGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNC 97
AG NF N+ YL++ Y P +R + + +PD FD+R+QWP+C
Sbjct: 33 AGHNF-HNVDMSYLKKLC---GTYLHGPKLP---ERFAFADD--VELPDSFDSRKQWPSC 83
Query: 98 GTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSC 157
TI + D G+C + F AV A SDR C+ + G+ N +S E + SCC ++ C
Sbjct: 84 PTINEIRDQGSCGSCWAFGAVEAISDRVCVHTNGKVNVEISAEDLLSCCG---FECGMGC 140
Query: 158 SHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCH 217
+ G W + ++G V+GG Y GC+P +I PC HH + T P C + +C
Sbjct: 141 NGGYPSGAWKYWTEKGLVSGGLYDSHVGCRPYSIPPCEHHTNG-TRPPCSGEGGETPECV 199
Query: 218 TRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVY 277
+C + Y + QDKH +Y + +E I EI +GP F +Y DF YKSGVY
Sbjct: 200 KKCED-GYTPAYKQDKHYGVTSYGIPRSEKEIMAEIYKNGPVEGAFVVYSDFLMYKSGVY 258
Query: 278 KHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLI 337
+H S ++ H+ +++GWG +NGTPYWL N+W WG+ G +ILRG+ C E I
Sbjct: 259 QHVSGEEVGG--HAIRILGWGVDNGTPYWLAANSWNTDWGEDGFFRILRGQDHCGIESEI 316
Query: 338 AAGKPK 343
AG PK
Sbjct: 317 VAGIPK 322
>gi|268579855|ref|XP_002644910.1| C. briggsae CBR-CPR-6 protein [Caenorhabditis briggsae]
Length = 376
Score = 199 bits (507), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 115/324 (35%), Positives = 161/324 (49%), Gaps = 11/324 (3%)
Query: 23 DAYIDQINREANTWTAGRN--FPANLSE-EYLRQFLIADAKYFDQSDRPLPGDRKTYDPE 79
D ID IN N WTA + F + E + ++ + + S + KT D +
Sbjct: 44 DELIDYINDNQNLWTAKKQKRFTSVYGETDDKAKWGLMGVNHVRLSVKGKQHLSKTKDLD 103
Query: 80 YSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLST 139
+P+ FD+RE WP C +I ++ D +C + F AV A SDR CI S G+ LS
Sbjct: 104 LD--IPESFDSRENWPKCQSIRNIRDQSSCGSCWAFGAVEAMSDRICIASHGELQVSLSA 161
Query: 140 EYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGS 199
+ + SCC+ C + C+ G W + K G VTG +Y +GC+P PC HH
Sbjct: 162 DDLLSCCRSCGF----GCNGGDPLAAWRYWVKDGIVTGSNYTANSGCKPYPFPPCEHHSK 217
Query: 200 APTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPT 259
C + P KC +C + + +DK Y V D+ +AI+KE++ HGP
Sbjct: 218 KTHFDPCPHDLYPTPKCEKKCIADYTDKTYSEDKFYGHSAYGVKDDVEAIQKELMTHGPL 277
Query: 260 TATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDR 319
F +Y+DF +Y GVY HT KL H+ KLIGWG E+G PYW N+W WG+
Sbjct: 278 EIAFEVYEDFLNYDGGVYVHTG-GKLGGG-HAVKLIGWGIEDGIPYWTCANSWNTDWGED 335
Query: 320 GTVKILRGKYECAFEYLIAAGKPK 343
G +ILRG EC E + G PK
Sbjct: 336 GFFRILRGVDECGIESGVVGGIPK 359
>gi|226474168|emb|CAX71570.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 199 bits (507), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 114/350 (32%), Positives = 181/350 (51%), Gaps = 14/350 (4%)
Query: 1 MIHILVFLLGC-TLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADA 59
M+ I V+++ TL+ + ++ I+ ++ E ++ +++ R + DA
Sbjct: 1 MLKIAVYIVSLFTLLEAHVTTRNNERIEPLSDEMISFINEHPNAGWKADKSDRFHSVDDA 60
Query: 60 KYFDQSDRPLPGDRKTYDP-----EYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHI 114
++ + P R+ P + + +P FD+R++WP C +I + D C +
Sbjct: 61 RFLLGGRKEDPNLREKRRPTVDHHDLNVEIPSHFDSRKKWPRCKSISQIRDQSQCGSSWA 120
Query: 115 FAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGS 174
+AVGA SDR CI+S G+Q+ LS + SCCK C C G + +W++ RG
Sbjct: 121 VSAVGAMSDRICIQSGGKQSVELSAVDLISCCKYC----GSGCDGGFLGPSWDYWVLRGI 176
Query: 175 VTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKH 234
VTGG + TGC+P C H +C ++ +C+ C Y + QDKH
Sbjct: 177 VTGGSKENHTGCRPYPFPKCDHFVKG-KYRACGDKLYKTPQCNQTC-QKGYNTSYEQDKH 234
Query: 235 RTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKL 294
+Y V E I+K+I+ HGP A +Y+DF +YKSG+Y++T+ + H+ +L
Sbjct: 235 YGGFSYNVLSVESVIQKDIMMHGPAEAYLEIYEDFLNYKSGIYRYTTGQFISG--HAVRL 292
Query: 295 IGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPKN 344
IGWG ENGT YWL NTW WG++G +I+RG+ EC E IAAG K+
Sbjct: 293 IGWGVENGTAYWLAANTWNEDWGEKGYFRIVRGRNECLIESEIAAGLIKS 342
>gi|226473758|emb|CAX71564.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 199 bits (507), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 116/330 (35%), Positives = 168/330 (50%), Gaps = 23/330 (6%)
Query: 21 FSDAYIDQINREANT-WTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDP- 78
SD I IN+ N W A ++ R + DA+ R P R+ P
Sbjct: 30 LSDEMISFINKHPNAGWKADKSD---------RFHSVDDARILLGGRREDPNLRQKRRPT 80
Query: 79 ----EYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQN 134
+ + +P FD+R++WP C +I + D C + +AVGA SDR CI+S G+Q+
Sbjct: 81 VDHHDLNVEIPSHFDSRKKWPRCKSISQIRDQSQCGSSWAVSAVGAMSDRICIQSGGKQS 140
Query: 135 RPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPC 194
LS + SCCK C C G + +W++ RG VTGG + TGC+P C
Sbjct: 141 VELSAVDLISCCKYC----GSGCDGGFLGPSWDYWVLRGIVTGGSKENHTGCRPYPFPKC 196
Query: 195 SHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEIL 254
H +C ++ +C+ C Y + QDKH +Y V E I+K+I+
Sbjct: 197 DHFVKGK-YRACGDKLYKTPQCNQTC-QKGYNTSYEQDKHYGGFSYNVLSVESVIQKDIM 254
Query: 255 AHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGP 314
HGP A +Y+DF +YKSG+Y++T+ + H+ +LIGWG ENGT YWL NTW
Sbjct: 255 MHGPVEAYLEIYEDFLNYKSGIYRYTTGQFISG--HAVRLIGWGVENGTAYWLAANTWNE 312
Query: 315 HWGDRGTVKILRGKYECAFEYLIAAGKPKN 344
WG++G +I+RG+ EC E IAAG K+
Sbjct: 313 DWGEKGYFRIVRGRNECLIESEIAAGLIKS 342
>gi|226474174|emb|CAX71573.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 199 bits (507), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 114/350 (32%), Positives = 181/350 (51%), Gaps = 14/350 (4%)
Query: 1 MIHILVFLLGC-TLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADA 59
M+ I V+++ TL+ + ++ I+ ++ E ++ +++ R + DA
Sbjct: 1 MLKIAVYIVSLFTLLEAHVTTRNNERIEPLSDEMISFINEHPNAGWKADKSDRFHSVDDA 60
Query: 60 KYFDQSDRPLPGDRKTYDP-----EYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHI 114
++ + P R+ P + + +P FD+R++WP C +I + D C +
Sbjct: 61 RFLLGGRKEDPNLREKRRPTVDHHDLNVEIPSHFDSRKKWPRCKSISQIRDQSQCGSSWA 120
Query: 115 FAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGS 174
+AVGA SDR CI+S G+Q+ LS + SCCK C C G + +W++ RG
Sbjct: 121 VSAVGAMSDRICIQSGGKQSVELSAIDLISCCKYC----GSGCDGGFLGPSWDYWVLRGI 176
Query: 175 VTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKH 234
VTGG + TGC+P C H +C ++ +C+ C Y + QDKH
Sbjct: 177 VTGGSKENHTGCRPYPFPKCDHFVKG-KYRACGDKLYKTPQCNQTC-QKGYNTSYEQDKH 234
Query: 235 RTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKL 294
+Y V E I+K+I+ HGP A +Y+DF +YKSG+Y++T+ + H+ +L
Sbjct: 235 YGGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGQFISG--HAVRL 292
Query: 295 IGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPKN 344
IGWG ENGT YWL NTW WG++G +I+RG+ EC E IAAG K+
Sbjct: 293 IGWGVENGTAYWLAANTWNEDWGEKGYFRIVRGRNECLIESEIAAGLIKS 342
>gi|226474164|emb|CAX71568.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
gi|226474166|emb|CAX71569.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 199 bits (507), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 116/330 (35%), Positives = 168/330 (50%), Gaps = 23/330 (6%)
Query: 21 FSDAYIDQINREANT-WTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDP- 78
SD I IN+ N W A ++ R + DA+ R P R+ P
Sbjct: 30 LSDEMISFINKHPNAGWKADKSD---------RFHSVDDARILLGGRREDPNLREKRRPT 80
Query: 79 ----EYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQN 134
+ + +P FD+R++WP C +I + D C + +AVGA SDR CI+S G+Q+
Sbjct: 81 VDHHDLNVEIPSHFDSRKKWPRCKSISQIRDQSQCGSSWAVSAVGAMSDRICIQSGGKQS 140
Query: 135 RPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPC 194
LS + SCCK C C G + +W++ RG VTGG + TGC+P C
Sbjct: 141 VELSAVDLISCCKYC----GSGCDGGFLGPSWDYWVLRGIVTGGSKENHTGCRPYPFPKC 196
Query: 195 SHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEIL 254
H +C ++ +C C Y + QDKH +Y V E I+K+I+
Sbjct: 197 DHFVKG-KYRACGDKLYKTPQCKQIC-QKGYNTSYEQDKHYGGFSYNVLSVESVIQKDIM 254
Query: 255 AHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGP 314
HGP A +Y+DF +YKSG+Y++T+ + H+ +LIGWG ENGT YWL NTW
Sbjct: 255 MHGPVEAYLEIYEDFLNYKSGIYRYTTGQFISG--HAVRLIGWGVENGTAYWLAANTWNE 312
Query: 315 HWGDRGTVKILRGKYECAFEYLIAAGKPKN 344
WG++G +I+RG+ EC+ E IAAG K+
Sbjct: 313 DWGEKGYFRIVRGRNECSIESEIAAGLIKS 342
>gi|56756907|gb|AAW26625.1| unknown [Schistosoma japonicum]
Length = 342
Score = 199 bits (507), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 114/350 (32%), Positives = 180/350 (51%), Gaps = 14/350 (4%)
Query: 1 MIHILVFLLGC-TLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADA 59
M+ I V+++ TL+ + ++ ++ ++ E ++ +++ R + DA
Sbjct: 1 MLKIAVYIVSLFTLLEAHVTTRNNERVEPLSDEMISFINEHPNAGWKADKSDRFHSVDDA 60
Query: 60 KYFDQSDRPLPGDRKTYDP-----EYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHI 114
+ R P R+ P + + +P FD+R++WP C +I + D C +
Sbjct: 61 RILLGGRREDPNLREKRRPTVDHHDLNVEIPSHFDSRKKWPRCKSISQIRDQSQCGSSWA 120
Query: 115 FAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGS 174
+AVGA SDR CI+S G+Q+ LS + SCCK C C G + +W++ RG
Sbjct: 121 VSAVGAMSDRICIQSGGKQSVELSAVDLISCCKYC----GSGCDGGFLGPSWDYWVLRGI 176
Query: 175 VTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKH 234
VTGG + TGC+P C H +C ++ +C+ C Y + QDKH
Sbjct: 177 VTGGSKENHTGCRPYPFPKCDHFVKGK-YRACGDKLYKTPQCNQTC-QKGYNTSYEQDKH 234
Query: 235 RTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKL 294
+Y V E I+K+I+ HGP A +Y+DF +YKSG+Y++T+ + H+ +L
Sbjct: 235 YGGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGKYISG--HAVRL 292
Query: 295 IGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPKN 344
IGWG ENGT YWL NTW WG++G +I+RG+ EC E IAAG K+
Sbjct: 293 IGWGVENGTAYWLAANTWNEDWGEKGYFRIVRGRNECLIESEIAAGLIKS 342
>gi|56754499|gb|AAW25437.1| unknown [Schistosoma japonicum]
Length = 342
Score = 199 bits (506), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 116/330 (35%), Positives = 168/330 (50%), Gaps = 23/330 (6%)
Query: 21 FSDAYIDQINREANT-WTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDP- 78
SD I IN+ N W A ++ R + DA+ R P R+ P
Sbjct: 30 LSDEMISFINKHPNAGWKADKSD---------RFHSVDDARILLGGRREDPNLREKRRPT 80
Query: 79 ----EYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQN 134
+ + +P FD+R++WP C +I + D C + +AVGA SDR CI+S G+Q+
Sbjct: 81 VDHHDLNVEIPSHFDSRKKWPRCKSISQIRDQSQCGSSWAVSAVGAMSDRICIQSGGKQS 140
Query: 135 RPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPC 194
LS + SCCK C C G + +W++ RG VTGG + TGC+P C
Sbjct: 141 VELSAVDLISCCKYC----GSGCDGGFLGPSWDYWVLRGIVTGGSKENHTGCRPYPFPKC 196
Query: 195 SHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEIL 254
H +C ++ +C+ C Y + QDKH +Y V E I+K+I+
Sbjct: 197 DHFVKGK-YRACGDKLYKTPQCNQTC-QKGYNTSYEQDKHYGGFSYNVLSVESVIQKDIM 254
Query: 255 AHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGP 314
HGP A +Y+DF +YKSG+Y++T+ + H+ +LIGWG ENGT YWL NTW
Sbjct: 255 MHGPVEAYLEIYEDFLNYKSGIYRYTTGKYISG--HAVRLIGWGVENGTAYWLAANTWNE 312
Query: 315 HWGDRGTVKILRGKYECAFEYLIAAGKPKN 344
WG++G +I+RG+ EC E IAAG K+
Sbjct: 313 DWGEKGYFRIVRGRNECLIESEIAAGLIKS 342
>gi|239938582|gb|ACS36090.1| cysteine proteinase [Haemonchus contortus]
Length = 346
Score = 199 bits (506), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 106/301 (35%), Positives = 159/301 (52%), Gaps = 15/301 (4%)
Query: 42 FPANLS-EEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTI 100
F A+++ Y Q + D ++ +Q+ +P + + + +P+ FDAR WPNC +I
Sbjct: 55 FEADVTPHSYNVQHKLMDLRFVNQNRKPAVEN----EDDEGDDIPESFDARTHWPNCTSI 110
Query: 101 GHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHG 160
H+ D C + + A SDR CI+S G+ +S+ SCC+ C Y C G
Sbjct: 111 RHIRDQANCGSCWAVSTASALSDRICIESNGETQMHISSIDFVSCCESCSY----GCDGG 166
Query: 161 SVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSC-ENQKVPKLKCHTR 219
++F G+VTGGDYG + GC+P PC HHG+ C + K P KC R
Sbjct: 167 WPILAFDFYTYEGAVTGGDYGSKDGCRPYPFHPCGHHGNDTYYGECPKGAKTP--KCRRR 224
Query: 220 CTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKH 279
C +Y + ++ DK Y V + AI++EI+ +GP F +Y+DF +YK G+YKH
Sbjct: 225 CQR-SYKKAYYMDKSYGEDAYEVPHSVKAIQREIMKNGPVVGAFTVYEDFSYYKKGIYKH 283
Query: 280 TSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAA 339
T+ H+ K+IGWG EN PYWL+ N+W WG+ G +++RG EC E + A
Sbjct: 284 TAGQARGG--HAIKIIGWGVENDVPYWLIANSWHNDWGEEGYFRMIRGINECGIEQEVVA 341
Query: 340 G 340
G
Sbjct: 342 G 342
>gi|226474180|emb|CAX71576.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 199 bits (506), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 116/330 (35%), Positives = 167/330 (50%), Gaps = 23/330 (6%)
Query: 21 FSDAYIDQINREANT-WTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDP- 78
SD I IN N W A ++ R + DA+ R P R+ P
Sbjct: 30 LSDEMISFINEHPNAGWKADKSD---------RFHSVDDARILLGGRREDPNLREKRRPT 80
Query: 79 ----EYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQN 134
+ + +P FD+R++WP C +I + D C + +AVGA SDR CI+S G+Q+
Sbjct: 81 VDHHDLNVEIPSHFDSRKKWPRCKSISQIRDQSQCGSSWAVSAVGAMSDRICIQSGGKQS 140
Query: 135 RPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPC 194
LS + SCCK C C G + +W++ RG VTGG + TGC+P C
Sbjct: 141 VELSAVDLISCCKYC----GSGCDGGFLGPSWDYWVLRGIVTGGSKENHTGCRPYPFPKC 196
Query: 195 SHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEIL 254
H +C ++ +C C Y + QDKH +Y V E I+K+I+
Sbjct: 197 DHFVKG-KYRACGDKLYETPQCKQTC-QKGYNTSYEQDKHYGGFSYNVLSVESVIQKDIM 254
Query: 255 AHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGP 314
HGP A +Y+DF +YKSG+Y++T+ + H+ +LIGWG ENGT YWL NTW
Sbjct: 255 MHGPVEAYLEIYEDFLNYKSGIYRYTTGQFISG--HAVRLIGWGVENGTAYWLAANTWNE 312
Query: 315 HWGDRGTVKILRGKYECAFEYLIAAGKPKN 344
WG++G +I+RG+ EC+ E IAAG K+
Sbjct: 313 DWGEKGYFRIVRGRNECSIESEIAAGLIKS 342
>gi|56758864|gb|AAW27572.1| unknown [Schistosoma japonicum]
Length = 342
Score = 199 bits (506), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 116/330 (35%), Positives = 168/330 (50%), Gaps = 23/330 (6%)
Query: 21 FSDAYIDQINREANT-WTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDP- 78
SD I IN+ N W A ++ R + DA+ R P R+ P
Sbjct: 30 LSDEMISFINKHPNAGWKADKSD---------RFHSVDDARILLGGRREDPNLREKRRPT 80
Query: 79 ----EYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQN 134
+ + +P FD+R++WP C +I + D C + +AVGA SDR CI+S G+Q+
Sbjct: 81 VDHHDLNVEIPSHFDSRKKWPRCKSISQIRDQSQCGSSWAVSAVGAMSDRICIQSGGKQS 140
Query: 135 RPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPC 194
LS + SCCK C C G + +W++ RG VTGG + TGC+P C
Sbjct: 141 VELSAVDLISCCKYC----GSGCDGGFLGPSWDYWVLRGIVTGGSKENHTGCRPYPFPKC 196
Query: 195 SHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEIL 254
H +C ++ +C+ C Y + QDKH +Y V E I+K+I+
Sbjct: 197 DHFVKG-KYRACGDKLYKTPQCNQTC-QKGYNTSYEQDKHYGGFSYNVLGIESVIQKDIM 254
Query: 255 AHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGP 314
HGP A +Y+DF +YKSG+Y++T+ + H+ +LIGWG ENGT YWL NTW
Sbjct: 255 MHGPVEAYLEIYEDFLNYKSGIYRYTTGKYISG--HAVRLIGWGVENGTAYWLAANTWNE 312
Query: 315 HWGDRGTVKILRGKYECAFEYLIAAGKPKN 344
WG++G +I+RG+ EC E IAAG K+
Sbjct: 313 DWGEKGYFRIVRGRNECLIESEIAAGLIKS 342
>gi|226473762|emb|CAX71566.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
gi|226474170|emb|CAX71571.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 199 bits (506), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 114/350 (32%), Positives = 180/350 (51%), Gaps = 14/350 (4%)
Query: 1 MIHILVFLLGC-TLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADA 59
M+ I V+++ TL+ + ++ ++ ++ E ++ +++ R + DA
Sbjct: 1 MLKIAVYIVSLFTLLEAHVTTRNNERVEPLSDEMISFINEHPNAGWKADKSDRFHSVDDA 60
Query: 60 KYFDQSDRPLPGDRKTYDP-----EYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHI 114
+ R P R+ P + + +P FD+R++WP C +I + D C +
Sbjct: 61 RILLGGRREDPNLREKRRPTVDHHDLNVEIPSHFDSRKKWPRCKSISQIRDQSQCGSSWA 120
Query: 115 FAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGS 174
+AVGA SDR CI+S G+Q+ LS + SCCK C C G + +W++ RG
Sbjct: 121 VSAVGAMSDRICIQSGGKQSVELSAVDLISCCKYC----GSGCDGGFLGPSWDYWVLRGI 176
Query: 175 VTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKH 234
VTGG + TGC+P C H +C ++ +C+ C Y + QDKH
Sbjct: 177 VTGGSKENHTGCRPYPFPKCDHFVKGK-YRACGDKLYKTPQCNQTC-QKGYNTSYEQDKH 234
Query: 235 RTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKL 294
+Y V E I+K+I+ HGP A +Y+DF +YKSG+Y++T+ + H+ +L
Sbjct: 235 YGGFSYNVLSVESVIQKDIMMHGPVEAYIEIYEDFLNYKSGIYRYTTGKYISG--HAVRL 292
Query: 295 IGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPKN 344
IGWG ENGT YWL NTW WG++G +I+RG+ EC E IAAG K+
Sbjct: 293 IGWGVENGTAYWLAANTWNEDWGEKGYFRIVRGRNECLIESEIAAGLIKS 342
>gi|226474160|emb|CAX71567.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 199 bits (506), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 116/330 (35%), Positives = 167/330 (50%), Gaps = 23/330 (6%)
Query: 21 FSDAYIDQINREANT-WTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDP- 78
SD I IN+ N W A ++ R + DA+ R P R+ P
Sbjct: 30 LSDEMISFINKHPNAGWKADKSD---------RFHSVDDARILLGGRREDPNLREKRRPT 80
Query: 79 ----EYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQN 134
+ +P FD+R++WP C +I + D C + +AVGA SDR CI+S G+Q+
Sbjct: 81 VDHHDLKVEIPSHFDSRKKWPRCKSISQIRDQSQCGSSWAVSAVGAMSDRICIQSGGKQS 140
Query: 135 RPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPC 194
LS + SCCK C C G + +W++ RG VTGG + TGC+P C
Sbjct: 141 VELSAVDLISCCKYC----GSGCDGGFLGPSWDYWVLRGIVTGGSKENHTGCRPYPFPKC 196
Query: 195 SHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEIL 254
H +C ++ +C C Y + QDKH +Y V E I+K+I+
Sbjct: 197 DHFVKG-KYRACGDKLYKTPQCKQIC-QKGYNTSYEQDKHYGGFSYNVLSVESVIQKDIM 254
Query: 255 AHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGP 314
HGP A +Y+DF +YKSG+Y++T+ + H+ +LIGWG ENGT YWL NTW
Sbjct: 255 MHGPVEAYLEIYEDFLNYKSGIYRYTTGQFISG--HAVRLIGWGVENGTAYWLAANTWNE 312
Query: 315 HWGDRGTVKILRGKYECAFEYLIAAGKPKN 344
WG++G +I+RG+ EC+ E IAAG K+
Sbjct: 313 DWGEKGYFRIVRGRNECSIESEIAAGLIKS 342
>gi|226468762|emb|CAX76409.1| cathepsin B [Schistosoma japonicum]
gi|257206178|emb|CAX82740.1| cathepsin B [Schistosoma japonicum]
Length = 348
Score = 199 bits (505), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 122/324 (37%), Positives = 158/324 (48%), Gaps = 15/324 (4%)
Query: 21 FSDAYIDQINREANT-WTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPE 79
S I IN EANT W AG + R +Q + G T +
Sbjct: 36 LSKELIHFINYEANTTWKAGPTRRFKTVSDIRRMLGALPDPNGEQLETLCTGYELTLN-- 93
Query: 80 YSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLST 139
+P FDAR++W +C +I + D +C + F AV A SDR CI+SKG+ LS
Sbjct: 94 ---ELPKSFDARKEWTHCPSISEIRDQSSCGSCWAFGAVEAMSDRICIESKGKYKPFLSA 150
Query: 140 EYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGS 199
E + SCC C C+ G W + +G VTG Y GCQP PC H+
Sbjct: 151 ENLVSCCSSC----GMGCNGGFPHSAWLYWKNQGIVTGDLYNTTNGCQPYEFPPCEHNTL 206
Query: 200 APTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPT 259
P LP C+ V C C Y + DK + Y V N++AI KE++ HGP
Sbjct: 207 GP-LPVCDGD-VETPPCKRTC-QAGYNVSYENDKWYGKVVYRVKSNQEAIMKELMQHGPV 263
Query: 260 TATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDR 319
F +Y DF +YKSGVY+H S A L H+ +L+GWG EN PYWL+ N+W WGD
Sbjct: 264 EVDFEVYADFPNYKSGVYQHVSGALLGG--HAVRLLGWGEENNVPYWLIANSWNTDWGDN 321
Query: 320 GTVKILRGKYECAFEYLIAAGKPK 343
G KI+RGK EC E + AG PK
Sbjct: 322 GYFKIIRGKNECGIESDVNAGIPK 345
>gi|239938584|gb|ACS36091.1| cysteine proteinase [Haemonchus contortus]
Length = 346
Score = 199 bits (505), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 106/301 (35%), Positives = 159/301 (52%), Gaps = 15/301 (4%)
Query: 42 FPANLS-EEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTI 100
F A+++ Y Q + D ++ +Q+ +P + + + +P+ FDAR WPNC +I
Sbjct: 55 FEADVTPHSYNVQHKLMDLRFVNQNRKPAVEN----EDDEGDDIPESFDARTHWPNCTSI 110
Query: 101 GHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHG 160
H+ D C + + A SDR CI+S G+ +S+ SCC+ C Y C G
Sbjct: 111 RHIRDQANCGSCWAVSTASALSDRICIESNGETQMHISSIDFVSCCESCGY----GCDGG 166
Query: 161 SVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSC-ENQKVPKLKCHTR 219
++F G+VTGGDYG + GC+P PC HHG+ C + K P KC R
Sbjct: 167 WPILAFDFYTYEGAVTGGDYGSKDGCRPYPFHPCGHHGNDTYYGECPKGAKTP--KCRRR 224
Query: 220 CTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKH 279
C +Y + ++ DK Y V + AI++EI+ +GP F +Y+DF +YK G+YKH
Sbjct: 225 CQR-SYKKAYYMDKSYGEDAYEVPHSVKAIQREIMKNGPVVGAFTVYEDFSYYKKGIYKH 283
Query: 280 TSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAA 339
T+ H+ K+IGWG EN PYWL+ N+W WG+ G +++RG EC E + A
Sbjct: 284 TAGQARGG--HAIKIIGWGVENDVPYWLIANSWHNDWGEEGYFRMIRGINECGIEQEVVA 341
Query: 340 G 340
G
Sbjct: 342 G 342
>gi|56756475|gb|AAW26410.1| unknown [Schistosoma japonicum]
Length = 342
Score = 198 bits (504), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 116/330 (35%), Positives = 167/330 (50%), Gaps = 23/330 (6%)
Query: 21 FSDAYIDQINREANT-WTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDP- 78
SD I IN+ N W A ++ R + DA+ R P R+ P
Sbjct: 30 LSDEMISFINKHPNAGWKADKSD---------RFHSVDDARNLLGGRREDPNLRQKRRPT 80
Query: 79 ----EYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQN 134
+ + +P FD+R++WP C +I + D C + +AVGA SDR CI+S G+Q+
Sbjct: 81 VDHHDLNVEIPSHFDSRKKWPRCKSISQIRDQSQCGSSWAVSAVGAMSDRICIQSGGKQS 140
Query: 135 RPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPC 194
LS + SCCK C C G + +W++ RG VTGG + TGC+P C
Sbjct: 141 VELSAVDLISCCKYC----GSGCDGGFLGPSWDYWVLRGIVTGGSKENHTGCRPYPFPKC 196
Query: 195 SHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEIL 254
H +C ++ +C C Y + QDKH +Y V E I+K+I+
Sbjct: 197 DHFVKGK-YRACGDKLYKTPQCKQTC-QKGYNTSYEQDKHYGGFSYNVLSVESVIQKDIM 254
Query: 255 AHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGP 314
HGP A +Y+DF +YKSG+Y++T+ + H+ +LIGWG ENGT YWL NTW
Sbjct: 255 MHGPVEAYLEIYEDFLNYKSGIYRYTTGKYISG--HAVRLIGWGVENGTAYWLAANTWNE 312
Query: 315 HWGDRGTVKILRGKYECAFEYLIAAGKPKN 344
WG++G +I+RG+ EC E IAAG K+
Sbjct: 313 DWGEKGYFRIVRGRNECLIESEIAAGLIKS 342
>gi|56752809|gb|AAW24616.1| unknown [Schistosoma japonicum]
Length = 342
Score = 198 bits (504), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 116/330 (35%), Positives = 166/330 (50%), Gaps = 23/330 (6%)
Query: 21 FSDAYIDQINREANT-WTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDP- 78
SD I IN N W A ++ R + DA+ R P R+ P
Sbjct: 30 LSDEMISFINEHPNAGWKADKSD---------RFHSVDDARILLGGRREDPNLREKRRPT 80
Query: 79 ----EYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQN 134
+ +P FD+R++WP C +I + D C + +AVGA SDR CI+S G+Q+
Sbjct: 81 VDHHDLKVEIPSHFDSRKKWPRCKSISQIRDQSQCGSSWAVSAVGAMSDRICIQSGGKQS 140
Query: 135 RPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPC 194
LS + SCCK C C G + +W++ RG VTGG + TGC+P C
Sbjct: 141 VELSAVDLISCCKYC----GSGCDGGFLGPSWDYWVLRGIVTGGSKENHTGCRPYPFPKC 196
Query: 195 SHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEIL 254
H +C ++ +C C Y + QDKH +Y V E I+K+I+
Sbjct: 197 DHFVKGK-YRACGDKLYKTPQCKQTC-QKGYNTSYEQDKHYGGFSYNVLSVESVIQKDIM 254
Query: 255 AHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGP 314
HGP A +Y+DF +YKSG+Y++T+ + H+ +LIGWG ENGT YWL NTW
Sbjct: 255 MHGPVEAYLEIYEDFLNYKSGIYRYTTGQFISG--HAVRLIGWGVENGTAYWLAANTWNE 312
Query: 315 HWGDRGTVKILRGKYECAFEYLIAAGKPKN 344
WG++G +I+RG+ EC+ E IAAG K+
Sbjct: 313 DWGEKGYFRIVRGRNECSIESEIAAGLIKS 342
>gi|56756410|gb|AAW26378.1| unknown [Schistosoma japonicum]
Length = 342
Score = 198 bits (504), Expect = 3e-48, Method: Compositional matrix adjust.
Identities = 115/330 (34%), Positives = 167/330 (50%), Gaps = 23/330 (6%)
Query: 21 FSDAYIDQINREANT-WTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDP- 78
SD I IN+ N W A ++ R + DA+ + P R+ P
Sbjct: 30 LSDEMISFINKHPNAGWKADKSD---------RFHSVDDARILLGGRKEDPNLRQKRRPT 80
Query: 79 ----EYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQN 134
+ +P FD+R++WP C +I + D C + +AVGA SDR CI+S G+Q+
Sbjct: 81 VDHHDLKVEIPSHFDSRKKWPRCKSISQIRDQSQCGSSWAVSAVGAMSDRICIQSGGKQS 140
Query: 135 RPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPC 194
LS + SCCK C C G + +W++ RG VTGG + TGC+P C
Sbjct: 141 VELSAVDLISCCKYC----GSGCDGGFLGPSWDYWVLRGIVTGGSKENHTGCRPYPFPKC 196
Query: 195 SHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEIL 254
H +C ++ +C C Y + QDKH +Y V E I+K+I+
Sbjct: 197 DHFVKG-KYRACGDKLYKTPQCKQTC-QKGYNTSYEQDKHYGGFSYNVLSVESVIQKDIM 254
Query: 255 AHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGP 314
HGP A +Y+DF +YKSG+Y++T+ + H+ +LIGWG ENGT YWL NTW
Sbjct: 255 MHGPVEAYLEIYEDFLNYKSGIYRYTTGQFISG--HAVRLIGWGVENGTAYWLAANTWNE 312
Query: 315 HWGDRGTVKILRGKYECAFEYLIAAGKPKN 344
WG++G +I+RG+ EC+ E IAAG K+
Sbjct: 313 DWGEKGYFRIVRGRNECSIESEIAAGLIKS 342
>gi|226473756|emb|CAX71563.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 198 bits (504), Expect = 3e-48, Method: Compositional matrix adjust.
Identities = 116/330 (35%), Positives = 166/330 (50%), Gaps = 23/330 (6%)
Query: 21 FSDAYIDQINREANT-WTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDP- 78
SD I IN N W A ++ R + DA+ R P R+ P
Sbjct: 30 LSDEMISFINEHPNAGWKADKSD---------RFHSVDDARILLGGRREDPNLREKRRPT 80
Query: 79 ----EYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQN 134
+ +P FD+R++WP C +I + D C + +AVGA SDR CI+S G+Q+
Sbjct: 81 VDHHDLKVEIPSHFDSRKKWPRCKSISQIRDQSQCGSSWAVSAVGAMSDRICIQSGGKQS 140
Query: 135 RPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPC 194
LS + SCCK C C G + +W++ RG VTGG + TGC+P C
Sbjct: 141 VELSAVDLISCCKYC----GSGCDGGFLGPSWDYWVLRGIVTGGSKENHTGCRPYPFPKC 196
Query: 195 SHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEIL 254
H +C ++ +C+ C Y + QDKH +Y V E I+K+I+
Sbjct: 197 DHFVKGK-YRACGDKLYKTPQCNQTC-QKGYNTSYEQDKHYGGFSYNVLSVESVIQKDIM 254
Query: 255 AHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGP 314
HGP A +Y+DF +YKSG+Y++T+ + H+ +LIGWG ENGT YWL NTW
Sbjct: 255 MHGPVEAYLEIYEDFLNYKSGIYRYTTGKYISG--HAVRLIGWGVENGTAYWLAANTWNE 312
Query: 315 HWGDRGTVKILRGKYECAFEYLIAAGKPKN 344
WG++G +I+RG+ EC E IAAG K+
Sbjct: 313 DWGEKGYFRIVRGRNECLIESEIAAGLIKS 342
>gi|56752925|gb|AAW24674.1| unknown [Schistosoma japonicum]
Length = 342
Score = 198 bits (504), Expect = 3e-48, Method: Compositional matrix adjust.
Identities = 115/329 (34%), Positives = 168/329 (51%), Gaps = 21/329 (6%)
Query: 21 FSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDP-- 78
SD I IN+ N + A+ S+ + + DA+ R P R+ P
Sbjct: 30 LSDEMILFINKHPNA-----GWKADKSDRF---HSVDDARILLGGRREDPNLRQKRRPTV 81
Query: 79 ---EYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNR 135
+ + +P FD+R++WP C +I + D CA+ +AV A SDR CI+S G+Q+
Sbjct: 82 DHHDLNVEIPSHFDSRKKWPRCKSISQIRDQSRCASSWAVSAVAAMSDRICIQSGGKQSV 141
Query: 136 PLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCS 195
LS + SCCK C C G +W++ K G VTGG + TGC+P C
Sbjct: 142 ELSAIDLISCCKNC----GSGCDGGVTGYSWDYWVKHGIVTGGSKENHTGCRPYPFPKCD 197
Query: 196 HHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILA 255
H +C ++ +C C Y + QDKH +Y V E AI+KEI+
Sbjct: 198 HFVKGK-YRACGDKLYKTPQCKQTC-QKGYNTSYEQDKHYGGFSYSVIGVESAIQKEIMM 255
Query: 256 HGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPH 315
+GP A +Y+DF +YKSG+Y++T+ + H+ +LIGWG ENGT YWL NTW
Sbjct: 256 YGPVEAYLQIYEDFLNYKSGIYRYTTGKYISG--HAVRLIGWGVENGTSYWLAANTWNED 313
Query: 316 WGDRGTVKILRGKYECAFEYLIAAGKPKN 344
WG++G +I+RG+ EC E I AG+ K+
Sbjct: 314 WGEKGYFRIVRGRDECLIESFIVAGQIKS 342
>gi|171474007|gb|AAX31052.2| SJCHGC09761 protein [Schistosoma japonicum]
Length = 342
Score = 198 bits (504), Expect = 3e-48, Method: Compositional matrix adjust.
Identities = 116/330 (35%), Positives = 166/330 (50%), Gaps = 23/330 (6%)
Query: 21 FSDAYIDQINREANT-WTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDP- 78
SD I IN N W A ++ R + DA+ R P R+ P
Sbjct: 30 LSDEMISFINEHPNAGWKADKSD---------RFHSVDDARILLGGRREDPNLREKRRPT 80
Query: 79 ----EYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQN 134
+ + +P FD+R++WP C +I + D C + +AVGA SDR CI+S G+Q+
Sbjct: 81 IDHHDLNVEIPSHFDSRKKWPRCKSISQIRDQSQCGSSWAVSAVGAMSDRICIQSGGKQS 140
Query: 135 RPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPC 194
LS + SCCK C C G + +W++ RG VTGG + TGC+P C
Sbjct: 141 VELSAVDLISCCKYC----GSGCDGGFLGPSWDYWVLRGIVTGGSKENHTGCRPYPFPKC 196
Query: 195 SHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEIL 254
H +C ++ +C C Y + QDKH +Y V E I+K+I+
Sbjct: 197 DHFVKGK-YRACGDKLYKTPQCKQTC-QKGYNTSYEQDKHYGGFSYNVLSVESVIQKDIM 254
Query: 255 AHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGP 314
HGP A +Y+DF +YKSG+Y++T+ + H+ +LIGWG ENGT YWL NTW
Sbjct: 255 MHGPVEAYLEIYEDFLNYKSGIYRYTTGKYISG--HAVRLIGWGVENGTAYWLAANTWNE 312
Query: 315 HWGDRGTVKILRGKYECAFEYLIAAGKPKN 344
WG++G +I+RG+ EC E IAAG K+
Sbjct: 313 DWGEKGYFRIVRGRNECLIESEIAAGLIKS 342
>gi|56753605|gb|AAW25005.1| SJCHGC02852 protein [Schistosoma japonicum]
Length = 346
Score = 198 bits (503), Expect = 3e-48, Method: Compositional matrix adjust.
Identities = 118/326 (36%), Positives = 167/326 (51%), Gaps = 14/326 (4%)
Query: 21 FSDAYIDQINREANT-WTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPE 79
SD I IN++ N W A R S + + + DQ P + +
Sbjct: 32 LSDELITFINKQPNIEWKADRTTRFT-SIHHAKSMMGVLLNSVDQHKLHHP---IIHHND 87
Query: 80 YSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLST 139
+ +P FD+R+ W NC +I + D +C + F AV + SDR CI SKG+ + LS
Sbjct: 88 INIKLPKYFDSRKYWKNCSSIRTIRDQSSCGSCWAFGAVESMSDRICIHSKGRISIELSA 147
Query: 140 EYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGS 199
+ SCC C + C+ G W++ G VTGG TGCQP C HH +
Sbjct: 148 VNLLSCCSRCGF----GCNGGIPGMAWDYWKDEGIVTGGSNETHTGCQPYPFPECIHHST 203
Query: 200 APTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPT 259
+ SCE + +C+ C P Y + DK+ +Y+V +E +I KEIL +GP
Sbjct: 204 SINHSSCEVKYYSTPECYQTC-QPDYAIQYENDKYYGKSSYYVTSDEVSIMKEILLNGPV 262
Query: 260 TATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWG--TENGTPYWLVINTWGPHWG 317
ATF ++DDF +YK+GVYK+ + + L H+ ++IGWG T N TPYWL N+W WG
Sbjct: 263 EATFYVFDDFLNYKTGVYKYVTGSLLGG--HAIRIIGWGVSTLNHTPYWLCANSWNKQWG 320
Query: 318 DRGTVKILRGKYECAFEYLIAAGKPK 343
D+G KILRG EC E ++ AG PK
Sbjct: 321 DKGYFKILRGSNECGIESMVTAGLPK 346
>gi|262368170|pdb|3K9M|A Chain A, Cathepsin B In Complex With Stefin A
gi|262368172|pdb|3K9M|B Chain B, Cathepsin B In Complex With Stefin A
Length = 254
Score = 198 bits (503), Expect = 3e-48, Method: Compositional matrix adjust.
Identities = 104/261 (39%), Positives = 141/261 (54%), Gaps = 10/261 (3%)
Query: 84 VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVA 143
+P FDAREQWP C TI + D G+C + F AV A SDR CI + + +S E +
Sbjct: 1 LPASFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHTNAHVSVEVSAEDLL 60
Query: 144 SCC-KICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPT 202
+CC +C C+ G WNF ++G V+GG Y GC+P +I PC HH +
Sbjct: 61 TCCGSMC----GDGCNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCEHHVNGSR 116
Query: 203 LPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTAT 262
P PK ++ P Y + QDKH +Y V ++E I EI +GP
Sbjct: 117 PPCTGEGDTPKC---SKICEPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGA 173
Query: 263 FALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTV 322
F++Y DF YKSGVY+H + + H+ +++GWG ENGTPYWLV N+W WGD G
Sbjct: 174 FSVYSDFLLYKSGVYQHVTGEMMGG--HAIRILGWGVENGTPYWLVANSWNTDWGDNGFF 231
Query: 323 KILRGKYECAFEYLIAAGKPK 343
KILRG+ C E + AG P+
Sbjct: 232 KILRGQDHCGIESEVVAGIPR 252
>gi|56757646|gb|AAW26973.1| unknown [Schistosoma japonicum]
Length = 342
Score = 198 bits (503), Expect = 3e-48, Method: Compositional matrix adjust.
Identities = 115/330 (34%), Positives = 167/330 (50%), Gaps = 23/330 (6%)
Query: 21 FSDAYIDQINREANT-WTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDP- 78
SD I IN N W A ++ R + DA+ R P R+ P
Sbjct: 30 LSDEMISFINEHPNAGWKADKSD---------RFHSVDDARILLGGRREDPNLREKRRPT 80
Query: 79 ----EYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQN 134
+ + +P FD+R++WP C +I + D C + +AVGA SDR CI+S G+Q+
Sbjct: 81 VDHHDLNVEIPSHFDSRKKWPRCKSISQIRDQSQCGSSWAVSAVGAMSDRICIQSGGKQS 140
Query: 135 RPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPC 194
LS + SCCK C C G + +W++ RG VTGG + TGC+P C
Sbjct: 141 VELSAVDLISCCKYC----GSGCDGGFLGPSWDYWVLRGIVTGGSKENHTGCRPYPFPKC 196
Query: 195 SHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEIL 254
H +C ++ +C+ C Y + QDKH +Y V E +K+I+
Sbjct: 197 DHFVKGK-YRACGDKLYKTPQCNQTC-QKGYNTSYEQDKHYGGFSYNVLSGESVFQKDIM 254
Query: 255 AHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGP 314
HGP A +Y+DF +YKSG+Y++T+ + H+ +LIGWG ENGT YWL NTW
Sbjct: 255 MHGPVEAYLEIYEDFLNYKSGIYRYTTGKYISG--HAVRLIGWGVENGTAYWLAANTWNE 312
Query: 315 HWGDRGTVKILRGKYECAFEYLIAAGKPKN 344
WG++G +I+RG+ EC+ E IAAG K+
Sbjct: 313 DWGEKGYFRIVRGRNECSIESEIAAGLIKS 342
>gi|332376204|gb|AEE63242.1| unknown [Dendroctonus ponderosae]
Length = 338
Score = 198 bits (503), Expect = 3e-48, Method: Compositional matrix adjust.
Identities = 118/343 (34%), Positives = 164/343 (47%), Gaps = 14/343 (4%)
Query: 2 IHILVFLLGCTLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKY 61
I +L F+ C +L+ SD YI IN +A TW AG+NF + ++ R IA
Sbjct: 7 IAVLAFVAVCHGTSLDLHPLSDEYIASINEKATTWKAGKNFEVD---DWERVKKIAAGVL 63
Query: 62 FDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAF 121
++ +D S VP+ FDARE WP C ++ + D +C + F AV A
Sbjct: 64 PRKAALRFVTQNNPHDE--SEEVPESFDARENWPRCDSLKQIRDQSSCGSCWAFGAVEAM 121
Query: 122 SDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYG 181
SDR CI S +S E + SCC + C G V W++ G VTGG Y
Sbjct: 122 SDRICIHSDQSNQVYVSAEDLNSCC-FGLFACGLGCDGGYVAEPWDYWRTDGIVTGGAYN 180
Query: 182 DRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRC--TNPTYGRGFFQDKHRTTLT 239
GC+ ++ PC HH + P C + +C C ++ Y + +T T
Sbjct: 181 SSQGCKDYSLEPCEHHVEVGSRPQCSSLNFDTPECVRSCYESSLDYTESLTFGQQVSTFT 240
Query: 240 YWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGT 299
NE ++ EIL +GP A F +Y+DF YKSGVY+ T+ + H+ K++GWG
Sbjct: 241 -----NEKQMQLEILKNGPIEAAFTVYNDFLSYKSGVYQATAQDESVGG-HAIKVLGWGV 294
Query: 300 ENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
E GT YWL+ N+W WGD G K LRG C E AA P
Sbjct: 295 EEGTKYWLIANSWNTDWGDNGYFKFLRGVDHCGIESETAASLP 337
>gi|56759488|gb|AAW27884.1| unknown [Schistosoma japonicum]
Length = 342
Score = 198 bits (503), Expect = 4e-48, Method: Compositional matrix adjust.
Identities = 116/330 (35%), Positives = 166/330 (50%), Gaps = 23/330 (6%)
Query: 21 FSDAYIDQINREANT-WTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDP- 78
SD I IN N W A ++ R + DA+ R P R+ P
Sbjct: 30 LSDEMISFINEHPNAGWKADKSD---------RFHSVDDARILLGGRREDPNLREKRRPT 80
Query: 79 ----EYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQN 134
+ + +P FD+R++WP C +I + D C + +AVGA SDR CI+S G+Q+
Sbjct: 81 IDHHDLNVEIPSHFDSRKKWPRCKSISQIRDQSQCGSSWAVSAVGAMSDRICIQSGGKQS 140
Query: 135 RPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPC 194
LS + SCCK C C G + +W++ RG VTGG + TGC+P C
Sbjct: 141 VELSAVDLISCCKYC----GSGCDGGFLGPSWDYWVLRGIVTGGSKENHTGCRPYPFPKC 196
Query: 195 SHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEIL 254
H +C ++ +C C Y + QDKH +Y V E I+K+I+
Sbjct: 197 DHFVKGK-YRACGDKLYKTPQCKQTC-QKGYNTSYEQDKHYGGFSYNVLGIESVIQKDIM 254
Query: 255 AHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGP 314
HGP A +Y+DF +YKSG+Y++T+ + H+ +LIGWG ENGT YWL NTW
Sbjct: 255 MHGPVEAYLEIYEDFLNYKSGIYRYTTGKYISG--HAVRLIGWGVENGTAYWLAANTWNE 312
Query: 315 HWGDRGTVKILRGKYECAFEYLIAAGKPKN 344
WG++G +I+RG+ EC E IAAG K+
Sbjct: 313 DWGEKGYFRIVRGRNECLIESEIAAGLIKS 342
>gi|24158605|pdb|1GMY|A Chain A, Cathepsin B Complexed With Dipeptidyl Nitrile Inhibitor
gi|24158606|pdb|1GMY|B Chain B, Cathepsin B Complexed With Dipeptidyl Nitrile Inhibitor
gi|24158607|pdb|1GMY|C Chain C, Cathepsin B Complexed With Dipeptidyl Nitrile Inhibitor
Length = 261
Score = 198 bits (503), Expect = 4e-48, Method: Compositional matrix adjust.
Identities = 104/261 (39%), Positives = 141/261 (54%), Gaps = 10/261 (3%)
Query: 84 VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVA 143
+P FDAREQWP C TI + D G+C + F AV A SDR CI + + +S E +
Sbjct: 2 LPASFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHTNAHVSVEVSAEDLL 61
Query: 144 SCC-KICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPT 202
+CC +C C+ G WNF ++G V+GG Y GC+P +I PC HH +
Sbjct: 62 TCCGSMC----GDGCNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCEHHVNGSR 117
Query: 203 LPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTAT 262
P PK ++ P Y + QDKH +Y V ++E I EI +GP
Sbjct: 118 PPCTGEGDTPKC---SKICEPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGA 174
Query: 263 FALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTV 322
F++Y DF YKSGVY+H + + H+ +++GWG ENGTPYWLV N+W WGD G
Sbjct: 175 FSVYSDFLLYKSGVYQHVTGEMMGG--HAIRILGWGVENGTPYWLVANSWNTDWGDNGFF 232
Query: 323 KILRGKYECAFEYLIAAGKPK 343
KILRG+ C E + AG P+
Sbjct: 233 KILRGQDHCGIESEVVAGIPR 253
>gi|56755451|gb|AAW25905.1| unknown [Schistosoma japonicum]
Length = 342
Score = 198 bits (503), Expect = 4e-48, Method: Compositional matrix adjust.
Identities = 114/329 (34%), Positives = 168/329 (51%), Gaps = 21/329 (6%)
Query: 21 FSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDP-- 78
SD I IN+ N + A+ S+ + + DA+ + P R+ P
Sbjct: 30 LSDEMISFINKHPNA-----GWKADKSDRF---HSVDDARILLGGRKEDPNLRQKRRPTV 81
Query: 79 ---EYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNR 135
+ +P FD+R++WP C +I + D C + +AVGA SDR CI+S G+Q+
Sbjct: 82 DHHDLKVEIPSHFDSRKKWPRCKSISQIRDQSQCGSSWAVSAVGAMSDRICIQSGGKQSV 141
Query: 136 PLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCS 195
LS + SCCK C C G + +W++ RG VTGG + TGC+P C
Sbjct: 142 ELSAVDLISCCKYC----GSGCDGGFLGPSWDYWVLRGIVTGGSKENHTGCRPYPFPKCD 197
Query: 196 HHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILA 255
H +C ++ +C C Y + QDKH +Y V E I+K+I+
Sbjct: 198 HFVKG-KYRACGDKLYKTPQCKQIC-QKGYNTSYEQDKHYGGFSYNVLSVESVIQKDIMM 255
Query: 256 HGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPH 315
HGP A +Y+DF +YKSG+Y++T+ + H+ +LIGWG ENGT YWL NTW
Sbjct: 256 HGPVEAYLEIYEDFLNYKSGIYRYTTGQFISG--HAVRLIGWGVENGTAYWLAANTWNED 313
Query: 316 WGDRGTVKILRGKYECAFEYLIAAGKPKN 344
WG++G +I+RG+ EC+ E IAAG K+
Sbjct: 314 WGEKGYFRIVRGRNECSIESEIAAGLIKS 342
>gi|56757271|gb|AAW26807.1| unknown [Schistosoma japonicum]
Length = 342
Score = 198 bits (503), Expect = 4e-48, Method: Compositional matrix adjust.
Identities = 115/329 (34%), Positives = 168/329 (51%), Gaps = 21/329 (6%)
Query: 21 FSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDP-- 78
SD I IN+ N + A+ S+ + + DA+ R P R+ P
Sbjct: 30 LSDEMILFINKHPNA-----GWKADKSDRF---HSVDDARILLGGRREDPNLREKRRPTV 81
Query: 79 ---EYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNR 135
+ + +P FD+R++WP C +I + D CA+ +AVGA SDR CI+S G+Q+
Sbjct: 82 DHHDLNVEIPSHFDSRKKWPRCKSISQIRDQSRCASSWAVSAVGAMSDRICIQSGGKQSV 141
Query: 136 PLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCS 195
LS + SCCK C C G +W++ K G VTGG + TGC+P C
Sbjct: 142 ELSAIDLISCCKNC----GSGCDGGVTGYSWDYWVKHGIVTGGSKENHTGCRPYPFPKCD 197
Query: 196 HHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILA 255
H +C ++ +C C Y + QDKH +Y V E I+KEI+
Sbjct: 198 HFVKGK-YRACGDKLYKTPQCKQTC-QKGYNTSYEQDKHYGEFSYNVIGVESVIQKEIMM 255
Query: 256 HGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPH 315
+GP A +Y+DF +YKSG+Y++T+ + H+ +LIGWG ENGT YWL NTW
Sbjct: 256 YGPVEAYLHIYEDFLNYKSGIYRYTTGQFISG--HAVRLIGWGVENGTSYWLAANTWNED 313
Query: 316 WGDRGTVKILRGKYECAFEYLIAAGKPKN 344
WG++G +I+RG+ EC E I AG+ K+
Sbjct: 314 WGEKGYFRIVRGRDECLIESFIVAGQIKS 342
>gi|333361087|pdb|3AI8|B Chain B, Cathepsin B In Complex With The Nitroxoline
gi|333361088|pdb|3AI8|A Chain A, Cathepsin B In Complex With The Nitroxoline
Length = 256
Score = 198 bits (503), Expect = 4e-48, Method: Compositional matrix adjust.
Identities = 104/261 (39%), Positives = 141/261 (54%), Gaps = 10/261 (3%)
Query: 84 VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVA 143
+P FDAREQWP C TI + D G+C + F AV A SDR CI + + +S E +
Sbjct: 3 LPASFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHTNAHVSVEVSAEDLL 62
Query: 144 SCC-KICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPT 202
+CC +C C+ G WNF ++G V+GG Y GC+P +I PC HH +
Sbjct: 63 TCCGSMC----GDGCNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCEHHVNGSR 118
Query: 203 LPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTAT 262
P PK ++ P Y + QDKH +Y V ++E I EI +GP
Sbjct: 119 PPCTGEGDTPKC---SKICEPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGA 175
Query: 263 FALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTV 322
F++Y DF YKSGVY+H + + H+ +++GWG ENGTPYWLV N+W WGD G
Sbjct: 176 FSVYSDFLLYKSGVYQHVTGEMMGG--HAIRILGWGVENGTPYWLVANSWNTDWGDNGFF 233
Query: 323 KILRGKYECAFEYLIAAGKPK 343
KILRG+ C E + AG P+
Sbjct: 234 KILRGQDHCGIESEVVAGIPR 254
>gi|170586854|ref|XP_001898194.1| cathepsin B-like cysteine proteinase [Brugia malayi]
gi|158594589|gb|EDP33173.1| cathepsin B-like cysteine proteinase, putative [Brugia malayi]
Length = 384
Score = 197 bits (502), Expect = 4e-48, Method: Compositional matrix adjust.
Identities = 111/323 (34%), Positives = 168/323 (52%), Gaps = 18/323 (5%)
Query: 26 IDQINREANTWTAGRNFPANLSEEYLRQFLIA--DAKYFDQSDRPLPGDRKTYDPEYSAT 83
ID +N W AG N NL + ++ L+ + K + + L R + +
Sbjct: 67 IDYVNSHQTLWKAGMN-KFNLYSDTVKYGLLGVNNRKKSVEHKKNLSPIRHS-----NIF 120
Query: 84 VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVA 143
+P+ FDAR+ WP C ++ ++ D +C + AAV A SDR CI SKG++ LS + +
Sbjct: 121 IPESFDARKNWPECASLRNIRDQSSCGSCWAVAAVEAMSDRICITSKGKKQVILSADDLL 180
Query: 144 SCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTL 203
SCCK C + C G W + G VTG DY + +GC+P PC HH +
Sbjct: 181 SCCKTCGF----GCFGGEPMAAWKYWVLSGIVTGSDYTNHSGCRPYPFPPCEHHSNKTHY 236
Query: 204 PSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATF 263
C++ P KC+ +C + Y + + DK+ Y V+++ ++I+KEI+ GP A+F
Sbjct: 237 EPCKHDLYPTPKCYKQC-DKNYTKSYKADKYYGEQAYNVENDVESIQKEIMTLGPVEASF 295
Query: 264 ALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGD---RG 320
+Y DF HY SG+YKH + + H+ K++GWG + G YWL N+W WG+ G
Sbjct: 296 EVYTDFLHYTSGIYKHVAGSVGGG--HAVKILGWGIDQGVSYWLAANSWNNDWGEDVFSG 353
Query: 321 TVKILRGKYECAFEYLIAAGKPK 343
+ILRG EC E I AG P+
Sbjct: 354 YFRILRGADECGIESGIVAGIPR 376
>gi|145498570|ref|XP_001435272.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124402403|emb|CAK67875.1| unnamed protein product [Paramecium tetraurelia]
Length = 325
Score = 197 bits (502), Expect = 4e-48, Method: Compositional matrix adjust.
Identities = 117/322 (36%), Positives = 167/322 (51%), Gaps = 17/322 (5%)
Query: 22 SDAYIDQINREANTWTAGRNFP-ANLSEEYLRQFLIADAKYFDQSD-RPLPGDRKTYDPE 79
S + D +N + +TW +G N +E L+ + + D+ D LP ++
Sbjct: 17 SQTFYDFVNSQQSTWVSGHNQRWEQFNEATLKTQM---GTFLDEPDFMKLPESTVQFE-- 71
Query: 80 YSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLST 139
+ +P+ FDAR+QWPNC +I V D C + F A A SDR CI + G+Q R +ST
Sbjct: 72 -NLEIPESFDARQQWPNCESIKEVRDQSTCGSCWAFGAAEAMSDRLCIAT-GKQTR-IST 128
Query: 140 EYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGS 199
E + +CC I C+ G WN+ +G VTG +GD + C+P T PC HH
Sbjct: 129 EDLLTCCGITC---GMGCNGGFPSGAWNYFKNKGLVTGDLFGDNSWCRPYTFPPCDHHVD 185
Query: 200 APTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPT 259
C + + P C CT + GR + DK R+ +Y V + I+ EI+ GP
Sbjct: 186 DGKYGPCGDSQ-PTPACVKSCTAQS-GRNYDSDKIRSIDSYSVSSKVEQIQNEIMTFGPV 243
Query: 260 TATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDR 319
A+F +Y+DF YKSGVY++ + A L H+ K+IGWG E PYWLV+N+W WG+
Sbjct: 244 EASFTVYEDFLTYKSGVYQNVAGANLGG--HAVKIIGWGVEKNVPYWLVVNSWNEGWGEN 301
Query: 320 GTVKILRGKYECAFEYLIAAGK 341
G KILRG E I AG+
Sbjct: 302 GLFKILRGSNHVGIEGGIYAGR 323
>gi|56759504|gb|AAW27892.1| unknown [Schistosoma japonicum]
Length = 279
Score = 197 bits (502), Expect = 4e-48, Method: Compositional matrix adjust.
Identities = 98/260 (37%), Positives = 149/260 (57%), Gaps = 8/260 (3%)
Query: 81 SATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTE 140
+ +P +FD+R++WP+C +I + D C + F AV A +DR CI+S GQQ+ LS
Sbjct: 24 NVEIPSQFDSRKKWPHCKSISQIRDQSRCGSCWAFGAVEAMTDRICIQSGGQQSAELSAL 83
Query: 141 YVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSA 200
+ SCC+ C + C G W++ KRG VTGG + TGCQP C HH +
Sbjct: 84 DLISCCEDC----GQGCQGGFPGVAWDYWVKRGIVTGGSKENHTGCQPYPFPKCEHH-TK 138
Query: 201 PTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTT 260
P+C + +C C Y + QDKH +Y V +NE I+++I+ +GP
Sbjct: 139 GKYPACGTKIYKTPQCKQTCQK-GYKTPYEQDKHYGEESYNVQNNEKVIQRDIMMYGPVE 197
Query: 261 ATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRG 320
A F +Y+DF +YKSG+Y+H + + + H+ ++IGWG E TPYWL+ N+W WG++G
Sbjct: 198 AAFDVYEDFLNYKSGIYRHVTGSIVGG--HAIRIIGWGVEKRTPYWLIANSWNEDWGEKG 255
Query: 321 TVKILRGKYECAFEYLIAAG 340
+I+RG+ EC+ E + AG
Sbjct: 256 LFRIVRGRDECSIESNVVAG 275
>gi|268555420|ref|XP_002635699.1| Hypothetical protein CBG22436 [Caenorhabditis briggsae]
Length = 317
Score = 197 bits (502), Expect = 5e-48, Method: Compositional matrix adjust.
Identities = 126/342 (36%), Positives = 178/342 (52%), Gaps = 28/342 (8%)
Query: 2 IHILVFLLGCTLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKY 61
I +L LL T + A ++ +N + +TA + N++EE+ ++F + D KY
Sbjct: 2 ILVLAVLLEATSAFVPIT--GQALVNYVNSAQSMFTAEYS---NVTEEF-KKFRVMDVKY 55
Query: 62 FDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAF 121
P R + ++P FDAR +WPNC +I + + C + F A
Sbjct: 56 AAPHS---PELRASQVNTVLPSIPTYFDARTRWPNCRSIKMIRNQATCGSCWAFGAAEVM 112
Query: 122 SDRRCIKSKGQQNRPLSTEYVASCC-KICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDY 180
SDR CI S G + +S + SCC C Y K S FR WN K+G VTGGDY
Sbjct: 113 SDRICIASMGTKQPIISPTDLLSCCGNFCGYG-CKGASPLQAFRWWN---KKGVVTGGDY 168
Query: 181 GDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTY 240
+GC+P +PC+ LP C + P+ C C P Y + + +DK+ T Y
Sbjct: 169 -RGSGCKPYPFAPCT------ALP-CTKSETPR--CSLNC-QPAYSKAYSKDKYFGTPAY 217
Query: 241 WVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTE 300
V + AI+ EI +GP A F +YDDF HY+SGVY+H + + H+ K+IGWG +
Sbjct: 218 IVGMDVAAIQTEI-TNGPVEAAFIVYDDFNHYRSGVYRHVAGKLVGG--HAVKIIGWGIQ 274
Query: 301 NGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
NG PYWL+ N+WGP+WG+ G K+LRG EC E I AGKP
Sbjct: 275 NGAPYWLMANSWGPYWGENGFFKMLRGVDECGIESTIVAGKP 316
>gi|1345924|sp|P25802.3|CYSP1_OSTOS RecName: Full=Cathepsin B-like cysteine proteinase 1; Flags:
Precursor
Length = 341
Score = 197 bits (502), Expect = 5e-48, Method: Compositional matrix adjust.
Identities = 107/291 (36%), Positives = 155/291 (53%), Gaps = 10/291 (3%)
Query: 50 YLRQFLIADAKYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGAC 109
Y +Q L+ D KY DQ++ P E + +P+ +D R QW NC ++ H+PD C
Sbjct: 58 YFKQRLM-DLKYIDQNNIPDEEVEDEELEENNDDIPESYDPRIQWANCSSLFHIPDQANC 116
Query: 110 AAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFL 169
+ ++ A SDR CI SKG + +S + V SCC C C G + F
Sbjct: 117 GSCWAVSSAAAMSDRICIASKGAKQVLISAQDVVSCCTWC----GDGCEGGWPISAFRFH 172
Query: 170 HKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGF 229
G VTGGDY + C+P I PC HHG+ C +C RC Y + +
Sbjct: 173 ADEGVVTGGDYNTKGSCRPYEIHPCGHHGNETYYGECVGM-ADTPRCKRRCL-LGYPKSY 230
Query: 230 FQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYL 289
D++ Y + ++ AI+K+I+ +GP AT+ +Y+DF HY+SG+YKH + K L
Sbjct: 231 PSDRYYKK-AYQLKNSVKAIQKDIMKNGPVVATYTVYEDFAHYRSGIYKHKAGRKTG--L 287
Query: 290 HSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAG 340
H+ K+IGWG E GTPYW+V N+W WG+ G ++ RG +C FE +AAG
Sbjct: 288 HAVKVIGWGEEKGTPYWIVANSWHDDWGENGFFRMHRGSNDCGFEERMAAG 338
>gi|343197337|pdb|3QSD|A Chain A, Structure Of Cathepsin B1 From Schistosoma Mansoni In
Complex With Ca074 Inhibitor
gi|343197588|pdb|3S3Q|A Chain A, Structure Of Cathepsin B1 From Schistosoma Mansoni In
Complex With K11017 Inhibitor
gi|343197589|pdb|3S3R|A Chain A, Structure Of Cathepsin B1 From Schistosoma Mansoni In
Complex With K11777 Inhibitor
gi|343197590|pdb|3S3R|B Chain B, Structure Of Cathepsin B1 From Schistosoma Mansoni In
Complex With K11777 Inhibitor
gi|343197591|pdb|3S3R|C Chain C, Structure Of Cathepsin B1 From Schistosoma Mansoni In
Complex With K11777 Inhibitor
Length = 254
Score = 197 bits (501), Expect = 5e-48, Method: Compositional matrix adjust.
Identities = 100/258 (38%), Positives = 144/258 (55%), Gaps = 8/258 (3%)
Query: 84 VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVA 143
+P FD+R++WP C +I + D C + F AV A SDR CI+S G+QN LS +
Sbjct: 3 IPSSFDSRKKWPRCKSIATIRDQSRCGSCWAFGAVEAMSDRSCIQSGGKQNVELSAVDLL 62
Query: 144 SCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTL 203
SCC+ C C G + W++ K G VTG + GC+P C HH +
Sbjct: 63 SCCESC----GLGCEGGILGPAWDYWVKEGIVTGSSKENHAGCEPYPFPKCEHH-TKGKY 117
Query: 204 PSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATF 263
P C ++ +C C Y + QDKHR +Y V ++E AI+KEI+ +GP A F
Sbjct: 118 PPCGSKIYKTPRCKQTCQK-KYKTPYTQDKHRGKSSYNVKNDEKAIQKEIMKYGPVEAGF 176
Query: 264 ALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVK 323
+Y+DF +YKSG+YKH + L H+ ++IGWG EN PYWL+ N+W WG+ G +
Sbjct: 177 TVYEDFLNYKSGIYKHITGETLGG--HAIRIIGWGVENKAPYWLIANSWNEDWGENGYFR 234
Query: 324 ILRGKYECAFEYLIAAGK 341
I+RG+ EC+ E + AG+
Sbjct: 235 IVRGRDECSIESEVTAGR 252
>gi|226469950|emb|CAX70256.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 197 bits (501), Expect = 6e-48, Method: Compositional matrix adjust.
Identities = 101/267 (37%), Positives = 155/267 (58%), Gaps = 10/267 (3%)
Query: 79 EYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLS 138
+ + +P +FD+R++WP+C +I + D C + +AVGA SDR CI+S G+Q+ LS
Sbjct: 85 DLNVEIPSQFDSRKKWPHCKSISQIRDQSQCGSSWAVSAVGAMSDRICIQSGGKQSVELS 144
Query: 139 TEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHG 198
+ SCC+ C C G W++ G VTGG + TGCQP C HH
Sbjct: 145 AIDLISCCENC----GSGCDGGFPGPAWDYWVSHGIVTGGSKENHTGCQPYPFPKCEHH- 199
Query: 199 SAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGP 258
S PSC ++ +C +C Y + DKH ++ V NE AI+KEI+ +GP
Sbjct: 200 SIGKYPSCGDKIYKTPQCKRKCQK-GYTTPYEHDKHYGGISINVIKNESAIQKEIMMYGP 258
Query: 259 TTATFALYDDFYHYKSGVYKHTSNAKL-ENYLHSGKLIGWGTENGTPYWLVINTWGPHWG 317
A +++DF +YKSG+Y++T+ + + E+Y+ ++IGWG ENGT YWL NTW WG
Sbjct: 259 VEAYLLIFEDFLNYKSGIYRYTTGSFVGEHYV---RIIGWGIENGTAYWLAANTWNEDWG 315
Query: 318 DRGTVKILRGKYECAFEYLIAAGKPKN 344
++G +I+RG+ EC+ E ++ AG+ K+
Sbjct: 316 EKGYFRIVRGRNECSIESVVVAGRLKS 342
>gi|9955277|pdb|1QDQ|A Chain A, X-Ray Crystal Structure Of Bovine Cathepsin B-Ca074
Complex
Length = 253
Score = 197 bits (500), Expect = 7e-48, Method: Compositional matrix adjust.
Identities = 111/259 (42%), Positives = 140/259 (54%), Gaps = 8/259 (3%)
Query: 84 VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVA 143
+P+ FDAREQWPNC TI + D G+C + F AV A SDR CI S G+ N +S E +
Sbjct: 1 LPESFDAREQWPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHSNGRVNVEVSAEDML 60
Query: 144 SCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTL 203
+CC D WNF K+G V+GG Y GC+P +I PC HH +
Sbjct: 61 TCCGGECGDGCNGGEPSG---AWNFWTKKGLVSGGLYNSHVGCRPYSIPPCEHHVNGSRP 117
Query: 204 PSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATF 263
P PK C C P Y + +DKH +Y V +NE I EI +GP F
Sbjct: 118 PCTGEGDTPK--CSKTC-EPGYSPSYKEDKHFGCSSYSVANNEKEIMAEIYKNGPVEGAF 174
Query: 264 ALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVK 323
++Y DF YKSGVY+H S + H+ +++GWG ENGTPYWLV N+W WGD G K
Sbjct: 175 SVYSDFLLYKSGVYQHVSGEIMGG--HAIRILGWGVENGTPYWLVANSWNTDWGDNGFFK 232
Query: 324 ILRGKYECAFEYLIAAGKP 342
ILRG+ C E I AG P
Sbjct: 233 ILRGQDHCGIESEIVAGMP 251
>gi|226474182|emb|CAX71577.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 197 bits (500), Expect = 7e-48, Method: Compositional matrix adjust.
Identities = 115/330 (34%), Positives = 166/330 (50%), Gaps = 23/330 (6%)
Query: 21 FSDAYIDQINREANT-WTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDP- 78
SD I IN N W A ++ R + DA+ R P R+ P
Sbjct: 30 LSDEMISFINEHPNAGWKADKSD---------RFHSVDDARILLGGRREDPNLREKRRPT 80
Query: 79 ----EYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQN 134
+ + +P FD+R++WP C +I + D C + +AVGA SDR CI+S G+Q+
Sbjct: 81 VDHHDLNVEIPSHFDSRKKWPRCKSISQIRDQSQCGSSWAVSAVGAMSDRICIQSGGKQS 140
Query: 135 RPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPC 194
LS + SCCK C C G + +W++ RG VTGG + T C+P C
Sbjct: 141 VELSAVDLISCCKYC----GSGCDGGFLGPSWDYWVLRGIVTGGSKENHTSCRPYPFPKC 196
Query: 195 SHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEIL 254
H +C ++ +C C Y + QDKH +Y V E I+K+I+
Sbjct: 197 DHFVKG-KYRACGDKLYETPQCKQTC-QKGYNTSYEQDKHYGGFSYNVLSVESVIQKDIM 254
Query: 255 AHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGP 314
HGP A +Y+DF +YKSG+Y++T+ + H+ +LIGWG ENGT YWL NTW
Sbjct: 255 MHGPVEAYLEIYEDFLNYKSGIYRYTTGQFISG--HAVRLIGWGVENGTAYWLAANTWNE 312
Query: 315 HWGDRGTVKILRGKYECAFEYLIAAGKPKN 344
WG++G +I+RG+ EC+ E IAAG K+
Sbjct: 313 DWGEKGYFRIVRGRNECSIESEIAAGLIKS 342
>gi|161343869|tpg|DAA06115.1| TPA_inf: cathepsin B [Myzus persicae]
Length = 337
Score = 197 bits (500), Expect = 8e-48, Method: Compositional matrix adjust.
Identities = 119/347 (34%), Positives = 169/347 (48%), Gaps = 21/347 (6%)
Query: 1 MIHILVFLLGCTLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAK 60
+ +L + + Y + +I+ IN +A TW AG NF N + + + L ++
Sbjct: 4 VFMLLSVIFVSVYATEQAYFLQEDFINNINEQATTWKAGMNFDPNTPHDDIIKLL--GSR 61
Query: 61 YFDQSDRPLPGDRKTYDPEYS---ATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAA 117
D+ KT+D Y +P+ FDAR +W C TIG V D G C + A
Sbjct: 62 GVQNPDKVNHKLYKTHDEAYDNLFGRIPEHFDARNKWVYCDTIGRVRDQGNCGSCWAVAT 121
Query: 118 VGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTG 177
AF+DR C+ + G N LS E + CC C + C G + W G VTG
Sbjct: 122 SSAFADRLCVATTGDFNELLSAEEITFCCHTCGF----GCHGGYPIKAWKRFSTHGLVTG 177
Query: 178 GDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTT 237
GDY GC+P + P + S+ + + + C+ + F D HR T
Sbjct: 178 GDYNSGEGCEPYRVPPSNDGNSSSSDQPLAINHICRRHCYGNQSID------FNDDHRYT 231
Query: 238 LTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYL--HSGKLI 295
Y+ +I+K++L +GP A+F +YDDF YKSGVY + NA +YL H+ KLI
Sbjct: 232 RDYYYL-TYGSIQKDVLTYGPIEASFDVYDDFPSYKSGVYVKSDNA---SYLGGHAVKLI 287
Query: 296 GWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
GWG E+GTPYWL++N+W WGD G KI RG EC + AG P
Sbjct: 288 GWGEEDGTPYWLMVNSWNTQWGDNGFFKIRRGTNECGVDNSTTAGVP 334
>gi|56758716|gb|AAW27498.1| unknown [Schistosoma japonicum]
Length = 342
Score = 196 bits (499), Expect = 9e-48, Method: Compositional matrix adjust.
Identities = 114/330 (34%), Positives = 167/330 (50%), Gaps = 23/330 (6%)
Query: 21 FSDAYIDQINREANT-WTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDP- 78
SD I IN+ N W A ++ R + DA+ + P R+ P
Sbjct: 30 LSDEMISFINKHPNAGWKADKSD---------RFHSVDDARILLGGRKEDPNLRQKRRPT 80
Query: 79 ----EYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQN 134
+ + +P FD+R++WP C +I + D CA+ +AV A SDR CI+S G+Q+
Sbjct: 81 VDHHDLNVEIPSHFDSRKKWPRCKSISQIRDQSRCASSWAVSAVAAMSDRICIQSGGKQS 140
Query: 135 RPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPC 194
LS + SCC+ C C G +W++ K G VTGG + TGC+P C
Sbjct: 141 VELSAIDLISCCENC----GSGCDGGVTGYSWDYWVKHGIVTGGSKENHTGCRPYPFPKC 196
Query: 195 SHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEIL 254
H +C ++ +C C Y + QDKH +Y V E AI+KEI+
Sbjct: 197 DHFVKGK-YRACGDKLYKTPQCKQTC-QKGYNTSYEQDKHYGGFSYSVIGVESAIQKEIM 254
Query: 255 AHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGP 314
+GP A +Y+DF +YKSG+Y++T+ + H+ +LIGWG ENGT YWL NTW
Sbjct: 255 MYGPVEAYLEIYEDFLNYKSGIYRYTTGKYISG--HAVRLIGWGVENGTAYWLAANTWNE 312
Query: 315 HWGDRGTVKILRGKYECAFEYLIAAGKPKN 344
WG++G +I+RG+ EC E I AG+ K+
Sbjct: 313 DWGEKGYFRIVRGRDECLIESFIVAGQIKS 342
>gi|56752787|gb|AAW24605.1| unknown [Schistosoma japonicum]
Length = 309
Score = 196 bits (499), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 107/294 (36%), Positives = 156/294 (53%), Gaps = 13/294 (4%)
Query: 56 IADAKYFDQSDRPLPGDRKTYDP-----EYSATVPDRFDAREQWPNCGTIGHVPDTGACA 110
+ DA+ R P R+ P + + +P FD+R++WP C +I + D C
Sbjct: 24 VDDARILLGGRREDPNLREKRRPTVDHHDLNVEIPSHFDSRKKWPRCKSISQIRDQSQCG 83
Query: 111 APHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLH 170
+ +AVGA SDR CI+S G+Q+ LS + SCCK C C G + +W++
Sbjct: 84 SSWAVSAVGAMSDRICIQSGGKQSVELSAVDLISCCKYC----GSGCDGGFLGPSWDYWV 139
Query: 171 KRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFF 230
RG VTGG + TGC+P C H +C ++ +C+ C Y +
Sbjct: 140 LRGIVTGGSKENHTGCRPYPFPKCDHFVKG-KYRACGDKLYKTPQCNQTC-QKGYNTSYE 197
Query: 231 QDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLH 290
QDKH +Y V E I+K+I+ HGP A +Y+DF +YKSG+Y++T+ + H
Sbjct: 198 QDKHYGGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGQFISG--H 255
Query: 291 SGKLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPKN 344
+ +LIGWG ENGT YWL NTW WG++G +I+RG+ EC E IAAG K+
Sbjct: 256 AVRLIGWGVENGTAYWLAANTWNEDWGEKGYFRIVRGRNECLIESEIAAGLIKS 309
>gi|56756436|gb|AAW26391.1| unknown [Schistosoma japonicum]
Length = 342
Score = 196 bits (499), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 99/265 (37%), Positives = 149/265 (56%), Gaps = 8/265 (3%)
Query: 79 EYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLS 138
+ + +P +FD+R++WP+C +I + D C + F AV A +DR CI+S GQQ+ LS
Sbjct: 85 DLNVEIPSQFDSRKKWPHCKSISQIRDQSRCGSCWAFGAVEAMTDRICIQSGGQQSAELS 144
Query: 139 TEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHG 198
+ SCC+ C C G + W++ KRG VTGG + TGCQP C H
Sbjct: 145 ALDLISCCEDC----GDGCKGGFPGQAWDYWVKRGIVTGGSEENHTGCQPYPFPKCEHL- 199
Query: 199 SAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGP 258
+ P+C + +C C Y + QDKH Y V NE AI++EI+ +GP
Sbjct: 200 TKGKYPACGTKIYKTPQCKQTCQK-GYKTPYEQDKHYGDQRYNVISNEKAIQREIMMYGP 258
Query: 259 TTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGD 318
A F +Y+DF +YKSG+Y+H + + + H+ ++IGWG E G PYWL+ N+W WG+
Sbjct: 259 VEAAFDVYEDFLNYKSGIYRHVTGSIVGG--HAIRIIGWGVEKGKPYWLIANSWNEDWGE 316
Query: 319 RGTVKILRGKYECAFEYLIAAGKPK 343
+G +++RG+ EC+ E + AG K
Sbjct: 317 KGLFRMVRGRDECSIESHVVAGLIK 341
>gi|56753443|gb|AAW24925.1| unknown [Schistosoma japonicum]
Length = 342
Score = 196 bits (499), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 98/262 (37%), Positives = 148/262 (56%), Gaps = 8/262 (3%)
Query: 79 EYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLS 138
+ + +P +FD+R++WP+C +I + D C + F AV A +DR CI+S GQQ+ LS
Sbjct: 85 DLNVEIPSQFDSRKKWPHCKSISQIRDQSRCGSCWAFGAVEAMTDRICIQSGGQQSAELS 144
Query: 139 TEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHG 198
+ SCC+ C C G W++ KRG VTGG + TGCQP C HH
Sbjct: 145 ALDLISCCEDC----GDGCQGGFPGVAWDYWVKRGIVTGGSKENHTGCQPYPFPKCEHH- 199
Query: 199 SAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGP 258
+ P+C + +C +C Y + QDK+ Y V NE AI++EI+ +GP
Sbjct: 200 TKGKYPACGTKIYKTPQCKQKCQK-GYKTPYEQDKNYGDQRYNVISNEKAIQREIMMYGP 258
Query: 259 TTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGD 318
A F +Y+DF +YKSG+Y+H + + + H+ ++IGWG E G PYWL+ N+W WG+
Sbjct: 259 VEAAFDVYEDFLNYKSGIYRHVAGSIVGG--HAIRIIGWGVEKGKPYWLIANSWNEDWGE 316
Query: 319 RGTVKILRGKYECAFEYLIAAG 340
G +++RG+ EC+ E + AG
Sbjct: 317 NGLFRMVRGRDECSIESHVVAG 338
>gi|189096178|pdb|3CBJ|A Chain A, Chagasin-cathepsin B Complex
gi|189096180|pdb|3CBK|A Chain A, Chagasin-Cathepsin B
Length = 266
Score = 196 bits (499), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 105/261 (40%), Positives = 140/261 (53%), Gaps = 10/261 (3%)
Query: 84 VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVA 143
+P FDAREQWP C TI + D G+C + F AV A SDR CI + + +S E +
Sbjct: 7 LPASFDAREQWPQCPTIKEIRDQGSCGSAWAFGAVEAISDRICIHTNAHVSVEVSAEDLL 66
Query: 144 SCC-KICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPT 202
+CC +C C+ G WNF ++G V+GG Y GC+P +I PC H +
Sbjct: 67 TCCGSMC----GDGCNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCEAHVNGAR 122
Query: 203 LPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTAT 262
P PK C C P Y + QDKH +Y V ++E I EI +GP
Sbjct: 123 PPCTGEGDTPK--CSKIC-EPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGA 179
Query: 263 FALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTV 322
F++Y DF YKSGVY+H + + H+ +++GWG ENGTPYWLV N+W WGD G
Sbjct: 180 FSVYSDFLLYKSGVYQHVTGEMMGG--HAIRILGWGVENGTPYWLVANSWNTDWGDNGFF 237
Query: 323 KILRGKYECAFEYLIAAGKPK 343
KILRG+ C E + AG P+
Sbjct: 238 KILRGQDHCGIESEVVAGIPR 258
>gi|56756114|gb|AAW26235.1| unknown [Schistosoma japonicum]
Length = 342
Score = 196 bits (498), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 113/350 (32%), Positives = 178/350 (50%), Gaps = 14/350 (4%)
Query: 1 MIHILVFLLGC-TLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADA 59
M+ I V+++ TL+ + K ++ I+ ++ E ++ +++ R + DA
Sbjct: 1 MLKIAVYIVSLFTLLEAHVTKRNNQRIEPLSDEMISFINKHPNAGWKADKSDRFHSVDDA 60
Query: 60 KYFDQSDRPLPGDRKTYDP-----EYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHI 114
+ + R+ P + + +P FD+R++WP C +I + D CA+
Sbjct: 61 RILLGGRKEDSNLRQKRRPTVDHHDLNVEIPSHFDSRKKWPRCKSISQIRDQSRCASSWA 120
Query: 115 FAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGS 174
++VGA SDR CI+S G+Q+ LS + SCCK C C G +W++ G
Sbjct: 121 VSSVGAMSDRICIQSGGKQSVELSAIDLISCCKNC----GSGCDGGYFLPSWDYWVSHGI 176
Query: 175 VTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKH 234
VTGG + TGC+P C H +C ++ +C C Y + QDKH
Sbjct: 177 VTGGSKENHTGCRPYPFPKCDHFVKGK-YRACGDKLYETPQCKQTC-QKGYNTSYEQDKH 234
Query: 235 RTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKL 294
+Y V E I+K+I+ HGP A +Y+DF +YKSG+Y++T+ + H+ +L
Sbjct: 235 YGGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGKYISG--HAVRL 292
Query: 295 IGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPKN 344
IGWG ENGT YWL NTW WG++G +I+RG+ EC E IAAG K+
Sbjct: 293 IGWGVENGTAYWLAANTWNEDWGEKGYFRIVRGRNECLIESEIAAGLIKS 342
>gi|312271211|gb|ADQ57303.1| cathepsin B-like cysteine proteinase 1 [Angiostrongylus
cantonensis]
Length = 394
Score = 196 bits (498), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 111/330 (33%), Positives = 158/330 (47%), Gaps = 31/330 (9%)
Query: 26 IDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTY--------- 76
ID +NR+ N W A ++ R+F+ Y D++ L G +
Sbjct: 66 IDYVNRKQNLWKAKKH----------RRFV----HYPDRTKWGLMGVNNVHLSVKAKQHL 111
Query: 77 --DPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQN 134
+ +P+ FDAR+ W NC +I ++ D +C + F AV A SDR CI S +
Sbjct: 112 SSTKDLDIDIPETFDARQHWSNCQSIKNIRDQSSCGSCWAFGAVEAMSDRICIASNEKIQ 171
Query: 135 RPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPC 194
LS + + SCC+ C + C G W + G VTG ++ GC+P PC
Sbjct: 172 VTLSADDLLSCCRTCGF----GCEGGDPMFAWQYWVDHGIVTGSNFTANQGCKPYPFPPC 227
Query: 195 SHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEIL 254
HH + C + P KC +C + + D+ Y V ++ AI+KEIL
Sbjct: 228 EHHSNKTRFDPCRHDLYPTPKCSKKCVPSYKEKNYDDDRFYGRTAYGVKNDVAAIQKEIL 287
Query: 255 AHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGP 314
HGP F +Y+DF HY G+Y HT KL H+ KLIGWG + GTPYWL+ N+W
Sbjct: 288 THGPVEVAFEVYEDFLHYAGGIYVHTG-GKLGGG-HAVKLIGWGIDQGTPYWLIANSWNT 345
Query: 315 HWGDRGTVKILRGKYECAFEYLIAAGKPKN 344
WG+ G +ILRG EC E + G PK+
Sbjct: 346 DWGEEGFFRILRGVDECGIESGVVGGIPKS 375
>gi|161343839|tpg|DAA06100.1| TPA_inf: cathepsin B [Acyrthosiphon pisum]
Length = 323
Score = 196 bits (497), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 105/273 (38%), Positives = 148/273 (54%), Gaps = 11/273 (4%)
Query: 72 DRKTYDPEYSATVPDRFDAREQWPNCG-TIGHVPDTGACAAPHIFAAVGAFSDRRCIKSK 130
+RKT D Y +P FDAR+ + +C IG V D G CA+ A F+DR CI S
Sbjct: 52 NRKTVDNSYKTDIPREFDARQYFTSCANVIGDVKDQGNCASSWAVAVASTFTDRLCIASN 111
Query: 131 GQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPST 190
GQ LS + + SC + C GS F+ W +G VTGG++ GCQP
Sbjct: 112 GQFTDNLSAQNLMSCGD----GEKMGCDGGSAFKAWELTMNKGIVTGGNFDSNEGCQPYK 167
Query: 191 ISPCSHHGSAPTLPSCENQKVPKLK-CHTRCTNPTYGRGFFQDKHRTTLTYWVD-DNEDA 248
PC H+G + L +C + + ++ C +C N Y + D H+T++ Y N
Sbjct: 168 NRPCDHYGDSR-LTNCSSLRRTQMTVCRKKCVNKNYKVKYEDDLHKTSIVYMTSWTNVKQ 226
Query: 249 IKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTE-NGTPYWL 307
I++EI+ HGP TA +Y++F YK G+YK T+ +L Y H KLIGWG + +GT YWL
Sbjct: 227 IQQEIMTHGPVTAFMYVYENFMGYKEGIYKSTT-GELIGY-HHVKLIGWGVDGDGTEYWL 284
Query: 308 VINTWGPHWGDRGTVKILRGKYECAFEYLIAAG 340
+N+W +WG+ G KILRG C+ E L+ AG
Sbjct: 285 AMNSWNSNWGNDGLFKILRGYNFCSIELLVMAG 317
>gi|329668994|gb|AEB96385.1| cathepsin B-like cysteine protease 2 [Angiostrongylus cantonensis]
Length = 316
Score = 196 bits (497), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 104/293 (35%), Positives = 157/293 (53%), Gaps = 13/293 (4%)
Query: 49 EYLRQFLIADAKYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGA 108
E+LR+ ++ +K+ +++++P D + + +PD FDAR WP+C +I ++ D
Sbjct: 36 EHLRRKVMK-SKFINRNNKPREDDTEID----GSKIPDSFDARVTWPHCPSISYIRDQSQ 90
Query: 109 CAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNF 168
C + F++ SDR CI S G + LS + + SCC D C G W +
Sbjct: 91 CGSCWAFSSAEVMSDRVCIASHGHKKVELSADDILSCCT----DGGYGCDGGWPVSAWQY 146
Query: 169 LHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRG 228
+ G VTGG YG + C+P I PC H + +C Q++ C T C Y
Sbjct: 147 FVETGVVTGGLYGTKDACRPYEIPPCGIHKNETFYSNC-TQEIDTPDCKTTC-QAGYPIS 204
Query: 229 FFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENY 288
+ DK Y V ++ AI+KEI+ +GP A F +YDDF+HYK+G+YKH S A+
Sbjct: 205 YDDDKTYGKTAYSVSNSVHAIQKEIMTYGPVVAAFTVYDDFFHYKTGIYKHVSGAEAGG- 263
Query: 289 LHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGK 341
H+ +++GWG + G PYWLV N+W WG+ G +ILRG EC E + AG+
Sbjct: 264 -HAVRILGWGQQGGVPYWLVANSWNTDWGENGYFRILRGSDECGIEDGVVAGQ 315
>gi|28373366|pdb|1ITO|A Chain A, Crystal Structure Analysis Of Bovine Spleen Cathepsin B-
E64c Complex
gi|88192750|pdb|2DC6|A Chain A, X-ray Crystal Structure Analysis Of Bovine Spleen
Cathepsin B-ca073 Complex
gi|88192751|pdb|2DC7|A Chain A, X-ray Crystal Structure Analysis Of Bovine Spleen
Cathepsin B-ca042 Complex
gi|88192752|pdb|2DC8|A Chain A, X-ray Crystal Structure Analysis Of Bovine Spleen
Cathepsin B-ca059 Complex
gi|88192753|pdb|2DC9|A Chain A, X-Ray Crystal Structure Analysis Of Bovine Spleen
Cathepsin B-Ca074me Complex
gi|88192754|pdb|2DCA|A Chain A, X-ray Crystal Structure Analysis Of Bovine Spleen
Cathepsin B-ca075 Complex
gi|88192755|pdb|2DCB|A Chain A, X-Ray Crystal Structure Analysis Of Bovine Spleen
Cathepsin B-Ca076 Complex
gi|88192756|pdb|2DCC|A Chain A, X-Ray Crystal Structure Analysis Of Bovine Spleen
Cathepsin B-Ca077 Complex
gi|88192757|pdb|2DCD|A Chain A, X-Ray Crystal Structure Analysis Of Bovine Spleen
Cathepsin B-Ca078 Complex
Length = 256
Score = 196 bits (497), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 109/261 (41%), Positives = 139/261 (53%), Gaps = 12/261 (4%)
Query: 84 VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVA 143
+P+ FDAREQWPNC TI + D G+C + F AV A SDR CI S G+ N +S E +
Sbjct: 1 LPESFDAREQWPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHSNGRVNVEVSAEDM- 59
Query: 144 SCCKICRYDDNKSCSHGSVFRT--WNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAP 201
+ F + WNF K+G V+GG Y GC+P +I PC HH +
Sbjct: 60 ----LTCCGGECGDGCNGGFPSGAWNFWTKKGLVSGGLYNSHVGCRPYSIPPCEHHVNGS 115
Query: 202 TLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTA 261
P PK C C P Y + +DKH +Y V +NE I EI +GP
Sbjct: 116 RPPCTGEGDTPK--CSKTC-EPGYSPSYKEDKHFGCSSYSVANNEKEIMAEIYKNGPVEG 172
Query: 262 TFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGT 321
F++Y DF YKSGVY+H S + H+ +++GWG ENGTPYWLV N+W WGD G
Sbjct: 173 AFSVYSDFLLYKSGVYQHVSGEIMGG--HAIRILGWGVENGTPYWLVGNSWNTDWGDNGF 230
Query: 322 VKILRGKYECAFEYLIAAGKP 342
KILRG+ C E I AG P
Sbjct: 231 FKILRGQDHCGIESEIVAGMP 251
>gi|226469948|emb|CAX70255.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 196 bits (497), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 114/351 (32%), Positives = 189/351 (53%), Gaps = 16/351 (4%)
Query: 1 MIHILVFLLGC-TLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADA 59
M+ I V+++ TL+ + ++ I+ ++ E ++ +++ R + DA
Sbjct: 1 MLKIAVYIVSLFTLLEAHVTTRNNERIEPLSDEMISFINKHPNAGWKADKSDRFHSVDDA 60
Query: 60 KYF---DQSDRPLPGDRK-TYDP-EYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHI 114
+ + D + R+ T D + + +P +FD+R++WP+C +I + D C +
Sbjct: 61 RILLGGGKEDAEMKWKRRPTVDHHDLNVEIPSQFDSRKKWPHCKSISQIRDQSQCGSSWA 120
Query: 115 FAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGS 174
+AVGA SDR CI+S G+Q+ LS + SCC+ C C G W++ G
Sbjct: 121 VSAVGAMSDRICIQSGGKQSVELSAIDLISCCENC----GSGCDGGFPGPAWDYWVSHGI 176
Query: 175 VTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKH 234
VTGG + TGCQP C HH S PSC ++ +C +C Y + DKH
Sbjct: 177 VTGGSKENHTGCQPYPFPKCEHH-SIGKYPSCGDKIYKTPQCKRKCQK-GYTTPYEHDKH 234
Query: 235 RTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKL-ENYLHSGK 293
++ V NE AI+ EI+ +GP A +++DF +YKSG+Y++T+ + + E+Y+ +
Sbjct: 235 YGGISINVIKNESAIQNEIMMYGPVEAYLLIFEDFLNYKSGIYRYTTGSFVGEHYV---R 291
Query: 294 LIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPKN 344
+IGWG ENGT YWL NTW WG++G +I+RG+ EC+ E ++ AG+ K+
Sbjct: 292 IIGWGIENGTAYWLAANTWNEDWGEKGYFRIVRGRNECSIESVVVAGRLKS 342
>gi|1169189|sp|P43157.1|CYSP_SCHJA RecName: Full=Cathepsin B-like cysteine proteinase; AltName:
Full=Antigen Sj31; Flags: Precursor
gi|11167|emb|CAA50305.1| cathepsin B [Schistosoma japonicum]
Length = 342
Score = 196 bits (497), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 98/265 (36%), Positives = 149/265 (56%), Gaps = 8/265 (3%)
Query: 79 EYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLS 138
+ + +P +FD+R++WP+C +I + D C + F AV A +DR CI+S G Q+ LS
Sbjct: 85 DLNVEIPSQFDSRKKWPHCKSISQIRDQSRCGSCWAFGAVEAMTDRICIQSGGGQSAELS 144
Query: 139 TEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHG 198
+ SCCK C C G W++ KRG VTGG + TGCQP C HH
Sbjct: 145 ALDLISCCKDC----GDGCQGGFPGVAWDYWVKRGIVTGGSKENHTGCQPYPFPKCEHH- 199
Query: 199 SAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGP 258
+ P+C + +C C Y + QDKH +Y V +NE I+++I+ +GP
Sbjct: 200 TKGKYPACGTKIYKTPQCKQTCQK-GYKTPYEQDKHYGDESYNVQNNEKVIQRDIMMYGP 258
Query: 259 TTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGD 318
A F +Y+DF +YKSG+Y+H + + + H+ ++IGWG E TPYWL+ N+W WG+
Sbjct: 259 VEAAFDVYEDFLNYKSGIYRHVTGSIVGG--HAIRIIGWGVEKRTPYWLIANSWNEDWGE 316
Query: 319 RGTVKILRGKYECAFEYLIAAGKPK 343
+G +++RG+ EC+ E + AG K
Sbjct: 317 KGLFRMVRGRDECSIESDVVAGLIK 341
>gi|32566081|ref|NP_506002.2| Protein CPR-1 [Caenorhabditis elegans]
gi|32172429|sp|P25807.2|CPR1_CAEEL RecName: Full=Gut-specific cysteine proteinase; Flags: Precursor
gi|1395200|gb|AAB88058.1| gut-specific cysteine protease-1 [Caenorhabditis elegans]
gi|24817276|emb|CAB01410.2| Protein CPR-1 [Caenorhabditis elegans]
Length = 329
Score = 195 bits (495), Expect = 3e-47, Method: Compositional matrix adjust.
Identities = 117/321 (36%), Positives = 163/321 (50%), Gaps = 23/321 (7%)
Query: 23 DAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEYSA 82
A +D +N + + + ++EE ++ F + D KY + R T A
Sbjct: 31 QALVDYVNSAQSLF---KTEHVEITEEEMK-FKLMDGKYAAAHSDEI---RATEQEVVLA 83
Query: 83 TVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYV 142
+VP FD+R QW C +I + D C + F A SDR CI++KG Q +S + +
Sbjct: 84 SVPATFDSRTQWSECKSIKLIRDQATCGSCWAFGAAEMISDRTCIETKGAQQPIISPDDL 143
Query: 143 ASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPT 202
SCC C G + + +G VTGGDY GC+P I+PC T
Sbjct: 144 LSCCG---SSCGNGCEGGYPIQALRWWDSKGVVTGGDY-HGAGCKPYPIAPC-------T 192
Query: 203 LPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTAT 262
+C K P C C + Y + +DKH Y V N +I+ EI A+GP A
Sbjct: 193 SGNCPESKTPS--CSMSCQS-GYSTAYAKDKHFGVSAYAVPKNAASIQAEIYANGPVEAA 249
Query: 263 FALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTV 322
F++Y+DFY YKSGVYKHT+ L H+ K+IGWGTE+G+PYWLV N+WG +WG+ G
Sbjct: 250 FSVYEDFYKYKSGVYKHTAGKYLGG--HAIKIIGWGTESGSPYWLVANSWGVNWGESGFF 307
Query: 323 KILRGKYECAFEYLIAAGKPK 343
KI RG +C E + AGK K
Sbjct: 308 KIYRGDDQCGIESAVVAGKAK 328
>gi|56756380|gb|AAW26363.1| unknown [Schistosoma japonicum]
Length = 342
Score = 194 bits (494), Expect = 4e-47, Method: Compositional matrix adjust.
Identities = 113/330 (34%), Positives = 167/330 (50%), Gaps = 23/330 (6%)
Query: 21 FSDAYIDQINREANT-WTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDP- 78
SD I IN N W A ++ R + DA+ + P R+ P
Sbjct: 30 LSDEMISFINEHPNAGWKADKSD---------RFHSVDDARILLGGRKEDPNLRQRRRPT 80
Query: 79 ----EYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQN 134
+ + +P FD+R++WP C +I + D C + +A+GA SDR CI+S G+Q+
Sbjct: 81 VDHHDLNVEIPSHFDSRKKWPRCKSISQIRDQSQCGSSWAVSAIGAMSDRICIQSGGKQS 140
Query: 135 RPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPC 194
LS + SCC+ C C G + +W++ RG VTGG + TGC+P C
Sbjct: 141 VKLSAVDLISCCENC----GSGCDGGFLGPSWDYWVLRGIVTGGSKENHTGCRPYPFPKC 196
Query: 195 SHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEIL 254
H +C ++ +C+ C Y + QDKH +Y V E I+K+I+
Sbjct: 197 DHFVKGK-YRACGDKLYKTPQCNQTC-QKGYNTSYEQDKHYGGFSYNVLSVESVIQKDIM 254
Query: 255 AHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGP 314
HGP A +Y+DF +YKSG+Y++T+ + H+ +LIGWG ENGT YWL NTW
Sbjct: 255 MHGPVEAYLEIYEDFLNYKSGIYRYTTGQFISG--HAVRLIGWGVENGTAYWLAANTWNE 312
Query: 315 HWGDRGTVKILRGKYECAFEYLIAAGKPKN 344
WG++G +I+RG+ EC E IAAG K+
Sbjct: 313 DWGEKGYFRIVRGRNECLIESEIAAGLIKS 342
>gi|226471004|emb|CAX70583.1| Cysteine PRotease related protein [Schistosoma japonicum]
Length = 304
Score = 194 bits (493), Expect = 4e-47, Method: Compositional matrix adjust.
Identities = 99/262 (37%), Positives = 147/262 (56%), Gaps = 8/262 (3%)
Query: 79 EYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLS 138
+ + +P +FD+R++WP+C +I + D C + F AV A +DR CI+S G Q+ LS
Sbjct: 47 DLNVEIPSQFDSRKKWPHCKSISQIRDQSRCGSCWAFGAVEAMTDRICIQSGGGQSAELS 106
Query: 139 TEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHG 198
+ SCCK C C G + W++ KRG VTGG + TGCQP C H
Sbjct: 107 ALDLISCCKDC----GDGCKGGFPGQAWDYWVKRGIVTGGSEENHTGCQPYPFPKCEHL- 161
Query: 199 SAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGP 258
+ P+C + +C C Y + QDKH Y V NE AI++EI+ +GP
Sbjct: 162 TKGKYPACGTKIYKTPQCKQTC-QKGYKTPYEQDKHYGDQRYNVISNEKAIQREIMMYGP 220
Query: 259 TTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGD 318
A F +Y+DF +YKSG+Y+H + + + H+ ++IGWG E TPYWL+ N+W WG+
Sbjct: 221 VEAAFDVYEDFLNYKSGIYRHVTGSIVGG--HAIRIIGWGVEKRTPYWLIANSWNEDWGE 278
Query: 319 RGTVKILRGKYECAFEYLIAAG 340
+G +I+RG+ EC+ E + AG
Sbjct: 279 KGLFRIVRGRDECSIESHVVAG 300
>gi|384597848|gb|AFI23675.1| cathepsin B, partial [Brugia malayi]
Length = 319
Score = 194 bits (493), Expect = 5e-47, Method: Compositional matrix adjust.
Identities = 106/310 (34%), Positives = 162/310 (52%), Gaps = 15/310 (4%)
Query: 26 IDQINREANTWTAGRNFPANLSEEYLRQFLIA--DAKYFDQSDRPLPGDRKTYDPEYSAT 83
ID +N W AG N NL + ++ L+ + K + + L R + +
Sbjct: 23 IDYVNSHQTLWKAGMN-KFNLYSDTVKYGLLGVNNRKKSVEHKKNLSPIRHS-----NIF 76
Query: 84 VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVA 143
+P+ FDAR+ WP C ++ ++ D +C + AAV A SDR CI SKG++ LS + +
Sbjct: 77 IPESFDARKNWPECASLRNIRDQSSCGSCWAVAAVEAMSDRICITSKGKKQVILSADDLL 136
Query: 144 SCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTL 203
SCCK C + C G W + G VTG DY + +GC+P PC HH +
Sbjct: 137 SCCKTCGF----GCFGGEPMAAWKYWVLSGIVTGSDYTNHSGCRPYPFPPCEHHSNKTHY 192
Query: 204 PSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATF 263
C++ P KC+ +C + Y + + DK+ Y V+++ ++I+KEI+ GP A+F
Sbjct: 193 EPCKHDLYPTPKCYKQC-DKNYTKSYKADKYYGEQAYNVENDVESIQKEIMTLGPVEASF 251
Query: 264 ALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVK 323
+Y DF HY SG+YKH + + H+ K++GWG + G YWL N+W WG+ G +
Sbjct: 252 EVYTDFLHYTSGIYKHVAGSVGGG--HAVKILGWGIDQGVSYWLAANSWNNDWGEDGYFR 309
Query: 324 ILRGKYECAF 333
ILRG EC
Sbjct: 310 ILRGADECGM 319
>gi|294876463|ref|XP_002767679.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
gi|239869446|gb|EER00397.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
Length = 348
Score = 194 bits (492), Expect = 6e-47, Method: Compositional matrix adjust.
Identities = 121/336 (36%), Positives = 168/336 (50%), Gaps = 28/336 (8%)
Query: 16 GELYK-----FSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLP 70
ELY+ + +++IN + N WTA + E + + + + L
Sbjct: 19 AELYEDTRPAIMQSLVNEINSKQNLWTASTD-----QERFYGRLKLCGT--LHEGTEGL- 70
Query: 71 GDRKTYDPEYSATVPDRFDAREQWPNC-GTIGHVPDTGACAAPHIFAAVGAFSDRRCIKS 129
+ K Y P A +P FDAR+ + C IGHV D ACA+ A V AFS R CIKS
Sbjct: 71 -EEKVYPPGELADIPSSFDARDAFKECKDVIGHVWDQSACASCWAIAPVQAFSARLCIKS 129
Query: 130 KGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRT----- 184
G+ N+ LS + +CC + + + C G W FL+K G TGGD+ ++
Sbjct: 130 GGKFNQLLSAGELLACCNLAHSCEARGCKGGVARDAWVFLNKHGIATGGDFVPKSSMEAV 189
Query: 185 -GCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRT--TLTYW 241
GC P C+H+ C + C RC N YG +D+H T + YW
Sbjct: 190 DGCWPYNFPRCAHYQKKSKYGPCPKKSYETPSCLDRCPNEKYGTPLDKDRHFTARAVPYW 249
Query: 242 VDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTEN 301
+ +IKKEI+ HGPT+A+F Y+DF+ YKSGVYK+TS A +E H+ +LIGWGTE
Sbjct: 250 FNGIR-SIKKEIMKHGPTSASFFTYEDFFSYKSGVYKYTSGAYVE--FHTVELIGWGTEK 306
Query: 302 GTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLI 337
G YWL N W W D GT KI +G +C L+
Sbjct: 307 GVDYWLAKNDWNEEWADLGTFKIAQG--DCGINDLV 340
>gi|126116630|gb|ABN79675.1| cathepsin B3 [Clonorchis sinensis]
Length = 337
Score = 194 bits (492), Expect = 6e-47, Method: Compositional matrix adjust.
Identities = 104/260 (40%), Positives = 140/260 (53%), Gaps = 9/260 (3%)
Query: 84 VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVA 143
+P FDAR+QWP+C TIG + D +C + F AV A SDR CI + G + +S +
Sbjct: 80 IPKAFDARKQWPHCPTIGEIRDQSSCGSCWAFGAVEAMSDRLCIHTNGTFTKRISAVDLI 139
Query: 144 SCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTL 203
SCC C + C G W+F G VTGG + TGC+ CSHHGS
Sbjct: 140 SCCGYCGF----GCQGGFPPTAWDFWQTEGIVTGGSKENPTGCRSYPFPRCSHHGSK-KY 194
Query: 204 PSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATF 263
P C ++ C +C P + DK R +TY V ++AI KEI+ +GP A F
Sbjct: 195 PPCSHRIYDTPNCVQKCDTPD--TDYATDKTRANITYNVKAKQNAIMKEIMINGPVEAAF 252
Query: 264 ALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVK 323
+Y+DF YKSGVY H+ L H+ +++GWG ENG YWL+ N+W WG+ G K
Sbjct: 253 QVYEDFLGYKSGVYFHSDGTLLGG--HAIRILGWGEENGVAYWLIANSWNDGWGEDGYFK 310
Query: 324 ILRGKYECAFEYLIAAGKPK 343
+LRGK EC E + AG P+
Sbjct: 311 MLRGKNECGIEDEVTAGLPE 330
>gi|349956183|dbj|GAA30948.1| cathepsin B-like cysteine proteinase [Clonorchis sinensis]
Length = 337
Score = 194 bits (492), Expect = 6e-47, Method: Compositional matrix adjust.
Identities = 104/260 (40%), Positives = 140/260 (53%), Gaps = 9/260 (3%)
Query: 84 VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVA 143
+P FDAR+QWP+C TIG + D +C + F AV A SDR CI + G + +S +
Sbjct: 80 IPKAFDARKQWPHCPTIGEIRDQSSCGSCWAFGAVEAMSDRLCIHTNGTFTKRISAVDLI 139
Query: 144 SCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTL 203
SCC C + C G W+F G VTGG + TGC+ CSHHGS
Sbjct: 140 SCCGYCGF----GCQGGFPPIAWDFWQTEGIVTGGSKENPTGCRSYPFPRCSHHGSK-KY 194
Query: 204 PSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATF 263
P C ++ C +C P + DK R +TY V ++AI KEI+ +GP A F
Sbjct: 195 PPCSHRIYDTPNCVQKCDTPD--TDYATDKTRANITYNVKAKQNAIMKEIMINGPVEAAF 252
Query: 264 ALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVK 323
+Y+DF YKSGVY H+ L H+ +++GWG ENG YWL+ N+W WG+ G K
Sbjct: 253 QVYEDFLGYKSGVYFHSDGTLLGG--HAIRILGWGEENGVAYWLIANSWNDGWGEDGCFK 310
Query: 324 ILRGKYECAFEYLIAAGKPK 343
+LRGK EC E + AG P+
Sbjct: 311 MLRGKNECGIEDEVTAGLPE 330
>gi|209863079|ref|NP_001119613.2| cathepsin B precursor [Acyrthosiphon pisum]
Length = 323
Score = 194 bits (492), Expect = 7e-47, Method: Compositional matrix adjust.
Identities = 104/273 (38%), Positives = 148/273 (54%), Gaps = 11/273 (4%)
Query: 72 DRKTYDPEYSATVPDRFDAREQWPNCG-TIGHVPDTGACAAPHIFAAVGAFSDRRCIKSK 130
+RKT D Y +P FDAR+ + +C IG V D G CA+ A F+DR CI S
Sbjct: 52 NRKTVDNSYKTDIPREFDARQYFTSCANVIGDVKDQGNCASSWAVAVASTFTDRLCIASN 111
Query: 131 GQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPST 190
GQ LS + + SC + C GS F+ W +G VTGG++ GCQP
Sbjct: 112 GQFTDNLSAQNLMSCGD----GEKMGCDGGSAFKAWELTMNKGIVTGGNFDSNEGCQPYK 167
Query: 191 ISPCSHHGSAPTLPSCENQKVPKLK-CHTRCTNPTYGRGFFQDKHRTTLTYWVD-DNEDA 248
PC H+G + L +C + + ++ C +C N Y + D H+T++ Y N
Sbjct: 168 NRPCDHYGDSR-LTNCSSLRRTQMTVCRKKCVNKNYKVKYEDDLHKTSIVYMTSWTNVKQ 226
Query: 249 IKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTE-NGTPYWL 307
I++EI+ +GP TA +Y++F YK G+YK T+ +L Y H KLIGWG + +GT YWL
Sbjct: 227 IQQEIMTYGPVTAFMYVYENFMGYKEGIYKSTT-GELIGY-HHVKLIGWGVDGDGTEYWL 284
Query: 308 VINTWGPHWGDRGTVKILRGKYECAFEYLIAAG 340
+N+W +WG+ G KILRG C+ E L+ AG
Sbjct: 285 AMNSWNSNWGNDGLFKILRGYNFCSIELLVMAG 317
>gi|201023319|ref|NP_001128401.1| cathepsin B-10270 precursor [Acyrthosiphon pisum]
gi|239788119|dbj|BAH70754.1| ACYPI000021 [Acyrthosiphon pisum]
Length = 341
Score = 194 bits (492), Expect = 7e-47, Method: Compositional matrix adjust.
Identities = 122/345 (35%), Positives = 162/345 (46%), Gaps = 16/345 (4%)
Query: 3 HILVFLLGCTLVRGELYKFSDAYI-DQINREANTWTAGRNFPANLSEEYLRQFLIADAKY 61
IL+ LL C + F +I D +NR W AG N L + +
Sbjct: 6 EILLLLLFCNIWLSCNANFKLQHIVDHVNRANVPWEAGIN---QLGTSDYKNIVGTWGFQ 62
Query: 62 FDQSDRPLPGDR-KTYD-PEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVG 119
+ D + G + YD + S +P+ FDAR +W C +I H+ + G CAA +
Sbjct: 63 KNGKDIDIIGHKVHNYDLDDGSNDMPETFDARNKWFECVSIAHIWNQGNCAADWAISVTS 122
Query: 120 AFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGD 179
A +DR CIKSK S + + SCC C C+ G W + KRG VTGGD
Sbjct: 123 AINDRICIKSKKNITAFYSPQKMLSCCDDC----GDGCNGGYSGAAWQYWMKRGLVTGGD 178
Query: 180 YGDRTGCQPSTISPCSHHGSAPTLPS--CENQKVPKLKCHTRCTNPTYGRGFFQDKHRTT 237
YG GCQP I PC+H PS C K +C C NP Y + F +D +
Sbjct: 179 YGSNEGCQPWLIPPCNHTVMDERSPSYMCGKYKSETPQCTLNCYNPNYSKPFLKDISKGI 238
Query: 238 LTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGW 297
W I+ E+ HGP TA +Y+DF YKSG+Y+H + L + K+IGW
Sbjct: 239 RIDW--HCSGMIRNELKKHGPATAIMRVYEDFLTYKSGIYQHVTGKLLGQI--TVKVIGW 294
Query: 298 GTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
G G YWL N+WG WGD+G KI RG EC FE +G+P
Sbjct: 295 GVYRGVQYWLAANSWGTSWGDKGFFKIRRGYNECLFEDYFISGRP 339
>gi|226473760|emb|CAX71565.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 194 bits (492), Expect = 7e-47, Method: Compositional matrix adjust.
Identities = 115/330 (34%), Positives = 165/330 (50%), Gaps = 23/330 (6%)
Query: 21 FSDAYIDQINREANT-WTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDP- 78
SD I IN N W A ++ R + DA+ R P R+ P
Sbjct: 30 LSDEMISFINEHPNAGWKADKSD---------RFHSVDDARILLGGRREDPNLREKRRPT 80
Query: 79 ----EYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQN 134
+ +P FD+R++WP C +I + D C + +AVGA SDR CI+S G+Q+
Sbjct: 81 VDHHDLKVEIPSHFDSRKKWPRCKSISQIRDQSQCGSSWAVSAVGAMSDRICIQSGGKQS 140
Query: 135 RPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPC 194
LS + SCCK C C G + +W++ RG VTGG + TGC+P C
Sbjct: 141 VELSAVDLISCCKYC----GSGCDGGFLGPSWDYWVLRGIVTGGSKENHTGCRPYPFPKC 196
Query: 195 SHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEIL 254
H +C ++ +C+ C Y + QDKH +Y V E I+K+I+
Sbjct: 197 DHFVKGK-YRACGDKLYKTPQCNQTC-QKGYNTSYEQDKHYGGFSYNVLSVESVIQKDIM 254
Query: 255 AHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGP 314
HGP A +Y+DF +YKSG+Y++T+ + H+ +LIG G ENGT YWL NTW
Sbjct: 255 MHGPVEAYLEIYEDFLNYKSGIYRYTTGKYISG--HAVRLIGCGVENGTAYWLAANTWNE 312
Query: 315 HWGDRGTVKILRGKYECAFEYLIAAGKPKN 344
WG++G +I+RG+ EC E IAAG K+
Sbjct: 313 DWGEKGYFRIVRGRNECLIESEIAAGLIKS 342
>gi|17559066|ref|NP_506790.1| Protein CPR-3 [Caenorhabditis elegans]
gi|1169083|sp|P43507.1|CPR3_CAEEL RecName: Full=Cathepsin B-like cysteine proteinase 3; AltName:
Full=Cysteine protease-related 3; Flags: Precursor
gi|675494|gb|AAA98788.1| cathepsin B-like cysteine proteinase [Caenorhabditis elegans]
gi|675496|gb|AAA98782.1| cathepsin B-like cysteine proteinase [Caenorhabditis elegans]
gi|14530554|emb|CAB61032.2| Protein CPR-3 [Caenorhabditis elegans]
Length = 370
Score = 194 bits (492), Expect = 8e-47, Method: Compositional matrix adjust.
Identities = 119/352 (33%), Positives = 171/352 (48%), Gaps = 33/352 (9%)
Query: 4 ILVFLLGCT-LVRGELYKFS------DAYIDQINREANTWTAGRNFPANLSEEYLRQFLI 56
+ +FL GC+ V E+ + +D +N +W A N + E+ +F +
Sbjct: 7 LALFLAGCSAFVLDEIRGINIGQSPQKVLVDHVNTVQTSWVAEHNEIS----EFEMKFKV 62
Query: 57 ADAKYFD--QSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHI 114
D K+ + + D + + +PD FDARE+WP+C TI + + C +
Sbjct: 63 MDVKFAEPLEKDSDVASELFVRGEIVPEPLPDTFDAREKWPDCNTIKLIRNQATCGSCWA 122
Query: 115 FAAVGAFSDRRCIKSKGQQNRPLSTEYVASCC-KICRYDDNKSCSHGSVFRTWNFLHKRG 173
F A SDR CI+S G Q +S E + SCC C Y C G F G
Sbjct: 123 FGAAEVISDRVCIQSNGTQQPVISVEDILSCCGTTCGY----GCKGGYSIEALRFWASSG 178
Query: 174 SVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDK 233
+VTGGDYG GC P + +PC+ + T PSC+ T C + + +DK
Sbjct: 179 AVTGGDYGGH-GCMPYSFAPCTKNCPESTTPSCK----------TTCQSSYKTEEYKKDK 227
Query: 234 HRTTLTYWVDDNEDA--IKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHS 291
H Y V + I+ EI +GP A++ +Y+DFYHYKSGVY +TS + H+
Sbjct: 228 HYGASAYKVTTTKSVTEIQTEIYHYGPVEASYKVYEDFYHYKSGVYHYTSGKLVGG--HA 285
Query: 292 GKLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
K+IGWG ENG YWL+ N+WG +G++G KI RG EC E + AG K
Sbjct: 286 VKIIGWGVENGVDYWLIANSWGTSFGEKGFFKIRRGTNECQIEGNVVAGIAK 337
>gi|29374025|gb|AAO73003.1| cathepsin B [Fasciola gigantica]
Length = 339
Score = 193 bits (491), Expect = 9e-47, Method: Compositional matrix adjust.
Identities = 111/330 (33%), Positives = 165/330 (50%), Gaps = 16/330 (4%)
Query: 17 ELYKFSDAYIDQINREAN-TWTAGRNFP-ANLSEEYLRQFLIADAKYFDQSDRP-LPGDR 73
+ FSD I +N E+ +W A R+ +N+ L +++ + RP + D
Sbjct: 22 QFEAFSDELIRFVNEESGASWKAARSTRFSNVDHFKLHLGALSETPEERNALRPTIKHDI 81
Query: 74 KTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQ 133
D +P+ FDAR QWP C TI + D +C + AA A SDR CI S GQ
Sbjct: 82 SKND------LPESFDARSQWPQCWTISEIRDQASCGSCWATAAASAMSDRVCIHSNGQM 135
Query: 134 NRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISP 193
L+ SCC C + C G + W++ + G VTGG + +RTGCQP +
Sbjct: 136 RPRLAAADPLSCCTYC----GQGCRGGYPPKAWDYWMREGIVTGGTWENRTGCQPWMFTK 191
Query: 194 CSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEI 253
C H G + C + P C C Y + + QDK +Y V ++E I +EI
Sbjct: 192 CDHVGDSRKYSRCPHYTYPTPPCARACQT-GYNKTYEQDKFYGNSSYNVGEHESYIMQEI 250
Query: 254 LAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWG 313
+ +GP TFA++ DF Y+SG+Y H + + H+ ++IGWG ENG YWL+ N+W
Sbjct: 251 MKNGPVEVTFAIFQDFGVYRSGIYHHVAGKFIGR--HAVRMIGWGVENGVNYWLMANSWN 308
Query: 314 PHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
WG+ G +++RG+ EC E + AG P+
Sbjct: 309 EEWGENGYFRMVRGRNECGIESEVVAGMPR 338
>gi|157167368|ref|XP_001653891.1| cathepsin b [Aedes aegypti]
gi|108874250|gb|EAT38475.1| AAEL009642-PA [Aedes aegypti]
Length = 332
Score = 193 bits (491), Expect = 9e-47, Method: Compositional matrix adjust.
Identities = 115/346 (33%), Positives = 173/346 (50%), Gaps = 22/346 (6%)
Query: 2 IHILVFLLGCTLV-RGELYK---FSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIA 57
+ +LVF +G ++ R E F+D ++ Q+ R A TWT F + E +
Sbjct: 4 VKLLVFAIGVVVIARSERLGDDPFNDGFLAQVQRHAKTWTPDATFRDGIRFENFQ----- 58
Query: 58 DAKYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAA 117
+ K +S K +D Y+ +P+ FDARE+WP C +I + + G C A AA
Sbjct: 59 NMKGIFESKIGFRLPTKRHDVAYNMDIPEFFDAREKWPYCKSISTIKNQGLCGACWAVAA 118
Query: 118 VGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTG 177
V SDR CI S+G+ + L+ E + CCK C N G+ F+ W + G V+G
Sbjct: 119 VSVMSDRLCIHSEGKFDVELAAEDLMGCCKDCGNGCNGGFLDGTSFQYWVDV---GLVSG 175
Query: 178 GDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTT 237
Y GC+P PC + C +K P C CT Y + +DK+ +
Sbjct: 176 AAYNSTDGCKPYPFKPCLY-----PFVGCHPEKTP--SCTHHCTE-GYDGTYRRDKYYGS 227
Query: 238 LTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGW 297
Y + ++E I+ EI+ +GP + F++Y D Y YK+GVY+H ++ H+ +LIGW
Sbjct: 228 AAYKLPNDERMIQLEIMTNGPVESGFSVYQDLYLYKTGVYQHVVGREVGK--HAVRLIGW 285
Query: 298 GTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
G E G PYWL+ N++G WG+ G K LRG E ++ AG PK
Sbjct: 286 GKERGVPYWLIANSYGEDWGEHGYFKFLRGSNHLGIESVVIAGLPK 331
>gi|300122171|emb|CBK22745.2| unnamed protein product [Blastocystis hominis]
Length = 319
Score = 193 bits (491), Expect = 9e-47, Method: Compositional matrix adjust.
Identities = 108/313 (34%), Positives = 154/313 (49%), Gaps = 21/313 (6%)
Query: 28 QINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEYSATVPDR 87
++N++ N+W A N P ++ ++ +PLP E +P
Sbjct: 26 RVNKQQNSWVANENTPLRDYSSFIGTL---------KNKKPLPIRSIPIKRE----LPKE 72
Query: 88 FDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCK 147
FD+ E+WP C +I V D +CA+ F V +DR CI+SKG+ LS E V CCK
Sbjct: 73 FDSSEKWPECPSILEVRDQSSCASCWAFGVVEVATDRICIESKGKNQVRLSAEDVLECCK 132
Query: 148 ICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCE 207
C + C G W +L + G VTGG Y C+ PCS HG P C
Sbjct: 133 DCGFQ----CQGGYSAMAWEYLRRTGVVTGGQYNSTEWCKSYPFPPCS-HGIEGQYPQCS 187
Query: 208 NQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYD 267
+ KC T C Y + +D+++ + Y +++N D IK EI+ +GP A+F +Y+
Sbjct: 188 TKPPVVPKCETTCQE-GYPIEYEKDRYKFSNVYQLENNVDQIKNEIMENGPVDASFQVYE 246
Query: 268 DFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRG 327
DF YKSG+Y H K N LH+ K+IGWG ENG YW +N+W WG+ G +I G
Sbjct: 247 DFMTYKSGIYHHVE-GKFMN-LHTVKIIGWGEENGEAYWKAVNSWNSEWGENGLFRIRLG 304
Query: 328 KYECAFEYLIAAG 340
EC E + G
Sbjct: 305 TNECTIESQVEGG 317
>gi|341888694|gb|EGT44629.1| hypothetical protein CAEBREN_31940 [Caenorhabditis brenneri]
Length = 374
Score = 193 bits (490), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 116/321 (36%), Positives = 162/321 (50%), Gaps = 23/321 (7%)
Query: 24 AYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPG-DRKTYDPEYSA 82
A +D IN+ ++W A N ++EE ++ F + D ++ D P D PE
Sbjct: 43 ALVDYINKAQSSWVAEHN---EMTEEEMK-FKVMDERFADPLQDGEPELDWGEIVPE--- 95
Query: 83 TVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYV 142
+PD FD+REQWP C +I + + C + F A SDR CI+S Q +S E +
Sbjct: 96 PLPDTFDSREQWPECKSIKLIRNQATCGSCWAFGAAEIISDRICIQSNATQTPIISVEDI 155
Query: 143 ASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPT 202
SCC + K C G F G+VTGGDY + GC P + +PC
Sbjct: 156 LSCCGV---SCGKGCQGGYSIEALRFWKSSGAVTGGDY-NGAGCMPYSFAPCKKD----- 206
Query: 203 LPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTAT 262
SC P C T C + + +DKH T Y + ++ AI+ EI +GP A+
Sbjct: 207 --SCAQGTTPS--CKTTCQSSYKTAEYTKDKHFGTTAYKITNSVAAIQTEIYHNGPVEAS 262
Query: 263 FALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTV 322
F +Y+DFY YKSGVY++TS + H+ K+IGWGTENG YWL+ N+WG +GD G
Sbjct: 263 FKVYEDFYKYKSGVYQYTSGKLVGG--HAVKIIGWGTENGVDYWLIANSWGTTFGDSGFF 320
Query: 323 KILRGKYECAFEYLIAAGKPK 343
K+ RG E E + AG K
Sbjct: 321 KMRRGTNEVGIEGNVVAGTAK 341
>gi|268557292|ref|XP_002636635.1| C. briggsae CBR-CPR-1 protein [Caenorhabditis briggsae]
Length = 330
Score = 193 bits (490), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 115/320 (35%), Positives = 164/320 (51%), Gaps = 23/320 (7%)
Query: 24 AYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEYSAT 83
A +D +N + +T +SEE+++ ++ D KY + R T A+
Sbjct: 33 ALVDYVNSAQSLFTTEH---VEVSEEFMKSRVM-DVKYAAAHSDEI---RATEVNTVLAS 85
Query: 84 VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVA 143
+P FD+R QW C +I + + C + F A SDR CI++KG Q +S + +
Sbjct: 86 IPASFDSRTQWSECKSIKLIRNQATCGSCWAFGAAEIISDRTCIETKGAQQPIISPDDLL 145
Query: 144 SCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTL 203
SCC C G + + +G VTGGDY GC+P I+PC T
Sbjct: 146 SCCG---SSCGNGCEGGYPIQALRWWDSKGVVTGGDY-HGAGCKPYPIAPC-------TS 194
Query: 204 PSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATF 263
+C K P C C + Y + +DKH Y V + AI+ EI+ +GP A F
Sbjct: 195 GNCPESKTPA--CSLSCQSG-YSTAYAKDKHFGASAYAVARSVAAIQTEIMTNGPVEAAF 251
Query: 264 ALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVK 323
+Y+DFY YKSGVYKHT+ L H+ K+IGWGTE+G+PYWLV N+WG +WG+ G K
Sbjct: 252 TVYEDFYKYKSGVYKHTAGKALGG--HAIKIIGWGTESGSPYWLVANSWGTNWGESGFFK 309
Query: 324 ILRGKYECAFEYLIAAGKPK 343
ILRG +C E + AGK +
Sbjct: 310 ILRGDDQCGIEGAVVAGKAR 329
>gi|187103108|ref|NP_001119614.1| cathepsin B-1418 precursor [Acyrthosiphon pisum]
gi|163300438|tpg|DAA06126.1| TPA_inf: cathepsin B transcript 1418 [Acyrthosiphon pisum]
gi|239788654|dbj|BAH70998.1| ACYPI000010 [Acyrthosiphon pisum]
Length = 346
Score = 192 bits (489), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 120/359 (33%), Positives = 171/359 (47%), Gaps = 44/359 (12%)
Query: 2 IHILVFLLGCTLVRGELYKFSDA--YIDQINRE-ANTWTAGRNFPANLSEEYLRQFLIAD 58
I I++ + VR ++A ID +N + NTW A + D
Sbjct: 7 IFIVLATMVAVAVRESSAVTNEATFIIDSVNADPGNTWRASDT-----------NVIPGD 55
Query: 59 AKYFDQSDRPLPGD---------RKTYDPEYSATVPDRFDAREQWPNCGTI-GHVPDTGA 108
K F+Q LP + +K+ + E + +P+ FDARE+WP C ++ G + D
Sbjct: 56 GKNFNQLMGVLPRNFNSFRFAPIKKSAEDESNEALPENFDARERWPECSSLLGSIKDQSN 115
Query: 109 CAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNF 168
C + +A FSDR CI + G R LS E + +CC C C GS W F
Sbjct: 116 CGSCWAVSAASVFSDRLCIATGGAVARNLSAEQLNTCCYRC----GNGCDGGSPESAWYF 171
Query: 169 LHKRGSVTGGDYGDRTGCQPSTISPCSHHGSA-----PTLPSCENQKVPKLKCHTRCTNP 223
+ G VTGGDYG GCQP +I PC + P P C + CTN
Sbjct: 172 FMRHGIVTGGDYGSEDGCQPYSIYPCGKGRNTCIEDDPDTPDCSIKT---------CTNS 222
Query: 224 TYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNA 283
Y + + D H Y + +E+ I K++ +GP A F +Y DF +YKSGVY +T
Sbjct: 223 NYSKNYRADLHYVDTVYSLSRSEEDIMKDLYKNGPVQAAFYVYTDFMYYKSGVYSYT-RG 281
Query: 284 KLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
++E H+ K++GWG ++GT YWL N+W WG+ G +ILRG EC E + AG P
Sbjct: 282 QIEGG-HAIKILGWGVDDGTKYWLCANSWSRSWGENGLFRILRGNNECHIEDRVIAGMP 339
>gi|118364222|ref|XP_001015333.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila]
gi|89297100|gb|EAR95088.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila SB210]
Length = 341
Score = 192 bits (489), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 123/358 (34%), Positives = 174/358 (48%), Gaps = 37/358 (10%)
Query: 2 IHILVFLLGCTL--------VRGELYKFSD---AYIDQINREANTWTAGRNFPANLSEEY 50
I I+ LL L ++ + +K+SD +++N TW AG N
Sbjct: 6 IFIVAALLSAALTGFYTYEALKHKEFKYSDRLKQLAEEVNNANTTWKAGENI-------- 57
Query: 51 LRQFLIADAKYFDQSDRPLPGDRKTYDPEYSAT---VPDRFDAREQWPN-CGTIGHVPDT 106
+++ AD L GD P +A +P FDAR+QW + C ++ V D
Sbjct: 58 --KWINADIAGVKAHLGALEGDNGENLPVSNAVKADLPTAFDARQQWGDKCTSLWEVRDQ 115
Query: 107 GACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTW 166
C + F AV + +DR CI GQ R LS + + +CC C + C+ G
Sbjct: 116 SNCGSCWAFGAVESLTDRHCIH-LGQDIR-LSAQNMLTCCATC----GQGCNGGYPASAM 169
Query: 167 NFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYG 226
++ K G VTG Y CQ + +PC+HH P P+C + +P KC C + G
Sbjct: 170 SYYVKTGLVTGDLYNTTGWCQAYSFAPCAHHVDTPLYPACTGE-LPTPKCAKTCDS---G 225
Query: 227 RGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLE 286
G H+ + Y V ++AI EI +GP A F +Y+DF +YKSGVYKH + L
Sbjct: 226 SGQTYTVHKGSKAYSVGKTQEAIMTEIQTNGPVEAAFTVYEDFLNYKSGVYKHVTGKALG 285
Query: 287 NYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPKN 344
H+ K++GWG EN TPYW+V+N+W WGD GT KILRGK EC E + P N
Sbjct: 286 G--HAIKIVGWGVENNTPYWIVVNSWNQTWGDNGTFKILRGKNECGIEAQVVTALPLN 341
>gi|91089435|ref|XP_966663.1| PREDICTED: similar to AGAP004533-PA [Tribolium castaneum]
gi|270012706|gb|EFA09154.1| cathepsin B precursor [Tribolium castaneum]
Length = 320
Score = 192 bits (489), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 114/326 (34%), Positives = 159/326 (48%), Gaps = 31/326 (9%)
Query: 21 FSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEY 80
S +I+ IN++ +W AG NFP N +LR A + D D +T +
Sbjct: 22 LSQQFINAINQKHPSWLAGPNFPPNTPHSHLRSLNGA------RDDPAFFTDTETKNVTI 75
Query: 81 SATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTE 140
+P FDAR WP C +I + + G+C + F AV SDR CI S + S +
Sbjct: 76 PEQIPQNFDARIVWPQCESIRKIRNQGSCGSCWAFGAVETMSDRLCIASNATKKFEFSAQ 135
Query: 141 YVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSA 200
+ +CCK C + C G R W + G V+GGD+ GC P ++
Sbjct: 136 DLLACCKECGH----GCGGGYSSRAWQYWVTDGIVSGGDFNTSQGCHPYSVQAFRDS--- 188
Query: 201 PTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTT 260
T P+C + CTNP Y + + +DK +Y + N + I+ EI+ GP
Sbjct: 189 -TTPNCS----------SFCTNPKYQKNYSEDKRYGARSYRIAKNIEQIQAEIMTSGPVQ 237
Query: 261 ATFALYDDFYHYKSGVYKHTSNAKLENY--LHSGKLIGWGTENGTPYWLVINTWGPHWGD 318
A++ +YDDFY Y++GVY+H L N HS K++GWG ENGT YWLV N+WG WG
Sbjct: 238 ASYVVYDDFYSYQNGVYQHV----LGNVSGRHSVKILGWGRENGTDYWLVANSWGRDWGR 293
Query: 319 RGT-VKILRGKYECAFEYLIAAGKPK 343
G K LRG+ C E I G PK
Sbjct: 294 LGGFFKFLRGENHCDIESNILGGDPK 319
>gi|54289256|gb|AAV31918.1| putative vitellogenic cathepsin B [Aedes aegypti]
Length = 332
Score = 192 bits (488), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 114/346 (32%), Positives = 173/346 (50%), Gaps = 22/346 (6%)
Query: 2 IHILVFLLGCTLV-RGELYK---FSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIA 57
+ +LVF +G ++ R E F+D ++ Q+ R A TWT F + E +
Sbjct: 4 VKLLVFAIGVVVIARSERLGDDPFNDGFLAQVQRHAKTWTPDATFRDGIRFENFQ----- 58
Query: 58 DAKYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAA 117
+ K +S K +D Y+ +P+ FDARE+WP C +I + + G C A A
Sbjct: 59 NMKGIFESKIGFRLPTKRHDVAYNMDIPEFFDAREKWPYCKSISTIKNQGLCGACWAVAT 118
Query: 118 VGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTG 177
V SDR CI S+G+ + L+ E + CCK C N G+ F+ W + G V+G
Sbjct: 119 VSVMSDRLCIHSEGKFDVELAAEDLMGCCKDCGNGCNGGFLDGTSFQYWVDV---GLVSG 175
Query: 178 GDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTT 237
Y + GC+P PC + C +K P C CT Y + +DK+ +
Sbjct: 176 AAYNNTDGCKPYPFKPCLY-----PFVGCHPEKTP--SCTHHCTE-GYDGTYRRDKYYGS 227
Query: 238 LTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGW 297
Y + ++E I+ EI+ +GP + F++Y D Y YK+GVY+H ++ H+ +LIGW
Sbjct: 228 AAYKLPNDERMIQLEIMTNGPVESGFSVYQDLYLYKTGVYQHVVGREVGK--HAVRLIGW 285
Query: 298 GTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
G E G PYWL+ N++G WG+ G K LRG E ++ AG PK
Sbjct: 286 GKERGVPYWLIANSYGEDWGEHGYFKFLRGSNHLGIESVVIAGLPK 331
>gi|320166129|gb|EFW43028.1| cathepsin B [Capsaspora owczarzaki ATCC 30864]
Length = 332
Score = 192 bits (488), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 115/318 (36%), Positives = 151/318 (47%), Gaps = 16/318 (5%)
Query: 26 IDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEYSATVP 85
+D IN TW A R +E R + D+ LP + +P
Sbjct: 29 VDHINSLKTTWVAERPTRFGSFDEVARLCGALETP----EDQRLPLKVA----PIAEAIP 80
Query: 86 DRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASC 145
D FD+R WP C TI V D AC + F AV + SDR CI S + LS + SC
Sbjct: 81 DTFDSRTNWPACPTIKEVRDQSACGSCWAFGAVESMSDRICIASNATKIVRLSASDLLSC 140
Query: 146 CKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPS 205
C C C G + +W++ +G VTG Y C+P C+HH ++P P
Sbjct: 141 CTSC----GDGCDGGQLGPSWDYYKNKGIVTGYLYNTTGYCKPYDFPACAHHEASPDYPD 196
Query: 206 CENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFAL 265
C + KC C + D H +Y V + AI+ EIL HGP A F +
Sbjct: 197 CPSTDYSTPKCTKSCVAGYTANTYTADLHYGQSSYSVGRTDAAIQTEILNHGPVEAAFTV 256
Query: 266 YDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVKIL 325
Y DF Y+SGVYKHTS + L H+ ++GWGTE+G+PYWLV N+W P WGD G KIL
Sbjct: 257 YSDFPTYRSGVYKHTSGSVLGG--HAISIVGWGTESGSPYWLVKNSWNPSWGDGGFFKIL 314
Query: 326 RGKYECAFEYLIAAGKPK 343
RG +C + G PK
Sbjct: 315 RG--DCGINNDVVGGLPK 330
>gi|38373697|gb|AAR19103.1| cathepsin B [Uronema marinum]
Length = 350
Score = 192 bits (488), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 114/347 (32%), Positives = 176/347 (50%), Gaps = 18/347 (5%)
Query: 4 ILVFLLGCTLVRGELYKFSDAYIDQINR--EANTWTAGRNFP-ANLSEEYLRQFL--IAD 58
+L+ L+ ++ + F+ ++++N +TW AG N +S + ++ + IA
Sbjct: 6 LLIALIVASVQAFDFKLFTSEIMEEVNNYNTGSTWKAGYNKRFEGMSFDQIQAMMGTIAT 65
Query: 59 AKYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAV 118
+ +R P ++ + ++P+ FD RE +P C ++ V D C + F V
Sbjct: 66 PVHMIPDERYTP-----FETIQNLSLPESFDLREAYPKCESLQQVRDQSNCGSCWAFGTV 120
Query: 119 GAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGG 178
A SDR CI S + +S+E + SCC+ + C+ G WN+ K G V+G
Sbjct: 121 EAISDRICIASGQKDQTRISSENLLSCCR-GTFACGMGCNGGYTAGAWNYYVKTGLVSGN 179
Query: 179 DYGD-----RTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDK 233
Y D +T CQP + PCSHH + + KC+T C + + QD
Sbjct: 180 LYTDDNQNSKTECQPYSFPPCSHHVQGEYQACTDLPQFNTPKCYTECNSQYTQNSYEQDL 239
Query: 234 HRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGK 293
H+ +Y V +E+ IK EI +G TTA+F +Y DF Y SGVY++TS + + H+ K
Sbjct: 240 HKGVSSYSVPKSEEQIKAEIYQYGSTTASFNVYSDFLTYSSGVYQNTSGSYMGG--HAIK 297
Query: 294 LIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAG 340
++GWG ENGTPYWL N+W WG+ G KILRG EC E + AG
Sbjct: 298 MLGWGVENGTPYWLCANSWNSSWGENGFFKILRGSNECGIESGMVAG 344
>gi|27526823|emb|CAD32937.1| pro-cathepsin B2 [Fasciola hepatica]
Length = 337
Score = 192 bits (488), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 119/331 (35%), Positives = 164/331 (49%), Gaps = 26/331 (7%)
Query: 21 FSDAYIDQINREAN-TWTAGRNFPAN--LSEEYLRQFLIADAKYFDQSDRPLPGDRKTYD 77
FSD I IN ++ +W A P++ ++ E+ +Q L +++ P +R+T
Sbjct: 16 FSDELIHYINEKSGASWKAA---PSSRFINIEHFKQHL----GLLEET----PEERQTRR 64
Query: 78 PEYSATV-----PDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQ 132
P V P+ FDARE+WP C +I +PD +C + A VGA SDR CI S G
Sbjct: 65 PTVRYNVSDNDLPESFDAREKWPLCRSIRQIPDQSSCGSCWAVAGVGAMSDRVCIHSNGM 124
Query: 133 QNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTIS 192
LS + SCC C C GS W++ + G VTGG + TGC P
Sbjct: 125 MQPELSAIDLVSCCSYC----GNGCQGGSPPAAWDYWWRNGIVTGGTLENPTGCLPYPFP 180
Query: 193 PCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKE 252
C H GS L C P C+ C Y + + +DK +Y VD +E I +E
Sbjct: 181 QCRHPGSRSQLNPCPRYTYPTPSCYPYC-QAGYDKTYEKDKVYGKTSYNVDRHEYTIMEE 239
Query: 253 ILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTW 312
I+ +GP A F +Y DF YKSG+Y H S H+ ++IGWG ENG YWL N+W
Sbjct: 240 IMKNGPVEAGFIVYTDFAVYKSGIYHHVSGRYAGK--HAIRIIGWGVENGVKYWLTANSW 297
Query: 313 GPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
WG+ G +ILRG EC E ++ AG P+
Sbjct: 298 NVGWGENGYFRILRGTDECRIESIVVAGMPR 328
>gi|313233819|emb|CBY09988.1| unnamed protein product [Oikopleura dioica]
Length = 356
Score = 192 bits (487), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 117/320 (36%), Positives = 159/320 (49%), Gaps = 15/320 (4%)
Query: 23 DAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEYSA 82
D I ++N +W AG NF +N + ++ +A D LP + D +
Sbjct: 41 DDIIAKVNSADLSWKAGANFNSNYAPKH-----VAGLCGTIMGDDRLPVNHLLNDADLE- 94
Query: 83 TVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYV 142
+P FD+RE WP+C +I V D G+C + F A A SDR CI S LS+E +
Sbjct: 95 -LPANFDSREAWPDCPSISEVRDQGSCGSCWAFGASEAISDRTCIHSNAAFTFDLSSEDL 153
Query: 143 ASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPT 202
SCC Y C+ G W + + G V+GG Y TGCQP I PC HH +
Sbjct: 154 LSCCG---YVCGNGCNGGFPQAAWEYWVQNGLVSGGLY-HGTGCQPYAIEPCEHH-TEGD 208
Query: 203 LPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTAT 262
P C ++ KC +C + Y F QDKH ++ Y + NE AI EI +GP
Sbjct: 209 RPPCTGEEGTTPKCSHKCVD-GYTGNFAQDKHYGSVAYRIPANEKAIMNEIYKNGPVEGA 267
Query: 263 FALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTV 322
F +Y+DF YKSGVY H + + L H+ +++GWG ENG YWL N+W WG+ G
Sbjct: 268 FIVYEDFPTYKSGVYSHHTGSALGG--HAIRVLGWGEENGEKYWLCGNSWNTDWGNNGFF 325
Query: 323 KILRGKYECAFEYLIAAGKP 342
KI RG EC E + G P
Sbjct: 326 KIKRGVNECGIESEMVGGIP 345
>gi|56754307|gb|AAW25341.1| unknown [Schistosoma japonicum]
Length = 309
Score = 192 bits (487), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 106/294 (36%), Positives = 153/294 (52%), Gaps = 13/294 (4%)
Query: 56 IADAKYFDQSDRPLPGDRKTYDP-----EYSATVPDRFDAREQWPNCGTIGHVPDTGACA 110
+ DA+ R P R+ P + + +P FD+R++WP C +I + D CA
Sbjct: 24 VDDARILLGGRREDPNLREKRRPTVDHHDLNVEIPSHFDSRKKWPRCKSISQIRDQSRCA 83
Query: 111 APHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLH 170
+ +AVGA SDR CI+S G+Q+ LS + SCCK C C G +W++
Sbjct: 84 SSWAVSAVGAMSDRICIQSGGKQSVELSAIDLISCCKNC----GSGCDGGVTGYSWDYWV 139
Query: 171 KRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFF 230
G VTGG + TGC+P C H +C ++ +C C Y +
Sbjct: 140 SHGIVTGGSKENHTGCRPYPFPKCDHFVKGK-YRACGDKLYKTPQCKQTC-QKGYNTSYE 197
Query: 231 QDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLH 290
QDKH +Y V E I+K+I+ HG A +Y+DF +YKSG+Y++T+ + H
Sbjct: 198 QDKHYGGFSYNVLSVESVIQKDIMMHGTVEAYLEIYEDFLNYKSGIYRYTTGQFISG--H 255
Query: 291 SGKLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPKN 344
+ +LIGWG ENGT YWL NTW WG++G +I+RG+ EC E IAAG K+
Sbjct: 256 AVRLIGWGVENGTAYWLAANTWNEDWGEKGYFRIVRGRNECLIESEIAAGLIKS 309
>gi|308488594|ref|XP_003106491.1| hypothetical protein CRE_15919 [Caenorhabditis remanei]
gi|308253841|gb|EFO97793.1| hypothetical protein CRE_15919 [Caenorhabditis remanei]
Length = 342
Score = 191 bits (486), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 105/320 (32%), Positives = 161/320 (50%), Gaps = 13/320 (4%)
Query: 24 AYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEYSAT 83
+++ + + WTAG +S ++ L+ D + +D T+ PE S
Sbjct: 34 GMFEELIPKNSFWTAGI---PKVSRSFMLSTLVKDPEIIGFNDL-----GPTFSPENSDL 85
Query: 84 VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVA 143
P FDARE+WP C +I + D C + FAA + SDR CI S G + LS + +
Sbjct: 86 SP-FFDARERWPECSSIPLINDISECKSSWAFAAAESMSDRLCINSGGMIDTILSAQELL 144
Query: 144 SCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTL 203
SCC + C+ G+ + W + K G TGG Y + GC+P +I+PC T
Sbjct: 145 SCCT-GVLSCGEGCAGGNPLKAWQYWQKHGIPTGGSYESQFGCKPYSIAPCGKTIGNVTY 203
Query: 204 PSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATF 263
P C N +P C +C P Y +D+H + + + I+ +++ +GP AT
Sbjct: 204 PPCTNTTLPTPTCEKKC-KPGYPVDLDKDRHYGVSVDQLPNRQIEIQSDVMLNGPVEATM 262
Query: 264 ALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVK 323
+YDDF Y +G+Y H + K + +L S +++GWG G PYWL+ N+WG WG+ GT +
Sbjct: 263 EIYDDFLQYTTGIYVHLAGNK-QGHL-SVRILGWGMFEGVPYWLLANSWGKEWGENGTFR 320
Query: 324 ILRGKYECAFEYLIAAGKPK 343
+LRG EC E +G PK
Sbjct: 321 VLRGVNECGLEANCISGMPK 340
>gi|121073189|gb|ABM47071.1| cathepsin B2 [Clonorchis sinensis]
gi|358341868|dbj|GAA36574.2| cathepsin B-like cysteine proteinase [Clonorchis sinensis]
Length = 343
Score = 191 bits (486), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 105/262 (40%), Positives = 137/262 (52%), Gaps = 9/262 (3%)
Query: 81 SATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTE 140
S +P FDAR WP+C +I + D +C + F AV A SDR CI S G N+ LS
Sbjct: 83 SKLIPKSFDARATWPHCPSISEIRDQSSCGSCWAFGAVEAMSDRLCIHSSGAFNKSLSAV 142
Query: 141 YVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSA 200
+ SCCK C C G W+F G VTGG + TGC+P C HH S
Sbjct: 143 DLLSCCKDC----GDGCDGGFPPMAWDFWKTHGIVTGGSKEEPTGCRPYPFPKCQHH-SQ 197
Query: 201 PTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTT 260
P C + P KC C P + +DK R +Y V +E AI KEIL +GP
Sbjct: 198 GHYPPCPRRIYPTPKCVKHCDTPKID--YQKDKTRANTSYNVHQSEVAIMKEILLNGPVE 255
Query: 261 ATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRG 320
ATF +++DF YKSG+Y H + H+ +++GWG ENG PYWL+ N+W WG++G
Sbjct: 256 ATFEVHEDFPEYKSGIYFHAWGGSVGG--HAIRILGWGEENGVPYWLIANSWNEDWGEKG 313
Query: 321 TVKILRGKYECAFEYLIAAGKP 342
++ LRG EC E AG P
Sbjct: 314 YLRFLRGHNECGIEEEATAGLP 335
>gi|392920988|ref|NP_506011.2| Protein F57F5.1 [Caenorhabditis elegans]
gi|206994319|emb|CAB00098.2| Protein F57F5.1 [Caenorhabditis elegans]
Length = 351
Score = 191 bits (486), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 104/280 (37%), Positives = 142/280 (50%), Gaps = 12/280 (4%)
Query: 69 LPGDRKTYD---PEYS-ATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDR 124
+P + + ++ PE A VPD FD+R WPNC +I + D +C + +A SDR
Sbjct: 78 IPEEYRVFEMTHPEVEDAAVPDSFDSRTAWPNCPSISKIRDQSSCGSCWAVSAAETISDR 137
Query: 125 RCIKSKGQQNRPLSTEYVASCC-KICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDR 183
CI S + +S + + +CC +C C+ G W K+G VTGG Y D+
Sbjct: 138 ICIASNAKTILSISADDINACCGMVC----GNGCNGGYPIEAWRHYVKKGYVTGGSYQDK 193
Query: 184 TGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVD 243
TGC+P PC HH + C + P KC C Y + QD H Y V
Sbjct: 194 TGCKPYPYPPCEHHVNGTHYKPCPSNMYPTDKCERSC-QAGYALTYQQDLHFGQSAYAVS 252
Query: 244 DNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGT 303
I+KEI+ HGP F +Y+DF HY GVY HT+ A L H+ K++GWG +NGT
Sbjct: 253 KKAAEIQKEIMTHGPVEVAFTVYEDFEHYSGGVYVHTAGASLGG--HAVKMLGWGVDNGT 310
Query: 304 PYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
PYWL N+W WG+ G +I+RG EC E + G PK
Sbjct: 311 PYWLCANSWNEDWGENGYFRIIRGVNECGIEGGVVGGIPK 350
>gi|21930117|gb|AAM82155.1| cysteine proteinase [Ancylostoma ceylanicum]
Length = 348
Score = 191 bits (485), Expect = 4e-46, Method: Compositional matrix adjust.
Identities = 114/318 (35%), Positives = 160/318 (50%), Gaps = 13/318 (4%)
Query: 24 AYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEYSAT 83
A++D IN++ + + A + A EE++R I D K+ ++ P + + E
Sbjct: 39 AFVDYINQQQSFFRAEYSPDA---EEFVRN-RIMDVKFAVDPEKTEP-NYVLANTEMKVD 93
Query: 84 VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVA 143
+PD FDAR++WPNC ++ H+ D +C + AA A SDR C + G+ NR LS V
Sbjct: 94 IPDTFDARDRWPNCTSMKHIRDQSSCGSCWAVAAASAMSDRVCALTNGRINRILSDTEVL 153
Query: 144 SCC-KICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPT 202
SCC C + C G R + + + G TGG YG++ CQP PC +H P
Sbjct: 154 SCCFGSCGF----GCKGGYPARAFGYAWRYGLSTGGPYGEKDACQPYAFYPCGNHAHEPY 209
Query: 203 LPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTAT 262
C ++ P C C Y F +DK TY++ NE IK EI+ GP AT
Sbjct: 210 YGPCPDELWPTPTCRRTC-QLGYPIPFEKDKIFNDQTYYIFGNETEIKYEIMTRGPVVAT 268
Query: 263 FALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTV 322
+ +Y DF +YK GVY H LH+ K+IGWG N PYWLV N+W WGD G
Sbjct: 269 YKVYRDFDYYKKGVYIHREGEVTG--LHAVKIIGWGKGNDVPYWLVANSWNTDWGDNGYF 326
Query: 323 KILRGKYECAFEYLIAAG 340
+I+RG C E + G
Sbjct: 327 RIVRGTDNCEIERQMVGG 344
>gi|156255405|gb|ABU62925.1| cathepsin B [Fasciola hepatica]
Length = 337
Score = 191 bits (485), Expect = 4e-46, Method: Compositional matrix adjust.
Identities = 109/324 (33%), Positives = 165/324 (50%), Gaps = 14/324 (4%)
Query: 21 FSDAYIDQINREAN-TWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPE 79
FSD I IN E+ +W A + N ++ + + + D++ + R+T
Sbjct: 26 FSDELIHYINEESGASWKAAPSTRFNNIDQVKQNLGVLEETPEDRNTQ-----RQTVRYS 80
Query: 80 YSAT-VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLS 138
S +P+ FDAR++W NC +I + D +C++ ++ A +DR CI S GQ+ LS
Sbjct: 81 VSENDLPESFDARQKWANCPSISEIRDQSSCSSCWAVSSASAITDRICIHSNGQKKPRLS 140
Query: 139 TEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHG 198
+ SCC C Y C+ G +W++ + G VTGG + TGC P CSH
Sbjct: 141 AIDIVSCCAYCGY----GCNGGIPAMSWDYWTREGVVTGGTLENPTGCLPYPFPKCSHGV 196
Query: 199 SAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGP 258
P LP C P KC +C + Y + + QDK + +Y V E I EI+ +GP
Sbjct: 197 VTPGLPPCPRDIYPTPKCEKKC-HAGYNKTYEQDKVKGKSSYNVGGQETDIMMEIMKNGP 255
Query: 259 TTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGD 318
F +++DF YKSG+Y +T+ + H+ ++IGWG ENG YWL+ N+W WG+
Sbjct: 256 VDGIFYMFEDFLVYKSGIYHYTTGRLVGG--HAIRVIGWGVENGVKYWLIANSWNEGWGE 313
Query: 319 RGTVKILRGKYECAFEYLIAAGKP 342
+G ++ RG EC E I AG P
Sbjct: 314 KGYFRMRRGNNECGIEARINAGLP 337
>gi|161343861|tpg|DAA06111.1| TPA_inf: cathepsin B [Myzus persicae]
Length = 323
Score = 191 bits (484), Expect = 5e-46, Method: Compositional matrix adjust.
Identities = 103/271 (38%), Positives = 142/271 (52%), Gaps = 9/271 (3%)
Query: 73 RKTYDPEYSATVPDRFDAREQWPNCG-TIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKG 131
RKT D Y +P FDAR+ + +C IG V D G CA+ A F+DR CI S G
Sbjct: 53 RKTADINYKTDIPKEFDARQYFISCANVIGDVKDQGNCASSWAVAVASTFTDRLCIASNG 112
Query: 132 QQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTI 191
+ LS + + SC D+ C GS ++ W F +G VTGG Y GCQP
Sbjct: 113 KFTDNLSAQNLMSCGD----DEKLGCDGGSAYKAWEFTMGKGIVTGGPYDSNEGCQPYKN 168
Query: 192 SPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVD-DNEDAIK 250
PC H+G + ++ + C +C N Y + D ++T++ Y N I+
Sbjct: 169 RPCDHYGDSSLTNCSSLRRTQMMFCRDKCVNKNYKVKYEDDLYKTSVVYMTSWTNVKQIQ 228
Query: 251 KEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGT-ENGTPYWLVI 309
+EI+ +GP TA +Y++F YK GVYK T+ +L Y H KLIGWG E G YWL +
Sbjct: 229 QEIMTYGPVTAFMYVYENFMGYKEGVYKSTA-GELIGY-HHVKLIGWGVDEAGIEYWLAM 286
Query: 310 NTWGPHWGDRGTVKILRGKYECAFEYLIAAG 340
N+W +WG+ G KILRG C+ E L+ AG
Sbjct: 287 NSWNSNWGNDGLFKILRGYNFCSIELLVMAG 317
>gi|29374023|gb|AAO73002.1| cathepsin B [Fasciola gigantica]
Length = 335
Score = 191 bits (484), Expect = 5e-46, Method: Compositional matrix adjust.
Identities = 111/328 (33%), Positives = 161/328 (49%), Gaps = 24/328 (7%)
Query: 21 FSDAYIDQINREAN-TWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDP- 78
FSD I +N E+ +W A R+ N E++ + + P +R T P
Sbjct: 26 FSDELIRYVNEESGASWKAARSTRFNNIEQFKKHLGALEET---------PEERNTRRPT 76
Query: 79 -EYSAT---VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQN 134
YS + +P+ FDARE+WPNC +I +PD +C++ A +DR CI S G++
Sbjct: 77 VRYSVSENDLPESFDAREKWPNCSSISEIPDQSSCSSCWAVGTASAMTDRICIHSNGEKK 136
Query: 135 RPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPC 194
LS + SCC C Y C G W++ + G V+GG + TGC P C
Sbjct: 137 PRLSAVDLVSCCPYCGY----GCEGGYPSMAWDYWWRHGIVSGGTLENPTGCLPYPFPKC 192
Query: 195 SHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEIL 254
SH P L C + KC +C Y + +DK + +Y V D E I EI+
Sbjct: 193 SHLEETPGLAPCPRELYATPKCEKQC-QAGYSKTSEEDKIKGKSSYNVGDRETDIMMEII 251
Query: 255 AHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGP 314
+GP + + +++DF YKSG+Y++TS + + + +IGWG ENG YWL N+W
Sbjct: 252 TNGPVSTIYYIFEDFTVYKSGIYQYTSGSLMGGH----GIIGWGVENGVKYWLAANSWNE 307
Query: 315 HWGDRGTVKILRGKYECAFEYLIAAGKP 342
WG+ G +I RG EC E I AG P
Sbjct: 308 GWGENGYFRIRRGTNECGIESRINAGLP 335
>gi|154761391|gb|ABS85545.1| cathepsin B preproprotein [Biomphalaria glabrata]
Length = 333
Score = 191 bits (484), Expect = 6e-46, Method: Compositional matrix adjust.
Identities = 125/349 (35%), Positives = 175/349 (50%), Gaps = 25/349 (7%)
Query: 1 MIHILVFLLGCTLVR----GELYKFSDAYIDQINREANT-WTAGRNF-PANLSE-EYLRQ 53
++ ILV + G V + SDA I IN ANT W AGRNF PA + L
Sbjct: 3 IMRILVAICGLLAVALATPFHIEPLSDAEIFYINHVANTTWKAGRNFHPAEIKRARALLG 62
Query: 54 FLIADAKYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPH 113
+A+ K +++ + K P +PD FD R +WP+C ++ + D C +
Sbjct: 63 VNMAENKAYNR----IHLKYKQVQPRND--LPDNFDPRTKWPDCASLNEIRDQANCGSCW 116
Query: 114 IFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRG 173
F + A +DR CI KG N +S E + CCK C C+ G W + G
Sbjct: 117 AFGSAEAMTDRICIAGKG--NIHISAEDINDCCKSC----GMGCNGGYPAAAWEWYVDTG 170
Query: 174 SVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDK 233
V+GG YG GC P ++ C HH + P VP KC +C Y + + DK
Sbjct: 171 VVSGGQYGTNEGCMPYSLPHCDHHTTGKYQPC--PAVVPTPKCEKKCLT-GYPKSYSNDK 227
Query: 234 HRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGK 293
R +Y V + +I +E++ +GP TA F +Y DF YK+GVY+HT+ + H+ K
Sbjct: 228 TRGKKSYGVRGVQ-SIMQELVDNGPVTAAFDVYSDFLSYKTGVYRHTTGSYEGG--HAVK 284
Query: 294 LIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
+IG+GTE+G YWLV N+W WGD+G KI +GK EC E I AG P
Sbjct: 285 IIGYGTESGQDYWLVANSWNEDWGDKGFFKIAKGKDECGIESSIVAGDP 333
>gi|118358710|ref|XP_001012596.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila]
gi|89294363|gb|EAR92351.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila SB210]
Length = 346
Score = 190 bits (483), Expect = 8e-46, Method: Compositional matrix adjust.
Identities = 107/318 (33%), Positives = 162/318 (50%), Gaps = 17/318 (5%)
Query: 27 DQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEYSATVPD 86
+++N TW AG N +++ + + G + + +P
Sbjct: 42 EKVNNSNTTWKAGENI------KWINSDIAGVKAHMGTLLNQKSGVKLEKVNRQANNLPS 95
Query: 87 RFDAREQWPN-CGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASC 145
FD+R QW + C ++ V D C + F A + SDR CI GQ R LST+ + +C
Sbjct: 96 EFDSRVQWGDKCSSLWEVRDQSNCGSCWAFGAAESLSDRHCIH-LGQDIR-LSTQNLVTC 153
Query: 146 CKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPS 205
C C + C G ++ G VTG YG+ + CQ +++PC+HH ++ P
Sbjct: 154 CDECGF----GCDGGWPEAAMDYYVNNGLVTGDLYGNNSWCQAYSLAPCAHHVTSDVYPP 209
Query: 206 CENQKVPKLKCHTRC-TNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFA 264
C + +P C C +N TY + +D H+ + Y +D NE AI EI +GP F
Sbjct: 210 CTGE-LPTPPCVKSCDSNSTYTIPYPKDLHKGSKAYSIDQNEQAIMTEIQTNGPIEVAFT 268
Query: 265 LYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVKI 324
+Y+DF YKSGVY+H + ++L H+ K++GWG ENGTPYW+++N+W WGD+GT KI
Sbjct: 269 VYEDFLTYKSGVYQHVTGSELGG--HAVKMVGWGVENGTPYWIIVNSWNESWGDKGTFKI 326
Query: 325 LRGKYECAFEYLIAAGKP 342
LRG+ EC E P
Sbjct: 327 LRGQNECGIESECVTALP 344
>gi|29374027|gb|AAO73004.1| cathepsin B [Fasciola gigantica]
Length = 337
Score = 190 bits (482), Expect = 8e-46, Method: Compositional matrix adjust.
Identities = 108/324 (33%), Positives = 165/324 (50%), Gaps = 14/324 (4%)
Query: 21 FSDAYIDQINREAN-TWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPE 79
FSD I IN E+ +W A + N ++ + + + D++ + R+T
Sbjct: 26 FSDELIHYINEESGASWKAAPSTRFNNIDQVKQNLGVLEETPEDRNTQ-----RQTVRYS 80
Query: 80 YSAT-VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLS 138
S +P+ FDAR++W NC +I + D +C++ ++ A +DR CI S GQ+ LS
Sbjct: 81 VSENDLPESFDARQKWANCPSISEIRDQSSCSSCWAVSSASAITDRICIHSNGQKKPRLS 140
Query: 139 TEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHG 198
+ SCC C Y C+ G +W++ + G VTGG + TGC P CSH
Sbjct: 141 AIDIVSCCAYCGY----GCNGGIPAMSWDYWTREGVVTGGTLENPTGCLPYPFPKCSHGV 196
Query: 199 SAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGP 258
P LP C P KC +C + Y + + QDK + +Y V + E EI+ +GP
Sbjct: 197 VTPGLPPCPRDIYPTPKCEKKC-HAGYNKTYEQDKVKGKSSYNVGEQETDFMMEIMKNGP 255
Query: 259 TTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGD 318
F +++DF YKSG+Y +T+ + H+ ++IGWG ENG YWL+ N+W WG+
Sbjct: 256 VDGIFYMFEDFLVYKSGIYHYTTGRLVGG--HAIRVIGWGVENGVKYWLIANSWNEGWGE 313
Query: 319 RGTVKILRGKYECAFEYLIAAGKP 342
+G ++ RG EC E I AG P
Sbjct: 314 KGYFRMRRGNNECGIEARINAGLP 337
>gi|395842321|ref|XP_003793966.1| PREDICTED: cathepsin B [Otolemur garnettii]
Length = 339
Score = 189 bits (481), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 120/345 (34%), Positives = 174/345 (50%), Gaps = 23/345 (6%)
Query: 5 LVFLLGCTLV------RGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIAD 58
L+ L C LV + + SD ++ IN++ +TW AG NF N+ YL++
Sbjct: 4 LLASLCCLLVLTSAWSKPYFHPLSDELVNFINKQNSTWQAGHNF-RNVDMSYLKRLC--- 59
Query: 59 AKYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAV 118
+ P R + + + +P FDAREQW +C TI + D G+C + F AV
Sbjct: 60 GSFLGGPKLP---QRVKFAKDMN--LPKSFDAREQWSHCPTIKEIRDQGSCGSCWAFGAV 114
Query: 119 GAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGG 178
+ SDR CI + G + +S E + +CC D WNF ++G V+GG
Sbjct: 115 ESISDRICIHTNGHVSVEVSAEDLLTCCGGQCGDGCNGGYPA---EAWNFWTRKGLVSGG 171
Query: 179 DYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTL 238
Y GC+P +I PC HH + + P+C + KC C P Y + +DKH
Sbjct: 172 LYESHVGCRPYSIPPCEHHVNG-SRPACTGEG-DTPKCSKTC-EPGYSPTYKEDKHFGYT 228
Query: 239 TYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWG 298
+Y + NE I EI +GP F++Y DF YKSGVY+H + + H+ +++GWG
Sbjct: 229 SYSLPTNEWEIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHLTGDMMGG--HAIRILGWG 286
Query: 299 TENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
ENG PYWLV N+W WGD G +ILRG+ C E + AG P+
Sbjct: 287 EENGVPYWLVANSWNTDWGDGGFFRILRGQDHCGIESEVVAGIPR 331
>gi|332244666|ref|XP_003271495.1| PREDICTED: cathepsin B [Nomascus leucogenys]
Length = 351
Score = 189 bits (481), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 120/342 (35%), Positives = 166/342 (48%), Gaps = 31/342 (9%)
Query: 15 RGELYKFSDAYIDQINREANTWTAG-----RNFPANLSEEYLRQFLIADAKYFDQSDRPL 69
R + SD ++ +N+ TW G NF N+ YL++ + P
Sbjct: 20 RPSFHPLSDELVNYVNKRNTTWQVGCGAASYNF-YNVDVSYLKRLC---GTFLGG---PK 72
Query: 70 PGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHV---PDTGAC----AAPHIFAAVGAFS 122
P R T+ + + +P+ F AREQWP C TI P G + F AV A S
Sbjct: 73 PPQRVTFTEDLN--LPESFYAREQWPQCPTIXXXRAQPGRGGLTRWGSFLQAFGAVEAIS 130
Query: 123 DRRCIKSKGQQNRPLSTEYVASCC-KICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYG 181
DR CI + + +S E + +CC +C C+ G WNF ++G V+GG Y
Sbjct: 131 DRICIHTNAHISVEVSAEDLLTCCGSMC----GDGCNGGYPAEAWNFWTRKGLVSGGLYD 186
Query: 182 DRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYW 241
GC+P +I PC HH + P PK C C P Y + QDKH +Y
Sbjct: 187 SHVGCRPYSIPPCEHHVNGSRPPCTGEGDTPK--CSKIC-EPGYSPTYKQDKHYGYNSYS 243
Query: 242 VDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTEN 301
V ++E I EI +GP F++Y DF YKSGVY+H + + H+ +++GWG EN
Sbjct: 244 VSNSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHITGEMMGG--HAIRILGWGVEN 301
Query: 302 GTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
GTPYWLV N+W WGD G KILRG+ C E + AG P+
Sbjct: 302 GTPYWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPR 343
>gi|341888224|gb|EGT44159.1| hypothetical protein CAEBREN_15022 [Caenorhabditis brenneri]
Length = 332
Score = 189 bits (481), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 110/334 (32%), Positives = 164/334 (49%), Gaps = 22/334 (6%)
Query: 18 LYKFSDAY----IDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPG-- 71
+ SD++ ID +N + WTAG P E L+ + D L G
Sbjct: 11 FFAISDSFDPLIIDYVNSQNTLWTAG--IPKIPRESMLKTLV---------KDPHLAGFR 59
Query: 72 DRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKG 131
D P ++ + FDARE+WP C +I + D C + FAA + SDR CI S G
Sbjct: 60 DHGPSVPTENSDLSQFFDARERWPECTSIPQINDISECKSSWAFAAAESMSDRLCINSGG 119
Query: 132 QQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTI 191
N LS + + SCC + C G+ F+ W + K G TGG Y + GC+P +I
Sbjct: 120 MINTILSAQELLSCCTGV-LSCGEGCGGGNAFKAWQYWGKHGLPTGGSYETQFGCKPYSI 178
Query: 192 SPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPT-YGRGFFQDKHRTTLTY-WVDDNEDAI 249
+PC T P+C N +P C +CT+ Y +D+H + + + + I
Sbjct: 179 APCGKTVGNVTYPACTNTTLPTPSCEKKCTSKNGYPVDIDKDRHYGASSVDQLPNRQIEI 238
Query: 250 KKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVI 309
+ +++ +GP TF +YDDF Y +G+Y H + K + +L S +++GWG G PYWL+
Sbjct: 239 QSDVMLNGPIETTFEVYDDFLQYTTGIYVHLTGNK-QGHL-SVRILGWGMYEGVPYWLLA 296
Query: 310 NTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
N+WG WG+ GT + LRG EC E + PK
Sbjct: 297 NSWGKEWGENGTFRALRGTNECGLEANCVSAMPK 330
>gi|187107122|ref|NP_001119621.1| cathepsin B-3098 precursor [Acyrthosiphon pisum]
gi|161343841|tpg|DAA06101.1| TPA_inf: cathepsin B [Acyrthosiphon pisum]
Length = 337
Score = 189 bits (481), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 117/348 (33%), Positives = 172/348 (49%), Gaps = 23/348 (6%)
Query: 1 MIHILVFLLGCTLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAK 60
+I V L+ L + Y +ID IN++A TW AG N N +E++ + L ++
Sbjct: 5 IILASVILISVYLTE-QAYFLEKDFIDNINKQATTWKAGVNSAPNTPKEHILRLL--GSR 61
Query: 61 YFDQSDRPLPGDRKTYD-PEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVG 119
D+ K D + +P +FDAR++W C TIG V D G C + +
Sbjct: 62 GVQIPDKVNYNMYKNDDHADNYQEIPMKFDARKKWIRCKTIGEVRDQGNCGSDWALSTSS 121
Query: 120 AFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGD 179
AF+DR C+ + G N+ LS E + CC C C+ G R W G VTGG+
Sbjct: 122 AFADRLCVATNGDFNQLLSAEEITFCCHKC----GNGCNGGYPIRAWKRFKNHGLVTGGN 177
Query: 180 YGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGR---GFFQDKHRT 236
Y GC+P + PC + +C Q ++ + +C+ YG F +D T
Sbjct: 178 YKSGEGCEPYRVPPCPYDKDGKN--TCSGQP---MESNHKCSKKCYGDEDIDFNKDHRYT 232
Query: 237 TLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYL--HSGKL 294
Y++ I+K+++ +GP +F +YDDF +YKSG+Y + NA +YL HS KL
Sbjct: 233 RDDYYL--TYRGIQKDVINYGPIETSFDVYDDFPNYKSGIYVKSENA---SYLGGHSVKL 287
Query: 295 IGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
IGWG E G YWL++N+W WGD+G KI RG EC + G P
Sbjct: 288 IGWGEEYGVLYWLMVNSWNADWGDKGLFKIRRGTNECRVDNSTTGGVP 335
>gi|118429531|gb|ABK91813.1| cathepsin B-like cysteine proteinase precursor [Clonorchis
sinensis]
gi|358331549|dbj|GAA37857.2| cathepsin B-like cysteine proteinase [Clonorchis sinensis]
Length = 343
Score = 189 bits (480), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 101/259 (38%), Positives = 136/259 (52%), Gaps = 9/259 (3%)
Query: 84 VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVA 143
+P FDAR +WP+C +I + D C + F AV A SDR CI S G N+ LS +
Sbjct: 86 LPKNFDARTKWPHCPSISEIRDQSGCGSCWAFGAVEAMSDRLCIHSNGAFNKSLSAVDLL 145
Query: 144 SCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTL 203
SCC+ C Y CS G W++ G VTGG D +GC+ C HH
Sbjct: 146 SCCENCGY----GCSGGYPAVAWDYWGAHGIVTGGSKEDPSGCRSYPFPKCEHHVQG-HY 200
Query: 204 PSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATF 263
P C +Q P +C C P G + +DK R ++Y + +E I KEI+ GP A F
Sbjct: 201 PPCPHQYYPTPECVQHCDTP--GIDYVKDKTRANMSYNIYSSEILIMKEIMLRGPVEAVF 258
Query: 264 ALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVK 323
+Y+DF YK GVY H+ A L H+ +++GWG E PYWL+ N+W WG++G +K
Sbjct: 259 TVYEDFLQYKFGVYFHSWGAPLSE--HAIRILGWGEEGDVPYWLIANSWNEDWGEKGYMK 316
Query: 324 ILRGKYECAFEYLIAAGKP 342
LRG EC E + AG P
Sbjct: 317 FLRGLNECGIEDDVTAGLP 335
>gi|294877489|ref|XP_002768007.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
gi|239870145|gb|EER00725.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
Length = 344
Score = 189 bits (479), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 122/351 (34%), Positives = 171/351 (48%), Gaps = 44/351 (12%)
Query: 24 AYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAK----YFDQSDRPLPGDRKTYDPE 79
+ +D++N + N WTA + +E + DAK + + L ++K Y E
Sbjct: 3 SLVDEVNSKQNLWTASTD------QERFYGRSLGDAKKLCGTLPEETKGL--EKKVYPTE 54
Query: 80 YSATVPDRFDAREQWPNC-GTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLS 138
A +P FDAR+ + C IGHV D AC + A V AF+ R CIKS G+ N+ LS
Sbjct: 55 ELADIPSSFDARDAFKECKDVIGHVWDQSACGSCWAIAPVEAFNARLCIKSGGKFNQLLS 114
Query: 139 TEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDR------TGCQPSTIS 192
+ +CC ++ C G W+FL G VTGGD+ + GC P +
Sbjct: 115 AGEMLACCNSVHSCNSHGCQGGIARAAWSFLKMHGIVTGGDFVPKGSMSAADGCWPYSFP 174
Query: 193 PCSHHGSAPTLPSCENQKVPKL--------------------KCHTRCTNPTYGRGFFQD 232
C+H C +VP L C RC N YG +D
Sbjct: 175 KCAHDQEDSKYEPCPEVRVPPLGERHQRGAGASIHQKLYDTPSCLDRCPNEKYGTPRDKD 234
Query: 233 KHRTTLTY-WVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHS 291
+H T ++ + D IKKEI+ +GPT+A+F+ Y+DF YKSGVYKHTS L + HS
Sbjct: 235 RHFTARALPYLFEGTDNIKKEIMTNGPTSASFSTYEDFSSYKSGVYKHTSGGYLGD--HS 292
Query: 292 GKLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
++IGWGTE G YWLV+N+W WGD GT KI +G +C + + P
Sbjct: 293 VEIIGWGTEKGVDYWLVMNSWNEGWGDHGTFKIAQG--DCGIDDAVQGSLP 341
>gi|193610664|ref|XP_001948185.1| PREDICTED: cathepsin B-like [Acyrthosiphon pisum]
Length = 324
Score = 189 bits (479), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 126/351 (35%), Positives = 163/351 (46%), Gaps = 40/351 (11%)
Query: 1 MIHILVFLLGCTLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAK 60
+ + + LL C L E K S + Q N E NT A N N ++E + L+ K
Sbjct: 5 LFLMSIMLLSCYLT--EQAKLSRDNMIQTNIETNTLKALDNIDLNSAKE---EHLMLLGK 59
Query: 61 YFDQSDRPLPGDRKTYDPEYSA--TVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAV 118
+ KT DP Y A + FDAR+ W C TIG V + G +A
Sbjct: 60 RGVAATFKSKLLYKTRDPRYVAYGKISKEFDARKHWSQCKTIGEVYNDGNSDLSWAYATT 119
Query: 119 GAFSDRRCIKSKGQQNRPLSTEYVASCCKI---CRYDDNKSCSHGSVFRTWNFLHKRGSV 175
GAF+DR C+ + G N+ LSTE + SC I DD + W F K+G V
Sbjct: 120 GAFADRMCVATNGSYNQLLSTEQLISCSGIKSNAMADD----------QAWKFFKKQGLV 169
Query: 176 TGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKH- 234
+GG Y GCQPS I P + +PK + C N YG H
Sbjct: 170 SGGKYNTNDGCQPSKIPPIFN--------------LPKKIYNRTCDNFCYGNSLIDYNHD 215
Query: 235 --RTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSG 292
+ + TY V I++E+ +GP +A F+LYDD + Y SGVY T +K Y S
Sbjct: 216 HVKVSYTYHVLYKN--IQREVQTYGPVSAYFSLYDDLFLYTSGVYARTEKSKFVRY-QSA 272
Query: 293 KLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
KLIGWG ENG YWL++N+WG WG G KI RG EC F AG PK
Sbjct: 273 KLIGWGVENGVDYWLLVNSWGNEWGQNGLFKIKRGTDECQFGRHTYAGVPK 323
>gi|221484923|gb|EEE23213.1| cysteine proteinase, putative [Toxoplasma gondii GT1]
Length = 569
Score = 188 bits (477), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 104/267 (38%), Positives = 147/267 (55%), Gaps = 16/267 (5%)
Query: 84 VPDRFDAREQWPNC-GTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYV 142
VP FDAR +P C +GHV D G C + FA+ AF+DR CI+S+G++ PLS ++
Sbjct: 274 VPAHFDARTAFPACKDVVGHVRDQGDCGSCWAFASTEAFNDRLCIRSQGKRLMPLSAQHT 333
Query: 143 ASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDY---GDRTGCQPSTISPCSHHGS 199
SCC + + C+ G W + ++G VTGGD+ G T C P + C+HH
Sbjct: 334 TSCCNAI-HCASFGCNGGQPGMAWRWFERKGVVTGGDFDALGKGTTCWPYEVPFCAHHAK 392
Query: 200 APTLPSCENQKVPKL--KCHTRCTNPTYGRG---FFQDKHRTTLTYWVDDNEDAIKKEIL 254
AP P C+ VP+ KC C Y F QD H+ T Y + +D +K++++
Sbjct: 393 AP-FPDCDATLVPRKTPKCRKDCEEQAYADNVHPFDQDTHKATSAYSLRSRDD-VKRDMM 450
Query: 255 AHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGP 314
HGP + F +Y+DF YKSGVYKH S + H+ K+IGWGTENG YW +N+W
Sbjct: 451 THGPVSGAFMVYEDFLSYKSGVYKHVSGLPVGG--HAIKIIGWGTENGEEYWHAVNSWNT 508
Query: 315 HWGDRGTVKILRGKYECAFEYLIAAGK 341
+WGD G KI G +C + + AG+
Sbjct: 509 YWGDGGQFKIAMG--QCGIDGEMVAGE 533
>gi|1181143|emb|CAA93278.1| cysteine proteinase [Haemonchus contortus]
Length = 341
Score = 188 bits (477), Expect = 4e-45, Method: Compositional matrix adjust.
Identities = 105/310 (33%), Positives = 155/310 (50%), Gaps = 23/310 (7%)
Query: 32 EANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEYSATVPDRFDAR 91
E AG NF L D + +Q+ +P+ D+ + +P+ FDAR
Sbjct: 52 EVEATPAGHNFDRKL----------MDLSFINQNRKPVFDDKN----DKGEDIPESFDAR 97
Query: 92 EQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCC-KICR 150
+WP C ++ H+ D C + + A SDR CI S G++ +S + SCC C
Sbjct: 98 TKWPKCSSLKHIRDQANCGSCWAVSTASALSDRICIASNGRKQVHVSATDILSCCGNQCG 157
Query: 151 YDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQK 210
Y C+ G + +N+ K+G+VTGGDY +GC+P PC HHG C N+
Sbjct: 158 Y----GCNGGWPIQAFNYFSKQGAVTGGDYKATSGCRPYPFHPCGHHGKDTYYGECPNEA 213
Query: 211 VPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFY 270
KC +C + +D+ Y V ++E AI++EI+ +GP F +Y+DF
Sbjct: 214 TTP-KCVRKCQKSYKKS-YKKDRSIGKDAYEVPNSEKAIQREIMKNGPVVGAFTVYEDFS 271
Query: 271 HYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYE 330
+YK G+YKHT+ H+ K+IGWG E G PYWL+ N+W WG+ G +ILRG
Sbjct: 272 YYKKGIYKHTAGKARGG--HAIKIIGWGKEGGVPYWLIANSWHNDWGENGYFRILRGSNH 329
Query: 331 CAFEYLIAAG 340
C E + AG
Sbjct: 330 CGIEENVVAG 339
>gi|19526442|gb|AAL89717.1|AF483623_1 cathepsin B [Apriona germari]
Length = 324
Score = 188 bits (477), Expect = 4e-45, Method: Compositional matrix adjust.
Identities = 114/322 (35%), Positives = 164/322 (50%), Gaps = 31/322 (9%)
Query: 22 SDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQS-DRPLPGDRKTYDPEY 80
++A+I IN +A TWTA +NF E+ +AD ++ + LP E
Sbjct: 28 TEAFIQSINEKATTWTARKNFEGRTPEQLK---ALADVIGINRDPNVTLP----VVFHEA 80
Query: 81 SATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTE 140
+ +PD FDAREQWP C +I + D GAC + FAAV SDR C+ S+G++ S E
Sbjct: 81 ISGIPDSFDAREQWPFCESIRTIRDEGACGSCWAFAAVEVMSDRLCLASEGRKKFIFSAE 140
Query: 141 YVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSA 200
V SCC C C G + + + G +GGDYG + GC+P T + G
Sbjct: 141 EVVSCCTAC----GGGCRGGFLNEPYKYWVTNGIPSGGDYGSKLGCKPYT---AAVSGET 193
Query: 201 PTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTT 260
P +C C + Y + + +D T Y V+ I++EIL +GP T
Sbjct: 194 P-------------QCQKACVS-GYEKSWEKDLRHATSAYQVNGGVLQIQREILDNGPVT 239
Query: 261 ATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRG 320
A +Y+DFY Y +G+Y+HTS + + H+ K+IGWG+EN PYW+ N+WG +G+ G
Sbjct: 240 AYMEVYEDFYSYGTGIYQHTSGSFVGG--HAVKIIGWGSENDVPYWIAANSWGTGFGEDG 297
Query: 321 TVKILRGKYECAFEYLIAAGKP 342
+ILRG E I AG P
Sbjct: 298 FFRILRGSNCAGIESYIVAGYP 319
>gi|21700775|gb|AAL60053.1| cysteine proteinase [Toxoplasma gondii]
Length = 569
Score = 188 bits (477), Expect = 4e-45, Method: Compositional matrix adjust.
Identities = 104/267 (38%), Positives = 147/267 (55%), Gaps = 16/267 (5%)
Query: 84 VPDRFDAREQWPNC-GTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYV 142
VP FDAR +P C +GHV D G C + FA+ AF+DR CI+S+G++ PLS ++
Sbjct: 274 VPAHFDARTAFPACKDVVGHVRDQGDCGSCWAFASTEAFNDRLCIRSQGKRLMPLSAQHT 333
Query: 143 ASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDY---GDRTGCQPSTISPCSHHGS 199
SCC + + C+ G W + ++G VTGGD+ G T C P + C+HH
Sbjct: 334 TSCCNAI-HCASFGCNGGQPGMAWRWFERKGVVTGGDFDALGKGTTCWPYEVPFCAHHAK 392
Query: 200 APTLPSCENQKVPKL--KCHTRCTNPTYGRG---FFQDKHRTTLTYWVDDNEDAIKKEIL 254
AP P C+ VP+ KC C Y F QD H+ T Y + +D +K++++
Sbjct: 393 AP-FPDCDATLVPRKTPKCRKDCEEQAYADNVHPFDQDTHKATSAYSLRSRDD-VKRDMM 450
Query: 255 AHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGP 314
HGP + F +Y+DF YKSGVYKH S + H+ K+IGWGTENG YW +N+W
Sbjct: 451 THGPVSGAFMVYEDFLSYKSGVYKHVSGLPVGG--HAIKIIGWGTENGEEYWHAVNSWNT 508
Query: 315 HWGDRGTVKILRGKYECAFEYLIAAGK 341
+WGD G KI G +C + + AG+
Sbjct: 509 YWGDGGQFKIAMG--QCGIDGEMVAGE 533
>gi|308504375|ref|XP_003114371.1| CRE-CPR-1 protein [Caenorhabditis remanei]
gi|308261756|gb|EFP05709.1| CRE-CPR-1 protein [Caenorhabditis remanei]
Length = 366
Score = 187 bits (476), Expect = 5e-45, Method: Compositional matrix adjust.
Identities = 114/320 (35%), Positives = 159/320 (49%), Gaps = 23/320 (7%)
Query: 24 AYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEYSAT 83
A +D +N + +T +SEE ++ ++ D KY + R T T
Sbjct: 69 ALVDYVNSAQSLFTTEH---VEVSEEVMKSRVM-DVKYAAAHSDEI---RATEVDTVLDT 121
Query: 84 VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVA 143
+P FD+R W C +I + D C + F A SDR CI++KG Q +S + +
Sbjct: 122 IPASFDSRTHWSECKSIKLIRDQATCGSCWAFGAAEVISDRTCIETKGAQQPIISPDDLL 181
Query: 144 SCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTL 203
SCC C G + + +G VTGGDY GC+P I+PC T
Sbjct: 182 SCCG---SSCGNGCEGGYPIQALRWWDSKGVVTGGDY-HGAGCKPYPIAPC-------TS 230
Query: 204 PSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATF 263
+C K P C C + Y + +DKH T Y V +I+ EI+ +GP A F
Sbjct: 231 GNCPESKTPS--CSLSCQSG-YTTAYAKDKHFGTSAYAVARKVASIQTEIMTNGPVEAAF 287
Query: 264 ALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVK 323
+Y+DFY YKSGVYKHT+ L H+ K+IGWGTE+G+PYWLV N+WG WG+ G +
Sbjct: 288 TVYEDFYKYKSGVYKHTAGKALGG--HAIKIIGWGTESGSPYWLVANSWGNSWGESGFFR 345
Query: 324 ILRGKYECAFEYLIAAGKPK 343
I RG +C E + AGK K
Sbjct: 346 IFRGDDQCGIESAVVAGKAK 365
>gi|294883442|ref|XP_002770942.1| Cathepsin B precursor, putative [Perkinsus marinus ATCC 50983]
gi|239874068|gb|EER02758.1| Cathepsin B precursor, putative [Perkinsus marinus ATCC 50983]
Length = 393
Score = 187 bits (475), Expect = 6e-45, Method: Compositional matrix adjust.
Identities = 116/326 (35%), Positives = 161/326 (49%), Gaps = 20/326 (6%)
Query: 23 DAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGD---RKTYDPE 79
D+ D +N+ TW A +E + + D K + P + +
Sbjct: 69 DSLADALNQGQKTWVASSK------QERFKGASVFDVKALCGTILNGPSKLPKKPASEST 122
Query: 80 YSATVPDRFDAREQWPNCGT-IGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNR-PL 137
+ +PDRFDARE + NC T IGHV D C + FA AFSDR CI+S G+ + PL
Sbjct: 123 ALSNLPDRFDAREHFKNCATVIGHVRDQSTCGSCWAFATSEAFSDRLCIRSSGEFDLVPL 182
Query: 138 STEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHH 197
S + A+CC + C G W + + G V+ D +GC P CSHH
Sbjct: 183 SAGHTAACCSEAEGCFSFGCDGGQPDSAWRWFSEHGVVSELD----SGCWPYNFPECSHH 238
Query: 198 GSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHG 257
+ C+ P C T C N + F D+H T + D D IKKEI+ +G
Sbjct: 239 VETKGMEPCKGNS-PSPVCSTTCRNHHFKPSFESDRHFTEDEGYSLDEVDEIKKEIIDNG 297
Query: 258 PTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWG 317
P A F +Y+DF +YKSGVYKH + ++L H+ K+IGWGT+ YWLV+N+W +WG
Sbjct: 298 PVAAAFTVYEDFLYYKSGVYKHVNGSELGG--HAVKIIGWGTDQNEQYWLVMNSWNVNWG 355
Query: 318 DRGTVKILRGKYECAFEYLIAAGKPK 343
D+G KI G EC + + AG PK
Sbjct: 356 DQGIFKIAIG--ECGIDSEVTAGIPK 379
>gi|157092993|gb|ABV22151.1| cysteine proteinase [Perkinsus chesapeaki]
Length = 396
Score = 187 bits (475), Expect = 7e-45, Method: Compositional matrix adjust.
Identities = 119/335 (35%), Positives = 168/335 (50%), Gaps = 42/335 (12%)
Query: 24 AYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQS--DRPLPGDRKTYDPEYS 81
+ +D+IN + N W A ++ +E + ++DAK + ++P K Y +
Sbjct: 83 SLVDEINSKQNAWMA------SIEQERFKGASMSDAKRLCGTWLEKPENIREKLYTADEL 136
Query: 82 ATVPDRFDAREQWPNCGT-IGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTE 140
+P F+A E++ C + IGH+ D AC + FA AF+DR CIKS G LS
Sbjct: 137 KDLPVSFNATEEFKECSSVIGHIRDQSACGSCWAFAPTEAFNDRLCIKSAGNFTSLLSPG 196
Query: 141 YVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYG------DRTGCQPSTISPC 194
VA+C K C GS W +LH G VTGGDY + GC P I PC
Sbjct: 197 NVAACSK------TSGCHGGSSLDAWQWLHTTGVVTGGDYSAEKDMTESDGCWPYDIPPC 250
Query: 195 SHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNE-------D 247
+H+ ++ P C K C C N Y +D+H +V++ D
Sbjct: 251 AHYTNSTLYPKCPKTKYDFPTCQESCPNKKYDTPMEKDRH------FVEEESLSALRSID 304
Query: 248 AIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWL 307
AIKKEI+ +GP +A++ +YDDF YKSGVYK TS+ L H+ K+IGWG + YWL
Sbjct: 305 AIKKEIMTNGPVSASYLVYDDFLTYKSGVYKRTSHNALGG--HAVKIIGWGED----YWL 358
Query: 308 VINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
V+N+W +WGD G KI G +C E + AG P
Sbjct: 359 VVNSWNKNWGDNGMFKI--GCGQCGIEDNVLAGTP 391
>gi|132566367|gb|ABO34080.1| cathepsin B5 [Clonorchis sinensis]
Length = 343
Score = 187 bits (474), Expect = 8e-45, Method: Compositional matrix adjust.
Identities = 97/259 (37%), Positives = 134/259 (51%), Gaps = 9/259 (3%)
Query: 84 VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVA 143
+P FDAR+ WP+C +I + D +C + F AV A SDR CI S G N+ LS +
Sbjct: 86 LPKNFDARKTWPHCSSISEIRDQSSCGSCWAFGAVEAMSDRLCIHSNGAFNKSLSAVDLL 145
Query: 144 SCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTL 203
SCCK C + C G W++ G VTGG D +GC+ C HH
Sbjct: 146 SCCKDCGF----GCRGGYPAVAWDYWKTHGIVTGGSKEDPSGCRSYPFPKCEHHVQG-HY 200
Query: 204 PSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATF 263
P C + P +C +C P G+ +DK R ++Y + +E +I KEI+ GP A F
Sbjct: 201 PPCPRELYPTPECVQQCDTPDV--GYLEDKTRANMSYNIYASEISIMKEIMLRGPVEAIF 258
Query: 264 ALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVK 323
+Y+DF Y SGVY H A + H+ +++GWG PYWL+ N+W WG+ G +K
Sbjct: 259 TMYEDFLRYSSGVYFHALGAPMSG--HAVRILGWGELGNVPYWLIANSWNEDWGEEGYMK 316
Query: 324 ILRGKYECAFEYLIAAGKP 342
LRG EC E + AG P
Sbjct: 317 FLRGYNECGIEDDVTAGLP 335
>gi|237836005|ref|XP_002367300.1| cysteine proteinase, putative [Toxoplasma gondii ME49]
gi|211964964|gb|EEB00160.1| cysteine proteinase, putative [Toxoplasma gondii ME49]
gi|221506020|gb|EEE31655.1| cysteine proteinase, putative [Toxoplasma gondii VEG]
Length = 572
Score = 187 bits (474), Expect = 8e-45, Method: Compositional matrix adjust.
Identities = 104/267 (38%), Positives = 146/267 (54%), Gaps = 16/267 (5%)
Query: 84 VPDRFDAREQWPNC-GTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYV 142
VP FDAR +P C +GHV D G C + FA+ AF+DR CI+S+G+ PLS ++
Sbjct: 277 VPAHFDARTAFPACKDVVGHVRDQGDCGSCWAFASTEAFNDRLCIRSQGKGLMPLSAQHT 336
Query: 143 ASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDY---GDRTGCQPSTISPCSHHGS 199
SCC + + C+ G W + ++G VTGGD+ G T C P + C+HH
Sbjct: 337 TSCCNAI-HCASFGCNGGQPGMAWRWFERKGVVTGGDFDALGKGTTCWPYEVPFCAHHAK 395
Query: 200 APTLPSCENQKVPKL--KCHTRCTNPTYGRG---FFQDKHRTTLTYWVDDNEDAIKKEIL 254
AP P C+ VP+ KC C Y F QD H+ T Y + +D +K++++
Sbjct: 396 AP-FPDCDATLVPRKTPKCRKDCEEQAYADNVHPFDQDTHKATSAYSLRSRDD-VKRDMM 453
Query: 255 AHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGP 314
HGP + F +Y+DF YKSGVYKH S + H+ K+IGWGTENG YW +N+W
Sbjct: 454 THGPVSGAFMVYEDFLSYKSGVYKHVSGLPVGG--HAIKIIGWGTENGEEYWHAVNSWNT 511
Query: 315 HWGDRGTVKILRGKYECAFEYLIAAGK 341
+WGD G KI G +C + + AG+
Sbjct: 512 YWGDGGQFKIAMG--QCGIDGEMVAGE 536
>gi|341900875|gb|EGT56810.1| hypothetical protein CAEBREN_32632 [Caenorhabditis brenneri]
Length = 287
Score = 186 bits (473), Expect = 9e-45, Method: Compositional matrix adjust.
Identities = 95/267 (35%), Positives = 141/267 (52%), Gaps = 4/267 (1%)
Query: 78 PEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPL 137
P ++ + FDARE+WP C +I + D C + FAA + SDR CI S G N L
Sbjct: 22 PTENSDLSQFFDARERWPECMSIPQINDISECKSSWAFAAAESMSDRLCINSGGTINTIL 81
Query: 138 STEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHH 197
S + + SCC + C G+ F+ W + K G TGG Y + GC+P +I+PC
Sbjct: 82 SAQELLSCCTGV-LSCGEGCGGGNAFKAWQYWGKHGLPTGGSYESQFGCKPYSIAPCGKT 140
Query: 198 GSAPTLPSCENQKVPKLKCHTRCTNPT-YGRGFFQDKHRTTLTYWVDDNEDAIKKEILAH 256
T P+C N +P C +CT+ Y +D+H + + + I+ +++ +
Sbjct: 141 VGNVTYPACTNTTLPTPSCEKKCTSKNGYPVDIDKDRHYGASVDQLPNRQIEIQSDVMLN 200
Query: 257 GPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHW 316
GP TF +YDDF Y +G+Y H + K + +L S +++GWG G PYWL+ N+WG W
Sbjct: 201 GPIETTFEVYDDFLQYTTGIYVHLTGNK-QGHL-SVRILGWGMYEGVPYWLLANSWGKEW 258
Query: 317 GDRGTVKILRGKYECAFEYLIAAGKPK 343
G+ GT + LRG EC E +G PK
Sbjct: 259 GENGTFRALRGTNECGLEANCVSGMPK 285
>gi|341878049|gb|EGT33984.1| CBN-CPR-1 protein [Caenorhabditis brenneri]
Length = 330
Score = 186 bits (473), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 103/261 (39%), Positives = 137/261 (52%), Gaps = 16/261 (6%)
Query: 83 TVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYV 142
T+P FD+R W C +I + + C + F A SDR CI++KG Q +S + +
Sbjct: 85 TIPASFDSRTHWSECKSIKLIRNQATCGSCWAFGAAEVISDRTCIETKGAQQPIISPDDL 144
Query: 143 ASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPT 202
SCC C G + + +G VTGGDY GC+P I+PC T
Sbjct: 145 LSCCG---SSCGNGCEGGYPIQALRWWDSKGVVTGGDY-HGAGCKPYPIAPC-------T 193
Query: 203 LPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTAT 262
SC K P C C P Y + +DKH T Y V +I+ EI+ +GP A
Sbjct: 194 SGSCPESKTPA--CSLSC-QPGYTTAYAKDKHFGTSAYAVAKKVASIQTEIMTNGPVEAA 250
Query: 263 FALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTV 322
F +Y+DFY YKSGVYKHT+ L H+ K+IGWGTE+G+PYWLV N+WG WG+ G
Sbjct: 251 FTVYEDFYKYKSGVYKHTAGKALGG--HAIKIIGWGTESGSPYWLVANSWGTSWGESGFF 308
Query: 323 KILRGKYECAFEYLIAAGKPK 343
KI RG +C E + AGK +
Sbjct: 309 KIFRGDDQCGIESAVVAGKAR 329
>gi|335347291|gb|AEH42093.1| cysteine proteinase 6 [Haemonchus contortus]
Length = 346
Score = 186 bits (473), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 99/300 (33%), Positives = 158/300 (52%), Gaps = 13/300 (4%)
Query: 42 FPANLS-EEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTI 100
F A+++ Y Q + D ++ +Q+ +P+ D + +P+ FDAR +WPNC +I
Sbjct: 55 FEADVTPHSYNVQHKLMDLRFVNQNRKPVVEDAS----DKGDDIPESFDARTKWPNCTSI 110
Query: 101 GHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHG 160
H+ D C + + SDR CI SK ++ +S+ SCC C + C G
Sbjct: 111 KHIRDQANCGSCWAVSTASVLSDRICIASKQKKQVHISSIDFVSCCDSCGF----GCEGG 166
Query: 161 SVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRC 220
+ + +G VTGGDYG +TGC+P PC HHG+ C ++ +C +C
Sbjct: 167 WPIDAFEYYSYQGVVTGGDYGSKTGCRPYPFHPCGHHGNETYYGECPKEESTP-ECVKQC 225
Query: 221 TNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHT 280
Y + +DK Y V+++ AI++EI+ GP ++F +YDDF +Y G+YKHT
Sbjct: 226 -QKGYKNSYRRDKTWGEDYYEVENSVKAIQREIMRSGPVVSSFTVYDDFSYYVKGIYKHT 284
Query: 281 SNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAG 340
+ + H+ K+IGWGTE PYW++ N+W WG++G +++RG C E + AG
Sbjct: 285 AGKARGS--HAIKIIGWGTEKNVPYWIIANSWHNDWGEKGFFRMVRGTNHCGIEEDVVAG 342
>gi|145481831|ref|XP_001426938.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124394016|emb|CAK59540.1| unnamed protein product [Paramecium tetraurelia]
Length = 332
Score = 186 bits (473), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 118/336 (35%), Positives = 164/336 (48%), Gaps = 28/336 (8%)
Query: 1 MIHILVFLLGCTLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIAD-- 58
M+ + ++ R F A+++ I TWTA Y R +D
Sbjct: 1 MLQFICLIISLVSARN---PFITAFVNSIK---TTWTA---------TNYERWNEKSDGF 45
Query: 59 -AKYFDQ-SDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFA 116
+KYF+ D P + K + E +P F A+E+WP C +I +PD G C + +
Sbjct: 46 YSKYFNVIVDHSEPVEYKYH--EKLENLPPSFSAQEKWPGCPSIELIPDQGNCGSCWAVS 103
Query: 117 AVGAFSDRRCIKSKGQQNRPLSTEYVASCCKI-CRYDDNKSCSHGSVFRTWNFLHKRGSV 175
A SDR CI S R +S E + SCC I C D N C G + W +L G V
Sbjct: 104 AASTMSDRLCIASGQTDKRQISAEDLLSCCGINCELDGNGGCDGGYPYGAWKYLRVDGIV 163
Query: 176 TGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCT---NPTYGRGFFQD 232
TGG Y D + C+P + PCSH + CEN + CT +P + R + D
Sbjct: 164 TGGTYNDFSLCKPYSFPPCSHGNDSGKYSKCENDFFMLTEVTPSCTKKCHPQFSRTYDVD 223
Query: 233 KHRTTLT-YWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHS 291
K R+ Y + +++ IK EI +GP A F ++DDF +YKSGVY+ T+ + H+
Sbjct: 224 KIRSRENPYKLIKDQEQIKNEIYLNGPVQAVFTVFDDFLNYKSGVYQQTTGQRRGK--HA 281
Query: 292 GKLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRG 327
K+IGWGTENG PYW IN+W WG G KILRG
Sbjct: 282 VKIIGWGTENGVPYWEAINSWNDGWGINGKFKILRG 317
>gi|226466816|emb|CAX69543.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 337
Score = 186 bits (472), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 101/258 (39%), Positives = 133/258 (51%), Gaps = 8/258 (3%)
Query: 84 VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVA 143
+PD FD+REQW NC +I + D C + A+V A SDR CI++ G LS +
Sbjct: 84 LPDYFDSREQWKNCPSIKRIYDQSQCYSSWAMASVAAISDRICIQTNGTVKVELSAIELV 143
Query: 144 SCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTL 203
SCC C C+ G W + + G VTG G+ +GC P C H GS+ +
Sbjct: 144 SCCSKCAV----GCNFGYSESAWYYWVENGLVTGESNGNNSGCLPYPFPKCDH-GSSDSY 198
Query: 204 PSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATF 263
P C C+ C P Y + DKH Y V NE I++EI+ +GP A+
Sbjct: 199 PMCGYVVYTPPVCNGTC-RPGYPIPYNDDKHFGKSAYQVKQNESDIRREIMLYGPVEASI 257
Query: 264 ALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVK 323
+YDDF YKSGVYKH + + + S ++IGWG ENG PYWL N+W WG G K
Sbjct: 258 FIYDDFVDYKSGVYKHLTGRLIT--IQSVRIIGWGIENGIPYWLCANSWNEEWGLNGFFK 315
Query: 324 ILRGKYECAFEYLIAAGK 341
ILRG EC E + AG+
Sbjct: 316 ILRGSNECEIEAFVNAGR 333
>gi|201023321|ref|NP_001128402.1| cathepsin B-1874 precursor [Acyrthosiphon pisum]
Length = 315
Score = 186 bits (472), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 101/265 (38%), Positives = 146/265 (55%), Gaps = 9/265 (3%)
Query: 80 YSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLST 139
+++ +P FD+R++WPNC +IGH+ + G C + + AA A SDR CI+S G +N +S
Sbjct: 57 FTSGLPINFDSRKKWPNCPSIGHIYNQGNCRSSYAVAAASAASDRICIQSNGTKNPIMSA 116
Query: 140 EYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGS 199
+ + SCC +C + C GS+F +W++ + G V+GGDY GCQP TI PC
Sbjct: 117 QQIISCCYLCGH----GCDGGSLFESWDYYRRHGFVSGGDYNSNQGCQPYTIPPCKLMNE 172
Query: 200 APTLPSCEN-QKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGP 258
P SC + C +C NP Y F D ++ + + K+I +GP
Sbjct: 173 KPPGHSCTTYHREETPICEKKCYNPNYYTSFRTDIYKGK---YYKLSPYMAMKDIFDNGP 229
Query: 259 TTATFALYDDFYHYKSGVYKHTSNAKLENY-LHSGKLIGWGTENGTPYWLVINTWGPHWG 317
T F +Y D YKSGVY++ + + + +HS K+ GWG ENG PYWLV N++G WG
Sbjct: 230 ITTQFYMYRDLVDYKSGVYQYDEQSDFDFFTVHSVKIFGWGEENGVPYWLVANSFGTDWG 289
Query: 318 DRGTVKILRGKYECAFEYLIAAGKP 342
GT KI RG C F+ + AG P
Sbjct: 290 YNGTFKISRGNDGCFFQEKMYAGLP 314
>gi|327408413|emb|CCA30060.1| unnamed protein product [Neospora caninum Liverpool]
Length = 463
Score = 186 bits (471), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 110/286 (38%), Positives = 156/286 (54%), Gaps = 18/286 (6%)
Query: 65 SDRPLPGDRKTYDPEYSATVPDRFDAREQWPNC-GTIGHVPDTGACAAPHIFAAVGAFSD 123
S PLP KT + VP FDAR +P C +GHV D G C + FA+ AF+D
Sbjct: 151 SGVPLPA--KTVFENANEPVPANFDARTAFPVCKDVVGHVRDQGDCGSCWAFASTEAFND 208
Query: 124 RRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDY--- 180
R CI+S+G+ PLST++ SCC + + C+ G W + ++G VTGGD+
Sbjct: 209 RLCIRSQGKGVMPLSTQHTTSCCNAI-HCASFGCNGGQPGMAWRWFERKGVVTGGDFDTL 267
Query: 181 GDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKL--KCHTRCTNPTYGR---GFFQDKHR 235
G T C P I C+HH AP P+C+ P+ KC C Y F +D H+
Sbjct: 268 GKGTTCWPYEIPFCAHHAKAP-FPNCDTDVRPRKTPKCRKDCEEAAYSEHVLPFDKDVHK 326
Query: 236 TTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLI 295
+ +Y + + DA+K++++AHG T F +Y+DF +YKSGVYKH L H+ K+I
Sbjct: 327 ASSSYSL-RSRDAVKRDMMAHGTVTGAFMVYEDFLNYKSGVYKHVYGGPLGG--HAIKII 383
Query: 296 GWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGK 341
GWGTE+G YW +N+W +WGD G KI G +C + + AG+
Sbjct: 384 GWGTEDGEEYWHAVNSWNTYWGDSGHFKIEMG--QCGVDNEMVAGE 427
>gi|340501578|gb|EGR28345.1| hypothetical protein IMG5_177790 [Ichthyophthirius multifiliis]
Length = 356
Score = 185 bits (470), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 100/262 (38%), Positives = 144/262 (54%), Gaps = 11/262 (4%)
Query: 84 VPDRFDAREQW-PNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYV 142
+P FD+R+QW C ++ V D C + FAA + SDR CI + G+ R LSTE +
Sbjct: 97 LPKNFDSRKQWGSKCPSLNEVRDQSTCGSCWAFAAAESLSDRICIHT-GEDVR-LSTENL 154
Query: 143 ASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPT 202
SCC C C+ G + K G VTG +GD CQ + PC+HH ++
Sbjct: 155 VSCCSSC----GDGCNGGYPEAAMQYFVKTGLVTGDLFGDNNFCQAYSFPPCAHHVASTK 210
Query: 203 LPSCENQKVPKLKCHTRCTNPT-YGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTA 261
P C+ + VP +C +C + + R + +D ++ +Y V + AI EI+ +GP
Sbjct: 211 YPPCKGE-VPTPECKKKCDDDSKVKRPYNEDLYKGQKSYSVSSDPKAIMTEIMNNGPVEV 269
Query: 262 TFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGT 321
F +Y+DF YKSGVY+H + +L H+ K+IGWG EN TPYWL++N+W WGD+GT
Sbjct: 270 AFTVYEDFVTYKSGVYQHVTGEQLGG--HAVKMIGWGVENDTPYWLIVNSWNETWGDQGT 327
Query: 322 VKILRGKYECAFEYLIAAGKPK 343
KILRG EC E + P+
Sbjct: 328 FKILRGSNECGIEDEVVTALPQ 349
>gi|161671340|gb|ABX75522.1| cathepsin b [Lycosa singoriensis]
Length = 247
Score = 185 bits (470), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 102/255 (40%), Positives = 131/255 (51%), Gaps = 9/255 (3%)
Query: 89 DAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKI 148
D+REQWP+C +I + D G+C + F AV A SDR CI S G+ +S E + SCC
Sbjct: 1 DSREQWPDCPSISEIRDQGSCGSCWAFGAVEAMSDRHCIHSNGKVKIEVSPEDLLSCCSS 60
Query: 149 CRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCEN 208
C C G W F +G TGG + GCQP I C HH + P +
Sbjct: 61 C----GMGCDGGFPPSAWEFWVDKGIATGGLWNSHIGCQPYEIPACEHHTTGDRPPCSDI 116
Query: 209 QKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDD 268
PK C C Y + DKH +Y ++ E I+ EI +GP F++Y D
Sbjct: 117 VDTPK--CVHLCEK-GYNTSYRDDKHFGKKSYSIESLEQQIQTEIFKNGPVEGAFSVYSD 173
Query: 269 FYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGK 328
F +YKSGVY+H S L H+ +++GWG EN PYWL N+W WGD+G KILRG
Sbjct: 174 FINYKSGVYQHHSGESLGG--HAIRVLGWGYENDVPYWLCANSWNTDWGDKGYFKILRGS 231
Query: 329 YECAFEYLIAAGKPK 343
EC E I AG PK
Sbjct: 232 DECGIESSIVAGIPK 246
>gi|144952804|gb|ABP04056.1| cathepsin B-4 [Clonorchis sinensis]
Length = 347
Score = 185 bits (470), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 115/330 (34%), Positives = 161/330 (48%), Gaps = 19/330 (5%)
Query: 21 FSDAYIDQINREAN-TWTAGRNFPANLSEEYLRQFL--IADAKYFDQSDRPLPGDRKTYD 77
SD +D +N + + TW A ++ EE +R L + + + + RP
Sbjct: 26 LSDELVDYVNSQVDATWKAAKSERFKTLEE-IRSVLGTMREDQNVKEFRRPTISHE---- 80
Query: 78 PEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKS-KGQQNRP 136
+ + +P FDARE WP C TI + D C + FAAV A SDR CI S + N
Sbjct: 81 -DITLELPSEFDAREHWPECRTIPQIRDQSGCGSCWAFAAVTAMSDRVCIHSNQTLVNVQ 139
Query: 137 LSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSH 196
LS + +CC C + W++ G VTGG+Y D C P PC H
Sbjct: 140 LSATDLLACCTTCGFGCVGG----WGGMAWDYWRDNGIVTGGEYKDSHTCLPYPFPPCRH 195
Query: 197 HGS-APTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILA 255
HG+ P C + +C + C Y + DK R + +Y + + AI+KEI
Sbjct: 196 HGAKGSEYPPCPEKMYSTPQCVSECQK-GYATKYEDDKIRASTSYNLYRSVTAIQKEIWM 254
Query: 256 HGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTE-NGTPYWLVINTWGP 314
GP AT +Y DF +Y GVYKHT+ L H+ +L+GWG E +GTPYWL N+W P
Sbjct: 255 RGPVEATMNVYTDFANYAGGVYKHTTGELLGG--HAIRLLGWGVEEDGTPYWLAANSWNP 312
Query: 315 HWGDRGTVKILRGKYECAFEYLIAAGKPKN 344
WG++G +ILRG C E ++AG P N
Sbjct: 313 SWGEKGFFRILRGSDHCGIESDVSAGLPVN 342
>gi|358331547|dbj|GAA35870.2| cathepsin B-like cysteine proteinase [Clonorchis sinensis]
Length = 508
Score = 184 bits (468), Expect = 4e-44, Method: Compositional matrix adjust.
Identities = 102/299 (34%), Positives = 146/299 (48%), Gaps = 13/299 (4%)
Query: 36 WTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWP 95
W +GR S++ + F ++ RP +D + +P FDAR+ WP
Sbjct: 42 WISGRRPKRFESDDLIHMFGAKRETREQKAQRPT----LRHDGFDNMRLPKNFDARKTWP 97
Query: 96 NCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNK 155
+C +I + D +C + F AV A SDR CI S G N+ LS + SCCK C +
Sbjct: 98 HCSSISEIRDQSSCGSCWAFGAVEAMSDRLCIHSNGAFNKSLSAVDLLSCCKDCGF---- 153
Query: 156 SCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLK 215
C G W++ G VTGG D +GC+ C HH P C + P +
Sbjct: 154 GCRGGYPAVAWDYWKTHGIVTGGSKEDPSGCRSYPFPKCEHHVQG-HYPPCPRELYPTPE 212
Query: 216 CHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSG 275
C +C P G+ +DK R ++Y + +E +I KEI+ GP A F +Y+DF Y SG
Sbjct: 213 CVQQCDTPDV--GYLEDKTRANMSYNIYASEISIMKEIMLRGPVEAIFTMYEDFLRYSSG 270
Query: 276 VYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFE 334
VY H A + H+ +++GWG PYWL+ N+W WG+ G +K LRG EC E
Sbjct: 271 VYFHALGAPMSG--HAVRILGWGELGNVPYWLIANSWNEDWGEEGYMKFLRGYNECGIE 327
>gi|268555786|ref|XP_002635882.1| Hypothetical protein CBG01102 [Caenorhabditis briggsae]
Length = 374
Score = 184 bits (467), Expect = 5e-44, Method: Compositional matrix adjust.
Identities = 111/361 (30%), Positives = 174/361 (48%), Gaps = 45/361 (12%)
Query: 21 FSDA--YIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFD-QSDRPLPGDRKTYD 77
FSD+ I+ +N + + WTAG +S++Y+ + L D + ++ P + +
Sbjct: 19 FSDSTKIINYVNSQKSLWTAGN---PKISKDYMLKTLTTDPETVGFRNLGPTFYSKNIFS 75
Query: 78 PEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPL 137
PE + + FDARE+WP C +I + D C + F+A + SDR CI S G N L
Sbjct: 76 PE-NLDDSNFFDARERWPECSSIPIINDISDCKSSWAFSAAESMSDRLCINSGGMINTVL 134
Query: 138 STEYVASCCK---ICRYDDNK--------------------------------SCSHGSV 162
S + + SCC C D++ C+ G+V
Sbjct: 135 SAQELLSCCTGVFSCGEGDSEHWQFRNSKFRKPRCQKFNKEILEARRNLETREKCAGGNV 194
Query: 163 FRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTN 222
F+ W + K G TGG Y + GC+P +ISPC T P C N V C +C +
Sbjct: 195 FKAWQYWQKHGLPTGGSYESQFGCKPYSISPCDTVIGNITFPGCLNSTVQTPSCEKKCKS 254
Query: 223 PTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSN 282
Y +D+H + + + I+ +++ +GP +AT +YDDF Y +G+Y H +
Sbjct: 255 -GYPVELDKDRHYGVSVDQLPNRQIEIQSDVMLNGPISATMEVYDDFLQYTTGIYVHLTG 313
Query: 283 AKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
K + +L S +++GWG G PYWL+ N+WG WG+ GT ++LRG EC E +G P
Sbjct: 314 NK-QGHL-SVRILGWGMYEGVPYWLLANSWGKQWGENGTFRVLRGVNECGLEANCVSGMP 371
Query: 343 K 343
+
Sbjct: 372 R 372
>gi|341904369|gb|EGT60202.1| hypothetical protein CAEBREN_08101 [Caenorhabditis brenneri]
Length = 330
Score = 184 bits (466), Expect = 6e-44, Method: Compositional matrix adjust.
Identities = 102/261 (39%), Positives = 137/261 (52%), Gaps = 16/261 (6%)
Query: 83 TVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYV 142
T+P FD+R W C +I + + C + F A SDR CI++KG Q +S + +
Sbjct: 85 TIPASFDSRTHWSECKSIKLIRNQATCGSCWAFGAAEVISDRTCIETKGAQQPIISPDDL 144
Query: 143 ASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPT 202
SCC C G + + +G VTGGDY GC+P I+PC T
Sbjct: 145 LSCCG---SSCGNGCEGGYPIQALRWWDSKGVVTGGDY-HGAGCKPYPIAPC-------T 193
Query: 203 LPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTAT 262
SC K P C C + Y + +DKH T Y V +I+ EI+ +GP A
Sbjct: 194 SGSCPESKTPA--CSLSCQS-GYTTAYAKDKHFGTSAYAVAKKVASIQTEIMTNGPVEAA 250
Query: 263 FALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTV 322
F +Y+DFY YKSGVYKHT+ L H+ K+IGWGTE+G+PYWLV N+WG WG+ G
Sbjct: 251 FTVYEDFYKYKSGVYKHTAGKALGG--HAIKIIGWGTESGSPYWLVANSWGTSWGESGFF 308
Query: 323 KILRGKYECAFEYLIAAGKPK 343
KI RG +C E + AGK +
Sbjct: 309 KIFRGDDQCGIESAVVAGKAR 329
>gi|358341561|dbj|GAA37330.2| cathepsin B-like cysteine proteinase [Clonorchis sinensis]
Length = 347
Score = 184 bits (466), Expect = 8e-44, Method: Compositional matrix adjust.
Identities = 114/330 (34%), Positives = 160/330 (48%), Gaps = 19/330 (5%)
Query: 21 FSDAYIDQINREAN-TWTAGRNFPANLSEEYLRQFL--IADAKYFDQSDRPLPGDRKTYD 77
SD +D +N + + TW A ++ EE +R L + + + + RP
Sbjct: 26 LSDELVDYVNSQVDATWKAAKSERFKTLEE-IRSVLGTMREDQNVKEFRRPTISHE---- 80
Query: 78 PEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKS-KGQQNRP 136
+ + +P FDARE WP C TI + D C + FAAV A SDR CI S + N
Sbjct: 81 -DITLELPSEFDAREHWPECRTIPQIRDQSGCGSCWAFAAVTAMSDRVCIHSNQTLVNVQ 139
Query: 137 LSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSH 196
LS + +CC C + W++ G VTGG+Y D C P PC H
Sbjct: 140 LSATDLLACCTTCGFGCVGG----WGGMAWDYWRDNGIVTGGEYKDSHTCLPYPFPPCRH 195
Query: 197 HGS-APTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILA 255
HG+ P C + +C + C Y + DK R + +Y + + I+KEI
Sbjct: 196 HGAKGSEYPPCPEKMYSTPQCVSECQK-GYATKYEDDKIRASTSYNLYRSVTTIQKEIWM 254
Query: 256 HGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTE-NGTPYWLVINTWGP 314
GP AT +Y DF +Y GVYKHT+ L H+ +L+GWG E +GTPYWL N+W P
Sbjct: 255 RGPVEATMNVYTDFANYAGGVYKHTTGELLGG--HAIRLLGWGVEEDGTPYWLAANSWNP 312
Query: 315 HWGDRGTVKILRGKYECAFEYLIAAGKPKN 344
WG++G +ILRG C E ++AG P N
Sbjct: 313 SWGEKGFFRILRGSDHCGIESDVSAGLPVN 342
>gi|328701234|ref|XP_001948885.2| PREDICTED: cathepsin B-like cysteine proteinase 5-like
[Acyrthosiphon pisum]
Length = 326
Score = 183 bits (465), Expect = 8e-44, Method: Compositional matrix adjust.
Identities = 124/347 (35%), Positives = 170/347 (48%), Gaps = 34/347 (9%)
Query: 2 IHILVFLLGCTLVRGELYKFS-DAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAK 60
I +LV ++ + E K S D ID+ + E NT AG N + +EE L
Sbjct: 4 ILLLVSIMLLSFCLTEQAKLSHDNTIDKSDVETNTLKAGENVGPHSAEEERLMLLGTRGV 63
Query: 61 YFDQSDRPLPGDRKTYDPEY--SATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAV 118
+ L KT DP Y + FDAR++WP C TIG V + G +AA
Sbjct: 64 EAATKSKML---YKTRDPRYIIDNQIHKEFDARKRWPQCKTIGEVHNEGNELLSWAYAAT 120
Query: 119 GAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRT--WNFLHKRGSVT 176
G F+DR CI + G N+ LSTE + SC I +D G V R W + G V+
Sbjct: 121 GVFADRMCIATNGNYNQLLSTEELISCSGIKERED------GYVNRVLVWEYFKTHGLVS 174
Query: 177 GGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGR---GFFQDK 233
GG Y GCQPS + PT+ + + K+ K C C YG+ + D
Sbjct: 175 GGKYNTNEGCQPSKV---------PTVYNSQT-KIYKRTCVEYC----YGKDTINYNHDH 220
Query: 234 HRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGK 293
+ + Y++ + I+KE+ +GP + F L+DD + YKSGVY T +K + Y H K
Sbjct: 221 VKVSNHYFIRIKD--IQKEVQTYGPVSVFFDLHDDLFLYKSGVYAKTEKSKDKRY-HHAK 277
Query: 294 LIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAG 340
LIGWG ENG YWL++N+WG WG G KI RG EC+ E + AG
Sbjct: 278 LIGWGVENGVDYWLLVNSWGYEWGQNGLFKIKRGTDECSVESHVYAG 324
>gi|268561802|ref|XP_002638421.1| C. briggsae CBR-CPR-3 protein [Caenorhabditis briggsae]
Length = 375
Score = 183 bits (464), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 113/325 (34%), Positives = 160/325 (49%), Gaps = 26/325 (8%)
Query: 24 AYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEYSAT 83
A + IN +W A N ++SE+ ++ F + D ++ D + + +
Sbjct: 39 ALVAHINSMQTSWIAEHN---DISEDEMK-FKVMDQRFADPLEEEVQDEGLVRGEVVPEP 94
Query: 84 VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVA 143
+PD FDAR+QWP+C ++ + + +C + F A SDR CI+S G Q +S E +
Sbjct: 95 LPDTFDARDQWPDCKSLKFIRNQASCGSCWAFGAAEVISDRVCIQSNGTQQPIISAEDIL 154
Query: 144 SCC-KICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAP- 201
SCC C K C G + G VTGGDY + GC P + PC
Sbjct: 155 SCCGSTC----GKGCQGGYTIEAMKYWMNSGVVTGGDY-NGAGCMPYSFPPCKKSPCVEF 209
Query: 202 TLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDA---IKKEILAHGP 258
+ PSC K C + T Y DKH T Y + ++A I+ EI +GP
Sbjct: 210 STPSC------KTTCQEKYTTADYKN----DKHFATSAYKLSTTKNAVPTIQYEIYHNGP 259
Query: 259 TTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGD 318
A++ +++DFY YKSGVY H S + H+ K+IGWGTENG YWLV N+WG +G+
Sbjct: 260 VEASYRVFEDFYQYKSGVYHHVSGNLVGG--HAVKIIGWGTENGVDYWLVANSWGTSFGE 317
Query: 319 RGTVKILRGKYECAFEYLIAAGKPK 343
+G KI RG EC E I AG K
Sbjct: 318 KGFFKIRRGTNECQIESNIVAGLAK 342
>gi|226471002|emb|CAX70582.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 183 bits (464), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 98/262 (37%), Positives = 147/262 (56%), Gaps = 8/262 (3%)
Query: 79 EYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLS 138
+ + +P +FD+R++WP+C +I + D C + F AV A +DR CI+S GQQ+ LS
Sbjct: 85 DLNVEIPSQFDSRKKWPHCKSISQIRDQSRCGSCWAFGAVEAMTDRICIQSGGQQSAELS 144
Query: 139 TEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHG 198
+ SC D C G + W++ KRG VTGG + TGCQP C H
Sbjct: 145 ALDLISC----CEDCGGGCKGGFPGQAWDYWVKRGIVTGGSEENHTGCQPYPFPKCEHL- 199
Query: 199 SAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGP 258
+ P+C + +C C Y + QDKH +Y V NE AI+KEI+ +GP
Sbjct: 200 TKGKYPACGTKIYKTPQCKQTCQK-GYKTPYKQDKHYGDESYNVISNEKAIQKEIMMYGP 258
Query: 259 TTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGD 318
A F +Y+DF +YKSG+Y+H + + + H+ ++IGWG E G PYWL+ N+W WG+
Sbjct: 259 VEAAFDVYEDFLNYKSGIYRHVTGSIVGG--HAIRIIGWGVEKGKPYWLIANSWNEDWGE 316
Query: 319 RGTVKILRGKYECAFEYLIAAG 340
+G +++RG+ EC+ E + AG
Sbjct: 317 KGLFRMVRGRDECSIESHVVAG 338
>gi|91088083|ref|XP_968689.1| PREDICTED: similar to AGAP004533-PA [Tribolium castaneum]
Length = 360
Score = 183 bits (464), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 121/324 (37%), Positives = 162/324 (50%), Gaps = 35/324 (10%)
Query: 24 AYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEYSAT 83
+ I+QIN + + WTAG N P + E L I + D + +P + +P+ +
Sbjct: 21 SLINQINSQQSAWTAGIN-PFDDIESRLGFLGI----HPDPNFKP-----EIKEPQATQN 70
Query: 84 V-PDRFDAREQWPNCG-TIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEY 141
V P+ FDARE WP C IG++ + G C++ FAA SDR CI + G+ LS E
Sbjct: 71 VIPETFDAREYWPECADIIGNIRNQGKCSSSWAFAAAEVMSDRLCIATNGKVKIQLSPED 130
Query: 142 VASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAP 201
+ CC C C G + WN+ G V+GGDY TGCQP S +++ P
Sbjct: 131 LIDCCHYC----GNQCKGGYTYYAWNYFMLTGLVSGGDYNTSTGCQP--YSELNYYRITP 184
Query: 202 TLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHG-PTT 260
C+T C N Y + DKH Y++ NE AI+ EIL+ G P
Sbjct: 185 -------------PCNTTCQNDKYPIPYVSDKHFGDSIYYIPQNETAIQNEILSGGGPVV 231
Query: 261 ATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRG 320
A F +Y DF Y+ GVY +TS A + K+IGWGTENG YWL N+WG WG G
Sbjct: 232 AAFDVYGDFKIYRDGVYIYTSGALFGR--TAVKIIGWGTENGWAYWLAANSWGKDWGALG 289
Query: 321 T-VKILRGKYECAFEYLIAAGKPK 343
KI RG EC FE I AG+ +
Sbjct: 290 GFFKIRRGTNECGFEESIIAGQVR 313
>gi|118358706|ref|XP_001012594.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila]
gi|89294361|gb|EAR92349.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila SB210]
Length = 346
Score = 182 bits (463), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 107/319 (33%), Positives = 158/319 (49%), Gaps = 17/319 (5%)
Query: 26 IDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEYSATVP 85
I ++N +TW AG N +++ + + G + + +P
Sbjct: 41 IQKVNSSNSTWKAGEN------TKWINSDIAGVKAHMGVKLGQESGIKLETVSAQANGLP 94
Query: 86 DRFDAREQWPN-CGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVAS 144
+ FDAR QW + C ++ V D C + F A + SDR CI GQ R LST+ + +
Sbjct: 95 EEFDARVQWGDKCSSLWEVRDQSTCGSCWAFGAAESLSDRHCIH-LGQDIR-LSTQNLLT 152
Query: 145 CCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTLP 204
CC C C G ++ G VTG YG+ + CQ T +PC+HH ++ P
Sbjct: 153 CCAAC----GDGCDGGWPEAAMDYYVNTGLVTGDLYGNNSWCQAYTFAPCAHHVTSDIYP 208
Query: 205 SCENQKVPKLKCHTRC-TNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATF 263
C + +P C C +N T+ + +D HR + Y + +E AI EI +GP
Sbjct: 209 PCTGE-LPTPPCINSCDSNSTHTIPYSKDIHRGSKAYGIAKDEKAIMAEIYKNGPIEVAL 267
Query: 264 ALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVK 323
+Y+DF YK+GVY+H + +L H+ K++GWG ENGTPYW ++N+W WGD+GT K
Sbjct: 268 TVYEDFLTYKTGVYQHVTGDELGG--HAVKMVGWGVENGTPYWTIVNSWNESWGDKGTFK 325
Query: 324 ILRGKYECAFEYLIAAGKP 342
ILRGK EC E P
Sbjct: 326 ILRGKNECGIESSCVTALP 344
>gi|326427908|gb|EGD73478.1| cathepsin B [Salpingoeca sp. ATCC 50818]
Length = 341
Score = 182 bits (463), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 117/326 (35%), Positives = 157/326 (48%), Gaps = 24/326 (7%)
Query: 23 DAYIDQINREANTWTAGRNFP-ANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDP--- 78
++ + IN W AG N N++ +Y+R+ Q L G T D
Sbjct: 34 ESIANDINARNVGWKAGVNERFVNVTMDYIRK----------QMGTRLEGSPVTLDVKHV 83
Query: 79 EYSATVPDRFDAREQWPN-CGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPL 137
E A +P FD+R QW + C ++ V D C + F AV A +DR CI SKG Q +
Sbjct: 84 EVPADLPTSFDSRTQWGSMCPSVKEVRDQANCGSCWAFGAVEAMTDRTCIASKGAQTPHI 143
Query: 138 STEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHH 197
S E + +CC D C+ G W + +G VTGG Y GCQP +++ C HH
Sbjct: 144 SAEDLLTCCTFTCGD---GCNGGYPAAAWEYWKNQGIVTGGQYDSNQGCQPYSLAKCEHH 200
Query: 198 GSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHG 257
+ P P VP C C Y + DKH +Y V D I EI+ +G
Sbjct: 201 TTGPYKPC--GDIVPTPACKRSCRQ-GYNVTYPNDKHFGASSYGVR-GVDQIATEIMTNG 256
Query: 258 PTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWG 317
P A F +Y DF YKSGVY+HTS L H+ K+IGWG ++GT YW+V N+W WG
Sbjct: 257 PVEAAFTVYSDFLSYKSGVYQHTSGQPLGG--HAIKIIGWGVQDGTDYWIVANSWNDSWG 314
Query: 318 DRGTVKILRGKYECAFEYLIAAGKPK 343
+ G I +G EC E + AG PK
Sbjct: 315 NDGFFWIKKGTDECGIESQVVAGLPK 340
>gi|121073168|gb|ABM47070.1| cathepsin B1 [Clonorchis sinensis]
gi|358341105|dbj|GAA29748.2| cathepsin B [Clonorchis sinensis]
Length = 339
Score = 182 bits (463), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 112/346 (32%), Positives = 159/346 (45%), Gaps = 21/346 (6%)
Query: 4 ILVFLLGCTLVRGE----LYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFL--IA 57
I+ LL R E SD + IN +ANT + + +R+ L +
Sbjct: 8 IMYALLCAESFRAEYIPSFESLSDEIVHYINHKANTTWKAAKYQRFKTISDVRRVLGAVP 67
Query: 58 DAKYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAA 117
D F R L + + +P+ FDARE+WP C +I + D C + F A
Sbjct: 68 DPNGFGLEKRCLLSTIREQE------LPESFDAREKWPYCSSIAEIRDQSNCGSCWAFGA 121
Query: 118 VGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTG 177
GA SDR CI S G+ +S E + CC C C G + W + + G VTG
Sbjct: 122 AGAISDRICIASGGKHQPRISPEDLVDCCADC----GMGCQGGYPAQAWEYWVRNGLVTG 177
Query: 178 GDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTT 237
Y C+P + PC HH P P + P+ C +C P Y + + DK
Sbjct: 178 DLYNTTDTCRPYSFPPCEHHVVGPRKPCTGDPTTPQ--CVKKC-QPEYPKTYENDKWYGL 234
Query: 238 LTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGW 297
Y + +++AI ++++ +GP F +Y DF Y SGVY+H + L H+ +L+GW
Sbjct: 235 KAYSIHSDQEAIMRDLMTYGPLEVDFEVYADFPSYSSGVYRHVAGGLLGG--HAVRLVGW 292
Query: 298 GTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
G E+G YWL+ N+W WGD G KI RG EC E AG PK
Sbjct: 293 GVEDGADYWLIANSWNTDWGDGGYFKIRRGVNECGIESDANAGHPK 338
>gi|239938574|gb|ACS36086.1| cysteine proteinase [Haemonchus contortus]
Length = 253
Score = 182 bits (463), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 96/258 (37%), Positives = 139/258 (53%), Gaps = 9/258 (3%)
Query: 84 VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVA 143
+P+ FDAR +WP C ++ H+ D C + + A SDR CI S G++ +S +
Sbjct: 2 IPESFDARTKWPKCSSLKHIHDQANCGSCWAVSTASALSDRICIASNGRKQVHVSATDIL 61
Query: 144 SCC-KICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPT 202
SCC C Y C+ G + +N+ K+G+VTGGDY +GC+P PC HHG
Sbjct: 62 SCCGNQCGY----GCNGGWPIQAFNYFSKQGAVTGGDYKATSGCRPYPFHPCGHHGKDTY 117
Query: 203 LPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTAT 262
C N+ KC +C + +D+ Y V ++E AI++EI+ +GP
Sbjct: 118 YGECPNEATTP-KCVRKCQKSYKKS-YKKDRSIGKDAYEVPNSEKAIQREIMKNGPVVGA 175
Query: 263 FALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTV 322
F +Y+DF +YK G+YKHT+ H+ K+IGWG ENG PYWL+ N+W WG+ G
Sbjct: 176 FTVYEDFSYYKKGIYKHTAGKARGG--HAIKIIGWGKENGVPYWLIANSWHNDWGENGYF 233
Query: 323 KILRGKYECAFEYLIAAG 340
+ILRG C E + AG
Sbjct: 234 RILRGSNHCGIEENVVAG 251
>gi|326515156|dbj|BAK03491.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 471
Score = 182 bits (462), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 116/321 (36%), Positives = 155/321 (48%), Gaps = 22/321 (6%)
Query: 26 IDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEYSATVP 85
+D +N TWTAG N + LR + + + LP R P+ A +P
Sbjct: 164 VDFVNALGTTWTAGHN--KRFTYNTLRH--VKNLCGAKKGGPKLPVKRI---PKKMA-LP 215
Query: 86 DRFDARE--QWPNC-GTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYV 142
FD R+ +WP C ++ HV D G+C + F A A +DR CI S GQ N LS E +
Sbjct: 216 TSFDPRDGSKWPACKDSLNHVRDQGSCGSCWAFGAAEAMTDRICIASNGQNNFYLSAEDL 275
Query: 143 ASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPT 202
SCC C C G W++ G VTGGD+ GC P + C HH +
Sbjct: 276 TSCCDSC----GMGCEGGYPSAAWDYFQSTGLVTGGDWNSNQGCYPYQLQACDHHVTGKY 331
Query: 203 LPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTAT 262
P + Q P C C N + DKH +Y V ++ +I EI +GP A+
Sbjct: 332 QPCGDIQPTPA--CANSCQN---NATWSSDKHFGASSYSVGTDQQSIMTEIYTNGPVEAS 386
Query: 263 FALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTV 322
+ +Y DF YKSGVY+H + L H+ K+IGWG + TPYW+V N+W WG+ G
Sbjct: 387 YDVYADFVSYKSGVYQHVTGDYLGG--HAVKIIGWGVDGSTPYWIVANSWNNDWGNNGFF 444
Query: 323 KILRGKYECAFEYLIAAGKPK 343
ILRG EC E I AG PK
Sbjct: 445 NILRGSDECGIEDGIVAGIPK 465
>gi|3087801|emb|CAA93277.1| cysteine proteinase [Haemonchus contortus]
Length = 344
Score = 182 bits (462), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 98/290 (33%), Positives = 151/290 (52%), Gaps = 15/290 (5%)
Query: 56 IADAKYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIF 115
+ D ++ + +P+ D + +P+ FDAR WPNC ++ H+ D C +
Sbjct: 67 LMDRRFIKHNRKPIVEDVN----DDGDDIPESFDARTHWPNCSSLTHIRDQADCGSCWAV 122
Query: 116 AAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSV 175
+ A SDR CI SKG + +S + SCC C C G V + F ++G+V
Sbjct: 123 STASALSDRICIASKGAKQVYVSATDILSCCHSC----GDGCDGGYVIDAFKFFAEQGAV 178
Query: 176 TGGDYGDRTGCQPSTISPCSHHGSAPTLPSC-ENQKVPKLKCHTRCTNPTYGRGFFQDKH 234
TGGDYG + C+P PC HHG+ C E+ P+ C +C Y + +D+
Sbjct: 179 TGGDYGAKDCCRPYPFHPCGHHGNETYYGECPEDGSTPE--CVRKCQE-GYETEYHEDRV 235
Query: 235 RTTLTYWVD-DNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGK 293
R Y + + AI+KEI+ +GP A F ++DDF Y+ G+Y H + + H+ K
Sbjct: 236 RGEDAYRLPIGSVKAIQKEIMRNGPVVAAFIVFDDFSFYRKGIYAHVAGSPRGG--HAVK 293
Query: 294 LIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
+IGWGTE+G PYW++ N+W WG+ G +++RG +C E + AGK K
Sbjct: 294 IIGWGTEHGVPYWIIANSWHSDWGEDGYFRMVRGINDCGIETNVVAGKFK 343
>gi|294954734|ref|XP_002788292.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
gi|239903555|gb|EER20088.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
Length = 317
Score = 182 bits (461), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 115/334 (34%), Positives = 160/334 (47%), Gaps = 34/334 (10%)
Query: 24 AYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGD---RKTYDPEY 80
+ +D+IN + TWTA + ++ + + DAK + D RK Y E
Sbjct: 3 SLVDEINSKQTTWTA------STGQKRFKNLSLRDAKMLCGTRMRGSNDKVIRKGYAIEE 56
Query: 81 SATVPDRFDAREQWPNCG-TIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLST 139
+P FDAR +PNC IGH+ D AC + F AF+DR C+KS G LS
Sbjct: 57 LQDLPTDFDARTAFPNCSKVIGHIRDQSACGSCWAFGVTEAFNDRLCVKSNGTFTELLSA 116
Query: 140 EYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDR------TGCQPSTISP 193
+ +C + C G W+++H G TGGDY R GC P P
Sbjct: 117 GEMNACAP------SYGCDGGYPDSAWSWVHDEGIATGGDYVARGNLTKGDGCWPYDFPP 170
Query: 194 CSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKH----RTTLTYWVDDNEDAI 249
C+HH + P C C +C NP Y D+H + Y V++ ++AI
Sbjct: 171 CAHHINDTKYPKCPKGSYETPNCVEQCHNPKYSTSLKNDRHYMLESSPYQYSVNNAKNAI 230
Query: 250 KKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVI 309
+ + GP +A++ +Y+DF YKSGVYKHTS + L H+ K+IGWG ENG YWLV+
Sbjct: 231 RTD----GPVSASYLVYEDFLAYKSGVYKHTSGSYLGG--HAVKIIGWGEENGEAYWLVV 284
Query: 310 NTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
N+W WGD G KI G C + + G PK
Sbjct: 285 NSWNEDWGDHGLFKIALGN--CQIDDDLLGGTPK 316
>gi|300176938|emb|CBK25507.2| unnamed protein product [Blastocystis hominis]
Length = 320
Score = 182 bits (461), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 102/316 (32%), Positives = 147/316 (46%), Gaps = 22/316 (6%)
Query: 28 QINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEYSATVPDR 87
++N TW A P +YL DR LP +P+
Sbjct: 26 EVNAMKTTWIANEAIPTRDYTQYLGVLF---------GDRQLPSKTIVA----RGDLPES 72
Query: 88 FDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCK 147
FD E+WP C ++ + D C + F A A +DR CI SKG+ LS + + +CC
Sbjct: 73 FDPVEKWPECPSLKEIRDQSVCGSCWAFGAAEAATDRLCIASKGKIQDRLSEQDLLTCCD 132
Query: 148 ICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCE 207
C + C G + W + G TGG+YG + C + C HH P E
Sbjct: 133 SCGF----GCDGGWLDMAWRWFQSTGVTTGGEYGSKDWCNAYSFPKCEHHAEGKYPPCGE 188
Query: 208 NQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYD 267
+Q+ P+ C +C Y + +DKH Y+V DAIK E++ +GP +F +Y+
Sbjct: 189 SQETPE--CVKQCQE-GYPVEYEKDKHFFGEAYYVQGGIDAIKTELMTNGPLEVSFFVYE 245
Query: 268 DFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRG 327
DF YKSG+Y+H + L H+ KL+GWG E+G YW + N+W WG+ G +I+ G
Sbjct: 246 DFLTYKSGIYQHVAGKYLGG--HAVKLVGWGVEDGIEYWKIANSWNEDWGENGYFRIVAG 303
Query: 328 KYECAFEYLIAAGKPK 343
K EC E G PK
Sbjct: 304 KGECGIEVGPIGGIPK 319
>gi|56752997|gb|AAW24710.1| unknown [Schistosoma japonicum]
Length = 342
Score = 182 bits (461), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 97/262 (37%), Positives = 146/262 (55%), Gaps = 8/262 (3%)
Query: 79 EYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLS 138
+ + +P +FD+R++WP+C +I + D C + F AV A +DR CI+S GQQ+ LS
Sbjct: 85 DLNVEIPSQFDSRKKWPHCKSISQIRDQSRCGSCWAFGAVEAMTDRICIQSGGQQSAELS 144
Query: 139 TEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHG 198
+ SC D C G + W++ KRG VTGG + TGCQP C H
Sbjct: 145 ALDLISC----CEDCGGGCKGGFPGQAWDYWVKRGIVTGGSEENHTGCQPYPFPKCEHL- 199
Query: 199 SAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGP 258
+ P+C + +C C Y + QDKH Y V NE AI++EI+ +GP
Sbjct: 200 TKGKYPACGTKIYKTPQCKQTC-QKGYKTPYEQDKHYGDQRYNVISNEKAIQREIMMYGP 258
Query: 259 TTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGD 318
A F +Y+DF +YKSG+Y+H + + + H+ ++IGWG E G PYWL+ N+W WG+
Sbjct: 259 VEAAFDVYEDFLNYKSGIYRHVTGSIVGG--HAIRIIGWGVEKGKPYWLIANSWNEDWGE 316
Query: 319 RGTVKILRGKYECAFEYLIAAG 340
+G +++RG+ EC+ E + AG
Sbjct: 317 KGLFRMVRGRDECSIESHVVAG 338
>gi|226473754|emb|CAX71562.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 329
Score = 181 bits (460), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 109/329 (33%), Positives = 164/329 (49%), Gaps = 34/329 (10%)
Query: 21 FSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDP-- 78
SD I IN+ N + A+ S+ + + DA++ + P R+ P
Sbjct: 30 LSDEMISFINKHPNA-----GWKADKSDRF---HSVDDARFLLGGRKEDPNLRQKRRPTV 81
Query: 79 ---EYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNR 135
+ + +P FD+R++WP C +I + D C + +AVGA SDR CI+S G+Q+
Sbjct: 82 DHHDLNVEIPSHFDSRKKWPRCKSISQIRDQSQCGSSWAVSAVGAISDRICIQSGGKQS- 140
Query: 136 PLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCS 195
Y S C G + +W++ RG VTGG + TGC+P C
Sbjct: 141 -----YCGS-----------GCDGGFLGPSWDYWVLRGIVTGGSKENHTGCRPYPFPKCD 184
Query: 196 HHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILA 255
H +C ++ +C C Y + QDKH +Y V E I+K+I+
Sbjct: 185 HFVKG-KYRACGDKLYKTPQCKQTC-QKGYNTSYEQDKHYGGFSYNVLSVESVIQKDIMM 242
Query: 256 HGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPH 315
HGP A +Y+DF +YKSG+Y++T+ + H+ +LIGWG ENGT YWL NTW
Sbjct: 243 HGPVEAYLEIYEDFLNYKSGIYRYTTGQFISG--HAVRLIGWGVENGTAYWLAANTWNED 300
Query: 316 WGDRGTVKILRGKYECAFEYLIAAGKPKN 344
WG++G +I+RG+ EC+ E IAAG K+
Sbjct: 301 WGEKGYFRIVRGRNECSIESEIAAGLIKS 329
>gi|40557606|gb|AAR88096.1| cathepsin B-like cysteine protease [Callosobruchus maculatus]
Length = 330
Score = 181 bits (460), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 114/342 (33%), Positives = 165/342 (48%), Gaps = 22/342 (6%)
Query: 5 LVFLLGCTLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQ 64
L ++ CT + EL SD YI+Q+N + W AGRNF + S +++ L
Sbjct: 8 LAAVVSCTFAQPELDFLSDEYIEQLNSKNLPWKAGRNFERDTSLYNIQRLLSVG------ 61
Query: 65 SDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDR 124
+ P + + +P+ FDAR+QW C +I + D C + ++ SDR
Sbjct: 62 TINPPSEFETIFHEDDGKDLPEEFDARKQWSKCESIKEIRDQSGCGSCWAVSSASVMSDR 121
Query: 125 RCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRT 184
CI+S + +S + CC+ C + + C G T+ G V+GG+Y
Sbjct: 122 ICIQSDQKNQLRISAADMIECCESCTFSVD-GCHGGIPSFTFTEWKDSGFVSGGEYNSTN 180
Query: 185 GCQPSTISPCSHHGSAPTLPSCENQ-KVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVD 243
GC + C+ PSC+ P C C + + + +DKH Y +
Sbjct: 181 GCMSYPLPRCN--------PSCKTLYDAPT--CKKECDKGSPLK-YEEDKHYAKQAYRIM 229
Query: 244 DN-EDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENG 302
E I+ EI+ +GP A+F +Y DF HY SGVYK +KL H+ ++IGWG ENG
Sbjct: 230 SKVERQIQLEIIKNGPVVASFTVYADFIHYLSGVYKFDGESKLLGG-HAVRIIGWGIENG 288
Query: 303 T-PYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
T PYWLV N+W WGD+G KI RGK EC E I AG P+
Sbjct: 289 TYPYWLVSNSWNERWGDQGLFKIWRGKNECGIEEEITAGLPR 330
>gi|159179|gb|AAA29178.1| cysteine proteinase, partial [Haemonchus contortus]
Length = 341
Score = 181 bits (460), Expect = 4e-43, Method: Compositional matrix adjust.
Identities = 106/313 (33%), Positives = 161/313 (51%), Gaps = 28/313 (8%)
Query: 46 LSEEYLRQFLIADAKYFDQSDRPLPGDRKTY----------------DPEYSATVPDRFD 89
LS E L +L + F+ + P PG ++ DPE +P+ +D
Sbjct: 32 LSGEPLVAYLRKNQNLFEVNSTPTPGFKQKIMDIKFRNQNPNLIVKDDPEPEDDIPEEYD 91
Query: 90 AREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCK-I 148
R+ W NC + ++ D C + + A SDR CI +K ++ +S + +CC
Sbjct: 92 PRKIWSNCTSF-YIRDQANCGSCWAVSTAAAISDRICIATKARKQVNISATDLVTCCTPT 150
Query: 149 CRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSC-E 207
C + C G + W + G V+GG+Y + C+P I PC HHG+ C E
Sbjct: 151 CGF----GCDGGWSIKAWEYFTYAGLVSGGEYRSKRCCRPYPIHPCGHHGNDTYYGECPE 206
Query: 208 NQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYD 267
P C +C P Y + + DK T + + + +AI+KE+L +GP TA+FA+Y+
Sbjct: 207 EASTPS--CKKKC-QPGYRKLYRMDKRYGTDAFQLPKSVEAIQKELLKNGPVTASFAVYE 263
Query: 268 DFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRG 327
DF YKSG+Y+HT+ +L Y H+ K+IGWGTEN T YWL+ N+W WG+ G +I+RG
Sbjct: 264 DFSLYKSGIYRHTA-GELRGY-HAVKMIGWGTENRTDYWLIANSWHDDWGENGYFRIIRG 321
Query: 328 KYECAFEYLIAAG 340
+C E +AAG
Sbjct: 322 INDCGIEENVAAG 334
>gi|308507719|ref|XP_003116043.1| hypothetical protein CRE_08645 [Caenorhabditis remanei]
gi|308250987|gb|EFO94939.1| hypothetical protein CRE_08645 [Caenorhabditis remanei]
Length = 356
Score = 181 bits (460), Expect = 4e-43, Method: Compositional matrix adjust.
Identities = 114/349 (32%), Positives = 167/349 (47%), Gaps = 35/349 (10%)
Query: 11 CTLVRGELYKFSDAYIDQINREANTWTA-------GRNFPANLSEEYLRQFLIADAKYFD 63
C++ R +++ A ++ IN+ ++W A + Y L D Y
Sbjct: 2 CSICRPKVHLTGKALVEHINKVQSSWVAEYTEISESEKKSKVMDSRYANPSLDEDDSYVL 61
Query: 64 QSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSD 123
++ R LP ++P FDAR WP C +I V D C + F A SD
Sbjct: 62 RNQRILP------------SIPTTFDARTNWPKCNSIKMVRDQSNCGSCWAFGAAEVISD 109
Query: 124 RRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDY-GD 182
R CI S G++ +S E + +CC + + + W G+VTGGDY GD
Sbjct: 110 RICIHSNGKEQPVISAEDILTCCGKSCGNGCQGGQGLEAMKFWT---TYGAVTGGDYKGD 166
Query: 183 RTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGR--GFFQDKHR---TT 237
GC+P + +PCS+ + T PSC+++ + YG+ G ++H+ T
Sbjct: 167 --GCKPYSFAPCSNCVESKTTPSCQSKCQSTYTVTNYKGDKHYGKNEGKVTERHKHLECT 224
Query: 238 LTYWVDDNEDA---IKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKL 294
Y +D + +A I+ EI +GP + +YDDFYHYKSGVY H + H+ K+
Sbjct: 225 SAYRLDTSSNAVPIIQNEIYQNGPVEVAYTVYDDFYHYKSGVYHHVTGKDTGG--HAVKI 282
Query: 295 IGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
IGWGTE G YWLV N+WG +GD+G KI RG EC E + AG K
Sbjct: 283 IGWGTEKGVDYWLVTNSWGTSFGDKGFFKIRRGTNECGIESNVVAGMAK 331
>gi|167538317|ref|XP_001750823.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163770644|gb|EDQ84327.1| predicted protein [Monosiga brevicollis MX1]
Length = 341
Score = 181 bits (460), Expect = 4e-43, Method: Compositional matrix adjust.
Identities = 110/319 (34%), Positives = 151/319 (47%), Gaps = 21/319 (6%)
Query: 28 QINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEYS--ATVP 85
++N+ +WTAG N +F A + L G + + + + A +P
Sbjct: 40 EVNQAQTSWTAGVN----------SRFARATDDFIKSQMGVLEGGPQLPEKDIAVLADLP 89
Query: 86 DRFDAREQW-PNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVAS 144
FD+REQW C + + D AC + F AV + +DR CI SKG +S + + +
Sbjct: 90 TAFDSREQWGSTCPSTKEIRDQAACGSCWAFGAVESMTDRICIASKGSLRPHISAQDLMT 149
Query: 145 CCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTLP 204
CC + CS G W++ G VTGG+Y GCQP ++ C HH S P
Sbjct: 150 CC---LFTCGSGCSGGYPSAAWSWFKTTGIVTGGNYNSSQGCQPYSLPNCDHHVSG-QYP 205
Query: 205 SCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFA 264
+C + P C C Y + DKH Y V D I EI+ +GP F
Sbjct: 206 ACSGEG-PTPACKKSC-EAGYNNTYSNDKHFGATAYSVAGEADKIATEIMTNGPVEGAFT 263
Query: 265 LYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVKI 324
+Y+D YKSGVY+HT+ L H+ K+IGWG E+G YW V N+W WGD G KI
Sbjct: 264 VYEDLLTYKSGVYQHTTGQVLGG--HAIKIIGWGVESGVDYWWVANSWNNDWGDNGFFKI 321
Query: 325 LRGKYECAFEYLIAAGKPK 343
+G EC E I AG PK
Sbjct: 322 KKGVDECGIESQIVAGMPK 340
>gi|984958|gb|AAC46877.1| cathepsin B-like proteinase [Ancylostoma caninum]
Length = 343
Score = 181 bits (460), Expect = 4e-43, Method: Compositional matrix adjust.
Identities = 103/319 (32%), Positives = 152/319 (47%), Gaps = 17/319 (5%)
Query: 23 DAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEYSA 82
A++D IN + + A + A E + I D+KY + P + + Y
Sbjct: 37 QAFVDYINEHQSFYRAEYSPEA----EAFVKARIMDSKYLVE-----PKKEEVLEDVYGN 87
Query: 83 TVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYV 142
P FDAR WP C +IG + D +C + ++ A SD C++S +S +
Sbjct: 88 DPPASFDARTHWPECRSIGTIRDQSSCGSCWAVSSAEAMSDEICVQSNSTIRVMISDSDI 147
Query: 143 ASCCKI-CRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAP 201
SCC I C Y C G + ++ + G VTGG Y + C+P PC HH + P
Sbjct: 148 LSCCGISCGY----GCQGGWPIEAYKWMQRDGVVTGGKYRQKKVCKPYAFYPCGHHQNDP 203
Query: 202 TLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTA 261
C P KC C Y + + +DKH T Y++ +NE I++EI +GP A
Sbjct: 204 YYGPCPGGLWPTPKCRKTCQR-KYNKSYQEDKHFATRAYYLPNNERNIRQEIYKNGPVVA 262
Query: 262 TFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGT 321
F +Y DF +YK G+Y H + H+ K++GWG EN T YWL+ N+W WG+ G
Sbjct: 263 AFRVYQDFSYYKKGIYVHKWGGQTG--AHAVKVVGWGRENATDYWLIANSWNTDWGESGY 320
Query: 322 VKILRGKYECAFEYLIAAG 340
+I+RG EC E + G
Sbjct: 321 FRIVRGTNECGIEAQMVGG 339
>gi|442754445|gb|JAA69382.1| Putative cathepsin b precursor [Ixodes ricinus]
Length = 340
Score = 181 bits (459), Expect = 5e-43, Method: Compositional matrix adjust.
Identities = 126/351 (35%), Positives = 161/351 (45%), Gaps = 27/351 (7%)
Query: 4 ILVFLLGCT----------LVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQ 53
+ +FLLG +V L SD +D IN TW AG N E R+
Sbjct: 5 VALFLLGVLASVRAEEGRLMVPAYLAPLSDKMVDYINFINTTWKAGHNEGHRDLETVRRK 64
Query: 54 FLIADAKYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAP- 112
+ + D LP + +P +FD+R+QW + P T A P
Sbjct: 65 LGV----HRDNHKYRLP---ELVHDTLEMDIPAQFDSRQQWQDWPHHPGDPGTKERADPV 117
Query: 113 HIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKR 172
F AV + SDR CI S + L+ + V SCC C C+ G W++ +
Sbjct: 118 GHFGAVESMSDRHCIHSGAKNIVHLAADDVLSCCWGC----GSGCNGGFPAAAWSYWVDK 173
Query: 173 GSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQD 232
G VTGG+Y GC P + C HH + TL C Q P KC R Y F D
Sbjct: 174 GIVTGGNYDTDEGCMPYPVPSCDHHVNG-TLGPC-GQDPPTPKC-VRLCRKGYNVDFKDD 230
Query: 233 KHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSG 292
KH +Y V NE I+ EI+ +GP F +Y DF YKSGVYK S L H+
Sbjct: 231 KHYGKSSYSVPSNETQIQMEIMKNGPVEGAFTVYADFPLYKSGVYKSHSTDALGG--HAI 288
Query: 293 KLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
+++GWG EN PYWLV N+W WGD+G KILRG EC E I AG PK
Sbjct: 289 RILGWGVENDVPYWLVANSWNTEWGDKGYFKILRGSNECGIEEDIVAGIPK 339
>gi|392922404|ref|NP_507186.3| Protein CPR-2 [Caenorhabditis elegans]
gi|206994217|emb|CAB04322.3| Protein CPR-2 [Caenorhabditis elegans]
Length = 326
Score = 181 bits (458), Expect = 5e-43, Method: Compositional matrix adjust.
Identities = 109/319 (34%), Positives = 158/319 (49%), Gaps = 25/319 (7%)
Query: 26 IDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEYSA-TV 84
+D IN A+T+ N+ + + R ++ + P P + + + E+
Sbjct: 32 VDHINSAASTFQT-ENYAVTHEKMHTRSM-------HEKFNAPFPDEFRATEREFVLDAT 83
Query: 85 PDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVAS 144
P FDAR +WP C ++ + + C + F+ SDR CI S G Q +S + +
Sbjct: 84 PLNFDARTRWPQCKSMKLIREQSNCGSCWAFSTAEVISDRTCIASNGTQQPIISPTDLLT 143
Query: 145 CCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTLP 204
CC + + C G +R + + +RG VTGGDY TGC+P I PC+
Sbjct: 144 CCGM---SCGEGCDGGFPYRAFQWWARRGVVTGGDYLG-TGCKPYPIRPCNSD------- 192
Query: 205 SCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFA 264
+C N + P C C P Y + DK+ Y V AI+ +I +GP A F
Sbjct: 193 NCVNLQTPP--CRLSC-QPGYRTTYTNDKNYGNSAYPVPRTVAAIQADIYYNGPVVAAFI 249
Query: 265 LYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVKI 324
+Y+DF YKSG+Y+H + H+ KLIGWGTE GTPYWL +N+WG WG+ GT +I
Sbjct: 250 VYEDFEKYKSGIYRHIAGRSKGG--HAVKLIGWGTERGTPYWLAVNSWGSQWGESGTFRI 307
Query: 325 LRGKYECAFEYLIAAGKPK 343
LRG EC E I AG P+
Sbjct: 308 LRGVDECGIESRIVAGLPR 326
>gi|226471006|emb|CAX70584.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 181 bits (458), Expect = 6e-43, Method: Compositional matrix adjust.
Identities = 97/262 (37%), Positives = 145/262 (55%), Gaps = 8/262 (3%)
Query: 79 EYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLS 138
+ + +P +FD+R++WP+C +I + D C + F AV A +DR CI+S GQQ+ LS
Sbjct: 85 DLNVEIPSQFDSRKKWPHCKSISQIRDQSRCGSCWAFGAVEAMTDRICIQSGGQQSAELS 144
Query: 139 TEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHG 198
+ S C D C G + W++ KRG VTGG + TGCQP C H
Sbjct: 145 ALDLIS----CCEDCGGGCKGGFPGQAWDYWVKRGIVTGGSEENHTGCQPYPFPKCEHL- 199
Query: 199 SAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGP 258
+ P+C + +C C Y + QDKH Y V NE AI++EI+ +GP
Sbjct: 200 TKGKYPACGTKIYKTPQCKQTCQK-GYKTPYEQDKHYGDQRYNVISNEKAIQREIMMYGP 258
Query: 259 TTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGD 318
A F +Y+DF +YKSG+Y+H + + + H+ ++IGWG E G PYWL+ N+W WG+
Sbjct: 259 VEAAFDVYEDFLNYKSGIYRHVAGSIVGG--HAIRIIGWGVEKGKPYWLIANSWNEDWGE 316
Query: 319 RGTVKILRGKYECAFEYLIAAG 340
G +++RG+ EC+ E + AG
Sbjct: 317 NGLFRMVRGRDECSIESHVVAG 338
>gi|239938576|gb|ACS36087.1| cysteine proteinase [Haemonchus contortus]
Length = 253
Score = 181 bits (458), Expect = 7e-43, Method: Compositional matrix adjust.
Identities = 95/258 (36%), Positives = 138/258 (53%), Gaps = 9/258 (3%)
Query: 84 VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVA 143
+P+ FDAR +WP C ++ H+ D C + + A SDR CI S G++ +S +
Sbjct: 2 IPESFDARTKWPKCSSLKHIRDQANCGSCWAVSTASALSDRICIASNGRKQVHVSATDIL 61
Query: 144 SCC-KICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPT 202
SCC C Y C+ G + +N+ K+G+VTGGDY +GC+P PC HHG
Sbjct: 62 SCCGNQCGY----GCNGGWPIQAFNYFSKQGAVTGGDYKATSGCRPYPFHPCGHHGKDTY 117
Query: 203 LPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTAT 262
C N+ KC +C + +D+ Y V ++E AI++EI+ +GP
Sbjct: 118 YGECPNEATTP-KCVRKCQKSYKKS-YKKDRSIGKDAYEVPNSEKAIQREIMKNGPVVGA 175
Query: 263 FALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTV 322
F +Y+DF +YK G+YKHT+ H+ K+IGWG E G PYWL+ N+W WG+ G
Sbjct: 176 FTVYEDFSYYKKGIYKHTAGKARGG--HAIKIIGWGKEGGVPYWLIANSWHNDWGENGYF 233
Query: 323 KILRGKYECAFEYLIAAG 340
+ILRG C E + AG
Sbjct: 234 RILRGSNHCGIEENVVAG 251
>gi|226471008|emb|CAX70585.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 180 bits (457), Expect = 7e-43, Method: Compositional matrix adjust.
Identities = 97/262 (37%), Positives = 145/262 (55%), Gaps = 8/262 (3%)
Query: 79 EYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLS 138
+ + +P +FD+R++WP+C +I + D C + F AV A +DR CI+S GQQ+ LS
Sbjct: 85 DLNVEIPSQFDSRKKWPHCKSISQIRDQSRCGSCWAFGAVEAMTDRICIQSGGQQSAELS 144
Query: 139 TEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHG 198
+ S C D C G + W++ KRG VTGG + TGCQP C H
Sbjct: 145 ALDLIS----CCEDCGGGCKGGFPGQAWDYWVKRGIVTGGSEENHTGCQPYPFPKCEHL- 199
Query: 199 SAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGP 258
+ P+C + +C C Y + QDKH Y V NE AI++EI+ +GP
Sbjct: 200 TKGKYPACGTKIYKTPQCKQTCQK-GYKTPYEQDKHYGDQRYNVISNEKAIQREIMMYGP 258
Query: 259 TTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGD 318
A F +Y+DF +YKSG+Y+H + + + H+ ++IGWG E G PYWL+ N+W WG+
Sbjct: 259 VEAAFDVYEDFLNYKSGIYRHVAGSIVGG--HAIRIIGWGVEKGKPYWLIANSWNEDWGE 316
Query: 319 RGTVKILRGKYECAFEYLIAAG 340
G +++RG+ EC+ E + AG
Sbjct: 317 NGLFRMVRGRDECSIESHVVAG 338
>gi|211853248|emb|CAP17587.1| cathepsin-like protein 4 [Crateromorpha meyeri]
Length = 325
Score = 180 bits (457), Expect = 8e-43, Method: Compositional matrix adjust.
Identities = 121/349 (34%), Positives = 163/349 (46%), Gaps = 41/349 (11%)
Query: 2 IHILVFLLGCTL-------VRGELYKFSDAYIDQINREANTWTAGRN--FPANLSEEYLR 52
+ + LL C L + Y F + I ++NRE W AGR F + +EEY+
Sbjct: 1 MKLTALLLVCALLSINAAHIESNYYPF-EKEIYEVNRENLGWVAGRQKRFEGH-TEEYIA 58
Query: 53 QFL-IADAKYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAA 111
+ + SD P+ D +PD FD+R QWP+C TIG + D C +
Sbjct: 59 GLCGVKGSIPLPLSDLPVLED-----------IPDMFDSRTQWPDCKTIGLIEDQSNCGS 107
Query: 112 PHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHK 171
F A + SDR CI K + +S + CC+ C C G + WN+ +
Sbjct: 108 CWAFGATESMSDRYCIHMK--MHLLISAANLMECCRNC----GNGCEGGFLGAAWNYWKQ 161
Query: 172 RGSVTGGDYG----DRTGCQPSTISPCSHH--GSAPTLPSCENQKVPKLKCHTRCTNPTY 225
G VTGG Y + CQP + C HH GS P PS K+ K + Y
Sbjct: 162 EGLVTGGLYNPSATESDTCQPYPLPSCEHHINGSKPACPS----KIAKTPECVHTCHAGY 217
Query: 226 GRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKL 285
+ QD H Y V I+ EI+ +GP A F +Y DF YKSGVYK S +L
Sbjct: 218 PTSYEQDLHYGESAYSVRRRVAEIQTEIMTNGPVEAAFTVYADFPAYKSGVYKRHSLRQL 277
Query: 286 ENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFE 334
H+ K+IGWG E+G PYWL+ N+W WGD G KI+RG+ EC E
Sbjct: 278 GG--HAVKMIGWGEEDGIPYWLIANSWNSDWGDHGYFKIVRGQDECGIE 324
>gi|300176937|emb|CBK25506.2| unnamed protein product [Blastocystis hominis]
Length = 320
Score = 180 bits (457), Expect = 8e-43, Method: Compositional matrix adjust.
Identities = 102/316 (32%), Positives = 151/316 (47%), Gaps = 22/316 (6%)
Query: 28 QINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEYSATVPDR 87
++N TW A P +YL + K + + + GD +P+
Sbjct: 26 EVNAMKTTWLANEAIPTRDYTQYLGA--LRGGKQLPEKNIAIRGD-----------LPES 72
Query: 88 FDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCK 147
FD E+WP C ++ + D C + F A A +DR CI SKG+ LS + + +CC+
Sbjct: 73 FDPVEKWPECPSLKEIRDQSVCGSCWAFGAAEAATDRLCIASKGKIQDRLSDQDLLTCCE 132
Query: 148 ICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCE 207
C + C+ G W++ H G TGG+YG + C C HH P E
Sbjct: 133 SCGF----GCNGGWPSMAWSWFHSTGVTTGGEYGSKDWCNAYEFPKCDHHVEGKYPPCGE 188
Query: 208 NQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYD 267
Q P+ C +C Y + +DKH Y V N +AIK E++ +GP F++Y+
Sbjct: 189 TQPTPE--CVEKCQE-GYPVEYKKDKHFFGEAYHVPSNVEAIKTELMTNGPIEVDFSVYE 245
Query: 268 DFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRG 327
DF YKSG+Y+H + L H+ KL+GWG E+G YW + N+W WG+ G +I+ G
Sbjct: 246 DFMTYKSGIYQHVAGKYLGG--HAVKLVGWGVEDGVEYWKIANSWNEDWGENGYFRIIAG 303
Query: 328 KYECAFEYLIAAGKPK 343
K EC E AG P+
Sbjct: 304 KNECGIESDGVAGIPE 319
>gi|56752811|gb|AAW24617.1| unknown [Schistosoma japonicum]
Length = 342
Score = 179 bits (455), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 98/265 (36%), Positives = 147/265 (55%), Gaps = 8/265 (3%)
Query: 79 EYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLS 138
+ + +P +FD+R++WP+C +I + D C + F AV A +DR CI+S GQQ+ LS
Sbjct: 85 DLNVEIPSQFDSRKKWPHCKSISQIRDQSRCGSCWAFGAVEAMTDRICIQSGGQQSAELS 144
Query: 139 TEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHG 198
+ SCCK C + W++ KRG VTGG + TGCQP C H
Sbjct: 145 ALDLISCCKDCGGGCKGGFPG----QAWDYWVKRGIVTGGSEENHTGCQPYPFPKCEHL- 199
Query: 199 SAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGP 258
+ P+C + +C C Y + QDKH Y V NE AI++EI+ +GP
Sbjct: 200 TKGKYPACGTKIYKTPQCKQTCQK-GYKTPYEQDKHYGDQRYNVISNEKAIQREIMMYGP 258
Query: 259 TTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGD 318
A F +Y+DF +YKSG+Y+H + + + H+ ++IGWG E G PYWL+ N+W WG+
Sbjct: 259 VEAAFDVYEDFLNYKSGIYRHVTGSIVGG--HAIRIIGWGVEKGKPYWLIANSWNEDWGE 316
Query: 319 RGTVKILRGKYECAFEYLIAAGKPK 343
+G +++RG+ EC+ E + AG K
Sbjct: 317 KGLFRMVRGRDECSIESHVVAGLIK 341
>gi|390994433|gb|AFM37366.1| cathepsin B3 [Dictyocaulus viviparus]
Length = 342
Score = 179 bits (455), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 93/275 (33%), Positives = 143/275 (52%), Gaps = 9/275 (3%)
Query: 66 DRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRR 125
DR + + + E +A +P+ FDAR QWP+C +I + D C + FA + SDR
Sbjct: 75 DRRIGKPQLQENEEDTAGIPESFDARTQWPHCPSISLIRDQADCGSCWAFAVGESISDRV 134
Query: 126 CIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTG 185
CI + + S E + +CC C + C G W + G VTGG YG +
Sbjct: 135 CIATDANKTAEFSVEDILTCCDECGF----GCDGGFPDAAWEYFVSTGVVTGGLYGTKNA 190
Query: 186 CQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDN 245
C+P ISPC +H + +C P C T C Y + DK R +Y + ++
Sbjct: 191 CRPYEISPCGNHPNETFYRNCTGVSTP--SCKTSC-QKGYPVSYKDDKTRGRKSYNLANS 247
Query: 246 EDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPY 305
AI+K+IL HGP ATF++Y+DF +YK G+Y++T H+ +++GWG EN Y
Sbjct: 248 VSAIQKDILKHGPLVATFSVYEDFMYYKKGIYRYTHGGYEGG--HAVRILGWGVENNVKY 305
Query: 306 WLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAG 340
W++ N+W WG+ G +++RG +C E ++AG
Sbjct: 306 WIIANSWNTDWGEDGFFRMVRGINDCGIEESVSAG 340
>gi|4204370|gb|AAD11445.1| cathepsin B protease, partial [Fasciola hepatica]
Length = 247
Score = 179 bits (455), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 92/253 (36%), Positives = 132/253 (52%), Gaps = 7/253 (2%)
Query: 91 REQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICR 150
R QWP C TI + D +C + AA A SDR CI S GQ L+ SCC C
Sbjct: 1 RSQWPQCWTISEIRDQASCGSCWATAAASAMSDRVCIHSNGQMRPRLAAADPLSCCTYC- 59
Query: 151 YDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQK 210
+ C G + W++ + G VTGG + +RTGCQP + C H G + C +
Sbjct: 60 ---GQGCRGGYPPKAWDYWMREGIVTGGTWENRTGCQPWMFTKCDHVGDSRKYSRCPHYT 116
Query: 211 VPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFY 270
P C C Y + + QDK +Y V ++E I +EI+ +GP TFA++ DF
Sbjct: 117 YPTPPCARACQT-GYNKTYEQDKFYGNSSYNVGEHESYIMQEIMKNGPVEVTFAIFQDFG 175
Query: 271 HYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYE 330
Y+SG+Y H + + H+ ++IGWG ENG YWL+ N+W WG+ G +++RG+ E
Sbjct: 176 VYRSGIYHHVAGKFIGR--HAVRMIGWGVENGVNYWLMANSWNEEWGENGYFRMVRGRNE 233
Query: 331 CAFEYLIAAGKPK 343
C E + AG P+
Sbjct: 234 CGIESEVVAGMPR 246
>gi|157058767|gb|ABV03141.1| cathepsin B-348 [Sitobion avenae]
Length = 252
Score = 179 bits (454), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 96/234 (41%), Positives = 130/234 (55%), Gaps = 10/234 (4%)
Query: 84 VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVA 143
+P+ FDARE WPNC TI V D G+C + F AV A SDR CI SKG +N S E +
Sbjct: 28 LPETFDAREHWPNCPTIREVRDQGSCGSCWAFGAVEAMSDRVCIHSKGTKNFHFSAENLV 87
Query: 144 SCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTL 203
SCC C + C+ G W++ +G V+GG YG GC P I+PC HH +
Sbjct: 88 SCCWTCGF----GCNGGFPGAAWHYWKTKGIVSGGPYGSNMGCIPYEIAPCEHHVNGTRG 143
Query: 204 PSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATF 263
P E K P KC +C + Y + QD HR Y + ++ D I++EI +GP F
Sbjct: 144 PCKEGGKTP--KCVKKCED-GYKVPYEQDLHRGKSAYSLSNDVDQIRQEIYTNGPVEGAF 200
Query: 264 ALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENG-TPYWLVINTWGPHW 316
+Y+DF Y++GVYKH + L H+ +++GWG +NG PYWLV N+W W
Sbjct: 201 TVYEDFIAYRAGVYKHVAGKALGG--HAIRILGWGVQNGEIPYWLVANSWNTDW 252
>gi|308466896|ref|XP_003095699.1| CRE-CPR-3 protein [Caenorhabditis remanei]
gi|308244581|gb|EFO88533.1| CRE-CPR-3 protein [Caenorhabditis remanei]
Length = 373
Score = 179 bits (454), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 111/324 (34%), Positives = 152/324 (46%), Gaps = 24/324 (7%)
Query: 24 AYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEYSAT 83
A +D IN +W A N +S+ ++ F + D ++ D G+
Sbjct: 37 ALVDHINTAQTSWLAEHNV---ISDSEMK-FKVMDERFADPLPEEESGEILVSGEIVPEP 92
Query: 84 VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVA 143
+PD FDARE WP+C +I + + C + F A SDR CI+S G Q +S E +
Sbjct: 93 IPDTFDARENWPDCKSIKLIRNQATCGSCWAFGAAEVISDRICIQSNGTQQPIISVEDIL 152
Query: 144 SCC-KICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPT 202
SCC C K C G F G+VTGGDY + GC P + +PC
Sbjct: 153 SCCGTTC----GKGCQGGYSIEAMRFWKSNGAVTGGDY-NGNGCMPYSFAPCQK------ 201
Query: 203 LPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDA---IKKEILAHGPT 259
C P C T C + + DKH T Y + + I+ EI +GP
Sbjct: 202 -SPCVESTTPT--CKTTCQSSYTTANYTTDKHYGTSAYRLATTNNVVSTIQYEIYHNGPV 258
Query: 260 TATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDR 319
A++ +Y+DFY YKSGVY + S + H+ K+IGWGTEN YWLV N+WG +G+
Sbjct: 259 EASYKVYEDFYQYKSGVYHYVSGKLVGG--HAVKIIGWGTENDVDYWLVANSWGIKFGEG 316
Query: 320 GTVKILRGKYECAFEYLIAAGKPK 343
G KI RG EC E + AG K
Sbjct: 317 GFFKIRRGTNECQIESNVVAGVAK 340
>gi|268572243|ref|XP_002648913.1| Hypothetical protein CBG17826 [Caenorhabditis briggsae]
Length = 323
Score = 179 bits (454), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 99/260 (38%), Positives = 135/260 (51%), Gaps = 18/260 (6%)
Query: 84 VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVA 143
+P FD+R +W NC +I + D C + F+ SDR CI +KG Q +S +
Sbjct: 81 IPPSFDSRTRWSNCTSIEMIRDQAQCGSCWAFSTAEVISDRICIATKGTQQPTISPTDML 140
Query: 144 SCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTL 203
+CC D K FR WN RG VTGGD+ +GC+P +PC
Sbjct: 141 ACCGNSCGDGCKGGYPIQAFRWWN---SRGVVTGGDF-RGSGCRPYPFAPCI-------- 188
Query: 204 PSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATF 263
SC +K P C C Y + +DK Y V N AI+ EI+ +GP F
Sbjct: 189 -SCPEEKTPT--CSLSC-QFGYSTAYAKDKRFGVSAYAVARNVAAIQTEIMTNGPVVGAF 244
Query: 264 ALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVK 323
+Y+D Y YKSGVY+HT+ L H+ K+IGWGT+NG PYWL+ N+WG +WG+ G +K
Sbjct: 245 TMYEDMYKYKSGVYRHTAGRLLGG--HAIKIIGWGTQNGIPYWLIANSWGANWGENGFLK 302
Query: 324 ILRGKYECAFEYLIAAGKPK 343
+ RG EC E + AG P+
Sbjct: 303 MRRGVNECGIERAVVAGMPR 322
>gi|347972080|ref|XP_313831.5| AGAP004531-PA [Anopheles gambiae str. PEST]
gi|333469162|gb|EAA09191.5| AGAP004531-PA [Anopheles gambiae str. PEST]
Length = 375
Score = 179 bits (454), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 107/334 (32%), Positives = 154/334 (46%), Gaps = 40/334 (11%)
Query: 16 GELYKFSDAYIDQINREANTWTAGRNFPAN-------LSEEYLRQFLIADAKYFDQSDRP 68
G + A+++ IN + TW AG N N LS+E ++ F + + ++P
Sbjct: 72 GNVLTSQAAFVEAINNRSTTWKAGVNPQRNDQYRTGVLSDESMK-FQLPLGFVLKKDEQP 130
Query: 69 LPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIK 128
LP FDAR++W C ++ V + G C + + AAV +DR C+
Sbjct: 131 LPMS---------------FDARQKWSYCPSMNMVRNQGCCDSSYAVAAVSTMTDRWCVH 175
Query: 129 SKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQP 188
S+G+ V SCC C + C G W++ + G +GG +G GCQ
Sbjct: 176 SEGKAQFNFGAYDVLSCCHRCGF----GCDGGVPSAVWHYWVENGITSGGAFGSHEGCQS 231
Query: 189 STISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDA 248
C G + P C R P Y + +DKH + Y V +E+
Sbjct: 232 YPFDVCKKSGDSNDTPRC-----------LRFCQPGYNVTYPEDKHYGRVAYTVPKDEER 280
Query: 249 IKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLV 308
I E+ GP ATF +Y DF YKSGVY+HT ++ HS K++GWG EN YWL
Sbjct: 281 IMYEVFNFGPAQATFTMYTDFVQYKSGVYRHTFGVRVGT--HSVKVMGWGVENDVKYWLC 338
Query: 309 INTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
N+WG WGD G KI+RG+ +FE + AG P
Sbjct: 339 ANSWGAQWGDGGFFKIVRGEDHLSFETNVVAGLP 372
>gi|294939825|ref|XP_002782575.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
gi|239894358|gb|EER14370.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
Length = 398
Score = 179 bits (454), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 114/333 (34%), Positives = 160/333 (48%), Gaps = 31/333 (9%)
Query: 24 AYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYF------DQSDRPLPGDRKTYD 77
+ +D+IN + NTWTA + +E + + DAK D +D+ + K Y
Sbjct: 83 SLVDEINAKQNTWTA------SAEQEKFKTSSLRDAKMLCGTLTRDSNDKVV---EKVYA 133
Query: 78 PEYSATVPDRFDAREQWPNCG-TIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRP 136
E +P FDAR +P C IGHV D AC F AF+DR CIKS G +
Sbjct: 134 IEELKDLPTDFDARTAFPKCSKVIGHVRDQSACGDCWAFGVTEAFNDRLCIKSNGTFTKL 193
Query: 137 LSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRT------GCQPST 190
LS + +C + + C G + W+++H G TGGDY R GC P
Sbjct: 194 LSAGEMNACAPSLK---DPGCRGGFPYSAWSWVHDEGIATGGDYVPRDNMTEDDGCWPYD 250
Query: 191 ISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIK 250
PC+H P P+C L+C ++ + +F D++ + + D K
Sbjct: 251 FPPCAHFFKDPKYPACPKFARVNLRCVSKLRHMMVV--YFSDRYFMVESVPYHFSADDAK 308
Query: 251 KEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVIN 310
I GP +ATF +Y+DF YKSGVYKHTS + L H+ K+IGWG + G YWLV+N
Sbjct: 309 NAIRTDGPVSATFYVYEDFLAYKSGVYKHTSGSLLG--AHAVKIIGWGEDGGEAYWLVVN 366
Query: 311 TWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
+W WGD G KI G +C + + G PK
Sbjct: 367 SWNEGWGDHGLFKIALG--DCGIDNELLGGTPK 397
>gi|193629592|ref|XP_001944624.1| PREDICTED: cathepsin B-like isoform 4 [Acyrthosiphon pisum]
Length = 331
Score = 179 bits (454), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 125/354 (35%), Positives = 169/354 (47%), Gaps = 36/354 (10%)
Query: 1 MIHILVF--LLGCTLVRGELYKFS-DAYIDQINREANTWTAGRNFPANLSEEYLRQFLIA 57
M +IL F ++ + E K S D I + + NT + NF N EE L+
Sbjct: 1 MANILFFTSIMLLSFYLTEQTKSSHDNMIANSDIKTNTLKSVENFGPNSGEEENIMMLLG 60
Query: 58 D--AKYFDQSDRPLPGDRKTYDPEYSATVPD--RFDAREQWPNCGTIGHVPDTGACAAPH 113
+ +S +P K +P Y + FDAR++WP C TIG V + G
Sbjct: 61 TRGVEAATKSKKPY----KIRNPRYVIDNQNHKEFDARKRWPQCKTIGEVYNEGNALLSW 116
Query: 114 IFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVF--RTWNFLHK 171
+A G F+DR CI + G N+ LSTE + SC I K+ ++G V W +
Sbjct: 117 AYATTGVFADRMCIATNGSYNKHLSTEELISCSGI------KASANGWVRDGLAWEYFKT 170
Query: 172 RGSVTGGD-YGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFF 230
G V+GG Y GCQPS I P + LP+ N++ C + YG
Sbjct: 171 HGLVSGGSIYNTNDGCQPSKIPPVCN------LPTKINKRT--------CVDYCYGNDTI 216
Query: 231 QDKH-RTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYL 289
+ H + Y+ I+KE+ +GP TA LYDD + +KSGVY T NAK L
Sbjct: 217 KYNHDHVKVRYYYHVKPKDIQKEVQTYGPVTAALNLYDDIFLHKSGVYTLTKNAKYVR-L 275
Query: 290 HSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
KLIGWG ENG YWL++N+WG WG G +KI RGKY CA E + A PK
Sbjct: 276 QYVKLIGWGVENGVDYWLLVNSWGNEWGQNGLLKIKRGKYGCAVESFVYAAVPK 329
>gi|321461662|gb|EFX72692.1| hypothetical protein DAPPUDRAFT_308155 [Daphnia pulex]
Length = 379
Score = 179 bits (454), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 111/305 (36%), Positives = 143/305 (46%), Gaps = 23/305 (7%)
Query: 51 LRQFLIADA----KYFDQSDRPLPGDRKTYDPEYSAT---VPDRFDAREQWPNCGTIGHV 103
R F+ A A +YF R +R++ +P FDAR +WPNC TIG +
Sbjct: 73 FRSFMGARAYDPWRYFMSVKRRQVNERRSLSSPSGFYSSSIPAEFDARLRWPNCPTIGEI 132
Query: 104 PDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVF 163
+ G+CA+ A SDR CI S + LS + SCCK+C K C G
Sbjct: 133 FEQGSCASCWAVAPTDVMSDRICIHSGSRHIVRLSAGNLLSCCKLC----GKGCKGGFPG 188
Query: 164 RTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPK-----LKCHT 218
W K G VTGG Y GCQ PC P K PK L+C
Sbjct: 189 GAWMHWSKHGIVTGGSYSSDYGCQKYQFFPCYQ----PRTKGSIKNKCPKTDNTLLECRE 244
Query: 219 RCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYK 278
C +Y + + QD + Y + ++ AI+ EI+ +GP A +Y+DF HYK GVY+
Sbjct: 245 TCRT-SYNKSYKQDLYYGESVYRIPNDARAIQLEIMENGPVQANLRIYEDFLHYKFGVYR 303
Query: 279 HTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIA 338
H LE H+ K+ GWGTE GTPYWL N W WG+ G KILRG E +
Sbjct: 304 HVHGQGLE--YHAVKIFGWGTEGGTPYWLAANPWSKRWGNGGFFKILRGSNHAEIEDHVM 361
Query: 339 AGKPK 343
AG PK
Sbjct: 362 AGIPK 366
>gi|161343871|tpg|DAA06116.1| TPA_inf: cathepsin B [Myzus persicae]
Length = 276
Score = 179 bits (453), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 107/275 (38%), Positives = 141/275 (51%), Gaps = 27/275 (9%)
Query: 77 DPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRP 136
D + +P +FDAR++W C TIG V D G C + + AFSDR C+ + G N+
Sbjct: 18 DDDNYQEIPIKFDARKKWLRCKTIGEVRDQGHCGSDWAMSTSSAFSDRLCVATNGDFNQL 77
Query: 137 LSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSH 196
LS E + CC C CS G R W K G VTGG+Y GC+P + PC +
Sbjct: 78 LSAEEITFCCHTC----GDGCSGGYPIRAWKRYKKHGLVTGGNYKSGEGCEPYRVPPCPN 133
Query: 197 HGSAPTLPSCENQKVPKLKCHTRCTNPTYGRG--FFQDKHRTT-----LTYWVDDNEDAI 249
+C Q + K + RCT YG F + HR T LTY I
Sbjct: 134 DDQGNN--TCSGQPMEK---NHRCTRMCYGDQDLDFDEDHRYTRDHYYLTY------RGI 182
Query: 250 KKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYL--HSGKLIGWGTENGTPYWL 307
+K+++ +GP A+F +YDDF YKSG+Y + NA +YL HS KLIGWG E G YWL
Sbjct: 183 QKDVINYGPIEASFDVYDDFPSYKSGIYVKSENA---SYLGGHSVKLIGWGEEYGVLYWL 239
Query: 308 VINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
++N+W WGD+G KI RG EC + G P
Sbjct: 240 MVNSWNADWGDKGLFKIRRGTNECGVDNSTTGGVP 274
>gi|268570495|ref|XP_002648548.1| Hypothetical protein CBG24861 [Caenorhabditis briggsae]
Length = 323
Score = 179 bits (453), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 99/260 (38%), Positives = 135/260 (51%), Gaps = 18/260 (6%)
Query: 84 VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVA 143
+P FD+R +W NC +I + D C + F+ SDR CI +KG Q +S +
Sbjct: 81 IPPSFDSRTRWSNCTSIEMIRDQAQCGSCWAFSTAEVISDRICIATKGTQQPTISPTDML 140
Query: 144 SCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTL 203
+CC D K FR WN RG VTGGD+ +GC+P +PC
Sbjct: 141 ACCGNSCGDGCKGRYPIQAFRWWN---SRGVVTGGDF-RGSGCRPYPFAPCI-------- 188
Query: 204 PSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATF 263
SC +K P C C Y + +DK Y V N AI+ EI+ +GP F
Sbjct: 189 -SCPEEKTPT--CSLSC-QFGYSTAYAKDKRFGVSAYAVARNVAAIQTEIMTNGPVVGAF 244
Query: 264 ALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVK 323
+Y+D Y YKSGVY+HT+ L H+ K+IGWGT+NG PYWL+ N+WG +WG+ G +K
Sbjct: 245 TMYEDMYKYKSGVYRHTAGRLLGG--HAIKIIGWGTQNGIPYWLIANSWGANWGENGFLK 302
Query: 324 ILRGKYECAFEYLIAAGKPK 343
+ RG EC E + AG P+
Sbjct: 303 MRRGVNECGIERAVVAGMPR 322
>gi|118118|sp|P19092.1|CYSP1_HAECO RecName: Full=Cathepsin B-like cysteine proteinase 1; Flags:
Precursor
gi|159173|gb|AAA29175.1| cysteine protease (AC-1) [Haemonchus contortus]
Length = 342
Score = 178 bits (452), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 102/285 (35%), Positives = 145/285 (50%), Gaps = 12/285 (4%)
Query: 56 IADAKYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIF 115
I D KY Q + + DP+ +P +D R+ W NC T ++ D C +
Sbjct: 63 IMDIKYKHQKLNLMVKE----DPDPEVDIPPSYDPRDVWKNCTTF-YIRDQANCGSCWAV 117
Query: 116 AAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSV 175
+ A SDR CI SK ++ +S + +CC R C G W + G V
Sbjct: 118 STAAAISDRICIASKAEKQVNISATDIMTCC---RPQCGDGCEGGWPIEAWKYFIYDGVV 174
Query: 176 TGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHR 235
+GG+Y + C+P I PC HHG+ C P C +C P + + DK
Sbjct: 175 SGGEYLTKDVCRPYPIHPCGHHGNDTYYGECRGT-APTPPCKRKC-RPGVRKMYRIDKRY 232
Query: 236 TTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLI 295
Y V + AI+ EIL +GP A+FA+Y+DF HYKSG+YKHT+ +L Y H+ K+I
Sbjct: 233 GKDAYIVKQSVKAIQSEILRNGPVVASFAVYEDFRHYKSGIYKHTA-GELRGY-HAVKMI 290
Query: 296 GWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAG 340
GWG EN T +WL+ N+W WG++G +I+RG +C E IAAG
Sbjct: 291 GWGNENNTDFWLIANSWHNDWGEKGYFRIIRGTNDCGIEGTIAAG 335
>gi|161343849|tpg|DAA06105.1| TPA_inf: cathepsin B [Acyrthosiphon pisum]
Length = 334
Score = 178 bits (452), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 114/355 (32%), Positives = 175/355 (49%), Gaps = 42/355 (11%)
Query: 5 LVFLLGCTLVRGELYK-----FSDAYIDQINREANTWTAGRNF-PANLSEEYLRQFLIAD 58
++FL+ L+ L + D ID+ +T G N P ++ EE+L +++
Sbjct: 4 VLFLVSTMLLNSYLSEQATLFHDDNIIDKSVMGTDTLKVGENVGPNSVEEEHL---MLSG 60
Query: 59 AKYFDQSDRPL----PGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHI 114
+ + + + +R+ + E + FDAR++WP+C TIG V + G
Sbjct: 61 TRGVEATSKSKMLHKTRNRRCFSVEIDHQIDQEFDARKRWPHCKTIGEVHNDGNSLLSWA 120
Query: 115 FAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSV--FRTWNFLHKR 172
+ G F+DR CI + G N+ LSTE + SC I K GSV + W +L
Sbjct: 121 YVPTGVFADRMCIATNGTYNQLLSTEELISCSGI------KEDEFGSVNDYYVWEYLKNH 174
Query: 173 GSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRC---TNPTYGRGF 229
G V+GG Y GCQPS I P G+ PT + + C RC Y +
Sbjct: 175 GLVSGGKYNTNNGCQPSKIPPI---GNLPT-------GLYENTCEKRCYGNNTINYNQDH 224
Query: 230 FQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYD-DFYHYKSGVYKHTSNAKLENY 288
+ K+ + Y + I++E+ +GP + F ++D DF+ YKSGVY+ T+N++ +
Sbjct: 225 VKIKNHYDIEY------EDIQREVQNYGPVSMAFKVFDNDFFLYKSGVYEKTTNSEFIQW 278
Query: 289 LHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
++ KLIGWG ENG YWL++N WG WG G KI RG EC E + AG+P+
Sbjct: 279 QYA-KLIGWGVENGVDYWLLVNFWGYEWGQNGLFKIKRGTDECNIETFVHAGEPQ 332
>gi|5764077|emb|CAB53367.1| necpain [Necator americanus]
Length = 339
Score = 178 bits (451), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 92/258 (35%), Positives = 140/258 (54%), Gaps = 9/258 (3%)
Query: 85 PDRFDAREQWPNC-GTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLS-TEYV 142
P++FDAR+ WP C IGHV D C + +A SDR C++S G+ +S T+ +
Sbjct: 85 PEKFDARDAWPYCREIIGHVRDQSRCGSCWAVSAASVMSDRLCVQSNGKIKLHVSDTDIL 144
Query: 143 ASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPT 202
A C + C CS G F+ W ++ K G TGGDY + C+P PC +H +
Sbjct: 145 ACCGEFC----GDGCSGGWPFQAWEWVRKYGVCTGGDYRAKGVCKPYAFHPCGNHENQVY 200
Query: 203 LPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTAT 262
C P +C C Y + + +DK +YW+ ++E I+ +I+ +GP A
Sbjct: 201 YGVCPKGSWPTPRCEKFCQR-GYIKPYKKDKFYAKKSYWLPNDEKEIRLDIMKNGPVQAA 259
Query: 263 FALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTV 322
F +Y+DF YK G+YKH ++ H+ K+IGWG +NGT YWL+ N+W WG+ G
Sbjct: 260 FDVYEDFKLYKRGIYKHKEG--IQTGGHAVKIIGWGKDNGTDYWLIANSWSKDWGESGFF 317
Query: 323 KILRGKYECAFEYLIAAG 340
+++RG+ +C E +I AG
Sbjct: 318 RMVRGENDCEIEDMITAG 335
>gi|118122|sp|P25793.1|CYSP2_HAECO RecName: Full=Cathepsin B-like cysteine proteinase 2; Flags:
Precursor
gi|159165|gb|AAA29171.1| cathepsin B-like cysteine protease [Haemonchus contortus]
Length = 342
Score = 178 bits (451), Expect = 4e-42, Method: Compositional matrix adjust.
Identities = 97/264 (36%), Positives = 138/264 (52%), Gaps = 8/264 (3%)
Query: 77 DPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRP 136
DP+ +P +D R+ W NC T ++ D C + + A SDR CI SK ++
Sbjct: 80 DPDPEVDIPPSYDPRDVWKNCTTF-YIRDQANCGSCWAVSTAAAISDRICIASKAEKQVN 138
Query: 137 LSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSH 196
+S + +CC R C G W + G V+GG+Y + C+P I PC H
Sbjct: 139 ISATDIMTCC---RPQCGDGCEGGWPIEAWKYFIYDGVVSGGEYLTKDVCRPYPIHPCGH 195
Query: 197 HGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAH 256
HG+ C P C +C P + + DK Y V + AI+ EIL +
Sbjct: 196 HGNDTYYGECRGT-APTPPCKRKC-RPGVRKMYRIDKRYGKDAYIVKQSVKAIQSEILKN 253
Query: 257 GPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHW 316
GP A+FA+Y+DF HYKSG+YKHT+ +L Y H+ K+IGWG EN T +WL+ N+W W
Sbjct: 254 GPVVASFAVYEDFRHYKSGIYKHTA-GELRGY-HAVKMIGWGNENNTDFWLIANSWHNDW 311
Query: 317 GDRGTVKILRGKYECAFEYLIAAG 340
G++G +I+RG +C E IAAG
Sbjct: 312 GEKGYFRIVRGSNDCGIEGTIAAG 335
>gi|193606095|ref|XP_001951499.1| PREDICTED: cathepsin B-like cysteine proteinase 5-like
[Acyrthosiphon pisum]
Length = 330
Score = 177 bits (450), Expect = 6e-42, Method: Compositional matrix adjust.
Identities = 115/328 (35%), Positives = 157/328 (47%), Gaps = 28/328 (8%)
Query: 21 FSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEY 80
++ ID + E N+ AG N N +EE Q L+ + + + +
Sbjct: 24 LHNSIIDPSDMETNSLKAGENVLPNSAEEE-HQMLLETREVEAATKSKIMYKTRHPRSAI 82
Query: 81 SATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTE 140
+ + FDAR+ WP C TIG V D G +A G +DR CI + G N+ LSTE
Sbjct: 83 DNQIHEEFDARKGWPQCKTIGEVHDDGNTRWGWAYATAGVLADRMCIATNGSYNQLLSTE 142
Query: 141 YVASCCKICRYDDNKSCSHGSVF--RTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHG 198
+ C I K+ G+V W +L G V+GG Y GCQPS I P G
Sbjct: 143 ELIFCGGI------KTKQSGAVRGDDVWEYLKSHGLVSGGKYNTNDGCQPSKIPPI---G 193
Query: 199 SAPTLPSCENQKVPKLKCHTRCTNPTYGRG---FFQDKHRTTLTYWVDDNEDAIKKEILA 255
+ PT + C RC YG ++ D + + Y + NED I+KE+
Sbjct: 194 NIPT-------HLYNHTCEERC----YGNNTIHYYHDHVKVSHYYNIKSNED-IQKEVQT 241
Query: 256 HGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPH 315
+GP + F +YDDF+ YKSGVY T + L H KLIGWG ENG YWL++N+WG
Sbjct: 242 YGPVSVKFRVYDDFFLYKSGVYVKTEKS-LYVRRHFAKLIGWGVENGVDYWLLVNSWGNE 300
Query: 316 WGDRGTVKILRGKYECAFEYLIAAGKPK 343
WG G KI RG E E + AG+P+
Sbjct: 301 WGQNGLFKIKRGTNEVHVEDYVYAGEPE 328
>gi|157058745|gb|ABV03130.1| cathepsin B-2744 [Sitobion avenae]
Length = 260
Score = 177 bits (449), Expect = 7e-42, Method: Compositional matrix adjust.
Identities = 96/252 (38%), Positives = 137/252 (54%), Gaps = 11/252 (4%)
Query: 73 RKTYDPEYSATVPDRFDAREQWPNCG-TIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKG 131
RKT D Y +P FDAR+ + +C IG V D G CA+ A FSDR CI S G
Sbjct: 15 RKTVDISYKIDIPREFDARQYFGSCADVIGDVKDQGNCASSWAVAVASTFSDRLCIASNG 74
Query: 132 QQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTI 191
Q LS + + SC ++ C GS F+ W +G VTGG++ GCQP I
Sbjct: 75 QFTDNLSAQNLLSCGD----EEKMGCDGGSAFKAWELTMSKGIVTGGNFDSNEGCQPYKI 130
Query: 192 SPCSHHGSAPTLPSCENQKVPKLK-CHTRCTNPTYGRGFFQDKHRTTLTYWVD-DNEDAI 249
PC+H+G+ L +C + + ++ C +C N Y + D H+T++ Y N I
Sbjct: 131 RPCNHYGNG-NLKNCSSLRRTQMTVCREKCVNKNYKVKYEDDLHKTSIVYMTSWTNVKQI 189
Query: 250 KKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTE-NGTPYWLV 308
++EI+ +GP TA +Y++F YK G+YK T+ +L Y H KLIGWG + +GT YWL
Sbjct: 190 QQEIMTYGPVTAFMYVYENFMGYKEGIYKSTA-GELIGYHHV-KLIGWGVDGDGTEYWLA 247
Query: 309 INTWGPHWGDRG 320
+N+W +WG G
Sbjct: 248 MNSWNSNWGTNG 259
>gi|118429529|gb|ABK91812.1| cathepsin B precursor [Clonorchis sinensis]
Length = 342
Score = 177 bits (449), Expect = 7e-42, Method: Compositional matrix adjust.
Identities = 98/259 (37%), Positives = 131/259 (50%), Gaps = 9/259 (3%)
Query: 84 VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVA 143
+P+ FDAR WP+C +I + D +C + F AV A SDR CI SKG N+ LS +
Sbjct: 86 LPESFDARANWPHCPSISEIRDQSSCGSCWAFGAVEAMSDRLCIHSKGAFNKSLSAVDLV 145
Query: 144 SCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTL 203
SCC C C G W+ G VTGG TGC+ C H G
Sbjct: 146 SCCTEC----GCGCRGGYSPIAWDLWKTHGIVTGGSKEKPTGCRSYPFPSCEHRGKG-QY 200
Query: 204 PSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATF 263
P C +Q P +C RC T + +DK R ++Y V E A+ KEI+ GP A
Sbjct: 201 PPCPHQLYPTPECIKRCD--TKEIDYEKDKTRANISYNVYPAEQAVMKEIMLRGPVGAIL 258
Query: 264 ALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVK 323
+Y+D YKSGVY H L H +++GWG E+G PYWLV N+W WG++G ++
Sbjct: 259 HVYEDLLDYKSGVYFHVWGGHLGE--HGIRILGWGEEDGVPYWLVANSWNEDWGEKGYMR 316
Query: 324 ILRGKYECAFEYLIAAGKP 342
+LR + EC + AG P
Sbjct: 317 VLRWRNECGIVDQVTAGLP 335
>gi|209863086|ref|NP_001119616.2| cathepsin B-1674 precursor [Acyrthosiphon pisum]
gi|239799412|dbj|BAH70627.1| ACYPI000012 [Acyrthosiphon pisum]
Length = 334
Score = 177 bits (448), Expect = 8e-42, Method: Compositional matrix adjust.
Identities = 113/353 (32%), Positives = 177/353 (50%), Gaps = 38/353 (10%)
Query: 5 LVFLLGCTLVRGELYK-----FSDAYIDQINREANTWTAGRNF-PANLSEEYLRQFLIAD 58
++FL+ L+ L + D ID+ +T G N P ++ EE+L +++
Sbjct: 4 VLFLVSTMLLNSYLSEQATLFHDDNIIDKSVMGTDTLKVGENVGPNSVEEEHL---MLSG 60
Query: 59 AKYFDQSDRPL----PGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHI 114
+ + + + +R+ + E + FDAR++WP+C TIG V + G
Sbjct: 61 TRGVEATSKSKMLHKTRNRRCFRVEIDHQIDQEFDARKRWPHCKTIGEVHNDGNSLLSWA 120
Query: 115 FAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGS 174
+ G F+DR CI + G N+ LSTE + SC I + D+ S + V W +L G
Sbjct: 121 YVPTGVFADRMCIATNGTYNQLLSTEELISCSGI-KEDEFGSVNDDYV---WEYLKNHGL 176
Query: 175 VTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRC---TNPTYGRGFFQ 231
V+GG Y GCQPS I P G+ PT + + C RC Y + +
Sbjct: 177 VSGGKYNTNNGCQPSKIPPI---GNLPT-------GLYENTCEKRCYGNNTINYNQDHVK 226
Query: 232 DKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYD-DFYHYKSGVYKHTSNAKLENYLH 290
K+ + Y + I++E+ +GP + F ++D DF+ YKSGVY+ T+N++ + +
Sbjct: 227 IKNHYDIEY------EDIQREVQNYGPVSMAFRVFDNDFFLYKSGVYEKTTNSEFIQWQY 280
Query: 291 SGKLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
+ KLIGWG ENG YWL++N+WG WG G KI RG EC E + AG+P+
Sbjct: 281 A-KLIGWGVENGVDYWLLVNSWGYEWGQNGLFKIKRGTDECNIETFVHAGEPQ 332
>gi|124502519|gb|ABN13633.1| cysteine proteinase [Haemonchus contortus]
Length = 342
Score = 177 bits (448), Expect = 9e-42, Method: Compositional matrix adjust.
Identities = 102/285 (35%), Positives = 144/285 (50%), Gaps = 12/285 (4%)
Query: 56 IADAKYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIF 115
I D KY Q + + DP+ +P +D R+ W NC T ++ D C +
Sbjct: 63 IMDIKYNHQRLNLMVKE----DPDPEVDIPPSYDPRDVWKNCTTF-YIRDQANCGSCWAV 117
Query: 116 AAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSV 175
+ A SDR CI SK ++ +S + +CC R C G W + G V
Sbjct: 118 STAAAISDRICIASKAEKQVNISATDIMTCC---RPQCGDGCEGGWPIEAWKYFIYDGVV 174
Query: 176 TGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHR 235
+GG+Y + C+P I PC HHG+ C P C C P + + DK
Sbjct: 175 SGGEYLTKGVCRPYPIHPCGHHGNDTYYGECRGT-APTPPCKKEC-RPGVRKVYRIDKRY 232
Query: 236 TTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLI 295
Y V + AI+ EIL +GP A+FA+Y+DF HYKSG+YKHT+ +L Y H+ K+I
Sbjct: 233 GKDAYIVKQSVKAIQSEILRNGPVVASFAVYEDFRHYKSGIYKHTA-GELRGY-HAVKMI 290
Query: 296 GWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAG 340
GWG EN T +WL+ N+W WG++G +I+RG +C E IAAG
Sbjct: 291 GWGNENNTDFWLIANSWHNDWGEKGYFRIIRGTNDCGIEGTIAAG 335
>gi|312374702|gb|EFR22199.1| hypothetical protein AND_15622 [Anopheles darlingi]
Length = 339
Score = 176 bits (447), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 102/322 (31%), Positives = 155/322 (48%), Gaps = 18/322 (5%)
Query: 21 FSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEY 80
F+DA++ ++ A +W NF +N+ R K +S K YD Y
Sbjct: 34 FNDAFLRRVLARARSWKPDTNFRSNIHYHTFRSL-----KGIGESRTGFKVPIKHYDYVY 88
Query: 81 SATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTE 140
+P+ FD+R++WPNC ++ + + G C + AA SDR CI + G +N ++ E
Sbjct: 89 DIDIPESFDSRDRWPNCDSLREIRNQGTCGSCWAVAAASVMSDRVCIHTNGTRNVAIAAE 148
Query: 141 YVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSA 200
+ CC C G+ F+ W G V+GG Y GC+P PC +
Sbjct: 149 DLMGCCADCGNGCEGGFLDGTSFQYWV---DAGLVSGGAYNSTEGCKPYPFKPCLY---- 201
Query: 201 PTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTT 260
C ++ PK K H C + R + +DK ++ Y V +E I+ EI+ +GP
Sbjct: 202 -PFTDCHREESPKCKHH--CQHGVDKR-YARDKVFGSVAYSVPRDERVIRYEIMTNGPVE 257
Query: 261 ATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRG 320
F +Y+D + YKSGVY+H + H+ ++IGWG E G PYWL+ N++G WGD G
Sbjct: 258 GGFDVYEDVFLYKSGVYRHVYGEHVGK--HAVRIIGWGREGGIPYWLISNSYGEDWGDHG 315
Query: 321 TVKILRGKYECAFEYLIAAGKP 342
KI+RG E + G P
Sbjct: 316 YFKIVRGINHLGIESKVITGLP 337
>gi|294873367|ref|XP_002766594.1| Cathepsin B precursor, putative [Perkinsus marinus ATCC 50983]
gi|239867622|gb|EEQ99311.1| Cathepsin B precursor, putative [Perkinsus marinus ATCC 50983]
Length = 244
Score = 176 bits (447), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 98/245 (40%), Positives = 134/245 (54%), Gaps = 11/245 (4%)
Query: 105 DTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFR 164
D AC + F V AF+ R CIKS G+ N+ LS + +CC I + + CS G+
Sbjct: 1 DQSACGSCWAFGTVEAFNARVCIKSGGKLNQLLSAANMLACCNIGHFCLSFGCSGGNPIT 60
Query: 165 TWNFLHKRGSVTGGDYGDRT------GCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHT 218
+W FLH G V+GG + GC P + C+HH C + C +
Sbjct: 61 SWTFLHTNGIVSGGGFVPEKNMKAADGCWPYSFPKCAHHQDGSDYKPCAKEIYDTPSCSS 120
Query: 219 RCTNPTYGRGFFQDKHRT-TLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVY 277
C N YG F +D+H T +L + +IKKEI+ +GPT+A F++Y+DF YKSGVY
Sbjct: 121 SCPNAKYGTAFDKDRHYTESLFPSRFGSTSSIKKEIMTNGPTSAAFSVYEDFLSYKSGVY 180
Query: 278 KHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLI 337
KHTS L H+ ++IGWGTE G YWLV+N+W WGD GT KI++G +C + I
Sbjct: 181 KHTSGGFLGG--HAVEIIGWGTEKGVDYWLVMNSWNEEWGDHGTFKIVQG--DCGIDDTI 236
Query: 338 AAGKP 342
AG P
Sbjct: 237 LAGTP 241
>gi|157058769|gb|ABV03142.1| cathepsin B-348 [Myzus persicae]
Length = 246
Score = 176 bits (447), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 94/230 (40%), Positives = 129/230 (56%), Gaps = 10/230 (4%)
Query: 84 VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVA 143
+P+ FDARE WPNC TI V D G+C + F AV A SDR CI SKG +N S E +
Sbjct: 24 LPENFDAREHWPNCPTIREVRDQGSCGSCWAFGAVEAMSDRVCIHSKGAKNFHFSAENLV 83
Query: 144 SCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTL 203
SCC C + C+ G W++ +G V+GG YG + GC P I+PC HH +
Sbjct: 84 SCCWTCGF----GCNGGFPGAAWHYWKTKGIVSGGPYGSKMGCIPYEIAPCEHHVNGTRG 139
Query: 204 PSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATF 263
P E K P C +C + Y + QD HR Y + ++ D I++EI +GP F
Sbjct: 140 PCKEGGKTP--ACVKKCED-GYKVPYAQDLHRGKSAYSLGNDVDQIRQEIYTNGPVEGAF 196
Query: 264 ALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENG-TPYWLVINTW 312
+Y+DF Y++GVYKH + L H+ +++GWG +NG PYWLV N+W
Sbjct: 197 TVYEDFIAYRAGVYKHVAGKALGG--HAIRILGWGVQNGEIPYWLVANSW 244
>gi|239790303|dbj|BAH71722.1| ACYPI001175 [Acyrthosiphon pisum]
Length = 330
Score = 176 bits (446), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 115/328 (35%), Positives = 156/328 (47%), Gaps = 28/328 (8%)
Query: 21 FSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEY 80
++ ID + E N+ AG N N +EE Q L+ + + + +
Sbjct: 24 LHNSIIDPSDMETNSLKAGENVLPNSAEEE-HQMLLETREVEAATKSKIMYKTRHPRSAI 82
Query: 81 SATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTE 140
+ + FDAR+ WP C TIG V D G +A G +DR CI + G N+ LSTE
Sbjct: 83 DNQIHEEFDARKGWPQCKTIGEVHDDGNTRWGWAYATAGVLADRMCIATNGSYNQLLSTE 142
Query: 141 YVASCCKICRYDDNKSCSHGSVF--RTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHG 198
+ C I K+ G+V W +L G V+GG Y GCQPS I P G
Sbjct: 143 ELIFCGGI------KTKQSGAVRGDDVWEYLKSHGLVSGGKYNTNDGCQPSKIPPI---G 193
Query: 199 SAPTLPSCENQKVPKLKCHTRCTNPTYGRG---FFQDKHRTTLTYWVDDNEDAIKKEILA 255
+ PT + C RC YG ++ D + + Y + NED I+KE+
Sbjct: 194 NIPT-------HLYNHTCEERC----YGNNTIHYYHDHVKVSHYYNIKSNED-IQKEVQT 241
Query: 256 HGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPH 315
+GP + F +YDDF+ YKSGVY T + L H KLIGWG ENG YWL++N WG
Sbjct: 242 YGPVSVKFRVYDDFFLYKSGVYVKTEKS-LYVRRHFAKLIGWGVENGVDYWLLVNFWGNE 300
Query: 316 WGDRGTVKILRGKYECAFEYLIAAGKPK 343
WG G KI RG E E + AG+P+
Sbjct: 301 WGQNGLFKIKRGTNEVHVEDYVYAGEPE 328
>gi|312091331|ref|XP_003146940.1| cathepsin B [Loa loa]
Length = 249
Score = 176 bits (446), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 93/239 (38%), Positives = 131/239 (54%), Gaps = 10/239 (4%)
Query: 106 TGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRT 165
+G+C A AAV A SDR CI SKG++ LS + + SCCK C + C G
Sbjct: 14 SGSCWA---VAAVEAMSDRICIMSKGKKQVTLSADDLLSCCKTCGF----GCFGGEPMAA 66
Query: 166 WNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTY 225
W + RG VTG +Y + +GC+P PC HH + C++ P KC +C + Y
Sbjct: 67 WKYWVLRGIVTGSEYTNHSGCRPYPFPPCEHHNNKTHYEPCKHDLYPTPKCVKKC-DKNY 125
Query: 226 GRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKL 285
G+ + DK+ Y V+ N ++I+KEI+ GP A+F +Y DF +Y G+YKH + +
Sbjct: 126 GKSYKADKYYGQSVYNVESNVESIQKEIMTLGPVEASFEVYTDFLYYTGGIYKHVAGSMG 185
Query: 286 ENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPKN 344
H+ K++GWG + G PYWL N+W WG+ G +ILRG EC E I AG PK
Sbjct: 186 GG--HAVKVLGWGIDQGVPYWLAANSWNTDWGEDGYFRILRGVNECGIESGIIAGIPKQ 242
>gi|7507648|pir||T24819 hypothetical protein T10H4.12 - Caenorhabditis elegans
Length = 324
Score = 176 bits (446), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 103/269 (38%), Positives = 136/269 (50%), Gaps = 16/269 (5%)
Query: 84 VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVA 143
+PD FDARE+WP+C TI + + C + F A SDR CI+S G Q +S E +
Sbjct: 30 LPDTFDAREKWPDCNTIKLIRNQATCGSCWAFGAAEVISDRVCIQSNGTQQPVISVEDIL 89
Query: 144 SCC-KICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPT 202
SCC C Y C G F G+VTGGDYG GC P + +PC+ + T
Sbjct: 90 SCCGTTCGY----GCKGGYSIEALRFWASSGAVTGGDYGGH-GCMPYSFAPCTKNCPEST 144
Query: 203 LPSCENQKVPKLKCHTRCTNPTYGRGF------FQDKHRTTLTYWVDDNEDA--IKKEIL 254
PSC+ K + YG FQ Y V + I+ EI
Sbjct: 145 TPSCKTTCQSSYKTEEYKKDKHYGELVWHSFNRFQRFLNRASAYKVTTTKSVTEIQTEIY 204
Query: 255 AHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGP 314
+GP A++ +Y+DFYHYKSGVY +TS + H+ K+IGWG ENG YWL+ N+WG
Sbjct: 205 HYGPVEASYKVYEDFYHYKSGVYHYTSGKLVGG--HAVKIIGWGVENGVDYWLIANSWGT 262
Query: 315 HWGDRGTVKILRGKYECAFEYLIAAGKPK 343
+G++G KI RG EC E + AG K
Sbjct: 263 SFGEKGFFKIRRGTNECQIEGNVVAGIAK 291
>gi|189239879|ref|XP_968767.2| PREDICTED: similar to putative cathepsin B-like proteinase
[Tribolium castaneum]
gi|270012755|gb|EFA09203.1| cathepsin B precursor [Tribolium castaneum]
Length = 353
Score = 176 bits (446), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 116/318 (36%), Positives = 151/318 (47%), Gaps = 32/318 (10%)
Query: 26 IDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEYSATVP 85
I+QIN + ++WTA N P + E L I F R +P
Sbjct: 24 INQINSQQSSWTARIN-PFDDIESRLGFLGIHPDPNFQLEVLEWEEPR--------TVIP 74
Query: 86 DRFDAREQWPNC-GTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVAS 144
FDARE WP C IG++ + G C + FAA SDR C+ + G S E + +
Sbjct: 75 ATFDAREYWPQCKDVIGNIRNQGKCGSCWAFAAAEVMSDRLCVATNGSVKFEFSPEDLIN 134
Query: 145 CCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTLP 204
CC+ C K C G + W + G V+GGDY GCQP + S + G +P
Sbjct: 135 CCETC----GKKCKGGYSYYAWKYYTSTGLVSGGDYNTSRGCQPYSKSN-FNDGVSP--- 186
Query: 205 SCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHG-PTTATF 263
+C C N Y + D+H TY++ N I++EIL G P A F
Sbjct: 187 ----------ECSKTCQNTKYPTSYLNDRHFGDGTYYILKNVTTIQQEILLRGGPVMAGF 236
Query: 264 ALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTV- 322
+Y+DF Y+ GVY HTS A L + H+ K+IGWGTENG YWLV N+WG WG G V
Sbjct: 237 DVYEDFKLYREGVYVHTSGALLGS--HAVKIIGWGTENGWAYWLVANSWGKDWGALGGVF 294
Query: 323 KILRGKYECAFEYLIAAG 340
KI RG EC E I G
Sbjct: 295 KIRRGTNECKIEQSIITG 312
>gi|1008858|gb|AAA79004.1| cathepsin B-like thiol protease [Aedes aegypti]
Length = 342
Score = 176 bits (446), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 111/353 (31%), Positives = 157/353 (44%), Gaps = 38/353 (10%)
Query: 1 MIHILVFLLGCTLVRGELY---------KFSDAYIDQINREANTWTAGRNF--PANLSEE 49
I I+ LG V G+ Y + + + QI TW AG N PA
Sbjct: 4 FILIVAAALGSPAVLGQYYNTFSYNGQYRSTGSIASQIRNLTRTWVAGNNTLPPA----A 59
Query: 50 YLRQFLIADAKYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGAC 109
Y + L +D+ +P+ +P+ FDAR++W C ++ + + G C
Sbjct: 60 YFKGVL------YDRLGETRLAPAILVNPQ-DIQLPESFDARQKWSQCPSLNVIRNQGCC 112
Query: 110 AAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFL 169
+ +A A +DR CIKSKG++ + +CC C C G + W F
Sbjct: 113 GSCWAISAASAMTDRWCIKSKGKEQFSFGATDMLACCHAC----GDGCKGGYLGPAWQFW 168
Query: 170 HKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGF 229
++G +GG Y R GC P I C G P KC RC +
Sbjct: 169 VEQGVSSGGPYNSRQGCHPYPIDVCDASGEEADTP----------KCSKRCQSGYNVTDV 218
Query: 230 FQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYL 289
+QD+ + Y + ++E I +EI +GP A F Y D + YKSGVY+H
Sbjct: 219 WQDRRYGRVAYSIPNDEQKIMEEIYINGPVQAAFMTYQDLHAYKSGVYRHVWGHMAGG-- 276
Query: 290 HSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
H+ KL+GWG ENG YWLV N+WG WGD G KI+RG+ C E + AG P
Sbjct: 277 HAVKLMGWGVENGLKYWLVANSWGDDWGDNGFFKIVRGENHCGIEKDVHAGLP 329
>gi|119638992|gb|ABL85238.1| cysteine proteinase 4 [Necator americanus]
Length = 339
Score = 176 bits (445), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 107/317 (33%), Positives = 162/317 (51%), Gaps = 19/317 (5%)
Query: 24 AYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEYSAT 83
A +D +N + + + P N E++++ I D KY ++ P RK + +
Sbjct: 36 ALVDYVNSHQSLFKTEYS-PTN--EQFVKA-RIMDIKYMTEASHKYP--RKGIN--LNVE 87
Query: 84 VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVA 143
+P+RFDARE+WP+C +IG + D AC + +A SDR CI++ G + LS+ +
Sbjct: 88 LPERFDAREKWPHCASIGLIRDHSACGSCWAVSAASVMSDRLCIQTNGTNQKILSSADIL 147
Query: 144 SCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPC-SHHGSAPT 202
+CC D C G + + +L G +GG+Y ++ C+P PC ++G P
Sbjct: 148 ACCG---EDCGSGCEGGYPIQAYFYLENTGVCSGGEYREKNVCKPYPFYPCDGNYGPCPK 204
Query: 203 LPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTAT 262
+ + K K+ C R P F L DNE I++EI +GP A
Sbjct: 205 EGAFDTPKCRKI-CQFRYPVPYEEDKVFGKNSHILL----QDNEARIRQEIFINGPVGAN 259
Query: 263 FALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTV 322
F +++DF HYK G+YK T + +H+ KLIGWGTENGT YWLV N++ WG+ GT
Sbjct: 260 FYVFEDFIHYKEGIYKQTYGKWIG--VHAIKLIGWGTENGTDYWLVANSYNYDWGENGTF 317
Query: 323 KILRGKYECAFEYLIAA 339
+ILRG C E + A
Sbjct: 318 RILRGTNHCLIESQVIA 334
>gi|157167283|ref|XP_001658486.1| cathepsin b [Aedes aegypti]
gi|108876477|gb|EAT40702.1| AAEL007599-PA [Aedes aegypti]
Length = 342
Score = 176 bits (445), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 111/353 (31%), Positives = 157/353 (44%), Gaps = 38/353 (10%)
Query: 1 MIHILVFLLGCTLVRGELY---------KFSDAYIDQINREANTWTAGRNF--PANLSEE 49
I I+ LG V G+ Y + + + QI TW AG N PA
Sbjct: 4 FILIVAAALGSPAVLGQYYNTFSYNGQYRSTGSIASQIRNLTRTWVAGNNTLPPA----A 59
Query: 50 YLRQFLIADAKYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGAC 109
Y + L +D+ +P+ +P+ FDAR++W C ++ + + G C
Sbjct: 60 YFKGVL------YDRLGETRLAPAILVNPQ-DIQLPESFDARQKWSQCPSLNVIRNQGCC 112
Query: 110 AAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFL 169
+ +A A +DR CIKSKG++ + +CC C C G + W F
Sbjct: 113 GSCWAISAASAMTDRWCIKSKGKEQFSFGATDMLACCHAC----GDGCKGGYLGPAWQFW 168
Query: 170 HKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGF 229
++G +GG Y R GC P I C G P KC RC +
Sbjct: 169 VEQGVSSGGPYNSRQGCHPYPIDVCDASGEEADTP----------KCSKRCQSGYNVTDV 218
Query: 230 FQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYL 289
+QD+ + Y + ++E I +EI +GP A F Y D + YKSGVY+H
Sbjct: 219 WQDRRYGRVAYSIPNDEQKIMEEIYINGPVQAAFMTYQDLHAYKSGVYRHVWGHMAGG-- 276
Query: 290 HSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
H+ KL+GWG ENG YWLV N+WG WGD G KI+RG+ C E + AG P
Sbjct: 277 HAVKLMGWGVENGLKYWLVANSWGDDWGDNGFFKIVRGENHCGIEKDVHAGLP 329
>gi|294951797|ref|XP_002787132.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
gi|239901778|gb|EER18928.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
Length = 278
Score = 175 bits (444), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 107/282 (37%), Positives = 141/282 (50%), Gaps = 25/282 (8%)
Query: 73 RKTYDPEYSATVPDRFDAREQWPNCG-TIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKG 131
RK Y E +P FDAR +PNC IGH+ D AC + F AF+DR CIKS G
Sbjct: 10 RKGYAIEELQDLPTDFDARTAFPNCSKVIGHIRDQSACGSCWAFGVTEAFNDRLCIKSHG 69
Query: 132 QQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDY---GDRT---G 185
LS + +C + C+ G W+++H +G TGGDY D T G
Sbjct: 70 TFTELLSAGEMNACAP------SHGCNGGFPNSAWSWVHDKGIATGGDYVAEDDMTKDDG 123
Query: 186 CQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKH----RTTLTYW 241
C P PC+HH + P C C +C NP Y D+H + Y
Sbjct: 124 CWPYDFPPCAHHVNDSKYPKCPKDSYETPNCAEQCHNPKYTTTLRDDRHFMVESSPYQYS 183
Query: 242 VDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTEN 301
V+D ++AI+ + GP +A+F +Y+DF YKSGVYKHTS L H+ K+IGWG E+
Sbjct: 184 VNDAKNAIRTD----GPVSASFTVYEDFLAYKSGVYKHTSGEYLGG--HAVKIIGWGEES 237
Query: 302 GTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
G YWLV+N+W WGD G KI G C + + G PK
Sbjct: 238 GQAYWLVVNSWNEDWGDHGLFKIALGN--CGIDDYLLGGTPK 277
>gi|270012756|gb|EFA09204.1| cathepsin B precursor [Tribolium castaneum]
Length = 369
Score = 175 bits (444), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 121/333 (36%), Positives = 162/333 (48%), Gaps = 44/333 (13%)
Query: 24 AYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEYSAT 83
+ I+QIN + + WTAG N P + E L I + D + +P + +P+ +
Sbjct: 21 SLINQINSQQSAWTAGIN-PFDDIESRLGFLGI----HPDPNFKP-----EIKEPQATQN 70
Query: 84 V-PDRFDAREQWPNCG-TIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEY 141
V P+ FDARE WP C IG++ + G C++ FAA SDR CI + G+ LS E
Sbjct: 71 VIPETFDAREYWPECADIIGNIRNQGKCSSSWAFAAAEVMSDRLCIATNGKVKIQLSPED 130
Query: 142 VASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAP 201
+ CC C C G + WN+ G V+GGDY TGCQP S +++ P
Sbjct: 131 LIDCCHYC----GNQCKGGYTYYAWNYFMLTGLVSGGDYNTSTGCQP--YSELNYYRITP 184
Query: 202 TLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHG-PTT 260
C+T C N Y + DKH Y++ NE AI+ EIL+ G P
Sbjct: 185 -------------PCNTTCQNDKYPIPYVSDKHFGDSIYYIPQNETAIQNEILSGGGPVV 231
Query: 261 ATFALYDDFYHYK---------SGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINT 311
A F +Y DF Y+ GVY +TS A + K+IGWGTENG YWL N+
Sbjct: 232 AAFDVYGDFKIYRDGEQHDTILEGVYIYTSGALFGR--TAVKIIGWGTENGWAYWLAANS 289
Query: 312 WGPHWGDRGT-VKILRGKYECAFEYLIAAGKPK 343
WG WG G KI RG EC FE I AG+ +
Sbjct: 290 WGKDWGALGGFFKIRRGTNECGFEESIIAGQVR 322
>gi|347972088|ref|XP_313836.5| AGAP004534-PA [Anopheles gambiae str. PEST]
gi|333469166|gb|EAA09182.5| AGAP004534-PA [Anopheles gambiae str. PEST]
Length = 334
Score = 175 bits (443), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 104/322 (32%), Positives = 149/322 (46%), Gaps = 18/322 (5%)
Query: 21 FSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEY 80
F+D ++ ++ A TW NF +N+ R K +S + Y+ Y
Sbjct: 29 FNDDFLRRVLARARTWKPDTNFQSNVHFHAFRSL-----KGIGESRTGFKVPIRRYEYVY 83
Query: 81 SATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTE 140
+P+ FDAR WPNC ++ + + G C + AA SDR CI S G N L+ E
Sbjct: 84 DVDIPESFDARNHWPNCESLRAIRNQGTCGSCWAVAAASVMSDRVCIHSNGTINVALAAE 143
Query: 141 YVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSA 200
+ CC C N G+ F+ W G V+GG Y GC+P PC +
Sbjct: 144 DLMGCCVDCGNGCNGGFLDGTSFQYWV---DAGLVSGGAYNSTDGCKPYPFKPCEY---- 196
Query: 201 PTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTT 260
C + PK H R R + +DK + Y V +E AI+ EI+ +GP
Sbjct: 197 -PFNDCHVEISPKCTHHCR---DGVDRHYSKDKLFGKVAYSVPRDERAIRYEIMTNGPVE 252
Query: 261 ATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRG 320
A F +Y+D YKSGVY+H ++ H+ ++IGWG + G PYWL+ N++G WGD G
Sbjct: 253 AGFDVYEDVLLYKSGVYRHVYGEQIGK--HAVRIIGWGRDGGIPYWLIANSYGDDWGDHG 310
Query: 321 TVKILRGKYECAFEYLIAAGKP 342
K +RG E I G P
Sbjct: 311 YFKFVRGSNHLGIESKIITGLP 332
>gi|268566077|ref|XP_002647467.1| Hypothetical protein CBG06539 [Caenorhabditis briggsae]
Length = 332
Score = 175 bits (443), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 114/323 (35%), Positives = 164/323 (50%), Gaps = 34/323 (10%)
Query: 24 AYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGD--RKTYDPEYS 81
A +D +N + + ++SEE+++ + + KY P P D R T
Sbjct: 33 ALVDYVNSAQSLFITEH---VDVSEEFMKS-RVMNVKY----ASPPPSDEIRATEVNTVL 84
Query: 82 ATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEY 141
AT+P+ FDAR +WP C +I + + C + F A SDR CI +KG + +S
Sbjct: 85 ATIPETFDARTKWPKCKSIKLIRNQANCGSCWAFGAAEVISDRICIATKGARQPVISPMD 144
Query: 142 VASCC-KICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDY-GDRTGCQPSTISPCSHHGS 199
+ CC + C Y C G + + G VTGGDY GD GC+P C+ G
Sbjct: 145 MVDCCGEYCGY----GCDGGYSIQALRWWVFDGVVTGGDYQGD--GCKPYQF--CNSAGC 196
Query: 200 APTL-PSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGP 258
+ P C L C ++ Y + +DK+ T Y+V +AI+ +I+ +GP
Sbjct: 197 PDAVTPEC------ALSCQSK-----YNTEYAKDKNFGTSAYYVGMTVNAIQTDIMTNGP 245
Query: 259 TTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGD 318
A+F +Y+DFY YKSGVYK+ + L H+ K+IGWGTENGT YWL+ N+WG WG+
Sbjct: 246 VEASFKVYEDFYKYKSGVYKYIAGKMLGG--HAIKIIGWGTENGTAYWLIANSWGTKWGE 303
Query: 319 RGTVKILRGKYECAFEYLIAAGK 341
G KI RG EC E + AGK
Sbjct: 304 NGFFKIRRGVNECGIENNVVAGK 326
>gi|157058763|gb|ABV03139.1| cathepsin B-348 [Acyrthosiphon pisum]
Length = 248
Score = 174 bits (442), Expect = 4e-41, Method: Compositional matrix adjust.
Identities = 94/233 (40%), Positives = 127/233 (54%), Gaps = 10/233 (4%)
Query: 81 SATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTE 140
S +P+ FDARE+WPNC TI V D G+C + F AV A SDR CI S G +N S E
Sbjct: 23 STDLPETFDARERWPNCPTIREVRDQGSCGSCWAFGAVEAMSDRVCIHSNGTKNFHFSAE 82
Query: 141 YVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSA 200
+ SCC C + C+ G WN+ +G V+GG YG GC P I+PC HH +
Sbjct: 83 NLVSCCWTCGF----GCNGGFPGAAWNYWKTKGIVSGGPYGSNMGCIPYEIAPCEHHVNG 138
Query: 201 PTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTT 260
P E K P C +C Y + QD H Y + ++ D I++EI +GP
Sbjct: 139 TRGPCKEGGKTP--TCVKKCEE-GYKVPYAQDLHHGKSAYSIRNDVDQIRQEIYTNGPVE 195
Query: 261 ATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENG-TPYWLVINTW 312
F +Y+DF Y++GVYKH + L H+ +++GWG +NG PYWLV N+W
Sbjct: 196 GAFTVYEDFIAYRAGVYKHVAGKALGG--HAIRILGWGVQNGEIPYWLVANSW 246
>gi|119638965|gb|ABL85237.1| cysteine proteinase 3 [Necator americanus]
Length = 360
Score = 174 bits (441), Expect = 5e-41, Method: Compositional matrix adjust.
Identities = 106/326 (32%), Positives = 160/326 (49%), Gaps = 23/326 (7%)
Query: 23 DAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEYSA 82
+A+ + +N+ + +TA + P L+ +R + ++++ D + + K D ++S
Sbjct: 36 EAFAEFLNKRQSFFTA-KYTPNALNILKMR---VMESRFLDNEEGEM---LKEEDMDFSE 88
Query: 83 TVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYV 142
+P FDAR++WP C +IG + D C + ++ SDR C++S G LS +
Sbjct: 89 EIPVSFDARDKWPKCTSIGFIRDQSHCGSCWAVSSAETMSDRLCVQSNGTIKVLLSDTDI 148
Query: 143 ASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPT 202
+CC C C G R W + G TGG YG + C+P PC +
Sbjct: 149 LACCPNC----GAGCGGGHTIRAWEYFKNTGVCTGGLYGTKDSCKPYAFYPCKDE----S 200
Query: 203 LPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTAT 262
C P KC C Y + + DK+ Y + NE IK EI+ +GP TA+
Sbjct: 201 YGKCPKDSFPTPKCRKICQY-KYSKKYADDKYYANSAYRIPQNETWIKLEIMRNGPVTAS 259
Query: 263 FALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTE--NGT--PYWLVINTWGPHWGD 318
F +Y DF Y+ GVY + +L H+ K+IGWGTE NGT PYWL+ N+WG WG+
Sbjct: 260 FRIYPDFGFYEKGVYVTSGGRELGG--HAIKIIGWGTEKVNGTDLPYWLIANSWGTDWGE 317
Query: 319 -RGTVKILRGKYECAFEYLIAAGKPK 343
G +ILRG+ C E + AG K
Sbjct: 318 NNGYFRILRGQNHCQIEQKVIAGMIK 343
>gi|294891881|ref|XP_002773785.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
gi|239878989|gb|EER05601.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
Length = 455
Score = 174 bits (441), Expect = 5e-41, Method: Compositional matrix adjust.
Identities = 114/320 (35%), Positives = 154/320 (48%), Gaps = 20/320 (6%)
Query: 24 AYIDQINREANTWTAGRNFP--ANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPE-- 79
+ +D+IN +WTA ++ P +S + L D + D G+ + P
Sbjct: 83 SMVDKINSMQQSWTASKDQPPFKGMSIKDLPAGCSNDTMFSSTLDEG--GENRLLGPTNP 140
Query: 80 YSATVPDRFDAREQWPNCG-TIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLS 138
T+P FDAR+++ +C IGHV + G C AAVG F+DR CIKS G+ LS
Sbjct: 141 VLTTLPSSFDARQKFASCADVIGHVREQGECNNCWASAAVGMFNDRVCIKSGGRITDILS 200
Query: 139 TEYVASCCKICR-YDDNKSCSHGSVFRTWNFLHKRGSVTGGDY------GDRTGCQPSTI 191
Y+ SCC + C GSV NF+ G VTGG+Y G+ GC P
Sbjct: 201 LGYLTSCCNRANGCPKSNGCMFGSVPEGLNFMKNHGLVTGGEYKPPEELGNDDGCWPYPF 260
Query: 192 SPCSH-HGSAPTLPSC-ENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAI 249
C+H G P C + + +P C T C N YG +D HR + + I
Sbjct: 261 PKCNHVPGLESKYPRCAQVRDLPA--CATTCPNKAYGTSMQKDTHRAKSWGRLPIGPEKI 318
Query: 250 KKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVI 309
K+EI +GP A LY+DF YKSGVY H + L H+ KLIGWG E+G YWL +
Sbjct: 319 KQEIFDNGPVAAMMTLYEDFRFYKSGVYVHKTGQMLA--AHTLKLIGWGVESGQEYWLAV 376
Query: 310 NTWGPHWGDRGTVKILRGKY 329
N W WGD G +K+ Y
Sbjct: 377 NAWNEEWGDHGMIKLASSVY 396
>gi|341886633|gb|EGT42568.1| hypothetical protein CAEBREN_17563 [Caenorhabditis brenneri]
Length = 358
Score = 174 bits (441), Expect = 6e-41, Method: Compositional matrix adjust.
Identities = 116/321 (36%), Positives = 160/321 (49%), Gaps = 22/321 (6%)
Query: 29 INREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEYSATVPDRF 88
+N + W A E+ R + D K+ ++ D E +P+ F
Sbjct: 47 VNEQQQLWKA-ETSRMTFQEKMAR---VKDIKFIRSHEQSTENDNSQVFEE----IPNSF 98
Query: 89 DAREQWPNCGTIGHVPDTGAC-AAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCC- 146
DAR++WP+C IG V D C +A H+ AA A SDR CI S G N PLS + SCC
Sbjct: 99 DARQKWPSCSQIGAVRDQSDCGSAAHLVAAEIA-SDRTCIFSNGTFNWPLSAQDPLSCCV 157
Query: 147 ---KICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSH-HGSAPT 202
IC D C + G TGG+Y D+ GC+P TI PC + + T
Sbjct: 158 GLMSIC--GDGWGCDGSWPKDILKWWQTHGLCTGGNYDDQFGCKPYTIYPCDKKYPNGTT 215
Query: 203 LPSCENQKVPKLKCHTRCT-NPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTA 261
C P C RCT N T+ + QDKH Y V I+ EI+ +GP A
Sbjct: 216 SVPCPGYHTPV--CEERCTSNITWPISYKQDKHFGKAHYNVGKKMTDIQTEIMRNGPVIA 273
Query: 262 TFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGT 321
+F +YDDF+ YKSG+Y HT+ + E + + K+IGWG +NG PYWL ++ WG +G+ G
Sbjct: 274 SFIIYDDFWDYKSGIYVHTAGDQ-EGGMDT-KIIGWGVDNGVPYWLCVHQWGTDFGENGF 331
Query: 322 VKILRGKYECAFEYLIAAGKP 342
V+ILRG E E+ + A +P
Sbjct: 332 VRILRGVNEVNIEHQVLAAQP 352
>gi|256086863|ref|XP_002579605.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
gi|353228447|emb|CCD74618.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
Length = 271
Score = 174 bits (440), Expect = 7e-41, Method: Compositional matrix adjust.
Identities = 95/231 (41%), Positives = 127/231 (54%), Gaps = 8/231 (3%)
Query: 113 HIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKR 172
H F AV + SDR CI SK + + LS + SCC C + C G W++
Sbjct: 44 HAFGAVESMSDRICIHSKNKISVELSAINLLSCCTRCGF----GCRGGIPGMAWDYWKYE 99
Query: 173 GSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQD 232
G VTGG TGCQP C+HH S+ + P CE+ P +CH C + YG+ + +D
Sbjct: 100 GIVTGGSNETHTGCQPYPFPECNHHSSSKSYPPCESYYFPTPECHETCQD-DYGKPYKKD 158
Query: 233 KHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSG 292
K +Y V E +I KEIL +GP F +Y+DF +YKSGVYKH + + L H+
Sbjct: 159 KFYGKSSYNVASEEISIMKEILLNGPVEGGFYVYEDFLNYKSGVYKHITGSYLGG--HAI 216
Query: 293 KLIGWGT-ENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
++IGWG +N PYWL N+W WGD+G KILRG EC E ++ AG P
Sbjct: 217 RIIGWGIQQNHIPYWLCANSWNNQWGDQGYFKILRGTNECGIESMVTAGLP 267
>gi|157167281|ref|XP_001658485.1| cathepsin b [Aedes aegypti]
gi|108876476|gb|EAT40701.1| AAEL007585-PA [Aedes aegypti]
Length = 386
Score = 174 bits (440), Expect = 7e-41, Method: Compositional matrix adjust.
Identities = 93/259 (35%), Positives = 134/259 (51%), Gaps = 16/259 (6%)
Query: 84 VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVA 143
+PD FDARE+WP C ++ + D G C + +A A +DR C++SKG++ + +
Sbjct: 125 LPDTFDAREKWPECPSLREIRDQGCCGSCWAVSAASAMTDRWCVRSKGKEQFIFGSLDLL 184
Query: 144 SCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTL 203
SCC C + C G++ W F ++G +GG R GC P I C G
Sbjct: 185 SCCHSC----GQGCRGGTLGPAWQFWVEKGLSSGGPLNSRQGCHPYPIGECRIPG----- 235
Query: 204 PSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATF 263
E++ PK C +C + +QD+H + Y + ++E I +EI +GP A F
Sbjct: 236 ---EDEDTPK--CSNKCRSGYNVTDVWQDRHYGRVAYSLPNDERKIMEEIFINGPVQAAF 290
Query: 264 ALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVK 323
Y D + YKSG+Y+H H+ KL+GWG ENG YWLV N+WG WG+ G K
Sbjct: 291 HTYLDLHAYKSGIYRHVWGPLSGG--HAVKLLGWGVENGVKYWLVANSWGREWGENGFFK 348
Query: 324 ILRGKYECAFEYLIAAGKP 342
I+RG+ C E I AG P
Sbjct: 349 IVRGENHCGIEENIHAGLP 367
>gi|268560898|ref|XP_002638183.1| Hypothetical protein CBG22612 [Caenorhabditis briggsae]
Length = 721
Score = 173 bits (439), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 111/352 (31%), Positives = 171/352 (48%), Gaps = 43/352 (12%)
Query: 1 MIHILVFLLGCTLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAK 60
+ H+L + + G+ + ++ +N W A +SEE ++ F + D+K
Sbjct: 8 IAHLLQYTFSQQTLSGK------SLVNHVNTIQTLWKAEY---FEISEEEMK-FKVMDSK 57
Query: 61 YFDQSDRPLPGDRKTYDPEYS-----ATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIF 115
+ P ++ + +P S + P FDAR+ WPNC +I + D C + F
Sbjct: 58 F------AFPEEQISSEPNNSLPGSLSRAPTSFDARDYWPNCKSIKMIRDQAYCGSCWAF 111
Query: 116 AAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSV 175
A SDR CI+S G +S E + +CC ++ C G V F +G V
Sbjct: 112 GAAEVISDRICIQSNGTDQPIISPEDILTCCT-----NSHGCQGGFVLEAMKFWKSKGVV 166
Query: 176 TGGDY-GDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKH 234
TGGD+ GD GC P + CS +A T P C+N+ C + T Y +DK+
Sbjct: 167 TGGDFQGD--GCIPYSYGSCSDCHTAQTTPKCKNE------CQVKYTKNEYK----EDKY 214
Query: 235 RTTLTYWVDDNEDA--IKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSG 292
+ Y + + I+ EIL +GP AT+ +Y+DFY+YKSGVY++ S + H+
Sbjct: 215 YGSSAYRLSTSNAVRTIQSEILRNGPVEATYQVYEDFYYYKSGVYEYISGRHMGG--HAV 272
Query: 293 KLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPKN 344
K+IGWG E YWL+ N+WG +G+ G K+ RG EC E + AG K+
Sbjct: 273 KIIGWGVEENVNYWLIANSWGTGFGENGFFKMRRGNNECGIENYVVAGMAKS 324
>gi|339242313|ref|XP_003377082.1| Gut-specific cysteine proteinase [Trichinella spiralis]
gi|316974149|gb|EFV57673.1| Gut-specific cysteine proteinase [Trichinella spiralis]
Length = 517
Score = 173 bits (438), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 92/260 (35%), Positives = 131/260 (50%), Gaps = 27/260 (10%)
Query: 84 VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVA 143
+P FD+RE+WP C I + D C + +A +DR CI SKGQ+ +S E +
Sbjct: 280 LPKHFDSREKWPECEWIRFIRDQSNCGSCWAVSAASVMTDRHCIASKGQETPYISDEQIL 339
Query: 144 SCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTL 203
+C G + +N+ K G TGG YGD++ CQP +I+PCS +
Sbjct: 340 AC--------------GMIPSPFNYWKKMGIATGGPYGDKSCCQPYSIAPCSKCSYTAST 385
Query: 204 PSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATF 263
PSC K C P F+ +H Y V N+ I EI HGP A F
Sbjct: 386 PSC------KYDCQADYDIPISDDKFYASEH-----YHVSSNQYEIMNEIYTHGPVVAGF 434
Query: 264 ALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVK 323
+Y+DF +Y SG+Y+ T+ + H+ ++IGWG ENG PYWL+ N+W +G++G +
Sbjct: 435 IVYEDFTYYISGIYQQTTYVAMGG--HAIRIIGWGEENGIPYWLIANSWNTTFGEKGFFR 492
Query: 324 ILRGKYECAFEYLIAAGKPK 343
I RG EC E + G PK
Sbjct: 493 IRRGTNECRIESEVYTGIPK 512
Score = 67.4 bits (163), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 37/121 (30%), Positives = 56/121 (46%), Gaps = 11/121 (9%)
Query: 157 CSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKC 216
C G + + + + G VTGG YG++ C P +ISPC+ P C+
Sbjct: 70 CRSGKIEAAFIYWQRSGLVTGGPYGEKACCLPYSISPCTMCRPYMLAPKCQ--------- 120
Query: 217 HTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGV 276
R +Y +DK+ Y+V+ +E I +EI GP A F +Y DF +Y SG
Sbjct: 121 --RTCQASYNLSLKRDKYYGKSHYYVNQDEFDIMQEIYQRGPVVAGFKVYHDFLYYISGQ 178
Query: 277 Y 277
+
Sbjct: 179 F 179
>gi|157058739|gb|ABV03127.1| cathepsin B-2744 [Acyrthosiphon pisum]
Length = 260
Score = 173 bits (438), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 94/253 (37%), Positives = 136/253 (53%), Gaps = 11/253 (4%)
Query: 72 DRKTYDPEYSATVPDRFDAREQWPNCG-TIGHVPDTGACAAPHIFAAVGAFSDRRCIKSK 130
+RKT D Y +P FDAR+ + +C IG V D G CA+ A F+DR CI S
Sbjct: 14 NRKTVDNSYKTDIPREFDARQYFTSCANVIGDVKDQGNCASSWAVAVASTFTDRLCIASN 73
Query: 131 GQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPST 190
GQ LS + + SC + C GS F+ W +G VTGG++ GCQP
Sbjct: 74 GQFTDNLSAQNLMSCGD----GEKMGCDGGSAFKAWELTMNKGIVTGGNFDSNEGCQPYK 129
Query: 191 ISPCSHHGSAPTLPSCENQKVPKLK-CHTRCTNPTYGRGFFQDKHRTTLTYWVD-DNEDA 248
PC H+G + L +C + + ++ C +C N Y + D H+T++ Y N
Sbjct: 130 NRPCDHYGDSR-LTNCSSLRRTQMTVCRKKCVNKNYKVKYEDDLHKTSIVYMTSWTNVKQ 188
Query: 249 IKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTE-NGTPYWL 307
I++EI+ +GP TA +Y++F YK G+YK T+ +L Y H KLIGWG + +GT YWL
Sbjct: 189 IQQEIMTYGPVTAFMYVYENFMGYKEGIYKSTT-GELIGYHHV-KLIGWGVDGDGTEYWL 246
Query: 308 VINTWGPHWGDRG 320
+N+W +WG+ G
Sbjct: 247 AMNSWNSNWGNDG 259
>gi|157131748|ref|XP_001662318.1| cathepsin b [Aedes aegypti]
gi|108871395|gb|EAT35620.1| AAEL012216-PA [Aedes aegypti]
Length = 386
Score = 173 bits (438), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 92/259 (35%), Positives = 134/259 (51%), Gaps = 16/259 (6%)
Query: 84 VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVA 143
+PD FDARE+WP C ++ + D G C + +A A +DR C++SKG++ + +
Sbjct: 125 LPDTFDAREKWPECPSLREIRDQGCCGSCWAVSAASAMTDRWCVRSKGKEQFIFGSLDLL 184
Query: 144 SCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTL 203
SCC C + C G++ W F ++G +GG R GC P I C G
Sbjct: 185 SCCHSC----GQGCRGGTLGPAWQFWVEKGLSSGGPLNSRQGCHPYPIGECRIPG----- 235
Query: 204 PSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATF 263
E++ PK C +C + +QD+H + Y + ++E I +EI +GP A F
Sbjct: 236 ---EDEDTPK--CSNKCRSGYNVTDVWQDRHYGRVAYSLPNDERKIMEEIFINGPVQAAF 290
Query: 264 ALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVK 323
Y D + YKSG+Y+H H+ KL+GWG ENG YWLV N+WG WG+ G K
Sbjct: 291 HTYLDLHAYKSGIYRHVWGPLSGG--HAVKLLGWGVENGVKYWLVANSWGREWGENGFFK 348
Query: 324 ILRGKYECAFEYLIAAGKP 342
++RG+ C E I AG P
Sbjct: 349 MVRGENHCGIEENIHAGLP 367
>gi|328722316|ref|XP_003247542.1| PREDICTED: cathepsin B-like isoform 2 [Acyrthosiphon pisum]
gi|328722318|ref|XP_003247543.1| PREDICTED: cathepsin B-like isoform 3 [Acyrthosiphon pisum]
Length = 276
Score = 172 bits (437), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 103/261 (39%), Positives = 134/261 (51%), Gaps = 25/261 (9%)
Query: 87 RFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCC 146
FDAR++WP C TIG V + G +A G F+DR CI + G N+ LSTE + SC
Sbjct: 35 EFDARKRWPQCKTIGEVYNEGNALLSWAYATTGVFADRMCIATNGSYNKHLSTEELISCS 94
Query: 147 KICRYDDNKSCSHGSVF--RTWNFLHKRGSVTGGD-YGDRTGCQPSTISPCSHHGSAPTL 203
I K+ ++G V W + G V+GG Y GCQPS I P + L
Sbjct: 95 GI------KASANGWVRDGLAWEYFKTHGLVSGGSIYNTNDGCQPSKIPPVCN------L 142
Query: 204 PSCENQKVPKLKCHTRCTNPTYGRGFFQDKH-RTTLTYWVDDNEDAIKKEILAHGPTTAT 262
P+ N++ C + YG + H + Y+ I+KE+ +GP TA
Sbjct: 143 PTKINKRT--------CVDYCYGNDTIKYNHDHVKVRYYYHVKPKDIQKEVQTYGPVTAA 194
Query: 263 FALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTV 322
LYDD + +KSGVY T NAK L KLIGWG ENG YWL++N+WG WG G +
Sbjct: 195 LNLYDDIFLHKSGVYTLTKNAKYVR-LQYVKLIGWGVENGVDYWLLVNSWGNEWGQNGLL 253
Query: 323 KILRGKYECAFEYLIAAGKPK 343
KI RGKY CA E + A PK
Sbjct: 254 KIKRGKYGCAVESFVYAAVPK 274
>gi|157111449|ref|XP_001651570.1| cathepsin b [Aedes aegypti]
gi|108868331|gb|EAT32556.1| AAEL015312-PA [Aedes aegypti]
Length = 386
Score = 172 bits (437), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 92/259 (35%), Positives = 134/259 (51%), Gaps = 16/259 (6%)
Query: 84 VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVA 143
+PD FDARE+WP C ++ + D G C + +A A +DR C++SKG++ + +
Sbjct: 125 LPDTFDAREKWPECPSLREIRDQGCCGSCWAVSAASAMTDRWCVRSKGKEQFIFGSLDLL 184
Query: 144 SCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTL 203
SCC C + C G++ W F ++G +GG R GC P I C G
Sbjct: 185 SCCHSC----GQGCRGGTLGPAWQFWVEKGLSSGGPLNSRQGCHPYPIGECRIPG----- 235
Query: 204 PSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATF 263
E++ PK C +C + +QD+H + Y + ++E I +EI +GP A F
Sbjct: 236 ---EDEDTPK--CSNKCRSGYNVTDVWQDRHYGRVAYSLPNDERKIMEEIFINGPVQAAF 290
Query: 264 ALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVK 323
Y D + YKSG+Y+H H+ KL+GWG ENG YWLV N+WG WG+ G K
Sbjct: 291 HTYLDLHAYKSGIYRHVWGPLSGG--HAVKLLGWGVENGVKYWLVANSWGREWGENGFFK 348
Query: 324 ILRGKYECAFEYLIAAGKP 342
++RG+ C E I AG P
Sbjct: 349 MVRGENHCGIEENIHAGLP 367
>gi|5031250|gb|AAD38132.1|AF127592_1 vitellogenic cathepsin-B like protease [Aedes aegypti]
Length = 386
Score = 172 bits (437), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 92/259 (35%), Positives = 134/259 (51%), Gaps = 16/259 (6%)
Query: 84 VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVA 143
+PD FDARE+WP C ++ + D G C + +A A +DR C++SKG++ + +
Sbjct: 125 LPDTFDAREKWPECPSLREIRDQGCCGSCWAVSAASAMTDRWCVRSKGKEQFIFGSLDLL 184
Query: 144 SCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTL 203
SCC C + C G++ W F ++G +GG R GC P I C G
Sbjct: 185 SCCHSC----GQGCRGGTLGPAWQFWVEKGLSSGGPLNSRQGCHPYPIGECRIPG----- 235
Query: 204 PSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATF 263
E++ PK C +C + +QD+H + Y + ++E I +EI +GP A F
Sbjct: 236 ---EDEDTPK--CSNKCRSGYNVTDVWQDRHIGRVAYSLPNDERKIMEEIFINGPVQAAF 290
Query: 264 ALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVK 323
Y D + YKSG+Y+H H+ KL+GWG ENG YWLV N+WG WG+ G K
Sbjct: 291 HTYLDLHAYKSGIYRHVWGPLSGG--HAVKLLGWGVENGVKYWLVANSWGREWGENGFFK 348
Query: 324 ILRGKYECAFEYLIAAGKP 342
++RG+ C E I AG P
Sbjct: 349 MVRGENHCGIEENIHAGLP 367
>gi|159177|gb|AAA29177.1| cysteine proteinase [Haemonchus contortus]
Length = 342
Score = 172 bits (437), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 103/314 (32%), Positives = 159/314 (50%), Gaps = 29/314 (9%)
Query: 46 LSEEYLRQFLIADAKYFDQSDRPLPGDRKTY----------------DPEYSATVPDRFD 89
L+ E L +L + F+ + P P + DPE + +P+ +D
Sbjct: 32 LTGEPLVAYLRKNQNLFEVNSEPTPNFEQKIMDIKFKNQKLNFVVKNDPEPNEDIPEEYD 91
Query: 90 AREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCK-I 148
RE++ C T ++ D C + + A SDR CI + G++ +S+ + +CC
Sbjct: 92 PREKF-KCSTF-YIRDQANCGSCWAVSTAAAISDRICIATNGEKQVNISSTDILTCCNPQ 149
Query: 149 CRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCEN 208
C + C G R W + G V+GG+Y + C+P I PC HHG+ C
Sbjct: 150 CGF----GCGGGWSIRAWEYFVYEGVVSGGEYLTKGVCRPYPIHPCGHHGNDTYYGECPR 205
Query: 209 QKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDD 268
+ C +C P Y + F DK + + Y V+ E+AI++EIL HGP A+FA+Y+D
Sbjct: 206 EAATP-PCKKKC-QPGYKKIFRMDKRQGKVAYGVEPKEEAIQREILRHGPVVASFAVYED 263
Query: 269 FYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGT--PYWLVINTWGPHWGDRGTVKILR 326
F YK+GVYKHT+ A L Y H+ K++GWG ++ T YWL+ N+W WG+ G + +R
Sbjct: 264 FSLYKTGVYKHTAGA-LRGY-HAVKMMGWGVDSKTKAKYWLIANSWHNDWGENGYFRFIR 321
Query: 327 GKYECAFEYLIAAG 340
G +C E +AAG
Sbjct: 322 GINDCEIEDTVAAG 335
>gi|339242629|ref|XP_003377240.1| Gut-specific cysteine proteinase [Trichinella spiralis]
gi|316973974|gb|EFV57515.1| Gut-specific cysteine proteinase [Trichinella spiralis]
Length = 325
Score = 172 bits (436), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 96/263 (36%), Positives = 130/263 (49%), Gaps = 19/263 (7%)
Query: 81 SATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTE 140
S +P DAR++WP C IG V D C + ++ +DR CI+S + LS E
Sbjct: 81 SVDLPFEMDARKRWPQCKYIGFVRDQANCGSCWAVSSASVMTDRICIESIAAKQPLLSEE 140
Query: 141 YVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSA 200
+ SCCKIC Y C G + + + RG TGG YG GC+P +I S +
Sbjct: 141 ELVSCCKICGY----GCDGGYPDKAFIYWATRGIPTGGPYGSTKGCKPYSIGSNSEDEAE 196
Query: 201 PTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTT 260
L C +C N Y QD+H YWV+ NE+ I +E+ +GP
Sbjct: 197 TPL------------CTRQCINE-YPYNLSQDRHFGEKPYWVNSNEEQIMQELYKNGPVV 243
Query: 261 ATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRG 320
F +Y+DF +Y GVY+H L H+ KLIGWG EN YWL+ N+W WG+ G
Sbjct: 244 VAFNVYEDFMYYIKGVYEHRFGKFLGG--HAVKLIGWGIENSKKYWLISNSWNTTWGENG 301
Query: 321 TVKILRGKYECAFEYLIAAGKPK 343
KI+RGK CA E + AG +
Sbjct: 302 FFKIIRGKNCCAIESYVVAGMAR 324
>gi|410912140|ref|XP_003969548.1| PREDICTED: cathepsin B-like [Takifugu rubripes]
Length = 246
Score = 172 bits (436), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 95/229 (41%), Positives = 122/229 (53%), Gaps = 8/229 (3%)
Query: 115 FAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGS 174
F A A SDR CI S + + LS E + SCC+ C C+ G W+F K G
Sbjct: 26 FGASEAMSDRICIHSNAKISVELSAEDLLSCCESC----GMGCNGGYPSAAWDFWTKDGL 81
Query: 175 VTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKH 234
V+GG Y GC+P TI PC HH + + PSC + +C RC Y + QDKH
Sbjct: 82 VSGGLYDSHIGCRPYTIPPCEHHVNG-SRPSCSGEGGETPQCVYRC-EAGYTPSYKQDKH 139
Query: 235 RTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKL 294
+Y V +ED IK EI +GP F +Y+DF YK+GVY+H + + L H+ K+
Sbjct: 140 YGKTSYSVSSDEDDIKHEIYKNGPVEGAFTVYEDFVLYKTGVYQHVTGSALGG--HAIKI 197
Query: 295 IGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
+GWG ENG PYWL N+W WG+ G KILRG C E I AG P
Sbjct: 198 LGWGEENGIPYWLCANSWNTDWGNNGFFKILRGSNHCGIESEIVAGIPN 246
>gi|390994429|gb|AFM37364.1| cathepsin B1 [Dictyocaulus viviparus]
Length = 350
Score = 172 bits (436), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 102/329 (31%), Positives = 151/329 (45%), Gaps = 30/329 (9%)
Query: 23 DAYIDQINR-------EANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKT 75
+A ++ +N+ E ++ T+ + +SEEYL Q P +
Sbjct: 41 NALVEYVNKRQQFFQTEISSLTSSDHKARLMSEEYLTQ--------------PNLNRNEL 86
Query: 76 YDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNR 135
+P+ FDARE+W C +I + D C + +A SDR CI S G+ N
Sbjct: 87 MTGLLDVEIPENFDAREKWSQCDSIRTIRDQSHCGSCWAVSAAETMSDRTCIHSDGKINV 146
Query: 136 PLSTEYVASCC-KICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPC 194
LS + SCC C + C G W + G TGG Y ++ C+P PC
Sbjct: 147 GLSATDILSCCGTTC----GRGCRGGYPIEAWRYFMLHGVCTGGHYAEKDVCKPYAFHPC 202
Query: 195 SHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEIL 254
HH + C + P +C C Y + DK Y + +NE AI++EI+
Sbjct: 203 GHHRNEIYYGECPKEIFPTPQCTQSC-QAGYASDYEDDKIYGKSAYALPNNEKAIQREIM 261
Query: 255 AHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGT-ENGTPYWLVINTWG 313
+GP A F +Y+DF Y+SG+Y HT+ + H+ KLIGWG ++G YWL N+W
Sbjct: 262 TNGPVQAAFMVYEDFSRYRSGIYVHTAGRREGG--HAVKLIGWGVDDDGNKYWLAANSWN 319
Query: 314 PHWGDRGTVKILRGKYECAFEYLIAAGKP 342
WG+ G +I+RG C E + AG P
Sbjct: 320 SDWGENGYFRIVRGVDHCGIESAVVAGMP 348
>gi|157058741|gb|ABV03128.1| cathepsin B-2744 [Aulacorthum solani]
Length = 255
Score = 172 bits (436), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 93/248 (37%), Positives = 133/248 (53%), Gaps = 11/248 (4%)
Query: 73 RKTYDPEYSATVPDRFDAREQWPNCG-TIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKG 131
RK D Y +P FDAR+ + +C IG V D G CA+ A F+DR CI S G
Sbjct: 15 RKIVDNNYETVIPRTFDARQYFVSCSDVIGDVKDQGNCASSWAVAVASTFTDRLCIASNG 74
Query: 132 QQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTI 191
Q LS + + SC ++ C GS F+ W +G VTGG+Y GCQP
Sbjct: 75 QFTDNLSAQNLMSCGN----EEKMGCDGGSAFKAWELTMSKGIVTGGNYDSNEGCQPYKN 130
Query: 192 SPCSHHGSAPTLPSCENQKVPKLK-CHTRCTNPTYGRGFFQDKHRTTLTYWVD-DNEDAI 249
PC H+G + +L +C + + ++ C +C N Y + D H+T++ Y N I
Sbjct: 131 RPCDHYGDS-SLTNCSSLRRTQMTVCREKCVNKNYKVKYEDDLHKTSIVYMTSWTNVKQI 189
Query: 250 KKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGT-ENGTPYWLV 308
++EI+ +GP TA +Y++F YK G+YK T+ +L Y H KLIGWG E+GT YWL
Sbjct: 190 QQEIMTYGPVTALMYVYENFMGYKKGIYKSTA-GELIGY-HHVKLIGWGVDEDGTEYWLA 247
Query: 309 INTWGPHW 316
+N+W +W
Sbjct: 248 MNSWNSNW 255
>gi|28932700|gb|AAO60044.1| midgut cysteine proteinase 1 [Rhipicephalus appendiculatus]
Length = 332
Score = 172 bits (435), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 115/341 (33%), Positives = 162/341 (47%), Gaps = 20/341 (5%)
Query: 2 IHILVFLLGCTLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKY 61
+ +V G +V + S+ I+ IN TW AGRNF S Q +
Sbjct: 8 VLFVVAAQGRLMVPSSVEPLSEEMINFINSINTTWKAGRNFDEKRSHSDCVQGGDGASVL 67
Query: 62 FDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAF 121
S +Y+ + T P+ F RE W +C +I + D AC + FAA +
Sbjct: 68 TATSTS---SHFTSYEEDSRWTCPESFTPREYWSHCSSIRVIRDQSACGSCWAFAAAESI 124
Query: 122 SDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYG 181
SDR CI + G+ +S E + +CC C + + C SV L R V
Sbjct: 125 SDRICIHTNGKVQVNISAEDLLACCHTCGHGCDGRCHCSSV----AILQGRRLVPE-PVR 179
Query: 182 DRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYW 241
GCQP ++ PC +P+C + + P KC C Y + + +DKH Y
Sbjct: 180 TEDGCQPYSLPPC--------VPNCTHPE-PTPKCQHVCRK-GYEKSYEEDKHFAKNVYR 229
Query: 242 VDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTEN 301
+ DAIK +I +GP + F +Y DF YKSGVY+ + +H+ K++GWGTE+
Sbjct: 230 LLKKCDAIKTDIYKNGPVESAFFVYADFPSYKSGVYQQHMIKFMG--VHAIKILGWGTED 287
Query: 302 GTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
G PYWLV N+W WGD+G KILRGK EC E +I AG P
Sbjct: 288 GVPYWLVANSWNVGWGDKGYFKILRGKDECGIEEVIDAGIP 328
>gi|268561866|ref|XP_002638438.1| Hypothetical protein CBG18654 [Caenorhabditis briggsae]
Length = 396
Score = 172 bits (435), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 122/358 (34%), Positives = 160/358 (44%), Gaps = 43/358 (12%)
Query: 1 MIHILVFLLGCTLVRG----ELYKFS----DAYIDQINREANTWTAGRNFPANLSEEYLR 52
MI IL F+ +L G E+ K S A +D +N ++W A EY
Sbjct: 1 MIRILNFVALASLSYGFVVQEVPKRSVLSGQALVDHVNAVQDSWKA----------EY-- 48
Query: 53 QFLIADAKYFDQSDRPLPGDRKTY---DPEYSATV--PDRFDAREQWPNCGTIGHVPDTG 107
+ AK D +P K+ D E+ + P FD+R QWPNC +I + D
Sbjct: 49 SSISMKAKTMDVRFAEVPESEKSEKSDDLEFETLIQLPTAFDSRVQWPNCNSIKLIRDQT 108
Query: 108 ACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCC-KICRYDDNKSCSHGSVFRTW 166
C + FAA SDR CI+S G Q +S E + SCC C N C G
Sbjct: 109 YCGSCWAFAAAEIISDRICIQSNGTQQPIISPEDILSCCGSSC----NNGCQGGYTIEAM 164
Query: 167 NFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYG 226
+ G VTGGDY GC P + PCS PSC+ T C
Sbjct: 165 KYWMNSGVVTGGDY-QGAGCIPYSFRPCSTCKEPKDAPSCK----------TTCQASYKA 213
Query: 227 RGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLE 286
+ ++ T+ V + I+ EI +GP + +YDDFYHYKSGVY H K
Sbjct: 214 KSAYRLPTTTSSNAIVANAVQMIQTEIYNNGPVEVAYQVYDDFYHYKSGVYYHVYGDKPS 273
Query: 287 NYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPKN 344
H+ K+IGWGTE YWLV N+W +G+ G KI RG EC E + AG PK+
Sbjct: 274 G--HAVKIIGWGTEKKVDYWLVANSWSTTFGENGFFKIRRGTNECGIEENVVAGLPKS 329
>gi|256090364|ref|XP_002581165.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
gi|353228444|emb|CCD74615.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
Length = 303
Score = 171 bits (434), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 107/327 (32%), Positives = 155/327 (47%), Gaps = 60/327 (18%)
Query: 21 FSDAYIDQINREANT-WTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDP- 78
SD I IN N W A E+ R + DA++ + R P R+T P
Sbjct: 29 LSDDIISYINEHPNAGWRA---------EKSNRFHSLDDARFQLGARREEPDLRRTRRPT 79
Query: 79 ----EYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQN 134
+++ +P FD+R++WP C +I + D C + F AV A S+R CI+S G+QN
Sbjct: 80 VDHNDWNVEIPSSFDSRKKWPRCKSIATIRDQSRCGSCCAFGAVEAMSERSCIQSGGKQN 139
Query: 135 RPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPC 194
LS + G VTG + TGC+P C
Sbjct: 140 VELSA-----------------------------VDLEGIVTGSSKENNTGCEPYPFPKC 170
Query: 195 SHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEIL 254
H + P C ++ +C T C Y + QDKHR AI+KEI+
Sbjct: 171 EHF-TKGQYPPCGSKIYKTPRCKTTCQK-RYKTSYAQDKHR------------AIQKEIM 216
Query: 255 AHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGP 314
+GP A+F +Y+DF +YKSG+YKH + L H+ ++IGWG EN TPYWL+ N+W
Sbjct: 217 KYGPVEASFTVYEDFLNYKSGIYKHITGETLGG--HAIRIIGWGVENKTPYWLIANSWNE 274
Query: 315 HWGDRGTVKILRGKYECAFEYLIAAGK 341
WG+ G +I+RG+ EC+ E + AG+
Sbjct: 275 DWGENGYFRIVRGRDECSIESEVTAGR 301
>gi|170060936|ref|XP_001866022.1| cathepsin B [Culex quinquefasciatus]
gi|167879259|gb|EDS42642.1| cathepsin B [Culex quinquefasciatus]
Length = 341
Score = 171 bits (434), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 107/326 (32%), Positives = 151/326 (46%), Gaps = 29/326 (8%)
Query: 19 YKFSDAYIDQINREANTWTAGRN-FPANLSEEYLRQFLIADAKYFDQSDRPLP-GDRKTY 76
Y+ + + +++ A TWT G N P NL AK D LP G
Sbjct: 33 YEATISIAEKVRPLATTWTPGANPLPPNLYR--------TGAKREDLEKHRLPLGILVVK 84
Query: 77 DPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRP 136
D +P+RFDAR++WP C ++ + + G C + +A F+DR CI S+ +
Sbjct: 85 D---HIVLPERFDARDRWPECTSLKQIRNQGCCGSCWAISAAETFTDRWCIHSEDKDQFS 141
Query: 137 LSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSH 196
+ SCC C C G++ W F +RG +GG Y R GC P + C
Sbjct: 142 FGAYDLLSCCHSC----GDGCQGGNLGPAWQFWVQRGVSSGGPYNSRQGCHPYPVDVCHS 197
Query: 197 HGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAH 256
P KC +C + D+ + Y V +E+ IK+EI +
Sbjct: 198 ADEDADTP----------KCTRKCQSMYNVTNVSDDRRFGRVAYSVSQDEERIKEEIFRN 247
Query: 257 GPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHW 316
GP A+F +Y DF YK+GVY+H H+ K+IGWG ENGT YWL N+WG W
Sbjct: 248 GPVQASFDVYLDFKAYKTGVYRHVFGPMEGG--HAVKMIGWGVENGTKYWLCSNSWGEDW 305
Query: 317 GDRGTVKILRGKYECAFEYLIAAGKP 342
G+RG KI+RG+ C E + AG P
Sbjct: 306 GERGFFKIVRGENHCGIESDVHAGLP 331
>gi|294894292|ref|XP_002774787.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
gi|239880404|gb|EER06603.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
Length = 414
Score = 171 bits (434), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 114/345 (33%), Positives = 159/345 (46%), Gaps = 41/345 (11%)
Query: 24 AYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGD---RKTYDPEY 80
+ +D+IN + NTWTA + ++ + + DAK + + D RK Y E
Sbjct: 85 SLVDEINSKQNTWTA------STGQKRFKNLSLRDAKMLCGTLKRGSNDKVIRKGYAIEE 138
Query: 81 SATVPDRFDAREQWPNCG-TIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLST 139
+P FDAR +PNC I H+ D C + F AF+DR CIKS G LS
Sbjct: 139 LQDLPTDFDARTAFPNCSKVIRHIRDQSDCGSCWAFGVTEAFNDRLCIKSNGTFTELLSA 198
Query: 140 EYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDY---GDRT---GCQPSTISP 193
+ +C + C G W+++H +G TGGDY D T GC P P
Sbjct: 199 GEMNACAP------SFGCDGGIPSLAWSWVHNKGIATGGDYLAEDDMTKDDGCWPYDFPP 252
Query: 194 CSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKH----RTTLTYWVDDNEDAI 249
C+HH + P C C +C NP Y D+H Y V+D ++AI
Sbjct: 253 CAHHVNDSKYPKCPKDSYETPNCAEQCHNPKYTTTLRDDRHFLVESVPYEYSVNDAKNAI 312
Query: 250 KKE-----------ILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWG 298
+ + + +A+F +Y+DF Y+SGVYKHTS +L H+ K+IGWG
Sbjct: 313 RTDGPVGPIYFCDPSVNFDQVSASFIVYEDFLAYRSGVYKHTSGKELGG--HAVKIIGWG 370
Query: 299 TENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
E G YWLV+N+W WGD G KI G C + + G PK
Sbjct: 371 EETGQAYWLVVNSWNEDWGDNGLFKIALGN--CEIDDDLLGGTPK 413
>gi|268561878|ref|XP_002638441.1| Hypothetical protein CBG18657 [Caenorhabditis briggsae]
Length = 372
Score = 171 bits (433), Expect = 5e-40, Method: Compositional matrix adjust.
Identities = 119/350 (34%), Positives = 160/350 (45%), Gaps = 55/350 (15%)
Query: 26 IDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQS--DRPLPGDRKTYDPEYSAT 83
+D IN+ N+W A + + L +F + D K+ + S D PL T Y
Sbjct: 28 VDHINKIQNSWRAEYSPISELE----MKFKVMDLKFSEISPKDEPL-----TVQGVY--- 75
Query: 84 VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVA 143
VP FDAR+ WPNC +I + + C A F A SDR CI+S G +S E +
Sbjct: 76 VPISFDARDHWPNCKSIKLIRNQAYCGACWAFGAAEIISDRICIQSGGAHQPIISVEDIL 135
Query: 144 SCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTL 203
SCC + C G F G VTGGDY + TGCQP T PCS ++ +
Sbjct: 136 SCCG---SSCGEGCKGGYPLEGLKFWMNSGVVTGGDY-NGTGCQPYTFPPCSSCEASKST 191
Query: 204 PSCENQKVPKLKCHTRCTNPTY-----------------------------GRGFFQDKH 234
PSC+ KC T TY G+ ++
Sbjct: 192 PSCQK------KCQTGYLEATYKNDKRFENEEQDSSYMSENFYQVLIILKGGKSAYRLST 245
Query: 235 RTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKL 294
T+ D I+ EI +GP ++ +++DFY YKSGVY + S KL H+ K+
Sbjct: 246 TTSSNKISTDAIITIQTEIYNNGPVEVSYRVFEDFYQYKSGVYHYVS-GKLTG-AHAVKI 303
Query: 295 IGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPKN 344
IGWGTEN YWLV N+WG +G++G KI RG EC E + AG KN
Sbjct: 304 IGWGTENKVDYWLVANSWGTDFGEKGFFKIRRGTNECGIEENVVAGLAKN 353
>gi|255040223|gb|ACT99884.1| truncated cathepsin B [Opisthorchis viverrini]
Length = 313
Score = 171 bits (432), Expect = 6e-40, Method: Compositional matrix adjust.
Identities = 96/285 (33%), Positives = 137/285 (48%), Gaps = 13/285 (4%)
Query: 36 WTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWP 95
W +GR+ S+ + F ++ RP D +P FDAR +WP
Sbjct: 42 WISGRHSKGFESDHLIHTFGAKMETAEQKAQRPTVKHVGFDD----TRLPKNFDARSKWP 97
Query: 96 NCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNK 155
+C ++ + D +C + F AV A SDR CI S G N+ LS + SCCK C +
Sbjct: 98 HCSSVSEIRDQSSCGSCWAFGAVEAMSDRLCIHSNGSFNKSLSAVDLLSCCKDCGF---- 153
Query: 156 SCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLK 215
C G W++ G VTGG D +GC+ C HH P C Q P +
Sbjct: 154 GCRGGYPAVAWDYWRTHGIVTGGSKEDPSGCRSYPFPKCDHHVQG-HYPPCPRQIYPTPE 212
Query: 216 CHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSG 275
C C P G+ +DK R ++Y + +E +I KEI+ GP A F +Y+DF YKS
Sbjct: 213 CVQDCDTPEL--GYLEDKTRANISYNIYASEISIMKEIMLRGPVEAVFTVYEDFLQYKSR 270
Query: 276 VYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRG 320
VY H A + H+ +++GWG E PYWL+ N+W WG++G
Sbjct: 271 VYFHAWGAPMSG--HAIRILGWGEEGDVPYWLIANSWNEDWGEKG 313
>gi|358341867|dbj|GAA49438.1| cathepsin B-like cysteine proteinase [Clonorchis sinensis]
Length = 952
Score = 171 bits (432), Expect = 7e-40, Method: Compositional matrix adjust.
Identities = 90/256 (35%), Positives = 131/256 (51%), Gaps = 9/256 (3%)
Query: 84 VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVA 143
+P FDAR +WP+C +I + D +C + F AV + SDR CI S G N+ LS +
Sbjct: 51 LPKSFDARTKWPHCPSISEIRDQSSCESFWAFGAVESMSDRLCIHSNGAFNKSLSATDLL 110
Query: 144 SCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTL 203
SCC+ C C G W+F G VTGG + +GC+ C H
Sbjct: 111 SCCEDC----GLGCGAGFHPMAWDFWKTHGIVTGGSKEEPSGCRSFPFPKCGHRRKG-RY 165
Query: 204 PSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATF 263
P C P +C +C P + +DK R ++Y V ++ +I KEI+ +GP A+F
Sbjct: 166 PPCPRHIYPTPECIKQCDEPEVN--YEKDKTRANISYNVYPSDISIMKEIMLNGPVEASF 223
Query: 264 ALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVK 323
+Y DF Y GVY H + H+ +++GWG ++G PYWL+ N+W WG++G V+
Sbjct: 224 GIYADFLEYNGGVYFHCWGGPISR--HAIRILGWGEDDGVPYWLIANSWNEDWGEKGYVR 281
Query: 324 ILRGKYECAFEYLIAA 339
LRG EC E + A
Sbjct: 282 FLRGHNECGIEEEVTA 297
Score = 157 bits (397), Expect = 7e-36, Method: Compositional matrix adjust.
Identities = 102/314 (32%), Positives = 135/314 (42%), Gaps = 62/314 (19%)
Query: 84 VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVA 143
+P+ FDAR WP+C +I + D +C + F AV A SDR CI SKG N+ LS +
Sbjct: 639 LPESFDARANWPHCPSISEIRDQSSCGSCWAFGAVEAMSDRLCIHSKGAFNKSLSAVDLV 698
Query: 144 SCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTL 203
SCC C C G W+F G VTGG TGC+ C H G
Sbjct: 699 SCCTEC----GCGCRGGYSPIAWDFWKTHGIVTGGSKEKPTGCRSYPFPSCEHRGKG-QY 753
Query: 204 PSCENQKVPKLKCHTRCTNPTYG------RGF-------FQDKH---------------- 234
P C +Q P +C RC RGF D+H
Sbjct: 754 PPCPHQLYPTPECIKRCDTKEIDYEKDKTRGFDSASSEQLADRHCFHTSNFGEASAQRTL 813
Query: 235 ------------------------RTT--LTYWVDDNEDAIKKEILAHGPTTATFALYDD 268
R+T ++Y V E A+ KEI+ GP A +Y+D
Sbjct: 814 HLTCLNFMHHSIDLLSSRLEKAVLRSTANISYNVYPAEQAVMKEIMLRGPVGAILHVYED 873
Query: 269 FYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGK 328
YKSGVY H L H +++GWG E+G PYWLV N+W WG++G +++LR +
Sbjct: 874 LLDYKSGVYFHVWGGHLGE--HGIRILGWGEEDGVPYWLVANSWNEDWGEKGYMRVLRWR 931
Query: 329 YECAFEYLIAAGKP 342
EC + AG P
Sbjct: 932 NECGIVDQVTAGLP 945
>gi|339831342|gb|AEK20867.1| cathepsin B [Eimeria tenella]
Length = 512
Score = 170 bits (431), Expect = 7e-40, Method: Compositional matrix adjust.
Identities = 107/284 (37%), Positives = 141/284 (49%), Gaps = 16/284 (5%)
Query: 67 RPLPGDRKTYDPEYSATVPDRFDAREQWPNCG-TIGHVPDTGACAAPHIFAAVGAFSDRR 125
PLP K + D+FDARE +P C IGHV D G C + FA+ A +DR
Sbjct: 222 EPLP--VKVFAETQQVLETDKFDAREAFPQCAEVIGHVRDQGDCGSCWAFASTEALNDRF 279
Query: 126 CIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGD-RT 184
CIKS G+ LS ++ SCC + + + CS G W + G VTGGDY + T
Sbjct: 280 CIKSGGRHREALSPQHTTSCCDLL-HCLSFGCSGGQPRMAWRWFSNDGVVTGGDYNELHT 338
Query: 185 G--CQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYG---RGFFQDKHRTTLT 239
G C P I C HH P P CE KC C Y + F D H T
Sbjct: 339 GKSCWPYEIPFCRHHSEGP-YPKCEGPLPKAPKCRKDCEEAEYTSKVKPFKDDLHFATSA 397
Query: 240 YWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGT 299
Y V+ D IK+E++ +G T F +Y+DF YK GVY H + + H+ K+IG+G
Sbjct: 398 YSVE-GRDQIKRELMENGTLTGAFLVYEDFLLYKEGVYHHVTGMPMGG--HAVKVIGFGN 454
Query: 300 ENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
E+G YWL +N+W +WGD+GT KI G E + G+PK
Sbjct: 455 EDGRDYWLAVNSWNEYWGDKGTFKIEMG--EAGIDKEFCGGEPK 496
>gi|359427491|gb|AEV46267.1| eimeripain [Eimeria tenella]
Length = 512
Score = 170 bits (431), Expect = 8e-40, Method: Compositional matrix adjust.
Identities = 107/284 (37%), Positives = 141/284 (49%), Gaps = 16/284 (5%)
Query: 67 RPLPGDRKTYDPEYSATVPDRFDAREQWPNCG-TIGHVPDTGACAAPHIFAAVGAFSDRR 125
PLP K + D+FDARE +P C IGHV D G C + FA+ A +DR
Sbjct: 222 EPLP--VKVFAETQQVLETDKFDAREAFPQCAEVIGHVRDQGDCGSCWAFASTEALNDRF 279
Query: 126 CIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGD-RT 184
CIKS G+ LS ++ SCC + + + CS G W + G VTGGDY + T
Sbjct: 280 CIKSGGRHREALSPQHTTSCCDLL-HCLSFGCSGGQPRMAWRWFSNDGVVTGGDYNELHT 338
Query: 185 G--CQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYG---RGFFQDKHRTTLT 239
G C P I C HH P P CE KC C Y + F D H T
Sbjct: 339 GKSCWPYEIPFCRHHSEGP-YPKCEGPLPKAPKCRKDCEEAEYTSKVKPFKDDLHFATSA 397
Query: 240 YWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGT 299
Y V+ D IK+E++ +G T F +Y+DF YK GVY H + + H+ K+IG+G
Sbjct: 398 YSVE-GRDQIKRELMENGTLTGAFLVYEDFLLYKEGVYHHVTGMPMGG--HAVKVIGFGN 454
Query: 300 ENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
E+G YWL +N+W +WGD+GT KI G E + G+PK
Sbjct: 455 EDGRDYWLAVNSWNEYWGDKGTFKIEMG--EAGIDKEFCGGEPK 496
>gi|17560488|ref|NP_506310.1| Protein F32H5.1 [Caenorhabditis elegans]
gi|3876629|emb|CAB04249.1| Protein F32H5.1 [Caenorhabditis elegans]
Length = 356
Score = 170 bits (430), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 112/336 (33%), Positives = 157/336 (46%), Gaps = 20/336 (5%)
Query: 14 VRGELYKFSDAYIDQ-INREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGD 72
+ E K S + + +N++ W A E+ R I K D+ D
Sbjct: 28 ISSEAIKLSGSDLTSYVNKKQKLWKA-ETSRMTFQEKMARAKSIKFIKSNDEVSEKTGND 86
Query: 73 RKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQ 132
D +P FD+R++WP+C IG V D C + AV SDR CI S G
Sbjct: 87 NVLVD------IPSSFDSRQKWPSCSQIGAVRDQSDCGSAAHLVAVEIASDRTCIASNGT 140
Query: 133 QNRPLSTEYVASCC----KICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQP 188
N PLS + SCC IC D C + G TGG+Y D+ GC+P
Sbjct: 141 FNWPLSAQDPLSCCVGLMSIC--GDGWGCDGSWPKDILKWWQTHGLCTGGNYNDQFGCKP 198
Query: 189 STISPCS-HHGSAPTLPSCENQKVPKLKCHTRCT-NPTYGRGFFQDKHRTTLTYWVDDNE 246
+I PC + + T C P C CT N T+ + QDKH Y V
Sbjct: 199 YSIYPCDKKYANGTTSVPCPGYHTP--TCEEHCTSNITWPIAYKQDKHFGKAHYNVGKKM 256
Query: 247 DAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYW 306
I+ EI+ +GP A+F +YDDF+ YK+G+Y HT+ + E + + K+IGWG +NG PYW
Sbjct: 257 TDIQIEIMTNGPVIASFIIYDDFWDYKTGIYVHTAGDQ-EGGMDT-KIIGWGVDNGVPYW 314
Query: 307 LVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
L ++ WG +G+ G V+ LRG E E+ + A P
Sbjct: 315 LCVHQWGTDFGENGFVRFLRGVNEVNIEHQVLAALP 350
>gi|157058747|gb|ABV03131.1| cathepsin B-2744 [Myzus persicae]
Length = 261
Score = 170 bits (430), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 93/251 (37%), Positives = 129/251 (51%), Gaps = 9/251 (3%)
Query: 73 RKTYDPEYSATVPDRFDAREQWPNCG-TIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKG 131
RKT D Y +P FDAR+ + +C IG V D G CA+ A F+DR CI S G
Sbjct: 17 RKTADINYKTDIPKEFDARQYFISCANVIGDVKDQGNCASSWAVAVASTFTDRLCIASNG 76
Query: 132 QQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTI 191
+ LS + + SC D+ C GS ++ W F +G VTGG Y GCQP
Sbjct: 77 KFTDNLSAQNLMSCGD----DEKLGCDGGSAYKAWEFTMGKGIVTGGPYDSNEGCQPYKN 132
Query: 192 SPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVD-DNEDAIK 250
PC H+G + ++ + C +C N Y + D ++T++ Y N I+
Sbjct: 133 RPCDHYGDSSLTNCSSLRRTQMMFCRDKCVNKNYKVKYEDDLYKTSVVYMTSWTNVKQIQ 192
Query: 251 KEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGT-ENGTPYWLVI 309
+EI+ +GP TA +Y++F YK GVYK T+ +L Y H KLIGWG E G YWL +
Sbjct: 193 QEIMTYGPVTAFMYVYENFMGYKEGVYKSTA-GELIGYHHV-KLIGWGVDEAGIEYWLAM 250
Query: 310 NTWGPHWGDRG 320
N+W +WG G
Sbjct: 251 NSWNSNWGTNG 261
>gi|268578113|ref|XP_002644039.1| Hypothetical protein CBG17499 [Caenorhabditis briggsae]
Length = 355
Score = 169 bits (429), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 102/270 (37%), Positives = 138/270 (51%), Gaps = 12/270 (4%)
Query: 79 EYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLS 138
E +P FD+R+QWP C IG V D C + AV SDR CI S G N PLS
Sbjct: 86 EVLINIPASFDSRQQWPECTQIGAVRDQSDCGSAAHLVAVEMASDRTCISSNGTFNWPLS 145
Query: 139 TEYVASCC----KICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPC 194
+ SCC IC D C + G TGG+Y D+ GC+P +I PC
Sbjct: 146 AQDPLSCCVGLMSIC--GDGWGCDGSWPKDILKWWQTHGLCTGGNYDDQFGCKPYSIYPC 203
Query: 195 S-HHGSAPTLPSCENQKVPKLKCHTRCT-NPTYGRGFFQDKHRTTLTYWVDDNEDAIKKE 252
++ + T C P C CT N T+ + QDKH Y V I+ E
Sbjct: 204 DKNYPNGTTSVPCPGYHTP--PCEDHCTSNITWPIAYKQDKHFGKAHYNVGKKMTDIQTE 261
Query: 253 ILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTW 312
I+ +GP A+F +Y+DF+ YKSG+Y HT+ + E + + K+IGWG +NG PYWL ++ W
Sbjct: 262 IMTNGPVIASFIIYEDFWDYKSGIYVHTAGDQ-EGGMDT-KIIGWGVDNGVPYWLCVHQW 319
Query: 313 GPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
G +G+ G V+ILRG E E+ + A P
Sbjct: 320 GTDFGENGFVRILRGVNEVNIEHQVLAALP 349
>gi|299471123|emb|CBN78981.1| cathepsin B-like proteinase [Ectocarpus siliculosus]
Length = 557
Score = 169 bits (429), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 103/307 (33%), Positives = 155/307 (50%), Gaps = 28/307 (9%)
Query: 60 KYFDQSDRPLPGDRKTYDPEYSAT---VPDRFDAREQWPNCGTI-GHVPDTGACAAPHIF 115
+Y + +PG R+ S++ +P FDARE +P C +I G V D C + F
Sbjct: 253 RYTKEIAPAVPGRRRLTPVAQSSSDEDIPANFDAREAFPECASIIGRVRDQSDCGSCWAF 312
Query: 116 AAVGAFSDRRCIKSKGQQNRP-------------LSTEYVASCCKICRYDDNKSCSHGSV 162
A+ AF+DRRCI G+++ LS E +CC + C+ G
Sbjct: 313 ASTEAFNDRRCIAGIGKEDAAGAEGEATADQLLVLSAEDTTACCHGFHCGLSMGCNGGQP 372
Query: 163 FRTWNFLHKRGSVTGGDYGD---RTGCQPSTISPCSHH--GSAPTLPSCENQKVPKLKCH 217
W + K G VTGGDY D T C+P PC+HH A P+C + + P +C
Sbjct: 373 GSAWKWFTKTGVVTGGDYADIGTGTTCKPYEFMPCAHHVDPGASGYPACPDGEYPTPECL 432
Query: 218 TRCTNPTYGRGFF-QDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGV 276
+ C+ + G + +DK Y + E+ I+++++ +G TA F+++ DF Y GV
Sbjct: 433 SECSETNFSGGSYGEDKKMAREAYSLAGIEN-IQRDMMKYGSVTAAFSVFSDFLTYSGGV 491
Query: 277 YKHTSNAKLENYLHSGKLIGWGTE--NGTPYWLVINTWGPHWGDRGTVKILRGKYECAFE 334
Y H S + + H+ K+IGWGT+ +G YWL+ N+W P WG+ G +ILRG EC E
Sbjct: 492 YTHESGSFMGG--HAVKMIGWGTDEVSGEDYWLIANSWNPSWGEGGLFRILRGVNECGIE 549
Query: 335 YLIAAGK 341
I AG+
Sbjct: 550 GQIVAGE 556
>gi|4325188|gb|AAD17297.1| cysteine proteinase [Ancylostoma ceylanicum]
Length = 341
Score = 169 bits (427), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 101/312 (32%), Positives = 151/312 (48%), Gaps = 18/312 (5%)
Query: 24 AYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYF-DQSDRPLPGDRKTYDPEYSA 82
A++D IN + + A + A E + I D+K+ +Q + D DP
Sbjct: 37 AFVDYINEHQSFYRAEYSPEA----EAFVKARIMDSKFLAEQKKEEVLADVYGDDP---- 88
Query: 83 TVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYV 142
PD FDAR QWP C +IG + D AC + ++ A SD C++S +S +
Sbjct: 89 --PDSFDARTQWPECRSIGTIRDQSACGSCWAVSSAEAMSDEICVQSNSTIKVMISDTDI 146
Query: 143 ASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPT 202
SCC + D C G + ++ + G VTGG Y R C+P + PC H P
Sbjct: 147 LSCCGL---DCGYGCQGGWPIEAYRWMQRDGVVTGGKYRQRDVCKPYSFYPCGQHKDVPY 203
Query: 203 LPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTAT 262
C P KC + + Y + + +DKH T +Y + +NE +I++EI +GP A
Sbjct: 204 YGPCPGGLWPTPKCR-KSSQRKYNKTYQEDKHFATRSYSLPNNERSIRQEIYKNGPVVAA 262
Query: 263 FALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTV 322
F +Y+D Y G+Y H ++ H+ K+IGWG ENGT YWL+ N+W WG+ G
Sbjct: 263 FKVYED-YSSTGGIYVHKWG--IQTGAHADKVIGWGRENGTDYWLIANSWNTDWGEDGYY 319
Query: 323 KILRGKYECAFE 334
+I+R C E
Sbjct: 320 RIVRETDNCEIE 331
>gi|119638996|gb|ABL85239.1| cysteine proteinase 5 [Necator americanus]
Length = 342
Score = 168 bits (426), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 99/320 (30%), Positives = 150/320 (46%), Gaps = 16/320 (5%)
Query: 24 AYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEYSAT 83
AY+D +N+ + + A + L E+Y + + +++ + ++ + D + +
Sbjct: 38 AYVDYVNQHQSFYKAEY---SPLVEQYAKA--VMRSEFMTKPNQ----NYVVKDVDLNIN 88
Query: 84 VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVA 143
+P+ FDARE+WPNC +I + D C + +A SDR CI+S G S +
Sbjct: 89 LPETFDAREKWPNCTSIRTIRDQSNCGSCWAVSAASVMSDRLCIQSNGTIQSWASDTDIL 148
Query: 144 SCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTL 203
SCC C C G F + F G TGG + + C+P PC H +
Sbjct: 149 SCCWNC----GMGCDGGRPFAAFFFAIDNGVCTGGPFREPNVCKPYAFYPCGRHQNQKYF 204
Query: 204 PSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATF 263
C + P KC C Y + DK Y + +NE I +EI +GP +F
Sbjct: 205 GPCPKELWPTPKCRKMC-QLKYNVAYKDDKIYGNDAYSLPNNETRIMQEIFTNGPVVGSF 263
Query: 264 ALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVK 323
+++ DF YK GVY SN +N H+ K+IGWG ++G YWL+ N+W WGD G V+
Sbjct: 264 SVFADFAIYKKGVY--VSNGIQQNGAHAVKIIGWGVQDGLKYWLIANSWNNDWGDEGYVR 321
Query: 324 ILRGKYECAFEYLIAAGKPK 343
LRG C E + G K
Sbjct: 322 FLRGDNHCGIESRVVTGTMK 341
>gi|984960|gb|AAC46878.1| cathepsin B proteinase, partial [Ancylostoma caninum]
Length = 340
Score = 168 bits (426), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 98/321 (30%), Positives = 152/321 (47%), Gaps = 22/321 (6%)
Query: 23 DAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEYSA 82
A++D IN + A + N E + I D+K+ + P + +
Sbjct: 35 QAFVDYINEHQPFYRA--EYSPNA--EAFVKARIMDSKFLVE-----PKKEEVLTEVFGD 85
Query: 83 TVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYV 142
PD FDAR WP C +IG + D AC + ++ A SD+ C++S +S +
Sbjct: 86 DPPDSFDARAHWPECRSIGTIRDQSACGSCWAVSSAEAMSDQICVQSNRTTRVMISDTDI 145
Query: 143 ASCCKICRYDDNKSCSHGSV---FRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGS 199
SCC I SC +G + ++ + VTGG Y + C+P PC +H +
Sbjct: 146 LSCCGI-------SCGYGCEVLPIEAYRWMQRSVVVTGGKYRQKDVCKPYAFYPCGNHTN 198
Query: 200 APTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPT 259
C P KC C Y + + +DK+ T +Y++ NE +I++EI +GP
Sbjct: 199 ERYYGPCPRGLWPTPKCRKACQR-KYNKSYNEDKYFATRSYYLPSNERSIREEIYKNGPV 257
Query: 260 TATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDR 319
A F +Y DF +Y+ G+Y H + H+ K++GWG ENGT YWL+ N+W WG+
Sbjct: 258 VAAFKVYQDFSYYRGGIYVHKWGGQTG--AHAVKVVGWGRENGTDYWLIANSWNTDWGEN 315
Query: 320 GTVKILRGKYECAFEYLIAAG 340
G +I RG EC E + +G
Sbjct: 316 GYFRIARGSNECGIEGQMVSG 336
>gi|201023369|ref|NP_001128426.1| cathepsin B-3483 [Acyrthosiphon pisum]
gi|328712086|ref|XP_003244726.1| PREDICTED: cathepsin B-like [Acyrthosiphon pisum]
Length = 355
Score = 168 bits (425), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 96/266 (36%), Positives = 128/266 (48%), Gaps = 15/266 (5%)
Query: 85 PDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVAS 144
P+ FDAR W NC +I H+ + G CAA + A +DR CI S+G S + + S
Sbjct: 97 PESFDARYHWFNCTSISHIWNQGNCAADWAISVTSAMNDRICIASQGNITALYSPQKLVS 156
Query: 145 CCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTLP 204
CC+ C CS G W ++ K+G VTGGDYG GCQP + PC+ +A
Sbjct: 157 CCEDC----GNGCSGGYTAAAWRYILKKGIVTGGDYGSNEGCQPWLVQPCNASTTAADPS 212
Query: 205 S-------CENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHG 257
S C KC C N + + D + + D + +K + HG
Sbjct: 213 SVLGPHGVCGGDPATTPKCDLSCYNARHEGKYLDDIIKAKKVFTFDGC--SARKNLRKHG 270
Query: 258 PTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWG 317
P T +Y+DF YKSGVY H + L L S ++IGWG E G +WL+ N+WG WG
Sbjct: 271 PYVVTMRVYEDFLAYKSGVYHHVTGDYLG--LLSVRMIGWGLEGGQAFWLLANSWGTSWG 328
Query: 318 DRGTVKILRGKYECAFEYLIAAGKPK 343
D+G KI R EC E AG P
Sbjct: 329 DKGFFKIRRFVNECWIENFRYAGVPN 354
>gi|329669000|gb|AEB96388.1| cathepsin B-like cysteine protease 2 [Angiostrongylus cantonensis]
Length = 232
Score = 168 bits (425), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 88/236 (37%), Positives = 123/236 (52%), Gaps = 7/236 (2%)
Query: 108 ACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWN 167
+C + AV A +DR CI SKG Q +S + + SCC C + C + W+
Sbjct: 4 SCGSCWAVGAVEAMTDRICIASKGNQKVTISADDLLSCCDECGF----GCDGRDPYAAWS 59
Query: 168 FLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGR 227
+ G VTG +Y ++GC+P PC HH C P C +C + Y
Sbjct: 60 YWVSNGIVTGSNYTSKSGCKPYPYPPCEHHIPEHHYKKCPKDIYPTNTCEYKCQD-GYSI 118
Query: 228 GFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLEN 287
+ DKH Y V + +I+KEI+ +GP F +Y+DF HY SG+YKHT+ L
Sbjct: 119 SYNSDKHYGASVYAVAQDVASIQKEIMTNGPVEVAFDVYEDFEHYSSGIYKHTTGDYLGG 178
Query: 288 YLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
H+ K++GWGTENGT YW+ N+W WG+ G +ILRG EC E + AG+PK
Sbjct: 179 --HAVKMLGWGTENGTDYWICANSWNSDWGENGFFRILRGVDECEIESGVVAGEPK 232
>gi|194246069|gb|ACF35526.1| putative cathepsin B-like cysteine protease form 1 [Dermacentor
variabilis]
Length = 277
Score = 168 bits (425), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 100/266 (37%), Positives = 133/266 (50%), Gaps = 11/266 (4%)
Query: 79 EYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLS 138
E +P+ FDARE W +C +I + D C + F A A SDR CI +KG+ +S
Sbjct: 20 EIPEDLPESFDAREAWSHCDSIHLIRDQSTCGSCRAFGATEAMSDRICIHTKGRVQVNIS 79
Query: 139 TEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHG 198
+ + +CC C C G W++ G VTGG YG GCQP PC HH
Sbjct: 80 AQDLLTCCHQC----GMGCFGGYPSAAWDYYKDEGIVTGGLYGTDDGCQPYYFPPCEHHT 135
Query: 199 SAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGP 258
P LP+C + K P KC C Y + + +DK+ Y + +E IK EI +GP
Sbjct: 136 KGP-LPNCTDTK-PTPKCLQVCRK-GYEKSYSEDKYFAKTVYSLHSDETQIKTEIYKNGP 192
Query: 259 TTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGD 318
A F++Y DF YKSGVY+ S E + + +GW + + WLV N+W WGD
Sbjct: 193 VEADFSVYTDFLAYKSGVYQRHS---YELWEARHQNLGWALKRRS-VWLVANSWNQDWGD 248
Query: 319 RGTVKILRGKYECAFEYLIAAGKPKN 344
+G KI RG EC E I AG PK
Sbjct: 249 KGYFKIRRGNNECGIENDINAGIPKE 274
>gi|146092987|ref|XP_001466605.1| CPC cysteine peptidase, Clan CA, family C1,Cathepsin B-like
[Leishmania infantum JPCM5]
gi|398018677|ref|XP_003862503.1| cysteine peptidase C (CPC) [Leishmania donovani]
gi|12005276|gb|AAG44365.1| cathepsin B-like cysteine protease [Leishmania donovani]
gi|134070968|emb|CAM69644.1| CPC cysteine peptidase, Clan CA, family C1,Cathepsin B-like
[Leishmania infantum JPCM5]
gi|322500733|emb|CBZ35810.1| cysteine peptidase C (CPC) [Leishmania donovani]
Length = 340
Score = 167 bits (424), Expect = 5e-39, Method: Compositional matrix adjust.
Identities = 119/352 (33%), Positives = 160/352 (45%), Gaps = 35/352 (9%)
Query: 1 MIHILVFLLGCTLVRGELYKFSD------AYIDQINREA-NTWTAGRN---FPANLSEEY 50
++ + LL T V G K SD +++ +IN +A WTA + + S E
Sbjct: 11 LVAVFAVLLA-TTVSGLYAKPSDFPLLGKSFVAEINSKARGQWTASADNGYLVSGKSLEE 69
Query: 51 LRQFLIADAKYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACA 110
+R+ + D S +P + D E +P+ FDA E WP C TI + D C
Sbjct: 70 VRKLM----GVTDMSTEAVPPRNFSVD-EMQQDLPEFFDAAEHWPMCVTISEIRDQSNCG 124
Query: 111 APHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLH 170
+ AAV A SDR C G +R +ST + SCC IC + C G W +
Sbjct: 125 SCWAIAAVEAISDRYCTLG-GVPDRRISTSNLLSCCFICGF----GCYGGIPTMAWLWWV 179
Query: 171 KRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFF 230
G T CQP PCSHHG++ P C N KC+T C
Sbjct: 180 WVGITT-------EVCQPYPFGPCSHHGNSDKYPPCPNTIYDTPKCNTTCEKSEMDL--- 229
Query: 231 QDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLH 290
K++ +Y V E + E++ +GP T +Y DF YKSGVYKH S L H
Sbjct: 230 -VKYKGGTSYSVK-GEKELMIELMTNGPLEVTMQVYSDFVGYKSGVYKHVSGDLLGG--H 285
Query: 291 SGKLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
+ KL+GWGT+ G PYW + N+W WGD+G I RG EC E AG P
Sbjct: 286 AVKLVGWGTQGGVPYWKIANSWNTDWGDKGYFLIQRGSNECGIESGGVAGTP 337
>gi|194387364|dbj|BAG60046.1| unnamed protein product [Homo sapiens]
Length = 245
Score = 167 bits (424), Expect = 5e-39, Method: Compositional matrix adjust.
Identities = 94/244 (38%), Positives = 126/244 (51%), Gaps = 10/244 (4%)
Query: 101 GHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCC-KICRYDDNKSCSH 159
G P + C F AV A SDR CI + + +S E + +CC +C C+
Sbjct: 3 GAGPLSIPCRMSWAFGAVEAISDRICIHTNAHVSVEVSAEDLLTCCGSMC----GDGCNG 58
Query: 160 GSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTR 219
G WNF ++G V+GG Y GC+P +I PC HH + P PK C
Sbjct: 59 GYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCEHHVNGSRPPCTGEGDTPK--CSKI 116
Query: 220 CTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKH 279
C P Y + QDKH +Y V ++E I EI +GP F++Y DF YKSGVY+H
Sbjct: 117 C-EPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQH 175
Query: 280 TSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAA 339
+ + H+ +++GWG ENGTPYWLV N+W WGD G KILRG+ C E + A
Sbjct: 176 VTGEMMGG--HAIRILGWGVENGTPYWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVA 233
Query: 340 GKPK 343
G P+
Sbjct: 234 GIPR 237
>gi|17384033|emb|CAD12394.1| cysteine proteinase [Leishmania infantum]
Length = 340
Score = 167 bits (423), Expect = 7e-39, Method: Compositional matrix adjust.
Identities = 119/352 (33%), Positives = 159/352 (45%), Gaps = 35/352 (9%)
Query: 1 MIHILVFLLGCTLVRGELYKFSD------AYIDQINREA-NTWTAGRN---FPANLSEEY 50
++ + LL T V G K SD +++ +IN +A WTA + S E
Sbjct: 11 LVAVFAVLLA-TTVSGLYAKPSDFPLLGKSFVAEINSKARGQWTASADNGYLVTGKSLEE 69
Query: 51 LRQFLIADAKYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACA 110
+R+ + D S +P + D E +P+ FDA E WP C TI + D C
Sbjct: 70 VRKLM----GVTDMSTEAVPPRNFSVD-EMQQDLPEFFDAAEHWPMCVTISEIRDQSNCG 124
Query: 111 APHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLH 170
+ AAV A SDR C G +R +ST + SCC IC + C G W +
Sbjct: 125 SCWAIAAVEAISDRYCTLG-GVPDRRISTSNLLSCCFICGF----GCYGGIPTMAWLWWV 179
Query: 171 KRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFF 230
G T CQP PCSHHG++ P C N KC+T C
Sbjct: 180 WVGITT-------EVCQPYPFGPCSHHGNSDKYPPCPNTIYDTPKCNTTCEKSEMDL--- 229
Query: 231 QDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLH 290
K++ +Y V E + E++ +GP T +Y DF YKSGVYKH S L H
Sbjct: 230 -VKYKGGTSYSVK-GEKELMIELMTNGPLEVTMQVYSDFVGYKSGVYKHVSGDLLGG--H 285
Query: 291 SGKLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
+ KL+GWGT+ G PYW + N+W WGD+G I RG EC E AG P
Sbjct: 286 AVKLVGWGTQGGVPYWKIANSWNTDWGDKGYFLIQRGSNECGIESGGVAGTP 337
>gi|353228456|emb|CCD74627.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
Length = 333
Score = 166 bits (421), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 113/345 (32%), Positives = 160/345 (46%), Gaps = 23/345 (6%)
Query: 2 IHILVFLLGCTLVRGELYKFSDAYIDQINREANT-WTAGRNFPANLSEEYLRQFLIADAK 60
I+ L++L G L + + SD I IN + W A + + F +
Sbjct: 8 IYFLIYLNGYNLKQFNI--LSDELIQYINNYPSAGWKASKQNRFKSISDVYNTFGYYGIR 65
Query: 61 YFDQSDRPLPGDRKTYDPE-YSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVG 119
+F + G T E + +PD FD+REQW +C +I + D C + A+
Sbjct: 66 HFRK------GILSTISHEDENIQLPDYFDSREQWKDCPSINIIHDQSKCDSGWAVASAA 119
Query: 120 AFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGD 179
+ SDR CI++ G LS + SC K + C G +W++ K G VTG
Sbjct: 120 SISDRTCIQTNGTMKVQLSAIELISCSK-----NKLGCQIGFSEFSWDYWLKNGLVTG-- 172
Query: 180 YGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLT 239
D TGC P C H S+ + P C C C + Y + DKH +
Sbjct: 173 --DPTGCLPYPFPKCDHR-SSNSYPKCGYITYTAPPCTKTCRS-GYPIPYKADKHYGRVI 228
Query: 240 YWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGT 299
Y + NE I+KEI+ +GP A ++ DF +YKSGVY+H + + +HS ++IGWG
Sbjct: 229 YSLRPNESDIRKEIMMNGPVEAGIFVHSDFLNYKSGVYRHITGQLVT--IHSVRIIGWGI 286
Query: 300 ENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPKN 344
EN PYWL N+W WG G KILRG EC E + AGK N
Sbjct: 287 ENDIPYWLCANSWNEDWGLNGYFKILRGSNECEIESFVNAGKVDN 331
>gi|728602|emb|CAA88490.1| cathepsin B-like enzyme [Leishmania mexicana]
gi|1586011|prf||2202319A cathepsin B-like Cys protease
Length = 340
Score = 166 bits (420), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 114/351 (32%), Positives = 160/351 (45%), Gaps = 33/351 (9%)
Query: 1 MIHILVFLLGCTLVR-----GELYKFSDAYIDQINREAN-TWTAGRN---FPANLSEEYL 51
++ + V LL T+ ++ +++ + N +A WTA + S E +
Sbjct: 11 LVAVFVVLLATTVSALYAKPSDIPLLGKSFVAETNSKAKGQWTASADNGHLVTGKSLEEV 70
Query: 52 RQFLIADAKYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAA 111
R+ + + S +P R E +P+ FDA E+WP C TIG + D C +
Sbjct: 71 RKLMGVTS----MSTEAVP-PRNFSVEEMQQDLPESFDASEKWPMCVTIGEIRDQSNCGS 125
Query: 112 PHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHK 171
AAV A SDR C S G +R +ST + SCC IC + C G W +
Sbjct: 126 CWAIAAVEAMSDRYCTMS-GIPDRRISTTNLLSCCFICGF----GCYGGIPAMAWLWWVW 180
Query: 172 RGSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQ 231
G T CQP PCSHHG++ P C N KC+T C N
Sbjct: 181 VGVTT-------ELCQPYPFGPCSHHGNSSKYPPCPNTIYNTPKCNTTCDNVE----MEL 229
Query: 232 DKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHS 291
K++ +Y + E + E++ +GP +Y DF YKSGVYKH S L H+
Sbjct: 230 VKYKGVSSYSIK-GERELDHELMNNGPLEVAMQVYADFVAYKSGVYKHVSGDHLGG--HA 286
Query: 292 GKLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
KL+GWG ++G PYW + N+W WGD+G I RG EC E AGKP
Sbjct: 287 VKLVGWGVKDGIPYWKIANSWNTDWGDKGYFLIQRGNDECGIESSGVAGKP 337
>gi|401415968|ref|XP_003872479.1| CPC cysteine peptidase, Clan CA, family C1,Cathepsin B-like
[Leishmania mexicana MHOM/GT/2001/U1103]
gi|322488703|emb|CBZ23950.1| CPC cysteine peptidase, Clan CA, family C1,Cathepsin B-like
[Leishmania mexicana MHOM/GT/2001/U1103]
Length = 340
Score = 166 bits (420), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 114/351 (32%), Positives = 160/351 (45%), Gaps = 33/351 (9%)
Query: 1 MIHILVFLLGCTLVR-----GELYKFSDAYIDQINREAN-TWTAGRN---FPANLSEEYL 51
++ + V LL T+ ++ +++ + N +A WTA + S E +
Sbjct: 11 LVAVFVVLLATTVSALYAKPSDIPLLGKSFVAETNSKAKGQWTASADNGHLVTGKSLEEV 70
Query: 52 RQFLIADAKYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAA 111
R+ + + S +P R E +P+ FDA E+WP C TIG + D C +
Sbjct: 71 RKLMGVTS----MSTEAVP-PRNFSVEEMQQDLPESFDASEKWPMCVTIGEIRDQSNCGS 125
Query: 112 PHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHK 171
AAV A SDR C S G +R +ST + SCC IC + C G W +
Sbjct: 126 CWAIAAVEAMSDRYCTMS-GIPDRRISTTNLLSCCFICGF----GCYGGIPAMAWLWWVW 180
Query: 172 RGSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQ 231
G T CQP PCSHHG++ P C N KC+T C N
Sbjct: 181 VGVTT-------ELCQPYPFGPCSHHGNSSKYPPCPNTIYNTPKCNTTCDNVE----MEL 229
Query: 232 DKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHS 291
K++ +Y + E + E++ +GP +Y DF YKSGVYKH S L H+
Sbjct: 230 VKYKGVSSYSIK-GERELMVELMNNGPLEVAMQVYADFVAYKSGVYKHVSGDHLGG--HA 286
Query: 292 GKLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
KL+GWG ++G PYW + N+W WGD+G I RG EC E AGKP
Sbjct: 287 VKLVGWGVKDGIPYWKIANSWNTDWGDKGYFLIQRGNDECGIESSGVAGKP 337
>gi|407080581|gb|AFS89610.1| procathepsin B precursor [Phenacoccus solenopsis]
Length = 309
Score = 166 bits (420), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 104/265 (39%), Positives = 133/265 (50%), Gaps = 12/265 (4%)
Query: 81 SATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTE 140
S +PD FD+R +WPNC TI + D G+C A FAA A SDR CI S ++ S
Sbjct: 48 SENLPDEFDSRVRWPNCPTIREIRDQGSCGACWAFAAAEAMSDRVCIHSSQTKHFHFSAL 107
Query: 141 YVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSA 200
+ SCC C K C W+ K G V+GG YG + GCQP + PC HH +
Sbjct: 108 NLLSCCDSCE----KGCLGCDHHLAWDHWVKHGIVSGGSYGSKEGCQPYHLPPCEHHRAG 163
Query: 201 PTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWV-DDNEDAIKKEILAHGPT 259
P + P R P Y + D H Y + NE I+ EI +GP
Sbjct: 164 PRRNCTKYGPTPSC---ARVCQPDYKISYEDDLHFGKQWYALAPHNEKIIRTEIFHNGPV 220
Query: 260 TATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTE--NGTPYWLVINTWGPHWG 317
AT A Y+DFY Y+SG+Y H + + H+ K+IGWGT+ TPYWLV N++ WG
Sbjct: 221 EATMAAYEDFYTYESGIYHHIEGTFVCD--HAVKIIGWGTDKKTNTPYWLVANSFNTDWG 278
Query: 318 DRGTVKILRGKYECAFEYLIAAGKP 342
+ G KI RG EC E I AG P
Sbjct: 279 EYGFFKIKRGVNECGIENKITAGIP 303
>gi|2944340|gb|AAC05262.1| cathepsin B-like cysteine protease GCP7 [Haemonchus contortus]
Length = 348
Score = 166 bits (420), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 103/307 (33%), Positives = 153/307 (49%), Gaps = 21/307 (6%)
Query: 42 FPANLSEEYL---RQFLIADAKYFDQS---DRPLPGDRKTYDPEYSATVPDRFDAREQWP 95
F A S E + RQFL+ ++ ++S + LP T + + +P+ FD+RE+W
Sbjct: 53 FKAKYSPEVVKKRRQFLL-KPQFIERSYNQENVLPIANITSNDD----IPESFDSREKWK 107
Query: 96 NCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCC-KICRYDDN 154
+C ++ +PD C + +A SDR CI S+G++ LS + +CC K C Y
Sbjct: 108 DCPSLRVIPDQSNCGSCWAVSAAQCMSDRLCIHSQGRKKVLLSATDILACCGKFCGY--- 164
Query: 155 KSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKL 214
C G R W + G VTGG Y ++ C+P C H +C +
Sbjct: 165 -GCDGGYNARAWKWATIAGVVTGGAYKEKGNCKPYVFPQCGAH-KGKAFNNCPSHPYATP 222
Query: 215 KCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKS 274
C C YG+ + DK + YW+ ++E I+ EI+ GP ATF +Y+DF HY+
Sbjct: 223 ACKPYCQY-GYGKRYENDKIKARTWYWLPNDERTIQLEIMQKGPVHATFNIYEDFEHYEG 281
Query: 275 GVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWG-DRGTVKILRGKYECAF 333
GVY HT+ A HS K+IGWG + G YWL+ N+W WG D G +++RG C
Sbjct: 282 GVYIHTAGAMEGG--HSIKIIGWGVDKGVKYWLIANSWSTDWGEDGGYFRVVRGINNCDI 339
Query: 334 EYLIAAG 340
E + AG
Sbjct: 340 EGGVLAG 346
>gi|308504721|ref|XP_003114544.1| hypothetical protein CRE_27547 [Caenorhabditis remanei]
gi|308261929|gb|EFP05882.1| hypothetical protein CRE_27547 [Caenorhabditis remanei]
Length = 358
Score = 166 bits (420), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 108/320 (33%), Positives = 153/320 (47%), Gaps = 18/320 (5%)
Query: 29 INREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEYSATVPDRF 88
+N W A E+ R + D K+ + + GD + + + +P F
Sbjct: 45 VNNHQKLWKA-ETSRMTFQEKMAR---VKDIKFIKSHEDQMVGDSE--NNQVLLDIPTYF 98
Query: 89 DAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCC-- 146
D+R++WP C IG V D C + AV SDR CI S G N PLS + SCC
Sbjct: 99 DSRQKWPECTQIGAVRDQSDCGSAAHLVAVELASDRTCIFSNGTFNWPLSAQDPLSCCVG 158
Query: 147 --KICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCS-HHGSAPTL 203
IC D C + G TGG+Y D+ GC+P +I PC + + T
Sbjct: 159 LMSIC--GDGWGCDGSWPKDILKWWQTHGLCTGGNYEDQFGCKPYSIYPCDKKYPNGTTS 216
Query: 204 PSCENQKVPKLKCHTRCT-NPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTAT 262
C P C CT N T+ + QDKH Y V I+ EI+ +GP A+
Sbjct: 217 VPCPGYHTP--TCEEHCTSNITWPIAYKQDKHFGKAHYNVGKKMTDIQTEIMTNGPVIAS 274
Query: 263 FALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTV 322
F +YDDF+ YKSG+Y HT+ + E + + K+IGWG ++G PYWL ++ WG +G+ G V
Sbjct: 275 FVIYDDFWDYKSGIYVHTAGDQ-EGGMDT-KIIGWGVDSGVPYWLCVHQWGTDFGENGFV 332
Query: 323 KILRGKYECAFEYLIAAGKP 342
+ LRG E E+ + A P
Sbjct: 333 RFLRGVNEVNIEHQVLAALP 352
>gi|409905640|gb|AFV46426.1| cysteine protease C [Leishmania donovani]
Length = 345
Score = 166 bits (420), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 119/352 (33%), Positives = 159/352 (45%), Gaps = 35/352 (9%)
Query: 1 MIHILVFLLGCTLVRGELYKFSD------AYIDQINREA-NTWTAGRN---FPANLSEEY 50
++ + LL T V G K SD +++ +IN +A WTA + + S E
Sbjct: 16 LVAVFAVLLA-TTVSGLYAKPSDFPLLGKSFVAEINSKARGQWTASADNGYLVSGKSLEE 74
Query: 51 LRQFLIADAKYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACA 110
+R+ + D S +P R E +P+ FDA E WP C TI + D C
Sbjct: 75 VRKLM----GVTDMSTEAVP-PRNFSVVEMQQDLPEFFDAAEHWPMCVTISEIRDQSNCG 129
Query: 111 APHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLH 170
+ AAV A SDR C G +R +ST + SCC IC + C G W +
Sbjct: 130 SCWAIAAVEAISDRYCTLG-GVPDRRISTSNLLSCCFICGF----GCYGGIPTMAWLWWV 184
Query: 171 KRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFF 230
G T CQP PCSHHG++ P C N KC+T C
Sbjct: 185 WVGITT-------EVCQPYPFGPCSHHGNSDKYPPCPNTIYDTPKCNTTCEKSEMDL--- 234
Query: 231 QDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLH 290
K++ +Y V E + E++ +GP T +Y DF YKSGVYKH S L H
Sbjct: 235 -VKYKGGTSYSVK-GEKELMIELMTNGPLEVTMQVYSDFVGYKSGVYKHVSGDLLGG--H 290
Query: 291 SGKLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
+ KL+GWGT+ G PYW + N+W WGD+G I RG EC E AG P
Sbjct: 291 AVKLVGWGTQGGVPYWKIANSWNTDWGDKGYFLIQRGSNECGIESGGVAGTP 342
>gi|254575663|gb|ACT68328.1| cysteine proteinase [Haemonchus contortus]
Length = 348
Score = 166 bits (420), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 103/307 (33%), Positives = 152/307 (49%), Gaps = 21/307 (6%)
Query: 42 FPANLSEEYL---RQFLIADAKYFDQS---DRPLPGDRKTYDPEYSATVPDRFDAREQWP 95
F A S E + RQFL+ ++ ++S + LP T + + +P+ FD+RE+W
Sbjct: 53 FKAKYSPEVVKKRRQFLL-KPQFIERSYNQENVLPVANITSNDD----IPESFDSREKWK 107
Query: 96 NCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCC-KICRYDDN 154
+C ++ +PD C + +A SDR CI S+G++ LS + +CC K C Y
Sbjct: 108 DCPSLRVIPDQSNCGSCWAVSAAQCMSDRLCIHSQGRKKVLLSATDILACCGKFCGY--- 164
Query: 155 KSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKL 214
C G R W + G VTGG Y ++ C+P C H +C +
Sbjct: 165 -GCDGGYNARAWKWATIAGVVTGGAYKEKGNCKPYVFPQCGAH-KGKAFNNCPSHPYATP 222
Query: 215 KCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKS 274
C C YG+ + DK + YW+ ++E I+ EI+ GP ATF +Y+DF HY
Sbjct: 223 ACKPYCQY-GYGKRYENDKIKAKTWYWLPNDERTIQLEIMKKGPVHATFNIYEDFEHYNG 281
Query: 275 GVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWG-DRGTVKILRGKYECAF 333
GVY HT+ A HS K+IGWG + G YWL+ N+W WG D G +++RG C
Sbjct: 282 GVYIHTAGAMEGG--HSIKIIGWGVDKGVKYWLIANSWSTDWGEDGGYFRVVRGINNCDI 339
Query: 334 EYLIAAG 340
E + AG
Sbjct: 340 EGGVLAG 346
>gi|194352768|emb|CAQ00112.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
gi|326488519|dbj|BAJ93928.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326508126|dbj|BAJ99330.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 355
Score = 166 bits (419), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 119/345 (34%), Positives = 157/345 (45%), Gaps = 45/345 (13%)
Query: 8 LLGCTLVRGELYKFSDAYIDQINREANT-WTAGRN-FPANLSEEYLRQFLIADAKYFDQS 65
L+G L + I +N N WTAG N + AN + E + L
Sbjct: 28 LVGAAKAEHSLGIIQEDIIQTVNDHPNAGWTAGHNPYFANYTIEQFKHIL---------G 78
Query: 66 DRPLPGDRKTYDP----EYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAF 121
+P P P SA +P FDAR QW +C TIG++ D G C A FAAV +
Sbjct: 79 VKPTPPGLLAGVPIKTHPKSADLPKEFDARTQWSSCSTIGNILDQGHCGACWAFAAVESL 138
Query: 122 SDRRCIKSKGQQNRPLSTEYVASCCK-ICRYDDNKSCSHGSVFRTWNFLHKRGSVTG--G 178
DR CI + LS + +CC +C C+ G W + + G VT
Sbjct: 139 QDRFCIHL--NMSVSLSVNDLLACCGFLC----GSGCNGGYPISAWRYFRRSGVVTEECD 192
Query: 179 DYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTL 238
Y D+TGCQ H G P P+ KCH +C + + ++KH +
Sbjct: 193 PYFDQTGCQ--------HPGCEPAYPT--------PKCHRKCK--VENQVWKKNKHFSVN 234
Query: 239 TYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWG 298
Y V N I E+ +GP F +Y+DF HYKSGVYKH + + H+ KLIGWG
Sbjct: 235 AYRVHSNPHDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKHITGGVMGG--HAVKLIGWG 292
Query: 299 TEN-GTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
T + G YWL+ N W WGD G KI+RGK EC E + AG P
Sbjct: 293 TSDAGEDYWLLANQWNRGWGDDGYFKIIRGKNECGIEEDVTAGMP 337
>gi|226472808|emb|CAX71090.1| cathepsin B [Schistosoma japonicum]
Length = 325
Score = 166 bits (419), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 105/290 (36%), Positives = 139/290 (47%), Gaps = 17/290 (5%)
Query: 21 FSDAYIDQINREANT-WTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPE 79
S I IN EANT W AG + R +Q + G T +
Sbjct: 36 LSKELIHFINYEANTTWKAGPTRRFKTVSDIRRMLGALPDPNGEQLETLCTGYELTLN-- 93
Query: 80 YSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLST 139
+P FDAR++W +C +I + D +C + F AV A SDR CI+SKG+ LS
Sbjct: 94 ---ELPKSFDARKEWTHCPSISEIRDQSSCGSCWAFGAVEAMSDRICIESKGKYKPFLSA 150
Query: 140 EYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGS 199
E + SCC C C+ G W + +G VTG Y GCQP PC HH
Sbjct: 151 ENLVSCCSSC----GMGCNGGFPHSAWLYWKNQGIVTGDLYNTTNGCQPYEFPPCEHHTL 206
Query: 200 APTLPSCE-NQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGP 258
P LP C+ + + P K R Y + DK + Y V N++AI KE++ HGP
Sbjct: 207 GP-LPVCDGDVETPPCK---RTCQAGYNVSYENDKWYGKVVYRVKSNQEAIMKELMQHGP 262
Query: 259 TTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLV 308
F +Y DF +YKSGVY+H S A L H+ +L+GWG EN PYWL+
Sbjct: 263 VEVDFEVYADFPNYKSGVYQHVSGALLGG--HAVRLLGWGEENNVPYWLI 310
>gi|308488550|ref|XP_003106469.1| hypothetical protein CRE_16049 [Caenorhabditis remanei]
gi|308253819|gb|EFO97771.1| hypothetical protein CRE_16049 [Caenorhabditis remanei]
Length = 205
Score = 166 bits (419), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 82/188 (43%), Positives = 107/188 (56%), Gaps = 3/188 (1%)
Query: 156 SCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLK 215
SC G + W + K G VTGG Y + GC+P +I+PC + T P C P K
Sbjct: 13 SCEGGYPIQAWKWWVKHGLVTGGSYESQFGCKPYSIAPCGQTVNGVTWPKCPEDTEPTPK 72
Query: 216 CHTRCT-NPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKS 274
C CT N TY G+ QDKH Y V + I+ EILAHGP F +Y+DFY Y +
Sbjct: 73 CVEACTSNNTYPTGYLQDKHFGATAYAVGKKVEQIQTEILAHGPIEVAFTVYEDFYQYTT 132
Query: 275 GVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFE 334
GVY HT+ L H+ K++GWG +NGTPYWLV N+W +WG++G +I+RG EC E
Sbjct: 133 GVYVHTAGKSLGG--HAVKILGWGVDNGTPYWLVANSWNVNWGEKGYFRIIRGLNECGIE 190
Query: 335 YLIAAGKP 342
+ AG P
Sbjct: 191 HSAVAGLP 198
>gi|294894290|ref|XP_002774786.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
gi|239880403|gb|EER06602.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
Length = 830
Score = 165 bits (418), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 119/373 (31%), Positives = 169/373 (45%), Gaps = 76/373 (20%)
Query: 24 AYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYF------DQSDRPLPGDRKTYD 77
+ +D+IN + NTWTA + ++ + + DAK +D+ + +K Y
Sbjct: 480 SLVDEINSKQNTWTA------STGQKRFKNLSLRDAKMLCGTLMRGSNDKAI---KKGYA 530
Query: 78 PEYSATVPDRFDAREQWPNCG-TIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRP 136
E +P FDAR +PNC IGH+ D AC + F AF+DR CIKS G
Sbjct: 531 IEELQDLPTDFDARTAFPNCSKVIGHIRDQSACGSCWAFGVTEAFNDRLCIKSNGTFTEL 590
Query: 137 LSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDY---GDRT---GCQPST 190
LS + +C + C+ G W+++H +G TGGDY D T GC P
Sbjct: 591 LSAGEMNACAP------SHGCNGGFPNSAWSWVHDKGIATGGDYVAKDDMTKDDGCWPYD 644
Query: 191 ISPCSHHGSAPTLP-----SCENQKVPKL----------------KCHTRCTNPTYGRGF 229
PC+HH + P SC + P C +C NP Y
Sbjct: 645 FPPCAHHINDTKYPECPKVSCSGESPPATAETATVIAYQNSYETPNCAEQCHNPKYTTTL 704
Query: 230 FQDKH----RTTLTYWVDDNEDAIKKEILAHGPT---------------TATFALYDDFY 270
D+H + Y V+D ++AI+ + GP +A+F++Y+DF
Sbjct: 705 RDDRHFMLESSPYQYSVNDAKNAIRTD----GPVGPIYFCDPNVNFDQVSASFSVYEDFL 760
Query: 271 HYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYE 330
YKSGVYKHTS L H+ K+IGWG E+G YW+V+N+W WGD G KI G
Sbjct: 761 AYKSGVYKHTSGEYLGG--HAVKIIGWGEESGQAYWIVVNSWNEDWGDHGLFKIALGN-- 816
Query: 331 CAFEYLIAAGKPK 343
C + + G PK
Sbjct: 817 CGIDDNLLGGTPK 829
>gi|3087803|emb|CAA93279.1| cysteine protease [Haemonchus contortus]
Length = 325
Score = 165 bits (418), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 89/268 (33%), Positives = 134/268 (50%), Gaps = 14/268 (5%)
Query: 50 YLRQFLIADAKYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGAC 109
Y + I D + ++ P+ GD + +P+ FDAR WPNC ++ H+ D C
Sbjct: 64 YDIEHRIMDLSFIGENREPIVGDEN----DEGDDIPESFDARTHWPNCSSLTHIRDQANC 119
Query: 110 AAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFL 169
+ + A SDR CI + G + +S + +CC C Y C G W ++
Sbjct: 120 GSCWAVSTAAALSDRICISTNGTKQVNISATDILTCCYKCGY----GCQGGWPIEAWEYV 175
Query: 170 HKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQ-KVPKLKCHTRCTNPTYGRG 228
+ G+VTGG ++ C+ PC HHG+ C + + P KC T CT P Y
Sbjct: 176 AREGAVTGGRLLAKSCCRSHPFPPCGHHGNETYYGECGGRARTP--KCRTSCT-PGYKNS 232
Query: 229 FFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENY 288
+ DK R Y + ++ AI++EI+ +GP A F +Y DF +YK G+YKHT+ +
Sbjct: 233 YSDDKIRGKDAYELPNSVKAIQREIMKNGPVVAAFTVYADFSYYKKGIYKHTAGRARGS- 291
Query: 289 LHSGKLIGWGTENGTPYWLVINTWGPHW 316
H+ K+IGWG E PYW+V N+W W
Sbjct: 292 -HAVKVIGWGEEGDVPYWIVKNSWHNDW 318
>gi|193603738|ref|XP_001943652.1| PREDICTED: cathepsin B-like cysteine proteinase 5-like
[Acyrthosiphon pisum]
Length = 337
Score = 165 bits (418), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 106/328 (32%), Positives = 153/328 (46%), Gaps = 17/328 (5%)
Query: 18 LYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYD 77
L+K + N +TW AG NF +L + + D L TYD
Sbjct: 24 LFKVNQIIQLVNNIPKHTWKAGINFHPSLLTNVSHLMGVVPWNKLSEKDILL-----TYD 78
Query: 78 PEYSA-TVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRP 136
++P+ +D + W C ++ + D C + + AFSDR CI S N+
Sbjct: 79 VSIDLESLPESYDITQTWSECKSVVSIRDQSNCGSCWALSTASAFSDRLCITSNMGVNKV 138
Query: 137 LSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSH 196
LS EY+ SCC + H + W ++ K G TGG+YG GCQP +I PC
Sbjct: 139 LSGEYINSCCNGKCGNGCNG-GHPE--KAWKYIKKNGLCTGGEYGSNEGCQPYSIVPCPR 195
Query: 197 HGSAPTLPSCENQKVPKLKCHT-RCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILA 255
+ ++ S EN+ P+ C+ +CTN Y D + Y V + I E+
Sbjct: 196 NANSC---SKENEDTPQ--CYKDQCTNNNYETPLVSDLYYAYKVYSVKPKPEIIMSEVFK 250
Query: 256 HGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPH 315
+GP A +YDDF YK G+Y++T+ + H+ K++GWG ++G YWL NTWG
Sbjct: 251 NGPVVAAMKVYDDFLCYKGGIYQYTTGGLKGD--HAVKIMGWGEDDGIDYWLCANTWGNS 308
Query: 316 WGDRGTVKILRGKYECAFEYLIAAGKPK 343
WG G KI RG+ EC E I G PK
Sbjct: 309 WGMGGMFKIRRGRNECGIENRITGGLPK 336
>gi|166030312|gb|ABY78823.1| cathepsin B-like protease [Trypanosoma congolense]
Length = 335
Score = 165 bits (417), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 111/343 (32%), Positives = 156/343 (45%), Gaps = 32/343 (9%)
Query: 5 LVFLLGCTLVRGELYKFSDAYIDQINR-EANTWTAGRNFPANLSEEYLRQFLIADAKYFD 63
LV L L+ + + ++D+IN+ W A N ++ A+A+
Sbjct: 14 LVALGASALLAKDAPVLTKTFVDRINQLNGGMWKAVYN-------GKMQNITFAEARRLT 66
Query: 64 ----QSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVG 119
Q LP R T + + +P+ FD+ E+WPNC TI + D AC + +
Sbjct: 67 GARIQKTSSLPPVRFT-EEQLRTELPESFDSAEKWPNCPTIREIADQSACGSCWAVSTAS 125
Query: 120 AFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGD 179
A SDR C QQ R +S ++ SCCK C Y C G W + G +
Sbjct: 126 AISDRHCTVGGVQQLR-ISAAHLLSCCKDCGY----GCDGGYPDAAWRYYVSHGLAS--- 177
Query: 180 YGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLT 239
+ CQP C HHG P C KC+T CT+ + K+R +
Sbjct: 178 ----SYCQPYPFPHCDHHGGKGKKPPCSKYDFHTPKCNTTCTD----KAIPLIKYRGNHS 229
Query: 240 YWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGT 299
Y V ED K+E+ +GP F +Y DF+ YK+GVY+H S L H+ +++GWG
Sbjct: 230 YEVHGEED-YKRELYFNGPFVVAFQVYSDFFAYKTGVYRHVSGDVLGG--HAVRIVGWGK 286
Query: 300 ENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
NGTPYW + N+W WG G ILRGK EC E+ AG P
Sbjct: 287 LNGTPYWKIANSWDTDWGMNGHFLILRGKDECGIEHQGYAGSP 329
>gi|294898091|ref|XP_002776152.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
gi|239882839|gb|EER07968.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
Length = 382
Score = 165 bits (417), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 108/332 (32%), Positives = 160/332 (48%), Gaps = 47/332 (14%)
Query: 24 AYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYF------DQSDRPLPGDRKTYD 77
+ +D+IN + NTWTA + ++ + + DAK +D+ + +K Y
Sbjct: 85 SLVDEINSKQNTWTA------STGQKRFKNLSLRDAKMLCGTLMRGSNDKAV---KKGYA 135
Query: 78 PEYSATVPDRFDAREQWPNCG-TIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRP 136
E +P FDAR +PNC IGH+ D AC + F AF+DR CIKS G
Sbjct: 136 IEELQDLPTDFDARTAFPNCSKVIGHIRDQSACGSCWAFGVTEAFNDRLCIKSNGAFTEL 195
Query: 137 LSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSH 196
LS + +C C G + W+++H +G TG G +P +S
Sbjct: 196 LSAGEMNACTLFF------GCGGGDPYSAWSWVHDKGIATG------EGSRPKRVS---- 239
Query: 197 HGSAPTLPSCENQKV-PKLKCHTRCTNPTYGRGFFQDKHRTTLT----YWVDDNEDAIKK 251
+ +P Q + P C +C NP Y D+H + Y V+D ++AI+
Sbjct: 240 --ESEAIPVIAYQDIYPTPNCVEQCRNPKYTTTLRDDRHFMLESSPYHYSVNDAKNAIRT 297
Query: 252 EILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINT 311
+ GP +A+F +Y+DF YKSGVYKHTS + L H+ K+IGWG ++G YWL +N+
Sbjct: 298 D----GPVSASFTVYEDFLAYKSGVYKHTSGSYLGG--HAVKIIGWGEKSGQAYWLAVNS 351
Query: 312 WGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
W WGD+G KI G C + + G PK
Sbjct: 352 WNEDWGDKGLFKIALGN--CGIDDDLLGGTPK 381
>gi|193688334|ref|XP_001945855.1| PREDICTED: cathepsin B-like cysteine proteinase 5-like
[Acyrthosiphon pisum]
Length = 313
Score = 165 bits (417), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 103/273 (37%), Positives = 137/273 (50%), Gaps = 24/273 (8%)
Query: 74 KTYDPEYSATVPD--RFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKG 131
KT +P+Y D FDAR++WP C TIG V + G A +AA G +DR CI + G
Sbjct: 60 KTRNPKYVIDNRDYKEFDARKRWPKCKTIGEVHNEGNFAFGWAYAAAGVLADRTCIATNG 119
Query: 132 QQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTI 191
N+ LSTE + SC I + N + + S+ W +L G V+GG Y GCQP
Sbjct: 120 GYNKLLSTEELISCSGI--KETNGNVNERSI---WEYLKSHGVVSGGKYNSNDGCQPFKF 174
Query: 192 SPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKH-RTTLTYWVDDNEDAIK 250
P A L + HT C + YG H + + I+
Sbjct: 175 PPI-----ANILTHLQ---------HT-CDDHCYGNTSINYNHDHVRVRNYYTIRTGYIQ 219
Query: 251 KEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVIN 310
KE+ +GP F + DDF YKSGVY + NAK+ ++ KLIGWG ENG YWLVIN
Sbjct: 220 KEVQTYGPVAVQFKVCDDFLLYKSGVYVKSDNAKVIRTQYA-KLIGWGVENGVDYWLVIN 278
Query: 311 TWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
+WG WG +G KI RG +C E ++ AG P+
Sbjct: 279 SWGHEWGQKGLFKIKRGTNQCGVESVVYAGVPE 311
>gi|239938580|gb|ACS36089.1| cysteine proteinase [Haemonchus contortus]
Length = 332
Score = 165 bits (417), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 84/245 (34%), Positives = 134/245 (54%), Gaps = 9/245 (3%)
Query: 84 VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVA 143
+P+ FD+RE W NC +I ++ D C + +A SDR C++SKG+ + +S +
Sbjct: 95 IPESFDSREVWKNCSSITYIRDQSNCGSCWAVSAAETMSDRICVQSKGRVQKMISDVDIL 154
Query: 144 SCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAP-T 202
+CC + + C+ G + W ++ + G VTGG Y ++ C+P + PC +HG +
Sbjct: 155 ACCG---RECGRGCNGGMDHKAWEYVKEFGVVTGGRYQEKGVCKPYPLHPCGNHGGKFWS 211
Query: 203 LPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTAT 262
P + + P C C YG+ + +DK Y +D++E AI++E++ +GP A
Sbjct: 212 CPRDHSFRTPA--CKKYCQY-GYGKRYEKDKSYVKSVYILDEDEKAIQREMMKNGPVQAA 268
Query: 263 FALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTV 322
F Y+DF Y G+Y HT + H+ K++GWG ENGT YW V N+W WG+ G
Sbjct: 269 FITYEDFSFYTKGIYVHTRGR--QRGAHAVKVVGWGVENGTKYWNVANSWSTDWGENGYF 326
Query: 323 KILRG 327
+ILRG
Sbjct: 327 RILRG 331
>gi|12004577|gb|AAG44098.1| cathepsin B cysteine protease [Leishmania chagasi]
Length = 340
Score = 164 bits (416), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 118/352 (33%), Positives = 159/352 (45%), Gaps = 35/352 (9%)
Query: 1 MIHILVFLLGCTLVRGELYKFSD------AYIDQINREA-NTWTAGRN---FPANLSEEY 50
++ + LL T V G K SD +++ +IN +A WTA + + S E
Sbjct: 11 LVAVFAVLLA-TTVSGLYAKPSDFPLLGKSFVAEINSKARGQWTASADNGYLVSGKSLEE 69
Query: 51 LRQFLIADAKYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACA 110
+R+ + D S +P + D E +P+ FDA E WP C TI + D C
Sbjct: 70 VRKLM----GVTDMSTEAVPPRNFSVD-EMQQDLPEFFDAAEHWPMCVTISEIRDQSNCG 124
Query: 111 APHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLH 170
+ AAV A SDR C G +R +ST + SCC IC + C G W +
Sbjct: 125 SCWAIAAVEAISDRYCTLG-GVPDRRISTSNLLSCCFICGF----GCYGGIPTMAWLWWV 179
Query: 171 KRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFF 230
G T CQP PCSHHG++ P C N KC+T C
Sbjct: 180 WVGITT-------EVCQPYPFGPCSHHGNSDKYPPCPNTIYDTPKCNTTCEKSEMDL--- 229
Query: 231 QDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLH 290
K++ +Y V E + E++ +GP T +Y DF YKSG YKH S L H
Sbjct: 230 -VKYKGGTSYSVK-GEKELMIELMTNGPLEVTMQVYSDFVGYKSGGYKHVSGDLLGG--H 285
Query: 291 SGKLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
+ KL+GWGT+ G PYW + N+W WGD+G I RG EC E AG P
Sbjct: 286 AVKLVGWGTQGGVPYWKIANSWNTDWGDKGYFLIQRGSNECGIESGGVAGTP 337
>gi|194246059|gb|ACF35521.1| putative cathepsin B-like cysteine protease form 1 [Dermacentor
variabilis]
Length = 217
Score = 164 bits (416), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 94/222 (42%), Positives = 117/222 (52%), Gaps = 9/222 (4%)
Query: 122 SDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYG 181
SDR CI +KG+ +S E + +CC C C+ G W F G VTGG YG
Sbjct: 1 SDRICIHTKGKVQVNISAEDLLTCCDSC----GSGCNGGYPSAAWQFYKDEGIVTGGLYG 56
Query: 182 DRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYW 241
GCQP PC HH P LP+C K P +C C Y + + +DKH Y
Sbjct: 57 TEDGCQPYYFPPCEHHTVGP-LPNCTGIK-PTPECAKTCRE-GYEKSYTRDKHFGKKVYS 113
Query: 242 VDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTEN 301
+ +E IK EI +GP A F +Y DF YKSGVY+ S L H+ +++GWGTE+
Sbjct: 114 ISSDETQIKTEICKNGPVEADFNVYADFPSYKSGVYQRHSKEMLGG--HAIRILGWGTED 171
Query: 302 GTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
G PYWLV N+W WGD+G KI RG EC E I AG PK
Sbjct: 172 GVPYWLVANSWNEDWGDKGYFKIRRGNDECGIENDINAGIPK 213
>gi|335347289|gb|AEH42092.1| cysteine proteinase 1 [Haemonchus contortus]
Length = 332
Score = 164 bits (415), Expect = 6e-38, Method: Compositional matrix adjust.
Identities = 84/245 (34%), Positives = 134/245 (54%), Gaps = 9/245 (3%)
Query: 84 VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVA 143
+P+ FD+RE W NC +I ++ D C + +A SDR C++SKG+ + +S +
Sbjct: 95 IPESFDSREVWKNCSSITYIRDQSNCGSCWAVSAAETMSDRICVQSKGRVQKMISDVDIL 154
Query: 144 SCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAP-T 202
+CC + + C+ G + W ++ + G VTGG Y ++ C+P + PC +HG +
Sbjct: 155 ACCG---RECGRGCNGGMDHKAWEYVKEFGVVTGGRYQEKGVCKPYPLHPCGNHGGKFWS 211
Query: 203 LPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTAT 262
P + + P C C YG+ + +DK Y +D++E AI++E++ +GP A
Sbjct: 212 CPRDHSFRTPA--CKKYC-QYGYGKRYEKDKSYVKSVYILDEDEKAIQREMMKNGPVQAA 268
Query: 263 FALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTV 322
F Y+DF Y G+Y HT + H+ K++GWG ENGT YW V N+W WG+ G
Sbjct: 269 FITYEDFSFYTKGIYVHTRGR--QRGAHAVKVVGWGVENGTKYWNVANSWSTDWGEDGYF 326
Query: 323 KILRG 327
+ILRG
Sbjct: 327 RILRG 331
>gi|326492684|dbj|BAJ90198.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 355
Score = 164 bits (415), Expect = 6e-38, Method: Compositional matrix adjust.
Identities = 118/345 (34%), Positives = 156/345 (45%), Gaps = 45/345 (13%)
Query: 8 LLGCTLVRGELYKFSDAYIDQINREANT-WTAGRN-FPANLSEEYLRQFLIADAKYFDQS 65
L+G L + I +N N WTAG N + AN + E + L
Sbjct: 28 LVGAAKAEHSLGIIQEDIIQTVNDHPNAGWTAGHNPYFANYTIEQFKHIL---------G 78
Query: 66 DRPLPGDRKTYDP----EYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAF 121
+P P P SA +P FDAR QW +C TIG++ D G C A FAAV +
Sbjct: 79 VKPTPPGLLAGVPIKTHPKSADLPKEFDARTQWSSCSTIGNILDQGHCGACWAFAAVESL 138
Query: 122 SDRRCIKSKGQQNRPLSTEYVASCCK-ICRYDDNKSCSHGSVFRTWNFLHKRGSVTG--G 178
DR CI + LS + +CC +C C+ G W + + G VT
Sbjct: 139 QDRFCIHL--NMSVSLSVNDLLACCGFLC----GSGCNGGYPISAWRYFRRSGVVTEECD 192
Query: 179 DYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTL 238
Y D+TGCQ H G P P+ KCH +C + + ++KH +
Sbjct: 193 PYFDQTGCQ--------HPGCEPAYPT--------PKCHRKCK--VENQVWKKNKHSSVN 234
Query: 239 TYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWG 298
Y V N I E+ +GP F +Y+DF HYKSGVYKH + + H+ KLIGWG
Sbjct: 235 AYRVHSNPHDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKHITGGVMGG--HAVKLIGWG 292
Query: 299 TEN-GTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
T + G YWL+ N W WG G KI+RGK EC E + AG P
Sbjct: 293 TSDAGEDYWLLANQWNRGWGGDGYFKIIRGKNECGIEEDVTAGMP 337
>gi|193688336|ref|XP_001945899.1| PREDICTED: cathepsin B-like cysteine proteinase 5-like
[Acyrthosiphon pisum]
Length = 308
Score = 164 bits (414), Expect = 7e-38, Method: Compositional matrix adjust.
Identities = 103/275 (37%), Positives = 138/275 (50%), Gaps = 26/275 (9%)
Query: 74 KTYDPEYSATVPD--RFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKG 131
KT +P+Y D FDAR++WP C TIG V + G A +A G +DR CI + G
Sbjct: 53 KTRNPKYVIDNRDYKEFDARKRWPKCKTIGEVHNEGNFALGWAYAVAGVLADRTCIATNG 112
Query: 132 QQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTI 191
N+ LSTE + SC I + ++ S S+ W +L G V+GG Y GCQP
Sbjct: 113 GYNKLLSTEELISCSGI-KENNGSVPSERSI---WEYLKSHGVVSGGKYNSNDGCQPFKF 168
Query: 192 SPCSHHGSAPTLPSCENQKVPK-LKCHTRCTNPTYGRGF--FQDKHRTTLTYWVDDNEDA 248
P ++ +PK L HT C + YG + H Y+ D
Sbjct: 169 PPIAN--------------IPKHLHKHT-CDDHCYGNSTINYNHDHVRVRNYYTIRTRD- 212
Query: 249 IKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLV 308
I+KE+ +GP F + DDF+ YKSGVY + AK ++ KLIGWG ENG YWLV
Sbjct: 213 IQKEVQTYGPVVVRFMVCDDFFLYKSGVYAKSDKAKGIRTQYA-KLIGWGVENGVDYWLV 271
Query: 309 INTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
IN+WG WG +G KI G +C E + AG P+
Sbjct: 272 INSWGHEWGQKGLFKIKSGTNQCGVESFVYAGLPE 306
>gi|44965401|gb|AAS49537.1| cathepsin B [Latimeria chalumnae]
Length = 225
Score = 163 bits (413), Expect = 9e-38, Method: Compositional matrix adjust.
Identities = 89/221 (40%), Positives = 125/221 (56%), Gaps = 9/221 (4%)
Query: 84 VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVA 143
+P+ FD+R QWP C TI + D G+C + F AV A SDR CI SKG+ N +S E +
Sbjct: 13 LPENFDSRTQWPKCPTIQEIRDQGSCGSCWAFGAVEAISDRVCIHSKGKVNVEISAEDLL 72
Query: 144 SCCKI-CRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPT 202
SCC + C + C+ G WNF + G V+GG + GC+P TI PC HH + +
Sbjct: 73 SCCGMECGF----GCNGGYPSGAWNFWTETGLVSGGLFKSHIGCRPYTIPPCEHHVNG-S 127
Query: 203 LPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTAT 262
PSC ++ KC +C Y +F+DKH + +Y V NE I+ EI +GP
Sbjct: 128 RPSCTGEEGDTPKCVMQC-EAGYTPSYFKDKHFGSTSYAVSSNEADIQIEIYKNGPVEGA 186
Query: 263 FALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGT 303
F +Y+DF YKSGVYKH + + H+ +++GWG E+GT
Sbjct: 187 FTVYEDFLQYKSGVYKHVTGDAVGG--HAIRILGWGVESGT 225
>gi|270012757|gb|EFA09205.1| cathepsin B precursor [Tribolium castaneum]
Length = 348
Score = 163 bits (412), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 105/317 (33%), Positives = 151/317 (47%), Gaps = 42/317 (13%)
Query: 26 IDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEYSATVP 85
I +IN +W AG N ++ L D Y Q T + + ++P
Sbjct: 29 IQEINSRQTSWKAGTN-SLDIKSRLGFLGLHPDPDYKIQ----------TKHHKIAKSIP 77
Query: 86 DRFDAREQWPNC-GTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVAS 144
+ FDARE+WP C IG + D G C + FA+ +DR CI +KG+ S E + +
Sbjct: 78 ESFDAREKWPECKDVIGKIRDQGTCGSCWAFASTEVMTDRLCIGTKGETKFVFSPENLLT 137
Query: 145 CCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTLP 204
CC+ CR + C G + W++ G V+GGDY GCQP + + + ++
Sbjct: 138 CCEDCRLE----CVGGYTAKAWDYYINEGIVSGGDYNSSEGCQPYSKASFQYAVAS---- 189
Query: 205 SCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFA 264
KC C N Y + DKH Y ++ N I+ EIL +GP ATF
Sbjct: 190 ----------KCVKACQNDKYDVKYDDDKHYGDSFYTLETNVTQIQTEILTNGPVMATFN 239
Query: 265 LYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGT-VK 323
+++D +YKSG+ L + ++ WGTE G PYWL+ N+WG WGD G +K
Sbjct: 240 VFEDIIYYKSGI-----------QLSNVSILRWGTEEGVPYWLIANSWGTWWGDLGGFIK 288
Query: 324 ILRGKYECAFEYLIAAG 340
I RG ECA E +AAG
Sbjct: 289 IKRGTNECAIEQEMAAG 305
>gi|170030060|ref|XP_001842908.1| cathepsin B [Culex quinquefasciatus]
gi|167865914|gb|EDS29297.1| cathepsin B [Culex quinquefasciatus]
Length = 320
Score = 163 bits (412), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 94/262 (35%), Positives = 133/262 (50%), Gaps = 23/262 (8%)
Query: 83 TVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPL-STEY 141
++P+ FD+R++WPNC ++ + D G C + ++ + A +DR CI S GQ+ +T+Y
Sbjct: 80 SLPESFDSRQKWPNCPSLNQIRDQGCCGSCYVVSTAAAITDRYCIHSGGQKQFTFGATDY 139
Query: 142 VASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAP 201
+A CC C C G V +TW + G + G Y GC S P
Sbjct: 140 LA-CCTDCF-----KCDGGYVGKTWQYWVDSGLTSEGPYKSGQGCN-----------SYP 182
Query: 202 TLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTA 261
C N +P +R Y + QD Y V NE+AI EI +GP
Sbjct: 183 FGSYCVNDPLPTC---SRTCQAGYPLTYSQDLKYGGSAYRVMWNENAIMTEIYQNGPVVV 239
Query: 262 TFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGT 321
F ++ DFY YKSGVY+H + A E + H+ ++IGWG ENG YWLV N+WG WGD+G
Sbjct: 240 QFEVFADFYQYKSGVYRHVTGAT-EGW-HAVRVIGWGVENGVKYWLVANSWGVRWGDKGF 297
Query: 322 VKILRGKYECAFEYLIAAGKPK 343
K +RG+ E + AG PK
Sbjct: 298 FKFVRGENHLGIEDFVYAGLPK 319
>gi|242001640|ref|XP_002435463.1| cathepsin B endopeptidase, putative [Ixodes scapularis]
gi|215498799|gb|EEC08293.1| cathepsin B endopeptidase, putative [Ixodes scapularis]
Length = 223
Score = 163 bits (412), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 89/229 (38%), Positives = 120/229 (52%), Gaps = 9/229 (3%)
Query: 115 FAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGS 174
F AV A SDR CI S G+ +S E + CC C CS G W + G
Sbjct: 2 FGAVEAMSDRVCIHSNGRVQVDISAEDLMDCCDKC----GSGCSGGVSAAAWQYWKDAGL 57
Query: 175 VTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKH 234
V+GG Y GC+P +++PC H S +LP C +P KC +C Y R + DK+
Sbjct: 58 VSGGLYNTTDGCKPYSLAPC-EHSSQGSLPECVG-TLPTPKCKRQCRE-GYERSYDDDKY 114
Query: 235 RTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKL 294
Y ++ +E I+ EI +GP A F Y DF YKSGVY+H S + H+ ++
Sbjct: 115 FAKNVYSINGSEKQIRTEIFQNGPVEAEFTAYADFLSYKSGVYQHHSRDIIGR--HAIRI 172
Query: 295 IGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
+GWG+E+ PYWL+ N+W WGD G K+LRG EC E + AG PK
Sbjct: 173 LGWGSEDNNPYWLLANSWNEDWGDHGYFKMLRGVNECDIESFVNAGIPK 221
>gi|166030314|gb|ABY78824.1| cathepsin B-like protease [Trypanosoma congolense]
Length = 335
Score = 163 bits (412), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 110/343 (32%), Positives = 152/343 (44%), Gaps = 32/343 (9%)
Query: 5 LVFLLGCTLVRGELYKFSDAYIDQINR-EANTWTAGRNFPANLSEEYLRQFLIADAKYFD 63
LV L L+ + + ++D+IN+ W A N ++ A+A+
Sbjct: 14 LVALGASALLAKDAPVLTKTFVDRINQLNGGMWKAVYN-------GKMQNITFAEARRLT 66
Query: 64 ----QSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVG 119
Q LP R T + + +P+ FD+ E+WPNC TI + D AC + +
Sbjct: 67 GARIQKTSSLPPVRFT-EEQLRTELPESFDSAEKWPNCPTIREIADQSACGSCWAVSTAS 125
Query: 120 AFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGD 179
A SDR C QQ R +S ++ SCCK C Y C G W + G +
Sbjct: 126 AISDRYCTVGGVQQLR-ISAAHLLSCCKDCGY----GCDGGYPGTAWEYYVSHGLAS--- 177
Query: 180 YGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLT 239
+ CQP C HHG P C KC+T CT+ ++ H L
Sbjct: 178 ----SYCQPYPFPHCGHHGGKGKKPPCSKYDFHTPKCNTTCTDKAIPLIKYRGNHSYGL- 232
Query: 240 YWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGT 299
D ED K+E+ +GP F +Y DF YK+GVY+H S L H+ +++GWG
Sbjct: 233 ----DGEDDYKRELYFNGPFVVAFQVYSDFLAYKTGVYRHVSGDVLGG--HAVRIVGWGK 286
Query: 300 ENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
NGTPYW + N+W WG G ILRGK EC E AG P
Sbjct: 287 LNGTPYWKIANSWDTDWGMNGHFLILRGKDECGIESEGYAGLP 329
>gi|404250524|gb|AFR54113.1| cysteine proteinase, partial [Haemonchus contortus]
Length = 332
Score = 163 bits (412), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 83/245 (33%), Positives = 134/245 (54%), Gaps = 9/245 (3%)
Query: 84 VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVA 143
+P+ FD+RE W +C +I ++ D C + +A SDR C++SKG+ + +S +
Sbjct: 95 IPESFDSREVWKSCSSITYIRDQSNCGSCWAVSAAETMSDRICVQSKGRVQKMISDVDIL 154
Query: 144 SCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAP-T 202
+CC + + C+ G + W ++ + G VTGG Y ++ C+P + PC +HG +
Sbjct: 155 ACCG---SECGRGCNGGMDHKAWEYVKEFGVVTGGRYQEKGVCKPYPLHPCGNHGGKFWS 211
Query: 203 LPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTAT 262
P + + P C C YG+ + +DK Y +D++E AI++E++ +GP A
Sbjct: 212 CPRDHSFRTPA--CKKYC-QYGYGKRYEKDKSYVKSVYILDEDEKAIQREMMKNGPVQAA 268
Query: 263 FALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTV 322
F Y+DF Y G+Y HT + H+ K++GWG ENGT YW V N+W WG+ G
Sbjct: 269 FITYEDFSFYTKGIYVHTRGR--QRGAHAVKVVGWGVENGTKYWNVANSWSTDWGENGYF 326
Query: 323 KILRG 327
+ILRG
Sbjct: 327 RILRG 331
>gi|56758040|gb|AAW27160.1| unknown [Schistosoma japonicum]
Length = 216
Score = 162 bits (411), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 86/223 (38%), Positives = 125/223 (56%), Gaps = 8/223 (3%)
Query: 121 FSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDY 180
+DR CI+S GQQ+ LS + SCC+ C C G + W++ +G VTGG
Sbjct: 1 MTDRICIQSGGQQSAELSALDLISCCEDC----GDGCQGGFPGQAWDYWVTQGIVTGGSK 56
Query: 181 GDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTY 240
+ TGCQP C HH + P+C + +C C Y + QDKH +Y
Sbjct: 57 ENHTGCQPYPFPKCEHH-TKGKYPACGTKIYKTPQCKQTC-QKGYKTPYEQDKHYGDESY 114
Query: 241 WVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTE 300
V NE AI+KEI+ +GP A F +Y+DF +YKSG+Y+H + + + H+ ++IGWG E
Sbjct: 115 NVISNEKAIQKEIMMNGPVEAAFDVYEDFLNYKSGIYRHVTGSIVGG--HAIRIIGWGVE 172
Query: 301 NGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
TPYWL+ N+W WG++G +I+RG+ EC+ E + AG K
Sbjct: 173 KRTPYWLIANSWNEDWGEKGLFRIVRGRDECSIESHVVAGLIK 215
>gi|157058765|gb|ABV03140.1| cathepsin B-348 [Aulacorthum solani]
Length = 237
Score = 162 bits (411), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 87/219 (39%), Positives = 120/219 (54%), Gaps = 9/219 (4%)
Query: 84 VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVA 143
+P+ FDARE WPNC TI V D G+C + F AV A SDR CI SKG +N S E +
Sbjct: 28 LPETFDAREHWPNCPTIREVRDQGSCGSCWAFGAVEAMSDRVCIHSKGTKNFHFSAENLV 87
Query: 144 SCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTL 203
SCC C + C+ G WN+ +G V+GG YG GC P ++PC HH +
Sbjct: 88 SCCWTCGF----GCNGGFPGAAWNYWKTKGIVSGGPYGSNMGCIPYEVAPCEHHVNGTRG 143
Query: 204 PSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATF 263
P E K P KC +C + Y + QD H Y + ++ D I++EI +GP F
Sbjct: 144 PCKEGGKTP--KCVKKCED-GYKVPYAQDLHHGKSAYSLSNDVDQIRQEIYTNGPVEGAF 200
Query: 264 ALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENG 302
+Y+DF Y++GVYKH + L H+ +++GWG +NG
Sbjct: 201 TVYEDFIAYRAGVYKHVAGKALGG--HAIRILGWGVQNG 237
>gi|166030308|gb|ABY78821.1| cathepsin B-like protease [Trypanosoma congolense]
Length = 336
Score = 162 bits (411), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 108/340 (31%), Positives = 155/340 (45%), Gaps = 25/340 (7%)
Query: 5 LVFLLGCTLVRGELYKFSDAYIDQINR-EANTWTAGRNFPA-NLSEEYLRQFLIADAKYF 62
LV L L+ + + ++D+IN+ W A N N++ R+ A F
Sbjct: 14 LVALGASALLAKDAPVLTKTFVDRINQLNGGMWKAVYNGKMQNITFAEARRLTGA----F 69
Query: 63 DQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFS 122
+ LP R T + + +P+ FD+ E+WPNC TI + D AC + + A S
Sbjct: 70 RRKTSSLPPVRFT-EEQLRTELPESFDSAEKWPNCPTIREIADQSACGSCWAVSTASAIS 128
Query: 123 DRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGD 182
DR C QQ R +S ++ SCC+ C C G+ W + G +
Sbjct: 129 DRYCTVGGVQQLR-ISAAHLMSCCEDC----GDGCKGGAPDSAWEYYVSHGLAS------ 177
Query: 183 RTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWV 242
+ CQP C HHG P C KC+T CT+ + K+R +Y +
Sbjct: 178 -SYCQPYPFPHCGHHGGKGKKPPCSKYHFHTPKCNTTCTD----KAIPLIKYRGNNSYML 232
Query: 243 DDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENG 302
+ ED K+E+ +GP F +Y DF YK+GVY+H S L H+ +++GWG NG
Sbjct: 233 LNGEDDYKRELYFNGPFVVDFGVYSDFLAYKTGVYRHVSGDVLGG--HAVRIVGWGKLNG 290
Query: 303 TPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
TPYW + N+W WG G ILRG EC E AG P
Sbjct: 291 TPYWKIANSWDTDWGMNGHFLILRGNNECGIESTGYAGLP 330
>gi|312382740|gb|EFR28091.1| hypothetical protein AND_04395 [Anopheles darlingi]
Length = 381
Score = 162 bits (411), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 107/328 (32%), Positives = 154/328 (46%), Gaps = 25/328 (7%)
Query: 16 GELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKT 75
G++ A++ IN W AG N L + Q+ Y + + LP
Sbjct: 75 GDVVNSQAAFVAAINNRTRGWKAGVN---PLRHD---QYRTGALLYEEAARAKLPQGIVL 128
Query: 76 YDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNR 135
E P+ FDAR++W C ++G + + G CA+ + AAV +DR CI S+G+
Sbjct: 129 KLQE--EPFPESFDARQKWSFCPSVGTIRNQGCCASSYAVAAVATITDRWCIHSEGKSQF 186
Query: 136 PLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCS 195
V SCC C + C G W++ + G +GG Y GCQ C
Sbjct: 187 SFGAYDVLSCCHRCGF----GCDGGVPSAVWHYWVENGITSGGAYESHEGCQSYPFGVCK 242
Query: 196 -HHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEIL 254
AP + L C +C P Y + +DKH + Y V +ED I E+
Sbjct: 243 PQEIFAPHV---------DLICLRQC-QPGYNTTYLEDKHFGRVAYSVPRDEDRILYELF 292
Query: 255 AHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGP 314
GP A+F +Y DF YKSGVY+HT ++ + HS K++GWG ENGT +WL N+WG
Sbjct: 293 YFGPVQASFTVYTDFIQYKSGVYRHTYGVRVGD--HSVKIVGWGVENGTKFWLCANSWGA 350
Query: 315 HWGDRGTVKILRGKYECAFEYLIAAGKP 342
WG+ G KI+RG+ + E + AG P
Sbjct: 351 EWGENGFFKIIRGEDHLSVESNVVAGLP 378
>gi|167541036|gb|ABZ82028.1| cathepsin B endopeptidase [Clonorchis sinensis]
Length = 228
Score = 162 bits (410), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 90/229 (39%), Positives = 120/229 (52%), Gaps = 9/229 (3%)
Query: 115 FAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGS 174
F AV A SDR CI + G + +S + SCC C + C G W+F G
Sbjct: 2 FGAVEAMSDRLCIHTNGTFTKRISAVDLISCCGYCGF----GCQGGFPPTAWDFWQTEGI 57
Query: 175 VTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKH 234
VTGG + TGC+ CSHHGS P C ++ C +C P + DK
Sbjct: 58 VTGGSKENPTGCRSYPFPRCSHHGSK-KYPPCSHRIYDTPNCVQKCDTPD--TDYATDKT 114
Query: 235 RTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKL 294
R +TY V ++AI KEI+ +GP A F +Y+DF YKSGVY H+ L H+ ++
Sbjct: 115 RANITYNVKAKQNAIMKEIMINGPVEAAFQVYEDFLGYKSGVYFHSDGTLLGG--HAIRI 172
Query: 295 IGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
+GWG ENG YWL+ N+W WG+ G K+LRGK EC E + AG P+
Sbjct: 173 LGWGEENGVAYWLIANSWNDGWGEDGYFKMLRGKNECGIEDEVTAGLPE 221
>gi|226466652|emb|CAX69461.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 340
Score = 162 bits (410), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 111/347 (31%), Positives = 157/347 (45%), Gaps = 17/347 (4%)
Query: 1 MIHILVFLLGCTLVRGE---LYKFSDAYIDQINREAN-TWTAGRNFPANLSEEYLRQFLI 56
+I +L + C L E + I+ +NR W AG N S++ + F
Sbjct: 6 VILLLNIICNCELNAVENEHIEPLFGKLIEYVNRNPKFGWKAGTNHRFRSSKDIEKMF-- 63
Query: 57 ADAKYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFA 116
KY + + + + +P FDAR W NC TI + D C A A
Sbjct: 64 --RKYIEIENIQTKHIKTISHNSINMEIPRSFDARYHWINCSTIRQIHDESLCRADWAIA 121
Query: 117 AVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVT 176
V + SDR CI+S G+ + LS SC + C HGS + G VT
Sbjct: 122 TVDSISDRICIRSNGRISVQLSARDAISC------GFSPGCFHGSEVEVLVYWITYGIVT 175
Query: 177 GGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRT 236
GG Y D++GCQP + CS+H + L C N +C C + Y + + DK
Sbjct: 176 GGSYEDQSGCQPYPLPKCSYHPESRFL-DCNNNTFEFPQCTNECQD-GYNKTYDDDKFYG 233
Query: 237 TLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIG 296
Y V ++ I+KEIL +GP A+ ++ DF YKSGVY T ++ ++ + ++IG
Sbjct: 234 ERIYNVYGTQEDIQKEILMNGPVIASISVNTDFLVYKSGVYLPTPRSRNLGWI-TLRIIG 292
Query: 297 WGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
WG E PYWL N+W WGD G VKI RG E + A PK
Sbjct: 293 WGYEGKIPYWLCANSWNEEWGDNGYVKIQRGVQAGYIESYVRAPIPK 339
>gi|402594312|gb|EJW88238.1| cathepsin B5 [Wuchereria bancrofti]
Length = 407
Score = 162 bits (409), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 87/231 (37%), Positives = 123/231 (53%), Gaps = 10/231 (4%)
Query: 116 AAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSV 175
AAV A SDR CI SKG++ LS + + SCCK C + C G W + G V
Sbjct: 168 AAVEAMSDRICITSKGKKQVILSADDLLSCCKTCGF----GCFGGEPMAAWKYWVLSGIV 223
Query: 176 TGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHR 235
TG DY + +GC+P PC HH + C++ P KC +C + Y + + DK+
Sbjct: 224 TGSDYTNHSGCRPYPFPPCEHHNNKTHYEPCKHDLYPTPKCDRQC-DKNYKKPYKADKYY 282
Query: 236 TTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLI 295
Y V+++ + I+KEI+ GP A+F +Y DF HY G+YKH + + H+ K++
Sbjct: 283 GEQAYNVENDVELIQKEIMTLGPVEASFEVYTDFLHYIGGIYKHVAGSVGGG--HAVKIL 340
Query: 296 GWGTENGTPYWLVINTWGPHWGD---RGTVKILRGKYECAFEYLIAAGKPK 343
GWG + G YWL N+W WG+ G +ILRG EC E I AG P+
Sbjct: 341 GWGIDQGVSYWLAANSWNTDWGEDVFSGYFRILRGVDECGIESGIVAGIPR 391
>gi|254575665|gb|ACT68329.1| cysteine proteinase [Haemonchus contortus]
Length = 348
Score = 162 bits (409), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 105/312 (33%), Positives = 156/312 (50%), Gaps = 31/312 (9%)
Query: 42 FPANLSEEYL---RQFLIADAKYFDQS---DRPLPGDRKTYDPEYSATVPDRFDAREQWP 95
F A S E + RQFL+ ++ ++S + LP T + + +P+ FD+RE+W
Sbjct: 53 FKAKYSPEVVKKRRQFLL-KPQFIERSYNQENVLPIANITSNDD----IPESFDSREKWK 107
Query: 96 NCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCC-KICRYDDN 154
+C ++ +PD C + +A SDR CI S+G++ LS + +CC K C Y
Sbjct: 108 DCPSLRVIPDQSNCGSCWAVSAAQCMSDRLCIHSQGRKKVLLSATDILACCGKFCGY--- 164
Query: 155 KSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPC-SHHGSA----PTLPSCENQ 209
C G R W + G VTGG Y ++ C+P C +H G A P+ P
Sbjct: 165 -GCDGGYNARAWKWATIAGVVTGGAYKEKGNCKPYVFPQCGAHKGKAFNNCPSHPYATPA 223
Query: 210 KVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDF 269
+ P + YG+ + DK + YW+ ++E I+ EI+ GP ATF +Y+DF
Sbjct: 224 RKPYCQY-------GYGKRYENDKIKARTWYWLPNDERTIQLEIMQKGPVHATFNIYEDF 276
Query: 270 YHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWG-DRGTVKILRGK 328
HY GVY HT+ A HS K+IGWG + G YWL+ N+W WG D G +++RG
Sbjct: 277 EHYNGGVYIHTAGAMEGG--HSIKIIGWGVDKGVKYWLIANSWSTDWGEDGGYFRVVRGI 334
Query: 329 YECAFEYLIAAG 340
C E + AG
Sbjct: 335 NNCDIEGGVLAG 346
>gi|343476048|emb|CCD12737.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 336
Score = 162 bits (409), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 109/340 (32%), Positives = 153/340 (45%), Gaps = 25/340 (7%)
Query: 5 LVFLLGCTLVRGELYKFSDAYIDQINR-EANTWTAGRNFPA-NLSEEYLRQFLIADAKYF 62
LV L L+ + + ++D+IN+ W A N N++ R+ A F
Sbjct: 14 LVALGASALLAKDAPVLTKTFVDRINQLNGGMWKAVYNGKMQNITFAEARRLTGA----F 69
Query: 63 DQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFS 122
+ LP R T + + +P+ FD+ E+WPNC TI + D AC + + A S
Sbjct: 70 RRKTSSLPPVRFT-EEQLRTELPESFDSAEKWPNCPTIREIADQSACGSCWAVSTASAIS 128
Query: 123 DRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGD 182
DR C QQ R +S ++ SCCK C C G W + G +
Sbjct: 129 DRHCTVGGVQQLR-ISAAHLLSCCKDC----GDGCDGGYPDSAWEYYVSHGLAS------ 177
Query: 183 RTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWV 242
+ CQP C HHG P C KC+T CT+ + K+R +Y +
Sbjct: 178 -SYCQPYPFPHCGHHGGKGKKPPCSKYDFHTPKCNTTCTD----KAIPLIKYRGNDSYVL 232
Query: 243 DDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENG 302
ED K+E+ +GP F +Y DF YK+GVY+H S L H+ +++GWG NG
Sbjct: 233 LHGEDDFKRELYFNGPFVVAFQVYSDFLAYKTGVYRHVSGDFLGG--HAVRIVGWGKLNG 290
Query: 303 TPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
TPYW + N+W WG G ILRG EC E AG P
Sbjct: 291 TPYWKIANSWDTDWGMNGHFLILRGNNECGIESTGYAGLP 330
>gi|107921798|gb|ABF85680.1| cathepsin B3 [Fasciola hepatica]
Length = 278
Score = 161 bits (407), Expect = 5e-37, Method: Compositional matrix adjust.
Identities = 95/288 (32%), Positives = 146/288 (50%), Gaps = 14/288 (4%)
Query: 21 FSDAYIDQINREAN-TWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPE 79
FSD I IN E+ +W A + N ++ + + + D++ + R+T
Sbjct: 3 FSDELIHYINEESGASWKAAPSTRFNNIDQVKQNLGVLEETPEDRNTQ-----RQTVRYS 57
Query: 80 YSAT-VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLS 138
S +P+ FDAR++WPNC +I + D +C++ ++ A +DR CI S GQ+ LS
Sbjct: 58 VSENDLPESFDARQKWPNCPSISEIRDQSSCSSCWAVSSASAITDRICIHSNGQKKPRLS 117
Query: 139 TEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHG 198
+ SCC C Y C+ G +W++ + G VTGG + TGC P CSH
Sbjct: 118 AIDIVSCCAYCGY----GCNGGIPAMSWDYWTREGVVTGGTLENPTGCLPYPFPKCSHGV 173
Query: 199 SAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGP 258
P LP C P KC +C + Y + + QDK + +Y V + E I EI+ +GP
Sbjct: 174 VTPGLPPCPRDIYPTPKCEKKC-HAGYNKTYEQDKVKGKSSYNVGEQETDIMMEIMKNGP 232
Query: 259 TTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYW 306
F +++DF YKSG+Y +T+ + H+ ++IGWG ENG YW
Sbjct: 233 VDGIFYMFEDFLVYKSGIYHYTTGRLVGG--HAIRVIGWGVENGVNYW 278
>gi|86451908|gb|ABC97349.1| cathepsin B [Streblomastix strix]
Length = 312
Score = 161 bits (407), Expect = 5e-37, Method: Compositional matrix adjust.
Identities = 110/328 (33%), Positives = 158/328 (48%), Gaps = 49/328 (14%)
Query: 28 QINREANT-----WTAGRN--FPANLSEEY-----LRQFLIADAKYFDQSDRPLPGDRKT 75
++ RE N+ W AG N F E++ RQ ++D Y D S P+
Sbjct: 20 KLVREVNSRNDVNWVAGINPHFADATIEDFRRLNGARQTPLSDRVYMDVSTVPV------ 73
Query: 76 YDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNR 135
A +PD FD+R WPNC IG + D G C + ++ DR CIKS+G+Q
Sbjct: 74 ------ANLPDEFDSRTNWPNCQLIGKIYDQGHCGSCWAMSSFEVLQDRFCIKSEGKQTP 127
Query: 136 PLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCS 195
LS +++ SC C C+ G + + F+ G + G D C P + C
Sbjct: 128 ELSPQHLTSCTPGC-----SGCNGGWMSTAFGFMQSNG-ILGED------CIPYQMGKCK 175
Query: 196 HHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILA 255
H G + T P+ K K KC+ T T + +Y V NE I+KEI
Sbjct: 176 HPGCS-TWPT---PKCNKTKCYPNDTKST-------ELWHAASSYSVRSNEADIQKEIYE 224
Query: 256 HGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPH 315
+GP TA+FA+Y+D Y+SGVY+H + E LH+ K++GWG +G YW ++N+W
Sbjct: 225 NGPVTASFAVYEDLSVYQSGVYQHVTGG-FEG-LHAIKVVGWGILDGVKYWTIVNSWAED 282
Query: 316 WGDRGTVKILRGKYECAFEYLIAAGKPK 343
WG G + I RG EC E + AG+PK
Sbjct: 283 WGFDGLLLIRRGVDECGIESDVVAGQPK 310
>gi|339241013|ref|XP_003376432.1| Gut-specific cysteine proteinase [Trichinella spiralis]
gi|316974853|gb|EFV58323.1| Gut-specific cysteine proteinase [Trichinella spiralis]
Length = 551
Score = 161 bits (407), Expect = 5e-37, Method: Compositional matrix adjust.
Identities = 103/321 (32%), Positives = 152/321 (47%), Gaps = 26/321 (8%)
Query: 28 QINREANTWTAGRN-FPANLSEEYLRQFL---IADAKYFDQSDRPLPGDRKTYDPEYSAT 83
Q N +W GRN + N S +++ L + ++++ P+P D + +
Sbjct: 232 QANGNTFSWKFGRNAYFKNKSIGEIKKLLGYRMLPKTVKERNEMPMPEDLLNLE---NFN 288
Query: 84 VPDRFDAREQWPNC-GTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYV 142
P FD+R+ WP C I + D C + ++ SDR CI + GQ LS +
Sbjct: 289 YPVEFDSRKHWPQCEKVISFIKDQANCGSCWAVSSASVMSDRTCIATDGQFTTLLSDAEL 348
Query: 143 ASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPT 202
SCC C Y C+ G RT+ + G TGG YG C+P I PCS+
Sbjct: 349 LSCCTSCGY----GCNGGYPQRTFKYWVYSGMPTGGPYGSNDTCKPYPIPPCSN------ 398
Query: 203 LPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTAT 262
C + PK C C + TY +D+H + Y E ++ K+I +GP A
Sbjct: 399 ---CSETRTPK--CSKSCIS-TYPLSLNEDRHYGSTYYQFWLGEKSMMKDISLYGPIVAG 452
Query: 263 FALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTV 322
++Y+DF HYK GVY S L H+ ++IGWG ++ PYWLV N+W +G+ G
Sbjct: 453 MSVYEDFLHYKEGVYTQESGIFLGG--HAVRIIGWGEQDNIPYWLVANSWNTTFGEDGLF 510
Query: 323 KILRGKYECAFEYLIAAGKPK 343
KI RG EC E ++AG+ K
Sbjct: 511 KIRRGFDECGIESYVSAGRAK 531
>gi|157058749|gb|ABV03132.1| cathepsin B-3098 [Acyrthosiphon pisum]
Length = 256
Score = 160 bits (406), Expect = 6e-37, Method: Compositional matrix adjust.
Identities = 92/243 (37%), Positives = 129/243 (53%), Gaps = 21/243 (8%)
Query: 84 VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVA 143
+P +FDAR++W C TIG V D G C + + AF+DR C+ + G N+ LS E +
Sbjct: 28 IPMKFDARKKWIRCKTIGEVRDQGNCGSDWALSTSSAFADRLCVATNGDFNQLLSAEEIT 87
Query: 144 SCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTL 203
CC C C+ G R W G VTGG+Y GC+P + PC +
Sbjct: 88 FCCHKC----GNGCNGGYPIRAWKRFKNHGLVTGGNYKSGEGCEPYRVPPCPYDKDGKN- 142
Query: 204 PSCENQKV-PKLKCHTRCTNPTYGR---GFFQDKHRTTLTYWVDDNEDAIKKEILAHGPT 259
+C Q + P KC +C YG F +D T Y++ I+K+++ +GP
Sbjct: 143 -TCSGQPMEPNHKCSKKC----YGDEDIDFNKDHRYTRDDYYL--TYRGIQKDVINYGPI 195
Query: 260 TATFALYDDFYHYKSGVYKHTSNAKLENYL--HSGKLIGWGTENGTPYWLVINTWGPHWG 317
A+F +YDDF +YKSG+Y + NA +YL HS KLIGWG E G YWL++N+W WG
Sbjct: 196 EASFDVYDDFPNYKSGIYVKSENA---SYLGGHSVKLIGWGEEYGVLYWLMVNSWNADWG 252
Query: 318 DRG 320
D+G
Sbjct: 253 DKG 255
>gi|332374788|gb|AEE62535.1| unknown [Dendroctonus ponderosae]
Length = 328
Score = 159 bits (403), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 113/348 (32%), Positives = 166/348 (47%), Gaps = 35/348 (10%)
Query: 2 IHILVFLLGCT--LVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADA 59
+ +LVF LG + L + + SD YI QIN + +TW AGRNF + E L + L +
Sbjct: 4 VLMLVFALGLSSALPSNKPHPLSDEYIAQINSKQSTWKAGRNFA--IDEYELFKSLASGV 61
Query: 60 KYFDQSDRPLPGDRKTYDP---EYSATVPDRFDAREQWPNC-GTIGHVPDTGACAAPHIF 115
K P KT E + +P+ FD+R WP C IG + D C + F
Sbjct: 62 KK--------PQGLKTAQKLVREITEEIPESFDSRTAWPECTQIIGMIRDQSRCGSCWAF 113
Query: 116 AAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSV 175
AAV A SDR CI S + +S++ + +C C+ G W+ G V
Sbjct: 114 AAVEAMSDRICIHSNATKKLLVSSQDLLTC------GTAGGCNGGWPAVAWSDW-TNGIV 166
Query: 176 TGGDYGD-RTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKH 234
TGG YG GC+ + C H + C N V C +C P+ +++ +
Sbjct: 167 TGGLYGALEQGCKSYFLEGCDDHPN-----KCRNY-VSTPACVEQCDEPSL---YYKAQE 217
Query: 235 RTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKL 294
T + E+ I+ EI+ +GP AT +Y DF Y+SG+Y+ T++ H+ K+
Sbjct: 218 TYGQTPYEIQGEEQIQYEIMTNGPVEATMDVYVDFAQYQSGIYQLTTDEYEGG--HAVKI 275
Query: 295 IGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
+GWG E+G YWLV N+W WG+ G +I+RG+ E E I A P
Sbjct: 276 LGWGVEDGVKYWLVANSWNERWGENGLFRIIRGRDEVGIESTIDAALP 323
>gi|291291827|gb|ADD91786.1| cysteine proteinase [Haemonchus contortus]
Length = 253
Score = 159 bits (403), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 88/258 (34%), Positives = 131/258 (50%), Gaps = 9/258 (3%)
Query: 84 VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVA 143
+P+ +R +WP C ++ + D C + + A SDR CI S G++ +S +
Sbjct: 2 IPESPYSRTKWPKCSSLKPIRDQANCGSCWAVSTASALSDRICIASNGRKQVHVSATDIL 61
Query: 144 SCC-KICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPT 202
SCC C Y C+ G + +N+ K+G+VTGGDY +GC+P PC HHG
Sbjct: 62 SCCGNQCGY----GCNGGWPIQAFNYFSKQGAVTGGDYKATSGCRPYPFHPCGHHGKDTY 117
Query: 203 LPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTAT 262
C N+ KC +C + +D+ Y + E A ++EI+ +GP
Sbjct: 118 YGECPNEATTP-KCVRKCQKSYKKS-YKKDRSIGKDAYEEPNAEKATQREIMKNGPVVGA 175
Query: 263 FALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTV 322
F +Y+DF +YK G+YKHT+ H+ K+IGWG E G PYWL+ N+W WG+ G
Sbjct: 176 FTVYEDFSYYKKGIYKHTAGKARGG--HAIKIIGWGKEGGVPYWLIANSWHNDWGENGYF 233
Query: 323 KILRGKYECAFEYLIAAG 340
+IL G C E + AG
Sbjct: 234 RILCGSNHCGIEENVVAG 251
>gi|239938578|gb|ACS36088.1| cysteine proteinase [Haemonchus contortus]
Length = 332
Score = 159 bits (403), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 82/245 (33%), Positives = 133/245 (54%), Gaps = 9/245 (3%)
Query: 84 VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVA 143
+P+ FD+R W NC +I ++ D C + +A SDR C++SKG+ + +S +
Sbjct: 95 IPESFDSRVVWKNCSSITYIRDQSNCGSCWAVSAAETMSDRICVQSKGRVQKMISDVDIL 154
Query: 144 SCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAP-T 202
+CC + + C+ G + W ++ + G VTGG Y ++ C+P + PC +HG +
Sbjct: 155 ACCG---RECGRGCNGGMDHKAWEYVKEFGVVTGGRYQEKGVCKPYPLHPCGNHGGKFWS 211
Query: 203 LPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTAT 262
P + + P C C YG+ + +DK Y +D++E AI++E++ +GP A
Sbjct: 212 CPRDHSFRTPA--CKKYCQY-GYGKRYEKDKSYVKSVYILDEDEKAIQREMMKNGPVQAA 268
Query: 263 FALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTV 322
Y+DF Y+ G+Y HT + H+ K++GWG ENGT YW V N+W WG+ G
Sbjct: 269 SITYEDFSFYRRGIYVHTRGR--QRGAHAVKVVGWGVENGTKYWNVANSWSTDWGEDGYF 326
Query: 323 KILRG 327
+ILRG
Sbjct: 327 RILRG 331
>gi|1848229|gb|AAB48119.1| cathepsin B-like protease [Leishmania major]
Length = 340
Score = 159 bits (402), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 118/354 (33%), Positives = 160/354 (45%), Gaps = 39/354 (11%)
Query: 1 MIHILVFLLGCTLVRGELYKFSD------AYIDQINREAN-TWTAGRN---FPANLSEEY 50
++ + LL T V G K SD +++ ++N +A WTA N S
Sbjct: 11 LVAVFALLLA-TTVSGLYAKPSDFPLLGKSFVAEVNSKAKGQWTASANNGYLVTGKSLGE 69
Query: 51 LRQFLIADAKYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACA 110
+R+ + D S +P R E +P+ FDA E WP C TI + D C
Sbjct: 70 VRKLM----GVTDMSTEAVP-PRNFSVEELQQDLPEFFDAAEHWPMCLTISEIRDQSNCG 124
Query: 111 APHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLH 170
+ AAV A SDR C G +R +ST + SCC IC C G W +
Sbjct: 125 SCWAIAAVEAISDRYCTFG-GVPDRRMSTSNLLSCCFIC----GLGCHGGIPTVAWLWWV 179
Query: 171 KRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFF 230
G T CQP PCSHHG++ P C + KC+T C
Sbjct: 180 WVGIAT-------EDCQPYPFDPCSHHGNSEKYPPCPSTIYDTPKCNTTCER----NEMD 228
Query: 231 QDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYL- 289
K++ + +Y V E + E++ +GP T +Y DF YKSGVYKH L ++L
Sbjct: 229 LVKYKGSTSYSVK-GEKELMIELMTNGPLELTMQVYSDFVGYKSGVYKHV----LGDFLG 283
Query: 290 -HSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
H+ KL+GWGT++G PYW V N+W WGD+G I RG EC E AG P
Sbjct: 284 GHAVKLVGWGTQDGVPYWKVANSWNTDWGDKGYFLIQRGNNECKIESGGVAGIP 337
>gi|343474530|emb|CCD13852.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 335
Score = 159 bits (402), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 103/344 (29%), Positives = 162/344 (47%), Gaps = 32/344 (9%)
Query: 4 ILVFLLGCTLVRGELYKFSDAYIDQINR-EANTWTAGRNFPANLSEEYLRQFLIADAKYF 62
+L+ + LV E + ++D +NR WTA + + ++ +++AK
Sbjct: 13 VLLAMNTSALVAREAPLLTKEFVDTVNRLSGGMWTA-------VYDGRMQNTTVSEAKRL 65
Query: 63 DQSDRP----LPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAV 118
+++ R LP T + E A +P+ FDA E+WPNC TI + D +C + AA
Sbjct: 66 NRATRKPVSVLPRVNFT-EEELLAPLPETFDAAEKWPNCPTITEISDQSSCGSCWAVAAA 124
Query: 119 GAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGG 178
+ +DR C G + +S + +CC C Y C G W + G +G
Sbjct: 125 TSMTDRYCTI-HGVRGLRISAADLLACCGDCGY----GCLGGDPDMAWAYFSSEGIASGR 179
Query: 179 DYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTL 238
CQP CSH+ ++ T P C + C+ CT+ T + K+R
Sbjct: 180 -------CQPYPFPRCSHYTNSTTYPQCSALHLWTPTCNPACTDSTISK----KKYRGLK 228
Query: 239 TYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWG 298
+Y + ED ++E+ GP A F ++ D + YK GVYKH A + H+ +++GWG
Sbjct: 229 SYSLSGEED-FRRELYFRGPFQAVFDVWSDLFAYKHGVYKHVGGAFIG--AHAVRIVGWG 285
Query: 299 TENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
++G PYW + N+W WGDRG +LRG EC E +AG P
Sbjct: 286 NQSGVPYWKIANSWNAEWGDRGYFFMLRGDNECGIEDSGSAGVP 329
>gi|56756587|gb|AAW26466.1| unknown [Schistosoma japonicum]
Length = 216
Score = 159 bits (401), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 84/223 (37%), Positives = 125/223 (56%), Gaps = 8/223 (3%)
Query: 121 FSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDY 180
+DR CI+S G Q+ LS + SCC+ C + C G W++ +G VTGG
Sbjct: 1 MTDRICIQSGGGQSAELSALDLISCCEDC----GQGCQGGFPGVAWDYWVTQGIVTGGSK 56
Query: 181 GDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTY 240
+ TGCQP C HH + P+C + +C +C Y + QDKH +Y
Sbjct: 57 ENHTGCQPYPFPKCEHH-TKGKYPACGTKIYKTPQCKQKCQK-GYKTPYKQDKHYGDESY 114
Query: 241 WVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTE 300
V NE AI+KEI+ +GP A F +Y+DF +YKSG+Y+H + + + H+ ++IGWG +
Sbjct: 115 NVISNEKAIQKEIMMNGPVEAAFDVYEDFLNYKSGIYRHVTGSIVGG--HAIRIIGWGVK 172
Query: 301 NGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
TPYWL+ N+W WG++G +I+RG+ EC+ E + AG K
Sbjct: 173 KRTPYWLIANSWNEDWGEKGLFRIVRGRDECSIESNVVAGLIK 215
>gi|94958151|gb|ABF47216.1| cathepsin B [Nicotiana benthamiana]
Length = 356
Score = 159 bits (401), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 110/334 (32%), Positives = 148/334 (44%), Gaps = 37/334 (11%)
Query: 15 RGELYKFSDAYIDQINR-EANTWTAGRN--FPANLSEEYLRQFLIADAKYFDQSDRPLPG 71
+ E D+ + Q+N E W A N F ++ R + + D P+
Sbjct: 34 KAESAILQDSIVKQVNENEKAGWKAALNPRFSNFTVSQFKRLLGVKPTRKGDLKGIPILT 93
Query: 72 DRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKG 131
K + +P FDAR WPNC TIG + D G C + F AV + SDR CI
Sbjct: 94 HPKLLE------LPQEFDARVAWPNCSTIGRILDQGHCGSCWAFGAVESLSDRFCIHYG- 146
Query: 132 QQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTG--GDYGDRTGCQPS 189
N LS + +CC D C G + W + ++G VT Y D G
Sbjct: 147 -LNISLSANDLLACCGFLCGD---GCDGGYPLQAWKYFVRKGVVTDECDPYFDNEG---- 198
Query: 190 TISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAI 249
CSH G P P+ KCH +C + + KH Y + + +I
Sbjct: 199 ----CSHPGCEPAYPT--------PKCHRKCVKQNL--LWSKSKHFGVNAYMISSDPHSI 244
Query: 250 KKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGT-ENGTPYWLV 308
E+ +GP +F +Y+DF HYKSGVYKH + + H+ KLIGWGT E+G YWL+
Sbjct: 245 MTELYKNGPVEVSFTVYEDFAHYKSGVYKHVTGDVMGG--HAVKLIGWGTSEDGEDYWLL 302
Query: 309 INTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
N W WGD G KI RG EC E + AG P
Sbjct: 303 ANQWNRGWGDDGYFKIRRGTDECEIEDEVVAGLP 336
>gi|166030318|gb|ABY78826.1| cathepsin B-like protease [Trypanosoma congolense]
Length = 335
Score = 158 bits (400), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 103/344 (29%), Positives = 161/344 (46%), Gaps = 32/344 (9%)
Query: 4 ILVFLLGCTLVRGELYKFSDAYIDQINR-EANTWTAGRNFPANLSEEYLRQFLIADAKYF 62
+L+ + LV E + ++D +NR WTA + + ++ +++AK
Sbjct: 13 VLLAMNTSALVAREAPLLTKEFVDTVNRLSGGMWTA-------VYDGRMQNTTVSEAKRL 65
Query: 63 DQSDRP----LPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAV 118
+++ R LP T + E A +P+ FDA E+WPNC TI + D +C + AA
Sbjct: 66 NRATRKPVSVLPRVNFT-EEELLAPLPETFDAAEKWPNCPTITEISDQSSCGSCWAVAAA 124
Query: 119 GAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGG 178
+ +DR C G + +S + +CC C Y C G W + G +G
Sbjct: 125 TSMTDRYCTI-HGVRGLRISAADLLACCGDCGY----GCLGGDPDMAWAYFSSEGIASGR 179
Query: 179 DYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTL 238
CQP CSH+ ++ T P C + C+ CT+ T + K+R
Sbjct: 180 -------CQPYPFPRCSHYTNSTTYPQCSALHLWTPTCNPACTDSTISK----KKYRGLK 228
Query: 239 TYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWG 298
+Y ED ++E+ GP A F ++ D + YK GVYKH A + H+ +++GWG
Sbjct: 229 SYSFSGEED-FRRELYFRGPFQAVFDVWSDLFAYKHGVYKHVGGAFIG--AHAVRIVGWG 285
Query: 299 TENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
++G PYW + N+W WGDRG +LRG EC E +AG P
Sbjct: 286 NQSGVPYWKIANSWNAEWGDRGYFFMLRGDNECGIEDSGSAGVP 329
>gi|90074902|dbj|BAE87131.1| unnamed protein product [Macaca fascicularis]
Length = 296
Score = 158 bits (400), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 105/338 (31%), Positives = 146/338 (43%), Gaps = 60/338 (17%)
Query: 6 VFLLGCTLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQS 65
+ LG R + SD ++ +N++ TW AG NF N+ YL++ +
Sbjct: 11 LLALGDARSRPSFHPLSDELVNYVNKQNTTWQAGHNF-YNVDVSYLKRLC---GTFLGG- 65
Query: 66 DRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRR 125
P P R + + +P+ FDAREQWP C TI + D G+C + F AV A SDR
Sbjct: 66 --PKPPQRVMFTEDLK--LPESFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVEAISDRI 121
Query: 126 CIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTG 185
CI + + +S E + +CC I D C+ G WNF ++G V+GG Y G
Sbjct: 122 CIHTNAHVSVEVSAEDLLTCCGIMCGD---GCNGGYPAGAWNFWTRKGLVSGGLYDSHVG 178
Query: 186 CQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDN 245
C+P +I PC HH + P PK ++ P Y + QDKH +Y V ++
Sbjct: 179 CRPYSIPPCEHHVNGSRPPCTGEGDTPKC---SKICEPGYSPTYKQDKHYGYNSYSVSNS 235
Query: 246 EDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPY 305
E I EI +NGTPY
Sbjct: 236 EKDIMAEIY---------------------------------------------KNGTPY 250
Query: 306 WLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
WLV N+W WGD G KILRG+ C E + AG P+
Sbjct: 251 WLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPR 288
>gi|119638954|gb|ABL85236.1| cysteine proteinase 2 [Necator americanus]
Length = 347
Score = 158 bits (400), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 89/289 (30%), Positives = 142/289 (49%), Gaps = 9/289 (3%)
Query: 56 IADAKYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIF 115
I D + ++ + + D + + ++P+ FDARE+WP C +IG + D A
Sbjct: 66 IMDLSFMVDAEVMMEEMDQQEDIDLAVSLPESFDAREKWPECPSIGLIRDQSAGGGCWAV 125
Query: 116 AAVGAFSDRRCIKSKGQQNRPLSTEYVASCC-KICRYDDNKSCSHGSVFRTWNFLHKRGS 174
++ +DR CI+S G + +S + SCC + C C+ G + +N+ ++G
Sbjct: 126 SSAEVMTDRICIQSNGTKQVYVSETDILSCCGQRC----GSGCTSGVPRQAFNYAIRKGV 181
Query: 175 VTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKH 234
+GG YG + C+P PC +H P C + P C C + Y + D+
Sbjct: 182 CSGGPYGTKGVCKPYPFYPCGYHAHLPYYGPCPDGMWPTPTCEKACQS-DYTVPYNDDRI 240
Query: 235 RTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKL 294
+ T V E+ IK+EI +GP AT+ +Y+DF +YK+G+Y + H+ K+
Sbjct: 241 FGSKTI-VLTGEEKIKREIFNNGPLVATYTVYEDFAYYKNGIY--MTGLGRATGAHAVKI 297
Query: 295 IGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
IGWG ENG YWL+ N+W WG+ G ++LRG C E G K
Sbjct: 298 IGWGEENGVKYWLIANSWNTDWGENGFFRMLRGTNLCDIELSATGGTFK 346
>gi|342181301|emb|CCC90780.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 335
Score = 158 bits (400), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 108/343 (31%), Positives = 154/343 (44%), Gaps = 32/343 (9%)
Query: 5 LVFLLGCTLVRGELYKFSDAYIDQINR-EANTWTAGRNFPANLSEEYLRQFLIADAKYFD 63
LV L L+ + + ++D+IN+ W A N ++ A+A+
Sbjct: 14 LVALGASALLAKDAPVLTKTFVDRINQLNGGMWKAVYN-------GKMQNITFAEARRLT 66
Query: 64 ----QSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVG 119
Q LP R T + + +P+ FD+ E+WPNC TI + D AC + +
Sbjct: 67 GARIQKTSSLPPVRFT-EEQLRTELPESFDSAEKWPNCPTIREIADQSACGSCWAVSTAS 125
Query: 120 AFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGD 179
A SDR C QQ R +S ++ SCC+ C Y C G +W + G +
Sbjct: 126 AISDRHCTVGGVQQLR-ISAAHLMSCCEDCGY----GCDGGYPGTSWEYYVSHGLAS--- 177
Query: 180 YGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLT 239
+ CQP C HHG P C KC+T CT+ + K+R +
Sbjct: 178 ----SYCQPYPFPHCGHHGGKGKKPPCSKYHFHTPKCNTTCTD----KAIPLIKYRGNHS 229
Query: 240 YWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGT 299
Y V ED K+E+ +GP F +Y DF YK+GVY+H S L H+ +++GWG
Sbjct: 230 YEVH-GEDDYKRELYFNGPFVVVFWVYSDFLAYKTGVYRHVSGDFLGG--HAVRIVGWGK 286
Query: 300 ENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
NGTPYW + N+W WG G + LRG EC E AG P
Sbjct: 287 LNGTPYWKIANSWDTDWGMNGHLLFLRGNNECGIEAAGYAGSP 329
>gi|389593817|ref|XP_003722157.1| cysteine peptidase C (CPC) [Leishmania major strain Friedlin]
gi|321438655|emb|CBZ12414.1| cysteine peptidase C (CPC) [Leishmania major strain Friedlin]
Length = 340
Score = 158 bits (400), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 116/352 (32%), Positives = 157/352 (44%), Gaps = 35/352 (9%)
Query: 1 MIHILVFLLGCTLVRGELYKFSD------AYIDQINREAN-TWTAGRN---FPANLSEEY 50
++ + LL T V G K SD +++ ++N +A WTA + S
Sbjct: 11 LVAVFALLLA-TTVSGLYAKPSDFPLLGKSFVAEVNSKAKGQWTASADNGYLVTGKSLGE 69
Query: 51 LRQFLIADAKYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACA 110
+R+ + D S +P R E +P+ FDA E WP C TI + D C
Sbjct: 70 VRKLM----GVTDMSTEAVP-PRNFSVEELQQDLPEFFDAAEHWPMCLTISEIRDQSNCG 124
Query: 111 APHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLH 170
+ AAV A SDR C G +R +ST + SCC IC C G W +
Sbjct: 125 SCWAIAAVEAISDRYCTFG-GVPDRRMSTSNLLSCCFIC----GLGCHGGIPTVAWLWWV 179
Query: 171 KRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFF 230
G T CQP PCSHHG++ P C + KC+T C
Sbjct: 180 WVGIAT-------EDCQPYPFDPCSHHGNSEKYPPCPSTIYDTPKCNTTCERSEMDL--- 229
Query: 231 QDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLH 290
K++ + +Y V E + E++ +GP T +Y DF YKSGVYKH L H
Sbjct: 230 -VKYKGSTSYSVK-GEKELMIELMTNGPLELTMQVYSDFVGYKSGVYKHVLGEFLGG--H 285
Query: 291 SGKLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
+ KL+GWGT++G PYW V N+W WGD+G I RG EC E AG P
Sbjct: 286 AVKLVGWGTQDGVPYWKVANSWNTDWGDKGYFLIQRGNNECKIESGGVAGIP 337
>gi|107921791|gb|ABF85679.1| cathepsin B2 [Fasciola hepatica]
Length = 278
Score = 158 bits (399), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 98/289 (33%), Positives = 142/289 (49%), Gaps = 16/289 (5%)
Query: 21 FSDAYIDQINREAN-TWTAGRNFP-ANLSEEYLRQFLIADAKYFDQSDRP-LPGDRKTYD 77
FSD I +N E+ +W A R+ +N+ L +++ + RP + D D
Sbjct: 3 FSDELIRFVNEESGASWKAARSTRFSNVDHFKLDLGALSETPEERNALRPTIKHDISKND 62
Query: 78 PEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPL 137
+P+ FDAR QWP C TI + D +C + AA A SDR CI S GQ L
Sbjct: 63 ------LPESFDARSQWPQCWTISEIRDQASCGSCWATAAASAMSDRVCIHSNGQMRPRL 116
Query: 138 STEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHH 197
+ SCC C + C G + W++ + G VTGG + +RTGCQP + C H
Sbjct: 117 AAADPLSCCTYC----GQGCRGGYPPKAWDYWMREGIVTGGTWENRTGCQPWMFTKCDHV 172
Query: 198 GSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHG 257
G + C + PK C C Y + + QDK +Y V ++E I +EI+ +G
Sbjct: 173 GDSRKYSRCPHYTYPKPPCARACQT-GYNKTYEQDKFYGNSSYNVGEHESYIMQEIMKNG 231
Query: 258 PTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYW 306
P TFA++ DF Y+SG+Y H + + H+ ++IGWG ENG YW
Sbjct: 232 PVEVTFAIFQDFGVYRSGIYHHVAGKFIGR--HAVRMIGWGVENGVNYW 278
>gi|428174191|gb|EKX43088.1| hypothetical protein GUITHDRAFT_73372 [Guillardia theta CCMP2712]
Length = 255
Score = 158 bits (399), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 93/258 (36%), Positives = 121/258 (46%), Gaps = 25/258 (9%)
Query: 84 VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVA 143
+PD FDAR WP C +I H+ D C + F AV A SDR CI S G LS E +
Sbjct: 15 IPDNFDARTNWPQCPSIAHIRDQSTCGSCWAFGAVEAMSDRLCIASNGTVKDELSAEDML 74
Query: 144 SCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTL 203
SCC + C+ G W F G T Y P PC HH +
Sbjct: 75 SCCLV---QCGMGCNGGFPTGAWRFFKMHGLTTESKY-------PYVFPPCEHHINKTHY 124
Query: 204 PSC-ENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTAT 262
C +Q PK C + + + K +++ I+ EI+ +GP A
Sbjct: 125 KPCGPSQPTPK------CVRASEKKPRYHGKSVYSVS------PAKIQAEIMTNGPVEAA 172
Query: 263 FALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTV 322
F +Y DF Y+SGVY+H S +L H+ K++GWG E G YWLV N+W WGD+GT
Sbjct: 173 FTVYQDFLAYQSGVYRHVSGPELGG--HAIKIMGWGVEAGNKYWLVANSWNEDWGDKGTF 230
Query: 323 KILRGKYECAFEYLIAAG 340
KI RG EC E + AG
Sbjct: 231 KIARGDDECGIESSVVAG 248
>gi|166030330|gb|ABY78832.1| cathepsin B-like protease [Trypanosoma congolense]
gi|343476577|emb|CCD12360.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 337
Score = 157 bits (398), Expect = 6e-36, Method: Compositional matrix adjust.
Identities = 107/344 (31%), Positives = 153/344 (44%), Gaps = 31/344 (9%)
Query: 5 LVFLLGCTLVRGELYKFSDAYIDQINR-EANTWTAGRNFPANLSEEYLRQFLIADAKYFD 63
LV L L+ + + ++D IN+ W A N ++ ++AK
Sbjct: 15 LVALGASALLAKDAPVLTKTFVDHINQLNGGMWKAVYN-------GKMQNITFSEAKRLT 67
Query: 64 ----QSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVG 119
Q LP R T + + +P+ FDA E WP+C TI + D C A +
Sbjct: 68 GARIQKSSALPPARFT-EEQLRTKLPETFDAAEHWPHCPTIREIADQSECRASWAVSTAS 126
Query: 120 AFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGD 179
A SDR C KG+Q R +S ++ SCCK C C G W + + G +
Sbjct: 127 AISDRYCTVGKGKQLR-ISAAHLLSCCKDC----GDGCKGGFPGFAWRYYVEYGITS--- 178
Query: 180 YGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLT 239
+ CQP C H G+ C KC+ CT+ + K+R T
Sbjct: 179 ----SSCQPYPFPRCEHQGAQGNKTPCSKYNFDTPKCNATCTD----KAIPLIKYRGNAT 230
Query: 240 YWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGT 299
Y + E+ K+E+ +GP A F +Y D + YKSGVY+H L + K++GWG
Sbjct: 231 YLLLHGEEDYKRELYFNGPFVAVFYVYTDLFAYKSGVYRHVDGDFLGG--TAVKVVGWGK 288
Query: 300 ENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
NGTPYW + N+W WG G + ILRG EC E+L AG P+
Sbjct: 289 LNGTPYWKLANSWDTDWGMGGYLLILRGNNECNIEHLGFAGTPE 332
>gi|166030316|gb|ABY78825.1| cathepsin B-like protease [Trypanosoma congolense]
Length = 336
Score = 157 bits (397), Expect = 6e-36, Method: Compositional matrix adjust.
Identities = 105/345 (30%), Positives = 156/345 (45%), Gaps = 26/345 (7%)
Query: 1 MIHILVFLLGCTLVRG-ELYKFSDAYIDQINR-EANTWTAGRNFPA-NLSEEYLRQFLIA 57
++ + LG + +R + + ++D+IN+ W A N N++ R+ A
Sbjct: 9 LLSTALVALGASALRAKDAPVLTKTFVDRINQLNGGMWKAVYNGKMQNITFAEARRLTGA 68
Query: 58 DAKYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAA 117
F + LP R T + + +P+ FD+ E+WPNC TI + D AC + +
Sbjct: 69 ----FRRKTSSLPPVRFT-EEQLRTELPESFDSAEKWPNCPTIREIADQSACGSCWAVST 123
Query: 118 VGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTG 177
A SDR C QQ R +S ++ SCCK C C G W + G +
Sbjct: 124 ASAISDRHCTVGGVQQLR-ISAAHLLSCCKDC----GDGCDGGYPDAAWRYYVSHGLAS- 177
Query: 178 GDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTT 237
+ CQP C HHG P C KC+T CT+ + ++R
Sbjct: 178 ------SYCQPYPFPHCGHHGGKGKKPPCSKYDFHTPKCNTTCTD----KAIPLIEYRGN 227
Query: 238 LTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGW 297
+Y + ED K+E+ +GP F ++ DF YK+GVY+H S L H+ +++GW
Sbjct: 228 DSYVLLHGEDDFKRELYFNGPFVVAFQVFSDFLAYKTGVYRHVSGDFLGG--HAVRIVGW 285
Query: 298 GTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
G NGTPYW + N+W WG G LRG EC E+ AG P
Sbjct: 286 GKLNGTPYWKIANSWDTDWGMNGHFLFLRGNNECGIEFEGYAGLP 330
>gi|340053922|emb|CCC48215.1| cysteine peptidase C (CPC) [Trypanosoma vivax Y486]
Length = 334
Score = 157 bits (397), Expect = 7e-36, Method: Compositional matrix adjust.
Identities = 101/325 (31%), Positives = 153/325 (47%), Gaps = 37/325 (11%)
Query: 24 AYIDQINREANTWTAGRNFP-ANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEYSA 82
A ++++N+ WTA + A L+ + +++ + A + D P+ R + E A
Sbjct: 37 AEVNKLNK--GIWTARYDTKMARLTRQGVKRLMGAKLR-----DAPVLPRRHFTEEELRA 89
Query: 83 TVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYV 142
+P+ FDA WP+C TI + D +C + AA A SDR C+ + G ++ +S +
Sbjct: 90 PLPESFDAATAWPDCPTIKRIADQSSCGSCWAVAAATAMSDRFCV-TGGVRDLGISAGDL 148
Query: 143 ASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPT 202
SCC C C G W + + G V+ DY CQP PC H G
Sbjct: 149 LSCCTSC----GDGCDGGYPDEAWLYFTESGLVS--DY-----CQPYPFPPCKHSGGRSK 197
Query: 203 LPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDN-----EDAIKKEILAHG 257
PSC + KC+ CT DK + Y+ ++ E+ K+E+ G
Sbjct: 198 NPSCHDMHFHTPKCNATCT----------DKRIPVVRYFASESYSLQGEEDYKRELYLRG 247
Query: 258 PTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWG 317
P F +Y+DF Y+SGVYKH S + H+ +++GWG NG PYW + N+W WG
Sbjct: 248 PFEVAFTVYEDFLAYESGVYKHVSGGPVGG--HAVRVVGWGERNGVPYWKIANSWNTDWG 305
Query: 318 DRGTVKILRGKYECAFEYLIAAGKP 342
+ G + RGK EC E +AG P
Sbjct: 306 ENGYLYFYRGKDECGIESQGSAGTP 330
>gi|308512693|gb|ADO33000.1| cathepsin B [Biston betularia]
Length = 217
Score = 157 bits (397), Expect = 8e-36, Method: Compositional matrix adjust.
Identities = 89/221 (40%), Positives = 110/221 (49%), Gaps = 9/221 (4%)
Query: 122 SDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYG 181
+DR C S G ++ S E + SCC IC C+ G W + G V+GG+Y
Sbjct: 1 TDRVCTYSNGTKHFHFSAEDLLSCCPICGL----GCNGGMPTLAWEYWKHMGLVSGGNYN 56
Query: 182 DRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYW 241
GC P I PC HH LP + K PK C C N Y + +DK Y
Sbjct: 57 SSQGCSPYVIPPCEHHVPGNRLPCNGDTKTPK--CSKTCEN-GYNVLYKKDKRYGKHVYA 113
Query: 242 VDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTEN 301
V ED IK E+ +GP A F +Y D YKSGVYKH L H+ K+IGWG EN
Sbjct: 114 VRGGEDHIKAELFKNGPVEAAFTVYADLLAYKSGVYKHVEGDALGG--HAIKIIGWGVEN 171
Query: 302 GTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
G YWL+ N+W WG+ G KILRG+ C E I AG+P
Sbjct: 172 GNKYWLIANSWNTDWGNNGFFKILRGEDHCGIESSIVAGEP 212
>gi|343474132|emb|CCD14149.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 337
Score = 157 bits (396), Expect = 8e-36, Method: Compositional matrix adjust.
Identities = 107/344 (31%), Positives = 153/344 (44%), Gaps = 31/344 (9%)
Query: 5 LVFLLGCTLVRGELYKFSDAYIDQINR-EANTWTAGRNFPANLSEEYLRQFLIADAKYFD 63
LV L L+ + + ++D IN+ W A N ++ ++AK
Sbjct: 15 LVALGASALLAKDAPVLTKTFVDHINQLNGGMWRAVYN-------GKMQNITFSEAKRLT 67
Query: 64 ----QSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVG 119
Q LP R T + + +P+ FDA E WP+C TI + D C A +
Sbjct: 68 GARIQKSSALPPARFT-EEQLRTKLPETFDAAEHWPHCPTIREIADQSECRASWAVSTAS 126
Query: 120 AFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGD 179
A SDR C KG+Q R +S ++ SCCK C C G W + + G +
Sbjct: 127 AISDRYCTVGKGKQLR-ISAAHLLSCCKDC----GDGCKGGFPGFAWRYYVEYGITS--- 178
Query: 180 YGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLT 239
+ CQP C H G+ C KC+ CT+ + K+R T
Sbjct: 179 ----SSCQPYPFPRCEHQGAQGNKTPCSKYNFDTPKCNATCTD----KAIPLIKYRGNAT 230
Query: 240 YWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGT 299
Y + E+ K+E+ +GP A F +Y D + YKSGVY+H L + K++GWG
Sbjct: 231 YLLLHGEEDYKRELYFNGPFVAVFYVYTDLFAYKSGVYRHVDGDFLGG--TAVKVVGWGK 288
Query: 300 ENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
NGTPYW + N+W WG G + ILRG EC E+L AG P+
Sbjct: 289 LNGTPYWKLANSWDTDWGMGGYLLILRGNNECNIEHLGFAGTPE 332
>gi|38639325|gb|AAR25800.1| cathepsin B-like cysteine proteinase [Solanum tuberosum]
Length = 354
Score = 157 bits (396), Expect = 8e-36, Method: Compositional matrix adjust.
Identities = 111/328 (33%), Positives = 149/328 (45%), Gaps = 37/328 (11%)
Query: 21 FSDAYIDQINREANT-WTAGRNFPANLSEEYLRQF-LIADAKYFDQSDRPLPGDRKTYDP 78
D+ + ++N A W A F LS + QF + K + D L G P
Sbjct: 40 LQDSIVKRVNENAEAGWKAA--FNPQLSNFTVSQFKRLLGVKPAREGD--LEGIPVLTHP 95
Query: 79 EYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLS 138
+P FDAR+ WP C TIG + D G C + F AV + SDR CI + LS
Sbjct: 96 RLK-ELPKEFDARKAWPQCSTIGKILDQGHCGSCWAFGAVESLSDRFCIHY--NLSISLS 152
Query: 139 TEYVASCCK-ICRYDDNKSCSHGSVFRTWNFLHKRGSVTG--GDYGDRTGCQPSTISPCS 195
+ +CC +C C G W + + G VT Y D TGC S
Sbjct: 153 VNDLLACCSFLC----GSGCDGGYPIAAWRYFKRSGVVTEECDPYFDTTGC--------S 200
Query: 196 HHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILA 255
H G P P+ KCH +C + + KH Y V + +I E+
Sbjct: 201 HPGCEPLYPT--------PKCHRKCVKGNVL--WRKSKHYGVNAYRVSHDPQSIMAEVYK 250
Query: 256 HGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGT-ENGTPYWLVINTWGP 314
+GP +F +Y+DF HYKSGVYKH + + H+ KLIGWGT E G YWL++N+W
Sbjct: 251 NGPVEVSFTVYEDFAHYKSGVYKHVTGGNMGG--HAVKLIGWGTSEQGEDYWLIVNSWNR 308
Query: 315 HWGDRGTVKILRGKYECAFEYLIAAGKP 342
WG+ G KI RG EC E+ + AG P
Sbjct: 309 GWGEDGYFKIRRGTNECGIEHSVVAGLP 336
>gi|166030332|gb|ABY78833.1| cathepsin B-like protease [Trypanosoma congolense]
Length = 336
Score = 157 bits (396), Expect = 9e-36, Method: Compositional matrix adjust.
Identities = 109/345 (31%), Positives = 157/345 (45%), Gaps = 36/345 (10%)
Query: 5 LVFLLGCTLVRGELYKFSDAYIDQINR-EANTWTAGRNFPANLSEEYLRQFLIADAK--- 60
LV L L+ + + ++D+IN+ W A + + ++ ++AK
Sbjct: 15 LVALGASALLAKDAPVLTKTFVDRINQLNGGMWKA-------VYDGKMQNLTFSEAKRLT 67
Query: 61 -YFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVG 119
F + LP R T + + +P+ FDA E WP+C TI + D AC A A
Sbjct: 68 GAFSRKTSSLPPVRFT-EEQLRTELPESFDAAEHWPHCPTIREIADQSACRASWAVATAS 126
Query: 120 AFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGD 179
A SDR C KG+Q R +S + +CCK C C G W + G +
Sbjct: 127 AISDRYCTVGKGKQLR-ISAADLMACCKDC----GGGCEGGYPDAAWEYYVSHGITS--- 178
Query: 180 YGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLT 239
+ CQP C H G+ P C K +C+ CT+ + K+R +
Sbjct: 179 ----SQCQPYPFPRCEHRGAQGKKPPCSKYKFVTPQCNATCTD----KSVPLIKYRGNHS 230
Query: 240 YWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYL--HSGKLIGW 297
Y V ED K+E+ +GP F ++ DF YKSGVY+H + N+L + +++GW
Sbjct: 231 YEVRGEED-YKRELYFNGPFVVRFQVHSDFLAYKSGVYQHVAG----NFLGGKAVRIVGW 285
Query: 298 GTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
G NGTPYW V N+W WG G ILRG EC E+L AG P
Sbjct: 286 GKLNGTPYWKVANSWDTDWGMNGYFLILRGDNECNIEHLGFAGTP 330
>gi|357116869|ref|XP_003560199.1| PREDICTED: cathepsin B-like [Brachypodium distachyon]
Length = 350
Score = 157 bits (396), Expect = 9e-36, Method: Compositional matrix adjust.
Identities = 114/344 (33%), Positives = 151/344 (43%), Gaps = 43/344 (12%)
Query: 8 LLGCTLVRGELYKFSDAYIDQINREANT-WTAGRNFPANLSEEYLRQFLIADAKYFDQSD 66
L+G L + I+ IN N WTAG+N Y + IA K+
Sbjct: 25 LVGAASGDNSLGIIQNDIIETINNHPNAGWTAGQN-------SYFANYTIAQFKHILGVK 77
Query: 67 RPLPG-----DRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAF 121
PG KTY S +P FDAR +W C TIG + D G C + F AV
Sbjct: 78 PTPPGLLRGVPTKTY--SRSTDLPKEFDARSKWSGCSTIGTILDQGHCGSCWAFGAVECL 135
Query: 122 SDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTG--GD 179
DR CI N LS + +CC D C G W +L + G VT
Sbjct: 136 QDRFCIHL--NMNISLSVNDLVACCGFMCGD---GCDGGYPISAWQYLVENGVVTDECDP 190
Query: 180 YGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLT 239
Y D+ GC+ H G P P+ +K K++ + + KH +
Sbjct: 191 YFDQVGCK--------HPGCEPAYPTPACEKKCKVQNQV----------WQEKKHFSINA 232
Query: 240 YWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGT 299
Y V+ + I E+ +GP F +Y+DF HYKSGVY+H + + H+ KLIGWGT
Sbjct: 233 YRVNSDPHDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYEHITGEMMGG--HAVKLIGWGT 290
Query: 300 E-NGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
+G YWL+ N W WGD G KI+RGK EC E + AG P
Sbjct: 291 SADGKDYWLLANQWNRGWGDDGYFKIIRGKNECGIEEDVVAGMP 334
>gi|341891034|gb|EGT46969.1| hypothetical protein CAEBREN_30419 [Caenorhabditis brenneri]
Length = 422
Score = 157 bits (396), Expect = 9e-36, Method: Compositional matrix adjust.
Identities = 108/356 (30%), Positives = 159/356 (44%), Gaps = 67/356 (18%)
Query: 22 SDAYIDQINREAN-----TWTAGRN----------FPANLSEEYLRQFLIADAKYFDQSD 66
SD Y+ ++ R+ N TW A N F ++ + +++ K+F+
Sbjct: 64 SDEYLRKLVRQVNDSPETTWKAKFNKFGVKNRSYGFKYTRNQTAVEEYMEHIRKFFESD- 122
Query: 67 RPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRC 126
+ + D S+ +P FDAR++WPNC +I +VP+ G C + AA G SDR C
Sbjct: 123 -AMKRHLEELDNYKSSDLPKAFDARQKWPNCPSISNVPNQGGCGSCFAVAAAGVASDRAC 181
Query: 127 IKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGC 186
I S G LS E + CC +C +C G + + +G VTGG R GC
Sbjct: 182 IHSNGTFKALLSEEDIIGCCSVC-----GNCYGGDPLKALTYWVNQGLVTGG----RDGC 232
Query: 187 QPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYW----- 241
+P + P P+ + K C RC N Y + + +DKH T Y
Sbjct: 233 RPYSFDLSC---GVPCSPATFFEAEEKRTCMRRCQNIYYQQRYEEDKHFATFAYSLYPRS 289
Query: 242 -----------------------------VDDNEDAIKKEILAHGPTTATFALYDDFYHY 272
V + + IKKEIL +GPTT F + ++F HY
Sbjct: 290 MTVSPDGKERVKVPTIIGHFNDKNTEKLNVTEYRNVIKKEILLYGPTTMAFPVPEEFLHY 349
Query: 273 KSGVYKHTSNAKLEN---YLHSGKLIGWG-TENGTPYWLVINTWGPHWGDRGTVKI 324
SGV++ ++ Y H +LIGWG +E+GT YWL +N++G HWGD G KI
Sbjct: 350 SSGVFRPFPLDGFDDRIVYWHVVRLIGWGQSEDGTHYWLAVNSFGSHWGDNGLFKI 405
>gi|166030310|gb|ABY78822.1| cathepsin B-like protease [Trypanosoma congolense]
Length = 335
Score = 156 bits (395), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 107/343 (31%), Positives = 153/343 (44%), Gaps = 32/343 (9%)
Query: 5 LVFLLGCTLVRGELYKFSDAYIDQINR-EANTWTAGRNFPANLSEEYLRQFLIADAKYFD 63
LV L L+ + + ++D+IN+ W A N ++ A+A+
Sbjct: 14 LVALGASALLAKDAPVLTKTFVDRINQLNGGMWKAVYN-------GKMQNITFAEARRLT 66
Query: 64 ----QSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVG 119
Q LP R T + + +P+ FD+ E+WPNC TI + D AC + +
Sbjct: 67 GARIQKTSSLPPVRFT-EEQLRTELPESFDSAEKWPNCPTIREIADQSACGSCWAVSTAS 125
Query: 120 AFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGD 179
A SDR C QQ R +S ++ SCC+ C C G +W + G +
Sbjct: 126 AISDRHCTVGGVQQLR-ISAAHLMSCCEDC----GDGCDGGYPGTSWEYYVSHGLAS--- 177
Query: 180 YGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLT 239
+ CQP C HHG P C KC+T CT+ + K+R +
Sbjct: 178 ----SYCQPYPFPHCGHHGGKGKKPPCSKYHFHTPKCNTTCTD----KAIPLIKYRGNHS 229
Query: 240 YWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGT 299
Y V ED K+E+ +GP F +Y DF YK+GVY+H S L H+ +++GWG
Sbjct: 230 YEVH-GEDDYKRELYFNGPFVVVFWVYSDFLAYKTGVYRHVSGDFLGG--HAVRIVGWGK 286
Query: 300 ENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
NGTPYW + N+W WG G + LRG EC E AG P
Sbjct: 287 LNGTPYWKIANSWDTDWGMNGHLLFLRGNNECGIEAAGYAGSP 329
>gi|262217337|gb|ACY38050.1| cathepsin B [Dactylis glomerata]
Length = 348
Score = 156 bits (394), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 115/344 (33%), Positives = 150/344 (43%), Gaps = 43/344 (12%)
Query: 8 LLGCTLVRGELYKFSDAYIDQINREANT-WTAGRNFPANLSEEYLRQFLIADAKYFDQSD 66
L+G L I+ IN+ N WTAG N YL + I K+
Sbjct: 21 LVGAASGDHSLRIIQKDIIETINKHPNAGWTAGHN-------AYLANYTIEQFKHILGVK 73
Query: 67 RPLPG-----DRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAF 121
PG KTY S +P +FDAR +W C TIG + D G C + F AV
Sbjct: 74 PTPPGLLAGVPTKTYSK--SEELPKQFDARSKWSGCSTIGTILDQGHCGSCWAFGAVECL 131
Query: 122 SDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTG--GD 179
DR CI N LS + +CC D C G + W + + G VT
Sbjct: 132 QDRFCIHQ--NINISLSANDLVACCGFMCGD---GCDGGYPIKAWQYFVQSGVVTEECDP 186
Query: 180 YGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLT 239
Y D+ GC+ P A P CE KC + + + + KH +
Sbjct: 187 YFDQVGCKHPGCEP------AYDTPKCEK------KCKVQ------NQVWEEKKHFSINA 228
Query: 240 YWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGT 299
Y V+ + I E+ +GP F +Y+DF HYKSGVYKH + + H+ KLIGWGT
Sbjct: 229 YRVNSDPHDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKHVTGGVMGG--HAVKLIGWGT 286
Query: 300 EN-GTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
+ G YWL+ N W WGD G KI+RGK EC E + AG P
Sbjct: 287 SDAGEDYWLLANQWNRGWGDDGYFKIIRGKNECGIEEEVVAGMP 330
>gi|357116879|ref|XP_003560204.1| PREDICTED: cathepsin B-like [Brachypodium distachyon]
Length = 351
Score = 156 bits (394), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 113/344 (32%), Positives = 149/344 (43%), Gaps = 43/344 (12%)
Query: 8 LLGCTLVRGELYKFSDAYIDQINREANT-WTAGRNFPANLSEEYLRQFLIADAKYFDQSD 66
L+G L + I+ IN+ N WTAG N Y + I K+
Sbjct: 24 LVGAARGDNSLRIIQNDIIETINKHPNAGWTAGHN-------PYFANYTITQFKHI-LGV 75
Query: 67 RPLP----GDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFS 122
+P P T S +P FDAR QW C TIG + D G C + F AV
Sbjct: 76 KPTPPALLAGVPTKSYSRSMKLPTEFDARSQWSGCSTIGTILDQGHCGSCWAFGAVECLQ 135
Query: 123 DRRCIKSKGQQNRPLSTEYVASCCK-ICRYDDNKSCSHGSVFRTWNFLHKRGSVTG--GD 179
DR CI N LS + +CC +C C+ G W + ++G VT
Sbjct: 136 DRFCIHL--NMNISLSVNDLLACCGFLC----GSGCNGGYPISAWRYFRRKGVVTDECDP 189
Query: 180 YGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLT 239
Y D+ GC+ H G P + PK + + N + + KH +
Sbjct: 190 YFDQVGCK--------HPGCEPAY------RTPKCEKKCKVQNEVWK----EQKHFSVDA 231
Query: 240 YWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGT 299
Y V N I E+ +GP F +Y+DF HYKSGVYKH + + H+ KLIGWGT
Sbjct: 232 YRVHSNPHDIMAEVYTNGPVEVAFTVYEDFAHYKSGVYKHITGGVMGG--HAVKLIGWGT 289
Query: 300 EN-GTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
+ G YWL+ N W WGD G KI+RGK EC E + AG P
Sbjct: 290 SDAGEDYWLLANQWNRGWGDDGYFKIIRGKNECGIEEDVVAGMP 333
>gi|268566089|ref|XP_002647469.1| Hypothetical protein CBG06541 [Caenorhabditis briggsae]
Length = 280
Score = 156 bits (394), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 91/242 (37%), Positives = 123/242 (50%), Gaps = 30/242 (12%)
Query: 109 CAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHG-------S 161
C + F+ SDR CI +KG Q +S + +CC +SC G
Sbjct: 61 CGSCWAFSTAEVISDRICIATKGTQQPTISPTDMLACC-------GRSCGDGCEGGYPIQ 113
Query: 162 VFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCT 221
FR WN RG VTGGD+ +GC+P +PC+ + C +K P C C
Sbjct: 114 AFRWWN---SRGVVTGGDF-RGSGCRPYPFAPCNSY-------KCPEEKTPT--CSLSC- 159
Query: 222 NPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTS 281
Y + +DK Y V N AI+ EI+ +GP F +Y+D Y YKSGVY+HT+
Sbjct: 160 QFGYSTAYAKDKRFGVSAYAVARNVAAIQTEIMTNGPVVGAFTMYEDMYKYKSGVYRHTA 219
Query: 282 NAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGK 341
L H+ K+IGWGT+NG PYWL+ N+WG WG+ G +K+ RG EC E + AG
Sbjct: 220 GRLLGG--HAIKIIGWGTQNGIPYWLIANSWGADWGENGFLKMRRGVNECGIESAVVAGM 277
Query: 342 PK 343
PK
Sbjct: 278 PK 279
Score = 70.5 bits (171), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 30/65 (46%), Positives = 46/65 (70%), Gaps = 2/65 (3%)
Query: 254 LAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWG 313
+ +GP A+F +Y+DFY YK GVY++T+ + +H+ K++GWGTE+GT YWL+ N+WG
Sbjct: 1 MTNGPVEASFTVYEDFYIYKKGVYQYTAGQVVG--VHAIKIMGWGTEHGTDYWLIANSWG 58
Query: 314 PHWGD 318
G
Sbjct: 59 AQCGS 63
>gi|609175|emb|CAA57522.1| cathepsin B-like cysteine proteinase [Nicotiana rustica]
Length = 356
Score = 156 bits (394), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 111/336 (33%), Positives = 148/336 (44%), Gaps = 41/336 (12%)
Query: 15 RGELYKFSDAYIDQINR-EANTWTAGRN--FPANLSEEYLRQFLIADAKYFDQSDRPLPG 71
+ E D+ + Q+N E W A N F ++ R + + D P+
Sbjct: 34 KAESAILQDSIVKQVNENEKAGWKAALNPRFSNFTVSQFKRLLGVKPTRKGDLKGIPILT 93
Query: 72 DRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKG 131
K + +P FDAR W NC TIG + D G C + F AV + SDR CI
Sbjct: 94 HPKLLE------LPQEFDARVAWSNCSTIGRILDQGHCGSCWAFGAVESLSDRFCIHYG- 146
Query: 132 QQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTG--GDYGDRTGCQPS 189
N LS + +CC D C G + W + ++G VT Y D GC
Sbjct: 147 -LNISLSANDLYACCGFLCGD---GCDGGYPLQAWKYFVRKGVVTDECDPYFDNEGC--- 199
Query: 190 TISPCSHHGSAPTLPSCENQKVPKLKCHTRCT--NPTYGRGFFQDKHRTTLTYWVDDNED 247
SH G P P+ KCH +C N + R KH Y + +
Sbjct: 200 -----SHPGCEPAYPT--------PKCHRKCVKQNLLWSR----SKHFGVNAYMISSDPH 242
Query: 248 AIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGT-ENGTPYW 306
+I E+ +GP +F +Y+DF HYKSGVYKH + + H+ KLIGWGT E+G YW
Sbjct: 243 SIMTEVYKNGPVEVSFTVYEDFAHYKSGVYKHVTGDIMGG--HAVKLIGWGTSEDGEDYW 300
Query: 307 LVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
L+ N W WGD G KI RG EC E + AG P
Sbjct: 301 LLANQWNRGWGDDGYFKIRRGTNECEIEDEVVAGLP 336
>gi|166030328|gb|ABY78831.1| cathepsin B-like protease [Trypanosoma congolense]
Length = 336
Score = 156 bits (394), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 108/343 (31%), Positives = 153/343 (44%), Gaps = 31/343 (9%)
Query: 5 LVFLLGCTLVRGELYKFSDAYIDQINR-EANTWTAGRNFPANLSEEYLRQFLIADAKYFD 63
LV L L+ + + ++D+IN+ W A N ++ A+AK
Sbjct: 14 LVALGASALLAKDAPVLTKTFVDRINQLNGGMWKAVYN-------GKMQNITFAEAKRLT 66
Query: 64 ----QSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVG 119
Q LP R T + + +P+ FDA E WP+C TI + D AC A +
Sbjct: 67 GAWIQKSSTLPPARFT-EEQLRTKLPETFDAAEHWPHCPTIREIADQSACRASWAVSTAS 125
Query: 120 AFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGD 179
A SDR C G+Q R +S + SCCK C C G W + + G +
Sbjct: 126 AISDRYCTVGGGKQLR-ISAADLLSCCKQC----GDGCKGGFPGFAWLYYVEYGIAS--- 177
Query: 180 YGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLT 239
+GCQP C H G+ C K KC+ CT+ + K+R T
Sbjct: 178 ----SGCQPYPFPHCEHRGAQGNKTPCSKYKFDTPKCNATCTD----KSIPLVKYRGNAT 229
Query: 240 YWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGT 299
Y + E+ K+E+ +GP A F +Y D + YKSGVY++ L + +++GWG
Sbjct: 230 YLLLHGEEDYKRELYFNGPFVAVFFVYTDLFAYKSGVYRNVDGDFLGG--QAVRIVGWGK 287
Query: 300 ENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
NGTPYW V N+W WG G + ILRG EC E+L G P
Sbjct: 288 LNGTPYWKVANSWDTDWGMNGYMLILRGNNECNIEHLGFTGFP 330
>gi|59895951|gb|AAX11351.1| cathepsin B-like cysteine protease [Oryza sativa Japonica Group]
gi|125551767|gb|EAY97476.1| hypothetical protein OsI_19406 [Oryza sativa Indica Group]
gi|215694023|dbj|BAG89222.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215712372|dbj|BAG94499.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215765382|dbj|BAG87079.1| unnamed protein product [Oryza sativa Japonica Group]
gi|222631058|gb|EEE63190.1| hypothetical protein OsJ_17999 [Oryza sativa Japonica Group]
Length = 358
Score = 155 bits (392), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 111/328 (33%), Positives = 148/328 (45%), Gaps = 37/328 (11%)
Query: 21 FSDAYIDQINREANT-WTAGRN-FPANLSEEYLRQFL-IADAKYFDQSDRPLPGDRKTYD 77
D I IN+ N WTA RN + AN + + L + + +D P+ KTY
Sbjct: 42 IQDDIIKAINKHPNAGWTAARNPYFANYTTAQFKHILGVKPTPHSVLNDVPV----KTY- 96
Query: 78 PEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPL 137
S +P FDAR W C TIG + D G C + F AV DR CI N L
Sbjct: 97 -PRSLMLPKEFDARSAWSQCNTIGTILDQGHCGSCWAFGAVECLQDRFCIHF--NMNISL 153
Query: 138 STEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTG--GDYGDRTGCQPSTISPCS 195
S + +CC D C G W + + G VT Y D+ GC+
Sbjct: 154 SVNDLVACCGFMCGD---GCDGGYPIMAWRYFVRNGVVTDECDPYFDQVGCK-------- 202
Query: 196 HHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILA 255
H G P P+ V + KC + + + + KH + Y V+ + I E+
Sbjct: 203 HPGCEPAYPT----PVCEKKCKVQ------NQVWLEKKHFSVNAYRVNSDPHDIMAEVYQ 252
Query: 256 HGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTEN-GTPYWLVINTWGP 314
+GP F +Y+DF HYKSGVYKH + + H+ KLIGWGT + G YWL+ N W
Sbjct: 253 NGPVEVAFTVYEDFAHYKSGVYKHITGGMMGG--HAVKLIGWGTTDAGEDYWLLANQWNR 310
Query: 315 HWGDRGTVKILRGKYECAFEYLIAAGKP 342
WGD G KI+RG EC E + AG P
Sbjct: 311 GWGDDGYFKIIRGTNECGIEEDVVAGMP 338
>gi|343470805|emb|CCD16605.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 337
Score = 155 bits (392), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 107/344 (31%), Positives = 153/344 (44%), Gaps = 31/344 (9%)
Query: 5 LVFLLGCTLVRGELYKFSDAYIDQINR-EANTWTAGRNFPANLSEEYLRQFLIADAKYFD 63
LV L L+ + + ++D IN+ W A N ++ ++AK
Sbjct: 15 LVALGASALLAKDAPVLTKTFVDHINQLNGGMWRAVYN-------GKMQNITFSEAKRLT 67
Query: 64 ----QSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVG 119
Q LP R T + + +P+ FDA E WP+C TI + D C A +
Sbjct: 68 GARIQKSSALPPARFT-EEQLRTKLPETFDAAEHWPHCPTIREIADQSECRASWAVSTAS 126
Query: 120 AFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGD 179
A SDR C KG+Q R +S ++ SCCK C C G W + + G +
Sbjct: 127 AISDRYCTVGKGKQLR-ISAAHLLSCCKDC----GDGCKGGFPGFAWRYYVEYGITS--- 178
Query: 180 YGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLT 239
+ CQP C H G+ C KC+ CT+ + K+R T
Sbjct: 179 ----SSCQPYPFPRCEHQGAQGNKTPCSKYNFDTPKCNATCTD----KSVPLIKYRGNAT 230
Query: 240 YWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGT 299
Y + E+ K+E+ +GP A F +Y D + YKSGVY++ L + K++GWG
Sbjct: 231 YLLLHGEEDYKRELYFNGPFVAVFYVYTDLFAYKSGVYRNVDGDFLGG--TAVKVVGWGK 288
Query: 300 ENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
NGTPYW V N+W WG G + ILRG EC E+L AG P+
Sbjct: 289 LNGTPYWKVANSWDTDWGMDGYLLILRGNNECNIEHLGFAGTPE 332
>gi|17510377|ref|NP_490763.1| Protein Y65B4A.2 [Caenorhabditis elegans]
gi|373220066|emb|CCD71920.1| Protein Y65B4A.2 [Caenorhabditis elegans]
Length = 421
Score = 155 bits (392), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 96/282 (34%), Positives = 132/282 (46%), Gaps = 50/282 (17%)
Query: 81 SATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTE 140
S+ VP FDAR++WPNC +I +VP+ G C + AA G SDR CI S G LS E
Sbjct: 135 SSDVPKNFDARQKWPNCPSISNVPNQGGCGSCFAVAAAGVASDRACIHSNGTFKSLLSEE 194
Query: 141 YVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSA 200
+ CC +C +C G + + +G VTGG R GC+P +
Sbjct: 195 DIIGCCSVC-----GNCYGGDPLKALTYWVNQGLVTGG----RDGCRPYSFDLSC---GV 242
Query: 201 PTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYW------------------- 241
P P+ + K C RC N Y + + +DKH T Y
Sbjct: 243 PCSPATFFEAEEKRTCMKRCQNIYYQQKYEEDKHFATFAYSMYPRSMTVSPDGKERVKVP 302
Query: 242 ---------------VDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLE 286
V + D IKKEIL +GPTT F + ++F HY SGV++ +
Sbjct: 303 TIIGHFNDKKTEKLNVTEYRDIIKKEILLYGPTTMAFPVPEEFLHYSSGVFRPYPTDGFD 362
Query: 287 N---YLHSGKLIGWG-TENGTPYWLVINTWGPHWGDRGTVKI 324
+ Y H +LIGWG +++GT YWL +N++G HWGD G KI
Sbjct: 363 DRIVYWHVVRLIGWGESDDGTHYWLAVNSFGNHWGDNGLFKI 404
>gi|355332948|pdb|3MOR|A Chain A, Crystal Structure Of Cathepsin B From Trypanosoma Brucei
gi|355332949|pdb|3MOR|B Chain B, Crystal Structure Of Cathepsin B From Trypanosoma Brucei
Length = 317
Score = 155 bits (391), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 105/332 (31%), Positives = 146/332 (43%), Gaps = 23/332 (6%)
Query: 13 LVRGELYKFSDAYIDQINR-EANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPG 71
LV + S A++D++NR W A + + LR+ + ++ +
Sbjct: 1 LVAEDAPVLSKAFVDRVNRLNRGIWKA--KYDGVMQNITLREAKRLNGVIKKNNNASILP 58
Query: 72 DRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKG 131
R+ + E A +P FD+ E WPNC TI + D AC + AA A SDR C G
Sbjct: 59 KRRFTEEEARAPLPSSFDSAEAWPNCPTIPQIADQSACGSCWAVAAASAMSDRFCTMG-G 117
Query: 132 QQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTI 191
Q+ +S + +CC C C+ G R W + G V+ DY CQP
Sbjct: 118 VQDVHISAGDLLACCSDC----GDGCNGGDPDRAWAYFSSTGLVS--DY-----CQPYPF 166
Query: 192 SPCSHHGSAPT-LPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIK 250
CSHH + P C KC+ C +PT + + T + ED
Sbjct: 167 PHCSHHSKSKNGYPPCSQFNFDTPKCNYTCDDPT-----IPVVNYRSWTSYALQGEDDYM 221
Query: 251 KEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVIN 310
+E+ GP F +Y+DF Y SGVY H S L H+ +L+GWGT NG PYW + N
Sbjct: 222 RELFFRGPFEVAFDVYEDFIAYNSGVYHHVSGQYLGG--HAVRLVGWGTSNGVPYWKIAN 279
Query: 311 TWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
+W WG G I RG EC E +AG P
Sbjct: 280 SWNTEWGMDGYFLIRRGSSECGIEDGGSAGIP 311
>gi|3087799|emb|CAA93276.1| cysteine proteinase [Haemonchus contortus]
Length = 350
Score = 155 bits (391), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 91/275 (33%), Positives = 136/275 (49%), Gaps = 14/275 (5%)
Query: 47 SEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDT 106
+EE + + D + ++ R L +K + + +P+ FD+R W NC +I +V D
Sbjct: 60 AEERMAHLMKTD---YIRNARKLYKVKKAEEQTTNEDIPESFDSRIVWKNCSSITYVRDQ 116
Query: 107 GACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCC-KICRYDDNKSCSHGSVFRT 165
C + +A SDR C+++KG+ LS + SCC ++C C G
Sbjct: 117 SRCGSCWAVSAASTMSDRICVQTKGKLQTILSDTDILSCCGRMC----GDGCEGGYDHLA 172
Query: 166 WNFLHKRGSVTGGDYGDRTGCQPSTISPCS-HHGSAPTLPSCENQKVPKLKCHTRCTNPT 224
W ++ + G VTGG Y + C+P PC HHG P + P C C
Sbjct: 173 WEWVQRFGVVTGGPYQQKGVCRPYAFHPCGLHHGRRYDCPWDHSFSTP--ACKPYC-QFG 229
Query: 225 YGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAK 284
YG+ + +DK TY +D++E I++E++ +GP A F Y+DF YK G+Y H
Sbjct: 230 YGKRYEKDKFFVKSTYILDNDEKVIQREMMKNGPVQAAFITYEDFSPYKGGIYVHVKGR- 288
Query: 285 LENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDR 319
E H+ KLIGWG ENGT YW V N+W WG +
Sbjct: 289 -ERGAHAVKLIGWGVENGTKYWTVANSWHDDWGGK 322
>gi|56755295|gb|AAW25827.1| SJCHGC06356 protein [Schistosoma japonicum]
Length = 279
Score = 155 bits (391), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 94/260 (36%), Positives = 127/260 (48%), Gaps = 9/260 (3%)
Query: 84 VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVA 143
+P FDAR W NC TI + D C A A V + SDR CI+S G+ + LS
Sbjct: 28 IPRSFDARYHWINCSTIRQIHDESLCRADWAIATVDSISDRICIRSNGRISVQLSARDAI 87
Query: 144 SCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTL 203
SC + C HGS + G VTGG Y D++GCQP + CS+H + L
Sbjct: 88 SC------GFSPGCFHGSEVEVLVYWITYGIVTGGSYEDQSGCQPYPLPKCSYHPESRFL 141
Query: 204 PSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATF 263
C N +C C + Y + + DK Y V ++ I+KEIL +GP A+
Sbjct: 142 -DCNNNTFEFPQCTNECQD-GYNKTYDDDKFYGERIYNVYGTQEDIQKEILMNGPVIASI 199
Query: 264 ALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVK 323
++ DF YKSGVY T ++ ++ + ++IGWG E PYWL N+W WG G VK
Sbjct: 200 SVNTDFLVYKSGVYLPTPRSRNLGWI-TLRIIGWGYEGKIPYWLCANSWNEEWGANGYVK 258
Query: 324 ILRGKYECAFEYLIAAGKPK 343
I RG E + A PK
Sbjct: 259 IQRGVQAGYIESYVRAPIPK 278
>gi|224128101|ref|XP_002320244.1| predicted protein [Populus trichocarpa]
gi|222861017|gb|EEE98559.1| predicted protein [Populus trichocarpa]
Length = 339
Score = 154 bits (390), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 98/266 (36%), Positives = 129/266 (48%), Gaps = 30/266 (11%)
Query: 81 SATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTE 140
S +P FDAR WP+C TIG + D G C + F AV + SDR CI N LS
Sbjct: 80 SMKLPIEFDARTAWPHCSTIGRILDQGHCGSCWAFGAVESLSDRFCIHYG--MNLSLSVN 137
Query: 141 YVASCCK-ICRYDDNKSCSHGSVFRTWNFLHKRGSVTG--GDYGDRTGCQPSTISPCSHH 197
+ +CC +C C GS W + + G VT Y D GC SH
Sbjct: 138 DLLACCGWMC----GAGCDGGSPIDAWRYFVQSGVVTEECDPYFDDIGC--------SHP 185
Query: 198 GSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHG 257
G P P+ KC +C + + + + KH + Y +D + +I E+ ++G
Sbjct: 186 GCEPGFPT--------PKCERKCADKN--KLWAESKHFSVNAYRIDSDPHSIMAEVSSNG 235
Query: 258 PTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGT-ENGTPYWLVINTWGPHW 316
P F +Y+DF HYKSGVYKH + + H+ KLIGWGT E+G YWL+ N W W
Sbjct: 236 PVEVAFTVYEDFAHYKSGVYKHITGDAMGG--HAVKLIGWGTSEDGEDYWLLANQWNRGW 293
Query: 317 GDRGTVKILRGKYECAFEYLIAAGKP 342
GD G KI RG EC E + AG P
Sbjct: 294 GDDGYFKIKRGTNECGIEGAVVAGLP 319
>gi|197725747|gb|ACH73069.1| cathepsin B precursor [Epinephelus coioides]
Length = 333
Score = 154 bits (390), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 103/330 (31%), Positives = 153/330 (46%), Gaps = 25/330 (7%)
Query: 15 RGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRK 74
R L S ++ IN+ TW AG NF N+ Y+++ L G +
Sbjct: 19 RPHLKPLSSDMVNYINKLNTTWKAGHNF-NNVDYSYVQKLC----------GTMLKGPKL 67
Query: 75 TYDPEYSA--TVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQ 132
+YS +P FD+REQWPNC T+ + D G+C + F A A SDR CI S G+
Sbjct: 68 PVLVQYSGDMKLPKNFDSREQWPNCPTLKEIRDQGSCGSCWAFGAAEAISDRLCIHSNGK 127
Query: 133 QNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTIS 192
+ +S+E + +CC C C+ G W+F G V+GG Y GC+P TI
Sbjct: 128 VSVEISSEDLLTCCDSC----GMGCNGGYPSAAWDFWTDVGLVSGGLYDSHVGCRPYTIP 183
Query: 193 PCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKE 252
PC HH + T P C + +C +C + Y + DKH +Y V +E+ I+ E
Sbjct: 184 PCEHHVNG-TRPPCTGEGGDTPQCILQCES-GYTPSYKADKHYGKSSYSVPSDEEQIQSE 241
Query: 253 ILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTW 312
I +GP F +Y+DF YK+GVY+H + + + + + W E + ++
Sbjct: 242 IYKNGPVEGAFTVYEDFLLYKTGVYQHMTGSAVGGH----AIKSWLGEEVCSLLALCHS- 296
Query: 313 GPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
WGD ++ G C E I AG P
Sbjct: 297 DTDWGDMVSLSS-AGSDHCGIESEIVAGIP 325
>gi|21693|emb|CAA46810.1| cathepsin B [Triticum aestivum]
Length = 305
Score = 154 bits (390), Expect = 5e-35, Method: Compositional matrix adjust.
Identities = 110/324 (33%), Positives = 144/324 (44%), Gaps = 39/324 (12%)
Query: 26 IDQINREANT-WTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDR---KTYDPEYS 81
I +N N WTAG N YL + I K+ PG R +T S
Sbjct: 2 IQTVNNHPNAGWTAGHN-------PYLANYTIEQFKHMLGVKPTPPGLRAAVRTKTHSRS 54
Query: 82 ATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEY 141
+P FDAR +W C TIG + D G C + F AV DR CI N LS
Sbjct: 55 EQLPKVFDARSKWSGCSTIGKILDQGHCGSCWAFGAVECLQDRFCIHH--NMNITLSAND 112
Query: 142 VASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTG--GDYGDRTGCQPSTISPCSHHGS 199
+ +CC D C G W + + G VT Y D+ GC+ H G
Sbjct: 113 LVACCGFMCGD---GCDGGYPISAWQYFVQNGVVTDECDPYFDQVGCK--------HPGC 161
Query: 200 APTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPT 259
P P+ V + KC + + + + KH + Y V+ + I E+ +GP
Sbjct: 162 EPAYPT----PVCEKKCKVQ------NQVWEEKKHFSINAYQVNSDPHDIMAEVYNNGPV 211
Query: 260 TATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTEN-GTPYWLVINTWGPHWGD 318
F +Y+DF HYKSGVYKH + + H+ KLIGWGT + G YWL+ N W WGD
Sbjct: 212 EVAFTVYEDFAHYKSGVYKHITGGVMGG--HAVKLIGWGTSDAGEDYWLLANQWNRGWGD 269
Query: 319 RGTVKILRGKYECAFEYLIAAGKP 342
G KI+RGK EC E + AG P
Sbjct: 270 DGYFKIIRGKNECGIEEDVTAGMP 293
>gi|224064400|ref|XP_002301457.1| predicted protein [Populus trichocarpa]
gi|222843183|gb|EEE80730.1| predicted protein [Populus trichocarpa]
Length = 357
Score = 154 bits (390), Expect = 5e-35, Method: Compositional matrix adjust.
Identities = 110/331 (33%), Positives = 147/331 (44%), Gaps = 43/331 (12%)
Query: 21 FSDAYIDQINREANT-WTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPE 79
D+ + ++N W A N + + +A KY +P P + P
Sbjct: 41 LQDSILKKVNGNPKAGWKATMN-------HHFSNYTVAQFKYL-LGVKPTPKEELRGIPV 92
Query: 80 YS----ATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNR 135
S +P+ FDAR WP C TIG + D G C + F AV + SDR CI N
Sbjct: 93 ISHPKSLRLPEEFDARTAWPQCSTIGKILDQGHCGSCWAFGAVESLSDRFCIHYG--MNI 150
Query: 136 PLSTEYVASCCK-ICRYDDNKSCSHGSVFRTWNFLHKRGSVTG--GDYGDRTGCQPSTIS 192
LS + +CC +C C+ G W + G VT Y D GC
Sbjct: 151 SLSVNDLLACCGFLC----GSGCNGGYPISAWRYFVHHGVVTEECDPYFDDIGC------ 200
Query: 193 PCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKE 252
SH G P P+ KC +C N + + + KH Y +D + ++I E
Sbjct: 201 --SHPGCEPGYPT--------PKCARKCVNKN--QLWKKSKHYGVKPYRIDSDPESIMAE 248
Query: 253 ILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGT-ENGTPYWLVINT 311
I +GP F +Y+DF HYKSGVYKH + + H+ KLIGWGT E+G YWL+ N
Sbjct: 249 IYKNGPVEVAFTVYEDFAHYKSGVYKHITGGMMGG--HAVKLIGWGTSEDGEAYWLLANQ 306
Query: 312 WGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
W WGD G KI RG EC E + AG P
Sbjct: 307 WNRGWGDDGYFKIRRGTNECGIEGDVVAGLP 337
>gi|296863454|pdb|3HHI|A Chain A, Crystal Structure Of Cathepsin B From T. Brucei In Complex
With Ca074
gi|296863455|pdb|3HHI|B Chain B, Crystal Structure Of Cathepsin B From T. Brucei In Complex
With Ca074
Length = 325
Score = 154 bits (389), Expect = 5e-35, Method: Compositional matrix adjust.
Identities = 105/333 (31%), Positives = 145/333 (43%), Gaps = 23/333 (6%)
Query: 12 TLVRGELYKFSDAYIDQINR-EANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLP 70
LV + S A++D++NR W A + + LR+ + ++ +
Sbjct: 1 ALVAEDAPVLSKAFVDRVNRLNRGIWKA--KYDGVMQNITLREAKRLNGVIKKNNNASIL 58
Query: 71 GDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSK 130
R+ + E A +P FD+ E WPNC TI + D AC + AA A SDR C
Sbjct: 59 PKRRFTEEEARAPLPSSFDSAEAWPNCPTIPQIADQSACGSCWAVAAASAMSDRFCTMG- 117
Query: 131 GQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPST 190
G Q+ +S + +CC C C+ G R W + G V+ DY CQP
Sbjct: 118 GVQDVHISAGDLLACCSDC----GDGCNGGDPDRAWAYFSSTGLVS--DY-----CQPYP 166
Query: 191 ISPCSHHGSAPT-LPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAI 249
CSHH + P C KC C +PT + + T + ED
Sbjct: 167 FPHCSHHSKSKNGYPPCSQFNFDTPKCDYTCDDPT-----IPVVNYRSWTSYALQGEDDY 221
Query: 250 KKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVI 309
+E+ GP F +Y+DF Y SGVY H S L H+ +L+GWGT NG PYW +
Sbjct: 222 MRELFFRGPFEVAFDVYEDFIAYNSGVYHHVSGQYLGG--HAVRLVGWGTSNGVPYWKIA 279
Query: 310 NTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
N+W WG G I RG EC E +AG P
Sbjct: 280 NSWNTEWGMDGYFLIRRGSSECGIEDGGSAGIP 312
>gi|261328564|emb|CBH11542.1| CPC cysteine peptidase, Clan CA, family C1,Cathepsin B-like,
putative [Trypanosoma brucei gambiense DAL972]
Length = 340
Score = 154 bits (389), Expect = 6e-35, Method: Compositional matrix adjust.
Identities = 103/324 (31%), Positives = 143/324 (44%), Gaps = 23/324 (7%)
Query: 21 FSDAYIDQINR-EANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPE 79
S A++D++NR W A + + LR+ + ++ + R+ + E
Sbjct: 32 LSKAFVDRVNRLNRGIWKA--KYDGVMQNITLREAKRLNGVIKKNNNASILPKRRFTEEE 89
Query: 80 YSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLST 139
A +P FD+ E WPNC TI + D AC + AA A SDR C G Q+ +S
Sbjct: 90 ARAPLPSSFDSAEAWPNCPTIPQIADQSACGSCWAVAAASAMSDRFCTMG-GVQDVHISA 148
Query: 140 EYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGS 199
+ +CC C C+ G R W + G V+ DY CQP CSHH
Sbjct: 149 GDLLACCSDC----GDGCNGGDPDRAWAYFSSTGLVS--DY-----CQPYPFPHCSHHSK 197
Query: 200 APT-LPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGP 258
+ P C KC+ C +PT + + T + ED +E+ GP
Sbjct: 198 SKNGYPPCSQFNFDTPKCNYTCDDPT-----IPVVNYRSWTSYALQGEDDYMRELFFRGP 252
Query: 259 TTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGD 318
F +Y+DF Y SGVY H S L H+ +L+GWGT NG PYW + N+W WG
Sbjct: 253 FEVAFDVYEDFIAYNSGVYHHVSGQYLGG--HAVRLVGWGTSNGVPYWKIANSWNTEWGM 310
Query: 319 RGTVKILRGKYECAFEYLIAAGKP 342
G I RG EC E +AG P
Sbjct: 311 DGYFLIRRGSSECGIEDGGSAGIP 334
>gi|281208776|gb|EFA82951.1| peptidase C1A family protein [Polysphondylium pallidum PN500]
Length = 1308
Score = 154 bits (389), Expect = 6e-35, Method: Composition-based stats.
Identities = 93/267 (34%), Positives = 128/267 (47%), Gaps = 29/267 (10%)
Query: 66 DRPLPGDRKTYDP-EYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDR 124
DRP +K Y ++ +P FDA +QWP C TIG + + C + F A+ + SDR
Sbjct: 55 DRP----KKIYKTLPHNVNLPTNFDAAQQWPQCPTIGAIQNQAECGSCWAFGAIESISDR 110
Query: 125 RCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRT 184
CI ++ LS + + +C + + C G + + ++ K G VT +
Sbjct: 111 FCIHK--NESVQLSFQDLITC-----DNQDNGCEGGDPYTAYKYVQKNGVVT-------S 156
Query: 185 GCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDD 244
CQP TI C P C N V C +C N + F QD H Y V
Sbjct: 157 NCQPYTIPTCP-----PAQQPCMN-FVNTPPCSAKCANSSVN--FQQDLHHLKTVYAVKP 208
Query: 245 NEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTP 304
N AI+ EI+ +GP A F +Y+DF YKSGVY H S L H K++G+G NGTP
Sbjct: 209 NVAAIQNEIVTNGPVEACFEVYEDFLGYKSGVYTHKSGKDLGG--HCIKIVGFGVSNGTP 266
Query: 305 YWLVINTWGPHWGDRGTVKILRGKYEC 331
YW+ N+W WG+ G I GK EC
Sbjct: 267 YWICNNSWTTSWGNNGIFWIEAGKNEC 293
>gi|170060938|ref|XP_001866023.1| cathepsin B [Culex quinquefasciatus]
gi|167879260|gb|EDS42643.1| cathepsin B [Culex quinquefasciatus]
Length = 353
Score = 154 bits (389), Expect = 6e-35, Method: Compositional matrix adjust.
Identities = 98/314 (31%), Positives = 147/314 (46%), Gaps = 25/314 (7%)
Query: 29 INREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEYSATVPDRF 88
+ N+WTAG + L + + +S R PG + +P++F
Sbjct: 52 VRNRTNSWTAG------APRQPLSSYRVGVNMEELESKRLKPG---ILILKEDIDLPEQF 102
Query: 89 DAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKI 148
DAR++WP C ++ + + G C + +A AF+DR CI S + + SCC
Sbjct: 103 DARDKWPQCPSLREIRNQGCCGSCWAISAAEAFTDRWCIHSPEHTTFSFGSFDLISCCHS 162
Query: 149 CRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCEN 208
C C G + W++ ++G +GG Y + GC C +P E+
Sbjct: 163 C----GDGCQGGVLGPAWDYWVQKGVSSGGPYNSKQGCHSYPFDTCH----SPD----ED 210
Query: 209 QKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDD 268
PK C +C + + +D+ + Y V +E I +EI +GP A F +Y D
Sbjct: 211 DDAPK--CSRKCQSSYSVQDVSKDRRFGRVAYSVVADEHRIMEEIFVNGPVQAAFQVYLD 268
Query: 269 FYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGK 328
F YKSGVY+H + LE H+ K++GWG ENGT YWL N+WG WGD G KI+RG+
Sbjct: 269 FKTYKSGVYRHVT-GPLEGG-HAIKILGWGVENGTKYWLCSNSWGEDWGDHGFFKIVRGE 326
Query: 329 YECAFEYLIAAGKP 342
E + AG P
Sbjct: 327 NHLGIETDVHAGLP 340
>gi|72389769|ref|XP_845179.1| cysteine peptidase C (CPC) [Trypanosoma brucei brucei strain 927/4
GUTat10.1]
gi|427931064|pdb|4HWY|A Chain A, Trypanosoma Brucei Procathepsin B Solved From 40 Fs
Free-electron Laser Pulse Data By Serial Femtosecond
X-ray Crystallography
gi|40557577|gb|AAR88085.1| cathepsin B-like cysteine protease [Trypanosoma brucei]
gi|62360039|gb|AAX80461.1| cysteine peptidase C (CPC) [Trypanosoma brucei]
gi|70801714|gb|AAZ11620.1| cysteine peptidase C (CPC) [Trypanosoma brucei brucei strain 927/4
GUTat10.1]
Length = 340
Score = 154 bits (389), Expect = 6e-35, Method: Compositional matrix adjust.
Identities = 103/324 (31%), Positives = 143/324 (44%), Gaps = 23/324 (7%)
Query: 21 FSDAYIDQINR-EANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPE 79
S A++D++NR W A + + LR+ + ++ + R+ + E
Sbjct: 32 LSKAFVDRVNRLNRGIWKA--KYDGVMQNITLREAKRLNGVIKKNNNASILPKRRFTEEE 89
Query: 80 YSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLST 139
A +P FD+ E WPNC TI + D AC + AA A SDR C G Q+ +S
Sbjct: 90 ARAPLPSSFDSAEAWPNCPTIPQIADQSACGSCWAVAAASAMSDRFCTMG-GVQDVHISA 148
Query: 140 EYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGS 199
+ +CC C C+ G R W + G V+ DY CQP CSHH
Sbjct: 149 GDLLACCSDC----GDGCNGGDPDRAWAYFSSTGLVS--DY-----CQPYPFPHCSHHSK 197
Query: 200 APT-LPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGP 258
+ P C KC+ C +PT + + T + ED +E+ GP
Sbjct: 198 SKNGYPPCSQFNFDTPKCNYTCDDPT-----IPVVNYRSWTSYALQGEDDYMRELFFRGP 252
Query: 259 TTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGD 318
F +Y+DF Y SGVY H S L H+ +L+GWGT NG PYW + N+W WG
Sbjct: 253 FEVAFDVYEDFIAYNSGVYHHVSGQYLGG--HAVRLVGWGTSNGVPYWKIANSWNTEWGM 310
Query: 319 RGTVKILRGKYECAFEYLIAAGKP 342
G I RG EC E +AG P
Sbjct: 311 DGYFLIRRGSSECGIEDGGSAGIP 334
>gi|44965462|gb|AAS49538.1| cathepsin B [Protopterus dolloi]
Length = 225
Score = 154 bits (389), Expect = 7e-35, Method: Compositional matrix adjust.
Identities = 84/220 (38%), Positives = 121/220 (55%), Gaps = 7/220 (3%)
Query: 84 VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVA 143
+PD FD+R QWPNC TI + D G+C + F AV + SDR C+ S G+QN +S E +
Sbjct: 13 LPDNFDSRTQWPNCPTIREIRDQGSCGSCWAFGAVESMSDRVCVHSGGKQNVEVSAEDLL 72
Query: 144 SCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTL 203
SCC ++ C+ G W + ++G V+GG YG GC+P TI PC HH + +
Sbjct: 73 SCCG---FECGMGCNGGYPSGAWQYWTEKGLVSGGLYGSGIGCRPYTIPPCEHHVNG-SR 128
Query: 204 PSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATF 263
PSC + KC +C + Y + +DK Y V + ++I +EI GP F
Sbjct: 129 PSCSGEGGDTPKCVQKC-DSGYTPAYEKDKIYGQSAYSVPSSPESIMEEIYKDGPVEGAF 187
Query: 264 ALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGT 303
+Y+DF YKSGVY+H + + H+ K++GWG EN T
Sbjct: 188 TVYEDFLLYKSGVYQHHTGEAVGG--HAIKILGWGIENNT 225
>gi|407425570|gb|EKF39488.1| cysteine peptidase C (CPC), putative [Trypanosoma cruzi
marinkellei]
Length = 333
Score = 154 bits (388), Expect = 7e-35, Method: Compositional matrix adjust.
Identities = 109/349 (31%), Positives = 159/349 (45%), Gaps = 33/349 (9%)
Query: 2 IHILVFLLGCTLVRG----ELYKFSDAYIDQINR-EANTWTAGR-NFPANLSEEYLRQFL 55
I + +FLL T V + +D +++ +N WTAGR + +L+ + L
Sbjct: 9 IALFLFLLYATAVHALHVDDAPILTDEFLEHVNSLNGGKWTAGRTSRTKHLTRREASRLL 68
Query: 56 IADAKYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIF 115
+ + P R+ + E + D+FDA E WPNC TI + D +C +
Sbjct: 69 ---GTFLGNTSILAP--RQFSEAELRVRLEDKFDAAEAWPNCPTITEIRDQSSCGSCWAV 123
Query: 116 AAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSV 175
AA A SDR C G ++ +S + SCC +C Y C+ G W F G V
Sbjct: 124 AAASAMSDRYCTLG-GVRDLRISAGDLMSCCDVCGY----GCNGGFPEVAWVFYVVHGLV 178
Query: 176 TGGDYGDRTGCQPSTISPCSHHGSAPTLPSCE-NQKVPKLKCHTRCTNPTYGRGFFQDKH 234
+ +Y CQP C+HH ++ L C + K PK C++ CT ++ H
Sbjct: 179 S--EY-----CQPYPFPSCAHHVNSSDLAPCSGDYKTPK--CNSTCTEKKIPLIRYRGNH 229
Query: 235 RTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKL 294
L+ E+ K+E+L +GP F +Y DF Y GVYKH + L H+ +L
Sbjct: 230 SYVLS-----GEEHFKRELLLNGPFEVAFEVYADFMAYTGGVYKHVAGDLLGG--HAVRL 282
Query: 295 IGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
+GWG NG PYW + N+W WG G I RG EC E AG P+
Sbjct: 283 VGWGELNGEPYWKIANSWNHEWGMNGYFLIARGVNECGIESNGVAGTPR 331
>gi|343474137|emb|CCD14154.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 337
Score = 154 bits (388), Expect = 7e-35, Method: Compositional matrix adjust.
Identities = 106/344 (30%), Positives = 152/344 (44%), Gaps = 31/344 (9%)
Query: 5 LVFLLGCTLVRGELYKFSDAYIDQINR-EANTWTAGRNFPANLSEEYLRQFLIADAKYFD 63
LV L L+ + + ++D IN+ W A N ++ ++AK
Sbjct: 15 LVALGASALLAKDAPVLTKTFVDHINQLNGGMWKAVYN-------GKMQNITFSEAKRLT 67
Query: 64 ----QSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVG 119
Q L R T + + +P+ FDA E WP+C TI + D C A +
Sbjct: 68 GARIQKSSGLQPARFT-EEQLRTKLPETFDAAEHWPHCPTIREIADQSECRASWAVSTAS 126
Query: 120 AFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGD 179
A SDR C KG+Q R +S ++ SCCK C C G W + + G +
Sbjct: 127 AISDRYCTVGKGKQLR-ISAAHLLSCCKDC----GDGCKGGFPGFAWRYYVEYGITS--- 178
Query: 180 YGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLT 239
+ CQP C H G+ C KC+ CT+ + K+R T
Sbjct: 179 ----SSCQPYPFPRCEHQGAQGNKTPCSKYNFDTPKCNATCTD----KAIPLIKYRGNAT 230
Query: 240 YWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGT 299
Y + E+ K+E+ +GP A F +Y D + YKSGVY+H L + K++GWG
Sbjct: 231 YLLLHGEEDYKRELYFNGPFVAVFYVYTDLFAYKSGVYRHVDGDFLGG--TAVKVVGWGK 288
Query: 300 ENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
NGTPYW + N+W WG G + ILRG EC E+L AG P+
Sbjct: 289 LNGTPYWKLANSWDTDWGMGGYLLILRGNNECNIEHLGFAGTPE 332
>gi|159175|gb|AAA29176.1| cysteine proteinase [Haemonchus contortus]
Length = 348
Score = 154 bits (388), Expect = 8e-35, Method: Compositional matrix adjust.
Identities = 92/282 (32%), Positives = 137/282 (48%), Gaps = 16/282 (5%)
Query: 63 DQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFS 122
+Q+ P+ D D + A +P+ +D R W NC + + D C + + A S
Sbjct: 72 NQNLNPVVND----DNDTGADLPENYDPRIVWKNCSSFHTIRDQANCGSCWAVSTAAAIS 127
Query: 123 DRRCIKSKGQQNRPLSTEYVASCCKI-CRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYG 181
DR CI +KG++ S + +CC C C G W F G V+GG Y
Sbjct: 128 DRICIATKGKKQVYASDTDILTCCGARC----GLGCRGGWPIEAWKFFEYDGVVSGGPYL 183
Query: 182 DRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHR---TTL 238
+ C P + PC HG+ +C P C +C P + RG ++ R
Sbjct: 184 GKGCCSPYPLHPCGRHGNDTFYGNCVGM-APTPPCKRKC-QPGF-RGMYRVDKRYGEPGR 240
Query: 239 TYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWG 298
TY + +E I+++I G A FA+Y+DF HY+SG+YKHT+ Y H+ K+IGWG
Sbjct: 241 TYTLPRSEVKIRRDIKERGSVVAVFAVYEDFSHYQSGIYKHTAGRFTGGY-HAVKMIGWG 299
Query: 299 TENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAG 340
+NGT YWL+ N+W WG+ G +++RG C E + AG
Sbjct: 300 KDNGTDYWLIANSWHDDWGENGFFRMIRGINNCGIEEQVDAG 341
>gi|40643250|emb|CAC83720.1| cathepsin B [Hordeum vulgare subsp. vulgare]
gi|326494236|dbj|BAJ90387.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326499864|dbj|BAJ90767.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 344
Score = 154 bits (388), Expect = 8e-35, Method: Compositional matrix adjust.
Identities = 111/327 (33%), Positives = 146/327 (44%), Gaps = 45/327 (13%)
Query: 26 IDQINREANT-WTAGRN-FPANLSEEYLRQFLIADAKYFDQSDRPLP-----GDRKTYDP 78
I +N N WTAG N + AN + E + L +P P G R P
Sbjct: 41 IQTVNNHPNAGWTAGHNPYLANYTIEQFKHML---------GVKPTPPGLLAGVRTKTHP 91
Query: 79 EYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLS 138
S +P FDAR +W C TIG + D G C + F AV DR CI N LS
Sbjct: 92 R-SEQLPKEFDARSKWSGCSTIGKILDQGHCGSCWAFGAVECLQDRFCIHHN--MNISLS 148
Query: 139 TEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTG--GDYGDRTGCQPSTISPCSH 196
+ +CC D C G W + + G VT Y D+ GC+ H
Sbjct: 149 ANDLVACCGFMCGD---GCDGGYPISAWQYFVQNGVVTEECDPYFDQVGCK--------H 197
Query: 197 HGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAH 256
G P P+ V + KC + + + + KH + Y V+ + I E+ +
Sbjct: 198 PGCEPAYPT----PVCEKKCKVQ------NQVWQEKKHFSIDAYQVNSDPHDIMAEVYKN 247
Query: 257 GPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTEN-GTPYWLVINTWGPH 315
GP F +Y+DF HYKSGVYKH + + H+ KLIGWGT + G YWL+ N W
Sbjct: 248 GPVEVAFTVYEDFAHYKSGVYKHITGGVMGG--HAVKLIGWGTSDAGEDYWLLANQWNRG 305
Query: 316 WGDRGTVKILRGKYECAFEYLIAAGKP 342
WGD G KI+RGK EC E + AG P
Sbjct: 306 WGDDGYFKIIRGKNECGIEEDVTAGMP 332
>gi|116784401|gb|ABK23329.1| unknown [Picea sitchensis]
Length = 350
Score = 153 bits (387), Expect = 9e-35, Method: Compositional matrix adjust.
Identities = 106/327 (32%), Positives = 146/327 (44%), Gaps = 36/327 (11%)
Query: 21 FSDAYIDQINREANT-WTAGRN--FPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYD 77
+ +++INR N W AG N F + ++ R + + P+ K +
Sbjct: 36 LKEPIVEEINRHPNAGWKAGMNSRFSNHTVGQFKRLLGVLPTPRNFLENVPVITYPKGMN 95
Query: 78 PEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPL 137
+P +FDARE WP C ++ + D G C + F AV A SDR CI K N L
Sbjct: 96 ------LPKQFDAREAWPQCTSVQTILDQGHCGSCWAFGAVEALSDRFCIHHK--VNVTL 147
Query: 138 STEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTG--GDYGDRTGCQPSTISPCS 195
S + +CC D C G W + G VT Y D GCQ
Sbjct: 148 SENDLVACCGFMCGD---GCDGGYPISAWQYFISTGVVTAECDPYFDDAGCQ-------- 196
Query: 196 HHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILA 255
H G P P+ +C +C + G K + Y + I E+
Sbjct: 197 HPGCEPLYPT--------PQCVKQCKDENQKWG--NSKRFSATAYRISSKPYDIMAEVYT 246
Query: 256 HGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPH 315
+GP +F++Y+DF HYKSGVYK+T + H+ KL+GWGTE+GT YWLV N+W
Sbjct: 247 NGPVEVSFSVYEDFAHYKSGVYKYTKGDYMGG--HAVKLVGWGTEDGTDYWLVANSWNTA 304
Query: 316 WGDRGTVKILRGKYECAFEYLIAAGKP 342
WG+ G KI RG EC E + AG P
Sbjct: 305 WGEDGYFKIARGSNECGIEGDVVAGMP 331
>gi|224285427|gb|ACN40436.1| unknown [Picea sitchensis]
Length = 350
Score = 153 bits (387), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 106/327 (32%), Positives = 146/327 (44%), Gaps = 36/327 (11%)
Query: 21 FSDAYIDQINREANT-WTAGRN--FPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYD 77
+ +++INR N W AG N F + ++ R + + P+ K +
Sbjct: 36 LKEPIVEEINRHPNAGWKAGMNSRFSNHTVGQFKRLLGVLPTPRNFLENVPVITYPKGIN 95
Query: 78 PEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPL 137
+P +FDARE WP C ++ + D G C + F AV A SDR CI K N L
Sbjct: 96 ------LPKQFDAREAWPQCTSVQTILDQGHCGSCWAFGAVEALSDRFCIHHK--VNVTL 147
Query: 138 STEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTG--GDYGDRTGCQPSTISPCS 195
S + +CC D C G W + G VT Y D GCQ
Sbjct: 148 SENDLVACCGFMCGD---GCDGGYPISAWQYFISTGVVTAECDPYFDDAGCQ-------- 196
Query: 196 HHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILA 255
H G P P+ +C +C + G K + Y + I E+
Sbjct: 197 HPGCEPLYPT--------PQCVKQCKDENQKWG--NSKRFSATAYRISSKPYDIMAEVYT 246
Query: 256 HGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPH 315
+GP +F++Y+DF HYKSGVYK+T + H+ KL+GWGTE+GT YWLV N+W
Sbjct: 247 NGPVEVSFSVYEDFAHYKSGVYKYTKGDYMGG--HAVKLVGWGTEDGTDYWLVANSWNTA 304
Query: 316 WGDRGTVKILRGKYECAFEYLIAAGKP 342
WG+ G KI RG EC E + AG P
Sbjct: 305 WGEDGYFKIARGSNECGIEGDVVAGMP 331
>gi|343477197|emb|CCD11909.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 336
Score = 153 bits (387), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 108/345 (31%), Positives = 156/345 (45%), Gaps = 36/345 (10%)
Query: 5 LVFLLGCTLVRGELYKFSDAYIDQINR-EANTWTAGRNFPANLSEEYLRQFLIADAK--- 60
LV L L+ + + ++D+IN+ W A + + ++ ++AK
Sbjct: 15 LVALGASALLAKDAPVLTKTFVDRINQLNGGMWKA-------VYDGKMQNLTFSEAKRLT 67
Query: 61 -YFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVG 119
F + LP R T + + +P+ FDA E WP+C TI + D AC A A
Sbjct: 68 GAFSRKTSTLPPARFT-EEQLRTDLPESFDAAEHWPHCPTIREIADQSACRASWAVATAS 126
Query: 120 AFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGD 179
A SDR C KG+Q R +S + +CCK C C G W + G +
Sbjct: 127 AISDRYCTVGKGKQLR-ISAADLMACCKDC----GGGCEGGYPDAAWEYYVSHGIAS--- 178
Query: 180 YGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLT 239
+ CQP C H G+ C K +C+ CT+ T K+R +
Sbjct: 179 ----SQCQPYPFPRCEHRGAQGKKTPCSKYKFVTPQCNATCTDKT----IPLIKYRGNHS 230
Query: 240 YWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYL--HSGKLIGW 297
Y V ED K+E+ +GP F ++ DF YK+GVY+H + N+L + +++GW
Sbjct: 231 YEVRGEED-YKRELYFNGPFVVRFQVHSDFLAYKNGVYQHVAG----NFLGGKAVRIVGW 285
Query: 298 GTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
G NGTPYW V N+W WG G ILRG EC E+L AG P
Sbjct: 286 GKLNGTPYWKVANSWDTDWGMNGYFLILRGDNECNIEHLGFAGTP 330
>gi|268563232|ref|XP_002638788.1| Hypothetical protein CBG05143 [Caenorhabditis briggsae]
Length = 426
Score = 153 bits (386), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 105/356 (29%), Positives = 159/356 (44%), Gaps = 67/356 (18%)
Query: 22 SDAYIDQINREAN-----TWTAGRN----------FPANLSEEYLRQFLIADAKYFDQSD 66
SD Y+ ++ R+ N TW A N F ++ + +++ K+F+
Sbjct: 68 SDEYLRKLVRQVNDSPETTWKAKFNKFGVKNRSYGFKYTRNQTAVEEYMEHIRKFFESD- 126
Query: 67 RPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRC 126
+ + + S+++P FDAR++WPNC +I +VP+ G C + AA G SDR C
Sbjct: 127 -AMKRHLEELENYKSSSLPKHFDARQKWPNCPSISNVPNQGGCGSCFAVAAAGVASDRAC 185
Query: 127 IKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGC 186
I S G LS E + CC +C +C G + + +G VTGG R GC
Sbjct: 186 IHSNGTFKSLLSEEDIIGCCSVC-----GNCYGGDPLKALTYWVNQGLVTGG----RDGC 236
Query: 187 QPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYW----- 241
+P + P P+ + K C RC N Y + + +DKH T Y
Sbjct: 237 RPYSFDLSC---GVPCSPATFFEAEEKRTCMRRCQNIYYQQKYEEDKHFATFAYSLYPRS 293
Query: 242 -----------------------------VDDNEDAIKKEILAHGPTTATFALYDDFYHY 272
V + + IKKEIL +GPTT F + ++F HY
Sbjct: 294 MTVSPDGKERVKVPTIIGHFNDKNTEKLNVTEYRNVIKKEILLYGPTTMAFPVPEEFLHY 353
Query: 273 KSGVYKHTSNAKLEN---YLHSGKLIGWG-TENGTPYWLVINTWGPHWGDRGTVKI 324
SGV++ ++ Y H +LIGWG +++G YWL +N++G HWGD G KI
Sbjct: 354 SSGVFRPFPLDGFDDRIVYWHVVRLIGWGESDDGQHYWLAVNSFGNHWGDNGIFKI 409
>gi|193783549|dbj|BAG53460.1| unnamed protein product [Homo sapiens]
Length = 276
Score = 153 bits (386), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 97/274 (35%), Positives = 130/274 (47%), Gaps = 24/274 (8%)
Query: 81 SATVPDRFDAREQW----PNCGTIGHVP----DTGACAAPHIFAAVGAFSDRRCIKS--- 129
S +PD W +C +G VP G AP A G+ S + S
Sbjct: 8 SCLLPDLCGQGWGWRLFPASCAYLGSVPWRVWGLGGLLAPLAAAGGGSTSGLGHLGSTQW 67
Query: 130 KGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPS 189
G+ LS ++ C SC+ G WNF ++G V+GG Y GC+P
Sbjct: 68 SGELVVLLSEVFITGCLF--------SCNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPY 119
Query: 190 TISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAI 249
+I PC HH + P PK C C P Y + QDKH +Y V ++E I
Sbjct: 120 SIPPCEHHVNGSRPPCTGEGDTPK--CSKIC-EPGYSPTYKQDKHYGYNSYSVSNSEKDI 176
Query: 250 KKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVI 309
EI +GP F++Y DF YKSGVY+H + + H+ +++GWG ENGTPYWLV
Sbjct: 177 MAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGG--HAIRILGWGVENGTPYWLVA 234
Query: 310 NTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
N+W WGD G KILRG+ C E + AG P+
Sbjct: 235 NSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPR 268
>gi|3912916|gb|AAC78691.1| thiol protease [Trichuris suis]
Length = 348
Score = 153 bits (386), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 104/345 (30%), Positives = 153/345 (44%), Gaps = 40/345 (11%)
Query: 21 FSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTY---- 76
+ A DQ+ R N +NL+ + L +L F D P K
Sbjct: 13 LTSAAEDQVARP-------NNVESNLTGDPLVVYLNTIQGLFHLKDSQSPDTEKKLMSAK 65
Query: 77 -----------DPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRR 125
D + ++P FD R W C ++ + D C + +A SDR
Sbjct: 66 YKHTVDICGREDRSLALSIPPSFDVRSLWHVC-SLNLIRDQAKCGSCWAVSAAETMSDRI 124
Query: 126 CIKSKGQQNRPLSTEYVASCCKI-CRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRT 184
C++S +S + SCC + C Y C+ G W G+ TGG D+
Sbjct: 125 CVQSNCSIKACISDTDILSCCGLYCGY----GCNGGFPIEAWRHFTVAGNCTGGKTIDKY 180
Query: 185 GCQP-STISPCSHHGSAPTLPSCENQ--------KVPKLKCHTRCTNPTYGRGFFQDKHR 235
GC+P P H C N +C RC Y + + D++
Sbjct: 181 GCKPYKPTGPIGRHLKRNDYAPCPNDTYYGECVGMADTPRCKRRCL-LGYPKSYPSDRYY 239
Query: 236 TTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLI 295
Y V + AI++EI+ +GP A+FA+Y+DF HYKSG+YKHT+ +L Y H+ K+I
Sbjct: 240 GKSAYIVKQSVKAIQREIMKNGPVVASFAVYEDFRHYKSGIYKHTA-GELRGY-HAVKII 297
Query: 296 GWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAG 340
GWG EN T +WL+ N+W WG++G +I+RGK EC E + AG
Sbjct: 298 GWGKENNTDFWLIANSWHQDWGEKGYFRIVRGKNECGIETDVVAG 342
>gi|107921773|gb|ABF85678.1| cathepsin B1 [Fasciola hepatica]
Length = 278
Score = 152 bits (385), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 86/223 (38%), Positives = 112/223 (50%), Gaps = 7/223 (3%)
Query: 84 VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVA 143
+P+ FDARE+WP C +I +PD +C + A VGA SDR CI S G LS +
Sbjct: 63 LPESFDAREKWPLCRSIRQIPDQSSCGSCWAVAGVGAMSDRVCIHSNGMMQPELSAIDLV 122
Query: 144 SCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTL 203
SCC C C GS W++ + G VTGG + TGC P C H GS L
Sbjct: 123 SCCSYC----GNGCQGGSPPAAWDYWWRNGIVTGGTLENPTGCLPYPFPQCRHPGSRSQL 178
Query: 204 PSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATF 263
C P C+ C Y + + +DK +Y VD +E I +EI+ +GP A F
Sbjct: 179 NPCPGYIYPTPSCYPYC-QAGYDKTYEEDKVYGKTSYNVDRHEYTIMQEIMKNGPVEAGF 237
Query: 264 ALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYW 306
+Y DF YKSG+Y H S H+ ++IGWG ENG YW
Sbjct: 238 IVYTDFAVYKSGIYHHVSGRYAGK--HAIRIIGWGVENGVNYW 278
>gi|1644295|emb|CAB03627.1| cysteine proteinase [Haemonchus contortus]
Length = 345
Score = 152 bits (385), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 90/292 (30%), Positives = 139/292 (47%), Gaps = 21/292 (7%)
Query: 53 QFLIADAKYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAP 112
Q + D ++ +Q+ +P+ + D + +P+ FDAR W NC ++ H+ D C +
Sbjct: 67 QHKLMDLRFVNQNRKPVVENADDEDDD----IPESFDARTHWANCTSLRHIRDQANCGSC 122
Query: 113 HIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKR 172
+ A SDR CI SKG+ +S+ + SCCK+C Y C G +++ ++
Sbjct: 123 WAVSTASALSDRICIASKGETQLHISSIDIVSCCKLCGY----GCDGGWPIEAFDYFSRQ 178
Query: 173 GSVTGGDYGDRTGCQPSTISPCSHHGSAPT----LPSCENQKVPKLKCHTRCTNPTYGRG 228
G+VTG + + GC+P P +G+ C++ K N T G
Sbjct: 179 GAVTG-ETTSKDGCRPYPFHPLWTYGNDTVGRRMSGRCKHSKTVGEGVKRVTRNHTRRTG 237
Query: 229 FFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENY 288
+ R T D +GP A F +Y+DF +YK G+Y H A
Sbjct: 238 LTARRLRITEFCQSHSEGDH------GNGPVVAVFTVYEDFSYYKKGIYVHI--AGKARG 289
Query: 289 LHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAG 340
H+ K+IGWG ENG PYWL+ N+W WG++G +I+RG EC E + AG
Sbjct: 290 AHAIKIIGWGVENGLPYWLIANSWHDDWGEQGLFRIVRGINECGIEQEVVAG 341
>gi|326490902|dbj|BAJ90118.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326508404|dbj|BAJ99469.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326514912|dbj|BAJ99817.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 345
Score = 152 bits (385), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 113/344 (32%), Positives = 148/344 (43%), Gaps = 41/344 (11%)
Query: 8 LLGCTLVRGELYKFSDAYIDQINREANT-WTAGRNFPANLSEEYLRQFLIADAKYFDQSD 66
L+G L + I +N N WTAG N YL + I K+
Sbjct: 22 LVGAARGDHSLPIIQEDIIRTVNSHPNAGWTAGHN-------PYLANYTIEQFKHILGVK 74
Query: 67 RPLPG-----DRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAF 121
PG KTY A +P FDAR +W C TIG + D G C A F AV
Sbjct: 75 PTPPGLLAGVPTKTYSRSEKAELPKEFDARSKWSGCSTIGKILDQGHCGACWAFGAVECL 134
Query: 122 SDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTG--GD 179
DR CI N LS + +CC D C G W + + G VT
Sbjct: 135 QDRFCIHHS--VNVSLSVNDLVACCGFLCGD---GCDGGYPIFAWQYFVENGVVTDECDP 189
Query: 180 YGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLT 239
+ D+ GCQ H G P P+ V + KC + + + + KH +
Sbjct: 190 FFDQVGCQ--------HPGCEPAYPT----PVCEKKCKVQ------NQVWEEKKHFSIDA 231
Query: 240 YWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGT 299
Y V+ + I E+ +GP +F +Y+DF HYKSGVYK + + H+ KLIGWGT
Sbjct: 232 YQVNSDPHDIMAEVYKNGPVEVSFIIYEDFAHYKSGVYKQITGRMVGG--HAAKLIGWGT 289
Query: 300 EN-GTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
+ G YWL+ N W WGD G KI+RG EC E + AG P
Sbjct: 290 SDAGEDYWLLANQWNRGWGDDGYFKIIRGTNECGIEGDVNAGMP 333
>gi|260782761|ref|XP_002586451.1| hypothetical protein BRAFLDRAFT_247264 [Branchiostoma floridae]
gi|229271561|gb|EEN42462.1| hypothetical protein BRAFLDRAFT_247264 [Branchiostoma floridae]
Length = 272
Score = 152 bits (385), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 92/261 (35%), Positives = 128/261 (49%), Gaps = 36/261 (13%)
Query: 84 VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVA 143
+P FDAR +W C + D G C + FA+ SDR CI+++G N LS+E +
Sbjct: 43 IPKSFDARMEWSTCVRSHKIHDQGHCGSCWAFASTEVLSDRLCIQTRGSTNIILSSEDLL 102
Query: 144 SCCKICRYDDNKSCSHGS-VFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPT 202
SC K R CS G + W ++ K+G V C+P T G+
Sbjct: 103 SCDKAGR-----GCSDGGRLSEAWRYMQKKGVVA-------NRCKPYT------SGATGF 144
Query: 203 LPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTAT 262
+P +C ++CT + F + T++ E+ IK EI+ +GP A
Sbjct: 145 IP----------ECMSKCTGEGHAYQKFYGLYLYTVS-----GENQIKVEIMTNGPVEAA 189
Query: 263 FALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTV 322
F +Y D HYKSGVY HTS KL H+ K++GWG E+ YWLV N+WGP WGD+G
Sbjct: 190 FTVYSDIVHYKSGVYHHTSGGKLGG--HAVKVLGWGVEDEEEYWLVANSWGPDWGDQGFF 247
Query: 323 KILRGKYECAFEYLIAAGKPK 343
KI RG EC E + G +
Sbjct: 248 KIKRGSDECGIESRVLTGTAR 268
>gi|87240981|gb|ABD32839.1| Peptidase C1A, papain; Somatotropin hormone; Peptidase C1,
propeptide [Medicago truncatula]
Length = 356
Score = 152 bits (385), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 99/266 (37%), Positives = 128/266 (48%), Gaps = 30/266 (11%)
Query: 81 SATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTE 140
S +P FDAR W C TIG + D G C + F AV + SDR CI N LS
Sbjct: 97 SLKLPKDFDARTAWSQCSTIGRILDQGHCGSCWAFGAVESLSDRFCIHF--DMNVSLSVN 154
Query: 141 YVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTG--GDYGDRTGCQPSTISPCSHHG 198
+ +CC + C+ G+ F W +L G VT Y D+ GC SH G
Sbjct: 155 DILACCGLLC---GAGCAGGTPFSAWIYLAHHGVVTEECDPYFDQIGC--------SHPG 203
Query: 199 SAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQ-DKHRTTLTYWVDDNEDAIKKEILAHG 257
PT + PK C +C N G ++ KH + Y V+ + I E+ +G
Sbjct: 204 CEPTY------RTPK--CVKKCVN---GNQLWETSKHYSVKAYTVNSDPQDIMAEVYKNG 252
Query: 258 PTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTEN-GTPYWLVINTWGPHW 316
P F +Y+DF HYKSGVYKH + L H+ KL+GWGT + G YWL+ N W +W
Sbjct: 253 PVEVAFTVYEDFAHYKSGVYKHITGFALGG--HAVKLVGWGTSHEGEDYWLLANQWNTNW 310
Query: 317 GDRGTVKILRGKYECAFEYLIAAGKP 342
GD G KI RG EC E + AG P
Sbjct: 311 GDDGYFKIKRGTNECGIENAVTAGLP 336
>gi|357511627|ref|XP_003626102.1| Cathepsin L-like proteinase [Medicago truncatula]
gi|355501117|gb|AES82320.1| Cathepsin L-like proteinase [Medicago truncatula]
Length = 351
Score = 152 bits (384), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 99/266 (37%), Positives = 128/266 (48%), Gaps = 30/266 (11%)
Query: 81 SATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTE 140
S +P FDAR W C TIG + D G C + F AV + SDR CI N LS
Sbjct: 92 SLKLPKDFDARTAWSQCSTIGRILDQGHCGSCWAFGAVESLSDRFCIHF--DMNVSLSVN 149
Query: 141 YVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTG--GDYGDRTGCQPSTISPCSHHG 198
+ +CC + C+ G+ F W +L G VT Y D+ GC SH G
Sbjct: 150 DILACCGLLC---GAGCAGGTPFSAWIYLAHHGVVTEECDPYFDQIGC--------SHPG 198
Query: 199 SAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQ-DKHRTTLTYWVDDNEDAIKKEILAHG 257
PT + PK C +C N G ++ KH + Y V+ + I E+ +G
Sbjct: 199 CEPTY------RTPK--CVKKCVN---GNQLWETSKHYSVKAYTVNSDPQDIMAEVYKNG 247
Query: 258 PTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTEN-GTPYWLVINTWGPHW 316
P F +Y+DF HYKSGVYKH + L H+ KL+GWGT + G YWL+ N W +W
Sbjct: 248 PVEVAFTVYEDFAHYKSGVYKHITGFALGG--HAVKLVGWGTSHEGEDYWLLANQWNTNW 305
Query: 317 GDRGTVKILRGKYECAFEYLIAAGKP 342
GD G KI RG EC E + AG P
Sbjct: 306 GDDGYFKIKRGTNECGIENAVTAGLP 331
>gi|294945206|ref|XP_002784584.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
gi|239897729|gb|EER16380.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
Length = 298
Score = 152 bits (384), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 96/260 (36%), Positives = 130/260 (50%), Gaps = 20/260 (7%)
Query: 83 TVPDRFDAREQWPNC-GTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEY 141
+P FDAR+++ C IGHV D G C +DR CIKS G+ LS Y
Sbjct: 32 NLPPEFDARQKFNYCRDVIGHVRDQGRCGNCWAVCPTEVLNDRLCIKSSGKIQEILSAGY 91
Query: 142 VASCCKI---CRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYG------DRTGCQPSTIS 192
V SCC C + K C+ G + +FL G VTG D+ + GC P
Sbjct: 92 VTSCCNPAHGCLH--AKGCNGGRLVEAMSFLRDHGVVTGNDFKPQDQLREADGCWPYPFQ 149
Query: 193 PCSHHGSAPT-LPSCEN---QKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDA 248
C+H + T P C++ Q VP C T CTN Y + +D HR V ++ +
Sbjct: 150 KCNHVPTEGTGYPKCKDVVQQPVPP--CRTTCTNKAYKKSLEKDVHRAKSWRKVLNDAQS 207
Query: 249 IKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLV 308
IK+EI +GP + F +Y DF +YKSGVY T+ K + LH K+IGWG ++ YWL
Sbjct: 208 IKQEIFDNGPVFSAFEMYKDFRYYKSGVYVPTT--KEVDCLHVIKIIGWGADSVREYWLA 265
Query: 309 INTWGPHWGDRGTVKILRGK 328
+N W WGD G +K+ GK
Sbjct: 266 MNAWNEEWGDHGLIKMAFGK 285
>gi|168020784|ref|XP_001762922.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162685734|gb|EDQ72127.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 345
Score = 152 bits (383), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 102/326 (31%), Positives = 152/326 (46%), Gaps = 34/326 (10%)
Query: 24 AYIDQINRE-ANTWTAGRNFP-ANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEYS 81
+ +D+IN TW AG N A + E+L++ A ++ + P + +
Sbjct: 42 SLVDKINAHPGATWKAGLNDRFAKHTVEHLKKMCGAKMTPANEVE---PSIERVTHKHKN 98
Query: 82 ATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEY 141
+P FDAR+ W +C TIG + D G C + F AV + +DR CI ++ LS
Sbjct: 99 LDLPTEFDARKHWSHCSTIGDILDQGHCGSCWAFGAVESLTDRFCIHL--NESVSLSEND 156
Query: 142 VASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTG--GDYGDRTGCQPSTISPCSHHGS 199
+ +CC ++ C G R W + + G VT Y D+ GC H G
Sbjct: 157 LLACCG---FECGDGCEGGYPIRAWQYFKRTGVVTSKCDPYFDQKGC--------GHPGC 205
Query: 200 APTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPT 259
PT + KC RC + + KH Y V + + E+ +GP
Sbjct: 206 YPTYDT--------PKCFKRCVDDEL---WVSSKHLGVSAYEVSMEPEELMAELFTNGPI 254
Query: 260 TATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGT-ENGTPYWLVINTWGPHWGD 318
F +++DF HYK+GVYKH + H+ KL+GWGT ++G YW ++N+W +WG+
Sbjct: 255 EVAFDVFEDFAHYKTGVYKHLYGGYIGG--HAVKLVGWGTTDDGVDYWSMVNSWNTNWGE 312
Query: 319 RGTVKILRGKYECAFEYLIAAGKPKN 344
GT +ILRGK EC E AG P N
Sbjct: 313 DGTFRILRGKDECGIESNAVAGLPSN 338
>gi|21699|emb|CAA46811.1| cathepsin B [Triticum aestivum]
Length = 353
Score = 151 bits (382), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 117/348 (33%), Positives = 152/348 (43%), Gaps = 50/348 (14%)
Query: 8 LLGCTLVRGELYKFSDAYIDQINREANT-WTAGRN-FPANLSEEYLRQFLIADAKYFDQS 65
L G L I +N+ N WTAG N + AN + E + L
Sbjct: 25 LAGTAKAEHSLGIIQKDIIQTVNKHPNAGWTAGHNPYFANYTIEQFKHIL---------G 75
Query: 66 DRPLP-----GDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGA 120
+P P G PE +P FDAR QW +C TIG++ D G C A FAAV A
Sbjct: 76 VKPTPPGLLAGVPIKIHPEMD--LPKEFDARTQWSSCSTIGNILDQGHCGACWAFAAVEA 133
Query: 121 FSDRRCIKSKGQQNRPLSTEYVASCCK-ICRYDDNKSCSHGSVFRTWNFLHKRGSVTG-- 177
DR CI + LS + +CC +C C+ G W + + G VT
Sbjct: 134 LQDRFCIHL--NMSVSLSVNDLLACCGFLC----GSGCNGGYPISAWRYFRRSGVVTEEC 187
Query: 178 GDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTT 237
Y D+TGCQ H G P P+ KC +C + + ++KH +
Sbjct: 188 DPYFDQTGCQ--------HPGCEPAYPT--------PKCQRKCK--VENQAWKENKHFSV 229
Query: 238 LTYWVDDNEDAIKKEILAHGPTTATFALYD--DFYHYKSGVYKHTSNAKLENYLHSGKLI 295
Y V N I E+ +GP F DF HYKSGVYKH + + H+ KLI
Sbjct: 230 NAYRVHSNPHDIMAEVYKNGPVEVAFTYCQILDFAHYKSGVYKHITGGVMGG--HAVKLI 287
Query: 296 GWGTEN-GTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
GWGT + G YWL+ N W WGD G KI+RG+ EC E + AG P
Sbjct: 288 GWGTSDAGEDYWLLANQWNRGWGDDGYFKIIRGENECGIEGDVTAGMP 335
>gi|194384502|dbj|BAG59411.1| unnamed protein product [Homo sapiens]
Length = 273
Score = 151 bits (382), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 106/338 (31%), Positives = 144/338 (42%), Gaps = 83/338 (24%)
Query: 6 VFLLGCTLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQS 65
+ +L R + SD ++ +N+ TW AG NF N+ YL++ +
Sbjct: 11 LLVLANARSRPSFHPVSDELVNYVNKRNTTWQAGHNF-YNVDMSYLKRLC---GTFLGG- 65
Query: 66 DRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRR 125
P P R + + +P FDAREQWP C TI + D G+C + F AV A SDR
Sbjct: 66 --PKPPQRVMFTEDLK--LPASFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVEAISDR- 120
Query: 126 CIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTG 185
IC +H GS
Sbjct: 121 ----------------------IC-------------------IHVNGS----------- 128
Query: 186 CQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDN 245
+P PC+ G P KC C P Y + QDKH +Y V ++
Sbjct: 129 -RP----PCTGEGDTP-------------KCSKIC-EPGYSPTYKQDKHYGYNSYSVSNS 169
Query: 246 EDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPY 305
E I EI +GP F++Y DF YKSGVY+H + + H+ +++GWG ENGTPY
Sbjct: 170 EKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGG--HAIRILGWGVENGTPY 227
Query: 306 WLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
WLV N+W WGD G KILRG+ C E + AG P+
Sbjct: 228 WLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPR 265
>gi|18378947|ref|NP_563648.1| putative cathepsin B-like cysteine protease [Arabidopsis thaliana]
gi|16226808|gb|AAL16267.1|AF428337_1 At1g02300/T6A9_10 [Arabidopsis thaliana]
gi|14532526|gb|AAK63991.1| At1g02300/T6A9_10 [Arabidopsis thaliana]
gi|25090140|gb|AAN72238.1| At1g02300/T6A9_10 [Arabidopsis thaliana]
gi|332189292|gb|AEE27413.1| putative cathepsin B-like cysteine protease [Arabidopsis thaliana]
Length = 362
Score = 151 bits (382), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 97/270 (35%), Positives = 129/270 (47%), Gaps = 30/270 (11%)
Query: 79 EYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLS 138
+ S +P FDAR W C +IG + D G C + F AV + SDR CIK N LS
Sbjct: 101 DISLKLPKEFDARTAWSQCTSIGRILDQGHCGSCWAFGAVESLSDRFCIKY--NMNVSLS 158
Query: 139 TEYVASCCK-ICRYDDNKSCSHGSVFRTWNFLHKRGSVTG--GDYGDRTGCQPSTISPCS 195
+ +CC +C + C+ G W + G VT Y D TGC S
Sbjct: 159 VNDLLACCGFLC----GQGCNGGYPIAAWRYFKHHGVVTEECDPYFDNTGC--------S 206
Query: 196 HHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILA 255
H G P P+ KC +C + + + + KH Y V + D I E+
Sbjct: 207 HPGCEPAYPT--------PKCARKCVSGN--QLWRESKHYGVSAYKVRSHPDDIMAEVYK 256
Query: 256 HGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGT-ENGTPYWLVINTWGP 314
+GP F +Y+DF HYKSGVYKH + + H+ KLIGWGT ++G YWL+ N W
Sbjct: 257 NGPVEVAFTVYEDFAHYKSGVYKHITGTNIGG--HAVKLIGWGTSDDGEDYWLLANQWNR 314
Query: 315 HWGDRGTVKILRGKYECAFEYLIAAGKPKN 344
WGD G KI RG EC E+ + AG P +
Sbjct: 315 SWGDDGYFKIRRGTNECGIEHGVVAGLPSD 344
>gi|999909|pdb|1HUC|B Chain B, The Refined 2.15 Angstroms X-Ray Crystal Structure Of
Human Liver Cathepsin B: The Structural Basis For Its
Specificity
gi|999911|pdb|1HUC|D Chain D, The Refined 2.15 Angstroms X-Ray Crystal Structure Of
Human Liver Cathepsin B: The Structural Basis For Its
Specificity
gi|1421164|pdb|1CSB|B Chain B, Crystal Structure Of Cathepsin B Inhibited With Ca030 At
2.1 Angstroms Resolution: A Basis For The Design Of
Specific Epoxysuccinyl Inhibitors
gi|1421167|pdb|1CSB|E Chain E, Crystal Structure Of Cathepsin B Inhibited With Ca030 At
2.1 Angstroms Resolution: A Basis For The Design Of
Specific Epoxysuccinyl Inhibitors
gi|122920711|pdb|2IPP|B Chain B, Crystal Structure Of The Tetragonal Form Of Human Liver
Cathepsin B
Length = 205
Score = 151 bits (381), Expect = 6e-34, Method: Compositional matrix adjust.
Identities = 82/208 (39%), Positives = 111/208 (53%), Gaps = 10/208 (4%)
Query: 137 LSTEYVASCC-KICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCS 195
+S E + +CC +C C+ G WNF ++G V+GG Y GC+P +I PC
Sbjct: 5 VSAEDLLTCCGSMC----GDGCNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCE 60
Query: 196 HHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILA 255
HH + P PK C C P Y + QDKH +Y V ++E I EI
Sbjct: 61 HHVNGSRPPCTGEGDTPK--CSKIC-EPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIYK 117
Query: 256 HGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPH 315
+GP F++Y DF YKSGVY+H + + H+ +++GWG ENGTPYWLV N+W
Sbjct: 118 NGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGG--HAIRILGWGVENGTPYWLVANSWNTD 175
Query: 316 WGDRGTVKILRGKYECAFEYLIAAGKPK 343
WGD G KILRG+ C E + AG P+
Sbjct: 176 WGDNGFFKILRGQDHCGIESEVVAGIPR 203
>gi|116779190|gb|ABK21175.1| unknown [Picea sitchensis]
gi|148907952|gb|ABR17096.1| unknown [Picea sitchensis]
gi|224284884|gb|ACN40172.1| unknown [Picea sitchensis]
Length = 350
Score = 150 bits (380), Expect = 6e-34, Method: Compositional matrix adjust.
Identities = 108/327 (33%), Positives = 145/327 (44%), Gaps = 36/327 (11%)
Query: 21 FSDAYIDQINREANT-WTAGRN--FPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYD 77
+ +++INR W AG N F + ++ R + + P+ +TY
Sbjct: 36 LKEPIVEEINRHPKAGWKAGMNSRFSNHTVGQFKRLLGVLPTPRNLLENVPV----RTYP 91
Query: 78 PEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPL 137
+P +FDAR+ WP C ++ + D G C + F AV A SDR CI K N L
Sbjct: 92 K--GLNLPKQFDARKAWPQCTSVRTILDQGHCGSCWAFGAVEALSDRFCIHYK--VNVTL 147
Query: 138 STEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTG--GDYGDRTGCQPSTISPCS 195
S + +CC R D C G W + G VT Y D GCQ
Sbjct: 148 SENDLVACCGF-RCGDG--CDGGYPLSAWQYFISTGVVTAECDPYFDEAGCQ-------- 196
Query: 196 HHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILA 255
H G P P+ +C +C + G K + Y + I E+
Sbjct: 197 HPGCEPLYPT--------PQCVKQCKDENQNWG--NSKRFSATAYRITSKPYDIMAEVYT 246
Query: 256 HGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPH 315
GP F +Y+DF HYKSGVYK+ + L H+ KLIGWGTENGT YWLV N+W
Sbjct: 247 KGPVEVDFLVYEDFAHYKSGVYKYITGDFLGG--HAVKLIGWGTENGTDYWLVANSWNTA 304
Query: 316 WGDRGTVKILRGKYECAFEYLIAAGKP 342
WG+ G KI RG EC+ E + AG P
Sbjct: 305 WGEDGYFKIARGSNECSIEEDVVAGMP 331
>gi|312283137|dbj|BAJ34434.1| unnamed protein product [Thellungiella halophila]
Length = 362
Score = 150 bits (380), Expect = 7e-34, Method: Compositional matrix adjust.
Identities = 99/267 (37%), Positives = 127/267 (47%), Gaps = 28/267 (10%)
Query: 81 SATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTE 140
S +P FDAR WP C +IG++ D G C + F AV + SDR CI+ N LS
Sbjct: 103 SLKLPKEFDARTAWPQCTSIGNILDQGHCGSCWAFGAVESLSDRFCIEFG--MNISLSVN 160
Query: 141 YVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTG--GDYGDRTGCQPSTISPCSHHG 198
+ +CC R D C G W + G VT Y D TGC SH G
Sbjct: 161 DLLACCGF-RCGDG--CDGGYPIAAWQYFSYSGVVTEECDPYFDDTGC--------SHPG 209
Query: 199 SAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGP 258
P P+ KC +C + + + Q KH + TY V N I E+ +GP
Sbjct: 210 CEPAYPT--------PKCMRKCVSGN--QLWSQSKHYSVSTYTVKSNPQDIMAEVYKNGP 259
Query: 259 TTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWG-TENGTPYWLVINTWGPHWG 317
+F +Y+DF HYKSGVYKH + + + H+ KLIGWG T+ G YWL+ N W WG
Sbjct: 260 VEVSFTVYEDFAHYKSGVYKHITGSNIGG--HAVKLIGWGTTDEGEDYWLLANQWNRSWG 317
Query: 318 DRGTVKILRGKYECAFEYLIAAGKPKN 344
D G I RG EC E AG P +
Sbjct: 318 DDGYFMIRRGTNECGIEDEPVAGLPSS 344
>gi|181178|gb|AAA52125.1| lysosomal proteinase cathepsin B, partial [Homo sapiens]
Length = 209
Score = 150 bits (380), Expect = 7e-34, Method: Compositional matrix adjust.
Identities = 82/208 (39%), Positives = 111/208 (53%), Gaps = 10/208 (4%)
Query: 137 LSTEYVASCC-KICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCS 195
+S E + +CC +C C+ G WNF ++G V+GG Y GC+P +I PC
Sbjct: 3 VSAEDLLTCCGSMC----GDGCNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCE 58
Query: 196 HHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILA 255
HH + P PK C C P Y + QDKH +Y V ++E I EI
Sbjct: 59 HHVNGSRPPCTGEGDTPK--CSKIC-EPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIYK 115
Query: 256 HGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPH 315
+GP F++Y DF YKSGVY+H + + H+ +++GWG ENGTPYWLV N+W
Sbjct: 116 NGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGG--HAIRILGWGVENGTPYWLVANSWNTD 173
Query: 316 WGDRGTVKILRGKYECAFEYLIAAGKPK 343
WGD G KILRG+ C E + AG P+
Sbjct: 174 WGDNGFFKILRGQDHCGIESEVVAGIPR 201
>gi|297843028|ref|XP_002889395.1| hypothetical protein ARALYDRAFT_887368 [Arabidopsis lyrata subsp.
lyrata]
gi|297335237|gb|EFH65654.1| hypothetical protein ARALYDRAFT_887368 [Arabidopsis lyrata subsp.
lyrata]
Length = 360
Score = 150 bits (380), Expect = 7e-34, Method: Compositional matrix adjust.
Identities = 96/270 (35%), Positives = 129/270 (47%), Gaps = 30/270 (11%)
Query: 79 EYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLS 138
+ S +P FDAR W C ++G + D G C + F AV + SDR CIK N LS
Sbjct: 99 DISLKLPKEFDARTAWSQCTSVGRILDQGHCGSCWAFGAVESLSDRFCIKY--NMNISLS 156
Query: 139 TEYVASCCK-ICRYDDNKSCSHGSVFRTWNFLHKRGSVTG--GDYGDRTGCQPSTISPCS 195
+ +CC +C + C+ G W + G VT Y D TGC S
Sbjct: 157 VNDLLACCGFLC----GQGCNGGYPIAAWRYFKHHGVVTEECDPYFDNTGC--------S 204
Query: 196 HHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILA 255
H G P P+ KC +C + + + + KH Y V + D I E+
Sbjct: 205 HPGCEPAYPT--------PKCARKCVSGN--QLWRESKHYGVSAYKVRSHPDDIMAEVYK 254
Query: 256 HGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGT-ENGTPYWLVINTWGP 314
+GP F +Y+DF HYKSGVYKH + + H+ KLIGWGT ++G YWL+ N W
Sbjct: 255 NGPVEVAFTVYEDFAHYKSGVYKHITGTNIGG--HAVKLIGWGTSDDGEDYWLLANQWNR 312
Query: 315 HWGDRGTVKILRGKYECAFEYLIAAGKPKN 344
WGD G KI RG EC E+ + AG P +
Sbjct: 313 SWGDDGYFKIRRGTNECGIEHGVVAGLPSD 342
>gi|3088522|gb|AAD03404.1| cathepsin B-like protease precursor [Trypanosoma cruzi]
gi|407859283|gb|EKG06969.1| cysteine peptidase C (CPC) [Trypanosoma cruzi]
Length = 333
Score = 150 bits (380), Expect = 7e-34, Method: Compositional matrix adjust.
Identities = 102/325 (31%), Positives = 155/325 (47%), Gaps = 27/325 (8%)
Query: 21 FSDAYIDQINR-EANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPE 79
+D +++ +NR WTAGR + ++ R+ F ++ LP R+ + E
Sbjct: 32 LTDEFLELVNRLNGGKWTAGRT---SRTKYLTRRGASRLLGTFLRNTSILP-PRQFSEEE 87
Query: 80 YSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLST 139
+ DRFDA E WP C TI + D +C + AA A SDR C G ++ +S
Sbjct: 88 LRVPLQDRFDAGEAWPKCPTITEIRDQSSCGSCWAVAAASAMSDRYCTLG-GVRDLRISA 146
Query: 140 EYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGS 199
+ SCC +C Y C+ G W + G V+ +Y CQP C+HH +
Sbjct: 147 GDLMSCCDVCGY----GCNGGYPEVAWEYYAVHGIVS--EY-----CQPYPFPSCAHHVN 195
Query: 200 APTLPSCENQ-KVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGP 258
+ L C + P C++ CT+ + K+R +Y + E++ K+E+L +GP
Sbjct: 196 SSDLSPCSGEYDTPT--CNSTCTD----KKIPLIKYRGNTSY-ILSGEESFKRELLLNGP 248
Query: 259 TTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGD 318
+F++Y DF Y GVYKH + L H+ +++GWG NG PYW + N+W WG
Sbjct: 249 FEVSFSVYADFVAYTGGVYKHVTGVFLGG--HAVRIVGWGELNGEPYWKIANSWNHEWGM 306
Query: 319 RGTVKILRGKYECAFEYLIAAGKPK 343
G I RG EC E AG P+
Sbjct: 307 NGYFLIARGVDECGIEGSGVAGIPR 331
>gi|71656032|ref|XP_816569.1| cysteine peptidase C (CPC) [Trypanosoma cruzi strain CL Brener]
gi|70881707|gb|EAN94718.1| cysteine peptidase C (CPC), putative [Trypanosoma cruzi]
Length = 333
Score = 150 bits (380), Expect = 7e-34, Method: Compositional matrix adjust.
Identities = 102/325 (31%), Positives = 155/325 (47%), Gaps = 27/325 (8%)
Query: 21 FSDAYIDQINR-EANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPE 79
+D +++ +NR WTAGR + ++ R+ F ++ LP R+ + E
Sbjct: 32 LTDEFLELVNRLNGGKWTAGRT---SRTKHLTRRGASRLLGTFLRNTSILP-PRQFSEEE 87
Query: 80 YSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLST 139
+ DRFDA E WP C TI + D +C + AA A SDR C G ++ +S
Sbjct: 88 LREPLQDRFDAGEAWPKCPTITEIRDQSSCGSCWAVAAASAISDRYCTLG-GVRDLRISA 146
Query: 140 EYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGS 199
+ SCC +C Y C+ G W + G V+ +Y CQP C+HH +
Sbjct: 147 GDLMSCCDVCGY----GCNGGYPEVAWEYYAVHGIVS--EY-----CQPYPFPSCAHHVN 195
Query: 200 APTLPSCENQ-KVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGP 258
+ L C + P C++ CT+ + K+R +Y + E++ K+E+L +GP
Sbjct: 196 SSDLSPCSGEYDTPT--CNSTCTD----KKVPLIKYRGNTSYLLS-GEESFKRELLLNGP 248
Query: 259 TTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGD 318
+F++Y DF Y GVYKH + L H+ +++GWG NG PYW + N+W WG
Sbjct: 249 FEVSFSVYADFLAYTGGVYKHVAGTFLGG--HAVRIVGWGELNGEPYWKIANSWNREWGM 306
Query: 319 RGTVKILRGKYECAFEYLIAAGKPK 343
G I RG EC E AG P+
Sbjct: 307 NGYFLIARGVDECGIEGSGVAGTPR 331
>gi|339239305|ref|XP_003381207.1| cathepsin B [Trichinella spiralis]
gi|316975778|gb|EFV59177.1| cathepsin B [Trichinella spiralis]
Length = 343
Score = 150 bits (380), Expect = 7e-34, Method: Compositional matrix adjust.
Identities = 95/310 (30%), Positives = 145/310 (46%), Gaps = 56/310 (18%)
Query: 72 DRKTYDPEYSATVP--DRFDAREQWPNCGTIGHVPDTGACAAPHI--------------- 114
+++ Y S ++P + FDARE+WP C IG + D C+ +
Sbjct: 46 NQQNYTDAKSESLPLEEHFDAREKWPECKYIGFIKDQSTCSCCWVSGDFLYHYDQWKIIL 105
Query: 115 -------------------FAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNK 155
++ +DR CI KG+Q LS E + SCC C Y
Sbjct: 106 LFDFSSSSSHWLFISTFKAMSSASVMTDRTCIAYKGEQQPFLSDEELTSCCTSCGY---- 161
Query: 156 SCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLK 215
C+ G + + ++ G TGG YG ++GC+P +I+P + +A P C+ LK
Sbjct: 162 GCNGGFPLLAFKYWNEIGVPTGGPYGSKSGCKPFSIAPPTSSSTAAQTPLCQ------LK 215
Query: 216 CHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIK---KEILAHGPTTATFALYDDFYHY 272
C + Y R +D++ Y + + +K +EI+ HGP A +++ F +Y
Sbjct: 216 CIS-----DYKRKLDKDRYYGESYYLITSSNQPVKTIQREIMDHGPVVAAMEIFESFLYY 270
Query: 273 KSGVYK-HTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYEC 331
KSGVY + N LH+ KLIGWG + PYWLV+N+W +G++G KI RG EC
Sbjct: 271 KSGVYSANKRNDDPSLGLHAVKLIGWGEQKRIPYWLVVNSWNTTFGEQGLFKIRRGTNEC 330
Query: 332 AFEYL-IAAG 340
E L + AG
Sbjct: 331 GIENLHVTAG 340
>gi|308485822|ref|XP_003105109.1| hypothetical protein CRE_20700 [Caenorhabditis remanei]
gi|308257054|gb|EFP01007.1| hypothetical protein CRE_20700 [Caenorhabditis remanei]
Length = 410
Score = 150 bits (379), Expect = 8e-34, Method: Compositional matrix adjust.
Identities = 105/356 (29%), Positives = 156/356 (43%), Gaps = 67/356 (18%)
Query: 22 SDAYIDQINREAN-----TWTAGRN----------FPANLSEEYLRQFLIADAKYFDQSD 66
+D Y+ ++ R+ N TW A N F ++ + +++ K+F+
Sbjct: 52 NDEYLRKLVRQVNDSPETTWKAKFNKFGVKNRSYGFKYTRNQTAVEEYMEHIRKFFESD- 110
Query: 67 RPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRC 126
+ + + S+ +P FDAR++WPNC +I +VP+ G C + AA G SDR C
Sbjct: 111 -AMKRHLEELENYKSSDLPKHFDARQKWPNCPSISNVPNQGGCGSCFAVAAAGVASDRAC 169
Query: 127 IKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGC 186
I S G LS E + CC +C +C G + + +G VTGG R GC
Sbjct: 170 IHSNGTFKALLSEEDIIGCCSVC-----GNCYGGDPLKALTYWVNQGLVTGG----RDGC 220
Query: 187 QPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYW----- 241
+P + P P+ + K C RC N Y + + +DKH T Y
Sbjct: 221 RPYSFDLSC---GVPCSPATFFEAEEKRTCMRRCQNIYYQQKYEEDKHFATFAYSMYPRS 277
Query: 242 -----------------------------VDDNEDAIKKEILAHGPTTATFALYDDFYHY 272
V + + IKKEIL +GPTT F + ++F HY
Sbjct: 278 MTVSPDGKERVKVPTIIGHFNDKNTEKLNVTEYRNVIKKEILLYGPTTMAFPVPEEFLHY 337
Query: 273 KSGVYKHTSNAKLEN---YLHSGKLIGWGTE-NGTPYWLVINTWGPHWGDRGTVKI 324
SGV++ ++ Y H +LIGWG +G YWL IN++G HWGD G KI
Sbjct: 338 SSGVFRPFPLDGFDDRIVYWHVVRLIGWGESGDGQHYWLAINSFGNHWGDNGLFKI 393
>gi|222424744|dbj|BAH20325.1| AT1G02305 [Arabidopsis thaliana]
Length = 293
Score = 150 bits (379), Expect = 9e-34, Method: Compositional matrix adjust.
Identities = 97/270 (35%), Positives = 129/270 (47%), Gaps = 30/270 (11%)
Query: 79 EYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLS 138
+ S +P FDAR W C +IG + D G C + F AV + SDR CIK N LS
Sbjct: 32 DISLKLPKEFDARTAWSQCTSIGRILDQGHCGSCWAFGAVESLSDRFCIKY--NMNVSLS 89
Query: 139 TEYVASCCK-ICRYDDNKSCSHGSVFRTWNFLHKRGSVTG--GDYGDRTGCQPSTISPCS 195
+ +CC +C + C+ G W + G VT Y D TGC S
Sbjct: 90 VNDLLACCGFLC----GQGCNGGYPIAAWRYFKHHGVVTEECDPYFDNTGC--------S 137
Query: 196 HHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILA 255
H G P P+ KC +C + + + + KH Y V + D I E+
Sbjct: 138 HPGCEPAYPT--------PKCARKCVSGN--QLWRESKHYGVSAYKVRSHPDDIMAEVYK 187
Query: 256 HGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGT-ENGTPYWLVINTWGP 314
+GP F +Y+DF HYKSGVYKH + + H+ KLIGWGT ++G YWL+ N W
Sbjct: 188 NGPVEVAFTVYEDFAHYKSGVYKHITGTNIGG--HAVKLIGWGTSDDGEDYWLLANQWNR 245
Query: 315 HWGDRGTVKILRGKYECAFEYLIAAGKPKN 344
WGD G KI RG EC E+ + AG P +
Sbjct: 246 SWGDDGYFKIRRGTNECGIEHGVVAGLPSD 275
>gi|343472937|emb|CCD15042.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 336
Score = 150 bits (379), Expect = 9e-34, Method: Compositional matrix adjust.
Identities = 101/327 (30%), Positives = 146/327 (44%), Gaps = 31/327 (9%)
Query: 21 FSDAYIDQINR-EANTWTAGRNFPANLSEEYLRQFLIADAKYFD----QSDRPLPGDRKT 75
+ ++D+IN+ W A N ++ ++AK Q R LP R T
Sbjct: 30 LTQTFVDRINQLNGGMWKAVYN-------GKMQNITFSEAKRLTGARIQKSRTLPPARFT 82
Query: 76 YDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNR 135
+ + +P+ FDA E WP+C TI + D C A + A SDR C G+Q R
Sbjct: 83 -EEQLRTKLPETFDAAEHWPHCPTIREIADQSECRASWAVSTASAISDRYCTVGGGKQLR 141
Query: 136 PLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCS 195
+S + +CCK C C G W + + G + + CQP C
Sbjct: 142 -ISAADLMACCKQC----GDGCKGGFPGFAWLYYVEYGITS-------SQCQPYPFPHCE 189
Query: 196 HHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILA 255
H G+ C K KC+ CT+ + K+R TY + E+ K+E+
Sbjct: 190 HRGAQGNKTPCSKYKFDTPKCNATCTD----KSIPLVKYRGNATYLLLHGEEDYKRELYF 245
Query: 256 HGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPH 315
+GP A F +Y D + YKSGVY++ L + +++GWG NGTPYW V N+W
Sbjct: 246 NGPFVAVFFVYTDLFAYKSGVYRNVDGDFLGG--QAVRIVGWGKLNGTPYWKVANSWDTD 303
Query: 316 WGDRGTVKILRGKYECAFEYLIAAGKP 342
WG G + ILRG EC E+L G P
Sbjct: 304 WGMNGYMLILRGNNECNIEHLGFTGFP 330
>gi|48425700|pdb|1SP4|B Chain B, Crystal Structure Of Ns-134 In Complex With Bovine
Cathepsin B: A Two Headed Epoxysuccinyl Inhibitor
Extends Along The Whole Active Site Cleft
Length = 205
Score = 150 bits (379), Expect = 9e-34, Method: Compositional matrix adjust.
Identities = 78/178 (43%), Positives = 99/178 (55%), Gaps = 5/178 (2%)
Query: 165 TWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPT 224
WNF K+G V+GG Y GC+P +I PC HH + P PK C+ C P
Sbjct: 31 AWNFWTKKGLVSGGLYNSHVGCRPYSIPPCEHHVNGSRPPCTGEGDTPK--CNKTC-EPG 87
Query: 225 YGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAK 284
Y + +DKH +Y V +NE I EI +GP F++Y DF YKSGVY+H S
Sbjct: 88 YSPSYKEDKHFGCSSYSVANNEKEIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVSGEI 147
Query: 285 LENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
+ H+ +++GWG ENGTPYWLV N+W WGD G KILRG+ C E I AG P
Sbjct: 148 MGG--HAIRILGWGVENGTPYWLVGNSWNTDWGDNGFFKILRGQDHCGIESEIVAGMP 203
>gi|226497010|ref|NP_001150152.1| LOC100283781 precursor [Zea mays]
gi|195637168|gb|ACG38052.1| cathepsin B-like cysteine proteinase 3 precursor [Zea mays]
Length = 347
Score = 150 bits (378), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 105/320 (32%), Positives = 141/320 (44%), Gaps = 40/320 (12%)
Query: 30 NREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDP----EYSATVP 85
N + WTA RN Y + IA K+ +P P + + P S +P
Sbjct: 43 NHPSAGWTASRN-------PYFSNYTIAQFKHI-LGVKPAPQNALSNVPVKTYSRSLELP 94
Query: 86 DRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASC 145
FDAR W C TIG++ D G C + F AV DR CI + LS + +C
Sbjct: 95 KEFDARSAWSRCSTIGNILDQGHCGSCWAFGAVECLQDRFCIHL--NMSILLSVNDLLAC 152
Query: 146 CKICRYDDNKSCSHGSVFRTWNFLHKRGSVTG--GDYGDRTGCQPSTISPCSHHGSAPTL 203
C D C G W + + G VT Y D GC+ H G P
Sbjct: 153 CGFMCGD---GCDGGYPIEAWRYFVQNGVVTDECDPYFDPVGCK--------HPGCEPAY 201
Query: 204 PSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATF 263
P+ KC +C + + + KH + Y ++ + I E+ +GP F
Sbjct: 202 PT--------PKCEKKCKEQN--QVWQEKKHFSIDAYRINSDPHDIMAEVYKNGPVEVAF 251
Query: 264 ALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTEN-GTPYWLVINTWGPHWGDRGTV 322
+Y+DF HYKSGVYKH + + H+ KLIGWGT + G YWL+ N W WGD G
Sbjct: 252 TVYEDFAHYKSGVYKHITGGIMGG--HAVKLIGWGTSDAGEDYWLLANQWNRGWGDDGYF 309
Query: 323 KILRGKYECAFEYLIAAGKP 342
KI+RGK EC E + AG P
Sbjct: 310 KIIRGKNECGIEEGVVAGMP 329
>gi|294929081|ref|XP_002779258.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
gi|239888294|gb|EER11053.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
Length = 288
Score = 150 bits (378), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 90/254 (35%), Positives = 120/254 (47%), Gaps = 24/254 (9%)
Query: 84 VPDRFDAREQWPNC-GTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYV 142
+P FDAR+++ +C G IGHV D AC ++ G +DR CIKS G LS Y
Sbjct: 37 LPSNFDARQKFASCAGVIGHVRDQSACHNCWTVSSTGMLNDRVCIKSGGTFRDILSVGYF 96
Query: 143 ASCCKICR-YDDNKSCSHGSVFRTWNFLHKRGSVTG------GDYGDRTGCQPSTISPCS 195
SCC K C G++ NFL G VTG G GC P C
Sbjct: 97 TSCCNPANGCPKAKGCQGGNLLEGLNFLKNHGIVTGDEFKPAGQLSSADGCWPYPFPKCK 156
Query: 196 HHG-SAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEIL 254
H G S+P C T+CTN Y QD HR + IK+EI
Sbjct: 157 HAGYSSPA-------------CQTKCTNKAYKTSLQQDLHRAKSFGRLPAIPQNIKQEIF 203
Query: 255 AHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGP 314
+GP ++Y+D YK+GVY H + + +H+ K+IGWG E+G YWL +N+W
Sbjct: 204 TNGPVIGMLSIYEDIRVYKAGVYVHQTGS--FQGIHTLKIIGWGVESGQDYWLAVNSWNE 261
Query: 315 HWGDRGTVKILRGK 328
WGD G +K+ G+
Sbjct: 262 EWGDHGMIKLAVGR 275
>gi|18411686|ref|NP_567215.1| cathepsin B [Arabidopsis thaliana]
gi|13877861|gb|AAK44008.1|AF370193_1 putative cathepsin B cysteine protease [Arabidopsis thaliana]
gi|17473834|gb|AAL38343.1| unknown protein [Arabidopsis thaliana]
gi|21281113|gb|AAM45063.1| putative cathepsin B cysteine protease [Arabidopsis thaliana]
gi|21554165|gb|AAM63244.1| cathepsin B-like cysteine protease, putative [Arabidopsis thaliana]
gi|24417490|gb|AAN60355.1| unknown [Arabidopsis thaliana]
gi|24899725|gb|AAN65077.1| unknown protein [Arabidopsis thaliana]
gi|51968702|dbj|BAD43043.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
gi|51969104|dbj|BAD43244.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
gi|51969220|dbj|BAD43302.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
gi|51970472|dbj|BAD43928.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
gi|51970630|dbj|BAD44007.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
gi|51970704|dbj|BAD44044.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
gi|51970802|dbj|BAD44093.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
gi|51970974|dbj|BAD44179.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
gi|51971008|dbj|BAD44196.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
gi|51971116|dbj|BAD44250.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
gi|62320144|dbj|BAD94342.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
gi|110740287|dbj|BAF02040.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
gi|332656652|gb|AEE82052.1| cathepsin B [Arabidopsis thaliana]
Length = 359
Score = 150 bits (378), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 100/273 (36%), Positives = 131/273 (47%), Gaps = 30/273 (10%)
Query: 75 TYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQN 134
++DP S +P FDAR WP C +IG++ D G C + F AV + SDR CI+ N
Sbjct: 96 SHDP--SLKLPKAFDARTAWPQCTSIGNILDQGHCGSCWAFGAVESLSDRFCIQFG--MN 151
Query: 135 RPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTG--GDYGDRTGCQPSTIS 192
LS + +CC R D C G W + G VT Y D TGC
Sbjct: 152 ISLSVNDLLACCGF-RCGDG--CDGGYPIAAWQYFSYSGVVTEECDPYFDNTGC------ 202
Query: 193 PCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKE 252
SH G P P+ KC +C + + + + KH + TY V N I E
Sbjct: 203 --SHPGCEPAYPT--------PKCSRKCVSDN--KLWSESKHYSVSTYTVKSNPQDIMAE 250
Query: 253 ILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTEN-GTPYWLVINT 311
+ +GP +F +Y+DF HYKSGVYKH + + + H+ KLIGWGT + G YWL+ N
Sbjct: 251 VYKNGPVEVSFTVYEDFAHYKSGVYKHITGSNIGG--HAVKLIGWGTSSEGEDYWLMANQ 308
Query: 312 WGPHWGDRGTVKILRGKYECAFEYLIAAGKPKN 344
W WGD G I RG EC E AG P +
Sbjct: 309 WNRGWGDDGYFMIRRGTNECGIEDEPVAGLPSS 341
>gi|154340956|ref|XP_001566431.1| cysteine peptidase C (CPC) [Leishmania braziliensis
MHOM/BR/75/M2904]
gi|134063754|emb|CAM39941.1| cysteine peptidase C (CPC) [Leishmania braziliensis
MHOM/BR/75/M2904]
Length = 340
Score = 150 bits (378), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 101/327 (30%), Positives = 148/327 (45%), Gaps = 30/327 (9%)
Query: 21 FSDAYIDQINREAN-TWTAGRN---FPANLSEEYLRQFL-IADAKYFDQSDRPLPGDRKT 75
S+ ++ +IN +A WTA + + S+E LR+ + + + S R +
Sbjct: 36 LSNRFVAEINLKAKGQWTASADNGHLVSGKSDEELRKLMGVLNMSTAALSPRIFSAE--- 92
Query: 76 YDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNR 135
E + +P FD+ ++WP C TI + D C + AAV A SDR C + G +
Sbjct: 93 ---ELAQELPTSFDSSDKWPKCRTISEIRDQSNCGSCWAIAAVEAMSDRYCTVA-GITDL 148
Query: 136 PLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCS 195
+ST ++ SCC +C C G W + G + CQP PC
Sbjct: 149 RVSTGHLLSCCFVC----GMGCQGGIPTMAWLWWVWVGLTS-------EVCQPYPFPPCG 197
Query: 196 HHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILA 255
HH P+C + C++ C + KH+ +Y + E E++
Sbjct: 198 HHTDGGKYPACPSTIYDTPTCNSTCADSHTAL----TKHKGEKSYSLR-GEREYMIELMT 252
Query: 256 HGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPH 315
+GP F +Y DF YKSGVY HT+ +L H+ KL+GWG +NGTPYW + N+W
Sbjct: 253 YGPFEVAFDVYADFVSYKSGVYSHTTGERLGG--HAVKLVGWGVQNGTPYWKIANSWNSD 310
Query: 316 WGDRGTVKILRGKYECAFEYLIAAGKP 342
WGD G I RG EC E AG P
Sbjct: 311 WGDNGYFLIRRGTDECGIESTGVAGLP 337
>gi|302823081|ref|XP_002993195.1| hypothetical protein SELMODRAFT_270024 [Selaginella moellendorffii]
gi|300138965|gb|EFJ05715.1| hypothetical protein SELMODRAFT_270024 [Selaginella moellendorffii]
Length = 342
Score = 149 bits (376), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 107/328 (32%), Positives = 150/328 (45%), Gaps = 44/328 (13%)
Query: 24 AYIDQINREANT-WTAGRNFPANLSEEYLRQF-----LIADAKYFDQSDRPLPGDRKTYD 77
+ +D +N + N W AG F +R F ++ + Q RPL +T D
Sbjct: 41 SIVDIVNNDPNAGWKAG--FNERFINHTVRDFKRLCGVLPKSSEEVQPLRPLRSHPRTLD 98
Query: 78 PEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPL 137
+P FDARE WP C +I ++ D G C + F AV A +DR CI + +N L
Sbjct: 99 ------LPKHFDAREAWPQCSSIKNILDQGHCGSCWAFGAVEALTDRFCILN--NENVSL 150
Query: 138 STEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTG--GDYGDRTGCQPSTISPCS 195
S + +CC C + C G + W + + G VT Y D GC+
Sbjct: 151 SENDLVACCSSCGF----GCDGGYPYAAWEYFAQTGVVTSQCDPYFDGKGCKH------- 199
Query: 196 HHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILA 255
P CE + + C +C + R KH T TY V+ + I+ EI
Sbjct: 200 --------PGCEPEYDTPV-CVKQCVDNEQWR---DSKHFTVQTYAVNSDIYDIQAEIYK 247
Query: 256 HGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWG-TENGTPYWLVINTWGP 314
+GP ++ +Y+DF HYKSGVYKH L H+ K IGWG T++G YW+V N+W
Sbjct: 248 NGPVEVSYTVYEDFAHYKSGVYKHVFGEVLGG--HAVKFIGWGTTDDGKDYWIVANSWNR 305
Query: 315 HWGDRGTVKILRGKYECAFEYLIAAGKP 342
WG+ G +I RG EC E AG P
Sbjct: 306 SWGEDGFFQISRGSNECGIESEPVAGIP 333
>gi|71424150|ref|XP_812694.1| cysteine peptidase C (CPC) [Trypanosoma cruzi strain CL Brener]
gi|70877506|gb|EAN90843.1| cysteine peptidase C (CPC), putative [Trypanosoma cruzi]
Length = 333
Score = 149 bits (376), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 105/348 (30%), Positives = 163/348 (46%), Gaps = 31/348 (8%)
Query: 2 IHILVFLL----GCTLVRGELYKFSDAYIDQINR-EANTWTAGRNFPANLSEEYLRQFLI 56
I + +FLL G + + +D +++ +NR WTAGR + ++ R+
Sbjct: 9 IALFLFLLYATAGHSFHAEDAPILTDEFLEHVNRLNGGKWTAGRT---SRTKHLTRRGAS 65
Query: 57 ADAKYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFA 116
F ++ LP R+ + E + DRFDA E WP C T+ + D +C + A
Sbjct: 66 RMLGTFLRNTSILP-PRQFSEEELRVPLQDRFDAGEAWPECPTVTEIRDQSSCGSCWAVA 124
Query: 117 AVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVT 176
A A SDR C G ++ +S + SCC +C + C+ G W + G V+
Sbjct: 125 AASAISDRYCTLG-GVRDLRISAGDLMSCCDVCGF----GCNGGYPEVAWEYYAVHGIVS 179
Query: 177 GGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQ-KVPKLKCHTRCTNPTYGRGFFQDKHR 235
+Y CQP C+HH ++ L C + P C++ CT+ + K+R
Sbjct: 180 --EY-----CQPYPFPSCAHHVNSSDLSPCSGEYDTPT--CNSTCTD----KKIPLIKYR 226
Query: 236 TTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLI 295
+Y V E+ K+E++ +GP +F++Y DF Y GVYKH + L H+ +++
Sbjct: 227 GNTSY-VLSGEEPFKRELILNGPFEVSFSVYADFVAYTGGVYKHVAGIFLGG--HAVRIV 283
Query: 296 GWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
GWG NG PYW + N+W WG G I RG EC E AG P+
Sbjct: 284 GWGELNGEPYWKIANSWNREWGMNGYFLIARGVDECGIEGSGVAGTPR 331
>gi|291228863|ref|XP_002734398.1| PREDICTED: hypothetical protein [Saccoglossus kowalevskii]
Length = 451
Score = 149 bits (376), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 97/269 (36%), Positives = 132/269 (49%), Gaps = 32/269 (11%)
Query: 77 DPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRP 136
D + +P FDAR++W + I + D G CA+ F+ VG SDR I+S G+
Sbjct: 172 DIKMKKKIPKSFDARDKWGS--MITGILDQGNCASSWAFSTVGVASDRLAIQSSGETGMT 229
Query: 137 LSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSH 196
LS +++ SC + CS G + R W F+ KRG V+ Y +G Q C
Sbjct: 230 LSPQHLLSC----NTRGQRGCSGGHIDRAWWFMRKRGVVSNDCYPYTSGDQDKK-GVCMM 284
Query: 197 HGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAH 256
G P+ C T GR + H +T Y + NE I+ EI+ +
Sbjct: 285 PGKLPS------------DCPT-------GRERNNELHHSTPPYRIAANEREIQVEIMEN 325
Query: 257 GPTTATFALYDDFYHYKSGVYKHTSNAK--LENY----LHSGKLIGWGTENGTPYWLVIN 310
GP A+F + +DF+ Y SGVY+HT A E Y HS KL+GWG ENG YWL N
Sbjct: 326 GPVQASFEVKEDFFMYGSGVYRHTPIASNDAEQYHASEWHSVKLLGWGVENGIKYWLGAN 385
Query: 311 TWGPHWGDRGTVKILRGKYECAFEYLIAA 339
+WG WG+ G KILRG+ EC E + A
Sbjct: 386 SWGTKWGEDGYFKILRGENECNIESYVVA 414
>gi|294891889|ref|XP_002773789.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
gi|239878993|gb|EER05605.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
Length = 422
Score = 149 bits (376), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 111/333 (33%), Positives = 152/333 (45%), Gaps = 35/333 (10%)
Query: 24 AYIDQINREANTWTAGRNFP--ANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPE-- 79
+ +D+IN +WTA ++ P +S + L D + D G+ + P
Sbjct: 83 SMVDKINSMQQSWTASKDQPPFKGMSIKDLPAGCSNDTMFSSTLDEG--GENRLLGPTNP 140
Query: 80 YSATVPDRFDAREQWPNCG-TIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLS 138
T+P FDAR+++ +C IGHV + G C AAVG F+DR CIKS G+ LS
Sbjct: 141 VLTTLPSSFDARQKFASCADVIGHVREQGECNNCWASAAVGMFNDRVCIKSGGRITDILS 200
Query: 139 TEYVASCCKICR-YDDNKSCSHGSVFRTWNFLHKRGSVTG-----------GDY------ 180
Y+ SCC + C GSV NF+ G VTG G+Y
Sbjct: 201 LGYLTSCCNRANGCPKSNGCMFGSVPEGLNFMKNHGLVTGRNFRFESFKLSGEYKPPEEL 260
Query: 181 GDRTGCQPSTISPCSH-HGSAPTLPSC-ENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTL 238
G+ GC P C+H G P C + + +P C T C N YG +D HR
Sbjct: 261 GNDDGCWPYPFPKCNHVPGLESKYPRCAQVRDLPA--CATTCPNKAYGTSMQKDTHRAKS 318
Query: 239 TYWVDDNEDAIKKEILAHGP---TTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLI 295
+ + IK+EI +GP A LY+DF + VY H + L H+ KLI
Sbjct: 319 WGRLPIGPEKIKQEIFDNGPLRXXAAMMTLYEDF-DLQVCVYVHKTGQMLA--AHTLKLI 375
Query: 296 GWGTENGTPYWLVINTWGPHWGDRGTVKILRGK 328
GWG E+G YWL +N W WGD G +K+ GK
Sbjct: 376 GWGVESGQEYWLAVNAWNEEWGDHGMIKLAVGK 408
>gi|302764096|ref|XP_002965469.1| hypothetical protein SELMODRAFT_143272 [Selaginella moellendorffii]
gi|300166283|gb|EFJ32889.1| hypothetical protein SELMODRAFT_143272 [Selaginella moellendorffii]
Length = 331
Score = 149 bits (375), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 107/331 (32%), Positives = 149/331 (45%), Gaps = 44/331 (13%)
Query: 21 FSDAYIDQINREANT-WTAGRNFPANLSEEYLRQF-----LIADAKYFDQSDRPLPGDRK 74
+ +D +N + N W AG F +R F ++ + Q RPL +
Sbjct: 27 LQKSIVDIVNNDPNAGWKAG--FNERFINHTVRDFKRLCGVLPKSSEEVQPLRPLRSHPR 84
Query: 75 TYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQN 134
T D +P FDARE WP C +I + D G C + F AV A +DR CI + +N
Sbjct: 85 TLD------LPKHFDAREAWPQCASIKTILDQGHCGSCWAFGAVEALTDRFCILN--NEN 136
Query: 135 RPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTG--GDYGDRTGCQPSTIS 192
LS + +CC C + C G + W + + G VT Y D GC+
Sbjct: 137 VSLSENDLVACCSSCGF----GCEGGYPYAAWEYFAQTGVVTSQCDPYFDGKGCKH---- 188
Query: 193 PCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKE 252
P CE + + C +C + R KH T TY V+ + I+ E
Sbjct: 189 -----------PGCEPEYDTPV-CVKQCVDNEQWR---DSKHFTVQTYAVNSDIYDIQAE 233
Query: 253 ILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWG-TENGTPYWLVINT 311
I +GP ++ +Y+DF HYKSGVYKH L H+ K IGWG T++G YW+V N+
Sbjct: 234 IYKNGPVEVSYTVYEDFAHYKSGVYKHVFGQVLGG--HAVKFIGWGTTDDGKDYWIVANS 291
Query: 312 WGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
W WG+ G +I RG EC E AG P
Sbjct: 292 WNRSWGEDGFFQISRGSNECGIESEPVAGIP 322
>gi|414886870|tpg|DAA62884.1| TPA: cathepsin B-like cysteine proteinase 3 [Zea mays]
Length = 347
Score = 149 bits (375), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 104/320 (32%), Positives = 141/320 (44%), Gaps = 40/320 (12%)
Query: 30 NREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDP----EYSATVP 85
N + WTA RN Y + IA K+ +P P + + P S +P
Sbjct: 43 NHPSAGWTASRN-------PYFSNYTIAQFKHI-LGVKPAPQNALSNVPVKTYSRSLELP 94
Query: 86 DRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASC 145
FDAR W C TIG++ + G C + F AV DR CI + LS + +C
Sbjct: 95 KEFDARSAWSRCSTIGNILEQGHCGSCWAFGAVECLQDRFCIHL--NMSILLSVNDLLAC 152
Query: 146 CKICRYDDNKSCSHGSVFRTWNFLHKRGSVTG--GDYGDRTGCQPSTISPCSHHGSAPTL 203
C D C G W + + G VT Y D GC+ H G P
Sbjct: 153 CGFMCGD---GCDGGYPIEAWRYFVQNGVVTDECDPYFDPVGCK--------HPGCEPAY 201
Query: 204 PSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATF 263
P+ KC +C + + + KH + Y ++ + I E+ +GP F
Sbjct: 202 PT--------PKCEKKCKEQN--QVWQEKKHFSIDAYRINSDPHDIMAEVYKNGPVEVAF 251
Query: 264 ALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTEN-GTPYWLVINTWGPHWGDRGTV 322
+Y+DF HYKSGVYKH + + H+ KLIGWGT + G YWL+ N W WGD G
Sbjct: 252 TVYEDFAHYKSGVYKHITGGIMGG--HAVKLIGWGTSDAGEDYWLLANQWNRGWGDDGYF 309
Query: 323 KILRGKYECAFEYLIAAGKP 342
KI+RGK EC E + AG P
Sbjct: 310 KIIRGKNECGIEEGVVAGMP 329
>gi|161343829|tpg|DAA06095.1| TPA_inf: cathepsin B [Aphis gossypii]
Length = 280
Score = 149 bits (375), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 81/227 (35%), Positives = 113/227 (49%), Gaps = 9/227 (3%)
Query: 80 YSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLST 139
Y+ +P FDAR++WPNC +IGH+ + G C + + + A +DR CI S +N +S
Sbjct: 59 YTNGLPINFDARKRWPNCPSIGHIYNQGNCRSSYAISVASAVTDRICIHSNETKNPIMSA 118
Query: 140 EYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGS 199
+ + SCC +C Y C GS F +W+F + G V+GGDY GCQP I PC
Sbjct: 119 QQIISCCYLCGY----GCDGGSQFESWDFYRRHGFVSGGDYNSNQGCQPYMIPPCKLINE 174
Query: 200 APTLPSCEN-QKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGP 258
SC + C +C NP Y F D ++ + KEI +GP
Sbjct: 175 KSPRHSCTTYNREETPACEIKCNNPNYYSSFKTDIYKGK---YYQVYPFMAMKEIFDNGP 231
Query: 259 TTATFALYDDFYHYKSGVYKHTSNAKLENYLHSG-KLIGWGTENGTP 304
T F +Y D YKSGVY++ + + G K+IGWG ENG P
Sbjct: 232 ITTQFYMYRDLIDYKSGVYQYDEGFYGDFFTVQGXKIIGWGEENGDP 278
>gi|343961899|dbj|BAK62537.1| cathepsin B precursor [Pan troglodytes]
Length = 195
Score = 149 bits (375), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 77/187 (41%), Positives = 102/187 (54%), Gaps = 5/187 (2%)
Query: 157 CSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKC 216
C+ G WNF ++G V+GG Y GC+P +I PC HH + P PK C
Sbjct: 6 CNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCEHHVNGSRPPCTGEGDTPK--C 63
Query: 217 HTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGV 276
C P Y + QDKH +Y V ++E I EI +GP F++Y DF YKSGV
Sbjct: 64 SKIC-EPGYSPTYKQDKHYGYNSYSVSNSEKGIMAEIYKNGPVEGAFSVYSDFLLYKSGV 122
Query: 277 YKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYL 336
Y+H + + H+ +++GWG ENGTPYWLV N+W WGD G KILRG+ C E
Sbjct: 123 YQHVTGEMMGG--HAIRILGWGVENGTPYWLVANSWNTDWGDNGFFKILRGQDHCGIESE 180
Query: 337 IAAGKPK 343
+ AG P+
Sbjct: 181 VVAGIPR 187
>gi|297814171|ref|XP_002874969.1| hypothetical protein ARALYDRAFT_490415 [Arabidopsis lyrata subsp.
lyrata]
gi|297320806|gb|EFH51228.1| hypothetical protein ARALYDRAFT_490415 [Arabidopsis lyrata subsp.
lyrata]
Length = 359
Score = 149 bits (375), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 99/273 (36%), Positives = 131/273 (47%), Gaps = 30/273 (10%)
Query: 75 TYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQN 134
++DP S +P FDAR WP C +IG + D G C + F AV + SDR CI+ N
Sbjct: 96 SHDP--SLKLPKAFDARTAWPQCTSIGKILDQGHCGSCWAFGAVESLSDRFCIQFG--MN 151
Query: 135 RPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTG--GDYGDRTGCQPSTIS 192
LS + +CC R D C G W + G VT Y D TGC
Sbjct: 152 ISLSVNDLLACCGF-RCGD--GCDGGYPIAAWQYFSYSGVVTEECDPYFDNTGC------ 202
Query: 193 PCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKE 252
SH G P P+ +C +C + + + + KH + TY V+ + I E
Sbjct: 203 --SHPGCEPAYPT--------PRCLRKCVSDN--KLWSESKHYSVSTYTVNSSPQDIMAE 250
Query: 253 ILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTEN-GTPYWLVINT 311
+ +GP +F +Y+DF HYKSGVYKH + + + H+ KLIGWGT N G YWL+ N
Sbjct: 251 VYKNGPVEVSFTVYEDFAHYKSGVYKHITGSNIGG--HAVKLIGWGTSNEGEDYWLMANQ 308
Query: 312 WGPHWGDRGTVKILRGKYECAFEYLIAAGKPKN 344
W WGD G I RG EC E AG P +
Sbjct: 309 WNRGWGDDGYFMIRRGTNECGIEDEPVAGLPSS 341
>gi|14582576|gb|AAK69541.1|AF283476_1 cathepsin B-like cysteine proteinase [Ipomoea batatas]
Length = 352
Score = 149 bits (375), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 95/265 (35%), Positives = 131/265 (49%), Gaps = 35/265 (13%)
Query: 84 VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVA 143
+P FDAR WP C +I + D G C + F AV + +DR CI N LS +
Sbjct: 96 LPKTFDARTAWPQCLSIADILDQGHCGSCWAFGAVESLTDRFCIHYG--TNVTLSVNDLL 153
Query: 144 SCCK-ICRYDDNKSCSHGSVFRTWNFLHKRGSVTG--GDYGDRTGCQPSTISPCSHHGSA 200
+CC +C + C G W + + G VT Y D+TGC SH G
Sbjct: 154 ACCGFLC----GEGCDGGYPIAAWQYFKRTGVVTSECDPYFDQTGC--------SHPGCE 201
Query: 201 PT--LPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGP 258
P P+CE + V K + + KH + Y V+ ++ +I E+ +GP
Sbjct: 202 PAYPTPACEKKCVKK------------NLLWSESKHFSVNAYRVNSDQHSIMTEVYTNGP 249
Query: 259 TTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGT-ENGTPYWLVINTWGPHWG 317
+F +Y+DF HYKSGVYKH + +++ H+ KLIGWGT E+G YWL+ N W WG
Sbjct: 250 AEVSFTVYEDFAHYKSGVYKHVTGSEMGG--HAVKLIGWGTSEDGEDYWLLANQWNRSWG 307
Query: 318 DRGTVKILRGKYECAFEYLIAAGKP 342
D G KI+RG EC E + AG P
Sbjct: 308 DDGYFKIIRGTNECGIED-VTAGMP 331
>gi|356984175|gb|AET43950.1| cathepsin B, partial [Reishia clavigera]
Length = 209
Score = 148 bits (374), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 82/207 (39%), Positives = 107/207 (51%), Gaps = 10/207 (4%)
Query: 137 LSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSH 196
+S + +CC+ C C+ G W G VTGG Y + GCQP I+ C H
Sbjct: 12 VSANELLACCESC----GDGCNGGYPSAAWEVFDHDGVVTGGQYNSKQGCQPYLIAACDH 67
Query: 197 HGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAH 256
H P + K P+ C +C Y F DKH +Y V D I +E++
Sbjct: 68 HVVGKLKPCKGDGKTPR--CEKKC-EAGYNVTFKDDKHYGQRSYSVSSVND-IMEELVTR 123
Query: 257 GPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHW 316
GP A F +Y DF Y SGVY+HT+ + L H+ K++G+G ENG YWLV N+W P W
Sbjct: 124 GPVEAAFTVYSDFLQYHSGVYRHTTGSALGG--HAVKILGYGVENGDKYWLVANSWNPDW 181
Query: 317 GDRGTVKILRGKYECAFEYLIAAGKPK 343
GD+G KILRG EC E I AG+PK
Sbjct: 182 GDQGFFKILRGVDECGIEGQIVAGEPK 208
>gi|157058743|gb|ABV03129.1| cathepsin B-2744 [Pterocomma populeum]
Length = 244
Score = 148 bits (374), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 83/232 (35%), Positives = 115/232 (49%), Gaps = 7/232 (3%)
Query: 72 DRKTYDPEYSATVPDRFDAREQWPNCG-TIGHVPDTGACAAPHIFAAVGAFSDRRCIKSK 130
DRKT D Y VP FDAR + +C IG V D G CA+ A F+DR CI +
Sbjct: 13 DRKTVDANYRTDVPKEFDARRHFVSCANVIGDVKDQGNCASSWAVAVASTFTDRLCIATG 72
Query: 131 GQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPST 190
G+ LS + + SC ++ C GS F+ W F G VTGG++ GCQP
Sbjct: 73 GKFTDNLSAQNLMSCGDSEKF---VGCHGGSAFKAWEFTMGNGIVTGGNFNSNEGCQPYK 129
Query: 191 ISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVD-DNEDAI 249
PC H+G + ++ C +C N Y + D H+T++ Y N I
Sbjct: 130 NRPCDHYGDSSMTNCSSFRRTQMSICREKCVNKNYKVKYEDDLHKTSVVYMTSWTNVTQI 189
Query: 250 KKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTEN 301
++EI+ +GP TA +Y++F YK G+YK T L Y H KLIGWG ++
Sbjct: 190 QQEIMTYGPVTALMYVYENFMGYKEGIYKSTV-GDLVGYHHV-KLIGWGVDD 239
>gi|3087797|emb|CAA93275.1| cysteine proteinase [Haemonchus contortus]
Length = 330
Score = 148 bits (373), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 75/233 (32%), Positives = 124/233 (53%), Gaps = 8/233 (3%)
Query: 84 VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVA 143
+P+ FD+RE W NC +I ++ D + +A SDR C++SKG+ + +S +
Sbjct: 95 IPESFDSREVWKNCSSITYIRDQSNSGSCWAVSAAETMSDRICVQSKGRVQKMISDVDIL 154
Query: 144 SCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTL 203
+CC + + C+ G + W ++ + G VTGG Y ++ C+P + PC G +
Sbjct: 155 ACCG---RECGRGCNGGMDHKAWEYVKEFGVVTGGRYQEKGVCKPYHLHPCEITGKFWSC 211
Query: 204 PSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATF 263
P + + P C C YG+ + +DK Y +D++E AI++E++ +GP A F
Sbjct: 212 PRDHSFRTPA--CKKYC-QYGYGKRYEKDKSYVKSVYILDEDEKAIQREMMKNGPVQAAF 268
Query: 264 ALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHW 316
Y+DF Y+ G+Y H+ + H+ K++GWG ENGT YW V N+W W
Sbjct: 269 TTYEDFSFYRKGIYVHSYGR--QRGAHAVKVVGWGVENGTKYWNVANSWSTDW 319
>gi|30678927|ref|NP_849281.1| cathepsin B [Arabidopsis thaliana]
gi|3859606|gb|AAC72872.1| contains similarity to cysteine proteases (Pfam: PF00112,
E=1.3e-79, N=1) [Arabidopsis thaliana]
gi|7268205|emb|CAB77732.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
gi|332656653|gb|AEE82053.1| cathepsin B [Arabidopsis thaliana]
Length = 359
Score = 147 bits (371), Expect = 7e-33, Method: Compositional matrix adjust.
Identities = 99/273 (36%), Positives = 130/273 (47%), Gaps = 30/273 (10%)
Query: 75 TYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQN 134
++DP S +P FDAR WP C +IG++ G C + F AV + SDR CI+ N
Sbjct: 96 SHDP--SLKLPKAFDARTAWPQCTSIGNILGLGHCGSCWAFGAVESLSDRFCIQFG--MN 151
Query: 135 RPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTG--GDYGDRTGCQPSTIS 192
LS + +CC R D C G W + G VT Y D TGC
Sbjct: 152 ISLSVNDLLACCGF-RCGDG--CDGGYPIAAWQYFSYSGVVTEECDPYFDNTGC------ 202
Query: 193 PCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKE 252
SH G P P+ KC +C + + + + KH + TY V N I E
Sbjct: 203 --SHPGCEPAYPT--------PKCSRKCVSDN--KLWSESKHYSVSTYTVKSNPQDIMAE 250
Query: 253 ILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTEN-GTPYWLVINT 311
+ +GP +F +Y+DF HYKSGVYKH + + + H+ KLIGWGT + G YWL+ N
Sbjct: 251 VYKNGPVEVSFTVYEDFAHYKSGVYKHITGSNIGG--HAVKLIGWGTSSEGEDYWLMANQ 308
Query: 312 WGPHWGDRGTVKILRGKYECAFEYLIAAGKPKN 344
W WGD G I RG EC E AG P +
Sbjct: 309 WNRGWGDDGYFMIRRGTNECGIEDEPVAGLPSS 341
>gi|157058733|gb|ABV03124.1| cathepsin B-16a [Acyrthosiphon pisum]
Length = 274
Score = 147 bits (370), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 89/284 (31%), Positives = 137/284 (48%), Gaps = 13/284 (4%)
Query: 2 IHILVFLLGCTLVRGELYKFSDAYIDQINREANTWTAGRNF-PANLSEEYLRQFLIADAK 60
I +L +L + Y ++YI+ IN A TWTAG NF P+ +++++ +
Sbjct: 1 IILLSVVLFSVYQTEQAYFLEESYIEMINDVATTWTAGVNFDPSTPEKDFIKMLGSKGVE 60
Query: 61 YFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGA 120
+ + + + +P FDAR +W +C TIG V D G C + A A
Sbjct: 61 AAKNASAHMFKTHDVANDNNNGYIPRTFDARRRWRHCKTIGEVRDQGHCGSCWAMATSSA 120
Query: 121 FSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDY 180
F+DR C+ + G N LS E + CC C + C+ G + W + G VTGG+Y
Sbjct: 121 FADRLCVATNGDFNELLSAEEITFCCHTCGF----GCNGGYPIKAWKYFSSHGIVTGGNY 176
Query: 181 GDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRG--FFQDKHRTTL 238
GC+P + PC + SC + + K + RCT YG + + HR T
Sbjct: 177 KSGEGCEPYRVPPCPQDEEGKS--SCAGKPIEK---NHRCTRMCYGNQDLDYNEDHRFTR 231
Query: 239 TYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSN 282
Y+ +I+K+++ +GP A+F +YDDF YKSGVY+ T N
Sbjct: 232 DYYY-LTYGSIQKDVMNYGPIEASFDVYDDFPSYKSGVYQRTPN 274
>gi|215687149|dbj|BAG90919.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 403
Score = 147 bits (370), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 101/308 (32%), Positives = 137/308 (44%), Gaps = 34/308 (11%)
Query: 38 AGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNC 97
G N P + ++ + + +D P+ KTY S +P FDAR W C
Sbjct: 107 GGLNNPPVQTAQFKHILGVKPTPHSVLNDVPV----KTY--PRSLMLPKEFDARSAWSQC 160
Query: 98 GTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSC 157
TIG + D G C + F AV DR CI N LS + +CC D C
Sbjct: 161 NTIGTILDQGHCGSCWAFGAVECLQDRFCIHF--NMNISLSVNDLVACCGFMCGD---GC 215
Query: 158 SHGSVFRTWNFLHKRGSVTG--GDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLK 215
G W + + G VT Y D+ GC+ H G P P+ +K K
Sbjct: 216 DGGYPIMAWRYFVRNGVVTDECDPYFDQVGCK--------HPGCEPAYPTPVCEK----K 263
Query: 216 CHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSG 275
C + + + + KH + Y V+ + I E+ +GP F +Y+DF HYKSG
Sbjct: 264 CKVQ------NQVWLEKKHFSVNAYRVNSDPHDIMAEVYQNGPVEVAFTVYEDFAHYKSG 317
Query: 276 VYKHTSNAKLENYLHSGKLIGWGTEN-GTPYWLVINTWGPHWGDRGTVKILRGKYECAFE 334
VYKH + + H+ KLIGWGT + G YWL+ N W WGD G KI+RG EC E
Sbjct: 318 VYKHITGGMMGG--HAVKLIGWGTTDAGEDYWLLANQWNRGWGDDGYFKIIRGTNECGIE 375
Query: 335 YLIAAGKP 342
+ AG P
Sbjct: 376 EDVVAGMP 383
>gi|157058735|gb|ABV03125.1| cathepsin B-16 [Aulacorthum solani]
Length = 246
Score = 146 bits (369), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 90/256 (35%), Positives = 129/256 (50%), Gaps = 17/256 (6%)
Query: 19 YKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDP 78
Y ++YID IN A TWTAG NF + EE+ + L +K + + + + KT D
Sbjct: 2 YFLEESYIDMINEVATTWTAGVNFDPSTPEEHFVKML--GSKGVESAKQASAHEFKTNDV 59
Query: 79 EYS---ATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNR 135
Y +P FDAR++W +C TIG V D G C + F AF+DR C+ + G N
Sbjct: 60 AYDNYYGYIPRTFDARKRWRHCKTIGEVRDQGNCGSCWAFGTSSAFADRLCVATDGDFNE 119
Query: 136 PLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCS 195
LS E +A CC C + C G + W + G VTGG+Y GC+P + PC
Sbjct: 120 LLSPEEIAFCCHTCGF----GCHGGYPIKAWKYFSTHGLVTGGNYKSGEGCEPYRVPPCQ 175
Query: 196 HHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRG--FFQDKHRTTLTYWVDDNEDAIKKEI 253
HH SC ++ + K + RCT YG + D HR T Y+ +I+K++
Sbjct: 176 HHHQGNN--SCSDKPMEK---NHRCTRMCYGDQDLDYNDDHRFTRDYYY-LTYGSIQKDV 229
Query: 254 LAHGPTTATFALYDDF 269
+ +GP A+F +YDDF
Sbjct: 230 MNYGPIEASFDVYDDF 245
>gi|56758644|gb|AAW27462.1| unknown [Schistosoma japonicum]
Length = 294
Score = 146 bits (368), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 77/205 (37%), Positives = 113/205 (55%), Gaps = 6/205 (2%)
Query: 79 EYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLS 138
+ + +P +FD+R++WP+C +I + D C + F AV A +DR CI+S GQQ+ LS
Sbjct: 85 DLNVEIPSQFDSRKKWPHCKSISQIRDQSRCGSCWAFGAVEAMTDRICIQSGGQQSAELS 144
Query: 139 TEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHG 198
+ SCC+ C C G W++ KRG VTGG + TGCQP C HH
Sbjct: 145 ALDLISCCEDC----GDGCQGGFPGVAWDYWVKRGIVTGGSKENHTGCQPYPFPKCEHH- 199
Query: 199 SAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGP 258
+ P+C + +C +C Y + QDKH +Y V NE AI+KEI+ +GP
Sbjct: 200 TKGKYPACGTKIYKTPQCKQKCQK-GYKTPYEQDKHYGEESYNVISNEKAIQKEIMMNGP 258
Query: 259 TTATFALYDDFYHYKSGVYKHTSNA 283
A F +Y+DF +YKSG+Y+H + +
Sbjct: 259 VEAAFDVYEDFLNYKSGIYRHVTGS 283
>gi|56754337|gb|AAW25356.1| SJCHGC00056 protein [Schistosoma japonicum]
Length = 342
Score = 146 bits (368), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 85/260 (32%), Positives = 132/260 (50%), Gaps = 8/260 (3%)
Query: 81 SATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTE 140
+ +P +FD+R++WP+C +I + D C + F AV A +DR CI+S G Q+ LS
Sbjct: 87 NVEIPSQFDSRKKWPHCKSISQIRDQSRCGSCWAFGAVEAMTDRICIQSGGGQSAELSAL 146
Query: 141 YVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSA 200
+ SC D C G + W+ R S + TGCQP C H +
Sbjct: 147 DLISC----CEDCGGGCKGGFPGQAWDMGKTRDSHWRFRKKNHTGCQPYPFPKCEHL-TK 201
Query: 201 PTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTT 260
P+C + +C C Y F QDK + V +NE +++I+ +GP
Sbjct: 202 GKYPACGTKIYKTPQCKQTCQK-GYKTPFEQDKPFGEGSSNVQNNEKVFQRDIMMYGPVE 260
Query: 261 ATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRG 320
A F +Y+DF + KSG+ +H + + + H ++IGWG E G PYWL+ N+W WG+ G
Sbjct: 261 AAFDVYEDFLNSKSGISRHVTGSIVGG--HPIRIIGWGVEKGNPYWLIANSWNEDWGENG 318
Query: 321 TVKILRGKYECAFEYLIAAG 340
+++RG+ EC+ E + AG
Sbjct: 319 LFRMVRGRDECSIESHVVAG 338
>gi|168000937|ref|XP_001753172.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162695871|gb|EDQ82213.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 347
Score = 146 bits (368), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 102/327 (31%), Positives = 150/327 (45%), Gaps = 36/327 (11%)
Query: 22 SDAYIDQINRE-ANTWTAGRNFP-ANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPE 79
A +D++N TWTAG N A + E+L++ A +++ P
Sbjct: 42 QQALVDKVNAHPGATWTAGFNERFAKHTIEHLKKMCGA---ILTPANKLEPSIETISHKH 98
Query: 80 YSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLST 139
+P FDAR+QW +C TIG + G C + F AV + +DR CI ++ LS
Sbjct: 99 KKLYLPKEFDARKQWSHCPTIGDILGQGHCGSCWAFGAVESLTDRFCIHL--NESVSLSE 156
Query: 140 EYVASCCKI-CRYDDNKSCSHGSVFRTWNFLHKRGSVTG--GDYGDRTGCQPSTISPCSH 196
+ +CC C Y C G R W + G VT Y D+ GC +H
Sbjct: 157 NDLLACCGFECGY----GCEGGYPIRAWKYFKHSGVVTNKCDPYFDQKGC--------AH 204
Query: 197 HGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAH 256
G PT + KC +C + + + Q KH Y + + + E+ +
Sbjct: 205 PGCYPTYET--------PKCEKQCVDDEF---WVQSKHLGVNAYEMSMEPEDLMAELYTN 253
Query: 257 GPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGT-ENGTPYWLVINTWGPH 315
GP F +Y+DF HYK+GVYKH + H+ KLIGWGT ++G YW ++N+W +
Sbjct: 254 GPVEVAFEVYEDFAHYKTGVYKHLFGGFMGG--HAVKLIGWGTTDDGVDYWTIVNSWNTN 311
Query: 316 WGDRGTVKILRGKYECAFEYLIAAGKP 342
WG+ G +I+RG EC E AG P
Sbjct: 312 WGEDGLFRIVRGNDECGIESNAVAGLP 338
>gi|157167285|ref|XP_001658487.1| cathepsin b [Aedes aegypti]
gi|108876478|gb|EAT40703.1| AAEL007590-PA [Aedes aegypti]
Length = 313
Score = 146 bits (368), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 89/260 (34%), Positives = 126/260 (48%), Gaps = 12/260 (4%)
Query: 83 TVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYV 142
+P FDAR+QWP C ++ + G C + + A +DR CI SKG++ +
Sbjct: 61 VLPKSFDARQQWPQCSSLNEIRTQGCCGSCAYVSGASAMTDRWCIHSKGKKQFTFGAFDL 120
Query: 143 ASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPT 202
SCC C G + W++ K+G +GG YG GC P + P S
Sbjct: 121 LSCCYECGGGCTGGGIPGPI---WSYWVKQGVSSGGPYGSNQGCHPYPMPPSCPKPSEGD 177
Query: 203 LPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTAT 262
P N C TRC +D+ + Y + +E I ++I +GP A
Sbjct: 178 YPDEPN-------CSTRCNAGYNVTEDLRDRRFGRVAYSIPADERKIMEDIFVNGPVQAV 230
Query: 263 FALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTV 322
F Y+D +Y GVY+H S +L+ H+ KLIGWG E+GT YWLV N+WG WGD G
Sbjct: 231 FQWYEDIVNYSGGVYRHQS-GRLKGG-HAVKLIGWGVEDGTKYWLVANSWGRVWGDDGFF 288
Query: 323 KILRGKYECAFEYLIAAGKP 342
K++RG+ C E + AG P
Sbjct: 289 KMVRGENHCGIEENVHAGLP 308
>gi|197304333|dbj|BAG69285.1| cathepsin B-like cysteine protease [Raphanus sativus]
Length = 343
Score = 146 bits (368), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 98/271 (36%), Positives = 127/271 (46%), Gaps = 32/271 (11%)
Query: 79 EYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLS 138
+ S +P FDAR WP C +IG + D G C + F AV + SDR CI+ N LS
Sbjct: 99 DQSLKLPKSFDARTHWPQCTSIGKILDQGHCGSCWAFGAVESLSDRFCIQFG--MNITLS 156
Query: 139 TEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTG--GDYGDRTGCQPSTISPCSH 196
+ +CC R D C G W + G VT Y D+TGC SH
Sbjct: 157 VNDLLACCGF-RCGD--GCDGGYPISAWQYFSYSGVVTEECDPYFDQTGC--------SH 205
Query: 197 HGSAPTLPSCENQKVPKLKCHTRCTNPTYGRG--FFQDKHRTTLTYWVDDNEDAIKKEIL 254
G P + +C GR + + KH + TY V+ N I EI
Sbjct: 206 PGCEPAYNT------------PQCLRKCVGRNQLWSESKHYSINTYVVESNPQDIMAEIY 253
Query: 255 AHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWG-TENGTPYWLVINTWG 313
+GP +F +Y+DF HYKSGVYKH + + + H+ KLIGWG T++G YWL+ N W
Sbjct: 254 KNGPVEVSFTVYEDFAHYKSGVYKHITGSNIGG--HAVKLIGWGTTDDGEDYWLLANQWN 311
Query: 314 PHWGDRGTVKILRGKYECAFEYLIAAGKPKN 344
WGD G I RG EC E AG P +
Sbjct: 312 RSWGDDGYFMIRRGTNECGIEDEPVAGLPSS 342
>gi|170030062|ref|XP_001842909.1| cathepsin B-like thiol protease [Culex quinquefasciatus]
gi|167865915|gb|EDS29298.1| cathepsin B-like thiol protease [Culex quinquefasciatus]
Length = 288
Score = 146 bits (368), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 86/264 (32%), Positives = 131/264 (49%), Gaps = 25/264 (9%)
Query: 83 TVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYV 142
+P FDAR++WP C ++ + G+C + + + +DR CI S G++ +
Sbjct: 46 ALPASFDARQKWPYCPSLNQIRSQGSCGSCYAVSTAAVITDRYCIHSGGERQFYFGSTGY 105
Query: 143 ASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPT 202
SCC C C G V +T+++ K G +GG Y GC+P P
Sbjct: 106 LSCCTDCY-----KCDGGYVHKTFDYWVKYGLTSGGPYHSGQGCKP-----------YPF 149
Query: 203 LPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTY---WVDDNEDAIKKEILAHGPT 259
+ ++ + LKC +C Y + QD +Y W D+N A+K EI +GP
Sbjct: 150 GGATQDVNIV-LKCDRQC-QAGYPLTYSQDLKHGASSYILPWGDEN--AMKAEIYQNGPI 205
Query: 260 TATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDR 319
+F +Y DF+ Y+SGVY+H + A + H+ ++IGWG ENG YWL N+W WG+
Sbjct: 206 VTSFDVYGDFFQYRSGVYRHVTGAYKGS--HAVRVIGWGVENGVKYWLCANSWNERWGEN 263
Query: 320 GTVKILRGKYECAFEYLIAAGKPK 343
G KI+RG+ E + AG PK
Sbjct: 264 GFFKIVRGENHVGVEDISYAGLPK 287
>gi|6165885|gb|AAF04727.1|AF101239_1 cathepsin B-like cysteine proteinase [Ipomoea batatas]
Length = 352
Score = 145 bits (367), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 94/265 (35%), Positives = 130/265 (49%), Gaps = 35/265 (13%)
Query: 84 VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVA 143
+P FDAR WP C +I + D G C + F AV + +DR CI N LS +
Sbjct: 96 LPKTFDARTAWPQCLSIADILDQGHCGSCWAFGAVESLTDRFCIHYG--TNVTLSVNDLL 153
Query: 144 SCCK-ICRYDDNKSCSHGSVFRTWNFLHKRGSVTG--GDYGDRTGCQPSTISPCSHHGSA 200
+CC +C + C G W + + G VT Y D+TGC SH G
Sbjct: 154 ACCGFLC----GEGCDGGYPIAAWQYFKRTGVVTSECDPYFDQTGC--------SHPGCE 201
Query: 201 PT--LPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGP 258
P P+CE + V K + + KH + Y V+ ++ +I E+ +GP
Sbjct: 202 PAYPTPACEKKCVKK------------NLLWSESKHFSVNAYRVNSDQHSIMTEVYTNGP 249
Query: 259 TTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGT-ENGTPYWLVINTWGPHWG 317
+F +Y+DF HYKSGVYKH + +++ H+ KLIGWGT E+G YWL+ N W WG
Sbjct: 250 AEVSFTVYEDFAHYKSGVYKHVTGSEMGG--HAVKLIGWGTSEDGEDYWLLANQWNRSWG 307
Query: 318 DRGTVKILRGKYECAFEYLIAAGKP 342
G KI+RG EC E + AG P
Sbjct: 308 GDGYFKIIRGTNECGIED-VTAGTP 331
>gi|449446774|ref|XP_004141146.1| PREDICTED: cathepsin B-like [Cucumis sativus]
Length = 348
Score = 145 bits (367), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 96/265 (36%), Positives = 125/265 (47%), Gaps = 28/265 (10%)
Query: 81 SATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTE 140
S +P FDARE WP C +IG + D G C + F AV + SDR CI N LS
Sbjct: 98 SLKLPKSFDAREAWPQCISIGTILDQGHCGSCWAFGAVESLSDRFCIHF--DMNITLSVN 155
Query: 141 YVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTG--GDYGDRTGCQPSTISPCSHHG 198
+ +CC D C G W + + G VT Y D TG CSH G
Sbjct: 156 DLLACCGFMCGD---GCDGGYPISAWRYFVRHGVVTEQCDPYFDTTG--------CSHPG 204
Query: 199 SAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGP 258
P P+ +C C + + + + KH Y V + + I E+ +GP
Sbjct: 205 CEPAYPT--------PRCVRHCVDKN--QIWRKTKHYGVSAYRVKRDPNDIMAEVYKNGP 254
Query: 259 TTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWG-TENGTPYWLVINTWGPHWG 317
+F +Y+DF HYKSGVYKH + + H+ KLIGWG T++G YWL+ N W WG
Sbjct: 255 VEVSFTVYEDFAHYKSGVYKHITGDVMGG--HAVKLIGWGTTDDGEDYWLLANQWNRGWG 312
Query: 318 DRGTVKILRGKYECAFEYLIAAGKP 342
D G KI RG EC E + AG P
Sbjct: 313 DDGYFKIRRGTNECGIEEDVVAGLP 337
>gi|449489527|ref|XP_004158338.1| PREDICTED: cathepsin B-like [Cucumis sativus]
Length = 349
Score = 145 bits (367), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 96/265 (36%), Positives = 125/265 (47%), Gaps = 28/265 (10%)
Query: 81 SATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTE 140
S +P FDARE WP C +IG + D G C + F AV + SDR CI N LS
Sbjct: 99 SLKLPKSFDAREAWPQCISIGTILDQGHCGSCWAFGAVESLSDRFCIHF--DMNITLSVN 156
Query: 141 YVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTG--GDYGDRTGCQPSTISPCSHHG 198
+ +CC D C G W + + G VT Y D TG CSH G
Sbjct: 157 DLLACCGFMCGD---GCDGGYPISAWRYFVRHGVVTEQCDPYFDTTG--------CSHPG 205
Query: 199 SAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGP 258
P P+ +C C + + + + KH Y V + + I E+ +GP
Sbjct: 206 CEPAYPT--------PRCVRHCVDKN--QIWRKTKHYGVSAYRVKRDPNDIMAEVYKNGP 255
Query: 259 TTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWG-TENGTPYWLVINTWGPHWG 317
+F +Y+DF HYKSGVYKH + + H+ KLIGWG T++G YWL+ N W WG
Sbjct: 256 VEVSFTVYEDFAHYKSGVYKHITGDVMGG--HAVKLIGWGTTDDGEDYWLLANQWNRGWG 313
Query: 318 DRGTVKILRGKYECAFEYLIAAGKP 342
D G KI RG EC E + AG P
Sbjct: 314 DDGYFKIRRGTNECGIEEDVVAGLP 338
>gi|194246067|gb|ACF35525.1| putative cathepsin B-like cysteine protease form 1 [Dermacentor
variabilis]
Length = 192
Score = 145 bits (366), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 81/188 (43%), Positives = 100/188 (53%), Gaps = 5/188 (2%)
Query: 157 CSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKC 216
C+ G W F VTGG YG GCQP PC HH P LP+C K P +C
Sbjct: 7 CNGGYPSAAWQFYKDEDIVTGGLYGTEDGCQPYYFPPCEHHTVGP-LPNCTGIK-PTPEC 64
Query: 217 HTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGV 276
C Y + + +DKH Y + +E IK EI +GP A F++Y DF YKSGV
Sbjct: 65 AKTCRE-GYQKSYTRDKHFGKKVYSISSDETQIKTEIYKNGPVEADFSVYADFPSYKSGV 123
Query: 277 YKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYL 336
Y+ S L H+ +++GWGTE+G PYWLV N+W WGD+G KI RG EC E
Sbjct: 124 YQRHSEEMLGG--HAIRILGWGTEDGVPYWLVANSWNEDWGDKGYFKIRRGNDECGIEDD 181
Query: 337 IAAGKPKN 344
I AG PK
Sbjct: 182 INAGIPKE 189
>gi|356572872|ref|XP_003554589.1| PREDICTED: cathepsin B-like [Glycine max]
Length = 356
Score = 145 bits (366), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 98/266 (36%), Positives = 125/266 (46%), Gaps = 30/266 (11%)
Query: 81 SATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTE 140
S +P FDAR W C TIG + D G C + F AV + SDR CI N LS
Sbjct: 97 SLKLPKNFDARTAWSQCSTIGRILDQGHCGSCWAFGAVESLSDRFCIHF--DVNISLSVN 154
Query: 141 YVASCCK-ICRYDDNKSCSHGSVFRTWNFLHKRGSVTG--GDYGDRTGCQPSTISPCSHH 197
+ +CC +C C G W +L G VT Y D+ GC SH
Sbjct: 155 DLLACCGFLC----GSGCDGGYPLYAWQYLAHHGVVTEECDPYFDQIGC--------SHP 202
Query: 198 GSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHG 257
G P + PK C +C + + + + KH + Y V + I E+ +G
Sbjct: 203 GCEPAY------RTPK--CVKKCVSGN--QVWKKSKHYSVNAYRVSSDPHDIMTEVYKNG 252
Query: 258 PTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGT-ENGTPYWLVINTWGPHW 316
P F +Y+DF HYKSGVYKH + +L H+ KLIGWGT E+G YWL+ N W W
Sbjct: 253 PVEVAFTVYEDFAHYKSGVYKHITGYELGG--HAVKLIGWGTTEDGEDYWLLANQWNREW 310
Query: 317 GDRGTVKILRGKYECAFEYLIAAGKP 342
GD G KI RG EC E + AG P
Sbjct: 311 GDDGYFKIRRGTNECGIEEDVTAGLP 336
>gi|328871084|gb|EGG19455.1| peptidase C1A family protein [Dictyostelium fasciculatum]
Length = 352
Score = 145 bits (365), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 92/271 (33%), Positives = 135/271 (49%), Gaps = 25/271 (9%)
Query: 71 GDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSK 130
GD D + A VP F++ +QW NC I + + C + F AV + SDR CI K
Sbjct: 58 GDVPVVDYAFQA-VPANFNSAQQWSNCSYISAIQNQARCGSCWAFGAVESVSDRFCIH-K 115
Query: 131 GQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPST 190
G+ + LS + + +C + + C G + F+ K+G V+ C P T
Sbjct: 116 GE-DVLLSFQDLVTCDQ-----SDNGCQGGDAYTAMKFIQKKGIVS-------NDCLPYT 162
Query: 191 ISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIK 250
I C AP C N V +C +C+N +Y + QD H Y ++ +AI+
Sbjct: 163 IPTC-----APAQQPCLN-FVDTPQCVEKCSNASYT--YAQDLHFIDGVYSMNPTVNAIQ 214
Query: 251 KEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVIN 310
+EI+ +GP A F +Y+DF YKSGVY+HT+ L H K+IGWGT+N YW+ N
Sbjct: 215 QEIMTNGPVEACFEVYEDFLGYKSGVYQHTTGKDLGG--HCVKMIGWGTQNNELYWICNN 272
Query: 311 TWGPHWGDRGTVKILRGKYECAFEYLIAAGK 341
+W +WG++G I G EC E + A K
Sbjct: 273 SWTTYWGNQGVFWIKAGVNECGIESDVVAAK 303
>gi|320167003|gb|EFW43902.1| cathepsin B [Capsaspora owczarzaki ATCC 30864]
Length = 306
Score = 145 bits (365), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 110/338 (32%), Positives = 145/338 (42%), Gaps = 43/338 (12%)
Query: 5 LVFLLGCTLVRGELYKFSDAYIDQINREANTWTAGRNFP--ANLSEEYLRQFLIADAKYF 62
++ LG V + IDQIN WTAG N P A + E ++ L
Sbjct: 6 VIAFLGLVAVASAEFILQQEMIDQINNANVGWTAGVN-PRFAGKTREDIKGLLGTKLLPK 64
Query: 63 DQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFS 122
R P D +P FDAR QWP +I + D C + F A A S
Sbjct: 65 GTKLREFPVVDTIVD-----AIPTSFDARTQWP--ASIHPIRDQQQCGSCWAFGATEALS 117
Query: 123 DRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGD 182
DR I S N LS + + SC + C G W+++ G VT
Sbjct: 118 DRLAIASNNSINVVLSPQDLVSCDST-----DYGCDGGYPINAWHYMQSLGVVT------ 166
Query: 183 RTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWV 242
C P T S +G + T +K P C T+ ++ Y V
Sbjct: 167 -DTCYPYT----SGNGDSGTC-QITGKKTPA------CATATF--------YKAKTAYQV 206
Query: 243 DDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENG 302
+N AI+ EILA+GP A F++YDDF+ Y SGVY H S A + H+ K++GWG +
Sbjct: 207 ANNMAAIQSEILANGPVEAAFSVYDDFFSYTSGVYSHQSGAL--DGGHAVKIVGWGVDGT 264
Query: 303 TPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAG 340
TPYW+V N+WG WG G I RG EC E I AG
Sbjct: 265 TPYWIVANSWGTSWGQAGFFWIKRGNDECGIEDGIVAG 302
>gi|44968648|gb|AAS49594.1| cathepsin B [Scyliorhinus canicula]
Length = 206
Score = 144 bits (364), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 81/214 (37%), Positives = 115/214 (53%), Gaps = 8/214 (3%)
Query: 90 AREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKIC 149
+REQWP+C TI + D G+C + F AV A SDR CI S+G+ N +S E + SCCK+
Sbjct: 1 SREQWPDCPTIKEIRDQGSCGSCWAFGAVEAMSDRICIHSRGKVNVEVSAEDLLSCCKL- 59
Query: 150 RYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQ 209
+ C+ G W F G V+GG Y GC+P +ISPC HH + + P C +
Sbjct: 60 --ECGNGCNGGYPSGAWEFWTNDGLVSGGLYYSHIGCRPYSISPCEHHVNG-SRPKCSGE 116
Query: 210 KVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDF 269
+ +C RC Y + +DKH +Y + + I EI +GP A ++ DF
Sbjct: 117 -IETPRCSRRC-EAGYSPKYSEDKHYGLTSYSIGSDVTEIMTEIYKNGPVEAALEVFKDF 174
Query: 270 YHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGT 303
YKSGVY+H + + H+ K++GWG ENGT
Sbjct: 175 LLYKSGVYQHKTGGSIGG--HAIKILGWGEENGT 206
>gi|343476073|emb|CCD12715.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 336
Score = 144 bits (363), Expect = 6e-32, Method: Compositional matrix adjust.
Identities = 102/350 (29%), Positives = 153/350 (43%), Gaps = 36/350 (10%)
Query: 1 MIHILVFLLGCTLVRG-ELYKFSDAYIDQINR-EANTWTAGRNFPANLSEEYLRQFLIAD 58
++ + LG + +R + + ++D+IN+ W A N ++ ++
Sbjct: 9 LLSTALVTLGASALRAKDAPVLTKTFVDRINQLNGGMWKAVYN-------GKMQNITFSE 61
Query: 59 AKYFD----QSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHI 114
AK Q LP R T + + +P+ FD+ E+WPNC TI + D AC A
Sbjct: 62 AKRLTGAWIQKTSSLPPVRFT-EEQLRTELPESFDSAEKWPNCPTIREIADQSACRASWA 120
Query: 115 FAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFR--TWNFLHKR 172
+ A SDR C G+Q R S + F W + +
Sbjct: 121 VSTASAISDRYCTVGGGKQLR-------ISAAHLLSCCKQCGGGCKGGFPGFAWRYYVEY 173
Query: 173 GSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQD 232
G + + CQP C HHG+ C N K +C+T CT+ T
Sbjct: 174 GIAS-------SYCQPYPFPQCEHHGAQGNKTPCSNYKFVTPQCNTTCTDKTIPL----I 222
Query: 233 KHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSG 292
K+R Y + E+ K+E+ +GP A +Y D + YKSGVY++ + + + +
Sbjct: 223 KYRGKDAYMLLPGEEEFKRELYFNGPFVAILFVYTDLFAYKSGVYRNVDGSYMG--VTAV 280
Query: 293 KLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
K++GWG NGTPYW V NTW WG G + ILRG EC E+L AG P
Sbjct: 281 KVVGWGKLNGTPYWKVANTWDTDWGMDGYLLILRGNNECNIEHLGFAGTP 330
>gi|217072748|gb|ACJ84734.1| unknown [Medicago truncatula]
gi|388505480|gb|AFK40806.1| unknown [Medicago truncatula]
Length = 359
Score = 143 bits (361), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 97/268 (36%), Positives = 124/268 (46%), Gaps = 34/268 (12%)
Query: 81 SATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTE 140
S +P FDAR W C TIG + D G C + F AV + DR CI N LS
Sbjct: 100 SLKLPKEFDARTAWSQCSTIGKILDQGHCGSCWAFGAVESLQDRFCIHF--DMNISLSVN 157
Query: 141 YVASCCK-ICRYDDNKSCSHGSVFRTWNFLHKRGSVTG--GDYGDRTGCQPSTISPCSHH 197
+ +CC +C C G+ W +L G VT Y D+ GC SH
Sbjct: 158 DLLACCGFLC----GAGCDGGTPIYAWRYLAHHGVVTEECDPYFDQIGC--------SHP 205
Query: 198 GSAPTLPSCENQKVPKLKCHTRCT--NPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILA 255
G P + PK C +C N + R KH + Y V + I E+
Sbjct: 206 GCEPAY------QTPK--CVRKCVKGNQIWKR----SKHYSVKAYRVKSDPQDIMAEVYK 253
Query: 256 HGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGT-ENGTPYWLVINTWGP 314
+GP F +++DF HYKSGVYKH + + L H+ KLIGWGT + G YWL+ N W
Sbjct: 254 NGPVEVAFTVFEDFAHYKSGVYKHITGSALGG--HAVKLIGWGTSDEGEDYWLLANQWNT 311
Query: 315 HWGDRGTVKILRGKYECAFEYLIAAGKP 342
+WGD G KI RG EC E + AG P
Sbjct: 312 NWGDDGYFKIKRGTNECGIEDDVTAGLP 339
>gi|294889976|ref|XP_002773021.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
gi|239877724|gb|EER04837.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
Length = 342
Score = 143 bits (361), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 93/257 (36%), Positives = 129/257 (50%), Gaps = 18/257 (7%)
Query: 83 TVPDRFDAREQWPNCG-TIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEY 141
T+P F+A+ ++ +C IGH+ D C A+VG F+DR CI+S G+ LS Y
Sbjct: 38 TLPSNFNAQIKFASCADVIGHIRDQAECHNCWASASVGMFNDRVCIQSGGRITDILSLAY 97
Query: 142 VASCCK---ICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDY------GDRTGCQPSTIS 192
+ SCC C D C GSV F+ G VTGG+Y G+ GC P
Sbjct: 98 LTSCCNHANGCPKSD--GCRRGSVAEGLIFMKNHGIVTGGEYKPPKKLGNDDGCWPYPFP 155
Query: 193 PCSH-HGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKK 251
C+H G P C KV +L + C R D HR + + + IK+
Sbjct: 156 KCNHVPGMKVKYPRC-GSKVGRLAAPSHCDGLHCRRA--GDVHRAKSWGRLPISPEKIKQ 212
Query: 252 EILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINT 311
EI +GP A +++DF YKSGVY++ + A + H+ KLIGWG E G YWL +N+
Sbjct: 213 EIFDNGPVAAIMTIHEDFRLYKSGVYEYKTGAMVG--AHTLKLIGWGVEAGQEYWLAVNS 270
Query: 312 WGPHWGDRGTVKILRGK 328
W WGD+G +K+ GK
Sbjct: 271 WNEEWGDQGKIKLAVGK 287
>gi|356505709|ref|XP_003521632.1| PREDICTED: cathepsin B-like [Glycine max]
Length = 357
Score = 143 bits (361), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 99/284 (34%), Positives = 131/284 (46%), Gaps = 34/284 (11%)
Query: 67 RPLPGDRKTYDPEYS----ATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFS 122
+P+P P S +P FDAR W C TIG + D G C + F AV + S
Sbjct: 80 KPMPKKELRSTPAISHPKTLKLPKNFDARTAWSQCSTIGRILDQGHCGSCWAFGAVESLS 139
Query: 123 DRRCIKSKGQQNRPLSTEYVASCCK-ICRYDDNKSCSHGSVFRTWNFLHKRGSVTG--GD 179
DR CI N LS + +CC +C C G W +L G VT
Sbjct: 140 DRFCIHF--DVNISLSVNDLLACCGFLC----GSGCDGGYPLYAWRYLAHHGVVTEECDP 193
Query: 180 YGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLT 239
Y D+ GC SH G P + PK C +C + + + + KH +
Sbjct: 194 YFDQIGC--------SHPGCEPAY------RTPK--CVKKCVSGN--QVWKKSKHYSVSA 235
Query: 240 YWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGT 299
Y V+ + I E+ +GP F +Y+DF +YKSGVYKH + +L H+ KLIGWGT
Sbjct: 236 YRVNSDPHDIMAEVYKNGPVEVAFTVYEDFAYYKSGVYKHITGYELGG--HAVKLIGWGT 293
Query: 300 -ENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
++G YWL+ N W WGD G KI RG EC E + AG P
Sbjct: 294 TDDGEDYWLLANQWNREWGDDGYFKIRRGTNECGIEEDVTAGLP 337
>gi|357511629|ref|XP_003626103.1| Cathepsin B [Medicago truncatula]
gi|87240982|gb|ABD32840.1| Peptidase C1A, papain; Somatotropin hormone; Peptidase C1,
propeptide [Medicago truncatula]
gi|355501118|gb|AES82321.1| Cathepsin B [Medicago truncatula]
Length = 357
Score = 143 bits (360), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 97/268 (36%), Positives = 124/268 (46%), Gaps = 34/268 (12%)
Query: 81 SATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTE 140
S +P FDAR W C TIG + D G C + F AV + DR CI N LS
Sbjct: 98 SLKLPKEFDARTAWSQCSTIGKILDQGHCGSCWAFGAVESLQDRFCIHF--DMNISLSVN 155
Query: 141 YVASCCK-ICRYDDNKSCSHGSVFRTWNFLHKRGSVTG--GDYGDRTGCQPSTISPCSHH 197
+ +CC +C C G+ W +L G VT Y D+ GC SH
Sbjct: 156 DLLACCGFLC----GAGCDGGTPIYAWRYLAHHGVVTEECDPYFDQIGC--------SHP 203
Query: 198 GSAPTLPSCENQKVPKLKCHTRCT--NPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILA 255
G P + PK C +C N + R KH + Y V + I E+
Sbjct: 204 GCEPAY------QTPK--CVRKCVKGNQIWKR----SKHYSVKAYRVKSDPQDIMAEVYK 251
Query: 256 HGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGT-ENGTPYWLVINTWGP 314
+GP F +++DF HYKSGVYKH + + L H+ KLIGWGT + G YWL+ N W
Sbjct: 252 NGPVEVAFTVFEDFAHYKSGVYKHITGSALGG--HAVKLIGWGTSDEGEDYWLLANQWNT 309
Query: 315 HWGDRGTVKILRGKYECAFEYLIAAGKP 342
+WGD G KI RG EC E + AG P
Sbjct: 310 NWGDDGYFKIKRGTNECGIEDDVTAGLP 337
>gi|161343875|tpg|DAA06118.1| TPA_inf: cathepsin B [Myzus persicae]
Length = 210
Score = 142 bits (359), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 81/221 (36%), Positives = 111/221 (50%), Gaps = 12/221 (5%)
Query: 109 CAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNF 168
C + +A FSDR CI + G R LS E + +CC C C GS W F
Sbjct: 1 CGSCWAASAASVFSDRLCIATGGAVARNLSAEQLNTCCYRC----GNGCDGGSPEAAWYF 56
Query: 169 LHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTR-CTNPTYGR 227
+ G VTGGDY GCQP +I P +C + + C R CTN Y +
Sbjct: 57 FMRHGIVTGGDYESGDGCQPYSIYP-----RGKGRNTCIDDDIDTPDCSIRTCTNSNYTK 111
Query: 228 GFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLEN 287
G+ D H Y + +E+ I +I +GP A F +Y DF +YKSGVY +T ++E
Sbjct: 112 GYRADLHYVDTVYSLSRSEEDIMTDIYKNGPVQAAFYVYTDFMYYKSGVYSYT-RGQIEG 170
Query: 288 YLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGK 328
H+ K++GWG ++ T YWL N+W WG+ G +ILRG
Sbjct: 171 -GHAIKILGWGVDDNTKYWLCANSWSRSWGENGLFRILRGN 210
>gi|166030322|gb|ABY78828.1| cathepsin B-like protease [Trypanosoma congolense]
gi|343471419|emb|CCD16168.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 336
Score = 142 bits (359), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 101/350 (28%), Positives = 153/350 (43%), Gaps = 36/350 (10%)
Query: 1 MIHILVFLLGCTLVRG-ELYKFSDAYIDQINR-EANTWTAGRNFPANLSEEYLRQFLIAD 58
++ + LG + +R + + ++D+IN+ W A N ++ ++
Sbjct: 9 LLSTALVALGASALRAKDAPVLTKTFVDRINQLNGGMWKAVYN-------GKMQNITFSE 61
Query: 59 AKYFD----QSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHI 114
AK Q + LP R T + + +P+ FD+ E+WPNC TI + D AC A
Sbjct: 62 AKRLTGAWIQKNSSLPPVRFT-EEQLRTELPESFDSAEKWPNCPTIREIADQSACRASWA 120
Query: 115 FAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFR--TWNFLHKR 172
+ A SDR C G+Q R S + F W + +
Sbjct: 121 VSTASAISDRYCTVGGGKQLR-------ISAAHLLSCCKQCGGGCKGGFPGFAWRYYVEY 173
Query: 173 GSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQD 232
G + + CQP C H G+ C N K +C+T CT+ T
Sbjct: 174 GIAS-------SYCQPYPFPQCEHQGAQGNKTPCSNYKFVTPQCNTTCTDKTIPL----I 222
Query: 233 KHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSG 292
K+R Y + E+ K+E+ +GP A +Y D + YKSGVY++ + + + +
Sbjct: 223 KYRGKDAYMLLPGEEEFKRELYFNGPFVAILFVYTDLFAYKSGVYRNVDGSYMG--VTAV 280
Query: 293 KLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
K++GWG NGTPYW V NTW WG G + ILRG EC E+L AG P
Sbjct: 281 KVVGWGKLNGTPYWKVANTWDTDWGMDGYLLILRGNNECNIEHLGFAGTP 330
>gi|217073630|gb|ACJ85175.1| unknown [Medicago truncatula]
Length = 359
Score = 142 bits (359), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 97/268 (36%), Positives = 124/268 (46%), Gaps = 34/268 (12%)
Query: 81 SATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTE 140
S +P FDAR W C TIG + D G C + F AV + DR C S N LS
Sbjct: 100 SLKLPKEFDARAAWSQCSTIGKILDQGHCGSCWAFGAVESLQDRFC--SHFDMNISLSVN 157
Query: 141 YVASCCK-ICRYDDNKSCSHGSVFRTWNFLHKRGSVTG--GDYGDRTGCQPSTISPCSHH 197
+ +CC +C C G+ W +L G VT Y D+ GC SH
Sbjct: 158 DLLACCGFLC----GAGCDGGTPIYAWRYLAHHGVVTEECDPYFDQIGC--------SHP 205
Query: 198 GSAPTLPSCENQKVPKLKCHTRCT--NPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILA 255
G P + PK C +C N + R KH + Y V + I E+
Sbjct: 206 GCEPAY------QTPK--CVRKCVKGNQIWKR----SKHYSVKAYRVKSDPQDIMTEVYK 253
Query: 256 HGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGT-ENGTPYWLVINTWGP 314
+GP F +++DF HYKSGVYKH + + L H+ KLIGWGT + G YWL+ N W
Sbjct: 254 NGPVEVAFTVFEDFAHYKSGVYKHITGSALGG--HAVKLIGWGTSDEGEDYWLLANQWNT 311
Query: 315 HWGDRGTVKILRGKYECAFEYLIAAGKP 342
+WGD G KI RG EC E + AG P
Sbjct: 312 NWGDDGYFKIKRGTNECGIEDDVTAGLP 339
>gi|10803452|emb|CAB97365.2| putative cathepsin B.2 [Ostertagia ostertagi]
Length = 194
Score = 142 bits (358), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 76/198 (38%), Positives = 107/198 (54%), Gaps = 10/198 (5%)
Query: 116 AAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSV 175
++ A SDR CI S G + LS + + +CC C Y C G + W + G V
Sbjct: 6 SSAAAMSDRVCIASXGAKQVLLSDQDMLACCSWCGY----GCEGGWPMKAWQYFXLEGVV 61
Query: 176 TGGDYGDRTGCQPSTISPCSHHGSAPTLPSC-ENQKVPKLKCHTRCTNPTYGRGFFQDKH 234
TGG+Y + C+P PC HG P C ++ K PK C C Y + + +DKH
Sbjct: 62 TGGNYRKQGCCRPYEFPPCGRHGKEPYYGECYDSAKTPK--CQKTCQR-GYLKPYKEDKH 118
Query: 235 RTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKL 294
Y + +N AI+++I+ +GP A F +Y+DF HYKSG+YKHT+ H+ K+
Sbjct: 119 FGKSAYRLPNNVKAIQRDIMKNGPVVAGFIVYEDFAHYKSGIYKHTAGRMTGG--HAVKI 176
Query: 295 IGWGTENGTPYWLVINTW 312
IGWG E GTPYWL+ N+W
Sbjct: 177 IGWGKEXGTPYWLIANSW 194
>gi|168026641|ref|XP_001765840.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162683017|gb|EDQ69431.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 339
Score = 142 bits (357), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 100/323 (30%), Positives = 153/323 (47%), Gaps = 36/323 (11%)
Query: 26 IDQINREAN-TWTAGRN--FPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEYSA 82
+D++N TW AG N F + + E+L++ I AK ++ +R T+ +
Sbjct: 38 VDKVNAHPRATWKAGFNDRFEGH-TIEHLKK--ICGAKMTPANELEPSIERVTHKHK-KL 93
Query: 83 TVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYV 142
+P FDAR+ W +C TIG + D G C + F A + +DR CI ++ LS +
Sbjct: 94 VLPKEFDARKHWGHCSTIGAILDQGHCGSCWAFGAAESLTDRFCIHM--NESVSLSENDL 151
Query: 143 ASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTG--GDYGDRTGCQPSTISPCSHHGSA 200
+CC ++ C G R W + + G VT Y D+ GC H G
Sbjct: 152 LACCG---FECGDGCDGGYPIRAWRYFKRTGVVTSKCDPYFDQIGC--------GHPGCY 200
Query: 201 PTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTT 260
PT + PK C C + + + KH + Y V + + E+ +GP
Sbjct: 201 PTY------RTPK--CVKHCVDDEL---WVKSKHLSVNAYEVSKEPEDLMAELYTNGPIE 249
Query: 261 ATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGT-ENGTPYWLVINTWGPHWGDR 319
+F +++DF HYK+GVYKH + H+ KLIGWGT ++G YW ++N+W +WG+
Sbjct: 250 VSFEVFEDFAHYKTGVYKHVYGRYIGG--HAVKLIGWGTTDDGVDYWTIVNSWNTNWGEH 307
Query: 320 GTVKILRGKYECAFEYLIAAGKP 342
G +I RG EC E AG P
Sbjct: 308 GLFRIARGGNECGIESYAVAGLP 330
>gi|224285256|gb|ACN40354.1| unknown [Picea sitchensis]
Length = 350
Score = 141 bits (356), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 90/262 (34%), Positives = 127/262 (48%), Gaps = 29/262 (11%)
Query: 84 VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVA 143
+P +FDAR+ WP+C + + D G C + FAAV A SDR CI Q N LS +
Sbjct: 95 LPSKFDARKAWPHCTSTRSILDQGHCGSCWAFAAVEALSDRFCIHF--QVNATLSENDLV 152
Query: 144 SCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTG--GDYGDRTGCQPSTISPCSHHGSAP 201
+CC + C+ G W + +RG VT Y D G C+H G P
Sbjct: 153 ACCG---FRCGSGCNGGFPLSAWRYFSRRGVVTDECDPYFDNDG--------CNHPGCEP 201
Query: 202 TLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTA 261
+ P+ +C C + + + KH + Y + + I E+ +GP
Sbjct: 202 SYPT--------PRCVKNCKD---NQRWSHSKHYSANAYRIKSDPYNIMAEVFNNGPVEV 250
Query: 262 TFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWG-TENGTPYWLVINTWGPHWGDRG 320
+F++Y+DF HY++GVYKH L H+ KLIGWG T++G YWL+ N+W WG+ G
Sbjct: 251 SFSVYEDFAHYETGVYKHVQGRYLGG--HAVKLIGWGTTDDGIDYWLIANSWNTAWGEGG 308
Query: 321 TVKILRGKYECAFEYLIAAGKP 342
KI RG EC E AG P
Sbjct: 309 YFKIARGVNECGIERDPVAGMP 330
>gi|29840882|gb|AAP05883.1| similar to GenBank Accession Number X70968 cathepsin B in
Schistosoma japonicum [Schistosoma japonicum]
Length = 312
Score = 141 bits (356), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 89/266 (33%), Positives = 130/266 (48%), Gaps = 10/266 (3%)
Query: 21 FSDAYIDQINREANT-WTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPE 79
SD I IN++ N W A R S + + + DQ P + +
Sbjct: 32 LSDELITFINKQPNIEWKADRTTRFT-SIHHAKSMMGVLLNRVDQHKLHHP---IIHHND 87
Query: 80 YSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLST 139
+ +P FD+R+ W NC +I + D +C + F AV + SDR CI SKG+ + LS
Sbjct: 88 INIKLPKYFDSRKYWKNCSSIRTIRDQSSCGSCWAFGAVESMSDRICIHSKGRISIELSA 147
Query: 140 EYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGS 199
+ SCC C + C+ G W++ G VTGG TGCQP C HH +
Sbjct: 148 VNLLSCCSRCGF----GCNGGIPGMAWDYWKDEGIVTGGSNETHTGCQPYPFPECIHHST 203
Query: 200 APTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPT 259
+ SCE + +C+ C P Y + DK+ +Y+V +E +I KEIL +GP
Sbjct: 204 SINHSSCEVKYYSTPECYQTC-QPDYAIQYENDKYYGKSSYYVTSDEVSIMKEILLNGPV 262
Query: 260 TATFALYDDFYHYKSGVYKHTSNAKL 285
ATF +YDDF +YK+GVYK+ + + L
Sbjct: 263 EATFYVYDDFLNYKTGVYKYVTGSLL 288
>gi|330805199|ref|XP_003290573.1| hypothetical protein DICPUDRAFT_155103 [Dictyostelium purpureum]
gi|325079281|gb|EGC32888.1| hypothetical protein DICPUDRAFT_155103 [Dictyostelium purpureum]
Length = 313
Score = 141 bits (355), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 100/267 (37%), Positives = 128/267 (47%), Gaps = 28/267 (10%)
Query: 77 DPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRP 136
DP A P FD+R W NC TIG++ + C + F AV + DR CI KG +
Sbjct: 73 DPNIKA--PASFDSRTAWSNCTTIGYIENQARCGSCWAFGAVESAQDRICIH-KGLDVQL 129
Query: 137 LSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSH 196
+ V C D+ C G WNFL K+G VT C+P TI C
Sbjct: 130 SFLDLVT-----CDQSDD-GCEGGDDVSAWNFLKKQGVVT-------QECKPYTIPTC-- 174
Query: 197 HGSAPTLPSCENQKVPKLKCHTRC-TNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILA 255
P C N V C +C +N T + QDKH+ Y ++ E AI +EI
Sbjct: 175 ---PPAQQPCLN-FVNTPNCVKQCESNSTLI--YSQDKHKMAKIYSINSVE-AIMQEIST 227
Query: 256 HGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPH 315
+GP A F++Y+DF YKSGVY+HT+ L H K+ G+GT NG YW V N+W
Sbjct: 228 NGPVEACFSVYEDFLGYKSGVYQHTTGKFLGG--HCVKIFGYGTLNGVNYWSVANSWTTS 285
Query: 316 WGDRGTVKILRGKYECAFEYLIAAGKP 342
WGD G I RG EC E + AG P
Sbjct: 286 WGDNGIFLIKRGSDECGIEDEVVAGIP 312
>gi|2317912|gb|AAC24376.1| cathepsin B-like cysteine proteinase [Arabidopsis thaliana]
Length = 357
Score = 141 bits (355), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 97/268 (36%), Positives = 131/268 (48%), Gaps = 32/268 (11%)
Query: 79 EYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLS 138
+ S +P FDAR W +C +I + G C + F AV + SDR CIK N LS
Sbjct: 98 DLSLKLPKEFDARTAWSHCTSIRRI--LGHCGSCWAFGAVESLSDRFCIKY--NLNVSLS 153
Query: 139 TEYVASCCKI-CRYDDNKSCSHGSVFRTWNFLHKRGSVTG--GDYGDRTGCQPSTISPCS 195
V +CC + C + C+ G W + G VT Y D TGC S
Sbjct: 154 ANDVIACCGLLCGF----GCNGGFPMGAWLYFKYHGVVTQECDPYFDNTGC--------S 201
Query: 196 HHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILA 255
H G PT P+ + ++ KC +R N +G + KH Y ++ + I E+
Sbjct: 202 HPGCEPTYPTPKCER----KCVSR--NQLWG----ESKHYGVGAYRINPDPQDIMAEVYK 251
Query: 256 HGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGT-ENGTPYWLVINTWGP 314
+GP F +Y+DF HYKSGVYK+ + K+ H+ KLIGWGT ++G YWL+ N W
Sbjct: 252 NGPVEVAFTVYEDFAHYKSGVYKYITGTKIGG--HAVKLIGWGTSDDGEDYWLLANQWNR 309
Query: 315 HWGDRGTVKILRGKYECAFEYLIAAGKP 342
WGD G KI RG EC E + AG P
Sbjct: 310 SWGDDGYFKIRRGTNECGIEQSVVAGLP 337
>gi|170028894|ref|XP_001842329.1| conserved hypothetical protein [Culex quinquefasciatus]
gi|167879379|gb|EDS42762.1| conserved hypothetical protein [Culex quinquefasciatus]
Length = 355
Score = 140 bits (354), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 82/245 (33%), Positives = 111/245 (45%), Gaps = 27/245 (11%)
Query: 84 VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVA 143
+P FDAR +WPNC +I +P+ G C + +DR CI+S G R S
Sbjct: 20 IPTSFDARTRWPNCPSIALIPNQGCCNSSAFQIPAAVITDRACIRSNGTSTRTYSAYDAL 79
Query: 144 SCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTL 203
+CC C + C+ G + WN+ G V+ C P ++SP + P L
Sbjct: 80 ACCTDCPFSQLFKCAGGDPLKVWNYWATTGLVS-------DSCMPFSLSPLCLGFNCPLL 132
Query: 204 PSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATF 263
P Y D+ + V DAI+ EI+ +GP A+F
Sbjct: 133 -----------------CAPGYAGSIVGDRKKGLKVVTVAPYVDAIQSEIILNGPVEASF 175
Query: 264 ALYDDFYHYK-SGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTV 322
LY DF H K S VY S L S K+IGWG ENGT YWL+ +T+G WG++GT
Sbjct: 176 DLYLDFVHLKQSQVYNSRSGPNLGR--QSVKIIGWGVENGTEYWLITSTFGIGWGNQGTA 233
Query: 323 KILRG 327
LRG
Sbjct: 234 MFLRG 238
>gi|294916952|ref|XP_002778399.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
gi|239886773|gb|EER10194.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
Length = 228
Score = 140 bits (353), Expect = 8e-31, Method: Compositional matrix adjust.
Identities = 85/217 (39%), Positives = 109/217 (50%), Gaps = 13/217 (5%)
Query: 121 FSDRRCIKSKGQQNRPLSTEYVASCC-KICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGD 179
F+DR CIKS G+ LS Y+ SCC + + C GSV NF+ G VTGG+
Sbjct: 2 FNDRVCIKSGGKTTDILSLGYLTSCCNRANGCPKSNGCMFGSVPEGLNFMKNHGLVTGGE 61
Query: 180 Y------GDRTGCQPSTISPCSH-HGSAPTLPSCENQK-VPKLKCHTRCTNPTYGRGFFQ 231
Y G+ GC P C+H G P C + +P C T C N YG +
Sbjct: 62 YKPPEKLGNDDGCWPYPFPKCNHVPGLESKYPRCAQVRDLPA--CATTCPNKAYGTSMQK 119
Query: 232 DKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHS 291
D HR + + IK+EI +GP A LY+DF +YKSGVY H + L H+
Sbjct: 120 DTHRAKSWGRLPIGPEKIKQEIFDNGPVAAMMTLYEDFRYYKSGVYVHKTGQLLA--AHT 177
Query: 292 GKLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGK 328
KLIGWG E+G YWL +N W WGD G +K+ GK
Sbjct: 178 LKLIGWGVESGQEYWLAMNAWNEEWGDHGMIKLAVGK 214
>gi|86279341|gb|ABC88766.1| putative cathepsin B-like like proteinase [Tenebrio molitor]
Length = 301
Score = 140 bits (352), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 91/277 (32%), Positives = 133/277 (48%), Gaps = 16/277 (5%)
Query: 1 MIHILVFLLGCTLVRG--ELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIAD 58
++ +V L L G +L+ SD +I++IN + TW AGRNF N ++R+ L
Sbjct: 4 VLLCIVVLASVALSYGGVKLHPLSDEFINEINSKQTTWKAGRNFDVNTPISHVRRLLGVL 63
Query: 59 AKYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTI-GHVPDTGACAAPHIFAA 117
K + P+ KT+ A +P+ FDARE WP C +I G + D +C + F A
Sbjct: 64 PKKANAPKLPV----KTHAVNLDA-IPESFDAREAWPECTSIIGEIRDQASCGSCWAFGA 118
Query: 118 VGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTG 177
V A SDR CI S +S E + CC YD C+ G W++ G VTG
Sbjct: 119 VEAMSDRICIHSDASVKVRISAEDLNDCC----YDCGDGCNGGWPDLAWSYWSSTGIVTG 174
Query: 178 GDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTT 237
G YG GC+ +I PC HH P + Q+ P K + + T + D R +
Sbjct: 175 GLYGVDEGCKAYSIKPCDHHVDGNLGPCGDIQRTPACK---KSCDSTSDLEYKSDLRRGS 231
Query: 238 LTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKS 274
Y + +E I+ EI+ +GP A + +Y DF YK+
Sbjct: 232 -AYSIPKSESQIQTEIMTNGPVEADYDVYSDFLTYKA 267
>gi|294955270|ref|XP_002788457.1| cysteine protease, putative [Perkinsus marinus ATCC 50983]
gi|239903926|gb|EER20253.1| cysteine protease, putative [Perkinsus marinus ATCC 50983]
Length = 392
Score = 140 bits (352), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 99/321 (30%), Positives = 133/321 (41%), Gaps = 66/321 (20%)
Query: 16 GELYK-----FSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYF--DQSDRP 68
ELY+ + +D+IN E N WTA + +E + DAK +
Sbjct: 65 AELYEDTRPAIMQSLVDEINSEQNLWTASTD------QERFYGHSLGDAKKLCGTLLEEA 118
Query: 69 LPGDRKTYDPEYSATVPDRFDAREQWPNC-GTIGHVPDTGACAAPHIFAAVGAFSDRRCI 127
+ K Y P A +P+ FDAR+ + C IGHV
Sbjct: 119 EGLEEKVYPPGELADIPNSFDARDAFKECKDVIGHV------------------------ 154
Query: 128 KSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQ 187
CC C+ G W+FL+ G T G GC
Sbjct: 155 -----------------CCD--------GCTKGRPDAAWSFLNVYGIATEGSMSAADGCW 189
Query: 188 PSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTT-LTYWVDDNE 246
P C HH C + C RC N YG +D+H T + +
Sbjct: 190 PYNFPKCGHHQQDSKYQPCPEKNYDTPPCLDRCPNKNYGTPLDKDRHFTAHFSPYQLKGT 249
Query: 247 DAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYW 306
D IKKEI+ +GPT+A F++YDDF Y+SGVYKHTS + H ++IGWGT+ G YW
Sbjct: 250 DNIKKEIMTNGPTSAAFSMYDDFLSYESGVYKHTSGTLMGE--HGVEIIGWGTKQGVDYW 307
Query: 307 LVINTWGPHWGDRGTVKILRG 327
LV+N+W WG GT KI +G
Sbjct: 308 LVMNSWNEGWGVHGTFKIAQG 328
>gi|390357905|ref|XP_003729132.1| PREDICTED: cathepsin B-like [Strongylocentrotus purpuratus]
Length = 354
Score = 140 bits (352), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 76/187 (40%), Positives = 100/187 (53%), Gaps = 5/187 (2%)
Query: 157 CSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKC 216
C+ G W + G VTGG + GCQP I C HH + P C+ + P +C
Sbjct: 173 CNGGFPGSAWEYYKDTGIVTGGQWNSSQGCQPYQIKSCDHHVNGTKGP-CQGEG-PTPEC 230
Query: 217 HTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGV 276
+C +Y + QDKH + +N +A + EI+ +GP A F +Y+DF YKSGV
Sbjct: 231 KHKC-EASYSTPYEQDKHYALSVNSISNNPEATQTEIMTNGPVEADFTVYEDFPTYKSGV 289
Query: 277 YKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYL 336
Y+HT+ L H+ K++GWG E GT YWLV N+W WGD G KILRG EC E
Sbjct: 290 YQHTTGGVLGG--HAIKILGWGVEEGTKYWLVANSWNNEWGDNGFFKILRGSNECGIESD 347
Query: 337 IAAGKPK 343
I G PK
Sbjct: 348 INFGIPK 354
>gi|343475054|emb|CCD13447.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 336
Score = 140 bits (352), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 99/329 (30%), Positives = 146/329 (44%), Gaps = 36/329 (10%)
Query: 21 FSDAYIDQINR-EANTWTAGRNFPANLSEEYLRQFLIADAK----YFDQSDRPLPGDRKT 75
+ ++D+IN+ W A + + ++ ++AK F + LP R T
Sbjct: 31 LTQKFVDRINQLNGGMWKA-------VYDGKMQNLTFSEAKRLTGAFSRKTSTLPPVRFT 83
Query: 76 YDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNR 135
+ + +P+ FDA E+WP+C TI +PD AC A A A SDR C G+Q R
Sbjct: 84 -EEQLRTELPESFDAAEKWPHCPTIREIPDQSACRASWAVATASAISDRYCTVGNGKQLR 142
Query: 136 PLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCS 195
+ + +A C + W + G + + CQP C
Sbjct: 143 ISAADLMACCTGCGGGCEGGYPDAA-----WEYYVSNGITS-------SQCQPYPFPRCE 190
Query: 196 HHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILA 255
H G+ P C C+ CT+ + K+R +Y V ED K+E+
Sbjct: 191 HRGAQGKKPPCSKYNFDTPTCNATCTD----KSVPLIKYRGNHSYEVRGEED-YKRELYF 245
Query: 256 HGPTTATFALYDDFYHYKSGVYKHTSNAKLENYL--HSGKLIGWGTENGTPYWLVINTWG 313
+GP F ++ DF YKSGVY+H + N+L + +++GWG NGTPYW V N+W
Sbjct: 246 NGPFVVRFQVHSDFLAYKSGVYQHVAG----NFLGGKAVRIVGWGKMNGTPYWKVANSWD 301
Query: 314 PHWGDRGTVKILRGKYECAFEYLIAAGKP 342
WG G ILRG EC E+L AG P
Sbjct: 302 TDWGMNGYFLILRGNNECNIEHLGFAGTP 330
>gi|294914336|ref|XP_002778250.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
gi|239886453|gb|EER10045.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
Length = 388
Score = 139 bits (351), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 104/328 (31%), Positives = 145/328 (44%), Gaps = 25/328 (7%)
Query: 21 FSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEY 80
D+ D +N+ TW A +E + + D K + P + P
Sbjct: 67 LMDSLADALNQGQKTWVASSK------QERFKGASVFDVKALCGTILNGP-SKLPKKPAS 119
Query: 81 SATV----PDRFDAREQWPNCGT-IGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNR 135
+TV PDRFDARE + NC T IGHV + P + A + I +
Sbjct: 120 ESTVLSNLPDRFDAREHFKNCATVIGHV------SPPVVAAGLLRRLKHSAIVCASARVD 173
Query: 136 PLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCS 195
L T Y + K + V N + G T D +GC P CS
Sbjct: 174 SL-TWYHFLLATLRHVAQKKKVAFHLVAMAVNLIAHGGGSTFAPELD-SGCWPYNFPECS 231
Query: 196 HHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILA 255
HH + C+ P C T C N + F D+H T + D D IK+EI+
Sbjct: 232 HHVDTKGMEPCKGNS-PSPVCSTTCRNHHFKPSFESDRHFTEDEGYSLDEVDEIKREIID 290
Query: 256 HGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPH 315
+GP A F +Y+DF +YKSGVYKH + ++L H+ K+IGWG + YWLV+N+W +
Sbjct: 291 NGPVAAAFTVYEDFPYYKSGVYKHVNGSELGG--HAVKIIGWGIDQNEQYWLVMNSWNVN 348
Query: 316 WGDRGTVKILRGKYECAFEYLIAAGKPK 343
WGD+G KI G EC + + AG PK
Sbjct: 349 WGDQGIFKIAIG--ECGIDSEVTAGIPK 374
>gi|66810163|ref|XP_638805.1| peptidase C1A family protein [Dictyostelium discoideum AX4]
gi|74897075|sp|Q54QD9.1|CTSB_DICDI RecName: Full=Cathepsin B; AltName: Full=Cathepsin B1; Flags:
Precursor
gi|60467425|gb|EAL65448.1| peptidase C1A family protein [Dictyostelium discoideum AX4]
Length = 311
Score = 139 bits (351), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 93/269 (34%), Positives = 129/269 (47%), Gaps = 27/269 (10%)
Query: 74 KTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQ 133
K+YDP +P F+A+ WPNC TI + + C + F A + +DR CI + +
Sbjct: 70 KSYDP-LGVQIPTSFNAQTNWPNCTTISQIQNQARCGSCWAFGATESATDRLCIHNN--E 126
Query: 134 NRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISP 193
N LS + +C + + C G F WN+L K+G+V+ C P TI
Sbjct: 127 NVQLSFMDMVTC-----DETDNGCEGGDAFSAWNWLRKQGAVS-------EECLPYTIPT 174
Query: 194 CSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEI 253
C P C N V C C + + + QDKH+ Y D +E AI +EI
Sbjct: 175 C-----PPAQQPCLN-FVNTPSCTKECQSNS-SLIYSQDKHKMAKIYSFDSDE-AIMQEI 226
Query: 254 LAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWG 313
+ +GP A F +++DF YKSGVY HT+ L H KL+G+GT NG Y+ N W
Sbjct: 227 VTNGPVEACFTVFEDFLAYKSGVYVHTTGKDLGG--HCVKLVGFGTLNGVDYYAANNQWT 284
Query: 314 PHWGDRGTVKILRGKYECAFEYLIAAGKP 342
WGD GT I RG +C + AG P
Sbjct: 285 TSWGDNGTFLIKRG--DCGISDDVVAGLP 311
>gi|388500062|gb|AFK38097.1| unknown [Lotus japonicus]
Length = 357
Score = 139 bits (350), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 95/266 (35%), Positives = 123/266 (46%), Gaps = 30/266 (11%)
Query: 81 SATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTE 140
S +P FDAR W C TIG + D G C + F AV + SDR CI N LS
Sbjct: 98 SLKLPKSFDARTAWSQCSTIGRILDQGHCGSCWAFGAVESLSDRFCIHL--DVNVSLSVN 155
Query: 141 YVASCCK-ICRYDDNKSCSHGSVFRTWNFLHKRGSVTG--GDYGDRTGCQPSTISPCSHH 197
+ +CC +C C G W +L G VT Y D+ G CSH
Sbjct: 156 DLLACCGFLC----GSGCDGGYPLYAWRYLAHHGVVTEECDPYFDQIG--------CSHP 203
Query: 198 GSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHG 257
G P + P KC +C + + + K+ + Y V + I E+ +G
Sbjct: 204 GCEPAY------QTP--KCVRKCVKGN--QIWKKSKYFSVNAYSVKSDPYDIMAEVYKNG 253
Query: 258 PTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWG-TENGTPYWLVINTWGPHW 316
P F +Y+DF HYKSGVYKH + ++L H+ KLIGWG T+ G YWL+ N W W
Sbjct: 254 PVEVAFTVYEDFAHYKSGVYKHITGSQLGG--HAVKLIGWGTTDEGEDYWLIANQWNRSW 311
Query: 317 GDRGTVKILRGKYECAFEYLIAAGKP 342
GD G I RG EC E + AG P
Sbjct: 312 GDDGYFMIRRGTNECGIEEDVTAGLP 337
>gi|166030326|gb|ABY78830.1| cathepsin B-like protease [Trypanosoma congolense]
Length = 336
Score = 139 bits (350), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 100/349 (28%), Positives = 152/349 (43%), Gaps = 34/349 (9%)
Query: 1 MIHILVFLLGCTLVRG-ELYKFSDAYIDQINR-EANTWTAGRNFPANLSEEYLRQFLIAD 58
++ + LG + +R + + ++D+IN+ W A N ++ ++
Sbjct: 9 LLSTALVTLGASALRAKDAPVLTKTFVDRINQLNGGMWKAVYN-------GKMQNITFSE 61
Query: 59 AKYFD----QSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHI 114
AK Q + LP R T + + +P+ FD+ E+WPNC TI + D AC A
Sbjct: 62 AKRLTGAWIQKNSSLPPVRFT-EEQLRTELPESFDSAEKWPNCPTIREIADQSACRASWA 120
Query: 115 FAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGS 174
+ A SDR C G+Q R S + F + +L+
Sbjct: 121 VSTASAISDRYCTVGGGKQLR-------ISAAHLLSCCKQCGGGCKGGFPGFAWLYYV-- 171
Query: 175 VTGGDYG-DRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDK 233
+YG +GCQP C H G+ C K KC+ CT+ + K
Sbjct: 172 ----EYGIASSGCQPYPFPHCEHRGAQGNKTPCSKYKFDTPKCNATCTD----KSIPLVK 223
Query: 234 HRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGK 293
+R TY + E+ K+E+ +GP A F +Y D + YKSGVY++ L + +
Sbjct: 224 YRGNATYLLLHGEEDYKRELYFNGPFVAVFFVYTDLFAYKSGVYRNVDGDFLGG--QAVR 281
Query: 294 LIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
++GWG NGTPYW V N+W WG G + IL G EC E+L G P
Sbjct: 282 IVGWGKLNGTPYWKVANSWDTDWGMNGYMLILGGNNECNIEHLGFTGFP 330
>gi|166030324|gb|ABY78829.1| cathepsin B-like protease [Trypanosoma congolense]
Length = 336
Score = 139 bits (349), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 103/346 (29%), Positives = 145/346 (41%), Gaps = 35/346 (10%)
Query: 5 LVFLLGCTLVRGELYKFSDAYIDQINR-EANTWTAGRNFPANLSEEYLRQFLIADAKYFD 63
LV L L+ + + ++D+IN+ W A N ++ A+AK
Sbjct: 14 LVTLGVSALLVKDAPVLTKTFVDRINQLNGGMWKAVYN-------GKMQNITFAEAKRLT 66
Query: 64 ----QSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVG 119
Q LP R T + + +P+ FD+ E+WPNC TI + D AC A +
Sbjct: 67 GAWIQKTSSLPPVRFT-EEQLRTELPESFDSAEKWPNCPTIREIADQSACRASWAVSTAS 125
Query: 120 AFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFR--TWNFLHKRGSVTG 177
SDR C QQ R S + F W + + G +
Sbjct: 126 VISDRYCTVGGVQQLR-------ISAAHLLSCCKQCGGGCKGGFPGFAWRYYVEYGIAS- 177
Query: 178 GDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTT 237
+ CQP C H G+ C KC+ CT+ + K+R
Sbjct: 178 ------SYCQPYPFPHCEHRGAQGNKTPCSKYNFDTPKCNATCTD----KSIPLVKYRGN 227
Query: 238 LTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGW 297
TY + E+ K+E+ +GP A F +Y D + YKSGVY+H L + K++GW
Sbjct: 228 ATYLLLHGEEDYKRELYFNGPFVAVFYVYTDLFAYKSGVYRHVDGDFLGG--TAVKVVGW 285
Query: 298 GTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
G NGTPYW V NTW WG G + ILRG EC E+L AG P+
Sbjct: 286 GKLNGTPYWKVANTWDTDWGMDGYLLILRGNNECNIEHLGFAGTPE 331
>gi|323448265|gb|EGB04166.1| hypothetical protein AURANDRAFT_32974 [Aureococcus anophagefferens]
Length = 298
Score = 139 bits (349), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 92/278 (33%), Positives = 121/278 (43%), Gaps = 24/278 (8%)
Query: 71 GDRKTYDPEYSATVPDRFDAREQWPNCGT-IGHVPDTGACAAPHIFAAVGAFSDRRCIKS 129
GD Y P A P+ FD+ +WP C IG + D C FA A SDR+CI +
Sbjct: 12 GDVVDYVPRGGA-APEAFDSAARWPECAKLIGDIRDQSNCGCCWAFAGAEAASDRQCIAT 70
Query: 130 KGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTG---- 185
G PLS + +C + C G + W ++ K G+VTGG Y + TG
Sbjct: 71 GGAVAVPLSAQ------DVCFNANVDGCDGGQIITPWTYVAKAGAVTGGQY-NGTGPFGA 123
Query: 186 --CQPSTISPCSHHGSAPTLP-------SCENQKVPKLKCHTRCTNPTYGRGFFQDKHRT 236
C C HHG P C ++K P+ T F DKH
Sbjct: 124 GLCADWFAPHCHHHGPRGDDPYPAEGDAGCPSEKSPEGPKACDATAAAGHDAFAADKHTF 183
Query: 237 TLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIG 296
E AI I GP F +Y+DF +Y G+Y H + + H+ K +G
Sbjct: 184 AGDVQTASGEAAIMAMIAEGGPVETAFTVYEDFENYAGGIYHHVTGEEAGG--HAVKFVG 241
Query: 297 WGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFE 334
WG ENGT YW V N+W P+WG+ G +ILRG E E
Sbjct: 242 WGVENGTKYWKVANSWNPYWGEAGYFRILRGSNEGGIE 279
>gi|156375635|ref|XP_001630185.1| predicted protein [Nematostella vectensis]
gi|156217201|gb|EDO38122.1| predicted protein [Nematostella vectensis]
Length = 311
Score = 138 bits (348), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 109/352 (30%), Positives = 156/352 (44%), Gaps = 53/352 (15%)
Query: 1 MIHILVFLLGCTLVRGELYKFSDA------YIDQINREANTWTAGRNFPA--NLSEEYLR 52
M+ I FL+ LV G+ S +D+IN W A +P NL+ E +
Sbjct: 1 MLAIAAFLV--LLVSGDGIPISKEKVISRDLVDKINTLNVGWEATL-YPQFENLTFESAK 57
Query: 53 QFLIADAKYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAP 112
L + + + S P + + +P+ FDAR+QWP G+I + + G C +
Sbjct: 58 SMLGSRGAWPEGSLPP------EIEVRVAENIPENFDARKQWP--GSIHPIRNQGQCGSC 109
Query: 113 HIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKR 172
F A SDR I SK Q LS + + C DN CS G WN++ K
Sbjct: 110 WAFGASEVLSDRFAIASKNQIYVTLSAQQLVDCDL-----DNSGCSGGWPINAWNYMVKT 164
Query: 173 GSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTR-CT-NPTYGRGFF 230
G +T YG P Q +L +T C P F+
Sbjct: 165 GLLTEQCYG----------------------PYYAKQYTCRLTANTTDCPWQPGVKARFY 202
Query: 231 QDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLH 290
K L N +AI+ +I+ +GP A F ++ DFY Y+SG+Y H + +L H
Sbjct: 203 HAKSAYKLP---AKNVEAIQTDIMNNGPVEADFTIFQDFYAYRSGIYVHATGKQLGG--H 257
Query: 291 SGKLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
+ K++GWGTE+ YWL N+WG +WG +G KI RG EC E +AAG P
Sbjct: 258 AIKILGWGTEDNVDYWLCANSWGANWGIQGYFKIRRGTDECGIEDGLAAGLP 309
>gi|256090674|ref|XP_002581308.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
Length = 250
Score = 138 bits (348), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 84/229 (36%), Positives = 113/229 (49%), Gaps = 13/229 (5%)
Query: 116 AAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSV 175
A+ + SDR CI++ G LS + SC K + C G +W++ K G V
Sbjct: 33 ASAASISDRTCIQTNGTMKVQLSAIELISCSK-----NKLGCQIGFSEFSWDYWLKNGLV 87
Query: 176 TGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHR 235
TG D TGC P C H S+ + P C C C + Y + DKH
Sbjct: 88 TG----DPTGCLPYPFPKCDHR-SSNSYPKCGYITYTAPPCTKTCRS-GYPIPYKADKHY 141
Query: 236 TTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLI 295
+ Y + NE I+KEI+ +GP A ++ DF +YKSGVY+H + + +HS ++I
Sbjct: 142 GRVIYSLRPNESDIRKEIMMNGPVEAGIFVHSDFLNYKSGVYRHITGQLVT--IHSVRII 199
Query: 296 GWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPKN 344
GWG EN PYWL N+W WG G KILRG EC E + AGK N
Sbjct: 200 GWGIENDIPYWLCANSWNEDWGLNGYFKILRGSNECEIESFVNAGKVDN 248
>gi|729283|sp|Q06544.1|CYSP3_OSTOS RecName: Full=Cathepsin B-like cysteine proteinase 3
gi|159952|gb|AAA29436.1| cathepsin B-like cysteine protease, partial [Ostertagia ostertagi]
Length = 174
Score = 138 bits (347), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 69/177 (38%), Positives = 99/177 (55%), Gaps = 6/177 (3%)
Query: 165 TWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSC-ENQKVPKLKCHTRCTNP 223
W + G VTGG+Y + C+P PC HG P C + K PK C C
Sbjct: 1 AWQYFALEGVVTGGNYRKQGCCRPYEFPPCGRHGKEPYYGECYDTAKTPK--CQKTCQR- 57
Query: 224 TYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNA 283
Y + + +DKH Y + +N AI+++I+ +GP A F +Y+DF HYKSG+YKHT+
Sbjct: 58 GYLKAYKEDKHFGKSAYRLPNNVKAIQRDIMKNGPVVAGFIVYEDFAHYKSGIYKHTAGR 117
Query: 284 KLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAG 340
H+ K+IGWG E GTPYWL+ N+W WG++G +++RG C E ++ AG
Sbjct: 118 MTGG--HAVKIIGWGKEKGTPYWLIANSWHDDWGEKGFYRMIRGINNCRIEEMVFAG 172
>gi|157058751|gb|ABV03133.1| cathepsin B-3098 [Aulacorthum solani]
Length = 215
Score = 138 bits (347), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 85/224 (37%), Positives = 116/224 (51%), Gaps = 19/224 (8%)
Query: 84 VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVA 143
+P +FDAR++W C TIG V D G CA+ + AF+DR C+ + G N+ LS E +
Sbjct: 6 IPRKFDARKKWLRCKTIGEVRDQGNCASGWALSTSSAFADRLCVATNGDFNQLLSAEEIT 65
Query: 144 SCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTL 203
CC C C G R W K G VTGG+Y GC+P + PC +
Sbjct: 66 FCCHTC----GNGCYGGYPIRAWKSFKKHGLVTGGNYKSGEGCEPYRVPPCPYDEYGNN- 120
Query: 204 PSCENQKVPKLKCHTRCTNPTYGR---GFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTT 260
+C Q ++ + RCT YG F QD T Y++ I+K+++ +GP
Sbjct: 121 -TCSGQ---PMESNHRCTRMCYGNQDLDFDQDHRYTRDHYYL--TYRGIQKDVINYGPIE 174
Query: 261 ATFALYDDFYHYKSGVYKHTSNAKLENYL--HSGKLIGWGTENG 302
A+F +YDDF YKSG+Y + NA +YL HS KLIGWG E G
Sbjct: 175 ASFDVYDDFPSYKSGIYVKSENA---SYLGGHSVKLIGWGEEYG 215
>gi|56758130|gb|AAW27205.1| unknown [Schistosoma japonicum]
Length = 279
Score = 137 bits (345), Expect = 8e-30, Method: Compositional matrix adjust.
Identities = 74/198 (37%), Positives = 107/198 (54%), Gaps = 6/198 (3%)
Query: 79 EYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLS 138
+ + +P +FD+R++WP+C +I + D C + F AV A +DR CI+S GQQ+ LS
Sbjct: 85 DLNVEIPSQFDSRKKWPHCKSISQIRDQSRCGSCWAFGAVEAMTDRICIQSGGQQSAELS 144
Query: 139 TEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHG 198
+ SCC+ C C G W++ KRG VTGG + TGCQP C HH
Sbjct: 145 ALDLISCCEDC----GDGCQGGFPGVAWDYWVKRGIVTGGSKENHTGCQPYPFPKCEHH- 199
Query: 199 SAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGP 258
+ P+C + +C C Y + QDKH +Y V NE AI++EI+ +GP
Sbjct: 200 TKGKYPACGTKIYKTPQCKQTCQK-GYKTPYEQDKHYGDESYNVISNEKAIQREIMMYGP 258
Query: 259 TTATFALYDDFYHYKSGV 276
A F +Y+DF +YKSG+
Sbjct: 259 VEAAFDVYEDFLNYKSGI 276
>gi|225437812|ref|XP_002281936.1| PREDICTED: cathepsin B-like isoform 1 [Vitis vinifera]
gi|359480250|ref|XP_003632421.1| PREDICTED: cathepsin B-like [Vitis vinifera]
Length = 358
Score = 137 bits (344), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 91/265 (34%), Positives = 124/265 (46%), Gaps = 30/265 (11%)
Query: 84 VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVA 143
+P FDAR WP C TIG + D G C + F AV + SDR CI N LS +
Sbjct: 101 LPKHFDARTAWPQCSTIGKILDQGHCGSCWAFGAVESLSDRFCIHFG--MNISLSVNDLL 158
Query: 144 SCCK-ICRYDDNKSCSHGSVFRTWNFLHKRGSVTG--GDYGDRTGCQPSTISPCSHHGSA 200
+CC +C C G W + G VT Y D TG CSH G
Sbjct: 159 ACCGFLC----GSGCDGGYPLYAWRYFIHHGVVTEECDPYFDATG--------CSHPGCE 206
Query: 201 PTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTT 260
P P+ KC +CT+ + + + K Y + + I E+ +GP
Sbjct: 207 PGYPT--------PKCVRKCTDEN--QLWRKAKRYGQSAYRISSDPYQIMAEVYKNGPVE 256
Query: 261 ATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWG-TENGTPYWLVINTWGPHWGDR 319
F +Y+DF HY+SGVY++T+ + H+ KLIGWG T++G YW++ N W +WGD
Sbjct: 257 VAFTVYEDFAHYESGVYRYTTGDVMGG--HAVKLIGWGTTDDGEDYWILANQWNRNWGDD 314
Query: 320 GTVKILRGKYECAFEYLIAAGKPKN 344
G I RG EC E + AG P +
Sbjct: 315 GYFMIRRGVNECGIEEGVVAGLPSS 339
>gi|166030320|gb|ABY78827.1| cathepsin B-like protease [Trypanosoma congolense]
Length = 336
Score = 137 bits (344), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 101/346 (29%), Positives = 145/346 (41%), Gaps = 35/346 (10%)
Query: 5 LVFLLGCTLVRGELYKFSDAYIDQINR-EANTWTAGRNFPANLSEEYLRQFLIADAKYFD 63
LV L L+ + + ++D+IN+ W A N ++ A+AK
Sbjct: 14 LVTLGVSALLVKDAPVLTKTFVDRINQLNGGMWKAVYN-------GKMQNITFAEAKRLT 66
Query: 64 ----QSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVG 119
Q LP R T + + +P+ FD+ E+WPNC TI + D AC A +
Sbjct: 67 GAWIQKTSSLPPVRFT-EEQLRTELPESFDSAEKWPNCPTIREIADQSACRASWAVSTAS 125
Query: 120 AFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFR--TWNFLHKRGSVTG 177
SDR C QQ R S + F W + + G +
Sbjct: 126 VISDRYCTVGGVQQLR-------ISAAHLLSCCKQCGGGCKGGFPGFAWRYYVEYGIAS- 177
Query: 178 GDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTT 237
+ CQP C H G+ C KC+ CT+ + K+R
Sbjct: 178 ------SYCQPYPFPHCEHRGAQGNKTPCSKYNFDTPKCNATCTD----KSIPLVKYRGN 227
Query: 238 LTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGW 297
TY + E+ K+E+ +GP A F +Y D + YKSGVY++ L + +++GW
Sbjct: 228 ATYLLLHGEEDYKRELYFNGPFVAVFFVYTDLFAYKSGVYRNVDGDILGG--QAVRIVGW 285
Query: 298 GTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
G NGTPYW V NTW WG G + ILRG EC E+L AG P+
Sbjct: 286 GKLNGTPYWKVANTWDTDWGMDGYLLILRGNNECNIEHLGFAGTPE 331
>gi|324514184|gb|ADY45787.1| Cathepsin B cysteine proteinase 6 [Ascaris suum]
Length = 476
Score = 136 bits (343), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 93/323 (28%), Positives = 143/323 (44%), Gaps = 54/323 (16%)
Query: 41 NFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTI 100
NFP + + +R++L +++F+ T P + ++P FDAR +W C ++
Sbjct: 152 NFPFDKNSTAIREYLNRLSEFFNSEKMKQHLRELTEFP--ADSLPSEFDARRKWSYCSSL 209
Query: 101 GHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHG 160
+VP+ G C A + AAVG SDR CI S G S E V CC +C +C G
Sbjct: 210 HNVPNQGGCGACYAVAAVGVASDRACIASNGTLQSMFSEEDVLGCCAVC-----GNCYGG 264
Query: 161 SVFRTWNFLHKRGSVTGGDYGDRTGCQPSTIS-PCSHHGSAPTLPSCENQKVPKLKCHTR 219
+ + G VTGG R GC+P ++ C S P E ++ KC+ +
Sbjct: 265 DPLKALVYWVDEGLVTGG----RDGCRPYSVDLSCGVPCSPAVYPLAEYRR----KCYRQ 316
Query: 220 CTNPTYGRGFFQDKHRTTLTY------------------------WVDDNED-------- 247
C + + + DKH ++ Y ++++ D
Sbjct: 317 CQDIYFQYNYESDKHYGSMAYSMFPRTMSLDNKGSERVKLPTVIGYLNETSDEPLTDKEI 376
Query: 248 --AIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLEN---YLHSGKLIGWGTENG 302
I KE+ GP T F + ++F HY SGV+ A + Y H +LIGWG +G
Sbjct: 377 RQIIMKELYLWGPMTMAFPVTEEFLHYSSGVFSPFPAANFSDRIVYWHVARLIGWGKYDG 436
Query: 303 -TPYWLVINTWGPHWGDRGTVKI 324
YWL +N++G HWGD G +I
Sbjct: 437 DNHYWLAVNSFGRHWGDDGVFRI 459
>gi|255548165|ref|XP_002515139.1| cathepsin B, putative [Ricinus communis]
gi|223545619|gb|EEF47123.1| cathepsin B, putative [Ricinus communis]
Length = 376
Score = 136 bits (343), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 107/347 (30%), Positives = 141/347 (40%), Gaps = 58/347 (16%)
Query: 21 FSDAYIDQINREANT-WTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPE 79
++ I ++N + W A N L F + KY +P P P
Sbjct: 41 LQESIIKKVNENPDAGWEAAMN-------PQLSNFTVGQFKYL-LGAKPTPKKELMGVPM 92
Query: 80 YS----ATVPDRFDAREQWPNCGTIGHV-----------------PDTGACAAPHIFAAV 118
S +P FDAR WP+C TIG + G C + F AV
Sbjct: 93 ISHPKTLKLPKEFDARTAWPHCSTIGKILGQLLSFYNIFSIFFFLFLEGHCGSCWAFGAV 152
Query: 119 GAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTG- 177
+ SDR CI N LS + +CC D C G W + G VT
Sbjct: 153 ESLSDRFCIHFG--MNISLSVNDLLACCGFLCGD---GCDGGYPMYAWRYFVHHGVVTEE 207
Query: 178 -GDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRT 236
Y D GC SH G P P+ KC +C + + + Q KH +
Sbjct: 208 CDPYFDNIGC--------SHPGCEPGFPT--------PKCVRKCIDKN--QLWRQSKHYS 249
Query: 237 TLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIG 296
Y + + + E+ +GP +F +Y+DF HYKSGVYKH + + H+ KLIG
Sbjct: 250 VNAYRISSDPHDVMAEVYKNGPVEVSFTVYEDFAHYKSGVYKHITGEVMGG--HAVKLIG 307
Query: 297 WGT-ENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
WGT +NG YWL+ N W WGD G KI RG EC E AG P
Sbjct: 308 WGTSDNGEDYWLLANQWNRGWGDDGYFKIRRGTNECGIEDDAVAGLP 354
>gi|290979437|ref|XP_002672440.1| predicted protein [Naegleria gruberi]
gi|284086017|gb|EFC39696.1| predicted protein [Naegleria gruberi]
Length = 354
Score = 136 bits (342), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 100/319 (31%), Positives = 137/319 (42%), Gaps = 46/319 (14%)
Query: 26 IDQINREANTWTAGRNFP--ANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEYSAT 83
ID+IN +P ANLS R L + D P D E
Sbjct: 78 IDKINANETLGWKATEYPRFANLSISEARDSLFGLSLLSTDPDTP------RLDIEPRVD 131
Query: 84 VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVA 143
+P FDAR QW C I V D C A F+A + R CI + G+ N LS EY
Sbjct: 132 LPMNFDARTQWRGC--IPAVRDQQTCGACWAFSATYVLAHRLCIATNGKTNVVLSPEYQV 189
Query: 144 SCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTL 203
C + NK+C G + W+FL + G+ T+ C + S
Sbjct: 190 QCDTM-----NKACQGGYLKYAWSFLERTGT---------------TVDSCIPYASGRAT 229
Query: 204 PSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATF 263
S C +C T ++ K+ ++ + IK I+++G + F
Sbjct: 230 FSSGT-------CPAKCKVSTQSMTMYKAKNSRYIS-----GVNNIKAAIMSYGSVQSGF 277
Query: 264 ALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVK 323
+Y DF Y+SGVYKH S L H+ LIGWG E+GT YWL +N+WG +WG G K
Sbjct: 278 TIYRDFMSYRSGVYKHVSTTTLGG--HAVALIGWGVESGTNYWLAVNSWGSNWGMSGYFK 335
Query: 324 ILRGKYECAFEYLIAAGKP 342
I +G EC E + AG+P
Sbjct: 336 IAQG--ECGIENQVYAGEP 352
>gi|255647484|gb|ACU24206.1| unknown [Glycine max]
Length = 327
Score = 136 bits (342), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 95/274 (34%), Positives = 126/274 (45%), Gaps = 34/274 (12%)
Query: 67 RPLPGDRKTYDPEYS----ATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFS 122
+P+P P S +P FDAR W C TIG + D G C + F AV + S
Sbjct: 80 KPMPKKELRSTPAISHPKTLKLPKNFDARTAWSQCSTIGRILDQGHCGSCWAFGAVESLS 139
Query: 123 DRRCIKSKGQQNRPLSTEYVASCCK-ICRYDDNKSCSHGSVFRTWNFLHKRGSVTG--GD 179
DR CI N LS + +CC +C C G W +L G VT
Sbjct: 140 DRFCIHF--DVNISLSVNDLLACCGFLC----GSGCDGGYPLYAWRYLAHHGVVTEECDP 193
Query: 180 YGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLT 239
Y D+ GC SH G P + PK C +C + + + + KH +
Sbjct: 194 YFDQIGC--------SHPGCEPAY------RTPK--CVKKCVSGN--QVWKKSKHYSVSA 235
Query: 240 YWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWG- 298
Y V+ + I E+ +GP F +Y+DF +YKSGVYKH + +L H+ KLIGWG
Sbjct: 236 YRVNSDPHDIMAEVYKNGPVEVAFTVYEDFAYYKSGVYKHITGYELGG--HAVKLIGWGT 293
Query: 299 TENGTPYWLVINTWGPHWGDRGTVKILRGKYECA 332
T++G YWL+ N W WGD G KI RG EC
Sbjct: 294 TDDGEDYWLLANQWNREWGDDGYFKIRRGTNECG 327
>gi|56758658|gb|AAW27469.1| unknown [Schistosoma japonicum]
Length = 181
Score = 135 bits (340), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 68/174 (39%), Positives = 101/174 (58%), Gaps = 4/174 (2%)
Query: 167 NFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYG 226
++L KRG VTGG + TGCQP C H + P+C + +C +C Y
Sbjct: 8 DYLVKRGIVTGGSKENHTGCQPYPFPKCEHL-TKGKYPACGTKIYKTPQCKQKCQK-GYK 65
Query: 227 RGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLE 286
+ QDK+ Y V N AI+KEI+ +GP A F +Y+DF +YKSG+Y+H + + +
Sbjct: 66 TPYEQDKNYGDQRYNVISNAKAIQKEIMMNGPVEAAFDVYEDFLNYKSGIYRHVTGSIVG 125
Query: 287 NYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAG 340
H+ ++IGWG E TPYWL+ N+W WG++G +I+RG+ EC+ E + AG
Sbjct: 126 G--HAIRIIGWGVEKRTPYWLIANSWNEDWGEKGLFRIVRGRDECSIESNVVAG 177
>gi|297843026|ref|XP_002889394.1| hypothetical protein ARALYDRAFT_887367 [Arabidopsis lyrata subsp.
lyrata]
gi|297335236|gb|EFH65653.1| hypothetical protein ARALYDRAFT_887367 [Arabidopsis lyrata subsp.
lyrata]
Length = 359
Score = 134 bits (338), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 97/270 (35%), Positives = 127/270 (47%), Gaps = 30/270 (11%)
Query: 79 EYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLS 138
+ S +P FDAR W C +I + D G C + F AV + SDR CIK N LS
Sbjct: 98 DLSLKLPKEFDARTAWSQCTSIPRILDQGHCGSCWAFGAVESLSDRFCIKY--NLNVSLS 155
Query: 139 T-EYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTG--GDYGDRTGCQPSTISPCS 195
+ VA C +C N G+ W + G VT Y D TGC S
Sbjct: 156 ANDVVACCGLLCGLGCNGGFPMGA----WLYFKYHGVVTEECDPYFDNTGC--------S 203
Query: 196 HHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILA 255
H G P P+ KC +C + G + KH Y ++ + I E+
Sbjct: 204 HPGCEPGYPT--------PKCVRKCVSENQLWG--ESKHYGVSAYRINHDPQDIMAEVYK 253
Query: 256 HGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGT-ENGTPYWLVINTWGP 314
+GP F +Y+DF HYKSGVYKH + K+ H+ KLIGWGT ++G YWL+ N W
Sbjct: 254 NGPVEVAFTVYEDFAHYKSGVYKHITGTKIGG--HAVKLIGWGTSDDGEDYWLLANQWNR 311
Query: 315 HWGDRGTVKILRGKYECAFEYLIAAGKPKN 344
WGD G KI RG EC E+ + AG P +
Sbjct: 312 SWGDDGYFKIRRGTNECGIEHGVVAGLPSD 341
>gi|268619140|gb|ACZ13346.1| cathepsin B-like cysteine proteinase [Bursaphelenchus xylophilus]
Length = 405
Score = 134 bits (338), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 80/269 (29%), Positives = 129/269 (47%), Gaps = 19/269 (7%)
Query: 57 ADAKYFDQSDRPLPGD----RKTYDPEYSATVPDRFDAREQWPNCGTI-GHVPDTGACAA 111
A A +F + + PL R D + S +P+ FDA E+WP C + ++ D C +
Sbjct: 41 AGAYHFGRINDPLRKSTLKKRTEADYDLSEEIPESFDAAEKWPECAEVFNNIRDQSNCGS 100
Query: 112 PHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHK 171
++ G SDR C+ + G+ +S ASC C+ G + +
Sbjct: 101 CWAVSSAGVMSDRICVATNGKVKVSISGIATASCV------GGDGCNGGLEEVAFEKFIE 154
Query: 172 RGSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLK---CHTRCTNPTYGRG 228
G TG + GCQP C+HH ++ P C++ VP+ K C C Y R
Sbjct: 155 NGFPTGSEVDKHQGCQPYPFKHCAHHVNSTEYPPCDS--VPEYKADTCSHECQK-DYDRK 211
Query: 229 FFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENY 288
+ +D + Y D E I++EI+ +GP +F +Y+ F +Y G+Y+ T +++ Y
Sbjct: 212 YEEDLYYGKEQYGFSD-EAPIQREIMTNGPVAVSFTVYESFLYYSGGIYRSTPGERIKGY 270
Query: 289 LHSGKLIGWGTENGTPYWLVINTWGPHWG 317
H+ +++GWG ENGT YW + N+W WG
Sbjct: 271 -HAVRVVGWGVENGTKYWKIANSWNEQWG 298
>gi|3929733|emb|CAA77178.1| cathepsin B [Homo sapiens]
Length = 195
Score = 134 bits (338), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 76/205 (37%), Positives = 107/205 (52%), Gaps = 12/205 (5%)
Query: 109 CAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCC-KICRYDDNKSCSHGSVFRTWN 167
C + F AV A SDR CI + + +S E + +CC +C C+ G WN
Sbjct: 1 CGSCWAFGAVEAISDRICIHT--NVSVEVSAEDLLTCCGSMC----GDGCNGGYPAEAWN 54
Query: 168 FLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGR 227
F ++G V+GG Y GC+P +I PC HH + P PK ++ P Y
Sbjct: 55 FWTRKGLVSGGLYESHVGCRPYSIPPCEHHVNGSRPPCTGEGDTPKC---SKICEPGYSP 111
Query: 228 GFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLEN 287
+ QDKH +Y V ++E I EI +GP F++Y DF YKSGVY+H + +
Sbjct: 112 TYKQDKHYGYDSYSVSNSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGG 171
Query: 288 YLHSGKLIGWGTENGTPYWLVINTW 312
H+ +++GWG ENGTPYWLV N+W
Sbjct: 172 --HAIRILGWGVENGTPYWLVANSW 194
>gi|290990464|ref|XP_002677856.1| predicted protein [Naegleria gruberi]
gi|284091466|gb|EFC45112.1| predicted protein [Naegleria gruberi]
Length = 231
Score = 134 bits (338), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 86/242 (35%), Positives = 121/242 (50%), Gaps = 28/242 (11%)
Query: 87 RFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCC 146
FD+R++WPNC + + D G C + + FA+ SDR CI S G N LS + + +C
Sbjct: 5 EFDSRQKWPNC--VHPIRDQGNCGSCYSFASSEVMSDRFCIFSNGSVNVVLSPQDLVTCS 62
Query: 147 KICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSC 206
+ C+ G ++++HK G V+ C P + H P C
Sbjct: 63 WY-----SFGCNGGIPGLVFDYIHKDGLVS-------DACFPYLSYDGNTHVKCPDF--C 108
Query: 207 ENQKVPKLKCHTRCTNPTYGRG-FFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFAL 265
N K K + Y G F +DK + L I+KEIL HGP A F +
Sbjct: 109 YNNKTKSFKSDKHFADKVYHVGEFLEDKAKRVL---------EIQKEILTHGPVNADFMV 159
Query: 266 YDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVKIL 325
Y DF YKSGVY+H + + E +H+ K+IGWGTENG YWL+ N+WG +G +G KI+
Sbjct: 160 YSDFTVYKSGVYRHQTGS-FEG-IHAVKIIGWGTENGVDYWLIANSWGTTFGLQGFFKIV 217
Query: 326 RG 327
RG
Sbjct: 218 RG 219
>gi|159114116|ref|XP_001707283.1| Cathepsin B precursor [Giardia lamblia ATCC 50803]
gi|157435387|gb|EDO79609.1| Cathepsin B precursor [Giardia lamblia ATCC 50803]
Length = 332
Score = 134 bits (336), Expect = 7e-29, Method: Compositional matrix adjust.
Identities = 103/344 (29%), Positives = 151/344 (43%), Gaps = 48/344 (13%)
Query: 1 MIHILVFLLGCTLVRGELYKFSDAYIDQINREANTWTAG-RNFPANLSEEYLRQFLIADA 59
M+ ++ +LL V S + +I WTAG + LSE+ LR
Sbjct: 27 MLSVITYLLAGLGVALSKPLLSRRELQEIRALQPPWTAGISDRLVGLSEDDLRAM----- 81
Query: 60 KYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVG 119
F + +P + E S +PD FD RE++P C I V D G C A F+A G
Sbjct: 82 --FPRHGQPTRPSAECPRAEPSGPIPDAFDLREEYPQC--ITPVYDQGYCGACWAFSATG 137
Query: 120 AFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGD 179
AF DRRC++ P S +Y SC D + C+ G+ F W FL + G+ T +
Sbjct: 138 AFGDRRCMQWLDPVGVPYSQQYTVSC-----DDLDLGCAGGTSFNVWTFLTEHGTTTL-E 191
Query: 180 YGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLT 239
T SPC P L C++ +L C +
Sbjct: 192 CVRYTDADKDLSSPC------PAL--CDDGSEIQLVKADGCLD----------------- 226
Query: 240 YWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGT 299
N AI + + GP A ++Y DF +Y+ GVYKH ++ + H+ ++IG+GT
Sbjct: 227 --YSGNVTAIMQTLANDGPVQAVMSVYRDFLYYRGGVYKHVYGIQISS--HAVEIIGYGT 282
Query: 300 ---ENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAG 340
E PYW+V N+ GP+WG+ G I+RG EC E + +G
Sbjct: 283 TDDEERIPYWIVKNSLGPNWGEEGYFNIVRGSNECDIESAVYSG 326
>gi|18378945|ref|NP_563647.1| putative cathepsin B-like cysteine protease [Arabidopsis thaliana]
gi|332189291|gb|AEE27412.1| putative cathepsin B-like cysteine protease [Arabidopsis thaliana]
Length = 379
Score = 134 bits (336), Expect = 7e-29, Method: Compositional matrix adjust.
Identities = 98/288 (34%), Positives = 134/288 (46%), Gaps = 50/288 (17%)
Query: 79 EYSATVPDRFDAREQWPNCGTI-----GHVPDT---------------GACAAPHIFAAV 118
+ S +P FDAR W +C +I G++ + G C + F AV
Sbjct: 98 DLSLKLPKEFDARTAWSHCTSIRRILVGYILNNVLLWSTITLWFWFLLGHCGSCWAFGAV 157
Query: 119 GAFSDRRCIKSKGQQNRPLSTEYVASCCKI-CRYDDNKSCSHGSVFRTWNFLHKRGSVTG 177
+ SDR CIK N LS V +CC + C + C+ G W + G VT
Sbjct: 158 ESLSDRFCIKY--NLNVSLSANDVIACCGLLCGF----GCNGGFPMGAWLYFKYHGVVTQ 211
Query: 178 --GDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHR 235
Y D TGC SH G PT P+ + ++ KC +R N +G + KH
Sbjct: 212 ECDPYFDNTGC--------SHPGCEPTYPTPKCER----KCVSR--NQLWG----ESKHY 253
Query: 236 TTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLI 295
Y ++ + I E+ +GP F +Y+DF HYKSGVYK+ + K+ H+ KLI
Sbjct: 254 GVGAYRINPDPQDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKYITGTKIGG--HAVKLI 311
Query: 296 GWGT-ENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
GWGT ++G YWL+ N W WGD G KI RG EC E + AG P
Sbjct: 312 GWGTSDDGEDYWLLANQWNRSWGDDGYFKIRRGTNECGIEQSVVAGLP 359
>gi|239792046|dbj|BAH72408.1| ACYPI000003 [Acyrthosiphon pisum]
Length = 182
Score = 133 bits (335), Expect = 9e-29, Method: Compositional matrix adjust.
Identities = 71/177 (40%), Positives = 95/177 (53%), Gaps = 6/177 (3%)
Query: 167 NFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYG 226
N+ +G V+GG YG GC P I+PC HH + P E K P C +C Y
Sbjct: 10 NYCKTKGIVSGGPYGSNMGCIPYEIAPCEHHVNGTRGPCKEGGKTPT--CVKKCEE-GYK 66
Query: 227 RGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLE 286
+ QD H Y + ++ D I++EI +GP F +Y+DF Y++GVYKH + L
Sbjct: 67 VPYAQDLHHGKSAYSIRNDVDQIRQEIYTNGPVEGAFTVYEDFIAYRAGVYKHVAGKALG 126
Query: 287 NYLHSGKLIGWGTENG-TPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
H+ +++GWG +NG PYWLV N+W WG G KILRG EC E I AG P
Sbjct: 127 G--HAIRILGWGVQNGEIPYWLVANSWNTDWGSDGFFKILRGSDECGIEGQINAGLP 181
>gi|269146930|gb|ACZ28411.1| cathepsin b [Simulium nigrimanum]
Length = 168
Score = 133 bits (335), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 67/172 (38%), Positives = 94/172 (54%), Gaps = 4/172 (2%)
Query: 172 RGSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQ 231
+ S +GG +G GC P I+PC HH + T P+C ++ KC C +Y + Q
Sbjct: 1 KASSSGGPFGSNQGCHPYKIAPCEHHVNG-TRPACNGEEGKTPKCIKHC-QASYTVAYEQ 58
Query: 232 DKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHS 291
DK +Y V + I+KEI+ +GP F +Y+D YK GVY+H + L H+
Sbjct: 59 DKSYGAKSYSVPHHVAQIQKEIMTNGPVEGAFTVYEDLVQYKDGVYQHVTGKMLGG--HA 116
Query: 292 GKLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
+++GWG EN PYWL+ N+W WG+ G KILRG C E I+AG PK
Sbjct: 117 IRILGWGVENDVPYWLIANSWNTDWGNNGFFKILRGSDHCGIESQISAGIPK 168
>gi|242014495|ref|XP_002427925.1| tubulointerstitial nephritis antigen, putative [Pediculus humanus
corporis]
gi|212512409|gb|EEB15187.1| tubulointerstitial nephritis antigen, putative [Pediculus humanus
corporis]
Length = 473
Score = 133 bits (334), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 92/264 (34%), Positives = 122/264 (46%), Gaps = 32/264 (12%)
Query: 84 VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVA 143
+P+ FDAR +WP G I D G C A + SDR I SKG LS +++
Sbjct: 190 LPNSFDARNKWP--GWISGPADQGWCGASWAVSTASVASDRYAIMSKGLTKVDLSPQHLL 247
Query: 144 SCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTL 203
SC K + C G + R W F+ K G V DY C P T +P
Sbjct: 248 SCNK-----GQRGCQGGHLSRAWTFIRKFGLVD--DY-----CYPWTGTP---------- 285
Query: 204 PSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATF 263
C+ K P + P+ G + +R Y + D +D I +EI+ GP AT
Sbjct: 286 TKCKIPKRPNFDALSSICPPSLGSNLRSELYRVGPAYKIQDEKD-IMEEIMQSGPVQATM 344
Query: 264 ALYDDFYHYKSGVY-KHTSNAKLENY-LHSGKLIGWGTEN---GTP--YWLVINTWGPHW 316
+Y DF+ YKSGVY K + + N+ HS K++GWG E G P YWL N+WG W
Sbjct: 345 KVYQDFFSYKSGVYTKSNTERESSNFGYHSVKILGWGEETNIYGQPIKYWLAANSWGQQW 404
Query: 317 GDRGTVKILRGKYECAFEYLIAAG 340
G+ G KI RG EC E + A
Sbjct: 405 GENGFFKIRRGTNECEIEEFVLAA 428
>gi|10803443|emb|CAC13134.1| putative cathepsin B.8 [Ostertagia ostertagi]
Length = 197
Score = 133 bits (334), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 72/199 (36%), Positives = 100/199 (50%), Gaps = 6/199 (3%)
Query: 115 FAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGS 174
F AV A SDR CI SKG+ LS + SCC+ C + C+ G W F K G
Sbjct: 5 FGAVEAISDRICIASKGKTQVTLSAADLLSCCRSCGF----GCNGGDPLSAWKFWVKEGI 60
Query: 175 VTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKH 234
VTG ++ GC+P C HH + C++ P KC C R + +DK+
Sbjct: 61 VTGSNHSTNAGCKPYPFPACEHHSNKTHYDPCKHDLFPTPKCEKSCQATFGERTYKEDKY 120
Query: 235 RTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKL 294
Y V ++ +AI+KEI+ +GP F +Y+DF +Y G+Y H A H+ K+
Sbjct: 121 FGRSAYGVKNHMEAIQKEIITYGPVEVAFEVYEDFLNYAGGIYVHQGGALGGG--HAVKM 178
Query: 295 IGWGTENGTPYWLVINTWG 313
IGWG +NG PYW + T G
Sbjct: 179 IGWGIDNGVPYWXHLPTHG 197
>gi|328702238|ref|XP_001943280.2| PREDICTED: cathepsin B-like cysteine proteinase 5-like
[Acyrthosiphon pisum]
Length = 328
Score = 132 bits (332), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 81/248 (32%), Positives = 115/248 (46%), Gaps = 24/248 (9%)
Query: 84 VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVA 143
+ FDAR++WP C TIG + G A +AA G +DR CI + G N+ +STE +
Sbjct: 84 IHKEFDARKRWPQCKTIGEFRNEGNFALSWAYAAAGVLADRMCIATNGSYNQLISTEELI 143
Query: 144 SCCKICRYDDNKSCSHGSVF--RTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAP 201
SC + HG V W +L G V+GG Y GCQPS I P +
Sbjct: 144 SCSGVS------GGYHGIVSEREVWEYLKSHGLVSGGKYNTSDGCQPSKIPPIEEY---- 193
Query: 202 TLPSCENQKVPKLKCHTRC-TNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTT 260
E ++ C+ C N T + D H Y+ ED I++E+ +GP +
Sbjct: 194 ----MEYSEIKNYTCNDHCYGNKTIN---YNDDHVKVSNYYQVQYED-IQEEVQNYGPVS 245
Query: 261 ATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRG 320
F + DD + + K + Y+ KLIGWG ENG YWL++++WG G G
Sbjct: 246 VEFYIRDDIFTPFLSINPRFQRRKYKGYV---KLIGWGVENGEDYWLLVDSWGYERGQNG 302
Query: 321 TVKILRGK 328
K+ R K
Sbjct: 303 VFKVERFK 310
>gi|195026034|ref|XP_001986167.1| GH20676 [Drosophila grimshawi]
gi|193902167|gb|EDW01034.1| GH20676 [Drosophila grimshawi]
Length = 432
Score = 132 bits (332), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 86/268 (32%), Positives = 120/268 (44%), Gaps = 36/268 (13%)
Query: 81 SATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTE 140
S +P F+A E+W I VPD G C A + + SDR I+S+G++ LS +
Sbjct: 184 SNDLPRSFNAVEKWST--FISEVPDQGWCGASWVLSTTSVASDRFAIQSQGKEVVQLSAQ 241
Query: 141 YVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSA 200
+ SC + + C G + W ++HK G + Y
Sbjct: 242 NILSCTR-----RQQGCDGGHLDAAWRYMHKNGVLDANCY-------------------- 276
Query: 201 PTLPSCENQKVPKLKCHTRCTNPTYG----RGFFQDKHRTTLTYWVDDNEDAIKKEILAH 256
P + + K++ H + YG G +D T + E I EI
Sbjct: 277 ---PYIQQRDTCKVQRHRGRSLKAYGCQPAHGVNRDNFYTVGPAYSLSREADIMAEIYHS 333
Query: 257 GPTTATFALYDDFYHYKSGVYKHTS-NAKLENYLHSGKLIGWGTE-NGTPYWLVINTWGP 314
GP AT +Y DF+ Y SGVY+HT+ N HS KL+GWG E NG YW+ N+WGP
Sbjct: 334 GPVQATMTVYRDFFSYSSGVYQHTAANRGAATGFHSVKLVGWGEEHNGVKYWIAANSWGP 393
Query: 315 HWGDRGTVKILRGKYECAFEYLIAAGKP 342
WG+RG +ILRG EC E + A P
Sbjct: 394 WWGERGYFRILRGSNECGIEEYVLASWP 421
>gi|161343873|tpg|DAA06117.1| TPA_inf: cathepsin B [Myzus persicae]
Length = 254
Score = 132 bits (331), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 75/203 (36%), Positives = 107/203 (52%), Gaps = 12/203 (5%)
Query: 80 YSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLST 139
++ +P FD+R++WPNC +IGH+ + G C + + AA A SDR CI S +N +S
Sbjct: 59 FTNGLPTNFDSRKKWPNCPSIGHIYNQGNCRSSYAVAAASAASDRICIHSNSTKNPIMSA 118
Query: 140 EYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGS 199
+ + SCC +C Y C GS+F +W+F + G V+GG+Y GCQP TI PC
Sbjct: 119 QQIISCCYLCGY----GCDGGSLFESWDFYRRHGFVSGGEYNSNQGCQPYTIPPCKLINE 174
Query: 200 APTLPSC---ENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAH 256
P SC ++ P C +C NP Y F D +R + + KEI +
Sbjct: 175 KPPGHSCTTFNREETP--TCEKKCNNPNYYTSFRADIYRGK---YYKVSPYMAMKEIFDN 229
Query: 257 GPTTATFALYDDFYHYKSGVYKH 279
GP T F +Y D YKSGVY++
Sbjct: 230 GPITTQFYMYRDLVDYKSGVYQY 252
>gi|3929817|emb|CAA77181.1| cathepsin B [Mus musculus]
Length = 194
Score = 131 bits (330), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 76/202 (37%), Positives = 102/202 (50%), Gaps = 8/202 (3%)
Query: 109 CAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNF 168
C + F AV A SDR CI + G+ N +S E + +CC I D C+ G WNF
Sbjct: 1 CGSCWAFGAVEAISDRTCIHTNGRVNVEVSAEDLLTCCGIQCGD---GCNGGYPSGAWNF 57
Query: 169 LHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRG 228
K+G V+GG Y GC P TI PC HH + P P +C+ C Y
Sbjct: 58 WTKKGLVSGGVYDSHIGCLPYTIPPCEHHVNGSRPPMHGEGDTP--RCNKSC-EAGYSPS 114
Query: 229 FFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENY 288
+ +DKH +Y V ++ I EI +GP F ++ DF YKSGVYKH + +
Sbjct: 115 YKEDKHFGYTSYSVSNSVKEIMAEIYKNGPVEGAFTVFSDFLTYKSGVYKHEAGDMMGG- 173
Query: 289 LHSGKLIGWGTENGTPYWLVIN 310
H+ +++GWG ENG PYWL N
Sbjct: 174 -HAIRILGWGVENGVPYWLAAN 194
>gi|194753202|ref|XP_001958906.1| GF12327 [Drosophila ananassae]
gi|190620204|gb|EDV35728.1| GF12327 [Drosophila ananassae]
Length = 431
Score = 131 bits (330), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 84/264 (31%), Positives = 120/264 (45%), Gaps = 35/264 (13%)
Query: 84 VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVA 143
+P F+A E+WP+ I VPD G C + + + SDR I+SKG++ LS + +
Sbjct: 186 LPSSFNAVERWPS--YISEVPDQGWCGSSWVLSTTSVASDRFAIQSKGKEAVRLSAQNIL 243
Query: 144 SCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTL 203
SC + + C G + W FLHK+G V Y
Sbjct: 244 SCTR-----RQQGCDGGHLDAAWRFLHKKGVVDDSCY----------------------- 275
Query: 204 PSCENQKVPKLKCHTRCTNPTYGR---GFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTT 260
P + + K++ ++R R +D T + + E I EI GP
Sbjct: 276 PYTQQRDTCKIRHNSRSLKANGCRPSPNVDRDSFYTVGPAYTLNREGDIMAEIYHSGPVQ 335
Query: 261 ATFALYDDFYHYKSGVYKHTS-NAKLENYLHSGKLIGWGTE-NGTPYWLVINTWGPHWGD 318
AT +Y DF+ Y G+Y+ T+ N HS KL+GWG E NG YW+ N+WGP WG+
Sbjct: 336 ATMRVYRDFFSYSGGIYRQTAANRGAPQGFHSVKLVGWGEEHNGDKYWIAANSWGPWWGE 395
Query: 319 RGTVKILRGKYECAFEYLIAAGKP 342
RG +ILRG EC E + A P
Sbjct: 396 RGYFRILRGSNECGIEEYVLASWP 419
>gi|157058753|gb|ABV03134.1| cathepsin B-84 [Acyrthosiphon pisum]
Length = 230
Score = 131 bits (329), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 78/245 (31%), Positives = 126/245 (51%), Gaps = 19/245 (7%)
Query: 38 AGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNC 97
A +NFP N +E + + L+ + S P+ + + Y ++ VP+ FD+R +W C
Sbjct: 1 AKQNFPENTPKEQIVR-LLGSKRLLGVSKSPIKENDELYMD--NSEVPEFFDSRLEWDYC 57
Query: 98 GTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSC 157
TIGHV + G C + GAF+DR C+ + G+ N +S E + CC C + C
Sbjct: 58 ETIGHVRNQGNCGSCWAHGTTGAFADRLCVATNGEFNELISAEELTFCCHTCGF----GC 113
Query: 158 SHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPC----SHHGSAPTLPSCENQKVPK 213
+ G + W + + G VTGGDY GCQP + PC H S P+ N K K
Sbjct: 114 NGGYPLKAWQYFKRHGVVTGGDYDTTDGCQPYRVPPCVKDDEGHNSCSGQPTERNHKCSK 173
Query: 214 LKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYK 273
KC+ T + ++ ++T Y++ + ++K+ + +GP A+F +YDDF +Y+
Sbjct: 174 -KCYGDDT-----IDYKKNHYKTKDAYYLKNT--TMQKDTMVYGPIEASFDVYDDFMNYE 225
Query: 274 SGVYK 278
SGVY+
Sbjct: 226 SGVYQ 230
>gi|290975216|ref|XP_002670339.1| cathepsin B-like cysteine proteinase [Naegleria gruberi]
gi|284083897|gb|EFC37595.1| cathepsin B-like cysteine proteinase [Naegleria gruberi]
Length = 350
Score = 131 bits (329), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 85/261 (32%), Positives = 122/261 (46%), Gaps = 41/261 (15%)
Query: 85 PDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVAS 144
P +FDAREQWP C I + + C + F+A +DR CIKS G+ N LS +++ S
Sbjct: 126 PTQFDAREQWPQC--IRSIKNQKNCGSCWAFSASSVLADRFCIKSGGKVNVDLSPQFMVS 183
Query: 145 CCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTLP 204
C N C+ G TW FL G+V+ C P S G+ P
Sbjct: 184 CS-----GQNNGCNGGFFDATWRFLVSVGTVS-------EACVPYV----SFGGAVPA-- 225
Query: 205 SCENQKVPKLKCHTR-CTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATF 263
C+ + C P F++ L +D I ++ A+GP
Sbjct: 226 -----------CNVKSCGVPGQKSPFYRAGSARKLEGMLD-----IMADLKANGPIQVAM 269
Query: 264 ALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGT--PYWLVINTWGPHWGDRGT 321
+Y DFY YKSGVY H S + H+ K++GWG ++ + PYW+ N+WG WG +G
Sbjct: 270 GVYRDFYSYKSGVYHHVSGRYVGG--HAVKIVGWGYDSASKLPYWICANSWGEDWGIKGY 327
Query: 322 VKILRGKYECAFEYLIAAGKP 342
ILRG+ EC ++ +GKP
Sbjct: 328 FWILRGRGECGIGKMVWSGKP 348
>gi|345327151|ref|XP_001507103.2| PREDICTED: tubulointerstitial nephritis antigen-like
[Ornithorhynchus anatinus]
Length = 327
Score = 131 bits (329), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 88/270 (32%), Positives = 127/270 (47%), Gaps = 26/270 (9%)
Query: 81 SATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTE 140
+ +P FDA ++WP G I D G CA F+ SDR I SKG LS +
Sbjct: 54 NVVLPRNFDAAQKWP--GLIHEPLDQGNCAGSWAFSTAAVASDRISIHSKGHMTPSLSPQ 111
Query: 141 YVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSA 200
+ SC + C+ G + R W+FL +RG V+ Y Q S PC +
Sbjct: 112 NLLSC----NTRHQQGCNGGRLDRAWSFLRRRGLVSDKCYP--LASQNSIAEPCRMY--- 162
Query: 201 PTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTT 260
+ P ++ C N + + D +++T Y + NE I KEI+ +GP
Sbjct: 163 -SRPMGRGKRQATGPC---PNNFHHSNDYSNDIYQSTPPYRLSSNEKDIMKEIMENGPVQ 218
Query: 261 ATFALYDDFYHYKSGVYKHT--SNAKLENY----LHSGKLIGWGTE---NG--TPYWLVI 309
A +++DF+ YK G+Y+HT SN K + HS K+ GWG E NG +W
Sbjct: 219 ALMEVHEDFFLYKDGIYRHTPASNGKPPQFRRQGTHSVKITGWGEELQPNGRRVKFWRAA 278
Query: 310 NTWGPHWGDRGTVKILRGKYECAFEYLIAA 339
N+WGP WG+ G+ +ILRG EC E +
Sbjct: 279 NSWGPTWGEGGSFRILRGCNECDIESFVVG 308
>gi|270012758|gb|EFA09206.1| cathepsin B precursor [Tribolium castaneum]
Length = 326
Score = 130 bits (328), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 87/263 (33%), Positives = 121/263 (46%), Gaps = 46/263 (17%)
Query: 83 TVPDRFDAREQWPNCG-TIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEY 141
++P+ FDARE+WP C IG + + G C + FA+ +DR CI SKG+ S E
Sbjct: 75 SIPESFDAREKWPECKDVIGKIRNQGNCGSCWAFASTEVMTDRLCISSKGKIKFVFSPEN 134
Query: 142 VASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAP 201
+ +C D C G + W++ G +GGDY GCQP
Sbjct: 135 LLTC----CKDCGCGCKGGYIKNAWDYYINEGIASGGDYNSSEGCQP------------- 177
Query: 202 TLPSCENQKVPKLKCHTRCTNPTYGRGFFQ--DKHRTTLTYWVDDNEDAIKKEILAHGPT 259
Y FQ + Y ++ N I+ EIL +GP
Sbjct: 178 -----------------------YSESSFQYAEASECVKFYTLETNVAQIQMEILTNGPV 214
Query: 260 TATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDR 319
A + +++DF +KSGVY + S + HS K+IGWGTE G PYWL+ N+WG WG+
Sbjct: 215 MAYYNVFEDFACHKSGVYYYKSGKFVGR--HSVKVIGWGTEEGIPYWLIANSWGSEWGEL 272
Query: 320 GT-VKILRGKYECAFEYLIAAGK 341
G K+ RG EC E + AGK
Sbjct: 273 GGFFKMRRGTNECWIEQEMTAGK 295
>gi|66506619|ref|XP_393283.2| PREDICTED: uncharacterized peptidase C1-like protein F26E4.3-like
[Apis mellifera]
Length = 439
Score = 130 bits (327), Expect = 9e-28, Method: Compositional matrix adjust.
Identities = 95/278 (34%), Positives = 127/278 (45%), Gaps = 45/278 (16%)
Query: 73 RKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQ 132
R+ YDPE ++P FDAR +W I V D G C A + SDR + SKG
Sbjct: 189 RRVYDPE---SLPREFDARTRWRR--QISGVDDQGWCGASWAISTAQVASDRFAVMSKGT 243
Query: 133 QNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDY---GDRTGCQPS 189
+ LS +++ SC K + C G + R W F+ K G V Y G C+
Sbjct: 244 DSVLLSAQHLLSCNK----KGQRGCDGGYLDRAWLFMRKFGLVDEQCYPWKGVYEQCKLQ 299
Query: 190 TISPCSHHG-SAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDA 248
+ G AP P L+ P Y G NE
Sbjct: 300 KRTNLEAAGCRAPANP---------LRKELYKVGPAYRLG----------------NETD 334
Query: 249 IKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKL-ENYLHSGKLIGWG----TENGT 303
I +EIL GP AT +Y DF+ Y+SG+Y HT A+L E+ HS ++IGWG T++G
Sbjct: 335 IMREILTSGPVQATMKVYQDFFSYESGIYMHTPIAELYESGYHSVRIIGWGEDISTDSGL 394
Query: 304 P--YWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAA 339
P YWLV+N+WG WG+ G +I RG EC E + A
Sbjct: 395 PIKYWLVVNSWGQEWGENGLFRIRRGINECDIESFVVA 432
>gi|10803441|emb|CAC13133.1| putative cathepsin B.7 [Ostertagia ostertagi]
Length = 198
Score = 130 bits (326), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 74/203 (36%), Positives = 107/203 (52%), Gaps = 14/203 (6%)
Query: 115 FAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGS 174
+ A SDR CI SKG +S + + SCC C C G W + G
Sbjct: 5 VSTAAAMSDRICIASKGATQVLISAQDIVSCCTWC----GAGCEGGWPIEAWKYGVTEGV 60
Query: 175 VTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQ-KVPKLKCHTRCTNPTYGRGFFQDK 233
VTGG++G + C+ I PC +HG+ P C + + P C RC P Y + DK
Sbjct: 61 VTGGNFGRKECCRSYEIHPCGYHGNEPFYGHCHSMARTPP--CKKRC-RPGYKNSYMMDK 117
Query: 234 HRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGK 293
T Y + ++ AI+++I+ +GP A F +Y+DF +YKSG+Y+HT+ H+ K
Sbjct: 118 RYGTSAYELPNSVXAIQRDIMENGPVVAGFDVYEDFKYYKSGIYRHTAGKXTGG--HAVK 175
Query: 294 LIGWG---TENGT-PYWLVINTW 312
+IGWG TENGT PYW++ N+W
Sbjct: 176 VIGWGEEXTENGTIPYWIIANSW 198
>gi|290989996|ref|XP_002677623.1| cathepsin B [Naegleria gruberi]
gi|284091231|gb|EFC44879.1| cathepsin B [Naegleria gruberi]
Length = 321
Score = 130 bits (326), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 105/338 (31%), Positives = 153/338 (45%), Gaps = 56/338 (16%)
Query: 26 IDQINRE-ANTWTAG--RNFPANLSEEYLRQFLIADAKYFDQ---SDRPLPGDRKTYDPE 79
I++IN + ++TW AG RN E R L+ AK Q S+ + + + +
Sbjct: 8 INEINSDPSSTWKAGVNRNLAGKTVAEMKR--LLGFAKKEGQVRYSEEQMTTIKHYNEAK 65
Query: 80 YSAT----------------VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSD 123
SA +P FD+R+QW C I + + C + F+A + SD
Sbjct: 66 ASAVKSVGVEEASKQFKTLGLPTNFDSRQQWGKC--IHPIRNQEQCGSCWAFSASESLSD 123
Query: 124 RRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDR 183
R CI S G+ + LS + + SC Y+D C G++ W ++ +G V
Sbjct: 124 RFCIASNGKVDVILSPQDMVSC----DYND-MGCDGGNLDNAWWWMKNKGIVP------- 171
Query: 184 TGCQPSTISPCSHHGSAPTLPS-CENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWV 242
C P S G+ P PS C +P + Y + F H + +W
Sbjct: 172 DSCMPYV----SGGGNVPACPSNCNGTNIP------ISSQLYYAKSF---SHISPWMFW- 217
Query: 243 DDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENG 302
+ I++EI +GP F++Y DF +YKSGVY H + + L H+ K+IGWG E G
Sbjct: 218 -ERVADIQQEIYTNGPVQGGFSVYQDFMNYKSGVYSHKTGSFLGG--HAIKIIGWGVEGG 274
Query: 303 TPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAG 340
YWLV N+W WG GT KILRG EC E + AG
Sbjct: 275 VDYWLVANSWSTDWGIDGTFKILRGHNECGIEDDVYAG 312
>gi|161343859|tpg|DAA06110.1| TPA_inf: cathepsin B [Myzus persicae]
Length = 260
Score = 130 bits (326), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 85/267 (31%), Positives = 125/267 (46%), Gaps = 15/267 (5%)
Query: 1 MIHILVFLLGCTLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADA- 59
+ +L +L + Y +YID IN A+TW AG NF N S+E + + L +
Sbjct: 4 FLILLSIVLFSVYQTEQAYFLQKSYIDTINEVASTWKAGVNFDPNTSQEDIVKLLGSTGV 63
Query: 60 -KYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAV 118
S D Y+ Y T P FDAR++W +C TIG V D G C + F
Sbjct: 64 ESAMKASANEFKMDDVAYNKLYGYT-PRTFDARKKWRHCKTIGEVRDQGHCGSCWAFGTS 122
Query: 119 GAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGG 178
AF+DR C+ + G N LS E + CC C + C+ G + W + G VTGG
Sbjct: 123 SAFADRLCVATDGDFNELLSAEEITFCCHTCGF----GCNGGDPIKAWKYFSTHGLVTGG 178
Query: 179 DYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRG--FFQDKHRT 236
+Y GC+P + PC +C + P+ K H RCT YG +++ HR
Sbjct: 179 NYKSGEGCEPYRVPPCPRDDKGKN--TCAGK--PREKNH-RCTRMCYGNQDLDYREDHRY 233
Query: 237 TLTYWVDDNEDAIKKEILAHGPTTATF 263
T ++ +I+K+++ +GP ATF
Sbjct: 234 TRDFYY-LTYGSIQKDVMTYGPIEATF 259
>gi|125810908|ref|XP_001361665.1| GA15908 [Drosophila pseudoobscura pseudoobscura]
gi|54636841|gb|EAL26244.1| GA15908 [Drosophila pseudoobscura pseudoobscura]
Length = 433
Score = 129 bits (325), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 86/267 (32%), Positives = 124/267 (46%), Gaps = 35/267 (13%)
Query: 81 SATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTE 140
+A +P F+A E+W + I VPD G C + + + SDR I+SKG++ LS +
Sbjct: 186 TAGLPAAFNAVEKWSS--YISEVPDQGWCGSSWVLSTTSVASDRFAIQSKGKEAVQLSAQ 243
Query: 141 YVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSA 200
+ SC + + C G + W +LHK+G V C P T
Sbjct: 244 NILSCTR-----RQQGCEGGHLDAAWRYLHKKGVVD-------ESCYPYT---------- 281
Query: 201 PTLPSCENQKVPKLKCHTRCTNPTYGR---GFFQDKHRTTLTYWVDDNEDAIKKEILAHG 257
+++ K++ ++R R +D T + + E I EI G
Sbjct: 282 ------QHRDTCKIRHNSRSLKANGCRPSANVDRDSFYTVGPAYTLNKESDIMAEIYHSG 335
Query: 258 PTTATFALYDDFYHYKSGVYKHTS-NAKLENYLHSGKLIGWGTE-NGTPYWLVINTWGPH 315
P AT +Y DF+ Y SGVY+ T+ N HS KL+GWG E NG YW+ N+WGP
Sbjct: 336 PVQATMRVYRDFFSYSSGVYRQTAANRGAPTGFHSVKLVGWGEEHNGDKYWIAANSWGPW 395
Query: 316 WGDRGTVKILRGKYECAFEYLIAAGKP 342
WG+RG +ILRG EC E + A P
Sbjct: 396 WGERGYFRILRGSNECGIEDYVLASWP 422
>gi|195154396|ref|XP_002018108.1| GL16940 [Drosophila persimilis]
gi|194113904|gb|EDW35947.1| GL16940 [Drosophila persimilis]
Length = 433
Score = 129 bits (325), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 86/267 (32%), Positives = 124/267 (46%), Gaps = 35/267 (13%)
Query: 81 SATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTE 140
+A +P F+A E+W + I VPD G C + + + SDR I+SKG++ LS +
Sbjct: 186 TAGLPAAFNAVEKWSS--YISEVPDQGWCGSSWVLSTTSVASDRFAIQSKGKEAVQLSAQ 243
Query: 141 YVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSA 200
+ SC + + C G + W +LHK+G V C P T
Sbjct: 244 NILSCTR-----RQQGCEGGHLDAAWRYLHKKGVVD-------ESCYPYT---------- 281
Query: 201 PTLPSCENQKVPKLKCHTRCTNPTYGR---GFFQDKHRTTLTYWVDDNEDAIKKEILAHG 257
+++ K++ ++R R +D T + + E I EI G
Sbjct: 282 ------QHRDTCKIRHNSRSLKANGCRPSANVDRDSFYTVGPAYTLNKESDIMAEIYHSG 335
Query: 258 PTTATFALYDDFYHYKSGVYKHTS-NAKLENYLHSGKLIGWGTE-NGTPYWLVINTWGPH 315
P AT +Y DF+ Y SGVY+ T+ N HS KL+GWG E NG YW+ N+WGP
Sbjct: 336 PVQATMRVYRDFFSYSSGVYRQTAANRGAPTGFHSVKLVGWGEEHNGDKYWIAANSWGPW 395
Query: 316 WGDRGTVKILRGKYECAFEYLIAAGKP 342
WG+RG +ILRG EC E + A P
Sbjct: 396 WGERGYFRILRGSNECGIEDYVLASWP 422
>gi|156708114|gb|ABU93315.1| cathepsin B6 cysteine protease [Monocercomonoides sp. PA]
Length = 281
Score = 129 bits (325), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 104/326 (31%), Positives = 150/326 (46%), Gaps = 58/326 (17%)
Query: 21 FSDAYIDQINRE-ANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPE 79
+++ ++ +N + ++TW A EY R+ + + LP + D E
Sbjct: 10 IAESIVETVNNDPSSTWVA---------IEYPREVITLAKMRAMLGEEVLPLE----DVE 56
Query: 80 YSA--TVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPL 137
Y VP+ FDAREQWP G I V D +C + AA A +R IK G+ L
Sbjct: 57 YVEPNNVPENFDAREQWP--GKIYPVRDQASCGSCWAHAASEAIGNRFSIKGCGKGM--L 112
Query: 138 STEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHH 197
S + + SC K + C+ GS + +L G T C P S +
Sbjct: 113 SVQDLVSCDK-----GDSGCNGGSGPLSSKWLVSNGVTT-------EECLPYV----SGN 156
Query: 198 GSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHG 257
G P C +C+N G + K+ TY V + I++E++ +G
Sbjct: 157 GRVPA-------------CAAKCSN---GSQIIRYKYEKAETYTVQN----IQEELMKNG 196
Query: 258 PTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWG 317
P F +Y DF +YKSGVY+H S + H+ LIGWG E+G PYWL+ N+WGP WG
Sbjct: 197 PVYFRFTVYSDFMNYKSGVYQHKSGYQEGG--HAVLLIGWGVEDGVPYWLLQNSWGPAWG 254
Query: 318 DRGTVKILRGKYECAFEYLIAAGKPK 343
++G KI+RGK EC E AG K
Sbjct: 255 EKGHFKIIRGKNECGCEQGFYAGPVK 280
>gi|443686962|gb|ELT90079.1| hypothetical protein CAPTEDRAFT_166233 [Capitella teleta]
Length = 495
Score = 129 bits (324), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 105/341 (30%), Positives = 147/341 (43%), Gaps = 49/341 (14%)
Query: 13 LVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADA-KYFDQSDRP--L 69
L+R E+ ID +N W A RN+ +L + D KY + +P +
Sbjct: 153 LIRKEV-------IDHVNSHNPGWQA-RNY------TFLWGMTLKDGIKYRLGTFKPQGM 198
Query: 70 PGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKS 129
+ + + +PD FDARE+WP+ I V D G C A + F+ +DR I S
Sbjct: 199 IEEMSSLKVDADEVMPDEFDAREEWPS--FIHPVQDQGNCGASYAFSTSTVAADRLSIHS 256
Query: 130 KGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPS 189
G+ LS +Y+ SC K C G V R W L + G+V+ C P
Sbjct: 257 GGELKDMLSAQYLISCTTD---HHQKGCEGGHVDRAWWQLRRVGTVS-------KDCYPY 306
Query: 190 TISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAI 249
T + G C K K + C G+G ++ + Y + E I
Sbjct: 307 TSGDTNDPGK------CLMSKYKLPKKNIECP---VGQGITSKLYQASPPYRIAAKEREI 357
Query: 250 KKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGK-------LIGWGT--- 299
EI+ +GP A + DDFY Y+ GVYKH+ K NY H GK +IGWGT
Sbjct: 358 MNEIILNGPVQAVMHVKDDFYTYERGVYKHSHAPKPANYPHLGKEAYHSVRIIGWGTDYT 417
Query: 300 -ENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAA 339
++ YWL NTWG HWG+ G +I RG E E +
Sbjct: 418 GDDPIKYWLAANTWGRHWGEGGFFRIARGSDESHIESFVVG 458
>gi|10803454|emb|CAB97366.2| putative cathepsin B.3 [Ostertagia ostertagi]
Length = 196
Score = 129 bits (323), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 72/198 (36%), Positives = 102/198 (51%), Gaps = 8/198 (4%)
Query: 116 AAVGAFSDRRCIKSKGQQNRPLS-TEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGS 174
+A SDR C+++ G++ LS T+ +A C C Y C+ G R W + G
Sbjct: 6 SAAETMSDRLCVQTNGRKKTLLSDTDILACCGDFCGY----GCNGGYSARAWLYARNSGV 61
Query: 175 VTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKH 234
+GG Y ++ C+P T PC +H + C C C YG+ + +DK
Sbjct: 62 CSGGRYQEKGVCKPYTFHPCGYHKNQTYYGECPKHTYQTPACKKYC-QYGYGKRYEKDKI 120
Query: 235 RTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKL 294
Y V +E AI+ EI A GP A+FA Y+DF HYKSG+Y HT+ + H+ K+
Sbjct: 121 YAXDAYRVSSDEAAIRAEIFARGPVQASFATYEDFAHYKSGIYVHTAGKRRGG--HAVKI 178
Query: 295 IGWGTENGTPYWLVINTW 312
IGWG ENGT W+V N+W
Sbjct: 179 IGWGVENGTKXWIVANSW 196
>gi|340712697|ref|XP_003394892.1| PREDICTED: tubulointerstitial nephritis antigen-like [Bombus
terrestris]
Length = 445
Score = 128 bits (322), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 92/281 (32%), Positives = 124/281 (44%), Gaps = 49/281 (17%)
Query: 73 RKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQ 132
R+ YDPE ++P FDAR +WP I + D G C A +A SDR + SKG
Sbjct: 194 RRIYDPE---SLPREFDARIRWPR--EISDIDDQGWCGASWAISATRVASDRFALMSKGA 248
Query: 133 QNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDY---GDRTGCQPS 189
+ LS +++ SC ++CS G + R W ++ K G V Y G C+
Sbjct: 249 DSVLLSAQHLLSC----NNRGQQACSGGYLDRAWLYMRKFGLVDEDCYPWEGTNAQCKLR 304
Query: 190 TISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAI 249
+ G P V L+ P Y G NE I
Sbjct: 305 KRTDLKTAGCRP--------PVNPLRTELYKVGPAYRLG----------------NETDI 340
Query: 250 KKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENY---LHSGKLIGWGTENGT--- 303
EIL GP AT +Y DF+ Y+SG+YKHT A E+Y HS ++IGWG +
Sbjct: 341 MYEILTSGPVQATMKVYQDFFSYESGIYKHT--ATTEHYAFGYHSVRIIGWGEDTSAHRH 398
Query: 304 -----PYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAA 339
YWLV+N+WG WG+ G +I RG EC E + A
Sbjct: 399 HNLPIKYWLVVNSWGQQWGESGLFRIQRGTNECDIESFVVA 439
>gi|383861394|ref|XP_003706171.1| PREDICTED: tubulointerstitial nephritis antigen-like [Megachile
rotundata]
Length = 442
Score = 128 bits (322), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 89/273 (32%), Positives = 123/273 (45%), Gaps = 38/273 (13%)
Query: 73 RKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQ 132
R+ YDPE ++P FD+R +WP I + D G C A ++ SDR I SKG
Sbjct: 192 RRIYDPE---SLPREFDSRTRWPR--DISKITDQGWCGASWAISSAQVASDRFAIMSKGT 246
Query: 133 QNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTIS 192
LS +++ SC + CS G + R W F+ + G V Y + + +
Sbjct: 247 DAVELSAQHLLSC----NNRGQQGCSGGHLDRAWMFMRRFGLVDENCYPWKASTETCRLR 302
Query: 193 PCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKE 252
+ SA P + K P Y NE I +E
Sbjct: 303 KRTDLRSAGCAPPPNPLRTELYK-----VGPAYRLA----------------NETDIMQE 341
Query: 253 ILAHGPTTATFALYDDFYHYKSGVYKHTSNAKL-ENYLHSGKLIGWGTE-----NGTP-- 304
IL GP AT +Y DF+ Y+SGVYKH+ A+L E+ HS ++IGWG E TP
Sbjct: 342 ILTSGPVQATMRVYQDFFSYESGVYKHSVTAELYESDYHSVRIIGWGEEPPTYSRNTPLK 401
Query: 305 YWLVINTWGPHWGDRGTVKILRGKYECAFEYLI 337
YWLV N+WG WG+ G +I +G EC E +
Sbjct: 402 YWLVANSWGQQWGENGLFRIQKGTNECEIESFV 434
>gi|307201161|gb|EFN81067.1| Uncharacterized peptidase C1-like protein F26E4.3 [Harpegnathos
saltator]
Length = 443
Score = 128 bits (321), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 94/276 (34%), Positives = 130/276 (47%), Gaps = 42/276 (15%)
Query: 73 RKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQ 132
R+ YDP+ +P FDAR +WP I + D G C A + SDR I SKG
Sbjct: 195 RRIYDPD---ALPREFDARTRWPR--DISGIHDQGWCGASWAVSTADVASDRFAIMSKGA 249
Query: 133 QNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDY---GDRTGCQPS 189
++ LS +++ SC + C G + R W F+ K G V Y G C+
Sbjct: 250 EDVELSAQHLLSC----NNRGQQGCRGGYLDRAWLFMRKFGLVDKECYPWTGRNDQCRLR 305
Query: 190 TISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAI 249
S + G P+ Q++ K+ P Y G NE I
Sbjct: 306 KRSNLNVAGCRKP-PNPLRQELYKV-------GPAYRLG----------------NETDI 341
Query: 250 KKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKL-ENYLHSGKLIGWGTE---NGTP- 304
+EIL GP AT +Y DF+ YK+GVY+H+ +A+L ++ HS ++IGWG E G P
Sbjct: 342 MQEILTSGPVQATMRVYQDFFVYKNGVYRHSRSAELHDSGYHSMRIIGWGEEPSYRGPPL 401
Query: 305 -YWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAA 339
YWLV N+WG HWG+ G +I RG EC E + A
Sbjct: 402 KYWLVANSWGRHWGENGLFRIQRGTNECEIESYVLA 437
>gi|156708112|gb|ABU93314.1| cathepsin B5 cysteine protease [Monocercomonoides sp. PA]
Length = 281
Score = 128 bits (321), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 97/315 (30%), Positives = 144/315 (45%), Gaps = 54/315 (17%)
Query: 21 FSDAYIDQINRE-ANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPE 79
F+++ ++ +N TW A P ++ LR L A D ++ P Y P+
Sbjct: 10 FAESIVETVNNHPGATWVAVEYPPEVITTAKLRARLGA----IDLNEGP-----SNYVPD 60
Query: 80 YSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLST 139
S +PD FDAREQWP G I V + C + FA +R I G+ + +S
Sbjct: 61 TS--LPDNFDAREQWP--GKILPVRNQEQCGSCWAFAVAETTGNRLNILGCGRGD--MSP 114
Query: 140 EYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGS 199
+ + SC K+ + C+ GS +W ++ G T I S G
Sbjct: 115 QDLVSCDKV-----DHGCNGGSPLFSWEWVKHSGITT-----------EECIPYVSGGGR 158
Query: 200 APTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPT 259
P+ P +CTN G + K ++ D ++ E+ + GP
Sbjct: 159 VPSCPK-------------KCTN---GSAIVRTKAKSVGLV----KGDKMQNELYSRGPF 198
Query: 260 TATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDR 319
A F++Y+DF YKSGVY H + L H+ ++GWG E+GTPYWL+ N+WG WG++
Sbjct: 199 EAAFSVYEDFKSYKSGVYHHITGKMLGG--HAVMVVGWGVEDGTPYWLIQNSWGTTWGEQ 256
Query: 320 GTVKILRGKYECAFE 334
G KILRGK EC E
Sbjct: 257 GFFKILRGKNECGIE 271
>gi|21695|emb|CAA46812.1| cathepsin B [Triticum aestivum]
Length = 310
Score = 128 bits (321), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 103/317 (32%), Positives = 134/317 (42%), Gaps = 38/317 (11%)
Query: 8 LLGCTLVRGELYKFSDAYIDQINREANT-WTAGRN-FPANLSEEYLRQFLIADAKYFDQS 65
L G L I +N+ N WTAG N + AN + E + L
Sbjct: 25 LAGTAKAEHSLGIIQKDIIQTVNKHPNAGWTAGHNPYFANYTIEQFKHILGVKPT----P 80
Query: 66 DRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRR 125
L G PE +P FDAR QW +C TIG++ D G C A FAAV A DR
Sbjct: 81 PGLLAGVPIKIHPEMD--LPKEFDARTQWSSCSTIGNILDQGHCGACWAFAAVEALQDRF 138
Query: 126 CIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTG--GDYGDR 183
CI + LS + +CC + C+ G W + + G VT Y D+
Sbjct: 139 CIHLN--MSVSLSVNDLLACCG---FLCGSGCNGGYPISAWRYFRRSGVVTEECDPYFDQ 193
Query: 184 TGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVD 243
TGCQ H G P P+ KC +C + + ++KH + Y V
Sbjct: 194 TGCQ--------HPGCEPAYPT--------PKCQRKCK--VENQAWKENKHFSVNAYRVH 235
Query: 244 DNEDAIKKEILAHGPTTATFALYD--DFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTEN 301
N I E+ +GP F DF HYKSGVYKH + + H+ KLIGWGT +
Sbjct: 236 SNPHDIMAEVYKNGPVEVAFTYCQILDFAHYKSGVYKHITGGVMGG--HAVKLIGWGTSD 293
Query: 302 -GTPYWLVINTWGPHWG 317
G YWL+ N W WG
Sbjct: 294 AGEDYWLLANQWNRGWG 310
>gi|195426329|ref|XP_002061289.1| GK20838 [Drosophila willistoni]
gi|194157374|gb|EDW72275.1| GK20838 [Drosophila willistoni]
Length = 432
Score = 128 bits (321), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 85/261 (32%), Positives = 120/261 (45%), Gaps = 30/261 (11%)
Query: 84 VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVA 143
+P F+A ++W I VPD G C + + + SDR I+S+G++ LS + +
Sbjct: 189 LPASFNAVDKWSR--YISEVPDQGWCGSSWVLSTTSVASDRFAIQSQGKEVVQLSPQNIL 246
Query: 144 SCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTL 203
SC + + C G + W +LHK+G + C P T S
Sbjct: 247 SCTR-----RQQGCEGGHLDAAWRYLHKKGVLD-------ESCYPYTQSR---------- 284
Query: 204 PSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATF 263
+C+ + LK H P G +D T + E IK EI GP AT
Sbjct: 285 GTCKVRHSGSLKAHGCRPAP----GVDRDSLYTVGPAYSLSREADIKAEIFHSGPVQATM 340
Query: 264 ALYDDFYHYKSGVYKHTS-NAKLENYLHSGKLIGWGTE-NGTPYWLVINTWGPHWGDRGT 321
+Y DF+ Y G+Y+ T+ N HS KL+GWG E NG YW+ N+WGP WG+RG
Sbjct: 341 RVYRDFFSYSGGIYRQTAANRGAPTGFHSVKLVGWGEEHNGDKYWIAANSWGPWWGERGY 400
Query: 322 VKILRGKYECAFEYLIAAGKP 342
+ILRG EC E + A P
Sbjct: 401 FRILRGSNECGIEDYVLASWP 421
>gi|161343821|tpg|DAA06091.1| TPA_inf: cathepsin B [Aphis gossypii]
Length = 196
Score = 127 bits (320), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 77/202 (38%), Positives = 103/202 (50%), Gaps = 17/202 (8%)
Query: 145 CCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTLP 204
CC C + C G R W G VTGGDY GC+P + PC +
Sbjct: 5 CCHTCGF----GCHGGYPIRAWKRFKNHGLVTGGDYKSGEGCEPYRVPPCPYDEQGNN-- 58
Query: 205 SCENQKVPKLKCHTRCTNPTYGRGF--FQDKHRTTLTYWVDDNEDAIKKEILAHGPTTAT 262
+C + + K + RCT YG F + HR T Y+ +I+K+++ +GP A+
Sbjct: 59 TCAGKPMEK---NHRCTRICYGDQELDFDEDHRYTRDYYYL-TYGSIQKDVMTYGPIEAS 114
Query: 263 FALYDDFYHYKSGVYKHTSNAKLENYL--HSGKLIGWGTENGTPYWLVINTWGPHWGDRG 320
F +Y DF YKSG+Y+ T NA YL H+ KLIGWG + G PYWL++N+W WGD G
Sbjct: 115 FDVYSDFPSYKSGIYERTENA---TYLGGHAVKLIGWGEQYGIPYWLMVNSWNEDWGDNG 171
Query: 321 TVKILRGKYECAFEYLIAAGKP 342
KI RG EC + AG P
Sbjct: 172 LFKIRRGTNECGVDNSTTAGVP 193
>gi|308162940|gb|EFO65307.1| Cathepsin B precursor [Giardia lamblia P15]
Length = 303
Score = 127 bits (320), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 100/325 (30%), Positives = 150/325 (46%), Gaps = 49/325 (15%)
Query: 22 SDAYIDQINREANTWTAG--RNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPE 79
S A + +I W AG + F N++E+ R LI LP T E
Sbjct: 17 SRAELRRIQALNPPWKAGMPKRF-ENITEDEFRGMLIR-PDILGAGSGSLPPSSVTEIQE 74
Query: 80 YSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLST 139
+ +P +FD R+++P C T V D G+C F+A+G F DRRC+ ++ P S
Sbjct: 75 PADPIPSQFDFRDEYPQCVT--PVMDQGSCGGCWAFSAIGVFGDRRCVAGIDKEGVPYSQ 132
Query: 140 EYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGG--DYGDRTGCQPSTI-SPCSH 196
+Y+ SC +N C G + TW+FL G+ T Y D P+ + SPC
Sbjct: 133 QYLISCST-----ENHGCDGGDFWPTWSFLTLTGATTAECVKYIDY----PNIVASPC-- 181
Query: 197 HGSAPTLPSCEN-QKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILA 255
P + C++ ++ K H YG+ V N AI +
Sbjct: 182 ----PAV--CDDGSQIQLYKAHG------YGQ--------------VSKNVQAIMHMLAT 215
Query: 256 HGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGT-ENGTPYWLVINTWGP 314
GP +Y D +Y+SGVYKHT + LH+ +++G+GT ++GT YW++ N+WG
Sbjct: 216 GGPVQTMIVVYSDLSYYESGVYKHT-YGTISLGLHALEMVGYGTTDDGTDYWIIRNSWGA 274
Query: 315 HWGDRGTVKILRGKYECAFEYLIAA 339
WG+ G +I+RG EC E I A
Sbjct: 275 DWGENGYFRIVRGVNECRIEDEIYA 299
>gi|403332696|gb|EJY65386.1| Cathepsin B [Oxytricha trifallax]
Length = 297
Score = 127 bits (320), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 99/342 (28%), Positives = 144/342 (42%), Gaps = 52/342 (15%)
Query: 5 LVFLLGCTLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQ 64
LV + + + ++ + I + + W S+ Q L Y
Sbjct: 4 LVIVGTIAAMVAATHPVNEEMVAHIKAKTSLWQPHETTTNPFSDLTKEQLLAKCGTYIVP 63
Query: 65 SDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDR 124
S++ PG + PD FDAR+QW + I + D C A F A A SDR
Sbjct: 64 SNKQYPGSPLI-------STPDNFDARQQWGS--KIHAIRDQQQCGACWAFGATEALSDR 114
Query: 125 RCIKSKGQQNRPLSTEYVASCCKICRYDDNK-SCSHGSVFRTWNFLHKRGSVTGG--DYG 181
I S G + S E + SC D N C+ G + W FL + G V Y
Sbjct: 115 FTIASNGSVDVVFSPEDLVSC------DTNDYGCNGGYMDMAWEFLDQHGVVADSCFPYS 168
Query: 182 DRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYW 241
+G P+ S C+ GSA SC + + + +
Sbjct: 169 AGSGFAPACASKCAD-GSAEKKYSCVHGSIRQSQ-------------------------- 201
Query: 242 VDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTEN 301
+ IK EI+AHGP F +Y DF++Y+SGVY T++ H+ K++G+G EN
Sbjct: 202 ---GVEQIKSEIVAHGPVEGAFTVYTDFFNYQSGVYTPTTSDVAGG--HAIKILGFGVEN 256
Query: 302 GTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
GTPYWL N+WGP WG +G KI +G EC E + + P+
Sbjct: 257 GTPYWLCANSWGPSWGMQGFFKIKQG--ECGIEDQVFSCDPQ 296
>gi|403362666|gb|EJY81064.1| Cathepsin B [Oxytricha trifallax]
Length = 309
Score = 127 bits (320), Expect = 7e-27, Method: Compositional matrix adjust.
Identities = 92/308 (29%), Positives = 131/308 (42%), Gaps = 47/308 (15%)
Query: 22 SDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEYS 81
S ++ I WT P +SE + A K + D + + +
Sbjct: 22 SQEMVNAIRSSNALWT-----PTEVSENKFANYTEAQIKGLLGTVLSHSSDIPAF-TQIN 75
Query: 82 ATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEY 141
A VPD FD+R QW C + + D C + FAA + SDR CI S+G+ N LS +
Sbjct: 76 AAVPDSFDSRTQWQGC--VHPIRDQAQCGSCWAFAASESLSDRFCIASQGKVNVVLSPQD 133
Query: 142 VASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAP 201
+ SC +N C G + W +L K+G + C+P S G+AP
Sbjct: 134 MVSC-----DTNNYGCDGGYLNLAWQYLEKKGVAS-------DSCEPYK----SASGTAP 177
Query: 202 TLPS-CEN-QKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPT 259
+ PS C N Q + K KC T G A K I GP
Sbjct: 178 SCPSKCANGQAIKKYKCQAGSTKQANGAA-------------------ATKSLIQQSGPV 218
Query: 260 TATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDR 319
F +Y DF++YKSG+Y H S H+ K++GWG + YW+V N+WG WG++
Sbjct: 219 ETGFTVYADFFNYKSGIYHHVSGGAEGG--HAVKILGWGKQGSENYWIVANSWGESWGEK 276
Query: 320 GTVKILRG 327
G I +G
Sbjct: 277 GFFNIRQG 284
>gi|403345965|gb|EJY72367.1| Cathepsin B [Oxytricha trifallax]
Length = 309
Score = 127 bits (319), Expect = 8e-27, Method: Compositional matrix adjust.
Identities = 92/308 (29%), Positives = 131/308 (42%), Gaps = 47/308 (15%)
Query: 22 SDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEYS 81
S ++ I WT P +SE + A K + D + + +
Sbjct: 22 SQEMVNAIRSSNALWT-----PTEVSENKFANYTEAQIKGLLGTVLSHSSDIPAF-TQIN 75
Query: 82 ATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEY 141
A VPD FD+R QW C + + D C + FAA + SDR CI S+G+ N LS +
Sbjct: 76 AAVPDSFDSRTQWQGC--VHPIRDQAQCGSCWAFAASESLSDRFCIASQGKVNVVLSPQD 133
Query: 142 VASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAP 201
+ SC +N C G + W +L K+G + C+P S G+AP
Sbjct: 134 MVSC-----DTNNYGCDGGYLNLAWQYLEKKGVAS-------DSCEPYK----SASGTAP 177
Query: 202 TLPS-CEN-QKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPT 259
+ PS C N Q + K KC T G A K I GP
Sbjct: 178 SCPSKCSNGQAIKKYKCKAGSTKQANGAA-------------------ATKSLIQQSGPV 218
Query: 260 TATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDR 319
F +Y DF++YKSG+Y H S H+ K++GWG + YW+V N+WG WG++
Sbjct: 219 ETGFTVYADFFNYKSGIYHHVSGGAEGG--HAVKILGWGKQGSENYWIVANSWGESWGEK 276
Query: 320 GTVKILRG 327
G I +G
Sbjct: 277 GFFNIRQG 284
>gi|323448735|gb|EGB04630.1| hypothetical protein AURANDRAFT_32318 [Aureococcus anophagefferens]
Length = 253
Score = 127 bits (318), Expect = 9e-27, Method: Compositional matrix adjust.
Identities = 85/255 (33%), Positives = 127/255 (49%), Gaps = 19/255 (7%)
Query: 97 CGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKS 156
C ++ + D C + F + A +DR CI S G LS + V SC K+ +
Sbjct: 1 CPSLKEIRDQANCGSCWAFGSTEAMTDRMCIASNGTVTTHLSAQDVTSCDKL----GDMG 56
Query: 157 CSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKC 216
C+ G +++ G V GG+YGD++GC + PC+HH ++ P+C ++ V KC
Sbjct: 57 CNGGIPSSVYSYWALSGIVDGGNYGDKSGCWSYQLEPCAHHVNSSKYPACPDE-VRAPKC 115
Query: 217 HTRCTNPTYGRGFFQDKHRTTLTYWVDDNED-----AIK--KEILAHGPTTATFALYDDF 269
+C + + + + K + Y V + AIK +I +GP T F + DF
Sbjct: 116 ARKCESED--KDWTKAKVKGEKGYSVCQQGELEGTCAIKMAADIYQNGPITGMFFVKQDF 173
Query: 270 YHYKSGVYKHTSNAKLENYL--HSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRG 327
YKSGVY+ L L H+ K++G+GTE+G YWLV N+W WGD G KI+RG
Sbjct: 174 LAYKSGVYEPK---LLSPPLGGHAIKIMGFGTEDGKDYWLVANSWNEDWGDDGYFKIIRG 230
Query: 328 KYECAFEYLIAAGKP 342
K C E + G P
Sbjct: 231 KNACQIEDPVINGGP 245
>gi|395526635|ref|XP_003765465.1| PREDICTED: tubulointerstitial nephritis antigen-like [Sarcophilus
harrisii]
Length = 467
Score = 127 bits (318), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 101/333 (30%), Positives = 139/333 (41%), Gaps = 44/333 (13%)
Query: 26 IDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEY----S 81
ID INR WTAG + + L +Y + RP + +
Sbjct: 146 IDAINRGNYGWTAGNH------SVFWGMTLDEGIRYRLGTVRPTSSVMNMNEIQMVMSPD 199
Query: 82 ATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEY 141
T+P F A +WP G I D G CA F+ SDR I S G + LS +
Sbjct: 200 ETLPSAFSASNKWP--GLIHEPLDQGNCAGSWAFSTAAVASDRISIHSMGHMSPALSPQN 257
Query: 142 VASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDY----GDRTGCQPSTISPCSHH 197
+ SC + C G + W FL +RG V+ Y GD G P+ +PC H
Sbjct: 258 LLSC----NTHNQHGCRGGRLDGAWWFLRRRGLVSNNCYPFSEGDHNGAAPA--APCMMH 311
Query: 198 GSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHG 257
+ K + C N R ++ T Y + +E I KE++ +G
Sbjct: 312 S--------RHMGRGKRQATAHCPN---SRTHANHIYQATPPYRLSSHEKDIMKELMENG 360
Query: 258 PTTATFALYDDFYHYKSGVYKHT--SNAKLENY----LHSGKLIGWGTE-----NGTPYW 306
P A +++DF+ YKSG+YKHT S K E Y HS K+ GWG E YW
Sbjct: 361 PVQALLEVHEDFFLYKSGIYKHTPASLGKPERYRQHGTHSVKITGWGEEIQPDGQKVKYW 420
Query: 307 LVINTWGPHWGDRGTVKILRGKYECAFEYLIAA 339
N+WGP WG+ G +I+RG EC E +
Sbjct: 421 TAANSWGPTWGENGYFRIVRGANECDIESFVVG 453
>gi|350408961|ref|XP_003488566.1| PREDICTED: tubulointerstitial nephritis antigen-like [Bombus
impatiens]
Length = 445
Score = 127 bits (318), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 102/334 (30%), Positives = 143/334 (42%), Gaps = 61/334 (18%)
Query: 26 IDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRP------LPGDRKTYDPE 79
ID+IN + +W A RN+ E+ + L K + P + ++ YDPE
Sbjct: 147 IDEINSQDLSWRA-RNY-----SEFWGRTLDEGVKLRLGTLNPSRSVYRMNSVQRIYDPE 200
Query: 80 YSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLST 139
++P FDAR +WP I + D G C A + SDR + SKG + LS
Sbjct: 201 ---SLPREFDARIRWPR--EISDIDDQGWCGASWAISTTRVASDRFALMSKGADSVLLSA 255
Query: 140 EYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDY---GDRTGCQPSTISPCSH 196
+++ SC ++CS G + R W ++ K G V Y G C+ +
Sbjct: 256 QHLLSC----NNRGQQACSGGYLDRAWLYMRKFGLVDEDCYPWEGTNVQCKLRKRTDLKT 311
Query: 197 HGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAH 256
G P V L+ P Y G NE I EIL
Sbjct: 312 AGCRP--------PVNPLRTELYKVGPAYRLG----------------NETDIMYEILTS 347
Query: 257 GPTTATFALYDDFYHYKSGVYKHTSNAKLENY---LHSGKLIGWGTENGT--------PY 305
GP AT +Y DF+ Y+SG+YKHT A E+Y HS ++IGWG + Y
Sbjct: 348 GPVQATMKVYQDFFSYESGIYKHT--ATTEHYAFGYHSVRIIGWGEDTSAHRYRNLPIKY 405
Query: 306 WLVINTWGPHWGDRGTVKILRGKYECAFEYLIAA 339
WLV+N+WG WG+ G +I RG EC E + A
Sbjct: 406 WLVVNSWGQQWGESGLFRIQRGTNECDIESFVVA 439
>gi|195384166|ref|XP_002050789.1| GJ20006 [Drosophila virilis]
gi|194145586|gb|EDW61982.1| GJ20006 [Drosophila virilis]
Length = 432
Score = 126 bits (317), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 83/264 (31%), Positives = 120/264 (45%), Gaps = 28/264 (10%)
Query: 81 SATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTE 140
S +P +F+A E+W + I VPD G C + + + SDR I+S+G++ LS +
Sbjct: 184 SDDLPRKFNAVEKWSS--YISEVPDQGWCGSSWVLSTTSVASDRFAIQSQGKEVVQLSAQ 241
Query: 141 YVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSA 200
+ SC + + C G + W +LHK+G + Y P + H
Sbjct: 242 NILSCTR-----RQQGCEGGHLDAAWRYLHKKGVLDEKCY------------PYTQHRD- 283
Query: 201 PTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTT 260
SC+ Q+ P YG +D T + E I EI GP
Sbjct: 284 ----SCKIQRHNSRSLKANGCQPAYGVN--RDSLYTVGPAYSLSREADIMAEIYHSGPVQ 337
Query: 261 ATFALYDDFYHYKSGVYKHTS-NAKLENYLHSGKLIGWGTE-NGTPYWLVINTWGPHWGD 318
AT +Y DF+ Y G+Y+ T+ N HS KL+GWG E +G YW+ N+WGP WG+
Sbjct: 338 ATMRIYRDFFSYSGGIYRQTAANRGAPTGFHSVKLVGWGEEHDGVKYWIAANSWGPWWGE 397
Query: 319 RGTVKILRGKYECAFEYLIAAGKP 342
G +ILRG EC E + A P
Sbjct: 398 HGYFRILRGSNECGIEEYVLASWP 421
>gi|157058737|gb|ABV03126.1| cathepsin B-16 [Myzus persicae]
Length = 238
Score = 126 bits (317), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 82/244 (33%), Positives = 118/244 (48%), Gaps = 15/244 (6%)
Query: 24 AYIDQINREANTWTAGRNFPANLSEEYLRQFLIADA--KYFDQSDRPLPGDRKTYDPEYS 81
+YID IN A+TW AG NF N S+E + + L + S D Y+ Y
Sbjct: 5 SYIDTINEVASTWKAGVNFDPNTSQEDIVKLLGSTGVESAMKASANEFKMDDVAYNKLYG 64
Query: 82 ATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEY 141
T P FDAR++W +C TIG V D G C + F AF+DR C+ + G N LS E
Sbjct: 65 YT-PRTFDARKKWRHCKTIGEVRDQGHCGSCWAFGTSSAFADRLCVATDGDFNELLSAEE 123
Query: 142 VASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAP 201
+ CC C + C+ G + W + G VTGG+Y GC+P + PC
Sbjct: 124 ITFCCHTCGF----GCNGGDPIKAWKYFSTHGLVTGGNYKSGEGCEPYRVPPCPRDDKGK 179
Query: 202 TLPSCENQKVPKLKCHTRCTNPTYGRG--FFQDKHRTTLTYWVDDNEDAIKKEILAHGPT 259
+C + P+ K H RCT YG +++ HR T ++ +I+K+++ +GP
Sbjct: 180 N--TCAGK--PREKNH-RCTRMCYGNQDLDYREDHRYTRDFYY-LTYGSIQKDVMTYGPI 233
Query: 260 TATF 263
ATF
Sbjct: 234 EATF 237
>gi|47125398|gb|AAH70278.1| Tubulointerstitial nephritis antigen [Homo sapiens]
gi|190690249|gb|ACE86899.1| tubulointerstitial nephritis antigen protein [synthetic construct]
gi|190691623|gb|ACE87586.1| tubulointerstitial nephritis antigen protein [synthetic construct]
gi|312150986|gb|ADQ32005.1| tubulointerstitial nephritis antigen [synthetic construct]
Length = 476
Score = 126 bits (316), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 109/352 (30%), Positives = 153/352 (43%), Gaps = 64/352 (18%)
Query: 13 LVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRP---- 68
LVR EL I+Q+N+ WTA N S+ + + D F P
Sbjct: 155 LVRSEL-------IEQVNKGDYGWTA-----QNYSQFW--GMTLEDGFKFRLGTLPPSLM 200
Query: 69 -LPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVP-DTGACAAPHIFAAVGAFSDRRC 126
L + T + +P+ F A +WP H P D CAA F+ +DR
Sbjct: 201 LLSMNEMTASLPATTDLPEFFVASYKWPG---WTHGPLDQKNCAASWAFSTASVAADRIA 257
Query: 127 IKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDY------ 180
I+SKG+ LS + + SCC R+ C+ GS+ R W +L KRG V+ Y
Sbjct: 258 IQSKGRYTANLSPQNLISCCAKNRH----GCNSGSIDRAWWYLRKRGLVSHACYPLFKDQ 313
Query: 181 -GDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLT 239
GC ++ S G C N V K +C+ P
Sbjct: 314 NATNNGCAMASRS--DGRGKRHATKPCPN-NVEKSNRIYQCSPP---------------- 354
Query: 240 YWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKH--TSNAKLENY----LHSGK 293
Y V NE I KEI+ +GP A +++DF+HYK+G+Y+H ++N + E Y H+ K
Sbjct: 355 YRVSSNETEIMKEIMQNGPVQAIMQVHEDFFHYKTGIYRHVTSTNKESEKYRKLQTHAVK 414
Query: 294 LIGWGTENGT-----PYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAG 340
L GWGT G +W+ N+WG WG+ G +ILRG E E LI A
Sbjct: 415 LTGWGTLRGAQGQKEKFWIAANSWGKSWGENGYFRILRGVNESDIEKLIIAA 466
>gi|327281715|ref|XP_003225592.1| PREDICTED: tubulointerstitial nephritis antigen-like [Anolis
carolinensis]
Length = 520
Score = 126 bits (316), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 83/269 (30%), Positives = 125/269 (46%), Gaps = 28/269 (10%)
Query: 83 TVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYV 142
+P F+A ++W G I D G CA F+ SDR I S G LS + +
Sbjct: 253 VLPSYFNAADKWS--GMIHEPLDQGNCAGSWAFSTAAVASDRISIHSMGHMTPALSPQNL 310
Query: 143 ASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPT 202
SC + C+ G + W FL +RG VT C P + +H +AP
Sbjct: 311 LSC----NTRHQQGCNGGRIDGAWWFLRRRGVVT-------DECYPFSNQETNHSPNAPA 359
Query: 203 -LPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTA 261
+ + K + RC NP R + +++T Y + NE I KE++ +GP A
Sbjct: 360 CMMHSRSTGRGKRQAIARCPNP---RSHANEIYQSTPAYRLSSNEKEIMKELMENGPVQA 416
Query: 262 TFALYDDFYHYKSGVYKHTSNA--KLENY----LHSGKLIGWGTE-----NGTPYWLVIN 310
+++DF+ Y++G+Y+HT+ A K E Y HS K+ GWG E + YW+ N
Sbjct: 417 ILEVHEDFFMYRTGIYRHTAVAAGKPEQYRRHGTHSVKITGWGEEQMPDGSNQKYWIAAN 476
Query: 311 TWGPHWGDRGTVKILRGKYECAFEYLIAA 339
+WG WG+ G +I RG+ EC E +
Sbjct: 477 SWGKDWGEHGYFRITRGENECEIETFVVG 505
>gi|16768502|gb|AAL28470.1| GM06507p [Drosophila melanogaster]
Length = 430
Score = 126 bits (316), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 85/261 (32%), Positives = 121/261 (46%), Gaps = 30/261 (11%)
Query: 84 VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVA 143
+P+ F+A ++W + I VPD G C A + + SDR I+SKG++N LS + +
Sbjct: 187 LPNSFNALDKWSS--YISEVPDQGWCGASWVLSTTSVASDRFAIQSKGKENVQLSAQNIL 244
Query: 144 SCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTL 203
SC + + C G + W +LHK+G V Y P + H
Sbjct: 245 SCTR-----RQQGCEGGHLDAAWRYLHKKGVVDENCY------------PYTQH-----R 282
Query: 204 PSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATF 263
+C+ + LK + C P +D T + + E I EI GP AT
Sbjct: 283 DTCKIRHSRSLKANG-CQKPV---NVDRDSLYTVGPAYSLNREADIMAEIFHSGPVQATM 338
Query: 264 ALYDDFYHYKSGVYKHTS-NAKLENYLHSGKLIGWGTE-NGTPYWLVINTWGPHWGDRGT 321
+ DF+ Y GVY+ T+ N K HS KL+GWG E NG YW+ N+WG WG+ G
Sbjct: 339 RVNRDFFAYSGGVYRETAANRKAPTGFHSVKLVGWGEEHNGEKYWIAANSWGSWWGEHGY 398
Query: 322 VKILRGKYECAFEYLIAAGKP 342
+ILRG EC E + A P
Sbjct: 399 FRILRGSNECGIEEYVLASWP 419
>gi|300952942|gb|ADK46902.1| cathepsin B [Radopholus similis]
Length = 356
Score = 125 bits (315), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 92/321 (28%), Positives = 142/321 (44%), Gaps = 26/321 (8%)
Query: 22 SDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFD---QSDRPLPGDRKTYDP 78
++ + ++N TWTA + ++ L+ + D Q + L G R
Sbjct: 42 AEDMVKKVNEAKTTWTAEELPRISSMSLNAKKGLMGLKAFHDGGFQKHKQLLGARPKSAS 101
Query: 79 EYSAT-VPDRFDAREQWPNCG-TIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRP 136
+ AT +P FD+R+Q+ C IG + D C + ++ DR CI S G+Q
Sbjct: 102 KLDATKLPQHFDSRKQFTKCAKVIGTIQDQSNCGSCWAVSSASVIQDRICIASNGEQKVH 161
Query: 137 LSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSH 196
+S + + SC D ++ C+ G + + G VTG GC+P P H
Sbjct: 162 ISAQDILSCAT----DRSQGCNGGYPDEAFEHYAQSGVVTGSGNSANQGCKPYPFLP--H 215
Query: 197 HGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDA-IKKEILA 255
+ P +C +C N Y + + QDKH Y V ++ I+ EI+
Sbjct: 216 TTVEYSTP----------ECSKKCENYQYKKAYKQDKHFGMSVYNVQFSDPVDIQYEIMN 265
Query: 256 HGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGT--PYWLVINTWG 313
+GP A +Y DF YKSGVY+ L H+ +++GWG + T PYWLV N+W
Sbjct: 266 NGPVEANMIVYYDFMFYKSGVYQTVFPWPLGG--HAVRIVGWGVDGPTKVPYWLVANSWN 323
Query: 314 PHWGDRGTVKILRGKYECAFE 334
WG+ G +I RG E E
Sbjct: 324 TDWGEDGYFRIRRGTDESYIE 344
>gi|431838263|gb|ELK00195.1| Tubulointerstitial nephritis antigen [Pteropus alecto]
Length = 425
Score = 125 bits (315), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 93/274 (33%), Positives = 127/274 (46%), Gaps = 41/274 (14%)
Query: 84 VPDRFDAREQWPNCGTIGHVP-DTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYV 142
+P+ F A +WP H P D CAA F+ +DR I+SKG+ LS + +
Sbjct: 166 LPEFFVAYYKWPG---WTHGPLDQKNCAASWAFSTASVAADRIAIQSKGRYTANLSPQNL 222
Query: 143 ASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDY-----GDRTGCQPSTISPCSHH 197
SCC R+ CS GS+ R W +L KRG V+ Y + T + S
Sbjct: 223 ISCCAKNRH----GCSSGSIDRAWWYLRKRGLVSHACYPFLKDQNTTNNACAMASRSDGR 278
Query: 198 GSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHG 257
G C N + K +C+ P Y V NE I KEI+ +G
Sbjct: 279 GKRHATKPCPNN-IEKSNRIYQCSPP----------------YRVSSNETEIMKEIIHNG 321
Query: 258 PTTATFALYDDFYHYKSGVYKH--TSNAKLENY----LHSGKLIGWGTENGT-----PYW 306
P A +++DF+HYKSG+Y+H ++N K E Y H+ KL GWGT G +W
Sbjct: 322 PVQAIMQVHEDFFHYKSGIYRHVTSTNEKSEKYQKLQTHAVKLTGWGTLRGAQGRKEKFW 381
Query: 307 LVINTWGPHWGDRGTVKILRGKYECAFEYLIAAG 340
+V N+WG WG+ G +ILRG E E LI A
Sbjct: 382 IVANSWGNSWGENGYFRILRGVNESDIEKLIIAA 415
>gi|339248603|ref|XP_003373289.1| cathepsin B [Trichinella spiralis]
gi|316970616|gb|EFV54519.1| cathepsin B [Trichinella spiralis]
Length = 576
Score = 125 bits (315), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 88/272 (32%), Positives = 123/272 (45%), Gaps = 39/272 (14%)
Query: 79 EYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLS 138
E S +P+ FDARE+WP+ I V D G CA+ F+ +DR I+S G+ PLS
Sbjct: 306 EMSNFLPESFDARERWPS--FIHPVRDQGDCASSWAFSTTAVSADRLAIQSGGKFYNPLS 363
Query: 139 TEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHG 198
+ + SC N +RG G Y DR C S G
Sbjct: 364 VQQLLSC---------------------NQARQRG--CNGGYLDRAWCVVSDECYTYTSG 400
Query: 199 SAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGP 258
C + L RC + + + + T Y + NE I EI+A+GP
Sbjct: 401 QTNQPGECHIPRTAYLDGEIRCPSGSADNRVY----KMTPPYRISTNEREIMTEIMANGP 456
Query: 259 TTATFALYDDFYHYKSGVYKHT--SNAKLENYLHSG----KLIGWGTENGT----PYWLV 308
ATF +++DF+ YKSGVY+H +N K Y SG +++GWG ++ T YWL
Sbjct: 457 VQATFLVHEDFFMYKSGVYQHLPYANDKGPAYARSGYHSVRILGWGVDHSTGVPIKYWLC 516
Query: 309 INTWGPHWGDRGTVKILRGKYECAFEYLIAAG 340
N+WG WG+ G +ILRG+ C E I
Sbjct: 517 ANSWGEEWGENGLFRILRGENHCDIESFIIGA 548
>gi|432884030|ref|XP_004074413.1| PREDICTED: tubulointerstitial nephritis antigen-like [Oryzias
latipes]
Length = 474
Score = 125 bits (314), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 102/330 (30%), Positives = 152/330 (46%), Gaps = 42/330 (12%)
Query: 26 IDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRP------LPGDRKTYDPE 79
I +NR W A AN S+ + L +Y + RP + + DP+
Sbjct: 145 IHAVNRGNYGWKA-----ANYSQ-FFGMSLDEGIRYRLGTQRPSRTVMNMNEIQMKMDPQ 198
Query: 80 YSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLST 139
+ +P F++ E+WPN I D G CAA F+ SDR I+S G LS
Sbjct: 199 -NDHLPRYFNSSEKWPN--KIHEPLDQGNCAASWAFSTAAVASDRISIQSMGHMTPQLSP 255
Query: 140 EYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQ-PSTISPCSHHG 198
+ + SC + C+ G + W +L +RG VT Y + Q P+ + C
Sbjct: 256 QNLISC----DTRNQGGCAGGRIDGAWWYLRRRGVVTENCYPYQPPQQAPAEVGRCMMQS 311
Query: 199 SAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGP 258
A K + RC N TY + D +++T Y + NE I KEI+ +GP
Sbjct: 312 RAVGRG--------KRQATQRCPN-TYN--YHNDIYQSTPPYKLSSNEKEIMKEIMENGP 360
Query: 259 TTATFALYDDFYHYKSGVYKHT--SNAKLENY----LHSGKLIGWGTE---NGTP--YWL 307
A +++DF+ YK+G+YKHT S+ K Y HS ++ GWG + +GTP YW+
Sbjct: 361 VQAIMEVHEDFFVYKNGIYKHTDVSSTKPPQYRKHGTHSVRITGWGEDKDYDGTPRKYWI 420
Query: 308 VINTWGPHWGDRGTVKILRGKYECAFEYLI 337
N+WG +WG+ G +I RG EC E +
Sbjct: 421 AANSWGKNWGENGFFRIARGANECEIEAFV 450
>gi|294888035|ref|XP_002772321.1| Cathepsin B precursor, putative [Perkinsus marinus ATCC 50983]
gi|239876433|gb|EER04137.1| Cathepsin B precursor, putative [Perkinsus marinus ATCC 50983]
Length = 200
Score = 125 bits (314), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 80/225 (35%), Positives = 112/225 (49%), Gaps = 31/225 (13%)
Query: 105 DTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFR 164
D AC + F AF+DR CIKS G LS + +C C G +
Sbjct: 1 DQSACGSCWAFGVTEAFNDRLCIKSDGAFTELLSAGEMNACTLF------FGCGGGDPYS 54
Query: 165 TWNFLHKRGSVTGGDY---GDRT---GCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHT 218
W+++H +G TGGDY D T GC P PC+HH + P C PK+ C
Sbjct: 55 AWSWVHDKGIATGGDYVAKDDMTKDDGCWPYDFPPCAHHINDTKYPKC-----PKVSCSG 109
Query: 219 RCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYK 278
R F + + Y V+D ++AI+ + GP +A+F +Y+DF Y+SGVYK
Sbjct: 110 D------DRHFMLES--SPYHYSVNDAKNAIRTD----GPVSASFTVYEDFLAYRSGVYK 157
Query: 279 HTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVK 323
HTS + L H+ K+IGWG ++G YWL +N+W WGD G +
Sbjct: 158 HTSGSYLGG--HAVKIIGWGEKSGQAYWLAVNSWNEDWGDHGLFR 200
>gi|6009533|dbj|BAA84949.1| tubulointerstitial nephritis antigen [Homo sapiens]
Length = 476
Score = 125 bits (314), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 109/352 (30%), Positives = 153/352 (43%), Gaps = 64/352 (18%)
Query: 13 LVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRP---- 68
LVR EL I+Q+N+ WTA N S+ + + D F P
Sbjct: 155 LVRPEL-------IEQVNKGDYGWTA-----QNYSQFW--GMTLEDGFKFRLGTLPPSLM 200
Query: 69 -LPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVP-DTGACAAPHIFAAVGAFSDRRC 126
L + T + +P+ F A +WP H P D CAA F+ +DR
Sbjct: 201 LLSMNEMTASLPATTDLPEFFVASYKWPG---WTHGPLDQKNCAASWAFSTASVAADRIA 257
Query: 127 IKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDY------ 180
I+SKG+ LS + + SCC R+ C+ GS+ R W +L KRG V+ Y
Sbjct: 258 IQSKGRYTANLSPQNLISCCAKNRH----GCNSGSIDRAWWYLRKRGLVSHACYPLFKDQ 313
Query: 181 -GDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLT 239
GC ++ S G C N V K +C+ P
Sbjct: 314 NATNNGCAMASRS--DGRGKRHATKPCPN-NVEKSNRIYQCSPP---------------- 354
Query: 240 YWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKH--TSNAKLENY----LHSGK 293
Y V NE I KEI+ +GP A +++DF+HYK+G+Y+H ++N + E Y H+ K
Sbjct: 355 YRVSSNETEIMKEIMQNGPVQAIMQVHEDFFHYKTGIYRHVTSTNKESEKYRKLQTHAVK 414
Query: 294 LIGWGTENGT-----PYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAG 340
L GWGT G +W+ N+WG WG+ G +ILRG E E LI A
Sbjct: 415 LTGWGTLRGAQGQKEKFWIAANSWGKSWGENGYFRILRGVNESDIEKLIIAA 466
>gi|10803450|emb|CAB97364.2| putative cathepsin B.1 [Ostertagia ostertagi]
Length = 199
Score = 125 bits (314), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 69/196 (35%), Positives = 103/196 (52%), Gaps = 12/196 (6%)
Query: 115 FAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGS 174
++ A SDR CI ++G + +S + + SCC C Y C G R W + ++G
Sbjct: 5 VSSASAMSDRVCIATQGAKQVLISDQDIVSCCTWCGY----GCQGGWSIRAWYYFAEQGV 60
Query: 175 VTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKH 234
VTGG+Y + C+P I PC +H P C++ +C RC Y + + DKH
Sbjct: 61 VTGGNYNTKGSCRPYEIHPCGYHKDEPYYGECDDL-ADTPRCKRRC-QLGYPKSYPSDKH 118
Query: 235 RTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKL 294
Y + + ++I++EI+ +GP A F +Y+DF HYK G+YKHTS K H+ K+
Sbjct: 119 YGRTAYQLPMSVESIQREIMRNGPVVAGFTVYEDFAHYKGGIYKHTSGKKTGG--HAVKV 176
Query: 295 IGWGTEN----GTPYW 306
IGWG+E PYW
Sbjct: 177 IGWGSEQKGSEKIPYW 192
>gi|395833440|ref|XP_003789742.1| PREDICTED: tubulointerstitial nephritis antigen [Otolemur
garnettii]
Length = 464
Score = 125 bits (314), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 90/276 (32%), Positives = 130/276 (47%), Gaps = 45/276 (16%)
Query: 84 VPDRFDAREQWPNCGTIGHVP-DTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYV 142
+P+ F A +WP H P D CAA F+ +DR I+SKG+ LS + +
Sbjct: 205 LPEFFVASYKWPG---WTHGPLDQKNCAASWAFSTASVAADRIAIQSKGRYTANLSPQNL 261
Query: 143 ASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDY-------GDRTGCQPSTISPCS 195
SCC R+ C+ GS+ R W +L KRG V+ Y +GC ++ S
Sbjct: 262 ISCCAKNRH----GCNSGSIDRAWWYLRKRGLVSHACYPLFKDQHATNSGCAMASRS--D 315
Query: 196 HHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILA 255
G C N + K +C+ P Y + NE I KEI+
Sbjct: 316 GRGKRHATKPCPN-NIEKSNRIYQCSPP----------------YRISSNETEIMKEIMQ 358
Query: 256 HGPTTATFALYDDFYHYKSGVYKH--TSNAKLENY----LHSGKLIGWGTENGT-----P 304
+GP A +++DF+HYKSG+Y+H +++ + ENY H+ KL+GWGT G
Sbjct: 359 NGPVQAIMQVHEDFFHYKSGIYRHVASTHGESENYRKLRTHAVKLLGWGTLRGAQGRKEK 418
Query: 305 YWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAG 340
+W+ N+WG WG+ G +ILRG E E LI A
Sbjct: 419 FWIAANSWGKSWGENGYFRILRGVNESDIEKLIIAA 454
>gi|347546077|gb|AEP03186.1| cathepsin B [Diuraphis noxia]
Length = 239
Score = 125 bits (314), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 83/248 (33%), Positives = 120/248 (48%), Gaps = 20/248 (8%)
Query: 38 AGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEY--SATVPDRFDAREQWP 95
AG NF + +EE +++ L +K ++ KT D Y S +P FDAR++W
Sbjct: 1 AGVNFDPDTTEEVIKRLL--GSKGVQIPNKNNMHMYKTNDVAYISSGKIPKTFDARKKWV 58
Query: 96 NCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNK 155
C TIG V D G C + + AF+DR CI + G N LS + + CC C +
Sbjct: 59 QCDTIGRVRDQGQCGSCWAVSTSSAFADRLCIATDGDFNELLSADEITFCCYTCGF---- 114
Query: 156 SCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLK 215
C G + W + G VTGGD+ GC+P + P + S C K
Sbjct: 115 GCDGGYPIKAWKQFSRHGLVTGGDFDSGEGCEPYRVPPSGSNSSNSYNHFCRG------K 168
Query: 216 CHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSG 275
C+ N +Y + HR T Y+ + +AI+K++L +GP A+F +YDDF YKSG
Sbjct: 169 CYGDNQNISY-----SEDHRYTRDYYY-LSYNAIQKDVLLYGPIEASFEVYDDFMIYKSG 222
Query: 276 VYKHTSNA 283
VY + NA
Sbjct: 223 VYVKSENA 230
>gi|224586907|ref|NP_055279.3| tubulointerstitial nephritis antigen [Homo sapiens]
gi|317373501|sp|Q9UJW2.3|TINAG_HUMAN RecName: Full=Tubulointerstitial nephritis antigen; Short=TIN-Ag
gi|119624842|gb|EAX04437.1| tubulointerstitial nephritis antigen [Homo sapiens]
gi|189066513|dbj|BAG35763.1| unnamed protein product [Homo sapiens]
Length = 476
Score = 125 bits (314), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 109/352 (30%), Positives = 152/352 (43%), Gaps = 64/352 (18%)
Query: 13 LVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRP---- 68
LVR EL I+Q+N+ WTA N S+ + + D F P
Sbjct: 155 LVRSEL-------IEQVNKGDYGWTA-----QNYSQFW--GMTLEDGFKFRLGTLPPSPM 200
Query: 69 -LPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVP-DTGACAAPHIFAAVGAFSDRRC 126
L + T + +P+ F A +WP H P D CAA F+ +DR
Sbjct: 201 LLSMNEMTASLPATTDLPEFFVASYKWPG---WTHGPLDQKNCAASWAFSTASVAADRIA 257
Query: 127 IKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDY------ 180
I+SKG+ LS + + SCC R+ C+ GS+ R W +L KRG V+ Y
Sbjct: 258 IQSKGRYTANLSPQNLISCCAKNRH----GCNSGSIDRAWWYLRKRGLVSHACYPLFKDQ 313
Query: 181 -GDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLT 239
GC ++ S G C N V K +C+ P
Sbjct: 314 NATNNGCAMASRS--DGRGKRHATKPCPN-NVEKSNRIYQCSPP---------------- 354
Query: 240 YWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKH--TSNAKLENY----LHSGK 293
Y V NE I KEI+ +GP A + +DF+HYK+G+Y+H ++N + E Y H+ K
Sbjct: 355 YRVSSNETEIMKEIMQNGPVQAIMQVREDFFHYKTGIYRHVTSTNKESEKYRKLQTHAVK 414
Query: 294 LIGWGTENGT-----PYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAG 340
L GWGT G +W+ N+WG WG+ G +ILRG E E LI A
Sbjct: 415 LTGWGTLRGAQGQKEKFWIAANSWGKSWGENGYFRILRGVNESDIEKLIIAA 466
>gi|291000228|ref|XP_002682681.1| predicted protein [Naegleria gruberi]
gi|284096309|gb|EFC49937.1| predicted protein [Naegleria gruberi]
Length = 225
Score = 125 bits (313), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 87/256 (33%), Positives = 119/256 (46%), Gaps = 38/256 (14%)
Query: 83 TVPDRFDAREQWPNC-GTIGHVPDTGAC-AAPHIFA-AVGAFSDRRCIKSKGQQNRPLST 139
++P+ FD+RE+WP C I + G+C A ++F + SDR CI S G+ N LS
Sbjct: 1 SLPESFDSREKWPTCIHPIRNQEQCGSCWACKNLFIQSSEVLSDRFCIASGGKVNVVLSP 60
Query: 140 EYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGS 199
+ + SC N C G ++ W +L G VT C P + S +G
Sbjct: 61 QDLVSCNWY-----NAGCDGGILWAAWIYLKHTGIVT-------DQCLPYS----SGNGV 104
Query: 200 APTLPS-CENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGP 258
AP+ P C P K++ Y V + I EI +GP
Sbjct: 105 APSCPKYCNGTSTP----------------IDSVKYKAKDWYEVGSIAEKIMNEIATNGP 148
Query: 259 TTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGD 318
+ F++Y DF YKSGVY H + + L H+ K++GWG EN YWLV N+WGP WG
Sbjct: 149 VQSGFSVYQDFMSYKSGVYTHQTGSFLGG--HAIKIVGWGVENNVKYWLVANSWGPDWGL 206
Query: 319 RGTVKILRGKYECAFE 334
G KI RG EC E
Sbjct: 207 NGLFKIKRGDNECGIE 222
>gi|115621283|ref|XP_782184.2| PREDICTED: tubulointerstitial nephritis antigen-like
[Strongylocentrotus purpuratus]
Length = 450
Score = 125 bits (313), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 85/274 (31%), Positives = 122/274 (44%), Gaps = 48/274 (17%)
Query: 82 ATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEY 141
A +P+ FDARE WP G I V D G C + + SDR I+S G+ N LS ++
Sbjct: 195 ARLPETFDARENWP--GLIDEVIDQGKCGSSWAISTASVASDRLAIQSMGEINPRLSEQH 252
Query: 142 VASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAP 201
+ SC + CS G + R W L + G+V+ Y +G TI
Sbjct: 253 LLSC----NIRGQRGCSGGYLDRAWYHLRRAGAVSRACYPYHSGLDEDTI---------- 298
Query: 202 TLPSCENQKVPKLKCHTRCTNPTYG------RGFFQDKHRTTLTYWVDDNEDAIKKEILA 255
+ KL+C YG RG D + +T Y + E I EI
Sbjct: 299 ---------MQKLRCRV-----AYGSSQCPERGVTSDLYLSTPPYRIAAREVDIMTEIYQ 344
Query: 256 HGPTTATFALYDDFYHYKSGVYKHTSNAKLENY-------LHSGKLIGWGTE-----NGT 303
+GP ATF + +DF+ Y GVY++ + HS K++GWG + N
Sbjct: 345 NGPVQATFNVKNDFFVYNRGVYRNVKQEFTASQSDSDQAGWHSVKIVGWGIDRSDWYNPI 404
Query: 304 PYWLVINTWGPHWGDRGTVKILRGKYECAFEYLI 337
YWL N+WG +WG++G +I+RG EC E +
Sbjct: 405 KYWLCTNSWGRNWGEQGMFRIVRGVNECEIESFV 438
>gi|332030944|gb|EGI70570.1| Uncharacterized peptidase C1-like protein F26E4.3 [Acromyrmex
echinatior]
Length = 501
Score = 124 bits (312), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 90/276 (32%), Positives = 127/276 (46%), Gaps = 42/276 (15%)
Query: 73 RKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQ 132
R+ YDP+ +P FD+R +W I +V D G C A + +DR I SKG
Sbjct: 253 RRIYDPD---ALPREFDSRTRWSR--DISNVHDQGWCGASWAISTADVATDRFSIMSKGA 307
Query: 133 QNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDY---GDRTGCQPS 189
++ LS +++ S C + C G + R W F+ K G V Y G C+
Sbjct: 308 EDAELSAQHLLS----CNNRGQQGCRGGYLDRAWLFMRKFGLVDKDCYPWTGKNGQCKLR 363
Query: 190 TISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAI 249
+ G C P L+ P Y G NE I
Sbjct: 364 KRNNLQAAG-------CRKPPNP-LRTELYKVGPAYRLG----------------NETDI 399
Query: 250 KKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKL-ENYLHSGKLIGWGTE---NGTP- 304
+EIL GP AT +Y DF+ YK+G+Y+H+ +A+L ++ HS ++IGWG E G P
Sbjct: 400 MQEILTSGPVQATMRVYQDFFVYKNGIYRHSQSAELHDSGYHSVRIIGWGEERSYRGPPL 459
Query: 305 -YWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAA 339
YWLV+N+WG +WG+ G KI RG EC E + A
Sbjct: 460 KYWLVVNSWGYNWGENGLFKIQRGTNECEIESYVLA 495
>gi|321478457|gb|EFX89414.1| hypothetical protein DAPPUDRAFT_303204 [Daphnia pulex]
Length = 442
Score = 124 bits (312), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 84/266 (31%), Positives = 130/266 (48%), Gaps = 23/266 (8%)
Query: 83 TVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYV 142
T+P FD R +W + T+ V D G C A F+ +DR I+S+G + PLS + +
Sbjct: 184 TLPMSFDGRIEWRD--TLQDVRDQGWCGASWAFSTAAVAADRLAIQSRGHEVYPLSMQNL 241
Query: 143 ASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDY---GDRTGCQPSTISPCSHHGS 199
+C + C+ G + R WN++ + G V Y RTG P G+
Sbjct: 242 LAC----NNRGQQGCNGGHLDRAWNYMRRFGVVNEECYPYISGRTGQVEKCKVP--RRGN 295
Query: 200 APTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPT 259
T+ C+ + K P +G F R+ Y + ED I EIL HGP
Sbjct: 296 LATM-KCQLVNAAERKSDRSDKPPR--KGLF----RSPPAYRIAPFEDDIMNEILQHGPV 348
Query: 260 TATFALYDDFYHYKSGVYKHT-SNAKLENYLHSGKLIGWGTE----NGTPYWLVINTWGP 314
AT ++ DF+ Y+ GVY+++ +N++ + HS +++GWG + N T YWLV N+WG
Sbjct: 349 QATMRVHPDFFLYRGGVYRYSGTNSQQRSGYHSVRIVGWGVDSSKRNPTKYWLVANSWGR 408
Query: 315 HWGDRGTVKILRGKYECAFEYLIAAG 340
WG+ G +I+RG+ E E + A
Sbjct: 409 LWGEDGYFRIVRGENESDIEKFVLAA 434
>gi|403268748|ref|XP_003926429.1| PREDICTED: tubulointerstitial nephritis antigen [Saimiri
boliviensis boliviensis]
Length = 476
Score = 124 bits (312), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 107/352 (30%), Positives = 155/352 (44%), Gaps = 64/352 (18%)
Query: 13 LVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRP---- 68
LVR EL I+Q+N+ WTA N S+ + + D F P
Sbjct: 155 LVRPEL-------IEQVNKGDYGWTA-----QNYSQFW--GMTLEDGFKFRLGTLPPSPM 200
Query: 69 -LPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVP-DTGACAAPHIFAAVGAFSDRRC 126
L + T + +P+ F A +WP H P D CAA F+ +DR
Sbjct: 201 LLSMNEMTASLPATTDLPEFFVASYKWPG---WTHGPLDQKNCAASWAFSTASVAADRIA 257
Query: 127 IKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDY------ 180
I+SKG+ LS + + SCC R+ C+ GS+ R W +L KRG V+ Y
Sbjct: 258 IQSKGRYTANLSPQNLISCCAKNRH----GCNSGSIDRAWWYLRKRGLVSHACYPLFKDQ 313
Query: 181 -GDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLT 239
+GC ++ S G C N + K +C+ P
Sbjct: 314 NATNSGCAMASRS--DGRGKRHATKPCPNN-IEKSNRIYQCSPP---------------- 354
Query: 240 YWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKH--TSNAKLENYL----HSGK 293
Y V +E I KEI+ +GP A +++DF+HYK+G+Y+H ++N + E +L H+ K
Sbjct: 355 YRVSSSETEIMKEIMQNGPVQAIMKVHEDFFHYKTGIYRHVTSTNKESEKFLKLQTHAVK 414
Query: 294 LIGWGTENGT-----PYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAG 340
L GWGT G +W+ N+WG WG+ G +ILRG E E LI A
Sbjct: 415 LTGWGTLRGAQGRKEKFWIAANSWGKSWGENGYFRILRGVNESDIEKLIIAA 466
>gi|397517574|ref|XP_003828984.1| PREDICTED: tubulointerstitial nephritis antigen [Pan paniscus]
Length = 476
Score = 124 bits (312), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 109/352 (30%), Positives = 152/352 (43%), Gaps = 64/352 (18%)
Query: 13 LVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRP---- 68
LVR EL I+Q+N+ WTA N S+ + + D F P
Sbjct: 155 LVRPEL-------IEQVNKGDYGWTA-----QNYSQFW--GMTLEDGFKFRLGTLPPSPM 200
Query: 69 -LPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVP-DTGACAAPHIFAAVGAFSDRRC 126
L + T + +P+ F A +WP H P D CAA F+ +DR
Sbjct: 201 LLSMNEMTASLPATTDLPEFFIASYKWPG---WTHGPLDQKNCAASWAFSTASVAADRIA 257
Query: 127 IKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDY------ 180
I+SKG+ LS + + SCC R+ C+ GS+ R W +L KRG V+ Y
Sbjct: 258 IQSKGRYTANLSPQNLISCCAKNRH----GCNSGSIDRAWWYLRKRGLVSHACYPLFKDH 313
Query: 181 -GDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLT 239
GC ++ S G C N V K +C+ P
Sbjct: 314 NATNNGCAMASRS--DGRGKRHATKPCPN-NVEKSNRIYQCSPP---------------- 354
Query: 240 YWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKH--TSNAKLENY----LHSGK 293
Y V NE I KEI+ +GP A + +DF+HYK+G+Y+H ++N + E Y H+ K
Sbjct: 355 YRVSSNETEIMKEIMQNGPVQAIMQVREDFFHYKTGIYRHVTSTNKESEKYRKLQTHAVK 414
Query: 294 LIGWGTENGT-----PYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAG 340
L GWGT G +W+ N+WG WG+ G +ILRG E E LI A
Sbjct: 415 LTGWGTLRGAQGQKEKFWIAANSWGKSWGENGYFRILRGVNESDIEKLIIAA 466
>gi|281204808|gb|EFA79003.1| hypothetical protein PPL_08471 [Polysphondylium pallidum PN500]
Length = 322
Score = 124 bits (311), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 84/303 (27%), Positives = 132/303 (43%), Gaps = 48/303 (15%)
Query: 46 LSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPD 105
LS L FL+ Y S + + +Y + A +P FDAR QWPNC I V D
Sbjct: 4 LSIYILLAFLLVGTVY---SQQQCLDNVVSYTDQDRANIPASFDARTQWPNC--ISPVRD 58
Query: 106 TGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRT 165
G+C++ + +DR CI S G + LS +Y+ C K C+ + C+ G F
Sbjct: 59 QGSCSSCWAMTSSSILADRLCIASGGAIKKLLSPQYMVDCAKNCKTNSQSDCNSGCKFGF 118
Query: 166 WNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPT- 224
+ + + + + SC K C ++C + +
Sbjct: 119 LDISME------------------------YLSNGISAESCLPYKESDATCPSQCKDGSP 154
Query: 225 ----YGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHT 280
YG G + + +DA + EI+ +GP A F ++ Y+ SG+Y+ T
Sbjct: 155 IQLYYGSGCIS----------IGNLKDA-QLEIMKNGPILAVFQIFTSLYNIGSGLYRGT 203
Query: 281 SNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAG 340
+ H+ ++IGWG ENGTPYWL +N+WG +G G K+ G+ FE + +
Sbjct: 204 GDPAEG---HAARVIGWGEENGTPYWLALNSWGTEFGMDGAFKVPMGENIAGFESQLLSV 260
Query: 341 KPK 343
KP
Sbjct: 261 KPN 263
>gi|348570708|ref|XP_003471139.1| PREDICTED: tubulointerstitial nephritis antigen-like [Cavia
porcellus]
Length = 468
Score = 124 bits (311), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 88/269 (32%), Positives = 120/269 (44%), Gaps = 32/269 (11%)
Query: 83 TVPDRFDAREQWPNCGTIGHVP-DTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEY 141
+P F+A E+WPN + H P D G CA F+ SDR I S G LS +
Sbjct: 203 VLPTAFEASEKWPN---LIHEPLDQGNCAGSWAFSTAAVASDRVSIHSMGHMTPLLSPQN 259
Query: 142 VASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDY--GDRTGCQPSTISPCSHHGS 199
+ SC + + C G + W FL +RG V+ Y R + PC H
Sbjct: 260 LLSCDTL----HQQGCRGGHLDGAWWFLRRRGVVSDHCYPFSGREQAEAGPAPPCMMHSR 315
Query: 200 APTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPT 259
A K + RC N D ++ T Y + +E I KE++ +GP
Sbjct: 316 A--------MGRGKRQATRRCPNSHTDA---NDIYQVTPAYRLGSDEKEIMKELMENGPV 364
Query: 260 TATFALYDDFYHYKSGVYKHT--SNAKLENY----LHSGKLIGWGTEN-----GTPYWLV 308
A +++DF+ YK G+Y HT S A+ E Y HS K+ GWG E YW
Sbjct: 365 QALMEVHEDFFLYKGGIYSHTPLSMARPEQYRRHGTHSVKITGWGEETLPDGRTLKYWTA 424
Query: 309 INTWGPHWGDRGTVKILRGKYECAFEYLI 337
N+WGP WG+RG +ILRG EC E +
Sbjct: 425 ANSWGPSWGERGHFRILRGSNECDIESFV 453
>gi|195488613|ref|XP_002092389.1| GE11695 [Drosophila yakuba]
gi|194178490|gb|EDW92101.1| GE11695 [Drosophila yakuba]
Length = 431
Score = 124 bits (311), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 82/267 (30%), Positives = 120/267 (44%), Gaps = 41/267 (15%)
Query: 84 VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVA 143
+P F+A ++W + I VPD G C A + + SDR I+SKG++ LS + +
Sbjct: 187 LPSSFNALDKWSS--YISEVPDQGWCGASWVLSTTSVASDRFAIQSKGKEAVQLSAQNIL 244
Query: 144 SCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTL 203
SC + + C G + W +LHK+G V Y
Sbjct: 245 SCTR-----RQQGCEGGHLDAAWRYLHKKGVVDESCY----------------------- 276
Query: 204 PSCENQKVPKLKCHTR------CTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHG 257
P + + K++ ++R C P +D T + + E I EI G
Sbjct: 277 PYTQQRDTCKIRHNSRSLRANGCQTPY---NVDRDTFYTVGPAYSLNREADIMAEIFHSG 333
Query: 258 PTTATFALYDDFYHYKSGVYKHTSNAKLE-NYLHSGKLIGWGTE-NGTPYWLVINTWGPH 315
P AT + DF+ Y GVY+ T+ ++ HS KL+GWG E NG YW+ N+WGP
Sbjct: 334 PVQATMRVNRDFFAYAGGVYRQTAANRMAPTGFHSVKLVGWGEEHNGEKYWIAANSWGPW 393
Query: 316 WGDRGTVKILRGKYECAFEYLIAAGKP 342
WG+RG +ILRG EC E + A P
Sbjct: 394 WGERGYFRILRGSNECGIEEYVLASWP 420
>gi|308157829|gb|EFO60849.1| Cathepsin B precursor [Giardia lamblia P15]
Length = 300
Score = 124 bits (311), Expect = 7e-26, Method: Compositional matrix adjust.
Identities = 83/260 (31%), Positives = 124/260 (47%), Gaps = 41/260 (15%)
Query: 84 VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVA 143
VP+ FD RE++P+C I V D G C + F++V F DRRCI ++ S +YV
Sbjct: 75 VPESFDFREEYPHC--IPEVVDQGGCGSCWAFSSVATFGDRRCIAGLDKKPVKYSPQYVV 132
Query: 144 SCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTL 203
SC N +C+ G + W FL K G+ T C P + G+ PT
Sbjct: 133 SCDH-----GNMACNGGWLPNAWKFLTKTGTTT-------DECVPYQSGSTTLRGTCPTK 180
Query: 204 PSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNED--AIKKEILAHGPTTA 261
+ + KV H TT T + D D A+ K + GP
Sbjct: 181 CADGSSKV----------------------HLTTATSYKDYGLDIPAMMKALSTTGPLQV 218
Query: 262 TFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTEN-GTPYWLVINTWGPHWGDRG 320
F +Y DF +Y+SGVY+HT H+ +++G+GT++ G YW++ N+WGP WG+ G
Sbjct: 219 AFLVYSDFMYYESGVYQHTYGYMEGG--HAVEMVGYGTDDDGVDYWIIRNSWGPDWGEDG 276
Query: 321 TVKILRGKYECAFEYLIAAG 340
+++RG +C+ E AG
Sbjct: 277 YFRMIRGINDCSIEEQAYAG 296
>gi|410910940|ref|XP_003968948.1| PREDICTED: tubulointerstitial nephritis antigen-like [Takifugu
rubripes]
Length = 477
Score = 124 bits (310), Expect = 8e-26, Method: Compositional matrix adjust.
Identities = 90/273 (32%), Positives = 125/273 (45%), Gaps = 30/273 (10%)
Query: 77 DPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRP 136
DPE +P F++ E+WP G I D G CAA F+ SDR I+S G
Sbjct: 199 DPERDQ-LPLYFNSAEKWP--GKIHEPLDQGNCAASWAFSTAAVASDRISIQSMGHMTPQ 255
Query: 137 LSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQ-PSTISPCS 195
LS + + SC + C+ G + W FL +RG VT Y R Q P+ + C
Sbjct: 256 LSPQNLISC----DTRNQGGCTGGRIDGAWWFLRRRGVVTEDCYPYRPPQQTPAELGRCM 311
Query: 196 HHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILA 255
+ K + RC N + D +++T Y + NE I KEI
Sbjct: 312 MQSRSVGRG--------KRQATQRCPNTN---NYQNDIYQSTPPYRLSTNEKEIMKEIQD 360
Query: 256 HGPTTATFALYDDFYHYKSGVYKHT--SNAKLENY----LHSGKLIGWGTENGT-----P 304
+GP A +++DF+ YKSG+YKHT S K Y HS K+ GWG E
Sbjct: 361 NGPVQAIMEVHEDFFVYKSGIYKHTDVSFTKPPQYRKHGTHSVKITGWGEERNVDGAKRK 420
Query: 305 YWLVINTWGPHWGDRGTVKILRGKYECAFEYLI 337
YW+ N+WG +WG+ G +I RG+ EC E +
Sbjct: 421 YWIAANSWGKNWGEEGYFRIARGENECEIEAFV 453
>gi|197100841|ref|NP_001126804.1| tubulointerstitial nephritis antigen [Pongo abelii]
gi|55732702|emb|CAH93049.1| hypothetical protein [Pongo abelii]
Length = 476
Score = 124 bits (310), Expect = 8e-26, Method: Compositional matrix adjust.
Identities = 109/352 (30%), Positives = 152/352 (43%), Gaps = 64/352 (18%)
Query: 13 LVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRP---- 68
LVR EL I+Q+N+ WTA N S+ + + D F P
Sbjct: 155 LVRPEL-------IEQVNKGDYGWTA-----QNYSQFW--GMTLEDGFKFHLGTLPPSPM 200
Query: 69 -LPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVP-DTGACAAPHIFAAVGAFSDRRC 126
L + T + +P+ F A +WP H P D CAA F+ +DR
Sbjct: 201 LLSMNEMTASLPATTDLPEFFVASYKWPG---WTHGPLDQKNCAASWAFSTASVAADRIA 257
Query: 127 IKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDY------ 180
I+SKG+ LS + + SCC R+ C+ GS+ R W +L KRG V+ Y
Sbjct: 258 IQSKGRYTANLSPQNLISCCAKNRH----GCNSGSIDRAWWYLRKRGLVSHACYPLSKDQ 313
Query: 181 -GDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLT 239
GC ++ S G C N V K +C+ P
Sbjct: 314 NATNNGCAMASRS--DGRGKRHATKPCPN-NVEKSNRIYQCSPP---------------- 354
Query: 240 YWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKH--TSNAKLENY----LHSGK 293
Y V NE I KEI+ +GP A + +DF+HYK+G+Y+H ++N + E Y H+ K
Sbjct: 355 YRVSSNETEIMKEIMQNGPVQAIMQVREDFFHYKTGIYRHVTSTNKESEKYRKLQTHAVK 414
Query: 294 LIGWGTENGT-----PYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAG 340
L GWGT G +W+ N+WG WG+ G +ILRG E E LI A
Sbjct: 415 LTGWGTLRGAQGQKEKFWVAANSWGKSWGENGYFRILRGVNESDIEKLIIAA 466
>gi|24657813|ref|NP_726176.1| secreted Wg-interacting molecule, isoform A [Drosophila
melanogaster]
gi|24657819|ref|NP_611652.2| secreted Wg-interacting molecule, isoform B [Drosophila
melanogaster]
gi|21064305|gb|AAM29382.1| RE01730p [Drosophila melanogaster]
gi|21626543|gb|AAF46818.2| secreted Wg-interacting molecule, isoform A [Drosophila
melanogaster]
gi|21626544|gb|AAM68213.1| secreted Wg-interacting molecule, isoform B [Drosophila
melanogaster]
gi|220949028|gb|ACL87057.1| CG3074-PA [synthetic construct]
gi|220958134|gb|ACL91610.1| CG3074-PA [synthetic construct]
Length = 431
Score = 124 bits (310), Expect = 9e-26, Method: Compositional matrix adjust.
Identities = 84/267 (31%), Positives = 121/267 (45%), Gaps = 41/267 (15%)
Query: 84 VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVA 143
+P F+A ++W + I VPD G C A + + SDR I+SKG++N LS + +
Sbjct: 187 LPSSFNALDKWSS--YISEVPDQGWCGASWVLSTTSVASDRFAIQSKGKENVQLSAQNIL 244
Query: 144 SCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTL 203
SC + + C G + W +LHK+G V C P T
Sbjct: 245 SCTR-----RQQGCEGGHLDAAWRYLHKKGVVD-------ENCYPYT------------- 279
Query: 204 PSCENQKVPKLKCHTR------CTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHG 257
+++ K++ ++R C P +D T + + E I EI G
Sbjct: 280 ---QHRDTCKIRHNSRSLRANGCQKPV---NVDRDSLYTVGPAYSLNREADIMAEIFHSG 333
Query: 258 PTTATFALYDDFYHYKSGVYKHTS-NAKLENYLHSGKLIGWGTE-NGTPYWLVINTWGPH 315
P AT + DF+ Y GVY+ T+ N K HS KL+GWG E NG YW+ N+WG
Sbjct: 334 PVQATMRVNRDFFAYSGGVYRETAANRKAPTGFHSVKLVGWGEEHNGEKYWIAANSWGSW 393
Query: 316 WGDRGTVKILRGKYECAFEYLIAAGKP 342
WG+ G +ILRG EC E + A P
Sbjct: 394 WGEHGYFRILRGSNECGIEEYVLASWP 420
>gi|56755425|gb|AAW25892.1| unknown [Schistosoma japonicum]
Length = 226
Score = 124 bits (310), Expect = 9e-26, Method: Compositional matrix adjust.
Identities = 72/190 (37%), Positives = 105/190 (55%), Gaps = 10/190 (5%)
Query: 113 HIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKR 172
H +AVGA SDR CI+S G+Q+ LS + SCC+ C C G W++
Sbjct: 41 HAVSAVGAMSDRICIQSGGKQSVELSAIDLISCCENC----GSGCDGGFPGPAWDYWVSH 96
Query: 173 GSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQD 232
G VTGG + TGCQP C HH S PSC ++ +C +C Y + D
Sbjct: 97 GIVTGGSKENHTGCQPYPFPKCEHH-SIGKYPSCGDKIYKTPQCKRKCQK-GYTTPYEHD 154
Query: 233 KHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKL-ENYLHS 291
KH ++ V NE AI+KEI+ +GP A +++DF +YKSG+Y++T+ + + E+Y+
Sbjct: 155 KHYGGISINVIKNESAIQKEIMMYGPVEAYLLIFEDFLNYKSGIYRYTTGSFVGEHYV-- 212
Query: 292 GKLIGWGTEN 301
++IGWG EN
Sbjct: 213 -RIIGWGIEN 221
>gi|224064398|ref|XP_002301456.1| predicted protein [Populus trichocarpa]
gi|222843182|gb|EEE80729.1| predicted protein [Populus trichocarpa]
Length = 325
Score = 123 bits (309), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 94/302 (31%), Positives = 132/302 (43%), Gaps = 31/302 (10%)
Query: 46 LSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEYSATVPDRFD-AREQWPNCGTIGHVP 104
+S+ L ++ D+ ++ P G T +P++S F P G
Sbjct: 30 VSKLKLNSRILQDSIVQKVNENPNAGWEATMNPQFSNYSVGEFKYLLGVKPTPGKELRGV 89
Query: 105 DTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCK-ICRYDDNKSCSHGSVF 163
G C + F AV + SDR CI N LS + +CC +C C G
Sbjct: 90 PLGHCGSCWAFGAVESLSDRFCIHYG--MNLSLSVNDLLACCGWMC----GDGCDGGYPI 143
Query: 164 RTWNFLHKRGSVTG--GDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCT 221
W + + G VT Y D GC SH G P P+ KC +C
Sbjct: 144 DAWRYFVQSGVVTEECDPYFDDIGC--------SHPGCEPGFPT--------PKCERKCA 187
Query: 222 NPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTS 281
+ + + + KH + Y +D + +I E+ +GP F +Y+DF HYKSGVYKH +
Sbjct: 188 DKN--KLWAESKHFSVNAYRIDSDPHSIMAEVSMNGPVEVAFTVYEDFAHYKSGVYKHIT 245
Query: 282 NAKLENYLHSGKLIGWGT-ENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAG 340
+ H+ KLIGWGT ++G YWL+ N W WGD G KI RG EC E + AG
Sbjct: 246 GDVMGG--HAVKLIGWGTSDDGEDYWLLANQWNRGWGDDGYFKIRRGTNECGIEEDVVAG 303
Query: 341 KP 342
P
Sbjct: 304 LP 305
>gi|354472325|ref|XP_003498390.1| PREDICTED: tubulointerstitial nephritis antigen [Cricetulus
griseus]
gi|344245030|gb|EGW01134.1| Tubulointerstitial nephritis antigen-like [Cricetulus griseus]
Length = 465
Score = 123 bits (309), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 87/268 (32%), Positives = 121/268 (45%), Gaps = 30/268 (11%)
Query: 83 TVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYV 142
+P F+A E+WPN I D G CA F+ SDR I S G LS + +
Sbjct: 201 VLPRAFEASEKWPN--LIQEPLDQGNCAGSWAFSTAAVASDRVSIHSMGHMTPILSPQNL 258
Query: 143 ASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYG--DRTGCQPSTISPCSHHGSA 200
SC + C G + W FL +RG V+ Y R + T S C H A
Sbjct: 259 LSC----DTHHQQGCRGGRLDGAWWFLRRRGVVSDNCYPFVGREQNEAGTSSRCMMHSRA 314
Query: 201 PTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTT 260
K + +RC N G+ D ++ T Y + +E I KE++ +GP
Sbjct: 315 --------MGRGKRQATSRCPN---GQVDSNDIYQVTPAYRLGSDEKEIMKELMENGPVQ 363
Query: 261 ATFALYDDFYHYKSGVYKHT--SNAKLENY----LHSGKLIGWGTEN-----GTPYWLVI 309
A +++DF+ Y+SG+Y HT S + E Y HS K+ GWG E YW
Sbjct: 364 ALMEVHEDFFLYQSGIYSHTPISQGRPEQYRRHGTHSVKITGWGEEKLPDGRTIKYWTAA 423
Query: 310 NTWGPHWGDRGTVKILRGKYECAFEYLI 337
N+WGP WG+RG +I+RG EC E +
Sbjct: 424 NSWGPWWGERGHFRIVRGTNECDIESFV 451
>gi|328726600|ref|XP_003248962.1| PREDICTED: cathepsin B-like cysteine proteinase-like [Acyrthosiphon
pisum]
Length = 169
Score = 123 bits (309), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 70/166 (42%), Positives = 93/166 (56%), Gaps = 11/166 (6%)
Query: 180 YGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGF--FQDKHRTT 237
+G GC+P + PC + + SC Q + K + RCT YG + D HR T
Sbjct: 9 FGFAVGCEPYRVPPCPRNEDGTS--SCAGQPIEK---NHRCTRMCYGNQDLDYNDDHRFT 63
Query: 238 LTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNA-KLENYLHSGKLIG 296
Y+ +I+K+++ +GP A+F +YDDFY YKSGVY+ T NA KL H+ KLIG
Sbjct: 64 RDYYYL-TYGSIQKDVMNYGPIEASFDVYDDFYSYKSGVYQRTPNATKLGG--HAVKLIG 120
Query: 297 WGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
WG E G PYWL++N+W WGD G KI RG EC + AG P
Sbjct: 121 WGVEEGIPYWLMVNSWSAQWGDNGLFKIRRGTDECGIDSATTAGVP 166
>gi|156708116|gb|ABU93316.1| cathepsin B7 cysteine protease, partial [Monocercomonoides sp. PA]
Length = 273
Score = 123 bits (309), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 93/326 (28%), Positives = 148/326 (45%), Gaps = 57/326 (17%)
Query: 21 FSDAYIDQINRE-ANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPE 79
+++ +D +N + ++TW A EY R+ L AK + G +
Sbjct: 1 LAESVVDIVNNDPSSTWVA---------TEYPREILTP-AKMRAMISQIGNGFEGEWTFA 50
Query: 80 YSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIK--SKGQQNRPL 137
+ P FD R++WP G V + G+C + AA R I+ SKG +
Sbjct: 51 ENENAPASFDCRQKWP--GKAEPVRNQGSCGSCWAHAASETMGFRMGIRRCSKGV----M 104
Query: 138 STEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHH 197
S + + SC +N C+ G R WN++ K+G T I S
Sbjct: 105 SPQDLVSC-----ESNNMGCNGGYADRVWNWIQKKGITT-----------EQCIPYVSGS 148
Query: 198 GSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHG 257
G PT PS +C N + + R+ ++ W N + E+ +G
Sbjct: 149 GRVPTCPS-------------KCKNGS-------NIVRSFVSSWGSFNSKTVMDEVANNG 188
Query: 258 PTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWG 317
P A F +++DFY+Y+SGVY+H + + + + H L+GWGTENG PYWL+ N+WG WG
Sbjct: 189 PVYACFEVFEDFYNYRSGVYQHKT-GRSQGWHHV-MLMGWGTENGVPYWLLQNSWGSGWG 246
Query: 318 DRGTVKILRGKYECAFEYLIAAGKPK 343
++G +I RG +C + + +G PK
Sbjct: 247 EKGFFRIRRGTNDCHIDEIFYSGLPK 272
>gi|363742306|ref|XP_428202.3| PREDICTED: tubulointerstitial nephritis antigen-like 1 [Gallus
gallus]
Length = 464
Score = 123 bits (309), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 98/329 (29%), Positives = 144/329 (43%), Gaps = 38/329 (11%)
Query: 26 IDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPG----DRKTYDPEYS 81
ID +NR W A AN S+ + L +Y + RP P + + +
Sbjct: 146 IDAVNRGNYGWRA-----ANYSQ-FWGMTLEDGMRYRLGTFRPPPTVMNMNEMHMAMDSN 199
Query: 82 ATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEY 141
+P FDA +WP G I D G CA F+ SDR I S G LS +
Sbjct: 200 EVLPRHFDAATKWP--GMIHEPLDQGNCAGSWAFSTAAVASDRISIHSMGHMTPSLSPQN 257
Query: 142 VASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAP 201
+ SC + + CS G + W +L +RG VT C P T S S + P
Sbjct: 258 LLSC----DTRNQRGCSGGRLDGAWWYLRRRGVVT-------DECYPFT-SQDSQPAAQP 305
Query: 202 TLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTA 261
+ + K + RC NP + D +++T Y + +E I KE++ +GP A
Sbjct: 306 CMMHSRSTGRGKRQATARCPNP---QTHANDIYQSTPAYRLAPSEKEIMKELMENGPVQA 362
Query: 262 TFALYDDFYHYKSGVYKHTSNAK------LENYLHSGKLIGWGTE-----NGTPYWLVIN 310
+++DF+ YKSG+Y+HT+ A+ ++ HS K+ GWG E YW N
Sbjct: 363 ILEVHEDFFLYKSGIYRHTAVAEGKGPKHQQHGTHSVKITGWGEEQLPDGQVQKYWTAAN 422
Query: 311 TWGPHWGDRGTVKILRGKYECAFEYLIAA 339
+WG WG+ G +I RG EC E +
Sbjct: 423 SWGRAWGEDGHFRIARGVNECEVESFVVG 451
>gi|195121981|ref|XP_002005491.1| GI19039 [Drosophila mojavensis]
gi|193910559|gb|EDW09426.1| GI19039 [Drosophila mojavensis]
Length = 432
Score = 123 bits (309), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 86/264 (32%), Positives = 123/264 (46%), Gaps = 29/264 (10%)
Query: 81 SATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTE 140
S+ +P +F+A E+W + I VPD G C + + + SDR I+S+G++ LS +
Sbjct: 184 SSGLPRKFNAVERWSS--YISEVPDQGWCGSSWVLSTTSVASDRFAIQSQGKEVVQLSPQ 241
Query: 141 YVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSA 200
+ SC + + C G + W +LHK+G V D T C P T
Sbjct: 242 NILSCTR-----RQQGCEGGHLDAAWRYLHKKGVV------DET-CYPYT---------- 279
Query: 201 PTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTT 260
SC+ + + C P YG +D T + E I EI GP
Sbjct: 280 QRRDSCKIRHNSRSLKANGC-RPAYG--VNRDSLYTVGPAYSLKGETDIMAEIYHSGPVQ 336
Query: 261 ATFALYDDFYHYKSGVYKHTS-NAKLENYLHSGKLIGWGTE-NGTPYWLVINTWGPHWGD 318
AT +Y DF+ Y GVY+ T+ N HS K++GWG E +G YW+ N+WGP WG+
Sbjct: 337 ATMRVYRDFFSYSGGVYRQTAANRGAPTGFHSVKIVGWGEEHDGVKYWIAANSWGPWWGE 396
Query: 319 RGTVKILRGKYECAFEYLIAAGKP 342
G +ILRG EC E + A P
Sbjct: 397 HGYFRILRGSNECGIEEYVLASWP 420
>gi|126330441|ref|XP_001381244.1| PREDICTED: tubulointerstitial nephritis antigen-like 1 [Monodelphis
domestica]
Length = 466
Score = 123 bits (309), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 86/268 (32%), Positives = 121/268 (45%), Gaps = 26/268 (9%)
Query: 83 TVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYV 142
T+P F+A ++WP G I D G CA F+ SDR I S G LS + +
Sbjct: 200 TLPLAFNASDKWP--GLIHEPLDQGNCAGSWAFSTAAVASDRISIHSMGHMTPALSPQNL 257
Query: 143 ASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPT 202
SC + K C G + W FL +RG V+ Y G + +T +AP
Sbjct: 258 LSC----DTHNQKGCRGGRLDGAWWFLRRRGLVSNHCYPFSAGNRDATAP------AAPC 307
Query: 203 LPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTAT 262
+ + K + C N R ++ T Y + +E I KE++ +GP A
Sbjct: 308 MMHSRSMGRGKRQATAHCPN---SRAHANHIYQATPPYRLSSDEKDIMKELMENGPVQAL 364
Query: 263 FALYDDFYHYKSGVYKHT--SNAKLENY----LHSGKLIGWGTENG-----TPYWLVINT 311
+++DF+ YKSG+YKHT S K Y HS K+ GWG E YW N+
Sbjct: 365 MEVHEDFFLYKSGIYKHTPASLGKPARYRQHGTHSVKITGWGEERQPDGQRLKYWTAANS 424
Query: 312 WGPHWGDRGTVKILRGKYECAFEYLIAA 339
WGP WG++G +ILRG EC E +
Sbjct: 425 WGPTWGEKGHFRILRGANECDIESFVVG 452
>gi|291408920|ref|XP_002720687.1| PREDICTED: tubulointerstitial nephritis antigen-like 1 [Oryctolagus
cuniculus]
Length = 467
Score = 123 bits (308), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 102/335 (30%), Positives = 140/335 (41%), Gaps = 52/335 (15%)
Query: 26 IDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEYSA--- 82
I+ IN+ W AG + + L +Y ++RP P + Y+
Sbjct: 147 INAINQGNYGWQAGNH------SAFWGMTLEEGIRYRLGTNRP-PSSVMNMNEIYTGLGS 199
Query: 83 --TVPDRFDAREQWPNCGTIGHVP-DTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLST 139
+P F+A E+WPN + H P D G CA F+ SDR I S G LS
Sbjct: 200 GEVLPTAFEASEKWPN---LIHEPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSP 256
Query: 140 EYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVT------GGDYGDRTGCQPSTISP 193
+ + SC + C G + W FL +RG V+ G D G P P
Sbjct: 257 QNLLSC----DTHHQQGCRGGRLDGAWWFLRRRGVVSDHCYPFSGHEQDEAGPAP----P 308
Query: 194 CSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEI 253
C H A K + RC N D ++ T Y + NE I KE+
Sbjct: 309 CMMHSRA--------MGRGKRQATARCPNSHV---HANDIYQVTPAYRLGSNEKEIMKEL 357
Query: 254 LAHGPTTATFALYDDFYHYKSGVYKHT--SNAKLENY----LHSGKLIGWGTEN-----G 302
L +GP A +++DF+ Y+ G+Y HT S + E Y HS K+ GWG E
Sbjct: 358 LENGPVQALMEVHEDFFLYQGGIYSHTPVSLERPERYRRHGTHSVKITGWGEETLPDGRT 417
Query: 303 TPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLI 337
YW N+WGP WG+RG +ILRG EC E +
Sbjct: 418 LKYWTAANSWGPAWGERGHFRILRGTNECDIESFV 452
>gi|348513320|ref|XP_003444190.1| PREDICTED: tubulointerstitial nephritis antigen-like [Oreochromis
niloticus]
Length = 499
Score = 123 bits (308), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 100/330 (30%), Positives = 152/330 (46%), Gaps = 42/330 (12%)
Query: 26 IDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRP------LPGDRKTYDPE 79
I +NR W A AN SE Y L +Y + RP + + DP+
Sbjct: 170 IQAVNRGNYGWKA-----ANYSELY-GMTLNEGIRYRLGTQRPSRTVMNMNEIQMNMDPQ 223
Query: 80 YSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLST 139
+ +P F++ E+WP G I D G CAA F+ SDR I+S G LS
Sbjct: 224 -TDNLPPYFNSAEKWP--GKIHEPLDQGNCAASWAFSTAAVASDRISIQSMGHMTPRLSP 280
Query: 140 EYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQ-PSTISPCSHHG 198
+ + SC + C+ G + W +L +RG VT Y + Q P+ + C
Sbjct: 281 QNLISC----DTRNQGGCAGGRIDGAWWYLRRRGVVTEDCYPYQPPHQTPAEVGRC---- 332
Query: 199 SAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGP 258
+ + K + RC N + + D +++T Y + NE I KEI+ +GP
Sbjct: 333 ----MMQSRSVGRGKRQATQRCPNT---QNYHNDIYQSTPPYRLSSNEKEIMKEIMDNGP 385
Query: 259 TTATFALYDDFYHYKSGVYKHT--SNAKLENY----LHSGKLIGWGTE---NGTP--YWL 307
A +++DF+ YK+G+YKHT S K Y HS ++ GWG + +GT YW+
Sbjct: 386 VQAIMEVHEDFFVYKTGIYKHTDVSFTKPPQYRKHGTHSVRITGWGEDRNVDGTSRKYWI 445
Query: 308 VINTWGPHWGDRGTVKILRGKYECAFEYLI 337
N+WG +WG+ G +I+RG+ EC E +
Sbjct: 446 AANSWGKNWGENGYFRIVRGENECEIETFV 475
>gi|426353589|ref|XP_004044272.1| PREDICTED: tubulointerstitial nephritis antigen [Gorilla gorilla
gorilla]
Length = 476
Score = 123 bits (308), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 108/352 (30%), Positives = 152/352 (43%), Gaps = 64/352 (18%)
Query: 13 LVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRP---- 68
LVR +L I+Q+N+ WTA N S+ + + D F P
Sbjct: 155 LVRPQL-------IEQVNKGDYGWTA-----QNYSQFW--GMTLEDGFKFRLGTLPPSPM 200
Query: 69 -LPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVP-DTGACAAPHIFAAVGAFSDRRC 126
L + T + +P+ F A +WP H P D CAA F+ +DR
Sbjct: 201 LLSMNEMTASLPATTDLPEFFVASYKWPG---WTHGPLDQKNCAASWAFSTASVAADRIA 257
Query: 127 IKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDY------ 180
I+SKG+ LS + + SCC R+ C+ GS+ R W +L KRG V+ Y
Sbjct: 258 IQSKGRYTANLSPQNLISCCAKNRH----GCNSGSIDRAWWYLRKRGLVSHACYPLFKDQ 313
Query: 181 -GDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLT 239
GC ++ S G C N V K +C+ P
Sbjct: 314 NATNNGCAMASRS--DGRGKRHATKPCPN-NVEKSNRIYQCSPP---------------- 354
Query: 240 YWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKH--TSNAKLENY----LHSGK 293
Y V NE I KEI+ +GP A + +DF+HYK+G+Y+H ++N + E Y H+ K
Sbjct: 355 YRVSSNETEIMKEIMQNGPVQAIMQVREDFFHYKTGIYRHVTSTNKESEKYRKLQTHAVK 414
Query: 294 LIGWGTENGT-----PYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAG 340
L GWGT G +W+ N+WG WG+ G +ILRG E E LI A
Sbjct: 415 LTGWGTLRGAQGQKEKFWIAANSWGKSWGENGYFRILRGVNESDIEKLIIAA 466
>gi|358421824|ref|XP_003585145.1| PREDICTED: tubulointerstitial nephritis antigen-like [Bos taurus]
Length = 428
Score = 123 bits (308), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 90/280 (32%), Positives = 125/280 (44%), Gaps = 54/280 (19%)
Query: 83 TVPDRFDAREQWPNCGTIGHVP-DTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEY 141
+P F+A E+WPN + H P D G CA F+ SDR I S G + LS +
Sbjct: 163 VLPRTFEASEKWPN---LIHDPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMSPVLSPQN 219
Query: 142 VASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAP 201
+ SC + + C G + W FL +RG V+ Y P S HG
Sbjct: 220 LLSC----DTHNQQGCRGGRLDGAWWFLRRRGVVSDHCY------------PFSGHGRDE 263
Query: 202 TLPSCENQKVPKLKCHTRCTNPTYGRGFFQ-------------DKHRTTLTYWVDDNEDA 248
+P+ P H+R GRG Q D ++ T Y + NE
Sbjct: 264 AVPA------PPCMMHSR----AMGRGKRQATARCPNSYVHANDIYQVTPAYRLGSNEKE 313
Query: 249 IKKEILAHGPTTATFALYDDFYHYKSGVYKHT--SNAKLENY----LHSGKLIGWGTEN- 301
I KE++ +GP A +++DF+ Y+SG+Y HT S + E Y HS K+ GWG E
Sbjct: 314 IMKELMENGPVQALMEVHEDFFLYQSGIYSHTPVSLGRPERYRRHGTHSVKITGWGEETL 373
Query: 302 ----GTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLI 337
YW N+WGP WG+RG +I+RG EC E +
Sbjct: 374 PDGRTIKYWTAANSWGPAWGERGHFRIVRGANECDIESFV 413
>gi|6449322|gb|AAF08931.1| tubulointerstitial nephritis antigen isoform TIN-ag [Homo sapiens]
Length = 476
Score = 123 bits (308), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 108/352 (30%), Positives = 151/352 (42%), Gaps = 64/352 (18%)
Query: 13 LVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRP---- 68
LVR EL I+Q+N+ WTA N S+ + + D F P
Sbjct: 155 LVRPEL-------IEQVNKGDYGWTA-----QNYSQFW--GMTLEDGFKFRLGTLPPSPM 200
Query: 69 -LPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVP-DTGACAAPHIFAAVGAFSDRRC 126
L + T + +P+ F A +WP H P D CAA F+ +DR
Sbjct: 201 LLSMNEMTASLPATTDLPEFFVASYKWPG---WTHGPLDQKNCAASWAFSTASVAADRIA 257
Query: 127 IKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDY------ 180
I+SKG+ LS + + SCC R+ C+ GS+ R W +L KRG V+ Y
Sbjct: 258 IQSKGRYTANLSPQNLISCCAKNRH----GCNSGSIDRAWWYLRKRGLVSHACYPLFKDQ 313
Query: 181 -GDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLT 239
GC ++ S G C N V K +C+ P
Sbjct: 314 NATNNGCAMASRS--DGRGKRDATKPCPN-NVEKSNRIYQCSPP---------------- 354
Query: 240 YWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKH--TSNAKLENY----LHSGK 293
Y V NE I KEI+ +GP A + +DF+HYK+G+Y+H ++N + E Y H+ K
Sbjct: 355 YRVSSNETEIMKEIMQNGPVQAIMQVREDFFHYKTGIYRHVTSTNKESEKYRKLQTHAVK 414
Query: 294 LIGWGTENGT-----PYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAG 340
L GWGT G +W+ N WG WG+ G +ILRG E E L+ A
Sbjct: 415 LTGWGTLRGAQGQKEKFWIAANFWGKSWGENGYFRILRGVNESDIEKLVIAA 466
>gi|355724272|gb|AES08175.1| tubulointerstitial nephritis antigen [Mustela putorius furo]
Length = 476
Score = 123 bits (308), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 94/295 (31%), Positives = 131/295 (44%), Gaps = 45/295 (15%)
Query: 65 SDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVP-DTGACAAPHIFAAVGAFSD 123
S R L + T + +P+ F A +WP H P D CAA F+ +D
Sbjct: 198 SPRLLSMNEMTASLPATTDLPEFFIASYKWPG---WTHGPLDQKNCAASWAFSTASVAAD 254
Query: 124 RRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDY--- 180
R I+SKG+ LS + + SCC R+ C+ GS+ R W FL KRG V+ Y
Sbjct: 255 RIAIQSKGRYTANLSPQNLISCCAKNRH----GCNSGSIDRAWWFLRKRGLVSHACYPLF 310
Query: 181 ----GDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRT 236
GC ++ S G C N + K +C+ P
Sbjct: 311 KDQNATNDGCAMASRS--DGRGKRHATKPCPNN-IEKSNRIYQCSPP------------- 354
Query: 237 TLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHT--SNAKLENY----LH 290
Y V NE I KEI+ +GP A +++DF+HYK+G+Y+H +N + Y H
Sbjct: 355 ---YRVSSNETEIMKEIMQNGPVQAIMQVHEDFFHYKTGIYRHVTRTNEEASKYRKFQTH 411
Query: 291 SGKLIGWGTENGT-----PYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAG 340
+ KL GWGT G +W+ N+WG WG+ G +ILRG E E LI A
Sbjct: 412 AVKLTGWGTLKGAQGQKEKFWIAANSWGKSWGENGYFRILRGVNESDIEKLIIAA 466
>gi|281200411|gb|EFA74631.1| hypothetical protein PPL_11599 [Polysphondylium pallidum PN500]
Length = 311
Score = 122 bits (307), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 103/351 (29%), Positives = 148/351 (42%), Gaps = 63/351 (17%)
Query: 1 MIHILVFLLGCTLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAK 60
+I + VF + C + D +I N +W AGRN +
Sbjct: 8 LIALTVFAV-CNALDLNKPVLDDKFIHNHNANGASWVAGRN-----------------PR 49
Query: 61 YFDQSDRPLPGDRKTYDP-----EYSAT---VPDRFDAREQWPNCGTIGHVPDTGACAAP 112
+ QS + G T P E S + VP+ FD+R WP C + V + G C +
Sbjct: 50 FEGQSIGDILGLLGTKKPRNTPEEVSVSKVAVPNSFDSRTNWPGC--VHAVLNQGQCGSC 107
Query: 113 HIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKR 172
FAA + SDR CI S+G N LS + + SC + N+ C+ G W +L
Sbjct: 108 WAFAASESLSDRLCIASQGAINVTLSPQALVSC----DIEFNQGCNGGIPQMAWEYLELH 163
Query: 173 GSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQD 232
G T C P T S +G+AP C C++ + +Q
Sbjct: 164 GIPT-------DSCFPYT----SGNGTAP-------------DCQKECSDGSK----YQL 195
Query: 233 KHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSG 292
T T + AI+ + A+GP T +Y DF Y SGVY T +KL H+
Sbjct: 196 YKGKTFTLKTCSSVAAIQANVFAYGPIEGTMDVYQDFMSYTSGVYVMTPGSKLLGG-HAI 254
Query: 293 KLIGWGTEN--GTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGK 341
K++GWGT++ G YW+V N+WG WG G I RG C + +AG+
Sbjct: 255 KIVGWGTDSTSGLDYWIVQNSWGSDWGMNGFFWIQRGTNMCGIDRDASAGQ 305
>gi|253744515|gb|EET00718.1| Cathepsin B precursor [Giardia intestinalis ATCC 50581]
Length = 306
Score = 122 bits (307), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 81/265 (30%), Positives = 124/265 (46%), Gaps = 40/265 (15%)
Query: 79 EYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLS 138
E S ++P FD RE++P C I V D G C + F+A AF DRRC++ P S
Sbjct: 73 EPSGSIPASFDFREEYPQC--ITPVYDQGHCGSCWAFSATSAFGDRRCMQGLDSAGVPYS 130
Query: 139 TEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHG 198
+Y SC + + C+ G F W FL + G+ T C P T + +
Sbjct: 131 QQYTISCDYL-----DLGCAGGLSFSVWTFLTEHGTTT-------LECVPYTDA--NKDI 176
Query: 199 SAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGP 258
S+P +C + +L C + N AI + + GP
Sbjct: 177 SSPCPDACADGSEIRLVKADGCLD-------------------YSGNVTAIMQALANDGP 217
Query: 259 TTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGT---ENGTPYWLVINTWGPH 315
A+ A+Y DF +Y+SGVY+H +++ + H+ ++IG+G E+ TPYW+V N+ G
Sbjct: 218 VQASMAVYRDFLYYRSGVYRHVYGSQISS--HAVEIIGYGAADDEDSTPYWIVKNSLGSG 275
Query: 316 WGDRGTVKILRGKYECAFEYLIAAG 340
WG+ G I+RG EC E + +G
Sbjct: 276 WGEEGYFNIVRGSNECDIESAVYSG 300
>gi|156708110|gb|ABU93313.1| cathepsin B4 cysteine protease [Monocercomonoides sp. PA]
Length = 281
Score = 122 bits (307), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 98/326 (30%), Positives = 138/326 (42%), Gaps = 60/326 (18%)
Query: 22 SDAYIDQINRE-ANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEY 80
+++ ++ IN + +TW A EY R +I AK+ L G Y
Sbjct: 11 AESIVETINNDPTSTWVAA---------EYPRS-VINVAKFRAMLGAEL-GPHMPYVQPL 59
Query: 81 SATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTE 140
S + P FDAREQWP G I V D +C + + A D + I G +S +
Sbjct: 60 SLSEPTEFDAREQWP--GKILPVRDQASCGSCWAHSVAEAMGDAQNIA--GCPRGAMSVQ 115
Query: 141 YVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSA 200
+ SC K + +C+ G + + +L K G T + + S G
Sbjct: 116 DLVSCDK-----TDSACNGGDMKKAQEYLVKTGITT-----------EACVKYVSGSGRV 159
Query: 201 PTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTT 260
P PS +C N + R L W I + ++ +GP +
Sbjct: 160 PACPS-------------KCDNGS-------QIIRYKLQSWKSVEPSEIMQALMEYGPLS 199
Query: 261 ATFALYDDFYHYKSGVYKHTSNAKLENYLHSGK---LIGWGTENGTPYWLVINTWGPHWG 317
F +Y DF +Y+SGVY+H S Y G L GWG ENG PYWLV N+WGP WG
Sbjct: 200 CGFMVYSDFMNYRSGVYQHKSG-----YFEGGHAVLLCGWGVENGLPYWLVQNSWGPAWG 254
Query: 318 DRGTVKILRGKYECAFEYLIAAGKPK 343
++G KILRG C E + G PK
Sbjct: 255 EKGFFKILRGSNHCEIESYVTLGVPK 280
>gi|296198446|ref|XP_002746707.1| PREDICTED: tubulointerstitial nephritis antigen [Callithrix
jacchus]
Length = 476
Score = 122 bits (307), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 105/352 (29%), Positives = 152/352 (43%), Gaps = 64/352 (18%)
Query: 13 LVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRP---- 68
LVR EL I+Q+N+ WTA N S+ + + D F P
Sbjct: 155 LVRPEL-------IEQVNKGDYGWTA-----QNYSQFW--GMTLEDGFKFRLGTLPPSPM 200
Query: 69 -LPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVP-DTGACAAPHIFAAVGAFSDRRC 126
L + T + +P+ F A +WP H P D CAA F+ +DR
Sbjct: 201 LLSMNEMTASLPATTDLPEFFVASYKWPG---WTHGPLDQKNCAASWAFSTASVAADRIA 257
Query: 127 IKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDY------ 180
I+SKG+ LS + + SCC R+ C+ GS+ R W +L KRG V+ Y
Sbjct: 258 IQSKGRYTANLSPQNLISCCAKNRH----GCNSGSIDRAWWYLRKRGLVSHACYPLFKDQ 313
Query: 181 -GDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLT 239
+GC ++ S G C N + K +C+ P
Sbjct: 314 NATNSGCAMASRS--DGRGKRHATKPCPNN-IEKSNRIYQCSPP---------------- 354
Query: 240 YWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLEN------YLHSGK 293
Y V +E I KEI+ +GP A +++DF+HYK+G+Y+H ++ E+ H+ K
Sbjct: 355 YRVSSSETEIMKEIMQNGPVQAIMKVHEDFFHYKTGIYRHVTSTNKESEKFQKLQTHAVK 414
Query: 294 LIGWGTENGT-----PYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAG 340
L GWGT G +W+ N+WG WG+ G +ILRG E E LI A
Sbjct: 415 LTGWGTLRGAQGRKEKFWIAANSWGKSWGENGYFRILRGVNESDIEKLIIAA 466
>gi|161343845|tpg|DAA06103.1| TPA_inf: cathepsin B [Acyrthosiphon pisum]
Length = 261
Score = 122 bits (307), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 83/269 (30%), Positives = 125/269 (46%), Gaps = 19/269 (7%)
Query: 1 MIHILVFLLGCTLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAK 60
++ +L + + + Y +ID IN A TW AG NF + +E+ + L +K
Sbjct: 4 VLMLLSVIFVSFYLTEQAYFLQKDFIDNINERATTWKAGVNFDPDTPKEHFLKML--GSK 61
Query: 61 YFDQSDRPLPGDRKTYDPEYS---ATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAA 117
++ KT+D Y +P FDAR +W C TIG V D G C + A
Sbjct: 62 GVQIPNKHNIHMYKTHDAAYDNLFGRIPRHFDARRKWRRCHTIGAVRDQGNCGSCWAMAT 121
Query: 118 VGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTG 177
AF+DR C+ + N LS E + CC C + C+ G + W KRG VTG
Sbjct: 122 SSAFADRLCVATNADFNELLSAEEITFCCHSCGF----GCNGGYPIKAWERFKKRGLVTG 177
Query: 178 GDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGR---GFFQDKH 234
GDY GC+P + PC + A +C + P+ H RCT YG F +D
Sbjct: 178 GDYQSGEGCEPYRVPPCPY--DAEGHNTCAGK--PRESNH-RCTRMCYGNQDLDFDEDHR 232
Query: 235 RTTLTYWVDDNEDAIKKEILAHGPTTATF 263
T +Y++ +I+K+++ +GP A+F
Sbjct: 233 YTRDSYYL--TYGSIQKDVMTYGPIEASF 259
>gi|332210168|ref|XP_003254178.1| PREDICTED: tubulointerstitial nephritis antigen [Nomascus
leucogenys]
Length = 476
Score = 122 bits (306), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 106/352 (30%), Positives = 150/352 (42%), Gaps = 64/352 (18%)
Query: 13 LVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRP---- 68
LVR EL I+Q+N+ WTA N S+ + + D F P
Sbjct: 155 LVRPEL-------IEQVNKGDYGWTA-----QNYSQFW--GMTLEDGFKFRLGTLPPSPM 200
Query: 69 -LPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVP-DTGACAAPHIFAAVGAFSDRRC 126
L + T + +P+ F A +WP H P D CAA F+ +DR
Sbjct: 201 LLSMNEMTASLPATTDLPEFFVASYKWPG---WTHGPLDQKNCAASWAFSTASVAADRIA 257
Query: 127 IKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDY------ 180
I+SKG+ LS + + SCC + C+ GS+ R W +L KRG V+ Y
Sbjct: 258 IQSKGRYTANLSPQNLISCCS----KNRPGCNSGSIDRAWWYLRKRGLVSHACYPLFKDQ 313
Query: 181 -GDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLT 239
GC ++ S G C N V K +C+ P
Sbjct: 314 NATSNGCAMASRS--DGRGKRHATKPCPN-NVEKSNRIYQCSPP---------------- 354
Query: 240 YWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLEN------YLHSGK 293
Y V +E I KEI+ +GP A + +DF+HYK+G+Y+H ++A E+ H+ K
Sbjct: 355 YRVSSSETEIMKEIMQNGPVQAIMQVREDFFHYKTGIYRHVTSANKESEKYRKLQTHAVK 414
Query: 294 LIGWGTENGT-----PYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAG 340
L GWGT G +W+ N+WG WG+ G +ILRG E E LI A
Sbjct: 415 LTGWGTLRGAQGQKEKFWIAANSWGKSWGENGYFRILRGVNESDIEKLIIAA 466
>gi|307175943|gb|EFN65753.1| Uncharacterized peptidase C1-like protein F26E4.3 [Camponotus
floridanus]
Length = 443
Score = 122 bits (306), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 90/273 (32%), Positives = 128/273 (46%), Gaps = 36/273 (13%)
Query: 73 RKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQ 132
R+ YDP+ +P F++R +WP I + D G C A + SDR I SKG
Sbjct: 195 RRIYDPD---ALPREFNSRTRWPR--DISDIHDQGWCGASWAVSTADVASDRFAIMSKGA 249
Query: 133 QNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTIS 192
+ LS +++ SC + C G + R W F+ K G V C P T
Sbjct: 250 ETVELSAQHLLSC----NNRGQQGCKGGYLDRAWLFMRKFGLVD-------EECYPWTGR 298
Query: 193 PCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKE 252
C +K LK C NP + ++ Y + NE I +E
Sbjct: 299 N----------DQCRLRKRSNLK-TAGCQNPP--NSLRTELYKVGPAYRLG-NETDIMQE 344
Query: 253 ILAHGPTTATFALYDDFYHYKSGVYKHTSNAKL-ENYLHSGKLIGWGTE---NGTP--YW 306
IL GP AT +Y DF+ Y+SGVY+H+ +A+L ++ HS ++IGWG E G P YW
Sbjct: 345 ILTSGPVQATMRVYQDFFVYQSGVYRHSRSAELHDSGYHSVRIIGWGEEPSYRGPPLKYW 404
Query: 307 LVINTWGPHWGDRGTVKILRGKYECAFEYLIAA 339
LV N+WG +WG+ G +I +G EC E + A
Sbjct: 405 LVANSWGHNWGENGLFRIQKGTNECEIESYVLA 437
>gi|297465285|ref|XP_887401.3| PREDICTED: tubulointerstitial nephritis antigen-like 1 isoform 2
[Bos taurus]
gi|297472148|ref|XP_002685665.1| PREDICTED: tubulointerstitial nephritis antigen-like 1 [Bos taurus]
gi|296490232|tpg|DAA32345.1| TPA: tubulointerstitial nephritis antigen-like 1-like [Bos taurus]
Length = 534
Score = 122 bits (306), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 90/280 (32%), Positives = 125/280 (44%), Gaps = 54/280 (19%)
Query: 83 TVPDRFDAREQWPNCGTIGHVP-DTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEY 141
+P F+A E+WPN + H P D G CA F+ SDR I S G + LS +
Sbjct: 269 VLPRTFEASEKWPN---LIHDPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMSPVLSPQN 325
Query: 142 VASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAP 201
+ SC + + C G + W FL +RG V+ Y P S HG
Sbjct: 326 LLSC----DTHNQQGCRGGRLDGAWWFLRRRGVVSDHCY------------PFSGHGRDE 369
Query: 202 TLPSCENQKVPKLKCHTRCTNPTYGRGFFQ-------------DKHRTTLTYWVDDNEDA 248
+P+ P H+R GRG Q D ++ T Y + NE
Sbjct: 370 AVPA------PPCMMHSR----AMGRGKRQATARCPNSYVHANDIYQVTPAYRLGSNEKE 419
Query: 249 IKKEILAHGPTTATFALYDDFYHYKSGVYKHT--SNAKLENY----LHSGKLIGWGTEN- 301
I KE++ +GP A +++DF+ Y+SG+Y HT S + E Y HS K+ GWG E
Sbjct: 420 IMKELMENGPVQALMEVHEDFFLYQSGIYSHTPVSLGRPERYRRHGTHSVKITGWGEETL 479
Query: 302 ----GTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLI 337
YW N+WGP WG+RG +I+RG EC E +
Sbjct: 480 PDGRTIKYWTAANSWGPAWGERGHFRIVRGANECDIESFV 519
>gi|332824268|ref|XP_518550.3| PREDICTED: tubulointerstitial nephritis antigen [Pan troglodytes]
Length = 476
Score = 122 bits (306), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 104/339 (30%), Positives = 147/339 (43%), Gaps = 57/339 (16%)
Query: 26 IDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRP-----LPGDRKTYDPEY 80
I+Q+N+ WTA N S+ + + D F P L + T
Sbjct: 161 IEQVNKGDYGWTA-----QNYSQFW--GMTLEDGFKFRLGTLPPSPMLLSMNEMTASLPA 213
Query: 81 SATVPDRFDAREQWPNCGTIGHVP-DTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLST 139
+ +P+ F A +WP H P D CAA F+ +DR I+SKG+ LS
Sbjct: 214 TTDLPEFFVASYKWPG---WTHGPLDQKNCAASWAFSTASVAADRIAIQSKGRYTANLSP 270
Query: 140 EYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDY-------GDRTGCQPSTIS 192
+ + SCC R+ C+ GS+ R W +L KRG V+ Y GC ++ S
Sbjct: 271 QNLISCCAKNRH----GCNSGSIDRAWWYLRKRGLVSHACYPLFKDHNATNNGCAMASRS 326
Query: 193 PCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKE 252
G C N V K +C+ P Y V NE I KE
Sbjct: 327 --DGRGKRHATKPCPNN-VEKSNRIYQCSPP----------------YRVSSNETEIMKE 367
Query: 253 ILAHGPTTATFALYDDFYHYKSGVYKH--TSNAKLENY----LHSGKLIGWGTENGT--- 303
I+ +GP A + +DF+HYK+G+Y+H ++N + E Y H+ KL GWGT G
Sbjct: 368 IMQNGPVQAIMQVREDFFHYKTGIYRHVTSTNKESEKYRKLQTHAVKLTGWGTLRGAQGQ 427
Query: 304 --PYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAG 340
+W+ N+WG WG+ G +ILRG E E LI A
Sbjct: 428 KEKFWIAANSWGKSWGENGYFRILRGVNESDIEKLIIAA 466
>gi|403365170|gb|EJY82363.1| Cathepsin B [Oxytricha trifallax]
Length = 309
Score = 122 bits (306), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 81/252 (32%), Positives = 118/252 (46%), Gaps = 43/252 (17%)
Query: 79 EYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLS 138
+ +A +PD FD+R QW +C + + D C + FAAV + SDR CI S+G+ N LS
Sbjct: 73 QINAALPDSFDSRTQWKDC--VHPIRDQAKCGSCWAFAAVESLSDRFCIASQGKVNLVLS 130
Query: 139 TEYVASCCKICRYDDNKSCSHGSVFRT-WNFLHKRGSVTGGDYGDRTGCQPSTISPCSHH 197
+ + SC D + C G T W +L ++G G D C+P S +
Sbjct: 131 PQDMLSC------DASNFCCFGGYLDTAWQYLEQQG--VGSD-----SCEPYK----SGN 173
Query: 198 GSAPTLPS-CEN-QKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILA 255
G P+ PS C N Q + K KC T G +A K I
Sbjct: 174 GDQPSCPSKCSNGQAIKKYKCKAGSTKQAKGA-------------------EATKSLIQQ 214
Query: 256 HGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPH 315
GP F +Y+DF +Y SG+Y H + + H+ K++GWG + YW+V N+WG
Sbjct: 215 SGPVETGFTIYEDFLNYNSGIYHHVTGGNMGG--HAVKILGWGKQGLENYWIVANSWGED 272
Query: 316 WGDRGTVKILRG 327
WG++G I +G
Sbjct: 273 WGEKGYFNIRQG 284
>gi|426221788|ref|XP_004005089.1| PREDICTED: tubulointerstitial nephritis antigen-like [Ovis aries]
Length = 362
Score = 122 bits (306), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 90/280 (32%), Positives = 125/280 (44%), Gaps = 54/280 (19%)
Query: 83 TVPDRFDAREQWPNCGTIGHVP-DTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEY 141
+P F+A E+WPN + H P D G CA F+ SDR I S G + LS +
Sbjct: 97 VLPRTFEASEKWPN---LIHDPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMSPVLSPQN 153
Query: 142 VASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAP 201
+ SC + + C G + W FL +RG V+ Y P S HG
Sbjct: 154 LLSC----DTHNQQGCHGGRLDGAWWFLRRRGVVSDHCY------------PFSGHGRDE 197
Query: 202 TLPSCENQKVPKLKCHTRCTNPTYGRGFFQ-------------DKHRTTLTYWVDDNEDA 248
+P+ P H+R GRG Q D ++ T Y + NE
Sbjct: 198 AVPA------PPCMMHSR----AMGRGKRQATARCPNSYVHANDIYQVTPAYRLGSNEKE 247
Query: 249 IKKEILAHGPTTATFALYDDFYHYKSGVYKHT--SNAKLENY----LHSGKLIGWGTEN- 301
I KE++ +GP A +++DF+ Y+SG+Y HT S + E Y HS K+ GWG E
Sbjct: 248 IMKELMENGPVQALMEVHEDFFLYQSGIYSHTPVSLGRPERYRRHGTHSVKITGWGEETL 307
Query: 302 ----GTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLI 337
YW N+WGP WG+RG +I+RG EC E +
Sbjct: 308 PDGRTVKYWTAANSWGPAWGERGHFRIVRGANECDIESFV 347
>gi|417409900|gb|JAA51439.1| Putative cysteine proteinase tin-ag, partial [Desmodus rotundus]
Length = 346
Score = 122 bits (305), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 89/280 (31%), Positives = 126/280 (45%), Gaps = 54/280 (19%)
Query: 83 TVPDRFDAREQWPNCGTIGHVP-DTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEY 141
+P F+A E+WPN + H P D G CA F+ SDR I S G LS +
Sbjct: 81 VLPTAFEASEKWPN---LIHEPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQN 137
Query: 142 VASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAP 201
+ SC K + + C G + W FL +RG V+ Y P S G
Sbjct: 138 LLSCDK----RNQQGCQGGHLDSAWWFLRRRGVVSDHCY------------PFSGQGRTE 181
Query: 202 TLPSCENQKVPKLKCHTRCTNPTYGRGFFQ-------------DKHRTTLTYWVDDNEDA 248
T P+ P+ H+R GRG Q D ++ T Y + +E
Sbjct: 182 TGPA------PRCMMHSR----AMGRGKRQATARCPNHQVHANDIYQVTPAYRLGSSEKE 231
Query: 249 IKKEILAHGPTTATFALYDDFYHYKSGVYKHT--SNAKLENY----LHSGKLIGWGTEN- 301
I KE++ +GP A +++DF+ Y++G+Y HT S + E Y HS K+ GWG E+
Sbjct: 232 IMKELMENGPVQALMEVHEDFFLYQNGIYSHTPVSLGRPERYRRHGTHSVKITGWGEESL 291
Query: 302 ----GTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLI 337
YW N+WGP WG+RG +I+RG EC E +
Sbjct: 292 PDGRTLKYWTAANSWGPAWGERGHFRIVRGANECDIESFV 331
>gi|335290878|ref|XP_003127800.2| PREDICTED: tubulointerstitial nephritis antigen-like 1 [Sus scrofa]
Length = 362
Score = 122 bits (305), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 87/268 (32%), Positives = 124/268 (46%), Gaps = 30/268 (11%)
Query: 83 TVPDRFDAREQWPNCGTIGHVP-DTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEY 141
+P F+A E+WPN + H P D G CA F+ SDR I S G LS +
Sbjct: 97 VLPRAFEASEKWPN---LIHDPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQN 153
Query: 142 VASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAP 201
+ SC + + C G + W FL +RG V+ D+ C P + + G AP
Sbjct: 154 LLSC----DTHNQQGCQGGRLDGAWWFLRRRGVVS--DH-----CYPFSGHERNEAGPAP 202
Query: 202 T-LPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTT 260
+ K + RC N D ++ T Y + NE I KE++ +GP
Sbjct: 203 RCMMHSRAMGRGKRQATARCPNSYV---HANDIYQVTPAYRLGSNEKDIMKELMENGPVQ 259
Query: 261 ATFALYDDFYHYKSGVYKHT--SNAKLENY----LHSGKLIGWGTEN-----GTPYWLVI 309
A +++DF+ Y+SG+Y HT S+ + E Y HS K+ GWG E YW
Sbjct: 260 ALMEVHEDFFLYQSGIYSHTPVSHGRPERYRRHGTHSVKITGWGEETLPDGRMLKYWTAA 319
Query: 310 NTWGPHWGDRGTVKILRGKYECAFEYLI 337
N+WGP WG+RG +I+RG EC E +
Sbjct: 320 NSWGPGWGERGHFRIVRGANECDIESFV 347
>gi|294885809|ref|XP_002771442.1| cathepsin L precursor, putative [Perkinsus marinus ATCC 50983]
gi|239875086|gb|EER03258.1| cathepsin L precursor, putative [Perkinsus marinus ATCC 50983]
Length = 527
Score = 122 bits (305), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 68/176 (38%), Positives = 94/176 (53%), Gaps = 17/176 (9%)
Query: 172 RGSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQ 231
RG++T GD GC P PC+HH + P C C +C NP Y
Sbjct: 364 RGNLTKGD-----GCWPYDFPPCAHHINDTKYPKCPKGSYETPNCVEQCHNPKYTTSLKN 418
Query: 232 DKH----RTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLEN 287
D+H + Y V++ ++AI+ + GP +A++ +Y+DF YKSGVYKHTS + L
Sbjct: 419 DRHYMLESSPYQYSVNNAKNAIRTD----GPISASYLVYEDFLAYKSGVYKHTSGSYLGG 474
Query: 288 YLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
H+ K+IGWG ENG YWLV+N+W WGD+G KI G C + + G PK
Sbjct: 475 --HAVKIIGWGEENGEAYWLVVNSWNEDWGDQGLFKIALGN--CEIDDDLLGGTPK 526
>gi|148704124|gb|EDL36071.1| cathepsin B, isoform CRA_b [Mus musculus]
Length = 237
Score = 122 bits (305), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 80/210 (38%), Positives = 107/210 (50%), Gaps = 33/210 (15%)
Query: 17 ELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQF---LIADAKYFDQSDRPLPGDR 73
+ SD I+ IN++ TW AGRNF N+ YL++ ++ K LPG
Sbjct: 22 SFHPLSDDLINYINKQNTTWQAGRNF-YNVDISYLKKLCGTVLGGPK--------LPG-- 70
Query: 74 KTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQ 133
S +P+ FDAREQW NC TIG + D G+C + F AV A SDR CI + G+
Sbjct: 71 -------SIDLPETFDAREQWSNCPTIGQIRDQGSCGSCWAFGAVEAISDRTCIHTNGRV 123
Query: 134 NRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISP 193
N +S E + +CC I D C+ G W+F K+G V+GG Y GC P TI P
Sbjct: 124 NVEVSAEDLLTCCGIQCGD---GCNGGYPSGAWSFWTKKGLVSGGVYNSHVGCLPYTIPP 180
Query: 194 CSHH--GSAPTL------PSCENQKVPKLK 215
C HH GS P P C N+K+P ++
Sbjct: 181 CEHHVNGSRPPCTGEGDTPRC-NKKLPAIR 209
>gi|194882138|ref|XP_001975170.1| GG20712 [Drosophila erecta]
gi|190658357|gb|EDV55570.1| GG20712 [Drosophila erecta]
Length = 431
Score = 122 bits (305), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 81/267 (30%), Positives = 120/267 (44%), Gaps = 41/267 (15%)
Query: 84 VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVA 143
+P F+A ++W + I VPD G C A + + SDR I+SKG++ LS + +
Sbjct: 187 LPRSFNALDKWSS--YISEVPDQGWCGASWVLSTTSVASDRFAIQSKGKETVQLSAQNIL 244
Query: 144 SCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTL 203
SC + + C G + W +LHK+G V Y
Sbjct: 245 SCTR-----RQQGCDGGHLDAAWRYLHKKGVVDESCY----------------------- 276
Query: 204 PSCENQKVPKLKCHTR------CTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHG 257
P +++ K++ ++R C P +D T + + E I EI G
Sbjct: 277 PYTQHRDTCKIRHNSRSLRANGCETPV---NVDRDTFYTVGPAYSLNREADIMAEIFNSG 333
Query: 258 PTTATFALYDDFYHYKSGVYKHTS-NAKLENYLHSGKLIGWGTE-NGTPYWLVINTWGPH 315
P AT + DF+ Y GVY+ T+ N + HS KL+GWG E NG YW+ N+WG
Sbjct: 334 PVQATMRVNRDFFSYSRGVYRQTAANREAPTGFHSVKLVGWGEEHNGEKYWIAANSWGSW 393
Query: 316 WGDRGTVKILRGKYECAFEYLIAAGKP 342
WG++G +ILRG EC E + A P
Sbjct: 394 WGEKGYFRILRGSNECGIEEYVLASWP 420
>gi|355561807|gb|EHH18439.1| hypothetical protein EGK_15031 [Macaca mulatta]
Length = 475
Score = 122 bits (305), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 107/351 (30%), Positives = 152/351 (43%), Gaps = 63/351 (17%)
Query: 13 LVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRP---- 68
LVR EL I+Q+N+ WTA N S+ + + D F P
Sbjct: 155 LVRPEL-------IEQVNKGDYGWTA-----QNYSQFW--GMTLEDGFKFRLGTLPPSPM 200
Query: 69 -LPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVP-DTGACAAPHIFAAVGAFSDRRC 126
L + T + +P+ F A +WP H P D CAA F+ +DR
Sbjct: 201 LLSMNEMTXPLPATTDLPEFFVASYKWPG---WTHGPLDQKNCAASWAFSTASVAADRIA 257
Query: 127 IKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGG------DY 180
I+SKG+ LS + + SCC R+ C+ GS+ R W +L KRG V+ D
Sbjct: 258 IQSKGRYTANLSPQNLISCCAKNRH----GCNSGSIDRAWWYLRKRGLVSHACYPLFKDQ 313
Query: 181 GDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTY 240
GC ++ S G C N + K +C+ P Y
Sbjct: 314 NANNGCAMASRS--DGRGKRHATKPCPN-NIEKSNRIYQCSPP----------------Y 354
Query: 241 WVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKH--TSNAKLENY----LHSGKL 294
V +E I KEI+ +GP A + +DF+HYK+G+Y+H ++N + E Y H+ KL
Sbjct: 355 RVSSSETEIMKEIMQNGPVQAIMQVREDFFHYKTGIYRHVTSTNKESEKYRKLQTHAVKL 414
Query: 295 IGWGTENGT-----PYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAG 340
GWGT G +W+ N+WG WG+ G +ILRG E E LI A
Sbjct: 415 TGWGTLRGAQGRKEKFWIAANSWGKSWGENGYFRILRGVNESDIEKLIIAA 465
>gi|355748654|gb|EHH53137.1| hypothetical protein EGM_13709 [Macaca fascicularis]
Length = 475
Score = 121 bits (304), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 107/351 (30%), Positives = 152/351 (43%), Gaps = 63/351 (17%)
Query: 13 LVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRP---- 68
LVR EL I+Q+N+ WTA N S+ + + D F P
Sbjct: 155 LVRPEL-------IEQVNKGDYGWTA-----QNYSQFW--GMTLEDGFKFRLGTLPPSPM 200
Query: 69 -LPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVP-DTGACAAPHIFAAVGAFSDRRC 126
L + T + +P+ F A +WP H P D CAA F+ +DR
Sbjct: 201 LLSMNEMTAPLPATTDLPEFFVASYKWPG---WTHGPLDQKNCAASWAFSTASVAADRIA 257
Query: 127 IKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGG------DY 180
I+SKG+ LS + + SCC R+ C+ GS+ R W +L KRG V+ D
Sbjct: 258 IQSKGRYTANLSPQNLISCCAKNRH----GCNSGSIDRAWWYLRKRGLVSHACYPLFKDQ 313
Query: 181 GDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTY 240
GC ++ S G C N + K +C+ P Y
Sbjct: 314 NANNGCAMASRS--DGRGKRHATKPCPN-NIEKSNRIYQCSPP----------------Y 354
Query: 241 WVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKH--TSNAKLENY----LHSGKL 294
V +E I KEI+ +GP A + +DF+HYK+G+Y+H ++N + E Y H+ KL
Sbjct: 355 RVSSSETEIMKEIMQNGPVQAIMQVREDFFHYKTGIYRHVTSTNKESEKYRKLQTHAVKL 414
Query: 295 IGWGTENGT-----PYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAG 340
GWGT G +W+ N+WG WG+ G +ILRG E E LI A
Sbjct: 415 TGWGTLRGAQGRKEKFWIAANSWGKSWGENGYFRILRGVNESDIEKLIIAA 465
>gi|324512900|gb|ADY45327.1| Peptidase C1-like protein [Ascaris suum]
Length = 450
Score = 121 bits (304), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 83/265 (31%), Positives = 122/265 (46%), Gaps = 34/265 (12%)
Query: 84 VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVA 143
+P FDARE+WP I V D G CA+ + +DR I + G+ N PLS + +
Sbjct: 184 LPSSFDAREKWPL--YIHPVRDQGDCASSWSHSTTATSADRLSIITDGRVNIPLSAQQLL 241
Query: 144 SCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTL 203
SC + C G + R W ++ K G V+ Y +G +T P +
Sbjct: 242 SC----NQHRQRGCEGGYLDRAWWYIRKLGVVSELCYPYESG---ATQQP-----GECRI 289
Query: 204 PSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATF 263
P + + C + +P+ +R T Y V E I EI+ +GP ATF
Sbjct: 290 PKSAYRTGAHIDCPSGAADPSV--------YRMTPPYRVSSREQDIMTEIITNGPVQATF 341
Query: 264 ALYDDFYHYKSGVYKHT-------SNAKLENYLHSGKLIGWGTENGT----PYWLVINTW 312
+Y+DF+ Y GVY+H K++ Y HS ++IGWG + T YWL N+W
Sbjct: 342 LVYEDFFMYSGGVYQHLDLHEHKEEERKVQGY-HSVRIIGWGEDYSTGPQVKYWLAANSW 400
Query: 313 GPHWGDRGTVKILRGKYECAFEYLI 337
G WG+ G +ILRG+ C E +
Sbjct: 401 GNEWGEDGLFRILRGENHCEIESFV 425
>gi|134023803|gb|AAI35570.1| LOC100124858 protein [Xenopus (Silurana) tropicalis]
Length = 484
Score = 121 bits (304), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 83/267 (31%), Positives = 121/267 (45%), Gaps = 28/267 (10%)
Query: 84 VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVA 143
+P F+A E+WP G + D G CA F+ SDR I+S G + LS + +
Sbjct: 221 LPSHFNAAEKWP--GLVHEPLDQGNCAGSWAFSTAAVASDRISIQSMGHMTQSLSPQNLL 278
Query: 144 SCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTL 203
SC + C G V W +L +RG V+ C P T + H SAP +
Sbjct: 279 SC----DTRNQHGCRGGRVDGAWWYLRRRGVVS-------EPCYPFTSLNTNGH-SAPCM 326
Query: 204 PSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATF 263
+ K + C N Y + +++T Y + +E I KE+ +GP A
Sbjct: 327 MQSRSMGRGKRQATNNCPNQYYSS---NEIYQSTPAYRLASSEKDIMKELYENGPVQAIM 383
Query: 264 ALYDDFYHYKSGVYKHTSNAKLE------NYLHSGKLIGWGTENGT-----PYWLVINTW 312
+++DF+ YKSG+Y+ T + E + HS K+ GWG E G YWL N+W
Sbjct: 384 EVHEDFFMYKSGIYRRTPVTEREPEHHRRHGTHSVKITGWGEERGRDGQTHKYWLAANSW 443
Query: 313 GPHWGDRGTVKILRGKYECAFEYLIAA 339
G WG+ G +I RG+ EC E I
Sbjct: 444 GRDWGEDGYFRIARGENECEIETFIVG 470
>gi|322788703|gb|EFZ14296.1| hypothetical protein SINV_07506 [Solenopsis invicta]
Length = 443
Score = 121 bits (304), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 91/276 (32%), Positives = 123/276 (44%), Gaps = 42/276 (15%)
Query: 73 RKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQ 132
++ YDP+ +P FD+R +W I + D G C A + SDR I SKG
Sbjct: 195 KRIYDPD---ALPREFDSRTRWSR--DISGIHDQGWCGASWAVSTADVASDRYSIMSKGA 249
Query: 133 QNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDY---GDRTGCQPS 189
+ LS + + SC + C G + R W F+ K G V Y G C+
Sbjct: 250 EAPELSAQQLLSC----NNRGQQGCRGGYLDRAWLFMRKFGLVDKECYPWSGKNDQCKLR 305
Query: 190 TISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAI 249
S G C P L+ P Y G NE I
Sbjct: 306 KRSTLKAAG-------CRKPSHP-LRTELYKVGPAYRLG----------------NETDI 341
Query: 250 KKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKL-ENYLHSGKLIGWGTE---NGTP- 304
+EIL GP AT +Y DF+ YKSG+Y+H+ +A+L ++ HS ++IGWG E G P
Sbjct: 342 MQEILTSGPVQATMRVYQDFFIYKSGIYRHSRSAELHDSGYHSVRIIGWGEERSYRGPPL 401
Query: 305 -YWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAA 339
YWLV N+WG +WGD G KI +G EC E + A
Sbjct: 402 KYWLVANSWGYNWGDNGLFKIQKGTNECEIESYVLA 437
>gi|402853710|ref|XP_003891533.1| PREDICTED: tubulointerstitial nephritis antigen-like [Papio anubis]
Length = 362
Score = 121 bits (303), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 88/273 (32%), Positives = 119/273 (43%), Gaps = 40/273 (14%)
Query: 83 TVPDRFDAREQWPNCGTIGHVP-DTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEY 141
+P F+A E+WPN + H P D G CA F+ SDR I S G LS +
Sbjct: 97 VLPTAFEASEKWPN---LIHEPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQN 153
Query: 142 VASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVT------GGDYGDRTGCQPSTISPCS 195
+ SC + C G + W FL +RG V+ G D G P PC
Sbjct: 154 LLSC----DTHQQQGCRGGRLDGAWWFLRRRGVVSDHCYPFSGRERDEAGPAP----PCM 205
Query: 196 HHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILA 255
H A K + RC N D ++ T Y + N+ I KE++
Sbjct: 206 MHSRA--------MGRGKRQATARCPNSHVNN---NDIYQVTPVYRLGSNDKEIMKELME 254
Query: 256 HGPTTATFALYDDFYHYKSGVYKHT--SNAKLENY----LHSGKLIGWGTEN-----GTP 304
+GP A +++DF+ YK G+Y HT S + E Y HS K+ GWG E
Sbjct: 255 NGPVQALMEVHEDFFLYKGGIYSHTPVSLGRPERYRRHGTHSVKITGWGEETLPDGRTLK 314
Query: 305 YWLVINTWGPHWGDRGTVKILRGKYECAFEYLI 337
YW N+WGP WG+RG +I+RG EC E +
Sbjct: 315 YWTAANSWGPAWGERGHFRIVRGVNECDIESFV 347
>gi|351709947|gb|EHB12866.1| Tubulointerstitial nephritis antigen-like protein [Heterocephalus
glaber]
Length = 467
Score = 121 bits (303), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 86/269 (31%), Positives = 119/269 (44%), Gaps = 32/269 (11%)
Query: 83 TVPDRFDAREQWPNCGTIGHVP-DTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEY 141
+P F+A ++WPN + H P D G CA F+ SDR I S G LS +
Sbjct: 202 VLPKAFEASKKWPN---MIHDPLDQGNCAGSWAFSTAAVASDRVSIHSMGHMTPVLSPQN 258
Query: 142 VASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDY--GDRTGCQPSTISPCSHHGS 199
+ SC + C G + W FL +RG V+ Y + +PC H
Sbjct: 259 LLSC----DTHHQQGCQGGRLDGAWWFLRRRGVVSDHCYPFSGHEQAEAGPATPCMMHSR 314
Query: 200 APTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPT 259
A K + RC N + ++ T Y + +E I KE++ +GP
Sbjct: 315 A--------MGRGKRQATRRCPN---SHDDANEIYQVTPAYRLGSDEKEIMKELMENGPV 363
Query: 260 TATFALYDDFYHYKSGVYKHT--SNAKLENY----LHSGKLIGWGTE-----NGTPYWLV 308
A +Y+DF+ YKSG+Y HT S + E Y HS K+ GWG E YW
Sbjct: 364 QALMEVYEDFFLYKSGIYSHTLVSMGRPEQYRRHGTHSVKITGWGEEMLPDGRTLKYWTA 423
Query: 309 INTWGPHWGDRGTVKILRGKYECAFEYLI 337
N+WGP WG+RG +ILRG EC E +
Sbjct: 424 ANSWGPSWGERGYFRILRGSNECDIESFV 452
>gi|126310154|ref|XP_001364630.1| PREDICTED: tubulointerstitial nephritis antigen [Monodelphis
domestica]
Length = 468
Score = 121 bits (303), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 88/275 (32%), Positives = 124/275 (45%), Gaps = 43/275 (15%)
Query: 84 VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVA 143
+P+ F + +WP G D CAA F+ +DR I+SKG+ LS + +
Sbjct: 209 LPEFFISSYKWP--GWTHDPLDQKNCAASWAFSTASVAADRIAIQSKGRYTDNLSPQNLI 266
Query: 144 SCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYG-------DRTGCQPSTISPCSH 196
SCC R+ C GS+ R W +L KRG V+ Y + GC ++ S
Sbjct: 267 SCCVKNRH----GCKGGSIDRAWWYLRKRGLVSHACYPLFKDQIFNNNGCDMASRS--DG 320
Query: 197 HGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAH 256
G C N + K +C+ P Y V NE I KEI+ +
Sbjct: 321 RGKRHATKPCPNN-IEKSNRIYQCSPP----------------YRVSSNETEIMKEIMQN 363
Query: 257 GPTTATFALYDDFYHYKSGVYKHTSNAKLEN------YLHSGKLIGWGTENGT-----PY 305
GP A +++DF+HYKSG+Y+H +N K E+ H+ KL GWG G +
Sbjct: 364 GPVQAIMQVHEDFFHYKSGIYRHINNLKDESEKYRNLRTHAVKLTGWGVLRGAQGKKEKF 423
Query: 306 WLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAG 340
W+ N+WG WG+ G +ILRG E E LI A
Sbjct: 424 WIAANSWGKSWGENGYFRILRGVNESDIEKLIIAA 458
>gi|13469701|gb|AAK27318.1| cysteine proteinase [Clonorchis sinensis]
Length = 179
Score = 121 bits (303), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 74/188 (39%), Positives = 94/188 (50%), Gaps = 9/188 (4%)
Query: 116 AAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSV 175
AV A SDR CI S G N+ LS + SCCK C Y C G W+F G V
Sbjct: 1 GAVEAMSDRLCIHSSGAFNKSLSAVDLLSCCKDCGY----GCDGGFPPMAWDFWKTHGIV 56
Query: 176 TGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHR 235
TGG + GC+P C HH S P C + P KC C P + +DK R
Sbjct: 57 TGGSKEEPAGCRPYPFPKCQHH-SQGHYPPCPRRIYPTPKCVKHCDTPKID--YQKDKTR 113
Query: 236 TTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLI 295
+Y V +E AI KEIL +GP ATF +++DF YKSG+Y H + H+ +++
Sbjct: 114 ANTSYNVHQSEVAIMKEILLNGPVEATFEVHEDFPEYKSGIYFHAWGGSVGG--HAIRIL 171
Query: 296 GWGTENGT 303
GWG ENG
Sbjct: 172 GWGEENGV 179
>gi|195585648|ref|XP_002082593.1| GD25141 [Drosophila simulans]
gi|194194602|gb|EDX08178.1| GD25141 [Drosophila simulans]
Length = 484
Score = 121 bits (303), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 82/267 (30%), Positives = 119/267 (44%), Gaps = 41/267 (15%)
Query: 84 VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVA 143
+P F+A ++W + I VPD G C A + + SDR I+SKG++ LS + +
Sbjct: 187 LPSSFNALDKWSS--YISEVPDQGWCGASWVLSTTSVASDRFAIQSKGKEAVQLSAQNIL 244
Query: 144 SCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTL 203
SC + + C G + W +LHK+G V Y
Sbjct: 245 SCTR-----RQQGCEGGHLDAAWRYLHKKGVVDENCY----------------------- 276
Query: 204 PSCENQKVPKLKCHTR------CTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHG 257
P +++ K++ ++R C P +D T + + E I EI G
Sbjct: 277 PYTQHRDTCKIRHNSRSLRANGCQTPV---NVDRDTLYTVGPAYSLNREADIMAEIFHSG 333
Query: 258 PTTATFALYDDFYHYKSGVYKHTS-NAKLENYLHSGKLIGWGTE-NGTPYWLVINTWGPH 315
P AT + DF+ Y GVY+ T+ N K HS KL+GWG E NG YW+ N+WG
Sbjct: 334 PVQATMRVNRDFFAYSGGVYRETAANRKAPTGFHSVKLVGWGEEHNGEKYWIAANSWGSW 393
Query: 316 WGDRGTVKILRGKYECAFEYLIAAGKP 342
WG+ G +ILRG EC E + A P
Sbjct: 394 WGEHGYFRILRGSNECGIEEYVLASWP 420
>gi|47212965|emb|CAF93376.1| unnamed protein product [Tetraodon nigroviridis]
Length = 271
Score = 121 bits (303), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 90/273 (32%), Positives = 130/273 (47%), Gaps = 30/273 (10%)
Query: 77 DPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRP 136
DPE +P F++ E+WP G I D G CAA F+ SDR I+S G
Sbjct: 2 DPERDQ-LPLYFNSAEKWP--GKIHEPLDQGNCAASWAFSTAAVASDRISIQSMGHMTPQ 58
Query: 137 LSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQ-PSTISPCS 195
LS + + SC + C+ G + W +L +RG VT Y R Q P+ +S C
Sbjct: 59 LSPQNLISC----DTRNQGGCAGGRLDGAWWYLRRRGVVTEDCYPYRPPQQTPAELSRC- 113
Query: 196 HHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILA 255
+ + K + RC N + D +++T Y + +E I KEI
Sbjct: 114 -------MMQSRSVGRGKRQATQRCPNTN---NYQNDIYQSTPPYRLSTSEKEIMKEIQD 163
Query: 256 HGPTTATFALYDDFYHYKSGVYKHT--SNAKLENY----LHSGKLIGWGTE---NGT--P 304
+GP A +++DF+ Y SG+YKHT S K +Y HS K+ GWG E +GT
Sbjct: 164 NGPVQAIMEVHEDFFMYNSGIYKHTDVSFTKPPHYRKHGTHSVKITGWGEERNFDGTTRK 223
Query: 305 YWLVINTWGPHWGDRGTVKILRGKYECAFEYLI 337
YW+ N+WG +WG+ G +I RG+ EC E +
Sbjct: 224 YWIAANSWGKNWGENGYFRIARGENECEIEAFV 256
>gi|417401428|gb|JAA47600.1| Putative cysteine proteinase tin-ag [Desmodus rotundus]
Length = 466
Score = 120 bits (302), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 89/280 (31%), Positives = 126/280 (45%), Gaps = 54/280 (19%)
Query: 83 TVPDRFDAREQWPNCGTIGHVP-DTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEY 141
+P F+A E+WPN + H P D G CA F+ SDR I S G LS +
Sbjct: 201 VLPTAFEASEKWPN---LIHEPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQN 257
Query: 142 VASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAP 201
+ SC K + + C G + W FL +RG V+ Y P S G
Sbjct: 258 LLSCDK----RNQQGCQGGHLDSAWWFLRRRGVVSDHCY------------PFSGQGRTE 301
Query: 202 TLPSCENQKVPKLKCHTRCTNPTYGRGFFQ-------------DKHRTTLTYWVDDNEDA 248
T P+ P+ H+R GRG Q D ++ T Y + +E
Sbjct: 302 TGPA------PRCMMHSR----AMGRGKRQATARCPNHQVHANDIYQVTPAYRLGSSEKE 351
Query: 249 IKKEILAHGPTTATFALYDDFYHYKSGVYKHT--SNAKLENY----LHSGKLIGWGTEN- 301
I KE++ +GP A +++DF+ Y++G+Y HT S + E Y HS K+ GWG E+
Sbjct: 352 IMKELMENGPVQALMEVHEDFFLYQNGIYSHTPVSLGRPERYRRHGTHSVKITGWGEESL 411
Query: 302 ----GTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLI 337
YW N+WGP WG+RG +I+RG EC E +
Sbjct: 412 PDGRTLKYWTAANSWGPAWGERGHFRIVRGANECDIESFV 451
>gi|159109223|ref|XP_001704877.1| Cathepsin B precursor [Giardia lamblia ATCC 50803]
gi|157432952|gb|EDO77203.1| Cathepsin B precursor [Giardia lamblia ATCC 50803]
Length = 300
Score = 120 bits (302), Expect = 8e-25, Method: Compositional matrix adjust.
Identities = 80/260 (30%), Positives = 123/260 (47%), Gaps = 41/260 (15%)
Query: 84 VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVA 143
VP+ FD RE++P+C I V D G C + F++V F DRRC+ ++ S +YV
Sbjct: 75 VPESFDFREEYPHC--IPEVVDQGGCGSCWAFSSVATFGDRRCVAGLDKKPVKYSPQYVV 132
Query: 144 SCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTL 203
SC + +C+ G + W FL K G+ T C P + G+ PT
Sbjct: 133 SCDH-----GDMACNGGWLPNVWKFLTKTGTTT-------DECVPYKSGSTTLRGTCPTK 180
Query: 204 PSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNED--AIKKEILAHGPTTA 261
+ + KV H T T + D D A+ K + GP
Sbjct: 181 CADGSSKV----------------------HLATATSYKDYGLDIPAMMKALSTSGPLQV 218
Query: 262 TFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTEN-GTPYWLVINTWGPHWGDRG 320
F +Y DF +Y+SGVY+HT H+ +++G+GT++ G YW++ N+WGP WG+ G
Sbjct: 219 AFLVYSDFMYYESGVYQHTYGYMEGG--HAVEMVGYGTDDDGVDYWIIRNSWGPDWGEDG 276
Query: 321 TVKILRGKYECAFEYLIAAG 340
+++RG +C+ E AG
Sbjct: 277 YFRMIRGINDCSIEEQAYAG 296
>gi|12658201|gb|AAK01061.1| cysteine proteinase [Metagonimus yokogawai]
Length = 179
Score = 120 bits (302), Expect = 8e-25, Method: Compositional matrix adjust.
Identities = 67/187 (35%), Positives = 98/187 (52%), Gaps = 9/187 (4%)
Query: 116 AAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSV 175
AV A +DR CI S + +S+ + SCC+ C + C G R W+F + G V
Sbjct: 1 GAVEAMTDRLCIHSNATIKKHISSTDLLSCCESCGF----GCHGGFPPRAWDFWMENGLV 56
Query: 176 TGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHR 235
TGG + +GC+ C+HHG P P C + P C+ C P + DK +
Sbjct: 57 TGGSKENPSGCRSYPFPKCNHHGKGPDAP-CPEKIFPTPACNKTCDTPEVN--YILDKTK 113
Query: 236 TTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLI 295
+Y V ++E AI KEI+ +GP A F +Y+DF HY+SGVY H+ + H+ +++
Sbjct: 114 AKSSYNVPNSEKAIMKEIMQNGPVEAAFEVYEDFLHYESGVYFHSFGRMIGG--HAIRML 171
Query: 296 GWGTENG 302
GWG ENG
Sbjct: 172 GWGEENG 178
>gi|355557764|gb|EHH14544.1| hypothetical protein EGK_00488 [Macaca mulatta]
gi|355745087|gb|EHH49712.1| hypothetical protein EGM_00421 [Macaca fascicularis]
gi|384948750|gb|AFI37980.1| tubulointerstitial nephritis antigen-like isoform 1 precursor
[Macaca mulatta]
gi|384948752|gb|AFI37981.1| tubulointerstitial nephritis antigen-like isoform 1 precursor
[Macaca mulatta]
gi|387540550|gb|AFJ70902.1| tubulointerstitial nephritis antigen-like isoform 1 precursor
[Macaca mulatta]
Length = 467
Score = 120 bits (301), Expect = 8e-25, Method: Compositional matrix adjust.
Identities = 88/273 (32%), Positives = 119/273 (43%), Gaps = 40/273 (14%)
Query: 83 TVPDRFDAREQWPNCGTIGHVP-DTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEY 141
+P F+A E+WPN + H P D G CA F+ SDR I S G LS +
Sbjct: 202 VLPTAFEASEKWPN---LIHEPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQN 258
Query: 142 VASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVT------GGDYGDRTGCQPSTISPCS 195
+ SC + C G + W FL +RG V+ G D G P PC
Sbjct: 259 LLSC----DTHQQQGCRGGRLDGAWWFLRRRGVVSDHCYPFSGRERDEAGPAP----PCM 310
Query: 196 HHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILA 255
H A K + RC N D ++ T Y + N+ I KE++
Sbjct: 311 MHSRA--------MGRGKRQATARCPNSHVNN---NDIYQVTPVYRLGSNDKEIMKELME 359
Query: 256 HGPTTATFALYDDFYHYKSGVYKHT--SNAKLENY----LHSGKLIGWGTEN-----GTP 304
+GP A +++DF+ YK G+Y HT S + E Y HS K+ GWG E
Sbjct: 360 NGPVQALMEVHEDFFLYKGGIYSHTPVSLGRPERYRRHGTHSVKITGWGEETLPDGRTLK 419
Query: 305 YWLVINTWGPHWGDRGTVKILRGKYECAFEYLI 337
YW N+WGP WG+RG +I+RG EC E +
Sbjct: 420 YWTAANSWGPAWGERGHFRIVRGVNECDIESFV 452
>gi|289724789|gb|ADD18342.1| putative cysteine proteinase TIN-ag [Glossina morsitans morsitans]
Length = 387
Score = 120 bits (301), Expect = 9e-25, Method: Compositional matrix adjust.
Identities = 80/265 (30%), Positives = 119/265 (44%), Gaps = 35/265 (13%)
Query: 84 VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVA 143
+P F++ ++W + I V D G C + + + SDR I+S+G++ LS + +
Sbjct: 142 LPRSFNSIDKWAS--YISDVLDQGWCGSSWVISTASVASDRFAIQSRGKEVIQLSPQNIL 199
Query: 144 SCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDY---GDRTGCQPSTISPCSHHGSA 200
SC + + C+ G + W +LHK+G V Y G R C
Sbjct: 200 SCTR-----RQQGCNGGHLDAAWRYLHKQGVVDESCYPYVGYRDAC-------------- 240
Query: 201 PTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTT 260
K+P R G +D+ T + +NE I EI GP
Sbjct: 241 ---------KIPHNSRSLRNNGCRSYSGVDRDELYTVGPAYSLNNETDIMAEIFMSGPVQ 291
Query: 261 ATFALYDDFYHYKSGVYKHTSNAKLENY-LHSGKLIGWGTE-NGTPYWLVINTWGPHWGD 318
AT +Y DF+ Y G+Y+HT+ ++ HS KLIGWG E +G YW+ N+WG WG+
Sbjct: 292 ATLTVYRDFFSYSGGIYRHTAASRGSPVGFHSVKLIGWGEEHDGNKYWIATNSWGTWWGE 351
Query: 319 RGTVKILRGKYECAFEYLIAAGKPK 343
G +ILRG EC E + A P
Sbjct: 352 HGNFRILRGSNECGIEEYVLAAWPN 376
>gi|268564843|ref|XP_002639246.1| Hypothetical protein CBG03805 [Caenorhabditis briggsae]
Length = 526
Score = 120 bits (301), Expect = 9e-25, Method: Compositional matrix adjust.
Identities = 82/265 (30%), Positives = 129/265 (48%), Gaps = 34/265 (12%)
Query: 84 VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVA 143
+P+ FDAR++W + I + D G C + + G SDR I S+G+ N LS++ +
Sbjct: 258 LPEHFDARDKWGH--LIHPIADQGDCGSSWAVSTTGISSDRLSIISEGRINASLSSQQLL 315
Query: 144 SCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTL 203
SC K C G + R W ++ K G V GD+ C P +S S +
Sbjct: 316 SC----NQHRQKGCEGGYLDRAWWYIRKLGVV--GDH-----CYP-YVSGQSREPGHCLI 363
Query: 204 PSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATF 263
P + L+C + + T + T Y V E+ I+ E++ +GP ATF
Sbjct: 364 PKRDYTNRQGLRCPSGSQDST--------AFKMTPPYKVSSREEDIQTELMTNGPVQATF 415
Query: 264 ALYDDFYHYKSGVYKHT-------SNAKLENYLHSGKLIGWGTENGT----PYWLVINTW 312
+++DF+ Y GVY+H+ +++ E Y HS +++GWG ++ T YWL N+W
Sbjct: 416 VVHEDFFMYAGGVYQHSDLAAQKGASSVAEGY-HSVRVLGWGVDHSTGRPIKYWLCANSW 474
Query: 313 GPHWGDRGTVKILRGKYECAFEYLI 337
G WG+ G KILRG+ C E +
Sbjct: 475 GTQWGEDGYFKILRGENHCEIESFV 499
>gi|297723949|ref|NP_001174338.1| Os05g0310500 [Oryza sativa Japonica Group]
gi|255676228|dbj|BAH93066.1| Os05g0310500, partial [Oryza sativa Japonica Group]
Length = 234
Score = 120 bits (301), Expect = 9e-25, Method: Compositional matrix adjust.
Identities = 81/239 (33%), Positives = 108/239 (45%), Gaps = 28/239 (11%)
Query: 107 GACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTW 166
G C + F AV DR CI N LS + +CC D C G W
Sbjct: 1 GHCGSCWAFGAVECLQDRFCIHF--NMNISLSVNDLVACCGFMCGD---GCDGGYPIMAW 55
Query: 167 NFLHKRGSVTG--GDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPT 224
+ + G VT Y D+ GC+ H G P P+ V + KC +
Sbjct: 56 RYFVRNGVVTDECDPYFDQVGCK--------HPGCEPAYPT----PVCEKKCKVQ----- 98
Query: 225 YGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAK 284
+ + + KH + Y V+ + I E+ +GP F +Y+DF HYKSGVYKH +
Sbjct: 99 -NQVWLEKKHFSVNAYRVNSDPHDIMAEVYQNGPVEVAFTVYEDFAHYKSGVYKHITGGM 157
Query: 285 LENYLHSGKLIGWGTEN-GTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
+ H+ KLIGWGT + G YWL+ N W WGD G KI+RG EC E + AG P
Sbjct: 158 MGG--HAVKLIGWGTTDAGEDYWLLANQWNRGWGDDGYFKIIRGTNECGIEEDVVAGMP 214
>gi|297744106|emb|CBI37076.3| unnamed protein product [Vitis vinifera]
Length = 392
Score = 120 bits (301), Expect = 9e-25, Method: Compositional matrix adjust.
Identities = 91/301 (30%), Positives = 125/301 (41%), Gaps = 66/301 (21%)
Query: 84 VPDRFDAREQWPNCGTIG------------------------------------HVPDTG 107
+P FDAR WP C TIG ++ D G
Sbjct: 99 LPKHFDARTAWPQCSTIGKILGRLLDSFSSYFDDFFCFGCTDALYFSYHLLVPFYIKDQG 158
Query: 108 ACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCK-ICRYDDNKSCSHGSVFRTW 166
C + F AV + SDR CI N LS + +CC +C C G W
Sbjct: 159 HCGSCWAFGAVESLSDRFCIHFG--MNISLSVNDLLACCGFLC----GSGCDGGYPLYAW 212
Query: 167 NFLHKRGSVTG--GDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPT 224
+ G VT Y D TGC SH G P P+ KC +CT+
Sbjct: 213 RYFIHHGVVTEECDPYFDATGC--------SHPGCEPGYPT--------PKCVRKCTDEN 256
Query: 225 YGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAK 284
+ + + K Y + + I E+ +GP F +Y+DF HY+SGVY++T+
Sbjct: 257 --QLWRKAKRYGQSAYRISSDPYQIMAEVYKNGPVEVAFTVYEDFAHYESGVYRYTTGDV 314
Query: 285 LENYLHSGKLIGWGT-ENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
+ H+ KLIGWGT ++G YW++ N W +WGD G I RG EC E + AG P
Sbjct: 315 MGG--HAVKLIGWGTTDDGEDYWILANQWNRNWGDDGYFMIRRGVNECGIEEGVVAGLPS 372
Query: 344 N 344
+
Sbjct: 373 S 373
>gi|389608479|dbj|BAM17849.1| tubulointerstitial nephritis antigen [Papilio xuthus]
Length = 429
Score = 120 bits (301), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 93/330 (28%), Positives = 138/330 (41%), Gaps = 51/330 (15%)
Query: 21 FSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDP-E 79
S++ ++ +NR ++W A N+P + L++ LI F PL + + P
Sbjct: 131 MSNSVVEGVNRGGSSWRA-YNYP-EFRNKKLKEGLIYKLGTF-----PLNAETRRMGPLR 183
Query: 80 YSATVP--DRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPL 137
Y VP +FDAR +WP G I + D G C + + G SDR I+S G +N L
Sbjct: 184 YDKDVPYPTQFDARTRWP--GFISPIVDQGWCGSDWAVSLAGVASDRFAIQSNGAENMVL 241
Query: 138 STEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHH 197
S + + SC + C G + WNF G V
Sbjct: 242 SPQTLLSC----NVRAQQGCHGGHIDVAWNFARGHGLVD--------------------- 276
Query: 198 GSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVD-----DNEDAIKKE 252
C K +C R G R T Y + +E I +
Sbjct: 277 ------EKCFPYKASVTRCPFRPRGNLIQDGCMPLVKRRTSRYKLGPPAKLSHEKDIMYD 330
Query: 253 ILAHGPTTATFALYDDFYHYKSGVYKHT--SNAKLENYLHSGKLIGWGTENGTPYWLVIN 310
I+ GP A +Y DF+HY+ GVY+ + N +L+ + HS ++IGWG + G YW+V N
Sbjct: 331 IMESGPVQAVMTVYQDFFHYRDGVYRRSYHGNNELKGF-HSVRIIGWGEDRGDRYWVVAN 389
Query: 311 TWGPHWGDRGTVKILRGKYECAFEYLIAAG 340
+WG WG+ G +I RG E E + G
Sbjct: 390 SWGRQWGENGYFRIARGSNEADIESFVVTG 419
>gi|344287518|ref|XP_003415500.1| PREDICTED: tubulointerstitial nephritis antigen isoform 1
[Loxodonta africana]
Length = 468
Score = 120 bits (301), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 87/274 (31%), Positives = 123/274 (44%), Gaps = 42/274 (15%)
Query: 83 TVPDRFDAREQWPNCGTIGHVP-DTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEY 141
+P F+A ++WPN + H P D G CA F+ SDR I S G LS +
Sbjct: 203 VLPMAFEASKKWPN---LIHEPLDQGDCAGSWAFSTAAVASDRVSIHSLGHMTPILSPQN 259
Query: 142 VASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVT------GGDYGDRTGCQPSTISPCS 195
+ SC + + C G + W FL +RG V+ G D+ G + PC
Sbjct: 260 LLSC----DTHNQQGCRGGRLDGAWWFLRRRGVVSDHCYPFSGHERDKAG----PVPPCM 311
Query: 196 HHGSAPTLPSCENQKVPKLKCHTRCTNP-TYGRGFFQDKHRTTLTYWVDDNEDAIKKEIL 254
H A K + +RC N +G +Q T Y + NE I KE++
Sbjct: 312 MHSRA--------MGRGKRQATSRCPNSHVHGNDIYQ----VTPAYRLGTNEKEIMKELM 359
Query: 255 AHGPTTATFALYDDFYHYKSGVYKHT--SNAKLENY----LHSGKLIGWGTEN-----GT 303
+GP A +++DF+ Y+ G+Y HT S + E Y HS K+ GWG E
Sbjct: 360 ENGPVQALMEVHEDFFLYQGGIYSHTPVSQERPEQYRRHGTHSVKITGWGEETLPDGRTL 419
Query: 304 PYWLVINTWGPHWGDRGTVKILRGKYECAFEYLI 337
YW N+WGP WG+RG +I+RG EC E +
Sbjct: 420 KYWTAANSWGPAWGERGHFRIVRGANECDIESFV 453
>gi|32129433|sp|P92131.3|CATB1_GIALA RecName: Full=Cathepsin B-like CP1; AltName: Full=Cathepsin B-like
protease B1; Flags: Precursor
Length = 303
Score = 120 bits (301), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 96/327 (29%), Positives = 148/327 (45%), Gaps = 53/327 (16%)
Query: 22 SDAYIDQINREANTWTAG--RNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPE 79
S A + +I W AG + F N++E+ R LI + +S LP T E
Sbjct: 17 SRAELRRIQALNPPWKAGMPKRF-ENVTEDEFRSMLIRPDRLRARSGS-LPPISITEVQE 74
Query: 80 YSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLST 139
+P +FD R+++P C + D G+C + F+A+G F DRRC ++ S
Sbjct: 75 LVDPIPPQFDFRDEYPQC--VKPALDQGSCGSCWAFSAIGVFGDRRCAMGIDKEAVSYSQ 132
Query: 140 EYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGG-----DYGDRTGCQPSTISPC 194
+++ SC +N C G TW+FL G+ T DYG
Sbjct: 133 QHLISCSL-----ENFGCDGGDFQPTWSFLTFTGATTAECVKYVDYG------------- 174
Query: 195 SHHGSAPTLPSCENQKVPKL-KCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEI 253
H ++P C++ +L K H YG+ V + AI +
Sbjct: 175 -HTVASPCPAVCDDGSPIQLYKAHG------YGQ--------------VSKSVPAIMGML 213
Query: 254 LAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGT-ENGTPYWLVINTW 312
+A GP +Y D +Y+SGVYKHT + H+ +++G+GT ++GT YW++ N+W
Sbjct: 214 VAGGPLQTMIVVYADLSYYESGVYKHTYGT-INLGFHALEIVGYGTTDDGTDYWIIKNSW 272
Query: 313 GPHWGDRGTVKILRGKYECAFEYLIAA 339
GP WG+ G +I+RG EC E I A
Sbjct: 273 GPDWGENGYFRIVRGVNECRIEDEIYA 299
>gi|195346663|ref|XP_002039877.1| GM15657 [Drosophila sechellia]
gi|194135226|gb|EDW56742.1| GM15657 [Drosophila sechellia]
Length = 431
Score = 120 bits (301), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 83/267 (31%), Positives = 120/267 (44%), Gaps = 41/267 (15%)
Query: 84 VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVA 143
+P F+A ++W + I VPD G C A + + SDR I+SKG++ LS + +
Sbjct: 187 LPSSFNALDKWSS--YISEVPDQGWCGASWVLSTTSVASDRFAIQSKGKEAVQLSAQNIL 244
Query: 144 SCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTL 203
SC + + C G + W +LHK+G V C P T
Sbjct: 245 SCTR-----RQQGCEGGHLDAAWRYLHKKGVVD-------ENCYPYT------------- 279
Query: 204 PSCENQKVPKLKCHTR------CTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHG 257
+++ K++ ++R C P +D T + + E I EI G
Sbjct: 280 ---QHRDTCKIRHNSRSLRANGCQTPV---NVDRDTLYTVGPAYSLNREADIMAEIFHSG 333
Query: 258 PTTATFALYDDFYHYKSGVYKHTS-NAKLENYLHSGKLIGWGTE-NGTPYWLVINTWGPH 315
P AT + DF+ Y GVY+ T+ N K HS KL+GWG E NG YW+ N+WG
Sbjct: 334 PVQATMRVNRDFFAYSGGVYRETAANRKALTGFHSVKLVGWGEEHNGEKYWIAANSWGSW 393
Query: 316 WGDRGTVKILRGKYECAFEYLIAAGKP 342
WG+ G +ILRG EC E + A P
Sbjct: 394 WGEHGYFRILRGSNECGIEDYVLASWP 420
>gi|297282815|ref|XP_002802331.1| PREDICTED: tubulointerstitial nephritis antigen-like [Macaca
mulatta]
Length = 322
Score = 120 bits (301), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 87/273 (31%), Positives = 119/273 (43%), Gaps = 40/273 (14%)
Query: 83 TVPDRFDAREQWPNCGTIGHVP-DTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEY 141
+P F+A E+WPN + H P D G CA F+ SDR I S G LS +
Sbjct: 57 VLPTAFEASEKWPN---LIHEPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQN 113
Query: 142 VASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVT------GGDYGDRTGCQPSTISPCS 195
+ +C + C G + W FL +RG V+ G D G P PC
Sbjct: 114 LLACDT----HHQQGCRGGRLDGAWWFLRRRGVVSDHCYPFSGRERDEAGPAP----PCM 165
Query: 196 HHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILA 255
H A K + RC N D ++ T Y + N+ I KE++
Sbjct: 166 MHSRA--------MGRGKRQATARCPNSHVNN---NDIYQVTPVYRLGSNDKEIMKELME 214
Query: 256 HGPTTATFALYDDFYHYKSGVYKHT--SNAKLENY----LHSGKLIGWGTEN-----GTP 304
+GP A +++DF+ YK G+Y HT S + E Y HS K+ GWG E
Sbjct: 215 NGPVQALMEVHEDFFLYKGGIYSHTPVSLGRPERYRRHGTHSVKITGWGEETLPDGRTLK 274
Query: 305 YWLVINTWGPHWGDRGTVKILRGKYECAFEYLI 337
YW N+WGP WG+RG +I+RG EC E +
Sbjct: 275 YWTAANSWGPAWGERGHFRIVRGVNECDIESFV 307
>gi|344287520|ref|XP_003415501.1| PREDICTED: tubulointerstitial nephritis antigen isoform 2
[Loxodonta africana]
Length = 437
Score = 120 bits (300), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 87/274 (31%), Positives = 123/274 (44%), Gaps = 42/274 (15%)
Query: 83 TVPDRFDAREQWPNCGTIGHVP-DTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEY 141
+P F+A ++WPN + H P D G CA F+ SDR I S G LS +
Sbjct: 172 VLPMAFEASKKWPN---LIHEPLDQGDCAGSWAFSTAAVASDRVSIHSLGHMTPILSPQN 228
Query: 142 VASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVT------GGDYGDRTGCQPSTISPCS 195
+ SC + + C G + W FL +RG V+ G D+ G + PC
Sbjct: 229 LLSC----DTHNQQGCRGGRLDGAWWFLRRRGVVSDHCYPFSGHERDKAG----PVPPCM 280
Query: 196 HHGSAPTLPSCENQKVPKLKCHTRCTNP-TYGRGFFQDKHRTTLTYWVDDNEDAIKKEIL 254
H A K + +RC N +G +Q T Y + NE I KE++
Sbjct: 281 MHSRA--------MGRGKRQATSRCPNSHVHGNDIYQ----VTPAYRLGTNEKEIMKELM 328
Query: 255 AHGPTTATFALYDDFYHYKSGVYKHT--SNAKLENY----LHSGKLIGWGTEN-----GT 303
+GP A +++DF+ Y+ G+Y HT S + E Y HS K+ GWG E
Sbjct: 329 ENGPVQALMEVHEDFFLYQGGIYSHTPVSQERPEQYRRHGTHSVKITGWGEETLPDGRTL 388
Query: 304 PYWLVINTWGPHWGDRGTVKILRGKYECAFEYLI 337
YW N+WGP WG+RG +I+RG EC E +
Sbjct: 389 KYWTAANSWGPAWGERGHFRIVRGANECDIESFV 422
>gi|12060418|dbj|BAB20596.1| ARG1 [Mus musculus]
gi|71059879|emb|CAJ18483.1| Lcn7 [Mus musculus]
Length = 415
Score = 120 bits (300), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 86/269 (31%), Positives = 121/269 (44%), Gaps = 32/269 (11%)
Query: 83 TVPDRFDAREQWPNCGTIGHVP-DTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEY 141
+P F+A E+WPN + H P D G CA F+ SDR I S G LS +
Sbjct: 150 VLPTAFEASEKWPN---LIHEPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPILSPQN 206
Query: 142 VASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDY--GDRTGCQPSTISPCSHHGS 199
+ SC + C G + W FL +RG V+ Y R + S C H
Sbjct: 207 LLSC----DTHHQQGCRGGRLDGAWWFLRRRGVVSDNCYPFSGREQNEASPTPRCMMHSR 262
Query: 200 APTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPT 259
A K + +RC N G+ D ++ T Y + +E I KE++ +GP
Sbjct: 263 A--------MGRGKRQATSRCPN---GQVDSNDIYQVTPAYRLGSDEKEIMKELMENGPV 311
Query: 260 TATFALYDDFYHYKSGVYKHT--SNAKLENY----LHSGKLIGWGTEN-----GTPYWLV 308
A +++DF+ Y+ G+Y HT S + E Y HS K+ GWG E YW
Sbjct: 312 QALMEVHEDFFLYQRGIYSHTPVSQGRPEQYRRHGTHSVKITGWGEETLPDGRTIKYWTA 371
Query: 309 INTWGPHWGDRGTVKILRGKYECAFEYLI 337
N+WGP WG+RG +I+RG EC E +
Sbjct: 372 ANSWGPWWGERGHFRIVRGTNECDIETFV 400
>gi|297291062|ref|XP_002803846.1| PREDICTED: tubulointerstitial nephritis antigen-like [Macaca
mulatta]
Length = 463
Score = 120 bits (300), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 102/338 (30%), Positives = 147/338 (43%), Gaps = 56/338 (16%)
Query: 26 IDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRP-----LPGDRKTYDPEY 80
I+Q+N+ WTA N S+ + + D F P L + T
Sbjct: 149 IEQVNKGDYGWTA-----QNYSQFW--GMTLEDGFKFRLGTLPPSPMLLSMNEMTAPLPA 201
Query: 81 SATVPDRFDAREQWPNCGTIGHVP-DTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLST 139
+ +P+ F A +WP H P D CAA F+ +DR I+SKG+ LS
Sbjct: 202 TTDLPEFFVASYKWPG---WTHGPLDQKNCAASWAFSTASVAADRIAIQSKGRYTANLSP 258
Query: 140 EYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGG------DYGDRTGCQPSTISP 193
+ + SCC R+ C+ GS+ R W +L KRG V+ D GC ++ S
Sbjct: 259 QNLISCCAKNRH----GCNSGSIDRAWWYLRKRGLVSHACYPLFKDQNANNGCAMASRS- 313
Query: 194 CSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEI 253
G C N + K +C+ P Y V +E I KEI
Sbjct: 314 -DGRGKRHATKPCPNN-IEKSNRIYQCSPP----------------YRVSSSETEIMKEI 355
Query: 254 LAHGPTTATFALYDDFYHYKSGVYKH--TSNAKLENY----LHSGKLIGWGTENGT---- 303
+ +GP A + +DF+HYK+G+Y+H ++N + E Y H+ KL GWGT G
Sbjct: 356 MQNGPVQAIMQVREDFFHYKTGIYRHVTSTNKESEKYRKLQTHAVKLTGWGTLRGAQGRK 415
Query: 304 -PYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAG 340
+W+ N+WG WG+ G +ILRG E E LI A
Sbjct: 416 EKFWIAANSWGKSWGENGYFRILRGVNESDIEKLIIAA 453
>gi|161343837|tpg|DAA06099.1| TPA_inf: cathepsin B [Acyrthosiphon pisum]
Length = 255
Score = 120 bits (300), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 82/259 (31%), Positives = 120/259 (46%), Gaps = 17/259 (6%)
Query: 5 LVFLLGCTLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQ 64
L + V + Y +ID IN +A TW AG NF + +E+ + L +K
Sbjct: 8 LSVIFVSVYVTEQTYFLQKDFIDNINNQATTWKAGVNFDPDTPKEHFLKML--GSKGVQI 65
Query: 65 SDRPLPGDRKTYDPEYS---ATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAF 121
++ KT+D Y +P FDAR +W +C TIG V D G C + A AF
Sbjct: 66 PNKHNIHMYKTHDEAYDNLFGRIPKHFDARRKWRSCHTIGAVRDQGNCGSCWAMATSSAF 125
Query: 122 SDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYG 181
+DR C+ + N LS E + CC C + C+ G + W KRG VTGGDY
Sbjct: 126 ADRLCVATNADFNELLSAEEITFCCHSCGF----GCNGGYPIKAWERFKKRGLVTGGDYQ 181
Query: 182 DRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRG--FFQDKHRTTLT 239
GC+P + PC + A +C + P+ H RCT YG F + HR T
Sbjct: 182 SGEGCEPYRVPPCPY--DAEGHNTCAGK--PRESNH-RCTRMCYGNXDLDFDEDHRYTRD 236
Query: 240 YWVDDNEDAIKKEILAHGP 258
++ +I+K+++ +GP
Sbjct: 237 FYY-LTYGSIQKDVMTYGP 254
>gi|159112288|ref|XP_001706373.1| Cathepsin B precursor [Giardia lamblia ATCC 50803]
gi|157434469|gb|EDO78699.1| Cathepsin B precursor [Giardia lamblia ATCC 50803]
Length = 303
Score = 120 bits (300), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 96/327 (29%), Positives = 147/327 (44%), Gaps = 53/327 (16%)
Query: 22 SDAYIDQINREANTWTAG--RNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPE 79
S A + +I W AG + F N++E+ R LI + +S LP T E
Sbjct: 17 SRAELRRIQALNPPWKAGMPKRF-ENVTEDEFRSMLIRPDRLRARSGS-LPPISITEVQE 74
Query: 80 YSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLST 139
+P +FD R+++P C + D G+C F+A+G F DRRC ++ S
Sbjct: 75 LVDPIPPQFDFRDEYPQC--VKPALDQGSCGGCWAFSAIGVFGDRRCAMGIDKEAVSYSQ 132
Query: 140 EYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGG-----DYGDRTGCQPSTISPC 194
+++ SC +N C G TW+FL G+ T DYG
Sbjct: 133 QHLISCSL-----ENFGCDGGDFQPTWSFLTFTGATTAECVKYVDYG------------- 174
Query: 195 SHHGSAPTLPSCENQKVPKL-KCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEI 253
H ++P C++ +L K H YG+ V + AI +
Sbjct: 175 -HTVASPCPAVCDDGSPIQLYKAHG------YGQ--------------VSKSVPAIMGML 213
Query: 254 LAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGT-ENGTPYWLVINTW 312
+A GP +Y D +Y+SGVYKHT + H+ +++G+GT ++GT YW++ N+W
Sbjct: 214 VAGGPLQTMIVVYADLSYYESGVYKHTYGT-INLGFHALEIVGYGTTDDGTDYWIIKNSW 272
Query: 313 GPHWGDRGTVKILRGKYECAFEYLIAA 339
GP WG+ G +I+RG EC E I A
Sbjct: 273 GPDWGENGYFRIVRGVNECRIEDEIYA 299
>gi|431891156|gb|ELK02033.1| Tubulointerstitial nephritis antigen-like protein [Pteropus alecto]
Length = 467
Score = 120 bits (300), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 86/274 (31%), Positives = 120/274 (43%), Gaps = 44/274 (16%)
Query: 84 VPDRFDAREQWPNCGTIGHVP-DTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYV 142
+P F+A E+WPN + H P D G CA F+ SDR I S G LS + +
Sbjct: 203 LPTAFEASEKWPN---LIHEPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQNL 259
Query: 143 ASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVT------GGDYGDRTGCQPSTI--SPC 194
SC K + + C G + W FL +RG V+ G + G +P + S
Sbjct: 260 LSCDK----HNQQGCRGGRLDGAWWFLRRRGVVSDHCYPFSGQERNEAGPEPRCMMHSRA 315
Query: 195 SHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEIL 254
G + C N V D ++ T Y + NE I KE++
Sbjct: 316 MGRGKRQAIARCPNHHV-----------------HANDIYQVTPAYRLGSNEKEIMKELM 358
Query: 255 AHGPTTATFALYDDFYHYKSGVYKHT--SNAKLENY----LHSGKLIGWGTEN-----GT 303
+GP A +++DF+ Y+ G+Y HT S K E Y HS K+ GWG E
Sbjct: 359 ENGPVQALMEVHEDFFLYQGGIYSHTPVSLGKPERYRRHGTHSVKITGWGEETLPDGRTL 418
Query: 304 PYWLVINTWGPHWGDRGTVKILRGKYECAFEYLI 337
YW N+WGP WG+RG +I+RG EC E +
Sbjct: 419 KYWTAANSWGPAWGERGHFRIVRGTNECDIESFV 452
>gi|395856781|ref|XP_003800797.1| PREDICTED: tubulointerstitial nephritis antigen-like isoform 2
[Otolemur garnettii]
Length = 436
Score = 120 bits (300), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 89/271 (32%), Positives = 122/271 (45%), Gaps = 36/271 (13%)
Query: 83 TVPDRFDAREQWPNCGTIGHVP-DTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEY 141
+P F+A E+WPN + H P D G CA F+ SDR I S G LS +
Sbjct: 171 VLPTAFEASEKWPN---LIHEPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQN 227
Query: 142 VASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAP 201
+ SC + C G + W FL +RG V+ D+ C P + G AP
Sbjct: 228 LLSC----DTHHQQGCHGGRLDGAWWFLRRRGVVS--DH-----CYPFSGQERDKAGPAP 276
Query: 202 TLPSCENQKVP----KLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHG 257
C P K + RC N D ++ T Y + NE I KE++ +G
Sbjct: 277 L---CMMHSRPMGRGKRQATARCPNNQVQA---NDIYQVTPAYRLGSNEKEIMKELMENG 330
Query: 258 PTTATFALYDDFYHYKSGVYKHT--SNAKLENY----LHSGKLIGWGTEN-----GTPYW 306
P A +++DF+ Y+SG+Y HT S + E Y HS K+ GWG E YW
Sbjct: 331 PVQALMEVHEDFFLYQSGIYSHTPVSLQRPEGYRRHGTHSVKITGWGEETLPDGRTLKYW 390
Query: 307 LVINTWGPHWGDRGTVKILRGKYECAFEYLI 337
N+WGP WG+RG +I+RG EC E +
Sbjct: 391 TAANSWGPAWGERGHFRIVRGANECDIESFV 421
>gi|270132817|ref|NP_075965.2| tubulointerstitial nephritis antigen-like precursor [Mus musculus]
gi|270132824|ref|NP_001161805.1| tubulointerstitial nephritis antigen-like precursor [Mus musculus]
gi|61213616|sp|Q99JR5.1|TINAL_MOUSE RecName: Full=Tubulointerstitial nephritis antigen-like; AltName:
Full=Adrenocortical zonation factor 1; Short=AZ-1;
AltName: Full=Androgen-regulated gene 1 protein;
AltName: Full=Tubulointerstitial nephritis
antigen-related protein; Short=TARP; Flags: Precursor
gi|13543125|gb|AAH05738.1| Tinagl1 protein [Mus musculus]
gi|17391278|gb|AAH18539.1| Tinagl1 protein [Mus musculus]
gi|30314458|dbj|BAC76038.1| tubulointersititial nephritis antigen-related protein [Mus
musculus]
gi|148698197|gb|EDL30144.1| tubulointerstitial nephritis antigen-like, isoform CRA_a [Mus
musculus]
gi|148698198|gb|EDL30145.1| tubulointerstitial nephritis antigen-like, isoform CRA_a [Mus
musculus]
Length = 466
Score = 119 bits (299), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 86/269 (31%), Positives = 121/269 (44%), Gaps = 32/269 (11%)
Query: 83 TVPDRFDAREQWPNCGTIGHVP-DTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEY 141
+P F+A E+WPN + H P D G CA F+ SDR I S G LS +
Sbjct: 201 VLPTAFEASEKWPN---LIHEPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPILSPQN 257
Query: 142 VASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDY--GDRTGCQPSTISPCSHHGS 199
+ SC + C G + W FL +RG V+ Y R + S C H
Sbjct: 258 LLSC----DTHHQQGCRGGRLDGAWWFLRRRGVVSDNCYPFSGREQNEASPTPRCMMHSR 313
Query: 200 APTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPT 259
A K + +RC N G+ D ++ T Y + +E I KE++ +GP
Sbjct: 314 A--------MGRGKRQATSRCPN---GQVDSNDIYQVTPAYRLGSDEKEIMKELMENGPV 362
Query: 260 TATFALYDDFYHYKSGVYKHT--SNAKLENY----LHSGKLIGWGTEN-----GTPYWLV 308
A +++DF+ Y+ G+Y HT S + E Y HS K+ GWG E YW
Sbjct: 363 QALMEVHEDFFLYQRGIYSHTPVSQGRPEQYRRHGTHSVKITGWGEETLPDGRTIKYWTA 422
Query: 309 INTWGPHWGDRGTVKILRGKYECAFEYLI 337
N+WGP WG+RG +I+RG EC E +
Sbjct: 423 ANSWGPWWGERGHFRIVRGTNECDIETFV 451
>gi|410959397|ref|XP_003986297.1| PREDICTED: tubulointerstitial nephritis antigen [Felis catus]
Length = 474
Score = 119 bits (299), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 89/277 (32%), Positives = 123/277 (44%), Gaps = 46/277 (16%)
Query: 84 VPDRFDAREQWPNCGTIGHVP-DTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYV 142
+P+ F A +WP H P D CAA F+ +DR I+SKG+ LS + +
Sbjct: 214 LPEFFIASYKWPG---WTHGPLDQKNCAASWAFSTASVAADRIAIQSKGRYTANLSPQNL 270
Query: 143 ASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDY-------GDRTGCQPSTISPCS 195
SCC R+ C+ GS+ R W FL KRG V+ Y GC ++ S
Sbjct: 271 ISCCPKNRH----GCNSGSIDRAWWFLRKRGLVSHACYPLFKNQNATNHGCAMASRS--D 324
Query: 196 HHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILA 255
G C N + K +C+ P Y V NE I KEI+
Sbjct: 325 GRGKRHATKPCPNN-IEKSNRIYQCSPP----------------YRVSSNETEIMKEIMQ 367
Query: 256 HGPTTATFALYDDFYHYKSGVYKHTSNAKLEN-------YLHSGKLIGWGTENGT----- 303
+GP A +++DF+HYK+G+Y+H + E H+ KL GWGT G
Sbjct: 368 NGPVQAIMQVHEDFFHYKTGIYRHITKKANEESGKYRKLQTHAVKLTGWGTLKGAQGRKE 427
Query: 304 PYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAG 340
+W+ N+WG WG+ G +ILRG E E LI A
Sbjct: 428 KFWIAANSWGKSWGENGYFRILRGVNESDIEKLIIAA 464
>gi|16758354|ref|NP_446034.1| tubulointerstitial nephritis antigen-like precursor [Rattus
norvegicus]
gi|61213054|sp|Q9EQT5.1|TINAL_RAT RecName: Full=Tubulointerstitial nephritis antigen-like; AltName:
Full=Glucocorticoid-inducible protein 5; Flags:
Precursor
gi|11527795|dbj|BAB18637.1| glucocorticoid-inducible protein [Rattus norvegicus]
Length = 467
Score = 119 bits (299), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 86/270 (31%), Positives = 119/270 (44%), Gaps = 33/270 (12%)
Query: 83 TVPDRFDAREQWPNCGTIGHVP-DTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEY 141
+P F+A E+WPN + H P D G CA F+ SDR I S G LS +
Sbjct: 201 VLPTAFEASEKWPN---LIHEPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPILSPQN 257
Query: 142 VASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDY---GDRTGCQPSTISPCSHHG 198
+ SC K C G + W FL +RG V+ Y G + S C H
Sbjct: 258 LLSC----DTHHQKGCRGGRLDGAWWFLRRRGVVSDNCYPFSGREQNDEASPTPRCMMHS 313
Query: 199 SAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGP 258
A K + +RC N D ++ T Y + +E I KE++ +GP
Sbjct: 314 RA--------MGRGKRQATSRCPNSQVDS---NDIYQVTPVYRLASDEKEIMKELMENGP 362
Query: 259 TTATFALYDDFYHYKSGVYKHT--SNAKLENY----LHSGKLIGWGTEN-----GTPYWL 307
A +++DF+ Y+ G+Y HT S + E Y HS K+ GWG E YW
Sbjct: 363 VQALMEVHEDFFLYQRGIYSHTPVSQGRPEQYRRHGTHSVKITGWGEETLPDGRTIKYWT 422
Query: 308 VINTWGPHWGDRGTVKILRGKYECAFEYLI 337
N+WGP WG+RG +I+RG EC E +
Sbjct: 423 AANSWGPWWGERGHFRIVRGINECDIETFV 452
>gi|32129435|sp|P92133.2|CATB3_GIALA RecName: Full=Cathepsin B-like CP3; AltName: Full=Cathepsin B-like
protease B3; Flags: Precursor
gi|1763663|gb|AAB58260.1| cysteine protease [Giardia intestinalis]
gi|11691660|emb|CAC18648.1| cathepsin B-like cysteine protease 3 [Giardia intestinalis]
Length = 299
Score = 119 bits (299), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 80/265 (30%), Positives = 123/265 (46%), Gaps = 37/265 (13%)
Query: 81 SATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTE 140
+ PD FD RE++P+C I V D G C + F++V + DRRC ++ S +
Sbjct: 71 ATQAPDSFDFREEYPHC--IPEVVDQGGCGSCWAFSSVASVGDRRCFAGLDKKAVKYSPQ 128
Query: 141 YVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSA 200
YV SC + + +C G + W FL K G+ T C P G+
Sbjct: 129 YVVSCDR-----GDMACDGGWLPSVWRFLTKTGTTT-------DECVPYQSGSTGARGTC 176
Query: 201 PTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTT 260
PT + +P L T+ + YG + AI K + GP
Sbjct: 177 PT-KCADGSDLPHLYKATKAVD--YGL-----------------DAPAIMKALATGGPLQ 216
Query: 261 ATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTEN-GTPYWLVINTWGPHWGDR 319
F +Y DF +Y+SGVY+HT ++E H+ ++G+GT++ G YW++ N+WGP WG+
Sbjct: 217 TAFTVYSDFMYYESGVYQHT-YGRVEGG-HAVDMVGYGTDDDGVDYWIIKNSWGPDWGED 274
Query: 320 GTVKILRGKYECAFEYLIAAGKPKN 344
G +I+R EC E + G +N
Sbjct: 275 GYFRIIRMTNECGIEEQVIGGFFEN 299
>gi|11691656|emb|CAC18646.1| cathepsin B-like protease 1 [Giardia intestinalis]
Length = 303
Score = 119 bits (299), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 96/327 (29%), Positives = 147/327 (44%), Gaps = 53/327 (16%)
Query: 22 SDAYIDQINREANTWTAG--RNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPE 79
S A + +I W AG + F N++E+ R LI + +S LP T E
Sbjct: 17 SRAELRRIQALNPPWKAGMPKRF-ENVTEDEFRSMLIRPDRLRARSGS-LPPISITEVQE 74
Query: 80 YSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLST 139
+P +FD R+++P C + D G+C F+A+G F DRRC ++ S
Sbjct: 75 LVDPIPPQFDFRDEYPQC--VKPALDQGSCGECWAFSAIGVFGDRRCAMGIDKEAVSYSQ 132
Query: 140 EYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGG-----DYGDRTGCQPSTISPC 194
+++ SC +N C G TW+FL G+ T DYG
Sbjct: 133 QHLISCSL-----ENFGCDGGDFQPTWSFLTFTGATTAECVKYVDYG------------- 174
Query: 195 SHHGSAPTLPSCENQKVPKL-KCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEI 253
H ++P C++ +L K H YG+ V + AI +
Sbjct: 175 -HTVASPCPAVCDDGSPIQLYKAHG------YGQ--------------VSKSVPAIMGML 213
Query: 254 LAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGT-ENGTPYWLVINTW 312
+A GP +Y D +Y+SGVYKHT + H+ +++G+GT ++GT YW++ N+W
Sbjct: 214 VAGGPLQTMIVVYADLSYYESGVYKHTYGT-INLGFHALEIVGYGTTDDGTDYWIIKNSW 272
Query: 313 GPHWGDRGTVKILRGKYECAFEYLIAA 339
GP WG+ G +I+RG EC E I A
Sbjct: 273 GPDWGENGYFRIVRGVNECRIEDEIYA 299
>gi|294897889|ref|XP_002776090.1| cysteine protease Cys2, putative [Perkinsus marinus ATCC 50983]
gi|239882699|gb|EER07906.1| cysteine protease Cys2, putative [Perkinsus marinus ATCC 50983]
Length = 134
Score = 119 bits (299), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 61/128 (47%), Positives = 83/128 (64%), Gaps = 5/128 (3%)
Query: 216 CHTRCTNPTYGRGFFQDKHRT-TLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKS 274
C + C N YG F +D+H T +L + +IKKEI+ +GPT+A F++Y+DF YKS
Sbjct: 8 CSSSCPNAKYGTAFDKDRHYTESLFPSRFGSTSSIKKEIMTNGPTSAAFSVYEDFLSYKS 67
Query: 275 GVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFE 334
GVYKHTS L H+ ++IGWGTE G YWLV+N+W WGD GT KI++G +C +
Sbjct: 68 GVYKHTSGGFLGG--HAVEIIGWGTEKGVDYWLVMNSWNEEWGDHGTFKIVQG--DCGID 123
Query: 335 YLIAAGKP 342
+I AG P
Sbjct: 124 DMILAGTP 131
>gi|290998874|ref|XP_002682005.1| predicted protein [Naegleria gruberi]
gi|284095631|gb|EFC49261.1| predicted protein [Naegleria gruberi]
Length = 310
Score = 119 bits (299), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 90/299 (30%), Positives = 128/299 (42%), Gaps = 45/299 (15%)
Query: 45 NLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVP 104
N++ LR L + D P + + E +P FDAR QW C + +
Sbjct: 54 NMTISQLRDNLFGLSLMSSDEDTP-----RMANIETRVDIPMNFDARTQWKGC--VPAIR 106
Query: 105 DTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFR 164
D C A F+A + R CI + GQ N LS EY C + NK+C G +
Sbjct: 107 DQQTCGACWAFSANYVLAHRLCIATNGQTNVVLSPEYQVQCDTM-----NKACQGGYLKY 161
Query: 165 TWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQ-KVPKLKCHTRCTNP 223
+W FL + TG T P + G + +C Q K+ +
Sbjct: 162 SWTFL------------ENTGTPLDTCIPYASGGGTFSSGTCPTQCKIASMS-------- 201
Query: 224 TYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNA 283
K++ T ++ + IK I+ +G A F +Y D YKSGVYKH +
Sbjct: 202 -------MSKYKAKNTVYISGINN-IKTAIMTYGSVQAGFTVYRDLTGYKSGVYKHLVST 253
Query: 284 KLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
L H+ LIG+G E G+ YWL N+WGP+WG G KI +G E E + AG+P
Sbjct: 254 VLGG--HAVALIGFGVEGGSNYWLAANSWGPNWGMSGYFKIAQG--EGGIENQVYAGEP 308
>gi|156708118|gb|ABU93317.1| cathepsin B8 cysteine protease, partial [Monocercomonoides sp. PA]
Length = 275
Score = 119 bits (299), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 91/324 (28%), Positives = 144/324 (44%), Gaps = 53/324 (16%)
Query: 21 FSDAYIDQINRE-ANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPE 79
+++ +D +N + ++TW A EY R+ L AK + G +
Sbjct: 3 LAESVVDIVNNDPSSTWVA---------TEYPREILTL-AKMTAMISQIGNGFEGEWTFA 52
Query: 80 YSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLST 139
+ P FD R++WP G V + +C + AA R I+ G +S
Sbjct: 53 ENENAPASFDCRQKWP--GKAEPVRNQASCGSCWAHAASETMGFRMGIR--GCYKGVMSP 108
Query: 140 EYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGS 199
+ + SC +N C G R WN++ K+G T C P S G
Sbjct: 109 QDLVSC-----ESNNMGCEGGYADRVWNWIQKKGITT-------EQCLPYV----SGSGR 152
Query: 200 APTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPT 259
PT PS +C N + + R+ ++ W N + E+ +GP
Sbjct: 153 VPTCPS-------------KCKNGS-------NIVRSFVSSWGSFNSKTVMDEVANNGPV 192
Query: 260 TATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDR 319
A F +++DF +YKSG+Y+H + K + + H L+GWGTENG PYWL+ N+WG WG++
Sbjct: 193 YACFEVFEDFLNYKSGIYQHKT-GKSKGWHHV-MLMGWGTENGVPYWLLQNSWGSGWGEK 250
Query: 320 GTVKILRGKYECAFEYLIAAGKPK 343
G +I RG +C + + +G PK
Sbjct: 251 GFFRIRRGTNDCHIDEIFYSGLPK 274
>gi|395856779|ref|XP_003800796.1| PREDICTED: tubulointerstitial nephritis antigen-like isoform 1
[Otolemur garnettii]
Length = 467
Score = 119 bits (299), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 89/271 (32%), Positives = 122/271 (45%), Gaps = 36/271 (13%)
Query: 83 TVPDRFDAREQWPNCGTIGHVP-DTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEY 141
+P F+A E+WPN + H P D G CA F+ SDR I S G LS +
Sbjct: 202 VLPTAFEASEKWPN---LIHEPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQN 258
Query: 142 VASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAP 201
+ SC + C G + W FL +RG V+ D+ C P + G AP
Sbjct: 259 LLSC----DTHHQQGCHGGRLDGAWWFLRRRGVVS--DH-----CYPFSGQERDKAGPAP 307
Query: 202 TLPSCENQKVP----KLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHG 257
C P K + RC N D ++ T Y + NE I KE++ +G
Sbjct: 308 L---CMMHSRPMGRGKRQATARCPNNQVQA---NDIYQVTPAYRLGSNEKEIMKELMENG 361
Query: 258 PTTATFALYDDFYHYKSGVYKHT--SNAKLENY----LHSGKLIGWGTEN-----GTPYW 306
P A +++DF+ Y+SG+Y HT S + E Y HS K+ GWG E YW
Sbjct: 362 PVQALMEVHEDFFLYQSGIYSHTPVSLQRPEGYRRHGTHSVKITGWGEETLPDGRTLKYW 421
Query: 307 LVINTWGPHWGDRGTVKILRGKYECAFEYLI 337
N+WGP WG+RG +I+RG EC E +
Sbjct: 422 TAANSWGPAWGERGHFRIVRGANECDIESFV 452
>gi|402867308|ref|XP_003897801.1| PREDICTED: tubulointerstitial nephritis antigen [Papio anubis]
Length = 475
Score = 119 bits (298), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 89/275 (32%), Positives = 126/275 (45%), Gaps = 44/275 (16%)
Query: 84 VPDRFDAREQWPNCGTIGHVP-DTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYV 142
+P+ F A +WP H P D CAA F+ +DR I+SKG+ LS + +
Sbjct: 217 LPEFFVASYKWPG---WTHGPLDQKNCAASWAFSTASVAADRIAIQSKGRYTANLSPQNL 273
Query: 143 ASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGG------DYGDRTGCQPSTISPCSH 196
SCC R+ C+ GS+ R W +L KRG V+ D GC ++ S
Sbjct: 274 ISCCAKNRH----GCNSGSIDRAWWYLRKRGLVSHACYPLFKDQNANNGCAMASRS--DG 327
Query: 197 HGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAH 256
G C N + K +C+ P Y V +E I KEI+ +
Sbjct: 328 RGKRHATKPCPNN-IEKSNRIYQCSPP----------------YRVSSSETEIMKEIMQN 370
Query: 257 GPTTATFALYDDFYHYKSGVYKH--TSNAKLENY----LHSGKLIGWGTENGT-----PY 305
GP A + +DF+HYK+G+Y+H ++N + E Y H+ KL GWGT G +
Sbjct: 371 GPVQAIMQVREDFFHYKTGIYRHVTSTNKESEKYRKLQTHAVKLTGWGTLRGAQGRKEKF 430
Query: 306 WLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAG 340
W+ N+WG WG+ G +ILRG E E LI A
Sbjct: 431 WIAANSWGKSWGENGYFRILRGVNESDIEKLIIAA 465
>gi|344264196|ref|XP_003404179.1| PREDICTED: tubulointerstitial nephritis antigen [Loxodonta
africana]
Length = 476
Score = 119 bits (298), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 107/351 (30%), Positives = 150/351 (42%), Gaps = 62/351 (17%)
Query: 13 LVRGELYKFSDAYIDQINREANTWTAGR--NFPANLSEEYLRQFLIADAKYFDQSDRPLP 70
LVR EL I+ +N+ WTA F EE L+ F + P+
Sbjct: 155 LVRPEL-------IEYVNKGDYGWTAKNYSQFWGMTLEEGLK-FRLGTL-----PPSPML 201
Query: 71 GDRKTYDPEYSAT--VPDRFDAREQWPNCGTIGHVP-DTGACAAPHIFAAVGAFSDRRCI 127
P AT +P+ F A +WP H P D CAA F+ +DR I
Sbjct: 202 LSMNEVTPSLPATTDLPEFFVASYKWPG---WTHGPLDQKNCAASWAFSTASVAADRIAI 258
Query: 128 KSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDY------- 180
+S G+ LS + + SCC R+ C+ GSV R W +L KRG V+ Y
Sbjct: 259 QSNGRYTANLSPQNLISCCTKNRH----GCNSGSVDRAWWYLRKRGLVSHACYPLFKDQN 314
Query: 181 GDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTY 240
+ GC ++ S G C N + K +C+ P Y
Sbjct: 315 ANNNGCAMASRS--DGRGKRHATKPCPNN-IEKSNVIYQCSPP----------------Y 355
Query: 241 WVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHT--SNAKLENY----LHSGKL 294
V NE I KEI+ +GP A +++DF+HYK+G+Y+H ++ + E Y H+ KL
Sbjct: 356 RVSSNETEIMKEIMQNGPVQAIMQVHEDFFHYKTGIYRHVIRTSEESEKYQKLRTHAVKL 415
Query: 295 IGWGTENGT-----PYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAG 340
GWG G +W+ N+WG WG+ G +ILRG E E LI A
Sbjct: 416 TGWGMMKGAKGRKEKFWVAANSWGKSWGEDGYFRILRGVNESDIEKLIIAA 466
>gi|294890618|ref|XP_002773230.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
gi|239878281|gb|EER05046.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
Length = 238
Score = 119 bits (298), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 79/246 (32%), Positives = 117/246 (47%), Gaps = 21/246 (8%)
Query: 24 AYIDQINREANTWTA----GRNFPANL--SEEYLRQFLIADAKYFDQSDRPLPGDRKTYD 77
+ +D++N + N WTA GR + ++L +++ FL + + K Y
Sbjct: 3 SLVDEVNSKQNLWTASTEQGRFYGSSLGDAKKLCGTFLNGTEEL----------EEKVYP 52
Query: 78 PEYSATVPDRFDAREQWPNC-GTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRP 136
PE +PD FDAR+ + C IGHV D AC + F V AF+ R CIKS G+ N+
Sbjct: 53 PEELVDIPDSFDARDAFKECKDVIGHVRDQSACGSCWAFGTVEAFNARVCIKSGGKLNQL 112
Query: 137 LSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTG---GDYGDRTGCQPSTISP 193
LS + +CC I + + CS G+ +W FLH G V+G + GC P
Sbjct: 113 LSAADMLACCNIEHFCLSFGCSGGNPITSWTFLHTNGIVSGKLSKNMKAADGCWPYNFPK 172
Query: 194 CSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRT-TLTYWVDDNEDAIKKE 252
C+HH C + C + C N YG F +D+H T +L + +IKKE
Sbjct: 173 CAHHQKESDYKPCAKELYDTPSCSSSCPNAKYGTAFDKDRHYTESLLPSRFGSTSSIKKE 232
Query: 253 ILAHGP 258
I+ +GP
Sbjct: 233 IMTNGP 238
>gi|355724275|gb|AES08176.1| tubulointerstitial nephritis antigen-like 1 [Mustela putorius furo]
Length = 454
Score = 119 bits (298), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 86/268 (32%), Positives = 121/268 (45%), Gaps = 30/268 (11%)
Query: 83 TVPDRFDAREQWPNCGTIGHVP-DTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEY 141
+P F+A E+WPN + H P D G CA F+ SDR I S G LS +
Sbjct: 189 VLPTAFEASEKWPN---LIHEPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQN 245
Query: 142 VASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAP 201
+ SC + + C G + W FL +RG V+ D+ C P G AP
Sbjct: 246 LLSC----DTHNQRGCHGGRLDGAWWFLRRRGVVS--DH-----CYPFVGREQDEAGPAP 294
Query: 202 -TLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTT 260
+ K + RC + D ++ T Y + NE I KE++ +GP
Sbjct: 295 RCMMHSRAMGRGKRQATARCPS---SHAHANDIYQVTPAYRLGSNEKEIMKELMENGPVQ 351
Query: 261 ATFALYDDFYHYKSGVYKHT--SNAKLENY----LHSGKLIGWGTEN-----GTPYWLVI 309
A +++DF+ Y+SG+Y HT S + E Y HS K+ GWG E YW
Sbjct: 352 ALMEVHEDFFLYQSGIYSHTPVSLGRPERYRRHGTHSVKITGWGEETLPDGRTLKYWTAA 411
Query: 310 NTWGPHWGDRGTVKILRGKYECAFEYLI 337
N+WGP WG+RG +I+RG EC E +
Sbjct: 412 NSWGPAWGERGHFRIVRGANECDIESFV 439
>gi|308494436|ref|XP_003109407.1| hypothetical protein CRE_08204 [Caenorhabditis remanei]
gi|308246820|gb|EFO90772.1| hypothetical protein CRE_08204 [Caenorhabditis remanei]
Length = 470
Score = 119 bits (298), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 83/265 (31%), Positives = 129/265 (48%), Gaps = 34/265 (12%)
Query: 84 VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVA 143
+P+ FDAR++W + I V D G C + + G SDR I S+G+ N LS++ +
Sbjct: 202 LPEHFDARDKWGH--LIHPVADQGDCGSSWAVSTTGISSDRLSIISEGRINASLSSQQLL 259
Query: 144 SCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTL 203
SC K C G + R W ++ K G V GD+ C P +S S +
Sbjct: 260 SC----NQHRQKGCEGGYLDRAWWYIRKLGVV--GDH-----CYP-YVSGQSREPGHCLI 307
Query: 204 PSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATF 263
P + L+C + + T + T Y V E+ I+ E++ +GP ATF
Sbjct: 308 PKRDYTNRQGLRCPSGDQDST--------AFKMTPPYKVSSREEDIQTELMTNGPVQATF 359
Query: 264 ALYDDFYHYKSGVYKHT-------SNAKLENYLHSGKLIGWGTENGT----PYWLVINTW 312
+++DF+ Y GVY+H+ +++ E Y HS +++GWG ++ T YWL N+W
Sbjct: 360 VVHEDFFMYAGGVYQHSDLAAQKGASSVAEGY-HSVRVLGWGVDHSTGRPIKYWLCANSW 418
Query: 313 GPHWGDRGTVKILRGKYECAFEYLI 337
G WG+ G KILRG+ C E +
Sbjct: 419 GTQWGEDGYFKILRGENHCEIESFV 443
>gi|426328832|ref|XP_004025452.1| PREDICTED: tubulointerstitial nephritis antigen-like [Gorilla
gorilla gorilla]
Length = 462
Score = 119 bits (297), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 87/273 (31%), Positives = 118/273 (43%), Gaps = 40/273 (14%)
Query: 83 TVPDRFDAREQWPNCGTIGHVP-DTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEY 141
+P F+A E+WPN + H P D G CA F+ SDR I S G LS +
Sbjct: 197 VLPTAFEASEKWPN---LIHEPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQN 253
Query: 142 VASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVT------GGDYGDRTGCQPSTISPCS 195
+ SC + C G + W FL +RG V+ G D G P PC
Sbjct: 254 LLSC----DTHQQQGCRGGRLDGAWWFLRRRGVVSDHCYPFSGRERDEAGPAP----PCM 305
Query: 196 HHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILA 255
H A K + C N D ++ T Y + N+ I KE++
Sbjct: 306 MHSQA--------MGRGKRQATAHCPNSYVNN---NDIYQVTPVYRLGSNDKEIMKELME 354
Query: 256 HGPTTATFALYDDFYHYKSGVYKHT--SNAKLENY----LHSGKLIGWGTEN-----GTP 304
+GP A +++DF+ YK G+Y HT S + E Y HS K+ GWG E
Sbjct: 355 NGPVQALMEVHEDFFLYKGGIYSHTPVSLGRPERYRRHGTHSVKITGWGEETLPDGRTLK 414
Query: 305 YWLVINTWGPHWGDRGTVKILRGKYECAFEYLI 337
YW N+WGP WG+RG +I+RG EC E +
Sbjct: 415 YWTAANSWGPAWGERGHFRIVRGVNECDIESFV 447
>gi|395730851|ref|XP_003775799.1| PREDICTED: tubulointerstitial nephritis antigen-like 1 [Pongo
abelii]
Length = 362
Score = 119 bits (297), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 88/279 (31%), Positives = 118/279 (42%), Gaps = 52/279 (18%)
Query: 83 TVPDRFDAREQWPNCGTIGHVP-DTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEY 141
+P F+A E+WPN + H P D G CA F+ SDR I S G LS +
Sbjct: 97 VLPTAFEASEKWPN---LIHEPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQN 153
Query: 142 VASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVT------GGDYGDRTGCQPSTISPCS 195
+ SC + C G + W FL +RG V+ G D G P PC
Sbjct: 154 LLSC----DTHQQQGCRGGRLDGAWWFLRRRGVVSDHCYPFSGRERDEAGPTP----PCM 205
Query: 196 HH------GSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAI 249
H G SC N V D ++ T Y + N+ I
Sbjct: 206 MHSRAMGRGKRQATASCPNSHVNN-----------------NDIYQVTPVYRLGSNDKEI 248
Query: 250 KKEILAHGPTTATFALYDDFYHYKSGVYKHT--SNAKLENY----LHSGKLIGWGTEN-- 301
KE++ +GP A +++DF+ YK G+Y HT S + E Y HS K+ GWG E
Sbjct: 249 MKELMENGPVQALMEVHEDFFLYKGGIYSHTPVSLGRPERYRRHGTHSVKITGWGEETLP 308
Query: 302 ---GTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLI 337
YW N+WGP WG+RG +I+RG EC E +
Sbjct: 309 DGRTLKYWTAANSWGPAWGERGHFRIVRGVNECDIESFV 347
>gi|73973401|ref|XP_538969.2| PREDICTED: tubulointerstitial nephritis antigen [Canis lupus
familiaris]
Length = 476
Score = 119 bits (297), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 88/276 (31%), Positives = 123/276 (44%), Gaps = 45/276 (16%)
Query: 84 VPDRFDAREQWPNCGTIGHVP-DTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYV 142
+P+ F A +WP H P D CAA F+ +DR I+S G+ LS + +
Sbjct: 217 LPEFFIASYKWPG---WTHGPLDQKNCAASWAFSTASVAADRIAIQSNGRYTANLSPQNL 273
Query: 143 ASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDY-------GDRTGCQPSTISPCS 195
SCC R+ C+ GS+ R W FL KRG V+ Y GC ++ S
Sbjct: 274 ISCCAKNRH----GCNSGSIDRAWWFLRKRGLVSHACYPLFKDQNATNYGCAMASRS--D 327
Query: 196 HHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILA 255
G C N + K +C+ P Y V NE I KEI+
Sbjct: 328 GRGKRHATKPCPNN-IEKSNRIYQCSPP----------------YRVSSNETEIMKEIMQ 370
Query: 256 HGPTTATFALYDDFYHYKSGVYKHTSNAKLEN------YLHSGKLIGWGTENGT-----P 304
+GP A +++DF+HYK+G+Y+H + E+ H+ KL GWGT G
Sbjct: 371 NGPVQAIMQVHEDFFHYKTGIYRHITRTNEESRKYQKLQTHAVKLTGWGTLKGAQGQKEK 430
Query: 305 YWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAG 340
+W+ N+WG WG+ G +ILRG E E LI A
Sbjct: 431 FWIAANSWGISWGENGYFRILRGVNESDIEKLIIAA 466
>gi|324713036|ref|NP_001191344.1| tubulointerstitial nephritis antigen-like isoform 3 [Homo sapiens]
gi|119628008|gb|EAX07603.1| tubulointerstitial nephritis antigen-like 1, isoform CRA_a [Homo
sapiens]
Length = 362
Score = 119 bits (297), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 87/273 (31%), Positives = 118/273 (43%), Gaps = 40/273 (14%)
Query: 83 TVPDRFDAREQWPNCGTIGHVP-DTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEY 141
+P F+A E+WPN + H P D G CA F+ SDR I S G LS +
Sbjct: 97 VLPTAFEASEKWPN---LIHEPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQN 153
Query: 142 VASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVT------GGDYGDRTGCQPSTISPCS 195
+ SC + C G + W FL +RG V+ G D G P PC
Sbjct: 154 LLSC----DTHQQQGCRGGRLDGAWWFLRRRGVVSDHCYPFSGRERDEAGPAP----PCM 205
Query: 196 HHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILA 255
H A K + C N D ++ T Y + N+ I KE++
Sbjct: 206 MHSRA--------MGRGKRQATAHCPNSYVNN---NDIYQVTPVYRLGSNDKEIMKELME 254
Query: 256 HGPTTATFALYDDFYHYKSGVYKHT--SNAKLENY----LHSGKLIGWGTEN-----GTP 304
+GP A +++DF+ YK G+Y HT S + E Y HS K+ GWG E
Sbjct: 255 NGPVQALMEVHEDFFLYKGGIYSHTPVSLGRPERYRRHGTHSVKITGWGEETLPDGRTLK 314
Query: 305 YWLVINTWGPHWGDRGTVKILRGKYECAFEYLI 337
YW N+WGP WG+RG +I+RG EC E +
Sbjct: 315 YWTAANSWGPAWGERGHFRIVRGVNECDIESFV 347
>gi|253748582|gb|EET02635.1| Cathepsin B precursor [Giardia intestinalis ATCC 50581]
Length = 298
Score = 119 bits (297), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 79/258 (30%), Positives = 120/258 (46%), Gaps = 38/258 (14%)
Query: 84 VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVA 143
VPD FD RE++P+C I V D G+C + F++V + DRRC ++ S +YV
Sbjct: 74 VPDSFDFREEYPHC--IPEVVDQGSCGSCWAFSSVASLGDRRCFAGLDKKAVTYSPQYVV 131
Query: 144 SCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTL 203
SC + +C G + W FL K G+ T C P G+ PT
Sbjct: 132 SCDH-----GDMACDGGWLQSVWRFLTKTGTTT-------NECVPYQSGTTGARGTCPT- 178
Query: 204 PSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATF 263
+C + G K + + Y +D D I K ++ GP F
Sbjct: 179 ---------------KCAD---GGELSTVKAKKAVDYGLDC--DLIMKALVTGGPLQTAF 218
Query: 264 ALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGT-ENGTPYWLVINTWGPHWGDRGTV 322
+Y DF +Y+ GVY+H S ++E H+ +++G+GT E YW++ N+WGP WG+ G
Sbjct: 219 TVYSDFMYYEGGVYQHMS-GRVEGG-HAVEMVGYGTDEYDVDYWIIRNSWGPDWGEDGYF 276
Query: 323 KILRGKYECAFEYLIAAG 340
+I+R EC E + G
Sbjct: 277 RIIRMTNECGIEEQVMGG 294
>gi|308804940|ref|XP_003079782.1| Cysteine proteinase Cathepsin F (ISS) [Ostreococcus tauri]
gi|116058239|emb|CAL53428.1| Cysteine proteinase Cathepsin F (ISS) [Ostreococcus tauri]
Length = 498
Score = 119 bits (297), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 88/262 (33%), Positives = 124/262 (47%), Gaps = 27/262 (10%)
Query: 83 TVPDRFDAREQWPNCG-TIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEY 141
++P FDAR+++P C IG V D G C + AA +DR CI S G++ LS ++
Sbjct: 256 SLPRHFDARDEYPKCARLIGTVRDQGKCGSCWAVAATEIMNDRLCISSGGKEVAELSPQF 315
Query: 142 VASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAP 201
SC Y+ C G V T +G GG D+ C P PC H P
Sbjct: 316 ALSC-----YNSGAGCEGGDVVDTLTLALAKGVPHGGML-DKGACLPYQFEPCDHPCMIP 369
Query: 202 -TLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDA-IKKEILAHGPT 259
T P C C + + FQ + L Y ++ A I KEI G
Sbjct: 370 GTSPEA---------CPATCADGSK----FQLVYPKNLPYTCPPDDIACIAKEIKNRGSV 416
Query: 260 TATFA-LYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWG-TENGTPYWLVINTWGPHWG 317
TF +++DFY +K GVYK T ++ E H+ KLIGWG T+ G YW+++N+W +WG
Sbjct: 417 AVTFGPVHEDFYGHKEGVYKVTESSGRELGNHATKLIGWGVTQEGDHYWIMVNSW-RNWG 475
Query: 318 DRGTVKILRGKYECAFEYLIAA 339
+ G K+ G E + E +AA
Sbjct: 476 ENGVGKVRMG--EMSIESGVAA 495
>gi|53850626|ref|NP_001005549.1| tubulointerstitial nephritis antigen precursor [Rattus norvegicus]
gi|51858645|gb|AAH81887.1| Tubulointerstitial nephritis antigen [Rattus norvegicus]
gi|149019129|gb|EDL77770.1| tubulointerstitial nephritis antigen [Rattus norvegicus]
Length = 475
Score = 119 bits (297), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 87/274 (31%), Positives = 129/274 (47%), Gaps = 37/274 (13%)
Query: 82 ATVPDRFDAREQWPNCGTIGHVP-DTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTE 140
A +P+ F A +WP H P D CAA F+ +DR I+SKG+ LS +
Sbjct: 214 ADLPEVFIASYKWPG---WTHGPLDQKNCAASWAFSTASVAADRIAIQSKGRYTANLSPQ 270
Query: 141 YVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSA 200
+ SCC R+ C+ GS+ R W FL KRG V+ C P +++ S
Sbjct: 271 NLISCCAKNRH----GCNSGSIDRAWWFLRKRGLVS-------HACYPLFKEQSTNNNSC 319
Query: 201 PTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRT---TLTYWVDDNEDAIKKEILAHG 257
+ + K C N F+ +R + Y + NE I +EI+ +G
Sbjct: 320 AMASRSDGRG--KRHATRPCPNS------FEKSNRIYQCSPPYRISSNETEIMREIIQNG 371
Query: 258 PTTATFALYDDFYHYKSGVYKH--TSNAKLENY----LHSGKLIGWGTENGT-----PYW 306
P A +++DF++YK+G+Y+H ++N + E Y H+ KL GWGT G +W
Sbjct: 372 PVQAIMQVHEDFFYYKTGIYRHVVSTNEEPEKYRKLRTHAVKLTGWGTLRGAQGKKEKFW 431
Query: 307 LVINTWGPHWGDRGTVKILRGKYECAFEYLIAAG 340
+ N+WG WG+ G +ILRG E E LI A
Sbjct: 432 IAANSWGKSWGENGYFRILRGVNESDIEKLIIAA 465
>gi|28974200|gb|AAO61484.1| cathepsin B [Sterkiella histriomuscorum]
Length = 294
Score = 118 bits (296), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 96/342 (28%), Positives = 142/342 (41%), Gaps = 55/342 (16%)
Query: 5 LVFLLGCTLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQ 64
LV + V + ++ + I + + W + Q L Y
Sbjct: 4 LVIIGTIVAVAVATHPINEEMVAHIKAKTSLWQPHETTTNPFNNMTKEQLLAKCGTYIVP 63
Query: 65 SDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDR 124
+++ PG + TVP+ FDAR+QW + I + D C + F A AFSDR
Sbjct: 64 ANKEYPGSKIM-------TVPENFDARQQWGS--KIHAIRDQQQCGSCWAFGATEAFSDR 114
Query: 125 RCIKSKGQQNRPLSTEYVASCCKICRYDDNK-SCSHGSVFRTWNFLHKRGSVTGG--DYG 181
I K + LS E + SC D N C+ G + W +L G+ T Y
Sbjct: 115 FAINGK---DVILSPEDLVSC------DTNDYGCNGGYMDVAWEYLADHGAATDSCFPYS 165
Query: 182 DRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYW 241
+G P+ C+ GSA + + KC + G
Sbjct: 166 AGSGFAPACSDKCAD-GSA----------MQRFKCAPNSVRQSKGVA------------- 201
Query: 242 VDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTEN 301
I+ EI++HGP F +Y DF++Y+SGVY T+ H+ K++G+G EN
Sbjct: 202 ------QIQSEIVSHGPVEGAFTVYTDFFNYQSGVYTPTTTDVAGG--HAIKILGYGVEN 253
Query: 302 GTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
GTPYWL N+WGP WG G KI +G EC E + + P+
Sbjct: 254 GTPYWLCANSWGPAWGMSGFFKIKQG--ECGIEDQVFSCDPQ 293
>gi|341898422|gb|EGT54357.1| hypothetical protein CAEBREN_10381 [Caenorhabditis brenneri]
Length = 466
Score = 118 bits (296), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 82/265 (30%), Positives = 128/265 (48%), Gaps = 34/265 (12%)
Query: 84 VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVA 143
+P+ FD+R++W + I V D G C + + G SDR I S+G+ N LS++ +
Sbjct: 198 LPEHFDSRDKWGH--LINPVVDQGDCGSSWAVSTTGISSDRLAIISEGRINASLSSQQLL 255
Query: 144 SCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTL 203
SC K C G + R W ++ K G V GD+ C P +S S +
Sbjct: 256 SC----NQHRQKGCEGGYLDRAWWYIRKLGVV--GDH-----CYP-YVSGQSREPGHCLI 303
Query: 204 PSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATF 263
P + L+C + + T + T Y V E+ I+ E++ +GP ATF
Sbjct: 304 PKRDYTDRRGLRCPSGSQDST--------AFKMTPPYKVSSREEDIQTELMTNGPVQATF 355
Query: 264 ALYDDFYHYKSGVYKHT-------SNAKLENYLHSGKLIGWGTENGT----PYWLVINTW 312
+++DF+ Y GVY+H+ +++ E Y HS +++GWG ++ T YWL N+W
Sbjct: 356 VVHEDFFMYAGGVYQHSDLAAQKGASSVAEGY-HSVRVLGWGVDHSTGRPIKYWLCANSW 414
Query: 313 GPHWGDRGTVKILRGKYECAFEYLI 337
G WG+ G KILRG C E +
Sbjct: 415 GTQWGEDGYFKILRGDNHCEIESFV 439
>gi|32129434|sp|P92132.2|CATB2_GIALA RecName: Full=Cathepsin B-like CP2; AltName: Full=Cathepsin B-like
protease B2; Flags: Precursor
gi|11691658|emb|CAC18647.1| cathepsin B-like protease 2 [Giardia intestinalis]
Length = 300
Score = 118 bits (296), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 79/260 (30%), Positives = 123/260 (47%), Gaps = 41/260 (15%)
Query: 84 VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVA 143
VP+ FD RE++P+C I V D G C + F++V F DRRC+ ++ S +YV
Sbjct: 75 VPESFDFREEYPHC--IPEVVDQGGCGSCWAFSSVATFGDRRCVAGLDKKPVKYSPQYVV 132
Query: 144 SCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTL 203
SC + +C+ G + W FL K G+ T C P + G+ PT
Sbjct: 133 SCDH-----GDMACNGGWLPNVWKFLTKTGTTT-------DECVPYKSGSTTLRGTCPTK 180
Query: 204 PSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNED--AIKKEILAHGPTTA 261
+ + KV H T T + D D A+ K + GP
Sbjct: 181 CADGSSKV----------------------HLATATSYKDYGLDIPAMMKALSTSGPLQV 218
Query: 262 TFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTEN-GTPYWLVINTWGPHWGDRG 320
F ++ DF +Y+SGVY+HT H+ +++G+GT++ G YW++ N+WGP WG+ G
Sbjct: 219 AFLVHSDFMYYESGVYQHTYGYMEGG--HAVEMVGYGTDDDGVDYWIIKNSWGPDWGEDG 276
Query: 321 TVKILRGKYECAFEYLIAAG 340
+++RG +C+ E AG
Sbjct: 277 YFRMIRGINDCSIEEQAYAG 296
>gi|403371460|gb|EJY85611.1| Cathepsin B [Oxytricha trifallax]
Length = 309
Score = 118 bits (296), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 78/251 (31%), Positives = 115/251 (45%), Gaps = 41/251 (16%)
Query: 79 EYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLS 138
+ +A +PD FD+R QW +C + + D C + FAA + SDR CI S+G+ N LS
Sbjct: 73 QINAALPDSFDSRTQWKDC--VHPIRDQAQCGSCWAFAAAESLSDRFCIASQGKVNLVLS 130
Query: 139 TEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHG 198
+ + SC N C G + + W +L ++G + C+P S +G
Sbjct: 131 PQDMVSC-----DTSNFGCFGGYLDQAWQYLEQQGVSS-------DSCEPYK----SGNG 174
Query: 199 SAPTLPS-CEN-QKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAH 256
P+ P+ C N Q + K KC T G +A K I
Sbjct: 175 DQPSCPTKCSNGQAIKKYKCKAGSTKQAKGA-------------------EATKSLIQES 215
Query: 257 GPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHW 316
GP F +Y DFY+Y SGVY H + H+ K++GWG + YW+V N+WG W
Sbjct: 216 GPVETGFTVYQDFYNYNSGVYHHVTGDAEGG--HAVKILGWGKQGLENYWIVANSWGEDW 273
Query: 317 GDRGTVKILRG 327
G++G I +G
Sbjct: 274 GEKGYFNIRQG 284
>gi|297665716|ref|XP_002811185.1| PREDICTED: tubulointerstitial nephritis antigen-like 1 isoform 3
[Pongo abelii]
Length = 436
Score = 118 bits (296), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 101/350 (28%), Positives = 138/350 (39%), Gaps = 65/350 (18%)
Query: 16 GELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKT 75
G +Y Y D NR W AG + + L +Y + RP
Sbjct: 109 GRIYPILGTYWDNCNR---CWQAGNH------SAFWGMTLDEGIRYRLGTIRPSSSVMNM 159
Query: 76 YDP----EYSATVPDRFDAREQWPNCGTIGHVP-DTGACAAPHIFAAVGAFSDRRCIKSK 130
++ +P F+A E+WPN + H P D G CA F+ SDR I S
Sbjct: 160 HEIYTVLNPGEVLPTAFEASEKWPN---LIHEPLDQGNCAGSWAFSTAAVASDRVSIHSL 216
Query: 131 GQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVT------GGDYGDRT 184
G LS + + SC + C G + W FL +RG V+ G D
Sbjct: 217 GHMTPVLSPQNLLSC----DTHQQQGCRGGRLDGAWWFLRRRGVVSDHCYPFSGRERDEA 272
Query: 185 GCQPSTISPCSHH------GSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTL 238
G P PC H G SC N V D ++ T
Sbjct: 273 GPTP----PCMMHSRAMGRGKRQATASCPNSHVNN-----------------NDIYQVTP 311
Query: 239 TYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHT--SNAKLENY----LHSG 292
Y + N+ I KE++ +GP A +++DF+ YK G+Y HT S + E Y HS
Sbjct: 312 VYRLGSNDKEIMKELMENGPVQALMEVHEDFFLYKGGIYSHTPVSLGRPERYRRHGTHSV 371
Query: 293 KLIGWGTEN-----GTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLI 337
K+ GWG E YW N+WGP WG+RG +I+RG EC E +
Sbjct: 372 KITGWGEETLPDGRTLKYWTAANSWGPAWGERGHFRIVRGVNECDIESFV 421
>gi|157116531|ref|XP_001658537.1| tubulointerstitial nephritis antigen [Aedes aegypti]
gi|108883447|gb|EAT47672.1| AAEL001232-PA [Aedes aegypti]
Length = 462
Score = 118 bits (296), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 84/267 (31%), Positives = 121/267 (45%), Gaps = 37/267 (13%)
Query: 84 VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVA 143
+P FDA WP G IG V D G C + + SDR I SKG++ L+ + +
Sbjct: 185 LPTHFDATNYWP--GFIGKVRDQGWCGSSWAVSTASVASDRFAILSKGRETVQLAPQQIV 242
Query: 144 SCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTL 203
SC + ++ CS G + W++L K G+V Y + I P +A
Sbjct: 243 SCVR-----RSQGCSGGHLDTAWSYLRKVGTVNEECYPYISAHNVCKIRPSDTLITA--- 294
Query: 204 PSCE-NQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTAT 262
+CE KV + + P + +NE I EI HGP A
Sbjct: 295 -NCELPMKVDRTNMYK--MGPAFSL----------------NNETDIMLEIKKHGPVQAI 335
Query: 263 FALYDDFYHYKSGVYKHT---SNAKLENYLHSGKLIGWGTE----NGTPYWLVINTWGPH 315
++ DF+ YKSG+Y+H+ ++A HS +LIGWG E T YW+ +N+WG
Sbjct: 336 MRVHRDFFSYKSGIYRHSAASTSADQRAGYHSVRLIGWGEERHGYEVTKYWIAVNSWGTW 395
Query: 316 WGDRGTVKILRGKYECAFEYLIAAGKP 342
WG+ G +ILRG EC E + A P
Sbjct: 396 WGENGRFRILRGSNECEIESYVLASLP 422
>gi|332254562|ref|XP_003276398.1| PREDICTED: tubulointerstitial nephritis antigen-like isoform 3
[Nomascus leucogenys]
Length = 362
Score = 118 bits (295), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 86/273 (31%), Positives = 118/273 (43%), Gaps = 40/273 (14%)
Query: 83 TVPDRFDAREQWPNCGTIGHVP-DTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEY 141
+P F+A E+WPN + H P D G CA F+ SDR I S G LS +
Sbjct: 97 VLPTAFEASEKWPN---LIHEPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQN 153
Query: 142 VASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVT------GGDYGDRTGCQPSTISPCS 195
+ SC + C G + W FL +RG V+ G D G P PC
Sbjct: 154 LLSC----DTHQQQGCRGGRLDGAWWFLRRRGVVSDHCYPFSGRERDEAGPAP----PCM 205
Query: 196 HHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILA 255
H A K + C N D ++ T Y + N+ + KE++
Sbjct: 206 MHSRA--------MGRGKRQATAHCPNSHVNN---NDIYQVTPVYRLGSNDKEVMKELME 254
Query: 256 HGPTTATFALYDDFYHYKSGVYKHT--SNAKLENY----LHSGKLIGWGTEN-----GTP 304
+GP A +++DF+ YK G+Y HT S + E Y HS K+ GWG E
Sbjct: 255 NGPVQALMEVHEDFFLYKGGIYSHTPVSLGRPERYRRHGTHSVKITGWGEETLPDGRTLK 314
Query: 305 YWLVINTWGPHWGDRGTVKILRGKYECAFEYLI 337
YW N+WGP WG+RG +I+RG EC E +
Sbjct: 315 YWTAANSWGPAWGERGHFRIVRGVNECDIESFV 347
>gi|308163309|gb|EFO65659.1| Cathepsin B precursor [Giardia lamblia P15]
Length = 309
Score = 118 bits (295), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 91/322 (28%), Positives = 139/322 (43%), Gaps = 54/322 (16%)
Query: 21 FSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGD---RKTYD 77
+ A + QI A TW AG E L+ +D K +D P R +
Sbjct: 16 LTQAELRQIQALAPTWKAG-------IPERLKSLTKSDFKRMLSADSPRTQPSMVRPIHV 68
Query: 78 PEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPL 137
PE PD FD RE++P C I V D G C++ +AV AFS RRC+ Q+
Sbjct: 69 PESEDPAPDHFDFREEYPQC--ITEVIDIGLCSSSWAHSAVDAFSHRRCLTGLDQEATRY 126
Query: 138 STEYVASCCKICRYDDNKSC----SHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISP 193
S +Y+ SC C + G + W+F+ G +
Sbjct: 127 SAQYILSCAS------TNGCFGFSTQGDI--AWDFIATTGV---------------PLES 163
Query: 194 CSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEI 253
C + N+ C + C + ++ + D + V N + +K+ +
Sbjct: 164 CVKYTDY-------NETQSSWPCPSVCNDNSFLEIYKPDGYEG-----VGFNSERLKRAV 211
Query: 254 LAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGT-ENGTPYWLVINTW 312
GP A FA+Y+DF +Y G+Y HT + +L S +++G+GT + G YW+V N W
Sbjct: 212 AFRGPMQAMFAVYEDFTYYLEGIYSHTYGNR-AGFL-SVEIVGYGTSDEGQDYWIVKNYW 269
Query: 313 GPHWGDRGTVKILRGKYECAFE 334
GP WG+ G +I+RG+ EC E
Sbjct: 270 GPDWGEDGYFRIVRGQDECQIE 291
>gi|161343827|tpg|DAA06094.1| TPA_inf: cathepsin B [Aphis gossypii]
Length = 207
Score = 118 bits (295), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 66/202 (32%), Positives = 97/202 (48%), Gaps = 15/202 (7%)
Query: 1 MIHILVFLLGCTLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAK 60
++ +L +L + + Y YI++IN +A TW AG NF +E++ + L +
Sbjct: 4 VLILLSVILFSVYMTEQAYFLEKDYINKINEQATTWKAGVNFDPKTPKEHILKLLGSKGV 63
Query: 61 YFDQSDRPLPGDRKTYDPE------YSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHI 114
P + K Y E +P +FDAR++W NC TIG + D G C +
Sbjct: 64 QI-----PSKVNYKMYKSEDENYDNLLGRIPRKFDARKKWRNCKTIGAIRDQGNCGSCWA 118
Query: 115 FAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGS 174
A AF+DR C+ S G N+ LS E + CC C + C+ G + W K G
Sbjct: 119 LATSSAFADRLCVASNGNFNQLLSAEELTFCCHKCGF----GCNGGYPIKAWERFMKHGL 174
Query: 175 VTGGDYGDRTGCQPSTISPCSH 196
VTGGDY R GC+P + PC +
Sbjct: 175 VTGGDYKSREGCEPYRVPPCPY 196
>gi|338722032|ref|XP_003364468.1| PREDICTED: tubulointerstitial nephritis antigen-like 1 isoform 2
[Equus caballus]
Length = 436
Score = 118 bits (295), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 100/340 (29%), Positives = 142/340 (41%), Gaps = 43/340 (12%)
Query: 15 RGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRK 74
RG +Y Y D NR W AG + + L +Y + RP
Sbjct: 108 RGRVYPVLGTYWDNCNR---CWRAGNH------SAFWGMTLDEGIRYRLGTIRPSSSVTS 158
Query: 75 TYDPEY----SATVPDRFDAREQWPNCGTIGHVP-DTGACAAPHIFAAVGAFSDRRCIKS 129
+ +P F+A E+WPN + H P D G CA F+ SDR I S
Sbjct: 159 MNEIHTVLGPGEVLPTAFEASEKWPN---LIHEPLDQGNCAGSWAFSTAAVASDRVSIHS 215
Query: 130 KGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPS 189
G LS + + SC + + C G + W FL +RG V+ D+ C P
Sbjct: 216 LGHMTPVLSPQNLLSC----DTHNQQGCRGGHLDGAWWFLRRRGVVS--DH-----CYPF 264
Query: 190 TISPCSHHGSAP-TLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDA 248
+ G AP + K + C N R D ++ T Y + +E
Sbjct: 265 SGRERDEAGPAPRCMMHSRAMGRGKRQATAHCPN---SRVHTNDIYQVTPAYRLGSSEKE 321
Query: 249 IKKEILAHGPTTATFALYDDFYHYKSGVYKHT--SNAKLENY----LHSGKLIGWGTEN- 301
I KE++ +GP A +++DF+ Y+ GVY HT S+ + E Y HS K+ GWG E
Sbjct: 322 IMKELMENGPVQALMEVHEDFFLYQGGVYSHTPVSHGRPERYRRHGTHSVKITGWGEETL 381
Query: 302 ----GTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLI 337
YW N+WGP WG+RG +I+RG EC E +
Sbjct: 382 PDGRTLKYWTAANSWGPAWGERGHFRIVRGANECDIESFV 421
>gi|338718488|ref|XP_001918155.2| PREDICTED: LOW QUALITY PROTEIN: tubulointerstitial nephritis
antigen-like [Equus caballus]
Length = 480
Score = 118 bits (295), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 92/292 (31%), Positives = 129/292 (44%), Gaps = 43/292 (14%)
Query: 68 PLPGDRKTYDPEYSAT--VPDRFDAREQWPNCGTIGHVP-DTGACAAPHIFAAVGAFSDR 124
P+ P AT +P+ F A +WP H P D CAA F+ +DR
Sbjct: 203 PMLLSMNEVTPSLPATTDLPEFFIASYKWPG---WTHGPLDQKNCAASWAFSTASVAADR 259
Query: 125 RCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDY---- 180
I+S G+ LS + + SCC R+ C+ GS+ R W +L KRG V+ Y
Sbjct: 260 IAIQSNGRFTANLSPQNLISCCAKNRH----GCNSGSIDRAWWYLRKRGLVSHACYPLFK 315
Query: 181 -GDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLT 239
+ T + S G C N + K +C+ P
Sbjct: 316 DQNATNNDCAMASRSDGRGKRHATKPCPNN-IEKSNRIYQCSPP---------------- 358
Query: 240 YWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKH--TSNAKLENY----LHSGK 293
Y V NE I KEI+ +GP A ++DDF+HYK G+Y+H +++ + E Y H+ K
Sbjct: 359 YRVSSNETEIMKEIMQNGPVQAIMQVHDDFFHYKKGIYRHVTSTHEEPEKYRKLRTHAIK 418
Query: 294 LIGWGTENGT-----PYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAG 340
L GWGT G +W+ N+WG WG+ G +ILRG E E LI A
Sbjct: 419 LAGWGTLRGAQGRKEKFWIAANSWGKSWGENGYFRILRGVNESDIEKLIIAA 470
>gi|297665714|ref|XP_002811184.1| PREDICTED: tubulointerstitial nephritis antigen-like 1 isoform 2
[Pongo abelii]
Length = 467
Score = 118 bits (295), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 88/279 (31%), Positives = 118/279 (42%), Gaps = 52/279 (18%)
Query: 83 TVPDRFDAREQWPNCGTIGHVP-DTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEY 141
+P F+A E+WPN + H P D G CA F+ SDR I S G LS +
Sbjct: 202 VLPTAFEASEKWPN---LIHEPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQN 258
Query: 142 VASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVT------GGDYGDRTGCQPSTISPCS 195
+ SC + C G + W FL +RG V+ G D G P PC
Sbjct: 259 LLSC----DTHQQQGCRGGRLDGAWWFLRRRGVVSDHCYPFSGRERDEAGPTP----PCM 310
Query: 196 HH------GSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAI 249
H G SC N V D ++ T Y + N+ I
Sbjct: 311 MHSRAMGRGKRQATASCPNSHVNN-----------------NDIYQVTPVYRLGSNDKEI 353
Query: 250 KKEILAHGPTTATFALYDDFYHYKSGVYKHT--SNAKLENY----LHSGKLIGWGTEN-- 301
KE++ +GP A +++DF+ YK G+Y HT S + E Y HS K+ GWG E
Sbjct: 354 MKELMENGPVQALMEVHEDFFLYKGGIYSHTPVSLGRPERYRRHGTHSVKITGWGEETLP 413
Query: 302 ---GTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLI 337
YW N+WGP WG+RG +I+RG EC E +
Sbjct: 414 DGRTLKYWTAANSWGPAWGERGHFRIVRGVNECDIESFV 452
>gi|412985820|emb|CCO17020.1| cathepsin B-like cysteine proteinase [Bathycoccus prasinos]
Length = 541
Score = 118 bits (295), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 79/256 (30%), Positives = 118/256 (46%), Gaps = 11/256 (4%)
Query: 79 EYSATVPDRFDAREQWPNCGT-IGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPL 137
E + +P+ FDARE+WP C IG D G C + A SDR CI S G+ L
Sbjct: 271 EPPSDLPESFDAREKWPECSEFIGEAWDQGECGSCWAIAPTKVMSDRLCIASGGKVQERL 330
Query: 138 STEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHH 197
+ + SC ++ SC G + F + G +GG YGD GC PC H
Sbjct: 331 AASEILSCGQLVSEFSFGSCEGGMPDDAYEFAKEFGVASGGKYGDEKGCAAYPFPPCHHP 390
Query: 198 GSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHG 257
P+C K +C T +H L + D + D + +EI G
Sbjct: 391 CHVQPTPACP-LKSDTAQCQGDLDEHTRNEVA---QHIDKLIHCPDGDYDCMAREIYNSG 446
Query: 258 PTTA-TFALYDDFYHYKSGVYKHTSNAKLENYLHSG---KLIGWGTE-NGTPYWLVINTW 312
P ++ +YD+FY YK G Y+ +++++ H G ++IGW E +GT W +IN+W
Sbjct: 447 PVSSYAGTIYDEFYAYKDGAYRTSADSETRGRSHGGHVIEVIGWHKESDGTYSWKIINSW 506
Query: 313 GPHWGDRGTVKILRGK 328
+WG +G +I G+
Sbjct: 507 -LNWGKKGHGRIAVGE 521
>gi|14789619|gb|AAH10745.1| Tubulointerstitial nephritis antigen [Mus musculus]
Length = 475
Score = 118 bits (295), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 88/276 (31%), Positives = 127/276 (46%), Gaps = 41/276 (14%)
Query: 82 ATVPDRFDAREQWPNCGTIGHVP-DTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTE 140
A +P+ F A +WP H P D CAA F+ +DR I+SKG+ LS +
Sbjct: 214 ADLPEIFIASYKWPG---WTHGPLDQKNCAASWAFSTASVAADRIAIQSKGRYTANLSPQ 270
Query: 141 YVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDY-----GDRTGCQPSTISPCS 195
+ SCC R+ C+ GS+ R W FL KRG V+ Y + T + S
Sbjct: 271 NLISCCAKNRH----GCNSGSIDRAWWFLRKRGLVSHACYPLFKDQNTTNNICAMASRSD 326
Query: 196 HHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILA 255
G C N + + +C+ P Y V NE I +EI+
Sbjct: 327 GRGKRHATKPCPNSFEKSNRIY-QCSPP----------------YRVSSNETEIMREIIQ 369
Query: 256 HGPTTATFALYDDFYHYKSGVYKH--TSNAKLENY----LHSGKLIGWGTENGT-----P 304
+GP A +++DF++YK+G+Y+H ++N + E Y H+ KL GWGT G
Sbjct: 370 NGPVQAIMQVHEDFFYYKTGIYRHVVSTNEEPEKYKKLRTHAVKLTGWGTLRGARGKKEK 429
Query: 305 YWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAG 340
+W+ N+WG WG+ G +ILRG E E LI A
Sbjct: 430 FWIAANSWGKSWGENGYFRILRGVNESDIEKLIIAA 465
>gi|308160258|gb|EFO62754.1| Cathepsin B precursor [Giardia lamblia P15]
Length = 298
Score = 118 bits (295), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 80/265 (30%), Positives = 118/265 (44%), Gaps = 38/265 (14%)
Query: 81 SATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTE 140
+ VPD FD RE++P+C I V D G C + F++V + DRRC+ ++ S +
Sbjct: 71 ATQVPDSFDFREEYPHC--IPEVVDQGGCGSCWAFSSVASVGDRRCVAGLDKKAVRYSPQ 128
Query: 141 YVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSA 200
YV SC + + +C G + W FL K G+ T C P G+
Sbjct: 129 YVVSCDR-----GDMACDGGWLPSVWRFLVKTGTTT-------DECVPYQSGSTGARGTC 176
Query: 201 PTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTT 260
PT KC P Y K + Y +D D I K + GP
Sbjct: 177 PT------------KCADGSELPIY-------KATKAVDYGLD--CDLIMKALATGGPLQ 215
Query: 261 ATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGT-ENGTPYWLVINTWGPHWGDR 319
F +Y DF +Y+ GVY+H H+ +++G+GT E YW++ N+WGP WG+
Sbjct: 216 TAFTVYSDFMYYQGGVYQHVYGRAEGG--HAVEMVGYGTDEYDVDYWIIRNSWGPDWGED 273
Query: 320 GTVKILRGKYECAFEYLIAAGKPKN 344
G +I+R EC E + G +N
Sbjct: 274 GYFRIIRMTNECGIEEQVIGGFFEN 298
>gi|332808277|ref|XP_524645.3| PREDICTED: LOW QUALITY PROTEIN: tubulointerstitial nephritis
antigen-like 1 [Pan troglodytes]
Length = 472
Score = 118 bits (295), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 87/273 (31%), Positives = 118/273 (43%), Gaps = 40/273 (14%)
Query: 83 TVPDRFDAREQWPNCGTIGHVP-DTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEY 141
+P F+A E+WPN + H P D G CA F+ SDR I S G LS +
Sbjct: 207 VLPTAFEASEKWPN---LIHEPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQN 263
Query: 142 VASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVT------GGDYGDRTGCQPSTISPCS 195
+ SC + C G + W FL +RG V+ G D G P PC
Sbjct: 264 LLSC----DTHQQQGCRGGRLDGAWWFLRRRGVVSDHCYPFSGRERDEAGPAP----PCM 315
Query: 196 HHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILA 255
H A K + C N D ++ T Y + N+ I KE++
Sbjct: 316 MHSRA--------MGRGKRQATAHCPNSYVNN---NDIYQVTPVYRLGSNDKEIMKELME 364
Query: 256 HGPTTATFALYDDFYHYKSGVYKHT--SNAKLENY----LHSGKLIGWGTEN-----GTP 304
+GP A +++DF+ YK G+Y HT S + E Y HS K+ GWG E
Sbjct: 365 NGPVQALMEVHEDFFLYKGGIYSHTPVSLGRPERYRRHGTHSVKITGWGEETLPDGRTLK 424
Query: 305 YWLVINTWGPHWGDRGTVKILRGKYECAFEYLI 337
YW N+WGP WG+RG +I+RG EC E +
Sbjct: 425 YWTAANSWGPAWGERGHFRIVRGVNECDIESFV 457
>gi|397515891|ref|XP_003828175.1| PREDICTED: tubulointerstitial nephritis antigen-like isoform 2 [Pan
paniscus]
Length = 436
Score = 118 bits (295), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 87/273 (31%), Positives = 118/273 (43%), Gaps = 40/273 (14%)
Query: 83 TVPDRFDAREQWPNCGTIGHVP-DTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEY 141
+P F+A E+WPN + H P D G CA F+ SDR I S G LS +
Sbjct: 171 VLPTAFEASEKWPN---LIHEPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQN 227
Query: 142 VASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVT------GGDYGDRTGCQPSTISPCS 195
+ SC + C G + W FL +RG V+ G D G P PC
Sbjct: 228 LLSC----DTHQQQGCRGGRLDGAWWFLRRRGVVSDHCYPFSGRERDEAGPAP----PCM 279
Query: 196 HHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILA 255
H A K + C N D ++ T Y + N+ I KE++
Sbjct: 280 MHSRA--------MGRGKRQATAHCPNSYVNN---NDIYQVTPVYRLGSNDKEIMKELME 328
Query: 256 HGPTTATFALYDDFYHYKSGVYKHT--SNAKLENY----LHSGKLIGWGTEN-----GTP 304
+GP A +++DF+ YK G+Y HT S + E Y HS K+ GWG E
Sbjct: 329 NGPVQALMEVHEDFFLYKGGIYSHTPVSLGRPERYRRHGTHSVKITGWGEETLPDGRTLK 388
Query: 305 YWLVINTWGPHWGDRGTVKILRGKYECAFEYLI 337
YW N+WGP WG+RG +I+RG EC E +
Sbjct: 389 YWTAANSWGPAWGERGHFRIVRGVNECDIESFV 421
>gi|11545918|ref|NP_071447.1| tubulointerstitial nephritis antigen-like isoform 1 precursor [Homo
sapiens]
gi|61213628|sp|Q9GZM7.1|TINAL_HUMAN RecName: Full=Tubulointerstitial nephritis antigen-like; AltName:
Full=Glucocorticoid-inducible protein 5; AltName:
Full=Oxidized LDL-responsive gene 2 protein;
Short=OLRG-2; AltName: Full=Tubulointerstitial nephritis
antigen-related protein; Short=TIN Ag-related protein;
Short=TIN-Ag-RP; Flags: Precursor
gi|11602840|gb|AAG38876.1|AF236150_1 tubulointerstitial nephritis antigen-related protein precursor
[Homo sapiens]
gi|11275667|gb|AAG33699.1| oxidized-LDL responsive gene 2 [Homo sapiens]
gi|11527793|dbj|BAB18636.1| glucocorticoid-inducible protein [Homo sapiens]
gi|11527809|dbj|BAB18727.1| glucocorticoid-inducible protein [Homo sapiens]
gi|11761715|gb|AAG40154.1| tubulointerstitial nephritis antigen-related protein [Homo sapiens]
gi|22761462|dbj|BAC11596.1| unnamed protein product [Homo sapiens]
gi|37181967|gb|AAQ88787.1| LCN7 [Homo sapiens]
gi|40353044|gb|AAH64633.1| Tubulointerstitial nephritis antigen-like 1 [Homo sapiens]
gi|119628009|gb|EAX07604.1| tubulointerstitial nephritis antigen-like 1, isoform CRA_b [Homo
sapiens]
gi|119628010|gb|EAX07605.1| tubulointerstitial nephritis antigen-like 1, isoform CRA_b [Homo
sapiens]
gi|119628011|gb|EAX07606.1| tubulointerstitial nephritis antigen-like 1, isoform CRA_b [Homo
sapiens]
gi|158258977|dbj|BAF85459.1| unnamed protein product [Homo sapiens]
gi|261858502|dbj|BAI45773.1| tubulointerstitial nephritis antigen-like 1 [synthetic construct]
gi|410265400|gb|JAA20666.1| tubulointerstitial nephritis antigen-like 1 [Pan troglodytes]
gi|410307560|gb|JAA32380.1| tubulointerstitial nephritis antigen-like 1 [Pan troglodytes]
gi|410307562|gb|JAA32381.1| tubulointerstitial nephritis antigen-like 1 [Pan troglodytes]
gi|410307564|gb|JAA32382.1| tubulointerstitial nephritis antigen-like 1 [Pan troglodytes]
gi|410335249|gb|JAA36571.1| tubulointerstitial nephritis antigen-like 1 [Pan troglodytes]
Length = 467
Score = 118 bits (295), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 87/273 (31%), Positives = 118/273 (43%), Gaps = 40/273 (14%)
Query: 83 TVPDRFDAREQWPNCGTIGHVP-DTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEY 141
+P F+A E+WPN + H P D G CA F+ SDR I S G LS +
Sbjct: 202 VLPTAFEASEKWPN---LIHEPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQN 258
Query: 142 VASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVT------GGDYGDRTGCQPSTISPCS 195
+ SC + C G + W FL +RG V+ G D G P PC
Sbjct: 259 LLSC----DTHQQQGCRGGRLDGAWWFLRRRGVVSDHCYPFSGRERDEAGPAP----PCM 310
Query: 196 HHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILA 255
H A K + C N D ++ T Y + N+ I KE++
Sbjct: 311 MHSRA--------MGRGKRQATAHCPNSYVNN---NDIYQVTPVYRLGSNDKEIMKELME 359
Query: 256 HGPTTATFALYDDFYHYKSGVYKHT--SNAKLENY----LHSGKLIGWGTEN-----GTP 304
+GP A +++DF+ YK G+Y HT S + E Y HS K+ GWG E
Sbjct: 360 NGPVQALMEVHEDFFLYKGGIYSHTPVSLGRPERYRRHGTHSVKITGWGEETLPDGRTLK 419
Query: 305 YWLVINTWGPHWGDRGTVKILRGKYECAFEYLI 337
YW N+WGP WG+RG +I+RG EC E +
Sbjct: 420 YWTAANSWGPAWGERGHFRIVRGVNECDIESFV 452
>gi|324711034|ref|NP_001191343.1| tubulointerstitial nephritis antigen-like isoform 2 precursor [Homo
sapiens]
gi|194391000|dbj|BAG60618.1| unnamed protein product [Homo sapiens]
Length = 436
Score = 118 bits (295), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 100/344 (29%), Positives = 138/344 (40%), Gaps = 53/344 (15%)
Query: 16 GELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKT 75
G +Y Y D NR W AG + + L +Y + RP
Sbjct: 109 GRIYPVLGTYWDNCNR---CWQAGNH------SAFWGMTLDEGIRYRLGTIRPSSSVMNM 159
Query: 76 YDP----EYSATVPDRFDAREQWPNCGTIGHVP-DTGACAAPHIFAAVGAFSDRRCIKSK 130
++ +P F+A E+WPN + H P D G CA F+ SDR I S
Sbjct: 160 HEIYTVLNPGEVLPTAFEASEKWPN---LIHEPLDQGNCAGSWAFSTAAVASDRVSIHSL 216
Query: 131 GQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVT------GGDYGDRT 184
G LS + + SC + C G + W FL +RG V+ G D
Sbjct: 217 GHMTPVLSPQNLLSC----DTHQQQGCRGGRLDGAWWFLRRRGVVSDHCYPFSGRERDEA 272
Query: 185 GCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDD 244
G P PC H A K + C N D ++ T Y +
Sbjct: 273 GPAP----PCMMHSRA--------MGRGKRQATAHCPNSYVNN---NDIYQVTPVYRLGS 317
Query: 245 NEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHT--SNAKLENY----LHSGKLIGWG 298
N+ I KE++ +GP A +++DF+ YK G+Y HT S + E Y HS K+ GWG
Sbjct: 318 NDKEIMKELMENGPVQALMEVHEDFFLYKGGIYSHTPVSLGRPERYRRHGTHSVKITGWG 377
Query: 299 TEN-----GTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLI 337
E YW N+WGP WG+RG +I+RG EC E +
Sbjct: 378 EETLPDGRTLKYWTAANSWGPAWGERGHFRIVRGVNECDIESFV 421
>gi|741376|prf||2007265A cathepsin B
Length = 153
Score = 117 bits (294), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 62/150 (41%), Positives = 81/150 (54%), Gaps = 5/150 (3%)
Query: 194 CSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEI 253
C HH + P PK C C P Y + QDKH +Y V ++E I EI
Sbjct: 1 CEHHVNGSRPPCTGEGDTPK--CSKIC-EPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEI 57
Query: 254 LAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWG 313
+GP F++Y DF YKSGVY+H + + H+ +++GWG ENGTPYWLV N+W
Sbjct: 58 YKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGG--HAIRILGWGVENGTPYWLVANSWN 115
Query: 314 PHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
WGD G KILRG+ C E + AG P+
Sbjct: 116 TDWGDNGFFKILRGQDHCGIESEVVAGIPR 145
>gi|397515889|ref|XP_003828174.1| PREDICTED: tubulointerstitial nephritis antigen-like isoform 1 [Pan
paniscus]
Length = 467
Score = 117 bits (294), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 87/273 (31%), Positives = 118/273 (43%), Gaps = 40/273 (14%)
Query: 83 TVPDRFDAREQWPNCGTIGHVP-DTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEY 141
+P F+A E+WPN + H P D G CA F+ SDR I S G LS +
Sbjct: 202 VLPTAFEASEKWPN---LIHEPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQN 258
Query: 142 VASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVT------GGDYGDRTGCQPSTISPCS 195
+ SC + C G + W FL +RG V+ G D G P PC
Sbjct: 259 LLSC----DTHQQQGCRGGRLDGAWWFLRRRGVVSDHCYPFSGRERDEAGPAP----PCM 310
Query: 196 HHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILA 255
H A K + C N D ++ T Y + N+ I KE++
Sbjct: 311 MHSRA--------MGRGKRQATAHCPNSYVNN---NDIYQVTPVYRLGSNDKEIMKELME 359
Query: 256 HGPTTATFALYDDFYHYKSGVYKHT--SNAKLENY----LHSGKLIGWGTEN-----GTP 304
+GP A +++DF+ YK G+Y HT S + E Y HS K+ GWG E
Sbjct: 360 NGPVQALMEVHEDFFLYKGGIYSHTPVSLGRPERYRRHGTHSVKITGWGEETLPDGRTLK 419
Query: 305 YWLVINTWGPHWGDRGTVKILRGKYECAFEYLI 337
YW N+WGP WG+RG +I+RG EC E +
Sbjct: 420 YWTAANSWGPAWGERGHFRIVRGVNECDIESFV 452
>gi|270011021|gb|EFA07469.1| cathepsin B precursor [Tribolium castaneum]
Length = 327
Score = 117 bits (294), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 84/273 (30%), Positives = 124/273 (45%), Gaps = 41/273 (15%)
Query: 73 RKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQ 132
R+ YDP ++P FD+ +WP G + + D G C + SDR I SKG+
Sbjct: 71 RRIYDPN---SLPREFDSEFKWP--GWMSEIQDQGWCGSSWAITTAAVASDRFAILSKGR 125
Query: 133 QNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTIS 192
+ LS +++ SC + +SC+ G + R W+++ K G V +
Sbjct: 126 EKVTLSAQHLLSCDR----RGQQSCNGGYLDRAWSYIRKIGLVDEQCF------------ 169
Query: 193 PCSHHGSAPTLPSCENQKVPKLK--CHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIK 250
P + E ++P+ C PT + K++ Y V NE I
Sbjct: 170 --------PYSATNEKCRIPRRGDLVTANCQLPT--NVDRRSKYKVAPAYRVG-NETDIM 218
Query: 251 KEILAHGPTTATFALYDDFYHYKSGVYKHT--SNAKLENYLHSGKLIGWGTENG----TP 304
EIL GP AT +Y DF+ YK G+Y+H+ S Y HS +++GWG E
Sbjct: 219 YEILHSGPVQATMKVYHDFFTYKRGIYRHSPISTNDRTGY-HSVRIVGWGEEYSPEGLKK 277
Query: 305 YWLVINTWGPHWGDRGTVKILRGKYECAFEYLI 337
YW V N+WGP WG+ G +ILRG EC E +
Sbjct: 278 YWKVANSWGPEWGENGYFRILRGSNECEIESFV 310
>gi|1763659|gb|AAB58258.1| cysteine protease [Giardia intestinalis]
Length = 269
Score = 117 bits (294), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 89/302 (29%), Positives = 137/302 (45%), Gaps = 50/302 (16%)
Query: 45 NLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVP 104
N++E+ R LI + +S LP T E +P +FD R+++P C +
Sbjct: 7 NVTEDEFRSMLIRPDRLRARSGS-LPPISITEVQELVDPIPPQFDFRDEYPQC--VKPAL 63
Query: 105 DTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFR 164
D G+C F+A+G F DRRC ++ S +++ SC +N C G
Sbjct: 64 DQGSCGECWAFSAIGVFGDRRCAMGIDKEAVSYSQQHLISCSL-----ENFGCDGGDFQP 118
Query: 165 TWNFLHKRGSVTGG-----DYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKL-KCHT 218
TW+FL G+ T DYG H ++P C++ +L K H
Sbjct: 119 TWSFLTFTGATTAECVKYVDYG--------------HTVASPCPAVCDDGSPIQLYKAHG 164
Query: 219 RCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYK 278
YG+ V + AI ++A GP +Y D +Y+SGVYK
Sbjct: 165 ------YGQ--------------VSKSVPAIMGMLVAGGPLQTMIVVYADLSYYESGVYK 204
Query: 279 HTSNAKLENYLHSGKLIGWGT-ENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLI 337
HT + H+ +++G+GT ++GT YW++ N+WGP WG+ G +I+RG EC E I
Sbjct: 205 HTYGT-INLGFHALEIVGYGTTDDGTDYWIIKNSWGPDWGENGYFRIVRGVNECRIEDEI 263
Query: 338 AA 339
A
Sbjct: 264 YA 265
>gi|227499499|ref|NP_036163.3| tubulointerstitial nephritis antigen precursor [Mus musculus]
gi|4929827|gb|AAD34171.1| tubulo-interstitial nephritis antigen [Mus musculus]
gi|148694397|gb|EDL26344.1| tubulointerstitial nephritis antigen, isoform CRA_a [Mus musculus]
Length = 475
Score = 117 bits (294), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 88/276 (31%), Positives = 127/276 (46%), Gaps = 41/276 (14%)
Query: 82 ATVPDRFDAREQWPNCGTIGHVP-DTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTE 140
A +P+ F A +WP H P D CAA F+ +DR I+SKG+ LS +
Sbjct: 214 ADLPEIFIASYKWPG---WTHGPLDQKNCAASWAFSTASVAADRIAIQSKGRYTANLSPQ 270
Query: 141 YVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDY-----GDRTGCQPSTISPCS 195
+ SCC R+ C+ GS+ R W FL KRG V+ Y + T + S
Sbjct: 271 NLISCCAKNRH----GCNSGSIDRAWWFLRKRGLVSHACYPLFKDQNTTNNICAMASRSD 326
Query: 196 HHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILA 255
G C N + + +C+ P Y V NE I +EI+
Sbjct: 327 GRGKRHATKPCPNSFEKSNRIY-QCSPP----------------YRVSSNETEIMREIIQ 369
Query: 256 HGPTTATFALYDDFYHYKSGVYKH--TSNAKLENY----LHSGKLIGWGTENGT-----P 304
+GP A +++DF++YK+G+Y+H ++N + E Y H+ KL GWGT G
Sbjct: 370 NGPVQAIMQVHEDFFYYKTGIYRHVVSTNEEPEKYKKLRTHAVKLTGWGTLRGARGKKEK 429
Query: 305 YWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAG 340
+W+ N+WG WG+ G +ILRG E E LI A
Sbjct: 430 FWIAANSWGKSWGENGYFRILRGVNESDIEKLIIAA 465
>gi|253748399|gb|EET02549.1| Cathepsin B precursor [Giardia intestinalis ATCC 50581]
Length = 303
Score = 117 bits (294), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 99/349 (28%), Positives = 155/349 (44%), Gaps = 64/349 (18%)
Query: 4 ILVFLLGCTLVRGELYKFSDAYIDQINREANTWTAG--RNFPANLSEEYLRQFLI----A 57
IL LL + + S A + +I +W A + F N++E+ R LI
Sbjct: 2 ILALLLAVVCAKPLV---SRAELRRIQALNPSWVAAMPKRF-ENVTEDEFRGMLINPDRL 57
Query: 58 DAKYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAA 117
A+ PL DP +P +FD R+++P+C + V D G+C F+A
Sbjct: 58 KARSGSMPSAPLKEINDPTDP-----LPAQFDFRDEYPHC--VSPVFDQGSCGGCWAFSA 110
Query: 118 VGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTG 177
+G F RRC + S +++ SC +N CS G F TW+FL + G+ T
Sbjct: 111 IGMFGSRRCAVGIDKAAVLYSQQHLISCST-----ENFGCSGGDFFPTWSFLTQTGATTA 165
Query: 178 G-----DYGDRTGCQPSTISPCSHHGSAPTLPSCEN-QKVPKLKCHTRCTNPTYGRGFFQ 231
DYG S + PT +C++ ++ K H YG+
Sbjct: 166 ECVKYVDYGS------------SVAAACPT--TCDDGSQIQFYKAHG------YGQ---- 201
Query: 232 DKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHS 291
V + AI + +++ GP +Y D +Y GVY+HT + N LH+
Sbjct: 202 ----------VSKSVPAIMQMLVSGGPVQTMIVVYADLLYYAGGVYRHT-YGPISNGLHA 250
Query: 292 GKLIGWGT-ENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAA 339
+++G+GT ++GT YW + N+WG WG+ G +I+RG EC E I A
Sbjct: 251 LEMVGYGTTDDGTDYWTIKNSWGSDWGEDGYFRIVRGVNECRIEDEIYA 299
>gi|159108625|ref|XP_001704582.1| Cathepsin B precursor [Giardia lamblia ATCC 50803]
gi|157432649|gb|EDO76908.1| Cathepsin B precursor [Giardia lamblia ATCC 50803]
Length = 298
Score = 117 bits (294), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 81/265 (30%), Positives = 120/265 (45%), Gaps = 38/265 (14%)
Query: 81 SATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTE 140
+ PD FD RE++P+C I V D G C + F++V + DRRC ++ S +
Sbjct: 71 ATQAPDSFDFREEYPHC--IPEVVDQGGCGSCWAFSSVASVGDRRCFAGLDKKAVKYSPQ 128
Query: 141 YVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSA 200
YV SC + + +C G + W FL K G+ T C P G+
Sbjct: 129 YVVSCDR-----GDMACDGGWLPSVWRFLTKTGTTT-------DECVPYQSGSTGARGTC 176
Query: 201 PTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTT 260
PT KC P Y K + Y +D D I K + GP
Sbjct: 177 PT------------KCADGSDLPIY-------KATKAVDYGLD--CDLIMKALATGGPLQ 215
Query: 261 ATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGT-ENGTPYWLVINTWGPHWGDR 319
F +Y DF +Y+ GVY+HT ++E H+ +++G+GT E YW++ N+WGP WG+
Sbjct: 216 TAFTVYSDFMYYEGGVYQHT-YGRVEGG-HAVEMVGYGTDEYDVDYWIIRNSWGPDWGED 273
Query: 320 GTVKILRGKYECAFEYLIAAGKPKN 344
G +I+R EC E + G +N
Sbjct: 274 GYFRIIRMTNECGIEEQVIGGFFEN 298
>gi|170045773|ref|XP_001850470.1| tubulointerstitial nephritis antigen [Culex quinquefasciatus]
gi|167868692|gb|EDS32075.1| tubulointerstitial nephritis antigen [Culex quinquefasciatus]
Length = 463
Score = 117 bits (294), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 84/267 (31%), Positives = 124/267 (46%), Gaps = 37/267 (13%)
Query: 84 VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVA 143
+P FDA WP G IG V D G C + + SDR I SKG++ L+ + +
Sbjct: 186 LPTHFDATTYWP--GFIGEVKDQGWCGSSWALSTASVASDRFAILSKGREIVQLAPQQII 243
Query: 144 SCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPT- 202
SC + ++ CS G + WN++ K G+V Y + I P +A
Sbjct: 244 SCVR-----RSQGCSGGHLDTAWNYVRKVGTVNDECYPYISAQNACKIRPSDTLITANCD 298
Query: 203 LPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTAT 262
LP+ KV + + P + +NE I EI HGP A
Sbjct: 299 LPT----KVDRTNMYK--MGPAFSL----------------NNETDIMIEIKKHGPVQAI 336
Query: 263 FALYDDFYHYKSGVYKHTSNAKLENY---LHSGKLIGWGTE-NG---TPYWLVINTWGPH 315
++ DF+ YKSG+Y+H++ + + HS +LIGWG E NG T YW+ +N+WG
Sbjct: 337 LRVHRDFFSYKSGIYRHSAASSAGDERAGYHSVRLIGWGEERNGYETTKYWVAVNSWGRW 396
Query: 316 WGDRGTVKILRGKYECAFEYLIAAGKP 342
WG+ G +I+RG+ EC E + A P
Sbjct: 397 WGENGRFRIVRGQNECEIESYVLASLP 423
>gi|156708122|gb|ABU93319.1| cathepsin B10 cysteine protease [Monocercomonoides sp. PA]
Length = 283
Score = 117 bits (294), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 99/339 (29%), Positives = 152/339 (44%), Gaps = 63/339 (18%)
Query: 4 ILVFLLGCTLVRGELYKFSDAYIDQINRE-ANTWTAGRNFPANLSEEYLRQFLIADAKYF 62
+++F L L+ GE ++ INR A TW+A EY R +I A+
Sbjct: 1 MIIFFL-VVLISGE------PLVNIINRNPAATWSA---------HEYSRD-IITRARLT 43
Query: 63 DQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFS 122
+ + G + + E S VP+ FDAR++WPN I V D C + F+ +
Sbjct: 44 LLAPLAI-GPVEKFTIEDSFYVPESFDARDEWPN--AILPVRDQEKCGSCWAFSIAESLG 100
Query: 123 DRRCIKSKGQQNRPLSTEYVASCCKICRYDDNK-SCSHGSVFRTWNFLHKRGSVTGGDYG 181
DR I G+ + LS + + SC D N C+ G +W ++ G T
Sbjct: 101 DRFGILGCGKGH--LSPQDLISC------DSNDLGCNGGYQENSWTWVLTTGITT----- 147
Query: 182 DRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYW 241
C P + + +PSC + RC N + R T+ +
Sbjct: 148 --ESCWP-------YRSGSGRIPSCPH----------RCVNGSV-------LQRNTINNY 181
Query: 242 VDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTEN 301
+ ++ E+ +GP T+ +Y+DF++Y G+YKH S K+ H+ L+GWG E+
Sbjct: 182 RRLDSSELQDELYNNGPIQVTYVVYEDFFYYSKGIYKHLSGNKVGG--HAVVLMGWGIED 239
Query: 302 GTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAG 340
G YWLV N+WG WG++G +ILRG EC E AG
Sbjct: 240 GVKYWLVQNSWGYEWGEQGYFRILRGSNECGIESSAYAG 278
>gi|350596935|ref|XP_001927698.4| PREDICTED: tubulointerstitial nephritis antigen, partial [Sus
scrofa]
Length = 368
Score = 117 bits (294), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 85/276 (30%), Positives = 125/276 (45%), Gaps = 45/276 (16%)
Query: 84 VPDRFDAREQWPNCGTIGHVP-DTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYV 142
+P+ F A +WP H P D CAA F+ +DR I+S+G+ LS + +
Sbjct: 109 LPEFFVASYKWPG---WTHGPLDQKNCAASWAFSTASVAADRIAIQSEGRYTANLSPQNL 165
Query: 143 ASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDY-------GDRTGCQPSTISPCS 195
SCC R+ C+ GS+ R W +L KRG V+ Y GC ++ S
Sbjct: 166 ISCCAKNRH----GCNSGSIDRAWWYLRKRGLVSHACYPLFKDQNATNNGCAMASRS--D 219
Query: 196 HHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILA 255
G C N + + +C+ P Y V NE I +EI+
Sbjct: 220 GRGKRHATKPCPNNFEKSNRIY-QCSPP----------------YRVSSNETEIMREIMQ 262
Query: 256 HGPTTATFALYDDFYHYKSGVYKHTSNAKLEN------YLHSGKLIGWGTENGT-----P 304
+GP A +++DF+HYK+G+Y+H ++ E+ H+ KL GWGT G
Sbjct: 263 NGPVQAIMQVHEDFFHYKTGIYRHVTSTNEESDKYRKLRTHAVKLTGWGTLKGAQGRKEK 322
Query: 305 YWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAG 340
+W+ N+WG WG+ G +ILRG E E LI A
Sbjct: 323 FWIAANSWGKSWGENGYFRILRGVNESDIEKLIIAA 358
>gi|10803437|emb|CAC13131.1| putative cathepsin B.5 [Ostertagia ostertagi]
Length = 196
Score = 117 bits (294), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 67/191 (35%), Positives = 91/191 (47%), Gaps = 8/191 (4%)
Query: 115 FAAVGAFSDRRCIKSKGQQNRPLSTEYVASCC-KICRYDDNKSCSHGSVFRTWNFLHKRG 173
F A A SDR CI S+G+ +S + V SCC K C C G W + K G
Sbjct: 5 FGAAEAMSDRICIASQGKTQVTISADDVLSCCGKKC----GNGCEGGYPIEAWKYWVKTG 60
Query: 174 SVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDK 233
TGG Y ++GC+P I PC HH + C + C +C Y + DK
Sbjct: 61 ICTGGSYESQSGCKPYPIPPCGHHKNQTYFGPCPTDEYDTPVCTNKCI-AAYKTPYSDDK 119
Query: 234 HRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGK 293
H T Y V I+KEI+ +GP A + +Y+DFY Y GVY HT A++ H+ +
Sbjct: 120 HYGTSAYNVAKTVAGIQKEIMTNGPVEAAYTVYEDFYQYTGGVYTHTGGAEVGG--HAVR 177
Query: 294 LIGWGTENGTP 304
++GWG P
Sbjct: 178 ILGWGVRQQDP 188
>gi|332254560|ref|XP_003276397.1| PREDICTED: tubulointerstitial nephritis antigen-like isoform 2
[Nomascus leucogenys]
Length = 436
Score = 117 bits (294), Expect = 7e-24, Method: Compositional matrix adjust.
Identities = 99/344 (28%), Positives = 138/344 (40%), Gaps = 53/344 (15%)
Query: 16 GELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKT 75
G +Y Y D NR W AG + + L +Y + RP
Sbjct: 109 GRIYPVLGTYWDNCNR---CWQAGNH------SAFWGMTLDEGIRYRLGTMRPSSSVMNM 159
Query: 76 YDP----EYSATVPDRFDAREQWPNCGTIGHVP-DTGACAAPHIFAAVGAFSDRRCIKSK 130
++ +P F+A E+WPN + H P D G CA F+ SDR I S
Sbjct: 160 HEIYTVLNPGEVLPTAFEASEKWPN---LIHEPLDQGNCAGSWAFSTAAVASDRVSIHSL 216
Query: 131 GQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVT------GGDYGDRT 184
G LS + + SC + C G + W FL +RG V+ G D
Sbjct: 217 GHMTPVLSPQNLLSC----DTHQQQGCRGGRLDGAWWFLRRRGVVSDHCYPFSGRERDEA 272
Query: 185 GCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDD 244
G P PC H A K + C N D ++ T Y +
Sbjct: 273 GPAP----PCMMHSRA--------MGRGKRQATAHCPNSHVNN---NDIYQVTPVYRLGS 317
Query: 245 NEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHT--SNAKLENY----LHSGKLIGWG 298
N+ + KE++ +GP A +++DF+ YK G+Y HT S + E Y HS K+ GWG
Sbjct: 318 NDKEVMKELMENGPVQALMEVHEDFFLYKGGIYSHTPVSLGRPERYRRHGTHSVKITGWG 377
Query: 299 TEN-----GTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLI 337
E YW N+WGP WG+RG +I+RG EC E +
Sbjct: 378 EETLPDGRTLKYWTAANSWGPAWGERGHFRIVRGVNECDIESFV 421
>gi|66911417|gb|AAH97299.1| Tubulointerstitial nephritis antigen-like 1 [Rattus norvegicus]
gi|149024087|gb|EDL80584.1| lipocalin 7, isoform CRA_a [Rattus norvegicus]
gi|149024088|gb|EDL80585.1| lipocalin 7, isoform CRA_a [Rattus norvegicus]
gi|149024089|gb|EDL80586.1| lipocalin 7, isoform CRA_a [Rattus norvegicus]
Length = 467
Score = 117 bits (294), Expect = 7e-24, Method: Compositional matrix adjust.
Identities = 86/270 (31%), Positives = 118/270 (43%), Gaps = 33/270 (12%)
Query: 83 TVPDRFDAREQWPNCGTIGHVP-DTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEY 141
+P F+A E+WPN + H P D G CA F+ SDR I S G LS +
Sbjct: 201 VLPTAFEASEKWPN---LIHEPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPILSPQN 257
Query: 142 VASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDY---GDRTGCQPSTISPCSHHG 198
+ SC K C G + W FL RG V+ Y G + S C H
Sbjct: 258 LLSC----DTHHQKGCRGGRLDGAWWFLRCRGVVSDNCYPFSGREQNDEASPTPRCMMHS 313
Query: 199 SAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGP 258
A K + +RC N D ++ T Y + +E I KE++ +GP
Sbjct: 314 RA--------MGRGKRQATSRCPNSHVDS---NDIYQVTPVYRLASDEKEIMKELMENGP 362
Query: 259 TTATFALYDDFYHYKSGVYKHT--SNAKLENY----LHSGKLIGWGTEN-----GTPYWL 307
A +++DF+ Y+ G+Y HT S + E Y HS K+ GWG E YW
Sbjct: 363 VQALMEVHEDFFLYQRGIYSHTPVSQGRPEQYRRHGTHSVKITGWGEETLPDGRTIKYWT 422
Query: 308 VINTWGPHWGDRGTVKILRGKYECAFEYLI 337
N+WGP WG+RG +I+RG EC E +
Sbjct: 423 AANSWGPWWGERGHFRIVRGTNECDIETFV 452
>gi|149694136|ref|XP_001503950.1| PREDICTED: tubulointerstitial nephritis antigen-like 1 isoform 1
[Equus caballus]
Length = 467
Score = 117 bits (294), Expect = 7e-24, Method: Compositional matrix adjust.
Identities = 86/268 (32%), Positives = 122/268 (45%), Gaps = 30/268 (11%)
Query: 83 TVPDRFDAREQWPNCGTIGHVP-DTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEY 141
+P F+A E+WPN + H P D G CA F+ SDR I S G LS +
Sbjct: 202 VLPTAFEASEKWPN---LIHEPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQN 258
Query: 142 VASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAP 201
+ SC + + C G + W FL +RG V+ D+ C P + G AP
Sbjct: 259 LLSC----DTHNQQGCRGGHLDGAWWFLRRRGVVS--DH-----CYPFSGRERDEAGPAP 307
Query: 202 -TLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTT 260
+ K + C N R D ++ T Y + +E I KE++ +GP
Sbjct: 308 RCMMHSRAMGRGKRQATAHCPN---SRVHTNDIYQVTPAYRLGSSEKEIMKELMENGPVQ 364
Query: 261 ATFALYDDFYHYKSGVYKHT--SNAKLENY----LHSGKLIGWGTEN-----GTPYWLVI 309
A +++DF+ Y+ GVY HT S+ + E Y HS K+ GWG E YW
Sbjct: 365 ALMEVHEDFFLYQGGVYSHTPVSHGRPERYRRHGTHSVKITGWGEETLPDGRTLKYWTAA 424
Query: 310 NTWGPHWGDRGTVKILRGKYECAFEYLI 337
N+WGP WG+RG +I+RG EC E +
Sbjct: 425 NSWGPAWGERGHFRIVRGANECDIESFV 452
>gi|351704465|gb|EHB07384.1| Tubulointerstitial nephritis antigen [Heterocephalus glaber]
Length = 475
Score = 117 bits (293), Expect = 7e-24, Method: Compositional matrix adjust.
Identities = 88/274 (32%), Positives = 123/274 (44%), Gaps = 42/274 (15%)
Query: 84 VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVA 143
+P+ F A +WP G D CAA F+ +DR I+S G+ LS + +
Sbjct: 217 LPEFFIASYKWP--GWTHDPLDQKNCAASWAFSTASVAADRIAIQSNGRYTVNLSPQNLI 274
Query: 144 SCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGG------DYGDRTGCQPSTISPCSHH 197
SCC RY CS GS+ R W +L KRG V+ D GC ++ S
Sbjct: 275 SCCLKHRY----GCSGGSIDRAWWYLRKRGLVSHACYPLFKDQNSTNGCAMASRS--DGR 328
Query: 198 GSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHG 257
G C N + K +C+ P Y V NE I KEI+ +G
Sbjct: 329 GKRHATTPCPNN-IEKSNRIYQCSPP----------------YRVSSNETQIMKEIMKNG 371
Query: 258 PTTATFALYDDFYHYKSGVYKHTSNA--KLENY----LHSGKLIGWGTENGT-----PYW 306
P A +++DF++YK+G+Y+H ++ E Y H+ KL GWGT G +W
Sbjct: 372 PVQAIMQVHEDFFYYKTGIYRHVTSTIEDSEKYQKLRTHAVKLTGWGTLRGAKGRKEKFW 431
Query: 307 LVINTWGPHWGDRGTVKILRGKYECAFEYLIAAG 340
+ N+WG WG+ G +ILRG E E LI A
Sbjct: 432 IAANSWGKSWGENGYFRILRGVNESDIEKLIIAA 465
>gi|149030259|gb|EDL85315.1| rCG52258, isoform CRA_b [Rattus norvegicus]
Length = 210
Score = 117 bits (293), Expect = 7e-24, Method: Compositional matrix adjust.
Identities = 71/197 (36%), Positives = 97/197 (49%), Gaps = 12/197 (6%)
Query: 17 ELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTY 76
+ SD I+ IN++ TW AGRNF N+ YL++ P +R +
Sbjct: 22 SFHPLSDDMINYINKQNTTWQAGRNF-YNVDISYLKKLC---GTVLGGPKLP---ERVGF 74
Query: 77 DPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRP 136
+ + +P+ FDAREQW NC TI + D G+C + F AV A SDR CI + G+ N
Sbjct: 75 SEDIN--LPESFDAREQWSNCPTIAQIRDQGSCGSCWAFGAVEAMSDRICIHTNGRVNVE 132
Query: 137 LSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSH 196
+S E + +CC I D C+ G WNF ++G V+GG Y GC P TI PC H
Sbjct: 133 VSAEDLLTCCGIQCGD---GCNGGYPSGAWNFWTRKGLVSGGVYNSHIGCLPYTIPPCEH 189
Query: 197 HGSAPTLPSCENQKVPK 213
H + P PK
Sbjct: 190 HVNGSRPPCTGEGDTPK 206
>gi|332254558|ref|XP_003276396.1| PREDICTED: tubulointerstitial nephritis antigen-like isoform 1
[Nomascus leucogenys]
Length = 467
Score = 117 bits (293), Expect = 8e-24, Method: Compositional matrix adjust.
Identities = 86/273 (31%), Positives = 118/273 (43%), Gaps = 40/273 (14%)
Query: 83 TVPDRFDAREQWPNCGTIGHVP-DTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEY 141
+P F+A E+WPN + H P D G CA F+ SDR I S G LS +
Sbjct: 202 VLPTAFEASEKWPN---LIHEPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQN 258
Query: 142 VASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVT------GGDYGDRTGCQPSTISPCS 195
+ SC + C G + W FL +RG V+ G D G P PC
Sbjct: 259 LLSC----DTHQQQGCRGGRLDGAWWFLRRRGVVSDHCYPFSGRERDEAGPAP----PCM 310
Query: 196 HHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILA 255
H A K + C N D ++ T Y + N+ + KE++
Sbjct: 311 MHSRA--------MGRGKRQATAHCPNSHVNN---NDIYQVTPVYRLGSNDKEVMKELME 359
Query: 256 HGPTTATFALYDDFYHYKSGVYKHT--SNAKLENY----LHSGKLIGWGTEN-----GTP 304
+GP A +++DF+ YK G+Y HT S + E Y HS K+ GWG E
Sbjct: 360 NGPVQALMEVHEDFFLYKGGIYSHTPVSLGRPERYRRHGTHSVKITGWGEETLPDGRTLK 419
Query: 305 YWLVINTWGPHWGDRGTVKILRGKYECAFEYLI 337
YW N+WGP WG+RG +I+RG EC E +
Sbjct: 420 YWTAANSWGPAWGERGHFRIVRGVNECDIESFV 452
>gi|10803435|emb|CAC13130.1| putative cathepsin B.4 [Ostertagia ostertagi]
Length = 194
Score = 117 bits (293), Expect = 8e-24, Method: Compositional matrix adjust.
Identities = 63/188 (33%), Positives = 98/188 (52%), Gaps = 8/188 (4%)
Query: 115 FAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGS 174
++ A SDR CI SKG + +S + + SCC C Y C G + W F + G
Sbjct: 5 VSSAAAMSDRICIASKGVKQVLISAQDMVSCCSYCGY----GCDGGWPIKAWQFFAREGV 60
Query: 175 VTGGDYGDRTGCQPSTISPCSHHGSAPTLPSC-ENQKVPKLKCHTRCTNPTYGRGFFQDK 233
VTGG+YG + C+P I+PC HHG P C ++ + P+ C +C + Y + +DK
Sbjct: 61 VTGGNYGRQGCCRPYEITPCGHHGREPYYGECYDDAQTPR--CKRKCQS-GYKTTYKKDK 117
Query: 234 HRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGK 293
Y + ++ AI++EI+ HGP A + +Y+DF +Y G+YKHT+ + +
Sbjct: 118 RYGRKAYQLPNSVKAIQREIMMHGPVVAGYTVYEDFSYYTKGIYKHTAGRETGGHAVKNN 177
Query: 294 LIGWGTEN 301
+G G N
Sbjct: 178 WMGQGKGN 185
>gi|345488309|ref|XP_001605531.2| PREDICTED: uncharacterized peptidase C1-like protein F26E4.3-like
[Nasonia vitripennis]
Length = 481
Score = 117 bits (293), Expect = 8e-24, Method: Compositional matrix adjust.
Identities = 90/286 (31%), Positives = 125/286 (43%), Gaps = 45/286 (15%)
Query: 64 QSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSD 123
QS R + + Y+P +P FD+R QW N I V D G C A + V SD
Sbjct: 217 QSTRQMLPVTRHYNPN---DLPREFDSRIQWGN--DITPVQDQGWCGASWAISTVDVASD 271
Query: 124 RRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDY--- 180
R I SKG + LS +++ SC + C G + R W F+ K G V Y
Sbjct: 272 RFAIMSKGIEKVQLSGQHLISC----NNRGQRGCKGGYLDRAWLFMRKFGVVDEDCYPWL 327
Query: 181 -GDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLT 239
G C+ S G C+ + L+ P Y G
Sbjct: 328 SGRSDKCRIPRRGKLSDAG-------CQRRNSYNLRNEMYKVGPAYRLG----------- 369
Query: 240 YWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTS--NAKLENYLHSGKLIGW 297
NE I +EIL GP AT ++ DF+HY+SG+Y H+ + + Y HS +++GW
Sbjct: 370 -----NETDIMQEILTSGPVQATMRVHRDFFHYESGIYVHSRPFDTRQSGY-HSVRIVGW 423
Query: 298 GTE----NGTP--YWLVINTWGPHWGDRGTVKILRGKYECAFEYLI 337
G E NG P +W V N+WG WG+ G +I+RG EC E +
Sbjct: 424 GEEPSPYNGKPIKFWRVANSWGRDWGEDGYFRIVRGNNECEIESFV 469
>gi|294952611|ref|XP_002787376.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
gi|239902348|gb|EER19172.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
Length = 203
Score = 117 bits (293), Expect = 9e-24, Method: Compositional matrix adjust.
Identities = 68/190 (35%), Positives = 95/190 (50%), Gaps = 24/190 (12%)
Query: 154 NKSCSHGSVFRTWNFLHKRGSVTGGDY------GDRTGCQPSTISPCSHHGSAPTLPSCE 207
+K C+ G+ +FL G VTG D+ + GC P C+H PT E
Sbjct: 9 SKGCNGGTFVEAMSFLEDYGVVTGNDFKPQGQLSEADGCWPYPFQKCNH---VPT----E 61
Query: 208 NQKVPKLK---------CHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGP 258
N + PK K C T CTN Y + +D HR V ++ +IK+EI +GP
Sbjct: 62 NSEYPKCKDVAHQPLPPCRTTCTNKAYKKSLKKDVHRAKSWRKVFNDAQSIKQEIFDNGP 121
Query: 259 TTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGD 318
+ F +Y+DF +YKSGVY T+ L H K+IGWG ++ YWL +N+W WGD
Sbjct: 122 VFSAFKMYEDFRYYKSGVYVPTTKEVLS--FHLVKIIGWGADSVQEYWLAMNSWNEEWGD 179
Query: 319 RGTVKILRGK 328
G +K+ GK
Sbjct: 180 HGLIKMAFGK 189
>gi|255076333|ref|XP_002501841.1| cysteine endopeptidase [Micromonas sp. RCC299]
gi|226517105|gb|ACO63099.1| cysteine endopeptidase [Micromonas sp. RCC299]
Length = 359
Score = 117 bits (292), Expect = 9e-24, Method: Compositional matrix adjust.
Identities = 86/257 (33%), Positives = 117/257 (45%), Gaps = 26/257 (10%)
Query: 84 VPDRFDAREQWPNC-GTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYV 142
+P FDAR++WP C IG V D G C + A +DR CI S G + R LS +Y
Sbjct: 105 LPLNFDARQKWPQCRAIIGTVRDQGKCGSCWAVATAEVMNDRLCIASGGAEQRELSPQYP 164
Query: 143 ASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYG-DRTGCQPSTISPCSHHGSAP 201
SC YD C G V + +G V GG +T C P PC H
Sbjct: 165 LSC-----YDGGSGCQGGDVAVAMHEATTKGMVFGGMLNRSKTACLPYEFEPCEH----- 214
Query: 202 TLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTL----TYWVDDNEDA-IKKEILAH 256
C+ Q V +C + T F+ + Y N+ A I +EI+ +
Sbjct: 215 ---PCQVQGVIPHECPAHVDDGTCLGNTFKLADQKVFPKSDVYTCPPNDWACIAQEIMTY 271
Query: 257 GPTTATFA-LYDDFYHYKSGVYKHTSNAKLENYL--HSGKLIGWGTENGT--PYWLVINT 311
GP TF ++ DFY Y +GVY K E L H+ KLIGWG + T PYWL++N+
Sbjct: 272 GPVAVTFGTVHSDFYGYHAGVYTVREEDKNEEGLGMHATKLIGWGFDEATGHPYWLMMNS 331
Query: 312 WGPHWGDRGTVKILRGK 328
W +WG G ++ G+
Sbjct: 332 W-DNWGIHGLGRVGVGE 347
>gi|78042562|ref|NP_001030279.1| tubulointerstitial nephritis antigen [Bos taurus]
gi|108861910|sp|Q3SZI1.1|TINAG_BOVIN RecName: Full=Tubulointerstitial nephritis antigen; Short=TIN-Ag
gi|74354008|gb|AAI02844.1| Tubulointerstitial nephritis antigen [Bos taurus]
gi|296474572|tpg|DAA16687.1| TPA: tubulointerstitial nephritis antigen [Bos taurus]
Length = 476
Score = 117 bits (292), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 88/276 (31%), Positives = 126/276 (45%), Gaps = 45/276 (16%)
Query: 84 VPDRFDAREQWPNCGTIGHVP-DTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYV 142
+P+ F A +WP H P D CAA F+ +DR I+S+G+ LS + +
Sbjct: 217 LPEFFIASYKWPG---WTHGPLDQKNCAASWAFSTASVAADRIAIQSQGRYTANLSPQNL 273
Query: 143 ASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDY-------GDRTGCQPSTISPCS 195
SCC R+ C+ GSV R W +L KRG V+ Y GC ++ S
Sbjct: 274 ISCCAKKRH----GCNSGSVDRAWWYLRKRGLVSHACYPLFKDQNATNNGCAMASRS--D 327
Query: 196 HHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILA 255
G C N + K +C+ P Y V NE I +EI+
Sbjct: 328 GRGKRHATTPCPN-SIEKSNRIYQCSPP----------------YRVSSNETEIMREIMQ 370
Query: 256 HGPTTATFALYDDFYHYKSGVYKH--TSNAKLENY----LHSGKLIGWGTENGT-----P 304
+GP A +++DF++YK+G+Y+H ++N E Y H+ KL GWGT G
Sbjct: 371 NGPVQAIMQVHEDFFNYKTGIYRHITSTNEDSEKYRKFRTHAVKLTGWGTLRGAQGQKEK 430
Query: 305 YWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAG 340
+W+ N+WG WG+ G +ILRG E E LI A
Sbjct: 431 FWIAANSWGKSWGENGYFRILRGVNESDIEKLIIAA 466
>gi|193202653|ref|NP_492593.2| Protein F26E4.3 [Caenorhabditis elegans]
gi|205371857|sp|P90850.3|YCF2E_CAEEL RecName: Full=Uncharacterized peptidase C1-like protein F26E4.3;
Flags: Precursor
gi|166157004|emb|CAB03007.2| Protein F26E4.3 [Caenorhabditis elegans]
Length = 452
Score = 117 bits (292), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 82/266 (30%), Positives = 129/266 (48%), Gaps = 36/266 (13%)
Query: 84 VPDRFDAREQWPNCGTIGH-VPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYV 142
+P+ FDAR++W G + H V D G C + + SDR I S+G+ N LS++ +
Sbjct: 184 LPEHFDARDKW---GPLIHPVADQGDCGSSWSVSTTAISSDRLAIISEGRINSTLSSQQL 240
Query: 143 ASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPT 202
SC K C G + R W ++ K G V GD+ C P +S S
Sbjct: 241 LSC----NQHRQKGCEGGYLDRAWWYIRKLGVV--GDH-----CYP-YVSGQSREPGHCL 288
Query: 203 LPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTAT 262
+P + L+C + + T + T Y V E+ I+ E++ +GP AT
Sbjct: 289 IPKRDYTNRQGLRCPSGSQDST--------AFKMTPPYKVSSREEDIQTELMTNGPVQAT 340
Query: 263 FALYDDFYHYKSGVYKHT-------SNAKLENYLHSGKLIGWGTENGT----PYWLVINT 311
F +++DF+ Y GVY+H+ +++ E Y HS +++GWG ++ T YWL N+
Sbjct: 341 FVVHEDFFMYAGGVYQHSDLAAQKGASSVAEGY-HSVRVLGWGVDHSTGKPIKYWLCANS 399
Query: 312 WGPHWGDRGTVKILRGKYECAFEYLI 337
WG WG+ G K+LRG+ C E +
Sbjct: 400 WGTQWGEDGYFKVLRGENHCEIESFV 425
>gi|290981656|ref|XP_002673546.1| predicted protein [Naegleria gruberi]
gi|284087130|gb|EFC40802.1| predicted protein [Naegleria gruberi]
Length = 362
Score = 117 bits (292), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 91/298 (30%), Positives = 124/298 (41%), Gaps = 45/298 (15%)
Query: 45 NLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVP 104
N++ LR L + D P + + E +P FDAR QW C + +
Sbjct: 106 NMTISQLRDNLFGLSLMSSDEDTP-----RMANIETRIDIPMNFDARTQWKGC--VPAIR 158
Query: 105 DTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFR 164
D C A F+A + R CI + GQ N LS EY C + NK+C G +
Sbjct: 159 DQQTCGACWAFSANYVLAHRLCIATNGQTNVVLSPEYQVQCDTM-----NKACQGGYLKY 213
Query: 165 TWNFLHKRGSVTGGDYGDRTGCQP-STISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNP 223
+W FL G+ C P ++ G+ PT + + K K N
Sbjct: 214 SWTFLENTGT-------PLDSCIPYASGRGTFSSGTCPTQCKIASMSMSKYKAK----NT 262
Query: 224 TYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNA 283
Y G + IK I+ +G A F +Y D YKSGVYKH N
Sbjct: 263 VYISGI-----------------NNIKTAIMTYGSVQAGFTVYRDLTGYKSGVYKHIENT 305
Query: 284 KLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGK 341
L H+ LIG+G E G+ YWL N+WGP+WG G KI +G E E + AG+
Sbjct: 306 VLGG--HAVALIGFGVEGGSNYWLAANSWGPNWGMSGYFKIAQG--EGGIENQVYAGE 359
>gi|508264|gb|AAA96833.1| cysteine protease, partial [Caenorhabditis elegans]
Length = 198
Score = 117 bits (292), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 67/200 (33%), Positives = 95/200 (47%), Gaps = 8/200 (4%)
Query: 115 FAAVGAFSDRRCIKSKGQQNRPLSTEYVASCC-KICRYDDNKSCSHGSVFRTWNFLHKRG 173
+A SDR CI S + +S + + +CC +C C+ G W K+G
Sbjct: 5 VSAAETISDRICIASNAKTILSISADDINACCGMVC----GNGCNGGYPIEAWRHYVKKG 60
Query: 174 SVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDK 233
VTGG Y D+TGC+P PC HH + C + P + + +D
Sbjct: 61 YVTGGSYQDKTGCKPYPYPPCEHHVNGTHYKPCPSNMYPTGQNANALGKLDIALTYHKDL 120
Query: 234 HRTTLTYWVDDNEDA-IKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSG 292
H T+ + E A I K I HG +++DF HY GVY HT+ A L H+
Sbjct: 121 HFRTILHTPASKEAAGIPKGIKTHGQLRGGITVFEDFEHYSGGVYVHTAGASLGG--HAV 178
Query: 293 KLIGWGTENGTPYWLVINTW 312
K++GWG +NGTPYWL+ N+W
Sbjct: 179 KMLGWGVDNGTPYWLIANSW 198
>gi|161343877|tpg|DAA06119.1| TPA_inf: cathepsin B [Myzus persicae]
Length = 145
Score = 117 bits (292), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 64/150 (42%), Positives = 84/150 (56%), Gaps = 6/150 (4%)
Query: 193 PCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKE 252
PC H SA P C N+ +C +C NP YG + +D H+ T Y + KE
Sbjct: 1 PCQHTESAVENP-CSNKTFFTPECKVQCYNPDYGTRYVKDNHKGT-QYRIPGY--TAMKE 56
Query: 253 ILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTW 312
I +GP TA+F +Y DF +Y+SGVY S + + K++GWG ENGTPYWL N++
Sbjct: 57 IYENGPITASFYMYQDFVNYQSGVYAFNSGKYVTT--QAVKILGWGEENGTPYWLAANSF 114
Query: 313 GPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
+WGD G VKILRG EC E + AG P
Sbjct: 115 NTYWGDNGFVKILRGANECYIEEFMYAGLP 144
>gi|189238903|ref|XP_967834.2| PREDICTED: similar to tubulointerstitial nephritis antigen
[Tribolium castaneum]
Length = 453
Score = 117 bits (292), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 82/272 (30%), Positives = 124/272 (45%), Gaps = 39/272 (14%)
Query: 73 RKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQ 132
R+ YDP ++P FD+ +WP G + + D G C + SDR I SKG+
Sbjct: 197 RRIYDPN---SLPREFDSEFKWP--GWMSEIQDQGWCGSSWAITTAAVASDRFAILSKGR 251
Query: 133 QNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTIS 192
+ LS +++ SC + +SC+ G + R W+++ K G V +
Sbjct: 252 EKVTLSAQHLLSCDR----RGQQSCNGGYLDRAWSYIRKIGLVDEQCF------------ 295
Query: 193 PCSHHGSAPTLPSCENQKVPKLK--CHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIK 250
P + E ++P+ C PT + K++ Y V NE I
Sbjct: 296 --------PYSATNEKCRIPRRGDLVTANCQLPTNVDR--RSKYKVAPAYRVG-NETDIM 344
Query: 251 KEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENY-LHSGKLIGWGTENG----TPY 305
EIL GP AT +Y DF+ YK G+Y+H+ + + HS +++GWG E Y
Sbjct: 345 YEILHSGPVQATMKVYHDFFTYKRGIYRHSPISTNDRTGYHSVRIVGWGEEYSPEGLKKY 404
Query: 306 WLVINTWGPHWGDRGTVKILRGKYECAFEYLI 337
W V N+WGP WG+ G +ILRG EC E +
Sbjct: 405 WKVANSWGPEWGENGYFRILRGSNECEIESFV 436
>gi|426250116|ref|XP_004018784.1| PREDICTED: tubulointerstitial nephritis antigen [Ovis aries]
Length = 476
Score = 116 bits (291), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 88/276 (31%), Positives = 126/276 (45%), Gaps = 45/276 (16%)
Query: 84 VPDRFDAREQWPNCGTIGHVP-DTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYV 142
+P+ F A +WP H P D CAA F+ +DR I+S+G+ LS + +
Sbjct: 217 LPEFFIASYKWPG---WTHGPLDQKNCAASWAFSTASVAADRIAIQSQGRYTANLSPQNL 273
Query: 143 ASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDY-------GDRTGCQPSTISPCS 195
SCC R+ C+ GSV R W +L KRG V+ Y GC ++ S
Sbjct: 274 ISCCAKKRH----GCNSGSVDRAWWYLRKRGLVSHACYPLFKDQNATNNGCAMASRS--D 327
Query: 196 HHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILA 255
G C N + K +C+ P Y V NE I +EI+
Sbjct: 328 GRGKRHATTPCPN-SIEKSNRIYQCSPP----------------YRVSSNETEIMREIMQ 370
Query: 256 HGPTTATFALYDDFYHYKSGVYKH--TSNAKLENY----LHSGKLIGWGTENGT-----P 304
+GP A +++DF++YK+G+Y+H ++N E Y H+ KL GWGT G
Sbjct: 371 NGPVQAIMQVHEDFFNYKTGIYRHITSTNEDSEKYRKFRTHAVKLTGWGTLRGAHGQKEK 430
Query: 305 YWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAG 340
+W+ N+WG WG+ G +ILRG E E LI A
Sbjct: 431 FWIAANSWGKSWGENGYFRILRGVNESDIEKLIIAA 466
>gi|301618234|ref|XP_002938532.1| PREDICTED: tubulointerstitial nephritis antigen-like [Xenopus
(Silurana) tropicalis]
Length = 494
Score = 116 bits (291), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 80/262 (30%), Positives = 118/262 (45%), Gaps = 23/262 (8%)
Query: 84 VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVA 143
+P F+A E+WP G + D G CA F+ SDR I+S G + LS + +
Sbjct: 236 LPSHFNAAEKWP--GLVHEPLDQGNCAGSWAFSTAAVASDRISIQSMGHMTQSLSPQNLL 293
Query: 144 SCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTL 203
SC + C G V W +L +RG V+ C P T + H SAP +
Sbjct: 294 SC----DTRNQHGCRGGRVDGAWWYLRRRGVVS-------EPCYPFTSLNTNGH-SAPCM 341
Query: 204 PSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATF 263
+ K + C N Y + +++T Y + +E I KE+ +GP A
Sbjct: 342 MQSRSMGRGKRQATNNCPNQYYSS---NEIYQSTPAYRLASSEKDIMKELYENGPVQAIM 398
Query: 264 ALYDDFYHYKSGVYKHTSNAKLE------NYLHSGKLIGWGTENGTPYWLVINTWGPHWG 317
+++DF+ YKSG+Y+HT + E + HS K+ G YWL N+WG WG
Sbjct: 399 EVHEDFFMYKSGIYRHTPVTEREPEHHRRHGTHSVKITGGRDGQTHKYWLAANSWGRDWG 458
Query: 318 DRGTVKILRGKYECAFEYLIAA 339
+ G +I RG+ EC E I
Sbjct: 459 EDGYFRIARGENECEIETFIVG 480
>gi|290982673|ref|XP_002674054.1| predicted protein [Naegleria gruberi]
gi|284087642|gb|EFC41310.1| predicted protein [Naegleria gruberi]
Length = 673
Score = 116 bits (291), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 86/323 (26%), Positives = 137/323 (42%), Gaps = 36/323 (11%)
Query: 21 FSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEY 80
F+ ID +N++ + N+ + + + K ++S T D +
Sbjct: 26 FTKDMIDSLNQDPSVKWEAANYDQFAGKSFAELRKLLGGKRGEESSSE-EARYNTRDVKS 84
Query: 81 SATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTE 140
+ +PD FD+R +WP C I + + G C + FA G FSDR CI + N +S E
Sbjct: 85 TVAIPDTFDSRTKWPQC--IHGIRNQGQCGSCWAFATTGVFSDRLCITTNNVSNVVISPE 142
Query: 141 YVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSA 200
++ C K + +C G + +W F G C P T +
Sbjct: 143 FLIECDKT-----SFACQGGYGYYSWKFFMNTGI-------PLESCVPYTKDSLVYG--- 187
Query: 201 PTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTT 260
+C + CT+ G ++ Y++ + EI+ +GP
Sbjct: 188 ---------NTTNAQCRSTCTD-----GSPLKLYKAASAYYIYSPITNYQTEIMTNGPVE 233
Query: 261 ATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTE-NGTPYWLVINTWGPHWGDR 319
A F +Y DFY YKSG+Y+ T+ + H+ K++GW ++ NGTPYW+ N WG WG
Sbjct: 234 ADFDVYSDFYSYKSGIYQKTAGSTYVG-GHAVKVLGWASDSNGTPYWIAQNQWGTSWGMG 292
Query: 320 GTVKILRGK--YECAFEYLIAAG 340
G I RG C F+ + AG
Sbjct: 293 GYFYIYRGNSTLNCKFDNYMIAG 315
>gi|86451924|gb|ABC97357.1| cathepsin B [Streblomastix strix]
Length = 283
Score = 116 bits (291), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 96/324 (29%), Positives = 140/324 (43%), Gaps = 54/324 (16%)
Query: 21 FSDAYIDQINREANTWTAGRNFPAN-LSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPE 79
+++ + INR N+ ++PA+ +S E LR L A++ RP K
Sbjct: 10 LAESIPETINRNPNSTWVAIDYPASVISHEKLRSKL--GARFTPHRVRPYRDSNK----- 62
Query: 80 YSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLST 139
VPD FDARE+WP+ I V D G C + F+ DR + G ++
Sbjct: 63 ----VPDTFDAREKWPD--AILPVRDQGECGSCWAFSIAETIGDR--LGVLGCSRGDIAP 114
Query: 140 EYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGS 199
E + SC +DD C G + W++ + G T C + +
Sbjct: 115 EDLVSCDI---FDDG--CDGGFIDMAWDWCQENGLTT---------------EECIPYKA 154
Query: 200 APTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPT 259
+PS C C + G + RT + + + D I+ EI +GP
Sbjct: 155 GEGVPS---------PCPETCED---GSAIY----RTPIESYRYIDADDIQGEIYEYGPV 198
Query: 260 TATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDR 319
+ F +Y DF YKSGVY H A H+ ++GWG E+ PYWLV N+WG WG+
Sbjct: 199 SMGFIVYSDFMSYKSGVYVH--QAGYIEGGHAVLIVGWGVEDEVPYWLVQNSWGTDWGEN 256
Query: 320 GTVKILRGKYECAFEYLIAAGKPK 343
G KILRG C E + AG P+
Sbjct: 257 GFFKILRGSDHCECESNVTAGYPE 280
>gi|4099305|gb|AAD00577.1| cysteine proteinase [Clonorchis sinensis]
Length = 180
Score = 116 bits (291), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 66/189 (34%), Positives = 92/189 (48%), Gaps = 9/189 (4%)
Query: 116 AAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSV 175
AV A SDR CI S G N+ LS + SCC+ C + C G W++ G V
Sbjct: 1 GAVEAMSDRLCIHSNGAFNKSLSAVDLLSCCENCGF----GCRGGYPAVAWDYWKTHGIV 56
Query: 176 TGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHR 235
TGG D +GC+ C HH P C + P +C +C P G + +DK R
Sbjct: 57 TGGSKEDPSGCRSYPFPKCEHHVQG-HYPPCPRELYPTPECVQQCDTPDVG--YLEDKTR 113
Query: 236 TTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLI 295
++Y + +E +I KEI+ GP A F +Y+DF Y SGVY H A + H+ +++
Sbjct: 114 ANMSYNIYASEISIMKEIMLRGPVEAIFTMYEDFLRYSSGVYFHALGAPMSG--HAVRIL 171
Query: 296 GWGTENGTP 304
GWG P
Sbjct: 172 GWGELGNVP 180
>gi|345794363|ref|XP_535330.3| PREDICTED: tubulointerstitial nephritis antigen-like 1 [Canis lupus
familiaris]
Length = 467
Score = 116 bits (290), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 85/268 (31%), Positives = 120/268 (44%), Gaps = 30/268 (11%)
Query: 83 TVPDRFDAREQWPNCGTIGHVP-DTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEY 141
+P F+A E+WPN + H P D G CA F+ SDR I S G LS +
Sbjct: 202 VLPTAFEAAEKWPN---LIHEPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQN 258
Query: 142 VASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAP 201
+ SC + + C G + W FL +RG V+ D+ C P G AP
Sbjct: 259 LLSC----DTHNQQGCRGGRLDGAWWFLRRRGVVS--DH-----CYPFVGREQDEAGPAP 307
Query: 202 -TLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTT 260
+ K + RC + D ++ T Y + NE I KE++ +GP
Sbjct: 308 RCMMHSRAMGRGKRQATARCPSSHV---HANDIYQVTPAYRLGTNEKEIMKELMENGPVQ 364
Query: 261 ATFALYDDFYHYKSGVYKHT--SNAKLENY----LHSGKLIGWGTEN-----GTPYWLVI 309
A +++DF+ Y+ G+Y HT S + E Y HS K+ GWG E YW
Sbjct: 365 ALMEVHEDFFLYQGGIYSHTPVSLGRPERYRRHGTHSVKITGWGEETLPDGRTLKYWTAA 424
Query: 310 NTWGPHWGDRGTVKILRGKYECAFEYLI 337
N+WGP WG+RG +I+RG EC E +
Sbjct: 425 NSWGPAWGERGHFRIVRGANECDIESFV 452
>gi|290992302|ref|XP_002678773.1| predicted protein [Naegleria gruberi]
gi|284092387|gb|EFC46029.1| predicted protein [Naegleria gruberi]
Length = 236
Score = 116 bits (290), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 84/265 (31%), Positives = 128/265 (48%), Gaps = 44/265 (16%)
Query: 82 ATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEY 141
A VP FD+R +WP+C + + + C + F+A SDR CI S G+ + LS +Y
Sbjct: 12 AAVP-AFDSRTKWPHC--VHPIRNQEQCGSCWAFSASEVLSDRFCIASGGKVDVVLSPQY 68
Query: 142 VASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAP 201
+ SC + C G + W FL G + D+ C P T S +G
Sbjct: 69 MVSC-----DSTDYGCDGGYLNNAWAFLAGTGIPS-----DK--CAPYT----SQNGDVA 112
Query: 202 TLPS-CENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTT 260
PS C++ KL ++ K+ L ++ +I +++ +GP
Sbjct: 113 ACPSKCQDGSSVKL---------------YKAKNPQQL-----NDIPSIMEDMQQNGPVQ 152
Query: 261 ATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGT--PYWLVINTWGPHWGD 318
A F++Y DF YKSGVY H S + L H+ K++GWG ++ T PYW++ N+WGP WG
Sbjct: 153 AAFSVYRDFMSYKSGVYHHVSGSLLGG--HAIKMVGWGVDSATNKPYWIIANSWGPSWGL 210
Query: 319 RGTVKILRGKYECAFEYLIAAGKPK 343
G ILRG EC E + +G+ +
Sbjct: 211 NGFFWILRGSDECGIEDNVWSGQAQ 235
>gi|15723276|gb|AAL06326.1| cathepsin B-like protease [Trypanosoma cruzi]
Length = 208
Score = 116 bits (290), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 73/225 (32%), Positives = 111/225 (49%), Gaps = 20/225 (8%)
Query: 88 FDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCK 147
FDA E WP C TI + D +C + AA A SDR C G ++ +S + SCC
Sbjct: 1 FDAGEAWPKCPTITEIRDQSSCGSCWAVAAASAISDRYCTLG-GVRDLRISAGDLMSCCD 59
Query: 148 ICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCE 207
+C Y C+ G W + G V+ +Y CQP C+HH ++ L C
Sbjct: 60 VCGY----GCNGGYPEVAWEYYAVHGIVS--EY-----CQPYPFPSCAHHVNSSDLSPCS 108
Query: 208 NQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYD 267
+ C++ CT+ + K+R +Y + E++ K+E+L +GP +F++Y
Sbjct: 109 GE-YDTPTCNSTCTD----KKVPLIKYRGNTSYLLS-GEESFKRELLLNGPFEVSFSVYA 162
Query: 268 DFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTW 312
DF Y GVYKH + L H+ +++GWG NG PYW + N+W
Sbjct: 163 DFLAYTGGVYKHVAGTFLGG--HAVRIVGWGELNGEPYWKIANSW 205
>gi|15723280|gb|AAL06328.1| cathepsin B-like protease [Trypanosoma cruzi]
Length = 208
Score = 116 bits (290), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 73/225 (32%), Positives = 111/225 (49%), Gaps = 20/225 (8%)
Query: 88 FDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCK 147
FDA E WP C TI + D +C + AA A SDR C G ++ +S + SCC
Sbjct: 1 FDAGEAWPKCPTITEIRDQSSCGSCWAVAAASAISDRYCTLG-GVRDLRISAGDLMSCCD 59
Query: 148 ICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCE 207
+C Y C+ G W + G V+ +Y CQP C+HH ++ L C
Sbjct: 60 VCGY----GCNGGYPEVAWEYYAVHGIVS--EY-----CQPYPFPSCAHHVNSSDLSPCS 108
Query: 208 NQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYD 267
+ C++ CT+ + K+R +Y + E++ K+E+L +GP +F++Y
Sbjct: 109 GE-YDTPTCNSTCTD----KKVPLIKYRGNTSYLLS-GEESFKRELLLNGPFEVSFSVYA 162
Query: 268 DFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTW 312
DF Y GVYKH + L H+ +++GWG NG PYW + N+W
Sbjct: 163 DFLAYTGGVYKHVAGIFLGG--HAVRIVGWGELNGEPYWKIANSW 205
>gi|312082955|ref|XP_003143660.1| hypothetical protein LOAG_08080 [Loa loa]
gi|307761175|gb|EFO20409.1| hypothetical protein LOAG_08080 [Loa loa]
Length = 339
Score = 116 bits (290), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 88/267 (32%), Positives = 129/267 (48%), Gaps = 47/267 (17%)
Query: 84 VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVA 143
+P FDAR++WP+ I + D G CA+ + +DR + ++G+QN LS +
Sbjct: 80 LPTSFDARQKWPD--FIHPIQDQGDCASSWAQSTAATSADRLALITEGRQNVALSAQQFL 137
Query: 144 SCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISP--C----SHH 197
SC + K C G + R W ++ K G V+ Y +G +T P C S H
Sbjct: 138 SCNQ----HRQKGCEGGYLDRAWWYIRKFGVVSEECYPYISG---TTRKPEICYMQKSKH 190
Query: 198 GSAPTLPSCE-NQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAH 256
+ PS N +V +RTT +Y V E I EIL +
Sbjct: 191 ANGRQCPSGHPNSRV----------------------YRTTPSYRVSSREQDIMSEILTN 228
Query: 257 GPTTATFALYDDFYHYKSGVYKH--TSNAKLENYLHSGKLIGWGTE--NGTP--YWLVIN 310
GP ATF ++ DF + +GVYKH T ++E Y HS +L+GWG + G P YW+ N
Sbjct: 229 GPVQATFRVHGDF--FIAGVYKHLPTVGEEIEGY-HSVRLLGWGEDYSTGIPVKYWIAAN 285
Query: 311 TWGPHWGDRGTVKILRGKYECAFEYLI 337
+WG +WG+ GT +ILRG+ C E +
Sbjct: 286 SWGTNWGENGTFRILRGENHCEIESFV 312
>gi|145509603|ref|XP_001440740.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124407968|emb|CAK73343.1| unnamed protein product [Paramecium tetraurelia]
Length = 357
Score = 116 bits (290), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 82/288 (28%), Positives = 127/288 (44%), Gaps = 45/288 (15%)
Query: 49 EYLRQFLIADAKYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGA 108
++ + + +DAK+ + G + PE +P+ ++ RE P C + + G
Sbjct: 97 DFFKDWKFSDAKFIFNNHLTFKG-KIPQCPESGVIIPESYNFREVQPECAQ--PIYNQGN 153
Query: 109 CAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKS--CSHGSVFRTW 166
C++ + AAV A SDR C G+ LS + SC DNK+ C GSV R
Sbjct: 154 CSSSYSIAAVSATSDRLCKVRNGEFQDQLSPQSPISC-------DNKNYRCGGGSVTRVL 206
Query: 167 NFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYG 226
K+G VT T C P T + + +CE K+
Sbjct: 207 EVGKKQGFVT-------TSCLPYTGTEDAKDNCDALFTNCEKYKI--------------- 244
Query: 227 RGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLE 286
QD Y V +E+ IK+EIL +GP A ++ DF YK G+Y+ +
Sbjct: 245 ----QD-------YCVISSEENIKREILNNGPVVAVIQVFKDFLVYKGGIYEVVEGSSKF 293
Query: 287 NYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFE 334
Y H+ K+IGWG ++G YW++ N+WG WG +G + G+ + E
Sbjct: 294 QYGHAVKVIGWGKQDGVNYWVIENSWGDSWGLKGLAYVAVGQNQLQLE 341
>gi|428180143|gb|EKX49011.1| cathepsin B-like cysteine protease [Guillardia theta CCMP2712]
Length = 330
Score = 116 bits (290), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 100/332 (30%), Positives = 141/332 (42%), Gaps = 56/332 (16%)
Query: 26 IDQINREANT-WTAGR-NFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEYSAT 83
I+QIN + ++ WTAG ++ + R ++ D S+ P+ K +
Sbjct: 38 IEQINSDKDSLWTAGETEIFKGMTMKEFRSSMLGLRLDRDYSEVPV----KVHSSTALKD 93
Query: 84 VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVA 143
+P+ F+ E WPN + + D C + FAA SDR I S G N+ LS E +
Sbjct: 94 LPESFNCYENWPN--YMHPIRDQARCGSCWAFAASEVLSDRFAIASNGTVNKILSPEDLV 151
Query: 144 SCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTL 203
SC K + C G + + W++L G VT C P + G AP+
Sbjct: 152 SCDK-----GDMGCQGGYLDKAWDYLKTNGIVT-------ESCFPYA----AQKGVAPS- 194
Query: 204 PSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATF 263
C C + G K++ + Y + ED I KEI +GP A F
Sbjct: 195 ------------CRISCVD-----GEPYKKYKASDYYQLTTEED-IMKEIYLNGPVEAGF 236
Query: 264 ALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTE-------NGTPYWLVINTWGPHW 316
+Y F YKSGVY H +E H+ K++GWG E T YW+ N+W W
Sbjct: 237 RVYTSFMSYKSGVYHHRILDIMEGG-HAIKIVGWGVEPPKRFWQKPTKYWICANSWTADW 295
Query: 317 GDRGTVKILRGK-----YECAFEYLIAAGKPK 343
G G KI RGK EC E + AG PK
Sbjct: 296 GMNGFFKIRRGKNRFGQSECGIEDQVFAGHPK 327
>gi|15150360|gb|AAK85411.1| cathepsin B-like protease [Trypanosoma rangeli]
Length = 207
Score = 115 bits (289), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 75/225 (33%), Positives = 106/225 (47%), Gaps = 21/225 (9%)
Query: 88 FDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCK 147
FDA E WPNC TI + D C + AA A SDR C + G ++ +S + SCC
Sbjct: 1 FDAGEAWPNCPTITEIRDQSGCGSCWAVAARSAMSDRYCTRG-GVRDLRISAGDLLSCCN 59
Query: 148 ICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCE 207
C C+ G W + + G V+ CQP PC+HH ++ C
Sbjct: 60 AC----GLGCNGGDPDWAWLYYVETGIVS-------EFCQPYPFPPCAHHVNSTHYTPCS 108
Query: 208 NQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYD 267
+ C+ CTN K++ ++Y + ED K+E+ +GP F +Y+
Sbjct: 109 VEYDTPF-CNITCTNT-----IPPIKYKGRISYSLSGEED-YKRELFLYGPFEVAFTVYE 161
Query: 268 DFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTW 312
DF Y GVYKH S L H+ +L+GWG NGTPYW + N+W
Sbjct: 162 DFVAYSDGVYKHFSGNALGG--HAVRLVGWGNLNGTPYWKIANSW 204
>gi|403293251|ref|XP_003937634.1| PREDICTED: tubulointerstitial nephritis antigen-like isoform 2
[Saimiri boliviensis boliviensis]
Length = 436
Score = 115 bits (289), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 86/273 (31%), Positives = 120/273 (43%), Gaps = 40/273 (14%)
Query: 83 TVPDRFDAREQWPNCGTIGHVP-DTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEY 141
+P F+A E+WPN + H P D G CA F+ SDR I S G LS +
Sbjct: 171 ALPTAFEASEKWPN---LIHEPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQN 227
Query: 142 VASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVT------GGDYGDRTGCQPSTISPCS 195
+ SC + C G + W FL +RG V+ G D+ G P PC
Sbjct: 228 LLSC----NTHHQQGCRGGRLDGAWWFLRRRGVVSDHCYPFSGRERDKAGPAP----PCM 279
Query: 196 HHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILA 255
H A K + C N G + ++ T Y + N+ I KE++
Sbjct: 280 MHSRA--------MGRGKRQATAHCPN---GHVNNNNIYQVTPAYRLGSNDTEIMKELME 328
Query: 256 HGPTTATFALYDDFYHYKSGVYKHT--SNAKLENY----LHSGKLIGWGTEN-----GTP 304
+GP A +++DF+ YK G+Y HT + + E Y HS K+ GWG E
Sbjct: 329 NGPVQALMEVHEDFFLYKGGIYSHTPVNLGRPERYRRHGTHSVKITGWGEETRPDGRKLK 388
Query: 305 YWLVINTWGPHWGDRGTVKILRGKYECAFEYLI 337
YW N+WGP WG+RG +I+RG EC E +
Sbjct: 389 YWTAANSWGPAWGERGHFRIVRGVNECDIESFV 421
>gi|403293249|ref|XP_003937633.1| PREDICTED: tubulointerstitial nephritis antigen-like isoform 1
[Saimiri boliviensis boliviensis]
Length = 467
Score = 115 bits (289), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 86/273 (31%), Positives = 120/273 (43%), Gaps = 40/273 (14%)
Query: 83 TVPDRFDAREQWPNCGTIGHVP-DTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEY 141
+P F+A E+WPN + H P D G CA F+ SDR I S G LS +
Sbjct: 202 ALPTAFEASEKWPN---LIHEPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQN 258
Query: 142 VASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVT------GGDYGDRTGCQPSTISPCS 195
+ SC + C G + W FL +RG V+ G D+ G P PC
Sbjct: 259 LLSC----NTHHQQGCRGGRLDGAWWFLRRRGVVSDHCYPFSGRERDKAGPAP----PCM 310
Query: 196 HHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILA 255
H A K + C N G + ++ T Y + N+ I KE++
Sbjct: 311 MHSRA--------MGRGKRQATAHCPN---GHVNNNNIYQVTPAYRLGSNDTEIMKELME 359
Query: 256 HGPTTATFALYDDFYHYKSGVYKHT--SNAKLENY----LHSGKLIGWGTEN-----GTP 304
+GP A +++DF+ YK G+Y HT + + E Y HS K+ GWG E
Sbjct: 360 NGPVQALMEVHEDFFLYKGGIYSHTPVNLGRPERYRRHGTHSVKITGWGEETRPDGRKLK 419
Query: 305 YWLVINTWGPHWGDRGTVKILRGKYECAFEYLI 337
YW N+WGP WG+RG +I+RG EC E +
Sbjct: 420 YWTAANSWGPAWGERGHFRIVRGVNECDIESFV 452
>gi|253742315|gb|EES99155.1| Cathepsin B precursor [Giardia intestinalis ATCC 50581]
Length = 303
Score = 115 bits (289), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 98/350 (28%), Positives = 154/350 (44%), Gaps = 64/350 (18%)
Query: 4 ILVFLLGCTLVRGELYKFSDAYIDQINREANTWTAG--RNFPANLSEEYLRQFLI----A 57
IL LL + + S A + +I W A + F N++E+ R LI
Sbjct: 2 ILALLLAVVCAKPLV---SRAELRRIQALNPPWVAAMPKRF-ENVTEDEFRGMLINPDRL 57
Query: 58 DAKYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAA 117
A+ PL DP +P +FD R+++P+C + V D G+C F+A
Sbjct: 58 KARSGSMPSAPLKEINDPTDP-----LPAQFDFRDEYPHC--VSPVFDQGSCGGCWAFSA 110
Query: 118 VGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTG 177
+G F RRC + S +++ SC +N CS G F TW+FL + G+ T
Sbjct: 111 IGMFGSRRCAVGIDKAAVLYSQQHLISCST-----ENFGCSGGDFFPTWSFLTQTGATTA 165
Query: 178 G-----DYGDRTGCQPSTISPCSHHGSAPTLPSCEN-QKVPKLKCHTRCTNPTYGRGFFQ 231
DYG S + PT +C++ ++ K H YG+
Sbjct: 166 ECVKYVDYGS------------SVAAACPT--TCDDGSQIQFYKAHG------YGQ---- 201
Query: 232 DKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHS 291
+ + AI + +++ GP +Y D +Y GVY+HT + N LH+
Sbjct: 202 ----------LSKSVPAIMQMLVSGGPVQTMIVVYADLLYYAGGVYRHT-YGPISNGLHA 250
Query: 292 GKLIGWGT-ENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAG 340
+++G+GT ++GT YW + N+WG WG+ G +I+RG EC E I A
Sbjct: 251 LEMVGYGTTDDGTDYWTIKNSWGSDWGEDGYFRIVRGVNECRIEDEIYAA 300
>gi|157058761|gb|ABV03138.1| cathepsin B-84 [Myzus persicae]
Length = 220
Score = 115 bits (289), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 71/235 (30%), Positives = 115/235 (48%), Gaps = 19/235 (8%)
Query: 35 TWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQW 94
TW A +NFP N E + + L+ + + P+ + Y + VP+ FD+R +W
Sbjct: 1 TWKAKQNFPENTPREDIVR-LLGSKRLLGLNKSPIKENDILYVD--NGEVPEFFDSRLEW 57
Query: 95 PNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDN 154
NC TIG V + G C + GAF+DR CI + G+ N +S E + CC C +
Sbjct: 58 KNCKTIGEVRNQGNCGSCWAHGTTGAFADRLCIATDGEFNELISAEELTFCCHTCGF--- 114
Query: 155 KSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSH----HGSAPTLPSCENQK 210
C+ G+ + W + + G VTGG+Y GCQPS + PC H S P+ N K
Sbjct: 115 -GCNGGNPLKAWKYFKRHGVVTGGNYNTTDGCQPSRVPPCVRDDEGHNSCSGQPTERNHK 173
Query: 211 VPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFAL 265
K KC+ T + ++ ++T Y++ + ++K+ + +GP A+F +
Sbjct: 174 CSK-KCYGDET-----INYKKNHYKTKDAYYLSNT--TMQKDTMVYGPIEASFDV 220
>gi|157058775|gb|ABV03145.1| cathepsin B-16D [Myzus persicae]
Length = 236
Score = 115 bits (289), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 74/240 (30%), Positives = 112/240 (46%), Gaps = 14/240 (5%)
Query: 5 LVFLLGCTLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQ 64
L + + + Y +ID IN +A TW AG NF S+E++ + L ++
Sbjct: 3 LSVIFVSVYMTEQAYFLEKDFIDNINEQATTWKAGVNFDPKTSKEHIMKLL--GSRGVQI 60
Query: 65 SDRPLPGDRKTYDPEYSAT-VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSD 123
++ K+ D +Y+ T +P FDAR +W +C TIG V D G C + A AF+D
Sbjct: 61 PNKNNMNLYKSEDADYNNTYIPRFFDARRKWRHCSTIGRVRDQGNCGSCWAVATSSAFAD 120
Query: 124 RRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDR 183
R C+ + N LS E + CC C + C+ G + W K+G VTGGDY
Sbjct: 121 RLCVATNADFNELLSAEEITFCCHTCGF----GCNGGYPIKAWKRFSKKGLVTGGDYKSG 176
Query: 184 TGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRG--FFQDKHRTTLTYW 241
GC+P + PC + +C + ++ + RCT YG F + HR T Y+
Sbjct: 177 EGCEPYRVPPCPNDDQGNN--TCAGK---PMESNHRCTRMCYGDQDLDFDEDHRYTRDYY 231
>gi|145514872|ref|XP_001443341.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124410719|emb|CAK75944.1| unnamed protein product [Paramecium tetraurelia]
Length = 358
Score = 115 bits (289), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 80/286 (27%), Positives = 123/286 (43%), Gaps = 41/286 (14%)
Query: 49 EYLRQFLIADAKYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGA 108
++ + + +DAK+ + G + PE +P+ ++ RE P C + G
Sbjct: 97 DFFKDWKFSDAKFIFNNHLTFKGKIQQC-PESGVIIPESYNFREAQPECAQPIYF--QGN 153
Query: 109 CAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNF 168
C++ + AAV A SDR C G+ LS + SC D N C GSV R
Sbjct: 154 CSSSYSIAAVSATSDRLCKSKNGEFQDQLSPQSPISC-----DDKNYKCGGGSVTRVLEV 208
Query: 169 LHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRG 228
K+G V+ T C P + + + + +CE K K H C
Sbjct: 209 GKKQGFVS-------TSCLPYSGTEDAKNNCDALFSNCE-----KYKIHDYC-------- 248
Query: 229 FFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENY 288
V E+ IK+EIL +GP A ++ DF YK GVY+ + Y
Sbjct: 249 -------------VVSGEENIKREILNNGPIVAVIQVFKDFLVYKGGVYEVVEGSSKFQY 295
Query: 289 LHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFE 334
H+ K+IGWG ++G YW++ N+WG WG +G + G+ + E
Sbjct: 296 GHAVKVIGWGKQDGVNYWVIENSWGDSWGLKGLAYVAVGQNQLQLE 341
>gi|157058755|gb|ABV03135.1| cathepsin B-84 [Aulacorthum solani]
Length = 218
Score = 115 bits (289), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 72/233 (30%), Positives = 115/233 (49%), Gaps = 19/233 (8%)
Query: 38 AGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNC 97
A +NFP N +E + + L+ + P+ + + Y ++ VP+ FD+R +W C
Sbjct: 1 AKQNFPENTPKEQIVR-LLGSKRLLGVPKSPIKENDEFYMD--NSEVPEFFDSRLEWKYC 57
Query: 98 GTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSC 157
TIGHV + G C + GAF+DR C+ + G+ N+ +S E V CC C + C
Sbjct: 58 KTIGHVRNQGNCGSCWAHGTTGAFADRLCVATNGEVNQLISAEEVTFCCHRCGF----GC 113
Query: 158 SHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSH----HGSAPTLPSCENQKVPK 213
+ G+ R W + + G VTGGDY GCQP + PC H S P+ N K K
Sbjct: 114 NGGNPLRAWQYFKRHGVVTGGDYNTTDGCQPYRVPPCVKDDKGHNSCSGQPTERNHKCSK 173
Query: 214 LKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALY 266
KC+ T + D ++T Y++ + ++K+ + +GP A+F +Y
Sbjct: 174 -KCYGDDT-----VDYKSDHYKTKDAYYLSNT--TMQKDTMVYGPIEASFDVY 218
>gi|189502866|gb|ACE06814.1| unknown [Schistosoma japonicum]
Length = 121
Score = 115 bits (289), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 56/119 (47%), Positives = 72/119 (60%), Gaps = 2/119 (1%)
Query: 225 YGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAK 284
Y + DK + Y V N++AI KE++ HGP F +Y DF +YKSGVY+H S A
Sbjct: 2 YNVSYENDKWYGKVVYRVKSNQEAIMKELMQHGPVEVDFEVYADFPNYKSGVYQHVSGAL 61
Query: 285 LENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
L H+ +L+GWG EN PYWL+ N+W WGD G KI+RGK EC E + AG PK
Sbjct: 62 LGG--HAVRLLGWGEENNVPYWLIANSWNTDWGDNGYFKIIRGKNECGIESDVNAGIPK 118
>gi|354483193|ref|XP_003503779.1| PREDICTED: tubulointerstitial nephritis antigen-like [Cricetulus
griseus]
Length = 475
Score = 115 bits (288), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 86/276 (31%), Positives = 127/276 (46%), Gaps = 41/276 (14%)
Query: 82 ATVPDRFDAREQWPNCGTIGHVP-DTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTE 140
A +P+ F + +WP H P D CAA F+ +DR I+S+G+ LS +
Sbjct: 214 ADLPEVFISSYKWPG---WTHGPLDQKNCAASWAFSTASVAADRIAIQSRGRYTANLSPQ 270
Query: 141 YVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDY-----GDRTGCQPSTISPCS 195
+ SCC R+ C+ GS+ R W FL KRG V+ Y + T + S
Sbjct: 271 NLISCCAKKRH----GCNSGSIDRAWWFLRKRGLVSHACYPLFKDQNTTNNICAMASRSD 326
Query: 196 HHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILA 255
G C N + + +C+ P Y V NE I +EI+
Sbjct: 327 GRGKRHATKPCPNSFEKSNRIY-QCSPP----------------YRVSSNETEIMREIIR 369
Query: 256 HGPTTATFALYDDFYHYKSGVYKH--TSNAKLENYL----HSGKLIGWGTENGT-----P 304
+GP A +++DF++YK+G+Y+H ++N + E Y H+ KL GWGT G
Sbjct: 370 NGPVQAIMQVHEDFFYYKTGIYRHVISTNEESEKYRKLRSHAVKLTGWGTLRGAGGKKEK 429
Query: 305 YWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAG 340
+W+ N+WG WG+ G +ILRG E E LI A
Sbjct: 430 FWIAANSWGKSWGENGYFRILRGVNESDIEKLIIAA 465
>gi|323447573|gb|EGB03489.1| hypothetical protein AURANDRAFT_72715 [Aureococcus anophagefferens]
Length = 812
Score = 115 bits (288), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 95/324 (29%), Positives = 144/324 (44%), Gaps = 59/324 (18%)
Query: 23 DAYIDQINREANTWTAGRNFP-ANLSEEYLRQFLIAD-----AKYFDQSDRPLPGDRKTY 76
+ +++ +N+E +W AG N A ++ ++ L AD A+Y G+ ++
Sbjct: 280 EQHVNYLNQEEMSWKAGVNERFAGMTYADVKGLLGADTSPHIAEYL--------GETRSQ 331
Query: 77 DPEYSAT-VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNR 135
D + T VP F+A QW G + + D C + F+A SDR I Q N+
Sbjct: 332 DFYDNITDVPSEFNAVTQWK--GLVQPIRDQQQCGSCWAFSAAEVLSDRNAI----QHNK 385
Query: 136 P---LSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTIS 192
LS E + SC ++ ++ C+ G++ W +L G VT C P T
Sbjct: 386 AEPVLSPEDLVSCDRV-----DQGCNGGNLGTAWTYLKNTGIVT-------DACFPYT-- 431
Query: 193 PCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKE 252
+ G AP KC T C + G K++ Y V+ E+ ++KE
Sbjct: 432 --AGGGDAP-------------KCETSCKD-----GSSWTKYKAASAYAVNGVEN-MQKE 470
Query: 253 ILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTW 312
I+ HGP F +Y F YKSGVY + H+ K++GWGTE G YWLV N+W
Sbjct: 471 IMTHGPIQVAFNVYKSFMSYKSGVYAKKWYELMPEGGHAVKIVGWGTEGGKDYWLVANSW 530
Query: 313 GPHWGDRGTVKILRGKYECAFEYL 336
WGD G KI G + + +
Sbjct: 531 NTSWGDEGYFKIAVGAESISLDVV 554
>gi|145513975|ref|XP_001442898.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124410259|emb|CAK75501.1| unnamed protein product [Paramecium tetraurelia]
Length = 358
Score = 115 bits (288), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 80/286 (27%), Positives = 123/286 (43%), Gaps = 41/286 (14%)
Query: 49 EYLRQFLIADAKYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGA 108
++ + + +DAK+ + G + PE +P+ ++ RE P C + G
Sbjct: 97 DFFKDWKFSDAKFIFNNHLTFKGKIQQC-PESGVIIPESYNFREAQPECAQPIYF--QGN 153
Query: 109 CAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNF 168
C++ + AAV A SDR C G+ LS + SC D N C GSV R
Sbjct: 154 CSSSYSIAAVSATSDRLCKSKNGEFQDQLSPQSPISC-----DDKNYKCGGGSVTRVLEV 208
Query: 169 LHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRG 228
K+G V+ T C P + + + + +CE K K H C
Sbjct: 209 GKKQGFVS-------TSCLPYSGTEDAKNNCDALFSNCE-----KYKIHDYC-------- 248
Query: 229 FFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENY 288
V E+ IK+EIL +GP A ++ DF YK GVY+ + Y
Sbjct: 249 -------------VVSGEENIKREILNNGPIVAVIQVFKDFLVYKGGVYEVVEGSSKFQY 295
Query: 289 LHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFE 334
H+ K+IGWG ++G YW++ N+WG WG +G + G+ + E
Sbjct: 296 GHAVKVIGWGKQDGVNYWVIENSWGDTWGLKGLAYVAVGQNQLQLE 341
>gi|146386348|gb|ABQ23962.1| cathepsin B [Oryctolagus cuniculus]
Length = 228
Score = 115 bits (288), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 83/244 (34%), Positives = 118/244 (48%), Gaps = 19/244 (7%)
Query: 18 LYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYD 77
+ SD ++ IN++ TW AG NF N+ YL++ L G +
Sbjct: 2 FHPLSDELVNFINKQNTTWQAGHNF-FNVEVSYLKKLC----------GTFLGGPKLPRR 50
Query: 78 PEYSATV--PDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNR 135
E++ + P+ FDAREQWPNC TI + D G+C + F AV A SDR CI + G N
Sbjct: 51 VEFADDIKLPESFDAREQWPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHTNGHVNV 110
Query: 136 PLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCS 195
+S E + +CC D WNF K+G V+GG Y GC+P +I PC
Sbjct: 111 EVSAEDMLTCCGGQCGDGCNGGYPSGA---WNFWTKKGLVSGGLYDSHVGCKPYSIPPCE 167
Query: 196 HHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILA 255
HH + + P+C + +C C P Y + +DKH +Y V +E+ IK EI
Sbjct: 168 HHVNG-SRPACTGEG-DTPRCSKTC-EPGYSPSYKEDKHYGYSSYSVSSDENEIKAEIYK 224
Query: 256 HGPT 259
+GP
Sbjct: 225 NGPV 228
>gi|239799410|dbj|BAH70626.1| ACYPI000012 [Acyrthosiphon pisum]
Length = 265
Score = 115 bits (288), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 88/286 (30%), Positives = 134/286 (46%), Gaps = 39/286 (13%)
Query: 5 LVFLLGCTLVRGELYK-----FSDAYIDQINREANTWTAGRNF-PANLSEEYLRQFLIAD 58
++FL+ L+ L + D ID+ +T G N P ++ EE+L +++
Sbjct: 4 VLFLVSTMLLNSYLSEQATLFHDDNIIDKSVMGTDTLKVGENVGPNSVEEEHL---MLSG 60
Query: 59 AKYFDQSDRPL----PGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHI 114
+ + + + +R+ + E + FDAR++WP+C TIG VP+ G
Sbjct: 61 TRGVEATSKSKMLHKTRNRRCFRVEIDHQIDQEFDARKRWPHCKTIGEVPNDGNSLLSWA 120
Query: 115 FAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSV--FRTWNFLHKR 172
+ G F+DR CI + G N+ LSTE + SC I K GSV + W +L
Sbjct: 121 YVPTGVFADRMCIATNGTYNQLLSTEELISCSGI------KEDEFGSVNDYYVWEYLKNH 174
Query: 173 GSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGF--F 230
G V+GG Y GCQPS I P G+ PT S EN C RC YG +
Sbjct: 175 GLVSGGKYNTNNGCQPSKIPPI---GNLPT-GSYEN------TCEKRC----YGNNTINY 220
Query: 231 QDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYD-DFYHYKSG 275
H ++ + ED I++E+ +GP + F ++D DF+ YKSG
Sbjct: 221 NQDHVKIKNHYDIEYED-IQREVQNYGPVSMAFRVFDNDFFLYKSG 265
>gi|130502070|ref|NP_001076255.1| tubulointerstitial nephritis antigen [Oryctolagus cuniculus]
gi|818411|gb|AAC48477.1| tubulointerstitial nephritis antigen [Oryctolagus cuniculus]
Length = 474
Score = 115 bits (288), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 88/286 (30%), Positives = 127/286 (44%), Gaps = 49/286 (17%)
Query: 79 EYSATVPDRFDARE--------QWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSK 130
E AT+P+ D E W + IG + CAA F+ +DR I+S
Sbjct: 204 EMRATLPETTDLPEFFIAFLQMAWMDSWAIG----SKNCAASWAFSTASVAADRIAIQSN 259
Query: 131 GQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDY-----GDRTG 185
G+ LS + + SCC R+ C+ GS+ R W +L KRG V+ Y + +
Sbjct: 260 GRYTANLSPQNLISCCAKNRH----GCNSGSIDRAWWYLRKRGLVSHACYPLFKDQNISN 315
Query: 186 CQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDN 245
+ S G C N + K +C+ P Y V N
Sbjct: 316 NTCAMTSKADGRGKRHATRPCPN-NIEKSNRIYQCSPP----------------YRVSSN 358
Query: 246 EDAIKKEILAHGPTTATFALYDDFYHYKSGVYKH--TSNAKLENY----LHSGKLIGWGT 299
E I KEI+ +GP A +++DF+HYK+G+Y+H ++N + E Y H+ KL GWGT
Sbjct: 359 ETEIMKEIMQNGPVQAIMQVHEDFFHYKTGIYRHVISTNEESEKYRKLQTHAVKLTGWGT 418
Query: 300 ENGT-----PYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAG 340
G +W+ N+WG WG+ G +ILRG E E LI A
Sbjct: 419 LKGARGQKEKFWIAANSWGKSWGENGYFRILRGVNESDIEKLIIAA 464
>gi|348553066|ref|XP_003462348.1| PREDICTED: tubulointerstitial nephritis antigen-like [Cavia
porcellus]
Length = 475
Score = 115 bits (288), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 89/275 (32%), Positives = 122/275 (44%), Gaps = 44/275 (16%)
Query: 84 VPDRFDAREQWPNCGTIGHVP-DTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYV 142
+P+ F A +WP H P D CAA F+ +DR I+S G+ LS + +
Sbjct: 217 LPEFFIASYKWPG---WTHGPLDQKNCAASWAFSTASVAADRIAIQSSGRYTANLSPQNL 273
Query: 143 ASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGG------DYGDRTGCQPSTISPCSH 196
SCC R+ C GSV R W +L KRG V+ D GC ++ S
Sbjct: 274 ISCCARKRH----GCGGGSVDRAWWYLRKRGLVSHACYPLFKDQNATNGCAMASRS--DG 327
Query: 197 HGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAH 256
G C N H +N Y + + Y V NE I KEI+ +
Sbjct: 328 RGKRHATTPCPN--------HIEKSNRIY---------QCSPPYRVSSNETQIMKEIMQN 370
Query: 257 GPTTATFALYDDFYHYKSGVYKHTSNAK--LENY----LHSGKLIGWGTENGT-----PY 305
GP A +++DF+ YK+G+Y+H ++ E Y H+ KL GWGT G +
Sbjct: 371 GPVQAIMKVHEDFFSYKTGIYRHVTSTSEDSEKYQKLRTHAVKLTGWGTLKGARGKKEKF 430
Query: 306 WLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAG 340
W+ N+WG WG+ G KILRG E E LI A
Sbjct: 431 WIAANSWGKSWGENGYFKILRGVNESDIEKLIIAA 465
>gi|296207307|ref|XP_002750588.1| PREDICTED: tubulointerstitial nephritis antigen-like [Callithrix
jacchus]
Length = 467
Score = 114 bits (286), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 84/269 (31%), Positives = 119/269 (44%), Gaps = 32/269 (11%)
Query: 83 TVPDRFDAREQWPNCGTIGHVP-DTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEY 141
+P F+A E+WPN + H P D G CA F+ SDR I S G LS +
Sbjct: 202 VLPTAFEASEKWPN---LIHEPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPILSPQN 258
Query: 142 VASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYG--DRTGCQPSTISPCSHHGS 199
+ SC + C G + W FL +RG V+ Y R + + PC H
Sbjct: 259 LLSC----NTHHQQGCRGGHLDGAWWFLRRRGVVSDHCYPFLGRERDKAGPVPPCMMHSR 314
Query: 200 APTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPT 259
A K + C N G + ++ T Y + N+ I KE++ +GP
Sbjct: 315 A--------TGRGKRQATAHCPN---GHVNNNNIYQVTPAYRLGSNDTEIMKELMENGPV 363
Query: 260 TATFALYDDFYHYKSGVYKHT--SNAKLENY----LHSGKLIGWGTEN-----GTPYWLV 308
A +++DF+ YK G+Y HT + + E Y HS K+ GWG E YW
Sbjct: 364 QALMEVHEDFFLYKGGIYSHTPVNLGRPERYRRHGTHSVKITGWGEETWPDGRKLKYWTA 423
Query: 309 INTWGPHWGDRGTVKILRGKYECAFEYLI 337
N+WGP WG+RG +I+RG EC E +
Sbjct: 424 ANSWGPAWGERGHFRIVRGVNECDIESFV 452
>gi|440907441|gb|ELR57591.1| Tubulointerstitial nephritis antigen [Bos grunniens mutus]
Length = 476
Score = 114 bits (286), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 87/276 (31%), Positives = 125/276 (45%), Gaps = 45/276 (16%)
Query: 84 VPDRFDAREQWPNCGTIGHVP-DTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYV 142
+P+ F A +WP H P D CAA F+ +DR I+S+G+ LS + +
Sbjct: 217 LPEFFIASYKWPG---WTHGPLDQKNCAASWAFSTASVAADRIAIQSQGRYTANLSPQNL 273
Query: 143 ASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDY-------GDRTGCQPSTISPCS 195
SCC R + C+ SV R W +L KRG V+ Y GC ++ S
Sbjct: 274 ISCCAKKR----RGCNSESVDRAWWYLRKRGLVSHACYPLFKDQNATNNGCAMASRS--D 327
Query: 196 HHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILA 255
G C N + K +C+ P Y V NE I +EI+
Sbjct: 328 GRGKRHATTPCPN-SIEKSNRIYQCSPP----------------YRVSSNETEIMREIMQ 370
Query: 256 HGPTTATFALYDDFYHYKSGVYKH--TSNAKLENY----LHSGKLIGWGTENGT-----P 304
+GP A +++DF++YK+G+Y+H ++N E Y H+ KL GWGT G
Sbjct: 371 NGPVQAIMQVHEDFFNYKTGIYRHITSTNEDSEKYRKFRTHAVKLTGWGTLRGAQGQKEK 430
Query: 305 YWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAG 340
+W+ N+WG WG+ G +ILRG E E LI A
Sbjct: 431 FWIAANSWGKSWGENGYFRILRGVNESDIEKLIIAA 466
>gi|15723272|gb|AAL06324.1| cathepsin B-like protease [Trypanosoma cruzi]
Length = 208
Score = 114 bits (286), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 73/225 (32%), Positives = 110/225 (48%), Gaps = 20/225 (8%)
Query: 88 FDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCK 147
FDA E WP C TI + D +C + AA A SDR C G ++ +S + SCC
Sbjct: 1 FDAGEAWPKCPTITEIRDQSSCGSCWAVAAASAMSDRYCTLG-GVRDLRISAGDLMSCCD 59
Query: 148 ICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCE 207
+C Y C+ G W + G V+ +Y CQP C+HH ++ L C
Sbjct: 60 VCGY----GCNGGYPEVAWEYYAVHGIVS--EY-----CQPYPFPSCAHHVNSSDLSPCS 108
Query: 208 NQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYD 267
+ C++ CT+ + K+R T + E++ K+E+L +GP +F++Y
Sbjct: 109 GE-YDTPTCNSTCTD----KKIPLIKYRGN-TSCILSGEESFKRELLLNGPFEVSFSVYA 162
Query: 268 DFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTW 312
DF Y GVYKH + L H+ +++GWG NG PYW + N+W
Sbjct: 163 DFVAYTGGVYKHVTGVFLGG--HAVRIVGWGELNGEPYWKIANSW 205
>gi|48762481|dbj|BAD23810.1| cathepsin B-S [Tuberaphis taiwana]
Length = 182
Score = 114 bits (285), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 72/194 (37%), Positives = 107/194 (55%), Gaps = 12/194 (6%)
Query: 116 AAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSV 175
+ GAF+DR C+ + G+ N+ LS E +A C D K C G + W + +G
Sbjct: 1 STTGAFADRLCVSTGGKFNQLLSPEELA----FCCKDCGKGCGGGYPIKAWKYFRTQGVT 56
Query: 176 TGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHR 235
TGGDY + GC P + PC + T C Q P + H +C YG+ Q++++
Sbjct: 57 TGGDYDTKEGCMPYKVPPCYNKQGKNT---CGGQ--PMERNH-QCPKTCYGKTTVQNRYK 110
Query: 236 TTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLI 295
T Y V ++ I++++ +GP A+F +YDDF YKSG+Y+ T AK + HS K+I
Sbjct: 111 TKSEY-VMNSIKTIEQDLKTYGPVEASFDVYDDFSVYKSGIYRKTPKAKYQG-GHSIKII 168
Query: 296 GWGTENGTPYWLVI 309
GWG +NGTPYWL +
Sbjct: 169 GWGQQNGTPYWLAV 182
>gi|157058773|gb|ABV03144.1| cathepsin B-16D [Sitobion avenae]
Length = 215
Score = 114 bits (285), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 73/214 (34%), Positives = 100/214 (46%), Gaps = 14/214 (6%)
Query: 16 GELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKT 75
G Y +I+ IN +A TW AG NF N +E+ + L +K +R KT
Sbjct: 1 GTAYFLQKDFIENINEQATTWKAGVNFNPNTPKEHFLKML--GSKGVQIPNRNNIHLYKT 58
Query: 76 YDPEYS---ATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQ 132
D Y +P FDAR +W +C TIG V D G C + A AF+DR C+ + G
Sbjct: 59 DDAAYDNLFGRIPRHFDARRKWRHCQTIGEVRDQGNCGSCWAVATSSAFADRLCVATDGD 118
Query: 133 QNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTIS 192
N+ LS E + CC C + C+ G + W K G VTGGDY GC+P +
Sbjct: 119 FNQLLSAEEITFCCHTCGF----GCNGGYPIKAWERFKKHGLVTGGDYKSEEGCEPYRVP 174
Query: 193 PCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYG 226
PC + S +C + + K + RCT YG
Sbjct: 175 PCPYDESGNN--TCAGKPMEK---NHRCTRMCYG 203
>gi|603044|gb|AAA96832.1| cysteine protease homolog, partial [Strongyloides ratti]
Length = 202
Score = 114 bits (285), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 67/203 (33%), Positives = 98/203 (48%), Gaps = 12/203 (5%)
Query: 116 AAVGAFSDRRCIKSKGQQNRPLSTEYVASCC-KICRYDDNKSCSHGSVFRTWNFLHKRGS 174
+A +DR C++SKG+ R +S + SCC + C Y C G+ R W + + G
Sbjct: 6 SAASVMTDRLCVQSKGRIKRFISDTDILSCCGRFCGY----GCRGGANIRAWKHVMRNGV 61
Query: 175 VTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKH 234
TGG G + GC+P PC H C + +C C + +D++
Sbjct: 62 CTGGPCGYKYGCRPYAFHPCGVHKDQVYYGECPRKSYDTPECRKICQRGCIQLQYGKDRY 121
Query: 235 RTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKL 294
Y+V ++ AI +EI+ GP + Y DF YK GVY+HT+ + HS K+
Sbjct: 122 YAASAYFVKNDTKAIMREIMRGGPVHGAYDTYTDFRLYKGGVYEHTAGERTGG--HSIKI 179
Query: 295 IGWGT---ENGT--PYWLVINTW 312
+GWG NGT PYWLV N+W
Sbjct: 180 MGWGNYKHPNGTVIPYWLVANSW 202
>gi|198434980|ref|XP_002126076.1| PREDICTED: similar to LOC100124858 protein [Ciona intestinalis]
Length = 541
Score = 114 bits (285), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 94/329 (28%), Positives = 143/329 (43%), Gaps = 39/329 (11%)
Query: 26 IDQINREANTWTAGR-NFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEYSATV 84
I+ IN WTA F L++ ++ + A+ D+ + + S+ +
Sbjct: 233 IEAINEGDFGWTASNFTFLWGLTQLEGYKYKLGTARVPDE----VRNMNAMHPLSVSSNL 288
Query: 85 PDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVAS 144
P FD+R +WP G++ D F+ SDR I+SK LS +++ S
Sbjct: 289 PKTFDSRTKWP--GSLSLPRDQENEGTSWAFSTTSVLSDRLAIQSKNFTVVELSPQHLVS 346
Query: 145 CCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTLP 204
C + ++ + RTW +L K+G V+ Y + I C +
Sbjct: 347 C-----FSSHEGRGE-RLDRTWWYLRKKGVVSTVCYPESRSKSTQGIGSCGLVAHSSGAH 400
Query: 205 SCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFA 264
C N V +N Y +T+ Y V NE+ I KEI +GP A
Sbjct: 401 ICPNGNVIS-------SNEIY---------KTSPVYRVSSNEENIMKEIFENGPVQAVMR 444
Query: 265 LYDDFYHYKSGVYKHTS--NAKLE----NYLHSGKLIGWGTE----NGTPYWLVINTWGP 314
+ DF+ YKSGVY T+ N +E N HS K+IGWG + N YW+V N+WG
Sbjct: 445 VQPDFFVYKSGVYSSTAIDNIVVEQVKDNTYHSVKIIGWGEKKSKTNSGKYWIVQNSWGA 504
Query: 315 HWGDRGTVKILRGKYECAFEYLIAAGKPK 343
+WG+ G +I +G EC E +I A P+
Sbjct: 505 NWGEGGYFRIRKGVNECGIEEMILAAWPQ 533
>gi|15723274|gb|AAL06325.1| cathepsin B-like protease [Trypanosoma cruzi]
gi|15723278|gb|AAL06327.1| cathepsin B-like protease [Trypanosoma cruzi]
Length = 208
Score = 114 bits (284), Expect = 9e-23, Method: Compositional matrix adjust.
Identities = 71/225 (31%), Positives = 110/225 (48%), Gaps = 20/225 (8%)
Query: 88 FDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCK 147
FDA E WP C T+ + D +C + AA A SDR C G ++ +S + SCC
Sbjct: 1 FDAGEAWPECPTVTEIRDQSSCGSCWAVAAASAISDRYCTLG-GVRDLRISAGDLMSCCD 59
Query: 148 ICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCE 207
+C + C+ G W + G V+ +Y CQP C+HH ++ L C
Sbjct: 60 VCGF----GCNGGYPEVAWEYYAVHGIVS--EY-----CQPYPFPSCAHHVNSSDLSPCS 108
Query: 208 NQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYD 267
+ C++ CT+ + K+R +Y V E+ K+E++ +GP +F++Y
Sbjct: 109 GE-YDTPTCNSTCTD----KKIPLIKYRGNTSY-VLSGEEPFKRELILNGPFEVSFSVYA 162
Query: 268 DFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTW 312
DF Y GVYKH + L H+ +++GWG NG PYW + N+W
Sbjct: 163 DFVAYTGGVYKHVAGIFLGG--HAVRIVGWGELNGEPYWKIANSW 205
>gi|167508668|gb|ABZ81540.1| cathepsin B-like cysteine protease [Caenorhabditis brenneri]
Length = 193
Score = 114 bits (284), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 67/169 (39%), Positives = 94/169 (55%), Gaps = 6/169 (3%)
Query: 168 FLHKRGSVTGGDYGDRTGCQPSTISPCSH-HGSAPTLPSCENQKVPKLKCHTRCT-NPTY 225
+ G TGG+Y D+ GC+P TI PC + + T C P C RCT N T+
Sbjct: 29 WWQTHGLCTGGNYDDQFGCKPYTIYPCDKTYPNGTTSVPCPGYHTPV--CEERCTSNITW 86
Query: 226 GRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKL 285
+ Q KH Y V I+ EI+ +GP A+F +YDDF+ YKSG+Y HT+ +
Sbjct: 87 PISYKQVKHFGKAHYNVGKKMTDIQTEIMRNGPVIASFIIYDDFWDYKSGIYVHTAGDQ- 145
Query: 286 ENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFE 334
E + + K+IGWG +NG PYWL ++ WG +G+ G ++ILRG E E
Sbjct: 146 EGGMDT-KIIGWGVDNGVPYWLCVHQWGTDFGENGFMRILRGVNEVHIE 193
>gi|327282776|ref|XP_003226118.1| PREDICTED: tubulointerstitial nephritis antigen-like [Anolis
carolinensis]
Length = 476
Score = 114 bits (284), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 85/274 (31%), Positives = 123/274 (44%), Gaps = 42/274 (15%)
Query: 85 PDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVAS 144
P+ F A +WP G I D CAA F+ +DR I SKG+ LS +++ S
Sbjct: 225 PEFFVAWHEWP--GWIHDPLDQRNCAASWAFSTASVAADRIAIHSKGRFTDNLSPQHLIS 282
Query: 145 CCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYG------DRTGCQPSTISPCSHHG 198
C +Y C GS+ W++L K G V+ Y +T C+ S++ G
Sbjct: 283 CDTRNQY----GCKGGSITGAWSYLKKYGLVSHACYPLFWNNLHQTSCEMSSVF--DAEG 336
Query: 199 SAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGP 258
+ C N+ P +N Y G L Y + + I KEI +GP
Sbjct: 337 KRQAIQPCPNRWEP--------SNHIYQCG---------LPYRISSQDADIMKEIKENGP 379
Query: 259 TTATFALYDDFYHYKSGVYKHT------SNAKLENYLHSGKLIGWGTENGTP-----YWL 307
A +YDDF+ YKSG+YKH + + + HS K++GWGT +W+
Sbjct: 380 VQAVMQVYDDFFLYKSGIYKHIWSLEGKTQNRHQKKPHSIKIVGWGTLRDAEGQRQKFWI 439
Query: 308 VINTWGPHWGDRGTVKILRGKYECAFEYLIAAGK 341
N+WG WG+ G +ILRG+ EC E + A K
Sbjct: 440 AANSWGNSWGENGYFRILRGQNECDIEKTVIASK 473
>gi|427783627|gb|JAA57265.1| hypothetical protein [Rhipicephalus pulchellus]
Length = 483
Score = 113 bits (283), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 87/266 (32%), Positives = 121/266 (45%), Gaps = 33/266 (12%)
Query: 84 VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVA 143
+P+ FDAR +W G + V D G CA F+ SDR I+S+G LS + +
Sbjct: 200 LPEEFDARIRWS--GLVHGVRDQGDCANSWAFSTAAVASDRLSIQSRGVDKVELSPQDLM 257
Query: 144 SCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTL 203
SC R C G R W FL G V+ C P H SA
Sbjct: 258 SCLNGGR---RVVCQGGHPDRGWRFLLNYGGVS-------EECYPYE----GVHSSANAT 303
Query: 204 PSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATF 263
++ P RC PT G + KH +T Y V NE+ I +EI A+GP A
Sbjct: 304 CRIPRRRDPIED--ARC--PT---GRTEQKHFSTPPYRVPANEEDIMQEIYANGPVQALI 356
Query: 264 ALYDDFYHYKSGVYKHTSNAK------LENYLHSGKLIGWGTENG----TPYWLVINTWG 313
+ +DF+ Y+SGVY+HT A+ + HS +++GWG + YWL N+WG
Sbjct: 357 LVKEDFFLYRSGVYRHTRIAESLRPQYSRSGWHSVRILGWGVDRSQYRPIKYWLCANSWG 416
Query: 314 PHWGDRGTVKILRGKYECAFEYLIAA 339
WG+ G +I+RG+ E E + A
Sbjct: 417 HGWGENGYFRIVRGEDESQIESFVLA 442
>gi|363732245|ref|XP_419905.3| PREDICTED: tubulointerstitial nephritis antigen [Gallus gallus]
Length = 467
Score = 113 bits (283), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 81/259 (31%), Positives = 117/259 (45%), Gaps = 25/259 (9%)
Query: 85 PDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVAS 144
P+ F A WP+ I D C A F+ +DR I S GQ LS + + S
Sbjct: 223 PEFFAATYAWPD--WIHDPLDQRNCGASWAFSTASVAADRITIHSDGQITDNLSVQNLIS 280
Query: 145 CCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTLP 204
C + + C+ GS+ W +L G V+ C PS HH +P+
Sbjct: 281 C----DTGNQRGCNGGSIDGAWRYLTTHGVVS-------YACYPSFWK---HHLDSPSEN 326
Query: 205 SC-ENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATF 263
C + + K + C N +R Y V E I +EI+A GP A
Sbjct: 327 QCYVSSEYGKNHTNGPCPNALEDSNRL---YRCGSHYRVSSKETDIMEEIMAKGPVQAIM 383
Query: 264 ALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENG-----TPYWLVINTWGPHWGD 318
+Y+DF+ YK G+Y+H+ A + HS KL+GWG+ G +W+ N+WG +WG+
Sbjct: 384 KVYEDFFLYKEGIYRHSYKAGSKWKTHSVKLLGWGSLPGKNGQKQKFWIAANSWGKYWGE 443
Query: 319 RGTVKILRGKYECAFEYLI 337
G +ILRG+ EC E LI
Sbjct: 444 NGYFRILRGQNECDIEKLI 462
>gi|294879717|ref|XP_002768767.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
gi|239871616|gb|EER01485.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
Length = 157
Score = 113 bits (283), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 62/163 (38%), Positives = 86/163 (52%), Gaps = 12/163 (7%)
Query: 185 GCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKH----RTTLTY 240
GC P PC+HH + P C P C +C NP Y D+H + Y
Sbjct: 2 GCWPYDFPPCAHHINDTKYPKCPKGLYPTPNCVEQCHNPKYTTTLRDDRHFMLESSPYHY 61
Query: 241 WVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTE 300
V+D ++AI+ + GP +A+F +Y+DF Y+SGVYKHTS + L H+ K+IGWG +
Sbjct: 62 SVNDAKNAIRTD----GPVSASFTVYEDFLAYRSGVYKHTSGSYLGG--HAVKIIGWGEK 115
Query: 301 NGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
+G YWL +N+W WGD G KI G C + + G PK
Sbjct: 116 SGQAYWLAVNSWNEDWGDHGLFKIALG--NCGIDDDLLGGTPK 156
>gi|290990726|ref|XP_002677987.1| predicted protein [Naegleria gruberi]
gi|284091597|gb|EFC45243.1| predicted protein [Naegleria gruberi]
Length = 225
Score = 113 bits (282), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 81/258 (31%), Positives = 113/258 (43%), Gaps = 38/258 (14%)
Query: 84 VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVA 143
+P FDAR QW C + + D C A F+A + R CI + GQ N LS EY
Sbjct: 3 IPMNFDARTQWRGC--VPAIRDQQTCGACWAFSANYVLAHRLCIATNGQTNVVLSPEYQV 60
Query: 144 SCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTL 203
C + NK+C G + +W FL G+ + C + S
Sbjct: 61 QCDTM-----NKACQGGYLKYSWTFLENTGT---------------PLDTCIPYASGRGT 100
Query: 204 PSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATF 263
S C T+C + ++ K+ +T + IK I+ +G A F
Sbjct: 101 FSSGT-------CPTQCKIASMSMSKYKAKNTRYIT-----GINNIKTAIMTYGSVQAGF 148
Query: 264 ALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVK 323
+Y D YKSGVYKH + L H+ LIG+G E G+ YWL N+WGP+WG G K
Sbjct: 149 TVYRDLTGYKSGVYKHVVSTVLGG--HAVALIGFGVEGGSNYWLAANSWGPNWGMSGYFK 206
Query: 324 ILRGKYECAFEYLIAAGK 341
I +G E E + AG+
Sbjct: 207 IAQG--EGGIENQVYAGE 222
>gi|449498128|ref|XP_002193225.2| PREDICTED: tubulointerstitial nephritis antigen [Taeniopygia
guttata]
Length = 469
Score = 112 bits (281), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 89/267 (33%), Positives = 119/267 (44%), Gaps = 29/267 (10%)
Query: 85 PDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVAS 144
P F A +WP I D C A F+ +DR I SKGQ LS + + S
Sbjct: 223 PAIFSAIYEWPE--WIHDPLDQRNCGASWAFSTASVAADRIAIHSKGQITDNLSAQNLIS 280
Query: 145 CCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTLP 204
C + C+ GS+ W +L G V+ C PS + H G P
Sbjct: 281 C----DTRNQHGCNGGSIDGAWRYLKTHGVVS-------YACYPSFWN--KHLG-----P 322
Query: 205 SCENQKVPKLKCHTRCTNPTYGRGFFQDK--HRTTLTYWVDDNEDAIKKEILAHGPTTAT 262
S ENQ + TN F + +R Y V E I KEI GP A
Sbjct: 323 SAENQCYVSNEYGKNHTNGPCPNAFEKSNRLYRCASHYRVSSKETDIMKEIKDRGPVQAI 382
Query: 263 FALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGT---ENG--TPYWLVINTWGPHWG 317
+Y+DF+ YK G+Y+H+ A + HS KL+GWG +NG +W+ N+WG WG
Sbjct: 383 MKVYEDFFLYKEGIYQHSQKAGSKWKTHSVKLLGWGALPDKNGQKQKFWIAANSWGKSWG 442
Query: 318 DRGTVKILRGKYECAFEYLIAA--GKP 342
+ G +ILRG+ EC E LI A G+P
Sbjct: 443 ENGYFRILRGQNECDIEKLILATLGQP 469
>gi|290998826|ref|XP_002681981.1| predicted protein [Naegleria gruberi]
gi|284095607|gb|EFC49237.1| predicted protein [Naegleria gruberi]
Length = 310
Score = 112 bits (281), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 91/299 (30%), Positives = 123/299 (41%), Gaps = 47/299 (15%)
Query: 45 NLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVP 104
N++ LR L + D P + E +P FDAR QW C + +
Sbjct: 54 NMTISQLRDNLFGLSLMSSDEDTP-----RMASIETRVDIPMNFDARTQWKGC--VPAIR 106
Query: 105 DTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFR 164
D C A F+A + R CI + G+ N LS EY C + NK+C G +
Sbjct: 107 DQQTCGACWAFSANYVLAHRLCIATNGKTNVVLSPEYQVQCDTM-----NKACQGGYLKY 161
Query: 165 TWNFLHKRGSV--TGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTN 222
+W FL G+ T Y G S G+ PT + + K K N
Sbjct: 162 SWTFLENTGTPLDTCIPYASGRGTFSS--------GTCPTQCKIASMSMSKYKAK----N 209
Query: 223 PTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSN 282
Y G + IK I+ +G A F +Y D YKSGVYKH +
Sbjct: 210 TVYISGI-----------------NNIKTAIMTYGSVQAGFTVYRDLTGYKSGVYKHVVS 252
Query: 283 AKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGK 341
L H+ LIG+G E G+ YWL N+WGP+WG G KI +G E E + AG+
Sbjct: 253 TVLGG--HAVALIGFGVEGGSNYWLAANSWGPNWGMSGYFKIAQG--EGGIENQVYAGE 307
>gi|129270160|ref|NP_001038442.2| tubulointerstitial nephritis antigen-like precursor [Danio rerio]
gi|126632071|gb|AAI33830.1| Si:dkey-158b13.1 [Danio rerio]
Length = 471
Score = 112 bits (281), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 83/269 (30%), Positives = 124/269 (46%), Gaps = 35/269 (13%)
Query: 84 VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVA 143
+P F+A ++WP G I D G C A F+ SDR I+S G LS + +
Sbjct: 200 LPSYFNAVDKWP--GKIHEPLDQGNCNASWAFSTAAVASDRISIQSMGHMTPQLSPQNLI 257
Query: 144 SCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTL 203
SC R+ D C+ G + W F+ +RG VT C P SP SA +
Sbjct: 258 SC--DTRHQD--GCAGGRIDGAWWFMRRRGVVT-------QDCYP--FSPPEQ--SAVEV 302
Query: 204 PSCENQKVP----KLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPT 259
C Q K + C N + D +++T Y + NE+ I KEI+ +GP
Sbjct: 303 ARCMMQSRAVGRGKRQATAHCPN---SHSYHNDIYQSTPPYRLSTNENEIMKEIMDNGPV 359
Query: 260 TATFALYDDFYHYKSGVYKHTS------NAKLENYLHSGKLIGWGTENG-----TPYWLV 308
A +++DF+ YKSG+++HT + ++ HS ++ GWG E YW+
Sbjct: 360 QAIMEVHEDFFVYKSGIFRHTDVNYHKPSQYRKHATHSVRITGWGEERDYSGRTRKYWIG 419
Query: 309 INTWGPHWGDRGTVKILRGKYECAFEYLI 337
N+WG +WG+ G +I RG EC E +
Sbjct: 420 ANSWGKNWGEDGYFRIARGVNECDIETFV 448
>gi|157058757|gb|ABV03136.1| cathepsin B-84 [Pterocomma populeum]
Length = 218
Score = 112 bits (281), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 68/229 (29%), Positives = 113/229 (49%), Gaps = 16/229 (6%)
Query: 40 RNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGT 99
+NFP N+ +E + + L+ + P+ + +Y + +P FDAR +W C T
Sbjct: 3 QNFPENMLKEQMVR-LLGSKRLTGVPKTPVKENDISYVED--GGIPKAFDARLEWKYCKT 59
Query: 100 IGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSH 159
IG V D G C + GAF+DR CI +KG N +S E + CC +C C+
Sbjct: 60 IGQVRDQGNCGSCWAHGTSGAFADRLCIATKGDFNELISAEELTFCCHLC----GIGCNG 115
Query: 160 GSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTR 219
G+ R W + + G VTGG+Y GCQP + PC++ SC Q+ + + +
Sbjct: 116 GNPLRAWQYFKRHGVVTGGNYNTTNGCQPYRVPPCTNGDKGHY--SCSGQQKER---NHK 170
Query: 220 CTNPTYGR---GFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFAL 265
C YG + +D ++T Y++ N ++K+++ +GP A+F +
Sbjct: 171 CLKTCYGDKTVDYKRDHYKTKDAYYL-SNTTTMQKDVILYGPIEASFDV 218
>gi|157058731|gb|ABV03123.1| cathepsin B-16D1 [Acyrthosiphon pisum]
Length = 243
Score = 112 bits (281), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 78/238 (32%), Positives = 108/238 (45%), Gaps = 16/238 (6%)
Query: 5 LVFLLGCTLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQ 64
L + V + Y +ID IN +A TW AG NF + +E+ + L +K
Sbjct: 6 LSVIFVSVYVTEQTYFLQKDFIDNINNQATTWKAGVNFDPDTPKEHFLKML--GSKGVQI 63
Query: 65 SDRPLPGDRKTYDPEYS---ATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAF 121
++ KT+D Y +P FDAR +W +C TIG V D G C + A AF
Sbjct: 64 PNKHNIHMYKTHDAAYDNLFGRIPRHFDARRKWRSCHTIGAVRDQGNCGSCWAMATSSAF 123
Query: 122 SDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYG 181
+DR C+ + N LS E + CC C + C+ G + W KRG VTGGDY
Sbjct: 124 ADRLCVATNADFNELLSAEEITFCCYSCGF----GCNGGYPIKAWERFKKRGLVTGGDYQ 179
Query: 182 DRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRG--FFQDKHRTT 237
GC+P + PC + A +C + P+ H RCT YG F + HR T
Sbjct: 180 SGEGCEPYRVPPCPY--DAEGHNTCAGK--PRESNH-RCTRMCYGNQDLDFDEDHRYT 232
>gi|308159555|gb|EFO62082.1| Cathepsin B precursor [Giardia lamblia P15]
Length = 305
Score = 112 bits (280), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 72/251 (28%), Positives = 112/251 (44%), Gaps = 37/251 (14%)
Query: 85 PDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVAS 144
PDR D R+ P C D C+ + FA +GA S RRCI Q LS +++ S
Sbjct: 82 PDRLDYRQTHPEC--FFEPEDQKECSCCYAFATIGALSTRRCIAKLDSQAVSLSVQHMVS 139
Query: 145 CCKICRYDDNKS-CSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTL 203
C D+ ++ C G +W FL G V ++ C P T + G P +
Sbjct: 140 C------DNGEAGCLGGEFESSWAFLETEGVV-------KSDCLPYTSGETGNSGECPMM 186
Query: 204 PSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATF 263
C++ + + H + + + +N + I +LA GP F
Sbjct: 187 --CQDGTLVEDAFHYKAASAS-----------------PLNNYNEIMVSLLADGPVQTGF 227
Query: 264 ALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVK 323
+++DF +Y G+Y + L H+ ++G+G+ N YW+V N+WGP WG+ G +
Sbjct: 228 YVHEDFLYYVGGIYHKVYGSSLGG--HAVLIVGYGSMNDHDYWIVRNSWGPDWGENGYFR 285
Query: 324 ILRGKYECAFE 334
ILRG EC E
Sbjct: 286 ILRGTNECGIE 296
>gi|328872536|gb|EGG20903.1| hypothetical protein DFA_00770 [Dictyostelium fasciculatum]
Length = 313
Score = 112 bits (280), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 84/287 (29%), Positives = 124/287 (43%), Gaps = 43/287 (14%)
Query: 50 YLRQFLIADAKYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGA- 108
+L I Y + S R G T+D ++ +P FD+R++W +C V D G
Sbjct: 3 FLITLFILLISYTELS-RAQCGASPTFD---ASNLPASFDSRQKWSDC--FSPVRDQGQK 56
Query: 109 CAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNF 168
C++ A G +DR C+ S G+ + LS + + C + + N C G + +
Sbjct: 57 CSSCWAMTATGVLADRLCVASGGKVKKVLSPQELIDCDR----NGNLGCGGGRLDTPLAY 112
Query: 169 LHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLK-CHTRCTNPTYGR 227
G VT CE+ K + C C + T
Sbjct: 113 FRDNGVVT---------------------------EKCESYKATQASSCSNTCDDGTSFS 145
Query: 228 GFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLEN 287
K+ + Y + E A K +I +GP A F LY D Y+YKSGVY + +A +
Sbjct: 146 N--TTKYHSKDCYRLSSIEQA-KADIYLNGPIIAVFDLYTDIYNYKSGVYIKSDSATYKE 202
Query: 288 YLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFE 334
H+G++IGWG E+G YWL N+WG WG +G KI G E FE
Sbjct: 203 -THAGRVIGWGVEDGVQYWLAANSWGTGWGQQGLFKIRSGTNEVGFE 248
>gi|161343825|tpg|DAA06093.1| TPA_inf: cathepsin B [Aphis gossypii]
Length = 199
Score = 112 bits (280), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 63/198 (31%), Positives = 101/198 (51%), Gaps = 7/198 (3%)
Query: 1 MIHILVFLLGCTLVRGELYKFSDAYIDQINREANTWTAGRNF-PANLSEEYLRQFLIADA 59
++ +L +L + + Y YI++IN +A+TWTAG NF P+ E+ L+
Sbjct: 4 VLILLSVILFSVYMTEQAYFLEKDYINKINEKASTWTAGFNFDPSTPKEDILKLLGSKGV 63
Query: 60 KYFDQSDRPL-PGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAV 118
+ + + + + + YD + +P +FDAR++W +C TIG V D G C + +
Sbjct: 64 QTPSKINLKMYKSEDENYDNLF-GRIPKKFDARKKWRHCTTIGKVRDQGNCGSCWALSTS 122
Query: 119 GAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGG 178
AF+DR C+ + G N+ LS E + CC C Y C+ G + W K G VTGG
Sbjct: 123 SAFADRLCVATNGDFNQLLSAEELTFCCHKCGY----GCNGGYPIKAWERFKKHGLVTGG 178
Query: 179 DYGDRTGCQPSTISPCSH 196
+Y GC+P + PC +
Sbjct: 179 EYKSGEGCEPYRVPPCPY 196
>gi|145356617|ref|XP_001422524.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144582767|gb|ABP00841.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 245
Score = 112 bits (279), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 83/257 (32%), Positives = 116/257 (45%), Gaps = 24/257 (9%)
Query: 83 TVPDRFDAREQWPNCGT-IGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEY 141
T+P FD RE+WP C + D G C + A +DR CI + G LS
Sbjct: 1 TLPKDFDVREKWPKCAALVSEALDQGECGSCWAVAPAKVMADRLCIATNGAVASHLSAMQ 60
Query: 142 VASCCKI--CRYDDNK----SCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCS 195
+ SC K+ +D SC G + G V+GG +GD C P +PC
Sbjct: 61 LLSCGKLENGTFDAGSTYSGSCDGGFPNEAYEKARTSGIVSGGLFGDDKTCMPYAFAPCQ 120
Query: 196 HHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILA 255
H P P+ Q C T C N Q T+L ++ + + E+
Sbjct: 121 H----PCNPNHVAQ------CPTTCRNKNVNLS-SQRYEVTSLVTCGTNDFNCMALELFY 169
Query: 256 HGPTTATFA-LYDDFYHYKSGVYKHTSNAKLENYLHSG---KLIGWG-TENGTPYWLVIN 310
HGP ++ ++D+FY YKSGVY + + H G ++IGWG TE+GT YW V N
Sbjct: 170 HGPVSSYVGDVFDEFYKYKSGVYSLSKDVAARGENHGGHVMEVIGWGTTESGTRYWKVYN 229
Query: 311 TWGPHWGDRGTVKILRG 327
+W +WGD+G KI G
Sbjct: 230 SW-LNWGDQGYGKIAVG 245
>gi|157058759|gb|ABV03137.1| cathepsin B-84 [Rhopalosiphum padi]
Length = 219
Score = 112 bits (279), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 69/235 (29%), Positives = 112/235 (47%), Gaps = 21/235 (8%)
Query: 36 WTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEYS--ATVPDRFDAREQ 93
W A +NFP +++E + + L + + L K YD +Y+ VPD FDAR +
Sbjct: 1 WKAKQNFPEYMTKEQIVRLLGSKS-----VKGALKSPIKEYDSKYTNDVEVPDFFDARIE 55
Query: 94 WPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDD 153
W C TIG V + G C + GAF+DR C+ + G N +S E + CC C +
Sbjct: 56 WKYCKTIGEVRNQGNCGSCWAHGTTGAFADRLCVATNGDFNELISAEELTFCCHTCGF-- 113
Query: 154 NKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPK 213
C+ G+ R W + + G VTGG+Y GCQP + PC SC Q+ +
Sbjct: 114 --GCNGGNPIRAWLYFKRHGVVTGGNYNTTDGCQPYKVPPCIRDEEGHN--SCSGQRTER 169
Query: 214 LKCHTRCTNPTYGR---GFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFAL 265
+ RC+ YG + ++T Y++ +N ++ + + +GP ++F +
Sbjct: 170 ---NHRCSKSCYGNTTSDYKNGHYKTKDAYYLTNN--TMQIDTMIYGPIESSFDV 219
>gi|159117627|ref|XP_001709033.1| Cathepsin B precursor [Giardia lamblia ATCC 50803]
gi|157437148|gb|EDO81359.1| Cathepsin B precursor [Giardia lamblia ATCC 50803]
Length = 308
Score = 112 bits (279), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 94/338 (27%), Positives = 145/338 (42%), Gaps = 66/338 (19%)
Query: 21 FSDAYIDQINREANTWTAGRNFPA---NLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYD 77
+ + QI A W AG P NL++ ++ L A + S R
Sbjct: 16 LTQVELRQIQALAPAWKAG--IPERLKNLTKNDFKKMLSAGSPRTQSSIV-----RPVRV 68
Query: 78 PEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPL 137
PE VPD FD RE++P C I V D G C++ ++AV AFS RRC+ Q+
Sbjct: 69 PENEDPVPDHFDFREEYPQC--ITEVIDIGLCSSSWAYSAVDAFSHRRCLTGLDQEATRY 126
Query: 138 STEYVASCCKICRYDDNKSCSHGSVFRT--WNFLHKRG-----SVTGGDYGDRTGCQPST 190
S +Y+ SC C S + W+F+ G V DY D+T +P
Sbjct: 127 SAQYILSC------SSTNGCFGFSTRESIAWDFIATTGIPLESCVKYTDY-DQTQSRP-- 177
Query: 191 ISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIK 250
C + C + ++ + D + V N + +K
Sbjct: 178 -------------------------CPSTCDDDSFLEVYKPDGYEG-----VGLNCERLK 207
Query: 251 KEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGT-ENGTPYWLVI 309
+ + GP A F +Y+DF +Y G+Y +T ++ S +++G+GT + G YW+V
Sbjct: 208 RAVALRGPMQAMFTVYEDFTYYLEGIYSYTYGNRVG--FLSVEIVGYGTSDEGQDYWIVK 265
Query: 310 NTWGPHWGDRGTVKILRGKYECAFE-----YLIAAGKP 342
N WGP WG+ G +I+RG+ EC E +I+ KP
Sbjct: 266 NYWGPGWGEDGYFRIVRGQNECQIENSAYGAIISPNKP 303
>gi|268572255|ref|XP_002648916.1| Hypothetical protein CBG17829 [Caenorhabditis briggsae]
Length = 220
Score = 112 bits (279), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 51/106 (48%), Positives = 66/106 (62%), Gaps = 2/106 (1%)
Query: 237 TLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIG 296
T Y+V AI+ EI+ +GP F +Y+D Y YKSGVY+HT+ L H+ K+IG
Sbjct: 112 TSAYYVGMTVSAIQTEIMTNGPVVGVFTMYEDMYKYKSGVYRHTAGRLLGG--HAIKIIG 169
Query: 297 WGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
WGT+NG PYWL+ N+WG WG+ G KI RG EC E + AGK
Sbjct: 170 WGTQNGIPYWLIANSWGTKWGENGFFKIRRGVNECGIENNVVAGKA 215
>gi|403357104|gb|EJY78168.1| Cathepsin B [Oxytricha trifallax]
Length = 349
Score = 111 bits (278), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 78/267 (29%), Positives = 125/267 (46%), Gaps = 46/267 (17%)
Query: 79 EYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLS 138
+ + T+P+ FD+R++WPNC I + D C + FA+ SDR CI S+GQ N LS
Sbjct: 120 DLNETIPESFDSRDKWPNC--IHGIRDQQLCGSCWAFASSAFLSDRFCIHSEGQINEDLS 177
Query: 139 TEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQP--STISPCSH 196
+ + SC +N CS G + + +FL G V+ C+P + + C
Sbjct: 178 PQDLVSCSY-----ENFGCSGGQLTESVDFLIYEGIVS-------EKCKPYMNQDTYCKF 225
Query: 197 HGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAH 256
C+N K P Y + F + K L+ + + I+ E++ +
Sbjct: 226 --------KCQNDKQP------------YTKYFCEQKSMLILS-----DIEEIQLELMTN 260
Query: 257 GPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWG-TENGTPYWLVINTWGPH 315
GP ++Y+D +YK GVY++T+ ++ H+ K+IGWG TE G +W N WG
Sbjct: 261 GPMMVGLSVYEDLMNYKEGVYEYTTGNQVGG--HAIKIIGWGHTEKGELFWKCQNQWGKD 318
Query: 316 WGDRGTVKILRGKYECAFEYLIAAGKP 342
WG G + I G E + ++ P
Sbjct: 319 WGMGGYINIKAG--ELGMDTMVLGCMP 343
>gi|162813|gb|AAA30434.1| cathepsin B, partial [Bos taurus]
Length = 122
Score = 111 bits (277), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 55/118 (46%), Positives = 70/118 (59%), Gaps = 2/118 (1%)
Query: 225 YGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAK 284
Y + +DKH +Y V +NE I EI +GP F++Y DF YKSGVY+H S
Sbjct: 2 YSPSYKEDKHFGCSSYSVANNEKEIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVSGEI 61
Query: 285 LENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
+ H+ +++GWG ENGTPYWLV N+W WGD G KILRG+ C E I AG P
Sbjct: 62 MGG--HAIRILGWGVENGTPYWLVGNSWNTDWGDNGFFKILRGQDHCGIESEIVAGMP 117
>gi|290971375|ref|XP_002668483.1| predicted protein [Naegleria gruberi]
gi|284081912|gb|EFC35739.1| predicted protein [Naegleria gruberi]
Length = 325
Score = 111 bits (277), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 87/297 (29%), Positives = 124/297 (41%), Gaps = 43/297 (14%)
Query: 45 NLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVP 104
N++ LR L + D P + + E +P FDAR QW C + +
Sbjct: 69 NMTISQLRDNLFGLSLMSTDEDTP-----RMENIETRMDIPMNFDARTQWRGC--VPAIR 121
Query: 105 DTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFR 164
D C A F+A + R CI + GQ N LS EY C + NK+C G +
Sbjct: 122 DQQTCGACWAFSANYVLAHRLCIATNGQTNVVLSPEYQVQCDTM-----NKACQGGYLKY 176
Query: 165 TWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPT 224
+W FL G+ + C + S S C T+C +
Sbjct: 177 SWTFLENTGT---------------PLDTCIPYASGRGTFSSGT-------CPTQCKIAS 214
Query: 225 YGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAK 284
++ K+ +T + IK I+ +G A F +Y D YKSGVYKH +
Sbjct: 215 MSMSKYKAKNTRYIT-----GINNIKTAIMTYGSVQAGFTVYRDLTGYKSGVYKHVVSTV 269
Query: 285 LENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGK 341
L H+ LIG+G E G+ YWL N+WG +WG G KI +G E E + AG+
Sbjct: 270 LGG--HAVALIGFGVEGGSNYWLAANSWGANWGMSGYFKIAQG--EGGIENQVYAGE 322
>gi|158285208|ref|XP_001687862.1| AGAP007684-PA [Anopheles gambiae str. PEST]
gi|158285210|ref|XP_308187.4| AGAP007684-PB [Anopheles gambiae str. PEST]
gi|157019881|gb|EDO64511.1| AGAP007684-PA [Anopheles gambiae str. PEST]
gi|157019882|gb|EAA04576.4| AGAP007684-PB [Anopheles gambiae str. PEST]
Length = 463
Score = 110 bits (275), Expect = 9e-22, Method: Compositional matrix adjust.
Identities = 79/271 (29%), Positives = 115/271 (42%), Gaps = 45/271 (16%)
Query: 84 VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVA 143
+P RFDA E W G + D G C + F+ SDR I SKG++ L+ + +
Sbjct: 187 LPTRFDASEHWT--GLVAEARDQGWCGSSWAFSTATMASDRFAILSKGREMVQLAPQQML 244
Query: 144 SCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTL 203
+C + + CS G + W +L + G V C + +A +
Sbjct: 245 ACVR-----RQQGCSGGHLDTAWQYLRRTGVVN---------------EECYPYIAAQNV 284
Query: 204 PSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVD-----DNEDAIKKEILAHGP 258
N C P K TL Y + +NE I EI G
Sbjct: 285 CKISNDDT---LITANCELPV--------KVNRTLMYKMGPAFSLNNETDIMAEIKDRGT 333
Query: 259 TTATFALYDDFYHYKSGVYKHTSNA---KLENYLHSGKLIGWGTE----NGTPYWLVINT 311
A +Y DF+ Y+SG+Y+H++ A + + HS +LIGWG E + YW+ IN+
Sbjct: 334 VQAIMRVYRDFFSYRSGIYRHSAAATPAEERSAYHSVRLIGWGEERVGYDVVKYWIAINS 393
Query: 312 WGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
WG WG+ G +ILRG EC E + A P
Sbjct: 394 WGQWWGENGRFRILRGSNECDIESYVLASNP 424
>gi|301775398|ref|XP_002923119.1| PREDICTED: LOW QUALITY PROTEIN: tubulointerstitial nephritis
antigen-like [Ailuropoda melanoleuca]
Length = 472
Score = 110 bits (275), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 86/276 (31%), Positives = 120/276 (43%), Gaps = 49/276 (17%)
Query: 84 VPDRFDAREQWPNCGTIGHVP-DTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYV 142
+P+ F A +WP H P D CAA F+ +DR G+ LS + +
Sbjct: 217 LPEFFIASYKWPG---WTHGPLDQKNCAASWAFSTASVAADR----IXGRYTANLSPQNL 269
Query: 143 ASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDY-------GDRTGCQPSTISPCS 195
SCC R+ C+ GS+ R W FL KRG V+ Y GC ++ S
Sbjct: 270 ISCCAKNRH----GCNSGSIDRAWWFLRKRGLVSHACYPLFKDQNATNYGCAMASRS--D 323
Query: 196 HHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILA 255
G C N + K +C+ P Y V NE I KEI+
Sbjct: 324 GRGKRHATKPCPNN-IEKSNRIYQCSPP----------------YRVSSNETEIMKEIMQ 366
Query: 256 HGPTTATFALYDDFYHYKSGVYKHTSNAKLEN------YLHSGKLIGWGTENGT-----P 304
+GP A +++DF+HYK+G+Y+H + E+ H+ KL GWGT G
Sbjct: 367 NGPVQAIMQVHEDFFHYKTGIYRHVTRTNEESSKYRKLQTHAIKLTGWGTLKGARGQKEK 426
Query: 305 YWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAG 340
+W+ N+WG WG+ G +ILRG E E LI A
Sbjct: 427 FWIAANSWGKSWGENGYFRILRGVNESDIEKLIIAA 462
>gi|260826514|ref|XP_002608210.1| hypothetical protein BRAFLDRAFT_125840 [Branchiostoma floridae]
gi|229293561|gb|EEN64220.1| hypothetical protein BRAFLDRAFT_125840 [Branchiostoma floridae]
Length = 470
Score = 110 bits (275), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 91/332 (27%), Positives = 145/332 (43%), Gaps = 47/332 (14%)
Query: 19 YKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLI--ADAKYFDQSDRPLPGDRKTY 76
+K + +I+QIN ++W AG +P E++ R LI A + RP P
Sbjct: 173 FKTNLDFIEQINSAQSSWQAGV-YPE--YEKFTRNDLIRRAGGRKSRLPHRPRPAPVSEE 229
Query: 77 DPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRP 136
+A +P+ FD R+ + + D G C + + FA++G R + + Q
Sbjct: 230 TRLAAAQLPESFDWRKVM-GLNFVSPIRDQGQCGSCYAFASMGMLEARLRVLTNNTQQFV 288
Query: 137 LSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSH 196
LS + + SC K ++ C G + + G Y + G P
Sbjct: 289 LSPQEIVSCGKY-----SQGCEGGFPY-----------LIAGKYAEDFGVVLEECYPYEG 332
Query: 197 HGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAH 256
S SC++ +RC GRG+ + +R ++ NE+ ++ E++ +
Sbjct: 333 KDS-----SCKDT--------SRC-----GRGYATN-YRYVGGFYGGCNEELMQLELVKN 373
Query: 257 GPTTATFALYDDFYHYKSGVYKHTSNAK----LENYLHSGKLIGWGT--ENGTPYWLVIN 310
GP F +Y DF HYK GVY+HT + E H+ L+G+G E G +W V N
Sbjct: 374 GPMAVAFEVYSDFMHYKGGVYEHTGLSDPFNPFEITNHAVLLVGYGRDPETGAKFWTVKN 433
Query: 311 TWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
+WG WG+ G +I RG ECA E + A P
Sbjct: 434 SWGEKWGEEGFFRIRRGTDECAIESIAVAADP 465
>gi|414886872|tpg|DAA62886.1| TPA: hypothetical protein ZEAMMB73_253741 [Zea mays]
gi|414886873|tpg|DAA62887.1| TPA: hypothetical protein ZEAMMB73_253741 [Zea mays]
Length = 208
Score = 109 bits (273), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 71/209 (33%), Positives = 96/209 (45%), Gaps = 26/209 (12%)
Query: 137 LSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTG--GDYGDRTGCQPSTISPC 194
LS + +CC D C G W + + G VT Y D GC+
Sbjct: 5 LSVNDLLACCGFMCGD---GCDGGYPIEAWRYFVQNGVVTDECDPYFDPVGCK------- 54
Query: 195 SHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEIL 254
H G P P+ KC +C + + + KH + Y ++ + I E+
Sbjct: 55 -HPGCEPAYPT--------PKCEKKCKEQN--QVWQEKKHFSIDAYRINSDPHDIMAEVY 103
Query: 255 AHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTEN-GTPYWLVINTWG 313
+GP F +Y+DF HYKSGVYKH + + H+ KLIGWGT + G YWL+ N W
Sbjct: 104 KNGPVEVAFTVYEDFAHYKSGVYKHITGGIMGG--HAVKLIGWGTSDAGEDYWLLANQWN 161
Query: 314 PHWGDRGTVKILRGKYECAFEYLIAAGKP 342
WGD G KI+RGK EC E + AG P
Sbjct: 162 RGWGDDGYFKIIRGKNECGIEEGVVAGMP 190
>gi|256052327|ref|XP_002569724.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
Length = 96
Score = 109 bits (273), Expect = 2e-21, Method: Composition-based stats.
Identities = 47/94 (50%), Positives = 69/94 (73%), Gaps = 2/94 (2%)
Query: 248 AIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWL 307
AI+KEI+ +GP A F +Y+DF +YKSG+YKH + KL ++ H+ ++IGWG EN TPYWL
Sbjct: 2 AIQKEIMKYGPVEANFIVYEDFLNYKSGIYKHIT-GKLFSW-HAIRIIGWGEENNTPYWL 59
Query: 308 VINTWGPHWGDRGTVKILRGKYECAFEYLIAAGK 341
+ N+W WG+ G +ILRG++EC+ E + AG+
Sbjct: 60 IPNSWNEDWGENGNFRILRGRHECSIESEVTAGR 93
>gi|395528577|ref|XP_003766405.1| PREDICTED: dipeptidyl peptidase 1-like [Sarcophilus harrisii]
Length = 568
Score = 109 bits (273), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 88/339 (25%), Positives = 141/339 (41%), Gaps = 59/339 (17%)
Query: 17 ELYKFSDAYIDQINREANTWTAGRNFPANLSEEY----LRQFLIADAKYFDQSDRPLPGD 72
E +K++ ++D IN N+W A + EEY L Q + Y RP
Sbjct: 272 EHFKYNYDFVDAINAAQNSWIA------TVYEEYEKLSLDQMIKRRGGYSYPYPRPKSAP 325
Query: 73 RKTYDPEYSATVPDRFDAREQWPNCGTIGHVP---DTGACAAPHIFAAVGAFSDRRCIKS 129
+ ++T+P +D W N + +V + C + + FA++G R IK+
Sbjct: 326 LTHEILQKTSTLPKSWD----WRNVNGVNYVSPVRNQANCGSCYAFASLGMLESRIRIKT 381
Query: 130 KGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPS 189
Q LS + + SC + ++ C G + + GG Y G
Sbjct: 382 NNSQVPVLSPQEIVSCSEY-----SQGCEGGFPY-----------LIGGKYAQDFGLVEE 425
Query: 190 TISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAI 249
P + S T C ++ ++ ++ NE +
Sbjct: 426 ECFPYQAYDSPCTPKKCSR--------------------YYTSEYHYVGGFYGGCNEALM 465
Query: 250 KKEILAHGPTTATFALYDDFYHYKSGVYKHT----SNAKLENYLHSGKLIGWGTEN--GT 303
K E++ +GP T F +YDDF HY++G+Y HT + E H+ L+G+GT+ G
Sbjct: 466 KHELIQNGPLTVAFEVYDDFIHYRTGIYHHTGLRDNFNPFELTNHAVLLVGYGTDEKTGE 525
Query: 304 PYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
YW+V N+WG WG+ G +ILRG ECA E + A P
Sbjct: 526 DYWIVKNSWGTSWGENGYFRILRGTDECAIESIAVAATP 564
>gi|197129222|gb|ACH45720.1| putative cathepsin B variant 2 [Taeniopygia guttata]
Length = 236
Score = 109 bits (273), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 70/229 (30%), Positives = 105/229 (45%), Gaps = 18/229 (7%)
Query: 2 IHILVFLLGCTLVRGELY--KFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADA 59
+ +L L+ R Y SD ++ IN+ TW AG NF N Y+++
Sbjct: 5 VSLLCVLVALANARSIPYFPPLSDDLVNHINKLNTTWKAGHNF-HNADMSYVKKLC---G 60
Query: 60 KYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVG 119
+ P + D +PD FD+R QWPNC TI + D G+C + F AV
Sbjct: 61 TFLGGPKLP-----ERVDFAADVELPDNFDSRTQWPNCPTISEIRDQGSCGSCWAFGAVE 115
Query: 120 AFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGD 179
A SDR C+ + + + +S E + SCC ++ C+ G W + +RG V+GG
Sbjct: 116 AISDRICVHTNAKVSVEVSAEDLLSCCG---FECGMGCNGGYPSGAWRYWTERGLVSGGL 172
Query: 180 YGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCT---NPTY 225
Y GC+P +I PC HH + T P C + +C C +P+Y
Sbjct: 173 YDSHVGCRPYSIPPCEHHVNG-TRPPCTGEGGSTPRCSRHCEPGYSPSY 220
>gi|38048307|gb|AAR10056.1| similar to Drosophila melanogaster CG10992, partial [Drosophila
yakuba]
Length = 174
Score = 109 bits (273), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 62/167 (37%), Positives = 92/167 (55%), Gaps = 10/167 (5%)
Query: 12 TLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIA--DAKYFDQSD-RP 68
TL GE SD +I+ + +A TWT GRNF A+++E ++R+ + DA F +D R
Sbjct: 15 TLSAGEPSLLSDEFIELVRSKAKTWTVGRNFDASVTEGHIRRLMGVHPDAHKFALADKRE 74
Query: 69 LPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIK 128
+ GD + +P+ FD+R+QWPNC TIG + D G+C + F AV A SDR CI
Sbjct: 75 VLGDLYMNSVD---EIPEEFDSRKQWPNCPTIGEIRDQGSCGSCWAFGAVEAMSDRVCIH 131
Query: 129 SKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSV 175
S G+ N S + + SCC C + C+ G W++ ++G V
Sbjct: 132 SGGKVNFHFSADDLVSCCHTCGF----GCNGGFPGAAWSYWTRKGIV 174
>gi|159108157|ref|XP_001704351.1| Cathepsin B precursor [Giardia lamblia ATCC 50803]
gi|157432412|gb|EDO76677.1| Cathepsin B precursor [Giardia lamblia ATCC 50803]
Length = 360
Score = 109 bits (273), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 92/337 (27%), Positives = 140/337 (41%), Gaps = 51/337 (15%)
Query: 4 ILVFLLGCTLVRGELYK---FSDAYIDQINR-EANTWTAGRNFPANLSEEYLRQFLIADA 59
I + ++G +L+ G + S A + I + TW P + L +
Sbjct: 61 IEIKMIGASLLLGAVLAAPAVSHADLHTIKALDGLTWVP--ELPKRFMGKSLDEVKAMFG 118
Query: 60 KYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVG 119
D S RP R++ P A P+ +D R+++P+C I V D G C + F++V
Sbjct: 119 PLVDTS-RPAITMRRSTTPPVGA--PESYDFRDEYPHC--ITEVVDQGNCGSCWAFSSVQ 173
Query: 120 AFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGD 179
F+D RC S +YV C + + C+ G +NFLH G+V
Sbjct: 174 TFADHRCRSGLDATGVSYSVQYVLDCDR-----KDHGCNGGEPVNAFNFLHNTGTVLASC 228
Query: 180 YGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLT 239
G G + V K C +C + +
Sbjct: 229 VGYTAG----------------------DDAVVKF-CPQKCDDGSAVENVVATS------ 259
Query: 240 YWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWG- 298
+ +LAHGP ATF + DF +YKSGVY+H L H+ ++IG+G
Sbjct: 260 ---GSKSGSAIDVLLAHGPVVATFNVAQDFMYYKSGVYQHRWGLWLGG--HAVEIIGYGV 314
Query: 299 TENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEY 335
T++G YW V N+WGP WG+ G +I+RG EC E+
Sbjct: 315 TDSGLDYWTVRNSWGPDWGEDGYFRIVRGGDECGIEH 351
>gi|412992960|emb|CCO16493.1| cysteine proteinase, putative [Bathycoccus prasinos]
Length = 396
Score = 109 bits (272), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 84/273 (30%), Positives = 121/273 (44%), Gaps = 29/273 (10%)
Query: 73 RKTYDPEYSATVPDRFDAREQWPNC-GTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKG 131
+ ++DPE S +P +FDAR++W C G IG V D G C + AA +DR CI
Sbjct: 136 KASFDPE-SLGLPRQFDARKEWAECKGLIGTVRDQGKCGSCWAVAATEVMNDRVCIAHG- 193
Query: 132 QQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGD-RTGCQPST 190
+ LS +Y SC Y C G+V T ++G TGG +GD + C P
Sbjct: 194 -KTEELSPQYALSC-----YSAGAGCEGGNVIDTLQEAIEKGVPTGGMFGDSSSACLPYE 247
Query: 191 ISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIK 250
C H C+ +C T C + T + + + I
Sbjct: 248 FEACDH--------PCQVPGTIAEECPTTCADGTPISETEMMRPTSEPYECPPGDWKCIT 299
Query: 251 KEILAHGPTTATFA-LYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTE--------N 301
+E+ +G TF + DDFY +K GVY+ K LH+ K+IGWG E
Sbjct: 300 QELHKYGSMAVTFGPVCDDFYGHKHGVYEQPEGGKPLG-LHATKIIGWGFEGDDEETGKG 358
Query: 302 GTPYWLVINTWGPHWGDRGTVKILRGKYECAFE 334
G PYW++IN+W +WG+ G +I G+ E
Sbjct: 359 GKPYWIMINSW-QNWGEHGVGRIGIGEMSIESE 390
>gi|403359042|gb|EJY79178.1| Cysteine protease [Oxytricha trifallax]
Length = 366
Score = 109 bits (272), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 82/264 (31%), Positives = 123/264 (46%), Gaps = 44/264 (16%)
Query: 65 SDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDR 124
SD G K+ D E +P+++D RE +P+C + V + G C++ +I AA+ +DR
Sbjct: 88 SDTQNIGPCKSKDDE-ETIIPEKYDWREVYPDC--VQPVVNQGNCSSSYITAALSTVADR 144
Query: 125 RCIKSKGQQNRP--LSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGD 182
C +K +P LS + + C K + C G V RT+N +G
Sbjct: 145 ICQTTK----KPIQLSAQELLDCDK-----SSYQCDGGYVSRTFN------------WGK 183
Query: 183 RTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWV 242
R G P P + + CE+ + +C R N Y + Y +
Sbjct: 184 RKGFIPEQCYPYTG-----VVGECEDDHLETNEC--RVNNMFY----------RVIDYCL 226
Query: 243 DDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTE-N 301
+E +KKEIL +GP A +Y DF YK GVY T +A N H K++GW + +
Sbjct: 227 ASDELGLKKEILKNGPVVAQMVIYTDFLTYKEGVYHRTEDAFKFNGQHVVKIVGWDRQGD 286
Query: 302 GTPYWLVINTWGPHWGDRGTVKIL 325
G +W+V N+WG WG+ G VKIL
Sbjct: 287 GNDFWIVENSWGSDWGEDGYVKIL 310
>gi|221219800|gb|ACM08561.1| Cathepsin B precursor [Salmo salar]
gi|221222296|gb|ACM09809.1| Cathepsin B precursor [Salmo salar]
Length = 205
Score = 109 bits (272), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 70/208 (33%), Positives = 97/208 (46%), Gaps = 18/208 (8%)
Query: 5 LVFLLGCTLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQ 64
LV L + L S +D IN+ TW AG NF N+ Y+++ K
Sbjct: 9 LVSGLSVSWAWPRLPPLSHQMVDYINKANTTWKAGPNF-HNVDYSYVKRLCGTLLK---- 63
Query: 65 SDRPLPGDRKTYDPEYSATV--PDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFS 122
G + +Y+ V PD FD R+QWPNC T+ + D G+C + F A A S
Sbjct: 64 ------GPKLPTMVQYAGDVELPDTFDPRQQWPNCPTLKEIRDQGSCGSCWAFGAAEAIS 117
Query: 123 DRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGD 182
DR CI S + + +S+E + SCC C C+ G W+F G VTGG Y
Sbjct: 118 DRVCIHSNAKVSVEISSEDLLSCCDSC----GMGCNGGYPSAAWDFWTTEGLVTGGLYDS 173
Query: 183 RTGCQPSTISPCSHHGSAPTLPSCENQK 210
GC+P +I PC HH + T P C ++
Sbjct: 174 HVGCRPYSIPPCEHHVNG-TRPPCTGEE 200
>gi|149030260|gb|EDL85316.1| rCG52258, isoform CRA_c [Rattus norvegicus]
Length = 130
Score = 108 bits (271), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 53/119 (44%), Positives = 69/119 (57%), Gaps = 2/119 (1%)
Query: 225 YGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAK 284
Y + +DKH +Y V D+E I EI +GP F ++ DF YKSGVYKH +
Sbjct: 6 YSTSYKEDKHYGYTSYSVSDSEKEIMAEIYKNGPVEGAFTVFSDFLTYKSGVYKHEAGDV 65
Query: 285 LENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
+ H+ +++GWG ENG PYWLV N+W WGD G KILRG+ C E I AG P+
Sbjct: 66 MGG--HAIRILGWGIENGVPYWLVANSWNVDWGDNGFFKILRGENHCGIESEIVAGIPR 122
>gi|328869211|gb|EGG17589.1| hypothetical protein DFA_08585 [Dictyostelium fasciculatum]
Length = 323
Score = 108 bits (271), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 92/328 (28%), Positives = 145/328 (44%), Gaps = 49/328 (14%)
Query: 21 FSDAYIDQINREANTWTAGRNFPANLSEEYLRQFL--IADAKYFDQSDRPLPGDRKTYDP 78
+D +I N + W A RN A + Q + + K + + P K D
Sbjct: 39 LNDKFIQNHNSKNAPWVAKRN--ARFEGHTIGQVMAMMGTKKVINNNAAP---SIKIVD- 92
Query: 79 EYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLS 138
A++P FDAREQWP C + V + C + F++ A SDR CI SKGQ N LS
Sbjct: 93 ---ASIPSTFDAREQWPGC--VHAVLNQEQCGSCWAFSSSEALSDRLCIASKGQVNVTLS 147
Query: 139 TEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHG 198
+ + +C I N+ C+ G W ++ +G T C P T + +G
Sbjct: 148 PQALVACDDI----GNQGCNGGVPQLAWEYMEWKGLPT-------FECYPYT----AGNG 192
Query: 199 SAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGP 258
+ T C +C + + +++ K + T ++ I+ EI+ +GP
Sbjct: 193 TDGT-------------CQRQCADGS-AMTYYRAKPFSMTTC---NSVACIQNEIITYGP 235
Query: 259 TTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTP--YWLVINTWGPHW 316
T +Y DF Y SGVY + A+L H+ +++GWGT+ + YW+V N+W W
Sbjct: 236 VVGTMMVYQDFMSYSSGVYVYDGTAELLGG-HAIEIVGWGTDATSKLDYWIVKNSWSAAW 294
Query: 317 GDR-GTVKILRGKYECAFEYLIAAGKPK 343
G G I RG C ++ +A + K
Sbjct: 295 GGLDGYFWIQRGTNMCGIDHDASASQAK 322
>gi|221221056|gb|ACM09189.1| Cathepsin B precursor [Salmo salar]
gi|221222300|gb|ACM09811.1| Cathepsin B precursor [Salmo salar]
Length = 207
Score = 108 bits (271), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 70/208 (33%), Positives = 97/208 (46%), Gaps = 18/208 (8%)
Query: 5 LVFLLGCTLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQ 64
LV L + L S +D IN+ TW AG NF N+ Y+++ K
Sbjct: 9 LVSGLSVSWAWPRLPPLSHQMVDYINKANTTWKAGPNF-HNVDYSYVKRLCGTLLK---- 63
Query: 65 SDRPLPGDRKTYDPEYSATV--PDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFS 122
G + +Y+ V PD FD R+QWPNC T+ + D G+C + F A A S
Sbjct: 64 ------GPKLPTMVQYAGDVELPDTFDPRQQWPNCPTLKEIRDQGSCGSCWAFGAAEAIS 117
Query: 123 DRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGD 182
DR CI S + + +S+E + SCC C C+ G W+F G VTGG Y
Sbjct: 118 DRVCIHSNAKVSVEISSEDLLSCCDSC----GMGCNGGYPSAAWDFWTTEGLVTGGLYDS 173
Query: 183 RTGCQPSTISPCSHHGSAPTLPSCENQK 210
GC+P +I PC HH + T P C ++
Sbjct: 174 HVGCRPYSIPPCEHHVNG-TRPPCTGEE 200
>gi|239793652|dbj|BAH72931.1| ACYPI000018 [Acyrthosiphon pisum]
Length = 239
Score = 108 bits (271), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 64/197 (32%), Positives = 91/197 (46%), Gaps = 9/197 (4%)
Query: 1 MIHILVFLLGCTLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAK 60
++ +L + + + Y +ID IN A TW AG NF + +E+ + L +K
Sbjct: 4 VLMLLSVIFVSFYLTEQAYFLQKDFIDNINERATTWKAGVNFDPDTPKEHFLKML--GSK 61
Query: 61 YFDQSDRPLPGDRKTYDPEYS---ATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAA 117
++ KT+D Y +P FDAR +W C TIG V D G C + A
Sbjct: 62 GVQIPNKHNIHMYKTHDAAYDNLFGRIPRHFDARRKWRRCHTIGAVRDQGNCGSCWAMAT 121
Query: 118 VGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTG 177
AF+DR C+ + N LS E + CC C + C+ G + W KRG VTG
Sbjct: 122 SSAFADRLCVATNTDFNELLSAEEITFCCHSCGF----GCNGGYPIKAWERFKKRGLVTG 177
Query: 178 GDYGDRTGCQPSTISPC 194
GDY GC+P + PC
Sbjct: 178 GDYQSGEGCEPYRVPPC 194
>gi|239793607|dbj|BAH72912.1| ACYPI000019 [Acyrthosiphon pisum]
Length = 188
Score = 108 bits (271), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 61/160 (38%), Positives = 81/160 (50%), Gaps = 5/160 (3%)
Query: 185 GCQPSTISPCSHHGSAPTLPSCEN-QKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVD 243
GCQP TI PC P SC + C +C NP Y F D ++ +
Sbjct: 31 GCQPYTIPPCKLMNEKPPGHSCTTYHREETPICEKKCYNPNYYTSFRTDIYKGK---YYK 87
Query: 244 DNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENY-LHSGKLIGWGTENG 302
+ K+I +GP T F +Y D YKSGVY++ + + + +HS K+ GWG ENG
Sbjct: 88 LSPYMAMKDIFDNGPITTQFYMYRDLVDYKSGVYQYDEQSDFDFFTVHSVKIFGWGEENG 147
Query: 303 TPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
PYWLV N++G WG GT KI RG C F+ + AG P
Sbjct: 148 VPYWLVANSFGTDWGYNGTFKISRGNDGCFFQEKMYAGLP 187
>gi|449283627|gb|EMC90232.1| Tubulointerstitial nephritis antigen [Columba livia]
Length = 469
Score = 108 bits (270), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 85/272 (31%), Positives = 120/272 (44%), Gaps = 39/272 (14%)
Query: 85 PDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVAS 144
P F A WP I D C A F+ +DR I S+GQ LS + + S
Sbjct: 223 PVFFAATYAWPE--WIHDPLDQRNCGASWAFSTASVAADRIAIHSEGQITDNLSVQNLIS 280
Query: 145 CCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGD--RTGCQPSTISPC---SHHGS 199
C + C+ G++ W +L G V+ Y + +PS + C S +G
Sbjct: 281 C----DTRNQHGCNGGNIDSAWRYLKTHGVVSYACYPSFWKKHLEPSGENHCYVSSEYGK 336
Query: 200 APTLPSCEN--QKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHG 257
T C N +K +L +R Y V E I KEI+ G
Sbjct: 337 NYTNGPCPNALEKSNRL-------------------YRCASHYRVSSKETNIMKEIMDKG 377
Query: 258 PTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGT---ENG--TPYWLVINTW 312
P A +Y+DF+ YK G+Y+H+ A + HS KL+GWG +NG +W+ N+W
Sbjct: 378 PVQAIMKVYEDFFLYKEGIYRHSQKAGSKWKTHSVKLLGWGALADKNGQKQKFWIAANSW 437
Query: 313 GPHWGDRGTVKILRGKYECAFEYLIAA--GKP 342
G WG+ G +ILRG+ EC E LI A G+P
Sbjct: 438 GKSWGENGYFRILRGQNECDIEKLILATSGQP 469
>gi|157058771|gb|ABV03143.1| cathepsin B-16D [Aulacorthum solani]
Length = 201
Score = 108 bits (269), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 61/185 (32%), Positives = 86/185 (46%), Gaps = 7/185 (3%)
Query: 14 VRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRP--LPG 71
V + Y +I+ IN +A TW AG NF N +E+ + L + +
Sbjct: 1 VTEQAYFLQRDFIENINEQATTWKAGVNFDPNTPKEHFLKLLGSKGVQIPNLNNINLYKT 60
Query: 72 DRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKG 131
D YD + +P FDAR +W +C TIG V D G C + A AF+DR C+ + G
Sbjct: 61 DDAAYDNLF-GLIPRHFDARRKWRHCQTIGKVRDQGNCGSCWAMATSSAFADRLCVATNG 119
Query: 132 QQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTI 191
N LS E + CC C + C G + W +K G VTGG+Y GC+P +
Sbjct: 120 DFNELLSAEEITFCCHTCGF----GCHGGYPIKAWKRFNKHGLVTGGNYNSGEGCEPYRV 175
Query: 192 SPCSH 196
PC +
Sbjct: 176 PPCPY 180
>gi|196009233|ref|XP_002114482.1| hypothetical protein TRIADDRAFT_28083 [Trichoplax adhaerens]
gi|190583501|gb|EDV23572.1| hypothetical protein TRIADDRAFT_28083 [Trichoplax adhaerens]
Length = 466
Score = 108 bits (269), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 93/333 (27%), Positives = 145/333 (43%), Gaps = 56/333 (16%)
Query: 25 YIDQINREANTWTAGRNFPA----NLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEY 80
YI+QIN + WTA +P L+E +R K F + DR + + +
Sbjct: 173 YINQINSAQSLWTA-TEYPEYEDFTLAELNMRSGRPTVPKSFAGPRLRMKRDRLSRNSDE 231
Query: 81 SATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTE 140
P +FD R N + V + GAC + + F+++ + R + SK R +S +
Sbjct: 232 FIYFPKQFDWRNV-SNVNYVSPVRNQGACGSCYAFSSMAMYEARLRVLSKNSVKRVMSPQ 290
Query: 141 YVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSA 200
V SC + + C+ G + + G YG+ G + P ++G
Sbjct: 291 DVVSCSEYA-----QGCAGGFPY-----------LIAGKYGEDFGLVEESCFP--YNGKD 332
Query: 201 PTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDD-----NEDAIKKEILA 255
E K K KC +H TT Y+V NE + +E++
Sbjct: 333 ------EPCKETKSKCR---------------RHSTTNYYYVGGFYGACNEYLMMRELVK 371
Query: 256 HGPTTATFALYDDFYHYKSGVYKHTSNAKLEN----YLHSGKLIGWGTE--NGTPYWLVI 309
+GP + +F +Y DF HYK G+Y+HT N H+ L+G+GT+ +G YW+V
Sbjct: 372 NGPISISFEVYGDFKHYKGGIYQHTGLGDSYNPWQITNHAVLLVGYGTDQKSGKDYWIVK 431
Query: 310 NTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
N+WG WG+ G +ILRG EC+ E A P
Sbjct: 432 NSWGTKWGENGFFRILRGVDECSIENEAVAVTP 464
>gi|326916361|ref|XP_003204476.1| PREDICTED: tubulointerstitial nephritis antigen-like [Meleagris
gallopavo]
Length = 467
Score = 108 bits (269), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 78/259 (30%), Positives = 114/259 (44%), Gaps = 25/259 (9%)
Query: 85 PDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVAS 144
P+ F A WP+ I D C A F+ +DR I S GQ LS + + S
Sbjct: 223 PEFFAATYAWPD--WIHDPLDQRNCGASWAFSTASVAADRIAIHSDGQITDNLSVQNLIS 280
Query: 145 CCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTLP 204
C + C G++ W +L G V+ C PS H +P+
Sbjct: 281 C----DTKNQHGCGGGNIEGAWRYLKTHGVVS-------YACYPSFWK---HSLDSPSEN 326
Query: 205 SC-ENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATF 263
C + + K + C N +R Y + E I +EI+A GP A
Sbjct: 327 HCYVSSEYGKNHTNGPCPNALEDSNRL---YRCASHYRISSKETDIMEEIMAKGPVQAIM 383
Query: 264 ALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENG-----TPYWLVINTWGPHWGD 318
+Y+DF+ YK G+Y+H+ A + HS KL+GWG+ G +W+ N+WG +WG+
Sbjct: 384 KVYEDFFLYKEGIYRHSYKAGSKWKTHSVKLLGWGSLPGKNGQKQKFWIAANSWGKYWGE 443
Query: 319 RGTVKILRGKYECAFEYLI 337
G +ILRG+ EC E LI
Sbjct: 444 NGYFRILRGQNECDIEKLI 462
>gi|159115721|ref|XP_001708083.1| Cathepsin B precursor [Giardia lamblia ATCC 50803]
gi|157436192|gb|EDO80409.1| Cathepsin B precursor [Giardia lamblia ATCC 50803]
Length = 305
Score = 108 bits (269), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 74/254 (29%), Positives = 111/254 (43%), Gaps = 37/254 (14%)
Query: 82 ATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEY 141
A PDR D R+ P C D C+ + FA +GA S RRCI Q LS ++
Sbjct: 79 AGSPDRLDYRQTHPEC--FFEPEDQKECSCCYAFATLGALSTRRCIAKLDPQAVSLSVQH 136
Query: 142 VASCCKICRYDDNKS-CSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSA 200
+ SC D ++ C G +W FL G+V ++ C P T G
Sbjct: 137 MVSC------DSGEAGCQGGEFESSWAFLETEGAV-------KSDCLPYTSGETGKSGEC 183
Query: 201 PTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTT 260
PT +C++ + H + + + R + N + I +LA GP
Sbjct: 184 PT--TCQDGTPVESAFHYKAASAS----------RLS-------NYNEIMVSLLADGPVQ 224
Query: 261 ATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRG 320
F +++DF +Y G+Y L H+ ++G+G+ N YW+V N+WG WG+ G
Sbjct: 225 TGFYVHEDFLYYVGGIYHKVYGTSLGG--HAVLIVGYGSMNNHDYWIVRNSWGSDWGENG 282
Query: 321 TVKILRGKYECAFE 334
+ILRG EC E
Sbjct: 283 YFRILRGTNECGIE 296
>gi|308161503|gb|EFO63946.1| Cathepsin B precursor [Giardia lamblia P15]
Length = 363
Score = 108 bits (269), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 76/271 (28%), Positives = 121/271 (44%), Gaps = 44/271 (16%)
Query: 65 SDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDR 124
+ RP + + P A P+ +D RE++P+C I V D G+C + F+++ F+D
Sbjct: 126 TSRPTITMKHSTKPPVGA--PESYDFREEYPHC--ITEVVDQGSCGSCWAFSSIQTFADH 181
Query: 125 RCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRT 184
RC S +YV C + + C+ G +NFLH G+V
Sbjct: 182 RCRSGLDATGVSYSVQYVLDCDR-----KDHGCNGGEPVNAFNFLHNTGTV--------- 227
Query: 185 GCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDD 244
++ C + + + V K C +C + + +
Sbjct: 228 ------LTSCVEYTAG-------DDAVVKF-CPQKCDDGSAVENIVATSGAKS------- 266
Query: 245 NEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWG-TENGT 303
+ +LAHGP ATF + DF +YKSGVY+H L H+ +++G+G T++G
Sbjct: 267 --GSAIDVLLAHGPVVATFNVAQDFMYYKSGVYQHRWGVWLGG--HAVEIVGYGVTDSGL 322
Query: 304 PYWLVINTWGPHWGDRGTVKILRGKYECAFE 334
YW V N+WGP WG+ G +I+RG EC E
Sbjct: 323 DYWTVRNSWGPDWGEDGYFRIVRGGDECGIE 353
>gi|156708108|gb|ABU93312.1| cathepsin B2 cysteine protease [Monocercomonoides sp. PA]
Length = 281
Score = 107 bits (268), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 86/315 (27%), Positives = 138/315 (43%), Gaps = 54/315 (17%)
Query: 21 FSDAYIDQINRE-ANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPE 79
+++ ++ +N + ++TW A +PA++ +FL Y + + ++D +
Sbjct: 10 LAESIVETVNNDPSSTWVA-VEYPASVITR--AKFLARLGTYVTKYEE------TSFDLD 60
Query: 80 YSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLST 139
+ +P+ FD+REQWP G I V D +C + F+ DR IK G +S
Sbjct: 61 NA--LPENFDSREQWP--GKILPVRDQASCGSCWAFSVAETMGDRLSIK--GCDFGDMSP 114
Query: 140 EYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGS 199
+ + SC + C+ G + W + G T C P S G
Sbjct: 115 QDLVSCDTT-----DMGCNGGYMDHAWAWTKSHGITT-------EKCMPYQ----SGSGR 158
Query: 200 APTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPT 259
P P+ +C N G ++K + + N + +E+ +GP
Sbjct: 159 VPACPA-------------KCVN---GSAIVRNKSVS----YKKLNAQQMMEELYENGPI 198
Query: 260 TATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDR 319
+ F +Y DF +YKSGVY H + H+ +GWG E+ TPYWL N+WGP WG++
Sbjct: 199 SVAFTVYYDFMNYKSGVYVHKTGGIAGG--HAVLCVGWGVEDNTPYWLCQNSWGPAWGEK 256
Query: 320 GTVKILRGKYECAFE 334
G KILRG C E
Sbjct: 257 GHFKILRGSNHCGIE 271
>gi|161343853|tpg|DAA06107.1| TPA_inf: cathepsin B [Acyrthosiphon pisum]
Length = 217
Score = 107 bits (268), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 58/157 (36%), Positives = 87/157 (55%), Gaps = 5/157 (3%)
Query: 80 YSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLST 139
+++ +P FD+R++WPNC +IGH+ + G C + + AA A SDR CI+S G +N +S
Sbjct: 57 FTSGLPINFDSRKKWPNCPSIGHIYNQGNCRSSYAVAAASAASDRICIQSNGTKNPIMSA 116
Query: 140 EYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGS 199
+ + SCC +C + C GS+F +W++ + G V+GGDY GCQP TI PC
Sbjct: 117 QQIISCCYLCGH----GCDGGSLFESWDYYRRHGFVSGGDYNSNQGCQPYTIPPCKLMNE 172
Query: 200 APTLPSCEN-QKVPKLKCHTRCTNPTYGRGFFQDKHR 235
P SC + C +C NP Y F D ++
Sbjct: 173 KPPGHSCTTYHREETPICEKKCYNPNYYTSFRTDIYK 209
>gi|253747738|gb|EET02294.1| Cathepsin B precursor [Giardia intestinalis ATCC 50581]
Length = 305
Score = 107 bits (268), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 71/254 (27%), Positives = 109/254 (42%), Gaps = 37/254 (14%)
Query: 82 ATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEY 141
A PDR D R+ P C D C+ + FA +GA S RRCI PLS ++
Sbjct: 79 ADSPDRLDYRQTHPEC--FFEPEDQSDCSCCYAFATLGALSTRRCIAKLDASVVPLSAQH 136
Query: 142 VASCCKICRYDDNKSCSHGSVFRT-WNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSA 200
+ SC D ++ G F T W FL G++ C P G
Sbjct: 137 MVSC------DHGEAGCQGGGFNTSWAFLETEGAIM-------RDCLPYVSGETGLSGEC 183
Query: 201 PTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTT 260
PT +C++ + H + + ++ + N + I +L GP
Sbjct: 184 PT--TCQDGTLLNDTIHYKAVSASHLK-----------------NYNEIMTSLLNEGPVQ 224
Query: 261 ATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRG 320
F +++DF +Y G+Y T + + H+ ++G+G+ N YW+V N+WG WG+ G
Sbjct: 225 TGFYVHEDFLYYVGGIYHKTYGSSIGG--HAVLIVGYGSMNNHDYWIVRNSWGSDWGENG 282
Query: 321 TVKILRGKYECAFE 334
+ILRG EC E
Sbjct: 283 YFRILRGTNECGIE 296
>gi|156708106|gb|ABU93311.1| cathepsin B2 cysteine protease [Monocercomonoides sp. PA]
Length = 282
Score = 107 bits (268), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 87/316 (27%), Positives = 143/316 (45%), Gaps = 56/316 (17%)
Query: 21 FSDAYIDQINRE-ANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPG-DRKTYDP 78
+++ ++ +N + ++TW A +PA++ I AK+ + + + +TY+
Sbjct: 10 LAESIVETVNNDPSSTWVA-VEYPASV---------ITRAKFLARLGTHVEEYEERTYES 59
Query: 79 EYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLS 138
+ + +P+ FDAREQWP I V D +C + F+ DR I G+ + +S
Sbjct: 60 DNA--LPENFDAREQWPE--QILPVRDQASCGSCWAFSVAETMGDRLSIIGCGRGH--MS 113
Query: 139 TEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHG 198
+ + SC + C+ G + + W + G VT + C P S G
Sbjct: 114 PQDLVSC-----DTTDMGCNGGYMDKAWAWTKSHG-VTNEE------CMPYQ----SGGG 157
Query: 199 SAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGP 258
P P+ +C N G + K ++ + +++E+ +GP
Sbjct: 158 RVPACPA-------------KCVN---GSTIVRTKSQSFTHF----TASQMQQELYENGP 197
Query: 259 TTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGD 318
+ F +Y DF +YKSGVY H + H+ IGWG E+ TPYWL N+WGP WG+
Sbjct: 198 LSVAFTVYYDFMNYKSGVYVHKTGGVAGG--HAVLCIGWGVEDNTPYWLCQNSWGPAWGE 255
Query: 319 RGTVKILRGKYECAFE 334
+G KILRG C E
Sbjct: 256 KGHFKILRGSNHCGIE 271
>gi|328726763|ref|XP_003249034.1| PREDICTED: cathepsin B-like cysteine proteinase-like, partial
[Acyrthosiphon pisum]
Length = 129
Score = 107 bits (268), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 63/132 (47%), Positives = 77/132 (58%), Gaps = 16/132 (12%)
Query: 219 RCTNPTYGRGF--FQDKHRTT-----LTYWVDDNEDAIKKEILAHGPTTATFALYDDFYH 271
RCT YG + D HR T LTY +I+K++L +GP A+F +YDDF
Sbjct: 3 RCTRMCYGNQDLDYDDDHRFTRDFYYLTY------GSIQKDVLNYGPIEASFDVYDDFPS 56
Query: 272 YKSGVYKHTSNA-KLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYE 330
YKSGVY+ T NA KL H+ KLIGWG E GTPYWL++N+W WGD G KI RG E
Sbjct: 57 YKSGVYQRTPNATKLGG--HAVKLIGWGVEEGTPYWLMVNSWNAQWGDNGLFKIRRGTDE 114
Query: 331 CAFEYLIAAGKP 342
C + AG P
Sbjct: 115 CRIDSATTAGVP 126
>gi|160688716|gb|ABX45136.1| cathepsin B-like cysteine protease 2 [Callosobruchus maculatus]
Length = 260
Score = 107 bits (267), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 56/115 (48%), Positives = 70/115 (60%), Gaps = 3/115 (2%)
Query: 231 QDKHRTTLTYWVDDN-EDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYL 289
+DKH Y + E I+ EI+ +GP A+F +Y DF HY SGVYK +KL
Sbjct: 147 EDKHYAKQAYRIMSKVERQIQLEIIKNGPVVASFTVYADFIHYLSGVYKFDGESKLLGG- 205
Query: 290 HSGKLIGWGTENGT-PYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
H+ ++IGWG ENGT PYWLV N+W WGD+G KI RGK EC E I AG P+
Sbjct: 206 HAVRIIGWGIENGTYPYWLVSNSWNERWGDQGLFKIWRGKNECGIEEEITAGLPR 260
Score = 57.4 bits (137), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 31/107 (28%), Positives = 50/107 (46%), Gaps = 6/107 (5%)
Query: 5 LVFLLGCTLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQ 64
L ++ CT + EL SD YI+Q+N + W AGRNF + S +++ L
Sbjct: 8 LAAVVSCTFAQPELDFLSDEYIEQLNSKNLPWKAGRNFERDTSLYNIQRLLSVG------ 61
Query: 65 SDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAA 111
+ P + + +P+ FDAR+QW C +I + D C +
Sbjct: 62 TINPPSEFETIFHEDDGKDLPEEFDARKQWSKCESIKEIRDQSGCGS 108
>gi|182509202|ref|NP_001116812.1| tubulointerstitial nephritis antigen precursor [Bombyx mori]
gi|81303350|gb|ABB71105.1| TIN-ag-RP [Bombyx mori]
Length = 404
Score = 107 bits (267), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 81/323 (25%), Positives = 136/323 (42%), Gaps = 53/323 (16%)
Query: 21 FSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEY 80
S+ ++ +N++ TW A +P +E+ L+ LI F PL +Y +
Sbjct: 131 MSEDLVNDVNQQGTTWRA-TTYP-EFNEKKLKDGLIYKLGTF-----PLNVTVISYSKD- 182
Query: 81 SATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTE 140
PD FDAR +W G I + D C + + DR I+S G +N +S++
Sbjct: 183 -GQYPDEFDARREWY--GYISPIADQDWCGSDWAVSIASIVGDRFSIQSFGTENVRMSSQ 239
Query: 141 YVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSA 200
+ SC + C+ G++ ++F+ G V+
Sbjct: 240 TLLSC----HLKGQRGCNGGNLDIAFDFVKTHGLVS------------------------ 271
Query: 201 PTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTT 260
Q P T+C R ++R + + + ED I +I+ GP
Sbjct: 272 -------EQCFPYEGAVTQCRIGNDCR-----RYRVGVPFSISKEED-IMYDIMTSGPAL 318
Query: 261 ATFALYDDFYHYKSGVYKHTSNA-KLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDR 319
+Y DF+HY+ G+Y+HT + +L LHS +++GWG + YW+V N+WG WG++
Sbjct: 319 GIMTVYQDFFHYREGIYRHTRHGDQLMRGLHSVRIVGWGEDAEDKYWIVANSWGTSWGEK 378
Query: 320 GTVKILRGKYECAFEYLIAAGKP 342
G +I RG E + P
Sbjct: 379 GYFRIARGHSGTGIESSVLTVLP 401
>gi|327239610|gb|AEA39649.1| cathepsin B [Epinephelus coioides]
Length = 171
Score = 107 bits (267), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 63/179 (35%), Positives = 92/179 (51%), Gaps = 9/179 (5%)
Query: 107 GACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTW 166
G+C A F A A SDR CI S G+ + +S+E + +CC C C+ G W
Sbjct: 1 GSCWA---FGAAEAISDRLCIHSNGKVSVEISSEDLLACCDSC----GMGCNGGYPSAAW 53
Query: 167 NFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYG 226
+F G V+GG Y GC+P TI PC HH + T P C + +C +C + Y
Sbjct: 54 DFWTDVGLVSGGLYDSHVGCRPYTIPPCEHHVNG-TRPPCTGEGGDTPQCILQCES-GYT 111
Query: 227 RGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKL 285
+ DKH +Y V +E+ I+ EI +GP F +Y+DF YK+GVY+H + + +
Sbjct: 112 PSYKADKHYGKSSYSVPSDEEQIQSEIYKNGPVEGAFTVYEDFLLYKTGVYQHMTGSAV 170
>gi|12330246|gb|AAG52660.1| cysteine proteinase [Metagonimus yokogawai]
Length = 179
Score = 107 bits (267), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 65/188 (34%), Positives = 88/188 (46%), Gaps = 9/188 (4%)
Query: 116 AAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSV 175
AV A +DR CI S + +S + SCC+ C + C G R W+F + G V
Sbjct: 1 GAVEAMTDRLCIHSNATIKKHISATDLLSCCESCGF----GCHGGFPPRAWDFWMENGLV 56
Query: 176 TGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHR 235
TGG + +GC+ CSHHG P C C C P + DK
Sbjct: 57 TGGSKENPSGCRSYPFPRCSHHGKG-KYPPCPKTIFDTPNCVDHCDKPDID--YAADKTH 113
Query: 236 TTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLI 295
+Y V NE I KEI+ +GP A F +Y+DF YKSG+Y H+ L H+ +++
Sbjct: 114 AKSSYNVQSNERVIMKEIMRNGPVEAAFMVYEDFIEYKSGIYFHSHGKLLGG--HAIRML 171
Query: 296 GWGTENGT 303
GWG E G
Sbjct: 172 GWGEEKGV 179
>gi|156708104|gb|ABU93310.1| cathepsin B1 cysteine protease [Monocercomonoides sp. PA]
Length = 281
Score = 107 bits (266), Expect = 9e-21, Method: Compositional matrix adjust.
Identities = 85/315 (26%), Positives = 138/315 (43%), Gaps = 54/315 (17%)
Query: 21 FSDAYIDQINRE-ANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPE 79
+++ ++ +N + ++TW A +PA++ +FL Y + + ++D +
Sbjct: 10 LAESIVETVNNDPSSTWVA-VEYPASVITR--AKFLARLGTYVTKYEE------TSFDLD 60
Query: 80 YSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLST 139
+ +P+ FD+REQWP G I V D +C + F+ DR IK G ++
Sbjct: 61 NA--LPENFDSREQWP--GKILPVRDQASCGSCWAFSVAETMGDRLSIK--GCDYGDMAP 114
Query: 140 EYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGS 199
+ + SC + C+ G + W + G T C P S G
Sbjct: 115 QDLVSCDTT-----DMGCNGGYMDHAWAWTKSHGVTT-------EKCMPYQ----SGSGR 158
Query: 200 APTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPT 259
P P+ +C N G ++K + + N + +E+ +GP
Sbjct: 159 VPACPA-------------KCVN---GSAIVRNKSVS----YKKLNAQQMMEELYENGPI 198
Query: 260 TATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDR 319
+ F +Y DF +YKSGVY H + H+ +GWG E+ TPYWL N+WGP WG++
Sbjct: 199 SVAFTVYYDFMNYKSGVYVHKTGGIAGG--HAVLCVGWGVEDNTPYWLCQNSWGPAWGEK 256
Query: 320 GTVKILRGKYECAFE 334
G KILRG C E
Sbjct: 257 GHFKILRGSNHCGIE 271
>gi|350540002|ref|NP_001232104.1| putative cathepsin B variant 2 precursor [Taeniopygia guttata]
gi|197129221|gb|ACH45719.1| putative cathepsin B variant 2 [Taeniopygia guttata]
Length = 261
Score = 107 bits (266), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 66/207 (31%), Positives = 97/207 (46%), Gaps = 15/207 (7%)
Query: 2 IHILVFLLGCTLVRGELY--KFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADA 59
+ +L L+ R Y SD ++ IN+ TW AG NF N Y+++
Sbjct: 5 VSLLCVLVALANARSIPYFPPLSDDLVNHINKLNTTWKAGHNF-HNADMSYVKKLC---G 60
Query: 60 KYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVG 119
+ P + D +PD FD+R QWPNC TI + D G+C + F AV
Sbjct: 61 TFLGGPKLP-----ERVDFAADVELPDNFDSRTQWPNCPTISEIRDQGSCGSCWAFGAVE 115
Query: 120 AFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGD 179
A SDR C+ + + + +S E + SCC ++ C+ G W + +RG V+GG
Sbjct: 116 AISDRICVHTNAKVSVEVSAEDLLSCCG---FECGMGCNGGYPSGAWRYWTERGLVSGGL 172
Query: 180 YGDRTGCQPSTISPCSHHGSAPTLPSC 206
Y GC+P +I PC HH + T P C
Sbjct: 173 YDSHVGCRPYSIPPCEHHVNG-TRPPC 198
>gi|448278133|gb|AGE43966.1| putative cathepsin B [Naegleria fowleri]
Length = 349
Score = 106 bits (265), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 88/333 (26%), Positives = 147/333 (44%), Gaps = 64/333 (19%)
Query: 23 DAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEYSA 82
+A+I IN+ A TW AG++ ++ ++ A+ P P R +Y + S
Sbjct: 50 EAFIQLINKYAKTWQAGKS-------KFFEGKRLSHARRLIGLGLPTPEQRASYPKKNSL 102
Query: 83 -----------------TVPDRFDAR--EQWPNCGTIGHVPDTGACAAPHIFAAVGAFSD 123
+PD ++A + C + + + C + F+ +D
Sbjct: 103 MMGEEANSLEKYLVKMDALPDSYNAANDSNYYMCQQLHRIRNQEQCGSCWAFSISEMVAD 162
Query: 124 RRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDR 183
R CI ++G+ N +S +++ SC DN C+ G + F+ G V+
Sbjct: 163 RFCIGTRGKINTIMSPQWMVSC----DTADN-GCNGGEFPTAFQFVETTGLVS------- 210
Query: 184 TGCQPSTISPCSHHGSAPTLP-SCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWV 242
GC P S +G P P SC N + ++ T+ + R F + ++
Sbjct: 211 DGCVPYQ----SGNGFVPPCPNSCANGEDINVRYRTKNS-----RNFDVNDMKS------ 255
Query: 243 DDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWG-TEN 301
++ ILA+GP + F +Y DFY+Y+SG YKH + + H+ K++GWG T++
Sbjct: 256 ------VQASILANGPVISGFKVYRDFYNYRSG-YKHVAGGLVGG--HAIKVVGWGVTQS 306
Query: 302 GTPYWLVINTWGPHWGDRGTVKILRGKYECAFE 334
PYW+V N+W WG G ILRG EC+ E
Sbjct: 307 NVPYWIVANSWSDEWGMNGYFWILRGTNECSIE 339
>gi|159950|gb|AAA29435.1| cathepsin B-like cysteine protease, partial [Ostertagia ostertagi]
Length = 105
Score = 106 bits (265), Expect = 1e-20, Method: Composition-based stats.
Identities = 47/101 (46%), Positives = 69/101 (68%), Gaps = 2/101 (1%)
Query: 240 YWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGT 299
Y + ++ AI+K+I+ +GP AT+ +Y+DF HY+SG+YKH + K LH+ K+IGWG
Sbjct: 4 YQLKNSVKAIQKDIMKNGPVVATYTVYEDFAHYRSGIYKHKAGRK--TGLHAVKVIGWGE 61
Query: 300 ENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAG 340
E GTPYW+V N+W WG+ G ++ RG +C FE +AAG
Sbjct: 62 EKGTPYWIVANSWHDDWGENGFFRMHRGSNDCGFEERMAAG 102
>gi|48762483|dbj|BAD23811.1| cathepsin B-S [Tuberaphis takenouchii]
Length = 155
Score = 106 bits (265), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 55/158 (34%), Positives = 85/158 (53%), Gaps = 8/158 (5%)
Query: 155 KSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKL 214
K C G + W + +G TGGDY + GC P I PC T C + + +
Sbjct: 3 KGCEGGYPIKAWQYFRTQGVPTGGDYDSKEGCAPYKIPPCFDQKGKNT---CAGKPLER- 58
Query: 215 KCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKS 274
+ +C YG Q +++ Y V ++ + ++++++ +GP A+F L+DD YKS
Sbjct: 59 --NHQCPKTCYGSTTVQKRYKVKNEY-VLNSPNTMEQDLIKYGPIEASFNLFDDLSAYKS 115
Query: 275 GVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTW 312
G+Y+ T AK + HS K+IGWG ENG PYWL +N+W
Sbjct: 116 GIYQKTPKAKFLS-GHSIKIIGWGKENGVPYWLAVNSW 152
>gi|255087666|ref|XP_002505756.1| cathepsin B-like cysteine proteinase [Micromonas sp. RCC299]
gi|226521026|gb|ACO67014.1| cathepsin B-like cysteine proteinase [Micromonas sp. RCC299]
Length = 273
Score = 106 bits (264), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 75/256 (29%), Positives = 113/256 (44%), Gaps = 25/256 (9%)
Query: 84 VPDRFDAREQWPNCG-TIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYV 142
+P+ FDAR +WP C IG D G C + A SDR CI+S G+ + LS +
Sbjct: 18 LPESFDARTKWPTCAHLIGVARDQGNCGSCWAMAPAEVMSDRACIQSGGEIDAELSPFQL 77
Query: 143 ASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPT 202
+C + + C G + F G VTGG + D+ C P +PC H
Sbjct: 78 LACAQ-----GSFGCEGGESADAYEFAKSNGVVTGGGFDDQNTCAPYPFAPCHHPCEVFP 132
Query: 203 LPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTAT 262
P+C P G+ F+ K + + + EI +GP ++
Sbjct: 133 TPAC-----PATCVGGSNDGVQNGKASFKVKAIVDCPSF---DYGCVANEIYHNGPVSSY 184
Query: 263 FA-LYDDFYHYKSGVYKHTSNAKLENYLHSG---KLIGWGT------ENGTPYWLVINTW 312
+Y++FY YKSGV++ + + H G K+IGWG E YW+V+N+W
Sbjct: 185 AGDIYEEFYAYKSGVFRESPSVAQRGANHGGHVVKVIGWGKADPAKGEGEGYYWIVVNSW 244
Query: 313 GPHWGDRGTVKILRGK 328
+WGD G +I G+
Sbjct: 245 -LNWGDDGVGRIAVGE 259
>gi|294891885|ref|XP_002773787.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
gi|239878991|gb|EER05603.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
Length = 234
Score = 106 bits (264), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 63/176 (35%), Positives = 84/176 (47%), Gaps = 11/176 (6%)
Query: 161 SVFRTWNFLHKRGSVTGG-----DYGDRTGCQPSTISPCSH-HGSAPTLPSC-ENQKVPK 213
S F NF + ++G + G+ GC P C+H G P C + + +P
Sbjct: 12 SAFNRRNFRFESFKLSGEYKPPEELGNDDGCWPYPFPKCNHVPGLESKYPRCAQVRDLPA 71
Query: 214 LKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYK 273
C T C N YG +D HR + + IK+EI +GP A LY+DF YK
Sbjct: 72 --CATTCPNKAYGTSMQKDTHRAKSWGRLPIGPEKIKQEIFDNGPVAAMMTLYEDFRFYK 129
Query: 274 SGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKY 329
SGVY H + L H+ KLIGWG E+G YWL +N W WGD G +K+ Y
Sbjct: 130 SGVYVHKTGQMLA--AHTLKLIGWGVESGQEYWLAVNAWNEEWGDHGMIKLASSVY 183
>gi|147902366|ref|NP_001080511.1| cathepsin C precursor [Xenopus laevis]
gi|33417162|gb|AAH56109.1| Ctsc protein [Xenopus laevis]
Length = 458
Score = 106 bits (264), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 86/338 (25%), Positives = 144/338 (42%), Gaps = 49/338 (14%)
Query: 13 LVRGELYKFSDAYIDQINREANTWTAGR--NFPANLSEEYLRQFLIADAKYFDQSDRPLP 70
++ +Y ++ ++ QIN +WTA + E+ +R+ A + RP P
Sbjct: 158 MLTSRVYNYNHDFVKQINTVQKSWTASVYPEYEGMSIEDLVRR---AGGRNSRIPVRPRP 214
Query: 71 GDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSK 130
T D +Y +P+ +D R + V + G+C + + FA++G R I+S+
Sbjct: 215 APMPT-DQKYQG-LPNEWDWR-NIAGFNFVSPVRNQGSCGSCYAFASMGMLESRIQIQSQ 271
Query: 131 GQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPST 190
Q LS + V SC ++ C G + + G Y + G
Sbjct: 272 LSQKPILSPQQVVSCSNY-----SQGCDGGFPY-----------LIAGKYLNDFGI---- 311
Query: 191 ISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIK 250
E P + + CT + ++ ++ ++ NE +K
Sbjct: 312 ---------------VEESDFPYIGSDSPCTLKDSYQRYYTAEYHYVGGFYGGCNEAYMK 356
Query: 251 KEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYL----HSGKLIGWGT--ENGTP 304
E++ GP + F +YDDF HY+SGVY HT N H+ L+G+GT + G
Sbjct: 357 LELVLGGPLSVAFEVYDDFIHYRSGVYHHTGLQDKFNPFQLTNHAVLLVGYGTDQQTGEK 416
Query: 305 YWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
YW+V N+WG WG++G +I RG ECA E + + P
Sbjct: 417 YWIVKNSWGESWGEKGFFRIRRGSDECAIESIAVSANP 454
>gi|123478051|ref|XP_001322190.1| Clan CA, family C1, cathepsin B-like cysteine peptidase
[Trichomonas vaginalis G3]
gi|121905031|gb|EAY09967.1| Clan CA, family C1, cathepsin B-like cysteine peptidase
[Trichomonas vaginalis G3]
Length = 288
Score = 106 bits (264), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 88/317 (27%), Positives = 129/317 (40%), Gaps = 62/317 (19%)
Query: 36 WTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDR--KTYDPEYSATVPDRFDAREQ 93
W AG N E + DA + L D P+ + ++P ++ E+
Sbjct: 25 WVAGEN-------ERFKGMTFKDASVISGNAHKLRPDTIPLARPPKINISIPMSYNFTER 77
Query: 94 WPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPL--STEYVASCCKICRY 151
+P C V D G C + FA +FS R C K N+P+ S ++ +C +
Sbjct: 78 FPQCDF--GVLDQGKCGSCWSFAVSKSFSHRYCRK----YNKPVLFSQSHLVACDR---- 127
Query: 152 DDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKV 211
N C G W ++ RG CQP + +
Sbjct: 128 -RNSGCGGGIEVNAWRYIDLRGL-------PLDSCQPY------------------DGNI 161
Query: 212 PKLKCHTRCTN--PTYGRGFFQDKHRTTLTYWVDDNEDAIKKE---ILAHGPTTATFALY 266
K C +CTN TY F + YW +I++ I+ GP T + +Y
Sbjct: 162 TKYNCSKKCTNESETYEAQFTE--------YWSVARYASIEEMQIGIMTEGPVTTSLKVY 213
Query: 267 DDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVKILR 326
D +YKSG+Y HT L + H+ ++IGWGT+NG YW++ N+W WG G I R
Sbjct: 214 SDLMYYKSGIYTHTKGEFLGH--HAVEIIGWGTKNGIDYWIISNSWNTTWGMNGLFLIKR 271
Query: 327 GKYECAFEYLIAAGKPK 343
G EC E + AGK K
Sbjct: 272 GVNECHIEDYVCAGKVK 288
>gi|312383398|gb|EFR28501.1| hypothetical protein AND_03481 [Anopheles darlingi]
Length = 573
Score = 105 bits (263), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 74/266 (27%), Positives = 112/266 (42%), Gaps = 35/266 (13%)
Query: 84 VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVA 143
+P FDA + WP +G D G C + + SDR I SKG++ L+ + +
Sbjct: 296 LPSHFDAADHWPR--LVGEARDQGWCGSSWALSTTTMASDRFAILSKGREQVQLAPQQLL 353
Query: 144 SCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTL 203
+C + ++CS G + W +L + G V Y I + G
Sbjct: 354 ACVR-----RQQACSGGHLDTAWQYLRRVGVVNDECYPYIAAKNQCKI----NDGDTLVS 404
Query: 204 PSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATF 263
+CE R P Y +NE I EI G A
Sbjct: 405 ANCELPANVNRTAMYR-MGPAYSL----------------NNETDIMTEIKERGTVQAIL 447
Query: 264 ALYDDFYHYKSGVYKHTSNA---KLENYLHSGKLIGWGTE----NGTPYWLVINTWGPHW 316
+Y DF+ Y++G+Y+H++ A + + HS +LIGWG E + YW+ +N+WG W
Sbjct: 448 RVYRDFFSYQNGIYRHSAAATPAEERSAYHSVRLIGWGEERVGYDMVKYWIAVNSWGTWW 507
Query: 317 GDRGTVKILRGKYECAFEYLIAAGKP 342
G+ G +ILRG EC E + A P
Sbjct: 508 GENGRFRILRGTNECEIESYVLASNP 533
>gi|189308076|gb|ACD86922.1| cysteine protease [Caenorhabditis brenneri]
Length = 228
Score = 105 bits (263), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 73/234 (31%), Positives = 104/234 (44%), Gaps = 10/234 (4%)
Query: 1 MIHILVFLLGCTLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAK 60
++ + L LV + K +A + +N + + W A P +++ E +++ L+
Sbjct: 4 VVFASLVALATGLVIPIVPKTPEAITEYVNSKQSLWKA--EIPKHITIEQVKKRLMRTEF 61
Query: 61 YFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGA 120
S P T+P FDAR QWP+C +I ++ D C + FAA A
Sbjct: 62 VAPHS----PDAEFVKHDIQEDTIPATFDARTQWPSCVSINNIRDQSDCGSCWAFAAAEA 117
Query: 121 FSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDY 180
SDR CI S G N LS E V SCC C Y C G W +L K G TGG Y
Sbjct: 118 ASDRFCIASNGAVNTLLSAEDVLSCCSNCGY----GCEGGYPINAWKYLVKSGFCTGGSY 173
Query: 181 GDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKH 234
+ GC+P +++PC T P+C C +CTN Y + DKH
Sbjct: 174 EAQFGCKPYSLAPCGETVGNTTWPACPTDGYDTPACVNKCTNSNYNVAYKDDKH 227
>gi|308811264|ref|XP_003082940.1| cysteine proteinase (ISS) [Ostreococcus tauri]
gi|116054818|emb|CAL56895.1| cysteine proteinase (ISS) [Ostreococcus tauri]
Length = 362
Score = 105 bits (261), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 88/281 (31%), Positives = 126/281 (44%), Gaps = 43/281 (15%)
Query: 74 KTYDP-----EYSATVPDRFDAREQWPNCGT-IGHVPDTGACAAPHIFAAVGAFSDRRCI 127
KT+DP +PD FD RE+WP C + D GAC + A A +DR CI
Sbjct: 73 KTWDPTKIKLHAGGRLPDTFDVREKWPKCAALVSEAVDQGACGSCWAVAPAKAMTDRLCI 132
Query: 128 KSKGQQNRPLSTEYVASCCKICR----YDDNKSCSHGSVF-----RTWNFLHKRGSVTGG 178
+ G N +S + SC YD+N + G + H+ G V+GG
Sbjct: 133 ATNGAVNTHVSAIQLLSCNSHSNSAYTYDENLAGGSGGCMGGYPTEAYETAHRVGVVSGG 192
Query: 179 DYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKL--KCHTRCTNPTYGRGFFQDKHRT 236
GD+ C P +PC H P P+ N P+ + T+ N T R
Sbjct: 193 LNGDQDTCMPYPFAPCHH----PCEPN-HNAVCPRTCQRSATQTANTT----------RY 237
Query: 237 TLTYWVD---DNEDAIKKEILAHGPTTATFA--LYDDFYHYKSGVYKHTSNAKLENYLHS 291
+ + V ++ D + EI GP T TF +YD+FY Y+ GVYK + + H
Sbjct: 238 AVGHLVQCGLNDYDCMASEIFERGPVT-TFVGDVYDEFYQYERGVYKLSKDPAARGKNHG 296
Query: 292 G---KLIGWG-TENGTPYWLVINTWGPHWGDRGTVKILRGK 328
G ++IGWG + G YW V N+W +WG+RG +I G+
Sbjct: 297 GHVMEVIGWGKSAEGVRYWKVYNSW-LNWGERGYGEIAVGE 336
>gi|253743418|gb|EES99819.1| Cathepsin B precursor [Giardia intestinalis ATCC 50581]
Length = 296
Score = 105 bits (261), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 75/272 (27%), Positives = 121/272 (44%), Gaps = 44/272 (16%)
Query: 64 QSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSD 123
+ RP R + P A P+ +D R+++P+C I V D G+C + F+++ F+D
Sbjct: 58 NTSRPAITRRHSTKPPVGA--PESYDFRDEYPHC--ITEVVDQGSCGSCWAFSSIQTFAD 113
Query: 124 RRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDR 183
RC S +YV C + + C+ G + ++FLH G+V
Sbjct: 114 HRCRSGLDATGVSYSVQYVLDCDR-----KDHGCNGGEPTKAFDFLHSTGTV-------- 160
Query: 184 TGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVD 243
++ C + + V K C C + + F +
Sbjct: 161 -------LTSCVDYTAGA-------DNVVKF-CPKTCDDGSAVENVFAASGSKS------ 199
Query: 244 DNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWG-TENG 302
+ +L+HGP ATF + DF +YKSGVY+H L H+ +++G+G T++G
Sbjct: 200 ---GSAIDVLLSHGPVVATFNVAQDFMYYKSGVYQHRWGVWLGG--HAVEVVGYGVTDSG 254
Query: 303 TPYWLVINTWGPHWGDRGTVKILRGKYECAFE 334
YW V N+WGP WG+ G +I+RG EC E
Sbjct: 255 LDYWTVRNSWGPDWGEDGYFRIVRGSDECGIE 286
>gi|443687066|gb|ELT90166.1| hypothetical protein CAPTEDRAFT_138389 [Capitella teleta]
Length = 446
Score = 104 bits (260), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 88/346 (25%), Positives = 142/346 (41%), Gaps = 58/346 (16%)
Query: 8 LLGCTLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLI----ADAKYFD 63
L+ + ++ +YK + YI Q+N ++TW A + EY LI +
Sbjct: 146 LIDESQMKSSVYKPNPDYIRQLNEASSTWKA------TIYAEYEGMHLIDLHRRNGGSRS 199
Query: 64 QSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVP---DTGACAAPHIFAAVGA 120
+ P G K + +P+ +D W N + V + G C + + F+++
Sbjct: 200 RVSSPGRGLLKEETKMAAVNLPESWD----WRNVDGVDFVSPVRNQGGCGSCYAFSSMAM 255
Query: 121 FSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDY 180
R + S Q S + + CC+ ++ C G + + GG Y
Sbjct: 256 NEARIRVMSNNTQMPVFSPQDIVDCCQY-----SQGCDGGFPY-----------LVGGKY 299
Query: 181 GDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTY 240
+ G + P E++K C R + ++R Y
Sbjct: 300 AEDFGLVDESCDPYVG----------EDRKCKSTSCSRR----------YATRYRYVGGY 339
Query: 241 WVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYL----HSGKLIG 296
+ NE +K L GP + +F +YDDF HYKSGVY+H+ N H+ L+G
Sbjct: 340 YGACNEQEMKLA-LQRGPLSVSFMVYDDFMHYKSGVYRHSGLTDKYNPFEITNHAVLLVG 398
Query: 297 WGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
+G + GT YW+V N+WG WG+ G +ILRG ECA E + P
Sbjct: 399 YGADEGTKYWIVKNSWGKGWGEEGYFRILRGADECAIESIAVETFP 444
>gi|161343881|tpg|DAA06121.1| TPA_inf: cathepsin B [Toxoptera citricida]
Length = 182
Score = 104 bits (260), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 59/148 (39%), Positives = 77/148 (52%), Gaps = 11/148 (7%)
Query: 53 QFLIADAKYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCG-TIGHVPDTGACAA 111
Q LI Y D L +RKT+D Y +P FDAR+ + NC IG V D G CA+
Sbjct: 39 QKLIQKTNY----DSWLKKNRKTFDINYKTDIPKEFDARQYFFNCANVIGDVKDQGNCAS 94
Query: 112 PHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKS-CSHGSVFRTWNFLH 170
A F+DR CI + G + LS + + SC DD KS C+ GS F+ W F+
Sbjct: 95 SWAVAVASTFTDRLCIATNGTFTQNLSAQNLMSCG-----DDEKSGCNGGSAFKAWEFIT 149
Query: 171 KRGSVTGGDYGDRTGCQPSTISPCSHHG 198
+G VTGG++ GCQP PC H+G
Sbjct: 150 GKGIVTGGNFDSNEGCQPYKNRPCDHYG 177
>gi|328712819|ref|XP_001942906.2| PREDICTED: tubulointerstitial nephritis antigen-like isoform 1
[Acyrthosiphon pisum]
gi|328712821|ref|XP_003244911.1| PREDICTED: tubulointerstitial nephritis antigen-like isoform 2
[Acyrthosiphon pisum]
Length = 463
Score = 104 bits (260), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 81/258 (31%), Positives = 114/258 (44%), Gaps = 28/258 (10%)
Query: 88 FDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCK 147
+DARE W N I D G C A V +DR I SK + LS +++ SC
Sbjct: 199 YDAREVWGN--YISSPIDQGWCGASWAITTVQVTTDRFGIMSKRAISDVLSPQHLLSCNN 256
Query: 148 ICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCE 207
+ + + C G + R WN++ K G +T Y P S +P +
Sbjct: 257 L----NQQGCQGGHLTRAWNWIRKFGLITEECY------------PWQGRMSTCAVPKKK 300
Query: 208 NQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYD 267
+ + + R N R HR Y V E+ I EIL GP A +
Sbjct: 301 KETMAQCPSRVRSNND---RTTKTRLHRVGPVYRVA-TEEGIMHEILTSGPVQAVMKVSR 356
Query: 268 DFYHYKSGVYKHTSNAK-LENYLHSGKLIGWGTE----NGTPYWLVINTWGPHWGDRGTV 322
DF+ YKSGVYK ++ A HS +++GWG E YW+ N+WG WG+ G
Sbjct: 357 DFFMYKSGVYKCSNLASGSRTGYHSVRIVGWGEEYQGGKIVKYWIASNSWGSWWGENGYF 416
Query: 323 KILRGKYECAFE-YLIAA 339
+IL+G EC E ++IAA
Sbjct: 417 RILKGVDECEIEDFVIAA 434
>gi|395734831|ref|XP_003776483.1| PREDICTED: LOW QUALITY PROTEIN: cathepsin B-like [Pongo abelii]
Length = 350
Score = 104 bits (260), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 97/344 (28%), Positives = 142/344 (41%), Gaps = 33/344 (9%)
Query: 6 VFLLGCTLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQS 65
+ +L R L+ S ++ IN+ AG NF + YLR+ + +S
Sbjct: 24 LLVLASAGSRTYLHPLSKXLVNYINKPNTMQQAGHNF-HKMXISYLRR---PCGTFPGRS 79
Query: 66 DRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRR 125
P R + + + +P+ FD EQWP+ + D G+ A+ A SD
Sbjct: 80 KLP---QRVKFAXDIN--LPESFDPXEQWPD-XPXREIRDQGSYGFCWALGALEAISDWI 133
Query: 126 CIK-----SKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDY 180
CI ++G + +S E +C +C C+ G WNF +G V+GG Y
Sbjct: 134 CIHPNVGGAQGGNHVEVSAEDKLTC--LC----GDGCNGGXPNEGWNFWTGKGLVSGGLY 187
Query: 181 GDRTGCQ--PSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTL 238
GC+ PS + PC HH P PK C C G+ + DKH
Sbjct: 188 DSHVGCRLFPSLL-PCKHHIHG--XPYVXTGDSPK--CSMTCEP---GQTYKXDKHYGCS 239
Query: 239 TYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWG 298
+Y + D+ I I + F++Y DF YK Y+ + H+ ++G
Sbjct: 240 SYSISDSTKDIMTNIYKNDXVEEAFSVYLDFLMYKFKEYQGVTGEMXGG--HAICILGCK 297
Query: 299 TENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
EN T YWLV N W WGD G KILRG+ E + A P
Sbjct: 298 VENSTSYWLVANXWNRDWGDNGFFKILRGQDHYGIESEVVAEIP 341
>gi|67867504|gb|AAH98085.1| Unknown (protein for MGC:107782) [Xenopus (Silurana) tropicalis]
Length = 458
Score = 103 bits (258), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 86/341 (25%), Positives = 143/341 (41%), Gaps = 55/341 (16%)
Query: 13 LVRGELYKFSDAYIDQINREANTWTAGR--NFPANLSEEYLRQFLIADAKYFDQSDRPLP 70
++ LY ++ ++ QIN +WTA + E+ +R+ A + RP P
Sbjct: 158 MLTSRLYNYNHDFVKQINEVQKSWTATAYPEYEGMTIEDLIRR---AGGRNSRIPMRPRP 214
Query: 71 GDRKTYDPEYSATVPDRFDAREQWPNCGT---IGHVPDTGACAAPHIFAAVGAFSDRRCI 127
T D +Y +P +D W N + V + +C + + F+++G R I
Sbjct: 215 APLPT-DEKYQG-LPTEWD----WRNIAGYNFVTPVRNQASCGSCYAFSSMGMLESRIQI 268
Query: 128 KSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQ 187
+S+ Q LS + V SC ++ C G + + G Y G
Sbjct: 269 RSQLSQKPILSPQQVVSCSNY-----SQGCEGGFPY-----------LIAGKYVSDYGI- 311
Query: 188 PSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNED 247
E +P + CT + ++ ++ ++ NE
Sbjct: 312 ------------------VEESDLPYTGSDSPCTLKDSQQKYYTAEYHYVGGFYGGCNEA 353
Query: 248 AIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYL----HSGKLIGWGT--EN 301
+K E++ GP + F +YDDF HY+SGVY HT N H+ L+G+GT +
Sbjct: 354 YMKLELVLGGPLSVAFEVYDDFMHYRSGVYHHTGLQDKFNPFQLTNHAVLLVGYGTDQQT 413
Query: 302 GTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
G YW+V N+WG WG++G +I RG ECA E + + +P
Sbjct: 414 GEKYWIVKNSWGESWGEKGYFRIRRGTDECAIESIAVSAEP 454
>gi|62320420|dbj|BAD94873.1| cathepsin B-like cysteine proteinase like protein [Arabidopsis
thaliana]
Length = 183
Score = 103 bits (258), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 67/180 (37%), Positives = 87/180 (48%), Gaps = 23/180 (12%)
Query: 166 WNFLHKRGSVTG--GDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNP 223
W + G VT Y D TGC SH G PT P+ KC +C +
Sbjct: 4 WLYFKYHGVVTQECDPYFDNTGC--------SHPGCEPTYPT--------PKCERKCVSR 47
Query: 224 TYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNA 283
G + KH Y ++ + I E+ +GP F +Y+DF HYKSGVYK+ +
Sbjct: 48 NQLWG--ESKHYGVGAYRINPDPQDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKYITGT 105
Query: 284 KLENYLHSGKLIGWGT-ENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
K+ H+ KLIGWGT ++G YWL+ N W WGD G KI RG EC E + AG P
Sbjct: 106 KIGG--HAVKLIGWGTSDDGEDYWLLANQWNRSWGDDGYFKIRRGTNECGIEQSVVAGLP 163
>gi|48762489|dbj|BAD23814.1| cathepsin B-N [Tuberaphis takenouchii]
Length = 163
Score = 103 bits (257), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 65/169 (38%), Positives = 89/169 (52%), Gaps = 18/169 (10%)
Query: 120 AFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGD 179
AF+DR CI + G+ N LS E +A CC C + C G + W + K G VTGGD
Sbjct: 5 AFADRLCIATDGEFNELLSAEELAFCCHKCGF----GCHGGYPIKAWEWFKKHGLVTGGD 60
Query: 180 YGDRTGCQPSTISPC--SHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGR---GFFQDKH 234
Y GCQP + PC +G+ +C + K + RCT YG F +D H
Sbjct: 61 YDSGEGCQPYRVPPCPLDEYGNN----TCRGKPAEK---NHRCTRMCYGNQELDFKEDHH 113
Query: 235 RTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNA 283
T Y++ I+K+++A+GP A+F +YDDF +YKSGVY T NA
Sbjct: 114 WTRDAYYL--TYTTIQKDVMAYGPIEASFDVYDDFPNYKSGVYMKTENA 160
>gi|496968|gb|AAA96831.1| cysteine protease homologue, partial [Ancylostoma caninum]
Length = 197
Score = 103 bits (256), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 63/202 (31%), Positives = 96/202 (47%), Gaps = 13/202 (6%)
Query: 115 FAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKI-CRYDDNKSCSHGSVFRTWNFLHKRG 173
++ A SD C++S +S + SCC I C Y C G + ++ +
Sbjct: 5 VSSAEAMSDEICVQSNSTIRVMISDSDILSCCGISCGY----GCQGGWSIEAYKWMQRER 60
Query: 174 SVTGGDYGDRTGCQPSTISPCSHHGSAPTLP---SCENQKVPKLKCHTRCTNPTYGRGFF 230
+ DR C+P + P G+ P P C P KC C Y + +
Sbjct: 61 CCYRWENTDRRVCKP--VRPSIRVGNHPNDPYYGPCPGGLWPTPKCRKTCQRKYY-KSYQ 117
Query: 231 QDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLH 290
+DKH T Y++ +NE +I++EI +GP A F +Y DF +YK G+Y H + H
Sbjct: 118 EDKHFATRAYYLPNNERSIRQEIYKNGPVVAAFRVYQDFSYYKKGIYVHKWGGQTG--AH 175
Query: 291 SGKLIGWGTENGTPYWLVINTW 312
+ K++GWG EN T YWL+ N+W
Sbjct: 176 AVKVVGWGRENATDYWLIANSW 197
>gi|428168267|gb|EKX37214.1| hypothetical protein GUITHDRAFT_78289 [Guillardia theta CCMP2712]
Length = 224
Score = 103 bits (256), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 78/256 (30%), Positives = 111/256 (43%), Gaps = 41/256 (16%)
Query: 86 DRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASC 145
D +DA E++ +C D +C + + FAA +S R C ++ GQ N LS + + SC
Sbjct: 4 DEYDASERFSSCKAF-TPKDQKSCGSCYAFAAAAVYSARLCAQTGGQFNIDLSPQQIVSC 62
Query: 146 CKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPS 205
+ N CS G+ T+ ++ G V G C P G A
Sbjct: 63 ------NSNDGCSGGNAIDTFEQMYTSGRVPGW-------CMPYLAKDVGGGGPA----- 104
Query: 206 CENQKVPKLKCHTRCT-NPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFA 264
C C+ P Y + + + DN I+ EIL++GP A F
Sbjct: 105 ----------CSDVCSLGPDY-------SVKASSLGVIQDNVRQIQSEILSNGPVFAAFW 147
Query: 265 LYDDFYHYKSGVYKHTSNAKLENYL--HSGKLIGWGT--ENGTPYWLVINTWGPHWGDRG 320
+Y DF Y GVY + A + H+ ++GWGT E G YWL+ N+W WGD+G
Sbjct: 148 VYSDFMAYTGGVYSASKEALAQGKTGGHAVMMVGWGTDKETGQDYWLLQNSWSEKWGDKG 207
Query: 321 TVKILRGKYECAFEYL 336
KI RG EC E L
Sbjct: 208 RFKIKRGVDECGIESL 223
>gi|290992564|ref|XP_002678904.1| predicted protein [Naegleria gruberi]
gi|284092518|gb|EFC46160.1| predicted protein [Naegleria gruberi]
Length = 289
Score = 102 bits (255), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 76/236 (32%), Positives = 109/236 (46%), Gaps = 40/236 (16%)
Query: 88 FDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCK 147
FDAR +W C + + D C + F+A SDR CI S G + LS EY+ C
Sbjct: 87 FDARTKWGKC--VHPIRDQQQCGSCWAFSASEVLSDRFCIASNGSVDVVLSPEYMLQCDS 144
Query: 148 ICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCE 207
+ C G + W FL G + D+ C P T S +G +
Sbjct: 145 T-----DYGCDGGYLNNAWAFLAGTGIPS-----DK--CDPYT----SGNGDVGS----- 183
Query: 208 NQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYD 267
C T CT+ + + + +DD I+K+I A+GP A F++Y
Sbjct: 184 --------CPTSCTDGSAIKLYKAKSSSVAQLSSIDD----IQKDIQANGPVQAAFSVYQ 231
Query: 268 DFYHYKSGVYKHTSNAKLENYLHSGKLIGWG-TENG--TPYWLVINTWGPHWGDRG 320
DF+ YKSGVY+H S + H+ K++GWG T +G TPYW+V N+W +WG G
Sbjct: 232 DFFSYKSGVYRHVSGSLAGG--HAIKIVGWGVTSDGKDTPYWIVANSWNTNWGQEG 285
>gi|294891865|ref|XP_002773777.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
gi|239878981|gb|EER05593.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
Length = 156
Score = 102 bits (254), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 48/130 (36%), Positives = 71/130 (54%), Gaps = 2/130 (1%)
Query: 199 SAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGP 258
+A P C ++ + + C T C N +Y QD HR + + IK+EI +G
Sbjct: 16 AASQYPKCPSEALSQPACQTECINESYKTSLQQDLHRAKSWGRLPTSPQKIKQEIFDNGT 75
Query: 259 TTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGD 318
++Y+DF YKSGVY HT+ + +HS K+IGWG E+G YWL +N+W WGD
Sbjct: 76 VLGVISMYEDFRLYKSGVYVHTTGGLVG--VHSLKIIGWGVESGQDYWLAVNSWNEEWGD 133
Query: 319 RGTVKILRGK 328
G +K+ G+
Sbjct: 134 HGMIKLAVGE 143
>gi|294876288|ref|XP_002767632.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
gi|239869318|gb|EER00350.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
Length = 97
Score = 102 bits (254), Expect = 3e-19, Method: Composition-based stats.
Identities = 47/96 (48%), Positives = 64/96 (66%), Gaps = 4/96 (4%)
Query: 247 DAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYW 306
D IKKEI+ +GPT+AT ++Y+DF Y+SGVYKHTS + +HS ++IGWG E G YW
Sbjct: 3 DNIKKEIMTNGPTSATLSMYNDFLSYESGVYKHTSGTFMG--VHSVEIIGWGIEKGVDYW 60
Query: 307 LVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
LV+N+W WGD GT KI +G +C ++ P
Sbjct: 61 LVMNSWNEDWGDNGTFKIAQG--DCGINDMVLGAPP 94
>gi|294931810|ref|XP_002780018.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
gi|239889821|gb|EER11813.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
Length = 131
Score = 102 bits (253), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 51/98 (52%), Positives = 68/98 (69%), Gaps = 3/98 (3%)
Query: 231 QDKHRTTLTY-WVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYL 289
+D+H T ++ + D IKKEI+ +GPT+A+F+ Y+DF YKSGVYKHTS L +
Sbjct: 12 RDRHFTARALPYLFEGTDNIKKEIMTNGPTSASFSTYEDFSSYKSGVYKHTSGGYLGD-- 69
Query: 290 HSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRG 327
HS ++IGWGTE G YWLV+N+W WGD GT KI +G
Sbjct: 70 HSVEIIGWGTEKGVDYWLVMNSWNEGWGDHGTFKIAQG 107
>gi|253744204|gb|EET00443.1| Cathepsin B precursor [Giardia intestinalis ATCC 50581]
Length = 309
Score = 102 bits (253), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 89/322 (27%), Positives = 132/322 (40%), Gaps = 56/322 (17%)
Query: 22 SDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRP---LPGDRKTYDP 78
+ A + QI W AG P E L+ D K + P +P +
Sbjct: 17 TQAKLRQIQALGPIWKAG--IP-----ERLKNLTETDFKRLVSAKDPRGQIPTLHLIHTY 69
Query: 79 EYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLS 138
E +PD FD RE++P C I V D G C++ + V AF RRC+ Q+ S
Sbjct: 70 ESEDPIPDHFDFREEYPQC--ITEVIDMGTCSSSWAHSPVEAFGHRRCMNGVDQEATRYS 127
Query: 139 TEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRG-----SVTGGDYGDRTGCQPSTISP 193
+Y+ SC + G +W+F+ G V DY D+T S
Sbjct: 128 AQYILSCATT----NGCLAFPGQGVVSWDFIATTGIPLESCVKYTDY-DKTESSYPCPSL 182
Query: 194 CSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEI 253
C+ + S S + G GF N + +++ I
Sbjct: 183 CNDNSSLVLYKS----------------DGYEGVGF---------------NPEKLRRAI 211
Query: 254 LAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGT-ENGTPYWLVINTW 312
GP A F +Y+DF +Y G+Y H YL S +++G+GT + G YW+V N W
Sbjct: 212 ALRGPMQAMFTVYEDFAYYLEGIYSHVYGGT-AGYL-SVEIVGYGTSDEGQDYWIVKNYW 269
Query: 313 GPHWGDRGTVKILRGKYECAFE 334
G +WG+ G +I+RG+ EC E
Sbjct: 270 GSNWGEDGYFRIVRGQNECQIE 291
>gi|48762487|dbj|BAD23813.1| cathepsin B-N [Tuberaphis taiwana]
Length = 163
Score = 101 bits (252), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 66/169 (39%), Positives = 86/169 (50%), Gaps = 18/169 (10%)
Query: 120 AFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGD 179
AF+DR CI + G+ N LS E +A CC C + CS G R W K G VTGG+
Sbjct: 5 AFADRLCIATDGEFNELLSAEELAFCCHKCGF----GCSGGYPIRAWERFKKHGLVTGGN 60
Query: 180 YGDRTGCQPSTISPC--SHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGR---GFFQDKH 234
Y GCQP + PC +G+ +C + K + RCT YG F +D H
Sbjct: 61 YDSGEGCQPYRVPPCPLDEYGNN----TCRGKPAEK---NHRCTRMCYGNQDLDFKEDHH 113
Query: 235 RTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNA 283
T Y++ I+ +ILA+GP A+F +YDDF YKSGVY NA
Sbjct: 114 YTRDAYYL--TYGTIQNDILAYGPIEASFEVYDDFPSYKSGVYTKMENA 160
>gi|395815757|ref|XP_003781389.1| PREDICTED: dipeptidyl peptidase 1 [Otolemur garnettii]
Length = 575
Score = 101 bits (251), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 86/340 (25%), Positives = 134/340 (39%), Gaps = 58/340 (17%)
Query: 16 GELYKFSDAYIDQINREANTWTAGRNFPANLSEEY----LRQFLIADAKYFDQSDRPLPG 71
G LYK++ ++ IN +WTA + EY LR+ + + + RP P
Sbjct: 277 GRLYKYNHNFVKAINAMQKSWTA------TVYMEYETLTLREMIRRSGGHGQRVPRPKPV 330
Query: 72 DRKTYDPEYSATVPDRFDAREQWPNCGTIGHVP---DTGACAAPHIFAAVGAFSDRRCIK 128
+ +P +D W N + +V + +C + + FA+VG R I
Sbjct: 331 ALTAEIQKKILHLPASWD----WRNVHGVNYVSPVRNQESCGSCYSFASVGMLEARIRIL 386
Query: 129 SKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQP 188
+ Q LS + V SC + + C G + + G + G
Sbjct: 387 TNNTQTPILSPQEVVSCSQYA-----QGCEGGFPY-----------LVAGKHAQDFGL-- 428
Query: 189 STISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDA 248
E P CT R ++ ++ ++ NE
Sbjct: 429 -----------------VEEACFPYTGTDAPCTMKEGCRRYYSSEYHYVGGFYGGCNEAL 471
Query: 249 IKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAK----LENYLHSGKLIGWGTEN--G 302
+K E++ HGP F +YDDF HY G+Y HT E H+ L+G+GT++ G
Sbjct: 472 MKLELVHHGPMAVAFEVYDDFLHYHRGIYHHTGLTDPFNPFELTNHAVLLVGYGTDSATG 531
Query: 303 TPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
YW+V N+WG WG+ G +I RG ECA E + A P
Sbjct: 532 IQYWIVKNSWGTGWGEDGYFRIRRGTDECAIESIAVAATP 571
>gi|405963121|gb|EKC28721.1| Tubulointerstitial nephritis antigen-like protein [Crassostrea
gigas]
Length = 464
Score = 101 bits (251), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 80/261 (30%), Positives = 116/261 (44%), Gaps = 37/261 (14%)
Query: 84 VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVA 143
+P FDAR W + I V D CA+ F+ V +DR I+S+G LS +++
Sbjct: 193 LPIHFDARINWTS--WIHPVRDQKNCASSWAFSTVDVAADRLAIESEGLLTNQLSPQHLV 250
Query: 144 SCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTL 203
SC + C GS + W F+ +RG +T C P T S T
Sbjct: 251 SCNT---GRGQRGCRGGSTEKAWWFVKRRGIIT-------EECYPYTASDGECLDGETTC 300
Query: 204 PSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATF 263
P+ N K+ + T Y V +E+ IK EI +GP ATF
Sbjct: 301 PN-ANSSTAKIVLYV------------------TPPYRVRQDEEDIKAEIYRNGPVQATF 341
Query: 264 ALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWG-----TENGTPYWLVINTWGPHWGD 318
+ DF+ Y+SGVY+HT A L S ++IGWG YW+ +N+WG WG+
Sbjct: 342 RVSSDFFMYRSGVYRHT-GADLGESRLSVRIIGWGEKTNKKGKKRKYWICLNSWGTKWGE 400
Query: 319 RGTVKILRGKYECAFEYLIAA 339
+G +I+RG+ E + A
Sbjct: 401 KGAFRIVRGENHLGIEENVLA 421
>gi|294952605|ref|XP_002787373.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
gi|239902345|gb|EER19169.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
Length = 185
Score = 101 bits (251), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 51/121 (42%), Positives = 71/121 (58%), Gaps = 4/121 (3%)
Query: 208 NQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYD 267
Q VP C T CTN Y + +D HR V ++ +IK+EI +GP ++F +Y+
Sbjct: 55 QQPVPP--CRTTCTNKAYKKSLEKDVHRAKSWRKVLNDAQSIKQEIFDNGPVLSSFKMYE 112
Query: 268 DFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRG 327
DF +YKSGVY T+ K + HS K+IGWG +G YWL +N+W WGD G +K+ G
Sbjct: 113 DFRYYKSGVYVPTT--KESSTSHSIKIIGWGGASGREYWLAVNSWNEEWGDHGLIKMAFG 170
Query: 328 K 328
K
Sbjct: 171 K 171
>gi|12330244|gb|AAG52659.1| cysteine proteinase [Metagonimus yokogawai]
Length = 183
Score = 101 bits (251), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 69/190 (36%), Positives = 98/190 (51%), Gaps = 10/190 (5%)
Query: 117 AVGAFSDRRCIKS-KGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSV 175
AV + SDR CI S + + N LS + SCC C + C G + W++ G V
Sbjct: 1 AVTSMSDRVCIHSNQNKTNVQLSARDLLSCCTSCGF----GCVGGWIGDAWDYWRDNGIV 56
Query: 176 TGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKV-PKLKCHTRCTNPTYGRGFFQDKH 234
TGGDY D++ C P P H S T Q + P C ++C G + +DK
Sbjct: 57 TGGDYQDKSTCLPYPFPPSHHLVSKGTPFEIYPQTLYPTPPCVSKCQEGYPGE-YEKDKI 115
Query: 235 RTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKL 294
+Y +D N I+KEIL +GP A +Y DF +YK+GVY+HT+ L H+ +L
Sbjct: 116 FALSSYKIDRNATEIQKEILINGPVEAGMNVYADFPNYKTGVYQHTTGEILGG--HAIRL 173
Query: 295 IGWG-TENGT 303
+GWG T++GT
Sbjct: 174 LGWGKTKDGT 183
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.320 0.137 0.455
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 6,192,513,516
Number of Sequences: 23463169
Number of extensions: 283793301
Number of successful extensions: 505548
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 4652
Number of HSP's successfully gapped in prelim test: 1385
Number of HSP's that attempted gapping in prelim test: 493140
Number of HSP's gapped (non-prelim): 8303
length of query: 344
length of database: 8,064,228,071
effective HSP length: 143
effective length of query: 201
effective length of database: 9,003,962,200
effective search space: 1809796402200
effective search space used: 1809796402200
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 77 (34.3 bits)