BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 018877
(349 letters)
Database: swissprot
539,616 sequences; 191,569,459 total letters
Searching..................................................done
>sp|Q4R5M2|CATB_MACFA Cathepsin B OS=Macaca fascicularis GN=CTSB PE=2 SV=1
Length = 339
Score = 288 bits (736), Expect = 5e-77, Method: Compositional matrix adjust.
Identities = 155/351 (44%), Positives = 211/351 (60%), Gaps = 36/351 (10%)
Query: 10 WMW---CCLQTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFK 66
W+W CCL + ++ + H L D ++ VN+ W+A N F N V K
Sbjct: 3 WLWASLCCLLALGD---ARSRPSFHPLSDELVNYVNKQ-NTTWQAGHN--FYNVDVSYLK 56
Query: 67 HLLGV---KPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWA 123
L G P P ++ + LKLP+SFDAR WPQC TI I DQG CGSCWA
Sbjct: 57 RLCGTFLGGPKPPQRVM------FTEDLKLPESFDAREQWPQCPTIKEIRDQGSCGSCWA 110
Query: 124 FGAVEALSDRFCIHFGMNLSLSVN--DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTE 181
FGAVEA+SDR CIH ++S+ V+ DLL CCG +CGDGC+GGYP AW ++ G+V+
Sbjct: 111 FGAVEAISDRICIHTNAHVSVEVSAEDLLTCCGIMCGDGCNGGYPAGAWNFWTRKGLVSG 170
Query: 182 E-------CDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISA 227
C PY C H P C TPKC + C + ++ KHY ++
Sbjct: 171 GLYDSHVGCRPY-SIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGYSPTYKQDKHYGYNS 229
Query: 228 YRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSD 287
Y +++ +DIMAEIYKNGPVE +F+VY DF YKSGVY+H+TG++MGGHA++++GWG +
Sbjct: 230 YSVSNSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGV-E 288
Query: 288 DGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEI 338
+G YW++AN WN WG +G+FKI RG + CGIE +VVAG+P + ++I
Sbjct: 289 NGTPYWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPRTDQYWEKI 339
>sp|Q5R6D1|CATB_PONAB Cathepsin B OS=Pongo abelii GN=CTSB PE=2 SV=1
Length = 339
Score = 286 bits (731), Expect = 2e-76, Method: Compositional matrix adjust.
Identities = 154/345 (44%), Positives = 208/345 (60%), Gaps = 33/345 (9%)
Query: 13 CCLQTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGV- 71
CCL A+ ++ + H L D ++ VN+ W+A N F N V K L G
Sbjct: 9 CCLLALAD---ARSRPSFHPLSDELVNYVNKR-NTTWQAGHN--FYNVDVSYLKKLCGTF 62
Query: 72 --KPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEA 129
P P ++ + LKLP+SFDAR WPQC TI I DQG CGSCWAFGAVEA
Sbjct: 63 LGGPKPPQRVM------FTEDLKLPESFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVEA 116
Query: 130 LSDRFCIHFGMNLSLSVN--DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE----- 182
+SDR CIH ++S+ V+ DLL CCG +CGDGC+GGYP AW ++ G+V+
Sbjct: 117 ISDRICIHTNAHVSVEVSAEDLLTCCGSMCGDGCNGGYPAEAWNFWTRKGLVSGGLYESH 176
Query: 183 --CDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSD 233
C PY C H P C TPKC + C + ++ KHY ++Y +++
Sbjct: 177 VGCRPY-SIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVSNS 235
Query: 234 PEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYW 293
DIMAEIYKNGPVE +F+VY DF YKSGVY+H+TG++MGGHA++++GWG ++G YW
Sbjct: 236 ERDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGV-ENGTPYW 294
Query: 294 ILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEI 338
++AN WN WG +G+FKI RG + CGIE +VVAG+P + ++I
Sbjct: 295 LVANSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPRTDQYWEKI 339
>sp|P07858|CATB_HUMAN Cathepsin B OS=Homo sapiens GN=CTSB PE=1 SV=3
Length = 339
Score = 284 bits (726), Expect = 8e-76, Method: Compositional matrix adjust.
Identities = 153/345 (44%), Positives = 207/345 (60%), Gaps = 33/345 (9%)
Query: 13 CCLQTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGV- 71
CCL A ++ + H L D ++ VN+ W+A N F N + K L G
Sbjct: 9 CCLLVLAN---ARSRPSFHPLSDELVNYVNKR-NTTWQAGHN--FYNVDMSYLKRLCGTF 62
Query: 72 --KPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEA 129
P P ++ + LKLP SFDAR WPQC TI I DQG CGSCWAFGAVEA
Sbjct: 63 LGGPKPPQRVM------FTEDLKLPASFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVEA 116
Query: 130 LSDRFCIHFGMNLSLSVN--DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE----- 182
+SDR CIH ++S+ V+ DLL CCG +CGDGC+GGYP AW ++ G+V+
Sbjct: 117 ISDRICIHTNAHVSVEVSAEDLLTCCGSMCGDGCNGGYPAEAWNFWTRKGLVSGGLYESH 176
Query: 183 --CDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSD 233
C PY C H P C TPKC + C + ++ KHY ++Y +++
Sbjct: 177 VGCRPY-SIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVSNS 235
Query: 234 PEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYW 293
+DIMAEIYKNGPVE +F+VY DF YKSGVY+H+TG++MGGHA++++GWG ++G YW
Sbjct: 236 EKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGV-ENGTPYW 294
Query: 294 ILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEI 338
++AN WN WG +G+FKI RG + CGIE +VVAG+P + ++I
Sbjct: 295 LVANSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPRTDQYWEKI 339
>sp|P00787|CATB_RAT Cathepsin B OS=Rattus norvegicus GN=Ctsb PE=1 SV=2
Length = 339
Score = 276 bits (705), Expect = 2e-73, Method: Compositional matrix adjust.
Identities = 148/325 (45%), Positives = 195/325 (60%), Gaps = 30/325 (9%)
Query: 27 KLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKT 86
K SH L D +I +N+ W+A RN F N + K L G +LG P
Sbjct: 20 KPSSHPLSDDMINYINKQ-NTTWQAGRN--FYNVDISYLKKLCGT-------VLGGPNLP 69
Query: 87 H----DKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--M 140
+ + LP+SFDAR W C TI++I DQG CGSCWAFGAVEA+SDR CIH +
Sbjct: 70 ERVGFSEDINLPESFDAREQWSNCPTIAQIRDQGSCGSCWAFGAVEAMSDRICIHTNGRV 129
Query: 141 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYF-----D 188
N+ +S DLL CCG CGDGC+GGYP AW ++ G+V+ C PY
Sbjct: 130 NVEVSAEDLLTCCGIQCGDGCNGGYPSGAWNFWTRKGLVSGGVYNSHIGCLPYTIPPCEH 189
Query: 189 STGCSHPGCEPAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPV 247
S P C TPKC + C + ++ KHY ++Y ++ ++IMAEIYKNGPV
Sbjct: 190 HVNGSRPPCTGEGDTPKCNKMCEAGYSTSYKEDKHYGYTSYSVSDSEKEIMAEIYKNGPV 249
Query: 248 EVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADG 307
E +FTV+ DF YKSGVYKH GDVMGGHA++++GWG ++G YW++AN WN WG +G
Sbjct: 250 EGAFTVFSDFLTYKSGVYKHEAGDVMGGHAIRILGWGI-ENGVPYWLVANSWNVDWGDNG 308
Query: 308 YFKIKRGSNECGIEEDVVAGLPSSK 332
+FKI RG N CGIE ++VAG+P ++
Sbjct: 309 FFKILRGENHCGIESEIVAGIPRTQ 333
>sp|A1E295|CATB_PIG Cathepsin B OS=Sus scrofa GN=CTSB PE=1 SV=1
Length = 335
Score = 266 bits (681), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 148/347 (42%), Positives = 198/347 (57%), Gaps = 35/347 (10%)
Query: 11 MWCCLQTFAEGVV---SKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKH 67
MW L T + V+ ++ L L D ++ +N+ W A N F N + K
Sbjct: 1 MWRLLATLSCLVLLTSARESLHFQPLSDELVNFINKQ-NTTWTAGHN--FYNVDLSYVKK 57
Query: 68 LLGVKPTPKGLLLGVPVKTHDKSLK----LPKSFDARSAWPQCSTISRILDQGHCGSCWA 123
L G LG P + LPKSFDAR WP C TI I DQG CGSCWA
Sbjct: 58 LCGT-------FLGGPKLPQRAAFAADMILPKSFDAREQWPNCPTIKEIRDQGSCGSCWA 110
Query: 124 FGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTE 181
FGAVEA+SDR CI +N+ +S D+L CCG CGDGC+GG+P AW ++ G+V+
Sbjct: 111 FGAVEAISDRICIRSNGRVNVEVSAEDMLTCCGDECGDGCNGGFPSGAWNFWTKKGLVSG 170
Query: 182 E-------CDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISA 227
C PY C H P C TPKC + C ++ KH+ S+
Sbjct: 171 GLYDSHVGCRPY-SIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGYTPSYKEDKHFGCSS 229
Query: 228 YRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSD 287
Y I+ + ++IMAEIYKNGPVE +FTVY DF YKSGVY+H+TGD+MGGHA++++GWG +
Sbjct: 230 YSISRNEKEIMAEIYKNGPVEGAFTVYSDFLQYKSGVYQHVTGDLMGGHAIRILGWGV-E 288
Query: 288 DGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNL 334
+G YW++ N WN WG +G+FKI RG + CGIE ++VAG+P + +
Sbjct: 289 NGTPYWLVGNSWNTDWGDNGFFKILRGQDHCGIESEIVAGIPCTPHF 335
>sp|P10605|CATB_MOUSE Cathepsin B OS=Mus musculus GN=Ctsb PE=1 SV=2
Length = 339
Score = 266 bits (679), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 142/320 (44%), Positives = 191/320 (59%), Gaps = 30/320 (9%)
Query: 31 HILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHL----LGVKPTPKGLLLGVPVKT 86
H L D +I +N+ W+A RN F N + K L LG P + G
Sbjct: 24 HPLSDDLINYINKQ-NTTWQAGRN--FYNVDISYLKKLCGTVLGGPKLPGRVAFG----- 75
Query: 87 HDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSL 144
+ + LP++FDAR W C TI +I DQG CGSCWAFGAVEA+SDR CIH +N+ +
Sbjct: 76 --EDIDLPETFDAREQWSNCPTIGQIRDQGSCGSCWAFGAVEAISDRTCIHTNGRVNVEV 133
Query: 145 SVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYF-----DSTGC 192
S DLL CCG CGDGC+GGYP AW ++ G+V+ C PY
Sbjct: 134 SAEDLLTCCGIQCGDGCNGGYPSGAWSFWTKKGLVSGGVYNSHVGCLPYTIPPCEHHVNG 193
Query: 193 SHPGCEPAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSF 251
S P C TP+C + C + ++ KH+ ++Y +++ ++IMAEIYKNGPVE +F
Sbjct: 194 SRPPCTGEGDTPRCNKSCEAGYSPSYKEDKHFGYTSYSVSNSVKEIMAEIYKNGPVEGAF 253
Query: 252 TVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKI 311
TV+ DF YKSGVYKH GD+MGGHA++++GWG ++G YW+ AN WN WG +G+FKI
Sbjct: 254 TVFSDFLTYKSGVYKHEAGDMMGGHAIRILGWGV-ENGVPYWLAANSWNLDWGDNGFFKI 312
Query: 312 KRGSNECGIEEDVVAGLPSS 331
RG N CGIE ++VAG+P +
Sbjct: 313 LRGENHCGIESEIVAGIPRT 332
>sp|P25792|CYSP_SCHMA Cathepsin B-like cysteine proteinase OS=Schistosoma mansoni PE=2
SV=1
Length = 340
Score = 261 bits (666), Expect = 7e-69, Method: Compositional matrix adjust.
Identities = 142/330 (43%), Positives = 195/330 (59%), Gaps = 20/330 (6%)
Query: 15 LQTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPT 74
L TF E +S L D II +NE+P AGW+A ++ +F + + + + +
Sbjct: 11 LITFLEAHISVKNEKFEPLSDDIISYINEHPNAGWRAEKSNRFHSLDDARIQ-MGARREE 69
Query: 75 PKGLLLGVPVKTH-DKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDR 133
P P H D ++++P +FD+R WP C +I+ I DQ CGSCW+FGAVEA+SDR
Sbjct: 70 PDLRRKRRPTVDHNDWNVEIPSNFDSRKKWPGCKSIATIRDQSRCGSCWSFGAVEAMSDR 129
Query: 134 FCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CD 184
CI G N+ LS DLL CC CG GC+GG AW Y+V G+VT C+
Sbjct: 130 SCIQSGGKQNVELSAVDLLTCCES-CGLGCEGGILGPAWDYWVKEGIVTASSKENHTGCE 188
Query: 185 PY-----FDSTGCSHPGC-EPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDI 237
PY T +P C Y TP+C + C +K + + KH S+Y + +D + I
Sbjct: 189 PYPFPKCEHHTKGKYPPCGSKIYNTPRCKQTCQRKYKTPYTQDKHRGKSSYNVKNDEKAI 248
Query: 238 MAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILAN 297
EI K GPVE SFTVYEDF +YKSG+YKHITG+ +GGHA+++IGWG ++ YW++AN
Sbjct: 249 QKEIMKYGPVEASFTVYEDFLNYKSGIYKHITGEALGGHAIRIIGWGV-ENKTPYWLIAN 307
Query: 298 QWNRSWGADGYFKIKRGSNECGIEEDVVAG 327
WN WG +GYF+I RG +EC IE +V+AG
Sbjct: 308 SWNEDWGENGYFRIVRGRDECSIESEVIAG 337
>sp|P43233|CATB_CHICK Cathepsin B OS=Gallus gallus GN=CTSB PE=2 SV=1
Length = 340
Score = 257 bits (657), Expect = 9e-68, Method: Compositional matrix adjust.
Identities = 140/319 (43%), Positives = 187/319 (58%), Gaps = 35/319 (10%)
Query: 33 LQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHD---- 88
L ++ +N+ G +A N F N + K L G LG P
Sbjct: 26 LSSDLVNHINKLNTTG-RAGHN--FHNTDMSYVKKLCGT-------FLGGPKAPERVDFA 75
Query: 89 KSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVN- 147
+ + LP +FD R WP C TIS I DQG CGSCWAFGAVEA+SDR C+H +S+ V+
Sbjct: 76 EDMDLPDTFDTRKQWPNCPTISEIRDQGSCGSCWAFGAVEAISDRICVHTNAKVSVEVSA 135
Query: 148 -DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGC---SHPGCE----- 198
DLL+CCGF CG GC+GGYP AWRY+ G+V+ Y GC + P CE
Sbjct: 136 EDLLSCCGFECGMGCNGGYPSGAWRYWTERGLVSGGL--YDSHVGCRAYTIPPCEHHVNG 193
Query: 199 -------PAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVS 250
TP+C R C + ++ KHY I++Y + ++IMAEIYKNGPVE +
Sbjct: 194 SRPPCTGEGGETPRCSRHCEPGYSPSYKEDKHYGITSYGVPRSEKEIMAEIYKNGPVEGA 253
Query: 251 FTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFK 310
F VYEDF YKSGVY+H++G+ +GGHA++++GWG ++G YW+ AN WN WG G+FK
Sbjct: 254 FIVYEDFLMYKSGVYQHVSGEQVGGHAIRILGWGV-ENGTPYWLAANSWNTDWGITGFFK 312
Query: 311 IKRGSNECGIEEDVVAGLP 329
I RG + CGIE ++VAG+P
Sbjct: 313 ILRGEDHCGIESEIVAGVP 331
>sp|P43157|CYSP_SCHJA Cathepsin B-like cysteine proteinase OS=Schistosoma japonicum
GN=CATB PE=2 SV=1
Length = 342
Score = 256 bits (655), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 138/333 (41%), Positives = 195/333 (58%), Gaps = 23/333 (6%)
Query: 15 LQTFAEG-VVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKP 73
L TF E V ++ L D +I +NE+P AGWKA ++ +F +++ + L+G +
Sbjct: 11 LFTFLEAHVTTRNNQRIEPLSDEMISFINEHPDAGWKADKSDRF--HSLDDARILMGARK 68
Query: 74 TPKGLLLGV--PVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALS 131
+ V HD ++++P FD+R WP C +IS+I DQ CGSCWAFGAVEA++
Sbjct: 69 EDAEMKRNRRPTVDHHDLNVEIPSQFDSRKKWPHCKSISQIRDQSRCGSCWAFGAVEAMT 128
Query: 132 DRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EE 182
DR CI G + LS DL++CC CGDGC GG+P AW Y+V G+VT
Sbjct: 129 DRICIQSGGGQSAELSALDLISCCK-DCGDGCQGGFPGVAWDYWVKRGIVTGGSKENHTG 187
Query: 183 CDPY-----FDSTGCSHPGC-EPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPE 235
C PY T +P C Y TP+C + C K + + KHY +Y + ++ +
Sbjct: 188 CQPYPFPKCEHHTKGKYPACGTKIYKTPQCKQTCQKGYKTPYEQDKHYGDESYNVQNNEK 247
Query: 236 DIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWIL 295
I +I GPVE +F VYEDF +YKSG+Y+H+TG ++GGHA+++IGWG + YW++
Sbjct: 248 VIQRDIMMYGPVEAAFDVYEDFLNYKSGIYRHVTGSIVGGHAIRIIGWGV-EKRTPYWLI 306
Query: 296 ANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 328
AN WN WG G F++ RG +EC IE DVVAGL
Sbjct: 307 ANSWNEDWGEKGLFRMVRGRDECSIESDVVAGL 339
>sp|P43510|CPR6_CAEEL Cathepsin B-like cysteine proteinase 6 OS=Caenorhabditis elegans
GN=cpr-6 PE=1 SV=1
Length = 379
Score = 255 bits (652), Expect = 3e-67, Method: Compositional matrix adjust.
Identities = 153/362 (42%), Positives = 200/362 (55%), Gaps = 51/362 (14%)
Query: 12 WCCLQTFAEGVVSKLK---LDSHILQ---DSIIKEVNENPKAGWKAARNPQFSNYTVGQF 65
+C E V+ K + +DS + D +I VNEN W A + +FS+
Sbjct: 15 YCACNDNLESVLDKYRNREIDSEAAELDGDDLIDYVNENQNL-WTAKKQRRFSS------ 67
Query: 66 KHLLGVKPTPKGLLLGVP------------VKTHDKSLKLPKSFDARSAWPQCSTISRIL 113
+ G K L+GV KT D L +P+SFD+R WP+C +I I
Sbjct: 68 --VYGENDKAKWGLMGVNHVRLSVKGKQHLSKTKDLDLDIPESFDSRDNWPKCDSIKVIR 125
Query: 114 DQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWR 171
DQ CGSCWAFGAVEA+SDR CI H + ++LS +DLL+CC CG GC+GG P++AWR
Sbjct: 126 DQSSCGSCWAFGAVEAMSDRICIASHGELQVTLSADDLLSCCKS-CGFGCNGGDPLAAWR 184
Query: 172 YFVHHGVVTEECDPYFDSTGCS---HPGCE-------------PAYPTPKCVRKCVKK-- 213
Y+V G+VT Y + GC P CE YPTPKC +KCV
Sbjct: 185 YWVKDGIVTGS--NYTANNGCKPYPFPPCEHHSKKTHFDPCPHDLYPTPKCEKKCVSDYT 242
Query: 214 NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVM 273
++ + K + SAY + D E I E+ +GP+E++F VYEDF +Y GVY H G +
Sbjct: 243 DKTYSEDKFFGASAYGVKDDVEAIQKELMTHGPLEIAFEVYEDFLNYDGGVYVHTGGKLG 302
Query: 274 GGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKN 333
GGHAVKLIGWG DDG YW +AN WN WG DG+F+I RG +ECGIE VV G+P +
Sbjct: 303 GGHAVKLIGWGI-DDGIPYWTVANSWNTDWGEDGFFRILRGVDECGIESGVVGGIPKLNS 361
Query: 334 LV 335
L
Sbjct: 362 LT 363
>sp|P43509|CPR5_CAEEL Cathepsin B-like cysteine proteinase 5 OS=Caenorhabditis elegans
GN=cpr-5 PE=2 SV=1
Length = 344
Score = 251 bits (642), Expect = 4e-66, Method: Compositional matrix adjust.
Identities = 129/258 (50%), Positives = 159/258 (61%), Gaps = 22/258 (8%)
Query: 93 LPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLL 150
+P FDAR WP C +I+ I DQ CGSCWAF A EA+SDR CI + +N LS DLL
Sbjct: 82 IPDHFDARDQWPNCMSINNIRDQSDCGSCWAFAAAEAISDRTCIASNGAVNTLLSSEDLL 141
Query: 151 ACCG--FLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDS------TGCSHP 195
+CC F CG+GC+GGYPI AW+++V HG+VT C PY + G P
Sbjct: 142 SCCTGMFSCGNGCEGGYPIQAWKWWVKHGLVTGGSYETQFGCKPYSIAPCGETVNGVKWP 201
Query: 196 GC-EPAYPTPKCVRKCVKKNQL---WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSF 251
C E PTPKCV C KN + KH+ +AY + E I EI NGP+EV+F
Sbjct: 202 ACPEDTEPTPKCVDSCTSKNNYATPYLQDKHFGSTAYAVGKKVEQIQTEILTNGPIEVAF 261
Query: 252 TVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKI 311
TVYEDF Y +GVY H G +GGHAVK++GWG D+G YW++AN WN +WG GYF+I
Sbjct: 262 TVYEDFYQYTTGVYVHTAGASLGGHAVKILGWGV-DNGTPYWLVANSWNVAWGEKGYFRI 320
Query: 312 KRGSNECGIEEDVVAGLP 329
RG NECGIE VAG+P
Sbjct: 321 IRGLNECGIEHSAVAGIP 338
>sp|Q54QD9|CTSB_DICDI Cathepsin B OS=Dictyostelium discoideum GN=ctsB PE=3 SV=1
Length = 311
Score = 238 bits (606), Expect = 6e-62, Method: Compositional matrix adjust.
Identities = 134/290 (46%), Positives = 172/290 (59%), Gaps = 26/290 (8%)
Query: 49 WKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDK-SLKLPKSFDARSAWPQCS 107
W + QF N VGQ LLG K +P L +K++D +++P SF+A++ WP C+
Sbjct: 39 WVEEQTDQFDNIKVGQ---LLGFKRSPNRPKL--QIKSYDPLGVQIPTSFNAQTNWPNCT 93
Query: 108 TISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPI 167
TIS+I +Q CGSCWAFGA E+ +DR CIH N+ LS D++ C +GC+GG
Sbjct: 94 TISQIQNQARCGSCWAFGATESATDRLCIHNNENVQLSFMDMVTCDE--TDNGCEGGDAF 151
Query: 168 SAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYP-------TPKCVRKCVKKNQL-WRN 219
SAW + G V+EEC PY + P C PA TP C ++C + L +
Sbjct: 152 SAWNWLRKQGAVSEECLPY------TIPTCPPAQQPCLNFVNTPSCTKECQSNSSLIYSQ 205
Query: 220 SKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVK 279
KH Y +SD E IM EI NGPVE FTV+EDF YKSGVY H TG +GGH VK
Sbjct: 206 DKHKMAKIYSFDSD-EAIMQEIVTNGPVEACFTVFEDFLAYKSGVYVHTTGKDLGGHCVK 264
Query: 280 LIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 329
L+G+GT +G DY+ NQW SWG +G F IKRG +CGI +DVVAGLP
Sbjct: 265 LVGFGTL-NGVDYYAANNQWTTSWGDNGTFLIKRG--DCGISDDVVAGLP 311
>sp|P43508|CPR4_CAEEL Cathepsin B-like cysteine proteinase 4 OS=Caenorhabditis elegans
GN=cpr-4 PE=2 SV=1
Length = 335
Score = 236 bits (603), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 142/315 (45%), Positives = 186/315 (59%), Gaps = 25/315 (7%)
Query: 34 QDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHD-KSLK 92
Q++I + VN ++ WKA P+ + T+ Q K L V V HD
Sbjct: 25 QEAITEYVNSK-QSLWKA-EIPK--DITIEQVKKRLMRTEFVAPHTPDVEVVKHDINEDT 80
Query: 93 LPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLL 150
+P +FDAR+ WP C +I+ I DQ CGSCWAF A EA SDRFCI + +N LS D+L
Sbjct: 81 IPATFDARTQWPNCMSINNIRDQSDCGSCWAFAAAEAASDRFCIASNGAVNTLLSAEDVL 140
Query: 151 ACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYF-----DSTG-CSHPGC 197
+CC CG GC+GGYPI+AW+Y V G T C PY ++ G + P C
Sbjct: 141 SCCSN-CGYGCEGGYPINAWKYLVKSGFCTGGSYEAQFGCKPYSLAPCGETVGNVTWPSC 199
Query: 198 -EPAYPTPKCVRKCVKKNQ--LWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVY 254
+ Y TP CV KC KN + KH+ +AY + I AEI +GPVE +FTVY
Sbjct: 200 PDDGYDTPACVNKCTNKNYNVAYTADKHFGSTAYAVGKKVSQIQAEIIAHGPVEAAFTVY 259
Query: 255 EDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRG 314
EDF YK+GVY H TG +GGHA++++GWGT D+G YW++AN WN +WG +GYF+I RG
Sbjct: 260 EDFYQYKTGVYVHTTGQELGGHAIRILGWGT-DNGTPYWLVANSWNVNWGENGYFRIIRG 318
Query: 315 SNECGIEEDVVAGLP 329
+NECGIE VV G+P
Sbjct: 319 TNECGIEHAVVGGVP 333
>sp|P25807|CPR1_CAEEL Gut-specific cysteine proteinase OS=Caenorhabditis elegans GN=cpr-1
PE=1 SV=2
Length = 329
Score = 236 bits (601), Expect = 3e-61, Method: Compositional matrix adjust.
Identities = 119/244 (48%), Positives = 153/244 (62%), Gaps = 12/244 (4%)
Query: 93 LPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLL 150
+P +FD+R+ W +C +I I DQ CGSCWAFGA E +SDR CI +S +DLL
Sbjct: 85 VPATFDSRTQWSECKSIKLIRDQATCGSCWAFGAAEMISDRTCIETKGAQQPIISPDDLL 144
Query: 151 ACCGFLCGDGCDGGYPISAWRYFVHHGVVT------EECDPYFDSTGCSHPGCEPAYPTP 204
+CCG CG+GC+GGYPI A R++ GVVT C PY + C+ C P TP
Sbjct: 145 SCCGSSCGNGCEGGYPIQALRWWDSKGVVTGGDYHGAGCKPYPIAP-CTSGNC-PESKTP 202
Query: 205 KCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSG 263
C C + + KH+ +SAY + + I AEIY NGPVE +F+VYEDF YKSG
Sbjct: 203 SCSMSCQSGYSTAYAKDKHFGVSAYAVPKNAASIQAEIYANGPVEAAFSVYEDFYKYKSG 262
Query: 264 VYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEED 323
VYKH G +GGHA+K+IGWGT + G YW++AN W +WG G+FKI RG ++CGIE
Sbjct: 263 VYKHTAGKYLGGHAIKIIGWGT-ESGSPYWLVANSWGVNWGESGFFKIYRGDDQCGIESA 321
Query: 324 VVAG 327
VVAG
Sbjct: 322 VVAG 325
>sp|P07688|CATB_BOVIN Cathepsin B OS=Bos taurus GN=CTSB PE=1 SV=5
Length = 335
Score = 235 bits (599), Expect = 4e-61, Method: Compositional matrix adjust.
Identities = 146/341 (42%), Positives = 201/341 (58%), Gaps = 33/341 (9%)
Query: 11 MWCCLQTFAEGVV---SKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKH 67
MW L T + +V ++ L L D ++ VN+ WKA N F N + K
Sbjct: 1 MWRLLATLSCLLVLTSARSSLYFPPLSDELVNFVNKQ-NTTWKAGHN--FYNVDLSYVKK 57
Query: 68 LLGVKPTPKGLLLGVPVKTHDK---SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAF 124
L G +L G + D + LP+SFDAR WP C TI I DQG CGSCWAF
Sbjct: 58 LCGA------ILGGPKLPQRDAFAADVVLPESFDAREQWPNCPTIKEIRDQGSCGSCWAF 111
Query: 125 GAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE 182
GAVEA+SDR CIH +N+ +S D+L CCG CGDGC+GG+P AW ++ G+V+
Sbjct: 112 GAVEAISDRICIHSNGRVNVEVSAEDMLTCCGGECGDGCNGGFPSGAWNFWTKKGLVSGG 171
Query: 183 -------CDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAY 228
C PY C H P C TPKC + C + ++ KH+ S+Y
Sbjct: 172 LYNSHVGCRPY-SIPPCEHHVNGSRPPCTGEGDTPKCSKTCEPGYSPSYKEDKHFGCSSY 230
Query: 229 RINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDD 288
+ ++ ++IMAEIYKNGPVE +F+VY DF YKSGVY+H++G++MGGHA++++GWG ++
Sbjct: 231 SVANNEKEIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVSGEIMGGHAIRILGWGV-EN 289
Query: 289 GEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 329
G YW++ N WN WG +G+FKI RG + CGIE ++VAG+P
Sbjct: 290 GTPYWLVGNSWNTDWGDNGFFKILRGQDHCGIESEIVAGMP 330
>sp|P43507|CPR3_CAEEL Cathepsin B-like cysteine proteinase 3 OS=Caenorhabditis elegans
GN=cpr-3 PE=2 SV=1
Length = 370
Score = 232 bits (591), Expect = 3e-60, Method: Compositional matrix adjust.
Identities = 125/266 (46%), Positives = 160/266 (60%), Gaps = 22/266 (8%)
Query: 93 LPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLL 150
LP +FDAR WP C+TI I +Q CGSCWAFGA E +SDR CI +SV D+L
Sbjct: 92 LPDTFDAREKWPDCNTIKLIRNQATCGSCWAFGAAEVISDRVCIQSNGTQQPVISVEDIL 151
Query: 151 ACCGFLCGDGCDGGYPISAWRYFVHHGVVT------EECDPYFDSTGCSHPGCEPAYPTP 204
+CCG CG GC GGY I A R++ G VT C PY S C P TP
Sbjct: 152 SCCGTTCGYGCKGGYSIEALRFWASSGAVTGGDYGGHGCMPY--SFAPCTKNC-PESTTP 208
Query: 205 KCVRKCVK--KNQLWRNSKHYSISAYRINSDPE--DIMAEIYKNGPVEVSFTVYEDFAHY 260
C C K + ++ KHY SAY++ + +I EIY GPVE S+ VYEDF HY
Sbjct: 209 SCKTTCQSSYKTEEYKKDKHYGASAYKVTTTKSVTEIQTEIYHYGPVEASYKVYEDFYHY 268
Query: 261 KSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGI 320
KSGVY + +G ++GGHAVK+IGWG ++G DYW++AN W S+G G+FKI+RG+NEC I
Sbjct: 269 KSGVYHYTSGKLVGGHAVKIIGWGV-ENGVDYWLIANSWGTSFGEKGFFKIRRGTNECQI 327
Query: 321 EEDVVAGLPSSKNLVKEITSADMFED 346
E +VVAG + K T ++ +ED
Sbjct: 328 EGNVVAG------IAKLGTHSETYED 347
>sp|P25793|CYSP2_HAECO Cathepsin B-like cysteine proteinase 2 OS=Haemonchus contortus
GN=AC-2 PE=2 SV=1
Length = 342
Score = 231 bits (588), Expect = 8e-60, Method: Compositional matrix adjust.
Identities = 128/324 (39%), Positives = 182/324 (56%), Gaps = 36/324 (11%)
Query: 28 LDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTH 87
L +++ + + EVN +P P F + ++ +K + L L V +
Sbjct: 38 LVAYLRRSQNLFEVNSDP--------TPDFE-------QKIMSIKYKHQKLNLMVK-EDP 81
Query: 88 DKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLS 145
D + +P S+D R W C+T I DQ +CGSCWA A+SDR CI +++S
Sbjct: 82 DPEVDIPPSYDPRDVWKNCTTFY-IRDQANCGSCWAVSTAAAISDRICIASKAEKQVNIS 140
Query: 146 VNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSHPG-- 196
D++ CC CGDGC+GG+PI AW+YF++ GVV+ + C PY C H G
Sbjct: 141 ATDIMTCCRPQCGDGCEGGWPIEAWKYFIYDGVVSGGEYLTKDVCRPY-PIHPCGHHGND 199
Query: 197 -----CEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVS 250
C PTP C RKC +++R K Y AY + + I +EI KNGPV S
Sbjct: 200 TYYGECRGTAPTPPCKRKCRPGVRKMYRIDKRYGKDAYIVKQSVKAIQSEILKNGPVVAS 259
Query: 251 FTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFK 310
F VYEDF HYKSG+YKH G++ G HAVK+IGWG +++ D+W++AN W+ WG GYF+
Sbjct: 260 FAVYEDFRHYKSGIYKHTAGELRGYHAVKMIGWG-NENNTDFWLIANSWHNDWGEKGYFR 318
Query: 311 IKRGSNECGIEEDVVAGLPSSKNL 334
I RGSN+CGIE + AG+ +++L
Sbjct: 319 IVRGSNDCGIEGTIAAGIVDTESL 342
>sp|P19092|CYSP1_HAECO Cathepsin B-like cysteine proteinase 1 OS=Haemonchus contortus
GN=AC-1 PE=2 SV=1
Length = 342
Score = 229 bits (583), Expect = 3e-59, Method: Compositional matrix adjust.
Identities = 115/264 (43%), Positives = 159/264 (60%), Gaps = 20/264 (7%)
Query: 88 DKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLS 145
D + +P S+D R W C+T I DQ +CGSCWA A+SDR CI +++S
Sbjct: 82 DPEVDIPPSYDPRDVWKNCTTFY-IRDQANCGSCWAVSTAAAISDRICIASKAEKQVNIS 140
Query: 146 VNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSHPG-- 196
D++ CC CGDGC+GG+PI AW+YF++ GVV+ + C PY C H G
Sbjct: 141 ATDIMTCCRPQCGDGCEGGWPIEAWKYFIYDGVVSGGEYLTKDVCRPY-PIHPCGHHGND 199
Query: 197 -----CEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVS 250
C PTP C RKC +++R K Y AY + + I +EI +NGPV S
Sbjct: 200 TYYGECRGTAPTPPCKRKCRPGVRKMYRIDKRYGKDAYIVKQSVKAIQSEILRNGPVVAS 259
Query: 251 FTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFK 310
F VYEDF HYKSG+YKH G++ G HAVK+IGWG +++ D+W++AN W+ WG GYF+
Sbjct: 260 FAVYEDFRHYKSGIYKHTAGELRGYHAVKMIGWG-NENNTDFWLIANSWHNDWGEKGYFR 318
Query: 311 IKRGSNECGIEEDVVAGLPSSKNL 334
I RG+N+CGIE + AG+ +++L
Sbjct: 319 IIRGTNDCGIEGTIAAGIVDTESL 342
>sp|P25802|CYSP1_OSTOS Cathepsin B-like cysteine proteinase 1 OS=Ostertagia ostertagi
GN=CP-1 PE=3 SV=3
Length = 341
Score = 211 bits (538), Expect = 5e-54, Method: Compositional matrix adjust.
Identities = 111/251 (44%), Positives = 152/251 (60%), Gaps = 19/251 (7%)
Query: 93 LPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLL 150
+P+S+D R W CS++ I DQ +CGSCWA + A+SDR CI + +S D++
Sbjct: 91 IPESYDPRIQWANCSSLFHIPDQANCGSCWAVSSAAAMSDRICIASKGAKQVLISAQDVV 150
Query: 151 ACCGFLCGDGCDGGYPISAWRYFVHHGVVTE-------ECDPYFDSTGCSHPGCEPAY-- 201
+CC + CGDGC+GG+PISA+R+ GVVT C PY + C H G E Y
Sbjct: 151 SCCTW-CGDGCEGGWPISAFRFHADEGVVTGGDYNTKGSCRPY-EIHPCGHHGNETYYGE 208
Query: 202 -----PTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYED 256
TP+C R+C+ S Y AY++ + + I +I KNGPV ++TVYED
Sbjct: 209 CVGMADTPRCKRRCLLGYPKSYPSDRYYKKAYQLKNSVKAIQKDIMKNGPVVATYTVYED 268
Query: 257 FAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSN 316
FAHY+SG+YKH G G HAVK+IGWG + G YWI+AN W+ WG +G+F++ RGSN
Sbjct: 269 FAHYRSGIYKHKAGRKTGLHAVKVIGWG-EEKGTPYWIVANSWHDDWGENGFFRMHRGSN 327
Query: 317 ECGIEEDVVAG 327
+CG EE + AG
Sbjct: 328 DCGFEERMAAG 338
>sp|P92133|CATB3_GIAIN Cathepsin B-like CP3 OS=Giardia intestinalis GN=CP3 PE=2 SV=2
Length = 299
Score = 186 bits (471), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 115/289 (39%), Positives = 151/289 (52%), Gaps = 24/289 (8%)
Query: 44 NPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAW 103
NP+ WKA +F T + LL K VP T + + P SFD R +
Sbjct: 28 NPR--WKAGIPKRFEGLTKDEISSLLMPVSFLKRDRAAVPRGTV-SATQAPDSFDFREEY 84
Query: 104 PQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMN---LSLSVNDLLACCGFLCGD- 159
P C I ++DQG CGSCWAF +V ++ DR C G++ + S +++C GD
Sbjct: 85 PHC--IPEVVDQGGCGSCWAFSSVASVGDRRCFA-GLDKKAVKYSPQYVVSCDR---GDM 138
Query: 160 GCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRN 219
CDGG+ S WR+ G T+EC PY G A T C KC + L
Sbjct: 139 ACDGGWLPSVWRFLTKTGTTTDECVPY-------QSGSTGARGT--CPTKCADGSDLPHL 189
Query: 220 SKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVK 279
K Y + D IM + GP++ +FTVY DF +Y+SGVY+H G V GGHAV
Sbjct: 190 YKATKAVDYGL--DAPAIMKALATGGPLQTAFTVYSDFMYYESGVYQHTYGRVEGGHAVD 247
Query: 280 LIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 328
++G+GT DDG DYWI+ N W WG DGYF+I R +NECGIEE V+ G
Sbjct: 248 MVGYGTDDDGVDYWIIKNSWGPDWGEDGYFRIIRMTNECGIEEQVIGGF 296
>sp|P92132|CATB2_GIAIN Cathepsin B-like CP2 OS=Giardia intestinalis GN=CP2 PE=1 SV=2
Length = 300
Score = 172 bits (435), Expect = 4e-42, Method: Compositional matrix adjust.
Identities = 104/289 (35%), Positives = 149/289 (51%), Gaps = 23/289 (7%)
Query: 44 NPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAW 103
NP+ WKA +F T + LL K P T +P+SFD R +
Sbjct: 28 NPR--WKAGIPKRFEGLTKDEISSLLMPVSFLKNAKGAAPRGTFTDKDDVPESFDFREEY 85
Query: 104 PQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMN---LSLSVNDLLACCGFLCGD- 159
P C I ++DQG CGSCWAF +V DR C+ G++ + S +++C GD
Sbjct: 86 PHC--IPEVVDQGGCGSCWAFSSVATFGDRRCVA-GLDKKPVKYSPQYVVSCDH---GDM 139
Query: 160 GCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRN 219
C+GG+ + W++ G T+EC PY + C PT KC +
Sbjct: 140 ACNGGWLPNVWKFLTKTGTTTDECVPYKSGSTTLRGTC----PT-----KCADGSSKVHL 190
Query: 220 SKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVK 279
+ S Y + D +M + +GP++V+F V+ DF +Y+SGVY+H G + GGHAV+
Sbjct: 191 ATATSYKDYGL--DIPAMMKALSTSGPLQVAFLVHSDFMYYESGVYQHTYGYMEGGHAVE 248
Query: 280 LIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 328
++G+GT DDG DYWI+ N W WG DGYF++ RG N+C IEE AG
Sbjct: 249 MVGYGTDDDGVDYWIIKNSWGPDWGEDGYFRMIRGINDCSIEEQAYAGF 297
>sp|Q3SZI1|TINAG_BOVIN Tubulointerstitial nephritis antigen OS=Bos taurus GN=TINAG PE=2
SV=1
Length = 476
Score = 166 bits (419), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 116/337 (34%), Positives = 164/337 (48%), Gaps = 43/337 (12%)
Query: 32 ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ-FKHLLGVKPTPKGLLLGVPVKTHD-- 88
++Q +I+ VN+ GW A QF T+ + FK+ LG P P LLL + T
Sbjct: 155 LVQPGLIEHVNKG-DYGWTAQNYSQFWGMTLEEGFKYRLGTLP-PSPLLLSMNEVTASLT 212
Query: 89 KSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSV 146
K+ LP+ F A WP LDQ +C + WAF +DR I +LS
Sbjct: 213 KTTDLPEFFIASYKWP--GWTHGPLDQKNCAASWAFSTASVAADRIAIQSQGRYTANLSP 270
Query: 147 NDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA------ 200
+L++CC GC+ G AW Y G+V+ C P F ++ GC A
Sbjct: 271 QNLISCCAKK-RHGCNSGSVDRAWWYLRKRGLVSHACYPLFKDQNATNNGCAMASRSDGR 329
Query: 201 ---YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDF 257
+ T C K N++++ S YR++S+ +IM EI +NGPV+ V+EDF
Sbjct: 330 GKRHATTPCPNSIEKSNRIYQCS-----PPYRVSSNETEIMREIMQNGPVQAIMQVHEDF 384
Query: 258 AHYKSGVYKHITGD--------VMGGHAVKLIGWGT----SDDGEDYWILANQWNRSWGA 305
+YK+G+Y+HIT HAVKL GWGT E +WI AN W +SWG
Sbjct: 385 FNYKTGIYRHITSTNEDSEKYRKFRTHAVKLTGWGTLRGAQGQKEKFWIAANSWGKSWGE 444
Query: 306 DGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEITSAD 342
+GYF+I RG NE IE+ ++A ++TSAD
Sbjct: 445 NGYFRILRGVNESDIEKLIIAAW-------GQLTSAD 474
>sp|P92131|CATB1_GIAIN Cathepsin B-like CP1 OS=Giardia intestinalis GN=CP1 PE=2 SV=3
Length = 303
Score = 164 bits (416), Expect = 8e-40, Method: Compositional matrix adjust.
Identities = 104/287 (36%), Positives = 148/287 (51%), Gaps = 27/287 (9%)
Query: 49 WKAARNPQFSNYTVGQFKHLLGVKP----TPKGLLLGVPV-KTHDKSLKLPKSFDARSAW 103
WKA +F N T +F+ +L ++P G L + + + + +P FD R +
Sbjct: 31 WKAGMPKRFENVTEDEFRSML-IRPDRLRARSGSLPPISITEVQELVDPIPPQFDFRDEY 89
Query: 104 PQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMN---LSLSVNDLLACCGFLCGDG 160
PQC + LDQG CGSCWAF A+ DR C G++ +S S L++C L G
Sbjct: 90 PQC--VKPALDQGSCGSCWAFSAIGVFGDRRC-AMGIDKEAVSYSQQHLISCS--LENFG 144
Query: 161 CDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNS 220
CDGG W + G T EC Y D G A P P QL++
Sbjct: 145 CDGGDFQPTWSFLTFTGATTAECVKYVDY------GHTVASPCPAVCDDG-SPIQLYKAH 197
Query: 221 KHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV-MGGHAVK 279
+ +S S P IM + GP++ VY D ++Y+SGVYKH G + +G HA++
Sbjct: 198 GYGQVS----KSVPA-IMGMLVAGGPLQTMIVVYADLSYYESGVYKHTYGTINLGFHALE 252
Query: 280 LIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 326
++G+GT+DDG DYWI+ N W WG +GYF+I RG NEC IE+++ A
Sbjct: 253 IVGYGTTDDGTDYWIIKNSWGPDWGENGYFRIVRGVNECRIEDEIYA 299
>sp|Q9UJW2|TINAG_HUMAN Tubulointerstitial nephritis antigen OS=Homo sapiens GN=TINAG PE=2
SV=3
Length = 476
Score = 164 bits (414), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 107/321 (33%), Positives = 156/321 (48%), Gaps = 34/321 (10%)
Query: 32 ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ-FKHLLG-VKPTPKGLLLGVPVKTHDK 89
+++ +I++VN+ GW A QF T+ FK LG + P+P L + +
Sbjct: 155 LVRSELIEQVNKG-DYGWTAQNYSQFWGMTLEDGFKFRLGTLPPSPMLLSMNEMTASLPA 213
Query: 90 SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVN 147
+ LP+ F A WP LDQ +C + WAF +DR I +LS
Sbjct: 214 TTDLPEFFVASYKWP--GWTHGPLDQKNCAASWAFSTASVAADRIAIQSKGRYTANLSPQ 271
Query: 148 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA------- 200
+L++CC GC+ G AW Y G+V+ C P F ++ GC A
Sbjct: 272 NLISCCA-KNRHGCNSGSIDRAWWYLRKRGLVSHACYPLFKDQNATNNGCAMASRSDGRG 330
Query: 201 --YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFA 258
+ T C K N++++ S YR++S+ +IM EI +NGPV+ V EDF
Sbjct: 331 KRHATKPCPNNVEKSNRIYQCS-----PPYRVSSNETEIMKEIMQNGPVQAIMQVREDFF 385
Query: 259 HYKSGVYKHITGD--------VMGGHAVKLIGWGT----SDDGEDYWILANQWNRSWGAD 306
HYK+G+Y+H+T + HAVKL GWGT E +WI AN W +SWG +
Sbjct: 386 HYKTGIYRHVTSTNKESEKYRKLQTHAVKLTGWGTLRGAQGQKEKFWIAANSWGKSWGEN 445
Query: 307 GYFKIKRGSNECGIEEDVVAG 327
GYF+I RG NE IE+ ++A
Sbjct: 446 GYFRILRGVNESDIEKLIIAA 466
>sp|O97578|CATC_CANFA Dipeptidyl peptidase 1 (Fragment) OS=Canis familiaris GN=CTSC PE=1
SV=1
Length = 435
Score = 149 bits (375), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 106/346 (30%), Positives = 169/346 (48%), Gaps = 45/346 (13%)
Query: 12 WCCLQTFAEGVVS-KLKLDS-HI--LQDS-----------IIKEVNENPKAGWKAARNPQ 56
W C G S K K+++ HI LQ++ +K +N K+ W A R +
Sbjct: 109 WACFTGTKMGTTSEKAKVNTKHIERLQENNSNRLYKYNYEFVKAINTIQKS-WTATRYIE 167
Query: 57 FSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQG 116
+ T+ +G + P+ + + H++ +LP S+D R+ + +S + +Q
Sbjct: 168 YETLTLRDMMTRVGGRKIPRPKPTPLTAEIHEEISRLPTSWDWRNV-RGTNFVSPVRNQA 226
Query: 117 HCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYP-ISAWRYF 173
CGSC+AF + L R I + LS ++++C + GC+GG+P + A +Y
Sbjct: 227 SCGSCYAFASTAMLEARIRILTNNTQTPILSPQEIVSCSQY--AQGCEGGFPYLIAGKYA 284
Query: 174 VHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSD 233
G+V E C PY G P C+P C R + +S++Y + + +
Sbjct: 285 QDFGLVEEACFPY---AGSDSP-CKPN----DCFR--------YYSSEYYYVGGFYGACN 328
Query: 234 PEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH------ITGDVMGGHAVKLIGWGT-S 286
+ E+ ++GP+ V+F VY+DF HY+ G+Y H + HAV L+G+GT S
Sbjct: 329 EALMKLELVRHGPMAVAFEVYDDFFHYQKGIYYHTGLRDPFNPFELTNHAVLLVGYGTDS 388
Query: 287 DDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSK 332
G DYWI+ N W WG DGYF+I+RG++EC IE VA P K
Sbjct: 389 ASGMDYWIVKNSWGSRWGEDGYFRIRRGTDECAIESIAVAATPIPK 434
>sp|Q99JR5|TINAL_MOUSE Tubulointerstitial nephritis antigen-like OS=Mus musculus
GN=Tinagl1 PE=1 SV=1
Length = 466
Score = 145 bits (367), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 109/351 (31%), Positives = 160/351 (45%), Gaps = 39/351 (11%)
Query: 5 IIRSNWMWCCLQTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ 64
+ + W C T EG + + ++ +IK +N GW+A + F T+ +
Sbjct: 113 VFGTYWDNCNRCTCHEGGHWECDQEPCLVDPDMIKAINRG-NYGWQAGNHSAFWGMTLDE 171
Query: 65 -FKHLLG-VKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCW 122
++ LG ++P+ + + + LP +F+A WP + I LDQG+C W
Sbjct: 172 GIRYRLGTIRPSSTVMNMNEIYTVLGQGEVLPTAFEASEKWP--NLIHEPLDQGNCAGSW 229
Query: 123 AFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT 180
AF SDR IH M LS +LL+C GC GG AW + GVV+
Sbjct: 230 AFSTAAVASDRVSIHSLGHMTPILSPQNLLSCDTHH-QQGCRGGRLDGAWWFLRRRGVVS 288
Query: 181 EECDPYFDSTGCSHPGCEPAYPTPKCV----------RKCVKK---NQLWRNSKHYSISA 227
+ C P+ A PTP+C+ R+ + Q+ N + A
Sbjct: 289 DNCYPFSGREQ------NEASPTPRCMMHSRAMGRGKRQATSRCPNGQVDSNDIYQVTPA 342
Query: 228 YRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV--------MGGHAVK 279
YR+ SD ++IM E+ +NGPV+ V+EDF Y+ G+Y H G H+VK
Sbjct: 343 YRLGSDEKEIMKELMENGPVQALMEVHEDFFLYQRGIYSHTPVSQGRPEQYRRHGTHSVK 402
Query: 280 LIGWG--TSDDGE--DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 326
+ GWG T DG YW AN W WG G+F+I RG+NEC IE V+
Sbjct: 403 ITGWGEETLPDGRTIKYWTAANSWGPWWGERGHFRIVRGTNECDIETFVLG 453
>sp|Q06544|CYSP3_OSTOS Cathepsin B-like cysteine proteinase 3 (Fragment) OS=Ostertagia
ostertagi GN=CP-3 PE=3 SV=1
Length = 174
Score = 145 bits (365), Expect = 6e-34, Method: Compositional matrix adjust.
Identities = 79/175 (45%), Positives = 104/175 (59%), Gaps = 17/175 (9%)
Query: 169 AWRYFVHHGVVTEE-------CDPYFDSTGCSHPGCEPAY-------PTPKCVRKCVKKN 214
AW+YF GVVT C PY + C G EP Y TPKC + C +
Sbjct: 1 AWQYFALEGVVTGGNYRKQGCCRPY-EFPPCGRHGKEPYYGECYDTAKTPKCQKTCQRGY 59
Query: 215 -QLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVM 273
+ ++ KH+ SAYR+ ++ + I +I KNGPV F VYEDFAHYKSG+YKH G +
Sbjct: 60 LKAYKEDKHFGKSAYRLPNNVKAIQRDIMKNGPVVAGFIVYEDFAHYKSGIYKHTAGRMT 119
Query: 274 GGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 328
GGHAVK+IGWG + G YW++AN W+ WG G++++ RG N C IEE V AG+
Sbjct: 120 GGHAVKIIGWG-KEKGTPYWLIANSWHDDWGEKGFYRMIRGINNCRIEEMVFAGI 173
>sp|Q9EQT5|TINAL_RAT Tubulointerstitial nephritis antigen-like OS=Rattus norvegicus
GN=Tinagl1 PE=2 SV=1
Length = 467
Score = 143 bits (360), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 103/324 (31%), Positives = 152/324 (46%), Gaps = 38/324 (11%)
Query: 32 ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ-FKHLLG-VKPTPKGLLLGVPVKTHDK 89
++ ++IK +N GW+A + F T+ + ++ LG ++P+ + + +
Sbjct: 140 LVDPAMIKAINRG-NYGWQAGNHSAFWGMTLDEGIRYRLGTIRPSSSVMNMNEIYTVLGQ 198
Query: 90 SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVN 147
LP +F+A WP + I LDQG+C WAF SDR IH M LS
Sbjct: 199 GEVLPTAFEASEKWP--NLIHEPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPILSPQ 256
Query: 148 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCV 207
+LL+C GC GG AW + GVV++ C P+ + A PTP+C+
Sbjct: 257 NLLSCDTHH-QKGCRGGRLDGAWWFLRRRGVVSDNCYPF-----SGREQNDEASPTPRCM 310
Query: 208 ----------RKCVKK---NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVY 254
R+ + +Q+ N + YR+ SD ++IM E+ +NGPV+ V+
Sbjct: 311 MHSRAMGRGKRQATSRCPNSQVDSNDIYQVTPVYRLASDEKEIMKELMENGPVQALMEVH 370
Query: 255 EDFAHYKSGVYKHITGDV--------MGGHAVKLIGWG--TSDDGE--DYWILANQWNRS 302
EDF Y+ G+Y H G H+VK+ GWG T DG YW AN W
Sbjct: 371 EDFFLYQRGIYSHTPVSQGRPEQYRRHGTHSVKITGWGEETLPDGRTIKYWTAANSWGPW 430
Query: 303 WGADGYFKIKRGSNECGIEEDVVA 326
WG G+F+I RG NEC IE V+
Sbjct: 431 WGERGHFRIVRGINECDIETFVLG 454
>sp|Q9GZM7|TINAL_HUMAN Tubulointerstitial nephritis antigen-like OS=Homo sapiens
GN=TINAGL1 PE=1 SV=1
Length = 467
Score = 142 bits (359), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 106/345 (30%), Positives = 161/345 (46%), Gaps = 27/345 (7%)
Query: 5 IIRSNWMWCCLQTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ 64
++ + W C T E + + ++ +IK +N+ GW+A + F T+ +
Sbjct: 114 VLGTYWDNCNRCTCQENRQWQCDQEPCLVDPDMIKAINQG-NYGWQAGNHSAFWGMTLDE 172
Query: 65 -FKHLLG-VKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCW 122
++ LG ++P+ + + + LP +F+A WP + I LDQG+C W
Sbjct: 173 GIRYRLGTIRPSSSVMNMHEIYTVLNPGEVLPTAFEASEKWP--NLIHEPLDQGNCAGSW 230
Query: 123 AFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT 180
AF SDR IH M LS +LL+C GC GG AW + GVV+
Sbjct: 231 AFSTAAVASDRVSIHSLGHMTPVLSPQNLLSCDTHQ-QQGCRGGRLDGAWWFLRRRGVVS 289
Query: 181 EECDPY----FDSTGCSHPGCEPAYPTPKCVRKCVKK--NQLWRNSKHYSIS-AYRINSD 233
+ C P+ D G + P + + R+ N N+ Y ++ YR+ S+
Sbjct: 290 DHCYPFSGRERDEAGPAPPCMMHSRAMGRGKRQATAHCPNSYVNNNDIYQVTPVYRLGSN 349
Query: 234 PEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV--------MGGHAVKLIGWG- 284
++IM E+ +NGPV+ V+EDF YK G+Y H + G H+VK+ GWG
Sbjct: 350 DKEIMKELMENGPVQALMEVHEDFFLYKGGIYSHTPVSLGRPERYRRHGTHSVKITGWGE 409
Query: 285 -TSDDGE--DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 326
T DG YW AN W +WG G+F+I RG NEC IE V+
Sbjct: 410 ETLPDGRTLKYWTAANSWGPAWGERGHFRIVRGVNECDIESFVLG 454
>sp|Q60HG6|CATC_MACFA Dipeptidyl peptidase 1 OS=Macaca fascicularis GN=CTSC PE=2 SV=1
Length = 463
Score = 142 bits (359), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 99/317 (31%), Positives = 157/317 (49%), Gaps = 47/317 (14%)
Query: 36 SIIKEVNENPKAGWKAARNPQFSNYTVGQF--------KHLLGVKPTPKGLLLGVPVKTH 87
+ +K +N K+ W A ++ T+G + + KPTP + +
Sbjct: 173 NFVKAINAIQKS-WTATTYMEYETLTLGDMIKRSGGHSRKIPRPKPTP------LTAEIQ 225
Query: 88 DKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LS 145
K L LP S+D R+ + +S + +Q CGSC++F +V L R I + + LS
Sbjct: 226 QKILHLPTSWDWRNV-HGINFVSPVRNQASCGSCYSFASVGMLEARIRILTNNSQTPILS 284
Query: 146 VNDLLACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTP 204
++++C + GC+GG+P ++A +Y G+V E C PY TG P
Sbjct: 285 SQEVVSCSQY--AQGCEGGFPYLTAGKYAQDFGLVEEACFPY---TGTDSP--------- 330
Query: 205 KCVRKCVKKNQLWR--NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKS 262
C K +R +S+++ + + + + E+ +GP+ V+F VY+DF HY++
Sbjct: 331 -----CKMKEDCFRYYSSEYHYVGGFYGGCNEALMKLELVYHGPLAVAFEVYDDFLHYQN 385
Query: 263 GVYKH------ITGDVMGGHAVKLIGWGT-SDDGEDYWILANQWNRSWGADGYFKIKRGS 315
G+Y H + HAV L+G+GT S G DYWI+ N W SWG DGYF+I+RG+
Sbjct: 386 GIYHHTGLRDPFNPFELTNHAVLLVGYGTDSASGMDYWIVKNSWGTSWGEDGYFRIRRGT 445
Query: 316 NECGIEEDVVAGLPSSK 332
+EC IE VA P K
Sbjct: 446 DECAIESIAVAATPIPK 462
>sp|Q5RB02|CATC_PONAB Dipeptidyl peptidase 1 OS=Pongo abelii GN=CTSC PE=2 SV=1
Length = 463
Score = 139 bits (350), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 99/313 (31%), Positives = 153/313 (48%), Gaps = 39/313 (12%)
Query: 36 SIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVK----PTPKGLLLGVPVKTHDKSL 91
+ +K +N K+ W A ++ T+G G P PK L ++ K L
Sbjct: 173 NFVKAINAIQKS-WTATTYKEYETLTLGDMIRRSGGHSRKIPRPKPAPLTAEIQ--QKVL 229
Query: 92 KLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDL 149
LP S+D R+ + +S + +Q CGSC++F ++ L R I + + LS ++
Sbjct: 230 HLPTSWDWRNI-HGINFVSPVRNQASCGSCYSFASMGMLEARIRILTSNSQTPILSPQEV 288
Query: 150 LACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVR 208
++C + GC+GG+P + A +Y G+V E C PY TG P
Sbjct: 289 VSCSQY--AQGCEGGFPYLIAGKYAQDFGLVEEACFPY---TGTDSP------------- 330
Query: 209 KCVKKNQLWR--NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYK 266
C K +R +S+++ + + + + E+ +GP+ V+F VY+DF HYK G+Y
Sbjct: 331 -CKMKEDCFRYYSSEYHYVGGFYGGCNEALMKLELVHHGPMAVAFEVYDDFLHYKKGIYH 389
Query: 267 H------ITGDVMGGHAVKLIGWGT-SDDGEDYWILANQWNRSWGADGYFKIKRGSNECG 319
H + HAV L+G+GT S G DYWI+ N W WG DGYF+I+RG++EC
Sbjct: 390 HTGLRDPFNPFELTNHAVLLVGYGTDSASGMDYWIVKNSWGTGWGEDGYFRIRRGTDECA 449
Query: 320 IEEDVVAGLPSSK 332
IE VA P K
Sbjct: 450 IESIAVAATPIPK 462
>sp|P90850|YCF2E_CAEEL Uncharacterized peptidase C1-like protein F26E4.3 OS=Caenorhabditis
elegans GN=F26E4.3 PE=1 SV=3
Length = 452
Score = 137 bits (346), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 90/252 (35%), Positives = 125/252 (49%), Gaps = 18/252 (7%)
Query: 89 KSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSV 146
K +LP+ FDAR W I + DQG CGS W+ SDR I +N +LS
Sbjct: 180 KPRELPEHFDARDKWG--PLIHPVADQGDCGSSWSVSTTAISSDRLAIISEGRINSTLSS 237
Query: 147 NDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKC 206
LL+C GC+GGY AW Y GVV + C PY S PG
Sbjct: 238 QQLLSCNQHR-QKGCEGGYLDRAWWYIRKLGVVGDHCYPYV-SGQSREPGHCLIPKRDYT 295
Query: 207 VRKCVKKNQLWRNSKHYSISA-YRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVY 265
R+ ++ ++S + ++ Y+++S EDI E+ NGPV+ +F V+EDF Y GVY
Sbjct: 296 NRQGLRCPSGSQDSTAFKMTPPYKVSSREEDIQTELMTNGPVQATFVVHEDFFMYAGGVY 355
Query: 266 KH--------ITGDVMGGHAVKLIGWG---TSDDGEDYWILANQWNRSWGADGYFKIKRG 314
+H + G H+V+++GWG ++ YW+ AN W WG DGYFK+ RG
Sbjct: 356 QHSDLAAQKGASSVAEGYHSVRVLGWGVDHSTGKPIKYWLCANSWGTQWGEDGYFKVLRG 415
Query: 315 SNECGIEEDVVA 326
N C IE V+
Sbjct: 416 ENHCEIESFVIG 427
>sp|P53634|CATC_HUMAN Dipeptidyl peptidase 1 OS=Homo sapiens GN=CTSC PE=1 SV=2
Length = 463
Score = 137 bits (345), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 98/313 (31%), Positives = 153/313 (48%), Gaps = 39/313 (12%)
Query: 36 SIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVK----PTPKGLLLGVPVKTHDKSL 91
+ +K +N K+ W A ++ T+G G P PK L ++ K L
Sbjct: 173 NFVKAINAIQKS-WTATTYMEYETLTLGDMIRRSGGHSRKIPRPKPAPLTAEIQ--QKIL 229
Query: 92 KLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDL 149
LP S+D R+ + +S + +Q CGSC++F ++ L R I + + LS ++
Sbjct: 230 HLPTSWDWRNV-HGINFVSPVRNQASCGSCYSFASMGMLEARIRILTNNSQTPILSPQEV 288
Query: 150 LACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVR 208
++C + GC+GG+P + A +Y G+V E C PY TG P
Sbjct: 289 VSCSQY--AQGCEGGFPYLIAGKYAQDFGLVEEACFPY---TGTDSP------------- 330
Query: 209 KCVKKNQLWR--NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYK 266
C K +R +S+++ + + + + E+ +GP+ V+F VY+DF HYK G+Y
Sbjct: 331 -CKMKEDCFRYYSSEYHYVGGFYGGCNEALMKLELVHHGPMAVAFEVYDDFLHYKKGIYH 389
Query: 267 H------ITGDVMGGHAVKLIGWGT-SDDGEDYWILANQWNRSWGADGYFKIKRGSNECG 319
H + HAV L+G+GT S G DYWI+ N W WG +GYF+I+RG++EC
Sbjct: 390 HTGLRDPFNPFELTNHAVLLVGYGTDSASGMDYWIVKNSWGTGWGENGYFRIRRGTDECA 449
Query: 320 IEEDVVAGLPSSK 332
IE VA P K
Sbjct: 450 IESIAVAATPIPK 462
>sp|Q3ZCJ8|CATC_BOVIN Dipeptidyl peptidase 1 OS=Bos taurus GN=CTSC PE=2 SV=1
Length = 463
Score = 135 bits (339), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 95/316 (30%), Positives = 154/316 (48%), Gaps = 47/316 (14%)
Query: 37 IIKEVNENPKAGWKAARNPQFSNYTV--------GQFKHLLGVKPTPKGLLLGVPVKTHD 88
+K +N K+ W AA ++ T+ G + + KP P + +
Sbjct: 174 FVKAINAIQKS-WTAAPYMEYETLTLKEMIRRGGGHSRRIPRPKPAP------ITAEIQK 226
Query: 89 KSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSV 146
K L LP S+D R+ + ++ + +QG CGSC++F ++ + R I + LS
Sbjct: 227 KILHLPTSWDWRNV-HGINFVTPVRNQGSCGSCYSFASMGMMEARIRILTNNTQTPILSP 285
Query: 147 NDLLACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPK 205
++++C + GC+GG+P + A +Y G+V E+C PY TG P
Sbjct: 286 QEVVSCSQY--AQGCEGGFPYLIAGKYAQDFGLVEEDCFPY---TGTDSP---------- 330
Query: 206 CVRKCVKKNQLWR--NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSG 263
C K +R +S+++ + + + + E+ GP+ V+F VY+DF HY+ G
Sbjct: 331 ----CRLKEGCFRYYSSEYHYVGGFYGGCNEALMKLELVHQGPMAVAFEVYDDFLHYRKG 386
Query: 264 VYKH------ITGDVMGGHAVKLIGWGT-SDDGEDYWILANQWNRSWGADGYFKIKRGSN 316
VY H + HAV L+G+GT + G DYWI+ N W SWG +GYF+I+RG++
Sbjct: 387 VYHHTGLRDPFNPFELTNHAVLLVGYGTDAASGLDYWIVKNSWGTSWGENGYFRIRRGTD 446
Query: 317 ECGIEEDVVAGLPSSK 332
EC IE +A P K
Sbjct: 447 ECAIESIALAATPIPK 462
>sp|P80067|CATC_RAT Dipeptidyl peptidase 1 OS=Rattus norvegicus GN=Ctsc PE=1 SV=3
Length = 462
Score = 132 bits (332), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 96/323 (29%), Positives = 157/323 (48%), Gaps = 44/323 (13%)
Query: 27 KLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTV-------GQFKHLLGVKPTPKGLL 79
+L SH + +K +N K+ W A ++ ++ G +L KP P
Sbjct: 166 RLYSH--NHNFVKAINSVQKS-WTATTYEEYEKLSIRDLIRRSGHSGRILRPKPAP---- 218
Query: 80 LGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG 139
+ + + L LP+S+D R+ + +S + +Q CGSC++F ++ L R I
Sbjct: 219 --ITDEIQQQILSLPESWDWRNV-RGINFVSPVRNQESCGSCYSFASLGMLEARIRILTN 275
Query: 140 MNLS--LSVNDLLACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPG 196
+ + LS ++++C + GCDGG+P + A +Y GVV E C PY +
Sbjct: 276 NSQTPILSPQEVVSCSPY--AQGCDGGFPYLIAGKYAQDFGVVEENCFPYTATDA----- 328
Query: 197 CEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYED 256
P P C+R + +S++Y + + + + E+ K+GP+ V+F V++D
Sbjct: 329 --PCKPKENCLR--------YYSSEYYYVGGFYGGCNEALMKLELVKHGPMAVAFEVHDD 378
Query: 257 FAHYKSGVYKH------ITGDVMGGHAVKLIGWGTSD-DGEDYWILANQWNRSWGADGYF 309
F HY SG+Y H + HAV L+G+G G DYWI+ N W WG GYF
Sbjct: 379 FLHYHSGIYHHTGLSDPFNPFELTNHAVLLVGYGKDPVTGLDYWIVKNSWGSQWGESGYF 438
Query: 310 KIKRGSNECGIEEDVVAGLPSSK 332
+I+RG++EC IE +A +P K
Sbjct: 439 RIRRGTDECAIESIAMAAIPIPK 461
>sp|P97821|CATC_MOUSE Dipeptidyl peptidase 1 OS=Mus musculus GN=Ctsc PE=2 SV=1
Length = 462
Score = 127 bits (320), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 92/314 (29%), Positives = 152/314 (48%), Gaps = 42/314 (13%)
Query: 36 SIIKEVNENPKAGWKAARNPQFSNYTV-------GQFKHLLGVKPTPKGLLLGVPVKTHD 88
+ +K +N K+ W A ++ ++ G + + KP P + +
Sbjct: 173 NFVKAINTVQKS-WTATAYKEYEKMSLRDLIRRSGHSQRIPRPKPAP------MTDEIQQ 225
Query: 89 KSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSV 146
+ L LP+S+D R+ + +S + +Q CGSC++F ++ L R I + + LS
Sbjct: 226 QILNLPESWDWRNV-QGVNYVSPVRNQESCGSCYSFASMGMLEARIRILTNNSQTPILSP 284
Query: 147 NDLLACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPK 205
++++C + GCDGG+P + A +Y GVV E C PY P P
Sbjct: 285 QEVVSCSPY--AQGCDGGFPYLIAGKYAQDFGVVEESCFPYTAKDS-------PCKPREN 335
Query: 206 CVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVY 265
C+R + +S +Y + + + + E+ K+GP+ V+F V++DF HY SG+Y
Sbjct: 336 CLR--------YYSSDYYYVGGFYGGCNEALMKLELVKHGPMAVAFEVHDDFLHYHSGIY 387
Query: 266 KH------ITGDVMGGHAVKLIGWGTSD-DGEDYWILANQWNRSWGADGYFKIKRGSNEC 318
H + HAV L+G+G G +YWI+ N W +WG GYF+I+RG++EC
Sbjct: 388 HHTGLSDPFNPFELTNHAVLLVGYGRDPVTGIEYWIIKNSWGSNWGESGYFRIRRGTDEC 447
Query: 319 GIEEDVVAGLPSSK 332
IE VA +P K
Sbjct: 448 AIESIAVAAIPIPK 461
>sp|Q26563|CATC_SCHMA Cathepsin C OS=Schistosoma mansoni PE=2 SV=1
Length = 454
Score = 118 bits (296), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 97/309 (31%), Positives = 143/309 (46%), Gaps = 38/309 (12%)
Query: 33 LQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSL- 91
+ S + ++N + K+ W+ P+ S YT+ + ++ G + + KT K L
Sbjct: 154 INPSFVGKINAHQKS-WRGEIYPELSKYTIDELRNRAGGVKSMVTRPSVLNRKTPSKELI 212
Query: 92 ----KLPKSFDARSAWPQC-STISRILDQGHCGSCWAFGAVEALSDRFCI--HFGMNLSL 144
LP FD S S ++ I +QG CGSC+A + AL R + +F L
Sbjct: 213 SLTGNLPLEFDWTSPPDGSRSPVTPIRNQGICGSCYASPSAAALEARIRLVSNFSEQPIL 272
Query: 145 SVNDLLACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPT 203
S ++ C + +GC+GG+P + A +Y G+ + PY TG T
Sbjct: 273 SPQTVVDCSPY--SEGCNGGFPFLIAGKYGEDFGLPQKIVIPY---TGED---------T 318
Query: 204 PKCVRKCVKKNQLWRNSKHYS-ISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKS 262
KC V KN + YS I Y ++ + + E+ NGP V F VYEDF YK
Sbjct: 319 GKCT---VSKNCTRYYTTDYSYIGGYYGATNEKLMQLELISNGPFPVGFEVYEDFQFYKE 375
Query: 263 GVYKHITGDV---------MGGHAVKLIGWGTSD-DGEDYWILANQWNRSWGADGYFKIK 312
G+Y H T + HAV L+G+G GE YW + N W WG GYF+I
Sbjct: 376 GIYHHTTVQTDHYNFNPFELTNHAVLLVGYGVDKLSGEPYWKVKNSWGVEWGEQGYFRIL 435
Query: 313 RGSNECGIE 321
RG++ECG+E
Sbjct: 436 RGTDECGVE 444
>sp|P05167|ALEU_HORVU Thiol protease aleurain OS=Hordeum vulgare PE=2 SV=1
Length = 362
Score = 116 bits (290), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 88/278 (31%), Positives = 125/278 (44%), Gaps = 47/278 (16%)
Query: 56 QFSNYTVGQFKHL-LGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILD 114
+FS+ + +F+ LG T L G + + LP++ D W + +S + +
Sbjct: 108 RFSDMSWEEFQATRLGAAQTCSATLAGNHLMR--DAAALPETKD----WREDGIVSPVKN 161
Query: 115 QGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFV 174
Q HCGSCW F AL + G N+SLS L+ C G GC+GG P A+ Y
Sbjct: 162 QAHCGSCWTFSTTGALEAAYTQATGKNISLSEQQLVDCAGGFNNFGCNGGLPSQAFEYIK 221
Query: 175 HHGVV-TEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSD 233
++G + TEE PY G H E N+ + + I +
Sbjct: 222 YNGGIDTEESYPYKGVNGVCHYKAE--------------------NAAVQVLDSVNITLN 261
Query: 234 PEDIMAEIYKNG-----PVEVSFTVYEDFAHYKSGVYKHITGDVMG------GHAVKLIG 282
ED + KN PV V+F V + F YKSGVY T D G HAV +G
Sbjct: 262 AEDEL----KNAVGLVRPVSVAFQVIDGFRQYKSGVY---TSDHCGTTPDDVNHAVLAVG 314
Query: 283 WGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGI 320
+G ++G YW++ N W WG +GYFK++ G N C I
Sbjct: 315 YGV-ENGVPYWLIKNSWGADWGDNGYFKMEMGKNMCAI 351
>sp|P22497|CYSP_THEPA Cysteine proteinase OS=Theileria parva GN=TP03_0285 PE=3 SV=2
Length = 440
Score = 115 bits (289), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 84/287 (29%), Positives = 132/287 (45%), Gaps = 49/287 (17%)
Query: 56 QFSNYTVGQFKHLLGVKPTPKG-------LLLGVPVKTHDKSLK----------LPKSFD 98
+FS+ T +F L V PK LL + KT+ K+LK L K
Sbjct: 171 RFSDLTEREFYKLFPVMKPPKATYSNGYYLLSHMANKTYLKNLKKALNTDEDVDLAKLTG 230
Query: 99 ARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCG 158
W + S+++ + DQ +CG CWAF V ++ + HF + LSV +LL C F
Sbjct: 231 ENLDWRRSSSVTSVKDQSNCGGCWAFSTVGSVEGYYMSHFDKSYELSVQELLDCDSF--S 288
Query: 159 DGCDGGYPISAWRYFVHHGVVTEECDPYFD-STGCSHPGCEPAYPTPKCVRKCVKKNQLW 217
+GC GG SA+ Y +G+V+ + P+ D + CS P
Sbjct: 289 NGCQGGLLESAYEYVRKYGLVSAKDLPFVDKARRCSVP---------------------- 326
Query: 218 RNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHA 277
+K S+ +Y + E +M + P V +V + A YKSGV+ G + HA
Sbjct: 327 -KAKKVSVPSYHVFKGKE-VMTRSLTSSPCSVYLSVSPELAKYKSGVFTGECGKSL-NHA 383
Query: 278 VKLIGWGTSD-DGEDYWILANQWNRSWGADGYFKIKR---GSNECGI 320
V L+G G + + YW++ N W WG +GY +++R G+++CG+
Sbjct: 384 VVLVGEGYDEVTKKRYWVVQNSWGTDWGENGYMRLERTNMGTDKCGV 430
>sp|Q8RWQ9|ALEUL_ARATH Thiol protease aleurain-like OS=Arabidopsis thaliana GN=At3g45310
PE=2 SV=1
Length = 358
Score = 115 bits (287), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 94/307 (30%), Positives = 138/307 (44%), Gaps = 40/307 (13%)
Query: 23 VSKLKLDSHILQDSIIKEVNENPKA-GWKAARNPQFSNYTVGQF-KHLLGVKPTPKGLLL 80
V ++KL + ++++ + N K +K + N QF++ T +F ++ LG L
Sbjct: 73 VEEMKLRFSVFKENLDLIRSTNKKGLSYKLSLN-QFADLTWQEFQRYKLGAAQNCSATLK 131
Query: 81 GVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGM 140
G T +P + D W + +S + +QGHCGSCW F AL + FG
Sbjct: 132 GSHKITE---ATVPDTKD----WREDGIVSPVKEQGHCGSCWTFSTTGALEAAYHQAFGK 184
Query: 141 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVV-TEECDPYFDSTGCSHPGCE- 198
+SLS L+ C G GC GG P A+ Y ++G + TEE PY G GC+
Sbjct: 185 GISLSEQQLVDCAGTFNNFGCHGGLPSQAFEYIKYNGGLDTEEAYPYTGKDG----GCKF 240
Query: 199 PAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFA 258
A VR V + +++ R PV V+F V +F
Sbjct: 241 SAKNIGVQVRDSVNITLGAEDELKHAVGLVR----------------PVSVAFEVVHEFR 284
Query: 259 HYKSGVYKHIT-----GDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKR 313
YK GV+ T DV HAV +G+G DD YW++ N W WG +GYFK++
Sbjct: 285 FYKKGVFTSNTCGNTPMDV--NHAVLAVGYGVEDD-VPYWLIKNSWGGEWGDNGYFKMEM 341
Query: 314 GSNECGI 320
G N CG+
Sbjct: 342 GKNMCGV 348
>sp|P49935|CATH_MOUSE Pro-cathepsin H OS=Mus musculus GN=Ctsh PE=2 SV=2
Length = 333
Score = 114 bits (284), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 81/235 (34%), Positives = 110/235 (46%), Gaps = 35/235 (14%)
Query: 94 PKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACC 153
P S D R + + +S + +QG CGSCW F AL I G LSL+ L+ C
Sbjct: 115 PSSMDWRK---KGNVVSPVKNQGACGSCWTFSTTGALESAVAIASGKMLSLAEQQLVDCA 171
Query: 154 GFLCGDGCDGGYPISAWRYFVHH-GVVTEECDPYF--DSTGCSHPGCEPAYPTPKCVRKC 210
GC GG P A+ Y +++ G++ E+ PY DS+ +P A+ V+
Sbjct: 172 QAFNNHGCKGGLPSQAFEYILYNKGIMEEDSYPYIGKDSSCRFNPQKAVAF-----VKNV 226
Query: 211 VKKNQLWRNSKHYSISAYRINSDPEDIMAE-IYKNGPVEVSFTVYEDFAHYKSGVYK--- 266
V I + E M E + PV +F V EDF YKSGVY
Sbjct: 227 V-----------------NITLNDEAAMVEAVALYNPVSFAFEVTEDFLMYKSGVYSSKS 269
Query: 267 -HITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGI 320
H T D + HAV +G+G +G YWI+ N W WG +GYF I+RG N CG+
Sbjct: 270 CHKTPDKV-NHAVLAVGYG-EQNGLLYWIVKNSWGSQWGENGYFLIERGKNMCGL 322
>sp|Q95029|CATL_DROME Cathepsin L OS=Drosophila melanogaster GN=Cp1 PE=2 SV=2
Length = 371
Score = 113 bits (283), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 94/310 (30%), Positives = 134/310 (43%), Gaps = 47/310 (15%)
Query: 16 QTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTP 75
Q FAEG VS KL + D + E + NYT+ K L +
Sbjct: 95 QRFAEGKVS-FKLAVNKYADLLHHEFRQLMNG----------FNYTL--HKQLRAADESF 141
Query: 76 KGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFC 135
KG+ P + LPKS D W ++ + DQGHCGSCWAF + AL +
Sbjct: 142 KGVTFISPAH-----VTLPKSVD----WRTKGAVTAVKDQGHCGSCWAFSSTGALEGQHF 192
Query: 136 IHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHP 195
G+ +SLS +L+ C +GC+GG +A+RY +G +
Sbjct: 193 RKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDNGGID--------------- 237
Query: 196 GCEPAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAE-IYKNGPVEVSFTV 253
E +YP C K + + ++ I E MAE + GPV V+
Sbjct: 238 -TEKSYPYEAIDDSCHFNKGTVGATDRGFT----DIPQGDEKKMAEAVATVGPVSVAIDA 292
Query: 254 -YEDFAHYKSGVYKHITGDVMG-GHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKI 311
+E F Y GVY D H V ++G+GT + GEDYW++ N W +WG G+ K+
Sbjct: 293 SHESFQFYSEGVYNEPQCDAQNLDHGVLVVGFGTDESGEDYWLVKNSWGTTWGDKGFIKM 352
Query: 312 KRG-SNECGI 320
R N+CGI
Sbjct: 353 LRNKENQCGI 362
>sp|P00786|CATH_RAT Pro-cathepsin H OS=Rattus norvegicus GN=Ctsh PE=1 SV=1
Length = 333
Score = 113 bits (283), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 78/233 (33%), Positives = 106/233 (45%), Gaps = 31/233 (13%)
Query: 94 PKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACC 153
P S D R + + +S + +QG CGSCW F AL I G ++L+ L+ C
Sbjct: 115 PSSMDWRK---KGNVVSPVKNQGACGSCWTFSTTGALESAVAIASGKMMTLAEQQLVDCA 171
Query: 154 GFLCGDGCDGGYPISAWRYFVHH-GVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVK 212
GC GG P A+ Y +++ G++ E+ PY G E A K V
Sbjct: 172 QNFNNHGCQGGLPSQAFEYILYNKGIMGEDSYPYIGKNGQCKFNPEKAVAFVKNV----- 226
Query: 213 KNQLWRNSKHYSISAYRINSDPEDIMAE-IYKNGPVEVSFTVYEDFAHYKSGVYK----H 267
I + E M E + PV +F V EDF YKSGVY H
Sbjct: 227 ---------------VNITLNDEAAMVEAVALYNPVSFAFEVTEDFMMYKSGVYSSNSCH 271
Query: 268 ITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGI 320
T D + HAV +G+G +G YWI+ N W +WG +GYF I+RG N CG+
Sbjct: 272 KTPDKV-NHAVLAVGYG-EQNGLLYWIVKNSWGSNWGNNGYFLIERGKNMCGL 322
>sp|Q8H166|ALEU_ARATH Thiol protease aleurain OS=Arabidopsis thaliana GN=ALEU PE=1 SV=2
Length = 358
Score = 110 bits (276), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 97/304 (31%), Positives = 139/304 (45%), Gaps = 34/304 (11%)
Query: 23 VSKLKLDSHILQDSIIKEVNENPKA-GWKAARNPQFSNYTVGQFKHL-LGVKPTPKGLLL 80
V ++KL I ++++ + N K +K N QF++ T +F+ LG L
Sbjct: 73 VEEMKLRFSIFKENLDLIRSTNKKGLSYKLGVN-QFADLTWQEFQRTKLGAAQNCSATLK 131
Query: 81 GVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGM 140
G T LP++ D W + +S + DQG CGSCW F AL + FG
Sbjct: 132 GSHKVTE---AALPETKD----WREDGIVSPVKDQGGCGSCWTFSTTGALEAAYHQAFGK 184
Query: 141 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVV-TEECDPYFDSTGCSHPGCEP 199
+SLS L+ C G GC+GG P A+ Y +G + TE+ PY TG
Sbjct: 185 GISLSEQQLVDCAGAFNNYGCNGGLPSQAFEYIKSNGGLDTEKAYPY---TGKDE----- 236
Query: 200 AYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAH 259
T K + V L NS + ++ A +++ + PV ++F V F
Sbjct: 237 ---TCKFSAENVGVQVL--NSVNITLGA------EDELKHAVGLVRPVSIAFEVIHSFRL 285
Query: 260 YKSGVY--KHITGDVMG-GHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSN 316
YKSGVY H M HAV +G+G +DG YW++ N W WG GYFK++ G N
Sbjct: 286 YKSGVYTDSHCGSTPMDVNHAVLAVGYGV-EDGVPYWLIKNSWGADWGDKGYFKMEMGKN 344
Query: 317 ECGI 320
CGI
Sbjct: 345 MCGI 348
>sp|Q24940|CATLL_FASHE Cathepsin L-like proteinase OS=Fasciola hepatica GN=Cat-1 PE=1 SV=1
Length = 326
Score = 110 bits (274), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 81/279 (29%), Positives = 126/279 (45%), Gaps = 34/279 (12%)
Query: 56 QFSNYTVGQFK--HLLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRIL 113
QF++ T +FK +L + L GVP + +++++ P D W + ++ +
Sbjct: 71 QFTDMTFEEFKAKYLTEMSRASDILSHGVPYEANNRAV--PDKID----WRESGYVTEVK 124
Query: 114 DQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYF 173
DQG+CGSCWAF + ++ + ++S S L+ C G +GC GG +A++Y
Sbjct: 125 DQGNCGSCWAFSTTGTMEGQYMKNERTSISFSEQQLVDCSGPWGNNGCSGGLMENAYQYL 184
Query: 174 VHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSD 233
G+ TE PY G +C QL Y ++S
Sbjct: 185 KQFGLETESSYPYTAVEG-----------------QCRYNKQL---GVAKVTGYYTVHSG 224
Query: 234 PE-DIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHIT-GDVMGGHAVKLIGWGTSDDGED 291
E ++ + P V+ V DF Y+SG+Y+ T + HAV +G+GT G D
Sbjct: 225 SEVELKNLVGARRPAAVAVDVESDFMMYRSGIYQSQTCSPLRVNHAVLAVGYGT-QGGTD 283
Query: 292 YWILANQWNRSWGADGYFKIKRG-SNECGIEEDVVAGLP 329
YWI+ N W WG GY ++ R N CGI +A LP
Sbjct: 284 YWIVKNSWGTYWGERGYIRMARNRGNMCGIAS--LASLP 320
>sp|Q10717|CYSP2_MAIZE Cysteine proteinase 2 OS=Zea mays GN=CCP2 PE=2 SV=1
Length = 360
Score = 109 bits (273), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 79/244 (32%), Positives = 112/244 (45%), Gaps = 36/244 (14%)
Query: 85 KTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSL 144
+ ++ LP++ D W + +S + +QGHCGSCW F AL + G +SL
Sbjct: 135 RMRAAAVALPETKD----WREDGIVSPVKNQGHCGSCWTFSTTGALEAAYTQATGKPISL 190
Query: 145 SVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVV-TEECDPYFDSTGCSHPGCEPAYPT 203
S L+ C GC+GG P A+ Y ++G + TEE PY G
Sbjct: 191 SEQQLVDCGFAFNNFGCNGGLPSQAFEYIKYNGGLDTEESYPYQGVNGI----------- 239
Query: 204 PKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAE-IYKNGPVEVSFTVYEDFAHYKS 262
C KN+ N + + I ED + + + PV V+F V F YKS
Sbjct: 240 ------CKFKNE---NVGVKVLDSVNITLGAEDELKDAVGLVRPVSVAFEVITGFRLYKS 290
Query: 263 GVYKHITGDVMG------GHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSN 316
GVY T D G HAV +G+G +DG YW++ N W WG +GYFK++ G N
Sbjct: 291 GVY---TSDHCGTTPMDVNHAVLAVGYGV-EDGVPYWLIKNSWGADWGDEGYFKMEMGKN 346
Query: 317 ECGI 320
CG+
Sbjct: 347 MCGV 350
>sp|P25251|CYSP4_BRANA Cysteine proteinase COT44 (Fragment) OS=Brassica napus PE=2 SV=1
Length = 328
Score = 108 bits (270), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 96/337 (28%), Positives = 152/337 (45%), Gaps = 42/337 (12%)
Query: 19 AEGVVSKLKLDSHILQDSI--IKEVNENPKAGWKAARNPQFSNYTVGQFKHL-LGVKPTP 75
+ G++++ +I +D++ I NEN K F+N T +++ L LG + P
Sbjct: 18 SNGIINQQDERFNIFKDNLRFIDLHNENNKNATYKLGLTIFANLTNDEYRSLYLGARTEP 77
Query: 76 KGLLLGVPVKTHDKSLKLPKSFDARSA-----WPQCSTISRILDQGHCGSCWAFGAVEAL 130
+ K + ++K + + W Q ++ I DQG CGSCWAF A+
Sbjct: 78 VRRI----TKAKNVNMKYSAAVNVDEVPVTVDWRQKGAVNAIKDQGTCGSCWAFSTAAAV 133
Query: 131 SDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECD-PYFDS 189
I G +SLS +L+ C GC+GG A+++ + +G + E D PY +
Sbjct: 134 EGINKIVTGELVSLSEQELVDCDKSY-NQGCNGGLMDYAFQFIMKNGGLNTEKDYPYHGT 192
Query: 190 TGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYR-INSDPEDIMAEIYKNGPVE 248
G KC N L +NS+ +I Y + S E + PV
Sbjct: 193 NG-------------KC-------NSLLKNSRVVTIDGYEDVPSKDETALKRAVSYQPVS 232
Query: 249 VSFTV-YEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADG 307
V+ F HY+SG++ G M HAV +G+G S++G DYWI+ N W WG DG
Sbjct: 233 VAIDAGGRAFQHYQSGIFTGKCGTNM-DHAVVAVGYG-SENGVDYWIVRNSWGTRWGEDG 290
Query: 308 YFKIKRG----SNECGIEEDVVAGLPSSKNLVKEITS 340
Y +++R S +CGI + + S N V+ +S
Sbjct: 291 YIRMERNVASKSGKCGIAIEASYPVKYSPNPVRGTSS 327
>sp|Q80LP4|CATV_NPVAH Viral cathepsin OS=Adoxophyes honmai nucleopolyhedrovirus GN=VCATH
PE=3 SV=1
Length = 337
Score = 108 bits (269), Expect = 8e-23, Method: Compositional matrix adjust.
Identities = 75/249 (30%), Positives = 111/249 (44%), Gaps = 34/249 (13%)
Query: 80 LGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG 139
L P HD+ LP++FD W + ++ + DQG CGSCWA AV L + I
Sbjct: 116 LDAPPDVHDE---LPQNFD----WRVNNKMTSVKDQGACGSCWAHAAVGTLETLYAIKHN 168
Query: 140 MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECD-PYFDSTG-CSHPGC 197
++LS L+ C CDGG +A+ ++ G + EE D PY + G C
Sbjct: 169 YLINLSEQQLIDCDS--ANMACDGGLMHTAFEQLMNAGGLMEEIDYPYQGTKGVCKIDNK 226
Query: 198 EPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDF 257
+ A C R I + E++ E+ GP+ ++
Sbjct: 227 KFALSVSSCKR--------------------YIFQNEENLKKELITMGPIAMAIDA-ASI 265
Query: 258 AHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNE 317
+ Y G+ H ++ HAV L+G+GT + G YW L N W WG DGYF++KR N
Sbjct: 266 STYSKGII-HFCENLGLNHAVLLVGYGT-EGGVSYWTLKNSWGSDWGEDGYFRVKRNINA 323
Query: 318 CGIEEDVVA 326
CG+ + A
Sbjct: 324 CGLNNQLAA 332
>sp|Q9PYY5|CATV_GVXN Viral cathepsin OS=Xestia c-nigrum granulosis virus GN=VCATH PE=3
SV=1
Length = 346
Score = 107 bits (267), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 88/288 (30%), Positives = 133/288 (46%), Gaps = 35/288 (12%)
Query: 33 LQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSLK 92
L+DS + E+N + N T + + G K K V + D S K
Sbjct: 80 LEDSAMFEINSRADI----SSNELLQKLTGLKLSLMRGEK---KNSFCTPTVISGDSSGK 132
Query: 93 LPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLAC 152
+P SFD W ++++ + Q CGSCWAF AV + + I ++L LS L+ C
Sbjct: 133 VPDSFD----WRDRNSVTSVKMQKECGSCWAFSAVANIESLYHIKHNVSLDLSEQQLVDC 188
Query: 153 CGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVK 212
+GC+GG + +W + G++ + G S+ E YP C
Sbjct: 189 DK--VNNGCNGG--LMSWAF---EGIIR--------AGGISY---EAPYPYTGVDGVCKN 230
Query: 213 KNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV 272
+ + S Y AY + S+ + + +++ GPV V+ V D +YKSGV KH + D
Sbjct: 231 TTRYVQLSGCY---AYDLRSEKK-LRQVLHEKGPVSVAIDVV-DLTNYKSGVAKHCSVDH 285
Query: 273 MGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGI 320
H V L+G+G +D + YW L N W WG G+F+IKR N CGI
Sbjct: 286 GLNHGVLLVGYGQENDVK-YWTLKNSWGSDWGEQGFFRIKRDVNSCGI 332
>sp|Q9R1T3|CATZ_RAT Cathepsin Z OS=Rattus norvegicus GN=Ctsz PE=1 SV=2
Length = 306
Score = 105 bits (263), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 81/280 (28%), Positives = 120/280 (42%), Gaps = 39/280 (13%)
Query: 68 LLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQ---GHCGSCWAF 124
LLG + P+ P LPK++D R+ + S +Q +CGSCWA
Sbjct: 46 LLGRRTYPRPHEYLSPAD-------LPKNWDWRNV-NGVNYASVTRNQHIPQYCGSCWAH 97
Query: 125 GAVEALSDRFCIHFG---MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTE 181
G+ AL+DR I + LSV +++ C C+GG + W Y HG+ E
Sbjct: 98 GSTSALADRINIKRKGAWPSTLLSVQNVIDCGN---AGSCEGGNDLPVWEYAHKHGIPDE 154
Query: 182 ECDPYFDSTGCSHPGCEPAYPTPKCV--RKC--VKKNQLWRNSKHYSISAYRINSDPEDI 237
C+ Y C+ C ++C ++ LWR + S+S E +
Sbjct: 155 TCNNY----QAKDQECDKFNQCGTCTEFKECHTIQNYTLWRVGDYGSLSGR------EKM 204
Query: 238 MAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILAN 297
MAEIY NGP+ E ++Y G+Y + H + + GWG S+DG +YWI+ N
Sbjct: 205 MAEIYANGPISCGIMATERMSNYTGGIYTEYQNQAIINHIISVAGWGVSNDGIEYWIVRN 264
Query: 298 QWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKE 337
W WG G+ +I + G SS NL E
Sbjct: 265 SWGEPWGERGWMRI--------VTSTYKGGTGSSYNLAIE 296
Database: swissprot
Posted date: Mar 23, 2013 2:32 AM
Number of letters in database: 191,569,459
Number of sequences in database: 539,616
Lambda K H
0.320 0.137 0.454
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 147,454,722
Number of Sequences: 539616
Number of extensions: 6727757
Number of successful extensions: 13000
Number of sequences better than 100.0: 50
Number of HSP's better than 100.0 without gapping: 210
Number of HSP's successfully gapped in prelim test: 11
Number of HSP's that attempted gapping in prelim test: 12326
Number of HSP's gapped (non-prelim): 280
length of query: 349
length of database: 191,569,459
effective HSP length: 118
effective length of query: 231
effective length of database: 127,894,771
effective search space: 29543692101
effective search space used: 29543692101
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 62 (28.5 bits)