BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 022267
(300 letters)
Database: swissprot
539,616 sequences; 191,569,459 total letters
Searching..................................................done
>sp|Q4R5M2|CATB_MACFA Cathepsin B OS=Macaca fascicularis GN=CTSB PE=2 SV=1
Length = 339
Score = 234 bits (596), Expect = 8e-61, Method: Compositional matrix adjust.
Identities = 133/307 (43%), Positives = 176/307 (57%), Gaps = 39/307 (12%)
Query: 11 CLLILGVISSQTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQF 70
CLL LG S+ H L D ++ VN+ W+A N F N V
Sbjct: 10 CLLALGDARSRP-----------SFHPLSDELVNYVNKQ-NTTWQAGHN--FYNVDVSYL 55
Query: 71 KHLLGV---KPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCW 127
K L G P P ++ + LKLP+SFDAR WPQC TI I DQG CGSCW
Sbjct: 56 KRLCGTFLGGPKPPQRVM------FTEDLKLPESFDAREQWPQCPTIKEIRDQGSCGSCW 109
Query: 128 AFGAVEALSDRFCIHFGMNLSLSVN--DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT 185
AFGAVEA+SDR CIH ++S+ V+ DLL CCG +CGDGC+GGYP AW ++ G+V+
Sbjct: 110 AFGAVEAISDRICIHTNAHVSVEVSAEDLLTCCGIMCGDGCNGGYPAGAWNFWTRKGLVS 169
Query: 186 EE-------CDPYF-----DSTGCSHPGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISA 232
C PY S P C TPKC + C + ++ KHY ++
Sbjct: 170 GGLYDSHVGCRPYSIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGYSPTYKQDKHYGYNS 229
Query: 233 YRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSD 292
Y +++ +DIMAEIYKNGPVE +F+VY DF YKSGVY+H+TG++MGGHA++++GWG +
Sbjct: 230 YSVSNSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGV-E 288
Query: 293 DGEDYWV 299
+G YW+
Sbjct: 289 NGTPYWL 295
>sp|Q5R6D1|CATB_PONAB Cathepsin B OS=Pongo abelii GN=CTSB PE=2 SV=1
Length = 339
Score = 232 bits (592), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 127/282 (45%), Positives = 168/282 (59%), Gaps = 28/282 (9%)
Query: 36 HILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGV---KPTPKGLLLGVPVKTH 92
H L D ++ VN+ W+A N F N V K L G P P ++
Sbjct: 24 HPLSDELVNYVNKR-NTTWQAGHN--FYNVDVSYLKKLCGTFLGGPKPPQRVM------F 74
Query: 93 DKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVN 152
+ LKLP+SFDAR WPQC TI I DQG CGSCWAFGAVEA+SDR CIH ++S+ V+
Sbjct: 75 TEDLKLPESFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHTNAHVSVEVS 134
Query: 153 --DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYF-----DSTGCS 198
DLL CCG +CGDGC+GGYP AW ++ G+V+ C PY S
Sbjct: 135 AEDLLTCCGSMCGDGCNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCEHHVNGS 194
Query: 199 HPGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFT 257
P C TPKC + C + ++ KHY ++Y +++ DIMAEIYKNGPVE +F+
Sbjct: 195 RPPCTGEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVSNSERDIMAEIYKNGPVEGAFS 254
Query: 258 VYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
VY DF YKSGVY+H+TG++MGGHA++++GWG ++G YW+
Sbjct: 255 VYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGV-ENGTPYWL 295
>sp|P07858|CATB_HUMAN Cathepsin B OS=Homo sapiens GN=CTSB PE=1 SV=3
Length = 339
Score = 231 bits (590), Expect = 3e-60, Method: Compositional matrix adjust.
Identities = 131/307 (42%), Positives = 175/307 (57%), Gaps = 39/307 (12%)
Query: 11 CLLILGVISSQTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQF 70
CLL+L S+ H L D ++ VN+ W+A N F N +
Sbjct: 10 CLLVLANARSRP-----------SFHPLSDELVNYVNKR-NTTWQAGHN--FYNVDMSYL 55
Query: 71 KHLLGV---KPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCW 127
K L G P P ++ + LKLP SFDAR WPQC TI I DQG CGSCW
Sbjct: 56 KRLCGTFLGGPKPPQRVM------FTEDLKLPASFDAREQWPQCPTIKEIRDQGSCGSCW 109
Query: 128 AFGAVEALSDRFCIHFGMNLSLSVN--DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT 185
AFGAVEA+SDR CIH ++S+ V+ DLL CCG +CGDGC+GGYP AW ++ G+V+
Sbjct: 110 AFGAVEAISDRICIHTNAHVSVEVSAEDLLTCCGSMCGDGCNGGYPAEAWNFWTRKGLVS 169
Query: 186 EE-------CDPYF-----DSTGCSHPGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISA 232
C PY S P C TPKC + C + ++ KHY ++
Sbjct: 170 GGLYESHVGCRPYSIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGYSPTYKQDKHYGYNS 229
Query: 233 YRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSD 292
Y +++ +DIMAEIYKNGPVE +F+VY DF YKSGVY+H+TG++MGGHA++++GWG +
Sbjct: 230 YSVSNSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGV-E 288
Query: 293 DGEDYWV 299
+G YW+
Sbjct: 289 NGTPYWL 295
>sp|P00787|CATB_RAT Cathepsin B OS=Rattus norvegicus GN=Ctsb PE=1 SV=2
Length = 339
Score = 227 bits (579), Expect = 8e-59, Method: Compositional matrix adjust.
Identities = 127/287 (44%), Positives = 166/287 (57%), Gaps = 30/287 (10%)
Query: 32 KLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKT 91
K SH L D +I +N+ W+A RN F N + K L G +LG P
Sbjct: 20 KPSSHPLSDDMINYINKQ-NTTWQAGRN--FYNVDISYLKKLCGT-------VLGGPNLP 69
Query: 92 H----DKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--M 145
+ + LP+SFDAR W C TI++I DQG CGSCWAFGAVEA+SDR CIH +
Sbjct: 70 ERVGFSEDINLPESFDAREQWSNCPTIAQIRDQGSCGSCWAFGAVEAMSDRICIHTNGRV 129
Query: 146 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYF-----D 193
N+ +S DLL CCG CGDGC+GGYP AW ++ G+V+ C PY
Sbjct: 130 NVEVSAEDLLTCCGIQCGDGCNGGYPSGAWNFWTRKGLVSGGVYNSHIGCLPYTIPPCEH 189
Query: 194 STGCSHPGCEPAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPV 252
S P C TPKC + C + ++ KHY ++Y ++ ++IMAEIYKNGPV
Sbjct: 190 HVNGSRPPCTGEGDTPKCNKMCEAGYSTSYKEDKHYGYTSYSVSDSEKEIMAEIYKNGPV 249
Query: 253 EVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
E +FTV+ DF YKSGVYKH GDVMGGHA++++GWG ++G YW+
Sbjct: 250 EGAFTVFSDFLTYKSGVYKHEAGDVMGGHAIRILGWGI-ENGVPYWL 295
>sp|P25792|CYSP_SCHMA Cathepsin B-like cysteine proteinase OS=Schistosoma mansoni PE=2
SV=1
Length = 340
Score = 219 bits (558), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 127/310 (40%), Positives = 176/310 (56%), Gaps = 23/310 (7%)
Query: 7 FLTTCLLILGVISSQTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYT 66
LT+ L I +I TF E +S L D II +NE+P AGW+A ++ +F +
Sbjct: 1 MLTSILCIASLI---TFLEAHISVKNEKFEPLSDDIISYINEHPNAGWRAEKSNRFHSLD 57
Query: 67 VGQFKHLLGVKPTPKGLLLGVPVKTH-DKSLKLPKSFDARSAWPQCSTISRILDQGHCGS 125
+ + + + P P H D ++++P +FD+R WP C +I+ I DQ CGS
Sbjct: 58 DARIQ-MGARREEPDLRRKRRPTVDHNDWNVEIPSNFDSRKKWPGCKSIATIRDQSRCGS 116
Query: 126 CWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGV 183
CW+FGAVEA+SDR CI G N+ LS DLL CC CG GC+GG AW Y+V G+
Sbjct: 117 CWSFGAVEAMSDRSCIQSGGKQNVELSAVDLLTCCES-CGLGCEGGILGPAWDYWVKEGI 175
Query: 184 VTEE-------CDPY-----FDSTGCSHPGC-EPAYPTPKCVRKCVKKNQL-WRNSKHYS 229
VT C+PY T +P C Y TP+C + C +K + + KH
Sbjct: 176 VTASSKENHTGCEPYPFPKCEHHTKGKYPPCGSKIYNTPRCKQTCQRKYKTPYTQDKHRG 235
Query: 230 ISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWG 289
S+Y + +D + I EI K GPVE SFTVYEDF +YKSG+YKHITG+ +GGHA+++IGWG
Sbjct: 236 KSSYNVKNDEKAIQKEIMKYGPVEASFTVYEDFLNYKSGIYKHITGEALGGHAIRIIGWG 295
Query: 290 TSDDGEDYWV 299
++ YW+
Sbjct: 296 V-ENKTPYWL 304
>sp|A1E295|CATB_PIG Cathepsin B OS=Sus scrofa GN=CTSB PE=1 SV=1
Length = 335
Score = 219 bits (557), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 124/291 (42%), Positives = 163/291 (56%), Gaps = 32/291 (10%)
Query: 29 SKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVP 88
++ L L D ++ +N+ W A N F N + K L G LG P
Sbjct: 17 ARESLHFQPLSDELVNFINKQ-NTTWTAGHN--FYNVDLSYVKKLCGT-------FLGGP 66
Query: 89 VKTHDKSLK----LPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG 144
+ LPKSFDAR WP C TI I DQG CGSCWAFGAVEA+SDR CI
Sbjct: 67 KLPQRAAFAADMILPKSFDAREQWPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICIRSN 126
Query: 145 --MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDST 195
+N+ +S D+L CCG CGDGC+GG+P AW ++ G+V+ C PY
Sbjct: 127 GRVNVEVSAEDMLTCCGDECGDGCNGGFPSGAWNFWTKKGLVSGGLYDSHVGCRPY-SIP 185
Query: 196 GCSH------PGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYK 248
C H P C TPKC + C ++ KH+ S+Y I+ + ++IMAEIYK
Sbjct: 186 PCEHHVNGSRPPCTGEGDTPKCSKICEPGYTPSYKEDKHFGCSSYSISRNEKEIMAEIYK 245
Query: 249 NGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
NGPVE +FTVY DF YKSGVY+H+TGD+MGGHA++++GWG ++G YW+
Sbjct: 246 NGPVEGAFTVYSDFLQYKSGVYQHVTGDLMGGHAIRILGWGV-ENGTPYWL 295
>sp|P10605|CATB_MOUSE Cathepsin B OS=Mus musculus GN=Ctsb PE=1 SV=2
Length = 339
Score = 218 bits (556), Expect = 4e-56, Method: Compositional matrix adjust.
Identities = 121/285 (42%), Positives = 164/285 (57%), Gaps = 34/285 (11%)
Query: 36 HILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHL----LGVKPTPKGLLLGVPVKT 91
H L D +I +N+ W+A RN F N + K L LG P + G
Sbjct: 24 HPLSDDLINYINKQ-NTTWQAGRN--FYNVDISYLKKLCGTVLGGPKLPGRVAFG----- 75
Query: 92 HDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSL 149
+ + LP++FDAR W C TI +I DQG CGSCWAFGAVEA+SDR CIH +N+ +
Sbjct: 76 --EDIDLPETFDAREQWSNCPTIGQIRDQGSCGSCWAFGAVEAISDRTCIHTNGRVNVEV 133
Query: 150 SVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGC------------ 197
S DLL CCG CGDGC+GGYP AW ++ G+V+ Y GC
Sbjct: 134 SAEDLLTCCGIQCGDGCNGGYPSGAWSFWTKKGLVSGGV--YNSHVGCLPYTIPPCEHHV 191
Query: 198 --SHPGCEPAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEV 254
S P C TP+C + C + ++ KH+ ++Y +++ ++IMAEIYKNGPVE
Sbjct: 192 NGSRPPCTGEGDTPRCNKSCEAGYSPSYKEDKHFGYTSYSVSNSVKEIMAEIYKNGPVEG 251
Query: 255 SFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
+FTV+ DF YKSGVYKH GD+MGGHA++++GWG ++G YW+
Sbjct: 252 AFTVFSDFLTYKSGVYKHEAGDMMGGHAIRILGWGV-ENGVPYWL 295
>sp|P43157|CYSP_SCHJA Cathepsin B-like cysteine proteinase OS=Schistosoma japonicum
GN=CATB PE=2 SV=1
Length = 342
Score = 213 bits (543), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 118/302 (39%), Positives = 173/302 (57%), Gaps = 23/302 (7%)
Query: 17 VISSQTFAEG-VVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLG 75
++S TF E V ++ L D +I +NE+P AGWKA ++ +F +++ + L+G
Sbjct: 8 IVSLFTFLEAHVTTRNNQRIEPLSDEMISFINEHPDAGWKADKSDRF--HSLDDARILMG 65
Query: 76 VKPTPKGLLLGV--PVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVE 133
+ + V HD ++++P FD+R WP C +IS+I DQ CGSCWAFGAVE
Sbjct: 66 ARKEDAEMKRNRRPTVDHHDLNVEIPSQFDSRKKWPHCKSISQIRDQSRCGSCWAFGAVE 125
Query: 134 ALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT------ 185
A++DR CI G + LS DL++CC CGDGC GG+P AW Y+V G+VT
Sbjct: 126 AMTDRICIQSGGGQSAELSALDLISCCK-DCGDGCQGGFPGVAWDYWVKRGIVTGGSKEN 184
Query: 186 -EECDPY-----FDSTGCSHPGC-EPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINS 237
C PY T +P C Y TP+C + C K + + KHY +Y + +
Sbjct: 185 HTGCQPYPFPKCEHHTKGKYPACGTKIYKTPQCKQTCQKGYKTPYEQDKHYGDESYNVQN 244
Query: 238 DPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDY 297
+ + I +I GPVE +F VYEDF +YKSG+Y+H+TG ++GGHA+++IGWG + Y
Sbjct: 245 NEKVIQRDIMMYGPVEAAFDVYEDFLNYKSGIYRHVTGSIVGGHAIRIIGWGV-EKRTPY 303
Query: 298 WV 299
W+
Sbjct: 304 WL 305
>sp|P43510|CPR6_CAEEL Cathepsin B-like cysteine proteinase 6 OS=Caenorhabditis elegans
GN=cpr-6 PE=1 SV=1
Length = 379
Score = 212 bits (539), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 132/332 (39%), Positives = 177/332 (53%), Gaps = 51/332 (15%)
Query: 6 LFLTTCLLILGVISSQTFAEGVVSKLK---LDSHILQ---DSIIKEVNENPKAGWKAARN 59
L +C+++ + E V+ K + +DS + D +I VNEN W A +
Sbjct: 4 LLFLSCIVVAAYCACNDNLESVLDKYRNREIDSEAAELDGDDLIDYVNENQNL-WTAKKQ 62
Query: 60 PQFSNYTVGQFKHLLGVKPTPKGLLLGVP------------VKTHDKSLKLPKSFDARSA 107
+FS+ + G K L+GV KT D L +P+SFD+R
Sbjct: 63 RRFSS--------VYGENDKAKWGLMGVNHVRLSVKGKQHLSKTKDLDLDIPESFDSRDN 114
Query: 108 WPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDG 165
WP+C +I I DQ CGSCWAFGAVEA+SDR CI H + ++LS +DLL+CC CG G
Sbjct: 115 WPKCDSIKVIRDQSSCGSCWAFGAVEAMSDRICIASHGELQVTLSADDLLSCCK-SCGFG 173
Query: 166 CDGGYPISAWRYFVHHGVVTEECDPYFDSTGCS---HPGCE-------------PAYPTP 209
C+GG P++AWRY+V G+VT Y + GC P CE YPTP
Sbjct: 174 CNGGDPLAAWRYWVKDGIVTGS--NYTANNGCKPYPFPPCEHHSKKTHFDPCPHDLYPTP 231
Query: 210 KCVRKCVKK--NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKS 267
KC +KCV ++ + K + SAY + D E I E+ +GP+E++F VYEDF +Y
Sbjct: 232 KCEKKCVSDYTDKTYSEDKFFGASAYGVKDDVEAIQKELMTHGPLEIAFEVYEDFLNYDG 291
Query: 268 GVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
GVY H G + GGHAVKLIGWG DDG YW
Sbjct: 292 GVYVHTGGKLGGGHAVKLIGWGI-DDGIPYWT 322
>sp|P43233|CATB_CHICK Cathepsin B OS=Gallus gallus GN=CTSB PE=2 SV=1
Length = 340
Score = 212 bits (539), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 120/284 (42%), Positives = 162/284 (57%), Gaps = 35/284 (12%)
Query: 38 LQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHD---- 93
L ++ +N+ G +A N F N + K L G LG P
Sbjct: 26 LSSDLVNHINKLNTTG-RAGHN--FHNTDMSYVKKLCGT-------FLGGPKAPERVDFA 75
Query: 94 KSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVN- 152
+ + LP +FD R WP C TIS I DQG CGSCWAFGAVEA+SDR C+H +S+ V+
Sbjct: 76 EDMDLPDTFDTRKQWPNCPTISEIRDQGSCGSCWAFGAVEAISDRICVHTNAKVSVEVSA 135
Query: 153 -DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGC---SHPGCE----- 203
DLL+CCGF CG GC+GGYP AWRY+ G+V+ Y GC + P CE
Sbjct: 136 EDLLSCCGFECGMGCNGGYPSGAWRYWTERGLVSGGL--YDSHVGCRAYTIPPCEHHVNG 193
Query: 204 -------PAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVS 255
TP+C R C + ++ KHY I++Y + ++IMAEIYKNGPVE +
Sbjct: 194 SRPPCTGEGGETPRCSRHCEPGYSPSYKEDKHYGITSYGVPRSEKEIMAEIYKNGPVEGA 253
Query: 256 FTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
F VYEDF YKSGVY+H++G+ +GGHA++++GWG ++G YW+
Sbjct: 254 FIVYEDFLMYKSGVYQHVSGEQVGGHAIRILGWGV-ENGTPYWL 296
>sp|P43509|CPR5_CAEEL Cathepsin B-like cysteine proteinase 5 OS=Caenorhabditis elegans
GN=cpr-5 PE=2 SV=1
Length = 344
Score = 203 bits (517), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 108/226 (47%), Positives = 134/226 (59%), Gaps = 22/226 (9%)
Query: 95 SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVN 152
S +P FDAR WP C +I+ I DQ CGSCWAF A EA+SDR CI + +N LS
Sbjct: 79 SDAIPDHFDARDQWPNCMSINNIRDQSDCGSCWAFAAAEAISDRTCIASNGAVNTLLSSE 138
Query: 153 DLLACCG--FLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDS------TGC 197
DLL+CC F CG+GC+GGYPI AW+++V HG+VT C PY + G
Sbjct: 139 DLLSCCTGMFSCGNGCEGGYPIQAWKWWVKHGLVTGGSYETQFGCKPYSIAPCGETVNGV 198
Query: 198 SHPGC-EPAYPTPKCVRKCVKKNQL---WRNSKHYSISAYRINSDPEDIMAEIYKNGPVE 253
P C E PTPKCV C KN + KH+ +AY + E I EI NGP+E
Sbjct: 199 KWPACPEDTEPTPKCVDSCTSKNNYATPYLQDKHFGSTAYAVGKKVEQIQTEILTNGPIE 258
Query: 254 VSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
V+FTVYEDF Y +GVY H G +GGHAVK++GWG D+G YW+
Sbjct: 259 VAFTVYEDFYQYTTGVYVHTAGASLGGHAVKILGWGV-DNGTPYWL 303
>sp|Q54QD9|CTSB_DICDI Cathepsin B OS=Dictyostelium discoideum GN=ctsB PE=3 SV=1
Length = 311
Score = 196 bits (499), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 112/254 (44%), Positives = 147/254 (57%), Gaps = 24/254 (9%)
Query: 54 WKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDK-SLKLPKSFDARSAWPQCS 112
W + QF N VGQ LLG K +P L +K++D +++P SF+A++ WP C+
Sbjct: 39 WVEEQTDQFDNIKVGQ---LLGFKRSPNRPKL--QIKSYDPLGVQIPTSFNAQTNWPNCT 93
Query: 113 TISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPI 172
TIS+I +Q CGSCWAFGA E+ +DR CIH N+ LS D++ C +GC+GG
Sbjct: 94 TISQIQNQARCGSCWAFGATESATDRLCIHNNENVQLSFMDMVTCDE--TDNGCEGGDAF 151
Query: 173 SAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYP-------TPKCVRKCVKKNQL-WRN 224
SAW + G V+EEC PY + P C PA TP C ++C + L +
Sbjct: 152 SAWNWLRKQGAVSEECLPY------TIPTCPPAQQPCLNFVNTPSCTKECQSNSSLIYSQ 205
Query: 225 SKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVK 284
KH Y +SD E IM EI NGPVE FTV+EDF YKSGVY H TG +GGH VK
Sbjct: 206 DKHKMAKIYSFDSD-EAIMQEIVTNGPVEACFTVFEDFLAYKSGVYVHTTGKDLGGHCVK 264
Query: 285 LIGWGTSDDGEDYW 298
L+G+GT +G DY+
Sbjct: 265 LVGFGTL-NGVDYY 277
>sp|P25807|CPR1_CAEEL Gut-specific cysteine proteinase OS=Caenorhabditis elegans GN=cpr-1
PE=1 SV=2
Length = 329
Score = 194 bits (492), Expect = 8e-49, Method: Compositional matrix adjust.
Identities = 100/211 (47%), Positives = 129/211 (61%), Gaps = 12/211 (5%)
Query: 98 LPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLL 155
+P +FD+R+ W +C +I I DQ CGSCWAFGA E +SDR CI +S +DLL
Sbjct: 85 VPATFDSRTQWSECKSIKLIRDQATCGSCWAFGAAEMISDRTCIETKGAQQPIISPDDLL 144
Query: 156 ACCGFLCGDGCDGGYPISAWRYFVHHGVVT------EECDPYFDSTGCSHPGCEPAYPTP 209
+CCG CG+GC+GGYPI A R++ GVVT C PY + C+ C P TP
Sbjct: 145 SCCGSSCGNGCEGGYPIQALRWWDSKGVVTGGDYHGAGCKPYPIAP-CTSGNC-PESKTP 202
Query: 210 KCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSG 268
C C + + KH+ +SAY + + I AEIY NGPVE +F+VYEDF YKSG
Sbjct: 203 SCSMSCQSGYSTAYAKDKHFGVSAYAVPKNAASIQAEIYANGPVEAAFSVYEDFYKYKSG 262
Query: 269 VYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
VYKH G +GGHA+K+IGWGT + G YW+
Sbjct: 263 VYKHTAGKYLGGHAIKIIGWGT-ESGSPYWL 292
>sp|P07688|CATB_BOVIN Cathepsin B OS=Bos taurus GN=CTSB PE=1 SV=5
Length = 335
Score = 188 bits (478), Expect = 4e-47, Method: Compositional matrix adjust.
Identities = 126/312 (40%), Positives = 173/312 (55%), Gaps = 39/312 (12%)
Query: 6 LFLTTCLLILGVISSQTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNY 65
L +CLL+L S + L D ++ VN+ WKA N F N
Sbjct: 5 LATLSCLLVLTSARSSLYFP-----------PLSDELVNFVNKQ-NTTWKAGHN--FYNV 50
Query: 66 TVGQFKHLLGVKPTPKGLLLGVPVKTHDK---SLKLPKSFDARSAWPQCSTISRILDQGH 122
+ K L G +L G + D + LP+SFDAR WP C TI I DQG
Sbjct: 51 DLSYVKKLCGA------ILGGPKLPQRDAFAADVVLPESFDAREQWPNCPTIKEIRDQGS 104
Query: 123 CGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVH 180
CGSCWAFGAVEA+SDR CIH +N+ +S D+L CCG CGDGC+GG+P AW ++
Sbjct: 105 CGSCWAFGAVEAISDRICIHSNGRVNVEVSAEDMLTCCGGECGDGCNGGFPSGAWNFWTK 164
Query: 181 HGVVTEE-------CDPYF-----DSTGCSHPGCEPAYPTPKCVRKCVKK-NQLWRNSKH 227
G+V+ C PY S P C TPKC + C + ++ KH
Sbjct: 165 KGLVSGGLYNSHVGCRPYSIPPCEHHVNGSRPPCTGEGDTPKCSKTCEPGYSPSYKEDKH 224
Query: 228 YSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIG 287
+ S+Y + ++ ++IMAEIYKNGPVE +F+VY DF YKSGVY+H++G++MGGHA++++G
Sbjct: 225 FGCSSYSVANNEKEIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVSGEIMGGHAIRILG 284
Query: 288 WGTSDDGEDYWV 299
WG ++G YW+
Sbjct: 285 WGV-ENGTPYWL 295
>sp|P43507|CPR3_CAEEL Cathepsin B-like cysteine proteinase 3 OS=Caenorhabditis elegans
GN=cpr-3 PE=2 SV=1
Length = 370
Score = 187 bits (475), Expect = 9e-47, Method: Compositional matrix adjust.
Identities = 101/214 (47%), Positives = 126/214 (58%), Gaps = 16/214 (7%)
Query: 98 LPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLL 155
LP +FDAR WP C+TI I +Q CGSCWAFGA E +SDR CI +SV D+L
Sbjct: 92 LPDTFDAREKWPDCNTIKLIRNQATCGSCWAFGAAEVISDRVCIQSNGTQQPVISVEDIL 151
Query: 156 ACCGFLCGDGCDGGYPISAWRYFVHHGVVT------EECDPYFDSTGCSHPGCEPAYPTP 209
+CCG CG GC GGY I A R++ G VT C PY S C P TP
Sbjct: 152 SCCGTTCGYGCKGGYSIEALRFWASSGAVTGGDYGGHGCMPY--SFAPCTKNC-PESTTP 208
Query: 210 KCVRKCVK--KNQLWRNSKHYSISAYRINSDPE--DIMAEIYKNGPVEVSFTVYEDFAHY 265
C C K + ++ KHY SAY++ + +I EIY GPVE S+ VYEDF HY
Sbjct: 209 SCKTTCQSSYKTEEYKKDKHYGASAYKVTTTKSVTEIQTEIYHYGPVEASYKVYEDFYHY 268
Query: 266 KSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
KSGVY + +G ++GGHAVK+IGWG ++G DYW+
Sbjct: 269 KSGVYHYTSGKLVGGHAVKIIGWGV-ENGVDYWL 301
>sp|P43508|CPR4_CAEEL Cathepsin B-like cysteine proteinase 4 OS=Caenorhabditis elegans
GN=cpr-4 PE=2 SV=1
Length = 335
Score = 187 bits (474), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 120/280 (42%), Positives = 158/280 (56%), Gaps = 25/280 (8%)
Query: 39 QDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHD-KSLK 97
Q++I + VN ++ WKA P+ + T+ Q K L V V HD
Sbjct: 25 QEAITEYVNSK-QSLWKA-EIPK--DITIEQVKKRLMRTEFVAPHTPDVEVVKHDINEDT 80
Query: 98 LPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLL 155
+P +FDAR+ WP C +I+ I DQ CGSCWAF A EA SDRFCI + +N LS D+L
Sbjct: 81 IPATFDARTQWPNCMSINNIRDQSDCGSCWAFAAAEAASDRFCIASNGAVNTLLSAEDVL 140
Query: 156 ACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYF-----DSTG-CSHPGC 202
+CC CG GC+GGYPI+AW+Y V G T C PY ++ G + P C
Sbjct: 141 SCCSN-CGYGCEGGYPINAWKYLVKSGFCTGGSYEAQFGCKPYSLAPCGETVGNVTWPSC 199
Query: 203 -EPAYPTPKCVRKCVKKNQ--LWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVY 259
+ Y TP CV KC KN + KH+ +AY + I AEI +GPVE +FTVY
Sbjct: 200 PDDGYDTPACVNKCTNKNYNVAYTADKHFGSTAYAVGKKVSQIQAEIIAHGPVEAAFTVY 259
Query: 260 EDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
EDF YK+GVY H TG +GGHA++++GWGT D+G YW+
Sbjct: 260 EDFYQYKTGVYVHTTGQELGGHAIRILGWGT-DNGTPYWL 298
>sp|P25793|CYSP2_HAECO Cathepsin B-like cysteine proteinase 2 OS=Haemonchus contortus
GN=AC-2 PE=2 SV=1
Length = 342
Score = 185 bits (470), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 108/284 (38%), Positives = 153/284 (53%), Gaps = 36/284 (12%)
Query: 33 LDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTH 92
L +++ + + EVN +P P F + ++ +K + L L V +
Sbjct: 38 LVAYLRRSQNLFEVNSDP--------TPDFE-------QKIMSIKYKHQKLNLMVK-EDP 81
Query: 93 DKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLS 150
D + +P S+D R W C+T I DQ +CGSCWA A+SDR CI +++S
Sbjct: 82 DPEVDIPPSYDPRDVWKNCTTFY-IRDQANCGSCWAVSTAAAISDRICIASKAEKQVNIS 140
Query: 151 VNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSHPG-- 201
D++ CC CGDGC+GG+PI AW+YF++ GVV+ + C PY C H G
Sbjct: 141 ATDIMTCCRPQCGDGCEGGWPIEAWKYFIYDGVVSGGEYLTKDVCRPY-PIHPCGHHGND 199
Query: 202 -----CEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVS 255
C PTP C RKC +++R K Y AY + + I +EI KNGPV S
Sbjct: 200 TYYGECRGTAPTPPCKRKCRPGVRKMYRIDKRYGKDAYIVKQSVKAIQSEILKNGPVVAS 259
Query: 256 FTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
F VYEDF HYKSG+YKH G++ G HAVK+IGWG +++ D+W+
Sbjct: 260 FAVYEDFRHYKSGIYKHTAGELRGYHAVKMIGWG-NENNTDFWL 302
>sp|P19092|CYSP1_HAECO Cathepsin B-like cysteine proteinase 1 OS=Haemonchus contortus
GN=AC-1 PE=2 SV=1
Length = 342
Score = 184 bits (467), Expect = 6e-46, Method: Compositional matrix adjust.
Identities = 96/224 (42%), Positives = 130/224 (58%), Gaps = 20/224 (8%)
Query: 93 DKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLS 150
D + +P S+D R W C+T I DQ +CGSCWA A+SDR CI +++S
Sbjct: 82 DPEVDIPPSYDPRDVWKNCTTFY-IRDQANCGSCWAVSTAAAISDRICIASKAEKQVNIS 140
Query: 151 VNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSHPG-- 201
D++ CC CGDGC+GG+PI AW+YF++ GVV+ + C PY C H G
Sbjct: 141 ATDIMTCCRPQCGDGCEGGWPIEAWKYFIYDGVVSGGEYLTKDVCRPY-PIHPCGHHGND 199
Query: 202 -----CEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVS 255
C PTP C RKC +++R K Y AY + + I +EI +NGPV S
Sbjct: 200 TYYGECRGTAPTPPCKRKCRPGVRKMYRIDKRYGKDAYIVKQSVKAIQSEILRNGPVVAS 259
Query: 256 FTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
F VYEDF HYKSG+YKH G++ G HAVK+IGWG +++ D+W+
Sbjct: 260 FAVYEDFRHYKSGIYKHTAGELRGYHAVKMIGWG-NENNTDFWL 302
>sp|P25802|CYSP1_OSTOS Cathepsin B-like cysteine proteinase 1 OS=Ostertagia ostertagi
GN=CP-1 PE=3 SV=3
Length = 341
Score = 169 bits (429), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 93/218 (42%), Positives = 127/218 (58%), Gaps = 19/218 (8%)
Query: 98 LPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLL 155
+P+S+D R W CS++ I DQ +CGSCWA + A+SDR CI + +S D++
Sbjct: 91 IPESYDPRIQWANCSSLFHIPDQANCGSCWAVSSAAAMSDRICIASKGAKQVLISAQDVV 150
Query: 156 ACCGFLCGDGCDGGYPISAWRYFVHHGVVTE-------ECDPYFDSTGCSHPGCEPAY-- 206
+CC + CGDGC+GG+PISA+R+ GVVT C PY + C H G E Y
Sbjct: 151 SCCTW-CGDGCEGGWPISAFRFHADEGVVTGGDYNTKGSCRPY-EIHPCGHHGNETYYGE 208
Query: 207 -----PTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYED 261
TP+C R+C+ S Y AY++ + + I +I KNGPV ++TVYED
Sbjct: 209 CVGMADTPRCKRRCLLGYPKSYPSDRYYKKAYQLKNSVKAIQKDIMKNGPVVATYTVYED 268
Query: 262 FAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
FAHY+SG+YKH G G HAVK+IGWG + G YW+
Sbjct: 269 FAHYRSGIYKHKAGRKTGLHAVKVIGWG-EEKGTPYWI 305
>sp|P92133|CATB3_GIAIN Cathepsin B-like CP3 OS=Giardia intestinalis GN=CP3 PE=2 SV=2
Length = 299
Score = 140 bits (354), Expect = 9e-33, Method: Compositional matrix adjust.
Identities = 95/255 (37%), Positives = 128/255 (50%), Gaps = 24/255 (9%)
Query: 49 NPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAW 108
NP+ WKA +F T + LL K VP T + + P SFD R +
Sbjct: 28 NPR--WKAGIPKRFEGLTKDEISSLLMPVSFLKRDRAAVPRGTV-SATQAPDSFDFREEY 84
Query: 109 PQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMN---LSLSVNDLLACCGFLCGD- 164
P C I ++DQG CGSCWAF +V ++ DR C G++ + S +++C GD
Sbjct: 85 PHC--IPEVVDQGGCGSCWAFSSVASVGDRRCFA-GLDKKAVKYSPQYVVSCDR---GDM 138
Query: 165 GCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRN 224
CDGG+ S WR+ G T+EC PY G A T C KC + L
Sbjct: 139 ACDGGWLPSVWRFLTKTGTTTDECVPY-------QSGSTGARGT--CPTKCADGSDLPHL 189
Query: 225 SKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVK 284
K Y + D IM + GP++ +FTVY DF +Y+SGVY+H G V GGHAV
Sbjct: 190 YKATKAVDYGL--DAPAIMKALATGGPLQTAFTVYSDFMYYESGVYQHTYGRVEGGHAVD 247
Query: 285 LIGWGTSDDGEDYWV 299
++G+GT DDG DYW+
Sbjct: 248 MVGYGTDDDGVDYWI 262
>sp|P92132|CATB2_GIAIN Cathepsin B-like CP2 OS=Giardia intestinalis GN=CP2 PE=1 SV=2
Length = 300
Score = 131 bits (330), Expect = 6e-30, Method: Compositional matrix adjust.
Identities = 86/255 (33%), Positives = 128/255 (50%), Gaps = 23/255 (9%)
Query: 49 NPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAW 108
NP+ WKA +F T + LL K P T +P+SFD R +
Sbjct: 28 NPR--WKAGIPKRFEGLTKDEISSLLMPVSFLKNAKGAAPRGTFTDKDDVPESFDFREEY 85
Query: 109 PQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMN---LSLSVNDLLACCGFLCGD- 164
P C I ++DQG CGSCWAF +V DR C+ G++ + S +++C GD
Sbjct: 86 PHC--IPEVVDQGGCGSCWAFSSVATFGDRRCVA-GLDKKPVKYSPQYVVSCDH---GDM 139
Query: 165 GCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRN 224
C+GG+ + W++ G T+EC PY + C PT KC +
Sbjct: 140 ACNGGWLPNVWKFLTKTGTTTDECVPYKSGSTTLRGTC----PT-----KCADGSSKVHL 190
Query: 225 SKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVK 284
+ S Y + D +M + +GP++V+F V+ DF +Y+SGVY+H G + GGHAV+
Sbjct: 191 ATATSYKDYGL--DIPAMMKALSTSGPLQVAFLVHSDFMYYESGVYQHTYGYMEGGHAVE 248
Query: 285 LIGWGTSDDGEDYWV 299
++G+GT DDG DYW+
Sbjct: 249 MVGYGTDDDGVDYWI 263
>sp|Q3SZI1|TINAG_BOVIN Tubulointerstitial nephritis antigen OS=Bos taurus GN=TINAG PE=2
SV=1
Length = 476
Score = 127 bits (319), Expect = 9e-29, Method: Compositional matrix adjust.
Identities = 94/289 (32%), Positives = 135/289 (46%), Gaps = 36/289 (12%)
Query: 37 ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ-FKHLLGVKPTPKGLLLGVPVKTHD-- 93
++Q +I+ VN+ GW A QF T+ + FK+ LG P P LLL + T
Sbjct: 155 LVQPGLIEHVNKG-DYGWTAQNYSQFWGMTLEEGFKYRLGTLP-PSPLLLSMNEVTASLT 212
Query: 94 KSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSV 151
K+ LP+ F A WP LDQ +C + WAF +DR I +LS
Sbjct: 213 KTTDLPEFFIASYKWP--GWTHGPLDQKNCAASWAFSTASVAADRIAIQSQGRYTANLSP 270
Query: 152 NDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA------ 205
+L++CC GC+ G AW Y G+V+ C P F ++ GC A
Sbjct: 271 QNLISCCAKK-RHGCNSGSVDRAWWYLRKRGLVSHACYPLFKDQNATNNGCAMASRSDGR 329
Query: 206 ---YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDF 262
+ T C K N++++ S YR++S+ +IM EI +NGPV+ V+EDF
Sbjct: 330 GKRHATTPCPNSIEKSNRIYQCS-----PPYRVSSNETEIMREIMQNGPVQAIMQVHEDF 384
Query: 263 AHYKSGVYKHITGD--------VMGGHAVKLIGWGT----SDDGEDYWV 299
+YK+G+Y+HIT HAVKL GWGT E +W+
Sbjct: 385 FNYKTGIYRHITSTNEDSEKYRKFRTHAVKLTGWGTLRGAQGQKEKFWI 433
>sp|Q9UJW2|TINAG_HUMAN Tubulointerstitial nephritis antigen OS=Homo sapiens GN=TINAG PE=2
SV=3
Length = 476
Score = 125 bits (315), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 89/288 (30%), Positives = 133/288 (46%), Gaps = 34/288 (11%)
Query: 37 ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ-FKHLLG-VKPTPKGLLLGVPVKTHDK 94
+++ +I++VN+ GW A QF T+ FK LG + P+P L + +
Sbjct: 155 LVRSELIEQVNKG-DYGWTAQNYSQFWGMTLEDGFKFRLGTLPPSPMLLSMNEMTASLPA 213
Query: 95 SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVN 152
+ LP+ F A WP LDQ +C + WAF +DR I +LS
Sbjct: 214 TTDLPEFFVASYKWP--GWTHGPLDQKNCAASWAFSTASVAADRIAIQSKGRYTANLSPQ 271
Query: 153 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA------- 205
+L++CC GC+ G AW Y G+V+ C P F ++ GC A
Sbjct: 272 NLISCCA-KNRHGCNSGSIDRAWWYLRKRGLVSHACYPLFKDQNATNNGCAMASRSDGRG 330
Query: 206 --YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFA 263
+ T C K N++++ S YR++S+ +IM EI +NGPV+ V EDF
Sbjct: 331 KRHATKPCPNNVEKSNRIYQCS-----PPYRVSSNETEIMKEIMQNGPVQAIMQVREDFF 385
Query: 264 HYKSGVYKHITGD--------VMGGHAVKLIGWGT----SDDGEDYWV 299
HYK+G+Y+H+T + HAVKL GWGT E +W+
Sbjct: 386 HYKTGIYRHVTSTNKESEKYRKLQTHAVKLTGWGTLRGAQGQKEKFWI 433
>sp|P92131|CATB1_GIAIN Cathepsin B-like CP1 OS=Giardia intestinalis GN=CP1 PE=2 SV=3
Length = 303
Score = 125 bits (314), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 87/255 (34%), Positives = 126/255 (49%), Gaps = 27/255 (10%)
Query: 54 WKAARNPQFSNYTVGQFKHLLGVKP----TPKGLLLGVPV-KTHDKSLKLPKSFDARSAW 108
WKA +F N T +F+ +L ++P G L + + + + +P FD R +
Sbjct: 31 WKAGMPKRFENVTEDEFRSML-IRPDRLRARSGSLPPISITEVQELVDPIPPQFDFRDEY 89
Query: 109 PQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMN---LSLSVNDLLACCGFLCGDG 165
PQC + LDQG CGSCWAF A+ DR C G++ +S S L++C L G
Sbjct: 90 PQC--VKPALDQGSCGSCWAFSAIGVFGDRRC-AMGIDKEAVSYSQQHLISCS--LENFG 144
Query: 166 CDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNS 225
CDGG W + G T EC Y D G A P P QL++
Sbjct: 145 CDGGDFQPTWSFLTFTGATTAECVKYVDY------GHTVASPCPAVCDDG-SPIQLYKAH 197
Query: 226 KHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV-MGGHAVK 284
+ +S S P IM + GP++ VY D ++Y+SGVYKH G + +G HA++
Sbjct: 198 GYGQVS----KSVPA-IMGMLVAGGPLQTMIVVYADLSYYESGVYKHTYGTINLGFHALE 252
Query: 285 LIGWGTSDDGEDYWV 299
++G+GT+DDG DYW+
Sbjct: 253 IVGYGTTDDGTDYWI 267
>sp|Q99JR5|TINAL_MOUSE Tubulointerstitial nephritis antigen-like OS=Mus musculus
GN=Tinagl1 PE=1 SV=1
Length = 466
Score = 113 bits (283), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 88/292 (30%), Positives = 131/292 (44%), Gaps = 39/292 (13%)
Query: 37 ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ-FKHLLG-VKPTPKGLLLGVPVKTHDK 94
++ +IK +N GW+A + F T+ + ++ LG ++P+ + + +
Sbjct: 140 LVDPDMIKAINRG-NYGWQAGNHSAFWGMTLDEGIRYRLGTIRPSSTVMNMNEIYTVLGQ 198
Query: 95 SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVN 152
LP +F+A WP + I LDQG+C WAF SDR IH M LS
Sbjct: 199 GEVLPTAFEASEKWP--NLIHEPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPILSPQ 256
Query: 153 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCV 212
+LL+C GC GG AW + GVV++ C P+ A PTP+C+
Sbjct: 257 NLLSCDTHH-QQGCRGGRLDGAWWFLRRRGVVSDNCYPFSGREQ------NEASPTPRCM 309
Query: 213 ----------RKCVKK---NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVY 259
R+ + Q+ N + AYR+ SD ++IM E+ +NGPV+ V+
Sbjct: 310 MHSRAMGRGKRQATSRCPNGQVDSNDIYQVTPAYRLGSDEKEIMKELMENGPVQALMEVH 369
Query: 260 EDFAHYKSGVYKHI--------TGDVMGGHAVKLIGWG--TSDDGE--DYWV 299
EDF Y+ G+Y H G H+VK+ GWG T DG YW
Sbjct: 370 EDFFLYQRGIYSHTPVSQGRPEQYRRHGTHSVKITGWGEETLPDGRTIKYWT 421
>sp|Q9EQT5|TINAL_RAT Tubulointerstitial nephritis antigen-like OS=Rattus norvegicus
GN=Tinagl1 PE=2 SV=1
Length = 467
Score = 112 bits (280), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 87/292 (29%), Positives = 133/292 (45%), Gaps = 38/292 (13%)
Query: 37 ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ-FKHLLG-VKPTPKGLLLGVPVKTHDK 94
++ ++IK +N GW+A + F T+ + ++ LG ++P+ + + +
Sbjct: 140 LVDPAMIKAINRG-NYGWQAGNHSAFWGMTLDEGIRYRLGTIRPSSSVMNMNEIYTVLGQ 198
Query: 95 SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVN 152
LP +F+A WP + I LDQG+C WAF SDR IH M LS
Sbjct: 199 GEVLPTAFEASEKWP--NLIHEPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPILSPQ 256
Query: 153 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCV 212
+LL+C GC GG AW + GVV++ C P+ + A PTP+C+
Sbjct: 257 NLLSCDTHH-QKGCRGGRLDGAWWFLRRRGVVSDNCYPF-----SGREQNDEASPTPRCM 310
Query: 213 ----------RKCVKK---NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVY 259
R+ + +Q+ N + YR+ SD ++IM E+ +NGPV+ V+
Sbjct: 311 MHSRAMGRGKRQATSRCPNSQVDSNDIYQVTPVYRLASDEKEIMKELMENGPVQALMEVH 370
Query: 260 EDFAHYKSGVYKHITGDV--------MGGHAVKLIGWG--TSDDGE--DYWV 299
EDF Y+ G+Y H G H+VK+ GWG T DG YW
Sbjct: 371 EDFFLYQRGIYSHTPVSQGRPEQYRRHGTHSVKITGWGEETLPDGRTIKYWT 422
>sp|Q9GZM7|TINAL_HUMAN Tubulointerstitial nephritis antigen-like OS=Homo sapiens
GN=TINAGL1 PE=1 SV=1
Length = 467
Score = 110 bits (274), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 86/286 (30%), Positives = 132/286 (46%), Gaps = 27/286 (9%)
Query: 37 ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ-FKHLLG-VKPTPKGLLLGVPVKTHDK 94
++ +IK +N+ GW+A + F T+ + ++ LG ++P+ + + +
Sbjct: 141 LVDPDMIKAINQG-NYGWQAGNHSAFWGMTLDEGIRYRLGTIRPSSSVMNMHEIYTVLNP 199
Query: 95 SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVN 152
LP +F+A WP + I LDQG+C WAF SDR IH M LS
Sbjct: 200 GEVLPTAFEASEKWP--NLIHEPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQ 257
Query: 153 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPY----FDSTGCSHPGCEPAYPT 208
+LL+C GC GG AW + GVV++ C P+ D G + P +
Sbjct: 258 NLLSCDTHQ-QQGCRGGRLDGAWWFLRRRGVVSDHCYPFSGRERDEAGPAPPCMMHSRAM 316
Query: 209 PKCVRKCVK--KNQLWRNSKHYSIS-AYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHY 265
+ R+ N N+ Y ++ YR+ S+ ++IM E+ +NGPV+ V+EDF Y
Sbjct: 317 GRGKRQATAHCPNSYVNNNDIYQVTPVYRLGSNDKEIMKELMENGPVQALMEVHEDFFLY 376
Query: 266 KSGVYKHITGDV--------MGGHAVKLIGWG--TSDDGE--DYWV 299
K G+Y H + G H+VK+ GWG T DG YW
Sbjct: 377 KGGIYSHTPVSLGRPERYRRHGTHSVKITGWGEETLPDGRTLKYWT 422
>sp|Q06544|CYSP3_OSTOS Cathepsin B-like cysteine proteinase 3 (Fragment) OS=Ostertagia
ostertagi GN=CP-3 PE=3 SV=1
Length = 174
Score = 108 bits (271), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 63/141 (44%), Positives = 81/141 (57%), Gaps = 17/141 (12%)
Query: 174 AWRYFVHHGVVTEE-------CDPYFDSTGCSHPGCEPAY-------PTPKCVRKCVKKN 219
AW+YF GVVT C PY + C G EP Y TPKC + C +
Sbjct: 1 AWQYFALEGVVTGGNYRKQGCCRPY-EFPPCGRHGKEPYYGECYDTAKTPKCQKTCQRGY 59
Query: 220 -QLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVM 278
+ ++ KH+ SAYR+ ++ + I +I KNGPV F VYEDFAHYKSG+YKH G +
Sbjct: 60 LKAYKEDKHFGKSAYRLPNNVKAIQRDIMKNGPVVAGFIVYEDFAHYKSGIYKHTAGRMT 119
Query: 279 GGHAVKLIGWGTSDDGEDYWV 299
GGHAVK+IGWG + G YW+
Sbjct: 120 GGHAVKIIGWG-KEKGTPYWL 139
>sp|O97578|CATC_CANFA Dipeptidyl peptidase 1 (Fragment) OS=Canis familiaris GN=CTSC PE=1
SV=1
Length = 435
Score = 104 bits (259), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 76/269 (28%), Positives = 130/269 (48%), Gaps = 30/269 (11%)
Query: 41 SIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSLKLPK 100
+K +N K+ W A R ++ T+ +G + P+ + + H++ +LP
Sbjct: 148 EFVKAINTIQKS-WTATRYIEYETLTLRDMMTRVGGRKIPRPKPTPLTAEIHEEISRLPT 206
Query: 101 SFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACC 158
S+D R+ + +S + +Q CGSC+AF + L R I + LS ++++C
Sbjct: 207 SWDWRNV-RGTNFVSPVRNQASCGSCYAFASTAMLEARIRILTNNTQTPILSPQEIVSCS 265
Query: 159 GFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVK 217
+ GC+GG+P + A +Y G+V E C PY G P C+P C R
Sbjct: 266 QY--AQGCEGGFPYLIAGKYAQDFGLVEEACFPY---AGSDSP-CKPN----DCFR---- 311
Query: 218 KNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH----- 272
+ +S++Y + + + + E+ ++GP+ V+F VY+DF HY+ G+Y H
Sbjct: 312 ----YYSSEYYYVGGFYGACNEALMKLELVRHGPMAVAFEVYDDFFHYQKGIYYHTGLRD 367
Query: 273 -ITGDVMGGHAVKLIGWGT-SDDGEDYWV 299
+ HAV L+G+GT S G DYW+
Sbjct: 368 PFNPFELTNHAVLLVGYGTDSASGMDYWI 396
>sp|P90850|YCF2E_CAEEL Uncharacterized peptidase C1-like protein F26E4.3 OS=Caenorhabditis
elegans GN=F26E4.3 PE=1 SV=3
Length = 452
Score = 103 bits (257), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 74/221 (33%), Positives = 107/221 (48%), Gaps = 18/221 (8%)
Query: 94 KSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSV 151
K +LP+ FDAR W I + DQG CGS W+ SDR I +N +LS
Sbjct: 180 KPRELPEHFDARDKWG--PLIHPVADQGDCGSSWSVSTTAISSDRLAIISEGRINSTLSS 237
Query: 152 NDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKC 211
LL+C GC+GGY AW Y GVV + C PY S PG
Sbjct: 238 QQLLSCNQHR-QKGCEGGYLDRAWWYIRKLGVVGDHCYPYV-SGQSREPGHCLIPKRDYT 295
Query: 212 VRKCVKKNQLWRNSKHYSISA-YRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVY 270
R+ ++ ++S + ++ Y+++S EDI E+ NGPV+ +F V+EDF Y GVY
Sbjct: 296 NRQGLRCPSGSQDSTAFKMTPPYKVSSREEDIQTELMTNGPVQATFVVHEDFFMYAGGVY 355
Query: 271 KH--------ITGDVMGGHAVKLIGWG---TSDDGEDYWVC 300
+H + G H+V+++GWG ++ YW+C
Sbjct: 356 QHSDLAAQKGASSVAEGYHSVRVLGWGVDHSTGKPIKYWLC 396
>sp|Q60HG6|CATC_MACFA Dipeptidyl peptidase 1 OS=Macaca fascicularis GN=CTSC PE=2 SV=1
Length = 463
Score = 96.3 bits (238), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 78/279 (27%), Positives = 132/279 (47%), Gaps = 47/279 (16%)
Query: 41 SIIKEVNENPKAGWKAARNPQFSNYTVGQF--------KHLLGVKPTPKGLLLGVPVKTH 92
+ +K +N K+ W A ++ T+G + + KPTP + +
Sbjct: 173 NFVKAINAIQKS-WTATTYMEYETLTLGDMIKRSGGHSRKIPRPKPTP------LTAEIQ 225
Query: 93 DKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LS 150
K L LP S+D R+ + +S + +Q CGSC++F +V L R I + + LS
Sbjct: 226 QKILHLPTSWDWRNV-HGINFVSPVRNQASCGSCYSFASVGMLEARIRILTNNSQTPILS 284
Query: 151 VNDLLACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTP 209
++++C + GC+GG+P ++A +Y G+V E C PY TG P
Sbjct: 285 SQEVVSCSQY--AQGCEGGFPYLTAGKYAQDFGLVEEACFPY---TGTDSP--------- 330
Query: 210 KCVRKCVKKNQLWR--NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKS 267
C K +R +S+++ + + + + E+ +GP+ V+F VY+DF HY++
Sbjct: 331 -----CKMKEDCFRYYSSEYHYVGGFYGGCNEALMKLELVYHGPLAVAFEVYDDFLHYQN 385
Query: 268 GVYKH------ITGDVMGGHAVKLIGWGT-SDDGEDYWV 299
G+Y H + HAV L+G+GT S G DYW+
Sbjct: 386 GIYHHTGLRDPFNPFELTNHAVLLVGYGTDSASGMDYWI 424
>sp|Q5RB02|CATC_PONAB Dipeptidyl peptidase 1 OS=Pongo abelii GN=CTSC PE=2 SV=1
Length = 463
Score = 94.7 bits (234), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 79/275 (28%), Positives = 129/275 (46%), Gaps = 39/275 (14%)
Query: 41 SIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVK----PTPKGLLLGVPVKTHDKSL 96
+ +K +N K+ W A ++ T+G G P PK L ++ K L
Sbjct: 173 NFVKAINAIQKS-WTATTYKEYETLTLGDMIRRSGGHSRKIPRPKPAPLTAEIQ--QKVL 229
Query: 97 KLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDL 154
LP S+D R+ + +S + +Q CGSC++F ++ L R I + + LS ++
Sbjct: 230 HLPTSWDWRNI-HGINFVSPVRNQASCGSCYSFASMGMLEARIRILTSNSQTPILSPQEV 288
Query: 155 LACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVR 213
++C + GC+GG+P + A +Y G+V E C PY TG P
Sbjct: 289 VSCSQY--AQGCEGGFPYLIAGKYAQDFGLVEEACFPY---TGTDSP------------- 330
Query: 214 KCVKKNQLWR--NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYK 271
C K +R +S+++ + + + + E+ +GP+ V+F VY+DF HYK G+Y
Sbjct: 331 -CKMKEDCFRYYSSEYHYVGGFYGGCNEALMKLELVHHGPMAVAFEVYDDFLHYKKGIYH 389
Query: 272 H------ITGDVMGGHAVKLIGWGT-SDDGEDYWV 299
H + HAV L+G+GT S G DYW+
Sbjct: 390 HTGLRDPFNPFELTNHAVLLVGYGTDSASGMDYWI 424
>sp|P53634|CATC_HUMAN Dipeptidyl peptidase 1 OS=Homo sapiens GN=CTSC PE=1 SV=2
Length = 463
Score = 94.4 bits (233), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 79/275 (28%), Positives = 129/275 (46%), Gaps = 39/275 (14%)
Query: 41 SIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVK----PTPKGLLLGVPVKTHDKSL 96
+ +K +N K+ W A ++ T+G G P PK L ++ K L
Sbjct: 173 NFVKAINAIQKS-WTATTYMEYETLTLGDMIRRSGGHSRKIPRPKPAPLTAEIQ--QKIL 229
Query: 97 KLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDL 154
LP S+D R+ + +S + +Q CGSC++F ++ L R I + + LS ++
Sbjct: 230 HLPTSWDWRNV-HGINFVSPVRNQASCGSCYSFASMGMLEARIRILTNNSQTPILSPQEV 288
Query: 155 LACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVR 213
++C + GC+GG+P + A +Y G+V E C PY TG P
Sbjct: 289 VSCSQY--AQGCEGGFPYLIAGKYAQDFGLVEEACFPY---TGTDSP------------- 330
Query: 214 KCVKKNQLWR--NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYK 271
C K +R +S+++ + + + + E+ +GP+ V+F VY+DF HYK G+Y
Sbjct: 331 -CKMKEDCFRYYSSEYHYVGGFYGGCNEALMKLELVHHGPMAVAFEVYDDFLHYKKGIYH 389
Query: 272 H------ITGDVMGGHAVKLIGWGT-SDDGEDYWV 299
H + HAV L+G+GT S G DYW+
Sbjct: 390 HTGLRDPFNPFELTNHAVLLVGYGTDSASGMDYWI 424
>sp|Q3ZCJ8|CATC_BOVIN Dipeptidyl peptidase 1 OS=Bos taurus GN=CTSC PE=2 SV=1
Length = 463
Score = 92.0 bits (227), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 76/278 (27%), Positives = 129/278 (46%), Gaps = 47/278 (16%)
Query: 42 IIKEVNENPKAGWKAARNPQFSNYTV--------GQFKHLLGVKPTPKGLLLGVPVKTHD 93
+K +N K+ W AA ++ T+ G + + KP P + +
Sbjct: 174 FVKAINAIQKS-WTAAPYMEYETLTLKEMIRRGGGHSRRIPRPKPAP------ITAEIQK 226
Query: 94 KSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSV 151
K L LP S+D R+ + ++ + +QG CGSC++F ++ + R I + LS
Sbjct: 227 KILHLPTSWDWRNV-HGINFVTPVRNQGSCGSCYSFASMGMMEARIRILTNNTQTPILSP 285
Query: 152 NDLLACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPK 210
++++C + GC+GG+P + A +Y G+V E+C PY TG P
Sbjct: 286 QEVVSCSQY--AQGCEGGFPYLIAGKYAQDFGLVEEDCFPY---TGTDSP---------- 330
Query: 211 CVRKCVKKNQLWR--NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSG 268
C K +R +S+++ + + + + E+ GP+ V+F VY+DF HY+ G
Sbjct: 331 ----CRLKEGCFRYYSSEYHYVGGFYGGCNEALMKLELVHQGPMAVAFEVYDDFLHYRKG 386
Query: 269 VYKH------ITGDVMGGHAVKLIGWGT-SDDGEDYWV 299
VY H + HAV L+G+GT + G DYW+
Sbjct: 387 VYHHTGLRDPFNPFELTNHAVLLVGYGTDAASGLDYWI 424
>sp|Q95029|CATL_DROME Cathepsin L OS=Drosophila melanogaster GN=Cp1 PE=2 SV=2
Length = 371
Score = 90.5 bits (223), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 83/285 (29%), Positives = 119/285 (41%), Gaps = 46/285 (16%)
Query: 20 SQTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPT 79
+Q FAEG VS KL + D + E + NYT+ K L +
Sbjct: 94 NQRFAEGKVS-FKLAVNKYADLLHHEFRQLMNG----------FNYTL--HKQLRAADES 140
Query: 80 PKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRF 139
KG+ P + LPKS D W ++ + DQGHCGSCWAF + AL +
Sbjct: 141 FKGVTFISPA-----HVTLPKSVD----WRTKGAVTAVKDQGHCGSCWAFSSTGALEGQH 191
Query: 140 CIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSH 199
G+ +SLS +L+ C +GC+GG +A+RY +G +
Sbjct: 192 FRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDNGGID-------------- 237
Query: 200 PGCEPAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAE-IYKNGPVEVSFT 257
E +YP C K + + ++ I E MAE + GPV V+
Sbjct: 238 --TEKSYPYEAIDDSCHFNKGTVGATDRGFT----DIPQGDEKKMAEAVATVGPVSVAID 291
Query: 258 V-YEDFAHYKSGVYKHITGDVMG-GHAVKLIGWGTSDDGEDYWVC 300
+E F Y GVY D H V ++G+GT + GEDYW+
Sbjct: 292 ASHESFQFYSEGVYNEPQCDAQNLDHGVLVVGFGTDESGEDYWLV 336
>sp|Q9R1T3|CATZ_RAT Cathepsin Z OS=Rattus norvegicus GN=Ctsz PE=1 SV=2
Length = 306
Score = 90.1 bits (222), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 63/212 (29%), Positives = 97/212 (45%), Gaps = 24/212 (11%)
Query: 98 LPKSFDARSAWPQCSTISRILDQ---GHCGSCWAFGAVEALSDRFCIHFGM---NLSLSV 151
LPK++D R+ + S +Q +CGSCWA G+ AL+DR I + LSV
Sbjct: 64 LPKNWDWRNV-NGVNYASVTRNQHIPQYCGSCWAHGSTSALADRINIKRKGAWPSTLLSV 122
Query: 152 NDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKC 211
+++ C C+GG + W Y HG+ E C+ Y C+ C
Sbjct: 123 QNVIDCGN---AGSCEGGNDLPVWEYAHKHGIPDETCNNY----QAKDQECDKFNQCGTC 175
Query: 212 V--RKC--VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKS 267
++C ++ LWR + S+S E +MAEIY NGP+ E ++Y
Sbjct: 176 TEFKECHTIQNYTLWRVGDYGSLSGR------EKMMAEIYANGPISCGIMATERMSNYTG 229
Query: 268 GVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
G+Y + H + + GWG S+DG +YW+
Sbjct: 230 GIYTEYQNQAIINHIISVAGWGVSNDGIEYWI 261
>sp|P80067|CATC_RAT Dipeptidyl peptidase 1 OS=Rattus norvegicus GN=Ctsc PE=1 SV=3
Length = 462
Score = 90.1 bits (222), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 78/285 (27%), Positives = 133/285 (46%), Gaps = 44/285 (15%)
Query: 32 KLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTV-------GQFKHLLGVKPTPKGLL 84
+L SH + +K +N K+ W A ++ ++ G +L KP P
Sbjct: 166 RLYSH--NHNFVKAINSVQKS-WTATTYEEYEKLSIRDLIRRSGHSGRILRPKPAP---- 218
Query: 85 LGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG 144
+ + + L LP+S+D R+ + +S + +Q CGSC++F ++ L R I
Sbjct: 219 --ITDEIQQQILSLPESWDWRNV-RGINFVSPVRNQESCGSCYSFASLGMLEARIRILTN 275
Query: 145 MNLS--LSVNDLLACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPG 201
+ + LS ++++C + GCDGG+P + A +Y GVV E C PY +
Sbjct: 276 NSQTPILSPQEVVSCSPY--AQGCDGGFPYLIAGKYAQDFGVVEENCFPYTATDA----- 328
Query: 202 CEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYED 261
P P C+R + +S++Y + + + + E+ K+GP+ V+F V++D
Sbjct: 329 --PCKPKENCLR--------YYSSEYYYVGGFYGGCNEALMKLELVKHGPMAVAFEVHDD 378
Query: 262 FAHYKSGVYKH------ITGDVMGGHAVKLIGWGTSD-DGEDYWV 299
F HY SG+Y H + HAV L+G+G G DYW+
Sbjct: 379 FLHYHSGIYHHTGLSDPFNPFELTNHAVLLVGYGKDPVTGLDYWI 423
>sp|P22497|CYSP_THEPA Cysteine proteinase OS=Theileria parva GN=TP03_0285 PE=3 SV=2
Length = 440
Score = 90.1 bits (222), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 75/259 (28%), Positives = 113/259 (43%), Gaps = 46/259 (17%)
Query: 61 QFSNYTVGQFKHLLGVKPTPKG-------LLLGVPVKTHDKSLK----------LPKSFD 103
+FS+ T +F L V PK LL + KT+ K+LK L K
Sbjct: 171 RFSDLTEREFYKLFPVMKPPKATYSNGYYLLSHMANKTYLKNLKKALNTDEDVDLAKLTG 230
Query: 104 ARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCG 163
W + S+++ + DQ +CG CWAF V ++ + HF + LSV +LL C F
Sbjct: 231 ENLDWRRSSSVTSVKDQSNCGGCWAFSTVGSVEGYYMSHFDKSYELSVQELLDCDSF--S 288
Query: 164 DGCDGGYPISAWRYFVHHGVVTEECDPYFD-STGCSHPGCEPAYPTPKCVRKCVKKNQLW 222
+GC GG SA+ Y +G+V+ + P+ D + CS P
Sbjct: 289 NGCQGGLLESAYEYVRKYGLVSAKDLPFVDKARRCSVP---------------------- 326
Query: 223 RNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHA 282
+K S+ +Y + E +M + P V +V + A YKSGV+ G + HA
Sbjct: 327 -KAKKVSVPSYHVFKGKE-VMTRSLTSSPCSVYLSVSPELAKYKSGVFTGECGKSL-NHA 383
Query: 283 VKLIGWGTSD-DGEDYWVC 300
V L+G G + + YWV
Sbjct: 384 VVLVGEGYDEVTKKRYWVV 402
>sp|Q9WUU7|CATZ_MOUSE Cathepsin Z OS=Mus musculus GN=Ctsz PE=2 SV=1
Length = 306
Score = 88.6 bits (218), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 62/212 (29%), Positives = 98/212 (46%), Gaps = 24/212 (11%)
Query: 98 LPKSFDARSAWPQCSTISRILDQ---GHCGSCWAFGAVEALSDRFCIHFGM---NLSLSV 151
LPK++D R+ + S +Q +CGSCWA G+ A++DR I ++ LSV
Sbjct: 64 LPKNWDWRNV-NGVNYASVTRNQHIPQYCGSCWAHGSTSAMADRINIKRKGAWPSILLSV 122
Query: 152 NDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKC 211
+++ C C+GG + W Y HG+ E C+ Y C+ C
Sbjct: 123 QNVIDCGN---AGSCEGGNDLPVWEYAHKHGIPDETCNNY----QAKDQDCDKFNQCGTC 175
Query: 212 V--RKC--VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKS 267
++C ++ LWR + S+S E +MAEIY NGP+ E ++Y
Sbjct: 176 TEFKECHTIQNYTLWRVGDYGSLSGR------EKMMAEIYANGPISCGIMATEMMSNYTG 229
Query: 268 GVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
G+Y + H + + GWG S+DG +YW+
Sbjct: 230 GIYAEHQDQAVINHIISVAGWGVSNDGIEYWI 261
>sp|P05167|ALEU_HORVU Thiol protease aleurain OS=Hordeum vulgare PE=2 SV=1
Length = 362
Score = 87.4 bits (215), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 76/252 (30%), Positives = 109/252 (43%), Gaps = 47/252 (18%)
Query: 61 QFSNYTVGQFKHL-LGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILD 119
+FS+ + +F+ LG T L G + + LP++ D W + +S + +
Sbjct: 108 RFSDMSWEEFQATRLGAAQTCSATLAGNHLMR--DAAALPETKD----WREDGIVSPVKN 161
Query: 120 QGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFV 179
Q HCGSCW F AL + G N+SLS L+ C G GC+GG P A+ Y
Sbjct: 162 QAHCGSCWTFSTTGALEAAYTQATGKNISLSEQQLVDCAGGFNNFGCNGGLPSQAFEYIK 221
Query: 180 HHGVV-TEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSD 238
++G + TEE PY G H E N+ + + I +
Sbjct: 222 YNGGIDTEESYPYKGVNGVCHYKAE--------------------NAAVQVLDSVNITLN 261
Query: 239 PEDIMAEIYKNG-----PVEVSFTVYEDFAHYKSGVYKHITGDVMG------GHAVKLIG 287
ED + KN PV V+F V + F YKSGVY T D G HAV +G
Sbjct: 262 AEDEL----KNAVGLVRPVSVAFQVIDGFRQYKSGVY---TSDHCGTTPDDVNHAVLAVG 314
Query: 288 WGTSDDGEDYWV 299
+G ++G YW+
Sbjct: 315 YGV-ENGVPYWL 325
>sp|Q24940|CATLL_FASHE Cathepsin L-like proteinase OS=Fasciola hepatica GN=Cat-1 PE=1 SV=1
Length = 326
Score = 87.0 bits (214), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 66/244 (27%), Positives = 108/244 (44%), Gaps = 31/244 (12%)
Query: 61 QFSNYTVGQFK--HLLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRIL 118
QF++ T +FK +L + L GVP + +++++ P D W + ++ +
Sbjct: 71 QFTDMTFEEFKAKYLTEMSRASDILSHGVPYEANNRAV--PDKID----WRESGYVTEVK 124
Query: 119 DQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYF 178
DQG+CGSCWAF + ++ + ++S S L+ C G +GC GG +A++Y
Sbjct: 125 DQGNCGSCWAFSTTGTMEGQYMKNERTSISFSEQQLVDCSGPWGNNGCSGGLMENAYQYL 184
Query: 179 VHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSD 238
G+ TE PY G +C QL Y ++S
Sbjct: 185 KQFGLETESSYPYTAVEG-----------------QCRYNKQL---GVAKVTGYYTVHSG 224
Query: 239 PE-DIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHIT-GDVMGGHAVKLIGWGTSDDGED 296
E ++ + P V+ V DF Y+SG+Y+ T + HAV +G+GT G D
Sbjct: 225 SEVELKNLVGARRPAAVAVDVESDFMMYRSGIYQSQTCSPLRVNHAVLAVGYGT-QGGTD 283
Query: 297 YWVC 300
YW+
Sbjct: 284 YWIV 287
>sp|Q9UBR2|CATZ_HUMAN Cathepsin Z OS=Homo sapiens GN=CTSZ PE=1 SV=1
Length = 303
Score = 85.9 bits (211), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 65/211 (30%), Positives = 91/211 (43%), Gaps = 23/211 (10%)
Query: 98 LPKSFDARSAWPQCSTISRILDQGH----CGSCWAFGAVEALSDRFCIHFGM---NLSLS 150
LPKS+D R+ + I H CGSCWA + A++DR I + LS
Sbjct: 62 LPKSWDWRNV--DGVNYASITRNQHIPQYCGSCWAHASTSAMADRINIKRKGAWPSTLLS 119
Query: 151 VNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYF--DSTGCSHPGCEPAYPT 208
V +++ C C+GG +S W Y HG+ E C+ Y D C
Sbjct: 120 VQNVIDCGN---AGSCEGGNDLSVWDYAHQHGIPDETCNNYQAKDQECDKFNQCGTCNEF 176
Query: 209 PKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSG 268
+C ++ LWR + S+S E +MAEIY NGP+ E A+Y G
Sbjct: 177 KEC--HAIRNYTLWRVGDYGSLSGR------EKMMAEIYANGPISCGIMATERLANYTGG 228
Query: 269 VYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
+Y H V + GWG S DG +YW+
Sbjct: 229 IYAEYQDTTYINHVVSVAGWGIS-DGTEYWI 258
>sp|Q8RWQ9|ALEUL_ARATH Thiol protease aleurain-like OS=Arabidopsis thaliana GN=At3g45310
PE=2 SV=1
Length = 358
Score = 85.1 bits (209), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 82/281 (29%), Positives = 121/281 (43%), Gaps = 40/281 (14%)
Query: 28 VSKLKLDSHILQDSIIKEVNENPKA-GWKAARNPQFSNYTVGQF-KHLLGVKPTPKGLLL 85
V ++KL + ++++ + N K +K + N QF++ T +F ++ LG L
Sbjct: 73 VEEMKLRFSVFKENLDLIRSTNKKGLSYKLSLN-QFADLTWQEFQRYKLGAAQNCSATLK 131
Query: 86 GVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGM 145
G T +P + D W + +S + +QGHCGSCW F AL + FG
Sbjct: 132 GSHKITE---ATVPDTKD----WREDGIVSPVKEQGHCGSCWTFSTTGALEAAYHQAFGK 184
Query: 146 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHH-GVVTEECDPYFDSTGCSHPGCE- 203
+SLS L+ C G GC GG P A+ Y ++ G+ TEE PY G GC+
Sbjct: 185 GISLSEQQLVDCAGTFNNFGCHGGLPSQAFEYIKYNGGLDTEEAYPYTGKDG----GCKF 240
Query: 204 PAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFA 263
A VR V + +++ R PV V+F V +F
Sbjct: 241 SAKNIGVQVRDSVNITLGAEDELKHAVGLVR----------------PVSVAFEVVHEFR 284
Query: 264 HYKSGVYKHIT-----GDVMGGHAVKLIGWGTSDDGEDYWV 299
YK GV+ T DV HAV +G+G DD YW+
Sbjct: 285 FYKKGVFTSNTCGNTPMDV--NHAVLAVGYGVEDD-VPYWL 322
>sp|P05689|CATZ_BOVIN Cathepsin Z OS=Bos taurus GN=CTSZ PE=2 SV=2
Length = 304
Score = 84.7 bits (208), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 66/214 (30%), Positives = 96/214 (44%), Gaps = 29/214 (13%)
Query: 98 LPKSFDARSAWPQCSTISRILDQ---GHCGSCWAFGAVEALSDRFCIHFGM---NLSLSV 151
LPKS+D R+ + S +Q +CGSCWA G+ A++DR I + LSV
Sbjct: 63 LPKSWDWRNV-NGVNYASVTRNQHIPQYCGSCWAHGSTSAMADRINIKRKGAWPSTLLSV 121
Query: 152 NDLLACCGFLCGDG--CDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTP 209
++ C GD C+GG + W Y HG+ E C+ Y C+
Sbjct: 122 QHVIDC-----GDAGSCEGGNDLPVWEYAHRHGIPDETCNNYQ----AKDQECDKFNQCG 172
Query: 210 KCV--RKC--VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHY 265
C ++C +K LW+ + S+S E +MAEIY NGP+ E ++Y
Sbjct: 173 TCTEFKECHVIKNYTLWKVGDYGSLSGR------EKMMAEIYTNGPISCGIMATEKMSNY 226
Query: 266 KSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
G+Y H V + GWG S DG +YW+
Sbjct: 227 TGGIYSEYNDQAFINHIVSVAGWGVS-DGMEYWI 259
>sp|Q26563|CATC_SCHMA Cathepsin C OS=Schistosoma mansoni PE=2 SV=1
Length = 454
Score = 84.3 bits (207), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 84/282 (29%), Positives = 125/282 (44%), Gaps = 40/282 (14%)
Query: 38 LQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSL- 96
+ S + ++N + K+ W+ P+ S YT+ + ++ G + + KT K L
Sbjct: 154 INPSFVGKINAHQKS-WRGEIYPELSKYTIDELRNRAGGVKSMVTRPSVLNRKTPSKELI 212
Query: 97 ----KLPKSFDARSAWPQCST--ISRILDQGHCGSCWAFGAVEALSDRFCI--HFGMNLS 148
LP FD S P S ++ I +QG CGSC+A + AL R + +F
Sbjct: 213 SLTGNLPLEFDWTSP-PDGSRSPVTPIRNQGICGSCYASPSAAALEARIRLVSNFSEQPI 271
Query: 149 LSVNDLLACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYP 207
LS ++ C + +GC+GG+P + A +Y G+ + PY TG
Sbjct: 272 LSPQTVVDCSPY--SEGCNGGFPFLIAGKYGEDFGLPQKIVIPY---TGED--------- 317
Query: 208 TPKCVRKCVKKNQLWRNSKHYS-ISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYK 266
T KC V KN + YS I Y ++ + + E+ NGP V F VYEDF YK
Sbjct: 318 TGKCT---VSKNCTRYYTTDYSYIGGYYGATNEKLMQLELISNGPFPVGFEVYEDFQFYK 374
Query: 267 SGVYKHITGDV---------MGGHAVKLIGWGTSD-DGEDYW 298
G+Y H T + HAV L+G+G GE YW
Sbjct: 375 EGIYHHTTVQTDHYNFNPFELTNHAVLLVGYGVDKLSGEPYW 416
>sp|P97821|CATC_MOUSE Dipeptidyl peptidase 1 OS=Mus musculus GN=Ctsc PE=2 SV=1
Length = 462
Score = 84.0 bits (206), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 73/276 (26%), Positives = 127/276 (46%), Gaps = 42/276 (15%)
Query: 41 SIIKEVNENPKAGWKAARNPQFSNYTV-------GQFKHLLGVKPTPKGLLLGVPVKTHD 93
+ +K +N K+ W A ++ ++ G + + KP P + +
Sbjct: 173 NFVKAINTVQKS-WTATAYKEYEKMSLRDLIRRSGHSQRIPRPKPAP------MTDEIQQ 225
Query: 94 KSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSV 151
+ L LP+S+D R+ + +S + +Q CGSC++F ++ L R I + + LS
Sbjct: 226 QILNLPESWDWRNV-QGVNYVSPVRNQESCGSCYSFASMGMLEARIRILTNNSQTPILSP 284
Query: 152 NDLLACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPK 210
++++C + GCDGG+P + A +Y GVV E C PY P P
Sbjct: 285 QEVVSCSPY--AQGCDGGFPYLIAGKYAQDFGVVEESCFPYTAKDS-------PCKPREN 335
Query: 211 CVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVY 270
C+R + +S +Y + + + + E+ K+GP+ V+F V++DF HY SG+Y
Sbjct: 336 CLR--------YYSSDYYYVGGFYGGCNEALMKLELVKHGPMAVAFEVHDDFLHYHSGIY 387
Query: 271 KH------ITGDVMGGHAVKLIGWGTSD-DGEDYWV 299
H + HAV L+G+G G +YW+
Sbjct: 388 HHTGLSDPFNPFELTNHAVLLVGYGRDPVTGIEYWI 423
>sp|P25251|CYSP4_BRANA Cysteine proteinase COT44 (Fragment) OS=Brassica napus PE=2 SV=1
Length = 328
Score = 84.0 bits (206), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 80/292 (27%), Positives = 129/292 (44%), Gaps = 38/292 (13%)
Query: 20 SQTFAEGVVSKLKLDSHILQDSI--IKEVNENPKAGWKAARNPQFSNYTVGQFKHL-LGV 76
S + + G++++ +I +D++ I NEN K F+N T +++ L LG
Sbjct: 14 SNSNSNGIINQQDERFNIFKDNLRFIDLHNENNKNATYKLGLTIFANLTNDEYRSLYLGA 73
Query: 77 KPTPKGLLLGVPVKTHDKSLKLPKSFDARSA-----WPQCSTISRILDQGHCGSCWAFGA 131
+ P + K + ++K + + W Q ++ I DQG CGSCWAF
Sbjct: 74 RTEPVRRI----TKAKNVNMKYSAAVNVDEVPVTVDWRQKGAVNAIKDQGTCGSCWAFST 129
Query: 132 VEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECD-P 190
A+ I G +SLS +L+ C GC+GG A+++ + +G + E D P
Sbjct: 130 AAAVEGINKIVTGELVSLSEQELVDCDKSY-NQGCNGGLMDYAFQFIMKNGGLNTEKDYP 188
Query: 191 YFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYR-INSDPEDIMAEIYKN 249
Y + G KC N L +NS+ +I Y + S E +
Sbjct: 189 YHGTNG-------------KC-------NSLLKNSRVVTIDGYEDVPSKDETALKRAVSY 228
Query: 250 GPVEVSFTV-YEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWVC 300
PV V+ F HY+SG++ G M HAV +G+G S++G DYW+
Sbjct: 229 QPVSVAIDAGGRAFQHYQSGIFTGKCGTNM-DHAVVAVGYG-SENGVDYWIV 278
>sp|Q40143|CYSP3_SOLLC Cysteine proteinase 3 OS=Solanum lycopersicum GN=CYP-3 PE=2 SV=1
Length = 356
Score = 83.2 bits (204), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 82/283 (28%), Positives = 119/283 (42%), Gaps = 44/283 (15%)
Query: 28 VSKLKLDSHILQDSIIKEVNENPKA-GWKAARNPQFSNYTVGQF-KHLLGVK----PTPK 81
V ++K I D++ + N K +K N +F++ T +F KH LG T K
Sbjct: 71 VEEIKQRFEIFLDNLKMIRSHNRKGLSYKLGIN-EFTDLTWDEFRKHKLGASQNCSATTK 129
Query: 82 GLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI 141
G L ++ LP++ D W + +S + QG CGSCW F AL +
Sbjct: 130 GNL-------KLTNVVLPETKD----WRKDGIVSPVKAQGKCGSCWTFSTTGALEAAYAQ 178
Query: 142 HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYF-VHHGVVTEECDPYFDSTG-CSH 199
FG +SLS L+ C G GC+GG P A+ Y + G+ TEE PY G C
Sbjct: 179 AFGKGISLSEQQLVDCAGAFNNFGCNGGLPSQAFEYIKFNGGLDTEEAYPYTGKNGICKF 238
Query: 200 PGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVY 259
V + + Y+++ R PV V+F V
Sbjct: 239 SQANIGVKVISSVNITLGAEYELK----YAVALVR----------------PVSVAFEVV 278
Query: 260 EDFAHYKSGVYKHIT-GD--VMGGHAVKLIGWGTSDDGEDYWV 299
+ F YKSGVY GD + HAV +G+G ++G YW+
Sbjct: 279 KGFKQYKSGVYASTECGDTPMDVNHAVLAVGYGV-ENGTPYWL 320
>sp|P25777|ORYB_ORYSJ Oryzain beta chain OS=Oryza sativa subsp. japonica GN=Os04g0670200
PE=1 SV=2
Length = 466
Score = 82.4 bits (202), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 68/254 (26%), Positives = 115/254 (45%), Gaps = 30/254 (11%)
Query: 49 NPKAGWKAARNPQFSNYTVGQFKH-LLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSA 107
+ + G++ N +F++ T +F+ LG K + G + HD +LP+S D
Sbjct: 93 DERGGFRLGMN-RFADLTNEEFRATFLGAKVAERSRAAGERYR-HDGVEELPESVD---- 146
Query: 108 WPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCD 167
W + ++ + +QG CGSCWAF AV + + G ++LS +L+ C GC+
Sbjct: 147 WREKGAVAPVKNQGQCGSCWAFSAVSTVESINQLVTGEMITLSEQELVECSTNGQNSGCN 206
Query: 168 GGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKH 227
GG A+ + + +G + E D YP KC + N+K
Sbjct: 207 GGLMDDAFDFIIKNGGIDTEDD----------------YPYKAVDGKCDINRE---NAKV 247
Query: 228 YSISAYR-INSDPEDIMAEIYKNGPVEVSFTV-YEDFAHYKSGVYKHITGDVMGGHAVKL 285
SI + + + E + + + PV V+ +F Y SGV+ G + H V
Sbjct: 248 VSIDGFEDVPQNDEKSLQKAVAHQPVSVAIEAGGREFQLYHSGVFSGRCGTSL-DHGVVA 306
Query: 286 IGWGTSDDGEDYWV 299
+G+GT D+G+DYW+
Sbjct: 307 VGYGT-DNGKDYWI 319
>sp|Q9LXW3|CPR2_ARATH Probable cysteine proteinase At3g43960 OS=Arabidopsis thaliana
GN=At3g43960 PE=2 SV=1
Length = 376
Score = 82.0 bits (201), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 75/266 (28%), Positives = 121/266 (45%), Gaps = 24/266 (9%)
Query: 37 ILQDSI--IKEVNENPKAGWKAARNPQFSNYTVGQFK-HLLGVKPTPKGLLLGVPVKTHD 93
I +D++ I+E N +P ++ N +FS+ T +F+ LG K K L +
Sbjct: 64 IFKDNLKRIEEHNSDPNRSYERGLN-KFSDLTADEFQASYLGGKMEKKSLSDVAERYQYK 122
Query: 94 KSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVND 153
+ LP D R + + + R+ QG CGSCWAF A A+ I G +SLS +
Sbjct: 123 EGDVLPDEVDWRE---RGAVVPRVKRQGECGSCWAFAATGAVEGINQITTGELVSLSEQE 179
Query: 154 LLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVR 213
L+ C GC GG + A+ + +G + D + TG C K +
Sbjct: 180 LIDCDRGNDNFGCAGGGAVWAFEFIKENGGIV--SDEVYGYTGEDTAAC-------KAIE 230
Query: 214 KCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHI 273
+K ++ + H + +N + A Y+ P+ V + + + YKSGVYK
Sbjct: 231 --MKTTRVVTINGHEVVP---VNDEMSLKKAVAYQ--PISVMISA-ANMSDYKSGVYKGA 282
Query: 274 TGDVMGGHAVKLIGWGTSDDGEDYWV 299
++ G H V ++G+GTS D DYW+
Sbjct: 283 CSNLWGDHNVLIVGYGTSSDEGDYWL 308
Database: swissprot
Posted date: Mar 23, 2013 2:32 AM
Number of letters in database: 191,569,459
Number of sequences in database: 539,616
Lambda K H
0.321 0.138 0.455
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 126,105,014
Number of Sequences: 539616
Number of extensions: 5588416
Number of successful extensions: 10821
Number of sequences better than 100.0: 50
Number of HSP's better than 100.0 without gapping: 204
Number of HSP's successfully gapped in prelim test: 28
Number of HSP's that attempted gapping in prelim test: 10338
Number of HSP's gapped (non-prelim): 261
length of query: 300
length of database: 191,569,459
effective HSP length: 117
effective length of query: 183
effective length of database: 128,434,387
effective search space: 23503492821
effective search space used: 23503492821
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 61 (28.1 bits)