BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 023657
(279 letters)
Database: swissprot
539,616 sequences; 191,569,459 total letters
Searching..................................................done
>sp|Q4R5M2|CATB_MACFA Cathepsin B OS=Macaca fascicularis GN=CTSB PE=2 SV=1
Length = 339
Score = 184 bits (467), Expect = 6e-46, Method: Compositional matrix adjust.
Identities = 111/267 (41%), Positives = 143/267 (53%), Gaps = 38/267 (14%)
Query: 11 CLLILGVISSQTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQF 70
CLL LG S+ H L D ++ VN+ W+A N F N V
Sbjct: 10 CLLALGDARSRP-----------SFHPLSDELVNYVNKQ-NTTWQAGHN--FYNVDVSYL 55
Query: 71 KHLLGV---KPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCW 127
K L G P P ++ + LKLP+SFDAR WPQC TI I DQG CGSCW
Sbjct: 56 KRLCGTFLGGPKPPQRVM------FTEDLKLPESFDAREQWPQCPTIKEIRDQGSCGSCW 109
Query: 128 AFGAVEALSDRFCIHFGMNLSLSVN--DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT 185
AFGAVEA+SDR CIH ++S+ V+ DLL CCG +CGDGC+GGYP AW ++ G+V+
Sbjct: 110 AFGAVEAISDRICIHTNAHVSVEVSAEDLLTCCGIMCGDGCNGGYPAGAWNFWTRKGLVS 169
Query: 186 E-------ECDPYF-----DSTGCSHPGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISA 232
C PY S P C TPKC + C + ++ KHY ++
Sbjct: 170 GGLYDSHVGCRPYSIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGYSPTYKQDKHYGYNS 229
Query: 233 YRINSDPEDIMAEIYKNGPVEVSFTVY 259
Y +++ +DIMAEIYKNGPVE +F+VY
Sbjct: 230 YSVSNSEKDIMAEIYKNGPVEGAFSVY 256
>sp|Q5R6D1|CATB_PONAB Cathepsin B OS=Pongo abelii GN=CTSB PE=2 SV=1
Length = 339
Score = 182 bits (463), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 105/242 (43%), Positives = 135/242 (55%), Gaps = 27/242 (11%)
Query: 36 HILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGV---KPTPKGLLLGVPVKTH 92
H L D ++ VN+ W+A N F N V K L G P P ++
Sbjct: 24 HPLSDELVNYVNKR-NTTWQAGHN--FYNVDVSYLKKLCGTFLGGPKPPQRVM------F 74
Query: 93 DKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVN 152
+ LKLP+SFDAR WPQC TI I DQG CGSCWAFGAVEA+SDR CIH ++S+ V+
Sbjct: 75 TEDLKLPESFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHTNAHVSVEVS 134
Query: 153 --DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTE-------ECDPYF-----DSTGCS 198
DLL CCG +CGDGC+GGYP AW ++ G+V+ C PY S
Sbjct: 135 AEDLLTCCGSMCGDGCNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCEHHVNGS 194
Query: 199 HPGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFT 257
P C TPKC + C + ++ KHY ++Y +++ DIMAEIYKNGPVE +F+
Sbjct: 195 RPPCTGEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVSNSERDIMAEIYKNGPVEGAFS 254
Query: 258 VY 259
VY
Sbjct: 255 VY 256
>sp|P07858|CATB_HUMAN Cathepsin B OS=Homo sapiens GN=CTSB PE=1 SV=3
Length = 339
Score = 181 bits (459), Expect = 5e-45, Method: Compositional matrix adjust.
Identities = 104/242 (42%), Positives = 135/242 (55%), Gaps = 27/242 (11%)
Query: 36 HILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGV---KPTPKGLLLGVPVKTH 92
H L D ++ VN+ W+A N F N + K L G P P ++
Sbjct: 24 HPLSDELVNYVNKR-NTTWQAGHN--FYNVDMSYLKRLCGTFLGGPKPPQRVM------F 74
Query: 93 DKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVN 152
+ LKLP SFDAR WPQC TI I DQG CGSCWAFGAVEA+SDR CIH ++S+ V+
Sbjct: 75 TEDLKLPASFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHTNAHVSVEVS 134
Query: 153 --DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTE-------ECDPYF-----DSTGCS 198
DLL CCG +CGDGC+GGYP AW ++ G+V+ C PY S
Sbjct: 135 AEDLLTCCGSMCGDGCNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCEHHVNGS 194
Query: 199 HPGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFT 257
P C TPKC + C + ++ KHY ++Y +++ +DIMAEIYKNGPVE +F+
Sbjct: 195 RPPCTGEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAFS 254
Query: 258 VY 259
VY
Sbjct: 255 VY 256
>sp|P00787|CATB_RAT Cathepsin B OS=Rattus norvegicus GN=Ctsb PE=1 SV=2
Length = 339
Score = 177 bits (449), Expect = 7e-44, Method: Compositional matrix adjust.
Identities = 103/247 (41%), Positives = 135/247 (54%), Gaps = 29/247 (11%)
Query: 32 KLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKT 91
K SH L D +I +N+ W+A RN F N + K L G +LG P
Sbjct: 20 KPSSHPLSDDMINYINKQ-NTTWQAGRN--FYNVDISYLKKLCGT-------VLGGPNLP 69
Query: 92 H----DKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--M 145
+ + LP+SFDAR W C TI++I DQG CGSCWAFGAVEA+SDR CIH +
Sbjct: 70 ERVGFSEDINLPESFDAREQWSNCPTIAQIRDQGSCGSCWAFGAVEAMSDRICIHTNGRV 129
Query: 146 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTE-------ECDPYF-----D 193
N+ +S DLL CCG CGDGC+GGYP AW ++ G+V+ C PY
Sbjct: 130 NVEVSAEDLLTCCGIQCGDGCNGGYPSGAWNFWTRKGLVSGGVYNSHIGCLPYTIPPCEH 189
Query: 194 STGCSHPGCEPAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPV 252
S P C TPKC + C + ++ KHY ++Y ++ ++IMAEIYKNGPV
Sbjct: 190 HVNGSRPPCTGEGDTPKCNKMCEAGYSTSYKEDKHYGYTSYSVSDSEKEIMAEIYKNGPV 249
Query: 253 EVSFTVY 259
E +FTV+
Sbjct: 250 EGAFTVF 256
>sp|P10605|CATB_MOUSE Cathepsin B OS=Mus musculus GN=Ctsb PE=1 SV=2
Length = 339
Score = 170 bits (431), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 98/245 (40%), Positives = 133/245 (54%), Gaps = 33/245 (13%)
Query: 36 HILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHL----LGVKPTPKGLLLGVPVKT 91
H L D +I +N+ W+A RN F N + K L LG P + G
Sbjct: 24 HPLSDDLINYINKQ-NTTWQAGRN--FYNVDISYLKKLCGTVLGGPKLPGRVAFG----- 75
Query: 92 HDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSL 149
+ + LP++FDAR W C TI +I DQG CGSCWAFGAVEA+SDR CIH +N+ +
Sbjct: 76 --EDIDLPETFDAREQWSNCPTIGQIRDQGSCGSCWAFGAVEAISDRTCIHTNGRVNVEV 133
Query: 150 SVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGC------------ 197
S DLL CCG CGDGC+GGYP AW ++ G+V+ Y GC
Sbjct: 134 SAEDLLTCCGIQCGDGCNGGYPSGAWSFWTKKGLVSGGV--YNSHVGCLPYTIPPCEHHV 191
Query: 198 --SHPGCEPAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEV 254
S P C TP+C + C + ++ KH+ ++Y +++ ++IMAEIYKNGPVE
Sbjct: 192 NGSRPPCTGEGDTPRCNKSCEAGYSPSYKEDKHFGYTSYSVSNSVKEIMAEIYKNGPVEG 251
Query: 255 SFTVY 259
+FTV+
Sbjct: 252 AFTVF 256
>sp|P43510|CPR6_CAEEL Cathepsin B-like cysteine proteinase 6 OS=Caenorhabditis elegans
GN=cpr-6 PE=1 SV=1
Length = 379
Score = 169 bits (427), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 108/293 (36%), Positives = 151/293 (51%), Gaps = 50/293 (17%)
Query: 6 LFLTTCLLILGVISSQTFAEGVVSKLK---LDSHILQ---DSIIKEVNENPKAGWKAARN 59
L +C+++ + E V+ K + +DS + D +I VNEN W A +
Sbjct: 4 LLFLSCIVVAAYCACNDNLESVLDKYRNREIDSEAAELDGDDLIDYVNENQNL-WTAKKQ 62
Query: 60 PQFSNYTVGQFKHLLGVKPTPKGLLLGVP------------VKTHDKSLKLPKSFDARSA 107
+FS+ + G K L+GV KT D L +P+SFD+R
Sbjct: 63 RRFSS--------VYGENDKAKWGLMGVNHVRLSVKGKQHLSKTKDLDLDIPESFDSRDN 114
Query: 108 WPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDG 165
WP+C +I I DQ CGSCWAFGAVEA+SDR CI H + ++LS +DLL+CC CG G
Sbjct: 115 WPKCDSIKVIRDQSSCGSCWAFGAVEAMSDRICIASHGELQVTLSADDLLSCCK-SCGFG 173
Query: 166 CDGGYPISAWRYFVHHGVVTEECDPYFDSTGCS---HPGCE-------------PAYPTP 209
C+GG P++AWRY+V G+VT Y + GC P CE YPTP
Sbjct: 174 CNGGDPLAAWRYWVKDGIVTGS--NYTANNGCKPYPFPPCEHHSKKTHFDPCPHDLYPTP 231
Query: 210 KCVRKCVKK--NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYE 260
KC +KCV ++ + K + SAY + D E I E+ +GP+E++F VYE
Sbjct: 232 KCEKKCVSDYTDKTYSEDKFFGASAYGVKDDVEAIQKELMTHGPLEIAFEVYE 284
>sp|P25792|CYSP_SCHMA Cathepsin B-like cysteine proteinase OS=Schistosoma mansoni PE=2
SV=1
Length = 340
Score = 168 bits (425), Expect = 4e-41, Method: Compositional matrix adjust.
Identities = 105/271 (38%), Positives = 144/271 (53%), Gaps = 22/271 (8%)
Query: 7 FLTTCLLILGVISSQTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYT 66
LT+ L I +I TF E +S L D II +NE+P AGW+A ++ +F +
Sbjct: 1 MLTSILCIASLI---TFLEAHISVKNEKFEPLSDDIISYINEHPNAGWRAEKSNRFHSLD 57
Query: 67 VGQFKHLLGVKPTPKGLLLGVPVKTH-DKSLKLPKSFDARSAWPQCSTISRILDQGHCGS 125
+ + + + P P H D ++++P +FD+R WP C +I+ I DQ CGS
Sbjct: 58 DARIQ-MGARREEPDLRRKRRPTVDHNDWNVEIPSNFDSRKKWPGCKSIATIRDQSRCGS 116
Query: 126 CWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGV 183
CW+FGAVEA+SDR CI G N+ LS DLL CC CG GC+GG AW Y+V G+
Sbjct: 117 CWSFGAVEAMSDRSCIQSGGKQNVELSAVDLLTCCE-SCGLGCEGGILGPAWDYWVKEGI 175
Query: 184 VTEE-------CDPY-----FDSTGCSHPGC-EPAYPTPKCVRKCVKKNQL-WRNSKHYS 229
VT C+PY T +P C Y TP+C + C +K + + KH
Sbjct: 176 VTASSKENHTGCEPYPFPKCEHHTKGKYPPCGSKIYNTPRCKQTCQRKYKTPYTQDKHRG 235
Query: 230 ISAYRINSDPEDIMAEIYKNGPVEVSFTVYE 260
S+Y + +D + I EI K GPVE SFTVYE
Sbjct: 236 KSSYNVKNDEKAIQKEIMKYGPVEASFTVYE 266
>sp|P43157|CYSP_SCHJA Cathepsin B-like cysteine proteinase OS=Schistosoma japonicum
GN=CATB PE=2 SV=1
Length = 342
Score = 166 bits (421), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 99/263 (37%), Positives = 142/263 (53%), Gaps = 22/263 (8%)
Query: 17 VISSQTFAEG-VVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLG 75
++S TF E V ++ L D +I +NE+P AGWKA ++ +F +++ + L+G
Sbjct: 8 IVSLFTFLEAHVTTRNNQRIEPLSDEMISFINEHPDAGWKADKSDRF--HSLDDARILMG 65
Query: 76 VKPTPKGLLLGV--PVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVE 133
+ + V HD ++++P FD+R WP C +IS+I DQ CGSCWAFGAVE
Sbjct: 66 ARKEDAEMKRNRRPTVDHHDLNVEIPSQFDSRKKWPHCKSISQIRDQSRCGSCWAFGAVE 125
Query: 134 ALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT------ 185
A++DR CI G S LS DL++CC CGDGC GG+P AW Y+V G+VT
Sbjct: 126 AMTDRICIQSGGGQSAELSALDLISCCK-DCGDGCQGGFPGVAWDYWVKRGIVTGGSKEN 184
Query: 186 -EECDPY-----FDSTGCSHPGC-EPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINS 237
C PY T +P C Y TP+C + C K + + KHY +Y + +
Sbjct: 185 HTGCQPYPFPKCEHHTKGKYPACGTKIYKTPQCKQTCQKGYKTPYEQDKHYGDESYNVQN 244
Query: 238 DPEDIMAEIYKNGPVEVSFTVYE 260
+ + I +I GPVE +F VYE
Sbjct: 245 NEKVIQRDIMMYGPVEAAFDVYE 267
>sp|P43233|CATB_CHICK Cathepsin B OS=Gallus gallus GN=CTSB PE=2 SV=1
Length = 340
Score = 166 bits (420), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 100/245 (40%), Positives = 130/245 (53%), Gaps = 34/245 (13%)
Query: 38 LQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHD---- 93
L ++ +N+ G +A N F N + K L G LG P
Sbjct: 26 LSSDLVNHINKLNTTG-RAGHN--FHNTDMSYVKKLCGT-------FLGGPKAPERVDFA 75
Query: 94 KSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVN- 152
+ + LP +FD R WP C TIS I DQG CGSCWAFGAVEA+SDR C+H +S+ V+
Sbjct: 76 EDMDLPDTFDTRKQWPNCPTISEIRDQGSCGSCWAFGAVEAISDRICVHTNAKVSVEVSA 135
Query: 153 -DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGC---SHPGCE----- 203
DLL+CCGF CG GC+GGYP AWRY+ G+V+ Y GC + P CE
Sbjct: 136 EDLLSCCGFECGMGCNGGYPSGAWRYWTERGLVSGGL--YDSHVGCRAYTIPPCEHHVNG 193
Query: 204 -------PAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVS 255
TP+C R C + ++ KHY I++Y + ++IMAEIYKNGPVE +
Sbjct: 194 SRPPCTGEGGETPRCSRHCEPGYSPSYKEDKHYGITSYGVPRSEKEIMAEIYKNGPVEGA 253
Query: 256 FTVYE 260
F VYE
Sbjct: 254 FIVYE 258
>sp|A1E295|CATB_PIG Cathepsin B OS=Sus scrofa GN=CTSB PE=1 SV=1
Length = 335
Score = 165 bits (418), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 100/250 (40%), Positives = 129/250 (51%), Gaps = 29/250 (11%)
Query: 29 SKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVP 88
++ L L D ++ +N+ W A N F N + K L G LG P
Sbjct: 17 ARESLHFQPLSDELVNFINKQ-NTTWTAGHN--FYNVDLSYVKKLCGT-------FLGGP 66
Query: 89 VKTHDKSLK----LPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG 144
+ LPKSFDAR WP C TI I DQG CGSCWAFGAVEA+SDR CI
Sbjct: 67 KLPQRAAFAADMILPKSFDAREQWPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICIRSN 126
Query: 145 --MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTE-------ECDPYF--- 192
+N+ +S D+L CCG CGDGC+GG+P AW ++ G+V+ C PY
Sbjct: 127 GRVNVEVSAEDMLTCCGDECGDGCNGGFPSGAWNFWTKKGLVSGGLYDSHVGCRPYSIPP 186
Query: 193 --DSTGCSHPGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKN 249
S P C TPKC + C ++ KH+ S+Y I+ + ++IMAEIYKN
Sbjct: 187 CEHHVNGSRPPCTGEGDTPKCSKICEPGYTPSYKEDKHFGCSSYSISRNEKEIMAEIYKN 246
Query: 250 GPVEVSFTVY 259
GPVE +FTVY
Sbjct: 247 GPVEGAFTVY 256
>sp|P43509|CPR5_CAEEL Cathepsin B-like cysteine proteinase 5 OS=Caenorhabditis elegans
GN=cpr-5 PE=2 SV=1
Length = 344
Score = 160 bits (404), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 87/187 (46%), Positives = 107/187 (57%), Gaps = 21/187 (11%)
Query: 95 SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVN 152
S +P FDAR WP C +I+ I DQ CGSCWAF A EA+SDR CI + +N LS
Sbjct: 79 SDAIPDHFDARDQWPNCMSINNIRDQSDCGSCWAFAAAEAISDRTCIASNGAVNTLLSSE 138
Query: 153 DLLACCG--FLCGDGCDGGYPISAWRYFVHHGVVTE-------ECDPYFDS------TGC 197
DLL+CC F CG+GC+GGYPI AW+++V HG+VT C PY + G
Sbjct: 139 DLLSCCTGMFSCGNGCEGGYPIQAWKWWVKHGLVTGGSYETQFGCKPYSIAPCGETVNGV 198
Query: 198 SHPGC-EPAYPTPKCVRKCVKKNQL---WRNSKHYSISAYRINSDPEDIMAEIYKNGPVE 253
P C E PTPKCV C KN + KH+ +AY + E I EI NGP+E
Sbjct: 199 KWPACPEDTEPTPKCVDSCTSKNNYATPYLQDKHFGSTAYAVGKKVEQIQTEILTNGPIE 258
Query: 254 VSFTVYE 260
V+FTVYE
Sbjct: 259 VAFTVYE 265
>sp|Q54QD9|CTSB_DICDI Cathepsin B OS=Dictyostelium discoideum GN=ctsB PE=3 SV=1
Length = 311
Score = 154 bits (388), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 89/216 (41%), Positives = 119/216 (55%), Gaps = 23/216 (10%)
Query: 54 WKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDK-SLKLPKSFDARSAWPQCS 112
W + QF N VGQ LLG K +P L +K++D +++P SF+A++ WP C+
Sbjct: 39 WVEEQTDQFDNIKVGQ---LLGFKRSPNRPKL--QIKSYDPLGVQIPTSFNAQTNWPNCT 93
Query: 113 TISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPI 172
TIS+I +Q CGSCWAFGA E+ +DR CIH N+ LS D++ C +GC+GG
Sbjct: 94 TISQIQNQARCGSCWAFGATESATDRLCIHNNENVQLSFMDMVTC--DETDNGCEGGDAF 151
Query: 173 SAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYP-------TPKCVRKCVKKNQL-WRN 224
SAW + G V+EEC PY + P C PA TP C ++C + L +
Sbjct: 152 SAWNWLRKQGAVSEECLPY------TIPTCPPAQQPCLNFVNTPSCTKECQSNSSLIYSQ 205
Query: 225 SKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYE 260
KH Y +SD E IM EI NGPVE FTV+E
Sbjct: 206 DKHKMAKIYSFDSD-EAIMQEIVTNGPVEACFTVFE 240
>sp|P25807|CPR1_CAEEL Gut-specific cysteine proteinase OS=Caenorhabditis elegans GN=cpr-1
PE=1 SV=2
Length = 329
Score = 143 bits (360), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 76/172 (44%), Positives = 100/172 (58%), Gaps = 11/172 (6%)
Query: 98 LPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLL 155
+P +FD+R+ W +C +I I DQ CGSCWAFGA E +SDR CI +S +DLL
Sbjct: 85 VPATFDSRTQWSECKSIKLIRDQATCGSCWAFGAAEMISDRTCIETKGAQQPIISPDDLL 144
Query: 156 ACCGFLCGDGCDGGYPISAWRYFVHHGVVT------EECDPYFDSTGCSHPGCEPAYPTP 209
+CCG CG+GC+GGYPI A R++ GVVT C PY + C+ C P TP
Sbjct: 145 SCCGSSCGNGCEGGYPIQALRWWDSKGVVTGGDYHGAGCKPYPIAP-CTSGNC-PESKTP 202
Query: 210 KCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYE 260
C C + + KH+ +SAY + + I AEIY NGPVE +F+VYE
Sbjct: 203 SCSMSCQSGYSTAYAKDKHFGVSAYAVPKNAASIQAEIYANGPVEAAFSVYE 254
>sp|P07688|CATB_BOVIN Cathepsin B OS=Bos taurus GN=CTSB PE=1 SV=5
Length = 335
Score = 138 bits (347), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 99/240 (41%), Positives = 131/240 (54%), Gaps = 27/240 (11%)
Query: 38 LQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDK--- 94
L D ++ VN+ WKA N F N + K L G +L G + D
Sbjct: 26 LSDELVNFVNKQ-NTTWKAGHN--FYNVDLSYVKKLCGA------ILGGPKLPQRDAFAA 76
Query: 95 SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVN 152
+ LP+SFDAR WP C TI I DQG CGSCWAFGAVEA+SDR CIH +N+ +S
Sbjct: 77 DVVLPESFDAREQWPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHSNGRVNVEVSAE 136
Query: 153 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTE-------ECDPYF-----DSTGCSHP 200
D+L CCG CGDGC+GG+P AW ++ G+V+ C PY S P
Sbjct: 137 DMLTCCGGECGDGCNGGFPSGAWNFWTKKGLVSGGLYNSHVGCRPYSIPPCEHHVNGSRP 196
Query: 201 GCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVY 259
C TPKC + C + ++ KH+ S+Y + ++ ++IMAEIYKNGPVE +F+VY
Sbjct: 197 PCTGEGDTPKCSKTCEPGYSPSYKEDKHFGCSSYSVANNEKEIMAEIYKNGPVEGAFSVY 256
>sp|P43508|CPR4_CAEEL Cathepsin B-like cysteine proteinase 4 OS=Caenorhabditis elegans
GN=cpr-4 PE=2 SV=1
Length = 335
Score = 137 bits (346), Expect = 7e-32, Method: Compositional matrix adjust.
Identities = 98/241 (40%), Positives = 128/241 (53%), Gaps = 24/241 (9%)
Query: 39 QDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHD-KSLK 97
Q++I + VN ++ WKA P+ + T+ Q K L V V HD
Sbjct: 25 QEAITEYVNSK-QSLWKA-EIPK--DITIEQVKKRLMRTEFVAPHTPDVEVVKHDINEDT 80
Query: 98 LPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLL 155
+P +FDAR+ WP C +I+ I DQ CGSCWAF A EA SDRFCI + +N LS D+L
Sbjct: 81 IPATFDARTQWPNCMSINNIRDQSDCGSCWAFAAAEAASDRFCIASNGAVNTLLSAEDVL 140
Query: 156 ACCGFLCGDGCDGGYPISAWRYFVHHGVVTE-------ECDPYF-----DSTG-CSHPGC 202
+CC CG GC+GGYPI+AW+Y V G T C PY ++ G + P C
Sbjct: 141 SCCSN-CGYGCEGGYPINAWKYLVKSGFCTGGSYEAQFGCKPYSLAPCGETVGNVTWPSC 199
Query: 203 -EPAYPTPKCVRKCVKKNQ--LWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVY 259
+ Y TP CV KC KN + KH+ +AY + I AEI +GPVE +FTVY
Sbjct: 200 PDDGYDTPACVNKCTNKNYNVAYTADKHFGSTAYAVGKKVSQIQAEIIAHGPVEAAFTVY 259
Query: 260 E 260
E
Sbjct: 260 E 260
>sp|P25793|CYSP2_HAECO Cathepsin B-like cysteine proteinase 2 OS=Haemonchus contortus
GN=AC-2 PE=2 SV=1
Length = 342
Score = 136 bits (342), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 75/185 (40%), Positives = 99/185 (53%), Gaps = 19/185 (10%)
Query: 93 DKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLS 150
D + +P S+D R W C+T I DQ +CGSCWA A+SDR CI +++S
Sbjct: 82 DPEVDIPPSYDPRDVWKNCTTFY-IRDQANCGSCWAVSTAAAISDRICIASKAEKQVNIS 140
Query: 151 VNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSHPG-- 201
D++ CC CGDGC+GG+PI AW+YF++ GVV+ + C PY C H G
Sbjct: 141 ATDIMTCCRPQCGDGCEGGWPIEAWKYFIYDGVVSGGEYLTKDVCRPY-PIHPCGHHGND 199
Query: 202 -----CEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVS 255
C PTP C RKC +++R K Y AY + + I +EI KNGPV S
Sbjct: 200 TYYGECRGTAPTPPCKRKCRPGVRKMYRIDKRYGKDAYIVKQSVKAIQSEILKNGPVVAS 259
Query: 256 FTVYE 260
F VYE
Sbjct: 260 FAVYE 264
>sp|P43507|CPR3_CAEEL Cathepsin B-like cysteine proteinase 3 OS=Caenorhabditis elegans
GN=cpr-3 PE=2 SV=1
Length = 370
Score = 135 bits (341), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 77/175 (44%), Positives = 94/175 (53%), Gaps = 15/175 (8%)
Query: 98 LPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLL 155
LP +FDAR WP C+TI I +Q CGSCWAFGA E +SDR CI +SV D+L
Sbjct: 92 LPDTFDAREKWPDCNTIKLIRNQATCGSCWAFGAAEVISDRVCIQSNGTQQPVISVEDIL 151
Query: 156 ACCGFLCGDGCDGGYPISAWRYFVHHGVVT------EECDPYFDSTGCSHPGCEPAYPTP 209
+CCG CG GC GGY I A R++ G VT C PY S C P TP
Sbjct: 152 SCCGTTCGYGCKGGYSIEALRFWASSGAVTGGDYGGHGCMPY--SFAPCTKNC-PESTTP 208
Query: 210 KCVRKCVK--KNQLWRNSKHYSISAYRINSDPE--DIMAEIYKNGPVEVSFTVYE 260
C C K + ++ KHY SAY++ + +I EIY GPVE S+ VYE
Sbjct: 209 SCKTTCQSSYKTEEYKKDKHYGASAYKVTTTKSVTEIQTEIYHYGPVEASYKVYE 263
>sp|P19092|CYSP1_HAECO Cathepsin B-like cysteine proteinase 1 OS=Haemonchus contortus
GN=AC-1 PE=2 SV=1
Length = 342
Score = 135 bits (340), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 74/185 (40%), Positives = 99/185 (53%), Gaps = 19/185 (10%)
Query: 93 DKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLS 150
D + +P S+D R W C+T I DQ +CGSCWA A+SDR CI +++S
Sbjct: 82 DPEVDIPPSYDPRDVWKNCTTFY-IRDQANCGSCWAVSTAAAISDRICIASKAEKQVNIS 140
Query: 151 VNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSHPG-- 201
D++ CC CGDGC+GG+PI AW+YF++ GVV+ + C PY C H G
Sbjct: 141 ATDIMTCCRPQCGDGCEGGWPIEAWKYFIYDGVVSGGEYLTKDVCRPY-PIHPCGHHGND 199
Query: 202 -----CEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVS 255
C PTP C RKC +++R K Y AY + + I +EI +NGPV S
Sbjct: 200 TYYGECRGTAPTPPCKRKCRPGVRKMYRIDKRYGKDAYIVKQSVKAIQSEILRNGPVVAS 259
Query: 256 FTVYE 260
F VYE
Sbjct: 260 FAVYE 264
>sp|P25802|CYSP1_OSTOS Cathepsin B-like cysteine proteinase 1 OS=Ostertagia ostertagi
GN=CP-1 PE=3 SV=3
Length = 341
Score = 123 bits (308), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 70/179 (39%), Positives = 99/179 (55%), Gaps = 18/179 (10%)
Query: 98 LPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLL 155
+P+S+D R W CS++ I DQ +CGSCWA + A+SDR CI + +S D++
Sbjct: 91 IPESYDPRIQWANCSSLFHIPDQANCGSCWAVSSAAAMSDRICIASKGAKQVLISAQDVV 150
Query: 156 ACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSHPGCEPAY-- 206
+CC + CGDGC+GG+PISA+R+ GVVT C PY + C H G E Y
Sbjct: 151 SCCTW-CGDGCEGGWPISAFRFHADEGVVTGGDYNTKGSCRPY-EIHPCGHHGNETYYGE 208
Query: 207 -----PTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYE 260
TP+C R+C+ S Y AY++ + + I +I KNGPV ++TVYE
Sbjct: 209 CVGMADTPRCKRRCLLGYPKSYPSDRYYKKAYQLKNSVKAIQKDIMKNGPVVATYTVYE 267
>sp|Q3SZI1|TINAG_BOVIN Tubulointerstitial nephritis antigen OS=Bos taurus GN=TINAG PE=2
SV=1
Length = 476
Score = 93.2 bits (230), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 74/238 (31%), Positives = 109/238 (45%), Gaps = 24/238 (10%)
Query: 37 ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ-FKHLLGVKPTPKGLLLGVPVKTHD-- 93
++Q +I+ VN+ GW A QF T+ + FK+ LG P P LLL + T
Sbjct: 155 LVQPGLIEHVNKG-DYGWTAQNYSQFWGMTLEEGFKYRLGTLP-PSPLLLSMNEVTASLT 212
Query: 94 KSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSV 151
K+ LP+ F A WP LDQ +C + WAF +DR I +LS
Sbjct: 213 KTTDLPEFFIASYKWP--GWTHGPLDQKNCAASWAFSTASVAADRIAIQSQGRYTANLSP 270
Query: 152 NDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA------ 205
+L++CC GC+ G AW Y G+V+ C P F ++ GC A
Sbjct: 271 QNLISCCAKK-RHGCNSGSVDRAWWYLRKRGLVSHACYPLFKDQNATNNGCAMASRSDGR 329
Query: 206 ---YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYE 260
+ T C K N++++ S YR++S+ +IM EI +NGPV+ V+E
Sbjct: 330 GKRHATTPCPNSIEKSNRIYQCS-----PPYRVSSNETEIMREIMQNGPVQAIMQVHE 382
>sp|Q99JR5|TINAL_MOUSE Tubulointerstitial nephritis antigen-like OS=Mus musculus
GN=Tinagl1 PE=1 SV=1
Length = 466
Score = 92.0 bits (227), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 74/255 (29%), Positives = 115/255 (45%), Gaps = 30/255 (11%)
Query: 37 ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ-FKHLLG-VKPTPKGLLLGVPVKTHDK 94
++ +IK +N GW+A + F T+ + ++ LG ++P+ + + +
Sbjct: 140 LVDPDMIKAINRG-NYGWQAGNHSAFWGMTLDEGIRYRLGTIRPSSTVMNMNEIYTVLGQ 198
Query: 95 SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIH-FG-MNLSLSVN 152
LP +F+A WP + I LDQG+C WAF SDR IH G M LS
Sbjct: 199 GEVLPTAFEASEKWP--NLIHEPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPILSPQ 256
Query: 153 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCV 212
+LL+C GC GG AW + GVV++ C P+ A PTP+C+
Sbjct: 257 NLLSCDTHH-QQGCRGGRLDGAWWFLRRRGVVSDNCYPFSGREQ------NEASPTPRCM 309
Query: 213 ----------RKCVKK---NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVY 259
R+ + Q+ N + AYR+ SD ++IM E+ +NGPV+ +
Sbjct: 310 MHSRAMGRGKRQATSRCPNGQVDSNDIYQVTPAYRLGSDEKEIMKELMENGPVQA---LM 366
Query: 260 EVKQTLTLYSSTDFS 274
EV + LY +S
Sbjct: 367 EVHEDFFLYQRGIYS 381
>sp|Q9EQT5|TINAL_RAT Tubulointerstitial nephritis antigen-like OS=Rattus norvegicus
GN=Tinagl1 PE=2 SV=1
Length = 467
Score = 90.9 bits (224), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 73/255 (28%), Positives = 117/255 (45%), Gaps = 29/255 (11%)
Query: 37 ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ-FKHLLG-VKPTPKGLLLGVPVKTHDK 94
++ ++IK +N GW+A + F T+ + ++ LG ++P+ + + +
Sbjct: 140 LVDPAMIKAINRG-NYGWQAGNHSAFWGMTLDEGIRYRLGTIRPSSSVMNMNEIYTVLGQ 198
Query: 95 SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIH-FG-MNLSLSVN 152
LP +F+A WP + I LDQG+C WAF SDR IH G M LS
Sbjct: 199 GEVLPTAFEASEKWP--NLIHEPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPILSPQ 256
Query: 153 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCV 212
+LL+C GC GG AW + GVV++ C P+ + A PTP+C+
Sbjct: 257 NLLSCDTHH-QKGCRGGRLDGAWWFLRRRGVVSDNCYPF-----SGREQNDEASPTPRCM 310
Query: 213 ----------RKCVKK---NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVY 259
R+ + +Q+ N + YR+ SD ++IM E+ +NGPV+ +
Sbjct: 311 MHSRAMGRGKRQATSRCPNSQVDSNDIYQVTPVYRLASDEKEIMKELMENGPVQA---LM 367
Query: 260 EVKQTLTLYSSTDFS 274
EV + LY +S
Sbjct: 368 EVHEDFFLYQRGIYS 382
>sp|Q9UJW2|TINAG_HUMAN Tubulointerstitial nephritis antigen OS=Homo sapiens GN=TINAG PE=2
SV=3
Length = 476
Score = 89.0 bits (219), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 69/237 (29%), Positives = 106/237 (44%), Gaps = 22/237 (9%)
Query: 37 ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ-FKHLLG-VKPTPKGLLLGVPVKTHDK 94
+++ +I++VN+ GW A QF T+ FK LG + P+P L + +
Sbjct: 155 LVRSELIEQVNKG-DYGWTAQNYSQFWGMTLEDGFKFRLGTLPPSPMLLSMNEMTASLPA 213
Query: 95 SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVN 152
+ LP+ F A WP LDQ +C + WAF +DR I +LS
Sbjct: 214 TTDLPEFFVASYKWP--GWTHGPLDQKNCAASWAFSTASVAADRIAIQSKGRYTANLSPQ 271
Query: 153 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA------- 205
+L++CC GC+ G AW Y G+V+ C P F ++ GC A
Sbjct: 272 NLISCCA-KNRHGCNSGSIDRAWWYLRKRGLVSHACYPLFKDQNATNNGCAMASRSDGRG 330
Query: 206 --YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYE 260
+ T C K N++++ S YR++S+ +IM EI +NGPV+ V E
Sbjct: 331 KRHATKPCPNNVEKSNRIYQCS-----PPYRVSSNETEIMKEIMQNGPVQAIMQVRE 382
>sp|Q9GZM7|TINAL_HUMAN Tubulointerstitial nephritis antigen-like OS=Homo sapiens
GN=TINAGL1 PE=1 SV=1
Length = 467
Score = 86.3 bits (212), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 71/249 (28%), Positives = 115/249 (46%), Gaps = 18/249 (7%)
Query: 37 ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ-FKHLLG-VKPTPKGLLLGVPVKTHDK 94
++ +IK +N+ GW+A + F T+ + ++ LG ++P+ + + +
Sbjct: 141 LVDPDMIKAINQG-NYGWQAGNHSAFWGMTLDEGIRYRLGTIRPSSSVMNMHEIYTVLNP 199
Query: 95 SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIH-FG-MNLSLSVN 152
LP +F+A WP + I LDQG+C WAF SDR IH G M LS
Sbjct: 200 GEVLPTAFEASEKWP--NLIHEPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQ 257
Query: 153 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPY----FDSTGCSHPGCEPAYPT 208
+LL+C GC GG AW + GVV++ C P+ D G + P +
Sbjct: 258 NLLSCDTHQ-QQGCRGGRLDGAWWFLRRRGVVSDHCYPFSGRERDEAGPAPPCMMHSRAM 316
Query: 209 PKCVRKCVKK--NQLWRNSKHYSIS-AYRINSDPEDIMAEIYKNGPVEVSFTVYEVKQTL 265
+ R+ N N+ Y ++ YR+ S+ ++IM E+ +NGPV+ + EV +
Sbjct: 317 GRGKRQATAHCPNSYVNNNDIYQVTPVYRLGSNDKEIMKELMENGPVQA---LMEVHEDF 373
Query: 266 TLYSSTDFS 274
LY +S
Sbjct: 374 FLYKGGIYS 382
>sp|P92133|CATB3_GIAIN Cathepsin B-like CP3 OS=Giardia intestinalis GN=CP3 PE=2 SV=2
Length = 299
Score = 86.3 bits (212), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 71/215 (33%), Positives = 97/215 (45%), Gaps = 24/215 (11%)
Query: 49 NPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAW 108
NP+ WKA +F T + LL K VP T + + P SFD R +
Sbjct: 28 NPR--WKAGIPKRFEGLTKDEISSLLMPVSFLKRDRAAVPRGTV-SATQAPDSFDFREEY 84
Query: 109 PQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMN---LSLSVNDLLACCGFLCGD- 164
P C I ++DQG CGSCWAF +V ++ DR C G++ + S +++C GD
Sbjct: 85 PHC--IPEVVDQGGCGSCWAFSSVASVGDRRCFA-GLDKKAVKYSPQYVVSC---DRGDM 138
Query: 165 GCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRN 224
CDGG+ S WR+ G T+EC PY G A T C KC + L
Sbjct: 139 ACDGGWLPSVWRFLTKTGTTTDECVPY-------QSGSTGARGT--CPTKCADGSDLPHL 189
Query: 225 SKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVY 259
K Y + D IM + GP++ +FTVY
Sbjct: 190 YKATKAVDYGL--DAPAIMKALATGGPLQTAFTVY 222
>sp|P90850|YCF2E_CAEEL Uncharacterized peptidase C1-like protein F26E4.3 OS=Caenorhabditis
elegans GN=F26E4.3 PE=1 SV=3
Length = 452
Score = 80.1 bits (196), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 58/170 (34%), Positives = 82/170 (48%), Gaps = 7/170 (4%)
Query: 94 KSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSV 151
K +LP+ FDAR W I + DQG CGS W+ SDR I +N +LS
Sbjct: 180 KPRELPEHFDARDKWG--PLIHPVADQGDCGSSWSVSTTAISSDRLAIISEGRINSTLSS 237
Query: 152 NDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKC 211
LL+C GC+GGY AW Y GVV + C PY S PG
Sbjct: 238 QQLLSCNQHR-QKGCEGGYLDRAWWYIRKLGVVGDHCYPYV-SGQSREPGHCLIPKRDYT 295
Query: 212 VRKCVKKNQLWRNSKHYSISA-YRINSDPEDIMAEIYKNGPVEVSFTVYE 260
R+ ++ ++S + ++ Y+++S EDI E+ NGPV+ +F V+E
Sbjct: 296 NRQGLRCPSGSQDSTAFKMTPPYKVSSREEDIQTELMTNGPVQATFVVHE 345
>sp|P92132|CATB2_GIAIN Cathepsin B-like CP2 OS=Giardia intestinalis GN=CP2 PE=1 SV=2
Length = 300
Score = 78.2 bits (191), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 63/215 (29%), Positives = 96/215 (44%), Gaps = 23/215 (10%)
Query: 49 NPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAW 108
NP+ WKA +F T + LL K P T +P+SFD R +
Sbjct: 28 NPR--WKAGIPKRFEGLTKDEISSLLMPVSFLKNAKGAAPRGTFTDKDDVPESFDFREEY 85
Query: 109 PQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMN---LSLSVNDLLACCGFLCGD- 164
P C I ++DQG CGSCWAF +V DR C+ G++ + S +++C GD
Sbjct: 86 PHC--IPEVVDQGGCGSCWAFSSVATFGDRRCVA-GLDKKPVKYSPQYVVSC---DHGDM 139
Query: 165 GCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRN 224
C+GG+ + W++ G T+EC PY + C PT KC +
Sbjct: 140 ACNGGWLPNVWKFLTKTGTTTDECVPYKSGSTTLRGTC----PT-----KCADGSSKVHL 190
Query: 225 SKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVY 259
+ S Y + D +M + +GP++V+F V+
Sbjct: 191 ATATSYKDYGL--DIPAMMKALSTSGPLQVAFLVH 223
>sp|P92131|CATB1_GIAIN Cathepsin B-like CP1 OS=Giardia intestinalis GN=CP1 PE=2 SV=3
Length = 303
Score = 77.0 bits (188), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 69/232 (29%), Positives = 100/232 (43%), Gaps = 29/232 (12%)
Query: 54 WKAARNPQFSNYTVGQFKHLLGVKP----TPKGLLLGVPV-KTHDKSLKLPKSFDARSAW 108
WKA +F N T +F+ +L ++P G L + + + + +P FD R +
Sbjct: 31 WKAGMPKRFENVTEDEFRSML-IRPDRLRARSGSLPPISITEVQELVDPIPPQFDFRDEY 89
Query: 109 PQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMN---LSLSVNDLLACCGFLCGDG 165
PQC + LDQG CGSCWAF A+ DR C G++ +S S L++C L G
Sbjct: 90 PQC--VKPALDQGSCGSCWAFSAIGVFGDRRC-AMGIDKEAVSYSQQHLISCS--LENFG 144
Query: 166 CDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNS 225
CDGG W + G T EC Y D G A P P QL++
Sbjct: 145 CDGGDFQPTWSFLTFTGATTAECVKYVDY------GHTVASPCPAVCDDG-SPIQLYKAH 197
Query: 226 KHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEVKQTLTLYSSTDFSASF 277
+ +S S P IM + GP++ VY L+ Y S + ++
Sbjct: 198 GYGQVS----KSVPA-IMGMLVAGGPLQTMIVVY---ADLSYYESGVYKHTY 241
>sp|P22497|CYSP_THEPA Cysteine proteinase OS=Theileria parva GN=TP03_0285 PE=3 SV=2
Length = 440
Score = 74.7 bits (182), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 53/168 (31%), Positives = 80/168 (47%), Gaps = 24/168 (14%)
Query: 61 QFSNYTVGQFKHLLGVKPTPKG-------LLLGVPVKTHDKSLK----------LPKSFD 103
+FS+ T +F L V PK LL + KT+ K+LK L K
Sbjct: 171 RFSDLTEREFYKLFPVMKPPKATYSNGYYLLSHMANKTYLKNLKKALNTDEDVDLAKLTG 230
Query: 104 ARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCG 163
W + S+++ + DQ +CG CWAF V ++ + HF + LSV +LL C F
Sbjct: 231 ENLDWRRSSSVTSVKDQSNCGGCWAFSTVGSVEGYYMSHFDKSYELSVQELLDCDSF--S 288
Query: 164 DGCDGGYPISAWRYFVHHGVVTEECDPYFD-STGCSHPGCE----PAY 206
+GC GG SA+ Y +G+V+ + P+ D + CS P + P+Y
Sbjct: 289 NGCQGGLLESAYEYVRKYGLVSAKDLPFVDKARRCSVPKAKKVSVPSY 336
>sp|O97578|CATC_CANFA Dipeptidyl peptidase 1 (Fragment) OS=Canis familiaris GN=CTSC PE=1
SV=1
Length = 435
Score = 72.8 bits (177), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 57/223 (25%), Positives = 105/223 (47%), Gaps = 23/223 (10%)
Query: 41 SIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSLKLPK 100
+K +N K+ W A R ++ T+ +G + P+ + + H++ +LP
Sbjct: 148 EFVKAINTIQKS-WTATRYIEYETLTLRDMMTRVGGRKIPRPKPTPLTAEIHEEISRLPT 206
Query: 101 SFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACC 158
S+D R+ + +S + +Q CGSC+AF + L R I + LS ++++C
Sbjct: 207 SWDWRNV-RGTNFVSPVRNQASCGSCYAFASTAMLEARIRILTNNTQTPILSPQEIVSCS 265
Query: 159 GFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVK 217
+ GC+GG+P + A +Y G+V E C PY G P C+P C R
Sbjct: 266 QY--AQGCEGGFPYLIAGKYAQDFGLVEEACFPY---AGSDSP-CKPN----DCFR---- 311
Query: 218 KNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYE 260
+ +S++Y + + + + E+ ++GP+ V+F VY+
Sbjct: 312 ----YYSSEYYYVGGFYGACNEALMKLELVRHGPMAVAFEVYD 350
>sp|P05167|ALEU_HORVU Thiol protease aleurain OS=Hordeum vulgare PE=2 SV=1
Length = 362
Score = 71.6 bits (174), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 48/147 (32%), Positives = 69/147 (46%), Gaps = 8/147 (5%)
Query: 61 QFSNYTVGQFKHL-LGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILD 119
+FS+ + +F+ LG T L G + + LP++ D W + +S + +
Sbjct: 108 RFSDMSWEEFQATRLGAAQTCSATLAGNHLMR--DAAALPETKD----WREDGIVSPVKN 161
Query: 120 QGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFV 179
Q HCGSCW F AL + G N+SLS L+ C G GC+GG P A+ Y
Sbjct: 162 QAHCGSCWTFSTTGALEAAYTQATGKNISLSEQQLVDCAGGFNNFGCNGGLPSQAFEYIK 221
Query: 180 HHGVV-TEECDPYFDSTGCSHPGCEPA 205
++G + TEE PY G H E A
Sbjct: 222 YNGGIDTEESYPYKGVNGVCHYKAENA 248
>sp|Q95029|CATL_DROME Cathepsin L OS=Drosophila melanogaster GN=Cp1 PE=2 SV=2
Length = 371
Score = 70.9 bits (172), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 56/173 (32%), Positives = 80/173 (46%), Gaps = 23/173 (13%)
Query: 20 SQTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPT 79
+Q FAEG VS KL + D + E + NYT+ K L +
Sbjct: 94 NQRFAEGKVS-FKLAVNKYADLLHHEFRQLMNG----------FNYTL--HKQLRAADES 140
Query: 80 PKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRF 139
KG+ P + LPKS D W ++ + DQGHCGSCWAF + AL +
Sbjct: 141 FKGVTFISPA-----HVTLPKSVD----WRTKGAVTAVKDQGHCGSCWAFSSTGALEGQH 191
Query: 140 CIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHH-GVVTEECDPY 191
G+ +SLS +L+ C +GC+GG +A+RY + G+ TE+ PY
Sbjct: 192 FRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDNGGIDTEKSYPY 244
>sp|Q8RWQ9|ALEUL_ARATH Thiol protease aleurain-like OS=Arabidopsis thaliana GN=At3g45310
PE=2 SV=1
Length = 358
Score = 70.5 bits (171), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 53/172 (30%), Positives = 81/172 (47%), Gaps = 11/172 (6%)
Query: 28 VSKLKLDSHILQDSIIKEVNENPKA-GWKAARNPQFSNYTVGQF-KHLLGVKPTPKGLLL 85
V ++KL + ++++ + N K +K + N QF++ T +F ++ LG L
Sbjct: 73 VEEMKLRFSVFKENLDLIRSTNKKGLSYKLSLN-QFADLTWQEFQRYKLGAAQNCSATLK 131
Query: 86 GVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGM 145
G T +P + D W + +S + +QGHCGSCW F AL + FG
Sbjct: 132 GSHKITE---ATVPDTKD----WREDGIVSPVKEQGHCGSCWTFSTTGALEAAYHQAFGK 184
Query: 146 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVV-TEECDPYFDSTG 196
+SLS L+ C G GC GG P A+ Y ++G + TEE PY G
Sbjct: 185 GISLSEQQLVDCAGTFNNFGCHGGLPSQAFEYIKYNGGLDTEEAYPYTGKDG 236
>sp|Q24940|CATLL_FASHE Cathepsin L-like proteinase OS=Fasciola hepatica GN=Cat-1 PE=1 SV=1
Length = 326
Score = 70.1 bits (170), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 39/133 (29%), Positives = 67/133 (50%), Gaps = 8/133 (6%)
Query: 61 QFSNYTVGQFK--HLLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRIL 118
QF++ T +FK +L + L GVP + +++++ P D W + ++ +
Sbjct: 71 QFTDMTFEEFKAKYLTEMSRASDILSHGVPYEANNRAV--PDKID----WRESGYVTEVK 124
Query: 119 DQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYF 178
DQG+CGSCWAF + ++ + ++S S L+ C G +GC GG +A++Y
Sbjct: 125 DQGNCGSCWAFSTTGTMEGQYMKNERTSISFSEQQLVDCSGPWGNNGCSGGLMENAYQYL 184
Query: 179 VHHGVVTEECDPY 191
G+ TE PY
Sbjct: 185 KQFGLETESSYPY 197
>sp|Q7XR52|CYSP1_ORYSJ Cysteine protease 1 OS=Oryza sativa subsp. japonica GN=CP1 PE=2
SV=2
Length = 490
Score = 69.3 bits (168), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 48/145 (33%), Positives = 75/145 (51%), Gaps = 7/145 (4%)
Query: 49 NPKAGWKAARNPQFSNYTVGQFKHL-LGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSA 107
+ + G++ N +F++ T G+F+ LG P +G +G + HD LP S D R
Sbjct: 107 DERGGFRLGMN-RFADLTNGEFRATYLGTTPAGRGRRVGEAYR-HDGVEALPDSVDWRD- 163
Query: 108 WPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCD 167
+ + ++ + +QG CGSCWAF AV A+ I G +SLS +L+ C GC+
Sbjct: 164 --KGAVVAPVKNQGQCGSCWAFSAVAAVEGINKIVTGELVSLSEQELVECARNGQNSGCN 221
Query: 168 GGYPISAWRYFVHHGVV-TEECDPY 191
GG A+ + +G + TEE PY
Sbjct: 222 GGIMDDAFAFIARNGGLDTEEDYPY 246
>sp|Q86GF7|CRUST_PANBO Crustapain OS=Pandalus borealis GN=Cys PE=1 SV=1
Length = 323
Score = 68.9 bits (167), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 41/112 (36%), Positives = 58/112 (51%), Gaps = 7/112 (6%)
Query: 86 GVPVKTHDKSLKLPKS-----FDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFC 140
G+ + H S+ LPKS A W ++ + DQG CGSCWAF AV AL
Sbjct: 86 GMTRRRHPLSV-LPKSAPTTPMAADVDWRNKGAVTPVKDQGQCGSCWAFSAVAALEGAHF 144
Query: 141 IHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFV-HHGVVTEECDPY 191
+ G +SLS +L+ C GC+GG+P A++Y + + G+ TE PY
Sbjct: 145 LKTGDLVSLSEQNLVDCSSSYGNQGCNGGWPYQAYQYIIANRGIDTESSYPY 196
>sp|Q40143|CYSP3_SOLLC Cysteine proteinase 3 OS=Solanum lycopersicum GN=CYP-3 PE=2 SV=1
Length = 356
Score = 68.9 bits (167), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 56/176 (31%), Positives = 80/176 (45%), Gaps = 19/176 (10%)
Query: 28 VSKLKLDSHILQDSIIKEVNENPKA-GWKAARNPQFSNYTVGQF-KHLLGVK----PTPK 81
V ++K I D++ + N K +K N +F++ T +F KH LG T K
Sbjct: 71 VEEIKQRFEIFLDNLKMIRSHNRKGLSYKLGIN-EFTDLTWDEFRKHKLGASQNCSATTK 129
Query: 82 GLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI 141
G L ++ LP++ D W + +S + QG CGSCW F AL +
Sbjct: 130 GNL-------KLTNVVLPETKD----WRKDGIVSPVKAQGKCGSCWTFSTTGALEAAYAQ 178
Query: 142 HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYF-VHHGVVTEECDPYFDSTG 196
FG +SLS L+ C G GC+GG P A+ Y + G+ TEE PY G
Sbjct: 179 AFGKGISLSEQQLVDCAGAFNNFGCNGGLPSQAFEYIKFNGGLDTEEAYPYTGKNG 234
>sp|Q8H166|ALEU_ARATH Thiol protease aleurain OS=Arabidopsis thaliana GN=ALEU PE=1 SV=2
Length = 358
Score = 68.6 bits (166), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 53/167 (31%), Positives = 78/167 (46%), Gaps = 11/167 (6%)
Query: 28 VSKLKLDSHILQDSIIKEVNENPKA-GWKAARNPQFSNYTVGQFKHL-LGVKPTPKGLLL 85
V ++KL I ++++ + N K +K N QF++ T +F+ LG L
Sbjct: 73 VEEMKLRFSIFKENLDLIRSTNKKGLSYKLGVN-QFADLTWQEFQRTKLGAAQNCSATLK 131
Query: 86 GVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGM 145
G T LP++ D W + +S + DQG CGSCW F AL + FG
Sbjct: 132 GSHKVTE---AALPETKD----WREDGIVSPVKDQGGCGSCWTFSTTGALEAAYHQAFGK 184
Query: 146 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVV-TEECDPY 191
+SLS L+ C G GC+GG P A+ Y +G + TE+ PY
Sbjct: 185 GISLSEQQLVDCAGAFNNYGCNGGLPSQAFEYIKSNGGLDTEKAYPY 231
>sp|Q26636|CATL_SARPE Cathepsin L OS=Sarcophaga peregrina PE=1 SV=1
Length = 339
Score = 67.8 bits (164), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 36/97 (37%), Positives = 55/97 (56%), Gaps = 5/97 (5%)
Query: 96 LKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLL 155
+ +PKS D W + ++ + DQGHCGSCWAF + AL + G+ +SLS +L+
Sbjct: 120 VTVPKSVD----WREHGAVTGVKDQGHCGSCWAFSSTGALEGQHFRKAGVLVSLSEQNLV 175
Query: 156 ACCGFLCGDGCDGGYPISAWRYFVHH-GVVTEECDPY 191
C +GC+GG +A+RY + G+ TE+ PY
Sbjct: 176 DCSTKYGNNGCNGGLMDNAFRYIKDNGGIDTEKSYPY 212
>sp|Q5E998|CATL2_BOVIN Cathepsin L2 OS=Bos taurus GN=CTSL2 PE=2 SV=1
Length = 334
Score = 67.0 bits (162), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 51/165 (30%), Positives = 81/165 (49%), Gaps = 17/165 (10%)
Query: 51 KAGWKAARNPQFSNYTVGQFKHLLG---VKPTPKGLLLGVPVKTHDKSLKLPKSFDARSA 107
K G++ A N F + T +F+ ++ + KG L P+ + +PKS D
Sbjct: 70 KHGFRMAMNA-FGDMTNEEFRQVMNGFQNQKHKKGKLFHEPL-----LVDVPKSVD---- 119
Query: 108 WPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCD 167
W + ++ + +QG CGSCWAF A AL + G +SLS +L+ C GC+
Sbjct: 120 WTKKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRAQGNQGCN 179
Query: 168 GGYPISAWRYFVHHGVV-TEECDPYF--DSTGCSH-PGCEPAYPT 208
GG +A++Y +G + +EE PY D+ C++ P C A T
Sbjct: 180 GGLMDNAFQYIKDNGCLDSEESYPYLATDTNSCNYKPECSAANDT 224
>sp|P25975|CATL1_BOVIN Cathepsin L1 OS=Bos taurus GN=CTSL1 PE=1 SV=3
Length = 334
Score = 66.6 bits (161), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 51/165 (30%), Positives = 81/165 (49%), Gaps = 17/165 (10%)
Query: 51 KAGWKAARNPQFSNYTVGQFKHLLG---VKPTPKGLLLGVPVKTHDKSLKLPKSFDARSA 107
K G++ A N F + T +F+ ++ + KG L P+ + +PKS D
Sbjct: 70 KHGFRMAMNA-FGDMTNEEFRQVMNGFQNQKHKKGKLFHEPL-----LVDVPKSVD---- 119
Query: 108 WPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCD 167
W + ++ + +QG CGSCWAF A AL + G +SLS +L+ C GC+
Sbjct: 120 WTKKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRAQGNQGCN 179
Query: 168 GGYPISAWRYFVHH-GVVTEECDPYF--DSTGCSH-PGCEPAYPT 208
GG +A++Y + G+ +EE PY D+ C++ P C A T
Sbjct: 180 GGLMDNAFQYIKDNGGLDSEESYPYLATDTNSCNYKPECSAANDT 224
>sp|Q9GL24|CATL1_CANFA Cathepsin L1 OS=Canis familiaris GN=CTSL1 PE=2 SV=1
Length = 333
Score = 66.6 bits (161), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 51/165 (30%), Positives = 81/165 (49%), Gaps = 17/165 (10%)
Query: 51 KAGWKAARNPQFSNYTVGQFKHLLG---VKPTPKGLLLGVPVKTHDKSLKLPKSFDARSA 107
K G+ A N F + T +F+ ++ + KG + P+ ++PKS D
Sbjct: 70 KHGFTMAMNA-FGDMTNEEFRQVMNGFQNQKHKKGKMFQEPLFA-----EIPKSVD---- 119
Query: 108 WPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCD 167
W + ++ + +QG CGSCWAF A AL + G +SLS +L+ C +GC+
Sbjct: 120 WREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRAQGNEGCN 179
Query: 168 GGYPISAWRYFVHHGVV-TEECDPYF--DSTGCSH-PGCEPAYPT 208
GG +A+RY +G + +EE PY D+ C++ P C A T
Sbjct: 180 GGLMDNAFRYVKDNGGLDSEESYPYLGRDTETCNYKPECSAANDT 224
>sp|Q90686|CATK_CHICK Cathepsin K OS=Gallus gallus GN=CTSK PE=2 SV=1
Length = 334
Score = 65.9 bits (159), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 42/117 (35%), Positives = 61/117 (52%), Gaps = 12/117 (10%)
Query: 77 KPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALS 136
+P P G L VP D S + P + D W + ++ + DQG CGSCWAF +V AL
Sbjct: 104 RPRPNGTLY-VP----DWSSRAPAAVD----WRRKGYVTPVKDQGQCGSCWAFSSVGALE 154
Query: 137 DRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYF-VHHGVVTEECDPYF 192
+ G LSLS +L+ C +GC GGY +A+ Y ++ G+ +E+ PY
Sbjct: 155 GQLKRRTGKLLSLSPQNLVYCVS--NNNGCGGGYMTNAFEYVRLNRGIDSEDAYPYI 209
>sp|Q28944|CATL1_PIG Cathepsin L1 OS=Sus scrofa GN=CTSL1 PE=2 SV=1
Length = 334
Score = 64.7 bits (156), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 41/117 (35%), Positives = 62/117 (52%), Gaps = 8/117 (6%)
Query: 96 LKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLL 155
L++PKS D W + ++ + +QG CGSCWAF A AL + G +SLS +L+
Sbjct: 112 LEVPKSVD----WREKGYVTAVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLV 167
Query: 156 ACCGFLCGDGCDGGYPISAWRYFVHHGVV-TEECDPYF--DSTGCSH-PGCEPAYPT 208
C GC+GG +A++Y +G + TEE PY ++ C++ P C A T
Sbjct: 168 DCSRPQGNQGCNGGLMDNAFQYVKDNGGLDTEESYPYLGRETNSCTYKPECSAANDT 224
>sp|P43296|RD19A_ARATH Cysteine proteinase RD19a OS=Arabidopsis thaliana GN=RD19A PE=2
SV=1
Length = 368
Score = 64.7 bits (156), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 62/210 (29%), Positives = 94/210 (44%), Gaps = 39/210 (18%)
Query: 61 QFSNYTVGQF-KHLLGVK---PTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISR 116
QFS+ T +F K LGV+ PK + T + LP+ FD W ++
Sbjct: 98 QFSDLTRSEFRKKHLGVRSGFKLPKDANKAPILPTEN----LPEDFD----WRDHGAVTP 149
Query: 117 ILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACC-------GFLCGDGCDGG 169
+ +QG CGSCW+F A AL + G +SLS L+ C C GC+GG
Sbjct: 150 VKNQGSCGSCWSFSATGALEGANFLATGKLVSLSEQQLVDCDHECDPEEADSCDSGCNGG 209
Query: 170 YPISAWRYFVHHGVVTEECD-PYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHY 228
SA+ Y + G + +E D PY TG C+ + K+++ + ++
Sbjct: 210 LMNSAFEYTLKTGGLMKEEDYPY---TGKDGKTCK------------LDKSKIVASVSNF 254
Query: 229 SISAYRINSDPEDIMAEIYKNGPVEVSFTV 258
S+ I+ D E I A + KNGP+ V+
Sbjct: 255 SV----ISIDEEQIAANLVKNGPLAVAINA 280
>sp|Q54TR1|CFAD_DICDI Counting factor associated protein D OS=Dictyostelium discoideum
GN=cfaD PE=1 SV=1
Length = 531
Score = 64.3 bits (155), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 56/179 (31%), Positives = 80/179 (44%), Gaps = 25/179 (13%)
Query: 49 NPKAGWK--AARNPQFSNYTVG----------QFKHLLGVKP-TPKGLLLGVPVKTHDKS 95
N KA K A N + S+Y +G +F L VKP + + G D+S
Sbjct: 248 NFKAARKIIATHNAKESSYKLGMNHYADLSNKEFNTL--VKPKVARPSVTGADSVHDDES 305
Query: 96 LK-LPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDL 154
L+ +P + D W + ++ + DQG CGSCW FG+ +L C+ G +SLS L
Sbjct: 306 LRSIPSTVD----WRNQNCVTPVKDQGICGSCWTFGSTGSLEGTNCVTNGELVSLSEQQL 361
Query: 155 LACCGFLCGDGCDGGYPISAWRYFVHHG-VVTEECDPYFDSTGCSHPGCEPAYPTPKCV 212
+ C GC GG+ SA++Y + G + TE PY G C TP V
Sbjct: 362 VDCAILTGSQGCGGGFASSAFQYVMEIGSLATESNYPYLMQNGL----CRDRTVTPSGV 416
>sp|Q60HG6|CATC_MACFA Dipeptidyl peptidase 1 OS=Macaca fascicularis GN=CTSC PE=2 SV=1
Length = 463
Score = 64.3 bits (155), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 60/229 (26%), Positives = 105/229 (45%), Gaps = 32/229 (13%)
Query: 41 SIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVK----PTPKGLLLGVPVKTHDKSL 96
+ +K +N K+ W A ++ T+G G P PK L ++ K L
Sbjct: 173 NFVKAINAIQKS-WTATTYMEYETLTLGDMIKRSGGHSRKIPRPKPTPLTAEIQ--QKIL 229
Query: 97 KLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDL 154
LP S+D R+ + +S + +Q CGSC++F +V L R I + + LS ++
Sbjct: 230 HLPTSWDWRNV-HGINFVSPVRNQASCGSCYSFASVGMLEARIRILTNNSQTPILSSQEV 288
Query: 155 LACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVR 213
++C + GC+GG+P ++A +Y G+V E C PY TG P
Sbjct: 289 VSCSQY--AQGCEGGFPYLTAGKYAQDFGLVEEACFPY---TGTDSP------------- 330
Query: 214 KCVKKNQLWR--NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYE 260
C K +R +S+++ + + + + E+ +GP+ V+F VY+
Sbjct: 331 -CKMKEDCFRYYSSEYHYVGGFYGGCNEALMKLELVYHGPLAVAFEVYD 378
>sp|P25804|CYSP_PEA Cysteine proteinase 15A OS=Pisum sativum PE=2 SV=1
Length = 363
Score = 64.3 bits (155), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 55/210 (26%), Positives = 89/210 (42%), Gaps = 40/210 (19%)
Query: 61 QFSNYTVGQFK-HLLGVKPTPKGLLLGVPVKTHDKSL----KLPKSFDARSAWPQCSTIS 115
+FS+ T +F+ LG+K L +P + LP+ FD W + ++
Sbjct: 95 KFSDLTASEFRRQFLGLKKR-----LRLPAHAQKAPILPTTNLPEDFD----WREKGAVT 145
Query: 116 RILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLAC-------CGFLCGDGCDG 168
+ DQG CGSCWAF AL + G +SLS L+ C C GC+G
Sbjct: 146 PVKDQGSCGSCWAFSTTGALEGAHYLATGKLVSLSEQQLVDCDHVCDPEQAGSCDSGCNG 205
Query: 169 GYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHY 228
G +A+ Y + G V +E D + S C+ K+++ + ++
Sbjct: 206 GLMNNAFEYLLESGGVVQEKDYAYTGRDGS---CK------------FDKSKVVASVSNF 250
Query: 229 SISAYRINSDPEDIMAEIYKNGPVEVSFTV 258
S+ + D + I A + KNGP+ V+
Sbjct: 251 SV----VTLDEDQIAANLVKNGPLAVAINA 276
>sp|P25778|ORYC_ORYSJ Oryzain gamma chain OS=Oryza sativa subsp. japonica GN=Os09g0442300
PE=2 SV=2
Length = 362
Score = 64.3 bits (155), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 37/103 (35%), Positives = 50/103 (48%), Gaps = 5/103 (4%)
Query: 98 LPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLAC 157
LP++ D W + +S + DQGHCGSCW F +L + G +SLS L+ C
Sbjct: 145 LPETKD----WREDGIVSPVKDQGHCGSCWTFSTTGSLEAAYTQATGKPVSLSEQQLVDC 200
Query: 158 CGFLCGDGCDGGYPISAWRYFVHHGVV-TEECDPYFDSTGCSH 199
GC GG P A+ Y ++G + TEE PY G H
Sbjct: 201 ATAYNNFGCSGGLPSQAFEYIKYNGGLDTEEAYPYTGVNGICH 243
>sp|P25326|CATS_BOVIN Cathepsin S OS=Bos taurus GN=CTSS PE=1 SV=2
Length = 331
Score = 63.9 bits (154), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 37/101 (36%), Positives = 57/101 (56%), Gaps = 6/101 (5%)
Query: 93 DKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVN 152
D + KLP S D W + ++ + QG CGSCWAF AV AL + + G +SLS
Sbjct: 110 DPNQKLPDSMD----WREKGCVTEVKYQGACGSCWAFSAVGALEAQVKLKTGKLVSLSAQ 165
Query: 153 DLLACCGFLCGD-GCDGGYPISAWRYFV-HHGVVTEECDPY 191
+L+ C G+ GC+GG+ A++Y + ++G+ +E PY
Sbjct: 166 NLVDCSTAKYGNKGCNGGFMTEAFQYIIDNNGIDSEASYPY 206
Database: swissprot
Posted date: Mar 23, 2013 2:32 AM
Number of letters in database: 191,569,459
Number of sequences in database: 539,616
Lambda K H
0.321 0.136 0.442
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 111,633,974
Number of Sequences: 539616
Number of extensions: 4842062
Number of successful extensions: 9538
Number of sequences better than 100.0: 50
Number of HSP's better than 100.0 without gapping: 195
Number of HSP's successfully gapped in prelim test: 33
Number of HSP's that attempted gapping in prelim test: 9142
Number of HSP's gapped (non-prelim): 239
length of query: 279
length of database: 191,569,459
effective HSP length: 116
effective length of query: 163
effective length of database: 128,974,003
effective search space: 21022762489
effective search space used: 21022762489
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 60 (27.7 bits)