BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 018568
(354 letters)
Database: swissprot
539,616 sequences; 191,569,459 total letters
Searching..................................................done
>sp|Q4R5M2|CATB_MACFA Cathepsin B OS=Macaca fascicularis GN=CTSB PE=2 SV=1
Length = 339
Score = 285 bits (728), Expect = 5e-76, Method: Compositional matrix adjust.
Identities = 156/352 (44%), Positives = 208/352 (59%), Gaps = 41/352 (11%)
Query: 11 CLLILGVISSQTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQF 70
CLL LG S+ H L D ++ VN+ W+A N F N V
Sbjct: 10 CLLALGDARSRP-----------SFHPLSDELVNYVNKQ-NTTWQAGHN--FYNVDVSYL 55
Query: 71 KHLLGV---KPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCW 127
K L G P P ++ + LKLP+SFDAR WPQC TI I DQG CGSCW
Sbjct: 56 KRLCGTFLGGPKPPQRVM------FTEDLKLPESFDAREQWPQCPTIKEIRDQGSCGSCW 109
Query: 128 AFGAVEALSDRFCIHFGMNLSLSVN--DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT 185
AFGAVEA+SDR CIH ++S+ V+ DLL CCG +CGDGC+GGYP AW ++ G+V+
Sbjct: 110 AFGAVEAISDRICIHTNAHVSVEVSAEDLLTCCGIMCGDGCNGGYPAGAWNFWTRKGLVS 169
Query: 186 EE-------CDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSIS 231
C PY C H P C TPKC + C + ++ KHY +
Sbjct: 170 GGLYDSHVGCRPY-SIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGYSPTYKQDKHYGYN 228
Query: 232 AYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTS 291
+Y +++ +DIMAEIYKNGPVE +F+VY DF YKSGVY+H+TG++MGGHA++++GWG
Sbjct: 229 SYSVSNSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGV- 287
Query: 292 DDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEI 343
++G YW++AN WN WG +G+FKI RG + CGIE +VVAG+P + ++I
Sbjct: 288 ENGTPYWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPRTDQYWEKI 339
>sp|Q5R6D1|CATB_PONAB Cathepsin B OS=Pongo abelii GN=CTSB PE=2 SV=1
Length = 339
Score = 283 bits (725), Expect = 1e-75, Method: Compositional matrix adjust.
Identities = 150/327 (45%), Positives = 200/327 (61%), Gaps = 30/327 (9%)
Query: 36 HILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGV---KPTPKGLLLGVPVKTH 92
H L D ++ VN+ W+A N F N V K L G P P ++
Sbjct: 24 HPLSDELVNYVNKR-NTTWQAGHN--FYNVDVSYLKKLCGTFLGGPKPPQRVM------F 74
Query: 93 DKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVN 152
+ LKLP+SFDAR WPQC TI I DQG CGSCWAFGAVEA+SDR CIH ++S+ V+
Sbjct: 75 TEDLKLPESFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHTNAHVSVEVS 134
Query: 153 --DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCSH---- 199
DLL CCG +CGDGC+GGYP AW ++ G+V+ C PY C H
Sbjct: 135 AEDLLTCCGSMCGDGCNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPY-SIPPCEHHVNG 193
Query: 200 --PGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSF 256
P C TPKC + C + ++ KHY ++Y +++ DIMAEIYKNGPVE +F
Sbjct: 194 SRPPCTGEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVSNSERDIMAEIYKNGPVEGAF 253
Query: 257 TVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKI 316
+VY DF YKSGVY+H+TG++MGGHA++++GWG ++G YW++AN WN WG +G+FKI
Sbjct: 254 SVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGV-ENGTPYWLVANSWNTDWGDNGFFKI 312
Query: 317 KRGSNECGIEEDVVAGLPSSKNLVKEI 343
RG + CGIE +VVAG+P + ++I
Sbjct: 313 LRGQDHCGIESEVVAGIPRTDQYWEKI 339
>sp|P07858|CATB_HUMAN Cathepsin B OS=Homo sapiens GN=CTSB PE=1 SV=3
Length = 339
Score = 282 bits (722), Expect = 3e-75, Method: Compositional matrix adjust.
Identities = 154/352 (43%), Positives = 207/352 (58%), Gaps = 41/352 (11%)
Query: 11 CLLILGVISSQTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQF 70
CLL+L S+ H L D ++ VN+ W+A N F N +
Sbjct: 10 CLLVLANARSRP-----------SFHPLSDELVNYVNKR-NTTWQAGHN--FYNVDMSYL 55
Query: 71 KHLLGV---KPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCW 127
K L G P P ++ + LKLP SFDAR WPQC TI I DQG CGSCW
Sbjct: 56 KRLCGTFLGGPKPPQRVM------FTEDLKLPASFDAREQWPQCPTIKEIRDQGSCGSCW 109
Query: 128 AFGAVEALSDRFCIHFGMNLSLSVN--DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT 185
AFGAVEA+SDR CIH ++S+ V+ DLL CCG +CGDGC+GGYP AW ++ G+V+
Sbjct: 110 AFGAVEAISDRICIHTNAHVSVEVSAEDLLTCCGSMCGDGCNGGYPAEAWNFWTRKGLVS 169
Query: 186 EE-------CDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSIS 231
C PY C H P C TPKC + C + ++ KHY +
Sbjct: 170 GGLYESHVGCRPY-SIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGYSPTYKQDKHYGYN 228
Query: 232 AYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTS 291
+Y +++ +DIMAEIYKNGPVE +F+VY DF YKSGVY+H+TG++MGGHA++++GWG
Sbjct: 229 SYSVSNSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGV- 287
Query: 292 DDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEI 343
++G YW++AN WN WG +G+FKI RG + CGIE +VVAG+P + ++I
Sbjct: 288 ENGTPYWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPRTDQYWEKI 339
>sp|P00787|CATB_RAT Cathepsin B OS=Rattus norvegicus GN=Ctsb PE=1 SV=2
Length = 339
Score = 277 bits (708), Expect = 1e-73, Method: Compositional matrix adjust.
Identities = 148/325 (45%), Positives = 195/325 (60%), Gaps = 30/325 (9%)
Query: 32 KLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKT 91
K SH L D +I +N+ W+A RN F N + K L G +LG P
Sbjct: 20 KPSSHPLSDDMINYINKQ-NTTWQAGRN--FYNVDISYLKKLCGT-------VLGGPNLP 69
Query: 92 H----DKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--M 145
+ + LP+SFDAR W C TI++I DQG CGSCWAFGAVEA+SDR CIH +
Sbjct: 70 ERVGFSEDINLPESFDAREQWSNCPTIAQIRDQGSCGSCWAFGAVEAMSDRICIHTNGRV 129
Query: 146 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYF-----D 193
N+ +S DLL CCG CGDGC+GGYP AW ++ G+V+ C PY
Sbjct: 130 NVEVSAEDLLTCCGIQCGDGCNGGYPSGAWNFWTRKGLVSGGVYNSHIGCLPYTIPPCEH 189
Query: 194 STGCSHPGCEPAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPV 252
S P C TPKC + C + ++ KHY ++Y ++ ++IMAEIYKNGPV
Sbjct: 190 HVNGSRPPCTGEGDTPKCNKMCEAGYSTSYKEDKHYGYTSYSVSDSEKEIMAEIYKNGPV 249
Query: 253 EVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADG 312
E +FTV+ DF YKSGVYKH GDVMGGHA++++GWG ++G YW++AN WN WG +G
Sbjct: 250 EGAFTVFSDFLTYKSGVYKHEAGDVMGGHAIRILGWGI-ENGVPYWLVANSWNVDWGDNG 308
Query: 313 YFKIKRGSNECGIEEDVVAGLPSSK 337
+FKI RG N CGIE ++VAG+P ++
Sbjct: 309 FFKILRGENHCGIESEIVAGIPRTQ 333
>sp|P10605|CATB_MOUSE Cathepsin B OS=Mus musculus GN=Ctsb PE=1 SV=2
Length = 339
Score = 266 bits (681), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 142/320 (44%), Positives = 191/320 (59%), Gaps = 30/320 (9%)
Query: 36 HILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHL----LGVKPTPKGLLLGVPVKT 91
H L D +I +N+ W+A RN F N + K L LG P + G
Sbjct: 24 HPLSDDLINYINKQ-NTTWQAGRN--FYNVDISYLKKLCGTVLGGPKLPGRVAFG----- 75
Query: 92 HDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSL 149
+ + LP++FDAR W C TI +I DQG CGSCWAFGAVEA+SDR CIH +N+ +
Sbjct: 76 --EDIDLPETFDAREQWSNCPTIGQIRDQGSCGSCWAFGAVEAISDRTCIHTNGRVNVEV 133
Query: 150 SVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYF-----DSTGC 197
S DLL CCG CGDGC+GGYP AW ++ G+V+ C PY
Sbjct: 134 SAEDLLTCCGIQCGDGCNGGYPSGAWSFWTKKGLVSGGVYNSHVGCLPYTIPPCEHHVNG 193
Query: 198 SHPGCEPAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSF 256
S P C TP+C + C + ++ KH+ ++Y +++ ++IMAEIYKNGPVE +F
Sbjct: 194 SRPPCTGEGDTPRCNKSCEAGYSPSYKEDKHFGYTSYSVSNSVKEIMAEIYKNGPVEGAF 253
Query: 257 TVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKI 316
TV+ DF YKSGVYKH GD+MGGHA++++GWG ++G YW+ AN WN WG +G+FKI
Sbjct: 254 TVFSDFLTYKSGVYKHEAGDMMGGHAIRILGWGV-ENGVPYWLAANSWNLDWGDNGFFKI 312
Query: 317 KRGSNECGIEEDVVAGLPSS 336
RG N CGIE ++VAG+P +
Sbjct: 313 LRGENHCGIESEIVAGIPRT 332
>sp|A1E295|CATB_PIG Cathepsin B OS=Sus scrofa GN=CTSB PE=1 SV=1
Length = 335
Score = 266 bits (680), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 143/331 (43%), Positives = 191/331 (57%), Gaps = 32/331 (9%)
Query: 29 SKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVP 88
++ L L D ++ +N+ W A N F N + K L G LG P
Sbjct: 17 ARESLHFQPLSDELVNFINKQ-NTTWTAGHN--FYNVDLSYVKKLCGT-------FLGGP 66
Query: 89 VKTHDKSLK----LPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG 144
+ LPKSFDAR WP C TI I DQG CGSCWAFGAVEA+SDR CI
Sbjct: 67 KLPQRAAFAADMILPKSFDAREQWPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICIRSN 126
Query: 145 --MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDST 195
+N+ +S D+L CCG CGDGC+GG+P AW ++ G+V+ C PY
Sbjct: 127 GRVNVEVSAEDMLTCCGDECGDGCNGGFPSGAWNFWTKKGLVSGGLYDSHVGCRPY-SIP 185
Query: 196 GCSH------PGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYK 248
C H P C TPKC + C ++ KH+ S+Y I+ + ++IMAEIYK
Sbjct: 186 PCEHHVNGSRPPCTGEGDTPKCSKICEPGYTPSYKEDKHFGCSSYSISRNEKEIMAEIYK 245
Query: 249 NGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSW 308
NGPVE +FTVY DF YKSGVY+H+TGD+MGGHA++++GWG ++G YW++ N WN W
Sbjct: 246 NGPVEGAFTVYSDFLQYKSGVYQHVTGDLMGGHAIRILGWGV-ENGTPYWLVGNSWNTDW 304
Query: 309 GADGYFKIKRGSNECGIEEDVVAGLPSSKNL 339
G +G+FKI RG + CGIE ++VAG+P + +
Sbjct: 305 GDNGFFKILRGQDHCGIESEIVAGIPCTPHF 335
>sp|P25792|CYSP_SCHMA Cathepsin B-like cysteine proteinase OS=Schistosoma mansoni PE=2
SV=1
Length = 340
Score = 265 bits (677), Expect = 4e-70, Method: Compositional matrix adjust.
Identities = 146/343 (42%), Positives = 201/343 (58%), Gaps = 23/343 (6%)
Query: 7 FLTTCLLILGVISSQTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYT 66
LT+ L I +I TF E +S L D II +NE+P AGW+A ++ +F +
Sbjct: 1 MLTSILCIASLI---TFLEAHISVKNEKFEPLSDDIISYINEHPNAGWRAEKSNRFHSLD 57
Query: 67 VGQFKHLLGVKPTPKGLLLGVPVKTH-DKSLKLPKSFDARSAWPQCSTISRILDQGHCGS 125
+ + + + P P H D ++++P +FD+R WP C +I+ I DQ CGS
Sbjct: 58 DARIQ-MGARREEPDLRRKRRPTVDHNDWNVEIPSNFDSRKKWPGCKSIATIRDQSRCGS 116
Query: 126 CWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGV 183
CW+FGAVEA+SDR CI G N+ LS DLL CC CG GC+GG AW Y+V G+
Sbjct: 117 CWSFGAVEAMSDRSCIQSGGKQNVELSAVDLLTCCES-CGLGCEGGILGPAWDYWVKEGI 175
Query: 184 VTEE-------CDPY-----FDSTGCSHPGC-EPAYPTPKCVRKCVKKNQL-WRNSKHYS 229
VT C+PY T +P C Y TP+C + C +K + + KH
Sbjct: 176 VTASSKENHTGCEPYPFPKCEHHTKGKYPPCGSKIYNTPRCKQTCQRKYKTPYTQDKHRG 235
Query: 230 ISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWG 289
S+Y + +D + I EI K GPVE SFTVYEDF +YKSG+YKHITG+ +GGHA+++IGWG
Sbjct: 236 KSSYNVKNDEKAIQKEIMKYGPVEASFTVYEDFLNYKSGIYKHITGEALGGHAIRIIGWG 295
Query: 290 TSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 332
++ YW++AN WN WG +GYF+I RG +EC IE +V+AG
Sbjct: 296 V-ENKTPYWLIANSWNEDWGENGYFRIVRGRDECSIESEVIAG 337
>sp|P43510|CPR6_CAEEL Cathepsin B-like cysteine proteinase 6 OS=Caenorhabditis elegans
GN=cpr-6 PE=1 SV=1
Length = 379
Score = 261 bits (666), Expect = 7e-69, Method: Compositional matrix adjust.
Identities = 154/373 (41%), Positives = 205/373 (54%), Gaps = 51/373 (13%)
Query: 6 LFLTTCLLILGVISSQTFAEGVVSKLK---LDSHILQ---DSIIKEVNENPKAGWKAARN 59
L +C+++ + E V+ K + +DS + D +I VNEN W A +
Sbjct: 4 LLFLSCIVVAAYCACNDNLESVLDKYRNREIDSEAAELDGDDLIDYVNENQNL-WTAKKQ 62
Query: 60 PQFSNYTVGQFKHLLGVKPTPKGLLLGVP------------VKTHDKSLKLPKSFDARSA 107
+FS+ + G K L+GV KT D L +P+SFD+R
Sbjct: 63 RRFSS--------VYGENDKAKWGLMGVNHVRLSVKGKQHLSKTKDLDLDIPESFDSRDN 114
Query: 108 WPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDG 165
WP+C +I I DQ CGSCWAFGAVEA+SDR CI H + ++LS +DLL+CC CG G
Sbjct: 115 WPKCDSIKVIRDQSSCGSCWAFGAVEAMSDRICIASHGELQVTLSADDLLSCCK-SCGFG 173
Query: 166 CDGGYPISAWRYFVHHGVVTEECDPYFDSTGCS---HPGCE-------------PAYPTP 209
C+GG P++AWRY+V G+VT Y + GC P CE YPTP
Sbjct: 174 CNGGDPLAAWRYWVKDGIVTGS--NYTANNGCKPYPFPPCEHHSKKTHFDPCPHDLYPTP 231
Query: 210 KCVRKCVKK--NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKS 267
KC +KCV ++ + K + SAY + D E I E+ +GP+E++F VYEDF +Y
Sbjct: 232 KCEKKCVSDYTDKTYSEDKFFGASAYGVKDDVEAIQKELMTHGPLEIAFEVYEDFLNYDG 291
Query: 268 GVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEE 327
GVY H G + GGHAVKLIGWG DDG YW +AN WN WG DG+F+I RG +ECGIE
Sbjct: 292 GVYVHTGGKLGGGHAVKLIGWGI-DDGIPYWTVANSWNTDWGEDGFFRILRGVDECGIES 350
Query: 328 DVVAGLPSSKNLV 340
VV G+P +L
Sbjct: 351 GVVGGIPKLNSLT 363
>sp|P43157|CYSP_SCHJA Cathepsin B-like cysteine proteinase OS=Schistosoma japonicum
GN=CATB PE=2 SV=1
Length = 342
Score = 258 bits (660), Expect = 4e-68, Method: Compositional matrix adjust.
Identities = 138/336 (41%), Positives = 197/336 (58%), Gaps = 23/336 (6%)
Query: 17 VISSQTFAEG-VVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLG 75
++S TF E V ++ L D +I +NE+P AGWKA ++ +F +++ + L+G
Sbjct: 8 IVSLFTFLEAHVTTRNNQRIEPLSDEMISFINEHPDAGWKADKSDRF--HSLDDARILMG 65
Query: 76 VKPTPKGLLLGV--PVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVE 133
+ + V HD ++++P FD+R WP C +IS+I DQ CGSCWAFGAVE
Sbjct: 66 ARKEDAEMKRNRRPTVDHHDLNVEIPSQFDSRKKWPHCKSISQIRDQSRCGSCWAFGAVE 125
Query: 134 ALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT------ 185
A++DR CI G + LS DL++CC CGDGC GG+P AW Y+V G+VT
Sbjct: 126 AMTDRICIQSGGGQSAELSALDLISCCK-DCGDGCQGGFPGVAWDYWVKRGIVTGGSKEN 184
Query: 186 -EECDPY-----FDSTGCSHPGC-EPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINS 237
C PY T +P C Y TP+C + C K + + KHY +Y + +
Sbjct: 185 HTGCQPYPFPKCEHHTKGKYPACGTKIYKTPQCKQTCQKGYKTPYEQDKHYGDESYNVQN 244
Query: 238 DPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDY 297
+ + I +I GPVE +F VYEDF +YKSG+Y+H+TG ++GGHA+++IGWG + Y
Sbjct: 245 NEKVIQRDIMMYGPVEAAFDVYEDFLNYKSGIYRHVTGSIVGGHAIRIIGWGV-EKRTPY 303
Query: 298 WILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 333
W++AN WN WG G F++ RG +EC IE DVVAGL
Sbjct: 304 WLIANSWNEDWGEKGLFRMVRGRDECSIESDVVAGL 339
>sp|P43233|CATB_CHICK Cathepsin B OS=Gallus gallus GN=CTSB PE=2 SV=1
Length = 340
Score = 258 bits (659), Expect = 5e-68, Method: Compositional matrix adjust.
Identities = 140/319 (43%), Positives = 187/319 (58%), Gaps = 35/319 (10%)
Query: 38 LQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHD---- 93
L ++ +N+ G +A N F N + K L G LG P
Sbjct: 26 LSSDLVNHINKLNTTG-RAGHN--FHNTDMSYVKKLCGT-------FLGGPKAPERVDFA 75
Query: 94 KSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVN- 152
+ + LP +FD R WP C TIS I DQG CGSCWAFGAVEA+SDR C+H +S+ V+
Sbjct: 76 EDMDLPDTFDTRKQWPNCPTISEIRDQGSCGSCWAFGAVEAISDRICVHTNAKVSVEVSA 135
Query: 153 -DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGC---SHPGCE----- 203
DLL+CCGF CG GC+GGYP AWRY+ G+V+ Y GC + P CE
Sbjct: 136 EDLLSCCGFECGMGCNGGYPSGAWRYWTERGLVSGGL--YDSHVGCRAYTIPPCEHHVNG 193
Query: 204 -------PAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVS 255
TP+C R C + ++ KHY I++Y + ++IMAEIYKNGPVE +
Sbjct: 194 SRPPCTGEGGETPRCSRHCEPGYSPSYKEDKHYGITSYGVPRSEKEIMAEIYKNGPVEGA 253
Query: 256 FTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFK 315
F VYEDF YKSGVY+H++G+ +GGHA++++GWG ++G YW+ AN WN WG G+FK
Sbjct: 254 FIVYEDFLMYKSGVYQHVSGEQVGGHAIRILGWGV-ENGTPYWLAANSWNTDWGITGFFK 312
Query: 316 IKRGSNECGIEEDVVAGLP 334
I RG + CGIE ++VAG+P
Sbjct: 313 ILRGEDHCGIESEIVAGVP 331
>sp|P43509|CPR5_CAEEL Cathepsin B-like cysteine proteinase 5 OS=Caenorhabditis elegans
GN=cpr-5 PE=2 SV=1
Length = 344
Score = 253 bits (646), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 129/258 (50%), Positives = 159/258 (61%), Gaps = 22/258 (8%)
Query: 98 LPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLL 155
+P FDAR WP C +I+ I DQ CGSCWAF A EA+SDR CI + +N LS DLL
Sbjct: 82 IPDHFDARDQWPNCMSINNIRDQSDCGSCWAFAAAEAISDRTCIASNGAVNTLLSSEDLL 141
Query: 156 ACCG--FLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDS------TGCSHP 200
+CC F CG+GC+GGYPI AW+++V HG+VT C PY + G P
Sbjct: 142 SCCTGMFSCGNGCEGGYPIQAWKWWVKHGLVTGGSYETQFGCKPYSIAPCGETVNGVKWP 201
Query: 201 GC-EPAYPTPKCVRKCVKKNQL---WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSF 256
C E PTPKCV C KN + KH+ +AY + E I EI NGP+EV+F
Sbjct: 202 ACPEDTEPTPKCVDSCTSKNNYATPYLQDKHFGSTAYAVGKKVEQIQTEILTNGPIEVAF 261
Query: 257 TVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKI 316
TVYEDF Y +GVY H G +GGHAVK++GWG D+G YW++AN WN +WG GYF+I
Sbjct: 262 TVYEDFYQYTTGVYVHTAGASLGGHAVKILGWGV-DNGTPYWLVANSWNVAWGEKGYFRI 320
Query: 317 KRGSNECGIEEDVVAGLP 334
RG NECGIE VAG+P
Sbjct: 321 IRGLNECGIEHSAVAGIP 338
>sp|Q54QD9|CTSB_DICDI Cathepsin B OS=Dictyostelium discoideum GN=ctsB PE=3 SV=1
Length = 311
Score = 239 bits (609), Expect = 3e-62, Method: Compositional matrix adjust.
Identities = 134/290 (46%), Positives = 172/290 (59%), Gaps = 26/290 (8%)
Query: 54 WKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDK-SLKLPKSFDARSAWPQCS 112
W + QF N VGQ LLG K +P L +K++D +++P SF+A++ WP C+
Sbjct: 39 WVEEQTDQFDNIKVGQ---LLGFKRSPNRPKL--QIKSYDPLGVQIPTSFNAQTNWPNCT 93
Query: 113 TISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPI 172
TIS+I +Q CGSCWAFGA E+ +DR CIH N+ LS D++ C +GC+GG
Sbjct: 94 TISQIQNQARCGSCWAFGATESATDRLCIHNNENVQLSFMDMVTCDE--TDNGCEGGDAF 151
Query: 173 SAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYP-------TPKCVRKCVKKNQL-WRN 224
SAW + G V+EEC PY + P C PA TP C ++C + L +
Sbjct: 152 SAWNWLRKQGAVSEECLPY------TIPTCPPAQQPCLNFVNTPSCTKECQSNSSLIYSQ 205
Query: 225 SKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVK 284
KH Y +SD E IM EI NGPVE FTV+EDF YKSGVY H TG +GGH VK
Sbjct: 206 DKHKMAKIYSFDSD-EAIMQEIVTNGPVEACFTVFEDFLAYKSGVYVHTTGKDLGGHCVK 264
Query: 285 LIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 334
L+G+GT +G DY+ NQW SWG +G F IKRG +CGI +DVVAGLP
Sbjct: 265 LVGFGTL-NGVDYYAANNQWTTSWGDNGTFLIKRG--DCGISDDVVAGLP 311
>sp|P43508|CPR4_CAEEL Cathepsin B-like cysteine proteinase 4 OS=Caenorhabditis elegans
GN=cpr-4 PE=2 SV=1
Length = 335
Score = 238 bits (606), Expect = 6e-62, Method: Compositional matrix adjust.
Identities = 142/315 (45%), Positives = 186/315 (59%), Gaps = 25/315 (7%)
Query: 39 QDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHD-KSLK 97
Q++I + VN ++ WKA P+ + T+ Q K L V V HD
Sbjct: 25 QEAITEYVNSK-QSLWKA-EIPK--DITIEQVKKRLMRTEFVAPHTPDVEVVKHDINEDT 80
Query: 98 LPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLL 155
+P +FDAR+ WP C +I+ I DQ CGSCWAF A EA SDRFCI + +N LS D+L
Sbjct: 81 IPATFDARTQWPNCMSINNIRDQSDCGSCWAFAAAEAASDRFCIASNGAVNTLLSAEDVL 140
Query: 156 ACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYF-----DSTG-CSHPGC 202
+CC CG GC+GGYPI+AW+Y V G T C PY ++ G + P C
Sbjct: 141 SCCSN-CGYGCEGGYPINAWKYLVKSGFCTGGSYEAQFGCKPYSLAPCGETVGNVTWPSC 199
Query: 203 -EPAYPTPKCVRKCVKKNQ--LWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVY 259
+ Y TP CV KC KN + KH+ +AY + I AEI +GPVE +FTVY
Sbjct: 200 PDDGYDTPACVNKCTNKNYNVAYTADKHFGSTAYAVGKKVSQIQAEIIAHGPVEAAFTVY 259
Query: 260 EDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRG 319
EDF YK+GVY H TG +GGHA++++GWGT D+G YW++AN WN +WG +GYF+I RG
Sbjct: 260 EDFYQYKTGVYVHTTGQELGGHAIRILGWGT-DNGTPYWLVANSWNVNWGENGYFRIIRG 318
Query: 320 SNECGIEEDVVAGLP 334
+NECGIE VV G+P
Sbjct: 319 TNECGIEHAVVGGVP 333
>sp|P25807|CPR1_CAEEL Gut-specific cysteine proteinase OS=Caenorhabditis elegans GN=cpr-1
PE=1 SV=2
Length = 329
Score = 237 bits (604), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 119/244 (48%), Positives = 153/244 (62%), Gaps = 12/244 (4%)
Query: 98 LPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLL 155
+P +FD+R+ W +C +I I DQ CGSCWAFGA E +SDR CI +S +DLL
Sbjct: 85 VPATFDSRTQWSECKSIKLIRDQATCGSCWAFGAAEMISDRTCIETKGAQQPIISPDDLL 144
Query: 156 ACCGFLCGDGCDGGYPISAWRYFVHHGVVT------EECDPYFDSTGCSHPGCEPAYPTP 209
+CCG CG+GC+GGYPI A R++ GVVT C PY + C+ C P TP
Sbjct: 145 SCCGSSCGNGCEGGYPIQALRWWDSKGVVTGGDYHGAGCKPYPIAP-CTSGNC-PESKTP 202
Query: 210 KCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSG 268
C C + + KH+ +SAY + + I AEIY NGPVE +F+VYEDF YKSG
Sbjct: 203 SCSMSCQSGYSTAYAKDKHFGVSAYAVPKNAASIQAEIYANGPVEAAFSVYEDFYKYKSG 262
Query: 269 VYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEED 328
VYKH G +GGHA+K+IGWGT + G YW++AN W +WG G+FKI RG ++CGIE
Sbjct: 263 VYKHTAGKYLGGHAIKIIGWGT-ESGSPYWLVANSWGVNWGESGFFKIYRGDDQCGIESA 321
Query: 329 VVAG 332
VVAG
Sbjct: 322 VVAG 325
>sp|P07688|CATB_BOVIN Cathepsin B OS=Bos taurus GN=CTSB PE=1 SV=5
Length = 335
Score = 235 bits (600), Expect = 3e-61, Method: Compositional matrix adjust.
Identities = 146/348 (41%), Positives = 200/348 (57%), Gaps = 41/348 (11%)
Query: 6 LFLTTCLLILGVISSQTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNY 65
L +CLL+L S + L D ++ VN+ WKA N F N
Sbjct: 5 LATLSCLLVLTSARSSLYFP-----------PLSDELVNFVNKQ-NTTWKAGHN--FYNV 50
Query: 66 TVGQFKHLLGVKPTPKGLLLGVPVKTHDK---SLKLPKSFDARSAWPQCSTISRILDQGH 122
+ K L G +L G + D + LP+SFDAR WP C TI I DQG
Sbjct: 51 DLSYVKKLCGA------ILGGPKLPQRDAFAADVVLPESFDAREQWPNCPTIKEIRDQGS 104
Query: 123 CGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVH 180
CGSCWAFGAVEA+SDR CIH +N+ +S D+L CCG CGDGC+GG+P AW ++
Sbjct: 105 CGSCWAFGAVEAISDRICIHSNGRVNVEVSAEDMLTCCGGECGDGCNGGFPSGAWNFWTK 164
Query: 181 HGVVTEE-------CDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKK-NQLWRNSK 226
G+V+ C PY C H P C TPKC + C + ++ K
Sbjct: 165 KGLVSGGLYNSHVGCRPY-SIPPCEHHVNGSRPPCTGEGDTPKCSKTCEPGYSPSYKEDK 223
Query: 227 HYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLI 286
H+ S+Y + ++ ++IMAEIYKNGPVE +F+VY DF YKSGVY+H++G++MGGHA++++
Sbjct: 224 HFGCSSYSVANNEKEIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVSGEIMGGHAIRIL 283
Query: 287 GWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 334
GWG ++G YW++ N WN WG +G+FKI RG + CGIE ++VAG+P
Sbjct: 284 GWGV-ENGTPYWLVGNSWNTDWGDNGFFKILRGQDHCGIESEIVAGMP 330
>sp|P43507|CPR3_CAEEL Cathepsin B-like cysteine proteinase 3 OS=Caenorhabditis elegans
GN=cpr-3 PE=2 SV=1
Length = 370
Score = 233 bits (593), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 125/266 (46%), Positives = 160/266 (60%), Gaps = 22/266 (8%)
Query: 98 LPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLL 155
LP +FDAR WP C+TI I +Q CGSCWAFGA E +SDR CI +SV D+L
Sbjct: 92 LPDTFDAREKWPDCNTIKLIRNQATCGSCWAFGAAEVISDRVCIQSNGTQQPVISVEDIL 151
Query: 156 ACCGFLCGDGCDGGYPISAWRYFVHHGVVT------EECDPYFDSTGCSHPGCEPAYPTP 209
+CCG CG GC GGY I A R++ G VT C PY S C P TP
Sbjct: 152 SCCGTTCGYGCKGGYSIEALRFWASSGAVTGGDYGGHGCMPY--SFAPCTKNC-PESTTP 208
Query: 210 KCVRKCVK--KNQLWRNSKHYSISAYRINSDPE--DIMAEIYKNGPVEVSFTVYEDFAHY 265
C C K + ++ KHY SAY++ + +I EIY GPVE S+ VYEDF HY
Sbjct: 209 SCKTTCQSSYKTEEYKKDKHYGASAYKVTTTKSVTEIQTEIYHYGPVEASYKVYEDFYHY 268
Query: 266 KSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGI 325
KSGVY + +G ++GGHAVK+IGWG ++G DYW++AN W S+G G+FKI+RG+NEC I
Sbjct: 269 KSGVYHYTSGKLVGGHAVKIIGWGV-ENGVDYWLIANSWGTSFGEKGFFKIRRGTNECQI 327
Query: 326 EEDVVAGLPSSKNLVKEITSADMFED 351
E +VVAG + K T ++ +ED
Sbjct: 328 EGNVVAG------IAKLGTHSETYED 347
>sp|P25793|CYSP2_HAECO Cathepsin B-like cysteine proteinase 2 OS=Haemonchus contortus
GN=AC-2 PE=2 SV=1
Length = 342
Score = 232 bits (591), Expect = 3e-60, Method: Compositional matrix adjust.
Identities = 128/324 (39%), Positives = 182/324 (56%), Gaps = 36/324 (11%)
Query: 33 LDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTH 92
L +++ + + EVN +P P F + ++ +K + L L V +
Sbjct: 38 LVAYLRRSQNLFEVNSDP--------TPDFE-------QKIMSIKYKHQKLNLMVK-EDP 81
Query: 93 DKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLS 150
D + +P S+D R W C+T I DQ +CGSCWA A+SDR CI +++S
Sbjct: 82 DPEVDIPPSYDPRDVWKNCTTFY-IRDQANCGSCWAVSTAAAISDRICIASKAEKQVNIS 140
Query: 151 VNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSHPG-- 201
D++ CC CGDGC+GG+PI AW+YF++ GVV+ + C PY C H G
Sbjct: 141 ATDIMTCCRPQCGDGCEGGWPIEAWKYFIYDGVVSGGEYLTKDVCRPY-PIHPCGHHGND 199
Query: 202 -----CEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVS 255
C PTP C RKC +++R K Y AY + + I +EI KNGPV S
Sbjct: 200 TYYGECRGTAPTPPCKRKCRPGVRKMYRIDKRYGKDAYIVKQSVKAIQSEILKNGPVVAS 259
Query: 256 FTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFK 315
F VYEDF HYKSG+YKH G++ G HAVK+IGWG +++ D+W++AN W+ WG GYF+
Sbjct: 260 FAVYEDFRHYKSGIYKHTAGELRGYHAVKMIGWG-NENNTDFWLIANSWHNDWGEKGYFR 318
Query: 316 IKRGSNECGIEEDVVAGLPSSKNL 339
I RGSN+CGIE + AG+ +++L
Sbjct: 319 IVRGSNDCGIEGTIAAGIVDTESL 342
>sp|P19092|CYSP1_HAECO Cathepsin B-like cysteine proteinase 1 OS=Haemonchus contortus
GN=AC-1 PE=2 SV=1
Length = 342
Score = 230 bits (586), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 115/264 (43%), Positives = 159/264 (60%), Gaps = 20/264 (7%)
Query: 93 DKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLS 150
D + +P S+D R W C+T I DQ +CGSCWA A+SDR CI +++S
Sbjct: 82 DPEVDIPPSYDPRDVWKNCTTFY-IRDQANCGSCWAVSTAAAISDRICIASKAEKQVNIS 140
Query: 151 VNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSHPG-- 201
D++ CC CGDGC+GG+PI AW+YF++ GVV+ + C PY C H G
Sbjct: 141 ATDIMTCCRPQCGDGCEGGWPIEAWKYFIYDGVVSGGEYLTKDVCRPY-PIHPCGHHGND 199
Query: 202 -----CEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVS 255
C PTP C RKC +++R K Y AY + + I +EI +NGPV S
Sbjct: 200 TYYGECRGTAPTPPCKRKCRPGVRKMYRIDKRYGKDAYIVKQSVKAIQSEILRNGPVVAS 259
Query: 256 FTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFK 315
F VYEDF HYKSG+YKH G++ G HAVK+IGWG +++ D+W++AN W+ WG GYF+
Sbjct: 260 FAVYEDFRHYKSGIYKHTAGELRGYHAVKMIGWG-NENNTDFWLIANSWHNDWGEKGYFR 318
Query: 316 IKRGSNECGIEEDVVAGLPSSKNL 339
I RG+N+CGIE + AG+ +++L
Sbjct: 319 IIRGTNDCGIEGTIAAGIVDTESL 342
>sp|P25802|CYSP1_OSTOS Cathepsin B-like cysteine proteinase 1 OS=Ostertagia ostertagi
GN=CP-1 PE=3 SV=3
Length = 341
Score = 213 bits (541), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 111/251 (44%), Positives = 152/251 (60%), Gaps = 19/251 (7%)
Query: 98 LPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLL 155
+P+S+D R W CS++ I DQ +CGSCWA + A+SDR CI + +S D++
Sbjct: 91 IPESYDPRIQWANCSSLFHIPDQANCGSCWAVSSAAAMSDRICIASKGAKQVLISAQDVV 150
Query: 156 ACCGFLCGDGCDGGYPISAWRYFVHHGVVTE-------ECDPYFDSTGCSHPGCEPAY-- 206
+CC + CGDGC+GG+PISA+R+ GVVT C PY + C H G E Y
Sbjct: 151 SCCTW-CGDGCEGGWPISAFRFHADEGVVTGGDYNTKGSCRPY-EIHPCGHHGNETYYGE 208
Query: 207 -----PTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYED 261
TP+C R+C+ S Y AY++ + + I +I KNGPV ++TVYED
Sbjct: 209 CVGMADTPRCKRRCLLGYPKSYPSDRYYKKAYQLKNSVKAIQKDIMKNGPVVATYTVYED 268
Query: 262 FAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSN 321
FAHY+SG+YKH G G HAVK+IGWG + G YWI+AN W+ WG +G+F++ RGSN
Sbjct: 269 FAHYRSGIYKHKAGRKTGLHAVKVIGWG-EEKGTPYWIVANSWHDDWGENGFFRMHRGSN 327
Query: 322 ECGIEEDVVAG 332
+CG EE + AG
Sbjct: 328 DCGFEERMAAG 338
>sp|P92133|CATB3_GIAIN Cathepsin B-like CP3 OS=Giardia intestinalis GN=CP3 PE=2 SV=2
Length = 299
Score = 186 bits (471), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 115/289 (39%), Positives = 151/289 (52%), Gaps = 24/289 (8%)
Query: 49 NPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAW 108
NP+ WKA +F T + LL K VP T + + P SFD R +
Sbjct: 28 NPR--WKAGIPKRFEGLTKDEISSLLMPVSFLKRDRAAVPRGTV-SATQAPDSFDFREEY 84
Query: 109 PQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMN---LSLSVNDLLACCGFLCGD- 164
P C I ++DQG CGSCWAF +V ++ DR C G++ + S +++C GD
Sbjct: 85 PHC--IPEVVDQGGCGSCWAFSSVASVGDRRCFA-GLDKKAVKYSPQYVVSCDR---GDM 138
Query: 165 GCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRN 224
CDGG+ S WR+ G T+EC PY G A T C KC + L
Sbjct: 139 ACDGGWLPSVWRFLTKTGTTTDECVPY-------QSGSTGARGT--CPTKCADGSDLPHL 189
Query: 225 SKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVK 284
K Y + D IM + GP++ +FTVY DF +Y+SGVY+H G V GGHAV
Sbjct: 190 YKATKAVDYGL--DAPAIMKALATGGPLQTAFTVYSDFMYYESGVYQHTYGRVEGGHAVD 247
Query: 285 LIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 333
++G+GT DDG DYWI+ N W WG DGYF+I R +NECGIEE V+ G
Sbjct: 248 MVGYGTDDDGVDYWIIKNSWGPDWGEDGYFRIIRMTNECGIEEQVIGGF 296
>sp|P92132|CATB2_GIAIN Cathepsin B-like CP2 OS=Giardia intestinalis GN=CP2 PE=1 SV=2
Length = 300
Score = 172 bits (436), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 104/289 (35%), Positives = 149/289 (51%), Gaps = 23/289 (7%)
Query: 49 NPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAW 108
NP+ WKA +F T + LL K P T +P+SFD R +
Sbjct: 28 NPR--WKAGIPKRFEGLTKDEISSLLMPVSFLKNAKGAAPRGTFTDKDDVPESFDFREEY 85
Query: 109 PQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMN---LSLSVNDLLACCGFLCGD- 164
P C I ++DQG CGSCWAF +V DR C+ G++ + S +++C GD
Sbjct: 86 PHC--IPEVVDQGGCGSCWAFSSVATFGDRRCVA-GLDKKPVKYSPQYVVSCDH---GDM 139
Query: 165 GCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRN 224
C+GG+ + W++ G T+EC PY + C PT KC +
Sbjct: 140 ACNGGWLPNVWKFLTKTGTTTDECVPYKSGSTTLRGTC----PT-----KCADGSSKVHL 190
Query: 225 SKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVK 284
+ S Y + D +M + +GP++V+F V+ DF +Y+SGVY+H G + GGHAV+
Sbjct: 191 ATATSYKDYGL--DIPAMMKALSTSGPLQVAFLVHSDFMYYESGVYQHTYGYMEGGHAVE 248
Query: 285 LIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 333
++G+GT DDG DYWI+ N W WG DGYF++ RG N+C IEE AG
Sbjct: 249 MVGYGTDDDGVDYWIIKNSWGPDWGEDGYFRMIRGINDCSIEEQAYAGF 297
>sp|Q3SZI1|TINAG_BOVIN Tubulointerstitial nephritis antigen OS=Bos taurus GN=TINAG PE=2
SV=1
Length = 476
Score = 166 bits (419), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 116/337 (34%), Positives = 164/337 (48%), Gaps = 43/337 (12%)
Query: 37 ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ-FKHLLGVKPTPKGLLLGVPVKTHD-- 93
++Q +I+ VN+ GW A QF T+ + FK+ LG P P LLL + T
Sbjct: 155 LVQPGLIEHVNKG-DYGWTAQNYSQFWGMTLEEGFKYRLGTLP-PSPLLLSMNEVTASLT 212
Query: 94 KSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSV 151
K+ LP+ F A WP LDQ +C + WAF +DR I +LS
Sbjct: 213 KTTDLPEFFIASYKWP--GWTHGPLDQKNCAASWAFSTASVAADRIAIQSQGRYTANLSP 270
Query: 152 NDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA------ 205
+L++CC GC+ G AW Y G+V+ C P F ++ GC A
Sbjct: 271 QNLISCCAKK-RHGCNSGSVDRAWWYLRKRGLVSHACYPLFKDQNATNNGCAMASRSDGR 329
Query: 206 ---YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDF 262
+ T C K N++++ S YR++S+ +IM EI +NGPV+ V+EDF
Sbjct: 330 GKRHATTPCPNSIEKSNRIYQCS-----PPYRVSSNETEIMREIMQNGPVQAIMQVHEDF 384
Query: 263 AHYKSGVYKHITGD--------VMGGHAVKLIGWGT----SDDGEDYWILANQWNRSWGA 310
+YK+G+Y+HIT HAVKL GWGT E +WI AN W +SWG
Sbjct: 385 FNYKTGIYRHITSTNEDSEKYRKFRTHAVKLTGWGTLRGAQGQKEKFWIAANSWGKSWGE 444
Query: 311 DGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEITSAD 347
+GYF+I RG NE IE+ ++A ++TSAD
Sbjct: 445 NGYFRILRGVNESDIEKLIIAAW-------GQLTSAD 474
>sp|P92131|CATB1_GIAIN Cathepsin B-like CP1 OS=Giardia intestinalis GN=CP1 PE=2 SV=3
Length = 303
Score = 165 bits (417), Expect = 6e-40, Method: Compositional matrix adjust.
Identities = 104/287 (36%), Positives = 148/287 (51%), Gaps = 27/287 (9%)
Query: 54 WKAARNPQFSNYTVGQFKHLLGVKP----TPKGLLLGVPV-KTHDKSLKLPKSFDARSAW 108
WKA +F N T +F+ +L ++P G L + + + + +P FD R +
Sbjct: 31 WKAGMPKRFENVTEDEFRSML-IRPDRLRARSGSLPPISITEVQELVDPIPPQFDFRDEY 89
Query: 109 PQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMN---LSLSVNDLLACCGFLCGDG 165
PQC + LDQG CGSCWAF A+ DR C G++ +S S L++C L G
Sbjct: 90 PQC--VKPALDQGSCGSCWAFSAIGVFGDRRC-AMGIDKEAVSYSQQHLISCS--LENFG 144
Query: 166 CDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNS 225
CDGG W + G T EC Y D G A P P QL++
Sbjct: 145 CDGGDFQPTWSFLTFTGATTAECVKYVDY------GHTVASPCPAVCDDG-SPIQLYKAH 197
Query: 226 KHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV-MGGHAVK 284
+ +S S P IM + GP++ VY D ++Y+SGVYKH G + +G HA++
Sbjct: 198 GYGQVS----KSVPA-IMGMLVAGGPLQTMIVVYADLSYYESGVYKHTYGTINLGFHALE 252
Query: 285 LIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 331
++G+GT+DDG DYWI+ N W WG +GYF+I RG NEC IE+++ A
Sbjct: 253 IVGYGTTDDGTDYWIIKNSWGPDWGENGYFRIVRGVNECRIEDEIYA 299
>sp|Q9UJW2|TINAG_HUMAN Tubulointerstitial nephritis antigen OS=Homo sapiens GN=TINAG PE=2
SV=3
Length = 476
Score = 164 bits (414), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 107/321 (33%), Positives = 156/321 (48%), Gaps = 34/321 (10%)
Query: 37 ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ-FKHLLG-VKPTPKGLLLGVPVKTHDK 94
+++ +I++VN+ GW A QF T+ FK LG + P+P L + +
Sbjct: 155 LVRSELIEQVNKG-DYGWTAQNYSQFWGMTLEDGFKFRLGTLPPSPMLLSMNEMTASLPA 213
Query: 95 SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVN 152
+ LP+ F A WP LDQ +C + WAF +DR I +LS
Sbjct: 214 TTDLPEFFVASYKWP--GWTHGPLDQKNCAASWAFSTASVAADRIAIQSKGRYTANLSPQ 271
Query: 153 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA------- 205
+L++CC GC+ G AW Y G+V+ C P F ++ GC A
Sbjct: 272 NLISCCA-KNRHGCNSGSIDRAWWYLRKRGLVSHACYPLFKDQNATNNGCAMASRSDGRG 330
Query: 206 --YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFA 263
+ T C K N++++ S YR++S+ +IM EI +NGPV+ V EDF
Sbjct: 331 KRHATKPCPNNVEKSNRIYQCS-----PPYRVSSNETEIMKEIMQNGPVQAIMQVREDFF 385
Query: 264 HYKSGVYKHITGD--------VMGGHAVKLIGWGT----SDDGEDYWILANQWNRSWGAD 311
HYK+G+Y+H+T + HAVKL GWGT E +WI AN W +SWG +
Sbjct: 386 HYKTGIYRHVTSTNKESEKYRKLQTHAVKLTGWGTLRGAQGQKEKFWIAANSWGKSWGEN 445
Query: 312 GYFKIKRGSNECGIEEDVVAG 332
GYF+I RG NE IE+ ++A
Sbjct: 446 GYFRILRGVNESDIEKLIIAA 466
>sp|O97578|CATC_CANFA Dipeptidyl peptidase 1 (Fragment) OS=Canis familiaris GN=CTSC PE=1
SV=1
Length = 435
Score = 149 bits (375), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 96/306 (31%), Positives = 154/306 (50%), Gaps = 30/306 (9%)
Query: 42 IIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSLKLPKS 101
+K +N K+ W A R ++ T+ +G + P+ + + H++ +LP S
Sbjct: 149 FVKAINTIQKS-WTATRYIEYETLTLRDMMTRVGGRKIPRPKPTPLTAEIHEEISRLPTS 207
Query: 102 FDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCG 159
+D R+ + +S + +Q CGSC+AF + L R I + LS ++++C
Sbjct: 208 WDWRNV-RGTNFVSPVRNQASCGSCYAFASTAMLEARIRILTNNTQTPILSPQEIVSCSQ 266
Query: 160 FLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKK 218
+ GC+GG+P + A +Y G+V E C PY G P C+P C R
Sbjct: 267 Y--AQGCEGGFPYLIAGKYAQDFGLVEEACFPY---AGSDSP-CKPN----DCFR----- 311
Query: 219 NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH------ 272
+ +S++Y + + + + E+ ++GP+ V+F VY+DF HY+ G+Y H
Sbjct: 312 ---YYSSEYYYVGGFYGACNEALMKLELVRHGPMAVAFEVYDDFFHYQKGIYYHTGLRDP 368
Query: 273 ITGDVMGGHAVKLIGWGT-SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 331
+ HAV L+G+GT S G DYWI+ N W WG DGYF+I+RG++EC IE VA
Sbjct: 369 FNPFELTNHAVLLVGYGTDSASGMDYWIVKNSWGSRWGEDGYFRIRRGTDECAIESIAVA 428
Query: 332 GLPSSK 337
P K
Sbjct: 429 ATPIPK 434
>sp|Q06544|CYSP3_OSTOS Cathepsin B-like cysteine proteinase 3 (Fragment) OS=Ostertagia
ostertagi GN=CP-3 PE=3 SV=1
Length = 174
Score = 145 bits (367), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 79/175 (45%), Positives = 104/175 (59%), Gaps = 17/175 (9%)
Query: 174 AWRYFVHHGVVTEE-------CDPYFDSTGCSHPGCEPAY-------PTPKCVRKCVKKN 219
AW+YF GVVT C PY + C G EP Y TPKC + C +
Sbjct: 1 AWQYFALEGVVTGGNYRKQGCCRPY-EFPPCGRHGKEPYYGECYDTAKTPKCQKTCQRGY 59
Query: 220 -QLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVM 278
+ ++ KH+ SAYR+ ++ + I +I KNGPV F VYEDFAHYKSG+YKH G +
Sbjct: 60 LKAYKEDKHFGKSAYRLPNNVKAIQRDIMKNGPVVAGFIVYEDFAHYKSGIYKHTAGRMT 119
Query: 279 GGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 333
GGHAVK+IGWG + G YW++AN W+ WG G++++ RG N C IEE V AG+
Sbjct: 120 GGHAVKIIGWG-KEKGTPYWLIANSWHDDWGEKGFYRMIRGINNCRIEEMVFAGI 173
>sp|Q99JR5|TINAL_MOUSE Tubulointerstitial nephritis antigen-like OS=Mus musculus
GN=Tinagl1 PE=1 SV=1
Length = 466
Score = 144 bits (364), Expect = 7e-34, Method: Compositional matrix adjust.
Identities = 104/324 (32%), Positives = 151/324 (46%), Gaps = 39/324 (12%)
Query: 37 ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ-FKHLLG-VKPTPKGLLLGVPVKTHDK 94
++ +IK +N GW+A + F T+ + ++ LG ++P+ + + +
Sbjct: 140 LVDPDMIKAINRG-NYGWQAGNHSAFWGMTLDEGIRYRLGTIRPSSTVMNMNEIYTVLGQ 198
Query: 95 SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVN 152
LP +F+A WP + I LDQG+C WAF SDR IH M LS
Sbjct: 199 GEVLPTAFEASEKWP--NLIHEPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPILSPQ 256
Query: 153 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCV 212
+LL+C GC GG AW + GVV++ C P+ A PTP+C+
Sbjct: 257 NLLSCDTHH-QQGCRGGRLDGAWWFLRRRGVVSDNCYPFSGREQ------NEASPTPRCM 309
Query: 213 ----------RKCVKK---NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVY 259
R+ + Q+ N + AYR+ SD ++IM E+ +NGPV+ V+
Sbjct: 310 MHSRAMGRGKRQATSRCPNGQVDSNDIYQVTPAYRLGSDEKEIMKELMENGPVQALMEVH 369
Query: 260 EDFAHYKSGVYKHITGDV--------MGGHAVKLIGWG--TSDDGE--DYWILANQWNRS 307
EDF Y+ G+Y H G H+VK+ GWG T DG YW AN W
Sbjct: 370 EDFFLYQRGIYSHTPVSQGRPEQYRRHGTHSVKITGWGEETLPDGRTIKYWTAANSWGPW 429
Query: 308 WGADGYFKIKRGSNECGIEEDVVA 331
WG G+F+I RG+NEC IE V+
Sbjct: 430 WGERGHFRIVRGTNECDIETFVLG 453
>sp|Q60HG6|CATC_MACFA Dipeptidyl peptidase 1 OS=Macaca fascicularis GN=CTSC PE=2 SV=1
Length = 463
Score = 142 bits (359), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 99/317 (31%), Positives = 157/317 (49%), Gaps = 47/317 (14%)
Query: 41 SIIKEVNENPKAGWKAARNPQFSNYTVGQF--------KHLLGVKPTPKGLLLGVPVKTH 92
+ +K +N K+ W A ++ T+G + + KPTP + +
Sbjct: 173 NFVKAINAIQKS-WTATTYMEYETLTLGDMIKRSGGHSRKIPRPKPTP------LTAEIQ 225
Query: 93 DKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LS 150
K L LP S+D R+ + +S + +Q CGSC++F +V L R I + + LS
Sbjct: 226 QKILHLPTSWDWRNV-HGINFVSPVRNQASCGSCYSFASVGMLEARIRILTNNSQTPILS 284
Query: 151 VNDLLACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTP 209
++++C + GC+GG+P ++A +Y G+V E C PY TG P
Sbjct: 285 SQEVVSCSQY--AQGCEGGFPYLTAGKYAQDFGLVEEACFPY---TGTDSP--------- 330
Query: 210 KCVRKCVKKNQLWR--NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKS 267
C K +R +S+++ + + + + E+ +GP+ V+F VY+DF HY++
Sbjct: 331 -----CKMKEDCFRYYSSEYHYVGGFYGGCNEALMKLELVYHGPLAVAFEVYDDFLHYQN 385
Query: 268 GVYKH------ITGDVMGGHAVKLIGWGT-SDDGEDYWILANQWNRSWGADGYFKIKRGS 320
G+Y H + HAV L+G+GT S G DYWI+ N W SWG DGYF+I+RG+
Sbjct: 386 GIYHHTGLRDPFNPFELTNHAVLLVGYGTDSASGMDYWIVKNSWGTSWGEDGYFRIRRGT 445
Query: 321 NECGIEEDVVAGLPSSK 337
+EC IE VA P K
Sbjct: 446 DECAIESIAVAATPIPK 462
>sp|Q9EQT5|TINAL_RAT Tubulointerstitial nephritis antigen-like OS=Rattus norvegicus
GN=Tinagl1 PE=2 SV=1
Length = 467
Score = 142 bits (359), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 103/324 (31%), Positives = 152/324 (46%), Gaps = 38/324 (11%)
Query: 37 ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ-FKHLLG-VKPTPKGLLLGVPVKTHDK 94
++ ++IK +N GW+A + F T+ + ++ LG ++P+ + + +
Sbjct: 140 LVDPAMIKAINRG-NYGWQAGNHSAFWGMTLDEGIRYRLGTIRPSSSVMNMNEIYTVLGQ 198
Query: 95 SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVN 152
LP +F+A WP + I LDQG+C WAF SDR IH M LS
Sbjct: 199 GEVLPTAFEASEKWP--NLIHEPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPILSPQ 256
Query: 153 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCV 212
+LL+C GC GG AW + GVV++ C P+ + A PTP+C+
Sbjct: 257 NLLSCDTHH-QKGCRGGRLDGAWWFLRRRGVVSDNCYPF-----SGREQNDEASPTPRCM 310
Query: 213 ----------RKCVKK---NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVY 259
R+ + +Q+ N + YR+ SD ++IM E+ +NGPV+ V+
Sbjct: 311 MHSRAMGRGKRQATSRCPNSQVDSNDIYQVTPVYRLASDEKEIMKELMENGPVQALMEVH 370
Query: 260 EDFAHYKSGVYKHITGDV--------MGGHAVKLIGWG--TSDDGE--DYWILANQWNRS 307
EDF Y+ G+Y H G H+VK+ GWG T DG YW AN W
Sbjct: 371 EDFFLYQRGIYSHTPVSQGRPEQYRRHGTHSVKITGWGEETLPDGRTIKYWTAANSWGPW 430
Query: 308 WGADGYFKIKRGSNECGIEEDVVA 331
WG G+F+I RG NEC IE V+
Sbjct: 431 WGERGHFRIVRGINECDIETFVLG 454
>sp|Q9GZM7|TINAL_HUMAN Tubulointerstitial nephritis antigen-like OS=Homo sapiens
GN=TINAGL1 PE=1 SV=1
Length = 467
Score = 142 bits (357), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 102/318 (32%), Positives = 152/318 (47%), Gaps = 27/318 (8%)
Query: 37 ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ-FKHLLG-VKPTPKGLLLGVPVKTHDK 94
++ +IK +N+ GW+A + F T+ + ++ LG ++P+ + + +
Sbjct: 141 LVDPDMIKAINQG-NYGWQAGNHSAFWGMTLDEGIRYRLGTIRPSSSVMNMHEIYTVLNP 199
Query: 95 SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVN 152
LP +F+A WP + I LDQG+C WAF SDR IH M LS
Sbjct: 200 GEVLPTAFEASEKWP--NLIHEPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQ 257
Query: 153 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPY----FDSTGCSHPGCEPAYPT 208
+LL+C GC GG AW + GVV++ C P+ D G + P +
Sbjct: 258 NLLSCDTHQ-QQGCRGGRLDGAWWFLRRRGVVSDHCYPFSGRERDEAGPAPPCMMHSRAM 316
Query: 209 PKCVRKCVKK--NQLWRNSKHYSIS-AYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHY 265
+ R+ N N+ Y ++ YR+ S+ ++IM E+ +NGPV+ V+EDF Y
Sbjct: 317 GRGKRQATAHCPNSYVNNNDIYQVTPVYRLGSNDKEIMKELMENGPVQALMEVHEDFFLY 376
Query: 266 KSGVYKHITGDV--------MGGHAVKLIGWG--TSDDGE--DYWILANQWNRSWGADGY 313
K G+Y H + G H+VK+ GWG T DG YW AN W +WG G+
Sbjct: 377 KGGIYSHTPVSLGRPERYRRHGTHSVKITGWGEETLPDGRTLKYWTAANSWGPAWGERGH 436
Query: 314 FKIKRGSNECGIEEDVVA 331
F+I RG NEC IE V+
Sbjct: 437 FRIVRGVNECDIESFVLG 454
>sp|Q5RB02|CATC_PONAB Dipeptidyl peptidase 1 OS=Pongo abelii GN=CTSC PE=2 SV=1
Length = 463
Score = 139 bits (351), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 99/313 (31%), Positives = 153/313 (48%), Gaps = 39/313 (12%)
Query: 41 SIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVK----PTPKGLLLGVPVKTHDKSL 96
+ +K +N K+ W A ++ T+G G P PK L ++ K L
Sbjct: 173 NFVKAINAIQKS-WTATTYKEYETLTLGDMIRRSGGHSRKIPRPKPAPLTAEIQ--QKVL 229
Query: 97 KLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDL 154
LP S+D R+ + +S + +Q CGSC++F ++ L R I + + LS ++
Sbjct: 230 HLPTSWDWRNI-HGINFVSPVRNQASCGSCYSFASMGMLEARIRILTSNSQTPILSPQEV 288
Query: 155 LACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVR 213
++C + GC+GG+P + A +Y G+V E C PY TG P
Sbjct: 289 VSCSQY--AQGCEGGFPYLIAGKYAQDFGLVEEACFPY---TGTDSP------------- 330
Query: 214 KCVKKNQLWR--NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYK 271
C K +R +S+++ + + + + E+ +GP+ V+F VY+DF HYK G+Y
Sbjct: 331 -CKMKEDCFRYYSSEYHYVGGFYGGCNEALMKLELVHHGPMAVAFEVYDDFLHYKKGIYH 389
Query: 272 H------ITGDVMGGHAVKLIGWGT-SDDGEDYWILANQWNRSWGADGYFKIKRGSNECG 324
H + HAV L+G+GT S G DYWI+ N W WG DGYF+I+RG++EC
Sbjct: 390 HTGLRDPFNPFELTNHAVLLVGYGTDSASGMDYWIVKNSWGTGWGEDGYFRIRRGTDECA 449
Query: 325 IEEDVVAGLPSSK 337
IE VA P K
Sbjct: 450 IESIAVAATPIPK 462
>sp|P90850|YCF2E_CAEEL Uncharacterized peptidase C1-like protein F26E4.3 OS=Caenorhabditis
elegans GN=F26E4.3 PE=1 SV=3
Length = 452
Score = 137 bits (346), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 90/252 (35%), Positives = 125/252 (49%), Gaps = 18/252 (7%)
Query: 94 KSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSV 151
K +LP+ FDAR W I + DQG CGS W+ SDR I +N +LS
Sbjct: 180 KPRELPEHFDARDKWG--PLIHPVADQGDCGSSWSVSTTAISSDRLAIISEGRINSTLSS 237
Query: 152 NDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKC 211
LL+C GC+GGY AW Y GVV + C PY S PG
Sbjct: 238 QQLLSCNQHR-QKGCEGGYLDRAWWYIRKLGVVGDHCYPYV-SGQSREPGHCLIPKRDYT 295
Query: 212 VRKCVKKNQLWRNSKHYSISA-YRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVY 270
R+ ++ ++S + ++ Y+++S EDI E+ NGPV+ +F V+EDF Y GVY
Sbjct: 296 NRQGLRCPSGSQDSTAFKMTPPYKVSSREEDIQTELMTNGPVQATFVVHEDFFMYAGGVY 355
Query: 271 KH--------ITGDVMGGHAVKLIGWG---TSDDGEDYWILANQWNRSWGADGYFKIKRG 319
+H + G H+V+++GWG ++ YW+ AN W WG DGYFK+ RG
Sbjct: 356 QHSDLAAQKGASSVAEGYHSVRVLGWGVDHSTGKPIKYWLCANSWGTQWGEDGYFKVLRG 415
Query: 320 SNECGIEEDVVA 331
N C IE V+
Sbjct: 416 ENHCEIESFVIG 427
>sp|P53634|CATC_HUMAN Dipeptidyl peptidase 1 OS=Homo sapiens GN=CTSC PE=1 SV=2
Length = 463
Score = 137 bits (346), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 98/313 (31%), Positives = 153/313 (48%), Gaps = 39/313 (12%)
Query: 41 SIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVK----PTPKGLLLGVPVKTHDKSL 96
+ +K +N K+ W A ++ T+G G P PK L ++ K L
Sbjct: 173 NFVKAINAIQKS-WTATTYMEYETLTLGDMIRRSGGHSRKIPRPKPAPLTAEIQ--QKIL 229
Query: 97 KLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDL 154
LP S+D R+ + +S + +Q CGSC++F ++ L R I + + LS ++
Sbjct: 230 HLPTSWDWRNV-HGINFVSPVRNQASCGSCYSFASMGMLEARIRILTNNSQTPILSPQEV 288
Query: 155 LACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVR 213
++C + GC+GG+P + A +Y G+V E C PY TG P
Sbjct: 289 VSCSQY--AQGCEGGFPYLIAGKYAQDFGLVEEACFPY---TGTDSP------------- 330
Query: 214 KCVKKNQLWR--NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYK 271
C K +R +S+++ + + + + E+ +GP+ V+F VY+DF HYK G+Y
Sbjct: 331 -CKMKEDCFRYYSSEYHYVGGFYGGCNEALMKLELVHHGPMAVAFEVYDDFLHYKKGIYH 389
Query: 272 H------ITGDVMGGHAVKLIGWGT-SDDGEDYWILANQWNRSWGADGYFKIKRGSNECG 324
H + HAV L+G+GT S G DYWI+ N W WG +GYF+I+RG++EC
Sbjct: 390 HTGLRDPFNPFELTNHAVLLVGYGTDSASGMDYWIVKNSWGTGWGENGYFRIRRGTDECA 449
Query: 325 IEEDVVAGLPSSK 337
IE VA P K
Sbjct: 450 IESIAVAATPIPK 462
>sp|Q3ZCJ8|CATC_BOVIN Dipeptidyl peptidase 1 OS=Bos taurus GN=CTSC PE=2 SV=1
Length = 463
Score = 135 bits (340), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 95/316 (30%), Positives = 154/316 (48%), Gaps = 47/316 (14%)
Query: 42 IIKEVNENPKAGWKAARNPQFSNYTV--------GQFKHLLGVKPTPKGLLLGVPVKTHD 93
+K +N K+ W AA ++ T+ G + + KP P + +
Sbjct: 174 FVKAINAIQKS-WTAAPYMEYETLTLKEMIRRGGGHSRRIPRPKPAP------ITAEIQK 226
Query: 94 KSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSV 151
K L LP S+D R+ + ++ + +QG CGSC++F ++ + R I + LS
Sbjct: 227 KILHLPTSWDWRNV-HGINFVTPVRNQGSCGSCYSFASMGMMEARIRILTNNTQTPILSP 285
Query: 152 NDLLACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPK 210
++++C + GC+GG+P + A +Y G+V E+C PY TG P
Sbjct: 286 QEVVSCSQY--AQGCEGGFPYLIAGKYAQDFGLVEEDCFPY---TGTDSP---------- 330
Query: 211 CVRKCVKKNQLWR--NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSG 268
C K +R +S+++ + + + + E+ GP+ V+F VY+DF HY+ G
Sbjct: 331 ----CRLKEGCFRYYSSEYHYVGGFYGGCNEALMKLELVHQGPMAVAFEVYDDFLHYRKG 386
Query: 269 VYKH------ITGDVMGGHAVKLIGWGT-SDDGEDYWILANQWNRSWGADGYFKIKRGSN 321
VY H + HAV L+G+GT + G DYWI+ N W SWG +GYF+I+RG++
Sbjct: 387 VYHHTGLRDPFNPFELTNHAVLLVGYGTDAASGLDYWIVKNSWGTSWGENGYFRIRRGTD 446
Query: 322 ECGIEEDVVAGLPSSK 337
EC IE +A P K
Sbjct: 447 ECAIESIALAATPIPK 462
>sp|P80067|CATC_RAT Dipeptidyl peptidase 1 OS=Rattus norvegicus GN=Ctsc PE=1 SV=3
Length = 462
Score = 132 bits (333), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 96/323 (29%), Positives = 157/323 (48%), Gaps = 44/323 (13%)
Query: 32 KLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTV-------GQFKHLLGVKPTPKGLL 84
+L SH + +K +N K+ W A ++ ++ G +L KP P
Sbjct: 166 RLYSH--NHNFVKAINSVQKS-WTATTYEEYEKLSIRDLIRRSGHSGRILRPKPAP---- 218
Query: 85 LGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG 144
+ + + L LP+S+D R+ + +S + +Q CGSC++F ++ L R I
Sbjct: 219 --ITDEIQQQILSLPESWDWRNV-RGINFVSPVRNQESCGSCYSFASLGMLEARIRILTN 275
Query: 145 MNLS--LSVNDLLACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPG 201
+ + LS ++++C + GCDGG+P + A +Y GVV E C PY +
Sbjct: 276 NSQTPILSPQEVVSCSPY--AQGCDGGFPYLIAGKYAQDFGVVEENCFPYTATDA----- 328
Query: 202 CEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYED 261
P P C+R + +S++Y + + + + E+ K+GP+ V+F V++D
Sbjct: 329 --PCKPKENCLR--------YYSSEYYYVGGFYGGCNEALMKLELVKHGPMAVAFEVHDD 378
Query: 262 FAHYKSGVYKH------ITGDVMGGHAVKLIGWGTSD-DGEDYWILANQWNRSWGADGYF 314
F HY SG+Y H + HAV L+G+G G DYWI+ N W WG GYF
Sbjct: 379 FLHYHSGIYHHTGLSDPFNPFELTNHAVLLVGYGKDPVTGLDYWIVKNSWGSQWGESGYF 438
Query: 315 KIKRGSNECGIEEDVVAGLPSSK 337
+I+RG++EC IE +A +P K
Sbjct: 439 RIRRGTDECAIESIAMAAIPIPK 461
>sp|P97821|CATC_MOUSE Dipeptidyl peptidase 1 OS=Mus musculus GN=Ctsc PE=2 SV=1
Length = 462
Score = 128 bits (321), Expect = 8e-29, Method: Compositional matrix adjust.
Identities = 92/314 (29%), Positives = 152/314 (48%), Gaps = 42/314 (13%)
Query: 41 SIIKEVNENPKAGWKAARNPQFSNYTV-------GQFKHLLGVKPTPKGLLLGVPVKTHD 93
+ +K +N K+ W A ++ ++ G + + KP P + +
Sbjct: 173 NFVKAINTVQKS-WTATAYKEYEKMSLRDLIRRSGHSQRIPRPKPAP------MTDEIQQ 225
Query: 94 KSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSV 151
+ L LP+S+D R+ + +S + +Q CGSC++F ++ L R I + + LS
Sbjct: 226 QILNLPESWDWRNV-QGVNYVSPVRNQESCGSCYSFASMGMLEARIRILTNNSQTPILSP 284
Query: 152 NDLLACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPK 210
++++C + GCDGG+P + A +Y GVV E C PY P P
Sbjct: 285 QEVVSCSPY--AQGCDGGFPYLIAGKYAQDFGVVEESCFPYTAKDS-------PCKPREN 335
Query: 211 CVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVY 270
C+R + +S +Y + + + + E+ K+GP+ V+F V++DF HY SG+Y
Sbjct: 336 CLR--------YYSSDYYYVGGFYGGCNEALMKLELVKHGPMAVAFEVHDDFLHYHSGIY 387
Query: 271 KH------ITGDVMGGHAVKLIGWGTSD-DGEDYWILANQWNRSWGADGYFKIKRGSNEC 323
H + HAV L+G+G G +YWI+ N W +WG GYF+I+RG++EC
Sbjct: 388 HHTGLSDPFNPFELTNHAVLLVGYGRDPVTGIEYWIIKNSWGSNWGESGYFRIRRGTDEC 447
Query: 324 GIEEDVVAGLPSSK 337
IE VA +P K
Sbjct: 448 AIESIAVAAIPIPK 461
>sp|Q26563|CATC_SCHMA Cathepsin C OS=Schistosoma mansoni PE=2 SV=1
Length = 454
Score = 118 bits (296), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 98/310 (31%), Positives = 144/310 (46%), Gaps = 40/310 (12%)
Query: 38 LQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSL- 96
+ S + ++N + K+ W+ P+ S YT+ + ++ G + + KT K L
Sbjct: 154 INPSFVGKINAHQKS-WRGEIYPELSKYTIDELRNRAGGVKSMVTRPSVLNRKTPSKELI 212
Query: 97 ----KLPKSFDARSAWPQCST--ISRILDQGHCGSCWAFGAVEALSDRFCI--HFGMNLS 148
LP FD S P S ++ I +QG CGSC+A + AL R + +F
Sbjct: 213 SLTGNLPLEFDWTSP-PDGSRSPVTPIRNQGICGSCYASPSAAALEARIRLVSNFSEQPI 271
Query: 149 LSVNDLLACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYP 207
LS ++ C + +GC+GG+P + A +Y G+ + PY TG
Sbjct: 272 LSPQTVVDCSPY--SEGCNGGFPFLIAGKYGEDFGLPQKIVIPY---TGED--------- 317
Query: 208 TPKCVRKCVKKNQLWRNSKHYS-ISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYK 266
T KC V KN + YS I Y ++ + + E+ NGP V F VYEDF YK
Sbjct: 318 TGKCT---VSKNCTRYYTTDYSYIGGYYGATNEKLMQLELISNGPFPVGFEVYEDFQFYK 374
Query: 267 SGVYKHITGDV---------MGGHAVKLIGWGTSD-DGEDYWILANQWNRSWGADGYFKI 316
G+Y H T + HAV L+G+G GE YW + N W WG GYF+I
Sbjct: 375 EGIYHHTTVQTDHYNFNPFELTNHAVLLVGYGVDKLSGEPYWKVKNSWGVEWGEQGYFRI 434
Query: 317 KRGSNECGIE 326
RG++ECG+E
Sbjct: 435 LRGTDECGVE 444
>sp|P05167|ALEU_HORVU Thiol protease aleurain OS=Hordeum vulgare PE=2 SV=1
Length = 362
Score = 116 bits (291), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 88/278 (31%), Positives = 125/278 (44%), Gaps = 47/278 (16%)
Query: 61 QFSNYTVGQFKHL-LGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILD 119
+FS+ + +F+ LG T L G + + LP++ D W + +S + +
Sbjct: 108 RFSDMSWEEFQATRLGAAQTCSATLAGNHLMR--DAAALPETKD----WREDGIVSPVKN 161
Query: 120 QGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFV 179
Q HCGSCW F AL + G N+SLS L+ C G GC+GG P A+ Y
Sbjct: 162 QAHCGSCWTFSTTGALEAAYTQATGKNISLSEQQLVDCAGGFNNFGCNGGLPSQAFEYIK 221
Query: 180 HHGVV-TEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSD 238
++G + TEE PY G H E N+ + + I +
Sbjct: 222 YNGGIDTEESYPYKGVNGVCHYKAE--------------------NAAVQVLDSVNITLN 261
Query: 239 PEDIMAEIYKNG-----PVEVSFTVYEDFAHYKSGVYKHITGDVMG------GHAVKLIG 287
ED + KN PV V+F V + F YKSGVY T D G HAV +G
Sbjct: 262 AEDEL----KNAVGLVRPVSVAFQVIDGFRQYKSGVY---TSDHCGTTPDDVNHAVLAVG 314
Query: 288 WGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGI 325
+G ++G YW++ N W WG +GYFK++ G N C I
Sbjct: 315 YGV-ENGVPYWLIKNSWGADWGDNGYFKMEMGKNMCAI 351
>sp|P22497|CYSP_THEPA Cysteine proteinase OS=Theileria parva GN=TP03_0285 PE=3 SV=2
Length = 440
Score = 116 bits (290), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 84/287 (29%), Positives = 132/287 (45%), Gaps = 49/287 (17%)
Query: 61 QFSNYTVGQFKHLLGVKPTPKG-------LLLGVPVKTHDKSLK----------LPKSFD 103
+FS+ T +F L V PK LL + KT+ K+LK L K
Sbjct: 171 RFSDLTEREFYKLFPVMKPPKATYSNGYYLLSHMANKTYLKNLKKALNTDEDVDLAKLTG 230
Query: 104 ARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCG 163
W + S+++ + DQ +CG CWAF V ++ + HF + LSV +LL C F
Sbjct: 231 ENLDWRRSSSVTSVKDQSNCGGCWAFSTVGSVEGYYMSHFDKSYELSVQELLDCDSF--S 288
Query: 164 DGCDGGYPISAWRYFVHHGVVTEECDPYFD-STGCSHPGCEPAYPTPKCVRKCVKKNQLW 222
+GC GG SA+ Y +G+V+ + P+ D + CS P
Sbjct: 289 NGCQGGLLESAYEYVRKYGLVSAKDLPFVDKARRCSVP---------------------- 326
Query: 223 RNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHA 282
+K S+ +Y + E +M + P V +V + A YKSGV+ G + HA
Sbjct: 327 -KAKKVSVPSYHVFKGKE-VMTRSLTSSPCSVYLSVSPELAKYKSGVFTGECGKSL-NHA 383
Query: 283 VKLIGWGTSD-DGEDYWILANQWNRSWGADGYFKIKR---GSNECGI 325
V L+G G + + YW++ N W WG +GY +++R G+++CG+
Sbjct: 384 VVLVGEGYDEVTKKRYWVVQNSWGTDWGENGYMRLERTNMGTDKCGV 430
>sp|Q8RWQ9|ALEUL_ARATH Thiol protease aleurain-like OS=Arabidopsis thaliana GN=At3g45310
PE=2 SV=1
Length = 358
Score = 115 bits (289), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 94/307 (30%), Positives = 138/307 (44%), Gaps = 40/307 (13%)
Query: 28 VSKLKLDSHILQDSIIKEVNENPKA-GWKAARNPQFSNYTVGQF-KHLLGVKPTPKGLLL 85
V ++KL + ++++ + N K +K + N QF++ T +F ++ LG L
Sbjct: 73 VEEMKLRFSVFKENLDLIRSTNKKGLSYKLSLN-QFADLTWQEFQRYKLGAAQNCSATLK 131
Query: 86 GVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGM 145
G T +P + D W + +S + +QGHCGSCW F AL + FG
Sbjct: 132 GSHKITE---ATVPDTKD----WREDGIVSPVKEQGHCGSCWTFSTTGALEAAYHQAFGK 184
Query: 146 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVV-TEECDPYFDSTGCSHPGCE- 203
+SLS L+ C G GC GG P A+ Y ++G + TEE PY G GC+
Sbjct: 185 GISLSEQQLVDCAGTFNNFGCHGGLPSQAFEYIKYNGGLDTEEAYPYTGKDG----GCKF 240
Query: 204 PAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFA 263
A VR V + +++ R PV V+F V +F
Sbjct: 241 SAKNIGVQVRDSVNITLGAEDELKHAVGLVR----------------PVSVAFEVVHEFR 284
Query: 264 HYKSGVYKHIT-----GDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKR 318
YK GV+ T DV HAV +G+G DD YW++ N W WG +GYFK++
Sbjct: 285 FYKKGVFTSNTCGNTPMDV--NHAVLAVGYGVEDD-VPYWLIKNSWGGEWGDNGYFKMEM 341
Query: 319 GSNECGI 325
G N CG+
Sbjct: 342 GKNMCGV 348
>sp|Q95029|CATL_DROME Cathepsin L OS=Drosophila melanogaster GN=Cp1 PE=2 SV=2
Length = 371
Score = 114 bits (284), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 94/311 (30%), Positives = 135/311 (43%), Gaps = 47/311 (15%)
Query: 20 SQTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPT 79
+Q FAEG VS KL + D + E + NYT+ K L +
Sbjct: 94 NQRFAEGKVS-FKLAVNKYADLLHHEFRQLMNG----------FNYTL--HKQLRAADES 140
Query: 80 PKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRF 139
KG+ P + LPKS D W ++ + DQGHCGSCWAF + AL +
Sbjct: 141 FKGVTFISPAH-----VTLPKSVD----WRTKGAVTAVKDQGHCGSCWAFSSTGALEGQH 191
Query: 140 CIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSH 199
G+ +SLS +L+ C +GC+GG +A+RY +G +
Sbjct: 192 FRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDNGGID-------------- 237
Query: 200 PGCEPAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAE-IYKNGPVEVSFT 257
E +YP C K + + ++ I E MAE + GPV V+
Sbjct: 238 --TEKSYPYEAIDDSCHFNKGTVGATDRGFT----DIPQGDEKKMAEAVATVGPVSVAID 291
Query: 258 V-YEDFAHYKSGVYKHITGDVMG-GHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFK 315
+E F Y GVY D H V ++G+GT + GEDYW++ N W +WG G+ K
Sbjct: 292 ASHESFQFYSEGVYNEPQCDAQNLDHGVLVVGFGTDESGEDYWLVKNSWGTTWGDKGFIK 351
Query: 316 IKRG-SNECGI 325
+ R N+CGI
Sbjct: 352 MLRNKENQCGI 362
>sp|P49935|CATH_MOUSE Pro-cathepsin H OS=Mus musculus GN=Ctsh PE=2 SV=2
Length = 333
Score = 113 bits (283), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 81/235 (34%), Positives = 110/235 (46%), Gaps = 35/235 (14%)
Query: 99 PKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACC 158
P S D R + + +S + +QG CGSCW F AL I G LSL+ L+ C
Sbjct: 115 PSSMDWRK---KGNVVSPVKNQGACGSCWTFSTTGALESAVAIASGKMLSLAEQQLVDCA 171
Query: 159 GFLCGDGCDGGYPISAWRYFVHH-GVVTEECDPYF--DSTGCSHPGCEPAYPTPKCVRKC 215
GC GG P A+ Y +++ G++ E+ PY DS+ +P A+ V+
Sbjct: 172 QAFNNHGCKGGLPSQAFEYILYNKGIMEEDSYPYIGKDSSCRFNPQKAVAF-----VKNV 226
Query: 216 VKKNQLWRNSKHYSISAYRINSDPEDIMAE-IYKNGPVEVSFTVYEDFAHYKSGVYK--- 271
V I + E M E + PV +F V EDF YKSGVY
Sbjct: 227 V-----------------NITLNDEAAMVEAVALYNPVSFAFEVTEDFLMYKSGVYSSKS 269
Query: 272 -HITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGI 325
H T D + HAV +G+G +G YWI+ N W WG +GYF I+RG N CG+
Sbjct: 270 CHKTPDKV-NHAVLAVGYG-EQNGLLYWIVKNSWGSQWGENGYFLIERGKNMCGL 322
>sp|P00786|CATH_RAT Pro-cathepsin H OS=Rattus norvegicus GN=Ctsh PE=1 SV=1
Length = 333
Score = 113 bits (283), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 78/233 (33%), Positives = 106/233 (45%), Gaps = 31/233 (13%)
Query: 99 PKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACC 158
P S D R + + +S + +QG CGSCW F AL I G ++L+ L+ C
Sbjct: 115 PSSMDWRK---KGNVVSPVKNQGACGSCWTFSTTGALESAVAIASGKMMTLAEQQLVDCA 171
Query: 159 GFLCGDGCDGGYPISAWRYFVHH-GVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVK 217
GC GG P A+ Y +++ G++ E+ PY G E A K V
Sbjct: 172 QNFNNHGCQGGLPSQAFEYILYNKGIMGEDSYPYIGKNGQCKFNPEKAVAFVKNV----- 226
Query: 218 KNQLWRNSKHYSISAYRINSDPEDIMAE-IYKNGPVEVSFTVYEDFAHYKSGVYK----H 272
I + E M E + PV +F V EDF YKSGVY H
Sbjct: 227 ---------------VNITLNDEAAMVEAVALYNPVSFAFEVTEDFMMYKSGVYSSNSCH 271
Query: 273 ITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGI 325
T D + HAV +G+G +G YWI+ N W +WG +GYF I+RG N CG+
Sbjct: 272 KTPDKV-NHAVLAVGYG-EQNGLLYWIVKNSWGSNWGNNGYFLIERGKNMCGL 322
>sp|Q8H166|ALEU_ARATH Thiol protease aleurain OS=Arabidopsis thaliana GN=ALEU PE=1 SV=2
Length = 358
Score = 111 bits (277), Expect = 8e-24, Method: Compositional matrix adjust.
Identities = 97/304 (31%), Positives = 139/304 (45%), Gaps = 34/304 (11%)
Query: 28 VSKLKLDSHILQDSIIKEVNENPKA-GWKAARNPQFSNYTVGQFKHL-LGVKPTPKGLLL 85
V ++KL I ++++ + N K +K N QF++ T +F+ LG L
Sbjct: 73 VEEMKLRFSIFKENLDLIRSTNKKGLSYKLGVN-QFADLTWQEFQRTKLGAAQNCSATLK 131
Query: 86 GVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGM 145
G T LP++ D W + +S + DQG CGSCW F AL + FG
Sbjct: 132 GSHKVTE---AALPETKD----WREDGIVSPVKDQGGCGSCWTFSTTGALEAAYHQAFGK 184
Query: 146 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVV-TEECDPYFDSTGCSHPGCEP 204
+SLS L+ C G GC+GG P A+ Y +G + TE+ PY TG
Sbjct: 185 GISLSEQQLVDCAGAFNNYGCNGGLPSQAFEYIKSNGGLDTEKAYPY---TGKDE----- 236
Query: 205 AYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAH 264
T K + V L NS + ++ A +++ + PV ++F V F
Sbjct: 237 ---TCKFSAENVGVQVL--NSVNITLGA------EDELKHAVGLVRPVSIAFEVIHSFRL 285
Query: 265 YKSGVY--KHITGDVMG-GHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSN 321
YKSGVY H M HAV +G+G +DG YW++ N W WG GYFK++ G N
Sbjct: 286 YKSGVYTDSHCGSTPMDVNHAVLAVGYGV-EDGVPYWLIKNSWGADWGDKGYFKMEMGKN 344
Query: 322 ECGI 325
CGI
Sbjct: 345 MCGI 348
>sp|Q10717|CYSP2_MAIZE Cysteine proteinase 2 OS=Zea mays GN=CCP2 PE=2 SV=1
Length = 360
Score = 110 bits (274), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 79/244 (32%), Positives = 112/244 (45%), Gaps = 36/244 (14%)
Query: 90 KTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSL 149
+ ++ LP++ D W + +S + +QGHCGSCW F AL + G +SL
Sbjct: 135 RMRAAAVALPETKD----WREDGIVSPVKNQGHCGSCWTFSTTGALEAAYTQATGKPISL 190
Query: 150 SVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVV-TEECDPYFDSTGCSHPGCEPAYPT 208
S L+ C GC+GG P A+ Y ++G + TEE PY G
Sbjct: 191 SEQQLVDCGFAFNNFGCNGGLPSQAFEYIKYNGGLDTEESYPYQGVNGI----------- 239
Query: 209 PKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAE-IYKNGPVEVSFTVYEDFAHYKS 267
C KN+ N + + I ED + + + PV V+F V F YKS
Sbjct: 240 ------CKFKNE---NVGVKVLDSVNITLGAEDELKDAVGLVRPVSVAFEVITGFRLYKS 290
Query: 268 GVYKHITGDVMG------GHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSN 321
GVY T D G HAV +G+G +DG YW++ N W WG +GYFK++ G N
Sbjct: 291 GVY---TSDHCGTTPMDVNHAVLAVGYGV-EDGVPYWLIKNSWGADWGDEGYFKMEMGKN 346
Query: 322 ECGI 325
CG+
Sbjct: 347 MCGV 350
>sp|Q24940|CATLL_FASHE Cathepsin L-like proteinase OS=Fasciola hepatica GN=Cat-1 PE=1 SV=1
Length = 326
Score = 109 bits (273), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 81/279 (29%), Positives = 126/279 (45%), Gaps = 34/279 (12%)
Query: 61 QFSNYTVGQFK--HLLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRIL 118
QF++ T +FK +L + L GVP + +++++ P D W + ++ +
Sbjct: 71 QFTDMTFEEFKAKYLTEMSRASDILSHGVPYEANNRAV--PDKID----WRESGYVTEVK 124
Query: 119 DQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYF 178
DQG+CGSCWAF + ++ + ++S S L+ C G +GC GG +A++Y
Sbjct: 125 DQGNCGSCWAFSTTGTMEGQYMKNERTSISFSEQQLVDCSGPWGNNGCSGGLMENAYQYL 184
Query: 179 VHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSD 238
G+ TE PY G +C QL Y ++S
Sbjct: 185 KQFGLETESSYPYTAVEG-----------------QCRYNKQL---GVAKVTGYYTVHSG 224
Query: 239 PE-DIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHIT-GDVMGGHAVKLIGWGTSDDGED 296
E ++ + P V+ V DF Y+SG+Y+ T + HAV +G+GT G D
Sbjct: 225 SEVELKNLVGARRPAAVAVDVESDFMMYRSGIYQSQTCSPLRVNHAVLAVGYGT-QGGTD 283
Query: 297 YWILANQWNRSWGADGYFKIKRGS-NECGIEEDVVAGLP 334
YWI+ N W WG GY ++ R N CGI +A LP
Sbjct: 284 YWIVKNSWGTYWGERGYIRMARNRGNMCGIAS--LASLP 320
>sp|P25251|CYSP4_BRANA Cysteine proteinase COT44 (Fragment) OS=Brassica napus PE=2 SV=1
Length = 328
Score = 109 bits (273), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 97/341 (28%), Positives = 154/341 (45%), Gaps = 42/341 (12%)
Query: 20 SQTFAEGVVSKLKLDSHILQDSI--IKEVNENPKAGWKAARNPQFSNYTVGQFKHL-LGV 76
S + + G++++ +I +D++ I NEN K F+N T +++ L LG
Sbjct: 14 SNSNSNGIINQQDERFNIFKDNLRFIDLHNENNKNATYKLGLTIFANLTNDEYRSLYLGA 73
Query: 77 KPTPKGLLLGVPVKTHDKSLKLPKSFDARSA-----WPQCSTISRILDQGHCGSCWAFGA 131
+ P + K + ++K + + W Q ++ I DQG CGSCWAF
Sbjct: 74 RTEPVRRI----TKAKNVNMKYSAAVNVDEVPVTVDWRQKGAVNAIKDQGTCGSCWAFST 129
Query: 132 VEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECD-P 190
A+ I G +SLS +L+ C GC+GG A+++ + +G + E D P
Sbjct: 130 AAAVEGINKIVTGELVSLSEQELVDCDKSY-NQGCNGGLMDYAFQFIMKNGGLNTEKDYP 188
Query: 191 YFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYR-INSDPEDIMAEIYKN 249
Y + G KC N L +NS+ +I Y + S E +
Sbjct: 189 YHGTNG-------------KC-------NSLLKNSRVVTIDGYEDVPSKDETALKRAVSY 228
Query: 250 GPVEVSFTV-YEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSW 308
PV V+ F HY+SG++ G M HAV +G+G S++G DYWI+ N W W
Sbjct: 229 QPVSVAIDAGGRAFQHYQSGIFTGKCGTNM-DHAVVAVGYG-SENGVDYWIVRNSWGTRW 286
Query: 309 GADGYFKIKRG----SNECGIEEDVVAGLPSSKNLVKEITS 345
G DGY +++R S +CGI + + S N V+ +S
Sbjct: 287 GEDGYIRMERNVASKSGKCGIAIEASYPVKYSPNPVRGTSS 327
>sp|Q80LP4|CATV_NPVAH Viral cathepsin OS=Adoxophyes honmai nucleopolyhedrovirus GN=VCATH
PE=3 SV=1
Length = 337
Score = 108 bits (270), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 75/249 (30%), Positives = 111/249 (44%), Gaps = 34/249 (13%)
Query: 85 LGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG 144
L P HD+ LP++FD W + ++ + DQG CGSCWA AV L + I
Sbjct: 116 LDAPPDVHDE---LPQNFD----WRVNNKMTSVKDQGACGSCWAHAAVGTLETLYAIKHN 168
Query: 145 MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECD-PYFDSTG-CSHPGC 202
++LS L+ C CDGG +A+ ++ G + EE D PY + G C
Sbjct: 169 YLINLSEQQLIDCDS--ANMACDGGLMHTAFEQLMNAGGLMEEIDYPYQGTKGVCKIDNK 226
Query: 203 EPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDF 262
+ A C R I + E++ E+ GP+ ++
Sbjct: 227 KFALSVSSCKR--------------------YIFQNEENLKKELITMGPIAMAIDA-ASI 265
Query: 263 AHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNE 322
+ Y G+ H ++ HAV L+G+GT + G YW L N W WG DGYF++KR N
Sbjct: 266 STYSKGII-HFCENLGLNHAVLLVGYGT-EGGVSYWTLKNSWGSDWGEDGYFRVKRNINA 323
Query: 323 CGIEEDVVA 331
CG+ + A
Sbjct: 324 CGLNNQLAA 332
>sp|Q9PYY5|CATV_GVXN Viral cathepsin OS=Xestia c-nigrum granulosis virus GN=VCATH PE=3
SV=1
Length = 346
Score = 107 bits (266), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 78/237 (32%), Positives = 117/237 (49%), Gaps = 28/237 (11%)
Query: 89 VKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLS 148
V + D S K+P SFD W ++++ + Q CGSCWAF AV + + I ++L
Sbjct: 124 VISGDSSGKVPDSFD----WRDRNSVTSVKMQKECGSCWAFSAVANIESLYHIKHNVSLD 179
Query: 149 LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPT 208
LS L+ C +GC+GG + +W + G++ + G S+ E YP
Sbjct: 180 LSEQQLVDCDK--VNNGCNGG--LMSWAF---EGIIR--------AGGISY---EAPYPY 221
Query: 209 PKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSG 268
C + + S Y AY + S+ + + +++ GPV V+ V D +YKSG
Sbjct: 222 TGVDGVCKNTTRYVQLSGCY---AYDLRSEKK-LRQVLHEKGPVSVAIDVV-DLTNYKSG 276
Query: 269 VYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGI 325
V KH + D H V L+G+G +D + YW L N W WG G+F+IKR N CGI
Sbjct: 277 VAKHCSVDHGLNHGVLLVGYGQENDVK-YWTLKNSWGSDWGEQGFFRIKRDVNSCGI 332
>sp|Q9R1T3|CATZ_RAT Cathepsin Z OS=Rattus norvegicus GN=Ctsz PE=1 SV=2
Length = 306
Score = 106 bits (265), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 76/255 (29%), Positives = 113/255 (44%), Gaps = 32/255 (12%)
Query: 98 LPKSFDARSAWPQCSTISRILDQ---GHCGSCWAFGAVEALSDRFCIHFG---MNLSLSV 151
LPK++D R+ + S +Q +CGSCWA G+ AL+DR I + LSV
Sbjct: 64 LPKNWDWRNV-NGVNYASVTRNQHIPQYCGSCWAHGSTSALADRINIKRKGAWPSTLLSV 122
Query: 152 NDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKC 211
+++ C C+GG + W Y HG+ E C+ Y C+ C
Sbjct: 123 QNVIDCGN---AGSCEGGNDLPVWEYAHKHGIPDETCNNY----QAKDQECDKFNQCGTC 175
Query: 212 V--RKC--VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKS 267
++C ++ LWR + S+S E +MAEIY NGP+ E ++Y
Sbjct: 176 TEFKECHTIQNYTLWRVGDYGSLSGR------EKMMAEIYANGPISCGIMATERMSNYTG 229
Query: 268 GVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEE 327
G+Y + H + + GWG S+DG +YWI+ N W WG G+ +I +
Sbjct: 230 GIYTEYQNQAIINHIISVAGWGVSNDGIEYWIVRNSWGEPWGERGWMRI--------VTS 281
Query: 328 DVVAGLPSSKNLVKE 342
G SS NL E
Sbjct: 282 TYKGGTGSSYNLAIE 296
Database: swissprot
Posted date: Mar 23, 2013 2:32 AM
Number of letters in database: 191,569,459
Number of sequences in database: 539,616
Lambda K H
0.319 0.136 0.442
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 147,960,381
Number of Sequences: 539616
Number of extensions: 6749217
Number of successful extensions: 13302
Number of sequences better than 100.0: 50
Number of HSP's better than 100.0 without gapping: 211
Number of HSP's successfully gapped in prelim test: 22
Number of HSP's that attempted gapping in prelim test: 12626
Number of HSP's gapped (non-prelim): 293
length of query: 354
length of database: 191,569,459
effective HSP length: 118
effective length of query: 236
effective length of database: 127,894,771
effective search space: 30183165956
effective search space used: 30183165956
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 62 (28.5 bits)