BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= psy15353
(344 letters)
Database: swissprot
539,616 sequences; 191,569,459 total letters
Searching..................................................done
>sp|P43509|CPR5_CAEEL Cathepsin B-like cysteine proteinase 5 OS=Caenorhabditis elegans
GN=cpr-5 PE=2 SV=1
Length = 344
Score = 227 bits (579), Expect = 1e-58, Method: Compositional matrix adjust.
Identities = 125/321 (38%), Positives = 169/321 (52%), Gaps = 15/321 (4%)
Query: 24 AYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLP-GDRKTYDPEYSA 82
A ID +N WTAG + +E + + L+ D KY +P D E S
Sbjct: 31 ALIDYVNSAQKLWTAGHQV---IPKEKITKKLM-DVKYL------VPHKDEDIVATEVSD 80
Query: 83 TVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYV 142
+PD FDAR+QWPNC +I ++ D C + FAA A SDR CI S G N LS+E +
Sbjct: 81 AIPDHFDARDQWPNCMSINNIRDQSDCGSCWAFAAAEAISDRTCIASNGAVNTLLSSEDL 140
Query: 143 ASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPT 202
SCC + C G + W + K G VTGG Y + GC+P +I+PC +
Sbjct: 141 LSCCT-GMFSCGNGCEGGYPIQAWKWWVKHGLVTGGSYETQFGCKPYSIAPCGETVNGVK 199
Query: 203 LPSCENQKVPKLKCHTRCTNP-TYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTA 261
P+C P KC CT+ Y + QDKH + Y V + I+ EIL +GP
Sbjct: 200 WPACPEDTEPTPKCVDSCTSKNNYATPYLQDKHFGSTAYAVGKKVEQIQTEILTNGPIEV 259
Query: 262 TFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGT 321
F +Y+DFY Y +GVY HT+ A L H+ K++GWG +NGTPYWLV N+W WG++G
Sbjct: 260 AFTVYEDFYQYTTGVYVHTAGASLGG--HAVKILGWGVDNGTPYWLVANSWNVAWGEKGY 317
Query: 322 VKILRGKYECAFEYLIAAGKP 342
+I+RG EC E+ AG P
Sbjct: 318 FRIIRGLNECGIEHSAVAGIP 338
>sp|P43508|CPR4_CAEEL Cathepsin B-like cysteine proteinase 4 OS=Caenorhabditis elegans
GN=cpr-4 PE=2 SV=1
Length = 335
Score = 221 bits (563), Expect = 6e-57, Method: Compositional matrix adjust.
Identities = 128/343 (37%), Positives = 180/343 (52%), Gaps = 16/343 (4%)
Query: 4 ILVFLLGCT--LVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKY 61
IL L+ T LV + K +A + +N + + W A P +++ E +++ L+
Sbjct: 5 ILAALVAVTAGLVIPLVPKTQEAITEYVNSKQSLWKA--EIPKDITIEQVKKRLMRT--- 59
Query: 62 FDQSDRPLPGDRKTYDPEYSA-TVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGA 120
+ P D + + + T+P FDAR QWPNC +I ++ D C + FAA A
Sbjct: 60 --EFVAPHTPDVEVVKHDINEDTIPATFDARTQWPNCMSINNIRDQSDCGSCWAFAAAEA 117
Query: 121 FSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDY 180
SDR CI S G N LS E V SCC C Y C G W +L K G TGG Y
Sbjct: 118 ASDRFCIASNGAVNTLLSAEDVLSCCSNCGY----GCEGGYPINAWKYLVKSGFCTGGSY 173
Query: 181 GDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTY 240
+ GC+P +++PC T PSC + C +CTN Y + DKH + Y
Sbjct: 174 EAQFGCKPYSLAPCGETVGNVTWPSCPDDGYDTPACVNKCTNKNYNVAYTADKHFGSTAY 233
Query: 241 WVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTE 300
V I+ EI+AHGP A F +Y+DFY YK+GVY HT+ +L H+ +++GWGT+
Sbjct: 234 AVGKKVSQIQAEIIAHGPVEAAFTVYEDFYQYKTGVYVHTTGQELGG--HAIRILGWGTD 291
Query: 301 NGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
NGTPYWLV N+W +WG+ G +I+RG EC E+ + G PK
Sbjct: 292 NGTPYWLVANSWNVNWGENGYFRIIRGTNECGIEHAVVGGVPK 334
>sp|A1E295|CATB_PIG Cathepsin B OS=Sus scrofa GN=CTSB PE=1 SV=1
Length = 335
Score = 221 bits (562), Expect = 7e-57, Method: Compositional matrix adjust.
Identities = 130/344 (37%), Positives = 176/344 (51%), Gaps = 19/344 (5%)
Query: 1 MIHILVFLLGCTLVRGELY--KFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIAD 58
++ L L+ T R L+ SD ++ IN++ TWTAG NF N+ Y+++
Sbjct: 4 LLATLSCLVLLTSARESLHFQPLSDELVNFINKQNTTWTAGHNF-YNVDLSYVKKLC--- 59
Query: 59 AKYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAV 118
+ P R + + +P FDAREQWPNC TI + D G+C + F AV
Sbjct: 60 GTFLGGPKLP---QRAAFAAD--MILPKSFDAREQWPNCPTIKEIRDQGSCGSCWAFGAV 114
Query: 119 GAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGG 178
A SDR CI+S G+ N +S E + +CC + C+ G WNF K+G V+GG
Sbjct: 115 EAISDRICIRSNGRVNVEVSAEDMLTCCGD---ECGDGCNGGFPSGAWNFWTKKGLVSGG 171
Query: 179 DYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTL 238
Y GC+P +I PC HH + P PK C C P Y + +DKH
Sbjct: 172 LYDSHVGCRPYSIPPCEHHVNGSRPPCTGEGDTPK--CSKIC-EPGYTPSYKEDKHFGCS 228
Query: 239 TYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWG 298
+Y + NE I EI +GP F +Y DF YKSGVY+H + + H+ +++GWG
Sbjct: 229 SYSISRNEKEIMAEIYKNGPVEGAFTVYSDFLQYKSGVYQHVTGDLMGG--HAIRILGWG 286
Query: 299 TENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
ENGTPYWLV N+W WGD G KILRG+ C E I AG P
Sbjct: 287 VENGTPYWLVGNSWNTDWGDNGFFKILRGQDHCGIESEIVAGIP 330
>sp|P00787|CATB_RAT Cathepsin B OS=Rattus norvegicus GN=Ctsb PE=1 SV=2
Length = 339
Score = 220 bits (561), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 126/325 (38%), Positives = 170/325 (52%), Gaps = 17/325 (5%)
Query: 19 YKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDP 78
+ SD I+ IN++ TW AGRNF N+ YL++ + P +R +
Sbjct: 24 HPLSDDMINYINKQNTTWQAGRNF-YNVDISYLKKLC---GTVLGGPNLP---ERVGFSE 76
Query: 79 EYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLS 138
+ + +P+ FDAREQW NC TI + D G+C + F AV A SDR CI + G+ N +S
Sbjct: 77 DIN--LPESFDAREQWSNCPTIAQIRDQGSCGSCWAFGAVEAMSDRICIHTNGRVNVEVS 134
Query: 139 TEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHG 198
E + +CC I D C+ G WNF ++G V+GG Y GC P TI PC HH
Sbjct: 135 AEDLLTCCGIQCGD---GCNGGYPSGAWNFWTRKGLVSGGVYNSHIGCLPYTIPPCEHHV 191
Query: 199 SAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGP 258
+ P PK C+ C Y + +DKH +Y V D+E I EI +GP
Sbjct: 192 NGSRPPCTGEGDTPK--CNKMC-EAGYSTSYKEDKHYGYTSYSVSDSEKEIMAEIYKNGP 248
Query: 259 TTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGD 318
F ++ DF YKSGVYKH + + H+ +++GWG ENG PYWLV N+W WGD
Sbjct: 249 VEGAFTVFSDFLTYKSGVYKHEAGDVMGG--HAIRILGWGIENGVPYWLVANSWNVDWGD 306
Query: 319 RGTVKILRGKYECAFEYLIAAGKPK 343
G KILRG+ C E I AG P+
Sbjct: 307 NGFFKILRGENHCGIESEIVAGIPR 331
>sp|Q4R5M2|CATB_MACFA Cathepsin B OS=Macaca fascicularis GN=CTSB PE=2 SV=1
Length = 339
Score = 219 bits (558), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 125/338 (36%), Positives = 173/338 (51%), Gaps = 17/338 (5%)
Query: 6 VFLLGCTLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQS 65
+ LG R + SD ++ +N++ TW AG NF N+ YL++ +
Sbjct: 11 LLALGDARSRPSFHPLSDELVNYVNKQNTTWQAGHNF-YNVDVSYLKRLC---GTFLGG- 65
Query: 66 DRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRR 125
P P R + + +P+ FDAREQWP C TI + D G+C + F AV A SDR
Sbjct: 66 --PKPPQRVMFTEDLK--LPESFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVEAISDRI 121
Query: 126 CIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTG 185
CI + + +S E + +CC I D C+ G WNF ++G V+GG Y G
Sbjct: 122 CIHTNAHVSVEVSAEDLLTCCGIMCGD---GCNGGYPAGAWNFWTRKGLVSGGLYDSHVG 178
Query: 186 CQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDN 245
C+P +I PC HH + P PK C C P Y + QDKH +Y V ++
Sbjct: 179 CRPYSIPPCEHHVNGSRPPCTGEGDTPK--CSKIC-EPGYSPTYKQDKHYGYNSYSVSNS 235
Query: 246 EDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPY 305
E I EI +GP F++Y DF YKSGVY+H + + H+ +++GWG ENGTPY
Sbjct: 236 EKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGG--HAIRILGWGVENGTPY 293
Query: 306 WLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
WLV N+W WGD G KILRG+ C E + AG P+
Sbjct: 294 WLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPR 331
>sp|P07858|CATB_HUMAN Cathepsin B OS=Homo sapiens GN=CTSB PE=1 SV=3
Length = 339
Score = 215 bits (547), Expect = 4e-55, Method: Compositional matrix adjust.
Identities = 123/339 (36%), Positives = 171/339 (50%), Gaps = 19/339 (5%)
Query: 6 VFLLGCTLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQS 65
+ +L R + SD ++ +N+ TW AG NF N+ YL++ +
Sbjct: 11 LLVLANARSRPSFHPLSDELVNYVNKRNTTWQAGHNF-YNVDMSYLKRLC---GTFLGG- 65
Query: 66 DRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRR 125
P P R + + +P FDAREQWP C TI + D G+C + F AV A SDR
Sbjct: 66 --PKPPQRVMFTEDLK--LPASFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVEAISDRI 121
Query: 126 CIKSKGQQNRPLSTEYVASCC-KICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRT 184
CI + + +S E + +CC +C C+ G WNF ++G V+GG Y
Sbjct: 122 CIHTNAHVSVEVSAEDLLTCCGSMC----GDGCNGGYPAEAWNFWTRKGLVSGGLYESHV 177
Query: 185 GCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDD 244
GC+P +I PC HH + P PK C C P Y + QDKH +Y V +
Sbjct: 178 GCRPYSIPPCEHHVNGSRPPCTGEGDTPK--CSKIC-EPGYSPTYKQDKHYGYNSYSVSN 234
Query: 245 NEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTP 304
+E I EI +GP F++Y DF YKSGVY+H + + H+ +++GWG ENGTP
Sbjct: 235 SEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGG--HAIRILGWGVENGTP 292
Query: 305 YWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
YWLV N+W WGD G KILRG+ C E + AG P+
Sbjct: 293 YWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPR 331
>sp|Q5R6D1|CATB_PONAB Cathepsin B OS=Pongo abelii GN=CTSB PE=2 SV=1
Length = 339
Score = 215 bits (547), Expect = 4e-55, Method: Compositional matrix adjust.
Identities = 122/330 (36%), Positives = 169/330 (51%), Gaps = 19/330 (5%)
Query: 15 RGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRK 74
R + SD ++ +N+ TW AG NF N+ YL++ + P P R
Sbjct: 20 RPSFHPLSDELVNYVNKRNTTWQAGHNF-YNVDVSYLKKLC---GTFLGG---PKPPQRV 72
Query: 75 TYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQN 134
+ + +P+ FDAREQWP C TI + D G+C + F AV A SDR CI + +
Sbjct: 73 MFTEDLK--LPESFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHTNAHVS 130
Query: 135 RPLSTEYVASCC-KICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISP 193
+S E + +CC +C C+ G WNF ++G V+GG Y GC+P +I P
Sbjct: 131 VEVSAEDLLTCCGSMC----GDGCNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPP 186
Query: 194 CSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEI 253
C HH + P PK C C P Y + QDKH +Y V ++E I EI
Sbjct: 187 CEHHVNGSRPPCTGEGDTPK--CSKIC-EPGYSPTYKQDKHYGYNSYSVSNSERDIMAEI 243
Query: 254 LAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWG 313
+GP F++Y DF YKSGVY+H + + H+ +++GWG ENGTPYWLV N+W
Sbjct: 244 YKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGG--HAIRILGWGVENGTPYWLVANSWN 301
Query: 314 PHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
WGD G KILRG+ C E + AG P+
Sbjct: 302 TDWGDNGFFKILRGQDHCGIESEVVAGIPR 331
>sp|P07688|CATB_BOVIN Cathepsin B OS=Bos taurus GN=CTSB PE=1 SV=5
Length = 335
Score = 215 bits (547), Expect = 4e-55, Method: Compositional matrix adjust.
Identities = 132/348 (37%), Positives = 175/348 (50%), Gaps = 27/348 (7%)
Query: 1 MIHILVFLLGCTLVRGELY--KFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIAD 58
++ L LL T R LY SD ++ +N++ TW AG NF N+ Y+++ A
Sbjct: 4 LLATLSCLLVLTSARSSLYFPPLSDELVNFVNKQNTTWKAGHNF-YNVDLSYVKKLCGAI 62
Query: 59 AKYFDQSDRPLPGDRKTYDPEYSATV--PDRFDAREQWPNCGTIGHVPDTGACAAPHIFA 116
L G + ++A V P+ FDAREQWPNC TI + D G+C + F
Sbjct: 63 ----------LGGPKLPQRDAFAADVVLPESFDAREQWPNCPTIKEIRDQGSCGSCWAFG 112
Query: 117 AVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRT--WNFLHKRGS 174
AV A SDR CI S G+ N +S E + + F + WNF K+G
Sbjct: 113 AVEAISDRICIHSNGRVNVEVSAEDM-----LTCCGGECGDGCNGGFPSGAWNFWTKKGL 167
Query: 175 VTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKH 234
V+GG Y GC+P +I PC HH + P PK C C P Y + +DKH
Sbjct: 168 VSGGLYNSHVGCRPYSIPPCEHHVNGSRPPCTGEGDTPK--CSKTC-EPGYSPSYKEDKH 224
Query: 235 RTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKL 294
+Y V +NE I EI +GP F++Y DF YKSGVY+H S + H+ ++
Sbjct: 225 FGCSSYSVANNEKEIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVSGEIMGG--HAIRI 282
Query: 295 IGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
+GWG ENGTPYWLV N+W WGD G KILRG+ C E I AG P
Sbjct: 283 LGWGVENGTPYWLVGNSWNTDWGDNGFFKILRGQDHCGIESEIVAGMP 330
>sp|P10605|CATB_MOUSE Cathepsin B OS=Mus musculus GN=Ctsb PE=1 SV=2
Length = 339
Score = 214 bits (546), Expect = 5e-55, Method: Compositional matrix adjust.
Identities = 126/329 (38%), Positives = 171/329 (51%), Gaps = 23/329 (6%)
Query: 18 LYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQF---LIADAKYFDQSDRPLPGDRK 74
+ SD I+ IN++ TW AGRNF N+ YL++ ++ K LPG R
Sbjct: 23 FHPLSDDLINYINKQNTTWQAGRNF-YNVDISYLKKLCGTVLGGPK--------LPG-RV 72
Query: 75 TYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQN 134
+ + +P+ FDAREQW NC TIG + D G+C + F AV A SDR CI + G+ N
Sbjct: 73 AFGEDID--LPETFDAREQWSNCPTIGQIRDQGSCGSCWAFGAVEAISDRTCIHTNGRVN 130
Query: 135 RPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPC 194
+S E + +CC I D C+ G W+F K+G V+GG Y GC P TI PC
Sbjct: 131 VEVSAEDLLTCCGIQCGD---GCNGGYPSGAWSFWTKKGLVSGGVYNSHVGCLPYTIPPC 187
Query: 195 SHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEIL 254
HH + P P +C+ C Y + +DKH +Y V ++ I EI
Sbjct: 188 EHHVNGSRPPCTGEGDTP--RCNKSC-EAGYSPSYKEDKHFGYTSYSVSNSVKEIMAEIY 244
Query: 255 AHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGP 314
+GP F ++ DF YKSGVYKH + + H+ +++GWG ENG PYWL N+W
Sbjct: 245 KNGPVEGAFTVFSDFLTYKSGVYKHEAGDMMGG--HAIRILGWGVENGVPYWLAANSWNL 302
Query: 315 HWGDRGTVKILRGKYECAFEYLIAAGKPK 343
WGD G KILRG+ C E I AG P+
Sbjct: 303 DWGDNGFFKILRGENHCGIESEIVAGIPR 331
>sp|P25792|CYSP_SCHMA Cathepsin B-like cysteine proteinase OS=Schistosoma mansoni PE=2
SV=1
Length = 340
Score = 207 bits (528), Expect = 7e-53, Method: Compositional matrix adjust.
Identities = 120/348 (34%), Positives = 180/348 (51%), Gaps = 24/348 (6%)
Query: 1 MIHILVFLLGCTLVRGELYK-FSDAYIDQINREANT-WTAGRNFPANLSEEYLRQFLIAD 58
+ ++ FL V+ E ++ SD I IN N W A E+ R + D
Sbjct: 8 IASLITFLEAHISVKNEKFEPLSDDIISYINEHPNAGWRA---------EKSNRFHSLDD 58
Query: 59 AKYFDQSDRPLPGDRKTYDP-----EYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPH 113
A+ + R P R+ P +++ +P FD+R++WP C +I + D C +
Sbjct: 59 ARIQMGARREEPDLRRKRRPTVDHNDWNVEIPSNFDSRKKWPGCKSIATIRDQSRCGSCW 118
Query: 114 IFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRG 173
F AV A SDR CI+S G+QN LS + +CC+ C C G + W++ K G
Sbjct: 119 SFGAVEAMSDRSCIQSGGKQNVELSAVDLLTCCESC----GLGCEGGILGPAWDYWVKEG 174
Query: 174 SVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDK 233
VT + TGC+P C HH + P C ++ +C C Y + QDK
Sbjct: 175 IVTASSKENHTGCEPYPFPKCEHH-TKGKYPPCGSKIYNTPRCKQTCQR-KYKTPYTQDK 232
Query: 234 HRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGK 293
HR +Y V ++E AI+KEI+ +GP A+F +Y+DF +YKSG+YKH + L H+ +
Sbjct: 233 HRGKSSYNVKNDEKAIQKEIMKYGPVEASFTVYEDFLNYKSGIYKHITGEALGG--HAIR 290
Query: 294 LIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGK 341
+IGWG EN TPYWL+ N+W WG+ G +I+RG+ EC+ E + AG+
Sbjct: 291 IIGWGVENKTPYWLIANSWNEDWGENGYFRIVRGRDECSIESEVIAGR 338
>sp|P43510|CPR6_CAEEL Cathepsin B-like cysteine proteinase 6 OS=Caenorhabditis elegans
GN=cpr-6 PE=1 SV=1
Length = 379
Score = 206 bits (524), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 115/323 (35%), Positives = 160/323 (49%), Gaps = 10/323 (3%)
Query: 23 DAYIDQINREANTWTAG--RNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEY 80
D ID +N N WTA R F + E ++ + + S + KT D +
Sbjct: 44 DDLIDYVNENQNLWTAKKQRRFSSVYGENDKAKWGLMGVNHVRLSVKGKQHLSKTKDLDL 103
Query: 81 SATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTE 140
+P+ FD+R+ WP C +I + D +C + F AV A SDR CI S G+ LS +
Sbjct: 104 D--IPESFDSRDNWPKCDSIKVIRDQSSCGSCWAFGAVEAMSDRICIASHGELQVTLSAD 161
Query: 141 YVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSA 200
+ SCCK C + C+ G W + K G VTG +Y GC+P PC HH
Sbjct: 162 DLLSCCKSCGF----GCNGGDPLAAWRYWVKDGIVTGSNYTANNGCKPYPFPPCEHHSKK 217
Query: 201 PTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTT 260
C + P KC +C + + + +DK Y V D+ +AI+KE++ HGP
Sbjct: 218 THFDPCPHDLYPTPKCEKKCVSDYTDKTYSEDKFFGASAYGVKDDVEAIQKELMTHGPLE 277
Query: 261 ATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRG 320
F +Y+DF +Y GVY HT KL H+ KLIGWG ++G PYW V N+W WG+ G
Sbjct: 278 IAFEVYEDFLNYDGGVYVHTG-GKLGGG-HAVKLIGWGIDDGIPYWTVANSWNTDWGEDG 335
Query: 321 TVKILRGKYECAFEYLIAAGKPK 343
+ILRG EC E + G PK
Sbjct: 336 FFRILRGVDECGIESGVVGGIPK 358
>sp|P43233|CATB_CHICK Cathepsin B OS=Gallus gallus GN=CTSB PE=2 SV=1
Length = 340
Score = 200 bits (508), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 114/323 (35%), Positives = 161/323 (49%), Gaps = 16/323 (4%)
Query: 21 FSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEY 80
S ++ IN+ T AG NF N Y+++ + P + D
Sbjct: 26 LSSDLVNHINKLNTTGRAGHNF-HNTDMSYVKKLC---GTFLGGPKAP-----ERVDFAE 76
Query: 81 SATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTE 140
+PD FD R+QWPNC TI + D G+C + F AV A SDR C+ + + + +S E
Sbjct: 77 DMDLPDTFDTRKQWPNCPTISEIRDQGSCGSCWAFGAVEAISDRICVHTNAKVSVEVSAE 136
Query: 141 YVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSA 200
+ SCC ++ C+ G W + +RG V+GG Y GC+ TI PC HH +
Sbjct: 137 DLLSCCG---FECGMGCNGGYPSGAWRYWTERGLVSGGLYDSHVGCRAYTIPPCEHHVNG 193
Query: 201 PTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTT 260
+ P C + +C C P Y + +DKH +Y V +E I EI +GP
Sbjct: 194 -SRPPCTGEGGETPRCSRHC-EPGYSPSYKEDKHYGITSYGVPRSEKEIMAEIYKNGPVE 251
Query: 261 ATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRG 320
F +Y+DF YKSGVY+H S ++ H+ +++GWG ENGTPYWL N+W WG G
Sbjct: 252 GAFIVYEDFLMYKSGVYQHVSGEQVGG--HAIRILGWGVENGTPYWLAANSWNTDWGITG 309
Query: 321 TVKILRGKYECAFEYLIAAGKPK 343
KILRG+ C E I AG P+
Sbjct: 310 FFKILRGEDHCGIESEIVAGVPR 332
>sp|P25802|CYSP1_OSTOS Cathepsin B-like cysteine proteinase 1 OS=Ostertagia ostertagi
GN=CP-1 PE=3 SV=3
Length = 341
Score = 197 bits (502), Expect = 8e-50, Method: Compositional matrix adjust.
Identities = 107/291 (36%), Positives = 155/291 (53%), Gaps = 10/291 (3%)
Query: 50 YLRQFLIADAKYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGAC 109
Y +Q L+ D KY DQ++ P E + +P+ +D R QW NC ++ H+PD C
Sbjct: 58 YFKQRLM-DLKYIDQNNIPDEEVEDEELEENNDDIPESYDPRIQWANCSSLFHIPDQANC 116
Query: 110 AAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFL 169
+ ++ A SDR CI SKG + +S + V SCC C C G + F
Sbjct: 117 GSCWAVSSAAAMSDRICIASKGAKQVLISAQDVVSCCTWC----GDGCEGGWPISAFRFH 172
Query: 170 HKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGF 229
G VTGGDY + C+P I PC HHG+ C +C RC Y + +
Sbjct: 173 ADEGVVTGGDYNTKGSCRPYEIHPCGHHGNETYYGECVGM-ADTPRCKRRCL-LGYPKSY 230
Query: 230 FQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYL 289
D++ Y + ++ AI+K+I+ +GP AT+ +Y+DF HY+SG+YKH + K L
Sbjct: 231 PSDRYYKK-AYQLKNSVKAIQKDIMKNGPVVATYTVYEDFAHYRSGIYKHKAGRKTG--L 287
Query: 290 HSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAG 340
H+ K+IGWG E GTPYW+V N+W WG+ G ++ RG +C FE +AAG
Sbjct: 288 HAVKVIGWGEEKGTPYWIVANSWHDDWGENGFFRMHRGSNDCGFEERMAAG 338
>sp|P43157|CYSP_SCHJA Cathepsin B-like cysteine proteinase OS=Schistosoma japonicum
GN=CATB PE=2 SV=1
Length = 342
Score = 196 bits (497), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 98/265 (36%), Positives = 149/265 (56%), Gaps = 8/265 (3%)
Query: 79 EYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLS 138
+ + +P +FD+R++WP+C +I + D C + F AV A +DR CI+S G Q+ LS
Sbjct: 85 DLNVEIPSQFDSRKKWPHCKSISQIRDQSRCGSCWAFGAVEAMTDRICIQSGGGQSAELS 144
Query: 139 TEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHG 198
+ SCCK C C G W++ KRG VTGG + TGCQP C HH
Sbjct: 145 ALDLISCCKDC----GDGCQGGFPGVAWDYWVKRGIVTGGSKENHTGCQPYPFPKCEHH- 199
Query: 199 SAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGP 258
+ P+C + +C C Y + QDKH +Y V +NE I+++I+ +GP
Sbjct: 200 TKGKYPACGTKIYKTPQCKQTCQK-GYKTPYEQDKHYGDESYNVQNNEKVIQRDIMMYGP 258
Query: 259 TTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGD 318
A F +Y+DF +YKSG+Y+H + + + H+ ++IGWG E TPYWL+ N+W WG+
Sbjct: 259 VEAAFDVYEDFLNYKSGIYRHVTGSIVGG--HAIRIIGWGVEKRTPYWLIANSWNEDWGE 316
Query: 319 RGTVKILRGKYECAFEYLIAAGKPK 343
+G +++RG+ EC+ E + AG K
Sbjct: 317 KGLFRMVRGRDECSIESDVVAGLIK 341
>sp|P25807|CPR1_CAEEL Gut-specific cysteine proteinase OS=Caenorhabditis elegans GN=cpr-1
PE=1 SV=2
Length = 329
Score = 195 bits (495), Expect = 5e-49, Method: Compositional matrix adjust.
Identities = 117/321 (36%), Positives = 163/321 (50%), Gaps = 23/321 (7%)
Query: 23 DAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEYSA 82
A +D +N + + + ++EE ++ F + D KY + R T A
Sbjct: 31 QALVDYVNSAQSLF---KTEHVEITEEEMK-FKLMDGKYAAAHSDEI---RATEQEVVLA 83
Query: 83 TVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYV 142
+VP FD+R QW C +I + D C + F A SDR CI++KG Q +S + +
Sbjct: 84 SVPATFDSRTQWSECKSIKLIRDQATCGSCWAFGAAEMISDRTCIETKGAQQPIISPDDL 143
Query: 143 ASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPT 202
SCC C G + + +G VTGGDY GC+P I+PC T
Sbjct: 144 LSCCG---SSCGNGCEGGYPIQALRWWDSKGVVTGGDY-HGAGCKPYPIAPC-------T 192
Query: 203 LPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTAT 262
+C K P C C + Y + +DKH Y V N +I+ EI A+GP A
Sbjct: 193 SGNCPESKTPS--CSMSCQS-GYSTAYAKDKHFGVSAYAVPKNAASIQAEIYANGPVEAA 249
Query: 263 FALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTV 322
F++Y+DFY YKSGVYKHT+ L H+ K+IGWGTE+G+PYWLV N+WG +WG+ G
Sbjct: 250 FSVYEDFYKYKSGVYKHTAGKYLGG--HAIKIIGWGTESGSPYWLVANSWGVNWGESGFF 307
Query: 323 KILRGKYECAFEYLIAAGKPK 343
KI RG +C E + AGK K
Sbjct: 308 KIYRGDDQCGIESAVVAGKAK 328
>sp|P43507|CPR3_CAEEL Cathepsin B-like cysteine proteinase 3 OS=Caenorhabditis elegans
GN=cpr-3 PE=2 SV=1
Length = 370
Score = 194 bits (492), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 119/352 (33%), Positives = 171/352 (48%), Gaps = 33/352 (9%)
Query: 4 ILVFLLGCT-LVRGELYKFS------DAYIDQINREANTWTAGRNFPANLSEEYLRQFLI 56
+ +FL GC+ V E+ + +D +N +W A N + E+ +F +
Sbjct: 7 LALFLAGCSAFVLDEIRGINIGQSPQKVLVDHVNTVQTSWVAEHNEIS----EFEMKFKV 62
Query: 57 ADAKYFD--QSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHI 114
D K+ + + D + + +PD FDARE+WP+C TI + + C +
Sbjct: 63 MDVKFAEPLEKDSDVASELFVRGEIVPEPLPDTFDAREKWPDCNTIKLIRNQATCGSCWA 122
Query: 115 FAAVGAFSDRRCIKSKGQQNRPLSTEYVASCC-KICRYDDNKSCSHGSVFRTWNFLHKRG 173
F A SDR CI+S G Q +S E + SCC C Y C G F G
Sbjct: 123 FGAAEVISDRVCIQSNGTQQPVISVEDILSCCGTTCGY----GCKGGYSIEALRFWASSG 178
Query: 174 SVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDK 233
+VTGGDYG GC P + +PC+ + T PSC+ T C + + +DK
Sbjct: 179 AVTGGDYGGH-GCMPYSFAPCTKNCPESTTPSCK----------TTCQSSYKTEEYKKDK 227
Query: 234 HRTTLTYWVDDNEDA--IKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHS 291
H Y V + I+ EI +GP A++ +Y+DFYHYKSGVY +TS + H+
Sbjct: 228 HYGASAYKVTTTKSVTEIQTEIYHYGPVEASYKVYEDFYHYKSGVYHYTSGKLVGG--HA 285
Query: 292 GKLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
K+IGWG ENG YWL+ N+WG +G++G KI RG EC E + AG K
Sbjct: 286 VKIIGWGVENGVDYWLIANSWGTSFGEKGFFKIRRGTNECQIEGNVVAGIAK 337
>sp|P19092|CYSP1_HAECO Cathepsin B-like cysteine proteinase 1 OS=Haemonchus contortus
GN=AC-1 PE=2 SV=1
Length = 342
Score = 178 bits (452), Expect = 5e-44, Method: Compositional matrix adjust.
Identities = 102/285 (35%), Positives = 145/285 (50%), Gaps = 12/285 (4%)
Query: 56 IADAKYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIF 115
I D KY Q + + DP+ +P +D R+ W NC T ++ D C +
Sbjct: 63 IMDIKYKHQKLNLMVKE----DPDPEVDIPPSYDPRDVWKNCTTF-YIRDQANCGSCWAV 117
Query: 116 AAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSV 175
+ A SDR CI SK ++ +S + +CC R C G W + G V
Sbjct: 118 STAAAISDRICIASKAEKQVNISATDIMTCC---RPQCGDGCEGGWPIEAWKYFIYDGVV 174
Query: 176 TGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHR 235
+GG+Y + C+P I PC HHG+ C P C +C P + + DK
Sbjct: 175 SGGEYLTKDVCRPYPIHPCGHHGNDTYYGECRGT-APTPPCKRKC-RPGVRKMYRIDKRY 232
Query: 236 TTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLI 295
Y V + AI+ EIL +GP A+FA+Y+DF HYKSG+YKHT+ +L Y H+ K+I
Sbjct: 233 GKDAYIVKQSVKAIQSEILRNGPVVASFAVYEDFRHYKSGIYKHTA-GELRGY-HAVKMI 290
Query: 296 GWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAG 340
GWG EN T +WL+ N+W WG++G +I+RG +C E IAAG
Sbjct: 291 GWGNENNTDFWLIANSWHNDWGEKGYFRIIRGTNDCGIEGTIAAG 335
>sp|P25793|CYSP2_HAECO Cathepsin B-like cysteine proteinase 2 OS=Haemonchus contortus
GN=AC-2 PE=2 SV=1
Length = 342
Score = 178 bits (451), Expect = 7e-44, Method: Compositional matrix adjust.
Identities = 97/264 (36%), Positives = 138/264 (52%), Gaps = 8/264 (3%)
Query: 77 DPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRP 136
DP+ +P +D R+ W NC T ++ D C + + A SDR CI SK ++
Sbjct: 80 DPDPEVDIPPSYDPRDVWKNCTTF-YIRDQANCGSCWAVSTAAAISDRICIASKAEKQVN 138
Query: 137 LSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSH 196
+S + +CC R C G W + G V+GG+Y + C+P I PC H
Sbjct: 139 ISATDIMTCC---RPQCGDGCEGGWPIEAWKYFIYDGVVSGGEYLTKDVCRPYPIHPCGH 195
Query: 197 HGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAH 256
HG+ C P C +C P + + DK Y V + AI+ EIL +
Sbjct: 196 HGNDTYYGECRGT-APTPPCKRKC-RPGVRKMYRIDKRYGKDAYIVKQSVKAIQSEILKN 253
Query: 257 GPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHW 316
GP A+FA+Y+DF HYKSG+YKHT+ +L Y H+ K+IGWG EN T +WL+ N+W W
Sbjct: 254 GPVVASFAVYEDFRHYKSGIYKHTA-GELRGY-HAVKMIGWGNENNTDFWLIANSWHNDW 311
Query: 317 GDRGTVKILRGKYECAFEYLIAAG 340
G++G +I+RG +C E IAAG
Sbjct: 312 GEKGYFRIVRGSNDCGIEGTIAAG 335
>sp|Q54QD9|CTSB_DICDI Cathepsin B OS=Dictyostelium discoideum GN=ctsB PE=3 SV=1
Length = 311
Score = 139 bits (351), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 93/269 (34%), Positives = 129/269 (47%), Gaps = 27/269 (10%)
Query: 74 KTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQ 133
K+YDP +P F+A+ WPNC TI + + C + F A + +DR CI + +
Sbjct: 70 KSYDP-LGVQIPTSFNAQTNWPNCTTISQIQNQARCGSCWAFGATESATDRLCIHNN--E 126
Query: 134 NRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISP 193
N LS + +C + + C G F WN+L K+G+V+ C P TI
Sbjct: 127 NVQLSFMDMVTC-----DETDNGCEGGDAFSAWNWLRKQGAVS-------EECLPYTIPT 174
Query: 194 CSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEI 253
C P C N V C C + + + QDKH+ Y D +E AI +EI
Sbjct: 175 C-----PPAQQPCLN-FVNTPSCTKECQSNS-SLIYSQDKHKMAKIYSFDSDE-AIMQEI 226
Query: 254 LAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWG 313
+ +GP A F +++DF YKSGVY HT+ L H KL+G+GT NG Y+ N W
Sbjct: 227 VTNGPVEACFTVFEDFLAYKSGVYVHTTGKDLGG--HCVKLVGFGTLNGVDYYAANNQWT 284
Query: 314 PHWGDRGTVKILRGKYECAFEYLIAAGKP 342
WGD GT I RG +C + AG P
Sbjct: 285 TSWGDNGTFLIKRG--DCGISDDVVAGLP 311
>sp|Q06544|CYSP3_OSTOS Cathepsin B-like cysteine proteinase 3 (Fragment) OS=Ostertagia
ostertagi GN=CP-3 PE=3 SV=1
Length = 174
Score = 138 bits (347), Expect = 6e-32, Method: Compositional matrix adjust.
Identities = 69/177 (38%), Positives = 99/177 (55%), Gaps = 6/177 (3%)
Query: 165 TWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSC-ENQKVPKLKCHTRCTNP 223
W + G VTGG+Y + C+P PC HG P C + K PK C C
Sbjct: 1 AWQYFALEGVVTGGNYRKQGCCRPYEFPPCGRHGKEPYYGECYDTAKTPK--CQKTCQR- 57
Query: 224 TYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNA 283
Y + + +DKH Y + +N AI+++I+ +GP A F +Y+DF HYKSG+YKHT+
Sbjct: 58 GYLKAYKEDKHFGKSAYRLPNNVKAIQRDIMKNGPVVAGFIVYEDFAHYKSGIYKHTAGR 117
Query: 284 KLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAG 340
H+ K+IGWG E GTPYWL+ N+W WG++G +++RG C E ++ AG
Sbjct: 118 MTGG--HAVKIIGWGKEKGTPYWLIANSWHDDWGEKGFYRMIRGINNCRIEEMVFAG 172
>sp|Q9UJW2|TINAG_HUMAN Tubulointerstitial nephritis antigen OS=Homo sapiens GN=TINAG PE=2
SV=3
Length = 476
Score = 125 bits (314), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 109/352 (30%), Positives = 152/352 (43%), Gaps = 64/352 (18%)
Query: 13 LVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRP---- 68
LVR EL I+Q+N+ WTA N S+ + + D F P
Sbjct: 155 LVRSEL-------IEQVNKGDYGWTA-----QNYSQFW--GMTLEDGFKFRLGTLPPSPM 200
Query: 69 -LPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVP-DTGACAAPHIFAAVGAFSDRRC 126
L + T + +P+ F A +WP H P D CAA F+ +DR
Sbjct: 201 LLSMNEMTASLPATTDLPEFFVASYKWPG---WTHGPLDQKNCAASWAFSTASVAADRIA 257
Query: 127 IKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDY------ 180
I+SKG+ LS + + SCC R+ C+ GS+ R W +L KRG V+ Y
Sbjct: 258 IQSKGRYTANLSPQNLISCCAKNRH----GCNSGSIDRAWWYLRKRGLVSHACYPLFKDQ 313
Query: 181 -GDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLT 239
GC ++ S G C N V K +C+ P
Sbjct: 314 NATNNGCAMASRS--DGRGKRHATKPCPN-NVEKSNRIYQCSPP---------------- 354
Query: 240 YWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKH--TSNAKLENY----LHSGK 293
Y V NE I KEI+ +GP A + +DF+HYK+G+Y+H ++N + E Y H+ K
Sbjct: 355 YRVSSNETEIMKEIMQNGPVQAIMQVREDFFHYKTGIYRHVTSTNKESEKYRKLQTHAVK 414
Query: 294 LIGWGTENGT-----PYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAG 340
L GWGT G +W+ N+WG WG+ G +ILRG E E LI A
Sbjct: 415 LTGWGTLRGAQGQKEKFWIAANSWGKSWGENGYFRILRGVNESDIEKLIIAA 466
>sp|P92131|CATB1_GIAIN Cathepsin B-like CP1 OS=Giardia intestinalis GN=CP1 PE=2 SV=3
Length = 303
Score = 120 bits (301), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 96/327 (29%), Positives = 148/327 (45%), Gaps = 53/327 (16%)
Query: 22 SDAYIDQINREANTWTAG--RNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPE 79
S A + +I W AG + F N++E+ R LI + +S LP T E
Sbjct: 17 SRAELRRIQALNPPWKAGMPKRF-ENVTEDEFRSMLIRPDRLRARSGS-LPPISITEVQE 74
Query: 80 YSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLST 139
+P +FD R+++P C + D G+C + F+A+G F DRRC ++ S
Sbjct: 75 LVDPIPPQFDFRDEYPQC--VKPALDQGSCGSCWAFSAIGVFGDRRCAMGIDKEAVSYSQ 132
Query: 140 EYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGG-----DYGDRTGCQPSTISPC 194
+++ SC +N C G TW+FL G+ T DYG
Sbjct: 133 QHLISCSL-----ENFGCDGGDFQPTWSFLTFTGATTAECVKYVDYG------------- 174
Query: 195 SHHGSAPTLPSCENQKVPKL-KCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEI 253
H ++P C++ +L K H YG+ V + AI +
Sbjct: 175 -HTVASPCPAVCDDGSPIQLYKAHG------YGQ--------------VSKSVPAIMGML 213
Query: 254 LAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGT-ENGTPYWLVINTW 312
+A GP +Y D +Y+SGVYKHT + H+ +++G+GT ++GT YW++ N+W
Sbjct: 214 VAGGPLQTMIVVYADLSYYESGVYKHTYGT-INLGFHALEIVGYGTTDDGTDYWIIKNSW 272
Query: 313 GPHWGDRGTVKILRGKYECAFEYLIAA 339
GP WG+ G +I+RG EC E I A
Sbjct: 273 GPDWGENGYFRIVRGVNECRIEDEIYA 299
>sp|Q99JR5|TINAL_MOUSE Tubulointerstitial nephritis antigen-like OS=Mus musculus
GN=Tinagl1 PE=1 SV=1
Length = 466
Score = 119 bits (299), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 86/269 (31%), Positives = 121/269 (44%), Gaps = 32/269 (11%)
Query: 83 TVPDRFDAREQWPNCGTIGHVP-DTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEY 141
+P F+A E+WPN + H P D G CA F+ SDR I S G LS +
Sbjct: 201 VLPTAFEASEKWPN---LIHEPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPILSPQN 257
Query: 142 VASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDY--GDRTGCQPSTISPCSHHGS 199
+ SC + C G + W FL +RG V+ Y R + S C H
Sbjct: 258 LLSC----DTHHQQGCRGGRLDGAWWFLRRRGVVSDNCYPFSGREQNEASPTPRCMMHSR 313
Query: 200 APTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPT 259
A K + +RC N G+ D ++ T Y + +E I KE++ +GP
Sbjct: 314 A--------MGRGKRQATSRCPN---GQVDSNDIYQVTPAYRLGSDEKEIMKELMENGPV 362
Query: 260 TATFALYDDFYHYKSGVYKHT--SNAKLENY----LHSGKLIGWGTEN-----GTPYWLV 308
A +++DF+ Y+ G+Y HT S + E Y HS K+ GWG E YW
Sbjct: 363 QALMEVHEDFFLYQRGIYSHTPVSQGRPEQYRRHGTHSVKITGWGEETLPDGRTIKYWTA 422
Query: 309 INTWGPHWGDRGTVKILRGKYECAFEYLI 337
N+WGP WG+RG +I+RG EC E +
Sbjct: 423 ANSWGPWWGERGHFRIVRGTNECDIETFV 451
>sp|Q9EQT5|TINAL_RAT Tubulointerstitial nephritis antigen-like OS=Rattus norvegicus
GN=Tinagl1 PE=2 SV=1
Length = 467
Score = 119 bits (299), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 86/270 (31%), Positives = 119/270 (44%), Gaps = 33/270 (12%)
Query: 83 TVPDRFDAREQWPNCGTIGHVP-DTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEY 141
+P F+A E+WPN + H P D G CA F+ SDR I S G LS +
Sbjct: 201 VLPTAFEASEKWPN---LIHEPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPILSPQN 257
Query: 142 VASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDY---GDRTGCQPSTISPCSHHG 198
+ SC K C G + W FL +RG V+ Y G + S C H
Sbjct: 258 LLSC----DTHHQKGCRGGRLDGAWWFLRRRGVVSDNCYPFSGREQNDEASPTPRCMMHS 313
Query: 199 SAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGP 258
A K + +RC N D ++ T Y + +E I KE++ +GP
Sbjct: 314 RA--------MGRGKRQATSRCPNSQVDS---NDIYQVTPVYRLASDEKEIMKELMENGP 362
Query: 259 TTATFALYDDFYHYKSGVYKHT--SNAKLENY----LHSGKLIGWGTEN-----GTPYWL 307
A +++DF+ Y+ G+Y HT S + E Y HS K+ GWG E YW
Sbjct: 363 VQALMEVHEDFFLYQRGIYSHTPVSQGRPEQYRRHGTHSVKITGWGEETLPDGRTIKYWT 422
Query: 308 VINTWGPHWGDRGTVKILRGKYECAFEYLI 337
N+WGP WG+RG +I+RG EC E +
Sbjct: 423 AANSWGPWWGERGHFRIVRGINECDIETFV 452
>sp|P92133|CATB3_GIAIN Cathepsin B-like CP3 OS=Giardia intestinalis GN=CP3 PE=2 SV=2
Length = 299
Score = 119 bits (299), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 80/265 (30%), Positives = 123/265 (46%), Gaps = 37/265 (13%)
Query: 81 SATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTE 140
+ PD FD RE++P+C I V D G C + F++V + DRRC ++ S +
Sbjct: 71 ATQAPDSFDFREEYPHC--IPEVVDQGGCGSCWAFSSVASVGDRRCFAGLDKKAVKYSPQ 128
Query: 141 YVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSA 200
YV SC + + +C G + W FL K G+ T C P G+
Sbjct: 129 YVVSCDR-----GDMACDGGWLPSVWRFLTKTGTTT-------DECVPYQSGSTGARGTC 176
Query: 201 PTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTT 260
PT + +P L T+ + YG + AI K + GP
Sbjct: 177 PT-KCADGSDLPHLYKATKAVD--YGL-----------------DAPAIMKALATGGPLQ 216
Query: 261 ATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTEN-GTPYWLVINTWGPHWGDR 319
F +Y DF +Y+SGVY+HT ++E H+ ++G+GT++ G YW++ N+WGP WG+
Sbjct: 217 TAFTVYSDFMYYESGVYQHT-YGRVEGG-HAVDMVGYGTDDDGVDYWIIKNSWGPDWGED 274
Query: 320 GTVKILRGKYECAFEYLIAAGKPKN 344
G +I+R EC E + G +N
Sbjct: 275 GYFRIIRMTNECGIEEQVIGGFFEN 299
>sp|P92132|CATB2_GIAIN Cathepsin B-like CP2 OS=Giardia intestinalis GN=CP2 PE=1 SV=2
Length = 300
Score = 118 bits (296), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 79/260 (30%), Positives = 123/260 (47%), Gaps = 41/260 (15%)
Query: 84 VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVA 143
VP+ FD RE++P+C I V D G C + F++V F DRRC+ ++ S +YV
Sbjct: 75 VPESFDFREEYPHC--IPEVVDQGGCGSCWAFSSVATFGDRRCVAGLDKKPVKYSPQYVV 132
Query: 144 SCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTL 203
SC + +C+ G + W FL K G+ T C P + G+ PT
Sbjct: 133 SCDH-----GDMACNGGWLPNVWKFLTKTGTTT-------DECVPYKSGSTTLRGTCPTK 180
Query: 204 PSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNED--AIKKEILAHGPTTA 261
+ + KV H T T + D D A+ K + GP
Sbjct: 181 CADGSSKV----------------------HLATATSYKDYGLDIPAMMKALSTSGPLQV 218
Query: 262 TFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTEN-GTPYWLVINTWGPHWGDRG 320
F ++ DF +Y+SGVY+HT H+ +++G+GT++ G YW++ N+WGP WG+ G
Sbjct: 219 AFLVHSDFMYYESGVYQHTYGYMEGG--HAVEMVGYGTDDDGVDYWIIKNSWGPDWGEDG 276
Query: 321 TVKILRGKYECAFEYLIAAG 340
+++RG +C+ E AG
Sbjct: 277 YFRMIRGINDCSIEEQAYAG 296
>sp|Q9GZM7|TINAL_HUMAN Tubulointerstitial nephritis antigen-like OS=Homo sapiens
GN=TINAGL1 PE=1 SV=1
Length = 467
Score = 118 bits (295), Expect = 8e-26, Method: Compositional matrix adjust.
Identities = 87/273 (31%), Positives = 118/273 (43%), Gaps = 40/273 (14%)
Query: 83 TVPDRFDAREQWPNCGTIGHVP-DTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEY 141
+P F+A E+WPN + H P D G CA F+ SDR I S G LS +
Sbjct: 202 VLPTAFEASEKWPN---LIHEPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQN 258
Query: 142 VASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVT------GGDYGDRTGCQPSTISPCS 195
+ SC + C G + W FL +RG V+ G D G P PC
Sbjct: 259 LLSC----DTHQQQGCRGGRLDGAWWFLRRRGVVSDHCYPFSGRERDEAGPAP----PCM 310
Query: 196 HHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILA 255
H A K + C N D ++ T Y + N+ I KE++
Sbjct: 311 MHSRA--------MGRGKRQATAHCPNSYVNN---NDIYQVTPVYRLGSNDKEIMKELME 359
Query: 256 HGPTTATFALYDDFYHYKSGVYKHT--SNAKLENY----LHSGKLIGWGTEN-----GTP 304
+GP A +++DF+ YK G+Y HT S + E Y HS K+ GWG E
Sbjct: 360 NGPVQALMEVHEDFFLYKGGIYSHTPVSLGRPERYRRHGTHSVKITGWGEETLPDGRTLK 419
Query: 305 YWLVINTWGPHWGDRGTVKILRGKYECAFEYLI 337
YW N+WGP WG+RG +I+RG EC E +
Sbjct: 420 YWTAANSWGPAWGERGHFRIVRGVNECDIESFV 452
>sp|P90850|YCF2E_CAEEL Uncharacterized peptidase C1-like protein F26E4.3 OS=Caenorhabditis
elegans GN=F26E4.3 PE=1 SV=3
Length = 452
Score = 117 bits (292), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 82/266 (30%), Positives = 129/266 (48%), Gaps = 36/266 (13%)
Query: 84 VPDRFDAREQWPNCGTIGH-VPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYV 142
+P+ FDAR++W G + H V D G C + + SDR I S+G+ N LS++ +
Sbjct: 184 LPEHFDARDKW---GPLIHPVADQGDCGSSWSVSTTAISSDRLAIISEGRINSTLSSQQL 240
Query: 143 ASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPT 202
SC K C G + R W ++ K G V GD+ C P +S S
Sbjct: 241 LSC----NQHRQKGCEGGYLDRAWWYIRKLGVV--GDH-----CYP-YVSGQSREPGHCL 288
Query: 203 LPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTAT 262
+P + L+C + + T + T Y V E+ I+ E++ +GP AT
Sbjct: 289 IPKRDYTNRQGLRCPSGSQDST--------AFKMTPPYKVSSREEDIQTELMTNGPVQAT 340
Query: 263 FALYDDFYHYKSGVYKHT-------SNAKLENYLHSGKLIGWGTENGT----PYWLVINT 311
F +++DF+ Y GVY+H+ +++ E Y HS +++GWG ++ T YWL N+
Sbjct: 341 FVVHEDFFMYAGGVYQHSDLAAQKGASSVAEGY-HSVRVLGWGVDHSTGKPIKYWLCANS 399
Query: 312 WGPHWGDRGTVKILRGKYECAFEYLI 337
WG WG+ G K+LRG+ C E +
Sbjct: 400 WGTQWGEDGYFKVLRGENHCEIESFV 425
>sp|Q3SZI1|TINAG_BOVIN Tubulointerstitial nephritis antigen OS=Bos taurus GN=TINAG PE=2
SV=1
Length = 476
Score = 117 bits (292), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 88/276 (31%), Positives = 126/276 (45%), Gaps = 45/276 (16%)
Query: 84 VPDRFDAREQWPNCGTIGHVP-DTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYV 142
+P+ F A +WP H P D CAA F+ +DR I+S+G+ LS + +
Sbjct: 217 LPEFFIASYKWPG---WTHGPLDQKNCAASWAFSTASVAADRIAIQSQGRYTANLSPQNL 273
Query: 143 ASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDY-------GDRTGCQPSTISPCS 195
SCC R+ C+ GSV R W +L KRG V+ Y GC ++ S
Sbjct: 274 ISCCAKKRH----GCNSGSVDRAWWYLRKRGLVSHACYPLFKDQNATNNGCAMASRS--D 327
Query: 196 HHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILA 255
G C N + K +C+ P Y V NE I +EI+
Sbjct: 328 GRGKRHATTPCPN-SIEKSNRIYQCSPP----------------YRVSSNETEIMREIMQ 370
Query: 256 HGPTTATFALYDDFYHYKSGVYKH--TSNAKLENY----LHSGKLIGWGTENGT-----P 304
+GP A +++DF++YK+G+Y+H ++N E Y H+ KL GWGT G
Sbjct: 371 NGPVQAIMQVHEDFFNYKTGIYRHITSTNEDSEKYRKFRTHAVKLTGWGTLRGAQGQKEK 430
Query: 305 YWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAG 340
+W+ N+WG WG+ G +ILRG E E LI A
Sbjct: 431 FWIAANSWGKSWGENGYFRILRGVNESDIEKLIIAA 466
>sp|Q60HG6|CATC_MACFA Dipeptidyl peptidase 1 OS=Macaca fascicularis GN=CTSC PE=2 SV=1
Length = 463
Score = 96.3 bits (238), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 88/341 (25%), Positives = 133/341 (39%), Gaps = 64/341 (18%)
Query: 18 LYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYD 77
LYK+ ++ IN +WTA L + + + RP P
Sbjct: 167 LYKYDHNFVKAINAIQKSWTATTYM--EYETLTLGDMIKRSGGHSRKIPRPKPTPLTAEI 224
Query: 78 PEYSATVPDRFDAREQWPNCGTIGHVP---DTGACAAPHIFAAVGAFSDRRCIKSKGQQN 134
+ +P +D W N I V + +C + + FA+VG R I + Q
Sbjct: 225 QQKILHLPTSWD----WRNVHGINFVSPVRNQASCGSCYSFASVGMLEARIRILTNNSQT 280
Query: 135 RPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTG-----CQP- 188
LS++ V SC + + C G + +T G Y G C P
Sbjct: 281 PILSSQEVVSCSQYA-----QGCEGGFPY-----------LTAGKYAQDFGLVEEACFPY 324
Query: 189 -STISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNED 247
T SPC K C ++ ++ ++ NE
Sbjct: 325 TGTDSPC--------------------KMKEDCFR------YYSSEYHYVGGFYGGCNEA 358
Query: 248 AIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAK----LENYLHSGKLIGWGTEN-- 301
+K E++ HGP F +YDDF HY++G+Y HT E H+ L+G+GT++
Sbjct: 359 LMKLELVYHGPLAVAFEVYDDFLHYQNGIYHHTGLRDPFNPFELTNHAVLLVGYGTDSAS 418
Query: 302 GTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
G YW+V N+WG WG+ G +I RG ECA E + A P
Sbjct: 419 GMDYWIVKNSWGTSWGEDGYFRIRRGTDECAIESIAVAATP 459
>sp|Q5RB02|CATC_PONAB Dipeptidyl peptidase 1 OS=Pongo abelii GN=CTSC PE=2 SV=1
Length = 463
Score = 94.7 bits (234), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 89/341 (26%), Positives = 133/341 (39%), Gaps = 64/341 (18%)
Query: 18 LYKFSDAYIDQINREANTWTAGRNFPANLSEEY----LRQFLIADAKYFDQSDRPLPGDR 73
LYK+ ++ IN +WTA +EY L + + + RP P
Sbjct: 167 LYKYDHNFVKAINAIQKSWTA------TTYKEYETLTLGDMIRRSGGHSRKIPRPKPAPL 220
Query: 74 KTYDPEYSATVPDRFDAREQWPNCGTIGHVP---DTGACAAPHIFAAVGAFSDRRCIKSK 130
+ +P +D W N I V + +C + + FA++G R I +
Sbjct: 221 TAEIQQKVLHLPTSWD----WRNIHGINFVSPVRNQASCGSCYSFASMGMLEARIRILTS 276
Query: 131 GQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYG-DRTGCQP- 188
Q LS + V SC + + C G F + D+G C P
Sbjct: 277 NSQTPILSPQEVVSCSQYA-----QGCEGG-------FPYLIAGKYAQDFGLVEEACFPY 324
Query: 189 -STISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNED 247
T SPC K C ++ ++ ++ NE
Sbjct: 325 TGTDSPC--------------------KMKEDCFR------YYSSEYHYVGGFYGGCNEA 358
Query: 248 AIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAK----LENYLHSGKLIGWGTE--N 301
+K E++ HGP F +YDDF HYK G+Y HT E H+ L+G+GT+ +
Sbjct: 359 LMKLELVHHGPMAVAFEVYDDFLHYKKGIYHHTGLRDPFNPFELTNHAVLLVGYGTDSAS 418
Query: 302 GTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
G YW+V N+WG WG+ G +I RG ECA E + A P
Sbjct: 419 GMDYWIVKNSWGTGWGEDGYFRIRRGTDECAIESIAVAATP 459
>sp|P53634|CATC_HUMAN Dipeptidyl peptidase 1 OS=Homo sapiens GN=CTSC PE=1 SV=2
Length = 463
Score = 94.4 bits (233), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 87/337 (25%), Positives = 130/337 (38%), Gaps = 56/337 (16%)
Query: 18 LYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYD 77
LYK+ ++ IN +WTA L + + + RP P
Sbjct: 167 LYKYDHNFVKAINAIQKSWTATTYM--EYETLTLGDMIRRSGGHSRKIPRPKPAPLTAEI 224
Query: 78 PEYSATVPDRFDAREQWPNCGTIGHVP---DTGACAAPHIFAAVGAFSDRRCIKSKGQQN 134
+ +P +D W N I V + +C + + FA++G R I + Q
Sbjct: 225 QQKILHLPTSWD----WRNVHGINFVSPVRNQASCGSCYSFASMGMLEARIRILTNNSQT 280
Query: 135 RPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYG-DRTGCQP--STI 191
LS + V SC + + C G F + D+G C P T
Sbjct: 281 PILSPQEVVSCSQYA-----QGCEGG-------FPYLIAGKYAQDFGLVEEACFPYTGTD 328
Query: 192 SPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKK 251
SPC K C ++ ++ ++ NE +K
Sbjct: 329 SPC--------------------KMKEDCFR------YYSSEYHYVGGFYGGCNEALMKL 362
Query: 252 EILAHGPTTATFALYDDFYHYKSGVYKHTSNAK----LENYLHSGKLIGWGTE--NGTPY 305
E++ HGP F +YDDF HYK G+Y HT E H+ L+G+GT+ +G Y
Sbjct: 363 ELVHHGPMAVAFEVYDDFLHYKKGIYHHTGLRDPFNPFELTNHAVLLVGYGTDSASGMDY 422
Query: 306 WLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
W+V N+WG WG+ G +I RG ECA E + A P
Sbjct: 423 WIVKNSWGTGWGENGYFRIRRGTDECAIESIAVAATP 459
>sp|O97578|CATC_CANFA Dipeptidyl peptidase 1 (Fragment) OS=Canis familiaris GN=CTSC PE=1
SV=1
Length = 435
Score = 93.6 bits (231), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 88/334 (26%), Positives = 132/334 (39%), Gaps = 53/334 (15%)
Query: 18 LYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYD 77
LYK++ ++ IN +WTA R LR + + RP P
Sbjct: 142 LYKYNYEFVKAINTIQKSWTATRYI--EYETLTLRDMMTRVGG--RKIPRPKPTPLTAEI 197
Query: 78 PEYSATVPDRFDAREQWPNC-GT--IGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQN 134
E + +P +D W N GT + V + +C + + FA+ R I + Q
Sbjct: 198 HEEISRLPTSWD----WRNVRGTNFVSPVRNQASCGSCYAFASTAMLEARIRILTNNTQT 253
Query: 135 RPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPC 194
LS + + SC + + C G + + G Y G P
Sbjct: 254 PILSPQEIVSCSQYA-----QGCEGGFPY-----------LIAGKYAQDFGLVEEACFPY 297
Query: 195 SHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEIL 254
+ S P P+ C ++ Y G F NE +K E++
Sbjct: 298 AGSDS-PCKPN---------DCFRYYSSEYYYVGGFYGAC----------NEALMKLELV 337
Query: 255 AHGPTTATFALYDDFYHYKSGVYKHTSNAK----LENYLHSGKLIGWGTE--NGTPYWLV 308
HGP F +YDDF+HY+ G+Y HT E H+ L+G+GT+ +G YW+V
Sbjct: 338 RHGPMAVAFEVYDDFFHYQKGIYYHTGLRDPFNPFELTNHAVLLVGYGTDSASGMDYWIV 397
Query: 309 INTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
N+WG WG+ G +I RG ECA E + A P
Sbjct: 398 KNSWGSRWGEDGYFRIRRGTDECAIESIAVAATP 431
>sp|Q3ZCJ8|CATC_BOVIN Dipeptidyl peptidase 1 OS=Bos taurus GN=CTSC PE=2 SV=1
Length = 463
Score = 92.8 bits (229), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 85/335 (25%), Positives = 133/335 (39%), Gaps = 52/335 (15%)
Query: 18 LYKFSDAYIDQINREANTWTAGRNFPANLSEEY-LRQFLIADAKYFDQSDRPLPGDRKTY 76
LY+++ ++ IN +WTA P E L++ + + + RP P
Sbjct: 167 LYRYNHDFVKAINAIQKSWTAA---PYMEYETLTLKEMIRRGGGHSRRIPRPKPAPITAE 223
Query: 77 DPEYSATVPDRFDAREQWPNCGTIGHVP---DTGACAAPHIFAAVGAFSDRRCIKSKGQQ 133
+ +P +D W N I V + G+C + + FA++G R I + Q
Sbjct: 224 IQKKILHLPTSWD----WRNVHGINFVTPVRNQGSCGSCYSFASMGMMEARIRILTNNTQ 279
Query: 134 NRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISP 193
LS + V SC + + C G + + G Y G P
Sbjct: 280 TPILSPQEVVSCSQYA-----QGCEGGFPY-----------LIAGKYAQDFGLVEEDCFP 323
Query: 194 CSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEI 253
+ S L K C ++ ++ ++ NE +K E+
Sbjct: 324 YTGTDSPCRL---------KEGCFR----------YYSSEYHYVGGFYGGCNEALMKLEL 364
Query: 254 LAHGPTTATFALYDDFYHYKSGVYKHTSNAK----LENYLHSGKLIGWGTE--NGTPYWL 307
+ GP F +YDDF HY+ GVY HT E H+ L+G+GT+ +G YW+
Sbjct: 365 VHQGPMAVAFEVYDDFLHYRKGVYHHTGLRDPFNPFELTNHAVLLVGYGTDAASGLDYWI 424
Query: 308 VINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
V N+WG WG+ G +I RG ECA E + A P
Sbjct: 425 VKNSWGTSWGENGYFRIRRGTDECAIESIALAATP 459
>sp|P80067|CATC_RAT Dipeptidyl peptidase 1 OS=Rattus norvegicus GN=Ctsc PE=1 SV=3
Length = 462
Score = 92.4 bits (228), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 88/340 (25%), Positives = 137/340 (40%), Gaps = 63/340 (18%)
Query: 18 LYKFSDAYIDQINREANTWTAGRNFPANLSEEYLR---QFLIADAKYFDQSDRPLPGDRK 74
LY + ++ IN +WTA EEY + + LI + + + RP P
Sbjct: 167 LYSHNHNFVKAINSVQKSWTA------TTYEEYEKLSIRDLIRRSGHSGRILRPKPAPIT 220
Query: 75 TYDPEYSATVPDRFDAREQWPNCGTIGHVP---DTGACAAPHIFAAVGAFSDRRCIKSKG 131
+ ++P+ +D W N I V + +C + + FA++G R I +
Sbjct: 221 DEIQQQILSLPESWD----WRNVRGINFVSPVRNQESCGSCYSFASLGMLEARIRILTNN 276
Query: 132 QQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGD-RTGCQP-- 188
Q LS + V SC + C G F + D+G C P
Sbjct: 277 SQTPILSPQEVVSCSPYA-----QGCDGG-------FPYLIAGKYAQDFGVVEENCFPYT 324
Query: 189 STISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDA 248
+T +PC PK C ++ ++ ++ NE
Sbjct: 325 ATDAPCK----------------PKENC----------LRYYSSEYYYVGGFYGGCNEAL 358
Query: 249 IKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAK----LENYLHSGKLIGWGTE--NG 302
+K E++ HGP F ++DDF HY SG+Y HT + E H+ L+G+G + G
Sbjct: 359 MKLELVKHGPMAVAFEVHDDFLHYHSGIYHHTGLSDPFNPFELTNHAVLLVGYGKDPVTG 418
Query: 303 TPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
YW+V N+WG WG+ G +I RG ECA E + A P
Sbjct: 419 LDYWIVKNSWGSQWGESGYFRIRRGTDECAIESIAMAAIP 458
>sp|P97821|CATC_MOUSE Dipeptidyl peptidase 1 OS=Mus musculus GN=Ctsc PE=2 SV=1
Length = 462
Score = 92.0 bits (227), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 84/337 (24%), Positives = 134/337 (39%), Gaps = 57/337 (16%)
Query: 18 LYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYD 77
LY + ++ IN +WTA E+ + LI + + + RP P
Sbjct: 167 LYTHNHNFVKAINTVQKSWTAT---AYKEYEKMSLRDLIRRSGHSQRIPRPKPAPMTDEI 223
Query: 78 PEYSATVPDRFDAREQWPNCGTIGHVP---DTGACAAPHIFAAVGAFSDRRCIKSKGQQN 134
+ +P+ +D W N + +V + +C + + FA++G R I + Q
Sbjct: 224 QQQILNLPESWD----WRNVQGVNYVSPVRNQESCGSCYSFASMGMLEARIRILTNNSQT 279
Query: 135 RPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGD-RTGCQPSTI-- 191
LS + V SC + C G F + D+G C P T
Sbjct: 280 PILSPQEVVSCSPYA-----QGCDGG-------FPYLIAGKYAQDFGVVEESCFPYTAKD 327
Query: 192 SPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKK 251
SPC P+ C ++ + ++ NE +K
Sbjct: 328 SPCK----------------PRENC----------LRYYSSDYYYVGGFYGGCNEALMKL 361
Query: 252 EILAHGPTTATFALYDDFYHYKSGVYKHTSNAK----LENYLHSGKLIGWGTE--NGTPY 305
E++ HGP F ++DDF HY SG+Y HT + E H+ L+G+G + G Y
Sbjct: 362 ELVKHGPMAVAFEVHDDFLHYHSGIYHHTGLSDPFNPFELTNHAVLLVGYGRDPVTGIEY 421
Query: 306 WLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
W++ N+WG +WG+ G +I RG ECA E + A P
Sbjct: 422 WIIKNSWGSNWGESGYFRIRRGTDECAIESIAVAAIP 458
>sp|Q26563|CATC_SCHMA Cathepsin C OS=Schistosoma mansoni PE=2 SV=1
Length = 454
Score = 85.5 bits (210), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 82/344 (23%), Positives = 136/344 (39%), Gaps = 53/344 (15%)
Query: 8 LLGCTLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDR 67
L G LY + +++ +IN +W G +P LS+ + + R
Sbjct: 141 LFGSKSFGRTLYHINPSFVGKINAHQKSW-RGEIYP-ELSKYTIDELRNRAGGVKSMVTR 198
Query: 68 PLPGDRKTYDPEY---SATVPDRFDAREQWPNCGT---IGHVPDTGACAAPHIFAAVGAF 121
P +RKT E + +P FD P G+ + + + G C + + + A
Sbjct: 199 PSVLNRKTPSKELISLTGNLPLEFDWTS--PPDGSRSPVTPIRNQGICGSCYASPSAAAL 256
Query: 122 SDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYG 181
R + S + LS + V C ++ C+ G F + G YG
Sbjct: 257 EARIRLVSNFSEQPILSPQTVVDCSPY-----SEGCNGGFPF-----------LIAGKYG 300
Query: 182 DRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYW 241
+ G + P + + K CT ++ + Y+
Sbjct: 301 EDFGLPQKIVIPYT------------GEDTGKCTVSKNCTR------YYTTDYSYIGGYY 342
Query: 242 VDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAK-------LENYLHSGKL 294
NE ++ E++++GP F +Y+DF YK G+Y HT+ E H+ L
Sbjct: 343 GATNEKLMQLELISNGPFPVGFEVYEDFQFYKEGIYHHTTVQTDHYNFNPFELTNHAVLL 402
Query: 295 IGWGTE--NGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYL 336
+G+G + +G PYW V N+WG WG++G +ILRG EC E L
Sbjct: 403 VGYGVDKLSGEPYWKVKNSWGVEWGEQGYFRILRGTDECGVESL 446
>sp|Q3T0I2|CATH_BOVIN Pro-cathepsin H OS=Bos taurus GN=CTSH PE=2 SV=1
Length = 335
Score = 78.6 bits (192), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 39/112 (34%), Positives = 60/112 (53%), Gaps = 6/112 (5%)
Query: 223 PTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSN 282
P+ F +D TL ++E+A+ + + H P + F + DF Y+ G+Y TS
Sbjct: 218 PSKAIAFVKDVANITL-----NDEEAMVEAVALHNPVSFAFEVTADFMMYRKGIYSSTSC 272
Query: 283 AKLENYL-HSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAF 333
K + + H+ +G+G E G PYW+V N+WGP+WG +G I RGK C
Sbjct: 273 HKTPDKVNHAVLAVGYGEEKGIPYWIVKNSWGPNWGMKGYFLIERGKNMCGL 324
>sp|O70370|CATS_MOUSE Cathepsin S OS=Mus musculus GN=Ctss PE=2 SV=2
Length = 340
Score = 77.0 bits (188), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 67/245 (27%), Positives = 107/245 (43%), Gaps = 44/245 (17%)
Query: 83 TVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYV 142
T+PD D RE+ G + V G+C A F+AVGA + +K K + LS + +
Sbjct: 122 TLPDTVDWREK----GCVTEVKYQGSCGACWAFSAVGALEGQ--LKLKTGKLISLSAQNL 175
Query: 143 ASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPT 202
C +Y NK C G + + ++ G + D + +T C H+ S
Sbjct: 176 VDCSNEEKYG-NKGCGGGYMTEAFQYIIDNGGIEA----DASYPYKATDEKC-HYNSKNR 229
Query: 203 LPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTAT 262
+C +R +G +EDA+K+ + GP +
Sbjct: 230 AATC-----------SRYIQLPFG------------------DEDALKEAVATKGPVSVG 260
Query: 263 F-ALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGT 321
A + F+ YKSGVY S N H ++G+GT +G YWLV N+WG ++GD+G
Sbjct: 261 IDASHSSFFFYKSGVYDDPSCTG--NVNHGVLVVGYGTLDGKDYWLVKNSWGLNFGDQGY 318
Query: 322 VKILR 326
+++ R
Sbjct: 319 IRMAR 323
>sp|O35186|CATK_RAT Cathepsin K OS=Rattus norvegicus GN=Ctsk PE=2 SV=1
Length = 329
Score = 77.0 bits (188), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 62/254 (24%), Positives = 103/254 (40%), Gaps = 47/254 (18%)
Query: 76 YDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNR 135
Y PE+ VPD D R++ G + V + G C + F++ GA + +K K +
Sbjct: 107 YTPEWEGRVPDSIDYRKK----GYVTPVKNQGQCGSCWAFSSAGALEGQ--LKKKTGKLL 160
Query: 136 PLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCS 195
LS + + C +N C G + + ++ + G + D G S C
Sbjct: 161 ALSPQNLVDCV-----SENYGCGGGYMTTAFQYVQQNGGIDSEDAYPYVGQDES----CM 211
Query: 196 HHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILA 255
++ +A KC P NE A+K+ +
Sbjct: 212 YNATAKAA-----------KCRGYREIPV-------------------GNEKALKRAVAR 241
Query: 256 HGPTTATF-ALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGP 314
GP + + A F Y GVY + N +N H+ ++G+GT+ G YW++ N+WG
Sbjct: 242 VGPVSVSIDASLTSFQFYSRGVY-YDENCDRDNVNHAVLVVGYGTQKGNKYWIIKNSWGE 300
Query: 315 HWGDRGTVKILRGK 328
WG++G V + R K
Sbjct: 301 SWGNKGYVLLARNK 314
>sp|O46427|CATH_PIG Pro-cathepsin H OS=Sus scrofa GN=CTSH PE=1 SV=1
Length = 335
Score = 76.6 bits (187), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 38/112 (33%), Positives = 59/112 (52%), Gaps = 6/112 (5%)
Query: 223 PTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSN 282
P F +D T+ ++E+A+ + + + P + F + +DF Y+ G+Y TS
Sbjct: 218 PDKAIAFVKDVANITM-----NDEEAMVEAVALYNPVSFAFEVTNDFLMYRKGIYSSTSC 272
Query: 283 AKLENYL-HSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAF 333
K + + H+ +G+G ENG PYW+V N+WGP WG G I RGK C
Sbjct: 273 HKTPDKVNHAVLAVGYGEENGIPYWIVKNSWGPQWGMNGYFLIERGKNMCGL 324
>sp|P09668|CATH_HUMAN Pro-cathepsin H OS=Homo sapiens GN=CTSH PE=1 SV=4
Length = 335
Score = 75.1 bits (183), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 38/112 (33%), Positives = 59/112 (52%), Gaps = 6/112 (5%)
Query: 223 PTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSN 282
P GF +D T+ +E+A+ + + + P + F + DF Y++G+Y TS
Sbjct: 218 PGKAIGFVKDVANITIY-----DEEAMVEAVALYNPVSFAFEVTQDFMMYRTGIYSSTSC 272
Query: 283 AKLENYL-HSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAF 333
K + + H+ +G+G +NG PYW+V N+WGP WG G I RGK C
Sbjct: 273 HKTPDKVNHAVLAVGYGEKNGIPYWIVKNSWGPQWGMNGYFLIERGKNMCGL 324
>sp|P05167|ALEU_HORVU Thiol protease aleurain OS=Hordeum vulgare PE=2 SV=1
Length = 362
Score = 72.8 bits (177), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 35/89 (39%), Positives = 47/89 (52%), Gaps = 1/89 (1%)
Query: 246 EDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHT-SNAKLENYLHSGKLIGWGTENGTP 304
ED +K + P + F + D F YKSGVY ++ H+ +G+G ENG P
Sbjct: 263 EDELKNAVGLVRPVSVAFQVIDGFRQYKSGVYTSDHCGTTPDDVNHAVLAVGYGVENGVP 322
Query: 305 YWLVINTWGPHWGDRGTVKILRGKYECAF 333
YWL+ N+WG WGD G K+ GK CA
Sbjct: 323 YWLIKNSWGADWGDNGYFKMEMGKNMCAI 351
>sp|P55097|CATK_MOUSE Cathepsin K OS=Mus musculus GN=Ctsk PE=2 SV=2
Length = 329
Score = 72.8 bits (177), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 60/254 (23%), Positives = 103/254 (40%), Gaps = 47/254 (18%)
Query: 76 YDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNR 135
Y PE+ VPD D R++ G + V + G C + F++ GA + +K K +
Sbjct: 107 YTPEWEGRVPDSIDYRKK----GYVTPVKNQGQCGSCWAFSSAGALEGQ--LKKKTGKLL 160
Query: 136 PLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCS 195
LS + + C +N C G + + ++ + G + D G S C
Sbjct: 161 ALSPQNLVDCV-----TENYGCGGGYMTTAFQYVQQNGGIDSEDAYPYVGQDES----CM 211
Query: 196 HHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILA 255
++ +A KC P NE A+K+ +
Sbjct: 212 YNATAKAA-----------KCRGYREIPV-------------------GNEKALKRAVAR 241
Query: 256 HGPTTATF-ALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGP 314
GP + + A F Y GVY + N +N H+ ++G+GT+ G+ +W++ N+WG
Sbjct: 242 VGPISVSIDASLASFQFYSRGVY-YDENCDRDNVNHAVLVVGYGTQKGSKHWIIKNSWGE 300
Query: 315 HWGDRGTVKILRGK 328
WG++G + R K
Sbjct: 301 SWGNKGYALLARNK 314
>sp|P43297|RD21A_ARATH Cysteine proteinase RD21a OS=Arabidopsis thaliana GN=RD21A PE=1
SV=1
Length = 462
Score = 72.0 bits (175), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 79/315 (25%), Positives = 130/315 (41%), Gaps = 56/315 (17%)
Query: 15 RGELYKFSDAYIDQINREANTWTAGRNFPANLS-EEYLRQFLIADAKYFDQSDRPLPGDR 73
R E++K + ++D+ N + ++ G A+L+ +EY ++L A + G+R
Sbjct: 72 RFEIFKDNLRFVDEHNEKNLSYRLGLTRFADLTNDEYRSKYLGAKMEK--------KGER 123
Query: 74 KTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQ 133
+T Y A V D W G + V D G C + F+ +GA I + G
Sbjct: 124 RT-SLRYEARVGDELPESIDWRKKGAVAEVKDQGGCGSCWAFSTIGAVEGINQIVT-GDL 181
Query: 134 NRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSV-TGGDYGDRTGCQPSTIS 192
E V C N+ C+ G + + F+ K G + T DY + T
Sbjct: 182 ITLSEQELVD-----CDTSYNEGCNGGLMDYAFEFIIKNGGIDTDKDYPYKG--VDGTCD 234
Query: 193 PCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKE 252
+ T+ S E+ PTY +E+++KK
Sbjct: 235 QIRKNAKVVTIDSYEDV-------------PTY-------------------SEESLKKA 262
Query: 253 ILAHGPTT-ATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINT 311
+ AH P + A A F Y SG++ + +L+ H +G+GTENG YW+V N+
Sbjct: 263 V-AHQPISIAIEAGGRAFQLYDSGIFDGSCGTQLD---HGVVAVGYGTENGKDYWIVRNS 318
Query: 312 WGPHWGDRGTVKILR 326
WG WG+ G +++ R
Sbjct: 319 WGKSWGESGYLRMAR 333
>sp|P49935|CATH_MOUSE Pro-cathepsin H OS=Mus musculus GN=Ctsh PE=2 SV=2
Length = 333
Score = 71.6 bits (174), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 38/113 (33%), Positives = 58/113 (51%), Gaps = 6/113 (5%)
Query: 222 NPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTS 281
NP F ++ TL ++E A+ + + + P + F + +DF YKSGVY S
Sbjct: 215 NPQKAVAFVKNVVNITL-----NDEAAMVEAVALYNPVSFAFEVTEDFLMYKSGVYSSKS 269
Query: 282 NAKLENYL-HSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAF 333
K + + H+ +G+G +NG YW+V N+WG WG+ G I RGK C
Sbjct: 270 CHKTPDKVNHAVLAVGYGEQNGLLYWIVKNSWGSQWGENGYFLIERGKNMCGL 322
>sp|P00786|CATH_RAT Pro-cathepsin H OS=Rattus norvegicus GN=Ctsh PE=1 SV=1
Length = 333
Score = 71.6 bits (174), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 59/251 (23%), Positives = 100/251 (39%), Gaps = 45/251 (17%)
Query: 85 PDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVAS 144
P D R++ + V + GAC + F+ GA I S + L+ + +
Sbjct: 115 PSSMDWRKK---GNVVSPVKNQGACGSCWTFSTTGALESAVAIASG--KMMTLAEQQLVD 169
Query: 145 CCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTLP 204
C + +N C G + + ++ + G D
Sbjct: 170 CA---QNFNNHGCQGGLPSQAFEYILYNKGIMGED------------------------- 201
Query: 205 SCENQKVPKLKCHTRCT-NPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATF 263
P + + +C NP F ++ TL ++E A+ + + + P + F
Sbjct: 202 -----SYPYIGKNGQCKFNPEKAVAFVKNVVNITL-----NDEAAMVEAVALYNPVSFAF 251
Query: 264 ALYDDFYHYKSGVYKHTSNAKLENYL-HSGKLIGWGTENGTPYWLVINTWGPHWGDRGTV 322
+ +DF YKSGVY S K + + H+ +G+G +NG YW+V N+WG +WG+ G
Sbjct: 252 EVTEDFMMYKSGVYSSNSCHKTPDKVNHAVLAVGYGEQNGLLYWIVKNSWGSNWGNNGYF 311
Query: 323 KILRGKYECAF 333
I RGK C
Sbjct: 312 LIERGKNMCGL 322
>sp|Q8HY82|CATS_SAIBB Cathepsin S OS=Saimiri boliviensis boliviensis GN=CTSS PE=2 SV=1
Length = 330
Score = 70.1 bits (170), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 72/295 (24%), Positives = 123/295 (41%), Gaps = 52/295 (17%)
Query: 35 TWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQW 94
++ G N +++ E + L++ + +Q R + TY + +PD D RE+
Sbjct: 72 SYDLGMNHLGDMTSEEVMS-LMSSLRVPNQWQRNI-----TYKSNPNQMLPDSVDWREK- 124
Query: 95 PNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDN 154
G + V G+C A F+AVGA + +K K + LS + + C + +Y N
Sbjct: 125 ---GCVTEVKYQGSCGACWAFSAVGALEAQ--LKLKTGKLVSLSAQNLVDCSE--KYG-N 176
Query: 155 KSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKL 214
K C+ G + + ++ D G P T C+ +
Sbjct: 177 KGCNGGFMTEAFQYII-----------DNKGIDSEASYP-----YKATDQKCQYDSKYRA 220
Query: 215 KCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPT-TATFALYDDFYHYK 273
++ T YGR ED +K+ + GP A + F+ Y+
Sbjct: 221 ATCSKYTELPYGR------------------EDVLKEAVANKGPVCVGVDASHPSFFLYR 262
Query: 274 SGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGK 328
SGVY + + N H +IG+G NG YWLV N+WG ++G++G +++ R K
Sbjct: 263 SGVYYDPACTQKVN--HGVLVIGYGDLNGKEYWLVKNSWGSNFGEQGYIRMARNK 315
>sp|P25778|ORYC_ORYSJ Oryzain gamma chain OS=Oryza sativa subsp. japonica GN=Os09g0442300
PE=2 SV=2
Length = 362
Score = 69.7 bits (169), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 34/90 (37%), Positives = 48/90 (53%), Gaps = 3/90 (3%)
Query: 246 EDAIKKEILAHGPTTATFALYDDFYHYKSGVY--KHTSNAKLENYLHSGKLIGWGTENGT 303
ED +K + P + F + + F YKSGVY H + ++ H+ +G+G ENG
Sbjct: 264 EDELKNAVGLVRPVSVAFQVINGFRMYKSGVYTSDHCGTSPMD-VNHAVLAVGYGVENGV 322
Query: 304 PYWLVINTWGPHWGDRGTVKILRGKYECAF 333
PYWL+ N+WG WGD G K+ GK C
Sbjct: 323 PYWLIKNSWGADWGDNGYFKMEMGKNMCGI 352
>sp|Q9LT77|CPR1_ARATH Probable cysteine proteinase At3g19400 OS=Arabidopsis thaliana
GN=At3g19400 PE=2 SV=1
Length = 362
Score = 69.7 bits (169), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 85/321 (26%), Positives = 135/321 (42%), Gaps = 52/321 (16%)
Query: 9 LGCTLVRGELYKFSDAYIDQINREAN-TWTAGRNFPANLSEEYLRQFLIADAKYFDQSDR 67
LG R +++K + ++D+ N + T+ G A+L+ E R + K +++
Sbjct: 58 LGEKERRFKIFKDNLKFVDEHNSVPDRTFEVGLTRFADLTNEEFRAIYLR--KKMERTKD 115
Query: 68 PLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCI 127
+ +R Y + +PD D W G + V D G C + F+AVGA I
Sbjct: 116 SVKTERYLY--KEGDVLPDEVD----WRANGAVVSVKDQGNCGSCWAFSAVGAVEGINQI 169
Query: 128 KSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSV-TGGDYGDRTGC 186
+ + LS + + C R N C G + + F+ K G + T DY
Sbjct: 170 TTG--ELISLSEQELVDCD---RGFVNAGCDGGIMNYAFEFIMKNGGIETDQDY------ 218
Query: 187 QPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNE 246
P + A L C K +TR ++D R D+E
Sbjct: 219 ------PYN----ANDLGLCNADK----NNNTRVVTIDG----YEDVPR--------DDE 252
Query: 247 DAIKKEILAHGPTT-ATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPY 305
++KK + AH P + A A F YKSGV T L+ H ++G+G+ +G Y
Sbjct: 253 KSLKKAV-AHQPVSVAIEASSQAFQLYKSGVMTGTCGISLD---HGVVVVGYGSTSGEDY 308
Query: 306 WLVINTWGPHWGDRGTVKILR 326
W++ N+WG +WGD G VK+ R
Sbjct: 309 WIIRNSWGLNWGDSGYVKLQR 329
Database: swissprot
Posted date: Mar 23, 2013 2:32 AM
Number of letters in database: 191,569,459
Number of sequences in database: 539,616
Lambda K H
0.320 0.137 0.455
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 145,290,624
Number of Sequences: 539616
Number of extensions: 6634845
Number of successful extensions: 12371
Number of sequences better than 100.0: 50
Number of HSP's better than 100.0 without gapping: 197
Number of HSP's successfully gapped in prelim test: 14
Number of HSP's that attempted gapping in prelim test: 11913
Number of HSP's gapped (non-prelim): 296
length of query: 344
length of database: 191,569,459
effective HSP length: 118
effective length of query: 226
effective length of database: 127,894,771
effective search space: 28904218246
effective search space used: 28904218246
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 62 (28.5 bits)