BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 027054
(229 letters)
Database: swissprot
539,616 sequences; 191,569,459 total letters
Searching..................................................done
>sp|Q4R5M2|CATB_MACFA Cathepsin B OS=Macaca fascicularis GN=CTSB PE=2 SV=1
Length = 339
Score = 197 bits (502), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 99/227 (43%), Positives = 146/227 (64%), Gaps = 19/227 (8%)
Query: 6 TNRDALSSSPYVSLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE--- 62
++R + ++ +VS++ +S DLL CCG +CGDGC+GGYP AW ++ G+V+
Sbjct: 118 SDRICIHTNAHVSVE---VSAEDLLTCCGIMCGDGCNGGYPAGAWNFWTRKGLVSGGLYD 174
Query: 63 ----CDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRIN 111
C PY C H P C TPKC + C + ++ KHY ++Y ++
Sbjct: 175 SHVGCRPY-SIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVS 233
Query: 112 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGED 171
+ +DIMAEIYKNGPVE +F+VY DF YKSGVY+H+TG++MGGHA++++GWG ++G
Sbjct: 234 NSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGV-ENGTP 292
Query: 172 YWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEI 218
YW++AN WN WG +G+FKI RG + CGIE +VVAG+P + ++I
Sbjct: 293 YWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPRTDQYWEKI 339
>sp|P07858|CATB_HUMAN Cathepsin B OS=Homo sapiens GN=CTSB PE=1 SV=3
Length = 339
Score = 197 bits (501), Expect = 5e-50, Method: Compositional matrix adjust.
Identities = 99/227 (43%), Positives = 146/227 (64%), Gaps = 19/227 (8%)
Query: 6 TNRDALSSSPYVSLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE--- 62
++R + ++ +VS++ +S DLL CCG +CGDGC+GGYP AW ++ G+V+
Sbjct: 118 SDRICIHTNAHVSVE---VSAEDLLTCCGSMCGDGCNGGYPAEAWNFWTRKGLVSGGLYE 174
Query: 63 ----CDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRIN 111
C PY C H P C TPKC + C + ++ KHY ++Y ++
Sbjct: 175 SHVGCRPY-SIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVS 233
Query: 112 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGED 171
+ +DIMAEIYKNGPVE +F+VY DF YKSGVY+H+TG++MGGHA++++GWG ++G
Sbjct: 234 NSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGV-ENGTP 292
Query: 172 YWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEI 218
YW++AN WN WG +G+FKI RG + CGIE +VVAG+P + ++I
Sbjct: 293 YWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPRTDQYWEKI 339
>sp|P43509|CPR5_CAEEL Cathepsin B-like cysteine proteinase 5 OS=Caenorhabditis elegans
GN=cpr-5 PE=2 SV=1
Length = 344
Score = 197 bits (501), Expect = 5e-50, Method: Compositional matrix adjust.
Identities = 102/208 (49%), Positives = 126/208 (60%), Gaps = 20/208 (9%)
Query: 21 NLSLSVNDLLACCG--FLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDS-- 69
N LS DLL+CC F CG+GC+GGYPI AW+++V HG+VT C PY +
Sbjct: 132 NTLLSSEDLLSCCTGMFSCGNGCEGGYPIQAWKWWVKHGLVTGGSYETQFGCKPYSIAPC 191
Query: 70 ----TGCSHPGC-EPAYPTPKCVRKCVKKNQL---WRNSKHYSISAYRINSDPEDIMAEI 121
G P C E PTPKCV C KN + KH+ +AY + E I EI
Sbjct: 192 GETVNGVKWPACPEDTEPTPKCVDSCTSKNNYATPYLQDKHFGSTAYAVGKKVEQIQTEI 251
Query: 122 YKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNR 181
NGP+EV+FTVYEDF Y +GVY H G +GGHAVK++GWG D+G YW++AN WN
Sbjct: 252 LTNGPIEVAFTVYEDFYQYTTGVYVHTAGASLGGHAVKILGWGV-DNGTPYWLVANSWNV 310
Query: 182 SWGADGYFKIKRGSNECGIEEDVVAGLP 209
+WG GYF+I RG NECGIE VAG+P
Sbjct: 311 AWGEKGYFRIIRGLNECGIEHSAVAGIP 338
>sp|Q5R6D1|CATB_PONAB Cathepsin B OS=Pongo abelii GN=CTSB PE=2 SV=1
Length = 339
Score = 197 bits (500), Expect = 6e-50, Method: Compositional matrix adjust.
Identities = 99/227 (43%), Positives = 145/227 (63%), Gaps = 19/227 (8%)
Query: 6 TNRDALSSSPYVSLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE--- 62
++R + ++ +VS++ +S DLL CCG +CGDGC+GGYP AW ++ G+V+
Sbjct: 118 SDRICIHTNAHVSVE---VSAEDLLTCCGSMCGDGCNGGYPAEAWNFWTRKGLVSGGLYE 174
Query: 63 ----CDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRIN 111
C PY C H P C TPKC + C + ++ KHY ++Y ++
Sbjct: 175 SHVGCRPY-SIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVS 233
Query: 112 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGED 171
+ DIMAEIYKNGPVE +F+VY DF YKSGVY+H+TG++MGGHA++++GWG ++G
Sbjct: 234 NSERDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGV-ENGTP 292
Query: 172 YWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEI 218
YW++AN WN WG +G+FKI RG + CGIE +VVAG+P + ++I
Sbjct: 293 YWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPRTDQYWEKI 339
>sp|P00787|CATB_RAT Cathepsin B OS=Rattus norvegicus GN=Ctsb PE=1 SV=2
Length = 339
Score = 194 bits (494), Expect = 4e-49, Method: Compositional matrix adjust.
Identities = 96/206 (46%), Positives = 130/206 (63%), Gaps = 14/206 (6%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYF-----D 68
N+ +S DLL CCG CGDGC+GGYP AW ++ G+V+ C PY
Sbjct: 130 NVEVSAEDLLTCCGIQCGDGCNGGYPSGAWNFWTRKGLVSGGVYNSHIGCLPYTIPPCEH 189
Query: 69 STGCSHPGCEPAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPV 127
S P C TPKC + C + ++ KHY ++Y ++ ++IMAEIYKNGPV
Sbjct: 190 HVNGSRPPCTGEGDTPKCNKMCEAGYSTSYKEDKHYGYTSYSVSDSEKEIMAEIYKNGPV 249
Query: 128 EVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADG 187
E +FTV+ DF YKSGVYKH GDVMGGHA++++GWG ++G YW++AN WN WG +G
Sbjct: 250 EGAFTVFSDFLTYKSGVYKHEAGDVMGGHAIRILGWGI-ENGVPYWLVANSWNVDWGDNG 308
Query: 188 YFKIKRGSNECGIEEDVVAGLPSSKN 213
+FKI RG N CGIE ++VAG+P ++
Sbjct: 309 FFKILRGENHCGIESEIVAGIPRTQQ 334
>sp|A1E295|CATB_PIG Cathepsin B OS=Sus scrofa GN=CTSB PE=1 SV=1
Length = 335
Score = 192 bits (489), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 94/208 (45%), Positives = 132/208 (63%), Gaps = 16/208 (7%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCS 73
N+ +S D+L CCG CGDGC+GG+P AW ++ G+V+ C PY C
Sbjct: 130 NVEVSAEDMLTCCGDECGDGCNGGFPSGAWNFWTKKGLVSGGLYDSHVGCRPY-SIPPCE 188
Query: 74 H------PGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGP 126
H P C TPKC + C ++ KH+ S+Y I+ + ++IMAEIYKNGP
Sbjct: 189 HHVNGSRPPCTGEGDTPKCSKICEPGYTPSYKEDKHFGCSSYSISRNEKEIMAEIYKNGP 248
Query: 127 VEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGAD 186
VE +FTVY DF YKSGVY+H+TGD+MGGHA++++GWG ++G YW++ N WN WG +
Sbjct: 249 VEGAFTVYSDFLQYKSGVYQHVTGDLMGGHAIRILGWGV-ENGTPYWLVGNSWNTDWGDN 307
Query: 187 GYFKIKRGSNECGIEEDVVAGLPSSKNL 214
G+FKI RG + CGIE ++VAG+P + +
Sbjct: 308 GFFKILRGQDHCGIESEIVAGIPCTPHF 335
>sp|P25793|CYSP2_HAECO Cathepsin B-like cysteine proteinase 2 OS=Haemonchus contortus
GN=AC-2 PE=2 SV=1
Length = 342
Score = 189 bits (481), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 94/210 (44%), Positives = 131/210 (62%), Gaps = 17/210 (8%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGC 72
+ +++S D++ CC CGDGC+GG+PI AW+YF++ GVV+ + C PY C
Sbjct: 135 KQVNISATDIMTCCRPQCGDGCEGGWPIEAWKYFIYDGVVSGGEYLTKDVCRPY-PIHPC 193
Query: 73 SHPG-------CEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKN 124
H G C PTP C RKC +++R K Y AY + + I +EI KN
Sbjct: 194 GHHGNDTYYGECRGTAPTPPCKRKCRPGVRKMYRIDKRYGKDAYIVKQSVKAIQSEILKN 253
Query: 125 GPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWG 184
GPV SF VYEDF HYKSG+YKH G++ G HAVK+IGWG +++ D+W++AN W+ WG
Sbjct: 254 GPVVASFAVYEDFRHYKSGIYKHTAGELRGYHAVKMIGWG-NENNTDFWLIANSWHNDWG 312
Query: 185 ADGYFKIKRGSNECGIEEDVVAGLPSSKNL 214
GYF+I RGSN+CGIE + AG+ +++L
Sbjct: 313 EKGYFRIVRGSNDCGIEGTIAAGIVDTESL 342
>sp|P10605|CATB_MOUSE Cathepsin B OS=Mus musculus GN=Ctsb PE=1 SV=2
Length = 339
Score = 189 bits (480), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 93/204 (45%), Positives = 129/204 (63%), Gaps = 14/204 (6%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYF-----D 68
N+ +S DLL CCG CGDGC+GGYP AW ++ G+V+ C PY
Sbjct: 130 NVEVSAEDLLTCCGIQCGDGCNGGYPSGAWSFWTKKGLVSGGVYNSHVGCLPYTIPPCEH 189
Query: 69 STGCSHPGCEPAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPV 127
S P C TP+C + C + ++ KH+ ++Y +++ ++IMAEIYKNGPV
Sbjct: 190 HVNGSRPPCTGEGDTPRCNKSCEAGYSPSYKEDKHFGYTSYSVSNSVKEIMAEIYKNGPV 249
Query: 128 EVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADG 187
E +FTV+ DF YKSGVYKH GD+MGGHA++++GWG ++G YW+ AN WN WG +G
Sbjct: 250 EGAFTVFSDFLTYKSGVYKHEAGDMMGGHAIRILGWGV-ENGVPYWLAANSWNLDWGDNG 308
Query: 188 YFKIKRGSNECGIEEDVVAGLPSS 211
+FKI RG N CGIE ++VAG+P +
Sbjct: 309 FFKILRGENHCGIESEIVAGIPRT 332
>sp|P19092|CYSP1_HAECO Cathepsin B-like cysteine proteinase 1 OS=Haemonchus contortus
GN=AC-1 PE=2 SV=1
Length = 342
Score = 188 bits (477), Expect = 3e-47, Method: Compositional matrix adjust.
Identities = 92/210 (43%), Positives = 131/210 (62%), Gaps = 17/210 (8%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGC 72
+ +++S D++ CC CGDGC+GG+PI AW+YF++ GVV+ + C PY C
Sbjct: 135 KQVNISATDIMTCCRPQCGDGCEGGWPIEAWKYFIYDGVVSGGEYLTKDVCRPY-PIHPC 193
Query: 73 SHPG-------CEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKN 124
H G C PTP C RKC +++R K Y AY + + I +EI +N
Sbjct: 194 GHHGNDTYYGECRGTAPTPPCKRKCRPGVRKMYRIDKRYGKDAYIVKQSVKAIQSEILRN 253
Query: 125 GPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWG 184
GPV SF VYEDF HYKSG+YKH G++ G HAVK+IGWG +++ D+W++AN W+ WG
Sbjct: 254 GPVVASFAVYEDFRHYKSGIYKHTAGELRGYHAVKMIGWG-NENNTDFWLIANSWHNDWG 312
Query: 185 ADGYFKIKRGSNECGIEEDVVAGLPSSKNL 214
GYF+I RG+N+CGIE + AG+ +++L
Sbjct: 313 EKGYFRIIRGTNDCGIEGTIAAGIVDTESL 342
>sp|P43233|CATB_CHICK Cathepsin B OS=Gallus gallus GN=CTSB PE=2 SV=1
Length = 340
Score = 187 bits (474), Expect = 6e-47, Method: Compositional matrix adjust.
Identities = 94/205 (45%), Positives = 130/205 (63%), Gaps = 19/205 (9%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGC---SHPGC 77
++ +S DLL+CCGF CG GC+GGYP AWRY+ G+V+ Y GC + P C
Sbjct: 130 SVEVSAEDLLSCCGFECGMGCNGGYPSGAWRYWTERGLVSGGL--YDSHVGCRAYTIPPC 187
Query: 78 E------------PAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKN 124
E TP+C R C + ++ KHY I++Y + ++IMAEIYKN
Sbjct: 188 EHHVNGSRPPCTGEGGETPRCSRHCEPGYSPSYKEDKHYGITSYGVPRSEKEIMAEIYKN 247
Query: 125 GPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWG 184
GPVE +F VYEDF YKSGVY+H++G+ +GGHA++++GWG ++G YW+ AN WN WG
Sbjct: 248 GPVEGAFIVYEDFLMYKSGVYQHVSGEQVGGHAIRILGWGV-ENGTPYWLAANSWNTDWG 306
Query: 185 ADGYFKIKRGSNECGIEEDVVAGLP 209
G+FKI RG + CGIE ++VAG+P
Sbjct: 307 ITGFFKILRGEDHCGIESEIVAGVP 331
>sp|P43508|CPR4_CAEEL Cathepsin B-like cysteine proteinase 4 OS=Caenorhabditis elegans
GN=cpr-4 PE=2 SV=1
Length = 335
Score = 186 bits (472), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 98/205 (47%), Positives = 127/205 (61%), Gaps = 18/205 (8%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYF-----D 68
N LS D+L+CC CG GC+GGYPI+AW+Y V G T C PY +
Sbjct: 131 NTLLSAEDVLSCCSN-CGYGCEGGYPINAWKYLVKSGFCTGGSYEAQFGCKPYSLAPCGE 189
Query: 69 STG-CSHPGC-EPAYPTPKCVRKCVKKNQ--LWRNSKHYSISAYRINSDPEDIMAEIYKN 124
+ G + P C + Y TP CV KC KN + KH+ +AY + I AEI +
Sbjct: 190 TVGNVTWPSCPDDGYDTPACVNKCTNKNYNVAYTADKHFGSTAYAVGKKVSQIQAEIIAH 249
Query: 125 GPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWG 184
GPVE +FTVYEDF YK+GVY H TG +GGHA++++GWGT D+G YW++AN WN +WG
Sbjct: 250 GPVEAAFTVYEDFYQYKTGVYVHTTGQELGGHAIRILGWGT-DNGTPYWLVANSWNVNWG 308
Query: 185 ADGYFKIKRGSNECGIEEDVVAGLP 209
+GYF+I RG+NECGIE VV G+P
Sbjct: 309 ENGYFRIIRGTNECGIEHAVVGGVP 333
>sp|P25807|CPR1_CAEEL Gut-specific cysteine proteinase OS=Caenorhabditis elegans GN=cpr-1
PE=1 SV=2
Length = 329
Score = 184 bits (468), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 95/195 (48%), Positives = 121/195 (62%), Gaps = 10/195 (5%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT------EECDPYFDSTGCS 73
Q +S +DLL+CCG CG+GC+GGYPI A R++ GVVT C PY C+
Sbjct: 134 QQPIISPDDLLSCCGSSCGNGCEGGYPIQALRWWDSKGVVTGGDYHGAGCKPY-PIAPCT 192
Query: 74 HPGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFT 132
C P TP C C + + KH+ +SAY + + I AEIY NGPVE +F+
Sbjct: 193 SGNC-PESKTPSCSMSCQSGYSTAYAKDKHFGVSAYAVPKNAASIQAEIYANGPVEAAFS 251
Query: 133 VYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIK 192
VYEDF YKSGVYKH G +GGHA+K+IGWGT + G YW++AN W +WG G+FKI
Sbjct: 252 VYEDFYKYKSGVYKHTAGKYLGGHAIKIIGWGT-ESGSPYWLVANSWGVNWGESGFFKIY 310
Query: 193 RGSNECGIEEDVVAG 207
RG ++CGIE VVAG
Sbjct: 311 RGDDQCGIESAVVAG 325
>sp|P43510|CPR6_CAEEL Cathepsin B-like cysteine proteinase 6 OS=Caenorhabditis elegans
GN=cpr-6 PE=1 SV=1
Length = 379
Score = 181 bits (460), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 101/224 (45%), Positives = 132/224 (58%), Gaps = 21/224 (9%)
Query: 9 DALSSSPYVSLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------E 61
D + + + LQ ++LS +DLL+CC CG GC+GG P++AWRY+V G+VT
Sbjct: 144 DRICIASHGELQ-VTLSADDLLSCCKS-CGFGCNGGDPLAAWRYWVKDGIVTGSNYTANN 201
Query: 62 ECDPYFDSTGCSH--------PGCEPAYPTPKCVRKCVK--KNQLWRNSKHYSISAYRIN 111
C PY C H P YPTPKC +KCV ++ + K + SAY +
Sbjct: 202 GCKPY-PFPPCEHHSKKTHFDPCPHDLYPTPKCEKKCVSDYTDKTYSEDKFFGASAYGVK 260
Query: 112 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGED 171
D E I E+ +GP+E++F VYEDF +Y GVY H G + GGHAVKLIGWG DDG
Sbjct: 261 DDVEAIQKELMTHGPLEIAFEVYEDFLNYDGGVYVHTGGKLGGGHAVKLIGWGI-DDGIP 319
Query: 172 YWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLV 215
YW +AN WN WG DG+F+I RG +ECGIE VV G+P +L
Sbjct: 320 YWTVANSWNTDWGEDGFFRILRGVDECGIESGVVGGIPKLNSLT 363
>sp|P25792|CYSP_SCHMA Cathepsin B-like cysteine proteinase OS=Schistosoma mansoni PE=2
SV=1
Length = 340
Score = 178 bits (452), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 94/202 (46%), Positives = 125/202 (61%), Gaps = 16/202 (7%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPY-----F 67
QN+ LS DLL CC CG GC+GG AW Y+V G+VT C+PY
Sbjct: 138 QNVELSAVDLLTCCES-CGLGCEGGILGPAWDYWVKEGIVTASSKENHTGCEPYPFPKCE 196
Query: 68 DSTGCSHPGC-EPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNG 125
T +P C Y TP+C + C +K + + KH S+Y + +D + I EI K G
Sbjct: 197 HHTKGKYPPCGSKIYNTPRCKQTCQRKYKTPYTQDKHRGKSSYNVKNDEKAIQKEIMKYG 256
Query: 126 PVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGA 185
PVE SFTVYEDF +YKSG+YKHITG+ +GGHA+++IGWG ++ YW++AN WN WG
Sbjct: 257 PVEASFTVYEDFLNYKSGIYKHITGEALGGHAIRIIGWGV-ENKTPYWLIANSWNEDWGE 315
Query: 186 DGYFKIKRGSNECGIEEDVVAG 207
+GYF+I RG +EC IE +V+AG
Sbjct: 316 NGYFRIVRGRDECSIESEVIAG 337
>sp|P43507|CPR3_CAEEL Cathepsin B-like cysteine proteinase 3 OS=Caenorhabditis elegans
GN=cpr-3 PE=2 SV=1
Length = 370
Score = 176 bits (445), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 98/217 (45%), Positives = 129/217 (59%), Gaps = 20/217 (9%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT------EECDPYFDSTGCS 73
Q +SV D+L+CCG CG GC GGY I A R++ G VT C PY S
Sbjct: 141 QQPVISVEDILSCCGTTCGYGCKGGYSIEALRFWASSGAVTGGDYGGHGCMPY--SFAPC 198
Query: 74 HPGCEPAYPTPKCVRKCVK--KNQLWRNSKHYSISAYRINSDPE--DIMAEIYKNGPVEV 129
C P TP C C K + ++ KHY SAY++ + +I EIY GPVE
Sbjct: 199 TKNC-PESTTPSCKTTCQSSYKTEEYKKDKHYGASAYKVTTTKSVTEIQTEIYHYGPVEA 257
Query: 130 SFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYF 189
S+ VYEDF HYKSGVY + +G ++GGHAVK+IGWG ++G DYW++AN W S+G G+F
Sbjct: 258 SYKVYEDFYHYKSGVYHYTSGKLVGGHAVKIIGWGV-ENGVDYWLIANSWGTSFGEKGFF 316
Query: 190 KIKRGSNECGIEEDVVAGLPSSKNLVKEITSADMFED 226
KI+RG+NEC IE +VVAG + K T ++ +ED
Sbjct: 317 KIRRGTNECQIEGNVVAG------IAKLGTHSETYED 347
>sp|P43157|CYSP_SCHJA Cathepsin B-like cysteine proteinase OS=Schistosoma japonicum
GN=CATB PE=2 SV=1
Length = 342
Score = 169 bits (429), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 88/203 (43%), Positives = 121/203 (59%), Gaps = 16/203 (7%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY-----F 67
Q+ LS DL++CC CGDGC GG+P AW Y+V G+VT C PY
Sbjct: 139 QSAELSALDLISCCKD-CGDGCQGGFPGVAWDYWVKRGIVTGGSKENHTGCQPYPFPKCE 197
Query: 68 DSTGCSHPGC-EPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNG 125
T +P C Y TP+C + C K + + KHY +Y + ++ + I +I G
Sbjct: 198 HHTKGKYPACGTKIYKTPQCKQTCQKGYKTPYEQDKHYGDESYNVQNNEKVIQRDIMMYG 257
Query: 126 PVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGA 185
PVE +F VYEDF +YKSG+Y+H+TG ++GGHA+++IGWG + YW++AN WN WG
Sbjct: 258 PVEAAFDVYEDFLNYKSGIYRHVTGSIVGGHAIRIIGWGV-EKRTPYWLIANSWNEDWGE 316
Query: 186 DGYFKIKRGSNECGIEEDVVAGL 208
G F++ RG +EC IE DVVAGL
Sbjct: 317 KGLFRMVRGRDECSIESDVVAGL 339
>sp|P25802|CYSP1_OSTOS Cathepsin B-like cysteine proteinase 1 OS=Ostertagia ostertagi
GN=CP-1 PE=3 SV=3
Length = 341
Score = 168 bits (425), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 89/202 (44%), Positives = 123/202 (60%), Gaps = 17/202 (8%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTE-------ECDPYFDSTGC 72
+ + +S D+++CC + CGDGC+GG+PISA+R+ GVVT C PY + C
Sbjct: 140 KQVLISAQDVVSCCTW-CGDGCEGGWPISAFRFHADEGVVTGGDYNTKGSCRPY-EIHPC 197
Query: 73 SHPGCEPAY-------PTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNG 125
H G E Y TP+C R+C+ S Y AY++ + + I +I KNG
Sbjct: 198 GHHGNETYYGECVGMADTPRCKRRCLLGYPKSYPSDRYYKKAYQLKNSVKAIQKDIMKNG 257
Query: 126 PVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGA 185
PV ++TVYEDFAHY+SG+YKH G G HAVK+IGWG + G YWI+AN W+ WG
Sbjct: 258 PVVATYTVYEDFAHYRSGIYKHKAGRKTGLHAVKVIGWG-EEKGTPYWIVANSWHDDWGE 316
Query: 186 DGYFKIKRGSNECGIEEDVVAG 207
+G+F++ RGSN+CG EE + AG
Sbjct: 317 NGFFRMHRGSNDCGFEERMAAG 338
>sp|Q54QD9|CTSB_DICDI Cathepsin B OS=Dictyostelium discoideum GN=ctsB PE=3 SV=1
Length = 311
Score = 159 bits (402), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 92/198 (46%), Positives = 115/198 (58%), Gaps = 20/198 (10%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEP 79
+N+ LS D++ C +GC+GG SAW + G V+EEC PY + P C P
Sbjct: 126 ENVQLSFMDMVTCDE--TDNGCEGGDAFSAWNWLRKQGAVSEECLPY------TIPTCPP 177
Query: 80 AYP-------TPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSF 131
A TP C ++C + L + KH Y +SD E IM EI NGPVE F
Sbjct: 178 AQQPCLNFVNTPSCTKECQSNSSLIYSQDKHKMAKIYSFDSD-EAIMQEIVTNGPVEACF 236
Query: 132 TVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKI 191
TV+EDF YKSGVY H TG +GGH VKL+G+GT +G DY+ NQW SWG +G F I
Sbjct: 237 TVFEDFLAYKSGVYVHTTGKDLGGHCVKLVGFGTL-NGVDYYAANNQWTTSWGDNGTFLI 295
Query: 192 KRGSNECGIEEDVVAGLP 209
KRG +CGI +DVVAGLP
Sbjct: 296 KRG--DCGISDDVVAGLP 311
>sp|P07688|CATB_BOVIN Cathepsin B OS=Bos taurus GN=CTSB PE=1 SV=5
Length = 335
Score = 158 bits (400), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 90/203 (44%), Positives = 131/203 (64%), Gaps = 16/203 (7%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCS 73
N+ +S D+L CCG CGDGC+GG+P AW ++ G+V+ C PY C
Sbjct: 130 NVEVSAEDMLTCCGGECGDGCNGGFPSGAWNFWTKKGLVSGGLYNSHVGCRPY-SIPPCE 188
Query: 74 H------PGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGP 126
H P C TPKC + C + ++ KH+ S+Y + ++ ++IMAEIYKNGP
Sbjct: 189 HHVNGSRPPCTGEGDTPKCSKTCEPGYSPSYKEDKHFGCSSYSVANNEKEIMAEIYKNGP 248
Query: 127 VEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGAD 186
VE +F+VY DF YKSGVY+H++G++MGGHA++++GWG ++G YW++ N WN WG +
Sbjct: 249 VEGAFSVYSDFLLYKSGVYQHVSGEIMGGHAIRILGWGV-ENGTPYWLVGNSWNTDWGDN 307
Query: 187 GYFKIKRGSNECGIEEDVVAGLP 209
G+FKI RG + CGIE ++VAG+P
Sbjct: 308 GFFKILRGQDHCGIESEIVAGMP 330
>sp|Q06544|CYSP3_OSTOS Cathepsin B-like cysteine proteinase 3 (Fragment) OS=Ostertagia
ostertagi GN=CP-3 PE=3 SV=1
Length = 174
Score = 145 bits (367), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 79/175 (45%), Positives = 104/175 (59%), Gaps = 17/175 (9%)
Query: 49 AWRYFVHHGVVTEE-------CDPYFDSTGCSHPGCEPAY-------PTPKCVRKCVKKN 94
AW+YF GVVT C PY + C G EP Y TPKC + C +
Sbjct: 1 AWQYFALEGVVTGGNYRKQGCCRPY-EFPPCGRHGKEPYYGECYDTAKTPKCQKTCQRGY 59
Query: 95 -QLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVM 153
+ ++ KH+ SAYR+ ++ + I +I KNGPV F VYEDFAHYKSG+YKH G +
Sbjct: 60 LKAYKEDKHFGKSAYRLPNNVKAIQRDIMKNGPVVAGFIVYEDFAHYKSGIYKHTAGRMT 119
Query: 154 GGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 208
GGHAVK+IGWG + G YW++AN W+ WG G++++ RG N C IEE V AG+
Sbjct: 120 GGHAVKIIGWG-KEKGTPYWLIANSWHDDWGEKGFYRMIRGINNCRIEEMVFAGI 173
>sp|P92133|CATB3_GIAIN Cathepsin B-like CP3 OS=Giardia intestinalis GN=CP3 PE=2 SV=2
Length = 299
Score = 142 bits (359), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 75/168 (44%), Positives = 94/168 (55%), Gaps = 11/168 (6%)
Query: 41 CDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNS 100
CDGG+ S WR+ G T+EC PY G A T C KC + L
Sbjct: 140 CDGGWLPSVWRFLTKTGTTTDECVPY-------QSGSTGARGT--CPTKCADGSDLPHLY 190
Query: 101 KHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKL 160
K Y + D IM + GP++ +FTVY DF +Y+SGVY+H G V GGHAV +
Sbjct: 191 KATKAVDYGL--DAPAIMKALATGGPLQTAFTVYSDFMYYESGVYQHTYGRVEGGHAVDM 248
Query: 161 IGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 208
+G+GT DDG DYWI+ N W WG DGYF+I R +NECGIEE V+ G
Sbjct: 249 VGYGTDDDGVDYWIIKNSWGPDWGEDGYFRIIRMTNECGIEEQVIGGF 296
>sp|Q9UJW2|TINAG_HUMAN Tubulointerstitial nephritis antigen OS=Homo sapiens GN=TINAG PE=2
SV=3
Length = 476
Score = 131 bits (329), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 75/206 (36%), Positives = 107/206 (51%), Gaps = 27/206 (13%)
Query: 23 SLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA-- 80
+LS +L++CC GC+ G AW Y G+V+ C P F ++ GC A
Sbjct: 267 NLSPQNLISCCA-KNRHGCNSGSIDRAWWYLRKRGLVSHACYPLFKDQNATNNGCAMASR 325
Query: 81 -------YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTV 133
+ T C K N++++ S YR++S+ +IM EI +NGPV+ V
Sbjct: 326 SDGRGKRHATKPCPNNVEKSNRIYQCS-----PPYRVSSNETEIMKEIMQNGPVQAIMQV 380
Query: 134 YEDFAHYKSGVYKHITGD--------VMGGHAVKLIGWGT----SDDGEDYWILANQWNR 181
EDF HYK+G+Y+H+T + HAVKL GWGT E +WI AN W +
Sbjct: 381 REDFFHYKTGIYRHVTSTNKESEKYRKLQTHAVKLTGWGTLRGAQGQKEKFWIAANSWGK 440
Query: 182 SWGADGYFKIKRGSNECGIEEDVVAG 207
SWG +GYF+I RG NE IE+ ++A
Sbjct: 441 SWGENGYFRILRGVNESDIEKLIIAA 466
>sp|Q3SZI1|TINAG_BOVIN Tubulointerstitial nephritis antigen OS=Bos taurus GN=TINAG PE=2
SV=1
Length = 476
Score = 129 bits (325), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 79/221 (35%), Positives = 113/221 (51%), Gaps = 34/221 (15%)
Query: 23 SLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA-- 80
+LS +L++CC GC+ G AW Y G+V+ C P F ++ GC A
Sbjct: 267 NLSPQNLISCCAKK-RHGCNSGSVDRAWWYLRKRGLVSHACYPLFKDQNATNNGCAMASR 325
Query: 81 -------YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTV 133
+ T C K N++++ S YR++S+ +IM EI +NGPV+ V
Sbjct: 326 SDGRGKRHATTPCPNSIEKSNRIYQCS-----PPYRVSSNETEIMREIMQNGPVQAIMQV 380
Query: 134 YEDFAHYKSGVYKHITGD--------VMGGHAVKLIGWGT----SDDGEDYWILANQWNR 181
+EDF +YK+G+Y+HIT HAVKL GWGT E +WI AN W +
Sbjct: 381 HEDFFNYKTGIYRHITSTNEDSEKYRKFRTHAVKLTGWGTLRGAQGQKEKFWIAANSWGK 440
Query: 182 SWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEITSAD 222
SWG +GYF+I RG NE IE+ ++A ++TSAD
Sbjct: 441 SWGENGYFRILRGVNESDIEKLIIAAW-------GQLTSAD 474
>sp|P92132|CATB2_GIAIN Cathepsin B-like CP2 OS=Giardia intestinalis GN=CP2 PE=1 SV=2
Length = 300
Score = 126 bits (316), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 65/168 (38%), Positives = 94/168 (55%), Gaps = 11/168 (6%)
Query: 41 CDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNS 100
C+GG+ + W++ G T+EC PY + C PT KC + +
Sbjct: 141 CNGGWLPNVWKFLTKTGTTTDECVPYKSGSTTLRGTC----PT-----KCADGSSKVHLA 191
Query: 101 KHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKL 160
S Y + D +M + +GP++V+F V+ DF +Y+SGVY+H G + GGHAV++
Sbjct: 192 TATSYKDYGL--DIPAMMKALSTSGPLQVAFLVHSDFMYYESGVYQHTYGYMEGGHAVEM 249
Query: 161 IGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 208
+G+GT DDG DYWI+ N W WG DGYF++ RG N+C IEE AG
Sbjct: 250 VGYGTDDDGVDYWIIKNSWGPDWGEDGYFRMIRGINDCSIEEQAYAGF 297
>sp|O97578|CATC_CANFA Dipeptidyl peptidase 1 (Fragment) OS=Canis familiaris GN=CTSC PE=1
SV=1
Length = 435
Score = 117 bits (292), Expect = 9e-26, Method: Compositional matrix adjust.
Identities = 71/201 (35%), Positives = 106/201 (52%), Gaps = 26/201 (12%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCE 78
Q LS ++++C + GC+GG+P + A +Y G+V E C PY G P C+
Sbjct: 252 QTPILSPQEIVSCSQY--AQGCEGGFPYLIAGKYAQDFGLVEEACFPY---AGSDSP-CK 305
Query: 79 PAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFA 138
P C R + +S++Y + + + + E+ ++GP+ V+F VY+DF
Sbjct: 306 PN----DCFR--------YYSSEYYYVGGFYGACNEALMKLELVRHGPMAVAFEVYDDFF 353
Query: 139 HYKSGVYKH------ITGDVMGGHAVKLIGWGT-SDDGEDYWILANQWNRSWGADGYFKI 191
HY+ G+Y H + HAV L+G+GT S G DYWI+ N W WG DGYF+I
Sbjct: 354 HYQKGIYYHTGLRDPFNPFELTNHAVLLVGYGTDSASGMDYWIVKNSWGSRWGEDGYFRI 413
Query: 192 KRGSNECGIEEDVVAGLPSSK 212
+RG++EC IE VA P K
Sbjct: 414 RRGTDECAIESIAVAATPIPK 434
>sp|P92131|CATB1_GIAIN Cathepsin B-like CP1 OS=Giardia intestinalis GN=CP1 PE=2 SV=3
Length = 303
Score = 117 bits (292), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 70/188 (37%), Positives = 99/188 (52%), Gaps = 15/188 (7%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEP 79
+ +S S L++C L GCDGG W + G T EC Y D G
Sbjct: 126 EAVSYSQQHLISCS--LENFGCDGGDFQPTWSFLTFTGATTAECVKYVDY------GHTV 177
Query: 80 AYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAH 139
A P P QL++ + +S S P IM + GP++ VY D ++
Sbjct: 178 ASPCPAVCDDG-SPIQLYKAHGYGQVS----KSVPA-IMGMLVAGGPLQTMIVVYADLSY 231
Query: 140 YKSGVYKHITGDV-MGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNEC 198
Y+SGVYKH G + +G HA++++G+GT+DDG DYWI+ N W WG +GYF+I RG NEC
Sbjct: 232 YESGVYKHTYGTINLGFHALEIVGYGTTDDGTDYWIIKNSWGPDWGENGYFRIVRGVNEC 291
Query: 199 GIEEDVVA 206
IE+++ A
Sbjct: 292 RIEDEIYA 299
>sp|Q60HG6|CATC_MACFA Dipeptidyl peptidase 1 OS=Macaca fascicularis GN=CTSC PE=2 SV=1
Length = 463
Score = 116 bits (291), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 71/203 (34%), Positives = 107/203 (52%), Gaps = 29/203 (14%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCE 78
Q LS ++++C + GC+GG+P ++A +Y G+V E C PY TG P
Sbjct: 279 QTPILSSQEVVSCSQY--AQGCEGGFPYLTAGKYAQDFGLVEEACFPY---TGTDSP--- 330
Query: 79 PAYPTPKCVRKCVKKNQLWR--NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYED 136
C K +R +S+++ + + + + E+ +GP+ V+F VY+D
Sbjct: 331 -----------CKMKEDCFRYYSSEYHYVGGFYGGCNEALMKLELVYHGPLAVAFEVYDD 379
Query: 137 FAHYKSGVYKH------ITGDVMGGHAVKLIGWGT-SDDGEDYWILANQWNRSWGADGYF 189
F HY++G+Y H + HAV L+G+GT S G DYWI+ N W SWG DGYF
Sbjct: 380 FLHYQNGIYHHTGLRDPFNPFELTNHAVLLVGYGTDSASGMDYWIVKNSWGTSWGEDGYF 439
Query: 190 KIKRGSNECGIEEDVVAGLPSSK 212
+I+RG++EC IE VA P K
Sbjct: 440 RIRRGTDECAIESIAVAATPIPK 462
>sp|Q5RB02|CATC_PONAB Dipeptidyl peptidase 1 OS=Pongo abelii GN=CTSC PE=2 SV=1
Length = 463
Score = 115 bits (287), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 71/203 (34%), Positives = 104/203 (51%), Gaps = 29/203 (14%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCE 78
Q LS ++++C + GC+GG+P + A +Y G+V E C PY TG P
Sbjct: 279 QTPILSPQEVVSCSQY--AQGCEGGFPYLIAGKYAQDFGLVEEACFPY---TGTDSP--- 330
Query: 79 PAYPTPKCVRKCVKKNQLWR--NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYED 136
C K +R +S+++ + + + + E+ +GP+ V+F VY+D
Sbjct: 331 -----------CKMKEDCFRYYSSEYHYVGGFYGGCNEALMKLELVHHGPMAVAFEVYDD 379
Query: 137 FAHYKSGVYKH------ITGDVMGGHAVKLIGWGT-SDDGEDYWILANQWNRSWGADGYF 189
F HYK G+Y H + HAV L+G+GT S G DYWI+ N W WG DGYF
Sbjct: 380 FLHYKKGIYHHTGLRDPFNPFELTNHAVLLVGYGTDSASGMDYWIVKNSWGTGWGEDGYF 439
Query: 190 KIKRGSNECGIEEDVVAGLPSSK 212
+I+RG++EC IE VA P K
Sbjct: 440 RIRRGTDECAIESIAVAATPIPK 462
>sp|P53634|CATC_HUMAN Dipeptidyl peptidase 1 OS=Homo sapiens GN=CTSC PE=1 SV=2
Length = 463
Score = 113 bits (282), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 70/203 (34%), Positives = 104/203 (51%), Gaps = 29/203 (14%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCE 78
Q LS ++++C + GC+GG+P + A +Y G+V E C PY TG P
Sbjct: 279 QTPILSPQEVVSCSQY--AQGCEGGFPYLIAGKYAQDFGLVEEACFPY---TGTDSP--- 330
Query: 79 PAYPTPKCVRKCVKKNQLWR--NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYED 136
C K +R +S+++ + + + + E+ +GP+ V+F VY+D
Sbjct: 331 -----------CKMKEDCFRYYSSEYHYVGGFYGGCNEALMKLELVHHGPMAVAFEVYDD 379
Query: 137 FAHYKSGVYKH------ITGDVMGGHAVKLIGWGT-SDDGEDYWILANQWNRSWGADGYF 189
F HYK G+Y H + HAV L+G+GT S G DYWI+ N W WG +GYF
Sbjct: 380 FLHYKKGIYHHTGLRDPFNPFELTNHAVLLVGYGTDSASGMDYWIVKNSWGTGWGENGYF 439
Query: 190 KIKRGSNECGIEEDVVAGLPSSK 212
+I+RG++EC IE VA P K
Sbjct: 440 RIRRGTDECAIESIAVAATPIPK 462
>sp|Q3ZCJ8|CATC_BOVIN Dipeptidyl peptidase 1 OS=Bos taurus GN=CTSC PE=2 SV=1
Length = 463
Score = 112 bits (279), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 69/203 (33%), Positives = 105/203 (51%), Gaps = 29/203 (14%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCE 78
Q LS ++++C + GC+GG+P + A +Y G+V E+C PY TG P
Sbjct: 279 QTPILSPQEVVSCSQY--AQGCEGGFPYLIAGKYAQDFGLVEEDCFPY---TGTDSP--- 330
Query: 79 PAYPTPKCVRKCVKKNQLWR--NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYED 136
C K +R +S+++ + + + + E+ GP+ V+F VY+D
Sbjct: 331 -----------CRLKEGCFRYYSSEYHYVGGFYGGCNEALMKLELVHQGPMAVAFEVYDD 379
Query: 137 FAHYKSGVYKH------ITGDVMGGHAVKLIGWGT-SDDGEDYWILANQWNRSWGADGYF 189
F HY+ GVY H + HAV L+G+GT + G DYWI+ N W SWG +GYF
Sbjct: 380 FLHYRKGVYHHTGLRDPFNPFELTNHAVLLVGYGTDAASGLDYWIVKNSWGTSWGENGYF 439
Query: 190 KIKRGSNECGIEEDVVAGLPSSK 212
+I+RG++EC IE +A P K
Sbjct: 440 RIRRGTDECAIESIALAATPIPK 462
>sp|P80067|CATC_RAT Dipeptidyl peptidase 1 OS=Rattus norvegicus GN=Ctsc PE=1 SV=3
Length = 462
Score = 110 bits (276), Expect = 7e-24, Method: Compositional matrix adjust.
Identities = 68/201 (33%), Positives = 103/201 (51%), Gaps = 25/201 (12%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCE 78
Q LS ++++C + GCDGG+P + A +Y GVV E C PY +
Sbjct: 278 QTPILSPQEVVSCSPY--AQGCDGGFPYLIAGKYAQDFGVVEENCFPYTATDA------- 328
Query: 79 PAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFA 138
P P C+R + +S++Y + + + + E+ K+GP+ V+F V++DF
Sbjct: 329 PCKPKENCLR--------YYSSEYYYVGGFYGGCNEALMKLELVKHGPMAVAFEVHDDFL 380
Query: 139 HYKSGVYKH------ITGDVMGGHAVKLIGWGTSD-DGEDYWILANQWNRSWGADGYFKI 191
HY SG+Y H + HAV L+G+G G DYWI+ N W WG GYF+I
Sbjct: 381 HYHSGIYHHTGLSDPFNPFELTNHAVLLVGYGKDPVTGLDYWIVKNSWGSQWGESGYFRI 440
Query: 192 KRGSNECGIEEDVVAGLPSSK 212
+RG++EC IE +A +P K
Sbjct: 441 RRGTDECAIESIAMAAIPIPK 461
>sp|P90850|YCF2E_CAEEL Uncharacterized peptidase C1-like protein F26E4.3 OS=Caenorhabditis
elegans GN=F26E4.3 PE=1 SV=3
Length = 452
Score = 110 bits (275), Expect = 9e-24, Method: Compositional matrix adjust.
Identities = 70/198 (35%), Positives = 100/198 (50%), Gaps = 14/198 (7%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA 80
N +LS LL+C GC+GGY AW Y GVV + C PY S PG
Sbjct: 232 NSTLSSQQLLSCNQHR-QKGCEGGYLDRAWWYIRKLGVVGDHCYPYV-SGQSREPGHCLI 289
Query: 81 YPTPKCVRKCVKKNQLWRNSKHYSISA-YRINSDPEDIMAEIYKNGPVEVSFTVYEDFAH 139
R+ ++ ++S + ++ Y+++S EDI E+ NGPV+ +F V+EDF
Sbjct: 290 PKRDYTNRQGLRCPSGSQDSTAFKMTPPYKVSSREEDIQTELMTNGPVQATFVVHEDFFM 349
Query: 140 YKSGVYKH--------ITGDVMGGHAVKLIGWG---TSDDGEDYWILANQWNRSWGADGY 188
Y GVY+H + G H+V+++GWG ++ YW+ AN W WG DGY
Sbjct: 350 YAGGVYQHSDLAAQKGASSVAEGYHSVRVLGWGVDHSTGKPIKYWLCANSWGTQWGEDGY 409
Query: 189 FKIKRGSNECGIEEDVVA 206
FK+ RG N C IE V+
Sbjct: 410 FKVLRGENHCEIESFVIG 427
>sp|P97821|CATC_MOUSE Dipeptidyl peptidase 1 OS=Mus musculus GN=Ctsc PE=2 SV=1
Length = 462
Score = 108 bits (271), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 68/201 (33%), Positives = 102/201 (50%), Gaps = 25/201 (12%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCE 78
Q LS ++++C + GCDGG+P + A +Y GVV E C PY
Sbjct: 278 QTPILSPQEVVSCSPY--AQGCDGGFPYLIAGKYAQDFGVVEESCFPYTAKDS------- 328
Query: 79 PAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFA 138
P P C+R + +S +Y + + + + E+ K+GP+ V+F V++DF
Sbjct: 329 PCKPRENCLR--------YYSSDYYYVGGFYGGCNEALMKLELVKHGPMAVAFEVHDDFL 380
Query: 139 HYKSGVYKH------ITGDVMGGHAVKLIGWGTSD-DGEDYWILANQWNRSWGADGYFKI 191
HY SG+Y H + HAV L+G+G G +YWI+ N W +WG GYF+I
Sbjct: 381 HYHSGIYHHTGLSDPFNPFELTNHAVLLVGYGRDPVTGIEYWIIKNSWGSNWGESGYFRI 440
Query: 192 KRGSNECGIEEDVVAGLPSSK 212
+RG++EC IE VA +P K
Sbjct: 441 RRGTDECAIESIAVAAIPIPK 461
>sp|Q99JR5|TINAL_MOUSE Tubulointerstitial nephritis antigen-like OS=Mus musculus
GN=Tinagl1 PE=1 SV=1
Length = 466
Score = 107 bits (266), Expect = 9e-23, Method: Compositional matrix adjust.
Identities = 77/227 (33%), Positives = 107/227 (47%), Gaps = 37/227 (16%)
Query: 10 ALSSSPYVSLQNLS-----LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECD 64
A +S VS+ +L LS +LL+C GC GG AW + GVV++ C
Sbjct: 234 AAVASDRVSIHSLGHMTPILSPQNLLSCDTHH-QQGCRGGRLDGAWWFLRRRGVVSDNCY 292
Query: 65 PYFDSTGCSHPGCEPAYPTPKCV----------RKCVKK---NQLWRNSKHYSISAYRIN 111
P+ A PTP+C+ R+ + Q+ N + AYR+
Sbjct: 293 PFSGREQ------NEASPTPRCMMHSRAMGRGKRQATSRCPNGQVDSNDIYQVTPAYRLG 346
Query: 112 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV--------MGGHAVKLIGW 163
SD ++IM E+ +NGPV+ V+EDF Y+ G+Y H G H+VK+ GW
Sbjct: 347 SDEKEIMKELMENGPVQALMEVHEDFFLYQRGIYSHTPVSQGRPEQYRRHGTHSVKITGW 406
Query: 164 G--TSDDGE--DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 206
G T DG YW AN W WG G+F+I RG+NEC IE V+
Sbjct: 407 GEETLPDGRTIKYWTAANSWGPWWGERGHFRIVRGTNECDIETFVLG 453
>sp|Q9GZM7|TINAL_HUMAN Tubulointerstitial nephritis antigen-like OS=Homo sapiens
GN=TINAGL1 PE=1 SV=1
Length = 467
Score = 104 bits (259), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 75/221 (33%), Positives = 107/221 (48%), Gaps = 25/221 (11%)
Query: 10 ALSSSPYVSLQNLS-----LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECD 64
A +S VS+ +L LS +LL+C GC GG AW + GVV++ C
Sbjct: 235 AAVASDRVSIHSLGHMTPVLSPQNLLSCDTHQ-QQGCRGGRLDGAWWFLRRRGVVSDHCY 293
Query: 65 PY----FDSTGCSHPGCEPAYPTPKCVRKCVKK--NQLWRNSKHYSIS-AYRINSDPEDI 117
P+ D G + P + + R+ N N+ Y ++ YR+ S+ ++I
Sbjct: 294 PFSGRERDEAGPAPPCMMHSRAMGRGKRQATAHCPNSYVNNNDIYQVTPVYRLGSNDKEI 353
Query: 118 MAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV--------MGGHAVKLIGWG--TSD 167
M E+ +NGPV+ V+EDF YK G+Y H + G H+VK+ GWG T
Sbjct: 354 MKELMENGPVQALMEVHEDFFLYKGGIYSHTPVSLGRPERYRRHGTHSVKITGWGEETLP 413
Query: 168 DGE--DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 206
DG YW AN W +WG G+F+I RG NEC IE V+
Sbjct: 414 DGRTLKYWTAANSWGPAWGERGHFRIVRGVNECDIESFVLG 454
>sp|Q9EQT5|TINAL_RAT Tubulointerstitial nephritis antigen-like OS=Rattus norvegicus
GN=Tinagl1 PE=2 SV=1
Length = 467
Score = 103 bits (258), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 76/227 (33%), Positives = 107/227 (47%), Gaps = 36/227 (15%)
Query: 10 ALSSSPYVSLQNLS-----LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECD 64
A +S VS+ +L LS +LL+C GC GG AW + GVV++ C
Sbjct: 234 AAVASDRVSIHSLGHMTPILSPQNLLSCDTHH-QKGCRGGRLDGAWWFLRRRGVVSDNCY 292
Query: 65 PYFDSTGCSHPGCEPAYPTPKCV----------RKCVKK---NQLWRNSKHYSISAYRIN 111
P+ + A PTP+C+ R+ + +Q+ N + YR+
Sbjct: 293 PF-----SGREQNDEASPTPRCMMHSRAMGRGKRQATSRCPNSQVDSNDIYQVTPVYRLA 347
Query: 112 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV--------MGGHAVKLIGW 163
SD ++IM E+ +NGPV+ V+EDF Y+ G+Y H G H+VK+ GW
Sbjct: 348 SDEKEIMKELMENGPVQALMEVHEDFFLYQRGIYSHTPVSQGRPEQYRRHGTHSVKITGW 407
Query: 164 G--TSDDGE--DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 206
G T DG YW AN W WG G+F+I RG NEC IE V+
Sbjct: 408 GEETLPDGRTIKYWTAANSWGPWWGERGHFRIVRGINECDIETFVLG 454
>sp|Q26563|CATC_SCHMA Cathepsin C OS=Schistosoma mansoni PE=2 SV=1
Length = 454
Score = 91.3 bits (225), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 66/190 (34%), Positives = 91/190 (47%), Gaps = 29/190 (15%)
Query: 24 LSVNDLLACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYP 82
LS ++ C + +GC+GG+P + A +Y G+ + PY TG
Sbjct: 272 LSPQTVVDCSPY--SEGCNGGFPFLIAGKYGEDFGLPQKIVIPY---TGED--------- 317
Query: 83 TPKCVRKCVKKNQLWRNSKHYS-ISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYK 141
T KC V KN + YS I Y ++ + + E+ NGP V F VYEDF YK
Sbjct: 318 TGKCT---VSKNCTRYYTTDYSYIGGYYGATNEKLMQLELISNGPFPVGFEVYEDFQFYK 374
Query: 142 SGVYKHITGDV---------MGGHAVKLIGWGTSD-DGEDYWILANQWNRSWGADGYFKI 191
G+Y H T + HAV L+G+G GE YW + N W WG GYF+I
Sbjct: 375 EGIYHHTTVQTDHYNFNPFELTNHAVLLVGYGVDKLSGEPYWKVKNSWGVEWGEQGYFRI 434
Query: 192 KRGSNECGIE 201
RG++ECG+E
Sbjct: 435 LRGTDECGVE 444
>sp|P05167|ALEU_HORVU Thiol protease aleurain OS=Hordeum vulgare PE=2 SV=1
Length = 362
Score = 87.4 bits (215), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 64/193 (33%), Positives = 88/193 (45%), Gaps = 40/193 (20%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVV-TEECDPYFDSTGCSHPGCE 78
+N+SLS L+ C G GC+GG P A+ Y ++G + TEE PY G H E
Sbjct: 187 KNISLSEQQLVDCAGGFNNFGCNGGLPSQAFEYIKYNGGIDTEESYPYKGVNGVCHYKAE 246
Query: 79 PAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNG-----PVEVSFTV 133
N+ + + I + ED + KN PV V+F V
Sbjct: 247 --------------------NAAVQVLDSVNITLNAEDEL----KNAVGLVRPVSVAFQV 282
Query: 134 YEDFAHYKSGVYKHITGDVMG------GHAVKLIGWGTSDDGEDYWILANQWNRSWGADG 187
+ F YKSGVY T D G HAV +G+G ++G YW++ N W WG +G
Sbjct: 283 IDGFRQYKSGVY---TSDHCGTTPDDVNHAVLAVGYGV-ENGVPYWLIKNSWGADWGDNG 338
Query: 188 YFKIKRGSNECGI 200
YFK++ G N C I
Sbjct: 339 YFKMEMGKNMCAI 351
>sp|Q9R1T3|CATZ_RAT Cathepsin Z OS=Rattus norvegicus GN=Ctsz PE=1 SV=2
Length = 306
Score = 87.4 bits (215), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 57/196 (29%), Positives = 83/196 (42%), Gaps = 21/196 (10%)
Query: 24 LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYF--DSTGCSHPGCEPAY 81
LSV +++ C C+GG + W Y HG+ E C+ Y D C
Sbjct: 120 LSVQNVIDCGN---AGSCEGGNDLPVWEYAHKHGIPDETCNNYQAKDQECDKFNQCGTCT 176
Query: 82 PTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYK 141
+C ++ LWR + S+S E +MAEIY NGP+ E ++Y
Sbjct: 177 EFKEC--HTIQNYTLWRVGDYGSLSGR------EKMMAEIYANGPISCGIMATERMSNYT 228
Query: 142 SGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIE 201
G+Y + H + + GWG S+DG +YWI+ N W WG G+ +I +
Sbjct: 229 GGIYTEYQNQAIINHIISVAGWGVSNDGIEYWIVRNSWGEPWGERGWMRI--------VT 280
Query: 202 EDVVAGLPSSKNLVKE 217
G SS NL E
Sbjct: 281 STYKGGTGSSYNLAIE 296
>sp|P49935|CATH_MOUSE Pro-cathepsin H OS=Mus musculus GN=Ctsh PE=2 SV=2
Length = 333
Score = 86.7 bits (213), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 69/206 (33%), Positives = 97/206 (47%), Gaps = 33/206 (16%)
Query: 4 TRTNRDALSSSPYV-SLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHH-GVVTE 61
T + AL S+ + S + LSL+ L+ C GC GG P A+ Y +++ G++ E
Sbjct: 141 TFSTTGALESAVAIASGKMLSLAEQQLVDCAQAFNNHGCKGGLPSQAFEYILYNKGIMEE 200
Query: 62 ECDPYF--DSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMA 119
+ PY DS+ +P A+ V+ V I + E M
Sbjct: 201 DSYPYIGKDSSCRFNPQKAVAF-----VKNVV-----------------NITLNDEAAMV 238
Query: 120 E-IYKNGPVEVSFTVYEDFAHYKSGVYK----HITGDVMGGHAVKLIGWGTSDDGEDYWI 174
E + PV +F V EDF YKSGVY H T D + HAV +G+G +G YWI
Sbjct: 239 EAVALYNPVSFAFEVTEDFLMYKSGVYSSKSCHKTPDKVN-HAVLAVGYG-EQNGLLYWI 296
Query: 175 LANQWNRSWGADGYFKIKRGSNECGI 200
+ N W WG +GYF I+RG N CG+
Sbjct: 297 VKNSWGSQWGENGYFLIERGKNMCGL 322
>sp|P00786|CATH_RAT Pro-cathepsin H OS=Rattus norvegicus GN=Ctsh PE=1 SV=1
Length = 333
Score = 86.3 bits (212), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 66/204 (32%), Positives = 93/204 (45%), Gaps = 29/204 (14%)
Query: 4 TRTNRDALSSSPYV-SLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHH-GVVTE 61
T + AL S+ + S + ++L+ L+ C GC GG P A+ Y +++ G++ E
Sbjct: 141 TFSTTGALESAVAIASGKMMTLAEQQLVDCAQNFNNHGCQGGLPSQAFEYILYNKGIMGE 200
Query: 62 ECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAE- 120
+ PY G E A K V I + E M E
Sbjct: 201 DSYPYIGKNGQCKFNPEKAVAFVKNV--------------------VNITLNDEAAMVEA 240
Query: 121 IYKNGPVEVSFTVYEDFAHYKSGVYK----HITGDVMGGHAVKLIGWGTSDDGEDYWILA 176
+ PV +F V EDF YKSGVY H T D + HAV +G+G +G YWI+
Sbjct: 241 VALYNPVSFAFEVTEDFMMYKSGVYSSNSCHKTPDKVN-HAVLAVGYG-EQNGLLYWIVK 298
Query: 177 NQWNRSWGADGYFKIKRGSNECGI 200
N W +WG +GYF I+RG N CG+
Sbjct: 299 NSWGSNWGNNGYFLIERGKNMCGL 322
>sp|Q9WUU7|CATZ_MOUSE Cathepsin Z OS=Mus musculus GN=Ctsz PE=2 SV=1
Length = 306
Score = 85.9 bits (211), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 51/175 (29%), Positives = 80/175 (45%), Gaps = 17/175 (9%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA 80
++ LSV +++ C C+GG + W Y HG+ E C+ Y C+
Sbjct: 117 SILLSVQNVIDCGN---AGSCEGGNDLPVWEYAHKHGIPDETCNNY----QAKDQDCDKF 169
Query: 81 YPTPKCV--RKC--VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYED 136
C ++C ++ LWR + S+S E +MAEIY NGP+ E
Sbjct: 170 NQCGTCTEFKECHTIQNYTLWRVGDYGSLSGR------EKMMAEIYANGPISCGIMATEM 223
Query: 137 FAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKI 191
++Y G+Y + H + + GWG S+DG +YWI+ N W WG G+ +I
Sbjct: 224 MSNYTGGIYAEHQDQAVINHIISVAGWGVSNDGIEYWIVRNSWGEPWGEKGWMRI 278
>sp|Q9UBR2|CATZ_HUMAN Cathepsin Z OS=Homo sapiens GN=CTSZ PE=1 SV=1
Length = 303
Score = 84.7 bits (208), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 54/170 (31%), Positives = 75/170 (44%), Gaps = 14/170 (8%)
Query: 24 LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYF--DSTGCSHPGCEPAY 81
LSV +++ C C+GG +S W Y HG+ E C+ Y D C
Sbjct: 118 LSVQNVIDCGN---AGSCEGGNDLSVWDYAHQHGIPDETCNNYQAKDQECDKFNQCGTCN 174
Query: 82 PTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYK 141
+C ++ LWR + S+S E +MAEIY NGP+ E A+Y
Sbjct: 175 EFKEC--HAIRNYTLWRVGDYGSLSGR------EKMMAEIYANGPISCGIMATERLANYT 226
Query: 142 SGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKI 191
G+Y H V + GWG S DG +YWI+ N W WG G+ +I
Sbjct: 227 GGIYAEYQDTTYINHVVSVAGWGIS-DGTEYWIVRNSWGEPWGERGWLRI 275
>sp|Q40143|CYSP3_SOLLC Cysteine proteinase 3 OS=Solanum lycopersicum GN=CYP-3 PE=2 SV=1
Length = 356
Score = 84.0 bits (206), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 59/188 (31%), Positives = 87/188 (46%), Gaps = 30/188 (15%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVV-TEECDPYFDSTGCSHPGCE 78
+ +SLS L+ C G GC+GG P A+ Y +G + TEE PY G C+
Sbjct: 182 KGISLSEQQLVDCAGAFNNFGCNGGLPSQAFEYIKFNGGLDTEEAYPYTGKNGI----CK 237
Query: 79 PAYPTPKCVRKCVKKNQLWRNSKH---YSISAYRINSDPEDIMAEIYKNGPVEVSFTVYE 135
+ K + + +++ Y+++ R PV V+F V +
Sbjct: 238 --FSQANIGVKVISSVNITLGAEYELKYAVALVR----------------PVSVAFEVVK 279
Query: 136 DFAHYKSGVYKHIT-GDVMG--GHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIK 192
F YKSGVY GD HAV +G+G ++G YW++ N W WG DGYFK++
Sbjct: 280 GFKQYKSGVYASTECGDTPMDVNHAVLAVGYGV-ENGTPYWLIKNSWGADWGEDGYFKME 338
Query: 193 RGSNECGI 200
G N CG+
Sbjct: 339 MGKNMCGV 346
>sp|P05689|CATZ_BOVIN Cathepsin Z OS=Bos taurus GN=CTSZ PE=2 SV=2
Length = 304
Score = 82.8 bits (203), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 55/183 (30%), Positives = 77/183 (42%), Gaps = 17/183 (9%)
Query: 37 CGDG--CDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKN 94
CGD C+GG + W Y HG+ E C+ Y C+ C K+
Sbjct: 127 CGDAGSCEGGNDLPVWEYAHRHGIPDETCNNY----QAKDQECDKFNQCGTCTE--FKEC 180
Query: 95 QLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMG 154
+ +N + + Y S E +MAEIY NGP+ E ++Y G+Y
Sbjct: 181 HVIKNYTLWKVGDYGSLSGREKMMAEIYTNGPISCGIMATEKMSNYTGGIYSEYNDQAFI 240
Query: 155 GHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECG--------IEEDVVA 206
H V + GWG S DG +YWI+ N W WG G+ +I + + G IEE
Sbjct: 241 NHIVSVAGWGVS-DGMEYWIVRNSWGEPWGEHGWMRIVTSTYKGGEGARYNLAIEESCTF 299
Query: 207 GLP 209
G P
Sbjct: 300 GDP 302
>sp|Q8RWQ9|ALEUL_ARATH Thiol protease aleurain-like OS=Arabidopsis thaliana GN=At3g45310
PE=2 SV=1
Length = 358
Score = 80.9 bits (198), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 61/188 (32%), Positives = 84/188 (44%), Gaps = 30/188 (15%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVV-TEECDPYFDSTGCSHPGCE 78
+ +SLS L+ C G GC GG P A+ Y ++G + TEE PY G GC+
Sbjct: 184 KGISLSEQQLVDCAGTFNNFGCHGGLPSQAFEYIKYNGGLDTEEAYPYTGKDG----GCK 239
Query: 79 -PAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDF 137
A VR V + +++ R PV V+F V +F
Sbjct: 240 FSAKNIGVQVRDSVNITLGAEDELKHAVGLVR----------------PVSVAFEVVHEF 283
Query: 138 AHYKSGVYKHIT-----GDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIK 192
YK GV+ T DV HAV +G+G DD YW++ N W WG +GYFK++
Sbjct: 284 RFYKKGVFTSNTCGNTPMDV--NHAVLAVGYGVEDD-VPYWLIKNSWGGEWGDNGYFKME 340
Query: 193 RGSNECGI 200
G N CG+
Sbjct: 341 MGKNMCGV 348
>sp|P09668|CATH_HUMAN Pro-cathepsin H OS=Homo sapiens GN=CTSH PE=1 SV=4
Length = 335
Score = 80.9 bits (198), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 60/203 (29%), Positives = 95/203 (46%), Gaps = 27/203 (13%)
Query: 4 TRTNRDALSSSPYVSL-QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHH-GVVTE 61
T + AL S+ ++ + LSL+ L+ C GC GG P A+ Y +++ G++ E
Sbjct: 143 TFSTTGALESAIAIATGKMLSLAEQQLVDCAQDFNNHGCQGGLPSQAFEYILYNKGIMGE 202
Query: 62 ECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEI 121
+ PY G C + K ++ + +I D E ++ +
Sbjct: 203 DTYPYQGKDG-------------YCKFQPGKAIGFVKDVANITIY------DEEAMVEAV 243
Query: 122 YKNGPVEVSFTVYEDFAHYKSGVYK----HITGDVMGGHAVKLIGWGTSDDGEDYWILAN 177
PV +F V +DF Y++G+Y H T D + HAV +G+G +G YWI+ N
Sbjct: 244 ALYNPVSFAFEVTQDFMMYRTGIYSSTSCHKTPDKVN-HAVLAVGYG-EKNGIPYWIVKN 301
Query: 178 QWNRSWGADGYFKIKRGSNECGI 200
W WG +GYF I+RG N CG+
Sbjct: 302 SWGPQWGMNGYFLIERGKNMCGL 324
>sp|Q10717|CYSP2_MAIZE Cysteine proteinase 2 OS=Zea mays GN=CCP2 PE=2 SV=1
Length = 360
Score = 79.0 bits (193), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 62/187 (33%), Positives = 85/187 (45%), Gaps = 32/187 (17%)
Query: 22 LSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVV-TEECDPYFDSTGCSHPGCEPA 80
+SLS L+ C GC+GG P A+ Y ++G + TEE PY G
Sbjct: 188 ISLSEQQLVDCGFAFNNFGCNGGLPSQAFEYIKYNGGLDTEESYPYQGVNGI-------- 239
Query: 81 YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAE-IYKNGPVEVSFTVYEDFAH 139
C KN+ N + + I ED + + + PV V+F V F
Sbjct: 240 ---------CKFKNE---NVGVKVLDSVNITLGAEDELKDAVGLVRPVSVAFEVITGFRL 287
Query: 140 YKSGVYKHITGDVMG------GHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKR 193
YKSGVY T D G HAV +G+G +DG YW++ N W WG +GYFK++
Sbjct: 288 YKSGVY---TSDHCGTTPMDVNHAVLAVGYGV-EDGVPYWLIKNSWGADWGDEGYFKMEM 343
Query: 194 GSNECGI 200
G N CG+
Sbjct: 344 GKNMCGV 350
>sp|Q3T0I2|CATH_BOVIN Pro-cathepsin H OS=Bos taurus GN=CTSH PE=2 SV=1
Length = 335
Score = 79.0 bits (193), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 58/203 (28%), Positives = 92/203 (45%), Gaps = 27/203 (13%)
Query: 4 TRTNRDALSSSPYVSLQNLS-LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHH-GVVTE 61
T + AL S+ ++ L L+ L+ C GC GG P A+ Y ++ G++ E
Sbjct: 143 TFSTTGALESAVAIATGKLPFLAEQQLVDCAQNFNNHGCQGGLPSQAFEYIRYNKGIMGE 202
Query: 62 ECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEI 121
+ PY G C + K ++ + ++ +D E ++ +
Sbjct: 203 DTYPYRGQDG-------------DCKYQPSKAIAFVKDVANITL------NDEEAMVEAV 243
Query: 122 YKNGPVEVSFTVYEDFAHYKSGVYK----HITGDVMGGHAVKLIGWGTSDDGEDYWILAN 177
+ PV +F V DF Y+ G+Y H T D + HAV +G+G + G YWI+ N
Sbjct: 244 ALHNPVSFAFEVTADFMMYRKGIYSSTSCHKTPDKV-NHAVLAVGYG-EEKGIPYWIVKN 301
Query: 178 QWNRSWGADGYFKIKRGSNECGI 200
W +WG GYF I+RG N CG+
Sbjct: 302 SWGPNWGMKGYFLIERGKNMCGL 324
>sp|P22497|CYSP_THEPA Cysteine proteinase OS=Theileria parva GN=TP03_0285 PE=3 SV=2
Length = 440
Score = 77.8 bits (190), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 54/182 (29%), Positives = 86/182 (47%), Gaps = 32/182 (17%)
Query: 24 LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFD-STGCSHPGCEPAYP 82
LSV +LL C F +GC GG SA+ Y +G+V+ + P+ D + CS P
Sbjct: 276 LSVQELLDCDSF--SNGCQGGLLESAYEYVRKYGLVSAKDLPFVDKARRCSVP------- 326
Query: 83 TPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKS 142
+K S+ +Y + E +M + P V +V + A YKS
Sbjct: 327 ----------------KAKKVSVPSYHVFKGKE-VMTRSLTSSPCSVYLSVSPELAKYKS 369
Query: 143 GVYKHITGDVMGGHAVKLIGWGTSD-DGEDYWILANQWNRSWGADGYFKIKR---GSNEC 198
GV+ G + HAV L+G G + + YW++ N W WG +GY +++R G+++C
Sbjct: 370 GVFTGECGKSLN-HAVVLVGEGYDEVTKKRYWVVQNSWGTDWGENGYMRLERTNMGTDKC 428
Query: 199 GI 200
G+
Sbjct: 429 GV 430
Database: swissprot
Posted date: Mar 23, 2013 2:32 AM
Number of letters in database: 191,569,459
Number of sequences in database: 539,616
Lambda K H
0.317 0.135 0.439
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 98,162,947
Number of Sequences: 539616
Number of extensions: 4498991
Number of successful extensions: 8737
Number of sequences better than 100.0: 50
Number of HSP's better than 100.0 without gapping: 207
Number of HSP's successfully gapped in prelim test: 14
Number of HSP's that attempted gapping in prelim test: 8237
Number of HSP's gapped (non-prelim): 254
length of query: 229
length of database: 191,569,459
effective HSP length: 113
effective length of query: 116
effective length of database: 130,592,851
effective search space: 15148770716
effective search space used: 15148770716
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 59 (27.3 bits)