BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 025695
(249 letters)
Database: swissprot
539,616 sequences; 191,569,459 total letters
Searching..................................................done
>sp|Q4R5M2|CATB_MACFA Cathepsin B OS=Macaca fascicularis GN=CTSB PE=2 SV=1
Length = 339
Score = 234 bits (597), Expect = 5e-61, Method: Compositional matrix adjust.
Identities = 117/240 (48%), Positives = 160/240 (66%), Gaps = 18/240 (7%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVN--DLLACCGFLCGDGCDGGYPISAWRY 72
QG CGSCWAFGAVEA+SDR CIH ++S+ V+ DLL CCG +CGDGC+GGYP AW +
Sbjct: 102 QGSCGSCWAFGAVEAISDRICIHTNAHVSVEVSAEDLLTCCGIMCGDGCNGGYPAGAWNF 161
Query: 73 FVHHGVVTEE-------CDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKK-NQLWR 118
+ G+V+ C PY C H P C TPKC + C + ++
Sbjct: 162 WTRKGLVSGGLYDSHVGCRPY-SIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGYSPTYK 220
Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
KHY ++Y +++ +DIMAEIYKNGPVE +F+VY DF YKSGVY+H+TG++MGGHA+
Sbjct: 221 QDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAI 280
Query: 179 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEI 238
+++GWG ++G YW++AN WN WG +G+FKI RG + CGIE +VVAG+P + ++I
Sbjct: 281 RILGWGV-ENGTPYWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPRTDQYWEKI 339
>sp|P07858|CATB_HUMAN Cathepsin B OS=Homo sapiens GN=CTSB PE=1 SV=3
Length = 339
Score = 234 bits (596), Expect = 5e-61, Method: Compositional matrix adjust.
Identities = 117/240 (48%), Positives = 160/240 (66%), Gaps = 18/240 (7%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVN--DLLACCGFLCGDGCDGGYPISAWRY 72
QG CGSCWAFGAVEA+SDR CIH ++S+ V+ DLL CCG +CGDGC+GGYP AW +
Sbjct: 102 QGSCGSCWAFGAVEAISDRICIHTNAHVSVEVSAEDLLTCCGSMCGDGCNGGYPAEAWNF 161
Query: 73 FVHHGVVTEE-------CDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKK-NQLWR 118
+ G+V+ C PY C H P C TPKC + C + ++
Sbjct: 162 WTRKGLVSGGLYESHVGCRPY-SIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGYSPTYK 220
Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
KHY ++Y +++ +DIMAEIYKNGPVE +F+VY DF YKSGVY+H+TG++MGGHA+
Sbjct: 221 QDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAI 280
Query: 179 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEI 238
+++GWG ++G YW++AN WN WG +G+FKI RG + CGIE +VVAG+P + ++I
Sbjct: 281 RILGWGV-ENGTPYWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPRTDQYWEKI 339
>sp|Q5R6D1|CATB_PONAB Cathepsin B OS=Pongo abelii GN=CTSB PE=2 SV=1
Length = 339
Score = 233 bits (595), Expect = 6e-61, Method: Compositional matrix adjust.
Identities = 117/240 (48%), Positives = 159/240 (66%), Gaps = 18/240 (7%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVN--DLLACCGFLCGDGCDGGYPISAWRY 72
QG CGSCWAFGAVEA+SDR CIH ++S+ V+ DLL CCG +CGDGC+GGYP AW +
Sbjct: 102 QGSCGSCWAFGAVEAISDRICIHTNAHVSVEVSAEDLLTCCGSMCGDGCNGGYPAEAWNF 161
Query: 73 FVHHGVVTEE-------CDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKK-NQLWR 118
+ G+V+ C PY C H P C TPKC + C + ++
Sbjct: 162 WTRKGLVSGGLYESHVGCRPY-SIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGYSPTYK 220
Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
KHY ++Y +++ DIMAEIYKNGPVE +F+VY DF YKSGVY+H+TG++MGGHA+
Sbjct: 221 QDKHYGYNSYSVSNSERDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAI 280
Query: 179 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEI 238
+++GWG ++G YW++AN WN WG +G+FKI RG + CGIE +VVAG+P + ++I
Sbjct: 281 RILGWGV-ENGTPYWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPRTDQYWEKI 339
>sp|P00787|CATB_RAT Cathepsin B OS=Rattus norvegicus GN=Ctsb PE=1 SV=2
Length = 339
Score = 232 bits (592), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 117/246 (47%), Positives = 157/246 (63%), Gaps = 16/246 (6%)
Query: 3 FTNSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDG 60
++N + + QG CGSCWAFGAVEA+SDR CIH +N+ +S DLL CCG CGDG
Sbjct: 90 WSNCPTIAQIRDQGSCGSCWAFGAVEAMSDRICIHTNGRVNVEVSAEDLLTCCGIQCGDG 149
Query: 61 CDGGYPISAWRYFVHHGVVTEE-------CDPYF-----DSTGCSHPGCEPAYPTPKCVR 108
C+GGYP AW ++ G+V+ C PY S P C TPKC +
Sbjct: 150 CNGGYPSGAWNFWTRKGLVSGGVYNSHIGCLPYTIPPCEHHVNGSRPPCTGEGDTPKCNK 209
Query: 109 KC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH 167
C + ++ KHY ++Y ++ ++IMAEIYKNGPVE +FTV+ DF YKSGVYKH
Sbjct: 210 MCEAGYSTSYKEDKHYGYTSYSVSDSEKEIMAEIYKNGPVEGAFTVFSDFLTYKSGVYKH 269
Query: 168 ITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
GDVMGGHA++++GWG ++G YW++AN WN WG +G+FKI RG N CGIE ++VAG
Sbjct: 270 EAGDVMGGHAIRILGWGI-ENGVPYWLVANSWNVDWGDNGFFKILRGENHCGIESEIVAG 328
Query: 228 LPSSKN 233
+P ++
Sbjct: 329 IPRTQQ 334
>sp|A1E295|CATB_PIG Cathepsin B OS=Sus scrofa GN=CTSB PE=1 SV=1
Length = 335
Score = 227 bits (578), Expect = 7e-59, Method: Compositional matrix adjust.
Identities = 114/248 (45%), Positives = 158/248 (63%), Gaps = 18/248 (7%)
Query: 3 FTNSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDG 60
+ N ++ + QG CGSCWAFGAVEA+SDR CI +N+ +S D+L CCG CGDG
Sbjct: 90 WPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICIRSNGRVNVEVSAEDMLTCCGDECGDG 149
Query: 61 CDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCSH------PGCEPAYPTPKCV 107
C+GG+P AW ++ G+V+ C PY C H P C TPKC
Sbjct: 150 CNGGFPSGAWNFWTKKGLVSGGLYDSHVGCRPY-SIPPCEHHVNGSRPPCTGEGDTPKCS 208
Query: 108 RKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYK 166
+ C ++ KH+ S+Y I+ + ++IMAEIYKNGPVE +FTVY DF YKSGVY+
Sbjct: 209 KICEPGYTPSYKEDKHFGCSSYSISRNEKEIMAEIYKNGPVEGAFTVYSDFLQYKSGVYQ 268
Query: 167 HITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
H+TGD+MGGHA++++GWG ++G YW++ N WN WG +G+FKI RG + CGIE ++VA
Sbjct: 269 HVTGDLMGGHAIRILGWGV-ENGTPYWLVGNSWNTDWGDNGFFKILRGQDHCGIESEIVA 327
Query: 227 GLPSSKNL 234
G+P + +
Sbjct: 328 GIPCTPHF 335
>sp|P10605|CATB_MOUSE Cathepsin B OS=Mus musculus GN=Ctsb PE=1 SV=2
Length = 339
Score = 226 bits (575), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 114/244 (46%), Positives = 156/244 (63%), Gaps = 16/244 (6%)
Query: 3 FTNSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDG 60
++N + + QG CGSCWAFGAVEA+SDR CIH +N+ +S DLL CCG CGDG
Sbjct: 90 WSNCPTIGQIRDQGSCGSCWAFGAVEAISDRTCIHTNGRVNVEVSAEDLLTCCGIQCGDG 149
Query: 61 CDGGYPISAWRYFVHHGVVTEE-------CDPYF-----DSTGCSHPGCEPAYPTPKCVR 108
C+GGYP AW ++ G+V+ C PY S P C TP+C +
Sbjct: 150 CNGGYPSGAWSFWTKKGLVSGGVYNSHVGCLPYTIPPCEHHVNGSRPPCTGEGDTPRCNK 209
Query: 109 KC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH 167
C + ++ KH+ ++Y +++ ++IMAEIYKNGPVE +FTV+ DF YKSGVYKH
Sbjct: 210 SCEAGYSPSYKEDKHFGYTSYSVSNSVKEIMAEIYKNGPVEGAFTVFSDFLTYKSGVYKH 269
Query: 168 ITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
GD+MGGHA++++GWG ++G YW+ AN WN WG +G+FKI RG N CGIE ++VAG
Sbjct: 270 EAGDMMGGHAIRILGWGV-ENGVPYWLAANSWNLDWGDNGFFKILRGENHCGIESEIVAG 328
Query: 228 LPSS 231
+P +
Sbjct: 329 IPRT 332
>sp|P43509|CPR5_CAEEL Cathepsin B-like cysteine proteinase 5 OS=Caenorhabditis elegans
GN=cpr-5 PE=2 SV=1
Length = 344
Score = 225 bits (573), Expect = 3e-58, Method: Compositional matrix adjust.
Identities = 118/236 (50%), Positives = 145/236 (61%), Gaps = 22/236 (9%)
Query: 15 QGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCG--FLCGDGCDGGYPISAW 70
Q CGSCWAF A EA+SDR CI + +N LS DLL+CC F CG+GC+GGYPI AW
Sbjct: 104 QSDCGSCWAFAAAEAISDRTCIASNGAVNTLLSSEDLLSCCTGMFSCGNGCEGGYPIQAW 163
Query: 71 RYFVHHGVVTEE-------CDPYFDS------TGCSHPGC-EPAYPTPKCVRKCVKKNQL 116
+++V HG+VT C PY + G P C E PTPKCV C KN
Sbjct: 164 KWWVKHGLVTGGSYETQFGCKPYSIAPCGETVNGVKWPACPEDTEPTPKCVDSCTSKNNY 223
Query: 117 ---WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVM 173
+ KH+ +AY + E I EI NGP+EV+FTVYEDF Y +GVY H G +
Sbjct: 224 ATPYLQDKHFGSTAYAVGKKVEQIQTEILTNGPIEVAFTVYEDFYQYTTGVYVHTAGASL 283
Query: 174 GGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
GGHAVK++GWG D+G YW++AN WN +WG GYF+I RG NECGIE VAG+P
Sbjct: 284 GGHAVKILGWGV-DNGTPYWLVANSWNVAWGEKGYFRIIRGLNECGIEHSAVAGIP 338
>sp|P43233|CATB_CHICK Cathepsin B OS=Gallus gallus GN=CTSB PE=2 SV=1
Length = 340
Score = 224 bits (570), Expect = 5e-58, Method: Compositional matrix adjust.
Identities = 114/233 (48%), Positives = 152/233 (65%), Gaps = 21/233 (9%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVN--DLLACCGFLCGDGCDGGYPISAWRY 72
QG CGSCWAFGAVEA+SDR C+H +S+ V+ DLL+CCGF CG GC+GGYP AWRY
Sbjct: 102 QGSCGSCWAFGAVEAISDRICVHTNAKVSVEVSAEDLLSCCGFECGMGCNGGYPSGAWRY 161
Query: 73 FVHHGVVTEECDPYFDSTGC---SHPGCE------------PAYPTPKCVRKCVKK-NQL 116
+ G+V+ Y GC + P CE TP+C R C +
Sbjct: 162 WTERGLVSGGL--YDSHVGCRAYTIPPCEHHVNGSRPPCTGEGGETPRCSRHCEPGYSPS 219
Query: 117 WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGH 176
++ KHY I++Y + ++IMAEIYKNGPVE +F VYEDF YKSGVY+H++G+ +GGH
Sbjct: 220 YKEDKHYGITSYGVPRSEKEIMAEIYKNGPVEGAFIVYEDFLMYKSGVYQHVSGEQVGGH 279
Query: 177 AVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
A++++GWG ++G YW+ AN WN WG G+FKI RG + CGIE ++VAG+P
Sbjct: 280 AIRILGWGV-ENGTPYWLAANSWNTDWGITGFFKILRGEDHCGIESEIVAGVP 331
>sp|P43510|CPR6_CAEEL Cathepsin B-like cysteine proteinase 6 OS=Caenorhabditis elegans
GN=cpr-6 PE=1 SV=1
Length = 379
Score = 218 bits (554), Expect = 4e-56, Method: Compositional matrix adjust.
Identities = 117/249 (46%), Positives = 153/249 (61%), Gaps = 24/249 (9%)
Query: 7 EHVEILVIQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGG 64
+ ++++ Q CGSCWAFGAVEA+SDR CI H + ++LS +DLL+CC CG GC+GG
Sbjct: 119 DSIKVIRDQSSCGSCWAFGAVEAMSDRICIASHGELQVTLSADDLLSCCKS-CGFGCNGG 177
Query: 65 YPISAWRYFVHHGVVTEECDPYFDSTGCS---HPGCE-------------PAYPTPKCVR 108
P++AWRY+V G+VT Y + GC P CE YPTPKC +
Sbjct: 178 DPLAAWRYWVKDGIVTGS--NYTANNGCKPYPFPPCEHHSKKTHFDPCPHDLYPTPKCEK 235
Query: 109 KCVK--KNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYK 166
KCV ++ + K + SAY + D E I E+ +GP+E++F VYEDF +Y GVY
Sbjct: 236 KCVSDYTDKTYSEDKFFGASAYGVKDDVEAIQKELMTHGPLEIAFEVYEDFLNYDGGVYV 295
Query: 167 HITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
H G + GGHAVKLIGWG DDG YW +AN WN WG DG+F+I RG +ECGIE VV
Sbjct: 296 HTGGKLGGGHAVKLIGWGI-DDGIPYWTVANSWNTDWGEDGFFRILRGVDECGIESGVVG 354
Query: 227 GLPSSKNLV 235
G+P +L
Sbjct: 355 GIPKLNSLT 363
>sp|P25807|CPR1_CAEEL Gut-specific cysteine proteinase OS=Caenorhabditis elegans GN=cpr-1
PE=1 SV=2
Length = 329
Score = 213 bits (542), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 110/230 (47%), Positives = 143/230 (62%), Gaps = 12/230 (5%)
Query: 7 EHVEILVIQGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGG 64
+ ++++ Q CGSCWAFGA E +SDR CI +S +DLL+CCG CG+GC+GG
Sbjct: 99 KSIKLIRDQATCGSCWAFGAAEMISDRTCIETKGAQQPIISPDDLLSCCGSSCGNGCEGG 158
Query: 65 YPISAWRYFVHHGVVT------EECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKK-NQLW 117
YPI A R++ GVVT C PY + C+ C P TP C C + +
Sbjct: 159 YPIQALRWWDSKGVVTGGDYHGAGCKPYPIAP-CTSGNC-PESKTPSCSMSCQSGYSTAY 216
Query: 118 RNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHA 177
KH+ +SAY + + I AEIY NGPVE +F+VYEDF YKSGVYKH G +GGHA
Sbjct: 217 AKDKHFGVSAYAVPKNAASIQAEIYANGPVEAAFSVYEDFYKYKSGVYKHTAGKYLGGHA 276
Query: 178 VKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
+K+IGWGT + G YW++AN W +WG G+FKI RG ++CGIE VVAG
Sbjct: 277 IKIIGWGT-ESGSPYWLVANSWGVNWGESGFFKIYRGDDQCGIESAVVAG 325
>sp|P25793|CYSP2_HAECO Cathepsin B-like cysteine proteinase 2 OS=Haemonchus contortus
GN=AC-2 PE=2 SV=1
Length = 342
Score = 211 bits (536), Expect = 5e-54, Method: Compositional matrix adjust.
Identities = 107/237 (45%), Positives = 145/237 (61%), Gaps = 19/237 (8%)
Query: 15 QGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
Q +CGSCWA A+SDR CI +++S D++ CC CGDGC+GG+PI AW+Y
Sbjct: 108 QANCGSCWAVSTAAAISDRICIASKAEKQVNISATDIMTCCRPQCGDGCEGGWPIEAWKY 167
Query: 73 FVHHGVVT-------EECDPYFDSTGCSHPG-------CEPAYPTPKCVRKCVKK-NQLW 117
F++ GVV+ + C PY C H G C PTP C RKC +++
Sbjct: 168 FIYDGVVSGGEYLTKDVCRPY-PIHPCGHHGNDTYYGECRGTAPTPPCKRKCRPGVRKMY 226
Query: 118 RNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHA 177
R K Y AY + + I +EI KNGPV SF VYEDF HYKSG+YKH G++ G HA
Sbjct: 227 RIDKRYGKDAYIVKQSVKAIQSEILKNGPVVASFAVYEDFRHYKSGIYKHTAGELRGYHA 286
Query: 178 VKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNL 234
VK+IGWG +++ D+W++AN W+ WG GYF+I RGSN+CGIE + AG+ +++L
Sbjct: 287 VKMIGWG-NENNTDFWLIANSWHNDWGEKGYFRIVRGSNDCGIEGTIAAGIVDTESL 342
>sp|P25792|CYSP_SCHMA Cathepsin B-like cysteine proteinase OS=Schistosoma mansoni PE=2
SV=1
Length = 340
Score = 210 bits (534), Expect = 9e-54, Method: Compositional matrix adjust.
Identities = 111/229 (48%), Positives = 144/229 (62%), Gaps = 18/229 (7%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
Q CGSCW+FGAVEA+SDR CI G N+ LS DLL CC CG GC+GG AW Y
Sbjct: 111 QSRCGSCWSFGAVEAMSDRSCIQSGGKQNVELSAVDLLTCCES-CGLGCEGGILGPAWDY 169
Query: 73 FVHHGVVTEE-------CDPY-----FDSTGCSHPGC-EPAYPTPKCVRKCVKKNQL-WR 118
+V G+VT C+PY T +P C Y TP+C + C +K + +
Sbjct: 170 WVKEGIVTASSKENHTGCEPYPFPKCEHHTKGKYPPCGSKIYNTPRCKQTCQRKYKTPYT 229
Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
KH S+Y + +D + I EI K GPVE SFTVYEDF +YKSG+YKHITG+ +GGHA+
Sbjct: 230 QDKHRGKSSYNVKNDEKAIQKEIMKYGPVEASFTVYEDFLNYKSGIYKHITGEALGGHAI 289
Query: 179 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
++IGWG ++ YW++AN WN WG +GYF+I RG +EC IE +V+AG
Sbjct: 290 RIIGWGV-ENKTPYWLIANSWNEDWGENGYFRIVRGRDECSIESEVIAG 337
>sp|P19092|CYSP1_HAECO Cathepsin B-like cysteine proteinase 1 OS=Haemonchus contortus
GN=AC-1 PE=2 SV=1
Length = 342
Score = 209 bits (532), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 105/237 (44%), Positives = 145/237 (61%), Gaps = 19/237 (8%)
Query: 15 QGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
Q +CGSCWA A+SDR CI +++S D++ CC CGDGC+GG+PI AW+Y
Sbjct: 108 QANCGSCWAVSTAAAISDRICIASKAEKQVNISATDIMTCCRPQCGDGCEGGWPIEAWKY 167
Query: 73 FVHHGVVT-------EECDPYFDSTGCSHPG-------CEPAYPTPKCVRKCVKK-NQLW 117
F++ GVV+ + C PY C H G C PTP C RKC +++
Sbjct: 168 FIYDGVVSGGEYLTKDVCRPY-PIHPCGHHGNDTYYGECRGTAPTPPCKRKCRPGVRKMY 226
Query: 118 RNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHA 177
R K Y AY + + I +EI +NGPV SF VYEDF HYKSG+YKH G++ G HA
Sbjct: 227 RIDKRYGKDAYIVKQSVKAIQSEILRNGPVVASFAVYEDFRHYKSGIYKHTAGELRGYHA 286
Query: 178 VKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNL 234
VK+IGWG +++ D+W++AN W+ WG GYF+I RG+N+CGIE + AG+ +++L
Sbjct: 287 VKMIGWG-NENNTDFWLIANSWHNDWGEKGYFRIIRGTNDCGIEGTIAAGIVDTESL 342
>sp|P43508|CPR4_CAEEL Cathepsin B-like cysteine proteinase 4 OS=Caenorhabditis elegans
GN=cpr-4 PE=2 SV=1
Length = 335
Score = 204 bits (520), Expect = 4e-52, Method: Compositional matrix adjust.
Identities = 115/233 (49%), Positives = 146/233 (62%), Gaps = 20/233 (8%)
Query: 15 QGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
Q CGSCWAF A EA SDRFCI + +N LS D+L+CC CG GC+GGYPI+AW+Y
Sbjct: 103 QSDCGSCWAFAAAEAASDRFCIASNGAVNTLLSAEDVLSCCSN-CGYGCEGGYPINAWKY 161
Query: 73 FVHHGVVTEE-------CDPYF-----DSTG-CSHPGC-EPAYPTPKCVRKCVKKNQ--L 116
V G T C PY ++ G + P C + Y TP CV KC KN
Sbjct: 162 LVKSGFCTGGSYEAQFGCKPYSLAPCGETVGNVTWPSCPDDGYDTPACVNKCTNKNYNVA 221
Query: 117 WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGH 176
+ KH+ +AY + I AEI +GPVE +FTVYEDF YK+GVY H TG +GGH
Sbjct: 222 YTADKHFGSTAYAVGKKVSQIQAEIIAHGPVEAAFTVYEDFYQYKTGVYVHTTGQELGGH 281
Query: 177 AVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
A++++GWGT D+G YW++AN WN +WG +GYF+I RG+NECGIE VV G+P
Sbjct: 282 AIRILGWGT-DNGTPYWLVANSWNVNWGENGYFRIIRGTNECGIEHAVVGGVP 333
>sp|P43507|CPR3_CAEEL Cathepsin B-like cysteine proteinase 3 OS=Caenorhabditis elegans
GN=cpr-3 PE=2 SV=1
Length = 370
Score = 204 bits (519), Expect = 4e-52, Method: Compositional matrix adjust.
Identities = 113/250 (45%), Positives = 149/250 (59%), Gaps = 22/250 (8%)
Query: 9 VEILVIQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYP 66
++++ Q CGSCWAFGA E +SDR CI +SV D+L+CCG CG GC GGY
Sbjct: 108 IKLIRNQATCGSCWAFGAAEVISDRVCIQSNGTQQPVISVEDILSCCGTTCGYGCKGGYS 167
Query: 67 ISAWRYFVHHGVVT------EECDPYFDSTGCSHPGCEPAYPTPKCVRKCVK--KNQLWR 118
I A R++ G VT C PY S C P TP C C K + ++
Sbjct: 168 IEALRFWASSGAVTGGDYGGHGCMPY--SFAPCTKNC-PESTTPSCKTTCQSSYKTEEYK 224
Query: 119 NSKHYSISAYRINSDPE--DIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGH 176
KHY SAY++ + +I EIY GPVE S+ VYEDF HYKSGVY + +G ++GGH
Sbjct: 225 KDKHYGASAYKVTTTKSVTEIQTEIYHYGPVEASYKVYEDFYHYKSGVYHYTSGKLVGGH 284
Query: 177 AVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVK 236
AVK+IGWG ++G DYW++AN W S+G G+FKI+RG+NEC IE +VVAG + K
Sbjct: 285 AVKIIGWGV-ENGVDYWLIANSWGTSFGEKGFFKIRRGTNECQIEGNVVAG------IAK 337
Query: 237 EITSADMFED 246
T ++ +ED
Sbjct: 338 LGTHSETYED 347
>sp|P43157|CYSP_SCHJA Cathepsin B-like cysteine proteinase OS=Schistosoma japonicum
GN=CATB PE=2 SV=1
Length = 342
Score = 202 bits (514), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 105/230 (45%), Positives = 140/230 (60%), Gaps = 18/230 (7%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
Q CGSCWAFGAVEA++DR CI G + LS DL++CC CGDGC GG+P AW Y
Sbjct: 112 QSRCGSCWAFGAVEAMTDRICIQSGGGQSAELSALDLISCCKD-CGDGCQGGFPGVAWDY 170
Query: 73 FVHHGVVT-------EECDPY-----FDSTGCSHPGC-EPAYPTPKCVRKCVKKNQL-WR 118
+V G+VT C PY T +P C Y TP+C + C K + +
Sbjct: 171 WVKRGIVTGGSKENHTGCQPYPFPKCEHHTKGKYPACGTKIYKTPQCKQTCQKGYKTPYE 230
Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
KHY +Y + ++ + I +I GPVE +F VYEDF +YKSG+Y+H+TG ++GGHA+
Sbjct: 231 QDKHYGDESYNVQNNEKVIQRDIMMYGPVEAAFDVYEDFLNYKSGIYRHVTGSIVGGHAI 290
Query: 179 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 228
++IGWG + YW++AN WN WG G F++ RG +EC IE DVVAGL
Sbjct: 291 RIIGWGV-EKRTPYWLIANSWNEDWGEKGLFRMVRGRDECSIESDVVAGL 339
>sp|P07688|CATB_BOVIN Cathepsin B OS=Bos taurus GN=CTSB PE=1 SV=5
Length = 335
Score = 196 bits (497), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 111/243 (45%), Positives = 158/243 (65%), Gaps = 18/243 (7%)
Query: 3 FTNSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDG 60
+ N ++ + QG CGSCWAFGAVEA+SDR CIH +N+ +S D+L CCG CGDG
Sbjct: 90 WPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHSNGRVNVEVSAEDMLTCCGGECGDG 149
Query: 61 CDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCSH------PGCEPAYPTPKCV 107
C+GG+P AW ++ G+V+ C PY C H P C TPKC
Sbjct: 150 CNGGFPSGAWNFWTKKGLVSGGLYNSHVGCRPY-SIPPCEHHVNGSRPPCTGEGDTPKCS 208
Query: 108 RKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYK 166
+ C + ++ KH+ S+Y + ++ ++IMAEIYKNGPVE +F+VY DF YKSGVY+
Sbjct: 209 KTCEPGYSPSYKEDKHFGCSSYSVANNEKEIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQ 268
Query: 167 HITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
H++G++MGGHA++++GWG ++G YW++ N WN WG +G+FKI RG + CGIE ++VA
Sbjct: 269 HVSGEIMGGHAIRILGWGV-ENGTPYWLVGNSWNTDWGDNGFFKILRGQDHCGIESEIVA 327
Query: 227 GLP 229
G+P
Sbjct: 328 GMP 330
>sp|Q54QD9|CTSB_DICDI Cathepsin B OS=Dictyostelium discoideum GN=ctsB PE=3 SV=1
Length = 311
Score = 195 bits (495), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 108/223 (48%), Positives = 132/223 (59%), Gaps = 20/223 (8%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFV 74
Q CGSCWAFGA E+ +DR CIH N+ LS D++ C +GC+GG SAW +
Sbjct: 101 QARCGSCWAFGATESATDRLCIHNNENVQLSFMDMVTCDE--TDNGCEGGDAFSAWNWLR 158
Query: 75 HHGVVTEECDPYFDSTGCSHPGCEPAYP-------TPKCVRKCVKKNQL-WRNSKHYSIS 126
G V+EEC PY + P C PA TP C ++C + L + KH
Sbjct: 159 KQGAVSEECLPY------TIPTCPPAQQPCLNFVNTPSCTKECQSNSSLIYSQDKHKMAK 212
Query: 127 AYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTS 186
Y +SD E IM EI NGPVE FTV+EDF YKSGVY H TG +GGH VKL+G+GT
Sbjct: 213 IYSFDSD-EAIMQEIVTNGPVEACFTVFEDFLAYKSGVYVHTTGKDLGGHCVKLVGFGTL 271
Query: 187 DDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
+G DY+ NQW SWG +G F IKRG +CGI +DVVAGLP
Sbjct: 272 -NGVDYYAANNQWTTSWGDNGTFLIKRG--DCGISDDVVAGLP 311
>sp|P25802|CYSP1_OSTOS Cathepsin B-like cysteine proteinase 1 OS=Ostertagia ostertagi
GN=CP-1 PE=3 SV=3
Length = 341
Score = 190 bits (482), Expect = 9e-48, Method: Compositional matrix adjust.
Identities = 102/229 (44%), Positives = 138/229 (60%), Gaps = 19/229 (8%)
Query: 15 QGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
Q +CGSCWA + A+SDR CI + +S D+++CC + CGDGC+GG+PISA+R+
Sbjct: 113 QANCGSCWAVSSAAAMSDRICIASKGAKQVLISAQDVVSCCTW-CGDGCEGGWPISAFRF 171
Query: 73 FVHHGVVTE-------ECDPYFDSTGCSHPGCEPAY-------PTPKCVRKCVKKNQLWR 118
GVVT C PY + C H G E Y TP+C R+C+
Sbjct: 172 HADEGVVTGGDYNTKGSCRPY-EIHPCGHHGNETYYGECVGMADTPRCKRRCLLGYPKSY 230
Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
S Y AY++ + + I +I KNGPV ++TVYEDFAHY+SG+YKH G G HAV
Sbjct: 231 PSDRYYKKAYQLKNSVKAIQKDIMKNGPVVATYTVYEDFAHYRSGIYKHKAGRKTGLHAV 290
Query: 179 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
K+IGWG + G YWI+AN W+ WG +G+F++ RGSN+CG EE + AG
Sbjct: 291 KVIGWG-EEKGTPYWIVANSWHDDWGENGFFRMHRGSNDCGFEERMAAG 338
>sp|P92133|CATB3_GIAIN Cathepsin B-like CP3 OS=Giardia intestinalis GN=CP3 PE=2 SV=2
Length = 299
Score = 162 bits (410), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 94/221 (42%), Positives = 123/221 (55%), Gaps = 19/221 (8%)
Query: 12 LVIQGHCGSCWAFGAVEALSDRFCIHFGMN---LSLSVNDLLACCGFLCGD-GCDGGYPI 67
+V QG CGSCWAF +V ++ DR C G++ + S +++C GD CDGG+
Sbjct: 91 VVDQGGCGSCWAFSSVASVGDRRCFA-GLDKKAVKYSPQYVVSCDR---GDMACDGGWLP 146
Query: 68 SAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISA 127
S WR+ G T+EC PY G A T C KC + L K
Sbjct: 147 SVWRFLTKTGTTTDECVPY-------QSGSTGARGT--CPTKCADGSDLPHLYKATKAVD 197
Query: 128 YRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSD 187
Y + D IM + GP++ +FTVY DF +Y+SGVY+H G V GGHAV ++G+GT D
Sbjct: 198 YGL--DAPAIMKALATGGPLQTAFTVYSDFMYYESGVYQHTYGRVEGGHAVDMVGYGTDD 255
Query: 188 DGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 228
DG DYWI+ N W WG DGYF+I R +NECGIEE V+ G
Sbjct: 256 DGVDYWIIKNSWGPDWGEDGYFRIIRMTNECGIEEQVIGGF 296
>sp|P92132|CATB2_GIAIN Cathepsin B-like CP2 OS=Giardia intestinalis GN=CP2 PE=1 SV=2
Length = 300
Score = 146 bits (368), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 84/221 (38%), Positives = 122/221 (55%), Gaps = 19/221 (8%)
Query: 12 LVIQGHCGSCWAFGAVEALSDRFCIHFGMN---LSLSVNDLLACCGFLCGD-GCDGGYPI 67
+V QG CGSCWAF +V DR C+ G++ + S +++C GD C+GG+
Sbjct: 92 VVDQGGCGSCWAFSSVATFGDRRCVA-GLDKKPVKYSPQYVVSCDH---GDMACNGGWLP 147
Query: 68 SAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISA 127
+ W++ G T+EC PY + C PT KC + + S
Sbjct: 148 NVWKFLTKTGTTTDECVPYKSGSTTLRGTC----PT-----KCADGSSKVHLATATSYKD 198
Query: 128 YRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSD 187
Y + D +M + +GP++V+F V+ DF +Y+SGVY+H G + GGHAV+++G+GT D
Sbjct: 199 YGL--DIPAMMKALSTSGPLQVAFLVHSDFMYYESGVYQHTYGYMEGGHAVEMVGYGTDD 256
Query: 188 DGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 228
DG DYWI+ N W WG DGYF++ RG N+C IEE AG
Sbjct: 257 DGVDYWIIKNSWGPDWGEDGYFRMIRGINDCSIEEQAYAGF 297
>sp|Q06544|CYSP3_OSTOS Cathepsin B-like cysteine proteinase 3 (Fragment) OS=Ostertagia
ostertagi GN=CP-3 PE=3 SV=1
Length = 174
Score = 145 bits (365), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 79/175 (45%), Positives = 104/175 (59%), Gaps = 17/175 (9%)
Query: 69 AWRYFVHHGVVTEE-------CDPYFDSTGCSHPGCEPAY-------PTPKCVRKCVKKN 114
AW+YF GVVT C PY + C G EP Y TPKC + C +
Sbjct: 1 AWQYFALEGVVTGGNYRKQGCCRPY-EFPPCGRHGKEPYYGECYDTAKTPKCQKTCQRGY 59
Query: 115 -QLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVM 173
+ ++ KH+ SAYR+ ++ + I +I KNGPV F VYEDFAHYKSG+YKH G +
Sbjct: 60 LKAYKEDKHFGKSAYRLPNNVKAIQRDIMKNGPVVAGFIVYEDFAHYKSGIYKHTAGRMT 119
Query: 174 GGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 228
GGHAVK+IGWG + G YW++AN W+ WG G++++ RG N C IEE V AG+
Sbjct: 120 GGHAVKIIGWG-KEKGTPYWLIANSWHDDWGEKGFYRMIRGINNCRIEEMVFAGI 173
>sp|Q9UJW2|TINAG_HUMAN Tubulointerstitial nephritis antigen OS=Homo sapiens GN=TINAG PE=2
SV=3
Length = 476
Score = 143 bits (360), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 83/236 (35%), Positives = 118/236 (50%), Gaps = 29/236 (12%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
Q +C + WAF +DR I +LS +L++CC GC+ G AW Y
Sbjct: 237 QKNCAASWAFSTASVAADRIAIQSKGRYTANLSPQNLISCCA-KNRHGCNSGSIDRAWWY 295
Query: 73 FVHHGVVTEECDPYFDSTGCSHPGCEPA---------YPTPKCVRKCVKKNQLWRNSKHY 123
G+V+ C P F ++ GC A + T C K N++++ S
Sbjct: 296 LRKRGLVSHACYPLFKDQNATNNGCAMASRSDGRGKRHATKPCPNNVEKSNRIYQCS--- 352
Query: 124 SISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGD--------VMGG 175
YR++S+ +IM EI +NGPV+ V EDF HYK+G+Y+H+T +
Sbjct: 353 --PPYRVSSNETEIMKEIMQNGPVQAIMQVREDFFHYKTGIYRHVTSTNKESEKYRKLQT 410
Query: 176 HAVKLIGWGT----SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
HAVKL GWGT E +WI AN W +SWG +GYF+I RG NE IE+ ++A
Sbjct: 411 HAVKLTGWGTLRGAQGQKEKFWIAANSWGKSWGENGYFRILRGVNESDIEKLIIAA 466
>sp|Q3SZI1|TINAG_BOVIN Tubulointerstitial nephritis antigen OS=Bos taurus GN=TINAG PE=2
SV=1
Length = 476
Score = 141 bits (355), Expect = 6e-33, Method: Compositional matrix adjust.
Identities = 87/251 (34%), Positives = 124/251 (49%), Gaps = 36/251 (14%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
Q +C + WAF +DR I +LS +L++CC GC+ G AW Y
Sbjct: 237 QKNCAASWAFSTASVAADRIAIQSQGRYTANLSPQNLISCCAKK-RHGCNSGSVDRAWWY 295
Query: 73 FVHHGVVTEECDPYFDSTGCSHPGCEPA---------YPTPKCVRKCVKKNQLWRNSKHY 123
G+V+ C P F ++ GC A + T C K N++++ S
Sbjct: 296 LRKRGLVSHACYPLFKDQNATNNGCAMASRSDGRGKRHATTPCPNSIEKSNRIYQCS--- 352
Query: 124 SISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGD--------VMGG 175
YR++S+ +IM EI +NGPV+ V+EDF +YK+G+Y+HIT
Sbjct: 353 --PPYRVSSNETEIMREIMQNGPVQAIMQVHEDFFNYKTGIYRHITSTNEDSEKYRKFRT 410
Query: 176 HAVKLIGWGT----SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSS 231
HAVKL GWGT E +WI AN W +SWG +GYF+I RG NE IE+ ++A
Sbjct: 411 HAVKLTGWGTLRGAQGQKEKFWIAANSWGKSWGENGYFRILRGVNESDIEKLIIAAW--- 467
Query: 232 KNLVKEITSAD 242
++TSAD
Sbjct: 468 ----GQLTSAD 474
>sp|P92131|CATB1_GIAIN Cathepsin B-like CP1 OS=Giardia intestinalis GN=CP1 PE=2 SV=3
Length = 303
Score = 138 bits (348), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 84/216 (38%), Positives = 115/216 (53%), Gaps = 19/216 (8%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMN---LSLSVNDLLACCGFLCGDGCDGGYPISAWR 71
QG CGSCWAF A+ DR C G++ +S S L++C L GCDGG W
Sbjct: 99 QGSCGSCWAFSAIGVFGDRRC-AMGIDKEAVSYSQQHLISCS--LENFGCDGGDFQPTWS 155
Query: 72 YFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRIN 131
+ G T EC Y D G A P P QL++ + +S
Sbjct: 156 FLTFTGATTAECVKYVDY------GHTVASPCPAVCDDG-SPIQLYKAHGYGQVS----K 204
Query: 132 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV-MGGHAVKLIGWGTSDDGE 190
S P IM + GP++ VY D ++Y+SGVYKH G + +G HA++++G+GT+DDG
Sbjct: 205 SVPA-IMGMLVAGGPLQTMIVVYADLSYYESGVYKHTYGTINLGFHALEIVGYGTTDDGT 263
Query: 191 DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
DYWI+ N W WG +GYF+I RG NEC IE+++ A
Sbjct: 264 DYWIIKNSWGPDWGENGYFRIVRGVNECRIEDEIYA 299
>sp|O97578|CATC_CANFA Dipeptidyl peptidase 1 (Fragment) OS=Canis familiaris GN=CTSC PE=1
SV=1
Length = 435
Score = 128 bits (321), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 80/228 (35%), Positives = 118/228 (51%), Gaps = 28/228 (12%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYP-ISAWR 71
Q CGSC+AF + L R I + LS ++++C + GC+GG+P + A +
Sbjct: 225 QASCGSCYAFASTAMLEARIRILTNNTQTPILSPQEIVSCSQY--AQGCEGGFPYLIAGK 282
Query: 72 YFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRIN 131
Y G+V E C PY G P C+P C R + +S++Y + +
Sbjct: 283 YAQDFGLVEEACFPY---AGSDSP-CKPN----DCFR--------YYSSEYYYVGGFYGA 326
Query: 132 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH------ITGDVMGGHAVKLIGWGT 185
+ + E+ ++GP+ V+F VY+DF HY+ G+Y H + HAV L+G+GT
Sbjct: 327 CNEALMKLELVRHGPMAVAFEVYDDFFHYQKGIYYHTGLRDPFNPFELTNHAVLLVGYGT 386
Query: 186 -SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSK 232
S G DYWI+ N W WG DGYF+I+RG++EC IE VA P K
Sbjct: 387 DSASGMDYWIVKNSWGSRWGEDGYFRIRRGTDECAIESIAVAATPIPK 434
>sp|Q60HG6|CATC_MACFA Dipeptidyl peptidase 1 OS=Macaca fascicularis GN=CTSC PE=2 SV=1
Length = 463
Score = 127 bits (320), Expect = 6e-29, Method: Compositional matrix adjust.
Identities = 80/230 (34%), Positives = 121/230 (52%), Gaps = 31/230 (13%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYP-ISAWR 71
Q CGSC++F +V L R I + + LS ++++C + GC+GG+P ++A +
Sbjct: 252 QASCGSCYSFASVGMLEARIRILTNNSQTPILSSQEVVSCSQY--AQGCEGGFPYLTAGK 309
Query: 72 YFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWR--NSKHYSISAYR 129
Y G+V E C PY TG P C K +R +S+++ + +
Sbjct: 310 YAQDFGLVEEACFPY---TGTDSP--------------CKMKEDCFRYYSSEYHYVGGFY 352
Query: 130 INSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH------ITGDVMGGHAVKLIGW 183
+ + E+ +GP+ V+F VY+DF HY++G+Y H + HAV L+G+
Sbjct: 353 GGCNEALMKLELVYHGPLAVAFEVYDDFLHYQNGIYHHTGLRDPFNPFELTNHAVLLVGY 412
Query: 184 GT-SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSK 232
GT S G DYWI+ N W SWG DGYF+I+RG++EC IE VA P K
Sbjct: 413 GTDSASGMDYWIVKNSWGTSWGEDGYFRIRRGTDECAIESIAVAATPIPK 462
>sp|Q5RB02|CATC_PONAB Dipeptidyl peptidase 1 OS=Pongo abelii GN=CTSC PE=2 SV=1
Length = 463
Score = 124 bits (311), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 79/230 (34%), Positives = 118/230 (51%), Gaps = 31/230 (13%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYP-ISAWR 71
Q CGSC++F ++ L R I + + LS ++++C + GC+GG+P + A +
Sbjct: 252 QASCGSCYSFASMGMLEARIRILTSNSQTPILSPQEVVSCSQY--AQGCEGGFPYLIAGK 309
Query: 72 YFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWR--NSKHYSISAYR 129
Y G+V E C PY TG P C K +R +S+++ + +
Sbjct: 310 YAQDFGLVEEACFPY---TGTDSP--------------CKMKEDCFRYYSSEYHYVGGFY 352
Query: 130 INSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH------ITGDVMGGHAVKLIGW 183
+ + E+ +GP+ V+F VY+DF HYK G+Y H + HAV L+G+
Sbjct: 353 GGCNEALMKLELVHHGPMAVAFEVYDDFLHYKKGIYHHTGLRDPFNPFELTNHAVLLVGY 412
Query: 184 GT-SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSK 232
GT S G DYWI+ N W WG DGYF+I+RG++EC IE VA P K
Sbjct: 413 GTDSASGMDYWIVKNSWGTGWGEDGYFRIRRGTDECAIESIAVAATPIPK 462
>sp|P90850|YCF2E_CAEEL Uncharacterized peptidase C1-like protein F26E4.3 OS=Caenorhabditis
elegans GN=F26E4.3 PE=1 SV=3
Length = 452
Score = 123 bits (309), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 80/226 (35%), Positives = 112/226 (49%), Gaps = 16/226 (7%)
Query: 15 QGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG CGS W+ SDR I +N +LS LL+C GC+GGY AW Y
Sbjct: 204 QGDCGSSWSVSTTAISSDRLAIISEGRINSTLSSQQLLSCNQHR-QKGCEGGYLDRAWWY 262
Query: 73 FVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISA-YRIN 131
GVV + C PY S PG R+ ++ ++S + ++ Y+++
Sbjct: 263 IRKLGVVGDHCYPYV-SGQSREPGHCLIPKRDYTNRQGLRCPSGSQDSTAFKMTPPYKVS 321
Query: 132 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH--------ITGDVMGGHAVKLIGW 183
S EDI E+ NGPV+ +F V+EDF Y GVY+H + G H+V+++GW
Sbjct: 322 SREEDIQTELMTNGPVQATFVVHEDFFMYAGGVYQHSDLAAQKGASSVAEGYHSVRVLGW 381
Query: 184 G---TSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
G ++ YW+ AN W WG DGYFK+ RG N C IE V+
Sbjct: 382 GVDHSTGKPIKYWLCANSWGTQWGEDGYFKVLRGENHCEIESFVIG 427
>sp|Q99JR5|TINAL_MOUSE Tubulointerstitial nephritis antigen-like OS=Mus musculus
GN=Tinagl1 PE=1 SV=1
Length = 466
Score = 123 bits (308), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 84/239 (35%), Positives = 112/239 (46%), Gaps = 34/239 (14%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG+C WAF SDR IH M LS +LL+C GC GG AW +
Sbjct: 222 QGNCAGSWAFSTAAVASDRVSIHSLGHMTPILSPQNLLSCDTHH-QQGCRGGRLDGAWWF 280
Query: 73 FVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCV----------RKCVKK---NQLWRN 119
GVV++ C P+ A PTP+C+ R+ + Q+ N
Sbjct: 281 LRRRGVVSDNCYPFSGREQ------NEASPTPRCMMHSRAMGRGKRQATSRCPNGQVDSN 334
Query: 120 SKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV------- 172
+ AYR+ SD ++IM E+ +NGPV+ V+EDF Y+ G+Y H
Sbjct: 335 DIYQVTPAYRLGSDEKEIMKELMENGPVQALMEVHEDFFLYQRGIYSHTPVSQGRPEQYR 394
Query: 173 -MGGHAVKLIGWG--TSDDGE--DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
G H+VK+ GWG T DG YW AN W WG G+F+I RG+NEC IE V+
Sbjct: 395 RHGTHSVKITGWGEETLPDGRTIKYWTAANSWGPWWGERGHFRIVRGTNECDIETFVLG 453
>sp|Q3ZCJ8|CATC_BOVIN Dipeptidyl peptidase 1 OS=Bos taurus GN=CTSC PE=2 SV=1
Length = 463
Score = 122 bits (307), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 77/230 (33%), Positives = 119/230 (51%), Gaps = 31/230 (13%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYP-ISAWR 71
QG CGSC++F ++ + R I + LS ++++C + GC+GG+P + A +
Sbjct: 252 QGSCGSCYSFASMGMMEARIRILTNNTQTPILSPQEVVSCSQY--AQGCEGGFPYLIAGK 309
Query: 72 YFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWR--NSKHYSISAYR 129
Y G+V E+C PY TG P C K +R +S+++ + +
Sbjct: 310 YAQDFGLVEEDCFPY---TGTDSP--------------CRLKEGCFRYYSSEYHYVGGFY 352
Query: 130 INSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH------ITGDVMGGHAVKLIGW 183
+ + E+ GP+ V+F VY+DF HY+ GVY H + HAV L+G+
Sbjct: 353 GGCNEALMKLELVHQGPMAVAFEVYDDFLHYRKGVYHHTGLRDPFNPFELTNHAVLLVGY 412
Query: 184 GT-SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSK 232
GT + G DYWI+ N W SWG +GYF+I+RG++EC IE +A P K
Sbjct: 413 GTDAASGLDYWIVKNSWGTSWGENGYFRIRRGTDECAIESIALAATPIPK 462
>sp|P53634|CATC_HUMAN Dipeptidyl peptidase 1 OS=Homo sapiens GN=CTSC PE=1 SV=2
Length = 463
Score = 122 bits (307), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 78/230 (33%), Positives = 118/230 (51%), Gaps = 31/230 (13%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYP-ISAWR 71
Q CGSC++F ++ L R I + + LS ++++C + GC+GG+P + A +
Sbjct: 252 QASCGSCYSFASMGMLEARIRILTNNSQTPILSPQEVVSCSQY--AQGCEGGFPYLIAGK 309
Query: 72 YFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWR--NSKHYSISAYR 129
Y G+V E C PY TG P C K +R +S+++ + +
Sbjct: 310 YAQDFGLVEEACFPY---TGTDSP--------------CKMKEDCFRYYSSEYHYVGGFY 352
Query: 130 INSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH------ITGDVMGGHAVKLIGW 183
+ + E+ +GP+ V+F VY+DF HYK G+Y H + HAV L+G+
Sbjct: 353 GGCNEALMKLELVHHGPMAVAFEVYDDFLHYKKGIYHHTGLRDPFNPFELTNHAVLLVGY 412
Query: 184 GT-SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSK 232
GT S G DYWI+ N W WG +GYF+I+RG++EC IE VA P K
Sbjct: 413 GTDSASGMDYWIVKNSWGTGWGENGYFRIRRGTDECAIESIAVAATPIPK 462
>sp|Q9GZM7|TINAL_HUMAN Tubulointerstitial nephritis antigen-like OS=Homo sapiens
GN=TINAGL1 PE=1 SV=1
Length = 467
Score = 120 bits (301), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 82/233 (35%), Positives = 112/233 (48%), Gaps = 22/233 (9%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG+C WAF SDR IH M LS +LL+C GC GG AW +
Sbjct: 223 QGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQNLLSCDTHQ-QQGCRGGRLDGAWWF 281
Query: 73 FVHHGVVTEECDPY----FDSTGCSHPGCEPAYPTPKCVRKCVK--KNQLWRNSKHYSIS 126
GVV++ C P+ D G + P + + R+ N N+ Y ++
Sbjct: 282 LRRRGVVSDHCYPFSGRERDEAGPAPPCMMHSRAMGRGKRQATAHCPNSYVNNNDIYQVT 341
Query: 127 -AYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV--------MGGHA 177
YR+ S+ ++IM E+ +NGPV+ V+EDF YK G+Y H + G H+
Sbjct: 342 PVYRLGSNDKEIMKELMENGPVQALMEVHEDFFLYKGGIYSHTPVSLGRPERYRRHGTHS 401
Query: 178 VKLIGWG--TSDDGE--DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
VK+ GWG T DG YW AN W +WG G+F+I RG NEC IE V+
Sbjct: 402 VKITGWGEETLPDGRTLKYWTAANSWGPAWGERGHFRIVRGVNECDIESFVLG 454
>sp|Q9EQT5|TINAL_RAT Tubulointerstitial nephritis antigen-like OS=Rattus norvegicus
GN=Tinagl1 PE=2 SV=1
Length = 467
Score = 120 bits (300), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 83/239 (34%), Positives = 112/239 (46%), Gaps = 33/239 (13%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG+C WAF SDR IH M LS +LL+C GC GG AW +
Sbjct: 222 QGNCAGSWAFSTAAVASDRVSIHSLGHMTPILSPQNLLSCDTHH-QKGCRGGRLDGAWWF 280
Query: 73 FVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCV----------RKCVKK---NQLWRN 119
GVV++ C P+ + A PTP+C+ R+ + +Q+ N
Sbjct: 281 LRRRGVVSDNCYPF-----SGREQNDEASPTPRCMMHSRAMGRGKRQATSRCPNSQVDSN 335
Query: 120 SKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV------- 172
+ YR+ SD ++IM E+ +NGPV+ V+EDF Y+ G+Y H
Sbjct: 336 DIYQVTPVYRLASDEKEIMKELMENGPVQALMEVHEDFFLYQRGIYSHTPVSQGRPEQYR 395
Query: 173 -MGGHAVKLIGWG--TSDDGE--DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
G H+VK+ GWG T DG YW AN W WG G+F+I RG NEC IE V+
Sbjct: 396 RHGTHSVKITGWGEETLPDGRTIKYWTAANSWGPWWGERGHFRIVRGINECDIETFVLG 454
>sp|P80067|CATC_RAT Dipeptidyl peptidase 1 OS=Rattus norvegicus GN=Ctsc PE=1 SV=3
Length = 462
Score = 120 bits (300), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 76/228 (33%), Positives = 117/228 (51%), Gaps = 27/228 (11%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYP-ISAWR 71
Q CGSC++F ++ L R I + + LS ++++C + GCDGG+P + A +
Sbjct: 251 QESCGSCYSFASLGMLEARIRILTNNSQTPILSPQEVVSCSPY--AQGCDGGFPYLIAGK 308
Query: 72 YFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRIN 131
Y GVV E C PY + P P C+R + +S++Y + +
Sbjct: 309 YAQDFGVVEENCFPYTATDA-------PCKPKENCLR--------YYSSEYYYVGGFYGG 353
Query: 132 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH------ITGDVMGGHAVKLIGWGT 185
+ + E+ K+GP+ V+F V++DF HY SG+Y H + HAV L+G+G
Sbjct: 354 CNEALMKLELVKHGPMAVAFEVHDDFLHYHSGIYHHTGLSDPFNPFELTNHAVLLVGYGK 413
Query: 186 SD-DGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSK 232
G DYWI+ N W WG GYF+I+RG++EC IE +A +P K
Sbjct: 414 DPVTGLDYWIVKNSWGSQWGESGYFRIRRGTDECAIESIAMAAIPIPK 461
>sp|P97821|CATC_MOUSE Dipeptidyl peptidase 1 OS=Mus musculus GN=Ctsc PE=2 SV=1
Length = 462
Score = 118 bits (295), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 76/228 (33%), Positives = 116/228 (50%), Gaps = 27/228 (11%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYP-ISAWR 71
Q CGSC++F ++ L R I + + LS ++++C + GCDGG+P + A +
Sbjct: 251 QESCGSCYSFASMGMLEARIRILTNNSQTPILSPQEVVSCSPY--AQGCDGGFPYLIAGK 308
Query: 72 YFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRIN 131
Y GVV E C PY P P C+R + +S +Y + +
Sbjct: 309 YAQDFGVVEESCFPYTAKDS-------PCKPRENCLR--------YYSSDYYYVGGFYGG 353
Query: 132 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH------ITGDVMGGHAVKLIGWGT 185
+ + E+ K+GP+ V+F V++DF HY SG+Y H + HAV L+G+G
Sbjct: 354 CNEALMKLELVKHGPMAVAFEVHDDFLHYHSGIYHHTGLSDPFNPFELTNHAVLLVGYGR 413
Query: 186 SD-DGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSK 232
G +YWI+ N W +WG GYF+I+RG++EC IE VA +P K
Sbjct: 414 DPVTGIEYWIIKNSWGSNWGESGYFRIRRGTDECAIESIAVAAIPIPK 461
>sp|P05167|ALEU_HORVU Thiol protease aleurain OS=Hordeum vulgare PE=2 SV=1
Length = 362
Score = 108 bits (271), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 75/218 (34%), Positives = 99/218 (45%), Gaps = 40/218 (18%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFV 74
Q HCGSCW F AL + G N+SLS L+ C G GC+GG P A+ Y
Sbjct: 162 QAHCGSCWTFSTTGALEAAYTQATGKNISLSEQQLVDCAGGFNNFGCNGGLPSQAFEYIK 221
Query: 75 HHGVV-TEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSD 133
++G + TEE PY G H E N+ + + I +
Sbjct: 222 YNGGIDTEESYPYKGVNGVCHYKAE--------------------NAAVQVLDSVNITLN 261
Query: 134 PEDIMAEIYKNG-----PVEVSFTVYEDFAHYKSGVYKHITGDVMG------GHAVKLIG 182
ED + KN PV V+F V + F YKSGVY T D G HAV +G
Sbjct: 262 AEDEL----KNAVGLVRPVSVAFQVIDGFRQYKSGVY---TSDHCGTTPDDVNHAVLAVG 314
Query: 183 WGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGI 220
+G ++G YW++ N W WG +GYFK++ G N C I
Sbjct: 315 YGV-ENGVPYWLIKNSWGADWGDNGYFKMEMGKNMCAI 351
>sp|P49935|CATH_MOUSE Pro-cathepsin H OS=Mus musculus GN=Ctsh PE=2 SV=2
Length = 333
Score = 108 bits (270), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 76/214 (35%), Positives = 100/214 (46%), Gaps = 32/214 (14%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFV 74
QG CGSCW F AL I G LSL+ L+ C GC GG P A+ Y +
Sbjct: 133 QGACGSCWTFSTTGALESAVAIASGKMLSLAEQQLVDCAQAFNNHGCKGGLPSQAFEYIL 192
Query: 75 HH-GVVTEECDPYF--DSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRIN 131
++ G++ E+ PY DS+ +P A+ V+ V I
Sbjct: 193 YNKGIMEEDSYPYIGKDSSCRFNPQKAVAF-----VKNVV-----------------NIT 230
Query: 132 SDPEDIMAE-IYKNGPVEVSFTVYEDFAHYKSGVYK----HITGDVMGGHAVKLIGWGTS 186
+ E M E + PV +F V EDF YKSGVY H T D + HAV +G+G
Sbjct: 231 LNDEAAMVEAVALYNPVSFAFEVTEDFLMYKSGVYSSKSCHKTPDKV-NHAVLAVGYG-E 288
Query: 187 DDGEDYWILANQWNRSWGADGYFKIKRGSNECGI 220
+G YWI+ N W WG +GYF I+RG N CG+
Sbjct: 289 QNGLLYWIVKNSWGSQWGENGYFLIERGKNMCGL 322
>sp|P00786|CATH_RAT Pro-cathepsin H OS=Rattus norvegicus GN=Ctsh PE=1 SV=1
Length = 333
Score = 107 bits (268), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 73/212 (34%), Positives = 96/212 (45%), Gaps = 28/212 (13%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFV 74
QG CGSCW F AL I G ++L+ L+ C GC GG P A+ Y +
Sbjct: 133 QGACGSCWTFSTTGALESAVAIASGKMMTLAEQQLVDCAQNFNNHGCQGGLPSQAFEYIL 192
Query: 75 HH-GVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSD 133
++ G++ E+ PY G E A K V I +
Sbjct: 193 YNKGIMGEDSYPYIGKNGQCKFNPEKAVAFVKNV--------------------VNITLN 232
Query: 134 PEDIMAE-IYKNGPVEVSFTVYEDFAHYKSGVYK----HITGDVMGGHAVKLIGWGTSDD 188
E M E + PV +F V EDF YKSGVY H T D + HAV +G+G +
Sbjct: 233 DEAAMVEAVALYNPVSFAFEVTEDFMMYKSGVYSSNSCHKTPDKV-NHAVLAVGYG-EQN 290
Query: 189 GEDYWILANQWNRSWGADGYFKIKRGSNECGI 220
G YWI+ N W +WG +GYF I+RG N CG+
Sbjct: 291 GLLYWIVKNSWGSNWGNNGYFLIERGKNMCGL 322
>sp|Q8RWQ9|ALEUL_ARATH Thiol protease aleurain-like OS=Arabidopsis thaliana GN=At3g45310
PE=2 SV=1
Length = 358
Score = 106 bits (265), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 74/213 (34%), Positives = 97/213 (45%), Gaps = 30/213 (14%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFV 74
QGHCGSCW F AL + FG +SLS L+ C G GC GG P A+ Y
Sbjct: 159 QGHCGSCWTFSTTGALEAAYHQAFGKGISLSEQQLVDCAGTFNNFGCHGGLPSQAFEYIK 218
Query: 75 HHGVV-TEECDPYFDSTGCSHPGCE-PAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINS 132
++G + TEE PY G GC+ A VR V + +++ R
Sbjct: 219 YNGGLDTEEAYPYTGKDG----GCKFSAKNIGVQVRDSVNITLGAEDELKHAVGLVR--- 271
Query: 133 DPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHIT-----GDVMGGHAVKLIGWGTSD 187
PV V+F V +F YK GV+ T DV HAV +G+G D
Sbjct: 272 -------------PVSVAFEVVHEFRFYKKGVFTSNTCGNTPMDV--NHAVLAVGYGVED 316
Query: 188 DGEDYWILANQWNRSWGADGYFKIKRGSNECGI 220
D YW++ N W WG +GYFK++ G N CG+
Sbjct: 317 D-VPYWLIKNSWGGEWGDNGYFKMEMGKNMCGV 348
>sp|Q9R1T3|CATZ_RAT Cathepsin Z OS=Rattus norvegicus GN=Ctsz PE=1 SV=2
Length = 306
Score = 105 bits (263), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 70/231 (30%), Positives = 102/231 (44%), Gaps = 28/231 (12%)
Query: 14 IQGHCGSCWAFGAVEALSDRFCIHFG---MNLSLSVNDLLACCGFLCGDGCDGGYPISAW 70
I +CGSCWA G+ AL+DR I + LSV +++ C C+GG + W
Sbjct: 87 IPQYCGSCWAHGSTSALADRINIKRKGAWPSTLLSVQNVIDCGN---AGSCEGGNDLPVW 143
Query: 71 RYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCV--RKC--VKKNQLWRNSKHYSIS 126
Y HG+ E C+ Y C+ C ++C ++ LWR + S+S
Sbjct: 144 EYAHKHGIPDETCNNY----QAKDQECDKFNQCGTCTEFKECHTIQNYTLWRVGDYGSLS 199
Query: 127 AYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTS 186
E +MAEIY NGP+ E ++Y G+Y + H + + GWG S
Sbjct: 200 GR------EKMMAEIYANGPISCGIMATERMSNYTGGIYTEYQNQAIINHIISVAGWGVS 253
Query: 187 DDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKE 237
+DG +YWI+ N W WG G+ +I + G SS NL E
Sbjct: 254 NDGIEYWIVRNSWGEPWGERGWMRI--------VTSTYKGGTGSSYNLAIE 296
>sp|Q26563|CATC_SCHMA Cathepsin C OS=Schistosoma mansoni PE=2 SV=1
Length = 454
Score = 104 bits (259), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 77/221 (34%), Positives = 106/221 (47%), Gaps = 31/221 (14%)
Query: 15 QGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYP-ISAWR 71
QG CGSC+A + AL R + +F LS ++ C + +GC+GG+P + A +
Sbjct: 241 QGICGSCYASPSAAALEARIRLVSNFSEQPILSPQTVVDCSPY--SEGCNGGFPFLIAGK 298
Query: 72 YFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYS-ISAYRI 130
Y G+ + PY TG T KC V KN + YS I Y
Sbjct: 299 YGEDFGLPQKIVIPY---TGED---------TGKCT---VSKNCTRYYTTDYSYIGGYYG 343
Query: 131 NSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV---------MGGHAVKLI 181
++ + + E+ NGP V F VYEDF YK G+Y H T + HAV L+
Sbjct: 344 ATNEKLMQLELISNGPFPVGFEVYEDFQFYKEGIYHHTTVQTDHYNFNPFELTNHAVLLV 403
Query: 182 GWGTSD-DGEDYWILANQWNRSWGADGYFKIKRGSNECGIE 221
G+G GE YW + N W WG GYF+I RG++ECG+E
Sbjct: 404 GYGVDKLSGEPYWKVKNSWGVEWGEQGYFRILRGTDECGVE 444
>sp|Q9WUU7|CATZ_MOUSE Cathepsin Z OS=Mus musculus GN=Ctsz PE=2 SV=1
Length = 306
Score = 103 bits (258), Expect = 8e-22, Method: Compositional matrix adjust.
Identities = 63/205 (30%), Positives = 96/205 (46%), Gaps = 20/205 (9%)
Query: 14 IQGHCGSCWAFGAVEALSDRFCIHFG---MNLSLSVNDLLACCGFLCGDGCDGGYPISAW 70
I +CGSCWA G+ A++DR I ++ LSV +++ C C+GG + W
Sbjct: 87 IPQYCGSCWAHGSTSAMADRINIKRKGAWPSILLSVQNVIDCGN---AGSCEGGNDLPVW 143
Query: 71 RYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCV--RKC--VKKNQLWRNSKHYSIS 126
Y HG+ E C+ Y C+ C ++C ++ LWR + S+S
Sbjct: 144 EYAHKHGIPDETCNNY----QAKDQDCDKFNQCGTCTEFKECHTIQNYTLWRVGDYGSLS 199
Query: 127 AYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTS 186
E +MAEIY NGP+ E ++Y G+Y + H + + GWG S
Sbjct: 200 GR------EKMMAEIYANGPISCGIMATEMMSNYTGGIYAEHQDQAVINHIISVAGWGVS 253
Query: 187 DDGEDYWILANQWNRSWGADGYFKI 211
+DG +YWI+ N W WG G+ +I
Sbjct: 254 NDGIEYWIVRNSWGEPWGEKGWMRI 278
>sp|Q10717|CYSP2_MAIZE Cysteine proteinase 2 OS=Zea mays GN=CCP2 PE=2 SV=1
Length = 360
Score = 102 bits (253), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 72/211 (34%), Positives = 97/211 (45%), Gaps = 26/211 (12%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFV 74
QGHCGSCW F AL + G +SLS L+ C GC+GG P A+ Y
Sbjct: 161 QGHCGSCWTFSTTGALEAAYTQATGKPISLSEQQLVDCGFAFNNFGCNGGLPSQAFEYIK 220
Query: 75 HHGVV-TEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSD 133
++G + TEE PY G C KN+ N + + I
Sbjct: 221 YNGGLDTEESYPYQGVNGI-----------------CKFKNE---NVGVKVLDSVNITLG 260
Query: 134 PEDIMAE-IYKNGPVEVSFTVYEDFAHYKSGVYKHI---TGDVMGGHAVKLIGWGTSDDG 189
ED + + + PV V+F V F YKSGVY T + HAV +G+G +DG
Sbjct: 261 AEDELKDAVGLVRPVSVAFEVITGFRLYKSGVYTSDHCGTTPMDVNHAVLAVGYGV-EDG 319
Query: 190 EDYWILANQWNRSWGADGYFKIKRGSNECGI 220
YW++ N W WG +GYFK++ G N CG+
Sbjct: 320 VPYWLIKNSWGADWGDEGYFKMEMGKNMCGV 350
>sp|Q9UBR2|CATZ_HUMAN Cathepsin Z OS=Homo sapiens GN=CTSZ PE=1 SV=1
Length = 303
Score = 99.8 bits (247), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 65/203 (32%), Positives = 91/203 (44%), Gaps = 17/203 (8%)
Query: 14 IQGHCGSCWAFGAVEALSDRFCIHFG---MNLSLSVNDLLACCGFLCGDGCDGGYPISAW 70
I +CGSCWA + A++DR I + LSV +++ C C+GG +S W
Sbjct: 85 IPQYCGSCWAHASTSAMADRINIKRKGAWPSTLLSVQNVIDCGN---AGSCEGGNDLSVW 141
Query: 71 RYFVHHGVVTEECDPYF--DSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAY 128
Y HG+ E C+ Y D C +C ++ LWR + S+S
Sbjct: 142 DYAHQHGIPDETCNNYQAKDQECDKFNQCGTCNEFKEC--HAIRNYTLWRVGDYGSLSGR 199
Query: 129 RINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDD 188
E +MAEIY NGP+ E A+Y G+Y H V + GWG S D
Sbjct: 200 ------EKMMAEIYANGPISCGIMATERLANYTGGIYAEYQDTTYINHVVSVAGWGIS-D 252
Query: 189 GEDYWILANQWNRSWGADGYFKI 211
G +YWI+ N W WG G+ +I
Sbjct: 253 GTEYWIVRNSWGEPWGERGWLRI 275
>sp|P22497|CYSP_THEPA Cysteine proteinase OS=Theileria parva GN=TP03_0285 PE=3 SV=2
Length = 440
Score = 99.4 bits (246), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 64/211 (30%), Positives = 101/211 (47%), Gaps = 32/211 (15%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFV 74
Q +CG CWAF V ++ + HF + LSV +LL C F +GC GG SA+ Y
Sbjct: 247 QSNCGGCWAFSTVGSVEGYYMSHFDKSYELSVQELLDCDSF--SNGCQGGLLESAYEYVR 304
Query: 75 HHGVVTEECDPYFD-STGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSD 133
+G+V+ + P+ D + CS P +K S+ +Y +
Sbjct: 305 KYGLVSAKDLPFVDKARRCSVP-----------------------KAKKVSVPSYHVFKG 341
Query: 134 PEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSD-DGEDY 192
E +M + P V +V + A YKSGV+ G + HAV L+G G + + Y
Sbjct: 342 KE-VMTRSLTSSPCSVYLSVSPELAKYKSGVFTGECGKSL-NHAVVLVGEGYDEVTKKRY 399
Query: 193 WILANQWNRSWGADGYFKIKR---GSNECGI 220
W++ N W WG +GY +++R G+++CG+
Sbjct: 400 WVVQNSWGTDWGENGYMRLERTNMGTDKCGV 430
>sp|Q8H166|ALEU_ARATH Thiol protease aleurain OS=Arabidopsis thaliana GN=ALEU PE=1 SV=2
Length = 358
Score = 99.4 bits (246), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 74/210 (35%), Positives = 99/210 (47%), Gaps = 24/210 (11%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFV 74
QG CGSCW F AL + FG +SLS L+ C G GC+GG P A+ Y
Sbjct: 159 QGGCGSCWTFSTTGALEAAYHQAFGKGISLSEQQLVDCAGAFNNYGCNGGLPSQAFEYIK 218
Query: 75 HHGVV-TEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSD 133
+G + TE+ PY TG T K + V L NS + ++ A
Sbjct: 219 SNGGLDTEKAYPY---TGKDE--------TCKFSAENVGVQVL--NSVNITLGA------ 259
Query: 134 PEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVY--KHITGDVMG-GHAVKLIGWGTSDDGE 190
+++ + PV ++F V F YKSGVY H M HAV +G+G +DG
Sbjct: 260 EDELKHAVGLVRPVSIAFEVIHSFRLYKSGVYTDSHCGSTPMDVNHAVLAVGYGV-EDGV 318
Query: 191 DYWILANQWNRSWGADGYFKIKRGSNECGI 220
YW++ N W WG GYFK++ G N CGI
Sbjct: 319 PYWLIKNSWGADWGDKGYFKMEMGKNMCGI 348
>sp|P05689|CATZ_BOVIN Cathepsin Z OS=Bos taurus GN=CTSZ PE=2 SV=2
Length = 304
Score = 99.0 bits (245), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 72/233 (30%), Positives = 102/233 (43%), Gaps = 33/233 (14%)
Query: 14 IQGHCGSCWAFGAVEALSDRFCIHFG---MNLSLSVNDLLACCGFLCGDG--CDGGYPIS 68
I +CGSCWA G+ A++DR I + LSV ++ C GD C+GG +
Sbjct: 86 IPQYCGSCWAHGSTSAMADRINIKRKGAWPSTLLSVQHVIDC-----GDAGSCEGGNDLP 140
Query: 69 AWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCV--RKC--VKKNQLWRNSKHYS 124
W Y HG+ E C+ Y C+ C ++C +K LW+ + S
Sbjct: 141 VWEYAHRHGIPDETCNNY----QAKDQECDKFNQCGTCTEFKECHVIKNYTLWKVGDYGS 196
Query: 125 ISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWG 184
+S E +MAEIY NGP+ E ++Y G+Y H V + GWG
Sbjct: 197 LSGR------EKMMAEIYTNGPISCGIMATEKMSNYTGGIYSEYNDQAFINHIVSVAGWG 250
Query: 185 TSDDGEDYWILANQWNRSWGADGYFKIKRGSNECG--------IEEDVVAGLP 229
S DG +YWI+ N W WG G+ +I + + G IEE G P
Sbjct: 251 VS-DGMEYWIVRNSWGEPWGEHGWMRIVTSTYKGGEGARYNLAIEESCTFGDP 302
>sp|Q95029|CATL_DROME Cathepsin L OS=Drosophila melanogaster GN=Cp1 PE=2 SV=2
Length = 371
Score = 98.2 bits (243), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 68/211 (32%), Positives = 98/211 (46%), Gaps = 25/211 (11%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFV 74
QGHCGSCWAF + AL + G+ +SLS +L+ C +GC+GG +A+RY
Sbjct: 172 QGHCGSCWAFSSTGALEGQHFRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIK 231
Query: 75 HHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSD 133
+G + E +YP C K + + ++ I
Sbjct: 232 DNGGID----------------TEKSYPYEAIDDSCHFNKGTVGATDRGFT----DIPQG 271
Query: 134 PEDIMAE-IYKNGPVEVSFTV-YEDFAHYKSGVYKHITGDVMG-GHAVKLIGWGTSDDGE 190
E MAE + GPV V+ +E F Y GVY D H V ++G+GT + GE
Sbjct: 272 DEKKMAEAVATVGPVSVAIDASHESFQFYSEGVYNEPQCDAQNLDHGVLVVGFGTDESGE 331
Query: 191 DYWILANQWNRSWGADGYFKIKRGS-NECGI 220
DYW++ N W +WG G+ K+ R N+CGI
Sbjct: 332 DYWLVKNSWGTTWGDKGFIKMLRNKENQCGI 362
>sp|Q3T0I2|CATH_BOVIN Pro-cathepsin H OS=Bos taurus GN=CTSH PE=2 SV=1
Length = 335
Score = 97.8 bits (242), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 65/211 (30%), Positives = 95/211 (45%), Gaps = 26/211 (12%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFV 74
QG CGSCW F AL I G L+ L+ C GC GG P A+ Y
Sbjct: 135 QGSCGSCWTFSTTGALESAVAIATGKLPFLAEQQLVDCAQNFNNHGCQGGLPSQAFEYIR 194
Query: 75 HH-GVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSD 133
++ G++ E+ PY G C + K ++ + ++ +D
Sbjct: 195 YNKGIMGEDTYPYRGQDG-------------DCKYQPSKAIAFVKDVANITL------ND 235
Query: 134 PEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYK----HITGDVMGGHAVKLIGWGTSDDG 189
E ++ + + PV +F V DF Y+ G+Y H T D + HAV +G+G + G
Sbjct: 236 EEAMVEAVALHNPVSFAFEVTADFMMYRKGIYSSTSCHKTPDKV-NHAVLAVGYG-EEKG 293
Query: 190 EDYWILANQWNRSWGADGYFKIKRGSNECGI 220
YWI+ N W +WG GYF I+RG N CG+
Sbjct: 294 IPYWIVKNSWGPNWGMKGYFLIERGKNMCGL 324
Database: swissprot
Posted date: Mar 23, 2013 2:32 AM
Number of letters in database: 191,569,459
Number of sequences in database: 539,616
Lambda K H
0.320 0.138 0.462
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 108,691,747
Number of Sequences: 539616
Number of extensions: 4947316
Number of successful extensions: 9354
Number of sequences better than 100.0: 50
Number of HSP's better than 100.0 without gapping: 208
Number of HSP's successfully gapped in prelim test: 12
Number of HSP's that attempted gapping in prelim test: 8699
Number of HSP's gapped (non-prelim): 274
length of query: 249
length of database: 191,569,459
effective HSP length: 115
effective length of query: 134
effective length of database: 129,513,619
effective search space: 17354824946
effective search space used: 17354824946
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 60 (27.7 bits)