BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 027054
         (229 letters)

Database: swissprot 
           539,616 sequences; 191,569,459 total letters

Searching..................................................done



>sp|Q4R5M2|CATB_MACFA Cathepsin B OS=Macaca fascicularis GN=CTSB PE=2 SV=1
          Length = 339

 Score =  197 bits (502), Expect = 3e-50,   Method: Compositional matrix adjust.
 Identities = 99/227 (43%), Positives = 146/227 (64%), Gaps = 19/227 (8%)

Query: 6   TNRDALSSSPYVSLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE--- 62
           ++R  + ++ +VS++   +S  DLL CCG +CGDGC+GGYP  AW ++   G+V+     
Sbjct: 118 SDRICIHTNAHVSVE---VSAEDLLTCCGIMCGDGCNGGYPAGAWNFWTRKGLVSGGLYD 174

Query: 63  ----CDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRIN 111
               C PY     C H      P C     TPKC + C    +  ++  KHY  ++Y ++
Sbjct: 175 SHVGCRPY-SIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVS 233

Query: 112 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGED 171
           +  +DIMAEIYKNGPVE +F+VY DF  YKSGVY+H+TG++MGGHA++++GWG  ++G  
Sbjct: 234 NSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGV-ENGTP 292

Query: 172 YWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEI 218
           YW++AN WN  WG +G+FKI RG + CGIE +VVAG+P +    ++I
Sbjct: 293 YWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPRTDQYWEKI 339


>sp|P07858|CATB_HUMAN Cathepsin B OS=Homo sapiens GN=CTSB PE=1 SV=3
          Length = 339

 Score =  197 bits (501), Expect = 5e-50,   Method: Compositional matrix adjust.
 Identities = 99/227 (43%), Positives = 146/227 (64%), Gaps = 19/227 (8%)

Query: 6   TNRDALSSSPYVSLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE--- 62
           ++R  + ++ +VS++   +S  DLL CCG +CGDGC+GGYP  AW ++   G+V+     
Sbjct: 118 SDRICIHTNAHVSVE---VSAEDLLTCCGSMCGDGCNGGYPAEAWNFWTRKGLVSGGLYE 174

Query: 63  ----CDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRIN 111
               C PY     C H      P C     TPKC + C    +  ++  KHY  ++Y ++
Sbjct: 175 SHVGCRPY-SIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVS 233

Query: 112 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGED 171
           +  +DIMAEIYKNGPVE +F+VY DF  YKSGVY+H+TG++MGGHA++++GWG  ++G  
Sbjct: 234 NSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGV-ENGTP 292

Query: 172 YWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEI 218
           YW++AN WN  WG +G+FKI RG + CGIE +VVAG+P +    ++I
Sbjct: 293 YWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPRTDQYWEKI 339


>sp|P43509|CPR5_CAEEL Cathepsin B-like cysteine proteinase 5 OS=Caenorhabditis elegans
           GN=cpr-5 PE=2 SV=1
          Length = 344

 Score =  197 bits (501), Expect = 5e-50,   Method: Compositional matrix adjust.
 Identities = 102/208 (49%), Positives = 126/208 (60%), Gaps = 20/208 (9%)

Query: 21  NLSLSVNDLLACCG--FLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDS-- 69
           N  LS  DLL+CC   F CG+GC+GGYPI AW+++V HG+VT         C PY  +  
Sbjct: 132 NTLLSSEDLLSCCTGMFSCGNGCEGGYPIQAWKWWVKHGLVTGGSYETQFGCKPYSIAPC 191

Query: 70  ----TGCSHPGC-EPAYPTPKCVRKCVKKNQL---WRNSKHYSISAYRINSDPEDIMAEI 121
                G   P C E   PTPKCV  C  KN     +   KH+  +AY +    E I  EI
Sbjct: 192 GETVNGVKWPACPEDTEPTPKCVDSCTSKNNYATPYLQDKHFGSTAYAVGKKVEQIQTEI 251

Query: 122 YKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNR 181
             NGP+EV+FTVYEDF  Y +GVY H  G  +GGHAVK++GWG  D+G  YW++AN WN 
Sbjct: 252 LTNGPIEVAFTVYEDFYQYTTGVYVHTAGASLGGHAVKILGWGV-DNGTPYWLVANSWNV 310

Query: 182 SWGADGYFKIKRGSNECGIEEDVVAGLP 209
           +WG  GYF+I RG NECGIE   VAG+P
Sbjct: 311 AWGEKGYFRIIRGLNECGIEHSAVAGIP 338


>sp|Q5R6D1|CATB_PONAB Cathepsin B OS=Pongo abelii GN=CTSB PE=2 SV=1
          Length = 339

 Score =  197 bits (500), Expect = 6e-50,   Method: Compositional matrix adjust.
 Identities = 99/227 (43%), Positives = 145/227 (63%), Gaps = 19/227 (8%)

Query: 6   TNRDALSSSPYVSLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE--- 62
           ++R  + ++ +VS++   +S  DLL CCG +CGDGC+GGYP  AW ++   G+V+     
Sbjct: 118 SDRICIHTNAHVSVE---VSAEDLLTCCGSMCGDGCNGGYPAEAWNFWTRKGLVSGGLYE 174

Query: 63  ----CDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRIN 111
               C PY     C H      P C     TPKC + C    +  ++  KHY  ++Y ++
Sbjct: 175 SHVGCRPY-SIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVS 233

Query: 112 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGED 171
           +   DIMAEIYKNGPVE +F+VY DF  YKSGVY+H+TG++MGGHA++++GWG  ++G  
Sbjct: 234 NSERDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGV-ENGTP 292

Query: 172 YWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEI 218
           YW++AN WN  WG +G+FKI RG + CGIE +VVAG+P +    ++I
Sbjct: 293 YWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPRTDQYWEKI 339


>sp|P00787|CATB_RAT Cathepsin B OS=Rattus norvegicus GN=Ctsb PE=1 SV=2
          Length = 339

 Score =  194 bits (494), Expect = 4e-49,   Method: Compositional matrix adjust.
 Identities = 96/206 (46%), Positives = 130/206 (63%), Gaps = 14/206 (6%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYF-----D 68
           N+ +S  DLL CCG  CGDGC+GGYP  AW ++   G+V+         C PY       
Sbjct: 130 NVEVSAEDLLTCCGIQCGDGCNGGYPSGAWNFWTRKGLVSGGVYNSHIGCLPYTIPPCEH 189

Query: 69  STGCSHPGCEPAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPV 127
               S P C     TPKC + C    +  ++  KHY  ++Y ++   ++IMAEIYKNGPV
Sbjct: 190 HVNGSRPPCTGEGDTPKCNKMCEAGYSTSYKEDKHYGYTSYSVSDSEKEIMAEIYKNGPV 249

Query: 128 EVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADG 187
           E +FTV+ DF  YKSGVYKH  GDVMGGHA++++GWG  ++G  YW++AN WN  WG +G
Sbjct: 250 EGAFTVFSDFLTYKSGVYKHEAGDVMGGHAIRILGWGI-ENGVPYWLVANSWNVDWGDNG 308

Query: 188 YFKIKRGSNECGIEEDVVAGLPSSKN 213
           +FKI RG N CGIE ++VAG+P ++ 
Sbjct: 309 FFKILRGENHCGIESEIVAGIPRTQQ 334


>sp|A1E295|CATB_PIG Cathepsin B OS=Sus scrofa GN=CTSB PE=1 SV=1
          Length = 335

 Score =  192 bits (489), Expect = 1e-48,   Method: Compositional matrix adjust.
 Identities = 94/208 (45%), Positives = 132/208 (63%), Gaps = 16/208 (7%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCS 73
           N+ +S  D+L CCG  CGDGC+GG+P  AW ++   G+V+         C PY     C 
Sbjct: 130 NVEVSAEDMLTCCGDECGDGCNGGFPSGAWNFWTKKGLVSGGLYDSHVGCRPY-SIPPCE 188

Query: 74  H------PGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGP 126
           H      P C     TPKC + C       ++  KH+  S+Y I+ + ++IMAEIYKNGP
Sbjct: 189 HHVNGSRPPCTGEGDTPKCSKICEPGYTPSYKEDKHFGCSSYSISRNEKEIMAEIYKNGP 248

Query: 127 VEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGAD 186
           VE +FTVY DF  YKSGVY+H+TGD+MGGHA++++GWG  ++G  YW++ N WN  WG +
Sbjct: 249 VEGAFTVYSDFLQYKSGVYQHVTGDLMGGHAIRILGWGV-ENGTPYWLVGNSWNTDWGDN 307

Query: 187 GYFKIKRGSNECGIEEDVVAGLPSSKNL 214
           G+FKI RG + CGIE ++VAG+P + + 
Sbjct: 308 GFFKILRGQDHCGIESEIVAGIPCTPHF 335


>sp|P25793|CYSP2_HAECO Cathepsin B-like cysteine proteinase 2 OS=Haemonchus contortus
           GN=AC-2 PE=2 SV=1
          Length = 342

 Score =  189 bits (481), Expect = 1e-47,   Method: Compositional matrix adjust.
 Identities = 94/210 (44%), Positives = 131/210 (62%), Gaps = 17/210 (8%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGC 72
           + +++S  D++ CC   CGDGC+GG+PI AW+YF++ GVV+       + C PY     C
Sbjct: 135 KQVNISATDIMTCCRPQCGDGCEGGWPIEAWKYFIYDGVVSGGEYLTKDVCRPY-PIHPC 193

Query: 73  SHPG-------CEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKN 124
            H G       C    PTP C RKC     +++R  K Y   AY +    + I +EI KN
Sbjct: 194 GHHGNDTYYGECRGTAPTPPCKRKCRPGVRKMYRIDKRYGKDAYIVKQSVKAIQSEILKN 253

Query: 125 GPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWG 184
           GPV  SF VYEDF HYKSG+YKH  G++ G HAVK+IGWG +++  D+W++AN W+  WG
Sbjct: 254 GPVVASFAVYEDFRHYKSGIYKHTAGELRGYHAVKMIGWG-NENNTDFWLIANSWHNDWG 312

Query: 185 ADGYFKIKRGSNECGIEEDVVAGLPSSKNL 214
             GYF+I RGSN+CGIE  + AG+  +++L
Sbjct: 313 EKGYFRIVRGSNDCGIEGTIAAGIVDTESL 342


>sp|P10605|CATB_MOUSE Cathepsin B OS=Mus musculus GN=Ctsb PE=1 SV=2
          Length = 339

 Score =  189 bits (480), Expect = 1e-47,   Method: Compositional matrix adjust.
 Identities = 93/204 (45%), Positives = 129/204 (63%), Gaps = 14/204 (6%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYF-----D 68
           N+ +S  DLL CCG  CGDGC+GGYP  AW ++   G+V+         C PY       
Sbjct: 130 NVEVSAEDLLTCCGIQCGDGCNGGYPSGAWSFWTKKGLVSGGVYNSHVGCLPYTIPPCEH 189

Query: 69  STGCSHPGCEPAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPV 127
               S P C     TP+C + C    +  ++  KH+  ++Y +++  ++IMAEIYKNGPV
Sbjct: 190 HVNGSRPPCTGEGDTPRCNKSCEAGYSPSYKEDKHFGYTSYSVSNSVKEIMAEIYKNGPV 249

Query: 128 EVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADG 187
           E +FTV+ DF  YKSGVYKH  GD+MGGHA++++GWG  ++G  YW+ AN WN  WG +G
Sbjct: 250 EGAFTVFSDFLTYKSGVYKHEAGDMMGGHAIRILGWGV-ENGVPYWLAANSWNLDWGDNG 308

Query: 188 YFKIKRGSNECGIEEDVVAGLPSS 211
           +FKI RG N CGIE ++VAG+P +
Sbjct: 309 FFKILRGENHCGIESEIVAGIPRT 332


>sp|P19092|CYSP1_HAECO Cathepsin B-like cysteine proteinase 1 OS=Haemonchus contortus
           GN=AC-1 PE=2 SV=1
          Length = 342

 Score =  188 bits (477), Expect = 3e-47,   Method: Compositional matrix adjust.
 Identities = 92/210 (43%), Positives = 131/210 (62%), Gaps = 17/210 (8%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGC 72
           + +++S  D++ CC   CGDGC+GG+PI AW+YF++ GVV+       + C PY     C
Sbjct: 135 KQVNISATDIMTCCRPQCGDGCEGGWPIEAWKYFIYDGVVSGGEYLTKDVCRPY-PIHPC 193

Query: 73  SHPG-------CEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKN 124
            H G       C    PTP C RKC     +++R  K Y   AY +    + I +EI +N
Sbjct: 194 GHHGNDTYYGECRGTAPTPPCKRKCRPGVRKMYRIDKRYGKDAYIVKQSVKAIQSEILRN 253

Query: 125 GPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWG 184
           GPV  SF VYEDF HYKSG+YKH  G++ G HAVK+IGWG +++  D+W++AN W+  WG
Sbjct: 254 GPVVASFAVYEDFRHYKSGIYKHTAGELRGYHAVKMIGWG-NENNTDFWLIANSWHNDWG 312

Query: 185 ADGYFKIKRGSNECGIEEDVVAGLPSSKNL 214
             GYF+I RG+N+CGIE  + AG+  +++L
Sbjct: 313 EKGYFRIIRGTNDCGIEGTIAAGIVDTESL 342


>sp|P43233|CATB_CHICK Cathepsin B OS=Gallus gallus GN=CTSB PE=2 SV=1
          Length = 340

 Score =  187 bits (474), Expect = 6e-47,   Method: Compositional matrix adjust.
 Identities = 94/205 (45%), Positives = 130/205 (63%), Gaps = 19/205 (9%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGC---SHPGC 77
           ++ +S  DLL+CCGF CG GC+GGYP  AWRY+   G+V+     Y    GC   + P C
Sbjct: 130 SVEVSAEDLLSCCGFECGMGCNGGYPSGAWRYWTERGLVSGGL--YDSHVGCRAYTIPPC 187

Query: 78  E------------PAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKN 124
           E                TP+C R C    +  ++  KHY I++Y +    ++IMAEIYKN
Sbjct: 188 EHHVNGSRPPCTGEGGETPRCSRHCEPGYSPSYKEDKHYGITSYGVPRSEKEIMAEIYKN 247

Query: 125 GPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWG 184
           GPVE +F VYEDF  YKSGVY+H++G+ +GGHA++++GWG  ++G  YW+ AN WN  WG
Sbjct: 248 GPVEGAFIVYEDFLMYKSGVYQHVSGEQVGGHAIRILGWGV-ENGTPYWLAANSWNTDWG 306

Query: 185 ADGYFKIKRGSNECGIEEDVVAGLP 209
             G+FKI RG + CGIE ++VAG+P
Sbjct: 307 ITGFFKILRGEDHCGIESEIVAGVP 331


>sp|P43508|CPR4_CAEEL Cathepsin B-like cysteine proteinase 4 OS=Caenorhabditis elegans
           GN=cpr-4 PE=2 SV=1
          Length = 335

 Score =  186 bits (472), Expect = 1e-46,   Method: Compositional matrix adjust.
 Identities = 98/205 (47%), Positives = 127/205 (61%), Gaps = 18/205 (8%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYF-----D 68
           N  LS  D+L+CC   CG GC+GGYPI+AW+Y V  G  T         C PY      +
Sbjct: 131 NTLLSAEDVLSCCSN-CGYGCEGGYPINAWKYLVKSGFCTGGSYEAQFGCKPYSLAPCGE 189

Query: 69  STG-CSHPGC-EPAYPTPKCVRKCVKKNQ--LWRNSKHYSISAYRINSDPEDIMAEIYKN 124
           + G  + P C +  Y TP CV KC  KN    +   KH+  +AY +      I AEI  +
Sbjct: 190 TVGNVTWPSCPDDGYDTPACVNKCTNKNYNVAYTADKHFGSTAYAVGKKVSQIQAEIIAH 249

Query: 125 GPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWG 184
           GPVE +FTVYEDF  YK+GVY H TG  +GGHA++++GWGT D+G  YW++AN WN +WG
Sbjct: 250 GPVEAAFTVYEDFYQYKTGVYVHTTGQELGGHAIRILGWGT-DNGTPYWLVANSWNVNWG 308

Query: 185 ADGYFKIKRGSNECGIEEDVVAGLP 209
            +GYF+I RG+NECGIE  VV G+P
Sbjct: 309 ENGYFRIIRGTNECGIEHAVVGGVP 333


>sp|P25807|CPR1_CAEEL Gut-specific cysteine proteinase OS=Caenorhabditis elegans GN=cpr-1
           PE=1 SV=2
          Length = 329

 Score =  184 bits (468), Expect = 3e-46,   Method: Compositional matrix adjust.
 Identities = 95/195 (48%), Positives = 121/195 (62%), Gaps = 10/195 (5%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT------EECDPYFDSTGCS 73
           Q   +S +DLL+CCG  CG+GC+GGYPI A R++   GVVT        C PY     C+
Sbjct: 134 QQPIISPDDLLSCCGSSCGNGCEGGYPIQALRWWDSKGVVTGGDYHGAGCKPY-PIAPCT 192

Query: 74  HPGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFT 132
              C P   TP C   C    +  +   KH+ +SAY +  +   I AEIY NGPVE +F+
Sbjct: 193 SGNC-PESKTPSCSMSCQSGYSTAYAKDKHFGVSAYAVPKNAASIQAEIYANGPVEAAFS 251

Query: 133 VYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIK 192
           VYEDF  YKSGVYKH  G  +GGHA+K+IGWGT + G  YW++AN W  +WG  G+FKI 
Sbjct: 252 VYEDFYKYKSGVYKHTAGKYLGGHAIKIIGWGT-ESGSPYWLVANSWGVNWGESGFFKIY 310

Query: 193 RGSNECGIEEDVVAG 207
           RG ++CGIE  VVAG
Sbjct: 311 RGDDQCGIESAVVAG 325


>sp|P43510|CPR6_CAEEL Cathepsin B-like cysteine proteinase 6 OS=Caenorhabditis elegans
           GN=cpr-6 PE=1 SV=1
          Length = 379

 Score =  181 bits (460), Expect = 3e-45,   Method: Compositional matrix adjust.
 Identities = 101/224 (45%), Positives = 132/224 (58%), Gaps = 21/224 (9%)

Query: 9   DALSSSPYVSLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------E 61
           D +  + +  LQ ++LS +DLL+CC   CG GC+GG P++AWRY+V  G+VT        
Sbjct: 144 DRICIASHGELQ-VTLSADDLLSCCKS-CGFGCNGGDPLAAWRYWVKDGIVTGSNYTANN 201

Query: 62  ECDPYFDSTGCSH--------PGCEPAYPTPKCVRKCVK--KNQLWRNSKHYSISAYRIN 111
            C PY     C H        P     YPTPKC +KCV    ++ +   K +  SAY + 
Sbjct: 202 GCKPY-PFPPCEHHSKKTHFDPCPHDLYPTPKCEKKCVSDYTDKTYSEDKFFGASAYGVK 260

Query: 112 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGED 171
            D E I  E+  +GP+E++F VYEDF +Y  GVY H  G + GGHAVKLIGWG  DDG  
Sbjct: 261 DDVEAIQKELMTHGPLEIAFEVYEDFLNYDGGVYVHTGGKLGGGHAVKLIGWGI-DDGIP 319

Query: 172 YWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLV 215
           YW +AN WN  WG DG+F+I RG +ECGIE  VV G+P   +L 
Sbjct: 320 YWTVANSWNTDWGEDGFFRILRGVDECGIESGVVGGIPKLNSLT 363


>sp|P25792|CYSP_SCHMA Cathepsin B-like cysteine proteinase OS=Schistosoma mansoni PE=2
           SV=1
          Length = 340

 Score =  178 bits (452), Expect = 2e-44,   Method: Compositional matrix adjust.
 Identities = 94/202 (46%), Positives = 125/202 (61%), Gaps = 16/202 (7%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPY-----F 67
           QN+ LS  DLL CC   CG GC+GG    AW Y+V  G+VT         C+PY      
Sbjct: 138 QNVELSAVDLLTCCES-CGLGCEGGILGPAWDYWVKEGIVTASSKENHTGCEPYPFPKCE 196

Query: 68  DSTGCSHPGC-EPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNG 125
             T   +P C    Y TP+C + C +K +  +   KH   S+Y + +D + I  EI K G
Sbjct: 197 HHTKGKYPPCGSKIYNTPRCKQTCQRKYKTPYTQDKHRGKSSYNVKNDEKAIQKEIMKYG 256

Query: 126 PVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGA 185
           PVE SFTVYEDF +YKSG+YKHITG+ +GGHA+++IGWG  ++   YW++AN WN  WG 
Sbjct: 257 PVEASFTVYEDFLNYKSGIYKHITGEALGGHAIRIIGWGV-ENKTPYWLIANSWNEDWGE 315

Query: 186 DGYFKIKRGSNECGIEEDVVAG 207
           +GYF+I RG +EC IE +V+AG
Sbjct: 316 NGYFRIVRGRDECSIESEVIAG 337


>sp|P43507|CPR3_CAEEL Cathepsin B-like cysteine proteinase 3 OS=Caenorhabditis elegans
           GN=cpr-3 PE=2 SV=1
          Length = 370

 Score =  176 bits (445), Expect = 1e-43,   Method: Compositional matrix adjust.
 Identities = 98/217 (45%), Positives = 129/217 (59%), Gaps = 20/217 (9%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT------EECDPYFDSTGCS 73
           Q   +SV D+L+CCG  CG GC GGY I A R++   G VT        C PY  S    
Sbjct: 141 QQPVISVEDILSCCGTTCGYGCKGGYSIEALRFWASSGAVTGGDYGGHGCMPY--SFAPC 198

Query: 74  HPGCEPAYPTPKCVRKCVK--KNQLWRNSKHYSISAYRINSDPE--DIMAEIYKNGPVEV 129
              C P   TP C   C    K + ++  KHY  SAY++ +     +I  EIY  GPVE 
Sbjct: 199 TKNC-PESTTPSCKTTCQSSYKTEEYKKDKHYGASAYKVTTTKSVTEIQTEIYHYGPVEA 257

Query: 130 SFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYF 189
           S+ VYEDF HYKSGVY + +G ++GGHAVK+IGWG  ++G DYW++AN W  S+G  G+F
Sbjct: 258 SYKVYEDFYHYKSGVYHYTSGKLVGGHAVKIIGWGV-ENGVDYWLIANSWGTSFGEKGFF 316

Query: 190 KIKRGSNECGIEEDVVAGLPSSKNLVKEITSADMFED 226
           KI+RG+NEC IE +VVAG      + K  T ++ +ED
Sbjct: 317 KIRRGTNECQIEGNVVAG------IAKLGTHSETYED 347


>sp|P43157|CYSP_SCHJA Cathepsin B-like cysteine proteinase OS=Schistosoma japonicum
           GN=CATB PE=2 SV=1
          Length = 342

 Score =  169 bits (429), Expect = 1e-41,   Method: Compositional matrix adjust.
 Identities = 88/203 (43%), Positives = 121/203 (59%), Gaps = 16/203 (7%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY-----F 67
           Q+  LS  DL++CC   CGDGC GG+P  AW Y+V  G+VT         C PY      
Sbjct: 139 QSAELSALDLISCCKD-CGDGCQGGFPGVAWDYWVKRGIVTGGSKENHTGCQPYPFPKCE 197

Query: 68  DSTGCSHPGC-EPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNG 125
             T   +P C    Y TP+C + C K  +  +   KHY   +Y + ++ + I  +I   G
Sbjct: 198 HHTKGKYPACGTKIYKTPQCKQTCQKGYKTPYEQDKHYGDESYNVQNNEKVIQRDIMMYG 257

Query: 126 PVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGA 185
           PVE +F VYEDF +YKSG+Y+H+TG ++GGHA+++IGWG  +    YW++AN WN  WG 
Sbjct: 258 PVEAAFDVYEDFLNYKSGIYRHVTGSIVGGHAIRIIGWGV-EKRTPYWLIANSWNEDWGE 316

Query: 186 DGYFKIKRGSNECGIEEDVVAGL 208
            G F++ RG +EC IE DVVAGL
Sbjct: 317 KGLFRMVRGRDECSIESDVVAGL 339


>sp|P25802|CYSP1_OSTOS Cathepsin B-like cysteine proteinase 1 OS=Ostertagia ostertagi
           GN=CP-1 PE=3 SV=3
          Length = 341

 Score =  168 bits (425), Expect = 3e-41,   Method: Compositional matrix adjust.
 Identities = 89/202 (44%), Positives = 123/202 (60%), Gaps = 17/202 (8%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTE-------ECDPYFDSTGC 72
           + + +S  D+++CC + CGDGC+GG+PISA+R+    GVVT         C PY +   C
Sbjct: 140 KQVLISAQDVVSCCTW-CGDGCEGGWPISAFRFHADEGVVTGGDYNTKGSCRPY-EIHPC 197

Query: 73  SHPGCEPAY-------PTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNG 125
            H G E  Y        TP+C R+C+        S  Y   AY++ +  + I  +I KNG
Sbjct: 198 GHHGNETYYGECVGMADTPRCKRRCLLGYPKSYPSDRYYKKAYQLKNSVKAIQKDIMKNG 257

Query: 126 PVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGA 185
           PV  ++TVYEDFAHY+SG+YKH  G   G HAVK+IGWG  + G  YWI+AN W+  WG 
Sbjct: 258 PVVATYTVYEDFAHYRSGIYKHKAGRKTGLHAVKVIGWG-EEKGTPYWIVANSWHDDWGE 316

Query: 186 DGYFKIKRGSNECGIEEDVVAG 207
           +G+F++ RGSN+CG EE + AG
Sbjct: 317 NGFFRMHRGSNDCGFEERMAAG 338


>sp|Q54QD9|CTSB_DICDI Cathepsin B OS=Dictyostelium discoideum GN=ctsB PE=3 SV=1
          Length = 311

 Score =  159 bits (402), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 92/198 (46%), Positives = 115/198 (58%), Gaps = 20/198 (10%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEP 79
           +N+ LS  D++ C      +GC+GG   SAW +    G V+EEC PY      + P C P
Sbjct: 126 ENVQLSFMDMVTCDE--TDNGCEGGDAFSAWNWLRKQGAVSEECLPY------TIPTCPP 177

Query: 80  AYP-------TPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSF 131
           A         TP C ++C   + L +   KH     Y  +SD E IM EI  NGPVE  F
Sbjct: 178 AQQPCLNFVNTPSCTKECQSNSSLIYSQDKHKMAKIYSFDSD-EAIMQEIVTNGPVEACF 236

Query: 132 TVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKI 191
           TV+EDF  YKSGVY H TG  +GGH VKL+G+GT  +G DY+   NQW  SWG +G F I
Sbjct: 237 TVFEDFLAYKSGVYVHTTGKDLGGHCVKLVGFGTL-NGVDYYAANNQWTTSWGDNGTFLI 295

Query: 192 KRGSNECGIEEDVVAGLP 209
           KRG  +CGI +DVVAGLP
Sbjct: 296 KRG--DCGISDDVVAGLP 311


>sp|P07688|CATB_BOVIN Cathepsin B OS=Bos taurus GN=CTSB PE=1 SV=5
          Length = 335

 Score =  158 bits (400), Expect = 3e-38,   Method: Compositional matrix adjust.
 Identities = 90/203 (44%), Positives = 131/203 (64%), Gaps = 16/203 (7%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCS 73
           N+ +S  D+L CCG  CGDGC+GG+P  AW ++   G+V+         C PY     C 
Sbjct: 130 NVEVSAEDMLTCCGGECGDGCNGGFPSGAWNFWTKKGLVSGGLYNSHVGCRPY-SIPPCE 188

Query: 74  H------PGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGP 126
           H      P C     TPKC + C    +  ++  KH+  S+Y + ++ ++IMAEIYKNGP
Sbjct: 189 HHVNGSRPPCTGEGDTPKCSKTCEPGYSPSYKEDKHFGCSSYSVANNEKEIMAEIYKNGP 248

Query: 127 VEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGAD 186
           VE +F+VY DF  YKSGVY+H++G++MGGHA++++GWG  ++G  YW++ N WN  WG +
Sbjct: 249 VEGAFSVYSDFLLYKSGVYQHVSGEIMGGHAIRILGWGV-ENGTPYWLVGNSWNTDWGDN 307

Query: 187 GYFKIKRGSNECGIEEDVVAGLP 209
           G+FKI RG + CGIE ++VAG+P
Sbjct: 308 GFFKILRGQDHCGIESEIVAGMP 330


>sp|Q06544|CYSP3_OSTOS Cathepsin B-like cysteine proteinase 3 (Fragment) OS=Ostertagia
           ostertagi GN=CP-3 PE=3 SV=1
          Length = 174

 Score =  145 bits (367), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 79/175 (45%), Positives = 104/175 (59%), Gaps = 17/175 (9%)

Query: 49  AWRYFVHHGVVTEE-------CDPYFDSTGCSHPGCEPAY-------PTPKCVRKCVKKN 94
           AW+YF   GVVT         C PY +   C   G EP Y        TPKC + C +  
Sbjct: 1   AWQYFALEGVVTGGNYRKQGCCRPY-EFPPCGRHGKEPYYGECYDTAKTPKCQKTCQRGY 59

Query: 95  -QLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVM 153
            + ++  KH+  SAYR+ ++ + I  +I KNGPV   F VYEDFAHYKSG+YKH  G + 
Sbjct: 60  LKAYKEDKHFGKSAYRLPNNVKAIQRDIMKNGPVVAGFIVYEDFAHYKSGIYKHTAGRMT 119

Query: 154 GGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 208
           GGHAVK+IGWG  + G  YW++AN W+  WG  G++++ RG N C IEE V AG+
Sbjct: 120 GGHAVKIIGWG-KEKGTPYWLIANSWHDDWGEKGFYRMIRGINNCRIEEMVFAGI 173


>sp|P92133|CATB3_GIAIN Cathepsin B-like CP3 OS=Giardia intestinalis GN=CP3 PE=2 SV=2
          Length = 299

 Score =  142 bits (359), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 75/168 (44%), Positives = 94/168 (55%), Gaps = 11/168 (6%)

Query: 41  CDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNS 100
           CDGG+  S WR+    G  T+EC PY         G   A  T  C  KC   + L    
Sbjct: 140 CDGGWLPSVWRFLTKTGTTTDECVPY-------QSGSTGARGT--CPTKCADGSDLPHLY 190

Query: 101 KHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKL 160
           K      Y +  D   IM  +   GP++ +FTVY DF +Y+SGVY+H  G V GGHAV +
Sbjct: 191 KATKAVDYGL--DAPAIMKALATGGPLQTAFTVYSDFMYYESGVYQHTYGRVEGGHAVDM 248

Query: 161 IGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 208
           +G+GT DDG DYWI+ N W   WG DGYF+I R +NECGIEE V+ G 
Sbjct: 249 VGYGTDDDGVDYWIIKNSWGPDWGEDGYFRIIRMTNECGIEEQVIGGF 296


>sp|Q9UJW2|TINAG_HUMAN Tubulointerstitial nephritis antigen OS=Homo sapiens GN=TINAG PE=2
           SV=3
          Length = 476

 Score =  131 bits (329), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 75/206 (36%), Positives = 107/206 (51%), Gaps = 27/206 (13%)

Query: 23  SLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA-- 80
           +LS  +L++CC      GC+ G    AW Y    G+V+  C P F     ++ GC  A  
Sbjct: 267 NLSPQNLISCCA-KNRHGCNSGSIDRAWWYLRKRGLVSHACYPLFKDQNATNNGCAMASR 325

Query: 81  -------YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTV 133
                  + T  C     K N++++ S       YR++S+  +IM EI +NGPV+    V
Sbjct: 326 SDGRGKRHATKPCPNNVEKSNRIYQCS-----PPYRVSSNETEIMKEIMQNGPVQAIMQV 380

Query: 134 YEDFAHYKSGVYKHITGD--------VMGGHAVKLIGWGT----SDDGEDYWILANQWNR 181
            EDF HYK+G+Y+H+T           +  HAVKL GWGT        E +WI AN W +
Sbjct: 381 REDFFHYKTGIYRHVTSTNKESEKYRKLQTHAVKLTGWGTLRGAQGQKEKFWIAANSWGK 440

Query: 182 SWGADGYFKIKRGSNECGIEEDVVAG 207
           SWG +GYF+I RG NE  IE+ ++A 
Sbjct: 441 SWGENGYFRILRGVNESDIEKLIIAA 466


>sp|Q3SZI1|TINAG_BOVIN Tubulointerstitial nephritis antigen OS=Bos taurus GN=TINAG PE=2
           SV=1
          Length = 476

 Score =  129 bits (325), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 79/221 (35%), Positives = 113/221 (51%), Gaps = 34/221 (15%)

Query: 23  SLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA-- 80
           +LS  +L++CC      GC+ G    AW Y    G+V+  C P F     ++ GC  A  
Sbjct: 267 NLSPQNLISCCAKK-RHGCNSGSVDRAWWYLRKRGLVSHACYPLFKDQNATNNGCAMASR 325

Query: 81  -------YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTV 133
                  + T  C     K N++++ S       YR++S+  +IM EI +NGPV+    V
Sbjct: 326 SDGRGKRHATTPCPNSIEKSNRIYQCS-----PPYRVSSNETEIMREIMQNGPVQAIMQV 380

Query: 134 YEDFAHYKSGVYKHITGD--------VMGGHAVKLIGWGT----SDDGEDYWILANQWNR 181
           +EDF +YK+G+Y+HIT              HAVKL GWGT        E +WI AN W +
Sbjct: 381 HEDFFNYKTGIYRHITSTNEDSEKYRKFRTHAVKLTGWGTLRGAQGQKEKFWIAANSWGK 440

Query: 182 SWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEITSAD 222
           SWG +GYF+I RG NE  IE+ ++A          ++TSAD
Sbjct: 441 SWGENGYFRILRGVNESDIEKLIIAAW-------GQLTSAD 474


>sp|P92132|CATB2_GIAIN Cathepsin B-like CP2 OS=Giardia intestinalis GN=CP2 PE=1 SV=2
          Length = 300

 Score =  126 bits (316), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 65/168 (38%), Positives = 94/168 (55%), Gaps = 11/168 (6%)

Query: 41  CDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNS 100
           C+GG+  + W++    G  T+EC PY   +      C    PT     KC   +     +
Sbjct: 141 CNGGWLPNVWKFLTKTGTTTDECVPYKSGSTTLRGTC----PT-----KCADGSSKVHLA 191

Query: 101 KHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKL 160
              S   Y +  D   +M  +  +GP++V+F V+ DF +Y+SGVY+H  G + GGHAV++
Sbjct: 192 TATSYKDYGL--DIPAMMKALSTSGPLQVAFLVHSDFMYYESGVYQHTYGYMEGGHAVEM 249

Query: 161 IGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 208
           +G+GT DDG DYWI+ N W   WG DGYF++ RG N+C IEE   AG 
Sbjct: 250 VGYGTDDDGVDYWIIKNSWGPDWGEDGYFRMIRGINDCSIEEQAYAGF 297


>sp|O97578|CATC_CANFA Dipeptidyl peptidase 1 (Fragment) OS=Canis familiaris GN=CTSC PE=1
           SV=1
          Length = 435

 Score =  117 bits (292), Expect = 9e-26,   Method: Compositional matrix adjust.
 Identities = 71/201 (35%), Positives = 106/201 (52%), Gaps = 26/201 (12%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCE 78
           Q   LS  ++++C  +    GC+GG+P + A +Y    G+V E C PY    G   P C+
Sbjct: 252 QTPILSPQEIVSCSQY--AQGCEGGFPYLIAGKYAQDFGLVEEACFPY---AGSDSP-CK 305

Query: 79  PAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFA 138
           P      C R        + +S++Y +  +    +   +  E+ ++GP+ V+F VY+DF 
Sbjct: 306 PN----DCFR--------YYSSEYYYVGGFYGACNEALMKLELVRHGPMAVAFEVYDDFF 353

Query: 139 HYKSGVYKH------ITGDVMGGHAVKLIGWGT-SDDGEDYWILANQWNRSWGADGYFKI 191
           HY+ G+Y H           +  HAV L+G+GT S  G DYWI+ N W   WG DGYF+I
Sbjct: 354 HYQKGIYYHTGLRDPFNPFELTNHAVLLVGYGTDSASGMDYWIVKNSWGSRWGEDGYFRI 413

Query: 192 KRGSNECGIEEDVVAGLPSSK 212
           +RG++EC IE   VA  P  K
Sbjct: 414 RRGTDECAIESIAVAATPIPK 434


>sp|P92131|CATB1_GIAIN Cathepsin B-like CP1 OS=Giardia intestinalis GN=CP1 PE=2 SV=3
          Length = 303

 Score =  117 bits (292), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 70/188 (37%), Positives = 99/188 (52%), Gaps = 15/188 (7%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEP 79
           + +S S   L++C   L   GCDGG     W +    G  T EC  Y D       G   
Sbjct: 126 EAVSYSQQHLISCS--LENFGCDGGDFQPTWSFLTFTGATTAECVKYVDY------GHTV 177

Query: 80  AYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAH 139
           A P P          QL++   +  +S     S P  IM  +   GP++    VY D ++
Sbjct: 178 ASPCPAVCDDG-SPIQLYKAHGYGQVS----KSVPA-IMGMLVAGGPLQTMIVVYADLSY 231

Query: 140 YKSGVYKHITGDV-MGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNEC 198
           Y+SGVYKH  G + +G HA++++G+GT+DDG DYWI+ N W   WG +GYF+I RG NEC
Sbjct: 232 YESGVYKHTYGTINLGFHALEIVGYGTTDDGTDYWIIKNSWGPDWGENGYFRIVRGVNEC 291

Query: 199 GIEEDVVA 206
            IE+++ A
Sbjct: 292 RIEDEIYA 299


>sp|Q60HG6|CATC_MACFA Dipeptidyl peptidase 1 OS=Macaca fascicularis GN=CTSC PE=2 SV=1
          Length = 463

 Score =  116 bits (291), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 71/203 (34%), Positives = 107/203 (52%), Gaps = 29/203 (14%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCE 78
           Q   LS  ++++C  +    GC+GG+P ++A +Y    G+V E C PY   TG   P   
Sbjct: 279 QTPILSSQEVVSCSQY--AQGCEGGFPYLTAGKYAQDFGLVEEACFPY---TGTDSP--- 330

Query: 79  PAYPTPKCVRKCVKKNQLWR--NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYED 136
                      C  K   +R  +S+++ +  +    +   +  E+  +GP+ V+F VY+D
Sbjct: 331 -----------CKMKEDCFRYYSSEYHYVGGFYGGCNEALMKLELVYHGPLAVAFEVYDD 379

Query: 137 FAHYKSGVYKH------ITGDVMGGHAVKLIGWGT-SDDGEDYWILANQWNRSWGADGYF 189
           F HY++G+Y H           +  HAV L+G+GT S  G DYWI+ N W  SWG DGYF
Sbjct: 380 FLHYQNGIYHHTGLRDPFNPFELTNHAVLLVGYGTDSASGMDYWIVKNSWGTSWGEDGYF 439

Query: 190 KIKRGSNECGIEEDVVAGLPSSK 212
           +I+RG++EC IE   VA  P  K
Sbjct: 440 RIRRGTDECAIESIAVAATPIPK 462


>sp|Q5RB02|CATC_PONAB Dipeptidyl peptidase 1 OS=Pongo abelii GN=CTSC PE=2 SV=1
          Length = 463

 Score =  115 bits (287), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 71/203 (34%), Positives = 104/203 (51%), Gaps = 29/203 (14%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCE 78
           Q   LS  ++++C  +    GC+GG+P + A +Y    G+V E C PY   TG   P   
Sbjct: 279 QTPILSPQEVVSCSQY--AQGCEGGFPYLIAGKYAQDFGLVEEACFPY---TGTDSP--- 330

Query: 79  PAYPTPKCVRKCVKKNQLWR--NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYED 136
                      C  K   +R  +S+++ +  +    +   +  E+  +GP+ V+F VY+D
Sbjct: 331 -----------CKMKEDCFRYYSSEYHYVGGFYGGCNEALMKLELVHHGPMAVAFEVYDD 379

Query: 137 FAHYKSGVYKH------ITGDVMGGHAVKLIGWGT-SDDGEDYWILANQWNRSWGADGYF 189
           F HYK G+Y H           +  HAV L+G+GT S  G DYWI+ N W   WG DGYF
Sbjct: 380 FLHYKKGIYHHTGLRDPFNPFELTNHAVLLVGYGTDSASGMDYWIVKNSWGTGWGEDGYF 439

Query: 190 KIKRGSNECGIEEDVVAGLPSSK 212
           +I+RG++EC IE   VA  P  K
Sbjct: 440 RIRRGTDECAIESIAVAATPIPK 462


>sp|P53634|CATC_HUMAN Dipeptidyl peptidase 1 OS=Homo sapiens GN=CTSC PE=1 SV=2
          Length = 463

 Score =  113 bits (282), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 70/203 (34%), Positives = 104/203 (51%), Gaps = 29/203 (14%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCE 78
           Q   LS  ++++C  +    GC+GG+P + A +Y    G+V E C PY   TG   P   
Sbjct: 279 QTPILSPQEVVSCSQY--AQGCEGGFPYLIAGKYAQDFGLVEEACFPY---TGTDSP--- 330

Query: 79  PAYPTPKCVRKCVKKNQLWR--NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYED 136
                      C  K   +R  +S+++ +  +    +   +  E+  +GP+ V+F VY+D
Sbjct: 331 -----------CKMKEDCFRYYSSEYHYVGGFYGGCNEALMKLELVHHGPMAVAFEVYDD 379

Query: 137 FAHYKSGVYKH------ITGDVMGGHAVKLIGWGT-SDDGEDYWILANQWNRSWGADGYF 189
           F HYK G+Y H           +  HAV L+G+GT S  G DYWI+ N W   WG +GYF
Sbjct: 380 FLHYKKGIYHHTGLRDPFNPFELTNHAVLLVGYGTDSASGMDYWIVKNSWGTGWGENGYF 439

Query: 190 KIKRGSNECGIEEDVVAGLPSSK 212
           +I+RG++EC IE   VA  P  K
Sbjct: 440 RIRRGTDECAIESIAVAATPIPK 462


>sp|Q3ZCJ8|CATC_BOVIN Dipeptidyl peptidase 1 OS=Bos taurus GN=CTSC PE=2 SV=1
          Length = 463

 Score =  112 bits (279), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 69/203 (33%), Positives = 105/203 (51%), Gaps = 29/203 (14%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCE 78
           Q   LS  ++++C  +    GC+GG+P + A +Y    G+V E+C PY   TG   P   
Sbjct: 279 QTPILSPQEVVSCSQY--AQGCEGGFPYLIAGKYAQDFGLVEEDCFPY---TGTDSP--- 330

Query: 79  PAYPTPKCVRKCVKKNQLWR--NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYED 136
                      C  K   +R  +S+++ +  +    +   +  E+   GP+ V+F VY+D
Sbjct: 331 -----------CRLKEGCFRYYSSEYHYVGGFYGGCNEALMKLELVHQGPMAVAFEVYDD 379

Query: 137 FAHYKSGVYKH------ITGDVMGGHAVKLIGWGT-SDDGEDYWILANQWNRSWGADGYF 189
           F HY+ GVY H           +  HAV L+G+GT +  G DYWI+ N W  SWG +GYF
Sbjct: 380 FLHYRKGVYHHTGLRDPFNPFELTNHAVLLVGYGTDAASGLDYWIVKNSWGTSWGENGYF 439

Query: 190 KIKRGSNECGIEEDVVAGLPSSK 212
           +I+RG++EC IE   +A  P  K
Sbjct: 440 RIRRGTDECAIESIALAATPIPK 462


>sp|P80067|CATC_RAT Dipeptidyl peptidase 1 OS=Rattus norvegicus GN=Ctsc PE=1 SV=3
          Length = 462

 Score =  110 bits (276), Expect = 7e-24,   Method: Compositional matrix adjust.
 Identities = 68/201 (33%), Positives = 103/201 (51%), Gaps = 25/201 (12%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCE 78
           Q   LS  ++++C  +    GCDGG+P + A +Y    GVV E C PY  +         
Sbjct: 278 QTPILSPQEVVSCSPY--AQGCDGGFPYLIAGKYAQDFGVVEENCFPYTATDA------- 328

Query: 79  PAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFA 138
           P  P   C+R        + +S++Y +  +    +   +  E+ K+GP+ V+F V++DF 
Sbjct: 329 PCKPKENCLR--------YYSSEYYYVGGFYGGCNEALMKLELVKHGPMAVAFEVHDDFL 380

Query: 139 HYKSGVYKH------ITGDVMGGHAVKLIGWGTSD-DGEDYWILANQWNRSWGADGYFKI 191
           HY SG+Y H           +  HAV L+G+G     G DYWI+ N W   WG  GYF+I
Sbjct: 381 HYHSGIYHHTGLSDPFNPFELTNHAVLLVGYGKDPVTGLDYWIVKNSWGSQWGESGYFRI 440

Query: 192 KRGSNECGIEEDVVAGLPSSK 212
           +RG++EC IE   +A +P  K
Sbjct: 441 RRGTDECAIESIAMAAIPIPK 461


>sp|P90850|YCF2E_CAEEL Uncharacterized peptidase C1-like protein F26E4.3 OS=Caenorhabditis
           elegans GN=F26E4.3 PE=1 SV=3
          Length = 452

 Score =  110 bits (275), Expect = 9e-24,   Method: Compositional matrix adjust.
 Identities = 70/198 (35%), Positives = 100/198 (50%), Gaps = 14/198 (7%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA 80
           N +LS   LL+C       GC+GGY   AW Y    GVV + C PY  S     PG    
Sbjct: 232 NSTLSSQQLLSCNQHR-QKGCEGGYLDRAWWYIRKLGVVGDHCYPYV-SGQSREPGHCLI 289

Query: 81  YPTPKCVRKCVKKNQLWRNSKHYSISA-YRINSDPEDIMAEIYKNGPVEVSFTVYEDFAH 139
                  R+ ++     ++S  + ++  Y+++S  EDI  E+  NGPV+ +F V+EDF  
Sbjct: 290 PKRDYTNRQGLRCPSGSQDSTAFKMTPPYKVSSREEDIQTELMTNGPVQATFVVHEDFFM 349

Query: 140 YKSGVYKH--------ITGDVMGGHAVKLIGWG---TSDDGEDYWILANQWNRSWGADGY 188
           Y  GVY+H         +    G H+V+++GWG   ++     YW+ AN W   WG DGY
Sbjct: 350 YAGGVYQHSDLAAQKGASSVAEGYHSVRVLGWGVDHSTGKPIKYWLCANSWGTQWGEDGY 409

Query: 189 FKIKRGSNECGIEEDVVA 206
           FK+ RG N C IE  V+ 
Sbjct: 410 FKVLRGENHCEIESFVIG 427


>sp|P97821|CATC_MOUSE Dipeptidyl peptidase 1 OS=Mus musculus GN=Ctsc PE=2 SV=1
          Length = 462

 Score =  108 bits (271), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 68/201 (33%), Positives = 102/201 (50%), Gaps = 25/201 (12%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCE 78
           Q   LS  ++++C  +    GCDGG+P + A +Y    GVV E C PY            
Sbjct: 278 QTPILSPQEVVSCSPY--AQGCDGGFPYLIAGKYAQDFGVVEESCFPYTAKDS------- 328

Query: 79  PAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFA 138
           P  P   C+R        + +S +Y +  +    +   +  E+ K+GP+ V+F V++DF 
Sbjct: 329 PCKPRENCLR--------YYSSDYYYVGGFYGGCNEALMKLELVKHGPMAVAFEVHDDFL 380

Query: 139 HYKSGVYKH------ITGDVMGGHAVKLIGWGTSD-DGEDYWILANQWNRSWGADGYFKI 191
           HY SG+Y H           +  HAV L+G+G     G +YWI+ N W  +WG  GYF+I
Sbjct: 381 HYHSGIYHHTGLSDPFNPFELTNHAVLLVGYGRDPVTGIEYWIIKNSWGSNWGESGYFRI 440

Query: 192 KRGSNECGIEEDVVAGLPSSK 212
           +RG++EC IE   VA +P  K
Sbjct: 441 RRGTDECAIESIAVAAIPIPK 461


>sp|Q99JR5|TINAL_MOUSE Tubulointerstitial nephritis antigen-like OS=Mus musculus
           GN=Tinagl1 PE=1 SV=1
          Length = 466

 Score =  107 bits (266), Expect = 9e-23,   Method: Compositional matrix adjust.
 Identities = 77/227 (33%), Positives = 107/227 (47%), Gaps = 37/227 (16%)

Query: 10  ALSSSPYVSLQNLS-----LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECD 64
           A  +S  VS+ +L      LS  +LL+C       GC GG    AW +    GVV++ C 
Sbjct: 234 AAVASDRVSIHSLGHMTPILSPQNLLSCDTHH-QQGCRGGRLDGAWWFLRRRGVVSDNCY 292

Query: 65  PYFDSTGCSHPGCEPAYPTPKCV----------RKCVKK---NQLWRNSKHYSISAYRIN 111
           P+             A PTP+C+          R+   +    Q+  N  +    AYR+ 
Sbjct: 293 PFSGREQ------NEASPTPRCMMHSRAMGRGKRQATSRCPNGQVDSNDIYQVTPAYRLG 346

Query: 112 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV--------MGGHAVKLIGW 163
           SD ++IM E+ +NGPV+    V+EDF  Y+ G+Y H              G H+VK+ GW
Sbjct: 347 SDEKEIMKELMENGPVQALMEVHEDFFLYQRGIYSHTPVSQGRPEQYRRHGTHSVKITGW 406

Query: 164 G--TSDDGE--DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 206
           G  T  DG    YW  AN W   WG  G+F+I RG+NEC IE  V+ 
Sbjct: 407 GEETLPDGRTIKYWTAANSWGPWWGERGHFRIVRGTNECDIETFVLG 453


>sp|Q9GZM7|TINAL_HUMAN Tubulointerstitial nephritis antigen-like OS=Homo sapiens
           GN=TINAGL1 PE=1 SV=1
          Length = 467

 Score =  104 bits (259), Expect = 6e-22,   Method: Compositional matrix adjust.
 Identities = 75/221 (33%), Positives = 107/221 (48%), Gaps = 25/221 (11%)

Query: 10  ALSSSPYVSLQNLS-----LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECD 64
           A  +S  VS+ +L      LS  +LL+C       GC GG    AW +    GVV++ C 
Sbjct: 235 AAVASDRVSIHSLGHMTPVLSPQNLLSCDTHQ-QQGCRGGRLDGAWWFLRRRGVVSDHCY 293

Query: 65  PY----FDSTGCSHPGCEPAYPTPKCVRKCVKK--NQLWRNSKHYSIS-AYRINSDPEDI 117
           P+     D  G + P    +    +  R+      N    N+  Y ++  YR+ S+ ++I
Sbjct: 294 PFSGRERDEAGPAPPCMMHSRAMGRGKRQATAHCPNSYVNNNDIYQVTPVYRLGSNDKEI 353

Query: 118 MAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV--------MGGHAVKLIGWG--TSD 167
           M E+ +NGPV+    V+EDF  YK G+Y H    +         G H+VK+ GWG  T  
Sbjct: 354 MKELMENGPVQALMEVHEDFFLYKGGIYSHTPVSLGRPERYRRHGTHSVKITGWGEETLP 413

Query: 168 DGE--DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 206
           DG    YW  AN W  +WG  G+F+I RG NEC IE  V+ 
Sbjct: 414 DGRTLKYWTAANSWGPAWGERGHFRIVRGVNECDIESFVLG 454


>sp|Q9EQT5|TINAL_RAT Tubulointerstitial nephritis antigen-like OS=Rattus norvegicus
           GN=Tinagl1 PE=2 SV=1
          Length = 467

 Score =  103 bits (258), Expect = 7e-22,   Method: Compositional matrix adjust.
 Identities = 76/227 (33%), Positives = 107/227 (47%), Gaps = 36/227 (15%)

Query: 10  ALSSSPYVSLQNLS-----LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECD 64
           A  +S  VS+ +L      LS  +LL+C       GC GG    AW +    GVV++ C 
Sbjct: 234 AAVASDRVSIHSLGHMTPILSPQNLLSCDTHH-QKGCRGGRLDGAWWFLRRRGVVSDNCY 292

Query: 65  PYFDSTGCSHPGCEPAYPTPKCV----------RKCVKK---NQLWRNSKHYSISAYRIN 111
           P+           + A PTP+C+          R+   +   +Q+  N  +     YR+ 
Sbjct: 293 PF-----SGREQNDEASPTPRCMMHSRAMGRGKRQATSRCPNSQVDSNDIYQVTPVYRLA 347

Query: 112 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV--------MGGHAVKLIGW 163
           SD ++IM E+ +NGPV+    V+EDF  Y+ G+Y H              G H+VK+ GW
Sbjct: 348 SDEKEIMKELMENGPVQALMEVHEDFFLYQRGIYSHTPVSQGRPEQYRRHGTHSVKITGW 407

Query: 164 G--TSDDGE--DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 206
           G  T  DG    YW  AN W   WG  G+F+I RG NEC IE  V+ 
Sbjct: 408 GEETLPDGRTIKYWTAANSWGPWWGERGHFRIVRGINECDIETFVLG 454


>sp|Q26563|CATC_SCHMA Cathepsin C OS=Schistosoma mansoni PE=2 SV=1
          Length = 454

 Score = 91.3 bits (225), Expect = 6e-18,   Method: Compositional matrix adjust.
 Identities = 66/190 (34%), Positives = 91/190 (47%), Gaps = 29/190 (15%)

Query: 24  LSVNDLLACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYP 82
           LS   ++ C  +   +GC+GG+P + A +Y    G+  +   PY   TG           
Sbjct: 272 LSPQTVVDCSPY--SEGCNGGFPFLIAGKYGEDFGLPQKIVIPY---TGED--------- 317

Query: 83  TPKCVRKCVKKNQLWRNSKHYS-ISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYK 141
           T KC    V KN     +  YS I  Y   ++ + +  E+  NGP  V F VYEDF  YK
Sbjct: 318 TGKCT---VSKNCTRYYTTDYSYIGGYYGATNEKLMQLELISNGPFPVGFEVYEDFQFYK 374

Query: 142 SGVYKHITGDV---------MGGHAVKLIGWGTSD-DGEDYWILANQWNRSWGADGYFKI 191
            G+Y H T            +  HAV L+G+G     GE YW + N W   WG  GYF+I
Sbjct: 375 EGIYHHTTVQTDHYNFNPFELTNHAVLLVGYGVDKLSGEPYWKVKNSWGVEWGEQGYFRI 434

Query: 192 KRGSNECGIE 201
            RG++ECG+E
Sbjct: 435 LRGTDECGVE 444


>sp|P05167|ALEU_HORVU Thiol protease aleurain OS=Hordeum vulgare PE=2 SV=1
          Length = 362

 Score = 87.4 bits (215), Expect = 7e-17,   Method: Compositional matrix adjust.
 Identities = 64/193 (33%), Positives = 88/193 (45%), Gaps = 40/193 (20%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVV-TEECDPYFDSTGCSHPGCE 78
           +N+SLS   L+ C G     GC+GG P  A+ Y  ++G + TEE  PY    G  H   E
Sbjct: 187 KNISLSEQQLVDCAGGFNNFGCNGGLPSQAFEYIKYNGGIDTEESYPYKGVNGVCHYKAE 246

Query: 79  PAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNG-----PVEVSFTV 133
                               N+    + +  I  + ED +    KN      PV V+F V
Sbjct: 247 --------------------NAAVQVLDSVNITLNAEDEL----KNAVGLVRPVSVAFQV 282

Query: 134 YEDFAHYKSGVYKHITGDVMG------GHAVKLIGWGTSDDGEDYWILANQWNRSWGADG 187
            + F  YKSGVY   T D  G       HAV  +G+G  ++G  YW++ N W   WG +G
Sbjct: 283 IDGFRQYKSGVY---TSDHCGTTPDDVNHAVLAVGYGV-ENGVPYWLIKNSWGADWGDNG 338

Query: 188 YFKIKRGSNECGI 200
           YFK++ G N C I
Sbjct: 339 YFKMEMGKNMCAI 351


>sp|Q9R1T3|CATZ_RAT Cathepsin Z OS=Rattus norvegicus GN=Ctsz PE=1 SV=2
          Length = 306

 Score = 87.4 bits (215), Expect = 7e-17,   Method: Compositional matrix adjust.
 Identities = 57/196 (29%), Positives = 83/196 (42%), Gaps = 21/196 (10%)

Query: 24  LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYF--DSTGCSHPGCEPAY 81
           LSV +++ C        C+GG  +  W Y   HG+  E C+ Y   D        C    
Sbjct: 120 LSVQNVIDCGN---AGSCEGGNDLPVWEYAHKHGIPDETCNNYQAKDQECDKFNQCGTCT 176

Query: 82  PTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYK 141
              +C    ++   LWR   + S+S        E +MAEIY NGP+       E  ++Y 
Sbjct: 177 EFKEC--HTIQNYTLWRVGDYGSLSGR------EKMMAEIYANGPISCGIMATERMSNYT 228

Query: 142 SGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIE 201
            G+Y       +  H + + GWG S+DG +YWI+ N W   WG  G+ +I        + 
Sbjct: 229 GGIYTEYQNQAIINHIISVAGWGVSNDGIEYWIVRNSWGEPWGERGWMRI--------VT 280

Query: 202 EDVVAGLPSSKNLVKE 217
                G  SS NL  E
Sbjct: 281 STYKGGTGSSYNLAIE 296


>sp|P49935|CATH_MOUSE Pro-cathepsin H OS=Mus musculus GN=Ctsh PE=2 SV=2
          Length = 333

 Score = 86.7 bits (213), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 69/206 (33%), Positives = 97/206 (47%), Gaps = 33/206 (16%)

Query: 4   TRTNRDALSSSPYV-SLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHH-GVVTE 61
           T +   AL S+  + S + LSL+   L+ C       GC GG P  A+ Y +++ G++ E
Sbjct: 141 TFSTTGALESAVAIASGKMLSLAEQQLVDCAQAFNNHGCKGGLPSQAFEYILYNKGIMEE 200

Query: 62  ECDPYF--DSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMA 119
           +  PY   DS+   +P    A+     V+  V                  I  + E  M 
Sbjct: 201 DSYPYIGKDSSCRFNPQKAVAF-----VKNVV-----------------NITLNDEAAMV 238

Query: 120 E-IYKNGPVEVSFTVYEDFAHYKSGVYK----HITGDVMGGHAVKLIGWGTSDDGEDYWI 174
           E +    PV  +F V EDF  YKSGVY     H T D +  HAV  +G+G   +G  YWI
Sbjct: 239 EAVALYNPVSFAFEVTEDFLMYKSGVYSSKSCHKTPDKVN-HAVLAVGYG-EQNGLLYWI 296

Query: 175 LANQWNRSWGADGYFKIKRGSNECGI 200
           + N W   WG +GYF I+RG N CG+
Sbjct: 297 VKNSWGSQWGENGYFLIERGKNMCGL 322


>sp|P00786|CATH_RAT Pro-cathepsin H OS=Rattus norvegicus GN=Ctsh PE=1 SV=1
          Length = 333

 Score = 86.3 bits (212), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 66/204 (32%), Positives = 93/204 (45%), Gaps = 29/204 (14%)

Query: 4   TRTNRDALSSSPYV-SLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHH-GVVTE 61
           T +   AL S+  + S + ++L+   L+ C       GC GG P  A+ Y +++ G++ E
Sbjct: 141 TFSTTGALESAVAIASGKMMTLAEQQLVDCAQNFNNHGCQGGLPSQAFEYILYNKGIMGE 200

Query: 62  ECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAE- 120
           +  PY    G      E A    K V                      I  + E  M E 
Sbjct: 201 DSYPYIGKNGQCKFNPEKAVAFVKNV--------------------VNITLNDEAAMVEA 240

Query: 121 IYKNGPVEVSFTVYEDFAHYKSGVYK----HITGDVMGGHAVKLIGWGTSDDGEDYWILA 176
           +    PV  +F V EDF  YKSGVY     H T D +  HAV  +G+G   +G  YWI+ 
Sbjct: 241 VALYNPVSFAFEVTEDFMMYKSGVYSSNSCHKTPDKVN-HAVLAVGYG-EQNGLLYWIVK 298

Query: 177 NQWNRSWGADGYFKIKRGSNECGI 200
           N W  +WG +GYF I+RG N CG+
Sbjct: 299 NSWGSNWGNNGYFLIERGKNMCGL 322


>sp|Q9WUU7|CATZ_MOUSE Cathepsin Z OS=Mus musculus GN=Ctsz PE=2 SV=1
          Length = 306

 Score = 85.9 bits (211), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 51/175 (29%), Positives = 80/175 (45%), Gaps = 17/175 (9%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA 80
           ++ LSV +++ C        C+GG  +  W Y   HG+  E C+ Y          C+  
Sbjct: 117 SILLSVQNVIDCGN---AGSCEGGNDLPVWEYAHKHGIPDETCNNY----QAKDQDCDKF 169

Query: 81  YPTPKCV--RKC--VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYED 136
                C   ++C  ++   LWR   + S+S        E +MAEIY NGP+       E 
Sbjct: 170 NQCGTCTEFKECHTIQNYTLWRVGDYGSLSGR------EKMMAEIYANGPISCGIMATEM 223

Query: 137 FAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKI 191
            ++Y  G+Y       +  H + + GWG S+DG +YWI+ N W   WG  G+ +I
Sbjct: 224 MSNYTGGIYAEHQDQAVINHIISVAGWGVSNDGIEYWIVRNSWGEPWGEKGWMRI 278


>sp|Q9UBR2|CATZ_HUMAN Cathepsin Z OS=Homo sapiens GN=CTSZ PE=1 SV=1
          Length = 303

 Score = 84.7 bits (208), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 54/170 (31%), Positives = 75/170 (44%), Gaps = 14/170 (8%)

Query: 24  LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYF--DSTGCSHPGCEPAY 81
           LSV +++ C        C+GG  +S W Y   HG+  E C+ Y   D        C    
Sbjct: 118 LSVQNVIDCGN---AGSCEGGNDLSVWDYAHQHGIPDETCNNYQAKDQECDKFNQCGTCN 174

Query: 82  PTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYK 141
              +C    ++   LWR   + S+S        E +MAEIY NGP+       E  A+Y 
Sbjct: 175 EFKEC--HAIRNYTLWRVGDYGSLSGR------EKMMAEIYANGPISCGIMATERLANYT 226

Query: 142 SGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKI 191
            G+Y          H V + GWG S DG +YWI+ N W   WG  G+ +I
Sbjct: 227 GGIYAEYQDTTYINHVVSVAGWGIS-DGTEYWIVRNSWGEPWGERGWLRI 275


>sp|Q40143|CYSP3_SOLLC Cysteine proteinase 3 OS=Solanum lycopersicum GN=CYP-3 PE=2 SV=1
          Length = 356

 Score = 84.0 bits (206), Expect = 8e-16,   Method: Compositional matrix adjust.
 Identities = 59/188 (31%), Positives = 87/188 (46%), Gaps = 30/188 (15%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVV-TEECDPYFDSTGCSHPGCE 78
           + +SLS   L+ C G     GC+GG P  A+ Y   +G + TEE  PY    G     C+
Sbjct: 182 KGISLSEQQLVDCAGAFNNFGCNGGLPSQAFEYIKFNGGLDTEEAYPYTGKNGI----CK 237

Query: 79  PAYPTPKCVRKCVKKNQLWRNSKH---YSISAYRINSDPEDIMAEIYKNGPVEVSFTVYE 135
             +       K +    +   +++   Y+++  R                PV V+F V +
Sbjct: 238 --FSQANIGVKVISSVNITLGAEYELKYAVALVR----------------PVSVAFEVVK 279

Query: 136 DFAHYKSGVYKHIT-GDVMG--GHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIK 192
            F  YKSGVY     GD      HAV  +G+G  ++G  YW++ N W   WG DGYFK++
Sbjct: 280 GFKQYKSGVYASTECGDTPMDVNHAVLAVGYGV-ENGTPYWLIKNSWGADWGEDGYFKME 338

Query: 193 RGSNECGI 200
            G N CG+
Sbjct: 339 MGKNMCGV 346


>sp|P05689|CATZ_BOVIN Cathepsin Z OS=Bos taurus GN=CTSZ PE=2 SV=2
          Length = 304

 Score = 82.8 bits (203), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 55/183 (30%), Positives = 77/183 (42%), Gaps = 17/183 (9%)

Query: 37  CGDG--CDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKN 94
           CGD   C+GG  +  W Y   HG+  E C+ Y          C+       C     K+ 
Sbjct: 127 CGDAGSCEGGNDLPVWEYAHRHGIPDETCNNY----QAKDQECDKFNQCGTCTE--FKEC 180

Query: 95  QLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMG 154
            + +N   + +  Y   S  E +MAEIY NGP+       E  ++Y  G+Y         
Sbjct: 181 HVIKNYTLWKVGDYGSLSGREKMMAEIYTNGPISCGIMATEKMSNYTGGIYSEYNDQAFI 240

Query: 155 GHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECG--------IEEDVVA 206
            H V + GWG S DG +YWI+ N W   WG  G+ +I   + + G        IEE    
Sbjct: 241 NHIVSVAGWGVS-DGMEYWIVRNSWGEPWGEHGWMRIVTSTYKGGEGARYNLAIEESCTF 299

Query: 207 GLP 209
           G P
Sbjct: 300 GDP 302


>sp|Q8RWQ9|ALEUL_ARATH Thiol protease aleurain-like OS=Arabidopsis thaliana GN=At3g45310
           PE=2 SV=1
          Length = 358

 Score = 80.9 bits (198), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 61/188 (32%), Positives = 84/188 (44%), Gaps = 30/188 (15%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVV-TEECDPYFDSTGCSHPGCE 78
           + +SLS   L+ C G     GC GG P  A+ Y  ++G + TEE  PY    G    GC+
Sbjct: 184 KGISLSEQQLVDCAGTFNNFGCHGGLPSQAFEYIKYNGGLDTEEAYPYTGKDG----GCK 239

Query: 79  -PAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDF 137
             A      VR  V       +   +++   R                PV V+F V  +F
Sbjct: 240 FSAKNIGVQVRDSVNITLGAEDELKHAVGLVR----------------PVSVAFEVVHEF 283

Query: 138 AHYKSGVYKHIT-----GDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIK 192
             YK GV+   T      DV   HAV  +G+G  DD   YW++ N W   WG +GYFK++
Sbjct: 284 RFYKKGVFTSNTCGNTPMDV--NHAVLAVGYGVEDD-VPYWLIKNSWGGEWGDNGYFKME 340

Query: 193 RGSNECGI 200
            G N CG+
Sbjct: 341 MGKNMCGV 348


>sp|P09668|CATH_HUMAN Pro-cathepsin H OS=Homo sapiens GN=CTSH PE=1 SV=4
          Length = 335

 Score = 80.9 bits (198), Expect = 7e-15,   Method: Compositional matrix adjust.
 Identities = 60/203 (29%), Positives = 95/203 (46%), Gaps = 27/203 (13%)

Query: 4   TRTNRDALSSSPYVSL-QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHH-GVVTE 61
           T +   AL S+  ++  + LSL+   L+ C       GC GG P  A+ Y +++ G++ E
Sbjct: 143 TFSTTGALESAIAIATGKMLSLAEQQLVDCAQDFNNHGCQGGLPSQAFEYILYNKGIMGE 202

Query: 62  ECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEI 121
           +  PY    G              C  +  K     ++  + +I       D E ++  +
Sbjct: 203 DTYPYQGKDG-------------YCKFQPGKAIGFVKDVANITIY------DEEAMVEAV 243

Query: 122 YKNGPVEVSFTVYEDFAHYKSGVYK----HITGDVMGGHAVKLIGWGTSDDGEDYWILAN 177
               PV  +F V +DF  Y++G+Y     H T D +  HAV  +G+G   +G  YWI+ N
Sbjct: 244 ALYNPVSFAFEVTQDFMMYRTGIYSSTSCHKTPDKVN-HAVLAVGYG-EKNGIPYWIVKN 301

Query: 178 QWNRSWGADGYFKIKRGSNECGI 200
            W   WG +GYF I+RG N CG+
Sbjct: 302 SWGPQWGMNGYFLIERGKNMCGL 324


>sp|Q10717|CYSP2_MAIZE Cysteine proteinase 2 OS=Zea mays GN=CCP2 PE=2 SV=1
          Length = 360

 Score = 79.0 bits (193), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 62/187 (33%), Positives = 85/187 (45%), Gaps = 32/187 (17%)

Query: 22  LSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVV-TEECDPYFDSTGCSHPGCEPA 80
           +SLS   L+ C       GC+GG P  A+ Y  ++G + TEE  PY    G         
Sbjct: 188 ISLSEQQLVDCGFAFNNFGCNGGLPSQAFEYIKYNGGLDTEESYPYQGVNGI-------- 239

Query: 81  YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAE-IYKNGPVEVSFTVYEDFAH 139
                    C  KN+   N     + +  I    ED + + +    PV V+F V   F  
Sbjct: 240 ---------CKFKNE---NVGVKVLDSVNITLGAEDELKDAVGLVRPVSVAFEVITGFRL 287

Query: 140 YKSGVYKHITGDVMG------GHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKR 193
           YKSGVY   T D  G       HAV  +G+G  +DG  YW++ N W   WG +GYFK++ 
Sbjct: 288 YKSGVY---TSDHCGTTPMDVNHAVLAVGYGV-EDGVPYWLIKNSWGADWGDEGYFKMEM 343

Query: 194 GSNECGI 200
           G N CG+
Sbjct: 344 GKNMCGV 350


>sp|Q3T0I2|CATH_BOVIN Pro-cathepsin H OS=Bos taurus GN=CTSH PE=2 SV=1
          Length = 335

 Score = 79.0 bits (193), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 58/203 (28%), Positives = 92/203 (45%), Gaps = 27/203 (13%)

Query: 4   TRTNRDALSSSPYVSLQNLS-LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHH-GVVTE 61
           T +   AL S+  ++   L  L+   L+ C       GC GG P  A+ Y  ++ G++ E
Sbjct: 143 TFSTTGALESAVAIATGKLPFLAEQQLVDCAQNFNNHGCQGGLPSQAFEYIRYNKGIMGE 202

Query: 62  ECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEI 121
           +  PY    G              C  +  K     ++  + ++      +D E ++  +
Sbjct: 203 DTYPYRGQDG-------------DCKYQPSKAIAFVKDVANITL------NDEEAMVEAV 243

Query: 122 YKNGPVEVSFTVYEDFAHYKSGVYK----HITGDVMGGHAVKLIGWGTSDDGEDYWILAN 177
             + PV  +F V  DF  Y+ G+Y     H T D +  HAV  +G+G  + G  YWI+ N
Sbjct: 244 ALHNPVSFAFEVTADFMMYRKGIYSSTSCHKTPDKV-NHAVLAVGYG-EEKGIPYWIVKN 301

Query: 178 QWNRSWGADGYFKIKRGSNECGI 200
            W  +WG  GYF I+RG N CG+
Sbjct: 302 SWGPNWGMKGYFLIERGKNMCGL 324


>sp|P22497|CYSP_THEPA Cysteine proteinase OS=Theileria parva GN=TP03_0285 PE=3 SV=2
          Length = 440

 Score = 77.8 bits (190), Expect = 6e-14,   Method: Compositional matrix adjust.
 Identities = 54/182 (29%), Positives = 86/182 (47%), Gaps = 32/182 (17%)

Query: 24  LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFD-STGCSHPGCEPAYP 82
           LSV +LL C  F   +GC GG   SA+ Y   +G+V+ +  P+ D +  CS P       
Sbjct: 276 LSVQELLDCDSF--SNGCQGGLLESAYEYVRKYGLVSAKDLPFVDKARRCSVP------- 326

Query: 83  TPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKS 142
                            +K  S+ +Y +    E +M     + P  V  +V  + A YKS
Sbjct: 327 ----------------KAKKVSVPSYHVFKGKE-VMTRSLTSSPCSVYLSVSPELAKYKS 369

Query: 143 GVYKHITGDVMGGHAVKLIGWGTSD-DGEDYWILANQWNRSWGADGYFKIKR---GSNEC 198
           GV+    G  +  HAV L+G G  +   + YW++ N W   WG +GY +++R   G+++C
Sbjct: 370 GVFTGECGKSLN-HAVVLVGEGYDEVTKKRYWVVQNSWGTDWGENGYMRLERTNMGTDKC 428

Query: 199 GI 200
           G+
Sbjct: 429 GV 430


  Database: swissprot
    Posted date:  Mar 23, 2013  2:32 AM
  Number of letters in database: 191,569,459
  Number of sequences in database:  539,616
  
Lambda     K      H
   0.317    0.135    0.439 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 98,162,947
Number of Sequences: 539616
Number of extensions: 4498991
Number of successful extensions: 8737
Number of sequences better than 100.0: 50
Number of HSP's better than 100.0 without gapping: 207
Number of HSP's successfully gapped in prelim test: 14
Number of HSP's that attempted gapping in prelim test: 8237
Number of HSP's gapped (non-prelim): 254
length of query: 229
length of database: 191,569,459
effective HSP length: 113
effective length of query: 116
effective length of database: 130,592,851
effective search space: 15148770716
effective search space used: 15148770716
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 59 (27.3 bits)