BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 023657
         (279 letters)

Database: swissprot 
           539,616 sequences; 191,569,459 total letters

Searching..................................................done



>sp|Q4R5M2|CATB_MACFA Cathepsin B OS=Macaca fascicularis GN=CTSB PE=2 SV=1
          Length = 339

 Score =  184 bits (467), Expect = 6e-46,   Method: Compositional matrix adjust.
 Identities = 111/267 (41%), Positives = 143/267 (53%), Gaps = 38/267 (14%)

Query: 11  CLLILGVISSQTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQF 70
           CLL LG   S+              H L D ++  VN+     W+A  N  F N  V   
Sbjct: 10  CLLALGDARSRP-----------SFHPLSDELVNYVNKQ-NTTWQAGHN--FYNVDVSYL 55

Query: 71  KHLLGV---KPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCW 127
           K L G     P P   ++        + LKLP+SFDAR  WPQC TI  I DQG CGSCW
Sbjct: 56  KRLCGTFLGGPKPPQRVM------FTEDLKLPESFDAREQWPQCPTIKEIRDQGSCGSCW 109

Query: 128 AFGAVEALSDRFCIHFGMNLSLSVN--DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT 185
           AFGAVEA+SDR CIH   ++S+ V+  DLL CCG +CGDGC+GGYP  AW ++   G+V+
Sbjct: 110 AFGAVEAISDRICIHTNAHVSVEVSAEDLLTCCGIMCGDGCNGGYPAGAWNFWTRKGLVS 169

Query: 186 E-------ECDPYF-----DSTGCSHPGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISA 232
                    C PY           S P C     TPKC + C    +  ++  KHY  ++
Sbjct: 170 GGLYDSHVGCRPYSIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGYSPTYKQDKHYGYNS 229

Query: 233 YRINSDPEDIMAEIYKNGPVEVSFTVY 259
           Y +++  +DIMAEIYKNGPVE +F+VY
Sbjct: 230 YSVSNSEKDIMAEIYKNGPVEGAFSVY 256


>sp|Q5R6D1|CATB_PONAB Cathepsin B OS=Pongo abelii GN=CTSB PE=2 SV=1
          Length = 339

 Score =  182 bits (463), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 105/242 (43%), Positives = 135/242 (55%), Gaps = 27/242 (11%)

Query: 36  HILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGV---KPTPKGLLLGVPVKTH 92
           H L D ++  VN+     W+A  N  F N  V   K L G     P P   ++       
Sbjct: 24  HPLSDELVNYVNKR-NTTWQAGHN--FYNVDVSYLKKLCGTFLGGPKPPQRVM------F 74

Query: 93  DKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVN 152
            + LKLP+SFDAR  WPQC TI  I DQG CGSCWAFGAVEA+SDR CIH   ++S+ V+
Sbjct: 75  TEDLKLPESFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHTNAHVSVEVS 134

Query: 153 --DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTE-------ECDPYF-----DSTGCS 198
             DLL CCG +CGDGC+GGYP  AW ++   G+V+         C PY           S
Sbjct: 135 AEDLLTCCGSMCGDGCNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCEHHVNGS 194

Query: 199 HPGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFT 257
            P C     TPKC + C    +  ++  KHY  ++Y +++   DIMAEIYKNGPVE +F+
Sbjct: 195 RPPCTGEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVSNSERDIMAEIYKNGPVEGAFS 254

Query: 258 VY 259
           VY
Sbjct: 255 VY 256


>sp|P07858|CATB_HUMAN Cathepsin B OS=Homo sapiens GN=CTSB PE=1 SV=3
          Length = 339

 Score =  181 bits (459), Expect = 5e-45,   Method: Compositional matrix adjust.
 Identities = 104/242 (42%), Positives = 135/242 (55%), Gaps = 27/242 (11%)

Query: 36  HILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGV---KPTPKGLLLGVPVKTH 92
           H L D ++  VN+     W+A  N  F N  +   K L G     P P   ++       
Sbjct: 24  HPLSDELVNYVNKR-NTTWQAGHN--FYNVDMSYLKRLCGTFLGGPKPPQRVM------F 74

Query: 93  DKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVN 152
            + LKLP SFDAR  WPQC TI  I DQG CGSCWAFGAVEA+SDR CIH   ++S+ V+
Sbjct: 75  TEDLKLPASFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHTNAHVSVEVS 134

Query: 153 --DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTE-------ECDPYF-----DSTGCS 198
             DLL CCG +CGDGC+GGYP  AW ++   G+V+         C PY           S
Sbjct: 135 AEDLLTCCGSMCGDGCNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCEHHVNGS 194

Query: 199 HPGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFT 257
            P C     TPKC + C    +  ++  KHY  ++Y +++  +DIMAEIYKNGPVE +F+
Sbjct: 195 RPPCTGEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAFS 254

Query: 258 VY 259
           VY
Sbjct: 255 VY 256


>sp|P00787|CATB_RAT Cathepsin B OS=Rattus norvegicus GN=Ctsb PE=1 SV=2
          Length = 339

 Score =  177 bits (449), Expect = 7e-44,   Method: Compositional matrix adjust.
 Identities = 103/247 (41%), Positives = 135/247 (54%), Gaps = 29/247 (11%)

Query: 32  KLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKT 91
           K  SH L D +I  +N+     W+A RN  F N  +   K L G        +LG P   
Sbjct: 20  KPSSHPLSDDMINYINKQ-NTTWQAGRN--FYNVDISYLKKLCGT-------VLGGPNLP 69

Query: 92  H----DKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--M 145
                 + + LP+SFDAR  W  C TI++I DQG CGSCWAFGAVEA+SDR CIH    +
Sbjct: 70  ERVGFSEDINLPESFDAREQWSNCPTIAQIRDQGSCGSCWAFGAVEAMSDRICIHTNGRV 129

Query: 146 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTE-------ECDPYF-----D 193
           N+ +S  DLL CCG  CGDGC+GGYP  AW ++   G+V+         C PY       
Sbjct: 130 NVEVSAEDLLTCCGIQCGDGCNGGYPSGAWNFWTRKGLVSGGVYNSHIGCLPYTIPPCEH 189

Query: 194 STGCSHPGCEPAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPV 252
               S P C     TPKC + C    +  ++  KHY  ++Y ++   ++IMAEIYKNGPV
Sbjct: 190 HVNGSRPPCTGEGDTPKCNKMCEAGYSTSYKEDKHYGYTSYSVSDSEKEIMAEIYKNGPV 249

Query: 253 EVSFTVY 259
           E +FTV+
Sbjct: 250 EGAFTVF 256


>sp|P10605|CATB_MOUSE Cathepsin B OS=Mus musculus GN=Ctsb PE=1 SV=2
          Length = 339

 Score =  170 bits (431), Expect = 1e-41,   Method: Compositional matrix adjust.
 Identities = 98/245 (40%), Positives = 133/245 (54%), Gaps = 33/245 (13%)

Query: 36  HILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHL----LGVKPTPKGLLLGVPVKT 91
           H L D +I  +N+     W+A RN  F N  +   K L    LG    P  +  G     
Sbjct: 24  HPLSDDLINYINKQ-NTTWQAGRN--FYNVDISYLKKLCGTVLGGPKLPGRVAFG----- 75

Query: 92  HDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSL 149
             + + LP++FDAR  W  C TI +I DQG CGSCWAFGAVEA+SDR CIH    +N+ +
Sbjct: 76  --EDIDLPETFDAREQWSNCPTIGQIRDQGSCGSCWAFGAVEAISDRTCIHTNGRVNVEV 133

Query: 150 SVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGC------------ 197
           S  DLL CCG  CGDGC+GGYP  AW ++   G+V+     Y    GC            
Sbjct: 134 SAEDLLTCCGIQCGDGCNGGYPSGAWSFWTKKGLVSGGV--YNSHVGCLPYTIPPCEHHV 191

Query: 198 --SHPGCEPAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEV 254
             S P C     TP+C + C    +  ++  KH+  ++Y +++  ++IMAEIYKNGPVE 
Sbjct: 192 NGSRPPCTGEGDTPRCNKSCEAGYSPSYKEDKHFGYTSYSVSNSVKEIMAEIYKNGPVEG 251

Query: 255 SFTVY 259
           +FTV+
Sbjct: 252 AFTVF 256


>sp|P43510|CPR6_CAEEL Cathepsin B-like cysteine proteinase 6 OS=Caenorhabditis elegans
           GN=cpr-6 PE=1 SV=1
          Length = 379

 Score =  169 bits (427), Expect = 3e-41,   Method: Compositional matrix adjust.
 Identities = 108/293 (36%), Positives = 151/293 (51%), Gaps = 50/293 (17%)

Query: 6   LFLTTCLLILGVISSQTFAEGVVSKLK---LDSHILQ---DSIIKEVNENPKAGWKAARN 59
           L   +C+++    +     E V+ K +   +DS   +   D +I  VNEN    W A + 
Sbjct: 4   LLFLSCIVVAAYCACNDNLESVLDKYRNREIDSEAAELDGDDLIDYVNENQNL-WTAKKQ 62

Query: 60  PQFSNYTVGQFKHLLGVKPTPKGLLLGVP------------VKTHDKSLKLPKSFDARSA 107
            +FS+        + G     K  L+GV              KT D  L +P+SFD+R  
Sbjct: 63  RRFSS--------VYGENDKAKWGLMGVNHVRLSVKGKQHLSKTKDLDLDIPESFDSRDN 114

Query: 108 WPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDG 165
           WP+C +I  I DQ  CGSCWAFGAVEA+SDR CI  H  + ++LS +DLL+CC   CG G
Sbjct: 115 WPKCDSIKVIRDQSSCGSCWAFGAVEAMSDRICIASHGELQVTLSADDLLSCCK-SCGFG 173

Query: 166 CDGGYPISAWRYFVHHGVVTEECDPYFDSTGCS---HPGCE-------------PAYPTP 209
           C+GG P++AWRY+V  G+VT     Y  + GC     P CE               YPTP
Sbjct: 174 CNGGDPLAAWRYWVKDGIVTGS--NYTANNGCKPYPFPPCEHHSKKTHFDPCPHDLYPTP 231

Query: 210 KCVRKCVKK--NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYE 260
           KC +KCV    ++ +   K +  SAY +  D E I  E+  +GP+E++F VYE
Sbjct: 232 KCEKKCVSDYTDKTYSEDKFFGASAYGVKDDVEAIQKELMTHGPLEIAFEVYE 284


>sp|P25792|CYSP_SCHMA Cathepsin B-like cysteine proteinase OS=Schistosoma mansoni PE=2
           SV=1
          Length = 340

 Score =  168 bits (425), Expect = 4e-41,   Method: Compositional matrix adjust.
 Identities = 105/271 (38%), Positives = 144/271 (53%), Gaps = 22/271 (8%)

Query: 7   FLTTCLLILGVISSQTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYT 66
            LT+ L I  +I   TF E  +S        L D II  +NE+P AGW+A ++ +F +  
Sbjct: 1   MLTSILCIASLI---TFLEAHISVKNEKFEPLSDDIISYINEHPNAGWRAEKSNRFHSLD 57

Query: 67  VGQFKHLLGVKPTPKGLLLGVPVKTH-DKSLKLPKSFDARSAWPQCSTISRILDQGHCGS 125
             + + +   +  P       P   H D ++++P +FD+R  WP C +I+ I DQ  CGS
Sbjct: 58  DARIQ-MGARREEPDLRRKRRPTVDHNDWNVEIPSNFDSRKKWPGCKSIATIRDQSRCGS 116

Query: 126 CWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGV 183
           CW+FGAVEA+SDR CI  G   N+ LS  DLL CC   CG GC+GG    AW Y+V  G+
Sbjct: 117 CWSFGAVEAMSDRSCIQSGGKQNVELSAVDLLTCCE-SCGLGCEGGILGPAWDYWVKEGI 175

Query: 184 VTEE-------CDPY-----FDSTGCSHPGC-EPAYPTPKCVRKCVKKNQL-WRNSKHYS 229
           VT         C+PY        T   +P C    Y TP+C + C +K +  +   KH  
Sbjct: 176 VTASSKENHTGCEPYPFPKCEHHTKGKYPPCGSKIYNTPRCKQTCQRKYKTPYTQDKHRG 235

Query: 230 ISAYRINSDPEDIMAEIYKNGPVEVSFTVYE 260
            S+Y + +D + I  EI K GPVE SFTVYE
Sbjct: 236 KSSYNVKNDEKAIQKEIMKYGPVEASFTVYE 266


>sp|P43157|CYSP_SCHJA Cathepsin B-like cysteine proteinase OS=Schistosoma japonicum
           GN=CATB PE=2 SV=1
          Length = 342

 Score =  166 bits (421), Expect = 1e-40,   Method: Compositional matrix adjust.
 Identities = 99/263 (37%), Positives = 142/263 (53%), Gaps = 22/263 (8%)

Query: 17  VISSQTFAEG-VVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLG 75
           ++S  TF E  V ++       L D +I  +NE+P AGWKA ++ +F  +++   + L+G
Sbjct: 8   IVSLFTFLEAHVTTRNNQRIEPLSDEMISFINEHPDAGWKADKSDRF--HSLDDARILMG 65

Query: 76  VKPTPKGLLLGV--PVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVE 133
            +     +       V  HD ++++P  FD+R  WP C +IS+I DQ  CGSCWAFGAVE
Sbjct: 66  ARKEDAEMKRNRRPTVDHHDLNVEIPSQFDSRKKWPHCKSISQIRDQSRCGSCWAFGAVE 125

Query: 134 ALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT------ 185
           A++DR CI  G   S  LS  DL++CC   CGDGC GG+P  AW Y+V  G+VT      
Sbjct: 126 AMTDRICIQSGGGQSAELSALDLISCCK-DCGDGCQGGFPGVAWDYWVKRGIVTGGSKEN 184

Query: 186 -EECDPY-----FDSTGCSHPGC-EPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINS 237
              C PY        T   +P C    Y TP+C + C K  +  +   KHY   +Y + +
Sbjct: 185 HTGCQPYPFPKCEHHTKGKYPACGTKIYKTPQCKQTCQKGYKTPYEQDKHYGDESYNVQN 244

Query: 238 DPEDIMAEIYKNGPVEVSFTVYE 260
           + + I  +I   GPVE +F VYE
Sbjct: 245 NEKVIQRDIMMYGPVEAAFDVYE 267


>sp|P43233|CATB_CHICK Cathepsin B OS=Gallus gallus GN=CTSB PE=2 SV=1
          Length = 340

 Score =  166 bits (420), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 100/245 (40%), Positives = 130/245 (53%), Gaps = 34/245 (13%)

Query: 38  LQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHD---- 93
           L   ++  +N+    G +A  N  F N  +   K L G         LG P         
Sbjct: 26  LSSDLVNHINKLNTTG-RAGHN--FHNTDMSYVKKLCGT-------FLGGPKAPERVDFA 75

Query: 94  KSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVN- 152
           + + LP +FD R  WP C TIS I DQG CGSCWAFGAVEA+SDR C+H    +S+ V+ 
Sbjct: 76  EDMDLPDTFDTRKQWPNCPTISEIRDQGSCGSCWAFGAVEAISDRICVHTNAKVSVEVSA 135

Query: 153 -DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGC---SHPGCE----- 203
            DLL+CCGF CG GC+GGYP  AWRY+   G+V+     Y    GC   + P CE     
Sbjct: 136 EDLLSCCGFECGMGCNGGYPSGAWRYWTERGLVSGGL--YDSHVGCRAYTIPPCEHHVNG 193

Query: 204 -------PAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVS 255
                      TP+C R C    +  ++  KHY I++Y +    ++IMAEIYKNGPVE +
Sbjct: 194 SRPPCTGEGGETPRCSRHCEPGYSPSYKEDKHYGITSYGVPRSEKEIMAEIYKNGPVEGA 253

Query: 256 FTVYE 260
           F VYE
Sbjct: 254 FIVYE 258


>sp|A1E295|CATB_PIG Cathepsin B OS=Sus scrofa GN=CTSB PE=1 SV=1
          Length = 335

 Score =  165 bits (418), Expect = 3e-40,   Method: Compositional matrix adjust.
 Identities = 100/250 (40%), Positives = 129/250 (51%), Gaps = 29/250 (11%)

Query: 29  SKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVP 88
           ++  L    L D ++  +N+     W A  N  F N  +   K L G         LG P
Sbjct: 17  ARESLHFQPLSDELVNFINKQ-NTTWTAGHN--FYNVDLSYVKKLCGT-------FLGGP 66

Query: 89  VKTHDKSLK----LPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG 144
                 +      LPKSFDAR  WP C TI  I DQG CGSCWAFGAVEA+SDR CI   
Sbjct: 67  KLPQRAAFAADMILPKSFDAREQWPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICIRSN 126

Query: 145 --MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTE-------ECDPYF--- 192
             +N+ +S  D+L CCG  CGDGC+GG+P  AW ++   G+V+         C PY    
Sbjct: 127 GRVNVEVSAEDMLTCCGDECGDGCNGGFPSGAWNFWTKKGLVSGGLYDSHVGCRPYSIPP 186

Query: 193 --DSTGCSHPGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKN 249
                  S P C     TPKC + C       ++  KH+  S+Y I+ + ++IMAEIYKN
Sbjct: 187 CEHHVNGSRPPCTGEGDTPKCSKICEPGYTPSYKEDKHFGCSSYSISRNEKEIMAEIYKN 246

Query: 250 GPVEVSFTVY 259
           GPVE +FTVY
Sbjct: 247 GPVEGAFTVY 256


>sp|P43509|CPR5_CAEEL Cathepsin B-like cysteine proteinase 5 OS=Caenorhabditis elegans
           GN=cpr-5 PE=2 SV=1
          Length = 344

 Score =  160 bits (404), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 87/187 (46%), Positives = 107/187 (57%), Gaps = 21/187 (11%)

Query: 95  SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVN 152
           S  +P  FDAR  WP C +I+ I DQ  CGSCWAF A EA+SDR CI  +  +N  LS  
Sbjct: 79  SDAIPDHFDARDQWPNCMSINNIRDQSDCGSCWAFAAAEAISDRTCIASNGAVNTLLSSE 138

Query: 153 DLLACCG--FLCGDGCDGGYPISAWRYFVHHGVVTE-------ECDPYFDS------TGC 197
           DLL+CC   F CG+GC+GGYPI AW+++V HG+VT         C PY  +       G 
Sbjct: 139 DLLSCCTGMFSCGNGCEGGYPIQAWKWWVKHGLVTGGSYETQFGCKPYSIAPCGETVNGV 198

Query: 198 SHPGC-EPAYPTPKCVRKCVKKNQL---WRNSKHYSISAYRINSDPEDIMAEIYKNGPVE 253
             P C E   PTPKCV  C  KN     +   KH+  +AY +    E I  EI  NGP+E
Sbjct: 199 KWPACPEDTEPTPKCVDSCTSKNNYATPYLQDKHFGSTAYAVGKKVEQIQTEILTNGPIE 258

Query: 254 VSFTVYE 260
           V+FTVYE
Sbjct: 259 VAFTVYE 265


>sp|Q54QD9|CTSB_DICDI Cathepsin B OS=Dictyostelium discoideum GN=ctsB PE=3 SV=1
          Length = 311

 Score =  154 bits (388), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 89/216 (41%), Positives = 119/216 (55%), Gaps = 23/216 (10%)

Query: 54  WKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDK-SLKLPKSFDARSAWPQCS 112
           W   +  QF N  VGQ   LLG K +P    L   +K++D   +++P SF+A++ WP C+
Sbjct: 39  WVEEQTDQFDNIKVGQ---LLGFKRSPNRPKL--QIKSYDPLGVQIPTSFNAQTNWPNCT 93

Query: 113 TISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPI 172
           TIS+I +Q  CGSCWAFGA E+ +DR CIH   N+ LS  D++ C      +GC+GG   
Sbjct: 94  TISQIQNQARCGSCWAFGATESATDRLCIHNNENVQLSFMDMVTC--DETDNGCEGGDAF 151

Query: 173 SAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYP-------TPKCVRKCVKKNQL-WRN 224
           SAW +    G V+EEC PY      + P C PA         TP C ++C   + L +  
Sbjct: 152 SAWNWLRKQGAVSEECLPY------TIPTCPPAQQPCLNFVNTPSCTKECQSNSSLIYSQ 205

Query: 225 SKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYE 260
            KH     Y  +SD E IM EI  NGPVE  FTV+E
Sbjct: 206 DKHKMAKIYSFDSD-EAIMQEIVTNGPVEACFTVFE 240


>sp|P25807|CPR1_CAEEL Gut-specific cysteine proteinase OS=Caenorhabditis elegans GN=cpr-1
           PE=1 SV=2
          Length = 329

 Score =  143 bits (360), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 76/172 (44%), Positives = 100/172 (58%), Gaps = 11/172 (6%)

Query: 98  LPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLL 155
           +P +FD+R+ W +C +I  I DQ  CGSCWAFGA E +SDR CI         +S +DLL
Sbjct: 85  VPATFDSRTQWSECKSIKLIRDQATCGSCWAFGAAEMISDRTCIETKGAQQPIISPDDLL 144

Query: 156 ACCGFLCGDGCDGGYPISAWRYFVHHGVVT------EECDPYFDSTGCSHPGCEPAYPTP 209
           +CCG  CG+GC+GGYPI A R++   GVVT        C PY  +  C+   C P   TP
Sbjct: 145 SCCGSSCGNGCEGGYPIQALRWWDSKGVVTGGDYHGAGCKPYPIAP-CTSGNC-PESKTP 202

Query: 210 KCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYE 260
            C   C    +  +   KH+ +SAY +  +   I AEIY NGPVE +F+VYE
Sbjct: 203 SCSMSCQSGYSTAYAKDKHFGVSAYAVPKNAASIQAEIYANGPVEAAFSVYE 254


>sp|P07688|CATB_BOVIN Cathepsin B OS=Bos taurus GN=CTSB PE=1 SV=5
          Length = 335

 Score =  138 bits (347), Expect = 5e-32,   Method: Compositional matrix adjust.
 Identities = 99/240 (41%), Positives = 131/240 (54%), Gaps = 27/240 (11%)

Query: 38  LQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDK--- 94
           L D ++  VN+     WKA  N  F N  +   K L G       +L G  +   D    
Sbjct: 26  LSDELVNFVNKQ-NTTWKAGHN--FYNVDLSYVKKLCGA------ILGGPKLPQRDAFAA 76

Query: 95  SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVN 152
            + LP+SFDAR  WP C TI  I DQG CGSCWAFGAVEA+SDR CIH    +N+ +S  
Sbjct: 77  DVVLPESFDAREQWPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHSNGRVNVEVSAE 136

Query: 153 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTE-------ECDPYF-----DSTGCSHP 200
           D+L CCG  CGDGC+GG+P  AW ++   G+V+         C PY           S P
Sbjct: 137 DMLTCCGGECGDGCNGGFPSGAWNFWTKKGLVSGGLYNSHVGCRPYSIPPCEHHVNGSRP 196

Query: 201 GCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVY 259
            C     TPKC + C    +  ++  KH+  S+Y + ++ ++IMAEIYKNGPVE +F+VY
Sbjct: 197 PCTGEGDTPKCSKTCEPGYSPSYKEDKHFGCSSYSVANNEKEIMAEIYKNGPVEGAFSVY 256


>sp|P43508|CPR4_CAEEL Cathepsin B-like cysteine proteinase 4 OS=Caenorhabditis elegans
           GN=cpr-4 PE=2 SV=1
          Length = 335

 Score =  137 bits (346), Expect = 7e-32,   Method: Compositional matrix adjust.
 Identities = 98/241 (40%), Positives = 128/241 (53%), Gaps = 24/241 (9%)

Query: 39  QDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHD-KSLK 97
           Q++I + VN   ++ WKA   P+  + T+ Q K  L            V V  HD     
Sbjct: 25  QEAITEYVNSK-QSLWKA-EIPK--DITIEQVKKRLMRTEFVAPHTPDVEVVKHDINEDT 80

Query: 98  LPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLL 155
           +P +FDAR+ WP C +I+ I DQ  CGSCWAF A EA SDRFCI  +  +N  LS  D+L
Sbjct: 81  IPATFDARTQWPNCMSINNIRDQSDCGSCWAFAAAEAASDRFCIASNGAVNTLLSAEDVL 140

Query: 156 ACCGFLCGDGCDGGYPISAWRYFVHHGVVTE-------ECDPYF-----DSTG-CSHPGC 202
           +CC   CG GC+GGYPI+AW+Y V  G  T         C PY      ++ G  + P C
Sbjct: 141 SCCSN-CGYGCEGGYPINAWKYLVKSGFCTGGSYEAQFGCKPYSLAPCGETVGNVTWPSC 199

Query: 203 -EPAYPTPKCVRKCVKKNQ--LWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVY 259
            +  Y TP CV KC  KN    +   KH+  +AY +      I AEI  +GPVE +FTVY
Sbjct: 200 PDDGYDTPACVNKCTNKNYNVAYTADKHFGSTAYAVGKKVSQIQAEIIAHGPVEAAFTVY 259

Query: 260 E 260
           E
Sbjct: 260 E 260


>sp|P25793|CYSP2_HAECO Cathepsin B-like cysteine proteinase 2 OS=Haemonchus contortus
           GN=AC-2 PE=2 SV=1
          Length = 342

 Score =  136 bits (342), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 75/185 (40%), Positives = 99/185 (53%), Gaps = 19/185 (10%)

Query: 93  DKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLS 150
           D  + +P S+D R  W  C+T   I DQ +CGSCWA     A+SDR CI       +++S
Sbjct: 82  DPEVDIPPSYDPRDVWKNCTTFY-IRDQANCGSCWAVSTAAAISDRICIASKAEKQVNIS 140

Query: 151 VNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSHPG-- 201
             D++ CC   CGDGC+GG+PI AW+YF++ GVV+       + C PY     C H G  
Sbjct: 141 ATDIMTCCRPQCGDGCEGGWPIEAWKYFIYDGVVSGGEYLTKDVCRPY-PIHPCGHHGND 199

Query: 202 -----CEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVS 255
                C    PTP C RKC     +++R  K Y   AY +    + I +EI KNGPV  S
Sbjct: 200 TYYGECRGTAPTPPCKRKCRPGVRKMYRIDKRYGKDAYIVKQSVKAIQSEILKNGPVVAS 259

Query: 256 FTVYE 260
           F VYE
Sbjct: 260 FAVYE 264


>sp|P43507|CPR3_CAEEL Cathepsin B-like cysteine proteinase 3 OS=Caenorhabditis elegans
           GN=cpr-3 PE=2 SV=1
          Length = 370

 Score =  135 bits (341), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 77/175 (44%), Positives = 94/175 (53%), Gaps = 15/175 (8%)

Query: 98  LPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLL 155
           LP +FDAR  WP C+TI  I +Q  CGSCWAFGA E +SDR CI         +SV D+L
Sbjct: 92  LPDTFDAREKWPDCNTIKLIRNQATCGSCWAFGAAEVISDRVCIQSNGTQQPVISVEDIL 151

Query: 156 ACCGFLCGDGCDGGYPISAWRYFVHHGVVT------EECDPYFDSTGCSHPGCEPAYPTP 209
           +CCG  CG GC GGY I A R++   G VT        C PY  S       C P   TP
Sbjct: 152 SCCGTTCGYGCKGGYSIEALRFWASSGAVTGGDYGGHGCMPY--SFAPCTKNC-PESTTP 208

Query: 210 KCVRKCVK--KNQLWRNSKHYSISAYRINSDPE--DIMAEIYKNGPVEVSFTVYE 260
            C   C    K + ++  KHY  SAY++ +     +I  EIY  GPVE S+ VYE
Sbjct: 209 SCKTTCQSSYKTEEYKKDKHYGASAYKVTTTKSVTEIQTEIYHYGPVEASYKVYE 263


>sp|P19092|CYSP1_HAECO Cathepsin B-like cysteine proteinase 1 OS=Haemonchus contortus
           GN=AC-1 PE=2 SV=1
          Length = 342

 Score =  135 bits (340), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 74/185 (40%), Positives = 99/185 (53%), Gaps = 19/185 (10%)

Query: 93  DKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLS 150
           D  + +P S+D R  W  C+T   I DQ +CGSCWA     A+SDR CI       +++S
Sbjct: 82  DPEVDIPPSYDPRDVWKNCTTFY-IRDQANCGSCWAVSTAAAISDRICIASKAEKQVNIS 140

Query: 151 VNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSHPG-- 201
             D++ CC   CGDGC+GG+PI AW+YF++ GVV+       + C PY     C H G  
Sbjct: 141 ATDIMTCCRPQCGDGCEGGWPIEAWKYFIYDGVVSGGEYLTKDVCRPY-PIHPCGHHGND 199

Query: 202 -----CEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVS 255
                C    PTP C RKC     +++R  K Y   AY +    + I +EI +NGPV  S
Sbjct: 200 TYYGECRGTAPTPPCKRKCRPGVRKMYRIDKRYGKDAYIVKQSVKAIQSEILRNGPVVAS 259

Query: 256 FTVYE 260
           F VYE
Sbjct: 260 FAVYE 264


>sp|P25802|CYSP1_OSTOS Cathepsin B-like cysteine proteinase 1 OS=Ostertagia ostertagi
           GN=CP-1 PE=3 SV=3
          Length = 341

 Score =  123 bits (308), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 70/179 (39%), Positives = 99/179 (55%), Gaps = 18/179 (10%)

Query: 98  LPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLL 155
           +P+S+D R  W  CS++  I DQ +CGSCWA  +  A+SDR CI       + +S  D++
Sbjct: 91  IPESYDPRIQWANCSSLFHIPDQANCGSCWAVSSAAAMSDRICIASKGAKQVLISAQDVV 150

Query: 156 ACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSHPGCEPAY-- 206
           +CC + CGDGC+GG+PISA+R+    GVVT         C PY +   C H G E  Y  
Sbjct: 151 SCCTW-CGDGCEGGWPISAFRFHADEGVVTGGDYNTKGSCRPY-EIHPCGHHGNETYYGE 208

Query: 207 -----PTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYE 260
                 TP+C R+C+        S  Y   AY++ +  + I  +I KNGPV  ++TVYE
Sbjct: 209 CVGMADTPRCKRRCLLGYPKSYPSDRYYKKAYQLKNSVKAIQKDIMKNGPVVATYTVYE 267


>sp|Q3SZI1|TINAG_BOVIN Tubulointerstitial nephritis antigen OS=Bos taurus GN=TINAG PE=2
           SV=1
          Length = 476

 Score = 93.2 bits (230), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 74/238 (31%), Positives = 109/238 (45%), Gaps = 24/238 (10%)

Query: 37  ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ-FKHLLGVKPTPKGLLLGVPVKTHD-- 93
           ++Q  +I+ VN+    GW A    QF   T+ + FK+ LG  P P  LLL +   T    
Sbjct: 155 LVQPGLIEHVNKG-DYGWTAQNYSQFWGMTLEEGFKYRLGTLP-PSPLLLSMNEVTASLT 212

Query: 94  KSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSV 151
           K+  LP+ F A   WP        LDQ +C + WAF      +DR  I        +LS 
Sbjct: 213 KTTDLPEFFIASYKWP--GWTHGPLDQKNCAASWAFSTASVAADRIAIQSQGRYTANLSP 270

Query: 152 NDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA------ 205
            +L++CC      GC+ G    AW Y    G+V+  C P F     ++ GC  A      
Sbjct: 271 QNLISCCAKK-RHGCNSGSVDRAWWYLRKRGLVSHACYPLFKDQNATNNGCAMASRSDGR 329

Query: 206 ---YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYE 260
              + T  C     K N++++ S       YR++S+  +IM EI +NGPV+    V+E
Sbjct: 330 GKRHATTPCPNSIEKSNRIYQCS-----PPYRVSSNETEIMREIMQNGPVQAIMQVHE 382


>sp|Q99JR5|TINAL_MOUSE Tubulointerstitial nephritis antigen-like OS=Mus musculus
           GN=Tinagl1 PE=1 SV=1
          Length = 466

 Score = 92.0 bits (227), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 74/255 (29%), Positives = 115/255 (45%), Gaps = 30/255 (11%)

Query: 37  ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ-FKHLLG-VKPTPKGLLLGVPVKTHDK 94
           ++   +IK +N     GW+A  +  F   T+ +  ++ LG ++P+   + +        +
Sbjct: 140 LVDPDMIKAINRG-NYGWQAGNHSAFWGMTLDEGIRYRLGTIRPSSTVMNMNEIYTVLGQ 198

Query: 95  SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIH-FG-MNLSLSVN 152
              LP +F+A   WP  + I   LDQG+C   WAF      SDR  IH  G M   LS  
Sbjct: 199 GEVLPTAFEASEKWP--NLIHEPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPILSPQ 256

Query: 153 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCV 212
           +LL+C       GC GG    AW +    GVV++ C P+             A PTP+C+
Sbjct: 257 NLLSCDTHH-QQGCRGGRLDGAWWFLRRRGVVSDNCYPFSGREQ------NEASPTPRCM 309

Query: 213 ----------RKCVKK---NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVY 259
                     R+   +    Q+  N  +    AYR+ SD ++IM E+ +NGPV+    + 
Sbjct: 310 MHSRAMGRGKRQATSRCPNGQVDSNDIYQVTPAYRLGSDEKEIMKELMENGPVQA---LM 366

Query: 260 EVKQTLTLYSSTDFS 274
           EV +   LY    +S
Sbjct: 367 EVHEDFFLYQRGIYS 381


>sp|Q9EQT5|TINAL_RAT Tubulointerstitial nephritis antigen-like OS=Rattus norvegicus
           GN=Tinagl1 PE=2 SV=1
          Length = 467

 Score = 90.9 bits (224), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 73/255 (28%), Positives = 117/255 (45%), Gaps = 29/255 (11%)

Query: 37  ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ-FKHLLG-VKPTPKGLLLGVPVKTHDK 94
           ++  ++IK +N     GW+A  +  F   T+ +  ++ LG ++P+   + +        +
Sbjct: 140 LVDPAMIKAINRG-NYGWQAGNHSAFWGMTLDEGIRYRLGTIRPSSSVMNMNEIYTVLGQ 198

Query: 95  SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIH-FG-MNLSLSVN 152
              LP +F+A   WP  + I   LDQG+C   WAF      SDR  IH  G M   LS  
Sbjct: 199 GEVLPTAFEASEKWP--NLIHEPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPILSPQ 256

Query: 153 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCV 212
           +LL+C       GC GG    AW +    GVV++ C P+           + A PTP+C+
Sbjct: 257 NLLSCDTHH-QKGCRGGRLDGAWWFLRRRGVVSDNCYPF-----SGREQNDEASPTPRCM 310

Query: 213 ----------RKCVKK---NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVY 259
                     R+   +   +Q+  N  +     YR+ SD ++IM E+ +NGPV+    + 
Sbjct: 311 MHSRAMGRGKRQATSRCPNSQVDSNDIYQVTPVYRLASDEKEIMKELMENGPVQA---LM 367

Query: 260 EVKQTLTLYSSTDFS 274
           EV +   LY    +S
Sbjct: 368 EVHEDFFLYQRGIYS 382


>sp|Q9UJW2|TINAG_HUMAN Tubulointerstitial nephritis antigen OS=Homo sapiens GN=TINAG PE=2
           SV=3
          Length = 476

 Score = 89.0 bits (219), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 69/237 (29%), Positives = 106/237 (44%), Gaps = 22/237 (9%)

Query: 37  ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ-FKHLLG-VKPTPKGLLLGVPVKTHDK 94
           +++  +I++VN+    GW A    QF   T+   FK  LG + P+P  L +     +   
Sbjct: 155 LVRSELIEQVNKG-DYGWTAQNYSQFWGMTLEDGFKFRLGTLPPSPMLLSMNEMTASLPA 213

Query: 95  SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVN 152
           +  LP+ F A   WP        LDQ +C + WAF      +DR  I        +LS  
Sbjct: 214 TTDLPEFFVASYKWP--GWTHGPLDQKNCAASWAFSTASVAADRIAIQSKGRYTANLSPQ 271

Query: 153 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA------- 205
           +L++CC      GC+ G    AW Y    G+V+  C P F     ++ GC  A       
Sbjct: 272 NLISCCA-KNRHGCNSGSIDRAWWYLRKRGLVSHACYPLFKDQNATNNGCAMASRSDGRG 330

Query: 206 --YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYE 260
             + T  C     K N++++ S       YR++S+  +IM EI +NGPV+    V E
Sbjct: 331 KRHATKPCPNNVEKSNRIYQCS-----PPYRVSSNETEIMKEIMQNGPVQAIMQVRE 382


>sp|Q9GZM7|TINAL_HUMAN Tubulointerstitial nephritis antigen-like OS=Homo sapiens
           GN=TINAGL1 PE=1 SV=1
          Length = 467

 Score = 86.3 bits (212), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 71/249 (28%), Positives = 115/249 (46%), Gaps = 18/249 (7%)

Query: 37  ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ-FKHLLG-VKPTPKGLLLGVPVKTHDK 94
           ++   +IK +N+    GW+A  +  F   T+ +  ++ LG ++P+   + +       + 
Sbjct: 141 LVDPDMIKAINQG-NYGWQAGNHSAFWGMTLDEGIRYRLGTIRPSSSVMNMHEIYTVLNP 199

Query: 95  SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIH-FG-MNLSLSVN 152
              LP +F+A   WP  + I   LDQG+C   WAF      SDR  IH  G M   LS  
Sbjct: 200 GEVLPTAFEASEKWP--NLIHEPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQ 257

Query: 153 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPY----FDSTGCSHPGCEPAYPT 208
           +LL+C       GC GG    AW +    GVV++ C P+     D  G + P    +   
Sbjct: 258 NLLSCDTHQ-QQGCRGGRLDGAWWFLRRRGVVSDHCYPFSGRERDEAGPAPPCMMHSRAM 316

Query: 209 PKCVRKCVKK--NQLWRNSKHYSIS-AYRINSDPEDIMAEIYKNGPVEVSFTVYEVKQTL 265
            +  R+      N    N+  Y ++  YR+ S+ ++IM E+ +NGPV+    + EV +  
Sbjct: 317 GRGKRQATAHCPNSYVNNNDIYQVTPVYRLGSNDKEIMKELMENGPVQA---LMEVHEDF 373

Query: 266 TLYSSTDFS 274
            LY    +S
Sbjct: 374 FLYKGGIYS 382


>sp|P92133|CATB3_GIAIN Cathepsin B-like CP3 OS=Giardia intestinalis GN=CP3 PE=2 SV=2
          Length = 299

 Score = 86.3 bits (212), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 71/215 (33%), Positives = 97/215 (45%), Gaps = 24/215 (11%)

Query: 49  NPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAW 108
           NP+  WKA    +F   T  +   LL      K     VP  T   + + P SFD R  +
Sbjct: 28  NPR--WKAGIPKRFEGLTKDEISSLLMPVSFLKRDRAAVPRGTV-SATQAPDSFDFREEY 84

Query: 109 PQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMN---LSLSVNDLLACCGFLCGD- 164
           P C  I  ++DQG CGSCWAF +V ++ DR C   G++   +  S   +++C     GD 
Sbjct: 85  PHC--IPEVVDQGGCGSCWAFSSVASVGDRRCFA-GLDKKAVKYSPQYVVSC---DRGDM 138

Query: 165 GCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRN 224
            CDGG+  S WR+    G  T+EC PY         G   A  T  C  KC   + L   
Sbjct: 139 ACDGGWLPSVWRFLTKTGTTTDECVPY-------QSGSTGARGT--CPTKCADGSDLPHL 189

Query: 225 SKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVY 259
            K      Y +  D   IM  +   GP++ +FTVY
Sbjct: 190 YKATKAVDYGL--DAPAIMKALATGGPLQTAFTVY 222


>sp|P90850|YCF2E_CAEEL Uncharacterized peptidase C1-like protein F26E4.3 OS=Caenorhabditis
           elegans GN=F26E4.3 PE=1 SV=3
          Length = 452

 Score = 80.1 bits (196), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 58/170 (34%), Positives = 82/170 (48%), Gaps = 7/170 (4%)

Query: 94  KSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSV 151
           K  +LP+ FDAR  W     I  + DQG CGS W+       SDR  I     +N +LS 
Sbjct: 180 KPRELPEHFDARDKWG--PLIHPVADQGDCGSSWSVSTTAISSDRLAIISEGRINSTLSS 237

Query: 152 NDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKC 211
             LL+C       GC+GGY   AW Y    GVV + C PY  S     PG          
Sbjct: 238 QQLLSCNQHR-QKGCEGGYLDRAWWYIRKLGVVGDHCYPYV-SGQSREPGHCLIPKRDYT 295

Query: 212 VRKCVKKNQLWRNSKHYSISA-YRINSDPEDIMAEIYKNGPVEVSFTVYE 260
            R+ ++     ++S  + ++  Y+++S  EDI  E+  NGPV+ +F V+E
Sbjct: 296 NRQGLRCPSGSQDSTAFKMTPPYKVSSREEDIQTELMTNGPVQATFVVHE 345


>sp|P92132|CATB2_GIAIN Cathepsin B-like CP2 OS=Giardia intestinalis GN=CP2 PE=1 SV=2
          Length = 300

 Score = 78.2 bits (191), Expect = 6e-14,   Method: Compositional matrix adjust.
 Identities = 63/215 (29%), Positives = 96/215 (44%), Gaps = 23/215 (10%)

Query: 49  NPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAW 108
           NP+  WKA    +F   T  +   LL      K      P  T      +P+SFD R  +
Sbjct: 28  NPR--WKAGIPKRFEGLTKDEISSLLMPVSFLKNAKGAAPRGTFTDKDDVPESFDFREEY 85

Query: 109 PQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMN---LSLSVNDLLACCGFLCGD- 164
           P C  I  ++DQG CGSCWAF +V    DR C+  G++   +  S   +++C     GD 
Sbjct: 86  PHC--IPEVVDQGGCGSCWAFSSVATFGDRRCVA-GLDKKPVKYSPQYVVSC---DHGDM 139

Query: 165 GCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRN 224
            C+GG+  + W++    G  T+EC PY   +      C    PT     KC   +     
Sbjct: 140 ACNGGWLPNVWKFLTKTGTTTDECVPYKSGSTTLRGTC----PT-----KCADGSSKVHL 190

Query: 225 SKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVY 259
           +   S   Y +  D   +M  +  +GP++V+F V+
Sbjct: 191 ATATSYKDYGL--DIPAMMKALSTSGPLQVAFLVH 223


>sp|P92131|CATB1_GIAIN Cathepsin B-like CP1 OS=Giardia intestinalis GN=CP1 PE=2 SV=3
          Length = 303

 Score = 77.0 bits (188), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 69/232 (29%), Positives = 100/232 (43%), Gaps = 29/232 (12%)

Query: 54  WKAARNPQFSNYTVGQFKHLLGVKP----TPKGLLLGVPV-KTHDKSLKLPKSFDARSAW 108
           WKA    +F N T  +F+ +L ++P       G L  + + +  +    +P  FD R  +
Sbjct: 31  WKAGMPKRFENVTEDEFRSML-IRPDRLRARSGSLPPISITEVQELVDPIPPQFDFRDEY 89

Query: 109 PQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMN---LSLSVNDLLACCGFLCGDG 165
           PQC  +   LDQG CGSCWAF A+    DR C   G++   +S S   L++C   L   G
Sbjct: 90  PQC--VKPALDQGSCGSCWAFSAIGVFGDRRC-AMGIDKEAVSYSQQHLISCS--LENFG 144

Query: 166 CDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNS 225
           CDGG     W +    G  T EC  Y D       G   A P P          QL++  
Sbjct: 145 CDGGDFQPTWSFLTFTGATTAECVKYVDY------GHTVASPCPAVCDDG-SPIQLYKAH 197

Query: 226 KHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEVKQTLTLYSSTDFSASF 277
            +  +S     S P  IM  +   GP++    VY     L+ Y S  +  ++
Sbjct: 198 GYGQVS----KSVPA-IMGMLVAGGPLQTMIVVY---ADLSYYESGVYKHTY 241


>sp|P22497|CYSP_THEPA Cysteine proteinase OS=Theileria parva GN=TP03_0285 PE=3 SV=2
          Length = 440

 Score = 74.7 bits (182), Expect = 7e-13,   Method: Compositional matrix adjust.
 Identities = 53/168 (31%), Positives = 80/168 (47%), Gaps = 24/168 (14%)

Query: 61  QFSNYTVGQFKHLLGVKPTPKG-------LLLGVPVKTHDKSLK----------LPKSFD 103
           +FS+ T  +F  L  V   PK        LL  +  KT+ K+LK          L K   
Sbjct: 171 RFSDLTEREFYKLFPVMKPPKATYSNGYYLLSHMANKTYLKNLKKALNTDEDVDLAKLTG 230

Query: 104 ARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCG 163
               W + S+++ + DQ +CG CWAF  V ++   +  HF  +  LSV +LL C  F   
Sbjct: 231 ENLDWRRSSSVTSVKDQSNCGGCWAFSTVGSVEGYYMSHFDKSYELSVQELLDCDSF--S 288

Query: 164 DGCDGGYPISAWRYFVHHGVVTEECDPYFD-STGCSHPGCE----PAY 206
           +GC GG   SA+ Y   +G+V+ +  P+ D +  CS P  +    P+Y
Sbjct: 289 NGCQGGLLESAYEYVRKYGLVSAKDLPFVDKARRCSVPKAKKVSVPSY 336


>sp|O97578|CATC_CANFA Dipeptidyl peptidase 1 (Fragment) OS=Canis familiaris GN=CTSC PE=1
           SV=1
          Length = 435

 Score = 72.8 bits (177), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 57/223 (25%), Positives = 105/223 (47%), Gaps = 23/223 (10%)

Query: 41  SIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSLKLPK 100
             +K +N   K+ W A R  ++   T+      +G +  P+     +  + H++  +LP 
Sbjct: 148 EFVKAINTIQKS-WTATRYIEYETLTLRDMMTRVGGRKIPRPKPTPLTAEIHEEISRLPT 206

Query: 101 SFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACC 158
           S+D R+     + +S + +Q  CGSC+AF +   L  R  I      +  LS  ++++C 
Sbjct: 207 SWDWRNV-RGTNFVSPVRNQASCGSCYAFASTAMLEARIRILTNNTQTPILSPQEIVSCS 265

Query: 159 GFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVK 217
            +    GC+GG+P + A +Y    G+V E C PY    G   P C+P      C R    
Sbjct: 266 QY--AQGCEGGFPYLIAGKYAQDFGLVEEACFPY---AGSDSP-CKPN----DCFR---- 311

Query: 218 KNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYE 260
               + +S++Y +  +    +   +  E+ ++GP+ V+F VY+
Sbjct: 312 ----YYSSEYYYVGGFYGACNEALMKLELVRHGPMAVAFEVYD 350


>sp|P05167|ALEU_HORVU Thiol protease aleurain OS=Hordeum vulgare PE=2 SV=1
          Length = 362

 Score = 71.6 bits (174), Expect = 6e-12,   Method: Compositional matrix adjust.
 Identities = 48/147 (32%), Positives = 69/147 (46%), Gaps = 8/147 (5%)

Query: 61  QFSNYTVGQFKHL-LGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILD 119
           +FS+ +  +F+   LG   T    L G  +     +  LP++ D    W +   +S + +
Sbjct: 108 RFSDMSWEEFQATRLGAAQTCSATLAGNHLMR--DAAALPETKD----WREDGIVSPVKN 161

Query: 120 QGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFV 179
           Q HCGSCW F    AL   +    G N+SLS   L+ C G     GC+GG P  A+ Y  
Sbjct: 162 QAHCGSCWTFSTTGALEAAYTQATGKNISLSEQQLVDCAGGFNNFGCNGGLPSQAFEYIK 221

Query: 180 HHGVV-TEECDPYFDSTGCSHPGCEPA 205
           ++G + TEE  PY    G  H   E A
Sbjct: 222 YNGGIDTEESYPYKGVNGVCHYKAENA 248


>sp|Q95029|CATL_DROME Cathepsin L OS=Drosophila melanogaster GN=Cp1 PE=2 SV=2
          Length = 371

 Score = 70.9 bits (172), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 56/173 (32%), Positives = 80/173 (46%), Gaps = 23/173 (13%)

Query: 20  SQTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPT 79
           +Q FAEG VS  KL  +   D +  E  +               NYT+   K L     +
Sbjct: 94  NQRFAEGKVS-FKLAVNKYADLLHHEFRQLMNG----------FNYTL--HKQLRAADES 140

Query: 80  PKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRF 139
            KG+    P       + LPKS D    W     ++ + DQGHCGSCWAF +  AL  + 
Sbjct: 141 FKGVTFISPA-----HVTLPKSVD----WRTKGAVTAVKDQGHCGSCWAFSSTGALEGQH 191

Query: 140 CIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHH-GVVTEECDPY 191
               G+ +SLS  +L+ C      +GC+GG   +A+RY   + G+ TE+  PY
Sbjct: 192 FRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDNGGIDTEKSYPY 244


>sp|Q8RWQ9|ALEUL_ARATH Thiol protease aleurain-like OS=Arabidopsis thaliana GN=At3g45310
           PE=2 SV=1
          Length = 358

 Score = 70.5 bits (171), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 53/172 (30%), Positives = 81/172 (47%), Gaps = 11/172 (6%)

Query: 28  VSKLKLDSHILQDSIIKEVNENPKA-GWKAARNPQFSNYTVGQF-KHLLGVKPTPKGLLL 85
           V ++KL   + ++++    + N K   +K + N QF++ T  +F ++ LG        L 
Sbjct: 73  VEEMKLRFSVFKENLDLIRSTNKKGLSYKLSLN-QFADLTWQEFQRYKLGAAQNCSATLK 131

Query: 86  GVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGM 145
           G    T      +P + D    W +   +S + +QGHCGSCW F    AL   +   FG 
Sbjct: 132 GSHKITE---ATVPDTKD----WREDGIVSPVKEQGHCGSCWTFSTTGALEAAYHQAFGK 184

Query: 146 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVV-TEECDPYFDSTG 196
            +SLS   L+ C G     GC GG P  A+ Y  ++G + TEE  PY    G
Sbjct: 185 GISLSEQQLVDCAGTFNNFGCHGGLPSQAFEYIKYNGGLDTEEAYPYTGKDG 236


>sp|Q24940|CATLL_FASHE Cathepsin L-like proteinase OS=Fasciola hepatica GN=Cat-1 PE=1 SV=1
          Length = 326

 Score = 70.1 bits (170), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 39/133 (29%), Positives = 67/133 (50%), Gaps = 8/133 (6%)

Query: 61  QFSNYTVGQFK--HLLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRIL 118
           QF++ T  +FK  +L  +      L  GVP + +++++  P   D    W +   ++ + 
Sbjct: 71  QFTDMTFEEFKAKYLTEMSRASDILSHGVPYEANNRAV--PDKID----WRESGYVTEVK 124

Query: 119 DQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYF 178
           DQG+CGSCWAF     +  ++  +   ++S S   L+ C G    +GC GG   +A++Y 
Sbjct: 125 DQGNCGSCWAFSTTGTMEGQYMKNERTSISFSEQQLVDCSGPWGNNGCSGGLMENAYQYL 184

Query: 179 VHHGVVTEECDPY 191
              G+ TE   PY
Sbjct: 185 KQFGLETESSYPY 197


>sp|Q7XR52|CYSP1_ORYSJ Cysteine protease 1 OS=Oryza sativa subsp. japonica GN=CP1 PE=2
           SV=2
          Length = 490

 Score = 69.3 bits (168), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 48/145 (33%), Positives = 75/145 (51%), Gaps = 7/145 (4%)

Query: 49  NPKAGWKAARNPQFSNYTVGQFKHL-LGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSA 107
           + + G++   N +F++ T G+F+   LG  P  +G  +G   + HD    LP S D R  
Sbjct: 107 DERGGFRLGMN-RFADLTNGEFRATYLGTTPAGRGRRVGEAYR-HDGVEALPDSVDWRD- 163

Query: 108 WPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCD 167
             + + ++ + +QG CGSCWAF AV A+     I  G  +SLS  +L+ C       GC+
Sbjct: 164 --KGAVVAPVKNQGQCGSCWAFSAVAAVEGINKIVTGELVSLSEQELVECARNGQNSGCN 221

Query: 168 GGYPISAWRYFVHHGVV-TEECDPY 191
           GG    A+ +   +G + TEE  PY
Sbjct: 222 GGIMDDAFAFIARNGGLDTEEDYPY 246


>sp|Q86GF7|CRUST_PANBO Crustapain OS=Pandalus borealis GN=Cys PE=1 SV=1
          Length = 323

 Score = 68.9 bits (167), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 41/112 (36%), Positives = 58/112 (51%), Gaps = 7/112 (6%)

Query: 86  GVPVKTHDKSLKLPKS-----FDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFC 140
           G+  + H  S+ LPKS       A   W     ++ + DQG CGSCWAF AV AL     
Sbjct: 86  GMTRRRHPLSV-LPKSAPTTPMAADVDWRNKGAVTPVKDQGQCGSCWAFSAVAALEGAHF 144

Query: 141 IHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFV-HHGVVTEECDPY 191
           +  G  +SLS  +L+ C       GC+GG+P  A++Y + + G+ TE   PY
Sbjct: 145 LKTGDLVSLSEQNLVDCSSSYGNQGCNGGWPYQAYQYIIANRGIDTESSYPY 196


>sp|Q40143|CYSP3_SOLLC Cysteine proteinase 3 OS=Solanum lycopersicum GN=CYP-3 PE=2 SV=1
          Length = 356

 Score = 68.9 bits (167), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 56/176 (31%), Positives = 80/176 (45%), Gaps = 19/176 (10%)

Query: 28  VSKLKLDSHILQDSIIKEVNENPKA-GWKAARNPQFSNYTVGQF-KHLLGVK----PTPK 81
           V ++K    I  D++    + N K   +K   N +F++ T  +F KH LG       T K
Sbjct: 71  VEEIKQRFEIFLDNLKMIRSHNRKGLSYKLGIN-EFTDLTWDEFRKHKLGASQNCSATTK 129

Query: 82  GLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI 141
           G L          ++ LP++ D    W +   +S +  QG CGSCW F    AL   +  
Sbjct: 130 GNL-------KLTNVVLPETKD----WRKDGIVSPVKAQGKCGSCWTFSTTGALEAAYAQ 178

Query: 142 HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYF-VHHGVVTEECDPYFDSTG 196
            FG  +SLS   L+ C G     GC+GG P  A+ Y   + G+ TEE  PY    G
Sbjct: 179 AFGKGISLSEQQLVDCAGAFNNFGCNGGLPSQAFEYIKFNGGLDTEEAYPYTGKNG 234


>sp|Q8H166|ALEU_ARATH Thiol protease aleurain OS=Arabidopsis thaliana GN=ALEU PE=1 SV=2
          Length = 358

 Score = 68.6 bits (166), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 53/167 (31%), Positives = 78/167 (46%), Gaps = 11/167 (6%)

Query: 28  VSKLKLDSHILQDSIIKEVNENPKA-GWKAARNPQFSNYTVGQFKHL-LGVKPTPKGLLL 85
           V ++KL   I ++++    + N K   +K   N QF++ T  +F+   LG        L 
Sbjct: 73  VEEMKLRFSIFKENLDLIRSTNKKGLSYKLGVN-QFADLTWQEFQRTKLGAAQNCSATLK 131

Query: 86  GVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGM 145
           G    T      LP++ D    W +   +S + DQG CGSCW F    AL   +   FG 
Sbjct: 132 GSHKVTE---AALPETKD----WREDGIVSPVKDQGGCGSCWTFSTTGALEAAYHQAFGK 184

Query: 146 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVV-TEECDPY 191
            +SLS   L+ C G     GC+GG P  A+ Y   +G + TE+  PY
Sbjct: 185 GISLSEQQLVDCAGAFNNYGCNGGLPSQAFEYIKSNGGLDTEKAYPY 231


>sp|Q26636|CATL_SARPE Cathepsin L OS=Sarcophaga peregrina PE=1 SV=1
          Length = 339

 Score = 67.8 bits (164), Expect = 8e-11,   Method: Compositional matrix adjust.
 Identities = 36/97 (37%), Positives = 55/97 (56%), Gaps = 5/97 (5%)

Query: 96  LKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLL 155
           + +PKS D    W +   ++ + DQGHCGSCWAF +  AL  +     G+ +SLS  +L+
Sbjct: 120 VTVPKSVD----WREHGAVTGVKDQGHCGSCWAFSSTGALEGQHFRKAGVLVSLSEQNLV 175

Query: 156 ACCGFLCGDGCDGGYPISAWRYFVHH-GVVTEECDPY 191
            C      +GC+GG   +A+RY   + G+ TE+  PY
Sbjct: 176 DCSTKYGNNGCNGGLMDNAFRYIKDNGGIDTEKSYPY 212


>sp|Q5E998|CATL2_BOVIN Cathepsin L2 OS=Bos taurus GN=CTSL2 PE=2 SV=1
          Length = 334

 Score = 67.0 bits (162), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 51/165 (30%), Positives = 81/165 (49%), Gaps = 17/165 (10%)

Query: 51  KAGWKAARNPQFSNYTVGQFKHLLG---VKPTPKGLLLGVPVKTHDKSLKLPKSFDARSA 107
           K G++ A N  F + T  +F+ ++     +   KG L   P+      + +PKS D    
Sbjct: 70  KHGFRMAMNA-FGDMTNEEFRQVMNGFQNQKHKKGKLFHEPL-----LVDVPKSVD---- 119

Query: 108 WPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCD 167
           W +   ++ + +QG CGSCWAF A  AL  +     G  +SLS  +L+ C       GC+
Sbjct: 120 WTKKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRAQGNQGCN 179

Query: 168 GGYPISAWRYFVHHGVV-TEECDPYF--DSTGCSH-PGCEPAYPT 208
           GG   +A++Y   +G + +EE  PY   D+  C++ P C  A  T
Sbjct: 180 GGLMDNAFQYIKDNGCLDSEESYPYLATDTNSCNYKPECSAANDT 224


>sp|P25975|CATL1_BOVIN Cathepsin L1 OS=Bos taurus GN=CTSL1 PE=1 SV=3
          Length = 334

 Score = 66.6 bits (161), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 51/165 (30%), Positives = 81/165 (49%), Gaps = 17/165 (10%)

Query: 51  KAGWKAARNPQFSNYTVGQFKHLLG---VKPTPKGLLLGVPVKTHDKSLKLPKSFDARSA 107
           K G++ A N  F + T  +F+ ++     +   KG L   P+      + +PKS D    
Sbjct: 70  KHGFRMAMNA-FGDMTNEEFRQVMNGFQNQKHKKGKLFHEPL-----LVDVPKSVD---- 119

Query: 108 WPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCD 167
           W +   ++ + +QG CGSCWAF A  AL  +     G  +SLS  +L+ C       GC+
Sbjct: 120 WTKKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRAQGNQGCN 179

Query: 168 GGYPISAWRYFVHH-GVVTEECDPYF--DSTGCSH-PGCEPAYPT 208
           GG   +A++Y   + G+ +EE  PY   D+  C++ P C  A  T
Sbjct: 180 GGLMDNAFQYIKDNGGLDSEESYPYLATDTNSCNYKPECSAANDT 224


>sp|Q9GL24|CATL1_CANFA Cathepsin L1 OS=Canis familiaris GN=CTSL1 PE=2 SV=1
          Length = 333

 Score = 66.6 bits (161), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 51/165 (30%), Positives = 81/165 (49%), Gaps = 17/165 (10%)

Query: 51  KAGWKAARNPQFSNYTVGQFKHLLG---VKPTPKGLLLGVPVKTHDKSLKLPKSFDARSA 107
           K G+  A N  F + T  +F+ ++     +   KG +   P+       ++PKS D    
Sbjct: 70  KHGFTMAMNA-FGDMTNEEFRQVMNGFQNQKHKKGKMFQEPLFA-----EIPKSVD---- 119

Query: 108 WPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCD 167
           W +   ++ + +QG CGSCWAF A  AL  +     G  +SLS  +L+ C      +GC+
Sbjct: 120 WREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRAQGNEGCN 179

Query: 168 GGYPISAWRYFVHHGVV-TEECDPYF--DSTGCSH-PGCEPAYPT 208
           GG   +A+RY   +G + +EE  PY   D+  C++ P C  A  T
Sbjct: 180 GGLMDNAFRYVKDNGGLDSEESYPYLGRDTETCNYKPECSAANDT 224


>sp|Q90686|CATK_CHICK Cathepsin K OS=Gallus gallus GN=CTSK PE=2 SV=1
          Length = 334

 Score = 65.9 bits (159), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 42/117 (35%), Positives = 61/117 (52%), Gaps = 12/117 (10%)

Query: 77  KPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALS 136
           +P P G L  VP    D S + P + D    W +   ++ + DQG CGSCWAF +V AL 
Sbjct: 104 RPRPNGTLY-VP----DWSSRAPAAVD----WRRKGYVTPVKDQGQCGSCWAFSSVGALE 154

Query: 137 DRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYF-VHHGVVTEECDPYF 192
            +     G  LSLS  +L+ C      +GC GGY  +A+ Y  ++ G+ +E+  PY 
Sbjct: 155 GQLKRRTGKLLSLSPQNLVYCVS--NNNGCGGGYMTNAFEYVRLNRGIDSEDAYPYI 209


>sp|Q28944|CATL1_PIG Cathepsin L1 OS=Sus scrofa GN=CTSL1 PE=2 SV=1
          Length = 334

 Score = 64.7 bits (156), Expect = 7e-10,   Method: Compositional matrix adjust.
 Identities = 41/117 (35%), Positives = 62/117 (52%), Gaps = 8/117 (6%)

Query: 96  LKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLL 155
           L++PKS D    W +   ++ + +QG CGSCWAF A  AL  +     G  +SLS  +L+
Sbjct: 112 LEVPKSVD----WREKGYVTAVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLV 167

Query: 156 ACCGFLCGDGCDGGYPISAWRYFVHHGVV-TEECDPYF--DSTGCSH-PGCEPAYPT 208
            C       GC+GG   +A++Y   +G + TEE  PY   ++  C++ P C  A  T
Sbjct: 168 DCSRPQGNQGCNGGLMDNAFQYVKDNGGLDTEESYPYLGRETNSCTYKPECSAANDT 224


>sp|P43296|RD19A_ARATH Cysteine proteinase RD19a OS=Arabidopsis thaliana GN=RD19A PE=2
           SV=1
          Length = 368

 Score = 64.7 bits (156), Expect = 8e-10,   Method: Compositional matrix adjust.
 Identities = 62/210 (29%), Positives = 94/210 (44%), Gaps = 39/210 (18%)

Query: 61  QFSNYTVGQF-KHLLGVK---PTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISR 116
           QFS+ T  +F K  LGV+     PK       + T +    LP+ FD    W     ++ 
Sbjct: 98  QFSDLTRSEFRKKHLGVRSGFKLPKDANKAPILPTEN----LPEDFD----WRDHGAVTP 149

Query: 117 ILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACC-------GFLCGDGCDGG 169
           + +QG CGSCW+F A  AL     +  G  +SLS   L+ C           C  GC+GG
Sbjct: 150 VKNQGSCGSCWSFSATGALEGANFLATGKLVSLSEQQLVDCDHECDPEEADSCDSGCNGG 209

Query: 170 YPISAWRYFVHHGVVTEECD-PYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHY 228
              SA+ Y +  G + +E D PY   TG     C+            + K+++  +  ++
Sbjct: 210 LMNSAFEYTLKTGGLMKEEDYPY---TGKDGKTCK------------LDKSKIVASVSNF 254

Query: 229 SISAYRINSDPEDIMAEIYKNGPVEVSFTV 258
           S+    I+ D E I A + KNGP+ V+   
Sbjct: 255 SV----ISIDEEQIAANLVKNGPLAVAINA 280


>sp|Q54TR1|CFAD_DICDI Counting factor associated protein D OS=Dictyostelium discoideum
           GN=cfaD PE=1 SV=1
          Length = 531

 Score = 64.3 bits (155), Expect = 9e-10,   Method: Compositional matrix adjust.
 Identities = 56/179 (31%), Positives = 80/179 (44%), Gaps = 25/179 (13%)

Query: 49  NPKAGWK--AARNPQFSNYTVG----------QFKHLLGVKP-TPKGLLLGVPVKTHDKS 95
           N KA  K  A  N + S+Y +G          +F  L  VKP   +  + G      D+S
Sbjct: 248 NFKAARKIIATHNAKESSYKLGMNHYADLSNKEFNTL--VKPKVARPSVTGADSVHDDES 305

Query: 96  LK-LPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDL 154
           L+ +P + D    W   + ++ + DQG CGSCW FG+  +L    C+  G  +SLS   L
Sbjct: 306 LRSIPSTVD----WRNQNCVTPVKDQGICGSCWTFGSTGSLEGTNCVTNGELVSLSEQQL 361

Query: 155 LACCGFLCGDGCDGGYPISAWRYFVHHG-VVTEECDPYFDSTGCSHPGCEPAYPTPKCV 212
           + C       GC GG+  SA++Y +  G + TE   PY    G     C     TP  V
Sbjct: 362 VDCAILTGSQGCGGGFASSAFQYVMEIGSLATESNYPYLMQNGL----CRDRTVTPSGV 416


>sp|Q60HG6|CATC_MACFA Dipeptidyl peptidase 1 OS=Macaca fascicularis GN=CTSC PE=2 SV=1
          Length = 463

 Score = 64.3 bits (155), Expect = 9e-10,   Method: Compositional matrix adjust.
 Identities = 60/229 (26%), Positives = 105/229 (45%), Gaps = 32/229 (13%)

Query: 41  SIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVK----PTPKGLLLGVPVKTHDKSL 96
           + +K +N   K+ W A    ++   T+G      G      P PK   L   ++   K L
Sbjct: 173 NFVKAINAIQKS-WTATTYMEYETLTLGDMIKRSGGHSRKIPRPKPTPLTAEIQ--QKIL 229

Query: 97  KLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDL 154
            LP S+D R+     + +S + +Q  CGSC++F +V  L  R  I    + +  LS  ++
Sbjct: 230 HLPTSWDWRNV-HGINFVSPVRNQASCGSCYSFASVGMLEARIRILTNNSQTPILSSQEV 288

Query: 155 LACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVR 213
           ++C  +    GC+GG+P ++A +Y    G+V E C PY   TG   P             
Sbjct: 289 VSCSQY--AQGCEGGFPYLTAGKYAQDFGLVEEACFPY---TGTDSP------------- 330

Query: 214 KCVKKNQLWR--NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYE 260
            C  K   +R  +S+++ +  +    +   +  E+  +GP+ V+F VY+
Sbjct: 331 -CKMKEDCFRYYSSEYHYVGGFYGGCNEALMKLELVYHGPLAVAFEVYD 378


>sp|P25804|CYSP_PEA Cysteine proteinase 15A OS=Pisum sativum PE=2 SV=1
          Length = 363

 Score = 64.3 bits (155), Expect = 9e-10,   Method: Compositional matrix adjust.
 Identities = 55/210 (26%), Positives = 89/210 (42%), Gaps = 40/210 (19%)

Query: 61  QFSNYTVGQFK-HLLGVKPTPKGLLLGVPVKTHDKSL----KLPKSFDARSAWPQCSTIS 115
           +FS+ T  +F+   LG+K       L +P       +     LP+ FD    W +   ++
Sbjct: 95  KFSDLTASEFRRQFLGLKKR-----LRLPAHAQKAPILPTTNLPEDFD----WREKGAVT 145

Query: 116 RILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLAC-------CGFLCGDGCDG 168
            + DQG CGSCWAF    AL     +  G  +SLS   L+ C           C  GC+G
Sbjct: 146 PVKDQGSCGSCWAFSTTGALEGAHYLATGKLVSLSEQQLVDCDHVCDPEQAGSCDSGCNG 205

Query: 169 GYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHY 228
           G   +A+ Y +  G V +E D  +     S   C+              K+++  +  ++
Sbjct: 206 GLMNNAFEYLLESGGVVQEKDYAYTGRDGS---CK------------FDKSKVVASVSNF 250

Query: 229 SISAYRINSDPEDIMAEIYKNGPVEVSFTV 258
           S+    +  D + I A + KNGP+ V+   
Sbjct: 251 SV----VTLDEDQIAANLVKNGPLAVAINA 276


>sp|P25778|ORYC_ORYSJ Oryzain gamma chain OS=Oryza sativa subsp. japonica GN=Os09g0442300
           PE=2 SV=2
          Length = 362

 Score = 64.3 bits (155), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 37/103 (35%), Positives = 50/103 (48%), Gaps = 5/103 (4%)

Query: 98  LPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLAC 157
           LP++ D    W +   +S + DQGHCGSCW F    +L   +    G  +SLS   L+ C
Sbjct: 145 LPETKD----WREDGIVSPVKDQGHCGSCWTFSTTGSLEAAYTQATGKPVSLSEQQLVDC 200

Query: 158 CGFLCGDGCDGGYPISAWRYFVHHGVV-TEECDPYFDSTGCSH 199
                  GC GG P  A+ Y  ++G + TEE  PY    G  H
Sbjct: 201 ATAYNNFGCSGGLPSQAFEYIKYNGGLDTEEAYPYTGVNGICH 243


>sp|P25326|CATS_BOVIN Cathepsin S OS=Bos taurus GN=CTSS PE=1 SV=2
          Length = 331

 Score = 63.9 bits (154), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 37/101 (36%), Positives = 57/101 (56%), Gaps = 6/101 (5%)

Query: 93  DKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVN 152
           D + KLP S D    W +   ++ +  QG CGSCWAF AV AL  +  +  G  +SLS  
Sbjct: 110 DPNQKLPDSMD----WREKGCVTEVKYQGACGSCWAFSAVGALEAQVKLKTGKLVSLSAQ 165

Query: 153 DLLACCGFLCGD-GCDGGYPISAWRYFV-HHGVVTEECDPY 191
           +L+ C     G+ GC+GG+   A++Y + ++G+ +E   PY
Sbjct: 166 NLVDCSTAKYGNKGCNGGFMTEAFQYIIDNNGIDSEASYPY 206


  Database: swissprot
    Posted date:  Mar 23, 2013  2:32 AM
  Number of letters in database: 191,569,459
  Number of sequences in database:  539,616
  
Lambda     K      H
   0.321    0.136    0.442 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 111,633,974
Number of Sequences: 539616
Number of extensions: 4842062
Number of successful extensions: 9538
Number of sequences better than 100.0: 50
Number of HSP's better than 100.0 without gapping: 195
Number of HSP's successfully gapped in prelim test: 33
Number of HSP's that attempted gapping in prelim test: 9142
Number of HSP's gapped (non-prelim): 239
length of query: 279
length of database: 191,569,459
effective HSP length: 116
effective length of query: 163
effective length of database: 128,974,003
effective search space: 21022762489
effective search space used: 21022762489
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 60 (27.7 bits)