BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 022267
         (300 letters)

Database: swissprot 
           539,616 sequences; 191,569,459 total letters

Searching..................................................done



>sp|Q4R5M2|CATB_MACFA Cathepsin B OS=Macaca fascicularis GN=CTSB PE=2 SV=1
          Length = 339

 Score =  234 bits (596), Expect = 8e-61,   Method: Compositional matrix adjust.
 Identities = 133/307 (43%), Positives = 176/307 (57%), Gaps = 39/307 (12%)

Query: 11  CLLILGVISSQTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQF 70
           CLL LG   S+              H L D ++  VN+     W+A  N  F N  V   
Sbjct: 10  CLLALGDARSRP-----------SFHPLSDELVNYVNKQ-NTTWQAGHN--FYNVDVSYL 55

Query: 71  KHLLGV---KPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCW 127
           K L G     P P   ++        + LKLP+SFDAR  WPQC TI  I DQG CGSCW
Sbjct: 56  KRLCGTFLGGPKPPQRVM------FTEDLKLPESFDAREQWPQCPTIKEIRDQGSCGSCW 109

Query: 128 AFGAVEALSDRFCIHFGMNLSLSVN--DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT 185
           AFGAVEA+SDR CIH   ++S+ V+  DLL CCG +CGDGC+GGYP  AW ++   G+V+
Sbjct: 110 AFGAVEAISDRICIHTNAHVSVEVSAEDLLTCCGIMCGDGCNGGYPAGAWNFWTRKGLVS 169

Query: 186 EE-------CDPYF-----DSTGCSHPGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISA 232
                    C PY           S P C     TPKC + C    +  ++  KHY  ++
Sbjct: 170 GGLYDSHVGCRPYSIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGYSPTYKQDKHYGYNS 229

Query: 233 YRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSD 292
           Y +++  +DIMAEIYKNGPVE +F+VY DF  YKSGVY+H+TG++MGGHA++++GWG  +
Sbjct: 230 YSVSNSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGV-E 288

Query: 293 DGEDYWV 299
           +G  YW+
Sbjct: 289 NGTPYWL 295


>sp|Q5R6D1|CATB_PONAB Cathepsin B OS=Pongo abelii GN=CTSB PE=2 SV=1
          Length = 339

 Score =  232 bits (592), Expect = 2e-60,   Method: Compositional matrix adjust.
 Identities = 127/282 (45%), Positives = 168/282 (59%), Gaps = 28/282 (9%)

Query: 36  HILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGV---KPTPKGLLLGVPVKTH 92
           H L D ++  VN+     W+A  N  F N  V   K L G     P P   ++       
Sbjct: 24  HPLSDELVNYVNKR-NTTWQAGHN--FYNVDVSYLKKLCGTFLGGPKPPQRVM------F 74

Query: 93  DKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVN 152
            + LKLP+SFDAR  WPQC TI  I DQG CGSCWAFGAVEA+SDR CIH   ++S+ V+
Sbjct: 75  TEDLKLPESFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHTNAHVSVEVS 134

Query: 153 --DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYF-----DSTGCS 198
             DLL CCG +CGDGC+GGYP  AW ++   G+V+         C PY           S
Sbjct: 135 AEDLLTCCGSMCGDGCNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCEHHVNGS 194

Query: 199 HPGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFT 257
            P C     TPKC + C    +  ++  KHY  ++Y +++   DIMAEIYKNGPVE +F+
Sbjct: 195 RPPCTGEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVSNSERDIMAEIYKNGPVEGAFS 254

Query: 258 VYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
           VY DF  YKSGVY+H+TG++MGGHA++++GWG  ++G  YW+
Sbjct: 255 VYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGV-ENGTPYWL 295


>sp|P07858|CATB_HUMAN Cathepsin B OS=Homo sapiens GN=CTSB PE=1 SV=3
          Length = 339

 Score =  231 bits (590), Expect = 3e-60,   Method: Compositional matrix adjust.
 Identities = 131/307 (42%), Positives = 175/307 (57%), Gaps = 39/307 (12%)

Query: 11  CLLILGVISSQTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQF 70
           CLL+L    S+              H L D ++  VN+     W+A  N  F N  +   
Sbjct: 10  CLLVLANARSRP-----------SFHPLSDELVNYVNKR-NTTWQAGHN--FYNVDMSYL 55

Query: 71  KHLLGV---KPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCW 127
           K L G     P P   ++        + LKLP SFDAR  WPQC TI  I DQG CGSCW
Sbjct: 56  KRLCGTFLGGPKPPQRVM------FTEDLKLPASFDAREQWPQCPTIKEIRDQGSCGSCW 109

Query: 128 AFGAVEALSDRFCIHFGMNLSLSVN--DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT 185
           AFGAVEA+SDR CIH   ++S+ V+  DLL CCG +CGDGC+GGYP  AW ++   G+V+
Sbjct: 110 AFGAVEAISDRICIHTNAHVSVEVSAEDLLTCCGSMCGDGCNGGYPAEAWNFWTRKGLVS 169

Query: 186 EE-------CDPYF-----DSTGCSHPGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISA 232
                    C PY           S P C     TPKC + C    +  ++  KHY  ++
Sbjct: 170 GGLYESHVGCRPYSIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGYSPTYKQDKHYGYNS 229

Query: 233 YRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSD 292
           Y +++  +DIMAEIYKNGPVE +F+VY DF  YKSGVY+H+TG++MGGHA++++GWG  +
Sbjct: 230 YSVSNSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGV-E 288

Query: 293 DGEDYWV 299
           +G  YW+
Sbjct: 289 NGTPYWL 295


>sp|P00787|CATB_RAT Cathepsin B OS=Rattus norvegicus GN=Ctsb PE=1 SV=2
          Length = 339

 Score =  227 bits (579), Expect = 8e-59,   Method: Compositional matrix adjust.
 Identities = 127/287 (44%), Positives = 166/287 (57%), Gaps = 30/287 (10%)

Query: 32  KLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKT 91
           K  SH L D +I  +N+     W+A RN  F N  +   K L G        +LG P   
Sbjct: 20  KPSSHPLSDDMINYINKQ-NTTWQAGRN--FYNVDISYLKKLCGT-------VLGGPNLP 69

Query: 92  H----DKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--M 145
                 + + LP+SFDAR  W  C TI++I DQG CGSCWAFGAVEA+SDR CIH    +
Sbjct: 70  ERVGFSEDINLPESFDAREQWSNCPTIAQIRDQGSCGSCWAFGAVEAMSDRICIHTNGRV 129

Query: 146 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYF-----D 193
           N+ +S  DLL CCG  CGDGC+GGYP  AW ++   G+V+         C PY       
Sbjct: 130 NVEVSAEDLLTCCGIQCGDGCNGGYPSGAWNFWTRKGLVSGGVYNSHIGCLPYTIPPCEH 189

Query: 194 STGCSHPGCEPAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPV 252
               S P C     TPKC + C    +  ++  KHY  ++Y ++   ++IMAEIYKNGPV
Sbjct: 190 HVNGSRPPCTGEGDTPKCNKMCEAGYSTSYKEDKHYGYTSYSVSDSEKEIMAEIYKNGPV 249

Query: 253 EVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
           E +FTV+ DF  YKSGVYKH  GDVMGGHA++++GWG  ++G  YW+
Sbjct: 250 EGAFTVFSDFLTYKSGVYKHEAGDVMGGHAIRILGWGI-ENGVPYWL 295


>sp|P25792|CYSP_SCHMA Cathepsin B-like cysteine proteinase OS=Schistosoma mansoni PE=2
           SV=1
          Length = 340

 Score =  219 bits (558), Expect = 2e-56,   Method: Compositional matrix adjust.
 Identities = 127/310 (40%), Positives = 176/310 (56%), Gaps = 23/310 (7%)

Query: 7   FLTTCLLILGVISSQTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYT 66
            LT+ L I  +I   TF E  +S        L D II  +NE+P AGW+A ++ +F +  
Sbjct: 1   MLTSILCIASLI---TFLEAHISVKNEKFEPLSDDIISYINEHPNAGWRAEKSNRFHSLD 57

Query: 67  VGQFKHLLGVKPTPKGLLLGVPVKTH-DKSLKLPKSFDARSAWPQCSTISRILDQGHCGS 125
             + + +   +  P       P   H D ++++P +FD+R  WP C +I+ I DQ  CGS
Sbjct: 58  DARIQ-MGARREEPDLRRKRRPTVDHNDWNVEIPSNFDSRKKWPGCKSIATIRDQSRCGS 116

Query: 126 CWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGV 183
           CW+FGAVEA+SDR CI  G   N+ LS  DLL CC   CG GC+GG    AW Y+V  G+
Sbjct: 117 CWSFGAVEAMSDRSCIQSGGKQNVELSAVDLLTCCES-CGLGCEGGILGPAWDYWVKEGI 175

Query: 184 VTEE-------CDPY-----FDSTGCSHPGC-EPAYPTPKCVRKCVKKNQL-WRNSKHYS 229
           VT         C+PY        T   +P C    Y TP+C + C +K +  +   KH  
Sbjct: 176 VTASSKENHTGCEPYPFPKCEHHTKGKYPPCGSKIYNTPRCKQTCQRKYKTPYTQDKHRG 235

Query: 230 ISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWG 289
            S+Y + +D + I  EI K GPVE SFTVYEDF +YKSG+YKHITG+ +GGHA+++IGWG
Sbjct: 236 KSSYNVKNDEKAIQKEIMKYGPVEASFTVYEDFLNYKSGIYKHITGEALGGHAIRIIGWG 295

Query: 290 TSDDGEDYWV 299
             ++   YW+
Sbjct: 296 V-ENKTPYWL 304


>sp|A1E295|CATB_PIG Cathepsin B OS=Sus scrofa GN=CTSB PE=1 SV=1
          Length = 335

 Score =  219 bits (557), Expect = 2e-56,   Method: Compositional matrix adjust.
 Identities = 124/291 (42%), Positives = 163/291 (56%), Gaps = 32/291 (10%)

Query: 29  SKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVP 88
           ++  L    L D ++  +N+     W A  N  F N  +   K L G         LG P
Sbjct: 17  ARESLHFQPLSDELVNFINKQ-NTTWTAGHN--FYNVDLSYVKKLCGT-------FLGGP 66

Query: 89  VKTHDKSLK----LPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG 144
                 +      LPKSFDAR  WP C TI  I DQG CGSCWAFGAVEA+SDR CI   
Sbjct: 67  KLPQRAAFAADMILPKSFDAREQWPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICIRSN 126

Query: 145 --MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDST 195
             +N+ +S  D+L CCG  CGDGC+GG+P  AW ++   G+V+         C PY    
Sbjct: 127 GRVNVEVSAEDMLTCCGDECGDGCNGGFPSGAWNFWTKKGLVSGGLYDSHVGCRPY-SIP 185

Query: 196 GCSH------PGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYK 248
            C H      P C     TPKC + C       ++  KH+  S+Y I+ + ++IMAEIYK
Sbjct: 186 PCEHHVNGSRPPCTGEGDTPKCSKICEPGYTPSYKEDKHFGCSSYSISRNEKEIMAEIYK 245

Query: 249 NGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
           NGPVE +FTVY DF  YKSGVY+H+TGD+MGGHA++++GWG  ++G  YW+
Sbjct: 246 NGPVEGAFTVYSDFLQYKSGVYQHVTGDLMGGHAIRILGWGV-ENGTPYWL 295


>sp|P10605|CATB_MOUSE Cathepsin B OS=Mus musculus GN=Ctsb PE=1 SV=2
          Length = 339

 Score =  218 bits (556), Expect = 4e-56,   Method: Compositional matrix adjust.
 Identities = 121/285 (42%), Positives = 164/285 (57%), Gaps = 34/285 (11%)

Query: 36  HILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHL----LGVKPTPKGLLLGVPVKT 91
           H L D +I  +N+     W+A RN  F N  +   K L    LG    P  +  G     
Sbjct: 24  HPLSDDLINYINKQ-NTTWQAGRN--FYNVDISYLKKLCGTVLGGPKLPGRVAFG----- 75

Query: 92  HDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSL 149
             + + LP++FDAR  W  C TI +I DQG CGSCWAFGAVEA+SDR CIH    +N+ +
Sbjct: 76  --EDIDLPETFDAREQWSNCPTIGQIRDQGSCGSCWAFGAVEAISDRTCIHTNGRVNVEV 133

Query: 150 SVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGC------------ 197
           S  DLL CCG  CGDGC+GGYP  AW ++   G+V+     Y    GC            
Sbjct: 134 SAEDLLTCCGIQCGDGCNGGYPSGAWSFWTKKGLVSGGV--YNSHVGCLPYTIPPCEHHV 191

Query: 198 --SHPGCEPAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEV 254
             S P C     TP+C + C    +  ++  KH+  ++Y +++  ++IMAEIYKNGPVE 
Sbjct: 192 NGSRPPCTGEGDTPRCNKSCEAGYSPSYKEDKHFGYTSYSVSNSVKEIMAEIYKNGPVEG 251

Query: 255 SFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
           +FTV+ DF  YKSGVYKH  GD+MGGHA++++GWG  ++G  YW+
Sbjct: 252 AFTVFSDFLTYKSGVYKHEAGDMMGGHAIRILGWGV-ENGVPYWL 295


>sp|P43157|CYSP_SCHJA Cathepsin B-like cysteine proteinase OS=Schistosoma japonicum
           GN=CATB PE=2 SV=1
          Length = 342

 Score =  213 bits (543), Expect = 1e-54,   Method: Compositional matrix adjust.
 Identities = 118/302 (39%), Positives = 173/302 (57%), Gaps = 23/302 (7%)

Query: 17  VISSQTFAEG-VVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLG 75
           ++S  TF E  V ++       L D +I  +NE+P AGWKA ++ +F  +++   + L+G
Sbjct: 8   IVSLFTFLEAHVTTRNNQRIEPLSDEMISFINEHPDAGWKADKSDRF--HSLDDARILMG 65

Query: 76  VKPTPKGLLLGV--PVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVE 133
            +     +       V  HD ++++P  FD+R  WP C +IS+I DQ  CGSCWAFGAVE
Sbjct: 66  ARKEDAEMKRNRRPTVDHHDLNVEIPSQFDSRKKWPHCKSISQIRDQSRCGSCWAFGAVE 125

Query: 134 ALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT------ 185
           A++DR CI  G   +  LS  DL++CC   CGDGC GG+P  AW Y+V  G+VT      
Sbjct: 126 AMTDRICIQSGGGQSAELSALDLISCCK-DCGDGCQGGFPGVAWDYWVKRGIVTGGSKEN 184

Query: 186 -EECDPY-----FDSTGCSHPGC-EPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINS 237
              C PY        T   +P C    Y TP+C + C K  +  +   KHY   +Y + +
Sbjct: 185 HTGCQPYPFPKCEHHTKGKYPACGTKIYKTPQCKQTCQKGYKTPYEQDKHYGDESYNVQN 244

Query: 238 DPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDY 297
           + + I  +I   GPVE +F VYEDF +YKSG+Y+H+TG ++GGHA+++IGWG  +    Y
Sbjct: 245 NEKVIQRDIMMYGPVEAAFDVYEDFLNYKSGIYRHVTGSIVGGHAIRIIGWGV-EKRTPY 303

Query: 298 WV 299
           W+
Sbjct: 304 WL 305


>sp|P43510|CPR6_CAEEL Cathepsin B-like cysteine proteinase 6 OS=Caenorhabditis elegans
           GN=cpr-6 PE=1 SV=1
          Length = 379

 Score =  212 bits (539), Expect = 3e-54,   Method: Compositional matrix adjust.
 Identities = 132/332 (39%), Positives = 177/332 (53%), Gaps = 51/332 (15%)

Query: 6   LFLTTCLLILGVISSQTFAEGVVSKLK---LDSHILQ---DSIIKEVNENPKAGWKAARN 59
           L   +C+++    +     E V+ K +   +DS   +   D +I  VNEN    W A + 
Sbjct: 4   LLFLSCIVVAAYCACNDNLESVLDKYRNREIDSEAAELDGDDLIDYVNENQNL-WTAKKQ 62

Query: 60  PQFSNYTVGQFKHLLGVKPTPKGLLLGVP------------VKTHDKSLKLPKSFDARSA 107
            +FS+        + G     K  L+GV              KT D  L +P+SFD+R  
Sbjct: 63  RRFSS--------VYGENDKAKWGLMGVNHVRLSVKGKQHLSKTKDLDLDIPESFDSRDN 114

Query: 108 WPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDG 165
           WP+C +I  I DQ  CGSCWAFGAVEA+SDR CI  H  + ++LS +DLL+CC   CG G
Sbjct: 115 WPKCDSIKVIRDQSSCGSCWAFGAVEAMSDRICIASHGELQVTLSADDLLSCCK-SCGFG 173

Query: 166 CDGGYPISAWRYFVHHGVVTEECDPYFDSTGCS---HPGCE-------------PAYPTP 209
           C+GG P++AWRY+V  G+VT     Y  + GC     P CE               YPTP
Sbjct: 174 CNGGDPLAAWRYWVKDGIVTGS--NYTANNGCKPYPFPPCEHHSKKTHFDPCPHDLYPTP 231

Query: 210 KCVRKCVKK--NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKS 267
           KC +KCV    ++ +   K +  SAY +  D E I  E+  +GP+E++F VYEDF +Y  
Sbjct: 232 KCEKKCVSDYTDKTYSEDKFFGASAYGVKDDVEAIQKELMTHGPLEIAFEVYEDFLNYDG 291

Query: 268 GVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
           GVY H  G + GGHAVKLIGWG  DDG  YW 
Sbjct: 292 GVYVHTGGKLGGGHAVKLIGWGI-DDGIPYWT 322


>sp|P43233|CATB_CHICK Cathepsin B OS=Gallus gallus GN=CTSB PE=2 SV=1
          Length = 340

 Score =  212 bits (539), Expect = 3e-54,   Method: Compositional matrix adjust.
 Identities = 120/284 (42%), Positives = 162/284 (57%), Gaps = 35/284 (12%)

Query: 38  LQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHD---- 93
           L   ++  +N+    G +A  N  F N  +   K L G         LG P         
Sbjct: 26  LSSDLVNHINKLNTTG-RAGHN--FHNTDMSYVKKLCGT-------FLGGPKAPERVDFA 75

Query: 94  KSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVN- 152
           + + LP +FD R  WP C TIS I DQG CGSCWAFGAVEA+SDR C+H    +S+ V+ 
Sbjct: 76  EDMDLPDTFDTRKQWPNCPTISEIRDQGSCGSCWAFGAVEAISDRICVHTNAKVSVEVSA 135

Query: 153 -DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGC---SHPGCE----- 203
            DLL+CCGF CG GC+GGYP  AWRY+   G+V+     Y    GC   + P CE     
Sbjct: 136 EDLLSCCGFECGMGCNGGYPSGAWRYWTERGLVSGGL--YDSHVGCRAYTIPPCEHHVNG 193

Query: 204 -------PAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVS 255
                      TP+C R C    +  ++  KHY I++Y +    ++IMAEIYKNGPVE +
Sbjct: 194 SRPPCTGEGGETPRCSRHCEPGYSPSYKEDKHYGITSYGVPRSEKEIMAEIYKNGPVEGA 253

Query: 256 FTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
           F VYEDF  YKSGVY+H++G+ +GGHA++++GWG  ++G  YW+
Sbjct: 254 FIVYEDFLMYKSGVYQHVSGEQVGGHAIRILGWGV-ENGTPYWL 296


>sp|P43509|CPR5_CAEEL Cathepsin B-like cysteine proteinase 5 OS=Caenorhabditis elegans
           GN=cpr-5 PE=2 SV=1
          Length = 344

 Score =  203 bits (517), Expect = 1e-51,   Method: Compositional matrix adjust.
 Identities = 108/226 (47%), Positives = 134/226 (59%), Gaps = 22/226 (9%)

Query: 95  SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVN 152
           S  +P  FDAR  WP C +I+ I DQ  CGSCWAF A EA+SDR CI  +  +N  LS  
Sbjct: 79  SDAIPDHFDARDQWPNCMSINNIRDQSDCGSCWAFAAAEAISDRTCIASNGAVNTLLSSE 138

Query: 153 DLLACCG--FLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDS------TGC 197
           DLL+CC   F CG+GC+GGYPI AW+++V HG+VT         C PY  +       G 
Sbjct: 139 DLLSCCTGMFSCGNGCEGGYPIQAWKWWVKHGLVTGGSYETQFGCKPYSIAPCGETVNGV 198

Query: 198 SHPGC-EPAYPTPKCVRKCVKKNQL---WRNSKHYSISAYRINSDPEDIMAEIYKNGPVE 253
             P C E   PTPKCV  C  KN     +   KH+  +AY +    E I  EI  NGP+E
Sbjct: 199 KWPACPEDTEPTPKCVDSCTSKNNYATPYLQDKHFGSTAYAVGKKVEQIQTEILTNGPIE 258

Query: 254 VSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
           V+FTVYEDF  Y +GVY H  G  +GGHAVK++GWG  D+G  YW+
Sbjct: 259 VAFTVYEDFYQYTTGVYVHTAGASLGGHAVKILGWGV-DNGTPYWL 303


>sp|Q54QD9|CTSB_DICDI Cathepsin B OS=Dictyostelium discoideum GN=ctsB PE=3 SV=1
          Length = 311

 Score =  196 bits (499), Expect = 1e-49,   Method: Compositional matrix adjust.
 Identities = 112/254 (44%), Positives = 147/254 (57%), Gaps = 24/254 (9%)

Query: 54  WKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDK-SLKLPKSFDARSAWPQCS 112
           W   +  QF N  VGQ   LLG K +P    L   +K++D   +++P SF+A++ WP C+
Sbjct: 39  WVEEQTDQFDNIKVGQ---LLGFKRSPNRPKL--QIKSYDPLGVQIPTSFNAQTNWPNCT 93

Query: 113 TISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPI 172
           TIS+I +Q  CGSCWAFGA E+ +DR CIH   N+ LS  D++ C      +GC+GG   
Sbjct: 94  TISQIQNQARCGSCWAFGATESATDRLCIHNNENVQLSFMDMVTCDE--TDNGCEGGDAF 151

Query: 173 SAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYP-------TPKCVRKCVKKNQL-WRN 224
           SAW +    G V+EEC PY      + P C PA         TP C ++C   + L +  
Sbjct: 152 SAWNWLRKQGAVSEECLPY------TIPTCPPAQQPCLNFVNTPSCTKECQSNSSLIYSQ 205

Query: 225 SKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVK 284
            KH     Y  +SD E IM EI  NGPVE  FTV+EDF  YKSGVY H TG  +GGH VK
Sbjct: 206 DKHKMAKIYSFDSD-EAIMQEIVTNGPVEACFTVFEDFLAYKSGVYVHTTGKDLGGHCVK 264

Query: 285 LIGWGTSDDGEDYW 298
           L+G+GT  +G DY+
Sbjct: 265 LVGFGTL-NGVDYY 277


>sp|P25807|CPR1_CAEEL Gut-specific cysteine proteinase OS=Caenorhabditis elegans GN=cpr-1
           PE=1 SV=2
          Length = 329

 Score =  194 bits (492), Expect = 8e-49,   Method: Compositional matrix adjust.
 Identities = 100/211 (47%), Positives = 129/211 (61%), Gaps = 12/211 (5%)

Query: 98  LPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLL 155
           +P +FD+R+ W +C +I  I DQ  CGSCWAFGA E +SDR CI         +S +DLL
Sbjct: 85  VPATFDSRTQWSECKSIKLIRDQATCGSCWAFGAAEMISDRTCIETKGAQQPIISPDDLL 144

Query: 156 ACCGFLCGDGCDGGYPISAWRYFVHHGVVT------EECDPYFDSTGCSHPGCEPAYPTP 209
           +CCG  CG+GC+GGYPI A R++   GVVT        C PY  +  C+   C P   TP
Sbjct: 145 SCCGSSCGNGCEGGYPIQALRWWDSKGVVTGGDYHGAGCKPYPIAP-CTSGNC-PESKTP 202

Query: 210 KCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSG 268
            C   C    +  +   KH+ +SAY +  +   I AEIY NGPVE +F+VYEDF  YKSG
Sbjct: 203 SCSMSCQSGYSTAYAKDKHFGVSAYAVPKNAASIQAEIYANGPVEAAFSVYEDFYKYKSG 262

Query: 269 VYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
           VYKH  G  +GGHA+K+IGWGT + G  YW+
Sbjct: 263 VYKHTAGKYLGGHAIKIIGWGT-ESGSPYWL 292


>sp|P07688|CATB_BOVIN Cathepsin B OS=Bos taurus GN=CTSB PE=1 SV=5
          Length = 335

 Score =  188 bits (478), Expect = 4e-47,   Method: Compositional matrix adjust.
 Identities = 126/312 (40%), Positives = 173/312 (55%), Gaps = 39/312 (12%)

Query: 6   LFLTTCLLILGVISSQTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNY 65
           L   +CLL+L    S  +              L D ++  VN+     WKA  N  F N 
Sbjct: 5   LATLSCLLVLTSARSSLYFP-----------PLSDELVNFVNKQ-NTTWKAGHN--FYNV 50

Query: 66  TVGQFKHLLGVKPTPKGLLLGVPVKTHDK---SLKLPKSFDARSAWPQCSTISRILDQGH 122
            +   K L G       +L G  +   D     + LP+SFDAR  WP C TI  I DQG 
Sbjct: 51  DLSYVKKLCGA------ILGGPKLPQRDAFAADVVLPESFDAREQWPNCPTIKEIRDQGS 104

Query: 123 CGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVH 180
           CGSCWAFGAVEA+SDR CIH    +N+ +S  D+L CCG  CGDGC+GG+P  AW ++  
Sbjct: 105 CGSCWAFGAVEAISDRICIHSNGRVNVEVSAEDMLTCCGGECGDGCNGGFPSGAWNFWTK 164

Query: 181 HGVVTEE-------CDPYF-----DSTGCSHPGCEPAYPTPKCVRKCVKK-NQLWRNSKH 227
            G+V+         C PY           S P C     TPKC + C    +  ++  KH
Sbjct: 165 KGLVSGGLYNSHVGCRPYSIPPCEHHVNGSRPPCTGEGDTPKCSKTCEPGYSPSYKEDKH 224

Query: 228 YSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIG 287
           +  S+Y + ++ ++IMAEIYKNGPVE +F+VY DF  YKSGVY+H++G++MGGHA++++G
Sbjct: 225 FGCSSYSVANNEKEIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVSGEIMGGHAIRILG 284

Query: 288 WGTSDDGEDYWV 299
           WG  ++G  YW+
Sbjct: 285 WGV-ENGTPYWL 295


>sp|P43507|CPR3_CAEEL Cathepsin B-like cysteine proteinase 3 OS=Caenorhabditis elegans
           GN=cpr-3 PE=2 SV=1
          Length = 370

 Score =  187 bits (475), Expect = 9e-47,   Method: Compositional matrix adjust.
 Identities = 101/214 (47%), Positives = 126/214 (58%), Gaps = 16/214 (7%)

Query: 98  LPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLL 155
           LP +FDAR  WP C+TI  I +Q  CGSCWAFGA E +SDR CI         +SV D+L
Sbjct: 92  LPDTFDAREKWPDCNTIKLIRNQATCGSCWAFGAAEVISDRVCIQSNGTQQPVISVEDIL 151

Query: 156 ACCGFLCGDGCDGGYPISAWRYFVHHGVVT------EECDPYFDSTGCSHPGCEPAYPTP 209
           +CCG  CG GC GGY I A R++   G VT        C PY  S       C P   TP
Sbjct: 152 SCCGTTCGYGCKGGYSIEALRFWASSGAVTGGDYGGHGCMPY--SFAPCTKNC-PESTTP 208

Query: 210 KCVRKCVK--KNQLWRNSKHYSISAYRINSDPE--DIMAEIYKNGPVEVSFTVYEDFAHY 265
            C   C    K + ++  KHY  SAY++ +     +I  EIY  GPVE S+ VYEDF HY
Sbjct: 209 SCKTTCQSSYKTEEYKKDKHYGASAYKVTTTKSVTEIQTEIYHYGPVEASYKVYEDFYHY 268

Query: 266 KSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
           KSGVY + +G ++GGHAVK+IGWG  ++G DYW+
Sbjct: 269 KSGVYHYTSGKLVGGHAVKIIGWGV-ENGVDYWL 301


>sp|P43508|CPR4_CAEEL Cathepsin B-like cysteine proteinase 4 OS=Caenorhabditis elegans
           GN=cpr-4 PE=2 SV=1
          Length = 335

 Score =  187 bits (474), Expect = 1e-46,   Method: Compositional matrix adjust.
 Identities = 120/280 (42%), Positives = 158/280 (56%), Gaps = 25/280 (8%)

Query: 39  QDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHD-KSLK 97
           Q++I + VN   ++ WKA   P+  + T+ Q K  L            V V  HD     
Sbjct: 25  QEAITEYVNSK-QSLWKA-EIPK--DITIEQVKKRLMRTEFVAPHTPDVEVVKHDINEDT 80

Query: 98  LPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLL 155
           +P +FDAR+ WP C +I+ I DQ  CGSCWAF A EA SDRFCI  +  +N  LS  D+L
Sbjct: 81  IPATFDARTQWPNCMSINNIRDQSDCGSCWAFAAAEAASDRFCIASNGAVNTLLSAEDVL 140

Query: 156 ACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYF-----DSTG-CSHPGC 202
           +CC   CG GC+GGYPI+AW+Y V  G  T         C PY      ++ G  + P C
Sbjct: 141 SCCSN-CGYGCEGGYPINAWKYLVKSGFCTGGSYEAQFGCKPYSLAPCGETVGNVTWPSC 199

Query: 203 -EPAYPTPKCVRKCVKKNQ--LWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVY 259
            +  Y TP CV KC  KN    +   KH+  +AY +      I AEI  +GPVE +FTVY
Sbjct: 200 PDDGYDTPACVNKCTNKNYNVAYTADKHFGSTAYAVGKKVSQIQAEIIAHGPVEAAFTVY 259

Query: 260 EDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
           EDF  YK+GVY H TG  +GGHA++++GWGT D+G  YW+
Sbjct: 260 EDFYQYKTGVYVHTTGQELGGHAIRILGWGT-DNGTPYWL 298


>sp|P25793|CYSP2_HAECO Cathepsin B-like cysteine proteinase 2 OS=Haemonchus contortus
           GN=AC-2 PE=2 SV=1
          Length = 342

 Score =  185 bits (470), Expect = 3e-46,   Method: Compositional matrix adjust.
 Identities = 108/284 (38%), Positives = 153/284 (53%), Gaps = 36/284 (12%)

Query: 33  LDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTH 92
           L +++ +   + EVN +P         P F        + ++ +K   + L L V  +  
Sbjct: 38  LVAYLRRSQNLFEVNSDP--------TPDFE-------QKIMSIKYKHQKLNLMVK-EDP 81

Query: 93  DKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLS 150
           D  + +P S+D R  W  C+T   I DQ +CGSCWA     A+SDR CI       +++S
Sbjct: 82  DPEVDIPPSYDPRDVWKNCTTFY-IRDQANCGSCWAVSTAAAISDRICIASKAEKQVNIS 140

Query: 151 VNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSHPG-- 201
             D++ CC   CGDGC+GG+PI AW+YF++ GVV+       + C PY     C H G  
Sbjct: 141 ATDIMTCCRPQCGDGCEGGWPIEAWKYFIYDGVVSGGEYLTKDVCRPY-PIHPCGHHGND 199

Query: 202 -----CEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVS 255
                C    PTP C RKC     +++R  K Y   AY +    + I +EI KNGPV  S
Sbjct: 200 TYYGECRGTAPTPPCKRKCRPGVRKMYRIDKRYGKDAYIVKQSVKAIQSEILKNGPVVAS 259

Query: 256 FTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
           F VYEDF HYKSG+YKH  G++ G HAVK+IGWG +++  D+W+
Sbjct: 260 FAVYEDFRHYKSGIYKHTAGELRGYHAVKMIGWG-NENNTDFWL 302


>sp|P19092|CYSP1_HAECO Cathepsin B-like cysteine proteinase 1 OS=Haemonchus contortus
           GN=AC-1 PE=2 SV=1
          Length = 342

 Score =  184 bits (467), Expect = 6e-46,   Method: Compositional matrix adjust.
 Identities = 96/224 (42%), Positives = 130/224 (58%), Gaps = 20/224 (8%)

Query: 93  DKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLS 150
           D  + +P S+D R  W  C+T   I DQ +CGSCWA     A+SDR CI       +++S
Sbjct: 82  DPEVDIPPSYDPRDVWKNCTTFY-IRDQANCGSCWAVSTAAAISDRICIASKAEKQVNIS 140

Query: 151 VNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSHPG-- 201
             D++ CC   CGDGC+GG+PI AW+YF++ GVV+       + C PY     C H G  
Sbjct: 141 ATDIMTCCRPQCGDGCEGGWPIEAWKYFIYDGVVSGGEYLTKDVCRPY-PIHPCGHHGND 199

Query: 202 -----CEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVS 255
                C    PTP C RKC     +++R  K Y   AY +    + I +EI +NGPV  S
Sbjct: 200 TYYGECRGTAPTPPCKRKCRPGVRKMYRIDKRYGKDAYIVKQSVKAIQSEILRNGPVVAS 259

Query: 256 FTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
           F VYEDF HYKSG+YKH  G++ G HAVK+IGWG +++  D+W+
Sbjct: 260 FAVYEDFRHYKSGIYKHTAGELRGYHAVKMIGWG-NENNTDFWL 302


>sp|P25802|CYSP1_OSTOS Cathepsin B-like cysteine proteinase 1 OS=Ostertagia ostertagi
           GN=CP-1 PE=3 SV=3
          Length = 341

 Score =  169 bits (429), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 93/218 (42%), Positives = 127/218 (58%), Gaps = 19/218 (8%)

Query: 98  LPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLL 155
           +P+S+D R  W  CS++  I DQ +CGSCWA  +  A+SDR CI       + +S  D++
Sbjct: 91  IPESYDPRIQWANCSSLFHIPDQANCGSCWAVSSAAAMSDRICIASKGAKQVLISAQDVV 150

Query: 156 ACCGFLCGDGCDGGYPISAWRYFVHHGVVTE-------ECDPYFDSTGCSHPGCEPAY-- 206
           +CC + CGDGC+GG+PISA+R+    GVVT         C PY +   C H G E  Y  
Sbjct: 151 SCCTW-CGDGCEGGWPISAFRFHADEGVVTGGDYNTKGSCRPY-EIHPCGHHGNETYYGE 208

Query: 207 -----PTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYED 261
                 TP+C R+C+        S  Y   AY++ +  + I  +I KNGPV  ++TVYED
Sbjct: 209 CVGMADTPRCKRRCLLGYPKSYPSDRYYKKAYQLKNSVKAIQKDIMKNGPVVATYTVYED 268

Query: 262 FAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
           FAHY+SG+YKH  G   G HAVK+IGWG  + G  YW+
Sbjct: 269 FAHYRSGIYKHKAGRKTGLHAVKVIGWG-EEKGTPYWI 305


>sp|P92133|CATB3_GIAIN Cathepsin B-like CP3 OS=Giardia intestinalis GN=CP3 PE=2 SV=2
          Length = 299

 Score =  140 bits (354), Expect = 9e-33,   Method: Compositional matrix adjust.
 Identities = 95/255 (37%), Positives = 128/255 (50%), Gaps = 24/255 (9%)

Query: 49  NPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAW 108
           NP+  WKA    +F   T  +   LL      K     VP  T   + + P SFD R  +
Sbjct: 28  NPR--WKAGIPKRFEGLTKDEISSLLMPVSFLKRDRAAVPRGTV-SATQAPDSFDFREEY 84

Query: 109 PQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMN---LSLSVNDLLACCGFLCGD- 164
           P C  I  ++DQG CGSCWAF +V ++ DR C   G++   +  S   +++C     GD 
Sbjct: 85  PHC--IPEVVDQGGCGSCWAFSSVASVGDRRCFA-GLDKKAVKYSPQYVVSCDR---GDM 138

Query: 165 GCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRN 224
            CDGG+  S WR+    G  T+EC PY         G   A  T  C  KC   + L   
Sbjct: 139 ACDGGWLPSVWRFLTKTGTTTDECVPY-------QSGSTGARGT--CPTKCADGSDLPHL 189

Query: 225 SKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVK 284
            K      Y +  D   IM  +   GP++ +FTVY DF +Y+SGVY+H  G V GGHAV 
Sbjct: 190 YKATKAVDYGL--DAPAIMKALATGGPLQTAFTVYSDFMYYESGVYQHTYGRVEGGHAVD 247

Query: 285 LIGWGTSDDGEDYWV 299
           ++G+GT DDG DYW+
Sbjct: 248 MVGYGTDDDGVDYWI 262


>sp|P92132|CATB2_GIAIN Cathepsin B-like CP2 OS=Giardia intestinalis GN=CP2 PE=1 SV=2
          Length = 300

 Score =  131 bits (330), Expect = 6e-30,   Method: Compositional matrix adjust.
 Identities = 86/255 (33%), Positives = 128/255 (50%), Gaps = 23/255 (9%)

Query: 49  NPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAW 108
           NP+  WKA    +F   T  +   LL      K      P  T      +P+SFD R  +
Sbjct: 28  NPR--WKAGIPKRFEGLTKDEISSLLMPVSFLKNAKGAAPRGTFTDKDDVPESFDFREEY 85

Query: 109 PQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMN---LSLSVNDLLACCGFLCGD- 164
           P C  I  ++DQG CGSCWAF +V    DR C+  G++   +  S   +++C     GD 
Sbjct: 86  PHC--IPEVVDQGGCGSCWAFSSVATFGDRRCVA-GLDKKPVKYSPQYVVSCDH---GDM 139

Query: 165 GCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRN 224
            C+GG+  + W++    G  T+EC PY   +      C    PT     KC   +     
Sbjct: 140 ACNGGWLPNVWKFLTKTGTTTDECVPYKSGSTTLRGTC----PT-----KCADGSSKVHL 190

Query: 225 SKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVK 284
           +   S   Y +  D   +M  +  +GP++V+F V+ DF +Y+SGVY+H  G + GGHAV+
Sbjct: 191 ATATSYKDYGL--DIPAMMKALSTSGPLQVAFLVHSDFMYYESGVYQHTYGYMEGGHAVE 248

Query: 285 LIGWGTSDDGEDYWV 299
           ++G+GT DDG DYW+
Sbjct: 249 MVGYGTDDDGVDYWI 263


>sp|Q3SZI1|TINAG_BOVIN Tubulointerstitial nephritis antigen OS=Bos taurus GN=TINAG PE=2
           SV=1
          Length = 476

 Score =  127 bits (319), Expect = 9e-29,   Method: Compositional matrix adjust.
 Identities = 94/289 (32%), Positives = 135/289 (46%), Gaps = 36/289 (12%)

Query: 37  ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ-FKHLLGVKPTPKGLLLGVPVKTHD-- 93
           ++Q  +I+ VN+    GW A    QF   T+ + FK+ LG  P P  LLL +   T    
Sbjct: 155 LVQPGLIEHVNKG-DYGWTAQNYSQFWGMTLEEGFKYRLGTLP-PSPLLLSMNEVTASLT 212

Query: 94  KSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSV 151
           K+  LP+ F A   WP        LDQ +C + WAF      +DR  I        +LS 
Sbjct: 213 KTTDLPEFFIASYKWP--GWTHGPLDQKNCAASWAFSTASVAADRIAIQSQGRYTANLSP 270

Query: 152 NDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA------ 205
            +L++CC      GC+ G    AW Y    G+V+  C P F     ++ GC  A      
Sbjct: 271 QNLISCCAKK-RHGCNSGSVDRAWWYLRKRGLVSHACYPLFKDQNATNNGCAMASRSDGR 329

Query: 206 ---YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDF 262
              + T  C     K N++++ S       YR++S+  +IM EI +NGPV+    V+EDF
Sbjct: 330 GKRHATTPCPNSIEKSNRIYQCS-----PPYRVSSNETEIMREIMQNGPVQAIMQVHEDF 384

Query: 263 AHYKSGVYKHITGD--------VMGGHAVKLIGWGT----SDDGEDYWV 299
            +YK+G+Y+HIT              HAVKL GWGT        E +W+
Sbjct: 385 FNYKTGIYRHITSTNEDSEKYRKFRTHAVKLTGWGTLRGAQGQKEKFWI 433


>sp|Q9UJW2|TINAG_HUMAN Tubulointerstitial nephritis antigen OS=Homo sapiens GN=TINAG PE=2
           SV=3
          Length = 476

 Score =  125 bits (315), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 89/288 (30%), Positives = 133/288 (46%), Gaps = 34/288 (11%)

Query: 37  ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ-FKHLLG-VKPTPKGLLLGVPVKTHDK 94
           +++  +I++VN+    GW A    QF   T+   FK  LG + P+P  L +     +   
Sbjct: 155 LVRSELIEQVNKG-DYGWTAQNYSQFWGMTLEDGFKFRLGTLPPSPMLLSMNEMTASLPA 213

Query: 95  SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVN 152
           +  LP+ F A   WP        LDQ +C + WAF      +DR  I        +LS  
Sbjct: 214 TTDLPEFFVASYKWP--GWTHGPLDQKNCAASWAFSTASVAADRIAIQSKGRYTANLSPQ 271

Query: 153 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA------- 205
           +L++CC      GC+ G    AW Y    G+V+  C P F     ++ GC  A       
Sbjct: 272 NLISCCA-KNRHGCNSGSIDRAWWYLRKRGLVSHACYPLFKDQNATNNGCAMASRSDGRG 330

Query: 206 --YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFA 263
             + T  C     K N++++ S       YR++S+  +IM EI +NGPV+    V EDF 
Sbjct: 331 KRHATKPCPNNVEKSNRIYQCS-----PPYRVSSNETEIMKEIMQNGPVQAIMQVREDFF 385

Query: 264 HYKSGVYKHITGD--------VMGGHAVKLIGWGT----SDDGEDYWV 299
           HYK+G+Y+H+T           +  HAVKL GWGT        E +W+
Sbjct: 386 HYKTGIYRHVTSTNKESEKYRKLQTHAVKLTGWGTLRGAQGQKEKFWI 433


>sp|P92131|CATB1_GIAIN Cathepsin B-like CP1 OS=Giardia intestinalis GN=CP1 PE=2 SV=3
          Length = 303

 Score =  125 bits (314), Expect = 4e-28,   Method: Compositional matrix adjust.
 Identities = 87/255 (34%), Positives = 126/255 (49%), Gaps = 27/255 (10%)

Query: 54  WKAARNPQFSNYTVGQFKHLLGVKP----TPKGLLLGVPV-KTHDKSLKLPKSFDARSAW 108
           WKA    +F N T  +F+ +L ++P       G L  + + +  +    +P  FD R  +
Sbjct: 31  WKAGMPKRFENVTEDEFRSML-IRPDRLRARSGSLPPISITEVQELVDPIPPQFDFRDEY 89

Query: 109 PQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMN---LSLSVNDLLACCGFLCGDG 165
           PQC  +   LDQG CGSCWAF A+    DR C   G++   +S S   L++C   L   G
Sbjct: 90  PQC--VKPALDQGSCGSCWAFSAIGVFGDRRC-AMGIDKEAVSYSQQHLISCS--LENFG 144

Query: 166 CDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNS 225
           CDGG     W +    G  T EC  Y D       G   A P P          QL++  
Sbjct: 145 CDGGDFQPTWSFLTFTGATTAECVKYVDY------GHTVASPCPAVCDDG-SPIQLYKAH 197

Query: 226 KHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV-MGGHAVK 284
            +  +S     S P  IM  +   GP++    VY D ++Y+SGVYKH  G + +G HA++
Sbjct: 198 GYGQVS----KSVPA-IMGMLVAGGPLQTMIVVYADLSYYESGVYKHTYGTINLGFHALE 252

Query: 285 LIGWGTSDDGEDYWV 299
           ++G+GT+DDG DYW+
Sbjct: 253 IVGYGTTDDGTDYWI 267


>sp|Q99JR5|TINAL_MOUSE Tubulointerstitial nephritis antigen-like OS=Mus musculus
           GN=Tinagl1 PE=1 SV=1
          Length = 466

 Score =  113 bits (283), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 88/292 (30%), Positives = 131/292 (44%), Gaps = 39/292 (13%)

Query: 37  ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ-FKHLLG-VKPTPKGLLLGVPVKTHDK 94
           ++   +IK +N     GW+A  +  F   T+ +  ++ LG ++P+   + +        +
Sbjct: 140 LVDPDMIKAINRG-NYGWQAGNHSAFWGMTLDEGIRYRLGTIRPSSTVMNMNEIYTVLGQ 198

Query: 95  SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVN 152
              LP +F+A   WP  + I   LDQG+C   WAF      SDR  IH    M   LS  
Sbjct: 199 GEVLPTAFEASEKWP--NLIHEPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPILSPQ 256

Query: 153 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCV 212
           +LL+C       GC GG    AW +    GVV++ C P+             A PTP+C+
Sbjct: 257 NLLSCDTHH-QQGCRGGRLDGAWWFLRRRGVVSDNCYPFSGREQ------NEASPTPRCM 309

Query: 213 ----------RKCVKK---NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVY 259
                     R+   +    Q+  N  +    AYR+ SD ++IM E+ +NGPV+    V+
Sbjct: 310 MHSRAMGRGKRQATSRCPNGQVDSNDIYQVTPAYRLGSDEKEIMKELMENGPVQALMEVH 369

Query: 260 EDFAHYKSGVYKHI--------TGDVMGGHAVKLIGWG--TSDDGE--DYWV 299
           EDF  Y+ G+Y H              G H+VK+ GWG  T  DG    YW 
Sbjct: 370 EDFFLYQRGIYSHTPVSQGRPEQYRRHGTHSVKITGWGEETLPDGRTIKYWT 421


>sp|Q9EQT5|TINAL_RAT Tubulointerstitial nephritis antigen-like OS=Rattus norvegicus
           GN=Tinagl1 PE=2 SV=1
          Length = 467

 Score =  112 bits (280), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 87/292 (29%), Positives = 133/292 (45%), Gaps = 38/292 (13%)

Query: 37  ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ-FKHLLG-VKPTPKGLLLGVPVKTHDK 94
           ++  ++IK +N     GW+A  +  F   T+ +  ++ LG ++P+   + +        +
Sbjct: 140 LVDPAMIKAINRG-NYGWQAGNHSAFWGMTLDEGIRYRLGTIRPSSSVMNMNEIYTVLGQ 198

Query: 95  SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVN 152
              LP +F+A   WP  + I   LDQG+C   WAF      SDR  IH    M   LS  
Sbjct: 199 GEVLPTAFEASEKWP--NLIHEPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPILSPQ 256

Query: 153 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCV 212
           +LL+C       GC GG    AW +    GVV++ C P+           + A PTP+C+
Sbjct: 257 NLLSCDTHH-QKGCRGGRLDGAWWFLRRRGVVSDNCYPF-----SGREQNDEASPTPRCM 310

Query: 213 ----------RKCVKK---NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVY 259
                     R+   +   +Q+  N  +     YR+ SD ++IM E+ +NGPV+    V+
Sbjct: 311 MHSRAMGRGKRQATSRCPNSQVDSNDIYQVTPVYRLASDEKEIMKELMENGPVQALMEVH 370

Query: 260 EDFAHYKSGVYKHITGDV--------MGGHAVKLIGWG--TSDDGE--DYWV 299
           EDF  Y+ G+Y H              G H+VK+ GWG  T  DG    YW 
Sbjct: 371 EDFFLYQRGIYSHTPVSQGRPEQYRRHGTHSVKITGWGEETLPDGRTIKYWT 422


>sp|Q9GZM7|TINAL_HUMAN Tubulointerstitial nephritis antigen-like OS=Homo sapiens
           GN=TINAGL1 PE=1 SV=1
          Length = 467

 Score =  110 bits (274), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 86/286 (30%), Positives = 132/286 (46%), Gaps = 27/286 (9%)

Query: 37  ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ-FKHLLG-VKPTPKGLLLGVPVKTHDK 94
           ++   +IK +N+    GW+A  +  F   T+ +  ++ LG ++P+   + +       + 
Sbjct: 141 LVDPDMIKAINQG-NYGWQAGNHSAFWGMTLDEGIRYRLGTIRPSSSVMNMHEIYTVLNP 199

Query: 95  SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVN 152
              LP +F+A   WP  + I   LDQG+C   WAF      SDR  IH    M   LS  
Sbjct: 200 GEVLPTAFEASEKWP--NLIHEPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQ 257

Query: 153 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPY----FDSTGCSHPGCEPAYPT 208
           +LL+C       GC GG    AW +    GVV++ C P+     D  G + P    +   
Sbjct: 258 NLLSCDTHQ-QQGCRGGRLDGAWWFLRRRGVVSDHCYPFSGRERDEAGPAPPCMMHSRAM 316

Query: 209 PKCVRKCVK--KNQLWRNSKHYSIS-AYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHY 265
            +  R+      N    N+  Y ++  YR+ S+ ++IM E+ +NGPV+    V+EDF  Y
Sbjct: 317 GRGKRQATAHCPNSYVNNNDIYQVTPVYRLGSNDKEIMKELMENGPVQALMEVHEDFFLY 376

Query: 266 KSGVYKHITGDV--------MGGHAVKLIGWG--TSDDGE--DYWV 299
           K G+Y H    +         G H+VK+ GWG  T  DG    YW 
Sbjct: 377 KGGIYSHTPVSLGRPERYRRHGTHSVKITGWGEETLPDGRTLKYWT 422


>sp|Q06544|CYSP3_OSTOS Cathepsin B-like cysteine proteinase 3 (Fragment) OS=Ostertagia
           ostertagi GN=CP-3 PE=3 SV=1
          Length = 174

 Score =  108 bits (271), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 63/141 (44%), Positives = 81/141 (57%), Gaps = 17/141 (12%)

Query: 174 AWRYFVHHGVVTEE-------CDPYFDSTGCSHPGCEPAY-------PTPKCVRKCVKKN 219
           AW+YF   GVVT         C PY +   C   G EP Y        TPKC + C +  
Sbjct: 1   AWQYFALEGVVTGGNYRKQGCCRPY-EFPPCGRHGKEPYYGECYDTAKTPKCQKTCQRGY 59

Query: 220 -QLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVM 278
            + ++  KH+  SAYR+ ++ + I  +I KNGPV   F VYEDFAHYKSG+YKH  G + 
Sbjct: 60  LKAYKEDKHFGKSAYRLPNNVKAIQRDIMKNGPVVAGFIVYEDFAHYKSGIYKHTAGRMT 119

Query: 279 GGHAVKLIGWGTSDDGEDYWV 299
           GGHAVK+IGWG  + G  YW+
Sbjct: 120 GGHAVKIIGWG-KEKGTPYWL 139


>sp|O97578|CATC_CANFA Dipeptidyl peptidase 1 (Fragment) OS=Canis familiaris GN=CTSC PE=1
           SV=1
          Length = 435

 Score =  104 bits (259), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 76/269 (28%), Positives = 130/269 (48%), Gaps = 30/269 (11%)

Query: 41  SIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSLKLPK 100
             +K +N   K+ W A R  ++   T+      +G +  P+     +  + H++  +LP 
Sbjct: 148 EFVKAINTIQKS-WTATRYIEYETLTLRDMMTRVGGRKIPRPKPTPLTAEIHEEISRLPT 206

Query: 101 SFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACC 158
           S+D R+     + +S + +Q  CGSC+AF +   L  R  I      +  LS  ++++C 
Sbjct: 207 SWDWRNV-RGTNFVSPVRNQASCGSCYAFASTAMLEARIRILTNNTQTPILSPQEIVSCS 265

Query: 159 GFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVK 217
            +    GC+GG+P + A +Y    G+V E C PY    G   P C+P      C R    
Sbjct: 266 QY--AQGCEGGFPYLIAGKYAQDFGLVEEACFPY---AGSDSP-CKPN----DCFR---- 311

Query: 218 KNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH----- 272
               + +S++Y +  +    +   +  E+ ++GP+ V+F VY+DF HY+ G+Y H     
Sbjct: 312 ----YYSSEYYYVGGFYGACNEALMKLELVRHGPMAVAFEVYDDFFHYQKGIYYHTGLRD 367

Query: 273 -ITGDVMGGHAVKLIGWGT-SDDGEDYWV 299
                 +  HAV L+G+GT S  G DYW+
Sbjct: 368 PFNPFELTNHAVLLVGYGTDSASGMDYWI 396


>sp|P90850|YCF2E_CAEEL Uncharacterized peptidase C1-like protein F26E4.3 OS=Caenorhabditis
           elegans GN=F26E4.3 PE=1 SV=3
          Length = 452

 Score =  103 bits (257), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 74/221 (33%), Positives = 107/221 (48%), Gaps = 18/221 (8%)

Query: 94  KSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSV 151
           K  +LP+ FDAR  W     I  + DQG CGS W+       SDR  I     +N +LS 
Sbjct: 180 KPRELPEHFDARDKWG--PLIHPVADQGDCGSSWSVSTTAISSDRLAIISEGRINSTLSS 237

Query: 152 NDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKC 211
             LL+C       GC+GGY   AW Y    GVV + C PY  S     PG          
Sbjct: 238 QQLLSCNQHR-QKGCEGGYLDRAWWYIRKLGVVGDHCYPYV-SGQSREPGHCLIPKRDYT 295

Query: 212 VRKCVKKNQLWRNSKHYSISA-YRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVY 270
            R+ ++     ++S  + ++  Y+++S  EDI  E+  NGPV+ +F V+EDF  Y  GVY
Sbjct: 296 NRQGLRCPSGSQDSTAFKMTPPYKVSSREEDIQTELMTNGPVQATFVVHEDFFMYAGGVY 355

Query: 271 KH--------ITGDVMGGHAVKLIGWG---TSDDGEDYWVC 300
           +H         +    G H+V+++GWG   ++     YW+C
Sbjct: 356 QHSDLAAQKGASSVAEGYHSVRVLGWGVDHSTGKPIKYWLC 396


>sp|Q60HG6|CATC_MACFA Dipeptidyl peptidase 1 OS=Macaca fascicularis GN=CTSC PE=2 SV=1
          Length = 463

 Score = 96.3 bits (238), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 78/279 (27%), Positives = 132/279 (47%), Gaps = 47/279 (16%)

Query: 41  SIIKEVNENPKAGWKAARNPQFSNYTVGQF--------KHLLGVKPTPKGLLLGVPVKTH 92
           + +K +N   K+ W A    ++   T+G          + +   KPTP      +  +  
Sbjct: 173 NFVKAINAIQKS-WTATTYMEYETLTLGDMIKRSGGHSRKIPRPKPTP------LTAEIQ 225

Query: 93  DKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LS 150
            K L LP S+D R+     + +S + +Q  CGSC++F +V  L  R  I    + +  LS
Sbjct: 226 QKILHLPTSWDWRNV-HGINFVSPVRNQASCGSCYSFASVGMLEARIRILTNNSQTPILS 284

Query: 151 VNDLLACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTP 209
             ++++C  +    GC+GG+P ++A +Y    G+V E C PY   TG   P         
Sbjct: 285 SQEVVSCSQY--AQGCEGGFPYLTAGKYAQDFGLVEEACFPY---TGTDSP--------- 330

Query: 210 KCVRKCVKKNQLWR--NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKS 267
                C  K   +R  +S+++ +  +    +   +  E+  +GP+ V+F VY+DF HY++
Sbjct: 331 -----CKMKEDCFRYYSSEYHYVGGFYGGCNEALMKLELVYHGPLAVAFEVYDDFLHYQN 385

Query: 268 GVYKH------ITGDVMGGHAVKLIGWGT-SDDGEDYWV 299
           G+Y H           +  HAV L+G+GT S  G DYW+
Sbjct: 386 GIYHHTGLRDPFNPFELTNHAVLLVGYGTDSASGMDYWI 424


>sp|Q5RB02|CATC_PONAB Dipeptidyl peptidase 1 OS=Pongo abelii GN=CTSC PE=2 SV=1
          Length = 463

 Score = 94.7 bits (234), Expect = 8e-19,   Method: Compositional matrix adjust.
 Identities = 79/275 (28%), Positives = 129/275 (46%), Gaps = 39/275 (14%)

Query: 41  SIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVK----PTPKGLLLGVPVKTHDKSL 96
           + +K +N   K+ W A    ++   T+G      G      P PK   L   ++   K L
Sbjct: 173 NFVKAINAIQKS-WTATTYKEYETLTLGDMIRRSGGHSRKIPRPKPAPLTAEIQ--QKVL 229

Query: 97  KLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDL 154
            LP S+D R+     + +S + +Q  CGSC++F ++  L  R  I    + +  LS  ++
Sbjct: 230 HLPTSWDWRNI-HGINFVSPVRNQASCGSCYSFASMGMLEARIRILTSNSQTPILSPQEV 288

Query: 155 LACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVR 213
           ++C  +    GC+GG+P + A +Y    G+V E C PY   TG   P             
Sbjct: 289 VSCSQY--AQGCEGGFPYLIAGKYAQDFGLVEEACFPY---TGTDSP------------- 330

Query: 214 KCVKKNQLWR--NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYK 271
            C  K   +R  +S+++ +  +    +   +  E+  +GP+ V+F VY+DF HYK G+Y 
Sbjct: 331 -CKMKEDCFRYYSSEYHYVGGFYGGCNEALMKLELVHHGPMAVAFEVYDDFLHYKKGIYH 389

Query: 272 H------ITGDVMGGHAVKLIGWGT-SDDGEDYWV 299
           H           +  HAV L+G+GT S  G DYW+
Sbjct: 390 HTGLRDPFNPFELTNHAVLLVGYGTDSASGMDYWI 424


>sp|P53634|CATC_HUMAN Dipeptidyl peptidase 1 OS=Homo sapiens GN=CTSC PE=1 SV=2
          Length = 463

 Score = 94.4 bits (233), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 79/275 (28%), Positives = 129/275 (46%), Gaps = 39/275 (14%)

Query: 41  SIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVK----PTPKGLLLGVPVKTHDKSL 96
           + +K +N   K+ W A    ++   T+G      G      P PK   L   ++   K L
Sbjct: 173 NFVKAINAIQKS-WTATTYMEYETLTLGDMIRRSGGHSRKIPRPKPAPLTAEIQ--QKIL 229

Query: 97  KLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDL 154
            LP S+D R+     + +S + +Q  CGSC++F ++  L  R  I    + +  LS  ++
Sbjct: 230 HLPTSWDWRNV-HGINFVSPVRNQASCGSCYSFASMGMLEARIRILTNNSQTPILSPQEV 288

Query: 155 LACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVR 213
           ++C  +    GC+GG+P + A +Y    G+V E C PY   TG   P             
Sbjct: 289 VSCSQY--AQGCEGGFPYLIAGKYAQDFGLVEEACFPY---TGTDSP------------- 330

Query: 214 KCVKKNQLWR--NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYK 271
            C  K   +R  +S+++ +  +    +   +  E+  +GP+ V+F VY+DF HYK G+Y 
Sbjct: 331 -CKMKEDCFRYYSSEYHYVGGFYGGCNEALMKLELVHHGPMAVAFEVYDDFLHYKKGIYH 389

Query: 272 H------ITGDVMGGHAVKLIGWGT-SDDGEDYWV 299
           H           +  HAV L+G+GT S  G DYW+
Sbjct: 390 HTGLRDPFNPFELTNHAVLLVGYGTDSASGMDYWI 424


>sp|Q3ZCJ8|CATC_BOVIN Dipeptidyl peptidase 1 OS=Bos taurus GN=CTSC PE=2 SV=1
          Length = 463

 Score = 92.0 bits (227), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 76/278 (27%), Positives = 129/278 (46%), Gaps = 47/278 (16%)

Query: 42  IIKEVNENPKAGWKAARNPQFSNYTV--------GQFKHLLGVKPTPKGLLLGVPVKTHD 93
            +K +N   K+ W AA   ++   T+        G  + +   KP P      +  +   
Sbjct: 174 FVKAINAIQKS-WTAAPYMEYETLTLKEMIRRGGGHSRRIPRPKPAP------ITAEIQK 226

Query: 94  KSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSV 151
           K L LP S+D R+     + ++ + +QG CGSC++F ++  +  R  I      +  LS 
Sbjct: 227 KILHLPTSWDWRNV-HGINFVTPVRNQGSCGSCYSFASMGMMEARIRILTNNTQTPILSP 285

Query: 152 NDLLACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPK 210
            ++++C  +    GC+GG+P + A +Y    G+V E+C PY   TG   P          
Sbjct: 286 QEVVSCSQY--AQGCEGGFPYLIAGKYAQDFGLVEEDCFPY---TGTDSP---------- 330

Query: 211 CVRKCVKKNQLWR--NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSG 268
               C  K   +R  +S+++ +  +    +   +  E+   GP+ V+F VY+DF HY+ G
Sbjct: 331 ----CRLKEGCFRYYSSEYHYVGGFYGGCNEALMKLELVHQGPMAVAFEVYDDFLHYRKG 386

Query: 269 VYKH------ITGDVMGGHAVKLIGWGT-SDDGEDYWV 299
           VY H           +  HAV L+G+GT +  G DYW+
Sbjct: 387 VYHHTGLRDPFNPFELTNHAVLLVGYGTDAASGLDYWI 424


>sp|Q95029|CATL_DROME Cathepsin L OS=Drosophila melanogaster GN=Cp1 PE=2 SV=2
          Length = 371

 Score = 90.5 bits (223), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 83/285 (29%), Positives = 119/285 (41%), Gaps = 46/285 (16%)

Query: 20  SQTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPT 79
           +Q FAEG VS  KL  +   D +  E  +               NYT+   K L     +
Sbjct: 94  NQRFAEGKVS-FKLAVNKYADLLHHEFRQLMNG----------FNYTL--HKQLRAADES 140

Query: 80  PKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRF 139
            KG+    P       + LPKS D    W     ++ + DQGHCGSCWAF +  AL  + 
Sbjct: 141 FKGVTFISPA-----HVTLPKSVD----WRTKGAVTAVKDQGHCGSCWAFSSTGALEGQH 191

Query: 140 CIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSH 199
               G+ +SLS  +L+ C      +GC+GG   +A+RY   +G +               
Sbjct: 192 FRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDNGGID-------------- 237

Query: 200 PGCEPAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAE-IYKNGPVEVSFT 257
              E +YP       C   K  +    + ++     I    E  MAE +   GPV V+  
Sbjct: 238 --TEKSYPYEAIDDSCHFNKGTVGATDRGFT----DIPQGDEKKMAEAVATVGPVSVAID 291

Query: 258 V-YEDFAHYKSGVYKHITGDVMG-GHAVKLIGWGTSDDGEDYWVC 300
             +E F  Y  GVY     D     H V ++G+GT + GEDYW+ 
Sbjct: 292 ASHESFQFYSEGVYNEPQCDAQNLDHGVLVVGFGTDESGEDYWLV 336


>sp|Q9R1T3|CATZ_RAT Cathepsin Z OS=Rattus norvegicus GN=Ctsz PE=1 SV=2
          Length = 306

 Score = 90.1 bits (222), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 63/212 (29%), Positives = 97/212 (45%), Gaps = 24/212 (11%)

Query: 98  LPKSFDARSAWPQCSTISRILDQ---GHCGSCWAFGAVEALSDRFCIHFGM---NLSLSV 151
           LPK++D R+     +  S   +Q    +CGSCWA G+  AL+DR  I       +  LSV
Sbjct: 64  LPKNWDWRNV-NGVNYASVTRNQHIPQYCGSCWAHGSTSALADRINIKRKGAWPSTLLSV 122

Query: 152 NDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKC 211
            +++ C        C+GG  +  W Y   HG+  E C+ Y          C+       C
Sbjct: 123 QNVIDCGN---AGSCEGGNDLPVWEYAHKHGIPDETCNNY----QAKDQECDKFNQCGTC 175

Query: 212 V--RKC--VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKS 267
              ++C  ++   LWR   + S+S        E +MAEIY NGP+       E  ++Y  
Sbjct: 176 TEFKECHTIQNYTLWRVGDYGSLSGR------EKMMAEIYANGPISCGIMATERMSNYTG 229

Query: 268 GVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
           G+Y       +  H + + GWG S+DG +YW+
Sbjct: 230 GIYTEYQNQAIINHIISVAGWGVSNDGIEYWI 261


>sp|P80067|CATC_RAT Dipeptidyl peptidase 1 OS=Rattus norvegicus GN=Ctsc PE=1 SV=3
          Length = 462

 Score = 90.1 bits (222), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 78/285 (27%), Positives = 133/285 (46%), Gaps = 44/285 (15%)

Query: 32  KLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTV-------GQFKHLLGVKPTPKGLL 84
           +L SH    + +K +N   K+ W A    ++   ++       G    +L  KP P    
Sbjct: 166 RLYSH--NHNFVKAINSVQKS-WTATTYEEYEKLSIRDLIRRSGHSGRILRPKPAP---- 218

Query: 85  LGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG 144
             +  +   + L LP+S+D R+     + +S + +Q  CGSC++F ++  L  R  I   
Sbjct: 219 --ITDEIQQQILSLPESWDWRNV-RGINFVSPVRNQESCGSCYSFASLGMLEARIRILTN 275

Query: 145 MNLS--LSVNDLLACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPG 201
            + +  LS  ++++C  +    GCDGG+P + A +Y    GVV E C PY  +       
Sbjct: 276 NSQTPILSPQEVVSCSPY--AQGCDGGFPYLIAGKYAQDFGVVEENCFPYTATDA----- 328

Query: 202 CEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYED 261
             P  P   C+R        + +S++Y +  +    +   +  E+ K+GP+ V+F V++D
Sbjct: 329 --PCKPKENCLR--------YYSSEYYYVGGFYGGCNEALMKLELVKHGPMAVAFEVHDD 378

Query: 262 FAHYKSGVYKH------ITGDVMGGHAVKLIGWGTSD-DGEDYWV 299
           F HY SG+Y H           +  HAV L+G+G     G DYW+
Sbjct: 379 FLHYHSGIYHHTGLSDPFNPFELTNHAVLLVGYGKDPVTGLDYWI 423


>sp|P22497|CYSP_THEPA Cysteine proteinase OS=Theileria parva GN=TP03_0285 PE=3 SV=2
          Length = 440

 Score = 90.1 bits (222), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 75/259 (28%), Positives = 113/259 (43%), Gaps = 46/259 (17%)

Query: 61  QFSNYTVGQFKHLLGVKPTPKG-------LLLGVPVKTHDKSLK----------LPKSFD 103
           +FS+ T  +F  L  V   PK        LL  +  KT+ K+LK          L K   
Sbjct: 171 RFSDLTEREFYKLFPVMKPPKATYSNGYYLLSHMANKTYLKNLKKALNTDEDVDLAKLTG 230

Query: 104 ARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCG 163
               W + S+++ + DQ +CG CWAF  V ++   +  HF  +  LSV +LL C  F   
Sbjct: 231 ENLDWRRSSSVTSVKDQSNCGGCWAFSTVGSVEGYYMSHFDKSYELSVQELLDCDSF--S 288

Query: 164 DGCDGGYPISAWRYFVHHGVVTEECDPYFD-STGCSHPGCEPAYPTPKCVRKCVKKNQLW 222
           +GC GG   SA+ Y   +G+V+ +  P+ D +  CS P                      
Sbjct: 289 NGCQGGLLESAYEYVRKYGLVSAKDLPFVDKARRCSVP---------------------- 326

Query: 223 RNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHA 282
             +K  S+ +Y +    E +M     + P  V  +V  + A YKSGV+    G  +  HA
Sbjct: 327 -KAKKVSVPSYHVFKGKE-VMTRSLTSSPCSVYLSVSPELAKYKSGVFTGECGKSL-NHA 383

Query: 283 VKLIGWGTSD-DGEDYWVC 300
           V L+G G  +   + YWV 
Sbjct: 384 VVLVGEGYDEVTKKRYWVV 402


>sp|Q9WUU7|CATZ_MOUSE Cathepsin Z OS=Mus musculus GN=Ctsz PE=2 SV=1
          Length = 306

 Score = 88.6 bits (218), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 62/212 (29%), Positives = 98/212 (46%), Gaps = 24/212 (11%)

Query: 98  LPKSFDARSAWPQCSTISRILDQ---GHCGSCWAFGAVEALSDRFCIHFGM---NLSLSV 151
           LPK++D R+     +  S   +Q    +CGSCWA G+  A++DR  I       ++ LSV
Sbjct: 64  LPKNWDWRNV-NGVNYASVTRNQHIPQYCGSCWAHGSTSAMADRINIKRKGAWPSILLSV 122

Query: 152 NDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKC 211
            +++ C        C+GG  +  W Y   HG+  E C+ Y          C+       C
Sbjct: 123 QNVIDCGN---AGSCEGGNDLPVWEYAHKHGIPDETCNNY----QAKDQDCDKFNQCGTC 175

Query: 212 V--RKC--VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKS 267
              ++C  ++   LWR   + S+S        E +MAEIY NGP+       E  ++Y  
Sbjct: 176 TEFKECHTIQNYTLWRVGDYGSLSGR------EKMMAEIYANGPISCGIMATEMMSNYTG 229

Query: 268 GVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
           G+Y       +  H + + GWG S+DG +YW+
Sbjct: 230 GIYAEHQDQAVINHIISVAGWGVSNDGIEYWI 261


>sp|P05167|ALEU_HORVU Thiol protease aleurain OS=Hordeum vulgare PE=2 SV=1
          Length = 362

 Score = 87.4 bits (215), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 76/252 (30%), Positives = 109/252 (43%), Gaps = 47/252 (18%)

Query: 61  QFSNYTVGQFKHL-LGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILD 119
           +FS+ +  +F+   LG   T    L G  +     +  LP++ D    W +   +S + +
Sbjct: 108 RFSDMSWEEFQATRLGAAQTCSATLAGNHLMR--DAAALPETKD----WREDGIVSPVKN 161

Query: 120 QGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFV 179
           Q HCGSCW F    AL   +    G N+SLS   L+ C G     GC+GG P  A+ Y  
Sbjct: 162 QAHCGSCWTFSTTGALEAAYTQATGKNISLSEQQLVDCAGGFNNFGCNGGLPSQAFEYIK 221

Query: 180 HHGVV-TEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSD 238
           ++G + TEE  PY    G  H   E                    N+    + +  I  +
Sbjct: 222 YNGGIDTEESYPYKGVNGVCHYKAE--------------------NAAVQVLDSVNITLN 261

Query: 239 PEDIMAEIYKNG-----PVEVSFTVYEDFAHYKSGVYKHITGDVMG------GHAVKLIG 287
            ED +    KN      PV V+F V + F  YKSGVY   T D  G       HAV  +G
Sbjct: 262 AEDEL----KNAVGLVRPVSVAFQVIDGFRQYKSGVY---TSDHCGTTPDDVNHAVLAVG 314

Query: 288 WGTSDDGEDYWV 299
           +G  ++G  YW+
Sbjct: 315 YGV-ENGVPYWL 325


>sp|Q24940|CATLL_FASHE Cathepsin L-like proteinase OS=Fasciola hepatica GN=Cat-1 PE=1 SV=1
          Length = 326

 Score = 87.0 bits (214), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 66/244 (27%), Positives = 108/244 (44%), Gaps = 31/244 (12%)

Query: 61  QFSNYTVGQFK--HLLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRIL 118
           QF++ T  +FK  +L  +      L  GVP + +++++  P   D    W +   ++ + 
Sbjct: 71  QFTDMTFEEFKAKYLTEMSRASDILSHGVPYEANNRAV--PDKID----WRESGYVTEVK 124

Query: 119 DQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYF 178
           DQG+CGSCWAF     +  ++  +   ++S S   L+ C G    +GC GG   +A++Y 
Sbjct: 125 DQGNCGSCWAFSTTGTMEGQYMKNERTSISFSEQQLVDCSGPWGNNGCSGGLMENAYQYL 184

Query: 179 VHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSD 238
              G+ TE   PY    G                 +C    QL           Y ++S 
Sbjct: 185 KQFGLETESSYPYTAVEG-----------------QCRYNKQL---GVAKVTGYYTVHSG 224

Query: 239 PE-DIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHIT-GDVMGGHAVKLIGWGTSDDGED 296
            E ++   +    P  V+  V  DF  Y+SG+Y+  T   +   HAV  +G+GT   G D
Sbjct: 225 SEVELKNLVGARRPAAVAVDVESDFMMYRSGIYQSQTCSPLRVNHAVLAVGYGT-QGGTD 283

Query: 297 YWVC 300
           YW+ 
Sbjct: 284 YWIV 287


>sp|Q9UBR2|CATZ_HUMAN Cathepsin Z OS=Homo sapiens GN=CTSZ PE=1 SV=1
          Length = 303

 Score = 85.9 bits (211), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 65/211 (30%), Positives = 91/211 (43%), Gaps = 23/211 (10%)

Query: 98  LPKSFDARSAWPQCSTISRILDQGH----CGSCWAFGAVEALSDRFCIHFGM---NLSLS 150
           LPKS+D R+        + I    H    CGSCWA  +  A++DR  I       +  LS
Sbjct: 62  LPKSWDWRNV--DGVNYASITRNQHIPQYCGSCWAHASTSAMADRINIKRKGAWPSTLLS 119

Query: 151 VNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYF--DSTGCSHPGCEPAYPT 208
           V +++ C        C+GG  +S W Y   HG+  E C+ Y   D        C      
Sbjct: 120 VQNVIDCGN---AGSCEGGNDLSVWDYAHQHGIPDETCNNYQAKDQECDKFNQCGTCNEF 176

Query: 209 PKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSG 268
            +C    ++   LWR   + S+S        E +MAEIY NGP+       E  A+Y  G
Sbjct: 177 KEC--HAIRNYTLWRVGDYGSLSGR------EKMMAEIYANGPISCGIMATERLANYTGG 228

Query: 269 VYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
           +Y          H V + GWG S DG +YW+
Sbjct: 229 IYAEYQDTTYINHVVSVAGWGIS-DGTEYWI 258


>sp|Q8RWQ9|ALEUL_ARATH Thiol protease aleurain-like OS=Arabidopsis thaliana GN=At3g45310
           PE=2 SV=1
          Length = 358

 Score = 85.1 bits (209), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 82/281 (29%), Positives = 121/281 (43%), Gaps = 40/281 (14%)

Query: 28  VSKLKLDSHILQDSIIKEVNENPKA-GWKAARNPQFSNYTVGQF-KHLLGVKPTPKGLLL 85
           V ++KL   + ++++    + N K   +K + N QF++ T  +F ++ LG        L 
Sbjct: 73  VEEMKLRFSVFKENLDLIRSTNKKGLSYKLSLN-QFADLTWQEFQRYKLGAAQNCSATLK 131

Query: 86  GVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGM 145
           G    T      +P + D    W +   +S + +QGHCGSCW F    AL   +   FG 
Sbjct: 132 GSHKITE---ATVPDTKD----WREDGIVSPVKEQGHCGSCWTFSTTGALEAAYHQAFGK 184

Query: 146 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHH-GVVTEECDPYFDSTGCSHPGCE- 203
            +SLS   L+ C G     GC GG P  A+ Y  ++ G+ TEE  PY    G    GC+ 
Sbjct: 185 GISLSEQQLVDCAGTFNNFGCHGGLPSQAFEYIKYNGGLDTEEAYPYTGKDG----GCKF 240

Query: 204 PAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFA 263
            A      VR  V       +   +++   R                PV V+F V  +F 
Sbjct: 241 SAKNIGVQVRDSVNITLGAEDELKHAVGLVR----------------PVSVAFEVVHEFR 284

Query: 264 HYKSGVYKHIT-----GDVMGGHAVKLIGWGTSDDGEDYWV 299
            YK GV+   T      DV   HAV  +G+G  DD   YW+
Sbjct: 285 FYKKGVFTSNTCGNTPMDV--NHAVLAVGYGVEDD-VPYWL 322


>sp|P05689|CATZ_BOVIN Cathepsin Z OS=Bos taurus GN=CTSZ PE=2 SV=2
          Length = 304

 Score = 84.7 bits (208), Expect = 8e-16,   Method: Compositional matrix adjust.
 Identities = 66/214 (30%), Positives = 96/214 (44%), Gaps = 29/214 (13%)

Query: 98  LPKSFDARSAWPQCSTISRILDQ---GHCGSCWAFGAVEALSDRFCIHFGM---NLSLSV 151
           LPKS+D R+     +  S   +Q    +CGSCWA G+  A++DR  I       +  LSV
Sbjct: 63  LPKSWDWRNV-NGVNYASVTRNQHIPQYCGSCWAHGSTSAMADRINIKRKGAWPSTLLSV 121

Query: 152 NDLLACCGFLCGDG--CDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTP 209
             ++ C     GD   C+GG  +  W Y   HG+  E C+ Y          C+      
Sbjct: 122 QHVIDC-----GDAGSCEGGNDLPVWEYAHRHGIPDETCNNYQ----AKDQECDKFNQCG 172

Query: 210 KCV--RKC--VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHY 265
            C   ++C  +K   LW+   + S+S        E +MAEIY NGP+       E  ++Y
Sbjct: 173 TCTEFKECHVIKNYTLWKVGDYGSLSGR------EKMMAEIYTNGPISCGIMATEKMSNY 226

Query: 266 KSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
             G+Y          H V + GWG S DG +YW+
Sbjct: 227 TGGIYSEYNDQAFINHIVSVAGWGVS-DGMEYWI 259


>sp|Q26563|CATC_SCHMA Cathepsin C OS=Schistosoma mansoni PE=2 SV=1
          Length = 454

 Score = 84.3 bits (207), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 84/282 (29%), Positives = 125/282 (44%), Gaps = 40/282 (14%)

Query: 38  LQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSL- 96
           +  S + ++N + K+ W+    P+ S YT+ + ++  G   +       +  KT  K L 
Sbjct: 154 INPSFVGKINAHQKS-WRGEIYPELSKYTIDELRNRAGGVKSMVTRPSVLNRKTPSKELI 212

Query: 97  ----KLPKSFDARSAWPQCST--ISRILDQGHCGSCWAFGAVEALSDRFCI--HFGMNLS 148
                LP  FD  S  P  S   ++ I +QG CGSC+A  +  AL  R  +  +F     
Sbjct: 213 SLTGNLPLEFDWTSP-PDGSRSPVTPIRNQGICGSCYASPSAAALEARIRLVSNFSEQPI 271

Query: 149 LSVNDLLACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYP 207
           LS   ++ C  +   +GC+GG+P + A +Y    G+  +   PY   TG           
Sbjct: 272 LSPQTVVDCSPY--SEGCNGGFPFLIAGKYGEDFGLPQKIVIPY---TGED--------- 317

Query: 208 TPKCVRKCVKKNQLWRNSKHYS-ISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYK 266
           T KC    V KN     +  YS I  Y   ++ + +  E+  NGP  V F VYEDF  YK
Sbjct: 318 TGKCT---VSKNCTRYYTTDYSYIGGYYGATNEKLMQLELISNGPFPVGFEVYEDFQFYK 374

Query: 267 SGVYKHITGDV---------MGGHAVKLIGWGTSD-DGEDYW 298
            G+Y H T            +  HAV L+G+G     GE YW
Sbjct: 375 EGIYHHTTVQTDHYNFNPFELTNHAVLLVGYGVDKLSGEPYW 416


>sp|P97821|CATC_MOUSE Dipeptidyl peptidase 1 OS=Mus musculus GN=Ctsc PE=2 SV=1
          Length = 462

 Score = 84.0 bits (206), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 73/276 (26%), Positives = 127/276 (46%), Gaps = 42/276 (15%)

Query: 41  SIIKEVNENPKAGWKAARNPQFSNYTV-------GQFKHLLGVKPTPKGLLLGVPVKTHD 93
           + +K +N   K+ W A    ++   ++       G  + +   KP P      +  +   
Sbjct: 173 NFVKAINTVQKS-WTATAYKEYEKMSLRDLIRRSGHSQRIPRPKPAP------MTDEIQQ 225

Query: 94  KSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSV 151
           + L LP+S+D R+     + +S + +Q  CGSC++F ++  L  R  I    + +  LS 
Sbjct: 226 QILNLPESWDWRNV-QGVNYVSPVRNQESCGSCYSFASMGMLEARIRILTNNSQTPILSP 284

Query: 152 NDLLACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPK 210
            ++++C  +    GCDGG+P + A +Y    GVV E C PY            P  P   
Sbjct: 285 QEVVSCSPY--AQGCDGGFPYLIAGKYAQDFGVVEESCFPYTAKDS-------PCKPREN 335

Query: 211 CVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVY 270
           C+R        + +S +Y +  +    +   +  E+ K+GP+ V+F V++DF HY SG+Y
Sbjct: 336 CLR--------YYSSDYYYVGGFYGGCNEALMKLELVKHGPMAVAFEVHDDFLHYHSGIY 387

Query: 271 KH------ITGDVMGGHAVKLIGWGTSD-DGEDYWV 299
            H           +  HAV L+G+G     G +YW+
Sbjct: 388 HHTGLSDPFNPFELTNHAVLLVGYGRDPVTGIEYWI 423


>sp|P25251|CYSP4_BRANA Cysteine proteinase COT44 (Fragment) OS=Brassica napus PE=2 SV=1
          Length = 328

 Score = 84.0 bits (206), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 80/292 (27%), Positives = 129/292 (44%), Gaps = 38/292 (13%)

Query: 20  SQTFAEGVVSKLKLDSHILQDSI--IKEVNENPKAGWKAARNPQFSNYTVGQFKHL-LGV 76
           S + + G++++     +I +D++  I   NEN K          F+N T  +++ L LG 
Sbjct: 14  SNSNSNGIINQQDERFNIFKDNLRFIDLHNENNKNATYKLGLTIFANLTNDEYRSLYLGA 73

Query: 77  KPTPKGLLLGVPVKTHDKSLKLPKSFDARSA-----WPQCSTISRILDQGHCGSCWAFGA 131
           +  P   +     K  + ++K   + +         W Q   ++ I DQG CGSCWAF  
Sbjct: 74  RTEPVRRI----TKAKNVNMKYSAAVNVDEVPVTVDWRQKGAVNAIKDQGTCGSCWAFST 129

Query: 132 VEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECD-P 190
             A+     I  G  +SLS  +L+ C       GC+GG    A+++ + +G +  E D P
Sbjct: 130 AAAVEGINKIVTGELVSLSEQELVDCDKSY-NQGCNGGLMDYAFQFIMKNGGLNTEKDYP 188

Query: 191 YFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYR-INSDPEDIMAEIYKN 249
           Y  + G             KC       N L +NS+  +I  Y  + S  E  +      
Sbjct: 189 YHGTNG-------------KC-------NSLLKNSRVVTIDGYEDVPSKDETALKRAVSY 228

Query: 250 GPVEVSFTV-YEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWVC 300
            PV V+       F HY+SG++    G  M  HAV  +G+G S++G DYW+ 
Sbjct: 229 QPVSVAIDAGGRAFQHYQSGIFTGKCGTNM-DHAVVAVGYG-SENGVDYWIV 278


>sp|Q40143|CYSP3_SOLLC Cysteine proteinase 3 OS=Solanum lycopersicum GN=CYP-3 PE=2 SV=1
          Length = 356

 Score = 83.2 bits (204), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 82/283 (28%), Positives = 119/283 (42%), Gaps = 44/283 (15%)

Query: 28  VSKLKLDSHILQDSIIKEVNENPKA-GWKAARNPQFSNYTVGQF-KHLLGVK----PTPK 81
           V ++K    I  D++    + N K   +K   N +F++ T  +F KH LG       T K
Sbjct: 71  VEEIKQRFEIFLDNLKMIRSHNRKGLSYKLGIN-EFTDLTWDEFRKHKLGASQNCSATTK 129

Query: 82  GLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI 141
           G L          ++ LP++ D    W +   +S +  QG CGSCW F    AL   +  
Sbjct: 130 GNL-------KLTNVVLPETKD----WRKDGIVSPVKAQGKCGSCWTFSTTGALEAAYAQ 178

Query: 142 HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYF-VHHGVVTEECDPYFDSTG-CSH 199
            FG  +SLS   L+ C G     GC+GG P  A+ Y   + G+ TEE  PY    G C  
Sbjct: 179 AFGKGISLSEQQLVDCAGAFNNFGCNGGLPSQAFEYIKFNGGLDTEEAYPYTGKNGICKF 238

Query: 200 PGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVY 259
                       V   +      +    Y+++  R                PV V+F V 
Sbjct: 239 SQANIGVKVISSVNITLGAEYELK----YAVALVR----------------PVSVAFEVV 278

Query: 260 EDFAHYKSGVYKHIT-GD--VMGGHAVKLIGWGTSDDGEDYWV 299
           + F  YKSGVY     GD  +   HAV  +G+G  ++G  YW+
Sbjct: 279 KGFKQYKSGVYASTECGDTPMDVNHAVLAVGYGV-ENGTPYWL 320


>sp|P25777|ORYB_ORYSJ Oryzain beta chain OS=Oryza sativa subsp. japonica GN=Os04g0670200
           PE=1 SV=2
          Length = 466

 Score = 82.4 bits (202), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 68/254 (26%), Positives = 115/254 (45%), Gaps = 30/254 (11%)

Query: 49  NPKAGWKAARNPQFSNYTVGQFKH-LLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSA 107
           + + G++   N +F++ T  +F+   LG K   +    G   + HD   +LP+S D    
Sbjct: 93  DERGGFRLGMN-RFADLTNEEFRATFLGAKVAERSRAAGERYR-HDGVEELPESVD---- 146

Query: 108 WPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCD 167
           W +   ++ + +QG CGSCWAF AV  +     +  G  ++LS  +L+ C       GC+
Sbjct: 147 WREKGAVAPVKNQGQCGSCWAFSAVSTVESINQLVTGEMITLSEQELVECSTNGQNSGCN 206

Query: 168 GGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKH 227
           GG    A+ + + +G +  E D                YP      KC    +   N+K 
Sbjct: 207 GGLMDDAFDFIIKNGGIDTEDD----------------YPYKAVDGKCDINRE---NAKV 247

Query: 228 YSISAYR-INSDPEDIMAEIYKNGPVEVSFTV-YEDFAHYKSGVYKHITGDVMGGHAVKL 285
            SI  +  +  + E  + +   + PV V+      +F  Y SGV+    G  +  H V  
Sbjct: 248 VSIDGFEDVPQNDEKSLQKAVAHQPVSVAIEAGGREFQLYHSGVFSGRCGTSL-DHGVVA 306

Query: 286 IGWGTSDDGEDYWV 299
           +G+GT D+G+DYW+
Sbjct: 307 VGYGT-DNGKDYWI 319


>sp|Q9LXW3|CPR2_ARATH Probable cysteine proteinase At3g43960 OS=Arabidopsis thaliana
           GN=At3g43960 PE=2 SV=1
          Length = 376

 Score = 82.0 bits (201), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 75/266 (28%), Positives = 121/266 (45%), Gaps = 24/266 (9%)

Query: 37  ILQDSI--IKEVNENPKAGWKAARNPQFSNYTVGQFK-HLLGVKPTPKGLLLGVPVKTHD 93
           I +D++  I+E N +P   ++   N +FS+ T  +F+   LG K   K L        + 
Sbjct: 64  IFKDNLKRIEEHNSDPNRSYERGLN-KFSDLTADEFQASYLGGKMEKKSLSDVAERYQYK 122

Query: 94  KSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVND 153
           +   LP   D R    + + + R+  QG CGSCWAF A  A+     I  G  +SLS  +
Sbjct: 123 EGDVLPDEVDWRE---RGAVVPRVKRQGECGSCWAFAATGAVEGINQITTGELVSLSEQE 179

Query: 154 LLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVR 213
           L+ C       GC GG  + A+ +   +G +    D  +  TG     C       K + 
Sbjct: 180 LIDCDRGNDNFGCAGGGAVWAFEFIKENGGIV--SDEVYGYTGEDTAAC-------KAIE 230

Query: 214 KCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHI 273
             +K  ++   + H  +    +N +     A  Y+  P+ V  +   + + YKSGVYK  
Sbjct: 231 --MKTTRVVTINGHEVVP---VNDEMSLKKAVAYQ--PISVMISA-ANMSDYKSGVYKGA 282

Query: 274 TGDVMGGHAVKLIGWGTSDDGEDYWV 299
             ++ G H V ++G+GTS D  DYW+
Sbjct: 283 CSNLWGDHNVLIVGYGTSSDEGDYWL 308


  Database: swissprot
    Posted date:  Mar 23, 2013  2:32 AM
  Number of letters in database: 191,569,459
  Number of sequences in database:  539,616
  
Lambda     K      H
   0.321    0.138    0.455 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 126,105,014
Number of Sequences: 539616
Number of extensions: 5588416
Number of successful extensions: 10821
Number of sequences better than 100.0: 50
Number of HSP's better than 100.0 without gapping: 204
Number of HSP's successfully gapped in prelim test: 28
Number of HSP's that attempted gapping in prelim test: 10338
Number of HSP's gapped (non-prelim): 261
length of query: 300
length of database: 191,569,459
effective HSP length: 117
effective length of query: 183
effective length of database: 128,434,387
effective search space: 23503492821
effective search space used: 23503492821
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 61 (28.1 bits)