BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 018707
         (351 letters)

Database: swissprot 
           539,616 sequences; 191,569,459 total letters

Searching..................................................done



>sp|Q5R6D1|CATB_PONAB Cathepsin B OS=Pongo abelii GN=CTSB PE=2 SV=1
          Length = 339

 Score =  283 bits (724), Expect = 1e-75,   Method: Compositional matrix adjust.
 Identities = 152/345 (44%), Positives = 206/345 (59%), Gaps = 31/345 (8%)

Query: 14  LTCFATFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGV 73
           L C    A+   ++ +   H L D ++  VN+     W+A  N  F N  V   K L G 
Sbjct: 8   LCCLLALAD---ARSRPSFHPLSDELVNYVNKR-NTTWQAGHN--FYNVDVSYLKKLCGT 61

Query: 74  ---KPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVE 130
               P P   ++        + LKLP+SFDAR  WPQC TI  I DQG CGSCWAFGAVE
Sbjct: 62  FLGGPKPPQRVM------FTEDLKLPESFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVE 115

Query: 131 ALSDRFCIHFGMNLSLSVN--DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE---- 184
           A+SDR CIH   ++S+ V+  DLL CCG +CGDGC+GGYP  AW ++   G+V+      
Sbjct: 116 AISDRICIHTNAHVSVEVSAEDLLTCCGSMCGDGCNGGYPAEAWNFWTRKGLVSGGLYES 175

Query: 185 ---CDPYF-----DSTGCSHPGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSD 235
              C PY           S P C     TPKC + C    +  ++  KHY  ++Y +++ 
Sbjct: 176 HVGCRPYSIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVSNS 235

Query: 236 PEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYW 295
             DIMAEIYKNGPVE +F+VY DF  YKSGVY+H+TG++MGGHA++++GWG  ++G  YW
Sbjct: 236 ERDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGV-ENGTPYW 294

Query: 296 ILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEI 340
           ++AN WN  WG +G+FKI RG + CGIE +VVAG+P +    ++I
Sbjct: 295 LVANSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPRTDQYWEKI 339


>sp|Q4R5M2|CATB_MACFA Cathepsin B OS=Macaca fascicularis GN=CTSB PE=2 SV=1
          Length = 339

 Score =  283 bits (724), Expect = 1e-75,   Method: Compositional matrix adjust.
 Identities = 149/326 (45%), Positives = 200/326 (61%), Gaps = 28/326 (8%)

Query: 33  HILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGV---KPTPKGLLLGVPVKTH 89
           H L D ++  VN+     W+A  N  F N  V   K L G     P P   ++       
Sbjct: 24  HPLSDELVNYVNKQ-NTTWQAGHN--FYNVDVSYLKRLCGTFLGGPKPPQRVM------F 74

Query: 90  DKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVN 149
            + LKLP+SFDAR  WPQC TI  I DQG CGSCWAFGAVEA+SDR CIH   ++S+ V+
Sbjct: 75  TEDLKLPESFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHTNAHVSVEVS 134

Query: 150 --DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYF-----DSTGCS 195
             DLL CCG +CGDGC+GGYP  AW ++   G+V+         C PY           S
Sbjct: 135 AEDLLTCCGIMCGDGCNGGYPAGAWNFWTRKGLVSGGLYDSHVGCRPYSIPPCEHHVNGS 194

Query: 196 HPGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFT 254
            P C     TPKC + C    +  ++  KHY  ++Y +++  +DIMAEIYKNGPVE +F+
Sbjct: 195 RPPCTGEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAFS 254

Query: 255 VYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIK 314
           VY DF  YKSGVY+H+TG++MGGHA++++GWG  ++G  YW++AN WN  WG +G+FKI 
Sbjct: 255 VYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGV-ENGTPYWLVANSWNTDWGDNGFFKIL 313

Query: 315 RGSNECGIEEDVVAGLPSSKNLVKEI 340
           RG + CGIE +VVAG+P +    ++I
Sbjct: 314 RGQDHCGIESEVVAGIPRTDQYWEKI 339


>sp|P07858|CATB_HUMAN Cathepsin B OS=Homo sapiens GN=CTSB PE=1 SV=3
          Length = 339

 Score =  281 bits (720), Expect = 4e-75,   Method: Compositional matrix adjust.
 Identities = 151/345 (43%), Positives = 205/345 (59%), Gaps = 31/345 (8%)

Query: 14  LTCFATFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGV 73
           L C    A    ++ +   H L D ++  VN+     W+A  N  F N  +   K L G 
Sbjct: 8   LCCLLVLAN---ARSRPSFHPLSDELVNYVNKR-NTTWQAGHN--FYNVDMSYLKRLCGT 61

Query: 74  ---KPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVE 130
               P P   ++        + LKLP SFDAR  WPQC TI  I DQG CGSCWAFGAVE
Sbjct: 62  FLGGPKPPQRVM------FTEDLKLPASFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVE 115

Query: 131 ALSDRFCIHFGMNLSLSVN--DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE---- 184
           A+SDR CIH   ++S+ V+  DLL CCG +CGDGC+GGYP  AW ++   G+V+      
Sbjct: 116 AISDRICIHTNAHVSVEVSAEDLLTCCGSMCGDGCNGGYPAEAWNFWTRKGLVSGGLYES 175

Query: 185 ---CDPYF-----DSTGCSHPGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSD 235
              C PY           S P C     TPKC + C    +  ++  KHY  ++Y +++ 
Sbjct: 176 HVGCRPYSIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVSNS 235

Query: 236 PEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYW 295
            +DIMAEIYKNGPVE +F+VY DF  YKSGVY+H+TG++MGGHA++++GWG  ++G  YW
Sbjct: 236 EKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGV-ENGTPYW 294

Query: 296 ILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEI 340
           ++AN WN  WG +G+FKI RG + CGIE +VVAG+P +    ++I
Sbjct: 295 LVANSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPRTDQYWEKI 339


>sp|P00787|CATB_RAT Cathepsin B OS=Rattus norvegicus GN=Ctsb PE=1 SV=2
          Length = 339

 Score =  277 bits (708), Expect = 1e-73,   Method: Compositional matrix adjust.
 Identities = 151/344 (43%), Positives = 201/344 (58%), Gaps = 36/344 (10%)

Query: 10  PILCLTCFATFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKH 69
           P+ CL    +  +      K  SH L D +I  +N+     W+A RN  F N  +   K 
Sbjct: 7   PLSCLLALTSAHD------KPSSHPLSDDMINYINKQ-NTTWQAGRN--FYNVDISYLKK 57

Query: 70  LLGVKPTPKGLLLGVPVKTH----DKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWA 125
           L G        +LG P         + + LP+SFDAR  W  C TI++I DQG CGSCWA
Sbjct: 58  LCGT-------VLGGPNLPERVGFSEDINLPESFDAREQWSNCPTIAQIRDQGSCGSCWA 110

Query: 126 FGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTE 183
           FGAVEA+SDR CIH    +N+ +S  DLL CCG  CGDGC+GGYP  AW ++   G+V+ 
Sbjct: 111 FGAVEAMSDRICIHTNGRVNVEVSAEDLLTCCGIQCGDGCNGGYPSGAWNFWTRKGLVSG 170

Query: 184 E-------CDPYF-----DSTGCSHPGCEPAYPTPKCVRKC-VKKNQLWRNSKHYSISAY 230
                   C PY           S P C     TPKC + C    +  ++  KHY  ++Y
Sbjct: 171 GVYNSHIGCLPYTIPPCEHHVNGSRPPCTGEGDTPKCNKMCEAGYSTSYKEDKHYGYTSY 230

Query: 231 RINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDD 290
            ++   ++IMAEIYKNGPVE +FTV+ DF  YKSGVYKH  GDVMGGHA++++GWG  ++
Sbjct: 231 SVSDSEKEIMAEIYKNGPVEGAFTVFSDFLTYKSGVYKHEAGDVMGGHAIRILGWGI-EN 289

Query: 291 GEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSK 334
           G  YW++AN WN  WG +G+FKI RG N CGIE ++VAG+P ++
Sbjct: 290 GVPYWLVANSWNVDWGDNGFFKILRGENHCGIESEIVAGIPRTQ 333


>sp|P25792|CYSP_SCHMA Cathepsin B-like cysteine proteinase OS=Schistosoma mansoni PE=2
           SV=1
          Length = 340

 Score =  269 bits (687), Expect = 3e-71,   Method: Compositional matrix adjust.
 Identities = 144/340 (42%), Positives = 200/340 (58%), Gaps = 20/340 (5%)

Query: 7   IMDPILCLTCFATFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ 66
           ++  ILC+    TF E  +S        L D II  +NE+P AGW+A ++ +F +    +
Sbjct: 1   MLTSILCIASLITFLEAHISVKNEKFEPLSDDIISYINEHPNAGWRAEKSNRFHSLDDAR 60

Query: 67  FKHLLGVKPTPKGLLLGVPVKTH-DKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWA 125
            + +   +  P       P   H D ++++P +FD+R  WP C +I+ I DQ  CGSCW+
Sbjct: 61  IQ-MGARREEPDLRRKRRPTVDHNDWNVEIPSNFDSRKKWPGCKSIATIRDQSRCGSCWS 119

Query: 126 FGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTE 183
           FGAVEA+SDR CI  G   N+ LS  DLL CC   CG GC+GG    AW Y+V  G+VT 
Sbjct: 120 FGAVEAMSDRSCIQSGGKQNVELSAVDLLTCCES-CGLGCEGGILGPAWDYWVKEGIVTA 178

Query: 184 E-------CDPY-----FDSTGCSHPGC-EPAYPTPKCVRKCVKKNQL-WRNSKHYSISA 229
                   C+PY        T   +P C    Y TP+C + C +K +  +   KH   S+
Sbjct: 179 SSKENHTGCEPYPFPKCEHHTKGKYPPCGSKIYNTPRCKQTCQRKYKTPYTQDKHRGKSS 238

Query: 230 YRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSD 289
           Y + +D + I  EI K GPVE SFTVYEDF +YKSG+YKHITG+ +GGHA+++IGWG  +
Sbjct: 239 YNVKNDEKAIQKEIMKYGPVEASFTVYEDFLNYKSGIYKHITGEALGGHAIRIIGWGV-E 297

Query: 290 DGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 329
           +   YW++AN WN  WG +GYF+I RG +EC IE +V+AG
Sbjct: 298 NKTPYWLIANSWNEDWGENGYFRIVRGRDECSIESEVIAG 337


>sp|P10605|CATB_MOUSE Cathepsin B OS=Mus musculus GN=Ctsb PE=1 SV=2
          Length = 339

 Score =  266 bits (680), Expect = 2e-70,   Method: Compositional matrix adjust.
 Identities = 142/320 (44%), Positives = 191/320 (59%), Gaps = 30/320 (9%)

Query: 33  HILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHL----LGVKPTPKGLLLGVPVKT 88
           H L D +I  +N+     W+A RN  F N  +   K L    LG    P  +  G     
Sbjct: 24  HPLSDDLINYINKQ-NTTWQAGRN--FYNVDISYLKKLCGTVLGGPKLPGRVAFG----- 75

Query: 89  HDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSL 146
             + + LP++FDAR  W  C TI +I DQG CGSCWAFGAVEA+SDR CIH    +N+ +
Sbjct: 76  --EDIDLPETFDAREQWSNCPTIGQIRDQGSCGSCWAFGAVEAISDRTCIHTNGRVNVEV 133

Query: 147 SVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYF-----DSTGC 194
           S  DLL CCG  CGDGC+GGYP  AW ++   G+V+         C PY           
Sbjct: 134 SAEDLLTCCGIQCGDGCNGGYPSGAWSFWTKKGLVSGGVYNSHVGCLPYTIPPCEHHVNG 193

Query: 195 SHPGCEPAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSF 253
           S P C     TP+C + C    +  ++  KH+  ++Y +++  ++IMAEIYKNGPVE +F
Sbjct: 194 SRPPCTGEGDTPRCNKSCEAGYSPSYKEDKHFGYTSYSVSNSVKEIMAEIYKNGPVEGAF 253

Query: 254 TVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKI 313
           TV+ DF  YKSGVYKH  GD+MGGHA++++GWG  ++G  YW+ AN WN  WG +G+FKI
Sbjct: 254 TVFSDFLTYKSGVYKHEAGDMMGGHAIRILGWGV-ENGVPYWLAANSWNLDWGDNGFFKI 312

Query: 314 KRGSNECGIEEDVVAGLPSS 333
            RG N CGIE ++VAG+P +
Sbjct: 313 LRGENHCGIESEIVAGIPRT 332


>sp|A1E295|CATB_PIG Cathepsin B OS=Sus scrofa GN=CTSB PE=1 SV=1
          Length = 335

 Score =  265 bits (677), Expect = 4e-70,   Method: Compositional matrix adjust.
 Identities = 145/343 (42%), Positives = 194/343 (56%), Gaps = 35/343 (10%)

Query: 14  LTCFATFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGV 73
           L+C         ++  L    L D ++  +N+     W A  N  F N  +   K L G 
Sbjct: 8   LSCLVLLTS---ARESLHFQPLSDELVNFINKQ-NTTWTAGHN--FYNVDLSYVKKLCGT 61

Query: 74  KPTPKGLLLGVPVKTHDKSLK----LPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAV 129
                   LG P      +      LPKSFDAR  WP C TI  I DQG CGSCWAFGAV
Sbjct: 62  -------FLGGPKLPQRAAFAADMILPKSFDAREQWPNCPTIKEIRDQGSCGSCWAFGAV 114

Query: 130 EALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE--- 184
           EA+SDR CI     +N+ +S  D+L CCG  CGDGC+GG+P  AW ++   G+V+     
Sbjct: 115 EAISDRICIRSNGRVNVEVSAEDMLTCCGDECGDGCNGGFPSGAWNFWTKKGLVSGGLYD 174

Query: 185 ----CDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRIN 233
               C PY     C H      P C     TPKC + C       ++  KH+  S+Y I+
Sbjct: 175 SHVGCRPY-SIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGYTPSYKEDKHFGCSSYSIS 233

Query: 234 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGED 293
            + ++IMAEIYKNGPVE +FTVY DF  YKSGVY+H+TGD+MGGHA++++GWG  ++G  
Sbjct: 234 RNEKEIMAEIYKNGPVEGAFTVYSDFLQYKSGVYQHVTGDLMGGHAIRILGWGV-ENGTP 292

Query: 294 YWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNL 336
           YW++ N WN  WG +G+FKI RG + CGIE ++VAG+P + + 
Sbjct: 293 YWLVGNSWNTDWGDNGFFKILRGQDHCGIESEIVAGIPCTPHF 335


>sp|P43233|CATB_CHICK Cathepsin B OS=Gallus gallus GN=CTSB PE=2 SV=1
          Length = 340

 Score =  258 bits (660), Expect = 4e-68,   Method: Compositional matrix adjust.
 Identities = 146/343 (42%), Positives = 194/343 (56%), Gaps = 40/343 (11%)

Query: 11  ILCLTCFATFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHL 70
           ILCL      A  +     L S ++    I ++N   +AG        F N  +   K L
Sbjct: 7   ILCLLGAFANARSIPYYPPLSSDLVNH--INKLNTTGRAG------HNFHNTDMSYVKKL 58

Query: 71  LGVKPTPKGLLLGVPVKTHD----KSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAF 126
            G         LG P         + + LP +FD R  WP C TIS I DQG CGSCWAF
Sbjct: 59  CGT-------FLGGPKAPERVDFAEDMDLPDTFDTRKQWPNCPTISEIRDQGSCGSCWAF 111

Query: 127 GAVEALSDRFCIHFGMNLSLSVN--DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE 184
           GAVEA+SDR C+H    +S+ V+  DLL+CCGF CG GC+GGYP  AWRY+   G+V+  
Sbjct: 112 GAVEAISDRICVHTNAKVSVEVSAEDLLSCCGFECGMGCNGGYPSGAWRYWTERGLVSGG 171

Query: 185 CDPYFDSTGC---SHPGCE------------PAYPTPKCVRKCVKK-NQLWRNSKHYSIS 228
              Y    GC   + P CE                TP+C R C    +  ++  KHY I+
Sbjct: 172 L--YDSHVGCRAYTIPPCEHHVNGSRPPCTGEGGETPRCSRHCEPGYSPSYKEDKHYGIT 229

Query: 229 AYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTS 288
           +Y +    ++IMAEIYKNGPVE +F VYEDF  YKSGVY+H++G+ +GGHA++++GWG  
Sbjct: 230 SYGVPRSEKEIMAEIYKNGPVEGAFIVYEDFLMYKSGVYQHVSGEQVGGHAIRILGWGV- 288

Query: 289 DDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 331
           ++G  YW+ AN WN  WG  G+FKI RG + CGIE ++VAG+P
Sbjct: 289 ENGTPYWLAANSWNTDWGITGFFKILRGEDHCGIESEIVAGVP 331


>sp|P43510|CPR6_CAEEL Cathepsin B-like cysteine proteinase 6 OS=Caenorhabditis elegans
           GN=cpr-6 PE=1 SV=1
          Length = 379

 Score =  257 bits (657), Expect = 8e-68,   Method: Compositional matrix adjust.
 Identities = 157/376 (41%), Positives = 205/376 (54%), Gaps = 59/376 (15%)

Query: 8   MDPILCLTCFATFA--------EGVVSKLK---LDSHILQ---DSIIKEVNENPKAGWKA 53
           M  +L L+C    A        E V+ K +   +DS   +   D +I  VNEN    W A
Sbjct: 1   MKTLLFLSCIVVAAYCACNDNLESVLDKYRNREIDSEAAELDGDDLIDYVNENQNL-WTA 59

Query: 54  ARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVP------------VKTHDKSLKLPKSFDA 101
            +  +FS+        + G     K  L+GV              KT D  L +P+SFD+
Sbjct: 60  KKQRRFSS--------VYGENDKAKWGLMGVNHVRLSVKGKQHLSKTKDLDLDIPESFDS 111

Query: 102 RSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLC 159
           R  WP+C +I  I DQ  CGSCWAFGAVEA+SDR CI  H  + ++LS +DLL+CC   C
Sbjct: 112 RDNWPKCDSIKVIRDQSSCGSCWAFGAVEAMSDRICIASHGELQVTLSADDLLSCCKS-C 170

Query: 160 GDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCS---HPGCE-------------PAY 203
           G GC+GG P++AWRY+V  G+VT     Y  + GC     P CE               Y
Sbjct: 171 GFGCNGGDPLAAWRYWVKDGIVTGS--NYTANNGCKPYPFPPCEHHSKKTHFDPCPHDLY 228

Query: 204 PTPKCVRKCVKK--NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAH 261
           PTPKC +KCV    ++ +   K +  SAY +  D E I  E+  +GP+E++F VYEDF +
Sbjct: 229 PTPKCEKKCVSDYTDKTYSEDKFFGASAYGVKDDVEAIQKELMTHGPLEIAFEVYEDFLN 288

Query: 262 YKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECG 321
           Y  GVY H  G + GGHAVKLIGWG  DDG  YW +AN WN  WG DG+F+I RG +ECG
Sbjct: 289 YDGGVYVHTGGKLGGGHAVKLIGWGI-DDGIPYWTVANSWNTDWGEDGFFRILRGVDECG 347

Query: 322 IEEDVVAGLPSSKNLV 337
           IE  VV G+P   +L 
Sbjct: 348 IESGVVGGIPKLNSLT 363


>sp|P43157|CYSP_SCHJA Cathepsin B-like cysteine proteinase OS=Schistosoma japonicum
           GN=CATB PE=2 SV=1
          Length = 342

 Score =  257 bits (656), Expect = 1e-67,   Method: Compositional matrix adjust.
 Identities = 135/343 (39%), Positives = 197/343 (57%), Gaps = 22/343 (6%)

Query: 6   LIMDPILCLTCFATFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVG 65
           ++   +  ++ F      V ++       L D +I  +NE+P AGWKA ++ +F  +++ 
Sbjct: 1   MLKIAVYIVSLFTFLEAHVTTRNNQRIEPLSDEMISFINEHPDAGWKADKSDRF--HSLD 58

Query: 66  QFKHLLGVKPTPKGLLLGV--PVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSC 123
             + L+G +     +       V  HD ++++P  FD+R  WP C +IS+I DQ  CGSC
Sbjct: 59  DARILMGARKEDAEMKRNRRPTVDHHDLNVEIPSQFDSRKKWPHCKSISQIRDQSRCGSC 118

Query: 124 WAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVV 181
           WAFGAVEA++DR CI  G   +  LS  DL++CC   CGDGC GG+P  AW Y+V  G+V
Sbjct: 119 WAFGAVEAMTDRICIQSGGGQSAELSALDLISCCK-DCGDGCQGGFPGVAWDYWVKRGIV 177

Query: 182 T-------EECDPY-----FDSTGCSHPGC-EPAYPTPKCVRKCVKKNQL-WRNSKHYSI 227
           T         C PY        T   +P C    Y TP+C + C K  +  +   KHY  
Sbjct: 178 TGGSKENHTGCQPYPFPKCEHHTKGKYPACGTKIYKTPQCKQTCQKGYKTPYEQDKHYGD 237

Query: 228 SAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGT 287
            +Y + ++ + I  +I   GPVE +F VYEDF +YKSG+Y+H+TG ++GGHA+++IGWG 
Sbjct: 238 ESYNVQNNEKVIQRDIMMYGPVEAAFDVYEDFLNYKSGIYRHVTGSIVGGHAIRIIGWGV 297

Query: 288 SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 330
            +    YW++AN WN  WG  G F++ RG +EC IE DVVAGL
Sbjct: 298 -EKRTPYWLIANSWNEDWGEKGLFRMVRGRDECSIESDVVAGL 339


>sp|P43509|CPR5_CAEEL Cathepsin B-like cysteine proteinase 5 OS=Caenorhabditis elegans
           GN=cpr-5 PE=2 SV=1
          Length = 344

 Score =  253 bits (645), Expect = 2e-66,   Method: Compositional matrix adjust.
 Identities = 129/258 (50%), Positives = 159/258 (61%), Gaps = 22/258 (8%)

Query: 95  LPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLL 152
           +P  FDAR  WP C +I+ I DQ  CGSCWAF A EA+SDR CI  +  +N  LS  DLL
Sbjct: 82  IPDHFDARDQWPNCMSINNIRDQSDCGSCWAFAAAEAISDRTCIASNGAVNTLLSSEDLL 141

Query: 153 ACCG--FLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDS------TGCSHP 197
           +CC   F CG+GC+GGYPI AW+++V HG+VT         C PY  +       G   P
Sbjct: 142 SCCTGMFSCGNGCEGGYPIQAWKWWVKHGLVTGGSYETQFGCKPYSIAPCGETVNGVKWP 201

Query: 198 GC-EPAYPTPKCVRKCVKKNQL---WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSF 253
            C E   PTPKCV  C  KN     +   KH+  +AY +    E I  EI  NGP+EV+F
Sbjct: 202 ACPEDTEPTPKCVDSCTSKNNYATPYLQDKHFGSTAYAVGKKVEQIQTEILTNGPIEVAF 261

Query: 254 TVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKI 313
           TVYEDF  Y +GVY H  G  +GGHAVK++GWG  D+G  YW++AN WN +WG  GYF+I
Sbjct: 262 TVYEDFYQYTTGVYVHTAGASLGGHAVKILGWGV-DNGTPYWLVANSWNVAWGEKGYFRI 320

Query: 314 KRGSNECGIEEDVVAGLP 331
            RG NECGIE   VAG+P
Sbjct: 321 IRGLNECGIEHSAVAGIP 338


>sp|Q54QD9|CTSB_DICDI Cathepsin B OS=Dictyostelium discoideum GN=ctsB PE=3 SV=1
          Length = 311

 Score =  238 bits (607), Expect = 5e-62,   Method: Compositional matrix adjust.
 Identities = 134/290 (46%), Positives = 172/290 (59%), Gaps = 26/290 (8%)

Query: 51  WKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDK-SLKLPKSFDARSAWPQCS 109
           W   +  QF N  VGQ   LLG K +P    L   +K++D   +++P SF+A++ WP C+
Sbjct: 39  WVEEQTDQFDNIKVGQ---LLGFKRSPNRPKL--QIKSYDPLGVQIPTSFNAQTNWPNCT 93

Query: 110 TISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPI 169
           TIS+I +Q  CGSCWAFGA E+ +DR CIH   N+ LS  D++ C      +GC+GG   
Sbjct: 94  TISQIQNQARCGSCWAFGATESATDRLCIHNNENVQLSFMDMVTCDE--TDNGCEGGDAF 151

Query: 170 SAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYP-------TPKCVRKCVKKNQL-WRN 221
           SAW +    G V+EEC PY      + P C PA         TP C ++C   + L +  
Sbjct: 152 SAWNWLRKQGAVSEECLPY------TIPTCPPAQQPCLNFVNTPSCTKECQSNSSLIYSQ 205

Query: 222 SKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVK 281
            KH     Y  +SD E IM EI  NGPVE  FTV+EDF  YKSGVY H TG  +GGH VK
Sbjct: 206 DKHKMAKIYSFDSD-EAIMQEIVTNGPVEACFTVFEDFLAYKSGVYVHTTGKDLGGHCVK 264

Query: 282 LIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 331
           L+G+GT  +G DY+   NQW  SWG +G F IKRG  +CGI +DVVAGLP
Sbjct: 265 LVGFGTL-NGVDYYAANNQWTTSWGDNGTFLIKRG--DCGISDDVVAGLP 311


>sp|P43508|CPR4_CAEEL Cathepsin B-like cysteine proteinase 4 OS=Caenorhabditis elegans
           GN=cpr-4 PE=2 SV=1
          Length = 335

 Score =  238 bits (607), Expect = 5e-62,   Method: Compositional matrix adjust.
 Identities = 147/339 (43%), Positives = 193/339 (56%), Gaps = 28/339 (8%)

Query: 12  LCLTCFATFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLL 71
           L L        G+V  L   +   Q++I + VN   ++ WKA   P+  + T+ Q K  L
Sbjct: 4   LILAALVAVTAGLVIPLVPKT---QEAITEYVNSK-QSLWKA-EIPK--DITIEQVKKRL 56

Query: 72  GVKPTPKGLLLGVPVKTHD-KSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVE 130
                       V V  HD     +P +FDAR+ WP C +I+ I DQ  CGSCWAF A E
Sbjct: 57  MRTEFVAPHTPDVEVVKHDINEDTIPATFDARTQWPNCMSINNIRDQSDCGSCWAFAAAE 116

Query: 131 ALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE---- 184
           A SDRFCI  +  +N  LS  D+L+CC   CG GC+GGYPI+AW+Y V  G  T      
Sbjct: 117 AASDRFCIASNGAVNTLLSAEDVLSCCSN-CGYGCEGGYPINAWKYLVKSGFCTGGSYEA 175

Query: 185 ---CDPYF-----DSTG-CSHPGC-EPAYPTPKCVRKCVKKNQ--LWRNSKHYSISAYRI 232
              C PY      ++ G  + P C +  Y TP CV KC  KN    +   KH+  +AY +
Sbjct: 176 QFGCKPYSLAPCGETVGNVTWPSCPDDGYDTPACVNKCTNKNYNVAYTADKHFGSTAYAV 235

Query: 233 NSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGE 292
                 I AEI  +GPVE +FTVYEDF  YK+GVY H TG  +GGHA++++GWGT D+G 
Sbjct: 236 GKKVSQIQAEIIAHGPVEAAFTVYEDFYQYKTGVYVHTTGQELGGHAIRILGWGT-DNGT 294

Query: 293 DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 331
            YW++AN WN +WG +GYF+I RG+NECGIE  VV G+P
Sbjct: 295 PYWLVANSWNVNWGENGYFRIIRGTNECGIEHAVVGGVP 333


>sp|P25807|CPR1_CAEEL Gut-specific cysteine proteinase OS=Caenorhabditis elegans GN=cpr-1
           PE=1 SV=2
          Length = 329

 Score =  236 bits (602), Expect = 2e-61,   Method: Compositional matrix adjust.
 Identities = 119/244 (48%), Positives = 153/244 (62%), Gaps = 12/244 (4%)

Query: 95  LPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLL 152
           +P +FD+R+ W +C +I  I DQ  CGSCWAFGA E +SDR CI         +S +DLL
Sbjct: 85  VPATFDSRTQWSECKSIKLIRDQATCGSCWAFGAAEMISDRTCIETKGAQQPIISPDDLL 144

Query: 153 ACCGFLCGDGCDGGYPISAWRYFVHHGVVT------EECDPYFDSTGCSHPGCEPAYPTP 206
           +CCG  CG+GC+GGYPI A R++   GVVT        C PY  +  C+   C P   TP
Sbjct: 145 SCCGSSCGNGCEGGYPIQALRWWDSKGVVTGGDYHGAGCKPYPIAP-CTSGNC-PESKTP 202

Query: 207 KCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSG 265
            C   C    +  +   KH+ +SAY +  +   I AEIY NGPVE +F+VYEDF  YKSG
Sbjct: 203 SCSMSCQSGYSTAYAKDKHFGVSAYAVPKNAASIQAEIYANGPVEAAFSVYEDFYKYKSG 262

Query: 266 VYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEED 325
           VYKH  G  +GGHA+K+IGWGT + G  YW++AN W  +WG  G+FKI RG ++CGIE  
Sbjct: 263 VYKHTAGKYLGGHAIKIIGWGT-ESGSPYWLVANSWGVNWGESGFFKIYRGDDQCGIESA 321

Query: 326 VVAG 329
           VVAG
Sbjct: 322 VVAG 325


>sp|P07688|CATB_BOVIN Cathepsin B OS=Bos taurus GN=CTSB PE=1 SV=5
          Length = 335

 Score =  234 bits (598), Expect = 6e-61,   Method: Compositional matrix adjust.
 Identities = 139/315 (44%), Positives = 190/315 (60%), Gaps = 28/315 (8%)

Query: 35  LQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDK--- 91
           L D ++  VN+     WKA  N  F N  +   K L G       +L G  +   D    
Sbjct: 26  LSDELVNFVNKQ-NTTWKAGHN--FYNVDLSYVKKLCGA------ILGGPKLPQRDAFAA 76

Query: 92  SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVN 149
            + LP+SFDAR  WP C TI  I DQG CGSCWAFGAVEA+SDR CIH    +N+ +S  
Sbjct: 77  DVVLPESFDAREQWPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHSNGRVNVEVSAE 136

Query: 150 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYF-----DSTGCSHP 197
           D+L CCG  CGDGC+GG+P  AW ++   G+V+         C PY           S P
Sbjct: 137 DMLTCCGGECGDGCNGGFPSGAWNFWTKKGLVSGGLYNSHVGCRPYSIPPCEHHVNGSRP 196

Query: 198 GCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVY 256
            C     TPKC + C    +  ++  KH+  S+Y + ++ ++IMAEIYKNGPVE +F+VY
Sbjct: 197 PCTGEGDTPKCSKTCEPGYSPSYKEDKHFGCSSYSVANNEKEIMAEIYKNGPVEGAFSVY 256

Query: 257 EDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRG 316
            DF  YKSGVY+H++G++MGGHA++++GWG  ++G  YW++ N WN  WG +G+FKI RG
Sbjct: 257 SDFLLYKSGVYQHVSGEIMGGHAIRILGWGV-ENGTPYWLVGNSWNTDWGDNGFFKILRG 315

Query: 317 SNECGIEEDVVAGLP 331
            + CGIE ++VAG+P
Sbjct: 316 QDHCGIESEIVAGMP 330


>sp|P43507|CPR3_CAEEL Cathepsin B-like cysteine proteinase 3 OS=Caenorhabditis elegans
           GN=cpr-3 PE=2 SV=1
          Length = 370

 Score =  232 bits (592), Expect = 3e-60,   Method: Compositional matrix adjust.
 Identities = 125/266 (46%), Positives = 160/266 (60%), Gaps = 22/266 (8%)

Query: 95  LPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLL 152
           LP +FDAR  WP C+TI  I +Q  CGSCWAFGA E +SDR CI         +SV D+L
Sbjct: 92  LPDTFDAREKWPDCNTIKLIRNQATCGSCWAFGAAEVISDRVCIQSNGTQQPVISVEDIL 151

Query: 153 ACCGFLCGDGCDGGYPISAWRYFVHHGVVT------EECDPYFDSTGCSHPGCEPAYPTP 206
           +CCG  CG GC GGY I A R++   G VT        C PY  S       C P   TP
Sbjct: 152 SCCGTTCGYGCKGGYSIEALRFWASSGAVTGGDYGGHGCMPY--SFAPCTKNC-PESTTP 208

Query: 207 KCVRKCVK--KNQLWRNSKHYSISAYRINSDPE--DIMAEIYKNGPVEVSFTVYEDFAHY 262
            C   C    K + ++  KHY  SAY++ +     +I  EIY  GPVE S+ VYEDF HY
Sbjct: 209 SCKTTCQSSYKTEEYKKDKHYGASAYKVTTTKSVTEIQTEIYHYGPVEASYKVYEDFYHY 268

Query: 263 KSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGI 322
           KSGVY + +G ++GGHAVK+IGWG  ++G DYW++AN W  S+G  G+FKI+RG+NEC I
Sbjct: 269 KSGVYHYTSGKLVGGHAVKIIGWGV-ENGVDYWLIANSWGTSFGEKGFFKIRRGTNECQI 327

Query: 323 EEDVVAGLPSSKNLVKEITSADMFED 348
           E +VVAG      + K  T ++ +ED
Sbjct: 328 EGNVVAG------IAKLGTHSETYED 347


>sp|P25793|CYSP2_HAECO Cathepsin B-like cysteine proteinase 2 OS=Haemonchus contortus
           GN=AC-2 PE=2 SV=1
          Length = 342

 Score =  231 bits (590), Expect = 5e-60,   Method: Compositional matrix adjust.
 Identities = 128/324 (39%), Positives = 182/324 (56%), Gaps = 36/324 (11%)

Query: 30  LDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTH 89
           L +++ +   + EVN +P         P F        + ++ +K   + L L V  +  
Sbjct: 38  LVAYLRRSQNLFEVNSDP--------TPDFE-------QKIMSIKYKHQKLNLMVK-EDP 81

Query: 90  DKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLS 147
           D  + +P S+D R  W  C+T   I DQ +CGSCWA     A+SDR CI       +++S
Sbjct: 82  DPEVDIPPSYDPRDVWKNCTTFY-IRDQANCGSCWAVSTAAAISDRICIASKAEKQVNIS 140

Query: 148 VNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSHPG-- 198
             D++ CC   CGDGC+GG+PI AW+YF++ GVV+       + C PY     C H G  
Sbjct: 141 ATDIMTCCRPQCGDGCEGGWPIEAWKYFIYDGVVSGGEYLTKDVCRPY-PIHPCGHHGND 199

Query: 199 -----CEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVS 252
                C    PTP C RKC     +++R  K Y   AY +    + I +EI KNGPV  S
Sbjct: 200 TYYGECRGTAPTPPCKRKCRPGVRKMYRIDKRYGKDAYIVKQSVKAIQSEILKNGPVVAS 259

Query: 253 FTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFK 312
           F VYEDF HYKSG+YKH  G++ G HAVK+IGWG +++  D+W++AN W+  WG  GYF+
Sbjct: 260 FAVYEDFRHYKSGIYKHTAGELRGYHAVKMIGWG-NENNTDFWLIANSWHNDWGEKGYFR 318

Query: 313 IKRGSNECGIEEDVVAGLPSSKNL 336
           I RGSN+CGIE  + AG+  +++L
Sbjct: 319 IVRGSNDCGIEGTIAAGIVDTESL 342


>sp|P19092|CYSP1_HAECO Cathepsin B-like cysteine proteinase 1 OS=Haemonchus contortus
           GN=AC-1 PE=2 SV=1
          Length = 342

 Score =  229 bits (585), Expect = 2e-59,   Method: Compositional matrix adjust.
 Identities = 115/264 (43%), Positives = 159/264 (60%), Gaps = 20/264 (7%)

Query: 90  DKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLS 147
           D  + +P S+D R  W  C+T   I DQ +CGSCWA     A+SDR CI       +++S
Sbjct: 82  DPEVDIPPSYDPRDVWKNCTTFY-IRDQANCGSCWAVSTAAAISDRICIASKAEKQVNIS 140

Query: 148 VNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSHPG-- 198
             D++ CC   CGDGC+GG+PI AW+YF++ GVV+       + C PY     C H G  
Sbjct: 141 ATDIMTCCRPQCGDGCEGGWPIEAWKYFIYDGVVSGGEYLTKDVCRPY-PIHPCGHHGND 199

Query: 199 -----CEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVS 252
                C    PTP C RKC     +++R  K Y   AY +    + I +EI +NGPV  S
Sbjct: 200 TYYGECRGTAPTPPCKRKCRPGVRKMYRIDKRYGKDAYIVKQSVKAIQSEILRNGPVVAS 259

Query: 253 FTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFK 312
           F VYEDF HYKSG+YKH  G++ G HAVK+IGWG +++  D+W++AN W+  WG  GYF+
Sbjct: 260 FAVYEDFRHYKSGIYKHTAGELRGYHAVKMIGWG-NENNTDFWLIANSWHNDWGEKGYFR 318

Query: 313 IKRGSNECGIEEDVVAGLPSSKNL 336
           I RG+N+CGIE  + AG+  +++L
Sbjct: 319 IIRGTNDCGIEGTIAAGIVDTESL 342


>sp|P25802|CYSP1_OSTOS Cathepsin B-like cysteine proteinase 1 OS=Ostertagia ostertagi
           GN=CP-1 PE=3 SV=3
          Length = 341

 Score =  212 bits (540), Expect = 3e-54,   Method: Compositional matrix adjust.
 Identities = 111/251 (44%), Positives = 152/251 (60%), Gaps = 19/251 (7%)

Query: 95  LPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLL 152
           +P+S+D R  W  CS++  I DQ +CGSCWA  +  A+SDR CI       + +S  D++
Sbjct: 91  IPESYDPRIQWANCSSLFHIPDQANCGSCWAVSSAAAMSDRICIASKGAKQVLISAQDVV 150

Query: 153 ACCGFLCGDGCDGGYPISAWRYFVHHGVVTE-------ECDPYFDSTGCSHPGCEPAY-- 203
           +CC + CGDGC+GG+PISA+R+    GVVT         C PY +   C H G E  Y  
Sbjct: 151 SCCTW-CGDGCEGGWPISAFRFHADEGVVTGGDYNTKGSCRPY-EIHPCGHHGNETYYGE 208

Query: 204 -----PTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYED 258
                 TP+C R+C+        S  Y   AY++ +  + I  +I KNGPV  ++TVYED
Sbjct: 209 CVGMADTPRCKRRCLLGYPKSYPSDRYYKKAYQLKNSVKAIQKDIMKNGPVVATYTVYED 268

Query: 259 FAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSN 318
           FAHY+SG+YKH  G   G HAVK+IGWG  + G  YWI+AN W+  WG +G+F++ RGSN
Sbjct: 269 FAHYRSGIYKHKAGRKTGLHAVKVIGWG-EEKGTPYWIVANSWHDDWGENGFFRMHRGSN 327

Query: 319 ECGIEEDVVAG 329
           +CG EE + AG
Sbjct: 328 DCGFEERMAAG 338


>sp|P92133|CATB3_GIAIN Cathepsin B-like CP3 OS=Giardia intestinalis GN=CP3 PE=2 SV=2
          Length = 299

 Score =  186 bits (471), Expect = 3e-46,   Method: Compositional matrix adjust.
 Identities = 117/299 (39%), Positives = 155/299 (51%), Gaps = 28/299 (9%)

Query: 40  IKEVNE----NPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSLKL 95
           + E+N     NP+  WKA    +F   T  +   LL      K     VP  T   + + 
Sbjct: 18  VSELNHIKSLNPR--WKAGIPKRFEGLTKDEISSLLMPVSFLKRDRAAVPRGTV-SATQA 74

Query: 96  PKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMN---LSLSVNDLL 152
           P SFD R  +P C  I  ++DQG CGSCWAF +V ++ DR C   G++   +  S   ++
Sbjct: 75  PDSFDFREEYPHC--IPEVVDQGGCGSCWAFSSVASVGDRRCFA-GLDKKAVKYSPQYVV 131

Query: 153 ACCGFLCGD-GCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRK 211
           +C     GD  CDGG+  S WR+    G  T+EC PY         G   A  T  C  K
Sbjct: 132 SCDR---GDMACDGGWLPSVWRFLTKTGTTTDECVPY-------QSGSTGARGT--CPTK 179

Query: 212 CVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHIT 271
           C   + L    K      Y +  D   IM  +   GP++ +FTVY DF +Y+SGVY+H  
Sbjct: 180 CADGSDLPHLYKATKAVDYGL--DAPAIMKALATGGPLQTAFTVYSDFMYYESGVYQHTY 237

Query: 272 GDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 330
           G V GGHAV ++G+GT DDG DYWI+ N W   WG DGYF+I R +NECGIEE V+ G 
Sbjct: 238 GRVEGGHAVDMVGYGTDDDGVDYWIIKNSWGPDWGEDGYFRIIRMTNECGIEEQVIGGF 296


>sp|P92132|CATB2_GIAIN Cathepsin B-like CP2 OS=Giardia intestinalis GN=CP2 PE=1 SV=2
          Length = 300

 Score =  172 bits (435), Expect = 5e-42,   Method: Compositional matrix adjust.
 Identities = 106/299 (35%), Positives = 153/299 (51%), Gaps = 27/299 (9%)

Query: 40  IKEVNE----NPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSLKL 95
           + E+N     NP+  WKA    +F   T  +   LL      K      P  T      +
Sbjct: 18  VSELNHIKSLNPR--WKAGIPKRFEGLTKDEISSLLMPVSFLKNAKGAAPRGTFTDKDDV 75

Query: 96  PKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMN---LSLSVNDLL 152
           P+SFD R  +P C  I  ++DQG CGSCWAF +V    DR C+  G++   +  S   ++
Sbjct: 76  PESFDFREEYPHC--IPEVVDQGGCGSCWAFSSVATFGDRRCVA-GLDKKPVKYSPQYVV 132

Query: 153 ACCGFLCGD-GCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRK 211
           +C     GD  C+GG+  + W++    G  T+EC PY   +      C    PT     K
Sbjct: 133 SCDH---GDMACNGGWLPNVWKFLTKTGTTTDECVPYKSGSTTLRGTC----PT-----K 180

Query: 212 CVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHIT 271
           C   +     +   S   Y +  D   +M  +  +GP++V+F V+ DF +Y+SGVY+H  
Sbjct: 181 CADGSSKVHLATATSYKDYGL--DIPAMMKALSTSGPLQVAFLVHSDFMYYESGVYQHTY 238

Query: 272 GDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 330
           G + GGHAV+++G+GT DDG DYWI+ N W   WG DGYF++ RG N+C IEE   AG 
Sbjct: 239 GYMEGGHAVEMVGYGTDDDGVDYWIIKNSWGPDWGEDGYFRMIRGINDCSIEEQAYAGF 297


>sp|Q3SZI1|TINAG_BOVIN Tubulointerstitial nephritis antigen OS=Bos taurus GN=TINAG PE=2
           SV=1
          Length = 476

 Score =  166 bits (420), Expect = 3e-40,   Method: Compositional matrix adjust.
 Identities = 116/337 (34%), Positives = 164/337 (48%), Gaps = 43/337 (12%)

Query: 34  ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ-FKHLLGVKPTPKGLLLGVPVKTHD-- 90
           ++Q  +I+ VN+    GW A    QF   T+ + FK+ LG  P P  LLL +   T    
Sbjct: 155 LVQPGLIEHVNKG-DYGWTAQNYSQFWGMTLEEGFKYRLGTLP-PSPLLLSMNEVTASLT 212

Query: 91  KSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSV 148
           K+  LP+ F A   WP        LDQ +C + WAF      +DR  I        +LS 
Sbjct: 213 KTTDLPEFFIASYKWP--GWTHGPLDQKNCAASWAFSTASVAADRIAIQSQGRYTANLSP 270

Query: 149 NDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA------ 202
            +L++CC      GC+ G    AW Y    G+V+  C P F     ++ GC  A      
Sbjct: 271 QNLISCCAKK-RHGCNSGSVDRAWWYLRKRGLVSHACYPLFKDQNATNNGCAMASRSDGR 329

Query: 203 ---YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDF 259
              + T  C     K N++++ S       YR++S+  +IM EI +NGPV+    V+EDF
Sbjct: 330 GKRHATTPCPNSIEKSNRIYQCS-----PPYRVSSNETEIMREIMQNGPVQAIMQVHEDF 384

Query: 260 AHYKSGVYKHITGD--------VMGGHAVKLIGWGT----SDDGEDYWILANQWNRSWGA 307
            +YK+G+Y+HIT              HAVKL GWGT        E +WI AN W +SWG 
Sbjct: 385 FNYKTGIYRHITSTNEDSEKYRKFRTHAVKLTGWGTLRGAQGQKEKFWIAANSWGKSWGE 444

Query: 308 DGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEITSAD 344
           +GYF+I RG NE  IE+ ++A          ++TSAD
Sbjct: 445 NGYFRILRGVNESDIEKLIIAAW-------GQLTSAD 474


>sp|P92131|CATB1_GIAIN Cathepsin B-like CP1 OS=Giardia intestinalis GN=CP1 PE=2 SV=3
          Length = 303

 Score =  164 bits (415), Expect = 8e-40,   Method: Compositional matrix adjust.
 Identities = 104/287 (36%), Positives = 148/287 (51%), Gaps = 27/287 (9%)

Query: 51  WKAARNPQFSNYTVGQFKHLLGVKP----TPKGLLLGVPV-KTHDKSLKLPKSFDARSAW 105
           WKA    +F N T  +F+ +L ++P       G L  + + +  +    +P  FD R  +
Sbjct: 31  WKAGMPKRFENVTEDEFRSML-IRPDRLRARSGSLPPISITEVQELVDPIPPQFDFRDEY 89

Query: 106 PQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMN---LSLSVNDLLACCGFLCGDG 162
           PQC  +   LDQG CGSCWAF A+    DR C   G++   +S S   L++C   L   G
Sbjct: 90  PQC--VKPALDQGSCGSCWAFSAIGVFGDRRC-AMGIDKEAVSYSQQHLISCS--LENFG 144

Query: 163 CDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNS 222
           CDGG     W +    G  T EC  Y D       G   A P P          QL++  
Sbjct: 145 CDGGDFQPTWSFLTFTGATTAECVKYVDY------GHTVASPCPAVCDDG-SPIQLYKAH 197

Query: 223 KHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV-MGGHAVK 281
            +  +S     S P  IM  +   GP++    VY D ++Y+SGVYKH  G + +G HA++
Sbjct: 198 GYGQVS----KSVPA-IMGMLVAGGPLQTMIVVYADLSYYESGVYKHTYGTINLGFHALE 252

Query: 282 LIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 328
           ++G+GT+DDG DYWI+ N W   WG +GYF+I RG NEC IE+++ A
Sbjct: 253 IVGYGTTDDGTDYWIIKNSWGPDWGENGYFRIVRGVNECRIEDEIYA 299


>sp|Q9UJW2|TINAG_HUMAN Tubulointerstitial nephritis antigen OS=Homo sapiens GN=TINAG PE=2
           SV=3
          Length = 476

 Score =  164 bits (414), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 107/321 (33%), Positives = 156/321 (48%), Gaps = 34/321 (10%)

Query: 34  ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ-FKHLLG-VKPTPKGLLLGVPVKTHDK 91
           +++  +I++VN+    GW A    QF   T+   FK  LG + P+P  L +     +   
Sbjct: 155 LVRSELIEQVNKG-DYGWTAQNYSQFWGMTLEDGFKFRLGTLPPSPMLLSMNEMTASLPA 213

Query: 92  SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVN 149
           +  LP+ F A   WP        LDQ +C + WAF      +DR  I        +LS  
Sbjct: 214 TTDLPEFFVASYKWP--GWTHGPLDQKNCAASWAFSTASVAADRIAIQSKGRYTANLSPQ 271

Query: 150 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA------- 202
           +L++CC      GC+ G    AW Y    G+V+  C P F     ++ GC  A       
Sbjct: 272 NLISCCA-KNRHGCNSGSIDRAWWYLRKRGLVSHACYPLFKDQNATNNGCAMASRSDGRG 330

Query: 203 --YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFA 260
             + T  C     K N++++ S       YR++S+  +IM EI +NGPV+    V EDF 
Sbjct: 331 KRHATKPCPNNVEKSNRIYQCS-----PPYRVSSNETEIMKEIMQNGPVQAIMQVREDFF 385

Query: 261 HYKSGVYKHITGD--------VMGGHAVKLIGWGT----SDDGEDYWILANQWNRSWGAD 308
           HYK+G+Y+H+T           +  HAVKL GWGT        E +WI AN W +SWG +
Sbjct: 386 HYKTGIYRHVTSTNKESEKYRKLQTHAVKLTGWGTLRGAQGQKEKFWIAANSWGKSWGEN 445

Query: 309 GYFKIKRGSNECGIEEDVVAG 329
           GYF+I RG NE  IE+ ++A 
Sbjct: 446 GYFRILRGVNESDIEKLIIAA 466


>sp|O97578|CATC_CANFA Dipeptidyl peptidase 1 (Fragment) OS=Canis familiaris GN=CTSC PE=1
           SV=1
          Length = 435

 Score =  149 bits (375), Expect = 4e-35,   Method: Compositional matrix adjust.
 Identities = 96/306 (31%), Positives = 154/306 (50%), Gaps = 30/306 (9%)

Query: 39  IIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSLKLPKS 98
            +K +N   K+ W A R  ++   T+      +G +  P+     +  + H++  +LP S
Sbjct: 149 FVKAINTIQKS-WTATRYIEYETLTLRDMMTRVGGRKIPRPKPTPLTAEIHEEISRLPTS 207

Query: 99  FDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCG 156
           +D R+     + +S + +Q  CGSC+AF +   L  R  I      +  LS  ++++C  
Sbjct: 208 WDWRNV-RGTNFVSPVRNQASCGSCYAFASTAMLEARIRILTNNTQTPILSPQEIVSCSQ 266

Query: 157 FLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKK 215
           +    GC+GG+P + A +Y    G+V E C PY    G   P C+P      C R     
Sbjct: 267 Y--AQGCEGGFPYLIAGKYAQDFGLVEEACFPY---AGSDSP-CKPN----DCFR----- 311

Query: 216 NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH------ 269
              + +S++Y +  +    +   +  E+ ++GP+ V+F VY+DF HY+ G+Y H      
Sbjct: 312 ---YYSSEYYYVGGFYGACNEALMKLELVRHGPMAVAFEVYDDFFHYQKGIYYHTGLRDP 368

Query: 270 ITGDVMGGHAVKLIGWGT-SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 328
                +  HAV L+G+GT S  G DYWI+ N W   WG DGYF+I+RG++EC IE   VA
Sbjct: 369 FNPFELTNHAVLLVGYGTDSASGMDYWIVKNSWGSRWGEDGYFRIRRGTDECAIESIAVA 428

Query: 329 GLPSSK 334
             P  K
Sbjct: 429 ATPIPK 434


>sp|Q06544|CYSP3_OSTOS Cathepsin B-like cysteine proteinase 3 (Fragment) OS=Ostertagia
           ostertagi GN=CP-3 PE=3 SV=1
          Length = 174

 Score =  145 bits (366), Expect = 5e-34,   Method: Compositional matrix adjust.
 Identities = 79/175 (45%), Positives = 104/175 (59%), Gaps = 17/175 (9%)

Query: 171 AWRYFVHHGVVTEE-------CDPYFDSTGCSHPGCEPAY-------PTPKCVRKCVKKN 216
           AW+YF   GVVT         C PY +   C   G EP Y        TPKC + C +  
Sbjct: 1   AWQYFALEGVVTGGNYRKQGCCRPY-EFPPCGRHGKEPYYGECYDTAKTPKCQKTCQRGY 59

Query: 217 -QLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVM 275
            + ++  KH+  SAYR+ ++ + I  +I KNGPV   F VYEDFAHYKSG+YKH  G + 
Sbjct: 60  LKAYKEDKHFGKSAYRLPNNVKAIQRDIMKNGPVVAGFIVYEDFAHYKSGIYKHTAGRMT 119

Query: 276 GGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 330
           GGHAVK+IGWG  + G  YW++AN W+  WG  G++++ RG N C IEE V AG+
Sbjct: 120 GGHAVKIIGWG-KEKGTPYWLIANSWHDDWGEKGFYRMIRGINNCRIEEMVFAGI 173


>sp|Q99JR5|TINAL_MOUSE Tubulointerstitial nephritis antigen-like OS=Mus musculus
           GN=Tinagl1 PE=1 SV=1
          Length = 466

 Score =  145 bits (366), Expect = 5e-34,   Method: Compositional matrix adjust.
 Identities = 104/324 (32%), Positives = 151/324 (46%), Gaps = 39/324 (12%)

Query: 34  ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ-FKHLLG-VKPTPKGLLLGVPVKTHDK 91
           ++   +IK +N     GW+A  +  F   T+ +  ++ LG ++P+   + +        +
Sbjct: 140 LVDPDMIKAINRG-NYGWQAGNHSAFWGMTLDEGIRYRLGTIRPSSTVMNMNEIYTVLGQ 198

Query: 92  SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVN 149
              LP +F+A   WP  + I   LDQG+C   WAF      SDR  IH    M   LS  
Sbjct: 199 GEVLPTAFEASEKWP--NLIHEPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPILSPQ 256

Query: 150 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCV 209
           +LL+C       GC GG    AW +    GVV++ C P+             A PTP+C+
Sbjct: 257 NLLSCDTHH-QQGCRGGRLDGAWWFLRRRGVVSDNCYPFSGREQ------NEASPTPRCM 309

Query: 210 ----------RKCVKK---NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVY 256
                     R+   +    Q+  N  +    AYR+ SD ++IM E+ +NGPV+    V+
Sbjct: 310 MHSRAMGRGKRQATSRCPNGQVDSNDIYQVTPAYRLGSDEKEIMKELMENGPVQALMEVH 369

Query: 257 EDFAHYKSGVYKHITGDV--------MGGHAVKLIGWG--TSDDGE--DYWILANQWNRS 304
           EDF  Y+ G+Y H              G H+VK+ GWG  T  DG    YW  AN W   
Sbjct: 370 EDFFLYQRGIYSHTPVSQGRPEQYRRHGTHSVKITGWGEETLPDGRTIKYWTAANSWGPW 429

Query: 305 WGADGYFKIKRGSNECGIEEDVVA 328
           WG  G+F+I RG+NEC IE  V+ 
Sbjct: 430 WGERGHFRIVRGTNECDIETFVLG 453


>sp|Q9EQT5|TINAL_RAT Tubulointerstitial nephritis antigen-like OS=Rattus norvegicus
           GN=Tinagl1 PE=2 SV=1
          Length = 467

 Score =  143 bits (360), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 103/324 (31%), Positives = 152/324 (46%), Gaps = 38/324 (11%)

Query: 34  ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ-FKHLLG-VKPTPKGLLLGVPVKTHDK 91
           ++  ++IK +N     GW+A  +  F   T+ +  ++ LG ++P+   + +        +
Sbjct: 140 LVDPAMIKAINRG-NYGWQAGNHSAFWGMTLDEGIRYRLGTIRPSSSVMNMNEIYTVLGQ 198

Query: 92  SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVN 149
              LP +F+A   WP  + I   LDQG+C   WAF      SDR  IH    M   LS  
Sbjct: 199 GEVLPTAFEASEKWP--NLIHEPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPILSPQ 256

Query: 150 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCV 209
           +LL+C       GC GG    AW +    GVV++ C P+           + A PTP+C+
Sbjct: 257 NLLSCDTHH-QKGCRGGRLDGAWWFLRRRGVVSDNCYPF-----SGREQNDEASPTPRCM 310

Query: 210 ----------RKCVKK---NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVY 256
                     R+   +   +Q+  N  +     YR+ SD ++IM E+ +NGPV+    V+
Sbjct: 311 MHSRAMGRGKRQATSRCPNSQVDSNDIYQVTPVYRLASDEKEIMKELMENGPVQALMEVH 370

Query: 257 EDFAHYKSGVYKHITGDV--------MGGHAVKLIGWG--TSDDGE--DYWILANQWNRS 304
           EDF  Y+ G+Y H              G H+VK+ GWG  T  DG    YW  AN W   
Sbjct: 371 EDFFLYQRGIYSHTPVSQGRPEQYRRHGTHSVKITGWGEETLPDGRTIKYWTAANSWGPW 430

Query: 305 WGADGYFKIKRGSNECGIEEDVVA 328
           WG  G+F+I RG NEC IE  V+ 
Sbjct: 431 WGERGHFRIVRGINECDIETFVLG 454


>sp|Q9GZM7|TINAL_HUMAN Tubulointerstitial nephritis antigen-like OS=Homo sapiens
           GN=TINAGL1 PE=1 SV=1
          Length = 467

 Score =  143 bits (360), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 102/318 (32%), Positives = 152/318 (47%), Gaps = 27/318 (8%)

Query: 34  ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ-FKHLLG-VKPTPKGLLLGVPVKTHDK 91
           ++   +IK +N+    GW+A  +  F   T+ +  ++ LG ++P+   + +       + 
Sbjct: 141 LVDPDMIKAINQG-NYGWQAGNHSAFWGMTLDEGIRYRLGTIRPSSSVMNMHEIYTVLNP 199

Query: 92  SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVN 149
              LP +F+A   WP  + I   LDQG+C   WAF      SDR  IH    M   LS  
Sbjct: 200 GEVLPTAFEASEKWP--NLIHEPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQ 257

Query: 150 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPY----FDSTGCSHPGCEPAYPT 205
           +LL+C       GC GG    AW +    GVV++ C P+     D  G + P    +   
Sbjct: 258 NLLSCDTHQ-QQGCRGGRLDGAWWFLRRRGVVSDHCYPFSGRERDEAGPAPPCMMHSRAM 316

Query: 206 PKCVRKCVKK--NQLWRNSKHYSIS-AYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHY 262
            +  R+      N    N+  Y ++  YR+ S+ ++IM E+ +NGPV+    V+EDF  Y
Sbjct: 317 GRGKRQATAHCPNSYVNNNDIYQVTPVYRLGSNDKEIMKELMENGPVQALMEVHEDFFLY 376

Query: 263 KSGVYKHITGDV--------MGGHAVKLIGWG--TSDDGE--DYWILANQWNRSWGADGY 310
           K G+Y H    +         G H+VK+ GWG  T  DG    YW  AN W  +WG  G+
Sbjct: 377 KGGIYSHTPVSLGRPERYRRHGTHSVKITGWGEETLPDGRTLKYWTAANSWGPAWGERGH 436

Query: 311 FKIKRGSNECGIEEDVVA 328
           F+I RG NEC IE  V+ 
Sbjct: 437 FRIVRGVNECDIESFVLG 454


>sp|Q60HG6|CATC_MACFA Dipeptidyl peptidase 1 OS=Macaca fascicularis GN=CTSC PE=2 SV=1
          Length = 463

 Score =  142 bits (358), Expect = 4e-33,   Method: Compositional matrix adjust.
 Identities = 99/317 (31%), Positives = 157/317 (49%), Gaps = 47/317 (14%)

Query: 38  SIIKEVNENPKAGWKAARNPQFSNYTVGQF--------KHLLGVKPTPKGLLLGVPVKTH 89
           + +K +N   K+ W A    ++   T+G          + +   KPTP      +  +  
Sbjct: 173 NFVKAINAIQKS-WTATTYMEYETLTLGDMIKRSGGHSRKIPRPKPTP------LTAEIQ 225

Query: 90  DKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LS 147
            K L LP S+D R+     + +S + +Q  CGSC++F +V  L  R  I    + +  LS
Sbjct: 226 QKILHLPTSWDWRNV-HGINFVSPVRNQASCGSCYSFASVGMLEARIRILTNNSQTPILS 284

Query: 148 VNDLLACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTP 206
             ++++C  +    GC+GG+P ++A +Y    G+V E C PY   TG   P         
Sbjct: 285 SQEVVSCSQY--AQGCEGGFPYLTAGKYAQDFGLVEEACFPY---TGTDSP--------- 330

Query: 207 KCVRKCVKKNQLWR--NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKS 264
                C  K   +R  +S+++ +  +    +   +  E+  +GP+ V+F VY+DF HY++
Sbjct: 331 -----CKMKEDCFRYYSSEYHYVGGFYGGCNEALMKLELVYHGPLAVAFEVYDDFLHYQN 385

Query: 265 GVYKH------ITGDVMGGHAVKLIGWGT-SDDGEDYWILANQWNRSWGADGYFKIKRGS 317
           G+Y H           +  HAV L+G+GT S  G DYWI+ N W  SWG DGYF+I+RG+
Sbjct: 386 GIYHHTGLRDPFNPFELTNHAVLLVGYGTDSASGMDYWIVKNSWGTSWGEDGYFRIRRGT 445

Query: 318 NECGIEEDVVAGLPSSK 334
           +EC IE   VA  P  K
Sbjct: 446 DECAIESIAVAATPIPK 462


>sp|Q5RB02|CATC_PONAB Dipeptidyl peptidase 1 OS=Pongo abelii GN=CTSC PE=2 SV=1
          Length = 463

 Score =  139 bits (350), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 99/313 (31%), Positives = 153/313 (48%), Gaps = 39/313 (12%)

Query: 38  SIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVK----PTPKGLLLGVPVKTHDKSL 93
           + +K +N   K+ W A    ++   T+G      G      P PK   L   ++   K L
Sbjct: 173 NFVKAINAIQKS-WTATTYKEYETLTLGDMIRRSGGHSRKIPRPKPAPLTAEIQ--QKVL 229

Query: 94  KLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDL 151
            LP S+D R+     + +S + +Q  CGSC++F ++  L  R  I    + +  LS  ++
Sbjct: 230 HLPTSWDWRNI-HGINFVSPVRNQASCGSCYSFASMGMLEARIRILTSNSQTPILSPQEV 288

Query: 152 LACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVR 210
           ++C  +    GC+GG+P + A +Y    G+V E C PY   TG   P             
Sbjct: 289 VSCSQY--AQGCEGGFPYLIAGKYAQDFGLVEEACFPY---TGTDSP------------- 330

Query: 211 KCVKKNQLWR--NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYK 268
            C  K   +R  +S+++ +  +    +   +  E+  +GP+ V+F VY+DF HYK G+Y 
Sbjct: 331 -CKMKEDCFRYYSSEYHYVGGFYGGCNEALMKLELVHHGPMAVAFEVYDDFLHYKKGIYH 389

Query: 269 H------ITGDVMGGHAVKLIGWGT-SDDGEDYWILANQWNRSWGADGYFKIKRGSNECG 321
           H           +  HAV L+G+GT S  G DYWI+ N W   WG DGYF+I+RG++EC 
Sbjct: 390 HTGLRDPFNPFELTNHAVLLVGYGTDSASGMDYWIVKNSWGTGWGEDGYFRIRRGTDECA 449

Query: 322 IEEDVVAGLPSSK 334
           IE   VA  P  K
Sbjct: 450 IESIAVAATPIPK 462


>sp|P90850|YCF2E_CAEEL Uncharacterized peptidase C1-like protein F26E4.3 OS=Caenorhabditis
           elegans GN=F26E4.3 PE=1 SV=3
          Length = 452

 Score =  138 bits (347), Expect = 7e-32,   Method: Compositional matrix adjust.
 Identities = 90/252 (35%), Positives = 125/252 (49%), Gaps = 18/252 (7%)

Query: 91  KSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSV 148
           K  +LP+ FDAR  W     I  + DQG CGS W+       SDR  I     +N +LS 
Sbjct: 180 KPRELPEHFDARDKWG--PLIHPVADQGDCGSSWSVSTTAISSDRLAIISEGRINSTLSS 237

Query: 149 NDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKC 208
             LL+C       GC+GGY   AW Y    GVV + C PY  S     PG          
Sbjct: 238 QQLLSCNQHR-QKGCEGGYLDRAWWYIRKLGVVGDHCYPYV-SGQSREPGHCLIPKRDYT 295

Query: 209 VRKCVKKNQLWRNSKHYSISA-YRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVY 267
            R+ ++     ++S  + ++  Y+++S  EDI  E+  NGPV+ +F V+EDF  Y  GVY
Sbjct: 296 NRQGLRCPSGSQDSTAFKMTPPYKVSSREEDIQTELMTNGPVQATFVVHEDFFMYAGGVY 355

Query: 268 KH--------ITGDVMGGHAVKLIGWG---TSDDGEDYWILANQWNRSWGADGYFKIKRG 316
           +H         +    G H+V+++GWG   ++     YW+ AN W   WG DGYFK+ RG
Sbjct: 356 QHSDLAAQKGASSVAEGYHSVRVLGWGVDHSTGKPIKYWLCANSWGTQWGEDGYFKVLRG 415

Query: 317 SNECGIEEDVVA 328
            N C IE  V+ 
Sbjct: 416 ENHCEIESFVIG 427


>sp|P53634|CATC_HUMAN Dipeptidyl peptidase 1 OS=Homo sapiens GN=CTSC PE=1 SV=2
          Length = 463

 Score =  137 bits (345), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 98/313 (31%), Positives = 153/313 (48%), Gaps = 39/313 (12%)

Query: 38  SIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVK----PTPKGLLLGVPVKTHDKSL 93
           + +K +N   K+ W A    ++   T+G      G      P PK   L   ++   K L
Sbjct: 173 NFVKAINAIQKS-WTATTYMEYETLTLGDMIRRSGGHSRKIPRPKPAPLTAEIQ--QKIL 229

Query: 94  KLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDL 151
            LP S+D R+     + +S + +Q  CGSC++F ++  L  R  I    + +  LS  ++
Sbjct: 230 HLPTSWDWRNV-HGINFVSPVRNQASCGSCYSFASMGMLEARIRILTNNSQTPILSPQEV 288

Query: 152 LACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVR 210
           ++C  +    GC+GG+P + A +Y    G+V E C PY   TG   P             
Sbjct: 289 VSCSQY--AQGCEGGFPYLIAGKYAQDFGLVEEACFPY---TGTDSP------------- 330

Query: 211 KCVKKNQLWR--NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYK 268
            C  K   +R  +S+++ +  +    +   +  E+  +GP+ V+F VY+DF HYK G+Y 
Sbjct: 331 -CKMKEDCFRYYSSEYHYVGGFYGGCNEALMKLELVHHGPMAVAFEVYDDFLHYKKGIYH 389

Query: 269 H------ITGDVMGGHAVKLIGWGT-SDDGEDYWILANQWNRSWGADGYFKIKRGSNECG 321
           H           +  HAV L+G+GT S  G DYWI+ N W   WG +GYF+I+RG++EC 
Sbjct: 390 HTGLRDPFNPFELTNHAVLLVGYGTDSASGMDYWIVKNSWGTGWGENGYFRIRRGTDECA 449

Query: 322 IEEDVVAGLPSSK 334
           IE   VA  P  K
Sbjct: 450 IESIAVAATPIPK 462


>sp|Q3ZCJ8|CATC_BOVIN Dipeptidyl peptidase 1 OS=Bos taurus GN=CTSC PE=2 SV=1
          Length = 463

 Score =  135 bits (339), Expect = 5e-31,   Method: Compositional matrix adjust.
 Identities = 95/316 (30%), Positives = 154/316 (48%), Gaps = 47/316 (14%)

Query: 39  IIKEVNENPKAGWKAARNPQFSNYTV--------GQFKHLLGVKPTPKGLLLGVPVKTHD 90
            +K +N   K+ W AA   ++   T+        G  + +   KP P      +  +   
Sbjct: 174 FVKAINAIQKS-WTAAPYMEYETLTLKEMIRRGGGHSRRIPRPKPAP------ITAEIQK 226

Query: 91  KSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSV 148
           K L LP S+D R+     + ++ + +QG CGSC++F ++  +  R  I      +  LS 
Sbjct: 227 KILHLPTSWDWRNV-HGINFVTPVRNQGSCGSCYSFASMGMMEARIRILTNNTQTPILSP 285

Query: 149 NDLLACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPK 207
            ++++C  +    GC+GG+P + A +Y    G+V E+C PY   TG   P          
Sbjct: 286 QEVVSCSQY--AQGCEGGFPYLIAGKYAQDFGLVEEDCFPY---TGTDSP---------- 330

Query: 208 CVRKCVKKNQLWR--NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSG 265
               C  K   +R  +S+++ +  +    +   +  E+   GP+ V+F VY+DF HY+ G
Sbjct: 331 ----CRLKEGCFRYYSSEYHYVGGFYGGCNEALMKLELVHQGPMAVAFEVYDDFLHYRKG 386

Query: 266 VYKH------ITGDVMGGHAVKLIGWGT-SDDGEDYWILANQWNRSWGADGYFKIKRGSN 318
           VY H           +  HAV L+G+GT +  G DYWI+ N W  SWG +GYF+I+RG++
Sbjct: 387 VYHHTGLRDPFNPFELTNHAVLLVGYGTDAASGLDYWIVKNSWGTSWGENGYFRIRRGTD 446

Query: 319 ECGIEEDVVAGLPSSK 334
           EC IE   +A  P  K
Sbjct: 447 ECAIESIALAATPIPK 462


>sp|P80067|CATC_RAT Dipeptidyl peptidase 1 OS=Rattus norvegicus GN=Ctsc PE=1 SV=3
          Length = 462

 Score =  132 bits (332), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 96/323 (29%), Positives = 157/323 (48%), Gaps = 44/323 (13%)

Query: 29  KLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTV-------GQFKHLLGVKPTPKGLL 81
           +L SH    + +K +N   K+ W A    ++   ++       G    +L  KP P    
Sbjct: 166 RLYSH--NHNFVKAINSVQKS-WTATTYEEYEKLSIRDLIRRSGHSGRILRPKPAP---- 218

Query: 82  LGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG 141
             +  +   + L LP+S+D R+     + +S + +Q  CGSC++F ++  L  R  I   
Sbjct: 219 --ITDEIQQQILSLPESWDWRNV-RGINFVSPVRNQESCGSCYSFASLGMLEARIRILTN 275

Query: 142 MNLS--LSVNDLLACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPG 198
            + +  LS  ++++C  +    GCDGG+P + A +Y    GVV E C PY  +       
Sbjct: 276 NSQTPILSPQEVVSCSPY--AQGCDGGFPYLIAGKYAQDFGVVEENCFPYTATDA----- 328

Query: 199 CEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYED 258
             P  P   C+R        + +S++Y +  +    +   +  E+ K+GP+ V+F V++D
Sbjct: 329 --PCKPKENCLR--------YYSSEYYYVGGFYGGCNEALMKLELVKHGPMAVAFEVHDD 378

Query: 259 FAHYKSGVYKH------ITGDVMGGHAVKLIGWGTSD-DGEDYWILANQWNRSWGADGYF 311
           F HY SG+Y H           +  HAV L+G+G     G DYWI+ N W   WG  GYF
Sbjct: 379 FLHYHSGIYHHTGLSDPFNPFELTNHAVLLVGYGKDPVTGLDYWIVKNSWGSQWGESGYF 438

Query: 312 KIKRGSNECGIEEDVVAGLPSSK 334
           +I+RG++EC IE   +A +P  K
Sbjct: 439 RIRRGTDECAIESIAMAAIPIPK 461


>sp|P97821|CATC_MOUSE Dipeptidyl peptidase 1 OS=Mus musculus GN=Ctsc PE=2 SV=1
          Length = 462

 Score =  127 bits (319), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 92/314 (29%), Positives = 152/314 (48%), Gaps = 42/314 (13%)

Query: 38  SIIKEVNENPKAGWKAARNPQFSNYTV-------GQFKHLLGVKPTPKGLLLGVPVKTHD 90
           + +K +N   K+ W A    ++   ++       G  + +   KP P      +  +   
Sbjct: 173 NFVKAINTVQKS-WTATAYKEYEKMSLRDLIRRSGHSQRIPRPKPAP------MTDEIQQ 225

Query: 91  KSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSV 148
           + L LP+S+D R+     + +S + +Q  CGSC++F ++  L  R  I    + +  LS 
Sbjct: 226 QILNLPESWDWRNV-QGVNYVSPVRNQESCGSCYSFASMGMLEARIRILTNNSQTPILSP 284

Query: 149 NDLLACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPK 207
            ++++C  +    GCDGG+P + A +Y    GVV E C PY            P  P   
Sbjct: 285 QEVVSCSPY--AQGCDGGFPYLIAGKYAQDFGVVEESCFPYTAKDS-------PCKPREN 335

Query: 208 CVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVY 267
           C+R        + +S +Y +  +    +   +  E+ K+GP+ V+F V++DF HY SG+Y
Sbjct: 336 CLR--------YYSSDYYYVGGFYGGCNEALMKLELVKHGPMAVAFEVHDDFLHYHSGIY 387

Query: 268 KH------ITGDVMGGHAVKLIGWGTSD-DGEDYWILANQWNRSWGADGYFKIKRGSNEC 320
            H           +  HAV L+G+G     G +YWI+ N W  +WG  GYF+I+RG++EC
Sbjct: 388 HHTGLSDPFNPFELTNHAVLLVGYGRDPVTGIEYWIIKNSWGSNWGESGYFRIRRGTDEC 447

Query: 321 GIEEDVVAGLPSSK 334
            IE   VA +P  K
Sbjct: 448 AIESIAVAAIPIPK 461


>sp|Q26563|CATC_SCHMA Cathepsin C OS=Schistosoma mansoni PE=2 SV=1
          Length = 454

 Score =  118 bits (295), Expect = 8e-26,   Method: Compositional matrix adjust.
 Identities = 98/310 (31%), Positives = 144/310 (46%), Gaps = 40/310 (12%)

Query: 35  LQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSL- 93
           +  S + ++N + K+ W+    P+ S YT+ + ++  G   +       +  KT  K L 
Sbjct: 154 INPSFVGKINAHQKS-WRGEIYPELSKYTIDELRNRAGGVKSMVTRPSVLNRKTPSKELI 212

Query: 94  ----KLPKSFDARSAWPQCST--ISRILDQGHCGSCWAFGAVEALSDRFCI--HFGMNLS 145
                LP  FD  S  P  S   ++ I +QG CGSC+A  +  AL  R  +  +F     
Sbjct: 213 SLTGNLPLEFDWTSP-PDGSRSPVTPIRNQGICGSCYASPSAAALEARIRLVSNFSEQPI 271

Query: 146 LSVNDLLACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYP 204
           LS   ++ C  +   +GC+GG+P + A +Y    G+  +   PY   TG           
Sbjct: 272 LSPQTVVDCSPY--SEGCNGGFPFLIAGKYGEDFGLPQKIVIPY---TGED--------- 317

Query: 205 TPKCVRKCVKKNQLWRNSKHYS-ISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYK 263
           T KC    V KN     +  YS I  Y   ++ + +  E+  NGP  V F VYEDF  YK
Sbjct: 318 TGKCT---VSKNCTRYYTTDYSYIGGYYGATNEKLMQLELISNGPFPVGFEVYEDFQFYK 374

Query: 264 SGVYKHITGDV---------MGGHAVKLIGWGTSD-DGEDYWILANQWNRSWGADGYFKI 313
            G+Y H T            +  HAV L+G+G     GE YW + N W   WG  GYF+I
Sbjct: 375 EGIYHHTTVQTDHYNFNPFELTNHAVLLVGYGVDKLSGEPYWKVKNSWGVEWGEQGYFRI 434

Query: 314 KRGSNECGIE 323
            RG++ECG+E
Sbjct: 435 LRGTDECGVE 444


>sp|P05167|ALEU_HORVU Thiol protease aleurain OS=Hordeum vulgare PE=2 SV=1
          Length = 362

 Score =  116 bits (291), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 88/278 (31%), Positives = 125/278 (44%), Gaps = 47/278 (16%)

Query: 58  QFSNYTVGQFKHL-LGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILD 116
           +FS+ +  +F+   LG   T    L G  +     +  LP++ D    W +   +S + +
Sbjct: 108 RFSDMSWEEFQATRLGAAQTCSATLAGNHLMR--DAAALPETKD----WREDGIVSPVKN 161

Query: 117 QGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFV 176
           Q HCGSCW F    AL   +    G N+SLS   L+ C G     GC+GG P  A+ Y  
Sbjct: 162 QAHCGSCWTFSTTGALEAAYTQATGKNISLSEQQLVDCAGGFNNFGCNGGLPSQAFEYIK 221

Query: 177 HHGVV-TEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSD 235
           ++G + TEE  PY    G  H   E                    N+    + +  I  +
Sbjct: 222 YNGGIDTEESYPYKGVNGVCHYKAE--------------------NAAVQVLDSVNITLN 261

Query: 236 PEDIMAEIYKNG-----PVEVSFTVYEDFAHYKSGVYKHITGDVMG------GHAVKLIG 284
            ED +    KN      PV V+F V + F  YKSGVY   T D  G       HAV  +G
Sbjct: 262 AEDEL----KNAVGLVRPVSVAFQVIDGFRQYKSGVY---TSDHCGTTPDDVNHAVLAVG 314

Query: 285 WGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGI 322
           +G  ++G  YW++ N W   WG +GYFK++ G N C I
Sbjct: 315 YGV-ENGVPYWLIKNSWGADWGDNGYFKMEMGKNMCAI 351


>sp|P22497|CYSP_THEPA Cysteine proteinase OS=Theileria parva GN=TP03_0285 PE=3 SV=2
          Length = 440

 Score =  115 bits (289), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 84/287 (29%), Positives = 132/287 (45%), Gaps = 49/287 (17%)

Query: 58  QFSNYTVGQFKHLLGVKPTPKG-------LLLGVPVKTHDKSLK----------LPKSFD 100
           +FS+ T  +F  L  V   PK        LL  +  KT+ K+LK          L K   
Sbjct: 171 RFSDLTEREFYKLFPVMKPPKATYSNGYYLLSHMANKTYLKNLKKALNTDEDVDLAKLTG 230

Query: 101 ARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCG 160
               W + S+++ + DQ +CG CWAF  V ++   +  HF  +  LSV +LL C  F   
Sbjct: 231 ENLDWRRSSSVTSVKDQSNCGGCWAFSTVGSVEGYYMSHFDKSYELSVQELLDCDSF--S 288

Query: 161 DGCDGGYPISAWRYFVHHGVVTEECDPYFD-STGCSHPGCEPAYPTPKCVRKCVKKNQLW 219
           +GC GG   SA+ Y   +G+V+ +  P+ D +  CS P                      
Sbjct: 289 NGCQGGLLESAYEYVRKYGLVSAKDLPFVDKARRCSVP---------------------- 326

Query: 220 RNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHA 279
             +K  S+ +Y +    E +M     + P  V  +V  + A YKSGV+    G  +  HA
Sbjct: 327 -KAKKVSVPSYHVFKGKE-VMTRSLTSSPCSVYLSVSPELAKYKSGVFTGECGKSL-NHA 383

Query: 280 VKLIGWGTSD-DGEDYWILANQWNRSWGADGYFKIKR---GSNECGI 322
           V L+G G  +   + YW++ N W   WG +GY +++R   G+++CG+
Sbjct: 384 VVLVGEGYDEVTKKRYWVVQNSWGTDWGENGYMRLERTNMGTDKCGV 430


>sp|Q8RWQ9|ALEUL_ARATH Thiol protease aleurain-like OS=Arabidopsis thaliana GN=At3g45310
           PE=2 SV=1
          Length = 358

 Score =  115 bits (289), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 94/307 (30%), Positives = 138/307 (44%), Gaps = 40/307 (13%)

Query: 25  VSKLKLDSHILQDSIIKEVNENPKA-GWKAARNPQFSNYTVGQF-KHLLGVKPTPKGLLL 82
           V ++KL   + ++++    + N K   +K + N QF++ T  +F ++ LG        L 
Sbjct: 73  VEEMKLRFSVFKENLDLIRSTNKKGLSYKLSLN-QFADLTWQEFQRYKLGAAQNCSATLK 131

Query: 83  GVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGM 142
           G    T      +P + D    W +   +S + +QGHCGSCW F    AL   +   FG 
Sbjct: 132 GSHKITE---ATVPDTKD----WREDGIVSPVKEQGHCGSCWTFSTTGALEAAYHQAFGK 184

Query: 143 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVV-TEECDPYFDSTGCSHPGCE- 200
            +SLS   L+ C G     GC GG P  A+ Y  ++G + TEE  PY    G    GC+ 
Sbjct: 185 GISLSEQQLVDCAGTFNNFGCHGGLPSQAFEYIKYNGGLDTEEAYPYTGKDG----GCKF 240

Query: 201 PAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFA 260
            A      VR  V       +   +++   R                PV V+F V  +F 
Sbjct: 241 SAKNIGVQVRDSVNITLGAEDELKHAVGLVR----------------PVSVAFEVVHEFR 284

Query: 261 HYKSGVYKHIT-----GDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKR 315
            YK GV+   T      DV   HAV  +G+G  DD   YW++ N W   WG +GYFK++ 
Sbjct: 285 FYKKGVFTSNTCGNTPMDV--NHAVLAVGYGVEDD-VPYWLIKNSWGGEWGDNGYFKMEM 341

Query: 316 GSNECGI 322
           G N CG+
Sbjct: 342 GKNMCGV 348


>sp|P49935|CATH_MOUSE Pro-cathepsin H OS=Mus musculus GN=Ctsh PE=2 SV=2
          Length = 333

 Score =  113 bits (283), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 81/235 (34%), Positives = 110/235 (46%), Gaps = 35/235 (14%)

Query: 96  PKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACC 155
           P S D R    + + +S + +QG CGSCW F    AL     I  G  LSL+   L+ C 
Sbjct: 115 PSSMDWRK---KGNVVSPVKNQGACGSCWTFSTTGALESAVAIASGKMLSLAEQQLVDCA 171

Query: 156 GFLCGDGCDGGYPISAWRYFVHH-GVVTEECDPYF--DSTGCSHPGCEPAYPTPKCVRKC 212
                 GC GG P  A+ Y +++ G++ E+  PY   DS+   +P    A+     V+  
Sbjct: 172 QAFNNHGCKGGLPSQAFEYILYNKGIMEEDSYPYIGKDSSCRFNPQKAVAF-----VKNV 226

Query: 213 VKKNQLWRNSKHYSISAYRINSDPEDIMAE-IYKNGPVEVSFTVYEDFAHYKSGVYK--- 268
           V                  I  + E  M E +    PV  +F V EDF  YKSGVY    
Sbjct: 227 V-----------------NITLNDEAAMVEAVALYNPVSFAFEVTEDFLMYKSGVYSSKS 269

Query: 269 -HITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGI 322
            H T D +  HAV  +G+G   +G  YWI+ N W   WG +GYF I+RG N CG+
Sbjct: 270 CHKTPDKV-NHAVLAVGYG-EQNGLLYWIVKNSWGSQWGENGYFLIERGKNMCGL 322


>sp|P00786|CATH_RAT Pro-cathepsin H OS=Rattus norvegicus GN=Ctsh PE=1 SV=1
          Length = 333

 Score =  113 bits (282), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 78/233 (33%), Positives = 106/233 (45%), Gaps = 31/233 (13%)

Query: 96  PKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACC 155
           P S D R    + + +S + +QG CGSCW F    AL     I  G  ++L+   L+ C 
Sbjct: 115 PSSMDWRK---KGNVVSPVKNQGACGSCWTFSTTGALESAVAIASGKMMTLAEQQLVDCA 171

Query: 156 GFLCGDGCDGGYPISAWRYFVHH-GVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVK 214
                 GC GG P  A+ Y +++ G++ E+  PY    G      E A    K V     
Sbjct: 172 QNFNNHGCQGGLPSQAFEYILYNKGIMGEDSYPYIGKNGQCKFNPEKAVAFVKNV----- 226

Query: 215 KNQLWRNSKHYSISAYRINSDPEDIMAE-IYKNGPVEVSFTVYEDFAHYKSGVYK----H 269
                            I  + E  M E +    PV  +F V EDF  YKSGVY     H
Sbjct: 227 ---------------VNITLNDEAAMVEAVALYNPVSFAFEVTEDFMMYKSGVYSSNSCH 271

Query: 270 ITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGI 322
            T D +  HAV  +G+G   +G  YWI+ N W  +WG +GYF I+RG N CG+
Sbjct: 272 KTPDKV-NHAVLAVGYG-EQNGLLYWIVKNSWGSNWGNNGYFLIERGKNMCGL 322


>sp|Q95029|CATL_DROME Cathepsin L OS=Drosophila melanogaster GN=Cp1 PE=2 SV=2
          Length = 371

 Score =  112 bits (281), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 83/267 (31%), Positives = 120/267 (44%), Gaps = 36/267 (13%)

Query: 61  NYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHC 120
           NYT+   K L     + KG+    P       + LPKS D    W     ++ + DQGHC
Sbjct: 127 NYTL--HKQLRAADESFKGVTFISPAH-----VTLPKSVD----WRTKGAVTAVKDQGHC 175

Query: 121 GSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGV 180
           GSCWAF +  AL  +     G+ +SLS  +L+ C      +GC+GG   +A+RY   +G 
Sbjct: 176 GSCWAFSSTGALEGQHFRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDNGG 235

Query: 181 VTEECDPYFDSTGCSHPGCEPAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDI 239
           +                  E +YP       C   K  +    + ++     I    E  
Sbjct: 236 ID----------------TEKSYPYEAIDDSCHFNKGTVGATDRGFT----DIPQGDEKK 275

Query: 240 MAE-IYKNGPVEVSFTV-YEDFAHYKSGVYKHITGDVMG-GHAVKLIGWGTSDDGEDYWI 296
           MAE +   GPV V+    +E F  Y  GVY     D     H V ++G+GT + GEDYW+
Sbjct: 276 MAEAVATVGPVSVAIDASHESFQFYSEGVYNEPQCDAQNLDHGVLVVGFGTDESGEDYWL 335

Query: 297 LANQWNRSWGADGYFKIKRG-SNECGI 322
           + N W  +WG  G+ K+ R   N+CGI
Sbjct: 336 VKNSWGTTWGDKGFIKMLRNKENQCGI 362


>sp|Q8H166|ALEU_ARATH Thiol protease aleurain OS=Arabidopsis thaliana GN=ALEU PE=1 SV=2
          Length = 358

 Score =  111 bits (277), Expect = 9e-24,   Method: Compositional matrix adjust.
 Identities = 97/304 (31%), Positives = 139/304 (45%), Gaps = 34/304 (11%)

Query: 25  VSKLKLDSHILQDSIIKEVNENPKA-GWKAARNPQFSNYTVGQFKHL-LGVKPTPKGLLL 82
           V ++KL   I ++++    + N K   +K   N QF++ T  +F+   LG        L 
Sbjct: 73  VEEMKLRFSIFKENLDLIRSTNKKGLSYKLGVN-QFADLTWQEFQRTKLGAAQNCSATLK 131

Query: 83  GVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGM 142
           G    T      LP++ D    W +   +S + DQG CGSCW F    AL   +   FG 
Sbjct: 132 GSHKVTE---AALPETKD----WREDGIVSPVKDQGGCGSCWTFSTTGALEAAYHQAFGK 184

Query: 143 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVV-TEECDPYFDSTGCSHPGCEP 201
            +SLS   L+ C G     GC+GG P  A+ Y   +G + TE+  PY   TG        
Sbjct: 185 GISLSEQQLVDCAGAFNNYGCNGGLPSQAFEYIKSNGGLDTEKAYPY---TGKDE----- 236

Query: 202 AYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAH 261
              T K   + V    L  NS + ++ A       +++   +    PV ++F V   F  
Sbjct: 237 ---TCKFSAENVGVQVL--NSVNITLGA------EDELKHAVGLVRPVSIAFEVIHSFRL 285

Query: 262 YKSGVY--KHITGDVMG-GHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSN 318
           YKSGVY   H     M   HAV  +G+G  +DG  YW++ N W   WG  GYFK++ G N
Sbjct: 286 YKSGVYTDSHCGSTPMDVNHAVLAVGYGV-EDGVPYWLIKNSWGADWGDKGYFKMEMGKN 344

Query: 319 ECGI 322
            CGI
Sbjct: 345 MCGI 348


>sp|Q10717|CYSP2_MAIZE Cysteine proteinase 2 OS=Zea mays GN=CCP2 PE=2 SV=1
          Length = 360

 Score =  109 bits (273), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 79/244 (32%), Positives = 112/244 (45%), Gaps = 36/244 (14%)

Query: 87  KTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSL 146
           +    ++ LP++ D    W +   +S + +QGHCGSCW F    AL   +    G  +SL
Sbjct: 135 RMRAAAVALPETKD----WREDGIVSPVKNQGHCGSCWTFSTTGALEAAYTQATGKPISL 190

Query: 147 SVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVV-TEECDPYFDSTGCSHPGCEPAYPT 205
           S   L+ C       GC+GG P  A+ Y  ++G + TEE  PY    G            
Sbjct: 191 SEQQLVDCGFAFNNFGCNGGLPSQAFEYIKYNGGLDTEESYPYQGVNGI----------- 239

Query: 206 PKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAE-IYKNGPVEVSFTVYEDFAHYKS 264
                 C  KN+   N     + +  I    ED + + +    PV V+F V   F  YKS
Sbjct: 240 ------CKFKNE---NVGVKVLDSVNITLGAEDELKDAVGLVRPVSVAFEVITGFRLYKS 290

Query: 265 GVYKHITGDVMG------GHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSN 318
           GVY   T D  G       HAV  +G+G  +DG  YW++ N W   WG +GYFK++ G N
Sbjct: 291 GVY---TSDHCGTTPMDVNHAVLAVGYGV-EDGVPYWLIKNSWGADWGDEGYFKMEMGKN 346

Query: 319 ECGI 322
            CG+
Sbjct: 347 MCGV 350


>sp|Q24940|CATLL_FASHE Cathepsin L-like proteinase OS=Fasciola hepatica GN=Cat-1 PE=1 SV=1
          Length = 326

 Score =  109 bits (272), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 81/279 (29%), Positives = 126/279 (45%), Gaps = 34/279 (12%)

Query: 58  QFSNYTVGQFK--HLLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRIL 115
           QF++ T  +FK  +L  +      L  GVP + +++++  P   D    W +   ++ + 
Sbjct: 71  QFTDMTFEEFKAKYLTEMSRASDILSHGVPYEANNRAV--PDKID----WRESGYVTEVK 124

Query: 116 DQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYF 175
           DQG+CGSCWAF     +  ++  +   ++S S   L+ C G    +GC GG   +A++Y 
Sbjct: 125 DQGNCGSCWAFSTTGTMEGQYMKNERTSISFSEQQLVDCSGPWGNNGCSGGLMENAYQYL 184

Query: 176 VHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSD 235
              G+ TE   PY    G                 +C    QL           Y ++S 
Sbjct: 185 KQFGLETESSYPYTAVEG-----------------QCRYNKQL---GVAKVTGYYTVHSG 224

Query: 236 PE-DIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHIT-GDVMGGHAVKLIGWGTSDDGED 293
            E ++   +    P  V+  V  DF  Y+SG+Y+  T   +   HAV  +G+GT   G D
Sbjct: 225 SEVELKNLVGARRPAAVAVDVESDFMMYRSGIYQSQTCSPLRVNHAVLAVGYGT-QGGTD 283

Query: 294 YWILANQWNRSWGADGYFKIKRGS-NECGIEEDVVAGLP 331
           YWI+ N W   WG  GY ++ R   N CGI    +A LP
Sbjct: 284 YWIVKNSWGTYWGERGYIRMARNRGNMCGIAS--LASLP 320


>sp|Q80LP4|CATV_NPVAH Viral cathepsin OS=Adoxophyes honmai nucleopolyhedrovirus GN=VCATH
           PE=3 SV=1
          Length = 337

 Score =  108 bits (270), Expect = 6e-23,   Method: Compositional matrix adjust.
 Identities = 75/249 (30%), Positives = 111/249 (44%), Gaps = 34/249 (13%)

Query: 82  LGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG 141
           L  P   HD+   LP++FD    W   + ++ + DQG CGSCWA  AV  L   + I   
Sbjct: 116 LDAPPDVHDE---LPQNFD----WRVNNKMTSVKDQGACGSCWAHAAVGTLETLYAIKHN 168

Query: 142 MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECD-PYFDSTG-CSHPGC 199
             ++LS   L+ C        CDGG   +A+   ++ G + EE D PY  + G C     
Sbjct: 169 YLINLSEQQLIDCDS--ANMACDGGLMHTAFEQLMNAGGLMEEIDYPYQGTKGVCKIDNK 226

Query: 200 EPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDF 259
           + A     C R                     I  + E++  E+   GP+ ++       
Sbjct: 227 KFALSVSSCKR--------------------YIFQNEENLKKELITMGPIAMAIDA-ASI 265

Query: 260 AHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNE 319
           + Y  G+  H   ++   HAV L+G+GT + G  YW L N W   WG DGYF++KR  N 
Sbjct: 266 STYSKGII-HFCENLGLNHAVLLVGYGT-EGGVSYWTLKNSWGSDWGEDGYFRVKRNINA 323

Query: 320 CGIEEDVVA 328
           CG+   + A
Sbjct: 324 CGLNNQLAA 332


>sp|P25251|CYSP4_BRANA Cysteine proteinase COT44 (Fragment) OS=Brassica napus PE=2 SV=1
          Length = 328

 Score =  108 bits (270), Expect = 7e-23,   Method: Compositional matrix adjust.
 Identities = 96/337 (28%), Positives = 152/337 (45%), Gaps = 42/337 (12%)

Query: 21  AEGVVSKLKLDSHILQDSI--IKEVNENPKAGWKAARNPQFSNYTVGQFKHL-LGVKPTP 77
           + G++++     +I +D++  I   NEN K          F+N T  +++ L LG +  P
Sbjct: 18  SNGIINQQDERFNIFKDNLRFIDLHNENNKNATYKLGLTIFANLTNDEYRSLYLGARTEP 77

Query: 78  KGLLLGVPVKTHDKSLKLPKSFDARSA-----WPQCSTISRILDQGHCGSCWAFGAVEAL 132
              +     K  + ++K   + +         W Q   ++ I DQG CGSCWAF    A+
Sbjct: 78  VRRI----TKAKNVNMKYSAAVNVDEVPVTVDWRQKGAVNAIKDQGTCGSCWAFSTAAAV 133

Query: 133 SDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECD-PYFDS 191
                I  G  +SLS  +L+ C       GC+GG    A+++ + +G +  E D PY  +
Sbjct: 134 EGINKIVTGELVSLSEQELVDCDKSY-NQGCNGGLMDYAFQFIMKNGGLNTEKDYPYHGT 192

Query: 192 TGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYR-INSDPEDIMAEIYKNGPVE 250
            G             KC       N L +NS+  +I  Y  + S  E  +       PV 
Sbjct: 193 NG-------------KC-------NSLLKNSRVVTIDGYEDVPSKDETALKRAVSYQPVS 232

Query: 251 VSFTV-YEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADG 309
           V+       F HY+SG++    G  M  HAV  +G+G S++G DYWI+ N W   WG DG
Sbjct: 233 VAIDAGGRAFQHYQSGIFTGKCGTNM-DHAVVAVGYG-SENGVDYWIVRNSWGTRWGEDG 290

Query: 310 YFKIKRG----SNECGIEEDVVAGLPSSKNLVKEITS 342
           Y +++R     S +CGI  +    +  S N V+  +S
Sbjct: 291 YIRMERNVASKSGKCGIAIEASYPVKYSPNPVRGTSS 327


>sp|Q9PYY5|CATV_GVXN Viral cathepsin OS=Xestia c-nigrum granulosis virus GN=VCATH PE=3
           SV=1
          Length = 346

 Score =  107 bits (267), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 78/238 (32%), Positives = 117/238 (49%), Gaps = 28/238 (11%)

Query: 85  PVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNL 144
            V + D S K+P SFD    W   ++++ +  Q  CGSCWAF AV  +   + I   ++L
Sbjct: 123 TVISGDSSGKVPDSFD----WRDRNSVTSVKMQKECGSCWAFSAVANIESLYHIKHNVSL 178

Query: 145 SLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYP 204
            LS   L+ C      +GC+GG  + +W +    G++         + G S+   E  YP
Sbjct: 179 DLSEQQLVDCDK--VNNGCNGG--LMSWAF---EGIIR--------AGGISY---EAPYP 220

Query: 205 TPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKS 264
                  C    +  + S  Y   AY + S+ + +   +++ GPV V+  V  D  +YKS
Sbjct: 221 YTGVDGVCKNTTRYVQLSGCY---AYDLRSEKK-LRQVLHEKGPVSVAIDVV-DLTNYKS 275

Query: 265 GVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGI 322
           GV KH + D    H V L+G+G  +D + YW L N W   WG  G+F+IKR  N CGI
Sbjct: 276 GVAKHCSVDHGLNHGVLLVGYGQENDVK-YWTLKNSWGSDWGEQGFFRIKRDVNSCGI 332


>sp|Q9R1T3|CATZ_RAT Cathepsin Z OS=Rattus norvegicus GN=Ctsz PE=1 SV=2
          Length = 306

 Score =  106 bits (265), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 76/255 (29%), Positives = 113/255 (44%), Gaps = 32/255 (12%)

Query: 95  LPKSFDARSAWPQCSTISRILDQ---GHCGSCWAFGAVEALSDRFCIHFGM---NLSLSV 148
           LPK++D R+     +  S   +Q    +CGSCWA G+  AL+DR  I       +  LSV
Sbjct: 64  LPKNWDWRNV-NGVNYASVTRNQHIPQYCGSCWAHGSTSALADRINIKRKGAWPSTLLSV 122

Query: 149 NDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKC 208
            +++ C        C+GG  +  W Y   HG+  E C+ Y          C+       C
Sbjct: 123 QNVIDCGN---AGSCEGGNDLPVWEYAHKHGIPDETCNNY----QAKDQECDKFNQCGTC 175

Query: 209 V--RKC--VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKS 264
              ++C  ++   LWR   + S+S        E +MAEIY NGP+       E  ++Y  
Sbjct: 176 TEFKECHTIQNYTLWRVGDYGSLSG------REKMMAEIYANGPISCGIMATERMSNYTG 229

Query: 265 GVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEE 324
           G+Y       +  H + + GWG S+DG +YWI+ N W   WG  G+ +I        +  
Sbjct: 230 GIYTEYQNQAIINHIISVAGWGVSNDGIEYWIVRNSWGEPWGERGWMRI--------VTS 281

Query: 325 DVVAGLPSSKNLVKE 339
               G  SS NL  E
Sbjct: 282 TYKGGTGSSYNLAIE 296


  Database: swissprot
    Posted date:  Mar 23, 2013  2:32 AM
  Number of letters in database: 191,569,459
  Number of sequences in database:  539,616
  
Lambda     K      H
   0.320    0.137    0.449 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 148,977,252
Number of Sequences: 539616
Number of extensions: 6775746
Number of successful extensions: 13022
Number of sequences better than 100.0: 50
Number of HSP's better than 100.0 without gapping: 210
Number of HSP's successfully gapped in prelim test: 11
Number of HSP's that attempted gapping in prelim test: 12348
Number of HSP's gapped (non-prelim): 280
length of query: 351
length of database: 191,569,459
effective HSP length: 118
effective length of query: 233
effective length of database: 127,894,771
effective search space: 29799481643
effective search space used: 29799481643
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 62 (28.5 bits)