BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= psy15353
         (344 letters)

Database: swissprot 
           539,616 sequences; 191,569,459 total letters

Searching..................................................done



>sp|P43509|CPR5_CAEEL Cathepsin B-like cysteine proteinase 5 OS=Caenorhabditis elegans
           GN=cpr-5 PE=2 SV=1
          Length = 344

 Score =  227 bits (579), Expect = 1e-58,   Method: Compositional matrix adjust.
 Identities = 125/321 (38%), Positives = 169/321 (52%), Gaps = 15/321 (4%)

Query: 24  AYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLP-GDRKTYDPEYSA 82
           A ID +N     WTAG      + +E + + L+ D KY       +P  D      E S 
Sbjct: 31  ALIDYVNSAQKLWTAGHQV---IPKEKITKKLM-DVKYL------VPHKDEDIVATEVSD 80

Query: 83  TVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYV 142
            +PD FDAR+QWPNC +I ++ D   C +   FAA  A SDR CI S G  N  LS+E +
Sbjct: 81  AIPDHFDARDQWPNCMSINNIRDQSDCGSCWAFAAAEAISDRTCIASNGAVNTLLSSEDL 140

Query: 143 ASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPT 202
            SCC    +     C  G   + W +  K G VTGG Y  + GC+P +I+PC    +   
Sbjct: 141 LSCCT-GMFSCGNGCEGGYPIQAWKWWVKHGLVTGGSYETQFGCKPYSIAPCGETVNGVK 199

Query: 203 LPSCENQKVPKLKCHTRCTNP-TYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTA 261
            P+C     P  KC   CT+   Y   + QDKH  +  Y V    + I+ EIL +GP   
Sbjct: 200 WPACPEDTEPTPKCVDSCTSKNNYATPYLQDKHFGSTAYAVGKKVEQIQTEILTNGPIEV 259

Query: 262 TFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGT 321
            F +Y+DFY Y +GVY HT+ A L    H+ K++GWG +NGTPYWLV N+W   WG++G 
Sbjct: 260 AFTVYEDFYQYTTGVYVHTAGASLGG--HAVKILGWGVDNGTPYWLVANSWNVAWGEKGY 317

Query: 322 VKILRGKYECAFEYLIAAGKP 342
            +I+RG  EC  E+   AG P
Sbjct: 318 FRIIRGLNECGIEHSAVAGIP 338


>sp|P43508|CPR4_CAEEL Cathepsin B-like cysteine proteinase 4 OS=Caenorhabditis elegans
           GN=cpr-4 PE=2 SV=1
          Length = 335

 Score =  221 bits (563), Expect = 6e-57,   Method: Compositional matrix adjust.
 Identities = 128/343 (37%), Positives = 180/343 (52%), Gaps = 16/343 (4%)

Query: 4   ILVFLLGCT--LVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKY 61
           IL  L+  T  LV   + K  +A  + +N + + W A    P +++ E +++ L+     
Sbjct: 5   ILAALVAVTAGLVIPLVPKTQEAITEYVNSKQSLWKA--EIPKDITIEQVKKRLMRT--- 59

Query: 62  FDQSDRPLPGDRKTYDPEYSA-TVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGA 120
             +   P   D +    + +  T+P  FDAR QWPNC +I ++ D   C +   FAA  A
Sbjct: 60  --EFVAPHTPDVEVVKHDINEDTIPATFDARTQWPNCMSINNIRDQSDCGSCWAFAAAEA 117

Query: 121 FSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDY 180
            SDR CI S G  N  LS E V SCC  C Y     C  G     W +L K G  TGG Y
Sbjct: 118 ASDRFCIASNGAVNTLLSAEDVLSCCSNCGY----GCEGGYPINAWKYLVKSGFCTGGSY 173

Query: 181 GDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTY 240
             + GC+P +++PC       T PSC +       C  +CTN  Y   +  DKH  +  Y
Sbjct: 174 EAQFGCKPYSLAPCGETVGNVTWPSCPDDGYDTPACVNKCTNKNYNVAYTADKHFGSTAY 233

Query: 241 WVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTE 300
            V      I+ EI+AHGP  A F +Y+DFY YK+GVY HT+  +L    H+ +++GWGT+
Sbjct: 234 AVGKKVSQIQAEIIAHGPVEAAFTVYEDFYQYKTGVYVHTTGQELGG--HAIRILGWGTD 291

Query: 301 NGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
           NGTPYWLV N+W  +WG+ G  +I+RG  EC  E+ +  G PK
Sbjct: 292 NGTPYWLVANSWNVNWGENGYFRIIRGTNECGIEHAVVGGVPK 334


>sp|A1E295|CATB_PIG Cathepsin B OS=Sus scrofa GN=CTSB PE=1 SV=1
          Length = 335

 Score =  221 bits (562), Expect = 7e-57,   Method: Compositional matrix adjust.
 Identities = 130/344 (37%), Positives = 176/344 (51%), Gaps = 19/344 (5%)

Query: 1   MIHILVFLLGCTLVRGELY--KFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIAD 58
           ++  L  L+  T  R  L+    SD  ++ IN++  TWTAG NF  N+   Y+++     
Sbjct: 4   LLATLSCLVLLTSARESLHFQPLSDELVNFINKQNTTWTAGHNF-YNVDLSYVKKLC--- 59

Query: 59  AKYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAV 118
             +      P    R  +  +    +P  FDAREQWPNC TI  + D G+C +   F AV
Sbjct: 60  GTFLGGPKLP---QRAAFAAD--MILPKSFDAREQWPNCPTIKEIRDQGSCGSCWAFGAV 114

Query: 119 GAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGG 178
            A SDR CI+S G+ N  +S E + +CC     +    C+ G     WNF  K+G V+GG
Sbjct: 115 EAISDRICIRSNGRVNVEVSAEDMLTCCGD---ECGDGCNGGFPSGAWNFWTKKGLVSGG 171

Query: 179 DYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTL 238
            Y    GC+P +I PC HH +    P       PK  C   C  P Y   + +DKH    
Sbjct: 172 LYDSHVGCRPYSIPPCEHHVNGSRPPCTGEGDTPK--CSKIC-EPGYTPSYKEDKHFGCS 228

Query: 239 TYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWG 298
           +Y +  NE  I  EI  +GP    F +Y DF  YKSGVY+H +   +    H+ +++GWG
Sbjct: 229 SYSISRNEKEIMAEIYKNGPVEGAFTVYSDFLQYKSGVYQHVTGDLMGG--HAIRILGWG 286

Query: 299 TENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
            ENGTPYWLV N+W   WGD G  KILRG+  C  E  I AG P
Sbjct: 287 VENGTPYWLVGNSWNTDWGDNGFFKILRGQDHCGIESEIVAGIP 330


>sp|P00787|CATB_RAT Cathepsin B OS=Rattus norvegicus GN=Ctsb PE=1 SV=2
          Length = 339

 Score =  220 bits (561), Expect = 1e-56,   Method: Compositional matrix adjust.
 Identities = 126/325 (38%), Positives = 170/325 (52%), Gaps = 17/325 (5%)

Query: 19  YKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDP 78
           +  SD  I+ IN++  TW AGRNF  N+   YL++            + P   +R  +  
Sbjct: 24  HPLSDDMINYINKQNTTWQAGRNF-YNVDISYLKKLC---GTVLGGPNLP---ERVGFSE 76

Query: 79  EYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLS 138
           + +  +P+ FDAREQW NC TI  + D G+C +   F AV A SDR CI + G+ N  +S
Sbjct: 77  DIN--LPESFDAREQWSNCPTIAQIRDQGSCGSCWAFGAVEAMSDRICIHTNGRVNVEVS 134

Query: 139 TEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHG 198
            E + +CC I   D    C+ G     WNF  ++G V+GG Y    GC P TI PC HH 
Sbjct: 135 AEDLLTCCGIQCGD---GCNGGYPSGAWNFWTRKGLVSGGVYNSHIGCLPYTIPPCEHHV 191

Query: 199 SAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGP 258
           +    P       PK  C+  C    Y   + +DKH    +Y V D+E  I  EI  +GP
Sbjct: 192 NGSRPPCTGEGDTPK--CNKMC-EAGYSTSYKEDKHYGYTSYSVSDSEKEIMAEIYKNGP 248

Query: 259 TTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGD 318
               F ++ DF  YKSGVYKH +   +    H+ +++GWG ENG PYWLV N+W   WGD
Sbjct: 249 VEGAFTVFSDFLTYKSGVYKHEAGDVMGG--HAIRILGWGIENGVPYWLVANSWNVDWGD 306

Query: 319 RGTVKILRGKYECAFEYLIAAGKPK 343
            G  KILRG+  C  E  I AG P+
Sbjct: 307 NGFFKILRGENHCGIESEIVAGIPR 331


>sp|Q4R5M2|CATB_MACFA Cathepsin B OS=Macaca fascicularis GN=CTSB PE=2 SV=1
          Length = 339

 Score =  219 bits (558), Expect = 2e-56,   Method: Compositional matrix adjust.
 Identities = 125/338 (36%), Positives = 173/338 (51%), Gaps = 17/338 (5%)

Query: 6   VFLLGCTLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQS 65
           +  LG    R   +  SD  ++ +N++  TW AG NF  N+   YL++       +    
Sbjct: 11  LLALGDARSRPSFHPLSDELVNYVNKQNTTWQAGHNF-YNVDVSYLKRLC---GTFLGG- 65

Query: 66  DRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRR 125
             P P  R  +  +    +P+ FDAREQWP C TI  + D G+C +   F AV A SDR 
Sbjct: 66  --PKPPQRVMFTEDLK--LPESFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVEAISDRI 121

Query: 126 CIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTG 185
           CI +    +  +S E + +CC I   D    C+ G     WNF  ++G V+GG Y    G
Sbjct: 122 CIHTNAHVSVEVSAEDLLTCCGIMCGD---GCNGGYPAGAWNFWTRKGLVSGGLYDSHVG 178

Query: 186 CQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDN 245
           C+P +I PC HH +    P       PK  C   C  P Y   + QDKH    +Y V ++
Sbjct: 179 CRPYSIPPCEHHVNGSRPPCTGEGDTPK--CSKIC-EPGYSPTYKQDKHYGYNSYSVSNS 235

Query: 246 EDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPY 305
           E  I  EI  +GP    F++Y DF  YKSGVY+H +   +    H+ +++GWG ENGTPY
Sbjct: 236 EKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGG--HAIRILGWGVENGTPY 293

Query: 306 WLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
           WLV N+W   WGD G  KILRG+  C  E  + AG P+
Sbjct: 294 WLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPR 331


>sp|P07858|CATB_HUMAN Cathepsin B OS=Homo sapiens GN=CTSB PE=1 SV=3
          Length = 339

 Score =  215 bits (547), Expect = 4e-55,   Method: Compositional matrix adjust.
 Identities = 123/339 (36%), Positives = 171/339 (50%), Gaps = 19/339 (5%)

Query: 6   VFLLGCTLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQS 65
           + +L     R   +  SD  ++ +N+   TW AG NF  N+   YL++       +    
Sbjct: 11  LLVLANARSRPSFHPLSDELVNYVNKRNTTWQAGHNF-YNVDMSYLKRLC---GTFLGG- 65

Query: 66  DRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRR 125
             P P  R  +  +    +P  FDAREQWP C TI  + D G+C +   F AV A SDR 
Sbjct: 66  --PKPPQRVMFTEDLK--LPASFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVEAISDRI 121

Query: 126 CIKSKGQQNRPLSTEYVASCC-KICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRT 184
           CI +    +  +S E + +CC  +C       C+ G     WNF  ++G V+GG Y    
Sbjct: 122 CIHTNAHVSVEVSAEDLLTCCGSMC----GDGCNGGYPAEAWNFWTRKGLVSGGLYESHV 177

Query: 185 GCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDD 244
           GC+P +I PC HH +    P       PK  C   C  P Y   + QDKH    +Y V +
Sbjct: 178 GCRPYSIPPCEHHVNGSRPPCTGEGDTPK--CSKIC-EPGYSPTYKQDKHYGYNSYSVSN 234

Query: 245 NEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTP 304
           +E  I  EI  +GP    F++Y DF  YKSGVY+H +   +    H+ +++GWG ENGTP
Sbjct: 235 SEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGG--HAIRILGWGVENGTP 292

Query: 305 YWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
           YWLV N+W   WGD G  KILRG+  C  E  + AG P+
Sbjct: 293 YWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPR 331


>sp|Q5R6D1|CATB_PONAB Cathepsin B OS=Pongo abelii GN=CTSB PE=2 SV=1
          Length = 339

 Score =  215 bits (547), Expect = 4e-55,   Method: Compositional matrix adjust.
 Identities = 122/330 (36%), Positives = 169/330 (51%), Gaps = 19/330 (5%)

Query: 15  RGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRK 74
           R   +  SD  ++ +N+   TW AG NF  N+   YL++       +      P P  R 
Sbjct: 20  RPSFHPLSDELVNYVNKRNTTWQAGHNF-YNVDVSYLKKLC---GTFLGG---PKPPQRV 72

Query: 75  TYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQN 134
            +  +    +P+ FDAREQWP C TI  + D G+C +   F AV A SDR CI +    +
Sbjct: 73  MFTEDLK--LPESFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHTNAHVS 130

Query: 135 RPLSTEYVASCC-KICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISP 193
             +S E + +CC  +C       C+ G     WNF  ++G V+GG Y    GC+P +I P
Sbjct: 131 VEVSAEDLLTCCGSMC----GDGCNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPP 186

Query: 194 CSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEI 253
           C HH +    P       PK  C   C  P Y   + QDKH    +Y V ++E  I  EI
Sbjct: 187 CEHHVNGSRPPCTGEGDTPK--CSKIC-EPGYSPTYKQDKHYGYNSYSVSNSERDIMAEI 243

Query: 254 LAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWG 313
             +GP    F++Y DF  YKSGVY+H +   +    H+ +++GWG ENGTPYWLV N+W 
Sbjct: 244 YKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGG--HAIRILGWGVENGTPYWLVANSWN 301

Query: 314 PHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
             WGD G  KILRG+  C  E  + AG P+
Sbjct: 302 TDWGDNGFFKILRGQDHCGIESEVVAGIPR 331


>sp|P07688|CATB_BOVIN Cathepsin B OS=Bos taurus GN=CTSB PE=1 SV=5
          Length = 335

 Score =  215 bits (547), Expect = 4e-55,   Method: Compositional matrix adjust.
 Identities = 132/348 (37%), Positives = 175/348 (50%), Gaps = 27/348 (7%)

Query: 1   MIHILVFLLGCTLVRGELY--KFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIAD 58
           ++  L  LL  T  R  LY    SD  ++ +N++  TW AG NF  N+   Y+++   A 
Sbjct: 4   LLATLSCLLVLTSARSSLYFPPLSDELVNFVNKQNTTWKAGHNF-YNVDLSYVKKLCGAI 62

Query: 59  AKYFDQSDRPLPGDRKTYDPEYSATV--PDRFDAREQWPNCGTIGHVPDTGACAAPHIFA 116
                     L G +      ++A V  P+ FDAREQWPNC TI  + D G+C +   F 
Sbjct: 63  ----------LGGPKLPQRDAFAADVVLPESFDAREQWPNCPTIKEIRDQGSCGSCWAFG 112

Query: 117 AVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRT--WNFLHKRGS 174
           AV A SDR CI S G+ N  +S E +     +              F +  WNF  K+G 
Sbjct: 113 AVEAISDRICIHSNGRVNVEVSAEDM-----LTCCGGECGDGCNGGFPSGAWNFWTKKGL 167

Query: 175 VTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKH 234
           V+GG Y    GC+P +I PC HH +    P       PK  C   C  P Y   + +DKH
Sbjct: 168 VSGGLYNSHVGCRPYSIPPCEHHVNGSRPPCTGEGDTPK--CSKTC-EPGYSPSYKEDKH 224

Query: 235 RTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKL 294
               +Y V +NE  I  EI  +GP    F++Y DF  YKSGVY+H S   +    H+ ++
Sbjct: 225 FGCSSYSVANNEKEIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVSGEIMGG--HAIRI 282

Query: 295 IGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
           +GWG ENGTPYWLV N+W   WGD G  KILRG+  C  E  I AG P
Sbjct: 283 LGWGVENGTPYWLVGNSWNTDWGDNGFFKILRGQDHCGIESEIVAGMP 330


>sp|P10605|CATB_MOUSE Cathepsin B OS=Mus musculus GN=Ctsb PE=1 SV=2
          Length = 339

 Score =  214 bits (546), Expect = 5e-55,   Method: Compositional matrix adjust.
 Identities = 126/329 (38%), Positives = 171/329 (51%), Gaps = 23/329 (6%)

Query: 18  LYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQF---LIADAKYFDQSDRPLPGDRK 74
            +  SD  I+ IN++  TW AGRNF  N+   YL++    ++   K        LPG R 
Sbjct: 23  FHPLSDDLINYINKQNTTWQAGRNF-YNVDISYLKKLCGTVLGGPK--------LPG-RV 72

Query: 75  TYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQN 134
            +  +    +P+ FDAREQW NC TIG + D G+C +   F AV A SDR CI + G+ N
Sbjct: 73  AFGEDID--LPETFDAREQWSNCPTIGQIRDQGSCGSCWAFGAVEAISDRTCIHTNGRVN 130

Query: 135 RPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPC 194
             +S E + +CC I   D    C+ G     W+F  K+G V+GG Y    GC P TI PC
Sbjct: 131 VEVSAEDLLTCCGIQCGD---GCNGGYPSGAWSFWTKKGLVSGGVYNSHVGCLPYTIPPC 187

Query: 195 SHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEIL 254
            HH +    P       P  +C+  C    Y   + +DKH    +Y V ++   I  EI 
Sbjct: 188 EHHVNGSRPPCTGEGDTP--RCNKSC-EAGYSPSYKEDKHFGYTSYSVSNSVKEIMAEIY 244

Query: 255 AHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGP 314
            +GP    F ++ DF  YKSGVYKH +   +    H+ +++GWG ENG PYWL  N+W  
Sbjct: 245 KNGPVEGAFTVFSDFLTYKSGVYKHEAGDMMGG--HAIRILGWGVENGVPYWLAANSWNL 302

Query: 315 HWGDRGTVKILRGKYECAFEYLIAAGKPK 343
            WGD G  KILRG+  C  E  I AG P+
Sbjct: 303 DWGDNGFFKILRGENHCGIESEIVAGIPR 331


>sp|P25792|CYSP_SCHMA Cathepsin B-like cysteine proteinase OS=Schistosoma mansoni PE=2
           SV=1
          Length = 340

 Score =  207 bits (528), Expect = 7e-53,   Method: Compositional matrix adjust.
 Identities = 120/348 (34%), Positives = 180/348 (51%), Gaps = 24/348 (6%)

Query: 1   MIHILVFLLGCTLVRGELYK-FSDAYIDQINREANT-WTAGRNFPANLSEEYLRQFLIAD 58
           +  ++ FL     V+ E ++  SD  I  IN   N  W A         E+  R   + D
Sbjct: 8   IASLITFLEAHISVKNEKFEPLSDDIISYINEHPNAGWRA---------EKSNRFHSLDD 58

Query: 59  AKYFDQSDRPLPGDRKTYDP-----EYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPH 113
           A+    + R  P  R+   P     +++  +P  FD+R++WP C +I  + D   C +  
Sbjct: 59  ARIQMGARREEPDLRRKRRPTVDHNDWNVEIPSNFDSRKKWPGCKSIATIRDQSRCGSCW 118

Query: 114 IFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRG 173
            F AV A SDR CI+S G+QN  LS   + +CC+ C       C  G +   W++  K G
Sbjct: 119 SFGAVEAMSDRSCIQSGGKQNVELSAVDLLTCCESC----GLGCEGGILGPAWDYWVKEG 174

Query: 174 SVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDK 233
            VT     + TGC+P     C HH +    P C ++     +C   C    Y   + QDK
Sbjct: 175 IVTASSKENHTGCEPYPFPKCEHH-TKGKYPPCGSKIYNTPRCKQTCQR-KYKTPYTQDK 232

Query: 234 HRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGK 293
           HR   +Y V ++E AI+KEI+ +GP  A+F +Y+DF +YKSG+YKH +   L    H+ +
Sbjct: 233 HRGKSSYNVKNDEKAIQKEIMKYGPVEASFTVYEDFLNYKSGIYKHITGEALGG--HAIR 290

Query: 294 LIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGK 341
           +IGWG EN TPYWL+ N+W   WG+ G  +I+RG+ EC+ E  + AG+
Sbjct: 291 IIGWGVENKTPYWLIANSWNEDWGENGYFRIVRGRDECSIESEVIAGR 338


>sp|P43510|CPR6_CAEEL Cathepsin B-like cysteine proteinase 6 OS=Caenorhabditis elegans
           GN=cpr-6 PE=1 SV=1
          Length = 379

 Score =  206 bits (524), Expect = 2e-52,   Method: Compositional matrix adjust.
 Identities = 115/323 (35%), Positives = 160/323 (49%), Gaps = 10/323 (3%)

Query: 23  DAYIDQINREANTWTAG--RNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEY 80
           D  ID +N   N WTA   R F +   E    ++ +    +   S +      KT D + 
Sbjct: 44  DDLIDYVNENQNLWTAKKQRRFSSVYGENDKAKWGLMGVNHVRLSVKGKQHLSKTKDLDL 103

Query: 81  SATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTE 140
              +P+ FD+R+ WP C +I  + D  +C +   F AV A SDR CI S G+    LS +
Sbjct: 104 D--IPESFDSRDNWPKCDSIKVIRDQSSCGSCWAFGAVEAMSDRICIASHGELQVTLSAD 161

Query: 141 YVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSA 200
            + SCCK C +     C+ G     W +  K G VTG +Y    GC+P    PC HH   
Sbjct: 162 DLLSCCKSCGF----GCNGGDPLAAWRYWVKDGIVTGSNYTANNGCKPYPFPPCEHHSKK 217

Query: 201 PTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTT 260
                C +   P  KC  +C +    + + +DK      Y V D+ +AI+KE++ HGP  
Sbjct: 218 THFDPCPHDLYPTPKCEKKCVSDYTDKTYSEDKFFGASAYGVKDDVEAIQKELMTHGPLE 277

Query: 261 ATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRG 320
             F +Y+DF +Y  GVY HT   KL    H+ KLIGWG ++G PYW V N+W   WG+ G
Sbjct: 278 IAFEVYEDFLNYDGGVYVHTG-GKLGGG-HAVKLIGWGIDDGIPYWTVANSWNTDWGEDG 335

Query: 321 TVKILRGKYECAFEYLIAAGKPK 343
             +ILRG  EC  E  +  G PK
Sbjct: 336 FFRILRGVDECGIESGVVGGIPK 358


>sp|P43233|CATB_CHICK Cathepsin B OS=Gallus gallus GN=CTSB PE=2 SV=1
          Length = 340

 Score =  200 bits (508), Expect = 1e-50,   Method: Compositional matrix adjust.
 Identities = 114/323 (35%), Positives = 161/323 (49%), Gaps = 16/323 (4%)

Query: 21  FSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEY 80
            S   ++ IN+   T  AG NF  N    Y+++       +      P     +  D   
Sbjct: 26  LSSDLVNHINKLNTTGRAGHNF-HNTDMSYVKKLC---GTFLGGPKAP-----ERVDFAE 76

Query: 81  SATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTE 140
              +PD FD R+QWPNC TI  + D G+C +   F AV A SDR C+ +  + +  +S E
Sbjct: 77  DMDLPDTFDTRKQWPNCPTISEIRDQGSCGSCWAFGAVEAISDRICVHTNAKVSVEVSAE 136

Query: 141 YVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSA 200
            + SCC    ++    C+ G     W +  +RG V+GG Y    GC+  TI PC HH + 
Sbjct: 137 DLLSCCG---FECGMGCNGGYPSGAWRYWTERGLVSGGLYDSHVGCRAYTIPPCEHHVNG 193

Query: 201 PTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTT 260
            + P C  +     +C   C  P Y   + +DKH    +Y V  +E  I  EI  +GP  
Sbjct: 194 -SRPPCTGEGGETPRCSRHC-EPGYSPSYKEDKHYGITSYGVPRSEKEIMAEIYKNGPVE 251

Query: 261 ATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRG 320
             F +Y+DF  YKSGVY+H S  ++    H+ +++GWG ENGTPYWL  N+W   WG  G
Sbjct: 252 GAFIVYEDFLMYKSGVYQHVSGEQVGG--HAIRILGWGVENGTPYWLAANSWNTDWGITG 309

Query: 321 TVKILRGKYECAFEYLIAAGKPK 343
             KILRG+  C  E  I AG P+
Sbjct: 310 FFKILRGEDHCGIESEIVAGVPR 332


>sp|P25802|CYSP1_OSTOS Cathepsin B-like cysteine proteinase 1 OS=Ostertagia ostertagi
           GN=CP-1 PE=3 SV=3
          Length = 341

 Score =  197 bits (502), Expect = 8e-50,   Method: Compositional matrix adjust.
 Identities = 107/291 (36%), Positives = 155/291 (53%), Gaps = 10/291 (3%)

Query: 50  YLRQFLIADAKYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGAC 109
           Y +Q L+ D KY DQ++ P          E +  +P+ +D R QW NC ++ H+PD   C
Sbjct: 58  YFKQRLM-DLKYIDQNNIPDEEVEDEELEENNDDIPESYDPRIQWANCSSLFHIPDQANC 116

Query: 110 AAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFL 169
            +    ++  A SDR CI SKG +   +S + V SCC  C       C  G     + F 
Sbjct: 117 GSCWAVSSAAAMSDRICIASKGAKQVLISAQDVVSCCTWC----GDGCEGGWPISAFRFH 172

Query: 170 HKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGF 229
              G VTGGDY  +  C+P  I PC HHG+      C        +C  RC    Y + +
Sbjct: 173 ADEGVVTGGDYNTKGSCRPYEIHPCGHHGNETYYGECVGM-ADTPRCKRRCL-LGYPKSY 230

Query: 230 FQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYL 289
             D++     Y + ++  AI+K+I+ +GP  AT+ +Y+DF HY+SG+YKH +  K    L
Sbjct: 231 PSDRYYKK-AYQLKNSVKAIQKDIMKNGPVVATYTVYEDFAHYRSGIYKHKAGRKTG--L 287

Query: 290 HSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAG 340
           H+ K+IGWG E GTPYW+V N+W   WG+ G  ++ RG  +C FE  +AAG
Sbjct: 288 HAVKVIGWGEEKGTPYWIVANSWHDDWGENGFFRMHRGSNDCGFEERMAAG 338


>sp|P43157|CYSP_SCHJA Cathepsin B-like cysteine proteinase OS=Schistosoma japonicum
           GN=CATB PE=2 SV=1
          Length = 342

 Score =  196 bits (497), Expect = 3e-49,   Method: Compositional matrix adjust.
 Identities = 98/265 (36%), Positives = 149/265 (56%), Gaps = 8/265 (3%)

Query: 79  EYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLS 138
           + +  +P +FD+R++WP+C +I  + D   C +   F AV A +DR CI+S G Q+  LS
Sbjct: 85  DLNVEIPSQFDSRKKWPHCKSISQIRDQSRCGSCWAFGAVEAMTDRICIQSGGGQSAELS 144

Query: 139 TEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHG 198
              + SCCK C       C  G     W++  KRG VTGG   + TGCQP     C HH 
Sbjct: 145 ALDLISCCKDC----GDGCQGGFPGVAWDYWVKRGIVTGGSKENHTGCQPYPFPKCEHH- 199

Query: 199 SAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGP 258
           +    P+C  +     +C   C    Y   + QDKH    +Y V +NE  I+++I+ +GP
Sbjct: 200 TKGKYPACGTKIYKTPQCKQTCQK-GYKTPYEQDKHYGDESYNVQNNEKVIQRDIMMYGP 258

Query: 259 TTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGD 318
             A F +Y+DF +YKSG+Y+H + + +    H+ ++IGWG E  TPYWL+ N+W   WG+
Sbjct: 259 VEAAFDVYEDFLNYKSGIYRHVTGSIVGG--HAIRIIGWGVEKRTPYWLIANSWNEDWGE 316

Query: 319 RGTVKILRGKYECAFEYLIAAGKPK 343
           +G  +++RG+ EC+ E  + AG  K
Sbjct: 317 KGLFRMVRGRDECSIESDVVAGLIK 341


>sp|P25807|CPR1_CAEEL Gut-specific cysteine proteinase OS=Caenorhabditis elegans GN=cpr-1
           PE=1 SV=2
          Length = 329

 Score =  195 bits (495), Expect = 5e-49,   Method: Compositional matrix adjust.
 Identities = 117/321 (36%), Positives = 163/321 (50%), Gaps = 23/321 (7%)

Query: 23  DAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEYSA 82
            A +D +N   + +   +     ++EE ++ F + D KY       +   R T      A
Sbjct: 31  QALVDYVNSAQSLF---KTEHVEITEEEMK-FKLMDGKYAAAHSDEI---RATEQEVVLA 83

Query: 83  TVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYV 142
           +VP  FD+R QW  C +I  + D   C +   F A    SDR CI++KG Q   +S + +
Sbjct: 84  SVPATFDSRTQWSECKSIKLIRDQATCGSCWAFGAAEMISDRTCIETKGAQQPIISPDDL 143

Query: 143 ASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPT 202
            SCC          C  G   +   +   +G VTGGDY    GC+P  I+PC       T
Sbjct: 144 LSCCG---SSCGNGCEGGYPIQALRWWDSKGVVTGGDY-HGAGCKPYPIAPC-------T 192

Query: 203 LPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTAT 262
             +C   K P   C   C +  Y   + +DKH     Y V  N  +I+ EI A+GP  A 
Sbjct: 193 SGNCPESKTPS--CSMSCQS-GYSTAYAKDKHFGVSAYAVPKNAASIQAEIYANGPVEAA 249

Query: 263 FALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTV 322
           F++Y+DFY YKSGVYKHT+   L    H+ K+IGWGTE+G+PYWLV N+WG +WG+ G  
Sbjct: 250 FSVYEDFYKYKSGVYKHTAGKYLGG--HAIKIIGWGTESGSPYWLVANSWGVNWGESGFF 307

Query: 323 KILRGKYECAFEYLIAAGKPK 343
           KI RG  +C  E  + AGK K
Sbjct: 308 KIYRGDDQCGIESAVVAGKAK 328


>sp|P43507|CPR3_CAEEL Cathepsin B-like cysteine proteinase 3 OS=Caenorhabditis elegans
           GN=cpr-3 PE=2 SV=1
          Length = 370

 Score =  194 bits (492), Expect = 1e-48,   Method: Compositional matrix adjust.
 Identities = 119/352 (33%), Positives = 171/352 (48%), Gaps = 33/352 (9%)

Query: 4   ILVFLLGCT-LVRGELYKFS------DAYIDQINREANTWTAGRNFPANLSEEYLRQFLI 56
           + +FL GC+  V  E+   +         +D +N    +W A  N  +    E+  +F +
Sbjct: 7   LALFLAGCSAFVLDEIRGINIGQSPQKVLVDHVNTVQTSWVAEHNEIS----EFEMKFKV 62

Query: 57  ADAKYFD--QSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHI 114
            D K+ +  + D  +  +           +PD FDARE+WP+C TI  + +   C +   
Sbjct: 63  MDVKFAEPLEKDSDVASELFVRGEIVPEPLPDTFDAREKWPDCNTIKLIRNQATCGSCWA 122

Query: 115 FAAVGAFSDRRCIKSKGQQNRPLSTEYVASCC-KICRYDDNKSCSHGSVFRTWNFLHKRG 173
           F A    SDR CI+S G Q   +S E + SCC   C Y     C  G       F    G
Sbjct: 123 FGAAEVISDRVCIQSNGTQQPVISVEDILSCCGTTCGY----GCKGGYSIEALRFWASSG 178

Query: 174 SVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDK 233
           +VTGGDYG   GC P + +PC+ +    T PSC+          T C +      + +DK
Sbjct: 179 AVTGGDYGGH-GCMPYSFAPCTKNCPESTTPSCK----------TTCQSSYKTEEYKKDK 227

Query: 234 HRTTLTYWVDDNEDA--IKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHS 291
           H     Y V   +    I+ EI  +GP  A++ +Y+DFYHYKSGVY +TS   +    H+
Sbjct: 228 HYGASAYKVTTTKSVTEIQTEIYHYGPVEASYKVYEDFYHYKSGVYHYTSGKLVGG--HA 285

Query: 292 GKLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
            K+IGWG ENG  YWL+ N+WG  +G++G  KI RG  EC  E  + AG  K
Sbjct: 286 VKIIGWGVENGVDYWLIANSWGTSFGEKGFFKIRRGTNECQIEGNVVAGIAK 337


>sp|P19092|CYSP1_HAECO Cathepsin B-like cysteine proteinase 1 OS=Haemonchus contortus
           GN=AC-1 PE=2 SV=1
          Length = 342

 Score =  178 bits (452), Expect = 5e-44,   Method: Compositional matrix adjust.
 Identities = 102/285 (35%), Positives = 145/285 (50%), Gaps = 12/285 (4%)

Query: 56  IADAKYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIF 115
           I D KY  Q    +  +    DP+    +P  +D R+ W NC T  ++ D   C +    
Sbjct: 63  IMDIKYKHQKLNLMVKE----DPDPEVDIPPSYDPRDVWKNCTTF-YIRDQANCGSCWAV 117

Query: 116 AAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSV 175
           +   A SDR CI SK ++   +S   + +CC   R      C  G     W +    G V
Sbjct: 118 STAAAISDRICIASKAEKQVNISATDIMTCC---RPQCGDGCEGGWPIEAWKYFIYDGVV 174

Query: 176 TGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHR 235
           +GG+Y  +  C+P  I PC HHG+      C     P   C  +C  P   + +  DK  
Sbjct: 175 SGGEYLTKDVCRPYPIHPCGHHGNDTYYGECRGT-APTPPCKRKC-RPGVRKMYRIDKRY 232

Query: 236 TTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLI 295
               Y V  +  AI+ EIL +GP  A+FA+Y+DF HYKSG+YKHT+  +L  Y H+ K+I
Sbjct: 233 GKDAYIVKQSVKAIQSEILRNGPVVASFAVYEDFRHYKSGIYKHTA-GELRGY-HAVKMI 290

Query: 296 GWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAG 340
           GWG EN T +WL+ N+W   WG++G  +I+RG  +C  E  IAAG
Sbjct: 291 GWGNENNTDFWLIANSWHNDWGEKGYFRIIRGTNDCGIEGTIAAG 335


>sp|P25793|CYSP2_HAECO Cathepsin B-like cysteine proteinase 2 OS=Haemonchus contortus
           GN=AC-2 PE=2 SV=1
          Length = 342

 Score =  178 bits (451), Expect = 7e-44,   Method: Compositional matrix adjust.
 Identities = 97/264 (36%), Positives = 138/264 (52%), Gaps = 8/264 (3%)

Query: 77  DPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRP 136
           DP+    +P  +D R+ W NC T  ++ D   C +    +   A SDR CI SK ++   
Sbjct: 80  DPDPEVDIPPSYDPRDVWKNCTTF-YIRDQANCGSCWAVSTAAAISDRICIASKAEKQVN 138

Query: 137 LSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSH 196
           +S   + +CC   R      C  G     W +    G V+GG+Y  +  C+P  I PC H
Sbjct: 139 ISATDIMTCC---RPQCGDGCEGGWPIEAWKYFIYDGVVSGGEYLTKDVCRPYPIHPCGH 195

Query: 197 HGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAH 256
           HG+      C     P   C  +C  P   + +  DK      Y V  +  AI+ EIL +
Sbjct: 196 HGNDTYYGECRGT-APTPPCKRKC-RPGVRKMYRIDKRYGKDAYIVKQSVKAIQSEILKN 253

Query: 257 GPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHW 316
           GP  A+FA+Y+DF HYKSG+YKHT+  +L  Y H+ K+IGWG EN T +WL+ N+W   W
Sbjct: 254 GPVVASFAVYEDFRHYKSGIYKHTA-GELRGY-HAVKMIGWGNENNTDFWLIANSWHNDW 311

Query: 317 GDRGTVKILRGKYECAFEYLIAAG 340
           G++G  +I+RG  +C  E  IAAG
Sbjct: 312 GEKGYFRIVRGSNDCGIEGTIAAG 335


>sp|Q54QD9|CTSB_DICDI Cathepsin B OS=Dictyostelium discoideum GN=ctsB PE=3 SV=1
          Length = 311

 Score =  139 bits (351), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 93/269 (34%), Positives = 129/269 (47%), Gaps = 27/269 (10%)

Query: 74  KTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQ 133
           K+YDP     +P  F+A+  WPNC TI  + +   C +   F A  + +DR CI +   +
Sbjct: 70  KSYDP-LGVQIPTSFNAQTNWPNCTTISQIQNQARCGSCWAFGATESATDRLCIHNN--E 126

Query: 134 NRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISP 193
           N  LS   + +C      + +  C  G  F  WN+L K+G+V+         C P TI  
Sbjct: 127 NVQLSFMDMVTC-----DETDNGCEGGDAFSAWNWLRKQGAVS-------EECLPYTIPT 174

Query: 194 CSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEI 253
           C      P    C N  V    C   C + +    + QDKH+    Y  D +E AI +EI
Sbjct: 175 C-----PPAQQPCLN-FVNTPSCTKECQSNS-SLIYSQDKHKMAKIYSFDSDE-AIMQEI 226

Query: 254 LAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWG 313
           + +GP  A F +++DF  YKSGVY HT+   L    H  KL+G+GT NG  Y+   N W 
Sbjct: 227 VTNGPVEACFTVFEDFLAYKSGVYVHTTGKDLGG--HCVKLVGFGTLNGVDYYAANNQWT 284

Query: 314 PHWGDRGTVKILRGKYECAFEYLIAAGKP 342
             WGD GT  I RG  +C     + AG P
Sbjct: 285 TSWGDNGTFLIKRG--DCGISDDVVAGLP 311


>sp|Q06544|CYSP3_OSTOS Cathepsin B-like cysteine proteinase 3 (Fragment) OS=Ostertagia
           ostertagi GN=CP-3 PE=3 SV=1
          Length = 174

 Score =  138 bits (347), Expect = 6e-32,   Method: Compositional matrix adjust.
 Identities = 69/177 (38%), Positives = 99/177 (55%), Gaps = 6/177 (3%)

Query: 165 TWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSC-ENQKVPKLKCHTRCTNP 223
            W +    G VTGG+Y  +  C+P    PC  HG  P    C +  K PK  C   C   
Sbjct: 1   AWQYFALEGVVTGGNYRKQGCCRPYEFPPCGRHGKEPYYGECYDTAKTPK--CQKTCQR- 57

Query: 224 TYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNA 283
            Y + + +DKH     Y + +N  AI+++I+ +GP  A F +Y+DF HYKSG+YKHT+  
Sbjct: 58  GYLKAYKEDKHFGKSAYRLPNNVKAIQRDIMKNGPVVAGFIVYEDFAHYKSGIYKHTAGR 117

Query: 284 KLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAG 340
                 H+ K+IGWG E GTPYWL+ N+W   WG++G  +++RG   C  E ++ AG
Sbjct: 118 MTGG--HAVKIIGWGKEKGTPYWLIANSWHDDWGEKGFYRMIRGINNCRIEEMVFAG 172


>sp|Q9UJW2|TINAG_HUMAN Tubulointerstitial nephritis antigen OS=Homo sapiens GN=TINAG PE=2
           SV=3
          Length = 476

 Score =  125 bits (314), Expect = 5e-28,   Method: Compositional matrix adjust.
 Identities = 109/352 (30%), Positives = 152/352 (43%), Gaps = 64/352 (18%)

Query: 13  LVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRP---- 68
           LVR EL       I+Q+N+    WTA      N S+ +     + D   F     P    
Sbjct: 155 LVRSEL-------IEQVNKGDYGWTA-----QNYSQFW--GMTLEDGFKFRLGTLPPSPM 200

Query: 69  -LPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVP-DTGACAAPHIFAAVGAFSDRRC 126
            L  +  T     +  +P+ F A  +WP      H P D   CAA   F+     +DR  
Sbjct: 201 LLSMNEMTASLPATTDLPEFFVASYKWPG---WTHGPLDQKNCAASWAFSTASVAADRIA 257

Query: 127 IKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDY------ 180
           I+SKG+    LS + + SCC   R+     C+ GS+ R W +L KRG V+   Y      
Sbjct: 258 IQSKGRYTANLSPQNLISCCAKNRH----GCNSGSIDRAWWYLRKRGLVSHACYPLFKDQ 313

Query: 181 -GDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLT 239
                GC  ++ S     G       C N  V K     +C+ P                
Sbjct: 314 NATNNGCAMASRS--DGRGKRHATKPCPN-NVEKSNRIYQCSPP---------------- 354

Query: 240 YWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKH--TSNAKLENY----LHSGK 293
           Y V  NE  I KEI+ +GP  A   + +DF+HYK+G+Y+H  ++N + E Y     H+ K
Sbjct: 355 YRVSSNETEIMKEIMQNGPVQAIMQVREDFFHYKTGIYRHVTSTNKESEKYRKLQTHAVK 414

Query: 294 LIGWGTENGT-----PYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAG 340
           L GWGT  G       +W+  N+WG  WG+ G  +ILRG  E   E LI A 
Sbjct: 415 LTGWGTLRGAQGQKEKFWIAANSWGKSWGENGYFRILRGVNESDIEKLIIAA 466


>sp|P92131|CATB1_GIAIN Cathepsin B-like CP1 OS=Giardia intestinalis GN=CP1 PE=2 SV=3
          Length = 303

 Score =  120 bits (301), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 96/327 (29%), Positives = 148/327 (45%), Gaps = 53/327 (16%)

Query: 22  SDAYIDQINREANTWTAG--RNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPE 79
           S A + +I      W AG  + F  N++E+  R  LI   +   +S   LP    T   E
Sbjct: 17  SRAELRRIQALNPPWKAGMPKRF-ENVTEDEFRSMLIRPDRLRARSGS-LPPISITEVQE 74

Query: 80  YSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLST 139
               +P +FD R+++P C  +    D G+C +   F+A+G F DRRC     ++    S 
Sbjct: 75  LVDPIPPQFDFRDEYPQC--VKPALDQGSCGSCWAFSAIGVFGDRRCAMGIDKEAVSYSQ 132

Query: 140 EYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGG-----DYGDRTGCQPSTISPC 194
           +++ SC       +N  C  G    TW+FL   G+ T       DYG             
Sbjct: 133 QHLISCSL-----ENFGCDGGDFQPTWSFLTFTGATTAECVKYVDYG------------- 174

Query: 195 SHHGSAPTLPSCENQKVPKL-KCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEI 253
            H  ++P    C++    +L K H       YG+              V  +  AI   +
Sbjct: 175 -HTVASPCPAVCDDGSPIQLYKAHG------YGQ--------------VSKSVPAIMGML 213

Query: 254 LAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGT-ENGTPYWLVINTW 312
           +A GP      +Y D  +Y+SGVYKHT    +    H+ +++G+GT ++GT YW++ N+W
Sbjct: 214 VAGGPLQTMIVVYADLSYYESGVYKHTYGT-INLGFHALEIVGYGTTDDGTDYWIIKNSW 272

Query: 313 GPHWGDRGTVKILRGKYECAFEYLIAA 339
           GP WG+ G  +I+RG  EC  E  I A
Sbjct: 273 GPDWGENGYFRIVRGVNECRIEDEIYA 299


>sp|Q99JR5|TINAL_MOUSE Tubulointerstitial nephritis antigen-like OS=Mus musculus
           GN=Tinagl1 PE=1 SV=1
          Length = 466

 Score =  119 bits (299), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 86/269 (31%), Positives = 121/269 (44%), Gaps = 32/269 (11%)

Query: 83  TVPDRFDAREQWPNCGTIGHVP-DTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEY 141
            +P  F+A E+WPN   + H P D G CA    F+     SDR  I S G     LS + 
Sbjct: 201 VLPTAFEASEKWPN---LIHEPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPILSPQN 257

Query: 142 VASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDY--GDRTGCQPSTISPCSHHGS 199
           + SC         + C  G +   W FL +RG V+   Y    R   + S    C  H  
Sbjct: 258 LLSC----DTHHQQGCRGGRLDGAWWFLRRRGVVSDNCYPFSGREQNEASPTPRCMMHSR 313

Query: 200 APTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPT 259
           A            K +  +RC N   G+    D ++ T  Y +  +E  I KE++ +GP 
Sbjct: 314 A--------MGRGKRQATSRCPN---GQVDSNDIYQVTPAYRLGSDEKEIMKELMENGPV 362

Query: 260 TATFALYDDFYHYKSGVYKHT--SNAKLENY----LHSGKLIGWGTEN-----GTPYWLV 308
            A   +++DF+ Y+ G+Y HT  S  + E Y     HS K+ GWG E         YW  
Sbjct: 363 QALMEVHEDFFLYQRGIYSHTPVSQGRPEQYRRHGTHSVKITGWGEETLPDGRTIKYWTA 422

Query: 309 INTWGPHWGDRGTVKILRGKYECAFEYLI 337
            N+WGP WG+RG  +I+RG  EC  E  +
Sbjct: 423 ANSWGPWWGERGHFRIVRGTNECDIETFV 451


>sp|Q9EQT5|TINAL_RAT Tubulointerstitial nephritis antigen-like OS=Rattus norvegicus
           GN=Tinagl1 PE=2 SV=1
          Length = 467

 Score =  119 bits (299), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 86/270 (31%), Positives = 119/270 (44%), Gaps = 33/270 (12%)

Query: 83  TVPDRFDAREQWPNCGTIGHVP-DTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEY 141
            +P  F+A E+WPN   + H P D G CA    F+     SDR  I S G     LS + 
Sbjct: 201 VLPTAFEASEKWPN---LIHEPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPILSPQN 257

Query: 142 VASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDY---GDRTGCQPSTISPCSHHG 198
           + SC         K C  G +   W FL +RG V+   Y   G     + S    C  H 
Sbjct: 258 LLSC----DTHHQKGCRGGRLDGAWWFLRRRGVVSDNCYPFSGREQNDEASPTPRCMMHS 313

Query: 199 SAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGP 258
            A            K +  +RC N         D ++ T  Y +  +E  I KE++ +GP
Sbjct: 314 RA--------MGRGKRQATSRCPNSQVDS---NDIYQVTPVYRLASDEKEIMKELMENGP 362

Query: 259 TTATFALYDDFYHYKSGVYKHT--SNAKLENY----LHSGKLIGWGTEN-----GTPYWL 307
             A   +++DF+ Y+ G+Y HT  S  + E Y     HS K+ GWG E         YW 
Sbjct: 363 VQALMEVHEDFFLYQRGIYSHTPVSQGRPEQYRRHGTHSVKITGWGEETLPDGRTIKYWT 422

Query: 308 VINTWGPHWGDRGTVKILRGKYECAFEYLI 337
             N+WGP WG+RG  +I+RG  EC  E  +
Sbjct: 423 AANSWGPWWGERGHFRIVRGINECDIETFV 452


>sp|P92133|CATB3_GIAIN Cathepsin B-like CP3 OS=Giardia intestinalis GN=CP3 PE=2 SV=2
          Length = 299

 Score =  119 bits (299), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 80/265 (30%), Positives = 123/265 (46%), Gaps = 37/265 (13%)

Query: 81  SATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTE 140
           +   PD FD RE++P+C  I  V D G C +   F++V +  DRRC     ++    S +
Sbjct: 71  ATQAPDSFDFREEYPHC--IPEVVDQGGCGSCWAFSSVASVGDRRCFAGLDKKAVKYSPQ 128

Query: 141 YVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSA 200
           YV SC +      + +C  G +   W FL K G+ T         C P         G+ 
Sbjct: 129 YVVSCDR-----GDMACDGGWLPSVWRFLTKTGTTT-------DECVPYQSGSTGARGTC 176

Query: 201 PTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTT 260
           PT    +   +P L   T+  +  YG                  +  AI K +   GP  
Sbjct: 177 PT-KCADGSDLPHLYKATKAVD--YGL-----------------DAPAIMKALATGGPLQ 216

Query: 261 ATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTEN-GTPYWLVINTWGPHWGDR 319
             F +Y DF +Y+SGVY+HT   ++E   H+  ++G+GT++ G  YW++ N+WGP WG+ 
Sbjct: 217 TAFTVYSDFMYYESGVYQHT-YGRVEGG-HAVDMVGYGTDDDGVDYWIIKNSWGPDWGED 274

Query: 320 GTVKILRGKYECAFEYLIAAGKPKN 344
           G  +I+R   EC  E  +  G  +N
Sbjct: 275 GYFRIIRMTNECGIEEQVIGGFFEN 299


>sp|P92132|CATB2_GIAIN Cathepsin B-like CP2 OS=Giardia intestinalis GN=CP2 PE=1 SV=2
          Length = 300

 Score =  118 bits (296), Expect = 5e-26,   Method: Compositional matrix adjust.
 Identities = 79/260 (30%), Positives = 123/260 (47%), Gaps = 41/260 (15%)

Query: 84  VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVA 143
           VP+ FD RE++P+C  I  V D G C +   F++V  F DRRC+    ++    S +YV 
Sbjct: 75  VPESFDFREEYPHC--IPEVVDQGGCGSCWAFSSVATFGDRRCVAGLDKKPVKYSPQYVV 132

Query: 144 SCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTL 203
           SC        + +C+ G +   W FL K G+ T         C P      +  G+ PT 
Sbjct: 133 SCDH-----GDMACNGGWLPNVWKFLTKTGTTT-------DECVPYKSGSTTLRGTCPTK 180

Query: 204 PSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNED--AIKKEILAHGPTTA 261
            +  + KV                      H  T T + D   D  A+ K +   GP   
Sbjct: 181 CADGSSKV----------------------HLATATSYKDYGLDIPAMMKALSTSGPLQV 218

Query: 262 TFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTEN-GTPYWLVINTWGPHWGDRG 320
            F ++ DF +Y+SGVY+HT         H+ +++G+GT++ G  YW++ N+WGP WG+ G
Sbjct: 219 AFLVHSDFMYYESGVYQHTYGYMEGG--HAVEMVGYGTDDDGVDYWIIKNSWGPDWGEDG 276

Query: 321 TVKILRGKYECAFEYLIAAG 340
             +++RG  +C+ E    AG
Sbjct: 277 YFRMIRGINDCSIEEQAYAG 296


>sp|Q9GZM7|TINAL_HUMAN Tubulointerstitial nephritis antigen-like OS=Homo sapiens
           GN=TINAGL1 PE=1 SV=1
          Length = 467

 Score =  118 bits (295), Expect = 8e-26,   Method: Compositional matrix adjust.
 Identities = 87/273 (31%), Positives = 118/273 (43%), Gaps = 40/273 (14%)

Query: 83  TVPDRFDAREQWPNCGTIGHVP-DTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEY 141
            +P  F+A E+WPN   + H P D G CA    F+     SDR  I S G     LS + 
Sbjct: 202 VLPTAFEASEKWPN---LIHEPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQN 258

Query: 142 VASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVT------GGDYGDRTGCQPSTISPCS 195
           + SC         + C  G +   W FL +RG V+       G   D  G  P    PC 
Sbjct: 259 LLSC----DTHQQQGCRGGRLDGAWWFLRRRGVVSDHCYPFSGRERDEAGPAP----PCM 310

Query: 196 HHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILA 255
            H  A            K +    C N         D ++ T  Y +  N+  I KE++ 
Sbjct: 311 MHSRA--------MGRGKRQATAHCPNSYVNN---NDIYQVTPVYRLGSNDKEIMKELME 359

Query: 256 HGPTTATFALYDDFYHYKSGVYKHT--SNAKLENY----LHSGKLIGWGTEN-----GTP 304
           +GP  A   +++DF+ YK G+Y HT  S  + E Y     HS K+ GWG E         
Sbjct: 360 NGPVQALMEVHEDFFLYKGGIYSHTPVSLGRPERYRRHGTHSVKITGWGEETLPDGRTLK 419

Query: 305 YWLVINTWGPHWGDRGTVKILRGKYECAFEYLI 337
           YW   N+WGP WG+RG  +I+RG  EC  E  +
Sbjct: 420 YWTAANSWGPAWGERGHFRIVRGVNECDIESFV 452


>sp|P90850|YCF2E_CAEEL Uncharacterized peptidase C1-like protein F26E4.3 OS=Caenorhabditis
           elegans GN=F26E4.3 PE=1 SV=3
          Length = 452

 Score =  117 bits (292), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 82/266 (30%), Positives = 129/266 (48%), Gaps = 36/266 (13%)

Query: 84  VPDRFDAREQWPNCGTIGH-VPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYV 142
           +P+ FDAR++W   G + H V D G C +    +     SDR  I S+G+ N  LS++ +
Sbjct: 184 LPEHFDARDKW---GPLIHPVADQGDCGSSWSVSTTAISSDRLAIISEGRINSTLSSQQL 240

Query: 143 ASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPT 202
            SC         K C  G + R W ++ K G V  GD+     C P  +S  S       
Sbjct: 241 LSC----NQHRQKGCEGGYLDRAWWYIRKLGVV--GDH-----CYP-YVSGQSREPGHCL 288

Query: 203 LPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTAT 262
           +P  +      L+C +   + T          + T  Y V   E+ I+ E++ +GP  AT
Sbjct: 289 IPKRDYTNRQGLRCPSGSQDST--------AFKMTPPYKVSSREEDIQTELMTNGPVQAT 340

Query: 263 FALYDDFYHYKSGVYKHT-------SNAKLENYLHSGKLIGWGTENGT----PYWLVINT 311
           F +++DF+ Y  GVY+H+       +++  E Y HS +++GWG ++ T     YWL  N+
Sbjct: 341 FVVHEDFFMYAGGVYQHSDLAAQKGASSVAEGY-HSVRVLGWGVDHSTGKPIKYWLCANS 399

Query: 312 WGPHWGDRGTVKILRGKYECAFEYLI 337
           WG  WG+ G  K+LRG+  C  E  +
Sbjct: 400 WGTQWGEDGYFKVLRGENHCEIESFV 425


>sp|Q3SZI1|TINAG_BOVIN Tubulointerstitial nephritis antigen OS=Bos taurus GN=TINAG PE=2
           SV=1
          Length = 476

 Score =  117 bits (292), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 88/276 (31%), Positives = 126/276 (45%), Gaps = 45/276 (16%)

Query: 84  VPDRFDAREQWPNCGTIGHVP-DTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYV 142
           +P+ F A  +WP      H P D   CAA   F+     +DR  I+S+G+    LS + +
Sbjct: 217 LPEFFIASYKWPG---WTHGPLDQKNCAASWAFSTASVAADRIAIQSQGRYTANLSPQNL 273

Query: 143 ASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDY-------GDRTGCQPSTISPCS 195
            SCC   R+     C+ GSV R W +L KRG V+   Y           GC  ++ S   
Sbjct: 274 ISCCAKKRH----GCNSGSVDRAWWYLRKRGLVSHACYPLFKDQNATNNGCAMASRS--D 327

Query: 196 HHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILA 255
             G       C N  + K     +C+ P                Y V  NE  I +EI+ 
Sbjct: 328 GRGKRHATTPCPN-SIEKSNRIYQCSPP----------------YRVSSNETEIMREIMQ 370

Query: 256 HGPTTATFALYDDFYHYKSGVYKH--TSNAKLENY----LHSGKLIGWGTENGT-----P 304
           +GP  A   +++DF++YK+G+Y+H  ++N   E Y     H+ KL GWGT  G       
Sbjct: 371 NGPVQAIMQVHEDFFNYKTGIYRHITSTNEDSEKYRKFRTHAVKLTGWGTLRGAQGQKEK 430

Query: 305 YWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAG 340
           +W+  N+WG  WG+ G  +ILRG  E   E LI A 
Sbjct: 431 FWIAANSWGKSWGENGYFRILRGVNESDIEKLIIAA 466


>sp|Q60HG6|CATC_MACFA Dipeptidyl peptidase 1 OS=Macaca fascicularis GN=CTSC PE=2 SV=1
          Length = 463

 Score = 96.3 bits (238), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 88/341 (25%), Positives = 133/341 (39%), Gaps = 64/341 (18%)

Query: 18  LYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYD 77
           LYK+   ++  IN    +WTA            L   +     +  +  RP P       
Sbjct: 167 LYKYDHNFVKAINAIQKSWTATTYM--EYETLTLGDMIKRSGGHSRKIPRPKPTPLTAEI 224

Query: 78  PEYSATVPDRFDAREQWPNCGTIGHVP---DTGACAAPHIFAAVGAFSDRRCIKSKGQQN 134
            +    +P  +D    W N   I  V    +  +C + + FA+VG    R  I +   Q 
Sbjct: 225 QQKILHLPTSWD----WRNVHGINFVSPVRNQASCGSCYSFASVGMLEARIRILTNNSQT 280

Query: 135 RPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTG-----CQP- 188
             LS++ V SC +       + C  G  +           +T G Y    G     C P 
Sbjct: 281 PILSSQEVVSCSQYA-----QGCEGGFPY-----------LTAGKYAQDFGLVEEACFPY 324

Query: 189 -STISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNED 247
             T SPC                    K    C        ++  ++     ++   NE 
Sbjct: 325 TGTDSPC--------------------KMKEDCFR------YYSSEYHYVGGFYGGCNEA 358

Query: 248 AIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAK----LENYLHSGKLIGWGTEN-- 301
            +K E++ HGP    F +YDDF HY++G+Y HT         E   H+  L+G+GT++  
Sbjct: 359 LMKLELVYHGPLAVAFEVYDDFLHYQNGIYHHTGLRDPFNPFELTNHAVLLVGYGTDSAS 418

Query: 302 GTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
           G  YW+V N+WG  WG+ G  +I RG  ECA E +  A  P
Sbjct: 419 GMDYWIVKNSWGTSWGEDGYFRIRRGTDECAIESIAVAATP 459


>sp|Q5RB02|CATC_PONAB Dipeptidyl peptidase 1 OS=Pongo abelii GN=CTSC PE=2 SV=1
          Length = 463

 Score = 94.7 bits (234), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 89/341 (26%), Positives = 133/341 (39%), Gaps = 64/341 (18%)

Query: 18  LYKFSDAYIDQINREANTWTAGRNFPANLSEEY----LRQFLIADAKYFDQSDRPLPGDR 73
           LYK+   ++  IN    +WTA         +EY    L   +     +  +  RP P   
Sbjct: 167 LYKYDHNFVKAINAIQKSWTA------TTYKEYETLTLGDMIRRSGGHSRKIPRPKPAPL 220

Query: 74  KTYDPEYSATVPDRFDAREQWPNCGTIGHVP---DTGACAAPHIFAAVGAFSDRRCIKSK 130
                +    +P  +D    W N   I  V    +  +C + + FA++G    R  I + 
Sbjct: 221 TAEIQQKVLHLPTSWD----WRNIHGINFVSPVRNQASCGSCYSFASMGMLEARIRILTS 276

Query: 131 GQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYG-DRTGCQP- 188
             Q   LS + V SC +       + C  G       F +        D+G     C P 
Sbjct: 277 NSQTPILSPQEVVSCSQYA-----QGCEGG-------FPYLIAGKYAQDFGLVEEACFPY 324

Query: 189 -STISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNED 247
             T SPC                    K    C        ++  ++     ++   NE 
Sbjct: 325 TGTDSPC--------------------KMKEDCFR------YYSSEYHYVGGFYGGCNEA 358

Query: 248 AIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAK----LENYLHSGKLIGWGTE--N 301
            +K E++ HGP    F +YDDF HYK G+Y HT         E   H+  L+G+GT+  +
Sbjct: 359 LMKLELVHHGPMAVAFEVYDDFLHYKKGIYHHTGLRDPFNPFELTNHAVLLVGYGTDSAS 418

Query: 302 GTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
           G  YW+V N+WG  WG+ G  +I RG  ECA E +  A  P
Sbjct: 419 GMDYWIVKNSWGTGWGEDGYFRIRRGTDECAIESIAVAATP 459


>sp|P53634|CATC_HUMAN Dipeptidyl peptidase 1 OS=Homo sapiens GN=CTSC PE=1 SV=2
          Length = 463

 Score = 94.4 bits (233), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 87/337 (25%), Positives = 130/337 (38%), Gaps = 56/337 (16%)

Query: 18  LYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYD 77
           LYK+   ++  IN    +WTA            L   +     +  +  RP P       
Sbjct: 167 LYKYDHNFVKAINAIQKSWTATTYM--EYETLTLGDMIRRSGGHSRKIPRPKPAPLTAEI 224

Query: 78  PEYSATVPDRFDAREQWPNCGTIGHVP---DTGACAAPHIFAAVGAFSDRRCIKSKGQQN 134
            +    +P  +D    W N   I  V    +  +C + + FA++G    R  I +   Q 
Sbjct: 225 QQKILHLPTSWD----WRNVHGINFVSPVRNQASCGSCYSFASMGMLEARIRILTNNSQT 280

Query: 135 RPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYG-DRTGCQP--STI 191
             LS + V SC +       + C  G       F +        D+G     C P   T 
Sbjct: 281 PILSPQEVVSCSQYA-----QGCEGG-------FPYLIAGKYAQDFGLVEEACFPYTGTD 328

Query: 192 SPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKK 251
           SPC                    K    C        ++  ++     ++   NE  +K 
Sbjct: 329 SPC--------------------KMKEDCFR------YYSSEYHYVGGFYGGCNEALMKL 362

Query: 252 EILAHGPTTATFALYDDFYHYKSGVYKHTSNAK----LENYLHSGKLIGWGTE--NGTPY 305
           E++ HGP    F +YDDF HYK G+Y HT         E   H+  L+G+GT+  +G  Y
Sbjct: 363 ELVHHGPMAVAFEVYDDFLHYKKGIYHHTGLRDPFNPFELTNHAVLLVGYGTDSASGMDY 422

Query: 306 WLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
           W+V N+WG  WG+ G  +I RG  ECA E +  A  P
Sbjct: 423 WIVKNSWGTGWGENGYFRIRRGTDECAIESIAVAATP 459


>sp|O97578|CATC_CANFA Dipeptidyl peptidase 1 (Fragment) OS=Canis familiaris GN=CTSC PE=1
           SV=1
          Length = 435

 Score = 93.6 bits (231), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 88/334 (26%), Positives = 132/334 (39%), Gaps = 53/334 (15%)

Query: 18  LYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYD 77
           LYK++  ++  IN    +WTA R          LR  +        +  RP P       
Sbjct: 142 LYKYNYEFVKAINTIQKSWTATRYI--EYETLTLRDMMTRVGG--RKIPRPKPTPLTAEI 197

Query: 78  PEYSATVPDRFDAREQWPNC-GT--IGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQN 134
            E  + +P  +D    W N  GT  +  V +  +C + + FA+      R  I +   Q 
Sbjct: 198 HEEISRLPTSWD----WRNVRGTNFVSPVRNQASCGSCYAFASTAMLEARIRILTNNTQT 253

Query: 135 RPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPC 194
             LS + + SC +       + C  G  +           +  G Y    G       P 
Sbjct: 254 PILSPQEIVSCSQYA-----QGCEGGFPY-----------LIAGKYAQDFGLVEEACFPY 297

Query: 195 SHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEIL 254
           +   S P  P+          C    ++  Y  G F              NE  +K E++
Sbjct: 298 AGSDS-PCKPN---------DCFRYYSSEYYYVGGFYGAC----------NEALMKLELV 337

Query: 255 AHGPTTATFALYDDFYHYKSGVYKHTSNAK----LENYLHSGKLIGWGTE--NGTPYWLV 308
            HGP    F +YDDF+HY+ G+Y HT         E   H+  L+G+GT+  +G  YW+V
Sbjct: 338 RHGPMAVAFEVYDDFFHYQKGIYYHTGLRDPFNPFELTNHAVLLVGYGTDSASGMDYWIV 397

Query: 309 INTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
            N+WG  WG+ G  +I RG  ECA E +  A  P
Sbjct: 398 KNSWGSRWGEDGYFRIRRGTDECAIESIAVAATP 431


>sp|Q3ZCJ8|CATC_BOVIN Dipeptidyl peptidase 1 OS=Bos taurus GN=CTSC PE=2 SV=1
          Length = 463

 Score = 92.8 bits (229), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 85/335 (25%), Positives = 133/335 (39%), Gaps = 52/335 (15%)

Query: 18  LYKFSDAYIDQINREANTWTAGRNFPANLSEEY-LRQFLIADAKYFDQSDRPLPGDRKTY 76
           LY+++  ++  IN    +WTA    P    E   L++ +     +  +  RP P      
Sbjct: 167 LYRYNHDFVKAINAIQKSWTAA---PYMEYETLTLKEMIRRGGGHSRRIPRPKPAPITAE 223

Query: 77  DPEYSATVPDRFDAREQWPNCGTIGHVP---DTGACAAPHIFAAVGAFSDRRCIKSKGQQ 133
             +    +P  +D    W N   I  V    + G+C + + FA++G    R  I +   Q
Sbjct: 224 IQKKILHLPTSWD----WRNVHGINFVTPVRNQGSCGSCYSFASMGMMEARIRILTNNTQ 279

Query: 134 NRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISP 193
              LS + V SC +       + C  G  +           +  G Y    G       P
Sbjct: 280 TPILSPQEVVSCSQYA-----QGCEGGFPY-----------LIAGKYAQDFGLVEEDCFP 323

Query: 194 CSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEI 253
            +   S   L         K  C            ++  ++     ++   NE  +K E+
Sbjct: 324 YTGTDSPCRL---------KEGCFR----------YYSSEYHYVGGFYGGCNEALMKLEL 364

Query: 254 LAHGPTTATFALYDDFYHYKSGVYKHTSNAK----LENYLHSGKLIGWGTE--NGTPYWL 307
           +  GP    F +YDDF HY+ GVY HT         E   H+  L+G+GT+  +G  YW+
Sbjct: 365 VHQGPMAVAFEVYDDFLHYRKGVYHHTGLRDPFNPFELTNHAVLLVGYGTDAASGLDYWI 424

Query: 308 VINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
           V N+WG  WG+ G  +I RG  ECA E +  A  P
Sbjct: 425 VKNSWGTSWGENGYFRIRRGTDECAIESIALAATP 459


>sp|P80067|CATC_RAT Dipeptidyl peptidase 1 OS=Rattus norvegicus GN=Ctsc PE=1 SV=3
          Length = 462

 Score = 92.4 bits (228), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 88/340 (25%), Positives = 137/340 (40%), Gaps = 63/340 (18%)

Query: 18  LYKFSDAYIDQINREANTWTAGRNFPANLSEEYLR---QFLIADAKYFDQSDRPLPGDRK 74
           LY  +  ++  IN    +WTA         EEY +   + LI  + +  +  RP P    
Sbjct: 167 LYSHNHNFVKAINSVQKSWTA------TTYEEYEKLSIRDLIRRSGHSGRILRPKPAPIT 220

Query: 75  TYDPEYSATVPDRFDAREQWPNCGTIGHVP---DTGACAAPHIFAAVGAFSDRRCIKSKG 131
               +   ++P+ +D    W N   I  V    +  +C + + FA++G    R  I +  
Sbjct: 221 DEIQQQILSLPESWD----WRNVRGINFVSPVRNQESCGSCYSFASLGMLEARIRILTNN 276

Query: 132 QQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGD-RTGCQP-- 188
            Q   LS + V SC         + C  G       F +        D+G     C P  
Sbjct: 277 SQTPILSPQEVVSCSPYA-----QGCDGG-------FPYLIAGKYAQDFGVVEENCFPYT 324

Query: 189 STISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDA 248
           +T +PC                 PK  C            ++  ++     ++   NE  
Sbjct: 325 ATDAPCK----------------PKENC----------LRYYSSEYYYVGGFYGGCNEAL 358

Query: 249 IKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAK----LENYLHSGKLIGWGTE--NG 302
           +K E++ HGP    F ++DDF HY SG+Y HT  +      E   H+  L+G+G +   G
Sbjct: 359 MKLELVKHGPMAVAFEVHDDFLHYHSGIYHHTGLSDPFNPFELTNHAVLLVGYGKDPVTG 418

Query: 303 TPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
             YW+V N+WG  WG+ G  +I RG  ECA E +  A  P
Sbjct: 419 LDYWIVKNSWGSQWGESGYFRIRRGTDECAIESIAMAAIP 458


>sp|P97821|CATC_MOUSE Dipeptidyl peptidase 1 OS=Mus musculus GN=Ctsc PE=2 SV=1
          Length = 462

 Score = 92.0 bits (227), Expect = 6e-18,   Method: Compositional matrix adjust.
 Identities = 84/337 (24%), Positives = 134/337 (39%), Gaps = 57/337 (16%)

Query: 18  LYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYD 77
           LY  +  ++  IN    +WTA         E+   + LI  + +  +  RP P       
Sbjct: 167 LYTHNHNFVKAINTVQKSWTAT---AYKEYEKMSLRDLIRRSGHSQRIPRPKPAPMTDEI 223

Query: 78  PEYSATVPDRFDAREQWPNCGTIGHVP---DTGACAAPHIFAAVGAFSDRRCIKSKGQQN 134
            +    +P+ +D    W N   + +V    +  +C + + FA++G    R  I +   Q 
Sbjct: 224 QQQILNLPESWD----WRNVQGVNYVSPVRNQESCGSCYSFASMGMLEARIRILTNNSQT 279

Query: 135 RPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGD-RTGCQPSTI-- 191
             LS + V SC         + C  G       F +        D+G     C P T   
Sbjct: 280 PILSPQEVVSCSPYA-----QGCDGG-------FPYLIAGKYAQDFGVVEESCFPYTAKD 327

Query: 192 SPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKK 251
           SPC                 P+  C            ++   +     ++   NE  +K 
Sbjct: 328 SPCK----------------PRENC----------LRYYSSDYYYVGGFYGGCNEALMKL 361

Query: 252 EILAHGPTTATFALYDDFYHYKSGVYKHTSNAK----LENYLHSGKLIGWGTE--NGTPY 305
           E++ HGP    F ++DDF HY SG+Y HT  +      E   H+  L+G+G +   G  Y
Sbjct: 362 ELVKHGPMAVAFEVHDDFLHYHSGIYHHTGLSDPFNPFELTNHAVLLVGYGRDPVTGIEY 421

Query: 306 WLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
           W++ N+WG +WG+ G  +I RG  ECA E +  A  P
Sbjct: 422 WIIKNSWGSNWGESGYFRIRRGTDECAIESIAVAAIP 458


>sp|Q26563|CATC_SCHMA Cathepsin C OS=Schistosoma mansoni PE=2 SV=1
          Length = 454

 Score = 85.5 bits (210), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 82/344 (23%), Positives = 136/344 (39%), Gaps = 53/344 (15%)

Query: 8   LLGCTLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDR 67
           L G       LY  + +++ +IN    +W  G  +P  LS+  + +             R
Sbjct: 141 LFGSKSFGRTLYHINPSFVGKINAHQKSW-RGEIYP-ELSKYTIDELRNRAGGVKSMVTR 198

Query: 68  PLPGDRKTYDPEY---SATVPDRFDAREQWPNCGT---IGHVPDTGACAAPHIFAAVGAF 121
           P   +RKT   E    +  +P  FD     P  G+   +  + + G C + +   +  A 
Sbjct: 199 PSVLNRKTPSKELISLTGNLPLEFDWTS--PPDGSRSPVTPIRNQGICGSCYASPSAAAL 256

Query: 122 SDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYG 181
             R  + S   +   LS + V  C        ++ C+ G  F           +  G YG
Sbjct: 257 EARIRLVSNFSEQPILSPQTVVDCSPY-----SEGCNGGFPF-----------LIAGKYG 300

Query: 182 DRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYW 241
           +  G     + P +             +   K      CT       ++   +     Y+
Sbjct: 301 EDFGLPQKIVIPYT------------GEDTGKCTVSKNCTR------YYTTDYSYIGGYY 342

Query: 242 VDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAK-------LENYLHSGKL 294
              NE  ++ E++++GP    F +Y+DF  YK G+Y HT+           E   H+  L
Sbjct: 343 GATNEKLMQLELISNGPFPVGFEVYEDFQFYKEGIYHHTTVQTDHYNFNPFELTNHAVLL 402

Query: 295 IGWGTE--NGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYL 336
           +G+G +  +G PYW V N+WG  WG++G  +ILRG  EC  E L
Sbjct: 403 VGYGVDKLSGEPYWKVKNSWGVEWGEQGYFRILRGTDECGVESL 446


>sp|Q3T0I2|CATH_BOVIN Pro-cathepsin H OS=Bos taurus GN=CTSH PE=2 SV=1
          Length = 335

 Score = 78.6 bits (192), Expect = 6e-14,   Method: Compositional matrix adjust.
 Identities = 39/112 (34%), Positives = 60/112 (53%), Gaps = 6/112 (5%)

Query: 223 PTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSN 282
           P+    F +D    TL     ++E+A+ + +  H P +  F +  DF  Y+ G+Y  TS 
Sbjct: 218 PSKAIAFVKDVANITL-----NDEEAMVEAVALHNPVSFAFEVTADFMMYRKGIYSSTSC 272

Query: 283 AKLENYL-HSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAF 333
            K  + + H+   +G+G E G PYW+V N+WGP+WG +G   I RGK  C  
Sbjct: 273 HKTPDKVNHAVLAVGYGEEKGIPYWIVKNSWGPNWGMKGYFLIERGKNMCGL 324


>sp|O70370|CATS_MOUSE Cathepsin S OS=Mus musculus GN=Ctss PE=2 SV=2
          Length = 340

 Score = 77.0 bits (188), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 67/245 (27%), Positives = 107/245 (43%), Gaps = 44/245 (17%)

Query: 83  TVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYV 142
           T+PD  D RE+    G +  V   G+C A   F+AVGA   +  +K K  +   LS + +
Sbjct: 122 TLPDTVDWREK----GCVTEVKYQGSCGACWAFSAVGALEGQ--LKLKTGKLISLSAQNL 175

Query: 143 ASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPT 202
             C    +Y  NK C  G +   + ++   G +      D +    +T   C H+ S   
Sbjct: 176 VDCSNEEKYG-NKGCGGGYMTEAFQYIIDNGGIEA----DASYPYKATDEKC-HYNSKNR 229

Query: 203 LPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTAT 262
             +C           +R     +G                  +EDA+K+ +   GP +  
Sbjct: 230 AATC-----------SRYIQLPFG------------------DEDALKEAVATKGPVSVG 260

Query: 263 F-ALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGT 321
             A +  F+ YKSGVY   S     N  H   ++G+GT +G  YWLV N+WG ++GD+G 
Sbjct: 261 IDASHSSFFFYKSGVYDDPSCTG--NVNHGVLVVGYGTLDGKDYWLVKNSWGLNFGDQGY 318

Query: 322 VKILR 326
           +++ R
Sbjct: 319 IRMAR 323


>sp|O35186|CATK_RAT Cathepsin K OS=Rattus norvegicus GN=Ctsk PE=2 SV=1
          Length = 329

 Score = 77.0 bits (188), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 62/254 (24%), Positives = 103/254 (40%), Gaps = 47/254 (18%)

Query: 76  YDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNR 135
           Y PE+   VPD  D R++    G +  V + G C +   F++ GA   +  +K K  +  
Sbjct: 107 YTPEWEGRVPDSIDYRKK----GYVTPVKNQGQCGSCWAFSSAGALEGQ--LKKKTGKLL 160

Query: 136 PLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCS 195
            LS + +  C       +N  C  G +   + ++ + G +   D     G   S    C 
Sbjct: 161 ALSPQNLVDCV-----SENYGCGGGYMTTAFQYVQQNGGIDSEDAYPYVGQDES----CM 211

Query: 196 HHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILA 255
           ++ +A              KC      P                     NE A+K+ +  
Sbjct: 212 YNATAKAA-----------KCRGYREIPV-------------------GNEKALKRAVAR 241

Query: 256 HGPTTATF-ALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGP 314
            GP + +  A    F  Y  GVY +  N   +N  H+  ++G+GT+ G  YW++ N+WG 
Sbjct: 242 VGPVSVSIDASLTSFQFYSRGVY-YDENCDRDNVNHAVLVVGYGTQKGNKYWIIKNSWGE 300

Query: 315 HWGDRGTVKILRGK 328
            WG++G V + R K
Sbjct: 301 SWGNKGYVLLARNK 314


>sp|O46427|CATH_PIG Pro-cathepsin H OS=Sus scrofa GN=CTSH PE=1 SV=1
          Length = 335

 Score = 76.6 bits (187), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 38/112 (33%), Positives = 59/112 (52%), Gaps = 6/112 (5%)

Query: 223 PTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSN 282
           P     F +D    T+     ++E+A+ + +  + P +  F + +DF  Y+ G+Y  TS 
Sbjct: 218 PDKAIAFVKDVANITM-----NDEEAMVEAVALYNPVSFAFEVTNDFLMYRKGIYSSTSC 272

Query: 283 AKLENYL-HSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAF 333
            K  + + H+   +G+G ENG PYW+V N+WGP WG  G   I RGK  C  
Sbjct: 273 HKTPDKVNHAVLAVGYGEENGIPYWIVKNSWGPQWGMNGYFLIERGKNMCGL 324


>sp|P09668|CATH_HUMAN Pro-cathepsin H OS=Homo sapiens GN=CTSH PE=1 SV=4
          Length = 335

 Score = 75.1 bits (183), Expect = 8e-13,   Method: Compositional matrix adjust.
 Identities = 38/112 (33%), Positives = 59/112 (52%), Gaps = 6/112 (5%)

Query: 223 PTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSN 282
           P    GF +D    T+      +E+A+ + +  + P +  F +  DF  Y++G+Y  TS 
Sbjct: 218 PGKAIGFVKDVANITIY-----DEEAMVEAVALYNPVSFAFEVTQDFMMYRTGIYSSTSC 272

Query: 283 AKLENYL-HSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAF 333
            K  + + H+   +G+G +NG PYW+V N+WGP WG  G   I RGK  C  
Sbjct: 273 HKTPDKVNHAVLAVGYGEKNGIPYWIVKNSWGPQWGMNGYFLIERGKNMCGL 324


>sp|P05167|ALEU_HORVU Thiol protease aleurain OS=Hordeum vulgare PE=2 SV=1
          Length = 362

 Score = 72.8 bits (177), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 35/89 (39%), Positives = 47/89 (52%), Gaps = 1/89 (1%)

Query: 246 EDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHT-SNAKLENYLHSGKLIGWGTENGTP 304
           ED +K  +    P +  F + D F  YKSGVY         ++  H+   +G+G ENG P
Sbjct: 263 EDELKNAVGLVRPVSVAFQVIDGFRQYKSGVYTSDHCGTTPDDVNHAVLAVGYGVENGVP 322

Query: 305 YWLVINTWGPHWGDRGTVKILRGKYECAF 333
           YWL+ N+WG  WGD G  K+  GK  CA 
Sbjct: 323 YWLIKNSWGADWGDNGYFKMEMGKNMCAI 351


>sp|P55097|CATK_MOUSE Cathepsin K OS=Mus musculus GN=Ctsk PE=2 SV=2
          Length = 329

 Score = 72.8 bits (177), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 60/254 (23%), Positives = 103/254 (40%), Gaps = 47/254 (18%)

Query: 76  YDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNR 135
           Y PE+   VPD  D R++    G +  V + G C +   F++ GA   +  +K K  +  
Sbjct: 107 YTPEWEGRVPDSIDYRKK----GYVTPVKNQGQCGSCWAFSSAGALEGQ--LKKKTGKLL 160

Query: 136 PLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCS 195
            LS + +  C       +N  C  G +   + ++ + G +   D     G   S    C 
Sbjct: 161 ALSPQNLVDCV-----TENYGCGGGYMTTAFQYVQQNGGIDSEDAYPYVGQDES----CM 211

Query: 196 HHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILA 255
           ++ +A              KC      P                     NE A+K+ +  
Sbjct: 212 YNATAKAA-----------KCRGYREIPV-------------------GNEKALKRAVAR 241

Query: 256 HGPTTATF-ALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGP 314
            GP + +  A    F  Y  GVY +  N   +N  H+  ++G+GT+ G+ +W++ N+WG 
Sbjct: 242 VGPISVSIDASLASFQFYSRGVY-YDENCDRDNVNHAVLVVGYGTQKGSKHWIIKNSWGE 300

Query: 315 HWGDRGTVKILRGK 328
            WG++G   + R K
Sbjct: 301 SWGNKGYALLARNK 314


>sp|P43297|RD21A_ARATH Cysteine proteinase RD21a OS=Arabidopsis thaliana GN=RD21A PE=1
           SV=1
          Length = 462

 Score = 72.0 bits (175), Expect = 7e-12,   Method: Compositional matrix adjust.
 Identities = 79/315 (25%), Positives = 130/315 (41%), Gaps = 56/315 (17%)

Query: 15  RGELYKFSDAYIDQINREANTWTAGRNFPANLS-EEYLRQFLIADAKYFDQSDRPLPGDR 73
           R E++K +  ++D+ N +  ++  G    A+L+ +EY  ++L A  +          G+R
Sbjct: 72  RFEIFKDNLRFVDEHNEKNLSYRLGLTRFADLTNDEYRSKYLGAKMEK--------KGER 123

Query: 74  KTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQ 133
           +T    Y A V D       W   G +  V D G C +   F+ +GA      I + G  
Sbjct: 124 RT-SLRYEARVGDELPESIDWRKKGAVAEVKDQGGCGSCWAFSTIGAVEGINQIVT-GDL 181

Query: 134 NRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSV-TGGDYGDRTGCQPSTIS 192
                 E V      C    N+ C+ G +   + F+ K G + T  DY  +      T  
Sbjct: 182 ITLSEQELVD-----CDTSYNEGCNGGLMDYAFEFIIKNGGIDTDKDYPYKG--VDGTCD 234

Query: 193 PCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKE 252
               +    T+ S E+              PTY                   +E+++KK 
Sbjct: 235 QIRKNAKVVTIDSYEDV-------------PTY-------------------SEESLKKA 262

Query: 253 ILAHGPTT-ATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINT 311
           + AH P + A  A    F  Y SG++  +   +L+   H    +G+GTENG  YW+V N+
Sbjct: 263 V-AHQPISIAIEAGGRAFQLYDSGIFDGSCGTQLD---HGVVAVGYGTENGKDYWIVRNS 318

Query: 312 WGPHWGDRGTVKILR 326
           WG  WG+ G +++ R
Sbjct: 319 WGKSWGESGYLRMAR 333


>sp|P49935|CATH_MOUSE Pro-cathepsin H OS=Mus musculus GN=Ctsh PE=2 SV=2
          Length = 333

 Score = 71.6 bits (174), Expect = 7e-12,   Method: Compositional matrix adjust.
 Identities = 38/113 (33%), Positives = 58/113 (51%), Gaps = 6/113 (5%)

Query: 222 NPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTS 281
           NP     F ++    TL     ++E A+ + +  + P +  F + +DF  YKSGVY   S
Sbjct: 215 NPQKAVAFVKNVVNITL-----NDEAAMVEAVALYNPVSFAFEVTEDFLMYKSGVYSSKS 269

Query: 282 NAKLENYL-HSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAF 333
             K  + + H+   +G+G +NG  YW+V N+WG  WG+ G   I RGK  C  
Sbjct: 270 CHKTPDKVNHAVLAVGYGEQNGLLYWIVKNSWGSQWGENGYFLIERGKNMCGL 322


>sp|P00786|CATH_RAT Pro-cathepsin H OS=Rattus norvegicus GN=Ctsh PE=1 SV=1
          Length = 333

 Score = 71.6 bits (174), Expect = 8e-12,   Method: Compositional matrix adjust.
 Identities = 59/251 (23%), Positives = 100/251 (39%), Gaps = 45/251 (17%)

Query: 85  PDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVAS 144
           P   D R++      +  V + GAC +   F+  GA      I S   +   L+ + +  
Sbjct: 115 PSSMDWRKK---GNVVSPVKNQGACGSCWTFSTTGALESAVAIASG--KMMTLAEQQLVD 169

Query: 145 CCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTLP 204
           C    +  +N  C  G   + + ++     + G D                         
Sbjct: 170 CA---QNFNNHGCQGGLPSQAFEYILYNKGIMGED------------------------- 201

Query: 205 SCENQKVPKLKCHTRCT-NPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATF 263
                  P +  + +C  NP     F ++    TL     ++E A+ + +  + P +  F
Sbjct: 202 -----SYPYIGKNGQCKFNPEKAVAFVKNVVNITL-----NDEAAMVEAVALYNPVSFAF 251

Query: 264 ALYDDFYHYKSGVYKHTSNAKLENYL-HSGKLIGWGTENGTPYWLVINTWGPHWGDRGTV 322
            + +DF  YKSGVY   S  K  + + H+   +G+G +NG  YW+V N+WG +WG+ G  
Sbjct: 252 EVTEDFMMYKSGVYSSNSCHKTPDKVNHAVLAVGYGEQNGLLYWIVKNSWGSNWGNNGYF 311

Query: 323 KILRGKYECAF 333
            I RGK  C  
Sbjct: 312 LIERGKNMCGL 322


>sp|Q8HY82|CATS_SAIBB Cathepsin S OS=Saimiri boliviensis boliviensis GN=CTSS PE=2 SV=1
          Length = 330

 Score = 70.1 bits (170), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 72/295 (24%), Positives = 123/295 (41%), Gaps = 52/295 (17%)

Query: 35  TWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQW 94
           ++  G N   +++ E +   L++  +  +Q  R +     TY    +  +PD  D RE+ 
Sbjct: 72  SYDLGMNHLGDMTSEEVMS-LMSSLRVPNQWQRNI-----TYKSNPNQMLPDSVDWREK- 124

Query: 95  PNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDN 154
              G +  V   G+C A   F+AVGA   +  +K K  +   LS + +  C +  +Y  N
Sbjct: 125 ---GCVTEVKYQGSCGACWAFSAVGALEAQ--LKLKTGKLVSLSAQNLVDCSE--KYG-N 176

Query: 155 KSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKL 214
           K C+ G +   + ++            D  G       P        T   C+     + 
Sbjct: 177 KGCNGGFMTEAFQYII-----------DNKGIDSEASYP-----YKATDQKCQYDSKYRA 220

Query: 215 KCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPT-TATFALYDDFYHYK 273
              ++ T   YGR                  ED +K+ +   GP      A +  F+ Y+
Sbjct: 221 ATCSKYTELPYGR------------------EDVLKEAVANKGPVCVGVDASHPSFFLYR 262

Query: 274 SGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGK 328
           SGVY   +  +  N  H   +IG+G  NG  YWLV N+WG ++G++G +++ R K
Sbjct: 263 SGVYYDPACTQKVN--HGVLVIGYGDLNGKEYWLVKNSWGSNFGEQGYIRMARNK 315


>sp|P25778|ORYC_ORYSJ Oryzain gamma chain OS=Oryza sativa subsp. japonica GN=Os09g0442300
           PE=2 SV=2
          Length = 362

 Score = 69.7 bits (169), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 34/90 (37%), Positives = 48/90 (53%), Gaps = 3/90 (3%)

Query: 246 EDAIKKEILAHGPTTATFALYDDFYHYKSGVY--KHTSNAKLENYLHSGKLIGWGTENGT 303
           ED +K  +    P +  F + + F  YKSGVY   H   + ++   H+   +G+G ENG 
Sbjct: 264 EDELKNAVGLVRPVSVAFQVINGFRMYKSGVYTSDHCGTSPMD-VNHAVLAVGYGVENGV 322

Query: 304 PYWLVINTWGPHWGDRGTVKILRGKYECAF 333
           PYWL+ N+WG  WGD G  K+  GK  C  
Sbjct: 323 PYWLIKNSWGADWGDNGYFKMEMGKNMCGI 352


>sp|Q9LT77|CPR1_ARATH Probable cysteine proteinase At3g19400 OS=Arabidopsis thaliana
           GN=At3g19400 PE=2 SV=1
          Length = 362

 Score = 69.7 bits (169), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 85/321 (26%), Positives = 135/321 (42%), Gaps = 52/321 (16%)

Query: 9   LGCTLVRGELYKFSDAYIDQINREAN-TWTAGRNFPANLSEEYLRQFLIADAKYFDQSDR 67
           LG    R +++K +  ++D+ N   + T+  G    A+L+ E  R   +   K  +++  
Sbjct: 58  LGEKERRFKIFKDNLKFVDEHNSVPDRTFEVGLTRFADLTNEEFRAIYLR--KKMERTKD 115

Query: 68  PLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCI 127
            +  +R  Y  +    +PD  D    W   G +  V D G C +   F+AVGA      I
Sbjct: 116 SVKTERYLY--KEGDVLPDEVD----WRANGAVVSVKDQGNCGSCWAFSAVGAVEGINQI 169

Query: 128 KSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSV-TGGDYGDRTGC 186
            +   +   LS + +  C    R   N  C  G +   + F+ K G + T  DY      
Sbjct: 170 TTG--ELISLSEQELVDCD---RGFVNAGCDGGIMNYAFEFIMKNGGIETDQDY------ 218

Query: 187 QPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNE 246
                 P +    A  L  C   K      +TR          ++D  R        D+E
Sbjct: 219 ------PYN----ANDLGLCNADK----NNNTRVVTIDG----YEDVPR--------DDE 252

Query: 247 DAIKKEILAHGPTT-ATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPY 305
            ++KK + AH P + A  A    F  YKSGV   T    L+   H   ++G+G+ +G  Y
Sbjct: 253 KSLKKAV-AHQPVSVAIEASSQAFQLYKSGVMTGTCGISLD---HGVVVVGYGSTSGEDY 308

Query: 306 WLVINTWGPHWGDRGTVKILR 326
           W++ N+WG +WGD G VK+ R
Sbjct: 309 WIIRNSWGLNWGDSGYVKLQR 329


  Database: swissprot
    Posted date:  Mar 23, 2013  2:32 AM
  Number of letters in database: 191,569,459
  Number of sequences in database:  539,616
  
Lambda     K      H
   0.320    0.137    0.455 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 145,290,624
Number of Sequences: 539616
Number of extensions: 6634845
Number of successful extensions: 12371
Number of sequences better than 100.0: 50
Number of HSP's better than 100.0 without gapping: 197
Number of HSP's successfully gapped in prelim test: 14
Number of HSP's that attempted gapping in prelim test: 11913
Number of HSP's gapped (non-prelim): 296
length of query: 344
length of database: 191,569,459
effective HSP length: 118
effective length of query: 226
effective length of database: 127,894,771
effective search space: 28904218246
effective search space used: 28904218246
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 62 (28.5 bits)