BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 022276
         (300 letters)

Database: swissprot 
           539,616 sequences; 191,569,459 total letters

Searching..................................................done



>sp|P25804|CYSP_PEA Cysteine proteinase 15A OS=Pisum sativum PE=2 SV=1
          Length = 363

 Score =  409 bits (1050), Expect = e-113,   Method: Compositional matrix adjust.
 Identities = 196/262 (74%), Positives = 228/262 (87%), Gaps = 7/262 (2%)

Query: 18  LASAVA--VNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFR 75
           +A+AV    N+DD +IRQVV    +  EDHLLNAEHHF+ FKSKFSK+YAT+EEHDYRF 
Sbjct: 15  VATAVTDDTNNDDFIIRQVV----DNEEDHLLNAEHHFTSFKSKFSKSYATKEEHDYRFG 70

Query: 76  VFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTN 135
           VFK+NL +AK  Q  DPTA HG+TKFSDLT SEFRRQFLGL +RLRLPA AQKAPILPT 
Sbjct: 71  VFKSNLIKAKLHQNRDPTAEHGITKFSDLTASEFRRQFLGLKKRLRLPAHAQKAPILPTT 130

Query: 136 DLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHE 195
           +LP DFDWR+ GAVT VKDQG+CGSCW+FS TGALEGAH+L+TG+LVSLSEQQLVDCDH 
Sbjct: 131 NLPEDFDWREKGAVTPVKDQGSCGSCWAFSTTGALEGAHYLATGKLVSLSEQQLVDCDHV 190

Query: 196 CDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSN 255
           CDPE++GSCDSGCNGGLMN+AFEY+L++GGV +EKDY YTG D GSCKFDKSK+ A+VSN
Sbjct: 191 CDPEQAGSCDSGCNGGLMNNAFEYLLESGGVVQEKDYAYTGRD-GSCKFDKSKVVASVSN 249

Query: 256 FSVISSDEDQMAANLVKHGPLA 277
           FSV++ DEDQ+AANLVK+GPLA
Sbjct: 250 FSVVTLDEDQIAANLVKNGPLA 271


>sp|P43296|RD19A_ARATH Cysteine proteinase RD19a OS=Arabidopsis thaliana GN=RD19A PE=2
           SV=1
          Length = 368

 Score =  395 bits (1015), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 196/279 (70%), Positives = 227/279 (81%), Gaps = 6/279 (2%)

Query: 1   MERLILS-SLLLLLLSSVLASAVAVND-DDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKS 58
           M+RL L  S+ +L    V  S+  VND DD +IRQVV      +E  +L +E HFSLFK 
Sbjct: 1   MDRLKLYFSVFVLSFFIVSVSSSDVNDGDDLVIRQVVGG----AEPQVLTSEDHFSLFKR 56

Query: 59  KFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNR 118
           KF K YA+ EEHDYRF VFKANLRRA+R Q LDP+A HGVT+FSDLT SEFR++ LG+  
Sbjct: 57  KFGKVYASNEEHDYRFSVFKANLRRARRHQKLDPSATHGVTQFSDLTRSEFRKKHLGVRS 116

Query: 119 RLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLST 178
             +LP DA KAPILPT +LP DFDWRDHGAVT VK+QG+CGSCWSFSATGALEGA+FL+T
Sbjct: 117 GFKLPKDANKAPILPTENLPEDFDWRDHGAVTPVKNQGSCGSCWSFSATGALEGANFLAT 176

Query: 179 GELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTD 238
           G+LVSLSEQQLVDCDHECDPEE+ SCDSGCNGGLMNSAFEY LK GG+ +E+DYPYTG D
Sbjct: 177 GKLVSLSEQQLVDCDHECDPEEADSCDSGCNGGLMNSAFEYTLKTGGLMKEEDYPYTGKD 236

Query: 239 GGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
           G +CK DKSKI A+VSNFSVIS DE+Q+AANLVK+GPLA
Sbjct: 237 GKTCKLDKSKIVASVSNFSVISIDEEQIAANLVKNGPLA 275


>sp|P43295|A494_ARATH Probable cysteine proteinase A494 OS=Arabidopsis thaliana
           GN=At2g21430 PE=2 SV=2
          Length = 361

 Score =  391 bits (1004), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 186/272 (68%), Positives = 218/272 (80%), Gaps = 4/272 (1%)

Query: 6   LSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYA 65
           L  L  + L  V  S     D+D +IRQVV    +++E  +L++E HF+LFK KF K Y 
Sbjct: 5   LRVLFSVSLIFVFVSVSVCGDEDVLIRQVV----DETEPKVLSSEDHFTLFKKKFGKVYG 60

Query: 66  TQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPAD 125
           + EEH YRF VFKANL RA R Q +DP+A HGVT+FSDLT SEFRR+ LG+    +LP D
Sbjct: 61  SIEEHYYRFSVFKANLLRAMRHQKMDPSARHGVTQFSDLTRSEFRRKHLGVKGGFKLPKD 120

Query: 126 AQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLS 185
           A +APILPT +LP +FDWRD GAVT VK+QG+CGSCWSFS TGALEGAHFL+TG+LVSLS
Sbjct: 121 ANQAPILPTQNLPEEFDWRDRGAVTPVKNQGSCGSCWSFSTTGALEGAHFLATGKLVSLS 180

Query: 186 EQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFD 245
           EQQLVDCDHECDPEE GSCDSGCNGGLMNSAFEY LK GG+ REKDYPYTGTDGGSCK D
Sbjct: 181 EQQLVDCDHECDPEEEGSCDSGCNGGLMNSAFEYTLKTGGLMREKDYPYTGTDGGSCKLD 240

Query: 246 KSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
           +SKI A+VSNFSV+S +EDQ+AANL+K+GPLA
Sbjct: 241 RSKIVASVSNFSVVSINEDQIAANLIKNGPLA 272


>sp|Q10716|CYSP1_MAIZE Cysteine proteinase 1 OS=Zea mays GN=CCP1 PE=2 SV=1
          Length = 371

 Score =  349 bits (896), Expect = 1e-95,   Method: Compositional matrix adjust.
 Identities = 173/278 (62%), Positives = 207/278 (74%), Gaps = 12/278 (4%)

Query: 27  DDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKR 86
           +D +IRQVVP  G    D  LNAE HF  F  +F K+Y   +EH YR  VFK NLRRA+R
Sbjct: 24  EDPLIRQVVP--GGDDNDLELNAESHFLSFVQRFGKSYKDADEHAYRLSVFKDNLRRARR 81

Query: 87  RQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLR-----LPADAQKAPILPTNDLPTDF 141
            QLLDP+A HGVTKFSDLTP+EFRR +LGL +  R     L   A +AP+LPT+ LP DF
Sbjct: 82  HQLLDPSAEHGVTKFSDLTPAEFRRTYLGLRKSRRALLRELGESAHEAPVLPTDGLPDDF 141

Query: 142 DWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEES 201
           DWRDHGAV  VK+QG+CGSCWSFSA+GALEGAH+L+TG+L  LSEQQ VDCDHECD  E 
Sbjct: 142 DWRDHGAVGPVKNQGSCGSCWSFSASGALEGAHYLATGKLEVLSEQQFVDCDHECDSSEP 201

Query: 202 GSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISS 261
            SCDSGCNGGLM +AF Y+ KAGG+E EKDYPYTG+D G CKFDKSKI A+V NFSV+S 
Sbjct: 202 DSCDSGCNGGLMTTAFSYLQKAGGLESEKDYPYTGSD-GKCKFDKSKIVASVQNFSVVSV 260

Query: 262 DEDQMAANLVKHGPLAGNVASIELPHISFSFLFTVSSP 299
           DE Q++ANL+KHGPLA  + +  +     +++  VS P
Sbjct: 261 DEAQISANLIKHGPLAIGINAAYMQ----TYIGGVSCP 294


>sp|P04988|CYSP1_DICDI Cysteine proteinase 1 OS=Dictyostelium discoideum GN=cprA PE=1 SV=2
          Length = 343

 Score =  215 bits (548), Expect = 3e-55,   Method: Compositional matrix adjust.
 Identities = 111/239 (46%), Positives = 145/239 (60%), Gaps = 10/239 (4%)

Query: 47  LNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLL----DPTAVHGVTKFS 102
           L  +  F  F+ KF+K Y + EE+  RF +FK+NL + +   L+          GV KF+
Sbjct: 23  LEEQSQFLEFQDKFNKKY-SHEEYLERFEIFKSNLGKIEELNLIAINHKADTKFGVNKFA 81

Query: 103 DLTPSEFRRQFLGLNRRLRLPADAQKAPILP---TNDLPTDFDWRDHGAVTGVKDQGACG 159
           DL+  EF+  +L  N+      D   A  L     N +PT FDWR  GAVT VK+QG CG
Sbjct: 82  DLSSDEFKNYYLN-NKEAIFTDDLPVADYLDDEFINSIPTAFDWRTRGAVTPVKNQGQCG 140

Query: 160 SCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHEC-DPEESGSCDSGCNGGLMNSAFE 218
           SCWSFS TG +EG HF+S  +LVSLSEQ LVDCDHEC + E   +CD GCNGGL  +A+ 
Sbjct: 141 SCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEYEGEQACDEGCNGGLQPNAYN 200

Query: 219 YILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
           YI+K GG++ E  YPYT   G  C F+ + I A +SNF++I  +E  MA  +V  GPLA
Sbjct: 201 YIIKNGGIQTESSYPYTAETGTQCNFNSANIGAKISNFTMIPKNETVMAGYIVSTGPLA 259


>sp|Q26636|CATL_SARPE Cathepsin L OS=Sarcophaga peregrina PE=1 SV=1
          Length = 339

 Score =  179 bits (455), Expect = 2e-44,   Method: Compositional matrix adjust.
 Identities = 105/258 (40%), Positives = 141/258 (54%), Gaps = 24/258 (9%)

Query: 46  LLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRR-AKRRQLLDPTAVH---GVTKF 101
           L+  E H   +K +  K YA + E  +R ++F  N  + AK  QL     V    G+ K+
Sbjct: 23  LIKEEWH--TYKLQHRKNYANEVEERFRMKIFNENRHKIAKHNQLFAQGKVSYKLGLNKY 80

Query: 102 SDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTN------DLPTDFDWRDHGAVTGVKDQ 155
           +D+   EF+    G N  LR     +   +  T        +P   DWR+HGAVTGVKDQ
Sbjct: 81  ADMLHHEFKETMNGYNHTLRQLMRERTGLVGATYIPPAHVTVPKSVDWREHGAVTGVKDQ 140

Query: 156 GACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNS 215
           G CGSCW+FS+TGALEG HF   G LVSLSEQ LVDC        +   ++GCNGGLM++
Sbjct: 141 GHCGSCWAFSSTGALEGQHFRKAGVLVSLSEQNLVDCS-------TKYGNNGCNGGLMDN 193

Query: 216 AFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVI-SSDEDQMAANLVKHG 274
           AF YI   GG++ EK YPY G D  SC F+K+ I A  + F  I   DE++M   +   G
Sbjct: 194 AFRYIKDNGGIDTEKSYPYEGID-DSCHFNKATIGATDTGFVDIPEGDEEKMKKAVATMG 252

Query: 275 PLAGNVASIELPHISFSF 292
           P++    +I+  H SF  
Sbjct: 253 PVS---VAIDASHESFQL 267


>sp|Q80LP4|CATV_NPVAH Viral cathepsin OS=Adoxophyes honmai nucleopolyhedrovirus GN=VCATH
           PE=3 SV=1
          Length = 337

 Score =  179 bits (454), Expect = 2e-44,   Method: Compositional matrix adjust.
 Identities = 97/265 (36%), Positives = 148/265 (55%), Gaps = 29/265 (10%)

Query: 30  MIRQVVPSDGEQSEDHLL----NAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAK 85
           MI  ++     Q E HL     +A+H+F  F   ++K Y   +  +YRF++FK NL    
Sbjct: 5   MIFTILLVASSQIEGHLKFDIHDAQHYFETFIINYNKQYPDTKTKNYRFKIFKQNLEDIN 64

Query: 86  RRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQK------------APILP 133
            +  L+ +A++ + KFSDL+ +E   ++ GL  +   P++  +            AP   
Sbjct: 65  EKNKLNDSAIYNINKFSDLSKNELLTKYTGLTSKK--PSNMVRSTSNFCNVIHLDAPPDV 122

Query: 134 TNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCD 193
            ++LP +FDWR +  +T VKDQGACGSCW+ +A G LE  + +    L++LSEQQL+DCD
Sbjct: 123 HDELPQNFDWRVNNKMTSVKDQGACGSCWAHAAVGTLETLYAIKHNYLINLSEQQLIDCD 182

Query: 194 HECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAV 253
                    S +  C+GGLM++AFE ++ AGG+  E DYPY GT  G CK D  K A +V
Sbjct: 183 ---------SANMACDGGLMHTAFEQLMNAGGLMEEIDYPYQGTK-GVCKIDNKKFALSV 232

Query: 254 SNFS-VISSDEDQMAANLVKHGPLA 277
           S+    I  +E+ +   L+  GP+A
Sbjct: 233 SSCKRYIFQNEENLKKELITMGPIA 257


>sp|P04989|CYSP2_DICDI Cysteine proteinase 2 OS=Dictyostelium discoideum GN=cprB PE=2 SV=1
          Length = 376

 Score =  177 bits (448), Expect = 1e-43,   Method: Compositional matrix adjust.
 Identities = 99/247 (40%), Positives = 140/247 (56%), Gaps = 16/247 (6%)

Query: 53  FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLL-DPTAVHGVTKFSDLTPSEFRR 111
           F+ +  KF++ Y++ E  + R+ +FK+N+          D   V G+  F+D+T  E+R+
Sbjct: 36  FTEWTLKFNRQYSSSEFSN-RYSIFKSNMDYVDNWNSKGDSQTVLGLNNFADITNEEYRK 94

Query: 112 QFLGLNRRLRLPADAQKAPILPTNDL---PTDFDWRDHGAVTGVKDQGACGSCWSFSATG 168
            +LG               +L   DL   P   DWR   AVT +KDQG CGSCWSFS TG
Sbjct: 95  TYLGTRVNAHSYNGYDGREVLNVEDLQTNPKSIDWRTKNAVTPIKDQGQCGSCWSFSTTG 154

Query: 169 ALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVER 228
           + EGAH L T +LVSLSEQ LVDC     PEE    + GC+GGLMN+AF+YI+K  G++ 
Sbjct: 155 STEGAHALKTKKLVSLSEQNLVDCS---GPEE----NFGCDGGLMNNAFDYIIKNKGIDT 207

Query: 229 EKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIELPHI 288
           E  YPYT   G +C F+KS I A +  +  I++  +    N  +HGP++    +I+  H 
Sbjct: 208 ESSYPYTAETGSTCLFNKSDIGATIKGYVNITAGSEISLENGAQHGPVS---VAIDASHN 264

Query: 289 SFSFLFT 295
           SF  L+T
Sbjct: 265 SFQ-LYT 270


>sp|P14658|CYSP_TRYBB Cysteine proteinase OS=Trypanosoma brucei brucei PE=1 SV=1
          Length = 450

 Score =  176 bits (447), Expect = 1e-43,   Method: Compositional matrix adjust.
 Identities = 109/288 (37%), Positives = 151/288 (52%), Gaps = 43/288 (14%)

Query: 1   MERLILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKF 60
           M R +   ++LL +++ LAS              V       E+ L   E  F+ FK K+
Sbjct: 6   MVRFVRLPVVLLAMAACLAS--------------VALGSLHVEESL---EMRFAAFKKKY 48

Query: 61  SKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ-------F 113
            K Y   +E  +RFR F+ N+ +AK +   +P A  GVT FSD+T  EFR +       F
Sbjct: 49  GKVYKDAKEEAFRFRAFEENMEQAKIQAAANPYATFGVTPFSDMTREEFRARYRNGASYF 108

Query: 114 LGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGA 173
               +RLR      K   + T   P   DWR+ GAVT VK QG CGSCW+FS  G +EG 
Sbjct: 109 AAAQKRLR------KTVNVTTGRAPAAVDWREKGAVTPVKVQGQCGSCWAFSTIGNIEGQ 162

Query: 174 HFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA--GGVEREKD 231
             ++   LVSLSEQ LV CD         + DSGCNGGLM++AF +I+ +  G V  E  
Sbjct: 163 WQVAGNPLVSLSEQMLVSCD---------TIDSGCNGGLMDNAFNWIVNSNGGNVFTEAS 213

Query: 232 YPYTGTDG--GSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
           YPY   +G    C+ +  +I AA+++   +  DED +AA L ++GPLA
Sbjct: 214 YPYVSGNGEQPQCQMNGHEIGAAITDHVDLPQDEDAIAAYLAENGPLA 261


>sp|Q9LT77|CPR1_ARATH Probable cysteine proteinase At3g19400 OS=Arabidopsis thaliana
           GN=At3g19400 PE=2 SV=1
          Length = 362

 Score =  171 bits (434), Expect = 5e-42,   Method: Compositional matrix adjust.
 Identities = 92/222 (41%), Positives = 134/222 (60%), Gaps = 15/222 (6%)

Query: 62  KTYATQEEHDYRFRVFKANLRRA-KRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRL 120
           K Y    E + RF++FK NL+   +   + D T   G+T+F+DLT  EFR  +L   +++
Sbjct: 53  KNYNGLGEKERRFKIFKDNLKFVDEHNSVPDRTFEVGLTRFADLTNEEFRAIYL--RKKM 110

Query: 121 RLPADAQKAP--ILPTND-LPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLS 177
               D+ K    +    D LP + DWR +GAV  VKDQG CGSCW+FSA GA+EG + ++
Sbjct: 111 ERTKDSVKTERYLYKEGDVLPDEVDWRANGAVVSVKDQGNCGSCWAFSAVGAVEGINQIT 170

Query: 178 TGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGT 237
           TGEL+SLSEQ+LVDCD        G  ++GC+GG+MN AFE+I+K GG+E ++DYPY   
Sbjct: 171 TGELISLSEQELVDCDR-------GFVNAGCDGGIMNYAFEFIMKNGGIETDQDYPYNAN 223

Query: 238 DGGSCKFDKSKIAAAVS--NFSVISSDEDQMAANLVKHGPLA 277
           D G C  DK+     V+   +  +  D+++     V H P++
Sbjct: 224 DLGLCNADKNNNTRVVTIDGYEDVPRDDEKSLKKAVAHQPVS 265


>sp|P43297|RD21A_ARATH Cysteine proteinase RD21a OS=Arabidopsis thaliana GN=RD21A PE=1
           SV=1
          Length = 462

 Score =  169 bits (428), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 97/274 (35%), Positives = 152/274 (55%), Gaps = 20/274 (7%)

Query: 10  LLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEE 69
           +L L    ++SAV ++      +  V + G +SE  +++    + L K   +++  +  E
Sbjct: 10  ILFLAMVAVSSAVDMSIISYDEKHGVSTTGGRSEAEVMSIYEAW-LVKHGKAQSQNSLVE 68

Query: 70  HDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLN------RRLRLP 123
            D RF +FK NLR        + +   G+T+F+DLT  E+R ++LG        RR  L 
Sbjct: 69  KDRRFEIFKDNLRFVDEHNEKNLSYRLGLTRFADLTNDEYRSKYLGAKMEKKGERRTSLR 128

Query: 124 ADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVS 183
            +A+       ++LP   DWR  GAV  VKDQG CGSCW+FS  GA+EG + + TG+L++
Sbjct: 129 YEARVG-----DELPESIDWRKKGAVAEVKDQGGCGSCWAFSTIGAVEGINQIVTGDLIT 183

Query: 184 LSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCK 243
           LSEQ+LVDCD         S + GCNGGLM+ AFE+I+K GG++ +KDYPY G DG   +
Sbjct: 184 LSEQELVDCDT--------SYNEGCNGGLMDYAFEFIIKNGGIDTDKDYPYKGVDGTCDQ 235

Query: 244 FDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
             K+     + ++  + +  ++     V H P++
Sbjct: 236 IRKNAKVVTIDSYEDVPTYSEESLKKAVAHQPIS 269


>sp|Q95029|CATL_DROME Cathepsin L OS=Drosophila melanogaster GN=Cp1 PE=2 SV=2
          Length = 371

 Score =  169 bits (428), Expect = 3e-41,   Method: Compositional matrix adjust.
 Identities = 104/252 (41%), Positives = 138/252 (54%), Gaps = 23/252 (9%)

Query: 53  FSLFKSKFSKTYATQEEHDYRFRVFKANLRR-AKRRQLLDPTAVH---GVTKFSDLTPSE 108
           +  FK +  K Y  + E  +R ++F  N  + AK  Q      V     V K++DL   E
Sbjct: 59  WHTFKLEHRKNYQDETEERFRLKIFNENKHKIAKHNQRFAEGKVSFKLAVNKYADLLHHE 118

Query: 109 FRRQFLGLN----RRLRLPADAQKAP--ILPTN-DLPTDFDWRDHGAVTGVKDQGACGSC 161
           FR+   G N    ++LR   ++ K    I P +  LP   DWR  GAVT VKDQG CGSC
Sbjct: 119 FRQLMNGFNYTLHKQLRAADESFKGVTFISPAHVTLPKSVDWRTKGAVTAVKDQGHCGSC 178

Query: 162 WSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYIL 221
           W+FS+TGALEG HF  +G LVSLSEQ LVDC        +   ++GCNGGLM++AF YI 
Sbjct: 179 WAFSSTGALEGQHFRKSGVLVSLSEQNLVDCS-------TKYGNNGCNGGLMDNAFRYIK 231

Query: 222 KAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVI-SSDEDQMAANLVKHGPLAGNV 280
             GG++ EK YPY   D  SC F+K  + A    F+ I   DE +MA  +   GP++   
Sbjct: 232 DNGGIDTEKSYPYEAID-DSCHFNKGTVGATDRGFTDIPQGDEKKMAEAVATVGPVS--- 287

Query: 281 ASIELPHISFSF 292
            +I+  H SF F
Sbjct: 288 VAIDASHESFQF 299


>sp|P25782|CYSP2_HOMAM Digestive cysteine proteinase 2 OS=Homarus americanus GN=LCP2 PE=2
           SV=1
          Length = 323

 Score =  169 bits (427), Expect = 3e-41,   Method: Compositional matrix adjust.
 Identities = 101/253 (39%), Positives = 132/253 (52%), Gaps = 17/253 (6%)

Query: 46  LLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRA----KRRQLLDPTAVHGVTKF 101
           L  A   +  FK K+ + Y   EE  YR  +F+ N +      K+ +  + T    + KF
Sbjct: 13  LAAASPSWEHFKGKYGRQYVDAEEDSYRRVIFEQNQKYIEEFNKKYENGEVTFNLAMNKF 72

Query: 102 SDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSC 161
            D+T  EF     G   R   P      P   T    T+ DWR  GAVT VKDQG CGSC
Sbjct: 73  GDMTLEEFNAVMKGNIPRRSAPVSV-FYPKKETGPQATEVDWRTKGAVTPVKDQGQCGSC 131

Query: 162 WSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYIL 221
           W+FS TG+LEG HFL TG L+SL+EQQLVDC     P+       GCNGG MN AF+YI 
Sbjct: 132 WAFSTTGSLEGQHFLKTGSLISLAEQQLVDCSRPYGPQ-------GCNGGWMNDAFDYIK 184

Query: 222 KAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKH-GPLAGNV 280
              G++ E  YPY   D GSC+FD + +AA  S  + I+S  +      V+  GP++   
Sbjct: 185 ANNGIDTEAAYPYEARD-GSCRFDSNSVAATCSGHTNIASGSETGLQQAVRDIGPIS--- 240

Query: 281 ASIELPHISFSFL 293
            +I+  H SF F 
Sbjct: 241 VTIDAAHSSFQFY 253


>sp|Q26534|CATL_SCHMA Cathepsin L OS=Schistosoma mansoni GN=CL1 PE=2 SV=1
          Length = 319

 Score =  166 bits (421), Expect = 1e-40,   Method: Compositional matrix adjust.
 Identities = 91/234 (38%), Positives = 136/234 (58%), Gaps = 17/234 (7%)

Query: 48  NAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQL-LDPTAVHGVTKFSDLTP 106
           N +  +  FK K+ K Y   E+ + RF +FK+N+ +A+  Q+ +  +A++GVT +SDLT 
Sbjct: 15  NVDEKYVQFKLKYRKQYHETED-EIRFNIFKSNILKAQLYQVFVRGSAIYGVTPYSDLTT 73

Query: 107 SEFRRQFLGLNRRLRLPADAQKAPI---LPTNDLPTDFDWRDHGAVTGVKDQGACGSCWS 163
            EF R  L       +P+     P       N++P +FDWR+ GAVT VK+QG CGSCW+
Sbjct: 74  DEFARTHL--TASWVVPSSRSNTPTSLGKEVNNIPKNFDWREKGAVTEVKNQGMCGSCWA 131

Query: 164 FSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA 223
           FS TG +E   F  TG+L+SLSEQQLVDCD           D GCNGGL ++A+E I+K 
Sbjct: 132 FSTTGNVESQWFRKTGKLLSLSEQQLVDCD---------GLDDGCNGGLPSNAYESIIKM 182

Query: 224 GGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
           GG+  E +YPY   +   C      +A  +++   ++ DE ++AA L  +  ++
Sbjct: 183 GGLMLEDNYPYDAKN-EKCHLKTDGVAVYINSSVNLTQDETELAAWLYHNSTIS 235


>sp|O65493|XCP1_ARATH Xylem cysteine proteinase 1 OS=Arabidopsis thaliana GN=XCP1 PE=1
           SV=1
          Length = 355

 Score =  166 bits (420), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 97/257 (37%), Positives = 143/257 (55%), Gaps = 22/257 (8%)

Query: 44  DHLLNAEHHFSLFKS---KFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTK 100
           +HL N +    LF+S   + SK Y + EE  +RF VF+ NL    +R     +   G+ +
Sbjct: 39  EHLTNTDKLLELFESWMSEHSKAYKSVEEKVHRFEVFRENLMHIDQRNNEINSYWLGLNE 98

Query: 101 FSDLTPSEFRRQFLGLNR----RLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQG 156
           F+DLT  EF+ ++LGL +    R R P+   +   +   DLP   DWR  GAV  VKDQG
Sbjct: 99  FADLTHEEFKGRYLGLAKPQFSRKRQPSANFRYRDI--TDLPKSVDWRKKGAVAPVKDQG 156

Query: 157 ACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSA 216
            CGSCW+FS   A+EG + ++TG L SLSEQ+L+DCD         + +SGCNGGLM+ A
Sbjct: 157 QCGSCWAFSTVAAVEGINQITTGNLSSLSEQELIDCDT--------TFNSGCNGGLMDYA 208

Query: 217 FEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIA-AAVSNFSVISSDEDQMAANLVKHGP 275
           F+YI+  GG+ +E DYPY   + G C+  K  +    +S +  +  ++D+     + H P
Sbjct: 209 FQYIISTGGLHKEDDYPYL-MEEGICQEQKEDVERVTISGYEDVPENDDESLVKALAHQP 267

Query: 276 LAGNVASIELPHISFSF 292
           ++    +IE     F F
Sbjct: 268 VS---VAIEASGRDFQF 281


>sp|P54640|CYSP5_DICDI Cysteine proteinase 5 OS=Dictyostelium discoideum GN=cprE PE=2 SV=2
          Length = 344

 Score =  165 bits (417), Expect = 4e-40,   Method: Compositional matrix adjust.
 Identities = 89/234 (38%), Positives = 128/234 (54%), Gaps = 16/234 (6%)

Query: 62  KTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLR 121
           K+Y T EE   R+ +FKAN+   ++        V G+  F+D+T  E+R  +LG      
Sbjct: 39  KSY-TSEEFGARYNIFKANMDYVQQWNSKGSETVLGLNNFADITNEEYRNTYLGTKFDAS 97

Query: 122 LPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGEL 181
                Q+  +  T+   +  DWR  GAVT VK+QG CG CWSFS TG+ EGAHF S GEL
Sbjct: 98  SLIGTQEEKVFTTSSAASK-DWRSEGAVTPVKNQGQCGGCWSFSTTGSTEGAHFQSKGEL 156

Query: 182 VSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGS 241
           VSLSEQ L+DC  E         +SGC+GGLM  AFEYI+   G++ E  YPY   + G 
Sbjct: 157 VSLSEQNLIDCSTE---------NSGCDGGLMTYAFEYIINNNGIDTESSYPYKA-ENGK 206

Query: 242 CKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIELPHISFSFLFT 295
           C++      A +S++  +++  +    + V   P++    +I+  H SF  L+T
Sbjct: 207 CEYKSENSGATLSSYKTVTAGSESSLESAVNVNPVS---VAIDASHQSFQ-LYT 256


>sp|Q86GF7|CRUST_PANBO Crustapain OS=Pandalus borealis GN=Cys PE=1 SV=1
          Length = 323

 Score =  164 bits (416), Expect = 5e-40,   Method: Compositional matrix adjust.
 Identities = 97/231 (41%), Positives = 123/231 (53%), Gaps = 23/231 (9%)

Query: 56  FKSKFSKTYATQEEHDYRFRVFKANLR----RAKRRQLLDPTAVHGVTKFSDLTPSEFRR 111
           FK+KF K YA  EE  +R  VF   L+      +R    + T    +  FSDLT  E   
Sbjct: 23  FKTKFGKKYANSEEESHRMSVFMDKLKFIQEHNERYDKGEVTYWLKINNFSDLTHEEVLA 82

Query: 112 QFLGLNRRLR----LPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSAT 167
              G+ RR      LP  A      PT  +  D DWR+ GAVT VKDQG CGSCW+FSA 
Sbjct: 83  TKTGMTRRRHPLSVLPKSA------PTTPMAADVDWRNKGAVTPVKDQGQCGSCWAFSAV 136

Query: 168 GALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVE 227
            ALEGAHFL TG+LVSLSEQ LVDC        S   + GCNGG    A++YI+   G++
Sbjct: 137 AALEGAHFLKTGDLVSLSEQNLVDC-------SSSYGNQGCNGGWPYQAYQYIIANRGID 189

Query: 228 REKDYPYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHGPLA 277
            E  YPY   D  +C++D   I A VS++    S DE  +   +   GP++
Sbjct: 190 TESSYPYKAID-DNCRYDAGNIGATVSSYVEPASGDESALQHAVQNEGPVS 239


>sp|P25779|CYSP_TRYCR Cruzipain OS=Trypanosoma cruzi PE=1 SV=1
          Length = 467

 Score =  163 bits (412), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 95/237 (40%), Positives = 122/237 (51%), Gaps = 26/237 (10%)

Query: 52  HFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRR 111
            F+ FK K  + Y +  E  +R  VF+ NL  A+     +P A  GVT FSDLT  EFR 
Sbjct: 37  QFAEFKQKHGRVYESAAEEAFRLSVFRENLFLARLHAAANPHATFGVTPFSDLTREEFRS 96

Query: 112 Q-------FLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSF 164
           +       F     R R+P   +          P   DWR  GAVT VKDQG CGSCW+F
Sbjct: 97  RYHNGAAHFAAAQERARVPVKVEVV------GAPAAVDWRARGAVTAVKDQGQCGSCWAF 150

Query: 165 SATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA- 223
           SA G +E   FL+   L +LSEQ LV CD           DSGC+GGLMN+AFE+I++  
Sbjct: 151 SAIGNVECQWFLAGHPLTNLSEQMLVSCDKT---------DSGCSGGLMNNAFEWIVQEN 201

Query: 224 -GGVEREKDYPYTGTDGGS--CKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
            G V  E  YPY   +G S  C      + A ++    +  DE Q+AA L  +GP+A
Sbjct: 202 NGAVYTEDSYPYASGEGISPPCTTSGHTVGATITGHVELPQDEAQIAAWLAVNGPVA 258


>sp|Q9VN93|CPR1_DROME Putative cysteine proteinase CG12163 OS=Drosophila melanogaster
           GN=CG12163 PE=2 SV=2
          Length = 614

 Score =  162 bits (410), Expect = 3e-39,   Method: Compositional matrix adjust.
 Identities = 86/232 (37%), Positives = 135/232 (58%), Gaps = 15/232 (6%)

Query: 50  EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLD-PTAVHGVTKFSDLTPSE 108
           +H F  F+ +F + Y +  E   R R+F+ NL+  +     +  +A +G+T+F+D+T SE
Sbjct: 305 DHLFYKFQVRFGRRYVSTAERQMRLRIFRQNLKTIEELNANEMGSAKYGITEFADMTSSE 364

Query: 109 FRRQFLGLNRRLRLPADAQKAPILPT--NDLPTDFDWRDHGAVTGVKDQGACGSCWSFSA 166
           ++ +  GL +R    A    A ++P    +LP +FDWR   AVT VK+QG+CGSCW+FS 
Sbjct: 365 YKER-TGLWQRDEAKATGGSAAVVPAYHGELPKEFDWRQKDAVTQVKNQGSCGSCWAFSV 423

Query: 167 TGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGV 226
           TG +EG + + TGEL   SEQ+L+DCD         + DS CNGGLM++A++ I   GG+
Sbjct: 424 TGNIEGLYAVKTGELKEFSEQELLDCD---------TTDSACNGGLMDNAYKAIKDIGGL 474

Query: 227 EREKDYPYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHGPLA 277
           E E +YPY       C F+++     V+ F  +   +E  M   L+ +GP++
Sbjct: 475 EYEAEYPYKAKK-NQCHFNRTLSHVQVAGFVDLPKGNETAMQEWLLANGPIS 525


>sp|Q94504|CYSP7_DICDI Cysteine proteinase 7 OS=Dictyostelium discoideum GN=cprG PE=1 SV=1
          Length = 460

 Score =  161 bits (408), Expect = 4e-39,   Method: Compositional matrix adjust.
 Identities = 90/217 (41%), Positives = 122/217 (56%), Gaps = 22/217 (10%)

Query: 66  TQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPAD 125
           + EE + R+ +FKAN+             V G+  F+D++  E+R  +LG       P D
Sbjct: 42  SSEEFNGRYNIFKANMDYVNEWNTKGSETVLGLNVFADISNEEYRATYLGT------PFD 95

Query: 126 AQKAPILPTN---DLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGE-- 180
           A    +  ++   D     DWR  GAVT +K+QG CG CWSFS TGA EGA +L+ G+  
Sbjct: 96  ASSLEMTESDKIFDASAQVDWRTQGAVTPIKNQGQCGGCWSFSTTGATEGAQYLANGKKN 155

Query: 181 LVSLSEQQLVDCDHECDPEESGSC-DSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDG 239
           LVSLSEQ L+DC        SGS  ++GC GGLM  AFEYI+   G++ E  YPYT  DG
Sbjct: 156 LVSLSEQNLIDC--------SGSYGNNGCEGGLMTLAFEYIINNKGIDTESSYPYTAEDG 207

Query: 240 GSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHGP 275
             CKF+   +AA +S++ +V S  E  +AA  V  GP
Sbjct: 208 KKCKFNPKNVAAQLSSYVNVTSGSESDLAAK-VTQGP 243


>sp|Q94B08|GCP1_ARATH Germination-specific cysteine protease 1 OS=Arabidopsis thaliana
           GN=GCP1 PE=2 SV=2
          Length = 376

 Score =  160 bits (406), Expect = 9e-39,   Method: Compositional matrix adjust.
 Identities = 91/255 (35%), Positives = 138/255 (54%), Gaps = 22/255 (8%)

Query: 35  VPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQE----EHDYRFRVFKANLRRAKRRQLL 90
           +PSDG+   D  + +   +  + ++  KT         + D RF +FK NLR        
Sbjct: 33  LPSDGKWRTDEEVRS--IYLQWSAEHGKTNNNNNGIINDQDKRFNIFKDNLRFIDLHNED 90

Query: 91  DPTAVH--GVTKFSDLTPSEFRRQFLGLN----RRLRLPADAQKAPILPTN--DLPTDFD 142
           +  A +  G+TKF+DLT  E+R+ +LG      RR+    +  +      N  ++P   D
Sbjct: 91  NKNATYKLGLTKFTDLTNDEYRKLYLGARTEPARRIAKAKNVNQKYSAAVNGKEVPETVD 150

Query: 143 WRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESG 202
           WR  GAV  +KDQG CGSCW+FS T A+EG + + TGEL+SLSEQ+LVDCD         
Sbjct: 151 WRQKGAVNPIKDQGTCGSCWAFSTTAAVEGINKIVTGELISLSEQELVDCDK-------- 202

Query: 203 SCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSD 262
           S + GCNGGLM+ AF++I+K GG+  EKDYPY G  G    F K+    ++  +  + + 
Sbjct: 203 SYNQGCNGGLMDYAFQFIMKNGGLNTEKDYPYRGFGGKCNSFLKNSRVVSIDGYEDVPTK 262

Query: 263 EDQMAANLVKHGPLA 277
           ++      + + P++
Sbjct: 263 DETALKKAISYQPVS 277


>sp|P07154|CATL1_RAT Cathepsin L1 OS=Rattus norvegicus GN=Ctsl1 PE=1 SV=2
          Length = 334

 Score =  160 bits (404), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 94/253 (37%), Positives = 133/253 (52%), Gaps = 20/253 (7%)

Query: 44  DHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVT---- 99
           D   NA+ H   +KS   + Y T EE ++R  V++ N+R  +          HG T    
Sbjct: 22  DQTFNAQWH--QWKSTHRRLYGTNEE-EWRRAVWEKNMRMIQLHNGEYSNGKHGFTMEMN 78

Query: 100 KFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACG 159
            F D+T  EFR+   G   +        + P++    +P   DWR+ G VT VK+QG CG
Sbjct: 79  AFGDMTNEEFRQIVNGYRHQKHKKGRLFQEPLML--QIPKTVDWREKGCVTPVKNQGQCG 136

Query: 160 SCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEY 219
           SCW+FSA+G LEG  FL TG+L+SLSEQ LVDC H+         + GCNGGLM+ AF+Y
Sbjct: 137 SCWAFSASGCLEGQMFLKTGKLISLSEQNLVDCSHD-------QGNQGCNGGLMDFAFQY 189

Query: 220 ILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGN 279
           I + GG++ E+ YPY   D GSCK+      A  + F  I   E  +   +   GP++  
Sbjct: 190 IKENGGLDSEESYPYEAKD-GSCKYRAEYAVANDTGFVDIPQQEKALMKAVATVGPIS-- 246

Query: 280 VASIELPHISFSF 292
             +++  H S  F
Sbjct: 247 -VAMDASHPSLQF 258


>sp|Q8H166|ALEU_ARATH Thiol protease aleurain OS=Arabidopsis thaliana GN=ALEU PE=1 SV=2
          Length = 358

 Score =  159 bits (403), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 102/257 (39%), Positives = 137/257 (53%), Gaps = 22/257 (8%)

Query: 26  DDDAMIRQVVPSDG----EQSEDHLLNAEHH---FSLFKSKFSKTYATQEEHDYRFRVFK 78
           D+   IR V  SDG    E+S   +L    H   F+ F  ++ K Y   EE   RF +FK
Sbjct: 27  DESNPIRMV--SDGLREVEESVSQILGQSRHVLSFARFTHRYGKKYQNVEEMKLRFSIFK 84

Query: 79  ANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLP 138
            NL   +       +   GV +F+DLT  EF+R  LG  +     A  + +  +    LP
Sbjct: 85  ENLDLIRSTNKKGLSYKLGVNQFADLTWQEFQRTKLGAAQNC--SATLKGSHKVTEAALP 142

Query: 139 TDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDP 198
              DWR+ G V+ VKDQG CGSCW+FS TGALE A+  + G+ +SLSEQQLVDC    + 
Sbjct: 143 ETKDWREDGIVSPVKDQGGCGSCWTFSTTGALEAAYHQAFGKGISLSEQQLVDCAGAFN- 201

Query: 199 EESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAV---SN 255
                 + GCNGGL + AFEYI   GG++ EK YPYTG D  +CKF    +   V    N
Sbjct: 202 ------NYGCNGGLPSQAFEYIKSNGGLDTEKAYPYTGKD-ETCKFSAENVGVQVLNSVN 254

Query: 256 FSVISSDEDQMAANLVK 272
            ++ + DE + A  LV+
Sbjct: 255 ITLGAEDELKHAVGLVR 271


>sp|P25776|ORYA_ORYSJ Oryzain alpha chain OS=Oryza sativa subsp. japonica GN=Os04g0650000
           PE=1 SV=2
          Length = 458

 Score =  159 bits (402), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 85/209 (40%), Positives = 120/209 (57%), Gaps = 16/209 (7%)

Query: 35  VPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTA 94
           + S GE+SE+    A   ++ +K++  K+Y    E + R+  F+ NLR            
Sbjct: 25  IVSYGERSEEE---ARRLYAEWKAEHGKSYNAVGEEERRYAAFRDNLRYIDEHNAAADAG 81

Query: 95  VH----GVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTND-LPTDFDWRDHGAV 149
           VH    G+ +F+DLT  E+R  +LGL  + R         +   N+ LP   DWR  GAV
Sbjct: 82  VHSFRLGLNRFADLTNEEYRDTYLGLRNKPRRERKVSDRYLAADNEALPESVDWRTKGAV 141

Query: 150 TGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCN 209
             +KDQG CGSCW+FSA  A+EG + + TG+L+SLSEQ+LVDCD         S + GCN
Sbjct: 142 AEIKDQGGCGSCWAFSAIAAVEGINQIVTGDLISLSEQELVDCDT--------SYNEGCN 193

Query: 210 GGLMNSAFEYILKAGGVEREKDYPYTGTD 238
           GGLM+ AF++I+  GG++ E DYPY G D
Sbjct: 194 GGLMDYAFDFIINNGGIDTEDDYPYKGKD 222


>sp|Q9LM66|XCP2_ARATH Xylem cysteine proteinase 2 OS=Arabidopsis thaliana GN=XCP2 PE=1
           SV=2
          Length = 356

 Score =  159 bits (402), Expect = 3e-38,   Method: Compositional matrix adjust.
 Identities = 96/255 (37%), Positives = 139/255 (54%), Gaps = 19/255 (7%)

Query: 28  DAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRR 87
           D  I    P D E S D L+     F  + S F K Y T EE   RF VFK NL+     
Sbjct: 30  DYSIVGYSPEDLE-SHDKLIEL---FENWISNFEKAYETVEEKFLRFEVFKDNLKHIDET 85

Query: 88  QLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDL---PTDFDWR 144
                +   G+ +F+DL+  EF++ +LGL   +    + +        D+   P   DWR
Sbjct: 86  NKKGKSYWLGLNEFADLSHEEFKKMYLGLKTDIVRRDEERSYAEFAYRDVEAVPKSVDWR 145

Query: 145 DHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSC 204
             GAV  VK+QG+CGSCW+FS   A+EG + + TG L +LSEQ+L+DCD         + 
Sbjct: 146 KKGAVAEVKNQGSCGSCWAFSTVAAVEGINKIVTGNLTTLSEQELIDCDT--------TY 197

Query: 205 DSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKF--DKSKIAAAVSNFSVISSD 262
           ++GCNGGLM+ AFEYI+K GG+ +E+DYPY+  + G+C+   D+S+      +  V ++D
Sbjct: 198 NNGCNGGLMDYAFEYIVKNGGLRKEEDYPYS-MEEGTCEMQKDESETVTINGHQDVPTND 256

Query: 263 EDQMAANLVKHGPLA 277
           E  +   L  H PL+
Sbjct: 257 EKSLLKALA-HQPLS 270


>sp|Q8RWQ9|ALEUL_ARATH Thiol protease aleurain-like OS=Arabidopsis thaliana GN=At3g45310
           PE=2 SV=1
          Length = 358

 Score =  159 bits (401), Expect = 3e-38,   Method: Compositional matrix adjust.
 Identities = 106/279 (37%), Positives = 152/279 (54%), Gaps = 20/279 (7%)

Query: 3   RLILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPS--DGEQSEDHLLNAEHH---FSLFK 57
           +L LSS +LL+L +  AS     D+   I+ V  +  + E +   +L    H   FS F 
Sbjct: 4   KLNLSSSILLILFAAAASKEIGFDESNPIKMVSDNLHELEDTVVQILGQSRHVLSFSRFT 63

Query: 58  SKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLN 117
            ++ K Y + EE   RF VFK NL   +       +    + +F+DLT  EF+R  LG  
Sbjct: 64  HRYGKKYQSVEEMKLRFSVFKENLDLIRSTNKKGLSYKLSLNQFADLTWQEFQRYKLGAA 123

Query: 118 RRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLS 177
           +     A  + +  +    +P   DWR+ G V+ VK+QG CGSCW+FS TGALE A+  +
Sbjct: 124 QNC--SATLKGSHKITEATVPDTKDWREDGIVSPVKEQGHCGSCWTFSTTGALEAAYHQA 181

Query: 178 TGELVSLSEQQLVDCDHECDPEESGSCDS-GCNGGLMNSAFEYILKAGGVEREKDYPYTG 236
            G+ +SLSEQQLVDC        +G+ ++ GC+GGL + AFEYI   GG++ E+ YPYTG
Sbjct: 182 FGKGISLSEQQLVDC--------AGTFNNFGCHGGLPSQAFEYIKYNGGLDTEEAYPYTG 233

Query: 237 TDGGSCKFDKSKIAAAVS---NFSVISSDEDQMAANLVK 272
            DGG CKF    I   V    N ++ + DE + A  LV+
Sbjct: 234 KDGG-CKFSAKNIGVQVRDSVNITLGAEDELKHAVGLVR 271


>sp|Q8HY81|CATS_CANFA Cathepsin S OS=Canis familiaris GN=CTSS PE=2 SV=1
          Length = 331

 Score =  158 bits (399), Expect = 5e-38,   Method: Compositional matrix adjust.
 Identities = 93/253 (36%), Positives = 136/253 (53%), Gaps = 23/253 (9%)

Query: 50  EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH----GVTKFSDLT 105
           +HH++L+K  +SK Y  + E   R  +++ NL+      L     +H    G+    D+T
Sbjct: 25  DHHWNLWKKTYSKQYKEENEEVARRLIWEKNLKFVMLHNLEHSMGMHSYDLGMNHLGDMT 84

Query: 106 PSEFRRQFLGLNRRLRLPADAQKAPILPTND---LPTDFDWRDHGAVTGVKDQGACGSCW 162
             E     + L   LR+P+  Q+     +N    LP   DWR+ G VT VK QG+CG+CW
Sbjct: 85  GEEV----ISLMGSLRVPSQWQRNVTYRSNSNQKLPDSVDWREKGCVTEVKYQGSCGACW 140

Query: 163 SFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILK 222
           +FSA GALE    L TG+LVSLS Q LVD    C  E+ G  + GCNGG M +AF+YI+ 
Sbjct: 141 AFSAVGALEAQLKLKTGKLVSLSAQNLVD----CSTEKYG--NKGCNGGFMTTAFQYIID 194

Query: 223 AGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVIS-SDEDQMAANLVKHGPLAGNVA 281
             G++ E  YPY   + G C++D  K AA  S ++ +    ED +   +   GP++    
Sbjct: 195 NNGIDSEASYPYKAMN-GKCRYDSKKRAATCSKYTELPFGSEDALKEAVANKGPVS---V 250

Query: 282 SIELPHISFSFLF 294
           +I+  H SF FL+
Sbjct: 251 AIDASHYSF-FLY 262


>sp|P54639|CYSP4_DICDI Cysteine proteinase 4 OS=Dictyostelium discoideum GN=cprD PE=2 SV=2
          Length = 442

 Score =  158 bits (399), Expect = 6e-38,   Method: Compositional matrix adjust.
 Identities = 85/234 (36%), Positives = 128/234 (54%), Gaps = 13/234 (5%)

Query: 47  LNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTP 106
           L   + F+ +     +TY++ EE + R+++FK+N+    +        V G+  F+D+T 
Sbjct: 24  LQYRNAFTNWMQAHQRTYSS-EEFNARYQIFKSNMDYVHQWNSKGGETVLGLNVFADITN 82

Query: 107 SEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSA 166
            E+R  +LG           ++  I  T   PT  DWR  GAVT +K+QG CG CWSFS 
Sbjct: 83  QEYRTTYLGTPFDGSALIGTEEEKIFST-PAPT-VDWRAQGAVTPIKNQGQCGGCWSFST 140

Query: 167 TGALEGAHFLSTG---ELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA 223
           TG+ EGAHF+++G   +LVSLSEQ L+DC            ++GC GGLM  AFEYI+  
Sbjct: 141 TGSTEGAHFIASGTKKDLVSLSEQNLIDCSKSYG-------NNGCEGGLMTLAFEYIINN 193

Query: 224 GGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
            G++ E  YPYT  DG  CKF  S I A + ++  ++S  +    +   + P++
Sbjct: 194 KGIDTESSYPYTAEDGKECKFKTSNIGAQIVSYQNVTSGSEASLQSASNNAPVS 247


>sp|Q8QLK1|CATV_NPVMC Viral cathepsin OS=Mamestra configurata nucleopolyhedrovirus
           GN=VCATH PE=3 SV=1
          Length = 337

 Score =  157 bits (398), Expect = 7e-38,   Method: Compositional matrix adjust.
 Identities = 96/290 (33%), Positives = 158/290 (54%), Gaps = 35/290 (12%)

Query: 6   LSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYA 65
           ++ +L+LLL   L SAV  + D     QVV    + +  ++ +A  +F  F S+++K Y+
Sbjct: 1   MNKILILLL---LVSAVLTSHD-----QVVAVTIKPNLYNINSAPLYFEKFISQYNKQYS 52

Query: 66  TQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPAD 125
           +++E  YR+ +F+ N+     +   + +AV+ + +F+D+T +E       +NR   L + 
Sbjct: 53  SEDEKKYRYNIFRHNIESINAKNSRNDSAVYKINRFADMTKNEV------VNRHTGLASG 106

Query: 126 AQKAPILPT--------NDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLS 177
              A    T           P +FDWR++  VT VKDQG CG+CW+F+  GALE  + + 
Sbjct: 107 DIGANFCETIVVDGPGQRQRPANFDWRNYNKVTSVKDQGMCGACWAFAGLGALESQYAIK 166

Query: 178 TGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGT 237
              L+ L+EQQLVDCD           D GC+GGL+++A+E I+  GGVE+E DYPY   
Sbjct: 167 YDRLIDLAEQQLVDCDF---------VDMGCDGGLIHTAYEQIMHIGGVEQEYDYPYKAV 217

Query: 238 DGGSCKFDKSKIAAAVSN-FSVISSDEDQMAANLVKH-GPLAGNVASIEL 285
               C     K A  V N +  +   E+++  +L++H GP+A  V +++L
Sbjct: 218 R-LPCAVKPHKFAVGVRNCYRYVLLSEERL-EDLLRHVGPIAIAVDAVDL 265


>sp|Q8V5U0|CATV_NPVHZ Viral cathepsin OS=Heliothis zea nuclear polyhedrosis virus
           GN=VCATH PE=3 SV=1
          Length = 367

 Score =  157 bits (397), Expect = 9e-38,   Method: Compositional matrix adjust.
 Identities = 95/254 (37%), Positives = 137/254 (53%), Gaps = 28/254 (11%)

Query: 49  AEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRR--AKRRQL----------LDPTAVH 96
           +E +F  F  +++K+Y   +E+ YR+ VFK NL +  ++ R+           L  +A  
Sbjct: 53  SEIYFKHFLQQYNKSYDDPKEYQYRYNVFKDNLNKINSQNRENLLNNKNNNDSLSTSAQF 112

Query: 97  GVTKFSDLTPSEFRRQ----FLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGV 152
           GV KFSD TP E        FL L++   L  + +     P   LP  +DWRD   VT +
Sbjct: 113 GVNKFSDKTPDEVLHSNTGFFLNLSQHYTL-CENRIVKGAPDIRLPDYYDWRDTNKVTPI 171

Query: 153 KDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGL 212
           KDQG CGSCW+F A G +E  + +   +L+ LSEQQL+DCD           D GCNGGL
Sbjct: 172 KDQGVCGSCWAFVAIGNIESQYAIRHNKLIDLSEQQLLDCD---------EVDLGCNGGL 222

Query: 213 MNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAV-SNFSVISSDEDQMAANLV 271
           M+ AF+ +L  GGVE E DYPY G++   C  D  KIA  + S F     DE+++   + 
Sbjct: 223 MHLAFQELLLMGGVETEADYPYQGSE-QMCTLDNRKIAVKLNSCFKYDIRDENKLKELVY 281

Query: 272 KHGPLAGNVASIEL 285
             GP+A  V ++++
Sbjct: 282 TTGPVAIAVDAMDI 295


>sp|P25251|CYSP4_BRANA Cysteine proteinase COT44 (Fragment) OS=Brassica napus PE=2 SV=1
          Length = 328

 Score =  157 bits (397), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 83/217 (38%), Positives = 122/217 (56%), Gaps = 16/217 (7%)

Query: 69  EHDYRFRVFKANLRRAKRRQLLDPTAVH--GVTKFSDLTPSEFRRQFLGLN----RRLRL 122
           + D RF +FK NLR        +  A +  G+T F++LT  E+R  +LG      RR+  
Sbjct: 24  QQDERFNIFKDNLRFIDLHNENNKNATYKLGLTIFANLTNDEYRSLYLGARTEPVRRITK 83

Query: 123 PADA--QKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGE 180
             +   + +  +  +++P   DWR  GAV  +KDQG CGSCW+FS   A+EG + + TGE
Sbjct: 84  AKNVNMKYSAAVNVDEVPVTVDWRQKGAVNAIKDQGTCGSCWAFSTAAAVEGINKIVTGE 143

Query: 181 LVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGG 240
           LVSLSEQ+LVDCD         S + GCNGGLM+ AF++I+K GG+  EKDYPY GT+G 
Sbjct: 144 LVSLSEQELVDCDK--------SYNQGCNGGLMDYAFQFIMKNGGLNTEKDYPYHGTNGK 195

Query: 241 SCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
                K+     +  +  + S ++      V + P++
Sbjct: 196 CNSLLKNSRVVTIDGYEDVPSKDETALKRAVSYQPVS 232


>sp|P06797|CATL1_MOUSE Cathepsin L1 OS=Mus musculus GN=Ctsl1 PE=1 SV=2
          Length = 334

 Score =  156 bits (395), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 92/253 (36%), Positives = 132/253 (52%), Gaps = 20/253 (7%)

Query: 44  DHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHG----VT 99
           D   +AE H   +KS   + Y T EE ++R  +++ N+R  +          HG    + 
Sbjct: 22  DQTFSAEWH--QWKSTHRRLYGTNEE-EWRRAIWEKNMRMIQLHNGEYSNGQHGFSMEMN 78

Query: 100 KFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACG 159
            F D+T  EFR+   G   +        + P++    +P   DWR+ G VT VK+QG CG
Sbjct: 79  AFGDMTNEEFRQVVNGYRHQKHKKGRLFQEPLMLK--IPKSVDWREKGCVTPVKNQGQCG 136

Query: 160 SCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEY 219
           SCW+FSA+G LEG  FL TG+L+SLSEQ LVDC H          + GCNGGLM+ AF+Y
Sbjct: 137 SCWAFSASGCLEGQMFLKTGKLISLSEQNLVDCSH-------AQGNQGCNGGLMDFAFQY 189

Query: 220 ILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGN 279
           I + GG++ E+ YPY   D GSCK+      A  + F  I   E  +   +   GP++  
Sbjct: 190 IKENGGLDSEESYPYEAKD-GSCKYRAEFAVANDTGFVDIPQQEKALMKAVATVGPIS-- 246

Query: 280 VASIELPHISFSF 292
             +++  H S  F
Sbjct: 247 -VAMDASHPSLQF 258


>sp|Q94503|CYSP6_DICDI Cysteine proteinase 6 OS=Dictyostelium discoideum GN=cprF PE=2 SV=1
          Length = 434

 Score =  156 bits (394), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 90/235 (38%), Positives = 128/235 (54%), Gaps = 26/235 (11%)

Query: 66  TQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPAD 125
           + EE + RF +FKAN+             V G+  F+D+T  E+R  +LG       P D
Sbjct: 42  SSEEFNGRFNIFKANMDYINEWNTKGSETVLGLNVFADITNEEYRATYLGT------PFD 95

Query: 126 AQKAPILPTNDL-----PTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTG- 179
           A    + P+  +         DWR  GAVT +K+QG CG CWSFSATGA EGA +++ G 
Sbjct: 96  ASSLEMTPSEKVFGGVQANSVDWRAKGAVTPIKNQGECGGCWSFSATGATEGAQYIANGD 155

Query: 180 -ELVSLSEQQLVDCDHECDPEESGSC-DSGCNGGLMNSAFEYILKAGGVEREKDYPYTGT 237
            +L S+SEQQL+DC        SGS  ++GC GGLM  AFEYI+  GG++ E  YP+T  
Sbjct: 156 SDLTSVSEQQLIDC--------SGSYGNNGCEGGLMTLAFEYIINNGGIDTESSYPFT-A 206

Query: 238 DGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIELPHISFSF 292
           +   CK++ S I A +S++  ++S  +   A  V  GP +    +I+    SF F
Sbjct: 207 NTEKCKYNPSNIGAELSSYVNVTSGSESDLAAKVTQGPTS---VAIDASQPSFQF 258


>sp|Q9UBX1|CATF_HUMAN Cathepsin F OS=Homo sapiens GN=CTSF PE=1 SV=1
          Length = 484

 Score =  156 bits (394), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 101/238 (42%), Positives = 137/238 (57%), Gaps = 14/238 (5%)

Query: 42  SEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLD-PTAVHGVTK 100
           S+D  +     F  F   +++TY ++EE  +R  VF  N+ RA++ Q LD  TA +GVTK
Sbjct: 176 SQDLPVKMASIFKNFVITYNRTYESKEEARWRLSVFVNNMVRAQKIQALDRGTAQYGVTK 235

Query: 101 FSDLTPSEFRRQFLGLNRRLRL-PADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACG 159
           FSDLT  EFR  +L  N  LR  P +  K      +  P ++DWR  GAVT VKDQG CG
Sbjct: 236 FSDLTEEEFRTIYL--NTLLRKEPGNKMKQAKSVGDLAPPEWDWRSKGAVTKVKDQGMCG 293

Query: 160 SCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEY 219
           SCW+FS TG +EG  FL+ G L+SLSEQ+L+DCD           D  C GGL ++A+  
Sbjct: 294 SCWAFSVTGNVEGQWFLNQGTLLSLSEQELLDCDK---------MDKACMGGLPSNAYSA 344

Query: 220 ILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
           I   GG+E E DY Y G    SC F   K    +++   +S +E ++AA L K GP++
Sbjct: 345 IKNLGGLETEDDYSYQG-HMQSCNFSAEKAKVYINDSVELSQNEQKLAAWLAKRGPIS 401


>sp|P13277|CYSP1_HOMAM Digestive cysteine proteinase 1 OS=Homarus americanus GN=LCP1 PE=1
           SV=2
          Length = 322

 Score =  155 bits (392), Expect = 3e-37,   Method: Compositional matrix adjust.
 Identities = 99/258 (38%), Positives = 132/258 (51%), Gaps = 24/258 (9%)

Query: 46  LLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRA----KRRQLLDPTAVHGVTKF 101
           L  A   +  FK KF + Y   EE  YR  VF  NL+      K+ +  + T    + +F
Sbjct: 13  LAAANPSWEEFKGKFGRKYVDLEEERYRLNVFLDNLQYIEEFNKKYERGEVTYNLAINQF 72

Query: 102 SDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLP--TDFDWRDHGAVTGVKDQGACG 159
           SD+T  +F     G  +  R PA    A    T+  P  T+ DWR  GAVT VKDQG CG
Sbjct: 73  SDMTNEKFNAVMKGYKKGPR-PA----AVFTSTDAAPESTEVDWRTKGAVTPVKDQGQCG 127

Query: 160 SCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGS-CDSGCNGGLMNSAFE 218
           SCW+FS TG +EG HFL TG LVSLSEQQLVDC         GS  + GCNGG +  A  
Sbjct: 128 SCWAFSTTGGIEGQHFLKTGRLVSLSEQQLVDC-------AGGSYYNQGCNGGWVERAIM 180

Query: 219 YILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKH-GPLA 277
           Y+   GGV+ E  YPY   D  +C+F+ + I A  + +  I+   +       +  GP++
Sbjct: 181 YVRDNGGVDTESSYPYEARD-NTCRFNSNTIGATCTGYVGIAQGSESALKTATRDIGPIS 239

Query: 278 GNVASIELPHISFSFLFT 295
               +I+  H SF   +T
Sbjct: 240 ---VAIDASHRSFQSYYT 254


>sp|Q40143|CYSP3_SOLLC Cysteine proteinase 3 OS=Solanum lycopersicum GN=CYP-3 PE=2 SV=1
          Length = 356

 Score =  154 bits (390), Expect = 6e-37,   Method: Compositional matrix adjust.
 Identities = 108/285 (37%), Positives = 153/285 (53%), Gaps = 29/285 (10%)

Query: 1   MERLILSSLLLLLLSSVLASAVA---VNDDDAMIRQVVPSDGEQSEDHLLNAEHH----- 52
           M RL   SL+L+L++ + A+A+A      D   IRQVV  D  + E+ +L          
Sbjct: 1   MSRL---SLVLILVAGLFATALAGPATFADKNPIRQVVFPD--ELENGILQVVGQTRSAL 55

Query: 53  -FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRR 111
            F+ F  +  K Y + EE   RF +F  NL+  +       +   G+ +F+DLT  EFR+
Sbjct: 56  SFARFAIRHRKRYDSVEEIKQRFEIFLDNLKMIRSHNRKGLSYKLGINEFTDLTWDEFRK 115

Query: 112 QFLGLNRRLRLPADAQKAPILPTND-LPTDFDWRDHGAVTGVKDQGACGSCWSFSATGAL 170
             LG ++     +   K  +  TN  LP   DWR  G V+ VK QG CGSCW+FS TGAL
Sbjct: 116 HKLGASQNC---SATTKGNLKLTNVVLPETKDWRKDGIVSPVKAQGKCGSCWTFSTTGAL 172

Query: 171 EGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREK 230
           E A+  + G+ +SLSEQQLVDC    +       + GCNGGL + AFEYI   GG++ E+
Sbjct: 173 EAAYAQAFGKGISLSEQQLVDCAGAFN-------NFGCNGGLPSQAFEYIKFNGGLDTEE 225

Query: 231 DYPYTGTDGGSCKFDKSKIAAAV---SNFSVISSDEDQMAANLVK 272
            YPYTG + G CKF ++ I   V    N ++ +  E + A  LV+
Sbjct: 226 AYPYTGKN-GICKFSQANIGVKVISSVNITLGAEYELKYAVALVR 269


>sp|P25975|CATL1_BOVIN Cathepsin L1 OS=Bos taurus GN=CTSL1 PE=1 SV=3
          Length = 334

 Score =  154 bits (389), Expect = 7e-37,   Method: Compositional matrix adjust.
 Identities = 92/249 (36%), Positives = 126/249 (50%), Gaps = 17/249 (6%)

Query: 48  NAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHG----VTKFSD 103
           N + H+  +K+   + Y   EE ++R  V++ N +             HG    +  F D
Sbjct: 24  NLDAHWHQWKATHRRLYGMNEE-EWRRAVWEKNKKIIDLHNQEYSEGKHGFRMAMNAFGD 82

Query: 104 LTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWS 163
           +T  EFR+   G   +          P+L   D+P   DW   G VT VK+QG CGSCW+
Sbjct: 83  MTNEEFRQVMNGFQNQKHKKGKLFHEPLLV--DVPKSVDWTKKGYVTPVKNQGQCGSCWA 140

Query: 164 FSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA 223
           FSATGALEG  F  TG+LVSLSEQ LVDC            + GCNGGLM++AF+YI   
Sbjct: 141 FSATGALEGQMFRKTGKLVSLSEQNLVDCSR-------AQGNQGCNGGLMDNAFQYIKDN 193

Query: 224 GGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASI 283
           GG++ E+ YPY  TD  SC +     AA  + F  I   E  +   +   GP++    +I
Sbjct: 194 GGLDSEESYPYLATDTNSCNYKPECSAANDTGFVDIPQREKALMKAVATVGPIS---VAI 250

Query: 284 ELPHISFSF 292
           +  H SF F
Sbjct: 251 DAGHTSFQF 259


>sp|Q9GL24|CATL1_CANFA Cathepsin L1 OS=Canis familiaris GN=CTSL1 PE=2 SV=1
          Length = 333

 Score =  154 bits (388), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 91/253 (35%), Positives = 130/253 (51%), Gaps = 19/253 (7%)

Query: 44  DHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVT---- 99
           D  LNA+ +   +K+   + Y   EE  +R  V++ N++  +          HG T    
Sbjct: 22  DQSLNAQWY--QWKATHRRLYGMNEE-GWRRAVWEKNMKMIELHNREYSQGKHGFTMAMN 78

Query: 100 KFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACG 159
            F D+T  EFR+   G   +        + P+    ++P   DWR+ G VT VK+QG CG
Sbjct: 79  AFGDMTNEEFRQVMNGFQNQKHKKGKMFQEPLFA--EIPKSVDWREKGYVTPVKNQGQCG 136

Query: 160 SCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEY 219
           SCW+FSATGALEG  F  TG+LVSLSEQ LVDC            + GCNGGLM++AF Y
Sbjct: 137 SCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSR-------AQGNEGCNGGLMDNAFRY 189

Query: 220 ILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGN 279
           +   GG++ E+ YPY G D  +C +     AA  + F  +   E  +   +   GP++  
Sbjct: 190 VKDNGGLDSEESYPYLGRDTETCNYKPECSAANDTGFVDLPQREKALMKAVATLGPIS-- 247

Query: 280 VASIELPHISFSF 292
             +I+  H SF F
Sbjct: 248 -VAIDAGHQSFQF 259


>sp|Q9YMP9|CATV_NPVLD Viral cathepsin OS=Lymantria dispar multicapsid nuclear
           polyhedrosis virus GN=VCATH PE=3 SV=1
          Length = 356

 Score =  153 bits (387), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 87/239 (36%), Positives = 134/239 (56%), Gaps = 18/239 (7%)

Query: 45  HLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLR--RAKRRQLLD-PTAVHGVTKF 101
           +L  A  +F  F   ++K Y +  E + R+ +FK NL    AK     D PTA + + KF
Sbjct: 48  NLQRAPDYFESFVENYNKNYTSDWEKNKRYSIFKDNLHEINAKNGNATDGPTATYKINKF 107

Query: 102 SDLTPSEFRRQFLGLNRRLRLPADAQKAPIL--PTNDLPTDFDWRDHGAVTGVKDQGACG 159
           SDL+ SE   +F GL+   R+ ++  K  IL  P +  P  FDWR+   VT +K+QGACG
Sbjct: 108 SDLSKSELIAKFTGLSIPERV-SNFCKTIILNQPPDKGPLHFDWREQNKVTSIKNQGACG 166

Query: 160 SCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEY 219
           +CW+F+   ++E    +    L+ LSEQQL+DCD         S D GCNGGL+++AFE 
Sbjct: 167 ACWAFATLASVESQFAMRHNRLIDLSEQQLIDCD---------SVDMGCNGGLLHTAFEE 217

Query: 220 ILKAGGVEREKDYPYTGTDGGSCKFDKSK--IAAAVSNFSVISSDEDQMAANLVKHGPL 276
           I++ GGV+ E DYP+ G +   C  D+ +  + + V  +  +  +E+++   L   GP+
Sbjct: 218 IMRMGGVQTELDYPFVGRN-RRCGLDRHRPYVVSLVGCYRYVMVNEEKLKDLLRAVGPI 275


>sp|P41721|CATV_NPVBM Viral cathepsin OS=Bombyx mori nuclear polyhedrosis virus GN=VCATH
           PE=1 SV=1
          Length = 323

 Score =  153 bits (386), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 87/236 (36%), Positives = 133/236 (56%), Gaps = 21/236 (8%)

Query: 47  LNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTP 106
           L A ++F  F  +F+K Y+++ E   RF++F+ NL     +   D +A + + KFSDL+ 
Sbjct: 22  LKAPNYFEEFVHRFNKNYSSEVEKLRRFKIFQHNLNEIINKNQND-SAKYEINKFSDLSK 80

Query: 107 SEFRRQFLGLNRRLRLPADAQ---KAPIL--PTNDLPTDFDWRDHGAVTGVKDQGACGSC 161
            E   ++ GL+    LP   Q   K  +L  P    P +FDWR    VT VK+QG CG+C
Sbjct: 81  DETIAKYTGLS----LPTQTQNFCKVILLDQPPGKGPLEFDWRRLNKVTSVKNQGMCGAC 136

Query: 162 WSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYIL 221
           W+F+  G+LE    +   EL++LSEQQ++DCD           D+GCNGGL+++AFE I+
Sbjct: 137 WAFATLGSLESQFAIKHNELINLSEQQMIDCDF---------VDAGCNGGLLHTAFEAII 187

Query: 222 KAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSN-FSVISSDEDQMAANLVKHGPL 276
           K GGV+ E DYPY   D  +C+ + +K    V + +  I   E+++   L   GP+
Sbjct: 188 KMGGVQLESDYPYEA-DNNNCRMNSNKFLVQVKDCYRYIIVYEEKLKDLLPLVGPI 242


>sp|Q9WGE0|CATV_NPVHC Viral cathepsin OS=Hyphantria cunea nuclear polyhedrosis virus
           GN=VCATH PE=3 SV=1
          Length = 324

 Score =  153 bits (386), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 84/236 (35%), Positives = 130/236 (55%), Gaps = 18/236 (7%)

Query: 46  LLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLT 105
           LL A  +F  F  KF+K Y+++ E   RF++F+ NL     +   D TA + + KFSDL+
Sbjct: 21  LLKAPSYFEDFLHKFNKHYSSESEKLRRFQIFQHNLEEIIIKNQNDTTAQYEINKFSDLS 80

Query: 106 PSEFRRQFLGLNRRLRLPADAQ---KAPIL--PTNDLPTDFDWRDHGAVTGVKDQGACGS 160
             E   ++ GL     LP   Q   +  +L  P +  P +FDWR    VT VK+QG CG+
Sbjct: 81  KDETISKYTGL----ALPLQTQNFCEVVVLNRPPDKGPLEFDWRRLNKVTSVKNQGICGA 136

Query: 161 CWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYI 220
           CW+F+   +LE    +   +L++LSEQQL+DCD+          D+GCNGGL+++A+E +
Sbjct: 137 CWAFATLASLESQFAIKHNQLINLSEQQLIDCDY---------VDAGCNGGLLHTAYEAV 187

Query: 221 LKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPL 276
           ++ GGV+ E DYPY G+DG         +      +  I+  E+++   L   GP+
Sbjct: 188 MQMGGVQAENDYPYEGSDGNCRVDVAKFVVKVKKCYRYIAVFEEKLKDLLRIVGPI 243


>sp|P35591|CYSP1_LEIPI Cysteine proteinase 1 OS=Leishmania pifanoi GN=CYS1 PE=2 SV=2
          Length = 354

 Score =  153 bits (386), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 101/275 (36%), Positives = 143/275 (52%), Gaps = 27/275 (9%)

Query: 12  LLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHD 71
           LL + V+     V    A+I Q  P       D+ + A  H+  FK +  K +    E  
Sbjct: 7   LLFAIVVTILFVVCYGSALIAQTPPP-----VDNFV-ASAHYGSFKKRHGKAFGGDAEEG 60

Query: 72  YRFRVFKANLRRAKRRQLLDPTAVHGVT-KFSDLTPSEFRRQFLGLNRRLRLPADAQKAP 130
           +RF  FK N++ A      +P A + V+ KF+DLTP EF + +L  +   R   D  K  
Sbjct: 61  HRFNAFKQNMQTAYFLNTQNPHAHYDVSGKFADLTPQEFAKLYLNPDYYARHLKD-HKED 119

Query: 131 ILPTNDLPT---DFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQ 187
           +   +  P+     DWRD GAVT VK+QG CGSCW+FSA G +EG    S   LVSLSEQ
Sbjct: 120 VHVDDSAPSGVMSVDWRDKGAVTPVKNQGLCGSCWAFSAIGNIEGQWAASGHSLVSLSEQ 179

Query: 188 QLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA--GGVEREKDYPYTGTDGGSCK-- 243
            LV CD         + D GCNGGLM+ A  +I+++  G V  E  YPY  T GG  +  
Sbjct: 180 MLVSCD---------NIDEGCNGGLMDQAMNWIMQSHNGSVFTEASYPY--TSGGGTRPP 228

Query: 244 -FDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
             D+ ++ A ++ F  +  DE+++A  + K GP+A
Sbjct: 229 CHDEGEVGAKITGFLSLPHDEERIAEWVEKRGPVA 263


>sp|O10364|CATV_NPVOP Viral cathepsin OS=Orgyia pseudotsugata multicapsid polyhedrosis
           virus GN=VCATH PE=3 SV=1
          Length = 324

 Score =  153 bits (386), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 86/237 (36%), Positives = 132/237 (55%), Gaps = 20/237 (8%)

Query: 46  LLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLT 105
           LL A ++F  F  KF+K Y+++ E  +RF++F+ NL     +   D TA + + KFSDL+
Sbjct: 21  LLKAPNYFEDFLHKFNKNYSSESEKLHRFKIFQHNLEEIINKNQNDSTAQYEINKFSDLS 80

Query: 106 PSEFRRQFLGLNRRLRLPADAQ---KAPIL--PTNDLPTDFDWRDHGAVTGVKDQGACGS 160
             E   ++ GL+    LP   Q   +  IL  P +  P +FDWR    VT VK+QG CG+
Sbjct: 81  KEEAISKYTGLS----LPHQTQNFCEVVILDRPPDRGPLEFDWRQFNKVTSVKNQGVCGA 136

Query: 161 CWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYI 220
           CW+F+  G+LE    +    L++LSEQQ +DCD           ++GC+GGL+++AFE  
Sbjct: 137 CWAFATLGSLESQFAIKYNRLINLSEQQFIDCDR---------VNAGCDGGLLHTAFESA 187

Query: 221 LKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAV-SNFSVISSDEDQMAANLVKHGPL 276
           ++ GGV+ E DYPY  T  G C+ + ++    V S    I   E+++   L   GP+
Sbjct: 188 MEMGGVQMESDYPYE-TANGQCRINPNRFVVGVRSCRRYIVMFEEKLKDLLRAVGPI 243


>sp|Q6VTL7|CATV_NPVCD Viral cathepsin OS=Choristoneura fumiferana defective polyhedrosis
           virus GN=Vcath PE=3 SV=1
          Length = 324

 Score =  153 bits (386), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 83/237 (35%), Positives = 132/237 (55%), Gaps = 20/237 (8%)

Query: 46  LLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLT 105
           LL A  +F  F   F+K Y+++ E  +RF++F+ NL     + L D +A + + KFSDL+
Sbjct: 21  LLKAPSYFEDFLHNFNKNYSSKSEKLHRFKIFQHNLEEIINKNLNDTSAQYEINKFSDLS 80

Query: 106 PSEFRRQFLGLNRRLRLPADAQ---KAPIL--PTNDLPTDFDWRDHGAVTGVKDQGACGS 160
             E   ++ GL+    LP   Q   +  +L  P +  P +FDWR    VT VK+QG CG+
Sbjct: 81  KDETISKYTGLS----LPLQNQNFCEVVVLNRPPDKGPLEFDWRRLNKVTSVKNQGTCGA 136

Query: 161 CWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYI 220
           CW+F+  G+LE    +   +L++LSEQQL+DCD           D GC+GGL+++A+E +
Sbjct: 137 CWAFATLGSLESQFAIKHDQLINLSEQQLIDCDF---------VDMGCDGGLLHTAYEAV 187

Query: 221 LKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSN-FSVISSDEDQMAANLVKHGPL 276
           +  GG++ E DYPY   + G C+ + +K    V   +  +   E+++   L   GPL
Sbjct: 188 MNMGGIQAENDYPYEANN-GDCRLNAAKFVVKVKKCYRYVLMFEEKLKDLLRIVGPL 243


>sp|P25784|CYSP3_HOMAM Digestive cysteine proteinase 3 OS=Homarus americanus GN=LCP3 PE=2
           SV=1
          Length = 321

 Score =  152 bits (384), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 95/253 (37%), Positives = 128/253 (50%), Gaps = 19/253 (7%)

Query: 46  LLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRA----KRRQLLDPTAVHGVTKF 101
           L  A   +  FK+++ + Y   +E  YR RVF+ N +      K+ +  + T    + +F
Sbjct: 13  LATASPSWDHFKTQYGRKYGDAKEELYRQRVFQQNEQLIEDFNKKFENGEVTFKVAMNQF 72

Query: 102 SDLTPSEFRRQFLGLNRRLR-LPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGS 160
            D+T  EF     G  +  R  P     A   P   +  D DWR    VT VKDQ  CGS
Sbjct: 73  GDMTNEEFNAVMKGYKKGSRGEPKAVFTAEAGP---MAADVDWRTKALVTPVKDQEQCGS 129

Query: 161 CWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYI 220
           CW+FSATGALEG HFL   ELVSLSEQQLVDC  +         + GC GG M SAF+YI
Sbjct: 130 CWAFSATGALEGQHFLKNDELVSLSEQQLVDCSTDYG-------NDGCGGGWMTSAFDYI 182

Query: 221 LKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNV 280
              GG++ E  YPY   D  SC+FD + I A  +    +   E+ +   +   GP++   
Sbjct: 183 KDNGGIDTESSYPYEAED-RSCRFDANSIGAICTGSVEVQHTEEALQEAVSGVGPIS--- 238

Query: 281 ASIELPHISFSFL 293
            +I+  H SF F 
Sbjct: 239 VAIDASHFSFQFY 251


>sp|P41715|CATV_NPVCF Viral cathepsin OS=Choristoneura fumiferana nuclear polyhedrosis
           virus GN=Vcath PE=3 SV=1
          Length = 324

 Score =  152 bits (384), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 82/237 (34%), Positives = 134/237 (56%), Gaps = 20/237 (8%)

Query: 46  LLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLT 105
           +L A ++F  F  KF+K+Y+++ E   RF++F+ NL     +   D TA + + KF+DL+
Sbjct: 21  VLKAPNYFEDFLHKFNKSYSSESEKLRRFQIFRHNLEEIINKNHNDSTAQYEINKFADLS 80

Query: 106 PSEFRRQFLGLNRRLRLPADAQ---KAPIL--PTNDLPTDFDWRDHGAVTGVKDQGACGS 160
             E   ++ GL+    LP   Q   +  +L  P +  P +FDWR    VT VK+QG CG+
Sbjct: 81  KDETISKYTGLS----LPLQTQNFCEVVVLDRPPDKGPLEFDWRRLNKVTSVKNQGMCGA 136

Query: 161 CWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYI 220
           CW+F+  G+LE    +   + ++LSEQQL+DCD           D+GC+GGL+++AFE +
Sbjct: 137 CWAFATLGSLESQFAIKHNQFINLSEQQLIDCDF---------VDAGCDGGLLHTAFEAV 187

Query: 221 LKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSN-FSVISSDEDQMAANLVKHGPL 276
           +  GG++ E DYPY   + G C+ + +K    V   +  I+  E+++   L   GP+
Sbjct: 188 MNMGGIQAESDYPYEANN-GDCRANAAKFVVKVKKCYRYITVFEEKLKDLLRSVGPI 243


>sp|P36400|LMCPB_LEIME Cysteine proteinase B OS=Leishmania mexicana GN=LMCPB PE=2 SV=2
          Length = 443

 Score =  152 bits (383), Expect = 4e-36,   Method: Compositional matrix adjust.
 Identities = 92/234 (39%), Positives = 123/234 (52%), Gaps = 18/234 (7%)

Query: 53  FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ 112
           F  FK  + + Y T  E   R   F+ NL   +  Q  +P A  G+TKF DL+ +EF  +
Sbjct: 38  FEEFKRTYGRAYETLAEEQQRLANFERNLELMREHQARNPHAQFGITKFFDLSEAEFAAR 97

Query: 113 FLG----LNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATG 168
           +L          R  A   +      + +P   DWR+ GAVT VKDQGACGSCW+FSA G
Sbjct: 98  YLNGAAYFAAAKRHAAQHYRKARADLSAVPDAVDWREKGAVTPVKDQGACGSCWAFSAVG 157

Query: 169 ALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILK--AGGV 226
            +EG  +L+  ELVSLSEQQLV CD   D         GC+GGLM  AF+++L+   G +
Sbjct: 158 NIEGQWYLAGHELVSLSEQQLVSCDDMND---------GCDGGLMLQAFDWLLQNTNGHL 208

Query: 227 EREKDYPYTGTDGGSCKFDKSK---IAAAVSNFSVISSDEDQMAANLVKHGPLA 277
             E  YPY   +G   +   S    + A +    +I S E  MAA L K+GP+A
Sbjct: 209 HTEDSYPYVSGNGYVPECSNSSELVVGAQIDGHVLIGSSEKAMAAWLAKNGPIA 262


>sp|P25783|CATV_NPVAC Viral cathepsin OS=Autographa californica nuclear polyhedrosis
           virus GN=VCATH PE=1 SV=1
          Length = 323

 Score =  152 bits (383), Expect = 4e-36,   Method: Compositional matrix adjust.
 Identities = 86/237 (36%), Positives = 133/237 (56%), Gaps = 21/237 (8%)

Query: 46  LLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLT 105
           LL A ++F  F  +F+K Y ++ E   RF++F+ NL     +   D +A + + KFSDL+
Sbjct: 21  LLKAPNYFEEFVHRFNKDYGSEVEKLRRFKIFQHNLNEIINKNQND-SAKYEINKFSDLS 79

Query: 106 PSEFRRQFLGLNRRLRLPADAQ---KAPIL--PTNDLPTDFDWRDHGAVTGVKDQGACGS 160
             E   ++ GL+    LP   Q   K  +L  P    P +FDWR    VT VK+QG CG+
Sbjct: 80  KDETIAKYTGLS----LPIQTQNFCKVIVLDQPPGKGPLEFDWRRLNKVTSVKNQGMCGA 135

Query: 161 CWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYI 220
           CW+F+   +LE    +   +L++LSEQQ++DCD           D+GCNGGL+++AFE I
Sbjct: 136 CWAFATLASLESQFAIKHNQLINLSEQQMIDCDF---------VDAGCNGGLLHTAFEAI 186

Query: 221 LKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSN-FSVISSDEDQMAANLVKHGPL 276
           +K GGV+ E DYPY   D  +C+ + +K    V + +  I+  E+++   L   GP+
Sbjct: 187 IKMGGVQLESDYPYEA-DNNNCRMNSNKFLVQVKDCYRYITVYEEKLKDLLRLVGPI 242


>sp|Q05094|CYSP2_LEIPI Cysteine proteinase 2 OS=Leishmania pifanoi GN=CYS2 PE=1 SV=1
          Length = 444

 Score =  151 bits (382), Expect = 5e-36,   Method: Compositional matrix adjust.
 Identities = 92/235 (39%), Positives = 123/235 (52%), Gaps = 19/235 (8%)

Query: 53  FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ 112
           F  FK  + + Y T  E   R   F+ NL   +  Q  +P A  G+TKF DL+ +EF  +
Sbjct: 38  FEEFKRTYGRAYETLAEEQQRLANFERNLELMREHQARNPHAQFGITKFFDLSEAEFAAR 97

Query: 113 FLG----LNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATG 168
           +L          R  A   +      + +P   DWR+ GAVT VKDQGACGSCW+FSA G
Sbjct: 98  YLNGAAYFAAAKRHAAQHYRKARADLSAVPDAVDWREKGAVTPVKDQGACGSCWAFSAVG 157

Query: 169 ALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILK--AGGV 226
            +EG  +L+  ELVSLSEQQLV CD   D         GC+GGLM  AF+++L+   G +
Sbjct: 158 NIEGQWYLAGHELVSLSEQQLVSCDDMND---------GCDGGLMLQAFDWLLQNTNGHL 208

Query: 227 EREKDYPYTGTDGGSCKFDKSK----IAAAVSNFSVISSDEDQMAANLVKHGPLA 277
             E  YPY   +G   +   S     + A +    +I S E  MAA L K+GP+A
Sbjct: 209 HTEDSYPYVSGNGYVPECSNSSEELVVGAQIDGHVLIGSSEKAMAAWLAKNGPIA 263


>sp|P25775|LMCPA_LEIME Cysteine proteinase A OS=Leishmania mexicana GN=LMCPA PE=2 SV=1
          Length = 354

 Score =  151 bits (382), Expect = 5e-36,   Method: Compositional matrix adjust.
 Identities = 100/275 (36%), Positives = 143/275 (52%), Gaps = 27/275 (9%)

Query: 12  LLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHD 71
           LL + V+     V    A+I Q  P       D+ + A  H+  FK +  K +    E  
Sbjct: 7   LLFAIVVTILFVVCYGSALIAQTPPP-----VDNFV-ASAHYGSFKKRHGKAFGGDAEEG 60

Query: 72  YRFRVFKANLRRAKRRQLLDPTAVHGVT-KFSDLTPSEFRRQFLGLNRRLRLPADAQKAP 130
           +RF  FK N++ A      +P A + V+ KF+DLTP EF + +L  +   R   +  K  
Sbjct: 61  HRFNAFKQNMQTAYFLNTQNPHAHYDVSGKFADLTPQEFAKLYLNPDYYARHLKN-HKED 119

Query: 131 ILPTNDLPT---DFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQ 187
           +   +  P+     DWRD GAVT VK+QG CGSCW+FSA G +EG    S   LVSLSEQ
Sbjct: 120 VHVDDSAPSGVMSVDWRDKGAVTPVKNQGLCGSCWAFSAIGNIEGQWAASGHSLVSLSEQ 179

Query: 188 QLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA--GGVEREKDYPYTGTDGGSCK-- 243
            LV CD         + D GCNGGLM+ A  +I+++  G V  E  YPY  T GG  +  
Sbjct: 180 MLVSCD---------NIDEGCNGGLMDQAMNWIMQSHNGSVFTEASYPY--TSGGGTRPP 228

Query: 244 -FDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
             D+ ++ A ++ F  +  DE+++A  + K GP+A
Sbjct: 229 CHDEGEVGAKITGFLSLPHDEERIAEWVEKRGPVA 263


  Database: swissprot
    Posted date:  Mar 23, 2013  2:32 AM
  Number of letters in database: 191,569,459
  Number of sequences in database:  539,616
  
Lambda     K      H
   0.317    0.133    0.391 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 109,964,099
Number of Sequences: 539616
Number of extensions: 4597424
Number of successful extensions: 10531
Number of sequences better than 100.0: 50
Number of HSP's better than 100.0 without gapping: 195
Number of HSP's successfully gapped in prelim test: 27
Number of HSP's that attempted gapping in prelim test: 9923
Number of HSP's gapped (non-prelim): 231
length of query: 300
length of database: 191,569,459
effective HSP length: 117
effective length of query: 183
effective length of database: 128,434,387
effective search space: 23503492821
effective search space used: 23503492821
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 61 (28.1 bits)