BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 042468
         (346 letters)

Database: swissprot 
           539,616 sequences; 191,569,459 total letters

Searching..................................................done



>sp|O65039|CYSEP_RICCO Vignain OS=Ricinus communis GN=CYSEP PE=1 SV=1
          Length = 360

 Score =  380 bits (975), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 190/308 (61%), Positives = 222/308 (72%), Gaps = 6/308 (1%)

Query: 39  HEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFR 98
           +E W + +  V R   EK+ RF +FK N  ++ + N    +KPYKL +N+FAD TN EFR
Sbjct: 38  YERWRSHH-TVSRSLHEKQKRFNVFKHNAMHVHNANK--MDKPYKLKLNKFADMTNHEFR 94

Query: 99  APRNGYK-RRLPSVRSSETTDVSFRYENA-SVPASIDWRKKGAVTGVKDQGQCGCCWAFS 156
              +G K +     R     + +F YE   +VPAS+DWRKKGAVT VKDQGQCG CWAFS
Sbjct: 95  NTYSGSKVKHHRMFRGGPRGNGTFMYEKVDTVPASVDWRKKGAVTSVKDQGQCGSCWAFS 154

Query: 157 AVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKY 216
            + A+EGIN I T KL SLSEQELVDCDT  ++QGC GGLMD AFEFI    G+ TEA Y
Sbjct: 155 TIVAVEGINQIKTNKLVSLSEQELVDCDTD-QNQGCNGGLMDYAFEFIKQRGGITTEANY 213

Query: 217 PYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSS 276
           PY+A DG+C+  + N  A  I G+E+VP N+E AL+KAVANQPVSVAIDA GSDFQFYS 
Sbjct: 214 PYEAYDGTCDVSKENAPAVSIDGHENVPENDENALLKAVANQPVSVAIDAGGSDFQFYSE 273

Query: 277 GVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCG 336
           GVFTG CGTELDHGV  VGYGT  DGTKYW VKNSWG  WGE GYIRM+R I  KEGLCG
Sbjct: 274 GVFTGSCGTELDHGVAIVGYGTTIDGTKYWTVKNSWGPEWGEKGYIRMERGISDKEGLCG 333

Query: 337 IAMQASYP 344
           IAM+ASYP
Sbjct: 334 IAMEASYP 341


>sp|P25803|CYSEP_PHAVU Vignain OS=Phaseolus vulgaris PE=2 SV=2
          Length = 362

 Score =  377 bits (967), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 192/342 (56%), Positives = 233/342 (68%), Gaps = 14/342 (4%)

Query: 10  LVLAAILVLGVWAPQSWSRTLNDATMNERHEMW-----MAQYGRVYRDNAEKEMRFKIFK 64
           +VL+  LVLGV    + S   +D  +     +W        +  V R   EK  RF +FK
Sbjct: 9   VVLSFSLVLGV----ANSFDFHDKDLASEESLWDLYERWRSHHTVSRSLGEKHKRFNVFK 64

Query: 65  ENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSV-RSSETTDVSFRY 123
            N+ ++   N    +KPYKL +N+FAD TN EFR+   G K   P + R +   + +F Y
Sbjct: 65  ANLMHV--HNTNKMDKPYKLKLNKFADMTNHEFRSTYAGSKVNHPRMFRGTPHENGAFMY 122

Query: 124 ENA-SVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVD 182
           E   SVP S+DWRKKGAVT VKDQGQCG CWAFS V A+EGIN I T KL +LSEQELVD
Sbjct: 123 EKVVSVPPSVDWRKKGAVTDVKDQGQCGSCWAFSTVVAVEGINQIKTNKLVALSEQELVD 182

Query: 183 CDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYED 242
           CD   E+QGC GGLM+ AFEFI    G+ TE+ YPYKA +G+C+  + N  A  I G+E+
Sbjct: 183 CDKE-ENQGCNGGLMESAFEFIKQKGGITTESNYPYKAQEGTCDASKVNDLAVSIDGHEN 241

Query: 243 VPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDG 302
           VP+N+E AL+KAVANQPVSVAIDA GSDFQFYS GVFTG C T+L+HGV  VGYGT  DG
Sbjct: 242 VPANDEDALLKAVANQPVSVAIDAGGSDFQFYSEGVFTGDCSTDLNHGVAIVGYGTTVDG 301

Query: 303 TKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
           T YW+V+NSWG  WGE+GYIRMQR+I  KEGLCGIAM  SYP
Sbjct: 302 TNYWIVRNSWGPEWGEHGYIRMQRNISKKEGLCGIAMLPSYP 343


>sp|P12412|CYSEP_VIGMU Vignain OS=Vigna mungo PE=1 SV=1
          Length = 362

 Score =  377 bits (967), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 186/308 (60%), Positives = 219/308 (71%), Gaps = 6/308 (1%)

Query: 39  HEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFR 98
           +E W + +  V R   EK  RF +FK NV ++   N    +KPYKL +N+FAD TN EFR
Sbjct: 40  YERWRSHH-TVSRSLGEKHKRFNVFKANVMHV--HNTNKMDKPYKLKLNKFADMTNHEFR 96

Query: 99  APRNGYK-RRLPSVRSSETTDVSFRYENA-SVPASIDWRKKGAVTGVKDQGQCGCCWAFS 156
           +   G K       R S+    +F YE   SVPAS+DWRKKGAVT VKDQGQCG CWAFS
Sbjct: 97  STYAGSKVNHHKMFRGSQHGSGTFMYEKVGSVPASVDWRKKGAVTDVKDQGQCGSCWAFS 156

Query: 157 AVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKY 216
            + A+EGIN I T KL SLSEQELVDCD   E+QGC GGLM+ AFEFI    G+ TE+ Y
Sbjct: 157 TIVAVEGINQIKTNKLVSLSEQELVDCDKE-ENQGCNGGLMESAFEFIKQKGGITTESNY 215

Query: 217 PYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSS 276
           PY A +G+C++ + N  A  I G+E+VP N+E AL+KAVANQPVSVAIDA GSDFQFYS 
Sbjct: 216 PYTAQEGTCDESKVNDLAVSIDGHENVPVNDENALLKAVANQPVSVAIDAGGSDFQFYSE 275

Query: 277 GVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCG 336
           GVFTG C T+L+HGV  VGYGT  DGT YW+V+NSWG  WGE GYIRMQR+I  KEGLCG
Sbjct: 276 GVFTGDCNTDLNHGVAIVGYGTTVDGTNYWIVRNSWGPEWGEQGYIRMQRNISKKEGLCG 335

Query: 337 IAMQASYP 344
           IAM ASYP
Sbjct: 336 IAMMASYP 343


>sp|Q9FGR9|CEP1_ARATH KDEL-tailed cysteine endopeptidase CEP1 OS=Arabidopsis thaliana
           GN=CEP1 PE=2 SV=1
          Length = 361

 Score =  376 bits (965), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 190/310 (61%), Positives = 221/310 (71%), Gaps = 6/310 (1%)

Query: 37  ERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEE 96
           E +E W + +  V R   EK  RF +FK NV++I   N K  +K YKL +N+F D T+EE
Sbjct: 36  ELYERWRSHH-TVARSLEEKAKRFNVFKHNVKHIHETNKK--DKSYKLKLNKFGDMTSEE 92

Query: 97  FRAPRNGYKRRLPSVRSSETTDV-SFRYENA-SVPASIDWRKKGAVTGVKDQGQCGCCWA 154
           FR    G   +   +   E     SF Y N  ++P S+DWRK GAVT VK+QGQCG CWA
Sbjct: 93  FRRTYAGSNIKHHRMFQGEKKATKSFMYANVNTLPTSVDWRKNGAVTPVKNQGQCGSCWA 152

Query: 155 FSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEA 214
           FS V A+EGIN I T+KLTSLSEQELVDCDT+ ++QGC GGLMD AFEFI    GL +E 
Sbjct: 153 FSTVVAVEGINQIRTKKLTSLSEQELVDCDTN-QNQGCNGGLMDLAFEFIKEKGGLTSEL 211

Query: 215 KYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFY 274
            YPYKASD +C+  + N     I G+EDVP N+E  LMKAVANQPVSVAIDA GSDFQFY
Sbjct: 212 VYPYKASDETCDTNKENAPVVSIDGHEDVPKNSEDDLMKAVANQPVSVAIDAGGSDFQFY 271

Query: 275 SSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGL 334
           S GVFTG+CGTEL+HGV  VGYGT  DGTKYW+VKNSWG  WGE GYIRMQR I  KEGL
Sbjct: 272 SEGVFTGRCGTELNHGVAVVGYGTTIDGTKYWIVKNSWGEEWGEKGYIRMQRGIRHKEGL 331

Query: 335 CGIAMQASYP 344
           CGIAM+ASYP
Sbjct: 332 CGIAMEASYP 341


>sp|P43156|CYSP_HEMSP Thiol protease SEN102 OS=Hemerocallis sp. GN=SEN102 PE=2 SV=1
          Length = 360

 Score =  374 bits (961), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 194/338 (57%), Positives = 234/338 (69%), Gaps = 7/338 (2%)

Query: 10  LVLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEY 69
           LV  + L +    P +     ++ ++   +E W   +  V RD  EK  RF +FKENV++
Sbjct: 11  LVALSFLSIAQSIPFTEKDLASEDSLWNLYEKWRTHH-TVARDLDEKNRRFNVFKENVKF 69

Query: 70  IASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYK-RRLPSVRSSETTDVSFRYENA-S 127
           I  FN K ++ PYKL +N+F D TN+EFR+   G K +   S R  +    SF YEN  S
Sbjct: 70  IHEFNQK-KDAPYKLALNKFGDMTNQEFRSKYAGSKIQHHRSQRGIQKNTGSFMYENVGS 128

Query: 128 VPA-SIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTS 186
           +PA SIDWR KGAVTGVKDQGQCG CWAFS +A++EGIN I T +L SLSEQELVDCDTS
Sbjct: 129 LPAASIDWRAKGAVTGVKDQGQCGSCWAFSTIASVEGINQIKTGELVSLSEQELVDCDTS 188

Query: 187 GEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSN 246
             ++GC GGLMD AFEFI  N G+ TE  YPY   DG+C     N     I G++DVP+N
Sbjct: 189 -YNEGCNGGLMDYAFEFIQKN-GITTEDSYPYAEQDGTCASNLLNSPVVSIDGHQDVPAN 246

Query: 247 NEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYW 306
           NE ALM+AVANQP+SV+I+ASG  FQFYS GVFTG+CGTELDHGV  VGYG   DGTKYW
Sbjct: 247 NENALMQAVANQPISVSIEASGYGFQFYSEGVFTGRCGTELDHGVAIVGYGATRDGTKYW 306

Query: 307 LVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
           +VKNSWG  WGE+GYIRMQR I  K G CGIAM+ASYP
Sbjct: 307 IVKNSWGEEWGESGYIRMQRGISDKRGKCGIAMEASYP 344


>sp|P25250|CYSP2_HORVU Cysteine proteinase EP-B 2 OS=Hordeum vulgare GN=EPB2 PE=1 SV=1
          Length = 373

 Score =  352 bits (904), Expect = 2e-96,   Method: Compositional matrix adjust.
 Identities = 174/311 (55%), Positives = 215/311 (69%), Gaps = 7/311 (2%)

Query: 39  HEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFR 98
           +E W + + RV R +AEK  RF  FK N  +I S N +  + PY+L +N F D    EFR
Sbjct: 46  YERWQSAH-RVRRHHAEKHRRFGTFKSNAHFIHSHNKRG-DHPYRLHLNRFGDMDQAEFR 103

Query: 99  APRNGYKRRLPSVRSSETTDVSFRYENAS-VPASIDWRKKGAVTGVKDQGQCGCCWAFSA 157
           A   G  RR    +        +   N S +P S+DWR+KGAVTGVKDQG+CG CWAFS 
Sbjct: 104 ATFVGDLRRDTPSKPPSVPGFMYAALNVSDLPPSVDWRQKGAVTGVKDQGKCGSCWAFST 163

Query: 158 VAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYP 217
           V ++EGIN I T  L SLSEQEL+DCDT+  D GC+GGLMD+AFE+I +N GL TEA YP
Sbjct: 164 VVSVEGINAIRTGSLVSLSEQELIDCDTADND-GCQGGLMDNAFEYIKNNGGLITEAAYP 222

Query: 218 YKASDGSCNKKEA---NPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFY 274
           Y+A+ G+CN   A   +P    I G++DVP+N+E  L +AVANQPVSVA++ASG  F FY
Sbjct: 223 YRAARGTCNVARAAQNSPVVVHIDGHQDVPANSEEDLARAVANQPVSVAVEASGKAFMFY 282

Query: 275 SSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGL 334
           S GVFTG+CGTELDHGV  VGYG A+DG  YW VKNSWG +WGE GYIR+++D  A  GL
Sbjct: 283 SEGVFTGECGTELDHGVAVVGYGVAEDGKAYWTVKNSWGPSWGEQGYIRVEKDSGASGGL 342

Query: 335 CGIAMQASYPT 345
           CGIAM+ASYP 
Sbjct: 343 CGIAMEASYPV 353


>sp|P25249|CYSP1_HORVU Cysteine proteinase EP-B 1 OS=Hordeum vulgare GN=EPB1 PE=2 SV=1
          Length = 371

 Score =  352 bits (902), Expect = 3e-96,   Method: Compositional matrix adjust.
 Identities = 175/311 (56%), Positives = 215/311 (69%), Gaps = 7/311 (2%)

Query: 39  HEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFR 98
           +E W + + RV R +AEK  RF  FK N  +I S +NK  + PY+L +N F D    EFR
Sbjct: 46  YERWQSAH-RVRRHHAEKHRRFGTFKSNAHFIHS-HNKRGDHPYRLHLNRFGDMDQAEFR 103

Query: 99  APRNGYKRRLPSVRSSETTDVSFRYENAS-VPASIDWRKKGAVTGVKDQGQCGCCWAFSA 157
           A   G  RR    +        +   N S +P S+DWR+KGAVTGVKDQG+CG CWAFS 
Sbjct: 104 ATFVGDLRRDTPAKPPSVPGFMYAALNVSDLPPSVDWRQKGAVTGVKDQGKCGSCWAFST 163

Query: 158 VAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYP 217
           V ++EGIN I T  L SLSEQEL+DCDT+  D GC+GGLMD+AFE+I +N GL TEA YP
Sbjct: 164 VVSVEGINAIRTGSLVSLSEQELIDCDTADND-GCQGGLMDNAFEYIKNNGGLITEAAYP 222

Query: 218 YKASDGSCNKKEA---NPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFY 274
           Y+A+ G+CN   A   +P    I G++DVP+N+E  L +AVANQPVSVA++ASG  F FY
Sbjct: 223 YRAARGTCNVARAAQNSPVVVHIDGHQDVPANSEEDLARAVANQPVSVAVEASGKAFMFY 282

Query: 275 SSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGL 334
           S GVFTG CGTELDHGV  VGYG A+DG  YW VKNSWG +WGE GYIR+++D  A  GL
Sbjct: 283 SEGVFTGDCGTELDHGVAVVGYGVAEDGKAYWTVKNSWGPSWGEQGYIRVEKDSGASGGL 342

Query: 335 CGIAMQASYPT 345
           CGIAM+ASYP 
Sbjct: 343 CGIAMEASYPV 353


>sp|Q9STL4|CEP2_ARATH KDEL-tailed cysteine endopeptidase CEP2 OS=Arabidopsis thaliana
           GN=CEP2 PE=2 SV=1
          Length = 361

 Score =  344 bits (883), Expect = 5e-94,   Method: Compositional matrix adjust.
 Identities = 173/311 (55%), Positives = 220/311 (70%), Gaps = 11/311 (3%)

Query: 39  HEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFR 98
           ++ W + +  V R   E+E RF +F+ NV ++ + N K  N+ YKL +N+FAD T  EF+
Sbjct: 38  YDRWRSHHS-VPRSLNEREKRFNVFRHNVMHVHNTNKK--NRSYKLKLNKFADLTINEFK 94

Query: 99  APRNG----YKRRLPSVRSSETTDVSFRYENAS-VPASIDWRKKGAVTGVKDQGQCGCCW 153
               G    + R L   +   +    + +EN S +P+S+DWRKKGAVT +K+QG+CG CW
Sbjct: 95  NAYTGSNIKHHRMLQGPKRG-SKQFMYDHENLSKLPSSVDWRKKGAVTEIKNQGKCGSCW 153

Query: 154 AFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATE 213
           AFS VAA+EGIN I T KL SLSEQELVDCDT  +++GC GGLM+ AFEFI  N G+ TE
Sbjct: 154 AFSTVAAVEGINKIKTNKLVSLSEQELVDCDTK-QNEGCNGGLMEIAFEFIKKNGGITTE 212

Query: 214 AKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQF 273
             YPY+  DG C+  + N     I G+EDVP N+E AL+KAVANQPVSVAIDA  SDFQF
Sbjct: 213 DSYPYEGIDGKCDASKDNGVLVTIDGHEDVPENDENALLKAVANQPVSVAIDAGSSDFQF 272

Query: 274 YSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEG 333
           YS GVFTG CGTEL+HGV AVGYG+ + G KYW+V+NSWG  WGE GYI+++R+ID  EG
Sbjct: 273 YSEGVFTGSCGTELNHGVAAVGYGS-ERGKKYWIVRNSWGAEWGEGGYIKIEREIDEPEG 331

Query: 334 LCGIAMQASYP 344
            CGIAM+ASYP
Sbjct: 332 RCGIAMEASYP 342


>sp|O65493|XCP1_ARATH Xylem cysteine proteinase 1 OS=Arabidopsis thaliana GN=XCP1 PE=1
           SV=1
          Length = 355

 Score =  342 bits (877), Expect = 3e-93,   Method: Compositional matrix adjust.
 Identities = 170/316 (53%), Positives = 220/316 (69%), Gaps = 7/316 (2%)

Query: 31  NDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFA 90
           N   + E  E WM+++ + Y+   EK  RF++F+EN+ +I   NN+  +  Y LG+NEFA
Sbjct: 43  NTDKLLELFESWMSEHSKAYKSVEEKVHRFEVFRENLMHIDQRNNEINS--YWLGLNEFA 100

Query: 91  DQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENAS-VPASIDWRKKGAVTGVKDQGQC 149
           D T+EEF+    G  +  P          +FRY + + +P S+DWRKKGAV  VKDQGQC
Sbjct: 101 DLTHEEFKGRYLGLAK--PQFSRKRQPSANFRYRDITDLPKSVDWRKKGAVAPVKDQGQC 158

Query: 150 GCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKG 209
           G CWAFS VAA+EGIN ITT  L+SLSEQEL+DCDT+  + GC GGLMD AF++IIS  G
Sbjct: 159 GSCWAFSTVAAVEGINQITTGNLSSLSEQELIDCDTTF-NSGCNGGLMDYAFQYIISTGG 217

Query: 210 LATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGS 269
           L  E  YPY   +G C +++ +     ISGYEDVP N++ +L+KA+A+QPVSVAI+ASG 
Sbjct: 218 LHKEDDYPYLMEEGICQEQKEDVERVTISGYEDVPENDDESLVKALAHQPVSVAIEASGR 277

Query: 270 DFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDID 329
           DFQFY  GVF G+CGT+LDHGV AVGYG++  G+ Y +VKNSWG  WGE G+IRM+R+  
Sbjct: 278 DFQFYKGGVFNGKCGTDLDHGVAAVGYGSS-KGSDYVIVKNSWGPRWGEKGFIRMKRNTG 336

Query: 330 AKEGLCGIAMQASYPT 345
             EGLCGI   ASYPT
Sbjct: 337 KPEGLCGINKMASYPT 352


>sp|Q9STL5|CEP3_ARATH KDEL-tailed cysteine endopeptidase CEP3 OS=Arabidopsis thaliana
           GN=CEP3 PE=2 SV=1
          Length = 364

 Score =  339 bits (869), Expect = 2e-92,   Method: Compositional matrix adjust.
 Identities = 172/310 (55%), Positives = 209/310 (67%), Gaps = 7/310 (2%)

Query: 39  HEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFR 98
           +E W   +  V R + E   RF +F+ NV ++   N K  NKPYKL IN FAD T+ EFR
Sbjct: 38  YERWRGHHS-VSRASHEAIKRFNVFRHNVLHVHRTNKK--NKPYKLKINRFADITHHEFR 94

Query: 99  APRNGYK-RRLPSVRSSETTDVSFRYENAS-VPASIDWRKKGAVTGVKDQGQCGCCWAFS 156
           +   G   +    +R  +     F YEN + VP+S+DWR+KGAVT VK+Q  CG CWAFS
Sbjct: 95  SSYAGSNVKHHRMLRGPKRGSGGFMYENVTRVPSSVDWREKGAVTEVKNQQDCGSCWAFS 154

Query: 157 AVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKY 216
            VAA+EGIN I T KL SLSEQELVDCDT  E+QGC GGLM+ AFEFI +N G+ TE  Y
Sbjct: 155 TVAAVEGINKIRTNKLVSLSEQELVDCDTE-ENQGCAGGLMEPAFEFIKNNGGIKTEETY 213

Query: 217 PYKASDGS-CNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYS 275
           PY +SD   C           I G+E VP N+E  L+KAVA+QPVSVAIDA  SDFQ YS
Sbjct: 214 PYDSSDVQFCRANSIGGETVTIDGHEHVPENDEEELLKAVAHQPVSVAIDAGSSDFQLYS 273

Query: 276 SGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLC 335
            GVF G+CGT+L+HGV  VGYG   +GTKYW+V+NSWG  WGE GY+R++R I   EG C
Sbjct: 274 EGVFIGECGTQLNHGVVIVGYGETKNGTKYWIVRNSWGPEWGEGGYVRIERGISENEGRC 333

Query: 336 GIAMQASYPT 345
           GIAM+ASYPT
Sbjct: 334 GIAMEASYPT 343


>sp|P25777|ORYB_ORYSJ Oryzain beta chain OS=Oryza sativa subsp. japonica GN=Os04g0670200
           PE=1 SV=2
          Length = 466

 Score =  337 bits (863), Expect = 1e-91,   Method: Compositional matrix adjust.
 Identities = 165/311 (53%), Positives = 220/311 (70%), Gaps = 9/311 (2%)

Query: 39  HEMWMAQYGRVYRD--NAEKEMRFKIFKENVEYIASFNNKARNKP-YKLGINEFADQTNE 95
           +++W+A+ G    +    E E RF +F +N++++ + N +A  +  ++LG+N FAD TNE
Sbjct: 52  YDLWLAENGGGSPNALGGEHERRFLVFWDNLKFVDAHNARADERGGFRLGMNRFADLTNE 111

Query: 96  EFRAPRNGYKRRLPSVRSSETTDVSFRYENAS-VPASIDWRKKGAVTGVKDQGQCGCCWA 154
           EFRA   G K    S  + E     +R++    +P S+DWR+KGAV  VK+QGQCG CWA
Sbjct: 112 EFRATFLGAKVAERSRAAGE----RYRHDGVEELPESVDWREKGAVAPVKNQGQCGSCWA 167

Query: 155 FSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEA 214
           FSAV+ +E IN + T ++ +LSEQELV+C T+G++ GC GGLMDDAF+FII N G+ TE 
Sbjct: 168 FSAVSTVESINQLVTGEMITLSEQELVECSTNGQNSGCNGGLMDDAFDFIIKNGGIDTED 227

Query: 215 KYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFY 274
            YPYKA DG C+    N     I G+EDVP N+E +L KAVA+QPVSVAI+A G +FQ Y
Sbjct: 228 DYPYKAVDGKCDINRENAKVVSIDGFEDVPQNDEKSLQKAVAHQPVSVAIEAGGREFQLY 287

Query: 275 SSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGL 334
            SGVF+G+CGT LDHGV AVGYGT D+G  YW+V+NSWG  WGE+GY+RM+R+I+   G 
Sbjct: 288 HSGVFSGRCGTSLDHGVVAVGYGT-DNGKDYWIVRNSWGPKWGESGYVRMERNINVTTGK 346

Query: 335 CGIAMQASYPT 345
           CGIAM ASYPT
Sbjct: 347 CGIAMMASYPT 357


>sp|P25251|CYSP4_BRANA Cysteine proteinase COT44 (Fragment) OS=Brassica napus PE=2 SV=1
          Length = 328

 Score =  334 bits (856), Expect = 7e-91,   Method: Compositional matrix adjust.
 Identities = 161/295 (54%), Positives = 207/295 (70%), Gaps = 7/295 (2%)

Query: 55  EKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSS 114
           +++ RF IFK+N+ +I   N   +N  YKLG+  FA+ TN+E+R+   G  R  P  R +
Sbjct: 24  QQDERFNIFKDNLRFIDLHNENNKNATYKLGLTIFANLTNDEYRSLYLG-ARTEPVRRIT 82

Query: 115 ETTDVSFRYENA----SVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTR 170
           +  +V+ +Y  A     VP ++DWR+KGAV  +KDQG CG CWAFS  AA+EGIN I T 
Sbjct: 83  KAKNVNMKYSAAVNVDEVPVTVDWRQKGAVNAIKDQGTCGSCWAFSTAAAVEGINKIVTG 142

Query: 171 KLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEA 230
           +L SLSEQELVDCD S  +QGC GGLMD AF+FI+ N GL TE  YPY  ++G CN    
Sbjct: 143 ELVSLSEQELVDCDKS-YNQGCNGGLMDYAFQFIMKNGGLNTEKDYPYHGTNGKCNSLLK 201

Query: 231 NPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHG 290
           N     I GYEDVPS +E AL +AV+ QPVSVAIDA G  FQ Y SG+FTG+CGT +DH 
Sbjct: 202 NSRVVTIDGYEDVPSKDETALKRAVSYQPVSVAIDAGGRAFQHYQSGIFTGKCGTNMDHA 261

Query: 291 VTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPT 345
           V AVGYG+ ++G  YW+V+NSWGT WGE+GYIRM+R++ +K G CGIA++ASYP 
Sbjct: 262 VVAVGYGS-ENGVDYWIVRNSWGTRWGEDGYIRMERNVASKSGKCGIAIEASYPV 315


>sp|P25776|ORYA_ORYSJ Oryzain alpha chain OS=Oryza sativa subsp. japonica GN=Os04g0650000
           PE=1 SV=2
          Length = 458

 Score =  333 bits (854), Expect = 1e-90,   Method: Compositional matrix adjust.
 Identities = 165/305 (54%), Positives = 210/305 (68%), Gaps = 6/305 (1%)

Query: 42  WMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARN--KPYKLGINEFADQTNEEFRA 99
           W A++G+ Y    E+E R+  F++N+ YI   N  A      ++LG+N FAD TNEE+R 
Sbjct: 43  WKAEHGKSYNAVGEEERRYAAFRDNLRYIDEHNAAADAGVHSFRLGLNRFADLTNEEYRD 102

Query: 100 PRNGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVA 159
              G + +    R  + +D     +N ++P S+DWR KGAV  +KDQG CG CWAFSA+A
Sbjct: 103 TYLGLRNK--PRRERKVSDRYLAADNEALPESVDWRTKGAVAEIKDQGGCGSCWAFSAIA 160

Query: 160 AMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYK 219
           A+EGIN I T  L SLSEQELVDCDTS  ++GC GGLMD AF+FII+N G+ TE  YPYK
Sbjct: 161 AVEGINQIVTGDLISLSEQELVDCDTS-YNEGCNGGLMDYAFDFIINNGGIDTEDDYPYK 219

Query: 220 ASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVF 279
             D  C+    N     I  YEDV  N+E +L KAVANQPVSVAI+A G  FQ YSSG+F
Sbjct: 220 GKDERCDVNRKNAKVVTIDSYEDVTPNSETSLQKAVANQPVSVAIEAGGRAFQLYSSGIF 279

Query: 280 TGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAM 339
           TG+CGT LDHGV AVGYGT ++G  YW+V+NSWG +WGE+GY+RM+R+I A  G CGIA+
Sbjct: 280 TGKCGTALDHGVAAVGYGT-ENGKDYWIVRNSWGKSWGESGYVRMERNIKASSGKCGIAV 338

Query: 340 QASYP 344
           + SYP
Sbjct: 339 EPSYP 343


>sp|P43297|RD21A_ARATH Cysteine proteinase RD21a OS=Arabidopsis thaliana GN=RD21A PE=1
           SV=1
          Length = 462

 Score =  331 bits (849), Expect = 5e-90,   Method: Compositional matrix adjust.
 Identities = 165/319 (51%), Positives = 217/319 (68%), Gaps = 14/319 (4%)

Query: 31  NDATMNERHEMWMAQYGRVYRDNA--EKEMRFKIFKENVEYIASFNNKARNKPYKLGINE 88
           ++A +   +E W+ ++G+    N+  EK+ RF+IFK+N+ ++   N K  N  Y+LG+  
Sbjct: 42  SEAEVMSIYEAWLVKHGKAQSQNSLVEKDRRFEIFKDNLRFVDEHNEK--NLSYRLGLTR 99

Query: 89  FADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYE---NASVPASIDWRKKGAVTGVKD 145
           FAD TN+E+R+   G K      R +     S RYE      +P SIDWRKKGAV  VKD
Sbjct: 100 FADLTNDEYRSKYLGAKMEKKGERRT-----SLRYEARVGDELPESIDWRKKGAVAEVKD 154

Query: 146 QGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFII 205
           QG CG CWAFS + A+EGIN I T  L +LSEQELVDCDTS  ++GC GGLMD AFEFII
Sbjct: 155 QGGCGSCWAFSTIGAVEGINQIVTGDLITLSEQELVDCDTS-YNEGCNGGLMDYAFEFII 213

Query: 206 SNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAID 265
            N G+ T+  YPYK  DG+C++   N     I  YEDVP+ +E +L KAVA+QP+S+AI+
Sbjct: 214 KNGGIDTDKDYPYKGVDGTCDQIRKNAKVVTIDSYEDVPTYSEESLKKAVAHQPISIAIE 273

Query: 266 ASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQ 325
           A G  FQ Y SG+F G CGT+LDHGV AVGYGT ++G  YW+V+NSWG +WGE+GY+RM 
Sbjct: 274 AGGRAFQLYDSGIFDGSCGTQLDHGVVAVGYGT-ENGKDYWIVRNSWGKSWGESGYLRMA 332

Query: 326 RDIDAKEGLCGIAMQASYP 344
           R+I +  G CGIA++ SYP
Sbjct: 333 RNIASSSGKCGIAIEPSYP 351


>sp|Q9LM66|XCP2_ARATH Xylem cysteine proteinase 2 OS=Arabidopsis thaliana GN=XCP2 PE=1
           SV=2
          Length = 356

 Score =  331 bits (848), Expect = 6e-90,   Method: Compositional matrix adjust.
 Identities = 163/310 (52%), Positives = 216/310 (69%), Gaps = 6/310 (1%)

Query: 37  ERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEE 96
           E  E W++ + + Y    EK +RF++FK+N+++I   N K   K Y LG+NEFAD ++EE
Sbjct: 49  ELFENWISNFEKAYETVEEKFLRFEVFKDNLKHIDETNKKG--KSYWLGLNEFADLSHEE 106

Query: 97  FRAPRNGYKRRLPSVRSSETTDVSFRYENA-SVPASIDWRKKGAVTGVKDQGQCGCCWAF 155
           F+    G K  +   R  E +   F Y +  +VP S+DWRKKGAV  VK+QG CG CWAF
Sbjct: 107 FKKMYLGLKTDIVR-RDEERSYAEFAYRDVEAVPKSVDWRKKGAVAEVKNQGSCGSCWAF 165

Query: 156 SAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAK 215
           S VAA+EGIN I T  LT+LSEQEL+DCDT+  + GC GGLMD AFE+I+ N GL  E  
Sbjct: 166 STVAAVEGINKIVTGNLTTLSEQELIDCDTT-YNNGCNGGLMDYAFEYIVKNGGLRKEED 224

Query: 216 YPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYS 275
           YPY   +G+C  ++       I+G++DVP+N+E +L+KA+A+QP+SVAIDASG +FQFYS
Sbjct: 225 YPYSMEEGTCEMQKDESETVTINGHQDVPTNDEKSLLKALAHQPLSVAIDASGREFQFYS 284

Query: 276 SGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLC 335
            GVF G+CG +LDHGV AVGYG++  G+ Y +VKNSWG  WGE GYIR++R+    EGLC
Sbjct: 285 GGVFDGRCGVDLDHGVAAVGYGSS-KGSDYIIVKNSWGPKWGEKGYIRLKRNTGKPEGLC 343

Query: 336 GIAMQASYPT 345
           GI   AS+PT
Sbjct: 344 GINKMASFPT 353


>sp|Q7XR52|CYSP1_ORYSJ Cysteine protease 1 OS=Oryza sativa subsp. japonica GN=CP1 PE=2
           SV=2
          Length = 490

 Score =  330 bits (847), Expect = 7e-90,   Method: Compositional matrix adjust.
 Identities = 166/294 (56%), Positives = 209/294 (71%), Gaps = 8/294 (2%)

Query: 55  EKEMRFKIFKENVEYIASFNNKARNKP-YKLGINEFADQTNEEFRAPRNGYKRRLPSVRS 113
           E E RF++F +N++++ + N +A  +  ++LG+N FAD TN EFRA    Y    P+ R 
Sbjct: 84  EHERRFRVFWDNLKFVDAHNARADERGGFRLGMNRFADLTNGEFRAT---YLGTTPAGRG 140

Query: 114 SETTDVSFRYENA-SVPASIDWRKKGAVTG-VKDQGQCGCCWAFSAVAAMEGINHITTRK 171
               + ++R++   ++P S+DWR KGAV   VK+QGQCG CWAFSAVAA+EGIN I T +
Sbjct: 141 RRVGE-AYRHDGVEALPDSVDWRDKGAVVAPVKNQGQCGSCWAFSAVAAVEGINKIVTGE 199

Query: 172 LTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEAN 231
           L SLSEQELV+C  +G++ GC GG+MDDAF FI  N GL TE  YPY A DG CN  + +
Sbjct: 200 LVSLSEQELVECARNGQNSGCNGGIMDDAFAFIARNGGLDTEEDYPYTAMDGKCNLAKRS 259

Query: 232 PSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGV 291
                I G+EDVP N+E +L KAVA+QPVSVAIDA G +FQ Y SGVFTG+CGT LDHGV
Sbjct: 260 RKVVSIDGFEDVPENDELSLQKAVAHQPVSVAIDAGGREFQLYDSGVFTGRCGTNLDHGV 319

Query: 292 TAVGYGT-ADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
            AVGYGT A  G  YW V+NSWG  WGENGYIRM+R++ A+ G CGIAM ASYP
Sbjct: 320 VAVGYGTDAATGAAYWTVRNSWGPDWGENGYIRMERNVTARTGKCGIAMMASYP 373


>sp|P80884|ANAN_ANACO Ananain OS=Ananas comosus GN=AN1 PE=1 SV=2
          Length = 345

 Score =  327 bits (838), Expect = 8e-89,   Method: Compositional matrix adjust.
 Identities = 170/343 (49%), Positives = 223/343 (65%), Gaps = 19/343 (5%)

Query: 9   KLVLAAILVLGVWA-PQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENV 67
           +LV   + +  +WA P + S       M ++ E WMA+YGRVY+DN EK +RF+IFK NV
Sbjct: 6   QLVFLFLFLCVMWASPSAASCDEPSDPMMKQFEEWMAEYGRVYKDNDEKMLRFQIFKNNV 65

Query: 68  EYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYK-----RRLPSVRSSETTDVSFR 122
            +I +FNN+  N  Y LGIN+F D TN EF A   G       +R P V S +  D+S  
Sbjct: 66  NHIETFNNRNGNS-YTLGINQFTDMTNNEFVAQYTGLSLPLNIKREPVV-SFDDVDIS-- 121

Query: 123 YENASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVD 182
               SVP SIDWR  GAVT VK+QG+CG CWAF+++A +E I  I    L SLSEQ+++D
Sbjct: 122 ----SVPQSIDWRDSGAVTSVKNQGRCGSCWAFASIATVESIYKIKRGNLVSLSEQQVLD 177

Query: 183 CDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYED 242
           C  S    GC+GG ++ A+ FIISNKG+A+ A YPYKA+ G+C K    P++A I+ Y  
Sbjct: 178 CAVS---YGCKGGWINKAYSFIISNKGVASAAIYPYKAAKGTC-KTNGVPNSAYITRYTY 233

Query: 243 VPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDG 302
           V  NNE  +M AV+NQP++ A+DASG +FQ Y  GVFTG CGT L+H +  +GYG    G
Sbjct: 234 VQRNNERNMMYAVSNQPIAAALDASG-NFQHYKRGVFTGPCGTRLNHAIVIIGYGQDSSG 292

Query: 303 TKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPT 345
            K+W+V+NSWG  WGE GYIR+ RD+ +  GLCGIAM   YPT
Sbjct: 293 KKFWIVRNSWGAGWGEGGYIRLARDVSSSFGLCGIAMDPLYPT 335


>sp|O23791|BROM1_ANACO Fruit bromelain OS=Ananas comosus PE=1 SV=1
          Length = 351

 Score =  324 bits (831), Expect = 5e-88,   Method: Compositional matrix adjust.
 Identities = 166/339 (48%), Positives = 223/339 (65%), Gaps = 11/339 (3%)

Query: 9   KLVLAAILVLGVWA-PQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENV 67
           +LV   + +  +WA P + SR   +  M +R E WMA+YGRVY+D+ EK  RF+IFK NV
Sbjct: 6   QLVFLFLFLCAMWASPSAASRDEPNDPMMKRFEEWMAEYGRVYKDDDEKMRRFQIFKNNV 65

Query: 68  EYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENAS 127
           ++I +FN++  N  Y LGIN+F D T  EF A   G    L   R      VSF   N S
Sbjct: 66  KHIETFNSRNENS-YTLGINQFTDMTKSEFVAQYTGVSLPLNIEREPV---VSFDDVNIS 121

Query: 128 -VPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTS 186
            VP SIDWR  GAV  VK+Q  CG CW+F+A+A +EGI  I T  L SLSEQE++DC  S
Sbjct: 122 AVPQSIDWRDYGAVNEVKNQNPCGSCWSFAAIATVEGIYKIKTGYLVSLSEQEVLDCAVS 181

Query: 187 GEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSN 246
               GC+GG ++ A++FIISN G+ TE  YPY A  G+CN   + P++A I+GY  V  N
Sbjct: 182 ---YGCKGGWVNKAYDFIISNNGVTTEENYPYLAYQGTCNAN-SFPNSAYITGYSYVRRN 237

Query: 247 NEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYW 306
           +E ++M AV+NQP++  IDAS  +FQ+Y+ GVF+G CGT L+H +T +GYG    GTKYW
Sbjct: 238 DERSMMYAVSNQPIAALIDAS-ENFQYYNGGVFSGPCGTSLNHAITIIGYGQDSSGTKYW 296

Query: 307 LVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPT 345
           +V+NSWG++WGE GY+RM R + +  G+CGIAM   +PT
Sbjct: 297 IVRNSWGSSWGEGGYVRMARGVSSSSGVCGIAMAPLFPT 335


>sp|Q9LT77|CPR1_ARATH Probable cysteine proteinase At3g19400 OS=Arabidopsis thaliana
           GN=At3g19400 PE=2 SV=1
          Length = 362

 Score =  323 bits (827), Expect = 2e-87,   Method: Compositional matrix adjust.
 Identities = 159/317 (50%), Positives = 210/317 (66%), Gaps = 6/317 (1%)

Query: 31  NDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFA 90
           N+  +   +E W+ +  + Y    EKE RFKIFK+N++++   +N   ++ +++G+  FA
Sbjct: 36  NETEVRLMYEQWLVENRKNYNGLGEKERRFKIFKDNLKFVDE-HNSVPDRTFEVGLTRFA 94

Query: 91  DQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCG 150
           D TNEEFRA     ++++   + S  T+     E   +P  +DWR  GAV  VKDQG CG
Sbjct: 95  DLTNEEFRAIY--LRKKMERTKDSVKTERYLYKEGDVLPDEVDWRANGAVVSVKDQGNCG 152

Query: 151 CCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGL 210
            CWAFSAV A+EGIN ITT +L SLSEQELVDCD    + GC+GG+M+ AFEFI+ N G+
Sbjct: 153 SCWAFSAVGAVEGINQITTGELISLSEQELVDCDRGFVNAGCDGGIMNYAFEFIMKNGGI 212

Query: 211 ATEAKYPYKASD-GSCN-KKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASG 268
            T+  YPY A+D G CN  K  N     I GYEDVP ++E +L KAVA+QPVSVAI+AS 
Sbjct: 213 ETDQDYPYNANDLGLCNADKNNNTRVVTIDGYEDVPRDDEKSLKKAVAHQPVSVAIEASS 272

Query: 269 SDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDI 328
             FQ Y SGV TG CG  LDHGV  VGYG+   G  YW+++NSWG  WG++GY+++QR+I
Sbjct: 273 QAFQLYKSGVMTGTCGISLDHGVVVVGYGST-SGEDYWIIRNSWGLNWGDSGYVKLQRNI 331

Query: 329 DAKEGLCGIAMQASYPT 345
           D   G CGIAM  SYPT
Sbjct: 332 DDPFGKCGIAMMPSYPT 348


>sp|Q94B08|GCP1_ARATH Germination-specific cysteine protease 1 OS=Arabidopsis thaliana
           GN=GCP1 PE=2 SV=2
          Length = 376

 Score =  322 bits (825), Expect = 2e-87,   Method: Compositional matrix adjust.
 Identities = 165/323 (51%), Positives = 214/323 (66%), Gaps = 12/323 (3%)

Query: 32  DATMNERHEMWMAQYGRVYRDNA----EKEMRFKIFKENVEYIASFNNKARNKPYKLGIN 87
           D  +   +  W A++G+   +N     +++ RF IFK+N+ +I   N   +N  YKLG+ 
Sbjct: 42  DEEVRSIYLQWSAEHGKTNNNNNGIINDQDKRFNIFKDNLRFIDLHNEDNKNATYKLGLT 101

Query: 88  EFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENA----SVPASIDWRKKGAVTGV 143
           +F D TN+E+R    G  R  P+ R ++  +V+ +Y  A     VP ++DWR+KGAV  +
Sbjct: 102 KFTDLTNDEYRKLYLG-ARTEPARRIAKAKNVNQKYSAAVNGKEVPETVDWRQKGAVNPI 160

Query: 144 KDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEF 203
           KDQG CG CWAFS  AA+EGIN I T +L SLSEQELVDCD S  +QGC GGLMD AF+F
Sbjct: 161 KDQGTCGSCWAFSTTAAVEGINKIVTGELISLSEQELVDCDKS-YNQGCNGGLMDYAFQF 219

Query: 204 IISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVA 263
           I+ N GL TE  YPY+   G CN    N     I GYEDVP+ +E AL KA++ QPVSVA
Sbjct: 220 IMKNGGLNTEKDYPYRGFGGKCNSFLKNSRVVSIDGYEDVPTKDETALKKAISYQPVSVA 279

Query: 264 IDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIR 323
           I+A G  FQ Y SG+FTG CGT LDH V AVGYG+ ++G  YW+V+NSWG  WGE GYIR
Sbjct: 280 IEAGGRIFQHYQSGIFTGSCGTNLDHAVVAVGYGS-ENGVDYWIVRNSWGPRWGEEGYIR 338

Query: 324 MQRDIDA-KEGLCGIAMQASYPT 345
           M+R++ A K G CGIA++ASYP 
Sbjct: 339 MERNLAASKSGKCGIAVEASYPV 361


>sp|A5HII1|ACTN_ACTDE Actinidain OS=Actinidia deliciosa PE=1 SV=1
          Length = 380

 Score =  317 bits (813), Expect = 6e-86,   Method: Compositional matrix adjust.
 Identities = 164/340 (48%), Positives = 215/340 (63%), Gaps = 14/340 (4%)

Query: 10  LVLAAILVLGV-WAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVE 68
           L  + +L+L + +  ++ ++  ND  +   +E W+ +YG+ Y    E E RF+IFKE + 
Sbjct: 13  LFFSTLLILSLAFNAKNLTQRTNDE-VKAMYESWLIKYGKSYNSLGEWERRFEIFKETLR 71

Query: 69  YIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYE---N 125
           +I   +N   N+ YK+G+N+FAD T+EEFR+   G+         S  T VS RYE    
Sbjct: 72  FIDE-HNADTNRSYKVGLNQFADLTDEEFRSTYLGF------TSGSNKTKVSNRYEPRVG 124

Query: 126 ASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDT 185
             +P+ +DWR  GAV  +K QG+CG CWAFSA+A +EGIN I T  L SLSEQEL+DC  
Sbjct: 125 QVLPSYVDWRSAGAVVDIKSQGECGGCWAFSAIATVEGINKIVTGVLISLSEQELIDCGR 184

Query: 186 SGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPS 245
           +   +GC GG + D F+FII+N G+ TE  YPY A DG CN    N     I  YE+VP 
Sbjct: 185 TQNTRGCNGGYITDGFQFIINNGGINTEENYPYTAQDGECNLDLQNEKYVTIDTYENVPY 244

Query: 246 NNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKY 305
           NNE AL  AV  QPVSVA+DA+G  F+ YSSG+FTG CGT +DH VT VGYGT + G  Y
Sbjct: 245 NNEWALQTAVTYQPVSVALDAAGDAFKHYSSGIFTGPCGTAIDHAVTIVGYGT-EGGIDY 303

Query: 306 WLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPT 345
           W+VKNSW TTWGE GY+R+ R++    G CGIA   SYP 
Sbjct: 304 WIVKNSWDTTWGEEGYMRILRNVGGA-GTCGIATMPSYPV 342


>sp|P00785|ACTN_ACTCH Actinidain OS=Actinidia chinensis PE=1 SV=4
          Length = 380

 Score =  315 bits (806), Expect = 5e-85,   Method: Compositional matrix adjust.
 Identities = 164/340 (48%), Positives = 214/340 (62%), Gaps = 14/340 (4%)

Query: 10  LVLAAILVLGV-WAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVE 68
           L  + +L+L + +  ++ ++  ND  +   +E W+ +YG+ Y    E E RF+IFKE + 
Sbjct: 13  LFFSTLLILSLAFNAKNLTQRTNDE-VKAMYESWLIKYGKSYNSLGEWERRFEIFKETLR 71

Query: 69  YIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYE---N 125
           +I   +N   N+ YK+G+N+FAD T+EEFR+        L     S  T VS RYE    
Sbjct: 72  FIDE-HNADTNRSYKVGLNQFADLTDEEFRS------TYLRFTSGSNKTKVSNRYEPRVG 124

Query: 126 ASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDT 185
             +P+ +DWR  GAV  +K QG+CG CWAFSA+A +EGIN I T  L SLSEQEL+DC  
Sbjct: 125 QVLPSYVDWRSAGAVVDIKSQGECGGCWAFSAIATVEGINKIVTGVLISLSEQELIDCGR 184

Query: 186 SGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPS 245
           +   +GC GG + D F+FII+N G+ TE  YPY A DG CN    N     I  YE+VP 
Sbjct: 185 TQNTRGCNGGYITDGFQFIINNGGINTEENYPYTAQDGECNVDLQNEKYVTIDTYENVPY 244

Query: 246 NNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKY 305
           NNE AL  AV  QPVSVA+DA+G  F+ YSSG+FTG CGT +DH VT VGYGT + G  Y
Sbjct: 245 NNEWALQTAVTYQPVSVALDAAGDAFKQYSSGIFTGPCGTAVDHAVTIVGYGT-EGGIDY 303

Query: 306 WLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPT 345
           W+VKNSW TTWGE GY+R+ R++    G CGIA   SYP 
Sbjct: 304 WIVKNSWDTTWGEEGYMRILRNVGGA-GTCGIATMPSYPV 342


>sp|Q9SUT0|CPR3_ARATH Probable cysteine proteinase At4g11310 OS=Arabidopsis thaliana
           GN=At4g11310 PE=2 SV=1
          Length = 364

 Score =  311 bits (798), Expect = 4e-84,   Method: Compositional matrix adjust.
 Identities = 163/352 (46%), Positives = 229/352 (65%), Gaps = 16/352 (4%)

Query: 2   AMILLENKLVLAAI-----LVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEK 56
           AM++L   +V+A+      + +  +   +   ++ DA  +   E WM ++G+VY   AEK
Sbjct: 7   AMLILLVAMVIASCATAIDMSVVSYDDNNRLHSVFDAEASLIFESWMVKHGKVYGSVAEK 66

Query: 57  EMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSET 116
           E R  IF++N+ +I   N  A N  Y+LG+  FAD +  E++   +G   R P  R+   
Sbjct: 67  ERRLTIFEDNLRFIN--NRNAENLSYRLGLTGFADLSLHEYKEVCHGADPRPP--RNHVF 122

Query: 117 TDVSFRYENAS---VPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLT 173
              S RY+ ++   +P S+DWR +GAVT VKDQG C  CWAFS V A+EG+N I T +L 
Sbjct: 123 MTSSDRYKTSADDVLPKSVDWRNEGAVTEVKDQGHCRSCWAFSTVGAVEGLNKIVTGELV 182

Query: 174 SLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKK-EANP 232
           +LSEQ+L++C+   E+ GC GG ++ A+EFI+ N GL T+  YPYKA +G C+ + + N 
Sbjct: 183 TLSEQDLINCNK--ENNGCGGGKLETAYEFIMKNGGLGTDNDYPYKAVNGVCDGRLKENN 240

Query: 233 SAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVT 292
               I GYE++P+N+E+ALMKAVA+QPV+  ID+S  +FQ Y SGVF G CGT L+HGV 
Sbjct: 241 KNVMIDGYENLPANDESALMKAVAHQPVTAVIDSSSREFQLYESGVFDGSCGTNLNHGVV 300

Query: 293 AVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
            VGYGT ++G  YWLVKNS G TWGE GY++M R+I    GLCGIAM+ASYP
Sbjct: 301 VVGYGT-ENGRDYWLVKNSRGITWGEAGYMKMARNIANPRGLCGIAMRASYP 351


>sp|Q9SUS9|CPR4_ARATH Probable cysteine proteinase At4g11320 OS=Arabidopsis thaliana
           GN=At4g11320 PE=2 SV=1
          Length = 371

 Score =  309 bits (791), Expect = 2e-83,   Method: Compositional matrix adjust.
 Identities = 158/316 (50%), Positives = 210/316 (66%), Gaps = 9/316 (2%)

Query: 32  DATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFAD 91
           DA      E WM ++G+VY   AEKE R  IF++N+ +I   N  A N  Y+LG+N FAD
Sbjct: 49  DAEATLMFESWMVKHGKVYDSVAEKERRLTIFEDNLRFIT--NRNAENLSYRLGLNRFAD 106

Query: 92  QTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENASV-PASIDWRKKGAVTGVKDQGQCG 150
            +  E+    +G   R P      T+   ++  +  V P S+DWR +GAVT VKDQG C 
Sbjct: 107 LSLHEYGEICHGADPRPPRNHVFMTSSNRYKTSDGDVLPKSVDWRNEGAVTEVKDQGLCR 166

Query: 151 CCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGL 210
            CWAFS V A+EG+N I T +L +LSEQ+L++C+   E+ GC GG ++ A+EFI++N GL
Sbjct: 167 SCWAFSTVGAVEGLNKIVTGELVTLSEQDLINCNK--ENNGCGGGKVETAYEFIMNNGGL 224

Query: 211 ATEAKYPYKASDGSCNK--KEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASG 268
            T+  YPYKA +G C    KE N +   I GYE++P+N+EAALMKAVA+QPV+  +D+S 
Sbjct: 225 GTDNDYPYKALNGVCEGRLKEDNKNVM-IDGYENLPANDEAALMKAVAHQPVTAVVDSSS 283

Query: 269 SDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDI 328
            +FQ Y SGVF G CGT L+HGV  VGYGT ++G  YW+VKNS G TWGE GY++M R+I
Sbjct: 284 REFQLYESGVFDGTCGTNLNHGVVVVGYGT-ENGRDYWIVKNSRGDTWGEAGYMKMARNI 342

Query: 329 DAKEGLCGIAMQASYP 344
               GLCGIAM+ASYP
Sbjct: 343 ANPRGLCGIAMRASYP 358


>sp|P20721|CYSPL_SOLLC Low-temperature-induced cysteine proteinase (Fragment) OS=Solanum
           lycopersicum PE=2 SV=1
          Length = 346

 Score =  285 bits (730), Expect = 3e-76,   Method: Compositional matrix adjust.
 Identities = 134/219 (61%), Positives = 164/219 (74%), Gaps = 2/219 (0%)

Query: 127 SVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTS 186
           S+P SIDWR+KG + GVKDQG CG CWAFSAVAAME IN I T  L SLSEQELVDCD S
Sbjct: 17  SLPESIDWREKGVLVGVKDQGSCGSCWAFSAVAAMESINAIVTGNLISLSEQELVDCDRS 76

Query: 187 GEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSN 246
             ++GC+GGLMD AFEF+I N G+ TE  YPYK  +G C++   N    KI  YEDVP N
Sbjct: 77  -YNEGCDGGLMDYAFEFVIKNGGIDTEEDYPYKERNGVCDQYRKNAKVVKIDSYEDVPVN 135

Query: 247 NEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYW 306
           NE AL KAVA+QPVS+A++A G DFQ Y SG+FTG+CGT +DHGV   GYGT ++G  YW
Sbjct: 136 NEKALQKAVAHQPVSIALEAGGRDFQHYKSGIFTGKCGTAVDHGVVIAGYGT-ENGMDYW 194

Query: 307 LVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPT 345
           +V+NSWG    ENGY+R+QR++ +  GLCG+A++ SYP 
Sbjct: 195 IVRNSWGANCRENGYLRVQRNVSSSSGLCGLAIEPSYPV 233


>sp|P14080|PAPA2_CARPA Chymopapain OS=Carica papaya PE=1 SV=2
          Length = 352

 Score =  278 bits (712), Expect = 3e-74,   Method: Compositional matrix adjust.
 Identities = 148/347 (42%), Positives = 206/347 (59%), Gaps = 14/347 (4%)

Query: 3   MILLENKLVLAAILVLGVWAPQSWSRTLNDATMNER----HEMWMAQYGRVYRDNAEKEM 58
           +I L   L++   L    +    +S+  +D T  ER     + WM ++ ++Y    EK  
Sbjct: 10  IIFLATCLIIHMGLSSADFYTVGYSQ--DDLTSIERLIQLFDSWMLKHNKIYESIDEKIY 67

Query: 59  RFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGY-KRRLPSVRSSETT 117
           RF+IF++N+ YI   N K  N  Y LG+N FAD +N+EF+    G+       +   +  
Sbjct: 68  RFEIFRDNLMYIDETNKK--NNSYWLGLNGFADLSNDEFKKKYVGFVAEDFTGLEHFDNE 125

Query: 118 DVSFRYENASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSE 177
           D ++++   + P SIDWR KGAVT VK+QG CG CWAFS +A +EGIN I T  L  LSE
Sbjct: 126 DFTYKHV-TNYPQSIDWRAKGAVTPVKNQGACGSCWAFSTIATVEGINKIVTGNLLELSE 184

Query: 178 QELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKI 237
           QELVDCD      GC+GG    + +++ +N G+ T   YPY+A    C   +      KI
Sbjct: 185 QELVDCDK--HSYGCKGGYQTTSLQYV-ANNGVHTSKVYPYQAKQYKCRATDKPGPKVKI 241

Query: 238 SGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYG 297
           +GY+ VPSN E + + A+ANQP+SV ++A G  FQ Y SGVF G CGT+LDH VTAVGYG
Sbjct: 242 TGYKRVPSNCETSFLGALANQPLSVLVEAGGKPFQLYKSGVFDGPCGTKLDHAVTAVGYG 301

Query: 298 TADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
           T+ DG  Y ++KNSWG  WGE GY+R++R     +G CG+   + YP
Sbjct: 302 TS-DGKNYIIIKNSWGPNWGEKGYMRLKRQSGNSQGTCGVYKSSYYP 347


>sp|Q26636|CATL_SARPE Cathepsin L OS=Sarcophaga peregrina PE=1 SV=1
          Length = 339

 Score =  278 bits (710), Expect = 6e-74,   Method: Compositional matrix adjust.
 Identities = 152/320 (47%), Positives = 193/320 (60%), Gaps = 12/320 (3%)

Query: 35  MNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNK-ARNK-PYKLGINEFADQ 92
           + E    +  Q+ + Y +  E+  R KIF EN   IA  N   A+ K  YKLG+N++AD 
Sbjct: 24  IKEEWHTYKLQHRKNYANEVEERFRMKIFNENRHKIAKHNQLFAQGKVSYKLGLNKYADM 83

Query: 93  TNEEFRAPRNGYKRRLPSVRSSETTDVSFRY---ENASVPASIDWRKKGAVTGVKDQGQC 149
            + EF+   NGY   L  +    T  V   Y    + +VP S+DWR+ GAVTGVKDQG C
Sbjct: 84  LHHEFKETMNGYNHTLRQLMRERTGLVGATYIPPAHVTVPKSVDWREHGAVTGVKDQGHC 143

Query: 150 GCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKG 209
           G CWAFS+  A+EG +      L SLSEQ LVDC T   + GC GGLMD+AF +I  N G
Sbjct: 144 GSCWAFSSTGALEGQHFRKAGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDNGG 203

Query: 210 LATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQ-PVSVAIDASG 268
           + TE  YPY+  D SC+  +A   A   +G+ D+P  +E  + KAVA   PVSVAIDAS 
Sbjct: 204 IDTEKSYPYEGIDDSCHFNKATIGATD-TGFVDIPEGDEEKMKKAVATMGPVSVAIDASH 262

Query: 269 SDFQFYSSGVFT-GQCGTE-LDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQR 326
             FQ YS GV+   +C  + LDHGV  VGYGT + G  YWLVKNSWGTTWGE GYI+M R
Sbjct: 263 ESFQLYSEGVYNEPECDEQNLDHGVLVVGYGTDESGMDYWLVKNSWGTTWGEQGYIKMAR 322

Query: 327 DIDAKEGLCGIAMQASYPTA 346
           + + +   CGIA  +SYPT 
Sbjct: 323 NQNNQ---CGIATASSYPTV 339


>sp|Q95029|CATL_DROME Cathepsin L OS=Drosophila melanogaster GN=Cp1 PE=2 SV=2
          Length = 371

 Score =  272 bits (695), Expect = 4e-72,   Method: Compositional matrix adjust.
 Identities = 151/327 (46%), Positives = 196/327 (59%), Gaps = 18/327 (5%)

Query: 29  TLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNK-ARNK-PYKLGI 86
           +  D  M E H   + ++ + Y+D  E+  R KIF EN   IA  N + A  K  +KL +
Sbjct: 50  SFADVVMEEWHTFKL-EHRKNYQDETEERFRLKIFNENKHKIAKHNQRFAEGKVSFKLAV 108

Query: 87  NEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFR------YENASVPASIDWRKKGAV 140
           N++AD  + EFR   NG+   L   +     D SF+        + ++P S+DWR KGAV
Sbjct: 109 NKYADLLHHEFRQLMNGFNYTLH--KQLRAADESFKGVTFISPAHVTLPKSVDWRTKGAV 166

Query: 141 TGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDA 200
           T VKDQG CG CWAFS+  A+EG +   +  L SLSEQ LVDC T   + GC GGLMD+A
Sbjct: 167 TAVKDQGHCGSCWAFSSTGALEGQHFRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNA 226

Query: 201 FEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVAN-QP 259
           F +I  N G+ TE  YPY+A D SC+  +    A    G+ D+P  +E  + +AVA   P
Sbjct: 227 FRYIKDNGGIDTEKSYPYEAIDDSCHFNKGTVGATD-RGFTDIPQGDEKKMAEAVATVGP 285

Query: 260 VSVAIDASGSDFQFYSSGVFT-GQCGTE-LDHGVTAVGYGTADDGTKYWLVKNSWGTTWG 317
           VSVAIDAS   FQFYS GV+   QC  + LDHGV  VG+GT + G  YWLVKNSWGTTWG
Sbjct: 286 VSVAIDASHESFQFYSEGVYNEPQCDAQNLDHGVLVVGFGTDESGEDYWLVKNSWGTTWG 345

Query: 318 ENGYIRMQRDIDAKEGLCGIAMQASYP 344
           + G+I+M R+   KE  CGIA  +SYP
Sbjct: 346 DKGFIKMLRN---KENQCGIASASSYP 369


>sp|P00784|PAPA1_CARPA Papain OS=Carica papaya PE=1 SV=1
          Length = 345

 Score =  271 bits (692), Expect = 7e-72,   Method: Compositional matrix adjust.
 Identities = 154/362 (42%), Positives = 206/362 (56%), Gaps = 36/362 (9%)

Query: 1   MAMILLENKLVLAAI-------LVLGVWAPQSWSRTLNDATMNER----HEMWMAQYGRV 49
           MAMI   +KL+  AI       L  G ++   +S+  ND T  ER     E WM ++ ++
Sbjct: 1   MAMIPSISKLLFVAICLFVYMGLSFGDFSIVGYSQ--NDLTSTERLIQLFESWMLKHNKI 58

Query: 50  YRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLP 109
           Y++  EK  RF+IFK+N++YI   N K  N  Y LG+N FAD +N+EF+    G      
Sbjct: 59  YKNIDEKIYRFEIFKDNLKYIDETNKK--NNSYWLGLNVFADMSNDEFKEKYTG------ 110

Query: 110 SVRSSETTDVSFRYE------NASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEG 163
           S+  + TT     YE      + ++P  +DWR+KGAVT VK+QG CG CWAFSAV  +EG
Sbjct: 111 SIAGNYTT-TELSYEEVLNDGDVNIPEYVDWRQKGAVTPVKNQGSCGSCWAFSAVVTIEG 169

Query: 164 INHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDG 223
           I  I T  L   SEQEL+DCD      GC GG    A + +++  G+     YPY+    
Sbjct: 170 IIKIRTGNLNEYSEQELLDCDR--RSYGCNGGYPWSALQ-LVAQYGIHYRNTYPYEGVQR 226

Query: 224 SCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQC 283
            C  +E  P AAK  G   V   NE AL+ ++ANQPVSV ++A+G DFQ Y  G+F G C
Sbjct: 227 YCRSREKGPYAAKTDGVRQVQPYNEGALLYSIANQPVSVVLEAAGKDFQLYRGGIFVGPC 286

Query: 284 GTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASY 343
           G ++DH V AVGY     G  Y L+KNSWGT WGENGYIR++R      G+CG+   + Y
Sbjct: 287 GNKVDHAVAAVGY-----GPNYILIKNSWGTGWGENGYIRIKRGTGNSYGVCGLYTSSFY 341

Query: 344 PT 345
           P 
Sbjct: 342 PV 343


>sp|P10056|PAPA3_CARPA Caricain OS=Carica papaya PE=1 SV=2
          Length = 348

 Score =  270 bits (691), Expect = 1e-71,   Method: Compositional matrix adjust.
 Identities = 158/357 (44%), Positives = 207/357 (57%), Gaps = 23/357 (6%)

Query: 1   MAMILLENKLVLAAILVL-------GVWAPQSWSRTLNDATMNER----HEMWMAQYGRV 49
           MAMI   +KL+  AI +        G ++   +S+  +D T  ER       WM  + + 
Sbjct: 1   MAMIPSISKLLFVAICLFVHMSVSFGDFSIVGYSQ--DDLTSTERLIQLFNSWMLNHNKF 58

Query: 50  YRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLP 109
           Y +  EK  RF+IFK+N+ YI   N K  N  Y LG+NEFAD +N+EF      Y   L 
Sbjct: 59  YENVDEKLYRFEIFKDNLNYIDETNKK--NNSYWLGLNEFADLSNDEFNEK---YVGSLI 113

Query: 110 SVRSSETTDVSFRYEN-ASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHIT 168
                ++ D  F  E+  ++P ++DWRKKGAVT V+ QG CG CWAFSAVA +EGIN I 
Sbjct: 114 DATIEQSYDEEFINEDTVNLPENVDWRKKGAVTPVRHQGSCGSCWAFSAVATVEGINKIR 173

Query: 169 TRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKK 228
           T KL  LSEQELVDC+      GC+GG    A E++  N G+   +KYPYKA  G+C  K
Sbjct: 174 TGKLVELSEQELVDCER--RSHGCKGGYPPYALEYVAKN-GIHLRSKYPYKAKQGTCRAK 230

Query: 229 EANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELD 288
           +      K SG   V  NNE  L+ A+A QPVSV +++ G  FQ Y  G+F G CGT++D
Sbjct: 231 QVGGPIVKTSGVGRVQPNNEGNLLNAIAKQPVSVVVESKGRPFQLYKGGIFEGPCGTKVD 290

Query: 289 HGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPT 345
           H VTAVGYG +       L+KNSWGT WGE GYIR++R      G+CG+   + YPT
Sbjct: 291 HAVTAVGYGKSGGKGYI-LIKNSWGTAWGEKGYIRIKRAPGNSPGVCGLYKSSYYPT 346


>sp|P82474|CPGP2_ZINOF Zingipain-2 OS=Zingiber officinale PE=1 SV=1
          Length = 221

 Score =  260 bits (665), Expect = 8e-69,   Method: Compositional matrix adjust.
 Identities = 124/218 (56%), Positives = 154/218 (70%), Gaps = 4/218 (1%)

Query: 128 VPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSG 187
           +P SIDWR+ GAV  VK+QG CG CWAFS VAA+EGIN I T  L SLSEQ+LVDC T+ 
Sbjct: 3   LPDSIDWRENGAVVPVKNQGGCGSCWAFSTVAAVEGINQIVTGDLISLSEQQLVDCTTA- 61

Query: 188 EDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNN 247
            + GC GG M+ AF+FI++N G+ +E  YPY+  DG CN    N     I  YE+VPS+N
Sbjct: 62  -NHGCRGGWMNPAFQFIVNNGGINSEETYPYRGQDGICNST-VNAPVVSIDSYENVPSHN 119

Query: 248 EAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWL 307
           E +L KAVANQPVSV +DA+G DFQ Y SG+FTG C    +H +T VGYGT +D   +W+
Sbjct: 120 EQSLQKAVANQPVSVTMDAAGRDFQLYRSGIFTGSCNISANHALTVVGYGTEND-KDFWI 178

Query: 308 VKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPT 345
           VKNSWG  WGE+GYIR +R+I+  +G CGI   ASYP 
Sbjct: 179 VKNSWGKNWGESGYIRAERNIENPDGKCGITRFASYPV 216


>sp|P25782|CYSP2_HOMAM Digestive cysteine proteinase 2 OS=Homarus americanus GN=LCP2 PE=2
           SV=1
          Length = 323

 Score =  260 bits (664), Expect = 1e-68,   Method: Compositional matrix adjust.
 Identities = 146/310 (47%), Positives = 191/310 (61%), Gaps = 14/310 (4%)

Query: 40  EMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNK--PYKLGINEFADQTNEEF 97
           E +  +YGR Y D  E   R  IF++N +YI  FN K  N    + L +N+F D T EEF
Sbjct: 21  EHFKGKYGRQYVDAEEDSYRRVIFEQNQKYIEEFNKKYENGEVTFNLAMNKFGDMTLEEF 80

Query: 98  RAPRNGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCGCCWAFSA 157
            A   G   R    RS+  +    + E       +DWR KGAVT VKDQGQCG CWAFS 
Sbjct: 81  NAVMKGNIPR----RSAPVSVFYPKKETGPQATEVDWRTKGAVTPVKDQGQCGSCWAFST 136

Query: 158 VAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYP 217
             ++EG + + T  L SL+EQ+LVDC      QGC GG M+DAF++I +N G+ TEA YP
Sbjct: 137 TGSLEGQHFLKTGSLISLAEQQLVDCSRPYGPQGCNGGWMNDAFDYIKANNGIDTEAAYP 196

Query: 218 YKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVAN-QPVSVAIDASGSDFQFYSS 276
           Y+A DGSC + ++N  AA  SG+ ++ S +E  L +AV +  P+SV IDA+ S FQFYSS
Sbjct: 197 YEARDGSC-RFDSNSVAATCSGHTNIASGSETGLQQAVRDIGPISVTIDAAHSSFQFYSS 255

Query: 277 GV-FTGQCG-TELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGL 334
           GV +   C  + LDH V AVGYG+ + G  +WLVKNSW T+WG+ GYI+M R+   +   
Sbjct: 256 GVYYEPSCSPSYLDHAVLAVGYGS-EGGQDFWLVKNSWATSWGDAGYIKMSRN---RNNN 311

Query: 335 CGIAMQASYP 344
           CGIA  ASYP
Sbjct: 312 CGIATVASYP 321


>sp|P60994|ERVB_TABDI Ervatamin-B OS=Tabernaemontana divaricata PE=1 SV=1
          Length = 215

 Score =  258 bits (659), Expect = 5e-68,   Method: Compositional matrix adjust.
 Identities = 123/218 (56%), Positives = 155/218 (71%), Gaps = 5/218 (2%)

Query: 128 VPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSG 187
           +P+ +DWR KGAV  +K+Q QCG CWAFSAVAA+E IN I T +L SLSEQELVDCDT+ 
Sbjct: 1   LPSFVDWRSKGAVNSIKNQKQCGSCWAFSAVAAVESINKIRTGQLISLSEQELVDCDTA- 59

Query: 188 EDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNN 247
              GC GG M++AF++II+N G+ T+  YPY A  GSC  K        I+G++ V  NN
Sbjct: 60  -SHGCNGGWMNNAFQYIITNGGIDTQQNYPYSAVQGSC--KPYRLRVVSINGFQRVTRNN 116

Query: 248 EAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWL 307
           E+AL  AVA+QPVSV ++A+G+ FQ YSSG+FTG CGT  +HGV  VGYGT   G  YW+
Sbjct: 117 ESALQSAVASQPVSVTVEAAGAPFQHYSSGIFTGPCGTAQNHGVVIVGYGT-QSGKNYWI 175

Query: 308 VKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPT 345
           V+NSWG  WG  GYI M+R++ +  GLCGIA   SYPT
Sbjct: 176 VRNSWGQNWGNQGYIWMERNVASSAGLCGIAQLPSYPT 213


>sp|P25784|CYSP3_HOMAM Digestive cysteine proteinase 3 OS=Homarus americanus GN=LCP3 PE=2
           SV=1
          Length = 321

 Score =  258 bits (658), Expect = 6e-68,   Method: Compositional matrix adjust.
 Identities = 151/318 (47%), Positives = 194/318 (61%), Gaps = 16/318 (5%)

Query: 33  ATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNK--PYKLGINEFA 90
           AT +   + +  QYGR Y D  E+  R ++F++N + I  FN K  N    +K+ +N+F 
Sbjct: 14  ATASPSWDHFKTQYGRKYGDAKEELYRQRVFQQNEQLIEDFNKKFENGEVTFKVAMNQFG 73

Query: 91  DQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCG 150
           D TNEEF A   GYK+      S       F  E   + A +DWR K  VT VKDQ QCG
Sbjct: 74  DMTNEEFNAVMKGYKKG-----SRGEPKAVFTAEAGPMAADVDWRTKALVTPVKDQEQCG 128

Query: 151 CCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGL 210
            CWAFSA  A+EG + +   +L SLSEQ+LVDC T   + GC GG M  AF++I  N G+
Sbjct: 129 SCWAFSATGALEGQHFLKNDELVSLSEQQLVDCSTDYGNDGCGGGWMTSAFDYIKDNGGI 188

Query: 211 ATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVAN-QPVSVAIDASGS 269
            TE+ YPY+A D SC + +AN   A  +G  +V  + E AL +AV+   P+SVAIDAS  
Sbjct: 189 DTESSYPYEAEDRSC-RFDANSIGAICTGSVEV-QHTEEALQEAVSGVGPISVAIDASHF 246

Query: 270 DFQFYSSGVFTGQ-CG-TELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRD 327
            FQFYSSGV+  Q C  T LDHGV AVGYGT +    YWLVKNSWG++WG+ GYI+M R+
Sbjct: 247 SFQFYSSGVYYEQNCSPTFLDHGVLAVGYGT-ESTKDYWLVKNSWGSSWGDAGYIKMSRN 305

Query: 328 IDAKEGLCGIAMQASYPT 345
            D     CGIA + SYPT
Sbjct: 306 RDNN---CGIASEPSYPT 320


>sp|P05994|PAPA4_CARPA Papaya proteinase 4 OS=Carica papaya PE=1 SV=3
          Length = 348

 Score =  256 bits (655), Expect = 1e-67,   Method: Compositional matrix adjust.
 Identities = 150/354 (42%), Positives = 206/354 (58%), Gaps = 19/354 (5%)

Query: 1   MAMILLENKLVLAAILVLGVWAPQSWSRTL-----NDATMNER----HEMWMAQYGRVYR 51
           MA+I   +KL+  AI + G  +      ++     +D T  ER       WM ++ + Y+
Sbjct: 1   MAIICSFSKLLFVAICLFGHMSLSYCDFSIVGYSQDDLTSTERLIQLFNSWMLKHNKNYK 60

Query: 52  DNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSV 111
           +  EK  RF+IFK+N++YI    NK  N  Y LG+NEF+D +N+EF+     Y   LP  
Sbjct: 61  NVDEKLYRFEIFKDNLKYIDE-RNKMING-YWLGLNEFSDLSNDEFKEK---YVGSLPED 115

Query: 112 RSSETTDVSFRYEN-ASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTR 170
            +++  D  F  E+   +P S+DWR KGAVT VK QG C  CWAFS VA +EGIN I T 
Sbjct: 116 YTNQPYDEEFVNEDIVDLPESVDWRAKGAVTPVKHQGYCESCWAFSTVATVEGINKIKTG 175

Query: 171 KLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEA 230
            L  LSEQELVDCD   +  GC  G    + +++  N G+   AKYPY A   +C   + 
Sbjct: 176 NLVELSEQELVDCDK--QSYGCNRGYQSTSLQYVAQN-GIHLRAKYPYIAKQQTCRANQV 232

Query: 231 NPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHG 290
                K +G   V SNNE +L+ A+A+QPVSV ++++G DFQ Y  G+F G CGT++DH 
Sbjct: 233 GGPKVKTNGVGRVQSNNEGSLLNAIAHQPVSVVVESAGRDFQNYKGGIFEGSCGTKVDHA 292

Query: 291 VTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
           VTAVGYG +       L+KNSWG  WGENGYIR++R      G+CG+   + YP
Sbjct: 293 VTAVGYGKSGGKGYI-LIKNSWGPGWGENGYIRIRRASGNSPGVCGVYRSSYYP 345


>sp|P06797|CATL1_MOUSE Cathepsin L1 OS=Mus musculus GN=Ctsl1 PE=1 SV=2
          Length = 334

 Score =  254 bits (650), Expect = 5e-67,   Method: Compositional matrix adjust.
 Identities = 147/323 (45%), Positives = 192/323 (59%), Gaps = 19/323 (5%)

Query: 32  DATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARN--KPYKLGINEF 89
           D T +     W + + R+Y  N E+E R  I+++N+  I   N +  N    + + +N F
Sbjct: 22  DQTFSAEWHQWKSTHRRLYGTN-EEEWRRAIWEKNMRMIQLHNGEYSNGQHGFSMEMNAF 80

Query: 90  ADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQC 149
            D TNEEFR   NGY+ +           +  +     +P S+DWR+KG VT VK+QGQC
Sbjct: 81  GDMTNEEFRQVVNGYRHQKHKKGRLFQEPLMLK-----IPKSVDWREKGCVTPVKNQGQC 135

Query: 150 GCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKG 209
           G CWAFSA   +EG   + T KL SLSEQ LVDC  +  +QGC GGLMD AF++I  N G
Sbjct: 136 GSCWAFSASGCLEGQMFLKTGKLISLSEQNLVDCSHAQGNQGCNGGLMDFAFQYIKENGG 195

Query: 210 LATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVAN-QPVSVAIDASG 268
           L +E  YPY+A DGSC K  A  + A  +G+ D+P   E ALMKAVA   P+SVA+DAS 
Sbjct: 196 LDSEESYPYEAKDGSC-KYRAEFAVANDTGFVDIPQ-QEKALMKAVATVGPISVAMDASH 253

Query: 269 SDFQFYSSGV-FTGQCGTE-LDHGVTAVGY---GTADDGTKYWLVKNSWGTTWGENGYIR 323
              QFYSSG+ +   C ++ LDHGV  VGY   GT  +  KYWLVKNSWG+ WG  GYI+
Sbjct: 254 PSLQFYSSGIYYEPNCSSKNLDHGVLLVGYGYEGTDSNKNKYWLVKNSWGSEWGMEGYIK 313

Query: 324 MQRDIDAKEGLCGIAMQASYPTA 346
           + +D D     CG+A  ASYP  
Sbjct: 314 IAKDRDNH---CGLATAASYPVV 333


>sp|Q9LXW3|CPR2_ARATH Probable cysteine proteinase At3g43960 OS=Arabidopsis thaliana
           GN=At3g43960 PE=2 SV=1
          Length = 376

 Score =  254 bits (650), Expect = 5e-67,   Method: Compositional matrix adjust.
 Identities = 141/321 (43%), Positives = 193/321 (60%), Gaps = 15/321 (4%)

Query: 31  NDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFA 90
           N+  +   +E W+ + G+ Y    EKE RFKIFK+N++ I   N+   N+ Y+ G+N+F+
Sbjct: 33  NEGEVLTMYEQWLVENGKNYNGLGEKERRFKIFKDNLKRIEEHNSDP-NRSYERGLNKFS 91

Query: 91  DQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRY---ENASVPASIDWRKKGAVTG-VKDQ 146
           D T +EF+A   G K    S+     +DV+ RY   E   +P  +DWR++GAV   VK Q
Sbjct: 92  DLTADEFQASYLGGKMEKKSL-----SDVAERYQYKEGDVLPDEVDWRERGAVVPRVKRQ 146

Query: 147 GQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIIS 206
           G+CG CWAF+A  A+EGIN ITT +L SLSEQEL+DCD   ++ GC GG    AFEFI  
Sbjct: 147 GECGSCWAFAATGAVEGINQITTGELVSLSEQELIDCDRGNDNFGCAGGGAVWAFEFIKE 206

Query: 207 NKGLATEAKYPYKASD-GSCNKKEANPS-AAKISGYEDVPSNNEAALMKAVANQPVSVAI 264
           N G+ ++  Y Y   D  +C   E   +    I+G+E VP N+E +L KAVA QP+SV I
Sbjct: 207 NGGIVSDEVYGYTGEDTAACKAIEMKTTRVVTINGHEVVPVNDEMSLKKAVAYQPISVMI 266

Query: 265 DASGSDFQFYSSGVFTGQCGTEL-DHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIR 323
            A  ++   Y SGV+ G C     DH V  VGYGT+ D   YWL++NSWG  WGE GY+R
Sbjct: 267 SA--ANMSDYKSGVYKGACSNLWGDHNVLIVGYGTSSDEGDYWLIRNSWGPEWGEGGYLR 324

Query: 324 MQRDIDAKEGLCGIAMQASYP 344
           +QR+     G C +A+   YP
Sbjct: 325 LQRNFHEPTGKCAVAVAPVYP 345


>sp|Q9GKL8|CATL1_CHLAE Cathepsin L1 OS=Chlorocebus aethiops GN=CTSL1 PE=1 SV=1
          Length = 333

 Score =  253 bits (646), Expect = 2e-66,   Method: Compositional matrix adjust.
 Identities = 153/347 (44%), Positives = 200/347 (57%), Gaps = 23/347 (6%)

Query: 8   NKLVLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENV 67
           N   + A L LG+    S + T N  ++  +   W A + R+Y  N E+  R  ++++N+
Sbjct: 2   NPTFILAALCLGI---ASATLTFNH-SLEAQWTKWKAMHNRLYGMN-EEGWRRAVWEKNM 56

Query: 68  EYIASFNNKARN--KPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYEN 125
           + I   N +       + + +N F D T+EEFR   NG++ R P  R  +       YE 
Sbjct: 57  KMIELHNQEYSQGKHSFTMAMNTFGDMTSEEFRQVMNGFQNRKP--RKGKVFQEPLFYE- 113

Query: 126 ASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDT 185
              P S+DWR+KG VT VK+QGQCG CWAFSA  A+EG     T KL SLSEQ LVDC  
Sbjct: 114 --APRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSG 171

Query: 186 SGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPS 245
              ++GC GGLMD AF+++  N GL +E  YPY+A++ SC K     S A  +G+ D+P 
Sbjct: 172 PQGNEGCNGGLMDYAFQYVADNGGLDSEESYPYEATEESC-KYNPEYSVANDTGFVDIP- 229

Query: 246 NNEAALMKAVAN-QPVSVAIDASGSDFQFYSSGV-FTGQCGTE-LDHGVTAVGYG---TA 299
             E ALMKAVA   P+SVAIDA    F FY  G+ F   C +E +DHGV  VGYG   T 
Sbjct: 230 KQEKALMKAVATVGPISVAIDAGHESFMFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTE 289

Query: 300 DDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPTA 346
            D +KYWLVKNSWG  WG  GYI+M +D   +   CGIA  ASYPT 
Sbjct: 290 SDNSKYWLVKNSWGEEWGMGGYIKMAKD---RRNHCGIASAASYPTV 333


>sp|P07154|CATL1_RAT Cathepsin L1 OS=Rattus norvegicus GN=Ctsl1 PE=1 SV=2
          Length = 334

 Score =  253 bits (645), Expect = 2e-66,   Method: Compositional matrix adjust.
 Identities = 145/321 (45%), Positives = 192/321 (59%), Gaps = 19/321 (5%)

Query: 32  DATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARN--KPYKLGINEF 89
           D T N +   W + + R+Y  N E+E R  ++++N+  I   N +  N    + + +N F
Sbjct: 22  DQTFNAQWHQWKSTHRRLYGTN-EEEWRRAVWEKNMRMIQLHNGEYSNGKHGFTMEMNAF 80

Query: 90  ADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQC 149
            D TNEEFR   NGY+ +           +  +     +P ++DWR+KG VT VK+QGQC
Sbjct: 81  GDMTNEEFRQIVNGYRHQKHKKGRLFQEPLMLQ-----IPKTVDWREKGCVTPVKNQGQC 135

Query: 150 GCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKG 209
           G CWAFSA   +EG   + T KL SLSEQ LVDC     +QGC GGLMD AF++I  N G
Sbjct: 136 GSCWAFSASGCLEGQMFLKTGKLISLSEQNLVDCSHDQGNQGCNGGLMDFAFQYIKENGG 195

Query: 210 LATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVAN-QPVSVAIDASG 268
           L +E  YPY+A DGSC K  A  + A  +G+ D+P   E ALMKAVA   P+SVA+DAS 
Sbjct: 196 LDSEESYPYEAKDGSC-KYRAEYAVANDTGFVDIPQ-QEKALMKAVATVGPISVAMDASH 253

Query: 269 SDFQFYSSGV-FTGQCGT-ELDHGVTAVGY---GTADDGTKYWLVKNSWGTTWGENGYIR 323
              QFYSSG+ +   C + +LDHGV  VGY   GT  +  KYWLVKNSWG  WG +GYI+
Sbjct: 254 PSLQFYSSGIYYEPNCSSKDLDHGVLVVGYGYEGTDSNKDKYWLVKNSWGKEWGMDGYIK 313

Query: 324 MQRDIDAKEGLCGIAMQASYP 344
           + +D   +   CG+A  ASYP
Sbjct: 314 IAKD---RNNHCGLATAASYP 331


>sp|P07711|CATL1_HUMAN Cathepsin L1 OS=Homo sapiens GN=CTSL1 PE=1 SV=2
          Length = 333

 Score =  251 bits (642), Expect = 5e-66,   Method: Compositional matrix adjust.
 Identities = 152/348 (43%), Positives = 200/348 (57%), Gaps = 25/348 (7%)

Query: 8   NKLVLAAILVLGVWAPQSWSRTLN-DATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKEN 66
           N  ++ A   LG+      S TL  D ++  +   W A + R+Y  N E+  R  ++++N
Sbjct: 2   NPTLILAAFCLGIA-----SATLTFDHSLEAQWTKWKAMHNRLYGMN-EEGWRRAVWEKN 55

Query: 67  VEYIASFNNKAR--NKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYE 124
           ++ I   N + R     + + +N F D T+EEFR   NG++ R P  R  +       YE
Sbjct: 56  MKMIELHNQEYREGKHSFTMAMNAFGDMTSEEFRQVMNGFQNRKP--RKGKVFQEPLFYE 113

Query: 125 NASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCD 184
               P S+DWR+KG VT VK+QGQCG CWAFSA  A+EG     T +L SLSEQ LVDC 
Sbjct: 114 ---APRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGRLISLSEQNLVDCS 170

Query: 185 TSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVP 244
               ++GC GGLMD AF+++  N GL +E  YPY+A++ SC K     S A  +G+ D+P
Sbjct: 171 GPQGNEGCNGGLMDYAFQYVQDNGGLDSEESYPYEATEESC-KYNPKYSVANDTGFVDIP 229

Query: 245 SNNEAALMKAVAN-QPVSVAIDASGSDFQFYSSGV-FTGQCGTE-LDHGVTAVGYG---T 298
              E ALMKAVA   P+SVAIDA    F FY  G+ F   C +E +DHGV  VGYG   T
Sbjct: 230 K-QEKALMKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMDHGVLVVGYGFEST 288

Query: 299 ADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPTA 346
             D  KYWLVKNSWG  WG  GY++M +D   +   CGIA  ASYPT 
Sbjct: 289 ESDNNKYWLVKNSWGEEWGMGGYVKMAKD---RRNHCGIASAASYPTV 333


>sp|P25975|CATL1_BOVIN Cathepsin L1 OS=Bos taurus GN=CTSL1 PE=1 SV=3
          Length = 334

 Score =  251 bits (641), Expect = 6e-66,   Method: Compositional matrix adjust.
 Identities = 154/350 (44%), Positives = 202/350 (57%), Gaps = 28/350 (8%)

Query: 8   NKLVLAAILVLGVW--APQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKE 65
           N      +L LGV   AP+       D  ++     W A + R+Y  N E+E R  ++++
Sbjct: 2   NPSFFLTVLCLGVASAAPKL------DPNLDAHWHQWKATHRRLYGMN-EEEWRRAVWEK 54

Query: 66  NVEYIASFNNKAR--NKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRY 123
           N + I   N +       +++ +N F D TNEEFR   NG++ +       +   +    
Sbjct: 55  NKKIIDLHNQEYSEGKHGFRMAMNAFGDMTNEEFRQVMNGFQNQ-----KHKKGKLFHEP 109

Query: 124 ENASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDC 183
               VP S+DW KKG VT VK+QGQCG CWAFSA  A+EG     T KL SLSEQ LVDC
Sbjct: 110 LLVDVPKSVDWTKKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDC 169

Query: 184 DTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASD-GSCNKKEANPSAAKISGYED 242
             +  +QGC GGLMD+AF++I  N GL +E  YPY A+D  SCN K    SAA  +G+ D
Sbjct: 170 SRAQGNQGCNGGLMDNAFQYIKDNGGLDSEESYPYLATDTNSCNYK-PECSAANDTGFVD 228

Query: 243 VPSNNEAALMKAVAN-QPVSVAIDASGSDFQFYSSGV-FTGQCGT-ELDHGVTAVGY--- 296
           +P   E ALMKAVA   P+SVAIDA  + FQFY SG+ +   C + +LDHGV  VGY   
Sbjct: 229 IPQ-REKALMKAVATVGPISVAIDAGHTSFQFYKSGIYYDPDCSSKDLDHGVLVVGYGFE 287

Query: 297 GTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPTA 346
           GT  +  K+W+VKNSWG  WG NGY++M +D   +   CGIA  ASYPT 
Sbjct: 288 GTDSNNNKFWIVKNSWGPEWGWNGYVKMAKD---QNNHCGIATAASYPTV 334


>sp|O70370|CATS_MOUSE Cathepsin S OS=Mus musculus GN=Ctss PE=2 SV=2
          Length = 340

 Score =  248 bits (633), Expect = 5e-65,   Method: Compositional matrix adjust.
 Identities = 154/343 (44%), Positives = 205/343 (59%), Gaps = 22/343 (6%)

Query: 13  AAILVLGVWAPQSWSRTL----NDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVE 68
           AAI  L  W P   S  +     D T++   ++W   + + Y+D  E+E+R  I+++N++
Sbjct: 7   AAIRWL-FWMPLVCSVAMEQLQRDPTLDYHWDLWKKTHEKEYKDKNEEEVRRLIWEKNLK 65

Query: 69  YIASFNNKARN--KPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFR-YEN 125
           +I   N +       Y++G+N+  D TNEE    R G   R+P  R S  T V+FR Y N
Sbjct: 66  FIMIHNLEYSMGMHTYQVGMNDMGDMTNEEILC-RMG-ALRIP--RQSPKT-VTFRSYSN 120

Query: 126 ASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDT 185
            ++P ++DWR+KG VT VK QG CG CWAFSAV A+EG   + T KL SLS Q LVDC  
Sbjct: 121 RTLPDTVDWREKGCVTEVKYQGSCGACWAFSAVGALEGQLKLKTGKLISLSAQNLVDCSN 180

Query: 186 SGE--DQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDV 243
             +  ++GC GG M +AF++II N G+  +A YPYKA+D  C+    N  AA  S Y  +
Sbjct: 181 EEKYGNKGCGGGYMTEAFQYIIDNGGIEADASYPYKATDEKCHYNSKN-RAATCSRYIQL 239

Query: 244 PSNNEAALMKAVANQ-PVSVAIDASGSDFQFYSSGVFTG-QCGTELDHGVTAVGYGTADD 301
           P  +E AL +AVA + PVSV IDAS S F FY SGV+    C   ++HGV  VGYGT  D
Sbjct: 240 PFGDEDALKEAVATKGPVSVGIDASHSSFFFYKSGVYDDPSCTGNVNHGVLVVGYGTL-D 298

Query: 302 GTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
           G  YWLVKNSWG  +G+ GYIRM R+    +  CGIA   SYP
Sbjct: 299 GKDYWLVKNSWGLNFGDQGYIRMARN---NKNHCGIASYCSYP 338


>sp|Q5E998|CATL2_BOVIN Cathepsin L2 OS=Bos taurus GN=CTSL2 PE=2 SV=1
          Length = 334

 Score =  248 bits (632), Expect = 6e-65,   Method: Compositional matrix adjust.
 Identities = 153/350 (43%), Positives = 201/350 (57%), Gaps = 28/350 (8%)

Query: 8   NKLVLAAILVLGVW--APQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKE 65
           N      +L LGV   AP+       D  ++     W A + R+Y  N E+E R  ++++
Sbjct: 2   NPSFFLTVLCLGVASAAPKL------DPNLDAHWHQWKATHRRLYGMN-EEEWRRAVWEK 54

Query: 66  NVEYIASFNNKAR--NKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRY 123
           N + I   N +       +++ +N F D TNEEFR   NG++ +       +   +    
Sbjct: 55  NKKIIDLHNQEYSEGKHGFRMAMNAFGDMTNEEFRQVMNGFQNQ-----KHKKGKLFHEP 109

Query: 124 ENASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDC 183
               VP S+DW KKG VT VK+QGQCG CWAFSA  A+EG     T KL SLSEQ LVDC
Sbjct: 110 LLVDVPKSVDWTKKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDC 169

Query: 184 DTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASD-GSCNKKEANPSAAKISGYED 242
             +  +QGC GGLMD+AF++I  N  L +E  YPY A+D  SCN K    SAA  +G+ D
Sbjct: 170 SRAQGNQGCNGGLMDNAFQYIKDNGCLDSEESYPYLATDTNSCNYK-PECSAANDTGFVD 228

Query: 243 VPSNNEAALMKAVAN-QPVSVAIDASGSDFQFYSSGV-FTGQCGT-ELDHGVTAVGY--- 296
           +P   E ALMKAVA   P+SVAIDA  + FQFY SG+ +   C + +LDHGV  VGY   
Sbjct: 229 IPQ-REKALMKAVATVGPISVAIDAGHTSFQFYKSGIYYDPDCSSKDLDHGVLVVGYGFE 287

Query: 297 GTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPTA 346
           GT  +  K+W+VKNSWG  WG NGY++M +D   +   CGIA  ASYPT 
Sbjct: 288 GTDSNNNKFWIVKNSWGPEWGWNGYVKMAKD---QNNHCGIATAASYPTV 334


>sp|Q28944|CATL1_PIG Cathepsin L1 OS=Sus scrofa GN=CTSL1 PE=2 SV=1
          Length = 334

 Score =  248 bits (632), Expect = 6e-65,   Method: Compositional matrix adjust.
 Identities = 142/313 (45%), Positives = 189/313 (60%), Gaps = 18/313 (5%)

Query: 42  WMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARN--KPYKLGINEFADQTNEEFRA 99
           W A +GR+Y  N E+  R  ++++N++ I   N +       + + +N F D TNEEFR 
Sbjct: 32  WKATHGRLYGMN-EEGWRRAVWEKNMKMIELHNQEYSQGKHGFSMAMNAFGDMTNEEFRQ 90

Query: 100 PRNGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVA 159
             NG++ +    +  +    S   E   VP S+DWR+KG VT VK+QGQCG CWAFSA  
Sbjct: 91  VMNGFQNQ--KHKKGKVFHESLVLE---VPKSVDWREKGYVTAVKNQGQCGSCWAFSATG 145

Query: 160 AMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYK 219
           A+EG     T KL SLSEQ LVDC     +QGC GGLMD+AF+++  N GL TE  YPY 
Sbjct: 146 ALEGQMFRKTGKLVSLSEQNLVDCSRPQGNQGCNGGLMDNAFQYVKDNGGLDTEESYPYL 205

Query: 220 ASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVAN-QPVSVAIDASGSDFQFYSSGV 278
             + +    +   SAA  +G+ D+P   E ALMKAVA   P+SVAIDA  S FQFY SG+
Sbjct: 206 GRETNSCTYKPECSAANDTGFVDIPQ-REKALMKAVATVGPISVAIDAGHSSFQFYKSGI 264

Query: 279 FTG-QCGT-ELDHGVTAVGY---GTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEG 333
           +    C + +LDHGV  VGY   GT  + +K+W+VKNSWG  WG NGY++M +D   +  
Sbjct: 265 YYDPDCSSKDLDHGVLVVGYGFEGTDSNSSKFWIVKNSWGPEWGWNGYVKMAKD---QNN 321

Query: 334 LCGIAMQASYPTA 346
            CGI+  ASYPT 
Sbjct: 322 HCGISTAASYPTV 334


>sp|P04989|CYSP2_DICDI Cysteine proteinase 2 OS=Dictyostelium discoideum GN=cprB PE=2 SV=1
          Length = 376

 Score =  246 bits (628), Expect = 2e-64,   Method: Compositional matrix adjust.
 Identities = 141/343 (41%), Positives = 194/343 (56%), Gaps = 43/343 (12%)

Query: 42  WMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPR 101
           W  ++ R Y  ++E   R+ IFK N++Y+ ++N+K  ++   LG+N FAD TNEE+R   
Sbjct: 39  WTLKFNRQY-SSSEFSNRYSIFKSNMDYVDNWNSKGDSQTV-LGLNNFADITNEEYRKTY 96

Query: 102 NGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAM 161
            G +    S    +  +V    +  + P SIDWR K AVT +KDQGQCG CW+FS   + 
Sbjct: 97  LGTRVNAHSYNGYDGREVLNVEDLQTNPKSIDWRTKNAVTPIKDQGQCGSCWSFSTTGST 156

Query: 162 EGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKAS 221
           EG + + T+KL SLSEQ LVDC    E+ GC+GGLM++AF++II NKG+ TE+ YPY A 
Sbjct: 157 EGAHALKTKKLVSLSEQNLVDCSGPEENFGCDGGLMNNAFDYIIKNKGIDTESSYPYTAE 216

Query: 222 DGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGV-FT 280
            GS      +   A I GY ++ + +E +L     + PVSVAIDAS + FQ Y+SG+ + 
Sbjct: 217 TGSTCLFNKSDIGATIKGYVNITAGSEISLENGAQHGPVSVAIDASHNSFQLYTSGIYYE 276

Query: 281 GQCG-TELDHGVTAVGYGTA---DDG---------------------------------T 303
            +C  TELDHGV  VGYG     D+G                                  
Sbjct: 277 PKCSPTELDHGVLVVGYGVQGKDDEGPVLNRKQTIVIHKNEDNKVESSDDSSDSVRPKAN 336

Query: 304 KYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPTA 346
            YW+VKNSWGT+WG  GYI M +D   ++  CGIA  +SYP A
Sbjct: 337 NYWIVKNSWGTSWGIKGYILMSKD---RKNNCGIASVSSYPLA 376


>sp|Q23894|CYSP3_DICDI Cysteine proteinase 3 OS=Dictyostelium discoideum GN=cprC PE=3 SV=2
          Length = 337

 Score =  246 bits (627), Expect = 2e-64,   Method: Compositional matrix adjust.
 Identities = 138/316 (43%), Positives = 195/316 (61%), Gaps = 26/316 (8%)

Query: 42  WMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFR--- 98
           WM    + Y  + E   R++ FK+N++Y+ ++N+K       LG+N+ AD +NEE+R   
Sbjct: 37  WMRSNNKAYT-HKEFMPRYEEFKKNMDYVHNWNSKGSKTV--LGLNQHADLSNEEYRLNY 93

Query: 99  ------APRNGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCGCC 152
                    NGY +R   +R +      F+      P ++DWR+K AVT VKDQGQCG C
Sbjct: 94  LGTRAHIKLNGYHKRNLGLRLNRP---QFKQ-----PLNVDWREKDAVTPVKDQGQCGSC 145

Query: 153 WAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLAT 212
           ++FS   ++EG+  I T KL SLSEQ ++DC +S  ++GC GGLM +AFE+II N GL +
Sbjct: 146 YSFSTTGSVEGVTAIKTGKLVSLSEQNILDCSSSFGNEGCNGGLMTNAFEYIIKNNGLNS 205

Query: 213 EAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQ 272
           E +YPY+       K +    AAKI+ Y+++ + +E  L  A+   PVSVAIDAS + FQ
Sbjct: 206 EEQYPYEMKVNDECKFQEGSVAAKITSYKEIEAGDENDLQNALLLNPVSVAIDASHNSFQ 265

Query: 273 FYSSGV-FTGQCGTE-LDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDA 330
            Y++GV +   C +E LDHGV AVG GT D+G  Y++VKNSWG +WG NGYI M R+   
Sbjct: 266 LYTAGVYYEPACSSEDLDHGVLAVGMGT-DNGEDYYIVKNSWGPSWGLNGYIHMARN--- 321

Query: 331 KEGLCGIAMQASYPTA 346
           K+  CGI+  ASYP A
Sbjct: 322 KDNNCGISTMASYPIA 337


>sp|P82473|CPGP1_ZINOF Zingipain-1 OS=Zingiber officinale PE=1 SV=1
          Length = 221

 Score =  245 bits (626), Expect = 3e-64,   Method: Compositional matrix adjust.
 Identities = 120/217 (55%), Positives = 150/217 (69%), Gaps = 4/217 (1%)

Query: 128 VPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSG 187
           +P SIDWR+KGAV  VK+QG CG CWAF A+AA+EGIN I T  L SLSEQ+LVDC T  
Sbjct: 3   LPDSIDWREKGAVVPVKNQGGCGSCWAFDAIAAVEGINQIVTGDLISLSEQQLVDCST-- 60

Query: 188 EDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNN 247
            + GCEGG    AF++II+N G+ +E  YPY  ++G+C+ KE N     I  Y +VPSN+
Sbjct: 61  RNHGCEGGWPYRAFQYIINNGGINSEEHYPYTGTNGTCDTKE-NAHVVSIDSYRNVPSND 119

Query: 248 EAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWL 307
           E +L KAVANQPVSV +DA+G DFQ Y +G+FTG C    +H  T  G  T +D   YW 
Sbjct: 120 EKSLQKAVANQPVSVTMDAAGRDFQLYRNGIFTGSCNISANHYRTVGGRETEND-KDYWT 178

Query: 308 VKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
           VKNSWG  WGE+GYIR++R+I    G CGIA+  SYP
Sbjct: 179 VKNSWGKNWGESGYIRVERNIAESSGKCGIAISPSYP 215


>sp|Q9GL24|CATL1_CANFA Cathepsin L1 OS=Canis familiaris GN=CTSL1 PE=2 SV=1
          Length = 333

 Score =  244 bits (624), Expect = 5e-64,   Method: Compositional matrix adjust.
 Identities = 145/323 (44%), Positives = 192/323 (59%), Gaps = 19/323 (5%)

Query: 32  DATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARN--KPYKLGINEF 89
           D ++N +   W A + R+Y  N E+  R  ++++N++ I   N +       + + +N F
Sbjct: 22  DQSLNAQWYQWKATHRRLYGMN-EEGWRRAVWEKNMKMIELHNREYSQGKHGFTMAMNAF 80

Query: 90  ADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQC 149
            D TNEEFR   NG++ +    +     +  F    A +P S+DWR+KG VT VK+QGQC
Sbjct: 81  GDMTNEEFRQVMNGFQNQ-KHKKGKMFQEPLF----AEIPKSVDWREKGYVTPVKNQGQC 135

Query: 150 GCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKG 209
           G CWAFSA  A+EG     T KL SLSEQ LVDC  +  ++GC GGLMD+AF ++  N G
Sbjct: 136 GSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRAQGNEGCNGGLMDNAFRYVKDNGG 195

Query: 210 LATEAKYPYKASDG-SCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQ-PVSVAIDAS 267
           L +E  YPY   D  +CN K    SAA  +G+ D+P   E ALMKAVA   P+SVAIDA 
Sbjct: 196 LDSEESYPYLGRDTETCNYK-PECSAANDTGFVDLPQ-REKALMKAVATLGPISVAIDAG 253

Query: 268 GSDFQFYSSGV-FTGQCGT-ELDHGVTAVGYG--TADDGTKYWLVKNSWGTTWGENGYIR 323
              FQFY SG+ F   C + +LDHGV  VGYG    D   K+W+VKNSWG  WG NGY++
Sbjct: 254 HQSFQFYKSGIYFDPDCSSKDLDHGVLVVGYGFEGTDSNNKFWIVKNSWGPEWGWNGYVK 313

Query: 324 MQRDIDAKEGLCGIAMQASYPTA 346
           M +D   +   CGIA  ASYPT 
Sbjct: 314 MAKD---QNNHCGIATAASYPTV 333


>sp|P54640|CYSP5_DICDI Cysteine proteinase 5 OS=Dictyostelium discoideum GN=cprE PE=2 SV=2
          Length = 344

 Score =  244 bits (623), Expect = 6e-64,   Method: Compositional matrix adjust.
 Identities = 144/323 (44%), Positives = 189/323 (58%), Gaps = 33/323 (10%)

Query: 42  WMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPR 101
           WM  + + Y    E   R+ IFK N++Y+  +N+K       LG+N FAD TNEE+R   
Sbjct: 33  WMITHQKSYTSE-EFGARYNIFKANMDYVQQWNSKGSETV--LGLNNFADITNEEYRNTY 89

Query: 102 NGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAM 161
            G K    S+  ++   V       S  AS DWR +GAVT VK+QGQCG CW+FS   + 
Sbjct: 90  LGTKFDASSLIGTQEEKVF----TTSSAASKDWRSEGAVTPVKNQGQCGGCWSFSTTGST 145

Query: 162 EGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKAS 221
           EG +  +  +L SLSEQ L+DC T  E+ GC+GGLM  AFE+II+N G+ TE+ YPYKA 
Sbjct: 146 EGAHFQSKGELVSLSEQNLIDCST--ENSGCDGGLMTYAFEYIINNNGIDTESSYPYKAE 203

Query: 222 DGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGV-FT 280
           +G C  K  N S A +S Y+ V + +E++L  AV   PVSVAIDAS   FQ Y+SG+ + 
Sbjct: 204 NGKCEYKSEN-SGATLSSYKTVTAGSESSLESAVNVNPVSVAIDASHQSFQLYTSGIYYE 262

Query: 281 GQCGTE-LDHGVTAVGY------------------GTADDGTKYWLVKNSWGTTWGENGY 321
            +C +E LDHGV AVGY                   +A    +YW+VKNSWGT+WG  GY
Sbjct: 263 PECSSENLDHGVLAVGYGSGSGSSSGQSSGQSSGNLSASSSNEYWIVKNSWGTSWGIEGY 322

Query: 322 IRMQRDIDAKEGLCGIAMQASYP 344
           I M R+ D     CGIA  AS+P
Sbjct: 323 ILMSRNRDNN---CGIASSASFP 342


>sp|P13277|CYSP1_HOMAM Digestive cysteine proteinase 1 OS=Homarus americanus GN=LCP1 PE=1
           SV=2
          Length = 322

 Score =  244 bits (623), Expect = 8e-64,   Method: Compositional matrix adjust.
 Identities = 140/319 (43%), Positives = 188/319 (58%), Gaps = 17/319 (5%)

Query: 33  ATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNK--PYKLGINEFA 90
           A  N   E +  ++GR Y D  E+  R  +F +N++YI  FN K       Y L IN+F+
Sbjct: 14  AAANPSWEEFKGKFGRKYVDLEEERYRLNVFLDNLQYIEEFNKKYERGEVTYNLAINQFS 73

Query: 91  DQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCG 150
           D TNE+F A   GYK+         +TD       A     +DWR KGAVT VKDQGQCG
Sbjct: 74  DMTNEKFNAVMKGYKKGPRPAAVFTSTDA------APESTEVDWRTKGAVTPVKDQGQCG 127

Query: 151 CCWAFSAVAAMEGINHITTRKLTSLSEQELVDC-DTSGEDQGCEGGLMDDAFEFIISNKG 209
            CWAFS    +EG + + T +L SLSEQ+LVDC   S  +QGC GG ++ A  ++  N G
Sbjct: 128 SCWAFSTTGGIEGQHFLKTGRLVSLSEQQLVDCAGGSYYNQGCNGGWVERAIMYVRDNGG 187

Query: 210 LATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVAN-QPVSVAIDASG 268
           + TE+ YPY+A D +C +  +N   A  +GY  +   +E+AL  A  +  P+SVAIDAS 
Sbjct: 188 VDTESSYPYEARDNTC-RFNSNTIGATCTGYVGIAQGSESALKTATRDIGPISVAIDASH 246

Query: 269 SDFQFYSSGV-FTGQC-GTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQR 326
             FQ Y +GV +   C  ++LDH V AVGYG+ + G  +WLVKNSW T+WGE+GYI+M R
Sbjct: 247 RSFQSYYTGVYYEPSCSSSQLDHAVLAVGYGS-EGGQDFWLVKNSWATSWGESGYIKMAR 305

Query: 327 DIDAKEGLCGIAMQASYPT 345
           +   +   CGIA  A YPT
Sbjct: 306 N---RNNNCGIATDACYPT 321


  Database: swissprot
    Posted date:  Mar 23, 2013  2:32 AM
  Number of letters in database: 191,569,459
  Number of sequences in database:  539,616
  
Lambda     K      H
   0.315    0.130    0.392 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 129,859,603
Number of Sequences: 539616
Number of extensions: 5439372
Number of successful extensions: 12950
Number of sequences better than 100.0: 50
Number of HSP's better than 100.0 without gapping: 224
Number of HSP's successfully gapped in prelim test: 16
Number of HSP's that attempted gapping in prelim test: 11911
Number of HSP's gapped (non-prelim): 276
length of query: 346
length of database: 191,569,459
effective HSP length: 118
effective length of query: 228
effective length of database: 127,894,771
effective search space: 29160007788
effective search space used: 29160007788
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 62 (28.5 bits)