BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 018958
         (348 letters)

Database: swissprot 
           539,616 sequences; 191,569,459 total letters

Searching..................................................done



>sp|O65493|XCP1_ARATH Xylem cysteine proteinase 1 OS=Arabidopsis thaliana GN=XCP1 PE=1
           SV=1
          Length = 355

 Score =  312 bits (799), Expect = 3e-84,   Method: Compositional matrix adjust.
 Identities = 148/314 (47%), Positives = 215/314 (68%), Gaps = 9/314 (2%)

Query: 38  THEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFS 97
           T+   ++E+ E WM++H ++YK   EK  R ++F+ENL +I++ N E N +Y LG N+F+
Sbjct: 42  TNTDKLLELFESWMSEHSKAYKSVEEKVHRFEVFRENLMHIDQRNNEIN-SYWLGLNEFA 100

Query: 98  DLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKEC 157
           DLT++EF+  Y G   P  S +   S+ F+Y+++  TD+P S+DWR KGAV P+K+Q +C
Sbjct: 101 DLTHEEFKGRYLGLAKPQFSRKRQPSANFRYRDI--TDLPKSVDWRKKGAVAPVKDQGQC 158

Query: 158 GCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI 217
           G CWAF+ VAAVEGI +I +GNL  LSEQ+L+DC T  N+GC GG  + AF YII   G+
Sbjct: 159 GSCWAFSTVAAVEGINQITTGNLSSLSEQELIDCDTTFNSGCNGGLMDYAFQYIISTGGL 218

Query: 218 ATEDEYPYQAVPGTCSAAQKPAA-AKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTE 276
             ED+YPY    G C   ++      IS YE+VP  D+++L+KA++ QPVS+AI A   +
Sbjct: 219 HKEDDYPYLMEEGICQEQKEDVERVTISGYEDVPENDDESLVKALAHQPVSVAIEASGRD 278

Query: 277 FQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD--- 333
           FQ YK G+FNG CGT LDH V  VG+G+++ G++Y ++KNSWG  WG+ G++++ R+   
Sbjct: 279 FQFYKGGVFNGKCGTDLDHGVAAVGYGSSK-GSDYVIVKNSWGPRWGEKGFIRMKRNTGK 337

Query: 334 -EGLCGIGTRSSYP 346
            EGLCGI   +SYP
Sbjct: 338 PEGLCGINKMASYP 351


>sp|Q9STL4|CEP2_ARATH KDEL-tailed cysteine endopeptidase CEP2 OS=Arabidopsis thaliana
           GN=CEP2 PE=2 SV=1
          Length = 361

 Score =  308 bits (789), Expect = 4e-83,   Method: Compositional matrix adjust.
 Identities = 153/342 (44%), Positives = 217/342 (63%), Gaps = 19/342 (5%)

Query: 18  MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHG--RSYKDELEKEMRLKIFKENL 75
           +F ++ L  +C           E+ +  ++++W + H   RS     E+E R  +F+ N+
Sbjct: 9   LFSLVILQTACGFDYDDKEIESEEGLSTLYDRWRSHHSVPRSLN---EREKRFNVFRHNV 65

Query: 76  EYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTG-----YKMPSPSHRSTTSSTFKYQN 130
            ++   NK+ NR+YKL  N+F+DLT +EF+  YTG     ++M     R +    + ++N
Sbjct: 66  MHVHNTNKK-NRSYKLKLNKFADLTINEFKNAYTGSNIKHHRMLQGPKRGSKQFMYDHEN 124

Query: 131 LSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLD 190
           LS   +P+S+DWR KGAVT IKNQ +CG CWAF+ VAAVEGI KI++  L+ LSEQ+L+D
Sbjct: 125 LSK--LPSSVDWRKKGAVTEIKNQGKCGSCWAFSTVAAVEGINKIKTNKLVSLSEQELVD 182

Query: 191 CSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAA-AKISNYEEV 249
           C T  N GC GG  E AF +I +N GI TED YPY+ + G C A++       I  +E+V
Sbjct: 183 CDTKQNEGCNGGLMEIAFEFIKKNGGITTEDSYPYEGIDGKCDASKDNGVLVTIDGHEDV 242

Query: 250 PSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGA 309
           P  DE ALLKAV+ QPVS+AI A S++FQ Y EG+F G CGT+L+H V  VG+G +E G 
Sbjct: 243 PENDENALLKAVANQPVSVAIDAGSSDFQFYSEGVFTGSCGTELNHGVAAVGYG-SERGK 301

Query: 310 NYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPL 347
            YW+++NSWG  WG+ GY+KI R+    EG CGI   +SYP+
Sbjct: 302 KYWIVRNSWGAEWGEGGYIKIEREIDEPEGRCGIAMEASYPI 343


>sp|P80884|ANAN_ANACO Ananain OS=Ananas comosus GN=AN1 PE=1 SV=2
          Length = 345

 Score =  308 bits (788), Expect = 5e-83,   Method: Compositional matrix adjust.
 Identities = 148/333 (44%), Positives = 211/333 (63%), Gaps = 10/333 (3%)

Query: 18  MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
           +F+ + L V  AS   +S       +++  E+WMA++GR YKD  EK +R +IFK N+ +
Sbjct: 8   VFLFLFLCVMWASPSAASCDEPSDPMMKQFEEWMAEYGRVYKDNDEKMLRFQIFKNNVNH 67

Query: 78  IEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVP 137
           IE  N     +Y LG NQF+D+TN+EF A YTG  +P    R    S   + ++ ++ VP
Sbjct: 68  IETFNNRNGNSYTLGINQFTDMTNNEFVAQYTGLSLPLNIKREPVVS---FDDVDISSVP 124

Query: 138 TSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNN 197
            S+DWRD GAVT +KNQ  CG CWAFA++A VE I KI+ GNL+ LSEQQ+LDC+   + 
Sbjct: 125 QSIDWRDSGAVTSVKNQGRCGSCWAFASIATVESIYKIKRGNLVSLSEQQVLDCAV--SY 182

Query: 198 GCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQAL 257
           GC GG   KA+++II N+G+A+   YPY+A  GTC     P +A I+ Y  V   +E+ +
Sbjct: 183 GCKGGWINKAYSFIISNKGVASAAIYPYKAAKGTCKTNGVPNSAYITRYTYVQRNNERNM 242

Query: 258 LKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNS 317
           + AVS QP++ A+ A S  FQ YK G+F G CGT+L+HA+ I+G+G    G  +W+++NS
Sbjct: 243 MYAVSNQPIAAALDA-SGNFQHYKRGVFTGPCGTRLNHAIVIIGYGQDSSGKKFWIVRNS 301

Query: 318 WGNTWGDAGYMKIVRDE----GLCGIGTRSSYP 346
           WG  WG+ GY+++ RD     GLCGI     YP
Sbjct: 302 WGAGWGEGGYIRLARDVSSSFGLCGIAMDPLYP 334


>sp|P43297|RD21A_ARATH Cysteine proteinase RD21a OS=Arabidopsis thaliana GN=RD21A PE=1
           SV=1
          Length = 462

 Score =  307 bits (787), Expect = 6e-83,   Method: Compositional matrix adjust.
 Identities = 150/323 (46%), Positives = 216/323 (66%), Gaps = 12/323 (3%)

Query: 32  VVSSRSTHEQSVVEIHEKWMAQHGR--SYKDELEKEMRLKIFKENLEYIEKANKEGNRTY 89
           V ++    E  V+ I+E W+ +HG+  S    +EK+ R +IFK+NL ++++ N E N +Y
Sbjct: 35  VSTTGGRSEAEVMSIYEAWLVKHGKAQSQNSLVEKDRRFEIFKDNLRFVDEHN-EKNLSY 93

Query: 90  KLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVT 149
           +LG  +F+DLTNDE+R+ Y G KM     R T+    +Y+     ++P S+DWR KGAV 
Sbjct: 94  RLGLTRFADLTNDEYRSKYLGAKMEKKGERRTS---LRYEARVGDELPESIDWRKKGAVA 150

Query: 150 PIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFA 209
            +K+Q  CG CWAF+ + AVEGI +I +G+LI LSEQ+L+DC T+ N GC GG  + AF 
Sbjct: 151 EVKDQGGCGSCWAFSTIGAVEGINQIVTGDLITLSEQELVDCDTSYNEGCNGGLMDYAFE 210

Query: 210 YIIQNQGIATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSI 268
           +II+N GI T+ +YPY+ V GTC   +K A    I +YE+VP+  E++L KAV+ QP+SI
Sbjct: 211 FIIKNGGIDTDKDYPYKGVDGTCDQIRKNAKVVTIDSYEDVPTYSEESLKKAVAHQPISI 270

Query: 269 AIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYM 328
           AI A    FQ Y  GIF+G CGTQLDH V  VG+G TE+G +YW+++NSWG +WG++GY+
Sbjct: 271 AIEAGGRAFQLYDSGIFDGSCGTQLDHGVVAVGYG-TENGKDYWIVRNSWGKSWGESGYL 329

Query: 329 KIVRD----EGLCGIGTRSSYPL 347
           ++ R+     G CGI    SYP+
Sbjct: 330 RMARNIASSSGKCGIAIEPSYPI 352


>sp|O23791|BROM1_ANACO Fruit bromelain OS=Ananas comosus PE=1 SV=1
          Length = 351

 Score =  305 bits (782), Expect = 3e-82,   Method: Compositional matrix adjust.
 Identities = 146/333 (43%), Positives = 212/333 (63%), Gaps = 10/333 (3%)

Query: 18  MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
           +F+ + L    AS   +SR      +++  E+WMA++GR YKD+ EK  R +IFK N+++
Sbjct: 8   VFLFLFLCAMWASPSAASRDEPNDPMMKRFEEWMAEYGRVYKDDDEKMRRFQIFKNNVKH 67

Query: 78  IEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVP 137
           IE  N     +Y LG NQF+D+T  EF A YTG  +P    R    S   + +++++ VP
Sbjct: 68  IETFNSRNENSYTLGINQFTDMTKSEFVAQYTGVSLPLNIEREPVVS---FDDVNISAVP 124

Query: 138 TSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNN 197
            S+DWRD GAV  +KNQ  CG CW+FAA+A VEGI KI++G L+ LSEQ++LDC+   + 
Sbjct: 125 QSIDWRDYGAVNEVKNQNPCGSCWSFAAIATVEGIYKIKTGYLVSLSEQEVLDCAV--SY 182

Query: 198 GCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQAL 257
           GC GG   KA+ +II N G+ TE+ YPY A  GTC+A   P +A I+ Y  V   DE+++
Sbjct: 183 GCKGGWVNKAYDFIISNNGVTTEENYPYLAYQGTCNANSFPNSAYITGYSYVRRNDERSM 242

Query: 258 LKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNS 317
           + AVS QP++  I A S  FQ Y  G+F+G CGT L+HA+TI+G+G    G  YW+++NS
Sbjct: 243 MYAVSNQPIAALIDA-SENFQYYNGGVFSGPCGTSLNHAITIIGYGQDSSGTKYWIVRNS 301

Query: 318 WGNTWGDAGYMKIVR----DEGLCGIGTRSSYP 346
           WG++WG+ GY+++ R      G+CGI     +P
Sbjct: 302 WGSSWGEGGYVRMARGVSSSSGVCGIAMAPLFP 334


>sp|P25803|CYSEP_PHAVU Vignain OS=Phaseolus vulgaris PE=2 SV=2
          Length = 362

 Score =  305 bits (780), Expect = 4e-82,   Method: Compositional matrix adjust.
 Identities = 155/344 (45%), Positives = 211/344 (61%), Gaps = 14/344 (4%)

Query: 16  TPMFIIITLLVSCASQVVSSRSTH------EQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
           T   + + L  S    V +S   H      E+S+ +++E+W + H  S +   EK  R  
Sbjct: 3   TKKLLWVVLSFSLVLGVANSFDFHDKDLASEESLWDLYERWRSHHTVS-RSLGEKHKRFN 61

Query: 70  IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPS-HRSTTSSTFKY 128
           +FK NL ++   NK  ++ YKL  N+F+D+TN EFR+ Y G K+  P   R T      +
Sbjct: 62  VFKANLMHVHNTNKM-DKPYKLKLNKFADMTNHEFRSTYAGSKVNHPRMFRGTPHENGAF 120

Query: 129 QNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQL 188
               +  VP S+DWR KGAVT +K+Q +CG CWAF+ V AVEGI +I++  L+ LSEQ+L
Sbjct: 121 MYEKVVSVPPSVDWRKKGAVTDVKDQGQCGSCWAFSTVVAVEGINQIKTNKLVALSEQEL 180

Query: 189 LDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQ-KPAAAKISNYE 247
           +DC    N GC GG  E AF +I Q  GI TE  YPY+A  GTC A++    A  I  +E
Sbjct: 181 VDCDKEENQGCNGGLMESAFEFIKQKGGITTESNYPYKAQEGTCDASKVNDLAVSIDGHE 240

Query: 248 EVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTED 307
            VP+ DE ALLKAV+ QPVS+AI A  ++FQ Y EG+F G C T L+H V IVG+GTT D
Sbjct: 241 NVPANDEDALLKAVANQPVSVAIDAGGSDFQFYSEGVFTGDCSTDLNHGVAIVGYGTTVD 300

Query: 308 GANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPL 347
           G NYW+++NSWG  WG+ GY+++ R+    EGLCGI    SYP+
Sbjct: 301 GTNYWIVRNSWGPEWGEHGYIRMQRNISKKEGLCGIAMLPSYPI 344


>sp|P25250|CYSP2_HORVU Cysteine proteinase EP-B 2 OS=Hordeum vulgare GN=EPB2 PE=1 SV=1
          Length = 373

 Score =  302 bits (773), Expect = 3e-81,   Method: Compositional matrix adjust.
 Identities = 140/317 (44%), Positives = 202/317 (63%), Gaps = 11/317 (3%)

Query: 40  EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDL 99
           E+++ +++E+W + H R  +   EK  R   FK N  +I   NK G+  Y+L  N+F D+
Sbjct: 39  EEALWDLYERWQSAH-RVRRHHAEKHRRFGTFKSNAHFIHSHNKRGDHPYRLHLNRFGDM 97

Query: 100 TNDEFRALYTG-YKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECG 158
              EFRA + G  +  +PS +  +   F Y  L+++D+P S+DWR KGAVT +K+Q +CG
Sbjct: 98  DQAEFRATFVGDLRRDTPS-KPPSVPGFMYAALNVSDLPPSVDWRQKGAVTGVKDQGKCG 156

Query: 159 CCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIA 218
            CWAF+ V +VEGI  IR+G+L+ LSEQ+L+DC T  N+GC GG  + AF YI  N G+ 
Sbjct: 157 SCWAFSTVVSVEGINAIRTGSLVSLSEQELIDCDTADNDGCQGGLMDNAFEYIKNNGGLI 216

Query: 219 TEDEYPYQAVPGTCSAAQ----KPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYS 274
           TE  YPY+A  GTC+ A+     P    I  +++VP+  E+ L +AV+ QPVS+A+ A  
Sbjct: 217 TEAAYPYRAARGTCNVARAAQNSPVVVHIDGHQDVPANSEEDLARAVANQPVSVAVEASG 276

Query: 275 TEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDE 334
             F  Y EG+F G CGT+LDH V +VG+G  EDG  YW +KNSWG +WG+ GY+++ +D 
Sbjct: 277 KAFMFYSEGVFTGECGTELDHGVAVVGYGVAEDGKAYWTVKNSWGPSWGEQGYIRVEKDS 336

Query: 335 ----GLCGIGTRSSYPL 347
               GLCGI   +SYP+
Sbjct: 337 GASGGLCGIAMEASYPV 353


>sp|P25249|CYSP1_HORVU Cysteine proteinase EP-B 1 OS=Hordeum vulgare GN=EPB1 PE=2 SV=1
          Length = 371

 Score =  301 bits (772), Expect = 4e-81,   Method: Compositional matrix adjust.
 Identities = 138/316 (43%), Positives = 198/316 (62%), Gaps = 9/316 (2%)

Query: 40  EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDL 99
           E+++ +++E+W + H R  +   EK  R   FK N  +I   NK G+  Y+L  N+F D+
Sbjct: 39  EEALWDLYERWQSAH-RVRRHHAEKHRRFGTFKSNAHFIHSHNKRGDHPYRLHLNRFGDM 97

Query: 100 TNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGC 159
              EFRA + G        +  +   F Y  L+++D+P S+DWR KGAVT +K+Q +CG 
Sbjct: 98  DQAEFRATFVGDLRRDTPAKPPSVPGFMYAALNVSDLPPSVDWRQKGAVTGVKDQGKCGS 157

Query: 160 CWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIAT 219
           CWAF+ V +VEGI  IR+G+L+ LSEQ+L+DC T  N+GC GG  + AF YI  N G+ T
Sbjct: 158 CWAFSTVVSVEGINAIRTGSLVSLSEQELIDCDTADNDGCQGGLMDNAFEYIKNNGGLIT 217

Query: 220 EDEYPYQAVPGTCSAAQ----KPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYST 275
           E  YPY+A  GTC+ A+     P    I  +++VP+  E+ L +AV+ QPVS+A+ A   
Sbjct: 218 EAAYPYRAARGTCNVARAAQNSPVVVHIDGHQDVPANSEEDLARAVANQPVSVAVEASGK 277

Query: 276 EFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDE- 334
            F  Y EG+F G CGT+LDH V +VG+G  EDG  YW +KNSWG +WG+ GY+++ +D  
Sbjct: 278 AFMFYSEGVFTGDCGTELDHGVAVVGYGVAEDGKAYWTVKNSWGPSWGEQGYIRVEKDSG 337

Query: 335 ---GLCGIGTRSSYPL 347
              GLCGI   +SYP+
Sbjct: 338 ASGGLCGIAMEASYPV 353


>sp|O65039|CYSEP_RICCO Vignain OS=Ricinus communis GN=CYSEP PE=1 SV=1
          Length = 360

 Score =  301 bits (771), Expect = 4e-81,   Method: Compositional matrix adjust.
 Identities = 150/312 (48%), Positives = 203/312 (65%), Gaps = 16/312 (5%)

Query: 46  IHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFR 105
           ++E+W + H  S +   EK+ R  +FK N  ++  ANK  ++ YKL  N+F+D+TN EFR
Sbjct: 37  LYERWRSHHTVS-RSLHEKQKRFNVFKHNAMHVHNANKM-DKPYKLKLNKFADMTNHEFR 94

Query: 106 ALYTGYKMPSPSHR-----STTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCC 160
             Y+G K+    HR        + TF Y+ +    VP S+DWR KGAVT +K+Q +CG C
Sbjct: 95  NTYSGSKVKH--HRMFRGGPRGNGTFMYEKVDT--VPASVDWRKKGAVTSVKDQGQCGSC 150

Query: 161 WAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATE 220
           WAF+ + AVEGI +I++  L+ LSEQ+L+DC T+ N GC GG  + AF +I Q  GI TE
Sbjct: 151 WAFSTIVAVEGINQIKTNKLVSLSEQELVDCDTDQNQGCNGGLMDYAFEFIKQRGGITTE 210

Query: 221 DEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQS 279
             YPY+A  GTC  +++ A A  I  +E VP  DE ALLKAV+ QPVS+AI A  ++FQ 
Sbjct: 211 ANYPYEAYDGTCDVSKENAPAVSIDGHENVPENDENALLKAVANQPVSVAIDAGGSDFQF 270

Query: 280 YKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR----DEG 335
           Y EG+F G CGT+LDH V IVG+GTT DG  YW +KNSWG  WG+ GY+++ R     EG
Sbjct: 271 YSEGVFTGSCGTELDHGVAIVGYGTTIDGTKYWTVKNSWGPEWGEKGYIRMERGISDKEG 330

Query: 336 LCGIGTRSSYPL 347
           LCGI   +SYP+
Sbjct: 331 LCGIAMEASYPI 342


>sp|Q9FGR9|CEP1_ARATH KDEL-tailed cysteine endopeptidase CEP1 OS=Arabidopsis thaliana
           GN=CEP1 PE=2 SV=1
          Length = 361

 Score =  299 bits (765), Expect = 3e-80,   Method: Compositional matrix adjust.
 Identities = 151/345 (43%), Positives = 216/345 (62%), Gaps = 22/345 (6%)

Query: 19  FIIITLLVSCASQVVSSRSTH------EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFK 72
           FI++ L +    +       H      E S+ E++E+W + H  +   E EK  R  +FK
Sbjct: 4   FIVLALCMLMVLETTKGLDFHNKDVESENSLWELYERWRSHHTVARSLE-EKAKRFNVFK 62

Query: 73  ENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTG-----YKMPSPSHRSTTSSTFK 127
            N+++I + NK+ +++YKL  N+F D+T++EFR  Y G     ++M     ++T S  F 
Sbjct: 63  HNVKHIHETNKK-DKSYKLKLNKFGDMTSEEFRRTYAGSNIKHHRMFQGEKKATKS--FM 119

Query: 128 YQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQ 187
           Y N++   +PTS+DWR  GAVTP+KNQ +CG CWAF+ V AVEGI +IR+  L  LSEQ+
Sbjct: 120 YANVNT--LPTSVDWRKNGAVTPVKNQGQCGSCWAFSTVVAVEGINQIRTKKLTSLSEQE 177

Query: 188 LLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPA-AAKISNY 246
           L+DC TN N GC GG  + AF +I +  G+ +E  YPY+A   TC   ++ A    I  +
Sbjct: 178 LVDCDTNQNQGCNGGLMDLAFEFIKEKGGLTSELVYPYKASDETCDTNKENAPVVSIDGH 237

Query: 247 EEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTE 306
           E+VP   E  L+KAV+ QPVS+AI A  ++FQ Y EG+F G CGT+L+H V +VG+GTT 
Sbjct: 238 EDVPKNSEDDLMKAVANQPVSVAIDAGGSDFQFYSEGVFTGRCGTELNHGVAVVGYGTTI 297

Query: 307 DGANYWLIKNSWGNTWGDAGYMKIVR----DEGLCGIGTRSSYPL 347
           DG  YW++KNSWG  WG+ GY+++ R     EGLCGI   +SYPL
Sbjct: 298 DGTKYWIVKNSWGEEWGEKGYIRMQRGIRHKEGLCGIAMEASYPL 342


>sp|Q9SUS9|CPR4_ARATH Probable cysteine proteinase At4g11320 OS=Arabidopsis thaliana
           GN=At4g11320 PE=2 SV=1
          Length = 371

 Score =  298 bits (763), Expect = 4e-80,   Method: Compositional matrix adjust.
 Identities = 150/353 (42%), Positives = 220/353 (62%), Gaps = 26/353 (7%)

Query: 18  MFIIITLLVSCAS----QVVSSRSTHE--------QSVVE-----IHEKWMAQHGRSYKD 60
           +F++  ++ SCA+     VVSS   H         Q + +     + E WM +HG+ Y  
Sbjct: 10  IFLLALVIASCATAMDMSVVSSNDNHHVTAGPGRRQGIFDAEATLMFESWMVKHGKVYDS 69

Query: 61  ELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRS 120
             EKE RL IF++NL +I   N E N +Y+LG N+F+DL+  E+  +  G     P +  
Sbjct: 70  VAEKERRLTIFEDNLRFITNRNAE-NLSYRLGLNRFADLSLHEYGEICHGADPRPPRNHV 128

Query: 121 TTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNL 180
             +S+ +Y+      +P S+DWR++GAVT +K+Q  C  CWAF+ V AVEG+ KI +G L
Sbjct: 129 FMTSSNRYKTSDGDVLPKSVDWRNEGAVTEVKDQGLCRSCWAFSTVGAVEGLNKIVTGEL 188

Query: 181 IQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKP-- 238
           + LSEQ L++C+   NNGC GG  E A+ +I+ N G+ T+++YPY+A+ G C    K   
Sbjct: 189 VTLSEQDLINCNKE-NNGCGGGKVETAYEFIMNNGGLGTDNDYPYKALNGVCEGRLKEDN 247

Query: 239 AAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVT 298
               I  YE +P+ DE AL+KAV+ QPV+  + + S EFQ Y+ G+F+G CGT L+H V 
Sbjct: 248 KNVMIDGYENLPANDEAALMKAVAHQPVTAVVDSSSREFQLYESGVFDGTCGTNLNHGVV 307

Query: 299 IVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPL 347
           +VG+G TE+G +YW++KNS G+TWG+AGYMK+ R+     GLCGI  R+SYPL
Sbjct: 308 VVGYG-TENGRDYWIVKNSRGDTWGEAGYMKMARNIANPRGLCGIAMRASYPL 359


>sp|P12412|CYSEP_VIGMU Vignain OS=Vigna mungo PE=1 SV=1
          Length = 362

 Score =  297 bits (761), Expect = 6e-80,   Method: Compositional matrix adjust.
 Identities = 149/318 (46%), Positives = 203/318 (63%), Gaps = 16/318 (5%)

Query: 40  EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDL 99
           E+S+ +++E+W + H  S +   EK  R  +FK N+ ++   NK  ++ YKL  N+F+D+
Sbjct: 33  EESLWDLYERWRSHHTVS-RSLGEKHKRFNVFKANVMHVHNTNKM-DKPYKLKLNKFADM 90

Query: 100 TNDEFRALYTG-----YKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQ 154
           TN EFR+ Y G     +KM   S     S TF Y+ +    VP S+DWR KGAVT +K+Q
Sbjct: 91  TNHEFRSTYAGSKVNHHKMFRGSQHG--SGTFMYEKVG--SVPASVDWRKKGAVTDVKDQ 146

Query: 155 KECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQN 214
            +CG CWAF+ + AVEGI +I++  L+ LSEQ+L+DC    N GC GG  E AF +I Q 
Sbjct: 147 GQCGSCWAFSTIVAVEGINQIKTNKLVSLSEQELVDCDKEENQGCNGGLMESAFEFIKQK 206

Query: 215 QGIATEDEYPYQAVPGTCSAAQ-KPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAY 273
            GI TE  YPY A  GTC  ++    A  I  +E VP  DE ALLKAV+ QPVS+AI A 
Sbjct: 207 GGITTESNYPYTAQEGTCDESKVNDLAVSIDGHENVPVNDENALLKAVANQPVSVAIDAG 266

Query: 274 STEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD 333
            ++FQ Y EG+F G C T L+H V IVG+GTT DG NYW+++NSWG  WG+ GY+++ R+
Sbjct: 267 GSDFQFYSEGVFTGDCNTDLNHGVAIVGYGTTVDGTNYWIVRNSWGPEWGEQGYIRMQRN 326

Query: 334 ----EGLCGIGTRSSYPL 347
               EGLCGI   +SYP+
Sbjct: 327 ISKKEGLCGIAMMASYPI 344


>sp|Q9SUT0|CPR3_ARATH Probable cysteine proteinase At4g11310 OS=Arabidopsis thaliana
           GN=At4g11310 PE=2 SV=1
          Length = 364

 Score =  297 bits (761), Expect = 8e-80,   Method: Compositional matrix adjust.
 Identities = 152/350 (43%), Positives = 221/350 (63%), Gaps = 21/350 (6%)

Query: 16  TPMFIIITLLV--SCASQVVSSRSTHE-----QSVVE-----IHEKWMAQHGRSYKDELE 63
           + M I++  +V  SCA+ +  S  +++      SV +     I E WM +HG+ Y    E
Sbjct: 6   SAMLILLVAMVIASCATAIDMSVVSYDDNNRLHSVFDAEASLIFESWMVKHGKVYGSVAE 65

Query: 64  KEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTS 123
           KE RL IF++NL +I   N E N +Y+LG   F+DL+  E++ +  G     P +    +
Sbjct: 66  KERRLTIFEDNLRFINNRNAE-NLSYRLGLTGFADLSLHEYKEVCHGADPRPPRNHVFMT 124

Query: 124 STFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQL 183
           S+ +Y+  +   +P S+DWR++GAVT +K+Q  C  CWAF+ V AVEG+ KI +G L+ L
Sbjct: 125 SSDRYKTSADDVLPKSVDWRNEGAVTEVKDQGHCRSCWAFSTVGAVEGLNKIVTGELVTL 184

Query: 184 SEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKP--AAA 241
           SEQ L++C+   NNGC GG  E A+ +I++N G+ T+++YPY+AV G C    K      
Sbjct: 185 SEQDLINCNKE-NNGCGGGKLETAYEFIMKNGGLGTDNDYPYKAVNGVCDGRLKENNKNV 243

Query: 242 KISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVG 301
            I  YE +P+ DE AL+KAV+ QPV+  I + S EFQ Y+ G+F+G CGT L+H V +VG
Sbjct: 244 MIDGYENLPANDESALMKAVAHQPVTAVIDSSSREFQLYESGVFDGSCGTNLNHGVVVVG 303

Query: 302 FGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPL 347
           +G TE+G +YWL+KNS G TWG+AGYMK+ R+     GLCGI  R+SYPL
Sbjct: 304 YG-TENGRDYWLVKNSRGITWGEAGYMKMARNIANPRGLCGIAMRASYPL 352


>sp|A5HII1|ACTN_ACTDE Actinidain OS=Actinidia deliciosa PE=1 SV=1
          Length = 380

 Score =  296 bits (757), Expect = 2e-79,   Method: Compositional matrix adjust.
 Identities = 154/340 (45%), Positives = 212/340 (62%), Gaps = 10/340 (2%)

Query: 13  INTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFK 72
           ++ + +F    L++S A    +        V  ++E W+ ++G+SY    E E R +IFK
Sbjct: 8   VSMSLLFFSTLLILSLAFNAKNLTQRTNDEVKAMYESWLIKYGKSYNSLGEWERRFEIFK 67

Query: 73  ENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLS 132
           E L +I++ N + NR+YK+G NQF+DLT++EFR+ Y G+   S S+++  S+  +Y+   
Sbjct: 68  ETLRFIDEHNADTNRSYKVGLNQFADLTDEEFRSTYLGFT--SGSNKTKVSN--RYEPRV 123

Query: 133 MTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS 192
              +P+ +DWR  GAV  IK+Q ECG CWAF+A+A VEGI KI +G LI LSEQ+L+DC 
Sbjct: 124 GQVLPSYVDWRSAGAVVDIKSQGECGGCWAFSAIATVEGINKIVTGVLISLSEQELIDCG 183

Query: 193 -TNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSA-AQKPAAAKISNYEEVP 250
            T    GC GG     F +II N GI TE+ YPY A  G C+   Q      I  YE VP
Sbjct: 184 RTQNTRGCNGGYITDGFQFIINNGGINTEENYPYTAQDGECNLDLQNEKYVTIDTYENVP 243

Query: 251 SGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGAN 310
             +E AL  AV+ QPVS+A+ A    F+ Y  GIF G CGT +DHAVTIVG+G TE G +
Sbjct: 244 YNNEWALQTAVTYQPVSVALDAAGDAFKHYSSGIFTGPCGTAIDHAVTIVGYG-TEGGID 302

Query: 311 YWLIKNSWGNTWGDAGYMKIVRD---EGLCGIGTRSSYPL 347
           YW++KNSW  TWG+ GYM+I+R+    G CGI T  SYP+
Sbjct: 303 YWIVKNSWDTTWGEEGYMRILRNVGGAGTCGIATMPSYPV 342


>sp|Q9STL5|CEP3_ARATH KDEL-tailed cysteine endopeptidase CEP3 OS=Arabidopsis thaliana
           GN=CEP3 PE=2 SV=1
          Length = 364

 Score =  295 bits (754), Expect = 5e-79,   Method: Compositional matrix adjust.
 Identities = 148/318 (46%), Positives = 205/318 (64%), Gaps = 17/318 (5%)

Query: 40  EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDL 99
           E++V +++E+W   H  S +   E   R  +F+ N+ ++ + NK+ N+ YKL  N+F+D+
Sbjct: 31  EENVWKLYERWRGHHSVS-RASHEAIKRFNVFRHNVLHVHRTNKK-NKPYKLKINRFADI 88

Query: 100 TNDEFRALYTG-----YKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQ 154
           T+ EFR+ Y G     ++M     R   S  F Y+N+  T VP+S+DWR+KGAVT +KNQ
Sbjct: 89  THHEFRSSYAGSNVKHHRMLRGPKRG--SGGFMYENV--TRVPSSVDWREKGAVTEVKNQ 144

Query: 155 KECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQN 214
           ++CG CWAF+ VAAVEGI KIR+  L+ LSEQ+L+DC T  N GC GG  E AF +I  N
Sbjct: 145 QDCGSCWAFSTVAAVEGINKIRTNKLVSLSEQELVDCDTEENQGCAGGLMEPAFEFIKNN 204

Query: 215 QGIATEDEYPYQAVPGTCSAAQKPAA--AKISNYEEVPSGDEQALLKAVSMQPVSIAIAA 272
            GI TE+ YPY +       A         I  +E VP  DE+ LLKAV+ QPVS+AI A
Sbjct: 205 GGIKTEETYPYDSSDVQFCRANSIGGETVTIDGHEHVPENDEEELLKAVAHQPVSVAIDA 264

Query: 273 YSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR 332
            S++FQ Y EG+F G CGTQL+H V IVG+G T++G  YW+++NSWG  WG+ GY++I R
Sbjct: 265 GSSDFQLYSEGVFIGECGTQLNHGVVIVGYGETKNGTKYWIVRNSWGPEWGEGGYVRIER 324

Query: 333 ----DEGLCGIGTRSSYP 346
               +EG CGI   +SYP
Sbjct: 325 GISENEGRCGIAMEASYP 342


>sp|P00785|ACTN_ACTCH Actinidain OS=Actinidia chinensis PE=1 SV=4
          Length = 380

 Score =  294 bits (752), Expect = 7e-79,   Method: Compositional matrix adjust.
 Identities = 153/340 (45%), Positives = 211/340 (62%), Gaps = 10/340 (2%)

Query: 13  INTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFK 72
           ++ + +F    L++S A    +        V  ++E W+ ++G+SY    E E R +IFK
Sbjct: 8   VSMSLLFFSTLLILSLAFNAKNLTQRTNDEVKAMYESWLIKYGKSYNSLGEWERRFEIFK 67

Query: 73  ENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLS 132
           E L +I++ N + NR+YK+G NQF+DLT++EFR+ Y   +  S S+++  S+  +Y+   
Sbjct: 68  ETLRFIDEHNADTNRSYKVGLNQFADLTDEEFRSTYL--RFTSGSNKTKVSN--RYEPRV 123

Query: 133 MTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS 192
              +P+ +DWR  GAV  IK+Q ECG CWAF+A+A VEGI KI +G LI LSEQ+L+DC 
Sbjct: 124 GQVLPSYVDWRSAGAVVDIKSQGECGGCWAFSAIATVEGINKIVTGVLISLSEQELIDCG 183

Query: 193 -TNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSA-AQKPAAAKISNYEEVP 250
            T    GC GG     F +II N GI TE+ YPY A  G C+   Q      I  YE VP
Sbjct: 184 RTQNTRGCNGGYITDGFQFIINNGGINTEENYPYTAQDGECNVDLQNEKYVTIDTYENVP 243

Query: 251 SGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGAN 310
             +E AL  AV+ QPVS+A+ A    F+ Y  GIF G CGT +DHAVTIVG+G TE G +
Sbjct: 244 YNNEWALQTAVTYQPVSVALDAAGDAFKQYSSGIFTGPCGTAVDHAVTIVGYG-TEGGID 302

Query: 311 YWLIKNSWGNTWGDAGYMKIVRD---EGLCGIGTRSSYPL 347
           YW++KNSW  TWG+ GYM+I+R+    G CGI T  SYP+
Sbjct: 303 YWIVKNSWDTTWGEEGYMRILRNVGGAGTCGIATMPSYPV 342


>sp|P43156|CYSP_HEMSP Thiol protease SEN102 OS=Hemerocallis sp. GN=SEN102 PE=2 SV=1
          Length = 360

 Score =  292 bits (747), Expect = 3e-78,   Method: Compositional matrix adjust.
 Identities = 150/349 (42%), Positives = 219/349 (62%), Gaps = 25/349 (7%)

Query: 17  PMFIIITLL----VSCASQVVSSRS--THEQSVVEIHEKWMAQHGRSYKDELEKEMRLKI 70
           P FI + L+    +S A  +  +      E S+  ++EKW   H  + +D  EK  R  +
Sbjct: 4   PKFIALALVALSFLSIAQSIPFTEKDLASEDSLWNLYEKWRTHHTVA-RDLDEKNRRFNV 62

Query: 71  FKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRS-----TTSST 125
           FKEN+++I + N++ +  YKL  N+F D+TN EFR+ Y G K+    HRS       + +
Sbjct: 63  FKENVKFIHEFNQKKDAPYKLALNKFGDMTNQEFRSKYAGSKIQH--HRSQRGIQKNTGS 120

Query: 126 FKYQNLSMTDVPT-SLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLS 184
           F Y+N+    +P  S+DWR KGAVT +K+Q +CG CWAF+ +A+VEGI +I++G L+ LS
Sbjct: 121 FMYENVG--SLPAASIDWRAKGAVTGVKDQGQCGSCWAFSTIASVEGINQIKTGELVSLS 178

Query: 185 EQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSA--AQKPAAAK 242
           EQ+L+DC T+ N GC GG  + AF +I Q  GI TED YPY    GTC++     P  + 
Sbjct: 179 EQELVDCDTSYNEGCNGGLMDYAFEFI-QKNGITTEDSYPYAEQDGTCASNLLNSPVVS- 236

Query: 243 ISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGF 302
           I  +++VP+ +E AL++AV+ QP+S++I A    FQ Y EG+F G CGT+LDH V IVG+
Sbjct: 237 IDGHQDVPANNENALMQAVANQPISVSIEASGYGFQFYSEGVFTGRCGTELDHGVAIVGY 296

Query: 303 GTTEDGANYWLIKNSWGNTWGDAGYMKIVR----DEGLCGIGTRSSYPL 347
           G T DG  YW++KNSWG  WG++GY+++ R      G CGI   +SYP+
Sbjct: 297 GATRDGTKYWIVKNSWGEEWGESGYIRMQRGISDKRGKCGIAMEASYPI 345


>sp|Q9LM66|XCP2_ARATH Xylem cysteine proteinase 2 OS=Arabidopsis thaliana GN=XCP2 PE=1
           SV=2
          Length = 356

 Score =  291 bits (744), Expect = 6e-78,   Method: Compositional matrix adjust.
 Identities = 140/315 (44%), Positives = 215/315 (68%), Gaps = 11/315 (3%)

Query: 38  THEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFS 97
           +H++ ++E+ E W++   ++Y+   EK +R ++FK+NL++I++ NK+G ++Y LG N+F+
Sbjct: 43  SHDK-LIELFENWISNFEKAYETVEEKFLRFEVFKDNLKHIDETNKKG-KSYWLGLNEFA 100

Query: 98  DLTNDEFRALYTGYKMPSPSHRSTTS-STFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKE 156
           DL+++EF+ +Y G K          S + F Y+++    VP S+DWR KGAV  +KNQ  
Sbjct: 101 DLSHEEFKKMYLGLKTDIVRRDEERSYAEFAYRDVEA--VPKSVDWRKKGAVAEVKNQGS 158

Query: 157 CGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQG 216
           CG CWAF+ VAAVEGI KI +GNL  LSEQ+L+DC T  NNGC GG  + AF YI++N G
Sbjct: 159 CGSCWAFSTVAAVEGINKIVTGNLTTLSEQELIDCDTTYNNGCNGGLMDYAFEYIVKNGG 218

Query: 217 IATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYST 275
           +  E++YPY    GTC   +  +    I+ +++VP+ DE++LLKA++ QP+S+AI A   
Sbjct: 219 LRKEEDYPYSMEEGTCEMQKDESETVTINGHQDVPTNDEKSLLKALAHQPLSVAIDASGR 278

Query: 276 EFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-- 333
           EFQ Y  G+F+G CG  LDH V  VG+G+++ G++Y ++KNSWG  WG+ GY+++ R+  
Sbjct: 279 EFQFYSGGVFDGRCGVDLDHGVAAVGYGSSK-GSDYIIVKNSWGPKWGEKGYIRLKRNTG 337

Query: 334 --EGLCGIGTRSSYP 346
             EGLCGI   +S+P
Sbjct: 338 KPEGLCGINKMASFP 352


>sp|Q94B08|GCP1_ARATH Germination-specific cysteine protease 1 OS=Arabidopsis thaliana
           GN=GCP1 PE=2 SV=2
          Length = 376

 Score =  290 bits (741), Expect = 2e-77,   Method: Compositional matrix adjust.
 Identities = 147/322 (45%), Positives = 209/322 (64%), Gaps = 16/322 (4%)

Query: 40  EQSVVEIHEKWMAQHGRSYKDEL----EKEMRLKIFKENLEYIEKANKEG-NRTYKLGTN 94
           ++ V  I+ +W A+HG++  +      +++ R  IFK+NL +I+  N++  N TYKLG  
Sbjct: 42  DEEVRSIYLQWSAEHGKTNNNNNGIINDQDKRFNIFKDNLRFIDLHNEDNKNATYKLGLT 101

Query: 95  QFSDLTNDEFRALYTGYKMPSPSHRSTTSSTF--KYQN-LSMTDVPTSLDWRDKGAVTPI 151
           +F+DLTNDE+R LY G +   P+ R   +     KY   ++  +VP ++DWR KGAV PI
Sbjct: 102 KFTDLTNDEYRKLYLGART-EPARRIAKAKNVNQKYSAAVNGKEVPETVDWRQKGAVNPI 160

Query: 152 KNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYI 211
           K+Q  CG CWAF+  AAVEGI KI +G LI LSEQ+L+DC  + N GC GG  + AF +I
Sbjct: 161 KDQGTCGSCWAFSTTAAVEGINKIVTGELISLSEQELVDCDKSYNQGCNGGLMDYAFQFI 220

Query: 212 IQNQGIATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAI 270
           ++N G+ TE +YPY+   G C++  K +    I  YE+VP+ DE AL KA+S QPVS+AI
Sbjct: 221 MKNGGLNTEKDYPYRGFGGKCNSFLKNSRVVSIDGYEDVPTKDETALKKAISYQPVSVAI 280

Query: 271 AAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKI 330
            A    FQ Y+ GIF G CGT LDHAV  VG+G +E+G +YW+++NSWG  WG+ GY+++
Sbjct: 281 EAGGRIFQHYQSGIFTGSCGTNLDHAVVAVGYG-SENGVDYWIVRNSWGPRWGEEGYIRM 339

Query: 331 VRD-----EGLCGIGTRSSYPL 347
            R+      G CGI   +SYP+
Sbjct: 340 ERNLAASKSGKCGIAVEASYPV 361


>sp|P25776|ORYA_ORYSJ Oryzain alpha chain OS=Oryza sativa subsp. japonica GN=Os04g0650000
           PE=1 SV=2
          Length = 458

 Score =  287 bits (735), Expect = 7e-77,   Method: Compositional matrix adjust.
 Identities = 146/324 (45%), Positives = 202/324 (62%), Gaps = 12/324 (3%)

Query: 32  VVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKAN---KEGNRT 88
           +VS     E+    ++ +W A+HG+SY    E+E R   F++NL YI++ N     G  +
Sbjct: 25  IVSYGERSEEEARRLYAEWKAEHGKSYNAVGEEERRYAAFRDNLRYIDEHNAAADAGVHS 84

Query: 89  YKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAV 148
           ++LG N+F+DLTN+E+R  Y G +      R  +       N ++   P S+DWR KGAV
Sbjct: 85  FRLGLNRFADLTNEEYRDTYLGLRNKPRRERKVSDRYLAADNEAL---PESVDWRTKGAV 141

Query: 149 TPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAF 208
             IK+Q  CG CWAF+A+AAVEGI +I +G+LI LSEQ+L+DC T+ N GC GG  + AF
Sbjct: 142 AEIKDQGGCGSCWAFSAIAAVEGINQIVTGDLISLSEQELVDCDTSYNEGCNGGLMDYAF 201

Query: 209 AYIIQNQGIATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVS 267
            +II N GI TED+YPY+     C   +K A    I +YE+V    E +L KAV+ QPVS
Sbjct: 202 DFIINNGGIDTEDDYPYKGKDERCDVNRKNAKVVTIDSYEDVTPNSETSLQKAVANQPVS 261

Query: 268 IAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGY 327
           +AI A    FQ Y  GIF G CGT LDH V  VG+G TE+G +YW+++NSWG +WG++GY
Sbjct: 262 VAIEAGGRAFQLYSSGIFTGKCGTALDHGVAAVGYG-TENGKDYWIVRNSWGKSWGESGY 320

Query: 328 MKIVRD----EGLCGIGTRSSYPL 347
           +++ R+     G CGI    SYPL
Sbjct: 321 VRMERNIKASSGKCGIAVEPSYPL 344


>sp|P25777|ORYB_ORYSJ Oryzain beta chain OS=Oryza sativa subsp. japonica GN=Os04g0670200
           PE=1 SV=2
          Length = 466

 Score =  287 bits (735), Expect = 8e-77,   Method: Compositional matrix adjust.
 Identities = 141/310 (45%), Positives = 212/310 (68%), Gaps = 15/310 (4%)

Query: 47  HEKWMAQHGRSYKDEL--EKEMRLKIFKENLEYIEKANKEGNRT--YKLGTNQFSDLTND 102
           ++ W+A++G    + L  E E R  +F +NL++++  N   +    ++LG N+F+DLTN+
Sbjct: 52  YDLWLAENGGGSPNALGGEHERRFLVFWDNLKFVDAHNARADERGGFRLGMNRFADLTNE 111

Query: 103 EFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWA 162
           EFRA + G K+   + RS  +   +Y++  + ++P S+DWR+KGAV P+KNQ +CG CWA
Sbjct: 112 EFRATFLGAKV---AERSRAAGE-RYRHDGVEELPESVDWREKGAVAPVKNQGQCGSCWA 167

Query: 163 FAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NNGCLGGSREKAFAYIIQNQGIATED 221
           F+AV+ VE I ++ +G +I LSEQ+L++CSTNG N+GC GG  + AF +II+N GI TED
Sbjct: 168 FSAVSTVESINQLVTGEMITLSEQELVECSTNGQNSGCNGGLMDDAFDFIIKNGGIDTED 227

Query: 222 EYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSY 280
           +YPY+AV G C   ++ A    I  +E+VP  DE++L KAV+ QPVS+AI A   EFQ Y
Sbjct: 228 DYPYKAVDGKCDINRENAKVVSIDGFEDVPQNDEKSLQKAVAHQPVSVAIEAGGREFQLY 287

Query: 281 KEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGL 336
             G+F+G CGT LDH V  VG+G T++G +YW+++NSWG  WG++GY+++ R+     G 
Sbjct: 288 HSGVFSGRCGTSLDHGVVAVGYG-TDNGKDYWIVRNSWGPKWGESGYVRMERNINVTTGK 346

Query: 337 CGIGTRSSYP 346
           CGI   +SYP
Sbjct: 347 CGIAMMASYP 356


>sp|P25251|CYSP4_BRANA Cysteine proteinase COT44 (Fragment) OS=Brassica napus PE=2 SV=1
          Length = 328

 Score =  285 bits (728), Expect = 5e-76,   Method: Compositional matrix adjust.
 Identities = 144/317 (45%), Positives = 204/317 (64%), Gaps = 15/317 (4%)

Query: 44  VEIHEKWMAQHGRSYKDEL----EKEMRLKIFKENLEYIEKANKEG-NRTYKLGTNQFSD 98
           + I+ +W  +HG+S  +      +++ R  IFK+NL +I+  N+   N TYKLG   F++
Sbjct: 1   MSIYLRWSLEHGKSNSNSNGIINQQDERFNIFKDNLRFIDLHNENNKNATYKLGLTIFAN 60

Query: 99  LTNDEFRALYTGYKMPSPSHRSTTSST--FKYQN-LSMTDVPTSLDWRDKGAVTPIKNQK 155
           LTNDE+R+LY G +   P  R T +     KY   +++ +VP ++DWR KGAV  IK+Q 
Sbjct: 61  LTNDEYRSLYLGART-EPVRRITKAKNVNMKYSAAVNVDEVPVTVDWRQKGAVNAIKDQG 119

Query: 156 ECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQ 215
            CG CWAF+  AAVEGI KI +G L+ LSEQ+L+DC  + N GC GG  + AF +I++N 
Sbjct: 120 TCGSCWAFSTAAAVEGINKIVTGELVSLSEQELVDCDKSYNQGCNGGLMDYAFQFIMKNG 179

Query: 216 GIATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYS 274
           G+ TE +YPY    G C++  K +    I  YE+VPS DE AL +AVS QPVS+AI A  
Sbjct: 180 GLNTEKDYPYHGTNGKCNSLLKNSRVVTIDGYEDVPSKDETALKRAVSYQPVSVAIDAGG 239

Query: 275 TEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD- 333
             FQ Y+ GIF G CGT +DHAV  VG+G +E+G +YW+++NSWG  WG+ GY+++ R+ 
Sbjct: 240 RAFQHYQSGIFTGKCGTNMDHAVVAVGYG-SENGVDYWIVRNSWGTRWGEDGYIRMERNV 298

Query: 334 ---EGLCGIGTRSSYPL 347
               G CGI   +SYP+
Sbjct: 299 ASKSGKCGIAIEASYPV 315


>sp|P14080|PAPA2_CARPA Chymopapain OS=Carica papaya PE=1 SV=2
          Length = 352

 Score =  280 bits (717), Expect = 8e-75,   Method: Compositional matrix adjust.
 Identities = 143/318 (44%), Positives = 201/318 (63%), Gaps = 16/318 (5%)

Query: 38  THEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFS 97
           T  + ++++ + WM +H + Y+   EK  R +IF++NL YI++ NK+ N +Y LG N F+
Sbjct: 39  TSIERLIQLFDSWMLKHNKIYESIDEKIYRFEIFRDNLMYIDETNKK-NNSYWLGLNGFA 97

Query: 98  DLTNDEFRALYTGY---KMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQ 154
           DL+NDEF+  Y G+         H      T+K+    +T+ P S+DWR KGAVTP+KNQ
Sbjct: 98  DLSNDEFKKKYVGFVAEDFTGLEHFDNEDFTYKH----VTNYPQSIDWRAKGAVTPVKNQ 153

Query: 155 KECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQN 214
             CG CWAF+ +A VEGI KI +GNL++LSEQ+L+DC  + + GC GG +  +  Y + N
Sbjct: 154 GACGSCWAFSTIATVEGINKIVTGNLLELSEQELVDCDKH-SYGCKGGYQTTSLQY-VAN 211

Query: 215 QGIATEDEYPYQAVPGTCSAAQKPAA-AKISNYEEVPSGDEQALLKAVSMQPVSIAIAAY 273
            G+ T   YPYQA    C A  KP    KI+ Y+ VPS  E + L A++ QP+S+ + A 
Sbjct: 212 NGVHTSKVYPYQAKQYKCRATDKPGPKVKITGYKRVPSNCETSFLGALANQPLSVLVEAG 271

Query: 274 STEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR- 332
              FQ YK G+F+G CGT+LDHAVT VG+GT+ DG NY +IKNSWG  WG+ GYM++ R 
Sbjct: 272 GKPFQLYKSGVFDGPCGTKLDHAVTAVGYGTS-DGKNYIIIKNSWGPNWGEKGYMRLKRQ 330

Query: 333 ---DEGLCGIGTRSSYPL 347
               +G CG+   S YP 
Sbjct: 331 SGNSQGTCGVYKSSYYPF 348


>sp|Q9LT77|CPR1_ARATH Probable cysteine proteinase At3g19400 OS=Arabidopsis thaliana
           GN=At3g19400 PE=2 SV=1
          Length = 362

 Score =  280 bits (715), Expect = 1e-74,   Method: Compositional matrix adjust.
 Identities = 142/316 (44%), Positives = 202/316 (63%), Gaps = 12/316 (3%)

Query: 39  HEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSD 98
           +E  V  ++E+W+ ++ ++Y    EKE R KIFK+NL+++++ N   +RT+++G  +F+D
Sbjct: 36  NETEVRLMYEQWLVENRKNYNGLGEKERRFKIFKDNLKFVDEHNSVPDRTFEVGLTRFAD 95

Query: 99  LTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECG 158
           LTN+EFRA+Y   KM   +  S  +  + Y+   +  +P  +DWR  GAV  +K+Q  CG
Sbjct: 96  LTNEEFRAIYLRKKMER-TKDSVKTERYLYKEGDV--LPDEVDWRANGAVVSVKDQGNCG 152

Query: 159 CCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NNGCLGGSREKAFAYIIQNQGI 217
            CWAF+AV AVEGI +I +G LI LSEQ+L+DC     N GC GG    AF +I++N GI
Sbjct: 153 SCWAFSAVGAVEGINQITTGELISLSEQELVDCDRGFVNAGCDGGIMNYAFEFIMKNGGI 212

Query: 218 ATEDEYPYQAVP-GTCSAAQ--KPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYS 274
            T+ +YPY A   G C+A +        I  YE+VP  DE++L KAV+ QPVS+AI A S
Sbjct: 213 ETDQDYPYNANDLGLCNADKNNNTRVVTIDGYEDVPRDDEKSLKKAVAHQPVSVAIEASS 272

Query: 275 TEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD- 333
             FQ YK G+  G CG  LDH V +VG+G+T  G +YW+I+NSWG  WGD+GY+K+ R+ 
Sbjct: 273 QAFQLYKSGVMTGTCGISLDHGVVVVGYGSTS-GEDYWIIRNSWGLNWGDSGYVKLQRNI 331

Query: 334 ---EGLCGIGTRSSYP 346
               G CGI    SYP
Sbjct: 332 DDPFGKCGIAMMPSYP 347


>sp|Q7XR52|CYSP1_ORYSJ Cysteine protease 1 OS=Oryza sativa subsp. japonica GN=CP1 PE=2
           SV=2
          Length = 490

 Score =  275 bits (703), Expect = 4e-73,   Method: Compositional matrix adjust.
 Identities = 139/295 (47%), Positives = 194/295 (65%), Gaps = 14/295 (4%)

Query: 63  EKEMRLKIFKENLEYIEKANKEGNRT--YKLGTNQFSDLTNDEFRALYTGYKMPSPSHRS 120
           E E R ++F +NL++++  N   +    ++LG N+F+DLTN EFRA Y G   P+   R 
Sbjct: 84  EHERRFRVFWDNLKFVDAHNARADERGGFRLGMNRFADLTNGEFRATYLG-TTPAGRGRR 142

Query: 121 TTSSTFKYQNLSMTDVPTSLDWRDKGAVT-PIKNQKECGCCWAFAAVAAVEGITKIRSGN 179
              +   Y++  +  +P S+DWRDKGAV  P+KNQ +CG CWAF+AVAAVEGI KI +G 
Sbjct: 143 VGEA---YRHDGVEALPDSVDWRDKGAVVAPVKNQGQCGSCWAFSAVAAVEGINKIVTGE 199

Query: 180 LIQLSEQQLLDCSTNG-NNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKP 238
           L+ LSEQ+L++C+ NG N+GC GG  + AFA+I +N G+ TE++YPY A+ G C+ A++ 
Sbjct: 200 LVSLSEQELVECARNGQNSGCNGGIMDDAFAFIARNGGLDTEEDYPYTAMDGKCNLAKRS 259

Query: 239 -AAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAV 297
                I  +E+VP  DE +L KAV+ QPVS+AI A   EFQ Y  G+F G CGT LDH V
Sbjct: 260 RKVVSIDGFEDVPENDELSLQKAVAHQPVSVAIDAGGREFQLYDSGVFTGRCGTNLDHGV 319

Query: 298 TIVGFGT-TEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPL 347
             VG+GT    GA YW ++NSWG  WG+ GY+++ R+     G CGI   +SYP+
Sbjct: 320 VAVGYGTDAATGAAYWTVRNSWGPDWGENGYIRMERNVTARTGKCGIAMMASYPI 374


>sp|Q95029|CATL_DROME Cathepsin L OS=Drosophila melanogaster GN=Cp1 PE=2 SV=2
          Length = 371

 Score =  273 bits (697), Expect = 2e-72,   Method: Compositional matrix adjust.
 Identities = 141/316 (44%), Positives = 195/316 (61%), Gaps = 14/316 (4%)

Query: 46  IHEKWMA---QHGRSYKDELEKEMRLKIFKENLEYIEKANK---EGNRTYKLGTNQFSDL 99
           + E+W     +H ++Y+DE E+  RLKIF EN   I K N+   EG  ++KL  N+++DL
Sbjct: 55  VMEEWHTFKLEHRKNYQDETEERFRLKIFNENKHKIAKHNQRFAEGKVSFKLAVNKYADL 114

Query: 100 TNDEFRALYTGYKMPSPSHRSTTSSTFK---YQNLSMTDVPTSLDWRDKGAVTPIKNQKE 156
            + EFR L  G+             +FK   + + +   +P S+DWR KGAVT +K+Q  
Sbjct: 115 LHHEFRQLMNGFNYTLHKQLRAADESFKGVTFISPAHVTLPKSVDWRTKGAVTAVKDQGH 174

Query: 157 CGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYIIQNQ 215
           CG CWAF++  A+EG    +SG L+ LSEQ L+DCST  GNNGC GG  + AF YI  N 
Sbjct: 175 CGSCWAFSSTGALEGQHFRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDNG 234

Query: 216 GIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAV-SMQPVSIAIAAYS 274
           GI TE  YPY+A+  +C   +    A    + ++P GDE+ + +AV ++ PVS+AI A  
Sbjct: 235 GIDTEKSYPYEAIDDSCHFNKGTVGATDRGFTDIPQGDEKKMAEAVATVGPVSVAIDASH 294

Query: 275 TEFQSYKEGIFN-GVCGTQ-LDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR 332
             FQ Y EG++N   C  Q LDH V +VGFGT E G +YWL+KNSWG TWGD G++K++R
Sbjct: 295 ESFQFYSEGVYNEPQCDAQNLDHGVLVVGFGTDESGEDYWLVKNSWGTTWGDKGFIKMLR 354

Query: 333 D-EGLCGIGTRSSYPL 347
           + E  CGI + SSYPL
Sbjct: 355 NKENQCGIASASSYPL 370


>sp|Q26636|CATL_SARPE Cathepsin L OS=Sarcophaga peregrina PE=1 SV=1
          Length = 339

 Score =  261 bits (666), Expect = 7e-69,   Method: Compositional matrix adjust.
 Identities = 138/314 (43%), Positives = 191/314 (60%), Gaps = 13/314 (4%)

Query: 46  IHEKWMA---QHGRSYKDELEKEMRLKIFKENLEYIEKANK---EGNRTYKLGTNQFSDL 99
           I E+W     QH ++Y +E+E+  R+KIF EN   I K N+   +G  +YKLG N+++D+
Sbjct: 24  IKEEWHTYKLQHRKNYANEVEERFRMKIFNENRHKIAKHNQLFAQGKVSYKLGLNKYADM 83

Query: 100 TNDEFRALYTGYK--MPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKEC 157
            + EF+    GY   +       T      Y   +   VP S+DWR+ GAVT +K+Q  C
Sbjct: 84  LHHEFKETMNGYNHTLRQLMRERTGLVGATYIPPAHVTVPKSVDWREHGAVTGVKDQGHC 143

Query: 158 GCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYIIQNQG 216
           G CWAF++  A+EG    ++G L+ LSEQ L+DCST  GNNGC GG  + AF YI  N G
Sbjct: 144 GSCWAFSSTGALEGQHFRKAGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDNGG 203

Query: 217 IATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAV-SMQPVSIAIAAYST 275
           I TE  YPY+ +  +C   +    A  + + ++P GDE+ + KAV +M PVS+AI A   
Sbjct: 204 IDTEKSYPYEGIDDSCHFNKATIGATDTGFVDIPEGDEEKMKKAVATMGPVSVAIDASHE 263

Query: 276 EFQSYKEGIFNGV-CGTQ-LDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD 333
            FQ Y EG++N   C  Q LDH V +VG+GT E G +YWL+KNSWG TWG+ GY+K+ R+
Sbjct: 264 SFQLYSEGVYNEPECDEQNLDHGVLVVGYGTDESGMDYWLVKNSWGTTWGEQGYIKMARN 323

Query: 334 E-GLCGIGTRSSYP 346
           +   CGI T SSYP
Sbjct: 324 QNNQCGIATASSYP 337


>sp|Q9LXW3|CPR2_ARATH Probable cysteine proteinase At3g43960 OS=Arabidopsis thaliana
           GN=At3g43960 PE=2 SV=1
          Length = 376

 Score =  260 bits (665), Expect = 8e-69,   Method: Compositional matrix adjust.
 Identities = 142/351 (40%), Positives = 205/351 (58%), Gaps = 21/351 (5%)

Query: 10  SFKINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
           SF+        ++ + +S      +    +E  V+ ++E+W+ ++G++Y    EKE R K
Sbjct: 4   SFRTLALLTLSVLLISISLGVVTATESQRNEGEVLTMYEQWLVENGKNYNGLGEKERRFK 63

Query: 70  IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQ 129
           IFK+NL+ IE+ N + NR+Y+ G N+FSDLT DEF+A Y G KM     +S +    +YQ
Sbjct: 64  IFKDNLKRIEEHNSDPNRSYERGLNKFSDLTADEFQASYLGGKM---EKKSLSDVAERYQ 120

Query: 130 NLSMTDVPTSLDWRDKGAVTP-IKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQL 188
                 +P  +DWR++GAV P +K Q ECG CWAFAA  AVEGI +I +G L+ LSEQ+L
Sbjct: 121 YKEGDVLPDEVDWRERGAVVPRVKRQGECGSCWAFAATGAVEGINQITTGELVSLSEQEL 180

Query: 189 LDCST-NGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAK----- 242
           +DC   N N GC GG    AF +I +N GI +++ Y Y    G  +AA K    K     
Sbjct: 181 IDCDRGNDNFGCAGGGAVWAFEFIKENGGIVSDEVYGYT---GEDTAACKAIEMKTTRVV 237

Query: 243 -ISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQL-DHAVTIV 300
            I+ +E VP  DE +L KAV+ QP+S+ I+A       YK G++ G C     DH V IV
Sbjct: 238 TINGHEVVPVNDEMSLKKAVAYQPISVMISA--ANMSDYKSGVYKGACSNLWGDHNVLIV 295

Query: 301 GFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPL 347
           G+GT+ D  +YWLI+NSWG  WG+ GY+++ R+     G C +     YP+
Sbjct: 296 GYGTSSDEGDYWLIRNSWGPEWGEGGYLRLQRNFHEPTGKCAVAVAPVYPI 346


>sp|P00784|PAPA1_CARPA Papain OS=Carica papaya PE=1 SV=1
          Length = 345

 Score =  259 bits (662), Expect = 2e-68,   Method: Compositional matrix adjust.
 Identities = 144/337 (42%), Positives = 206/337 (61%), Gaps = 17/337 (5%)

Query: 18  MFIIITLLVSCASQVVSSRS--THEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENL 75
           +F+ + L     S V  S++  T  + ++++ E WM +H + YK+  EK  R +IFK+NL
Sbjct: 17  LFVYMGLSFGDFSIVGYSQNDLTSTERLIQLFESWMLKHNKIYKNIDEKIYRFEIFKDNL 76

Query: 76  EYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTD 135
           +YI++ NK+ N +Y LG N F+D++NDEF+  YTG    + ++ +T  S  +  N    +
Sbjct: 77  KYIDETNKK-NNSYWLGLNVFADMSNDEFKEKYTG--SIAGNYTTTELSYEEVLNDGDVN 133

Query: 136 VPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG 195
           +P  +DWR KGAVTP+KNQ  CG CWAF+AV  +EGI KIR+GNL + SEQ+LLDC    
Sbjct: 134 IPEYVDWRQKGAVTPVKNQGSCGSCWAFSAVVTIEGIIKIRTGNLNEYSEQELLDCDRR- 192

Query: 196 NNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQK-PAAAKISNYEEVPSGDE 254
           + GC GG    A   + Q  GI   + YPY+ V   C + +K P AAK     +V   +E
Sbjct: 193 SYGCNGGYPWSALQLVAQ-YGIHYRNTYPYEGVQRYCRSREKGPYAAKTDGVRQVQPYNE 251

Query: 255 QALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLI 314
            ALL +++ QPVS+ + A   +FQ Y+ GIF G CG ++DHAV  VG+     G NY LI
Sbjct: 252 GALLYSIANQPVSVVLEAAGKDFQLYRGGIFVGPCGNKVDHAVAAVGY-----GPNYILI 306

Query: 315 KNSWGNTWGDAGYMKIVR----DEGLCGIGTRSSYPL 347
           KNSWG  WG+ GY++I R      G+CG+ T S YP+
Sbjct: 307 KNSWGTGWGENGYIRIKRGTGNSYGVCGLYTSSFYPV 343


>sp|P10056|PAPA3_CARPA Caricain OS=Carica papaya PE=1 SV=2
          Length = 348

 Score =  251 bits (640), Expect = 8e-66,   Method: Compositional matrix adjust.
 Identities = 136/314 (43%), Positives = 192/314 (61%), Gaps = 12/314 (3%)

Query: 38  THEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFS 97
           T  + ++++   WM  H + Y++  EK  R +IFK+NL YI++ NK+ N +Y LG N+F+
Sbjct: 39  TSTERLIQLFNSWMLNHNKFYENVDEKLYRFEIFKDNLNYIDETNKK-NNSYWLGLNEFA 97

Query: 98  DLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKEC 157
           DL+NDEF   Y G  + +   +S      ++ N    ++P ++DWR KGAVTP+++Q  C
Sbjct: 98  DLSNDEFNEKYVGSLIDATIEQSYDE---EFINEDTVNLPENVDWRKKGAVTPVRHQGSC 154

Query: 158 GCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI 217
           G CWAF+AVA VEGI KIR+G L++LSEQ+L+DC    ++GC GG    A  Y+ +N GI
Sbjct: 155 GSCWAFSAVATVEGINKIRTGKLVELSEQELVDCERR-SHGCKGGYPPYALEYVAKN-GI 212

Query: 218 ATEDEYPYQAVPGTCSAAQKPAA-AKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTE 276
               +YPY+A  GTC A Q      K S    V   +E  LL A++ QPVS+ + +    
Sbjct: 213 HLRSKYPYKAKQGTCRAKQVGGPIVKTSGVGRVQPNNEGNLLNAIAKQPVSVVVESKGRP 272

Query: 277 FQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR---- 332
           FQ YK GIF G CGT++DHAVT VG+G +       LIKNSWG  WG+ GY++I R    
Sbjct: 273 FQLYKGGIFEGPCGTKVDHAVTAVGYGKSGGKGYI-LIKNSWGTAWGEKGYIRIKRAPGN 331

Query: 333 DEGLCGIGTRSSYP 346
             G+CG+   S YP
Sbjct: 332 SPGVCGLYKSSYYP 345


>sp|P05994|PAPA4_CARPA Papaya proteinase 4 OS=Carica papaya PE=1 SV=3
          Length = 348

 Score =  248 bits (633), Expect = 5e-65,   Method: Compositional matrix adjust.
 Identities = 137/315 (43%), Positives = 194/315 (61%), Gaps = 12/315 (3%)

Query: 38  THEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFS 97
           T  + ++++   WM +H ++YK+  EK  R +IFK+NL+YI++ NK  N  Y LG N+FS
Sbjct: 39  TSTERLIQLFNSWMLKHNKNYKNVDEKLYRFEIFKDNLKYIDERNKMIN-GYWLGLNEFS 97

Query: 98  DLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKEC 157
           DL+NDEF+  Y G     P   +      ++ N  + D+P S+DWR KGAVTP+K+Q  C
Sbjct: 98  DLSNDEFKEKYVG---SLPEDYTNQPYDEEFVNEDIVDLPESVDWRAKGAVTPVKHQGYC 154

Query: 158 GCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI 217
             CWAF+ VA VEGI KI++GNL++LSEQ+L+DC    + GC  G +  +  Y+ QN GI
Sbjct: 155 ESCWAFSTVATVEGINKIKTGNLVELSEQELVDCDKQ-SYGCNRGYQSTSLQYVAQN-GI 212

Query: 218 ATEDEYPYQAVPGTCSAAQKPAA-AKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTE 276
               +YPY A   TC A Q      K +    V S +E +LL A++ QPVS+ + +   +
Sbjct: 213 HLRAKYPYIAKQQTCRANQVGGPKVKTNGVGRVQSNNEGSLLNAIAHQPVSVVVESAGRD 272

Query: 277 FQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR---- 332
           FQ+YK GIF G CGT++DHAVT VG+G +       LIKNSWG  WG+ GY++I R    
Sbjct: 273 FQNYKGGIFEGSCGTKVDHAVTAVGYGKSGGKGYI-LIKNSWGPGWGENGYIRIRRASGN 331

Query: 333 DEGLCGIGTRSSYPL 347
             G+CG+   S YP+
Sbjct: 332 SPGVCGVYRSSYYPI 346


>sp|Q23894|CYSP3_DICDI Cysteine proteinase 3 OS=Dictyostelium discoideum GN=cprC PE=3 SV=2
          Length = 337

 Score =  241 bits (616), Expect = 5e-63,   Method: Compositional matrix adjust.
 Identities = 132/344 (38%), Positives = 210/344 (61%), Gaps = 13/344 (3%)

Query: 11  FKINTTPMFIIITLLVSCASQ-VVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
            +++ T +F +I L +S  S   V S   ++ S ++    WM  + ++Y  + E   R +
Sbjct: 1   MRLSITLIFTLIVLSISFISAGNVFSHKQYQDSFID----WMRSNNKAYTHK-EFMPRYE 55

Query: 70  IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQ 129
            FK+N++Y+   N +G++T  LG NQ +DL+N+E+R  Y G +     +     +     
Sbjct: 56  EFKKNMDYVHNWNSKGSKTV-LGLNQHADLSNEEYRLNYLGTRAHIKLNGYHKRNLGLRL 114

Query: 130 NLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLL 189
           N      P ++DWR+K AVTP+K+Q +CG C++F+   +VEG+T I++G L+ LSEQ +L
Sbjct: 115 NRPQFKQPLNVDWREKDAVTPVKDQGQCGSCYSFSTTGSVEGVTAIKTGKLVSLSEQNIL 174

Query: 190 DCSTN-GNNGCLGGSREKAFAYIIQNQGIATEDEYPYQA-VPGTCSAAQKPAAAKISNYE 247
           DCS++ GN GC GG    AF YII+N G+ +E++YPY+  V   C   +   AAKI++Y+
Sbjct: 175 DCSSSFGNEGCNGGLMTNAFEYIIKNNGLNSEEQYPYEMKVNDECKFQEGSVAAKITSYK 234

Query: 248 EVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGI-FNGVCGTQ-LDHAVTIVGFGTT 305
           E+ +GDE  L  A+ + PVS+AI A    FQ Y  G+ +   C ++ LDH V  VG G T
Sbjct: 235 EIEAGDENDLQNALLLNPVSVAIDASHNSFQLYTAGVYYEPACSSEDLDHGVLAVGMG-T 293

Query: 306 EDGANYWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYPLA 348
           ++G +Y+++KNSWG +WG  GY+ + R+ +  CGI T +SYP+A
Sbjct: 294 DNGEDYYIVKNSWGPSWGLNGYIHMARNKDNNCGISTMASYPIA 337


>sp|P25326|CATS_BOVIN Cathepsin S OS=Bos taurus GN=CTSS PE=1 SV=2
          Length = 331

 Score =  240 bits (613), Expect = 1e-62,   Method: Compositional matrix adjust.
 Identities = 135/338 (39%), Positives = 199/338 (58%), Gaps = 18/338 (5%)

Query: 18  MFIIITLLVSCASQVVSSRSTHEQSVVEIH-EKWMAQHGRSYKDELEKEMRLKIFKENLE 76
           M  ++  L+ C+S +      H    ++ H + W   +G+ YK++ E+  R  I+++NL+
Sbjct: 1   MNWLVWALLLCSSAMAH---VHRDPTLDHHWDLWKKTYGKQYKEKNEEVARRLIWEKNLK 57

Query: 77  YIEKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSM 133
            +   N E   G  +Y+LG N   D+T++E  +L +  ++PS   R+ T  +   Q L  
Sbjct: 58  TVTLHNLEHSMGMHSYELGMNHLGDMTSEEVISLMSSLRVPSQWPRNVTYKSDPNQKL-- 115

Query: 134 TDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCST 193
              P S+DWR+KG VT +K Q  CG CWAF+AV A+E   K+++G L+ LS Q L+DCST
Sbjct: 116 ---PDSMDWREKGCVTEVKYQGACGSCWAFSAVGALEAQVKLKTGKLVSLSAQNLVDCST 172

Query: 194 N--GNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPS 251
              GN GC GG   +AF YII N GI +E  YPY+A+ G C    K  AA  S Y E+P 
Sbjct: 173 AKYGNKGCNGGFMTEAFQYIIDNNGIDSEASYPYKAMDGKCQYDVKNRAATCSRYIELPF 232

Query: 252 GDEQALLKAVSMQ-PVSIAIAAYSTEFQSYKEGI-FNGVCGTQLDHAVTIVGFGTTEDGA 309
           G E+AL +AV+ + PVS+ I A  + F  YK G+ ++  C   ++H V +VG+G   DG 
Sbjct: 233 GSEEALKEAVANKGPVSVGIDASHSSFFLYKTGVYYDPSCTQNVNHGVLVVGYGNL-DGK 291

Query: 310 NYWLIKNSWGNTWGDAGYMKIVRDEG-LCGIGTRSSYP 346
           +YWL+KNSWG  +GD GY+++ R+ G  CGI    SYP
Sbjct: 292 DYWLVKNSWGLHFGDQGYIRMARNSGNHCGIANYPSYP 329


>sp|Q8HY81|CATS_CANFA Cathepsin S OS=Canis familiaris GN=CTSS PE=2 SV=1
          Length = 331

 Score =  240 bits (613), Expect = 1e-62,   Method: Compositional matrix adjust.
 Identities = 135/338 (39%), Positives = 195/338 (57%), Gaps = 18/338 (5%)

Query: 18  MFIIITLLVSCASQVVSSRSTHEQSVVEIH-EKWMAQHGRSYKDELEKEMRLKIFKENLE 76
           M  ++ LL  C+  V      H+   ++ H   W   + + YK+E E+  R  I+++NL+
Sbjct: 1   MKWLVGLLPLCSYAVAQ---VHKDPTLDHHWNLWKKTYSKQYKEENEEVARRLIWEKNLK 57

Query: 77  YIEKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSM 133
           ++   N E   G  +Y LG N   D+T +E  +L    ++PS   R+ T     Y++ S 
Sbjct: 58  FVMLHNLEHSMGMHSYDLGMNHLGDMTGEEVISLMGSLRVPSQWQRNVT-----YRSNSN 112

Query: 134 TDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCST 193
             +P S+DWR+KG VT +K Q  CG CWAF+AV A+E   K+++G L+ LS Q L+DCST
Sbjct: 113 QKLPDSVDWREKGCVTEVKYQGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCST 172

Query: 194 N--GNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPS 251
              GN GC GG    AF YII N GI +E  YPY+A+ G C    K  AA  S Y E+P 
Sbjct: 173 EKYGNKGCNGGFMTTAFQYIIDNNGIDSEASYPYKAMNGKCRYDSKKRAATCSKYTELPF 232

Query: 252 GDEQALLKAVSMQ-PVSIAIAAYSTEFQSYKEGI-FNGVCGTQLDHAVTIVGFGTTEDGA 309
           G E AL +AV+ + PVS+AI A    F  Y+ G+ +   C   ++H V +VG+G   +G 
Sbjct: 233 GSEDALKEAVANKGPVSVAIDASHYSFFLYRSGVYYEPSCTQNVNHGVLVVGYGNL-NGK 291

Query: 310 NYWLIKNSWGNTWGDAGYMKIVRDEG-LCGIGTRSSYP 346
           +YWL+KNSWG  +GD GY+++ R+ G  CGI +  SYP
Sbjct: 292 DYWLVKNSWGLNFGDQGYIRMARNSGNHCGIASYPSYP 329


>sp|O70370|CATS_MOUSE Cathepsin S OS=Mus musculus GN=Ctss PE=2 SV=2
          Length = 340

 Score =  238 bits (608), Expect = 4e-62,   Method: Compositional matrix adjust.
 Identities = 128/306 (41%), Positives = 185/306 (60%), Gaps = 15/306 (4%)

Query: 50  WMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKE---GNRTYKLGTNQFSDLTNDEFRA 106
           W   H + YKD+ E+E+R  I+++NL++I   N E   G  TY++G N   D+TN+E   
Sbjct: 39  WKKTHEKEYKDKNEEEVRRLIWEKNLKFIMIHNLEYSMGMHTYQVGMNDMGDMTNEEILC 98

Query: 107 LYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAV 166
                ++P  S ++ T     +++ S   +P ++DWR+KG VT +K Q  CG CWAF+AV
Sbjct: 99  RMGALRIPRQSPKTVT-----FRSYSNRTLPDTVDWREKGCVTEVKYQGSCGACWAFSAV 153

Query: 167 AAVEGITKIRSGNLIQLSEQQLLDCSTN---GNNGCLGGSREKAFAYIIQNQGIATEDEY 223
            A+EG  K+++G LI LS Q L+DCS     GN GC GG   +AF YII N GI  +  Y
Sbjct: 154 GALEGQLKLKTGKLISLSAQNLVDCSNEEKYGNKGCGGGYMTEAFQYIIDNGGIEADASY 213

Query: 224 PYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVSMQ-PVSIAIAAYSTEFQSYKE 282
           PY+A    C    K  AA  S Y ++P GDE AL +AV+ + PVS+ I A  + F  YK 
Sbjct: 214 PYKATDEKCHYNSKNRAATCSRYIQLPFGDEDALKEAVATKGPVSVGIDASHSSFFFYKS 273

Query: 283 GIFNG-VCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR-DEGLCGIG 340
           G+++   C   ++H V +VG+GT  DG +YWL+KNSWG  +GD GY+++ R ++  CGI 
Sbjct: 274 GVYDDPSCTGNVNHGVLVVGYGTL-DGKDYWLVKNSWGLNFGDQGYIRMARNNKNHCGIA 332

Query: 341 TRSSYP 346
           +  SYP
Sbjct: 333 SYCSYP 338


>sp|P13277|CYSP1_HOMAM Digestive cysteine proteinase 1 OS=Homarus americanus GN=LCP1 PE=1
           SV=2
          Length = 322

 Score =  238 bits (608), Expect = 4e-62,   Method: Compositional matrix adjust.
 Identities = 129/309 (41%), Positives = 181/309 (58%), Gaps = 19/309 (6%)

Query: 48  EKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKE---GNRTYKLGTNQFSDLTNDEF 104
           E++  + GR Y D  E+  RL +F +NL+YIE+ NK+   G  TY L  NQFSD+TN++F
Sbjct: 21  EEFKGKFGRKYVDLEEERYRLNVFLDNLQYIEEFNKKYERGEVTYNLAINQFSDMTNEKF 80

Query: 105 RALYTGYKM-PSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAF 163
            A+  GYK  P P+   T++              T +DWR KGAVTP+K+Q +CG CWAF
Sbjct: 81  NAVMKGYKKGPRPAAVFTSTDA--------APESTEVDWRTKGAVTPVKDQGQCGSCWAF 132

Query: 164 AAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG--NNGCLGGSREKAFAYIIQNQGIATED 221
           +    +EG   +++G L+ LSEQQL+DC+     N GC GG  E+A  Y+  N G+ TE 
Sbjct: 133 STTGGIEGQHFLKTGRLVSLSEQQLVDCAGGSYYNQGCNGGWVERAIMYVRDNGGVDTES 192

Query: 222 EYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAV-SMQPVSIAIAAYSTEFQSY 280
            YPY+A   TC        A  + Y  +  G E AL  A   + P+S+AI A    FQSY
Sbjct: 193 SYPYEARDNTCRFNSNTIGATCTGYVGIAQGSESALKTATRDIGPISVAIDASHRSFQSY 252

Query: 281 KEGIF--NGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDE-GLC 337
             G++       +QLDHAV  VG+G +E G ++WL+KNSW  +WG++GY+K+ R+    C
Sbjct: 253 YTGVYYEPSCSSSQLDHAVLAVGYG-SEGGQDFWLVKNSWATSWGESGYIKMARNRNNNC 311

Query: 338 GIGTRSSYP 346
           GI T + YP
Sbjct: 312 GIATDACYP 320


>sp|P25774|CATS_HUMAN Cathepsin S OS=Homo sapiens GN=CTSS PE=1 SV=3
          Length = 331

 Score =  235 bits (600), Expect = 3e-61,   Method: Compositional matrix adjust.
 Identities = 128/338 (37%), Positives = 199/338 (58%), Gaps = 18/338 (5%)

Query: 18  MFIIITLLVSCASQVVSSRSTHEQSVVEIH-EKWMAQHGRSYKDELEKEMRLKIFKENLE 76
           M  ++ +L+ C+S V      H+   ++ H   W   +G+ YK++ E+ +R  I+++NL+
Sbjct: 1   MKRLVCVLLVCSSAVAQ---LHKDPTLDHHWHLWKKTYGKQYKEKNEEAVRRLIWEKNLK 57

Query: 77  YIEKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSM 133
           ++   N E   G  +Y LG N   D+T++E  +L +  ++PS   R+ T     Y++   
Sbjct: 58  FVMLHNLEHSMGMHSYDLGMNHLGDMTSEEVMSLMSSLRVPSQWQRNIT-----YKSNPN 112

Query: 134 TDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCST 193
             +P S+DWR+KG VT +K Q  CG CWAF+AV A+E   K+++G L+ LS Q L+DCST
Sbjct: 113 RILPDSVDWREKGCVTEVKYQGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCST 172

Query: 194 N--GNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPS 251
              GN GC GG    AF YII N+GI ++  YPY+A+   C    K  AA  S Y E+P 
Sbjct: 173 EKYGNKGCNGGFMTTAFQYIIDNKGIDSDASYPYKAMDQKCQYDSKYRAATCSKYTELPY 232

Query: 252 GDEQALLKAVSMQ-PVSIAIAAYSTEFQSYKEGI-FNGVCGTQLDHAVTIVGFGTTEDGA 309
           G E  L +AV+ + PVS+ + A    F  Y+ G+ +   C   ++H V +VG+G   +G 
Sbjct: 233 GREDVLKEAVANKGPVSVGVDARHPSFFLYRSGVYYEPSCTQNVNHGVLVVGYGDL-NGK 291

Query: 310 NYWLIKNSWGNTWGDAGYMKIVRDEG-LCGIGTRSSYP 346
            YWL+KNSWG+ +G+ GY+++ R++G  CGI +  SYP
Sbjct: 292 EYWLVKNSWGHNFGEEGYIRMARNKGNHCGIASFPSYP 329


>sp|Q9GKL8|CATL1_CHLAE Cathepsin L1 OS=Chlorocebus aethiops GN=CTSL1 PE=1 SV=1
          Length = 333

 Score =  235 bits (599), Expect = 4e-61,   Method: Compositional matrix adjust.
 Identities = 131/341 (38%), Positives = 195/341 (57%), Gaps = 23/341 (6%)

Query: 17  PMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLE 76
           P FI+  L +  AS  +    T   S+     KW A H R Y    E+  R  ++++N++
Sbjct: 3   PTFILAALCLGIASATL----TFNHSLEAQWTKWKAMHNRLYGMN-EEGWRRAVWEKNMK 57

Query: 77  YIEKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSM 133
            IE  N+E   G  ++ +  N F D+T++EFR +  G++   P           +Q    
Sbjct: 58  MIELHNQEYSQGKHSFTMAMNTFGDMTSEEFRQVMNGFQNRKPRKGKV------FQEPLF 111

Query: 134 TDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS- 192
            + P S+DWR+KG VTP+KNQ +CG CWAF+A  A+EG    ++G L+ LSEQ L+DCS 
Sbjct: 112 YEAPRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSG 171

Query: 193 TNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSG 252
             GN GC GG  + AF Y+  N G+ +E+ YPY+A   +C    + + A  + + ++P  
Sbjct: 172 PQGNEGCNGGLMDYAFQYVADNGGLDSEESYPYEATEESCKYNPEYSVANDTGFVDIPK- 230

Query: 253 DEQALLKAV-SMQPVSIAIAAYSTEFQSYKEGI-FNGVCGTQ-LDHAVTIVGFG---TTE 306
            E+AL+KAV ++ P+S+AI A    F  YKEGI F   C ++ +DH V +VG+G   T  
Sbjct: 231 QEKALMKAVATVGPISVAIDAGHESFMFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTES 290

Query: 307 DGANYWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYP 346
           D + YWL+KNSWG  WG  GY+K+ +D    CGI + +SYP
Sbjct: 291 DNSKYWLVKNSWGEEWGMGGYIKMAKDRRNHCGIASAASYP 331


>sp|Q86GF7|CRUST_PANBO Crustapain OS=Pandalus borealis GN=Cys PE=1 SV=1
          Length = 323

 Score =  234 bits (598), Expect = 6e-61,   Method: Compositional matrix adjust.
 Identities = 129/308 (41%), Positives = 190/308 (61%), Gaps = 14/308 (4%)

Query: 48  EKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANK---EGNRTYKLGTNQFSDLTNDEF 104
           E +  + G+ Y +  E+  R+ +F + L++I++ N+   +G  TY L  N FSDLT++E 
Sbjct: 21  ENFKTKFGKKYANSEEESHRMSVFMDKLKFIQEHNERYDKGEVTYWLKINNFSDLTHEEV 80

Query: 105 RALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFA 164
            A  TG      + R    S    ++   T +   +DWR+KGAVTP+K+Q +CG CWAF+
Sbjct: 81  LATKTGM-----TRRRHPLSVLP-KSAPTTPMAADVDWRNKGAVTPVKDQGQCGSCWAFS 134

Query: 165 AVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYIIQNQGIATEDEY 223
           AVAA+EG   +++G+L+ LSEQ L+DCS++ GN GC GG   +A+ YII N+GI TE  Y
Sbjct: 135 AVAALEGAHFLKTGDLVSLSEQNLVDCSSSYGNQGCNGGWPYQAYQYIIANRGIDTESSY 194

Query: 224 PYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVSMQ-PVSIAIAAYSTEFQSYKE 282
           PY+A+   C        A +S+Y E  SGDE AL  AV  + PVS+ I A  + F SY  
Sbjct: 195 PYKAIDDNCRYDAGNIGATVSSYVEPASGDESALQHAVQNEGPVSVCIDAGQSSFGSYGG 254

Query: 283 GI-FNGVCGTQL-DHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-EGLCGI 339
           G+ +   C +   +HAVT VG+GT  +G +YW++KNSWG  WG++GY+K+ R+ +  C I
Sbjct: 255 GVYYEPNCDSWYANHAVTAVGYGTDANGGDYWIVKNSWGAWWGESGYIKMARNRDNNCAI 314

Query: 340 GTRSSYPL 347
            T S YP+
Sbjct: 315 ATYSVYPV 322


>sp|P25782|CYSP2_HOMAM Digestive cysteine proteinase 2 OS=Homarus americanus GN=LCP2 PE=2
           SV=1
          Length = 323

 Score =  233 bits (594), Expect = 2e-60,   Method: Compositional matrix adjust.
 Identities = 132/308 (42%), Positives = 181/308 (58%), Gaps = 14/308 (4%)

Query: 48  EKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKE---GNRTYKLGTNQFSDLTNDEF 104
           E +  ++GR Y D  E   R  IF++N +YIE+ NK+   G  T+ L  N+F D+T +EF
Sbjct: 21  EHFKGKYGRQYVDAEEDSYRRVIFEQNQKYIEEFNKKYENGEVTFNLAMNKFGDMTLEEF 80

Query: 105 RALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFA 164
            A+  G  +P    RS   S F Y         T +DWR KGAVTP+K+Q +CG CWAF+
Sbjct: 81  NAVMKG-NIP---RRSAPVSVF-YPKKETGPQATEVDWRTKGAVTPVKDQGQCGSCWAFS 135

Query: 165 AVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYIIQNQGIATEDEY 223
              ++EG   +++G+LI L+EQQL+DCS   G  GC GG    AF YI  N GI TE  Y
Sbjct: 136 TTGSLEGQHFLKTGSLISLAEQQLVDCSRPYGPQGCNGGWMNDAFDYIKANNGIDTEAAY 195

Query: 224 PYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAV-SMQPVSIAIAAYSTEFQSYKE 282
           PY+A  G+C       AA  S +  + SG E  L +AV  + P+S+ I A  + FQ Y  
Sbjct: 196 PYEARDGSCRFDSNSVAATCSGHTNIASGSETGLQQAVRDIGPISVTIDAAHSSFQFYSS 255

Query: 283 GIF--NGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDE-GLCGI 339
           G++       + LDHAV  VG+G +E G ++WL+KNSW  +WGDAGY+K+ R+    CGI
Sbjct: 256 GVYYEPSCSPSYLDHAVLAVGYG-SEGGQDFWLVKNSWATSWGDAGYIKMSRNRNNNCGI 314

Query: 340 GTRSSYPL 347
            T +SYPL
Sbjct: 315 ATVASYPL 322


>sp|Q8HY82|CATS_SAIBB Cathepsin S OS=Saimiri boliviensis boliviensis GN=CTSS PE=2 SV=1
          Length = 330

 Score =  232 bits (592), Expect = 3e-60,   Method: Compositional matrix adjust.
 Identities = 126/337 (37%), Positives = 197/337 (58%), Gaps = 17/337 (5%)

Query: 18  MFIIITLLVSCASQVVSSRSTHEQSVVEIH-EKWMAQHGRSYKDELEKEMRLKIFKENLE 76
           M  ++ +L  C+S V      H+   ++ H   W   +G+ YK++ E+ +R  I+++NL+
Sbjct: 1   MKQLVCVLFVCSSAVTQ---LHKDPTLDHHWNLWKKTYGKQYKEKNEEAVRRLIWEKNLK 57

Query: 77  YIEKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSM 133
           ++   N E   G  +Y LG N   D+T++E  +L +  ++P+   R+ T  +   Q L  
Sbjct: 58  FVMLHNLEHSMGMHSYDLGMNHLGDMTSEEVMSLMSSLRVPNQWQRNITYKSNPNQML-- 115

Query: 134 TDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCST 193
              P S+DWR+KG VT +K Q  CG CWAF+AV A+E   K+++G L+ LS Q L+DCS 
Sbjct: 116 ---PDSVDWREKGCVTEVKYQGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSE 172

Query: 194 N-GNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSG 252
             GN GC GG   +AF YII N+GI +E  YPY+A    C    K  AA  S Y E+P G
Sbjct: 173 KYGNKGCNGGFMTEAFQYIIDNKGIDSEASYPYKATDQKCQYDSKYRAATCSKYTELPYG 232

Query: 253 DEQALLKAVSMQ-PVSIAIAAYSTEFQSYKEGI-FNGVCGTQLDHAVTIVGFGTTEDGAN 310
            E  L +AV+ + PV + + A    F  Y+ G+ ++  C  +++H V ++G+G   +G  
Sbjct: 233 REDVLKEAVANKGPVCVGVDASHPSFFLYRSGVYYDPACTQKVNHGVLVIGYGDL-NGKE 291

Query: 311 YWLIKNSWGNTWGDAGYMKIVRDEG-LCGIGTRSSYP 346
           YWL+KNSWG+ +G+ GY+++ R++G  CGI +  SYP
Sbjct: 292 YWLVKNSWGSNFGEQGYIRMARNKGNHCGIASYPSYP 328


>sp|P07711|CATL1_HUMAN Cathepsin L1 OS=Homo sapiens GN=CTSL1 PE=1 SV=2
          Length = 333

 Score =  232 bits (592), Expect = 3e-60,   Method: Compositional matrix adjust.
 Identities = 131/342 (38%), Positives = 193/342 (56%), Gaps = 23/342 (6%)

Query: 16  TPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENL 75
            P  I+    +  AS  +    T + S+     KW A H R Y    E+  R  ++++N+
Sbjct: 2   NPTLILAAFCLGIASATL----TFDHSLEAQWTKWKAMHNRLYGMN-EEGWRRAVWEKNM 56

Query: 76  EYIEKAN---KEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLS 132
           + IE  N   +EG  ++ +  N F D+T++EFR +  G++   P           +Q   
Sbjct: 57  KMIELHNQEYREGKHSFTMAMNAFGDMTSEEFRQVMNGFQNRKPRKGKV------FQEPL 110

Query: 133 MTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS 192
             + P S+DWR+KG VTP+KNQ +CG CWAF+A  A+EG    ++G LI LSEQ L+DCS
Sbjct: 111 FYEAPRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGRLISLSEQNLVDCS 170

Query: 193 -TNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPS 251
              GN GC GG  + AF Y+  N G+ +E+ YPY+A   +C    K + A  + + ++P 
Sbjct: 171 GPQGNEGCNGGLMDYAFQYVQDNGGLDSEESYPYEATEESCKYNPKYSVANDTGFVDIPK 230

Query: 252 GDEQALLKAV-SMQPVSIAIAAYSTEFQSYKEGI-FNGVCGTQ-LDHAVTIVGFG---TT 305
             E+AL+KAV ++ P+S+AI A    F  YKEGI F   C ++ +DH V +VG+G   T 
Sbjct: 231 -QEKALMKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTE 289

Query: 306 EDGANYWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYP 346
            D   YWL+KNSWG  WG  GY+K+ +D    CGI + +SYP
Sbjct: 290 SDNNKYWLVKNSWGEEWGMGGYVKMAKDRRNHCGIASAASYP 331


>sp|P25784|CYSP3_HOMAM Digestive cysteine proteinase 3 OS=Homarus americanus GN=LCP3 PE=2
           SV=1
          Length = 321

 Score =  231 bits (589), Expect = 6e-60,   Method: Compositional matrix adjust.
 Identities = 132/307 (42%), Positives = 183/307 (59%), Gaps = 16/307 (5%)

Query: 48  EKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKE---GNRTYKLGTNQFSDLTNDEF 104
           + +  Q+GR Y D  E+  R ++F++N + IE  NK+   G  T+K+  NQF D+TN+EF
Sbjct: 21  DHFKTQYGRKYGDAKEELYRQRVFQQNEQLIEDFNKKFENGEVTFKVAMNQFGDMTNEEF 80

Query: 105 RALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFA 164
            A+  GYK  S   R    + F  +   M      +DWR K  VTP+K+Q++CG CWAF+
Sbjct: 81  NAVMKGYKKGS---RGEPKAVFTAEAGPMA---ADVDWRTKALVTPVKDQEQCGSCWAFS 134

Query: 165 AVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYIIQNQGIATEDEY 223
           A  A+EG   +++  L+ LSEQQL+DCST+ GN+GC GG    AF YI  N GI TE  Y
Sbjct: 135 ATGALEGQHFLKNDELVSLSEQQLVDCSTDYGNDGCGGGWMTSAFDYIKDNGGIDTESSY 194

Query: 224 PYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVS-MQPVSIAIAAYSTEFQSYKE 282
           PY+A   +C        A  +   EV    E+AL +AVS + P+S+AI A    FQ Y  
Sbjct: 195 PYEAEDRSCRFDANSIGAICTGSVEVQH-TEEALQEAVSGVGPISVAIDASHFSFQFYSS 253

Query: 283 GIF--NGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-EGLCGI 339
           G++       T LDH V  VG+G TE   +YWL+KNSWG++WGDAGY+K+ R+ +  CGI
Sbjct: 254 GVYYEQNCSPTFLDHGVLAVGYG-TESTKDYWLVKNSWGSSWGDAGYIKMSRNRDNNCGI 312

Query: 340 GTRSSYP 346
            +  SYP
Sbjct: 313 ASEPSYP 319


>sp|P82474|CPGP2_ZINOF Zingipain-2 OS=Zingiber officinale PE=1 SV=1
          Length = 221

 Score =  229 bits (585), Expect = 2e-59,   Method: Compositional matrix adjust.
 Identities = 107/217 (49%), Positives = 149/217 (68%), Gaps = 6/217 (2%)

Query: 135 DVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN 194
           D+P S+DWR+ GAV P+KNQ  CG CWAF+ VAAVEGI +I +G+LI LSEQQL+DC+T 
Sbjct: 2   DLPDSIDWRENGAVVPVKNQGGCGSCWAFSTVAAVEGINQIVTGDLISLSEQQLVDCTT- 60

Query: 195 GNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDE 254
            N+GC GG    AF +I+ N GI +E+ YPY+   G C++        I +YE VPS +E
Sbjct: 61  ANHGCRGGWMNPAFQFIVNNGGINSEETYPYRGQDGICNSTVNAPVVSIDSYENVPSHNE 120

Query: 255 QALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLI 314
           Q+L KAV+ QPVS+ + A   +FQ Y+ GIF G C    +HA+T+VG+GT  D  ++W++
Sbjct: 121 QSLQKAVANQPVSVTMDAAGRDFQLYRSGIFTGSCNISANHALTVVGYGTEND-KDFWIV 179

Query: 315 KNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPL 347
           KNSWG  WG++GY++  R+    +G CGI   +SYP+
Sbjct: 180 KNSWGKNWGESGYIRAERNIENPDGKCGITRFASYPV 216


>sp|P61277|CATK_MACMU Cathepsin K OS=Macaca mulatta GN=CTSK PE=1 SV=1
          Length = 329

 Score =  229 bits (583), Expect = 3e-59,   Method: Compositional matrix adjust.
 Identities = 129/326 (39%), Positives = 193/326 (59%), Gaps = 21/326 (6%)

Query: 33  VSSRSTHEQSVVEIH-EKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKE---GNRT 88
           V S + + + +++ H E W   H + Y  ++++  R  I+++NL+YI   N E   G  T
Sbjct: 11  VMSFALYPEEILDTHWELWKKTHRKQYNSKVDEISRRLIWEKNLKYISIHNLEASLGVHT 70

Query: 89  YKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTD----VPTSLDWRD 144
           Y+L  N   D+TN+E     TG K+P+   RS  +       L + D     P S+D+R 
Sbjct: 71  YELAMNHLGDMTNEEVVQKMTGLKVPASHSRSNDT-------LYIPDWEGRAPDSVDYRK 123

Query: 145 KGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSR 204
           KG VTP+KNQ +CG CWAF++V A+EG  K ++G L+ LS Q L+DC +  N+GC GG  
Sbjct: 124 KGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCVSE-NDGCGGGYM 182

Query: 205 EKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVS-M 263
             AF Y+ +N+GI +ED YPY     +C       AAK   Y E+P G+E+AL +AV+ +
Sbjct: 183 TNAFQYVQKNRGIDSEDAYPYVGQEESCMYNPTGKAAKCRGYREIPEGNEKALKRAVARV 242

Query: 264 QPVSIAIAAYSTEFQSYKEGI-FNGVCGT-QLDHAVTIVGFGTTEDGANYWLIKNSWGNT 321
            PVS+AI A  T FQ Y +G+ ++  C +  L+HAV  VG+G  + G  +W+IKNSWG  
Sbjct: 243 GPVSVAIDASLTSFQFYSKGVYYDESCNSDNLNHAVLAVGYG-IQKGNKHWIIKNSWGEN 301

Query: 322 WGDAGYMKIVRDE-GLCGIGTRSSYP 346
           WG+ GY+ + R++   CGI   +S+P
Sbjct: 302 WGNKGYILMARNKNNACGIANLASFP 327


>sp|P61276|CATK_MACFA Cathepsin K OS=Macaca fascicularis GN=CTSK PE=2 SV=1
          Length = 329

 Score =  229 bits (583), Expect = 3e-59,   Method: Compositional matrix adjust.
 Identities = 129/326 (39%), Positives = 193/326 (59%), Gaps = 21/326 (6%)

Query: 33  VSSRSTHEQSVVEIH-EKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKE---GNRT 88
           V S + + + +++ H E W   H + Y  ++++  R  I+++NL+YI   N E   G  T
Sbjct: 11  VMSFALYPEEILDTHWELWKKTHRKQYNSKVDEISRRLIWEKNLKYISIHNLEASLGVHT 70

Query: 89  YKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTD----VPTSLDWRD 144
           Y+L  N   D+TN+E     TG K+P+   RS  +       L + D     P S+D+R 
Sbjct: 71  YELAMNHLGDMTNEEVVQKMTGLKVPASHSRSNDT-------LYIPDWEGRAPDSVDYRK 123

Query: 145 KGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSR 204
           KG VTP+KNQ +CG CWAF++V A+EG  K ++G L+ LS Q L+DC +  N+GC GG  
Sbjct: 124 KGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCVSE-NDGCGGGYM 182

Query: 205 EKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVS-M 263
             AF Y+ +N+GI +ED YPY     +C       AAK   Y E+P G+E+AL +AV+ +
Sbjct: 183 TNAFQYVQKNRGIDSEDAYPYVGQEESCMYNPTGKAAKCRGYREIPEGNEKALKRAVARV 242

Query: 264 QPVSIAIAAYSTEFQSYKEGI-FNGVCGT-QLDHAVTIVGFGTTEDGANYWLIKNSWGNT 321
            PVS+AI A  T FQ Y +G+ ++  C +  L+HAV  VG+G  + G  +W+IKNSWG  
Sbjct: 243 GPVSVAIDASLTSFQFYSKGVYYDESCNSDNLNHAVLAVGYG-IQKGNKHWIIKNSWGEN 301

Query: 322 WGDAGYMKIVRDE-GLCGIGTRSSYP 346
           WG+ GY+ + R++   CGI   +S+P
Sbjct: 302 WGNKGYILMARNKNNACGIANLASFP 327


>sp|O60911|CATL2_HUMAN Cathepsin L2 OS=Homo sapiens GN=CTSL2 PE=1 SV=2
          Length = 334

 Score =  228 bits (581), Expect = 6e-59,   Method: Compositional matrix adjust.
 Identities = 130/340 (38%), Positives = 194/340 (57%), Gaps = 19/340 (5%)

Query: 18  MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
           M + + L   C   + S+    +Q++     +W A H R Y    E+  R  ++++N++ 
Sbjct: 1   MNLSLVLAAFCLG-IASAVPKFDQNLDTKWYQWKATHRRLYGAN-EEGWRRAVWEKNMKM 58

Query: 78  IEKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMT 134
           IE  N E   G   + +  N F D+TN+EFR +   ++    + +      F+       
Sbjct: 59  IELHNGEYSQGKHGFTMAMNAFGDMTNEEFRQMMGCFR----NQKFRKGKVFR--EPLFL 112

Query: 135 DVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS-T 193
           D+P S+DWR KG VTP+KNQK+CG CWAF+A  A+EG    ++G L+ LSEQ L+DCS  
Sbjct: 113 DLPKSVDWRKKGYVTPVKNQKQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRP 172

Query: 194 NGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGD 253
            GN GC GG   +AF Y+ +N G+ +E+ YPY AV   C    + + A  + +  V  G 
Sbjct: 173 QGNQGCNGGFMARAFQYVKENGGLDSEESYPYVAVDEICKYRPENSVANDTGFTVVAPGK 232

Query: 254 EQALLKAV-SMQPVSIAIAAYSTEFQSYKEGI-FNGVCGTQ-LDHAVTIVGF---GTTED 307
           E+AL+KAV ++ P+S+A+ A  + FQ YK GI F   C ++ LDH V +VG+   G   +
Sbjct: 233 EKALMKAVATVGPISVAMDAGHSSFQFYKSGIYFEPDCSSKNLDHGVLVVGYGFEGANSN 292

Query: 308 GANYWLIKNSWGNTWGDAGYMKIVRDE-GLCGIGTRSSYP 346
            + YWL+KNSWG  WG  GY+KI +D+   CGI T +SYP
Sbjct: 293 NSKYWLVKNSWGPEWGSNGYVKIAKDKNNHCGIATAASYP 332


>sp|P06797|CATL1_MOUSE Cathepsin L1 OS=Mus musculus GN=Ctsl1 PE=1 SV=2
          Length = 334

 Score =  227 bits (579), Expect = 8e-59,   Method: Compositional matrix adjust.
 Identities = 125/311 (40%), Positives = 185/311 (59%), Gaps = 19/311 (6%)

Query: 48  EKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKE---GNRTYKLGTNQFSDLTNDEF 104
            +W + H R Y    E+E R  I+++N+  I+  N E   G   + +  N F D+TN+EF
Sbjct: 30  HQWKSTHRRLYGTN-EEEWRRAIWEKNMRMIQLHNGEYSNGQHGFSMEMNAFGDMTNEEF 88

Query: 105 RALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFA 164
           R +  GY+     H+        +Q   M  +P S+DWR+KG VTP+KNQ +CG CWAF+
Sbjct: 89  RQVVNGYR-----HQKHKKGRL-FQEPLMLKIPKSVDWREKGCVTPVKNQGQCGSCWAFS 142

Query: 165 AVAAVEGITKIRSGNLIQLSEQQLLDCS-TNGNNGCLGGSREKAFAYIIQNQGIATEDEY 223
           A   +EG   +++G LI LSEQ L+DCS   GN GC GG  + AF YI +N G+ +E+ Y
Sbjct: 143 ASGCLEGQMFLKTGKLISLSEQNLVDCSHAQGNQGCNGGLMDFAFQYIKENGGLDSEESY 202

Query: 224 PYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAV-SMQPVSIAIAAYSTEFQSYKE 282
           PY+A  G+C    + A A  + + ++P   E+AL+KAV ++ P+S+A+ A     Q Y  
Sbjct: 203 PYEAKDGSCKYRAEFAVANDTGFVDIPQ-QEKALMKAVATVGPISVAMDASHPSLQFYSS 261

Query: 283 GI-FNGVCGTQ-LDHAVTIVGF---GTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-EGL 336
           GI +   C ++ LDH V +VG+   GT  +   YWL+KNSWG+ WG  GY+KI +D +  
Sbjct: 262 GIYYEPNCSSKNLDHGVLLVGYGYEGTDSNKNKYWLVKNSWGSEWGMEGYIKIAKDRDNH 321

Query: 337 CGIGTRSSYPL 347
           CG+ T +SYP+
Sbjct: 322 CGLATAASYPV 332


>sp|P07154|CATL1_RAT Cathepsin L1 OS=Rattus norvegicus GN=Ctsl1 PE=1 SV=2
          Length = 334

 Score =  226 bits (575), Expect = 2e-58,   Method: Compositional matrix adjust.
 Identities = 123/311 (39%), Positives = 184/311 (59%), Gaps = 19/311 (6%)

Query: 48  EKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKE---GNRTYKLGTNQFSDLTNDEF 104
            +W + H R Y    E+E R  ++++N+  I+  N E   G   + +  N F D+TN+EF
Sbjct: 30  HQWKSTHRRLYGTN-EEEWRRAVWEKNMRMIQLHNGEYSNGKHGFTMEMNAFGDMTNEEF 88

Query: 105 RALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFA 164
           R +  GY+     H+        +Q   M  +P ++DWR+KG VTP+KNQ +CG CWAF+
Sbjct: 89  RQIVNGYR-----HQKHKKGRL-FQEPLMLQIPKTVDWREKGCVTPVKNQGQCGSCWAFS 142

Query: 165 AVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYIIQNQGIATEDEY 223
           A   +EG   +++G LI LSEQ L+DCS + GN GC GG  + AF YI +N G+ +E+ Y
Sbjct: 143 ASGCLEGQMFLKTGKLISLSEQNLVDCSHDQGNQGCNGGLMDFAFQYIKENGGLDSEESY 202

Query: 224 PYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAV-SMQPVSIAIAAYSTEFQSYKE 282
           PY+A  G+C    + A A  + + ++P   E+AL+KAV ++ P+S+A+ A     Q Y  
Sbjct: 203 PYEAKDGSCKYRAEYAVANDTGFVDIPQ-QEKALMKAVATVGPISVAMDASHPSLQFYSS 261

Query: 283 GI-FNGVCGTQ-LDHAVTIVGF---GTTEDGANYWLIKNSWGNTWGDAGYMKIVRDE-GL 336
           GI +   C ++ LDH V +VG+   GT  +   YWL+KNSWG  WG  GY+KI +D    
Sbjct: 262 GIYYEPNCSSKDLDHGVLVVGYGYEGTDSNKDKYWLVKNSWGKEWGMDGYIKIAKDRNNH 321

Query: 337 CGIGTRSSYPL 347
           CG+ T +SYP+
Sbjct: 322 CGLATAASYPI 332


>sp|P83654|ERVC_TABDI Ervatamin-C OS=Tabernaemontana divaricata PE=1 SV=1
          Length = 208

 Score =  226 bits (575), Expect = 2e-58,   Method: Compositional matrix adjust.
 Identities = 112/213 (52%), Positives = 145/213 (68%), Gaps = 10/213 (4%)

Query: 136 VPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG 195
           +P  +DWR KGAVTP+KNQ  CG CWAF+ V+ VE I +IR+GNLI LSEQ+L+DC    
Sbjct: 1   LPEQIDWRKKGAVTPVKNQGSCGSCWAFSTVSTVESINQIRTGNLISLSEQELVDCDKK- 59

Query: 196 NNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQ 255
           N+GCLGG+   A+ YII N GI T+  YPY+AV G C AA K     I  Y  VP  +E 
Sbjct: 60  NHGCLGGAFVFAYQYIINNGGIDTQANYPYKAVQGPCQAASK--VVSIDGYNGVPFCNEX 117

Query: 256 ALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIK 315
           AL +AV++QP ++AI A S +FQ Y  GIF+G CGT+L+H VTIVG+      ANYW+++
Sbjct: 118 ALKQAVAVQPSTVAIDASSAQFQQYSSGIFSGPCGTKLNHGVTIVGY-----QANYWIVR 172

Query: 316 NSWGNTWGDAGYMKIVR--DEGLCGIGTRSSYP 346
           NSWG  WG+ GY++++R    GLCGI     YP
Sbjct: 173 NSWGRYWGEKGYIRMLRVGGCGLCGIARLPYYP 205


  Database: swissprot
    Posted date:  Mar 23, 2013  2:32 AM
  Number of letters in database: 191,569,459
  Number of sequences in database:  539,616
  
Lambda     K      H
   0.315    0.130    0.387 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 127,069,099
Number of Sequences: 539616
Number of extensions: 5202453
Number of successful extensions: 16415
Number of sequences better than 100.0: 50
Number of HSP's better than 100.0 without gapping: 223
Number of HSP's successfully gapped in prelim test: 22
Number of HSP's that attempted gapping in prelim test: 15280
Number of HSP's gapped (non-prelim): 291
length of query: 348
length of database: 191,569,459
effective HSP length: 118
effective length of query: 230
effective length of database: 127,894,771
effective search space: 29415797330
effective search space used: 29415797330
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 62 (28.5 bits)