BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 047400
         (88 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|242093994|ref|XP_002437487.1| hypothetical protein SORBIDRAFT_10g027980 [Sorghum bicolor]
 gi|241915710|gb|EER88854.1| hypothetical protein SORBIDRAFT_10g027980 [Sorghum bicolor]
          Length = 341

 Score = 88.6 bits (218), Expect = 4e-16,   Method: Composition-based stats.
 Identities = 39/76 (51%), Positives = 52/76 (68%), Gaps = 2/76 (2%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G CW F+ V A+EGI+KIVTNNL+ +S Q+L+DCD   E   C GG ++  +Q+VI N G
Sbjct: 162 GGCWAFSAVAAMEGINKIVTNNLISLSEQELIDCDT--EDYGCQGGEMQKAFQFVIDNGG 219

Query: 65  INTERDYPNVGVMDNC 80
           I+TE DYP +G    C
Sbjct: 220 IDTEADYPFIGTNGTC 235


>gi|357474523|ref|XP_003607546.1| Cysteine proteinase [Medicago truncatula]
 gi|358347207|ref|XP_003637651.1| Cysteine proteinase [Medicago truncatula]
 gi|355503586|gb|AES84789.1| Cysteine proteinase [Medicago truncatula]
 gi|355508601|gb|AES89743.1| Cysteine proteinase [Medicago truncatula]
          Length = 345

 Score = 88.6 bits (218), Expect = 4e-16,   Method: Composition-based stats.
 Identities = 37/78 (47%), Positives = 54/78 (69%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G CW FA V A+EGI+KI +  L+ +S Q+L+DCD +  ++ C GG +ET Y ++I+N G
Sbjct: 150 GGCWAFAAVAAVEGINKIKSGKLISLSEQELIDCDVKSGNQGCQGGLMETAYTFIIENGG 209

Query: 65  INTERDYPNVGVMDNCKV 82
           + TE+DYP  GV   CK+
Sbjct: 210 LTTEQDYPYEGVDGTCKM 227


>gi|144905112|dbj|BAF56429.1| cysteine proteinase [Lotus japonicus]
          Length = 341

 Score = 88.6 bits (218), Expect = 5e-16,   Method: Composition-based stats.
 Identities = 38/77 (49%), Positives = 52/77 (67%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V A EGI KI T  LV +S Q+LVDCD +G  + C GG++E  ++++I+N G
Sbjct: 148 GSCWAFSTVAATEGIHKISTGKLVSLSEQELVDCDRKGTDQGCEGGYMEDGFEFIIKNGG 207

Query: 65  INTERDYPNVGVMDNCK 81
           I TE +YP   V  +CK
Sbjct: 208 ITTEANYPYKAVDGSCK 224


>gi|224099295|ref|XP_002334495.1| predicted protein [Populus trichocarpa]
 gi|222872550|gb|EEF09681.1| predicted protein [Populus trichocarpa]
          Length = 342

 Score = 88.2 bits (217), Expect = 5e-16,   Method: Composition-based stats.
 Identities = 36/68 (52%), Positives = 50/68 (73%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V A EGI+++ T  LV +S Q+LVDCDNQGE + C GG +E  ++++I+N G
Sbjct: 146 GSCWAFSTVAATEGINQLTTGKLVSLSEQELVDCDNQGEDQGCEGGLMEDGFEFIIKNHG 205

Query: 65  INTERDYP 72
           I TE +YP
Sbjct: 206 ITTEANYP 213


>gi|159479072|ref|XP_001697622.1| cysteine endopeptidase [Chlamydomonas reinhardtii]
 gi|158274232|gb|EDP00016.1| cysteine endopeptidase [Chlamydomonas reinhardtii]
          Length = 469

 Score = 87.8 bits (216), Expect = 8e-16,   Method: Composition-based stats.
 Identities = 38/78 (48%), Positives = 54/78 (69%), Gaps = 1/78 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW FA  G++EGI+ IVT +LV +S Q+LVDCD + + + C GG ++  Y ++I+N+G
Sbjct: 124 GSCWAFATTGSVEGINAIVTGSLVSLSEQELVDCDTE-QDKGCSGGLMDYAYAWIIKNKG 182

Query: 65  INTERDYPNVGVMDNCKV 82
           INTE DYP   +   C V
Sbjct: 183 INTEEDYPYTAMDGQCDV 200


>gi|413922306|gb|AFW62238.1| hypothetical protein ZEAMMB73_802227 [Zea mays]
          Length = 490

 Score = 87.4 bits (215), Expect = 1e-15,   Method: Composition-based stats.
 Identities = 38/81 (46%), Positives = 56/81 (69%), Gaps = 7/81 (8%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCD---NQGESRSCVGGFIETIYQYVIQ 61
           GSCW F+ + A+EGI++IVT +L+ +S Q+LVDCD   NQG    C GG ++ +++++I 
Sbjct: 360 GSCWAFSTIAAVEGINQIVTGDLISLSKQELVDCDTSYNQG----CNGGLMDYVFEFIIN 415

Query: 62  NRGINTERDYPNVGVMDNCKV 82
           N GI+TE+DYP  G    C V
Sbjct: 416 NGGIDTEKDYPYKGTDGRCDV 436


>gi|414879924|tpg|DAA57055.1| TPA: hypothetical protein ZEAMMB73_175573 [Zea mays]
          Length = 336

 Score = 87.0 bits (214), Expect = 1e-15,   Method: Composition-based stats.
 Identities = 40/88 (45%), Positives = 59/88 (67%), Gaps = 7/88 (7%)

Query: 2  HPLGSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCD---NQGESRSCVGGFIETIYQY 58
          +P GSCW F+ + A+EGI++IVT +L+ +S Q+LVDCD   NQG    C GG ++  +++
Sbjct: 6  YPSGSCWAFSTIAAVEGINQIVTGDLISLSEQELVDCDTSYNQG----CNGGLMDYAFEF 61

Query: 59 VIQNRGINTERDYPNVGVMDNCKVFQFN 86
          +I N GI+TE+DYP  G    C V + N
Sbjct: 62 IINNGGIDTEKDYPYKGTDGRCDVNRKN 89


>gi|356553978|ref|XP_003545327.1| PREDICTED: cysteine proteinase RD21a-like [Glycine max]
          Length = 496

 Score = 87.0 bits (214), Expect = 1e-15,   Method: Composition-based stats.
 Identities = 38/82 (46%), Positives = 56/82 (68%), Gaps = 1/82 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ +GA+EGI+KIVT  L+ +S Q+LVDCD  G +  C GG ++  ++++I N G
Sbjct: 189 GSCWAFSAIGAVEGINKIVTGELISLSEQELVDCDT-GYNEGCNGGLMDYAFEFIINNGG 247

Query: 65  INTERDYPNVGVMDNCKVFQFN 86
           I++E DYP  GV   C  ++ N
Sbjct: 248 IDSEEDYPYRGVDGRCDTYRKN 269


>gi|356517350|ref|XP_003527350.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Glycine
           max]
 gi|356577765|ref|XP_003556993.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Glycine
           max]
          Length = 343

 Score = 86.7 bits (213), Expect = 1e-15,   Method: Composition-based stats.
 Identities = 36/78 (46%), Positives = 50/78 (64%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G CW F+ + A EGI KI T  LV +S Q++VDCD +G    C GG+++  ++++IQN G
Sbjct: 147 GCCWAFSAIAATEGIHKISTGKLVSLSEQEVVDCDTKGTDHGCEGGYMDGAFKFIIQNHG 206

Query: 65  INTERDYPNVGVMDNCKV 82
           INTE  YP  GV   C +
Sbjct: 207 INTEASYPYKGVDGKCNI 224


>gi|224135841|ref|XP_002327317.1| predicted protein [Populus trichocarpa]
 gi|222835687|gb|EEE74122.1| predicted protein [Populus trichocarpa]
          Length = 342

 Score = 86.7 bits (213), Expect = 2e-15,   Method: Composition-based stats.
 Identities = 35/68 (51%), Positives = 49/68 (72%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V A EGI+++ T  LV +S Q+LVDCD QGE + C GG +E  ++++I+N G
Sbjct: 146 GSCWAFSTVAATEGINQLTTGKLVSLSEQELVDCDTQGEDQGCEGGLMEDGFEFIIKNHG 205

Query: 65  INTERDYP 72
           I TE +YP
Sbjct: 206 ITTEANYP 213


>gi|255580659|ref|XP_002531152.1| cysteine protease, putative [Ricinus communis]
 gi|223529265|gb|EEF31237.1| cysteine protease, putative [Ricinus communis]
          Length = 340

 Score = 86.3 bits (212), Expect = 2e-15,   Method: Composition-based stats.
 Identities = 34/80 (42%), Positives = 54/80 (67%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V A+EGI+++ T+ L+ +S Q+LVDCD +GE + C GG ++  ++++ QN+G
Sbjct: 145 GSCWAFSAVAAVEGITQLATSKLISLSEQELVDCDTKGEDQGCQGGLMDDAFKFIEQNQG 204

Query: 65  INTERDYPNVGVMDNCKVFQ 84
           + TE +YP  G    C   Q
Sbjct: 205 LTTEANYPYEGSDGTCNTKQ 224


>gi|356564154|ref|XP_003550321.1| PREDICTED: cysteine proteinase RD21a-like [Glycine max]
          Length = 476

 Score = 85.9 bits (211), Expect = 3e-15,   Method: Composition-based stats.
 Identities = 37/82 (45%), Positives = 57/82 (69%), Gaps = 1/82 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ +GA+EGI+KIVT  L+ +S Q+LVDCD  G ++ C GG ++  ++++I N G
Sbjct: 169 GSCWAFSAIGAVEGINKIVTGELISLSEQELVDCDT-GYNQGCNGGLMDYAFEFIINNGG 227

Query: 65  INTERDYPNVGVMDNCKVFQFN 86
           I+++ DYP  GV   C  ++ N
Sbjct: 228 IDSDEDYPYRGVDGRCDTYRKN 249


>gi|255580657|ref|XP_002531151.1| cysteine protease, putative [Ricinus communis]
 gi|223529264|gb|EEF31236.1| cysteine protease, putative [Ricinus communis]
          Length = 340

 Score = 85.9 bits (211), Expect = 3e-15,   Method: Composition-based stats.
 Identities = 34/80 (42%), Positives = 54/80 (67%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V A+EGI+++ T+ L+ +S Q+LVDCD +GE + C GG ++  ++++ QN+G
Sbjct: 145 GSCWAFSAVAAVEGITQLATSKLISLSEQELVDCDTKGEDQGCQGGLMDDAFKFIEQNQG 204

Query: 65  INTERDYPNVGVMDNCKVFQ 84
           + TE +YP  G    C   Q
Sbjct: 205 LTTEANYPYEGSDGTCNTKQ 224


>gi|357452075|ref|XP_003596314.1| Cysteine proteinase [Medicago truncatula]
 gi|355485362|gb|AES66565.1| Cysteine proteinase [Medicago truncatula]
          Length = 341

 Score = 85.9 bits (211), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 36/76 (47%), Positives = 51/76 (67%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G CW F+ V + EGI K+ T NLV +S Q+LVDCD  GE + C GG ++  ++++IQN G
Sbjct: 145 GCCWAFSAVASTEGIHKLTTGNLVSLSEQELVDCDTNGEDQGCEGGLMDDAFEFIIQNNG 204

Query: 65  INTERDYPNVGVMDNC 80
           ++TE +YP  GV   C
Sbjct: 205 LSTEAEYPYQGVDGTC 220


>gi|144905116|dbj|BAF56430.1| cysteine proteinase [Lotus japonicus]
          Length = 341

 Score = 85.9 bits (211), Expect = 3e-15,   Method: Composition-based stats.
 Identities = 34/76 (44%), Positives = 51/76 (67%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G CW F+ V A EGI+K+ T  L+ +S Q+LVDCD +G  + C GG ++  +++++QN+G
Sbjct: 146 GCCWAFSAVAATEGITKLSTGKLISLSEQELVDCDTKGVDQGCEGGLMDDAFKFIMQNKG 205

Query: 65  INTERDYPNVGVMDNC 80
           +NTE  YP  GV   C
Sbjct: 206 LNTEAKYPYQGVDATC 221


>gi|147788834|emb|CAN64655.1| hypothetical protein VITISV_005140 [Vitis vinifera]
          Length = 341

 Score = 85.5 bits (210), Expect = 3e-15,   Method: Composition-based stats.
 Identities = 33/76 (43%), Positives = 50/76 (65%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V A+EGI+++ T  L+ +S Q+LVDCD  GE + C GG ++  ++++ QN G
Sbjct: 145 GSCWAFSAVAAMEGITQLSTGKLISLSEQELVDCDTSGEDQGCSGGLMDDAFKFIEQNHG 204

Query: 65  INTERDYPNVGVMDNC 80
           + TE +YP  G    C
Sbjct: 205 LTTEANYPYAGTDGTC 220


>gi|225446581|ref|XP_002280246.1| PREDICTED: vignain [Vitis vinifera]
          Length = 341

 Score = 85.5 bits (210), Expect = 4e-15,   Method: Composition-based stats.
 Identities = 33/76 (43%), Positives = 50/76 (65%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V A+EGI+++ T  L+ +S Q+LVDCD  GE + C GG ++  ++++ QN G
Sbjct: 145 GSCWAFSAVAAMEGITQLSTGKLISLSEQELVDCDTSGEDQGCSGGLMDDAFKFIEQNHG 204

Query: 65  INTERDYPNVGVMDNC 80
           + TE +YP  G    C
Sbjct: 205 LTTEANYPYAGTDGTC 220


>gi|357113934|ref|XP_003558756.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like
           [Brachypodium distachyon]
          Length = 346

 Score = 85.5 bits (210), Expect = 4e-15,   Method: Composition-based stats.
 Identities = 35/77 (45%), Positives = 52/77 (67%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V A EG+ K+ T  LV +S Q+LVDCD  G  + C+GG+++  ++++I+N G
Sbjct: 150 GSCWAFSAVAATEGVVKLSTGKLVSLSEQELVDCDVHGVDQGCMGGWMDDAFKFIIKNGG 209

Query: 65  INTERDYPNVGVMDNCK 81
           + TE +YP  G  D CK
Sbjct: 210 LTTEANYPYTGEDDKCK 226


>gi|144905108|dbj|BAF56428.1| cysteine proteinase [Lotus japonicus]
          Length = 342

 Score = 85.5 bits (210), Expect = 4e-15,   Method: Composition-based stats.
 Identities = 34/76 (44%), Positives = 51/76 (67%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G CW F+ V A EGI+K+ T  L+ +S Q+LVDCD +G  + C GG ++  +++++QN+G
Sbjct: 146 GCCWAFSAVAATEGITKLSTGKLISLSEQELVDCDTKGVDQGCEGGLMDDAFKFIMQNKG 205

Query: 65  INTERDYPNVGVMDNC 80
           +NTE  YP  GV   C
Sbjct: 206 LNTEAKYPYQGVDATC 221


>gi|147839728|emb|CAN70559.1| hypothetical protein VITISV_032465 [Vitis vinifera]
          Length = 341

 Score = 85.5 bits (210), Expect = 4e-15,   Method: Composition-based stats.
 Identities = 33/76 (43%), Positives = 50/76 (65%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V A+EGI+++ T  L+ +S Q+LVDCD  GE + C GG ++  ++++ QN G
Sbjct: 145 GSCWAFSAVAAMEGITQLSTGKLISLSEQELVDCDTSGEDQGCSGGLMDDAFKFIEQNHG 204

Query: 65  INTERDYPNVGVMDNC 80
           + TE +YP  G    C
Sbjct: 205 LTTEANYPYAGTDGTC 220


>gi|388512155|gb|AFK44139.1| unknown [Medicago truncatula]
          Length = 340

 Score = 85.5 bits (210), Expect = 4e-15,   Method: Composition-based stats.
 Identities = 35/76 (46%), Positives = 52/76 (68%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V AIEGI++I T  L+ +S Q+LVDCD +GE + C GG +E  ++++I+N G
Sbjct: 146 GSCWAFSTVAAIEGINQITTGKLISLSEQELVDCDTKGEDQGCEGGLMEDGFEFIIKNGG 205

Query: 65  INTERDYPNVGVMDNC 80
           I +E +YP      +C
Sbjct: 206 ITSETNYPYKAADGSC 221


>gi|2511693|emb|CAB17076.1| cysteine proteinase precursor [Phaseolus vulgaris]
          Length = 455

 Score = 85.1 bits (209), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 38/82 (46%), Positives = 58/82 (70%), Gaps = 1/82 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ +GA+EGI+KIVT +L+ +S Q+LVDCD  G +  C GG ++  ++++I+N G
Sbjct: 149 GSCWAFSAIGAVEGINKIVTGDLISLSEQELVDCDT-GYNMGCNGGLMDYAFEFIIKNGG 207

Query: 65  INTERDYPNVGVMDNCKVFQFN 86
           I++E DYP  GV   C  ++ N
Sbjct: 208 IDSEEDYPYKGVDGRCDEYRKN 229


>gi|302812789|ref|XP_002988081.1| hypothetical protein SELMODRAFT_183539 [Selaginella moellendorffii]
 gi|300144187|gb|EFJ10873.1| hypothetical protein SELMODRAFT_183539 [Selaginella moellendorffii]
          Length = 425

 Score = 85.1 bits (209), Expect = 4e-15,   Method: Composition-based stats.
 Identities = 37/82 (45%), Positives = 55/82 (67%), Gaps = 1/82 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G CW FA  GAIEGI++IVT  LV +S Q+L+DCD + + + C GG +E  YQ++++N G
Sbjct: 127 GGCWAFATTGAIEGINQIVTGQLVSLSEQELIDCDKKAD-KGCDGGLMENAYQFIVENGG 185

Query: 65  INTERDYPNVGVMDNCKVFQFN 86
           ++TE DYP      +C + + N
Sbjct: 186 LDTETDYPYHASESHCNMKKLN 207


>gi|225446583|ref|XP_002280204.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1 [Vitis vinifera]
          Length = 341

 Score = 85.1 bits (209), Expect = 4e-15,   Method: Composition-based stats.
 Identities = 33/76 (43%), Positives = 50/76 (65%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V A+EGI+++ T  L+ +S Q+LVDCD  GE + C GG ++  ++++ QN G
Sbjct: 145 GSCWAFSAVAAMEGITQLSTGKLISLSEQELVDCDTSGEDQGCSGGLMDDAFKFIEQNHG 204

Query: 65  INTERDYPNVGVMDNC 80
           + TE +YP  G    C
Sbjct: 205 LTTEANYPYAGTDGTC 220


>gi|102140014|gb|ABF70145.1| cysteine protease, putative [Musa acuminata]
          Length = 373

 Score = 85.1 bits (209), Expect = 5e-15,   Method: Composition-based stats.
 Identities = 36/76 (47%), Positives = 51/76 (67%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F VV A+EGI+KIVT  L+ +S QQLVDCD  G+ + C GG ++  +++++ N G
Sbjct: 142 GSCWAFTVVAAVEGITKIVTGKLISLSEQQLVDCDVHGKDQGCQGGDMDAAFEFIVNNGG 201

Query: 65  INTERDYPNVGVMDNC 80
           I +E +YP   V   C
Sbjct: 202 ITSEANYPYEEVQRLC 217


>gi|357474725|ref|XP_003607647.1| Cysteine proteinase [Medicago truncatula]
 gi|355508702|gb|AES89844.1| Cysteine proteinase [Medicago truncatula]
          Length = 340

 Score = 85.1 bits (209), Expect = 5e-15,   Method: Composition-based stats.
 Identities = 35/76 (46%), Positives = 52/76 (68%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V AIEGI++I T  L+ +S Q+LVDCD +GE + C GG +E  ++++I+N G
Sbjct: 146 GSCWAFSTVAAIEGINQITTGKLISLSEQELVDCDTKGEDQGCEGGLMEDGFEFIIKNGG 205

Query: 65  INTERDYPNVGVMDNC 80
           I +E +YP      +C
Sbjct: 206 ITSETNYPYKAADGSC 221


>gi|225458143|ref|XP_002280937.1| PREDICTED: cysteine proteinase RD21a [Vitis vinifera]
 gi|302142569|emb|CBI19772.3| unnamed protein product [Vitis vinifera]
          Length = 436

 Score = 85.1 bits (209), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 41/76 (53%), Positives = 54/76 (71%), Gaps = 1/76 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G CW F+  GAIEGI+KIVT +LV +S Q+LVDCD    S  C GG ++  YQ+VI+N+G
Sbjct: 135 GGCWSFSTTGAIEGINKIVTGSLVSLSEQELVDCDRSYNS-GCEGGLMDYAYQFVIKNQG 193

Query: 65  INTERDYPNVGVMDNC 80
           I++E DYP VG+   C
Sbjct: 194 IDSEADYPYVGMDKPC 209


>gi|356515080|ref|XP_003526229.1| PREDICTED: vignain-like [Glycine max]
          Length = 284

 Score = 85.1 bits (209), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 36/78 (46%), Positives = 50/78 (64%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G CW F+ + A EGI KI T  LV +S Q++VDCD +G    C GG+++  ++++IQN G
Sbjct: 88  GCCWAFSAIAATEGIHKISTGKLVSLSEQEVVDCDTKGTDHGCEGGYMDGAFKFIIQNHG 147

Query: 65  INTERDYPNVGVMDNCKV 82
           INTE  YP  GV   C +
Sbjct: 148 INTEASYPYKGVDGKCNI 165


>gi|116309178|emb|CAH66275.1| OSIGBa0147O06.5 [Oryza sativa Indica Group]
          Length = 339

 Score = 85.1 bits (209), Expect = 5e-15,   Method: Composition-based stats.
 Identities = 34/77 (44%), Positives = 50/77 (64%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G CW F+ V A+EGI K+ T  L+ +S Q+LVDCD  GE + C GG ++  ++++I+N G
Sbjct: 145 GCCWAFSAVAAMEGIVKLSTGKLISLSEQELVDCDVHGEDQGCEGGLMDDAFKFIIKNGG 204

Query: 65  INTERDYPNVGVMDNCK 81
           + TE +YP     D CK
Sbjct: 205 LTTESNYPYAAADDKCK 221


>gi|218202087|gb|EEC84514.1| hypothetical protein OsI_31214 [Oryza sativa Indica Group]
          Length = 348

 Score = 85.1 bits (209), Expect = 5e-15,   Method: Composition-based stats.
 Identities = 34/77 (44%), Positives = 50/77 (64%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G CW F+ V A+EGI K+ T  L+ +S Q+LVDCD  GE + C GG ++  ++++I+N G
Sbjct: 145 GCCWAFSAVAAMEGIVKLSTGKLISLSEQELVDCDVHGEDQGCEGGLMDDAFKFIIKNGG 204

Query: 65  INTERDYPNVGVMDNCK 81
           + TE +YP     D CK
Sbjct: 205 LTTESNYPYAAADDKCK 221


>gi|50355611|dbj|BAD29954.1| cysteine protease [Daucus carota]
          Length = 474

 Score = 84.7 bits (208), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 38/72 (52%), Positives = 55/72 (76%), Gaps = 1/72 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ VGA+EGI+KIVT  L+ +S Q+LVDCDN G ++ C GG ++  ++++++N G
Sbjct: 172 GSCWAFSTVGAVEGINKIVTGELISLSEQELVDCDN-GYNQGCNGGLMDYAFEFIVKNGG 230

Query: 65  INTERDYPNVGV 76
           I+TE DYP  GV
Sbjct: 231 IDTEDDYPYKGV 242


>gi|297843430|ref|XP_002889596.1| hypothetical protein ARALYDRAFT_887827 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297335438|gb|EFH65855.1| hypothetical protein ARALYDRAFT_887827 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 343

 Score = 84.7 bits (208), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 37/76 (48%), Positives = 50/76 (65%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G CW F+ V AIEGI+KI T NLV +S QQL+DCD    ++ C GG +ET ++++  N G
Sbjct: 149 GGCWAFSAVAAIEGINKIKTGNLVSLSEQQLIDCDVGTYNKGCSGGLMETAFEFIKSNGG 208

Query: 65  INTERDYPNVGVMDNC 80
           + TE DYP  G+   C
Sbjct: 209 LTTETDYPYTGIEGTC 224


>gi|359485281|ref|XP_002280230.2| PREDICTED: LOW QUALITY PROTEIN: KDEL-tailed cysteine endopeptidase
           CEP1 [Vitis vinifera]
          Length = 341

 Score = 84.7 bits (208), Expect = 6e-15,   Method: Composition-based stats.
 Identities = 33/76 (43%), Positives = 50/76 (65%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V A+EGI+++ T  L+ +S Q+LVDCD  GE + C GG ++  ++++ QN G
Sbjct: 145 GSCWAFSAVAAMEGITQLSTGKLISLSEQELVDCDTSGEDQGCNGGLMDDAFKFIKQNHG 204

Query: 65  INTERDYPNVGVMDNC 80
           + TE +YP  G    C
Sbjct: 205 LTTEANYPYAGTDGTC 220


>gi|38345008|emb|CAD40026.2| OSJNBa0052O21.11 [Oryza sativa Japonica Group]
 gi|125589414|gb|EAZ29764.1| hypothetical protein OsJ_13822 [Oryza sativa Japonica Group]
          Length = 339

 Score = 84.7 bits (208), Expect = 6e-15,   Method: Composition-based stats.
 Identities = 34/77 (44%), Positives = 50/77 (64%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G CW F+ V A+EGI K+ T  L+ +S Q+LVDCD  GE + C GG ++  ++++I+N G
Sbjct: 145 GCCWAFSAVAAMEGIVKLSTGKLISLSEQELVDCDVHGEDQGCEGGLMDDAFKFIIKNGG 204

Query: 65  INTERDYPNVGVMDNCK 81
           + TE +YP     D CK
Sbjct: 205 LTTESNYPYAAADDKCK 221


>gi|224121800|ref|XP_002330656.1| predicted protein [Populus trichocarpa]
 gi|222872260|gb|EEF09391.1| predicted protein [Populus trichocarpa]
          Length = 342

 Score = 84.7 bits (208), Expect = 6e-15,   Method: Composition-based stats.
 Identities = 35/68 (51%), Positives = 49/68 (72%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V A EGI+++ T  LV +S Q+LVDCD QGE + C GG +E  ++++I+N G
Sbjct: 146 GSCWAFSTVAATEGINQLTTGKLVSLSEQELVDCDIQGEDQGCEGGLMEDGFEFIIKNHG 205

Query: 65  INTERDYP 72
           I TE +YP
Sbjct: 206 ITTEANYP 213


>gi|125547236|gb|EAY93058.1| hypothetical protein OsI_14861 [Oryza sativa Indica Group]
          Length = 339

 Score = 84.7 bits (208), Expect = 7e-15,   Method: Composition-based stats.
 Identities = 34/77 (44%), Positives = 50/77 (64%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G CW F+ V A+EGI K+ T  L+ +S Q+LVDCD  GE + C GG ++  ++++I+N G
Sbjct: 145 GCCWAFSAVAAMEGIVKLSTGKLISLSEQELVDCDVHGEDQGCEGGLMDDAFKFIIKNGG 204

Query: 65  INTERDYPNVGVMDNCK 81
           + TE +YP     D CK
Sbjct: 205 LTTESNYPYAAADDKCK 221


>gi|242094000|ref|XP_002437490.1| hypothetical protein SORBIDRAFT_10g028000 [Sorghum bicolor]
 gi|241915713|gb|EER88857.1| hypothetical protein SORBIDRAFT_10g028000 [Sorghum bicolor]
          Length = 372

 Score = 84.3 bits (207), Expect = 7e-15,   Method: Composition-based stats.
 Identities = 39/82 (47%), Positives = 53/82 (64%), Gaps = 2/82 (2%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G CW F+ V AIEGI++IVT NLV +S Q+++DCD Q     C GG ++  +Q+VI N G
Sbjct: 164 GGCWAFSAVAAIEGINEIVTGNLVSLSEQEIIDCDTQ--DGGCNGGEMQNAFQFVINNGG 221

Query: 65  INTERDYPNVGVMDNCKVFQFN 86
           I+TE DYP +G    C   + N
Sbjct: 222 IDTEADYPYLGTDAACDANRVN 243


>gi|302781881|ref|XP_002972714.1| hypothetical protein SELMODRAFT_98707 [Selaginella moellendorffii]
 gi|300159315|gb|EFJ25935.1| hypothetical protein SELMODRAFT_98707 [Selaginella moellendorffii]
          Length = 446

 Score = 84.3 bits (207), Expect = 8e-15,   Method: Composition-based stats.
 Identities = 36/82 (43%), Positives = 55/82 (67%), Gaps = 1/82 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G CW FA  GAIEGI++IVT  L+ +S Q+L+DCD + + + C GG +E  YQ++++N G
Sbjct: 127 GGCWAFATTGAIEGINQIVTGQLMSLSEQELIDCDKKAD-KGCDGGLMENAYQFIVENGG 185

Query: 65  INTERDYPNVGVMDNCKVFQFN 86
           ++TE DYP      +C + + N
Sbjct: 186 LDTETDYPYHASESHCNMKKLN 207


>gi|225446585|ref|XP_002280215.1| PREDICTED: vignain [Vitis vinifera]
          Length = 341

 Score = 84.3 bits (207), Expect = 8e-15,   Method: Composition-based stats.
 Identities = 33/76 (43%), Positives = 50/76 (65%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V A+EGI+++ T  L+ +S Q+LVDCD  GE + C GG ++  ++++ QN G
Sbjct: 145 GSCWAFSAVAAMEGITQLSTGKLISLSEQELVDCDTSGEDQGCNGGLMDDAFKFIEQNHG 204

Query: 65  INTERDYPNVGVMDNC 80
           + TE +YP  G    C
Sbjct: 205 LATEANYPYAGTDGTC 220


>gi|409190991|gb|AFV30165.1| cysteine proteinase [Lotus japonicus]
          Length = 342

 Score = 84.3 bits (207), Expect = 8e-15,   Method: Compositional matrix adjust.
 Identities = 37/76 (48%), Positives = 49/76 (64%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G CW F+ V A EGI K+ T NLV +S Q+LVDCD  G  + C GG ++  ++++IQN G
Sbjct: 146 GCCWAFSAVAATEGIHKLSTGNLVSLSEQELVDCDTSGADQGCQGGLMDDAFKFIIQNGG 205

Query: 65  INTERDYPNVGVMDNC 80
           +NTE  YP  GV   C
Sbjct: 206 LNTEAQYPYQGVDGTC 221


>gi|414875906|tpg|DAA53037.1| TPA: hypothetical protein ZEAMMB73_586844 [Zea mays]
          Length = 1039

 Score = 84.3 bits (207), Expect = 8e-15,   Method: Composition-based stats.
 Identities = 39/85 (45%), Positives = 57/85 (67%), Gaps = 7/85 (8%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCD---NQGESRSCVGGFIETIYQYVIQ 61
           GSCW F+ + A+EGI++IVT +L+ +S Q+LVDCD   NQG    C GG ++  ++++I 
Sbjct: 713 GSCWAFSTIAAVEGINQIVTGDLISLSEQELVDCDTSYNQG----CNGGLMDYAFEFIIN 768

Query: 62  NRGINTERDYPNVGVMDNCKVFQFN 86
           N GI+TE+DYP  G    C V + N
Sbjct: 769 NGGIDTEKDYPYKGTDGRCDVNRKN 793


>gi|147836416|emb|CAN75313.1| hypothetical protein VITISV_033592 [Vitis vinifera]
          Length = 201

 Score = 84.3 bits (207), Expect = 8e-15,   Method: Compositional matrix adjust.
 Identities = 33/76 (43%), Positives = 50/76 (65%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V A+EGI+++ T  L+ +S Q+LVDCD  GE + C GG ++  ++++ QN G
Sbjct: 69  GSCWAFSAVAAMEGITQLSTGKLISLSEQELVDCDTSGEDQGCNGGLMDDAFKFIEQNHG 128

Query: 65  INTERDYPNVGVMDNC 80
           + TE +YP  G    C
Sbjct: 129 LTTEANYPYAGTDGTC 144


>gi|144905104|dbj|BAF56427.1| cysteine proteinase [Lotus japonicus]
          Length = 342

 Score = 84.3 bits (207), Expect = 8e-15,   Method: Compositional matrix adjust.
 Identities = 37/76 (48%), Positives = 49/76 (64%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G CW F+ V A EGI K+ T NLV +S Q+LVDCD  G  + C GG ++  ++++IQN G
Sbjct: 146 GCCWAFSAVAATEGIHKLSTGNLVSLSEQELVDCDTSGADQGCQGGLMDDAFKFIIQNGG 205

Query: 65  INTERDYPNVGVMDNC 80
           +NTE  YP  GV   C
Sbjct: 206 LNTEAQYPYQGVDGTC 221


>gi|356577813|ref|XP_003557017.1| PREDICTED: uncharacterized protein LOC100801364 [Glycine max]
          Length = 890

 Score = 84.3 bits (207), Expect = 9e-15,   Method: Composition-based stats.
 Identities = 34/76 (44%), Positives = 49/76 (64%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G CW F+ V A EGI  + +  L+ +S Q+LVDCD +G  + C GG ++  +++VIQN G
Sbjct: 694 GCCWAFSAVAATEGIHALTSGKLISLSEQELVDCDTKGVDQGCEGGLMDDAFKFVIQNHG 753

Query: 65  INTERDYPNVGVMDNC 80
           +NTE +YP  GV   C
Sbjct: 754 LNTEANYPYKGVDGKC 769


>gi|18390634|ref|NP_563764.1| cysteine proteinase-like protein [Arabidopsis thaliana]
 gi|8844131|gb|AAF80223.1|AC025290_12 Contains similarity to a cysteine endopeptidase 1 from Phaseolus
           vulgaris gb|U52970 and is a member of the papain
           cysteine protease family PF|00112 [Arabidopsis thaliana]
 gi|332189848|gb|AEE27969.1| cysteine proteinase-like protein [Arabidopsis thaliana]
          Length = 343

 Score = 84.0 bits (206), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 37/76 (48%), Positives = 50/76 (65%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G CW F+ V AIEGI+KI T NLV +S QQL+DCD    ++ C GG +ET ++++  N G
Sbjct: 149 GGCWAFSAVAAIEGINKIKTGNLVSLSEQQLIDCDVGTYNKGCSGGLMETAFEFIKTNGG 208

Query: 65  INTERDYPNVGVMDNC 80
           + TE DYP  G+   C
Sbjct: 209 LATETDYPYTGIEGTC 224


>gi|224096714|ref|XP_002310708.1| predicted protein [Populus trichocarpa]
 gi|222853611|gb|EEE91158.1| predicted protein [Populus trichocarpa]
          Length = 356

 Score = 84.0 bits (206), Expect = 1e-14,   Method: Composition-based stats.
 Identities = 36/76 (47%), Positives = 54/76 (71%), Gaps = 1/76 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V A+EGI++IVT  L+ +S Q+LVDCD +  +  C GG ++  +Q++I N G
Sbjct: 139 GSCWAFSTVAAVEGINQIVTGELISLSEQELVDCD-RFYNAGCNGGLMDYAFQFIINNGG 197

Query: 65  INTERDYPNVGVMDNC 80
           ++TE+DYP +G  D C
Sbjct: 198 LDTEKDYPYLGNDDTC 213


>gi|148927382|gb|ABR19827.1| cysteine proteinase [Elaeis guineensis]
          Length = 470

 Score = 84.0 bits (206), Expect = 1e-14,   Method: Composition-based stats.
 Identities = 38/82 (46%), Positives = 56/82 (68%), Gaps = 1/82 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V A+EGI+KIVT +L+ +S Q+LVDCDN G ++ C GG ++  ++++I N G
Sbjct: 163 GSCWAFSTVAAVEGINKIVTGDLISLSEQELVDCDN-GYNQGCNGGLMDYGFEFIINNGG 221

Query: 65  INTERDYPNVGVMDNCKVFQFN 86
           I+TE DYP       C  ++ N
Sbjct: 222 IDTEEDYPYTARDGKCDQYRKN 243


>gi|414587996|tpg|DAA38567.1| TPA: hypothetical protein ZEAMMB73_390779 [Zea mays]
          Length = 343

 Score = 84.0 bits (206), Expect = 1e-14,   Method: Composition-based stats.
 Identities = 35/77 (45%), Positives = 50/77 (64%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G CW F+ V A+EGI K+ T+NLV +S Q+LVDCD       C GG++++ +++VI+N G
Sbjct: 149 GCCWAFSAVAAMEGIVKLSTDNLVSLSEQELVDCDTHSMDEGCEGGWMDSAFEFVIKNGG 208

Query: 65  INTERDYPNVGVMDNCK 81
           + TE  YP   V   CK
Sbjct: 209 LATESSYPYKAVDGKCK 225


>gi|255563110|ref|XP_002522559.1| cysteine protease, putative [Ricinus communis]
 gi|223538250|gb|EEF39859.1| cysteine protease, putative [Ricinus communis]
          Length = 344

 Score = 84.0 bits (206), Expect = 1e-14,   Method: Composition-based stats.
 Identities = 33/76 (43%), Positives = 51/76 (67%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G CW F+ V A+EGI+K+ T  L+ +S Q+LVDCD  G  + C GG ++  ++++I+N G
Sbjct: 148 GCCWAFSAVAAMEGITKLSTGTLISLSEQELVDCDTSGMDQGCEGGLMDDAFEFIIENNG 207

Query: 65  INTERDYPNVGVMDNC 80
           + TE +YP  GV  +C
Sbjct: 208 LTTEANYPYEGVDGSC 223


>gi|34223513|gb|AAQ62999.1| oil palm polygalacturonase allergen PEST472 [Elaeis guineensis]
          Length = 525

 Score = 84.0 bits (206), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 37/82 (45%), Positives = 57/82 (69%), Gaps = 1/82 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ + A+EGI+KIVT +L+ +S Q+LVDCDN G+++ C GG ++  ++++I N G
Sbjct: 163 GSCWAFSTIAAVEGINKIVTGDLISLSEQELVDCDN-GQNQGCNGGLMDYAFEFIINNGG 221

Query: 65  INTERDYPNVGVMDNCKVFQFN 86
           I+TE DYP       C  ++ N
Sbjct: 222 IDTEEDYPYKARDGKCDQYRKN 243


>gi|356545118|ref|XP_003540992.1| PREDICTED: thiol protease SEN102-like [Glycine max]
          Length = 337

 Score = 83.6 bits (205), Expect = 1e-14,   Method: Composition-based stats.
 Identities = 35/76 (46%), Positives = 50/76 (65%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V A EGI +I T  LV +S Q+LVDCD +G  + C GG++E  ++++I+N G
Sbjct: 144 GSCWAFSTVAATEGIHQITTGKLVSLSEQELVDCDTKGVDQGCEGGYMEDGFEFIIKNGG 203

Query: 65  INTERDYPNVGVMDNC 80
           I +E +YP   V   C
Sbjct: 204 ITSEANYPYKAVDGKC 219


>gi|213514640|ref|NP_001134963.1| Cathepsin S precursor [Salmo salar]
 gi|209155506|gb|ACI33985.1| Cathepsin S precursor [Salmo salar]
 gi|209737594|gb|ACI69666.1| Cathepsin S precursor [Salmo salar]
 gi|223647278|gb|ACN10397.1| Cathepsin S precursor [Salmo salar]
 gi|223673157|gb|ACN12760.1| Cathepsin S precursor [Salmo salar]
          Length = 330

 Score = 83.6 bits (205), Expect = 1e-14,   Method: Composition-based stats.
 Identities = 36/76 (47%), Positives = 51/76 (67%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ VGA+EG     T  L+DIS+Q LVDC ++  ++ C GGF+   +QYVI N+G
Sbjct: 137 GSCWAFSAVGALEGQLMKTTGKLIDISSQNLVDCSSKYGNKGCNGGFMSQAFQYVIDNQG 196

Query: 65  INTERDYPNVGVMDNC 80
           I++++ YP  GV   C
Sbjct: 197 IDSDQSYPYKGVQQQC 212


>gi|357458911|ref|XP_003599736.1| Cysteine proteinase [Medicago truncatula]
 gi|357474719|ref|XP_003607644.1| Cysteine proteinase [Medicago truncatula]
 gi|355488784|gb|AES69987.1| Cysteine proteinase [Medicago truncatula]
 gi|355508699|gb|AES89841.1| Cysteine proteinase [Medicago truncatula]
          Length = 340

 Score = 83.6 bits (205), Expect = 1e-14,   Method: Composition-based stats.
 Identities = 35/76 (46%), Positives = 51/76 (67%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V A EGI++I T  LV +S Q+LVDCD +GE + C GG +E  ++++I+N G
Sbjct: 146 GSCWAFSTVAATEGINQITTGKLVSLSEQELVDCDTKGEDQGCEGGLMEDGFEFIIKNGG 205

Query: 65  INTERDYPNVGVMDNC 80
           I +E +YP      +C
Sbjct: 206 ITSETNYPYKAADGSC 221


>gi|111073717|dbj|BAF02547.1| triticain beta [Triticum aestivum]
          Length = 472

 Score = 83.6 bits (205), Expect = 2e-14,   Method: Composition-based stats.
 Identities = 35/82 (42%), Positives = 54/82 (65%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V  +E I++IVT  +V +S Q+LV+CD  G+S  C GG ++  ++++I+N G
Sbjct: 169 GSCWAFSAVSTVESINQIVTGEMVTLSEQELVECDTNGQSSGCNGGLMDDAFEFIIKNGG 228

Query: 65  INTERDYPNVGVMDNCKVFQFN 86
           I+TE DYP   +   C V + N
Sbjct: 229 IDTEDDYPYKAIDGRCDVLRKN 250


>gi|357474573|ref|XP_003607571.1| Cysteine proteinase EP-B [Medicago truncatula]
 gi|34329348|gb|AAQ63885.1| putative cysteine proteinase [Medicago truncatula]
 gi|355508626|gb|AES89768.1| Cysteine proteinase EP-B [Medicago truncatula]
          Length = 345

 Score = 83.6 bits (205), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 36/78 (46%), Positives = 49/78 (62%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G CW F+ V A EGI K+ T  LV +S Q+LVDCD +G  + C GG ++  ++++IQN G
Sbjct: 149 GCCWAFSAVAATEGIHKLSTGKLVSLSEQELVDCDTKGVDQGCEGGLMDDAFKFIIQNHG 208

Query: 65  INTERDYPNVGVMDNCKV 82
           +NTE  YP  GV   C  
Sbjct: 209 LNTEAQYPYQGVDGTCSA 226


>gi|46395939|sp|Q94B08.2|GCP1_ARATH RecName: Full=Germination-specific cysteine protease 1; Flags:
           Precursor
 gi|4006883|emb|CAB16767.1| cysteine proteinase [Arabidopsis thaliana]
 gi|7270637|emb|CAB80354.1| cysteine proteinase [Arabidopsis thaliana]
          Length = 376

 Score = 83.6 bits (205), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 40/85 (47%), Positives = 55/85 (64%), Gaps = 7/85 (8%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCD---NQGESRSCVGGFIETIYQYVIQ 61
           GSCW F+   A+EGI+KIVT  L+ +S Q+LVDCD   NQG    C GG ++  +Q++++
Sbjct: 167 GSCWAFSTTAAVEGINKIVTGELISLSEQELVDCDKSYNQG----CNGGLMDYAFQFIMK 222

Query: 62  NRGINTERDYPNVGVMDNCKVFQFN 86
           N G+NTE+DYP  G    C  F  N
Sbjct: 223 NGGLNTEKDYPYRGFGGKCNSFLKN 247


>gi|186516984|ref|NP_195406.2| cysteine proteinase1 [Arabidopsis thaliana]
 gi|15290508|gb|AAK92229.1| cysteine proteinase [Arabidopsis thaliana]
 gi|332661313|gb|AEE86713.1| cysteine proteinase1 [Arabidopsis thaliana]
          Length = 376

 Score = 83.6 bits (205), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 40/85 (47%), Positives = 55/85 (64%), Gaps = 7/85 (8%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCD---NQGESRSCVGGFIETIYQYVIQ 61
           GSCW F+   A+EGI+KIVT  L+ +S Q+LVDCD   NQG    C GG ++  +Q++++
Sbjct: 167 GSCWAFSTTAAVEGINKIVTGELISLSEQELVDCDKSYNQG----CNGGLMDYAFQFIMK 222

Query: 62  NRGINTERDYPNVGVMDNCKVFQFN 86
           N G+NTE+DYP  G    C  F  N
Sbjct: 223 NGGLNTEKDYPYRGFGGKCNSFLKN 247


>gi|110737959|dbj|BAF00916.1| cysteine proteinase [Arabidopsis thaliana]
          Length = 376

 Score = 83.2 bits (204), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 40/85 (47%), Positives = 55/85 (64%), Gaps = 7/85 (8%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCD---NQGESRSCVGGFIETIYQYVIQ 61
           GSCW F+   A+EGI+KIVT  L+ +S Q+LVDCD   NQG    C GG ++  +Q++++
Sbjct: 167 GSCWAFSTTAAVEGINKIVTGELISLSEQELVDCDKSYNQG----CNGGLMDYAFQFIMK 222

Query: 62  NRGINTERDYPNVGVMDNCKVFQFN 86
           N G+NTE+DYP  G    C  F  N
Sbjct: 223 NGGLNTEKDYPYRGFGGKCNSFLKN 247


>gi|356543076|ref|XP_003539989.1| PREDICTED: vignain-like [Glycine max]
          Length = 343

 Score = 83.2 bits (204), Expect = 2e-14,   Method: Composition-based stats.
 Identities = 33/76 (43%), Positives = 50/76 (65%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G CW F+ V A EGI  + +  L+ +S Q++VDCD +GE + C GGF++  ++++IQN G
Sbjct: 147 GCCWAFSAVAATEGIHALNSGKLISLSEQEVVDCDTKGEDQGCAGGFMDGAFKFIIQNHG 206

Query: 65  INTERDYPNVGVMDNC 80
           +NTE +YP   V   C
Sbjct: 207 LNTEANYPYKAVDGKC 222


>gi|255568345|ref|XP_002525147.1| cysteine protease, putative [Ricinus communis]
 gi|223535606|gb|EEF37274.1| cysteine protease, putative [Ricinus communis]
          Length = 347

 Score = 83.2 bits (204), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 37/77 (48%), Positives = 52/77 (67%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V A+EGI+KI T NLV +S Q+LVDCD  G+++ C GGF+E  + ++    G
Sbjct: 151 GSCWAFSAVAAVEGINKIKTGNLVSLSEQELVDCDVNGDNKGCNGGFMEKAFTFIKSIGG 210

Query: 65  INTERDYPNVGVMDNCK 81
           + TE DYP  G   +C+
Sbjct: 211 LTTENDYPYKGTDGSCE 227


>gi|255538788|ref|XP_002510459.1| cysteine protease, putative [Ricinus communis]
 gi|223551160|gb|EEF52646.1| cysteine protease, putative [Ricinus communis]
          Length = 422

 Score = 83.2 bits (204), Expect = 2e-14,   Method: Composition-based stats.
 Identities = 38/68 (55%), Positives = 51/68 (75%), Gaps = 1/68 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G+CW F+  GAIEGI+KIVT +LV +S Q+LVDCD +  +  C GG ++  YQ+VI+N G
Sbjct: 141 GACWSFSATGAIEGINKIVTGSLVSLSEQELVDCD-RSYNNGCEGGLMDYAYQFVIENNG 199

Query: 65  INTERDYP 72
           I+TE DYP
Sbjct: 200 IDTEEDYP 207


>gi|357483847|ref|XP_003612210.1| Cysteine proteinase [Medicago truncatula]
 gi|355513545|gb|AES95168.1| Cysteine proteinase [Medicago truncatula]
          Length = 344

 Score = 83.2 bits (204), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 36/76 (47%), Positives = 49/76 (64%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G CW F+ V A EGI K+ T  LV +S Q+LVDCD +G  + C GG ++  ++++IQN G
Sbjct: 148 GCCWAFSAVAATEGIHKLSTGKLVSLSEQELVDCDTKGVDQGCEGGLMDDAFKFIIQNHG 207

Query: 65  INTERDYPNVGVMDNC 80
           +NTE  YP  GV   C
Sbjct: 208 LNTEAQYPYQGVDGTC 223


>gi|326502440|dbj|BAJ95283.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 349

 Score = 83.2 bits (204), Expect = 2e-14,   Method: Composition-based stats.
 Identities = 34/77 (44%), Positives = 48/77 (62%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G CW F+ V A+EG++KI T  LV +S QQLVDCD  G+   C GG ++  ++Y+I   G
Sbjct: 157 GCCWAFSAVAAVEGLTKIRTGRLVSLSEQQLVDCDVYGDDEGCAGGLMDNAFEYMINRGG 216

Query: 65  INTERDYPNVGVMDNCK 81
           + TE  YP  G   +C+
Sbjct: 217 LTTESSYPYRGTDGSCR 233


>gi|2224810|emb|CAB09698.1| cysteine proteinase [Hordeum vulgare subsp. vulgare]
          Length = 349

 Score = 83.2 bits (204), Expect = 2e-14,   Method: Composition-based stats.
 Identities = 34/77 (44%), Positives = 48/77 (62%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G CW F+ V A+EG++KI T  LV +S QQLVDCD  G+   C GG ++  ++Y+I   G
Sbjct: 157 GCCWAFSAVAAVEGLTKIRTGRLVSLSEQQLVDCDVYGDDEGCAGGLMDNAFEYMINRGG 216

Query: 65  INTERDYPNVGVMDNCK 81
           + TE  YP  G   +C+
Sbjct: 217 LTTESSYPYRGTDGSCR 233


>gi|160858205|dbj|BAF93840.1| triticain beta 2 [Triticum aestivum]
          Length = 469

 Score = 83.2 bits (204), Expect = 2e-14,   Method: Composition-based stats.
 Identities = 34/82 (41%), Positives = 54/82 (65%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ +  +E I++IVT  +V +S Q+LV+CD  G+S  C GG ++  ++++I+N G
Sbjct: 166 GSCWAFSAISTVESINQIVTGEMVTLSEQELVECDTNGQSSGCNGGLMDDAFEFIIKNGG 225

Query: 65  INTERDYPNVGVMDNCKVFQFN 86
           I+TE DYP   +   C V + N
Sbjct: 226 IDTEDDYPYKAIDGRCDVLRKN 247


>gi|357474527|ref|XP_003607548.1| Cysteine protease [Medicago truncatula]
 gi|358347211|ref|XP_003637653.1| Cysteine protease [Medicago truncatula]
 gi|355503588|gb|AES84791.1| Cysteine protease [Medicago truncatula]
 gi|355508603|gb|AES89745.1| Cysteine protease [Medicago truncatula]
          Length = 345

 Score = 83.2 bits (204), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 36/78 (46%), Positives = 49/78 (62%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G CW F+ V A EGI K+ T  LV +S Q+LVDCD +G  + C GG ++  ++++IQN G
Sbjct: 149 GCCWAFSAVAATEGIHKLSTGKLVSLSEQELVDCDTKGVDQGCEGGLMDDAFKFIIQNHG 208

Query: 65  INTERDYPNVGVMDNCKV 82
           +NTE  YP  GV   C  
Sbjct: 209 LNTEAQYPYQGVDGTCSA 226


>gi|357471211|ref|XP_003605890.1| Cysteine proteinase [Medicago truncatula]
 gi|355506945|gb|AES88087.1| Cysteine proteinase [Medicago truncatula]
          Length = 343

 Score = 83.2 bits (204), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 35/76 (46%), Positives = 50/76 (65%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G CW F+ V A EGI K+ T  L+ +S Q+LVDCD +G  + C GG ++  ++++IQN G
Sbjct: 147 GCCWAFSAVAATEGIHKLSTGKLISLSEQELVDCDTKGVDQGCEGGLMDDAFKFIIQNHG 206

Query: 65  INTERDYPNVGVMDNC 80
           +NTE +YP  GV   C
Sbjct: 207 LNTEANYPYQGVDGTC 222


>gi|356543038|ref|XP_003539970.1| PREDICTED: vignain-like [Glycine max]
          Length = 343

 Score = 83.2 bits (204), Expect = 2e-14,   Method: Composition-based stats.
 Identities = 33/76 (43%), Positives = 50/76 (65%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G CW F+ V A EGI  + +  L+ +S Q++VDCD +GE + C GGF++  ++++IQN G
Sbjct: 147 GCCWAFSAVAATEGIHALNSGKLISLSEQEVVDCDTKGEDQGCAGGFMDGAFKFIIQNHG 206

Query: 65  INTERDYPNVGVMDNC 80
           +NTE +YP   V   C
Sbjct: 207 LNTEANYPYKAVDGKC 222


>gi|326507362|dbj|BAK03074.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 473

 Score = 83.2 bits (204), Expect = 2e-14,   Method: Composition-based stats.
 Identities = 36/82 (43%), Positives = 54/82 (65%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V  +E I++IVT  +V +S Q+LV+CD  G+S  C GG ++  ++++I+N G
Sbjct: 170 GSCWAFSAVSTVESINQIVTGEMVTLSEQELVECDINGQSSGCNGGLMDDAFEFIIKNGG 229

Query: 65  INTERDYPNVGVMDNCKVFQFN 86
           I+TE DYP   V   C V + N
Sbjct: 230 IDTEDDYPYKAVDGRCDVLRKN 251


>gi|204307508|gb|ACI00280.1| triticain beta 2 [Hordeum vulgare]
          Length = 473

 Score = 83.2 bits (204), Expect = 2e-14,   Method: Composition-based stats.
 Identities = 36/82 (43%), Positives = 54/82 (65%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V  +E I++IVT  +V +S Q+LV+CD  G+S  C GG ++  ++++I+N G
Sbjct: 170 GSCWAFSAVSTVESINQIVTGEMVTLSEQELVECDINGQSSGCNGGLMDDAFEFIIKNGG 229

Query: 65  INTERDYPNVGVMDNCKVFQFN 86
           I+TE DYP   V   C V + N
Sbjct: 230 IDTEDDYPYKAVDGRCDVLRKN 251


>gi|62320725|dbj|BAD95392.1| cysteine proteinase RD21A [Arabidopsis thaliana]
          Length = 433

 Score = 82.8 bits (203), Expect = 2e-14,   Method: Composition-based stats.
 Identities = 36/82 (43%), Positives = 57/82 (69%), Gaps = 1/82 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ +GA+EGI++IVT +L+ +S Q+LVDCD    +  C GG ++  ++++I+N G
Sbjct: 159 GSCWAFSTIGAVEGINQIVTGDLITLSEQELVDCDTS-YNEGCNGGLMDYAFEFIIKNGG 217

Query: 65  INTERDYPNVGVMDNCKVFQFN 86
           I+T++DYP  GV   C   + N
Sbjct: 218 IDTDKDYPYKGVDGTCDQIRKN 239


>gi|18401614|ref|NP_564497.1| cysteine proteinase RD21a [Arabidopsis thaliana]
 gi|1172873|sp|P43297.1|RD21A_ARATH RecName: Full=Cysteine proteinase RD21a; Short=RD21; Flags:
           Precursor
 gi|12321010|gb|AAG50628.1|AC083835_13 cysteine protease, putative [Arabidopsis thaliana]
 gi|435619|dbj|BAA02374.1| thiol protease [Arabidopsis thaliana]
 gi|18175926|gb|AAL59952.1| putative cysteine proteinase RD21A [Arabidopsis thaliana]
 gi|22136972|gb|AAM91715.1| putative cysteine proteinase RD21A [Arabidopsis thaliana]
 gi|332194014|gb|AEE32135.1| cysteine proteinase RD21a [Arabidopsis thaliana]
          Length = 462

 Score = 82.8 bits (203), Expect = 2e-14,   Method: Composition-based stats.
 Identities = 36/82 (43%), Positives = 57/82 (69%), Gaps = 1/82 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ +GA+EGI++IVT +L+ +S Q+LVDCD    +  C GG ++  ++++I+N G
Sbjct: 159 GSCWAFSTIGAVEGINQIVTGDLITLSEQELVDCDTS-YNEGCNGGLMDYAFEFIIKNGG 217

Query: 65  INTERDYPNVGVMDNCKVFQFN 86
           I+T++DYP  GV   C   + N
Sbjct: 218 IDTDKDYPYKGVDGTCDQIRKN 239


>gi|224081756|ref|XP_002306486.1| predicted protein [Populus trichocarpa]
 gi|222855935|gb|EEE93482.1| predicted protein [Populus trichocarpa]
          Length = 352

 Score = 82.8 bits (203), Expect = 2e-14,   Method: Composition-based stats.
 Identities = 35/71 (49%), Positives = 51/71 (71%), Gaps = 1/71 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V A+EGI++IVT  L+ +S Q+LVDCD    +  C GG ++  +Q++I N G
Sbjct: 115 GSCWAFSTVAAVEGINQIVTGELISLSEQELVDCDRTYNA-GCNGGLMDYAFQFIINNGG 173

Query: 65  INTERDYPNVG 75
           ++TE+DYP VG
Sbjct: 174 LDTEKDYPYVG 184


>gi|14517542|gb|AAK62661.1| F2G19.31/F2G19.31 [Arabidopsis thaliana]
 gi|19548039|gb|AAL87383.1| F2G19.31/F2G19.31 [Arabidopsis thaliana]
          Length = 462

 Score = 82.8 bits (203), Expect = 2e-14,   Method: Composition-based stats.
 Identities = 36/82 (43%), Positives = 57/82 (69%), Gaps = 1/82 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ +GA+EGI++IVT +L+ +S Q+LVDCD    +  C GG ++  ++++I+N G
Sbjct: 159 GSCWAFSTIGAVEGINQIVTGDLITLSEQELVDCDTS-YNEGCNGGLMDYAFEFIIKNGG 217

Query: 65  INTERDYPNVGVMDNCKVFQFN 86
           I+T++DYP  GV   C   + N
Sbjct: 218 IDTDKDYPYKGVDGTCDQIRKN 239


>gi|224093956|ref|XP_002310053.1| predicted protein [Populus trichocarpa]
 gi|224147016|ref|XP_002336386.1| predicted protein [Populus trichocarpa]
 gi|222834869|gb|EEE73318.1| predicted protein [Populus trichocarpa]
 gi|222852956|gb|EEE90503.1| predicted protein [Populus trichocarpa]
          Length = 340

 Score = 82.8 bits (203), Expect = 2e-14,   Method: Composition-based stats.
 Identities = 32/76 (42%), Positives = 51/76 (67%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G CW F+ V A+EGI+K+ T  L+ +S Q++VDCD +GE + C GG ++  ++++ QN+G
Sbjct: 145 GCCWAFSAVAAMEGINKLTTGKLISLSEQEVVDCDTKGEDQGCNGGLMDDAFKFIEQNKG 204

Query: 65  INTERDYPNVGVMDNC 80
           + TE +YP  G    C
Sbjct: 205 LTTEANYPYKGTDGTC 220


>gi|37780051|gb|AAP32198.1| cysteine protease 12 [Trifolium repens]
          Length = 343

 Score = 82.8 bits (203), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 37/78 (47%), Positives = 49/78 (62%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G CW F+ + A EGI KI T  LV +S Q+LVDCD  G  + C GG ++  ++++IQN G
Sbjct: 147 GCCWAFSAIAATEGIHKISTGKLVSLSEQELVDCDTNGVDQGCEGGLMDDAFKFIIQNNG 206

Query: 65  INTERDYPNVGVMDNCKV 82
           I+TE  YP  GV   CK 
Sbjct: 207 ISTEAGYPYQGVDGTCKA 224


>gi|356509992|ref|XP_003523725.1| PREDICTED: oryzain alpha chain-like [Glycine max]
          Length = 439

 Score = 82.8 bits (203), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 39/76 (51%), Positives = 52/76 (68%), Gaps = 1/76 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G+CW F+  GAIEGI+KIVT +LV +S Q+L+DCD    S  C GG ++  YQ+VI N+G
Sbjct: 145 GACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDTSYNS-GCGGGLMDFAYQFVIDNKG 203

Query: 65  INTERDYPNVGVMDNC 80
           I+TE DYP      +C
Sbjct: 204 IDTEDDYPYQARQRSC 219


>gi|297802228|ref|XP_002868998.1| cysteine proteinase [Arabidopsis lyrata subsp. lyrata]
 gi|297314834|gb|EFH45257.1| cysteine proteinase [Arabidopsis lyrata subsp. lyrata]
          Length = 375

 Score = 82.8 bits (203), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 37/82 (45%), Positives = 54/82 (65%), Gaps = 1/82 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+   A+EGI+KIVT  L+ +S Q+LVDCDN   ++ C GG ++  +Q++++N G
Sbjct: 167 GSCWAFSTAAAVEGINKIVTGELISLSEQELVDCDNS-YNQGCNGGLMDYAFQFIMKNGG 225

Query: 65  INTERDYPNVGVMDNCKVFQFN 86
           + TE+DYP  G    C  F  N
Sbjct: 226 LKTEKDYPYRGFGGKCNSFLKN 247


>gi|37780045|gb|AAP32195.1| cysteine protease 5 [Trifolium repens]
          Length = 343

 Score = 82.8 bits (203), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 37/78 (47%), Positives = 49/78 (62%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G CW F+ + A EGI KI T  LV +S Q+LVDCD  G  + C GG ++  ++++IQN G
Sbjct: 147 GCCWAFSAIAATEGIHKISTGKLVSLSEQELVDCDTNGVDQGCEGGLMDDAFKFIIQNNG 206

Query: 65  INTERDYPNVGVMDNCKV 82
           I+TE  YP  GV   CK 
Sbjct: 207 ISTEAGYPYQGVDGTCKA 224


>gi|13491750|gb|AAK27968.1|AF242372_1 cysteine protease [Ipomoea batatas]
          Length = 339

 Score = 82.8 bits (203), Expect = 2e-14,   Method: Composition-based stats.
 Identities = 33/76 (43%), Positives = 51/76 (67%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G CW F+ V A+EGI+K+ T NL+ +S Q+LVDCD +G  + C GG ++  + ++I N+G
Sbjct: 143 GCCWAFSAVAAMEGITKLSTGNLISLSEQELVDCDVKGTDQGCEGGLMDDAFSFIINNKG 202

Query: 65  INTERDYPNVGVMDNC 80
           + TE +YP  G   +C
Sbjct: 203 LTTESNYPYQGTDGSC 218


>gi|242094002|ref|XP_002437491.1| hypothetical protein SORBIDRAFT_10g028010 [Sorghum bicolor]
 gi|241915714|gb|EER88858.1| hypothetical protein SORBIDRAFT_10g028010 [Sorghum bicolor]
          Length = 397

 Score = 82.8 bits (203), Expect = 2e-14,   Method: Composition-based stats.
 Identities = 38/82 (46%), Positives = 51/82 (62%), Gaps = 2/82 (2%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G CW F+ V AIEGI+ IVT NLV +S Q+++DCD Q     C GG +E  +Q+VI N G
Sbjct: 185 GGCWAFSAVAAIEGINAIVTGNLVSLSEQEIIDCDTQ--DSGCNGGQMENAFQFVIDNGG 242

Query: 65  INTERDYPNVGVMDNCKVFQFN 86
           I++E DYP +     C   + N
Sbjct: 243 IDSEADYPFIATDGTCDANKAN 264


>gi|413919735|gb|AFW59667.1| hypothetical protein ZEAMMB73_680472 [Zea mays]
          Length = 344

 Score = 82.8 bits (203), Expect = 2e-14,   Method: Composition-based stats.
 Identities = 38/85 (44%), Positives = 56/85 (65%), Gaps = 7/85 (8%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCD---NQGESRSCVGGFIETIYQYVIQ 61
           GSCW F+ + A+EGI++IVT +++ +S Q+LVDCD   NQG    C GG ++  ++++I 
Sbjct: 157 GSCWAFSTIAAVEGINQIVTGDMISLSEQELVDCDTSYNQG----CNGGLMDYAFEFIIN 212

Query: 62  NRGINTERDYPNVGVMDNCKVFQFN 86
           N GI+TE DYP  G    C V + N
Sbjct: 213 NGGIDTEEDYPYKGTDGRCDVNRKN 237


>gi|50355621|dbj|BAD29959.1| cysteine protease [Daucus carota]
          Length = 361

 Score = 82.8 bits (203), Expect = 2e-14,   Method: Composition-based stats.
 Identities = 32/76 (42%), Positives = 49/76 (64%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ + A EGI+K+ T  L+ +S Q+LVDCD  GE + C GG++E  ++++++N+G
Sbjct: 165 GSCWAFSTIAATEGITKLKTGKLISLSEQELVDCDKTGEDQGCEGGYMEDGFEFIVKNKG 224

Query: 65  INTERDYPNVGVMDNC 80
           I  E  YP       C
Sbjct: 225 IALEASYPYTAADGTC 240


>gi|194352756|emb|CAQ00106.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
          Length = 476

 Score = 82.8 bits (203), Expect = 2e-14,   Method: Composition-based stats.
 Identities = 36/82 (43%), Positives = 54/82 (65%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V  +E I++IVT  +V +S Q+LV+CD  G+S  C GG ++  ++++I+N G
Sbjct: 170 GSCWAFSAVSTVESINQIVTGEMVTLSEQELVECDINGQSSGCNGGLMDDAFEFIIKNGG 229

Query: 65  INTERDYPNVGVMDNCKVFQFN 86
           I+TE DYP   V   C V + N
Sbjct: 230 IDTEDDYPYKAVDGRCDVLRKN 251


>gi|171702843|dbj|BAG16377.1| cysteine protease [Brassica rapa var. perviridis]
          Length = 431

 Score = 82.8 bits (203), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 38/82 (46%), Positives = 56/82 (68%), Gaps = 1/82 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ +GA+EGI+KIVT +L+ +S Q+LVDCD    +  C GG ++  ++++I+N G
Sbjct: 148 GSCWAFSTIGAVEGINKIVTGDLISLSEQELVDCDTS-YNEGCNGGLMDYAFEFIIKNGG 206

Query: 65  INTERDYPNVGVMDNCKVFQFN 86
           I+TE DYP  GV   C   + N
Sbjct: 207 IDTEEDYPYKGVDGRCDQTRKN 228


>gi|242074728|ref|XP_002447300.1| hypothetical protein SORBIDRAFT_06g032360 [Sorghum bicolor]
 gi|241938483|gb|EES11628.1| hypothetical protein SORBIDRAFT_06g032360 [Sorghum bicolor]
          Length = 471

 Score = 82.8 bits (203), Expect = 2e-14,   Method: Composition-based stats.
 Identities = 35/78 (44%), Positives = 52/78 (66%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ VGA+EGI++IVT  LV +S Q+LVDC   G++  C GG ++  + +++ N G
Sbjct: 163 GSCWAFSAVGAVEGINQIVTGELVTLSEQELVDCSKNGQNGGCDGGMMDDAFAFIVGNGG 222

Query: 65  INTERDYPNVGVMDNCKV 82
           I+T++DYP       C V
Sbjct: 223 IDTDKDYPYTARDGKCDV 240


>gi|224114698|ref|XP_002316833.1| predicted protein [Populus trichocarpa]
 gi|222859898|gb|EEE97445.1| predicted protein [Populus trichocarpa]
          Length = 305

 Score = 82.8 bits (203), Expect = 3e-14,   Method: Composition-based stats.
 Identities = 33/76 (43%), Positives = 49/76 (64%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G CW F+ V A EGI+++ T  L+ +S Q+LVDCD  GE + C GG ++  + ++IQN+G
Sbjct: 112 GCCWAFSAVAATEGITQLSTGKLISLSEQELVDCDTSGEDQGCNGGLMDDAFDFIIQNKG 171

Query: 65  INTERDYPNVGVMDNC 80
           + TE +YP  G    C
Sbjct: 172 LTTEANYPYQGADGAC 187


>gi|413943290|gb|AFW75939.1| maize insect resistance1 [Zea mays]
          Length = 435

 Score = 82.8 bits (203), Expect = 3e-14,   Method: Composition-based stats.
 Identities = 38/82 (46%), Positives = 51/82 (62%), Gaps = 2/82 (2%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G CW F+ V AIEGI+ I T NLV +S Q+++DCD Q     C GG +E  +++VI N G
Sbjct: 213 GGCWAFSAVAAIEGINAIATGNLVSLSEQEIIDCDAQ--DSGCDGGQMENAFRFVIGNGG 270

Query: 65  INTERDYPNVGVMDNCKVFQFN 86
           I+TE DYP +G    C   + N
Sbjct: 271 IDTEADYPFIGTDGTCDASKEN 292


>gi|242072392|ref|XP_002446132.1| hypothetical protein SORBIDRAFT_06g002150 [Sorghum bicolor]
 gi|241937315|gb|EES10460.1| hypothetical protein SORBIDRAFT_06g002150 [Sorghum bicolor]
          Length = 337

 Score = 82.4 bits (202), Expect = 3e-14,   Method: Composition-based stats.
 Identities = 34/77 (44%), Positives = 50/77 (64%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G CW F+ V A+EGI K+ T NL+ +S Q+LVDCD       C GG++++ +++VI+N G
Sbjct: 143 GCCWAFSAVAAMEGIVKLSTGNLISLSEQELVDCDTHSMDEGCEGGWMDSAFEFVIKNGG 202

Query: 65  INTERDYPNVGVMDNCK 81
           + TE +YP   V   CK
Sbjct: 203 LATESNYPYKAVDGKCK 219


>gi|224081320|ref|XP_002306369.1| predicted protein [Populus trichocarpa]
 gi|222855818|gb|EEE93365.1| predicted protein [Populus trichocarpa]
          Length = 340

 Score = 82.4 bits (202), Expect = 3e-14,   Method: Composition-based stats.
 Identities = 31/76 (40%), Positives = 51/76 (67%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G CW F+ V A+EGI+++ T  L+ +S Q++VDCD +GE + C GG ++  ++++ QN+G
Sbjct: 145 GCCWAFSAVAAMEGINQLTTGKLISLSEQEVVDCDTKGEDQGCNGGLMDDAFKFIEQNKG 204

Query: 65  INTERDYPNVGVMDNC 80
           + TE +YP  G    C
Sbjct: 205 LTTEANYPYTGTDGTC 220


>gi|356577811|ref|XP_003557016.1| PREDICTED: vignain-like [Glycine max]
          Length = 343

 Score = 82.4 bits (202), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 35/78 (44%), Positives = 50/78 (64%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G CW F+ V A EGI  + +  L+ +S Q+LVDCD +G  + C GG ++  +++VIQN G
Sbjct: 147 GCCWAFSAVAATEGIHALTSGKLISLSEQELVDCDTKGVDQGCEGGLMDDAFKFVIQNHG 206

Query: 65  INTERDYPNVGVMDNCKV 82
           +NTE +YP  GV   C V
Sbjct: 207 LNTEANYPYKGVDGKCNV 224


>gi|356515044|ref|XP_003526211.1| PREDICTED: LOW QUALITY PROTEIN: thiol protease SEN102-like [Glycine
           max]
          Length = 337

 Score = 82.4 bits (202), Expect = 3e-14,   Method: Composition-based stats.
 Identities = 35/76 (46%), Positives = 50/76 (65%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V A EGI +I T NLV +  Q+LV CD +G  + C GG++E  ++++I+N G
Sbjct: 142 GSCWAFSTVAATEGIHQITTGNLVSLXEQELVSCDTKGVDQGCEGGYMEDGFEFIIKNGG 201

Query: 65  INTERDYPNVGVMDNC 80
           I T+ +YP  GV   C
Sbjct: 202 ITTKANYPYKGVNGTC 217


>gi|182375363|gb|ACB87490.1| mucunain [Mucuna pruriens]
          Length = 422

 Score = 82.4 bits (202), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 40/85 (47%), Positives = 57/85 (67%), Gaps = 7/85 (8%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCD---NQGESRSCVGGFIETIYQYVIQ 61
           GSCW F+ +GA+EGI+KIVT +L+ +S Q+LVDCD   NQG    C GG ++  Y+++I 
Sbjct: 115 GSCWAFSTIGAVEGINKIVTGDLISLSEQELVDCDTSYNQG----CNGGLMDYAYEFIIN 170

Query: 62  NRGINTERDYPNVGVMDNCKVFQFN 86
           N GI++E DYP   V   C  ++ N
Sbjct: 171 NGGIDSEEDYPYRAVDGTCDQYRKN 195


>gi|302816222|ref|XP_002989790.1| hypothetical protein SELMODRAFT_184826 [Selaginella moellendorffii]
 gi|300142356|gb|EFJ09057.1| hypothetical protein SELMODRAFT_184826 [Selaginella moellendorffii]
          Length = 358

 Score = 82.4 bits (202), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 35/76 (46%), Positives = 56/76 (73%), Gaps = 1/76 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V A+EG+++IVT  LV +S Q+LVDCD Q +++ C GG +++ ++++IQN G
Sbjct: 140 GSCWAFSTVAAVEGVNQIVTGELVSLSEQELVDCDKQ-KNQGCNGGLMDSAFEFIIQNGG 198

Query: 65  INTERDYPNVGVMDNC 80
           +++E DYP   V  +C
Sbjct: 199 LDSEADYPYKAVSGSC 214


>gi|302845628|ref|XP_002954352.1| hypothetical protein VOLCADRAFT_76255 [Volvox carteri f.
           nagariensis]
 gi|300260282|gb|EFJ44502.1| hypothetical protein VOLCADRAFT_76255 [Volvox carteri f.
           nagariensis]
          Length = 489

 Score = 82.4 bits (202), Expect = 3e-14,   Method: Composition-based stats.
 Identities = 36/68 (52%), Positives = 48/68 (70%), Gaps = 1/68 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW FA  G++EGI+ IVT  L  +S Q+LVDCD   E R C GG ++  YQ++I+N G
Sbjct: 156 GSCWAFATTGSVEGINAIVTGELASLSEQELVDCDTD-EDRGCSGGLMDYAYQWIIKNGG 214

Query: 65  INTERDYP 72
           ++TE DYP
Sbjct: 215 LDTEDDYP 222


>gi|356542631|ref|XP_003539770.1| PREDICTED: vignain-like [Glycine max]
          Length = 343

 Score = 82.4 bits (202), Expect = 3e-14,   Method: Composition-based stats.
 Identities = 33/76 (43%), Positives = 49/76 (64%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW FA V A EGI+K+ T  L+ +S Q+L+DCD  G++  C  G I+  +++++QN+G
Sbjct: 147 GSCWAFAAVAATEGITKLTTGELISLSEQELIDCDTNGDNGGCKWGIIQEAFKFIVQNKG 206

Query: 65  INTERDYPNVGVMDNC 80
           + TE  YP   V   C
Sbjct: 207 LATEASYPYQAVDGTC 222


>gi|302816909|ref|XP_002990132.1| hypothetical protein SELMODRAFT_428615 [Selaginella moellendorffii]
 gi|300142145|gb|EFJ08849.1| hypothetical protein SELMODRAFT_428615 [Selaginella moellendorffii]
          Length = 358

 Score = 82.4 bits (202), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 35/76 (46%), Positives = 56/76 (73%), Gaps = 1/76 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V A+EG+++IVT  LV +S Q+LVDCD Q +++ C GG +++ ++++IQN G
Sbjct: 140 GSCWAFSTVAAVEGVNQIVTGELVSLSEQELVDCDKQ-KNQGCNGGLMDSAFEFIIQNGG 198

Query: 65  INTERDYPNVGVMDNC 80
           +++E DYP   V  +C
Sbjct: 199 LDSEADYPYKAVSGSC 214


>gi|224076972|ref|XP_002305074.1| predicted protein [Populus trichocarpa]
 gi|224106329|ref|XP_002333698.1| predicted protein [Populus trichocarpa]
 gi|222837984|gb|EEE76349.1| predicted protein [Populus trichocarpa]
 gi|222848038|gb|EEE85585.1| predicted protein [Populus trichocarpa]
          Length = 307

 Score = 82.4 bits (202), Expect = 3e-14,   Method: Composition-based stats.
 Identities = 35/77 (45%), Positives = 50/77 (64%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G CW F+ V AIEGI K+ T  L+ +S QQLVDCD +G  + C GG ++  +Q++++N G
Sbjct: 111 GCCWAFSAVAAIEGIIKLKTGKLISLSEQQLVDCDVKGVDQGCGGGLMDNAFQFILRNGG 170

Query: 65  INTERDYPNVGVMDNCK 81
           + +E  YP  GV   CK
Sbjct: 171 LTSEATYPYQGVDGTCK 187


>gi|413953050|gb|AFW85699.1| hypothetical protein ZEAMMB73_033873 [Zea mays]
          Length = 361

 Score = 82.4 bits (202), Expect = 3e-14,   Method: Composition-based stats.
 Identities = 35/76 (46%), Positives = 47/76 (61%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW FA V +IEG+ +I T  LV +S Q++VDCD  G    C GG+  +  ++V +N G
Sbjct: 159 GSCWAFATVASIEGVHQIKTGRLVSLSEQEIVDCDRGGNDHGCRGGYPRSAMEWVTRNGG 218

Query: 65  INTERDYPNVGVMDNC 80
           + TE DYP VG    C
Sbjct: 219 LTTESDYPYVGSQRQC 234


>gi|224162986|ref|XP_002338508.1| predicted protein [Populus trichocarpa]
 gi|222872535|gb|EEF09666.1| predicted protein [Populus trichocarpa]
          Length = 306

 Score = 82.4 bits (202), Expect = 3e-14,   Method: Composition-based stats.
 Identities = 32/76 (42%), Positives = 51/76 (67%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G CW F+ V A+EGI+K+ T  L+ +S Q++VDCD +GE + C GG ++  ++++ QN+G
Sbjct: 111 GCCWAFSAVAAMEGINKLTTGKLISLSEQEVVDCDTKGEDQGCNGGLMDDAFKFIEQNKG 170

Query: 65  INTERDYPNVGVMDNC 80
           + TE +YP  G    C
Sbjct: 171 LTTEANYPYKGTDGTC 186


>gi|413953051|gb|AFW85700.1| hypothetical protein ZEAMMB73_033873 [Zea mays]
          Length = 359

 Score = 82.4 bits (202), Expect = 3e-14,   Method: Composition-based stats.
 Identities = 35/76 (46%), Positives = 47/76 (61%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW FA V +IEG+ +I T  LV +S Q++VDCD  G    C GG+  +  ++V +N G
Sbjct: 159 GSCWAFATVASIEGVHQIKTGRLVSLSEQEIVDCDRGGNDHGCRGGYPRSAMEWVTRNGG 218

Query: 65  INTERDYPNVGVMDNC 80
           + TE DYP VG    C
Sbjct: 219 LTTESDYPYVGSQRQC 234


>gi|21070926|gb|AAM34401.1|AF377947_7 putative cysteine proteinase [Oryza sativa Japonica Group]
 gi|31712050|gb|AAP68356.1| putative cysteine protease [Oryza sativa Japonica Group]
 gi|40538988|gb|AAR87245.1| putative cysteine protease [Oryza sativa Japonica Group]
 gi|108711126|gb|ABF98921.1| Papain family cysteine protease containing protein, expressed
           [Oryza sativa Japonica Group]
 gi|125545747|gb|EAY91886.1| hypothetical protein OsI_13535 [Oryza sativa Indica Group]
          Length = 350

 Score = 82.4 bits (202), Expect = 3e-14,   Method: Composition-based stats.
 Identities = 34/72 (47%), Positives = 48/72 (66%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G CW F+ V A+EG++KI T  LV +S Q+LVDCD +GE + C GG ++T +QY+ +  G
Sbjct: 156 GCCWAFSAVAAVEGLAKIRTGQLVSLSEQELVDCDVRGEDQGCEGGLMDTAFQYIARRGG 215

Query: 65  INTERDYPNVGV 76
           +  E  YP  GV
Sbjct: 216 LAAESSYPYRGV 227


>gi|242072572|ref|XP_002446222.1| hypothetical protein SORBIDRAFT_06g005410 [Sorghum bicolor]
 gi|241937405|gb|EES10550.1| hypothetical protein SORBIDRAFT_06g005410 [Sorghum bicolor]
          Length = 340

 Score = 82.4 bits (202), Expect = 3e-14,   Method: Composition-based stats.
 Identities = 34/77 (44%), Positives = 48/77 (62%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G CW F+ V A EGI KI T+ L+ +S Q+LVDCD  GE + C GG ++  ++++I+N G
Sbjct: 146 GCCWAFSAVAATEGIVKISTDKLISLSEQELVDCDVHGEDQGCEGGLMDDAFKFIIKNGG 205

Query: 65  INTERDYPNVGVMDNCK 81
           + TE  YP       CK
Sbjct: 206 LTTESSYPYTATDGKCK 222


>gi|432910514|ref|XP_004078393.1| PREDICTED: cathepsin S-like [Oryzias latipes]
          Length = 339

 Score = 82.0 bits (201), Expect = 4e-14,   Method: Composition-based stats.
 Identities = 38/76 (50%), Positives = 50/76 (65%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ VGA+EG     T  LVD+S Q LVDC ++  +  C GGF+   +QYVI N+G
Sbjct: 146 GSCWAFSAVGALEGQLCRKTGKLVDLSPQNLVDCSSKYGNHGCNGGFMHQAFQYVIDNQG 205

Query: 65  INTERDYPNVGVMDNC 80
           I+++  YP VGV  NC
Sbjct: 206 IDSDAGYPYVGVTQNC 221


>gi|124484401|dbj|BAF46311.1| cysteine proteinase precursor [Ipomoea nil]
          Length = 339

 Score = 82.0 bits (201), Expect = 4e-14,   Method: Composition-based stats.
 Identities = 33/76 (43%), Positives = 51/76 (67%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G CW F+ V A+EGI+K+ T NL+ +S Q+LVDCD +G  + C GG ++  + ++I N+G
Sbjct: 143 GCCWAFSAVAAMEGITKLSTGNLISLSEQELVDCDVKGIDQGCEGGLMDDAFTFIINNKG 202

Query: 65  INTERDYPNVGVMDNC 80
           + TE +YP  G   +C
Sbjct: 203 LTTESNYPYQGTDGSC 218


>gi|162459393|ref|NP_001105993.1| cysteine protease component of protease-inhibitor complex precursor
           [Zea mays]
 gi|6682829|dbj|BAA88898.1| cysteine protease component of protease-inhibitor complex [Zea
           mays]
          Length = 465

 Score = 82.0 bits (201), Expect = 4e-14,   Method: Composition-based stats.
 Identities = 39/85 (45%), Positives = 57/85 (67%), Gaps = 7/85 (8%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCD---NQGESRSCVGGFIETIYQYVIQ 61
           GSCW F+ + A+EGI++IVT +L+ +S Q+LVDCD   NQG    C GG ++  ++++I 
Sbjct: 155 GSCWAFSTIAAVEGINQIVTGDLISLSEQELVDCDTSYNQG----CNGGLMDYAFEFIIN 210

Query: 62  NRGINTERDYPNVGVMDNCKVFQFN 86
           N GI+TE+DYP  G    C V + N
Sbjct: 211 NGGIDTEKDYPYKGTDGRCDVNRKN 235


>gi|414585111|tpg|DAA35682.1| TPA: cysteine proteinase Mir3 [Zea mays]
          Length = 468

 Score = 82.0 bits (201), Expect = 4e-14,   Method: Composition-based stats.
 Identities = 39/85 (45%), Positives = 57/85 (67%), Gaps = 7/85 (8%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCD---NQGESRSCVGGFIETIYQYVIQ 61
           GSCW F+ + A+EGI++IVT +L+ +S Q+LVDCD   NQG    C GG ++  ++++I 
Sbjct: 157 GSCWAFSTIAAVEGINQIVTGDLISLSEQELVDCDTSYNQG----CNGGLMDYAFEFIIN 212

Query: 62  NRGINTERDYPNVGVMDNCKVFQFN 86
           N GI+TE+DYP  G    C V + N
Sbjct: 213 NGGIDTEKDYPYKGTDGRCDVNRKN 237


>gi|226495425|ref|NP_001148706.1| cysteine protease 1 precursor [Zea mays]
 gi|195621544|gb|ACG32602.1| cysteine protease 1 precursor [Zea mays]
          Length = 463

 Score = 82.0 bits (201), Expect = 4e-14,   Method: Composition-based stats.
 Identities = 39/85 (45%), Positives = 57/85 (67%), Gaps = 7/85 (8%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCD---NQGESRSCVGGFIETIYQYVIQ 61
           GSCW F+ + A+EGI++IVT +L+ +S Q+LVDCD   NQG    C GG ++  ++++I 
Sbjct: 152 GSCWAFSTIAAVEGINQIVTGDLISLSEQELVDCDTSYNQG----CNGGLMDYAFEFIIN 207

Query: 62  NRGINTERDYPNVGVMDNCKVFQFN 86
           N GI+TE+DYP  G    C V + N
Sbjct: 208 NGGIDTEKDYPYKGTDGRCDVNRKN 232


>gi|356545063|ref|XP_003540965.1| PREDICTED: thiol protease SEN102-like [Glycine max]
          Length = 361

 Score = 82.0 bits (201), Expect = 4e-14,   Method: Composition-based stats.
 Identities = 34/76 (44%), Positives = 49/76 (64%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G CW F+ V A EGI  + +  L+ +S Q+LVDCD +G  + C GG ++  +++VIQN G
Sbjct: 165 GCCWAFSAVAATEGIHALTSGKLISLSEQELVDCDTKGVDQGCEGGLMDDAFKFVIQNHG 224

Query: 65  INTERDYPNVGVMDNC 80
           +NTE +YP  GV   C
Sbjct: 225 LNTEANYPYKGVDGKC 240


>gi|162463464|ref|NP_001104879.1| cysteine proteinase Mir3 precursor [Zea mays]
 gi|2425066|gb|AAB88263.1| cysteine proteinase Mir3 [Zea mays]
          Length = 480

 Score = 82.0 bits (201), Expect = 4e-14,   Method: Composition-based stats.
 Identities = 38/85 (44%), Positives = 57/85 (67%), Gaps = 7/85 (8%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCD---NQGESRSCVGGFIETIYQYVIQ 61
           G+CW F+ + A+EGI++IVT +L+ +S Q+LVDCD   NQG    C GG ++  ++++I 
Sbjct: 155 GTCWAFSTIAAVEGINQIVTGDLISLSEQELVDCDTSYNQG----CNGGLMDYAFEFIIN 210

Query: 62  NRGINTERDYPNVGVMDNCKVFQFN 86
           N GI+TE+DYP  G    C V + N
Sbjct: 211 NGGIDTEKDYPYKGTDGRCDVNRKN 235


>gi|54300682|gb|AAV32964.1| cathepsin S-like [Oncorhynchus mykiss]
          Length = 246

 Score = 82.0 bits (201), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 36/76 (47%), Positives = 51/76 (67%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ VGA+EG     T  L+DIS+Q LVDC ++  ++ C GGF+   +QYVI N+G
Sbjct: 150 GSCWAFSSVGALEGQLMKTTGKLIDISSQNLVDCSSKYGNKGCNGGFMSQAFQYVIDNQG 209

Query: 65  INTERDYPNVGVMDNC 80
           I++++ YP  GV   C
Sbjct: 210 IDSDQSYPYXGVQQQC 225


>gi|356533293|ref|XP_003535200.1| PREDICTED: LOW QUALITY PROTEIN: cysteine proteinase RD21a-like
           [Glycine max]
          Length = 466

 Score = 82.0 bits (201), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 36/68 (52%), Positives = 52/68 (76%), Gaps = 1/68 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ VGA+EGI++IVT NL  +S Q+LVDCD +G +  C GG ++  +++++QN G
Sbjct: 161 GSCWAFSTVGAVEGINQIVTGNLTSLSEQELVDCD-RGYNMGCNGGLMDYAFEFIVQNGG 219

Query: 65  INTERDYP 72
           I+TE DYP
Sbjct: 220 IDTEEDYP 227


>gi|302759380|ref|XP_002963113.1| hypothetical protein SELMODRAFT_270344 [Selaginella moellendorffii]
 gi|300169974|gb|EFJ36576.1| hypothetical protein SELMODRAFT_270344 [Selaginella moellendorffii]
          Length = 479

 Score = 82.0 bits (201), Expect = 4e-14,   Method: Composition-based stats.
 Identities = 38/82 (46%), Positives = 53/82 (64%), Gaps = 1/82 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V AIEG++K+ T  LV +S Q+LVDCD +GE   C GG ++  + +VI+N G
Sbjct: 174 GSCWAFSAVAAIEGVNKLATGELVSLSEQELVDCD-KGEDEGCNGGLMDYAFGFVIKNGG 232

Query: 65  INTERDYPNVGVMDNCKVFQFN 86
           ++TE DYP  G    C   + N
Sbjct: 233 LDTEADYPYKGYGTRCDRSKMN 254


>gi|50355615|dbj|BAD29956.1| cysteine protease [Daucus carota]
          Length = 423

 Score = 82.0 bits (201), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 36/76 (47%), Positives = 56/76 (73%), Gaps = 1/76 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V A+EGI++I T +L+ +S Q+LVDCD +G ++ C GGF++  ++++++N G
Sbjct: 115 GSCWAFSTVAAVEGINQIATGDLISLSEQELVDCD-KGFNQGCNGGFMDYAFEFIVKNGG 173

Query: 65  INTERDYPNVGVMDNC 80
           I+TE DYP  GV   C
Sbjct: 174 IDTEDDYPYKGVDGQC 189


>gi|302796898|ref|XP_002980210.1| hypothetical protein SELMODRAFT_153766 [Selaginella moellendorffii]
 gi|300151826|gb|EFJ18470.1| hypothetical protein SELMODRAFT_153766 [Selaginella moellendorffii]
          Length = 479

 Score = 82.0 bits (201), Expect = 4e-14,   Method: Composition-based stats.
 Identities = 38/82 (46%), Positives = 53/82 (64%), Gaps = 1/82 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V AIEG++K+ T  LV +S Q+LVDCD +GE   C GG ++  + +VI+N G
Sbjct: 174 GSCWAFSAVAAIEGVNKLATGELVSLSEQELVDCD-KGEDEGCNGGLMDYAFGFVIKNGG 232

Query: 65  INTERDYPNVGVMDNCKVFQFN 86
           ++TE DYP  G    C   + N
Sbjct: 233 LDTEADYPYKGYGTRCDRSKMN 254


>gi|224103643|ref|XP_002313136.1| predicted protein [Populus trichocarpa]
 gi|222849544|gb|EEE87091.1| predicted protein [Populus trichocarpa]
          Length = 477

 Score = 82.0 bits (201), Expect = 4e-14,   Method: Composition-based stats.
 Identities = 39/79 (49%), Positives = 54/79 (68%), Gaps = 7/79 (8%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCD---NQGESRSCVGGFIETIYQYVIQ 61
           GSCW F+ VGA+EGI++IVT NL  +S Q+LVDCD   NQG    C GG ++  ++++++
Sbjct: 160 GSCWAFSTVGAVEGINQIVTGNLTSLSEQELVDCDKVYNQG----CNGGLMDYAFEFIMK 215

Query: 62  NRGINTERDYPNVGVMDNC 80
           N GI+TE DYP   V   C
Sbjct: 216 NGGIDTEEDYPYKAVDSMC 234


>gi|255564910|ref|XP_002523448.1| cysteine protease, putative [Ricinus communis]
 gi|223537276|gb|EEF38907.1| cysteine protease, putative [Ricinus communis]
          Length = 341

 Score = 82.0 bits (201), Expect = 4e-14,   Method: Composition-based stats.
 Identities = 33/76 (43%), Positives = 49/76 (64%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G CW F+ V A+EGI+K+ T  L+ +S Q+LVDCD  GE + C GG ++  ++++ QN G
Sbjct: 146 GCCWAFSAVAAMEGITKLSTGKLISLSEQELVDCDTSGEDQGCEGGLMDDAFEFIKQNGG 205

Query: 65  INTERDYPNVGVMDNC 80
           + TE +YP  G    C
Sbjct: 206 LTTEANYPYQGTDGTC 221


>gi|334185815|ref|NP_680113.3| putative cysteine proteinase [Arabidopsis thaliana]
 gi|75313879|sp|Q9STL4.1|CEP2_ARATH RecName: Full=KDEL-tailed cysteine endopeptidase CEP2; Flags:
           Precursor
 gi|4678354|emb|CAB41164.1| cysteine endopeptidase-like protein [Arabidopsis thaliana]
 gi|332644882|gb|AEE78403.1| putative cysteine proteinase [Arabidopsis thaliana]
          Length = 361

 Score = 82.0 bits (201), Expect = 4e-14,   Method: Composition-based stats.
 Identities = 37/82 (45%), Positives = 53/82 (64%), Gaps = 1/82 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V A+EGI+KI TN LV +S Q+LVDCD + ++  C GG +E  ++++ +N G
Sbjct: 150 GSCWAFSTVAAVEGINKIKTNKLVSLSEQELVDCDTK-QNEGCNGGLMEIAFEFIKKNGG 208

Query: 65  INTERDYPNVGVMDNCKVFQFN 86
           I TE  YP  G+   C   + N
Sbjct: 209 ITTEDSYPYEGIDGKCDASKDN 230


>gi|18422289|ref|NP_568620.1| Granulin repeat cysteine protease family protein [Arabidopsis
           thaliana]
 gi|9757832|dbj|BAB08269.1| cysteine protease component of protease-inhibitor complex
           [Arabidopsis thaliana]
 gi|17065064|gb|AAL32686.1| cysteine protease component of protease-inhibitor complex
           [Arabidopsis thaliana]
 gi|21387153|gb|AAM47980.1| cysteine protease component of protease-inhibitor complex
           [Arabidopsis thaliana]
 gi|332007522|gb|AED94905.1| Granulin repeat cysteine protease family protein [Arabidopsis
           thaliana]
          Length = 463

 Score = 82.0 bits (201), Expect = 4e-14,   Method: Composition-based stats.
 Identities = 37/71 (52%), Positives = 53/71 (74%), Gaps = 7/71 (9%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCD---NQGESRSCVGGFIETIYQYVIQ 61
           GSCW F+ +GA+EGI+KIVT +L+ +S Q+LVDCD   NQG    C GG ++  ++++I+
Sbjct: 160 GSCWAFSTIGAVEGINKIVTGDLISLSEQELVDCDTSYNQG----CNGGLMDYAFEFIIK 215

Query: 62  NRGINTERDYP 72
           N GI+TE DYP
Sbjct: 216 NGGIDTEADYP 226


>gi|242072394|ref|XP_002446133.1| hypothetical protein SORBIDRAFT_06g002160 [Sorghum bicolor]
 gi|241937316|gb|EES10461.1| hypothetical protein SORBIDRAFT_06g002160 [Sorghum bicolor]
          Length = 338

 Score = 82.0 bits (201), Expect = 4e-14,   Method: Composition-based stats.
 Identities = 34/77 (44%), Positives = 49/77 (63%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G CW F+ V A+EGI K+ T NL+ +S Q+LVDCD       C GG++++ +++VI+N G
Sbjct: 144 GCCWAFSAVAAMEGIVKLSTGNLISLSEQELVDCDTHSMDEGCEGGWMDSAFEFVIKNGG 203

Query: 65  INTERDYPNVGVMDNCK 81
           + TE  YP   V   CK
Sbjct: 204 LATESSYPYKAVDGKCK 220


>gi|255564908|ref|XP_002523447.1| cysteine protease, putative [Ricinus communis]
 gi|223537275|gb|EEF38906.1| cysteine protease, putative [Ricinus communis]
          Length = 342

 Score = 82.0 bits (201), Expect = 4e-14,   Method: Composition-based stats.
 Identities = 33/76 (43%), Positives = 49/76 (64%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G CW F+ V A+EGI+K+ T  L+ +S Q+LVDCD  GE + C GG ++  ++++ QN G
Sbjct: 146 GCCWAFSAVAAMEGITKLSTGKLISLSEQELVDCDTSGEDQGCEGGLMDDAFEFIKQNGG 205

Query: 65  INTERDYPNVGVMDNC 80
           + TE +YP  G    C
Sbjct: 206 LTTEANYPYQGTDGTC 221


>gi|225438807|ref|XP_002283263.1| PREDICTED: germination-specific cysteine protease 1-like isoform 1
           [Vitis vinifera]
          Length = 374

 Score = 82.0 bits (201), Expect = 4e-14,   Method: Composition-based stats.
 Identities = 34/78 (43%), Positives = 53/78 (67%), Gaps = 1/78 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V A+EGI++IVT  L+ +S Q+LVDCD + +   C GG ++  + ++I+N G
Sbjct: 157 GSCWAFSTVAAVEGINQIVTGELISLSEQELVDCDTEYD-MGCNGGLMDYAFDFIIKNGG 215

Query: 65  INTERDYPNVGVMDNCKV 82
           ++TE+DYP  G    C +
Sbjct: 216 LDTEKDYPYTGFDGECNL 233


>gi|357477459|ref|XP_003609015.1| Cysteine proteinase [Medicago truncatula]
 gi|355510070|gb|AES91212.1| Cysteine proteinase [Medicago truncatula]
          Length = 345

 Score = 82.0 bits (201), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 35/76 (46%), Positives = 50/76 (65%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G CW F+ V A EGI+K+ T  LV +S Q+LVDCD +G  + C GG ++  ++++IQN G
Sbjct: 149 GCCWAFSAVAATEGITKLSTGKLVSLSEQELVDCDTKGVDQGCEGGLMDDAFKFIIQNHG 208

Query: 65  INTERDYPNVGVMDNC 80
           ++TE  YP  GV   C
Sbjct: 209 LSTEAAYPYQGVDGTC 224


>gi|297791625|ref|XP_002863697.1| hypothetical protein ARALYDRAFT_917391 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297309532|gb|EFH39956.1| hypothetical protein ARALYDRAFT_917391 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 463

 Score = 82.0 bits (201), Expect = 5e-14,   Method: Composition-based stats.
 Identities = 37/71 (52%), Positives = 53/71 (74%), Gaps = 7/71 (9%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCD---NQGESRSCVGGFIETIYQYVIQ 61
           GSCW F+ +GA+EGI+KIVT +L+ +S Q+LVDCD   NQG    C GG ++  ++++I+
Sbjct: 160 GSCWAFSTIGAVEGINKIVTGDLISLSEQELVDCDTSYNQG----CNGGLMDYAFEFIIK 215

Query: 62  NRGINTERDYP 72
           N GI+TE DYP
Sbjct: 216 NGGIDTEADYP 226


>gi|413944253|gb|AFW76902.1| hypothetical protein ZEAMMB73_056195 [Zea mays]
          Length = 340

 Score = 82.0 bits (201), Expect = 5e-14,   Method: Composition-based stats.
 Identities = 34/77 (44%), Positives = 48/77 (62%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G CW F+ V A EGI KI T  L+ +S Q+LVDCD  GE + C GG ++  ++++I+N G
Sbjct: 146 GCCWAFSAVAATEGIVKISTGKLISLSEQELVDCDVHGEDQGCEGGLMDDAFKFIIKNGG 205

Query: 65  INTERDYPNVGVMDNCK 81
           + TE +YP       CK
Sbjct: 206 LTTESNYPYTAADGKCK 222


>gi|18141283|gb|AAL60579.1|AF454957_1 senescence-associated cysteine protease [Brassica oleracea]
          Length = 460

 Score = 82.0 bits (201), Expect = 5e-14,   Method: Composition-based stats.
 Identities = 37/71 (52%), Positives = 53/71 (74%), Gaps = 7/71 (9%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCD---NQGESRSCVGGFIETIYQYVIQ 61
           GSCW F+ +GA+EGI+KIVT +L+ +S Q+LVDCD   NQG    C GG ++  ++++I+
Sbjct: 159 GSCWAFSTIGAVEGINKIVTGDLISLSEQELVDCDTSYNQG----CNGGLMDYAFEFIIK 214

Query: 62  NRGINTERDYP 72
           N GI+TE DYP
Sbjct: 215 NGGIDTEEDYP 225


>gi|414589857|tpg|DAA40428.1| TPA: Vignain [Zea mays]
          Length = 377

 Score = 82.0 bits (201), Expect = 5e-14,   Method: Composition-based stats.
 Identities = 35/77 (45%), Positives = 53/77 (68%), Gaps = 2/77 (2%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V AIEGI++I    LV +S Q+LVDCD +  +  C GG++   +++V++NRG
Sbjct: 173 GSCWAFSAVAAIEGINQIKNGKLVSLSEQELVDCDTK--AIGCAGGYMSWAFEFVMKNRG 230

Query: 65  INTERDYPNVGVMDNCK 81
           + TER+YP  G+   C+
Sbjct: 231 LTTERNYPYQGLNGACQ 247


>gi|118140100|gb|ABK63481.1| cathepsin S [Channa argus]
          Length = 335

 Score = 82.0 bits (201), Expect = 5e-14,   Method: Composition-based stats.
 Identities = 36/76 (47%), Positives = 48/76 (63%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ VGA+EG     T  LVD+S Q LVDC  +  +  C GGF+   +QYVI+N+G
Sbjct: 142 GSCWAFSAVGALEGQLAKTTGKLVDLSPQNLVDCSGKYGNHGCDGGFMTNAFQYVIENQG 201

Query: 65  INTERDYPNVGVMDNC 80
           I +E  YP +G+   C
Sbjct: 202 IESEASYPYIGLEQQC 217


>gi|297816028|ref|XP_002875897.1| hypothetical protein ARALYDRAFT_347926 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297321735|gb|EFH52156.1| hypothetical protein ARALYDRAFT_347926 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 361

 Score = 81.6 bits (200), Expect = 5e-14,   Method: Composition-based stats.
 Identities = 40/83 (48%), Positives = 53/83 (63%), Gaps = 3/83 (3%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCD-NQGESRSCVGGFIETIYQYVIQNR 63
           GSCW F+ V A+EGI+KI TN LV +S Q+LVDCD NQ E   C GG +E  ++++ +N 
Sbjct: 150 GSCWAFSTVAAVEGINKIKTNKLVSLSEQELVDCDTNQNE--GCNGGLMEIAFEFIKKNG 207

Query: 64  GINTERDYPNVGVMDNCKVFQFN 86
           GI TE  YP  G+   C   + N
Sbjct: 208 GITTEDSYPYEGIDGKCDASKDN 230


>gi|255032|gb|AAB23155.1| COT44=cysteine proteinase homolog [Brassica napus, seedling, rapid
           cycling base population CrGC5, Peptide, 328 aa]
          Length = 328

 Score = 81.6 bits (200), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 36/76 (47%), Positives = 53/76 (69%), Gaps = 1/76 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+   A+EGI+KIVT  LV +S Q+LVDCD +  ++ C GG ++  +Q++++N G
Sbjct: 122 GSCWAFSTAAAVEGINKIVTGELVSLSEQELVDCD-KSYNQGCNGGLMDYAFQFIMKNGG 180

Query: 65  INTERDYPNVGVMDNC 80
           +NTE+DYP  G    C
Sbjct: 181 LNTEKDYPYHGTNGKC 196


>gi|118127|sp|P25251.1|CYSP4_BRANA RecName: Full=Cysteine proteinase COT44; Flags: Precursor
          Length = 328

 Score = 81.6 bits (200), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 36/76 (47%), Positives = 53/76 (69%), Gaps = 1/76 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+   A+EGI+KIVT  LV +S Q+LVDCD +  ++ C GG ++  +Q++++N G
Sbjct: 122 GSCWAFSTAAAVEGINKIVTGELVSLSEQELVDCD-KSYNQGCNGGLMDYAFQFIMKNGG 180

Query: 65  INTERDYPNVGVMDNC 80
           +NTE+DYP  G    C
Sbjct: 181 LNTEKDYPYHGTNGKC 196


>gi|357439999|ref|XP_003590277.1| Cysteine protease [Medicago truncatula]
 gi|355479325|gb|AES60528.1| Cysteine protease [Medicago truncatula]
          Length = 514

 Score = 81.6 bits (200), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 38/79 (48%), Positives = 55/79 (69%), Gaps = 2/79 (2%)

Query: 4   LGSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNR 63
           +GSCW F+  GAIEG++ IVT +L+ +S Q+LVDCD   +   C GG+++  +++VI N 
Sbjct: 205 VGSCWSFSSTGAIEGVNAIVTGDLISLSEQELVDCDTTND--GCEGGYMDYAFEWVINNG 262

Query: 64  GINTERDYPNVGVMDNCKV 82
           GI+TE DYP +GV   C V
Sbjct: 263 GIDTEADYPYIGVGGTCNV 281


>gi|357167190|ref|XP_003581045.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like
           [Brachypodium distachyon]
          Length = 415

 Score = 81.6 bits (200), Expect = 5e-14,   Method: Composition-based stats.
 Identities = 32/76 (42%), Positives = 49/76 (64%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G CW F+ V ++EGI K+ T  L+ +S Q+LVDCD  G  + C GG ++  ++++I N G
Sbjct: 219 GCCWAFSTVASVEGIVKLSTGKLISLSEQELVDCDVDGMDQGCEGGLMDNAFEFIIDNGG 278

Query: 65  INTERDYPNVGVMDNC 80
           + TE +YP  G  D+C
Sbjct: 279 LTTEGNYPYTGTDDSC 294


>gi|356517310|ref|XP_003527331.1| PREDICTED: vignain-like [Glycine max]
          Length = 342

 Score = 81.6 bits (200), Expect = 5e-14,   Method: Composition-based stats.
 Identities = 36/78 (46%), Positives = 51/78 (65%), Gaps = 1/78 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V A EGI +I T  LV +S Q+LVDC  +GES  C+GG+++  ++++ +  G
Sbjct: 146 GSCWAFSAVAATEGIHQITTGKLVPLSEQELVDC-VKGESEGCIGGYVDDAFEFIAKKGG 204

Query: 65  INTERDYPNVGVMDNCKV 82
           I +E  YP  GV   CKV
Sbjct: 205 IASETHYPYKGVNKTCKV 222


>gi|413953048|gb|AFW85697.1| hypothetical protein ZEAMMB73_051316 [Zea mays]
          Length = 298

 Score = 81.6 bits (200), Expect = 6e-14,   Method: Compositional matrix adjust.
 Identities = 36/76 (47%), Positives = 47/76 (61%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW FA V +IEG+ +I T  LV +S QQ+VDCD  G    C GG+  +  ++V +N G
Sbjct: 98  GSCWAFATVASIEGVHQIKTGRLVSLSEQQIVDCDRGGNDHGCHGGYPRSAMEWVTRNGG 157

Query: 65  INTERDYPNVGVMDNC 80
           + TE DYP VG    C
Sbjct: 158 LTTESDYPYVGSQRQC 173


>gi|357474579|ref|XP_003607574.1| Cysteine protease [Medicago truncatula]
 gi|355508629|gb|AES89771.1| Cysteine protease [Medicago truncatula]
          Length = 345

 Score = 81.6 bits (200), Expect = 6e-14,   Method: Compositional matrix adjust.
 Identities = 35/78 (44%), Positives = 49/78 (62%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G CW F+ V A EGI K+ T  LV +S Q+LVDCD +G  + C GG ++  ++++IQN G
Sbjct: 149 GCCWAFSAVAATEGIHKLSTGKLVSLSEQELVDCDTKGVDQGCEGGLMDDAFKFIIQNHG 208

Query: 65  INTERDYPNVGVMDNCKV 82
           ++TE  YP  GV   C  
Sbjct: 209 LHTEAQYPYQGVDGTCSA 226


>gi|356515036|ref|XP_003526207.1| PREDICTED: thiol protease SEN102-like [Glycine max]
          Length = 336

 Score = 81.6 bits (200), Expect = 6e-14,   Method: Composition-based stats.
 Identities = 34/76 (44%), Positives = 50/76 (65%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ + A EGI +I T  LV +S Q+LVDCD +G  + C GG++E  ++++I+N G
Sbjct: 143 GSCWAFSTIAATEGIHQITTGKLVSLSEQELVDCDTKGVDQGCEGGYMEDGFEFIIKNGG 202

Query: 65  INTERDYPNVGVMDNC 80
           I +E +YP   V   C
Sbjct: 203 ITSETNYPYKAVDGKC 218


>gi|358345461|ref|XP_003636796.1| Cysteine proteinase [Medicago truncatula]
 gi|355502731|gb|AES83934.1| Cysteine proteinase [Medicago truncatula]
          Length = 475

 Score = 81.6 bits (200), Expect = 6e-14,   Method: Compositional matrix adjust.
 Identities = 38/78 (48%), Positives = 54/78 (69%), Gaps = 2/78 (2%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+  GAIEG++ IVT +L+ +S Q+LVDCD   +   C GG+++  +++VI N G
Sbjct: 146 GSCWSFSSTGAIEGVNAIVTGDLISLSEQELVDCDTTND--GCEGGYMDYAFEWVINNGG 203

Query: 65  INTERDYPNVGVMDNCKV 82
           I+TE DYP +GV   C V
Sbjct: 204 IDTEADYPYIGVGGTCNV 221


>gi|57282619|emb|CAE54307.1| cysteine proteinase [Gossypium hirsutum]
          Length = 372

 Score = 81.6 bits (200), Expect = 6e-14,   Method: Compositional matrix adjust.
 Identities = 36/82 (43%), Positives = 56/82 (68%), Gaps = 1/82 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ + A+EGI+KIV+  L+ +S Q+LVDCD   ++  C GG ++  +Q++I N G
Sbjct: 158 GSCWAFSAIAAVEGINKIVSGELISLSEQELVDCDRSYDA-GCNGGLMDYAFQFIIDNGG 216

Query: 65  INTERDYPNVGVMDNCKVFQFN 86
           I+TE+DYP +G  + C   + N
Sbjct: 217 IDTEKDYPYLGFNNQCDPTKKN 238


>gi|413947586|gb|AFW80235.1| hypothetical protein ZEAMMB73_542371 [Zea mays]
          Length = 264

 Score = 81.6 bits (200), Expect = 6e-14,   Method: Compositional matrix adjust.
 Identities = 35/77 (45%), Positives = 49/77 (63%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G CW F+ V A+EGI K+ T NLV +S Q+LVDCD       C GG++++ +++VI+N G
Sbjct: 149 GCCWAFSAVAAVEGIVKLSTGNLVSLSEQELVDCDTHSMDEGCEGGWMDSAFEFVIKNGG 208

Query: 65  INTERDYPNVGVMDNCK 81
           + TE  YP   V   CK
Sbjct: 209 LATESSYPYKAVDGKCK 225


>gi|297740489|emb|CBI30671.3| unnamed protein product [Vitis vinifera]
          Length = 320

 Score = 81.6 bits (200), Expect = 6e-14,   Method: Composition-based stats.
 Identities = 32/76 (42%), Positives = 51/76 (67%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G CW F+ V A+EG++++ T  L+ +S Q+LVDCD  GE + C GG +++ ++++I N G
Sbjct: 124 GCCWAFSAVAAMEGVTQLKTGELISLSEQELVDCDTSGEDQGCGGGLMDSAFEFIIGNGG 183

Query: 65  INTERDYPNVGVMDNC 80
           + TE +YP  GV   C
Sbjct: 184 LTTEANYPYKGVDATC 199


>gi|225443827|ref|XP_002274223.1| PREDICTED: vignain-like [Vitis vinifera]
          Length = 340

 Score = 81.6 bits (200), Expect = 6e-14,   Method: Composition-based stats.
 Identities = 32/76 (42%), Positives = 51/76 (67%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G CW F+ V A+EG++++ T  L+ +S Q+LVDCD  GE + C GG +++ ++++I N G
Sbjct: 144 GCCWAFSAVAAMEGVTQLKTGELISLSEQELVDCDTSGEDQGCGGGLMDSAFEFIIGNGG 203

Query: 65  INTERDYPNVGVMDNC 80
           + TE +YP  GV   C
Sbjct: 204 LTTEANYPYKGVDATC 219


>gi|18141285|gb|AAL60580.1|AF454958_1 senescence-associated cysteine protease [Brassica oleracea]
          Length = 485

 Score = 81.6 bits (200), Expect = 6e-14,   Method: Compositional matrix adjust.
 Identities = 37/76 (48%), Positives = 53/76 (69%), Gaps = 1/76 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ +GA+EGI+KIVT +L+ +S Q+LVDCD    +  C GG ++  ++++I N G
Sbjct: 154 GSCWAFSTIGAVEGINKIVTGDLITLSEQELVDCDTS-YNEGCNGGLMDYAFEFIINNGG 212

Query: 65  INTERDYPNVGVMDNC 80
           I+TE DYP  GV   C
Sbjct: 213 IDTEEDYPYKGVDGRC 228


>gi|226507844|ref|NP_001148894.1| LOC100282514 precursor [Zea mays]
 gi|194703250|gb|ACF85709.1| unknown [Zea mays]
 gi|195622994|gb|ACG33327.1| vignain precursor [Zea mays]
          Length = 356

 Score = 81.6 bits (200), Expect = 6e-14,   Method: Composition-based stats.
 Identities = 35/77 (45%), Positives = 53/77 (68%), Gaps = 2/77 (2%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V AIEGI++I    LV +S Q+LVDCD +  +  C GG++   +++V++NRG
Sbjct: 152 GSCWAFSAVAAIEGINQIKNGKLVSLSEQELVDCDTK--AIGCAGGYMSWAFEFVMKNRG 209

Query: 65  INTERDYPNVGVMDNCK 81
           + TER+YP  G+   C+
Sbjct: 210 LTTERNYPYQGLNGACQ 226


>gi|356517308|ref|XP_003527330.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Glycine
           max]
          Length = 342

 Score = 81.3 bits (199), Expect = 6e-14,   Method: Composition-based stats.
 Identities = 36/78 (46%), Positives = 51/78 (65%), Gaps = 1/78 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V A EGI +I T  LV +S Q+LVDC  +GES  C+GG+++  ++++ +  G
Sbjct: 146 GSCWAFSAVAATEGIHQITTGKLVPLSEQELVDC-VKGESEGCIGGYVDDAFEFIAKKGG 204

Query: 65  INTERDYPNVGVMDNCKV 82
           I +E  YP  GV   CKV
Sbjct: 205 IASETHYPYKGVNKTCKV 222


>gi|171702831|dbj|BAG16371.1| cysteine protease [Brassica oleracea var. italica]
          Length = 441

 Score = 81.3 bits (199), Expect = 6e-14,   Method: Compositional matrix adjust.
 Identities = 38/82 (46%), Positives = 55/82 (67%), Gaps = 1/82 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ +GA+EGI+KIVT +L+ +S Q+LVDCD    +  C GG ++  ++++I N G
Sbjct: 148 GSCWAFSTIGAVEGINKIVTGDLITLSEQELVDCDTS-YNEGCNGGLMDYAFEFIINNGG 206

Query: 65  INTERDYPNVGVMDNCKVFQFN 86
           I+TE DYP  GV   C   + N
Sbjct: 207 IDTEEDYPYKGVDGRCDQTRKN 228


>gi|317106666|dbj|BAJ53169.1| JHL18I08.3 [Jatropha curcas]
          Length = 441

 Score = 81.3 bits (199), Expect = 7e-14,   Method: Compositional matrix adjust.
 Identities = 39/76 (51%), Positives = 53/76 (69%), Gaps = 1/76 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G+CW F+  GAIEGI+KIVT +LV +S Q+LVDCD +  +  C GG ++  +Q+VI N G
Sbjct: 140 GACWSFSATGAIEGINKIVTGSLVSLSEQELVDCD-KSYNNGCEGGIMDYAFQFVIDNHG 198

Query: 65  INTERDYPNVGVMDNC 80
           I+TE DYP  G   +C
Sbjct: 199 IDTEEDYPYQGRDRSC 214


>gi|18408616|ref|NP_566901.1| putative cysteine proteinase [Arabidopsis thaliana]
 gi|75313880|sp|Q9STL5.1|CEP3_ARATH RecName: Full=KDEL-tailed cysteine endopeptidase CEP3; Flags:
           Precursor
 gi|4678353|emb|CAB41163.1| cysteine endopeptidase precursor-like protein [Arabidopsis
           thaliana]
 gi|26453052|dbj|BAC43602.1| putative cysteine endopeptidase precursor [Arabidopsis thaliana]
 gi|332644885|gb|AEE78406.1| putative cysteine proteinase [Arabidopsis thaliana]
          Length = 364

 Score = 81.3 bits (199), Expect = 7e-14,   Method: Composition-based stats.
 Identities = 35/68 (51%), Positives = 48/68 (70%), Gaps = 1/68 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V A+EGI+KI TN LV +S Q+LVDCD + E++ C GG +E  ++++  N G
Sbjct: 148 GSCWAFSTVAAVEGINKIRTNKLVSLSEQELVDCDTE-ENQGCAGGLMEPAFEFIKNNGG 206

Query: 65  INTERDYP 72
           I TE  YP
Sbjct: 207 IKTEETYP 214


>gi|386648112|gb|AFJ15103.1| mexicain-like cystein protease, partial [Jacaratia mexicana]
          Length = 348

 Score = 81.3 bits (199), Expect = 7e-14,   Method: Composition-based stats.
 Identities = 37/81 (45%), Positives = 53/81 (65%), Gaps = 3/81 (3%)

Query: 1   PHPLGSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVI 60
           P+P GSCW F+ V  +EGI+KIVT  L+ +S Q+L+DCD +  S  C GG+  T  QYV+
Sbjct: 152 PNPCGSCWAFSTVATVEGINKIVTGKLISLSEQELLDCDRR--SHGCKGGYQTTSLQYVV 209

Query: 61  QNRGINTERDYPNVGVMDNCK 81
            N G++TE++YP       C+
Sbjct: 210 DN-GVHTEKEYPYEKKQGKCR 229


>gi|413953668|gb|AFW86317.1| hypothetical protein ZEAMMB73_339067 [Zea mays]
          Length = 433

 Score = 81.3 bits (199), Expect = 7e-14,   Method: Composition-based stats.
 Identities = 34/77 (44%), Positives = 47/77 (61%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G CW F+ V A EGI KI T  LV ++ Q+LVDCD  GE + C GG ++  ++++I+N G
Sbjct: 239 GCCWAFSAVAATEGIVKISTGKLVSLAEQELVDCDVHGEDQGCEGGLMDDAFKFIIKNGG 298

Query: 65  INTERDYPNVGVMDNCK 81
           + TE  YP       CK
Sbjct: 299 LTTESSYPYTAADGKCK 315


>gi|356515056|ref|XP_003526217.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Glycine
           max]
          Length = 342

 Score = 81.3 bits (199), Expect = 7e-14,   Method: Composition-based stats.
 Identities = 36/78 (46%), Positives = 51/78 (65%), Gaps = 1/78 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V A EGI +I T  LV +S Q+LVDC  +GES  C+GG+++  ++++ +  G
Sbjct: 146 GSCWAFSAVAATEGIHQITTGKLVPLSEQELVDC-VKGESEGCIGGYVDDAFEFIAKKGG 204

Query: 65  INTERDYPNVGVMDNCKV 82
           I +E  YP  GV   CKV
Sbjct: 205 IASETHYPYKGVNKTCKV 222


>gi|255567869|ref|XP_002524912.1| cysteine protease, putative [Ricinus communis]
 gi|223535747|gb|EEF37409.1| cysteine protease, putative [Ricinus communis]
          Length = 366

 Score = 81.3 bits (199), Expect = 7e-14,   Method: Composition-based stats.
 Identities = 34/76 (44%), Positives = 52/76 (68%), Gaps = 1/76 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ + A+EG++KIVT  L+ +S Q+LVDCD +  +  C GG ++  +Q++I N G
Sbjct: 160 GSCWAFSTIAAVEGVNKIVTGELISLSEQELVDCD-RSYNAGCNGGLMDNAFQFIINNGG 218

Query: 65  INTERDYPNVGVMDNC 80
           I+T++DYP   V   C
Sbjct: 219 IDTDKDYPYQAVDGKC 234


>gi|297816030|ref|XP_002875898.1| hypothetical protein ARALYDRAFT_485194 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297321736|gb|EFH52157.1| hypothetical protein ARALYDRAFT_485194 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 363

 Score = 81.3 bits (199), Expect = 7e-14,   Method: Composition-based stats.
 Identities = 35/68 (51%), Positives = 48/68 (70%), Gaps = 1/68 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V A+EGI+KI TN LV +S Q+LVDCD + E++ C GG +E  ++++  N G
Sbjct: 147 GSCWAFSTVAAVEGINKIRTNKLVSLSEQELVDCDTE-ENQGCAGGLMEPAFEFIKNNGG 205

Query: 65  INTERDYP 72
           I TE  YP
Sbjct: 206 IKTEETYP 213


>gi|224136808|ref|XP_002326950.1| predicted protein [Populus trichocarpa]
 gi|222835265|gb|EEE73700.1| predicted protein [Populus trichocarpa]
          Length = 456

 Score = 81.3 bits (199), Expect = 7e-14,   Method: Composition-based stats.
 Identities = 36/82 (43%), Positives = 55/82 (67%), Gaps = 1/82 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ + A+EGI+KIVT +L+ +S Q+LVDCD    +  C GG ++  ++++I N G
Sbjct: 150 GSCWAFSTIAAVEGINKIVTGDLIALSEQELVDCDTS-YNEGCNGGLMDYAFEFIINNGG 208

Query: 65  INTERDYPNVGVMDNCKVFQFN 86
           I+TE DYP +G    C  ++ N
Sbjct: 209 IDTEDDYPYLGRDGRCDTYRKN 230


>gi|357518983|ref|XP_003629780.1| Cysteine proteinase [Medicago truncatula]
 gi|355523802|gb|AET04256.1| Cysteine proteinase [Medicago truncatula]
          Length = 364

 Score = 81.3 bits (199), Expect = 8e-14,   Method: Composition-based stats.
 Identities = 39/76 (51%), Positives = 52/76 (68%), Gaps = 2/76 (2%)

Query: 6   SCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRGI 65
           S W F+V GAIEG++KIVT NL+++S Q+LVDCD    S+ C GGF    + YVI+N GI
Sbjct: 164 SHWAFSVTGAIEGLNKIVTGNLINLSAQELVDCDPA--SKGCAGGFYFNAFGYVIENGGI 221

Query: 66  NTERDYPNVGVMDNCK 81
           +TE +YP +     CK
Sbjct: 222 DTEANYPYLAKNGTCK 237


>gi|37780047|gb|AAP32196.1| cysteine protease 8 [Trifolium repens]
          Length = 343

 Score = 81.3 bits (199), Expect = 8e-14,   Method: Compositional matrix adjust.
 Identities = 34/76 (44%), Positives = 49/76 (64%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G CW F+ V A EGI K+ T  L+ +S Q+LVDCD +G  + C GG ++  ++++IQN G
Sbjct: 147 GCCWAFSAVAATEGIHKLSTGKLISLSEQELVDCDTKGVDQGCEGGLMDDAFKFIIQNHG 206

Query: 65  INTERDYPNVGVMDNC 80
           ++TE  YP  GV   C
Sbjct: 207 LSTEAQYPYEGVDGTC 222


>gi|297852302|ref|XP_002894032.1| F2G19.31/F2G19.31 [Arabidopsis lyrata subsp. lyrata]
 gi|297339874|gb|EFH70291.1| F2G19.31/F2G19.31 [Arabidopsis lyrata subsp. lyrata]
          Length = 455

 Score = 81.3 bits (199), Expect = 8e-14,   Method: Compositional matrix adjust.
 Identities = 36/82 (43%), Positives = 57/82 (69%), Gaps = 1/82 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ +GA+EGI++IVT +L+ +S Q+LVDCD    +  C GG ++  ++++I+N G
Sbjct: 152 GSCWAFSTIGAVEGINQIVTGDLITLSEQELVDCDTS-YNEGCNGGLMDYAFEFIIKNGG 210

Query: 65  INTERDYPNVGVMDNCKVFQFN 86
           I+T++DYP  GV   C   + N
Sbjct: 211 IDTDKDYPYKGVDGTCDQIRKN 232


>gi|118627554|emb|CAL64936.1| putative cysteine protease 8 [Trifolium pratense]
          Length = 344

 Score = 80.9 bits (198), Expect = 8e-14,   Method: Compositional matrix adjust.
 Identities = 34/76 (44%), Positives = 49/76 (64%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G CW F+ V A EGI K+ T  L+ +S Q+LVDCD +G  + C GG ++  ++++IQN G
Sbjct: 148 GCCWAFSAVAATEGIHKLSTGKLISLSEQELVDCDTKGVDQGCEGGLMDDAFKFIIQNHG 207

Query: 65  INTERDYPNVGVMDNC 80
           ++TE  YP  GV   C
Sbjct: 208 LSTEAQYPYEGVDGTC 223


>gi|162459488|ref|NP_001105571.1| maize insect resistance1 precursor [Zea mays]
 gi|5731354|gb|AAB70820.2| cysteine protease Mir1 [Zea mays]
          Length = 398

 Score = 80.9 bits (198), Expect = 8e-14,   Method: Composition-based stats.
 Identities = 36/76 (47%), Positives = 49/76 (64%), Gaps = 2/76 (2%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G CW F+ V AIEG++ I T NLV +S Q+++DCD Q     C GG +E  +++VI N G
Sbjct: 179 GGCWAFSAVAAIEGVNAIATGNLVSLSEQEIIDCDAQ--DSGCDGGQMENAFRFVIGNGG 236

Query: 65  INTERDYPNVGVMDNC 80
           I+TE DYP +G    C
Sbjct: 237 IDTEADYPFIGTDGTC 252


>gi|37780043|gb|AAP32194.1| cysteine protease 1 [Trifolium repens]
          Length = 292

 Score = 80.9 bits (198), Expect = 9e-14,   Method: Compositional matrix adjust.
 Identities = 34/76 (44%), Positives = 50/76 (65%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V A EGI ++ T  LV +S Q+L+DCD +G  + C GG ++  ++++IQN G
Sbjct: 96  GSCWAFSAVAATEGIHQLSTGKLVSLSEQELIDCDTKGVDQGCEGGLMDDAFKFIIQNHG 155

Query: 65  INTERDYPNVGVMDNC 80
           ++TE  YP  GV   C
Sbjct: 156 LSTEVQYPYEGVDGTC 171


>gi|37780049|gb|AAP32197.1| cysteine protease 10 [Trifolium repens]
          Length = 272

 Score = 80.9 bits (198), Expect = 9e-14,   Method: Compositional matrix adjust.
 Identities = 34/76 (44%), Positives = 50/76 (65%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V A EGI ++ T  LV +S Q+L+DCD +G  + C GG ++  ++++IQN G
Sbjct: 76  GSCWAFSAVAATEGIHQLSTGKLVSLSEQELIDCDTKGVDQGCEGGLMDDAFKFIIQNHG 135

Query: 65  INTERDYPNVGVMDNC 80
           ++TE  YP  GV   C
Sbjct: 136 LSTEVQYPYEGVDGTC 151


>gi|212275830|ref|NP_001130503.1| cysteine protease 1 [Zea mays]
 gi|194689328|gb|ACF78748.1| unknown [Zea mays]
 gi|219886279|gb|ACL53514.1| unknown [Zea mays]
 gi|238010470|gb|ACR36270.1| unknown [Zea mays]
 gi|413920875|gb|AFW60807.1| cysteine protease 1 [Zea mays]
          Length = 354

 Score = 80.9 bits (198), Expect = 9e-14,   Method: Composition-based stats.
 Identities = 36/80 (45%), Positives = 49/80 (61%), Gaps = 1/80 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G CW FA V A+EGI +I T NLV +S QQ++DCD  G +  C GG+I+  +QY++ N G
Sbjct: 165 GCCWAFAAVAAVEGIHQITTGNLVSLSEQQVLDCDTDGNN-GCNGGYIDNAFQYIVGNGG 223

Query: 65  INTERDYPNVGVMDNCKVFQ 84
           + TE  YP       C+  Q
Sbjct: 224 LGTEDAYPYTAAQAMCQSVQ 243


>gi|356517358|ref|XP_003527354.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
 gi|356577767|ref|XP_003556994.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
          Length = 343

 Score = 80.9 bits (198), Expect = 9e-14,   Method: Composition-based stats.
 Identities = 32/76 (42%), Positives = 48/76 (63%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G CW F+ V A EGI  +    L+ +S Q++VDCD +GE + C GGF++  ++++IQN G
Sbjct: 147 GCCWAFSAVAATEGIHALSAGKLISLSEQEVVDCDTKGEDQGCAGGFMDGAFKFIIQNHG 206

Query: 65  INTERDYPNVGVMDNC 80
           +N E +YP   V   C
Sbjct: 207 LNNEPNYPYKAVDGKC 222


>gi|297819566|ref|XP_002877666.1| hypothetical protein ARALYDRAFT_906213 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297323504|gb|EFH53925.1| hypothetical protein ARALYDRAFT_906213 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 304

 Score = 80.9 bits (198), Expect = 9e-14,   Method: Composition-based stats.
 Identities = 35/77 (45%), Positives = 45/77 (58%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G CW FA V A+EG++KI    LV +S QQLVDC     +  C GG   T Y Y+ +N+G
Sbjct: 127 GCCWAFAAVAAVEGVTKIANGELVSLSEQQLVDCSTANNNMGCDGGLALTAYDYIKENQG 186

Query: 65  INTERDYPNVGVMDNCK 81
           I +E +YP   V   CK
Sbjct: 187 ITSEENYPYQAVQQTCK 203


>gi|118486542|gb|ABK95110.1| unknown [Populus trichocarpa]
          Length = 465

 Score = 80.9 bits (198), Expect = 9e-14,   Method: Compositional matrix adjust.
 Identities = 36/82 (43%), Positives = 55/82 (67%), Gaps = 1/82 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ + A+EGI+KIVT +L+ +S Q+LVDCD    +  C GG ++  ++++I N G
Sbjct: 159 GSCWAFSTIAAVEGINKIVTGDLIALSEQELVDCDTS-YNEGCNGGLMDYAFEFIINNGG 217

Query: 65  INTERDYPNVGVMDNCKVFQFN 86
           I+TE DYP +G    C  ++ N
Sbjct: 218 IDTEDDYPYLGRDGRCDTYRKN 239


>gi|168058022|ref|XP_001781010.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162667567|gb|EDQ54194.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 457

 Score = 80.9 bits (198), Expect = 9e-14,   Method: Composition-based stats.
 Identities = 37/82 (45%), Positives = 55/82 (67%), Gaps = 1/82 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ VG++EGI+ I T + + +S Q+LVDCD +  ++ C GG ++  + +VIQN G
Sbjct: 155 GSCWAFSAVGSVEGINAIRTGDAISLSVQELVDCDKK-YNQGCNGGLMDYAFDFVIQNGG 213

Query: 65  INTERDYPNVGVMDNCKVFQFN 86
           I+TE+DYP  G    C V + N
Sbjct: 214 IDTEKDYPYQGYDGRCDVNKMN 235


>gi|146215990|gb|ABQ10197.1| actinidin Act4a [Actinidia eriantha]
          Length = 385

 Score = 80.9 bits (198), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 36/71 (50%), Positives = 49/71 (69%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW FA + A+EGI+KIVT NL+ +S Q++VDC  +  +  C GG +   YQ++I N G
Sbjct: 155 GSCWTFASIAAVEGINKIVTGNLISLSEQEIVDCQRKYPNNGCNGGTLSGAYQFIINNGG 214

Query: 65  INTERDYPNVG 75
           INTE +YP  G
Sbjct: 215 INTEANYPYTG 225


>gi|356517348|ref|XP_003527349.1| PREDICTED: vignain-like [Glycine max]
          Length = 343

 Score = 80.9 bits (198), Expect = 1e-13,   Method: Composition-based stats.
 Identities = 32/76 (42%), Positives = 48/76 (63%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G CW F+ V A EGI  +    L+ +S Q++VDCD +GE + C GGF++  ++++IQN G
Sbjct: 147 GCCWAFSAVAATEGIHALSAGKLISLSEQEVVDCDTKGEDQGCAGGFMDGAFKFIIQNHG 206

Query: 65  INTERDYPNVGVMDNC 80
           +N E +YP   V   C
Sbjct: 207 LNNEPNYPYKAVDGKC 222


>gi|413956349|gb|AFW88998.1| hypothetical protein ZEAMMB73_678859 [Zea mays]
          Length = 1140

 Score = 80.9 bits (198), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 39/85 (45%), Positives = 57/85 (67%), Gaps = 7/85 (8%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCD---NQGESRSCVGGFIETIYQYVIQ 61
           GSCW F+ + A+EGI++IVT +L+ +S Q+LVDCD   NQG    C GG ++  ++++I 
Sbjct: 780 GSCWAFSTIAAVEGINQIVTGDLISLSEQELVDCDTSYNQG----CNGGLMDYAFEFIIN 835

Query: 62  NRGINTERDYPNVGVMDNCKVFQFN 86
           N GI+TE+DYP  G    C V + N
Sbjct: 836 NGGIDTEKDYPYKGTDGRCDVNRKN 860


>gi|146215978|gb|ABQ10191.1| actinidin Act1c [Actinidia eriantha]
          Length = 368

 Score = 80.5 bits (197), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 34/78 (43%), Positives = 51/78 (65%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ + A+EGI+KIVT NL+ +S Q+LVDC     ++ C GG++   ++++I N G
Sbjct: 145 GSCWAFSAIAAVEGINKIVTGNLISLSEQELVDCGRTQSTKGCDGGYMTDGFEFIINNGG 204

Query: 65  INTERDYPNVGVMDNCKV 82
           INTE +YP       C +
Sbjct: 205 INTEENYPYTAQEGQCDL 222


>gi|424513619|emb|CCO66241.1| predicted protein [Bathycoccus prasinos]
          Length = 396

 Score = 80.5 bits (197), Expect = 1e-13,   Method: Composition-based stats.
 Identities = 30/77 (38%), Positives = 52/77 (67%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ +GA+EGI+ I T  LV +S Q+LV C  +G ++ C GG ++  ++++++N G
Sbjct: 187 GSCWAFSAIGAVEGINAIRTGKLVSLSEQELVSCAREGGNQGCNGGLMDNAFEWIVENGG 246

Query: 65  INTERDYPNVGVMDNCK 81
           +++E+ Y      D+CK
Sbjct: 247 VDSEKQYQYKASFDDCK 263


>gi|1256830|gb|AAB68374.1| cysteine endopeptidase 1 [Phaseolus vulgaris]
 gi|2959418|emb|CAA12118.1| cysteine protease [Phaseolus vulgaris]
          Length = 364

 Score = 80.5 bits (197), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 37/76 (48%), Positives = 51/76 (67%), Gaps = 1/76 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V A+EGI+ IVT   V +S Q+LVDCD + +   C GG ++  +Q++IQN G
Sbjct: 147 GSCWAFSTVAAVEGINNIVTGEFVSLSEQELVDCDREYD-EGCNGGLMDYAFQFIIQNGG 205

Query: 65  INTERDYPNVGVMDNC 80
           I+TE DYP  G+   C
Sbjct: 206 IDTEEDYPYQGIDGTC 221


>gi|357437717|ref|XP_003589134.1| Cysteine proteinase [Medicago truncatula]
 gi|355478182|gb|AES59385.1| Cysteine proteinase [Medicago truncatula]
          Length = 299

 Score = 80.5 bits (197), Expect = 1e-13,   Method: Composition-based stats.
 Identities = 34/76 (44%), Positives = 51/76 (67%), Gaps = 1/76 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ + A+EGI+KIVT +L+ +S Q+LVDCD    +  C GG ++  ++++I N G
Sbjct: 167 GSCWAFSAIAAVEGINKIVTGDLISLSEQELVDCDTS-YNEGCNGGLMDYAFEFIISNGG 225

Query: 65  INTERDYPNVGVMDNC 80
           I++E DYP   V   C
Sbjct: 226 IDSEDDYPYKAVDGRC 241


>gi|60100207|gb|AAX13273.1| putative cysteine protease [Oryza sativa Japonica Group]
          Length = 349

 Score = 80.5 bits (197), Expect = 1e-13,   Method: Composition-based stats.
 Identities = 33/76 (43%), Positives = 46/76 (60%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G CW F+ V A+EG  K+ T  LV +S QQLV CD +GE + C GG ++  + ++I+N G
Sbjct: 151 GCCWAFSAVAAMEGAVKLATGKLVSLSEQQLVSCDVKGEDQGCEGGLMDDAFDFIIKNGG 210

Query: 65  INTERDYPNVGVMDNC 80
           +  E DYP     D C
Sbjct: 211 LAAESDYPYTASDDKC 226


>gi|413945959|gb|AFW78608.1| hypothetical protein ZEAMMB73_489507 [Zea mays]
          Length = 289

 Score = 80.5 bits (197), Expect = 1e-13,   Method: Composition-based stats.
 Identities = 35/68 (51%), Positives = 49/68 (72%), Gaps = 1/68 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G+CW F+  GA+EGI+KI T +LV +S Q+L+DCD    S  C GG ++  Y++VI+N G
Sbjct: 159 GACWSFSATGAMEGINKIKTGSLVSLSEQELIDCDRSYNS-GCGGGLMDYAYKFVIKNGG 217

Query: 65  INTERDYP 72
           I+TE DYP
Sbjct: 218 IDTEEDYP 225


>gi|157278117|ref|NP_001098157.1| cathepsin S precursor [Oryzias latipes]
 gi|50251130|dbj|BAD27582.1| cathepsin S [Oryzias latipes]
          Length = 327

 Score = 80.5 bits (197), Expect = 1e-13,   Method: Composition-based stats.
 Identities = 36/77 (46%), Positives = 49/77 (63%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ VGA+EG  K  T  L  +S Q LVDC  +  +  C GGF+   +QYVI+N+G
Sbjct: 134 GSCWAFSAVGALEGQLKKTTGILTSLSPQNLVDCSTKYGNYGCKGGFMSNAFQYVIKNQG 193

Query: 65  INTERDYPNVGVMDNCK 81
           I+++  YP +G  D CK
Sbjct: 194 ISSDAAYPYIGKRDKCK 210


>gi|2511689|emb|CAB17074.1| cysteine proteinase precursor [Phaseolus vulgaris]
          Length = 364

 Score = 80.5 bits (197), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 37/76 (48%), Positives = 51/76 (67%), Gaps = 1/76 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V A+EGI+ IVT   V +S Q+LVDCD + +   C GG ++  +Q++IQN G
Sbjct: 147 GSCWAFSTVAAVEGINNIVTGEFVSLSEQELVDCDREYD-EGCNGGLMDYAFQFIIQNGG 205

Query: 65  INTERDYPNVGVMDNC 80
           I+TE DYP  G+   C
Sbjct: 206 IDTEEDYPYQGIDGTC 221


>gi|356554921|ref|XP_003545789.1| PREDICTED: LOW QUALITY PROTEIN: thiol protease SEN102-like [Glycine
           max]
          Length = 439

 Score = 80.5 bits (197), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 34/76 (44%), Positives = 48/76 (63%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G CW F+ V A EGI  +    L+ +S Q+LVDCD +G  + C GG ++  Y+++IQN G
Sbjct: 243 GCCWAFSAVAATEGIHALSGGKLISLSEQELVDCDTKGVDQGCEGGLMDDAYKFIIQNHG 302

Query: 65  INTERDYPNVGVMDNC 80
           +NTE +YP  GV   C
Sbjct: 303 LNTEANYPYKGVDGKC 318


>gi|595986|gb|AAA79915.1| cysteine proteinase, partial [Dianthus caryophyllus]
          Length = 427

 Score = 80.5 bits (197), Expect = 1e-13,   Method: Composition-based stats.
 Identities = 35/76 (46%), Positives = 53/76 (69%), Gaps = 1/76 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ +GA+EGI+KIVT +L+ +S Q+LVDCD    S  C GG ++  ++++I N G
Sbjct: 117 GSCWAFSAIGAVEGINKIVTGDLITLSEQELVDCDTSYNS-GCDGGLMDYAFRFIINNGG 175

Query: 65  INTERDYPNVGVMDNC 80
           I+T++DYP      +C
Sbjct: 176 IDTDKDYPYKATDGSC 191


>gi|50355613|dbj|BAD29955.1| cysteine protease [Daucus carota]
          Length = 365

 Score = 80.5 bits (197), Expect = 1e-13,   Method: Composition-based stats.
 Identities = 32/76 (42%), Positives = 49/76 (64%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G CW F+ V A+EGI+++ T  L+ +S Q+LVDCD  GE + C GG ++  + ++ QN G
Sbjct: 145 GCCWAFSAVAAMEGITQLKTGKLISLSEQELVDCDTNGEDQGCEGGLMDYAFDFIQQNHG 204

Query: 65  INTERDYPNVGVMDNC 80
           ++TE +YP  G    C
Sbjct: 205 LSTETNYPYSGTDGTC 220


>gi|38346007|emb|CAD40110.2| OSJNBa0035O13.9 [Oryza sativa Japonica Group]
 gi|125589429|gb|EAZ29779.1| hypothetical protein OsJ_13837 [Oryza sativa Japonica Group]
          Length = 314

 Score = 80.5 bits (197), Expect = 1e-13,   Method: Composition-based stats.
 Identities = 33/76 (43%), Positives = 46/76 (60%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G CW F+ V A+EG  K+ T  LV +S QQLV CD +GE + C GG ++  + ++I+N G
Sbjct: 116 GCCWAFSAVAAMEGAVKLATGKLVSLSEQQLVSCDVKGEDQGCEGGLMDDAFDFIIKNGG 175

Query: 65  INTERDYPNVGVMDNC 80
           +  E DYP     D C
Sbjct: 176 LAAESDYPYTASDDKC 191


>gi|356515086|ref|XP_003526232.1| PREDICTED: vignain-like [Glycine max]
          Length = 343

 Score = 80.5 bits (197), Expect = 1e-13,   Method: Composition-based stats.
 Identities = 33/76 (43%), Positives = 48/76 (63%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G CW F+ V A EGI  +    L+ +S Q+LVDCD +G  + C GG ++  ++++IQN G
Sbjct: 147 GCCWAFSAVAATEGIHALSAGKLISLSEQELVDCDTKGVDQGCEGGLMDDAFKFIIQNHG 206

Query: 65  INTERDYPNVGVMDNC 80
           +NTE +YP  GV   C
Sbjct: 207 LNTEANYPYKGVDGKC 222


>gi|356543112|ref|XP_003540007.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Glycine
           max]
          Length = 345

 Score = 80.5 bits (197), Expect = 1e-13,   Method: Composition-based stats.
 Identities = 38/78 (48%), Positives = 52/78 (66%), Gaps = 1/78 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V AIEGI +I T+ LV +S Q+LVDC  +GES  C GG++E  +++V +  G
Sbjct: 150 GSCWAFSAVAAIEGIHQITTSKLVSLSEQELVDC-VKGESEGCNGGYMEDAFEFVAKKGG 208

Query: 65  INTERDYPNVGVMDNCKV 82
           I +E  YP  G   +CKV
Sbjct: 209 IASESYYPYKGKDKSCKV 226


>gi|318136892|gb|ADV41672.1| cysteine protease [Nicotiana tabacum]
          Length = 349

 Score = 80.5 bits (197), Expect = 1e-13,   Method: Composition-based stats.
 Identities = 30/71 (42%), Positives = 48/71 (67%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G CW F+ V A EG+ ++ T  L+ +S Q+LVDCD +GE   C GG ++T + ++++N+G
Sbjct: 153 GCCWAFSAVAATEGLHQLKTGKLIPLSEQELVDCDVEGEDEGCSGGLLDTAFDFILKNKG 212

Query: 65  INTERDYPNVG 75
           + TE +YP  G
Sbjct: 213 LTTEANYPYKG 223


>gi|413953667|gb|AFW86316.1| hypothetical protein ZEAMMB73_635707 [Zea mays]
          Length = 340

 Score = 80.5 bits (197), Expect = 1e-13,   Method: Composition-based stats.
 Identities = 34/77 (44%), Positives = 46/77 (59%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G CW F+ V A EGI KI T  LV ++ Q+LVDCD  GE + C GG ++  ++++I N G
Sbjct: 146 GCCWAFSAVAATEGIVKISTGKLVSLAEQELVDCDVHGEDQGCEGGLMDDAFKFIINNGG 205

Query: 65  INTERDYPNVGVMDNCK 81
           + TE  YP       CK
Sbjct: 206 LTTESSYPYTAADGKCK 222


>gi|413919736|gb|AFW59668.1| cysteine protease 1 [Zea mays]
          Length = 469

 Score = 80.5 bits (197), Expect = 1e-13,   Method: Composition-based stats.
 Identities = 38/85 (44%), Positives = 56/85 (65%), Gaps = 7/85 (8%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCD---NQGESRSCVGGFIETIYQYVIQ 61
           GSCW F+ + A+EGI++IVT +++ +S Q+LVDCD   NQG    C GG ++  ++++I 
Sbjct: 157 GSCWAFSTIAAVEGINQIVTGDMISLSEQELVDCDTSYNQG----CNGGLMDYAFEFIIN 212

Query: 62  NRGINTERDYPNVGVMDNCKVFQFN 86
           N GI+TE DYP  G    C V + N
Sbjct: 213 NGGIDTEEDYPYKGTDGRCDVNRKN 237


>gi|226496089|ref|NP_001149658.1| cysteine protease 1 precursor [Zea mays]
 gi|195629242|gb|ACG36262.1| cysteine protease 1 precursor [Zea mays]
          Length = 469

 Score = 80.5 bits (197), Expect = 1e-13,   Method: Composition-based stats.
 Identities = 38/85 (44%), Positives = 56/85 (65%), Gaps = 7/85 (8%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCD---NQGESRSCVGGFIETIYQYVIQ 61
           GSCW F+ + A+EGI++IVT +++ +S Q+LVDCD   NQG    C GG ++  ++++I 
Sbjct: 157 GSCWAFSTIAAVEGINQIVTGDMISLSEQELVDCDTSYNQG----CNGGLMDYAFEFIIN 212

Query: 62  NRGINTERDYPNVGVMDNCKVFQFN 86
           N GI+TE DYP  G    C V + N
Sbjct: 213 NGGIDTEEDYPYKGTDGRCDVNRKN 237


>gi|255538210|ref|XP_002510170.1| cysteine protease, putative [Ricinus communis]
 gi|223550871|gb|EEF52357.1| cysteine protease, putative [Ricinus communis]
          Length = 469

 Score = 80.5 bits (197), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 35/82 (42%), Positives = 55/82 (67%), Gaps = 1/82 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ + A+EGI+KIVT +L+ +S Q+LVDCD +  +  C GG ++  +Q++I N G
Sbjct: 163 GSCWAFSTIAAVEGINKIVTGDLISLSEQELVDCD-RSYNEGCNGGLMDYAFQFIINNGG 221

Query: 65  INTERDYPNVGVMDNCKVFQFN 86
           I++E DYP +     C  ++ N
Sbjct: 222 IDSEEDYPYLARDGTCDTYRKN 243


>gi|24285904|gb|AAL14199.1| cysteine proteinase precursor [Ipomoea batatas]
 gi|56961686|gb|AAK15148.2| cysteine proteinase-like protein [Ipomoea batatas]
          Length = 341

 Score = 80.1 bits (196), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 33/76 (43%), Positives = 51/76 (67%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G CW F+ V A+EGI+K+ T NL+ +S Q+LVDCD +G  + C GG ++  + ++I N+G
Sbjct: 145 GCCWAFSAVAAMEGITKLSTGNLISLSEQELVDCDVKGIDQGCEGGLMDDAFSFIINNKG 204

Query: 65  INTERDYPNVGVMDNC 80
           + TE +YP  G   +C
Sbjct: 205 LTTESNYPYQGTDGSC 220


>gi|356577763|ref|XP_003556992.1| PREDICTED: vignain-like [Glycine max]
          Length = 343

 Score = 80.1 bits (196), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 32/76 (42%), Positives = 48/76 (63%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G CW F+ V A EGI  +    L+ +S Q++VDCD +GE + C GGF++  ++++IQN G
Sbjct: 147 GCCWAFSAVAATEGIHALSAGKLISLSEQEVVDCDTKGEDQGCAGGFMDGAFKFIIQNHG 206

Query: 65  INTERDYPNVGVMDNC 80
           +N E +YP   V   C
Sbjct: 207 LNNEPNYPYKAVDGKC 222


>gi|40806498|gb|AAR92154.1| putative cysteine protease 1 [Iris x hollandica]
          Length = 340

 Score = 80.1 bits (196), Expect = 1e-13,   Method: Composition-based stats.
 Identities = 31/68 (45%), Positives = 48/68 (70%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V A EGI+K+ T  L+ +S Q++VDCD   + + C GG ++  ++Y+I+N+G
Sbjct: 144 GSCWAFSAVAATEGITKLSTGKLISLSEQEVVDCDVTSDDQGCNGGEMDDAFEYIIKNKG 203

Query: 65  INTERDYP 72
           I TE +YP
Sbjct: 204 ITTEANYP 211


>gi|62526575|gb|AAX84673.1| cysteine protease CP1 [Manihot esculenta]
          Length = 467

 Score = 80.1 bits (196), Expect = 1e-13,   Method: Composition-based stats.
 Identities = 35/82 (42%), Positives = 54/82 (65%), Gaps = 1/82 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ + A+EGI+KIVT +L+ +S Q+LVDCD    +  C GG ++  ++++I N G
Sbjct: 161 GSCWAFSTIAAVEGINKIVTGDLISLSEQELVDCDTS-YNEGCNGGLMDYAFEFIINNGG 219

Query: 65  INTERDYPNVGVMDNCKVFQFN 86
           I+TE DYP +     C  ++ N
Sbjct: 220 IDTEEDYPYLARDGRCDTYRKN 241


>gi|262360187|gb|ACY38051.2| cysteine proteinase C1A [Dactylis glomerata]
          Length = 365

 Score = 80.1 bits (196), Expect = 1e-13,   Method: Composition-based stats.
 Identities = 36/76 (47%), Positives = 51/76 (67%), Gaps = 1/76 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ + A+EGI+KI T  LV +S Q+L+DC N GE+  C GG ++  +Q++ QN G
Sbjct: 154 GSCWAFSTIVAVEGINKIRTGRLVSLSEQELMDC-NIGENDGCNGGLMDVAFQFIQQNGG 212

Query: 65  INTERDYPNVGVMDNC 80
           I TE  YP  G  ++C
Sbjct: 213 ITTEASYPYQGEQNSC 228


>gi|226821425|gb|ACO82388.1| cathepsin S [Lutjanus argentimaculatus]
          Length = 337

 Score = 80.1 bits (196), Expect = 1e-13,   Method: Composition-based stats.
 Identities = 36/76 (47%), Positives = 48/76 (63%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ VGA+EG     T  LVD+S Q LVDC  +  +  C GGF++  +QYVI N+G
Sbjct: 144 GSCWAFSAVGALEGQLAKKTGKLVDLSPQNLVDCSTKYGNHGCNGGFMDHAFQYVIDNQG 203

Query: 65  INTERDYPNVGVMDNC 80
           I+++  YP  G  D C
Sbjct: 204 IDSDASYPYTGRSDQC 219


>gi|146215988|gb|ABQ10196.1| actinidin Act3a [Actinidia eriantha]
          Length = 380

 Score = 80.1 bits (196), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 33/75 (44%), Positives = 51/75 (68%)

Query: 6   SCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRGI 65
           SCW FA +  +E I++I+T +L+ +S Q+LVDC+    +  C GGF++  Y+++I N GI
Sbjct: 149 SCWAFATIATVESINQIITGDLISLSEQELVDCNRTPINEGCKGGFMDDAYEFIINNGGI 208

Query: 66  NTERDYPNVGVMDNC 80
           NTE +YP +G  D C
Sbjct: 209 NTEENYPYIGQDDQC 223


>gi|125547258|gb|EAY93080.1| hypothetical protein OsI_14881 [Oryza sativa Indica Group]
          Length = 314

 Score = 80.1 bits (196), Expect = 1e-13,   Method: Composition-based stats.
 Identities = 33/76 (43%), Positives = 46/76 (60%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G CW F+ V A+EG  K+ T  LV +S QQLV CD +GE + C GG ++  + ++I+N G
Sbjct: 116 GCCWAFSAVAAMEGAVKLATGKLVSLSEQQLVSCDVKGEDQGCEGGLMDDAFDFIIKNGG 175

Query: 65  INTERDYPNVGVMDNC 80
           +  E DYP     D C
Sbjct: 176 LAAESDYPYTASDDKC 191


>gi|109390302|gb|ABG33750.1| cysteine protease [Hevea brasiliensis]
          Length = 457

 Score = 80.1 bits (196), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 36/82 (43%), Positives = 55/82 (67%), Gaps = 1/82 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V A+EGI+KIVT +L+ +S Q+LVDCDN   +  C GG ++  ++++I N G
Sbjct: 151 GSCWAFSAVAAVEGINKIVTGDLISLSEQELVDCDNS-YNEGCNGGLMDYGFEFIINNGG 209

Query: 65  INTERDYPNVGVMDNCKVFQFN 86
           I++E DYP +     C  ++ N
Sbjct: 210 IDSEEDYPYLARDGRCDTYRKN 231


>gi|310942960|pdb|3P5W|A Chain A, Actinidin From Actinidia Arguta Planch (Sarusashi)
          Length = 220

 Score = 80.1 bits (196), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 35/78 (44%), Positives = 50/78 (64%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ + A+EGI+KI T +L+ +S Q+LVDC     +R C GGF+   +Q++I N G
Sbjct: 23  GSCWAFSTIAAVEGINKIATGDLISLSEQELVDCGRTQNTRGCDGGFMTDGFQFIINNGG 82

Query: 65  INTERDYPNVGVMDNCKV 82
           INTE +YP       C +
Sbjct: 83  INTEANYPYTAEEGQCNL 100


>gi|226502454|ref|NP_001140922.1| hypothetical protein [Zea mays]
 gi|223948637|gb|ACN28402.1| unknown [Zea mays]
 gi|413920877|gb|AFW60809.1| hypothetical protein ZEAMMB73_830238 [Zea mays]
          Length = 354

 Score = 80.1 bits (196), Expect = 2e-13,   Method: Composition-based stats.
 Identities = 36/80 (45%), Positives = 49/80 (61%), Gaps = 1/80 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G CW FA V A+EGI +I T NLV +S QQ++DCD +G +  C GG+I+  +QY+  N G
Sbjct: 165 GCCWAFAAVAAVEGIHQITTGNLVSLSEQQVLDCDTEGNN-GCNGGYIDNAFQYIAGNGG 223

Query: 65  INTERDYPNVGVMDNCKVFQ 84
           + TE  YP       C+  Q
Sbjct: 224 LATEDAYPYTAAQAMCQSVQ 243


>gi|357162587|ref|XP_003579458.1| PREDICTED: oryzain beta chain-like [Brachypodium distachyon]
          Length = 470

 Score = 80.1 bits (196), Expect = 2e-13,   Method: Composition-based stats.
 Identities = 35/82 (42%), Positives = 53/82 (64%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V A+E I+++VT  LV +S Q+LV+CD  G+S  C GG ++  + ++I N G
Sbjct: 167 GSCWAFSAVSAVESINQLVTGELVTLSEQELVECDINGQSNGCNGGLMDDAFDFIINNGG 226

Query: 65  INTERDYPNVGVMDNCKVFQFN 86
           I+TE DYP   +   C + + N
Sbjct: 227 IDTEDDYPYKALDGKCDINRRN 248


>gi|357465603|ref|XP_003603086.1| Cysteine proteinase [Medicago truncatula]
 gi|355492134|gb|AES73337.1| Cysteine proteinase [Medicago truncatula]
          Length = 474

 Score = 80.1 bits (196), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 37/82 (45%), Positives = 54/82 (65%), Gaps = 1/82 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ + A+EGI+KIVT  LV +S Q+LVDCD    +  C GG +E  ++++I N G
Sbjct: 167 GSCWAFSTIAAVEGINKIVTGELVSLSEQELVDCDRTVNA-GCDGGLMEYAFEFIINNGG 225

Query: 65  INTERDYPNVGVMDNCKVFQFN 86
           I+++ DYP  GV   C  ++ N
Sbjct: 226 IDSDEDYPYRGVDGKCDQYKKN 247


>gi|223673161|gb|ACN12762.1| Cathepsin S precursor [Salmo salar]
          Length = 330

 Score = 80.1 bits (196), Expect = 2e-13,   Method: Composition-based stats.
 Identities = 35/76 (46%), Positives = 49/76 (64%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ VGA+EG     T  L+D+S Q LVDC ++  ++ C GGF+   +QYVI N+G
Sbjct: 137 GSCWAFSSVGALEGQLMKTTGKLIDLSPQNLVDCSSKYGNKGCHGGFMTKAFQYVIDNQG 196

Query: 65  INTERDYPNVGVMDNC 80
           I +++ YP  GV   C
Sbjct: 197 IASDQSYPYKGVQQQC 212


>gi|84181681|gb|AAW78661.2| senescence-specific cysteine protease [Nicotiana tabacum]
          Length = 349

 Score = 80.1 bits (196), Expect = 2e-13,   Method: Composition-based stats.
 Identities = 30/71 (42%), Positives = 49/71 (69%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G CW F+ V A+EG+ ++ T  L+ +S Q+LVDCD +GE   C GG ++T + ++++N+G
Sbjct: 153 GCCWAFSAVAAMEGLHQLKTGELIPLSEQELVDCDVEGEDEGCSGGLLDTAFDFILKNKG 212

Query: 65  INTERDYPNVG 75
           + TE +YP  G
Sbjct: 213 LTTEVNYPYKG 223


>gi|222625810|gb|EEE59942.1| hypothetical protein OsJ_12596 [Oryza sativa Japonica Group]
          Length = 213

 Score = 80.1 bits (196), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 34/72 (47%), Positives = 48/72 (66%)

Query: 5  GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
          G CW F+ V A+EG++KI T  LV +S Q+LVDCD +GE + C GG ++T +QY+ +  G
Sbjct: 19 GCCWAFSAVAAVEGLAKIRTGQLVSLSEQELVDCDVRGEDQGCEGGLMDTAFQYIARRGG 78

Query: 65 INTERDYPNVGV 76
          +  E  YP  GV
Sbjct: 79 LAAESSYPYRGV 90


>gi|46576360|sp|P60994.1|ERVB_TABDI RecName: Full=Ervatamin-B; Short=ERV-B
 gi|30749291|pdb|1IWD|A Chain A, Proposed Amino Acid Sequence And The 1.63 Angstrom X-ray
           Crystal Structure Of A Plant Cysteine Protease Ervatamin
           B: Insight Into The Structural Basis Of Its Stability
           And Substrate Specificity
          Length = 215

 Score = 80.1 bits (196), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 36/81 (44%), Positives = 53/81 (65%), Gaps = 2/81 (2%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V A+E I+KI T  L+ +S Q+LVDCD    S  C GG++   +QY+I N G
Sbjct: 23  GSCWAFSAVAAVESINKIRTGQLISLSEQELVDCDT--ASHGCNGGWMNNAFQYIITNGG 80

Query: 65  INTERDYPNVGVMDNCKVFQF 85
           I+T+++YP   V  +CK ++ 
Sbjct: 81  IDTQQNYPYSAVQGSCKPYRL 101


>gi|356515040|ref|XP_003526209.1| PREDICTED: thiol protease SEN102-like [Glycine max]
          Length = 342

 Score = 80.1 bits (196), Expect = 2e-13,   Method: Composition-based stats.
 Identities = 36/76 (47%), Positives = 51/76 (67%), Gaps = 2/76 (2%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ + A EGI +I T NLV +S Q+LVDCD+  +   C GGF+E  ++++I+N G
Sbjct: 149 GSCWAFSTIAATEGIHQISTGNLVSLSEQELVDCDSVDD--GCEGGFMEDGFEFIIKNGG 206

Query: 65  INTERDYPNVGVMDNC 80
           I +E +YP  GV   C
Sbjct: 207 ITSETNYPYKGVDGTC 222


>gi|74927078|sp|Q86GF7.1|CRUST_PANBO RecName: Full=Crustapain; AltName: Full=NsCys; Flags: Precursor
 gi|28971811|dbj|BAC65417.1| crustapain [Pandalus borealis]
          Length = 323

 Score = 79.7 bits (195), Expect = 2e-13,   Method: Composition-based stats.
 Identities = 36/77 (46%), Positives = 49/77 (63%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V A+EG   + T +LV +S Q LVDC +   ++ C GG+    YQY+I NRG
Sbjct: 128 GSCWAFSAVAALEGAHFLKTGDLVSLSEQNLVDCSSSYGNQGCNGGWPYQAYQYIIANRG 187

Query: 65  INTERDYPNVGVMDNCK 81
           I+TE  YP   + DNC+
Sbjct: 188 IDTESSYPYKAIDDNCR 204


>gi|356542633|ref|XP_003539771.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
          Length = 341

 Score = 79.7 bits (195), Expect = 2e-13,   Method: Composition-based stats.
 Identities = 32/76 (42%), Positives = 49/76 (64%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G CW F+ V A EGI+K+ T  L+ +S Q+LVDCD +G  + C GG ++  +++++QN+G
Sbjct: 145 GCCWAFSAVAATEGITKLRTGKLISLSEQELVDCDTKGVDQGCEGGLMDDAFKFILQNKG 204

Query: 65  INTERDYPNVGVMDNC 80
           + TE  YP  G    C
Sbjct: 205 LATEAIYPYEGFDGTC 220


>gi|414591546|tpg|DAA42117.1| TPA: hypothetical protein ZEAMMB73_388689 [Zea mays]
 gi|414591547|tpg|DAA42118.1| TPA: hypothetical protein ZEAMMB73_388689 [Zea mays]
          Length = 268

 Score = 79.7 bits (195), Expect = 2e-13,   Method: Composition-based stats.
 Identities = 32/76 (42%), Positives = 53/76 (69%), Gaps = 1/76 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ + A+EG++KI+T  LV +S Q+LVDCD+  +++ C GG ++  +QY+ +N G
Sbjct: 161 GSCWAFSAIAAVEGVNKIMTGKLVSLSEQELVDCDDV-DNQGCDGGLMDYAFQYIQRNGG 219

Query: 65  INTERDYPNVGVMDNC 80
           + TE +YP +    +C
Sbjct: 220 VTTESNYPYLAEQRSC 235


>gi|413953046|gb|AFW85695.1| thiol protease SEN102 [Zea mays]
          Length = 382

 Score = 79.7 bits (195), Expect = 2e-13,   Method: Composition-based stats.
 Identities = 35/76 (46%), Positives = 46/76 (60%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW FA V +IEG+ +I T  LV +S Q++VDCD  G    C GG   +  ++V +N G
Sbjct: 183 GSCWAFATVASIEGVHQIKTGRLVSLSEQEIVDCDRGGNDNGCRGGSPRSAMEWVTRNGG 242

Query: 65  INTERDYPNVGVMDNC 80
           + TE DYP VG    C
Sbjct: 243 LTTESDYPYVGSQRQC 258


>gi|356539398|ref|XP_003538185.1| PREDICTED: vignain-like [Glycine max]
          Length = 343

 Score = 79.7 bits (195), Expect = 2e-13,   Method: Composition-based stats.
 Identities = 32/76 (42%), Positives = 49/76 (64%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G CW F+ V A EGI+K+ T  L+ +S Q+LVDCD +G  + C GG ++  +++++QN+G
Sbjct: 147 GCCWAFSAVAATEGITKLSTGKLISLSEQELVDCDTKGVDQGCEGGLMDDAFKFILQNKG 206

Query: 65  INTERDYPNVGVMDNC 80
           +  E  YP  GV   C
Sbjct: 207 LAAEAIYPYEGVDGTC 222


>gi|226508570|ref|NP_001141984.1| uncharacterized protein LOC100274134 precursor [Zea mays]
 gi|194706676|gb|ACF87422.1| unknown [Zea mays]
 gi|413920745|gb|AFW60677.1| vignain [Zea mays]
          Length = 363

 Score = 79.7 bits (195), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 36/80 (45%), Positives = 50/80 (62%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G CW F+ VGA+EG+  I T NLV +S QQ++DCD    ++ C GG+++  +QYVI N G
Sbjct: 173 GCCWAFSAVGAMEGLIMITTGNLVSLSEQQILDCDESDGNQGCNGGYMDNAFQYVINNGG 232

Query: 65  INTERDYPNVGVMDNCKVFQ 84
           + TE  YP   V   C+  Q
Sbjct: 233 VTTEDAYPYSAVQGTCQNVQ 252


>gi|358348957|ref|XP_003638507.1| Cysteine proteinase [Medicago truncatula]
 gi|355504442|gb|AES85645.1| Cysteine proteinase [Medicago truncatula]
          Length = 362

 Score = 79.7 bits (195), Expect = 2e-13,   Method: Composition-based stats.
 Identities = 38/82 (46%), Positives = 53/82 (64%), Gaps = 1/82 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V A+EGI++I TN LV +S Q+L+DCDNQ E++ C GG +E  ++Y+ Q  G
Sbjct: 150 GSCWAFSTVVAVEGINQIKTNRLVPLSEQELIDCDNQ-ENQGCNGGLMEYAFEYIKQKGG 208

Query: 65  INTERDYPNVGVMDNCKVFQFN 86
           I TE  YP      +C   + N
Sbjct: 209 ITTESYYPYTANDGSCDATKEN 230


>gi|30141019|dbj|BAC75923.1| cysteine protease-1 [Helianthus annuus]
          Length = 461

 Score = 79.7 bits (195), Expect = 2e-13,   Method: Composition-based stats.
 Identities = 36/79 (45%), Positives = 54/79 (68%), Gaps = 7/79 (8%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCD---NQGESRSCVGGFIETIYQYVIQ 61
           GSCW F+  G++EG++KIVT +L+ +S Q+LV+CD   NQG    C GG ++  ++++I+
Sbjct: 162 GSCWAFSTTGSVEGVNKIVTGDLISVSEQELVNCDTSYNQG----CNGGLMDYAFEFIIK 217

Query: 62  NRGINTERDYPNVGVMDNC 80
           N GI+TE DYP  G    C
Sbjct: 218 NGGIDTEEDYPYTGKDGKC 236


>gi|449454309|ref|XP_004144898.1| PREDICTED: xylem cysteine proteinase 1-like [Cucumis sativus]
 gi|449471311|ref|XP_004153272.1| PREDICTED: xylem cysteine proteinase 1-like [Cucumis sativus]
          Length = 349

 Score = 79.7 bits (195), Expect = 2e-13,   Method: Composition-based stats.
 Identities = 32/68 (47%), Positives = 47/68 (69%), Gaps = 1/68 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V A+EGI++IVT NL  +S Q+L+DCD +  +  C GG ++  +QY++ N G
Sbjct: 153 GSCWAFSTVAAVEGINQIVTGNLTSLSEQELIDCD-RSFNNGCYGGLMDYAFQYIMSNSG 211

Query: 65  INTERDYP 72
           +  E DYP
Sbjct: 212 LRKEEDYP 219


>gi|61661067|gb|AAX51229.1| cathepsin S cysteine protease [Paralichthys olivaceus]
          Length = 337

 Score = 79.7 bits (195), Expect = 2e-13,   Method: Composition-based stats.
 Identities = 35/76 (46%), Positives = 48/76 (63%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+  GA+EG     T  LVD+S Q LVDC  +  ++ C GGF++  +QYVI N+G
Sbjct: 144 GSCWAFSAAGALEGQLAKTTGKLVDLSPQNLVDCSLKYGNKGCNGGFMDRAFQYVIDNKG 203

Query: 65  INTERDYPNVGVMDNC 80
           I++E  YP  G +  C
Sbjct: 204 IDSEASYPYRGQLQQC 219


>gi|242071345|ref|XP_002450949.1| hypothetical protein SORBIDRAFT_05g021550 [Sorghum bicolor]
 gi|241936792|gb|EES09937.1| hypothetical protein SORBIDRAFT_05g021550 [Sorghum bicolor]
          Length = 371

 Score = 79.7 bits (195), Expect = 2e-13,   Method: Composition-based stats.
 Identities = 34/76 (44%), Positives = 52/76 (68%), Gaps = 1/76 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ + A+EGI+KI T  LV +S Q+LVDCD+  +++ C GG ++  +QY+ +N G
Sbjct: 160 GSCWAFSTIAAVEGINKIRTGKLVSLSEQELVDCDDV-DNQGCNGGLMDYAFQYIKRNGG 218

Query: 65  INTERDYPNVGVMDNC 80
           I TE +YP +    +C
Sbjct: 219 ITTESNYPYLAEQRSC 234


>gi|90265242|emb|CAH67695.1| H0624F09.3 [Oryza sativa Indica Group]
          Length = 494

 Score = 79.7 bits (195), Expect = 2e-13,   Method: Composition-based stats.
 Identities = 33/78 (42%), Positives = 51/78 (65%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V A+EGI+KIVT  LV +S Q+LV+C   G++  C GG ++  + ++ +N G
Sbjct: 178 GSCWAFSAVAAVEGINKIVTGELVSLSEQELVECARNGQNSGCNGGIMDDAFAFIARNGG 237

Query: 65  INTERDYPNVGVMDNCKV 82
           ++TE DYP   +   C +
Sbjct: 238 LDTEEDYPYTAMDGKCNL 255


>gi|449500145|ref|XP_004161017.1| PREDICTED: xylem cysteine proteinase 1-like [Cucumis sativus]
          Length = 349

 Score = 79.7 bits (195), Expect = 2e-13,   Method: Composition-based stats.
 Identities = 32/68 (47%), Positives = 47/68 (69%), Gaps = 1/68 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V A+EGI++IVT NL  +S Q+L+DCD +  +  C GG ++  +QY++ N G
Sbjct: 153 GSCWAFSTVAAVEGINQIVTGNLTSLSEQELIDCD-RSFNNGCYGGLMDYAFQYIMSNSG 211

Query: 65  INTERDYP 72
           +  E DYP
Sbjct: 212 LRKEEDYP 219


>gi|214015305|gb|ACJ62269.1| cysteine protease [Zea mays subsp. parviglumis]
          Length = 255

 Score = 79.7 bits (195), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 38/82 (46%), Positives = 51/82 (62%), Gaps = 2/82 (2%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G CW F+ V AIEGI+ I T NLV +S Q+++DCD Q     C GG +E  +++VI N G
Sbjct: 132 GGCWAFSAVAAIEGINAIATGNLVSLSEQEIIDCDAQ--DSGCDGGQMENAFRFVIGNGG 189

Query: 65  INTERDYPNVGVMDNCKVFQFN 86
           I+TE DYP +G    C   + N
Sbjct: 190 IDTEADYPFIGTDGTCDASKEN 211


>gi|47524507|gb|AAT34987.1| putative cysteine protease [Gossypium hirsutum]
          Length = 344

 Score = 79.7 bits (195), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 34/76 (44%), Positives = 49/76 (64%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G CW F+ V AIEGI++I T  L+ +S Q+LVDCD +G    C GG ++T ++++I N G
Sbjct: 148 GCCWAFSAVAAIEGITQISTGKLISLSEQELVDCDTKGIDHGCEGGLMDTAFEFIINNGG 207

Query: 65  INTERDYPNVGVMDNC 80
           + TE +YP  G    C
Sbjct: 208 LTTESNYPYKGEDGTC 223


>gi|13897890|gb|AAK48495.1|AF259983_1 putative cysteine protease [Ipomoea batatas]
          Length = 462

 Score = 79.7 bits (195), Expect = 2e-13,   Method: Composition-based stats.
 Identities = 34/76 (44%), Positives = 51/76 (67%), Gaps = 1/76 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ + A+EGI++IVT  L+ +S Q+LVDCD    +  C GG ++  ++++I+N G
Sbjct: 161 GSCWAFSTIAAVEGINQIVTGELISLSEQELVDCDTS-YNEGCNGGLMDYAFEFIIKNGG 219

Query: 65  INTERDYPNVGVMDNC 80
           I+TE DYP  G    C
Sbjct: 220 IDTEADYPYTGRYGRC 235


>gi|115461226|ref|NP_001054213.1| Os04g0670500 [Oryza sativa Japonica Group]
 gi|62510688|sp|Q7XR52.2|CYSP1_ORYSJ RecName: Full=Cysteine protease 1; AltName: Full=OsCP1; Flags:
           Precursor
 gi|38345300|emb|CAE02828.2| OSJNBa0043A12.33 [Oryza sativa Japonica Group]
 gi|113565784|dbj|BAF16127.1| Os04g0670500 [Oryza sativa Japonica Group]
 gi|215741575|dbj|BAG98070.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 490

 Score = 79.7 bits (195), Expect = 2e-13,   Method: Composition-based stats.
 Identities = 33/78 (42%), Positives = 51/78 (65%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V A+EGI+KIVT  LV +S Q+LV+C   G++  C GG ++  + ++ +N G
Sbjct: 178 GSCWAFSAVAAVEGINKIVTGELVSLSEQELVECARNGQNSGCNGGIMDDAFAFIARNGG 237

Query: 65  INTERDYPNVGVMDNCKV 82
           ++TE DYP   +   C +
Sbjct: 238 LDTEEDYPYTAMDGKCNL 255


>gi|116788286|gb|ABK24823.1| unknown [Picea sitchensis]
          Length = 294

 Score = 79.7 bits (195), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 41/82 (50%), Positives = 52/82 (63%), Gaps = 1/82 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G CW F+  GAIEGI+KIVT +LV +S Q+L DCD    S  C GG ++  +Q+VI N G
Sbjct: 148 GDCWAFSATGAIEGINKIVTGSLVSLSEQELCDCDTSYNS-GCDGGLMDYAFQWVIVNGG 206

Query: 65  INTERDYPNVGVMDNCKVFQFN 86
           I+TE DYP  GV   C   + N
Sbjct: 207 IDTEVDYPYKGVQKACNSKKVN 228


>gi|214015353|gb|ACJ62293.1| cysteine protease [Zea mays subsp. parviglumis]
          Length = 254

 Score = 79.7 bits (195), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 38/82 (46%), Positives = 51/82 (62%), Gaps = 2/82 (2%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G CW F+ V AIEGI+ I T NLV +S Q+++DCD Q     C GG +E  +++VI N G
Sbjct: 131 GGCWAFSAVAAIEGINAIATGNLVSLSEQEIIDCDAQ--DSGCDGGQMENAFRFVIGNGG 188

Query: 65  INTERDYPNVGVMDNCKVFQFN 86
           I+TE DYP +G    C   + N
Sbjct: 189 IDTEADYPFIGTDGTCDASKEN 210


>gi|30141027|dbj|BAC75927.1| cysteine protease-5 [Helianthus annuus]
          Length = 365

 Score = 79.7 bits (195), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 37/76 (48%), Positives = 54/76 (71%), Gaps = 1/76 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V ++EGI+KIVT +L+ +S Q+LVDCDN+  S  C GG ++  +Q+++ N G
Sbjct: 150 GSCWAFSTVASVEGINKIVTGDLISLSEQELVDCDNKYNS-GCNGGSMDYAFQFIVSNGG 208

Query: 65  INTERDYPNVGVMDNC 80
           I++E DYP  GV   C
Sbjct: 209 IDSESDYPYKGVGAVC 224


>gi|226531284|ref|NP_001147086.1| thiol protease SEN102 precursor [Zea mays]
 gi|195607128|gb|ACG25394.1| thiol protease SEN102 precursor [Zea mays]
          Length = 356

 Score = 79.7 bits (195), Expect = 2e-13,   Method: Composition-based stats.
 Identities = 35/76 (46%), Positives = 46/76 (60%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW FA V +IEG+ +I T  LV +S Q++VDCD  G    C GG   +  ++V +N G
Sbjct: 157 GSCWAFATVASIEGVHQIKTGRLVSLSEQEIVDCDRGGNDNGCRGGSPRSAMEWVTRNGG 216

Query: 65  INTERDYPNVGVMDNC 80
           + TE DYP VG    C
Sbjct: 217 LTTESDYPYVGSQRQC 232


>gi|38345906|emb|CAE04498.2| OSJNBb0059K02.8 [Oryza sativa Japonica Group]
          Length = 458

 Score = 79.7 bits (195), Expect = 2e-13,   Method: Composition-based stats.
 Identities = 36/82 (43%), Positives = 54/82 (65%), Gaps = 1/82 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ + A+EGI++IVT +L+ +S Q+LVDCD    +  C GG ++  + ++I N G
Sbjct: 151 GSCWAFSAIAAVEGINQIVTGDLISLSEQELVDCDTS-YNEGCNGGLMDYAFDFIINNGG 209

Query: 65  INTERDYPNVGVMDNCKVFQFN 86
           I+TE DYP  G  + C V + N
Sbjct: 210 IDTEDDYPYKGKDERCDVNRKN 231


>gi|355344587|gb|AER60490.1| cysteine proteases [Gossypium hirsutum]
          Length = 371

 Score = 79.7 bits (195), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 35/82 (42%), Positives = 55/82 (67%), Gaps = 1/82 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ +  +EGI+KIV+  LV +S Q+LVDCD   ++  C GG ++  +Q+++ N G
Sbjct: 157 GSCWAFSTIATVEGINKIVSGELVSLSEQELVDCDRSYDA-GCNGGLMDYAFQFIMDNGG 215

Query: 65  INTERDYPNVGVMDNCKVFQFN 86
           I+TE+DYP +G  + C   + N
Sbjct: 216 IDTEKDYPYLGFNNQCDPTKKN 237


>gi|75994616|gb|ABA33829.1| cysteine protease Mir1 [Zea mays subsp. parviglumis]
          Length = 248

 Score = 79.3 bits (194), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 38/82 (46%), Positives = 51/82 (62%), Gaps = 2/82 (2%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G CW F+ V AIEGI+ I T NLV +S Q+++DCD Q     C GG +E  +++VI N G
Sbjct: 124 GGCWAFSAVAAIEGINAIATGNLVSLSEQEIIDCDAQ--DSGCDGGQMENAFRFVIGNGG 181

Query: 65  INTERDYPNVGVMDNCKVFQFN 86
           I+TE DYP +G    C   + N
Sbjct: 182 IDTEADYPFIGTDGTCDASKEN 203


>gi|219687002|dbj|BAH08632.1| daikon cysteine protease RD21 [Raphanus sativus]
          Length = 289

 Score = 79.3 bits (194), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 38/79 (48%), Positives = 54/79 (68%), Gaps = 7/79 (8%)

Query: 5  GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCD---NQGESRSCVGGFIETIYQYVIQ 61
          GSCW F+ +GA+EGI+KIVT +L+ +S Q+LVDCD   NQG    C GG ++  ++++I+
Sbjct: 25 GSCWAFSTIGAVEGINKIVTGDLISLSEQELVDCDTSYNQG----CNGGLMDYAFEFIIK 80

Query: 62 NRGINTERDYPNVGVMDNC 80
          N GI+TE DYP       C
Sbjct: 81 NGGIDTEEDYPYKAADGRC 99


>gi|225706086|gb|ACO08889.1| Cathepsin S precursor [Osmerus mordax]
          Length = 333

 Score = 79.3 bits (194), Expect = 3e-13,   Method: Composition-based stats.
 Identities = 35/76 (46%), Positives = 49/76 (64%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ VGA+EG     T  L+D+S Q LVDC ++  ++ C GGF+   +QYVI N+G
Sbjct: 140 GSCWAFSSVGALEGQLMRTTGKLLDLSPQNLVDCSSKYGNKGCNGGFMSEAFQYVIDNKG 199

Query: 65  INTERDYPNVGVMDNC 80
           I+++  YP  GV   C
Sbjct: 200 IDSDTSYPYQGVQGTC 215


>gi|146215986|gb|ABQ10195.1| actinidin Act2d [Actinidia eriantha]
          Length = 381

 Score = 79.3 bits (194), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 36/81 (44%), Positives = 51/81 (62%)

Query: 6   SCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRGI 65
           SCW F+ V A+EGI+KIVT NL+ +S Q+LVDC     +R C  G++   +Q++I N GI
Sbjct: 151 SCWAFSAVAAVEGINKIVTGNLISLSEQELVDCGRTQRTRGCNRGYMNDAFQFIIDNGGI 210

Query: 66  NTERDYPNVGVMDNCKVFQFN 86
           NTE +YP       C  ++ N
Sbjct: 211 NTEDNYPYTAQDGQCDWYRKN 231


>gi|219362839|ref|NP_001136636.1| uncharacterized protein LOC100216764 precursor [Zea mays]
 gi|194696462|gb|ACF82315.1| unknown [Zea mays]
 gi|413934556|gb|AFW69107.1| hypothetical protein ZEAMMB73_554980 [Zea mays]
          Length = 361

 Score = 79.3 bits (194), Expect = 3e-13,   Method: Composition-based stats.
 Identities = 36/77 (46%), Positives = 49/77 (63%), Gaps = 2/77 (2%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW FA + A+EGI+ I T +LV +S QQLVDCDN      C GG+I +   ++++NRG
Sbjct: 169 GSCWAFAAIAAVEGINAIRTWSLVTLSEQQLVDCDNV--DHGCAGGWIPSALDFIVRNRG 226

Query: 65  INTERDYPNVGVMDNCK 81
           I  E  YP +G    C+
Sbjct: 227 IVPEGTYPYIGTQGRCR 243


>gi|2414570|emb|CAB16317.1| cysteine proteinase precursor [Nicotiana tabacum]
          Length = 374

 Score = 79.3 bits (194), Expect = 3e-13,   Method: Composition-based stats.
 Identities = 34/83 (40%), Positives = 56/83 (67%), Gaps = 1/83 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V A+EGI++IVT  ++ +S Q+LVDCD + ++  C GG ++  ++++I N G
Sbjct: 162 GSCWAFSTVAAVEGINQIVTGEMITLSEQELVDCD-RVQNSGCNGGLMDYAFEFIISNGG 220

Query: 65  INTERDYPNVGVMDNCKVFQFNW 87
           ++TE+ YP  GV   C   + N+
Sbjct: 221 MDTEKHYPYRGVEGRCDPVRKNY 243


>gi|255568299|ref|XP_002525124.1| cysteine protease, putative [Ricinus communis]
 gi|223535583|gb|EEF37251.1| cysteine protease, putative [Ricinus communis]
          Length = 342

 Score = 79.3 bits (194), Expect = 3e-13,   Method: Composition-based stats.
 Identities = 34/76 (44%), Positives = 48/76 (63%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V A EG+ K+ T  LV +S Q+LVDCD +GE + C GG +E  ++++ +N G
Sbjct: 146 GSCWAFSAVAATEGVHKLRTGKLVSLSEQELVDCDVKGEDKGCQGGLMEDAFKFIKRNGG 205

Query: 65  INTERDYPNVGVMDNC 80
           I TE +Y   G    C
Sbjct: 206 ITTEANYAYRGRDGKC 221


>gi|113120269|gb|ABI30274.1| VS-B, partial [Vasconcellea stipulata]
          Length = 341

 Score = 79.3 bits (194), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 37/80 (46%), Positives = 53/80 (66%), Gaps = 3/80 (3%)

Query: 3   PLGSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQN 62
           P GSCW F+ V  +EGI+KIVT NL+ +S Q+L+DCD +  S  C GG+  T  +YV+ N
Sbjct: 155 PCGSCWAFSTVATVEGINKIVTGNLISLSEQELLDCDRR--SHGCKGGYQTTSLKYVVDN 212

Query: 63  RGINTERDYPNVGVMDNCKV 82
            G++TE++YP      NC+ 
Sbjct: 213 -GVHTEKEYPYEKKQGNCRA 231


>gi|195628596|gb|ACG36128.1| vignain precursor [Zea mays]
          Length = 362

 Score = 79.3 bits (194), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 35/80 (43%), Positives = 50/80 (62%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G CW F+ VGA+EG+  I T NLV +S QQ++DCD    ++ C GG+++  +QYV+ N G
Sbjct: 172 GCCWAFSAVGAMEGLIMITTGNLVSLSEQQILDCDESDGNQGCNGGYMDNAFQYVVNNGG 231

Query: 65  INTERDYPNVGVMDNCKVFQ 84
           + TE  YP   V   C+  Q
Sbjct: 232 VTTEDAYPYSAVQGTCQNVQ 251


>gi|214015295|gb|ACJ62264.1| cysteine protease [Zea mays subsp. parviglumis]
          Length = 247

 Score = 79.3 bits (194), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 38/82 (46%), Positives = 51/82 (62%), Gaps = 2/82 (2%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G CW F+ V AIEG++ I T NLV +S Q+++DCD Q     C GG +E  +++VI N G
Sbjct: 124 GGCWAFSAVAAIEGVNAIATGNLVSLSEQEIIDCDAQ--DSGCDGGQMEDAFRFVIGNGG 181

Query: 65  INTERDYPNVGVMDNCKVFQFN 86
           I+TE DYP VG    C   + N
Sbjct: 182 IDTEADYPFVGTDGTCDASKEN 203


>gi|10336513|dbj|BAB13759.1| cysteine proteinase [Astragalus sinicus]
          Length = 343

 Score = 79.3 bits (194), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 33/76 (43%), Positives = 49/76 (64%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G CW F+ V A EGI ++ T  L+ +S Q+LVDCD +G  + C GG ++  ++++IQN G
Sbjct: 147 GCCWAFSAVAATEGIHQLSTGKLISLSEQELVDCDTKGVDQGCEGGLMDDAFKFIIQNHG 206

Query: 65  INTERDYPNVGVMDNC 80
           ++TE  YP  GV   C
Sbjct: 207 LDTEAKYPYQGVDGTC 222


>gi|414591545|tpg|DAA42116.1| TPA: hypothetical protein ZEAMMB73_388689 [Zea mays]
          Length = 384

 Score = 79.3 bits (194), Expect = 3e-13,   Method: Composition-based stats.
 Identities = 32/76 (42%), Positives = 53/76 (69%), Gaps = 1/76 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ + A+EG++KI+T  LV +S Q+LVDCD+  +++ C GG ++  +QY+ +N G
Sbjct: 161 GSCWAFSAIAAVEGVNKIMTGKLVSLSEQELVDCDDV-DNQGCDGGLMDYAFQYIQRNGG 219

Query: 65  INTERDYPNVGVMDNC 80
           + TE +YP +    +C
Sbjct: 220 VTTESNYPYLAEQRSC 235


>gi|1208549|gb|AAC49455.1| Pseudotzain [Pseudotsuga menziesii]
          Length = 454

 Score = 79.3 bits (194), Expect = 3e-13,   Method: Composition-based stats.
 Identities = 38/85 (44%), Positives = 55/85 (64%), Gaps = 7/85 (8%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCD---NQGESRSCVGGFIETIYQYVIQ 61
           GSCW F+ V A+EGI++IVT NL  +S Q+LVDCD   NQG    C GG ++  +Q++I 
Sbjct: 154 GSCWAFSTVAAVEGINQIVTGNLTSLSEQELVDCDTSYNQG----CNGGLMDYAFQFIIS 209

Query: 62  NRGINTERDYPNVGVMDNCKVFQFN 86
           N G+++E DYP      +C  ++ N
Sbjct: 210 NGGLDSEDDYPYKANNGSCDAYRKN 234


>gi|297845064|ref|XP_002890413.1| hypothetical protein ARALYDRAFT_472321 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297336255|gb|EFH66672.1| hypothetical protein ARALYDRAFT_472321 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 357

 Score = 79.3 bits (194), Expect = 3e-13,   Method: Composition-based stats.
 Identities = 32/68 (47%), Positives = 47/68 (69%), Gaps = 1/68 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V A+EGI+KIVT NL  +S Q+L+DCD    +  C GG ++  ++Y+++N G
Sbjct: 160 GSCWAFSTVAAVEGINKIVTGNLTTLSEQELIDCDTT-YNNGCNGGLMDYAFEYIVKNGG 218

Query: 65  INTERDYP 72
           +  E DYP
Sbjct: 219 LRKEEDYP 226


>gi|413944252|gb|AFW76901.1| hypothetical protein ZEAMMB73_101481 [Zea mays]
          Length = 232

 Score = 79.3 bits (194), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 34/77 (44%), Positives = 48/77 (62%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G CW F+ V A EGI KI T  L+ +S Q+LVDCD  GE + C GG ++  ++++I+N G
Sbjct: 38  GCCWAFSAVAATEGIVKISTGKLISLSEQELVDCDVYGEDQGCEGGLMDDAFKFIIKNGG 97

Query: 65  INTERDYPNVGVMDNCK 81
           + TE +YP       CK
Sbjct: 98  LTTESNYPYTAADGKCK 114


>gi|357154164|ref|XP_003576692.1| PREDICTED: vignain-like [Brachypodium distachyon]
          Length = 427

 Score = 79.3 bits (194), Expect = 3e-13,   Method: Composition-based stats.
 Identities = 34/82 (41%), Positives = 51/82 (62%), Gaps = 2/82 (2%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V A+EG+++I    LV +S Q+LVDCD   E+  C GGF+   +++V+ N G
Sbjct: 233 GSCWAFSAVAAMEGLNQIKNGKLVSLSEQELVDCD--AEAVGCAGGFMSWAFEFVMANHG 290

Query: 65  INTERDYPNVGVMDNCKVFQFN 86
           + TE  YP  G+   C+  + N
Sbjct: 291 LTTEASYPYKGINGACQTAKLN 312


>gi|238007404|gb|ACR34737.1| unknown [Zea mays]
 gi|413943289|gb|AFW75938.1| cysteine proteinase Mir2 [Zea mays]
          Length = 484

 Score = 79.3 bits (194), Expect = 3e-13,   Method: Composition-based stats.
 Identities = 34/78 (43%), Positives = 54/78 (69%), Gaps = 1/78 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G+CW F+ V A+EGI+KIVT +L+ +S Q+L+DCD + + + C GG ++  + ++I+N G
Sbjct: 177 GACWAFSAVAAVEGINKIVTGSLISLSEQELIDCD-KFQDQGCDGGLMDNAFVFMIKNGG 235

Query: 65  INTERDYPNVGVMDNCKV 82
           I+TE DYP  G    C +
Sbjct: 236 IDTEADYPFTGHDGTCDL 253


>gi|222629675|gb|EEE61807.1| hypothetical protein OsJ_16426 [Oryza sativa Japonica Group]
          Length = 459

 Score = 79.3 bits (194), Expect = 3e-13,   Method: Composition-based stats.
 Identities = 36/82 (43%), Positives = 54/82 (65%), Gaps = 1/82 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ + A+EGI++IVT +L+ +S Q+LVDCD    +  C GG ++  + ++I N G
Sbjct: 152 GSCWAFSAIAAVEGINQIVTGDLISLSEQELVDCDTS-YNEGCNGGLMDYAFDFIINNGG 210

Query: 65  INTERDYPNVGVMDNCKVFQFN 86
           I+TE DYP  G  + C V + N
Sbjct: 211 IDTEDDYPYKGKDERCDVNRKN 232


>gi|109939734|sp|P25776.2|ORYA_ORYSJ RecName: Full=Oryzain alpha chain; Flags: Precursor
 gi|78192122|gb|ABB30151.1| oryzain alpha [Oryza sativa Japonica Group]
          Length = 458

 Score = 79.3 bits (194), Expect = 3e-13,   Method: Composition-based stats.
 Identities = 36/82 (43%), Positives = 54/82 (65%), Gaps = 1/82 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ + A+EGI++IVT +L+ +S Q+LVDCD    +  C GG ++  + ++I N G
Sbjct: 151 GSCWAFSAIAAVEGINQIVTGDLISLSEQELVDCDTS-YNEGCNGGLMDYAFDFIINNGG 209

Query: 65  INTERDYPNVGVMDNCKVFQFN 86
           I+TE DYP  G  + C V + N
Sbjct: 210 IDTEDDYPYKGKDERCDVNRKN 231


>gi|76574390|gb|ABA46965.1| cysteine protease Mir1 [Zea diploperennis]
          Length = 256

 Score = 79.0 bits (193), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 37/82 (45%), Positives = 51/82 (62%), Gaps = 2/82 (2%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G CW F+ V AIEG++ I T NLV +S Q+++DCD Q     C GG +E  +++VI N G
Sbjct: 132 GGCWAFSAVAAIEGVNAIATGNLVSLSEQEIIDCDAQ--DSGCDGGQMENAFRFVIGNGG 189

Query: 65  INTERDYPNVGVMDNCKVFQFN 86
           I+TE DYP +G    C   + N
Sbjct: 190 IDTEADYPFIGTDGTCDASKEN 211


>gi|214015368|gb|ACJ62300.1| cysteine protease [Zea mays subsp. parviglumis]
          Length = 257

 Score = 79.0 bits (193), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 37/82 (45%), Positives = 51/82 (62%), Gaps = 2/82 (2%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G CW F+ V AIEG++ I T NLV +S Q+++DCD Q     C GG +E  +++VI N G
Sbjct: 134 GGCWAFSAVAAIEGVNAIATGNLVSLSEQEIIDCDAQ--DSGCDGGQMENAFRFVIGNGG 191

Query: 65  INTERDYPNVGVMDNCKVFQFN 86
           I+TE DYP +G    C   + N
Sbjct: 192 IDTEADYPFIGTDGTCDASKEN 213


>gi|76574402|gb|ABA46971.1| cysteine protease Mir1 [Zea diploperennis]
          Length = 256

 Score = 79.0 bits (193), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 37/82 (45%), Positives = 51/82 (62%), Gaps = 2/82 (2%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G CW F+ V AIEG++ I T NLV +S Q+++DCD Q     C GG +E  +++VI N G
Sbjct: 132 GGCWAFSAVAAIEGVNAIATGNLVSLSEQEIIDCDAQ--DSGCDGGQMENAFRFVIGNGG 189

Query: 65  INTERDYPNVGVMDNCKVFQFN 86
           I+TE DYP +G    C   + N
Sbjct: 190 IDTEADYPFIGTDGTCDASKEN 211


>gi|242068363|ref|XP_002449458.1| hypothetical protein SORBIDRAFT_05g013840 [Sorghum bicolor]
 gi|241935301|gb|EES08446.1| hypothetical protein SORBIDRAFT_05g013840 [Sorghum bicolor]
          Length = 350

 Score = 79.0 bits (193), Expect = 3e-13,   Method: Composition-based stats.
 Identities = 35/77 (45%), Positives = 47/77 (61%), Gaps = 1/77 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G CW FA V A+E I +I T NLV +S QQ++DCD  G +  C GG+I+  +QY+I N G
Sbjct: 159 GCCWAFAAVAAVESIHQITTGNLVSLSEQQVLDCDTDGNN-GCNGGYIDNAFQYIISNGG 217

Query: 65  INTERDYPNVGVMDNCK 81
           + TE  YP       C+
Sbjct: 218 LATEDAYPYAAAQGTCQ 234


>gi|218195711|gb|EEC78138.1| hypothetical protein OsI_17694 [Oryza sativa Indica Group]
          Length = 458

 Score = 79.0 bits (193), Expect = 3e-13,   Method: Composition-based stats.
 Identities = 36/82 (43%), Positives = 54/82 (65%), Gaps = 1/82 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ + A+EGI++IVT +L+ +S Q+LVDCD    +  C GG ++  + ++I N G
Sbjct: 151 GSCWAFSAIAAVEGINQIVTGDLISLSEQELVDCDTS-YNEGCNGGLMDYAFDFIINNGG 209

Query: 65  INTERDYPNVGVMDNCKVFQFN 86
           I+TE DYP  G  + C V + N
Sbjct: 210 IDTEDDYPYKGKDERCDVNRKN 231


>gi|125547256|gb|EAY93078.1| hypothetical protein OsI_14879 [Oryza sativa Indica Group]
          Length = 339

 Score = 79.0 bits (193), Expect = 3e-13,   Method: Composition-based stats.
 Identities = 32/76 (42%), Positives = 47/76 (61%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G CW F+ V A+EGI K+ T  L+ +S Q+LVDCD  GE + C GG ++  ++++I+N G
Sbjct: 145 GCCWAFSAVAAMEGIVKLSTGKLISLSEQELVDCDVHGEDQGCEGGLMDDAFKFIIKNGG 204

Query: 65  INTERDYPNVGVMDNC 80
           + TE  YP       C
Sbjct: 205 LTTESKYPYTAADGKC 220


>gi|116309130|emb|CAH66233.1| H0825G02.10 [Oryza sativa Indica Group]
          Length = 339

 Score = 79.0 bits (193), Expect = 3e-13,   Method: Composition-based stats.
 Identities = 32/76 (42%), Positives = 47/76 (61%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G CW F+ V A+EGI K+ T  L+ +S Q+LVDCD  GE + C GG ++  ++++I+N G
Sbjct: 145 GCCWAFSAVAAMEGIVKLSTGKLISLSEQELVDCDVHGEDQGCEGGLMDDAFKFIIKNGG 204

Query: 65  INTERDYPNVGVMDNC 80
           + TE  YP       C
Sbjct: 205 LTTESKYPYTAADGKC 220


>gi|76574394|gb|ABA46967.1| cysteine protease Mir1 [Zea diploperennis]
          Length = 256

 Score = 79.0 bits (193), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 37/82 (45%), Positives = 51/82 (62%), Gaps = 2/82 (2%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G CW F+ V AIEG++ I T NLV +S Q+++DCD Q     C GG +E  +++VI N G
Sbjct: 132 GGCWAFSAVAAIEGVNAIATGNLVSLSEQEIIDCDAQ--DSGCDGGQMENAFRFVIGNGG 189

Query: 65  INTERDYPNVGVMDNCKVFQFN 86
           I+TE DYP +G    C   + N
Sbjct: 190 IDTEADYPFIGTDGTCDASKEN 211


>gi|214015297|gb|ACJ62265.1| cysteine protease [Zea mays subsp. parviglumis]
          Length = 251

 Score = 79.0 bits (193), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 37/82 (45%), Positives = 51/82 (62%), Gaps = 2/82 (2%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G CW F+ V AIEG++ I T NLV +S Q+++DCD Q     C GG +E  +++VI N G
Sbjct: 128 GGCWAFSAVAAIEGVNAIATGNLVSLSEQEIIDCDAQ--DSGCDGGQMENAFRFVIGNGG 185

Query: 65  INTERDYPNVGVMDNCKVFQFN 86
           I+TE DYP +G    C   + N
Sbjct: 186 IDTEADYPFIGTDGTCDASKEN 207


>gi|214015259|gb|ACJ62246.1| cysteine protease [Zea mays subsp. parviglumis]
          Length = 255

 Score = 79.0 bits (193), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 37/82 (45%), Positives = 51/82 (62%), Gaps = 2/82 (2%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G CW F+ V AIEG++ I T NLV +S Q+++DCD Q     C GG +E  +++VI N G
Sbjct: 132 GGCWAFSAVAAIEGVNAIATGNLVSLSEQEIIDCDAQ--DSGCDGGQMENAFRFVIGNGG 189

Query: 65  INTERDYPNVGVMDNCKVFQFN 86
           I+TE DYP +G    C   + N
Sbjct: 190 IDTEADYPFIGTDGTCDASKEN 211


>gi|76574404|gb|ABA46972.1| cysteine protease Mir1 [Zea diploperennis]
          Length = 250

 Score = 79.0 bits (193), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 37/82 (45%), Positives = 51/82 (62%), Gaps = 2/82 (2%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G CW F+ V AIEG++ I T NLV +S Q+++DCD Q     C GG +E  +++VI N G
Sbjct: 126 GGCWAFSAVAAIEGVNAIATGNLVSLSEQEIIDCDAQ--DSGCDGGQMENAFRFVIGNGG 183

Query: 65  INTERDYPNVGVMDNCKVFQFN 86
           I+TE DYP +G    C   + N
Sbjct: 184 IDTEADYPFIGTDGTCDASKEN 205


>gi|214015355|gb|ACJ62294.1| cysteine protease [Zea mays subsp. parviglumis]
          Length = 252

 Score = 79.0 bits (193), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 37/82 (45%), Positives = 51/82 (62%), Gaps = 2/82 (2%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G CW F+ V AIEG++ I T NLV +S Q+++DCD Q     C GG +E  +++VI N G
Sbjct: 129 GGCWAFSAVAAIEGVNAIATGNLVSLSEQEIIDCDAQ--DSGCDGGQMENAFRFVIGNGG 186

Query: 65  INTERDYPNVGVMDNCKVFQFN 86
           I+TE DYP +G    C   + N
Sbjct: 187 IDTEADYPFIGTDGTCDASKEN 208


>gi|214015351|gb|ACJ62292.1| cysteine protease [Zea mays subsp. parviglumis]
          Length = 255

 Score = 79.0 bits (193), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 37/82 (45%), Positives = 51/82 (62%), Gaps = 2/82 (2%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G CW F+ V AIEG++ I T NLV +S Q+++DCD Q     C GG +E  +++VI N G
Sbjct: 132 GGCWAFSAVAAIEGVNAIATGNLVSLSEQEIIDCDAQ--DSGCDGGQMENAFRFVIGNGG 189

Query: 65  INTERDYPNVGVMDNCKVFQFN 86
           I+TE DYP +G    C   + N
Sbjct: 190 IDTEADYPFIGTDGTCDASKEN 211


>gi|38346003|emb|CAD40112.2| OSJNBa0035O13.5 [Oryza sativa Japonica Group]
 gi|125589427|gb|EAZ29777.1| hypothetical protein OsJ_13835 [Oryza sativa Japonica Group]
          Length = 339

 Score = 79.0 bits (193), Expect = 3e-13,   Method: Composition-based stats.
 Identities = 32/76 (42%), Positives = 47/76 (61%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G CW F+ V A+EGI K+ T  L+ +S Q+LVDCD  GE + C GG ++  ++++I+N G
Sbjct: 145 GCCWAFSAVAAMEGIVKLSTGKLISLSEQELVDCDVHGEDQGCEGGLMDDAFKFIIKNGG 204

Query: 65  INTERDYPNVGVMDNC 80
           + TE  YP       C
Sbjct: 205 LTTESKYPYTAADGKC 220


>gi|75994620|gb|ABA33831.1| cysteine protease Mir1 [Zea mays subsp. parviglumis]
          Length = 255

 Score = 79.0 bits (193), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 37/82 (45%), Positives = 51/82 (62%), Gaps = 2/82 (2%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G CW F+ V AIEG++ I T NLV +S Q+++DCD Q     C GG +E  +++VI N G
Sbjct: 131 GGCWAFSAVAAIEGVNAIATGNLVSLSEQEIIDCDAQ--DSGCDGGQMENAFRFVIGNGG 188

Query: 65  INTERDYPNVGVMDNCKVFQFN 86
           I+TE DYP +G    C   + N
Sbjct: 189 IDTEADYPFIGTDGTCDASKEN 210


>gi|75994618|gb|ABA33830.1| cysteine protease Mir1 [Zea mays subsp. parviglumis]
          Length = 255

 Score = 79.0 bits (193), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 37/82 (45%), Positives = 51/82 (62%), Gaps = 2/82 (2%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G CW F+ V AIEG++ I T NLV +S Q+++DCD Q     C GG +E  +++VI N G
Sbjct: 131 GGCWAFSAVAAIEGVNAIATGNLVSLSEQEIIDCDAQ--DSGCDGGQMENAFRFVIGNGG 188

Query: 65  INTERDYPNVGVMDNCKVFQFN 86
           I+TE DYP +G    C   + N
Sbjct: 189 IDTEADYPFIGTDGTCDASKEN 210


>gi|214015291|gb|ACJ62262.1| cysteine protease [Zea mays subsp. parviglumis]
          Length = 251

 Score = 79.0 bits (193), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 37/82 (45%), Positives = 51/82 (62%), Gaps = 2/82 (2%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G CW F+ V AIEG++ I T NLV +S Q+++DCD Q     C GG +E  +++VI N G
Sbjct: 128 GGCWAFSAVAAIEGVNAIATGNLVSLSEQEIIDCDAQ--DSGCDGGQMENAFRFVIGNGG 185

Query: 65  INTERDYPNVGVMDNCKVFQFN 86
           I+TE DYP +G    C   + N
Sbjct: 186 IDTEADYPFIGTDGTCDASKEN 207


>gi|544129|sp|P25803.2|CYSEP_PHAVU RecName: Full=Vignain; AltName: Full=Bean endopeptidase; AltName:
           Full=Cysteine proteinase EP-C1; Flags: Precursor
 gi|20994|emb|CAA44816.1| endopeptidase [Phaseolus vulgaris]
          Length = 362

 Score = 79.0 bits (193), Expect = 3e-13,   Method: Composition-based stats.
 Identities = 36/82 (43%), Positives = 53/82 (64%), Gaps = 1/82 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V A+EGI++I TN LV +S Q+LVDCD + E++ C GG +E+ ++++ Q  G
Sbjct: 150 GSCWAFSTVVAVEGINQIKTNKLVALSEQELVDCDKE-ENQGCNGGLMESAFEFIKQKGG 208

Query: 65  INTERDYPNVGVMDNCKVFQFN 86
           I TE +YP       C   + N
Sbjct: 209 ITTESNYPYKAQEGTCDASKVN 230


>gi|214015279|gb|ACJ62256.1| cysteine protease [Zea mays subsp. parviglumis]
          Length = 255

 Score = 79.0 bits (193), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 37/82 (45%), Positives = 51/82 (62%), Gaps = 2/82 (2%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G CW F+ V AIEG++ I T NLV +S Q+++DCD Q     C GG +E  +++VI N G
Sbjct: 132 GGCWAFSAVAAIEGVNAIATGNLVSLSEQEIIDCDAQ--DSGCDGGQMENAFRFVIGNGG 189

Query: 65  INTERDYPNVGVMDNCKVFQFN 86
           I+TE DYP +G    C   + N
Sbjct: 190 IDTEADYPFIGTDGTCDASKEN 211


>gi|214015331|gb|ACJ62282.1| cysteine protease [Zea mays subsp. parviglumis]
          Length = 251

 Score = 79.0 bits (193), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 37/82 (45%), Positives = 51/82 (62%), Gaps = 2/82 (2%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G CW F+ V AIEG++ I T NLV +S Q+++DCD Q     C GG +E  +++VI N G
Sbjct: 128 GGCWAFSAVAAIEGVNAIATGNLVSLSEQEIIDCDAQ--DSGCDGGQMENAFRFVIGNGG 185

Query: 65  INTERDYPNVGVMDNCKVFQFN 86
           I+TE DYP +G    C   + N
Sbjct: 186 IDTEADYPFIGTDGTCDASKEN 207


>gi|76574392|gb|ABA46966.1| cysteine protease Mir1 [Zea diploperennis]
 gi|76574396|gb|ABA46968.1| cysteine protease Mir1 [Zea diploperennis]
 gi|76574398|gb|ABA46969.1| cysteine protease Mir1 [Zea diploperennis]
 gi|76574406|gb|ABA46973.1| cysteine protease Mir1 [Zea diploperennis]
          Length = 250

 Score = 79.0 bits (193), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 37/82 (45%), Positives = 51/82 (62%), Gaps = 2/82 (2%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G CW F+ V AIEG++ I T NLV +S Q+++DCD Q     C GG +E  +++VI N G
Sbjct: 126 GGCWAFSAVAAIEGVNAIATGNLVSLSEQEIIDCDAQ--DSGCDGGQMENAFRFVIGNGG 183

Query: 65  INTERDYPNVGVMDNCKVFQFN 86
           I+TE DYP +G    C   + N
Sbjct: 184 IDTEADYPFIGTDGTCDASKEN 205


>gi|1345573|emb|CAA40073.1| endopeptidase (EP-C1) [Phaseolus vulgaris]
          Length = 361

 Score = 79.0 bits (193), Expect = 3e-13,   Method: Composition-based stats.
 Identities = 36/82 (43%), Positives = 53/82 (64%), Gaps = 1/82 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V A+EGI++I TN LV +S Q+LVDCD + E++ C GG +E+ ++++ Q  G
Sbjct: 149 GSCWAFSTVVAVEGINQIKTNKLVALSEQELVDCDKE-ENQGCNGGLMESAFEFIKQKGG 207

Query: 65  INTERDYPNVGVMDNCKVFQFN 86
           I TE +YP       C   + N
Sbjct: 208 ITTESNYPYKAQEGTCDASKVN 229


>gi|75994632|gb|ABA33837.1| cysteine protease Mir1 [Zea mays subsp. parviglumis]
          Length = 248

 Score = 79.0 bits (193), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 37/82 (45%), Positives = 51/82 (62%), Gaps = 2/82 (2%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G CW F+ V AIEG++ I T NLV +S Q+++DCD Q     C GG +E  +++VI N G
Sbjct: 124 GGCWAFSAVAAIEGVNAIATGNLVSLSEQEIIDCDAQ--DSGCDGGQMENAFRFVIGNGG 181

Query: 65  INTERDYPNVGVMDNCKVFQFN 86
           I+TE DYP +G    C   + N
Sbjct: 182 IDTEADYPFIGTDGTCDASKEN 203


>gi|214015339|gb|ACJ62286.1| cysteine protease [Zea mays subsp. parviglumis]
 gi|214015343|gb|ACJ62288.1| cysteine protease [Zea mays subsp. parviglumis]
 gi|214015347|gb|ACJ62290.1| cysteine protease [Zea mays subsp. parviglumis]
 gi|214015349|gb|ACJ62291.1| cysteine protease [Zea mays subsp. parviglumis]
          Length = 255

 Score = 79.0 bits (193), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 37/82 (45%), Positives = 51/82 (62%), Gaps = 2/82 (2%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G CW F+ V AIEG++ I T NLV +S Q+++DCD Q     C GG +E  +++VI N G
Sbjct: 132 GGCWAFSAVAAIEGVNAIATGNLVSLSEQEIIDCDAQ--DSGCDGGQMENAFRFVIGNGG 189

Query: 65  INTERDYPNVGVMDNCKVFQFN 86
           I+TE DYP +G    C   + N
Sbjct: 190 IDTEADYPFIGTDGTCDASKEN 211


>gi|214015307|gb|ACJ62270.1| cysteine protease [Zea mays subsp. parviglumis]
 gi|214015309|gb|ACJ62271.1| cysteine protease [Zea mays subsp. parviglumis]
 gi|214015313|gb|ACJ62273.1| cysteine protease [Zea mays subsp. parviglumis]
 gi|214015315|gb|ACJ62274.1| cysteine protease [Zea mays subsp. parviglumis]
 gi|214015317|gb|ACJ62275.1| cysteine protease [Zea mays subsp. parviglumis]
 gi|214015319|gb|ACJ62276.1| cysteine protease [Zea mays subsp. parviglumis]
 gi|214015321|gb|ACJ62277.1| cysteine protease [Zea mays subsp. parviglumis]
 gi|214015323|gb|ACJ62278.1| cysteine protease [Zea mays subsp. parviglumis]
          Length = 249

 Score = 79.0 bits (193), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 37/82 (45%), Positives = 51/82 (62%), Gaps = 2/82 (2%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G CW F+ V AIEG++ I T NLV +S Q+++DCD Q     C GG +E  +++VI N G
Sbjct: 126 GGCWAFSAVAAIEGVNAIATGNLVSLSEQEIIDCDAQ--DSGCDGGQMENAFRFVIGNGG 183

Query: 65  INTERDYPNVGVMDNCKVFQFN 86
           I+TE DYP +G    C   + N
Sbjct: 184 IDTEADYPFIGTDGTCDASKEN 205


>gi|214015269|gb|ACJ62251.1| cysteine protease [Zea mays subsp. parviglumis]
          Length = 247

 Score = 79.0 bits (193), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 37/82 (45%), Positives = 51/82 (62%), Gaps = 2/82 (2%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G CW F+ V AIEG++ I T NLV +S Q+++DCD Q     C GG +E  +++VI N G
Sbjct: 124 GGCWAFSAVAAIEGVNAIATGNLVSLSEQEIIDCDAQ--DSGCDGGQMENAFRFVIGNGG 181

Query: 65  INTERDYPNVGVMDNCKVFQFN 86
           I+TE DYP +G    C   + N
Sbjct: 182 IDTEADYPFIGTDGTCDASKEN 203


>gi|388517427|gb|AFK46775.1| unknown [Medicago truncatula]
          Length = 362

 Score = 79.0 bits (193), Expect = 3e-13,   Method: Composition-based stats.
 Identities = 37/82 (45%), Positives = 53/82 (64%), Gaps = 1/82 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V A+EGI++I TN LV +S Q+L+DCDNQ E++ C GG +E  ++Y+ Q  G
Sbjct: 150 GSCWAFSTVVAVEGINQIKTNRLVPLSEQELIDCDNQ-ENQGCNGGLMEYAFEYIKQKGG 208

Query: 65  INTERDYPNVGVMDNCKVFQFN 86
           + TE  YP      +C   + N
Sbjct: 209 VTTESYYPYTANDGSCDATKEN 230


>gi|214015289|gb|ACJ62261.1| cysteine protease [Zea mays subsp. parviglumis]
          Length = 254

 Score = 79.0 bits (193), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 37/82 (45%), Positives = 51/82 (62%), Gaps = 2/82 (2%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G CW F+ V AIEG++ I T NLV +S Q+++DCD Q     C GG +E  +++VI N G
Sbjct: 131 GGCWAFSAVAAIEGVNAIATGNLVSLSEQEIIDCDAQ--DSGCDGGQMENAFRFVIGNGG 188

Query: 65  INTERDYPNVGVMDNCKVFQFN 86
           I+TE DYP +G    C   + N
Sbjct: 189 IDTEADYPFIGTDGTCDASKEN 210


>gi|214015235|gb|ACJ62234.1| cysteine protease [Zea mays subsp. parviglumis]
 gi|214015271|gb|ACJ62252.1| cysteine protease [Zea mays subsp. parviglumis]
 gi|214015275|gb|ACJ62254.1| cysteine protease [Zea mays subsp. parviglumis]
          Length = 254

 Score = 79.0 bits (193), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 37/82 (45%), Positives = 51/82 (62%), Gaps = 2/82 (2%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G CW F+ V AIEG++ I T NLV +S Q+++DCD Q     C GG +E  +++VI N G
Sbjct: 131 GGCWAFSAVAAIEGVNAIATGNLVSLSEQEIIDCDAQ--DSGCDGGQMENAFRFVIGNGG 188

Query: 65  INTERDYPNVGVMDNCKVFQFN 86
           I+TE DYP +G    C   + N
Sbjct: 189 IDTEADYPFIGTDGTCDASKEN 210


>gi|217073894|gb|ACJ85307.1| unknown [Medicago truncatula]
 gi|388507498|gb|AFK41815.1| unknown [Medicago truncatula]
          Length = 362

 Score = 79.0 bits (193), Expect = 3e-13,   Method: Composition-based stats.
 Identities = 37/82 (45%), Positives = 53/82 (64%), Gaps = 1/82 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V A+EGI++I TN LV +S Q+L+DCDNQ E++ C GG +E  ++Y+ Q  G
Sbjct: 150 GSCWAFSTVVAVEGINQIKTNRLVPLSEQELIDCDNQ-ENQGCNGGLMEYAFEYIKQKGG 208

Query: 65  INTERDYPNVGVMDNCKVFQFN 86
           + TE  YP      +C   + N
Sbjct: 209 VTTESYYPYTANDGSCDATKEN 230


>gi|90399361|emb|CAJ86180.1| H0212B02.7 [Oryza sativa Indica Group]
          Length = 470

 Score = 79.0 bits (193), Expect = 3e-13,   Method: Composition-based stats.
 Identities = 35/78 (44%), Positives = 52/78 (66%), Gaps = 1/78 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ + A+EGI++IVT +L+ +S Q+LVDCD    +  C GG ++  + ++I N G
Sbjct: 151 GSCWAFSAIAAVEGINQIVTGDLISLSEQELVDCDTS-YNEGCNGGLMDYAFDFIINNGG 209

Query: 65  INTERDYPNVGVMDNCKV 82
           I+TE DYP  G  + C V
Sbjct: 210 IDTEDDYPYKGKDERCDV 227


>gi|214015357|gb|ACJ62295.1| cysteine protease [Zea mays subsp. parviglumis]
          Length = 251

 Score = 79.0 bits (193), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 37/82 (45%), Positives = 51/82 (62%), Gaps = 2/82 (2%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G CW F+ V AIEG++ I T NLV +S Q+++DCD Q     C GG +E  +++VI N G
Sbjct: 128 GGCWAFSAVAAIEGVNAIATGNLVSLSEQEIIDCDAQ--DSGCDGGQMENAFRFVIGNGG 185

Query: 65  INTERDYPNVGVMDNCKVFQFN 86
           I+TE DYP +G    C   + N
Sbjct: 186 IDTEADYPFIGTDGTCDASKEN 207


>gi|214015267|gb|ACJ62250.1| cysteine protease [Zea mays subsp. parviglumis]
          Length = 254

 Score = 79.0 bits (193), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 37/82 (45%), Positives = 51/82 (62%), Gaps = 2/82 (2%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G CW F+ V AIEG++ I T NLV +S Q+++DCD Q     C GG +E  +++VI N G
Sbjct: 131 GGCWAFSAVAAIEGVNAIATGNLVSLSEQEIIDCDAQ--DSGCDGGQMENAFRFVIGNGG 188

Query: 65  INTERDYPNVGVMDNCKVFQFN 86
           I+TE DYP +G    C   + N
Sbjct: 189 IDTEADYPFIGTDGTCDASKEN 210


>gi|75994622|gb|ABA33832.1| cysteine protease Mir1 [Zea mays subsp. parviglumis]
          Length = 254

 Score = 79.0 bits (193), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 37/82 (45%), Positives = 51/82 (62%), Gaps = 2/82 (2%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G CW F+ V AIEG++ I T NLV +S Q+++DCD Q     C GG +E  +++VI N G
Sbjct: 130 GGCWAFSAVAAIEGVNAIATGNLVSLSEQEIIDCDAQ--DSGCDGGQMEDAFRFVIGNGG 187

Query: 65  INTERDYPNVGVMDNCKVFQFN 86
           I+TE DYP +G    C   + N
Sbjct: 188 IDTEADYPFIGTDGTCDASKEN 209


>gi|18141281|gb|AAL60578.1|AF454956_1 senescence-associated cysteine protease [Brassica oleracea]
          Length = 445

 Score = 79.0 bits (193), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 36/75 (48%), Positives = 50/75 (66%), Gaps = 1/75 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ +GA+EGI++I T  LV +S Q+LVDCD    +  C GG ++  +Q++I N G
Sbjct: 145 GSCWAFSAIGAVEGINQIKTGELVSLSEQELVDCDTS-YNNGCGGGLMDYAFQFIISNGG 203

Query: 65  INTERDYPNVGVMDN 79
           I+TE DYP     DN
Sbjct: 204 IDTEEDYPYTATDDN 218


>gi|224076970|ref|XP_002305073.1| predicted protein [Populus trichocarpa]
 gi|222848037|gb|EEE85584.1| predicted protein [Populus trichocarpa]
          Length = 340

 Score = 79.0 bits (193), Expect = 3e-13,   Method: Composition-based stats.
 Identities = 36/76 (47%), Positives = 50/76 (65%), Gaps = 2/76 (2%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G CW F+ V AIEGI K+ T NL+ +S QQLVDC     ++ C GG ++T +QY+I+N G
Sbjct: 146 GCCWAFSTVAAIEGIIKLQTGNLISLSEQQLVDC--TAGNKGCQGGLMDTAFQYIIRNGG 203

Query: 65  INTERDYPNVGVMDNC 80
           + +E +YP  GV   C
Sbjct: 204 LTSEDNYPYQGVDGTC 219


>gi|214015327|gb|ACJ62280.1| cysteine protease [Zea mays subsp. parviglumis]
 gi|214015329|gb|ACJ62281.1| cysteine protease [Zea mays subsp. parviglumis]
 gi|214015345|gb|ACJ62289.1| cysteine protease [Zea mays subsp. parviglumis]
          Length = 251

 Score = 79.0 bits (193), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 37/82 (45%), Positives = 51/82 (62%), Gaps = 2/82 (2%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G CW F+ V AIEG++ I T NLV +S Q+++DCD Q     C GG +E  +++VI N G
Sbjct: 128 GGCWAFSAVAAIEGVNAIATGNLVSLSEQEIIDCDAQ--DSGCDGGQMENAFRFVIGNGG 185

Query: 65  INTERDYPNVGVMDNCKVFQFN 86
           I+TE DYP +G    C   + N
Sbjct: 186 IDTEADYPFIGTDGTCDASKEN 207


>gi|214015301|gb|ACJ62267.1| cysteine protease [Zea mays subsp. parviglumis]
          Length = 253

 Score = 79.0 bits (193), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 37/82 (45%), Positives = 51/82 (62%), Gaps = 2/82 (2%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G CW F+ V AIEG++ I T NLV +S Q+++DCD Q     C GG +E  +++VI N G
Sbjct: 130 GGCWAFSAVAAIEGVNAIATGNLVSLSEQEIIDCDAQ--DSGCDGGQMENAFRFVIGNGG 187

Query: 65  INTERDYPNVGVMDNCKVFQFN 86
           I+TE DYP +G    C   + N
Sbjct: 188 IDTEADYPFIGTDGTCDASKEN 209


>gi|214015241|gb|ACJ62237.1| cysteine protease [Zea mays subsp. parviglumis]
 gi|214015243|gb|ACJ62238.1| cysteine protease [Zea mays subsp. parviglumis]
 gi|214015255|gb|ACJ62244.1| cysteine protease [Zea mays subsp. parviglumis]
 gi|214015261|gb|ACJ62247.1| cysteine protease [Zea mays subsp. parviglumis]
          Length = 254

 Score = 79.0 bits (193), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 37/82 (45%), Positives = 51/82 (62%), Gaps = 2/82 (2%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G CW F+ V AIEG++ I T NLV +S Q+++DCD Q     C GG +E  +++VI N G
Sbjct: 131 GGCWAFSAVAAIEGVNAIATGNLVSLSEQEIIDCDAQ--DSGCDGGQMENAFRFVIGNGG 188

Query: 65  INTERDYPNVGVMDNCKVFQFN 86
           I+TE DYP +G    C   + N
Sbjct: 189 IDTEADYPFIGTDGTCDASKEN 210


>gi|214015303|gb|ACJ62268.1| cysteine protease [Zea mays subsp. parviglumis]
          Length = 251

 Score = 79.0 bits (193), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 37/82 (45%), Positives = 51/82 (62%), Gaps = 2/82 (2%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G CW F+ V AIEG++ I T NLV +S Q+++DCD Q     C GG +E  +++VI N G
Sbjct: 128 GGCWAFSAVAAIEGVNAIATGNLVSLSEQEIIDCDAQ--DSGCDGGQMENAFRFVIGNGG 185

Query: 65  INTERDYPNVGVMDNCKVFQFN 86
           I+TE DYP +G    C   + N
Sbjct: 186 IDTEADYPFIGTDGTCDASKEN 207


>gi|116786779|gb|ABK24233.1| unknown [Picea sitchensis]
          Length = 463

 Score = 79.0 bits (193), Expect = 4e-13,   Method: Composition-based stats.
 Identities = 38/85 (44%), Positives = 55/85 (64%), Gaps = 7/85 (8%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCD---NQGESRSCVGGFIETIYQYVIQ 61
           GSCW F+ V A+EGI++IVT NL  +S Q+LVDCD   NQG    C GG ++  +Q++I 
Sbjct: 154 GSCWAFSTVAAVEGINQIVTGNLTSLSEQELVDCDTSYNQG----CNGGLMDYAFQFIIN 209

Query: 62  NRGINTERDYPNVGVMDNCKVFQFN 86
           N G+++E DYP      +C  ++ N
Sbjct: 210 NGGLDSEDDYPYKANDGSCDAYRKN 234


>gi|356517426|ref|XP_003527388.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Glycine
           max]
          Length = 343

 Score = 79.0 bits (193), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 31/76 (40%), Positives = 48/76 (63%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G CW F+ V A EGI  +    L+ +S Q++VDCD +G+ + C GGF++  ++++IQN G
Sbjct: 147 GCCWAFSAVAATEGIHALNAGKLISLSEQEVVDCDTKGQDQGCAGGFMDGAFKFIIQNHG 206

Query: 65  INTERDYPNVGVMDNC 80
           +NTE +YP       C
Sbjct: 207 LNTEPNYPYKAADGKC 222


>gi|326493368|dbj|BAJ85145.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 436

 Score = 79.0 bits (193), Expect = 4e-13,   Method: Composition-based stats.
 Identities = 33/71 (46%), Positives = 51/71 (71%), Gaps = 7/71 (9%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCD---NQGESRSCVGGFIETIYQYVIQ 61
           GSCW F+ + A+EGI++IVT +++ +S Q+LVDCD   NQG    C GG ++  ++++I 
Sbjct: 154 GSCWAFSAIAAVEGINQIVTGDMIPLSEQELVDCDTSYNQG----CNGGLMDYAFEFIIN 209

Query: 62  NRGINTERDYP 72
           N GI++E DYP
Sbjct: 210 NGGIDSEEDYP 220


>gi|214015366|gb|ACJ62299.1| cysteine protease [Zea mays subsp. parviglumis]
          Length = 247

 Score = 79.0 bits (193), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 37/82 (45%), Positives = 51/82 (62%), Gaps = 2/82 (2%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G CW F+ V AIEG++ I T NLV +S Q+++DCD Q     C GG +E  +++VI N G
Sbjct: 124 GGCWAFSAVAAIEGVNAIATGNLVSLSEQEIIDCDAQ--DSGCDGGQMEDAFRFVIGNGG 181

Query: 65  INTERDYPNVGVMDNCKVFQFN 86
           I+TE DYP +G    C   + N
Sbjct: 182 IDTEADYPFIGTDGTCDASKEN 203


>gi|214015233|gb|ACJ62233.1| cysteine protease [Zea mays subsp. parviglumis]
 gi|214015239|gb|ACJ62236.1| cysteine protease [Zea mays subsp. parviglumis]
 gi|214015265|gb|ACJ62249.1| cysteine protease [Zea mays subsp. parviglumis]
          Length = 253

 Score = 79.0 bits (193), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 37/82 (45%), Positives = 51/82 (62%), Gaps = 2/82 (2%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G CW F+ V AIEG++ I T NLV +S Q+++DCD Q     C GG +E  +++VI N G
Sbjct: 130 GGCWAFSAVAAIEGVNAIATGNLVSLSEQEIIDCDAQ--DSGCDGGQMENAFRFVIGNGG 187

Query: 65  INTERDYPNVGVMDNCKVFQFN 86
           I+TE DYP +G    C   + N
Sbjct: 188 IDTEADYPFIGTDGTCDASKEN 209


>gi|297602258|ref|NP_001052246.2| Os04g0208200 [Oryza sativa Japonica Group]
 gi|255675225|dbj|BAF14160.2| Os04g0208200, partial [Oryza sativa Japonica Group]
          Length = 219

 Score = 79.0 bits (193), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 33/76 (43%), Positives = 46/76 (60%)

Query: 5  GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
          G CW F+ V A+EG  K+ T  LV +S QQLV CD +GE + C GG ++  + ++I+N G
Sbjct: 21 GCCWAFSAVAAMEGAVKLATGKLVSLSEQQLVSCDVKGEDQGCEGGLMDDAFDFIIKNGG 80

Query: 65 INTERDYPNVGVMDNC 80
          +  E DYP     D C
Sbjct: 81 LAAESDYPYTASDDKC 96


>gi|214015247|gb|ACJ62240.1| cysteine protease [Zea mays subsp. parviglumis]
          Length = 255

 Score = 79.0 bits (193), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 37/82 (45%), Positives = 51/82 (62%), Gaps = 2/82 (2%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G CW F+ V AIEG++ I T NLV +S Q+++DCD Q     C GG +E  +++VI N G
Sbjct: 132 GGCWAFSAVAAIEGVNAIATGNLVSLSGQEIIDCDAQ--DSGCDGGQMENAFRFVIGNGG 189

Query: 65  INTERDYPNVGVMDNCKVFQFN 86
           I+TE DYP +G    C   + N
Sbjct: 190 IDTEADYPFIGTDGTCDASKEN 211


>gi|214015237|gb|ACJ62235.1| cysteine protease [Zea mays subsp. parviglumis]
 gi|214015245|gb|ACJ62239.1| cysteine protease [Zea mays subsp. parviglumis]
 gi|214015249|gb|ACJ62241.1| cysteine protease [Zea mays subsp. parviglumis]
 gi|214015257|gb|ACJ62245.1| cysteine protease [Zea mays subsp. parviglumis]
 gi|214015273|gb|ACJ62253.1| cysteine protease [Zea mays subsp. parviglumis]
 gi|214015277|gb|ACJ62255.1| cysteine protease [Zea mays subsp. parviglumis]
          Length = 255

 Score = 79.0 bits (193), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 37/82 (45%), Positives = 51/82 (62%), Gaps = 2/82 (2%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G CW F+ V AIEG++ I T NLV +S Q+++DCD Q     C GG +E  +++VI N G
Sbjct: 132 GGCWAFSAVAAIEGVNAIATGNLVSLSEQEIIDCDAQ--DSGCDGGQMEDAFRFVIGNGG 189

Query: 65  INTERDYPNVGVMDNCKVFQFN 86
           I+TE DYP +G    C   + N
Sbjct: 190 IDTEADYPFIGTDGTCDASKEN 211


>gi|214015253|gb|ACJ62243.1| cysteine protease [Zea mays subsp. parviglumis]
          Length = 255

 Score = 79.0 bits (193), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 37/82 (45%), Positives = 51/82 (62%), Gaps = 2/82 (2%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G CW F+ V AIEG++ I T NLV +S Q+++DCD Q     C GG +E  +++VI N G
Sbjct: 132 GGCWAFSAVAAIEGVNAIATGNLVSLSEQEIIDCDAQ--DSGCDGGQMEDAFRFVIGNGG 189

Query: 65  INTERDYPNVGVMDNCKVFQFN 86
           I+TE DYP +G    C   + N
Sbjct: 190 IDTEADYPFIGTDGTCDASKEN 211


>gi|146216000|gb|ABQ10202.1| cysteine protease Cp4 [Actinidia deliciosa]
          Length = 463

 Score = 79.0 bits (193), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 37/82 (45%), Positives = 54/82 (65%), Gaps = 1/82 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ + A+EGI++IVT  L+ +S Q+LVDCD +  +  C GG ++  +Q++I N G
Sbjct: 156 GSCWAFSTISAVEGINQIVTGELISLSEQELVDCD-KSYNMGCNGGLMDYGFQFIINNGG 214

Query: 65  INTERDYPNVGVMDNCKVFQFN 86
           I+TE DYP   V   C  F+ N
Sbjct: 215 IDTEEDYPYRAVDGTCDQFRKN 236


>gi|214015372|gb|ACJ62302.1| cysteine protease [Zea mays subsp. parviglumis]
          Length = 250

 Score = 79.0 bits (193), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 37/82 (45%), Positives = 51/82 (62%), Gaps = 2/82 (2%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G CW F+ V AIEG++ I T NLV +S Q+++DCD Q     C GG +E  +++VI N G
Sbjct: 127 GGCWAFSAVAAIEGVNAIATGNLVSLSEQEIIDCDAQ--DSGCDGGQMENAFRFVIGNGG 184

Query: 65  INTERDYPNVGVMDNCKVFQFN 86
           I+TE DYP +G    C   + N
Sbjct: 185 IDTEADYPFIGTDGTCDASKEN 206


>gi|146215996|gb|ABQ10200.1| cysteine protease Cp2 [Actinidia deliciosa]
          Length = 376

 Score = 79.0 bits (193), Expect = 4e-13,   Method: Composition-based stats.
 Identities = 33/76 (43%), Positives = 51/76 (67%), Gaps = 1/76 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V A+EG+++I T  ++ +S Q+LVDCD   ++  C GG ++  ++++I N G
Sbjct: 158 GSCWAFSTVAAVEGVNQIATGEMIQLSEQELVDCDRTYDA-GCNGGLMDYAFEFIINNGG 216

Query: 65  INTERDYPNVGVMDNC 80
           I+TE DYP  GV   C
Sbjct: 217 IDTEEDYPYRGVDGTC 232


>gi|214015325|gb|ACJ62279.1| cysteine protease [Zea mays subsp. parviglumis]
          Length = 251

 Score = 79.0 bits (193), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 37/82 (45%), Positives = 51/82 (62%), Gaps = 2/82 (2%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G CW F+ V AIEG++ I T NLV +S Q+++DCD Q     C GG +E  +++VI N G
Sbjct: 128 GGCWAFSAVAAIEGVNAIATGNLVSLSEQEIIDCDAQ--DSGCDGGQMENAFRFVIGNGG 185

Query: 65  INTERDYPNVGVMDNCKVFQFN 86
           I+TE DYP +G    C   + N
Sbjct: 186 IDTEADYPFIGTDGTCDASKEN 207


>gi|214015361|gb|ACJ62297.1| cysteine protease [Zea mays subsp. parviglumis]
          Length = 247

 Score = 79.0 bits (193), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 37/82 (45%), Positives = 51/82 (62%), Gaps = 2/82 (2%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G CW F+ V AIEG++ I T NLV +S Q+++DCD Q     C GG +E  +++VI N G
Sbjct: 124 GGCWAFSAVAAIEGVNAIATGNLVSLSEQEIIDCDAQ--DSGCDGGQMEDAFRFVIGNGG 181

Query: 65  INTERDYPNVGVMDNCKVFQFN 86
           I+TE DYP +G    C   + N
Sbjct: 182 IDTEADYPFIGTDGTCDASKEN 203


>gi|75994612|gb|ABA33827.1| cysteine protease Mir1 [Zea mays subsp. parviglumis]
          Length = 254

 Score = 79.0 bits (193), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 38/82 (46%), Positives = 51/82 (62%), Gaps = 2/82 (2%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G CW F+ V AIEG++ I T NLV +S Q+++DCD Q     C GG +E  +++VI N G
Sbjct: 130 GGCWAFSAVAAIEGVNAIATGNLVSLSEQEIIDCDAQ--DSGCDGGQMENAFRFVIGNGG 187

Query: 65  INTERDYPNVGVMDNCKVFQFN 86
           I+TE DYP VG    C   + N
Sbjct: 188 IDTEADYPFVGTDGTCDANKEN 209


>gi|214015380|gb|ACJ62306.1| cysteine protease [Zea mays subsp. parviglumis]
 gi|214015386|gb|ACJ62309.1| cysteine protease [Zea mays subsp. parviglumis]
          Length = 249

 Score = 79.0 bits (193), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 37/82 (45%), Positives = 51/82 (62%), Gaps = 2/82 (2%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G CW F+ V AIEG++ I T NLV +S Q+++DCD Q     C GG +E  +++VI N G
Sbjct: 126 GGCWAFSAVAAIEGVNAIATGNLVSLSEQEIIDCDAQ--DSGCDGGQMENAFRFVIGNGG 183

Query: 65  INTERDYPNVGVMDNCKVFQFN 86
           I+TE DYP +G    C   + N
Sbjct: 184 IDTEADYPFIGTDGTCDASKEN 205


>gi|214015263|gb|ACJ62248.1| cysteine protease [Zea mays subsp. parviglumis]
          Length = 252

 Score = 79.0 bits (193), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 37/82 (45%), Positives = 51/82 (62%), Gaps = 2/82 (2%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G CW F+ V AIEG++ I T NLV +S Q+++DCD Q     C GG +E  +++VI N G
Sbjct: 129 GGCWAFSAVAAIEGVNAIATGNLVSLSEQEIIDCDAQ--DSGCDGGQMENAFRFVIGNGG 186

Query: 65  INTERDYPNVGVMDNCKVFQFN 86
           I+TE DYP +G    C   + N
Sbjct: 187 IDTEADYPFIGTDGTCDASKEN 208


>gi|168017893|ref|XP_001761481.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162687165|gb|EDQ73549.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 471

 Score = 79.0 bits (193), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 33/76 (43%), Positives = 53/76 (69%), Gaps = 1/76 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ +G++EG++ IVT  L+ +S Q+LVDCD +G+++ C GG ++  + ++I+N G
Sbjct: 159 GSCWAFSAIGSVEGVNAIVTGELISLSEQELVDCD-RGQNQGCNGGLMDYAFDFIIKNGG 217

Query: 65  INTERDYPNVGVMDNC 80
           I+TE DYP       C
Sbjct: 218 IDTEEDYPYKATDGQC 233


>gi|214015333|gb|ACJ62283.1| cysteine protease [Zea mays subsp. parviglumis]
 gi|214015335|gb|ACJ62284.1| cysteine protease [Zea mays subsp. parviglumis]
 gi|214015337|gb|ACJ62285.1| cysteine protease [Zea mays subsp. parviglumis]
 gi|214015341|gb|ACJ62287.1| cysteine protease [Zea mays subsp. parviglumis]
          Length = 251

 Score = 79.0 bits (193), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 37/82 (45%), Positives = 51/82 (62%), Gaps = 2/82 (2%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G CW F+ V AIEG++ I T NLV +S Q+++DCD Q     C GG +E  +++VI N G
Sbjct: 128 GGCWAFSAVAAIEGVNAIATGNLVSLSEQEIIDCDAQ--DSGCDGGQMENAFRFVIGNGG 185

Query: 65  INTERDYPNVGVMDNCKVFQFN 86
           I+TE DYP +G    C   + N
Sbjct: 186 IDTEADYPFIGTDGTCDASKEN 207


>gi|374713649|gb|AEZ65082.1| cysteine protease [Carica papaya]
          Length = 471

 Score = 78.6 bits (192), Expect = 4e-13,   Method: Composition-based stats.
 Identities = 32/68 (47%), Positives = 53/68 (77%), Gaps = 1/68 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ VG++EGI++IVT +L+ +S Q+LVDCD +  ++ C GG ++  ++++I+N G
Sbjct: 164 GSCWAFSTVGSVEGINQIVTGDLISLSEQELVDCD-KAYNQGCNGGLMDYAFEFIIKNGG 222

Query: 65  INTERDYP 72
           I++E DYP
Sbjct: 223 IDSEADYP 230


>gi|242093944|ref|XP_002437462.1| hypothetical protein SORBIDRAFT_10g027570 [Sorghum bicolor]
 gi|241915685|gb|EER88829.1| hypothetical protein SORBIDRAFT_10g027570 [Sorghum bicolor]
          Length = 366

 Score = 78.6 bits (192), Expect = 4e-13,   Method: Composition-based stats.
 Identities = 36/77 (46%), Positives = 49/77 (63%), Gaps = 2/77 (2%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ + A+EGI+ I T NLV +S QQLVDCD    +  C GG + T + +V++NRG
Sbjct: 174 GSCWAFSAIAAVEGINAIRTRNLVPLSEQQLVDCDKL--NHGCNGGLMTTAFSFVVRNRG 231

Query: 65  INTERDYPNVGVMDNCK 81
           +  E  YP +G    CK
Sbjct: 232 VVPEGAYPYMGREGRCK 248


>gi|359359168|gb|AEV41073.1| putative cysteine protease [Oryza minuta]
          Length = 499

 Score = 78.6 bits (192), Expect = 4e-13,   Method: Composition-based stats.
 Identities = 33/78 (42%), Positives = 50/78 (64%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V A+EGI+KIVT  LV +S Q+LV+C   G +  C GG ++  + ++ +N G
Sbjct: 179 GSCWAFSAVAAVEGINKIVTGELVSLSEQELVECARNGANSGCNGGMMDDAFAFIARNGG 238

Query: 65  INTERDYPNVGVMDNCKV 82
           ++TE DYP   +   C +
Sbjct: 239 LDTEEDYPYTAMDGKCNL 256


>gi|214015378|gb|ACJ62305.1| cysteine protease [Zea mays subsp. parviglumis]
          Length = 254

 Score = 78.6 bits (192), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 38/82 (46%), Positives = 51/82 (62%), Gaps = 2/82 (2%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G CW F+ V AIEG++ IVT NLV +S Q+++DCD Q     C GG +E  + +VI N G
Sbjct: 131 GGCWAFSAVAAIEGVNAIVTGNLVSLSEQEIIDCDAQ--DSGCDGGQMENAFLFVIGNGG 188

Query: 65  INTERDYPNVGVMDNCKVFQFN 86
           I+TE DYP +G    C   + N
Sbjct: 189 IDTEADYPFIGTDGTCDASKEN 210


>gi|359359215|gb|AEV41119.1| putative cysteine protease [Oryza officinalis]
          Length = 499

 Score = 78.6 bits (192), Expect = 4e-13,   Method: Composition-based stats.
 Identities = 33/78 (42%), Positives = 50/78 (64%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V A+EGI+KIVT  LV +S Q+LV+C   G +  C GG ++  + ++ +N G
Sbjct: 179 GSCWAFSAVAAVEGINKIVTGELVSLSEQELVECARNGANSGCNGGMMDDAFAFIARNGG 238

Query: 65  INTERDYPNVGVMDNCKV 82
           ++TE DYP   +   C +
Sbjct: 239 LDTEEDYPYTAMDGKCNL 256


>gi|115448287|ref|NP_001047923.1| Os02g0715000 [Oryza sativa Japonica Group]
 gi|42408029|dbj|BAD09165.1| putative cysteine proteinase [Oryza sativa Japonica Group]
 gi|113537454|dbj|BAF09837.1| Os02g0715000 [Oryza sativa Japonica Group]
 gi|215737450|dbj|BAG96580.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|215765786|dbj|BAG87483.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|222623551|gb|EEE57683.1| hypothetical protein OsJ_08138 [Oryza sativa Japonica Group]
          Length = 366

 Score = 78.6 bits (192), Expect = 4e-13,   Method: Composition-based stats.
 Identities = 34/68 (50%), Positives = 48/68 (70%), Gaps = 1/68 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V A+EGI++IVT  LV +S Q+L+DCDN   +  C GG ++  + Y++ N+G
Sbjct: 165 GSCWAFSTVAAVEGINQIVTGKLVSLSEQELMDCDNTF-NHGCRGGLMDFAFAYIMGNQG 223

Query: 65  INTERDYP 72
           I TE DYP
Sbjct: 224 IYTEEDYP 231


>gi|359483753|ref|XP_002266308.2| PREDICTED: oryzain alpha chain-like [Vitis vinifera]
          Length = 501

 Score = 78.6 bits (192), Expect = 4e-13,   Method: Composition-based stats.
 Identities = 35/76 (46%), Positives = 51/76 (67%), Gaps = 2/76 (2%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+  GA+EGI+ IVT +L+ +S Q+LVDCD    +  C GG+++  +++VI N G
Sbjct: 158 GSCWAFSSTGAMEGINAIVTGDLISLSEQELVDCDTT--NYGCEGGYMDYAFEWVISNGG 215

Query: 65  INTERDYPNVGVMDNC 80
           I++E DYP  G    C
Sbjct: 216 IDSESDYPYTGTDGTC 231


>gi|224079085|ref|XP_002305743.1| predicted protein [Populus trichocarpa]
 gi|222848707|gb|EEE86254.1| predicted protein [Populus trichocarpa]
          Length = 494

 Score = 78.6 bits (192), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 37/76 (48%), Positives = 53/76 (69%), Gaps = 2/76 (2%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+  GAIEGI+ IVT++L+ +S Q+LVDCD    +  C GG+++  +++VI N G
Sbjct: 155 GSCWSFSTTGAIEGINAIVTSDLISLSEQELVDCDTT--NYGCEGGYMDYAFEWVINNGG 212

Query: 65  INTERDYPNVGVMDNC 80
           I+TE +YP  GV   C
Sbjct: 213 IDTEANYPYTGVDGTC 228


>gi|255586666|ref|XP_002533962.1| cysteine protease, putative [Ricinus communis]
 gi|223526059|gb|EEF28418.1| cysteine protease, putative [Ricinus communis]
          Length = 417

 Score = 78.6 bits (192), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 38/80 (47%), Positives = 54/80 (67%), Gaps = 2/80 (2%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+  GAIEGI+ IVT +LV +S Q+L+DCD    +  C GG+++  +++VI N G
Sbjct: 163 GSCWAFSSTGAIEGINAIVTGDLVSLSEQELMDCDTT--NYGCDGGYMDYAFEWVINNGG 220

Query: 65  INTERDYPNVGVMDNCKVFQ 84
           I+TE DYP  GV   C + +
Sbjct: 221 IDTEIDYPYTGVDGTCNIAK 240


>gi|214015293|gb|ACJ62263.1| cysteine protease [Zea mays subsp. parviglumis]
          Length = 256

 Score = 78.6 bits (192), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 37/82 (45%), Positives = 51/82 (62%), Gaps = 2/82 (2%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G CW F+ V AIEG++ I T NLV +S Q+++DCD Q     C GG +E  +++VI N G
Sbjct: 133 GGCWAFSAVAAIEGVNAIATGNLVSLSEQEIIDCDAQ--DSGCDGGQMENAFRFVIGNGG 190

Query: 65  INTERDYPNVGVMDNCKVFQFN 86
           I+TE DYP +G    C   + N
Sbjct: 191 IDTEADYPFIGTDGTCDANKEN 212


>gi|359806985|ref|NP_001241331.1| uncharacterized protein LOC100811719 precursor [Glycine max]
 gi|255645733|gb|ACU23360.1| unknown [Glycine max]
          Length = 362

 Score = 78.6 bits (192), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 41/77 (53%), Positives = 48/77 (62%), Gaps = 2/77 (2%)

Query: 6   SCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRGI 65
           S W F+V GAIEGI+KIVT NLV +S QQ+VDCD    S  C GGF    + YVI+N GI
Sbjct: 160 SHWAFSVTGAIEGINKIVTGNLVSLSVQQVVDCDPA--SHGCAGGFYFNAFGYVIENGGI 217

Query: 66  NTERDYPNVGVMDNCKV 82
           +TE  YP       CK 
Sbjct: 218 DTEAHYPYTAQNGTCKA 234


>gi|125540888|gb|EAY87283.1| hypothetical protein OsI_08685 [Oryza sativa Indica Group]
          Length = 357

 Score = 78.6 bits (192), Expect = 5e-13,   Method: Composition-based stats.
 Identities = 34/68 (50%), Positives = 48/68 (70%), Gaps = 1/68 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V A+EGI++IVT  LV +S Q+L+DCDN   +  C GG ++  + Y++ N+G
Sbjct: 156 GSCWAFSTVAAVEGINQIVTGKLVSLSEQELMDCDNTF-NHGCRGGLMDFAFAYIMGNQG 214

Query: 65  INTERDYP 72
           I TE DYP
Sbjct: 215 IYTEEDYP 222


>gi|8886940|gb|AAF80626.1|AC069251_19 F2D10.37 [Arabidopsis thaliana]
          Length = 315

 Score = 78.6 bits (192), Expect = 5e-13,   Method: Composition-based stats.
 Identities = 32/68 (47%), Positives = 47/68 (69%), Gaps = 1/68 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V A+EGI+KIVT NL  +S Q+L+DCD    +  C GG ++  ++Y+++N G
Sbjct: 160 GSCWAFSTVAAVEGINKIVTGNLTTLSEQELIDCDTT-YNNGCNGGLMDYAFEYIVKNGG 218

Query: 65  INTERDYP 72
           +  E DYP
Sbjct: 219 LRKEEDYP 226


>gi|214015287|gb|ACJ62260.1| cysteine protease [Zea mays subsp. parviglumis]
          Length = 253

 Score = 78.6 bits (192), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 37/82 (45%), Positives = 51/82 (62%), Gaps = 2/82 (2%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G CW F+ V AIEG++ I T NLV +S Q+++DCD Q     C GG +E  +++VI N G
Sbjct: 130 GGCWAFSAVAAIEGVNAIATGNLVSLSEQEIIDCDAQ--DSGCDGGQMENAFRFVIGNGG 187

Query: 65  INTERDYPNVGVMDNCKVFQFN 86
           I+TE DYP +G    C   + N
Sbjct: 188 IDTEADYPFIGTDGTCDANKEN 209


>gi|162463334|ref|NP_001104878.1| maize insect resistance2 precursor [Zea mays]
 gi|2425064|gb|AAB88262.1| cysteine proteinase Mir2 [Zea mays]
          Length = 493

 Score = 78.6 bits (192), Expect = 5e-13,   Method: Composition-based stats.
 Identities = 34/78 (43%), Positives = 53/78 (67%), Gaps = 1/78 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G CW F+ V A+EGI+KIVT +L+ +S Q+L+DCD + + + C GG ++  + ++I+N G
Sbjct: 186 GGCWAFSAVAAVEGINKIVTGSLISLSEQELIDCD-KFQDQGCDGGLMDNAFVFMIKNGG 244

Query: 65  INTERDYPNVGVMDNCKV 82
           I+TE DYP  G    C +
Sbjct: 245 IDTEADYPFTGHDGTCDL 262


>gi|297826875|ref|XP_002881320.1| hypothetical protein ARALYDRAFT_321132 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297327159|gb|EFH57579.1| hypothetical protein ARALYDRAFT_321132 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 341

 Score = 78.6 bits (192), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 36/77 (46%), Positives = 49/77 (63%), Gaps = 1/77 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G CW F+ V A+EG++KI   NLV +S QQL+DCD + + R C GG +   + Y+IQNRG
Sbjct: 148 GCCWAFSAVAAVEGVTKIAGGNLVSLSEQQLLDCDREYD-RGCDGGIMSDAFNYIIQNRG 206

Query: 65  INTERDYPNVGVMDNCK 81
           I +E DY   G    C+
Sbjct: 207 IASENDYSYQGSDGRCR 223


>gi|296082368|emb|CBI21373.3| unnamed protein product [Vitis vinifera]
          Length = 245

 Score = 78.6 bits (192), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 34/78 (43%), Positives = 53/78 (67%), Gaps = 1/78 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V A+EGI++IVT  L+ +S Q+LVDCD + +   C GG ++  + ++I+N G
Sbjct: 28  GSCWAFSTVAAVEGINQIVTGELISLSEQELVDCDTEYD-MGCNGGLMDYAFDFIIKNGG 86

Query: 65  INTERDYPNVGVMDNCKV 82
           ++TE+DYP  G    C +
Sbjct: 87  LDTEKDYPYTGFDGECNL 104


>gi|75994614|gb|ABA33828.1| cysteine protease Mir1 [Zea mays subsp. parviglumis]
          Length = 251

 Score = 78.6 bits (192), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 37/82 (45%), Positives = 51/82 (62%), Gaps = 2/82 (2%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G CW F+ V AIEG++ I T NLV +S Q+++DCD Q     C GG +E  +++VI N G
Sbjct: 127 GGCWAFSAVAAIEGVNAIATGNLVSLSEQEIIDCDAQ--DSGCDGGQMEDAFRFVIGNGG 184

Query: 65  INTERDYPNVGVMDNCKVFQFN 86
           I+TE DYP +G    C   + N
Sbjct: 185 IDTEADYPFIGTDGTCDANKEN 206


>gi|18394919|ref|NP_564126.1| Xylem cysteine proteinase 2 [Arabidopsis thaliana]
 gi|71153409|sp|Q9LM66.2|XCP2_ARATH RecName: Full=Xylem cysteine proteinase 2; Short=AtXCP2; Flags:
           Precursor
 gi|4836904|gb|AAD30607.1|AC007369_17 Putative cysteine proteinase [Arabidopsis thaliana]
 gi|6708183|gb|AAF25832.1|AF191028_1 papain-type cysteine endopeptidase XCP2 [Arabidopsis thaliana]
 gi|28466959|gb|AAO44088.1| At1g20850 [Arabidopsis thaliana]
 gi|110743795|dbj|BAE99733.1| putative cysteine proteinase [Arabidopsis thaliana]
 gi|332191910|gb|AEE30031.1| Xylem cysteine proteinase 2 [Arabidopsis thaliana]
          Length = 356

 Score = 78.6 bits (192), Expect = 5e-13,   Method: Composition-based stats.
 Identities = 32/68 (47%), Positives = 47/68 (69%), Gaps = 1/68 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V A+EGI+KIVT NL  +S Q+L+DCD    +  C GG ++  ++Y+++N G
Sbjct: 160 GSCWAFSTVAAVEGINKIVTGNLTTLSEQELIDCDTT-YNNGCNGGLMDYAFEYIVKNGG 218

Query: 65  INTERDYP 72
           +  E DYP
Sbjct: 219 LRKEEDYP 226


>gi|214015370|gb|ACJ62301.1| cysteine protease [Zea mays subsp. parviglumis]
 gi|214015384|gb|ACJ62308.1| cysteine protease [Zea mays subsp. parviglumis]
 gi|214015392|gb|ACJ62312.1| cysteine protease [Zea mays subsp. parviglumis]
          Length = 247

 Score = 78.6 bits (192), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 37/82 (45%), Positives = 51/82 (62%), Gaps = 2/82 (2%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G CW F+ V AIEG++ I T NLV +S Q+++DCD Q     C GG +E  +++VI N G
Sbjct: 124 GGCWAFSAVAAIEGVNAIATGNLVSLSEQEIIDCDAQ--DSGCDGGQMEDAFRFVIGNGG 181

Query: 65  INTERDYPNVGVMDNCKVFQFN 86
           I+TE DYP +G    C   + N
Sbjct: 182 IDTEADYPFIGTDGTCDANKEN 203


>gi|357160572|ref|XP_003578808.1| PREDICTED: vignain-like [Brachypodium distachyon]
          Length = 339

 Score = 78.2 bits (191), Expect = 5e-13,   Method: Composition-based stats.
 Identities = 33/77 (42%), Positives = 48/77 (62%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G CW F+ V A+EGI K+ T  LV +S Q+LVDCD  GE + C GG ++  ++++I+N G
Sbjct: 145 GCCWAFSAVAAMEGIVKLSTGKLVSLSEQELVDCDVHGEDQGCEGGLMDDAFKFIIKNGG 204

Query: 65  INTERDYPNVGVMDNCK 81
           +  E +YP       CK
Sbjct: 205 LTQESNYPYDAADGKCK 221


>gi|214015374|gb|ACJ62303.1| cysteine protease [Zea mays subsp. parviglumis]
          Length = 247

 Score = 78.2 bits (191), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 37/82 (45%), Positives = 51/82 (62%), Gaps = 2/82 (2%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G CW F+ V AIEG++ I T NLV +S Q+++DCD Q     C GG +E  +++VI N G
Sbjct: 124 GGCWAFSAVAAIEGVNAIATGNLVSLSEQEIIDCDAQ--DSGCDGGQMEDAFRFVIGNGG 181

Query: 65  INTERDYPNVGVMDNCKVFQFN 86
           I+TE DYP +G    C   + N
Sbjct: 182 IDTEADYPFIGTDGTCDANKEN 203


>gi|116787404|gb|ABK24495.1| unknown [Picea sitchensis]
 gi|224286306|gb|ACN40861.1| unknown [Picea sitchensis]
          Length = 452

 Score = 78.2 bits (191), Expect = 5e-13,   Method: Composition-based stats.
 Identities = 36/85 (42%), Positives = 56/85 (65%), Gaps = 7/85 (8%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCD---NQGESRSCVGGFIETIYQYVIQ 61
           GSCW F+ V A+EGI++IVT +L+ +S Q+LVDCD   NQG    C GG ++  ++++I 
Sbjct: 151 GSCWAFSTVAAVEGINQIVTGDLISLSEQELVDCDTSYNQG----CNGGLMDYAFEFIIN 206

Query: 62  NRGINTERDYPNVGVMDNCKVFQFN 86
           N G+++E DYP      +C  ++ N
Sbjct: 207 NGGLDSEEDYPYTAYDGSCDSYRKN 231


>gi|357133074|ref|XP_003568153.1| PREDICTED: cysteine proteinase RD21a-like [Brachypodium distachyon]
          Length = 565

 Score = 78.2 bits (191), Expect = 5e-13,   Method: Composition-based stats.
 Identities = 34/68 (50%), Positives = 50/68 (73%), Gaps = 1/68 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G+CW F+  GAIEGI+KI T +L+ +S Q+L+DCD +  +  C GG ++  Y++VI+N G
Sbjct: 160 GACWSFSATGAIEGINKIKTGSLISLSEQELIDCD-RSYNAGCGGGLMDYAYRFVIKNGG 218

Query: 65  INTERDYP 72
           I+TE DYP
Sbjct: 219 IDTEDDYP 226


>gi|357507505|ref|XP_003624041.1| Cysteine proteinase [Medicago truncatula]
 gi|355499056|gb|AES80259.1| Cysteine proteinase [Medicago truncatula]
          Length = 342

 Score = 78.2 bits (191), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 34/68 (50%), Positives = 47/68 (69%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V AIEGI KI + NLV +S QQLVDCD  G ++ C  G +   ++++++N G
Sbjct: 145 GSCWAFSAVAAIEGIQKITSGNLVSLSEQQLVDCDRSGRTKGCDNGNMINAFKFILENGG 204

Query: 65  INTERDYP 72
           I TE +YP
Sbjct: 205 IATEANYP 212


>gi|116781957|gb|ABK22314.1| unknown [Picea sitchensis]
          Length = 369

 Score = 78.2 bits (191), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 38/82 (46%), Positives = 50/82 (60%), Gaps = 2/82 (2%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V ++EGI+ I T NLV +S QQLVDC    E+  C GG ++T +QY+I N G
Sbjct: 155 GSCWAFSTVASVEGINYITTGNLVSLSEQQLVDCST--ENSGCNGGLMDTAFQYIINNGG 212

Query: 65  INTERDYPNVGVMDNCKVFQFN 86
           I TE +YP       C   + N
Sbjct: 213 IVTEDNYPYTAEATECSSTKIN 234


>gi|242072398|ref|XP_002446135.1| hypothetical protein SORBIDRAFT_06g002170 [Sorghum bicolor]
 gi|241937318|gb|EES10463.1| hypothetical protein SORBIDRAFT_06g002170 [Sorghum bicolor]
          Length = 338

 Score = 78.2 bits (191), Expect = 5e-13,   Method: Composition-based stats.
 Identities = 33/77 (42%), Positives = 48/77 (62%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G CW F+ V A+EGI K+ T NL+ +S Q+LVDCD       C GG++++ +++VI+N G
Sbjct: 144 GCCWAFSAVAAMEGIVKLSTGNLISLSEQELVDCDTHSMDEGCEGGWMDSAFEFVIKNGG 203

Query: 65  INTERDYPNVGVMDNCK 81
           + T   YP   V   CK
Sbjct: 204 LATVSSYPYKAVDGKCK 220


>gi|214015299|gb|ACJ62266.1| cysteine protease [Zea mays subsp. parviglumis]
          Length = 251

 Score = 78.2 bits (191), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 37/82 (45%), Positives = 51/82 (62%), Gaps = 2/82 (2%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G CW F+ V AIEG++ I T NLV +S Q+++DCD Q     C GG +E  +++VI N G
Sbjct: 128 GGCWAFSAVAAIEGVNAIATGNLVSLSEQEIIDCDAQ--DSGCDGGQMENAFRFVIGNGG 185

Query: 65  INTERDYPNVGVMDNCKVFQFN 86
           I+TE DYP +G    C   + N
Sbjct: 186 IDTEADYPFIGTDGTCDANKEN 207


>gi|449448298|ref|XP_004141903.1| PREDICTED: germination-specific cysteine protease 1-like [Cucumis
           sativus]
 gi|449531757|ref|XP_004172852.1| PREDICTED: germination-specific cysteine protease 1-like [Cucumis
           sativus]
          Length = 365

 Score = 78.2 bits (191), Expect = 5e-13,   Method: Composition-based stats.
 Identities = 32/68 (47%), Positives = 48/68 (70%), Gaps = 1/68 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ + A+EGI++IVT  L+ +S Q+LV CD +  S  C GG ++  +Q++I N G
Sbjct: 146 GSCWAFSTIAAVEGINQIVTGELISLSEQELVSCDKKYNS-GCNGGLMDYAFQFIIDNGG 204

Query: 65  INTERDYP 72
           ++TE DYP
Sbjct: 205 LDTEEDYP 212


>gi|89274062|dbj|BAE80740.1| cysteine proteinase [Platycodon grandiflorus]
          Length = 462

 Score = 78.2 bits (191), Expect = 5e-13,   Method: Composition-based stats.
 Identities = 35/82 (42%), Positives = 56/82 (68%), Gaps = 1/82 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V A+EGI++IVT  L+ +S Q+LVDCD +  +  C GG ++  ++++I N G
Sbjct: 158 GSCWAFSTVNAVEGINQIVTGELITLSEQELVDCD-KSYNEGCDGGLMDYGFEFIINNGG 216

Query: 65  INTERDYPNVGVMDNCKVFQFN 86
           I+T++DYP +G    C  ++ N
Sbjct: 217 IDTDKDYPYLGRDARCDQYRKN 238


>gi|118158|sp|P12412.1|CYSEP_VIGMU RecName: Full=Vignain; AltName: Full=Bean endopeptidase; AltName:
           Full=Cysteine proteinase; AltName:
           Full=Sulfhydryl-endopeptidase; Short=SH-EP; Contains:
           RecName: Full=Vignain-1; Contains: RecName:
           Full=Vignain-2; Flags: Precursor
 gi|22062|emb|CAA33753.1| sulfhydryl-pre-endopeptidase (AA -20 to 342) [Vigna mungo]
 gi|22066|emb|CAA36181.1| sulfhydryl-endopeptidase [Vigna mungo]
          Length = 362

 Score = 78.2 bits (191), Expect = 5e-13,   Method: Composition-based stats.
 Identities = 37/81 (45%), Positives = 54/81 (66%), Gaps = 4/81 (4%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ + A+EGI++I TN LV +S Q+LVDCD + E++ C GG +E+ ++++ Q  G
Sbjct: 150 GSCWAFSTIVAVEGINQIKTNKLVSLSEQELVDCDKE-ENQGCNGGLMESAFEFIKQKGG 208

Query: 65  INTERDYPNV---GVMDNCKV 82
           I TE +YP     G  D  KV
Sbjct: 209 ITTESNYPYTAQEGTCDESKV 229


>gi|326520659|dbj|BAJ92693.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 289

 Score = 78.2 bits (191), Expect = 5e-13,   Method: Composition-based stats.
 Identities = 33/71 (46%), Positives = 51/71 (71%), Gaps = 7/71 (9%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCD---NQGESRSCVGGFIETIYQYVIQ 61
           GSCW F+ + A+EGI++IVT +++ +S Q+LVDCD   NQG    C GG ++  ++++I 
Sbjct: 154 GSCWAFSAIAAVEGINQIVTGDMIPLSEQELVDCDTSYNQG----CNGGLMDYAFEFIIN 209

Query: 62  NRGINTERDYP 72
           N GI++E DYP
Sbjct: 210 NGGIDSEEDYP 220


>gi|148927394|gb|ABR19828.1| cysteine proteinase [Elaeis guineensis]
          Length = 469

 Score = 78.2 bits (191), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 37/85 (43%), Positives = 56/85 (65%), Gaps = 7/85 (8%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCD---NQGESRSCVGGFIETIYQYVIQ 61
           GSCW F+ + A+EGI+ IVT +L+ +S Q+LVDCD   NQG    C GG ++  ++++I 
Sbjct: 162 GSCWAFSTIAAVEGINHIVTGDLISLSEQELVDCDTYYNQG----CNGGLMDYAFEFIIS 217

Query: 62  NRGINTERDYPNVGVMDNCKVFQFN 86
           N GI+T+ DYP  G   +C  ++ N
Sbjct: 218 NGGIDTDEDYPYTGRDGSCDQYRKN 242


>gi|445927|prf||1910332A Cys endopeptidase
          Length = 362

 Score = 78.2 bits (191), Expect = 6e-13,   Method: Composition-based stats.
 Identities = 37/81 (45%), Positives = 54/81 (66%), Gaps = 4/81 (4%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ + A+EGI++I TN LV +S Q+LVDCD + E++ C GG +E+ ++++ Q  G
Sbjct: 150 GSCWAFSTIVAVEGINQIKTNKLVSLSEQELVDCDKE-ENQGCNGGLMESAFEFIKQKGG 208

Query: 65  INTERDYP---NVGVMDNCKV 82
           I TE +YP     G  D  KV
Sbjct: 209 ITTESNYPYKAQEGTCDESKV 229


>gi|351721126|ref|NP_001237199.1| cysteine proteinase precursor [Glycine max]
 gi|31559530|dbj|BAC77523.1| cysteine proteinase [Glycine max]
 gi|31559532|dbj|BAC77524.1| cysteine proteinase [Glycine max]
          Length = 362

 Score = 78.2 bits (191), Expect = 6e-13,   Method: Composition-based stats.
 Identities = 37/82 (45%), Positives = 51/82 (62%), Gaps = 1/82 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V A+EGI++I TN LV +S Q+LVDCD + E+  C GG +E+ +Q++ Q  G
Sbjct: 150 GSCWAFSTVVAVEGINQIKTNKLVSLSEQELVDCDTE-ENAGCNGGLMESAFQFIKQKGG 208

Query: 65  INTERDYPNVGVMDNCKVFQFN 86
           I TE  YP       C   + N
Sbjct: 209 ITTESYYPYTAQDGTCDASKAN 230


>gi|75994608|gb|ABA33825.1| cysteine protease Mir1 [Zea mays subsp. parviglumis]
          Length = 250

 Score = 78.2 bits (191), Expect = 6e-13,   Method: Compositional matrix adjust.
 Identities = 37/78 (47%), Positives = 49/78 (62%), Gaps = 2/78 (2%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G CW F+ V AIEG++ I T NLV +S Q+++DCD Q     C GG +E  +++VI N G
Sbjct: 126 GGCWAFSAVAAIEGVNAIATGNLVSLSEQEIIDCDAQ--DSGCDGGQMEDAFRFVIGNGG 183

Query: 65  INTERDYPNVGVMDNCKV 82
           I+TE DYP VG    C  
Sbjct: 184 IDTEADYPFVGTDGTCDA 201


>gi|348525687|ref|XP_003450353.1| PREDICTED: cathepsin S-like [Oreochromis niloticus]
          Length = 170

 Score = 78.2 bits (191), Expect = 6e-13,   Method: Compositional matrix adjust.
 Identities = 35/75 (46%), Positives = 48/75 (64%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G+CW F+V GA+EG     T  LVD+S Q LVDC  +  +  C GG+I   +QYVI N+G
Sbjct: 64  GACWAFSVAGALEGQLAKTTGKLVDLSPQNLVDCSGKYGNHGCNGGYISRAFQYVIDNQG 123

Query: 65  INTERDYPNVGVMDN 79
           I+++  YP  G M+N
Sbjct: 124 IDSDASYPYTGRMEN 138


>gi|214015394|gb|ACJ62313.1| cysteine protease [Zea mays subsp. parviglumis]
          Length = 256

 Score = 78.2 bits (191), Expect = 6e-13,   Method: Compositional matrix adjust.
 Identities = 36/78 (46%), Positives = 49/78 (62%), Gaps = 2/78 (2%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G CW F+ V AIEG++ I T NLV +S Q+++DCD Q     C GG +E  +++VI N G
Sbjct: 133 GGCWAFSAVAAIEGVNAIATGNLVSLSEQEIIDCDAQ--DSGCDGGQMENAFRFVIGNGG 190

Query: 65  INTERDYPNVGVMDNCKV 82
           I+TE DYP +G    C  
Sbjct: 191 IDTEADYPFIGTDGTCDA 208


>gi|226499806|ref|NP_001151335.1| cysteine protease 1 [Zea mays]
 gi|195645896|gb|ACG42416.1| cysteine protease 1 precursor [Zea mays]
          Length = 258

 Score = 78.2 bits (191), Expect = 6e-13,   Method: Compositional matrix adjust.
 Identities = 36/80 (45%), Positives = 49/80 (61%), Gaps = 1/80 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G CW FA V A+EGI +I T NLV +S QQ++DCD  G +  C GG+I+  +QY++ N G
Sbjct: 69  GCCWAFAAVAAVEGIHQITTGNLVSLSEQQVLDCDTDG-NNGCNGGYIDNAFQYIVGNGG 127

Query: 65  INTERDYPNVGVMDNCKVFQ 84
           + TE  YP       C+  Q
Sbjct: 128 LATEDAYPYTAAQAMCQSVQ 147


>gi|18403438|ref|NP_565780.1| cysteine proteinase-like protein [Arabidopsis thaliana]
 gi|2342728|gb|AAB67626.1| cysteine proteinase [Arabidopsis thaliana]
 gi|330253821|gb|AEC08915.1| cysteine proteinase-like protein [Arabidopsis thaliana]
          Length = 345

 Score = 78.2 bits (191), Expect = 6e-13,   Method: Compositional matrix adjust.
 Identities = 36/77 (46%), Positives = 49/77 (63%), Gaps = 1/77 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G CW F+ V A+EG++KI   NLV +S QQL+DCD + + R C GG +   + YV+QNRG
Sbjct: 152 GCCWAFSAVAAVEGVAKIAGGNLVSLSEQQLLDCDREYD-RGCDGGIMSDAFNYVVQNRG 210

Query: 65  INTERDYPNVGVMDNCK 81
           I +E DY   G    C+
Sbjct: 211 IASENDYSYQGSDGGCR 227


>gi|255635584|gb|ACU18142.1| unknown [Glycine max]
          Length = 345

 Score = 78.2 bits (191), Expect = 6e-13,   Method: Composition-based stats.
 Identities = 32/78 (41%), Positives = 52/78 (66%), Gaps = 1/78 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V A+EGI++IVT NL  +S Q+L+DCD +  S  C GG ++  + ++++N G
Sbjct: 153 GSCWAFSTVAAVEGINQIVTGNLTSLSEQELIDCD-RTYSNGCNGGLMDYAFSFIVENGG 211

Query: 65  INTERDYPNVGVMDNCKV 82
           ++ E DYP +     C++
Sbjct: 212 LHKEEDYPYIMEEGTCEM 229


>gi|110737404|dbj|BAF00646.1| putative cysteine proteinase [Arabidopsis thaliana]
          Length = 345

 Score = 78.2 bits (191), Expect = 6e-13,   Method: Compositional matrix adjust.
 Identities = 36/77 (46%), Positives = 49/77 (63%), Gaps = 1/77 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G CW F+ V A+EG++KI   NLV +S QQL+DCD + + R C GG +   + YV+QNRG
Sbjct: 152 GCCWAFSAVAAVEGVAKIAGGNLVSLSEQQLLDCDREYD-RDCDGGIMSDAFNYVVQNRG 210

Query: 65  INTERDYPNVGVMDNCK 81
           I +E DY   G    C+
Sbjct: 211 IASENDYSYQGSDGGCR 227


>gi|224116884|ref|XP_002317418.1| predicted protein [Populus trichocarpa]
 gi|222860483|gb|EEE98030.1| predicted protein [Populus trichocarpa]
          Length = 503

 Score = 78.2 bits (191), Expect = 6e-13,   Method: Compositional matrix adjust.
 Identities = 37/76 (48%), Positives = 52/76 (68%), Gaps = 2/76 (2%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+  GAIEGI+ IVT +L+ +S Q+LVDCD    +  C GG+++  +++VI N G
Sbjct: 163 GSCWSFSTTGAIEGINAIVTGDLISLSEQELVDCDTT--NYGCEGGYMDYAFEWVINNGG 220

Query: 65  INTERDYPNVGVMDNC 80
           I+TE +YP  GV   C
Sbjct: 221 IDTEANYPYTGVDGTC 236


>gi|414588010|tpg|DAA38581.1| TPA: hypothetical protein ZEAMMB73_156486 [Zea mays]
          Length = 347

 Score = 78.2 bits (191), Expect = 7e-13,   Method: Composition-based stats.
 Identities = 34/77 (44%), Positives = 47/77 (61%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G CW F+ V A EGI KI T  L  +S Q+LVDCD  GE + C GG ++  ++++I+N G
Sbjct: 153 GCCWAFSAVAATEGIVKISTGKLTSLSEQELVDCDVHGEDQGCNGGEMDDAFKFIIKNGG 212

Query: 65  INTERDYPNVGVMDNCK 81
           + TE +YP       CK
Sbjct: 213 LTTESNYPYTAQDGQCK 229


>gi|214015283|gb|ACJ62258.1| cysteine protease [Zea mays subsp. parviglumis]
          Length = 255

 Score = 77.8 bits (190), Expect = 7e-13,   Method: Compositional matrix adjust.
 Identities = 36/78 (46%), Positives = 49/78 (62%), Gaps = 2/78 (2%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G CW F+ V AIEG++ I T NLV +S Q+++DCD Q     C GG +E  +++VI N G
Sbjct: 132 GGCWAFSAVAAIEGVNAIATGNLVSLSEQEIIDCDAQ--DSGCDGGQMENAFRFVIGNGG 189

Query: 65  INTERDYPNVGVMDNCKV 82
           I+TE DYP +G    C  
Sbjct: 190 IDTEADYPFIGTDGTCDA 207


>gi|214015231|gb|ACJ62232.1| cysteine protease [Zea mays subsp. parviglumis]
          Length = 255

 Score = 77.8 bits (190), Expect = 7e-13,   Method: Compositional matrix adjust.
 Identities = 36/78 (46%), Positives = 49/78 (62%), Gaps = 2/78 (2%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G CW F+ V AIEG++ I T NLV +S Q+++DCD Q     C GG +E  +++VI N G
Sbjct: 132 GGCWAFSAVAAIEGVNAIATGNLVSLSEQEIIDCDAQ--DSGCDGGQMENAFRFVIGNGG 189

Query: 65  INTERDYPNVGVMDNCKV 82
           I+TE DYP +G    C  
Sbjct: 190 IDTEADYPFIGTDGTCDA 207


>gi|214015382|gb|ACJ62307.1| cysteine protease [Zea mays subsp. parviglumis]
          Length = 249

 Score = 77.8 bits (190), Expect = 7e-13,   Method: Compositional matrix adjust.
 Identities = 36/78 (46%), Positives = 49/78 (62%), Gaps = 2/78 (2%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G CW F+ V AIEG++ I T NLV +S Q+++DCD Q     C GG +E  +++VI N G
Sbjct: 126 GGCWAFSAVAAIEGVNAIATGNLVSLSEQEIIDCDAQ--DSGCDGGQMENAFRFVIGNGG 183

Query: 65  INTERDYPNVGVMDNCKV 82
           I+TE DYP +G    C  
Sbjct: 184 IDTEADYPFIGTDGTCDA 201


>gi|214015396|gb|ACJ62314.1| cysteine protease [Zea mays subsp. parviglumis]
          Length = 246

 Score = 77.8 bits (190), Expect = 7e-13,   Method: Compositional matrix adjust.
 Identities = 36/78 (46%), Positives = 49/78 (62%), Gaps = 2/78 (2%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G CW F+ V AIEG++ I T NLV +S Q+++DCD Q     C GG +E  +++VI N G
Sbjct: 123 GGCWAFSAVAAIEGVNAIATGNLVSLSEQEIIDCDAQ--DSGCDGGQMENAFRFVIGNGG 180

Query: 65  INTERDYPNVGVMDNCKV 82
           I+TE DYP +G    C  
Sbjct: 181 IDTEADYPFIGTDGTCDA 198


>gi|214015251|gb|ACJ62242.1| cysteine protease [Zea mays subsp. parviglumis]
 gi|214015281|gb|ACJ62257.1| cysteine protease [Zea mays subsp. parviglumis]
 gi|214015285|gb|ACJ62259.1| cysteine protease [Zea mays subsp. parviglumis]
          Length = 255

 Score = 77.8 bits (190), Expect = 7e-13,   Method: Compositional matrix adjust.
 Identities = 36/78 (46%), Positives = 49/78 (62%), Gaps = 2/78 (2%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G CW F+ V AIEG++ I T NLV +S Q+++DCD Q     C GG +E  +++VI N G
Sbjct: 132 GGCWAFSAVAAIEGVNAIATGNLVSLSEQEIIDCDAQ--DSGCDGGQMENAFRFVIGNGG 189

Query: 65  INTERDYPNVGVMDNCKV 82
           I+TE DYP +G    C  
Sbjct: 190 IDTEADYPFIGTDGTCDA 207


>gi|449469929|ref|XP_004152671.1| PREDICTED: cysteine proteinase RD21a-like [Cucumis sativus]
 gi|449529596|ref|XP_004171784.1| PREDICTED: cysteine proteinase RD21a-like [Cucumis sativus]
          Length = 431

 Score = 77.8 bits (190), Expect = 7e-13,   Method: Compositional matrix adjust.
 Identities = 35/77 (45%), Positives = 52/77 (67%), Gaps = 1/77 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G+CW F+  GA+EGI++I+T +L+ +S Q+L+DCD    S  C GG ++  YQ+VI N G
Sbjct: 136 GACWSFSATGAMEGINQIMTGSLISLSEQELIDCDRSYNS-GCGGGLMDYAYQFVISNHG 194

Query: 65  INTERDYPNVGVMDNCK 81
           I+TE DYP      +C+
Sbjct: 195 IDTENDYPYQARDGSCR 211


>gi|4731372|gb|AAD28476.1|AF133838_1 papain-like cysteine protease [Sandersonia aurantiaca]
          Length = 370

 Score = 77.8 bits (190), Expect = 7e-13,   Method: Compositional matrix adjust.
 Identities = 35/82 (42%), Positives = 54/82 (65%), Gaps = 1/82 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ + ++EGI+KIVT +L+ +S Q+LVDCD    +  C GG ++  +Q++I N G
Sbjct: 63  GSCWAFSTIASVEGINKIVTGDLISLSEQELVDCDKT-YNDGCNGGLMDYAFQFIIDNGG 121

Query: 65  INTERDYPNVGVMDNCKVFQFN 86
           I+TE+DYP       C  ++ N
Sbjct: 122 IDTEKDYPYTEQDGRCDSYRKN 143


>gi|388519351|gb|AFK47737.1| unknown [Medicago truncatula]
          Length = 359

 Score = 77.8 bits (190), Expect = 7e-13,   Method: Composition-based stats.
 Identities = 33/76 (43%), Positives = 51/76 (67%), Gaps = 1/76 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ +  +E I+KIVT  LV +S Q+LVDCD +  +  C GG ++  ++++++N G
Sbjct: 147 GSCWAFSTIATVEAINKIVTGKLVSLSEQELVDCD-RAFNEGCNGGLMDYAFEFIVENGG 205

Query: 65  INTERDYPNVGVMDNC 80
           I+TE+DYP  G    C
Sbjct: 206 IDTEQDYPYKGFEGRC 221


>gi|255546708|ref|XP_002514413.1| cysteine protease, putative [Ricinus communis]
 gi|223546510|gb|EEF48009.1| cysteine protease, putative [Ricinus communis]
          Length = 324

 Score = 77.8 bits (190), Expect = 7e-13,   Method: Composition-based stats.
 Identities = 32/68 (47%), Positives = 46/68 (67%), Gaps = 1/68 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V A+EGI++IVT NL  +S Q+L+DCD    S  C GG ++  + Y++ N G
Sbjct: 128 GSCWAFSTVAAVEGINQIVTGNLTSLSEQELIDCDTSFNS-GCNGGLMDYAFDYIVNNGG 186

Query: 65  INTERDYP 72
           ++ E DYP
Sbjct: 187 LHKEEDYP 194


>gi|214015359|gb|ACJ62296.1| cysteine protease [Zea mays subsp. parviglumis]
          Length = 247

 Score = 77.8 bits (190), Expect = 7e-13,   Method: Compositional matrix adjust.
 Identities = 36/78 (46%), Positives = 49/78 (62%), Gaps = 2/78 (2%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G CW F+ V AIEG++ I T NLV +S Q+++DCD Q     C GG +E  +++VI N G
Sbjct: 124 GGCWAFSAVAAIEGVNAIATGNLVSLSEQEIIDCDAQ--DSGCDGGQMEDAFRFVIGNGG 181

Query: 65  INTERDYPNVGVMDNCKV 82
           I+TE DYP +G    C  
Sbjct: 182 IDTEADYPFIGTDGTCDA 199


>gi|214015311|gb|ACJ62272.1| cysteine protease [Zea mays subsp. parviglumis]
 gi|214015376|gb|ACJ62304.1| cysteine protease [Zea mays subsp. parviglumis]
          Length = 247

 Score = 77.8 bits (190), Expect = 7e-13,   Method: Compositional matrix adjust.
 Identities = 36/78 (46%), Positives = 49/78 (62%), Gaps = 2/78 (2%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G CW F+ V AIEG++ I T NLV +S Q+++DCD Q     C GG +E  +++VI N G
Sbjct: 124 GGCWAFSAVAAIEGVNAIATGNLVSLSEQEIIDCDAQ--DSGCDGGQMEDAFRFVIGNGG 181

Query: 65  INTERDYPNVGVMDNCKV 82
           I+TE DYP +G    C  
Sbjct: 182 IDTEADYPFIGTDGTCDA 199


>gi|1223922|gb|AAA92063.1| cysteinyl endopeptidase [Vigna radiata]
          Length = 362

 Score = 77.8 bits (190), Expect = 7e-13,   Method: Composition-based stats.
 Identities = 35/82 (42%), Positives = 53/82 (64%), Gaps = 1/82 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V A+EGI++I T+ LV +S Q+LVDCD + E++ C GG +E+ ++++ Q  G
Sbjct: 150 GSCWAFSTVVAVEGINQIKTDKLVSLSEQELVDCDKE-ENQGCNGGLMESAFEFIKQKGG 208

Query: 65  INTERDYPNVGVMDNCKVFQFN 86
           I TE +YP       C   + N
Sbjct: 209 ITTESNYPYTAQEGTCDASKVN 230


>gi|357437715|ref|XP_003589133.1| Cysteine proteinase [Medicago truncatula]
 gi|87240770|gb|ABD32628.1| Granulin; Peptidase C1A, papain [Medicago truncatula]
 gi|355478181|gb|AES59384.1| Cysteine proteinase [Medicago truncatula]
          Length = 474

 Score = 77.8 bits (190), Expect = 7e-13,   Method: Composition-based stats.
 Identities = 34/76 (44%), Positives = 51/76 (67%), Gaps = 1/76 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ + A+EGI+KIVT +L+ +S Q+LVDCD    +  C GG ++  ++++I N G
Sbjct: 167 GSCWAFSAIAAVEGINKIVTGDLISLSEQELVDCDTS-YNEGCNGGLMDYAFEFIISNGG 225

Query: 65  INTERDYPNVGVMDNC 80
           I++E DYP   V   C
Sbjct: 226 IDSEDDYPYKAVDGRC 241


>gi|214015388|gb|ACJ62310.1| cysteine protease [Zea mays subsp. parviglumis]
          Length = 249

 Score = 77.8 bits (190), Expect = 7e-13,   Method: Compositional matrix adjust.
 Identities = 36/78 (46%), Positives = 49/78 (62%), Gaps = 2/78 (2%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G CW F+ V AIEG++ I T NLV +S Q+++DCD Q     C GG +E  +++VI N G
Sbjct: 126 GGCWAFSAVAAIEGVNAIATGNLVSLSEQEIIDCDAQ--DSGCDGGQMENAFRFVIGNGG 183

Query: 65  INTERDYPNVGVMDNCKV 82
           I+TE DYP +G    C  
Sbjct: 184 IDTEADYPFIGTDGTCDA 201


>gi|75994610|gb|ABA33826.1| cysteine protease Mir1 [Zea mays subsp. parviglumis]
          Length = 248

 Score = 77.8 bits (190), Expect = 7e-13,   Method: Compositional matrix adjust.
 Identities = 36/78 (46%), Positives = 49/78 (62%), Gaps = 2/78 (2%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G CW F+ V AIEG++ I T NLV +S Q+++DCD Q     C GG +E  +++VI N G
Sbjct: 124 GGCWAFSAVAAIEGVNAIATGNLVSLSEQEIIDCDAQ--DSGCDGGQMEDAFRFVIGNGG 181

Query: 65  INTERDYPNVGVMDNCKV 82
           I+TE DYP +G    C  
Sbjct: 182 IDTEADYPFIGTDGTCDA 199


>gi|357167196|ref|XP_003581047.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like
           [Brachypodium distachyon]
          Length = 338

 Score = 77.8 bits (190), Expect = 7e-13,   Method: Composition-based stats.
 Identities = 30/76 (39%), Positives = 49/76 (64%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G CW F+ V ++EGI K+ T  L+ +S Q+LVDCD   +++ C GG ++  +++++ N G
Sbjct: 142 GCCWAFSTVASMEGIVKVSTGKLISLSEQELVDCDVGMQNKGCGGGLMDNAFEFIVNNGG 201

Query: 65  INTERDYPNVGVMDNC 80
           ++TE DYP  G    C
Sbjct: 202 LDTEADYPYTGADGTC 217


>gi|5777889|emb|CAB53515.1| cysteine protease [Solanum tuberosum]
          Length = 466

 Score = 77.8 bits (190), Expect = 8e-13,   Method: Composition-based stats.
 Identities = 37/82 (45%), Positives = 53/82 (64%), Gaps = 1/82 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V A+E I+ IVT NL+ +S Q+LVDCD +  +  C GG ++  +++VI N G
Sbjct: 160 GSCWAFSAVAAMESINAIVTGNLISLSEQELVDCD-KSYNEGCDGGLMDYAFEFVINNGG 218

Query: 65  INTERDYPNVGVMDNCKVFQFN 86
           I+TE DYP     D C  ++ N
Sbjct: 219 IDTEEDYPYKERNDVCDQYRKN 240


>gi|75994628|gb|ABA33835.1| cysteine protease Mir1 [Zea mays subsp. parviglumis]
          Length = 248

 Score = 77.8 bits (190), Expect = 8e-13,   Method: Compositional matrix adjust.
 Identities = 36/78 (46%), Positives = 49/78 (62%), Gaps = 2/78 (2%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G CW F+ V AIEG++ I T NLV +S Q+++DCD Q     C GG +E  +++VI N G
Sbjct: 124 GGCWAFSAVAAIEGVNAIATGNLVSLSEQEIIDCDAQ--DSGCDGGQMENAFRFVIGNGG 181

Query: 65  INTERDYPNVGVMDNCKV 82
           I+TE DYP +G    C  
Sbjct: 182 IDTEADYPFIGTDGTCDA 199


>gi|125533982|gb|EAY80530.1| hypothetical protein OsI_35710 [Oryza sativa Indica Group]
          Length = 378

 Score = 77.8 bits (190), Expect = 8e-13,   Method: Composition-based stats.
 Identities = 33/68 (48%), Positives = 49/68 (72%), Gaps = 1/68 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V A+EG++KI T  LV +S Q+LVDCD  G+++ C GG ++  +Q++ +N G
Sbjct: 165 GSCWAFSAVAAVEGVNKIKTGRLVTLSEQELVDCDT-GDNQGCDGGLMDYAFQFIKRNGG 223

Query: 65  INTERDYP 72
           I TE +YP
Sbjct: 224 ITTESNYP 231


>gi|307111936|gb|EFN60170.1| hypothetical protein CHLNCDRAFT_59551 [Chlorella variabilis]
          Length = 364

 Score = 77.8 bits (190), Expect = 8e-13,   Method: Composition-based stats.
 Identities = 32/68 (47%), Positives = 47/68 (69%), Gaps = 1/68 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+  GA+EG + I T  LV +S Q LVDCD + ++  C GGF+++ + +++ N G
Sbjct: 158 GSCWAFSTTGAVEGANAIATGKLVSLSEQMLVDCDREYDT-GCRGGFMDSAFDFIVNNGG 216

Query: 65  INTERDYP 72
           I+TE DYP
Sbjct: 217 IDTEDDYP 224


>gi|224056176|ref|XP_002298740.1| predicted protein [Populus trichocarpa]
 gi|222845998|gb|EEE83545.1| predicted protein [Populus trichocarpa]
          Length = 455

 Score = 77.8 bits (190), Expect = 8e-13,   Method: Compositional matrix adjust.
 Identities = 36/76 (47%), Positives = 52/76 (68%), Gaps = 1/76 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ VGA+EGI++IVT NL  +S Q+LVDCD +  +  C GG ++  + ++I+N G
Sbjct: 136 GSCWAFSTVGAVEGINQIVTGNLTSLSEQELVDCD-KTYNLGCNGGLMDYAFDFIIENGG 194

Query: 65  INTERDYPNVGVMDNC 80
           I+TE DYP   +   C
Sbjct: 195 IDTEEDYPYKAIDSMC 210


>gi|115484973|ref|NP_001067630.1| Os11g0255300 [Oryza sativa Japonica Group]
 gi|530335|emb|CAA56844.1| cysteine protease [Oryza sativa Japonica Group]
 gi|5761322|dbj|BAA83472.1| cysteine endopeptidase [Oryza sativa Japonica Group]
 gi|62732672|gb|AAX94791.1| cysteine proteinase (EC 3.4.22.-) - rice [Oryza sativa Japonica
           Group]
 gi|62732673|gb|AAX94792.1| cysteine proteinase (EC 3.4.22.-) - rice [Oryza sativa Japonica
           Group]
 gi|62732674|gb|AAX94793.1| cysteine proteinase (EC 3.4.22.-) - rice [Oryza sativa Japonica
           Group]
 gi|77549615|gb|ABA92412.1| Thiol protease SEN102 precursor, putative, expressed [Oryza sativa
           Japonica Group]
 gi|77549616|gb|ABA92413.1| Thiol protease SEN102 precursor, putative, expressed [Oryza sativa
           Japonica Group]
 gi|77549617|gb|ABA92414.1| Thiol protease SEN102 precursor, putative, expressed [Oryza sativa
           Japonica Group]
 gi|113644852|dbj|BAF27993.1| Os11g0255300 [Oryza sativa Japonica Group]
 gi|125576789|gb|EAZ18011.1| hypothetical protein OsJ_33558 [Oryza sativa Japonica Group]
 gi|215701098|dbj|BAG92522.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 378

 Score = 77.8 bits (190), Expect = 8e-13,   Method: Composition-based stats.
 Identities = 33/68 (48%), Positives = 49/68 (72%), Gaps = 1/68 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V A+EG++KI T  LV +S Q+LVDCD  G+++ C GG ++  +Q++ +N G
Sbjct: 165 GSCWAFSTVAAVEGVNKIKTGRLVTLSEQELVDCDT-GDNQGCDGGLMDYAFQFIKRNGG 223

Query: 65  INTERDYP 72
           I TE +YP
Sbjct: 224 ITTESNYP 231


>gi|75994624|gb|ABA33833.1| cysteine protease Mir1 [Zea mays subsp. parviglumis]
          Length = 255

 Score = 77.8 bits (190), Expect = 8e-13,   Method: Compositional matrix adjust.
 Identities = 37/82 (45%), Positives = 50/82 (60%), Gaps = 2/82 (2%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G CW F+ V AIEG++ I T NLV +S Q+++DCD Q     C GG +E  + +VI N G
Sbjct: 131 GGCWAFSAVAAIEGVNAIATGNLVSLSEQEIIDCDAQ--DSGCDGGQMENAFLFVIGNGG 188

Query: 65  INTERDYPNVGVMDNCKVFQFN 86
           I+TE DYP +G    C   + N
Sbjct: 189 IDTEADYPFIGTDGTCDASKEN 210


>gi|116786550|gb|ABK24153.1| unknown [Picea sitchensis]
          Length = 394

 Score = 77.8 bits (190), Expect = 8e-13,   Method: Composition-based stats.
 Identities = 35/84 (41%), Positives = 50/84 (59%), Gaps = 7/84 (8%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRS-------CVGGFIETIYQ 57
           GSCW F+  GA+EG + + T  L+ +S QQLVDCD++ +S         C GG + T YQ
Sbjct: 182 GSCWTFSTTGAMEGANFMKTGKLISLSEQQLVDCDHECDSSEPDVCDSGCNGGLMTTAYQ 241

Query: 58  YVIQNRGINTERDYPNVGVMDNCK 81
           Y ++  G+  E DYP  G+  +CK
Sbjct: 242 YALKAGGLQREEDYPYTGIDGSCK 265


>gi|422001787|dbj|BAM66994.1| germination-specific cysteine protease 1, partial [Raphanus
           sativus]
          Length = 235

 Score = 77.8 bits (190), Expect = 8e-13,   Method: Compositional matrix adjust.
 Identities = 35/82 (42%), Positives = 53/82 (64%), Gaps = 1/82 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+    +EGI+KIVT  L+ +S Q+LVDCD +  ++ C GG ++  +Q++++N G
Sbjct: 26  GSCWAFSTAAVVEGINKIVTGELISLSEQELVDCD-KSYNQGCNGGLMDYAFQFIMKNGG 84

Query: 65  INTERDYPNVGVMDNCKVFQFN 86
           +NTE+DYP  G    C     N
Sbjct: 85  LNTEQDYPYRGSDGKCNSLLKN 106


>gi|30690594|ref|NP_564321.2| cysteine proteinase-like protein [Arabidopsis thaliana]
 gi|28393492|gb|AAO42167.1| putative cysteine proteinase [Arabidopsis thaliana]
 gi|332192920|gb|AEE31041.1| cysteine proteinase-like protein [Arabidopsis thaliana]
          Length = 355

 Score = 77.8 bits (190), Expect = 8e-13,   Method: Compositional matrix adjust.
 Identities = 35/77 (45%), Positives = 50/77 (64%), Gaps = 1/77 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G CW F+ V A+EG++KIV NNLV +S QQL+DCD + ++  C GG +   + Y+I+NRG
Sbjct: 161 GCCWAFSSVAAVEGLTKIVGNNLVSLSEQQLLDCDRERDN-GCNGGIMSDAFSYIIKNRG 219

Query: 65  INTERDYPNVGVMDNCK 81
           I +E  YP       C+
Sbjct: 220 IASEASYPYQAAEGTCR 236


>gi|297598407|ref|NP_001045533.2| Os01g0971400 [Oryza sativa Japonica Group]
 gi|15289977|dbj|BAB63672.1| putative cysteine protease CP1 [Oryza sativa Japonica Group]
 gi|125529282|gb|EAY77396.1| hypothetical protein OsI_05384 [Oryza sativa Indica Group]
 gi|125573472|gb|EAZ14987.1| hypothetical protein OsJ_04922 [Oryza sativa Japonica Group]
 gi|215740756|dbj|BAG97412.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|215741010|dbj|BAG97505.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|215765325|dbj|BAG87022.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|215767338|dbj|BAG99566.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|255674119|dbj|BAF07447.2| Os01g0971400 [Oryza sativa Japonica Group]
          Length = 365

 Score = 77.8 bits (190), Expect = 8e-13,   Method: Composition-based stats.
 Identities = 33/77 (42%), Positives = 48/77 (62%), Gaps = 1/77 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V A+EGI+ IVT NL  +S Q+L+DCD  G +  C GG ++  + Y+  N G
Sbjct: 161 GSCWAFSTVAAVEGINAIVTGNLTRLSEQELIDCDTDGNN-GCSGGLMDYAFSYIAANGG 219

Query: 65  INTERDYPNVGVMDNCK 81
           ++TE  YP +     C+
Sbjct: 220 LHTEESYPYLMEEGTCR 236


>gi|359483514|ref|XP_003632971.1| PREDICTED: LOW QUALITY PROTEIN: oryzain beta chain-like [Vitis
           vinifera]
          Length = 340

 Score = 77.8 bits (190), Expect = 9e-13,   Method: Composition-based stats.
 Identities = 33/76 (43%), Positives = 51/76 (67%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G CW F+ V A+EGI+K+ T+  + +S Q+LVDCD  G +  C GG ++  ++++IQNRG
Sbjct: 144 GGCWAFSAVAAMEGIAKLQTSKSISLSEQELVDCDIFGSNIGCEGGCMDDAFKFIIQNRG 203

Query: 65  INTERDYPNVGVMDNC 80
           +N+E  Y   GV  +C
Sbjct: 204 LNSEARYLYKGVEGHC 219


>gi|312282059|dbj|BAJ33895.1| unnamed protein product [Thellungiella halophila]
          Length = 379

 Score = 77.8 bits (190), Expect = 9e-13,   Method: Composition-based stats.
 Identities = 34/75 (45%), Positives = 50/75 (66%), Gaps = 2/75 (2%)

Query: 6   SCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRGI 65
           SCW F+ VGA+EG++KIVT  LV +S Q L++C+   E+  C GG +ET Y++++ N G+
Sbjct: 175 SCWAFSTVGAVEGLNKIVTGELVTLSEQDLINCNK--ENNGCGGGKVETAYEFIVSNGGL 232

Query: 66  NTERDYPNVGVMDNC 80
            T+ DYP   V   C
Sbjct: 233 GTDNDYPYKAVNGAC 247


>gi|255555337|ref|XP_002518705.1| cysteine protease, putative [Ricinus communis]
 gi|223542086|gb|EEF43630.1| cysteine protease, putative [Ricinus communis]
          Length = 471

 Score = 77.8 bits (190), Expect = 9e-13,   Method: Compositional matrix adjust.
 Identities = 34/68 (50%), Positives = 51/68 (75%), Gaps = 1/68 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ VGA+EGI++IVT  L+ +S Q+LVDCD +  ++ C GG ++  ++++I N G
Sbjct: 160 GSCWAFSTVGAVEGINQIVTGELISLSEQELVDCD-KSYNQGCNGGLMDYAFEFIINNGG 218

Query: 65  INTERDYP 72
           I+TE DYP
Sbjct: 219 IDTEEDYP 226


>gi|358343350|ref|XP_003635767.1| Cysteine proteinase [Medicago truncatula]
 gi|355501702|gb|AES82905.1| Cysteine proteinase [Medicago truncatula]
          Length = 338

 Score = 77.8 bits (190), Expect = 9e-13,   Method: Composition-based stats.
 Identities = 35/76 (46%), Positives = 49/76 (64%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V A+EGI+KI T NLV +S QQL+DCD +  +  C GG +   + Y+ ++ G
Sbjct: 142 GSCWAFSAVAAVEGINKIKTENLVSLSEQQLIDCDIKSGNEGCEGGDMYIAFNYIKKHGG 201

Query: 65  INTERDYPNVGVMDNC 80
           I T ++YP  G   NC
Sbjct: 202 IATAKEYPYKGRDGNC 217


>gi|75994630|gb|ABA33836.1| cysteine protease Mir1 [Zea mays subsp. parviglumis]
          Length = 248

 Score = 77.4 bits (189), Expect = 9e-13,   Method: Compositional matrix adjust.
 Identities = 38/78 (48%), Positives = 48/78 (61%), Gaps = 2/78 (2%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G CW F+ V AIEGI+ I T NLV +S Q+++DCD Q     C GG +E  + +VI N G
Sbjct: 124 GGCWAFSAVAAIEGINAIATGNLVSLSEQEIIDCDAQ--DSGCDGGQMENAFLFVIGNGG 181

Query: 65  INTERDYPNVGVMDNCKV 82
           I+TE DYP VG    C  
Sbjct: 182 IDTEADYPFVGTDGTCDA 199


>gi|224085750|ref|XP_002307688.1| predicted protein [Populus trichocarpa]
 gi|222857137|gb|EEE94684.1| predicted protein [Populus trichocarpa]
          Length = 436

 Score = 77.4 bits (189), Expect = 9e-13,   Method: Compositional matrix adjust.
 Identities = 35/68 (51%), Positives = 50/68 (73%), Gaps = 1/68 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G+CW F+  GAIEGI+KIVT +LV +S Q+L++CD +  +  C GG ++  +Q+VI N G
Sbjct: 136 GACWSFSATGAIEGINKIVTGSLVSLSEQELIECD-KSYNDGCGGGLMDYAFQFVINNHG 194

Query: 65  INTERDYP 72
           I+TE DYP
Sbjct: 195 IDTEEDYP 202


>gi|226505708|ref|NP_001141813.1| uncharacterized protein LOC100273952 precursor [Zea mays]
 gi|194706024|gb|ACF87096.1| unknown [Zea mays]
 gi|413945958|gb|AFW78607.1| hypothetical protein ZEAMMB73_489507 [Zea mays]
          Length = 460

 Score = 77.4 bits (189), Expect = 9e-13,   Method: Composition-based stats.
 Identities = 35/68 (51%), Positives = 49/68 (72%), Gaps = 1/68 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G+CW F+  GA+EGI+KI T +LV +S Q+L+DCD    S  C GG ++  Y++VI+N G
Sbjct: 159 GACWSFSATGAMEGINKIKTGSLVSLSEQELIDCDRSYNS-GCGGGLMDYAYKFVIKNGG 217

Query: 65  INTERDYP 72
           I+TE DYP
Sbjct: 218 IDTEEDYP 225


>gi|58531896|gb|AAW78660.1| cysteine protease [Nicotiana tabacum]
          Length = 361

 Score = 77.4 bits (189), Expect = 9e-13,   Method: Composition-based stats.
 Identities = 34/82 (41%), Positives = 54/82 (65%), Gaps = 1/82 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V A+EGI++I TN LV +S Q+LVDCD   +++ C GG ++  ++++ +  G
Sbjct: 148 GSCWAFSTVVAVEGINQIKTNELVSLSEQELVDCDTS-QNQGCNGGLMDMAFEFIKKKGG 206

Query: 65  INTERDYPNVGVMDNCKVFQFN 86
           INTE +YP +     C + + N
Sbjct: 207 INTEENYPYMAEGGECDIQKRN 228


>gi|2160175|gb|AAB60738.1| Strong similarity to Dianthus cysteine proteinase (gb|U17135)
           [Arabidopsis thaliana]
          Length = 416

 Score = 77.4 bits (189), Expect = 9e-13,   Method: Compositional matrix adjust.
 Identities = 32/68 (47%), Positives = 52/68 (76%), Gaps = 1/68 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G+CW F+  GA+EGI++IVT +L+ +S Q+L+DCD +  +  C GG ++  +++VI+N G
Sbjct: 138 GACWSFSATGAMEGINQIVTGDLISLSEQELIDCD-KSYNAGCNGGLMDYAFEFVIKNHG 196

Query: 65  INTERDYP 72
           I+TE+DYP
Sbjct: 197 IDTEKDYP 204


>gi|18391078|ref|NP_563855.1| xylem bark cysteine peptidase 3 [Arabidopsis thaliana]
 gi|110741821|dbj|BAE98853.1| papain-like cysteine peptidase XBCP3 [Arabidopsis thaliana]
 gi|111074448|gb|ABH04597.1| At1g09850 [Arabidopsis thaliana]
 gi|332190386|gb|AEE28507.1| xylem bark cysteine peptidase 3 [Arabidopsis thaliana]
          Length = 437

 Score = 77.4 bits (189), Expect = 9e-13,   Method: Compositional matrix adjust.
 Identities = 32/68 (47%), Positives = 52/68 (76%), Gaps = 1/68 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G+CW F+  GA+EGI++IVT +L+ +S Q+L+DCD +  +  C GG ++  +++VI+N G
Sbjct: 140 GACWSFSATGAMEGINQIVTGDLISLSEQELIDCD-KSYNAGCNGGLMDYAFEFVIKNHG 198

Query: 65  INTERDYP 72
           I+TE+DYP
Sbjct: 199 IDTEKDYP 206


>gi|9502421|gb|AAF88120.1|AC021043_13 Putative cysteine proteinase [Arabidopsis thaliana]
          Length = 331

 Score = 77.4 bits (189), Expect = 9e-13,   Method: Compositional matrix adjust.
 Identities = 35/77 (45%), Positives = 50/77 (64%), Gaps = 1/77 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G CW F+ V A+EG++KIV NNLV +S QQL+DCD + ++  C GG +   + Y+I+NRG
Sbjct: 137 GCCWAFSSVAAVEGLTKIVGNNLVSLSEQQLLDCDRERDN-GCNGGIMSDAFSYIIKNRG 195

Query: 65  INTERDYPNVGVMDNCK 81
           I +E  YP       C+
Sbjct: 196 IASEASYPYQAAEGTCR 212


>gi|42572491|ref|NP_974341.1| putative cysteine proteinase [Arabidopsis thaliana]
 gi|332642714|gb|AEE76235.1| putative cysteine proteinase [Arabidopsis thaliana]
          Length = 290

 Score = 77.4 bits (189), Expect = 9e-13,   Method: Composition-based stats.
 Identities = 31/68 (45%), Positives = 48/68 (70%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ VGA+EGI++I T  L+ +S Q+LVDCD    +  C GG +   ++++++N G
Sbjct: 152 GSCWAFSAVGAVEGINQITTGELISLSEQELVDCDRGFVNAGCDGGIMNYAFEFIMKNGG 211

Query: 65  INTERDYP 72
           I T++DYP
Sbjct: 212 IETDQDYP 219


>gi|167521499|ref|XP_001745088.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163776702|gb|EDQ90321.1| predicted protein [Monosiga brevicollis MX1]
          Length = 294

 Score = 77.4 bits (189), Expect = 9e-13,   Method: Composition-based stats.
 Identities = 32/68 (47%), Positives = 45/68 (66%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+  G+ EG   I T NLV +S QQLVDC     ++ C GG ++  ++Y+I N+G
Sbjct: 104 GSCWSFSTTGSTEGAHAIATGNLVSLSEQQLVDCSGSFGNQGCNGGLMDDAFKYIISNKG 163

Query: 65  INTERDYP 72
           ++TE DYP
Sbjct: 164 LDTEEDYP 171


>gi|146215976|gb|ABQ10190.1| actinidin Act1b [Actinidia arguta]
          Length = 380

 Score = 77.4 bits (189), Expect = 9e-13,   Method: Compositional matrix adjust.
 Identities = 35/78 (44%), Positives = 49/78 (62%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ +  +EGI+KIVT +L+ +S Q+LVDC     +R C GG I   +Q++I N G
Sbjct: 149 GSCWAFSAIATVEGINKIVTGDLISLSEQELVDCGRTQNTRGCDGGSITDGFQFIINNGG 208

Query: 65  INTERDYPNVGVMDNCKV 82
           INTE +YP       C +
Sbjct: 209 INTEANYPYTAEDGQCNL 226


>gi|33242886|gb|AAQ01147.1| cathepsin [Haplochromis chilotes]
          Length = 334

 Score = 77.4 bits (189), Expect = 9e-13,   Method: Composition-based stats.
 Identities = 35/76 (46%), Positives = 46/76 (60%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+  GA+EG     T  LVD+S Q LVDC  +  +  C GGF+   +QYVI N G
Sbjct: 141 GSCWAFSAAGALEGQLAKSTGKLVDLSPQNLVDCSGKYGNHGCNGGFMTRAFQYVIDNHG 200

Query: 65  INTERDYPNVGVMDNC 80
           I+++  YP +G  D C
Sbjct: 201 IDSDASYPYIGRDDQC 216


>gi|242088413|ref|XP_002440039.1| hypothetical protein SORBIDRAFT_09g024940 [Sorghum bicolor]
 gi|241945324|gb|EES18469.1| hypothetical protein SORBIDRAFT_09g024940 [Sorghum bicolor]
          Length = 463

 Score = 77.4 bits (189), Expect = 9e-13,   Method: Composition-based stats.
 Identities = 34/68 (50%), Positives = 49/68 (72%), Gaps = 1/68 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G+CW F+  GA+EGI+KI T +LV +S Q+L+DCD    S  C GG ++  Y++V++N G
Sbjct: 161 GACWSFSATGAMEGINKIKTGSLVSLSEQELIDCDRSYNS-GCGGGLMDYAYKFVVKNGG 219

Query: 65  INTERDYP 72
           I+TE DYP
Sbjct: 220 IDTEEDYP 227


>gi|224116880|ref|XP_002317417.1| predicted protein [Populus trichocarpa]
 gi|118488173|gb|ABK95906.1| unknown [Populus trichocarpa]
 gi|222860482|gb|EEE98029.1| predicted protein [Populus trichocarpa]
          Length = 498

 Score = 77.4 bits (189), Expect = 9e-13,   Method: Compositional matrix adjust.
 Identities = 38/80 (47%), Positives = 52/80 (65%), Gaps = 1/80 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+  GAIE I+ IVT +L+ +S Q+LVDCD    +  C GG +++ +Q+VI N G
Sbjct: 159 GSCWSFSTTGAIEAINAIVTGDLISLSEQELVDCDTTN-NYGCEGGDMDSAFQWVIGNGG 217

Query: 65  INTERDYPNVGVMDNCKVFQ 84
           I+TE DYP  GV   C   +
Sbjct: 218 IDTEADYPYTGVDGTCNTAK 237


>gi|297843784|ref|XP_002889773.1| hypothetical protein ARALYDRAFT_471096 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297335615|gb|EFH66032.1| hypothetical protein ARALYDRAFT_471096 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 439

 Score = 77.4 bits (189), Expect = 9e-13,   Method: Compositional matrix adjust.
 Identities = 32/68 (47%), Positives = 52/68 (76%), Gaps = 1/68 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G+CW F+  GA+EGI++IVT +L+ +S Q+L+DCD +  +  C GG ++  +++VI+N G
Sbjct: 140 GACWSFSATGAMEGINQIVTGDLISLSEQELIDCD-KSYNAGCNGGLMDYAFEFVIKNHG 198

Query: 65  INTERDYP 72
           I+TE+DYP
Sbjct: 199 IDTEKDYP 206


>gi|14600257|gb|AAK71314.1|AF388175_1 papain-like cysteine peptidase XBCP3 [Arabidopsis thaliana]
          Length = 437

 Score = 77.4 bits (189), Expect = 9e-13,   Method: Compositional matrix adjust.
 Identities = 32/68 (47%), Positives = 52/68 (76%), Gaps = 1/68 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G+CW F+  GA+EGI++IVT +L+ +S Q+L+DCD +  +  C GG ++  +++VI+N G
Sbjct: 140 GACWSFSATGAMEGINQIVTGDLISLSEQELIDCD-KSYNAGCNGGLMDYAFEFVIKNHG 198

Query: 65  INTERDYP 72
           I+TE+DYP
Sbjct: 199 IDTEKDYP 206


>gi|363807062|ref|NP_001242584.1| uncharacterized protein LOC100804015 precursor [Glycine max]
 gi|255640677|gb|ACU20623.1| unknown [Glycine max]
          Length = 366

 Score = 77.4 bits (189), Expect = 9e-13,   Method: Composition-based stats.
 Identities = 34/76 (44%), Positives = 50/76 (65%), Gaps = 1/76 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V  +E I+KIVT   V +S Q+LVDCD +  +  C GG ++  ++++IQN G
Sbjct: 150 GSCWAFSTVATVEAINKIVTGKFVSLSEQELVDCD-RAYNEGCNGGLMDYAFEFIIQNGG 208

Query: 65  INTERDYPNVGVMDNC 80
           I+T++DYP  G    C
Sbjct: 209 IDTDKDYPYRGFDGIC 224


>gi|168057475|ref|XP_001780740.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162667829|gb|EDQ54449.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 463

 Score = 77.4 bits (189), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 32/68 (47%), Positives = 52/68 (76%), Gaps = 1/68 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ VG++EG++ I T  LV +S Q+LVDCD + +++ C GG ++  ++++I+N G
Sbjct: 153 GSCWAFSAVGSVEGVNAIKTGELVSLSEQELVDCDRK-QNQGCNGGLMDYAFEFIIKNGG 211

Query: 65  INTERDYP 72
           I+TE+DYP
Sbjct: 212 IDTEKDYP 219


>gi|357437719|ref|XP_003589135.1| Cysteine proteinase [Medicago truncatula]
 gi|355478183|gb|AES59386.1| Cysteine proteinase [Medicago truncatula]
          Length = 457

 Score = 77.4 bits (189), Expect = 1e-12,   Method: Composition-based stats.
 Identities = 34/76 (44%), Positives = 51/76 (67%), Gaps = 1/76 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ + A+EGI+KIVT +L+ +S Q+LVDCD    +  C GG ++  ++++I N G
Sbjct: 167 GSCWAFSAIAAVEGINKIVTGDLISLSEQELVDCDTS-YNEGCNGGLMDYAFEFIISNGG 225

Query: 65  INTERDYPNVGVMDNC 80
           I++E DYP   V   C
Sbjct: 226 IDSEDDYPYKAVDGRC 241


>gi|214015364|gb|ACJ62298.1| cysteine protease [Zea mays subsp. parviglumis]
          Length = 252

 Score = 77.4 bits (189), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 36/82 (43%), Positives = 50/82 (60%), Gaps = 2/82 (2%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G CW F+ V  IEG++ I T NLV +S Q+++DCD Q     C GG +E  +++VI N G
Sbjct: 129 GGCWAFSAVAGIEGVNAIATGNLVSLSEQEIIDCDAQ--DSGCDGGQMENAFRFVIGNGG 186

Query: 65  INTERDYPNVGVMDNCKVFQFN 86
           I+TE DYP +G    C   + N
Sbjct: 187 IDTEADYPFIGTDGTCDASKEN 208


>gi|326430491|gb|EGD76061.1| cathepsin [Salpingoeca sp. ATCC 50818]
          Length = 381

 Score = 77.4 bits (189), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 36/80 (45%), Positives = 50/80 (62%), Gaps = 3/80 (3%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+  G++EG   I T NLV +S QQLVDC     ++ C GG ++  ++Y+I N G
Sbjct: 128 GSCWSFSTTGSVEGAHAIATGNLVSLSEQQLVDCSGSFGNQGCNGGLMDNAFKYIISNGG 187

Query: 65  INTERDYPNV---GVMDNCK 81
           ++TE+DYP     GV D  K
Sbjct: 188 LDTEQDYPYTARDGVCDKSK 207


>gi|172052260|gb|ACB70409.1| cysteine protease [Nicotiana tabacum]
          Length = 361

 Score = 77.4 bits (189), Expect = 1e-12,   Method: Composition-based stats.
 Identities = 34/82 (41%), Positives = 54/82 (65%), Gaps = 1/82 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V A+EGI++I TN LV +S Q+LVDCD   +++ C GG ++  ++++ +  G
Sbjct: 148 GSCWAFSTVVAVEGINQIKTNELVSLSEQELVDCDTS-QNQGCNGGLMDMAFEFIKKKGG 206

Query: 65  INTERDYPNVGVMDNCKVFQFN 86
           INTE +YP +     C + + N
Sbjct: 207 INTEENYPYMAEGGECDIQKRN 228


>gi|297740510|emb|CBI30692.3| unnamed protein product [Vitis vinifera]
          Length = 377

 Score = 77.4 bits (189), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 35/76 (46%), Positives = 51/76 (67%), Gaps = 2/76 (2%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+  GA+EGI+ IVT +L+ +S Q+LVDCD    +  C GG+++  +++VI N G
Sbjct: 34  GSCWAFSSTGAMEGINAIVTGDLISLSEQELVDCDTT--NYGCEGGYMDYAFEWVISNGG 91

Query: 65  INTERDYPNVGVMDNC 80
           I++E DYP  G    C
Sbjct: 92  IDSESDYPYTGTDGTC 107


>gi|326430490|gb|EGD76060.1| cysteine proteinase [Salpingoeca sp. ATCC 50818]
          Length = 448

 Score = 77.4 bits (189), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 36/80 (45%), Positives = 50/80 (62%), Gaps = 3/80 (3%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+  G++EG   I T NLV +S QQLVDC     ++ C GG ++  ++Y+I N G
Sbjct: 138 GSCWSFSTTGSVEGAHAIATGNLVSLSEQQLVDCSGSFGNQGCNGGLMDNAFKYIISNGG 197

Query: 65  INTERDYPNV---GVMDNCK 81
           ++TE+DYP     GV D  K
Sbjct: 198 LDTEQDYPYTARDGVCDKSK 217


>gi|297830592|ref|XP_002883178.1| hypothetical protein ARALYDRAFT_479457 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297329018|gb|EFH59437.1| hypothetical protein ARALYDRAFT_479457 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 452

 Score = 77.4 bits (189), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 34/75 (45%), Positives = 51/75 (68%), Gaps = 1/75 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ +GA+EGI++I T  L+ +S Q+LVDCD    +  C GG ++  ++++I+N G
Sbjct: 151 GSCWAFSAIGAVEGINQIKTGELISLSEQELVDCDTS-YNGGCGGGLMDYAFKFIIENGG 209

Query: 65  INTERDYPNVGVMDN 79
           I+TE DYP     DN
Sbjct: 210 IDTEEDYPYTATDDN 224


>gi|374713651|gb|AEZ65083.1| cysteine protease [Carica papaya]
          Length = 467

 Score = 77.0 bits (188), Expect = 1e-12,   Method: Composition-based stats.
 Identities = 33/82 (40%), Positives = 53/82 (64%), Gaps = 1/82 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ + A+EGI++IVT +L+ +S Q+LVDCD    +  C GG ++  ++++I N G
Sbjct: 160 GSCWAFSTIAAVEGINQIVTGDLISLSEQELVDCDTS-YNEGCNGGLMDYAFEFIINNGG 218

Query: 65  INTERDYPNVGVMDNCKVFQFN 86
           I++E DYP       C  ++ N
Sbjct: 219 IDSEEDYPYRAADQKCDQYRKN 240


>gi|357115272|ref|XP_003559414.1| PREDICTED: thiol protease SEN102-like [Brachypodium distachyon]
          Length = 360

 Score = 77.0 bits (188), Expect = 1e-12,   Method: Composition-based stats.
 Identities = 32/71 (45%), Positives = 46/71 (64%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G CW F+ V A+EG++KI T  LV +S QQLVDCD  G+ + C GG ++  +QY+ +  G
Sbjct: 161 GCCWAFSAVAAMEGLTKIRTGRLVSLSEQQLVDCDVYGDDQGCEGGLMDNAFQYISRQGG 220

Query: 65  INTERDYPNVG 75
           + +E  YP  G
Sbjct: 221 LASESAYPYSG 231


>gi|226533314|ref|NP_001150119.1| xylem cysteine proteinase 2 [Zea mays]
 gi|195636886|gb|ACG37911.1| xylem cysteine proteinase 2 precursor [Zea mays]
 gi|223946183|gb|ACN27175.1| unknown [Zea mays]
 gi|413951209|gb|AFW83858.1| Xylem cysteine proteinase 2 [Zea mays]
          Length = 385

 Score = 77.0 bits (188), Expect = 1e-12,   Method: Composition-based stats.
 Identities = 33/77 (42%), Positives = 49/77 (63%), Gaps = 1/77 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V A+EGI++IVT NL  +S Q+L+DCD  G +  C GG ++  + Y+  N G
Sbjct: 175 GSCWAFSTVAAVEGINQIVTGNLTALSEQELIDCDTDGNN-GCNGGLMDYAFSYIAHNGG 233

Query: 65  INTERDYPNVGVMDNCK 81
           ++TE  YP +     C+
Sbjct: 234 LHTEEAYPYLMEEGTCQ 250


>gi|225458701|ref|XP_002284973.1| PREDICTED: cysteine proteinase RD21a-like [Vitis vinifera]
          Length = 467

 Score = 77.0 bits (188), Expect = 1e-12,   Method: Composition-based stats.
 Identities = 34/82 (41%), Positives = 52/82 (63%), Gaps = 1/82 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ + A+EGI+KIVT  L+ +S Q+LVDCD    +  C GG ++  ++++I N G
Sbjct: 160 GSCWAFSTIAAVEGINKIVTGGLISLSEQELVDCDTS-YNEGCNGGLMDYAFEFIINNGG 218

Query: 65  INTERDYPNVGVMDNCKVFQFN 86
           I++E DYP       C  ++ N
Sbjct: 219 IDSEEDYPYKASDGRCDQYRKN 240


>gi|296081395|emb|CBI16828.3| unnamed protein product [Vitis vinifera]
          Length = 359

 Score = 77.0 bits (188), Expect = 1e-12,   Method: Composition-based stats.
 Identities = 32/82 (39%), Positives = 52/82 (63%), Gaps = 1/82 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V  +EGI++I T  L+ +S QQL+DCD + +   C GG +E+ ++++ +N G
Sbjct: 150 GSCWAFSTVVGVEGINQIKTKELLSLSEQQLIDCD-RSDDHGCNGGLMESAFEFIKKNGG 208

Query: 65  INTERDYPNVGVMDNCKVFQFN 86
           I TE +YP     + C + + N
Sbjct: 209 ITTENNYPYKAKDERCDMLKMN 230


>gi|359473128|ref|XP_002285397.2| PREDICTED: vignain-like [Vitis vinifera]
          Length = 357

 Score = 77.0 bits (188), Expect = 1e-12,   Method: Composition-based stats.
 Identities = 32/82 (39%), Positives = 52/82 (63%), Gaps = 1/82 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V  +EGI++I T  L+ +S QQL+DCD + +   C GG +E+ ++++ +N G
Sbjct: 148 GSCWAFSTVVGVEGINQIKTKELLSLSEQQLIDCD-RSDDHGCNGGLMESAFEFIKKNGG 206

Query: 65  INTERDYPNVGVMDNCKVFQFN 86
           I TE +YP     + C + + N
Sbjct: 207 ITTENNYPYKAKDERCDMLKMN 228


>gi|297809383|ref|XP_002872575.1| hypothetical protein ARALYDRAFT_911472 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297318412|gb|EFH48834.1| hypothetical protein ARALYDRAFT_911472 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 371

 Score = 77.0 bits (188), Expect = 1e-12,   Method: Composition-based stats.
 Identities = 34/75 (45%), Positives = 51/75 (68%), Gaps = 2/75 (2%)

Query: 6   SCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRGI 65
           SCW F+ VGA+EG++KIVT  LV +S Q L++C+   E+  C GG +ET Y+++++N G+
Sbjct: 167 SCWAFSTVGAVEGLNKIVTGELVTLSEQDLINCNK--ENNGCGGGKVETAYEFIMKNGGL 224

Query: 66  NTERDYPNVGVMDNC 80
            T+ DYP   V   C
Sbjct: 225 GTDNDYPYKAVNGVC 239


>gi|327289213|ref|XP_003229319.1| PREDICTED: cathepsin S-like [Anolis carolinensis]
          Length = 333

 Score = 77.0 bits (188), Expect = 1e-12,   Method: Composition-based stats.
 Identities = 36/77 (46%), Positives = 47/77 (61%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ VGA+E   K+ T NLV +S Q LVDC +   +  C GG+I   +QYVI N G
Sbjct: 139 GSCWAFSAVGALECQLKLKTGNLVSLSPQNLVDCSSAFGNHGCNGGYISAAFQYVIYNNG 198

Query: 65  INTERDYPNVGVMDNCK 81
           I++E  YP  G    C+
Sbjct: 199 IDSEASYPYTGQSGTCR 215


>gi|147790682|emb|CAN61026.1| hypothetical protein VITISV_001146 [Vitis vinifera]
          Length = 469

 Score = 77.0 bits (188), Expect = 1e-12,   Method: Composition-based stats.
 Identities = 34/82 (41%), Positives = 52/82 (63%), Gaps = 1/82 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ + A+EGI+KIVT  L+ +S Q+LVDCD    +  C GG ++  ++++I N G
Sbjct: 162 GSCWAFSTIAAVEGINKIVTGGLISLSEQELVDCDTS-YNEGCNGGLMDYAFEFIINNGG 220

Query: 65  INTERDYPNVGVMDNCKVFQFN 86
           I++E DYP       C  ++ N
Sbjct: 221 IDSEEDYPYKASDGRCDQYRKN 242


>gi|146215992|gb|ABQ10198.1| actinidin Act4b [Actinidia eriantha]
          Length = 379

 Score = 77.0 bits (188), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 34/68 (50%), Positives = 46/68 (67%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW FA + A+E I++IVT NL+ +S QQ+VDC  +  +  C GG     YQ++I N G
Sbjct: 155 GSCWTFAPIAAVEAINQIVTGNLISLSEQQIVDCQRKSPNNGCKGGSRAGAYQFIIDNGG 214

Query: 65  INTERDYP 72
           INTE +YP
Sbjct: 215 INTEANYP 222


>gi|242055753|ref|XP_002457022.1| hypothetical protein SORBIDRAFT_03g047290 [Sorghum bicolor]
 gi|241928997|gb|EES02142.1| hypothetical protein SORBIDRAFT_03g047290 [Sorghum bicolor]
          Length = 378

 Score = 77.0 bits (188), Expect = 1e-12,   Method: Composition-based stats.
 Identities = 33/68 (48%), Positives = 46/68 (67%), Gaps = 1/68 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V A+EGI++IVT NL  +S Q+LVDCD  G +  C GG ++  + Y+  N G
Sbjct: 178 GSCWAFSTVAAVEGINQIVTGNLTALSEQELVDCDTDGNN-GCNGGLMDYAFSYIAHNGG 236

Query: 65  INTERDYP 72
           ++TE  YP
Sbjct: 237 LHTEEAYP 244


>gi|375340657|emb|CBJ56264.1| cathepsin S protein [Dicentrarchus labrax]
          Length = 337

 Score = 77.0 bits (188), Expect = 1e-12,   Method: Composition-based stats.
 Identities = 34/77 (44%), Positives = 46/77 (59%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+  GA+EG     T  LVD+S Q LVDC  +  +  C GGF+   +QYVI N+G
Sbjct: 144 GSCWAFSAAGALEGQLAKTTGKLVDLSPQNLVDCSTKYGNHGCNGGFMHQAFQYVIDNQG 203

Query: 65  INTERDYPNVGVMDNCK 81
           I+++  YP  G    C+
Sbjct: 204 IDSDASYPYTGRNGECR 220


>gi|37780041|gb|AAP32193.1| cysteine protease 14 [Trifolium repens]
          Length = 351

 Score = 77.0 bits (188), Expect = 1e-12,   Method: Composition-based stats.
 Identities = 32/78 (41%), Positives = 51/78 (65%), Gaps = 1/78 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V A+EGI++IVT NL  +S Q+L+DCD    +  C GG ++  + +++QN G
Sbjct: 155 GSCWAFSTVAAVEGINQIVTGNLTSLSEQELIDCDTT-YNNGCNGGLMDYAFSFIVQNGG 213

Query: 65  INTERDYPNVGVMDNCKV 82
           ++ E DYP +     C++
Sbjct: 214 LHKEDDYPYIMEESTCEM 231


>gi|124484387|dbj|BAF46304.1| cysteine proteinase precursor [Ipomoea nil]
          Length = 474

 Score = 77.0 bits (188), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 36/85 (42%), Positives = 55/85 (64%), Gaps = 7/85 (8%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCD---NQGESRSCVGGFIETIYQYVIQ 61
           GSCW F+ + A+EG++++ T NL+ +S Q+LVDCD   NQG    C GG +   +Q++I+
Sbjct: 160 GSCWAFSTIAAVEGVNQLATGNLISLSEQELVDCDRKINQG----CNGGDMGYAFQFIIK 215

Query: 62  NRGINTERDYPNVGVMDNCKVFQFN 86
           N GI++E DYP  G    C  ++ N
Sbjct: 216 NGGIDSEEDYPYTGKDGKCDSYRQN 240


>gi|124484383|dbj|BAF46302.1| cysteine proteinase precursor [Ipomoea nil]
          Length = 369

 Score = 77.0 bits (188), Expect = 1e-12,   Method: Composition-based stats.
 Identities = 36/84 (42%), Positives = 49/84 (58%), Gaps = 7/84 (8%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRS-------CVGGFIETIYQ 57
           GSCW F+  GA+EG + + T  LV +S QQLVDCD+  +          C GG + T Y+
Sbjct: 159 GSCWSFSTTGALEGANFLATGELVSLSEQQLVDCDHLCDPEEAGACDSGCNGGLMTTAYE 218

Query: 58  YVIQNRGINTERDYPNVGVMDNCK 81
           YV+Q+ G+  E+DYP  G    CK
Sbjct: 219 YVLQSGGLEKEKDYPYTGKDGTCK 242


>gi|414584879|tpg|DAA35450.1| TPA: cysteine protease 1 [Zea mays]
          Length = 522

 Score = 77.0 bits (188), Expect = 1e-12,   Method: Composition-based stats.
 Identities = 32/82 (39%), Positives = 52/82 (63%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V ++E +++IVT  +V +S Q+LV+C   G +  C GG ++  + ++I+N G
Sbjct: 221 GSCWAFSAVSSVESVNQIVTGEMVTLSEQELVECSTDGGNSGCNGGLMDAAFDFIIKNGG 280

Query: 65  INTERDYPNVGVMDNCKVFQFN 86
           I+TE DYP   V   C + + N
Sbjct: 281 IDTEGDYPYKAVDGKCDINREN 302


>gi|146215984|gb|ABQ10194.1| actinidin Act2c [Actinidia arguta]
          Length = 378

 Score = 77.0 bits (188), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 34/78 (43%), Positives = 49/78 (62%)

Query: 6   SCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRGI 65
           SCW F+ V A+EGI+KI+T NL+ +S Q+LVDC     +R C  G++   +Q++I N GI
Sbjct: 149 SCWAFSAVAAVEGINKIMTGNLLSLSEQELVDCGRTQSTRGCNRGYMTDAFQFIINNGGI 208

Query: 66  NTERDYPNVGVMDNCKVF 83
           NTE +YP       C  +
Sbjct: 209 NTEDNYPYTAQDGQCNRY 226


>gi|190358935|sp|P00785.4|ACTN_ACTCH RecName: Full=Actinidain; Short=Actinidin; AltName: Allergen=Act c
           1; Flags: Precursor
 gi|12744965|gb|AAK06862.1|AF343446_1 actinidin protease [Actinidia chinensis]
          Length = 380

 Score = 76.6 bits (187), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 34/78 (43%), Positives = 48/78 (61%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G CW F+ +  +EGI+KIVT  L+ +S Q+L+DC     +R C GG+I   +Q++I N G
Sbjct: 149 GGCWAFSAIATVEGINKIVTGVLISLSEQELIDCGRTQNTRGCNGGYITDGFQFIINNGG 208

Query: 65  INTERDYPNVGVMDNCKV 82
           INTE +YP       C V
Sbjct: 209 INTEENYPYTAQDGECNV 226


>gi|224083362|ref|XP_002306996.1| predicted protein [Populus trichocarpa]
 gi|222856445|gb|EEE93992.1| predicted protein [Populus trichocarpa]
          Length = 336

 Score = 76.6 bits (187), Expect = 2e-12,   Method: Composition-based stats.
 Identities = 34/78 (43%), Positives = 50/78 (64%), Gaps = 1/78 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V A+EGI++IVT NL  +S Q+LVDCD    +  C GG ++  + Y+I N G
Sbjct: 140 GSCWAFSTVAAVEGINQIVTGNLTSLSEQELVDCDTT-YNNGCNGGLMDYAFAYIISNGG 198

Query: 65  INTERDYPNVGVMDNCKV 82
           ++ E DYP +     C++
Sbjct: 199 LHKEEDYPYIMEEGTCEM 216


>gi|222425026|dbj|BAH20463.1| cysteine protease [Spinacia oleracea]
          Length = 473

 Score = 76.6 bits (187), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 35/82 (42%), Positives = 52/82 (63%), Gaps = 1/82 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ + A+EGI++IVT  L+ +S Q+LVDCD    S  C GG ++  Y+++I N G
Sbjct: 169 GSCWAFSTIAAVEGINQIVTGELLSLSEQELVDCDTSYNS-GCDGGLMDYAYEFIINNGG 227

Query: 65  INTERDYPNVGVMDNCKVFQFN 86
           I+T+ DYP       C  ++ N
Sbjct: 228 IDTDADYPYTAKDGKCDQYRKN 249


>gi|2144501|pir||TAGB actinidain (EC 3.4.22.14) precursor - kiwi fruit
 gi|166317|gb|AAA32629.1| actinidin [Actinidia deliciosa]
          Length = 380

 Score = 76.6 bits (187), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 34/78 (43%), Positives = 48/78 (61%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G CW F+ +  +EGI+KIVT  L+ +S Q+L+DC     +R C GG+I   +Q++I N G
Sbjct: 149 GGCWAFSAIATVEGINKIVTGVLISLSEQELIDCGRTQNTRGCNGGYITDGFQFIINNGG 208

Query: 65  INTERDYPNVGVMDNCKV 82
           INTE +YP       C V
Sbjct: 209 INTEENYPYTAQDGECNV 226


>gi|413953665|gb|AFW86314.1| hypothetical protein ZEAMMB73_546353 [Zea mays]
          Length = 233

 Score = 76.6 bits (187), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 33/77 (42%), Positives = 46/77 (59%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G CW F+ V A EGI KI T  LV ++ Q+LVDCD   E + C GG ++  ++++I+N G
Sbjct: 39  GCCWAFSAVAATEGIVKISTGKLVSLAEQELVDCDVHDEDQGCEGGLMDDAFKFIIKNGG 98

Query: 65  INTERDYPNVGVMDNCK 81
           + TE  YP       CK
Sbjct: 99  LTTESSYPYTAADGKCK 115


>gi|312451836|gb|ADQ85985.1| actinidin [Actinidia chinensis]
          Length = 380

 Score = 76.6 bits (187), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 34/78 (43%), Positives = 48/78 (61%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G CW F+ +  +EGI+KIVT  L+ +S Q+L+DC     +R C GG+I   +Q++I N G
Sbjct: 149 GGCWAFSAIATVEGINKIVTGVLISLSEQELIDCGRTQNTRGCNGGYITDGFQFIINNGG 208

Query: 65  INTERDYPNVGVMDNCKV 82
           INTE +YP       C V
Sbjct: 209 INTEENYPYTAQDGECNV 226


>gi|18413505|ref|NP_567376.1| putative cysteine proteinase [Arabidopsis thaliana]
 gi|30315954|sp|Q9SUT0.1|CPR3_ARATH RecName: Full=Probable cysteine proteinase At4g11310; Flags:
           Precursor
 gi|5596477|emb|CAB51415.1| drought-inducible cysteine proteinase RD21A precursor-like protein
           [Arabidopsis thaliana]
 gi|7267830|emb|CAB81232.1| drought-inducible cysteine proteinase RD21A precursor-like protein
           [Arabidopsis thaliana]
 gi|332657595|gb|AEE82995.1| putative cysteine proteinase [Arabidopsis thaliana]
          Length = 364

 Score = 76.6 bits (187), Expect = 2e-12,   Method: Composition-based stats.
 Identities = 34/75 (45%), Positives = 51/75 (68%), Gaps = 2/75 (2%)

Query: 6   SCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRGI 65
           SCW F+ VGA+EG++KIVT  LV +S Q L++C+   E+  C GG +ET Y+++++N G+
Sbjct: 160 SCWAFSTVGAVEGLNKIVTGELVTLSEQDLINCNK--ENNGCGGGKLETAYEFIMKNGGL 217

Query: 66  NTERDYPNVGVMDNC 80
            T+ DYP   V   C
Sbjct: 218 GTDNDYPYKAVNGVC 232


>gi|242077600|ref|XP_002448736.1| hypothetical protein SORBIDRAFT_06g032320 [Sorghum bicolor]
 gi|241939919|gb|EES13064.1| hypothetical protein SORBIDRAFT_06g032320 [Sorghum bicolor]
          Length = 467

 Score = 76.6 bits (187), Expect = 2e-12,   Method: Composition-based stats.
 Identities = 33/82 (40%), Positives = 52/82 (63%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V ++E I++IVT  +V +S Q+LV+C   G +  C GG ++  + ++I+N G
Sbjct: 167 GSCWAFSAVSSVESINQIVTGEMVTLSEQELVECSTDGGNSGCNGGLMDAAFNFIIKNGG 226

Query: 65  INTERDYPNVGVMDNCKVFQFN 86
           I+TE DYP   V   C + + N
Sbjct: 227 IDTEDDYPYKAVDGKCDINRRN 248


>gi|224133760|ref|XP_002321654.1| predicted protein [Populus trichocarpa]
 gi|222868650|gb|EEF05781.1| predicted protein [Populus trichocarpa]
          Length = 362

 Score = 76.6 bits (187), Expect = 2e-12,   Method: Composition-based stats.
 Identities = 33/68 (48%), Positives = 47/68 (69%), Gaps = 1/68 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ + A+EGI+ I TN LV +S Q+LVDCD   E++ C GG +E  ++++ + RG
Sbjct: 150 GSCWAFSTIVAVEGINYIKTNELVSLSEQELVDCDTT-ENQGCNGGLMEYAFEFIKKKRG 208

Query: 65  INTERDYP 72
           I TE  YP
Sbjct: 209 ITTESTYP 216


>gi|242081867|ref|XP_002445702.1| hypothetical protein SORBIDRAFT_07g024430 [Sorghum bicolor]
 gi|241942052|gb|EES15197.1| hypothetical protein SORBIDRAFT_07g024430 [Sorghum bicolor]
          Length = 372

 Score = 76.6 bits (187), Expect = 2e-12,   Method: Composition-based stats.
 Identities = 33/77 (42%), Positives = 48/77 (62%), Gaps = 1/77 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ + A+EGI+ I T NL  +S QQLVDCD +G +  C GG ++  +QY+ ++ G
Sbjct: 158 GSCWAFSTIAAVEGINAIKTKNLTSLSEQQLVDCDTKGNA-GCDGGLMDYAFQYIAKHGG 216

Query: 65  INTERDYPNVGVMDNCK 81
           +  E  YP      +CK
Sbjct: 217 VAAEDAYPYKARQASCK 233


>gi|356543010|ref|XP_003539956.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
          Length = 306

 Score = 76.6 bits (187), Expect = 2e-12,   Method: Composition-based stats.
 Identities = 32/76 (42%), Positives = 49/76 (64%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V A+EGI+KI +  LV +S Q+  DCD +  ++ C GG ++T + ++ +N G
Sbjct: 108 GSCWAFSAVAAVEGINKIKSGKLVSLSEQEFRDCDVEDGNQGCEGGLMDTAFAFIKKNGG 167

Query: 65  INTERDYPNVGVMDNC 80
           + T +DYP  GV   C
Sbjct: 168 LTTSKDYPYEGVDGTC 183


>gi|414588007|tpg|DAA38578.1| TPA: hypothetical protein ZEAMMB73_159244 [Zea mays]
          Length = 307

 Score = 76.6 bits (187), Expect = 2e-12,   Method: Composition-based stats.
 Identities = 33/77 (42%), Positives = 47/77 (61%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G CW F+ + A+EGI K+ T NLV +S Q+ VDCD       C GG+++  +++VI+N G
Sbjct: 113 GCCWAFSAIAAMEGIVKLSTGNLVSLSEQEPVDCDTHNMDEGCEGGWMDNAFEFVIKNGG 172

Query: 65  INTERDYPNVGVMDNCK 81
           + TE  YP   V   CK
Sbjct: 173 LATESSYPYKVVDGKCK 189


>gi|327322928|gb|AEA48885.1| cathepsin S [Oplegnathus fasciatus]
          Length = 337

 Score = 76.6 bits (187), Expect = 2e-12,   Method: Composition-based stats.
 Identities = 33/76 (43%), Positives = 46/76 (60%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+  GA+EG     T  L+D+S Q LVDC ++  +  C GGF+   +QYVI N+G
Sbjct: 144 GSCWAFSAAGALEGQLAKTTGKLLDLSPQNLVDCSSKYGNHGCNGGFMHRAFQYVIDNQG 203

Query: 65  INTERDYPNVGVMDNC 80
           I+++  YP  G    C
Sbjct: 204 IDSDASYPYTGQSQQC 219


>gi|330805273|ref|XP_003290609.1| hypothetical protein DICPUDRAFT_92519 [Dictyostelium purpureum]
 gi|325079248|gb|EGC32857.1| hypothetical protein DICPUDRAFT_92519 [Dictyostelium purpureum]
          Length = 333

 Score = 76.6 bits (187), Expect = 2e-12,   Method: Composition-based stats.
 Identities = 33/77 (42%), Positives = 45/77 (58%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+  GA+EG  +I + N+V +S Q LVDC  Q  ++ C GG +   ++Y+I N G
Sbjct: 139 GSCWSFSTTGAVEGAHQIKSGNMVSLSEQNLVDCSGQYGNQGCEGGLMVNAFEYIIDNGG 198

Query: 65  INTERDYPNVGVMDNCK 81
           I TE  YP       CK
Sbjct: 199 IATESSYPYTAAQGRCK 215


>gi|63101996|gb|AAH95694.1| Cathepsin S, b.1 [Danio rerio]
          Length = 330

 Score = 76.6 bits (187), Expect = 2e-12,   Method: Composition-based stats.
 Identities = 36/72 (50%), Positives = 47/72 (65%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ VGA+EG  K  T  LVD+S Q LVDC ++  ++ C GGF+   +QYVI N G
Sbjct: 137 GSCWAFSSVGALEGQLKKTTGKLVDLSPQNLVDCSSKYGNKGCNGGFMSDAFQYVIDNGG 196

Query: 65  INTERDYPNVGV 76
           I ++  YP  GV
Sbjct: 197 IASDSAYPYRGV 208


>gi|66378053|gb|AAY45871.1| cathepsin L-like cysteine proteinase [Longidorus elongatus]
          Length = 358

 Score = 76.6 bits (187), Expect = 2e-12,   Method: Composition-based stats.
 Identities = 34/77 (44%), Positives = 46/77 (59%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+  G++EG     T  LV +S Q LVDCD  G+   C GG+++  +QYV  N+G
Sbjct: 161 GSCWAFSATGSLEGQHYKQTGKLVSLSEQNLVDCDVNGDDEGCNGGYMDGAFQYVETNKG 220

Query: 65  INTERDYPNVGVMDNCK 81
           I+TE  YP  G    C+
Sbjct: 221 IDTEASYPYKGRDGRCR 237


>gi|20260334|gb|AAM13065.1| drought-inducible cysteine proteinase RD21A precursor-like protein
           [Arabidopsis thaliana]
 gi|23197782|gb|AAN15418.1| drought-inducible cysteine proteinase RD21A precursor-like protein
           [Arabidopsis thaliana]
          Length = 357

 Score = 76.6 bits (187), Expect = 2e-12,   Method: Composition-based stats.
 Identities = 34/75 (45%), Positives = 51/75 (68%), Gaps = 2/75 (2%)

Query: 6   SCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRGI 65
           SCW F+ VGA+EG++KIVT  LV +S Q L++C+   E+  C GG +ET Y+++++N G+
Sbjct: 153 SCWAFSTVGAVEGLNKIVTGELVTLSEQDLINCNK--ENNGCGGGKLETAYEFIMKNGGL 210

Query: 66  NTERDYPNVGVMDNC 80
            T+ DYP   V   C
Sbjct: 211 GTDNDYPYKAVNGVC 225


>gi|224076968|ref|XP_002305072.1| predicted protein [Populus trichocarpa]
 gi|222848036|gb|EEE85583.1| predicted protein [Populus trichocarpa]
          Length = 305

 Score = 76.6 bits (187), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 36/76 (47%), Positives = 50/76 (65%), Gaps = 2/76 (2%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G CW F+ V AIEGI K+ T NL+ +S QQLVDC     ++ C GG ++T +QY+I+N G
Sbjct: 111 GCCWAFSTVAAIEGIIKLQTGNLISLSEQQLVDC--TAGNKGCQGGLMDTAFQYIIRNGG 168

Query: 65  INTERDYPNVGVMDNC 80
           + +E +YP  GV   C
Sbjct: 169 LTSEDNYPYQGVDGTC 184


>gi|146216004|gb|ABQ10204.1| cysteine protease Cp6 [Actinidia deliciosa]
          Length = 461

 Score = 76.6 bits (187), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 34/82 (41%), Positives = 54/82 (65%), Gaps = 1/82 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ + A+EGI++IVT +L+ +S Q+LVDCD    +  C GG ++  ++++I+N G
Sbjct: 156 GSCWAFSTIAAVEGINQIVTGDLISLSEQELVDCDTS-YNEGCNGGLMDYAFEFIIKNGG 214

Query: 65  INTERDYPNVGVMDNCKVFQFN 86
           I+TE DYP       C  ++ N
Sbjct: 215 IDTEEDYPYNARDGRCDQYRKN 236


>gi|62955291|ref|NP_001017661.1| cathepsin S, b.2 precursor [Danio rerio]
 gi|62204682|gb|AAH93339.1| Cathepsin S, b.2 [Danio rerio]
 gi|182891354|gb|AAI64362.1| Ctssb.2 protein [Danio rerio]
          Length = 330

 Score = 76.6 bits (187), Expect = 2e-12,   Method: Composition-based stats.
 Identities = 35/77 (45%), Positives = 48/77 (62%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ VGA+EG     T  LVD+S Q LVDC ++  +  C GG++   +QYVI N G
Sbjct: 137 GSCWAFSSVGALEGQLMKTTGKLVDLSPQNLVDCSSKYGNLGCNGGYMSQAFQYVIDNGG 196

Query: 65  INTERDYPNVGVMDNCK 81
           I++E  YP  G   +C+
Sbjct: 197 IDSESSYPYQGTQGSCR 213


>gi|75994626|gb|ABA33834.1| cysteine protease Mir1 [Zea mays subsp. parviglumis]
          Length = 248

 Score = 76.6 bits (187), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 35/78 (44%), Positives = 49/78 (62%), Gaps = 2/78 (2%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G CW F+ V AIEG++ I T NLV +S Q+++DCD Q     C GG +E  +++VI N G
Sbjct: 124 GGCWAFSAVAAIEGVNAIATGNLVSLSEQEIIDCDAQ--DSGCDGGQMEDAFRFVIGNGG 181

Query: 65  INTERDYPNVGVMDNCKV 82
           I++E DYP +G    C  
Sbjct: 182 IDSEADYPFIGTDGTCDA 199


>gi|600111|emb|CAA84378.1| cysteine proteinase [Vicia sativa]
          Length = 359

 Score = 76.6 bits (187), Expect = 2e-12,   Method: Composition-based stats.
 Identities = 36/78 (46%), Positives = 50/78 (64%), Gaps = 2/78 (2%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ + A+EGI++I T  LV +S QQLVDCD + E+  C GG +E  ++++ QN G
Sbjct: 150 GSCWAFSTIAAVEGINQIKTQKLVSLSEQQLVDCDTE-ENEGCNGGLMEYAFEFIKQN-G 207

Query: 65  INTERDYPNVGVMDNCKV 82
           I TE +YP       C V
Sbjct: 208 ITTESNYPYAAKDGTCDV 225


>gi|310942958|pdb|3P5U|A Chain A, Actinidin From Actinidia Arguta Planch (Sarusashi)
 gi|310942959|pdb|3P5V|A Chain A, Actinidin From Actinidia Arguta Planch (Sarusashi)
 gi|310942961|pdb|3P5X|A Chain A, Actinidin From Actinidia Arguta Planch (Sarusashi)
          Length = 220

 Score = 76.6 bits (187), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 34/78 (43%), Positives = 49/78 (62%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GS W F+ + A+EGI+KI T +L+ +S Q+LVDC     +R C GGF+   +Q++I N G
Sbjct: 23  GSXWAFSTIAAVEGINKIATGDLISLSEQELVDCGRTQNTRGCDGGFMTDGFQFIINNGG 82

Query: 65  INTERDYPNVGVMDNCKV 82
           INTE +YP       C +
Sbjct: 83  INTEANYPYTAEEGQCNL 100


>gi|463046|gb|AAA49207.1| cysteine proteinase [Cyprinus carpio]
          Length = 331

 Score = 76.6 bits (187), Expect = 2e-12,   Method: Composition-based stats.
 Identities = 36/77 (46%), Positives = 46/77 (59%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ VGA+EG     T  LVD+S Q LVDC +   +  C GG +   +QYVI N G
Sbjct: 137 GSCWAFSSVGALEGQLMKTTGKLVDLSPQNLVDCSSSYGNYGCGGGLMSAAFQYVIDNGG 196

Query: 65  INTERDYPNVGVMDNCK 81
           I++E  YP  GV   C+
Sbjct: 197 IDSESSYPYEGVQGQCR 213


>gi|47213723|emb|CAF95154.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 334

 Score = 76.6 bits (187), Expect = 2e-12,   Method: Composition-based stats.
 Identities = 34/77 (44%), Positives = 46/77 (59%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+  GA+EG+    T  LVD+S Q LVDC  +  +  C GG++   +QYVI N G
Sbjct: 143 GSCWAFSAAGALEGLLAKTTGKLVDLSPQNLVDCTRKYGNHGCNGGYMHHTFQYVIDNHG 202

Query: 65  INTERDYPNVGVMDNCK 81
           I++E  YP  G    C+
Sbjct: 203 IDSEASYPYTGQEGVCR 219


>gi|312100382|gb|ADQ27799.1| mitogenic proteinase [Vasconcellea cundinamarcensis]
          Length = 214

 Score = 76.6 bits (187), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 37/81 (45%), Positives = 52/81 (64%), Gaps = 3/81 (3%)

Query: 2  HPLGSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQ 61
          +P GSCW F+ V  +EGI+KIVT  L+ +S Q+L+DCD +  S  C GG+  T  QYV+ 
Sbjct: 20 NPCGSCWAFSTVATVEGINKIVTGKLISLSEQELLDCDRR--SHGCNGGYQTTSLQYVVD 77

Query: 62 NRGINTERDYPNVGVMDNCKV 82
          N G++TE +YP      NC+ 
Sbjct: 78 N-GVHTEYEYPYEKKQGNCRA 97


>gi|218181|dbj|BAA14402.1| oryzain alpha precursor [Oryza sativa Japonica Group]
          Length = 458

 Score = 76.3 bits (186), Expect = 2e-12,   Method: Composition-based stats.
 Identities = 35/82 (42%), Positives = 53/82 (64%), Gaps = 1/82 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ + A+E I++IVT +L+ +S Q+LVDCD    +  C GG ++  + ++I N G
Sbjct: 151 GSCWAFSAIAAVEDINQIVTGDLISLSEQELVDCDTS-YNEGCNGGLMDYAFDFIINNGG 209

Query: 65  INTERDYPNVGVMDNCKVFQFN 86
           I+TE DYP  G  + C V + N
Sbjct: 210 IDTEDDYPYKGKDERCDVNRKN 231


>gi|449438381|ref|XP_004136967.1| PREDICTED: cysteine proteinase RD21a-like [Cucumis sativus]
          Length = 479

 Score = 76.3 bits (186), Expect = 2e-12,   Method: Composition-based stats.
 Identities = 35/71 (49%), Positives = 50/71 (70%), Gaps = 1/71 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V A+EGI++IVT  L+ +S Q+LVDCD +  +  C GG ++  +Q++I N G
Sbjct: 171 GSCWAFSSVAAVEGINQIVTGELIPLSEQELVDCD-KSFNMGCNGGLMDYAFQFIIGNGG 229

Query: 65  INTERDYPNVG 75
           I+TE DYP  G
Sbjct: 230 IDTEEDYPYKG 240


>gi|535454|gb|AAA50755.1| cysteine proteinase [Alnus glutinosa]
          Length = 340

 Score = 76.3 bits (186), Expect = 2e-12,   Method: Composition-based stats.
 Identities = 31/76 (40%), Positives = 46/76 (60%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G CW F+ V A EGI+K+ T  L+ +S Q+LVDCD  G  + C GG ++  + ++  N G
Sbjct: 144 GCCWAFSAVAATEGITKLTTGELISLSEQELVDCDTSGVDQGCEGGLMDNAFTFIQHNHG 203

Query: 65  INTERDYPNVGVMDNC 80
           + +E +YP  GV   C
Sbjct: 204 LASEANYPYKGVDGTC 219


>gi|224809458|ref|NP_001019580.2| cathepsin S, b.1 precursor [Danio rerio]
 gi|63101450|gb|AAH95788.1| Cathepsin S, b.1 [Danio rerio]
 gi|77748418|gb|AAI07613.1| Cathepsin S, b.1 [Danio rerio]
          Length = 330

 Score = 76.3 bits (186), Expect = 2e-12,   Method: Composition-based stats.
 Identities = 36/72 (50%), Positives = 47/72 (65%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ VGA+EG  K  T  LVD+S Q LVDC ++  ++ C GGF+   +QYVI N G
Sbjct: 137 GSCWAFSSVGALEGQLKKTTGKLVDLSPQNLVDCSSKYGNKGCNGGFMSDAFQYVIDNGG 196

Query: 65  INTERDYPNVGV 76
           I ++  YP  GV
Sbjct: 197 IASDSAYPYRGV 208


>gi|356517184|ref|XP_003527269.1| PREDICTED: xylem cysteine proteinase 2-like [Glycine max]
          Length = 350

 Score = 76.3 bits (186), Expect = 2e-12,   Method: Composition-based stats.
 Identities = 31/78 (39%), Positives = 52/78 (66%), Gaps = 1/78 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V A+EGI++IVT NL  +S Q+L+DCD +  +  C GG ++  + ++++N G
Sbjct: 154 GSCWAFSTVAAVEGINQIVTGNLTSLSEQELIDCD-RTYNNGCNGGLMDYAFSFIVENDG 212

Query: 65  INTERDYPNVGVMDNCKV 82
           ++ E DYP +     C++
Sbjct: 213 LHKEEDYPYIMEEGTCEM 230


>gi|348525618|ref|XP_003450319.1| PREDICTED: cathepsin S-like [Oreochromis niloticus]
          Length = 330

 Score = 76.3 bits (186), Expect = 2e-12,   Method: Composition-based stats.
 Identities = 34/77 (44%), Positives = 46/77 (59%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+  GA+EG     T  LVD+S Q LVDC  +  +  C GGF+   +QYVI N G
Sbjct: 137 GSCWAFSAAGALEGQLAKSTGKLVDLSPQNLVDCSGKYGNHGCNGGFMTRAFQYVIDNHG 196

Query: 65  INTERDYPNVGVMDNCK 81
           I+++  YP  G  + C+
Sbjct: 197 IDSDASYPYTGRDEQCR 213


>gi|342305192|dbj|BAK55650.1| cathepsin S [Oplegnathus fasciatus]
          Length = 337

 Score = 76.3 bits (186), Expect = 2e-12,   Method: Composition-based stats.
 Identities = 33/76 (43%), Positives = 46/76 (60%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+  GA+EG     T  L+D+S Q LVDC ++  +  C GGF+   +QYVI N+G
Sbjct: 144 GSCWAFSAAGALEGQLAKTTGKLLDLSPQNLVDCSSKYGNHGCNGGFMHRAFQYVIDNQG 203

Query: 65  INTERDYPNVGVMDNC 80
           I+++  YP  G    C
Sbjct: 204 IDSDASYPYTGQSQQC 219


>gi|255539310|ref|XP_002510720.1| cysteine protease, putative [Ricinus communis]
 gi|223551421|gb|EEF52907.1| cysteine protease, putative [Ricinus communis]
          Length = 349

 Score = 76.3 bits (186), Expect = 2e-12,   Method: Composition-based stats.
 Identities = 32/76 (42%), Positives = 48/76 (63%), Gaps = 1/76 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V A+EGI++IVT NL  +S Q+L+DCD    +  C GG ++  + Y++ N G
Sbjct: 153 GSCWAFSTVAAVEGINQIVTGNLTSLSEQELIDCDTT-YNNGCNGGLMDYAFAYIVANGG 211

Query: 65  INTERDYPNVGVMDNC 80
           ++ E DYP +     C
Sbjct: 212 LHKEEDYPYIMEEGTC 227


>gi|357467173|ref|XP_003603871.1| Cysteine proteinase [Medicago truncatula]
 gi|355492919|gb|AES74122.1| Cysteine proteinase [Medicago truncatula]
 gi|388499154|gb|AFK37643.1| unknown [Medicago truncatula]
          Length = 350

 Score = 76.3 bits (186), Expect = 2e-12,   Method: Composition-based stats.
 Identities = 31/78 (39%), Positives = 51/78 (65%), Gaps = 1/78 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V A+EGI++IVT NL  +S Q+L+DCD    +  C GG ++  + ++++N G
Sbjct: 154 GSCWAFSTVAAVEGINQIVTGNLTSLSEQELIDCDTT-YNNGCNGGLMDYAFSFIVKNGG 212

Query: 65  INTERDYPNVGVMDNCKV 82
           ++ E DYP +     C++
Sbjct: 213 LHKEEDYPYIMEESTCEM 230


>gi|225446523|ref|XP_002275891.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP2 [Vitis vinifera]
          Length = 358

 Score = 76.3 bits (186), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 34/76 (44%), Positives = 49/76 (64%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V A+EGI+KI T  LV +S Q+L+DCD    +  C GG++   ++++ QN G
Sbjct: 151 GSCWAFSTVAAVEGINKIRTGKLVSLSEQELLDCDIDSGNEGCNGGYMVNAFKFIKQNGG 210

Query: 65  INTERDYPNVGVMDNC 80
           I T R+YP +G    C
Sbjct: 211 ITTARNYPYIGEQGIC 226


>gi|27728675|gb|AAO18731.1| cysteine protease [Gossypium hirsutum]
          Length = 389

 Score = 76.3 bits (186), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 34/76 (44%), Positives = 52/76 (68%), Gaps = 2/76 (2%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+  GA+EGI+ +VT +L+ +S Q+LV+CD    +  C GG+++  +++VI N G
Sbjct: 162 GSCWAFSSTGAMEGINALVTGDLISLSEQELVECDTS--NYGCEGGYMDYAFEWVINNGG 219

Query: 65  INTERDYPNVGVMDNC 80
           I++E DYP  GV   C
Sbjct: 220 IDSESDYPYTGVDGTC 235


>gi|356508490|ref|XP_003522989.1| PREDICTED: xylem cysteine proteinase 2-like [Glycine max]
          Length = 349

 Score = 76.3 bits (186), Expect = 2e-12,   Method: Composition-based stats.
 Identities = 31/78 (39%), Positives = 52/78 (66%), Gaps = 1/78 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V A+EGI++IVT NL  +S Q+L+DCD +  +  C GG ++  + ++++N G
Sbjct: 153 GSCWAFSTVAAVEGINQIVTGNLTSLSEQELIDCD-RTYNNGCNGGLMDYAFSFIVENGG 211

Query: 65  INTERDYPNVGVMDNCKV 82
           ++ E DYP +     C++
Sbjct: 212 LHKEEDYPYIMEEGTCEM 229


>gi|440799058|gb|ELR20119.1| cysteine proteinase [Acanthamoeba castellanii str. Neff]
          Length = 401

 Score = 76.3 bits (186), Expect = 2e-12,   Method: Composition-based stats.
 Identities = 36/82 (43%), Positives = 50/82 (60%), Gaps = 9/82 (10%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDC-----DNQGESRSCVGGFIETIYQYV 59
           GSCW F+  G+ EGI+ I T+ LV +S Q LVDC     DN G    C GGF++  ++Y+
Sbjct: 206 GSCWAFSTTGSTEGINAITTSRLVPLSEQNLVDCATAAYDNYG----CNGGFMDNAFRYI 261

Query: 60  IQNRGINTERDYPNVGVMDNCK 81
           I N+GI++E  YP V     C+
Sbjct: 262 IDNKGIDSEASYPYVAADGQCR 283


>gi|156142226|gb|ABU51882.1| ervatamin-C precursor [Tabernaemontana divaricata]
          Length = 365

 Score = 76.3 bits (186), Expect = 2e-12,   Method: Composition-based stats.
 Identities = 35/77 (45%), Positives = 48/77 (62%), Gaps = 2/77 (2%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V  +E I++I T NL+ +S QQLVDC+ +  +  C GG     YQY+I N G
Sbjct: 156 GSCWAFSTVSTVESINQIRTGNLISLSEQQLVDCNKK--NHGCKGGAFVYAYQYIIDNGG 213

Query: 65  INTERDYPNVGVMDNCK 81
           I+TE +YP   V   C+
Sbjct: 214 IDTEANYPYKAVQGPCR 230


>gi|410904753|ref|XP_003965856.1| PREDICTED: cathepsin S-like [Takifugu rubripes]
          Length = 334

 Score = 76.3 bits (186), Expect = 2e-12,   Method: Composition-based stats.
 Identities = 34/76 (44%), Positives = 46/76 (60%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+  GA+EG     T  LVD+S Q LVDC  +  +  C GG++   +QYVI N+G
Sbjct: 141 GSCWAFSAAGALEGQLAKTTGRLVDLSPQNLVDCSGKYGNHGCNGGYMHRAFQYVIDNQG 200

Query: 65  INTERDYPNVGVMDNC 80
           I++E  YP  G +  C
Sbjct: 201 IDSEASYPYRGQVQQC 216


>gi|449447027|ref|XP_004141271.1| PREDICTED: cysteine proteinase RD21a-like [Cucumis sativus]
          Length = 458

 Score = 76.3 bits (186), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 33/82 (40%), Positives = 56/82 (68%), Gaps = 1/82 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V ++E I++IVT +L+ +S Q+LVDCD +  +  C GG ++  ++++I+N G
Sbjct: 150 GSCWAFSTVASVEAINQIVTGDLIALSEQELVDCD-RSYNEGCNGGLMDYAFEFIIENGG 208

Query: 65  INTERDYPNVGVMDNCKVFQFN 86
           ++TE DYP  G   +C  ++ N
Sbjct: 209 LDTEEDYPYYGFDSSCIQYKKN 230


>gi|125592009|gb|EAZ32359.1| hypothetical protein OsJ_16569 [Oryza sativa Japonica Group]
          Length = 480

 Score = 76.3 bits (186), Expect = 2e-12,   Method: Composition-based stats.
 Identities = 31/82 (37%), Positives = 52/82 (63%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V  +E I+++VT  ++ +S Q+LV+C   G++  C GG ++  + ++I+N G
Sbjct: 177 GSCWAFSAVSTVESINQLVTGEMITLSEQELVECSTNGQNSGCNGGLMDDAFDFIIKNGG 236

Query: 65  INTERDYPNVGVMDNCKVFQFN 86
           I+TE DYP   V   C + + N
Sbjct: 237 IDTEDDYPYKAVDGKCDINREN 258


>gi|302143380|emb|CBI21941.3| unnamed protein product [Vitis vinifera]
          Length = 354

 Score = 76.3 bits (186), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 34/76 (44%), Positives = 49/76 (64%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V A+EGI+KI T  LV +S Q+L+DCD    +  C GG++   ++++ QN G
Sbjct: 147 GSCWAFSTVAAVEGINKIRTGKLVSLSEQELLDCDIDSGNEGCNGGYMVNAFKFIKQNGG 206

Query: 65  INTERDYPNVGVMDNC 80
           I T R+YP +G    C
Sbjct: 207 ITTARNYPYIGEQGIC 222


>gi|116666824|pdb|2BDZ|A Chain A, Mexicain From Jacaratia Mexicana
 gi|116666825|pdb|2BDZ|B Chain B, Mexicain From Jacaratia Mexicana
 gi|116666826|pdb|2BDZ|C Chain C, Mexicain From Jacaratia Mexicana
 gi|116666827|pdb|2BDZ|D Chain D, Mexicain From Jacaratia Mexicana
          Length = 214

 Score = 76.3 bits (186), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 36/81 (44%), Positives = 52/81 (64%), Gaps = 3/81 (3%)

Query: 2  HPLGSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQ 61
          +P GSCW F+ V  IEGI+KI+T  L+ +S Q+L+DC+ +  S  C GG+  T  QYV+ 
Sbjct: 20 NPCGSCWAFSTVATIEGINKIITGQLISLSEQELLDCERR--SHGCDGGYQTTSLQYVVD 77

Query: 62 NRGINTERDYPNVGVMDNCKV 82
          N G++TER+YP       C+ 
Sbjct: 78 N-GVHTEREYPYEKKQGRCRA 97


>gi|351705687|gb|EHB08606.1| Cathepsin S [Heterocephalus glaber]
          Length = 331

 Score = 76.3 bits (186), Expect = 2e-12,   Method: Composition-based stats.
 Identities = 35/77 (45%), Positives = 47/77 (61%), Gaps = 1/77 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQG-ESRSCVGGFIETIYQYVIQNR 63
           GSCW F+ VGA+EG  K+ T  LV +S Q LVDC  +   ++ C GGF+   +QYVI N 
Sbjct: 137 GSCWAFSAVGALEGQLKLKTGKLVSLSAQNLVDCSTEKYRNKGCSGGFMTEAFQYVIDNN 196

Query: 64  GINTERDYPNVGVMDNC 80
           GI++E  YP     + C
Sbjct: 197 GIDSETSYPYKATDEKC 213


>gi|449525012|ref|XP_004169515.1| PREDICTED: cysteine proteinase RD21a-like [Cucumis sativus]
          Length = 459

 Score = 76.3 bits (186), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 33/82 (40%), Positives = 56/82 (68%), Gaps = 1/82 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V ++E I++IVT +L+ +S Q+LVDCD +  +  C GG ++  ++++I+N G
Sbjct: 150 GSCWAFSTVASVEAINQIVTGDLIALSEQELVDCD-RSYNEGCNGGLMDYAFEFIIENGG 208

Query: 65  INTERDYPNVGVMDNCKVFQFN 86
           ++TE DYP  G   +C  ++ N
Sbjct: 209 LDTEEDYPYYGFDSSCIQYKKN 230


>gi|157829826|pdb|1AEC|A Chain A, Crystal Structure Of Actinidin-E-64 Complex+
          Length = 218

 Score = 76.3 bits (186), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 34/78 (43%), Positives = 48/78 (61%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G CW F+ +  +EGI+KIVT  L+ +S Q+L+DC     +R C GG+I   +Q++I N G
Sbjct: 23  GGCWAFSAIATVEGINKIVTGVLISLSEQELIDCGRTQNTRGCNGGYITDGFQFIINNGG 82

Query: 65  INTERDYPNVGVMDNCKV 82
           INTE +YP       C V
Sbjct: 83  INTEENYPYTAQDGECNV 100


>gi|440906717|gb|ELR56946.1| Cathepsin K [Bos grunniens mutus]
          Length = 338

 Score = 76.3 bits (186), Expect = 3e-12,   Method: Composition-based stats.
 Identities = 37/76 (48%), Positives = 50/76 (65%), Gaps = 2/76 (2%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ VGA+EG  K  T  L+++S Q LVDC    E+  C GG++   +QYV +NRG
Sbjct: 146 GSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDC--VSENDGCGGGYMTNAFQYVQKNRG 203

Query: 65  INTERDYPNVGVMDNC 80
           I++E  YP VG  +NC
Sbjct: 204 IDSEDAYPYVGQDENC 219


>gi|386648114|gb|AFJ15104.1| mexicain-like cystein protease, partial [Jacaratia mexicana]
          Length = 323

 Score = 76.3 bits (186), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 36/81 (44%), Positives = 51/81 (62%), Gaps = 3/81 (3%)

Query: 2   HPLGSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQ 61
           +P GSCW F+ V  +EGI+KIVT  L+ +S Q+L+DCD +  S  C GG+  T  QYV  
Sbjct: 128 NPCGSCWAFSTVATVEGINKIVTGKLISLSEQELLDCDRR--SHGCKGGYQTTSLQYVAD 185

Query: 62  NRGINTERDYPNVGVMDNCKV 82
           N G++TE++YP       C+ 
Sbjct: 186 N-GVHTEKEYPYEKKQGKCRA 205


>gi|356517188|ref|XP_003527271.1| PREDICTED: xylem cysteine proteinase 2-like [Glycine max]
          Length = 350

 Score = 76.3 bits (186), Expect = 3e-12,   Method: Composition-based stats.
 Identities = 31/78 (39%), Positives = 52/78 (66%), Gaps = 1/78 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V A+EGI++IVT NL  +S Q+L+DCD +  +  C GG ++  + ++++N G
Sbjct: 154 GSCWAFSTVAAVEGINQIVTGNLTSLSEQELIDCD-RTYNNGCNGGLMDYAFSFIVENGG 212

Query: 65  INTERDYPNVGVMDNCKV 82
           ++ E DYP +     C++
Sbjct: 213 LHKEEDYPYIMEEGTCEM 230


>gi|255636047|gb|ACU18368.1| unknown [Glycine max]
          Length = 227

 Score = 76.3 bits (186), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 35/68 (51%), Positives = 48/68 (70%), Gaps = 1/68 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V A+EGI++I TN LV +S Q+LVDCD + E+  C GG +E+ +Q++ Q  G
Sbjct: 150 GSCWAFSTVVAVEGINQIKTNKLVSLSEQELVDCDTE-ENAGCNGGLMESAFQFIKQKGG 208

Query: 65  INTERDYP 72
           I TE  YP
Sbjct: 209 ITTESYYP 216


>gi|222632170|gb|EEE64302.1| hypothetical protein OsJ_19139 [Oryza sativa Japonica Group]
          Length = 1105

 Score = 76.3 bits (186), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 33/68 (48%), Positives = 49/68 (72%), Gaps = 1/68 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G+CW F+  GA+EGI+KI T +L+ +S Q+L+DCD    S  C GG ++  Y++V++N G
Sbjct: 151 GACWSFSATGAMEGINKIKTGSLISLSEQELIDCDRSYNS-GCGGGLMDYAYKFVVKNGG 209

Query: 65  INTERDYP 72
           I+TE DYP
Sbjct: 210 IDTEADYP 217


>gi|297603535|ref|NP_001054211.2| Os04g0670200 [Oryza sativa Japonica Group]
 gi|109939735|sp|P25777.2|ORYB_ORYSJ RecName: Full=Oryzain beta chain; Flags: Precursor
 gi|32488398|emb|CAE02823.1| OSJNBa0043A12.28 [Oryza sativa Japonica Group]
 gi|90399163|emb|CAJ86092.1| H0818H01.14 [Oryza sativa Indica Group]
 gi|125550169|gb|EAY95991.1| hypothetical protein OsI_17862 [Oryza sativa Indica Group]
 gi|215766596|dbj|BAG98700.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|255675868|dbj|BAF16125.2| Os04g0670200 [Oryza sativa Japonica Group]
          Length = 466

 Score = 75.9 bits (185), Expect = 3e-12,   Method: Composition-based stats.
 Identities = 31/82 (37%), Positives = 52/82 (63%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V  +E I+++VT  ++ +S Q+LV+C   G++  C GG ++  + ++I+N G
Sbjct: 163 GSCWAFSAVSTVESINQLVTGEMITLSEQELVECSTNGQNSGCNGGLMDDAFDFIIKNGG 222

Query: 65  INTERDYPNVGVMDNCKVFQFN 86
           I+TE DYP   V   C + + N
Sbjct: 223 IDTEDDYPYKAVDGKCDINREN 244


>gi|15984|emb|CAA34486.1| unnamed protein product [Actinidia deliciosa]
          Length = 380

 Score = 75.9 bits (185), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 33/78 (42%), Positives = 48/78 (61%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G CW F+ +  +EGI+KIVT  L+ +S Q+L+DC     +R C GG+I   +Q++I N G
Sbjct: 149 GGCWAFSAIATVEGINKIVTGVLISLSEQELIDCGRTQNTRGCNGGYITDGFQFIINNGG 208

Query: 65  INTERDYPNVGVMDNCKV 82
           INTE +YP       C +
Sbjct: 209 INTEENYPYTAQDGECNL 226


>gi|226501480|ref|NP_001150266.1| cysteine protease 1 precursor [Zea mays]
 gi|195637948|gb|ACG38442.1| cysteine protease 1 precursor [Zea mays]
          Length = 462

 Score = 75.9 bits (185), Expect = 3e-12,   Method: Composition-based stats.
 Identities = 32/82 (39%), Positives = 52/82 (63%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V ++E +++IVT  +V +S Q+LV+C   G +  C GG ++  + ++I+N G
Sbjct: 161 GSCWAFSAVSSVESVNQIVTGEMVTLSEQELVECSTDGGNSGCNGGLMDAAFDFIIKNGG 220

Query: 65  INTERDYPNVGVMDNCKVFQFN 86
           I+TE DYP   V   C + + N
Sbjct: 221 IDTEGDYPYKAVDGKCDINREN 242


>gi|50355619|dbj|BAD29958.1| cysteine protease [Daucus carota]
          Length = 496

 Score = 75.9 bits (185), Expect = 3e-12,   Method: Composition-based stats.
 Identities = 33/82 (40%), Positives = 53/82 (64%), Gaps = 1/82 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ + A+EGI++I T  L+ +S Q+LVDCD +  +  C GG ++  ++++I N G
Sbjct: 155 GSCWAFSTISAVEGINQIATGKLITLSEQELVDCD-RSYNEGCNGGLMDYAFEFIINNGG 213

Query: 65  INTERDYPNVGVMDNCKVFQFN 86
           I+T+ DYP  G    C  ++ N
Sbjct: 214 IDTDVDYPYTGRDGKCDQYRKN 235


>gi|356508487|ref|XP_003522988.1| PREDICTED: xylem cysteine proteinase 2-like [Glycine max]
          Length = 349

 Score = 75.9 bits (185), Expect = 3e-12,   Method: Composition-based stats.
 Identities = 31/78 (39%), Positives = 52/78 (66%), Gaps = 1/78 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V A+EGI++IVT NL  +S Q+L+DCD +  +  C GG ++  + ++++N G
Sbjct: 153 GSCWAFSTVAAVEGINQIVTGNLTSLSEQELIDCD-RTYNNGCNGGLMDYAFSFIVENGG 211

Query: 65  INTERDYPNVGVMDNCKV 82
           ++ E DYP +     C++
Sbjct: 212 LHKEEDYPYIMEEGTCEM 229


>gi|242049716|ref|XP_002462602.1| hypothetical protein SORBIDRAFT_02g028840 [Sorghum bicolor]
 gi|241925979|gb|EER99123.1| hypothetical protein SORBIDRAFT_02g028840 [Sorghum bicolor]
          Length = 384

 Score = 75.9 bits (185), Expect = 3e-12,   Method: Composition-based stats.
 Identities = 33/71 (46%), Positives = 48/71 (67%), Gaps = 2/71 (2%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V AIEGI++I    LV +S Q+LVDCD +  +  C GG++   +++V+ N G
Sbjct: 152 GSCWAFSAVAAIEGINQIKNGKLVSLSEQELVDCDTK--AIGCAGGYMSWAFEFVMNNSG 209

Query: 65  INTERDYPNVG 75
           + TER+YP  G
Sbjct: 210 LTTERNYPYQG 220


>gi|255646088|gb|ACU23531.1| unknown [Glycine max]
          Length = 362

 Score = 75.9 bits (185), Expect = 3e-12,   Method: Composition-based stats.
 Identities = 35/82 (42%), Positives = 52/82 (63%), Gaps = 1/82 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V A+EGI++I TN LV +S Q+LVDCD + ++  C GG +E+ ++++ Q  G
Sbjct: 150 GSCWAFSTVVAVEGINQIKTNKLVSLSEQELVDCDTK-KNAGCNGGLMESAFEFIKQKGG 208

Query: 65  INTERDYPNVGVMDNCKVFQFN 86
           I TE +YP       C   + N
Sbjct: 209 ITTESNYPYTAQDGTCDASKAN 230


>gi|193806686|sp|A5HII1.1|ACTN_ACTDE RecName: Full=Actinidain; Short=Actinidin; AltName: Full=Allergen
           Act d 1; AltName: Allergen=Act d 1; Flags: Precursor
 gi|146215974|gb|ABQ10189.1| actinidin Act1a [Actinidia deliciosa]
          Length = 380

 Score = 75.9 bits (185), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 33/78 (42%), Positives = 48/78 (61%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G CW F+ +  +EGI+KIVT  L+ +S Q+L+DC     +R C GG+I   +Q++I N G
Sbjct: 149 GGCWAFSAIATVEGINKIVTGVLISLSEQELIDCGRTQNTRGCNGGYITDGFQFIINNGG 208

Query: 65  INTERDYPNVGVMDNCKV 82
           INTE +YP       C +
Sbjct: 209 INTEENYPYTAQDGECNL 226


>gi|77735825|ref|NP_001029607.1| cathepsin K precursor [Bos taurus]
 gi|59858469|gb|AAX09069.1| cathepsin K preproprotein [Bos taurus]
 gi|83638771|gb|AAI09854.1| Cathepsin K [Bos taurus]
 gi|296489554|tpg|DAA31667.1| TPA: cathepsin K [Bos taurus]
          Length = 334

 Score = 75.9 bits (185), Expect = 3e-12,   Method: Composition-based stats.
 Identities = 37/76 (48%), Positives = 50/76 (65%), Gaps = 2/76 (2%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ VGA+EG  K  T  L+++S Q LVDC    E+  C GG++   +QYV +NRG
Sbjct: 142 GSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDC--VSENDGCGGGYMTNAFQYVQKNRG 199

Query: 65  INTERDYPNVGVMDNC 80
           I++E  YP VG  +NC
Sbjct: 200 IDSEDAYPYVGQDENC 215


>gi|351726339|ref|NP_001237379.1| cysteine proteinase precursor [Glycine max]
 gi|31559526|dbj|BAC77521.1| cysteine proteinase [Glycine max]
 gi|31559528|dbj|BAC77522.1| cysteine proteinase [Glycine max]
          Length = 362

 Score = 75.9 bits (185), Expect = 3e-12,   Method: Composition-based stats.
 Identities = 35/82 (42%), Positives = 52/82 (63%), Gaps = 1/82 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V A+EGI++I TN LV +S Q+LVDCD + ++  C GG +E+ ++++ Q  G
Sbjct: 150 GSCWAFSTVVAVEGINQIKTNKLVSLSEQELVDCDTK-KNAGCNGGLMESAFEFIKQKGG 208

Query: 65  INTERDYPNVGVMDNCKVFQFN 86
           I TE +YP       C   + N
Sbjct: 209 ITTESNYPYTAQDGTCDASKAN 230


>gi|238006338|gb|ACR34204.1| unknown [Zea mays]
          Length = 465

 Score = 75.9 bits (185), Expect = 3e-12,   Method: Composition-based stats.
 Identities = 32/82 (39%), Positives = 52/82 (63%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V ++E +++IVT  +V +S Q+LV+C   G +  C GG ++  + ++I+N G
Sbjct: 164 GSCWAFSAVSSVESVNQIVTGEMVTLSEQELVECSTDGGNSGCNGGLMDAAFDFIIKNGG 223

Query: 65  INTERDYPNVGVMDNCKVFQFN 86
           I+TE DYP   V   C + + N
Sbjct: 224 IDTEGDYPYKAVDGKCDINREN 245


>gi|47523662|ref|NP_999467.1| cathepsin K precursor [Sus scrofa]
 gi|15213940|sp|Q9GLE3.1|CATK_PIG RecName: Full=Cathepsin K; Flags: Precursor
 gi|10048286|gb|AAG12340.1|AF292030_1 cathepsin K precursor [Sus scrofa]
          Length = 330

 Score = 75.9 bits (185), Expect = 3e-12,   Method: Composition-based stats.
 Identities = 37/76 (48%), Positives = 50/76 (65%), Gaps = 2/76 (2%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ VGA+EG  K  T  L+++S Q LVDC    E+  C GG++   +QYV +NRG
Sbjct: 138 GSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDC--VSENDGCGGGYMTNAFQYVQKNRG 195

Query: 65  INTERDYPNVGVMDNC 80
           I++E  YP VG  +NC
Sbjct: 196 IDSEDAYPYVGQDENC 211


>gi|224133764|ref|XP_002321655.1| predicted protein [Populus trichocarpa]
 gi|222868651|gb|EEF05782.1| predicted protein [Populus trichocarpa]
          Length = 360

 Score = 75.9 bits (185), Expect = 3e-12,   Method: Composition-based stats.
 Identities = 31/68 (45%), Positives = 48/68 (70%), Gaps = 1/68 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ + A+EGI+ I TN L+ +S Q+LVDC N GE+  C GG ++  ++++ + +G
Sbjct: 148 GSCWAFSTIVAVEGINFIKTNKLISLSEQELVDC-NTGENHGCNGGLMDYAFEFITKQKG 206

Query: 65  INTERDYP 72
           I TE +YP
Sbjct: 207 ITTEANYP 214


>gi|67678376|gb|AAH96862.1| Cathepsin S, b.2 [Danio rerio]
          Length = 330

 Score = 75.9 bits (185), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 35/77 (45%), Positives = 48/77 (62%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ VGA+EG     T  LVD+S Q LVDC ++  +  C GG++   +QYVI N G
Sbjct: 137 GSCWAFSSVGALEGQLMKTTGKLVDLSPQNLVDCSSKYGNLGCNGGYMSQAFQYVIDNGG 196

Query: 65  INTERDYPNVGVMDNCK 81
           I++E  YP  G   +C+
Sbjct: 197 IDSESSYPYQGTQGSCR 213


>gi|359359213|gb|AEV41117.1| putative oryzain beta chain precursor [Oryza officinalis]
          Length = 465

 Score = 75.9 bits (185), Expect = 3e-12,   Method: Composition-based stats.
 Identities = 31/82 (37%), Positives = 52/82 (63%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V  +E I+++VT  ++ +S Q+LV+C   G++  C GG ++  + ++I+N G
Sbjct: 162 GSCWAFSAVSTVESINQLVTGEMITLSEQELVECSTNGQNSGCNGGLMDDAFDFIIKNGG 221

Query: 65  INTERDYPNVGVMDNCKVFQFN 86
           I+TE DYP   V   C + + N
Sbjct: 222 IDTEDDYPYKAVDGKCDINREN 243


>gi|359359066|gb|AEV40973.1| putative oryzain beta chain precursor [Oryza punctata]
          Length = 461

 Score = 75.9 bits (185), Expect = 3e-12,   Method: Composition-based stats.
 Identities = 31/82 (37%), Positives = 52/82 (63%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V  +E I+++VT  ++ +S Q+LV+C   G++  C GG ++  + ++I+N G
Sbjct: 158 GSCWAFSAVSTVESINQLVTGEMITLSEQELVECSTNGQNSGCNGGLMDDAFDFIIKNGG 217

Query: 65  INTERDYPNVGVMDNCKVFQFN 86
           I+TE DYP   V   C + + N
Sbjct: 218 IDTEDDYPYKAVDGKCDINREN 239


>gi|109940312|sp|Q5E968.2|CATK_BOVIN RecName: Full=Cathepsin K; Flags: Precursor
          Length = 329

 Score = 75.9 bits (185), Expect = 3e-12,   Method: Composition-based stats.
 Identities = 37/76 (48%), Positives = 50/76 (65%), Gaps = 2/76 (2%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ VGA+EG  K  T  L+++S Q LVDC    E+  C GG++   +QYV +NRG
Sbjct: 137 GSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDC--VSENDGCGGGYMTNAFQYVQKNRG 194

Query: 65  INTERDYPNVGVMDNC 80
           I++E  YP VG  +NC
Sbjct: 195 IDSEDAYPYVGQDENC 210


>gi|302142276|emb|CBI19479.3| unnamed protein product [Vitis vinifera]
          Length = 388

 Score = 75.9 bits (185), Expect = 3e-12,   Method: Composition-based stats.
 Identities = 34/82 (41%), Positives = 52/82 (63%), Gaps = 1/82 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ + A+EGI+KIVT  L+ +S Q+LVDCD    +  C GG ++  ++++I N G
Sbjct: 81  GSCWAFSTIAAVEGINKIVTGGLISLSEQELVDCDTS-YNEGCNGGLMDYAFEFIINNGG 139

Query: 65  INTERDYPNVGVMDNCKVFQFN 86
           I++E DYP       C  ++ N
Sbjct: 140 IDSEEDYPYKASDGRCDQYRKN 161


>gi|224065647|ref|XP_002301901.1| predicted protein [Populus trichocarpa]
 gi|222843627|gb|EEE81174.1| predicted protein [Populus trichocarpa]
          Length = 336

 Score = 75.9 bits (185), Expect = 3e-12,   Method: Composition-based stats.
 Identities = 34/78 (43%), Positives = 50/78 (64%), Gaps = 1/78 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V A+EGI++IVT NL  +S Q+LVDCD    +  C GG ++  + Y+I N G
Sbjct: 140 GSCWAFSTVAAVEGINQIVTGNLTSLSEQELVDCDTT-NNYGCNGGLMDYAFSYIISNGG 198

Query: 65  INTERDYPNVGVMDNCKV 82
           ++ E DYP +     C++
Sbjct: 199 LHKEVDYPYIMEEGTCEM 216


>gi|125551397|gb|EAY97106.1| hypothetical protein OsI_19029 [Oryza sativa Indica Group]
          Length = 350

 Score = 75.9 bits (185), Expect = 3e-12,   Method: Composition-based stats.
 Identities = 32/77 (41%), Positives = 46/77 (59%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G CW F+ V A+EGI K+ T  L+ +S Q+LVDCD  G  + C GG I+  +Q+++ N G
Sbjct: 156 GCCWAFSAVAAMEGIVKLSTGKLISLSEQELVDCDVDGNDQGCEGGEIDGAFQFILSNGG 215

Query: 65  INTERDYPNVGVMDNCK 81
           +  E +YP       CK
Sbjct: 216 LTAEANYPYTAEDGRCK 232


>gi|18413507|ref|NP_567377.1| putative cysteine proteinase [Arabidopsis thaliana]
 gi|30315953|sp|Q9SUS9.1|CPR4_ARATH RecName: Full=Probable cysteine proteinase At4g11320; Flags:
           Precursor
 gi|5596478|emb|CAB51416.1| drought-inducible cysteine proteinase RD21A precursor-like protein
           [Arabidopsis thaliana]
 gi|7267831|emb|CAB81233.1| drought-inducible cysteine proteinase RD21A precursor-like protein
           [Arabidopsis thaliana]
 gi|14334764|gb|AAK59560.1| putative cysteine proteinase [Arabidopsis thaliana]
 gi|15293257|gb|AAK93739.1| putative cysteine proteinase [Arabidopsis thaliana]
 gi|332657596|gb|AEE82996.1| putative cysteine proteinase [Arabidopsis thaliana]
          Length = 371

 Score = 75.9 bits (185), Expect = 3e-12,   Method: Composition-based stats.
 Identities = 32/67 (47%), Positives = 48/67 (71%), Gaps = 2/67 (2%)

Query: 6   SCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRGI 65
           SCW F+ VGA+EG++KIVT  LV +S Q L++C+   E+  C GG +ET Y++++ N G+
Sbjct: 167 SCWAFSTVGAVEGLNKIVTGELVTLSEQDLINCNK--ENNGCGGGKVETAYEFIMNNGGL 224

Query: 66  NTERDYP 72
            T+ DYP
Sbjct: 225 GTDNDYP 231


>gi|359359118|gb|AEV41024.1| putative oryzain beta chain precursor [Oryza minuta]
          Length = 493

 Score = 75.9 bits (185), Expect = 3e-12,   Method: Composition-based stats.
 Identities = 31/82 (37%), Positives = 52/82 (63%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V  +E I+++VT  ++ +S Q+LV+C   G++  C GG ++  + ++I+N G
Sbjct: 190 GSCWAFSAVSTVESINQLVTGEMITLSEQELVECSTNGQNSGCNGGLMDDAFDFIIKNGG 249

Query: 65  INTERDYPNVGVMDNCKVFQFN 86
           I+TE DYP   V   C + + N
Sbjct: 250 IDTEDDYPYKAVDGKCDINREN 271


>gi|217072410|gb|ACJ84565.1| unknown [Medicago truncatula]
          Length = 328

 Score = 75.9 bits (185), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 34/76 (44%), Positives = 51/76 (67%), Gaps = 1/76 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ + A+EGI+KIVT +L+ +S Q+LVDCD    +  C GG ++  ++++I N G
Sbjct: 46  GSCWAFSAIAAVEGINKIVTGDLISLSEQELVDCDTS-YNEGCNGGLMDYAFEFIISNGG 104

Query: 65  INTERDYPNVGVMDNC 80
           I++E DYP   V   C
Sbjct: 105 IDSEDDYPYKAVDGRC 120


>gi|357160599|ref|XP_003578815.1| PREDICTED: vignain-like [Brachypodium distachyon]
          Length = 339

 Score = 75.9 bits (185), Expect = 3e-12,   Method: Composition-based stats.
 Identities = 33/77 (42%), Positives = 45/77 (58%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G CW F+ V A EGI K+ T  LV +S Q+LVDCD  GE + C GG ++  ++++I N G
Sbjct: 145 GCCWAFSAVAATEGIVKLSTGKLVSLSEQELVDCDVHGEDQGCEGGLMDDAFKFIITNGG 204

Query: 65  INTERDYPNVGVMDNCK 81
           +  E  YP       CK
Sbjct: 205 LTQESSYPYDAEDGKCK 221


>gi|330805275|ref|XP_003290610.1| hypothetical protein DICPUDRAFT_98747 [Dictyostelium purpureum]
 gi|325079249|gb|EGC32858.1| hypothetical protein DICPUDRAFT_98747 [Dictyostelium purpureum]
          Length = 334

 Score = 75.9 bits (185), Expect = 3e-12,   Method: Composition-based stats.
 Identities = 33/76 (43%), Positives = 44/76 (57%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW FA  GA+EG  +I T N+V  S Q LVDC  +  +  C GG + + ++Y+I N G
Sbjct: 141 GSCWAFATTGAVEGAHQIKTGNMVTFSEQHLVDCSGRYGNNGCDGGLMTSAFKYIIDNDG 200

Query: 65  INTERDYPNVGVMDNC 80
           I TE  YP     + C
Sbjct: 201 IATEEAYPYTATQNRC 216


>gi|350538043|ref|NP_001234324.1| cysteine protease TDI-65 precursor [Solanum lycopersicum]
 gi|5726641|gb|AAD48496.1|AF172856_1 cysteine protease TDI-65 [Solanum lycopersicum]
 gi|2828252|emb|CAA05894.1| CYP1 [Solanum lycopersicum]
          Length = 466

 Score = 75.9 bits (185), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 34/68 (50%), Positives = 49/68 (72%), Gaps = 1/68 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V A+E I+ IVT NL+ +S Q+LVDCD +  +  C GG ++  +++VI+N G
Sbjct: 160 GSCWAFSAVAAMESINAIVTGNLISLSEQELVDCD-RSYNEGCDGGLMDYAFEFVIKNGG 218

Query: 65  INTERDYP 72
           I+TE DYP
Sbjct: 219 IDTEEDYP 226


>gi|357160569|ref|XP_003578807.1| PREDICTED: vignain-like [Brachypodium distachyon]
          Length = 339

 Score = 75.9 bits (185), Expect = 3e-12,   Method: Composition-based stats.
 Identities = 33/77 (42%), Positives = 45/77 (58%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G CW F+ V A EGI K+ T  LV +S Q+LVDCD  GE + C GG ++  ++++I N G
Sbjct: 145 GCCWAFSAVAATEGIVKLSTGKLVSLSEQELVDCDVHGEDQGCEGGLMDDAFKFIISNGG 204

Query: 65  INTERDYPNVGVMDNCK 81
           +  E  YP       CK
Sbjct: 205 LTQESSYPYDAEDGKCK 221


>gi|18402225|ref|NP_566633.1| Granulin repeat cysteine protease family protein [Arabidopsis
           thaliana]
 gi|11994461|dbj|BAB02463.1| cysteine proteinase [Arabidopsis thaliana]
 gi|17065298|gb|AAL32803.1| cysteine proteinase [Arabidopsis thaliana]
 gi|20260004|gb|AAM13349.1| cysteine proteinase [Arabidopsis thaliana]
 gi|332642713|gb|AEE76234.1| Granulin repeat cysteine protease family protein [Arabidopsis
           thaliana]
          Length = 452

 Score = 75.9 bits (185), Expect = 3e-12,   Method: Composition-based stats.
 Identities = 32/71 (45%), Positives = 50/71 (70%), Gaps = 1/71 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ +GA+EGI++I T  L+ +S Q+LVDCD    +  C GG ++  ++++I+N G
Sbjct: 151 GSCWAFSAIGAVEGINQIKTGELISLSEQELVDCDTS-YNDGCGGGLMDYAFKFIIENGG 209

Query: 65  INTERDYPNVG 75
           I+TE DYP + 
Sbjct: 210 IDTEEDYPYIA 220


>gi|255646767|gb|ACU23856.1| unknown [Glycine max]
          Length = 350

 Score = 75.9 bits (185), Expect = 3e-12,   Method: Composition-based stats.
 Identities = 31/78 (39%), Positives = 52/78 (66%), Gaps = 1/78 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V A+EGI++IVT NL  +S Q+L+DCD +  +  C GG ++  + ++++N G
Sbjct: 154 GSCWAFSTVAAVEGINQIVTGNLTSLSEQELIDCD-RTYNNGCNGGLMDYAFSFIVENGG 212

Query: 65  INTERDYPNVGVMDNCKV 82
           ++ E DYP +     C++
Sbjct: 213 LHKEEDYPYIMEEGACEM 230


>gi|351694420|gb|EHA97338.1| Cathepsin K [Heterocephalus glaber]
          Length = 329

 Score = 75.5 bits (184), Expect = 3e-12,   Method: Composition-based stats.
 Identities = 37/76 (48%), Positives = 50/76 (65%), Gaps = 2/76 (2%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ VGA+EG  K  T  L+++S Q LVDC    E+  C GG++   +QYV QNRG
Sbjct: 137 GSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDC--VSENDGCGGGYMTNAFQYVQQNRG 194

Query: 65  INTERDYPNVGVMDNC 80
           I++E  YP VG  ++C
Sbjct: 195 IDSEDAYPYVGQDESC 210


>gi|300122868|emb|CBK23875.2| unnamed protein product [Blastocystis hominis]
          Length = 316

 Score = 75.5 bits (184), Expect = 3e-12,   Method: Composition-based stats.
 Identities = 36/77 (46%), Positives = 51/77 (66%), Gaps = 3/77 (3%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+  GA+EG + + T  LV +S QQLVDCD   E   C GGF++T ++YV++ +G
Sbjct: 129 GSCWAFSATGALEGGNFVATGKLVSLSEQQLVDCDT--EDAGCGGGFMDTAFEYVMK-KG 185

Query: 65  INTERDYPNVGVMDNCK 81
           + TE DYP     ++CK
Sbjct: 186 LCTEEDYPYHAKDEDCK 202


>gi|156124998|gb|ABU50817.1| Ale o 1 allergen [Aleuroglyphus ovatus]
          Length = 337

 Score = 75.5 bits (184), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 33/77 (42%), Positives = 48/77 (62%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V ++EG   + T  LV +S Q LVDC        C GG+++  ++YVIQNRG
Sbjct: 143 GSCWAFSAVASMEGQHALKTGKLVSLSEQNLVDCSAAEGDMGCSGGWMDYAFKYVIQNRG 202

Query: 65  INTERDYPNVGVMDNCK 81
           I+TE  YP   + ++C+
Sbjct: 203 IDTEASYPYKAIDESCE 219


>gi|403302732|ref|XP_003942007.1| PREDICTED: cathepsin S isoform 2 [Saimiri boliviensis boliviensis]
          Length = 289

 Score = 75.5 bits (184), Expect = 3e-12,   Method: Composition-based stats.
 Identities = 32/77 (41%), Positives = 47/77 (61%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G+CW F+ VGA+E   K+ T  LV +S Q LVDC  +  ++ C GGF+   +QY+I N+G
Sbjct: 96  GACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSEKYGNKGCNGGFMTEAFQYIIDNKG 155

Query: 65  INTERDYPNVGVMDNCK 81
           I++E  YP       C+
Sbjct: 156 IDSEASYPYKATDQKCQ 172


>gi|156124996|gb|ABU50816.1| Ale o 1 allergen [Aleuroglyphus ovatus]
          Length = 337

 Score = 75.5 bits (184), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 33/77 (42%), Positives = 48/77 (62%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V ++EG   + T  LV +S Q LVDC        C GG+++  ++YVIQNRG
Sbjct: 143 GSCWAFSAVASMEGQHALKTGKLVSLSEQNLVDCSAAEGDMGCSGGWMDYAFKYVIQNRG 202

Query: 65  INTERDYPNVGVMDNCK 81
           I+TE  YP   + ++C+
Sbjct: 203 IDTEASYPYKAIDESCE 219


>gi|426216528|ref|XP_004002514.1| PREDICTED: cathepsin K [Ovis aries]
          Length = 330

 Score = 75.5 bits (184), Expect = 3e-12,   Method: Composition-based stats.
 Identities = 37/76 (48%), Positives = 50/76 (65%), Gaps = 2/76 (2%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ VGA+EG  K  T  L+++S Q LVDC    E+  C GG++   +QYV +NRG
Sbjct: 138 GSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDC--VSENDGCGGGYMTNAFQYVQKNRG 195

Query: 65  INTERDYPNVGVMDNC 80
           I++E  YP VG  +NC
Sbjct: 196 IDSEDAYPYVGQDENC 211


>gi|42573181|ref|NP_974687.1| Xylem cysteine proteinase 1 [Arabidopsis thaliana]
 gi|332661102|gb|AEE86502.1| Xylem cysteine proteinase 1 [Arabidopsis thaliana]
          Length = 288

 Score = 75.5 bits (184), Expect = 3e-12,   Method: Composition-based stats.
 Identities = 32/68 (47%), Positives = 45/68 (66%), Gaps = 1/68 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V A+EGI++I T NL  +S Q+L+DCD    S  C GG ++  +QY+I   G
Sbjct: 159 GSCWAFSTVAAVEGINQITTGNLSSLSEQELIDCDTTFNS-GCNGGLMDYAFQYIISTGG 217

Query: 65  INTERDYP 72
           ++ E DYP
Sbjct: 218 LHKEDDYP 225


>gi|310656790|gb|ADP02219.1| Peptidase_C1 domain-containing protein [Triticum aestivum]
          Length = 419

 Score = 75.5 bits (184), Expect = 4e-12,   Method: Composition-based stats.
 Identities = 33/77 (42%), Positives = 47/77 (61%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G CW F+ V A EGI K+ T  LV +S Q+LVDCD  G  + C GG ++  ++++I+N G
Sbjct: 145 GCCWAFSAVAATEGIVKLSTGKLVSLSEQELVDCDVHGVDQGCEGGEMDNAFKFIIKNGG 204

Query: 65  INTERDYPNVGVMDNCK 81
           + TE +YP       CK
Sbjct: 205 LTTEANYPYTAQDGQCK 221


>gi|414591548|tpg|DAA42119.1| TPA: hypothetical protein ZEAMMB73_388689, partial [Zea mays]
          Length = 229

 Score = 75.5 bits (184), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 32/76 (42%), Positives = 53/76 (69%), Gaps = 1/76 (1%)

Query: 5  GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
          GSCW F+ + A+EG++KI+T  LV +S Q+LVDCD+  +++ C GG ++  +QY+ +N G
Sbjct: 13 GSCWAFSAIAAVEGVNKIMTGKLVSLSEQELVDCDDV-DNQGCDGGLMDYAFQYIQRNGG 71

Query: 65 INTERDYPNVGVMDNC 80
          + TE +YP +    +C
Sbjct: 72 VTTESNYPYLAEQRSC 87


>gi|356563584|ref|XP_003550041.1| PREDICTED: cysteine proteinase RD21a-like [Glycine max]
          Length = 366

 Score = 75.5 bits (184), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 33/71 (46%), Positives = 50/71 (70%), Gaps = 1/71 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V  +E I+KIVT   V +S Q+LVDCD +  ++ C GG ++  ++++IQN G
Sbjct: 152 GSCWAFSTVATVEAINKIVTGKFVSLSEQELVDCD-RAYNQGCNGGLMDYAFEFIIQNGG 210

Query: 65  INTERDYPNVG 75
           I+T++DYP  G
Sbjct: 211 IDTDKDYPYRG 221


>gi|2511695|emb|CAB17077.1| cysteine proteinase precursor [Phaseolus vulgaris]
          Length = 377

 Score = 75.5 bits (184), Expect = 4e-12,   Method: Composition-based stats.
 Identities = 34/84 (40%), Positives = 49/84 (58%), Gaps = 7/84 (8%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGE-------SRSCVGGFIETIYQ 57
           GSCW F+  G+IEG + I T  L+++S QQLVDCD+Q +          C+GG +   Y+
Sbjct: 160 GSCWAFSTTGSIEGANFIATGKLLNLSEQQLVDCDSQCDITESTTCDNGCMGGLMTNAYK 219

Query: 58  YVIQNRGINTERDYPNVGVMDNCK 81
           Y++Q+ G+  E  YP  G    CK
Sbjct: 220 YLLQSGGLEEESSYPYTGAKGECK 243


>gi|313118766|gb|ADR32295.1| C14 cysteine protease [Solanum demissum]
 gi|313118774|gb|ADR32299.1| C14 cysteine protease [Solanum verrucosum]
 gi|313118776|gb|ADR32300.1| C14 cysteine protease [Solanum verrucosum]
 gi|313118778|gb|ADR32301.1| C14 cysteine protease [Solanum verrucosum]
          Length = 217

 Score = 75.5 bits (184), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 36/82 (43%), Positives = 53/82 (64%), Gaps = 1/82 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V A+E I+ IVT NL+ +S Q+LVDCD +  +  C GG ++  +++VI N G
Sbjct: 23  GSCWAFSAVAAMESINAIVTGNLISLSEQELVDCD-KSYNEGCDGGLMDYAFEFVINNGG 81

Query: 65  INTERDYPNVGVMDNCKVFQFN 86
           I++E DYP     D C  ++ N
Sbjct: 82  IDSEEDYPYKERNDVCDQYRKN 103


>gi|296228726|ref|XP_002759933.1| PREDICTED: cathepsin S isoform 1 [Callithrix jacchus]
          Length = 330

 Score = 75.5 bits (184), Expect = 4e-12,   Method: Composition-based stats.
 Identities = 32/77 (41%), Positives = 48/77 (62%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G+CW F+ VGA+E   K+ T  LV +S Q LVDC  +  ++ C GGF+   +QY+I N+G
Sbjct: 137 GACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSEKYGNKGCNGGFMTEAFQYIIDNKG 196

Query: 65  INTERDYPNVGVMDNCK 81
           I++E  YP   +   C+
Sbjct: 197 IDSEASYPYKAMDQKCQ 213


>gi|357160591|ref|XP_003578813.1| PREDICTED: vignain-like [Brachypodium distachyon]
          Length = 339

 Score = 75.5 bits (184), Expect = 4e-12,   Method: Composition-based stats.
 Identities = 33/77 (42%), Positives = 45/77 (58%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G CW F+ V A EGI K+ T  LV +S Q+LVDCD  GE + C GG ++  ++++I N G
Sbjct: 145 GCCWAFSAVAATEGIVKLSTGKLVSLSEQELVDCDVHGEDQGCEGGLMDDAFKFIITNGG 204

Query: 65  INTERDYPNVGVMDNCK 81
           +  E  YP       CK
Sbjct: 205 LTQESSYPYDAEDGKCK 221


>gi|244539471|dbj|BAH82657.1| cysteine protease [Lotus japonicus]
          Length = 286

 Score = 75.5 bits (184), Expect = 4e-12,   Method: Composition-based stats.
 Identities = 32/78 (41%), Positives = 51/78 (65%), Gaps = 1/78 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V A+EGI++IVT NL  +S Q+L+DCD    S  C GG ++  + ++++N G
Sbjct: 114 GSCWAFSTVAAVEGINQIVTGNLTSLSEQELIDCDRTYNS-GCNGGLMDYAFSFIVENGG 172

Query: 65  INTERDYPNVGVMDNCKV 82
           ++ E DYP +     C++
Sbjct: 173 LHKEDDYPYIMEEGTCEM 190


>gi|146215982|gb|ABQ10193.1| actinidin Act2b [Actinidia eriantha]
          Length = 378

 Score = 75.5 bits (184), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 33/67 (49%), Positives = 46/67 (68%)

Query: 6   SCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRGI 65
           SCW F+ V A+EGI+KIVT NL+ +S Q+LVDC     ++ C  G +   +Q++I N GI
Sbjct: 149 SCWAFSAVTAVEGINKIVTGNLISLSEQELVDCGRTQRTKGCNRGLMTDAFQFIINNGGI 208

Query: 66  NTERDYP 72
           NTE +YP
Sbjct: 209 NTEDNYP 215


>gi|59798094|sp|P84347.1|MEX2_JACME RecName: Full=Chymomexicain
          Length = 215

 Score = 75.5 bits (184), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 34/81 (41%), Positives = 49/81 (60%), Gaps = 2/81 (2%)

Query: 2  HPLGSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQ 61
          +P GSCW F+ V  +EGI+KI T  L+ +S Q+L+DCD +  S  C GG+     QYV  
Sbjct: 20 NPCGSCWAFSTVATVEGINKIRTGKLISLSEQELLDCDRR--SHGCKGGYQTGSIQYVAD 77

Query: 62 NRGINTERDYPNVGVMDNCKV 82
          N G++TE++YP       C+ 
Sbjct: 78 NGGVHTEKEYPYEKKQGKCRA 98


>gi|313118762|gb|ADR32293.1| C14 cysteine protease [Solanum stoloniferum]
          Length = 217

 Score = 75.5 bits (184), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 36/82 (43%), Positives = 53/82 (64%), Gaps = 1/82 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V A+E I+ IVT NL+ +S Q+LVDCD +  +  C GG ++  +++VI N G
Sbjct: 23  GSCWAFSAVAAMESINAIVTGNLISLSEQELVDCD-KSYNEGCDGGLMDYAFEFVINNGG 81

Query: 65  INTERDYPNVGVMDNCKVFQFN 86
           I++E DYP     D C  ++ N
Sbjct: 82  IDSEEDYPYKERNDVCDQYRKN 103


>gi|226529105|ref|NP_001150196.1| cysteine protease 1 precursor [Zea mays]
 gi|194701798|gb|ACF84983.1| unknown [Zea mays]
 gi|194704800|gb|ACF86484.1| unknown [Zea mays]
 gi|195637480|gb|ACG38208.1| cysteine protease 1 precursor [Zea mays]
 gi|413919895|gb|AFW59827.1| cysteine protease 1 [Zea mays]
          Length = 470

 Score = 75.5 bits (184), Expect = 4e-12,   Method: Composition-based stats.
 Identities = 32/82 (39%), Positives = 52/82 (63%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V ++E +++IVT  +V +S Q+LV+C   G +  C GG ++  + ++I+N G
Sbjct: 172 GSCWAFSAVSSVESVNQIVTGEMVTLSEQELVECSTDGGNSGCNGGLMDAAFDFIIKNGG 231

Query: 65  INTERDYPNVGVMDNCKVFQFN 86
           I+TE DYP   V   C + + N
Sbjct: 232 IDTEDDYPYRAVDGKCDMNRKN 253


>gi|328872971|gb|EGG21338.1| cysteine proteinase 5 precursor [Dictyostelium fasciculatum]
          Length = 358

 Score = 75.5 bits (184), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 33/77 (42%), Positives = 48/77 (62%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+  G+ EG  +I T+NLV +S Q L+DC +   +  C GG ++  ++Y+I N G
Sbjct: 138 GSCWSFSATGSTEGAHQISTSNLVALSEQNLIDCSSSYGNDGCNGGLMDNAFKYIIANGG 197

Query: 65  INTERDYPNVGVMDNCK 81
           I+TE  YP V  +  CK
Sbjct: 198 IDTEASYPYVAKVQKCK 214


>gi|313118772|gb|ADR32298.1| C14 cysteine protease [Solanum demissum]
          Length = 217

 Score = 75.5 bits (184), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 36/82 (43%), Positives = 54/82 (65%), Gaps = 1/82 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V A+E I+ IVT +L+ +S Q+LVDCD +  ++ C GG ++  +++VI N G
Sbjct: 23  GSCWAFSAVAAMESINAIVTGDLISLSEQELVDCD-KSYNQGCDGGLMDYAFEFVINNGG 81

Query: 65  INTERDYPNVGVMDNCKVFQFN 86
           I+TE DYP     D C  ++ N
Sbjct: 82  IDTEEDYPYKERNDVCDQYRKN 103


>gi|50355623|dbj|BAD29960.1| cysteine protease [Daucus carota]
          Length = 460

 Score = 75.5 bits (184), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 33/82 (40%), Positives = 53/82 (64%), Gaps = 1/82 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ + A+EGI++I T  L+ +S Q+LVDCD +  +  C GG ++  +Q++I N G
Sbjct: 154 GSCWAFSTISAVEGINQIATGKLITLSEQELVDCD-RSYNEGCNGGLMDDAFQFIINNGG 212

Query: 65  INTERDYPNVGVMDNCKVFQFN 86
           I+++ DYP  G    C  ++ N
Sbjct: 213 IDSDADYPYTGRDGQCDQYRKN 234


>gi|313118760|gb|ADR32292.1| C14 cysteine protease [Solanum stoloniferum]
          Length = 217

 Score = 75.5 bits (184), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 36/82 (43%), Positives = 53/82 (64%), Gaps = 1/82 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V A+E I+ IVT NL+ +S Q+LVDCD +  +  C GG ++  +++VI N G
Sbjct: 23  GSCWAFSAVAAMESINAIVTGNLISLSEQELVDCD-KSYNEGCDGGLMDYAFEFVINNGG 81

Query: 65  INTERDYPNVGVMDNCKVFQFN 86
           I++E DYP     D C  ++ N
Sbjct: 82  IDSEEDYPYKERNDVCDQYRKN 103


>gi|363814535|ref|NP_001242660.1| uncharacterized protein LOC100807362 precursor [Glycine max]
 gi|255636658|gb|ACU18666.1| unknown [Glycine max]
          Length = 367

 Score = 75.5 bits (184), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 34/80 (42%), Positives = 51/80 (63%), Gaps = 1/80 (1%)

Query: 7   CWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRGIN 66
           CW F+ + A+EGI+KIVT NL  +S Q+L+DCD    +  C GG ++  ++++I N GI+
Sbjct: 162 CWAFSAIAAVEGINKIVTGNLTALSEQELLDCDRTVNA-GCSGGLVDYAFEFIINNGGID 220

Query: 67  TERDYPNVGVMDNCKVFQFN 86
           TE DYP  G    C  ++ N
Sbjct: 221 TEEDYPFQGADGICDQYKIN 240


>gi|224131910|ref|XP_002328138.1| predicted protein [Populus trichocarpa]
 gi|222837653|gb|EEE76018.1| predicted protein [Populus trichocarpa]
          Length = 349

 Score = 75.5 bits (184), Expect = 4e-12,   Method: Composition-based stats.
 Identities = 30/68 (44%), Positives = 46/68 (67%), Gaps = 1/68 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V A+EGI++IV  NL  +S QQL+DCD    +  C GG ++  +++++ N G
Sbjct: 153 GSCWAFSTVAAVEGINQIVAGNLTSLSEQQLIDCDTSFNN-GCNGGLMDYAFEFIVNNGG 211

Query: 65  INTERDYP 72
           ++ E DYP
Sbjct: 212 LHKEEDYP 219


>gi|242038089|ref|XP_002466439.1| hypothetical protein SORBIDRAFT_01g007820 [Sorghum bicolor]
 gi|241920293|gb|EER93437.1| hypothetical protein SORBIDRAFT_01g007820 [Sorghum bicolor]
          Length = 353

 Score = 75.5 bits (184), Expect = 4e-12,   Method: Composition-based stats.
 Identities = 32/76 (42%), Positives = 48/76 (63%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G CW F+ V A+EG++KI T  LV +S Q+LVDCD  GE + C GG ++  +Q++ +  G
Sbjct: 159 GCCWAFSAVAAVEGLNKIRTGRLVSLSEQELVDCDVNGEDQGCEGGLMDDAFQFIERRGG 218

Query: 65  INTERDYPNVGVMDNC 80
           + +E  YP  G   +C
Sbjct: 219 LASESGYPYQGDDGSC 234


>gi|194352754|emb|CAQ00105.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
 gi|326513690|dbj|BAJ87864.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326514532|dbj|BAJ96253.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 463

 Score = 75.1 bits (183), Expect = 4e-12,   Method: Composition-based stats.
 Identities = 33/71 (46%), Positives = 51/71 (71%), Gaps = 7/71 (9%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCD---NQGESRSCVGGFIETIYQYVIQ 61
           GSCW F+ + A+EGI++IVT +++ +S Q+LVDCD   NQG    C GG ++  ++++I 
Sbjct: 154 GSCWAFSAIAAVEGINQIVTGDMIPLSEQELVDCDTSYNQG----CNGGLMDYAFEFIIN 209

Query: 62  NRGINTERDYP 72
           N GI++E DYP
Sbjct: 210 NGGIDSEEDYP 220


>gi|42794048|dbj|BAD11762.1| cahepsin L-like cysteine protease [Brugia malayi]
          Length = 371

 Score = 75.1 bits (183), Expect = 4e-12,   Method: Composition-based stats.
 Identities = 33/78 (42%), Positives = 50/78 (64%), Gaps = 1/78 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDC-DNQGESRSCVGGFIETIYQYVIQNR 63
           GSCW F+ VGA+EG   + T  LV++S Q L+DC D+   +  C GG +   ++YV++N 
Sbjct: 165 GSCWTFSAVGALEGQHFLQTGKLVELSMQNLLDCSDDTYGNYGCDGGLMMEAFEYVVKND 224

Query: 64  GINTERDYPNVGVMDNCK 81
           GI+TE+ YP  G  + C+
Sbjct: 225 GIDTEKSYPYQGYQNTCR 242


>gi|403302730|ref|XP_003942006.1| PREDICTED: cathepsin S isoform 1 [Saimiri boliviensis boliviensis]
          Length = 339

 Score = 75.1 bits (183), Expect = 4e-12,   Method: Composition-based stats.
 Identities = 32/77 (41%), Positives = 47/77 (61%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G+CW F+ VGA+E   K+ T  LV +S Q LVDC  +  ++ C GGF+   +QY+I N+G
Sbjct: 146 GACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSEKYGNKGCNGGFMTEAFQYIIDNKG 205

Query: 65  INTERDYPNVGVMDNCK 81
           I++E  YP       C+
Sbjct: 206 IDSEASYPYKATDQKCQ 222


>gi|297809385|ref|XP_002872576.1| hypothetical protein ARALYDRAFT_489965 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297318413|gb|EFH48835.1| hypothetical protein ARALYDRAFT_489965 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 371

 Score = 75.1 bits (183), Expect = 4e-12,   Method: Composition-based stats.
 Identities = 32/67 (47%), Positives = 48/67 (71%), Gaps = 2/67 (2%)

Query: 6   SCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRGI 65
           SCW F+ VGA+EG++KIVT  LV +S Q L++C+   E+  C GG +ET Y++++ N G+
Sbjct: 167 SCWAFSTVGAVEGLNKIVTGELVTLSEQDLINCNK--ENNGCGGGKVETAYEFIMNNGGL 224

Query: 66  NTERDYP 72
            T+ DYP
Sbjct: 225 GTDNDYP 231


>gi|297802418|ref|XP_002869093.1| hypothetical protein ARALYDRAFT_491113 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297314929|gb|EFH45352.1| hypothetical protein ARALYDRAFT_491113 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 355

 Score = 75.1 bits (183), Expect = 4e-12,   Method: Composition-based stats.
 Identities = 32/68 (47%), Positives = 45/68 (66%), Gaps = 1/68 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V A+EGI++I T NL  +S Q+L+DCD    S  C GG ++  +QY+I   G
Sbjct: 159 GSCWAFSTVAAVEGINQITTGNLSSLSEQELIDCDTTFNS-GCNGGLMDYAFQYIISTGG 217

Query: 65  INTERDYP 72
           ++ E DYP
Sbjct: 218 LHKEDDYP 225


>gi|220983358|dbj|BAH11164.1| cysteine protease [Hordeum vulgare]
          Length = 462

 Score = 75.1 bits (183), Expect = 4e-12,   Method: Composition-based stats.
 Identities = 33/71 (46%), Positives = 51/71 (71%), Gaps = 7/71 (9%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCD---NQGESRSCVGGFIETIYQYVIQ 61
           GSCW F+ + A+EGI++IVT +++ +S Q+LVDCD   NQG    C GG ++  ++++I 
Sbjct: 153 GSCWAFSAIAAVEGINQIVTGDMIPLSEQELVDCDTSYNQG----CNGGLMDYAFEFIIN 208

Query: 62  NRGINTERDYP 72
           N GI++E DYP
Sbjct: 209 NGGIDSEEDYP 219


>gi|290997496|ref|XP_002681317.1| cysteine protease [Naegleria gruberi]
 gi|284094941|gb|EFC48573.1| cysteine protease [Naegleria gruberi]
          Length = 350

 Score = 75.1 bits (183), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 40/90 (44%), Positives = 53/90 (58%), Gaps = 8/90 (8%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDN-----QGES---RSCVGGFIETIY 56
           GSCW F+  G +EGI +I T  LV +S QQLVDCD+     QG+      C GG + + +
Sbjct: 147 GSCWTFSTTGNVEGIHQIKTGKLVSLSEQQLVDCDHNCVTYQGQQACDAGCNGGLMWSAF 206

Query: 57  QYVIQNRGINTERDYPNVGVMDNCKVFQFN 86
           QYVI+  G+ TE  YP  GV D C+  + N
Sbjct: 207 QYVIKTGGLVTEDSYPYEGVDDTCRFNKSN 236


>gi|37780039|gb|AAP32192.1| cysteine protease 14 [Trifolium repens]
          Length = 351

 Score = 75.1 bits (183), Expect = 5e-12,   Method: Composition-based stats.
 Identities = 32/78 (41%), Positives = 50/78 (64%), Gaps = 1/78 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V A+EGI++IVT NL  +S Q+L+DCD    +  C GG ++  + ++ QN G
Sbjct: 155 GSCWAFSTVAAVEGINQIVTGNLTSLSEQELIDCDTT-YNNGCNGGLMDYAFSFIGQNGG 213

Query: 65  INTERDYPNVGVMDNCKV 82
           ++ E DYP +     C++
Sbjct: 214 LHKEEDYPYIMEESTCEM 231


>gi|62510453|sp|Q8HY82.1|CATS_SAIBB RecName: Full=Cathepsin S; Flags: Precursor
 gi|27497536|gb|AAO13008.1| cathepsin S preproprotein [Saimiri boliviensis]
          Length = 330

 Score = 75.1 bits (183), Expect = 5e-12,   Method: Composition-based stats.
 Identities = 32/77 (41%), Positives = 47/77 (61%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G+CW F+ VGA+E   K+ T  LV +S Q LVDC  +  ++ C GGF+   +QY+I N+G
Sbjct: 137 GACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSEKYGNKGCNGGFMTEAFQYIIDNKG 196

Query: 65  INTERDYPNVGVMDNCK 81
           I++E  YP       C+
Sbjct: 197 IDSEASYPYKATDQKCQ 213


>gi|359491865|ref|XP_002273243.2| PREDICTED: xylem cysteine proteinase 1-like [Vitis vinifera]
          Length = 351

 Score = 75.1 bits (183), Expect = 5e-12,   Method: Composition-based stats.
 Identities = 33/76 (43%), Positives = 48/76 (63%), Gaps = 1/76 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V A+EGI++IVT NL  +S Q+L+DCD +  +  C GG ++  + ++I N G
Sbjct: 155 GSCWAFSTVAAVEGINQIVTGNLTALSEQELIDCD-KPFNNGCNGGLMDYAFAFIISNGG 213

Query: 65  INTERDYPNVGVMDNC 80
           +  E DYP V     C
Sbjct: 214 LRKEEDYPYVMEEGTC 229


>gi|168063167|ref|XP_001783545.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162664932|gb|EDQ51634.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 461

 Score = 75.1 bits (183), Expect = 5e-12,   Method: Composition-based stats.
 Identities = 37/79 (46%), Positives = 50/79 (63%), Gaps = 7/79 (8%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCD---NQGESRSCVGGFIETIYQYVIQ 61
           GSCW F+ VG++EGI+ I     V +S Q+LVDCD   NQG    C GG ++  + ++IQ
Sbjct: 159 GSCWAFSAVGSVEGINAIRNGEAVSLSEQELVDCDLEYNQG----CNGGLMDYAFDFIIQ 214

Query: 62  NRGINTERDYPNVGVMDNC 80
           N GI+TE+DYP  G    C
Sbjct: 215 NGGIDTEKDYPYKGFDGRC 233


>gi|301612003|ref|XP_002935514.1| PREDICTED: cathepsin K-like [Xenopus (Silurana) tropicalis]
          Length = 331

 Score = 75.1 bits (183), Expect = 5e-12,   Method: Composition-based stats.
 Identities = 39/79 (49%), Positives = 50/79 (63%), Gaps = 6/79 (7%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDC--DNQGESRSCVGGFIETIYQYVIQN 62
           GSCW F+ VGA+EG     T  LV IS Q LVDC  DN G    C GG++ T ++YV +N
Sbjct: 139 GSCWAFSTVGALEGQLMKKTGKLVGISPQNLVDCVKDNFG----CGGGYMTTAFKYVKKN 194

Query: 63  RGINTERDYPNVGVMDNCK 81
           +GI++E  YP VG+   CK
Sbjct: 195 KGIDSEEAYPYVGMDQKCK 213


>gi|15145801|gb|AAK83567.1| cysteine proteinase CC23 [Vasconcellea cundinamarcensis]
          Length = 176

 Score = 75.1 bits (183), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 36/81 (44%), Positives = 51/81 (62%), Gaps = 3/81 (3%)

Query: 2  HPLGSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQ 61
          +P GSCW F+ V  +EGI+KIVT  L+ +S Q+L+DCD +  S  C GG+  T  QYV+ 
Sbjct: 19 NPCGSCWAFSTVATVEGINKIVTGKLISLSEQELLDCDRR--SHGCKGGYQTTSLQYVVD 76

Query: 62 NRGINTERDYPNVGVMDNCKV 82
          N G++TE+ YP       C+ 
Sbjct: 77 N-GVHTEKVYPYEKKQGKCRA 96


>gi|444515095|gb|ELV10757.1| Aryl hydrocarbon receptor nuclear translocator [Tupaia chinensis]
          Length = 786

 Score = 75.1 bits (183), Expect = 5e-12,   Method: Composition-based stats.
 Identities = 36/76 (47%), Positives = 50/76 (65%), Gaps = 2/76 (2%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ VGA+EG  K  T  L+++S Q LVDC    E+  C GG++   +QYV +NRG
Sbjct: 626 GSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDC--VSENDGCGGGYMTNAFQYVQKNRG 683

Query: 65  INTERDYPNVGVMDNC 80
           I++E  YP VG  ++C
Sbjct: 684 IDSEDAYPYVGQDESC 699


>gi|326490904|dbj|BAJ90119.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 457

 Score = 75.1 bits (183), Expect = 5e-12,   Method: Composition-based stats.
 Identities = 33/68 (48%), Positives = 49/68 (72%), Gaps = 1/68 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G+CW F+  GA+EGI+KI T +L+ +S Q+L+DCD +  +  C GG +   Y++VI+N G
Sbjct: 156 GACWSFSATGAMEGINKITTGSLLSLSEQELIDCD-RSYNTGCGGGLMTYAYKFVIKNGG 214

Query: 65  INTERDYP 72
           I+TE DYP
Sbjct: 215 IDTEDDYP 222


>gi|356559055|ref|XP_003547817.1| PREDICTED: cysteine proteinase RD21a [Glycine max]
          Length = 366

 Score = 75.1 bits (183), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 33/71 (46%), Positives = 49/71 (69%), Gaps = 1/71 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V  +E I+KIVT   V +S Q+LVDCD +  +  C GG ++  ++++IQN G
Sbjct: 152 GSCWAFSTVATVEAINKIVTGKFVSLSEQELVDCD-RAYNEGCNGGLMDYAFEFIIQNGG 210

Query: 65  INTERDYPNVG 75
           I+T++DYP  G
Sbjct: 211 IDTDKDYPYRG 221


>gi|389608655|dbj|BAM17937.1| cathepsin L [Papilio xuthus]
          Length = 341

 Score = 75.1 bits (183), Expect = 5e-12,   Method: Composition-based stats.
 Identities = 33/77 (42%), Positives = 47/77 (61%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+  GA+EG     TN LV +S Q L+DC     +  C GG ++  ++Y+  NRG
Sbjct: 146 GSCWSFSSTGALEGQHYRRTNILVSLSEQNLIDCSAAYGNNGCNGGLMDNAFKYIKDNRG 205

Query: 65  INTERDYPNVGVMDNCK 81
           I+TE+ YP  G+ D C+
Sbjct: 206 IDTEKSYPYEGIDDKCR 222


>gi|118145|sp|P20721.1|CYSPL_SOLLC RecName: Full=Low-temperature-induced cysteine proteinase; Flags:
           Precursor
 gi|806314|gb|AAA66308.1| thiol protease, partial [Solanum lycopersicum]
          Length = 346

 Score = 75.1 bits (183), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 34/68 (50%), Positives = 49/68 (72%), Gaps = 1/68 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V A+E I+ IVT NL+ +S Q+LVDCD +  +  C GG ++  +++VI+N G
Sbjct: 40  GSCWAFSAVAAMESINAIVTGNLISLSEQELVDCD-RSYNEGCDGGLMDYAFEFVIKNGG 98

Query: 65  INTERDYP 72
           I+TE DYP
Sbjct: 99  IDTEEDYP 106


>gi|18418684|ref|NP_567983.1| Xylem cysteine proteinase 1 [Arabidopsis thaliana]
 gi|71153408|sp|O65493.1|XCP1_ARATH RecName: Full=Xylem cysteine proteinase 1; Short=AtXCP1; Flags:
           Precursor
 gi|6708181|gb|AAF25831.1|AF191027_1 papain-type cysteine endopeptidase XCP1 [Arabidopsis thaliana]
 gi|3080415|emb|CAA18734.1| cysteine proteinase-like protein [Arabidopsis thaliana]
 gi|7270487|emb|CAB80252.1| cysteine proteinase-like protein [Arabidopsis thaliana]
 gi|26449881|dbj|BAC42063.1| putative cysteine proteinase [Arabidopsis thaliana]
 gi|28827736|gb|AAO50712.1| unknown protein [Arabidopsis thaliana]
 gi|332661101|gb|AEE86501.1| Xylem cysteine proteinase 1 [Arabidopsis thaliana]
          Length = 355

 Score = 75.1 bits (183), Expect = 5e-12,   Method: Composition-based stats.
 Identities = 32/68 (47%), Positives = 45/68 (66%), Gaps = 1/68 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V A+EGI++I T NL  +S Q+L+DCD    S  C GG ++  +QY+I   G
Sbjct: 159 GSCWAFSTVAAVEGINQITTGNLSSLSEQELIDCDTTFNS-GCNGGLMDYAFQYIISTGG 217

Query: 65  INTERDYP 72
           ++ E DYP
Sbjct: 218 LHKEDDYP 225


>gi|147903593|ref|NP_001080822.1| cathepsin S precursor [Xenopus laevis]
 gi|33417128|gb|AAH56059.1| Ctss-a protein [Xenopus laevis]
          Length = 333

 Score = 75.1 bits (183), Expect = 5e-12,   Method: Composition-based stats.
 Identities = 33/76 (43%), Positives = 49/76 (64%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ VGA+EG   + T  LV +S Q LVDC ++  ++ C GGF+ + +QYVI N G
Sbjct: 140 GSCWAFSAVGALEGQLMLKTGKLVSLSPQNLVDCASKYGNKGCSGGFMTSAFQYVIDNNG 199

Query: 65  INTERDYPNVGVMDNC 80
           I+++  YP   + + C
Sbjct: 200 IDSDSYYPYHAMDEKC 215


>gi|357156854|ref|XP_003577598.1| PREDICTED: thiol protease SEN102-like [Brachypodium distachyon]
          Length = 368

 Score = 75.1 bits (183), Expect = 5e-12,   Method: Composition-based stats.
 Identities = 31/76 (40%), Positives = 50/76 (65%), Gaps = 1/76 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ + A+E I+KI T  LV +S Q+L+DCDN  + + C GG ++  +Q++ +N G
Sbjct: 159 GSCWAFSTIAAVESINKIRTGKLVSLSEQELMDCDNVND-QGCDGGLMDYAFQFIQKNGG 217

Query: 65  INTERDYPNVGVMDNC 80
           + +E +YP  G  + C
Sbjct: 218 VTSEANYPYQGQQNTC 233


>gi|403302736|ref|XP_003942009.1| PREDICTED: cathepsin K isoform 2 [Saimiri boliviensis boliviensis]
          Length = 383

 Score = 75.1 bits (183), Expect = 5e-12,   Method: Composition-based stats.
 Identities = 36/76 (47%), Positives = 50/76 (65%), Gaps = 2/76 (2%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ VGA+EG  K  T  L+++S Q LVDC    E+  C GG++   +QYV +NRG
Sbjct: 191 GSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDC--VSENDGCGGGYMTNAFQYVQKNRG 248

Query: 65  INTERDYPNVGVMDNC 80
           I++E  YP VG  ++C
Sbjct: 249 IDSEDAYPYVGQEESC 264


>gi|356545116|ref|XP_003540991.1| PREDICTED: vignain-like [Glycine max]
          Length = 342

 Score = 75.1 bits (183), Expect = 5e-12,   Method: Composition-based stats.
 Identities = 32/77 (41%), Positives = 48/77 (62%), Gaps = 1/77 (1%)

Query: 6   SCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRGI 65
           SCW F+ V  IEG+ +I    LV +S Q+LVDC  +G+S  C GG++E  ++++ +  G+
Sbjct: 148 SCWAFSTVATIEGLHQITKGELVSLSEQELVDC-VKGDSEGCYGGYVEDAFEFIAKKGGV 206

Query: 66  NTERDYPNVGVMDNCKV 82
            +E  YP  GV   CKV
Sbjct: 207 ASETHYPYKGVNKTCKV 223


>gi|32396020|gb|AAP41847.1| senescence-associated cysteine protease [Anthurium andraeanum]
          Length = 460

 Score = 75.1 bits (183), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 34/68 (50%), Positives = 49/68 (72%), Gaps = 1/68 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V A+EGI++IVT +L+ +S Q+LVDCD    +  C GG ++  +Q++I N G
Sbjct: 152 GSCWAFSTVAAVEGINQIVTGDLIVLSEQELVDCDT-AYNEGCNGGLMDYAFQFIISNGG 210

Query: 65  INTERDYP 72
           I+TE DYP
Sbjct: 211 IDTEEDYP 218


>gi|194352758|emb|CAQ00107.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
          Length = 457

 Score = 75.1 bits (183), Expect = 5e-12,   Method: Composition-based stats.
 Identities = 33/68 (48%), Positives = 49/68 (72%), Gaps = 1/68 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G+CW F+  GA+EGI+KI T +L+ +S Q+L+DCD +  +  C GG +   Y++VI+N G
Sbjct: 156 GACWSFSATGAMEGINKITTGSLLSLSEQELIDCD-RSYNTGCGGGLMTYAYKFVIKNGG 214

Query: 65  INTERDYP 72
           I+TE DYP
Sbjct: 215 IDTEDDYP 222


>gi|167526493|ref|XP_001747580.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163774026|gb|EDQ87660.1| predicted protein [Monosiga brevicollis MX1]
          Length = 330

 Score = 75.1 bits (183), Expect = 5e-12,   Method: Composition-based stats.
 Identities = 31/68 (45%), Positives = 44/68 (64%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+  G++EG   I T  LV +S QQL+DC  +  +  C GG ++  ++YVI N G
Sbjct: 129 GSCWSFSTTGSVEGAHAIATGKLVSLSEQQLMDCSTRYGNHGCNGGLMDYAFEYVIANGG 188

Query: 65  INTERDYP 72
           ++TE DYP
Sbjct: 189 LDTEEDYP 196


>gi|113120267|gb|ABI30273.1| VXH-B, partial [Vasconcellea x heilbornii]
          Length = 266

 Score = 75.1 bits (183), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 39/80 (48%), Positives = 51/80 (63%), Gaps = 3/80 (3%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V A+EGI+KIVT  LV +S Q+L+DC+ +  S  C GGF     QYV QN G
Sbjct: 157 GSCWTFSSVAAVEGINKIVTGRLVSLSEQELLDCERR--SYGCRGGFPPYALQYVAQN-G 213

Query: 65  INTERDYPNVGVMDNCKVFQ 84
           I+  ++YP  GV   C+  Q
Sbjct: 214 IHLRQNYPYEGVQRQCRARQ 233


>gi|218183|dbj|BAA14403.1| oryzain beta precursor [Oryza sativa Japonica Group]
          Length = 471

 Score = 75.1 bits (183), Expect = 5e-12,   Method: Composition-based stats.
 Identities = 31/82 (37%), Positives = 51/82 (62%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V  +E I+++VT  ++ +S Q+LV+C   G++  C GG +   + ++I+N G
Sbjct: 162 GSCWAFSAVSTVESINQLVTGEMITLSEQELVECSTNGQNSGCNGGLMADAFDFIIKNGG 221

Query: 65  INTERDYPNVGVMDNCKVFQFN 86
           I+TE DYP   V   C + + N
Sbjct: 222 IDTEDDYPYKAVDGKCDINREN 243


>gi|148907299|gb|ABR16787.1| unknown [Picea sitchensis]
          Length = 372

 Score = 75.1 bits (183), Expect = 5e-12,   Method: Composition-based stats.
 Identities = 34/76 (44%), Positives = 46/76 (60%), Gaps = 2/76 (2%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ + ++EGI+ I T  LV +S QQLVDC    E+  C GG ++  +QY+I N G
Sbjct: 158 GSCWAFSTIASVEGINYIKTGKLVSLSEQQLVDCSK--ENAGCNGGLMDNAFQYIIDNGG 215

Query: 65  INTERDYPNVGVMDNC 80
           I TE +YP       C
Sbjct: 216 IVTEDEYPYTAEAGEC 231


>gi|255572401|ref|XP_002527138.1| cysteine protease, putative [Ricinus communis]
 gi|223533498|gb|EEF35240.1| cysteine protease, putative [Ricinus communis]
          Length = 96

 Score = 75.1 bits (183), Expect = 6e-12,   Method: Compositional matrix adjust.
 Identities = 37/82 (45%), Positives = 54/82 (65%), Gaps = 4/82 (4%)

Query: 6  SCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRGI 65
          SCW F++V A+EGI+KIVT  L+ +S Q+LVDCD +  +  C G  ++  +Q++I N GI
Sbjct: 9  SCWAFSIVAAVEGINKIVTGKLISLSDQELVDCD-RSYNAGCNGDLVDNAFQFIINNGGI 67

Query: 66 NTERDYPNVGV---MDNCKVFQ 84
          +T++DYP   V    D  KV Q
Sbjct: 68 DTDKDYPYQAVDGKRDMTKVLQ 89


>gi|115479933|ref|NP_001063560.1| Os09g0497500 [Oryza sativa Japonica Group]
 gi|113631793|dbj|BAF25474.1| Os09g0497500 [Oryza sativa Japonica Group]
 gi|215704298|dbj|BAG93138.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 349

 Score = 75.1 bits (183), Expect = 6e-12,   Method: Composition-based stats.
 Identities = 34/82 (41%), Positives = 50/82 (60%), Gaps = 2/82 (2%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V AIEGI++I    LV +S Q+LVDCD+  E+  C GG++   +++V+ N G
Sbjct: 144 GSCWAFSAVAAIEGINQIKNGELVSLSEQELVDCDD--EAVGCGGGYMSWAFEFVVGNHG 201

Query: 65  INTERDYPNVGVMDNCKVFQFN 86
           + TE  YP       C+  + N
Sbjct: 202 LTTEASYPYHAANGACQAAKLN 223


>gi|5901663|gb|AAD55363.1| cysteine protease [Hordeum vulgare subsp. vulgare]
          Length = 163

 Score = 75.1 bits (183), Expect = 6e-12,   Method: Compositional matrix adjust.
 Identities = 31/82 (37%), Positives = 52/82 (63%)

Query: 5  GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
          GSCW F+ V  +E I+++VT  ++ +S Q+LV+C   G++  C GG ++  + ++I+N G
Sbjct: 1  GSCWAFSAVSTVESINQLVTGEMITLSEQELVECSTNGQNSGCNGGLMDDAFDFIIKNGG 60

Query: 65 INTERDYPNVGVMDNCKVFQFN 86
          I+TE DYP   V   C + + N
Sbjct: 61 IDTEEDYPYKAVDGKCDINREN 82


>gi|218202389|gb|EEC84816.1| hypothetical protein OsI_31898 [Oryza sativa Indica Group]
          Length = 350

 Score = 75.1 bits (183), Expect = 6e-12,   Method: Composition-based stats.
 Identities = 34/82 (41%), Positives = 50/82 (60%), Gaps = 2/82 (2%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V AIEGI++I    LV +S Q+LVDCD+  E+  C GG++   +++V+ N G
Sbjct: 145 GSCWAFSAVAAIEGINQIKNGELVSLSEQELVDCDD--EAVGCGGGYMSWAFEFVVGNHG 202

Query: 65  INTERDYPNVGVMDNCKVFQFN 86
           + TE  YP       C+  + N
Sbjct: 203 LTTEASYPYHAANGACQAAKLN 224


>gi|125606204|gb|EAZ45240.1| hypothetical protein OsJ_29883 [Oryza sativa Japonica Group]
          Length = 350

 Score = 74.7 bits (182), Expect = 6e-12,   Method: Composition-based stats.
 Identities = 34/82 (41%), Positives = 50/82 (60%), Gaps = 2/82 (2%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V AIEGI++I    LV +S Q+LVDCD+  E+  C GG++   +++V+ N G
Sbjct: 145 GSCWAFSAVAAIEGINQIKNGELVSLSEQELVDCDD--EAVGCGGGYMSWAFEFVVGNHG 202

Query: 65  INTERDYPNVGVMDNCKVFQFN 86
           + TE  YP       C+  + N
Sbjct: 203 LTTEASYPYHAANGACQAAKLN 224


>gi|146216002|gb|ABQ10203.1| cysteine protease Cp5 [Actinidia deliciosa]
          Length = 509

 Score = 74.7 bits (182), Expect = 6e-12,   Method: Composition-based stats.
 Identities = 33/76 (43%), Positives = 50/76 (65%), Gaps = 2/76 (2%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+  GAIEGI+ +   +L+ +S Q+LVDCD+  +   C GG+++  +++V+ N G
Sbjct: 169 GSCWAFSSTGAIEGINALANGDLISLSEQELVDCDSTND--GCEGGYMDYAFEWVMSNGG 226

Query: 65  INTERDYPNVGVMDNC 80
           I+TE DYP  G    C
Sbjct: 227 IDTETDYPYTGEDGTC 242


>gi|355681664|gb|AER96818.1| cathepsin S [Mustela putorius furo]
          Length = 338

 Score = 74.7 bits (182), Expect = 6e-12,   Method: Composition-based stats.
 Identities = 34/78 (43%), Positives = 49/78 (62%), Gaps = 1/78 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGE-SRSCVGGFIETIYQYVIQNR 63
           G+CW F+ VGA+E   K+ T NLV +S Q LVDC  +   ++ C GGF+   +QY+I N 
Sbjct: 145 GACWAFSAVGALEAQLKLKTGNLVSLSAQNLVDCSTERYGNKGCNGGFMTKAFQYIIDNN 204

Query: 64  GINTERDYPNVGVMDNCK 81
           GI++E  YP   +  NC+
Sbjct: 205 GIDSEVSYPYKAMDGNCR 222


>gi|40806500|gb|AAR92155.1| putative cysteine protease 2 [Iris x hollandica]
          Length = 359

 Score = 74.7 bits (182), Expect = 6e-12,   Method: Composition-based stats.
 Identities = 32/76 (42%), Positives = 48/76 (63%), Gaps = 1/76 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ + ++EGI+KI TN LV +S QQLVDCD   ++  C GG ++  ++++  N G
Sbjct: 150 GSCWAFSTIASVEGINKIKTNQLVPLSGQQLVDCDTD-QNEGCNGGLMDYAFEFIKSNGG 208

Query: 65  INTERDYPNVGVMDNC 80
           I +E  YP      +C
Sbjct: 209 ITSESAYPYTAEQGSC 224


>gi|255626679|gb|ACU13684.1| unknown [Glycine max]
          Length = 229

 Score = 74.7 bits (182), Expect = 6e-12,   Method: Compositional matrix adjust.
 Identities = 33/71 (46%), Positives = 47/71 (66%), Gaps = 1/71 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V  +E  +KIVT   V +S Q+LVDCD     R C GG ++  ++++IQN G
Sbjct: 152 GSCWAFSTVATVEATNKIVTGKFVSLSEQELVDCDRAYNER-CNGGLMDYAFEFIIQNGG 210

Query: 65  INTERDYPNVG 75
           I+T++DYP  G
Sbjct: 211 IDTDKDYPYRG 221


>gi|149510440|ref|XP_001518002.1| PREDICTED: cathepsin K-like [Ornithorhynchus anatinus]
          Length = 618

 Score = 74.7 bits (182), Expect = 6e-12,   Method: Composition-based stats.
 Identities = 38/79 (48%), Positives = 49/79 (62%), Gaps = 6/79 (7%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDC--DNQGESRSCVGGFIETIYQYVIQN 62
           GSCW F+ VGA+EG  K  T  L+D+S Q LVDC   N G    C GG++   +QYV  N
Sbjct: 426 GSCWAFSSVGALEGQLKKKTGRLLDLSPQNLVDCVASNDG----CGGGYMTNAFQYVHDN 481

Query: 63  RGINTERDYPNVGVMDNCK 81
           RGI++E  YP VG  + C+
Sbjct: 482 RGIDSEDAYPYVGQDEPCR 500


>gi|125552927|gb|EAY98636.1| hypothetical protein OsI_20560 [Oryza sativa Indica Group]
          Length = 449

 Score = 74.7 bits (182), Expect = 6e-12,   Method: Composition-based stats.
 Identities = 33/68 (48%), Positives = 49/68 (72%), Gaps = 1/68 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G+CW F+  GA+EGI+KI T +L+ +S Q+L+DCD    S  C GG ++  Y++V++N G
Sbjct: 147 GACWSFSATGAMEGINKIKTGSLISLSEQELIDCDRSYNS-GCGGGLMDYAYKFVVKNGG 205

Query: 65  INTERDYP 72
           I+TE DYP
Sbjct: 206 IDTEADYP 213


>gi|115464789|ref|NP_001055994.1| Os05g0508300 [Oryza sativa Japonica Group]
 gi|48475189|gb|AAT44258.1| hypothetical protein [Oryza sativa Japonica Group]
 gi|113579545|dbj|BAF17908.1| Os05g0508300 [Oryza sativa Japonica Group]
          Length = 450

 Score = 74.7 bits (182), Expect = 6e-12,   Method: Composition-based stats.
 Identities = 33/68 (48%), Positives = 49/68 (72%), Gaps = 1/68 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G+CW F+  GA+EGI+KI T +L+ +S Q+L+DCD    S  C GG ++  Y++V++N G
Sbjct: 148 GACWSFSATGAMEGINKIKTGSLISLSEQELIDCDRSYNS-GCGGGLMDYAYKFVVKNGG 206

Query: 65  INTERDYP 72
           I+TE DYP
Sbjct: 207 IDTEADYP 214


>gi|402856109|ref|XP_003892642.1| PREDICTED: cathepsin K [Papio anubis]
          Length = 348

 Score = 74.7 bits (182), Expect = 6e-12,   Method: Composition-based stats.
 Identities = 36/76 (47%), Positives = 50/76 (65%), Gaps = 2/76 (2%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ VGA+EG  K  T  L+++S Q LVDC    E+  C GG++   +QYV +NRG
Sbjct: 156 GSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDC--VSENDGCGGGYMTNAFQYVQKNRG 213

Query: 65  INTERDYPNVGVMDNC 80
           I++E  YP VG  ++C
Sbjct: 214 IDSEDAYPYVGQEESC 229


>gi|15705865|gb|AAL05851.1|AF411121_1 cysteine proteinase precursor [Sandersonia aurantiaca]
          Length = 360

 Score = 74.7 bits (182), Expect = 6e-12,   Method: Composition-based stats.
 Identities = 33/78 (42%), Positives = 50/78 (64%), Gaps = 7/78 (8%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGES-------RSCVGGFIETIYQ 57
           GSCW F+  GA+EG + + T NLV +S QQLVDCD++ +S       + C GG + T ++
Sbjct: 151 GSCWSFSAAGALEGANYLSTGNLVSLSEQQLVDCDHECDSSEPDSCDQGCNGGLMTTAFE 210

Query: 58  YVIQNRGINTERDYPNVG 75
           Y++++ G+  E DYP  G
Sbjct: 211 YILKSGGLEREADYPYTG 228


>gi|403302734|ref|XP_003942008.1| PREDICTED: cathepsin K isoform 1 [Saimiri boliviensis boliviensis]
          Length = 329

 Score = 74.7 bits (182), Expect = 6e-12,   Method: Composition-based stats.
 Identities = 36/76 (47%), Positives = 50/76 (65%), Gaps = 2/76 (2%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ VGA+EG  K  T  L+++S Q LVDC    E+  C GG++   +QYV +NRG
Sbjct: 137 GSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDC--VSENDGCGGGYMTNAFQYVQKNRG 194

Query: 65  INTERDYPNVGVMDNC 80
           I++E  YP VG  ++C
Sbjct: 195 IDSEDAYPYVGQEESC 210


>gi|310975577|gb|ADP55137.1| cathepsin S [Miichthys miiuy]
          Length = 338

 Score = 74.7 bits (182), Expect = 6e-12,   Method: Composition-based stats.
 Identities = 33/71 (46%), Positives = 43/71 (60%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+  GA+EG     T  LVD+S Q LVDC  +  +  C GGF+   +QYVI N G
Sbjct: 144 GSCWAFSAAGALEGQLAKTTGKLVDLSPQNLVDCSTKYGNHGCNGGFMHKAFQYVIDNHG 203

Query: 65  INTERDYPNVG 75
           I+++  YP  G
Sbjct: 204 IDSDAAYPYTG 214


>gi|390476660|ref|XP_003735160.1| PREDICTED: LOW QUALITY PROTEIN: cathepsin K [Callithrix jacchus]
          Length = 329

 Score = 74.7 bits (182), Expect = 6e-12,   Method: Composition-based stats.
 Identities = 36/76 (47%), Positives = 50/76 (65%), Gaps = 2/76 (2%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ VGA+EG  K  T  L+++S Q LVDC    E+  C GG++   +QYV +NRG
Sbjct: 137 GSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDC--VSENDGCGGGYMTNAFQYVQKNRG 194

Query: 65  INTERDYPNVGVMDNC 80
           I++E  YP VG  ++C
Sbjct: 195 IDSEDAYPYVGQEESC 210


>gi|149030666|gb|EDL85703.1| cathepsin S [Rattus norvegicus]
          Length = 291

 Score = 74.7 bits (182), Expect = 6e-12,   Method: Composition-based stats.
 Identities = 34/78 (43%), Positives = 49/78 (62%), Gaps = 2/78 (2%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGE--SRSCVGGFIETIYQYVIQN 62
           GSCW F+ VGA+EG  K+ T  LV +S Q LVDC  + +  ++ C GGF+   +QY+I N
Sbjct: 96  GSCWAFSAVGALEGQLKLKTGKLVSLSAQNLVDCSTEEKYGNKGCGGGFMTEAFQYIIDN 155

Query: 63  RGINTERDYPNVGVMDNC 80
            GI++E  YP   + + C
Sbjct: 156 GGIDSEASYPYKAMDEKC 173


>gi|32396018|gb|AAP41846.1| cysteine protease [Anthurium andraeanum]
          Length = 502

 Score = 74.7 bits (182), Expect = 6e-12,   Method: Compositional matrix adjust.
 Identities = 34/75 (45%), Positives = 50/75 (66%), Gaps = 2/75 (2%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+  GA+EGI+ I T  L+ +S Q+LVDCD   E   C GG+++  +++VI N G
Sbjct: 168 GSCWAFSSTGAMEGINAITTGELISLSEQELVDCDTTNE--GCDGGYMDYAFEWVINNGG 225

Query: 65  INTERDYPNVGVMDN 79
           I++E +YP  G  D+
Sbjct: 226 IDSEANYPYTGQADS 240


>gi|18202414|sp|P82473.1|CPGP1_ZINOF RecName: Full=Zingipain-1; AltName: Full=Cysteine proteinase GP-I
          Length = 221

 Score = 74.7 bits (182), Expect = 6e-12,   Method: Compositional matrix adjust.
 Identities = 35/80 (43%), Positives = 49/80 (61%), Gaps = 2/80 (2%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F  + A+EGI++IVT +L+ +S QQLVDC  +  +  C GG+    +QY+I N G
Sbjct: 25  GSCWAFDAIAAVEGINQIVTGDLISLSEQQLVDCSTR--NHGCEGGWPYRAFQYIINNGG 82

Query: 65  INTERDYPNVGVMDNCKVFQ 84
           IN+E  YP  G    C   +
Sbjct: 83  INSEEHYPYTGTNGTCDTKE 102


>gi|357166359|ref|XP_003580684.1| PREDICTED: oryzain alpha chain-like [Brachypodium distachyon]
          Length = 456

 Score = 74.7 bits (182), Expect = 6e-12,   Method: Composition-based stats.
 Identities = 33/71 (46%), Positives = 51/71 (71%), Gaps = 7/71 (9%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCD---NQGESRSCVGGFIETIYQYVIQ 61
           GSCW F+ + A+EGI++IVT +++ +S Q+LVDCD   NQG    C GG ++  ++++I 
Sbjct: 153 GSCWAFSAIAAVEGINQIVTGDMIALSEQELVDCDTSYNQG----CNGGLMDYAFEFIIN 208

Query: 62  NRGINTERDYP 72
           N GI++E DYP
Sbjct: 209 NGGIDSEEDYP 219


>gi|432117576|gb|ELK37815.1| Cathepsin L1 [Myotis davidii]
          Length = 299

 Score = 74.7 bits (182), Expect = 6e-12,   Method: Composition-based stats.
 Identities = 34/77 (44%), Positives = 43/77 (55%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+  GA+EG     T  LV +S Q LVDC     +  C GG ++  +QYV  N G
Sbjct: 81  GSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRAQGNEGCSGGLMDNAFQYVKDNEG 140

Query: 65  INTERDYPNVGVMDNCK 81
           ++TE  YP  G  D CK
Sbjct: 141 LDTEESYPYYGTDDTCK 157


>gi|350583407|ref|XP_003481511.1| PREDICTED: cathepsin S [Sus scrofa]
          Length = 331

 Score = 74.7 bits (182), Expect = 6e-12,   Method: Composition-based stats.
 Identities = 35/78 (44%), Positives = 47/78 (60%), Gaps = 1/78 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQG-ESRSCVGGFIETIYQYVIQNR 63
           GSCW F+ VGA+E   K+ T  LV +S Q LVDC  +   ++ C GGF+   +QY+I N 
Sbjct: 137 GSCWAFSAVGALEAQVKMKTGRLVSLSAQNLVDCSTEKYRNKGCNGGFMTEAFQYIIDNN 196

Query: 64  GINTERDYPNVGVMDNCK 81
           GI++E  YP   V   CK
Sbjct: 197 GIDSEASYPYKAVDGKCK 214


>gi|348586441|ref|XP_003478977.1| PREDICTED: cathepsin K-like [Cavia porcellus]
          Length = 329

 Score = 74.7 bits (182), Expect = 7e-12,   Method: Composition-based stats.
 Identities = 36/76 (47%), Positives = 50/76 (65%), Gaps = 2/76 (2%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ VGA+EG  K  T  L+++S Q LVDC    E+  C GG++   +QYV +NRG
Sbjct: 137 GSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDC--VSENDGCGGGYMTNAFQYVQENRG 194

Query: 65  INTERDYPNVGVMDNC 80
           I++E  YP VG  ++C
Sbjct: 195 IDSEDAYPYVGQEESC 210


>gi|388497270|gb|AFK36701.1| unknown [Lotus japonicus]
          Length = 343

 Score = 74.7 bits (182), Expect = 7e-12,   Method: Composition-based stats.
 Identities = 34/77 (44%), Positives = 49/77 (63%), Gaps = 1/77 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V A+EGI KI   NL+ +S QQLVDC +  +++ C GGF++  + Y+ +N G
Sbjct: 149 GSCWAFSAVAAVEGIVKIKNGNLISLSEQQLVDCASNEQNQGCGGGFMDNAFSYITEN-G 207

Query: 65  INTERDYPNVGVMDNCK 81
           I +E DY   G    C+
Sbjct: 208 IASENDYQYRGGAGTCQ 224


>gi|71897043|ref|NP_001026516.1| cathepsin S precursor [Gallus gallus]
 gi|53126701|emb|CAG30977.1| hypothetical protein RCJMB04_1f23 [Gallus gallus]
          Length = 328

 Score = 74.7 bits (182), Expect = 7e-12,   Method: Composition-based stats.
 Identities = 32/77 (41%), Positives = 46/77 (59%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G+CW F+ VGA+E   K+ T  LV +S Q LVDC     ++ C GGF+   +QY+I N G
Sbjct: 135 GACWAFSAVGALEAQVKLKTGKLVSLSAQNLVDCSMMYGNKGCGGGFMTRAFQYIIDNNG 194

Query: 65  INTERDYPNVGVMDNCK 81
           I++E  YP +     C+
Sbjct: 195 IDSEESYPYMAQNGTCQ 211


>gi|313118768|gb|ADR32296.1| C14 cysteine protease [Solanum demissum]
 gi|313118770|gb|ADR32297.1| C14 cysteine protease [Solanum demissum]
          Length = 217

 Score = 74.7 bits (182), Expect = 7e-12,   Method: Compositional matrix adjust.
 Identities = 34/68 (50%), Positives = 48/68 (70%), Gaps = 1/68 (1%)

Query: 5  GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
          GSCW F+ V A+E I+ IVT NL+ +S Q+LVDCD +  +  C GG ++  +++VI N G
Sbjct: 23 GSCWAFSAVAAMESINAIVTGNLISLSEQELVDCD-KSYNEGCDGGLMDYAFEFVINNGG 81

Query: 65 INTERDYP 72
          I+TE DYP
Sbjct: 82 IDTEEDYP 89


>gi|225456820|ref|XP_002278323.1| PREDICTED: vignain [Vitis vinifera]
          Length = 360

 Score = 74.7 bits (182), Expect = 7e-12,   Method: Composition-based stats.
 Identities = 35/82 (42%), Positives = 50/82 (60%), Gaps = 1/82 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V A+EGI+ I TN LV +S Q+LVDCD   E++ C GG +   ++++ +  G
Sbjct: 148 GSCWAFSTVVAVEGINHIKTNKLVSLSEQELVDCDTS-ENQGCNGGLMGYAFEFIKEKGG 206

Query: 65  INTERDYPNVGVMDNCKVFQFN 86
           I TE+ YP       C V + N
Sbjct: 207 ITTEQSYPYTAEDGTCDVSKVN 228


>gi|161172356|pdb|3BCN|A Chain A, Crystal Structure Of A Papain-Like Cysteine Protease
           Ervatamin-A Complexed With Irreversible Inhibitor E-64
 gi|161172357|pdb|3BCN|B Chain B, Crystal Structure Of A Papain-Like Cysteine Protease
           Ervatamin-A Complexed With Irreversible Inhibitor E-64
          Length = 209

 Score = 74.7 bits (182), Expect = 7e-12,   Method: Compositional matrix adjust.
 Identities = 34/80 (42%), Positives = 49/80 (61%), Gaps = 2/80 (2%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V  +E I++I T NL+ +S QQLVDC  +  +  C GG+ +  YQY+I N G
Sbjct: 23  GSCWAFSTVTTVESINQIRTGNLISLSEQQLVDCSKK--NHGCKGGYFDRAYQYIIANGG 80

Query: 65  INTERDYPNVGVMDNCKVFQ 84
           I+TE +YP       C+  +
Sbjct: 81  IDTEANYPYKAFQGPCRAAK 100


>gi|395856027|ref|XP_003800444.1| PREDICTED: cathepsin K [Otolemur garnettii]
          Length = 329

 Score = 74.7 bits (182), Expect = 7e-12,   Method: Composition-based stats.
 Identities = 38/78 (48%), Positives = 51/78 (65%), Gaps = 6/78 (7%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDC--DNQGESRSCVGGFIETIYQYVIQN 62
           GSCW F+ VGA+EG  K  T  L+++S Q LVDC  DN G    C GG++   +QYV +N
Sbjct: 137 GSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCVSDNDG----CGGGYMTNAFQYVQKN 192

Query: 63  RGINTERDYPNVGVMDNC 80
           RGI++E  YP VG  ++C
Sbjct: 193 RGIDSEDAYPYVGQDESC 210


>gi|431896622|gb|ELK06034.1| Cathepsin K [Pteropus alecto]
          Length = 330

 Score = 74.7 bits (182), Expect = 7e-12,   Method: Composition-based stats.
 Identities = 36/76 (47%), Positives = 50/76 (65%), Gaps = 2/76 (2%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ VGA+EG  K  T  L+++S Q LVDC    E+  C GG++   +QYV +NRG
Sbjct: 138 GSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDC--VSENDGCGGGYMTNAFQYVQKNRG 195

Query: 65  INTERDYPNVGVMDNC 80
           I++E  YP VG  ++C
Sbjct: 196 IDSEDAYPYVGQDESC 211


>gi|413953666|gb|AFW86315.1| hypothetical protein ZEAMMB73_539008 [Zea mays]
          Length = 314

 Score = 74.7 bits (182), Expect = 7e-12,   Method: Composition-based stats.
 Identities = 32/76 (42%), Positives = 45/76 (59%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G C  F+ V A EGI KI T  LV ++ Q+LVDCD  GE + C GG ++  ++++I+N G
Sbjct: 120 GCCSAFSAVAATEGIVKISTGKLVSLADQELVDCDVHGEDQGCEGGLMDDAFKFIIKNGG 179

Query: 65  INTERDYPNVGVMDNC 80
           + TE  YP       C
Sbjct: 180 LTTESSYPYTAADGKC 195


>gi|60654335|gb|AAX29858.1| cathepsin K [synthetic construct]
 gi|60654337|gb|AAX29859.1| cathepsin K [synthetic construct]
          Length = 330

 Score = 74.7 bits (182), Expect = 7e-12,   Method: Composition-based stats.
 Identities = 36/76 (47%), Positives = 50/76 (65%), Gaps = 2/76 (2%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ VGA+EG  K  T  L+++S Q LVDC    E+  C GG++   +QYV +NRG
Sbjct: 137 GSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDC--VSENDGCGGGYMTNAFQYVQKNRG 194

Query: 65  INTERDYPNVGVMDNC 80
           I++E  YP VG  ++C
Sbjct: 195 IDSEDAYPYVGQEESC 210


>gi|395729888|ref|XP_002810309.2| PREDICTED: cathepsin K [Pongo abelii]
          Length = 343

 Score = 74.7 bits (182), Expect = 7e-12,   Method: Composition-based stats.
 Identities = 36/76 (47%), Positives = 50/76 (65%), Gaps = 2/76 (2%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ VGA+EG  K  T  L+++S Q LVDC    E+  C GG++   +QYV +NRG
Sbjct: 151 GSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDC--VSENDGCGGGYMTNAFQYVQKNRG 208

Query: 65  INTERDYPNVGVMDNC 80
           I++E  YP VG  ++C
Sbjct: 209 IDSEDAYPYVGQEESC 224


>gi|74136185|ref|NP_001027984.1| cathepsin K precursor [Macaca mulatta]
 gi|47117667|sp|P61276.1|CATK_MACFA RecName: Full=Cathepsin K; Flags: Precursor
 gi|47117668|sp|P61277.1|CATK_MACMU RecName: Full=Cathepsin K; Flags: Precursor
 gi|3236470|gb|AAC23694.1| cathepsin K [Macaca fascicularis]
 gi|4927694|gb|AAD33249.1| cathepsin K [Macaca mulatta]
 gi|355558400|gb|EHH15180.1| hypothetical protein EGK_01237 [Macaca mulatta]
 gi|355763132|gb|EHH62118.1| hypothetical protein EGM_20317 [Macaca fascicularis]
 gi|380809978|gb|AFE76864.1| cathepsin K preproprotein [Macaca mulatta]
 gi|383416065|gb|AFH31246.1| cathepsin K preproprotein [Macaca mulatta]
 gi|384945478|gb|AFI36344.1| cathepsin K preproprotein [Macaca mulatta]
          Length = 329

 Score = 74.3 bits (181), Expect = 7e-12,   Method: Composition-based stats.
 Identities = 36/76 (47%), Positives = 50/76 (65%), Gaps = 2/76 (2%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ VGA+EG  K  T  L+++S Q LVDC    E+  C GG++   +QYV +NRG
Sbjct: 137 GSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDC--VSENDGCGGGYMTNAFQYVQKNRG 194

Query: 65  INTERDYPNVGVMDNC 80
           I++E  YP VG  ++C
Sbjct: 195 IDSEDAYPYVGQEESC 210


>gi|49456399|emb|CAG46520.1| CTSK [Homo sapiens]
          Length = 329

 Score = 74.3 bits (181), Expect = 7e-12,   Method: Composition-based stats.
 Identities = 36/76 (47%), Positives = 50/76 (65%), Gaps = 2/76 (2%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ VGA+EG  K  T  L+++S Q LVDC    E+  C GG++   +QYV +NRG
Sbjct: 137 GSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDC--VSENDGCGGGYMTNAFQYVQKNRG 194

Query: 65  INTERDYPNVGVMDNC 80
           I++E  YP VG  ++C
Sbjct: 195 IDSEDAYPYVGQEESC 210


>gi|332220191|ref|XP_003259241.1| PREDICTED: cathepsin K [Nomascus leucogenys]
          Length = 329

 Score = 74.3 bits (181), Expect = 8e-12,   Method: Composition-based stats.
 Identities = 36/76 (47%), Positives = 50/76 (65%), Gaps = 2/76 (2%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ VGA+EG  K  T  L+++S Q LVDC    E+  C GG++   +QYV +NRG
Sbjct: 137 GSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDC--VSENDGCGGGYMTNAFQYVQKNRG 194

Query: 65  INTERDYPNVGVMDNC 80
           I++E  YP VG  ++C
Sbjct: 195 IDSEDAYPYVGQEESC 210


>gi|414870137|tpg|DAA48694.1| TPA: vignain [Zea mays]
          Length = 484

 Score = 74.3 bits (181), Expect = 8e-12,   Method: Composition-based stats.
 Identities = 32/77 (41%), Positives = 47/77 (61%), Gaps = 1/77 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ + A+EGI+ I T NL  +S QQLVDCD +  +  C GG ++  +QY+ ++ G
Sbjct: 270 GSCWAFSTIAAVEGINAIKTKNLTSLSEQQLVDCDTKANA-GCNGGLMDYAFQYIAKHGG 328

Query: 65  INTERDYPNVGVMDNCK 81
           +  E  YP      +CK
Sbjct: 329 VAAEDAYPYRARQASCK 345


>gi|226507950|ref|NP_001151278.1| LOC100284911 precursor [Zea mays]
 gi|195645488|gb|ACG42212.1| vignain precursor [Zea mays]
          Length = 376

 Score = 74.3 bits (181), Expect = 8e-12,   Method: Composition-based stats.
 Identities = 32/77 (41%), Positives = 47/77 (61%), Gaps = 1/77 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ + A+EGI+ I T NL  +S QQLVDCD +  +  C GG ++  +QY+ ++ G
Sbjct: 162 GSCWAFSTIAAVEGINAIKTKNLTSLSEQQLVDCDTKANA-GCNGGLMDYAFQYIAKHGG 220

Query: 65  INTERDYPNVGVMDNCK 81
           +  E  YP      +CK
Sbjct: 221 VAAEDAYPYRARQASCK 237


>gi|195637152|gb|ACG38044.1| vignain precursor [Zea mays]
          Length = 377

 Score = 74.3 bits (181), Expect = 8e-12,   Method: Composition-based stats.
 Identities = 32/77 (41%), Positives = 47/77 (61%), Gaps = 1/77 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ + A+EGI+ I T NL  +S QQLVDCD +  +  C GG ++  +QY+ ++ G
Sbjct: 163 GSCWAFSTIAAVEGINAIKTKNLTSLSEQQLVDCDTKANA-GCNGGLMDYAFQYIAKHGG 221

Query: 65  INTERDYPNVGVMDNCK 81
           +  E  YP      +CK
Sbjct: 222 VAAEDAYPYRARQASCK 238


>gi|4503151|ref|NP_000387.1| cathepsin K preproprotein [Homo sapiens]
 gi|1168793|sp|P43235.1|CATK_HUMAN RecName: Full=Cathepsin K; AltName: Full=Cathepsin O; AltName:
           Full=Cathepsin O2; AltName: Full=Cathepsin X; Flags:
           Precursor
 gi|562757|emb|CAA57649.1| Cathepsin O [Homo sapiens]
 gi|606923|gb|AAA65233.1| cathepsin O [Homo sapiens]
 gi|1195556|gb|AAB35521.1| cathepsin O2 [Homo sapiens]
 gi|16359188|gb|AAH16058.1| Cathepsin K [Homo sapiens]
 gi|49456311|emb|CAG46476.1| CTSK [Homo sapiens]
 gi|60823594|gb|AAX36649.1| cathepsin K [synthetic construct]
 gi|119573901|gb|EAW53516.1| cathepsin K (pycnodysostosis), isoform CRA_b [Homo sapiens]
 gi|307685681|dbj|BAJ20771.1| cathepsin K [synthetic construct]
 gi|312150424|gb|ADQ31724.1| cathepsin K [synthetic construct]
          Length = 329

 Score = 74.3 bits (181), Expect = 8e-12,   Method: Composition-based stats.
 Identities = 36/76 (47%), Positives = 50/76 (65%), Gaps = 2/76 (2%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ VGA+EG  K  T  L+++S Q LVDC    E+  C GG++   +QYV +NRG
Sbjct: 137 GSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDC--VSENDGCGGGYMTNAFQYVQKNRG 194

Query: 65  INTERDYPNVGVMDNC 80
           I++E  YP VG  ++C
Sbjct: 195 IDSEDAYPYVGQEESC 210


>gi|426331364|ref|XP_004026652.1| PREDICTED: cathepsin K [Gorilla gorilla gorilla]
          Length = 329

 Score = 74.3 bits (181), Expect = 8e-12,   Method: Composition-based stats.
 Identities = 36/76 (47%), Positives = 50/76 (65%), Gaps = 2/76 (2%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ VGA+EG  K  T  L+++S Q LVDC    E+  C GG++   +QYV +NRG
Sbjct: 137 GSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDC--VSENDGCGGGYMTNAFQYVQKNRG 194

Query: 65  INTERDYPNVGVMDNC 80
           I++E  YP VG  ++C
Sbjct: 195 IDSEDAYPYVGQEESC 210


>gi|836934|gb|AAA95998.1| cathepsin X [Homo sapiens]
          Length = 329

 Score = 74.3 bits (181), Expect = 8e-12,   Method: Composition-based stats.
 Identities = 36/76 (47%), Positives = 50/76 (65%), Gaps = 2/76 (2%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ VGA+EG  K  T  L+++S Q LVDC    E+  C GG++   +QYV +NRG
Sbjct: 137 GSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDC--VSENDGCGGGYMTNAFQYVQKNRG 194

Query: 65  INTERDYPNVGVMDNC 80
           I++E  YP VG  ++C
Sbjct: 195 IDSEDAYPYVGQEESC 210


>gi|397492864|ref|XP_003817340.1| PREDICTED: cathepsin K [Pan paniscus]
          Length = 343

 Score = 74.3 bits (181), Expect = 8e-12,   Method: Composition-based stats.
 Identities = 36/76 (47%), Positives = 50/76 (65%), Gaps = 2/76 (2%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ VGA+EG  K  T  L+++S Q LVDC    E+  C GG++   +QYV +NRG
Sbjct: 151 GSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDC--VSENDGCGGGYMTNAFQYVQKNRG 208

Query: 65  INTERDYPNVGVMDNC 80
           I++E  YP VG  ++C
Sbjct: 209 IDSEDAYPYVGQEESC 224


>gi|297851334|ref|XP_002893548.1| peptidase C1A papain family protein [Arabidopsis lyrata subsp.
           lyrata]
 gi|297339390|gb|EFH69807.1| peptidase C1A papain family protein [Arabidopsis lyrata subsp.
           lyrata]
          Length = 346

 Score = 74.3 bits (181), Expect = 8e-12,   Method: Compositional matrix adjust.
 Identities = 34/77 (44%), Positives = 49/77 (63%), Gaps = 1/77 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G CW F+ V A+EG++KIV  NLV +S QQL+DCD + ++  C GG +   + Y+I+NRG
Sbjct: 152 GCCWAFSSVAAVEGLTKIVGGNLVSLSEQQLLDCDRERDN-GCNGGIMSDAFSYIIKNRG 210

Query: 65  INTERDYPNVGVMDNCK 81
           I +E  YP       C+
Sbjct: 211 IASEASYPYQETEGTCR 227


>gi|223946391|gb|ACN27279.1| unknown [Zea mays]
          Length = 279

 Score = 74.3 bits (181), Expect = 8e-12,   Method: Composition-based stats.
 Identities = 32/77 (41%), Positives = 47/77 (61%), Gaps = 1/77 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ + A+EGI+ I T NL  +S QQLVDCD +  +  C GG ++  +QY+ ++ G
Sbjct: 65  GSCWAFSTIAAVEGINAIKTKNLTSLSEQQLVDCDTKANA-GCNGGLMDYAFQYIAKHGG 123

Query: 65  INTERDYPNVGVMDNCK 81
           +  E  YP      +CK
Sbjct: 124 VAAEDAYPYRARQASCK 140


>gi|18401420|ref|NP_565649.1| cysteine proteinase-like protein [Arabidopsis thaliana]
 gi|4314384|gb|AAD15594.1| cysteine proteinase [Arabidopsis thaliana]
 gi|17381154|gb|AAL36389.1| putative cysteine proteinase [Arabidopsis thaliana]
 gi|20465849|gb|AAM20029.1| putative cysteine proteinase [Arabidopsis thaliana]
 gi|330252901|gb|AEC07995.1| cysteine proteinase-like protein [Arabidopsis thaliana]
          Length = 348

 Score = 74.3 bits (181), Expect = 8e-12,   Method: Compositional matrix adjust.
 Identities = 35/71 (49%), Positives = 47/71 (66%), Gaps = 7/71 (9%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCD---NQGESRSCVGGFIETIYQYVIQ 61
           G CW F+ V A+EGI+KI    LV +S QQL+DCD   NQG    C GG +   ++Y+I+
Sbjct: 150 GGCWAFSAVAAVEGITKITKGELVSLSEQQLLDCDRDYNQG----CRGGIMSKAFEYIIK 205

Query: 62  NRGINTERDYP 72
           N+GI TE +YP
Sbjct: 206 NQGITTEDNYP 216


>gi|42563538|gb|AAS20467.1| cysteine protease-like protein [Pelargonium x hortorum]
          Length = 234

 Score = 74.3 bits (181), Expect = 8e-12,   Method: Compositional matrix adjust.
 Identities = 36/85 (42%), Positives = 53/85 (62%), Gaps = 7/85 (8%)

Query: 5  GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCD---NQGESRSCVGGFIETIYQYVIQ 61
          G CW F+ + A+EGI+ IVT  L+ +S Q+LVDCD   NQG    C GG ++  ++++I+
Sbjct: 2  GRCWAFSTIAAVEGINHIVTGELISLSEQELVDCDRSYNQG----CNGGLMDYAFEFIIK 57

Query: 62 NRGINTERDYPNVGVMDNCKVFQFN 86
          N GI++E DYP   V   C   + N
Sbjct: 58 NGGIDSEEDYPYKAVDGTCDPIRKN 82


>gi|380236892|emb|CBK52289.1| cathepsin S protein [Dicentrarchus labrax]
          Length = 337

 Score = 74.3 bits (181), Expect = 8e-12,   Method: Composition-based stats.
 Identities = 33/77 (42%), Positives = 45/77 (58%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+  GA+EG     T  LVD+S Q LVDC  +  +  C GG +   +QYVI N+G
Sbjct: 144 GSCWAFSAAGALEGQLAKTTGKLVDLSPQNLVDCSTKYGNHGCNGGLMHHAFQYVIDNQG 203

Query: 65  INTERDYPNVGVMDNCK 81
           I+++  YP  G    C+
Sbjct: 204 IDSDASYPYTGRNGECR 220


>gi|449532567|ref|XP_004173252.1| PREDICTED: oryzain alpha chain-like [Cucumis sativus]
          Length = 321

 Score = 74.3 bits (181), Expect = 8e-12,   Method: Compositional matrix adjust.
 Identities = 36/76 (47%), Positives = 51/76 (67%), Gaps = 1/76 (1%)

Query: 5  GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
          GSCW F+ V A+EGI++IVT  L+ +S Q+LVDCD +  +  C GG ++  +Q++I N G
Sbjct: 13 GSCWAFSSVAAVEGINQIVTGELIPLSEQELVDCD-KSFNMGCNGGLMDYAFQFIIGNGG 71

Query: 65 INTERDYPNVGVMDNC 80
          I+TE DYP  G    C
Sbjct: 72 IDTEEDYPYKGRDAAC 87


>gi|54020908|ref|NP_001005695.1| cathepsin S precursor [Xenopus (Silurana) tropicalis]
 gi|49522293|gb|AAH75261.1| cathepsin S [Xenopus (Silurana) tropicalis]
          Length = 333

 Score = 74.3 bits (181), Expect = 8e-12,   Method: Composition-based stats.
 Identities = 33/76 (43%), Positives = 49/76 (64%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ VGA+EG   + T  LV +S Q LVDC ++  ++ C GGF+   +QYVI N+G
Sbjct: 140 GSCWAFSAVGALEGQLMLKTGKLVSLSPQNLVDCSSKYGNKGCGGGFMTQAFQYVIDNKG 199

Query: 65  INTERDYPNVGVMDNC 80
           I+++  YP   + + C
Sbjct: 200 IDSDSYYPYHAMDEKC 215


>gi|312381834|gb|EFR27484.1| hypothetical protein AND_05795 [Anopheles darlingi]
          Length = 508

 Score = 74.3 bits (181), Expect = 8e-12,   Method: Composition-based stats.
 Identities = 33/68 (48%), Positives = 43/68 (63%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+  GA+EG     TN LV +S Q LVDC +   ++ C GG I   +QY+ QN G
Sbjct: 313 GSCWAFSSTGAVEGQHFRKTNKLVSLSEQNLVDCTSNYRNKGCKGGAIYRSFQYIEQNHG 372

Query: 65  INTERDYP 72
           I+TE+ YP
Sbjct: 373 IDTEKSYP 380


>gi|261289779|ref|XP_002611751.1| hypothetical protein BRAFLDRAFT_284345 [Branchiostoma floridae]
 gi|229297123|gb|EEN67761.1| hypothetical protein BRAFLDRAFT_284345 [Branchiostoma floridae]
          Length = 330

 Score = 74.3 bits (181), Expect = 8e-12,   Method: Compositional matrix adjust.
 Identities = 34/77 (44%), Positives = 47/77 (61%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+  G++EG     T  LV +S Q LVDC  Q  ++ C GG ++  +QY+IQN+G
Sbjct: 136 GSCWAFSTTGSLEGQHAKATGTLVSLSEQNLVDCSRQEGNKGCEGGDMDQGFQYIIQNKG 195

Query: 65  INTERDYPNVGVMDNCK 81
           I+TE+ YP       CK
Sbjct: 196 IDTEQCYPYKAKNHRCK 212


>gi|359492179|ref|XP_002280808.2| PREDICTED: cysteine proteinase RD19a-like [Vitis vinifera]
 gi|302142580|emb|CBI19783.3| unnamed protein product [Vitis vinifera]
          Length = 365

 Score = 74.3 bits (181), Expect = 8e-12,   Method: Composition-based stats.
 Identities = 34/84 (40%), Positives = 49/84 (58%), Gaps = 7/84 (8%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGE-------SRSCVGGFIETIYQ 57
           GSCW F+  GA+EG   + T NLV +S QQLVDCD++ +        R C GG + T ++
Sbjct: 157 GSCWSFSTTGALEGAHFLATGNLVSLSEQQLVDCDHECDPEEYGACDRGCNGGLMNTAFE 216

Query: 58  YVIQNRGINTERDYPNVGVMDNCK 81
           Y+++  G+    DYP  G   +CK
Sbjct: 217 YILKAGGVVRGEDYPYTGTDGHCK 240


>gi|111073715|dbj|BAF02546.1| triticain alpha [Triticum aestivum]
 gi|388890585|gb|AFK80346.1| cysteine endopeptidase EP alpha [Secale cereale x Triticum durum]
          Length = 461

 Score = 74.3 bits (181), Expect = 8e-12,   Method: Composition-based stats.
 Identities = 30/68 (44%), Positives = 49/68 (72%), Gaps = 1/68 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ + A+EGI++IVT +++ +S Q+LVDCD    +  C GG ++  ++++I N G
Sbjct: 152 GSCWAFSAIAAVEGINQIVTGDMIPLSEQELVDCDTS-YNEGCNGGLMDYAFEFIINNGG 210

Query: 65  INTERDYP 72
           I++E DYP
Sbjct: 211 IDSEEDYP 218


>gi|6435586|pdb|7PCK|A Chain A, Crystal Structure Of Wild Type Human Procathepsin K
 gi|6435587|pdb|7PCK|B Chain B, Crystal Structure Of Wild Type Human Procathepsin K
 gi|6435588|pdb|7PCK|C Chain C, Crystal Structure Of Wild Type Human Procathepsin K
 gi|6435589|pdb|7PCK|D Chain D, Crystal Structure Of Wild Type Human Procathepsin K
 gi|6435592|pdb|1BY8|A Chain A, The Crystal Structure Of Human Procathepsin K
          Length = 314

 Score = 74.3 bits (181), Expect = 8e-12,   Method: Composition-based stats.
 Identities = 36/76 (47%), Positives = 50/76 (65%), Gaps = 2/76 (2%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ VGA+EG  K  T  L+++S Q LVDC    E+  C GG++   +QYV +NRG
Sbjct: 122 GSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDC--VSENDGCGGGYMTNAFQYVQKNRG 179

Query: 65  INTERDYPNVGVMDNC 80
           I++E  YP VG  ++C
Sbjct: 180 IDSEDAYPYVGQEESC 195


>gi|214015390|gb|ACJ62311.1| cysteine protease [Zea mays subsp. parviglumis]
          Length = 247

 Score = 74.3 bits (181), Expect = 9e-12,   Method: Compositional matrix adjust.
 Identities = 36/82 (43%), Positives = 50/82 (60%), Gaps = 2/82 (2%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G  W F+ V AIEG++ I T NLV +S Q+++DCD Q     C GG +E  +++VI N G
Sbjct: 124 GRYWAFSAVAAIEGVNAIATGNLVSLSEQEIIDCDAQ--DSGCDGGQMEDAFRFVIGNGG 181

Query: 65  INTERDYPNVGVMDNCKVFQFN 86
           I+TE DYP +G    C   + N
Sbjct: 182 IDTEADYPFIGTDGTCDANKEN 203


>gi|297745594|emb|CBI40759.3| unnamed protein product [Vitis vinifera]
          Length = 300

 Score = 74.3 bits (181), Expect = 9e-12,   Method: Composition-based stats.
 Identities = 33/76 (43%), Positives = 48/76 (63%), Gaps = 1/76 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V A+EGI++IVT NL  +S Q+L+DCD +  +  C GG ++  + ++I N G
Sbjct: 104 GSCWAFSTVAAVEGINQIVTGNLTALSEQELIDCD-KPFNNGCNGGLMDYAFAFIISNGG 162

Query: 65  INTERDYPNVGVMDNC 80
           +  E DYP V     C
Sbjct: 163 LRKEEDYPYVMEEGTC 178


>gi|312381833|gb|EFR27483.1| hypothetical protein AND_05794 [Anopheles darlingi]
          Length = 344

 Score = 74.3 bits (181), Expect = 9e-12,   Method: Composition-based stats.
 Identities = 33/76 (43%), Positives = 45/76 (59%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+  GA+EG     T  LV +S Q LVDC  +  +  C GG ++  +QYV  N+G
Sbjct: 149 GSCWSFSATGALEGQHFRKTGKLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFQYVKDNKG 208

Query: 65  INTERDYPNVGVMDNC 80
           I+TE+ YP   + D C
Sbjct: 209 IDTEKAYPYEAIDDEC 224


>gi|255568297|ref|XP_002525123.1| cysteine protease, putative [Ricinus communis]
 gi|223535582|gb|EEF37250.1| cysteine protease, putative [Ricinus communis]
          Length = 349

 Score = 74.3 bits (181), Expect = 9e-12,   Method: Composition-based stats.
 Identities = 31/76 (40%), Positives = 48/76 (63%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V A EGI K+ T  LV +S Q+LVDCD +G+ + C GG +   ++++ ++ G
Sbjct: 146 GSCWAFSAVAATEGIHKLRTGKLVSLSEQELVDCDVKGQDKGCQGGLMVDAFKFIKRHGG 205

Query: 65  INTERDYPNVGVMDNC 80
           + +E +YP  G    C
Sbjct: 206 MTSEANYPYQGRDGKC 221


>gi|148362116|gb|ABQ59635.1| ervatamin-A [Tabernaemontana divaricata]
          Length = 184

 Score = 74.3 bits (181), Expect = 9e-12,   Method: Compositional matrix adjust.
 Identities = 34/80 (42%), Positives = 49/80 (61%), Gaps = 2/80 (2%)

Query: 5  GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
          GSCW F+ V  +E I++I T NL+ +S QQLVDC    ++  C GG+ +  YQY+I N G
Sbjct: 12 GSCWAFSTVTTVESINQIRTGNLISLSEQQLVDCSK--KNHGCKGGYFDRAYQYIIANGG 69

Query: 65 INTERDYPNVGVMDNCKVFQ 84
          I+TE +YP       C+  +
Sbjct: 70 IDTEANYPYKAFQGPCRAAK 89


>gi|26452046|dbj|BAC43113.1| putative cysteine proteinase RD21A precursor [Arabidopsis thaliana]
          Length = 362

 Score = 74.3 bits (181), Expect = 9e-12,   Method: Compositional matrix adjust.
 Identities = 31/68 (45%), Positives = 48/68 (70%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ VGA+EGI++I T  L+ +S Q+LVDCD    +  C GG +   ++++++N G
Sbjct: 152 GSCWAFSAVGAVEGINQITTGELISLSEQELVDCDRGFVNAGCDGGIMNYAFEFIMKNGG 211

Query: 65  INTERDYP 72
           I T++DYP
Sbjct: 212 IETDQDYP 219


>gi|395856029|ref|XP_003800445.1| PREDICTED: cathepsin S [Otolemur garnettii]
          Length = 331

 Score = 74.3 bits (181), Expect = 9e-12,   Method: Composition-based stats.
 Identities = 33/78 (42%), Positives = 47/78 (60%), Gaps = 1/78 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQG-ESRSCVGGFIETIYQYVIQNR 63
           GSCW F+ VGA+E   K+ T  LV +S Q LVDC  +   +  C GGF+   +QY+I N 
Sbjct: 137 GSCWAFSAVGALEAQLKLTTGKLVSLSAQNLVDCSTEKYRNEGCHGGFMTEAFQYIIDNN 196

Query: 64  GINTERDYPNVGVMDNCK 81
           GI++E  YP   + + C+
Sbjct: 197 GIDSEASYPYKAMDEKCQ 214


>gi|66354492|gb|AAY44882.1| papain family cysteine protease [Vigna unguiculata]
          Length = 178

 Score = 74.3 bits (181), Expect = 9e-12,   Method: Compositional matrix adjust.
 Identities = 34/78 (43%), Positives = 50/78 (64%), Gaps = 1/78 (1%)

Query: 5  GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
          GSCW F+ V  IEG+  I    LV +S Q+LVDC  +G+S  C GG++E  ++++ +  G
Sbjct: 8  GSCWAFSAVATIEGLHHIKKGELVSLSEQELVDC-VRGDSEGCNGGYVEDAFEFLAKKGG 66

Query: 65 INTERDYPNVGVMDNCKV 82
          I +E +YP  GV  +CKV
Sbjct: 67 IASETNYPYKGVNKSCKV 84


>gi|30685308|ref|NP_566634.2| putative cysteine proteinase [Arabidopsis thaliana]
 gi|30315949|sp|Q9LT77.1|CPR1_ARATH RecName: Full=Probable cysteine proteinase At3g19400; Flags:
           Precursor
 gi|11994462|dbj|BAB02464.1| cysteine proteinase [Arabidopsis thaliana]
 gi|332642715|gb|AEE76236.1| putative cysteine proteinase [Arabidopsis thaliana]
          Length = 362

 Score = 74.3 bits (181), Expect = 9e-12,   Method: Compositional matrix adjust.
 Identities = 31/68 (45%), Positives = 48/68 (70%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ VGA+EGI++I T  L+ +S Q+LVDCD    +  C GG +   ++++++N G
Sbjct: 152 GSCWAFSAVGAVEGINQITTGELISLSEQELVDCDRGFVNAGCDGGIMNYAFEFIMKNGG 211

Query: 65  INTERDYP 72
           I T++DYP
Sbjct: 212 IETDQDYP 219


>gi|334332716|ref|XP_001367365.2| PREDICTED: cathepsin L1-like [Monodelphis domestica]
          Length = 335

 Score = 74.3 bits (181), Expect = 9e-12,   Method: Compositional matrix adjust.
 Identities = 38/76 (50%), Positives = 46/76 (60%)

Query: 6   SCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRGI 65
           SCW F+ VGAIEG     T  LV +S Q LVDC       SC GGF++  +QYV  N GI
Sbjct: 140 SCWAFSAVGAIEGQWFRKTGELVSLSIQNLVDCTTSDSISSCHGGFMDRAFQYVQDNGGI 199

Query: 66  NTERDYPNVGVMDNCK 81
           +TE  YP VG ++ CK
Sbjct: 200 DTEECYPYVGEVNECK 215


>gi|77404197|ref|NP_001029168.1| cathepsin K precursor [Canis lupus familiaris]
 gi|122056102|sp|Q3ZKN1.1|CATK_CANFA RecName: Full=Cathepsin K; Flags: Precursor
 gi|58047562|gb|AAW65150.1| cathepsin K [Canis lupus familiaris]
          Length = 330

 Score = 74.3 bits (181), Expect = 9e-12,   Method: Composition-based stats.
 Identities = 36/76 (47%), Positives = 50/76 (65%), Gaps = 2/76 (2%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ VGA+EG  K  T  L+++S Q LVDC    E+  C GG++   +QYV +NRG
Sbjct: 138 GSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDC--VSENDGCGGGYMTNAFQYVQKNRG 195

Query: 65  INTERDYPNVGVMDNC 80
           I++E  YP VG  ++C
Sbjct: 196 IDSEDAYPYVGQDESC 211


>gi|357129125|ref|XP_003566217.1| PREDICTED: thiol protease SEN102-like [Brachypodium distachyon]
          Length = 380

 Score = 74.3 bits (181), Expect = 9e-12,   Method: Composition-based stats.
 Identities = 30/68 (44%), Positives = 45/68 (66%), Gaps = 1/68 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ + A+EGI+ I TNNL  +S QQLVDCD +  +  C GG ++  + Y+ ++ G
Sbjct: 166 GSCWAFSTIAAVEGINAIRTNNLTSLSEQQLVDCDTKTNA-GCDGGLMDDAFSYIAKHGG 224

Query: 65  INTERDYP 72
           +  E+ YP
Sbjct: 225 VAAEKSYP 232


>gi|297826061|ref|XP_002880913.1| hypothetical protein ARALYDRAFT_481640 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297326752|gb|EFH57172.1| hypothetical protein ARALYDRAFT_481640 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 347

 Score = 74.3 bits (181), Expect = 9e-12,   Method: Compositional matrix adjust.
 Identities = 35/71 (49%), Positives = 47/71 (66%), Gaps = 7/71 (9%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCD---NQGESRSCVGGFIETIYQYVIQ 61
           G CW F+ V A+EGI+KI    LV +S QQL+DCD   NQG    C GG +   ++Y+I+
Sbjct: 149 GGCWAFSAVAAVEGITKITKGELVSLSEQQLLDCDTDYNQG----CHGGIMSKAFEYIIK 204

Query: 62  NRGINTERDYP 72
           N+GI TE +YP
Sbjct: 205 NQGITTEDNYP 215


>gi|50355617|dbj|BAD29957.1| cysteine protease [Daucus carota]
          Length = 437

 Score = 74.3 bits (181), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 32/76 (42%), Positives = 51/76 (67%), Gaps = 1/76 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ +GA+EGI++I T  L+ +S Q+LVDCD +  +  C GG ++  + ++I+N G
Sbjct: 160 GSCWAFSAIGAVEGINQITTGELITLSEQELVDCD-RSYNEGCEGGLMDYAFNFIIKNGG 218

Query: 65  INTERDYPNVGVMDNC 80
           I+++ DYP  G    C
Sbjct: 219 IDSDLDYPYTGRDGTC 234


>gi|359359120|gb|AEV41026.1| putative cysteine protease [Oryza minuta]
          Length = 464

 Score = 74.3 bits (181), Expect = 1e-11,   Method: Composition-based stats.
 Identities = 32/78 (41%), Positives = 49/78 (62%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V A+EGI+KIVT  LV +S Q+LV+C     +  C GG ++  + ++ +N G
Sbjct: 180 GSCWAFSAVAAVEGINKIVTGELVSLSEQELVECARNRGNSGCNGGIMDDAFAFITRNGG 239

Query: 65  INTERDYPNVGVMDNCKV 82
           ++TE DYP   +   C +
Sbjct: 240 LDTEEDYPYTAMDGKCDL 257


>gi|281204231|gb|EFA78427.1| cysteine proteinase 3 [Polysphondylium pallidum PN500]
          Length = 329

 Score = 74.3 bits (181), Expect = 1e-11,   Method: Composition-based stats.
 Identities = 30/68 (44%), Positives = 42/68 (61%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+  GA+EG  +I T N V +S QQL+DC     +  C GG +++   Y+++  G
Sbjct: 134 GSCWAFSTTGAVEGAHQIATGNFVSLSEQQLMDCSRSYGNHGCQGGLMDSAMSYIVKQGG 193

Query: 65  INTERDYP 72
           INTE  YP
Sbjct: 194 INTEESYP 201


>gi|356545112|ref|XP_003540989.1| PREDICTED: LOW QUALITY PROTEIN: KDEL-tailed cysteine endopeptidase
           CEP1-like [Glycine max]
          Length = 400

 Score = 74.3 bits (181), Expect = 1e-11,   Method: Composition-based stats.
 Identities = 35/79 (44%), Positives = 54/79 (68%), Gaps = 2/79 (2%)

Query: 4   LGSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNR 63
           +GSCW  + V AIEGI +I T+ L+ +S Q+LVD   +GES  C+GG++E  ++++++  
Sbjct: 225 IGSCWALSAVAAIEGIHQITTSKLMFLSKQKLVD-SVKGESEGCIGGYVEDAFEFIVKKG 283

Query: 64  GINTERDYPNVGVMDNCKV 82
           GI +E  YP  GV + CKV
Sbjct: 284 GILSETHYPYKGV-NXCKV 301


>gi|77554625|gb|ABA97421.1| Vignain precursor, putative [Oryza sativa Japonica Group]
 gi|222630746|gb|EEE62878.1| hypothetical protein OsJ_17681 [Oryza sativa Japonica Group]
          Length = 350

 Score = 74.3 bits (181), Expect = 1e-11,   Method: Composition-based stats.
 Identities = 31/77 (40%), Positives = 45/77 (58%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G CW F+ V A+EG  K+ T  L+ +S Q+LVDCD  G  + C GG I+  +Q+++ N G
Sbjct: 156 GCCWAFSAVAAMEGFVKLSTGKLISLSEQELVDCDVDGNDQGCEGGEIDGAFQFILSNGG 215

Query: 65  INTERDYPNVGVMDNCK 81
           +  E +YP       CK
Sbjct: 216 LTAEANYPYTAEDGRCK 232


>gi|12621903|gb|AAB60643.2|AAB60643 cathepsin S [Homo sapiens]
          Length = 267

 Score = 73.9 bits (180), Expect = 1e-11,   Method: Composition-based stats.
 Identities = 32/78 (41%), Positives = 49/78 (62%), Gaps = 1/78 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGE-SRSCVGGFIETIYQYVIQNR 63
           G+CW F+ VGA+E   K+ T  LV +S Q LVDC  +   ++ C GGF+ T +QY+I N+
Sbjct: 137 GACWAFSAVGALEAQLKLKTGKLVTLSAQNLVDCSTEKYGNKGCNGGFMTTAFQYIIDNK 196

Query: 64  GINTERDYPNVGVMDNCK 81
           GI+++  YP   +   C+
Sbjct: 197 GIDSDASYPYKAMDQKCQ 214


>gi|281207557|gb|EFA81740.1| hypothetical protein PPL_05734 [Polysphondylium pallidum PN500]
          Length = 387

 Score = 73.9 bits (180), Expect = 1e-11,   Method: Composition-based stats.
 Identities = 32/68 (47%), Positives = 44/68 (64%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+  G+IEG  +I T NLV +S Q L+DC     ++ C GG +   ++YVI+N G
Sbjct: 143 GSCWSFSTTGSIEGAHEIATGNLVSLSEQNLIDCSTAEGNQGCNGGLMTNAFEYVIKNGG 202

Query: 65  INTERDYP 72
           I+TE  YP
Sbjct: 203 IDTEASYP 210


>gi|320169652|gb|EFW46551.1| cathepsin L2 [Capsaspora owczarzaki ATCC 30864]
          Length = 325

 Score = 73.9 bits (180), Expect = 1e-11,   Method: Composition-based stats.
 Identities = 32/77 (41%), Positives = 45/77 (58%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+  G++EG     T  LV +S Q LVDC +Q  +  C GG ++  ++Y+I+N G
Sbjct: 130 GSCWSFSTTGSVEGQHARKTGTLVSLSEQNLVDCSSQEGNEGCNGGLMDDAFEYIIKNGG 189

Query: 65  INTERDYPNVGVMDNCK 81
           I+TE  YP       CK
Sbjct: 190 IDTEASYPYTATTGTCK 206


>gi|356515046|ref|XP_003526212.1| PREDICTED: thiol protease SEN102-like [Glycine max]
          Length = 342

 Score = 73.9 bits (180), Expect = 1e-11,   Method: Composition-based stats.
 Identities = 34/76 (44%), Positives = 49/76 (64%), Gaps = 2/76 (2%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G  W F+ + A EGI +I T NLV +S Q+LVDCD+  +   C GGF+E  ++++I+N G
Sbjct: 149 GRFWAFSTIAATEGIHQISTGNLVSLSEQELVDCDSVDD--GCEGGFMEDGFEFIIKNGG 206

Query: 65  INTERDYPNVGVMDNC 80
           I +E +YP  GV   C
Sbjct: 207 ITSETNYPYKGVDGTC 222


>gi|119573900|gb|EAW53515.1| cathepsin K (pycnodysostosis), isoform CRA_a [Homo sapiens]
          Length = 288

 Score = 73.9 bits (180), Expect = 1e-11,   Method: Composition-based stats.
 Identities = 36/76 (47%), Positives = 50/76 (65%), Gaps = 2/76 (2%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ VGA+EG  K  T  L+++S Q LVDC    E+  C GG++   +QYV +NRG
Sbjct: 96  GSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDC--VSENDGCGGGYMTNAFQYVQKNRG 153

Query: 65  INTERDYPNVGVMDNC 80
           I++E  YP VG  ++C
Sbjct: 154 IDSEDAYPYVGQEESC 169


>gi|440893559|gb|ELR46281.1| Cathepsin L1 [Bos grunniens mutus]
          Length = 330

 Score = 73.9 bits (180), Expect = 1e-11,   Method: Composition-based stats.
 Identities = 33/76 (43%), Positives = 45/76 (59%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+  GA+EG     T  LV +S Q LVDC     +R C GGFI+  +QYV+   G
Sbjct: 133 GSCWAFSATGALEGQMFQKTGKLVSLSEQNLVDCSQPEGNRGCHGGFIDNAFQYVLDVGG 192

Query: 65  INTERDYPNVGVMDNC 80
           +++E  YP  G++  C
Sbjct: 193 LDSEESYPYTGLVGTC 208


>gi|159485468|ref|XP_001700766.1| cysteine endopeptidase [Chlamydomonas reinhardtii]
 gi|158281265|gb|EDP07020.1| cysteine endopeptidase [Chlamydomonas reinhardtii]
          Length = 498

 Score = 73.9 bits (180), Expect = 1e-11,   Method: Composition-based stats.
 Identities = 33/67 (49%), Positives = 45/67 (67%), Gaps = 1/67 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ VG+IEG + + T  LV +S QQLVDCD    +  C GG ++  ++YV+ N G
Sbjct: 155 GSCWAFSAVGSIEGANALATGQLVALSEQQLVDCDT-ASNMGCSGGLMDDAFKYVLDNGG 213

Query: 65  INTERDY 71
           I+TE DY
Sbjct: 214 IDTEEDY 220


>gi|255540425|ref|XP_002511277.1| cysteine protease, putative [Ricinus communis]
 gi|46395620|sp|O65039.1|CYSEP_RICCO RecName: Full=Vignain; AltName: Full=Cysteine endopeptidase; Flags:
           Precursor
 gi|2944446|gb|AAC62396.1| cysteine endopeptidase precursor [Ricinus communis]
 gi|223550392|gb|EEF51879.1| cysteine protease, putative [Ricinus communis]
          Length = 360

 Score = 73.9 bits (180), Expect = 1e-11,   Method: Composition-based stats.
 Identities = 34/82 (41%), Positives = 52/82 (63%), Gaps = 1/82 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ + A+EGI++I TN LV +S Q+LVDCD   +++ C GG ++  ++++ Q  G
Sbjct: 148 GSCWAFSTIVAVEGINQIKTNKLVSLSEQELVDCDTD-QNQGCNGGLMDYAFEFIKQRGG 206

Query: 65  INTERDYPNVGVMDNCKVFQFN 86
           I TE +YP       C V + N
Sbjct: 207 ITTEANYPYEAYDGTCDVSKEN 228


>gi|358248896|ref|NP_001239703.1| uncharacterized protein LOC100799247 precursor [Glycine max]
 gi|255636729|gb|ACU18700.1| unknown [Glycine max]
          Length = 341

 Score = 73.9 bits (180), Expect = 1e-11,   Method: Composition-based stats.
 Identities = 33/78 (42%), Positives = 48/78 (61%), Gaps = 1/78 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW FA V  +E + +I T  LV +S Q+LVDC  +G+S  C GG++E  ++++    G
Sbjct: 145 GSCWAFATVATVESLHQITTGELVSLSEQELVDC-VRGDSEGCRGGYVENAFEFIANKGG 203

Query: 65  INTERDYPNVGVMDNCKV 82
           I +E  YP  G   +CKV
Sbjct: 204 ITSEAYYPYKGKDRSCKV 221


>gi|351629615|gb|AEQ54771.1| KDDL-tailed cysteine proteinase CP4 [Coffea canephora]
          Length = 359

 Score = 73.9 bits (180), Expect = 1e-11,   Method: Composition-based stats.
 Identities = 35/82 (42%), Positives = 49/82 (59%), Gaps = 2/82 (2%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V  +EGI+KI T  LV +S Q+LVDC+   E   C GG +E  Y+++ ++ G
Sbjct: 148 GSCWAFSTVVGVEGINKIKTGQLVSLSEQELVDCETDNE--GCNGGLMENAYEFIKKSGG 205

Query: 65  INTERDYPNVGVMDNCKVFQFN 86
           I TER YP      +C   + N
Sbjct: 206 ITTERLYPYKARDGSCDSSKMN 227


>gi|2224808|emb|CAB09697.1| cysteine endopeptidase EP-A [Hordeum vulgare subsp. vulgare]
 gi|326502180|dbj|BAK06781.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 365

 Score = 73.9 bits (180), Expect = 1e-11,   Method: Composition-based stats.
 Identities = 35/76 (46%), Positives = 50/76 (65%), Gaps = 2/76 (2%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ + A+EGI+KI T  LV +S Q+L+DCDN   ++ C GG ++  +Q+ IQ  G
Sbjct: 154 GSCWAFSTIVAVEGINKIRTGKLVSLSEQELMDCDNV-NNQGCEGGLMDYAFQF-IQKNG 211

Query: 65  INTERDYPNVGVMDNC 80
           I TE +YP  G   +C
Sbjct: 212 ITTESNYPYQGEQGSC 227


>gi|2414683|emb|CAB16316.1| cysteine proteinase precursor [Vicia sativa]
          Length = 379

 Score = 73.9 bits (180), Expect = 1e-11,   Method: Composition-based stats.
 Identities = 34/83 (40%), Positives = 45/83 (54%), Gaps = 6/83 (7%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGE------SRSCVGGFIETIYQY 58
           GSCW F   G+IEG + + T  LV +S QQLVDCDN+ +         C GG + T Y Y
Sbjct: 162 GSCWAFTTTGSIEGANFLATGKLVSLSEQQLVDCDNKCDITKTSCDNGCNGGLMTTAYDY 221

Query: 59  VIQNRGINTERDYPNVGVMDNCK 81
           +++  G+  E  YP  G    CK
Sbjct: 222 LMEAGGLEEETSYPYTGAQGECK 244


>gi|410968296|ref|XP_003990643.1| PREDICTED: cathepsin K [Felis catus]
          Length = 330

 Score = 73.9 bits (180), Expect = 1e-11,   Method: Composition-based stats.
 Identities = 36/76 (47%), Positives = 50/76 (65%), Gaps = 2/76 (2%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ VGA+EG  K  T  L+++S Q LVDC    E+  C GG++   +QYV +NRG
Sbjct: 138 GSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDC--VSENDGCGGGYMTNAFQYVQKNRG 195

Query: 65  INTERDYPNVGVMDNC 80
           I++E  YP VG  ++C
Sbjct: 196 IDSEDAYPYVGQDESC 211


>gi|149751227|ref|XP_001490649.1| PREDICTED: cathepsin K-like [Equus caballus]
          Length = 329

 Score = 73.9 bits (180), Expect = 1e-11,   Method: Composition-based stats.
 Identities = 36/76 (47%), Positives = 50/76 (65%), Gaps = 2/76 (2%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ VGA+EG  K  T  L+++S Q LVDC    E+  C GG++   +QYV +NRG
Sbjct: 137 GSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDC--VSENDGCGGGYMTNAFQYVQKNRG 194

Query: 65  INTERDYPNVGVMDNC 80
           I++E  YP VG  ++C
Sbjct: 195 IDSEDAYPYVGQDESC 210


>gi|355681653|gb|AER96814.1| cathepsin K [Mustela putorius furo]
          Length = 329

 Score = 73.9 bits (180), Expect = 1e-11,   Method: Composition-based stats.
 Identities = 36/76 (47%), Positives = 50/76 (65%), Gaps = 2/76 (2%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ VGA+EG  K  T  L+++S Q LVDC    E+  C GG++   +QYV +NRG
Sbjct: 138 GSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDC--VSENDGCGGGYMTNAFQYVQKNRG 195

Query: 65  INTERDYPNVGVMDNC 80
           I++E  YP VG  ++C
Sbjct: 196 IDSEDAYPYVGQDESC 211


>gi|301767944|ref|XP_002919404.1| PREDICTED: cathepsin K-like [Ailuropoda melanoleuca]
 gi|281352889|gb|EFB28473.1| hypothetical protein PANDA_008011 [Ailuropoda melanoleuca]
          Length = 330

 Score = 73.9 bits (180), Expect = 1e-11,   Method: Composition-based stats.
 Identities = 36/76 (47%), Positives = 50/76 (65%), Gaps = 2/76 (2%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ VGA+EG  K  T  L+++S Q LVDC    E+  C GG++   +QYV +NRG
Sbjct: 138 GSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDC--VSENDGCGGGYMTNAFQYVQKNRG 195

Query: 65  INTERDYPNVGVMDNC 80
           I++E  YP VG  ++C
Sbjct: 196 IDSEDAYPYVGQDESC 211


>gi|295321664|pdb|3H7D|A Chain A, The Crystal Structure Of The Cathepsin K Variant M5 In
          Compl Chondroitin-4-Sulfate
 gi|295321665|pdb|3H7D|E Chain E, The Crystal Structure Of The Cathepsin K Variant M5 In
          Compl Chondroitin-4-Sulfate
          Length = 215

 Score = 73.9 bits (180), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 36/76 (47%), Positives = 51/76 (67%), Gaps = 2/76 (2%)

Query: 5  GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
          GSCW F+ VGA+EG  K  T  L+++S Q LVDC +  E+  C GG++   +QYV +NRG
Sbjct: 23 GSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCVS--ENDGCGGGYMTNAFQYVQKNRG 80

Query: 65 INTERDYPNVGVMDNC 80
          I++E  YP VG  ++C
Sbjct: 81 IDSEDAYPYVGQEESC 96


>gi|139947602|ref|NP_001077155.1| cathepsin L1 precursor [Bos taurus]
 gi|134025180|gb|AAI34742.1| CTSL1 protein [Bos taurus]
 gi|296484500|tpg|DAA26615.1| TPA: cathepsin L1 [Bos taurus]
          Length = 333

 Score = 73.9 bits (180), Expect = 1e-11,   Method: Composition-based stats.
 Identities = 33/76 (43%), Positives = 45/76 (59%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+  GA+EG     T  LV +S Q LVDC     +R C GGFI+  +QYV+   G
Sbjct: 136 GSCWAFSATGALEGQMFQKTGKLVSLSEQNLVDCSQPEGNRGCHGGFIDNAFQYVLDVGG 195

Query: 65  INTERDYPNVGVMDNC 80
           +++E  YP  G++  C
Sbjct: 196 LDSEESYPYTGLVGTC 211


>gi|4731374|gb|AAD28477.1|AF133839_1 papain-like cysteine protease [Sandersonia aurantiaca]
          Length = 357

 Score = 73.9 bits (180), Expect = 1e-11,   Method: Composition-based stats.
 Identities = 32/77 (41%), Positives = 49/77 (63%), Gaps = 1/77 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ + A+EGI++IVT  LV +S Q+L+DCD   +++ C GG ++  ++++  N G
Sbjct: 151 GSCWAFSAIAAVEGINQIVTKELVPLSEQELIDCDTD-QNQGCSGGLMDYAFEFIKNNGG 209

Query: 65  INTERDYPNVGVMDNCK 81
           I TE  YP       CK
Sbjct: 210 ITTEDVYPYQAEDATCK 226


>gi|130502110|ref|NP_001076110.1| cathepsin K precursor [Oryctolagus cuniculus]
 gi|1168794|sp|P43236.1|CATK_RABIT RecName: Full=Cathepsin K; AltName: Full=Protein OC-2; Flags:
           Precursor
 gi|454187|dbj|BAA03125.1| OC-2 protein [Oryctolagus cuniculus]
          Length = 329

 Score = 73.9 bits (180), Expect = 1e-11,   Method: Composition-based stats.
 Identities = 36/76 (47%), Positives = 50/76 (65%), Gaps = 2/76 (2%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ VGA+EG  K  T  L+++S Q LVDC    E+  C GG++   +QYV +NRG
Sbjct: 137 GSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDC--VSENYGCGGGYMTNAFQYVQRNRG 194

Query: 65  INTERDYPNVGVMDNC 80
           I++E  YP VG  ++C
Sbjct: 195 IDSEDAYPYVGQDESC 210


>gi|225428328|ref|XP_002279940.1| PREDICTED: cysteine proteinase-like [Vitis vinifera]
          Length = 707

 Score = 73.9 bits (180), Expect = 1e-11,   Method: Composition-based stats.
 Identities = 31/68 (45%), Positives = 45/68 (66%), Gaps = 1/68 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V A+EGI++IVT NL  +S Q+L+DCD    S  C GG ++  + ++  N G
Sbjct: 511 GSCWAFSTVAAVEGINQIVTGNLTTLSEQELIDCDTTFNS-GCNGGLMDYAFAFIASNGG 569

Query: 65  INTERDYP 72
           ++ E DYP
Sbjct: 570 LHKEDDYP 577


>gi|157093728|gb|ABV22590.1| KDEL-tailed cysteine endopeptidase [Solanum lycopersicum]
          Length = 360

 Score = 73.9 bits (180), Expect = 1e-11,   Method: Composition-based stats.
 Identities = 34/82 (41%), Positives = 50/82 (60%), Gaps = 1/82 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V A+EGI++I T  LV +S Q+LVDCD   E++ C GG ++  + ++ +  G
Sbjct: 148 GSCWAFSTVVAVEGINQIKTKKLVSLSEQELVDCDTT-ENQGCNGGLMDPAFDFIKKRGG 206

Query: 65  INTERDYPNVGVMDNCKVFQFN 86
           I TE  YP     D C + + N
Sbjct: 207 ITTEERYPYKAEDDKCDIQKRN 228


>gi|28192373|gb|AAK07730.1| CPR1-like cysteine proteinase [Nicotiana tabacum]
          Length = 374

 Score = 73.9 bits (180), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 33/83 (39%), Positives = 55/83 (66%), Gaps = 1/83 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V A+ GI++IVT  ++ +S Q+LVDCD + ++  C GG ++  ++++I N G
Sbjct: 162 GSCWAFSTVAAVGGINQIVTGEMITLSEQELVDCD-RVQNSGCNGGLMDYAFEFIISNGG 220

Query: 65  INTERDYPNVGVMDNCKVFQFNW 87
           ++TE+ YP  GV   C   + N+
Sbjct: 221 MDTEKHYPYRGVEGRCDPVRKNY 243


>gi|324983200|gb|ADY68475.1| stem bromelain [Ananas comosus]
          Length = 291

 Score = 73.9 bits (180), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 33/81 (40%), Positives = 48/81 (59%), Gaps = 3/81 (3%)

Query: 2   HPLGSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQ 61
           +P GSCW F+ +  +EGI KIVT  LV +S Q+++DC     S  C GGF++  Y ++I 
Sbjct: 143 NPCGSCWAFSAIATVEGIYKIVTGYLVSLSEQEVLDC---AVSNGCDGGFVDNAYDFIIS 199

Query: 62  NRGINTERDYPNVGVMDNCKV 82
           N G+ +E DYP      +C  
Sbjct: 200 NNGVASEADYPYQAYQGDCAA 220


>gi|168006315|ref|XP_001755855.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162693174|gb|EDQ79528.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 454

 Score = 73.9 bits (180), Expect = 1e-11,   Method: Composition-based stats.
 Identities = 35/79 (44%), Positives = 51/79 (64%), Gaps = 7/79 (8%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCD---NQGESRSCVGGFIETIYQYVIQ 61
           GSCW F+ +G++EGI+ I T   V +S Q+LVDCD   NQG    C GG ++  + ++++
Sbjct: 152 GSCWAFSAIGSVEGINAIRTGEAVSLSEQELVDCDLEYNQG----CNGGLMDYAFDFILE 207

Query: 62  NRGINTERDYPNVGVMDNC 80
           N GI+TE DYP  G+   C
Sbjct: 208 NGGIDTENDYPYKGLDGRC 226


>gi|116794072|gb|ABK26996.1| unknown [Picea sitchensis]
          Length = 367

 Score = 73.9 bits (180), Expect = 1e-11,   Method: Composition-based stats.
 Identities = 36/76 (47%), Positives = 48/76 (63%), Gaps = 2/76 (2%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+  GAIEG++ I T  LV +S Q+LV CD    +  C GG ++  + +VIQN G
Sbjct: 164 GSCWAFSTTGAIEGVNFISTGKLVSLSEQELVACD--ATNYGCEGGDMDYAFTWVIQNGG 221

Query: 65  INTERDYPNVGVMDNC 80
           I+TE+DY   GV   C
Sbjct: 222 IDTEKDYSYTGVDSTC 237


>gi|46401612|dbj|BAD16614.1| cysteine proteinase [Dianthus caryophyllus]
          Length = 459

 Score = 73.9 bits (180), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 31/82 (37%), Positives = 54/82 (65%), Gaps = 1/82 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ + A+EG+++IVT +L+ +S Q+LV+CD    +  C GG ++  ++++I+N G
Sbjct: 154 GSCWAFSAIAAVEGVNQIVTGDLISLSEQELVECDTS-YNDGCDGGLMDYAFEFIIKNEG 212

Query: 65  INTERDYPNVGVMDNCKVFQFN 86
           I+++ DYP  G    C   + N
Sbjct: 213 IDSDEDYPYTGRDGRCDTNRKN 234


>gi|2914594|pdb|1MEM|A Chain A, Crystal Structure Of Cathepsin K Complexed With A Potent
          Vinyl Sulfone Inhibitor
 gi|28374044|pdb|1NL6|A Chain A, Crystal Structure Of The Cysteine Protease Human
          Cathepsin K In Complex With A Covalent Azepanone
          Inhibitor
 gi|28374045|pdb|1NL6|B Chain B, Crystal Structure Of The Cysteine Protease Human
          Cathepsin K In Complex With A Covalent Azepanone
          Inhibitor
 gi|28374047|pdb|1NLJ|A Chain A, Crystal Structure Of The Cysteine Protease Human
          Cathepsin K In Complex With A Covalent Azepanone
          Inhibitor
 gi|28374048|pdb|1NLJ|B Chain B, Crystal Structure Of The Cysteine Protease Human
          Cathepsin K In Complex With A Covalent Azepanone
          Inhibitor
 gi|47168617|pdb|1Q6K|A Chain A, Cathepsin K Complexed With T-butyl(1s)-1-cyclohexyl-2-
          Oxoethylcarbamate
 gi|55670045|pdb|1TU6|A Chain A, Cathepsin K Complexed With A Ketoamide Inhibitor
 gi|55670046|pdb|1TU6|B Chain B, Cathepsin K Complexed With A Ketoamide Inhibitor
 gi|62738654|pdb|1YK7|A Chain A, Cathepsin K Complexed With A Cyanopyrrolidine Inhibitor
 gi|73535690|pdb|1YK8|A Chain A, Cathepsin K Complexed With A Cyanamide-Based Inhibitor
 gi|73535721|pdb|1YT7|A Chain A, Cathepsin K Complexed With A Constrained Ketoamide
          Inhibitor
 gi|93278849|pdb|2BDL|A Chain A, Cathepsin K Complexed With A Pyrrolidine Ketoamide-Based
          Inhibitor
 gi|114793438|pdb|2ATO|A Chain A, Crystal Structure Of Human Cathepsin K In Complex With
          Myocrisin
 gi|114793448|pdb|2AUX|A Chain A, Cathepsin K Complexed With A Semicarbazone Inhibitor
 gi|114793451|pdb|2AUZ|A Chain A, Cathepsin K Complexed With A Semicarbazone Inhibitor
 gi|126030469|pdb|2FTD|A Chain A, Crystal Structure Of Cathepsin K Complexed With
          7-Methyl- Substituted Azepan-3-One Compound
 gi|126030470|pdb|2FTD|B Chain B, Crystal Structure Of Cathepsin K Complexed With
          7-Methyl- Substituted Azepan-3-One Compound
 gi|157830076|pdb|1ATK|A Chain A, Crystal Structure Of The Cysteine Protease Human
          Cathepsin K In Complex With The Covalent Inhibitor E-64
 gi|157830085|pdb|1AU0|A Chain A, Crystal Structure Of The Cysteine Protease Human
          Cathepsin K In Complex With A Covalent Symmetric
          Diacylaminomethyl Ketone Inhibitor
 gi|157830086|pdb|1AU2|A Chain A, Crystal Structure Of The Cysteine Protease Human
          Cathepsin K In Complex With A Covalent Propanone
          Inhibitor
 gi|157830087|pdb|1AU3|A Chain A, Crystal Structure Of The Cysteine Protease Human
          Cathepsin K In Complex With A Covalent Pyrrolidinone
          Inhibitor
 gi|157830088|pdb|1AU4|A Chain A, Crystal Structure Of The Cysteine Protease Human
          Cathepsin K In Complex With A Covalent Pyrrolidinone
          Inhibitor
 gi|157830146|pdb|1AYU|A Chain A, Crystal Structure Of Cysteine Protease Human Cathepsin K
          In Complex With A Covalent Symmetric Biscarbohydrazide
          Inhibitor
 gi|157830147|pdb|1AYV|A Chain A, Crystal Structure Of Cysteine Protease Human Cathepsin K
          In Complex With A Covalent Thiazolhydrazide Inhibitor
 gi|157830148|pdb|1AYW|A Chain A, Crystal Structure Of Cysteine Protease Human Cathepsin K
          In Complex With A Covalent
          Benzyloxybenzoylcarbohydrazide Inhibitor
 gi|157830300|pdb|1BGO|A Chain A, Crystal Structure Of Cysteine Protease Human Cathepsin K
          In Complex With A Covalent Peptidomimetic Inhibitor
 gi|197305045|pdb|3C9E|A Chain A, Crystal Structure Of The Cathepsin K : Chondroitin
          Sulfate Complex.
 gi|290560385|pdb|3KW9|A Chain A, X-Ray Structure Of Cathepsin K Covalently Bound To A
          Triazine Ligand
 gi|290560386|pdb|3KWZ|A Chain A, Cathepsin K In Complex With A Non-Selective 2-Cyano-
          Pyrimidine Inhibitor
 gi|290560387|pdb|3KX1|A Chain A, Cathepsin K In Complex With A Selective
          2-Cyano-Pyrimidine Inhibitor
 gi|293651910|pdb|3KWB|X Chain X, Structure Of Catk Covalently Bound To A Dioxo-Triazine
          Inhibitor
 gi|293651911|pdb|3KWB|Y Chain Y, Structure Of Catk Covalently Bound To A Dioxo-Triazine
          Inhibitor
 gi|308198615|pdb|3O1G|A Chain A, Cathepsin K Covalently Bound To A 2-Cyano Pyrimidine
          Inhibitor With A Benzyl P3 Group.
 gi|327200584|pdb|3O0U|A Chain A, Cathepsin K Covalently Bound To A Cyano-Pyrimidine
          Inhibitor With Improved Selectivity Over Herg
 gi|394986262|pdb|4DMX|A Chain A, Cathepsin K Inhibitor
 gi|394986263|pdb|4DMY|A Chain A, Cathepsin K Inhibitor
 gi|394986264|pdb|4DMY|B Chain B, Cathepsin K Inhibitor
          Length = 215

 Score = 73.9 bits (180), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 36/76 (47%), Positives = 51/76 (67%), Gaps = 2/76 (2%)

Query: 5  GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
          GSCW F+ VGA+EG  K  T  L+++S Q LVDC +  E+  C GG++   +QYV +NRG
Sbjct: 23 GSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCVS--ENDGCGGGYMTNAFQYVQKNRG 80

Query: 65 INTERDYPNVGVMDNC 80
          I++E  YP VG  ++C
Sbjct: 81 IDSEDAYPYVGQEESC 96


>gi|146215980|gb|ABQ10192.1| actinidin Act2a [Actinidia deliciosa]
          Length = 378

 Score = 73.9 bits (180), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 32/67 (47%), Positives = 46/67 (68%)

Query: 6   SCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRGI 65
           SCW F+ V A+EGI+KIVT NL+ +S Q+LVDC     ++ C  G +   ++++I N GI
Sbjct: 149 SCWAFSAVAAVEGINKIVTGNLISLSEQELVDCGRTQITKGCNRGLMTDAFKFIINNGGI 208

Query: 66  NTERDYP 72
           NTE +YP
Sbjct: 209 NTENNYP 215


>gi|281211531|gb|EFA85693.1| cysteine protease [Polysphondylium pallidum PN500]
          Length = 366

 Score = 73.6 bits (179), Expect = 1e-11,   Method: Composition-based stats.
 Identities = 30/68 (44%), Positives = 44/68 (64%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+  G++EG  +I T N+V++S Q LVDC +   +  C GG +   + Y+I N G
Sbjct: 136 GSCWSFSTTGSVEGAHQIKTGNMVELSEQNLVDCSSAEGNMGCNGGLMNNAFDYIISNHG 195

Query: 65  INTERDYP 72
           I+TE+ YP
Sbjct: 196 IDTEQSYP 203


>gi|85000505|ref|XP_954971.1| cysteine proteinase precursor, tacP [Theileria annulata strain
           Ankara]
 gi|65303117|emb|CAI75495.1| cysteine proteinase precursor, tacP, putative [Theileria annulata]
          Length = 447

 Score = 73.6 bits (179), Expect = 1e-11,   Method: Composition-based stats.
 Identities = 34/78 (43%), Positives = 51/78 (65%), Gaps = 3/78 (3%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW FA +G++E + KI  +  +D+S Q+LVDC+ +  S+ C GGF +T  QY IQN+G
Sbjct: 259 GSCWAFATIGSVESLYKIYRDVTLDLSEQELVDCETK--SKGCEGGFGDTALQY-IQNKG 315

Query: 65  INTERDYPNVGVMDNCKV 82
           ++ + D P V   + C V
Sbjct: 316 VSNDNDIPYVAKKNTCVV 333


>gi|357452869|ref|XP_003596711.1| Cysteine proteinase [Medicago truncatula]
 gi|355485759|gb|AES66962.1| Cysteine proteinase [Medicago truncatula]
          Length = 344

 Score = 73.6 bits (179), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 35/81 (43%), Positives = 47/81 (58%)

Query: 2   HPLGSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQ 61
           H  GSCW FA V AIEGI +I T  LV +S Q+LVDC     +  C GG++E    ++++
Sbjct: 145 HLCGSCWAFATVAAIEGIHQITTGRLVSLSEQELVDCVKTNTTDGCNGGYVEDACDFIVK 204

Query: 62  NRGINTERDYPNVGVMDNCKV 82
             GI +E +YP   V   C V
Sbjct: 205 KGGITSETNYPYTRVDGKCNV 225


>gi|2342494|dbj|BAA21848.1| bromelain [Ananas comosus]
 gi|2463582|dbj|BAA22543.1| FB31 precursor [Ananas comosus]
          Length = 352

 Score = 73.6 bits (179), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 33/81 (40%), Positives = 48/81 (59%), Gaps = 3/81 (3%)

Query: 2   HPLGSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQ 61
           +P GSCW F+ +  +EGI KIVT  LV +S Q+++DC     S  C GGF++  Y ++I 
Sbjct: 143 NPCGSCWAFSAIATVEGIYKIVTGYLVSLSEQEVLDC---AVSNGCDGGFVDNAYDFIIS 199

Query: 62  NRGINTERDYPNVGVMDNCKV 82
           N G+ +E DYP      +C  
Sbjct: 200 NNGVASEADYPYQAYQGDCAA 220


>gi|431896621|gb|ELK06033.1| Cathepsin S [Pteropus alecto]
          Length = 331

 Score = 73.6 bits (179), Expect = 1e-11,   Method: Composition-based stats.
 Identities = 33/69 (47%), Positives = 45/69 (65%), Gaps = 1/69 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGES-RSCVGGFIETIYQYVIQNR 63
           GSCW F+ VGA+E   K+ T  LV +S Q LVDC  +  S + C GGF+ + +QY+I N 
Sbjct: 137 GSCWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTEKYSNKGCNGGFMTSAFQYIIDNN 196

Query: 64  GINTERDYP 72
           GI++E  YP
Sbjct: 197 GIDSEASYP 205


>gi|313118764|gb|ADR32294.1| C14 cysteine protease [Solanum stoloniferum]
          Length = 217

 Score = 73.6 bits (179), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 33/68 (48%), Positives = 49/68 (72%), Gaps = 1/68 (1%)

Query: 5  GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
          GSCW F+ V A+E I+ IVT NL+ +S Q+LVDCD +  ++ C GG ++  +++VI N G
Sbjct: 23 GSCWAFSAVAAMESINAIVTGNLISLSEQELVDCD-KSYNQGCDGGLMDYAFEFVINNGG 81

Query: 65 INTERDYP 72
          I++E DYP
Sbjct: 82 IDSEEDYP 89


>gi|75765285|pdb|1U9V|A Chain A, Crystal Structure Of The Cysteine Protease Human
          Cathepsin K In Complex With The Covalent Inhibitor
          Nvp-Abe854
 gi|75765286|pdb|1U9W|A Chain A, Crystal Structure Of The Cysteine Protease Human
          Cathepsin K In Complex With The Covalent Inhibitor
          Nvp-Abi491
 gi|75765287|pdb|1U9X|A Chain A, Crystal Structure Of The Cysteine Protease Human
          Cathepsin K In Complex With The Covalent Inhibitor
          Nvp-Abj688
 gi|160286063|pdb|2R6N|A Chain A, Crystal Structure Of A Pyrrolopyrimidine Inhibitor In
          Complex With Human Cathepsin K
          Length = 217

 Score = 73.6 bits (179), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 36/76 (47%), Positives = 51/76 (67%), Gaps = 2/76 (2%)

Query: 5  GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
          GSCW F+ VGA+EG  K  T  L+++S Q LVDC +  E+  C GG++   +QYV +NRG
Sbjct: 25 GSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCVS--ENDGCGGGYMTNAFQYVQKNRG 82

Query: 65 INTERDYPNVGVMDNC 80
          I++E  YP VG  ++C
Sbjct: 83 IDSEDAYPYVGQEESC 98


>gi|535473|emb|CAA53377.1| cysteine protease [Vicia sativa]
          Length = 368

 Score = 73.6 bits (179), Expect = 1e-11,   Method: Composition-based stats.
 Identities = 32/76 (42%), Positives = 49/76 (64%), Gaps = 1/76 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ +  +E I+KIVT  LV +S Q+LVDCD +  +  C GG ++  ++++I N G
Sbjct: 150 GSCWAFSTIATVEAINKIVTGKLVSLSEQELVDCD-RAFNEGCNGGLMDYAFEFIIGNGG 208

Query: 65  INTERDYPNVGVMDNC 80
           I+T++ YP  G    C
Sbjct: 209 IDTDQHYPYKGFEGRC 224


>gi|315075311|ref|NP_001186668.1| cathepsin S isoform 2 preproprotein [Homo sapiens]
 gi|194376464|dbj|BAG62991.1| unnamed protein product [Homo sapiens]
          Length = 281

 Score = 73.6 bits (179), Expect = 1e-11,   Method: Composition-based stats.
 Identities = 32/78 (41%), Positives = 49/78 (62%), Gaps = 1/78 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGE-SRSCVGGFIETIYQYVIQNR 63
           G+CW F+ VGA+E   K+ T  LV +S Q LVDC  +   ++ C GGF+ T +QY+I N+
Sbjct: 87  GACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTEKYGNKGCNGGFMTTAFQYIIDNK 146

Query: 64  GINTERDYPNVGVMDNCK 81
           GI+++  YP   +   C+
Sbjct: 147 GIDSDASYPYKAMDQKCQ 164


>gi|50513589|pdb|1SNK|A Chain A, Cathepsin K Complexed With Carbamate Derivatized
          Norleucine Aldehyde
          Length = 214

 Score = 73.6 bits (179), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 36/76 (47%), Positives = 51/76 (67%), Gaps = 2/76 (2%)

Query: 5  GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
          GSCW F+ VGA+EG  K  T  L+++S Q LVDC +  E+  C GG++   +QYV +NRG
Sbjct: 22 GSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCVS--ENDGCGGGYMTNAFQYVQKNRG 79

Query: 65 INTERDYPNVGVMDNC 80
          I++E  YP VG  ++C
Sbjct: 80 IDSEDAYPYVGQEESC 95


>gi|356564325|ref|XP_003550405.1| PREDICTED: cysteine proteinase 15A [Glycine max]
          Length = 370

 Score = 73.6 bits (179), Expect = 1e-11,   Method: Composition-based stats.
 Identities = 33/84 (39%), Positives = 47/84 (55%), Gaps = 7/84 (8%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRS-------CVGGFIETIYQ 57
           GSCW F+  GA+EG   + T  LV +S QQLVDCD+  +          C GG +   ++
Sbjct: 161 GSCWSFSTTGALEGAHYLATGELVSLSEQQLVDCDHVCDPEEYGACDSGCNGGLMNNAFE 220

Query: 58  YVIQNRGINTERDYPNVGVMDNCK 81
           Y++Q+ G+  E+DYP  G    CK
Sbjct: 221 YILQSGGVQKEKDYPYTGRDGTCK 244


>gi|354622947|ref|NP_001002938.2| cathepsin S precursor [Canis lupus familiaris]
          Length = 339

 Score = 73.6 bits (179), Expect = 1e-11,   Method: Composition-based stats.
 Identities = 33/78 (42%), Positives = 48/78 (61%), Gaps = 1/78 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGE-SRSCVGGFIETIYQYVIQNR 63
           G+CW F+ VGA+E   K+ T  LV +S Q LVDC  +   ++ C GGF+ T +QY+I N 
Sbjct: 145 GACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTEKYGNKGCNGGFMTTAFQYIIDNN 204

Query: 64  GINTERDYPNVGVMDNCK 81
           GI++E  YP   +   C+
Sbjct: 205 GIDSEASYPYKAMNGKCR 222


>gi|315364648|pdb|3OVZ|A Chain A, Cathepsin K In Complex With A Covalent Inhibitor With A
          Ketoamide Warhead
          Length = 213

 Score = 73.6 bits (179), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 36/76 (47%), Positives = 51/76 (67%), Gaps = 2/76 (2%)

Query: 5  GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
          GSCW F+ VGA+EG  K  T  L+++S Q LVDC +  E+  C GG++   +QYV +NRG
Sbjct: 21 GSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCVS--ENDGCGGGYMTNAFQYVQKNRG 78

Query: 65 INTERDYPNVGVMDNC 80
          I++E  YP VG  ++C
Sbjct: 79 IDSEDAYPYVGQEESC 94


>gi|344275468|ref|XP_003409534.1| PREDICTED: cathepsin K-like [Loxodonta africana]
          Length = 329

 Score = 73.6 bits (179), Expect = 1e-11,   Method: Composition-based stats.
 Identities = 36/76 (47%), Positives = 50/76 (65%), Gaps = 2/76 (2%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ VGA+EG  K  T  L+++S Q LVDC    E+  C GG++   +QYV +NRG
Sbjct: 137 GSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDC--VSENDGCGGGYMTNAFQYVQKNRG 194

Query: 65  INTERDYPNVGVMDNC 80
           I++E  YP VG  ++C
Sbjct: 195 IDSEDAYPYVGQDESC 210


>gi|71027309|ref|XP_763298.1| cysteine proteinase [Theileria parva strain Muguga]
 gi|68350251|gb|EAN31015.1| cysteine proteinase, putative [Theileria parva]
          Length = 460

 Score = 73.6 bits (179), Expect = 1e-11,   Method: Composition-based stats.
 Identities = 35/78 (44%), Positives = 52/78 (66%), Gaps = 3/78 (3%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW FA V ++E + KI  N  +D+S Q+LVDC+    S+ C GGF +T  +Y IQN+G
Sbjct: 272 GSCWAFASVSSVESLYKIYRNVTLDLSEQELVDCETS--SKGCEGGFGDTALKY-IQNKG 328

Query: 65  INTERDYPNVGVMDNCKV 82
           ++T+ + P +G  +NC V
Sbjct: 329 VSTDSEIPYLGKKNNCLV 346


>gi|449522968|ref|XP_004168497.1| PREDICTED: xylem cysteine proteinase 1-like [Cucumis sativus]
          Length = 348

 Score = 73.6 bits (179), Expect = 1e-11,   Method: Composition-based stats.
 Identities = 31/76 (40%), Positives = 49/76 (64%), Gaps = 1/76 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V A+EGI+KIV  NL  +S Q+L+DCD +  +  C GG ++  + +++ + G
Sbjct: 152 GSCWAFSTVAAVEGINKIVGGNLTSLSEQELIDCD-RPYNNGCHGGLMDYAFSFIVSSGG 210

Query: 65  INTERDYPNVGVMDNC 80
           ++ E DYP + V   C
Sbjct: 211 LHKEEDYPYLEVESTC 226


>gi|357143305|ref|XP_003572875.1| PREDICTED: xylem cysteine proteinase 1-like [Brachypodium
           distachyon]
          Length = 473

 Score = 73.6 bits (179), Expect = 1e-11,   Method: Composition-based stats.
 Identities = 33/77 (42%), Positives = 49/77 (63%), Gaps = 1/77 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V A+EGI++I T  L  +S Q+L+DCD   +   C GGF++  + Y++ N G
Sbjct: 155 GSCWAFSTVAAVEGINQIATGKLESLSEQELMDCDTTFD-HGCGGGFMDFAFAYIMGNLG 213

Query: 65  INTERDYPNVGVMDNCK 81
           I+T+ DYP +     CK
Sbjct: 214 IHTDDDYPYLMEEGYCK 230


>gi|449449489|ref|XP_004142497.1| PREDICTED: cysteine proteinase 15A-like [Cucumis sativus]
          Length = 406

 Score = 73.6 bits (179), Expect = 1e-11,   Method: Composition-based stats.
 Identities = 34/83 (40%), Positives = 48/83 (57%), Gaps = 7/83 (8%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGE-------SRSCVGGFIETIYQ 57
           GSCW F+  GA+EG + I T NL+++S QQLVDCD+  +       +  C GG +   Y+
Sbjct: 200 GSCWAFSTCGAVEGANFIATGNLLNLSEQQLVDCDHTCDPTDKTACNNGCNGGLMTNAYK 259

Query: 58  YVIQNRGINTERDYPNVGVMDNC 80
           Y+IQ+ G+  E  YP  G    C
Sbjct: 260 YLIQSGGLEEESSYPYTGRSGQC 282


>gi|46948144|gb|AAT07054.1| cathepsin L-like cysteine proteinase [Brugia malayi]
          Length = 368

 Score = 73.6 bits (179), Expect = 1e-11,   Method: Composition-based stats.
 Identities = 32/78 (41%), Positives = 50/78 (64%), Gaps = 1/78 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDC-DNQGESRSCVGGFIETIYQYVIQNR 63
           GSCW F+ VGA++G   + T  LV++S Q L+DC D+   +  C GG +   ++YV++N 
Sbjct: 162 GSCWTFSAVGALKGQHFLQTGKLVELSMQNLLDCSDDTYGNYGCDGGLMMEAFEYVVKND 221

Query: 64  GINTERDYPNVGVMDNCK 81
           GI+TE+ YP  G  + C+
Sbjct: 222 GIDTEKSYPYQGYQNTCR 239


>gi|356553413|ref|XP_003545051.1| PREDICTED: cysteine proteinase 15A-like [Glycine max]
          Length = 367

 Score = 73.6 bits (179), Expect = 1e-11,   Method: Composition-based stats.
 Identities = 33/84 (39%), Positives = 47/84 (55%), Gaps = 7/84 (8%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRS-------CVGGFIETIYQ 57
           GSCW F+  GA+EG   + T  LV +S QQLVDCD+  +          C GG +   ++
Sbjct: 158 GSCWSFSTTGALEGAHYLATGELVSLSEQQLVDCDHVCDPEEYGACDSGCNGGLMNNAFE 217

Query: 58  YVIQNRGINTERDYPNVGVMDNCK 81
           Y++Q+ G+  E+DYP  G    CK
Sbjct: 218 YILQSGGVQKEKDYPYTGRDGTCK 241


>gi|312985015|gb|ACX54787.2| cysteine protease [Arachis diogoi]
          Length = 360

 Score = 73.6 bits (179), Expect = 1e-11,   Method: Composition-based stats.
 Identities = 34/84 (40%), Positives = 47/84 (55%), Gaps = 7/84 (8%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRS-------CVGGFIETIYQ 57
           GSCW F+  GA+EG   + T  LV +S QQLVDCD+  +          C GG +   + 
Sbjct: 149 GSCWAFSTTGALEGAHYLSTGELVSLSEQQLVDCDHVCDPEEYGACDAGCNGGLMNNAFD 208

Query: 58  YVIQNRGINTERDYPNVGVMDNCK 81
           Y++Q  G+ TE+DYP  G  + CK
Sbjct: 209 YILQAGGVQTEKDYPYSGRDETCK 232


>gi|449455625|ref|XP_004145553.1| PREDICTED: xylem cysteine proteinase 1-like [Cucumis sativus]
          Length = 351

 Score = 73.6 bits (179), Expect = 2e-11,   Method: Composition-based stats.
 Identities = 31/76 (40%), Positives = 49/76 (64%), Gaps = 1/76 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V A+EGI+KIV  NL  +S Q+L+DCD +  +  C GG ++  + +++ + G
Sbjct: 155 GSCWAFSTVAAVEGINKIVGGNLTSLSEQELIDCD-RPYNNGCHGGLMDYAFSFIVSSGG 213

Query: 65  INTERDYPNVGVMDNC 80
           ++ E DYP + V   C
Sbjct: 214 LHKEEDYPYLEVESTC 229


>gi|4100157|gb|AAD10337.1| cysteine proteinase precursor [Hordeum vulgare]
          Length = 365

 Score = 73.6 bits (179), Expect = 2e-11,   Method: Composition-based stats.
 Identities = 35/76 (46%), Positives = 50/76 (65%), Gaps = 2/76 (2%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ + A+EGI+KI T  LV +S Q+L+DCDN   ++ C GG ++  +Q+ IQ  G
Sbjct: 154 GSCWAFSTIVAVEGINKIRTGKLVSLSEQELMDCDNV-NNQGCDGGLMDYAFQF-IQKNG 211

Query: 65  INTERDYPNVGVMDNC 80
           I TE +YP  G   +C
Sbjct: 212 ITTESNYPYQGEQGSC 227


>gi|2224812|emb|CAB09699.1| cysteine endopeptidase EP-A [Hordeum vulgare subsp. vulgare]
          Length = 365

 Score = 73.6 bits (179), Expect = 2e-11,   Method: Composition-based stats.
 Identities = 35/76 (46%), Positives = 50/76 (65%), Gaps = 2/76 (2%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ + A+EGI+KI T  LV +S Q+L+DCDN   ++ C GG ++  +Q+ IQ  G
Sbjct: 154 GSCWAFSTIVAVEGINKIRTGKLVSLSEQELMDCDNV-NNQGCDGGLMDYAFQF-IQKNG 211

Query: 65  INTERDYPNVGVMDNC 80
           I TE +YP  G   +C
Sbjct: 212 ITTESNYPYQGEQGSC 227


>gi|23110962|ref|NP_004070.3| cathepsin S isoform 1 preproprotein [Homo sapiens]
 gi|88984046|sp|P25774.3|CATS_HUMAN RecName: Full=Cathepsin S; Flags: Precursor
 gi|60816153|gb|AAX36372.1| cathepsin S [synthetic construct]
 gi|61358282|gb|AAX41541.1| cathepsin S [synthetic construct]
 gi|119573903|gb|EAW53518.1| cathepsin S, isoform CRA_b [Homo sapiens]
 gi|119573904|gb|EAW53519.1| cathepsin S, isoform CRA_b [Homo sapiens]
          Length = 331

 Score = 73.6 bits (179), Expect = 2e-11,   Method: Composition-based stats.
 Identities = 32/78 (41%), Positives = 49/78 (62%), Gaps = 1/78 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGE-SRSCVGGFIETIYQYVIQNR 63
           G+CW F+ VGA+E   K+ T  LV +S Q LVDC  +   ++ C GGF+ T +QY+I N+
Sbjct: 137 GACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTEKYGNKGCNGGFMTTAFQYIIDNK 196

Query: 64  GINTERDYPNVGVMDNCK 81
           GI+++  YP   +   C+
Sbjct: 197 GIDSDASYPYKAMDQKCQ 214


>gi|62510452|sp|Q8HY81.1|CATS_CANFA RecName: Full=Cathepsin S; Flags: Precursor
 gi|27497538|gb|AAO13009.1| cathepsin S preproprotein [Canis lupus familiaris]
          Length = 331

 Score = 73.6 bits (179), Expect = 2e-11,   Method: Composition-based stats.
 Identities = 33/78 (42%), Positives = 48/78 (61%), Gaps = 1/78 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGE-SRSCVGGFIETIYQYVIQNR 63
           G+CW F+ VGA+E   K+ T  LV +S Q LVDC  +   ++ C GGF+ T +QY+I N 
Sbjct: 137 GACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTEKYGNKGCNGGFMTTAFQYIIDNN 196

Query: 64  GINTERDYPNVGVMDNCK 81
           GI++E  YP   +   C+
Sbjct: 197 GIDSEASYPYKAMNGKCR 214


>gi|449487301|ref|XP_004157559.1| PREDICTED: cysteine proteinase 15A-like [Cucumis sativus]
          Length = 406

 Score = 73.6 bits (179), Expect = 2e-11,   Method: Composition-based stats.
 Identities = 34/83 (40%), Positives = 48/83 (57%), Gaps = 7/83 (8%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGE-------SRSCVGGFIETIYQ 57
           GSCW F+  GA+EG + I T NL+++S QQLVDCD+  +       +  C GG +   Y+
Sbjct: 200 GSCWAFSTCGAVEGANFIATGNLLNLSEQQLVDCDHTCDPTDKTACNNGCNGGLMTNAYK 259

Query: 58  YVIQNRGINTERDYPNVGVMDNC 80
           Y+IQ+ G+  E  YP  G    C
Sbjct: 260 YLIQSGGLEEESSYPYTGRSGQC 282


>gi|61368403|gb|AAX43172.1| cathepsin S [synthetic construct]
          Length = 332

 Score = 73.6 bits (179), Expect = 2e-11,   Method: Composition-based stats.
 Identities = 32/78 (41%), Positives = 49/78 (62%), Gaps = 1/78 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGE-SRSCVGGFIETIYQYVIQNR 63
           G+CW F+ VGA+E   K+ T  LV +S Q LVDC  +   ++ C GGF+ T +QY+I N+
Sbjct: 137 GACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTEKYGNKGCNGGFMTTAFQYIIDNK 196

Query: 64  GINTERDYPNVGVMDNCK 81
           GI+++  YP   +   C+
Sbjct: 197 GIDSDASYPYKAMDQKCQ 214


>gi|354473025|ref|XP_003498737.1| PREDICTED: cathepsin S-like [Cricetulus griseus]
          Length = 341

 Score = 73.6 bits (179), Expect = 2e-11,   Method: Composition-based stats.
 Identities = 33/78 (42%), Positives = 48/78 (61%), Gaps = 2/78 (2%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGE--SRSCVGGFIETIYQYVIQN 62
           GSCW F+ VGA+E   K+ T  LV +S Q LVDC  + +  ++ C GGF+   +QY+I N
Sbjct: 146 GSCWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTEEKYGNKGCDGGFMTRAFQYIIDN 205

Query: 63  RGINTERDYPNVGVMDNC 80
            GI+++  YP   V + C
Sbjct: 206 GGIDSDASYPYKAVAEKC 223


>gi|229366026|gb|ACQ57993.1| Cathepsin H precursor [Anoplopoma fimbria]
          Length = 247

 Score = 73.6 bits (179), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 31/80 (38%), Positives = 45/80 (56%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+  G +E ++ I T  LV +S QQLVDC     +  C GG     ++Y++ ++G
Sbjct: 130 GSCWTFSTTGCLESVTAISTGKLVPLSEQQLVDCAQDFNNHGCNGGLPSQAFEYIMYSKG 189

Query: 65  INTERDYPNVGVMDNCKVFQ 84
           + TE+DYP     D C   Q
Sbjct: 190 LMTEKDYPYTAFEDTCAYKQ 209


>gi|334324659|ref|XP_001371004.2| PREDICTED: cathepsin K-like [Monodelphis domestica]
          Length = 332

 Score = 73.6 bits (179), Expect = 2e-11,   Method: Composition-based stats.
 Identities = 35/76 (46%), Positives = 50/76 (65%), Gaps = 2/76 (2%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ VGA+EG  K  T  L+++S Q LVDC    E+  C GG++   +QYV +NRG
Sbjct: 140 GSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDC--VSENDGCGGGYMTNAFQYVQKNRG 197

Query: 65  INTERDYPNVGVMDNC 80
           I++E  YP +G  ++C
Sbjct: 198 IDSEDAYPYIGEDESC 213


>gi|2351107|dbj|BAA21929.1| bromelain [Ananas comosus]
          Length = 312

 Score = 73.6 bits (179), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 33/81 (40%), Positives = 48/81 (59%), Gaps = 3/81 (3%)

Query: 2   HPLGSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQ 61
           +P GSCW F+ +  +EGI KIVT  LV +S Q+++DC     S  C GGF++  Y ++I 
Sbjct: 103 NPCGSCWAFSAIATVEGIYKIVTGYLVSLSEQEVLDC---AVSNGCDGGFVDNAYDFIIS 159

Query: 62  NRGINTERDYPNVGVMDNCKV 82
           N G+ +E DYP      +C  
Sbjct: 160 NNGVASEADYPYQAYQGDCAA 180


>gi|59798093|sp|P84346.1|MEX1_JACME RecName: Full=Mexicain
          Length = 214

 Score = 73.2 bits (178), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 35/81 (43%), Positives = 51/81 (62%), Gaps = 3/81 (3%)

Query: 2  HPLGSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQ 61
          +P GSCW F+ V  IEGI+KI+T  L+ +S Q+L+DC+ +  S  C GG+     QYV+ 
Sbjct: 20 NPCGSCWAFSTVATIEGINKIITGQLISLSEQELLDCEYR--SHGCDGGYQTPSLQYVVD 77

Query: 62 NRGINTERDYPNVGVMDNCKV 82
          N G++TER+YP       C+ 
Sbjct: 78 N-GVHTEREYPYEKKQGRCRA 97


>gi|413933049|gb|AFW67600.1| cysteine protease 1 [Zea mays]
          Length = 341

 Score = 73.2 bits (178), Expect = 2e-11,   Method: Composition-based stats.
 Identities = 31/71 (43%), Positives = 45/71 (63%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G CW F+ V A+EG++KI T  LV +S Q+LVDCD  G  + C GG ++  +Q+V +  G
Sbjct: 147 GCCWAFSAVAAVEGLNKIRTGRLVSLSEQELVDCDVSGVDQGCDGGLMDNAFQFVARRGG 206

Query: 65  INTERDYPNVG 75
           + +E  YP  G
Sbjct: 207 LASESGYPYQG 217


>gi|389610697|dbj|BAM18960.1| cathepsin L [Papilio polytes]
          Length = 341

 Score = 73.2 bits (178), Expect = 2e-11,   Method: Composition-based stats.
 Identities = 32/77 (41%), Positives = 46/77 (59%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+  GA+EG     TN LV +S Q L+DC     +  C GG ++  ++Y+  N+G
Sbjct: 146 GSCWSFSATGALEGQHYRQTNILVSLSEQNLIDCSTAYGNNGCNGGLMDNAFKYIKDNKG 205

Query: 65  INTERDYPNVGVMDNCK 81
           I+TE+ YP   V D C+
Sbjct: 206 IDTEKSYPYEAVDDKCR 222


>gi|114559420|ref|XP_001171183.1| PREDICTED: cathepsin S isoform 1 [Pan troglodytes]
 gi|397492868|ref|XP_003817342.1| PREDICTED: cathepsin S isoform 2 [Pan paniscus]
          Length = 281

 Score = 73.2 bits (178), Expect = 2e-11,   Method: Composition-based stats.
 Identities = 32/78 (41%), Positives = 48/78 (61%), Gaps = 1/78 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGE-SRSCVGGFIETIYQYVIQNR 63
           G+CW F+ VGA+E   K+ T  LV +S Q LVDC  +   ++ C GGF+ T +QY+I N+
Sbjct: 87  GACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTEKYGNKGCNGGFMTTAFQYIIDNK 146

Query: 64  GINTERDYPNVGVMDNCK 81
           GI+++  YP       C+
Sbjct: 147 GIDSDASYPYKATDQKCQ 164


>gi|224106333|ref|XP_002333699.1| predicted protein [Populus trichocarpa]
 gi|222837985|gb|EEE76350.1| predicted protein [Populus trichocarpa]
          Length = 197

 Score = 73.2 bits (178), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 35/77 (45%), Positives = 51/77 (66%), Gaps = 2/77 (2%)

Query: 4  LGSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNR 63
          +G CW F+ V AIEGI K+ T NL+ +S QQLV+ D    ++ C GG ++T +QY+I+N 
Sbjct: 2  VGCCWAFSAVAAIEGIIKLKTGNLISLSKQQLVNRDVG--NKGCHGGLMDTAFQYIIRNE 59

Query: 64 GINTERDYPNVGVMDNC 80
          G+ +E +YP  GV   C
Sbjct: 60 GLTSEDNYPYQGVDGTC 76


>gi|113120273|gb|ABI30276.1| VXH-C [Vasconcellea x heilbornii]
          Length = 282

 Score = 73.2 bits (178), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 36/80 (45%), Positives = 50/80 (62%), Gaps = 3/80 (3%)

Query: 3   PLGSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQN 62
           P GSCW F+ V  +EGI+KIVT  L+ +S Q+L+DCD +  S  C GG+  T  QYV+ N
Sbjct: 155 PCGSCWAFSTVATVEGINKIVTGKLISLSEQELLDCDRR--SHGCDGGYQRTSLQYVVDN 212

Query: 63  RGINTERDYPNVGVMDNCKV 82
            G++TE +Y       NC+ 
Sbjct: 213 -GVHTEYEYQYEKKQGNCRA 231


>gi|38344381|emb|CAD40319.2| OSJNBb0054B09.3 [Oryza sativa Japonica Group]
 gi|116309071|emb|CAH66180.1| OSIGBa0130O15.4 [Oryza sativa Indica Group]
 gi|116309098|emb|CAH66205.1| OSIGBa0148D14.11 [Oryza sativa Indica Group]
          Length = 381

 Score = 73.2 bits (178), Expect = 2e-11,   Method: Composition-based stats.
 Identities = 34/84 (40%), Positives = 52/84 (61%), Gaps = 7/84 (8%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQ---GESRS----CVGGFIETIYQ 57
           GSCW F+  GA+EG   + T  L  +S QQ+VDCD++    ESR+    C GG + T + 
Sbjct: 167 GSCWSFSTSGALEGAHFLATGKLEVLSEQQMVDCDHECDASESRACDSGCNGGLMTTAFS 226

Query: 58  YVIQNRGINTERDYPNVGVMDNCK 81
           Y++++ G+ +E+DYP  G  + CK
Sbjct: 227 YLMKSGGLQSEKDYPYAGRENTCK 250


>gi|12803615|gb|AAH02642.1| Cathepsin S [Homo sapiens]
 gi|49456313|emb|CAG46477.1| CTSS [Homo sapiens]
 gi|60821573|gb|AAX36579.1| cathepsin S [synthetic construct]
 gi|189069420|dbj|BAG37086.1| unnamed protein product [Homo sapiens]
 gi|261858586|dbj|BAI45815.1| cathepsin S [synthetic construct]
          Length = 331

 Score = 73.2 bits (178), Expect = 2e-11,   Method: Composition-based stats.
 Identities = 32/78 (41%), Positives = 49/78 (62%), Gaps = 1/78 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGE-SRSCVGGFIETIYQYVIQNR 63
           G+CW F+ VGA+E   K+ T  LV +S Q LVDC  +   ++ C GGF+ T +QY+I N+
Sbjct: 137 GACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTEKYGNKGCNGGFMTTAFQYIIDNK 196

Query: 64  GINTERDYPNVGVMDNCK 81
           GI+++  YP   +   C+
Sbjct: 197 GIDSDASYPYKAMDQKCQ 214


>gi|348513412|ref|XP_003444236.1| PREDICTED: cathepsin S-like [Oreochromis niloticus]
          Length = 328

 Score = 73.2 bits (178), Expect = 2e-11,   Method: Composition-based stats.
 Identities = 33/77 (42%), Positives = 46/77 (59%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G+CW F+  GA+EG  K  T  L  +STQ L+DC     +R C GG I   ++YV+ N+G
Sbjct: 135 GACWAFSAAGALEGQLKKSTGILRSLSTQNLIDCTTDYGNRGCNGGLIARAFKYVVDNQG 194

Query: 65  INTERDYPNVGVMDNCK 81
           I +E  YP +G  + CK
Sbjct: 195 IASEDAYPYIGRHNQCK 211


>gi|242086591|ref|XP_002439128.1| hypothetical protein SORBIDRAFT_09g000960 [Sorghum bicolor]
 gi|241944413|gb|EES17558.1| hypothetical protein SORBIDRAFT_09g000960 [Sorghum bicolor]
          Length = 371

 Score = 73.2 bits (178), Expect = 2e-11,   Method: Composition-based stats.
 Identities = 32/76 (42%), Positives = 47/76 (61%), Gaps = 1/76 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V A+EGI++IVT NL  +S Q+LVDC   G +  C GG ++  + Y+  + G
Sbjct: 174 GSCWAFSTVAAVEGINQIVTGNLTSLSEQELVDCSTDGNN-GCNGGVMDNAFSYIASSGG 232

Query: 65  INTERDYPNVGVMDNC 80
           + TE  YP +    +C
Sbjct: 233 LRTEEAYPYLMEEGDC 248


>gi|310656789|gb|ADP02218.1| Peptidase_C1 domain-containing protein [Triticum aestivum]
          Length = 341

 Score = 73.2 bits (178), Expect = 2e-11,   Method: Composition-based stats.
 Identities = 32/77 (41%), Positives = 47/77 (61%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G CW F+ V A EGI K+ T  L+ +S Q+LVDCD  G  + C GG ++  ++++I+N G
Sbjct: 145 GCCWAFSAVVATEGIVKLSTGKLISLSEQELVDCDVHGVDQGCEGGEMDDAFKFIIKNGG 204

Query: 65  INTERDYPNVGVMDNCK 81
           + TE +YP       CK
Sbjct: 205 LTTEANYPYTAQDGQCK 221


>gi|357160300|ref|XP_003578721.1| PREDICTED: oryzain beta chain-like [Brachypodium distachyon]
          Length = 349

 Score = 73.2 bits (178), Expect = 2e-11,   Method: Composition-based stats.
 Identities = 31/77 (40%), Positives = 46/77 (59%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G CW F+ V A EGI ++ T  LV +S Q+LVDCD  G    C GG ++  ++++I+N G
Sbjct: 153 GCCWAFSAVAATEGIVQLSTGKLVPLSEQELVDCDANGADHGCEGGEMDDAFEFIIKNGG 212

Query: 65  INTERDYPNVGVMDNCK 81
           + +E +YP       CK
Sbjct: 213 LTSETNYPYTAQDGQCK 229


>gi|115477767|ref|NP_001062479.1| Os08g0556900 [Oryza sativa Japonica Group]
 gi|42407937|dbj|BAD09076.1| putative cysteine proteinase [Oryza sativa Japonica Group]
 gi|113624448|dbj|BAF24393.1| Os08g0556900 [Oryza sativa Japonica Group]
 gi|125562525|gb|EAZ07973.1| hypothetical protein OsI_30231 [Oryza sativa Indica Group]
 gi|215701458|dbj|BAG92882.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 385

 Score = 73.2 bits (178), Expect = 2e-11,   Method: Composition-based stats.
 Identities = 29/68 (42%), Positives = 44/68 (64%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ + A+EGI+ I T+NL  +S QQLVDCD +  +  C GG ++  +QY+ ++ G
Sbjct: 158 GSCWAFSTIAAVEGINAIRTSNLTALSEQQLVDCDTKTGNAGCDGGLMDNAFQYIAKHGG 217

Query: 65  INTERDYP 72
           +     YP
Sbjct: 218 VAASSAYP 225


>gi|125604306|gb|EAZ43631.1| hypothetical protein OsJ_28254 [Oryza sativa Japonica Group]
          Length = 369

 Score = 73.2 bits (178), Expect = 2e-11,   Method: Composition-based stats.
 Identities = 29/68 (42%), Positives = 44/68 (64%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ + A+EGI+ I T+NL  +S QQLVDCD +  +  C GG ++  +QY+ ++ G
Sbjct: 142 GSCWAFSTIAAVEGINAIRTSNLTALSEQQLVDCDTKTGNAGCDGGLMDNAFQYIAKHGG 201

Query: 65  INTERDYP 72
           +     YP
Sbjct: 202 VAASSAYP 209


>gi|558563|emb|CAA57538.1| cysteine proteinase [Cicer arietinum]
          Length = 325

 Score = 73.2 bits (178), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 32/76 (42%), Positives = 50/76 (65%), Gaps = 1/76 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ +  +E I+KIVT   V +S Q+LVDCD +  +  C GG ++  ++++I+N G
Sbjct: 114 GSCWAFSTIATVEAINKIVTGKFVSLSEQELVDCD-RAFNEGCNGGLMDYAFEFIIRNGG 172

Query: 65  INTERDYPNVGVMDNC 80
           I+T++DYP  G    C
Sbjct: 173 IDTDQDYPYNGFERKC 188


>gi|42567068|ref|NP_567686.2| putative cysteine proteinase [Arabidopsis thaliana]
 gi|332659371|gb|AEE84771.1| putative cysteine proteinase [Arabidopsis thaliana]
          Length = 356

 Score = 73.2 bits (178), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 36/82 (43%), Positives = 54/82 (65%), Gaps = 7/82 (8%)

Query: 6   SCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCD--NQGESRSCVG-GFIETIYQYVIQN 62
           SCW F+ V A+EG++KIVT  L+ +S Q+LVDC+  N G    C G G ++T +Q++I N
Sbjct: 156 SCWAFSTVAAVEGLNKIVTGELISLSEQELVDCNLVNNG----CYGSGLMDTAFQFLINN 211

Query: 63  RGINTERDYPNVGVMDNCKVFQ 84
            G+++E+DYP  G   +C   Q
Sbjct: 212 NGLDSEKDYPYQGTQGSCNRKQ 233


>gi|413938554|gb|AFW73105.1| hypothetical protein ZEAMMB73_931917 [Zea mays]
          Length = 361

 Score = 73.2 bits (178), Expect = 2e-11,   Method: Composition-based stats.
 Identities = 34/77 (44%), Positives = 50/77 (64%), Gaps = 1/77 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V A+EGI++IVT  LV +S Q+LVDCD   +   C GG ++  + Y++ ++G
Sbjct: 157 GSCWAFSSVAAVEGINQIVTGKLVSLSEQELVDCDTTLD-HGCEGGTMDLAFAYMMGSQG 215

Query: 65  INTERDYPNVGVMDNCK 81
           I+ E DYP +     CK
Sbjct: 216 IHAEDDYPYLMEEGYCK 232


>gi|395535911|ref|XP_003769964.1| PREDICTED: cathepsin K [Sarcophilus harrisii]
          Length = 332

 Score = 73.2 bits (178), Expect = 2e-11,   Method: Composition-based stats.
 Identities = 34/76 (44%), Positives = 51/76 (67%), Gaps = 2/76 (2%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ VGA+EG  K  T  L+++S Q LVDC ++ +   C GG++   +QYV +NRG
Sbjct: 140 GSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCVSKND--GCGGGYMTNAFQYVQENRG 197

Query: 65  INTERDYPNVGVMDNC 80
           I++E  YP +G  ++C
Sbjct: 198 IDSEDAYPYIGQDESC 213


>gi|226504984|ref|NP_001151293.1| cysteine protease 1 precursor [Zea mays]
 gi|195645596|gb|ACG42266.1| cysteine protease 1 precursor [Zea mays]
          Length = 340

 Score = 73.2 bits (178), Expect = 2e-11,   Method: Composition-based stats.
 Identities = 30/68 (44%), Positives = 44/68 (64%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G CW F+ V A+EG++KI T  LV +S Q+LVDCD  G  + C GG ++  +Q+V +  G
Sbjct: 147 GCCWAFSAVAAVEGLNKIRTGRLVSLSEQELVDCDVSGVDQGCDGGLMDNAFQFVARRGG 206

Query: 65  INTERDYP 72
           + +E  YP
Sbjct: 207 LASESGYP 214


>gi|242066206|ref|XP_002454392.1| hypothetical protein SORBIDRAFT_04g029960 [Sorghum bicolor]
 gi|241934223|gb|EES07368.1| hypothetical protein SORBIDRAFT_04g029960 [Sorghum bicolor]
          Length = 356

 Score = 73.2 bits (178), Expect = 2e-11,   Method: Composition-based stats.
 Identities = 33/77 (42%), Positives = 50/77 (64%), Gaps = 1/77 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V A+EGI++IVT  LV +S Q+L+DCD   +   C GG ++  + Y++ ++G
Sbjct: 155 GSCWAFSSVAAVEGINQIVTGKLVSLSEQELMDCDTMLD-HGCEGGLMDFAFAYIMGSQG 213

Query: 65  INTERDYPNVGVMDNCK 81
           I+ E DYP +     CK
Sbjct: 214 IHAEDDYPYLMEEGYCK 230


>gi|19851|emb|CAA78365.1| tobacco pre-pro-cysteine proteinase [Nicotiana tabacum]
          Length = 365

 Score = 73.2 bits (178), Expect = 2e-11,   Method: Composition-based stats.
 Identities = 33/83 (39%), Positives = 47/83 (56%), Gaps = 7/83 (8%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRS-------CVGGFIETIYQ 57
           GSCW F+  GA+EG   + T  LV +S QQLVDCD++ +S         C GG + T ++
Sbjct: 154 GSCWSFSTTGAVEGAHFLATGELVSLSEQQLVDCDHECDSEQQDSCDAGCGGGLMTTAFE 213

Query: 58  YVIQNRGINTERDYPNVGVMDNC 80
           Y ++  G+  E+DYP  G    C
Sbjct: 214 YTLKAGGLQLEKDYPYTGKDGKC 236


>gi|2463586|dbj|BAA22545.1| FB22 precursor [Ananas comosus]
          Length = 340

 Score = 73.2 bits (178), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 33/81 (40%), Positives = 48/81 (59%), Gaps = 3/81 (3%)

Query: 2   HPLGSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQ 61
           +P GSCW F+ +  +EGI KIVT  LV +S Q+++DC     S  C GGF++  Y ++I 
Sbjct: 142 NPCGSCWAFSAIATVEGIYKIVTGYLVSLSEQEVLDC---AVSNGCDGGFVDNAYDFIIS 198

Query: 62  NRGINTERDYPNVGVMDNCKV 82
           N G+ +E DYP      +C  
Sbjct: 199 NNGVASEADYPYQAYEGDCTA 219


>gi|537437|gb|AAC35211.1| cysteine proteinase [Hemerocallis hybrid cultivar]
          Length = 359

 Score = 73.2 bits (178), Expect = 2e-11,   Method: Composition-based stats.
 Identities = 32/76 (42%), Positives = 49/76 (64%), Gaps = 2/76 (2%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V A+EGI++I TN LV +S QQLVDCD +  +  C GG ++  + ++  N G
Sbjct: 151 GSCWAFSTVVAVEGINQIKTNELVSLSEQQLVDCDTK--NSGCNGGLMDYAFDFIKNNGG 208

Query: 65  INTERDYPNVGVMDNC 80
           +++E  YP +    +C
Sbjct: 209 LSSEDSYPYLAEQKSC 224


>gi|374530932|gb|AEP83812.2| cysteine endopeptidase EP8 [Secale cereale x Triticum durum]
          Length = 364

 Score = 73.2 bits (178), Expect = 2e-11,   Method: Composition-based stats.
 Identities = 34/78 (43%), Positives = 52/78 (66%), Gaps = 2/78 (2%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ + A+EGI+KI T  LV +S Q+L+DCDN   ++ C GG ++  +Q++ +N G
Sbjct: 153 GSCWAFSTIVAVEGINKIRTGKLVSLSEQELMDCDNV-NNQGCDGGLMDYAFQFIHKN-G 210

Query: 65  INTERDYPNVGVMDNCKV 82
           I TE +YP  G   +C +
Sbjct: 211 ITTESNYPYQGEQGSCDL 228


>gi|150261413|pdb|2PNS|A Chain A, 1.9 Angstrom Resolution Crystal Structure Of A Plant
           Cysteine Protease Ervatamin-C Refinement With Cdna
           Derived Amino Acid Sequence
 gi|150261414|pdb|2PNS|B Chain B, 1.9 Angstrom Resolution Crystal Structure Of A Plant
           Cysteine Protease Ervatamin-C Refinement With Cdna
           Derived Amino Acid Sequence
 gi|166007115|pdb|2PRE|A Chain A, Crystal Structure Of Plant Cysteine Protease Ervatamin-C
           Complexed With Irreversible Inhibitor E-64 At 2.7 A
           Resolution
 gi|166007116|pdb|2PRE|B Chain B, Crystal Structure Of Plant Cysteine Protease Ervatamin-C
           Complexed With Irreversible Inhibitor E-64 At 2.7 A
           Resolution
          Length = 208

 Score = 73.2 bits (178), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 35/80 (43%), Positives = 49/80 (61%), Gaps = 2/80 (2%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V  +E I++I T NL+ +S QQLVDC+ +  +  C GG     YQY+I N G
Sbjct: 23  GSCWAFSTVSTVESINQIRTGNLISLSEQQLVDCNKK--NHGCKGGAFVYAYQYIIDNGG 80

Query: 65  INTERDYPNVGVMDNCKVFQ 84
           I+TE +YP   V   C+  +
Sbjct: 81  IDTEANYPYKAVQGPCRAAK 100


>gi|158300877|ref|XP_001689282.1| AGAP011828-PA [Anopheles gambiae str. PEST]
 gi|157013372|gb|EDO63348.1| AGAP011828-PA [Anopheles gambiae str. PEST]
          Length = 344

 Score = 73.2 bits (178), Expect = 2e-11,   Method: Composition-based stats.
 Identities = 32/76 (42%), Positives = 45/76 (59%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+  GA+EG     T  LV +S Q LVDC  +  +  C GG ++  +QY+  N+G
Sbjct: 149 GSCWSFSATGALEGQHFRKTGKLVSLSEQNLVDCSQKYGNNGCNGGMMDFAFQYIKDNKG 208

Query: 65  INTERDYPNVGVMDNC 80
           I+TE+ YP   + D C
Sbjct: 209 IDTEKSYPYEAIDDEC 224


>gi|149751225|ref|XP_001490531.1| PREDICTED: cathepsin S-like [Equus caballus]
          Length = 332

 Score = 73.2 bits (178), Expect = 2e-11,   Method: Composition-based stats.
 Identities = 33/78 (42%), Positives = 48/78 (61%), Gaps = 1/78 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGES-RSCVGGFIETIYQYVIQNR 63
           G+CW F+ VGA+E   K+ T NLV +S Q LVDC  +  S + C GGF+   +QY+I N 
Sbjct: 138 GACWAFSAVGALEAQLKLKTGNLVSLSAQNLVDCSTEKYSNKGCNGGFMTAAFQYIIDNN 197

Query: 64  GINTERDYPNVGVMDNCK 81
           GI+++  YP   +   C+
Sbjct: 198 GIDSDASYPYKAMDGKCR 215


>gi|417409876|gb|JAA51427.1| Putative cathepsin s, partial [Desmodus rotundus]
          Length = 342

 Score = 73.2 bits (178), Expect = 2e-11,   Method: Composition-based stats.
 Identities = 34/78 (43%), Positives = 46/78 (58%), Gaps = 1/78 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCD-NQGESRSCVGGFIETIYQYVIQNR 63
           GSCW F+ VGA+E   K+ T  LV +S Q LVDC   +  +R C GGF+   +QY+I N 
Sbjct: 148 GSCWAFSAVGALEAQVKLKTGKLVSLSAQNLVDCSVGKYSNRGCNGGFMTEAFQYIIDNN 207

Query: 64  GINTERDYPNVGVMDNCK 81
           GI +E  YP   +   C+
Sbjct: 208 GIESEASYPYKAMDGKCQ 225


>gi|161408097|dbj|BAF94152.1| cathepsin L-like cysteine protease 2 [Plautia stali]
          Length = 334

 Score = 73.2 bits (178), Expect = 2e-11,   Method: Composition-based stats.
 Identities = 32/77 (41%), Positives = 46/77 (59%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+  GA+EG +   T  LV +S Q LVDC     +  C GG ++  +QY+ +N G
Sbjct: 140 GSCWAFSTTGALEGQNFRKTGKLVSLSEQNLVDCSGSYGNNGCEGGLMDNAFQYIKENHG 199

Query: 65  INTERDYPNVGVMDNCK 81
           I+TE+ YP  G  + C+
Sbjct: 200 IDTEKSYPYEGEDETCR 216


>gi|332220183|ref|XP_003259237.1| PREDICTED: cathepsin S isoform 1 [Nomascus leucogenys]
          Length = 331

 Score = 73.2 bits (178), Expect = 2e-11,   Method: Composition-based stats.
 Identities = 32/78 (41%), Positives = 49/78 (62%), Gaps = 1/78 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGE-SRSCVGGFIETIYQYVIQNR 63
           G+CW F+ VGA+E   K+ T  LV +S Q LVDC  +   ++ C GGF+ T +QY+I N+
Sbjct: 137 GACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTEKYGNKGCNGGFMTTAFQYIIDNK 196

Query: 64  GINTERDYPNVGVMDNCK 81
           GI+++  YP   +   C+
Sbjct: 197 GIDSDASYPYKAMDQKCQ 214


>gi|114559412|ref|XP_001171151.1| PREDICTED: cathepsin K isoform 4 [Pan troglodytes]
 gi|410221358|gb|JAA07898.1| cathepsin K [Pan troglodytes]
 gi|410248298|gb|JAA12116.1| cathepsin K [Pan troglodytes]
 gi|410301088|gb|JAA29144.1| cathepsin K [Pan troglodytes]
 gi|410351445|gb|JAA42326.1| cathepsin K [Pan troglodytes]
          Length = 329

 Score = 73.2 bits (178), Expect = 2e-11,   Method: Composition-based stats.
 Identities = 35/76 (46%), Positives = 50/76 (65%), Gaps = 2/76 (2%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ VGA+EG  K  T  L+++S Q LVDC    E+  C GG++   ++YV +NRG
Sbjct: 137 GSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDC--VSENDGCGGGYMTNAFEYVQKNRG 194

Query: 65  INTERDYPNVGVMDNC 80
           I++E  YP VG  ++C
Sbjct: 195 IDSEDAYPYVGQEESC 210


>gi|118429523|gb|ABK91809.1| cathepsin L-like proteinase precursor [Clonorchis sinensis]
          Length = 373

 Score = 73.2 bits (178), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 33/70 (47%), Positives = 47/70 (67%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+  GAIEG + + T NLV +S QQLVDC ++  + +C GG ++  ++YV  + G
Sbjct: 174 GSCWAFSATGAIEGQNFLATGNLVSLSEQQLVDCSSEYGNNACNGGLMDNAFKYVKDSNG 233

Query: 65  INTERDYPNV 74
           I+TE  YP V
Sbjct: 234 IDTEASYPYV 243


>gi|179959|gb|AAA35655.1| cathepsin [Homo sapiens]
 gi|248406|gb|AAB22005.1| cathepsin S [Homo sapiens]
          Length = 331

 Score = 73.2 bits (178), Expect = 2e-11,   Method: Composition-based stats.
 Identities = 32/78 (41%), Positives = 49/78 (62%), Gaps = 1/78 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGE-SRSCVGGFIETIYQYVIQNR 63
           G+CW F+ VGA+E   K+ T  LV +S Q LVDC  +   ++ C GGF+ T +QY+I N+
Sbjct: 137 GACWAFSAVGALEAQLKLKTGKLVTLSAQNLVDCSTEKYGNKGCNGGFMTTAFQYIIDNK 196

Query: 64  GINTERDYPNVGVMDNCK 81
           GI+++  YP   +   C+
Sbjct: 197 GIDSDASYPYKAMDQKCQ 214


>gi|57118009|gb|AAW34136.1| cysteine protease gp3a [Zingiber officinale]
          Length = 475

 Score = 73.2 bits (178), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 33/68 (48%), Positives = 47/68 (69%), Gaps = 2/68 (2%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW FA + A+EGI++IVT +L+ +S QQLVDC  +  +  C GG+    +QY+I N G
Sbjct: 165 GSCWAFAAIAAVEGINQIVTGDLISLSEQQLVDCSTR--NYGCEGGWPYRAFQYIINNGG 222

Query: 65  INTERDYP 72
           +N+E  YP
Sbjct: 223 VNSEEHYP 230


>gi|148224682|ref|NP_001086670.1| cathepsin S [Xenopus laevis]
 gi|50418223|gb|AAH77285.1| Ctss-prov protein [Xenopus laevis]
          Length = 320

 Score = 73.2 bits (178), Expect = 2e-11,   Method: Composition-based stats.
 Identities = 32/76 (42%), Positives = 48/76 (63%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ VGA+EG   + T  +V +S Q LVDC ++  ++ C GGF+   +QYVI N G
Sbjct: 127 GSCWAFSAVGALEGQLMLKTGKIVSLSPQNLVDCSSKYGNKGCSGGFMTRAFQYVIDNNG 186

Query: 65  INTERDYPNVGVMDNC 80
           I+++  YP   + + C
Sbjct: 187 IDSDTYYPYHAMDEKC 202


>gi|225428879|ref|XP_002285299.1| PREDICTED: cysteine proteinase RD21a-like [Vitis vinifera]
          Length = 469

 Score = 73.2 bits (178), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 31/76 (40%), Positives = 51/76 (67%), Gaps = 1/76 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ + A+EGI++I T +L+ +S Q+LVDCD +  ++ C GG ++  ++++I N G
Sbjct: 164 GSCWAFSTIAAVEGINQIATGDLISLSEQELVDCD-KSYNQGCNGGLMDYAFEFIINNGG 222

Query: 65  INTERDYPNVGVMDNC 80
           I++E DYP       C
Sbjct: 223 IDSEEDYPYRAADTTC 238


>gi|57118007|gb|AAW34135.1| cysteine protease gp2b [Zingiber officinale]
          Length = 379

 Score = 73.2 bits (178), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 33/71 (46%), Positives = 48/71 (67%), Gaps = 2/71 (2%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V A+EGI++IVT +L+ +S QQLVDC     +  C GG++   +Q+++ N G
Sbjct: 164 GSCWAFSTVAAVEGINQIVTGDLISLSEQQLVDCTTA--NHGCRGGWMNPAFQFIVNNGG 221

Query: 65  INTERDYPNVG 75
           IN+E  YP  G
Sbjct: 222 INSEETYPYRG 232


>gi|222628593|gb|EEE60725.1| hypothetical protein OsJ_14236 [Oryza sativa Japonica Group]
          Length = 364

 Score = 73.2 bits (178), Expect = 2e-11,   Method: Composition-based stats.
 Identities = 34/84 (40%), Positives = 52/84 (61%), Gaps = 7/84 (8%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQ---GESRS----CVGGFIETIYQ 57
           GSCW F+  GA+EG   + T  L  +S QQ+VDCD++    ESR+    C GG + T + 
Sbjct: 150 GSCWSFSTSGALEGAHFLATGKLEVLSEQQMVDCDHECDASESRACDSGCNGGLMTTAFS 209

Query: 58  YVIQNRGINTERDYPNVGVMDNCK 81
           Y++++ G+ +E+DYP  G  + CK
Sbjct: 210 YLMKSGGLQSEKDYPYAGRENTCK 233


>gi|358255476|dbj|GAA57175.1| cathepsin L [Clonorchis sinensis]
          Length = 385

 Score = 73.2 bits (178), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 33/70 (47%), Positives = 47/70 (67%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+  GAIEG + + T NLV +S QQLVDC ++  + +C GG ++  ++YV  + G
Sbjct: 186 GSCWAFSATGAIEGQNFLATGNLVSLSEQQLVDCSSEYGNNACNGGLMDNAFKYVKDSNG 245

Query: 65  INTERDYPNV 74
           I+TE  YP V
Sbjct: 246 IDTEASYPYV 255


>gi|349604730|gb|AEQ00199.1| Cathepsin K-like protein, partial [Equus caballus]
          Length = 219

 Score = 73.2 bits (178), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 36/76 (47%), Positives = 51/76 (67%), Gaps = 2/76 (2%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ VGA+EG  K  T  L+++S Q LVDC +  E+  C GG++   +QYV +NRG
Sbjct: 27  GSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCVS--ENDGCGGGYMTNAFQYVQKNRG 84

Query: 65  INTERDYPNVGVMDNC 80
           I++E  YP VG  ++C
Sbjct: 85  IDSEDAYPYVGQDESC 100


>gi|3451077|emb|CAA20473.1| cysteine proteinase-like protein [Arabidopsis thaliana]
 gi|7269200|emb|CAB79307.1| cysteine proteinase-like protein [Arabidopsis thaliana]
          Length = 355

 Score = 73.2 bits (178), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 36/82 (43%), Positives = 54/82 (65%), Gaps = 7/82 (8%)

Query: 6   SCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCD--NQGESRSCVG-GFIETIYQYVIQN 62
           SCW F+ V A+EG++KIVT  L+ +S Q+LVDC+  N G    C G G ++T +Q++I N
Sbjct: 156 SCWAFSTVAAVEGLNKIVTGELISLSEQELVDCNLVNNG----CYGSGLMDTAFQFLINN 211

Query: 63  RGINTERDYPNVGVMDNCKVFQ 84
            G+++E+DYP  G   +C   Q
Sbjct: 212 NGLDSEKDYPYQGTQGSCNRKQ 233


>gi|93279455|pdb|2F7D|A Chain A, A Mutant Rabbit Cathepsin K With A Nitrile Inhibitor
          Length = 215

 Score = 73.2 bits (178), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 36/76 (47%), Positives = 51/76 (67%), Gaps = 2/76 (2%)

Query: 5  GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
          GSCW F+ VGA+EG  K  T  L+++S Q LVDC +  E+  C GG++   +QYV +NRG
Sbjct: 23 GSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCVS--ENDGCGGGYMTNAFQYVQRNRG 80

Query: 65 INTERDYPNVGVMDNC 80
          I++E  YP VG  ++C
Sbjct: 81 IDSEDAYPYVGQDESC 96


>gi|357166364|ref|XP_003580686.1| PREDICTED: oryzain alpha chain-like [Brachypodium distachyon]
          Length = 360

 Score = 73.2 bits (178), Expect = 2e-11,   Method: Composition-based stats.
 Identities = 31/82 (37%), Positives = 51/82 (62%), Gaps = 1/82 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GS W F+ + A+E I++IVT  L+ +S Q+L+DCD    +  C GG ++  ++++I N G
Sbjct: 156 GSAWAFSAIAAVESINQIVTGELISLSEQELMDCDTSYNA-GCDGGLMDDAFEFIISNGG 214

Query: 65  INTERDYPNVGVMDNCKVFQFN 86
           I+T+ DYP     D+C   + N
Sbjct: 215 IDTDEDYPYKARNDSCDANKRN 236


>gi|114559418|ref|XP_001171268.1| PREDICTED: cathepsin S isoform 3 [Pan troglodytes]
 gi|397492866|ref|XP_003817341.1| PREDICTED: cathepsin S isoform 1 [Pan paniscus]
 gi|410225070|gb|JAA09754.1| cathepsin S [Pan troglodytes]
 gi|410251608|gb|JAA13771.1| cathepsin S [Pan troglodytes]
 gi|410328325|gb|JAA33109.1| cathepsin S [Pan troglodytes]
 gi|410328327|gb|JAA33110.1| cathepsin S [Pan troglodytes]
          Length = 331

 Score = 72.8 bits (177), Expect = 2e-11,   Method: Composition-based stats.
 Identities = 32/78 (41%), Positives = 48/78 (61%), Gaps = 1/78 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGE-SRSCVGGFIETIYQYVIQNR 63
           G+CW F+ VGA+E   K+ T  LV +S Q LVDC  +   ++ C GGF+ T +QY+I N+
Sbjct: 137 GACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTEKYGNKGCNGGFMTTAFQYIIDNK 196

Query: 64  GINTERDYPNVGVMDNCK 81
           GI+++  YP       C+
Sbjct: 197 GIDSDASYPYKATDQKCQ 214


>gi|57118005|gb|AAW34134.1| cysteine protease gp2a [Zingiber officinale]
          Length = 381

 Score = 72.8 bits (177), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 33/71 (46%), Positives = 48/71 (67%), Gaps = 2/71 (2%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V A+EGI++IVT +L+ +S QQLVDC     +  C GG++   +Q+++ N G
Sbjct: 166 GSCWAFSTVAAVEGINQIVTGDLISLSEQQLVDCTTA--NHGCRGGWMNPAFQFIVNNGG 223

Query: 65  INTERDYPNVG 75
           IN+E  YP  G
Sbjct: 224 INSEETYPYRG 234


>gi|1174171|gb|AAB41816.1| NTH1 [Pisum sativum]
          Length = 367

 Score = 72.8 bits (177), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 32/76 (42%), Positives = 53/76 (69%), Gaps = 1/76 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G+CW F+ V A+E I+KIVT +LV +S Q+LVDCD + +++ C GG     Y+++++N G
Sbjct: 143 GACWAFSAVAAVEAINKIVTGSLVSLSEQELVDCD-RTKNKGCNGGNQVNAYRFIVENGG 201

Query: 65  INTERDYPNVGVMDNC 80
           ++++ DYP +G    C
Sbjct: 202 LDSQIDYPYLGRQSTC 217


>gi|297684916|ref|XP_002820055.1| PREDICTED: cathepsin L2 isoform 3 [Pongo abelii]
          Length = 345

 Score = 72.8 bits (177), Expect = 2e-11,   Method: Composition-based stats.
 Identities = 33/77 (42%), Positives = 48/77 (62%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+  GA+EG     T  LV +S Q LVDC +   ++ C GGF++  +QYV +N G
Sbjct: 147 GSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSHPQGNQGCNGGFMDKAFQYVKENGG 206

Query: 65  INTERDYPNVGVMDNCK 81
           +++E  YP V + + CK
Sbjct: 207 LDSEESYPYVAMDEICK 223


>gi|116787909|gb|ABK24688.1| unknown [Picea sitchensis]
 gi|224284108|gb|ACN39791.1| unknown [Picea sitchensis]
 gi|224285024|gb|ACN40241.1| unknown [Picea sitchensis]
          Length = 366

 Score = 72.8 bits (177), Expect = 2e-11,   Method: Composition-based stats.
 Identities = 35/83 (42%), Positives = 50/83 (60%), Gaps = 7/83 (8%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQ---GESRS----CVGGFIETIYQ 57
           GSCW F+  GA+EG + + T  LV +S QQLVDCD++    ++RS    C GG + + YQ
Sbjct: 161 GSCWAFSTTGALEGANFLKTGELVSLSEQQLVDCDHECDPSDARSCDSGCNGGLMTSAYQ 220

Query: 58  YVIQNRGINTERDYPNVGVMDNC 80
           Y +++ G+  E DYP  G    C
Sbjct: 221 YALKSGGLEKEEDYPYTGKDGTC 243


>gi|116779325|gb|ABK21238.1| unknown [Picea sitchensis]
 gi|148905850|gb|ABR16087.1| unknown [Picea sitchensis]
 gi|148908434|gb|ABR17330.1| unknown [Picea sitchensis]
 gi|148908881|gb|ABR17545.1| unknown [Picea sitchensis]
 gi|224286109|gb|ACN40765.1| unknown [Picea sitchensis]
          Length = 366

 Score = 72.8 bits (177), Expect = 2e-11,   Method: Composition-based stats.
 Identities = 35/83 (42%), Positives = 50/83 (60%), Gaps = 7/83 (8%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQ---GESRS----CVGGFIETIYQ 57
           GSCW F+  GA+EG + + T  LV +S QQLVDCD++    ++RS    C GG + + YQ
Sbjct: 161 GSCWAFSTTGALEGANFLKTGELVSLSEQQLVDCDHECDPSDARSCDSGCNGGLMTSAYQ 220

Query: 58  YVIQNRGINTERDYPNVGVMDNC 80
           Y +++ G+  E DYP  G    C
Sbjct: 221 YALKSGGLEKEEDYPYTGKDGTC 243


>gi|209731972|gb|ACI66855.1| Cathepsin H precursor [Salmo salar]
          Length = 328

 Score = 72.8 bits (177), Expect = 2e-11,   Method: Composition-based stats.
 Identities = 32/77 (41%), Positives = 43/77 (55%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+  G +E ++ I T  L+ +S QQLVDC     +  C GG     ++Y+  N+G
Sbjct: 132 GSCWTFSTTGCLESVTAIATGKLLQLSEQQLVDCAQAFNNHGCNGGLPSQAFEYIKFNKG 191

Query: 65  INTERDYPNVGVMDNCK 81
           I TE DYP     D CK
Sbjct: 192 IMTEDDYPYTAHDDTCK 208


>gi|307110445|gb|EFN58681.1| hypothetical protein CHLNCDRAFT_56822 [Chlorella variabilis]
          Length = 466

 Score = 72.8 bits (177), Expect = 2e-11,   Method: Composition-based stats.
 Identities = 31/68 (45%), Positives = 46/68 (67%), Gaps = 1/68 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+  GA+EG S I T  L  +S Q LVDCD + ++  C GG ++  ++++++N G
Sbjct: 147 GSCWAFSTTGAVEGASAIATGKLASLSEQMLVDCDRERDN-GCHGGLMDFAFEFIMKNGG 205

Query: 65  INTERDYP 72
           I+TE DYP
Sbjct: 206 IDTEDDYP 213


>gi|410968392|ref|XP_003990691.1| PREDICTED: cathepsin S, partial [Felis catus]
          Length = 310

 Score = 72.8 bits (177), Expect = 2e-11,   Method: Composition-based stats.
 Identities = 33/78 (42%), Positives = 48/78 (61%), Gaps = 1/78 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGE-SRSCVGGFIETIYQYVIQNR 63
           G+CW F+ VGA+E   K+ T NLV +S Q LVDC  +   ++ C GGF+   +QY+I N 
Sbjct: 149 GACWAFSAVGALEAQLKLKTGNLVSLSAQNLVDCSTEKYGNKGCNGGFMTEAFQYIIDNN 208

Query: 64  GINTERDYPNVGVMDNCK 81
           GI++E  YP   +   C+
Sbjct: 209 GIDSEASYPYKAMDGKCQ 226


>gi|296090463|emb|CBI40282.3| unnamed protein product [Vitis vinifera]
          Length = 386

 Score = 72.8 bits (177), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 31/76 (40%), Positives = 51/76 (67%), Gaps = 1/76 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ + A+EGI++I T +L+ +S Q+LVDCD +  ++ C GG ++  ++++I N G
Sbjct: 81  GSCWAFSTIAAVEGINQIATGDLISLSEQELVDCD-KSYNQGCNGGLMDYAFEFIINNGG 139

Query: 65  INTERDYPNVGVMDNC 80
           I++E DYP       C
Sbjct: 140 IDSEEDYPYRAADTTC 155


>gi|224285931|gb|ACN40679.1| unknown [Picea sitchensis]
          Length = 366

 Score = 72.8 bits (177), Expect = 2e-11,   Method: Composition-based stats.
 Identities = 35/83 (42%), Positives = 50/83 (60%), Gaps = 7/83 (8%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQ---GESRS----CVGGFIETIYQ 57
           GSCW F+  GA+EG + + T  LV +S QQLVDCD++    ++RS    C GG + + YQ
Sbjct: 161 GSCWAFSTTGALEGANFLKTGELVSLSEQQLVDCDHECDPSDARSCDSGCNGGLMTSAYQ 220

Query: 58  YVIQNRGINTERDYPNVGVMDNC 80
           Y +++ G+  E DYP  G    C
Sbjct: 221 YALKSGGLEKEEDYPYTGKDGTC 243


>gi|146386366|gb|ABQ23971.1| cathepsin K [Oryctolagus cuniculus]
          Length = 183

 Score = 72.8 bits (177), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 36/76 (47%), Positives = 51/76 (67%), Gaps = 2/76 (2%)

Query: 5  GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
          GSCW F+ VGA+EG  K  T  L+++S Q LVDC +  E+  C GG++   +QYV +NRG
Sbjct: 2  GSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCVS--ENYGCGGGYMTNAFQYVQRNRG 59

Query: 65 INTERDYPNVGVMDNC 80
          I++E  YP VG  ++C
Sbjct: 60 IDSEDAYPYVGQDESC 75


>gi|356543116|ref|XP_003540009.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Glycine
           max]
          Length = 337

 Score = 72.8 bits (177), Expect = 2e-11,   Method: Composition-based stats.
 Identities = 33/76 (43%), Positives = 50/76 (65%), Gaps = 2/76 (2%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V A EGI +I T+ L+ +S Q+LVDCD+      C GG++E  ++++I+N G
Sbjct: 143 GSCWAFSTVAATEGIYQITTSMLMSLSEQELVDCDSV--DHGCDGGYMEGGFEFIIKNGG 200

Query: 65  INTERDYPNVGVMDNC 80
           I++E +YP   V   C
Sbjct: 201 ISSEANYPYTAVDGTC 216


>gi|297684914|ref|XP_002820054.1| PREDICTED: cathepsin L2 isoform 2 [Pongo abelii]
          Length = 334

 Score = 72.8 bits (177), Expect = 2e-11,   Method: Composition-based stats.
 Identities = 33/77 (42%), Positives = 48/77 (62%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+  GA+EG     T  LV +S Q LVDC +   ++ C GGF++  +QYV +N G
Sbjct: 136 GSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSHPQGNQGCNGGFMDKAFQYVKENGG 195

Query: 65  INTERDYPNVGVMDNCK 81
           +++E  YP V + + CK
Sbjct: 196 LDSEESYPYVAMDEICK 212


>gi|139002720|dbj|BAF51966.1| cathepsin K [Carassius auratus]
 gi|139002725|dbj|BAF51967.1| tartrate-resistant acid phosphatase [Carassius auratus]
          Length = 332

 Score = 72.8 bits (177), Expect = 2e-11,   Method: Composition-based stats.
 Identities = 37/78 (47%), Positives = 47/78 (60%), Gaps = 6/78 (7%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDC--DNQGESRSCVGGFIETIYQYVIQN 62
           GSCW F+ VGA+EG  K     LVD+S Q LVDC  DN G    C GG++   ++YV  N
Sbjct: 139 GSCWAFSSVGALEGQLKKTKGQLVDLSPQNLVDCVTDNDG----CGGGYMTNAFRYVKDN 194

Query: 63  RGINTERDYPNVGVMDNC 80
           +GI++E  YP VG    C
Sbjct: 195 QGIDSEEGYPYVGTDQQC 212


>gi|171854651|dbj|BAG16515.1| putative cysteine proteinase [Capsicum chinense]
          Length = 367

 Score = 72.8 bits (177), Expect = 2e-11,   Method: Composition-based stats.
 Identities = 34/83 (40%), Positives = 48/83 (57%), Gaps = 7/83 (8%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQ--GESRS-----CVGGFIETIYQ 57
           GSCW F+  GA+EG   + T  LV +S QQLVDCD++   E +S     C GG + T ++
Sbjct: 155 GSCWSFSTTGAVEGAHFLATGELVSLSEQQLVDCDHECDAEQKSECDAGCGGGLMTTAFE 214

Query: 58  YVIQNRGINTERDYPNVGVMDNC 80
           Y ++  G+  E+DYP  G    C
Sbjct: 215 YTLKAGGLQREKDYPYTGRNGQC 237


>gi|30749499|pdb|1MS6|A Chain A, Dipeptide Nitrile Inhibitor Bound To Cathepsin S.
 gi|163310952|pdb|2R9M|A Chain A, Cathepsin S Complexed With Compound 15
 gi|163310953|pdb|2R9M|B Chain B, Cathepsin S Complexed With Compound 15
 gi|163310954|pdb|2R9N|A Chain A, Cathepsin S Complexed With Compound 26
 gi|163310955|pdb|2R9N|B Chain B, Cathepsin S Complexed With Compound 26
 gi|163310956|pdb|2R9O|A Chain A, Cathepsin S Complexed With Compound 8
 gi|163310957|pdb|2R9O|B Chain B, Cathepsin S Complexed With Compound 8
          Length = 222

 Score = 72.8 bits (177), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 32/78 (41%), Positives = 49/78 (62%), Gaps = 1/78 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGE-SRSCVGGFIETIYQYVIQNR 63
           G+CW F+ VGA+E   K+ T  LV +S Q LVDC  +   ++ C GGF+ T +QY+I N+
Sbjct: 23  GACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTEKYGNKGCNGGFMTTAFQYIIDNK 82

Query: 64  GINTERDYPNVGVMDNCK 81
           GI+++  YP   +   C+
Sbjct: 83  GIDSDASYPYKAMDQKCQ 100


>gi|71482944|gb|AAZ32411.1| cysteine proteinase glycinain type [Nicotiana benthamiana]
          Length = 355

 Score = 72.8 bits (177), Expect = 3e-11,   Method: Composition-based stats.
 Identities = 32/83 (38%), Positives = 46/83 (55%), Gaps = 7/83 (8%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRS-------CVGGFIETIYQ 57
           GSCW F+  GA+EG   + T  LV +S QQLVDCD++ +          C GG + T ++
Sbjct: 154 GSCWSFSTTGAVEGAHFLATGELVSLSEQQLVDCDHECDPEQQDSCDAGCSGGLMTTAFE 213

Query: 58  YVIQNRGINTERDYPNVGVMDNC 80
           Y ++  G+  E+DYP  G    C
Sbjct: 214 YTLKAGGLQREKDYPYTGKXGKC 236


>gi|356543124|ref|XP_003540013.1| PREDICTED: vignain-like [Glycine max]
 gi|356543126|ref|XP_003540014.1| PREDICTED: vignain-like [Glycine max]
          Length = 337

 Score = 72.8 bits (177), Expect = 3e-11,   Method: Composition-based stats.
 Identities = 33/76 (43%), Positives = 50/76 (65%), Gaps = 2/76 (2%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V A EGI +I T+ L+ +S Q+LVDCD+      C GG++E  ++++I+N G
Sbjct: 143 GSCWAFSTVAATEGIYQITTSMLMSLSEQELVDCDSV--DHGCDGGYMEGGFEFIIKNGG 200

Query: 65  INTERDYPNVGVMDNC 80
           I++E +YP   V   C
Sbjct: 201 ISSEANYPYTAVDGTC 216


>gi|46576373|sp|P83654.1|ERVC_TABDI RecName: Full=Ervatamin-C; Short=ERV-C
 gi|46014979|pdb|1O0E|A Chain A, 1.9 Angstrom Crystal Structure Of A Plant Cysteine
          Protease Ervatamin C
 gi|46014980|pdb|1O0E|B Chain B, 1.9 Angstrom Crystal Structure Of A Plant Cysteine
          Protease Ervatamin C
          Length = 208

 Score = 72.8 bits (177), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 34/78 (43%), Positives = 49/78 (62%), Gaps = 2/78 (2%)

Query: 5  GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
          GSCW F+ V  +E I++I T NL+ +S Q+LVDCD   ++  C+GG     YQY+I N G
Sbjct: 23 GSCWAFSTVSTVESINQIRTGNLISLSEQELVDCDK--KNHGCLGGAFVFAYQYIINNGG 80

Query: 65 INTERDYPNVGVMDNCKV 82
          I+T+ +YP   V   C+ 
Sbjct: 81 IDTQANYPYKAVQGPCQA 98


>gi|125547724|gb|EAY93546.1| hypothetical protein OsI_15336 [Oryza sativa Indica Group]
          Length = 348

 Score = 72.8 bits (177), Expect = 3e-11,   Method: Composition-based stats.
 Identities = 34/84 (40%), Positives = 52/84 (61%), Gaps = 7/84 (8%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQ---GESRS----CVGGFIETIYQ 57
           GSCW F+  GA+EG   + T  L  +S QQ+VDCD++    ESR+    C GG + T + 
Sbjct: 134 GSCWSFSTSGALEGAHFLATGKLEVLSEQQMVDCDHECDASESRACDSGCNGGLMTTAFS 193

Query: 58  YVIQNRGINTERDYPNVGVMDNCK 81
           Y++++ G+ +E+DYP  G  + CK
Sbjct: 194 YLMKSGGLQSEKDYPYAGRENTCK 217


>gi|297733654|emb|CBI14901.3| unnamed protein product [Vitis vinifera]
          Length = 273

 Score = 72.8 bits (177), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 35/82 (42%), Positives = 50/82 (60%), Gaps = 1/82 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V A+EGI+ I TN LV +S Q+LVDCD   E++ C GG +   ++++ +  G
Sbjct: 61  GSCWAFSTVVAVEGINHIKTNKLVSLSEQELVDCDTS-ENQGCNGGLMGYAFEFIKEKGG 119

Query: 65  INTERDYPNVGVMDNCKVFQFN 86
           I TE+ YP       C V + N
Sbjct: 120 ITTEQSYPYTAEDGTCDVSKVN 141


>gi|291398027|ref|XP_002715626.1| PREDICTED: cathepsin S [Oryctolagus cuniculus]
          Length = 331

 Score = 72.8 bits (177), Expect = 3e-11,   Method: Composition-based stats.
 Identities = 33/77 (42%), Positives = 47/77 (61%), Gaps = 1/77 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCD-NQGESRSCVGGFIETIYQYVIQNR 63
           G+CW F+ VGA+E   K+ T NLV +S Q LVDC   +  ++ C GGF+   +QY+I N 
Sbjct: 137 GACWAFSAVGALEAQLKLKTGNLVSLSAQNLVDCSTTKYGNKGCNGGFMTEAFQYIIDNN 196

Query: 64  GINTERDYPNVGVMDNC 80
           GI++E  YP   +   C
Sbjct: 197 GIDSEASYPYKAMDQKC 213


>gi|1353726|gb|AAB01769.1| cysteine proteinase homolog, partial [Naegleria fowleri]
          Length = 347

 Score = 72.8 bits (177), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 38/90 (42%), Positives = 50/90 (55%), Gaps = 8/90 (8%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDN--------QGESRSCVGGFIETIY 56
           GSCW F+  G +EG   I    LV +S QQLVDCD+        Q     C GG + + +
Sbjct: 144 GSCWTFSTTGNVEGQWAIKKGKLVSLSEQQLVDCDHNCVTYQNQQACDSGCNGGLMWSAF 203

Query: 57  QYVIQNRGINTERDYPNVGVMDNCKVFQFN 86
           QYVI+N G++TE  YP  GV D C+  + N
Sbjct: 204 QYVIKNGGLDTEDSYPYEGVDDTCRFNKSN 233


>gi|432114312|gb|ELK36240.1| Aryl hydrocarbon receptor nuclear translocator [Myotis davidii]
          Length = 897

 Score = 72.8 bits (177), Expect = 3e-11,   Method: Composition-based stats.
 Identities = 35/76 (46%), Positives = 49/76 (64%), Gaps = 2/76 (2%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ VGA+EG     T  L+++S Q LVDC    E+  C GG++   +QYV +NRG
Sbjct: 705 GSCWAFSSVGALEGQLMKKTGKLLNLSPQNLVDC--VSENDGCGGGYMTNAFQYVQRNRG 762

Query: 65  INTERDYPNVGVMDNC 80
           I++E  YP VG  ++C
Sbjct: 763 IDSEDAYPYVGQDESC 778


>gi|5823018|gb|AAD53011.1|AF089848_1 senescence-specific cysteine protease [Brassica napus]
          Length = 346

 Score = 72.4 bits (176), Expect = 3e-11,   Method: Composition-based stats.
 Identities = 32/78 (41%), Positives = 47/78 (60%), Gaps = 2/78 (2%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G CW F+ V AIEG ++I    L+ +S QQLVDCD       C GG ++T +++++   G
Sbjct: 152 GCCWAFSAVAAIEGATQIKKGKLISLSEQQLVDCDTN--DFGCSGGLMDTAFEHIMATGG 209

Query: 65  INTERDYPNVGVMDNCKV 82
           + TE +YP  G   NCK+
Sbjct: 210 LTTESNYPYKGEDANCKI 227


>gi|293334313|ref|NP_001170085.1| hypothetical protein [Zea mays]
 gi|224033359|gb|ACN35755.1| unknown [Zea mays]
 gi|414589091|tpg|DAA39662.1| TPA: hypothetical protein ZEAMMB73_231678 [Zea mays]
          Length = 385

 Score = 72.4 bits (176), Expect = 3e-11,   Method: Composition-based stats.
 Identities = 34/76 (44%), Positives = 43/76 (56%), Gaps = 2/76 (2%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V  +EGI +I T  LV +S Q+LVDCD   +   C GG      +++  N G
Sbjct: 173 GSCWAFSTVAVVEGIYQIRTGKLVSLSEQELVDCDTLDD--GCDGGISYRALRWIASNGG 230

Query: 65  INTERDYPNVGVMDNC 80
           I TE DYP  G  D C
Sbjct: 231 ITTEADYPYTGTTDAC 246


>gi|256082975|ref|XP_002577726.1| subfamily C1A unassigned peptidase (C01 family) [Schistosoma
           mansoni]
          Length = 1471

 Score = 72.4 bits (176), Expect = 3e-11,   Method: Composition-based stats.
 Identities = 33/70 (47%), Positives = 43/70 (61%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+  GAIEG     TN LV++S QQLVDC     +  C GG + + ++YV  N G
Sbjct: 170 GSCWAFSTTGAIEGQHYRKTNRLVNLSEQQLVDCSKSYGNNGCSGGLMNSAFEYVRDNEG 229

Query: 65  INTERDYPNV 74
           I++E  YP V
Sbjct: 230 IDSEISYPYV 239


>gi|297744465|emb|CBI37727.3| unnamed protein product [Vitis vinifera]
          Length = 331

 Score = 72.4 bits (176), Expect = 3e-11,   Method: Composition-based stats.
 Identities = 31/68 (45%), Positives = 45/68 (66%), Gaps = 1/68 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V A+EGI++IVT NL  +S Q+L+DCD    S  C GG ++  + ++  N G
Sbjct: 135 GSCWAFSTVAAVEGINQIVTGNLTTLSEQELIDCDTTFNS-GCNGGLMDYAFAFIASNGG 193

Query: 65  INTERDYP 72
           ++ E DYP
Sbjct: 194 LHKEDDYP 201


>gi|348565223|ref|XP_003468403.1| PREDICTED: cathepsin L1-like [Cavia porcellus]
          Length = 333

 Score = 72.4 bits (176), Expect = 3e-11,   Method: Composition-based stats.
 Identities = 33/77 (42%), Positives = 45/77 (58%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+  G++EG     T NLV +S Q LVDC     ++ C GG ++  +QYV  N+G
Sbjct: 136 GSCWAFSATGSLEGQMFHKTGNLVSLSEQNLVDCSRPQGNQGCNGGLMDFAFQYVKDNKG 195

Query: 65  INTERDYPNVGVMDNCK 81
           +  E+ YP VG    CK
Sbjct: 196 LEAEKSYPYVGKDGECK 212


>gi|297819568|ref|XP_002877667.1| hypothetical protein ARALYDRAFT_348033 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297323505|gb|EFH53926.1| hypothetical protein ARALYDRAFT_348033 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 341

 Score = 72.4 bits (176), Expect = 3e-11,   Method: Composition-based stats.
 Identities = 31/77 (40%), Positives = 46/77 (59%), Gaps = 2/77 (2%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G CW F+ V A+EG++KI    LV +S QQL+DC    E+  C GG +   + Y+++N+G
Sbjct: 149 GCCWAFSAVAAVEGMTKIAKGELVSLSEQQLLDCST--ENDGCDGGIMWKAFDYIVENQG 206

Query: 65  INTERDYPNVGVMDNCK 81
           I  E +YP  G    C+
Sbjct: 207 ITAEDNYPYQGAQQTCE 223


>gi|357124027|ref|XP_003563708.1| PREDICTED: germination-specific cysteine protease 1-like
           [Brachypodium distachyon]
          Length = 334

 Score = 72.4 bits (176), Expect = 3e-11,   Method: Composition-based stats.
 Identities = 32/82 (39%), Positives = 47/82 (57%), Gaps = 1/82 (1%)

Query: 2   HPLGSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQ 61
           H    CW F+   A+EGI +I T N V +S QQLVDC N    + C  G I+  Y+Y+ +
Sbjct: 133 HLCACCWAFSSAAAVEGIHQITTGNQVSLSVQQLVDCSNAANEK-CKAGEIDKAYEYIAR 191

Query: 62  NRGINTERDYPNVGVMDNCKVF 83
           + G+  ++DYP  G    C+V+
Sbjct: 192 SGGLVADQDYPYEGHSGTCRVY 213


>gi|315364646|pdb|3OVX|A Chain A, Cathepsin S In Complex With A Covalent Inhibitor With An
           Aldehyde Warhead
 gi|315364647|pdb|3OVX|B Chain B, Cathepsin S In Complex With A Covalent Inhibitor With An
           Aldehyde Warhead
          Length = 218

 Score = 72.4 bits (176), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 32/78 (41%), Positives = 49/78 (62%), Gaps = 1/78 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGE-SRSCVGGFIETIYQYVIQNR 63
           G+CW F+ VGA+E   K+ T  LV +S Q LVDC  +   ++ C GGF+ T +QY+I N+
Sbjct: 24  GACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTEKYGNKGCNGGFMTTAFQYIIDNK 83

Query: 64  GINTERDYPNVGVMDNCK 81
           GI+++  YP   +   C+
Sbjct: 84  GIDSDASYPYKAMDQKCQ 101


>gi|300508731|pdb|3N3G|A Chain A, 4-(3-Trifluoromethylphenyl)-Pyrimidine-2-Carbonitrile As
           Cathepsin S Inhibitors: N3, Not N1 Is Critically
           Important
 gi|300508732|pdb|3N3G|B Chain B, 4-(3-Trifluoromethylphenyl)-Pyrimidine-2-Carbonitrile As
           Cathepsin S Inhibitors: N3, Not N1 Is Critically
           Important
 gi|327533626|pdb|3N4C|A Chain A, 6-Phenyl-1h-Imidazo[4,5-C]pyridine-4-Carbonitrile As
           Cathepsin S Inhibitors
 gi|327533627|pdb|3N4C|B Chain B, 6-Phenyl-1h-Imidazo[4,5-C]pyridine-4-Carbonitrile As
           Cathepsin S Inhibitors
          Length = 217

 Score = 72.4 bits (176), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 32/78 (41%), Positives = 49/78 (62%), Gaps = 1/78 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGE-SRSCVGGFIETIYQYVIQNR 63
           G+CW F+ VGA+E   K+ T  LV +S Q LVDC  +   ++ C GGF+ T +QY+I N+
Sbjct: 23  GACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTEKYGNKGCNGGFMTTAFQYIIDNK 82

Query: 64  GINTERDYPNVGVMDNCK 81
           GI+++  YP   +   C+
Sbjct: 83  GIDSDASYPYKAMDQKCQ 100


>gi|195124431|ref|XP_002006696.1| GI21205 [Drosophila mojavensis]
 gi|193911764|gb|EDW10631.1| GI21205 [Drosophila mojavensis]
          Length = 339

 Score = 72.4 bits (176), Expect = 3e-11,   Method: Composition-based stats.
 Identities = 32/76 (42%), Positives = 46/76 (60%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+  GA+EG     T  LV +S Q LVDC  +  +  C GG ++  ++Y+  N G
Sbjct: 144 GSCWAFSSTGALEGQHFRKTGTLVSLSEQNLVDCSAKYGNNGCNGGLMDNAFRYIKDNGG 203

Query: 65  INTERDYPNVGVMDNC 80
           I+TE+ YP  G+ D+C
Sbjct: 204 IDTEKSYPYEGIDDSC 219


>gi|93279887|pdb|2G6D|A Chain A, Human Cathepsin S Mutant With Vinyl Sulfone Inhibitor Cra-
           14009
          Length = 217

 Score = 72.4 bits (176), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 32/78 (41%), Positives = 49/78 (62%), Gaps = 1/78 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGE-SRSCVGGFIETIYQYVIQNR 63
           G+CW F+ VGA+E   K+ T  LV +S Q LVDC  +   ++ C GGF+ T +QY+I N+
Sbjct: 23  GACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTEKYGNKGCNGGFMTTAFQYIIDNK 82

Query: 64  GINTERDYPNVGVMDNCK 81
           GI+++  YP   +   C+
Sbjct: 83  GIDSDASYPYKAMDQKCQ 100


>gi|114793879|pdb|2FYE|A Chain A, Mutant Human Cathepsin S With Irreversible Inhibitor Cra-
           14013
          Length = 217

 Score = 72.4 bits (176), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 32/78 (41%), Positives = 49/78 (62%), Gaps = 1/78 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGE-SRSCVGGFIETIYQYVIQNR 63
           G+CW F+ VGA+E   K+ T  LV +S Q LVDC  +   ++ C GGF+ T +QY+I N+
Sbjct: 23  GACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTKKYGNKGCNGGFMTTAFQYIIDNK 82

Query: 64  GINTERDYPNVGVMDNCK 81
           GI+++  YP   +   C+
Sbjct: 83  GIDSDASYPYKAMDQKCQ 100


>gi|328909405|gb|AEB61370.1| cathepsin S-like protein, partial [Equus caballus]
          Length = 281

 Score = 72.4 bits (176), Expect = 3e-11,   Method: Composition-based stats.
 Identities = 33/78 (42%), Positives = 48/78 (61%), Gaps = 1/78 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGES-RSCVGGFIETIYQYVIQNR 63
           G+CW F+ VGA+E   K+ T NLV +S Q LVDC  +  S + C GGF+   +QY+I N 
Sbjct: 87  GACWAFSAVGALEAQLKLKTGNLVSLSAQNLVDCSTEKYSNKGCNGGFMTAAFQYIIDNN 146

Query: 64  GINTERDYPNVGVMDNCK 81
           GI+++  YP   +   C+
Sbjct: 147 GIDSDASYPYKAMDGKCR 164


>gi|145351119|ref|XP_001419933.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144580166|gb|ABO98226.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 272

 Score = 72.4 bits (176), Expect = 3e-11,   Method: Composition-based stats.
 Identities = 38/85 (44%), Positives = 49/85 (57%), Gaps = 9/85 (10%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCD--------NQGESRSCVGGFIETIY 56
           GSCW F+  GAIEG   I T  LV++S QQLVDCD        N  +S  C GG      
Sbjct: 66  GSCWTFSTTGAIEGAHFISTGKLVELSEQQLVDCDVGCDPDVPNACDS-GCNGGLPSNAM 124

Query: 57  QYVIQNRGINTERDYPNVGVMDNCK 81
           +Y++++ GI+TE+ YP VG    CK
Sbjct: 125 EYIVEHGGIDTEKSYPYVGEKGECK 149


>gi|93279396|pdb|2F1G|A Chain A, Cathepsin S In Complex With Non-Covalent
           2-(Benzoxazol-2-Ylamino)- Acetamide
 gi|93279397|pdb|2F1G|B Chain B, Cathepsin S In Complex With Non-Covalent
           2-(Benzoxazol-2-Ylamino)- Acetamide
 gi|114794366|pdb|2HH5|B Chain B, Crystal Structure Of Cathepsin S In Complex With A Zinc
           Mediated Non-Covalent Arylaminoethyl Amide
 gi|114794367|pdb|2HH5|A Chain A, Crystal Structure Of Cathepsin S In Complex With A Zinc
           Mediated Non-Covalent Arylaminoethyl Amide
 gi|118137884|pdb|2H7J|A Chain A, Crystal Structure Of Cathepsin S In Complex With A
           Nonpeptidic Inhibitor.
 gi|118137885|pdb|2H7J|B Chain B, Crystal Structure Of Cathepsin S In Complex With A
           Nonpeptidic Inhibitor.
 gi|118138002|pdb|2HXZ|A Chain A, Crystal Structure Of Cathepsin S In Complex With A
           Nonpeptidic Inhibitor (hexagonal Spacegroup)
 gi|118138003|pdb|2HXZ|B Chain B, Crystal Structure Of Cathepsin S In Complex With A
           Nonpeptidic Inhibitor (hexagonal Spacegroup)
 gi|118138004|pdb|2HXZ|C Chain C, Crystal Structure Of Cathepsin S In Complex With A
           Nonpeptidic Inhibitor (hexagonal Spacegroup)
 gi|149241966|pdb|2HHN|A Chain A, Cathepsin S In Complex With Non Covalent Arylaminoethyl
           Amide.
 gi|149241967|pdb|2HHN|B Chain B, Cathepsin S In Complex With Non Covalent Arylaminoethyl
           Amide.
 gi|149242657|pdb|2OP3|A Chain A, The Structure Of Cathepsin S With A Novel 2-
           Arylphenoxyacetaldehyde Inhibitor Derived By The
           Substrate Activity Screening (Sas) Method
 gi|149242658|pdb|2OP3|B Chain B, The Structure Of Cathepsin S With A Novel 2-
           Arylphenoxyacetaldehyde Inhibitor Derived By The
           Substrate Activity Screening (Sas) Method
          Length = 220

 Score = 72.4 bits (176), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 32/78 (41%), Positives = 49/78 (62%), Gaps = 1/78 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGE-SRSCVGGFIETIYQYVIQNR 63
           G+CW F+ VGA+E   K+ T  LV +S Q LVDC  +   ++ C GGF+ T +QY+I N+
Sbjct: 26  GACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTEKYGNKGCNGGFMTTAFQYIIDNK 85

Query: 64  GINTERDYPNVGVMDNCK 81
           GI+++  YP   +   C+
Sbjct: 86  GIDSDASYPYKAMDQKCQ 103


>gi|18202415|sp|P82474.1|CPGP2_ZINOF RecName: Full=Zingipain-2; AltName: Full=Cysteine proteinase
          GP-II
 gi|6137410|pdb|1CQD|A Chain A, The 2.1 Angstrom Structure Of A Cysteine Protease With
          Proline Specificity From Ginger Rhizome, Zingiber
          Officinale
 gi|6137411|pdb|1CQD|B Chain B, The 2.1 Angstrom Structure Of A Cysteine Protease With
          Proline Specificity From Ginger Rhizome, Zingiber
          Officinale
 gi|6137412|pdb|1CQD|C Chain C, The 2.1 Angstrom Structure Of A Cysteine Protease With
          Proline Specificity From Ginger Rhizome, Zingiber
          Officinale
 gi|6137413|pdb|1CQD|D Chain D, The 2.1 Angstrom Structure Of A Cysteine Protease With
          Proline Specificity From Ginger Rhizome, Zingiber
          Officinale
          Length = 221

 Score = 72.4 bits (176), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 33/71 (46%), Positives = 48/71 (67%), Gaps = 2/71 (2%)

Query: 5  GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
          GSCW F+ V A+EGI++IVT +L+ +S QQLVDC     +  C GG++   +Q+++ N G
Sbjct: 25 GSCWAFSTVAAVEGINQIVTGDLISLSEQQLVDCTTA--NHGCRGGWMNPAFQFIVNNGG 82

Query: 65 INTERDYPNVG 75
          IN+E  YP  G
Sbjct: 83 INSEETYPYRG 93


>gi|93279711|pdb|2FQ9|A Chain A, Cathepsin S With Nitrile Inhibitor
 gi|93279712|pdb|2FQ9|B Chain B, Cathepsin S With Nitrile Inhibitor
 gi|112490596|pdb|2FRA|A Chain A, Human Cathepsin S With Cra-27934, A Nitrile Inhibitor
 gi|112490597|pdb|2FRA|B Chain B, Human Cathepsin S With Cra-27934, A Nitrile Inhibitor
 gi|112490599|pdb|2FRQ|A Chain A, Human Cathepsin S With Inhibitor Cra-26871
 gi|112490600|pdb|2FRQ|B Chain B, Human Cathepsin S With Inhibitor Cra-26871
 gi|112490616|pdb|2FT2|A Chain A, Human Cathepsin S With Inhibitor Cra-29728
 gi|112490617|pdb|2FT2|B Chain B, Human Cathepsin S With Inhibitor Cra-29728
 gi|112490630|pdb|2FUD|A Chain A, Human Cathepsin S With Inhibitor Cra-27566
 gi|112490631|pdb|2FUD|B Chain B, Human Cathepsin S With Inhibitor Cra-27566
 gi|114793976|pdb|2G7Y|A Chain A, Human Cathepsin S With Inhibitor Cra-16981
 gi|114793977|pdb|2G7Y|B Chain B, Human Cathepsin S With Inhibitor Cra-16981
          Length = 225

 Score = 72.4 bits (176), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 32/78 (41%), Positives = 49/78 (62%), Gaps = 1/78 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGE-SRSCVGGFIETIYQYVIQNR 63
           G+CW F+ VGA+E   K+ T  LV +S Q LVDC  +   ++ C GGF+ T +QY+I N+
Sbjct: 24  GACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTEKYGNKGCNGGFMTTAFQYIIDNK 83

Query: 64  GINTERDYPNVGVMDNCK 81
           GI+++  YP   +   C+
Sbjct: 84  GIDSDASYPYKAMDQKCQ 101


>gi|356984263|gb|AET43955.1| cathepsin L2, partial [Reishia clavigera]
          Length = 278

 Score = 72.4 bits (176), Expect = 3e-11,   Method: Composition-based stats.
 Identities = 30/77 (38%), Positives = 47/77 (61%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+  G++EG     T  LV +S Q LVDC  +  +  C GG ++  ++Y+ +N+G
Sbjct: 107 GSCWAFSATGSLEGQHFKKTGTLVSLSEQNLVDCSKKEGNEGCEGGLMDQAFEYIKRNKG 166

Query: 65  INTERDYPNVGVMDNCK 81
           I+TE+ YP   V + C+
Sbjct: 167 IDTEQSYPYRAVDEKCR 183


>gi|3087790|emb|CAA75029.1| cathepsin L2 [Homo sapiens]
          Length = 334

 Score = 72.4 bits (176), Expect = 3e-11,   Method: Composition-based stats.
 Identities = 34/77 (44%), Positives = 46/77 (59%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+  GA+EG     T  LV +S Q LVDC     ++ C GGF+   +QYV +N G
Sbjct: 136 GSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRPQGNQGCNGGFMARAFQYVKENGG 195

Query: 65  INTERDYPNVGVMDNCK 81
           +++E  YP V V + CK
Sbjct: 196 LDSEESYPYVAVDEICK 212


>gi|4469153|emb|CAB38314.1| chymopapain isoform II [Carica papaya]
          Length = 352

 Score = 72.4 bits (176), Expect = 3e-11,   Method: Composition-based stats.
 Identities = 35/77 (45%), Positives = 48/77 (62%), Gaps = 3/77 (3%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ +  +EGI+KIVT NL+++S Q+LVDCD    S  C GG+  T  QYV  N G
Sbjct: 157 GSCWAFSTIATVEGINKIVTGNLLELSEQELVDCDKH--SYGCKGGYQTTSLQYV-ANNG 213

Query: 65  INTERDYPNVGVMDNCK 81
           ++T + YP       C+
Sbjct: 214 VHTSKVYPYQAKQYKCR 230


>gi|179957|gb|AAC37592.1| cathepsin S [Homo sapiens]
          Length = 331

 Score = 72.4 bits (176), Expect = 3e-11,   Method: Composition-based stats.
 Identities = 31/69 (44%), Positives = 46/69 (66%), Gaps = 1/69 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGE-SRSCVGGFIETIYQYVIQNR 63
           G+CW F+ VGA+E   K+ T  LV +S Q LVDC  +   ++ C GGF+ T +QY+I N+
Sbjct: 137 GACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTEKYGNKGCNGGFMTTAFQYIIDNK 196

Query: 64  GINTERDYP 72
           GI+++  YP
Sbjct: 197 GIDSDASYP 205


>gi|146386731|pdb|1VSN|A Chain A, Crystal Structure Of A Potent Small Molecule Inhibitor
          Bound To Cathepsin K
          Length = 215

 Score = 72.4 bits (176), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 35/76 (46%), Positives = 51/76 (67%), Gaps = 2/76 (2%)

Query: 5  GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
          GSCW F+ VGA+EG  K  T  L++++ Q LVDC +  E+  C GG++   +QYV +NRG
Sbjct: 23 GSCWAFSSVGALEGQLKKATGALLNLAPQNLVDCVS--ENDGCGGGYMTNAFQYVQRNRG 80

Query: 65 INTERDYPNVGVMDNC 80
          I++E  YP VG  ++C
Sbjct: 81 IDSEDAYPYVGQDESC 96


>gi|34761156|gb|AAQ81938.1| cysteine proteinase precursor [Ipomoea batatas]
          Length = 371

 Score = 72.4 bits (176), Expect = 3e-11,   Method: Composition-based stats.
 Identities = 32/84 (38%), Positives = 49/84 (58%), Gaps = 7/84 (8%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRS-------CVGGFIETIYQ 57
           GSCW F+  G +EG + + T  L+ ++ Q+LVDCD+  + +        C GG + T Y+
Sbjct: 161 GSCWSFSTTGTLEGTNFLATGELLSLNEQELVDCDHLCDPKKAGACDAGCNGGLMTTAYE 220

Query: 58  YVIQNRGINTERDYPNVGVMDNCK 81
           YV+Q+ G+  E+DYP  G    CK
Sbjct: 221 YVLQSGGLEKEKDYPYTGRDGTCK 244


>gi|23110960|ref|NP_001324.2| cathepsin L2 preproprotein [Homo sapiens]
 gi|320118898|ref|NP_001188504.1| cathepsin L2 preproprotein [Homo sapiens]
 gi|12644075|sp|O60911.2|CATL2_HUMAN RecName: Full=Cathepsin L2; AltName: Full=Cathepsin U; AltName:
           Full=Cathepsin V; Flags: Precursor
 gi|3107915|dbj|BAA25909.1| cathepsin V [Homo sapiens]
 gi|3228672|gb|AAC23598.1| cathepsin U [Homo sapiens]
 gi|3869129|dbj|BAA34365.1| cathepsin L2 [Homo sapiens]
 gi|23958123|gb|AAH23504.1| CTSL2 protein [Homo sapiens]
 gi|37182404|gb|AAQ89004.1| cathepsin L2 [Homo sapiens]
 gi|83405150|gb|AAI10513.1| Cathepsin L2 [Homo sapiens]
 gi|119579235|gb|EAW58831.1| cathepsin L2, isoform CRA_a [Homo sapiens]
 gi|119579236|gb|EAW58832.1| cathepsin L2, isoform CRA_a [Homo sapiens]
          Length = 334

 Score = 72.4 bits (176), Expect = 3e-11,   Method: Composition-based stats.
 Identities = 34/77 (44%), Positives = 46/77 (59%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+  GA+EG     T  LV +S Q LVDC     ++ C GGF+   +QYV +N G
Sbjct: 136 GSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRPQGNQGCNGGFMARAFQYVKENGG 195

Query: 65  INTERDYPNVGVMDNCK 81
           +++E  YP V V + CK
Sbjct: 196 LDSEESYPYVAVDEICK 212


>gi|255544115|ref|XP_002513120.1| cysteine protease, putative [Ricinus communis]
 gi|223548131|gb|EEF49623.1| cysteine protease, putative [Ricinus communis]
          Length = 362

 Score = 72.4 bits (176), Expect = 3e-11,   Method: Composition-based stats.
 Identities = 31/77 (40%), Positives = 47/77 (61%), Gaps = 1/77 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V A+EGI++I T  L+ +S Q+LVDCD   E + C GG ++  +++ I+  G
Sbjct: 145 GSCWAFSAVAAVEGITEIKTGKLISLSEQELVDCDTNSEDQGCQGGLMDDAFKF-IEQHG 203

Query: 65  INTERDYPNVGVMDNCK 81
           + +E  YP       CK
Sbjct: 204 LASEATYPYDAADSTCK 220


>gi|2507252|sp|P14080.2|PAPA2_CARPA RecName: Full=Chymopapain; AltName: Full=Papaya proteinase II;
           Short=PPII; Flags: Precursor
 gi|1332461|emb|CAA66378.1| chymopapain [Carica papaya]
          Length = 352

 Score = 72.4 bits (176), Expect = 3e-11,   Method: Composition-based stats.
 Identities = 35/77 (45%), Positives = 48/77 (62%), Gaps = 3/77 (3%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ +  +EGI+KIVT NL+++S Q+LVDCD    S  C GG+  T  QYV  N G
Sbjct: 157 GSCWAFSTIATVEGINKIVTGNLLELSEQELVDCDKH--SYGCKGGYQTTSLQYV-ANNG 213

Query: 65  INTERDYPNVGVMDNCK 81
           ++T + YP       C+
Sbjct: 214 VHTSKVYPYQAKQYKCR 230


>gi|4757570|gb|AAD29084.1|AF082181_1 cysteine proteinase precursor [Solanum melongena]
          Length = 363

 Score = 72.4 bits (176), Expect = 4e-11,   Method: Composition-based stats.
 Identities = 34/83 (40%), Positives = 48/83 (57%), Gaps = 7/83 (8%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQ--GESRS-----CVGGFIETIYQ 57
           GSCW F+  GA+EG   + T  LV +S QQLVDCD++   E +S     C GG + T ++
Sbjct: 152 GSCWSFSTTGAVEGAHFLATGELVSLSEQQLVDCDHECDAEEKSECDAGCNGGLMTTAFE 211

Query: 58  YVIQNRGINTERDYPNVGVMDNC 80
           Y ++  G+  E+DYP  G    C
Sbjct: 212 YTLKAGGLQREKDYPYTGRDGKC 234


>gi|37905511|gb|AAO64477.1| cathepsin S precursor [Fundulus heteroclitus]
          Length = 337

 Score = 72.4 bits (176), Expect = 4e-11,   Method: Composition-based stats.
 Identities = 33/77 (42%), Positives = 45/77 (58%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+  GA+EG     T  L ++S Q LVDC  +  +  C GGF+   +QYVI N+G
Sbjct: 144 GSCWAFSAAGALEGQLAKKTGKLQNLSPQNLVDCSTKYGNHGCNGGFMHKAFQYVIDNQG 203

Query: 65  INTERDYPNVGVMDNCK 81
           I++E  YP  G    C+
Sbjct: 204 IDSEDSYPYRGRDQQCQ 220


>gi|390457768|ref|XP_002742793.2| PREDICTED: cathepsin L2 [Callithrix jacchus]
          Length = 588

 Score = 72.4 bits (176), Expect = 4e-11,   Method: Composition-based stats.
 Identities = 33/77 (42%), Positives = 46/77 (59%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+  GA+EG     T  LV +S Q LVDC +   ++ C GGF+   +QYV +N G
Sbjct: 136 GSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSHPQGNQGCNGGFMNNAFQYVKENGG 195

Query: 65  INTERDYPNVGVMDNCK 81
           +++E  YP V    +CK
Sbjct: 196 LDSEASYPYVAKDGSCK 212


>gi|15826035|pdb|1FH0|A Chain A, Crystal Structure Of Human Cathepsin V Complexed With An
          Irreversible Vinyl Sulfone Inhibitor
 gi|15826036|pdb|1FH0|B Chain B, Crystal Structure Of Human Cathepsin V Complexed With An
          Irreversible Vinyl Sulfone Inhibitor
          Length = 221

 Score = 72.4 bits (176), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 34/77 (44%), Positives = 46/77 (59%)

Query: 5  GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
          GSCW F+  GA+EG     T  LV +S Q LVDC     ++ C GGF+   +QYV +N G
Sbjct: 23 GSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRPQGNQGCNGGFMARAFQYVKENGG 82

Query: 65 INTERDYPNVGVMDNCK 81
          +++E  YP V V + CK
Sbjct: 83 LDSEESYPYVAVDEICK 99


>gi|119573902|gb|EAW53517.1| cathepsin S, isoform CRA_a [Homo sapiens]
          Length = 220

 Score = 72.4 bits (176), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 31/69 (44%), Positives = 46/69 (66%), Gaps = 1/69 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGE-SRSCVGGFIETIYQYVIQNR 63
           G+CW F+ VGA+E   K+ T  LV +S Q LVDC  +   ++ C GGF+ T +QY+I N+
Sbjct: 137 GACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTEKYGNKGCNGGFMTTAFQYIIDNK 196

Query: 64  GINTERDYP 72
           GI+++  YP
Sbjct: 197 GIDSDASYP 205


>gi|115457680|ref|NP_001052440.1| Os04g0311400 [Oryza sativa Japonica Group]
 gi|113564011|dbj|BAF14354.1| Os04g0311400, partial [Oryza sativa Japonica Group]
          Length = 384

 Score = 72.4 bits (176), Expect = 4e-11,   Method: Composition-based stats.
 Identities = 34/84 (40%), Positives = 52/84 (61%), Gaps = 7/84 (8%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQ---GESRS----CVGGFIETIYQ 57
           GSCW F+  GA+EG   + T  L  +S QQ+VDCD++    ESR+    C GG + T + 
Sbjct: 170 GSCWSFSTSGALEGAHFLATGKLEVLSEQQMVDCDHECDASESRACDSGCNGGLMTTAFS 229

Query: 58  YVIQNRGINTERDYPNVGVMDNCK 81
           Y++++ G+ +E+DYP  G  + CK
Sbjct: 230 YLMKSGGLQSEKDYPYAGRENTCK 253


>gi|21489677|gb|AAM55195.1|AF412313_1 cathepsin L cysteine protease [Haemonchus contortus]
 gi|21483192|gb|AAL14224.1| cathepsin L [Haemonchus contortus]
          Length = 354

 Score = 72.0 bits (175), Expect = 4e-11,   Method: Composition-based stats.
 Identities = 31/76 (40%), Positives = 44/76 (57%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+  GA+EG     T  LV +S Q LVDC  +  +  C GG ++  ++Y+ +N G
Sbjct: 159 GSCWAFSSTGALEGQHARATGKLVSLSEQNLVDCSTKYGNHGCNGGLMDLAFEYIKENHG 218

Query: 65  INTERDYPNVGVMDNC 80
           ++TE  YP VG    C
Sbjct: 219 VDTEDSYPYVGRETKC 234


>gi|21483184|gb|AAF86584.1| cathepsin L cysteine protease [Haemonchus contortus]
          Length = 355

 Score = 72.0 bits (175), Expect = 4e-11,   Method: Composition-based stats.
 Identities = 31/76 (40%), Positives = 44/76 (57%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+  GA+EG     T  LV +S Q LVDC  +  +  C GG ++  ++Y+ +N G
Sbjct: 160 GSCWAFSSTGALEGQHARATGKLVSLSEQNLVDCSTKYGNHGCNGGLMDLAFEYIKENHG 219

Query: 65  INTERDYPNVGVMDNC 80
           ++TE  YP VG    C
Sbjct: 220 VDTEDSYPYVGRETKC 235


>gi|30749675|pdb|1NPZ|A Chain A, Crystal Structures Of Cathepsin S Inhibitor Complexes
 gi|30749676|pdb|1NPZ|B Chain B, Crystal Structures Of Cathepsin S Inhibitor Complexes
 gi|30749688|pdb|1NQC|A Chain A, Crystal Structures Of Cathepsin S Inhibitor Complexes
          Length = 217

 Score = 72.0 bits (175), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 32/78 (41%), Positives = 49/78 (62%), Gaps = 1/78 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGE-SRSCVGGFIETIYQYVIQNR 63
           G+CW F+ VGA+E   K+ T  LV +S Q LVDC  +   ++ C GGF+ T +QY+I N+
Sbjct: 23  GACWAFSAVGALEAQLKLKTGKLVTLSAQNLVDCSTEKYGNKGCNGGFMTTAFQYIIDNK 82

Query: 64  GINTERDYPNVGVMDNCK 81
           GI+++  YP   +   C+
Sbjct: 83  GIDSDASYPYKAMDQKCQ 100


>gi|302763831|ref|XP_002965337.1| hypothetical protein SELMODRAFT_230602 [Selaginella moellendorffii]
 gi|300167570|gb|EFJ34175.1| hypothetical protein SELMODRAFT_230602 [Selaginella moellendorffii]
          Length = 343

 Score = 72.0 bits (175), Expect = 4e-11,   Method: Composition-based stats.
 Identities = 31/76 (40%), Positives = 46/76 (60%), Gaps = 2/76 (2%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ + +IE    + T  LV +S QQL+DCD       C GG +ET +++V++N G
Sbjct: 149 GSCWAFSAIASIESAHFLATKELVSLSEQQLMDCDTV--DAGCDGGLMETAFKFVVKNGG 206

Query: 65  INTERDYPNVGVMDNC 80
           + TE  YP  G + +C
Sbjct: 207 VTTEASYPYTGSVGSC 222


>gi|1173630|gb|AAB37233.1| cysteine proteinase [Phalaenopsis sp. SM9108]
          Length = 359

 Score = 72.0 bits (175), Expect = 4e-11,   Method: Composition-based stats.
 Identities = 29/68 (42%), Positives = 47/68 (69%), Gaps = 1/68 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V A+EGI++I T  L+ +S Q+L+DCD   E+  C GG ++  + ++ +N G
Sbjct: 153 GSCWAFSTVAAVEGINQIKTKKLLSLSEQELIDCDTD-ENNGCNGGLMDYAFDFIKKNGG 211

Query: 65  INTERDYP 72
           I++E +YP
Sbjct: 212 ISSEAEYP 219


>gi|410904751|ref|XP_003965855.1| PREDICTED: cathepsin K-like [Takifugu rubripes]
          Length = 331

 Score = 72.0 bits (175), Expect = 4e-11,   Method: Composition-based stats.
 Identities = 34/77 (44%), Positives = 47/77 (61%), Gaps = 2/77 (2%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+  GA+EG+    T  LVD+S Q LVDC    E+  C GG++   ++YV  NRG
Sbjct: 138 GSCWAFSSAGALEGMQAKKTGKLVDLSPQNLVDCVK--ENDGCGGGYMTNAFRYVATNRG 195

Query: 65  INTERDYPNVGVMDNCK 81
           I++E  YP V    +C+
Sbjct: 196 IDSEASYPYVAQEQSCQ 212


>gi|57118011|gb|AAW34137.1| cysteine protease gp3b [Zingiber officinale]
          Length = 466

 Score = 72.0 bits (175), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 32/68 (47%), Positives = 46/68 (67%), Gaps = 2/68 (2%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW FA +  +EGI++IVT +L+ +S QQLVDC  +  +  C GG+    +QY+I N G
Sbjct: 156 GSCWAFAAIATVEGINQIVTGDLISLSEQQLVDCSTR--NHGCEGGWPYRAFQYIINNGG 213

Query: 65  INTERDYP 72
           +N+E  YP
Sbjct: 214 VNSEEHYP 221


>gi|426331346|ref|XP_004026643.1| PREDICTED: cathepsin S [Gorilla gorilla gorilla]
          Length = 220

 Score = 72.0 bits (175), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 31/69 (44%), Positives = 46/69 (66%), Gaps = 1/69 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGE-SRSCVGGFIETIYQYVIQNR 63
           G+CW F+ VGA+E   K+ T  LV +S Q LVDC  +   ++ C GGF+ T +QY+I N+
Sbjct: 137 GACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTEKYGNKGCNGGFMTTAFQYIIDNK 196

Query: 64  GINTERDYP 72
           GI+++  YP
Sbjct: 197 GIDSDASYP 205


>gi|312451845|gb|ADQ85986.1| actinidin [Actinidia chinensis]
          Length = 380

 Score = 72.0 bits (175), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 32/78 (41%), Positives = 46/78 (58%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G CW F+ +  +EGI+KIVT  L+ +S Q+L+DC     +R C G +I   + ++I N G
Sbjct: 149 GGCWAFSAIATVEGINKIVTGVLISLSEQELIDCGRTQNTRGCNGSYITDGFPFIINNGG 208

Query: 65  INTERDYPNVGVMDNCKV 82
           INTE +YP       C V
Sbjct: 209 INTEENYPYTAQDGECNV 226


>gi|4469155|emb|CAB38315.1| chymopapain isoform III [Carica papaya]
          Length = 361

 Score = 72.0 bits (175), Expect = 4e-11,   Method: Composition-based stats.
 Identities = 35/77 (45%), Positives = 48/77 (62%), Gaps = 3/77 (3%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ +  +EGI+KIVT NL+++S Q+LVDCD    S  C GG+  T  QYV  N G
Sbjct: 157 GSCWAFSTIATVEGINKIVTGNLLELSEQELVDCDKH--SYGCKGGYQTTSLQYV-ANNG 213

Query: 65  INTERDYPNVGVMDNCK 81
           ++T + YP       C+
Sbjct: 214 VHTSKVYPCQAKQYKCR 230


>gi|297830594|ref|XP_002883179.1| hypothetical protein ARALYDRAFT_318695 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297329019|gb|EFH59438.1| hypothetical protein ARALYDRAFT_318695 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 308

 Score = 72.0 bits (175), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 30/68 (44%), Positives = 47/68 (69%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ VGA+EGI++I T  L+ +S Q+L+DCD    +  C GG +   ++++I N G
Sbjct: 98  GSCWAFSAVGAVEGINQIKTGELISLSDQELIDCDRGFVNAGCEGGVMNYAFEFIINNGG 157

Query: 65  INTERDYP 72
           I +++DYP
Sbjct: 158 IESDQDYP 165


>gi|307103885|gb|EFN52142.1| hypothetical protein CHLNCDRAFT_139276 [Chlorella variabilis]
          Length = 388

 Score = 72.0 bits (175), Expect = 4e-11,   Method: Composition-based stats.
 Identities = 32/67 (47%), Positives = 46/67 (68%), Gaps = 1/67 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+  GA+EGI+ I T  LV +S QQLVDCD++ +   C GG ++  + Y+ +N G
Sbjct: 154 GSCWAFSATGAVEGINAIRTGKLVSLSEQQLVDCDSE-KDLGCGGGLMDFAFDYITKNGG 212

Query: 65  INTERDY 71
           I++E DY
Sbjct: 213 IDSEDDY 219


>gi|226503129|ref|NP_001149806.1| LOC100283433 precursor [Zea mays]
 gi|195634783|gb|ACG36860.1| xylem cysteine proteinase 2 precursor [Zea mays]
 gi|219884977|gb|ACL52863.1| unknown [Zea mays]
          Length = 377

 Score = 72.0 bits (175), Expect = 4e-11,   Method: Composition-based stats.
 Identities = 31/76 (40%), Positives = 46/76 (60%), Gaps = 1/76 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V A+EGI++IVT NL  +S QQLVDC   G +  C GG ++  + ++    G
Sbjct: 180 GSCWAFSTVAAVEGINQIVTGNLTSLSEQQLVDCSTDGNN-GCSGGVMDNAFSFIATGAG 238

Query: 65  INTERDYPNVGVMDNC 80
           + +E  YP +    +C
Sbjct: 239 LRSEEAYPYLMEEGDC 254


>gi|109119897|dbj|BAE96008.1| cysteine proteinase [Triticum aestivum]
          Length = 377

 Score = 72.0 bits (175), Expect = 4e-11,   Method: Composition-based stats.
 Identities = 31/77 (40%), Positives = 47/77 (61%), Gaps = 1/77 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V ++EGI+ I T  LV +S Q+L+DCD   ++  C GG ++  ++Y+ +N G
Sbjct: 160 GSCWAFSTVVSVEGINAIRTGKLVSLSEQELIDCDT-ADNDGCEGGLMDNAFEYIKKNGG 218

Query: 65  INTERDYPNVGVMDNCK 81
           + TE  YP       CK
Sbjct: 219 LTTEAAYPYRAANGTCK 235


>gi|195056367|ref|XP_001995082.1| GH22826 [Drosophila grimshawi]
 gi|193899288|gb|EDV98154.1| GH22826 [Drosophila grimshawi]
          Length = 340

 Score = 72.0 bits (175), Expect = 4e-11,   Method: Composition-based stats.
 Identities = 31/76 (40%), Positives = 46/76 (60%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+  GA+EG     T  L+ +S Q LVDC  +  +  C GG ++  ++Y+  N G
Sbjct: 145 GSCWAFSSTGALEGQHFRKTGTLISLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDNGG 204

Query: 65  INTERDYPNVGVMDNC 80
           I+TE+ YP  G+ D+C
Sbjct: 205 IDTEKSYPYEGIDDSC 220


>gi|432114311|gb|ELK36239.1| Cathepsin S [Myotis davidii]
          Length = 340

 Score = 72.0 bits (175), Expect = 4e-11,   Method: Composition-based stats.
 Identities = 33/78 (42%), Positives = 47/78 (60%), Gaps = 1/78 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDN-QGESRSCVGGFIETIYQYVIQNR 63
           GSCW F+ VGA+E   K+ T  LV +S Q LVDC   +  ++ C GGF+   +QY+I N 
Sbjct: 146 GSCWAFSAVGALEAQLKLKTGKLVSLSVQNLVDCSTGKYSNKGCNGGFMTEAFQYIIDNN 205

Query: 64  GINTERDYPNVGVMDNCK 81
           GI++E  YP   +   C+
Sbjct: 206 GIDSEASYPYKAMDGKCQ 223


>gi|84660246|emb|CAI43320.1| cathepsin L [Lubomirskia baicalensis]
 gi|85677150|emb|CAI46307.1| cathepsin L [Lubomirskia baicalensis]
          Length = 327

 Score = 72.0 bits (175), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 34/77 (44%), Positives = 44/77 (57%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V  +EG     T  LV +S Q LVDC     ++ C GG ++  +QYVI+N G
Sbjct: 130 GSCWAFSAVAGLEGQHFNATGTLVSLSEQNLVDCSTAEGNQGCNGGLMDNAFQYVIKNGG 189

Query: 65  INTERDYPNVGVMDNCK 81
           I+TE  YP   V   CK
Sbjct: 190 IDTEASYPYKAVDQKCK 206


>gi|356576257|ref|XP_003556249.1| PREDICTED: LOW QUALITY PROTEIN: cysteine proteinase 15A-like
           [Glycine max]
          Length = 374

 Score = 72.0 bits (175), Expect = 4e-11,   Method: Composition-based stats.
 Identities = 33/84 (39%), Positives = 46/84 (54%), Gaps = 7/84 (8%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGE-------SRSCVGGFIETIYQ 57
           GSCW F+  G+IEG + + T  LV +S QQL+DCDN+ E          C GG +   Y 
Sbjct: 157 GSCWAFSTTGSIEGANFLATGKLVSLSEQQLLDCDNKCEITEKTSCDNGCNGGLMTNAYN 216

Query: 58  YVIQNRGINTERDYPNVGVMDNCK 81
           Y++++ G+  E  YP  G    CK
Sbjct: 217 YLLESGGLEEESSYPYTGERGECK 240


>gi|3929823|emb|CAA77184.1| cathepsin S [Mus musculus]
          Length = 163

 Score = 72.0 bits (175), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 32/78 (41%), Positives = 47/78 (60%), Gaps = 2/78 (2%)

Query: 5  GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGE--SRSCVGGFIETIYQYVIQN 62
          GSCW F+ VGA+EG  K+ T  L+ +S Q LVDC N+ +  ++ C GG++   +QY+I N
Sbjct: 2  GSCWAFSAVGALEGQLKLKTGKLISLSAQNLVDCSNEEKYGNKGCGGGYMTEAFQYIIDN 61

Query: 63 RGINTERDYPNVGVMDNC 80
           GI  +  YP     + C
Sbjct: 62 GGIEADASYPYKATDEKC 79


>gi|297663703|ref|XP_002810310.1| PREDICTED: LOW QUALITY PROTEIN: cathepsin S [Pongo abelii]
          Length = 330

 Score = 72.0 bits (175), Expect = 4e-11,   Method: Composition-based stats.
 Identities = 31/69 (44%), Positives = 46/69 (66%), Gaps = 1/69 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGE-SRSCVGGFIETIYQYVIQNR 63
           G+CW F+ VGA+E   K+ T  LV +S Q LVDC  +   ++ C GGF+ T +QY+I N+
Sbjct: 137 GACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTEKYGNKGCNGGFMTTAFQYIIDNK 196

Query: 64  GINTERDYP 72
           GI+++  YP
Sbjct: 197 GIDSDASYP 205


>gi|56758090|gb|AAW27185.1| SJCHGC06231 protein [Schistosoma japonicum]
          Length = 372

 Score = 72.0 bits (175), Expect = 4e-11,   Method: Composition-based stats.
 Identities = 32/70 (45%), Positives = 44/70 (62%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+  GAIEG     TN LV++S QQL+DC     +  C GG ++  +QYV  N+G
Sbjct: 172 GSCWAFSSTGAIEGQHYRKTNRLVNLSEQQLIDCSKSYGNNGCEGGLMDLAFQYVRDNKG 231

Query: 65  INTERDYPNV 74
           I++E  YP +
Sbjct: 232 IDSEISYPYI 241


>gi|359492709|ref|XP_002280798.2| PREDICTED: cysteine proteinase RD19a-like [Vitis vinifera]
 gi|147841854|emb|CAN73591.1| hypothetical protein VITISV_022889 [Vitis vinifera]
 gi|302142582|emb|CBI19785.3| unnamed protein product [Vitis vinifera]
          Length = 371

 Score = 72.0 bits (175), Expect = 4e-11,   Method: Composition-based stats.
 Identities = 31/78 (39%), Positives = 46/78 (58%), Gaps = 7/78 (8%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRS-------CVGGFIETIYQ 57
           GSCW F+ +GA+EG   + T NLV +STQQL+DCD + +          C GG +   ++
Sbjct: 161 GSCWSFSTIGALEGAHFLATGNLVSLSTQQLLDCDTECDPEEYDACDDGCNGGLMNNAFE 220

Query: 58  YVIQNRGINTERDYPNVG 75
           Y+++  G+  E DYP  G
Sbjct: 221 YILKAGGVAQEEDYPYTG 238


>gi|300175245|emb|CBK20556.2| unnamed protein product [Blastocystis hominis]
          Length = 325

 Score = 72.0 bits (175), Expect = 5e-11,   Method: Composition-based stats.
 Identities = 36/81 (44%), Positives = 46/81 (56%), Gaps = 3/81 (3%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G+CW FA V +IEG+    T  ++D S QQLVDCD    S  C GG +   Y+YV+ N G
Sbjct: 134 GACWAFAAVASIEGVYAQKTGKILDFSPQQLVDCDYS--SLGCSGGLMTYAYEYVMNN-G 190

Query: 65  INTERDYPNVGVMDNCKVFQF 85
           I+ E DYP      +CK   F
Sbjct: 191 ISLESDYPYKASQGSCKKVDF 211


>gi|194352750|emb|CAQ00103.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
 gi|326514262|dbj|BAJ92281.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326519402|dbj|BAJ96700.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326524351|dbj|BAK00559.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326531998|dbj|BAK01375.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 356

 Score = 72.0 bits (175), Expect = 5e-11,   Method: Composition-based stats.
 Identities = 31/68 (45%), Positives = 44/68 (64%), Gaps = 1/68 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V A+EGI+ IVT NL  +S Q+L+DC   G S  C GG ++  + Y+  + G
Sbjct: 157 GSCWAFSTVAAVEGINAIVTGNLTALSEQELIDCSVDGNS-GCNGGLMDYAFSYIASSGG 215

Query: 65  INTERDYP 72
           ++TE  YP
Sbjct: 216 LHTEEAYP 223


>gi|242020003|ref|XP_002430447.1| Cathepsin L precursor, putative [Pediculus humanus corporis]
 gi|212515585|gb|EEB17709.1| Cathepsin L precursor, putative [Pediculus humanus corporis]
          Length = 345

 Score = 72.0 bits (175), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 34/77 (44%), Positives = 46/77 (59%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+  GA+EG+    T  LV +S Q L+DC  +  +  C GG ++  +QYV  N G
Sbjct: 147 GSCWAFSATGALEGLHFRKTKVLVSLSEQNLIDCSTEEGNNGCNGGLMDQAFQYVRINGG 206

Query: 65  INTERDYPNVGVMDNCK 81
           I+TER YP  G  D C+
Sbjct: 207 IDTERSYPYEGNNDVCR 223


>gi|328870624|gb|EGG18997.1| cysteine proteinase [Dictyostelium fasciculatum]
          Length = 521

 Score = 72.0 bits (175), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 32/76 (42%), Positives = 44/76 (57%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+  G+ EG   + T NLV +S Q LVDC     +  C GG ++  + Y+I+N+G
Sbjct: 141 GSCWSFSTTGSTEGAHFLSTGNLVSLSEQNLVDCSGPEGNDGCNGGLMDQAFTYIIKNKG 200

Query: 65  INTERDYPNVGVMDNC 80
           I+TE  YP   V   C
Sbjct: 201 IDTESSYPYKAVQGKC 216


>gi|357458909|ref|XP_003599735.1| Cysteine proteinase [Medicago truncatula]
 gi|357474677|ref|XP_003607623.1| Cysteine proteinase [Medicago truncatula]
 gi|355488783|gb|AES69986.1| Cysteine proteinase [Medicago truncatula]
 gi|355508678|gb|AES89820.1| Cysteine proteinase [Medicago truncatula]
          Length = 342

 Score = 72.0 bits (175), Expect = 5e-11,   Method: Composition-based stats.
 Identities = 31/77 (40%), Positives = 49/77 (63%), Gaps = 1/77 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V +IEGI +I T  LV +S Q+L+DC  +G S  C GG++E  ++++ +  G
Sbjct: 147 GSCWAFSTVASIEGIHQITTGELVSLSEQELIDC-VRGNSSGCSGGYLEDAFKFIAKKGG 205

Query: 65  INTERDYPNVGVMDNCK 81
           + +E +YP     + CK
Sbjct: 206 MASETNYPYKETDEKCK 222


>gi|359811751|emb|CCE67159.1| cysteine peptidase, partial [Vasconcellea quercifolia]
          Length = 211

 Score = 72.0 bits (175), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 38/80 (47%), Positives = 50/80 (62%), Gaps = 3/80 (3%)

Query: 5  GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
          GSCW F+ V A+EGI+KIVT  L+ +S Q+L+DC+ +  S  C GGF     QYV QN G
Sbjct: 21 GSCWTFSSVAAVEGINKIVTGQLLWLSEQELLDCERR--SYGCRGGFPPYALQYVAQN-G 77

Query: 65 INTERDYPNVGVMDNCKVFQ 84
          I+  + YP  GV   C+  Q
Sbjct: 78 IHLRQYYPYEGVQRQCRASQ 97


>gi|357507617|ref|XP_003624097.1| Cysteine protease [Medicago truncatula]
 gi|355499112|gb|AES80315.1| Cysteine protease [Medicago truncatula]
          Length = 340

 Score = 72.0 bits (175), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 34/75 (45%), Positives = 48/75 (64%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ VGA+EGI +I + NLV +S Q+LVD      +  C GG++   +++V++N G
Sbjct: 143 GSCWAFSAVGALEGIQQITSGNLVSLSEQELVDRVRSNWTNGCNGGYLIDAFEFVLENGG 202

Query: 65  INTERDYPNVGVMDN 79
           I TE  YP  GV  N
Sbjct: 203 IATEASYPYRGVKGN 217


>gi|1619903|gb|AAB16996.1| thiol protease isoform B, partial [Glycine max]
          Length = 319

 Score = 72.0 bits (175), Expect = 5e-11,   Method: Composition-based stats.
 Identities = 33/84 (39%), Positives = 47/84 (55%), Gaps = 7/84 (8%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRS-------CVGGFIETIYQ 57
           GSCW F+  GA+EG   + T  LV +S QQLVDCD+  +          C GG +   ++
Sbjct: 110 GSCWSFSTTGALEGAYYLATGELVSLSEQQLVDCDHVCDPEEYGACDSGCNGGLMNNAFE 169

Query: 58  YVIQNRGINTERDYPNVGVMDNCK 81
           Y++Q+ G+  E+DYP  G    CK
Sbjct: 170 YILQSGGVQKEKDYPYTGRDGTCK 193


>gi|334324657|ref|XP_003340546.1| PREDICTED: cathepsin S-like isoform 2 [Monodelphis domestica]
          Length = 281

 Score = 71.6 bits (174), Expect = 5e-11,   Method: Composition-based stats.
 Identities = 33/78 (42%), Positives = 48/78 (61%), Gaps = 1/78 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCD-NQGESRSCVGGFIETIYQYVIQNR 63
           GSCW F+ VGA+E   K+ T  LV +S Q LVDC  ++ ++  C GGF+ + +QYVI N 
Sbjct: 87  GSCWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTDKYDNHGCNGGFMTSAFQYVIDNN 146

Query: 64  GINTERDYPNVGVMDNCK 81
           GI+++  YP       C+
Sbjct: 147 GIDSDVSYPYKATDGKCQ 164


>gi|333069454|gb|AEF13978.1| chymopapain [Carica papaya]
          Length = 352

 Score = 71.6 bits (174), Expect = 5e-11,   Method: Composition-based stats.
 Identities = 34/77 (44%), Positives = 48/77 (62%), Gaps = 3/77 (3%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ +  +EG++KIVT NL+++S Q+LVDCD    S  C GG+  T  QYV  N G
Sbjct: 157 GSCWAFSTIATVEGVNKIVTGNLLELSEQELVDCDKN--SHGCKGGYQTTSLQYVADN-G 213

Query: 65  INTERDYPNVGVMDNCK 81
           ++T + YP       C+
Sbjct: 214 VHTSKVYPYQAKAMQCR 230


>gi|194701748|gb|ACF84958.1| unknown [Zea mays]
 gi|414589103|tpg|DAA39674.1| TPA: thiol protease SEN102 [Zea mays]
          Length = 374

 Score = 71.6 bits (174), Expect = 5e-11,   Method: Composition-based stats.
 Identities = 34/76 (44%), Positives = 43/76 (56%), Gaps = 2/76 (2%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V  +EGI +I T  LV +S Q+LVDCD   +   C GG      +++  N G
Sbjct: 178 GSCWAFSTVAVVEGIYQIRTGKLVSLSEQELVDCDTLDD--GCDGGISYRALRWIASNGG 235

Query: 65  INTERDYPNVGVMDNC 80
           I TE DYP  G  D C
Sbjct: 236 ITTETDYPYTGTTDAC 251


>gi|302790828|ref|XP_002977181.1| hypothetical protein SELMODRAFT_106402 [Selaginella moellendorffii]
 gi|300155157|gb|EFJ21790.1| hypothetical protein SELMODRAFT_106402 [Selaginella moellendorffii]
          Length = 337

 Score = 71.6 bits (174), Expect = 5e-11,   Method: Composition-based stats.
 Identities = 31/76 (40%), Positives = 46/76 (60%), Gaps = 2/76 (2%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ + +IE    + T  LV +S QQL+DCD       C GG +ET +++V++N G
Sbjct: 145 GSCWAFSAIASIESAHFLATKELVSLSEQQLMDCDTV--DAGCDGGLMETAFKFVVKNGG 202

Query: 65  INTERDYPNVGVMDNC 80
           + TE  YP  G + +C
Sbjct: 203 VTTEAAYPYTGSVGSC 218


>gi|413942348|gb|AFW74997.1| Xylem cysteine proteinase 2 [Zea mays]
          Length = 391

 Score = 71.6 bits (174), Expect = 5e-11,   Method: Composition-based stats.
 Identities = 31/76 (40%), Positives = 46/76 (60%), Gaps = 1/76 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V A+EGI++IVT NL  +S QQLVDC   G +  C GG ++  + ++    G
Sbjct: 194 GSCWAFSTVAAVEGINQIVTGNLTSLSEQQLVDCSTDGNN-GCSGGVMDNAFSFIATGAG 252

Query: 65  INTERDYPNVGVMDNC 80
           + +E  YP +    +C
Sbjct: 253 LRSEEAYPYLMEEGDC 268


>gi|351629613|gb|AEQ54770.1| cysteine proteinase CP1 [Coffea canephora]
          Length = 397

 Score = 71.6 bits (174), Expect = 5e-11,   Method: Composition-based stats.
 Identities = 35/84 (41%), Positives = 46/84 (54%), Gaps = 7/84 (8%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRS-------CVGGFIETIYQ 57
           GSCW F+  GAIEG + I T  L+ +S QQLVDCD+  + +        C GG + T + 
Sbjct: 190 GSCWAFSTTGAIEGANFIATGKLLSLSEQQLVDCDHMCDLKEKDDCDDGCSGGLMTTAFN 249

Query: 58  YVIQNRGINTERDYPNVGVMDNCK 81
           Y+I+  GI  E  YP  G    CK
Sbjct: 250 YLIEAGGIEEEVTYPYTGKRGECK 273


>gi|320164780|gb|EFW41679.1| cathepsin L2 [Capsaspora owczarzaki ATCC 30864]
          Length = 334

 Score = 71.6 bits (174), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 32/77 (41%), Positives = 44/77 (57%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+  G++EG     T  LV +S Q LVDC     ++ C GG ++  +QY+I N+G
Sbjct: 140 GSCWSFSTTGSVEGQHARKTGQLVSLSEQNLVDCSKAQGNQGCNGGLMDDAFQYIITNKG 199

Query: 65  INTERDYPNVGVMDNCK 81
           I+TE  YP       CK
Sbjct: 200 IDTEASYPYTAKDGTCK 216


>gi|33242870|gb|AAQ01139.1| cathepsin [Branchiostoma lanceolatum]
          Length = 334

 Score = 71.6 bits (174), Expect = 5e-11,   Method: Composition-based stats.
 Identities = 33/78 (42%), Positives = 45/78 (57%), Gaps = 1/78 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+  G++EG     T  LVD+S QQLVDC     ++ C GG ++  +QY+  N G
Sbjct: 136 GSCWAFSTTGSLEGQHSNKTGKLVDLSEQQLVDCSKDFGNQGCGGGLMDQAFQYITANGG 195

Query: 65  INTERDYPNVGVMDN-CK 81
           ++TE  YP     D  CK
Sbjct: 196 LDTEESYPYTATDDEPCK 213


>gi|356515050|ref|XP_003526214.1| PREDICTED: vignain-like [Glycine max]
          Length = 344

 Score = 71.6 bits (174), Expect = 5e-11,   Method: Composition-based stats.
 Identities = 33/76 (43%), Positives = 48/76 (63%), Gaps = 2/76 (2%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V A EGI +I T  L+ +S Q+LVDCD+      C GG +E  ++++I+N G
Sbjct: 149 GSCWAFSTVAATEGIYQISTGMLMSLSEQELVDCDSV--DHGCDGGLMEDGFEFIIKNGG 206

Query: 65  INTERDYPNVGVMDNC 80
           I++E +YP   V   C
Sbjct: 207 ISSEANYPYTAVDGTC 222


>gi|351629617|gb|AEQ54772.1| KDEL-tailed cysteine proteinase CP4, partial [Coffea canephora]
          Length = 215

 Score = 71.6 bits (174), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 35/82 (42%), Positives = 49/82 (59%), Gaps = 2/82 (2%)

Query: 5  GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
          GSCW F+ V  +EGI+KI T  LV +S Q+LVDC+   E   C GG +E  Y+++ ++ G
Sbjct: 4  GSCWAFSTVVGVEGINKIKTGQLVSLSEQELVDCETDNE--GCNGGLMENAYEFIKKSGG 61

Query: 65 INTERDYPNVGVMDNCKVFQFN 86
          I TER YP      +C   + N
Sbjct: 62 ITTERLYPYKARDGSCDSSKMN 83


>gi|344275472|ref|XP_003409536.1| PREDICTED: cathepsin S-like isoform 2 [Loxodonta africana]
          Length = 281

 Score = 71.6 bits (174), Expect = 5e-11,   Method: Composition-based stats.
 Identities = 33/78 (42%), Positives = 46/78 (58%), Gaps = 1/78 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGES-RSCVGGFIETIYQYVIQNR 63
           G+CW F+ VGA+E   K+ T  LV +S Q LVDC  +  S + C GGF+   +QY+I N 
Sbjct: 87  GACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSGEKYSNKGCNGGFMTRAFQYIIDNN 146

Query: 64  GINTERDYPNVGVMDNCK 81
           GI++E  YP       C+
Sbjct: 147 GIDSEASYPYKATDGKCQ 164


>gi|281208825|gb|EFA83000.1| cysteine proteinase [Polysphondylium pallidum PN500]
          Length = 531

 Score = 71.6 bits (174), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 31/68 (45%), Positives = 45/68 (66%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+  G+IEG+ ++ T NLV +S Q L+DC     ++ C GG +   ++YVI+N G
Sbjct: 116 GSCWSFSTTGSIEGVHELQTGNLVALSEQNLIDCSVAEGNQGCNGGLMPNAFEYVIKNGG 175

Query: 65  INTERDYP 72
           I+TE  YP
Sbjct: 176 IDTEASYP 183


>gi|226499884|ref|NP_001148278.1| thiol protease SEN102 precursor [Zea mays]
 gi|195617112|gb|ACG30386.1| thiol protease SEN102 precursor [Zea mays]
          Length = 374

 Score = 71.6 bits (174), Expect = 5e-11,   Method: Composition-based stats.
 Identities = 34/76 (44%), Positives = 43/76 (56%), Gaps = 2/76 (2%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V  +EGI +I T  LV +S Q+LVDCD   +   C GG      +++  N G
Sbjct: 178 GSCWAFSTVAVVEGIYQIRTGKLVSLSEQELVDCDTLDD--GCDGGISYRALRWIASNGG 235

Query: 65  INTERDYPNVGVMDNC 80
           I TE DYP  G  D C
Sbjct: 236 ITTEADYPYTGTTDAC 251


>gi|75277440|sp|O23791.1|BROM1_ANACO RecName: Full=Fruit bromelain; AltName: Allergen=Ana c 2; Flags:
           Precursor
 gi|2342496|dbj|BAA21849.1| bromelain [Ananas comosus]
          Length = 351

 Score = 71.6 bits (174), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 33/84 (39%), Positives = 47/84 (55%), Gaps = 3/84 (3%)

Query: 2   HPLGSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQ 61
           +P GSCW FA +  +EGI KI T  LV +S Q+++DC     S  C GG++   Y ++I 
Sbjct: 142 NPCGSCWSFAAIATVEGIYKIKTGYLVSLSEQEVLDC---AVSYGCKGGWVNKAYDFIIS 198

Query: 62  NRGINTERDYPNVGVMDNCKVFQF 85
           N G+ TE +YP +     C    F
Sbjct: 199 NNGVTTEENYPYLAYQGTCNANSF 222


>gi|47169030|pdb|1S4V|A Chain A, The 2.0 A Crystal Structure Of The Kdel-Tailed Cysteine
           Endopeptidase Functioning In Programmed Cell Death Of
           Ricinus Communis Endosperm
 gi|47169031|pdb|1S4V|B Chain B, The 2.0 A Crystal Structure Of The Kdel-Tailed Cysteine
           Endopeptidase Functioning In Programmed Cell Death Of
           Ricinus Communis Endosperm
          Length = 229

 Score = 71.6 bits (174), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 34/82 (41%), Positives = 52/82 (63%), Gaps = 1/82 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ + A+EGI++I TN LV +S Q+LVDCD   +++ C GG ++  ++++ Q  G
Sbjct: 24  GSCWAFSTIVAVEGINQIKTNKLVSLSEQELVDCDTD-QNQGCNGGLMDYAFEFIKQRGG 82

Query: 65  INTERDYPNVGVMDNCKVFQFN 86
           I TE +YP       C V + N
Sbjct: 83  ITTEANYPYEAYDGTCDVSKEN 104


>gi|356542171|ref|XP_003539543.1| PREDICTED: LOW QUALITY PROTEIN: KDEL-tailed cysteine endopeptidase
           CEP2-like [Glycine max]
          Length = 342

 Score = 71.6 bits (174), Expect = 6e-11,   Method: Composition-based stats.
 Identities = 31/71 (43%), Positives = 47/71 (66%), Gaps = 1/71 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V  +E I+KI T  LV +S QQL+DCDN+  +  C GG +ET + ++ +  G
Sbjct: 147 GSCWSFSAVATVEDINKIKTGKLVSLSEQQLIDCDNRNGNEGCNGGHMET-FTFITKRGG 205

Query: 65  INTERDYPNVG 75
           + T+++YP  G
Sbjct: 206 LTTDKNYPYQG 216


>gi|67968401|dbj|BAE00562.1| unnamed protein product [Macaca fascicularis]
          Length = 433

 Score = 71.6 bits (174), Expect = 6e-11,   Method: Composition-based stats.
 Identities = 32/77 (41%), Positives = 47/77 (61%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+  GA+EG     T  LV +S Q LVDC +   ++ C GGF+ + ++YV +N G
Sbjct: 136 GSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSHPQGNQGCNGGFMNSAFRYVKENGG 195

Query: 65  INTERDYPNVGVMDNCK 81
           +++E  YP V +   CK
Sbjct: 196 LDSEESYPYVAMDGICK 212


>gi|444515096|gb|ELV10758.1| Cathepsin S [Tupaia chinensis]
          Length = 240

 Score = 71.6 bits (174), Expect = 6e-11,   Method: Compositional matrix adjust.
 Identities = 33/78 (42%), Positives = 47/78 (60%), Gaps = 1/78 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDN-QGESRSCVGGFIETIYQYVIQNR 63
           G+CW F+ VGA+E   K+ T  LV +S Q LVDC   Q  ++ C GGF+   +QY+I N 
Sbjct: 46  GACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCATIQYGNKGCNGGFMTRAFQYIIDNN 105

Query: 64  GINTERDYPNVGVMDNCK 81
           GI++E  YP     + C+
Sbjct: 106 GIDSEASYPYKATDEKCQ 123


>gi|31982433|ref|NP_031828.2| cathepsin K precursor [Mus musculus]
 gi|12644320|sp|P55097.2|CATK_MOUSE RecName: Full=Cathepsin K; Flags: Precursor
 gi|3550487|emb|CAA06825.1| cathepsin K [Mus musculus]
 gi|12834090|dbj|BAB22783.1| unnamed protein product [Mus musculus]
 gi|28277388|gb|AAH46320.1| Cathepsin K [Mus musculus]
 gi|74209960|dbj|BAE21279.1| unnamed protein product [Mus musculus]
 gi|148706870|gb|EDL38817.1| cathepsin K, isoform CRA_a [Mus musculus]
          Length = 329

 Score = 71.6 bits (174), Expect = 6e-11,   Method: Composition-based stats.
 Identities = 36/76 (47%), Positives = 48/76 (63%), Gaps = 2/76 (2%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+  GA+EG  K  T  L+ +S Q LVDC    E+  C GG++ T +QYV QN G
Sbjct: 137 GSCWAFSSAGALEGQLKKKTGKLLALSPQNLVDCVT--ENYGCGGGYMTTAFQYVQQNGG 194

Query: 65  INTERDYPNVGVMDNC 80
           I++E  YP VG  ++C
Sbjct: 195 IDSEDAYPYVGQDESC 210


>gi|168018894|ref|XP_001761980.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162686697|gb|EDQ73084.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 369

 Score = 71.6 bits (174), Expect = 6e-11,   Method: Composition-based stats.
 Identities = 36/84 (42%), Positives = 45/84 (53%), Gaps = 7/84 (8%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRS-------CVGGFIETIYQ 57
           GSCW F+  GA+EG   + +  LV +S QQLVDCD+Q +          C GGF+   YQ
Sbjct: 161 GSCWAFSTTGAVEGAHFLNSGKLVSLSEQQLVDCDHQCDREEADACDAGCNGGFMTNAYQ 220

Query: 58  YVIQNRGINTERDYPNVGVMDNCK 81
           YV    G+  E DYP  G    CK
Sbjct: 221 YVEAAGGLELESDYPYEGRDGKCK 244


>gi|356543122|ref|XP_003540012.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
          Length = 342

 Score = 71.6 bits (174), Expect = 6e-11,   Method: Compositional matrix adjust.
 Identities = 34/76 (44%), Positives = 48/76 (63%), Gaps = 2/76 (2%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G CW F+ V A EGI +I T NLV +S Q+LVDCD+      C GG +E  ++++I+N G
Sbjct: 148 GICWAFSAVAATEGIYQITTGNLVSLSEQELVDCDSV--DHGCDGGLMEHGFEFIIKNGG 205

Query: 65  INTERDYPNVGVMDNC 80
           I++E +YP   V   C
Sbjct: 206 ISSEANYPYTAVNGTC 221


>gi|344275470|ref|XP_003409535.1| PREDICTED: cathepsin S-like isoform 1 [Loxodonta africana]
          Length = 331

 Score = 71.6 bits (174), Expect = 6e-11,   Method: Composition-based stats.
 Identities = 33/78 (42%), Positives = 46/78 (58%), Gaps = 1/78 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGES-RSCVGGFIETIYQYVIQNR 63
           G+CW F+ VGA+E   K+ T  LV +S Q LVDC  +  S + C GGF+   +QY+I N 
Sbjct: 137 GACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSGEKYSNKGCNGGFMTRAFQYIIDNN 196

Query: 64  GINTERDYPNVGVMDNCK 81
           GI++E  YP       C+
Sbjct: 197 GIDSEASYPYKATDGKCQ 214


>gi|225706914|gb|ACO09303.1| Cathepsin H precursor [Osmerus mordax]
          Length = 328

 Score = 71.6 bits (174), Expect = 6e-11,   Method: Composition-based stats.
 Identities = 31/79 (39%), Positives = 43/79 (54%)

Query: 3   PLGSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQN 62
           P GSCW F+  G +E ++ I T  L+ +S QQLVDC     +  C GG     ++Y+  N
Sbjct: 132 PCGSCWTFSTTGCLESVTAISTGKLLQLSEQQLVDCAQAFNNHGCNGGLPSQAFEYIKYN 191

Query: 63  RGINTERDYPNVGVMDNCK 81
           +G+ TE DYP       CK
Sbjct: 192 KGLMTEDDYPYTAQDGTCK 210


>gi|312281697|dbj|BAJ33714.1| unnamed protein product [Thellungiella halophila]
          Length = 347

 Score = 71.6 bits (174), Expect = 6e-11,   Method: Composition-based stats.
 Identities = 32/78 (41%), Positives = 46/78 (58%), Gaps = 2/78 (2%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G CW F+ V AIEG +KI    L+ +S QQLVDCD       C GG ++T +++++   G
Sbjct: 153 GCCWAFSAVAAIEGATKIKKGKLISLSEQQLVDCDTN--DFGCSGGLMDTAFEHIMATGG 210

Query: 65  INTERDYPNVGVMDNCKV 82
           + TE +YP  G    CK+
Sbjct: 211 LTTESNYPYKGKDATCKI 228


>gi|157779038|gb|ABV71063.1| cathepsin L3 precursor [Schistosoma mansoni]
 gi|360044915|emb|CCD82463.1| subfamily C1A unassigned peptidase (C01 family) [Schistosoma
           mansoni]
          Length = 370

 Score = 71.6 bits (174), Expect = 6e-11,   Method: Composition-based stats.
 Identities = 33/70 (47%), Positives = 43/70 (61%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+  GAIEG     TN LV++S QQLVDC     +  C GG + + ++YV  N G
Sbjct: 170 GSCWAFSTTGAIEGQHYRKTNRLVNLSEQQLVDCSKSYGNNGCSGGLMNSAFEYVRDNEG 229

Query: 65  INTERDYPNV 74
           I++E  YP V
Sbjct: 230 IDSEISYPYV 239


>gi|390994425|gb|AFM37362.1| cathepsin L2 [Dictyocaulus viviparus]
          Length = 352

 Score = 71.6 bits (174), Expect = 6e-11,   Method: Composition-based stats.
 Identities = 32/76 (42%), Positives = 43/76 (56%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+  GA+EG     T  LV +S Q LVDC  +  +  C GG ++  ++Y+  N G
Sbjct: 157 GSCWAFSATGALEGQHFRATGKLVSLSEQNLVDCSTKYGNHGCNGGLMDLAFEYIKDNHG 216

Query: 65  INTERDYPNVGVMDNC 80
           I+TE  YP VG    C
Sbjct: 217 IDTEEGYPYVGKEMRC 232


>gi|21483188|gb|AAK77918.1| cathepsin L 1 [Dictyocaulus viviparus]
          Length = 347

 Score = 71.6 bits (174), Expect = 6e-11,   Method: Composition-based stats.
 Identities = 32/76 (42%), Positives = 43/76 (56%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+  GA+EG     T  LV +S Q LVDC  +  +  C GG ++  ++Y+  N G
Sbjct: 152 GSCWAFSATGALEGQHFRATGKLVSLSEQNLVDCSTKYGNHGCNGGLMDLAFEYIKDNHG 211

Query: 65  INTERDYPNVGVMDNC 80
           I+TE  YP VG    C
Sbjct: 212 IDTEEGYPYVGKEMRC 227


>gi|21483190|gb|AAL14223.1| cathepsin L [Dictyocaulus viviparus]
          Length = 347

 Score = 71.6 bits (174), Expect = 6e-11,   Method: Composition-based stats.
 Identities = 32/76 (42%), Positives = 43/76 (56%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+  GA+EG     T  LV +S Q LVDC  +  +  C GG ++  ++Y+  N G
Sbjct: 152 GSCWAFSATGALEGQHFRATGKLVSLSEQNLVDCSTKYGNHGCNGGLMDLAFEYIKDNHG 211

Query: 65  INTERDYPNVGVMDNC 80
           I+TE  YP VG    C
Sbjct: 212 IDTEEGYPYVGKEMRC 227


>gi|158268255|gb|ABW25047.1| cathepsin L-like protease [Strongylus vulgaris]
          Length = 354

 Score = 71.6 bits (174), Expect = 6e-11,   Method: Composition-based stats.
 Identities = 30/76 (39%), Positives = 43/76 (56%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+  GA+EG     +  +V +S Q LVDC  +  +  C GG ++  ++Y+  N G
Sbjct: 159 GSCWAFSATGALEGQHARASGKMVSLSEQNLVDCSTKYGNHGCNGGLMDLAFEYIKDNHG 218

Query: 65  INTERDYPNVGVMDNC 80
           I+TE  YP VG    C
Sbjct: 219 IDTEESYPYVGRETKC 234


>gi|229596403|ref|XP_001009843.3| Papain family cysteine protease containing protein [Tetrahymena
           thermophila]
 gi|225565321|gb|EAR89598.3| Papain family cysteine protease containing protein [Tetrahymena
           thermophila SB210]
          Length = 324

 Score = 71.6 bits (174), Expect = 6e-11,   Method: Compositional matrix adjust.
 Identities = 37/76 (48%), Positives = 44/76 (57%), Gaps = 3/76 (3%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ VG +EG   I T NL   S QQ+VDC     +  C GG +   Y+YV+QN G
Sbjct: 137 GSCWAFSTVGGLEGAYAIATGNLTSFSEQQIVDCSK--ANAGCNGGDLPPAYKYVVQN-G 193

Query: 65  INTERDYPNVGVMDNC 80
           I TE DYP  GV   C
Sbjct: 194 IETEADYPYKGVNQKC 209


>gi|13928758|ref|NP_113748.1| cathepsin K precursor [Rattus norvegicus]
 gi|12585195|sp|O35186.1|CATK_RAT RecName: Full=Cathepsin K; Flags: Precursor
 gi|2305208|gb|AAB65743.1| cathepsin K [Rattus norvegicus]
 gi|50927597|gb|AAH78793.1| Cathepsin K [Rattus norvegicus]
 gi|149030667|gb|EDL85704.1| cathepsin K, isoform CRA_a [Rattus norvegicus]
          Length = 329

 Score = 71.6 bits (174), Expect = 6e-11,   Method: Composition-based stats.
 Identities = 36/76 (47%), Positives = 48/76 (63%), Gaps = 2/76 (2%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+  GA+EG  K  T  L+ +S Q LVDC    E+  C GG++ T +QYV QN G
Sbjct: 137 GSCWAFSSAGALEGQLKKKTGKLLALSPQNLVDC--VSENYGCGGGYMTTAFQYVQQNGG 194

Query: 65  INTERDYPNVGVMDNC 80
           I++E  YP VG  ++C
Sbjct: 195 IDSEDAYPYVGQDESC 210


>gi|33242872|gb|AAQ01140.1| cathepsin [Branchiostoma lanceolatum]
          Length = 334

 Score = 71.6 bits (174), Expect = 6e-11,   Method: Composition-based stats.
 Identities = 33/78 (42%), Positives = 45/78 (57%), Gaps = 1/78 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+  G++EG     T  LVD+S QQLVDC     ++ C GG ++  +QY+  N G
Sbjct: 136 GSCWAFSTTGSLEGQHSSKTGKLVDLSEQQLVDCSKDFGNQGCGGGLMDQAFQYIKANGG 195

Query: 65  INTERDYPNVGVMDN-CK 81
           ++TE  YP     D  CK
Sbjct: 196 LDTEESYPYTATDDKPCK 213


>gi|334324655|ref|XP_001370975.2| PREDICTED: cathepsin S-like isoform 1 [Monodelphis domestica]
          Length = 331

 Score = 71.6 bits (174), Expect = 6e-11,   Method: Composition-based stats.
 Identities = 33/78 (42%), Positives = 48/78 (61%), Gaps = 1/78 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCD-NQGESRSCVGGFIETIYQYVIQNR 63
           GSCW F+ VGA+E   K+ T  LV +S Q LVDC  ++ ++  C GGF+ + +QYVI N 
Sbjct: 137 GSCWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTDKYDNHGCNGGFMTSAFQYVIDNN 196

Query: 64  GINTERDYPNVGVMDNCK 81
           GI+++  YP       C+
Sbjct: 197 GIDSDVSYPYKATDGKCQ 214


>gi|414590229|tpg|DAA40800.1| TPA: putative cysteine protease family protein [Zea mays]
          Length = 381

 Score = 71.6 bits (174), Expect = 6e-11,   Method: Composition-based stats.
 Identities = 33/84 (39%), Positives = 48/84 (57%), Gaps = 7/84 (8%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDN------QGE-SRSCVGGFIETIYQ 57
           GSCW F+  GA+EG + + T  LVD+S QQLVDCD+      Q E +  C GG +   Y 
Sbjct: 172 GSCWAFSTTGAVEGANFLATGELVDLSEQQLVDCDHTCSAVAQNECNNGCAGGLMTNAYS 231

Query: 58  YVIQNRGINTERDYPNVGVMDNCK 81
           Y++++ G+  +  YP  G    C+
Sbjct: 232 YLMESGGLMEQSAYPYTGAAGPCR 255


>gi|356543114|ref|XP_003540008.1| PREDICTED: LOW QUALITY PROTEIN: KDEL-tailed cysteine endopeptidase
           CEP1-like [Glycine max]
          Length = 343

 Score = 71.6 bits (174), Expect = 6e-11,   Method: Compositional matrix adjust.
 Identities = 33/76 (43%), Positives = 49/76 (64%), Gaps = 2/76 (2%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G+CW F+ V A EGI +I T NLV +S ++LVDCD+      C GG +E  ++++I+N G
Sbjct: 148 GNCWAFSAVAATEGIYQITTGNLVSLSEKELVDCDSV--DHGCDGGLMEHGFEFIIKNGG 205

Query: 65  INTERDYPNVGVMDNC 80
           I++E +YP   V   C
Sbjct: 206 ISSEANYPYTAVNGTC 221


>gi|319826926|gb|ADV74756.1| cysteine protease [Lactuca sativa]
          Length = 363

 Score = 71.2 bits (173), Expect = 6e-11,   Method: Composition-based stats.
 Identities = 30/71 (42%), Positives = 44/71 (61%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G CW F+ V A+EGI+K+    LV +S Q+LVDCD  G  + C GG +E  +Q++ + +G
Sbjct: 164 GCCWAFSAVAAMEGINKLENGKLVSLSEQELVDCDIDGIDQGCEGGLMENAFQFIEKRKG 223

Query: 65  INTERDYPNVG 75
           +  E  YP  G
Sbjct: 224 LAAESVYPYTG 234


>gi|113120271|gb|ABI30275.1| VS-A [Vasconcellea stipulata]
          Length = 318

 Score = 71.2 bits (173), Expect = 6e-11,   Method: Compositional matrix adjust.
 Identities = 38/80 (47%), Positives = 49/80 (61%), Gaps = 3/80 (3%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V A+EGI+KIVT  LV +S Q+L+DC+ +  S  C GGF     QYV  N G
Sbjct: 157 GSCWTFSSVAAVEGINKIVTGQLVSLSEQELLDCERR--SYGCRGGFPPYALQYV-ANSG 213

Query: 65  INTERDYPNVGVMDNCKVFQ 84
           I+  + YP  GV   C+  Q
Sbjct: 214 IHLRQYYPYEGVQRQCRAAQ 233


>gi|330842502|ref|XP_003293216.1| hypothetical protein DICPUDRAFT_95775 [Dictyostelium purpureum]
 gi|325076482|gb|EGC30264.1| hypothetical protein DICPUDRAFT_95775 [Dictyostelium purpureum]
          Length = 376

 Score = 71.2 bits (173), Expect = 6e-11,   Method: Compositional matrix adjust.
 Identities = 31/68 (45%), Positives = 43/68 (63%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+  G++EG  +I T  LV +S Q LVDC     +  C GG ++  + Y+IQN+G
Sbjct: 144 GSCWSFSTTGSVEGAHQIKTGKLVSLSEQNLVDCSGAEGNLGCDGGLMDNAFIYIIQNKG 203

Query: 65  INTERDYP 72
           I+TE  YP
Sbjct: 204 IDTESSYP 211


>gi|326514800|dbj|BAJ99761.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 291

 Score = 71.2 bits (173), Expect = 7e-11,   Method: Composition-based stats.
 Identities = 30/68 (44%), Positives = 44/68 (64%), Gaps = 1/68 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ + A+EGI+ I T NL  +S QQLVDCD +  +  C GG ++  +QY+ ++ G
Sbjct: 83  GSCWAFSTIAAVEGINAIRTKNLTSLSEQQLVDCDTKSNA-GCNGGLMDYAFQYIAKHGG 141

Query: 65  INTERDYP 72
           +  E  YP
Sbjct: 142 VAAEDAYP 149


>gi|359806140|ref|NP_001241450.1| uncharacterized protein LOC100778716 precursor [Glycine max]
 gi|255639509|gb|ACU20049.1| unknown [Glycine max]
          Length = 366

 Score = 71.2 bits (173), Expect = 7e-11,   Method: Composition-based stats.
 Identities = 33/78 (42%), Positives = 46/78 (58%), Gaps = 7/78 (8%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRS-------CVGGFIETIYQ 57
           GSCW F+ VGA+EG   + T  LV +S QQLVDCD++ +          C GG + T ++
Sbjct: 156 GSCWSFSAVGALEGAHFLSTGELVSLSEQQLVDCDHECDPEERGACDSGCNGGLMTTAFE 215

Query: 58  YVIQNRGINTERDYPNVG 75
           Y +Q  G+  E+DYP  G
Sbjct: 216 YTLQAGGLMREKDYPYTG 233


>gi|348513249|ref|XP_003444155.1| PREDICTED: cathepsin K-like [Oreochromis niloticus]
          Length = 330

 Score = 71.2 bits (173), Expect = 7e-11,   Method: Composition-based stats.
 Identities = 33/68 (48%), Positives = 43/68 (63%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ +GA+EG  K  T  LV +S Q LVDC  Q  +  C GG+I   Y YVI+N G
Sbjct: 136 GSCWAFSSLGALEGQLKKRTGTLVSLSPQNLVDCSTQDGNLGCRGGYITKAYSYVIRNGG 195

Query: 65  INTERDYP 72
           +++E  YP
Sbjct: 196 VDSESFYP 203


>gi|158268253|gb|ABW25046.1| cathepsin L-like protease [Strongylus vulgaris]
          Length = 354

 Score = 71.2 bits (173), Expect = 7e-11,   Method: Composition-based stats.
 Identities = 30/76 (39%), Positives = 43/76 (56%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+  GA+EG     +  +V +S Q LVDC  +  +  C GG ++  ++Y+  N G
Sbjct: 159 GSCWAFSATGALEGQHARASGKMVSLSEQNLVDCSTKYGNHGCNGGLMDLAFEYIKDNHG 218

Query: 65  INTERDYPNVGVMDNC 80
           I+TE  YP VG    C
Sbjct: 219 IDTEESYPYVGRETKC 234


>gi|164420679|ref|NP_001037464.2| fibroinase precursor [Bombyx mori]
 gi|40556818|gb|AAR87763.1| fibroinase precursor [Bombyx mori]
          Length = 341

 Score = 71.2 bits (173), Expect = 7e-11,   Method: Composition-based stats.
 Identities = 32/77 (41%), Positives = 46/77 (59%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+  GA+EG     +  LV +S Q L+DC  Q  +  C GG ++  ++Y+  N G
Sbjct: 146 GSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIKDNGG 205

Query: 65  INTERDYPNVGVMDNCK 81
           I+TE+ YP  GV D C+
Sbjct: 206 IDTEQTYPYEGVDDKCR 222


>gi|27462834|gb|AAO15606.1| cathepsin L-like protease [Sarcoptes scabiei type hominis]
          Length = 245

 Score = 71.2 bits (173), Expect = 7e-11,   Method: Compositional matrix adjust.
 Identities = 32/82 (39%), Positives = 53/82 (64%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V ++E  + + T  LV++S Q+LVDC     +  C GG++++ +++VI+  G
Sbjct: 142 GSCWAFSAVASMESQNALKTGQLVELSEQELVDCSVGEGNEGCDGGWMDSAFEFVIKADG 201

Query: 65  INTERDYPNVGVMDNCKVFQFN 86
           I+TE+ YP  GV   C+ +Q N
Sbjct: 202 IDTEKSYPYHGVNQVCRSYQKN 223


>gi|113120265|gb|ABI30272.1| VXH-A, partial [Vasconcellea x heilbornii]
          Length = 318

 Score = 71.2 bits (173), Expect = 7e-11,   Method: Compositional matrix adjust.
 Identities = 38/80 (47%), Positives = 49/80 (61%), Gaps = 3/80 (3%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V A+EGI+KIVT  LV +S Q+L+DC+ +  S  C GGF     QYV  N G
Sbjct: 157 GSCWTFSSVAAVEGINKIVTGQLVSLSEQELLDCERR--SYGCRGGFPPYALQYV-ANSG 213

Query: 65  INTERDYPNVGVMDNCKVFQ 84
           I+  + YP  GV   C+  Q
Sbjct: 214 IHLRQYYPYEGVQRQCRAAQ 233


>gi|308808478|ref|XP_003081549.1| Cysteine proteinase Cathepsin F (ISS) [Ostreococcus tauri]
 gi|116060014|emb|CAL56073.1| Cysteine proteinase Cathepsin F (ISS), partial [Ostreococcus tauri]
          Length = 293

 Score = 71.2 bits (173), Expect = 7e-11,   Method: Composition-based stats.
 Identities = 37/85 (43%), Positives = 49/85 (57%), Gaps = 9/85 (10%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCD--------NQGESRSCVGGFIETIY 56
           GSCW F+  GAIEG   I T  LV++S QQL+DCD        N  +S  C GG      
Sbjct: 87  GSCWTFSTTGAIEGAHFISTGKLVELSEQQLLDCDVGCDPDVPNACDS-GCNGGLPSNAM 145

Query: 57  QYVIQNRGINTERDYPNVGVMDNCK 81
           +Y++++ GI+TE+ YP VG    CK
Sbjct: 146 EYIVEHGGIDTEKSYPYVGEKGECK 170


>gi|4469157|emb|CAB38316.1| chymopapain isoform IV [Carica papaya]
          Length = 226

 Score = 71.2 bits (173), Expect = 7e-11,   Method: Compositional matrix adjust.
 Identities = 35/78 (44%), Positives = 48/78 (61%), Gaps = 3/78 (3%)

Query: 5  GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
          GSCW F+ +  +EGI+KIVT NL+++S Q+LVDCD    S  C GG+  T  QYV  N G
Sbjct: 22 GSCWAFSTIATVEGINKIVTGNLLELSEQELVDCDRH--SYGCKGGYQTTSLQYVANN-G 78

Query: 65 INTERDYPNVGVMDNCKV 82
          ++T + YP       C+ 
Sbjct: 79 VHTSKVYPYQAKQYKCRA 96


>gi|194352746|emb|CAQ00101.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
          Length = 381

 Score = 71.2 bits (173), Expect = 7e-11,   Method: Composition-based stats.
 Identities = 35/84 (41%), Positives = 49/84 (58%), Gaps = 7/84 (8%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQ---GESRS----CVGGFIETIYQ 57
           GSCW F+  GA+EG + + T  L  +S QQLVDCD++    E R+    C GG + T + 
Sbjct: 169 GSCWSFSTSGALEGANYLATGKLEVLSEQQLVDCDHECDPSEPRACDAGCNGGLMTTAFS 228

Query: 58  YVIQNRGINTERDYPNVGVMDNCK 81
           Y+ +  G+ TE+DYP  G    CK
Sbjct: 229 YLAKAGGLETEKDYPYTGRNSACK 252


>gi|75812934|ref|NP_001028787.1| cathepsin S precursor [Bos taurus]
 gi|115503669|sp|P25326.2|CATS_BOVIN RecName: Full=Cathepsin S; Flags: Precursor
 gi|74353837|gb|AAI02246.1| Cathepsin S [Bos taurus]
 gi|296489535|tpg|DAA31648.1| TPA: cathepsin S precursor [Bos taurus]
          Length = 331

 Score = 71.2 bits (173), Expect = 7e-11,   Method: Composition-based stats.
 Identities = 33/78 (42%), Positives = 47/78 (60%), Gaps = 1/78 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDN-QGESRSCVGGFIETIYQYVIQNR 63
           GSCW F+ VGA+E   K+ T  LV +S Q LVDC   +  ++ C GGF+   +QY+I N 
Sbjct: 137 GSCWAFSAVGALEAQVKLKTGKLVSLSAQNLVDCSTAKYGNKGCNGGFMTEAFQYIIDNN 196

Query: 64  GINTERDYPNVGVMDNCK 81
           GI++E  YP   +   C+
Sbjct: 197 GIDSEASYPYKAMDGKCQ 214


>gi|356517384|ref|XP_003527367.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
          Length = 332

 Score = 71.2 bits (173), Expect = 7e-11,   Method: Composition-based stats.
 Identities = 31/81 (38%), Positives = 49/81 (60%), Gaps = 1/81 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQ-LVDCDNQGESRSCVGGFIETIYQYVIQNR 63
           G  W  + V A EGI  +    L+ +S++Q LVDCD +G  + C GG ++  ++++IQN 
Sbjct: 133 GCFWALSAVAATEGIHALXAGKLILLSSEQELVDCDTKGVDQDCQGGLMDDAFKFIIQNH 192

Query: 64  GINTERDYPNVGVMDNCKVFQ 84
           G+NTE +YP  GV   C  ++
Sbjct: 193 GLNTEANYPYKGVDGKCNAYE 213


>gi|194352752|emb|CAQ00104.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
          Length = 351

 Score = 71.2 bits (173), Expect = 7e-11,   Method: Composition-based stats.
 Identities = 31/68 (45%), Positives = 44/68 (64%), Gaps = 1/68 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V A+EGI+ IVT NL  +S Q+L+DC   G S  C GG ++  + Y+  + G
Sbjct: 152 GSCWAFSTVAAVEGINAIVTGNLTALSEQELIDCSVDGNS-GCNGGMMDYAFSYIASSGG 210

Query: 65  INTERDYP 72
           ++TE  YP
Sbjct: 211 LHTEEAYP 218


>gi|397499865|ref|XP_003820654.1| PREDICTED: cathepsin L2 isoform 1 [Pan paniscus]
 gi|397499867|ref|XP_003820655.1| PREDICTED: cathepsin L2 isoform 2 [Pan paniscus]
          Length = 334

 Score = 71.2 bits (173), Expect = 7e-11,   Method: Composition-based stats.
 Identities = 33/77 (42%), Positives = 46/77 (59%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+  GA+EG     T  LV +S Q LVDC     ++ C GGF+   +QYV +N G
Sbjct: 136 GSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRPQGNQGCNGGFMARAFQYVKENGG 195

Query: 65  INTERDYPNVGVMDNCK 81
           +++E  YP V + + CK
Sbjct: 196 LDSEESYPYVAMDEICK 212


>gi|413917937|gb|AFW57869.1| hypothetical protein ZEAMMB73_830006 [Zea mays]
          Length = 443

 Score = 71.2 bits (173), Expect = 7e-11,   Method: Composition-based stats.
 Identities = 29/76 (38%), Positives = 44/76 (57%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G CW F+ V ++EG+ K+ T  LV +S Q+LVDCD  G  + C GG ++  + +++ N G
Sbjct: 156 GCCWAFSAVASMEGVVKLSTGKLVSLSEQELVDCDVNGMDQGCEGGEMDDAFDFIVGNGG 215

Query: 65  INTERDYPNVGVMDNC 80
           + TE  YP       C
Sbjct: 216 LTTESRYPYTASDGTC 231


>gi|19195|emb|CAA78403.1| pre-pro-cysteine proteinase [Solanum lycopersicum]
          Length = 361

 Score = 71.2 bits (173), Expect = 7e-11,   Method: Composition-based stats.
 Identities = 32/83 (38%), Positives = 46/83 (55%), Gaps = 7/83 (8%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGE-------SRSCVGGFIETIYQ 57
           GSCW F+  GA+EG   + T  LV +S QQLVDCD++ +          C GG + T ++
Sbjct: 150 GSCWSFSTTGAVEGAHFLATGELVSLSEQQLVDCDHECDPVEKNDCDAGCNGGLMTTAFE 209

Query: 58  YVIQNRGINTERDYPNVGVMDNC 80
           Y ++  G+  E+DYP  G    C
Sbjct: 210 YTLKAGGLQLEKDYPYTGRNGKC 232


>gi|440906716|gb|ELR56945.1| Cathepsin S, partial [Bos grunniens mutus]
          Length = 342

 Score = 71.2 bits (173), Expect = 7e-11,   Method: Composition-based stats.
 Identities = 33/78 (42%), Positives = 47/78 (60%), Gaps = 1/78 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDN-QGESRSCVGGFIETIYQYVIQNR 63
           GSCW F+ VGA+E   K+ T  LV +S Q LVDC   +  ++ C GGF+   +QY+I N 
Sbjct: 148 GSCWAFSAVGALEAQVKLKTGKLVSLSAQNLVDCSTAKYGNKGCNGGFMTEAFQYIIDNN 207

Query: 64  GINTERDYPNVGVMDNCK 81
           GI++E  YP   +   C+
Sbjct: 208 GIDSEASYPYKAMDGKCQ 225


>gi|28192375|gb|AAK07731.1| CPR2-like cysteine proteinase [Nicotiana tabacum]
          Length = 363

 Score = 71.2 bits (173), Expect = 7e-11,   Method: Composition-based stats.
 Identities = 32/83 (38%), Positives = 46/83 (55%), Gaps = 7/83 (8%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRS-------CVGGFIETIYQ 57
           GSCW F+  GA+EG   + T  LV +S QQLVDCD++ +          C GG + T ++
Sbjct: 152 GSCWSFSTTGAVEGAHFLATGELVSLSEQQLVDCDHECDPEQQDACDAGCGGGLMTTAFE 211

Query: 58  YVIQNRGINTERDYPNVGVMDNC 80
           Y ++  G+  E+DYP  G    C
Sbjct: 212 YTLKAGGLQLEKDYPYTGKDGKC 234


>gi|222637029|gb|EEE67161.1| hypothetical protein OsJ_24244 [Oryza sativa Japonica Group]
          Length = 309

 Score = 71.2 bits (173), Expect = 7e-11,   Method: Composition-based stats.
 Identities = 31/84 (36%), Positives = 47/84 (55%), Gaps = 7/84 (8%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRS-------CVGGFIETIYQ 57
           GSCW F+  GA+EG + + T NL+D+S QQLVDCD+  ++         C GG +   Y 
Sbjct: 160 GSCWAFSTTGAVEGANFLATGNLLDLSEQQLVDCDHTCDAEKKTECDSGCGGGLMTNAYA 219

Query: 58  YVIQNRGINTERDYPNVGVMDNCK 81
           Y++ + G+  +  YP  G    C+
Sbjct: 220 YLMSSGGLMEQSAYPYTGAQGTCR 243


>gi|5051468|emb|CAB44983.1| putative preprocysteine proteinase [Nicotiana tabacum]
          Length = 363

 Score = 71.2 bits (173), Expect = 7e-11,   Method: Composition-based stats.
 Identities = 32/83 (38%), Positives = 46/83 (55%), Gaps = 7/83 (8%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRS-------CVGGFIETIYQ 57
           GSCW F+  GA+EG   + T  LV +S QQLVDCD++ +          C GG + T ++
Sbjct: 152 GSCWSFSTTGAVEGAHFLATGELVSLSEQQLVDCDHECDPEQQDACDAGCGGGLMTTAFE 211

Query: 58  YVIQNRGINTERDYPNVGVMDNC 80
           Y ++  G+  E+DYP  G    C
Sbjct: 212 YTLKAGGLQLEKDYPYTGKDGKC 234


>gi|115472081|ref|NP_001059639.1| Os07g0480900 [Oryza sativa Japonica Group]
 gi|27261016|dbj|BAC45132.1| putative cysteine proteinase [Oryza sativa Japonica Group]
 gi|113611175|dbj|BAF21553.1| Os07g0480900 [Oryza sativa Japonica Group]
 gi|215693312|dbj|BAG88694.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 376

 Score = 71.2 bits (173), Expect = 7e-11,   Method: Composition-based stats.
 Identities = 31/84 (36%), Positives = 47/84 (55%), Gaps = 7/84 (8%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRS-------CVGGFIETIYQ 57
           GSCW F+  GA+EG + + T NL+D+S QQLVDCD+  ++         C GG +   Y 
Sbjct: 160 GSCWAFSTTGAVEGANFLATGNLLDLSEQQLVDCDHTCDAEKKTECDSGCGGGLMTNAYA 219

Query: 58  YVIQNRGINTERDYPNVGVMDNCK 81
           Y++ + G+  +  YP  G    C+
Sbjct: 220 YLMSSGGLMEQSAYPYTGAQGTCR 243


>gi|357446993|ref|XP_003593772.1| Cysteine proteinase [Medicago truncatula]
 gi|355482820|gb|AES64023.1| Cysteine proteinase [Medicago truncatula]
          Length = 339

 Score = 71.2 bits (173), Expect = 8e-11,   Method: Compositional matrix adjust.
 Identities = 34/80 (42%), Positives = 49/80 (61%), Gaps = 2/80 (2%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ VGAIEGI+ I T  L+++S Q+L+DCD    S  C  G++   + +VI+N+G
Sbjct: 128 GSCWAFSAVGAIEGINAITTGKLINLSEQELLDCD--PISGGCNSGWVNKAFDWVIRNKG 185

Query: 65  INTERDYPNVGVMDNCKVFQ 84
           +  + DYP       CK  Q
Sbjct: 186 VALDNDYPYTAEKGVCKASQ 205


>gi|384247445|gb|EIE20932.1| hypothetical protein COCSUDRAFT_18161 [Coccomyxa subellipsoidea
           C-169]
          Length = 387

 Score = 71.2 bits (173), Expect = 8e-11,   Method: Composition-based stats.
 Identities = 30/67 (44%), Positives = 47/67 (70%), Gaps = 1/67 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+  G++EG + + T +LV +S QQLVDCD + + + C GG ++  + Y+I+N G
Sbjct: 138 GSCWAFSTTGSVEGANFLATGDLVSLSEQQLVDCDTK-KDQGCGGGLMDYAFDYIIKNGG 196

Query: 65  INTERDY 71
           ++TE DY
Sbjct: 197 LDTEEDY 203


>gi|52546912|gb|AAU81589.1| cysteine proteinase [Petunia x hybrida]
          Length = 257

 Score = 71.2 bits (173), Expect = 8e-11,   Method: Compositional matrix adjust.
 Identities = 32/83 (38%), Positives = 47/83 (56%), Gaps = 7/83 (8%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESR-------SCVGGFIETIYQ 57
           GSCW F+  GA+EG   + T  LV +S QQLVDCD++ ++         C GG + T ++
Sbjct: 46  GSCWSFSTTGAVEGAHFLATGELVSLSEQQLVDCDHECDAEQQNECDAGCGGGLMTTAFE 105

Query: 58  YVIQNRGINTERDYPNVGVMDNC 80
           Y ++  G+  E+DYP  G    C
Sbjct: 106 YTLKAGGLQREKDYPYTGRDGKC 128


>gi|52546920|gb|AAU81593.1| cysteine proteinase [Petunia x hybrida]
          Length = 210

 Score = 71.2 bits (173), Expect = 8e-11,   Method: Compositional matrix adjust.
 Identities = 31/68 (45%), Positives = 46/68 (67%), Gaps = 1/68 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V A+EGI++I T NL  +S Q+L+DCD    +  C GG ++  +Q++I N G
Sbjct: 107 GSCWAFSTVAAVEGINQIKTGNLTSLSEQELIDCDTT-YNNGCNGGLMDYAFQFIISNGG 165

Query: 65  INTERDYP 72
           ++ E DYP
Sbjct: 166 LHKEDDYP 173


>gi|281206749|gb|EFA80934.1| counting factor associated protein [Polysphondylium pallidum PN500]
          Length = 530

 Score = 71.2 bits (173), Expect = 8e-11,   Method: Compositional matrix adjust.
 Identities = 34/77 (44%), Positives = 45/77 (58%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F   G++EG+S + T  LV +S QQLVDC   G+S+ C GGF    +QY++   G
Sbjct: 333 GSCWTFGSTGSLEGVSCLATGKLVSLSEQQLVDCAYLGQSQGCNGGFASDAFQYIMNFGG 392

Query: 65  INTERDYPNVGVMDNCK 81
           I  E  YP +     CK
Sbjct: 393 IAYESTYPYLMQNGYCK 409


>gi|33242876|gb|AAQ01142.1| cathepsin [Branchiostoma lanceolatum]
          Length = 334

 Score = 71.2 bits (173), Expect = 8e-11,   Method: Composition-based stats.
 Identities = 33/78 (42%), Positives = 45/78 (57%), Gaps = 1/78 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+  G++EG     T  LVD+S QQLVDC     ++ C GG ++  +QY+  N G
Sbjct: 136 GSCWAFSTTGSLEGQHSNKTGKLVDLSEQQLVDCSKDFGNQGCGGGLMDQAFQYIKANGG 195

Query: 65  INTERDYPNVGVMDN-CK 81
           ++TE  YP     D  CK
Sbjct: 196 LDTEESYPYTATDDKPCK 213


>gi|47213724|emb|CAF95155.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 336

 Score = 71.2 bits (173), Expect = 8e-11,   Method: Composition-based stats.
 Identities = 33/77 (42%), Positives = 47/77 (61%), Gaps = 2/77 (2%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+  GA+EG+    T  LVD+S Q LVDC    E+  C GG++   ++YV  N+G
Sbjct: 143 GSCWAFSSAGALEGMLAKKTGKLVDLSPQNLVDCVK--ENSGCGGGYMTNAFKYVATNKG 200

Query: 65  INTERDYPNVGVMDNCK 81
           +++E  YP VG    C+
Sbjct: 201 LDSEAAYPYVGQEQPCQ 217


>gi|114625736|ref|XP_001153919.1| PREDICTED: cathepsin L2 isoform 2 [Pan troglodytes]
 gi|114625742|ref|XP_520130.2| PREDICTED: cathepsin L2 isoform 5 [Pan troglodytes]
          Length = 334

 Score = 71.2 bits (173), Expect = 8e-11,   Method: Composition-based stats.
 Identities = 33/77 (42%), Positives = 46/77 (59%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+  GA+EG     T  LV +S Q LVDC     ++ C GGF+   +QYV +N G
Sbjct: 136 GSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRPQGNQGCNGGFMARAFQYVKENGG 195

Query: 65  INTERDYPNVGVMDNCK 81
           +++E  YP V + + CK
Sbjct: 196 LDSEESYPYVAMDEICK 212


>gi|302763837|ref|XP_002965340.1| hypothetical protein SELMODRAFT_143126 [Selaginella moellendorffii]
 gi|302790566|ref|XP_002977050.1| hypothetical protein SELMODRAFT_232903 [Selaginella moellendorffii]
 gi|300155026|gb|EFJ21659.1| hypothetical protein SELMODRAFT_232903 [Selaginella moellendorffii]
 gi|300167573|gb|EFJ34178.1| hypothetical protein SELMODRAFT_143126 [Selaginella moellendorffii]
          Length = 300

 Score = 71.2 bits (173), Expect = 8e-11,   Method: Composition-based stats.
 Identities = 31/76 (40%), Positives = 45/76 (59%), Gaps = 2/76 (2%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ + +IE    + T  LV +S QQL+DCD     + C GGF E  +++V++N G
Sbjct: 110 GSCWAFSAIASIESAHFLATKELVSLSEQQLIDCDTV--DQGCQGGFPEDAFKFVVENGG 167

Query: 65  INTERDYPNVGVMDNC 80
           + TE  YP  G   +C
Sbjct: 168 VTTEEAYPYTGFAGSC 183


>gi|33242878|gb|AAQ01143.1| cathepsin [Branchiostoma lanceolatum]
          Length = 334

 Score = 71.2 bits (173), Expect = 8e-11,   Method: Composition-based stats.
 Identities = 33/78 (42%), Positives = 45/78 (57%), Gaps = 1/78 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+  G++EG     T  LVD+S QQLVDC     ++ C GG ++  +QY+  N G
Sbjct: 136 GSCWAFSTTGSLEGQHSNKTGKLVDLSEQQLVDCSKDFGNQGCGGGLMDQAFQYIKANGG 195

Query: 65  INTERDYPNVGVMDN-CK 81
           ++TE  YP     D  CK
Sbjct: 196 LDTEESYPYTATDDKPCK 213


>gi|348586359|ref|XP_003478936.1| PREDICTED: cathepsin S-like [Cavia porcellus]
          Length = 344

 Score = 71.2 bits (173), Expect = 8e-11,   Method: Composition-based stats.
 Identities = 31/69 (44%), Positives = 44/69 (63%), Gaps = 1/69 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGE-SRSCVGGFIETIYQYVIQNR 63
           G+CW F+ VGA+E   K+ T  LV +S Q LVDC  +   ++ C GGF+   +QY+I N 
Sbjct: 200 GACWAFSAVGALEAQVKLKTGKLVSLSAQNLVDCSTEKYGNKGCNGGFMTEAFQYIIDNN 259

Query: 64  GINTERDYP 72
           GI++E  YP
Sbjct: 260 GIDSETSYP 268


>gi|426362423|ref|XP_004048364.1| PREDICTED: cathepsin L2 isoform 1 [Gorilla gorilla gorilla]
 gi|426362425|ref|XP_004048365.1| PREDICTED: cathepsin L2 isoform 2 [Gorilla gorilla gorilla]
          Length = 334

 Score = 70.9 bits (172), Expect = 8e-11,   Method: Composition-based stats.
 Identities = 33/77 (42%), Positives = 46/77 (59%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+  GA+EG     T  LV +S Q LVDC     ++ C GGF+   +QYV +N G
Sbjct: 136 GSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRPQGNQGCNGGFMARAFQYVKENGG 195

Query: 65  INTERDYPNVGVMDNCK 81
           +++E  YP V + + CK
Sbjct: 196 LDSEESYPYVAMDEICK 212


>gi|356557734|ref|XP_003547166.1| PREDICTED: P34 probable thiol protease-like [Glycine max]
          Length = 369

 Score = 70.9 bits (172), Expect = 8e-11,   Method: Compositional matrix adjust.
 Identities = 34/78 (43%), Positives = 45/78 (57%), Gaps = 3/78 (3%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+  GAIEG S + T  L+ +S Q+L+DC     S  C GG+I+    +VI NRG
Sbjct: 162 GSCWAFSATGAIEGASALATGKLISVSEQELLDC---AYSFGCGGGWIDKALDWVIGNRG 218

Query: 65  INTERDYPNVGVMDNCKV 82
           I +E DYP       C+ 
Sbjct: 219 IASEIDYPYTARKGTCRA 236


>gi|302790570|ref|XP_002977052.1| hypothetical protein SELMODRAFT_268054 [Selaginella moellendorffii]
 gi|300155028|gb|EFJ21661.1| hypothetical protein SELMODRAFT_268054 [Selaginella moellendorffii]
          Length = 300

 Score = 70.9 bits (172), Expect = 8e-11,   Method: Composition-based stats.
 Identities = 31/76 (40%), Positives = 45/76 (59%), Gaps = 2/76 (2%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ + +IE    + T  LV +S QQL+DCD     + C GGF E  +++V++N G
Sbjct: 110 GSCWAFSAIASIESAHFLATKELVSLSEQQLIDCDTV--DQGCQGGFPEDAFKFVVENGG 167

Query: 65  INTERDYPNVGVMDNC 80
           + TE  YP  G   +C
Sbjct: 168 VTTEEAYPYTGFAGSC 183


>gi|66810271|ref|XP_638859.1| cysteine proteinase 3 [Dictyostelium discoideum AX4]
 gi|166201983|sp|Q23894.2|CYSP3_DICDI RecName: Full=Cysteine proteinase 3; AltName: Full=Cysteine
           proteinase II; Flags: Precursor
 gi|60467526|gb|EAL65548.1| cysteine proteinase 3 [Dictyostelium discoideum AX4]
          Length = 337

 Score = 70.9 bits (172), Expect = 8e-11,   Method: Composition-based stats.
 Identities = 30/78 (38%), Positives = 49/78 (62%), Gaps = 1/78 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSC+ F+  G++EG++ I T  LV +S Q ++DC +   +  C GG +   ++Y+I+N G
Sbjct: 143 GSCYSFSTTGSVEGVTAIKTGKLVSLSEQNILDCSSSFGNEGCNGGLMTNAFEYIIKNNG 202

Query: 65  INTERDYP-NVGVMDNCK 81
           +N+E  YP  + V D CK
Sbjct: 203 LNSEEQYPYEMKVNDECK 220


>gi|33242874|gb|AAQ01141.1| cathepsin [Branchiostoma lanceolatum]
          Length = 334

 Score = 70.9 bits (172), Expect = 8e-11,   Method: Composition-based stats.
 Identities = 33/78 (42%), Positives = 45/78 (57%), Gaps = 1/78 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+  G++EG     T  LVD+S QQLVDC     ++ C GG ++  +QY+  N G
Sbjct: 136 GSCWAFSTTGSLEGQHSNKTGKLVDLSEQQLVDCSKDFGNQGCGGGLMDQAFQYIKANGG 195

Query: 65  INTERDYPNVGVMDN-CK 81
           ++TE  YP     D  CK
Sbjct: 196 LDTEESYPYTATDDKPCK 213


>gi|94421566|gb|ABF18890.1| cathepsin-L-like cysteine proteinase 2 [Lygus lineolaris]
          Length = 216

 Score = 70.9 bits (172), Expect = 9e-11,   Method: Compositional matrix adjust.
 Identities = 31/82 (37%), Positives = 46/82 (56%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+  G++EG  K+    L  +S QQLVDC     +  C GG ++  ++Y+ +N G
Sbjct: 93  GSCWAFSTTGSLEGQHKLKQGKLYSLSEQQLVDCSAAEGNMGCEGGLMDDGFKYIKKNGG 152

Query: 65  INTERDYPNVGVMDNCKVFQFN 86
           I+TE+ YP  G    C   + N
Sbjct: 153 IDTEKSYPYTGEDGKCHATKKN 174


>gi|34559455|gb|AAQ75437.1| cathepsin L-like protease [Helicoverpa armigera]
 gi|338855117|gb|AEJ31938.1| cathepsin L-like protease [Helicoverpa assulta]
          Length = 341

 Score = 70.9 bits (172), Expect = 9e-11,   Method: Composition-based stats.
 Identities = 32/77 (41%), Positives = 45/77 (58%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+  GA+EG     T  LV +S Q L+DC     +  C GG ++  ++Y+  N G
Sbjct: 146 GSCWAFSTTGALEGQHFRKTGYLVSLSEQNLIDCSAAYGNNGCNGGLMDNAFKYIKDNGG 205

Query: 65  INTERDYPNVGVMDNCK 81
           I+TE+ YP  GV D C+
Sbjct: 206 IDTEKAYPYEGVDDKCR 222


>gi|957281|gb|AAB33990.1| cysteine proteinase [Bombyx mori]
          Length = 344

 Score = 70.9 bits (172), Expect = 9e-11,   Method: Composition-based stats.
 Identities = 32/77 (41%), Positives = 46/77 (59%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+  GA+EG     +  LV +S Q L+DC  Q  +  C GG ++  ++Y+  N G
Sbjct: 149 GSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIKDNGG 208

Query: 65  INTERDYPNVGVMDNCK 81
           I+TE+ YP  GV D C+
Sbjct: 209 IDTEQAYPYEGVDDKCR 225


>gi|356549192|ref|XP_003542981.1| PREDICTED: cysteine proteinase RD21a-like [Glycine max]
          Length = 517

 Score = 70.9 bits (172), Expect = 9e-11,   Method: Composition-based stats.
 Identities = 33/78 (42%), Positives = 49/78 (62%), Gaps = 2/78 (2%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G CW F+  GAIEGI+ IV+ +L+ +S  +LVDCD   +   C GG ++  +++V+ N G
Sbjct: 159 GCCWAFSSTGAIEGINAIVSGDLISLSEPELVDCDRTND--GCDGGHMDYAFEWVMHNGG 216

Query: 65  INTERDYPNVGVMDNCKV 82
           I+TE +YP  G    C V
Sbjct: 217 IDTETNYPYSGADGTCNV 234


>gi|41055337|ref|NP_956720.1| cathepsin S, a [Danio rerio]
 gi|32451845|gb|AAH54668.1| Cathepsin S, a [Danio rerio]
          Length = 239

 Score = 70.9 bits (172), Expect = 9e-11,   Method: Compositional matrix adjust.
 Identities = 33/70 (47%), Positives = 43/70 (61%)

Query: 3   PLGSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQN 62
           P GSCW F+ VG++E   K  T  LV +S Q L+DC     +R C GGF+   + YVIQN
Sbjct: 44  PCGSCWAFSAVGSLEAQMKRRTAALVPLSAQNLLDCSVSLGNRGCKGGFLSRAFLYVIQN 103

Query: 63  RGINTERDYP 72
           RGI++   YP
Sbjct: 104 RGIDSSTFYP 113


>gi|157132324|ref|XP_001655999.1| cathepsin l [Aedes aegypti]
 gi|108881694|gb|EAT45919.1| AAEL002833-PA [Aedes aegypti]
          Length = 339

 Score = 70.9 bits (172), Expect = 9e-11,   Method: Composition-based stats.
 Identities = 32/76 (42%), Positives = 44/76 (57%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+  GA+EG     T  LV +S Q LVDC  +  +  C GG ++  +QY+  N G
Sbjct: 144 GSCWSFSATGALEGQHFRKTGKLVSLSEQNLVDCSGKYGNNGCNGGMMDYAFQYIKDNGG 203

Query: 65  INTERDYPNVGVMDNC 80
           I+TE+ YP   + D C
Sbjct: 204 IDTEKSYPYEAIDDTC 219


>gi|30141021|dbj|BAC75924.1| cysteine protease-2 [Helianthus annuus]
          Length = 362

 Score = 70.9 bits (172), Expect = 9e-11,   Method: Composition-based stats.
 Identities = 33/82 (40%), Positives = 48/82 (58%), Gaps = 1/82 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V A+EGI+KI TN LV +S Q+LVDCD   E++ C GG ++  + ++ +  G
Sbjct: 149 GSCWAFSTVAAVEGINKIKTNELVSLSEQELVDCDTL-ENQGCNGGLMDLAFDFIKKTGG 207

Query: 65  INTERDYPNVGVMDNCKVFQFN 86
           +  E  YP       C   + N
Sbjct: 208 LTREDAYPYAAEDGKCDSNKMN 229


>gi|224102377|ref|XP_002312656.1| predicted protein [Populus trichocarpa]
 gi|222852476|gb|EEE90023.1| predicted protein [Populus trichocarpa]
          Length = 358

 Score = 70.9 bits (172), Expect = 9e-11,   Method: Composition-based stats.
 Identities = 32/82 (39%), Positives = 49/82 (59%), Gaps = 2/82 (2%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V A+EGI+KI T  L+ +S Q+LVDCD+  ++  C GG +E  + ++ Q  G
Sbjct: 149 GSCWAFSTVAAVEGINKIKTGELISLSEQELVDCDS--DNHGCNGGLMEDAFNFIKQIGG 206

Query: 65  INTERDYPNVGVMDNCKVFQFN 86
           + +E  YP     + C   + N
Sbjct: 207 LTSENTYPYRAKEEPCDSNKMN 228


>gi|89266543|gb|ABD65563.1| cathepsin S [Ictalurus punctatus]
          Length = 165

 Score = 70.9 bits (172), Expect = 9e-11,   Method: Compositional matrix adjust.
 Identities = 32/80 (40%), Positives = 47/80 (58%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+  GA+EG  K  T  +  +S Q LVDC ++  ++ C GGF+   +QYVI N G
Sbjct: 22  GSCWAFSAAGALEGQLKRTTGQVKSLSPQNLVDCSSKYGNKGCNGGFMTQAFQYVIDNGG 81

Query: 65  INTERDYPNVGVMDNCKVFQ 84
           I+++  YP   +   C+  Q
Sbjct: 82  IDSDEAYPYTAMDGQCRYDQ 101


>gi|226477902|emb|CAX72658.1| Cathepsin L precursor [Schistosoma japonicum]
 gi|226488903|emb|CAX74801.1| Cathepsin L precursor [Schistosoma japonicum]
          Length = 372

 Score = 70.9 bits (172), Expect = 9e-11,   Method: Composition-based stats.
 Identities = 32/70 (45%), Positives = 43/70 (61%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+  GAIEG     TN LV++S QQL+DC     +  C GG ++  +QYV  N G
Sbjct: 172 GSCWAFSSTGAIEGQHYRKTNRLVNLSEQQLIDCSKSYGNNGCEGGLMDLAFQYVRDNEG 231

Query: 65  INTERDYPNV 74
           I++E  YP +
Sbjct: 232 IDSEISYPYI 241


>gi|226469954|emb|CAX70258.1| Cathepsin L precursor [Schistosoma japonicum]
          Length = 372

 Score = 70.9 bits (172), Expect = 9e-11,   Method: Composition-based stats.
 Identities = 32/70 (45%), Positives = 43/70 (61%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+  GAIEG     TN LV++S QQL+DC     +  C GG ++  +QYV  N G
Sbjct: 172 GSCWAFSSTGAIEGQHYRKTNRLVNLSEQQLIDCSKSYGNNGCEGGLMDLAFQYVRDNEG 231

Query: 65  INTERDYPNV 74
           I++E  YP +
Sbjct: 232 IDSEISYPYI 241


>gi|440797325|gb|ELR18416.1| cathepsin Llike cysteine protease [Acanthamoeba castellanii str.
           Neff]
          Length = 345

 Score = 70.9 bits (172), Expect = 9e-11,   Method: Composition-based stats.
 Identities = 32/68 (47%), Positives = 40/68 (58%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+  GAIEG   + T  LVD+S + L+DC        C GG     +QYVI N+G
Sbjct: 135 GSCWAFSAAGAIEGQQALRTGRLVDLSEENLIDCSWAQGDMGCGGGLPSQAFQYVIDNKG 194

Query: 65  INTERDYP 72
           I+TE  YP
Sbjct: 195 IDTEARYP 202


>gi|297596716|ref|NP_001042970.2| Os01g0347600 [Oryza sativa Japonica Group]
 gi|255673204|dbj|BAF04884.2| Os01g0347600 [Oryza sativa Japonica Group]
          Length = 211

 Score = 70.9 bits (172), Expect = 9e-11,   Method: Compositional matrix adjust.
 Identities = 35/78 (44%), Positives = 43/78 (55%), Gaps = 2/78 (2%)

Query: 5  GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
          GSCW FA V AIEG++KI T  L  +S Q+LVDCD    S  C GG  +  ++ V    G
Sbjct: 15 GSCWAFAAVAAIEGLTKIRTGQLTPLSEQELVDCDTN--SNGCGGGHTDRAFELVASKGG 72

Query: 65 INTERDYPNVGVMDNCKV 82
          I  E DY   G    C+V
Sbjct: 73 ITAESDYRYEGFQGKCRV 90


>gi|12805315|gb|AAH02125.1| Ctss protein [Mus musculus]
          Length = 340

 Score = 70.9 bits (172), Expect = 9e-11,   Method: Composition-based stats.
 Identities = 31/78 (39%), Positives = 48/78 (61%), Gaps = 2/78 (2%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGE--SRSCVGGFIETIYQYVIQN 62
           G+CW F+ VGA+EG  K+ T  L+ +S Q LVDC N+ +  ++ C GG++   +QY+I N
Sbjct: 145 GACWAFSAVGALEGQLKLKTGKLISLSAQNLVDCSNEEKYGNKGCGGGYMTEAFQYIIDN 204

Query: 63  RGINTERDYPNVGVMDNC 80
            GI  +  YP   + + C
Sbjct: 205 GGIEADASYPYKAMDEKC 222


>gi|194352762|emb|CAQ00109.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
 gi|326517250|dbj|BAJ99991.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 367

 Score = 70.9 bits (172), Expect = 9e-11,   Method: Composition-based stats.
 Identities = 29/68 (42%), Positives = 44/68 (64%), Gaps = 1/68 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ + A+EGI+ I + NL  +S QQLVDCD +  +  C GG ++  +QY+ ++ G
Sbjct: 159 GSCWAFSTIAAVEGINAIRSKNLTSLSEQQLVDCDTKSNA-GCNGGLMDYAFQYIAKHGG 217

Query: 65  INTERDYP 72
           +  E  YP
Sbjct: 218 VAAEDAYP 225


>gi|426216526|ref|XP_004002513.1| PREDICTED: cathepsin S isoform 2 [Ovis aries]
          Length = 281

 Score = 70.9 bits (172), Expect = 9e-11,   Method: Composition-based stats.
 Identities = 33/78 (42%), Positives = 47/78 (60%), Gaps = 1/78 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDN-QGESRSCVGGFIETIYQYVIQNR 63
           GSCW F+ VGA+E   K+ T  LV +S Q LVDC   +  ++ C GGF+   +QY+I N 
Sbjct: 87  GSCWAFSAVGALEAQVKLKTGKLVSLSAQNLVDCSTVKYGNKGCNGGFMTEAFQYIIDNN 146

Query: 64  GINTERDYPNVGVMDNCK 81
           GI++E  YP   +   C+
Sbjct: 147 GIDSEASYPYKAMDGRCQ 164


>gi|426216524|ref|XP_004002512.1| PREDICTED: cathepsin S isoform 1 [Ovis aries]
          Length = 331

 Score = 70.9 bits (172), Expect = 9e-11,   Method: Composition-based stats.
 Identities = 33/78 (42%), Positives = 47/78 (60%), Gaps = 1/78 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDN-QGESRSCVGGFIETIYQYVIQNR 63
           GSCW F+ VGA+E   K+ T  LV +S Q LVDC   +  ++ C GGF+   +QY+I N 
Sbjct: 137 GSCWAFSAVGALEAQVKLKTGKLVSLSAQNLVDCSTVKYGNKGCNGGFMTEAFQYIIDNN 196

Query: 64  GINTERDYPNVGVMDNCK 81
           GI++E  YP   +   C+
Sbjct: 197 GIDSEASYPYKAMDGRCQ 214


>gi|1085731|pir||S46476 cysteine proteinase (EC 3.4.22.-) III - mountain papaya
 gi|926847|gb|AAB32657.1| cysteine proteinase CC-III [Carica candamarcensis=mountain
          papaya, Hook, latex, Peptide, 214 aa]
          Length = 214

 Score = 70.9 bits (172), Expect = 9e-11,   Method: Compositional matrix adjust.
 Identities = 33/78 (42%), Positives = 49/78 (62%), Gaps = 3/78 (3%)

Query: 5  GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
          GSCW F+ +  +EGI+KIV  NL  +S Q+LVDCD +  S  C GG+  T  +YV+ + G
Sbjct: 23 GSCWAFSTIATVEGINKIVHGNLTSLSEQELVDCDRR--SHGCKGGYQTTSLKYVV-DHG 79

Query: 65 INTERDYPNVGVMDNCKV 82
          ++TE++YP       C+ 
Sbjct: 80 VHTEKEYPYEEKQYKCRA 97


>gi|301767946|ref|XP_002919405.1| PREDICTED: cathepsin S-like [Ailuropoda melanoleuca]
          Length = 340

 Score = 70.9 bits (172), Expect = 9e-11,   Method: Composition-based stats.
 Identities = 32/78 (41%), Positives = 46/78 (58%), Gaps = 1/78 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGE-SRSCVGGFIETIYQYVIQNR 63
           G+CW F+ VGA+E   K+ T  LV +S Q LVDC  +   ++ C GGF+   +QY+I N 
Sbjct: 146 GACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTEKYGNKGCNGGFMTEAFQYIIDNN 205

Query: 64  GINTERDYPNVGVMDNCK 81
           GI++E  YP       C+
Sbjct: 206 GIDSEASYPYKATDGKCR 223


>gi|225446589|ref|XP_002280263.1| PREDICTED: vignain [Vitis vinifera]
          Length = 339

 Score = 70.9 bits (172), Expect = 9e-11,   Method: Composition-based stats.
 Identities = 31/76 (40%), Positives = 47/76 (61%), Gaps = 1/76 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G CW F+ V A EGI++I T  L+ +S Q+LVDCD  GE++ C GG ++  +++ I+  G
Sbjct: 144 GCCWAFSAVAATEGITQITTGKLISLSEQELVDCDTGGENQGCSGGLMDDAFRF-IKIHG 202

Query: 65  INTERDYPNVGVMDNC 80
           + +E  YP  G    C
Sbjct: 203 LASEATYPYEGDDGTC 218


>gi|442539990|gb|AGC54590.1| bromelain, partial [Ananas comosus]
          Length = 241

 Score = 70.9 bits (172), Expect = 9e-11,   Method: Compositional matrix adjust.
 Identities = 33/84 (39%), Positives = 46/84 (54%), Gaps = 3/84 (3%)

Query: 2   HPLGSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQ 61
           +P GSCW FA +  +EGI KI T  LV +S Q+++DC     S  C GG++   Y ++I 
Sbjct: 32  NPCGSCWAFAAIATVEGIYKIKTGYLVSLSEQEVLDC---AVSYGCKGGWVNKAYDFIIS 88

Query: 62  NRGINTERDYPNVGVMDNCKVFQF 85
           N G+ TE +YP       C    F
Sbjct: 89  NNGVTTEENYPYQAYQGTCNANSF 112


>gi|2746723|gb|AAB94925.1| cathepsin S precursor [Mus musculus]
          Length = 340

 Score = 70.9 bits (172), Expect = 9e-11,   Method: Composition-based stats.
 Identities = 31/78 (39%), Positives = 48/78 (61%), Gaps = 2/78 (2%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGE--SRSCVGGFIETIYQYVIQN 62
           G+CW F+ VGA+EG  K+ T  L+ +S Q LVDC N+ +  ++ C GG++   +QY+I N
Sbjct: 145 GACWAFSAVGALEGQLKLKTGKLISLSAQNLVDCSNEEKYGNKGCGGGYMTEAFQYIIDN 204

Query: 63  RGINTERDYPNVGVMDNC 80
            GI  +  YP   + + C
Sbjct: 205 GGIEADASYPYKAMDEKC 222


>gi|151573014|gb|ABS17682.1| cathepsin L-1 [Artemia salina]
          Length = 334

 Score = 70.9 bits (172), Expect = 9e-11,   Method: Composition-based stats.
 Identities = 31/77 (40%), Positives = 45/77 (58%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+  GA+EG +   T  LV +S Q L+DC  +  +  C GG ++  +QY+  N+G
Sbjct: 140 GSCWAFSSTGALEGQTFRKTGKLVSLSEQNLIDCSGKYGNEGCNGGLMDQAFQYIKDNKG 199

Query: 65  INTERDYPNVGVMDNCK 81
           I+TE  YP     D C+
Sbjct: 200 IDTENTYPYEAEDDVCR 216


>gi|294938848|ref|XP_002782226.1| cathepsin L, putative [Perkinsus marinus ATCC 50983]
 gi|239893730|gb|EER14021.1| cathepsin L, putative [Perkinsus marinus ATCC 50983]
          Length = 334

 Score = 70.9 bits (172), Expect = 9e-11,   Method: Composition-based stats.
 Identities = 31/80 (38%), Positives = 46/80 (57%), Gaps = 1/80 (1%)

Query: 3   PLGSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQN 62
           P GSCW F+  GA+E    I T  L+ +S QQL+DC +   +  C GG +E  Y Y I++
Sbjct: 132 PCGSCWAFSATGALEAQYAIATGKLLSLSEQQLIDCSSSYGNEGCSGGLMENAYTY-IKS 190

Query: 63  RGINTERDYPNVGVMDNCKV 82
            G++ E  YP +   + C+V
Sbjct: 191 AGLDQESTYPYIAKNNACQV 210


>gi|308810026|ref|XP_003082322.1| cysteine protease-1 (ISS) [Ostreococcus tauri]
 gi|116060790|emb|CAL57268.1| cysteine protease-1 (ISS) [Ostreococcus tauri]
          Length = 430

 Score = 70.9 bits (172), Expect = 9e-11,   Method: Composition-based stats.
 Identities = 29/68 (42%), Positives = 46/68 (67%), Gaps = 2/68 (2%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+  GA+EGI+KI T  LV +S Q++V C  Q  +  C GG ++  ++++++N G
Sbjct: 223 GSCWAFSTTGAVEGITKIRTGRLVSLSEQEMVSCSKQ--NMGCNGGLMDYAFRWIVKNGG 280

Query: 65  INTERDYP 72
           I++E  YP
Sbjct: 281 IDSEFQYP 288


>gi|33242880|gb|AAQ01144.1| cathepsin [Branchiostoma lanceolatum]
          Length = 334

 Score = 70.9 bits (172), Expect = 9e-11,   Method: Composition-based stats.
 Identities = 33/78 (42%), Positives = 45/78 (57%), Gaps = 1/78 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+  G++EG     T  LVD+S QQLVDC     ++ C GG ++  +QY+  N G
Sbjct: 136 GSCWAFSTTGSLEGQHSNKTGKLVDLSEQQLVDCSKDFGNQGCGGGLMDQAFQYIKANGG 195

Query: 65  INTERDYPNVGVMDN-CK 81
           ++TE  YP     D  CK
Sbjct: 196 LDTEESYPYTATDDKPCK 213


>gi|47086663|ref|NP_997853.1| cathepsin H precursor [Danio rerio]
 gi|45709087|gb|AAH67615.1| Cathepsin H [Danio rerio]
          Length = 330

 Score = 70.9 bits (172), Expect = 1e-10,   Method: Composition-based stats.
 Identities = 28/79 (35%), Positives = 45/79 (56%)

Query: 3   PLGSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQN 62
           P GSCW F+  G +E ++ I T  L+ ++ QQL+DC    ++  C GG     ++Y++ N
Sbjct: 132 PCGSCWTFSTTGCLESVTAIATGKLLQLAEQQLIDCAGDFDNHGCNGGLPSHAFEYIMYN 191

Query: 63  RGINTERDYPNVGVMDNCK 81
           +G+ TE DYP       C+
Sbjct: 192 KGLMTEDDYPYQAKGGQCR 210


>gi|146386360|gb|ABQ23968.1| cathepsin S [Oryctolagus cuniculus]
          Length = 162

 Score = 70.9 bits (172), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 32/69 (46%), Positives = 45/69 (65%), Gaps = 1/69 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCD-NQGESRSCVGGFIETIYQYVIQNR 63
           G+CW F+ VGA+E   K+ T NLV +S Q LVDC   +  ++ C GGF+   +QY+I N 
Sbjct: 90  GACWAFSAVGALEAQLKLKTGNLVSLSAQNLVDCSTTKYGNKGCNGGFMTEAFQYIIDNN 149

Query: 64  GINTERDYP 72
           GI++E  YP
Sbjct: 150 GIDSEASYP 158


>gi|91992508|gb|ABE72970.1| cathepsin L [Aedes aegypti]
          Length = 339

 Score = 70.9 bits (172), Expect = 1e-10,   Method: Composition-based stats.
 Identities = 32/76 (42%), Positives = 44/76 (57%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+  GA+EG     T  LV +S Q LVDC  +  +  C GG ++  +QY+  N G
Sbjct: 144 GSCWSFSATGALEGQHFRKTGKLVSLSEQNLVDCSGKYGNNGCNGGMMDYAFQYIKDNGG 203

Query: 65  INTERDYPNVGVMDNC 80
           I+TE+ YP   + D C
Sbjct: 204 IDTEKSYPYEAIDDTC 219


>gi|410907221|ref|XP_003967090.1| PREDICTED: pro-cathepsin H-like [Takifugu rubripes]
          Length = 324

 Score = 70.9 bits (172), Expect = 1e-10,   Method: Composition-based stats.
 Identities = 30/76 (39%), Positives = 42/76 (55%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+  G +E ++ I +  LV +S QQLVDC     +  C GG     ++Y+  N+G
Sbjct: 130 GSCWTFSTTGCLESVTAINSGKLVPLSEQQLVDCAQDFNNHGCNGGLPSQAFEYIKYNKG 189

Query: 65  INTERDYPNVGVMDNC 80
           + TE DYP     D C
Sbjct: 190 LMTESDYPYTAFEDKC 205


>gi|281352890|gb|EFB28474.1| hypothetical protein PANDA_008012 [Ailuropoda melanoleuca]
          Length = 328

 Score = 70.9 bits (172), Expect = 1e-10,   Method: Composition-based stats.
 Identities = 32/78 (41%), Positives = 46/78 (58%), Gaps = 1/78 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGE-SRSCVGGFIETIYQYVIQNR 63
           G+CW F+ VGA+E   K+ T  LV +S Q LVDC  +   ++ C GGF+   +QY+I N 
Sbjct: 134 GACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTEKYGNKGCNGGFMTEAFQYIIDNN 193

Query: 64  GINTERDYPNVGVMDNCK 81
           GI++E  YP       C+
Sbjct: 194 GIDSEASYPYKATDGKCR 211


>gi|302790836|ref|XP_002977185.1| hypothetical protein SELMODRAFT_106228 [Selaginella moellendorffii]
 gi|300155161|gb|EFJ21794.1| hypothetical protein SELMODRAFT_106228 [Selaginella moellendorffii]
          Length = 299

 Score = 70.9 bits (172), Expect = 1e-10,   Method: Composition-based stats.
 Identities = 31/76 (40%), Positives = 45/76 (59%), Gaps = 2/76 (2%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ + +IE    + T  LV +S QQL+DCD     + C GGF E  +++V++N G
Sbjct: 110 GSCWAFSAIASIESAHFLATKELVSLSEQQLIDCDTV--DQGCQGGFPEDAFKFVVENGG 167

Query: 65  INTERDYPNVGVMDNC 80
           + TE  YP  G   +C
Sbjct: 168 VTTEEAYPYTGFAGSC 183


>gi|4469159|emb|CAB38317.1| chymopapain isoform V [Carica papaya]
          Length = 227

 Score = 70.9 bits (172), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 35/78 (44%), Positives = 48/78 (61%), Gaps = 3/78 (3%)

Query: 5  GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
          GSCW F+ +  +EGI+KIVT NL+++S Q+LVDCD    S  C GG+  T  QYV  N G
Sbjct: 23 GSCWAFSTIATVEGINKIVTGNLLELSEQELVDCDKH--SYGCKGGYQTTSLQYVANN-G 79

Query: 65 INTERDYPNVGVMDNCKV 82
          ++T + YP       C+ 
Sbjct: 80 VHTSKVYPCQAKQYKCRA 97


>gi|313507179|pdb|2ACT|A Chain A, Crystallographic Refinement Of The Structure Of Actinidin
           At 1.7 Angstroms Resolution By Fast Fourier
           Least-Squares Methods
          Length = 220

 Score = 70.9 bits (172), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 30/78 (38%), Positives = 48/78 (61%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G  W F+ +  +EGI+KI + +L+ +S Q+L+DC     +R C GG+I   +Q++I + G
Sbjct: 23  GGXWAFSAIATVEGINKITSGSLISLSEQELIDCGRTQNTRGCDGGYITDGFQFIINDGG 82

Query: 65  INTERDYPNVGVMDNCKV 82
           INTE +YP      +C V
Sbjct: 83  INTEENYPYTAQDGDCDV 100


>gi|242048430|ref|XP_002461961.1| hypothetical protein SORBIDRAFT_02g011230 [Sorghum bicolor]
 gi|241925338|gb|EER98482.1| hypothetical protein SORBIDRAFT_02g011230 [Sorghum bicolor]
          Length = 380

 Score = 70.9 bits (172), Expect = 1e-10,   Method: Composition-based stats.
 Identities = 33/76 (43%), Positives = 42/76 (55%), Gaps = 2/76 (2%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V  +EGI +I T  LV +S Q+LVDCD       C GG      +++  N G
Sbjct: 184 GSCWAFSTVAVVEGIYQIRTGKLVSLSEQELVDCDTL--DAGCDGGISYRALRWITSNGG 241

Query: 65  INTERDYPNVGVMDNC 80
           + TE DYP  G  D C
Sbjct: 242 LTTEEDYPYTGTTDAC 257


>gi|197258082|gb|ACH56225.1| cathepsin L-like cysteine proteinase [Bursaphelenchus xylophilus]
          Length = 282

 Score = 70.9 bits (172), Expect = 1e-10,   Method: Composition-based stats.
 Identities = 32/76 (42%), Positives = 42/76 (55%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+  G++EG  K  T  LV +S Q LVDC     +  C GG ++  ++YV QN G
Sbjct: 87  GSCWAFSATGSLEGQHKRATGKLVSLSEQNLVDCSADFGNNGCNGGLMDFAFEYVKQNHG 146

Query: 65  INTERDYPNVGVMDNC 80
           I+TE  YP       C
Sbjct: 147 IDTEESYPYKAKQKKC 162


>gi|115461667|ref|NP_001054433.1| Os05g0108600 [Oryza sativa Japonica Group]
 gi|14719319|gb|AAK73137.1|AC079022_10 putative cysteine proteinase [Oryza sativa]
 gi|33151125|gb|AAP97431.1| cysteine protease CP1 [Oryza sativa]
 gi|52353572|gb|AAU44138.1| cysteine proteinase CP1 [Oryza sativa Japonica Group]
 gi|113577984|dbj|BAF16347.1| Os05g0108600 [Oryza sativa Japonica Group]
 gi|125550541|gb|EAY96250.1| hypothetical protein OsI_18148 [Oryza sativa Indica Group]
          Length = 358

 Score = 70.9 bits (172), Expect = 1e-10,   Method: Composition-based stats.
 Identities = 30/68 (44%), Positives = 42/68 (61%), Gaps = 1/68 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V A+EGI+ IVT NL  +S Q+L+DC   G +  C GG ++  + Y+    G
Sbjct: 163 GSCWAFSTVAAVEGINAIVTGNLTSLSEQELIDCSTDGNN-GCNGGLMDYAFSYIASTGG 221

Query: 65  INTERDYP 72
           + TE  YP
Sbjct: 222 LRTEEAYP 229


>gi|121543825|gb|ABM55577.1| putative cathepsin L-like protease [Maconellicoccus hirsutus]
          Length = 341

 Score = 70.9 bits (172), Expect = 1e-10,   Method: Composition-based stats.
 Identities = 29/77 (37%), Positives = 45/77 (58%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+  G++EG     T  L  +S Q L+DC  +  +  C GG ++  + Y+  N+G
Sbjct: 145 GSCWAFSTTGSLEGQHFRNTKQLTSLSEQNLIDCSGKYGNNGCSGGLMDNAFAYIKSNKG 204

Query: 65  INTERDYPNVGVMDNCK 81
           I+TE+ YP  G+ D C+
Sbjct: 205 IDTEQSYPYEGIDDKCR 221


>gi|390608645|ref|NP_001254624.1| cathepsin S isoform 1 preproprotein [Mus musculus]
 gi|74214026|dbj|BAE29430.1| unnamed protein product [Mus musculus]
          Length = 343

 Score = 70.9 bits (172), Expect = 1e-10,   Method: Composition-based stats.
 Identities = 31/78 (39%), Positives = 47/78 (60%), Gaps = 2/78 (2%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGE--SRSCVGGFIETIYQYVIQN 62
           G+CW F+ VGA+EG  K+ T  L+ +S Q LVDC N+ +  ++ C GG++   +QY+I N
Sbjct: 148 GACWAFSAVGALEGQLKLKTGKLISLSAQNLVDCSNEEKYGNKGCGGGYMTEAFQYIIDN 207

Query: 63  RGINTERDYPNVGVMDNC 80
            GI  +  YP     + C
Sbjct: 208 GGIEADASYPYKATDEKC 225


>gi|318054062|ref|NP_001187179.1| cathepsin S precursor [Ictalurus punctatus]
 gi|190351079|gb|ACE75948.1| cathepsin S [Ictalurus punctatus]
          Length = 329

 Score = 70.9 bits (172), Expect = 1e-10,   Method: Composition-based stats.
 Identities = 32/80 (40%), Positives = 47/80 (58%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+  GA+EG  K  T  +  +S Q LVDC ++  ++ C GGF+   +QYVI N G
Sbjct: 136 GSCWAFSAAGALEGQLKRTTGQVKSLSPQNLVDCSSKYGNKGCNGGFMTEAFQYVIDNGG 195

Query: 65  INTERDYPNVGVMDNCKVFQ 84
           I+++  YP   +   C+  Q
Sbjct: 196 IDSDEAYPYTAMDGQCRYDQ 215


>gi|170041165|ref|XP_001848344.1| cathepsin l [Culex quinquefasciatus]
 gi|167864709|gb|EDS28092.1| cathepsin l [Culex quinquefasciatus]
          Length = 340

 Score = 70.9 bits (172), Expect = 1e-10,   Method: Composition-based stats.
 Identities = 32/76 (42%), Positives = 44/76 (57%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+  GA+EG     T  LV +S Q LVDC  +  +  C GG ++  +QY+  N G
Sbjct: 145 GSCWSFSATGALEGQHFRKTGKLVSLSEQNLVDCSTKYGNNGCNGGMMDFAFQYIKDNGG 204

Query: 65  INTERDYPNVGVMDNC 80
           I+TE+ YP   + D C
Sbjct: 205 IDTEKAYPYEAIDDTC 220


>gi|42744610|gb|AAH66625.1| Ctssa protein [Danio rerio]
          Length = 321

 Score = 70.9 bits (172), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 33/70 (47%), Positives = 43/70 (61%)

Query: 3   PLGSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQN 62
           P GSCW F+ VG++E   K  T  LV +S Q L+DC     +R C GGF+   + YVIQN
Sbjct: 126 PCGSCWAFSAVGSLEAQMKRRTAALVPLSAQNLLDCSVSLGNRGCKGGFLSRAFLYVIQN 185

Query: 63  RGINTERDYP 72
           RGI++   YP
Sbjct: 186 RGIDSSTFYP 195


>gi|413933048|gb|AFW67599.1| hypothetical protein ZEAMMB73_513726 [Zea mays]
          Length = 205

 Score = 70.9 bits (172), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 32/76 (42%), Positives = 46/76 (60%)

Query: 5  GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
          G CW F+ V A+EG++KI T  LV +S Q+LVDCD  G  + C GG ++  +Q+V +  G
Sbjct: 11 GCCWAFSAVAAVEGLNKIRTGRLVSLSEQELVDCDVSGVDQGCDGGLMDNAFQFVARRGG 70

Query: 65 INTERDYPNVGVMDNC 80
          + +E  YP  G    C
Sbjct: 71 LASESGYPYQGRDGPC 86


>gi|334904467|gb|AEH26024.1| cysteine peptidase [Ananas comosus]
          Length = 352

 Score = 70.9 bits (172), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 33/84 (39%), Positives = 46/84 (54%), Gaps = 3/84 (3%)

Query: 2   HPLGSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQ 61
           +P GSCW FA +  +EGI KI T  LV +S Q+++DC     S  C GG++   Y ++I 
Sbjct: 143 NPCGSCWAFAAIATVEGIYKIKTGYLVSLSEQEVLDC---AVSYGCKGGWVNKAYDFIIS 199

Query: 62  NRGINTERDYPNVGVMDNCKVFQF 85
           N G+ TE +YP       C    F
Sbjct: 200 NNGVTTEENYPYQAYQGTCNANSF 223


>gi|2463588|dbj|BAA22546.1| FB1035 precursor [Ananas comosus]
          Length = 324

 Score = 70.5 bits (171), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 33/84 (39%), Positives = 46/84 (54%), Gaps = 3/84 (3%)

Query: 2   HPLGSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQ 61
           +P GSCW FA +  +EGI KI T  LV +S Q+++DC     S  C GG++   Y ++I 
Sbjct: 115 NPCGSCWAFAAIATVEGIYKIKTGYLVSLSEQEVLDC---AVSYGCKGGWVNKAYDFIIS 171

Query: 62  NRGINTERDYPNVGVMDNCKVFQF 85
           N G+ TE +YP       C    F
Sbjct: 172 NNGVTTEENYPYQAYQGTCNANSF 195


>gi|261289811|ref|XP_002611767.1| hypothetical protein BRAFLDRAFT_284308 [Branchiostoma floridae]
 gi|229297139|gb|EEN67777.1| hypothetical protein BRAFLDRAFT_284308 [Branchiostoma floridae]
          Length = 336

 Score = 70.5 bits (171), Expect = 1e-10,   Method: Composition-based stats.
 Identities = 33/78 (42%), Positives = 45/78 (57%), Gaps = 1/78 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+  G++EG     T  LVD+S QQLVDC     ++ C GG ++  +QY+  N G
Sbjct: 138 GSCWAFSTTGSLEGQHANKTGKLVDLSEQQLVDCSKDFGNQGCGGGLMDQAFQYIKANGG 197

Query: 65  INTERDYPNVGVMDN-CK 81
           ++TE  YP     D  CK
Sbjct: 198 LDTEESYPYTATDDKPCK 215


>gi|353441136|gb|AEQ94152.1| drought-inducible cysteine proteinase [Elaeis guineensis]
          Length = 252

 Score = 70.5 bits (171), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 31/79 (39%), Positives = 48/79 (60%), Gaps = 7/79 (8%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRS-------CVGGFIETIYQ 57
           GSCW F+  GA+EG + + T  L  +S QQLVDCD++ +S         C GG + T ++
Sbjct: 164 GSCWSFSASGALEGANFLATGQLESLSEQQLVDCDHECDSSEPDSCDSGCNGGLMTTAFE 223

Query: 58  YVIQNRGINTERDYPNVGV 76
           Y++++ G+  E+DYP  G 
Sbjct: 224 YLLKSGGLELEKDYPYTGT 242


>gi|341940310|sp|O70370.2|CATS_MOUSE RecName: Full=Cathepsin S; Flags: Precursor
          Length = 340

 Score = 70.5 bits (171), Expect = 1e-10,   Method: Composition-based stats.
 Identities = 31/78 (39%), Positives = 47/78 (60%), Gaps = 2/78 (2%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGE--SRSCVGGFIETIYQYVIQN 62
           G+CW F+ VGA+EG  K+ T  L+ +S Q LVDC N+ +  ++ C GG++   +QY+I N
Sbjct: 145 GACWAFSAVGALEGQLKLKTGKLISLSAQNLVDCSNEEKYGNKGCGGGYMTEAFQYIIDN 204

Query: 63  RGINTERDYPNVGVMDNC 80
            GI  +  YP     + C
Sbjct: 205 GGIEADASYPYKATDEKC 222


>gi|147772785|emb|CAN62838.1| hypothetical protein VITISV_003391 [Vitis vinifera]
          Length = 298

 Score = 70.5 bits (171), Expect = 1e-10,   Method: Composition-based stats.
 Identities = 32/76 (42%), Positives = 46/76 (60%), Gaps = 1/76 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V A EGI++I T  L+ +S Q+LVDCD  GE++ C GG  +  +++ I   G
Sbjct: 103 GSCWAFSAVAATEGITQITTGKLISLSEQELVDCDTGGENQGCSGGLXDDAFRF-IXIHG 161

Query: 65  INTERDYPNVGVMDNC 80
           + +E  YP  G    C
Sbjct: 162 LASEATYPYEGDDGTC 177


>gi|3850787|emb|CAA05360.1| cathepsin S [Mus musculus]
          Length = 330

 Score = 70.5 bits (171), Expect = 1e-10,   Method: Composition-based stats.
 Identities = 31/78 (39%), Positives = 48/78 (61%), Gaps = 2/78 (2%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGE--SRSCVGGFIETIYQYVIQN 62
           G+CW F+ VGA+EG  K+ T  L+ +S Q LVDC N+ +  ++ C GG++   +QY+I N
Sbjct: 135 GACWAFSAVGALEGQLKLKTGKLISLSAQNLVDCSNEEKYGNKGCGGGYMTEAFQYIIDN 194

Query: 63  RGINTERDYPNVGVMDNC 80
            GI  +  YP   + + C
Sbjct: 195 GGIEADASYPYKAMDEKC 212


>gi|23344734|gb|AAN28680.1| cathepsin L [Theromyzon tessulatum]
          Length = 351

 Score = 70.5 bits (171), Expect = 1e-10,   Method: Composition-based stats.
 Identities = 31/77 (40%), Positives = 44/77 (57%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+  G++EG     T  +VD+S Q LVDC     +  C GG +   ++Y+  N+G
Sbjct: 151 GSCWAFSTTGSLEGQHMRKTGTMVDLSEQNLVDCSTSYGNDGCNGGLMTNAFKYIKDNKG 210

Query: 65  INTERDYPNVGVMDNCK 81
           I+TE  YP  G   +CK
Sbjct: 211 IDTEEAYPYAGRDGDCK 227


>gi|392306967|ref|NP_067256.3| cathepsin S isoform 2 preproprotein [Mus musculus]
 gi|26390492|dbj|BAC25906.1| unnamed protein product [Mus musculus]
 gi|148706872|gb|EDL38819.1| cathepsin S [Mus musculus]
          Length = 342

 Score = 70.5 bits (171), Expect = 1e-10,   Method: Composition-based stats.
 Identities = 31/78 (39%), Positives = 47/78 (60%), Gaps = 2/78 (2%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGE--SRSCVGGFIETIYQYVIQN 62
           G+CW F+ VGA+EG  K+ T  L+ +S Q LVDC N+ +  ++ C GG++   +QY+I N
Sbjct: 147 GACWAFSAVGALEGQLKLKTGKLISLSAQNLVDCSNEEKYGNKGCGGGYMTEAFQYIIDN 206

Query: 63  RGINTERDYPNVGVMDNC 80
            GI  +  YP     + C
Sbjct: 207 GGIEADASYPYKATDEKC 224


>gi|255078398|ref|XP_002502779.1| cysteine endopeptidase [Micromonas sp. RCC299]
 gi|226518045|gb|ACO64037.1| cysteine endopeptidase [Micromonas sp. RCC299]
          Length = 414

 Score = 70.5 bits (171), Expect = 1e-10,   Method: Composition-based stats.
 Identities = 28/80 (35%), Positives = 49/80 (61%), Gaps = 1/80 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+  GA+EG++ I T  L+ +S ++L+ C   G +  C GG ++  +++++ NRG
Sbjct: 179 GSCWAFSTTGAVEGVNAIKTGKLISLSEEELISCSTNG-NMGCNGGLMDNGFEWIVNNRG 237

Query: 65  INTERDYPNVGVMDNCKVFQ 84
           I+TE  +  V   + C  F+
Sbjct: 238 IDTEDGWEYVAKEEKCGFFR 257


>gi|74152091|dbj|BAE32077.1| unnamed protein product [Mus musculus]
          Length = 245

 Score = 70.5 bits (171), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 31/78 (39%), Positives = 47/78 (60%), Gaps = 2/78 (2%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGE--SRSCVGGFIETIYQYVIQN 62
           G+CW F+ VGA+EG  K+ T  L+ +S Q LVDC N+ +  ++ C GG++   +QY+I N
Sbjct: 50  GACWAFSAVGALEGQLKLKTGKLISLSAQNLVDCSNEEKYGNKGCGGGYMTEAFQYIIDN 109

Query: 63  RGINTERDYPNVGVMDNC 80
            GI  +  YP     + C
Sbjct: 110 GGIEADASYPYKATDEKC 127


>gi|281203744|gb|EFA77940.1| hypothetical protein PPL_08585 [Polysphondylium pallidum PN500]
          Length = 505

 Score = 70.5 bits (171), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 29/68 (42%), Positives = 44/68 (64%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+  G++EG  +I + N+V++S Q LVDC     +  C GG ++  ++Y+I N G
Sbjct: 288 GSCWSFSTTGSVEGAHQIKSGNMVELSEQNLVDCSTSEGNMGCNGGLMDYAFEYIITNNG 347

Query: 65  INTERDYP 72
           I+TE  YP
Sbjct: 348 IDTESSYP 355


>gi|118124|sp|P25250.1|CYSP2_HORVU RecName: Full=Cysteine proteinase EP-B 2; Flags: Precursor
 gi|1146118|gb|AAA85036.1| cysteine proteinase EPB2 precursor [Hordeum vulgare subsp. vulgare]
          Length = 373

 Score = 70.5 bits (171), Expect = 1e-10,   Method: Composition-based stats.
 Identities = 31/78 (39%), Positives = 47/78 (60%), Gaps = 1/78 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V ++EGI+ I T +LV +S Q+L+DCD   ++  C GG ++  ++Y+  N G
Sbjct: 156 GSCWAFSTVVSVEGINAIRTGSLVSLSEQELIDCDT-ADNDGCQGGLMDNAFEYIKNNGG 214

Query: 65  INTERDYPNVGVMDNCKV 82
           + TE  YP       C V
Sbjct: 215 LITEAAYPYRAARGTCNV 232


>gi|356515048|ref|XP_003526213.1| PREDICTED: vignain-like [Glycine max]
          Length = 350

 Score = 70.5 bits (171), Expect = 1e-10,   Method: Composition-based stats.
 Identities = 33/76 (43%), Positives = 48/76 (63%), Gaps = 2/76 (2%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V A EGI +I T  L+ +S Q+LVDCD+      C GG +E  ++++I+N G
Sbjct: 143 GSCWAFSTVAATEGIYQISTGMLMSLSEQELVDCDSV--DHGCDGGLMEDGFEFIIKNGG 200

Query: 65  INTERDYPNVGVMDNC 80
           I++E +YP   V   C
Sbjct: 201 ISSEANYPYTAVDGTC 216


>gi|441593109|ref|XP_003260582.2| PREDICTED: cathepsin L2 isoform 1 [Nomascus leucogenys]
          Length = 334

 Score = 70.5 bits (171), Expect = 1e-10,   Method: Composition-based stats.
 Identities = 33/77 (42%), Positives = 46/77 (59%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+  GA+EG     T  LV +S Q LVDC     ++ C GGF+   +QYV +N G
Sbjct: 136 GSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRPQGNQGCNGGFMGKAFQYVKENGG 195

Query: 65  INTERDYPNVGVMDNCK 81
           +++E  YP V + + CK
Sbjct: 196 LDSEESYPYVAMDEICK 212


>gi|254746340|emb|CAX16635.1| putative C1A cysteine protease precursor [Manduca sexta]
          Length = 342

 Score = 70.5 bits (171), Expect = 1e-10,   Method: Composition-based stats.
 Identities = 31/77 (40%), Positives = 46/77 (59%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+  GA+EG     +  LV +S Q L+DC +   +  C GG ++  ++Y+  N G
Sbjct: 147 GSCWAFSTTGALEGQHFRKSGYLVSLSEQNLIDCSSTYGNNGCNGGLMDNAFKYIKDNGG 206

Query: 65  INTERDYPNVGVMDNCK 81
           I+TE+ YP  GV D C+
Sbjct: 207 IDTEKTYPYEGVDDKCR 223


>gi|67773374|gb|AAY81944.1| cysteine protease 6 [Paragonimus westermani]
          Length = 325

 Score = 70.5 bits (171), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 36/78 (46%), Positives = 46/78 (58%), Gaps = 2/78 (2%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+V G +EG   + T  LV +S QQLVDCD Q     C GG+  T Y  +I+  G
Sbjct: 134 GSCWAFSVAGNVEGQWFLKTGQLVSLSKQQLVDCDVQ--DSGCDGGYPPTTYGEIIRMGG 191

Query: 65  INTERDYPNVGVMDNCKV 82
           +  +RDYP VG    CK+
Sbjct: 192 LEAQRDYPYVGREQPCKL 209


>gi|18408828|ref|NP_566920.1| putative cysteine proteinase [Arabidopsis thaliana]
 gi|12324451|gb|AAG52191.1|AC012329_18 putative cysteine proteinase; 15366-14136 [Arabidopsis thaliana]
 gi|6723404|emb|CAB66413.1| cysteine protease-like protein [Arabidopsis thaliana]
 gi|332645009|gb|AEE78530.1| putative cysteine proteinase [Arabidopsis thaliana]
          Length = 341

 Score = 70.5 bits (171), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 32/77 (41%), Positives = 46/77 (59%), Gaps = 2/77 (2%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G CW F+ V A+EG++KI    LV +S QQL+DC    E+  C GG +   + Y+ +N+G
Sbjct: 149 GCCWAFSAVAAVEGMTKIANGELVSLSEQQLLDCST--ENNGCGGGIMWKAFDYIKENQG 206

Query: 65  INTERDYPNVGVMDNCK 81
           I TE +YP  G    C+
Sbjct: 207 ITTEDNYPYQGAQQTCE 223


>gi|66814630|ref|XP_641494.1| cysteine protease [Dictyostelium discoideum AX4]
 gi|118121|sp|P04989.1|CYSP2_DICDI RecName: Full=Cysteine proteinase 2; AltName: Full=Prestalk
           cathepsin; Flags: Precursor
 gi|167860|gb|AAA33240.1| pst-cathepsin [Dictyostelium discoideum]
 gi|1834417|emb|CAA27050.1| cysteine proteinase 2 [Dictyostelium discoideum]
 gi|60469522|gb|EAL67513.1| cysteine protease [Dictyostelium discoideum AX4]
 gi|225484|prf||1304284A cathepsin,prestalk
          Length = 376

 Score = 70.5 bits (171), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 30/68 (44%), Positives = 41/68 (60%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+  G+ EG   + T  LV +S Q LVDC    E+  C GG +   + Y+I+N+G
Sbjct: 145 GSCWSFSTTGSTEGAHALKTKKLVSLSEQNLVDCSGPEENFGCDGGLMNNAFDYIIKNKG 204

Query: 65  INTERDYP 72
           I+TE  YP
Sbjct: 205 IDTESSYP 212


>gi|391343119|ref|XP_003745860.1| PREDICTED: cathepsin L-like [Metaseiulus occidentalis]
          Length = 385

 Score = 70.5 bits (171), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 33/76 (43%), Positives = 45/76 (59%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ VG++EG     T  LV +S Q LVDC     +  C GG+++  ++YV  N G
Sbjct: 190 GSCWAFSAVGSLEGQHFKSTGKLVSLSEQNLVDCSTPEGNSGCNGGWMDQAFEYVKDNHG 249

Query: 65  INTERDYPNVGVMDNC 80
           I+TE  YP VG   +C
Sbjct: 250 IDTEDSYPYVGTDGSC 265


>gi|341850671|gb|AEK97329.1| chromoplast senescence-associated protein 12 [Brassica rapa var.
           parachinensis]
          Length = 260

 Score = 70.5 bits (171), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 32/78 (41%), Positives = 46/78 (58%), Gaps = 2/78 (2%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G CW F+ V AIEG ++I    L+ +S QQLVDCD       C GG I+T +++++   G
Sbjct: 66  GCCWAFSAVAAIEGATQIKKGKLISLSEQQLVDCDTN--DFGCSGGLIDTAFEHIMATGG 123

Query: 65  INTERDYPNVGVMDNCKV 82
           + TE +YP  G    CK+
Sbjct: 124 LTTESNYPYKGEDATCKI 141


>gi|147809367|emb|CAN64491.1| hypothetical protein VITISV_015725 [Vitis vinifera]
          Length = 321

 Score = 70.5 bits (171), Expect = 1e-10,   Method: Composition-based stats.
 Identities = 34/84 (40%), Positives = 45/84 (53%), Gaps = 7/84 (8%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRS-------CVGGFIETIYQ 57
           GSCW F+  GA+EG   I T  L+ +S QQLVDCD+  + R        C GG +   Y+
Sbjct: 114 GSCWAFSTTGAVEGAHFISTKKLLTLSEQQLVDCDHMCDIRDKXACDSGCEGGLMTNAYK 173

Query: 58  YVIQNRGINTERDYPNVGVMDNCK 81
           Y+I+  G+  E  YP  G    CK
Sbjct: 174 YLIEAGGLEEESSYPYTGKHGECK 197


>gi|2961621|gb|AAC05781.1| cathepsin S [Mus musculus]
          Length = 340

 Score = 70.5 bits (171), Expect = 1e-10,   Method: Composition-based stats.
 Identities = 31/78 (39%), Positives = 47/78 (60%), Gaps = 2/78 (2%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGE--SRSCVGGFIETIYQYVIQN 62
           G+CW F+ VGA+EG  K+ T  L+ +S Q LVDC N+ +  ++ C GG++   +QY+I N
Sbjct: 145 GACWAFSAVGALEGQLKLKTGKLISLSAQNLVDCSNEEKYGNKGCGGGYMTEAFQYIIDN 204

Query: 63  RGINTERDYPNVGVMDNC 80
            GI  +  YP     + C
Sbjct: 205 GGIEADASYPYKATDEKC 222


>gi|356515038|ref|XP_003526208.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Glycine
           max]
          Length = 339

 Score = 70.5 bits (171), Expect = 1e-10,   Method: Composition-based stats.
 Identities = 34/78 (43%), Positives = 48/78 (61%), Gaps = 1/78 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F++V AIEGI +I T  LV +S Q+LVDC  +G+S  C  G+ E  +++V +N G
Sbjct: 146 GSCWAFSIVAAIEGIHQITTGKLVSLSEQELVDC-VKGKSEGCNFGYKEEAFEFVAKNGG 204

Query: 65  INTERDYPNVGVMDNCKV 82
           + +E  YP       C V
Sbjct: 205 LASEISYPYKANNKTCMV 222


>gi|17062058|gb|AAL34984.1|AF320565_1 cathepsine L-like cysteine protease [Rhodnius prolixus]
          Length = 316

 Score = 70.5 bits (171), Expect = 1e-10,   Method: Composition-based stats.
 Identities = 32/77 (41%), Positives = 44/77 (57%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+  G++EG   + T  LV +S Q LVDC     +  C GG +   +QYV  N+G
Sbjct: 122 GSCWSFSATGSLEGQLFLKTGRLVSLSEQNLVDCSKTYGNSGCEGGLMNQAFQYVRDNKG 181

Query: 65  INTERDYPNVGVMDNCK 81
           I+TE  YP     +NC+
Sbjct: 182 IDTEASYPYEARENNCR 198


>gi|37788267|gb|AAO64473.1| cathepsin H precursor [Fundulus heteroclitus]
          Length = 345

 Score = 70.5 bits (171), Expect = 1e-10,   Method: Composition-based stats.
 Identities = 29/68 (42%), Positives = 42/68 (61%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+  G +E ++ I T  LV +S QQLVDC     +  C GG     ++Y++ N+G
Sbjct: 151 GSCWTFSTTGCLESVTAIATVKLVPLSEQQLVDCAQDFNNHGCNGGLPSQAFEYIMYNKG 210

Query: 65  INTERDYP 72
           + TE+DYP
Sbjct: 211 LMTEQDYP 218


>gi|358339356|dbj|GAA47436.1| cathepsin L [Clonorchis sinensis]
          Length = 236

 Score = 70.5 bits (171), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 34/78 (43%), Positives = 47/78 (60%), Gaps = 2/78 (2%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW FAV G IEG     T  LV +S QQL+DCD + E  +C GGF E  Y+ +++  G
Sbjct: 43  GSCWAFAVTGNIEGQWYKKTKKLVSLSEQQLLDCDKKDE--ACNGGFPEWAYESIVKMGG 100

Query: 65  INTERDYPNVGVMDNCKV 82
           + +E+DYP     + C +
Sbjct: 101 LMSEKDYPYEAHKETCNL 118


>gi|391333248|ref|XP_003741031.1| PREDICTED: uncharacterized protein LOC100898636 [Metaseiulus
           occidentalis]
          Length = 642

 Score = 70.5 bits (171), Expect = 1e-10,   Method: Composition-based stats.
 Identities = 35/77 (45%), Positives = 43/77 (55%), Gaps = 2/77 (2%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+  GA+EG     T  L  +S Q LVDC    ES+ C GGF E  +QY+  N G
Sbjct: 450 GSCWAFSATGAVEGQHFKATGRLESLSEQNLVDCVK--ESKGCDGGFFEQAFQYIKDNGG 507

Query: 65  INTERDYPNVGVMDNCK 81
           INTE  YP      +C+
Sbjct: 508 INTEDSYPYEAFDGSCR 524



 Score = 62.8 bits (151), Expect = 2e-08,   Method: Composition-based stats.
 Identities = 30/80 (37%), Positives = 39/80 (48%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G+CW FA  GAIEG     T NLV +S Q ++DC     S  C GG     + Y+  + G
Sbjct: 128 GACWTFAATGAIEGQHFKATGNLVSLSEQNILDCVKTATSNGCSGGLFVEAFDYLKNSGG 187

Query: 65  INTERDYPNVGVMDNCKVFQ 84
           I+ E  YP       C+  Q
Sbjct: 188 IDAEESYPYEASGGTCRFRQ 207


>gi|81542|pir||S02728 actinidain (EC 3.4.22.14) precursor (clone pAC.1) - kiwi fruit
           (fragment)
 gi|15957|emb|CAA31435.1| actinidin precursor [Actinidia chinensis]
 gi|166319|gb|AAA32630.1| actinidin precursor [Actinidia deliciosa]
 gi|226542|prf||1601514A actinidin
          Length = 302

 Score = 70.5 bits (171), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 31/78 (39%), Positives = 46/78 (58%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G CW F+ +  +EGI+KIVT  L+ +S Q+L+ C     +R C GG+I   +Q++I N G
Sbjct: 80  GGCWAFSAIATVEGINKIVTGVLISLSEQELIGCGGTQNTRGCNGGYITDGFQFIINNGG 139

Query: 65  INTERDYPNVGVMDNCKV 82
           INT  +YP       C +
Sbjct: 140 INTGENYPYTAQDGECNL 157


>gi|118120|sp|P25249.1|CYSP1_HORVU RecName: Full=Cysteine proteinase EP-B 1; Flags: Precursor
 gi|1146116|gb|AAA85035.1| cysteine proteinase EPB1 precursor [Hordeum vulgare subsp. vulgare]
          Length = 371

 Score = 70.5 bits (171), Expect = 1e-10,   Method: Composition-based stats.
 Identities = 31/78 (39%), Positives = 47/78 (60%), Gaps = 1/78 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V ++EGI+ I T +LV +S Q+L+DCD   ++  C GG ++  ++Y+  N G
Sbjct: 156 GSCWAFSTVVSVEGINAIRTGSLVSLSEQELIDCDT-ADNDGCQGGLMDNAFEYIKNNGG 214

Query: 65  INTERDYPNVGVMDNCKV 82
           + TE  YP       C V
Sbjct: 215 LITEAAYPYRAARGTCNV 232


>gi|417409774|gb|JAA51378.1| Putative cathepsin k, partial [Desmodus rotundus]
          Length = 331

 Score = 70.5 bits (171), Expect = 1e-10,   Method: Composition-based stats.
 Identities = 34/76 (44%), Positives = 49/76 (64%), Gaps = 2/76 (2%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ VGA+EG  K  T  L+++S Q LVDC    E+  C GG++   + YV +N+G
Sbjct: 139 GSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDC--VSENDGCGGGYMTNAFHYVQKNQG 196

Query: 65  INTERDYPNVGVMDNC 80
           I++E  YP VG  ++C
Sbjct: 197 IDSEDAYPYVGQDESC 212


>gi|118136313|gb|ABK62794.1| cathepsin L-like cysteine protease [Neobenedenia melleni]
 gi|118136315|gb|ABK62795.1| cathepsin L-like cysteine protease [Neobenedenia melleni]
          Length = 335

 Score = 70.5 bits (171), Expect = 1e-10,   Method: Composition-based stats.
 Identities = 30/77 (38%), Positives = 42/77 (54%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+  G+IEG  K  T  L+  S QQLVDC     +  C GG ++  + Y+I N+G
Sbjct: 140 GSCWAFSSTGSIEGAVKRATGKLISFSEQQLVDCSTAFGNHGCNGGIMDNSFNYLIHNKG 199

Query: 65  INTERDYPNVGVMDNCK 81
           + +E  YP       C+
Sbjct: 200 LESEASYPYEAQKKECR 216


>gi|162815|gb|AAA30435.1| cathepsin S, partial [Bos taurus]
 gi|312895|emb|CAA43971.1| cathepsin S [Bos taurus]
          Length = 196

 Score = 70.5 bits (171), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 33/78 (42%), Positives = 47/78 (60%), Gaps = 1/78 (1%)

Query: 5  GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDN-QGESRSCVGGFIETIYQYVIQNR 63
          GSCW F+ VGA+E   K+ T  LV +S Q LVDC   +  ++ C GGF+   +QY+I N 
Sbjct: 2  GSCWAFSAVGALEAQVKLKTGKLVSLSAQNLVDCSTAKYGNKGCNGGFMTEAFQYIIDNN 61

Query: 64 GINTERDYPNVGVMDNCK 81
          GI++E  YP   +   C+
Sbjct: 62 GIDSEASYPYKAMDGKCQ 79


>gi|400180357|gb|AFP73317.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score = 70.5 bits (171), Expect = 1e-10,   Method: Composition-based stats.
 Identities = 31/77 (40%), Positives = 47/77 (61%), Gaps = 2/77 (2%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G CW F+ VG++EG  KI T NL++ S Q+L+DC     +  C GGF+   + ++I+N G
Sbjct: 152 GCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCTTN--NYGCNGGFMTNAFDFIIENGG 209

Query: 65  INTERDYPNVGVMDNCK 81
           I+ E DY  +G    C+
Sbjct: 210 ISRESDYEYLGQQYTCR 226


>gi|74178074|dbj|BAE29827.1| unnamed protein product [Mus musculus]
 gi|74178231|dbj|BAE29900.1| unnamed protein product [Mus musculus]
 gi|74220784|dbj|BAE31361.1| unnamed protein product [Mus musculus]
          Length = 326

 Score = 70.5 bits (171), Expect = 1e-10,   Method: Composition-based stats.
 Identities = 31/78 (39%), Positives = 47/78 (60%), Gaps = 2/78 (2%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGE--SRSCVGGFIETIYQYVIQN 62
           G+CW F+ VGA+EG  K+ T  L+ +S Q LVDC N+ +  ++ C GG++   +QY+I N
Sbjct: 131 GACWAFSAVGALEGQLKLKTGKLISLSAQNLVDCSNEEKYGNKGCGGGYMTEAFQYIIDN 190

Query: 63  RGINTERDYPNVGVMDNC 80
            GI  +  YP     + C
Sbjct: 191 GGIEADASYPYKATDEKC 208


>gi|255211|gb|AAB23202.1| cathepsin S [cattle, spleen, Peptide Partial, 217 aa]
 gi|227966|prf||1714236A cathepsin S
          Length = 217

 Score = 70.1 bits (170), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 33/78 (42%), Positives = 47/78 (60%), Gaps = 1/78 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDN-QGESRSCVGGFIETIYQYVIQNR 63
           GSCW F+ VGA+E   K+ T  LV +S Q LVDC   +  ++ C GGF+   +QY+I N 
Sbjct: 23  GSCWAFSAVGALEAQVKLKTGKLVSLSAQNLVDCSTAKYGNKGCNGGFMTEAFQYIIDNN 82

Query: 64  GINTERDYPNVGVMDNCK 81
           GI++E  YP   +   C+
Sbjct: 83  GIDSEASYPYKAMDGKCQ 100


>gi|357126406|ref|XP_003564878.1| PREDICTED: cysteine proteinase EP-B 1-like [Brachypodium
           distachyon]
          Length = 377

 Score = 70.1 bits (170), Expect = 1e-10,   Method: Composition-based stats.
 Identities = 28/69 (40%), Positives = 46/69 (66%), Gaps = 1/69 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNR- 63
           GSCW F+ V ++EG++ I T +LV +S Q+L+DCD  G+   C GG +E+ ++++  +  
Sbjct: 158 GSCWAFSAVASVEGLNAIRTGSLVSLSEQELIDCDTGGDDNGCQGGLMESAFEFIAHSAG 217

Query: 64  GINTERDYP 72
           G+ TE  YP
Sbjct: 218 GLATEAAYP 226


>gi|55740402|gb|AAV63977.1| cathepsin L precursor [Artemia franciscana]
          Length = 338

 Score = 70.1 bits (170), Expect = 1e-10,   Method: Composition-based stats.
 Identities = 30/77 (38%), Positives = 45/77 (58%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+  GA+EG +   T  L+ +S Q L+DC  +  +  C GG ++  +QY+  N+G
Sbjct: 144 GSCWAFSSTGALEGQTFRKTGKLISLSEQNLIDCSGKYGNEGCNGGLMDQAFQYIKDNKG 203

Query: 65  INTERDYPNVGVMDNCK 81
           I+TE  YP     D C+
Sbjct: 204 IDTENTYPYEAEDDVCR 220


>gi|115441717|ref|NP_001045138.1| Os01g0907600 [Oryza sativa Japonica Group]
 gi|5761329|dbj|BAA83473.1| cysteine endopeptidase [Oryza sativa]
 gi|20804884|dbj|BAB92565.1| cysteine endopeptidase [Oryza sativa Japonica Group]
 gi|56785107|dbj|BAD82745.1| putative cysteine proteinase [Oryza sativa Japonica Group]
 gi|113534669|dbj|BAF07052.1| Os01g0907600 [Oryza sativa Japonica Group]
 gi|119395242|gb|ABL74582.1| cysteine endopeptidase [Oryza sativa Japonica Group]
 gi|125528777|gb|EAY76891.1| hypothetical protein OsI_04850 [Oryza sativa Indica Group]
 gi|125573036|gb|EAZ14551.1| hypothetical protein OsJ_04473 [Oryza sativa Japonica Group]
          Length = 371

 Score = 70.1 bits (170), Expect = 1e-10,   Method: Composition-based stats.
 Identities = 31/76 (40%), Positives = 45/76 (59%), Gaps = 1/76 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V ++EGI+ I T  LV +S Q+L+DCD   ++  C GG +E  ++Y+  + G
Sbjct: 157 GSCWAFSTVVSVEGINAIRTGRLVSLSEQELIDCDT-ADNSGCQGGLMENAFEYIKHSGG 215

Query: 65  INTERDYPNVGVMDNC 80
           I TE  YP       C
Sbjct: 216 ITTESAYPYRAANGTC 231


>gi|351726954|ref|NP_001236888.1| cysteine proteinase precursor [Glycine max]
 gi|479060|emb|CAA83673.1| cysteine proteinase [Glycine max]
 gi|300507422|gb|ADK24076.1| cysteine proteinase [Glycine max]
 gi|300507425|gb|ADK24077.1| cysteine proteinase [Glycine max]
 gi|1096153|prf||2111244A Cys protease
          Length = 380

 Score = 70.1 bits (170), Expect = 1e-10,   Method: Composition-based stats.
 Identities = 32/84 (38%), Positives = 46/84 (54%), Gaps = 7/84 (8%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGE-------SRSCVGGFIETIYQ 57
           GSCW F+  G+IEG + + T  LV +S QQL+DCDN+ +          C GG +   Y 
Sbjct: 162 GSCWAFSTTGSIEGANFLATGKLVSLSEQQLLDCDNKCDITEKTSCDNGCNGGLMTNAYN 221

Query: 58  YVIQNRGINTERDYPNVGVMDNCK 81
           Y++++ G+  E  YP  G    CK
Sbjct: 222 YLLESGGLEEESSYPYTGERGECK 245


>gi|168059933|ref|XP_001781954.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162666600|gb|EDQ53250.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 369

 Score = 70.1 bits (170), Expect = 1e-10,   Method: Composition-based stats.
 Identities = 36/89 (40%), Positives = 47/89 (52%), Gaps = 10/89 (11%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRS-------CVGGFIETIYQ 57
           GSCW F+  GA+EG   + T  L+ +S QQLVDCD+Q +          C GG +   Y+
Sbjct: 161 GSCWAFSTTGAVEGAHFLATGKLLSLSEQQLVDCDHQCDPEEAQACDAGCGGGLMTNAYK 220

Query: 58  YVIQNRGINTERDYPNVGVMDNCKVFQFN 86
           YV +  G+  E DYP  G    C   QFN
Sbjct: 221 YVEEAGGLELESDYPYKGRDGKC---QFN 246


>gi|225448924|ref|XP_002266821.1| PREDICTED: cysteine proteinase 15A-like [Vitis vinifera]
          Length = 375

 Score = 70.1 bits (170), Expect = 1e-10,   Method: Composition-based stats.
 Identities = 34/84 (40%), Positives = 45/84 (53%), Gaps = 7/84 (8%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRS-------CVGGFIETIYQ 57
           GSCW F+  GA+EG   I T  L+ +S QQLVDCD+  + R        C GG +   Y+
Sbjct: 168 GSCWAFSTTGAVEGAHFISTKKLLTLSEQQLVDCDHMCDIRDKTACDSGCEGGLMTNAYK 227

Query: 58  YVIQNRGINTERDYPNVGVMDNCK 81
           Y+I+  G+  E  YP  G    CK
Sbjct: 228 YLIEAGGLEEESSYPYTGKHGECK 251


>gi|405977173|gb|EKC41636.1| Cathepsin K [Crassostrea gigas]
          Length = 942

 Score = 70.1 bits (170), Expect = 1e-10,   Method: Composition-based stats.
 Identities = 35/76 (46%), Positives = 43/76 (56%), Gaps = 2/76 (2%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW FA  G +EG     T  LV +S Q LVDC    E+  C GG   T Y+Y+ +N G
Sbjct: 750 GSCWAFATTGGLEGQHFRKTKKLVSLSEQNLVDCCK--ENLGCTGGLPVTAYKYIARNGG 807

Query: 65  INTERDYPNVGVMDNC 80
           I+TE  YP +G   NC
Sbjct: 808 IDTEESYPYLGKNGNC 823


>gi|225431287|ref|XP_002275759.1| PREDICTED: cysteine proteinase RD19a isoform 1 [Vitis vinifera]
 gi|297735094|emb|CBI17456.3| unnamed protein product [Vitis vinifera]
          Length = 367

 Score = 70.1 bits (170), Expect = 1e-10,   Method: Composition-based stats.
 Identities = 29/78 (37%), Positives = 48/78 (61%), Gaps = 7/78 (8%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGE-------SRSCVGGFIETIYQ 57
           GSCW F+ +GA+EG   + T NL+ +S QQLVDCD++ +        + C GG + + ++
Sbjct: 157 GSCWSFSAIGALEGAHFLTTGNLISMSEQQLVDCDHECDPEEYGACDQGCNGGLMTSAFE 216

Query: 58  YVIQNRGINTERDYPNVG 75
           Y+++  G+  E  YP +G
Sbjct: 217 YILKAGGVEREETYPYIG 234


>gi|357127811|ref|XP_003565571.1| PREDICTED: fruit bromelain-like [Brachypodium distachyon]
          Length = 364

 Score = 70.1 bits (170), Expect = 1e-10,   Method: Composition-based stats.
 Identities = 37/77 (48%), Positives = 43/77 (55%), Gaps = 2/77 (2%)

Query: 6   SCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRGI 65
           SCW FA V AIEG++KI T  LV +S QQLVDCD    S  C GG  +T    V +  GI
Sbjct: 168 SCWAFAAVAAIEGMNKIRTGTLVSLSEQQLVDCDKG--SSGCAGGRTDTALDLVAKRGGI 225

Query: 66  NTERDYPNVGVMDNCKV 82
            +E  YP  G    C V
Sbjct: 226 TSEEKYPYGGFNGKCNV 242


>gi|163658591|gb|ABY28387.1| cathepsin L [Gnathostoma spinigerum]
          Length = 398

 Score = 70.1 bits (170), Expect = 1e-10,   Method: Composition-based stats.
 Identities = 31/72 (43%), Positives = 43/72 (59%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+  GA+EG     T+ LV +S Q LVDC  +  +  C GG ++  ++Y+  N G
Sbjct: 202 GSCWAFSATGALEGQHMRKTHQLVSLSEQNLVDCSRKYGNNGCNGGLMDNAFEYIKDNHG 261

Query: 65  INTERDYPNVGV 76
           I+TE  YP  GV
Sbjct: 262 IDTEESYPYKGV 273


>gi|310656787|gb|ADP02216.1| Peptidase_C1 domain-containing protein [Triticum aestivum]
          Length = 195

 Score = 70.1 bits (170), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 32/78 (41%), Positives = 46/78 (58%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G CW F+ V A E I K+ T  LV +S Q+LVDCD  G  + C GG ++  ++++I+N G
Sbjct: 74  GCCWAFSAVAATECIVKLSTGKLVSLSEQELVDCDIHGVDQGCEGGEMDDAFKFIIKNGG 133

Query: 65  INTERDYPNVGVMDNCKV 82
           + TE +YP       CK 
Sbjct: 134 LTTEANYPYTAQDGQCKT 151


>gi|294878199|ref|XP_002768307.1| cryptopain precursor, putative [Perkinsus marinus ATCC 50983]
 gi|239870555|gb|EER01025.1| cryptopain precursor, putative [Perkinsus marinus ATCC 50983]
          Length = 337

 Score = 70.1 bits (170), Expect = 1e-10,   Method: Composition-based stats.
 Identities = 32/68 (47%), Positives = 44/68 (64%), Gaps = 1/68 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ VGA+EG+ K VT  LVD+S QQL+DC  +  +  C GG ++  Y+YV ++ G
Sbjct: 139 GSCWAFSTVGALEGLYKEVTGKLVDLSEQQLMDCSKEYGNEGCGGGNMDRAYEYV-EDHG 197

Query: 65  INTERDYP 72
           I     YP
Sbjct: 198 IKLNATYP 205


>gi|1709576|sp|P05994.3|PAPA4_CARPA RecName: Full=Papaya proteinase 4; AltName: Full=Glycyl
           endopeptidase; AltName: Full=Papaya peptidase B;
           AltName: Full=Papaya proteinase IV; Short=PPIV; Flags:
           Precursor
 gi|953176|emb|CAA54974.1| proteinase IV [Carica papaya]
          Length = 348

 Score = 70.1 bits (170), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 37/79 (46%), Positives = 47/79 (59%), Gaps = 3/79 (3%)

Query: 6   SCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRGI 65
           SCW F+ V  +EGI+KI T NLV++S Q+LVDCD Q  S  C  G+  T  QYV QN GI
Sbjct: 156 SCWAFSTVATVEGINKIKTGNLVELSEQELVDCDKQ--SYGCNRGYQSTSLQYVAQN-GI 212

Query: 66  NTERDYPNVGVMDNCKVFQ 84
           +    YP +     C+  Q
Sbjct: 213 HLRAKYPYIAKQQTCRANQ 231


>gi|395514298|ref|XP_003761356.1| PREDICTED: cathepsin L1-like [Sarcophilus harrisii]
          Length = 365

 Score = 70.1 bits (170), Expect = 2e-10,   Method: Composition-based stats.
 Identities = 31/77 (40%), Positives = 44/77 (57%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+  G++EG     T  LV +S Q LVDC     +  C GG ++  ++YV +N G
Sbjct: 170 GSCWAFSATGSLEGQWFRKTGKLVSLSEQNLVDCSTAQGNSGCQGGLMDNAFEYVKENGG 229

Query: 65  INTERDYPNVGVMDNCK 81
           I+TE  YP +   D C+
Sbjct: 230 IDTEESYPYIAADDTCQ 246


>gi|195381187|ref|XP_002049336.1| GJ20806 [Drosophila virilis]
 gi|194144133|gb|EDW60529.1| GJ20806 [Drosophila virilis]
          Length = 339

 Score = 70.1 bits (170), Expect = 2e-10,   Method: Composition-based stats.
 Identities = 30/76 (39%), Positives = 45/76 (59%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+  GA+EG        L+ +S Q LVDC  +  +  C GG ++  ++Y+  N G
Sbjct: 144 GSCWAFSSTGALEGQHFRKAGTLISLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDNGG 203

Query: 65  INTERDYPNVGVMDNC 80
           I+TE+ YP  G+ D+C
Sbjct: 204 IDTEKSYPYEGIDDSC 219


>gi|355681656|gb|AER96815.1| Cathepsin L precursor [Mustela putorius furo]
          Length = 331

 Score = 70.1 bits (170), Expect = 2e-10,   Method: Composition-based stats.
 Identities = 32/79 (40%), Positives = 44/79 (55%)

Query: 3   PLGSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQN 62
           P GSCW F+  GA+EG     T  LV +S Q LVDC     +  C GG ++  +QYV  N
Sbjct: 134 PCGSCWAFSATGALEGQMFRKTKRLVSLSEQNLVDCSQAEGNEGCSGGLMDYAFQYVKDN 193

Query: 63  RGINTERDYPNVGVMDNCK 81
            G+++E  YP     ++CK
Sbjct: 194 GGLDSEESYPYRAQDESCK 212


>gi|355567966|gb|EHH24307.1| Cathepsin L2 [Macaca mulatta]
 gi|355753494|gb|EHH57540.1| Cathepsin L2 [Macaca fascicularis]
 gi|380790509|gb|AFE67130.1| cathepsin L2 preproprotein [Macaca mulatta]
          Length = 334

 Score = 70.1 bits (170), Expect = 2e-10,   Method: Composition-based stats.
 Identities = 32/77 (41%), Positives = 47/77 (61%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+  GA+EG     T  LV +S Q LVDC +   ++ C GGF+ + ++YV +N G
Sbjct: 136 GSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSHPQGNQGCNGGFMNSAFRYVKENGG 195

Query: 65  INTERDYPNVGVMDNCK 81
           +++E  YP V +   CK
Sbjct: 196 LDSEESYPYVAMDGICK 212


>gi|224083868|ref|XP_002307151.1| predicted protein [Populus trichocarpa]
 gi|222856600|gb|EEE94147.1| predicted protein [Populus trichocarpa]
          Length = 298

 Score = 70.1 bits (170), Expect = 2e-10,   Method: Composition-based stats.
 Identities = 27/68 (39%), Positives = 46/68 (67%)

Query: 13  VGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRGINTERDYP 72
           V A+EGI+++ T  L+ +S Q++VDCD +GE + C GG ++  ++++ QN+G+ TE +YP
Sbjct: 111 VAAMEGINQLTTGKLISLSEQEVVDCDTKGEDQGCNGGLMDDAFKFIEQNKGLTTEANYP 170

Query: 73  NVGVMDNC 80
             G    C
Sbjct: 171 YTGTDGTC 178


>gi|225707828|gb|ACO09760.1| Cathepsin S precursor [Osmerus mordax]
          Length = 282

 Score = 70.1 bits (170), Expect = 2e-10,   Method: Composition-based stats.
 Identities = 32/68 (47%), Positives = 43/68 (63%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ +GA+EG  K    +LV +S Q LVDC  +  +  C GG++   Y YVI NRG
Sbjct: 139 GSCWAFSSIGALEGQMKRRNGSLVPLSPQNLVDCSTRFGNHGCKGGYLSKSYLYVISNRG 198

Query: 65  INTERDYP 72
           I++E  YP
Sbjct: 199 IDSESFYP 206


>gi|222629922|gb|EEE62054.1| hypothetical protein OsJ_16838 [Oryza sativa Japonica Group]
          Length = 336

 Score = 70.1 bits (170), Expect = 2e-10,   Method: Composition-based stats.
 Identities = 30/68 (44%), Positives = 42/68 (61%), Gaps = 1/68 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V A+EGI+ IVT NL  +S Q+L+DC   G +  C GG ++  + Y+    G
Sbjct: 141 GSCWAFSTVAAVEGINAIVTGNLTSLSEQELIDCSTDGNN-GCNGGLMDYAFSYIASTGG 199

Query: 65  INTERDYP 72
           + TE  YP
Sbjct: 200 LRTEEAYP 207


>gi|115468686|ref|NP_001057942.1| Os06g0582600 [Oryza sativa Japonica Group]
 gi|55296512|dbj|BAD68726.1| putative cysteine proteinase [Oryza sativa Japonica Group]
 gi|113595982|dbj|BAF19856.1| Os06g0582600 [Oryza sativa Japonica Group]
 gi|215695236|dbj|BAG90427.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 357

 Score = 70.1 bits (170), Expect = 2e-10,   Method: Composition-based stats.
 Identities = 30/68 (44%), Positives = 43/68 (63%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
            SCW F+ V A+EGI +I ++NLV +STQQL+DC     +  C  G ++  ++Y+  N G
Sbjct: 157 ASCWAFSAVAAVEGIHQIRSHNLVALSTQQLLDCSTGRNNHGCNRGDMDEAFRYITSNGG 216

Query: 65  INTERDYP 72
           I  E DYP
Sbjct: 217 IAAESDYP 224


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.323    0.140    0.458 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 1,409,333,940
Number of Sequences: 23463169
Number of extensions: 48738852
Number of successful extensions: 92530
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 3846
Number of HSP's successfully gapped in prelim test: 810
Number of HSP's that attempted gapping in prelim test: 86587
Number of HSP's gapped (non-prelim): 4694
length of query: 88
length of database: 8,064,228,071
effective HSP length: 58
effective length of query: 30
effective length of database: 6,703,364,269
effective search space: 201100928070
effective search space used: 201100928070
T: 11
A: 40
X1: 16 ( 7.5 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 69 (31.2 bits)