BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 047400
(88 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|242093994|ref|XP_002437487.1| hypothetical protein SORBIDRAFT_10g027980 [Sorghum bicolor]
gi|241915710|gb|EER88854.1| hypothetical protein SORBIDRAFT_10g027980 [Sorghum bicolor]
Length = 341
Score = 88.6 bits (218), Expect = 4e-16, Method: Composition-based stats.
Identities = 39/76 (51%), Positives = 52/76 (68%), Gaps = 2/76 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW F+ V A+EGI+KIVTNNL+ +S Q+L+DCD E C GG ++ +Q+VI N G
Sbjct: 162 GGCWAFSAVAAMEGINKIVTNNLISLSEQELIDCDT--EDYGCQGGEMQKAFQFVIDNGG 219
Query: 65 INTERDYPNVGVMDNC 80
I+TE DYP +G C
Sbjct: 220 IDTEADYPFIGTNGTC 235
>gi|357474523|ref|XP_003607546.1| Cysteine proteinase [Medicago truncatula]
gi|358347207|ref|XP_003637651.1| Cysteine proteinase [Medicago truncatula]
gi|355503586|gb|AES84789.1| Cysteine proteinase [Medicago truncatula]
gi|355508601|gb|AES89743.1| Cysteine proteinase [Medicago truncatula]
Length = 345
Score = 88.6 bits (218), Expect = 4e-16, Method: Composition-based stats.
Identities = 37/78 (47%), Positives = 54/78 (69%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW FA V A+EGI+KI + L+ +S Q+L+DCD + ++ C GG +ET Y ++I+N G
Sbjct: 150 GGCWAFAAVAAVEGINKIKSGKLISLSEQELIDCDVKSGNQGCQGGLMETAYTFIIENGG 209
Query: 65 INTERDYPNVGVMDNCKV 82
+ TE+DYP GV CK+
Sbjct: 210 LTTEQDYPYEGVDGTCKM 227
>gi|144905112|dbj|BAF56429.1| cysteine proteinase [Lotus japonicus]
Length = 341
Score = 88.6 bits (218), Expect = 5e-16, Method: Composition-based stats.
Identities = 38/77 (49%), Positives = 52/77 (67%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V A EGI KI T LV +S Q+LVDCD +G + C GG++E ++++I+N G
Sbjct: 148 GSCWAFSTVAATEGIHKISTGKLVSLSEQELVDCDRKGTDQGCEGGYMEDGFEFIIKNGG 207
Query: 65 INTERDYPNVGVMDNCK 81
I TE +YP V +CK
Sbjct: 208 ITTEANYPYKAVDGSCK 224
>gi|224099295|ref|XP_002334495.1| predicted protein [Populus trichocarpa]
gi|222872550|gb|EEF09681.1| predicted protein [Populus trichocarpa]
Length = 342
Score = 88.2 bits (217), Expect = 5e-16, Method: Composition-based stats.
Identities = 36/68 (52%), Positives = 50/68 (73%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V A EGI+++ T LV +S Q+LVDCDNQGE + C GG +E ++++I+N G
Sbjct: 146 GSCWAFSTVAATEGINQLTTGKLVSLSEQELVDCDNQGEDQGCEGGLMEDGFEFIIKNHG 205
Query: 65 INTERDYP 72
I TE +YP
Sbjct: 206 ITTEANYP 213
>gi|159479072|ref|XP_001697622.1| cysteine endopeptidase [Chlamydomonas reinhardtii]
gi|158274232|gb|EDP00016.1| cysteine endopeptidase [Chlamydomonas reinhardtii]
Length = 469
Score = 87.8 bits (216), Expect = 8e-16, Method: Composition-based stats.
Identities = 38/78 (48%), Positives = 54/78 (69%), Gaps = 1/78 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW FA G++EGI+ IVT +LV +S Q+LVDCD + + + C GG ++ Y ++I+N+G
Sbjct: 124 GSCWAFATTGSVEGINAIVTGSLVSLSEQELVDCDTE-QDKGCSGGLMDYAYAWIIKNKG 182
Query: 65 INTERDYPNVGVMDNCKV 82
INTE DYP + C V
Sbjct: 183 INTEEDYPYTAMDGQCDV 200
>gi|413922306|gb|AFW62238.1| hypothetical protein ZEAMMB73_802227 [Zea mays]
Length = 490
Score = 87.4 bits (215), Expect = 1e-15, Method: Composition-based stats.
Identities = 38/81 (46%), Positives = 56/81 (69%), Gaps = 7/81 (8%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCD---NQGESRSCVGGFIETIYQYVIQ 61
GSCW F+ + A+EGI++IVT +L+ +S Q+LVDCD NQG C GG ++ +++++I
Sbjct: 360 GSCWAFSTIAAVEGINQIVTGDLISLSKQELVDCDTSYNQG----CNGGLMDYVFEFIIN 415
Query: 62 NRGINTERDYPNVGVMDNCKV 82
N GI+TE+DYP G C V
Sbjct: 416 NGGIDTEKDYPYKGTDGRCDV 436
>gi|414879924|tpg|DAA57055.1| TPA: hypothetical protein ZEAMMB73_175573 [Zea mays]
Length = 336
Score = 87.0 bits (214), Expect = 1e-15, Method: Composition-based stats.
Identities = 40/88 (45%), Positives = 59/88 (67%), Gaps = 7/88 (7%)
Query: 2 HPLGSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCD---NQGESRSCVGGFIETIYQY 58
+P GSCW F+ + A+EGI++IVT +L+ +S Q+LVDCD NQG C GG ++ +++
Sbjct: 6 YPSGSCWAFSTIAAVEGINQIVTGDLISLSEQELVDCDTSYNQG----CNGGLMDYAFEF 61
Query: 59 VIQNRGINTERDYPNVGVMDNCKVFQFN 86
+I N GI+TE+DYP G C V + N
Sbjct: 62 IINNGGIDTEKDYPYKGTDGRCDVNRKN 89
>gi|356553978|ref|XP_003545327.1| PREDICTED: cysteine proteinase RD21a-like [Glycine max]
Length = 496
Score = 87.0 bits (214), Expect = 1e-15, Method: Composition-based stats.
Identities = 38/82 (46%), Positives = 56/82 (68%), Gaps = 1/82 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ +GA+EGI+KIVT L+ +S Q+LVDCD G + C GG ++ ++++I N G
Sbjct: 189 GSCWAFSAIGAVEGINKIVTGELISLSEQELVDCDT-GYNEGCNGGLMDYAFEFIINNGG 247
Query: 65 INTERDYPNVGVMDNCKVFQFN 86
I++E DYP GV C ++ N
Sbjct: 248 IDSEEDYPYRGVDGRCDTYRKN 269
>gi|356517350|ref|XP_003527350.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Glycine
max]
gi|356577765|ref|XP_003556993.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Glycine
max]
Length = 343
Score = 86.7 bits (213), Expect = 1e-15, Method: Composition-based stats.
Identities = 36/78 (46%), Positives = 50/78 (64%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW F+ + A EGI KI T LV +S Q++VDCD +G C GG+++ ++++IQN G
Sbjct: 147 GCCWAFSAIAATEGIHKISTGKLVSLSEQEVVDCDTKGTDHGCEGGYMDGAFKFIIQNHG 206
Query: 65 INTERDYPNVGVMDNCKV 82
INTE YP GV C +
Sbjct: 207 INTEASYPYKGVDGKCNI 224
>gi|224135841|ref|XP_002327317.1| predicted protein [Populus trichocarpa]
gi|222835687|gb|EEE74122.1| predicted protein [Populus trichocarpa]
Length = 342
Score = 86.7 bits (213), Expect = 2e-15, Method: Composition-based stats.
Identities = 35/68 (51%), Positives = 49/68 (72%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V A EGI+++ T LV +S Q+LVDCD QGE + C GG +E ++++I+N G
Sbjct: 146 GSCWAFSTVAATEGINQLTTGKLVSLSEQELVDCDTQGEDQGCEGGLMEDGFEFIIKNHG 205
Query: 65 INTERDYP 72
I TE +YP
Sbjct: 206 ITTEANYP 213
>gi|255580659|ref|XP_002531152.1| cysteine protease, putative [Ricinus communis]
gi|223529265|gb|EEF31237.1| cysteine protease, putative [Ricinus communis]
Length = 340
Score = 86.3 bits (212), Expect = 2e-15, Method: Composition-based stats.
Identities = 34/80 (42%), Positives = 54/80 (67%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V A+EGI+++ T+ L+ +S Q+LVDCD +GE + C GG ++ ++++ QN+G
Sbjct: 145 GSCWAFSAVAAVEGITQLATSKLISLSEQELVDCDTKGEDQGCQGGLMDDAFKFIEQNQG 204
Query: 65 INTERDYPNVGVMDNCKVFQ 84
+ TE +YP G C Q
Sbjct: 205 LTTEANYPYEGSDGTCNTKQ 224
>gi|356564154|ref|XP_003550321.1| PREDICTED: cysteine proteinase RD21a-like [Glycine max]
Length = 476
Score = 85.9 bits (211), Expect = 3e-15, Method: Composition-based stats.
Identities = 37/82 (45%), Positives = 57/82 (69%), Gaps = 1/82 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ +GA+EGI+KIVT L+ +S Q+LVDCD G ++ C GG ++ ++++I N G
Sbjct: 169 GSCWAFSAIGAVEGINKIVTGELISLSEQELVDCDT-GYNQGCNGGLMDYAFEFIINNGG 227
Query: 65 INTERDYPNVGVMDNCKVFQFN 86
I+++ DYP GV C ++ N
Sbjct: 228 IDSDEDYPYRGVDGRCDTYRKN 249
>gi|255580657|ref|XP_002531151.1| cysteine protease, putative [Ricinus communis]
gi|223529264|gb|EEF31236.1| cysteine protease, putative [Ricinus communis]
Length = 340
Score = 85.9 bits (211), Expect = 3e-15, Method: Composition-based stats.
Identities = 34/80 (42%), Positives = 54/80 (67%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V A+EGI+++ T+ L+ +S Q+LVDCD +GE + C GG ++ ++++ QN+G
Sbjct: 145 GSCWAFSAVAAVEGITQLATSKLISLSEQELVDCDTKGEDQGCQGGLMDDAFKFIEQNQG 204
Query: 65 INTERDYPNVGVMDNCKVFQ 84
+ TE +YP G C Q
Sbjct: 205 LTTEANYPYEGSDGTCNTKQ 224
>gi|357452075|ref|XP_003596314.1| Cysteine proteinase [Medicago truncatula]
gi|355485362|gb|AES66565.1| Cysteine proteinase [Medicago truncatula]
Length = 341
Score = 85.9 bits (211), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 36/76 (47%), Positives = 51/76 (67%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW F+ V + EGI K+ T NLV +S Q+LVDCD GE + C GG ++ ++++IQN G
Sbjct: 145 GCCWAFSAVASTEGIHKLTTGNLVSLSEQELVDCDTNGEDQGCEGGLMDDAFEFIIQNNG 204
Query: 65 INTERDYPNVGVMDNC 80
++TE +YP GV C
Sbjct: 205 LSTEAEYPYQGVDGTC 220
>gi|144905116|dbj|BAF56430.1| cysteine proteinase [Lotus japonicus]
Length = 341
Score = 85.9 bits (211), Expect = 3e-15, Method: Composition-based stats.
Identities = 34/76 (44%), Positives = 51/76 (67%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW F+ V A EGI+K+ T L+ +S Q+LVDCD +G + C GG ++ +++++QN+G
Sbjct: 146 GCCWAFSAVAATEGITKLSTGKLISLSEQELVDCDTKGVDQGCEGGLMDDAFKFIMQNKG 205
Query: 65 INTERDYPNVGVMDNC 80
+NTE YP GV C
Sbjct: 206 LNTEAKYPYQGVDATC 221
>gi|147788834|emb|CAN64655.1| hypothetical protein VITISV_005140 [Vitis vinifera]
Length = 341
Score = 85.5 bits (210), Expect = 3e-15, Method: Composition-based stats.
Identities = 33/76 (43%), Positives = 50/76 (65%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V A+EGI+++ T L+ +S Q+LVDCD GE + C GG ++ ++++ QN G
Sbjct: 145 GSCWAFSAVAAMEGITQLSTGKLISLSEQELVDCDTSGEDQGCSGGLMDDAFKFIEQNHG 204
Query: 65 INTERDYPNVGVMDNC 80
+ TE +YP G C
Sbjct: 205 LTTEANYPYAGTDGTC 220
>gi|225446581|ref|XP_002280246.1| PREDICTED: vignain [Vitis vinifera]
Length = 341
Score = 85.5 bits (210), Expect = 4e-15, Method: Composition-based stats.
Identities = 33/76 (43%), Positives = 50/76 (65%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V A+EGI+++ T L+ +S Q+LVDCD GE + C GG ++ ++++ QN G
Sbjct: 145 GSCWAFSAVAAMEGITQLSTGKLISLSEQELVDCDTSGEDQGCSGGLMDDAFKFIEQNHG 204
Query: 65 INTERDYPNVGVMDNC 80
+ TE +YP G C
Sbjct: 205 LTTEANYPYAGTDGTC 220
>gi|357113934|ref|XP_003558756.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like
[Brachypodium distachyon]
Length = 346
Score = 85.5 bits (210), Expect = 4e-15, Method: Composition-based stats.
Identities = 35/77 (45%), Positives = 52/77 (67%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V A EG+ K+ T LV +S Q+LVDCD G + C+GG+++ ++++I+N G
Sbjct: 150 GSCWAFSAVAATEGVVKLSTGKLVSLSEQELVDCDVHGVDQGCMGGWMDDAFKFIIKNGG 209
Query: 65 INTERDYPNVGVMDNCK 81
+ TE +YP G D CK
Sbjct: 210 LTTEANYPYTGEDDKCK 226
>gi|144905108|dbj|BAF56428.1| cysteine proteinase [Lotus japonicus]
Length = 342
Score = 85.5 bits (210), Expect = 4e-15, Method: Composition-based stats.
Identities = 34/76 (44%), Positives = 51/76 (67%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW F+ V A EGI+K+ T L+ +S Q+LVDCD +G + C GG ++ +++++QN+G
Sbjct: 146 GCCWAFSAVAATEGITKLSTGKLISLSEQELVDCDTKGVDQGCEGGLMDDAFKFIMQNKG 205
Query: 65 INTERDYPNVGVMDNC 80
+NTE YP GV C
Sbjct: 206 LNTEAKYPYQGVDATC 221
>gi|147839728|emb|CAN70559.1| hypothetical protein VITISV_032465 [Vitis vinifera]
Length = 341
Score = 85.5 bits (210), Expect = 4e-15, Method: Composition-based stats.
Identities = 33/76 (43%), Positives = 50/76 (65%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V A+EGI+++ T L+ +S Q+LVDCD GE + C GG ++ ++++ QN G
Sbjct: 145 GSCWAFSAVAAMEGITQLSTGKLISLSEQELVDCDTSGEDQGCSGGLMDDAFKFIEQNHG 204
Query: 65 INTERDYPNVGVMDNC 80
+ TE +YP G C
Sbjct: 205 LTTEANYPYAGTDGTC 220
>gi|388512155|gb|AFK44139.1| unknown [Medicago truncatula]
Length = 340
Score = 85.5 bits (210), Expect = 4e-15, Method: Composition-based stats.
Identities = 35/76 (46%), Positives = 52/76 (68%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V AIEGI++I T L+ +S Q+LVDCD +GE + C GG +E ++++I+N G
Sbjct: 146 GSCWAFSTVAAIEGINQITTGKLISLSEQELVDCDTKGEDQGCEGGLMEDGFEFIIKNGG 205
Query: 65 INTERDYPNVGVMDNC 80
I +E +YP +C
Sbjct: 206 ITSETNYPYKAADGSC 221
>gi|2511693|emb|CAB17076.1| cysteine proteinase precursor [Phaseolus vulgaris]
Length = 455
Score = 85.1 bits (209), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 38/82 (46%), Positives = 58/82 (70%), Gaps = 1/82 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ +GA+EGI+KIVT +L+ +S Q+LVDCD G + C GG ++ ++++I+N G
Sbjct: 149 GSCWAFSAIGAVEGINKIVTGDLISLSEQELVDCDT-GYNMGCNGGLMDYAFEFIIKNGG 207
Query: 65 INTERDYPNVGVMDNCKVFQFN 86
I++E DYP GV C ++ N
Sbjct: 208 IDSEEDYPYKGVDGRCDEYRKN 229
>gi|302812789|ref|XP_002988081.1| hypothetical protein SELMODRAFT_183539 [Selaginella moellendorffii]
gi|300144187|gb|EFJ10873.1| hypothetical protein SELMODRAFT_183539 [Selaginella moellendorffii]
Length = 425
Score = 85.1 bits (209), Expect = 4e-15, Method: Composition-based stats.
Identities = 37/82 (45%), Positives = 55/82 (67%), Gaps = 1/82 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW FA GAIEGI++IVT LV +S Q+L+DCD + + + C GG +E YQ++++N G
Sbjct: 127 GGCWAFATTGAIEGINQIVTGQLVSLSEQELIDCDKKAD-KGCDGGLMENAYQFIVENGG 185
Query: 65 INTERDYPNVGVMDNCKVFQFN 86
++TE DYP +C + + N
Sbjct: 186 LDTETDYPYHASESHCNMKKLN 207
>gi|225446583|ref|XP_002280204.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1 [Vitis vinifera]
Length = 341
Score = 85.1 bits (209), Expect = 4e-15, Method: Composition-based stats.
Identities = 33/76 (43%), Positives = 50/76 (65%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V A+EGI+++ T L+ +S Q+LVDCD GE + C GG ++ ++++ QN G
Sbjct: 145 GSCWAFSAVAAMEGITQLSTGKLISLSEQELVDCDTSGEDQGCSGGLMDDAFKFIEQNHG 204
Query: 65 INTERDYPNVGVMDNC 80
+ TE +YP G C
Sbjct: 205 LTTEANYPYAGTDGTC 220
>gi|102140014|gb|ABF70145.1| cysteine protease, putative [Musa acuminata]
Length = 373
Score = 85.1 bits (209), Expect = 5e-15, Method: Composition-based stats.
Identities = 36/76 (47%), Positives = 51/76 (67%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F VV A+EGI+KIVT L+ +S QQLVDCD G+ + C GG ++ +++++ N G
Sbjct: 142 GSCWAFTVVAAVEGITKIVTGKLISLSEQQLVDCDVHGKDQGCQGGDMDAAFEFIVNNGG 201
Query: 65 INTERDYPNVGVMDNC 80
I +E +YP V C
Sbjct: 202 ITSEANYPYEEVQRLC 217
>gi|357474725|ref|XP_003607647.1| Cysteine proteinase [Medicago truncatula]
gi|355508702|gb|AES89844.1| Cysteine proteinase [Medicago truncatula]
Length = 340
Score = 85.1 bits (209), Expect = 5e-15, Method: Composition-based stats.
Identities = 35/76 (46%), Positives = 52/76 (68%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V AIEGI++I T L+ +S Q+LVDCD +GE + C GG +E ++++I+N G
Sbjct: 146 GSCWAFSTVAAIEGINQITTGKLISLSEQELVDCDTKGEDQGCEGGLMEDGFEFIIKNGG 205
Query: 65 INTERDYPNVGVMDNC 80
I +E +YP +C
Sbjct: 206 ITSETNYPYKAADGSC 221
>gi|225458143|ref|XP_002280937.1| PREDICTED: cysteine proteinase RD21a [Vitis vinifera]
gi|302142569|emb|CBI19772.3| unnamed protein product [Vitis vinifera]
Length = 436
Score = 85.1 bits (209), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 41/76 (53%), Positives = 54/76 (71%), Gaps = 1/76 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW F+ GAIEGI+KIVT +LV +S Q+LVDCD S C GG ++ YQ+VI+N+G
Sbjct: 135 GGCWSFSTTGAIEGINKIVTGSLVSLSEQELVDCDRSYNS-GCEGGLMDYAYQFVIKNQG 193
Query: 65 INTERDYPNVGVMDNC 80
I++E DYP VG+ C
Sbjct: 194 IDSEADYPYVGMDKPC 209
>gi|356515080|ref|XP_003526229.1| PREDICTED: vignain-like [Glycine max]
Length = 284
Score = 85.1 bits (209), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 36/78 (46%), Positives = 50/78 (64%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW F+ + A EGI KI T LV +S Q++VDCD +G C GG+++ ++++IQN G
Sbjct: 88 GCCWAFSAIAATEGIHKISTGKLVSLSEQEVVDCDTKGTDHGCEGGYMDGAFKFIIQNHG 147
Query: 65 INTERDYPNVGVMDNCKV 82
INTE YP GV C +
Sbjct: 148 INTEASYPYKGVDGKCNI 165
>gi|116309178|emb|CAH66275.1| OSIGBa0147O06.5 [Oryza sativa Indica Group]
Length = 339
Score = 85.1 bits (209), Expect = 5e-15, Method: Composition-based stats.
Identities = 34/77 (44%), Positives = 50/77 (64%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW F+ V A+EGI K+ T L+ +S Q+LVDCD GE + C GG ++ ++++I+N G
Sbjct: 145 GCCWAFSAVAAMEGIVKLSTGKLISLSEQELVDCDVHGEDQGCEGGLMDDAFKFIIKNGG 204
Query: 65 INTERDYPNVGVMDNCK 81
+ TE +YP D CK
Sbjct: 205 LTTESNYPYAAADDKCK 221
>gi|218202087|gb|EEC84514.1| hypothetical protein OsI_31214 [Oryza sativa Indica Group]
Length = 348
Score = 85.1 bits (209), Expect = 5e-15, Method: Composition-based stats.
Identities = 34/77 (44%), Positives = 50/77 (64%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW F+ V A+EGI K+ T L+ +S Q+LVDCD GE + C GG ++ ++++I+N G
Sbjct: 145 GCCWAFSAVAAMEGIVKLSTGKLISLSEQELVDCDVHGEDQGCEGGLMDDAFKFIIKNGG 204
Query: 65 INTERDYPNVGVMDNCK 81
+ TE +YP D CK
Sbjct: 205 LTTESNYPYAAADDKCK 221
>gi|50355611|dbj|BAD29954.1| cysteine protease [Daucus carota]
Length = 474
Score = 84.7 bits (208), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 38/72 (52%), Positives = 55/72 (76%), Gaps = 1/72 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ VGA+EGI+KIVT L+ +S Q+LVDCDN G ++ C GG ++ ++++++N G
Sbjct: 172 GSCWAFSTVGAVEGINKIVTGELISLSEQELVDCDN-GYNQGCNGGLMDYAFEFIVKNGG 230
Query: 65 INTERDYPNVGV 76
I+TE DYP GV
Sbjct: 231 IDTEDDYPYKGV 242
>gi|297843430|ref|XP_002889596.1| hypothetical protein ARALYDRAFT_887827 [Arabidopsis lyrata subsp.
lyrata]
gi|297335438|gb|EFH65855.1| hypothetical protein ARALYDRAFT_887827 [Arabidopsis lyrata subsp.
lyrata]
Length = 343
Score = 84.7 bits (208), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 37/76 (48%), Positives = 50/76 (65%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW F+ V AIEGI+KI T NLV +S QQL+DCD ++ C GG +ET ++++ N G
Sbjct: 149 GGCWAFSAVAAIEGINKIKTGNLVSLSEQQLIDCDVGTYNKGCSGGLMETAFEFIKSNGG 208
Query: 65 INTERDYPNVGVMDNC 80
+ TE DYP G+ C
Sbjct: 209 LTTETDYPYTGIEGTC 224
>gi|359485281|ref|XP_002280230.2| PREDICTED: LOW QUALITY PROTEIN: KDEL-tailed cysteine endopeptidase
CEP1 [Vitis vinifera]
Length = 341
Score = 84.7 bits (208), Expect = 6e-15, Method: Composition-based stats.
Identities = 33/76 (43%), Positives = 50/76 (65%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V A+EGI+++ T L+ +S Q+LVDCD GE + C GG ++ ++++ QN G
Sbjct: 145 GSCWAFSAVAAMEGITQLSTGKLISLSEQELVDCDTSGEDQGCNGGLMDDAFKFIKQNHG 204
Query: 65 INTERDYPNVGVMDNC 80
+ TE +YP G C
Sbjct: 205 LTTEANYPYAGTDGTC 220
>gi|38345008|emb|CAD40026.2| OSJNBa0052O21.11 [Oryza sativa Japonica Group]
gi|125589414|gb|EAZ29764.1| hypothetical protein OsJ_13822 [Oryza sativa Japonica Group]
Length = 339
Score = 84.7 bits (208), Expect = 6e-15, Method: Composition-based stats.
Identities = 34/77 (44%), Positives = 50/77 (64%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW F+ V A+EGI K+ T L+ +S Q+LVDCD GE + C GG ++ ++++I+N G
Sbjct: 145 GCCWAFSAVAAMEGIVKLSTGKLISLSEQELVDCDVHGEDQGCEGGLMDDAFKFIIKNGG 204
Query: 65 INTERDYPNVGVMDNCK 81
+ TE +YP D CK
Sbjct: 205 LTTESNYPYAAADDKCK 221
>gi|224121800|ref|XP_002330656.1| predicted protein [Populus trichocarpa]
gi|222872260|gb|EEF09391.1| predicted protein [Populus trichocarpa]
Length = 342
Score = 84.7 bits (208), Expect = 6e-15, Method: Composition-based stats.
Identities = 35/68 (51%), Positives = 49/68 (72%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V A EGI+++ T LV +S Q+LVDCD QGE + C GG +E ++++I+N G
Sbjct: 146 GSCWAFSTVAATEGINQLTTGKLVSLSEQELVDCDIQGEDQGCEGGLMEDGFEFIIKNHG 205
Query: 65 INTERDYP 72
I TE +YP
Sbjct: 206 ITTEANYP 213
>gi|125547236|gb|EAY93058.1| hypothetical protein OsI_14861 [Oryza sativa Indica Group]
Length = 339
Score = 84.7 bits (208), Expect = 7e-15, Method: Composition-based stats.
Identities = 34/77 (44%), Positives = 50/77 (64%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW F+ V A+EGI K+ T L+ +S Q+LVDCD GE + C GG ++ ++++I+N G
Sbjct: 145 GCCWAFSAVAAMEGIVKLSTGKLISLSEQELVDCDVHGEDQGCEGGLMDDAFKFIIKNGG 204
Query: 65 INTERDYPNVGVMDNCK 81
+ TE +YP D CK
Sbjct: 205 LTTESNYPYAAADDKCK 221
>gi|242094000|ref|XP_002437490.1| hypothetical protein SORBIDRAFT_10g028000 [Sorghum bicolor]
gi|241915713|gb|EER88857.1| hypothetical protein SORBIDRAFT_10g028000 [Sorghum bicolor]
Length = 372
Score = 84.3 bits (207), Expect = 7e-15, Method: Composition-based stats.
Identities = 39/82 (47%), Positives = 53/82 (64%), Gaps = 2/82 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW F+ V AIEGI++IVT NLV +S Q+++DCD Q C GG ++ +Q+VI N G
Sbjct: 164 GGCWAFSAVAAIEGINEIVTGNLVSLSEQEIIDCDTQ--DGGCNGGEMQNAFQFVINNGG 221
Query: 65 INTERDYPNVGVMDNCKVFQFN 86
I+TE DYP +G C + N
Sbjct: 222 IDTEADYPYLGTDAACDANRVN 243
>gi|302781881|ref|XP_002972714.1| hypothetical protein SELMODRAFT_98707 [Selaginella moellendorffii]
gi|300159315|gb|EFJ25935.1| hypothetical protein SELMODRAFT_98707 [Selaginella moellendorffii]
Length = 446
Score = 84.3 bits (207), Expect = 8e-15, Method: Composition-based stats.
Identities = 36/82 (43%), Positives = 55/82 (67%), Gaps = 1/82 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW FA GAIEGI++IVT L+ +S Q+L+DCD + + + C GG +E YQ++++N G
Sbjct: 127 GGCWAFATTGAIEGINQIVTGQLMSLSEQELIDCDKKAD-KGCDGGLMENAYQFIVENGG 185
Query: 65 INTERDYPNVGVMDNCKVFQFN 86
++TE DYP +C + + N
Sbjct: 186 LDTETDYPYHASESHCNMKKLN 207
>gi|225446585|ref|XP_002280215.1| PREDICTED: vignain [Vitis vinifera]
Length = 341
Score = 84.3 bits (207), Expect = 8e-15, Method: Composition-based stats.
Identities = 33/76 (43%), Positives = 50/76 (65%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V A+EGI+++ T L+ +S Q+LVDCD GE + C GG ++ ++++ QN G
Sbjct: 145 GSCWAFSAVAAMEGITQLSTGKLISLSEQELVDCDTSGEDQGCNGGLMDDAFKFIEQNHG 204
Query: 65 INTERDYPNVGVMDNC 80
+ TE +YP G C
Sbjct: 205 LATEANYPYAGTDGTC 220
>gi|409190991|gb|AFV30165.1| cysteine proteinase [Lotus japonicus]
Length = 342
Score = 84.3 bits (207), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 37/76 (48%), Positives = 49/76 (64%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW F+ V A EGI K+ T NLV +S Q+LVDCD G + C GG ++ ++++IQN G
Sbjct: 146 GCCWAFSAVAATEGIHKLSTGNLVSLSEQELVDCDTSGADQGCQGGLMDDAFKFIIQNGG 205
Query: 65 INTERDYPNVGVMDNC 80
+NTE YP GV C
Sbjct: 206 LNTEAQYPYQGVDGTC 221
>gi|414875906|tpg|DAA53037.1| TPA: hypothetical protein ZEAMMB73_586844 [Zea mays]
Length = 1039
Score = 84.3 bits (207), Expect = 8e-15, Method: Composition-based stats.
Identities = 39/85 (45%), Positives = 57/85 (67%), Gaps = 7/85 (8%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCD---NQGESRSCVGGFIETIYQYVIQ 61
GSCW F+ + A+EGI++IVT +L+ +S Q+LVDCD NQG C GG ++ ++++I
Sbjct: 713 GSCWAFSTIAAVEGINQIVTGDLISLSEQELVDCDTSYNQG----CNGGLMDYAFEFIIN 768
Query: 62 NRGINTERDYPNVGVMDNCKVFQFN 86
N GI+TE+DYP G C V + N
Sbjct: 769 NGGIDTEKDYPYKGTDGRCDVNRKN 793
>gi|147836416|emb|CAN75313.1| hypothetical protein VITISV_033592 [Vitis vinifera]
Length = 201
Score = 84.3 bits (207), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 33/76 (43%), Positives = 50/76 (65%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V A+EGI+++ T L+ +S Q+LVDCD GE + C GG ++ ++++ QN G
Sbjct: 69 GSCWAFSAVAAMEGITQLSTGKLISLSEQELVDCDTSGEDQGCNGGLMDDAFKFIEQNHG 128
Query: 65 INTERDYPNVGVMDNC 80
+ TE +YP G C
Sbjct: 129 LTTEANYPYAGTDGTC 144
>gi|144905104|dbj|BAF56427.1| cysteine proteinase [Lotus japonicus]
Length = 342
Score = 84.3 bits (207), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 37/76 (48%), Positives = 49/76 (64%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW F+ V A EGI K+ T NLV +S Q+LVDCD G + C GG ++ ++++IQN G
Sbjct: 146 GCCWAFSAVAATEGIHKLSTGNLVSLSEQELVDCDTSGADQGCQGGLMDDAFKFIIQNGG 205
Query: 65 INTERDYPNVGVMDNC 80
+NTE YP GV C
Sbjct: 206 LNTEAQYPYQGVDGTC 221
>gi|356577813|ref|XP_003557017.1| PREDICTED: uncharacterized protein LOC100801364 [Glycine max]
Length = 890
Score = 84.3 bits (207), Expect = 9e-15, Method: Composition-based stats.
Identities = 34/76 (44%), Positives = 49/76 (64%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW F+ V A EGI + + L+ +S Q+LVDCD +G + C GG ++ +++VIQN G
Sbjct: 694 GCCWAFSAVAATEGIHALTSGKLISLSEQELVDCDTKGVDQGCEGGLMDDAFKFVIQNHG 753
Query: 65 INTERDYPNVGVMDNC 80
+NTE +YP GV C
Sbjct: 754 LNTEANYPYKGVDGKC 769
>gi|18390634|ref|NP_563764.1| cysteine proteinase-like protein [Arabidopsis thaliana]
gi|8844131|gb|AAF80223.1|AC025290_12 Contains similarity to a cysteine endopeptidase 1 from Phaseolus
vulgaris gb|U52970 and is a member of the papain
cysteine protease family PF|00112 [Arabidopsis thaliana]
gi|332189848|gb|AEE27969.1| cysteine proteinase-like protein [Arabidopsis thaliana]
Length = 343
Score = 84.0 bits (206), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 37/76 (48%), Positives = 50/76 (65%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW F+ V AIEGI+KI T NLV +S QQL+DCD ++ C GG +ET ++++ N G
Sbjct: 149 GGCWAFSAVAAIEGINKIKTGNLVSLSEQQLIDCDVGTYNKGCSGGLMETAFEFIKTNGG 208
Query: 65 INTERDYPNVGVMDNC 80
+ TE DYP G+ C
Sbjct: 209 LATETDYPYTGIEGTC 224
>gi|224096714|ref|XP_002310708.1| predicted protein [Populus trichocarpa]
gi|222853611|gb|EEE91158.1| predicted protein [Populus trichocarpa]
Length = 356
Score = 84.0 bits (206), Expect = 1e-14, Method: Composition-based stats.
Identities = 36/76 (47%), Positives = 54/76 (71%), Gaps = 1/76 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V A+EGI++IVT L+ +S Q+LVDCD + + C GG ++ +Q++I N G
Sbjct: 139 GSCWAFSTVAAVEGINQIVTGELISLSEQELVDCD-RFYNAGCNGGLMDYAFQFIINNGG 197
Query: 65 INTERDYPNVGVMDNC 80
++TE+DYP +G D C
Sbjct: 198 LDTEKDYPYLGNDDTC 213
>gi|148927382|gb|ABR19827.1| cysteine proteinase [Elaeis guineensis]
Length = 470
Score = 84.0 bits (206), Expect = 1e-14, Method: Composition-based stats.
Identities = 38/82 (46%), Positives = 56/82 (68%), Gaps = 1/82 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V A+EGI+KIVT +L+ +S Q+LVDCDN G ++ C GG ++ ++++I N G
Sbjct: 163 GSCWAFSTVAAVEGINKIVTGDLISLSEQELVDCDN-GYNQGCNGGLMDYGFEFIINNGG 221
Query: 65 INTERDYPNVGVMDNCKVFQFN 86
I+TE DYP C ++ N
Sbjct: 222 IDTEEDYPYTARDGKCDQYRKN 243
>gi|414587996|tpg|DAA38567.1| TPA: hypothetical protein ZEAMMB73_390779 [Zea mays]
Length = 343
Score = 84.0 bits (206), Expect = 1e-14, Method: Composition-based stats.
Identities = 35/77 (45%), Positives = 50/77 (64%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW F+ V A+EGI K+ T+NLV +S Q+LVDCD C GG++++ +++VI+N G
Sbjct: 149 GCCWAFSAVAAMEGIVKLSTDNLVSLSEQELVDCDTHSMDEGCEGGWMDSAFEFVIKNGG 208
Query: 65 INTERDYPNVGVMDNCK 81
+ TE YP V CK
Sbjct: 209 LATESSYPYKAVDGKCK 225
>gi|255563110|ref|XP_002522559.1| cysteine protease, putative [Ricinus communis]
gi|223538250|gb|EEF39859.1| cysteine protease, putative [Ricinus communis]
Length = 344
Score = 84.0 bits (206), Expect = 1e-14, Method: Composition-based stats.
Identities = 33/76 (43%), Positives = 51/76 (67%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW F+ V A+EGI+K+ T L+ +S Q+LVDCD G + C GG ++ ++++I+N G
Sbjct: 148 GCCWAFSAVAAMEGITKLSTGTLISLSEQELVDCDTSGMDQGCEGGLMDDAFEFIIENNG 207
Query: 65 INTERDYPNVGVMDNC 80
+ TE +YP GV +C
Sbjct: 208 LTTEANYPYEGVDGSC 223
>gi|34223513|gb|AAQ62999.1| oil palm polygalacturonase allergen PEST472 [Elaeis guineensis]
Length = 525
Score = 84.0 bits (206), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 37/82 (45%), Positives = 57/82 (69%), Gaps = 1/82 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ + A+EGI+KIVT +L+ +S Q+LVDCDN G+++ C GG ++ ++++I N G
Sbjct: 163 GSCWAFSTIAAVEGINKIVTGDLISLSEQELVDCDN-GQNQGCNGGLMDYAFEFIINNGG 221
Query: 65 INTERDYPNVGVMDNCKVFQFN 86
I+TE DYP C ++ N
Sbjct: 222 IDTEEDYPYKARDGKCDQYRKN 243
>gi|356545118|ref|XP_003540992.1| PREDICTED: thiol protease SEN102-like [Glycine max]
Length = 337
Score = 83.6 bits (205), Expect = 1e-14, Method: Composition-based stats.
Identities = 35/76 (46%), Positives = 50/76 (65%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V A EGI +I T LV +S Q+LVDCD +G + C GG++E ++++I+N G
Sbjct: 144 GSCWAFSTVAATEGIHQITTGKLVSLSEQELVDCDTKGVDQGCEGGYMEDGFEFIIKNGG 203
Query: 65 INTERDYPNVGVMDNC 80
I +E +YP V C
Sbjct: 204 ITSEANYPYKAVDGKC 219
>gi|213514640|ref|NP_001134963.1| Cathepsin S precursor [Salmo salar]
gi|209155506|gb|ACI33985.1| Cathepsin S precursor [Salmo salar]
gi|209737594|gb|ACI69666.1| Cathepsin S precursor [Salmo salar]
gi|223647278|gb|ACN10397.1| Cathepsin S precursor [Salmo salar]
gi|223673157|gb|ACN12760.1| Cathepsin S precursor [Salmo salar]
Length = 330
Score = 83.6 bits (205), Expect = 1e-14, Method: Composition-based stats.
Identities = 36/76 (47%), Positives = 51/76 (67%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ VGA+EG T L+DIS+Q LVDC ++ ++ C GGF+ +QYVI N+G
Sbjct: 137 GSCWAFSAVGALEGQLMKTTGKLIDISSQNLVDCSSKYGNKGCNGGFMSQAFQYVIDNQG 196
Query: 65 INTERDYPNVGVMDNC 80
I++++ YP GV C
Sbjct: 197 IDSDQSYPYKGVQQQC 212
>gi|357458911|ref|XP_003599736.1| Cysteine proteinase [Medicago truncatula]
gi|357474719|ref|XP_003607644.1| Cysteine proteinase [Medicago truncatula]
gi|355488784|gb|AES69987.1| Cysteine proteinase [Medicago truncatula]
gi|355508699|gb|AES89841.1| Cysteine proteinase [Medicago truncatula]
Length = 340
Score = 83.6 bits (205), Expect = 1e-14, Method: Composition-based stats.
Identities = 35/76 (46%), Positives = 51/76 (67%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V A EGI++I T LV +S Q+LVDCD +GE + C GG +E ++++I+N G
Sbjct: 146 GSCWAFSTVAATEGINQITTGKLVSLSEQELVDCDTKGEDQGCEGGLMEDGFEFIIKNGG 205
Query: 65 INTERDYPNVGVMDNC 80
I +E +YP +C
Sbjct: 206 ITSETNYPYKAADGSC 221
>gi|111073717|dbj|BAF02547.1| triticain beta [Triticum aestivum]
Length = 472
Score = 83.6 bits (205), Expect = 2e-14, Method: Composition-based stats.
Identities = 35/82 (42%), Positives = 54/82 (65%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V +E I++IVT +V +S Q+LV+CD G+S C GG ++ ++++I+N G
Sbjct: 169 GSCWAFSAVSTVESINQIVTGEMVTLSEQELVECDTNGQSSGCNGGLMDDAFEFIIKNGG 228
Query: 65 INTERDYPNVGVMDNCKVFQFN 86
I+TE DYP + C V + N
Sbjct: 229 IDTEDDYPYKAIDGRCDVLRKN 250
>gi|357474573|ref|XP_003607571.1| Cysteine proteinase EP-B [Medicago truncatula]
gi|34329348|gb|AAQ63885.1| putative cysteine proteinase [Medicago truncatula]
gi|355508626|gb|AES89768.1| Cysteine proteinase EP-B [Medicago truncatula]
Length = 345
Score = 83.6 bits (205), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 36/78 (46%), Positives = 49/78 (62%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW F+ V A EGI K+ T LV +S Q+LVDCD +G + C GG ++ ++++IQN G
Sbjct: 149 GCCWAFSAVAATEGIHKLSTGKLVSLSEQELVDCDTKGVDQGCEGGLMDDAFKFIIQNHG 208
Query: 65 INTERDYPNVGVMDNCKV 82
+NTE YP GV C
Sbjct: 209 LNTEAQYPYQGVDGTCSA 226
>gi|46395939|sp|Q94B08.2|GCP1_ARATH RecName: Full=Germination-specific cysteine protease 1; Flags:
Precursor
gi|4006883|emb|CAB16767.1| cysteine proteinase [Arabidopsis thaliana]
gi|7270637|emb|CAB80354.1| cysteine proteinase [Arabidopsis thaliana]
Length = 376
Score = 83.6 bits (205), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 40/85 (47%), Positives = 55/85 (64%), Gaps = 7/85 (8%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCD---NQGESRSCVGGFIETIYQYVIQ 61
GSCW F+ A+EGI+KIVT L+ +S Q+LVDCD NQG C GG ++ +Q++++
Sbjct: 167 GSCWAFSTTAAVEGINKIVTGELISLSEQELVDCDKSYNQG----CNGGLMDYAFQFIMK 222
Query: 62 NRGINTERDYPNVGVMDNCKVFQFN 86
N G+NTE+DYP G C F N
Sbjct: 223 NGGLNTEKDYPYRGFGGKCNSFLKN 247
>gi|186516984|ref|NP_195406.2| cysteine proteinase1 [Arabidopsis thaliana]
gi|15290508|gb|AAK92229.1| cysteine proteinase [Arabidopsis thaliana]
gi|332661313|gb|AEE86713.1| cysteine proteinase1 [Arabidopsis thaliana]
Length = 376
Score = 83.6 bits (205), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 40/85 (47%), Positives = 55/85 (64%), Gaps = 7/85 (8%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCD---NQGESRSCVGGFIETIYQYVIQ 61
GSCW F+ A+EGI+KIVT L+ +S Q+LVDCD NQG C GG ++ +Q++++
Sbjct: 167 GSCWAFSTTAAVEGINKIVTGELISLSEQELVDCDKSYNQG----CNGGLMDYAFQFIMK 222
Query: 62 NRGINTERDYPNVGVMDNCKVFQFN 86
N G+NTE+DYP G C F N
Sbjct: 223 NGGLNTEKDYPYRGFGGKCNSFLKN 247
>gi|110737959|dbj|BAF00916.1| cysteine proteinase [Arabidopsis thaliana]
Length = 376
Score = 83.2 bits (204), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 40/85 (47%), Positives = 55/85 (64%), Gaps = 7/85 (8%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCD---NQGESRSCVGGFIETIYQYVIQ 61
GSCW F+ A+EGI+KIVT L+ +S Q+LVDCD NQG C GG ++ +Q++++
Sbjct: 167 GSCWAFSTTAAVEGINKIVTGELISLSEQELVDCDKSYNQG----CNGGLMDYAFQFIMK 222
Query: 62 NRGINTERDYPNVGVMDNCKVFQFN 86
N G+NTE+DYP G C F N
Sbjct: 223 NGGLNTEKDYPYRGFGGKCNSFLKN 247
>gi|356543076|ref|XP_003539989.1| PREDICTED: vignain-like [Glycine max]
Length = 343
Score = 83.2 bits (204), Expect = 2e-14, Method: Composition-based stats.
Identities = 33/76 (43%), Positives = 50/76 (65%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW F+ V A EGI + + L+ +S Q++VDCD +GE + C GGF++ ++++IQN G
Sbjct: 147 GCCWAFSAVAATEGIHALNSGKLISLSEQEVVDCDTKGEDQGCAGGFMDGAFKFIIQNHG 206
Query: 65 INTERDYPNVGVMDNC 80
+NTE +YP V C
Sbjct: 207 LNTEANYPYKAVDGKC 222
>gi|255568345|ref|XP_002525147.1| cysteine protease, putative [Ricinus communis]
gi|223535606|gb|EEF37274.1| cysteine protease, putative [Ricinus communis]
Length = 347
Score = 83.2 bits (204), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 37/77 (48%), Positives = 52/77 (67%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V A+EGI+KI T NLV +S Q+LVDCD G+++ C GGF+E + ++ G
Sbjct: 151 GSCWAFSAVAAVEGINKIKTGNLVSLSEQELVDCDVNGDNKGCNGGFMEKAFTFIKSIGG 210
Query: 65 INTERDYPNVGVMDNCK 81
+ TE DYP G +C+
Sbjct: 211 LTTENDYPYKGTDGSCE 227
>gi|255538788|ref|XP_002510459.1| cysteine protease, putative [Ricinus communis]
gi|223551160|gb|EEF52646.1| cysteine protease, putative [Ricinus communis]
Length = 422
Score = 83.2 bits (204), Expect = 2e-14, Method: Composition-based stats.
Identities = 38/68 (55%), Positives = 51/68 (75%), Gaps = 1/68 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G+CW F+ GAIEGI+KIVT +LV +S Q+LVDCD + + C GG ++ YQ+VI+N G
Sbjct: 141 GACWSFSATGAIEGINKIVTGSLVSLSEQELVDCD-RSYNNGCEGGLMDYAYQFVIENNG 199
Query: 65 INTERDYP 72
I+TE DYP
Sbjct: 200 IDTEEDYP 207
>gi|357483847|ref|XP_003612210.1| Cysteine proteinase [Medicago truncatula]
gi|355513545|gb|AES95168.1| Cysteine proteinase [Medicago truncatula]
Length = 344
Score = 83.2 bits (204), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 36/76 (47%), Positives = 49/76 (64%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW F+ V A EGI K+ T LV +S Q+LVDCD +G + C GG ++ ++++IQN G
Sbjct: 148 GCCWAFSAVAATEGIHKLSTGKLVSLSEQELVDCDTKGVDQGCEGGLMDDAFKFIIQNHG 207
Query: 65 INTERDYPNVGVMDNC 80
+NTE YP GV C
Sbjct: 208 LNTEAQYPYQGVDGTC 223
>gi|326502440|dbj|BAJ95283.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 349
Score = 83.2 bits (204), Expect = 2e-14, Method: Composition-based stats.
Identities = 34/77 (44%), Positives = 48/77 (62%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW F+ V A+EG++KI T LV +S QQLVDCD G+ C GG ++ ++Y+I G
Sbjct: 157 GCCWAFSAVAAVEGLTKIRTGRLVSLSEQQLVDCDVYGDDEGCAGGLMDNAFEYMINRGG 216
Query: 65 INTERDYPNVGVMDNCK 81
+ TE YP G +C+
Sbjct: 217 LTTESSYPYRGTDGSCR 233
>gi|2224810|emb|CAB09698.1| cysteine proteinase [Hordeum vulgare subsp. vulgare]
Length = 349
Score = 83.2 bits (204), Expect = 2e-14, Method: Composition-based stats.
Identities = 34/77 (44%), Positives = 48/77 (62%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW F+ V A+EG++KI T LV +S QQLVDCD G+ C GG ++ ++Y+I G
Sbjct: 157 GCCWAFSAVAAVEGLTKIRTGRLVSLSEQQLVDCDVYGDDEGCAGGLMDNAFEYMINRGG 216
Query: 65 INTERDYPNVGVMDNCK 81
+ TE YP G +C+
Sbjct: 217 LTTESSYPYRGTDGSCR 233
>gi|160858205|dbj|BAF93840.1| triticain beta 2 [Triticum aestivum]
Length = 469
Score = 83.2 bits (204), Expect = 2e-14, Method: Composition-based stats.
Identities = 34/82 (41%), Positives = 54/82 (65%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ + +E I++IVT +V +S Q+LV+CD G+S C GG ++ ++++I+N G
Sbjct: 166 GSCWAFSAISTVESINQIVTGEMVTLSEQELVECDTNGQSSGCNGGLMDDAFEFIIKNGG 225
Query: 65 INTERDYPNVGVMDNCKVFQFN 86
I+TE DYP + C V + N
Sbjct: 226 IDTEDDYPYKAIDGRCDVLRKN 247
>gi|357474527|ref|XP_003607548.1| Cysteine protease [Medicago truncatula]
gi|358347211|ref|XP_003637653.1| Cysteine protease [Medicago truncatula]
gi|355503588|gb|AES84791.1| Cysteine protease [Medicago truncatula]
gi|355508603|gb|AES89745.1| Cysteine protease [Medicago truncatula]
Length = 345
Score = 83.2 bits (204), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 36/78 (46%), Positives = 49/78 (62%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW F+ V A EGI K+ T LV +S Q+LVDCD +G + C GG ++ ++++IQN G
Sbjct: 149 GCCWAFSAVAATEGIHKLSTGKLVSLSEQELVDCDTKGVDQGCEGGLMDDAFKFIIQNHG 208
Query: 65 INTERDYPNVGVMDNCKV 82
+NTE YP GV C
Sbjct: 209 LNTEAQYPYQGVDGTCSA 226
>gi|357471211|ref|XP_003605890.1| Cysteine proteinase [Medicago truncatula]
gi|355506945|gb|AES88087.1| Cysteine proteinase [Medicago truncatula]
Length = 343
Score = 83.2 bits (204), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 35/76 (46%), Positives = 50/76 (65%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW F+ V A EGI K+ T L+ +S Q+LVDCD +G + C GG ++ ++++IQN G
Sbjct: 147 GCCWAFSAVAATEGIHKLSTGKLISLSEQELVDCDTKGVDQGCEGGLMDDAFKFIIQNHG 206
Query: 65 INTERDYPNVGVMDNC 80
+NTE +YP GV C
Sbjct: 207 LNTEANYPYQGVDGTC 222
>gi|356543038|ref|XP_003539970.1| PREDICTED: vignain-like [Glycine max]
Length = 343
Score = 83.2 bits (204), Expect = 2e-14, Method: Composition-based stats.
Identities = 33/76 (43%), Positives = 50/76 (65%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW F+ V A EGI + + L+ +S Q++VDCD +GE + C GGF++ ++++IQN G
Sbjct: 147 GCCWAFSAVAATEGIHALNSGKLISLSEQEVVDCDTKGEDQGCAGGFMDGAFKFIIQNHG 206
Query: 65 INTERDYPNVGVMDNC 80
+NTE +YP V C
Sbjct: 207 LNTEANYPYKAVDGKC 222
>gi|326507362|dbj|BAK03074.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 473
Score = 83.2 bits (204), Expect = 2e-14, Method: Composition-based stats.
Identities = 36/82 (43%), Positives = 54/82 (65%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V +E I++IVT +V +S Q+LV+CD G+S C GG ++ ++++I+N G
Sbjct: 170 GSCWAFSAVSTVESINQIVTGEMVTLSEQELVECDINGQSSGCNGGLMDDAFEFIIKNGG 229
Query: 65 INTERDYPNVGVMDNCKVFQFN 86
I+TE DYP V C V + N
Sbjct: 230 IDTEDDYPYKAVDGRCDVLRKN 251
>gi|204307508|gb|ACI00280.1| triticain beta 2 [Hordeum vulgare]
Length = 473
Score = 83.2 bits (204), Expect = 2e-14, Method: Composition-based stats.
Identities = 36/82 (43%), Positives = 54/82 (65%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V +E I++IVT +V +S Q+LV+CD G+S C GG ++ ++++I+N G
Sbjct: 170 GSCWAFSAVSTVESINQIVTGEMVTLSEQELVECDINGQSSGCNGGLMDDAFEFIIKNGG 229
Query: 65 INTERDYPNVGVMDNCKVFQFN 86
I+TE DYP V C V + N
Sbjct: 230 IDTEDDYPYKAVDGRCDVLRKN 251
>gi|62320725|dbj|BAD95392.1| cysteine proteinase RD21A [Arabidopsis thaliana]
Length = 433
Score = 82.8 bits (203), Expect = 2e-14, Method: Composition-based stats.
Identities = 36/82 (43%), Positives = 57/82 (69%), Gaps = 1/82 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ +GA+EGI++IVT +L+ +S Q+LVDCD + C GG ++ ++++I+N G
Sbjct: 159 GSCWAFSTIGAVEGINQIVTGDLITLSEQELVDCDTS-YNEGCNGGLMDYAFEFIIKNGG 217
Query: 65 INTERDYPNVGVMDNCKVFQFN 86
I+T++DYP GV C + N
Sbjct: 218 IDTDKDYPYKGVDGTCDQIRKN 239
>gi|18401614|ref|NP_564497.1| cysteine proteinase RD21a [Arabidopsis thaliana]
gi|1172873|sp|P43297.1|RD21A_ARATH RecName: Full=Cysteine proteinase RD21a; Short=RD21; Flags:
Precursor
gi|12321010|gb|AAG50628.1|AC083835_13 cysteine protease, putative [Arabidopsis thaliana]
gi|435619|dbj|BAA02374.1| thiol protease [Arabidopsis thaliana]
gi|18175926|gb|AAL59952.1| putative cysteine proteinase RD21A [Arabidopsis thaliana]
gi|22136972|gb|AAM91715.1| putative cysteine proteinase RD21A [Arabidopsis thaliana]
gi|332194014|gb|AEE32135.1| cysteine proteinase RD21a [Arabidopsis thaliana]
Length = 462
Score = 82.8 bits (203), Expect = 2e-14, Method: Composition-based stats.
Identities = 36/82 (43%), Positives = 57/82 (69%), Gaps = 1/82 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ +GA+EGI++IVT +L+ +S Q+LVDCD + C GG ++ ++++I+N G
Sbjct: 159 GSCWAFSTIGAVEGINQIVTGDLITLSEQELVDCDTS-YNEGCNGGLMDYAFEFIIKNGG 217
Query: 65 INTERDYPNVGVMDNCKVFQFN 86
I+T++DYP GV C + N
Sbjct: 218 IDTDKDYPYKGVDGTCDQIRKN 239
>gi|224081756|ref|XP_002306486.1| predicted protein [Populus trichocarpa]
gi|222855935|gb|EEE93482.1| predicted protein [Populus trichocarpa]
Length = 352
Score = 82.8 bits (203), Expect = 2e-14, Method: Composition-based stats.
Identities = 35/71 (49%), Positives = 51/71 (71%), Gaps = 1/71 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V A+EGI++IVT L+ +S Q+LVDCD + C GG ++ +Q++I N G
Sbjct: 115 GSCWAFSTVAAVEGINQIVTGELISLSEQELVDCDRTYNA-GCNGGLMDYAFQFIINNGG 173
Query: 65 INTERDYPNVG 75
++TE+DYP VG
Sbjct: 174 LDTEKDYPYVG 184
>gi|14517542|gb|AAK62661.1| F2G19.31/F2G19.31 [Arabidopsis thaliana]
gi|19548039|gb|AAL87383.1| F2G19.31/F2G19.31 [Arabidopsis thaliana]
Length = 462
Score = 82.8 bits (203), Expect = 2e-14, Method: Composition-based stats.
Identities = 36/82 (43%), Positives = 57/82 (69%), Gaps = 1/82 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ +GA+EGI++IVT +L+ +S Q+LVDCD + C GG ++ ++++I+N G
Sbjct: 159 GSCWAFSTIGAVEGINQIVTGDLITLSEQELVDCDTS-YNEGCNGGLMDYAFEFIIKNGG 217
Query: 65 INTERDYPNVGVMDNCKVFQFN 86
I+T++DYP GV C + N
Sbjct: 218 IDTDKDYPYKGVDGTCDQIRKN 239
>gi|224093956|ref|XP_002310053.1| predicted protein [Populus trichocarpa]
gi|224147016|ref|XP_002336386.1| predicted protein [Populus trichocarpa]
gi|222834869|gb|EEE73318.1| predicted protein [Populus trichocarpa]
gi|222852956|gb|EEE90503.1| predicted protein [Populus trichocarpa]
Length = 340
Score = 82.8 bits (203), Expect = 2e-14, Method: Composition-based stats.
Identities = 32/76 (42%), Positives = 51/76 (67%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW F+ V A+EGI+K+ T L+ +S Q++VDCD +GE + C GG ++ ++++ QN+G
Sbjct: 145 GCCWAFSAVAAMEGINKLTTGKLISLSEQEVVDCDTKGEDQGCNGGLMDDAFKFIEQNKG 204
Query: 65 INTERDYPNVGVMDNC 80
+ TE +YP G C
Sbjct: 205 LTTEANYPYKGTDGTC 220
>gi|37780051|gb|AAP32198.1| cysteine protease 12 [Trifolium repens]
Length = 343
Score = 82.8 bits (203), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 37/78 (47%), Positives = 49/78 (62%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW F+ + A EGI KI T LV +S Q+LVDCD G + C GG ++ ++++IQN G
Sbjct: 147 GCCWAFSAIAATEGIHKISTGKLVSLSEQELVDCDTNGVDQGCEGGLMDDAFKFIIQNNG 206
Query: 65 INTERDYPNVGVMDNCKV 82
I+TE YP GV CK
Sbjct: 207 ISTEAGYPYQGVDGTCKA 224
>gi|356509992|ref|XP_003523725.1| PREDICTED: oryzain alpha chain-like [Glycine max]
Length = 439
Score = 82.8 bits (203), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 39/76 (51%), Positives = 52/76 (68%), Gaps = 1/76 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G+CW F+ GAIEGI+KIVT +LV +S Q+L+DCD S C GG ++ YQ+VI N+G
Sbjct: 145 GACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDTSYNS-GCGGGLMDFAYQFVIDNKG 203
Query: 65 INTERDYPNVGVMDNC 80
I+TE DYP +C
Sbjct: 204 IDTEDDYPYQARQRSC 219
>gi|297802228|ref|XP_002868998.1| cysteine proteinase [Arabidopsis lyrata subsp. lyrata]
gi|297314834|gb|EFH45257.1| cysteine proteinase [Arabidopsis lyrata subsp. lyrata]
Length = 375
Score = 82.8 bits (203), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 37/82 (45%), Positives = 54/82 (65%), Gaps = 1/82 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ A+EGI+KIVT L+ +S Q+LVDCDN ++ C GG ++ +Q++++N G
Sbjct: 167 GSCWAFSTAAAVEGINKIVTGELISLSEQELVDCDNS-YNQGCNGGLMDYAFQFIMKNGG 225
Query: 65 INTERDYPNVGVMDNCKVFQFN 86
+ TE+DYP G C F N
Sbjct: 226 LKTEKDYPYRGFGGKCNSFLKN 247
>gi|37780045|gb|AAP32195.1| cysteine protease 5 [Trifolium repens]
Length = 343
Score = 82.8 bits (203), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 37/78 (47%), Positives = 49/78 (62%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW F+ + A EGI KI T LV +S Q+LVDCD G + C GG ++ ++++IQN G
Sbjct: 147 GCCWAFSAIAATEGIHKISTGKLVSLSEQELVDCDTNGVDQGCEGGLMDDAFKFIIQNNG 206
Query: 65 INTERDYPNVGVMDNCKV 82
I+TE YP GV CK
Sbjct: 207 ISTEAGYPYQGVDGTCKA 224
>gi|13491750|gb|AAK27968.1|AF242372_1 cysteine protease [Ipomoea batatas]
Length = 339
Score = 82.8 bits (203), Expect = 2e-14, Method: Composition-based stats.
Identities = 33/76 (43%), Positives = 51/76 (67%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW F+ V A+EGI+K+ T NL+ +S Q+LVDCD +G + C GG ++ + ++I N+G
Sbjct: 143 GCCWAFSAVAAMEGITKLSTGNLISLSEQELVDCDVKGTDQGCEGGLMDDAFSFIINNKG 202
Query: 65 INTERDYPNVGVMDNC 80
+ TE +YP G +C
Sbjct: 203 LTTESNYPYQGTDGSC 218
>gi|242094002|ref|XP_002437491.1| hypothetical protein SORBIDRAFT_10g028010 [Sorghum bicolor]
gi|241915714|gb|EER88858.1| hypothetical protein SORBIDRAFT_10g028010 [Sorghum bicolor]
Length = 397
Score = 82.8 bits (203), Expect = 2e-14, Method: Composition-based stats.
Identities = 38/82 (46%), Positives = 51/82 (62%), Gaps = 2/82 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW F+ V AIEGI+ IVT NLV +S Q+++DCD Q C GG +E +Q+VI N G
Sbjct: 185 GGCWAFSAVAAIEGINAIVTGNLVSLSEQEIIDCDTQ--DSGCNGGQMENAFQFVIDNGG 242
Query: 65 INTERDYPNVGVMDNCKVFQFN 86
I++E DYP + C + N
Sbjct: 243 IDSEADYPFIATDGTCDANKAN 264
>gi|413919735|gb|AFW59667.1| hypothetical protein ZEAMMB73_680472 [Zea mays]
Length = 344
Score = 82.8 bits (203), Expect = 2e-14, Method: Composition-based stats.
Identities = 38/85 (44%), Positives = 56/85 (65%), Gaps = 7/85 (8%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCD---NQGESRSCVGGFIETIYQYVIQ 61
GSCW F+ + A+EGI++IVT +++ +S Q+LVDCD NQG C GG ++ ++++I
Sbjct: 157 GSCWAFSTIAAVEGINQIVTGDMISLSEQELVDCDTSYNQG----CNGGLMDYAFEFIIN 212
Query: 62 NRGINTERDYPNVGVMDNCKVFQFN 86
N GI+TE DYP G C V + N
Sbjct: 213 NGGIDTEEDYPYKGTDGRCDVNRKN 237
>gi|50355621|dbj|BAD29959.1| cysteine protease [Daucus carota]
Length = 361
Score = 82.8 bits (203), Expect = 2e-14, Method: Composition-based stats.
Identities = 32/76 (42%), Positives = 49/76 (64%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ + A EGI+K+ T L+ +S Q+LVDCD GE + C GG++E ++++++N+G
Sbjct: 165 GSCWAFSTIAATEGITKLKTGKLISLSEQELVDCDKTGEDQGCEGGYMEDGFEFIVKNKG 224
Query: 65 INTERDYPNVGVMDNC 80
I E YP C
Sbjct: 225 IALEASYPYTAADGTC 240
>gi|194352756|emb|CAQ00106.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
Length = 476
Score = 82.8 bits (203), Expect = 2e-14, Method: Composition-based stats.
Identities = 36/82 (43%), Positives = 54/82 (65%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V +E I++IVT +V +S Q+LV+CD G+S C GG ++ ++++I+N G
Sbjct: 170 GSCWAFSAVSTVESINQIVTGEMVTLSEQELVECDINGQSSGCNGGLMDDAFEFIIKNGG 229
Query: 65 INTERDYPNVGVMDNCKVFQFN 86
I+TE DYP V C V + N
Sbjct: 230 IDTEDDYPYKAVDGRCDVLRKN 251
>gi|171702843|dbj|BAG16377.1| cysteine protease [Brassica rapa var. perviridis]
Length = 431
Score = 82.8 bits (203), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 38/82 (46%), Positives = 56/82 (68%), Gaps = 1/82 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ +GA+EGI+KIVT +L+ +S Q+LVDCD + C GG ++ ++++I+N G
Sbjct: 148 GSCWAFSTIGAVEGINKIVTGDLISLSEQELVDCDTS-YNEGCNGGLMDYAFEFIIKNGG 206
Query: 65 INTERDYPNVGVMDNCKVFQFN 86
I+TE DYP GV C + N
Sbjct: 207 IDTEEDYPYKGVDGRCDQTRKN 228
>gi|242074728|ref|XP_002447300.1| hypothetical protein SORBIDRAFT_06g032360 [Sorghum bicolor]
gi|241938483|gb|EES11628.1| hypothetical protein SORBIDRAFT_06g032360 [Sorghum bicolor]
Length = 471
Score = 82.8 bits (203), Expect = 2e-14, Method: Composition-based stats.
Identities = 35/78 (44%), Positives = 52/78 (66%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ VGA+EGI++IVT LV +S Q+LVDC G++ C GG ++ + +++ N G
Sbjct: 163 GSCWAFSAVGAVEGINQIVTGELVTLSEQELVDCSKNGQNGGCDGGMMDDAFAFIVGNGG 222
Query: 65 INTERDYPNVGVMDNCKV 82
I+T++DYP C V
Sbjct: 223 IDTDKDYPYTARDGKCDV 240
>gi|224114698|ref|XP_002316833.1| predicted protein [Populus trichocarpa]
gi|222859898|gb|EEE97445.1| predicted protein [Populus trichocarpa]
Length = 305
Score = 82.8 bits (203), Expect = 3e-14, Method: Composition-based stats.
Identities = 33/76 (43%), Positives = 49/76 (64%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW F+ V A EGI+++ T L+ +S Q+LVDCD GE + C GG ++ + ++IQN+G
Sbjct: 112 GCCWAFSAVAATEGITQLSTGKLISLSEQELVDCDTSGEDQGCNGGLMDDAFDFIIQNKG 171
Query: 65 INTERDYPNVGVMDNC 80
+ TE +YP G C
Sbjct: 172 LTTEANYPYQGADGAC 187
>gi|413943290|gb|AFW75939.1| maize insect resistance1 [Zea mays]
Length = 435
Score = 82.8 bits (203), Expect = 3e-14, Method: Composition-based stats.
Identities = 38/82 (46%), Positives = 51/82 (62%), Gaps = 2/82 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW F+ V AIEGI+ I T NLV +S Q+++DCD Q C GG +E +++VI N G
Sbjct: 213 GGCWAFSAVAAIEGINAIATGNLVSLSEQEIIDCDAQ--DSGCDGGQMENAFRFVIGNGG 270
Query: 65 INTERDYPNVGVMDNCKVFQFN 86
I+TE DYP +G C + N
Sbjct: 271 IDTEADYPFIGTDGTCDASKEN 292
>gi|242072392|ref|XP_002446132.1| hypothetical protein SORBIDRAFT_06g002150 [Sorghum bicolor]
gi|241937315|gb|EES10460.1| hypothetical protein SORBIDRAFT_06g002150 [Sorghum bicolor]
Length = 337
Score = 82.4 bits (202), Expect = 3e-14, Method: Composition-based stats.
Identities = 34/77 (44%), Positives = 50/77 (64%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW F+ V A+EGI K+ T NL+ +S Q+LVDCD C GG++++ +++VI+N G
Sbjct: 143 GCCWAFSAVAAMEGIVKLSTGNLISLSEQELVDCDTHSMDEGCEGGWMDSAFEFVIKNGG 202
Query: 65 INTERDYPNVGVMDNCK 81
+ TE +YP V CK
Sbjct: 203 LATESNYPYKAVDGKCK 219
>gi|224081320|ref|XP_002306369.1| predicted protein [Populus trichocarpa]
gi|222855818|gb|EEE93365.1| predicted protein [Populus trichocarpa]
Length = 340
Score = 82.4 bits (202), Expect = 3e-14, Method: Composition-based stats.
Identities = 31/76 (40%), Positives = 51/76 (67%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW F+ V A+EGI+++ T L+ +S Q++VDCD +GE + C GG ++ ++++ QN+G
Sbjct: 145 GCCWAFSAVAAMEGINQLTTGKLISLSEQEVVDCDTKGEDQGCNGGLMDDAFKFIEQNKG 204
Query: 65 INTERDYPNVGVMDNC 80
+ TE +YP G C
Sbjct: 205 LTTEANYPYTGTDGTC 220
>gi|356577811|ref|XP_003557016.1| PREDICTED: vignain-like [Glycine max]
Length = 343
Score = 82.4 bits (202), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 35/78 (44%), Positives = 50/78 (64%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW F+ V A EGI + + L+ +S Q+LVDCD +G + C GG ++ +++VIQN G
Sbjct: 147 GCCWAFSAVAATEGIHALTSGKLISLSEQELVDCDTKGVDQGCEGGLMDDAFKFVIQNHG 206
Query: 65 INTERDYPNVGVMDNCKV 82
+NTE +YP GV C V
Sbjct: 207 LNTEANYPYKGVDGKCNV 224
>gi|356515044|ref|XP_003526211.1| PREDICTED: LOW QUALITY PROTEIN: thiol protease SEN102-like [Glycine
max]
Length = 337
Score = 82.4 bits (202), Expect = 3e-14, Method: Composition-based stats.
Identities = 35/76 (46%), Positives = 50/76 (65%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V A EGI +I T NLV + Q+LV CD +G + C GG++E ++++I+N G
Sbjct: 142 GSCWAFSTVAATEGIHQITTGNLVSLXEQELVSCDTKGVDQGCEGGYMEDGFEFIIKNGG 201
Query: 65 INTERDYPNVGVMDNC 80
I T+ +YP GV C
Sbjct: 202 ITTKANYPYKGVNGTC 217
>gi|182375363|gb|ACB87490.1| mucunain [Mucuna pruriens]
Length = 422
Score = 82.4 bits (202), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 40/85 (47%), Positives = 57/85 (67%), Gaps = 7/85 (8%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCD---NQGESRSCVGGFIETIYQYVIQ 61
GSCW F+ +GA+EGI+KIVT +L+ +S Q+LVDCD NQG C GG ++ Y+++I
Sbjct: 115 GSCWAFSTIGAVEGINKIVTGDLISLSEQELVDCDTSYNQG----CNGGLMDYAYEFIIN 170
Query: 62 NRGINTERDYPNVGVMDNCKVFQFN 86
N GI++E DYP V C ++ N
Sbjct: 171 NGGIDSEEDYPYRAVDGTCDQYRKN 195
>gi|302816222|ref|XP_002989790.1| hypothetical protein SELMODRAFT_184826 [Selaginella moellendorffii]
gi|300142356|gb|EFJ09057.1| hypothetical protein SELMODRAFT_184826 [Selaginella moellendorffii]
Length = 358
Score = 82.4 bits (202), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 35/76 (46%), Positives = 56/76 (73%), Gaps = 1/76 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V A+EG+++IVT LV +S Q+LVDCD Q +++ C GG +++ ++++IQN G
Sbjct: 140 GSCWAFSTVAAVEGVNQIVTGELVSLSEQELVDCDKQ-KNQGCNGGLMDSAFEFIIQNGG 198
Query: 65 INTERDYPNVGVMDNC 80
+++E DYP V +C
Sbjct: 199 LDSEADYPYKAVSGSC 214
>gi|302845628|ref|XP_002954352.1| hypothetical protein VOLCADRAFT_76255 [Volvox carteri f.
nagariensis]
gi|300260282|gb|EFJ44502.1| hypothetical protein VOLCADRAFT_76255 [Volvox carteri f.
nagariensis]
Length = 489
Score = 82.4 bits (202), Expect = 3e-14, Method: Composition-based stats.
Identities = 36/68 (52%), Positives = 48/68 (70%), Gaps = 1/68 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW FA G++EGI+ IVT L +S Q+LVDCD E R C GG ++ YQ++I+N G
Sbjct: 156 GSCWAFATTGSVEGINAIVTGELASLSEQELVDCDTD-EDRGCSGGLMDYAYQWIIKNGG 214
Query: 65 INTERDYP 72
++TE DYP
Sbjct: 215 LDTEDDYP 222
>gi|356542631|ref|XP_003539770.1| PREDICTED: vignain-like [Glycine max]
Length = 343
Score = 82.4 bits (202), Expect = 3e-14, Method: Composition-based stats.
Identities = 33/76 (43%), Positives = 49/76 (64%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW FA V A EGI+K+ T L+ +S Q+L+DCD G++ C G I+ +++++QN+G
Sbjct: 147 GSCWAFAAVAATEGITKLTTGELISLSEQELIDCDTNGDNGGCKWGIIQEAFKFIVQNKG 206
Query: 65 INTERDYPNVGVMDNC 80
+ TE YP V C
Sbjct: 207 LATEASYPYQAVDGTC 222
>gi|302816909|ref|XP_002990132.1| hypothetical protein SELMODRAFT_428615 [Selaginella moellendorffii]
gi|300142145|gb|EFJ08849.1| hypothetical protein SELMODRAFT_428615 [Selaginella moellendorffii]
Length = 358
Score = 82.4 bits (202), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 35/76 (46%), Positives = 56/76 (73%), Gaps = 1/76 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V A+EG+++IVT LV +S Q+LVDCD Q +++ C GG +++ ++++IQN G
Sbjct: 140 GSCWAFSTVAAVEGVNQIVTGELVSLSEQELVDCDKQ-KNQGCNGGLMDSAFEFIIQNGG 198
Query: 65 INTERDYPNVGVMDNC 80
+++E DYP V +C
Sbjct: 199 LDSEADYPYKAVSGSC 214
>gi|224076972|ref|XP_002305074.1| predicted protein [Populus trichocarpa]
gi|224106329|ref|XP_002333698.1| predicted protein [Populus trichocarpa]
gi|222837984|gb|EEE76349.1| predicted protein [Populus trichocarpa]
gi|222848038|gb|EEE85585.1| predicted protein [Populus trichocarpa]
Length = 307
Score = 82.4 bits (202), Expect = 3e-14, Method: Composition-based stats.
Identities = 35/77 (45%), Positives = 50/77 (64%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW F+ V AIEGI K+ T L+ +S QQLVDCD +G + C GG ++ +Q++++N G
Sbjct: 111 GCCWAFSAVAAIEGIIKLKTGKLISLSEQQLVDCDVKGVDQGCGGGLMDNAFQFILRNGG 170
Query: 65 INTERDYPNVGVMDNCK 81
+ +E YP GV CK
Sbjct: 171 LTSEATYPYQGVDGTCK 187
>gi|413953050|gb|AFW85699.1| hypothetical protein ZEAMMB73_033873 [Zea mays]
Length = 361
Score = 82.4 bits (202), Expect = 3e-14, Method: Composition-based stats.
Identities = 35/76 (46%), Positives = 47/76 (61%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW FA V +IEG+ +I T LV +S Q++VDCD G C GG+ + ++V +N G
Sbjct: 159 GSCWAFATVASIEGVHQIKTGRLVSLSEQEIVDCDRGGNDHGCRGGYPRSAMEWVTRNGG 218
Query: 65 INTERDYPNVGVMDNC 80
+ TE DYP VG C
Sbjct: 219 LTTESDYPYVGSQRQC 234
>gi|224162986|ref|XP_002338508.1| predicted protein [Populus trichocarpa]
gi|222872535|gb|EEF09666.1| predicted protein [Populus trichocarpa]
Length = 306
Score = 82.4 bits (202), Expect = 3e-14, Method: Composition-based stats.
Identities = 32/76 (42%), Positives = 51/76 (67%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW F+ V A+EGI+K+ T L+ +S Q++VDCD +GE + C GG ++ ++++ QN+G
Sbjct: 111 GCCWAFSAVAAMEGINKLTTGKLISLSEQEVVDCDTKGEDQGCNGGLMDDAFKFIEQNKG 170
Query: 65 INTERDYPNVGVMDNC 80
+ TE +YP G C
Sbjct: 171 LTTEANYPYKGTDGTC 186
>gi|413953051|gb|AFW85700.1| hypothetical protein ZEAMMB73_033873 [Zea mays]
Length = 359
Score = 82.4 bits (202), Expect = 3e-14, Method: Composition-based stats.
Identities = 35/76 (46%), Positives = 47/76 (61%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW FA V +IEG+ +I T LV +S Q++VDCD G C GG+ + ++V +N G
Sbjct: 159 GSCWAFATVASIEGVHQIKTGRLVSLSEQEIVDCDRGGNDHGCRGGYPRSAMEWVTRNGG 218
Query: 65 INTERDYPNVGVMDNC 80
+ TE DYP VG C
Sbjct: 219 LTTESDYPYVGSQRQC 234
>gi|21070926|gb|AAM34401.1|AF377947_7 putative cysteine proteinase [Oryza sativa Japonica Group]
gi|31712050|gb|AAP68356.1| putative cysteine protease [Oryza sativa Japonica Group]
gi|40538988|gb|AAR87245.1| putative cysteine protease [Oryza sativa Japonica Group]
gi|108711126|gb|ABF98921.1| Papain family cysteine protease containing protein, expressed
[Oryza sativa Japonica Group]
gi|125545747|gb|EAY91886.1| hypothetical protein OsI_13535 [Oryza sativa Indica Group]
Length = 350
Score = 82.4 bits (202), Expect = 3e-14, Method: Composition-based stats.
Identities = 34/72 (47%), Positives = 48/72 (66%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW F+ V A+EG++KI T LV +S Q+LVDCD +GE + C GG ++T +QY+ + G
Sbjct: 156 GCCWAFSAVAAVEGLAKIRTGQLVSLSEQELVDCDVRGEDQGCEGGLMDTAFQYIARRGG 215
Query: 65 INTERDYPNVGV 76
+ E YP GV
Sbjct: 216 LAAESSYPYRGV 227
>gi|242072572|ref|XP_002446222.1| hypothetical protein SORBIDRAFT_06g005410 [Sorghum bicolor]
gi|241937405|gb|EES10550.1| hypothetical protein SORBIDRAFT_06g005410 [Sorghum bicolor]
Length = 340
Score = 82.4 bits (202), Expect = 3e-14, Method: Composition-based stats.
Identities = 34/77 (44%), Positives = 48/77 (62%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW F+ V A EGI KI T+ L+ +S Q+LVDCD GE + C GG ++ ++++I+N G
Sbjct: 146 GCCWAFSAVAATEGIVKISTDKLISLSEQELVDCDVHGEDQGCEGGLMDDAFKFIIKNGG 205
Query: 65 INTERDYPNVGVMDNCK 81
+ TE YP CK
Sbjct: 206 LTTESSYPYTATDGKCK 222
>gi|432910514|ref|XP_004078393.1| PREDICTED: cathepsin S-like [Oryzias latipes]
Length = 339
Score = 82.0 bits (201), Expect = 4e-14, Method: Composition-based stats.
Identities = 38/76 (50%), Positives = 50/76 (65%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ VGA+EG T LVD+S Q LVDC ++ + C GGF+ +QYVI N+G
Sbjct: 146 GSCWAFSAVGALEGQLCRKTGKLVDLSPQNLVDCSSKYGNHGCNGGFMHQAFQYVIDNQG 205
Query: 65 INTERDYPNVGVMDNC 80
I+++ YP VGV NC
Sbjct: 206 IDSDAGYPYVGVTQNC 221
>gi|124484401|dbj|BAF46311.1| cysteine proteinase precursor [Ipomoea nil]
Length = 339
Score = 82.0 bits (201), Expect = 4e-14, Method: Composition-based stats.
Identities = 33/76 (43%), Positives = 51/76 (67%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW F+ V A+EGI+K+ T NL+ +S Q+LVDCD +G + C GG ++ + ++I N+G
Sbjct: 143 GCCWAFSAVAAMEGITKLSTGNLISLSEQELVDCDVKGIDQGCEGGLMDDAFTFIINNKG 202
Query: 65 INTERDYPNVGVMDNC 80
+ TE +YP G +C
Sbjct: 203 LTTESNYPYQGTDGSC 218
>gi|162459393|ref|NP_001105993.1| cysteine protease component of protease-inhibitor complex precursor
[Zea mays]
gi|6682829|dbj|BAA88898.1| cysteine protease component of protease-inhibitor complex [Zea
mays]
Length = 465
Score = 82.0 bits (201), Expect = 4e-14, Method: Composition-based stats.
Identities = 39/85 (45%), Positives = 57/85 (67%), Gaps = 7/85 (8%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCD---NQGESRSCVGGFIETIYQYVIQ 61
GSCW F+ + A+EGI++IVT +L+ +S Q+LVDCD NQG C GG ++ ++++I
Sbjct: 155 GSCWAFSTIAAVEGINQIVTGDLISLSEQELVDCDTSYNQG----CNGGLMDYAFEFIIN 210
Query: 62 NRGINTERDYPNVGVMDNCKVFQFN 86
N GI+TE+DYP G C V + N
Sbjct: 211 NGGIDTEKDYPYKGTDGRCDVNRKN 235
>gi|414585111|tpg|DAA35682.1| TPA: cysteine proteinase Mir3 [Zea mays]
Length = 468
Score = 82.0 bits (201), Expect = 4e-14, Method: Composition-based stats.
Identities = 39/85 (45%), Positives = 57/85 (67%), Gaps = 7/85 (8%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCD---NQGESRSCVGGFIETIYQYVIQ 61
GSCW F+ + A+EGI++IVT +L+ +S Q+LVDCD NQG C GG ++ ++++I
Sbjct: 157 GSCWAFSTIAAVEGINQIVTGDLISLSEQELVDCDTSYNQG----CNGGLMDYAFEFIIN 212
Query: 62 NRGINTERDYPNVGVMDNCKVFQFN 86
N GI+TE+DYP G C V + N
Sbjct: 213 NGGIDTEKDYPYKGTDGRCDVNRKN 237
>gi|226495425|ref|NP_001148706.1| cysteine protease 1 precursor [Zea mays]
gi|195621544|gb|ACG32602.1| cysteine protease 1 precursor [Zea mays]
Length = 463
Score = 82.0 bits (201), Expect = 4e-14, Method: Composition-based stats.
Identities = 39/85 (45%), Positives = 57/85 (67%), Gaps = 7/85 (8%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCD---NQGESRSCVGGFIETIYQYVIQ 61
GSCW F+ + A+EGI++IVT +L+ +S Q+LVDCD NQG C GG ++ ++++I
Sbjct: 152 GSCWAFSTIAAVEGINQIVTGDLISLSEQELVDCDTSYNQG----CNGGLMDYAFEFIIN 207
Query: 62 NRGINTERDYPNVGVMDNCKVFQFN 86
N GI+TE+DYP G C V + N
Sbjct: 208 NGGIDTEKDYPYKGTDGRCDVNRKN 232
>gi|356545063|ref|XP_003540965.1| PREDICTED: thiol protease SEN102-like [Glycine max]
Length = 361
Score = 82.0 bits (201), Expect = 4e-14, Method: Composition-based stats.
Identities = 34/76 (44%), Positives = 49/76 (64%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW F+ V A EGI + + L+ +S Q+LVDCD +G + C GG ++ +++VIQN G
Sbjct: 165 GCCWAFSAVAATEGIHALTSGKLISLSEQELVDCDTKGVDQGCEGGLMDDAFKFVIQNHG 224
Query: 65 INTERDYPNVGVMDNC 80
+NTE +YP GV C
Sbjct: 225 LNTEANYPYKGVDGKC 240
>gi|162463464|ref|NP_001104879.1| cysteine proteinase Mir3 precursor [Zea mays]
gi|2425066|gb|AAB88263.1| cysteine proteinase Mir3 [Zea mays]
Length = 480
Score = 82.0 bits (201), Expect = 4e-14, Method: Composition-based stats.
Identities = 38/85 (44%), Positives = 57/85 (67%), Gaps = 7/85 (8%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCD---NQGESRSCVGGFIETIYQYVIQ 61
G+CW F+ + A+EGI++IVT +L+ +S Q+LVDCD NQG C GG ++ ++++I
Sbjct: 155 GTCWAFSTIAAVEGINQIVTGDLISLSEQELVDCDTSYNQG----CNGGLMDYAFEFIIN 210
Query: 62 NRGINTERDYPNVGVMDNCKVFQFN 86
N GI+TE+DYP G C V + N
Sbjct: 211 NGGIDTEKDYPYKGTDGRCDVNRKN 235
>gi|54300682|gb|AAV32964.1| cathepsin S-like [Oncorhynchus mykiss]
Length = 246
Score = 82.0 bits (201), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 36/76 (47%), Positives = 51/76 (67%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ VGA+EG T L+DIS+Q LVDC ++ ++ C GGF+ +QYVI N+G
Sbjct: 150 GSCWAFSSVGALEGQLMKTTGKLIDISSQNLVDCSSKYGNKGCNGGFMSQAFQYVIDNQG 209
Query: 65 INTERDYPNVGVMDNC 80
I++++ YP GV C
Sbjct: 210 IDSDQSYPYXGVQQQC 225
>gi|356533293|ref|XP_003535200.1| PREDICTED: LOW QUALITY PROTEIN: cysteine proteinase RD21a-like
[Glycine max]
Length = 466
Score = 82.0 bits (201), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 36/68 (52%), Positives = 52/68 (76%), Gaps = 1/68 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ VGA+EGI++IVT NL +S Q+LVDCD +G + C GG ++ +++++QN G
Sbjct: 161 GSCWAFSTVGAVEGINQIVTGNLTSLSEQELVDCD-RGYNMGCNGGLMDYAFEFIVQNGG 219
Query: 65 INTERDYP 72
I+TE DYP
Sbjct: 220 IDTEEDYP 227
>gi|302759380|ref|XP_002963113.1| hypothetical protein SELMODRAFT_270344 [Selaginella moellendorffii]
gi|300169974|gb|EFJ36576.1| hypothetical protein SELMODRAFT_270344 [Selaginella moellendorffii]
Length = 479
Score = 82.0 bits (201), Expect = 4e-14, Method: Composition-based stats.
Identities = 38/82 (46%), Positives = 53/82 (64%), Gaps = 1/82 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V AIEG++K+ T LV +S Q+LVDCD +GE C GG ++ + +VI+N G
Sbjct: 174 GSCWAFSAVAAIEGVNKLATGELVSLSEQELVDCD-KGEDEGCNGGLMDYAFGFVIKNGG 232
Query: 65 INTERDYPNVGVMDNCKVFQFN 86
++TE DYP G C + N
Sbjct: 233 LDTEADYPYKGYGTRCDRSKMN 254
>gi|50355615|dbj|BAD29956.1| cysteine protease [Daucus carota]
Length = 423
Score = 82.0 bits (201), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 36/76 (47%), Positives = 56/76 (73%), Gaps = 1/76 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V A+EGI++I T +L+ +S Q+LVDCD +G ++ C GGF++ ++++++N G
Sbjct: 115 GSCWAFSTVAAVEGINQIATGDLISLSEQELVDCD-KGFNQGCNGGFMDYAFEFIVKNGG 173
Query: 65 INTERDYPNVGVMDNC 80
I+TE DYP GV C
Sbjct: 174 IDTEDDYPYKGVDGQC 189
>gi|302796898|ref|XP_002980210.1| hypothetical protein SELMODRAFT_153766 [Selaginella moellendorffii]
gi|300151826|gb|EFJ18470.1| hypothetical protein SELMODRAFT_153766 [Selaginella moellendorffii]
Length = 479
Score = 82.0 bits (201), Expect = 4e-14, Method: Composition-based stats.
Identities = 38/82 (46%), Positives = 53/82 (64%), Gaps = 1/82 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V AIEG++K+ T LV +S Q+LVDCD +GE C GG ++ + +VI+N G
Sbjct: 174 GSCWAFSAVAAIEGVNKLATGELVSLSEQELVDCD-KGEDEGCNGGLMDYAFGFVIKNGG 232
Query: 65 INTERDYPNVGVMDNCKVFQFN 86
++TE DYP G C + N
Sbjct: 233 LDTEADYPYKGYGTRCDRSKMN 254
>gi|224103643|ref|XP_002313136.1| predicted protein [Populus trichocarpa]
gi|222849544|gb|EEE87091.1| predicted protein [Populus trichocarpa]
Length = 477
Score = 82.0 bits (201), Expect = 4e-14, Method: Composition-based stats.
Identities = 39/79 (49%), Positives = 54/79 (68%), Gaps = 7/79 (8%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCD---NQGESRSCVGGFIETIYQYVIQ 61
GSCW F+ VGA+EGI++IVT NL +S Q+LVDCD NQG C GG ++ ++++++
Sbjct: 160 GSCWAFSTVGAVEGINQIVTGNLTSLSEQELVDCDKVYNQG----CNGGLMDYAFEFIMK 215
Query: 62 NRGINTERDYPNVGVMDNC 80
N GI+TE DYP V C
Sbjct: 216 NGGIDTEEDYPYKAVDSMC 234
>gi|255564910|ref|XP_002523448.1| cysteine protease, putative [Ricinus communis]
gi|223537276|gb|EEF38907.1| cysteine protease, putative [Ricinus communis]
Length = 341
Score = 82.0 bits (201), Expect = 4e-14, Method: Composition-based stats.
Identities = 33/76 (43%), Positives = 49/76 (64%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW F+ V A+EGI+K+ T L+ +S Q+LVDCD GE + C GG ++ ++++ QN G
Sbjct: 146 GCCWAFSAVAAMEGITKLSTGKLISLSEQELVDCDTSGEDQGCEGGLMDDAFEFIKQNGG 205
Query: 65 INTERDYPNVGVMDNC 80
+ TE +YP G C
Sbjct: 206 LTTEANYPYQGTDGTC 221
>gi|334185815|ref|NP_680113.3| putative cysteine proteinase [Arabidopsis thaliana]
gi|75313879|sp|Q9STL4.1|CEP2_ARATH RecName: Full=KDEL-tailed cysteine endopeptidase CEP2; Flags:
Precursor
gi|4678354|emb|CAB41164.1| cysteine endopeptidase-like protein [Arabidopsis thaliana]
gi|332644882|gb|AEE78403.1| putative cysteine proteinase [Arabidopsis thaliana]
Length = 361
Score = 82.0 bits (201), Expect = 4e-14, Method: Composition-based stats.
Identities = 37/82 (45%), Positives = 53/82 (64%), Gaps = 1/82 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V A+EGI+KI TN LV +S Q+LVDCD + ++ C GG +E ++++ +N G
Sbjct: 150 GSCWAFSTVAAVEGINKIKTNKLVSLSEQELVDCDTK-QNEGCNGGLMEIAFEFIKKNGG 208
Query: 65 INTERDYPNVGVMDNCKVFQFN 86
I TE YP G+ C + N
Sbjct: 209 ITTEDSYPYEGIDGKCDASKDN 230
>gi|18422289|ref|NP_568620.1| Granulin repeat cysteine protease family protein [Arabidopsis
thaliana]
gi|9757832|dbj|BAB08269.1| cysteine protease component of protease-inhibitor complex
[Arabidopsis thaliana]
gi|17065064|gb|AAL32686.1| cysteine protease component of protease-inhibitor complex
[Arabidopsis thaliana]
gi|21387153|gb|AAM47980.1| cysteine protease component of protease-inhibitor complex
[Arabidopsis thaliana]
gi|332007522|gb|AED94905.1| Granulin repeat cysteine protease family protein [Arabidopsis
thaliana]
Length = 463
Score = 82.0 bits (201), Expect = 4e-14, Method: Composition-based stats.
Identities = 37/71 (52%), Positives = 53/71 (74%), Gaps = 7/71 (9%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCD---NQGESRSCVGGFIETIYQYVIQ 61
GSCW F+ +GA+EGI+KIVT +L+ +S Q+LVDCD NQG C GG ++ ++++I+
Sbjct: 160 GSCWAFSTIGAVEGINKIVTGDLISLSEQELVDCDTSYNQG----CNGGLMDYAFEFIIK 215
Query: 62 NRGINTERDYP 72
N GI+TE DYP
Sbjct: 216 NGGIDTEADYP 226
>gi|242072394|ref|XP_002446133.1| hypothetical protein SORBIDRAFT_06g002160 [Sorghum bicolor]
gi|241937316|gb|EES10461.1| hypothetical protein SORBIDRAFT_06g002160 [Sorghum bicolor]
Length = 338
Score = 82.0 bits (201), Expect = 4e-14, Method: Composition-based stats.
Identities = 34/77 (44%), Positives = 49/77 (63%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW F+ V A+EGI K+ T NL+ +S Q+LVDCD C GG++++ +++VI+N G
Sbjct: 144 GCCWAFSAVAAMEGIVKLSTGNLISLSEQELVDCDTHSMDEGCEGGWMDSAFEFVIKNGG 203
Query: 65 INTERDYPNVGVMDNCK 81
+ TE YP V CK
Sbjct: 204 LATESSYPYKAVDGKCK 220
>gi|255564908|ref|XP_002523447.1| cysteine protease, putative [Ricinus communis]
gi|223537275|gb|EEF38906.1| cysteine protease, putative [Ricinus communis]
Length = 342
Score = 82.0 bits (201), Expect = 4e-14, Method: Composition-based stats.
Identities = 33/76 (43%), Positives = 49/76 (64%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW F+ V A+EGI+K+ T L+ +S Q+LVDCD GE + C GG ++ ++++ QN G
Sbjct: 146 GCCWAFSAVAAMEGITKLSTGKLISLSEQELVDCDTSGEDQGCEGGLMDDAFEFIKQNGG 205
Query: 65 INTERDYPNVGVMDNC 80
+ TE +YP G C
Sbjct: 206 LTTEANYPYQGTDGTC 221
>gi|225438807|ref|XP_002283263.1| PREDICTED: germination-specific cysteine protease 1-like isoform 1
[Vitis vinifera]
Length = 374
Score = 82.0 bits (201), Expect = 4e-14, Method: Composition-based stats.
Identities = 34/78 (43%), Positives = 53/78 (67%), Gaps = 1/78 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V A+EGI++IVT L+ +S Q+LVDCD + + C GG ++ + ++I+N G
Sbjct: 157 GSCWAFSTVAAVEGINQIVTGELISLSEQELVDCDTEYD-MGCNGGLMDYAFDFIIKNGG 215
Query: 65 INTERDYPNVGVMDNCKV 82
++TE+DYP G C +
Sbjct: 216 LDTEKDYPYTGFDGECNL 233
>gi|357477459|ref|XP_003609015.1| Cysteine proteinase [Medicago truncatula]
gi|355510070|gb|AES91212.1| Cysteine proteinase [Medicago truncatula]
Length = 345
Score = 82.0 bits (201), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 35/76 (46%), Positives = 50/76 (65%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW F+ V A EGI+K+ T LV +S Q+LVDCD +G + C GG ++ ++++IQN G
Sbjct: 149 GCCWAFSAVAATEGITKLSTGKLVSLSEQELVDCDTKGVDQGCEGGLMDDAFKFIIQNHG 208
Query: 65 INTERDYPNVGVMDNC 80
++TE YP GV C
Sbjct: 209 LSTEAAYPYQGVDGTC 224
>gi|297791625|ref|XP_002863697.1| hypothetical protein ARALYDRAFT_917391 [Arabidopsis lyrata subsp.
lyrata]
gi|297309532|gb|EFH39956.1| hypothetical protein ARALYDRAFT_917391 [Arabidopsis lyrata subsp.
lyrata]
Length = 463
Score = 82.0 bits (201), Expect = 5e-14, Method: Composition-based stats.
Identities = 37/71 (52%), Positives = 53/71 (74%), Gaps = 7/71 (9%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCD---NQGESRSCVGGFIETIYQYVIQ 61
GSCW F+ +GA+EGI+KIVT +L+ +S Q+LVDCD NQG C GG ++ ++++I+
Sbjct: 160 GSCWAFSTIGAVEGINKIVTGDLISLSEQELVDCDTSYNQG----CNGGLMDYAFEFIIK 215
Query: 62 NRGINTERDYP 72
N GI+TE DYP
Sbjct: 216 NGGIDTEADYP 226
>gi|413944253|gb|AFW76902.1| hypothetical protein ZEAMMB73_056195 [Zea mays]
Length = 340
Score = 82.0 bits (201), Expect = 5e-14, Method: Composition-based stats.
Identities = 34/77 (44%), Positives = 48/77 (62%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW F+ V A EGI KI T L+ +S Q+LVDCD GE + C GG ++ ++++I+N G
Sbjct: 146 GCCWAFSAVAATEGIVKISTGKLISLSEQELVDCDVHGEDQGCEGGLMDDAFKFIIKNGG 205
Query: 65 INTERDYPNVGVMDNCK 81
+ TE +YP CK
Sbjct: 206 LTTESNYPYTAADGKCK 222
>gi|18141283|gb|AAL60579.1|AF454957_1 senescence-associated cysteine protease [Brassica oleracea]
Length = 460
Score = 82.0 bits (201), Expect = 5e-14, Method: Composition-based stats.
Identities = 37/71 (52%), Positives = 53/71 (74%), Gaps = 7/71 (9%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCD---NQGESRSCVGGFIETIYQYVIQ 61
GSCW F+ +GA+EGI+KIVT +L+ +S Q+LVDCD NQG C GG ++ ++++I+
Sbjct: 159 GSCWAFSTIGAVEGINKIVTGDLISLSEQELVDCDTSYNQG----CNGGLMDYAFEFIIK 214
Query: 62 NRGINTERDYP 72
N GI+TE DYP
Sbjct: 215 NGGIDTEEDYP 225
>gi|414589857|tpg|DAA40428.1| TPA: Vignain [Zea mays]
Length = 377
Score = 82.0 bits (201), Expect = 5e-14, Method: Composition-based stats.
Identities = 35/77 (45%), Positives = 53/77 (68%), Gaps = 2/77 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V AIEGI++I LV +S Q+LVDCD + + C GG++ +++V++NRG
Sbjct: 173 GSCWAFSAVAAIEGINQIKNGKLVSLSEQELVDCDTK--AIGCAGGYMSWAFEFVMKNRG 230
Query: 65 INTERDYPNVGVMDNCK 81
+ TER+YP G+ C+
Sbjct: 231 LTTERNYPYQGLNGACQ 247
>gi|118140100|gb|ABK63481.1| cathepsin S [Channa argus]
Length = 335
Score = 82.0 bits (201), Expect = 5e-14, Method: Composition-based stats.
Identities = 36/76 (47%), Positives = 48/76 (63%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ VGA+EG T LVD+S Q LVDC + + C GGF+ +QYVI+N+G
Sbjct: 142 GSCWAFSAVGALEGQLAKTTGKLVDLSPQNLVDCSGKYGNHGCDGGFMTNAFQYVIENQG 201
Query: 65 INTERDYPNVGVMDNC 80
I +E YP +G+ C
Sbjct: 202 IESEASYPYIGLEQQC 217
>gi|297816028|ref|XP_002875897.1| hypothetical protein ARALYDRAFT_347926 [Arabidopsis lyrata subsp.
lyrata]
gi|297321735|gb|EFH52156.1| hypothetical protein ARALYDRAFT_347926 [Arabidopsis lyrata subsp.
lyrata]
Length = 361
Score = 81.6 bits (200), Expect = 5e-14, Method: Composition-based stats.
Identities = 40/83 (48%), Positives = 53/83 (63%), Gaps = 3/83 (3%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCD-NQGESRSCVGGFIETIYQYVIQNR 63
GSCW F+ V A+EGI+KI TN LV +S Q+LVDCD NQ E C GG +E ++++ +N
Sbjct: 150 GSCWAFSTVAAVEGINKIKTNKLVSLSEQELVDCDTNQNE--GCNGGLMEIAFEFIKKNG 207
Query: 64 GINTERDYPNVGVMDNCKVFQFN 86
GI TE YP G+ C + N
Sbjct: 208 GITTEDSYPYEGIDGKCDASKDN 230
>gi|255032|gb|AAB23155.1| COT44=cysteine proteinase homolog [Brassica napus, seedling, rapid
cycling base population CrGC5, Peptide, 328 aa]
Length = 328
Score = 81.6 bits (200), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 36/76 (47%), Positives = 53/76 (69%), Gaps = 1/76 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ A+EGI+KIVT LV +S Q+LVDCD + ++ C GG ++ +Q++++N G
Sbjct: 122 GSCWAFSTAAAVEGINKIVTGELVSLSEQELVDCD-KSYNQGCNGGLMDYAFQFIMKNGG 180
Query: 65 INTERDYPNVGVMDNC 80
+NTE+DYP G C
Sbjct: 181 LNTEKDYPYHGTNGKC 196
>gi|118127|sp|P25251.1|CYSP4_BRANA RecName: Full=Cysteine proteinase COT44; Flags: Precursor
Length = 328
Score = 81.6 bits (200), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 36/76 (47%), Positives = 53/76 (69%), Gaps = 1/76 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ A+EGI+KIVT LV +S Q+LVDCD + ++ C GG ++ +Q++++N G
Sbjct: 122 GSCWAFSTAAAVEGINKIVTGELVSLSEQELVDCD-KSYNQGCNGGLMDYAFQFIMKNGG 180
Query: 65 INTERDYPNVGVMDNC 80
+NTE+DYP G C
Sbjct: 181 LNTEKDYPYHGTNGKC 196
>gi|357439999|ref|XP_003590277.1| Cysteine protease [Medicago truncatula]
gi|355479325|gb|AES60528.1| Cysteine protease [Medicago truncatula]
Length = 514
Score = 81.6 bits (200), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 38/79 (48%), Positives = 55/79 (69%), Gaps = 2/79 (2%)
Query: 4 LGSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNR 63
+GSCW F+ GAIEG++ IVT +L+ +S Q+LVDCD + C GG+++ +++VI N
Sbjct: 205 VGSCWSFSSTGAIEGVNAIVTGDLISLSEQELVDCDTTND--GCEGGYMDYAFEWVINNG 262
Query: 64 GINTERDYPNVGVMDNCKV 82
GI+TE DYP +GV C V
Sbjct: 263 GIDTEADYPYIGVGGTCNV 281
>gi|357167190|ref|XP_003581045.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like
[Brachypodium distachyon]
Length = 415
Score = 81.6 bits (200), Expect = 5e-14, Method: Composition-based stats.
Identities = 32/76 (42%), Positives = 49/76 (64%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW F+ V ++EGI K+ T L+ +S Q+LVDCD G + C GG ++ ++++I N G
Sbjct: 219 GCCWAFSTVASVEGIVKLSTGKLISLSEQELVDCDVDGMDQGCEGGLMDNAFEFIIDNGG 278
Query: 65 INTERDYPNVGVMDNC 80
+ TE +YP G D+C
Sbjct: 279 LTTEGNYPYTGTDDSC 294
>gi|356517310|ref|XP_003527331.1| PREDICTED: vignain-like [Glycine max]
Length = 342
Score = 81.6 bits (200), Expect = 5e-14, Method: Composition-based stats.
Identities = 36/78 (46%), Positives = 51/78 (65%), Gaps = 1/78 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V A EGI +I T LV +S Q+LVDC +GES C+GG+++ ++++ + G
Sbjct: 146 GSCWAFSAVAATEGIHQITTGKLVPLSEQELVDC-VKGESEGCIGGYVDDAFEFIAKKGG 204
Query: 65 INTERDYPNVGVMDNCKV 82
I +E YP GV CKV
Sbjct: 205 IASETHYPYKGVNKTCKV 222
>gi|413953048|gb|AFW85697.1| hypothetical protein ZEAMMB73_051316 [Zea mays]
Length = 298
Score = 81.6 bits (200), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 36/76 (47%), Positives = 47/76 (61%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW FA V +IEG+ +I T LV +S QQ+VDCD G C GG+ + ++V +N G
Sbjct: 98 GSCWAFATVASIEGVHQIKTGRLVSLSEQQIVDCDRGGNDHGCHGGYPRSAMEWVTRNGG 157
Query: 65 INTERDYPNVGVMDNC 80
+ TE DYP VG C
Sbjct: 158 LTTESDYPYVGSQRQC 173
>gi|357474579|ref|XP_003607574.1| Cysteine protease [Medicago truncatula]
gi|355508629|gb|AES89771.1| Cysteine protease [Medicago truncatula]
Length = 345
Score = 81.6 bits (200), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 35/78 (44%), Positives = 49/78 (62%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW F+ V A EGI K+ T LV +S Q+LVDCD +G + C GG ++ ++++IQN G
Sbjct: 149 GCCWAFSAVAATEGIHKLSTGKLVSLSEQELVDCDTKGVDQGCEGGLMDDAFKFIIQNHG 208
Query: 65 INTERDYPNVGVMDNCKV 82
++TE YP GV C
Sbjct: 209 LHTEAQYPYQGVDGTCSA 226
>gi|356515036|ref|XP_003526207.1| PREDICTED: thiol protease SEN102-like [Glycine max]
Length = 336
Score = 81.6 bits (200), Expect = 6e-14, Method: Composition-based stats.
Identities = 34/76 (44%), Positives = 50/76 (65%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ + A EGI +I T LV +S Q+LVDCD +G + C GG++E ++++I+N G
Sbjct: 143 GSCWAFSTIAATEGIHQITTGKLVSLSEQELVDCDTKGVDQGCEGGYMEDGFEFIIKNGG 202
Query: 65 INTERDYPNVGVMDNC 80
I +E +YP V C
Sbjct: 203 ITSETNYPYKAVDGKC 218
>gi|358345461|ref|XP_003636796.1| Cysteine proteinase [Medicago truncatula]
gi|355502731|gb|AES83934.1| Cysteine proteinase [Medicago truncatula]
Length = 475
Score = 81.6 bits (200), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 38/78 (48%), Positives = 54/78 (69%), Gaps = 2/78 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ GAIEG++ IVT +L+ +S Q+LVDCD + C GG+++ +++VI N G
Sbjct: 146 GSCWSFSSTGAIEGVNAIVTGDLISLSEQELVDCDTTND--GCEGGYMDYAFEWVINNGG 203
Query: 65 INTERDYPNVGVMDNCKV 82
I+TE DYP +GV C V
Sbjct: 204 IDTEADYPYIGVGGTCNV 221
>gi|57282619|emb|CAE54307.1| cysteine proteinase [Gossypium hirsutum]
Length = 372
Score = 81.6 bits (200), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 36/82 (43%), Positives = 56/82 (68%), Gaps = 1/82 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ + A+EGI+KIV+ L+ +S Q+LVDCD ++ C GG ++ +Q++I N G
Sbjct: 158 GSCWAFSAIAAVEGINKIVSGELISLSEQELVDCDRSYDA-GCNGGLMDYAFQFIIDNGG 216
Query: 65 INTERDYPNVGVMDNCKVFQFN 86
I+TE+DYP +G + C + N
Sbjct: 217 IDTEKDYPYLGFNNQCDPTKKN 238
>gi|413947586|gb|AFW80235.1| hypothetical protein ZEAMMB73_542371 [Zea mays]
Length = 264
Score = 81.6 bits (200), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 35/77 (45%), Positives = 49/77 (63%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW F+ V A+EGI K+ T NLV +S Q+LVDCD C GG++++ +++VI+N G
Sbjct: 149 GCCWAFSAVAAVEGIVKLSTGNLVSLSEQELVDCDTHSMDEGCEGGWMDSAFEFVIKNGG 208
Query: 65 INTERDYPNVGVMDNCK 81
+ TE YP V CK
Sbjct: 209 LATESSYPYKAVDGKCK 225
>gi|297740489|emb|CBI30671.3| unnamed protein product [Vitis vinifera]
Length = 320
Score = 81.6 bits (200), Expect = 6e-14, Method: Composition-based stats.
Identities = 32/76 (42%), Positives = 51/76 (67%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW F+ V A+EG++++ T L+ +S Q+LVDCD GE + C GG +++ ++++I N G
Sbjct: 124 GCCWAFSAVAAMEGVTQLKTGELISLSEQELVDCDTSGEDQGCGGGLMDSAFEFIIGNGG 183
Query: 65 INTERDYPNVGVMDNC 80
+ TE +YP GV C
Sbjct: 184 LTTEANYPYKGVDATC 199
>gi|225443827|ref|XP_002274223.1| PREDICTED: vignain-like [Vitis vinifera]
Length = 340
Score = 81.6 bits (200), Expect = 6e-14, Method: Composition-based stats.
Identities = 32/76 (42%), Positives = 51/76 (67%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW F+ V A+EG++++ T L+ +S Q+LVDCD GE + C GG +++ ++++I N G
Sbjct: 144 GCCWAFSAVAAMEGVTQLKTGELISLSEQELVDCDTSGEDQGCGGGLMDSAFEFIIGNGG 203
Query: 65 INTERDYPNVGVMDNC 80
+ TE +YP GV C
Sbjct: 204 LTTEANYPYKGVDATC 219
>gi|18141285|gb|AAL60580.1|AF454958_1 senescence-associated cysteine protease [Brassica oleracea]
Length = 485
Score = 81.6 bits (200), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 37/76 (48%), Positives = 53/76 (69%), Gaps = 1/76 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ +GA+EGI+KIVT +L+ +S Q+LVDCD + C GG ++ ++++I N G
Sbjct: 154 GSCWAFSTIGAVEGINKIVTGDLITLSEQELVDCDTS-YNEGCNGGLMDYAFEFIINNGG 212
Query: 65 INTERDYPNVGVMDNC 80
I+TE DYP GV C
Sbjct: 213 IDTEEDYPYKGVDGRC 228
>gi|226507844|ref|NP_001148894.1| LOC100282514 precursor [Zea mays]
gi|194703250|gb|ACF85709.1| unknown [Zea mays]
gi|195622994|gb|ACG33327.1| vignain precursor [Zea mays]
Length = 356
Score = 81.6 bits (200), Expect = 6e-14, Method: Composition-based stats.
Identities = 35/77 (45%), Positives = 53/77 (68%), Gaps = 2/77 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V AIEGI++I LV +S Q+LVDCD + + C GG++ +++V++NRG
Sbjct: 152 GSCWAFSAVAAIEGINQIKNGKLVSLSEQELVDCDTK--AIGCAGGYMSWAFEFVMKNRG 209
Query: 65 INTERDYPNVGVMDNCK 81
+ TER+YP G+ C+
Sbjct: 210 LTTERNYPYQGLNGACQ 226
>gi|356517308|ref|XP_003527330.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Glycine
max]
Length = 342
Score = 81.3 bits (199), Expect = 6e-14, Method: Composition-based stats.
Identities = 36/78 (46%), Positives = 51/78 (65%), Gaps = 1/78 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V A EGI +I T LV +S Q+LVDC +GES C+GG+++ ++++ + G
Sbjct: 146 GSCWAFSAVAATEGIHQITTGKLVPLSEQELVDC-VKGESEGCIGGYVDDAFEFIAKKGG 204
Query: 65 INTERDYPNVGVMDNCKV 82
I +E YP GV CKV
Sbjct: 205 IASETHYPYKGVNKTCKV 222
>gi|171702831|dbj|BAG16371.1| cysteine protease [Brassica oleracea var. italica]
Length = 441
Score = 81.3 bits (199), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 38/82 (46%), Positives = 55/82 (67%), Gaps = 1/82 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ +GA+EGI+KIVT +L+ +S Q+LVDCD + C GG ++ ++++I N G
Sbjct: 148 GSCWAFSTIGAVEGINKIVTGDLITLSEQELVDCDTS-YNEGCNGGLMDYAFEFIINNGG 206
Query: 65 INTERDYPNVGVMDNCKVFQFN 86
I+TE DYP GV C + N
Sbjct: 207 IDTEEDYPYKGVDGRCDQTRKN 228
>gi|317106666|dbj|BAJ53169.1| JHL18I08.3 [Jatropha curcas]
Length = 441
Score = 81.3 bits (199), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 39/76 (51%), Positives = 53/76 (69%), Gaps = 1/76 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G+CW F+ GAIEGI+KIVT +LV +S Q+LVDCD + + C GG ++ +Q+VI N G
Sbjct: 140 GACWSFSATGAIEGINKIVTGSLVSLSEQELVDCD-KSYNNGCEGGIMDYAFQFVIDNHG 198
Query: 65 INTERDYPNVGVMDNC 80
I+TE DYP G +C
Sbjct: 199 IDTEEDYPYQGRDRSC 214
>gi|18408616|ref|NP_566901.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|75313880|sp|Q9STL5.1|CEP3_ARATH RecName: Full=KDEL-tailed cysteine endopeptidase CEP3; Flags:
Precursor
gi|4678353|emb|CAB41163.1| cysteine endopeptidase precursor-like protein [Arabidopsis
thaliana]
gi|26453052|dbj|BAC43602.1| putative cysteine endopeptidase precursor [Arabidopsis thaliana]
gi|332644885|gb|AEE78406.1| putative cysteine proteinase [Arabidopsis thaliana]
Length = 364
Score = 81.3 bits (199), Expect = 7e-14, Method: Composition-based stats.
Identities = 35/68 (51%), Positives = 48/68 (70%), Gaps = 1/68 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V A+EGI+KI TN LV +S Q+LVDCD + E++ C GG +E ++++ N G
Sbjct: 148 GSCWAFSTVAAVEGINKIRTNKLVSLSEQELVDCDTE-ENQGCAGGLMEPAFEFIKNNGG 206
Query: 65 INTERDYP 72
I TE YP
Sbjct: 207 IKTEETYP 214
>gi|386648112|gb|AFJ15103.1| mexicain-like cystein protease, partial [Jacaratia mexicana]
Length = 348
Score = 81.3 bits (199), Expect = 7e-14, Method: Composition-based stats.
Identities = 37/81 (45%), Positives = 53/81 (65%), Gaps = 3/81 (3%)
Query: 1 PHPLGSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVI 60
P+P GSCW F+ V +EGI+KIVT L+ +S Q+L+DCD + S C GG+ T QYV+
Sbjct: 152 PNPCGSCWAFSTVATVEGINKIVTGKLISLSEQELLDCDRR--SHGCKGGYQTTSLQYVV 209
Query: 61 QNRGINTERDYPNVGVMDNCK 81
N G++TE++YP C+
Sbjct: 210 DN-GVHTEKEYPYEKKQGKCR 229
>gi|413953668|gb|AFW86317.1| hypothetical protein ZEAMMB73_339067 [Zea mays]
Length = 433
Score = 81.3 bits (199), Expect = 7e-14, Method: Composition-based stats.
Identities = 34/77 (44%), Positives = 47/77 (61%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW F+ V A EGI KI T LV ++ Q+LVDCD GE + C GG ++ ++++I+N G
Sbjct: 239 GCCWAFSAVAATEGIVKISTGKLVSLAEQELVDCDVHGEDQGCEGGLMDDAFKFIIKNGG 298
Query: 65 INTERDYPNVGVMDNCK 81
+ TE YP CK
Sbjct: 299 LTTESSYPYTAADGKCK 315
>gi|356515056|ref|XP_003526217.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Glycine
max]
Length = 342
Score = 81.3 bits (199), Expect = 7e-14, Method: Composition-based stats.
Identities = 36/78 (46%), Positives = 51/78 (65%), Gaps = 1/78 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V A EGI +I T LV +S Q+LVDC +GES C+GG+++ ++++ + G
Sbjct: 146 GSCWAFSAVAATEGIHQITTGKLVPLSEQELVDC-VKGESEGCIGGYVDDAFEFIAKKGG 204
Query: 65 INTERDYPNVGVMDNCKV 82
I +E YP GV CKV
Sbjct: 205 IASETHYPYKGVNKTCKV 222
>gi|255567869|ref|XP_002524912.1| cysteine protease, putative [Ricinus communis]
gi|223535747|gb|EEF37409.1| cysteine protease, putative [Ricinus communis]
Length = 366
Score = 81.3 bits (199), Expect = 7e-14, Method: Composition-based stats.
Identities = 34/76 (44%), Positives = 52/76 (68%), Gaps = 1/76 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ + A+EG++KIVT L+ +S Q+LVDCD + + C GG ++ +Q++I N G
Sbjct: 160 GSCWAFSTIAAVEGVNKIVTGELISLSEQELVDCD-RSYNAGCNGGLMDNAFQFIINNGG 218
Query: 65 INTERDYPNVGVMDNC 80
I+T++DYP V C
Sbjct: 219 IDTDKDYPYQAVDGKC 234
>gi|297816030|ref|XP_002875898.1| hypothetical protein ARALYDRAFT_485194 [Arabidopsis lyrata subsp.
lyrata]
gi|297321736|gb|EFH52157.1| hypothetical protein ARALYDRAFT_485194 [Arabidopsis lyrata subsp.
lyrata]
Length = 363
Score = 81.3 bits (199), Expect = 7e-14, Method: Composition-based stats.
Identities = 35/68 (51%), Positives = 48/68 (70%), Gaps = 1/68 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V A+EGI+KI TN LV +S Q+LVDCD + E++ C GG +E ++++ N G
Sbjct: 147 GSCWAFSTVAAVEGINKIRTNKLVSLSEQELVDCDTE-ENQGCAGGLMEPAFEFIKNNGG 205
Query: 65 INTERDYP 72
I TE YP
Sbjct: 206 IKTEETYP 213
>gi|224136808|ref|XP_002326950.1| predicted protein [Populus trichocarpa]
gi|222835265|gb|EEE73700.1| predicted protein [Populus trichocarpa]
Length = 456
Score = 81.3 bits (199), Expect = 7e-14, Method: Composition-based stats.
Identities = 36/82 (43%), Positives = 55/82 (67%), Gaps = 1/82 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ + A+EGI+KIVT +L+ +S Q+LVDCD + C GG ++ ++++I N G
Sbjct: 150 GSCWAFSTIAAVEGINKIVTGDLIALSEQELVDCDTS-YNEGCNGGLMDYAFEFIINNGG 208
Query: 65 INTERDYPNVGVMDNCKVFQFN 86
I+TE DYP +G C ++ N
Sbjct: 209 IDTEDDYPYLGRDGRCDTYRKN 230
>gi|357518983|ref|XP_003629780.1| Cysteine proteinase [Medicago truncatula]
gi|355523802|gb|AET04256.1| Cysteine proteinase [Medicago truncatula]
Length = 364
Score = 81.3 bits (199), Expect = 8e-14, Method: Composition-based stats.
Identities = 39/76 (51%), Positives = 52/76 (68%), Gaps = 2/76 (2%)
Query: 6 SCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRGI 65
S W F+V GAIEG++KIVT NL+++S Q+LVDCD S+ C GGF + YVI+N GI
Sbjct: 164 SHWAFSVTGAIEGLNKIVTGNLINLSAQELVDCDPA--SKGCAGGFYFNAFGYVIENGGI 221
Query: 66 NTERDYPNVGVMDNCK 81
+TE +YP + CK
Sbjct: 222 DTEANYPYLAKNGTCK 237
>gi|37780047|gb|AAP32196.1| cysteine protease 8 [Trifolium repens]
Length = 343
Score = 81.3 bits (199), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 34/76 (44%), Positives = 49/76 (64%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW F+ V A EGI K+ T L+ +S Q+LVDCD +G + C GG ++ ++++IQN G
Sbjct: 147 GCCWAFSAVAATEGIHKLSTGKLISLSEQELVDCDTKGVDQGCEGGLMDDAFKFIIQNHG 206
Query: 65 INTERDYPNVGVMDNC 80
++TE YP GV C
Sbjct: 207 LSTEAQYPYEGVDGTC 222
>gi|297852302|ref|XP_002894032.1| F2G19.31/F2G19.31 [Arabidopsis lyrata subsp. lyrata]
gi|297339874|gb|EFH70291.1| F2G19.31/F2G19.31 [Arabidopsis lyrata subsp. lyrata]
Length = 455
Score = 81.3 bits (199), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 36/82 (43%), Positives = 57/82 (69%), Gaps = 1/82 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ +GA+EGI++IVT +L+ +S Q+LVDCD + C GG ++ ++++I+N G
Sbjct: 152 GSCWAFSTIGAVEGINQIVTGDLITLSEQELVDCDTS-YNEGCNGGLMDYAFEFIIKNGG 210
Query: 65 INTERDYPNVGVMDNCKVFQFN 86
I+T++DYP GV C + N
Sbjct: 211 IDTDKDYPYKGVDGTCDQIRKN 232
>gi|118627554|emb|CAL64936.1| putative cysteine protease 8 [Trifolium pratense]
Length = 344
Score = 80.9 bits (198), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 34/76 (44%), Positives = 49/76 (64%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW F+ V A EGI K+ T L+ +S Q+LVDCD +G + C GG ++ ++++IQN G
Sbjct: 148 GCCWAFSAVAATEGIHKLSTGKLISLSEQELVDCDTKGVDQGCEGGLMDDAFKFIIQNHG 207
Query: 65 INTERDYPNVGVMDNC 80
++TE YP GV C
Sbjct: 208 LSTEAQYPYEGVDGTC 223
>gi|162459488|ref|NP_001105571.1| maize insect resistance1 precursor [Zea mays]
gi|5731354|gb|AAB70820.2| cysteine protease Mir1 [Zea mays]
Length = 398
Score = 80.9 bits (198), Expect = 8e-14, Method: Composition-based stats.
Identities = 36/76 (47%), Positives = 49/76 (64%), Gaps = 2/76 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW F+ V AIEG++ I T NLV +S Q+++DCD Q C GG +E +++VI N G
Sbjct: 179 GGCWAFSAVAAIEGVNAIATGNLVSLSEQEIIDCDAQ--DSGCDGGQMENAFRFVIGNGG 236
Query: 65 INTERDYPNVGVMDNC 80
I+TE DYP +G C
Sbjct: 237 IDTEADYPFIGTDGTC 252
>gi|37780043|gb|AAP32194.1| cysteine protease 1 [Trifolium repens]
Length = 292
Score = 80.9 bits (198), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 34/76 (44%), Positives = 50/76 (65%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V A EGI ++ T LV +S Q+L+DCD +G + C GG ++ ++++IQN G
Sbjct: 96 GSCWAFSAVAATEGIHQLSTGKLVSLSEQELIDCDTKGVDQGCEGGLMDDAFKFIIQNHG 155
Query: 65 INTERDYPNVGVMDNC 80
++TE YP GV C
Sbjct: 156 LSTEVQYPYEGVDGTC 171
>gi|37780049|gb|AAP32197.1| cysteine protease 10 [Trifolium repens]
Length = 272
Score = 80.9 bits (198), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 34/76 (44%), Positives = 50/76 (65%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V A EGI ++ T LV +S Q+L+DCD +G + C GG ++ ++++IQN G
Sbjct: 76 GSCWAFSAVAATEGIHQLSTGKLVSLSEQELIDCDTKGVDQGCEGGLMDDAFKFIIQNHG 135
Query: 65 INTERDYPNVGVMDNC 80
++TE YP GV C
Sbjct: 136 LSTEVQYPYEGVDGTC 151
>gi|212275830|ref|NP_001130503.1| cysteine protease 1 [Zea mays]
gi|194689328|gb|ACF78748.1| unknown [Zea mays]
gi|219886279|gb|ACL53514.1| unknown [Zea mays]
gi|238010470|gb|ACR36270.1| unknown [Zea mays]
gi|413920875|gb|AFW60807.1| cysteine protease 1 [Zea mays]
Length = 354
Score = 80.9 bits (198), Expect = 9e-14, Method: Composition-based stats.
Identities = 36/80 (45%), Positives = 49/80 (61%), Gaps = 1/80 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW FA V A+EGI +I T NLV +S QQ++DCD G + C GG+I+ +QY++ N G
Sbjct: 165 GCCWAFAAVAAVEGIHQITTGNLVSLSEQQVLDCDTDGNN-GCNGGYIDNAFQYIVGNGG 223
Query: 65 INTERDYPNVGVMDNCKVFQ 84
+ TE YP C+ Q
Sbjct: 224 LGTEDAYPYTAAQAMCQSVQ 243
>gi|356517358|ref|XP_003527354.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
gi|356577767|ref|XP_003556994.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
Length = 343
Score = 80.9 bits (198), Expect = 9e-14, Method: Composition-based stats.
Identities = 32/76 (42%), Positives = 48/76 (63%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW F+ V A EGI + L+ +S Q++VDCD +GE + C GGF++ ++++IQN G
Sbjct: 147 GCCWAFSAVAATEGIHALSAGKLISLSEQEVVDCDTKGEDQGCAGGFMDGAFKFIIQNHG 206
Query: 65 INTERDYPNVGVMDNC 80
+N E +YP V C
Sbjct: 207 LNNEPNYPYKAVDGKC 222
>gi|297819566|ref|XP_002877666.1| hypothetical protein ARALYDRAFT_906213 [Arabidopsis lyrata subsp.
lyrata]
gi|297323504|gb|EFH53925.1| hypothetical protein ARALYDRAFT_906213 [Arabidopsis lyrata subsp.
lyrata]
Length = 304
Score = 80.9 bits (198), Expect = 9e-14, Method: Composition-based stats.
Identities = 35/77 (45%), Positives = 45/77 (58%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW FA V A+EG++KI LV +S QQLVDC + C GG T Y Y+ +N+G
Sbjct: 127 GCCWAFAAVAAVEGVTKIANGELVSLSEQQLVDCSTANNNMGCDGGLALTAYDYIKENQG 186
Query: 65 INTERDYPNVGVMDNCK 81
I +E +YP V CK
Sbjct: 187 ITSEENYPYQAVQQTCK 203
>gi|118486542|gb|ABK95110.1| unknown [Populus trichocarpa]
Length = 465
Score = 80.9 bits (198), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 36/82 (43%), Positives = 55/82 (67%), Gaps = 1/82 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ + A+EGI+KIVT +L+ +S Q+LVDCD + C GG ++ ++++I N G
Sbjct: 159 GSCWAFSTIAAVEGINKIVTGDLIALSEQELVDCDTS-YNEGCNGGLMDYAFEFIINNGG 217
Query: 65 INTERDYPNVGVMDNCKVFQFN 86
I+TE DYP +G C ++ N
Sbjct: 218 IDTEDDYPYLGRDGRCDTYRKN 239
>gi|168058022|ref|XP_001781010.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162667567|gb|EDQ54194.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 457
Score = 80.9 bits (198), Expect = 9e-14, Method: Composition-based stats.
Identities = 37/82 (45%), Positives = 55/82 (67%), Gaps = 1/82 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ VG++EGI+ I T + + +S Q+LVDCD + ++ C GG ++ + +VIQN G
Sbjct: 155 GSCWAFSAVGSVEGINAIRTGDAISLSVQELVDCDKK-YNQGCNGGLMDYAFDFVIQNGG 213
Query: 65 INTERDYPNVGVMDNCKVFQFN 86
I+TE+DYP G C V + N
Sbjct: 214 IDTEKDYPYQGYDGRCDVNKMN 235
>gi|146215990|gb|ABQ10197.1| actinidin Act4a [Actinidia eriantha]
Length = 385
Score = 80.9 bits (198), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 36/71 (50%), Positives = 49/71 (69%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW FA + A+EGI+KIVT NL+ +S Q++VDC + + C GG + YQ++I N G
Sbjct: 155 GSCWTFASIAAVEGINKIVTGNLISLSEQEIVDCQRKYPNNGCNGGTLSGAYQFIINNGG 214
Query: 65 INTERDYPNVG 75
INTE +YP G
Sbjct: 215 INTEANYPYTG 225
>gi|356517348|ref|XP_003527349.1| PREDICTED: vignain-like [Glycine max]
Length = 343
Score = 80.9 bits (198), Expect = 1e-13, Method: Composition-based stats.
Identities = 32/76 (42%), Positives = 48/76 (63%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW F+ V A EGI + L+ +S Q++VDCD +GE + C GGF++ ++++IQN G
Sbjct: 147 GCCWAFSAVAATEGIHALSAGKLISLSEQEVVDCDTKGEDQGCAGGFMDGAFKFIIQNHG 206
Query: 65 INTERDYPNVGVMDNC 80
+N E +YP V C
Sbjct: 207 LNNEPNYPYKAVDGKC 222
>gi|413956349|gb|AFW88998.1| hypothetical protein ZEAMMB73_678859 [Zea mays]
Length = 1140
Score = 80.9 bits (198), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 39/85 (45%), Positives = 57/85 (67%), Gaps = 7/85 (8%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCD---NQGESRSCVGGFIETIYQYVIQ 61
GSCW F+ + A+EGI++IVT +L+ +S Q+LVDCD NQG C GG ++ ++++I
Sbjct: 780 GSCWAFSTIAAVEGINQIVTGDLISLSEQELVDCDTSYNQG----CNGGLMDYAFEFIIN 835
Query: 62 NRGINTERDYPNVGVMDNCKVFQFN 86
N GI+TE+DYP G C V + N
Sbjct: 836 NGGIDTEKDYPYKGTDGRCDVNRKN 860
>gi|146215978|gb|ABQ10191.1| actinidin Act1c [Actinidia eriantha]
Length = 368
Score = 80.5 bits (197), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 34/78 (43%), Positives = 51/78 (65%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ + A+EGI+KIVT NL+ +S Q+LVDC ++ C GG++ ++++I N G
Sbjct: 145 GSCWAFSAIAAVEGINKIVTGNLISLSEQELVDCGRTQSTKGCDGGYMTDGFEFIINNGG 204
Query: 65 INTERDYPNVGVMDNCKV 82
INTE +YP C +
Sbjct: 205 INTEENYPYTAQEGQCDL 222
>gi|424513619|emb|CCO66241.1| predicted protein [Bathycoccus prasinos]
Length = 396
Score = 80.5 bits (197), Expect = 1e-13, Method: Composition-based stats.
Identities = 30/77 (38%), Positives = 52/77 (67%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ +GA+EGI+ I T LV +S Q+LV C +G ++ C GG ++ ++++++N G
Sbjct: 187 GSCWAFSAIGAVEGINAIRTGKLVSLSEQELVSCAREGGNQGCNGGLMDNAFEWIVENGG 246
Query: 65 INTERDYPNVGVMDNCK 81
+++E+ Y D+CK
Sbjct: 247 VDSEKQYQYKASFDDCK 263
>gi|1256830|gb|AAB68374.1| cysteine endopeptidase 1 [Phaseolus vulgaris]
gi|2959418|emb|CAA12118.1| cysteine protease [Phaseolus vulgaris]
Length = 364
Score = 80.5 bits (197), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 37/76 (48%), Positives = 51/76 (67%), Gaps = 1/76 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V A+EGI+ IVT V +S Q+LVDCD + + C GG ++ +Q++IQN G
Sbjct: 147 GSCWAFSTVAAVEGINNIVTGEFVSLSEQELVDCDREYD-EGCNGGLMDYAFQFIIQNGG 205
Query: 65 INTERDYPNVGVMDNC 80
I+TE DYP G+ C
Sbjct: 206 IDTEEDYPYQGIDGTC 221
>gi|357437717|ref|XP_003589134.1| Cysteine proteinase [Medicago truncatula]
gi|355478182|gb|AES59385.1| Cysteine proteinase [Medicago truncatula]
Length = 299
Score = 80.5 bits (197), Expect = 1e-13, Method: Composition-based stats.
Identities = 34/76 (44%), Positives = 51/76 (67%), Gaps = 1/76 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ + A+EGI+KIVT +L+ +S Q+LVDCD + C GG ++ ++++I N G
Sbjct: 167 GSCWAFSAIAAVEGINKIVTGDLISLSEQELVDCDTS-YNEGCNGGLMDYAFEFIISNGG 225
Query: 65 INTERDYPNVGVMDNC 80
I++E DYP V C
Sbjct: 226 IDSEDDYPYKAVDGRC 241
>gi|60100207|gb|AAX13273.1| putative cysteine protease [Oryza sativa Japonica Group]
Length = 349
Score = 80.5 bits (197), Expect = 1e-13, Method: Composition-based stats.
Identities = 33/76 (43%), Positives = 46/76 (60%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW F+ V A+EG K+ T LV +S QQLV CD +GE + C GG ++ + ++I+N G
Sbjct: 151 GCCWAFSAVAAMEGAVKLATGKLVSLSEQQLVSCDVKGEDQGCEGGLMDDAFDFIIKNGG 210
Query: 65 INTERDYPNVGVMDNC 80
+ E DYP D C
Sbjct: 211 LAAESDYPYTASDDKC 226
>gi|413945959|gb|AFW78608.1| hypothetical protein ZEAMMB73_489507 [Zea mays]
Length = 289
Score = 80.5 bits (197), Expect = 1e-13, Method: Composition-based stats.
Identities = 35/68 (51%), Positives = 49/68 (72%), Gaps = 1/68 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G+CW F+ GA+EGI+KI T +LV +S Q+L+DCD S C GG ++ Y++VI+N G
Sbjct: 159 GACWSFSATGAMEGINKIKTGSLVSLSEQELIDCDRSYNS-GCGGGLMDYAYKFVIKNGG 217
Query: 65 INTERDYP 72
I+TE DYP
Sbjct: 218 IDTEEDYP 225
>gi|157278117|ref|NP_001098157.1| cathepsin S precursor [Oryzias latipes]
gi|50251130|dbj|BAD27582.1| cathepsin S [Oryzias latipes]
Length = 327
Score = 80.5 bits (197), Expect = 1e-13, Method: Composition-based stats.
Identities = 36/77 (46%), Positives = 49/77 (63%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ VGA+EG K T L +S Q LVDC + + C GGF+ +QYVI+N+G
Sbjct: 134 GSCWAFSAVGALEGQLKKTTGILTSLSPQNLVDCSTKYGNYGCKGGFMSNAFQYVIKNQG 193
Query: 65 INTERDYPNVGVMDNCK 81
I+++ YP +G D CK
Sbjct: 194 ISSDAAYPYIGKRDKCK 210
>gi|2511689|emb|CAB17074.1| cysteine proteinase precursor [Phaseolus vulgaris]
Length = 364
Score = 80.5 bits (197), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 37/76 (48%), Positives = 51/76 (67%), Gaps = 1/76 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V A+EGI+ IVT V +S Q+LVDCD + + C GG ++ +Q++IQN G
Sbjct: 147 GSCWAFSTVAAVEGINNIVTGEFVSLSEQELVDCDREYD-EGCNGGLMDYAFQFIIQNGG 205
Query: 65 INTERDYPNVGVMDNC 80
I+TE DYP G+ C
Sbjct: 206 IDTEEDYPYQGIDGTC 221
>gi|356554921|ref|XP_003545789.1| PREDICTED: LOW QUALITY PROTEIN: thiol protease SEN102-like [Glycine
max]
Length = 439
Score = 80.5 bits (197), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 34/76 (44%), Positives = 48/76 (63%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW F+ V A EGI + L+ +S Q+LVDCD +G + C GG ++ Y+++IQN G
Sbjct: 243 GCCWAFSAVAATEGIHALSGGKLISLSEQELVDCDTKGVDQGCEGGLMDDAYKFIIQNHG 302
Query: 65 INTERDYPNVGVMDNC 80
+NTE +YP GV C
Sbjct: 303 LNTEANYPYKGVDGKC 318
>gi|595986|gb|AAA79915.1| cysteine proteinase, partial [Dianthus caryophyllus]
Length = 427
Score = 80.5 bits (197), Expect = 1e-13, Method: Composition-based stats.
Identities = 35/76 (46%), Positives = 53/76 (69%), Gaps = 1/76 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ +GA+EGI+KIVT +L+ +S Q+LVDCD S C GG ++ ++++I N G
Sbjct: 117 GSCWAFSAIGAVEGINKIVTGDLITLSEQELVDCDTSYNS-GCDGGLMDYAFRFIINNGG 175
Query: 65 INTERDYPNVGVMDNC 80
I+T++DYP +C
Sbjct: 176 IDTDKDYPYKATDGSC 191
>gi|50355613|dbj|BAD29955.1| cysteine protease [Daucus carota]
Length = 365
Score = 80.5 bits (197), Expect = 1e-13, Method: Composition-based stats.
Identities = 32/76 (42%), Positives = 49/76 (64%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW F+ V A+EGI+++ T L+ +S Q+LVDCD GE + C GG ++ + ++ QN G
Sbjct: 145 GCCWAFSAVAAMEGITQLKTGKLISLSEQELVDCDTNGEDQGCEGGLMDYAFDFIQQNHG 204
Query: 65 INTERDYPNVGVMDNC 80
++TE +YP G C
Sbjct: 205 LSTETNYPYSGTDGTC 220
>gi|38346007|emb|CAD40110.2| OSJNBa0035O13.9 [Oryza sativa Japonica Group]
gi|125589429|gb|EAZ29779.1| hypothetical protein OsJ_13837 [Oryza sativa Japonica Group]
Length = 314
Score = 80.5 bits (197), Expect = 1e-13, Method: Composition-based stats.
Identities = 33/76 (43%), Positives = 46/76 (60%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW F+ V A+EG K+ T LV +S QQLV CD +GE + C GG ++ + ++I+N G
Sbjct: 116 GCCWAFSAVAAMEGAVKLATGKLVSLSEQQLVSCDVKGEDQGCEGGLMDDAFDFIIKNGG 175
Query: 65 INTERDYPNVGVMDNC 80
+ E DYP D C
Sbjct: 176 LAAESDYPYTASDDKC 191
>gi|356515086|ref|XP_003526232.1| PREDICTED: vignain-like [Glycine max]
Length = 343
Score = 80.5 bits (197), Expect = 1e-13, Method: Composition-based stats.
Identities = 33/76 (43%), Positives = 48/76 (63%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW F+ V A EGI + L+ +S Q+LVDCD +G + C GG ++ ++++IQN G
Sbjct: 147 GCCWAFSAVAATEGIHALSAGKLISLSEQELVDCDTKGVDQGCEGGLMDDAFKFIIQNHG 206
Query: 65 INTERDYPNVGVMDNC 80
+NTE +YP GV C
Sbjct: 207 LNTEANYPYKGVDGKC 222
>gi|356543112|ref|XP_003540007.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Glycine
max]
Length = 345
Score = 80.5 bits (197), Expect = 1e-13, Method: Composition-based stats.
Identities = 38/78 (48%), Positives = 52/78 (66%), Gaps = 1/78 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V AIEGI +I T+ LV +S Q+LVDC +GES C GG++E +++V + G
Sbjct: 150 GSCWAFSAVAAIEGIHQITTSKLVSLSEQELVDC-VKGESEGCNGGYMEDAFEFVAKKGG 208
Query: 65 INTERDYPNVGVMDNCKV 82
I +E YP G +CKV
Sbjct: 209 IASESYYPYKGKDKSCKV 226
>gi|318136892|gb|ADV41672.1| cysteine protease [Nicotiana tabacum]
Length = 349
Score = 80.5 bits (197), Expect = 1e-13, Method: Composition-based stats.
Identities = 30/71 (42%), Positives = 48/71 (67%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW F+ V A EG+ ++ T L+ +S Q+LVDCD +GE C GG ++T + ++++N+G
Sbjct: 153 GCCWAFSAVAATEGLHQLKTGKLIPLSEQELVDCDVEGEDEGCSGGLLDTAFDFILKNKG 212
Query: 65 INTERDYPNVG 75
+ TE +YP G
Sbjct: 213 LTTEANYPYKG 223
>gi|413953667|gb|AFW86316.1| hypothetical protein ZEAMMB73_635707 [Zea mays]
Length = 340
Score = 80.5 bits (197), Expect = 1e-13, Method: Composition-based stats.
Identities = 34/77 (44%), Positives = 46/77 (59%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW F+ V A EGI KI T LV ++ Q+LVDCD GE + C GG ++ ++++I N G
Sbjct: 146 GCCWAFSAVAATEGIVKISTGKLVSLAEQELVDCDVHGEDQGCEGGLMDDAFKFIINNGG 205
Query: 65 INTERDYPNVGVMDNCK 81
+ TE YP CK
Sbjct: 206 LTTESSYPYTAADGKCK 222
>gi|413919736|gb|AFW59668.1| cysteine protease 1 [Zea mays]
Length = 469
Score = 80.5 bits (197), Expect = 1e-13, Method: Composition-based stats.
Identities = 38/85 (44%), Positives = 56/85 (65%), Gaps = 7/85 (8%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCD---NQGESRSCVGGFIETIYQYVIQ 61
GSCW F+ + A+EGI++IVT +++ +S Q+LVDCD NQG C GG ++ ++++I
Sbjct: 157 GSCWAFSTIAAVEGINQIVTGDMISLSEQELVDCDTSYNQG----CNGGLMDYAFEFIIN 212
Query: 62 NRGINTERDYPNVGVMDNCKVFQFN 86
N GI+TE DYP G C V + N
Sbjct: 213 NGGIDTEEDYPYKGTDGRCDVNRKN 237
>gi|226496089|ref|NP_001149658.1| cysteine protease 1 precursor [Zea mays]
gi|195629242|gb|ACG36262.1| cysteine protease 1 precursor [Zea mays]
Length = 469
Score = 80.5 bits (197), Expect = 1e-13, Method: Composition-based stats.
Identities = 38/85 (44%), Positives = 56/85 (65%), Gaps = 7/85 (8%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCD---NQGESRSCVGGFIETIYQYVIQ 61
GSCW F+ + A+EGI++IVT +++ +S Q+LVDCD NQG C GG ++ ++++I
Sbjct: 157 GSCWAFSTIAAVEGINQIVTGDMISLSEQELVDCDTSYNQG----CNGGLMDYAFEFIIN 212
Query: 62 NRGINTERDYPNVGVMDNCKVFQFN 86
N GI+TE DYP G C V + N
Sbjct: 213 NGGIDTEEDYPYKGTDGRCDVNRKN 237
>gi|255538210|ref|XP_002510170.1| cysteine protease, putative [Ricinus communis]
gi|223550871|gb|EEF52357.1| cysteine protease, putative [Ricinus communis]
Length = 469
Score = 80.5 bits (197), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 35/82 (42%), Positives = 55/82 (67%), Gaps = 1/82 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ + A+EGI+KIVT +L+ +S Q+LVDCD + + C GG ++ +Q++I N G
Sbjct: 163 GSCWAFSTIAAVEGINKIVTGDLISLSEQELVDCD-RSYNEGCNGGLMDYAFQFIINNGG 221
Query: 65 INTERDYPNVGVMDNCKVFQFN 86
I++E DYP + C ++ N
Sbjct: 222 IDSEEDYPYLARDGTCDTYRKN 243
>gi|24285904|gb|AAL14199.1| cysteine proteinase precursor [Ipomoea batatas]
gi|56961686|gb|AAK15148.2| cysteine proteinase-like protein [Ipomoea batatas]
Length = 341
Score = 80.1 bits (196), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 33/76 (43%), Positives = 51/76 (67%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW F+ V A+EGI+K+ T NL+ +S Q+LVDCD +G + C GG ++ + ++I N+G
Sbjct: 145 GCCWAFSAVAAMEGITKLSTGNLISLSEQELVDCDVKGIDQGCEGGLMDDAFSFIINNKG 204
Query: 65 INTERDYPNVGVMDNC 80
+ TE +YP G +C
Sbjct: 205 LTTESNYPYQGTDGSC 220
>gi|356577763|ref|XP_003556992.1| PREDICTED: vignain-like [Glycine max]
Length = 343
Score = 80.1 bits (196), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 32/76 (42%), Positives = 48/76 (63%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW F+ V A EGI + L+ +S Q++VDCD +GE + C GGF++ ++++IQN G
Sbjct: 147 GCCWAFSAVAATEGIHALSAGKLISLSEQEVVDCDTKGEDQGCAGGFMDGAFKFIIQNHG 206
Query: 65 INTERDYPNVGVMDNC 80
+N E +YP V C
Sbjct: 207 LNNEPNYPYKAVDGKC 222
>gi|40806498|gb|AAR92154.1| putative cysteine protease 1 [Iris x hollandica]
Length = 340
Score = 80.1 bits (196), Expect = 1e-13, Method: Composition-based stats.
Identities = 31/68 (45%), Positives = 48/68 (70%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V A EGI+K+ T L+ +S Q++VDCD + + C GG ++ ++Y+I+N+G
Sbjct: 144 GSCWAFSAVAATEGITKLSTGKLISLSEQEVVDCDVTSDDQGCNGGEMDDAFEYIIKNKG 203
Query: 65 INTERDYP 72
I TE +YP
Sbjct: 204 ITTEANYP 211
>gi|62526575|gb|AAX84673.1| cysteine protease CP1 [Manihot esculenta]
Length = 467
Score = 80.1 bits (196), Expect = 1e-13, Method: Composition-based stats.
Identities = 35/82 (42%), Positives = 54/82 (65%), Gaps = 1/82 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ + A+EGI+KIVT +L+ +S Q+LVDCD + C GG ++ ++++I N G
Sbjct: 161 GSCWAFSTIAAVEGINKIVTGDLISLSEQELVDCDTS-YNEGCNGGLMDYAFEFIINNGG 219
Query: 65 INTERDYPNVGVMDNCKVFQFN 86
I+TE DYP + C ++ N
Sbjct: 220 IDTEEDYPYLARDGRCDTYRKN 241
>gi|262360187|gb|ACY38051.2| cysteine proteinase C1A [Dactylis glomerata]
Length = 365
Score = 80.1 bits (196), Expect = 1e-13, Method: Composition-based stats.
Identities = 36/76 (47%), Positives = 51/76 (67%), Gaps = 1/76 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ + A+EGI+KI T LV +S Q+L+DC N GE+ C GG ++ +Q++ QN G
Sbjct: 154 GSCWAFSTIVAVEGINKIRTGRLVSLSEQELMDC-NIGENDGCNGGLMDVAFQFIQQNGG 212
Query: 65 INTERDYPNVGVMDNC 80
I TE YP G ++C
Sbjct: 213 ITTEASYPYQGEQNSC 228
>gi|226821425|gb|ACO82388.1| cathepsin S [Lutjanus argentimaculatus]
Length = 337
Score = 80.1 bits (196), Expect = 1e-13, Method: Composition-based stats.
Identities = 36/76 (47%), Positives = 48/76 (63%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ VGA+EG T LVD+S Q LVDC + + C GGF++ +QYVI N+G
Sbjct: 144 GSCWAFSAVGALEGQLAKKTGKLVDLSPQNLVDCSTKYGNHGCNGGFMDHAFQYVIDNQG 203
Query: 65 INTERDYPNVGVMDNC 80
I+++ YP G D C
Sbjct: 204 IDSDASYPYTGRSDQC 219
>gi|146215988|gb|ABQ10196.1| actinidin Act3a [Actinidia eriantha]
Length = 380
Score = 80.1 bits (196), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 33/75 (44%), Positives = 51/75 (68%)
Query: 6 SCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRGI 65
SCW FA + +E I++I+T +L+ +S Q+LVDC+ + C GGF++ Y+++I N GI
Sbjct: 149 SCWAFATIATVESINQIITGDLISLSEQELVDCNRTPINEGCKGGFMDDAYEFIINNGGI 208
Query: 66 NTERDYPNVGVMDNC 80
NTE +YP +G D C
Sbjct: 209 NTEENYPYIGQDDQC 223
>gi|125547258|gb|EAY93080.1| hypothetical protein OsI_14881 [Oryza sativa Indica Group]
Length = 314
Score = 80.1 bits (196), Expect = 1e-13, Method: Composition-based stats.
Identities = 33/76 (43%), Positives = 46/76 (60%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW F+ V A+EG K+ T LV +S QQLV CD +GE + C GG ++ + ++I+N G
Sbjct: 116 GCCWAFSAVAAMEGAVKLATGKLVSLSEQQLVSCDVKGEDQGCEGGLMDDAFDFIIKNGG 175
Query: 65 INTERDYPNVGVMDNC 80
+ E DYP D C
Sbjct: 176 LAAESDYPYTASDDKC 191
>gi|109390302|gb|ABG33750.1| cysteine protease [Hevea brasiliensis]
Length = 457
Score = 80.1 bits (196), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 36/82 (43%), Positives = 55/82 (67%), Gaps = 1/82 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V A+EGI+KIVT +L+ +S Q+LVDCDN + C GG ++ ++++I N G
Sbjct: 151 GSCWAFSAVAAVEGINKIVTGDLISLSEQELVDCDNS-YNEGCNGGLMDYGFEFIINNGG 209
Query: 65 INTERDYPNVGVMDNCKVFQFN 86
I++E DYP + C ++ N
Sbjct: 210 IDSEEDYPYLARDGRCDTYRKN 231
>gi|310942960|pdb|3P5W|A Chain A, Actinidin From Actinidia Arguta Planch (Sarusashi)
Length = 220
Score = 80.1 bits (196), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 35/78 (44%), Positives = 50/78 (64%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ + A+EGI+KI T +L+ +S Q+LVDC +R C GGF+ +Q++I N G
Sbjct: 23 GSCWAFSTIAAVEGINKIATGDLISLSEQELVDCGRTQNTRGCDGGFMTDGFQFIINNGG 82
Query: 65 INTERDYPNVGVMDNCKV 82
INTE +YP C +
Sbjct: 83 INTEANYPYTAEEGQCNL 100
>gi|226502454|ref|NP_001140922.1| hypothetical protein [Zea mays]
gi|223948637|gb|ACN28402.1| unknown [Zea mays]
gi|413920877|gb|AFW60809.1| hypothetical protein ZEAMMB73_830238 [Zea mays]
Length = 354
Score = 80.1 bits (196), Expect = 2e-13, Method: Composition-based stats.
Identities = 36/80 (45%), Positives = 49/80 (61%), Gaps = 1/80 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW FA V A+EGI +I T NLV +S QQ++DCD +G + C GG+I+ +QY+ N G
Sbjct: 165 GCCWAFAAVAAVEGIHQITTGNLVSLSEQQVLDCDTEGNN-GCNGGYIDNAFQYIAGNGG 223
Query: 65 INTERDYPNVGVMDNCKVFQ 84
+ TE YP C+ Q
Sbjct: 224 LATEDAYPYTAAQAMCQSVQ 243
>gi|357162587|ref|XP_003579458.1| PREDICTED: oryzain beta chain-like [Brachypodium distachyon]
Length = 470
Score = 80.1 bits (196), Expect = 2e-13, Method: Composition-based stats.
Identities = 35/82 (42%), Positives = 53/82 (64%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V A+E I+++VT LV +S Q+LV+CD G+S C GG ++ + ++I N G
Sbjct: 167 GSCWAFSAVSAVESINQLVTGELVTLSEQELVECDINGQSNGCNGGLMDDAFDFIINNGG 226
Query: 65 INTERDYPNVGVMDNCKVFQFN 86
I+TE DYP + C + + N
Sbjct: 227 IDTEDDYPYKALDGKCDINRRN 248
>gi|357465603|ref|XP_003603086.1| Cysteine proteinase [Medicago truncatula]
gi|355492134|gb|AES73337.1| Cysteine proteinase [Medicago truncatula]
Length = 474
Score = 80.1 bits (196), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 37/82 (45%), Positives = 54/82 (65%), Gaps = 1/82 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ + A+EGI+KIVT LV +S Q+LVDCD + C GG +E ++++I N G
Sbjct: 167 GSCWAFSTIAAVEGINKIVTGELVSLSEQELVDCDRTVNA-GCDGGLMEYAFEFIINNGG 225
Query: 65 INTERDYPNVGVMDNCKVFQFN 86
I+++ DYP GV C ++ N
Sbjct: 226 IDSDEDYPYRGVDGKCDQYKKN 247
>gi|223673161|gb|ACN12762.1| Cathepsin S precursor [Salmo salar]
Length = 330
Score = 80.1 bits (196), Expect = 2e-13, Method: Composition-based stats.
Identities = 35/76 (46%), Positives = 49/76 (64%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ VGA+EG T L+D+S Q LVDC ++ ++ C GGF+ +QYVI N+G
Sbjct: 137 GSCWAFSSVGALEGQLMKTTGKLIDLSPQNLVDCSSKYGNKGCHGGFMTKAFQYVIDNQG 196
Query: 65 INTERDYPNVGVMDNC 80
I +++ YP GV C
Sbjct: 197 IASDQSYPYKGVQQQC 212
>gi|84181681|gb|AAW78661.2| senescence-specific cysteine protease [Nicotiana tabacum]
Length = 349
Score = 80.1 bits (196), Expect = 2e-13, Method: Composition-based stats.
Identities = 30/71 (42%), Positives = 49/71 (69%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW F+ V A+EG+ ++ T L+ +S Q+LVDCD +GE C GG ++T + ++++N+G
Sbjct: 153 GCCWAFSAVAAMEGLHQLKTGELIPLSEQELVDCDVEGEDEGCSGGLLDTAFDFILKNKG 212
Query: 65 INTERDYPNVG 75
+ TE +YP G
Sbjct: 213 LTTEVNYPYKG 223
>gi|222625810|gb|EEE59942.1| hypothetical protein OsJ_12596 [Oryza sativa Japonica Group]
Length = 213
Score = 80.1 bits (196), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 34/72 (47%), Positives = 48/72 (66%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW F+ V A+EG++KI T LV +S Q+LVDCD +GE + C GG ++T +QY+ + G
Sbjct: 19 GCCWAFSAVAAVEGLAKIRTGQLVSLSEQELVDCDVRGEDQGCEGGLMDTAFQYIARRGG 78
Query: 65 INTERDYPNVGV 76
+ E YP GV
Sbjct: 79 LAAESSYPYRGV 90
>gi|46576360|sp|P60994.1|ERVB_TABDI RecName: Full=Ervatamin-B; Short=ERV-B
gi|30749291|pdb|1IWD|A Chain A, Proposed Amino Acid Sequence And The 1.63 Angstrom X-ray
Crystal Structure Of A Plant Cysteine Protease Ervatamin
B: Insight Into The Structural Basis Of Its Stability
And Substrate Specificity
Length = 215
Score = 80.1 bits (196), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 36/81 (44%), Positives = 53/81 (65%), Gaps = 2/81 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V A+E I+KI T L+ +S Q+LVDCD S C GG++ +QY+I N G
Sbjct: 23 GSCWAFSAVAAVESINKIRTGQLISLSEQELVDCDT--ASHGCNGGWMNNAFQYIITNGG 80
Query: 65 INTERDYPNVGVMDNCKVFQF 85
I+T+++YP V +CK ++
Sbjct: 81 IDTQQNYPYSAVQGSCKPYRL 101
>gi|356515040|ref|XP_003526209.1| PREDICTED: thiol protease SEN102-like [Glycine max]
Length = 342
Score = 80.1 bits (196), Expect = 2e-13, Method: Composition-based stats.
Identities = 36/76 (47%), Positives = 51/76 (67%), Gaps = 2/76 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ + A EGI +I T NLV +S Q+LVDCD+ + C GGF+E ++++I+N G
Sbjct: 149 GSCWAFSTIAATEGIHQISTGNLVSLSEQELVDCDSVDD--GCEGGFMEDGFEFIIKNGG 206
Query: 65 INTERDYPNVGVMDNC 80
I +E +YP GV C
Sbjct: 207 ITSETNYPYKGVDGTC 222
>gi|74927078|sp|Q86GF7.1|CRUST_PANBO RecName: Full=Crustapain; AltName: Full=NsCys; Flags: Precursor
gi|28971811|dbj|BAC65417.1| crustapain [Pandalus borealis]
Length = 323
Score = 79.7 bits (195), Expect = 2e-13, Method: Composition-based stats.
Identities = 36/77 (46%), Positives = 49/77 (63%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V A+EG + T +LV +S Q LVDC + ++ C GG+ YQY+I NRG
Sbjct: 128 GSCWAFSAVAALEGAHFLKTGDLVSLSEQNLVDCSSSYGNQGCNGGWPYQAYQYIIANRG 187
Query: 65 INTERDYPNVGVMDNCK 81
I+TE YP + DNC+
Sbjct: 188 IDTESSYPYKAIDDNCR 204
>gi|356542633|ref|XP_003539771.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
Length = 341
Score = 79.7 bits (195), Expect = 2e-13, Method: Composition-based stats.
Identities = 32/76 (42%), Positives = 49/76 (64%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW F+ V A EGI+K+ T L+ +S Q+LVDCD +G + C GG ++ +++++QN+G
Sbjct: 145 GCCWAFSAVAATEGITKLRTGKLISLSEQELVDCDTKGVDQGCEGGLMDDAFKFILQNKG 204
Query: 65 INTERDYPNVGVMDNC 80
+ TE YP G C
Sbjct: 205 LATEAIYPYEGFDGTC 220
>gi|414591546|tpg|DAA42117.1| TPA: hypothetical protein ZEAMMB73_388689 [Zea mays]
gi|414591547|tpg|DAA42118.1| TPA: hypothetical protein ZEAMMB73_388689 [Zea mays]
Length = 268
Score = 79.7 bits (195), Expect = 2e-13, Method: Composition-based stats.
Identities = 32/76 (42%), Positives = 53/76 (69%), Gaps = 1/76 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ + A+EG++KI+T LV +S Q+LVDCD+ +++ C GG ++ +QY+ +N G
Sbjct: 161 GSCWAFSAIAAVEGVNKIMTGKLVSLSEQELVDCDDV-DNQGCDGGLMDYAFQYIQRNGG 219
Query: 65 INTERDYPNVGVMDNC 80
+ TE +YP + +C
Sbjct: 220 VTTESNYPYLAEQRSC 235
>gi|413953046|gb|AFW85695.1| thiol protease SEN102 [Zea mays]
Length = 382
Score = 79.7 bits (195), Expect = 2e-13, Method: Composition-based stats.
Identities = 35/76 (46%), Positives = 46/76 (60%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW FA V +IEG+ +I T LV +S Q++VDCD G C GG + ++V +N G
Sbjct: 183 GSCWAFATVASIEGVHQIKTGRLVSLSEQEIVDCDRGGNDNGCRGGSPRSAMEWVTRNGG 242
Query: 65 INTERDYPNVGVMDNC 80
+ TE DYP VG C
Sbjct: 243 LTTESDYPYVGSQRQC 258
>gi|356539398|ref|XP_003538185.1| PREDICTED: vignain-like [Glycine max]
Length = 343
Score = 79.7 bits (195), Expect = 2e-13, Method: Composition-based stats.
Identities = 32/76 (42%), Positives = 49/76 (64%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW F+ V A EGI+K+ T L+ +S Q+LVDCD +G + C GG ++ +++++QN+G
Sbjct: 147 GCCWAFSAVAATEGITKLSTGKLISLSEQELVDCDTKGVDQGCEGGLMDDAFKFILQNKG 206
Query: 65 INTERDYPNVGVMDNC 80
+ E YP GV C
Sbjct: 207 LAAEAIYPYEGVDGTC 222
>gi|226508570|ref|NP_001141984.1| uncharacterized protein LOC100274134 precursor [Zea mays]
gi|194706676|gb|ACF87422.1| unknown [Zea mays]
gi|413920745|gb|AFW60677.1| vignain [Zea mays]
Length = 363
Score = 79.7 bits (195), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 36/80 (45%), Positives = 50/80 (62%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW F+ VGA+EG+ I T NLV +S QQ++DCD ++ C GG+++ +QYVI N G
Sbjct: 173 GCCWAFSAVGAMEGLIMITTGNLVSLSEQQILDCDESDGNQGCNGGYMDNAFQYVINNGG 232
Query: 65 INTERDYPNVGVMDNCKVFQ 84
+ TE YP V C+ Q
Sbjct: 233 VTTEDAYPYSAVQGTCQNVQ 252
>gi|358348957|ref|XP_003638507.1| Cysteine proteinase [Medicago truncatula]
gi|355504442|gb|AES85645.1| Cysteine proteinase [Medicago truncatula]
Length = 362
Score = 79.7 bits (195), Expect = 2e-13, Method: Composition-based stats.
Identities = 38/82 (46%), Positives = 53/82 (64%), Gaps = 1/82 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V A+EGI++I TN LV +S Q+L+DCDNQ E++ C GG +E ++Y+ Q G
Sbjct: 150 GSCWAFSTVVAVEGINQIKTNRLVPLSEQELIDCDNQ-ENQGCNGGLMEYAFEYIKQKGG 208
Query: 65 INTERDYPNVGVMDNCKVFQFN 86
I TE YP +C + N
Sbjct: 209 ITTESYYPYTANDGSCDATKEN 230
>gi|30141019|dbj|BAC75923.1| cysteine protease-1 [Helianthus annuus]
Length = 461
Score = 79.7 bits (195), Expect = 2e-13, Method: Composition-based stats.
Identities = 36/79 (45%), Positives = 54/79 (68%), Gaps = 7/79 (8%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCD---NQGESRSCVGGFIETIYQYVIQ 61
GSCW F+ G++EG++KIVT +L+ +S Q+LV+CD NQG C GG ++ ++++I+
Sbjct: 162 GSCWAFSTTGSVEGVNKIVTGDLISVSEQELVNCDTSYNQG----CNGGLMDYAFEFIIK 217
Query: 62 NRGINTERDYPNVGVMDNC 80
N GI+TE DYP G C
Sbjct: 218 NGGIDTEEDYPYTGKDGKC 236
>gi|449454309|ref|XP_004144898.1| PREDICTED: xylem cysteine proteinase 1-like [Cucumis sativus]
gi|449471311|ref|XP_004153272.1| PREDICTED: xylem cysteine proteinase 1-like [Cucumis sativus]
Length = 349
Score = 79.7 bits (195), Expect = 2e-13, Method: Composition-based stats.
Identities = 32/68 (47%), Positives = 47/68 (69%), Gaps = 1/68 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V A+EGI++IVT NL +S Q+L+DCD + + C GG ++ +QY++ N G
Sbjct: 153 GSCWAFSTVAAVEGINQIVTGNLTSLSEQELIDCD-RSFNNGCYGGLMDYAFQYIMSNSG 211
Query: 65 INTERDYP 72
+ E DYP
Sbjct: 212 LRKEEDYP 219
>gi|61661067|gb|AAX51229.1| cathepsin S cysteine protease [Paralichthys olivaceus]
Length = 337
Score = 79.7 bits (195), Expect = 2e-13, Method: Composition-based stats.
Identities = 35/76 (46%), Positives = 48/76 (63%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ GA+EG T LVD+S Q LVDC + ++ C GGF++ +QYVI N+G
Sbjct: 144 GSCWAFSAAGALEGQLAKTTGKLVDLSPQNLVDCSLKYGNKGCNGGFMDRAFQYVIDNKG 203
Query: 65 INTERDYPNVGVMDNC 80
I++E YP G + C
Sbjct: 204 IDSEASYPYRGQLQQC 219
>gi|242071345|ref|XP_002450949.1| hypothetical protein SORBIDRAFT_05g021550 [Sorghum bicolor]
gi|241936792|gb|EES09937.1| hypothetical protein SORBIDRAFT_05g021550 [Sorghum bicolor]
Length = 371
Score = 79.7 bits (195), Expect = 2e-13, Method: Composition-based stats.
Identities = 34/76 (44%), Positives = 52/76 (68%), Gaps = 1/76 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ + A+EGI+KI T LV +S Q+LVDCD+ +++ C GG ++ +QY+ +N G
Sbjct: 160 GSCWAFSTIAAVEGINKIRTGKLVSLSEQELVDCDDV-DNQGCNGGLMDYAFQYIKRNGG 218
Query: 65 INTERDYPNVGVMDNC 80
I TE +YP + +C
Sbjct: 219 ITTESNYPYLAEQRSC 234
>gi|90265242|emb|CAH67695.1| H0624F09.3 [Oryza sativa Indica Group]
Length = 494
Score = 79.7 bits (195), Expect = 2e-13, Method: Composition-based stats.
Identities = 33/78 (42%), Positives = 51/78 (65%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V A+EGI+KIVT LV +S Q+LV+C G++ C GG ++ + ++ +N G
Sbjct: 178 GSCWAFSAVAAVEGINKIVTGELVSLSEQELVECARNGQNSGCNGGIMDDAFAFIARNGG 237
Query: 65 INTERDYPNVGVMDNCKV 82
++TE DYP + C +
Sbjct: 238 LDTEEDYPYTAMDGKCNL 255
>gi|449500145|ref|XP_004161017.1| PREDICTED: xylem cysteine proteinase 1-like [Cucumis sativus]
Length = 349
Score = 79.7 bits (195), Expect = 2e-13, Method: Composition-based stats.
Identities = 32/68 (47%), Positives = 47/68 (69%), Gaps = 1/68 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V A+EGI++IVT NL +S Q+L+DCD + + C GG ++ +QY++ N G
Sbjct: 153 GSCWAFSTVAAVEGINQIVTGNLTSLSEQELIDCD-RSFNNGCYGGLMDYAFQYIMSNSG 211
Query: 65 INTERDYP 72
+ E DYP
Sbjct: 212 LRKEEDYP 219
>gi|214015305|gb|ACJ62269.1| cysteine protease [Zea mays subsp. parviglumis]
Length = 255
Score = 79.7 bits (195), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 38/82 (46%), Positives = 51/82 (62%), Gaps = 2/82 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW F+ V AIEGI+ I T NLV +S Q+++DCD Q C GG +E +++VI N G
Sbjct: 132 GGCWAFSAVAAIEGINAIATGNLVSLSEQEIIDCDAQ--DSGCDGGQMENAFRFVIGNGG 189
Query: 65 INTERDYPNVGVMDNCKVFQFN 86
I+TE DYP +G C + N
Sbjct: 190 IDTEADYPFIGTDGTCDASKEN 211
>gi|47524507|gb|AAT34987.1| putative cysteine protease [Gossypium hirsutum]
Length = 344
Score = 79.7 bits (195), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 34/76 (44%), Positives = 49/76 (64%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW F+ V AIEGI++I T L+ +S Q+LVDCD +G C GG ++T ++++I N G
Sbjct: 148 GCCWAFSAVAAIEGITQISTGKLISLSEQELVDCDTKGIDHGCEGGLMDTAFEFIINNGG 207
Query: 65 INTERDYPNVGVMDNC 80
+ TE +YP G C
Sbjct: 208 LTTESNYPYKGEDGTC 223
>gi|13897890|gb|AAK48495.1|AF259983_1 putative cysteine protease [Ipomoea batatas]
Length = 462
Score = 79.7 bits (195), Expect = 2e-13, Method: Composition-based stats.
Identities = 34/76 (44%), Positives = 51/76 (67%), Gaps = 1/76 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ + A+EGI++IVT L+ +S Q+LVDCD + C GG ++ ++++I+N G
Sbjct: 161 GSCWAFSTIAAVEGINQIVTGELISLSEQELVDCDTS-YNEGCNGGLMDYAFEFIIKNGG 219
Query: 65 INTERDYPNVGVMDNC 80
I+TE DYP G C
Sbjct: 220 IDTEADYPYTGRYGRC 235
>gi|115461226|ref|NP_001054213.1| Os04g0670500 [Oryza sativa Japonica Group]
gi|62510688|sp|Q7XR52.2|CYSP1_ORYSJ RecName: Full=Cysteine protease 1; AltName: Full=OsCP1; Flags:
Precursor
gi|38345300|emb|CAE02828.2| OSJNBa0043A12.33 [Oryza sativa Japonica Group]
gi|113565784|dbj|BAF16127.1| Os04g0670500 [Oryza sativa Japonica Group]
gi|215741575|dbj|BAG98070.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 490
Score = 79.7 bits (195), Expect = 2e-13, Method: Composition-based stats.
Identities = 33/78 (42%), Positives = 51/78 (65%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V A+EGI+KIVT LV +S Q+LV+C G++ C GG ++ + ++ +N G
Sbjct: 178 GSCWAFSAVAAVEGINKIVTGELVSLSEQELVECARNGQNSGCNGGIMDDAFAFIARNGG 237
Query: 65 INTERDYPNVGVMDNCKV 82
++TE DYP + C +
Sbjct: 238 LDTEEDYPYTAMDGKCNL 255
>gi|116788286|gb|ABK24823.1| unknown [Picea sitchensis]
Length = 294
Score = 79.7 bits (195), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 41/82 (50%), Positives = 52/82 (63%), Gaps = 1/82 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW F+ GAIEGI+KIVT +LV +S Q+L DCD S C GG ++ +Q+VI N G
Sbjct: 148 GDCWAFSATGAIEGINKIVTGSLVSLSEQELCDCDTSYNS-GCDGGLMDYAFQWVIVNGG 206
Query: 65 INTERDYPNVGVMDNCKVFQFN 86
I+TE DYP GV C + N
Sbjct: 207 IDTEVDYPYKGVQKACNSKKVN 228
>gi|214015353|gb|ACJ62293.1| cysteine protease [Zea mays subsp. parviglumis]
Length = 254
Score = 79.7 bits (195), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 38/82 (46%), Positives = 51/82 (62%), Gaps = 2/82 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW F+ V AIEGI+ I T NLV +S Q+++DCD Q C GG +E +++VI N G
Sbjct: 131 GGCWAFSAVAAIEGINAIATGNLVSLSEQEIIDCDAQ--DSGCDGGQMENAFRFVIGNGG 188
Query: 65 INTERDYPNVGVMDNCKVFQFN 86
I+TE DYP +G C + N
Sbjct: 189 IDTEADYPFIGTDGTCDASKEN 210
>gi|30141027|dbj|BAC75927.1| cysteine protease-5 [Helianthus annuus]
Length = 365
Score = 79.7 bits (195), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 37/76 (48%), Positives = 54/76 (71%), Gaps = 1/76 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V ++EGI+KIVT +L+ +S Q+LVDCDN+ S C GG ++ +Q+++ N G
Sbjct: 150 GSCWAFSTVASVEGINKIVTGDLISLSEQELVDCDNKYNS-GCNGGSMDYAFQFIVSNGG 208
Query: 65 INTERDYPNVGVMDNC 80
I++E DYP GV C
Sbjct: 209 IDSESDYPYKGVGAVC 224
>gi|226531284|ref|NP_001147086.1| thiol protease SEN102 precursor [Zea mays]
gi|195607128|gb|ACG25394.1| thiol protease SEN102 precursor [Zea mays]
Length = 356
Score = 79.7 bits (195), Expect = 2e-13, Method: Composition-based stats.
Identities = 35/76 (46%), Positives = 46/76 (60%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW FA V +IEG+ +I T LV +S Q++VDCD G C GG + ++V +N G
Sbjct: 157 GSCWAFATVASIEGVHQIKTGRLVSLSEQEIVDCDRGGNDNGCRGGSPRSAMEWVTRNGG 216
Query: 65 INTERDYPNVGVMDNC 80
+ TE DYP VG C
Sbjct: 217 LTTESDYPYVGSQRQC 232
>gi|38345906|emb|CAE04498.2| OSJNBb0059K02.8 [Oryza sativa Japonica Group]
Length = 458
Score = 79.7 bits (195), Expect = 2e-13, Method: Composition-based stats.
Identities = 36/82 (43%), Positives = 54/82 (65%), Gaps = 1/82 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ + A+EGI++IVT +L+ +S Q+LVDCD + C GG ++ + ++I N G
Sbjct: 151 GSCWAFSAIAAVEGINQIVTGDLISLSEQELVDCDTS-YNEGCNGGLMDYAFDFIINNGG 209
Query: 65 INTERDYPNVGVMDNCKVFQFN 86
I+TE DYP G + C V + N
Sbjct: 210 IDTEDDYPYKGKDERCDVNRKN 231
>gi|355344587|gb|AER60490.1| cysteine proteases [Gossypium hirsutum]
Length = 371
Score = 79.7 bits (195), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 35/82 (42%), Positives = 55/82 (67%), Gaps = 1/82 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ + +EGI+KIV+ LV +S Q+LVDCD ++ C GG ++ +Q+++ N G
Sbjct: 157 GSCWAFSTIATVEGINKIVSGELVSLSEQELVDCDRSYDA-GCNGGLMDYAFQFIMDNGG 215
Query: 65 INTERDYPNVGVMDNCKVFQFN 86
I+TE+DYP +G + C + N
Sbjct: 216 IDTEKDYPYLGFNNQCDPTKKN 237
>gi|75994616|gb|ABA33829.1| cysteine protease Mir1 [Zea mays subsp. parviglumis]
Length = 248
Score = 79.3 bits (194), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 38/82 (46%), Positives = 51/82 (62%), Gaps = 2/82 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW F+ V AIEGI+ I T NLV +S Q+++DCD Q C GG +E +++VI N G
Sbjct: 124 GGCWAFSAVAAIEGINAIATGNLVSLSEQEIIDCDAQ--DSGCDGGQMENAFRFVIGNGG 181
Query: 65 INTERDYPNVGVMDNCKVFQFN 86
I+TE DYP +G C + N
Sbjct: 182 IDTEADYPFIGTDGTCDASKEN 203
>gi|219687002|dbj|BAH08632.1| daikon cysteine protease RD21 [Raphanus sativus]
Length = 289
Score = 79.3 bits (194), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 38/79 (48%), Positives = 54/79 (68%), Gaps = 7/79 (8%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCD---NQGESRSCVGGFIETIYQYVIQ 61
GSCW F+ +GA+EGI+KIVT +L+ +S Q+LVDCD NQG C GG ++ ++++I+
Sbjct: 25 GSCWAFSTIGAVEGINKIVTGDLISLSEQELVDCDTSYNQG----CNGGLMDYAFEFIIK 80
Query: 62 NRGINTERDYPNVGVMDNC 80
N GI+TE DYP C
Sbjct: 81 NGGIDTEEDYPYKAADGRC 99
>gi|225706086|gb|ACO08889.1| Cathepsin S precursor [Osmerus mordax]
Length = 333
Score = 79.3 bits (194), Expect = 3e-13, Method: Composition-based stats.
Identities = 35/76 (46%), Positives = 49/76 (64%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ VGA+EG T L+D+S Q LVDC ++ ++ C GGF+ +QYVI N+G
Sbjct: 140 GSCWAFSSVGALEGQLMRTTGKLLDLSPQNLVDCSSKYGNKGCNGGFMSEAFQYVIDNKG 199
Query: 65 INTERDYPNVGVMDNC 80
I+++ YP GV C
Sbjct: 200 IDSDTSYPYQGVQGTC 215
>gi|146215986|gb|ABQ10195.1| actinidin Act2d [Actinidia eriantha]
Length = 381
Score = 79.3 bits (194), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 36/81 (44%), Positives = 51/81 (62%)
Query: 6 SCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRGI 65
SCW F+ V A+EGI+KIVT NL+ +S Q+LVDC +R C G++ +Q++I N GI
Sbjct: 151 SCWAFSAVAAVEGINKIVTGNLISLSEQELVDCGRTQRTRGCNRGYMNDAFQFIIDNGGI 210
Query: 66 NTERDYPNVGVMDNCKVFQFN 86
NTE +YP C ++ N
Sbjct: 211 NTEDNYPYTAQDGQCDWYRKN 231
>gi|219362839|ref|NP_001136636.1| uncharacterized protein LOC100216764 precursor [Zea mays]
gi|194696462|gb|ACF82315.1| unknown [Zea mays]
gi|413934556|gb|AFW69107.1| hypothetical protein ZEAMMB73_554980 [Zea mays]
Length = 361
Score = 79.3 bits (194), Expect = 3e-13, Method: Composition-based stats.
Identities = 36/77 (46%), Positives = 49/77 (63%), Gaps = 2/77 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW FA + A+EGI+ I T +LV +S QQLVDCDN C GG+I + ++++NRG
Sbjct: 169 GSCWAFAAIAAVEGINAIRTWSLVTLSEQQLVDCDNV--DHGCAGGWIPSALDFIVRNRG 226
Query: 65 INTERDYPNVGVMDNCK 81
I E YP +G C+
Sbjct: 227 IVPEGTYPYIGTQGRCR 243
>gi|2414570|emb|CAB16317.1| cysteine proteinase precursor [Nicotiana tabacum]
Length = 374
Score = 79.3 bits (194), Expect = 3e-13, Method: Composition-based stats.
Identities = 34/83 (40%), Positives = 56/83 (67%), Gaps = 1/83 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V A+EGI++IVT ++ +S Q+LVDCD + ++ C GG ++ ++++I N G
Sbjct: 162 GSCWAFSTVAAVEGINQIVTGEMITLSEQELVDCD-RVQNSGCNGGLMDYAFEFIISNGG 220
Query: 65 INTERDYPNVGVMDNCKVFQFNW 87
++TE+ YP GV C + N+
Sbjct: 221 MDTEKHYPYRGVEGRCDPVRKNY 243
>gi|255568299|ref|XP_002525124.1| cysteine protease, putative [Ricinus communis]
gi|223535583|gb|EEF37251.1| cysteine protease, putative [Ricinus communis]
Length = 342
Score = 79.3 bits (194), Expect = 3e-13, Method: Composition-based stats.
Identities = 34/76 (44%), Positives = 48/76 (63%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V A EG+ K+ T LV +S Q+LVDCD +GE + C GG +E ++++ +N G
Sbjct: 146 GSCWAFSAVAATEGVHKLRTGKLVSLSEQELVDCDVKGEDKGCQGGLMEDAFKFIKRNGG 205
Query: 65 INTERDYPNVGVMDNC 80
I TE +Y G C
Sbjct: 206 ITTEANYAYRGRDGKC 221
>gi|113120269|gb|ABI30274.1| VS-B, partial [Vasconcellea stipulata]
Length = 341
Score = 79.3 bits (194), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 37/80 (46%), Positives = 53/80 (66%), Gaps = 3/80 (3%)
Query: 3 PLGSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQN 62
P GSCW F+ V +EGI+KIVT NL+ +S Q+L+DCD + S C GG+ T +YV+ N
Sbjct: 155 PCGSCWAFSTVATVEGINKIVTGNLISLSEQELLDCDRR--SHGCKGGYQTTSLKYVVDN 212
Query: 63 RGINTERDYPNVGVMDNCKV 82
G++TE++YP NC+
Sbjct: 213 -GVHTEKEYPYEKKQGNCRA 231
>gi|195628596|gb|ACG36128.1| vignain precursor [Zea mays]
Length = 362
Score = 79.3 bits (194), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 35/80 (43%), Positives = 50/80 (62%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW F+ VGA+EG+ I T NLV +S QQ++DCD ++ C GG+++ +QYV+ N G
Sbjct: 172 GCCWAFSAVGAMEGLIMITTGNLVSLSEQQILDCDESDGNQGCNGGYMDNAFQYVVNNGG 231
Query: 65 INTERDYPNVGVMDNCKVFQ 84
+ TE YP V C+ Q
Sbjct: 232 VTTEDAYPYSAVQGTCQNVQ 251
>gi|214015295|gb|ACJ62264.1| cysteine protease [Zea mays subsp. parviglumis]
Length = 247
Score = 79.3 bits (194), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 38/82 (46%), Positives = 51/82 (62%), Gaps = 2/82 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW F+ V AIEG++ I T NLV +S Q+++DCD Q C GG +E +++VI N G
Sbjct: 124 GGCWAFSAVAAIEGVNAIATGNLVSLSEQEIIDCDAQ--DSGCDGGQMEDAFRFVIGNGG 181
Query: 65 INTERDYPNVGVMDNCKVFQFN 86
I+TE DYP VG C + N
Sbjct: 182 IDTEADYPFVGTDGTCDASKEN 203
>gi|10336513|dbj|BAB13759.1| cysteine proteinase [Astragalus sinicus]
Length = 343
Score = 79.3 bits (194), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 33/76 (43%), Positives = 49/76 (64%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW F+ V A EGI ++ T L+ +S Q+LVDCD +G + C GG ++ ++++IQN G
Sbjct: 147 GCCWAFSAVAATEGIHQLSTGKLISLSEQELVDCDTKGVDQGCEGGLMDDAFKFIIQNHG 206
Query: 65 INTERDYPNVGVMDNC 80
++TE YP GV C
Sbjct: 207 LDTEAKYPYQGVDGTC 222
>gi|414591545|tpg|DAA42116.1| TPA: hypothetical protein ZEAMMB73_388689 [Zea mays]
Length = 384
Score = 79.3 bits (194), Expect = 3e-13, Method: Composition-based stats.
Identities = 32/76 (42%), Positives = 53/76 (69%), Gaps = 1/76 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ + A+EG++KI+T LV +S Q+LVDCD+ +++ C GG ++ +QY+ +N G
Sbjct: 161 GSCWAFSAIAAVEGVNKIMTGKLVSLSEQELVDCDDV-DNQGCDGGLMDYAFQYIQRNGG 219
Query: 65 INTERDYPNVGVMDNC 80
+ TE +YP + +C
Sbjct: 220 VTTESNYPYLAEQRSC 235
>gi|1208549|gb|AAC49455.1| Pseudotzain [Pseudotsuga menziesii]
Length = 454
Score = 79.3 bits (194), Expect = 3e-13, Method: Composition-based stats.
Identities = 38/85 (44%), Positives = 55/85 (64%), Gaps = 7/85 (8%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCD---NQGESRSCVGGFIETIYQYVIQ 61
GSCW F+ V A+EGI++IVT NL +S Q+LVDCD NQG C GG ++ +Q++I
Sbjct: 154 GSCWAFSTVAAVEGINQIVTGNLTSLSEQELVDCDTSYNQG----CNGGLMDYAFQFIIS 209
Query: 62 NRGINTERDYPNVGVMDNCKVFQFN 86
N G+++E DYP +C ++ N
Sbjct: 210 NGGLDSEDDYPYKANNGSCDAYRKN 234
>gi|297845064|ref|XP_002890413.1| hypothetical protein ARALYDRAFT_472321 [Arabidopsis lyrata subsp.
lyrata]
gi|297336255|gb|EFH66672.1| hypothetical protein ARALYDRAFT_472321 [Arabidopsis lyrata subsp.
lyrata]
Length = 357
Score = 79.3 bits (194), Expect = 3e-13, Method: Composition-based stats.
Identities = 32/68 (47%), Positives = 47/68 (69%), Gaps = 1/68 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V A+EGI+KIVT NL +S Q+L+DCD + C GG ++ ++Y+++N G
Sbjct: 160 GSCWAFSTVAAVEGINKIVTGNLTTLSEQELIDCDTT-YNNGCNGGLMDYAFEYIVKNGG 218
Query: 65 INTERDYP 72
+ E DYP
Sbjct: 219 LRKEEDYP 226
>gi|413944252|gb|AFW76901.1| hypothetical protein ZEAMMB73_101481 [Zea mays]
Length = 232
Score = 79.3 bits (194), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 34/77 (44%), Positives = 48/77 (62%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW F+ V A EGI KI T L+ +S Q+LVDCD GE + C GG ++ ++++I+N G
Sbjct: 38 GCCWAFSAVAATEGIVKISTGKLISLSEQELVDCDVYGEDQGCEGGLMDDAFKFIIKNGG 97
Query: 65 INTERDYPNVGVMDNCK 81
+ TE +YP CK
Sbjct: 98 LTTESNYPYTAADGKCK 114
>gi|357154164|ref|XP_003576692.1| PREDICTED: vignain-like [Brachypodium distachyon]
Length = 427
Score = 79.3 bits (194), Expect = 3e-13, Method: Composition-based stats.
Identities = 34/82 (41%), Positives = 51/82 (62%), Gaps = 2/82 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V A+EG+++I LV +S Q+LVDCD E+ C GGF+ +++V+ N G
Sbjct: 233 GSCWAFSAVAAMEGLNQIKNGKLVSLSEQELVDCD--AEAVGCAGGFMSWAFEFVMANHG 290
Query: 65 INTERDYPNVGVMDNCKVFQFN 86
+ TE YP G+ C+ + N
Sbjct: 291 LTTEASYPYKGINGACQTAKLN 312
>gi|238007404|gb|ACR34737.1| unknown [Zea mays]
gi|413943289|gb|AFW75938.1| cysteine proteinase Mir2 [Zea mays]
Length = 484
Score = 79.3 bits (194), Expect = 3e-13, Method: Composition-based stats.
Identities = 34/78 (43%), Positives = 54/78 (69%), Gaps = 1/78 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G+CW F+ V A+EGI+KIVT +L+ +S Q+L+DCD + + + C GG ++ + ++I+N G
Sbjct: 177 GACWAFSAVAAVEGINKIVTGSLISLSEQELIDCD-KFQDQGCDGGLMDNAFVFMIKNGG 235
Query: 65 INTERDYPNVGVMDNCKV 82
I+TE DYP G C +
Sbjct: 236 IDTEADYPFTGHDGTCDL 253
>gi|222629675|gb|EEE61807.1| hypothetical protein OsJ_16426 [Oryza sativa Japonica Group]
Length = 459
Score = 79.3 bits (194), Expect = 3e-13, Method: Composition-based stats.
Identities = 36/82 (43%), Positives = 54/82 (65%), Gaps = 1/82 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ + A+EGI++IVT +L+ +S Q+LVDCD + C GG ++ + ++I N G
Sbjct: 152 GSCWAFSAIAAVEGINQIVTGDLISLSEQELVDCDTS-YNEGCNGGLMDYAFDFIINNGG 210
Query: 65 INTERDYPNVGVMDNCKVFQFN 86
I+TE DYP G + C V + N
Sbjct: 211 IDTEDDYPYKGKDERCDVNRKN 232
>gi|109939734|sp|P25776.2|ORYA_ORYSJ RecName: Full=Oryzain alpha chain; Flags: Precursor
gi|78192122|gb|ABB30151.1| oryzain alpha [Oryza sativa Japonica Group]
Length = 458
Score = 79.3 bits (194), Expect = 3e-13, Method: Composition-based stats.
Identities = 36/82 (43%), Positives = 54/82 (65%), Gaps = 1/82 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ + A+EGI++IVT +L+ +S Q+LVDCD + C GG ++ + ++I N G
Sbjct: 151 GSCWAFSAIAAVEGINQIVTGDLISLSEQELVDCDTS-YNEGCNGGLMDYAFDFIINNGG 209
Query: 65 INTERDYPNVGVMDNCKVFQFN 86
I+TE DYP G + C V + N
Sbjct: 210 IDTEDDYPYKGKDERCDVNRKN 231
>gi|76574390|gb|ABA46965.1| cysteine protease Mir1 [Zea diploperennis]
Length = 256
Score = 79.0 bits (193), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 37/82 (45%), Positives = 51/82 (62%), Gaps = 2/82 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW F+ V AIEG++ I T NLV +S Q+++DCD Q C GG +E +++VI N G
Sbjct: 132 GGCWAFSAVAAIEGVNAIATGNLVSLSEQEIIDCDAQ--DSGCDGGQMENAFRFVIGNGG 189
Query: 65 INTERDYPNVGVMDNCKVFQFN 86
I+TE DYP +G C + N
Sbjct: 190 IDTEADYPFIGTDGTCDASKEN 211
>gi|214015368|gb|ACJ62300.1| cysteine protease [Zea mays subsp. parviglumis]
Length = 257
Score = 79.0 bits (193), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 37/82 (45%), Positives = 51/82 (62%), Gaps = 2/82 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW F+ V AIEG++ I T NLV +S Q+++DCD Q C GG +E +++VI N G
Sbjct: 134 GGCWAFSAVAAIEGVNAIATGNLVSLSEQEIIDCDAQ--DSGCDGGQMENAFRFVIGNGG 191
Query: 65 INTERDYPNVGVMDNCKVFQFN 86
I+TE DYP +G C + N
Sbjct: 192 IDTEADYPFIGTDGTCDASKEN 213
>gi|76574402|gb|ABA46971.1| cysteine protease Mir1 [Zea diploperennis]
Length = 256
Score = 79.0 bits (193), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 37/82 (45%), Positives = 51/82 (62%), Gaps = 2/82 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW F+ V AIEG++ I T NLV +S Q+++DCD Q C GG +E +++VI N G
Sbjct: 132 GGCWAFSAVAAIEGVNAIATGNLVSLSEQEIIDCDAQ--DSGCDGGQMENAFRFVIGNGG 189
Query: 65 INTERDYPNVGVMDNCKVFQFN 86
I+TE DYP +G C + N
Sbjct: 190 IDTEADYPFIGTDGTCDASKEN 211
>gi|242068363|ref|XP_002449458.1| hypothetical protein SORBIDRAFT_05g013840 [Sorghum bicolor]
gi|241935301|gb|EES08446.1| hypothetical protein SORBIDRAFT_05g013840 [Sorghum bicolor]
Length = 350
Score = 79.0 bits (193), Expect = 3e-13, Method: Composition-based stats.
Identities = 35/77 (45%), Positives = 47/77 (61%), Gaps = 1/77 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW FA V A+E I +I T NLV +S QQ++DCD G + C GG+I+ +QY+I N G
Sbjct: 159 GCCWAFAAVAAVESIHQITTGNLVSLSEQQVLDCDTDGNN-GCNGGYIDNAFQYIISNGG 217
Query: 65 INTERDYPNVGVMDNCK 81
+ TE YP C+
Sbjct: 218 LATEDAYPYAAAQGTCQ 234
>gi|218195711|gb|EEC78138.1| hypothetical protein OsI_17694 [Oryza sativa Indica Group]
Length = 458
Score = 79.0 bits (193), Expect = 3e-13, Method: Composition-based stats.
Identities = 36/82 (43%), Positives = 54/82 (65%), Gaps = 1/82 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ + A+EGI++IVT +L+ +S Q+LVDCD + C GG ++ + ++I N G
Sbjct: 151 GSCWAFSAIAAVEGINQIVTGDLISLSEQELVDCDTS-YNEGCNGGLMDYAFDFIINNGG 209
Query: 65 INTERDYPNVGVMDNCKVFQFN 86
I+TE DYP G + C V + N
Sbjct: 210 IDTEDDYPYKGKDERCDVNRKN 231
>gi|125547256|gb|EAY93078.1| hypothetical protein OsI_14879 [Oryza sativa Indica Group]
Length = 339
Score = 79.0 bits (193), Expect = 3e-13, Method: Composition-based stats.
Identities = 32/76 (42%), Positives = 47/76 (61%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW F+ V A+EGI K+ T L+ +S Q+LVDCD GE + C GG ++ ++++I+N G
Sbjct: 145 GCCWAFSAVAAMEGIVKLSTGKLISLSEQELVDCDVHGEDQGCEGGLMDDAFKFIIKNGG 204
Query: 65 INTERDYPNVGVMDNC 80
+ TE YP C
Sbjct: 205 LTTESKYPYTAADGKC 220
>gi|116309130|emb|CAH66233.1| H0825G02.10 [Oryza sativa Indica Group]
Length = 339
Score = 79.0 bits (193), Expect = 3e-13, Method: Composition-based stats.
Identities = 32/76 (42%), Positives = 47/76 (61%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW F+ V A+EGI K+ T L+ +S Q+LVDCD GE + C GG ++ ++++I+N G
Sbjct: 145 GCCWAFSAVAAMEGIVKLSTGKLISLSEQELVDCDVHGEDQGCEGGLMDDAFKFIIKNGG 204
Query: 65 INTERDYPNVGVMDNC 80
+ TE YP C
Sbjct: 205 LTTESKYPYTAADGKC 220
>gi|76574394|gb|ABA46967.1| cysteine protease Mir1 [Zea diploperennis]
Length = 256
Score = 79.0 bits (193), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 37/82 (45%), Positives = 51/82 (62%), Gaps = 2/82 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW F+ V AIEG++ I T NLV +S Q+++DCD Q C GG +E +++VI N G
Sbjct: 132 GGCWAFSAVAAIEGVNAIATGNLVSLSEQEIIDCDAQ--DSGCDGGQMENAFRFVIGNGG 189
Query: 65 INTERDYPNVGVMDNCKVFQFN 86
I+TE DYP +G C + N
Sbjct: 190 IDTEADYPFIGTDGTCDASKEN 211
>gi|214015297|gb|ACJ62265.1| cysteine protease [Zea mays subsp. parviglumis]
Length = 251
Score = 79.0 bits (193), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 37/82 (45%), Positives = 51/82 (62%), Gaps = 2/82 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW F+ V AIEG++ I T NLV +S Q+++DCD Q C GG +E +++VI N G
Sbjct: 128 GGCWAFSAVAAIEGVNAIATGNLVSLSEQEIIDCDAQ--DSGCDGGQMENAFRFVIGNGG 185
Query: 65 INTERDYPNVGVMDNCKVFQFN 86
I+TE DYP +G C + N
Sbjct: 186 IDTEADYPFIGTDGTCDASKEN 207
>gi|214015259|gb|ACJ62246.1| cysteine protease [Zea mays subsp. parviglumis]
Length = 255
Score = 79.0 bits (193), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 37/82 (45%), Positives = 51/82 (62%), Gaps = 2/82 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW F+ V AIEG++ I T NLV +S Q+++DCD Q C GG +E +++VI N G
Sbjct: 132 GGCWAFSAVAAIEGVNAIATGNLVSLSEQEIIDCDAQ--DSGCDGGQMENAFRFVIGNGG 189
Query: 65 INTERDYPNVGVMDNCKVFQFN 86
I+TE DYP +G C + N
Sbjct: 190 IDTEADYPFIGTDGTCDASKEN 211
>gi|76574404|gb|ABA46972.1| cysteine protease Mir1 [Zea diploperennis]
Length = 250
Score = 79.0 bits (193), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 37/82 (45%), Positives = 51/82 (62%), Gaps = 2/82 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW F+ V AIEG++ I T NLV +S Q+++DCD Q C GG +E +++VI N G
Sbjct: 126 GGCWAFSAVAAIEGVNAIATGNLVSLSEQEIIDCDAQ--DSGCDGGQMENAFRFVIGNGG 183
Query: 65 INTERDYPNVGVMDNCKVFQFN 86
I+TE DYP +G C + N
Sbjct: 184 IDTEADYPFIGTDGTCDASKEN 205
>gi|214015355|gb|ACJ62294.1| cysteine protease [Zea mays subsp. parviglumis]
Length = 252
Score = 79.0 bits (193), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 37/82 (45%), Positives = 51/82 (62%), Gaps = 2/82 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW F+ V AIEG++ I T NLV +S Q+++DCD Q C GG +E +++VI N G
Sbjct: 129 GGCWAFSAVAAIEGVNAIATGNLVSLSEQEIIDCDAQ--DSGCDGGQMENAFRFVIGNGG 186
Query: 65 INTERDYPNVGVMDNCKVFQFN 86
I+TE DYP +G C + N
Sbjct: 187 IDTEADYPFIGTDGTCDASKEN 208
>gi|214015351|gb|ACJ62292.1| cysteine protease [Zea mays subsp. parviglumis]
Length = 255
Score = 79.0 bits (193), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 37/82 (45%), Positives = 51/82 (62%), Gaps = 2/82 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW F+ V AIEG++ I T NLV +S Q+++DCD Q C GG +E +++VI N G
Sbjct: 132 GGCWAFSAVAAIEGVNAIATGNLVSLSEQEIIDCDAQ--DSGCDGGQMENAFRFVIGNGG 189
Query: 65 INTERDYPNVGVMDNCKVFQFN 86
I+TE DYP +G C + N
Sbjct: 190 IDTEADYPFIGTDGTCDASKEN 211
>gi|38346003|emb|CAD40112.2| OSJNBa0035O13.5 [Oryza sativa Japonica Group]
gi|125589427|gb|EAZ29777.1| hypothetical protein OsJ_13835 [Oryza sativa Japonica Group]
Length = 339
Score = 79.0 bits (193), Expect = 3e-13, Method: Composition-based stats.
Identities = 32/76 (42%), Positives = 47/76 (61%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW F+ V A+EGI K+ T L+ +S Q+LVDCD GE + C GG ++ ++++I+N G
Sbjct: 145 GCCWAFSAVAAMEGIVKLSTGKLISLSEQELVDCDVHGEDQGCEGGLMDDAFKFIIKNGG 204
Query: 65 INTERDYPNVGVMDNC 80
+ TE YP C
Sbjct: 205 LTTESKYPYTAADGKC 220
>gi|75994620|gb|ABA33831.1| cysteine protease Mir1 [Zea mays subsp. parviglumis]
Length = 255
Score = 79.0 bits (193), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 37/82 (45%), Positives = 51/82 (62%), Gaps = 2/82 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW F+ V AIEG++ I T NLV +S Q+++DCD Q C GG +E +++VI N G
Sbjct: 131 GGCWAFSAVAAIEGVNAIATGNLVSLSEQEIIDCDAQ--DSGCDGGQMENAFRFVIGNGG 188
Query: 65 INTERDYPNVGVMDNCKVFQFN 86
I+TE DYP +G C + N
Sbjct: 189 IDTEADYPFIGTDGTCDASKEN 210
>gi|75994618|gb|ABA33830.1| cysteine protease Mir1 [Zea mays subsp. parviglumis]
Length = 255
Score = 79.0 bits (193), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 37/82 (45%), Positives = 51/82 (62%), Gaps = 2/82 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW F+ V AIEG++ I T NLV +S Q+++DCD Q C GG +E +++VI N G
Sbjct: 131 GGCWAFSAVAAIEGVNAIATGNLVSLSEQEIIDCDAQ--DSGCDGGQMENAFRFVIGNGG 188
Query: 65 INTERDYPNVGVMDNCKVFQFN 86
I+TE DYP +G C + N
Sbjct: 189 IDTEADYPFIGTDGTCDASKEN 210
>gi|214015291|gb|ACJ62262.1| cysteine protease [Zea mays subsp. parviglumis]
Length = 251
Score = 79.0 bits (193), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 37/82 (45%), Positives = 51/82 (62%), Gaps = 2/82 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW F+ V AIEG++ I T NLV +S Q+++DCD Q C GG +E +++VI N G
Sbjct: 128 GGCWAFSAVAAIEGVNAIATGNLVSLSEQEIIDCDAQ--DSGCDGGQMENAFRFVIGNGG 185
Query: 65 INTERDYPNVGVMDNCKVFQFN 86
I+TE DYP +G C + N
Sbjct: 186 IDTEADYPFIGTDGTCDASKEN 207
>gi|544129|sp|P25803.2|CYSEP_PHAVU RecName: Full=Vignain; AltName: Full=Bean endopeptidase; AltName:
Full=Cysteine proteinase EP-C1; Flags: Precursor
gi|20994|emb|CAA44816.1| endopeptidase [Phaseolus vulgaris]
Length = 362
Score = 79.0 bits (193), Expect = 3e-13, Method: Composition-based stats.
Identities = 36/82 (43%), Positives = 53/82 (64%), Gaps = 1/82 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V A+EGI++I TN LV +S Q+LVDCD + E++ C GG +E+ ++++ Q G
Sbjct: 150 GSCWAFSTVVAVEGINQIKTNKLVALSEQELVDCDKE-ENQGCNGGLMESAFEFIKQKGG 208
Query: 65 INTERDYPNVGVMDNCKVFQFN 86
I TE +YP C + N
Sbjct: 209 ITTESNYPYKAQEGTCDASKVN 230
>gi|214015279|gb|ACJ62256.1| cysteine protease [Zea mays subsp. parviglumis]
Length = 255
Score = 79.0 bits (193), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 37/82 (45%), Positives = 51/82 (62%), Gaps = 2/82 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW F+ V AIEG++ I T NLV +S Q+++DCD Q C GG +E +++VI N G
Sbjct: 132 GGCWAFSAVAAIEGVNAIATGNLVSLSEQEIIDCDAQ--DSGCDGGQMENAFRFVIGNGG 189
Query: 65 INTERDYPNVGVMDNCKVFQFN 86
I+TE DYP +G C + N
Sbjct: 190 IDTEADYPFIGTDGTCDASKEN 211
>gi|214015331|gb|ACJ62282.1| cysteine protease [Zea mays subsp. parviglumis]
Length = 251
Score = 79.0 bits (193), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 37/82 (45%), Positives = 51/82 (62%), Gaps = 2/82 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW F+ V AIEG++ I T NLV +S Q+++DCD Q C GG +E +++VI N G
Sbjct: 128 GGCWAFSAVAAIEGVNAIATGNLVSLSEQEIIDCDAQ--DSGCDGGQMENAFRFVIGNGG 185
Query: 65 INTERDYPNVGVMDNCKVFQFN 86
I+TE DYP +G C + N
Sbjct: 186 IDTEADYPFIGTDGTCDASKEN 207
>gi|76574392|gb|ABA46966.1| cysteine protease Mir1 [Zea diploperennis]
gi|76574396|gb|ABA46968.1| cysteine protease Mir1 [Zea diploperennis]
gi|76574398|gb|ABA46969.1| cysteine protease Mir1 [Zea diploperennis]
gi|76574406|gb|ABA46973.1| cysteine protease Mir1 [Zea diploperennis]
Length = 250
Score = 79.0 bits (193), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 37/82 (45%), Positives = 51/82 (62%), Gaps = 2/82 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW F+ V AIEG++ I T NLV +S Q+++DCD Q C GG +E +++VI N G
Sbjct: 126 GGCWAFSAVAAIEGVNAIATGNLVSLSEQEIIDCDAQ--DSGCDGGQMENAFRFVIGNGG 183
Query: 65 INTERDYPNVGVMDNCKVFQFN 86
I+TE DYP +G C + N
Sbjct: 184 IDTEADYPFIGTDGTCDASKEN 205
>gi|1345573|emb|CAA40073.1| endopeptidase (EP-C1) [Phaseolus vulgaris]
Length = 361
Score = 79.0 bits (193), Expect = 3e-13, Method: Composition-based stats.
Identities = 36/82 (43%), Positives = 53/82 (64%), Gaps = 1/82 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V A+EGI++I TN LV +S Q+LVDCD + E++ C GG +E+ ++++ Q G
Sbjct: 149 GSCWAFSTVVAVEGINQIKTNKLVALSEQELVDCDKE-ENQGCNGGLMESAFEFIKQKGG 207
Query: 65 INTERDYPNVGVMDNCKVFQFN 86
I TE +YP C + N
Sbjct: 208 ITTESNYPYKAQEGTCDASKVN 229
>gi|75994632|gb|ABA33837.1| cysteine protease Mir1 [Zea mays subsp. parviglumis]
Length = 248
Score = 79.0 bits (193), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 37/82 (45%), Positives = 51/82 (62%), Gaps = 2/82 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW F+ V AIEG++ I T NLV +S Q+++DCD Q C GG +E +++VI N G
Sbjct: 124 GGCWAFSAVAAIEGVNAIATGNLVSLSEQEIIDCDAQ--DSGCDGGQMENAFRFVIGNGG 181
Query: 65 INTERDYPNVGVMDNCKVFQFN 86
I+TE DYP +G C + N
Sbjct: 182 IDTEADYPFIGTDGTCDASKEN 203
>gi|214015339|gb|ACJ62286.1| cysteine protease [Zea mays subsp. parviglumis]
gi|214015343|gb|ACJ62288.1| cysteine protease [Zea mays subsp. parviglumis]
gi|214015347|gb|ACJ62290.1| cysteine protease [Zea mays subsp. parviglumis]
gi|214015349|gb|ACJ62291.1| cysteine protease [Zea mays subsp. parviglumis]
Length = 255
Score = 79.0 bits (193), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 37/82 (45%), Positives = 51/82 (62%), Gaps = 2/82 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW F+ V AIEG++ I T NLV +S Q+++DCD Q C GG +E +++VI N G
Sbjct: 132 GGCWAFSAVAAIEGVNAIATGNLVSLSEQEIIDCDAQ--DSGCDGGQMENAFRFVIGNGG 189
Query: 65 INTERDYPNVGVMDNCKVFQFN 86
I+TE DYP +G C + N
Sbjct: 190 IDTEADYPFIGTDGTCDASKEN 211
>gi|214015307|gb|ACJ62270.1| cysteine protease [Zea mays subsp. parviglumis]
gi|214015309|gb|ACJ62271.1| cysteine protease [Zea mays subsp. parviglumis]
gi|214015313|gb|ACJ62273.1| cysteine protease [Zea mays subsp. parviglumis]
gi|214015315|gb|ACJ62274.1| cysteine protease [Zea mays subsp. parviglumis]
gi|214015317|gb|ACJ62275.1| cysteine protease [Zea mays subsp. parviglumis]
gi|214015319|gb|ACJ62276.1| cysteine protease [Zea mays subsp. parviglumis]
gi|214015321|gb|ACJ62277.1| cysteine protease [Zea mays subsp. parviglumis]
gi|214015323|gb|ACJ62278.1| cysteine protease [Zea mays subsp. parviglumis]
Length = 249
Score = 79.0 bits (193), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 37/82 (45%), Positives = 51/82 (62%), Gaps = 2/82 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW F+ V AIEG++ I T NLV +S Q+++DCD Q C GG +E +++VI N G
Sbjct: 126 GGCWAFSAVAAIEGVNAIATGNLVSLSEQEIIDCDAQ--DSGCDGGQMENAFRFVIGNGG 183
Query: 65 INTERDYPNVGVMDNCKVFQFN 86
I+TE DYP +G C + N
Sbjct: 184 IDTEADYPFIGTDGTCDASKEN 205
>gi|214015269|gb|ACJ62251.1| cysteine protease [Zea mays subsp. parviglumis]
Length = 247
Score = 79.0 bits (193), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 37/82 (45%), Positives = 51/82 (62%), Gaps = 2/82 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW F+ V AIEG++ I T NLV +S Q+++DCD Q C GG +E +++VI N G
Sbjct: 124 GGCWAFSAVAAIEGVNAIATGNLVSLSEQEIIDCDAQ--DSGCDGGQMENAFRFVIGNGG 181
Query: 65 INTERDYPNVGVMDNCKVFQFN 86
I+TE DYP +G C + N
Sbjct: 182 IDTEADYPFIGTDGTCDASKEN 203
>gi|388517427|gb|AFK46775.1| unknown [Medicago truncatula]
Length = 362
Score = 79.0 bits (193), Expect = 3e-13, Method: Composition-based stats.
Identities = 37/82 (45%), Positives = 53/82 (64%), Gaps = 1/82 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V A+EGI++I TN LV +S Q+L+DCDNQ E++ C GG +E ++Y+ Q G
Sbjct: 150 GSCWAFSTVVAVEGINQIKTNRLVPLSEQELIDCDNQ-ENQGCNGGLMEYAFEYIKQKGG 208
Query: 65 INTERDYPNVGVMDNCKVFQFN 86
+ TE YP +C + N
Sbjct: 209 VTTESYYPYTANDGSCDATKEN 230
>gi|214015289|gb|ACJ62261.1| cysteine protease [Zea mays subsp. parviglumis]
Length = 254
Score = 79.0 bits (193), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 37/82 (45%), Positives = 51/82 (62%), Gaps = 2/82 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW F+ V AIEG++ I T NLV +S Q+++DCD Q C GG +E +++VI N G
Sbjct: 131 GGCWAFSAVAAIEGVNAIATGNLVSLSEQEIIDCDAQ--DSGCDGGQMENAFRFVIGNGG 188
Query: 65 INTERDYPNVGVMDNCKVFQFN 86
I+TE DYP +G C + N
Sbjct: 189 IDTEADYPFIGTDGTCDASKEN 210
>gi|214015235|gb|ACJ62234.1| cysteine protease [Zea mays subsp. parviglumis]
gi|214015271|gb|ACJ62252.1| cysteine protease [Zea mays subsp. parviglumis]
gi|214015275|gb|ACJ62254.1| cysteine protease [Zea mays subsp. parviglumis]
Length = 254
Score = 79.0 bits (193), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 37/82 (45%), Positives = 51/82 (62%), Gaps = 2/82 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW F+ V AIEG++ I T NLV +S Q+++DCD Q C GG +E +++VI N G
Sbjct: 131 GGCWAFSAVAAIEGVNAIATGNLVSLSEQEIIDCDAQ--DSGCDGGQMENAFRFVIGNGG 188
Query: 65 INTERDYPNVGVMDNCKVFQFN 86
I+TE DYP +G C + N
Sbjct: 189 IDTEADYPFIGTDGTCDASKEN 210
>gi|217073894|gb|ACJ85307.1| unknown [Medicago truncatula]
gi|388507498|gb|AFK41815.1| unknown [Medicago truncatula]
Length = 362
Score = 79.0 bits (193), Expect = 3e-13, Method: Composition-based stats.
Identities = 37/82 (45%), Positives = 53/82 (64%), Gaps = 1/82 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V A+EGI++I TN LV +S Q+L+DCDNQ E++ C GG +E ++Y+ Q G
Sbjct: 150 GSCWAFSTVVAVEGINQIKTNRLVPLSEQELIDCDNQ-ENQGCNGGLMEYAFEYIKQKGG 208
Query: 65 INTERDYPNVGVMDNCKVFQFN 86
+ TE YP +C + N
Sbjct: 209 VTTESYYPYTANDGSCDATKEN 230
>gi|90399361|emb|CAJ86180.1| H0212B02.7 [Oryza sativa Indica Group]
Length = 470
Score = 79.0 bits (193), Expect = 3e-13, Method: Composition-based stats.
Identities = 35/78 (44%), Positives = 52/78 (66%), Gaps = 1/78 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ + A+EGI++IVT +L+ +S Q+LVDCD + C GG ++ + ++I N G
Sbjct: 151 GSCWAFSAIAAVEGINQIVTGDLISLSEQELVDCDTS-YNEGCNGGLMDYAFDFIINNGG 209
Query: 65 INTERDYPNVGVMDNCKV 82
I+TE DYP G + C V
Sbjct: 210 IDTEDDYPYKGKDERCDV 227
>gi|214015357|gb|ACJ62295.1| cysteine protease [Zea mays subsp. parviglumis]
Length = 251
Score = 79.0 bits (193), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 37/82 (45%), Positives = 51/82 (62%), Gaps = 2/82 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW F+ V AIEG++ I T NLV +S Q+++DCD Q C GG +E +++VI N G
Sbjct: 128 GGCWAFSAVAAIEGVNAIATGNLVSLSEQEIIDCDAQ--DSGCDGGQMENAFRFVIGNGG 185
Query: 65 INTERDYPNVGVMDNCKVFQFN 86
I+TE DYP +G C + N
Sbjct: 186 IDTEADYPFIGTDGTCDASKEN 207
>gi|214015267|gb|ACJ62250.1| cysteine protease [Zea mays subsp. parviglumis]
Length = 254
Score = 79.0 bits (193), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 37/82 (45%), Positives = 51/82 (62%), Gaps = 2/82 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW F+ V AIEG++ I T NLV +S Q+++DCD Q C GG +E +++VI N G
Sbjct: 131 GGCWAFSAVAAIEGVNAIATGNLVSLSEQEIIDCDAQ--DSGCDGGQMENAFRFVIGNGG 188
Query: 65 INTERDYPNVGVMDNCKVFQFN 86
I+TE DYP +G C + N
Sbjct: 189 IDTEADYPFIGTDGTCDASKEN 210
>gi|75994622|gb|ABA33832.1| cysteine protease Mir1 [Zea mays subsp. parviglumis]
Length = 254
Score = 79.0 bits (193), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 37/82 (45%), Positives = 51/82 (62%), Gaps = 2/82 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW F+ V AIEG++ I T NLV +S Q+++DCD Q C GG +E +++VI N G
Sbjct: 130 GGCWAFSAVAAIEGVNAIATGNLVSLSEQEIIDCDAQ--DSGCDGGQMEDAFRFVIGNGG 187
Query: 65 INTERDYPNVGVMDNCKVFQFN 86
I+TE DYP +G C + N
Sbjct: 188 IDTEADYPFIGTDGTCDASKEN 209
>gi|18141281|gb|AAL60578.1|AF454956_1 senescence-associated cysteine protease [Brassica oleracea]
Length = 445
Score = 79.0 bits (193), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 36/75 (48%), Positives = 50/75 (66%), Gaps = 1/75 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ +GA+EGI++I T LV +S Q+LVDCD + C GG ++ +Q++I N G
Sbjct: 145 GSCWAFSAIGAVEGINQIKTGELVSLSEQELVDCDTS-YNNGCGGGLMDYAFQFIISNGG 203
Query: 65 INTERDYPNVGVMDN 79
I+TE DYP DN
Sbjct: 204 IDTEEDYPYTATDDN 218
>gi|224076970|ref|XP_002305073.1| predicted protein [Populus trichocarpa]
gi|222848037|gb|EEE85584.1| predicted protein [Populus trichocarpa]
Length = 340
Score = 79.0 bits (193), Expect = 3e-13, Method: Composition-based stats.
Identities = 36/76 (47%), Positives = 50/76 (65%), Gaps = 2/76 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW F+ V AIEGI K+ T NL+ +S QQLVDC ++ C GG ++T +QY+I+N G
Sbjct: 146 GCCWAFSTVAAIEGIIKLQTGNLISLSEQQLVDC--TAGNKGCQGGLMDTAFQYIIRNGG 203
Query: 65 INTERDYPNVGVMDNC 80
+ +E +YP GV C
Sbjct: 204 LTSEDNYPYQGVDGTC 219
>gi|214015327|gb|ACJ62280.1| cysteine protease [Zea mays subsp. parviglumis]
gi|214015329|gb|ACJ62281.1| cysteine protease [Zea mays subsp. parviglumis]
gi|214015345|gb|ACJ62289.1| cysteine protease [Zea mays subsp. parviglumis]
Length = 251
Score = 79.0 bits (193), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 37/82 (45%), Positives = 51/82 (62%), Gaps = 2/82 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW F+ V AIEG++ I T NLV +S Q+++DCD Q C GG +E +++VI N G
Sbjct: 128 GGCWAFSAVAAIEGVNAIATGNLVSLSEQEIIDCDAQ--DSGCDGGQMENAFRFVIGNGG 185
Query: 65 INTERDYPNVGVMDNCKVFQFN 86
I+TE DYP +G C + N
Sbjct: 186 IDTEADYPFIGTDGTCDASKEN 207
>gi|214015301|gb|ACJ62267.1| cysteine protease [Zea mays subsp. parviglumis]
Length = 253
Score = 79.0 bits (193), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 37/82 (45%), Positives = 51/82 (62%), Gaps = 2/82 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW F+ V AIEG++ I T NLV +S Q+++DCD Q C GG +E +++VI N G
Sbjct: 130 GGCWAFSAVAAIEGVNAIATGNLVSLSEQEIIDCDAQ--DSGCDGGQMENAFRFVIGNGG 187
Query: 65 INTERDYPNVGVMDNCKVFQFN 86
I+TE DYP +G C + N
Sbjct: 188 IDTEADYPFIGTDGTCDASKEN 209
>gi|214015241|gb|ACJ62237.1| cysteine protease [Zea mays subsp. parviglumis]
gi|214015243|gb|ACJ62238.1| cysteine protease [Zea mays subsp. parviglumis]
gi|214015255|gb|ACJ62244.1| cysteine protease [Zea mays subsp. parviglumis]
gi|214015261|gb|ACJ62247.1| cysteine protease [Zea mays subsp. parviglumis]
Length = 254
Score = 79.0 bits (193), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 37/82 (45%), Positives = 51/82 (62%), Gaps = 2/82 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW F+ V AIEG++ I T NLV +S Q+++DCD Q C GG +E +++VI N G
Sbjct: 131 GGCWAFSAVAAIEGVNAIATGNLVSLSEQEIIDCDAQ--DSGCDGGQMENAFRFVIGNGG 188
Query: 65 INTERDYPNVGVMDNCKVFQFN 86
I+TE DYP +G C + N
Sbjct: 189 IDTEADYPFIGTDGTCDASKEN 210
>gi|214015303|gb|ACJ62268.1| cysteine protease [Zea mays subsp. parviglumis]
Length = 251
Score = 79.0 bits (193), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 37/82 (45%), Positives = 51/82 (62%), Gaps = 2/82 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW F+ V AIEG++ I T NLV +S Q+++DCD Q C GG +E +++VI N G
Sbjct: 128 GGCWAFSAVAAIEGVNAIATGNLVSLSEQEIIDCDAQ--DSGCDGGQMENAFRFVIGNGG 185
Query: 65 INTERDYPNVGVMDNCKVFQFN 86
I+TE DYP +G C + N
Sbjct: 186 IDTEADYPFIGTDGTCDASKEN 207
>gi|116786779|gb|ABK24233.1| unknown [Picea sitchensis]
Length = 463
Score = 79.0 bits (193), Expect = 4e-13, Method: Composition-based stats.
Identities = 38/85 (44%), Positives = 55/85 (64%), Gaps = 7/85 (8%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCD---NQGESRSCVGGFIETIYQYVIQ 61
GSCW F+ V A+EGI++IVT NL +S Q+LVDCD NQG C GG ++ +Q++I
Sbjct: 154 GSCWAFSTVAAVEGINQIVTGNLTSLSEQELVDCDTSYNQG----CNGGLMDYAFQFIIN 209
Query: 62 NRGINTERDYPNVGVMDNCKVFQFN 86
N G+++E DYP +C ++ N
Sbjct: 210 NGGLDSEDDYPYKANDGSCDAYRKN 234
>gi|356517426|ref|XP_003527388.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Glycine
max]
Length = 343
Score = 79.0 bits (193), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 31/76 (40%), Positives = 48/76 (63%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW F+ V A EGI + L+ +S Q++VDCD +G+ + C GGF++ ++++IQN G
Sbjct: 147 GCCWAFSAVAATEGIHALNAGKLISLSEQEVVDCDTKGQDQGCAGGFMDGAFKFIIQNHG 206
Query: 65 INTERDYPNVGVMDNC 80
+NTE +YP C
Sbjct: 207 LNTEPNYPYKAADGKC 222
>gi|326493368|dbj|BAJ85145.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 436
Score = 79.0 bits (193), Expect = 4e-13, Method: Composition-based stats.
Identities = 33/71 (46%), Positives = 51/71 (71%), Gaps = 7/71 (9%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCD---NQGESRSCVGGFIETIYQYVIQ 61
GSCW F+ + A+EGI++IVT +++ +S Q+LVDCD NQG C GG ++ ++++I
Sbjct: 154 GSCWAFSAIAAVEGINQIVTGDMIPLSEQELVDCDTSYNQG----CNGGLMDYAFEFIIN 209
Query: 62 NRGINTERDYP 72
N GI++E DYP
Sbjct: 210 NGGIDSEEDYP 220
>gi|214015366|gb|ACJ62299.1| cysteine protease [Zea mays subsp. parviglumis]
Length = 247
Score = 79.0 bits (193), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 37/82 (45%), Positives = 51/82 (62%), Gaps = 2/82 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW F+ V AIEG++ I T NLV +S Q+++DCD Q C GG +E +++VI N G
Sbjct: 124 GGCWAFSAVAAIEGVNAIATGNLVSLSEQEIIDCDAQ--DSGCDGGQMEDAFRFVIGNGG 181
Query: 65 INTERDYPNVGVMDNCKVFQFN 86
I+TE DYP +G C + N
Sbjct: 182 IDTEADYPFIGTDGTCDASKEN 203
>gi|214015233|gb|ACJ62233.1| cysteine protease [Zea mays subsp. parviglumis]
gi|214015239|gb|ACJ62236.1| cysteine protease [Zea mays subsp. parviglumis]
gi|214015265|gb|ACJ62249.1| cysteine protease [Zea mays subsp. parviglumis]
Length = 253
Score = 79.0 bits (193), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 37/82 (45%), Positives = 51/82 (62%), Gaps = 2/82 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW F+ V AIEG++ I T NLV +S Q+++DCD Q C GG +E +++VI N G
Sbjct: 130 GGCWAFSAVAAIEGVNAIATGNLVSLSEQEIIDCDAQ--DSGCDGGQMENAFRFVIGNGG 187
Query: 65 INTERDYPNVGVMDNCKVFQFN 86
I+TE DYP +G C + N
Sbjct: 188 IDTEADYPFIGTDGTCDASKEN 209
>gi|297602258|ref|NP_001052246.2| Os04g0208200 [Oryza sativa Japonica Group]
gi|255675225|dbj|BAF14160.2| Os04g0208200, partial [Oryza sativa Japonica Group]
Length = 219
Score = 79.0 bits (193), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 33/76 (43%), Positives = 46/76 (60%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW F+ V A+EG K+ T LV +S QQLV CD +GE + C GG ++ + ++I+N G
Sbjct: 21 GCCWAFSAVAAMEGAVKLATGKLVSLSEQQLVSCDVKGEDQGCEGGLMDDAFDFIIKNGG 80
Query: 65 INTERDYPNVGVMDNC 80
+ E DYP D C
Sbjct: 81 LAAESDYPYTASDDKC 96
>gi|214015247|gb|ACJ62240.1| cysteine protease [Zea mays subsp. parviglumis]
Length = 255
Score = 79.0 bits (193), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 37/82 (45%), Positives = 51/82 (62%), Gaps = 2/82 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW F+ V AIEG++ I T NLV +S Q+++DCD Q C GG +E +++VI N G
Sbjct: 132 GGCWAFSAVAAIEGVNAIATGNLVSLSGQEIIDCDAQ--DSGCDGGQMENAFRFVIGNGG 189
Query: 65 INTERDYPNVGVMDNCKVFQFN 86
I+TE DYP +G C + N
Sbjct: 190 IDTEADYPFIGTDGTCDASKEN 211
>gi|214015237|gb|ACJ62235.1| cysteine protease [Zea mays subsp. parviglumis]
gi|214015245|gb|ACJ62239.1| cysteine protease [Zea mays subsp. parviglumis]
gi|214015249|gb|ACJ62241.1| cysteine protease [Zea mays subsp. parviglumis]
gi|214015257|gb|ACJ62245.1| cysteine protease [Zea mays subsp. parviglumis]
gi|214015273|gb|ACJ62253.1| cysteine protease [Zea mays subsp. parviglumis]
gi|214015277|gb|ACJ62255.1| cysteine protease [Zea mays subsp. parviglumis]
Length = 255
Score = 79.0 bits (193), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 37/82 (45%), Positives = 51/82 (62%), Gaps = 2/82 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW F+ V AIEG++ I T NLV +S Q+++DCD Q C GG +E +++VI N G
Sbjct: 132 GGCWAFSAVAAIEGVNAIATGNLVSLSEQEIIDCDAQ--DSGCDGGQMEDAFRFVIGNGG 189
Query: 65 INTERDYPNVGVMDNCKVFQFN 86
I+TE DYP +G C + N
Sbjct: 190 IDTEADYPFIGTDGTCDASKEN 211
>gi|214015253|gb|ACJ62243.1| cysteine protease [Zea mays subsp. parviglumis]
Length = 255
Score = 79.0 bits (193), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 37/82 (45%), Positives = 51/82 (62%), Gaps = 2/82 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW F+ V AIEG++ I T NLV +S Q+++DCD Q C GG +E +++VI N G
Sbjct: 132 GGCWAFSAVAAIEGVNAIATGNLVSLSEQEIIDCDAQ--DSGCDGGQMEDAFRFVIGNGG 189
Query: 65 INTERDYPNVGVMDNCKVFQFN 86
I+TE DYP +G C + N
Sbjct: 190 IDTEADYPFIGTDGTCDASKEN 211
>gi|146216000|gb|ABQ10202.1| cysteine protease Cp4 [Actinidia deliciosa]
Length = 463
Score = 79.0 bits (193), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 37/82 (45%), Positives = 54/82 (65%), Gaps = 1/82 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ + A+EGI++IVT L+ +S Q+LVDCD + + C GG ++ +Q++I N G
Sbjct: 156 GSCWAFSTISAVEGINQIVTGELISLSEQELVDCD-KSYNMGCNGGLMDYGFQFIINNGG 214
Query: 65 INTERDYPNVGVMDNCKVFQFN 86
I+TE DYP V C F+ N
Sbjct: 215 IDTEEDYPYRAVDGTCDQFRKN 236
>gi|214015372|gb|ACJ62302.1| cysteine protease [Zea mays subsp. parviglumis]
Length = 250
Score = 79.0 bits (193), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 37/82 (45%), Positives = 51/82 (62%), Gaps = 2/82 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW F+ V AIEG++ I T NLV +S Q+++DCD Q C GG +E +++VI N G
Sbjct: 127 GGCWAFSAVAAIEGVNAIATGNLVSLSEQEIIDCDAQ--DSGCDGGQMENAFRFVIGNGG 184
Query: 65 INTERDYPNVGVMDNCKVFQFN 86
I+TE DYP +G C + N
Sbjct: 185 IDTEADYPFIGTDGTCDASKEN 206
>gi|146215996|gb|ABQ10200.1| cysteine protease Cp2 [Actinidia deliciosa]
Length = 376
Score = 79.0 bits (193), Expect = 4e-13, Method: Composition-based stats.
Identities = 33/76 (43%), Positives = 51/76 (67%), Gaps = 1/76 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V A+EG+++I T ++ +S Q+LVDCD ++ C GG ++ ++++I N G
Sbjct: 158 GSCWAFSTVAAVEGVNQIATGEMIQLSEQELVDCDRTYDA-GCNGGLMDYAFEFIINNGG 216
Query: 65 INTERDYPNVGVMDNC 80
I+TE DYP GV C
Sbjct: 217 IDTEEDYPYRGVDGTC 232
>gi|214015325|gb|ACJ62279.1| cysteine protease [Zea mays subsp. parviglumis]
Length = 251
Score = 79.0 bits (193), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 37/82 (45%), Positives = 51/82 (62%), Gaps = 2/82 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW F+ V AIEG++ I T NLV +S Q+++DCD Q C GG +E +++VI N G
Sbjct: 128 GGCWAFSAVAAIEGVNAIATGNLVSLSEQEIIDCDAQ--DSGCDGGQMENAFRFVIGNGG 185
Query: 65 INTERDYPNVGVMDNCKVFQFN 86
I+TE DYP +G C + N
Sbjct: 186 IDTEADYPFIGTDGTCDASKEN 207
>gi|214015361|gb|ACJ62297.1| cysteine protease [Zea mays subsp. parviglumis]
Length = 247
Score = 79.0 bits (193), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 37/82 (45%), Positives = 51/82 (62%), Gaps = 2/82 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW F+ V AIEG++ I T NLV +S Q+++DCD Q C GG +E +++VI N G
Sbjct: 124 GGCWAFSAVAAIEGVNAIATGNLVSLSEQEIIDCDAQ--DSGCDGGQMEDAFRFVIGNGG 181
Query: 65 INTERDYPNVGVMDNCKVFQFN 86
I+TE DYP +G C + N
Sbjct: 182 IDTEADYPFIGTDGTCDASKEN 203
>gi|75994612|gb|ABA33827.1| cysteine protease Mir1 [Zea mays subsp. parviglumis]
Length = 254
Score = 79.0 bits (193), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 38/82 (46%), Positives = 51/82 (62%), Gaps = 2/82 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW F+ V AIEG++ I T NLV +S Q+++DCD Q C GG +E +++VI N G
Sbjct: 130 GGCWAFSAVAAIEGVNAIATGNLVSLSEQEIIDCDAQ--DSGCDGGQMENAFRFVIGNGG 187
Query: 65 INTERDYPNVGVMDNCKVFQFN 86
I+TE DYP VG C + N
Sbjct: 188 IDTEADYPFVGTDGTCDANKEN 209
>gi|214015380|gb|ACJ62306.1| cysteine protease [Zea mays subsp. parviglumis]
gi|214015386|gb|ACJ62309.1| cysteine protease [Zea mays subsp. parviglumis]
Length = 249
Score = 79.0 bits (193), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 37/82 (45%), Positives = 51/82 (62%), Gaps = 2/82 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW F+ V AIEG++ I T NLV +S Q+++DCD Q C GG +E +++VI N G
Sbjct: 126 GGCWAFSAVAAIEGVNAIATGNLVSLSEQEIIDCDAQ--DSGCDGGQMENAFRFVIGNGG 183
Query: 65 INTERDYPNVGVMDNCKVFQFN 86
I+TE DYP +G C + N
Sbjct: 184 IDTEADYPFIGTDGTCDASKEN 205
>gi|214015263|gb|ACJ62248.1| cysteine protease [Zea mays subsp. parviglumis]
Length = 252
Score = 79.0 bits (193), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 37/82 (45%), Positives = 51/82 (62%), Gaps = 2/82 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW F+ V AIEG++ I T NLV +S Q+++DCD Q C GG +E +++VI N G
Sbjct: 129 GGCWAFSAVAAIEGVNAIATGNLVSLSEQEIIDCDAQ--DSGCDGGQMENAFRFVIGNGG 186
Query: 65 INTERDYPNVGVMDNCKVFQFN 86
I+TE DYP +G C + N
Sbjct: 187 IDTEADYPFIGTDGTCDASKEN 208
>gi|168017893|ref|XP_001761481.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162687165|gb|EDQ73549.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 471
Score = 79.0 bits (193), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 33/76 (43%), Positives = 53/76 (69%), Gaps = 1/76 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ +G++EG++ IVT L+ +S Q+LVDCD +G+++ C GG ++ + ++I+N G
Sbjct: 159 GSCWAFSAIGSVEGVNAIVTGELISLSEQELVDCD-RGQNQGCNGGLMDYAFDFIIKNGG 217
Query: 65 INTERDYPNVGVMDNC 80
I+TE DYP C
Sbjct: 218 IDTEEDYPYKATDGQC 233
>gi|214015333|gb|ACJ62283.1| cysteine protease [Zea mays subsp. parviglumis]
gi|214015335|gb|ACJ62284.1| cysteine protease [Zea mays subsp. parviglumis]
gi|214015337|gb|ACJ62285.1| cysteine protease [Zea mays subsp. parviglumis]
gi|214015341|gb|ACJ62287.1| cysteine protease [Zea mays subsp. parviglumis]
Length = 251
Score = 79.0 bits (193), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 37/82 (45%), Positives = 51/82 (62%), Gaps = 2/82 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW F+ V AIEG++ I T NLV +S Q+++DCD Q C GG +E +++VI N G
Sbjct: 128 GGCWAFSAVAAIEGVNAIATGNLVSLSEQEIIDCDAQ--DSGCDGGQMENAFRFVIGNGG 185
Query: 65 INTERDYPNVGVMDNCKVFQFN 86
I+TE DYP +G C + N
Sbjct: 186 IDTEADYPFIGTDGTCDASKEN 207
>gi|374713649|gb|AEZ65082.1| cysteine protease [Carica papaya]
Length = 471
Score = 78.6 bits (192), Expect = 4e-13, Method: Composition-based stats.
Identities = 32/68 (47%), Positives = 53/68 (77%), Gaps = 1/68 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ VG++EGI++IVT +L+ +S Q+LVDCD + ++ C GG ++ ++++I+N G
Sbjct: 164 GSCWAFSTVGSVEGINQIVTGDLISLSEQELVDCD-KAYNQGCNGGLMDYAFEFIIKNGG 222
Query: 65 INTERDYP 72
I++E DYP
Sbjct: 223 IDSEADYP 230
>gi|242093944|ref|XP_002437462.1| hypothetical protein SORBIDRAFT_10g027570 [Sorghum bicolor]
gi|241915685|gb|EER88829.1| hypothetical protein SORBIDRAFT_10g027570 [Sorghum bicolor]
Length = 366
Score = 78.6 bits (192), Expect = 4e-13, Method: Composition-based stats.
Identities = 36/77 (46%), Positives = 49/77 (63%), Gaps = 2/77 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ + A+EGI+ I T NLV +S QQLVDCD + C GG + T + +V++NRG
Sbjct: 174 GSCWAFSAIAAVEGINAIRTRNLVPLSEQQLVDCDKL--NHGCNGGLMTTAFSFVVRNRG 231
Query: 65 INTERDYPNVGVMDNCK 81
+ E YP +G CK
Sbjct: 232 VVPEGAYPYMGREGRCK 248
>gi|359359168|gb|AEV41073.1| putative cysteine protease [Oryza minuta]
Length = 499
Score = 78.6 bits (192), Expect = 4e-13, Method: Composition-based stats.
Identities = 33/78 (42%), Positives = 50/78 (64%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V A+EGI+KIVT LV +S Q+LV+C G + C GG ++ + ++ +N G
Sbjct: 179 GSCWAFSAVAAVEGINKIVTGELVSLSEQELVECARNGANSGCNGGMMDDAFAFIARNGG 238
Query: 65 INTERDYPNVGVMDNCKV 82
++TE DYP + C +
Sbjct: 239 LDTEEDYPYTAMDGKCNL 256
>gi|214015378|gb|ACJ62305.1| cysteine protease [Zea mays subsp. parviglumis]
Length = 254
Score = 78.6 bits (192), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 38/82 (46%), Positives = 51/82 (62%), Gaps = 2/82 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW F+ V AIEG++ IVT NLV +S Q+++DCD Q C GG +E + +VI N G
Sbjct: 131 GGCWAFSAVAAIEGVNAIVTGNLVSLSEQEIIDCDAQ--DSGCDGGQMENAFLFVIGNGG 188
Query: 65 INTERDYPNVGVMDNCKVFQFN 86
I+TE DYP +G C + N
Sbjct: 189 IDTEADYPFIGTDGTCDASKEN 210
>gi|359359215|gb|AEV41119.1| putative cysteine protease [Oryza officinalis]
Length = 499
Score = 78.6 bits (192), Expect = 4e-13, Method: Composition-based stats.
Identities = 33/78 (42%), Positives = 50/78 (64%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V A+EGI+KIVT LV +S Q+LV+C G + C GG ++ + ++ +N G
Sbjct: 179 GSCWAFSAVAAVEGINKIVTGELVSLSEQELVECARNGANSGCNGGMMDDAFAFIARNGG 238
Query: 65 INTERDYPNVGVMDNCKV 82
++TE DYP + C +
Sbjct: 239 LDTEEDYPYTAMDGKCNL 256
>gi|115448287|ref|NP_001047923.1| Os02g0715000 [Oryza sativa Japonica Group]
gi|42408029|dbj|BAD09165.1| putative cysteine proteinase [Oryza sativa Japonica Group]
gi|113537454|dbj|BAF09837.1| Os02g0715000 [Oryza sativa Japonica Group]
gi|215737450|dbj|BAG96580.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215765786|dbj|BAG87483.1| unnamed protein product [Oryza sativa Japonica Group]
gi|222623551|gb|EEE57683.1| hypothetical protein OsJ_08138 [Oryza sativa Japonica Group]
Length = 366
Score = 78.6 bits (192), Expect = 4e-13, Method: Composition-based stats.
Identities = 34/68 (50%), Positives = 48/68 (70%), Gaps = 1/68 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V A+EGI++IVT LV +S Q+L+DCDN + C GG ++ + Y++ N+G
Sbjct: 165 GSCWAFSTVAAVEGINQIVTGKLVSLSEQELMDCDNTF-NHGCRGGLMDFAFAYIMGNQG 223
Query: 65 INTERDYP 72
I TE DYP
Sbjct: 224 IYTEEDYP 231
>gi|359483753|ref|XP_002266308.2| PREDICTED: oryzain alpha chain-like [Vitis vinifera]
Length = 501
Score = 78.6 bits (192), Expect = 4e-13, Method: Composition-based stats.
Identities = 35/76 (46%), Positives = 51/76 (67%), Gaps = 2/76 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ GA+EGI+ IVT +L+ +S Q+LVDCD + C GG+++ +++VI N G
Sbjct: 158 GSCWAFSSTGAMEGINAIVTGDLISLSEQELVDCDTT--NYGCEGGYMDYAFEWVISNGG 215
Query: 65 INTERDYPNVGVMDNC 80
I++E DYP G C
Sbjct: 216 IDSESDYPYTGTDGTC 231
>gi|224079085|ref|XP_002305743.1| predicted protein [Populus trichocarpa]
gi|222848707|gb|EEE86254.1| predicted protein [Populus trichocarpa]
Length = 494
Score = 78.6 bits (192), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 37/76 (48%), Positives = 53/76 (69%), Gaps = 2/76 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ GAIEGI+ IVT++L+ +S Q+LVDCD + C GG+++ +++VI N G
Sbjct: 155 GSCWSFSTTGAIEGINAIVTSDLISLSEQELVDCDTT--NYGCEGGYMDYAFEWVINNGG 212
Query: 65 INTERDYPNVGVMDNC 80
I+TE +YP GV C
Sbjct: 213 IDTEANYPYTGVDGTC 228
>gi|255586666|ref|XP_002533962.1| cysteine protease, putative [Ricinus communis]
gi|223526059|gb|EEF28418.1| cysteine protease, putative [Ricinus communis]
Length = 417
Score = 78.6 bits (192), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 38/80 (47%), Positives = 54/80 (67%), Gaps = 2/80 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ GAIEGI+ IVT +LV +S Q+L+DCD + C GG+++ +++VI N G
Sbjct: 163 GSCWAFSSTGAIEGINAIVTGDLVSLSEQELMDCDTT--NYGCDGGYMDYAFEWVINNGG 220
Query: 65 INTERDYPNVGVMDNCKVFQ 84
I+TE DYP GV C + +
Sbjct: 221 IDTEIDYPYTGVDGTCNIAK 240
>gi|214015293|gb|ACJ62263.1| cysteine protease [Zea mays subsp. parviglumis]
Length = 256
Score = 78.6 bits (192), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 37/82 (45%), Positives = 51/82 (62%), Gaps = 2/82 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW F+ V AIEG++ I T NLV +S Q+++DCD Q C GG +E +++VI N G
Sbjct: 133 GGCWAFSAVAAIEGVNAIATGNLVSLSEQEIIDCDAQ--DSGCDGGQMENAFRFVIGNGG 190
Query: 65 INTERDYPNVGVMDNCKVFQFN 86
I+TE DYP +G C + N
Sbjct: 191 IDTEADYPFIGTDGTCDANKEN 212
>gi|359806985|ref|NP_001241331.1| uncharacterized protein LOC100811719 precursor [Glycine max]
gi|255645733|gb|ACU23360.1| unknown [Glycine max]
Length = 362
Score = 78.6 bits (192), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 41/77 (53%), Positives = 48/77 (62%), Gaps = 2/77 (2%)
Query: 6 SCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRGI 65
S W F+V GAIEGI+KIVT NLV +S QQ+VDCD S C GGF + YVI+N GI
Sbjct: 160 SHWAFSVTGAIEGINKIVTGNLVSLSVQQVVDCDPA--SHGCAGGFYFNAFGYVIENGGI 217
Query: 66 NTERDYPNVGVMDNCKV 82
+TE YP CK
Sbjct: 218 DTEAHYPYTAQNGTCKA 234
>gi|125540888|gb|EAY87283.1| hypothetical protein OsI_08685 [Oryza sativa Indica Group]
Length = 357
Score = 78.6 bits (192), Expect = 5e-13, Method: Composition-based stats.
Identities = 34/68 (50%), Positives = 48/68 (70%), Gaps = 1/68 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V A+EGI++IVT LV +S Q+L+DCDN + C GG ++ + Y++ N+G
Sbjct: 156 GSCWAFSTVAAVEGINQIVTGKLVSLSEQELMDCDNTF-NHGCRGGLMDFAFAYIMGNQG 214
Query: 65 INTERDYP 72
I TE DYP
Sbjct: 215 IYTEEDYP 222
>gi|8886940|gb|AAF80626.1|AC069251_19 F2D10.37 [Arabidopsis thaliana]
Length = 315
Score = 78.6 bits (192), Expect = 5e-13, Method: Composition-based stats.
Identities = 32/68 (47%), Positives = 47/68 (69%), Gaps = 1/68 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V A+EGI+KIVT NL +S Q+L+DCD + C GG ++ ++Y+++N G
Sbjct: 160 GSCWAFSTVAAVEGINKIVTGNLTTLSEQELIDCDTT-YNNGCNGGLMDYAFEYIVKNGG 218
Query: 65 INTERDYP 72
+ E DYP
Sbjct: 219 LRKEEDYP 226
>gi|214015287|gb|ACJ62260.1| cysteine protease [Zea mays subsp. parviglumis]
Length = 253
Score = 78.6 bits (192), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 37/82 (45%), Positives = 51/82 (62%), Gaps = 2/82 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW F+ V AIEG++ I T NLV +S Q+++DCD Q C GG +E +++VI N G
Sbjct: 130 GGCWAFSAVAAIEGVNAIATGNLVSLSEQEIIDCDAQ--DSGCDGGQMENAFRFVIGNGG 187
Query: 65 INTERDYPNVGVMDNCKVFQFN 86
I+TE DYP +G C + N
Sbjct: 188 IDTEADYPFIGTDGTCDANKEN 209
>gi|162463334|ref|NP_001104878.1| maize insect resistance2 precursor [Zea mays]
gi|2425064|gb|AAB88262.1| cysteine proteinase Mir2 [Zea mays]
Length = 493
Score = 78.6 bits (192), Expect = 5e-13, Method: Composition-based stats.
Identities = 34/78 (43%), Positives = 53/78 (67%), Gaps = 1/78 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW F+ V A+EGI+KIVT +L+ +S Q+L+DCD + + + C GG ++ + ++I+N G
Sbjct: 186 GGCWAFSAVAAVEGINKIVTGSLISLSEQELIDCD-KFQDQGCDGGLMDNAFVFMIKNGG 244
Query: 65 INTERDYPNVGVMDNCKV 82
I+TE DYP G C +
Sbjct: 245 IDTEADYPFTGHDGTCDL 262
>gi|297826875|ref|XP_002881320.1| hypothetical protein ARALYDRAFT_321132 [Arabidopsis lyrata subsp.
lyrata]
gi|297327159|gb|EFH57579.1| hypothetical protein ARALYDRAFT_321132 [Arabidopsis lyrata subsp.
lyrata]
Length = 341
Score = 78.6 bits (192), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 36/77 (46%), Positives = 49/77 (63%), Gaps = 1/77 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW F+ V A+EG++KI NLV +S QQL+DCD + + R C GG + + Y+IQNRG
Sbjct: 148 GCCWAFSAVAAVEGVTKIAGGNLVSLSEQQLLDCDREYD-RGCDGGIMSDAFNYIIQNRG 206
Query: 65 INTERDYPNVGVMDNCK 81
I +E DY G C+
Sbjct: 207 IASENDYSYQGSDGRCR 223
>gi|296082368|emb|CBI21373.3| unnamed protein product [Vitis vinifera]
Length = 245
Score = 78.6 bits (192), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 34/78 (43%), Positives = 53/78 (67%), Gaps = 1/78 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V A+EGI++IVT L+ +S Q+LVDCD + + C GG ++ + ++I+N G
Sbjct: 28 GSCWAFSTVAAVEGINQIVTGELISLSEQELVDCDTEYD-MGCNGGLMDYAFDFIIKNGG 86
Query: 65 INTERDYPNVGVMDNCKV 82
++TE+DYP G C +
Sbjct: 87 LDTEKDYPYTGFDGECNL 104
>gi|75994614|gb|ABA33828.1| cysteine protease Mir1 [Zea mays subsp. parviglumis]
Length = 251
Score = 78.6 bits (192), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 37/82 (45%), Positives = 51/82 (62%), Gaps = 2/82 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW F+ V AIEG++ I T NLV +S Q+++DCD Q C GG +E +++VI N G
Sbjct: 127 GGCWAFSAVAAIEGVNAIATGNLVSLSEQEIIDCDAQ--DSGCDGGQMEDAFRFVIGNGG 184
Query: 65 INTERDYPNVGVMDNCKVFQFN 86
I+TE DYP +G C + N
Sbjct: 185 IDTEADYPFIGTDGTCDANKEN 206
>gi|18394919|ref|NP_564126.1| Xylem cysteine proteinase 2 [Arabidopsis thaliana]
gi|71153409|sp|Q9LM66.2|XCP2_ARATH RecName: Full=Xylem cysteine proteinase 2; Short=AtXCP2; Flags:
Precursor
gi|4836904|gb|AAD30607.1|AC007369_17 Putative cysteine proteinase [Arabidopsis thaliana]
gi|6708183|gb|AAF25832.1|AF191028_1 papain-type cysteine endopeptidase XCP2 [Arabidopsis thaliana]
gi|28466959|gb|AAO44088.1| At1g20850 [Arabidopsis thaliana]
gi|110743795|dbj|BAE99733.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|332191910|gb|AEE30031.1| Xylem cysteine proteinase 2 [Arabidopsis thaliana]
Length = 356
Score = 78.6 bits (192), Expect = 5e-13, Method: Composition-based stats.
Identities = 32/68 (47%), Positives = 47/68 (69%), Gaps = 1/68 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V A+EGI+KIVT NL +S Q+L+DCD + C GG ++ ++Y+++N G
Sbjct: 160 GSCWAFSTVAAVEGINKIVTGNLTTLSEQELIDCDTT-YNNGCNGGLMDYAFEYIVKNGG 218
Query: 65 INTERDYP 72
+ E DYP
Sbjct: 219 LRKEEDYP 226
>gi|214015370|gb|ACJ62301.1| cysteine protease [Zea mays subsp. parviglumis]
gi|214015384|gb|ACJ62308.1| cysteine protease [Zea mays subsp. parviglumis]
gi|214015392|gb|ACJ62312.1| cysteine protease [Zea mays subsp. parviglumis]
Length = 247
Score = 78.6 bits (192), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 37/82 (45%), Positives = 51/82 (62%), Gaps = 2/82 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW F+ V AIEG++ I T NLV +S Q+++DCD Q C GG +E +++VI N G
Sbjct: 124 GGCWAFSAVAAIEGVNAIATGNLVSLSEQEIIDCDAQ--DSGCDGGQMEDAFRFVIGNGG 181
Query: 65 INTERDYPNVGVMDNCKVFQFN 86
I+TE DYP +G C + N
Sbjct: 182 IDTEADYPFIGTDGTCDANKEN 203
>gi|357160572|ref|XP_003578808.1| PREDICTED: vignain-like [Brachypodium distachyon]
Length = 339
Score = 78.2 bits (191), Expect = 5e-13, Method: Composition-based stats.
Identities = 33/77 (42%), Positives = 48/77 (62%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW F+ V A+EGI K+ T LV +S Q+LVDCD GE + C GG ++ ++++I+N G
Sbjct: 145 GCCWAFSAVAAMEGIVKLSTGKLVSLSEQELVDCDVHGEDQGCEGGLMDDAFKFIIKNGG 204
Query: 65 INTERDYPNVGVMDNCK 81
+ E +YP CK
Sbjct: 205 LTQESNYPYDAADGKCK 221
>gi|214015374|gb|ACJ62303.1| cysteine protease [Zea mays subsp. parviglumis]
Length = 247
Score = 78.2 bits (191), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 37/82 (45%), Positives = 51/82 (62%), Gaps = 2/82 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW F+ V AIEG++ I T NLV +S Q+++DCD Q C GG +E +++VI N G
Sbjct: 124 GGCWAFSAVAAIEGVNAIATGNLVSLSEQEIIDCDAQ--DSGCDGGQMEDAFRFVIGNGG 181
Query: 65 INTERDYPNVGVMDNCKVFQFN 86
I+TE DYP +G C + N
Sbjct: 182 IDTEADYPFIGTDGTCDANKEN 203
>gi|116787404|gb|ABK24495.1| unknown [Picea sitchensis]
gi|224286306|gb|ACN40861.1| unknown [Picea sitchensis]
Length = 452
Score = 78.2 bits (191), Expect = 5e-13, Method: Composition-based stats.
Identities = 36/85 (42%), Positives = 56/85 (65%), Gaps = 7/85 (8%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCD---NQGESRSCVGGFIETIYQYVIQ 61
GSCW F+ V A+EGI++IVT +L+ +S Q+LVDCD NQG C GG ++ ++++I
Sbjct: 151 GSCWAFSTVAAVEGINQIVTGDLISLSEQELVDCDTSYNQG----CNGGLMDYAFEFIIN 206
Query: 62 NRGINTERDYPNVGVMDNCKVFQFN 86
N G+++E DYP +C ++ N
Sbjct: 207 NGGLDSEEDYPYTAYDGSCDSYRKN 231
>gi|357133074|ref|XP_003568153.1| PREDICTED: cysteine proteinase RD21a-like [Brachypodium distachyon]
Length = 565
Score = 78.2 bits (191), Expect = 5e-13, Method: Composition-based stats.
Identities = 34/68 (50%), Positives = 50/68 (73%), Gaps = 1/68 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G+CW F+ GAIEGI+KI T +L+ +S Q+L+DCD + + C GG ++ Y++VI+N G
Sbjct: 160 GACWSFSATGAIEGINKIKTGSLISLSEQELIDCD-RSYNAGCGGGLMDYAYRFVIKNGG 218
Query: 65 INTERDYP 72
I+TE DYP
Sbjct: 219 IDTEDDYP 226
>gi|357507505|ref|XP_003624041.1| Cysteine proteinase [Medicago truncatula]
gi|355499056|gb|AES80259.1| Cysteine proteinase [Medicago truncatula]
Length = 342
Score = 78.2 bits (191), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 34/68 (50%), Positives = 47/68 (69%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V AIEGI KI + NLV +S QQLVDCD G ++ C G + ++++++N G
Sbjct: 145 GSCWAFSAVAAIEGIQKITSGNLVSLSEQQLVDCDRSGRTKGCDNGNMINAFKFILENGG 204
Query: 65 INTERDYP 72
I TE +YP
Sbjct: 205 IATEANYP 212
>gi|116781957|gb|ABK22314.1| unknown [Picea sitchensis]
Length = 369
Score = 78.2 bits (191), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 38/82 (46%), Positives = 50/82 (60%), Gaps = 2/82 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V ++EGI+ I T NLV +S QQLVDC E+ C GG ++T +QY+I N G
Sbjct: 155 GSCWAFSTVASVEGINYITTGNLVSLSEQQLVDCST--ENSGCNGGLMDTAFQYIINNGG 212
Query: 65 INTERDYPNVGVMDNCKVFQFN 86
I TE +YP C + N
Sbjct: 213 IVTEDNYPYTAEATECSSTKIN 234
>gi|242072398|ref|XP_002446135.1| hypothetical protein SORBIDRAFT_06g002170 [Sorghum bicolor]
gi|241937318|gb|EES10463.1| hypothetical protein SORBIDRAFT_06g002170 [Sorghum bicolor]
Length = 338
Score = 78.2 bits (191), Expect = 5e-13, Method: Composition-based stats.
Identities = 33/77 (42%), Positives = 48/77 (62%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW F+ V A+EGI K+ T NL+ +S Q+LVDCD C GG++++ +++VI+N G
Sbjct: 144 GCCWAFSAVAAMEGIVKLSTGNLISLSEQELVDCDTHSMDEGCEGGWMDSAFEFVIKNGG 203
Query: 65 INTERDYPNVGVMDNCK 81
+ T YP V CK
Sbjct: 204 LATVSSYPYKAVDGKCK 220
>gi|214015299|gb|ACJ62266.1| cysteine protease [Zea mays subsp. parviglumis]
Length = 251
Score = 78.2 bits (191), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 37/82 (45%), Positives = 51/82 (62%), Gaps = 2/82 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW F+ V AIEG++ I T NLV +S Q+++DCD Q C GG +E +++VI N G
Sbjct: 128 GGCWAFSAVAAIEGVNAIATGNLVSLSEQEIIDCDAQ--DSGCDGGQMENAFRFVIGNGG 185
Query: 65 INTERDYPNVGVMDNCKVFQFN 86
I+TE DYP +G C + N
Sbjct: 186 IDTEADYPFIGTDGTCDANKEN 207
>gi|449448298|ref|XP_004141903.1| PREDICTED: germination-specific cysteine protease 1-like [Cucumis
sativus]
gi|449531757|ref|XP_004172852.1| PREDICTED: germination-specific cysteine protease 1-like [Cucumis
sativus]
Length = 365
Score = 78.2 bits (191), Expect = 5e-13, Method: Composition-based stats.
Identities = 32/68 (47%), Positives = 48/68 (70%), Gaps = 1/68 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ + A+EGI++IVT L+ +S Q+LV CD + S C GG ++ +Q++I N G
Sbjct: 146 GSCWAFSTIAAVEGINQIVTGELISLSEQELVSCDKKYNS-GCNGGLMDYAFQFIIDNGG 204
Query: 65 INTERDYP 72
++TE DYP
Sbjct: 205 LDTEEDYP 212
>gi|89274062|dbj|BAE80740.1| cysteine proteinase [Platycodon grandiflorus]
Length = 462
Score = 78.2 bits (191), Expect = 5e-13, Method: Composition-based stats.
Identities = 35/82 (42%), Positives = 56/82 (68%), Gaps = 1/82 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V A+EGI++IVT L+ +S Q+LVDCD + + C GG ++ ++++I N G
Sbjct: 158 GSCWAFSTVNAVEGINQIVTGELITLSEQELVDCD-KSYNEGCDGGLMDYGFEFIINNGG 216
Query: 65 INTERDYPNVGVMDNCKVFQFN 86
I+T++DYP +G C ++ N
Sbjct: 217 IDTDKDYPYLGRDARCDQYRKN 238
>gi|118158|sp|P12412.1|CYSEP_VIGMU RecName: Full=Vignain; AltName: Full=Bean endopeptidase; AltName:
Full=Cysteine proteinase; AltName:
Full=Sulfhydryl-endopeptidase; Short=SH-EP; Contains:
RecName: Full=Vignain-1; Contains: RecName:
Full=Vignain-2; Flags: Precursor
gi|22062|emb|CAA33753.1| sulfhydryl-pre-endopeptidase (AA -20 to 342) [Vigna mungo]
gi|22066|emb|CAA36181.1| sulfhydryl-endopeptidase [Vigna mungo]
Length = 362
Score = 78.2 bits (191), Expect = 5e-13, Method: Composition-based stats.
Identities = 37/81 (45%), Positives = 54/81 (66%), Gaps = 4/81 (4%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ + A+EGI++I TN LV +S Q+LVDCD + E++ C GG +E+ ++++ Q G
Sbjct: 150 GSCWAFSTIVAVEGINQIKTNKLVSLSEQELVDCDKE-ENQGCNGGLMESAFEFIKQKGG 208
Query: 65 INTERDYPNV---GVMDNCKV 82
I TE +YP G D KV
Sbjct: 209 ITTESNYPYTAQEGTCDESKV 229
>gi|326520659|dbj|BAJ92693.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 289
Score = 78.2 bits (191), Expect = 5e-13, Method: Composition-based stats.
Identities = 33/71 (46%), Positives = 51/71 (71%), Gaps = 7/71 (9%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCD---NQGESRSCVGGFIETIYQYVIQ 61
GSCW F+ + A+EGI++IVT +++ +S Q+LVDCD NQG C GG ++ ++++I
Sbjct: 154 GSCWAFSAIAAVEGINQIVTGDMIPLSEQELVDCDTSYNQG----CNGGLMDYAFEFIIN 209
Query: 62 NRGINTERDYP 72
N GI++E DYP
Sbjct: 210 NGGIDSEEDYP 220
>gi|148927394|gb|ABR19828.1| cysteine proteinase [Elaeis guineensis]
Length = 469
Score = 78.2 bits (191), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 37/85 (43%), Positives = 56/85 (65%), Gaps = 7/85 (8%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCD---NQGESRSCVGGFIETIYQYVIQ 61
GSCW F+ + A+EGI+ IVT +L+ +S Q+LVDCD NQG C GG ++ ++++I
Sbjct: 162 GSCWAFSTIAAVEGINHIVTGDLISLSEQELVDCDTYYNQG----CNGGLMDYAFEFIIS 217
Query: 62 NRGINTERDYPNVGVMDNCKVFQFN 86
N GI+T+ DYP G +C ++ N
Sbjct: 218 NGGIDTDEDYPYTGRDGSCDQYRKN 242
>gi|445927|prf||1910332A Cys endopeptidase
Length = 362
Score = 78.2 bits (191), Expect = 6e-13, Method: Composition-based stats.
Identities = 37/81 (45%), Positives = 54/81 (66%), Gaps = 4/81 (4%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ + A+EGI++I TN LV +S Q+LVDCD + E++ C GG +E+ ++++ Q G
Sbjct: 150 GSCWAFSTIVAVEGINQIKTNKLVSLSEQELVDCDKE-ENQGCNGGLMESAFEFIKQKGG 208
Query: 65 INTERDYP---NVGVMDNCKV 82
I TE +YP G D KV
Sbjct: 209 ITTESNYPYKAQEGTCDESKV 229
>gi|351721126|ref|NP_001237199.1| cysteine proteinase precursor [Glycine max]
gi|31559530|dbj|BAC77523.1| cysteine proteinase [Glycine max]
gi|31559532|dbj|BAC77524.1| cysteine proteinase [Glycine max]
Length = 362
Score = 78.2 bits (191), Expect = 6e-13, Method: Composition-based stats.
Identities = 37/82 (45%), Positives = 51/82 (62%), Gaps = 1/82 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V A+EGI++I TN LV +S Q+LVDCD + E+ C GG +E+ +Q++ Q G
Sbjct: 150 GSCWAFSTVVAVEGINQIKTNKLVSLSEQELVDCDTE-ENAGCNGGLMESAFQFIKQKGG 208
Query: 65 INTERDYPNVGVMDNCKVFQFN 86
I TE YP C + N
Sbjct: 209 ITTESYYPYTAQDGTCDASKAN 230
>gi|75994608|gb|ABA33825.1| cysteine protease Mir1 [Zea mays subsp. parviglumis]
Length = 250
Score = 78.2 bits (191), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 37/78 (47%), Positives = 49/78 (62%), Gaps = 2/78 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW F+ V AIEG++ I T NLV +S Q+++DCD Q C GG +E +++VI N G
Sbjct: 126 GGCWAFSAVAAIEGVNAIATGNLVSLSEQEIIDCDAQ--DSGCDGGQMEDAFRFVIGNGG 183
Query: 65 INTERDYPNVGVMDNCKV 82
I+TE DYP VG C
Sbjct: 184 IDTEADYPFVGTDGTCDA 201
>gi|348525687|ref|XP_003450353.1| PREDICTED: cathepsin S-like [Oreochromis niloticus]
Length = 170
Score = 78.2 bits (191), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 35/75 (46%), Positives = 48/75 (64%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G+CW F+V GA+EG T LVD+S Q LVDC + + C GG+I +QYVI N+G
Sbjct: 64 GACWAFSVAGALEGQLAKTTGKLVDLSPQNLVDCSGKYGNHGCNGGYISRAFQYVIDNQG 123
Query: 65 INTERDYPNVGVMDN 79
I+++ YP G M+N
Sbjct: 124 IDSDASYPYTGRMEN 138
>gi|214015394|gb|ACJ62313.1| cysteine protease [Zea mays subsp. parviglumis]
Length = 256
Score = 78.2 bits (191), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 36/78 (46%), Positives = 49/78 (62%), Gaps = 2/78 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW F+ V AIEG++ I T NLV +S Q+++DCD Q C GG +E +++VI N G
Sbjct: 133 GGCWAFSAVAAIEGVNAIATGNLVSLSEQEIIDCDAQ--DSGCDGGQMENAFRFVIGNGG 190
Query: 65 INTERDYPNVGVMDNCKV 82
I+TE DYP +G C
Sbjct: 191 IDTEADYPFIGTDGTCDA 208
>gi|226499806|ref|NP_001151335.1| cysteine protease 1 [Zea mays]
gi|195645896|gb|ACG42416.1| cysteine protease 1 precursor [Zea mays]
Length = 258
Score = 78.2 bits (191), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 36/80 (45%), Positives = 49/80 (61%), Gaps = 1/80 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW FA V A+EGI +I T NLV +S QQ++DCD G + C GG+I+ +QY++ N G
Sbjct: 69 GCCWAFAAVAAVEGIHQITTGNLVSLSEQQVLDCDTDG-NNGCNGGYIDNAFQYIVGNGG 127
Query: 65 INTERDYPNVGVMDNCKVFQ 84
+ TE YP C+ Q
Sbjct: 128 LATEDAYPYTAAQAMCQSVQ 147
>gi|18403438|ref|NP_565780.1| cysteine proteinase-like protein [Arabidopsis thaliana]
gi|2342728|gb|AAB67626.1| cysteine proteinase [Arabidopsis thaliana]
gi|330253821|gb|AEC08915.1| cysteine proteinase-like protein [Arabidopsis thaliana]
Length = 345
Score = 78.2 bits (191), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 36/77 (46%), Positives = 49/77 (63%), Gaps = 1/77 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW F+ V A+EG++KI NLV +S QQL+DCD + + R C GG + + YV+QNRG
Sbjct: 152 GCCWAFSAVAAVEGVAKIAGGNLVSLSEQQLLDCDREYD-RGCDGGIMSDAFNYVVQNRG 210
Query: 65 INTERDYPNVGVMDNCK 81
I +E DY G C+
Sbjct: 211 IASENDYSYQGSDGGCR 227
>gi|255635584|gb|ACU18142.1| unknown [Glycine max]
Length = 345
Score = 78.2 bits (191), Expect = 6e-13, Method: Composition-based stats.
Identities = 32/78 (41%), Positives = 52/78 (66%), Gaps = 1/78 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V A+EGI++IVT NL +S Q+L+DCD + S C GG ++ + ++++N G
Sbjct: 153 GSCWAFSTVAAVEGINQIVTGNLTSLSEQELIDCD-RTYSNGCNGGLMDYAFSFIVENGG 211
Query: 65 INTERDYPNVGVMDNCKV 82
++ E DYP + C++
Sbjct: 212 LHKEEDYPYIMEEGTCEM 229
>gi|110737404|dbj|BAF00646.1| putative cysteine proteinase [Arabidopsis thaliana]
Length = 345
Score = 78.2 bits (191), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 36/77 (46%), Positives = 49/77 (63%), Gaps = 1/77 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW F+ V A+EG++KI NLV +S QQL+DCD + + R C GG + + YV+QNRG
Sbjct: 152 GCCWAFSAVAAVEGVAKIAGGNLVSLSEQQLLDCDREYD-RDCDGGIMSDAFNYVVQNRG 210
Query: 65 INTERDYPNVGVMDNCK 81
I +E DY G C+
Sbjct: 211 IASENDYSYQGSDGGCR 227
>gi|224116884|ref|XP_002317418.1| predicted protein [Populus trichocarpa]
gi|222860483|gb|EEE98030.1| predicted protein [Populus trichocarpa]
Length = 503
Score = 78.2 bits (191), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 37/76 (48%), Positives = 52/76 (68%), Gaps = 2/76 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ GAIEGI+ IVT +L+ +S Q+LVDCD + C GG+++ +++VI N G
Sbjct: 163 GSCWSFSTTGAIEGINAIVTGDLISLSEQELVDCDTT--NYGCEGGYMDYAFEWVINNGG 220
Query: 65 INTERDYPNVGVMDNC 80
I+TE +YP GV C
Sbjct: 221 IDTEANYPYTGVDGTC 236
>gi|414588010|tpg|DAA38581.1| TPA: hypothetical protein ZEAMMB73_156486 [Zea mays]
Length = 347
Score = 78.2 bits (191), Expect = 7e-13, Method: Composition-based stats.
Identities = 34/77 (44%), Positives = 47/77 (61%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW F+ V A EGI KI T L +S Q+LVDCD GE + C GG ++ ++++I+N G
Sbjct: 153 GCCWAFSAVAATEGIVKISTGKLTSLSEQELVDCDVHGEDQGCNGGEMDDAFKFIIKNGG 212
Query: 65 INTERDYPNVGVMDNCK 81
+ TE +YP CK
Sbjct: 213 LTTESNYPYTAQDGQCK 229
>gi|214015283|gb|ACJ62258.1| cysteine protease [Zea mays subsp. parviglumis]
Length = 255
Score = 77.8 bits (190), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 36/78 (46%), Positives = 49/78 (62%), Gaps = 2/78 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW F+ V AIEG++ I T NLV +S Q+++DCD Q C GG +E +++VI N G
Sbjct: 132 GGCWAFSAVAAIEGVNAIATGNLVSLSEQEIIDCDAQ--DSGCDGGQMENAFRFVIGNGG 189
Query: 65 INTERDYPNVGVMDNCKV 82
I+TE DYP +G C
Sbjct: 190 IDTEADYPFIGTDGTCDA 207
>gi|214015231|gb|ACJ62232.1| cysteine protease [Zea mays subsp. parviglumis]
Length = 255
Score = 77.8 bits (190), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 36/78 (46%), Positives = 49/78 (62%), Gaps = 2/78 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW F+ V AIEG++ I T NLV +S Q+++DCD Q C GG +E +++VI N G
Sbjct: 132 GGCWAFSAVAAIEGVNAIATGNLVSLSEQEIIDCDAQ--DSGCDGGQMENAFRFVIGNGG 189
Query: 65 INTERDYPNVGVMDNCKV 82
I+TE DYP +G C
Sbjct: 190 IDTEADYPFIGTDGTCDA 207
>gi|214015382|gb|ACJ62307.1| cysteine protease [Zea mays subsp. parviglumis]
Length = 249
Score = 77.8 bits (190), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 36/78 (46%), Positives = 49/78 (62%), Gaps = 2/78 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW F+ V AIEG++ I T NLV +S Q+++DCD Q C GG +E +++VI N G
Sbjct: 126 GGCWAFSAVAAIEGVNAIATGNLVSLSEQEIIDCDAQ--DSGCDGGQMENAFRFVIGNGG 183
Query: 65 INTERDYPNVGVMDNCKV 82
I+TE DYP +G C
Sbjct: 184 IDTEADYPFIGTDGTCDA 201
>gi|214015396|gb|ACJ62314.1| cysteine protease [Zea mays subsp. parviglumis]
Length = 246
Score = 77.8 bits (190), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 36/78 (46%), Positives = 49/78 (62%), Gaps = 2/78 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW F+ V AIEG++ I T NLV +S Q+++DCD Q C GG +E +++VI N G
Sbjct: 123 GGCWAFSAVAAIEGVNAIATGNLVSLSEQEIIDCDAQ--DSGCDGGQMENAFRFVIGNGG 180
Query: 65 INTERDYPNVGVMDNCKV 82
I+TE DYP +G C
Sbjct: 181 IDTEADYPFIGTDGTCDA 198
>gi|214015251|gb|ACJ62242.1| cysteine protease [Zea mays subsp. parviglumis]
gi|214015281|gb|ACJ62257.1| cysteine protease [Zea mays subsp. parviglumis]
gi|214015285|gb|ACJ62259.1| cysteine protease [Zea mays subsp. parviglumis]
Length = 255
Score = 77.8 bits (190), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 36/78 (46%), Positives = 49/78 (62%), Gaps = 2/78 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW F+ V AIEG++ I T NLV +S Q+++DCD Q C GG +E +++VI N G
Sbjct: 132 GGCWAFSAVAAIEGVNAIATGNLVSLSEQEIIDCDAQ--DSGCDGGQMENAFRFVIGNGG 189
Query: 65 INTERDYPNVGVMDNCKV 82
I+TE DYP +G C
Sbjct: 190 IDTEADYPFIGTDGTCDA 207
>gi|449469929|ref|XP_004152671.1| PREDICTED: cysteine proteinase RD21a-like [Cucumis sativus]
gi|449529596|ref|XP_004171784.1| PREDICTED: cysteine proteinase RD21a-like [Cucumis sativus]
Length = 431
Score = 77.8 bits (190), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 35/77 (45%), Positives = 52/77 (67%), Gaps = 1/77 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G+CW F+ GA+EGI++I+T +L+ +S Q+L+DCD S C GG ++ YQ+VI N G
Sbjct: 136 GACWSFSATGAMEGINQIMTGSLISLSEQELIDCDRSYNS-GCGGGLMDYAYQFVISNHG 194
Query: 65 INTERDYPNVGVMDNCK 81
I+TE DYP +C+
Sbjct: 195 IDTENDYPYQARDGSCR 211
>gi|4731372|gb|AAD28476.1|AF133838_1 papain-like cysteine protease [Sandersonia aurantiaca]
Length = 370
Score = 77.8 bits (190), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 35/82 (42%), Positives = 54/82 (65%), Gaps = 1/82 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ + ++EGI+KIVT +L+ +S Q+LVDCD + C GG ++ +Q++I N G
Sbjct: 63 GSCWAFSTIASVEGINKIVTGDLISLSEQELVDCDKT-YNDGCNGGLMDYAFQFIIDNGG 121
Query: 65 INTERDYPNVGVMDNCKVFQFN 86
I+TE+DYP C ++ N
Sbjct: 122 IDTEKDYPYTEQDGRCDSYRKN 143
>gi|388519351|gb|AFK47737.1| unknown [Medicago truncatula]
Length = 359
Score = 77.8 bits (190), Expect = 7e-13, Method: Composition-based stats.
Identities = 33/76 (43%), Positives = 51/76 (67%), Gaps = 1/76 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ + +E I+KIVT LV +S Q+LVDCD + + C GG ++ ++++++N G
Sbjct: 147 GSCWAFSTIATVEAINKIVTGKLVSLSEQELVDCD-RAFNEGCNGGLMDYAFEFIVENGG 205
Query: 65 INTERDYPNVGVMDNC 80
I+TE+DYP G C
Sbjct: 206 IDTEQDYPYKGFEGRC 221
>gi|255546708|ref|XP_002514413.1| cysteine protease, putative [Ricinus communis]
gi|223546510|gb|EEF48009.1| cysteine protease, putative [Ricinus communis]
Length = 324
Score = 77.8 bits (190), Expect = 7e-13, Method: Composition-based stats.
Identities = 32/68 (47%), Positives = 46/68 (67%), Gaps = 1/68 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V A+EGI++IVT NL +S Q+L+DCD S C GG ++ + Y++ N G
Sbjct: 128 GSCWAFSTVAAVEGINQIVTGNLTSLSEQELIDCDTSFNS-GCNGGLMDYAFDYIVNNGG 186
Query: 65 INTERDYP 72
++ E DYP
Sbjct: 187 LHKEEDYP 194
>gi|214015359|gb|ACJ62296.1| cysteine protease [Zea mays subsp. parviglumis]
Length = 247
Score = 77.8 bits (190), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 36/78 (46%), Positives = 49/78 (62%), Gaps = 2/78 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW F+ V AIEG++ I T NLV +S Q+++DCD Q C GG +E +++VI N G
Sbjct: 124 GGCWAFSAVAAIEGVNAIATGNLVSLSEQEIIDCDAQ--DSGCDGGQMEDAFRFVIGNGG 181
Query: 65 INTERDYPNVGVMDNCKV 82
I+TE DYP +G C
Sbjct: 182 IDTEADYPFIGTDGTCDA 199
>gi|214015311|gb|ACJ62272.1| cysteine protease [Zea mays subsp. parviglumis]
gi|214015376|gb|ACJ62304.1| cysteine protease [Zea mays subsp. parviglumis]
Length = 247
Score = 77.8 bits (190), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 36/78 (46%), Positives = 49/78 (62%), Gaps = 2/78 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW F+ V AIEG++ I T NLV +S Q+++DCD Q C GG +E +++VI N G
Sbjct: 124 GGCWAFSAVAAIEGVNAIATGNLVSLSEQEIIDCDAQ--DSGCDGGQMEDAFRFVIGNGG 181
Query: 65 INTERDYPNVGVMDNCKV 82
I+TE DYP +G C
Sbjct: 182 IDTEADYPFIGTDGTCDA 199
>gi|1223922|gb|AAA92063.1| cysteinyl endopeptidase [Vigna radiata]
Length = 362
Score = 77.8 bits (190), Expect = 7e-13, Method: Composition-based stats.
Identities = 35/82 (42%), Positives = 53/82 (64%), Gaps = 1/82 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V A+EGI++I T+ LV +S Q+LVDCD + E++ C GG +E+ ++++ Q G
Sbjct: 150 GSCWAFSTVVAVEGINQIKTDKLVSLSEQELVDCDKE-ENQGCNGGLMESAFEFIKQKGG 208
Query: 65 INTERDYPNVGVMDNCKVFQFN 86
I TE +YP C + N
Sbjct: 209 ITTESNYPYTAQEGTCDASKVN 230
>gi|357437715|ref|XP_003589133.1| Cysteine proteinase [Medicago truncatula]
gi|87240770|gb|ABD32628.1| Granulin; Peptidase C1A, papain [Medicago truncatula]
gi|355478181|gb|AES59384.1| Cysteine proteinase [Medicago truncatula]
Length = 474
Score = 77.8 bits (190), Expect = 7e-13, Method: Composition-based stats.
Identities = 34/76 (44%), Positives = 51/76 (67%), Gaps = 1/76 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ + A+EGI+KIVT +L+ +S Q+LVDCD + C GG ++ ++++I N G
Sbjct: 167 GSCWAFSAIAAVEGINKIVTGDLISLSEQELVDCDTS-YNEGCNGGLMDYAFEFIISNGG 225
Query: 65 INTERDYPNVGVMDNC 80
I++E DYP V C
Sbjct: 226 IDSEDDYPYKAVDGRC 241
>gi|214015388|gb|ACJ62310.1| cysteine protease [Zea mays subsp. parviglumis]
Length = 249
Score = 77.8 bits (190), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 36/78 (46%), Positives = 49/78 (62%), Gaps = 2/78 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW F+ V AIEG++ I T NLV +S Q+++DCD Q C GG +E +++VI N G
Sbjct: 126 GGCWAFSAVAAIEGVNAIATGNLVSLSEQEIIDCDAQ--DSGCDGGQMENAFRFVIGNGG 183
Query: 65 INTERDYPNVGVMDNCKV 82
I+TE DYP +G C
Sbjct: 184 IDTEADYPFIGTDGTCDA 201
>gi|75994610|gb|ABA33826.1| cysteine protease Mir1 [Zea mays subsp. parviglumis]
Length = 248
Score = 77.8 bits (190), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 36/78 (46%), Positives = 49/78 (62%), Gaps = 2/78 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW F+ V AIEG++ I T NLV +S Q+++DCD Q C GG +E +++VI N G
Sbjct: 124 GGCWAFSAVAAIEGVNAIATGNLVSLSEQEIIDCDAQ--DSGCDGGQMEDAFRFVIGNGG 181
Query: 65 INTERDYPNVGVMDNCKV 82
I+TE DYP +G C
Sbjct: 182 IDTEADYPFIGTDGTCDA 199
>gi|357167196|ref|XP_003581047.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like
[Brachypodium distachyon]
Length = 338
Score = 77.8 bits (190), Expect = 7e-13, Method: Composition-based stats.
Identities = 30/76 (39%), Positives = 49/76 (64%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW F+ V ++EGI K+ T L+ +S Q+LVDCD +++ C GG ++ +++++ N G
Sbjct: 142 GCCWAFSTVASMEGIVKVSTGKLISLSEQELVDCDVGMQNKGCGGGLMDNAFEFIVNNGG 201
Query: 65 INTERDYPNVGVMDNC 80
++TE DYP G C
Sbjct: 202 LDTEADYPYTGADGTC 217
>gi|5777889|emb|CAB53515.1| cysteine protease [Solanum tuberosum]
Length = 466
Score = 77.8 bits (190), Expect = 8e-13, Method: Composition-based stats.
Identities = 37/82 (45%), Positives = 53/82 (64%), Gaps = 1/82 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V A+E I+ IVT NL+ +S Q+LVDCD + + C GG ++ +++VI N G
Sbjct: 160 GSCWAFSAVAAMESINAIVTGNLISLSEQELVDCD-KSYNEGCDGGLMDYAFEFVINNGG 218
Query: 65 INTERDYPNVGVMDNCKVFQFN 86
I+TE DYP D C ++ N
Sbjct: 219 IDTEEDYPYKERNDVCDQYRKN 240
>gi|75994628|gb|ABA33835.1| cysteine protease Mir1 [Zea mays subsp. parviglumis]
Length = 248
Score = 77.8 bits (190), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 36/78 (46%), Positives = 49/78 (62%), Gaps = 2/78 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW F+ V AIEG++ I T NLV +S Q+++DCD Q C GG +E +++VI N G
Sbjct: 124 GGCWAFSAVAAIEGVNAIATGNLVSLSEQEIIDCDAQ--DSGCDGGQMENAFRFVIGNGG 181
Query: 65 INTERDYPNVGVMDNCKV 82
I+TE DYP +G C
Sbjct: 182 IDTEADYPFIGTDGTCDA 199
>gi|125533982|gb|EAY80530.1| hypothetical protein OsI_35710 [Oryza sativa Indica Group]
Length = 378
Score = 77.8 bits (190), Expect = 8e-13, Method: Composition-based stats.
Identities = 33/68 (48%), Positives = 49/68 (72%), Gaps = 1/68 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V A+EG++KI T LV +S Q+LVDCD G+++ C GG ++ +Q++ +N G
Sbjct: 165 GSCWAFSAVAAVEGVNKIKTGRLVTLSEQELVDCDT-GDNQGCDGGLMDYAFQFIKRNGG 223
Query: 65 INTERDYP 72
I TE +YP
Sbjct: 224 ITTESNYP 231
>gi|307111936|gb|EFN60170.1| hypothetical protein CHLNCDRAFT_59551 [Chlorella variabilis]
Length = 364
Score = 77.8 bits (190), Expect = 8e-13, Method: Composition-based stats.
Identities = 32/68 (47%), Positives = 47/68 (69%), Gaps = 1/68 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ GA+EG + I T LV +S Q LVDCD + ++ C GGF+++ + +++ N G
Sbjct: 158 GSCWAFSTTGAVEGANAIATGKLVSLSEQMLVDCDREYDT-GCRGGFMDSAFDFIVNNGG 216
Query: 65 INTERDYP 72
I+TE DYP
Sbjct: 217 IDTEDDYP 224
>gi|224056176|ref|XP_002298740.1| predicted protein [Populus trichocarpa]
gi|222845998|gb|EEE83545.1| predicted protein [Populus trichocarpa]
Length = 455
Score = 77.8 bits (190), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 36/76 (47%), Positives = 52/76 (68%), Gaps = 1/76 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ VGA+EGI++IVT NL +S Q+LVDCD + + C GG ++ + ++I+N G
Sbjct: 136 GSCWAFSTVGAVEGINQIVTGNLTSLSEQELVDCD-KTYNLGCNGGLMDYAFDFIIENGG 194
Query: 65 INTERDYPNVGVMDNC 80
I+TE DYP + C
Sbjct: 195 IDTEEDYPYKAIDSMC 210
>gi|115484973|ref|NP_001067630.1| Os11g0255300 [Oryza sativa Japonica Group]
gi|530335|emb|CAA56844.1| cysteine protease [Oryza sativa Japonica Group]
gi|5761322|dbj|BAA83472.1| cysteine endopeptidase [Oryza sativa Japonica Group]
gi|62732672|gb|AAX94791.1| cysteine proteinase (EC 3.4.22.-) - rice [Oryza sativa Japonica
Group]
gi|62732673|gb|AAX94792.1| cysteine proteinase (EC 3.4.22.-) - rice [Oryza sativa Japonica
Group]
gi|62732674|gb|AAX94793.1| cysteine proteinase (EC 3.4.22.-) - rice [Oryza sativa Japonica
Group]
gi|77549615|gb|ABA92412.1| Thiol protease SEN102 precursor, putative, expressed [Oryza sativa
Japonica Group]
gi|77549616|gb|ABA92413.1| Thiol protease SEN102 precursor, putative, expressed [Oryza sativa
Japonica Group]
gi|77549617|gb|ABA92414.1| Thiol protease SEN102 precursor, putative, expressed [Oryza sativa
Japonica Group]
gi|113644852|dbj|BAF27993.1| Os11g0255300 [Oryza sativa Japonica Group]
gi|125576789|gb|EAZ18011.1| hypothetical protein OsJ_33558 [Oryza sativa Japonica Group]
gi|215701098|dbj|BAG92522.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 378
Score = 77.8 bits (190), Expect = 8e-13, Method: Composition-based stats.
Identities = 33/68 (48%), Positives = 49/68 (72%), Gaps = 1/68 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V A+EG++KI T LV +S Q+LVDCD G+++ C GG ++ +Q++ +N G
Sbjct: 165 GSCWAFSTVAAVEGVNKIKTGRLVTLSEQELVDCDT-GDNQGCDGGLMDYAFQFIKRNGG 223
Query: 65 INTERDYP 72
I TE +YP
Sbjct: 224 ITTESNYP 231
>gi|75994624|gb|ABA33833.1| cysteine protease Mir1 [Zea mays subsp. parviglumis]
Length = 255
Score = 77.8 bits (190), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 37/82 (45%), Positives = 50/82 (60%), Gaps = 2/82 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW F+ V AIEG++ I T NLV +S Q+++DCD Q C GG +E + +VI N G
Sbjct: 131 GGCWAFSAVAAIEGVNAIATGNLVSLSEQEIIDCDAQ--DSGCDGGQMENAFLFVIGNGG 188
Query: 65 INTERDYPNVGVMDNCKVFQFN 86
I+TE DYP +G C + N
Sbjct: 189 IDTEADYPFIGTDGTCDASKEN 210
>gi|116786550|gb|ABK24153.1| unknown [Picea sitchensis]
Length = 394
Score = 77.8 bits (190), Expect = 8e-13, Method: Composition-based stats.
Identities = 35/84 (41%), Positives = 50/84 (59%), Gaps = 7/84 (8%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRS-------CVGGFIETIYQ 57
GSCW F+ GA+EG + + T L+ +S QQLVDCD++ +S C GG + T YQ
Sbjct: 182 GSCWTFSTTGAMEGANFMKTGKLISLSEQQLVDCDHECDSSEPDVCDSGCNGGLMTTAYQ 241
Query: 58 YVIQNRGINTERDYPNVGVMDNCK 81
Y ++ G+ E DYP G+ +CK
Sbjct: 242 YALKAGGLQREEDYPYTGIDGSCK 265
>gi|422001787|dbj|BAM66994.1| germination-specific cysteine protease 1, partial [Raphanus
sativus]
Length = 235
Score = 77.8 bits (190), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 35/82 (42%), Positives = 53/82 (64%), Gaps = 1/82 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ +EGI+KIVT L+ +S Q+LVDCD + ++ C GG ++ +Q++++N G
Sbjct: 26 GSCWAFSTAAVVEGINKIVTGELISLSEQELVDCD-KSYNQGCNGGLMDYAFQFIMKNGG 84
Query: 65 INTERDYPNVGVMDNCKVFQFN 86
+NTE+DYP G C N
Sbjct: 85 LNTEQDYPYRGSDGKCNSLLKN 106
>gi|30690594|ref|NP_564321.2| cysteine proteinase-like protein [Arabidopsis thaliana]
gi|28393492|gb|AAO42167.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|332192920|gb|AEE31041.1| cysteine proteinase-like protein [Arabidopsis thaliana]
Length = 355
Score = 77.8 bits (190), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 35/77 (45%), Positives = 50/77 (64%), Gaps = 1/77 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW F+ V A+EG++KIV NNLV +S QQL+DCD + ++ C GG + + Y+I+NRG
Sbjct: 161 GCCWAFSSVAAVEGLTKIVGNNLVSLSEQQLLDCDRERDN-GCNGGIMSDAFSYIIKNRG 219
Query: 65 INTERDYPNVGVMDNCK 81
I +E YP C+
Sbjct: 220 IASEASYPYQAAEGTCR 236
>gi|297598407|ref|NP_001045533.2| Os01g0971400 [Oryza sativa Japonica Group]
gi|15289977|dbj|BAB63672.1| putative cysteine protease CP1 [Oryza sativa Japonica Group]
gi|125529282|gb|EAY77396.1| hypothetical protein OsI_05384 [Oryza sativa Indica Group]
gi|125573472|gb|EAZ14987.1| hypothetical protein OsJ_04922 [Oryza sativa Japonica Group]
gi|215740756|dbj|BAG97412.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215741010|dbj|BAG97505.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215765325|dbj|BAG87022.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215767338|dbj|BAG99566.1| unnamed protein product [Oryza sativa Japonica Group]
gi|255674119|dbj|BAF07447.2| Os01g0971400 [Oryza sativa Japonica Group]
Length = 365
Score = 77.8 bits (190), Expect = 8e-13, Method: Composition-based stats.
Identities = 33/77 (42%), Positives = 48/77 (62%), Gaps = 1/77 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V A+EGI+ IVT NL +S Q+L+DCD G + C GG ++ + Y+ N G
Sbjct: 161 GSCWAFSTVAAVEGINAIVTGNLTRLSEQELIDCDTDGNN-GCSGGLMDYAFSYIAANGG 219
Query: 65 INTERDYPNVGVMDNCK 81
++TE YP + C+
Sbjct: 220 LHTEESYPYLMEEGTCR 236
>gi|359483514|ref|XP_003632971.1| PREDICTED: LOW QUALITY PROTEIN: oryzain beta chain-like [Vitis
vinifera]
Length = 340
Score = 77.8 bits (190), Expect = 9e-13, Method: Composition-based stats.
Identities = 33/76 (43%), Positives = 51/76 (67%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW F+ V A+EGI+K+ T+ + +S Q+LVDCD G + C GG ++ ++++IQNRG
Sbjct: 144 GGCWAFSAVAAMEGIAKLQTSKSISLSEQELVDCDIFGSNIGCEGGCMDDAFKFIIQNRG 203
Query: 65 INTERDYPNVGVMDNC 80
+N+E Y GV +C
Sbjct: 204 LNSEARYLYKGVEGHC 219
>gi|312282059|dbj|BAJ33895.1| unnamed protein product [Thellungiella halophila]
Length = 379
Score = 77.8 bits (190), Expect = 9e-13, Method: Composition-based stats.
Identities = 34/75 (45%), Positives = 50/75 (66%), Gaps = 2/75 (2%)
Query: 6 SCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRGI 65
SCW F+ VGA+EG++KIVT LV +S Q L++C+ E+ C GG +ET Y++++ N G+
Sbjct: 175 SCWAFSTVGAVEGLNKIVTGELVTLSEQDLINCNK--ENNGCGGGKVETAYEFIVSNGGL 232
Query: 66 NTERDYPNVGVMDNC 80
T+ DYP V C
Sbjct: 233 GTDNDYPYKAVNGAC 247
>gi|255555337|ref|XP_002518705.1| cysteine protease, putative [Ricinus communis]
gi|223542086|gb|EEF43630.1| cysteine protease, putative [Ricinus communis]
Length = 471
Score = 77.8 bits (190), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 34/68 (50%), Positives = 51/68 (75%), Gaps = 1/68 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ VGA+EGI++IVT L+ +S Q+LVDCD + ++ C GG ++ ++++I N G
Sbjct: 160 GSCWAFSTVGAVEGINQIVTGELISLSEQELVDCD-KSYNQGCNGGLMDYAFEFIINNGG 218
Query: 65 INTERDYP 72
I+TE DYP
Sbjct: 219 IDTEEDYP 226
>gi|358343350|ref|XP_003635767.1| Cysteine proteinase [Medicago truncatula]
gi|355501702|gb|AES82905.1| Cysteine proteinase [Medicago truncatula]
Length = 338
Score = 77.8 bits (190), Expect = 9e-13, Method: Composition-based stats.
Identities = 35/76 (46%), Positives = 49/76 (64%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V A+EGI+KI T NLV +S QQL+DCD + + C GG + + Y+ ++ G
Sbjct: 142 GSCWAFSAVAAVEGINKIKTENLVSLSEQQLIDCDIKSGNEGCEGGDMYIAFNYIKKHGG 201
Query: 65 INTERDYPNVGVMDNC 80
I T ++YP G NC
Sbjct: 202 IATAKEYPYKGRDGNC 217
>gi|75994630|gb|ABA33836.1| cysteine protease Mir1 [Zea mays subsp. parviglumis]
Length = 248
Score = 77.4 bits (189), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 38/78 (48%), Positives = 48/78 (61%), Gaps = 2/78 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW F+ V AIEGI+ I T NLV +S Q+++DCD Q C GG +E + +VI N G
Sbjct: 124 GGCWAFSAVAAIEGINAIATGNLVSLSEQEIIDCDAQ--DSGCDGGQMENAFLFVIGNGG 181
Query: 65 INTERDYPNVGVMDNCKV 82
I+TE DYP VG C
Sbjct: 182 IDTEADYPFVGTDGTCDA 199
>gi|224085750|ref|XP_002307688.1| predicted protein [Populus trichocarpa]
gi|222857137|gb|EEE94684.1| predicted protein [Populus trichocarpa]
Length = 436
Score = 77.4 bits (189), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 35/68 (51%), Positives = 50/68 (73%), Gaps = 1/68 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G+CW F+ GAIEGI+KIVT +LV +S Q+L++CD + + C GG ++ +Q+VI N G
Sbjct: 136 GACWSFSATGAIEGINKIVTGSLVSLSEQELIECD-KSYNDGCGGGLMDYAFQFVINNHG 194
Query: 65 INTERDYP 72
I+TE DYP
Sbjct: 195 IDTEEDYP 202
>gi|226505708|ref|NP_001141813.1| uncharacterized protein LOC100273952 precursor [Zea mays]
gi|194706024|gb|ACF87096.1| unknown [Zea mays]
gi|413945958|gb|AFW78607.1| hypothetical protein ZEAMMB73_489507 [Zea mays]
Length = 460
Score = 77.4 bits (189), Expect = 9e-13, Method: Composition-based stats.
Identities = 35/68 (51%), Positives = 49/68 (72%), Gaps = 1/68 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G+CW F+ GA+EGI+KI T +LV +S Q+L+DCD S C GG ++ Y++VI+N G
Sbjct: 159 GACWSFSATGAMEGINKIKTGSLVSLSEQELIDCDRSYNS-GCGGGLMDYAYKFVIKNGG 217
Query: 65 INTERDYP 72
I+TE DYP
Sbjct: 218 IDTEEDYP 225
>gi|58531896|gb|AAW78660.1| cysteine protease [Nicotiana tabacum]
Length = 361
Score = 77.4 bits (189), Expect = 9e-13, Method: Composition-based stats.
Identities = 34/82 (41%), Positives = 54/82 (65%), Gaps = 1/82 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V A+EGI++I TN LV +S Q+LVDCD +++ C GG ++ ++++ + G
Sbjct: 148 GSCWAFSTVVAVEGINQIKTNELVSLSEQELVDCDTS-QNQGCNGGLMDMAFEFIKKKGG 206
Query: 65 INTERDYPNVGVMDNCKVFQFN 86
INTE +YP + C + + N
Sbjct: 207 INTEENYPYMAEGGECDIQKRN 228
>gi|2160175|gb|AAB60738.1| Strong similarity to Dianthus cysteine proteinase (gb|U17135)
[Arabidopsis thaliana]
Length = 416
Score = 77.4 bits (189), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 32/68 (47%), Positives = 52/68 (76%), Gaps = 1/68 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G+CW F+ GA+EGI++IVT +L+ +S Q+L+DCD + + C GG ++ +++VI+N G
Sbjct: 138 GACWSFSATGAMEGINQIVTGDLISLSEQELIDCD-KSYNAGCNGGLMDYAFEFVIKNHG 196
Query: 65 INTERDYP 72
I+TE+DYP
Sbjct: 197 IDTEKDYP 204
>gi|18391078|ref|NP_563855.1| xylem bark cysteine peptidase 3 [Arabidopsis thaliana]
gi|110741821|dbj|BAE98853.1| papain-like cysteine peptidase XBCP3 [Arabidopsis thaliana]
gi|111074448|gb|ABH04597.1| At1g09850 [Arabidopsis thaliana]
gi|332190386|gb|AEE28507.1| xylem bark cysteine peptidase 3 [Arabidopsis thaliana]
Length = 437
Score = 77.4 bits (189), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 32/68 (47%), Positives = 52/68 (76%), Gaps = 1/68 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G+CW F+ GA+EGI++IVT +L+ +S Q+L+DCD + + C GG ++ +++VI+N G
Sbjct: 140 GACWSFSATGAMEGINQIVTGDLISLSEQELIDCD-KSYNAGCNGGLMDYAFEFVIKNHG 198
Query: 65 INTERDYP 72
I+TE+DYP
Sbjct: 199 IDTEKDYP 206
>gi|9502421|gb|AAF88120.1|AC021043_13 Putative cysteine proteinase [Arabidopsis thaliana]
Length = 331
Score = 77.4 bits (189), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 35/77 (45%), Positives = 50/77 (64%), Gaps = 1/77 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW F+ V A+EG++KIV NNLV +S QQL+DCD + ++ C GG + + Y+I+NRG
Sbjct: 137 GCCWAFSSVAAVEGLTKIVGNNLVSLSEQQLLDCDRERDN-GCNGGIMSDAFSYIIKNRG 195
Query: 65 INTERDYPNVGVMDNCK 81
I +E YP C+
Sbjct: 196 IASEASYPYQAAEGTCR 212
>gi|42572491|ref|NP_974341.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|332642714|gb|AEE76235.1| putative cysteine proteinase [Arabidopsis thaliana]
Length = 290
Score = 77.4 bits (189), Expect = 9e-13, Method: Composition-based stats.
Identities = 31/68 (45%), Positives = 48/68 (70%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ VGA+EGI++I T L+ +S Q+LVDCD + C GG + ++++++N G
Sbjct: 152 GSCWAFSAVGAVEGINQITTGELISLSEQELVDCDRGFVNAGCDGGIMNYAFEFIMKNGG 211
Query: 65 INTERDYP 72
I T++DYP
Sbjct: 212 IETDQDYP 219
>gi|167521499|ref|XP_001745088.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163776702|gb|EDQ90321.1| predicted protein [Monosiga brevicollis MX1]
Length = 294
Score = 77.4 bits (189), Expect = 9e-13, Method: Composition-based stats.
Identities = 32/68 (47%), Positives = 45/68 (66%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ G+ EG I T NLV +S QQLVDC ++ C GG ++ ++Y+I N+G
Sbjct: 104 GSCWSFSTTGSTEGAHAIATGNLVSLSEQQLVDCSGSFGNQGCNGGLMDDAFKYIISNKG 163
Query: 65 INTERDYP 72
++TE DYP
Sbjct: 164 LDTEEDYP 171
>gi|146215976|gb|ABQ10190.1| actinidin Act1b [Actinidia arguta]
Length = 380
Score = 77.4 bits (189), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 35/78 (44%), Positives = 49/78 (62%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ + +EGI+KIVT +L+ +S Q+LVDC +R C GG I +Q++I N G
Sbjct: 149 GSCWAFSAIATVEGINKIVTGDLISLSEQELVDCGRTQNTRGCDGGSITDGFQFIINNGG 208
Query: 65 INTERDYPNVGVMDNCKV 82
INTE +YP C +
Sbjct: 209 INTEANYPYTAEDGQCNL 226
>gi|33242886|gb|AAQ01147.1| cathepsin [Haplochromis chilotes]
Length = 334
Score = 77.4 bits (189), Expect = 9e-13, Method: Composition-based stats.
Identities = 35/76 (46%), Positives = 46/76 (60%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ GA+EG T LVD+S Q LVDC + + C GGF+ +QYVI N G
Sbjct: 141 GSCWAFSAAGALEGQLAKSTGKLVDLSPQNLVDCSGKYGNHGCNGGFMTRAFQYVIDNHG 200
Query: 65 INTERDYPNVGVMDNC 80
I+++ YP +G D C
Sbjct: 201 IDSDASYPYIGRDDQC 216
>gi|242088413|ref|XP_002440039.1| hypothetical protein SORBIDRAFT_09g024940 [Sorghum bicolor]
gi|241945324|gb|EES18469.1| hypothetical protein SORBIDRAFT_09g024940 [Sorghum bicolor]
Length = 463
Score = 77.4 bits (189), Expect = 9e-13, Method: Composition-based stats.
Identities = 34/68 (50%), Positives = 49/68 (72%), Gaps = 1/68 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G+CW F+ GA+EGI+KI T +LV +S Q+L+DCD S C GG ++ Y++V++N G
Sbjct: 161 GACWSFSATGAMEGINKIKTGSLVSLSEQELIDCDRSYNS-GCGGGLMDYAYKFVVKNGG 219
Query: 65 INTERDYP 72
I+TE DYP
Sbjct: 220 IDTEEDYP 227
>gi|224116880|ref|XP_002317417.1| predicted protein [Populus trichocarpa]
gi|118488173|gb|ABK95906.1| unknown [Populus trichocarpa]
gi|222860482|gb|EEE98029.1| predicted protein [Populus trichocarpa]
Length = 498
Score = 77.4 bits (189), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 38/80 (47%), Positives = 52/80 (65%), Gaps = 1/80 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ GAIE I+ IVT +L+ +S Q+LVDCD + C GG +++ +Q+VI N G
Sbjct: 159 GSCWSFSTTGAIEAINAIVTGDLISLSEQELVDCDTTN-NYGCEGGDMDSAFQWVIGNGG 217
Query: 65 INTERDYPNVGVMDNCKVFQ 84
I+TE DYP GV C +
Sbjct: 218 IDTEADYPYTGVDGTCNTAK 237
>gi|297843784|ref|XP_002889773.1| hypothetical protein ARALYDRAFT_471096 [Arabidopsis lyrata subsp.
lyrata]
gi|297335615|gb|EFH66032.1| hypothetical protein ARALYDRAFT_471096 [Arabidopsis lyrata subsp.
lyrata]
Length = 439
Score = 77.4 bits (189), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 32/68 (47%), Positives = 52/68 (76%), Gaps = 1/68 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G+CW F+ GA+EGI++IVT +L+ +S Q+L+DCD + + C GG ++ +++VI+N G
Sbjct: 140 GACWSFSATGAMEGINQIVTGDLISLSEQELIDCD-KSYNAGCNGGLMDYAFEFVIKNHG 198
Query: 65 INTERDYP 72
I+TE+DYP
Sbjct: 199 IDTEKDYP 206
>gi|14600257|gb|AAK71314.1|AF388175_1 papain-like cysteine peptidase XBCP3 [Arabidopsis thaliana]
Length = 437
Score = 77.4 bits (189), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 32/68 (47%), Positives = 52/68 (76%), Gaps = 1/68 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G+CW F+ GA+EGI++IVT +L+ +S Q+L+DCD + + C GG ++ +++VI+N G
Sbjct: 140 GACWSFSATGAMEGINQIVTGDLISLSEQELIDCD-KSYNAGCNGGLMDYAFEFVIKNHG 198
Query: 65 INTERDYP 72
I+TE+DYP
Sbjct: 199 IDTEKDYP 206
>gi|363807062|ref|NP_001242584.1| uncharacterized protein LOC100804015 precursor [Glycine max]
gi|255640677|gb|ACU20623.1| unknown [Glycine max]
Length = 366
Score = 77.4 bits (189), Expect = 9e-13, Method: Composition-based stats.
Identities = 34/76 (44%), Positives = 50/76 (65%), Gaps = 1/76 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V +E I+KIVT V +S Q+LVDCD + + C GG ++ ++++IQN G
Sbjct: 150 GSCWAFSTVATVEAINKIVTGKFVSLSEQELVDCD-RAYNEGCNGGLMDYAFEFIIQNGG 208
Query: 65 INTERDYPNVGVMDNC 80
I+T++DYP G C
Sbjct: 209 IDTDKDYPYRGFDGIC 224
>gi|168057475|ref|XP_001780740.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162667829|gb|EDQ54449.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 463
Score = 77.4 bits (189), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 32/68 (47%), Positives = 52/68 (76%), Gaps = 1/68 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ VG++EG++ I T LV +S Q+LVDCD + +++ C GG ++ ++++I+N G
Sbjct: 153 GSCWAFSAVGSVEGVNAIKTGELVSLSEQELVDCDRK-QNQGCNGGLMDYAFEFIIKNGG 211
Query: 65 INTERDYP 72
I+TE+DYP
Sbjct: 212 IDTEKDYP 219
>gi|357437719|ref|XP_003589135.1| Cysteine proteinase [Medicago truncatula]
gi|355478183|gb|AES59386.1| Cysteine proteinase [Medicago truncatula]
Length = 457
Score = 77.4 bits (189), Expect = 1e-12, Method: Composition-based stats.
Identities = 34/76 (44%), Positives = 51/76 (67%), Gaps = 1/76 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ + A+EGI+KIVT +L+ +S Q+LVDCD + C GG ++ ++++I N G
Sbjct: 167 GSCWAFSAIAAVEGINKIVTGDLISLSEQELVDCDTS-YNEGCNGGLMDYAFEFIISNGG 225
Query: 65 INTERDYPNVGVMDNC 80
I++E DYP V C
Sbjct: 226 IDSEDDYPYKAVDGRC 241
>gi|214015364|gb|ACJ62298.1| cysteine protease [Zea mays subsp. parviglumis]
Length = 252
Score = 77.4 bits (189), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 36/82 (43%), Positives = 50/82 (60%), Gaps = 2/82 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW F+ V IEG++ I T NLV +S Q+++DCD Q C GG +E +++VI N G
Sbjct: 129 GGCWAFSAVAGIEGVNAIATGNLVSLSEQEIIDCDAQ--DSGCDGGQMENAFRFVIGNGG 186
Query: 65 INTERDYPNVGVMDNCKVFQFN 86
I+TE DYP +G C + N
Sbjct: 187 IDTEADYPFIGTDGTCDASKEN 208
>gi|326430491|gb|EGD76061.1| cathepsin [Salpingoeca sp. ATCC 50818]
Length = 381
Score = 77.4 bits (189), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 36/80 (45%), Positives = 50/80 (62%), Gaps = 3/80 (3%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ G++EG I T NLV +S QQLVDC ++ C GG ++ ++Y+I N G
Sbjct: 128 GSCWSFSTTGSVEGAHAIATGNLVSLSEQQLVDCSGSFGNQGCNGGLMDNAFKYIISNGG 187
Query: 65 INTERDYPNV---GVMDNCK 81
++TE+DYP GV D K
Sbjct: 188 LDTEQDYPYTARDGVCDKSK 207
>gi|172052260|gb|ACB70409.1| cysteine protease [Nicotiana tabacum]
Length = 361
Score = 77.4 bits (189), Expect = 1e-12, Method: Composition-based stats.
Identities = 34/82 (41%), Positives = 54/82 (65%), Gaps = 1/82 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V A+EGI++I TN LV +S Q+LVDCD +++ C GG ++ ++++ + G
Sbjct: 148 GSCWAFSTVVAVEGINQIKTNELVSLSEQELVDCDTS-QNQGCNGGLMDMAFEFIKKKGG 206
Query: 65 INTERDYPNVGVMDNCKVFQFN 86
INTE +YP + C + + N
Sbjct: 207 INTEENYPYMAEGGECDIQKRN 228
>gi|297740510|emb|CBI30692.3| unnamed protein product [Vitis vinifera]
Length = 377
Score = 77.4 bits (189), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 35/76 (46%), Positives = 51/76 (67%), Gaps = 2/76 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ GA+EGI+ IVT +L+ +S Q+LVDCD + C GG+++ +++VI N G
Sbjct: 34 GSCWAFSSTGAMEGINAIVTGDLISLSEQELVDCDTT--NYGCEGGYMDYAFEWVISNGG 91
Query: 65 INTERDYPNVGVMDNC 80
I++E DYP G C
Sbjct: 92 IDSESDYPYTGTDGTC 107
>gi|326430490|gb|EGD76060.1| cysteine proteinase [Salpingoeca sp. ATCC 50818]
Length = 448
Score = 77.4 bits (189), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 36/80 (45%), Positives = 50/80 (62%), Gaps = 3/80 (3%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ G++EG I T NLV +S QQLVDC ++ C GG ++ ++Y+I N G
Sbjct: 138 GSCWSFSTTGSVEGAHAIATGNLVSLSEQQLVDCSGSFGNQGCNGGLMDNAFKYIISNGG 197
Query: 65 INTERDYPNV---GVMDNCK 81
++TE+DYP GV D K
Sbjct: 198 LDTEQDYPYTARDGVCDKSK 217
>gi|297830592|ref|XP_002883178.1| hypothetical protein ARALYDRAFT_479457 [Arabidopsis lyrata subsp.
lyrata]
gi|297329018|gb|EFH59437.1| hypothetical protein ARALYDRAFT_479457 [Arabidopsis lyrata subsp.
lyrata]
Length = 452
Score = 77.4 bits (189), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 34/75 (45%), Positives = 51/75 (68%), Gaps = 1/75 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ +GA+EGI++I T L+ +S Q+LVDCD + C GG ++ ++++I+N G
Sbjct: 151 GSCWAFSAIGAVEGINQIKTGELISLSEQELVDCDTS-YNGGCGGGLMDYAFKFIIENGG 209
Query: 65 INTERDYPNVGVMDN 79
I+TE DYP DN
Sbjct: 210 IDTEEDYPYTATDDN 224
>gi|374713651|gb|AEZ65083.1| cysteine protease [Carica papaya]
Length = 467
Score = 77.0 bits (188), Expect = 1e-12, Method: Composition-based stats.
Identities = 33/82 (40%), Positives = 53/82 (64%), Gaps = 1/82 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ + A+EGI++IVT +L+ +S Q+LVDCD + C GG ++ ++++I N G
Sbjct: 160 GSCWAFSTIAAVEGINQIVTGDLISLSEQELVDCDTS-YNEGCNGGLMDYAFEFIINNGG 218
Query: 65 INTERDYPNVGVMDNCKVFQFN 86
I++E DYP C ++ N
Sbjct: 219 IDSEEDYPYRAADQKCDQYRKN 240
>gi|357115272|ref|XP_003559414.1| PREDICTED: thiol protease SEN102-like [Brachypodium distachyon]
Length = 360
Score = 77.0 bits (188), Expect = 1e-12, Method: Composition-based stats.
Identities = 32/71 (45%), Positives = 46/71 (64%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW F+ V A+EG++KI T LV +S QQLVDCD G+ + C GG ++ +QY+ + G
Sbjct: 161 GCCWAFSAVAAMEGLTKIRTGRLVSLSEQQLVDCDVYGDDQGCEGGLMDNAFQYISRQGG 220
Query: 65 INTERDYPNVG 75
+ +E YP G
Sbjct: 221 LASESAYPYSG 231
>gi|226533314|ref|NP_001150119.1| xylem cysteine proteinase 2 [Zea mays]
gi|195636886|gb|ACG37911.1| xylem cysteine proteinase 2 precursor [Zea mays]
gi|223946183|gb|ACN27175.1| unknown [Zea mays]
gi|413951209|gb|AFW83858.1| Xylem cysteine proteinase 2 [Zea mays]
Length = 385
Score = 77.0 bits (188), Expect = 1e-12, Method: Composition-based stats.
Identities = 33/77 (42%), Positives = 49/77 (63%), Gaps = 1/77 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V A+EGI++IVT NL +S Q+L+DCD G + C GG ++ + Y+ N G
Sbjct: 175 GSCWAFSTVAAVEGINQIVTGNLTALSEQELIDCDTDGNN-GCNGGLMDYAFSYIAHNGG 233
Query: 65 INTERDYPNVGVMDNCK 81
++TE YP + C+
Sbjct: 234 LHTEEAYPYLMEEGTCQ 250
>gi|225458701|ref|XP_002284973.1| PREDICTED: cysteine proteinase RD21a-like [Vitis vinifera]
Length = 467
Score = 77.0 bits (188), Expect = 1e-12, Method: Composition-based stats.
Identities = 34/82 (41%), Positives = 52/82 (63%), Gaps = 1/82 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ + A+EGI+KIVT L+ +S Q+LVDCD + C GG ++ ++++I N G
Sbjct: 160 GSCWAFSTIAAVEGINKIVTGGLISLSEQELVDCDTS-YNEGCNGGLMDYAFEFIINNGG 218
Query: 65 INTERDYPNVGVMDNCKVFQFN 86
I++E DYP C ++ N
Sbjct: 219 IDSEEDYPYKASDGRCDQYRKN 240
>gi|296081395|emb|CBI16828.3| unnamed protein product [Vitis vinifera]
Length = 359
Score = 77.0 bits (188), Expect = 1e-12, Method: Composition-based stats.
Identities = 32/82 (39%), Positives = 52/82 (63%), Gaps = 1/82 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V +EGI++I T L+ +S QQL+DCD + + C GG +E+ ++++ +N G
Sbjct: 150 GSCWAFSTVVGVEGINQIKTKELLSLSEQQLIDCD-RSDDHGCNGGLMESAFEFIKKNGG 208
Query: 65 INTERDYPNVGVMDNCKVFQFN 86
I TE +YP + C + + N
Sbjct: 209 ITTENNYPYKAKDERCDMLKMN 230
>gi|359473128|ref|XP_002285397.2| PREDICTED: vignain-like [Vitis vinifera]
Length = 357
Score = 77.0 bits (188), Expect = 1e-12, Method: Composition-based stats.
Identities = 32/82 (39%), Positives = 52/82 (63%), Gaps = 1/82 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V +EGI++I T L+ +S QQL+DCD + + C GG +E+ ++++ +N G
Sbjct: 148 GSCWAFSTVVGVEGINQIKTKELLSLSEQQLIDCD-RSDDHGCNGGLMESAFEFIKKNGG 206
Query: 65 INTERDYPNVGVMDNCKVFQFN 86
I TE +YP + C + + N
Sbjct: 207 ITTENNYPYKAKDERCDMLKMN 228
>gi|297809383|ref|XP_002872575.1| hypothetical protein ARALYDRAFT_911472 [Arabidopsis lyrata subsp.
lyrata]
gi|297318412|gb|EFH48834.1| hypothetical protein ARALYDRAFT_911472 [Arabidopsis lyrata subsp.
lyrata]
Length = 371
Score = 77.0 bits (188), Expect = 1e-12, Method: Composition-based stats.
Identities = 34/75 (45%), Positives = 51/75 (68%), Gaps = 2/75 (2%)
Query: 6 SCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRGI 65
SCW F+ VGA+EG++KIVT LV +S Q L++C+ E+ C GG +ET Y+++++N G+
Sbjct: 167 SCWAFSTVGAVEGLNKIVTGELVTLSEQDLINCNK--ENNGCGGGKVETAYEFIMKNGGL 224
Query: 66 NTERDYPNVGVMDNC 80
T+ DYP V C
Sbjct: 225 GTDNDYPYKAVNGVC 239
>gi|327289213|ref|XP_003229319.1| PREDICTED: cathepsin S-like [Anolis carolinensis]
Length = 333
Score = 77.0 bits (188), Expect = 1e-12, Method: Composition-based stats.
Identities = 36/77 (46%), Positives = 47/77 (61%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ VGA+E K+ T NLV +S Q LVDC + + C GG+I +QYVI N G
Sbjct: 139 GSCWAFSAVGALECQLKLKTGNLVSLSPQNLVDCSSAFGNHGCNGGYISAAFQYVIYNNG 198
Query: 65 INTERDYPNVGVMDNCK 81
I++E YP G C+
Sbjct: 199 IDSEASYPYTGQSGTCR 215
>gi|147790682|emb|CAN61026.1| hypothetical protein VITISV_001146 [Vitis vinifera]
Length = 469
Score = 77.0 bits (188), Expect = 1e-12, Method: Composition-based stats.
Identities = 34/82 (41%), Positives = 52/82 (63%), Gaps = 1/82 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ + A+EGI+KIVT L+ +S Q+LVDCD + C GG ++ ++++I N G
Sbjct: 162 GSCWAFSTIAAVEGINKIVTGGLISLSEQELVDCDTS-YNEGCNGGLMDYAFEFIINNGG 220
Query: 65 INTERDYPNVGVMDNCKVFQFN 86
I++E DYP C ++ N
Sbjct: 221 IDSEEDYPYKASDGRCDQYRKN 242
>gi|146215992|gb|ABQ10198.1| actinidin Act4b [Actinidia eriantha]
Length = 379
Score = 77.0 bits (188), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 34/68 (50%), Positives = 46/68 (67%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW FA + A+E I++IVT NL+ +S QQ+VDC + + C GG YQ++I N G
Sbjct: 155 GSCWTFAPIAAVEAINQIVTGNLISLSEQQIVDCQRKSPNNGCKGGSRAGAYQFIIDNGG 214
Query: 65 INTERDYP 72
INTE +YP
Sbjct: 215 INTEANYP 222
>gi|242055753|ref|XP_002457022.1| hypothetical protein SORBIDRAFT_03g047290 [Sorghum bicolor]
gi|241928997|gb|EES02142.1| hypothetical protein SORBIDRAFT_03g047290 [Sorghum bicolor]
Length = 378
Score = 77.0 bits (188), Expect = 1e-12, Method: Composition-based stats.
Identities = 33/68 (48%), Positives = 46/68 (67%), Gaps = 1/68 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V A+EGI++IVT NL +S Q+LVDCD G + C GG ++ + Y+ N G
Sbjct: 178 GSCWAFSTVAAVEGINQIVTGNLTALSEQELVDCDTDGNN-GCNGGLMDYAFSYIAHNGG 236
Query: 65 INTERDYP 72
++TE YP
Sbjct: 237 LHTEEAYP 244
>gi|375340657|emb|CBJ56264.1| cathepsin S protein [Dicentrarchus labrax]
Length = 337
Score = 77.0 bits (188), Expect = 1e-12, Method: Composition-based stats.
Identities = 34/77 (44%), Positives = 46/77 (59%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ GA+EG T LVD+S Q LVDC + + C GGF+ +QYVI N+G
Sbjct: 144 GSCWAFSAAGALEGQLAKTTGKLVDLSPQNLVDCSTKYGNHGCNGGFMHQAFQYVIDNQG 203
Query: 65 INTERDYPNVGVMDNCK 81
I+++ YP G C+
Sbjct: 204 IDSDASYPYTGRNGECR 220
>gi|37780041|gb|AAP32193.1| cysteine protease 14 [Trifolium repens]
Length = 351
Score = 77.0 bits (188), Expect = 1e-12, Method: Composition-based stats.
Identities = 32/78 (41%), Positives = 51/78 (65%), Gaps = 1/78 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V A+EGI++IVT NL +S Q+L+DCD + C GG ++ + +++QN G
Sbjct: 155 GSCWAFSTVAAVEGINQIVTGNLTSLSEQELIDCDTT-YNNGCNGGLMDYAFSFIVQNGG 213
Query: 65 INTERDYPNVGVMDNCKV 82
++ E DYP + C++
Sbjct: 214 LHKEDDYPYIMEESTCEM 231
>gi|124484387|dbj|BAF46304.1| cysteine proteinase precursor [Ipomoea nil]
Length = 474
Score = 77.0 bits (188), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 36/85 (42%), Positives = 55/85 (64%), Gaps = 7/85 (8%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCD---NQGESRSCVGGFIETIYQYVIQ 61
GSCW F+ + A+EG++++ T NL+ +S Q+LVDCD NQG C GG + +Q++I+
Sbjct: 160 GSCWAFSTIAAVEGVNQLATGNLISLSEQELVDCDRKINQG----CNGGDMGYAFQFIIK 215
Query: 62 NRGINTERDYPNVGVMDNCKVFQFN 86
N GI++E DYP G C ++ N
Sbjct: 216 NGGIDSEEDYPYTGKDGKCDSYRQN 240
>gi|124484383|dbj|BAF46302.1| cysteine proteinase precursor [Ipomoea nil]
Length = 369
Score = 77.0 bits (188), Expect = 1e-12, Method: Composition-based stats.
Identities = 36/84 (42%), Positives = 49/84 (58%), Gaps = 7/84 (8%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRS-------CVGGFIETIYQ 57
GSCW F+ GA+EG + + T LV +S QQLVDCD+ + C GG + T Y+
Sbjct: 159 GSCWSFSTTGALEGANFLATGELVSLSEQQLVDCDHLCDPEEAGACDSGCNGGLMTTAYE 218
Query: 58 YVIQNRGINTERDYPNVGVMDNCK 81
YV+Q+ G+ E+DYP G CK
Sbjct: 219 YVLQSGGLEKEKDYPYTGKDGTCK 242
>gi|414584879|tpg|DAA35450.1| TPA: cysteine protease 1 [Zea mays]
Length = 522
Score = 77.0 bits (188), Expect = 1e-12, Method: Composition-based stats.
Identities = 32/82 (39%), Positives = 52/82 (63%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V ++E +++IVT +V +S Q+LV+C G + C GG ++ + ++I+N G
Sbjct: 221 GSCWAFSAVSSVESVNQIVTGEMVTLSEQELVECSTDGGNSGCNGGLMDAAFDFIIKNGG 280
Query: 65 INTERDYPNVGVMDNCKVFQFN 86
I+TE DYP V C + + N
Sbjct: 281 IDTEGDYPYKAVDGKCDINREN 302
>gi|146215984|gb|ABQ10194.1| actinidin Act2c [Actinidia arguta]
Length = 378
Score = 77.0 bits (188), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 34/78 (43%), Positives = 49/78 (62%)
Query: 6 SCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRGI 65
SCW F+ V A+EGI+KI+T NL+ +S Q+LVDC +R C G++ +Q++I N GI
Sbjct: 149 SCWAFSAVAAVEGINKIMTGNLLSLSEQELVDCGRTQSTRGCNRGYMTDAFQFIINNGGI 208
Query: 66 NTERDYPNVGVMDNCKVF 83
NTE +YP C +
Sbjct: 209 NTEDNYPYTAQDGQCNRY 226
>gi|190358935|sp|P00785.4|ACTN_ACTCH RecName: Full=Actinidain; Short=Actinidin; AltName: Allergen=Act c
1; Flags: Precursor
gi|12744965|gb|AAK06862.1|AF343446_1 actinidin protease [Actinidia chinensis]
Length = 380
Score = 76.6 bits (187), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 34/78 (43%), Positives = 48/78 (61%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW F+ + +EGI+KIVT L+ +S Q+L+DC +R C GG+I +Q++I N G
Sbjct: 149 GGCWAFSAIATVEGINKIVTGVLISLSEQELIDCGRTQNTRGCNGGYITDGFQFIINNGG 208
Query: 65 INTERDYPNVGVMDNCKV 82
INTE +YP C V
Sbjct: 209 INTEENYPYTAQDGECNV 226
>gi|224083362|ref|XP_002306996.1| predicted protein [Populus trichocarpa]
gi|222856445|gb|EEE93992.1| predicted protein [Populus trichocarpa]
Length = 336
Score = 76.6 bits (187), Expect = 2e-12, Method: Composition-based stats.
Identities = 34/78 (43%), Positives = 50/78 (64%), Gaps = 1/78 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V A+EGI++IVT NL +S Q+LVDCD + C GG ++ + Y+I N G
Sbjct: 140 GSCWAFSTVAAVEGINQIVTGNLTSLSEQELVDCDTT-YNNGCNGGLMDYAFAYIISNGG 198
Query: 65 INTERDYPNVGVMDNCKV 82
++ E DYP + C++
Sbjct: 199 LHKEEDYPYIMEEGTCEM 216
>gi|222425026|dbj|BAH20463.1| cysteine protease [Spinacia oleracea]
Length = 473
Score = 76.6 bits (187), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 35/82 (42%), Positives = 52/82 (63%), Gaps = 1/82 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ + A+EGI++IVT L+ +S Q+LVDCD S C GG ++ Y+++I N G
Sbjct: 169 GSCWAFSTIAAVEGINQIVTGELLSLSEQELVDCDTSYNS-GCDGGLMDYAYEFIINNGG 227
Query: 65 INTERDYPNVGVMDNCKVFQFN 86
I+T+ DYP C ++ N
Sbjct: 228 IDTDADYPYTAKDGKCDQYRKN 249
>gi|2144501|pir||TAGB actinidain (EC 3.4.22.14) precursor - kiwi fruit
gi|166317|gb|AAA32629.1| actinidin [Actinidia deliciosa]
Length = 380
Score = 76.6 bits (187), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 34/78 (43%), Positives = 48/78 (61%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW F+ + +EGI+KIVT L+ +S Q+L+DC +R C GG+I +Q++I N G
Sbjct: 149 GGCWAFSAIATVEGINKIVTGVLISLSEQELIDCGRTQNTRGCNGGYITDGFQFIINNGG 208
Query: 65 INTERDYPNVGVMDNCKV 82
INTE +YP C V
Sbjct: 209 INTEENYPYTAQDGECNV 226
>gi|413953665|gb|AFW86314.1| hypothetical protein ZEAMMB73_546353 [Zea mays]
Length = 233
Score = 76.6 bits (187), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 33/77 (42%), Positives = 46/77 (59%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW F+ V A EGI KI T LV ++ Q+LVDCD E + C GG ++ ++++I+N G
Sbjct: 39 GCCWAFSAVAATEGIVKISTGKLVSLAEQELVDCDVHDEDQGCEGGLMDDAFKFIIKNGG 98
Query: 65 INTERDYPNVGVMDNCK 81
+ TE YP CK
Sbjct: 99 LTTESSYPYTAADGKCK 115
>gi|312451836|gb|ADQ85985.1| actinidin [Actinidia chinensis]
Length = 380
Score = 76.6 bits (187), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 34/78 (43%), Positives = 48/78 (61%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW F+ + +EGI+KIVT L+ +S Q+L+DC +R C GG+I +Q++I N G
Sbjct: 149 GGCWAFSAIATVEGINKIVTGVLISLSEQELIDCGRTQNTRGCNGGYITDGFQFIINNGG 208
Query: 65 INTERDYPNVGVMDNCKV 82
INTE +YP C V
Sbjct: 209 INTEENYPYTAQDGECNV 226
>gi|18413505|ref|NP_567376.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|30315954|sp|Q9SUT0.1|CPR3_ARATH RecName: Full=Probable cysteine proteinase At4g11310; Flags:
Precursor
gi|5596477|emb|CAB51415.1| drought-inducible cysteine proteinase RD21A precursor-like protein
[Arabidopsis thaliana]
gi|7267830|emb|CAB81232.1| drought-inducible cysteine proteinase RD21A precursor-like protein
[Arabidopsis thaliana]
gi|332657595|gb|AEE82995.1| putative cysteine proteinase [Arabidopsis thaliana]
Length = 364
Score = 76.6 bits (187), Expect = 2e-12, Method: Composition-based stats.
Identities = 34/75 (45%), Positives = 51/75 (68%), Gaps = 2/75 (2%)
Query: 6 SCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRGI 65
SCW F+ VGA+EG++KIVT LV +S Q L++C+ E+ C GG +ET Y+++++N G+
Sbjct: 160 SCWAFSTVGAVEGLNKIVTGELVTLSEQDLINCNK--ENNGCGGGKLETAYEFIMKNGGL 217
Query: 66 NTERDYPNVGVMDNC 80
T+ DYP V C
Sbjct: 218 GTDNDYPYKAVNGVC 232
>gi|242077600|ref|XP_002448736.1| hypothetical protein SORBIDRAFT_06g032320 [Sorghum bicolor]
gi|241939919|gb|EES13064.1| hypothetical protein SORBIDRAFT_06g032320 [Sorghum bicolor]
Length = 467
Score = 76.6 bits (187), Expect = 2e-12, Method: Composition-based stats.
Identities = 33/82 (40%), Positives = 52/82 (63%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V ++E I++IVT +V +S Q+LV+C G + C GG ++ + ++I+N G
Sbjct: 167 GSCWAFSAVSSVESINQIVTGEMVTLSEQELVECSTDGGNSGCNGGLMDAAFNFIIKNGG 226
Query: 65 INTERDYPNVGVMDNCKVFQFN 86
I+TE DYP V C + + N
Sbjct: 227 IDTEDDYPYKAVDGKCDINRRN 248
>gi|224133760|ref|XP_002321654.1| predicted protein [Populus trichocarpa]
gi|222868650|gb|EEF05781.1| predicted protein [Populus trichocarpa]
Length = 362
Score = 76.6 bits (187), Expect = 2e-12, Method: Composition-based stats.
Identities = 33/68 (48%), Positives = 47/68 (69%), Gaps = 1/68 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ + A+EGI+ I TN LV +S Q+LVDCD E++ C GG +E ++++ + RG
Sbjct: 150 GSCWAFSTIVAVEGINYIKTNELVSLSEQELVDCDTT-ENQGCNGGLMEYAFEFIKKKRG 208
Query: 65 INTERDYP 72
I TE YP
Sbjct: 209 ITTESTYP 216
>gi|242081867|ref|XP_002445702.1| hypothetical protein SORBIDRAFT_07g024430 [Sorghum bicolor]
gi|241942052|gb|EES15197.1| hypothetical protein SORBIDRAFT_07g024430 [Sorghum bicolor]
Length = 372
Score = 76.6 bits (187), Expect = 2e-12, Method: Composition-based stats.
Identities = 33/77 (42%), Positives = 48/77 (62%), Gaps = 1/77 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ + A+EGI+ I T NL +S QQLVDCD +G + C GG ++ +QY+ ++ G
Sbjct: 158 GSCWAFSTIAAVEGINAIKTKNLTSLSEQQLVDCDTKGNA-GCDGGLMDYAFQYIAKHGG 216
Query: 65 INTERDYPNVGVMDNCK 81
+ E YP +CK
Sbjct: 217 VAAEDAYPYKARQASCK 233
>gi|356543010|ref|XP_003539956.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
Length = 306
Score = 76.6 bits (187), Expect = 2e-12, Method: Composition-based stats.
Identities = 32/76 (42%), Positives = 49/76 (64%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V A+EGI+KI + LV +S Q+ DCD + ++ C GG ++T + ++ +N G
Sbjct: 108 GSCWAFSAVAAVEGINKIKSGKLVSLSEQEFRDCDVEDGNQGCEGGLMDTAFAFIKKNGG 167
Query: 65 INTERDYPNVGVMDNC 80
+ T +DYP GV C
Sbjct: 168 LTTSKDYPYEGVDGTC 183
>gi|414588007|tpg|DAA38578.1| TPA: hypothetical protein ZEAMMB73_159244 [Zea mays]
Length = 307
Score = 76.6 bits (187), Expect = 2e-12, Method: Composition-based stats.
Identities = 33/77 (42%), Positives = 47/77 (61%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW F+ + A+EGI K+ T NLV +S Q+ VDCD C GG+++ +++VI+N G
Sbjct: 113 GCCWAFSAIAAMEGIVKLSTGNLVSLSEQEPVDCDTHNMDEGCEGGWMDNAFEFVIKNGG 172
Query: 65 INTERDYPNVGVMDNCK 81
+ TE YP V CK
Sbjct: 173 LATESSYPYKVVDGKCK 189
>gi|327322928|gb|AEA48885.1| cathepsin S [Oplegnathus fasciatus]
Length = 337
Score = 76.6 bits (187), Expect = 2e-12, Method: Composition-based stats.
Identities = 33/76 (43%), Positives = 46/76 (60%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ GA+EG T L+D+S Q LVDC ++ + C GGF+ +QYVI N+G
Sbjct: 144 GSCWAFSAAGALEGQLAKTTGKLLDLSPQNLVDCSSKYGNHGCNGGFMHRAFQYVIDNQG 203
Query: 65 INTERDYPNVGVMDNC 80
I+++ YP G C
Sbjct: 204 IDSDASYPYTGQSQQC 219
>gi|330805273|ref|XP_003290609.1| hypothetical protein DICPUDRAFT_92519 [Dictyostelium purpureum]
gi|325079248|gb|EGC32857.1| hypothetical protein DICPUDRAFT_92519 [Dictyostelium purpureum]
Length = 333
Score = 76.6 bits (187), Expect = 2e-12, Method: Composition-based stats.
Identities = 33/77 (42%), Positives = 45/77 (58%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ GA+EG +I + N+V +S Q LVDC Q ++ C GG + ++Y+I N G
Sbjct: 139 GSCWSFSTTGAVEGAHQIKSGNMVSLSEQNLVDCSGQYGNQGCEGGLMVNAFEYIIDNGG 198
Query: 65 INTERDYPNVGVMDNCK 81
I TE YP CK
Sbjct: 199 IATESSYPYTAAQGRCK 215
>gi|63101996|gb|AAH95694.1| Cathepsin S, b.1 [Danio rerio]
Length = 330
Score = 76.6 bits (187), Expect = 2e-12, Method: Composition-based stats.
Identities = 36/72 (50%), Positives = 47/72 (65%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ VGA+EG K T LVD+S Q LVDC ++ ++ C GGF+ +QYVI N G
Sbjct: 137 GSCWAFSSVGALEGQLKKTTGKLVDLSPQNLVDCSSKYGNKGCNGGFMSDAFQYVIDNGG 196
Query: 65 INTERDYPNVGV 76
I ++ YP GV
Sbjct: 197 IASDSAYPYRGV 208
>gi|66378053|gb|AAY45871.1| cathepsin L-like cysteine proteinase [Longidorus elongatus]
Length = 358
Score = 76.6 bits (187), Expect = 2e-12, Method: Composition-based stats.
Identities = 34/77 (44%), Positives = 46/77 (59%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ G++EG T LV +S Q LVDCD G+ C GG+++ +QYV N+G
Sbjct: 161 GSCWAFSATGSLEGQHYKQTGKLVSLSEQNLVDCDVNGDDEGCNGGYMDGAFQYVETNKG 220
Query: 65 INTERDYPNVGVMDNCK 81
I+TE YP G C+
Sbjct: 221 IDTEASYPYKGRDGRCR 237
>gi|20260334|gb|AAM13065.1| drought-inducible cysteine proteinase RD21A precursor-like protein
[Arabidopsis thaliana]
gi|23197782|gb|AAN15418.1| drought-inducible cysteine proteinase RD21A precursor-like protein
[Arabidopsis thaliana]
Length = 357
Score = 76.6 bits (187), Expect = 2e-12, Method: Composition-based stats.
Identities = 34/75 (45%), Positives = 51/75 (68%), Gaps = 2/75 (2%)
Query: 6 SCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRGI 65
SCW F+ VGA+EG++KIVT LV +S Q L++C+ E+ C GG +ET Y+++++N G+
Sbjct: 153 SCWAFSTVGAVEGLNKIVTGELVTLSEQDLINCNK--ENNGCGGGKLETAYEFIMKNGGL 210
Query: 66 NTERDYPNVGVMDNC 80
T+ DYP V C
Sbjct: 211 GTDNDYPYKAVNGVC 225
>gi|224076968|ref|XP_002305072.1| predicted protein [Populus trichocarpa]
gi|222848036|gb|EEE85583.1| predicted protein [Populus trichocarpa]
Length = 305
Score = 76.6 bits (187), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 36/76 (47%), Positives = 50/76 (65%), Gaps = 2/76 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW F+ V AIEGI K+ T NL+ +S QQLVDC ++ C GG ++T +QY+I+N G
Sbjct: 111 GCCWAFSTVAAIEGIIKLQTGNLISLSEQQLVDC--TAGNKGCQGGLMDTAFQYIIRNGG 168
Query: 65 INTERDYPNVGVMDNC 80
+ +E +YP GV C
Sbjct: 169 LTSEDNYPYQGVDGTC 184
>gi|146216004|gb|ABQ10204.1| cysteine protease Cp6 [Actinidia deliciosa]
Length = 461
Score = 76.6 bits (187), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 34/82 (41%), Positives = 54/82 (65%), Gaps = 1/82 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ + A+EGI++IVT +L+ +S Q+LVDCD + C GG ++ ++++I+N G
Sbjct: 156 GSCWAFSTIAAVEGINQIVTGDLISLSEQELVDCDTS-YNEGCNGGLMDYAFEFIIKNGG 214
Query: 65 INTERDYPNVGVMDNCKVFQFN 86
I+TE DYP C ++ N
Sbjct: 215 IDTEEDYPYNARDGRCDQYRKN 236
>gi|62955291|ref|NP_001017661.1| cathepsin S, b.2 precursor [Danio rerio]
gi|62204682|gb|AAH93339.1| Cathepsin S, b.2 [Danio rerio]
gi|182891354|gb|AAI64362.1| Ctssb.2 protein [Danio rerio]
Length = 330
Score = 76.6 bits (187), Expect = 2e-12, Method: Composition-based stats.
Identities = 35/77 (45%), Positives = 48/77 (62%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ VGA+EG T LVD+S Q LVDC ++ + C GG++ +QYVI N G
Sbjct: 137 GSCWAFSSVGALEGQLMKTTGKLVDLSPQNLVDCSSKYGNLGCNGGYMSQAFQYVIDNGG 196
Query: 65 INTERDYPNVGVMDNCK 81
I++E YP G +C+
Sbjct: 197 IDSESSYPYQGTQGSCR 213
>gi|75994626|gb|ABA33834.1| cysteine protease Mir1 [Zea mays subsp. parviglumis]
Length = 248
Score = 76.6 bits (187), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 35/78 (44%), Positives = 49/78 (62%), Gaps = 2/78 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW F+ V AIEG++ I T NLV +S Q+++DCD Q C GG +E +++VI N G
Sbjct: 124 GGCWAFSAVAAIEGVNAIATGNLVSLSEQEIIDCDAQ--DSGCDGGQMEDAFRFVIGNGG 181
Query: 65 INTERDYPNVGVMDNCKV 82
I++E DYP +G C
Sbjct: 182 IDSEADYPFIGTDGTCDA 199
>gi|600111|emb|CAA84378.1| cysteine proteinase [Vicia sativa]
Length = 359
Score = 76.6 bits (187), Expect = 2e-12, Method: Composition-based stats.
Identities = 36/78 (46%), Positives = 50/78 (64%), Gaps = 2/78 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ + A+EGI++I T LV +S QQLVDCD + E+ C GG +E ++++ QN G
Sbjct: 150 GSCWAFSTIAAVEGINQIKTQKLVSLSEQQLVDCDTE-ENEGCNGGLMEYAFEFIKQN-G 207
Query: 65 INTERDYPNVGVMDNCKV 82
I TE +YP C V
Sbjct: 208 ITTESNYPYAAKDGTCDV 225
>gi|310942958|pdb|3P5U|A Chain A, Actinidin From Actinidia Arguta Planch (Sarusashi)
gi|310942959|pdb|3P5V|A Chain A, Actinidin From Actinidia Arguta Planch (Sarusashi)
gi|310942961|pdb|3P5X|A Chain A, Actinidin From Actinidia Arguta Planch (Sarusashi)
Length = 220
Score = 76.6 bits (187), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 34/78 (43%), Positives = 49/78 (62%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GS W F+ + A+EGI+KI T +L+ +S Q+LVDC +R C GGF+ +Q++I N G
Sbjct: 23 GSXWAFSTIAAVEGINKIATGDLISLSEQELVDCGRTQNTRGCDGGFMTDGFQFIINNGG 82
Query: 65 INTERDYPNVGVMDNCKV 82
INTE +YP C +
Sbjct: 83 INTEANYPYTAEEGQCNL 100
>gi|463046|gb|AAA49207.1| cysteine proteinase [Cyprinus carpio]
Length = 331
Score = 76.6 bits (187), Expect = 2e-12, Method: Composition-based stats.
Identities = 36/77 (46%), Positives = 46/77 (59%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ VGA+EG T LVD+S Q LVDC + + C GG + +QYVI N G
Sbjct: 137 GSCWAFSSVGALEGQLMKTTGKLVDLSPQNLVDCSSSYGNYGCGGGLMSAAFQYVIDNGG 196
Query: 65 INTERDYPNVGVMDNCK 81
I++E YP GV C+
Sbjct: 197 IDSESSYPYEGVQGQCR 213
>gi|47213723|emb|CAF95154.1| unnamed protein product [Tetraodon nigroviridis]
Length = 334
Score = 76.6 bits (187), Expect = 2e-12, Method: Composition-based stats.
Identities = 34/77 (44%), Positives = 46/77 (59%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ GA+EG+ T LVD+S Q LVDC + + C GG++ +QYVI N G
Sbjct: 143 GSCWAFSAAGALEGLLAKTTGKLVDLSPQNLVDCTRKYGNHGCNGGYMHHTFQYVIDNHG 202
Query: 65 INTERDYPNVGVMDNCK 81
I++E YP G C+
Sbjct: 203 IDSEASYPYTGQEGVCR 219
>gi|312100382|gb|ADQ27799.1| mitogenic proteinase [Vasconcellea cundinamarcensis]
Length = 214
Score = 76.6 bits (187), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 37/81 (45%), Positives = 52/81 (64%), Gaps = 3/81 (3%)
Query: 2 HPLGSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQ 61
+P GSCW F+ V +EGI+KIVT L+ +S Q+L+DCD + S C GG+ T QYV+
Sbjct: 20 NPCGSCWAFSTVATVEGINKIVTGKLISLSEQELLDCDRR--SHGCNGGYQTTSLQYVVD 77
Query: 62 NRGINTERDYPNVGVMDNCKV 82
N G++TE +YP NC+
Sbjct: 78 N-GVHTEYEYPYEKKQGNCRA 97
>gi|218181|dbj|BAA14402.1| oryzain alpha precursor [Oryza sativa Japonica Group]
Length = 458
Score = 76.3 bits (186), Expect = 2e-12, Method: Composition-based stats.
Identities = 35/82 (42%), Positives = 53/82 (64%), Gaps = 1/82 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ + A+E I++IVT +L+ +S Q+LVDCD + C GG ++ + ++I N G
Sbjct: 151 GSCWAFSAIAAVEDINQIVTGDLISLSEQELVDCDTS-YNEGCNGGLMDYAFDFIINNGG 209
Query: 65 INTERDYPNVGVMDNCKVFQFN 86
I+TE DYP G + C V + N
Sbjct: 210 IDTEDDYPYKGKDERCDVNRKN 231
>gi|449438381|ref|XP_004136967.1| PREDICTED: cysteine proteinase RD21a-like [Cucumis sativus]
Length = 479
Score = 76.3 bits (186), Expect = 2e-12, Method: Composition-based stats.
Identities = 35/71 (49%), Positives = 50/71 (70%), Gaps = 1/71 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V A+EGI++IVT L+ +S Q+LVDCD + + C GG ++ +Q++I N G
Sbjct: 171 GSCWAFSSVAAVEGINQIVTGELIPLSEQELVDCD-KSFNMGCNGGLMDYAFQFIIGNGG 229
Query: 65 INTERDYPNVG 75
I+TE DYP G
Sbjct: 230 IDTEEDYPYKG 240
>gi|535454|gb|AAA50755.1| cysteine proteinase [Alnus glutinosa]
Length = 340
Score = 76.3 bits (186), Expect = 2e-12, Method: Composition-based stats.
Identities = 31/76 (40%), Positives = 46/76 (60%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW F+ V A EGI+K+ T L+ +S Q+LVDCD G + C GG ++ + ++ N G
Sbjct: 144 GCCWAFSAVAATEGITKLTTGELISLSEQELVDCDTSGVDQGCEGGLMDNAFTFIQHNHG 203
Query: 65 INTERDYPNVGVMDNC 80
+ +E +YP GV C
Sbjct: 204 LASEANYPYKGVDGTC 219
>gi|224809458|ref|NP_001019580.2| cathepsin S, b.1 precursor [Danio rerio]
gi|63101450|gb|AAH95788.1| Cathepsin S, b.1 [Danio rerio]
gi|77748418|gb|AAI07613.1| Cathepsin S, b.1 [Danio rerio]
Length = 330
Score = 76.3 bits (186), Expect = 2e-12, Method: Composition-based stats.
Identities = 36/72 (50%), Positives = 47/72 (65%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ VGA+EG K T LVD+S Q LVDC ++ ++ C GGF+ +QYVI N G
Sbjct: 137 GSCWAFSSVGALEGQLKKTTGKLVDLSPQNLVDCSSKYGNKGCNGGFMSDAFQYVIDNGG 196
Query: 65 INTERDYPNVGV 76
I ++ YP GV
Sbjct: 197 IASDSAYPYRGV 208
>gi|356517184|ref|XP_003527269.1| PREDICTED: xylem cysteine proteinase 2-like [Glycine max]
Length = 350
Score = 76.3 bits (186), Expect = 2e-12, Method: Composition-based stats.
Identities = 31/78 (39%), Positives = 52/78 (66%), Gaps = 1/78 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V A+EGI++IVT NL +S Q+L+DCD + + C GG ++ + ++++N G
Sbjct: 154 GSCWAFSTVAAVEGINQIVTGNLTSLSEQELIDCD-RTYNNGCNGGLMDYAFSFIVENDG 212
Query: 65 INTERDYPNVGVMDNCKV 82
++ E DYP + C++
Sbjct: 213 LHKEEDYPYIMEEGTCEM 230
>gi|348525618|ref|XP_003450319.1| PREDICTED: cathepsin S-like [Oreochromis niloticus]
Length = 330
Score = 76.3 bits (186), Expect = 2e-12, Method: Composition-based stats.
Identities = 34/77 (44%), Positives = 46/77 (59%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ GA+EG T LVD+S Q LVDC + + C GGF+ +QYVI N G
Sbjct: 137 GSCWAFSAAGALEGQLAKSTGKLVDLSPQNLVDCSGKYGNHGCNGGFMTRAFQYVIDNHG 196
Query: 65 INTERDYPNVGVMDNCK 81
I+++ YP G + C+
Sbjct: 197 IDSDASYPYTGRDEQCR 213
>gi|342305192|dbj|BAK55650.1| cathepsin S [Oplegnathus fasciatus]
Length = 337
Score = 76.3 bits (186), Expect = 2e-12, Method: Composition-based stats.
Identities = 33/76 (43%), Positives = 46/76 (60%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ GA+EG T L+D+S Q LVDC ++ + C GGF+ +QYVI N+G
Sbjct: 144 GSCWAFSAAGALEGQLAKTTGKLLDLSPQNLVDCSSKYGNHGCNGGFMHRAFQYVIDNQG 203
Query: 65 INTERDYPNVGVMDNC 80
I+++ YP G C
Sbjct: 204 IDSDASYPYTGQSQQC 219
>gi|255539310|ref|XP_002510720.1| cysteine protease, putative [Ricinus communis]
gi|223551421|gb|EEF52907.1| cysteine protease, putative [Ricinus communis]
Length = 349
Score = 76.3 bits (186), Expect = 2e-12, Method: Composition-based stats.
Identities = 32/76 (42%), Positives = 48/76 (63%), Gaps = 1/76 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V A+EGI++IVT NL +S Q+L+DCD + C GG ++ + Y++ N G
Sbjct: 153 GSCWAFSTVAAVEGINQIVTGNLTSLSEQELIDCDTT-YNNGCNGGLMDYAFAYIVANGG 211
Query: 65 INTERDYPNVGVMDNC 80
++ E DYP + C
Sbjct: 212 LHKEEDYPYIMEEGTC 227
>gi|357467173|ref|XP_003603871.1| Cysteine proteinase [Medicago truncatula]
gi|355492919|gb|AES74122.1| Cysteine proteinase [Medicago truncatula]
gi|388499154|gb|AFK37643.1| unknown [Medicago truncatula]
Length = 350
Score = 76.3 bits (186), Expect = 2e-12, Method: Composition-based stats.
Identities = 31/78 (39%), Positives = 51/78 (65%), Gaps = 1/78 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V A+EGI++IVT NL +S Q+L+DCD + C GG ++ + ++++N G
Sbjct: 154 GSCWAFSTVAAVEGINQIVTGNLTSLSEQELIDCDTT-YNNGCNGGLMDYAFSFIVKNGG 212
Query: 65 INTERDYPNVGVMDNCKV 82
++ E DYP + C++
Sbjct: 213 LHKEEDYPYIMEESTCEM 230
>gi|225446523|ref|XP_002275891.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP2 [Vitis vinifera]
Length = 358
Score = 76.3 bits (186), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 34/76 (44%), Positives = 49/76 (64%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V A+EGI+KI T LV +S Q+L+DCD + C GG++ ++++ QN G
Sbjct: 151 GSCWAFSTVAAVEGINKIRTGKLVSLSEQELLDCDIDSGNEGCNGGYMVNAFKFIKQNGG 210
Query: 65 INTERDYPNVGVMDNC 80
I T R+YP +G C
Sbjct: 211 ITTARNYPYIGEQGIC 226
>gi|27728675|gb|AAO18731.1| cysteine protease [Gossypium hirsutum]
Length = 389
Score = 76.3 bits (186), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 34/76 (44%), Positives = 52/76 (68%), Gaps = 2/76 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ GA+EGI+ +VT +L+ +S Q+LV+CD + C GG+++ +++VI N G
Sbjct: 162 GSCWAFSSTGAMEGINALVTGDLISLSEQELVECDTS--NYGCEGGYMDYAFEWVINNGG 219
Query: 65 INTERDYPNVGVMDNC 80
I++E DYP GV C
Sbjct: 220 IDSESDYPYTGVDGTC 235
>gi|356508490|ref|XP_003522989.1| PREDICTED: xylem cysteine proteinase 2-like [Glycine max]
Length = 349
Score = 76.3 bits (186), Expect = 2e-12, Method: Composition-based stats.
Identities = 31/78 (39%), Positives = 52/78 (66%), Gaps = 1/78 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V A+EGI++IVT NL +S Q+L+DCD + + C GG ++ + ++++N G
Sbjct: 153 GSCWAFSTVAAVEGINQIVTGNLTSLSEQELIDCD-RTYNNGCNGGLMDYAFSFIVENGG 211
Query: 65 INTERDYPNVGVMDNCKV 82
++ E DYP + C++
Sbjct: 212 LHKEEDYPYIMEEGTCEM 229
>gi|440799058|gb|ELR20119.1| cysteine proteinase [Acanthamoeba castellanii str. Neff]
Length = 401
Score = 76.3 bits (186), Expect = 2e-12, Method: Composition-based stats.
Identities = 36/82 (43%), Positives = 50/82 (60%), Gaps = 9/82 (10%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDC-----DNQGESRSCVGGFIETIYQYV 59
GSCW F+ G+ EGI+ I T+ LV +S Q LVDC DN G C GGF++ ++Y+
Sbjct: 206 GSCWAFSTTGSTEGINAITTSRLVPLSEQNLVDCATAAYDNYG----CNGGFMDNAFRYI 261
Query: 60 IQNRGINTERDYPNVGVMDNCK 81
I N+GI++E YP V C+
Sbjct: 262 IDNKGIDSEASYPYVAADGQCR 283
>gi|156142226|gb|ABU51882.1| ervatamin-C precursor [Tabernaemontana divaricata]
Length = 365
Score = 76.3 bits (186), Expect = 2e-12, Method: Composition-based stats.
Identities = 35/77 (45%), Positives = 48/77 (62%), Gaps = 2/77 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V +E I++I T NL+ +S QQLVDC+ + + C GG YQY+I N G
Sbjct: 156 GSCWAFSTVSTVESINQIRTGNLISLSEQQLVDCNKK--NHGCKGGAFVYAYQYIIDNGG 213
Query: 65 INTERDYPNVGVMDNCK 81
I+TE +YP V C+
Sbjct: 214 IDTEANYPYKAVQGPCR 230
>gi|410904753|ref|XP_003965856.1| PREDICTED: cathepsin S-like [Takifugu rubripes]
Length = 334
Score = 76.3 bits (186), Expect = 2e-12, Method: Composition-based stats.
Identities = 34/76 (44%), Positives = 46/76 (60%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ GA+EG T LVD+S Q LVDC + + C GG++ +QYVI N+G
Sbjct: 141 GSCWAFSAAGALEGQLAKTTGRLVDLSPQNLVDCSGKYGNHGCNGGYMHRAFQYVIDNQG 200
Query: 65 INTERDYPNVGVMDNC 80
I++E YP G + C
Sbjct: 201 IDSEASYPYRGQVQQC 216
>gi|449447027|ref|XP_004141271.1| PREDICTED: cysteine proteinase RD21a-like [Cucumis sativus]
Length = 458
Score = 76.3 bits (186), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 33/82 (40%), Positives = 56/82 (68%), Gaps = 1/82 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V ++E I++IVT +L+ +S Q+LVDCD + + C GG ++ ++++I+N G
Sbjct: 150 GSCWAFSTVASVEAINQIVTGDLIALSEQELVDCD-RSYNEGCNGGLMDYAFEFIIENGG 208
Query: 65 INTERDYPNVGVMDNCKVFQFN 86
++TE DYP G +C ++ N
Sbjct: 209 LDTEEDYPYYGFDSSCIQYKKN 230
>gi|125592009|gb|EAZ32359.1| hypothetical protein OsJ_16569 [Oryza sativa Japonica Group]
Length = 480
Score = 76.3 bits (186), Expect = 2e-12, Method: Composition-based stats.
Identities = 31/82 (37%), Positives = 52/82 (63%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V +E I+++VT ++ +S Q+LV+C G++ C GG ++ + ++I+N G
Sbjct: 177 GSCWAFSAVSTVESINQLVTGEMITLSEQELVECSTNGQNSGCNGGLMDDAFDFIIKNGG 236
Query: 65 INTERDYPNVGVMDNCKVFQFN 86
I+TE DYP V C + + N
Sbjct: 237 IDTEDDYPYKAVDGKCDINREN 258
>gi|302143380|emb|CBI21941.3| unnamed protein product [Vitis vinifera]
Length = 354
Score = 76.3 bits (186), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 34/76 (44%), Positives = 49/76 (64%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V A+EGI+KI T LV +S Q+L+DCD + C GG++ ++++ QN G
Sbjct: 147 GSCWAFSTVAAVEGINKIRTGKLVSLSEQELLDCDIDSGNEGCNGGYMVNAFKFIKQNGG 206
Query: 65 INTERDYPNVGVMDNC 80
I T R+YP +G C
Sbjct: 207 ITTARNYPYIGEQGIC 222
>gi|116666824|pdb|2BDZ|A Chain A, Mexicain From Jacaratia Mexicana
gi|116666825|pdb|2BDZ|B Chain B, Mexicain From Jacaratia Mexicana
gi|116666826|pdb|2BDZ|C Chain C, Mexicain From Jacaratia Mexicana
gi|116666827|pdb|2BDZ|D Chain D, Mexicain From Jacaratia Mexicana
Length = 214
Score = 76.3 bits (186), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 36/81 (44%), Positives = 52/81 (64%), Gaps = 3/81 (3%)
Query: 2 HPLGSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQ 61
+P GSCW F+ V IEGI+KI+T L+ +S Q+L+DC+ + S C GG+ T QYV+
Sbjct: 20 NPCGSCWAFSTVATIEGINKIITGQLISLSEQELLDCERR--SHGCDGGYQTTSLQYVVD 77
Query: 62 NRGINTERDYPNVGVMDNCKV 82
N G++TER+YP C+
Sbjct: 78 N-GVHTEREYPYEKKQGRCRA 97
>gi|351705687|gb|EHB08606.1| Cathepsin S [Heterocephalus glaber]
Length = 331
Score = 76.3 bits (186), Expect = 2e-12, Method: Composition-based stats.
Identities = 35/77 (45%), Positives = 47/77 (61%), Gaps = 1/77 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQG-ESRSCVGGFIETIYQYVIQNR 63
GSCW F+ VGA+EG K+ T LV +S Q LVDC + ++ C GGF+ +QYVI N
Sbjct: 137 GSCWAFSAVGALEGQLKLKTGKLVSLSAQNLVDCSTEKYRNKGCSGGFMTEAFQYVIDNN 196
Query: 64 GINTERDYPNVGVMDNC 80
GI++E YP + C
Sbjct: 197 GIDSETSYPYKATDEKC 213
>gi|449525012|ref|XP_004169515.1| PREDICTED: cysteine proteinase RD21a-like [Cucumis sativus]
Length = 459
Score = 76.3 bits (186), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 33/82 (40%), Positives = 56/82 (68%), Gaps = 1/82 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V ++E I++IVT +L+ +S Q+LVDCD + + C GG ++ ++++I+N G
Sbjct: 150 GSCWAFSTVASVEAINQIVTGDLIALSEQELVDCD-RSYNEGCNGGLMDYAFEFIIENGG 208
Query: 65 INTERDYPNVGVMDNCKVFQFN 86
++TE DYP G +C ++ N
Sbjct: 209 LDTEEDYPYYGFDSSCIQYKKN 230
>gi|157829826|pdb|1AEC|A Chain A, Crystal Structure Of Actinidin-E-64 Complex+
Length = 218
Score = 76.3 bits (186), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 34/78 (43%), Positives = 48/78 (61%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW F+ + +EGI+KIVT L+ +S Q+L+DC +R C GG+I +Q++I N G
Sbjct: 23 GGCWAFSAIATVEGINKIVTGVLISLSEQELIDCGRTQNTRGCNGGYITDGFQFIINNGG 82
Query: 65 INTERDYPNVGVMDNCKV 82
INTE +YP C V
Sbjct: 83 INTEENYPYTAQDGECNV 100
>gi|440906717|gb|ELR56946.1| Cathepsin K [Bos grunniens mutus]
Length = 338
Score = 76.3 bits (186), Expect = 3e-12, Method: Composition-based stats.
Identities = 37/76 (48%), Positives = 50/76 (65%), Gaps = 2/76 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ VGA+EG K T L+++S Q LVDC E+ C GG++ +QYV +NRG
Sbjct: 146 GSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDC--VSENDGCGGGYMTNAFQYVQKNRG 203
Query: 65 INTERDYPNVGVMDNC 80
I++E YP VG +NC
Sbjct: 204 IDSEDAYPYVGQDENC 219
>gi|386648114|gb|AFJ15104.1| mexicain-like cystein protease, partial [Jacaratia mexicana]
Length = 323
Score = 76.3 bits (186), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 36/81 (44%), Positives = 51/81 (62%), Gaps = 3/81 (3%)
Query: 2 HPLGSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQ 61
+P GSCW F+ V +EGI+KIVT L+ +S Q+L+DCD + S C GG+ T QYV
Sbjct: 128 NPCGSCWAFSTVATVEGINKIVTGKLISLSEQELLDCDRR--SHGCKGGYQTTSLQYVAD 185
Query: 62 NRGINTERDYPNVGVMDNCKV 82
N G++TE++YP C+
Sbjct: 186 N-GVHTEKEYPYEKKQGKCRA 205
>gi|356517188|ref|XP_003527271.1| PREDICTED: xylem cysteine proteinase 2-like [Glycine max]
Length = 350
Score = 76.3 bits (186), Expect = 3e-12, Method: Composition-based stats.
Identities = 31/78 (39%), Positives = 52/78 (66%), Gaps = 1/78 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V A+EGI++IVT NL +S Q+L+DCD + + C GG ++ + ++++N G
Sbjct: 154 GSCWAFSTVAAVEGINQIVTGNLTSLSEQELIDCD-RTYNNGCNGGLMDYAFSFIVENGG 212
Query: 65 INTERDYPNVGVMDNCKV 82
++ E DYP + C++
Sbjct: 213 LHKEEDYPYIMEEGTCEM 230
>gi|255636047|gb|ACU18368.1| unknown [Glycine max]
Length = 227
Score = 76.3 bits (186), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 35/68 (51%), Positives = 48/68 (70%), Gaps = 1/68 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V A+EGI++I TN LV +S Q+LVDCD + E+ C GG +E+ +Q++ Q G
Sbjct: 150 GSCWAFSTVVAVEGINQIKTNKLVSLSEQELVDCDTE-ENAGCNGGLMESAFQFIKQKGG 208
Query: 65 INTERDYP 72
I TE YP
Sbjct: 209 ITTESYYP 216
>gi|222632170|gb|EEE64302.1| hypothetical protein OsJ_19139 [Oryza sativa Japonica Group]
Length = 1105
Score = 76.3 bits (186), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 33/68 (48%), Positives = 49/68 (72%), Gaps = 1/68 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G+CW F+ GA+EGI+KI T +L+ +S Q+L+DCD S C GG ++ Y++V++N G
Sbjct: 151 GACWSFSATGAMEGINKIKTGSLISLSEQELIDCDRSYNS-GCGGGLMDYAYKFVVKNGG 209
Query: 65 INTERDYP 72
I+TE DYP
Sbjct: 210 IDTEADYP 217
>gi|297603535|ref|NP_001054211.2| Os04g0670200 [Oryza sativa Japonica Group]
gi|109939735|sp|P25777.2|ORYB_ORYSJ RecName: Full=Oryzain beta chain; Flags: Precursor
gi|32488398|emb|CAE02823.1| OSJNBa0043A12.28 [Oryza sativa Japonica Group]
gi|90399163|emb|CAJ86092.1| H0818H01.14 [Oryza sativa Indica Group]
gi|125550169|gb|EAY95991.1| hypothetical protein OsI_17862 [Oryza sativa Indica Group]
gi|215766596|dbj|BAG98700.1| unnamed protein product [Oryza sativa Japonica Group]
gi|255675868|dbj|BAF16125.2| Os04g0670200 [Oryza sativa Japonica Group]
Length = 466
Score = 75.9 bits (185), Expect = 3e-12, Method: Composition-based stats.
Identities = 31/82 (37%), Positives = 52/82 (63%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V +E I+++VT ++ +S Q+LV+C G++ C GG ++ + ++I+N G
Sbjct: 163 GSCWAFSAVSTVESINQLVTGEMITLSEQELVECSTNGQNSGCNGGLMDDAFDFIIKNGG 222
Query: 65 INTERDYPNVGVMDNCKVFQFN 86
I+TE DYP V C + + N
Sbjct: 223 IDTEDDYPYKAVDGKCDINREN 244
>gi|15984|emb|CAA34486.1| unnamed protein product [Actinidia deliciosa]
Length = 380
Score = 75.9 bits (185), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 33/78 (42%), Positives = 48/78 (61%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW F+ + +EGI+KIVT L+ +S Q+L+DC +R C GG+I +Q++I N G
Sbjct: 149 GGCWAFSAIATVEGINKIVTGVLISLSEQELIDCGRTQNTRGCNGGYITDGFQFIINNGG 208
Query: 65 INTERDYPNVGVMDNCKV 82
INTE +YP C +
Sbjct: 209 INTEENYPYTAQDGECNL 226
>gi|226501480|ref|NP_001150266.1| cysteine protease 1 precursor [Zea mays]
gi|195637948|gb|ACG38442.1| cysteine protease 1 precursor [Zea mays]
Length = 462
Score = 75.9 bits (185), Expect = 3e-12, Method: Composition-based stats.
Identities = 32/82 (39%), Positives = 52/82 (63%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V ++E +++IVT +V +S Q+LV+C G + C GG ++ + ++I+N G
Sbjct: 161 GSCWAFSAVSSVESVNQIVTGEMVTLSEQELVECSTDGGNSGCNGGLMDAAFDFIIKNGG 220
Query: 65 INTERDYPNVGVMDNCKVFQFN 86
I+TE DYP V C + + N
Sbjct: 221 IDTEGDYPYKAVDGKCDINREN 242
>gi|50355619|dbj|BAD29958.1| cysteine protease [Daucus carota]
Length = 496
Score = 75.9 bits (185), Expect = 3e-12, Method: Composition-based stats.
Identities = 33/82 (40%), Positives = 53/82 (64%), Gaps = 1/82 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ + A+EGI++I T L+ +S Q+LVDCD + + C GG ++ ++++I N G
Sbjct: 155 GSCWAFSTISAVEGINQIATGKLITLSEQELVDCD-RSYNEGCNGGLMDYAFEFIINNGG 213
Query: 65 INTERDYPNVGVMDNCKVFQFN 86
I+T+ DYP G C ++ N
Sbjct: 214 IDTDVDYPYTGRDGKCDQYRKN 235
>gi|356508487|ref|XP_003522988.1| PREDICTED: xylem cysteine proteinase 2-like [Glycine max]
Length = 349
Score = 75.9 bits (185), Expect = 3e-12, Method: Composition-based stats.
Identities = 31/78 (39%), Positives = 52/78 (66%), Gaps = 1/78 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V A+EGI++IVT NL +S Q+L+DCD + + C GG ++ + ++++N G
Sbjct: 153 GSCWAFSTVAAVEGINQIVTGNLTSLSEQELIDCD-RTYNNGCNGGLMDYAFSFIVENGG 211
Query: 65 INTERDYPNVGVMDNCKV 82
++ E DYP + C++
Sbjct: 212 LHKEEDYPYIMEEGTCEM 229
>gi|242049716|ref|XP_002462602.1| hypothetical protein SORBIDRAFT_02g028840 [Sorghum bicolor]
gi|241925979|gb|EER99123.1| hypothetical protein SORBIDRAFT_02g028840 [Sorghum bicolor]
Length = 384
Score = 75.9 bits (185), Expect = 3e-12, Method: Composition-based stats.
Identities = 33/71 (46%), Positives = 48/71 (67%), Gaps = 2/71 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V AIEGI++I LV +S Q+LVDCD + + C GG++ +++V+ N G
Sbjct: 152 GSCWAFSAVAAIEGINQIKNGKLVSLSEQELVDCDTK--AIGCAGGYMSWAFEFVMNNSG 209
Query: 65 INTERDYPNVG 75
+ TER+YP G
Sbjct: 210 LTTERNYPYQG 220
>gi|255646088|gb|ACU23531.1| unknown [Glycine max]
Length = 362
Score = 75.9 bits (185), Expect = 3e-12, Method: Composition-based stats.
Identities = 35/82 (42%), Positives = 52/82 (63%), Gaps = 1/82 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V A+EGI++I TN LV +S Q+LVDCD + ++ C GG +E+ ++++ Q G
Sbjct: 150 GSCWAFSTVVAVEGINQIKTNKLVSLSEQELVDCDTK-KNAGCNGGLMESAFEFIKQKGG 208
Query: 65 INTERDYPNVGVMDNCKVFQFN 86
I TE +YP C + N
Sbjct: 209 ITTESNYPYTAQDGTCDASKAN 230
>gi|193806686|sp|A5HII1.1|ACTN_ACTDE RecName: Full=Actinidain; Short=Actinidin; AltName: Full=Allergen
Act d 1; AltName: Allergen=Act d 1; Flags: Precursor
gi|146215974|gb|ABQ10189.1| actinidin Act1a [Actinidia deliciosa]
Length = 380
Score = 75.9 bits (185), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 33/78 (42%), Positives = 48/78 (61%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW F+ + +EGI+KIVT L+ +S Q+L+DC +R C GG+I +Q++I N G
Sbjct: 149 GGCWAFSAIATVEGINKIVTGVLISLSEQELIDCGRTQNTRGCNGGYITDGFQFIINNGG 208
Query: 65 INTERDYPNVGVMDNCKV 82
INTE +YP C +
Sbjct: 209 INTEENYPYTAQDGECNL 226
>gi|77735825|ref|NP_001029607.1| cathepsin K precursor [Bos taurus]
gi|59858469|gb|AAX09069.1| cathepsin K preproprotein [Bos taurus]
gi|83638771|gb|AAI09854.1| Cathepsin K [Bos taurus]
gi|296489554|tpg|DAA31667.1| TPA: cathepsin K [Bos taurus]
Length = 334
Score = 75.9 bits (185), Expect = 3e-12, Method: Composition-based stats.
Identities = 37/76 (48%), Positives = 50/76 (65%), Gaps = 2/76 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ VGA+EG K T L+++S Q LVDC E+ C GG++ +QYV +NRG
Sbjct: 142 GSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDC--VSENDGCGGGYMTNAFQYVQKNRG 199
Query: 65 INTERDYPNVGVMDNC 80
I++E YP VG +NC
Sbjct: 200 IDSEDAYPYVGQDENC 215
>gi|351726339|ref|NP_001237379.1| cysteine proteinase precursor [Glycine max]
gi|31559526|dbj|BAC77521.1| cysteine proteinase [Glycine max]
gi|31559528|dbj|BAC77522.1| cysteine proteinase [Glycine max]
Length = 362
Score = 75.9 bits (185), Expect = 3e-12, Method: Composition-based stats.
Identities = 35/82 (42%), Positives = 52/82 (63%), Gaps = 1/82 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V A+EGI++I TN LV +S Q+LVDCD + ++ C GG +E+ ++++ Q G
Sbjct: 150 GSCWAFSTVVAVEGINQIKTNKLVSLSEQELVDCDTK-KNAGCNGGLMESAFEFIKQKGG 208
Query: 65 INTERDYPNVGVMDNCKVFQFN 86
I TE +YP C + N
Sbjct: 209 ITTESNYPYTAQDGTCDASKAN 230
>gi|238006338|gb|ACR34204.1| unknown [Zea mays]
Length = 465
Score = 75.9 bits (185), Expect = 3e-12, Method: Composition-based stats.
Identities = 32/82 (39%), Positives = 52/82 (63%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V ++E +++IVT +V +S Q+LV+C G + C GG ++ + ++I+N G
Sbjct: 164 GSCWAFSAVSSVESVNQIVTGEMVTLSEQELVECSTDGGNSGCNGGLMDAAFDFIIKNGG 223
Query: 65 INTERDYPNVGVMDNCKVFQFN 86
I+TE DYP V C + + N
Sbjct: 224 IDTEGDYPYKAVDGKCDINREN 245
>gi|47523662|ref|NP_999467.1| cathepsin K precursor [Sus scrofa]
gi|15213940|sp|Q9GLE3.1|CATK_PIG RecName: Full=Cathepsin K; Flags: Precursor
gi|10048286|gb|AAG12340.1|AF292030_1 cathepsin K precursor [Sus scrofa]
Length = 330
Score = 75.9 bits (185), Expect = 3e-12, Method: Composition-based stats.
Identities = 37/76 (48%), Positives = 50/76 (65%), Gaps = 2/76 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ VGA+EG K T L+++S Q LVDC E+ C GG++ +QYV +NRG
Sbjct: 138 GSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDC--VSENDGCGGGYMTNAFQYVQKNRG 195
Query: 65 INTERDYPNVGVMDNC 80
I++E YP VG +NC
Sbjct: 196 IDSEDAYPYVGQDENC 211
>gi|224133764|ref|XP_002321655.1| predicted protein [Populus trichocarpa]
gi|222868651|gb|EEF05782.1| predicted protein [Populus trichocarpa]
Length = 360
Score = 75.9 bits (185), Expect = 3e-12, Method: Composition-based stats.
Identities = 31/68 (45%), Positives = 48/68 (70%), Gaps = 1/68 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ + A+EGI+ I TN L+ +S Q+LVDC N GE+ C GG ++ ++++ + +G
Sbjct: 148 GSCWAFSTIVAVEGINFIKTNKLISLSEQELVDC-NTGENHGCNGGLMDYAFEFITKQKG 206
Query: 65 INTERDYP 72
I TE +YP
Sbjct: 207 ITTEANYP 214
>gi|67678376|gb|AAH96862.1| Cathepsin S, b.2 [Danio rerio]
Length = 330
Score = 75.9 bits (185), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 35/77 (45%), Positives = 48/77 (62%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ VGA+EG T LVD+S Q LVDC ++ + C GG++ +QYVI N G
Sbjct: 137 GSCWAFSSVGALEGQLMKTTGKLVDLSPQNLVDCSSKYGNLGCNGGYMSQAFQYVIDNGG 196
Query: 65 INTERDYPNVGVMDNCK 81
I++E YP G +C+
Sbjct: 197 IDSESSYPYQGTQGSCR 213
>gi|359359213|gb|AEV41117.1| putative oryzain beta chain precursor [Oryza officinalis]
Length = 465
Score = 75.9 bits (185), Expect = 3e-12, Method: Composition-based stats.
Identities = 31/82 (37%), Positives = 52/82 (63%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V +E I+++VT ++ +S Q+LV+C G++ C GG ++ + ++I+N G
Sbjct: 162 GSCWAFSAVSTVESINQLVTGEMITLSEQELVECSTNGQNSGCNGGLMDDAFDFIIKNGG 221
Query: 65 INTERDYPNVGVMDNCKVFQFN 86
I+TE DYP V C + + N
Sbjct: 222 IDTEDDYPYKAVDGKCDINREN 243
>gi|359359066|gb|AEV40973.1| putative oryzain beta chain precursor [Oryza punctata]
Length = 461
Score = 75.9 bits (185), Expect = 3e-12, Method: Composition-based stats.
Identities = 31/82 (37%), Positives = 52/82 (63%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V +E I+++VT ++ +S Q+LV+C G++ C GG ++ + ++I+N G
Sbjct: 158 GSCWAFSAVSTVESINQLVTGEMITLSEQELVECSTNGQNSGCNGGLMDDAFDFIIKNGG 217
Query: 65 INTERDYPNVGVMDNCKVFQFN 86
I+TE DYP V C + + N
Sbjct: 218 IDTEDDYPYKAVDGKCDINREN 239
>gi|109940312|sp|Q5E968.2|CATK_BOVIN RecName: Full=Cathepsin K; Flags: Precursor
Length = 329
Score = 75.9 bits (185), Expect = 3e-12, Method: Composition-based stats.
Identities = 37/76 (48%), Positives = 50/76 (65%), Gaps = 2/76 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ VGA+EG K T L+++S Q LVDC E+ C GG++ +QYV +NRG
Sbjct: 137 GSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDC--VSENDGCGGGYMTNAFQYVQKNRG 194
Query: 65 INTERDYPNVGVMDNC 80
I++E YP VG +NC
Sbjct: 195 IDSEDAYPYVGQDENC 210
>gi|302142276|emb|CBI19479.3| unnamed protein product [Vitis vinifera]
Length = 388
Score = 75.9 bits (185), Expect = 3e-12, Method: Composition-based stats.
Identities = 34/82 (41%), Positives = 52/82 (63%), Gaps = 1/82 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ + A+EGI+KIVT L+ +S Q+LVDCD + C GG ++ ++++I N G
Sbjct: 81 GSCWAFSTIAAVEGINKIVTGGLISLSEQELVDCDTS-YNEGCNGGLMDYAFEFIINNGG 139
Query: 65 INTERDYPNVGVMDNCKVFQFN 86
I++E DYP C ++ N
Sbjct: 140 IDSEEDYPYKASDGRCDQYRKN 161
>gi|224065647|ref|XP_002301901.1| predicted protein [Populus trichocarpa]
gi|222843627|gb|EEE81174.1| predicted protein [Populus trichocarpa]
Length = 336
Score = 75.9 bits (185), Expect = 3e-12, Method: Composition-based stats.
Identities = 34/78 (43%), Positives = 50/78 (64%), Gaps = 1/78 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V A+EGI++IVT NL +S Q+LVDCD + C GG ++ + Y+I N G
Sbjct: 140 GSCWAFSTVAAVEGINQIVTGNLTSLSEQELVDCDTT-NNYGCNGGLMDYAFSYIISNGG 198
Query: 65 INTERDYPNVGVMDNCKV 82
++ E DYP + C++
Sbjct: 199 LHKEVDYPYIMEEGTCEM 216
>gi|125551397|gb|EAY97106.1| hypothetical protein OsI_19029 [Oryza sativa Indica Group]
Length = 350
Score = 75.9 bits (185), Expect = 3e-12, Method: Composition-based stats.
Identities = 32/77 (41%), Positives = 46/77 (59%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW F+ V A+EGI K+ T L+ +S Q+LVDCD G + C GG I+ +Q+++ N G
Sbjct: 156 GCCWAFSAVAAMEGIVKLSTGKLISLSEQELVDCDVDGNDQGCEGGEIDGAFQFILSNGG 215
Query: 65 INTERDYPNVGVMDNCK 81
+ E +YP CK
Sbjct: 216 LTAEANYPYTAEDGRCK 232
>gi|18413507|ref|NP_567377.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|30315953|sp|Q9SUS9.1|CPR4_ARATH RecName: Full=Probable cysteine proteinase At4g11320; Flags:
Precursor
gi|5596478|emb|CAB51416.1| drought-inducible cysteine proteinase RD21A precursor-like protein
[Arabidopsis thaliana]
gi|7267831|emb|CAB81233.1| drought-inducible cysteine proteinase RD21A precursor-like protein
[Arabidopsis thaliana]
gi|14334764|gb|AAK59560.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|15293257|gb|AAK93739.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|332657596|gb|AEE82996.1| putative cysteine proteinase [Arabidopsis thaliana]
Length = 371
Score = 75.9 bits (185), Expect = 3e-12, Method: Composition-based stats.
Identities = 32/67 (47%), Positives = 48/67 (71%), Gaps = 2/67 (2%)
Query: 6 SCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRGI 65
SCW F+ VGA+EG++KIVT LV +S Q L++C+ E+ C GG +ET Y++++ N G+
Sbjct: 167 SCWAFSTVGAVEGLNKIVTGELVTLSEQDLINCNK--ENNGCGGGKVETAYEFIMNNGGL 224
Query: 66 NTERDYP 72
T+ DYP
Sbjct: 225 GTDNDYP 231
>gi|359359118|gb|AEV41024.1| putative oryzain beta chain precursor [Oryza minuta]
Length = 493
Score = 75.9 bits (185), Expect = 3e-12, Method: Composition-based stats.
Identities = 31/82 (37%), Positives = 52/82 (63%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V +E I+++VT ++ +S Q+LV+C G++ C GG ++ + ++I+N G
Sbjct: 190 GSCWAFSAVSTVESINQLVTGEMITLSEQELVECSTNGQNSGCNGGLMDDAFDFIIKNGG 249
Query: 65 INTERDYPNVGVMDNCKVFQFN 86
I+TE DYP V C + + N
Sbjct: 250 IDTEDDYPYKAVDGKCDINREN 271
>gi|217072410|gb|ACJ84565.1| unknown [Medicago truncatula]
Length = 328
Score = 75.9 bits (185), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 34/76 (44%), Positives = 51/76 (67%), Gaps = 1/76 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ + A+EGI+KIVT +L+ +S Q+LVDCD + C GG ++ ++++I N G
Sbjct: 46 GSCWAFSAIAAVEGINKIVTGDLISLSEQELVDCDTS-YNEGCNGGLMDYAFEFIISNGG 104
Query: 65 INTERDYPNVGVMDNC 80
I++E DYP V C
Sbjct: 105 IDSEDDYPYKAVDGRC 120
>gi|357160599|ref|XP_003578815.1| PREDICTED: vignain-like [Brachypodium distachyon]
Length = 339
Score = 75.9 bits (185), Expect = 3e-12, Method: Composition-based stats.
Identities = 33/77 (42%), Positives = 45/77 (58%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW F+ V A EGI K+ T LV +S Q+LVDCD GE + C GG ++ ++++I N G
Sbjct: 145 GCCWAFSAVAATEGIVKLSTGKLVSLSEQELVDCDVHGEDQGCEGGLMDDAFKFIITNGG 204
Query: 65 INTERDYPNVGVMDNCK 81
+ E YP CK
Sbjct: 205 LTQESSYPYDAEDGKCK 221
>gi|330805275|ref|XP_003290610.1| hypothetical protein DICPUDRAFT_98747 [Dictyostelium purpureum]
gi|325079249|gb|EGC32858.1| hypothetical protein DICPUDRAFT_98747 [Dictyostelium purpureum]
Length = 334
Score = 75.9 bits (185), Expect = 3e-12, Method: Composition-based stats.
Identities = 33/76 (43%), Positives = 44/76 (57%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW FA GA+EG +I T N+V S Q LVDC + + C GG + + ++Y+I N G
Sbjct: 141 GSCWAFATTGAVEGAHQIKTGNMVTFSEQHLVDCSGRYGNNGCDGGLMTSAFKYIIDNDG 200
Query: 65 INTERDYPNVGVMDNC 80
I TE YP + C
Sbjct: 201 IATEEAYPYTATQNRC 216
>gi|350538043|ref|NP_001234324.1| cysteine protease TDI-65 precursor [Solanum lycopersicum]
gi|5726641|gb|AAD48496.1|AF172856_1 cysteine protease TDI-65 [Solanum lycopersicum]
gi|2828252|emb|CAA05894.1| CYP1 [Solanum lycopersicum]
Length = 466
Score = 75.9 bits (185), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 34/68 (50%), Positives = 49/68 (72%), Gaps = 1/68 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V A+E I+ IVT NL+ +S Q+LVDCD + + C GG ++ +++VI+N G
Sbjct: 160 GSCWAFSAVAAMESINAIVTGNLISLSEQELVDCD-RSYNEGCDGGLMDYAFEFVIKNGG 218
Query: 65 INTERDYP 72
I+TE DYP
Sbjct: 219 IDTEEDYP 226
>gi|357160569|ref|XP_003578807.1| PREDICTED: vignain-like [Brachypodium distachyon]
Length = 339
Score = 75.9 bits (185), Expect = 3e-12, Method: Composition-based stats.
Identities = 33/77 (42%), Positives = 45/77 (58%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW F+ V A EGI K+ T LV +S Q+LVDCD GE + C GG ++ ++++I N G
Sbjct: 145 GCCWAFSAVAATEGIVKLSTGKLVSLSEQELVDCDVHGEDQGCEGGLMDDAFKFIISNGG 204
Query: 65 INTERDYPNVGVMDNCK 81
+ E YP CK
Sbjct: 205 LTQESSYPYDAEDGKCK 221
>gi|18402225|ref|NP_566633.1| Granulin repeat cysteine protease family protein [Arabidopsis
thaliana]
gi|11994461|dbj|BAB02463.1| cysteine proteinase [Arabidopsis thaliana]
gi|17065298|gb|AAL32803.1| cysteine proteinase [Arabidopsis thaliana]
gi|20260004|gb|AAM13349.1| cysteine proteinase [Arabidopsis thaliana]
gi|332642713|gb|AEE76234.1| Granulin repeat cysteine protease family protein [Arabidopsis
thaliana]
Length = 452
Score = 75.9 bits (185), Expect = 3e-12, Method: Composition-based stats.
Identities = 32/71 (45%), Positives = 50/71 (70%), Gaps = 1/71 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ +GA+EGI++I T L+ +S Q+LVDCD + C GG ++ ++++I+N G
Sbjct: 151 GSCWAFSAIGAVEGINQIKTGELISLSEQELVDCDTS-YNDGCGGGLMDYAFKFIIENGG 209
Query: 65 INTERDYPNVG 75
I+TE DYP +
Sbjct: 210 IDTEEDYPYIA 220
>gi|255646767|gb|ACU23856.1| unknown [Glycine max]
Length = 350
Score = 75.9 bits (185), Expect = 3e-12, Method: Composition-based stats.
Identities = 31/78 (39%), Positives = 52/78 (66%), Gaps = 1/78 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V A+EGI++IVT NL +S Q+L+DCD + + C GG ++ + ++++N G
Sbjct: 154 GSCWAFSTVAAVEGINQIVTGNLTSLSEQELIDCD-RTYNNGCNGGLMDYAFSFIVENGG 212
Query: 65 INTERDYPNVGVMDNCKV 82
++ E DYP + C++
Sbjct: 213 LHKEEDYPYIMEEGACEM 230
>gi|351694420|gb|EHA97338.1| Cathepsin K [Heterocephalus glaber]
Length = 329
Score = 75.5 bits (184), Expect = 3e-12, Method: Composition-based stats.
Identities = 37/76 (48%), Positives = 50/76 (65%), Gaps = 2/76 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ VGA+EG K T L+++S Q LVDC E+ C GG++ +QYV QNRG
Sbjct: 137 GSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDC--VSENDGCGGGYMTNAFQYVQQNRG 194
Query: 65 INTERDYPNVGVMDNC 80
I++E YP VG ++C
Sbjct: 195 IDSEDAYPYVGQDESC 210
>gi|300122868|emb|CBK23875.2| unnamed protein product [Blastocystis hominis]
Length = 316
Score = 75.5 bits (184), Expect = 3e-12, Method: Composition-based stats.
Identities = 36/77 (46%), Positives = 51/77 (66%), Gaps = 3/77 (3%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ GA+EG + + T LV +S QQLVDCD E C GGF++T ++YV++ +G
Sbjct: 129 GSCWAFSATGALEGGNFVATGKLVSLSEQQLVDCDT--EDAGCGGGFMDTAFEYVMK-KG 185
Query: 65 INTERDYPNVGVMDNCK 81
+ TE DYP ++CK
Sbjct: 186 LCTEEDYPYHAKDEDCK 202
>gi|156124998|gb|ABU50817.1| Ale o 1 allergen [Aleuroglyphus ovatus]
Length = 337
Score = 75.5 bits (184), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 33/77 (42%), Positives = 48/77 (62%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V ++EG + T LV +S Q LVDC C GG+++ ++YVIQNRG
Sbjct: 143 GSCWAFSAVASMEGQHALKTGKLVSLSEQNLVDCSAAEGDMGCSGGWMDYAFKYVIQNRG 202
Query: 65 INTERDYPNVGVMDNCK 81
I+TE YP + ++C+
Sbjct: 203 IDTEASYPYKAIDESCE 219
>gi|403302732|ref|XP_003942007.1| PREDICTED: cathepsin S isoform 2 [Saimiri boliviensis boliviensis]
Length = 289
Score = 75.5 bits (184), Expect = 3e-12, Method: Composition-based stats.
Identities = 32/77 (41%), Positives = 47/77 (61%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G+CW F+ VGA+E K+ T LV +S Q LVDC + ++ C GGF+ +QY+I N+G
Sbjct: 96 GACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSEKYGNKGCNGGFMTEAFQYIIDNKG 155
Query: 65 INTERDYPNVGVMDNCK 81
I++E YP C+
Sbjct: 156 IDSEASYPYKATDQKCQ 172
>gi|156124996|gb|ABU50816.1| Ale o 1 allergen [Aleuroglyphus ovatus]
Length = 337
Score = 75.5 bits (184), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 33/77 (42%), Positives = 48/77 (62%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V ++EG + T LV +S Q LVDC C GG+++ ++YVIQNRG
Sbjct: 143 GSCWAFSAVASMEGQHALKTGKLVSLSEQNLVDCSAAEGDMGCSGGWMDYAFKYVIQNRG 202
Query: 65 INTERDYPNVGVMDNCK 81
I+TE YP + ++C+
Sbjct: 203 IDTEASYPYKAIDESCE 219
>gi|426216528|ref|XP_004002514.1| PREDICTED: cathepsin K [Ovis aries]
Length = 330
Score = 75.5 bits (184), Expect = 3e-12, Method: Composition-based stats.
Identities = 37/76 (48%), Positives = 50/76 (65%), Gaps = 2/76 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ VGA+EG K T L+++S Q LVDC E+ C GG++ +QYV +NRG
Sbjct: 138 GSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDC--VSENDGCGGGYMTNAFQYVQKNRG 195
Query: 65 INTERDYPNVGVMDNC 80
I++E YP VG +NC
Sbjct: 196 IDSEDAYPYVGQDENC 211
>gi|42573181|ref|NP_974687.1| Xylem cysteine proteinase 1 [Arabidopsis thaliana]
gi|332661102|gb|AEE86502.1| Xylem cysteine proteinase 1 [Arabidopsis thaliana]
Length = 288
Score = 75.5 bits (184), Expect = 3e-12, Method: Composition-based stats.
Identities = 32/68 (47%), Positives = 45/68 (66%), Gaps = 1/68 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V A+EGI++I T NL +S Q+L+DCD S C GG ++ +QY+I G
Sbjct: 159 GSCWAFSTVAAVEGINQITTGNLSSLSEQELIDCDTTFNS-GCNGGLMDYAFQYIISTGG 217
Query: 65 INTERDYP 72
++ E DYP
Sbjct: 218 LHKEDDYP 225
>gi|310656790|gb|ADP02219.1| Peptidase_C1 domain-containing protein [Triticum aestivum]
Length = 419
Score = 75.5 bits (184), Expect = 4e-12, Method: Composition-based stats.
Identities = 33/77 (42%), Positives = 47/77 (61%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW F+ V A EGI K+ T LV +S Q+LVDCD G + C GG ++ ++++I+N G
Sbjct: 145 GCCWAFSAVAATEGIVKLSTGKLVSLSEQELVDCDVHGVDQGCEGGEMDNAFKFIIKNGG 204
Query: 65 INTERDYPNVGVMDNCK 81
+ TE +YP CK
Sbjct: 205 LTTEANYPYTAQDGQCK 221
>gi|414591548|tpg|DAA42119.1| TPA: hypothetical protein ZEAMMB73_388689, partial [Zea mays]
Length = 229
Score = 75.5 bits (184), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 32/76 (42%), Positives = 53/76 (69%), Gaps = 1/76 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ + A+EG++KI+T LV +S Q+LVDCD+ +++ C GG ++ +QY+ +N G
Sbjct: 13 GSCWAFSAIAAVEGVNKIMTGKLVSLSEQELVDCDDV-DNQGCDGGLMDYAFQYIQRNGG 71
Query: 65 INTERDYPNVGVMDNC 80
+ TE +YP + +C
Sbjct: 72 VTTESNYPYLAEQRSC 87
>gi|356563584|ref|XP_003550041.1| PREDICTED: cysteine proteinase RD21a-like [Glycine max]
Length = 366
Score = 75.5 bits (184), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 33/71 (46%), Positives = 50/71 (70%), Gaps = 1/71 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V +E I+KIVT V +S Q+LVDCD + ++ C GG ++ ++++IQN G
Sbjct: 152 GSCWAFSTVATVEAINKIVTGKFVSLSEQELVDCD-RAYNQGCNGGLMDYAFEFIIQNGG 210
Query: 65 INTERDYPNVG 75
I+T++DYP G
Sbjct: 211 IDTDKDYPYRG 221
>gi|2511695|emb|CAB17077.1| cysteine proteinase precursor [Phaseolus vulgaris]
Length = 377
Score = 75.5 bits (184), Expect = 4e-12, Method: Composition-based stats.
Identities = 34/84 (40%), Positives = 49/84 (58%), Gaps = 7/84 (8%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGE-------SRSCVGGFIETIYQ 57
GSCW F+ G+IEG + I T L+++S QQLVDCD+Q + C+GG + Y+
Sbjct: 160 GSCWAFSTTGSIEGANFIATGKLLNLSEQQLVDCDSQCDITESTTCDNGCMGGLMTNAYK 219
Query: 58 YVIQNRGINTERDYPNVGVMDNCK 81
Y++Q+ G+ E YP G CK
Sbjct: 220 YLLQSGGLEEESSYPYTGAKGECK 243
>gi|313118766|gb|ADR32295.1| C14 cysteine protease [Solanum demissum]
gi|313118774|gb|ADR32299.1| C14 cysteine protease [Solanum verrucosum]
gi|313118776|gb|ADR32300.1| C14 cysteine protease [Solanum verrucosum]
gi|313118778|gb|ADR32301.1| C14 cysteine protease [Solanum verrucosum]
Length = 217
Score = 75.5 bits (184), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 36/82 (43%), Positives = 53/82 (64%), Gaps = 1/82 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V A+E I+ IVT NL+ +S Q+LVDCD + + C GG ++ +++VI N G
Sbjct: 23 GSCWAFSAVAAMESINAIVTGNLISLSEQELVDCD-KSYNEGCDGGLMDYAFEFVINNGG 81
Query: 65 INTERDYPNVGVMDNCKVFQFN 86
I++E DYP D C ++ N
Sbjct: 82 IDSEEDYPYKERNDVCDQYRKN 103
>gi|296228726|ref|XP_002759933.1| PREDICTED: cathepsin S isoform 1 [Callithrix jacchus]
Length = 330
Score = 75.5 bits (184), Expect = 4e-12, Method: Composition-based stats.
Identities = 32/77 (41%), Positives = 48/77 (62%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G+CW F+ VGA+E K+ T LV +S Q LVDC + ++ C GGF+ +QY+I N+G
Sbjct: 137 GACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSEKYGNKGCNGGFMTEAFQYIIDNKG 196
Query: 65 INTERDYPNVGVMDNCK 81
I++E YP + C+
Sbjct: 197 IDSEASYPYKAMDQKCQ 213
>gi|357160591|ref|XP_003578813.1| PREDICTED: vignain-like [Brachypodium distachyon]
Length = 339
Score = 75.5 bits (184), Expect = 4e-12, Method: Composition-based stats.
Identities = 33/77 (42%), Positives = 45/77 (58%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW F+ V A EGI K+ T LV +S Q+LVDCD GE + C GG ++ ++++I N G
Sbjct: 145 GCCWAFSAVAATEGIVKLSTGKLVSLSEQELVDCDVHGEDQGCEGGLMDDAFKFIITNGG 204
Query: 65 INTERDYPNVGVMDNCK 81
+ E YP CK
Sbjct: 205 LTQESSYPYDAEDGKCK 221
>gi|244539471|dbj|BAH82657.1| cysteine protease [Lotus japonicus]
Length = 286
Score = 75.5 bits (184), Expect = 4e-12, Method: Composition-based stats.
Identities = 32/78 (41%), Positives = 51/78 (65%), Gaps = 1/78 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V A+EGI++IVT NL +S Q+L+DCD S C GG ++ + ++++N G
Sbjct: 114 GSCWAFSTVAAVEGINQIVTGNLTSLSEQELIDCDRTYNS-GCNGGLMDYAFSFIVENGG 172
Query: 65 INTERDYPNVGVMDNCKV 82
++ E DYP + C++
Sbjct: 173 LHKEDDYPYIMEEGTCEM 190
>gi|146215982|gb|ABQ10193.1| actinidin Act2b [Actinidia eriantha]
Length = 378
Score = 75.5 bits (184), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 33/67 (49%), Positives = 46/67 (68%)
Query: 6 SCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRGI 65
SCW F+ V A+EGI+KIVT NL+ +S Q+LVDC ++ C G + +Q++I N GI
Sbjct: 149 SCWAFSAVTAVEGINKIVTGNLISLSEQELVDCGRTQRTKGCNRGLMTDAFQFIINNGGI 208
Query: 66 NTERDYP 72
NTE +YP
Sbjct: 209 NTEDNYP 215
>gi|59798094|sp|P84347.1|MEX2_JACME RecName: Full=Chymomexicain
Length = 215
Score = 75.5 bits (184), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 34/81 (41%), Positives = 49/81 (60%), Gaps = 2/81 (2%)
Query: 2 HPLGSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQ 61
+P GSCW F+ V +EGI+KI T L+ +S Q+L+DCD + S C GG+ QYV
Sbjct: 20 NPCGSCWAFSTVATVEGINKIRTGKLISLSEQELLDCDRR--SHGCKGGYQTGSIQYVAD 77
Query: 62 NRGINTERDYPNVGVMDNCKV 82
N G++TE++YP C+
Sbjct: 78 NGGVHTEKEYPYEKKQGKCRA 98
>gi|313118762|gb|ADR32293.1| C14 cysteine protease [Solanum stoloniferum]
Length = 217
Score = 75.5 bits (184), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 36/82 (43%), Positives = 53/82 (64%), Gaps = 1/82 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V A+E I+ IVT NL+ +S Q+LVDCD + + C GG ++ +++VI N G
Sbjct: 23 GSCWAFSAVAAMESINAIVTGNLISLSEQELVDCD-KSYNEGCDGGLMDYAFEFVINNGG 81
Query: 65 INTERDYPNVGVMDNCKVFQFN 86
I++E DYP D C ++ N
Sbjct: 82 IDSEEDYPYKERNDVCDQYRKN 103
>gi|226529105|ref|NP_001150196.1| cysteine protease 1 precursor [Zea mays]
gi|194701798|gb|ACF84983.1| unknown [Zea mays]
gi|194704800|gb|ACF86484.1| unknown [Zea mays]
gi|195637480|gb|ACG38208.1| cysteine protease 1 precursor [Zea mays]
gi|413919895|gb|AFW59827.1| cysteine protease 1 [Zea mays]
Length = 470
Score = 75.5 bits (184), Expect = 4e-12, Method: Composition-based stats.
Identities = 32/82 (39%), Positives = 52/82 (63%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V ++E +++IVT +V +S Q+LV+C G + C GG ++ + ++I+N G
Sbjct: 172 GSCWAFSAVSSVESVNQIVTGEMVTLSEQELVECSTDGGNSGCNGGLMDAAFDFIIKNGG 231
Query: 65 INTERDYPNVGVMDNCKVFQFN 86
I+TE DYP V C + + N
Sbjct: 232 IDTEDDYPYRAVDGKCDMNRKN 253
>gi|328872971|gb|EGG21338.1| cysteine proteinase 5 precursor [Dictyostelium fasciculatum]
Length = 358
Score = 75.5 bits (184), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 33/77 (42%), Positives = 48/77 (62%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ G+ EG +I T+NLV +S Q L+DC + + C GG ++ ++Y+I N G
Sbjct: 138 GSCWSFSATGSTEGAHQISTSNLVALSEQNLIDCSSSYGNDGCNGGLMDNAFKYIIANGG 197
Query: 65 INTERDYPNVGVMDNCK 81
I+TE YP V + CK
Sbjct: 198 IDTEASYPYVAKVQKCK 214
>gi|313118772|gb|ADR32298.1| C14 cysteine protease [Solanum demissum]
Length = 217
Score = 75.5 bits (184), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 36/82 (43%), Positives = 54/82 (65%), Gaps = 1/82 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V A+E I+ IVT +L+ +S Q+LVDCD + ++ C GG ++ +++VI N G
Sbjct: 23 GSCWAFSAVAAMESINAIVTGDLISLSEQELVDCD-KSYNQGCDGGLMDYAFEFVINNGG 81
Query: 65 INTERDYPNVGVMDNCKVFQFN 86
I+TE DYP D C ++ N
Sbjct: 82 IDTEEDYPYKERNDVCDQYRKN 103
>gi|50355623|dbj|BAD29960.1| cysteine protease [Daucus carota]
Length = 460
Score = 75.5 bits (184), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 33/82 (40%), Positives = 53/82 (64%), Gaps = 1/82 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ + A+EGI++I T L+ +S Q+LVDCD + + C GG ++ +Q++I N G
Sbjct: 154 GSCWAFSTISAVEGINQIATGKLITLSEQELVDCD-RSYNEGCNGGLMDDAFQFIINNGG 212
Query: 65 INTERDYPNVGVMDNCKVFQFN 86
I+++ DYP G C ++ N
Sbjct: 213 IDSDADYPYTGRDGQCDQYRKN 234
>gi|313118760|gb|ADR32292.1| C14 cysteine protease [Solanum stoloniferum]
Length = 217
Score = 75.5 bits (184), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 36/82 (43%), Positives = 53/82 (64%), Gaps = 1/82 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V A+E I+ IVT NL+ +S Q+LVDCD + + C GG ++ +++VI N G
Sbjct: 23 GSCWAFSAVAAMESINAIVTGNLISLSEQELVDCD-KSYNEGCDGGLMDYAFEFVINNGG 81
Query: 65 INTERDYPNVGVMDNCKVFQFN 86
I++E DYP D C ++ N
Sbjct: 82 IDSEEDYPYKERNDVCDQYRKN 103
>gi|363814535|ref|NP_001242660.1| uncharacterized protein LOC100807362 precursor [Glycine max]
gi|255636658|gb|ACU18666.1| unknown [Glycine max]
Length = 367
Score = 75.5 bits (184), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 34/80 (42%), Positives = 51/80 (63%), Gaps = 1/80 (1%)
Query: 7 CWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRGIN 66
CW F+ + A+EGI+KIVT NL +S Q+L+DCD + C GG ++ ++++I N GI+
Sbjct: 162 CWAFSAIAAVEGINKIVTGNLTALSEQELLDCDRTVNA-GCSGGLVDYAFEFIINNGGID 220
Query: 67 TERDYPNVGVMDNCKVFQFN 86
TE DYP G C ++ N
Sbjct: 221 TEEDYPFQGADGICDQYKIN 240
>gi|224131910|ref|XP_002328138.1| predicted protein [Populus trichocarpa]
gi|222837653|gb|EEE76018.1| predicted protein [Populus trichocarpa]
Length = 349
Score = 75.5 bits (184), Expect = 4e-12, Method: Composition-based stats.
Identities = 30/68 (44%), Positives = 46/68 (67%), Gaps = 1/68 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V A+EGI++IV NL +S QQL+DCD + C GG ++ +++++ N G
Sbjct: 153 GSCWAFSTVAAVEGINQIVAGNLTSLSEQQLIDCDTSFNN-GCNGGLMDYAFEFIVNNGG 211
Query: 65 INTERDYP 72
++ E DYP
Sbjct: 212 LHKEEDYP 219
>gi|242038089|ref|XP_002466439.1| hypothetical protein SORBIDRAFT_01g007820 [Sorghum bicolor]
gi|241920293|gb|EER93437.1| hypothetical protein SORBIDRAFT_01g007820 [Sorghum bicolor]
Length = 353
Score = 75.5 bits (184), Expect = 4e-12, Method: Composition-based stats.
Identities = 32/76 (42%), Positives = 48/76 (63%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW F+ V A+EG++KI T LV +S Q+LVDCD GE + C GG ++ +Q++ + G
Sbjct: 159 GCCWAFSAVAAVEGLNKIRTGRLVSLSEQELVDCDVNGEDQGCEGGLMDDAFQFIERRGG 218
Query: 65 INTERDYPNVGVMDNC 80
+ +E YP G +C
Sbjct: 219 LASESGYPYQGDDGSC 234
>gi|194352754|emb|CAQ00105.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
gi|326513690|dbj|BAJ87864.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326514532|dbj|BAJ96253.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 463
Score = 75.1 bits (183), Expect = 4e-12, Method: Composition-based stats.
Identities = 33/71 (46%), Positives = 51/71 (71%), Gaps = 7/71 (9%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCD---NQGESRSCVGGFIETIYQYVIQ 61
GSCW F+ + A+EGI++IVT +++ +S Q+LVDCD NQG C GG ++ ++++I
Sbjct: 154 GSCWAFSAIAAVEGINQIVTGDMIPLSEQELVDCDTSYNQG----CNGGLMDYAFEFIIN 209
Query: 62 NRGINTERDYP 72
N GI++E DYP
Sbjct: 210 NGGIDSEEDYP 220
>gi|42794048|dbj|BAD11762.1| cahepsin L-like cysteine protease [Brugia malayi]
Length = 371
Score = 75.1 bits (183), Expect = 4e-12, Method: Composition-based stats.
Identities = 33/78 (42%), Positives = 50/78 (64%), Gaps = 1/78 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDC-DNQGESRSCVGGFIETIYQYVIQNR 63
GSCW F+ VGA+EG + T LV++S Q L+DC D+ + C GG + ++YV++N
Sbjct: 165 GSCWTFSAVGALEGQHFLQTGKLVELSMQNLLDCSDDTYGNYGCDGGLMMEAFEYVVKND 224
Query: 64 GINTERDYPNVGVMDNCK 81
GI+TE+ YP G + C+
Sbjct: 225 GIDTEKSYPYQGYQNTCR 242
>gi|403302730|ref|XP_003942006.1| PREDICTED: cathepsin S isoform 1 [Saimiri boliviensis boliviensis]
Length = 339
Score = 75.1 bits (183), Expect = 4e-12, Method: Composition-based stats.
Identities = 32/77 (41%), Positives = 47/77 (61%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G+CW F+ VGA+E K+ T LV +S Q LVDC + ++ C GGF+ +QY+I N+G
Sbjct: 146 GACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSEKYGNKGCNGGFMTEAFQYIIDNKG 205
Query: 65 INTERDYPNVGVMDNCK 81
I++E YP C+
Sbjct: 206 IDSEASYPYKATDQKCQ 222
>gi|297809385|ref|XP_002872576.1| hypothetical protein ARALYDRAFT_489965 [Arabidopsis lyrata subsp.
lyrata]
gi|297318413|gb|EFH48835.1| hypothetical protein ARALYDRAFT_489965 [Arabidopsis lyrata subsp.
lyrata]
Length = 371
Score = 75.1 bits (183), Expect = 4e-12, Method: Composition-based stats.
Identities = 32/67 (47%), Positives = 48/67 (71%), Gaps = 2/67 (2%)
Query: 6 SCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRGI 65
SCW F+ VGA+EG++KIVT LV +S Q L++C+ E+ C GG +ET Y++++ N G+
Sbjct: 167 SCWAFSTVGAVEGLNKIVTGELVTLSEQDLINCNK--ENNGCGGGKVETAYEFIMNNGGL 224
Query: 66 NTERDYP 72
T+ DYP
Sbjct: 225 GTDNDYP 231
>gi|297802418|ref|XP_002869093.1| hypothetical protein ARALYDRAFT_491113 [Arabidopsis lyrata subsp.
lyrata]
gi|297314929|gb|EFH45352.1| hypothetical protein ARALYDRAFT_491113 [Arabidopsis lyrata subsp.
lyrata]
Length = 355
Score = 75.1 bits (183), Expect = 4e-12, Method: Composition-based stats.
Identities = 32/68 (47%), Positives = 45/68 (66%), Gaps = 1/68 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V A+EGI++I T NL +S Q+L+DCD S C GG ++ +QY+I G
Sbjct: 159 GSCWAFSTVAAVEGINQITTGNLSSLSEQELIDCDTTFNS-GCNGGLMDYAFQYIISTGG 217
Query: 65 INTERDYP 72
++ E DYP
Sbjct: 218 LHKEDDYP 225
>gi|220983358|dbj|BAH11164.1| cysteine protease [Hordeum vulgare]
Length = 462
Score = 75.1 bits (183), Expect = 4e-12, Method: Composition-based stats.
Identities = 33/71 (46%), Positives = 51/71 (71%), Gaps = 7/71 (9%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCD---NQGESRSCVGGFIETIYQYVIQ 61
GSCW F+ + A+EGI++IVT +++ +S Q+LVDCD NQG C GG ++ ++++I
Sbjct: 153 GSCWAFSAIAAVEGINQIVTGDMIPLSEQELVDCDTSYNQG----CNGGLMDYAFEFIIN 208
Query: 62 NRGINTERDYP 72
N GI++E DYP
Sbjct: 209 NGGIDSEEDYP 219
>gi|290997496|ref|XP_002681317.1| cysteine protease [Naegleria gruberi]
gi|284094941|gb|EFC48573.1| cysteine protease [Naegleria gruberi]
Length = 350
Score = 75.1 bits (183), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 40/90 (44%), Positives = 53/90 (58%), Gaps = 8/90 (8%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDN-----QGES---RSCVGGFIETIY 56
GSCW F+ G +EGI +I T LV +S QQLVDCD+ QG+ C GG + + +
Sbjct: 147 GSCWTFSTTGNVEGIHQIKTGKLVSLSEQQLVDCDHNCVTYQGQQACDAGCNGGLMWSAF 206
Query: 57 QYVIQNRGINTERDYPNVGVMDNCKVFQFN 86
QYVI+ G+ TE YP GV D C+ + N
Sbjct: 207 QYVIKTGGLVTEDSYPYEGVDDTCRFNKSN 236
>gi|37780039|gb|AAP32192.1| cysteine protease 14 [Trifolium repens]
Length = 351
Score = 75.1 bits (183), Expect = 5e-12, Method: Composition-based stats.
Identities = 32/78 (41%), Positives = 50/78 (64%), Gaps = 1/78 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V A+EGI++IVT NL +S Q+L+DCD + C GG ++ + ++ QN G
Sbjct: 155 GSCWAFSTVAAVEGINQIVTGNLTSLSEQELIDCDTT-YNNGCNGGLMDYAFSFIGQNGG 213
Query: 65 INTERDYPNVGVMDNCKV 82
++ E DYP + C++
Sbjct: 214 LHKEEDYPYIMEESTCEM 231
>gi|62510453|sp|Q8HY82.1|CATS_SAIBB RecName: Full=Cathepsin S; Flags: Precursor
gi|27497536|gb|AAO13008.1| cathepsin S preproprotein [Saimiri boliviensis]
Length = 330
Score = 75.1 bits (183), Expect = 5e-12, Method: Composition-based stats.
Identities = 32/77 (41%), Positives = 47/77 (61%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G+CW F+ VGA+E K+ T LV +S Q LVDC + ++ C GGF+ +QY+I N+G
Sbjct: 137 GACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSEKYGNKGCNGGFMTEAFQYIIDNKG 196
Query: 65 INTERDYPNVGVMDNCK 81
I++E YP C+
Sbjct: 197 IDSEASYPYKATDQKCQ 213
>gi|359491865|ref|XP_002273243.2| PREDICTED: xylem cysteine proteinase 1-like [Vitis vinifera]
Length = 351
Score = 75.1 bits (183), Expect = 5e-12, Method: Composition-based stats.
Identities = 33/76 (43%), Positives = 48/76 (63%), Gaps = 1/76 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V A+EGI++IVT NL +S Q+L+DCD + + C GG ++ + ++I N G
Sbjct: 155 GSCWAFSTVAAVEGINQIVTGNLTALSEQELIDCD-KPFNNGCNGGLMDYAFAFIISNGG 213
Query: 65 INTERDYPNVGVMDNC 80
+ E DYP V C
Sbjct: 214 LRKEEDYPYVMEEGTC 229
>gi|168063167|ref|XP_001783545.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162664932|gb|EDQ51634.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 461
Score = 75.1 bits (183), Expect = 5e-12, Method: Composition-based stats.
Identities = 37/79 (46%), Positives = 50/79 (63%), Gaps = 7/79 (8%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCD---NQGESRSCVGGFIETIYQYVIQ 61
GSCW F+ VG++EGI+ I V +S Q+LVDCD NQG C GG ++ + ++IQ
Sbjct: 159 GSCWAFSAVGSVEGINAIRNGEAVSLSEQELVDCDLEYNQG----CNGGLMDYAFDFIIQ 214
Query: 62 NRGINTERDYPNVGVMDNC 80
N GI+TE+DYP G C
Sbjct: 215 NGGIDTEKDYPYKGFDGRC 233
>gi|301612003|ref|XP_002935514.1| PREDICTED: cathepsin K-like [Xenopus (Silurana) tropicalis]
Length = 331
Score = 75.1 bits (183), Expect = 5e-12, Method: Composition-based stats.
Identities = 39/79 (49%), Positives = 50/79 (63%), Gaps = 6/79 (7%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDC--DNQGESRSCVGGFIETIYQYVIQN 62
GSCW F+ VGA+EG T LV IS Q LVDC DN G C GG++ T ++YV +N
Sbjct: 139 GSCWAFSTVGALEGQLMKKTGKLVGISPQNLVDCVKDNFG----CGGGYMTTAFKYVKKN 194
Query: 63 RGINTERDYPNVGVMDNCK 81
+GI++E YP VG+ CK
Sbjct: 195 KGIDSEEAYPYVGMDQKCK 213
>gi|15145801|gb|AAK83567.1| cysteine proteinase CC23 [Vasconcellea cundinamarcensis]
Length = 176
Score = 75.1 bits (183), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 36/81 (44%), Positives = 51/81 (62%), Gaps = 3/81 (3%)
Query: 2 HPLGSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQ 61
+P GSCW F+ V +EGI+KIVT L+ +S Q+L+DCD + S C GG+ T QYV+
Sbjct: 19 NPCGSCWAFSTVATVEGINKIVTGKLISLSEQELLDCDRR--SHGCKGGYQTTSLQYVVD 76
Query: 62 NRGINTERDYPNVGVMDNCKV 82
N G++TE+ YP C+
Sbjct: 77 N-GVHTEKVYPYEKKQGKCRA 96
>gi|444515095|gb|ELV10757.1| Aryl hydrocarbon receptor nuclear translocator [Tupaia chinensis]
Length = 786
Score = 75.1 bits (183), Expect = 5e-12, Method: Composition-based stats.
Identities = 36/76 (47%), Positives = 50/76 (65%), Gaps = 2/76 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ VGA+EG K T L+++S Q LVDC E+ C GG++ +QYV +NRG
Sbjct: 626 GSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDC--VSENDGCGGGYMTNAFQYVQKNRG 683
Query: 65 INTERDYPNVGVMDNC 80
I++E YP VG ++C
Sbjct: 684 IDSEDAYPYVGQDESC 699
>gi|326490904|dbj|BAJ90119.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 457
Score = 75.1 bits (183), Expect = 5e-12, Method: Composition-based stats.
Identities = 33/68 (48%), Positives = 49/68 (72%), Gaps = 1/68 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G+CW F+ GA+EGI+KI T +L+ +S Q+L+DCD + + C GG + Y++VI+N G
Sbjct: 156 GACWSFSATGAMEGINKITTGSLLSLSEQELIDCD-RSYNTGCGGGLMTYAYKFVIKNGG 214
Query: 65 INTERDYP 72
I+TE DYP
Sbjct: 215 IDTEDDYP 222
>gi|356559055|ref|XP_003547817.1| PREDICTED: cysteine proteinase RD21a [Glycine max]
Length = 366
Score = 75.1 bits (183), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 33/71 (46%), Positives = 49/71 (69%), Gaps = 1/71 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V +E I+KIVT V +S Q+LVDCD + + C GG ++ ++++IQN G
Sbjct: 152 GSCWAFSTVATVEAINKIVTGKFVSLSEQELVDCD-RAYNEGCNGGLMDYAFEFIIQNGG 210
Query: 65 INTERDYPNVG 75
I+T++DYP G
Sbjct: 211 IDTDKDYPYRG 221
>gi|389608655|dbj|BAM17937.1| cathepsin L [Papilio xuthus]
Length = 341
Score = 75.1 bits (183), Expect = 5e-12, Method: Composition-based stats.
Identities = 33/77 (42%), Positives = 47/77 (61%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ GA+EG TN LV +S Q L+DC + C GG ++ ++Y+ NRG
Sbjct: 146 GSCWSFSSTGALEGQHYRRTNILVSLSEQNLIDCSAAYGNNGCNGGLMDNAFKYIKDNRG 205
Query: 65 INTERDYPNVGVMDNCK 81
I+TE+ YP G+ D C+
Sbjct: 206 IDTEKSYPYEGIDDKCR 222
>gi|118145|sp|P20721.1|CYSPL_SOLLC RecName: Full=Low-temperature-induced cysteine proteinase; Flags:
Precursor
gi|806314|gb|AAA66308.1| thiol protease, partial [Solanum lycopersicum]
Length = 346
Score = 75.1 bits (183), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 34/68 (50%), Positives = 49/68 (72%), Gaps = 1/68 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V A+E I+ IVT NL+ +S Q+LVDCD + + C GG ++ +++VI+N G
Sbjct: 40 GSCWAFSAVAAMESINAIVTGNLISLSEQELVDCD-RSYNEGCDGGLMDYAFEFVIKNGG 98
Query: 65 INTERDYP 72
I+TE DYP
Sbjct: 99 IDTEEDYP 106
>gi|18418684|ref|NP_567983.1| Xylem cysteine proteinase 1 [Arabidopsis thaliana]
gi|71153408|sp|O65493.1|XCP1_ARATH RecName: Full=Xylem cysteine proteinase 1; Short=AtXCP1; Flags:
Precursor
gi|6708181|gb|AAF25831.1|AF191027_1 papain-type cysteine endopeptidase XCP1 [Arabidopsis thaliana]
gi|3080415|emb|CAA18734.1| cysteine proteinase-like protein [Arabidopsis thaliana]
gi|7270487|emb|CAB80252.1| cysteine proteinase-like protein [Arabidopsis thaliana]
gi|26449881|dbj|BAC42063.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|28827736|gb|AAO50712.1| unknown protein [Arabidopsis thaliana]
gi|332661101|gb|AEE86501.1| Xylem cysteine proteinase 1 [Arabidopsis thaliana]
Length = 355
Score = 75.1 bits (183), Expect = 5e-12, Method: Composition-based stats.
Identities = 32/68 (47%), Positives = 45/68 (66%), Gaps = 1/68 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V A+EGI++I T NL +S Q+L+DCD S C GG ++ +QY+I G
Sbjct: 159 GSCWAFSTVAAVEGINQITTGNLSSLSEQELIDCDTTFNS-GCNGGLMDYAFQYIISTGG 217
Query: 65 INTERDYP 72
++ E DYP
Sbjct: 218 LHKEDDYP 225
>gi|147903593|ref|NP_001080822.1| cathepsin S precursor [Xenopus laevis]
gi|33417128|gb|AAH56059.1| Ctss-a protein [Xenopus laevis]
Length = 333
Score = 75.1 bits (183), Expect = 5e-12, Method: Composition-based stats.
Identities = 33/76 (43%), Positives = 49/76 (64%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ VGA+EG + T LV +S Q LVDC ++ ++ C GGF+ + +QYVI N G
Sbjct: 140 GSCWAFSAVGALEGQLMLKTGKLVSLSPQNLVDCASKYGNKGCSGGFMTSAFQYVIDNNG 199
Query: 65 INTERDYPNVGVMDNC 80
I+++ YP + + C
Sbjct: 200 IDSDSYYPYHAMDEKC 215
>gi|357156854|ref|XP_003577598.1| PREDICTED: thiol protease SEN102-like [Brachypodium distachyon]
Length = 368
Score = 75.1 bits (183), Expect = 5e-12, Method: Composition-based stats.
Identities = 31/76 (40%), Positives = 50/76 (65%), Gaps = 1/76 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ + A+E I+KI T LV +S Q+L+DCDN + + C GG ++ +Q++ +N G
Sbjct: 159 GSCWAFSTIAAVESINKIRTGKLVSLSEQELMDCDNVND-QGCDGGLMDYAFQFIQKNGG 217
Query: 65 INTERDYPNVGVMDNC 80
+ +E +YP G + C
Sbjct: 218 VTSEANYPYQGQQNTC 233
>gi|403302736|ref|XP_003942009.1| PREDICTED: cathepsin K isoform 2 [Saimiri boliviensis boliviensis]
Length = 383
Score = 75.1 bits (183), Expect = 5e-12, Method: Composition-based stats.
Identities = 36/76 (47%), Positives = 50/76 (65%), Gaps = 2/76 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ VGA+EG K T L+++S Q LVDC E+ C GG++ +QYV +NRG
Sbjct: 191 GSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDC--VSENDGCGGGYMTNAFQYVQKNRG 248
Query: 65 INTERDYPNVGVMDNC 80
I++E YP VG ++C
Sbjct: 249 IDSEDAYPYVGQEESC 264
>gi|356545116|ref|XP_003540991.1| PREDICTED: vignain-like [Glycine max]
Length = 342
Score = 75.1 bits (183), Expect = 5e-12, Method: Composition-based stats.
Identities = 32/77 (41%), Positives = 48/77 (62%), Gaps = 1/77 (1%)
Query: 6 SCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRGI 65
SCW F+ V IEG+ +I LV +S Q+LVDC +G+S C GG++E ++++ + G+
Sbjct: 148 SCWAFSTVATIEGLHQITKGELVSLSEQELVDC-VKGDSEGCYGGYVEDAFEFIAKKGGV 206
Query: 66 NTERDYPNVGVMDNCKV 82
+E YP GV CKV
Sbjct: 207 ASETHYPYKGVNKTCKV 223
>gi|32396020|gb|AAP41847.1| senescence-associated cysteine protease [Anthurium andraeanum]
Length = 460
Score = 75.1 bits (183), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 34/68 (50%), Positives = 49/68 (72%), Gaps = 1/68 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V A+EGI++IVT +L+ +S Q+LVDCD + C GG ++ +Q++I N G
Sbjct: 152 GSCWAFSTVAAVEGINQIVTGDLIVLSEQELVDCDT-AYNEGCNGGLMDYAFQFIISNGG 210
Query: 65 INTERDYP 72
I+TE DYP
Sbjct: 211 IDTEEDYP 218
>gi|194352758|emb|CAQ00107.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
Length = 457
Score = 75.1 bits (183), Expect = 5e-12, Method: Composition-based stats.
Identities = 33/68 (48%), Positives = 49/68 (72%), Gaps = 1/68 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G+CW F+ GA+EGI+KI T +L+ +S Q+L+DCD + + C GG + Y++VI+N G
Sbjct: 156 GACWSFSATGAMEGINKITTGSLLSLSEQELIDCD-RSYNTGCGGGLMTYAYKFVIKNGG 214
Query: 65 INTERDYP 72
I+TE DYP
Sbjct: 215 IDTEDDYP 222
>gi|167526493|ref|XP_001747580.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163774026|gb|EDQ87660.1| predicted protein [Monosiga brevicollis MX1]
Length = 330
Score = 75.1 bits (183), Expect = 5e-12, Method: Composition-based stats.
Identities = 31/68 (45%), Positives = 44/68 (64%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ G++EG I T LV +S QQL+DC + + C GG ++ ++YVI N G
Sbjct: 129 GSCWSFSTTGSVEGAHAIATGKLVSLSEQQLMDCSTRYGNHGCNGGLMDYAFEYVIANGG 188
Query: 65 INTERDYP 72
++TE DYP
Sbjct: 189 LDTEEDYP 196
>gi|113120267|gb|ABI30273.1| VXH-B, partial [Vasconcellea x heilbornii]
Length = 266
Score = 75.1 bits (183), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 39/80 (48%), Positives = 51/80 (63%), Gaps = 3/80 (3%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V A+EGI+KIVT LV +S Q+L+DC+ + S C GGF QYV QN G
Sbjct: 157 GSCWTFSSVAAVEGINKIVTGRLVSLSEQELLDCERR--SYGCRGGFPPYALQYVAQN-G 213
Query: 65 INTERDYPNVGVMDNCKVFQ 84
I+ ++YP GV C+ Q
Sbjct: 214 IHLRQNYPYEGVQRQCRARQ 233
>gi|218183|dbj|BAA14403.1| oryzain beta precursor [Oryza sativa Japonica Group]
Length = 471
Score = 75.1 bits (183), Expect = 5e-12, Method: Composition-based stats.
Identities = 31/82 (37%), Positives = 51/82 (62%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V +E I+++VT ++ +S Q+LV+C G++ C GG + + ++I+N G
Sbjct: 162 GSCWAFSAVSTVESINQLVTGEMITLSEQELVECSTNGQNSGCNGGLMADAFDFIIKNGG 221
Query: 65 INTERDYPNVGVMDNCKVFQFN 86
I+TE DYP V C + + N
Sbjct: 222 IDTEDDYPYKAVDGKCDINREN 243
>gi|148907299|gb|ABR16787.1| unknown [Picea sitchensis]
Length = 372
Score = 75.1 bits (183), Expect = 5e-12, Method: Composition-based stats.
Identities = 34/76 (44%), Positives = 46/76 (60%), Gaps = 2/76 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ + ++EGI+ I T LV +S QQLVDC E+ C GG ++ +QY+I N G
Sbjct: 158 GSCWAFSTIASVEGINYIKTGKLVSLSEQQLVDCSK--ENAGCNGGLMDNAFQYIIDNGG 215
Query: 65 INTERDYPNVGVMDNC 80
I TE +YP C
Sbjct: 216 IVTEDEYPYTAEAGEC 231
>gi|255572401|ref|XP_002527138.1| cysteine protease, putative [Ricinus communis]
gi|223533498|gb|EEF35240.1| cysteine protease, putative [Ricinus communis]
Length = 96
Score = 75.1 bits (183), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 37/82 (45%), Positives = 54/82 (65%), Gaps = 4/82 (4%)
Query: 6 SCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRGI 65
SCW F++V A+EGI+KIVT L+ +S Q+LVDCD + + C G ++ +Q++I N GI
Sbjct: 9 SCWAFSIVAAVEGINKIVTGKLISLSDQELVDCD-RSYNAGCNGDLVDNAFQFIINNGGI 67
Query: 66 NTERDYPNVGV---MDNCKVFQ 84
+T++DYP V D KV Q
Sbjct: 68 DTDKDYPYQAVDGKRDMTKVLQ 89
>gi|115479933|ref|NP_001063560.1| Os09g0497500 [Oryza sativa Japonica Group]
gi|113631793|dbj|BAF25474.1| Os09g0497500 [Oryza sativa Japonica Group]
gi|215704298|dbj|BAG93138.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 349
Score = 75.1 bits (183), Expect = 6e-12, Method: Composition-based stats.
Identities = 34/82 (41%), Positives = 50/82 (60%), Gaps = 2/82 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V AIEGI++I LV +S Q+LVDCD+ E+ C GG++ +++V+ N G
Sbjct: 144 GSCWAFSAVAAIEGINQIKNGELVSLSEQELVDCDD--EAVGCGGGYMSWAFEFVVGNHG 201
Query: 65 INTERDYPNVGVMDNCKVFQFN 86
+ TE YP C+ + N
Sbjct: 202 LTTEASYPYHAANGACQAAKLN 223
>gi|5901663|gb|AAD55363.1| cysteine protease [Hordeum vulgare subsp. vulgare]
Length = 163
Score = 75.1 bits (183), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 31/82 (37%), Positives = 52/82 (63%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V +E I+++VT ++ +S Q+LV+C G++ C GG ++ + ++I+N G
Sbjct: 1 GSCWAFSAVSTVESINQLVTGEMITLSEQELVECSTNGQNSGCNGGLMDDAFDFIIKNGG 60
Query: 65 INTERDYPNVGVMDNCKVFQFN 86
I+TE DYP V C + + N
Sbjct: 61 IDTEEDYPYKAVDGKCDINREN 82
>gi|218202389|gb|EEC84816.1| hypothetical protein OsI_31898 [Oryza sativa Indica Group]
Length = 350
Score = 75.1 bits (183), Expect = 6e-12, Method: Composition-based stats.
Identities = 34/82 (41%), Positives = 50/82 (60%), Gaps = 2/82 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V AIEGI++I LV +S Q+LVDCD+ E+ C GG++ +++V+ N G
Sbjct: 145 GSCWAFSAVAAIEGINQIKNGELVSLSEQELVDCDD--EAVGCGGGYMSWAFEFVVGNHG 202
Query: 65 INTERDYPNVGVMDNCKVFQFN 86
+ TE YP C+ + N
Sbjct: 203 LTTEASYPYHAANGACQAAKLN 224
>gi|125606204|gb|EAZ45240.1| hypothetical protein OsJ_29883 [Oryza sativa Japonica Group]
Length = 350
Score = 74.7 bits (182), Expect = 6e-12, Method: Composition-based stats.
Identities = 34/82 (41%), Positives = 50/82 (60%), Gaps = 2/82 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V AIEGI++I LV +S Q+LVDCD+ E+ C GG++ +++V+ N G
Sbjct: 145 GSCWAFSAVAAIEGINQIKNGELVSLSEQELVDCDD--EAVGCGGGYMSWAFEFVVGNHG 202
Query: 65 INTERDYPNVGVMDNCKVFQFN 86
+ TE YP C+ + N
Sbjct: 203 LTTEASYPYHAANGACQAAKLN 224
>gi|146216002|gb|ABQ10203.1| cysteine protease Cp5 [Actinidia deliciosa]
Length = 509
Score = 74.7 bits (182), Expect = 6e-12, Method: Composition-based stats.
Identities = 33/76 (43%), Positives = 50/76 (65%), Gaps = 2/76 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ GAIEGI+ + +L+ +S Q+LVDCD+ + C GG+++ +++V+ N G
Sbjct: 169 GSCWAFSSTGAIEGINALANGDLISLSEQELVDCDSTND--GCEGGYMDYAFEWVMSNGG 226
Query: 65 INTERDYPNVGVMDNC 80
I+TE DYP G C
Sbjct: 227 IDTETDYPYTGEDGTC 242
>gi|355681664|gb|AER96818.1| cathepsin S [Mustela putorius furo]
Length = 338
Score = 74.7 bits (182), Expect = 6e-12, Method: Composition-based stats.
Identities = 34/78 (43%), Positives = 49/78 (62%), Gaps = 1/78 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGE-SRSCVGGFIETIYQYVIQNR 63
G+CW F+ VGA+E K+ T NLV +S Q LVDC + ++ C GGF+ +QY+I N
Sbjct: 145 GACWAFSAVGALEAQLKLKTGNLVSLSAQNLVDCSTERYGNKGCNGGFMTKAFQYIIDNN 204
Query: 64 GINTERDYPNVGVMDNCK 81
GI++E YP + NC+
Sbjct: 205 GIDSEVSYPYKAMDGNCR 222
>gi|40806500|gb|AAR92155.1| putative cysteine protease 2 [Iris x hollandica]
Length = 359
Score = 74.7 bits (182), Expect = 6e-12, Method: Composition-based stats.
Identities = 32/76 (42%), Positives = 48/76 (63%), Gaps = 1/76 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ + ++EGI+KI TN LV +S QQLVDCD ++ C GG ++ ++++ N G
Sbjct: 150 GSCWAFSTIASVEGINKIKTNQLVPLSGQQLVDCDTD-QNEGCNGGLMDYAFEFIKSNGG 208
Query: 65 INTERDYPNVGVMDNC 80
I +E YP +C
Sbjct: 209 ITSESAYPYTAEQGSC 224
>gi|255626679|gb|ACU13684.1| unknown [Glycine max]
Length = 229
Score = 74.7 bits (182), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 33/71 (46%), Positives = 47/71 (66%), Gaps = 1/71 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V +E +KIVT V +S Q+LVDCD R C GG ++ ++++IQN G
Sbjct: 152 GSCWAFSTVATVEATNKIVTGKFVSLSEQELVDCDRAYNER-CNGGLMDYAFEFIIQNGG 210
Query: 65 INTERDYPNVG 75
I+T++DYP G
Sbjct: 211 IDTDKDYPYRG 221
>gi|149510440|ref|XP_001518002.1| PREDICTED: cathepsin K-like [Ornithorhynchus anatinus]
Length = 618
Score = 74.7 bits (182), Expect = 6e-12, Method: Composition-based stats.
Identities = 38/79 (48%), Positives = 49/79 (62%), Gaps = 6/79 (7%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDC--DNQGESRSCVGGFIETIYQYVIQN 62
GSCW F+ VGA+EG K T L+D+S Q LVDC N G C GG++ +QYV N
Sbjct: 426 GSCWAFSSVGALEGQLKKKTGRLLDLSPQNLVDCVASNDG----CGGGYMTNAFQYVHDN 481
Query: 63 RGINTERDYPNVGVMDNCK 81
RGI++E YP VG + C+
Sbjct: 482 RGIDSEDAYPYVGQDEPCR 500
>gi|125552927|gb|EAY98636.1| hypothetical protein OsI_20560 [Oryza sativa Indica Group]
Length = 449
Score = 74.7 bits (182), Expect = 6e-12, Method: Composition-based stats.
Identities = 33/68 (48%), Positives = 49/68 (72%), Gaps = 1/68 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G+CW F+ GA+EGI+KI T +L+ +S Q+L+DCD S C GG ++ Y++V++N G
Sbjct: 147 GACWSFSATGAMEGINKIKTGSLISLSEQELIDCDRSYNS-GCGGGLMDYAYKFVVKNGG 205
Query: 65 INTERDYP 72
I+TE DYP
Sbjct: 206 IDTEADYP 213
>gi|115464789|ref|NP_001055994.1| Os05g0508300 [Oryza sativa Japonica Group]
gi|48475189|gb|AAT44258.1| hypothetical protein [Oryza sativa Japonica Group]
gi|113579545|dbj|BAF17908.1| Os05g0508300 [Oryza sativa Japonica Group]
Length = 450
Score = 74.7 bits (182), Expect = 6e-12, Method: Composition-based stats.
Identities = 33/68 (48%), Positives = 49/68 (72%), Gaps = 1/68 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G+CW F+ GA+EGI+KI T +L+ +S Q+L+DCD S C GG ++ Y++V++N G
Sbjct: 148 GACWSFSATGAMEGINKIKTGSLISLSEQELIDCDRSYNS-GCGGGLMDYAYKFVVKNGG 206
Query: 65 INTERDYP 72
I+TE DYP
Sbjct: 207 IDTEADYP 214
>gi|402856109|ref|XP_003892642.1| PREDICTED: cathepsin K [Papio anubis]
Length = 348
Score = 74.7 bits (182), Expect = 6e-12, Method: Composition-based stats.
Identities = 36/76 (47%), Positives = 50/76 (65%), Gaps = 2/76 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ VGA+EG K T L+++S Q LVDC E+ C GG++ +QYV +NRG
Sbjct: 156 GSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDC--VSENDGCGGGYMTNAFQYVQKNRG 213
Query: 65 INTERDYPNVGVMDNC 80
I++E YP VG ++C
Sbjct: 214 IDSEDAYPYVGQEESC 229
>gi|15705865|gb|AAL05851.1|AF411121_1 cysteine proteinase precursor [Sandersonia aurantiaca]
Length = 360
Score = 74.7 bits (182), Expect = 6e-12, Method: Composition-based stats.
Identities = 33/78 (42%), Positives = 50/78 (64%), Gaps = 7/78 (8%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGES-------RSCVGGFIETIYQ 57
GSCW F+ GA+EG + + T NLV +S QQLVDCD++ +S + C GG + T ++
Sbjct: 151 GSCWSFSAAGALEGANYLSTGNLVSLSEQQLVDCDHECDSSEPDSCDQGCNGGLMTTAFE 210
Query: 58 YVIQNRGINTERDYPNVG 75
Y++++ G+ E DYP G
Sbjct: 211 YILKSGGLEREADYPYTG 228
>gi|403302734|ref|XP_003942008.1| PREDICTED: cathepsin K isoform 1 [Saimiri boliviensis boliviensis]
Length = 329
Score = 74.7 bits (182), Expect = 6e-12, Method: Composition-based stats.
Identities = 36/76 (47%), Positives = 50/76 (65%), Gaps = 2/76 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ VGA+EG K T L+++S Q LVDC E+ C GG++ +QYV +NRG
Sbjct: 137 GSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDC--VSENDGCGGGYMTNAFQYVQKNRG 194
Query: 65 INTERDYPNVGVMDNC 80
I++E YP VG ++C
Sbjct: 195 IDSEDAYPYVGQEESC 210
>gi|310975577|gb|ADP55137.1| cathepsin S [Miichthys miiuy]
Length = 338
Score = 74.7 bits (182), Expect = 6e-12, Method: Composition-based stats.
Identities = 33/71 (46%), Positives = 43/71 (60%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ GA+EG T LVD+S Q LVDC + + C GGF+ +QYVI N G
Sbjct: 144 GSCWAFSAAGALEGQLAKTTGKLVDLSPQNLVDCSTKYGNHGCNGGFMHKAFQYVIDNHG 203
Query: 65 INTERDYPNVG 75
I+++ YP G
Sbjct: 204 IDSDAAYPYTG 214
>gi|390476660|ref|XP_003735160.1| PREDICTED: LOW QUALITY PROTEIN: cathepsin K [Callithrix jacchus]
Length = 329
Score = 74.7 bits (182), Expect = 6e-12, Method: Composition-based stats.
Identities = 36/76 (47%), Positives = 50/76 (65%), Gaps = 2/76 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ VGA+EG K T L+++S Q LVDC E+ C GG++ +QYV +NRG
Sbjct: 137 GSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDC--VSENDGCGGGYMTNAFQYVQKNRG 194
Query: 65 INTERDYPNVGVMDNC 80
I++E YP VG ++C
Sbjct: 195 IDSEDAYPYVGQEESC 210
>gi|149030666|gb|EDL85703.1| cathepsin S [Rattus norvegicus]
Length = 291
Score = 74.7 bits (182), Expect = 6e-12, Method: Composition-based stats.
Identities = 34/78 (43%), Positives = 49/78 (62%), Gaps = 2/78 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGE--SRSCVGGFIETIYQYVIQN 62
GSCW F+ VGA+EG K+ T LV +S Q LVDC + + ++ C GGF+ +QY+I N
Sbjct: 96 GSCWAFSAVGALEGQLKLKTGKLVSLSAQNLVDCSTEEKYGNKGCGGGFMTEAFQYIIDN 155
Query: 63 RGINTERDYPNVGVMDNC 80
GI++E YP + + C
Sbjct: 156 GGIDSEASYPYKAMDEKC 173
>gi|32396018|gb|AAP41846.1| cysteine protease [Anthurium andraeanum]
Length = 502
Score = 74.7 bits (182), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 34/75 (45%), Positives = 50/75 (66%), Gaps = 2/75 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ GA+EGI+ I T L+ +S Q+LVDCD E C GG+++ +++VI N G
Sbjct: 168 GSCWAFSSTGAMEGINAITTGELISLSEQELVDCDTTNE--GCDGGYMDYAFEWVINNGG 225
Query: 65 INTERDYPNVGVMDN 79
I++E +YP G D+
Sbjct: 226 IDSEANYPYTGQADS 240
>gi|18202414|sp|P82473.1|CPGP1_ZINOF RecName: Full=Zingipain-1; AltName: Full=Cysteine proteinase GP-I
Length = 221
Score = 74.7 bits (182), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 35/80 (43%), Positives = 49/80 (61%), Gaps = 2/80 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F + A+EGI++IVT +L+ +S QQLVDC + + C GG+ +QY+I N G
Sbjct: 25 GSCWAFDAIAAVEGINQIVTGDLISLSEQQLVDCSTR--NHGCEGGWPYRAFQYIINNGG 82
Query: 65 INTERDYPNVGVMDNCKVFQ 84
IN+E YP G C +
Sbjct: 83 INSEEHYPYTGTNGTCDTKE 102
>gi|357166359|ref|XP_003580684.1| PREDICTED: oryzain alpha chain-like [Brachypodium distachyon]
Length = 456
Score = 74.7 bits (182), Expect = 6e-12, Method: Composition-based stats.
Identities = 33/71 (46%), Positives = 51/71 (71%), Gaps = 7/71 (9%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCD---NQGESRSCVGGFIETIYQYVIQ 61
GSCW F+ + A+EGI++IVT +++ +S Q+LVDCD NQG C GG ++ ++++I
Sbjct: 153 GSCWAFSAIAAVEGINQIVTGDMIALSEQELVDCDTSYNQG----CNGGLMDYAFEFIIN 208
Query: 62 NRGINTERDYP 72
N GI++E DYP
Sbjct: 209 NGGIDSEEDYP 219
>gi|432117576|gb|ELK37815.1| Cathepsin L1 [Myotis davidii]
Length = 299
Score = 74.7 bits (182), Expect = 6e-12, Method: Composition-based stats.
Identities = 34/77 (44%), Positives = 43/77 (55%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ GA+EG T LV +S Q LVDC + C GG ++ +QYV N G
Sbjct: 81 GSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRAQGNEGCSGGLMDNAFQYVKDNEG 140
Query: 65 INTERDYPNVGVMDNCK 81
++TE YP G D CK
Sbjct: 141 LDTEESYPYYGTDDTCK 157
>gi|350583407|ref|XP_003481511.1| PREDICTED: cathepsin S [Sus scrofa]
Length = 331
Score = 74.7 bits (182), Expect = 6e-12, Method: Composition-based stats.
Identities = 35/78 (44%), Positives = 47/78 (60%), Gaps = 1/78 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQG-ESRSCVGGFIETIYQYVIQNR 63
GSCW F+ VGA+E K+ T LV +S Q LVDC + ++ C GGF+ +QY+I N
Sbjct: 137 GSCWAFSAVGALEAQVKMKTGRLVSLSAQNLVDCSTEKYRNKGCNGGFMTEAFQYIIDNN 196
Query: 64 GINTERDYPNVGVMDNCK 81
GI++E YP V CK
Sbjct: 197 GIDSEASYPYKAVDGKCK 214
>gi|348586441|ref|XP_003478977.1| PREDICTED: cathepsin K-like [Cavia porcellus]
Length = 329
Score = 74.7 bits (182), Expect = 7e-12, Method: Composition-based stats.
Identities = 36/76 (47%), Positives = 50/76 (65%), Gaps = 2/76 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ VGA+EG K T L+++S Q LVDC E+ C GG++ +QYV +NRG
Sbjct: 137 GSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDC--VSENDGCGGGYMTNAFQYVQENRG 194
Query: 65 INTERDYPNVGVMDNC 80
I++E YP VG ++C
Sbjct: 195 IDSEDAYPYVGQEESC 210
>gi|388497270|gb|AFK36701.1| unknown [Lotus japonicus]
Length = 343
Score = 74.7 bits (182), Expect = 7e-12, Method: Composition-based stats.
Identities = 34/77 (44%), Positives = 49/77 (63%), Gaps = 1/77 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V A+EGI KI NL+ +S QQLVDC + +++ C GGF++ + Y+ +N G
Sbjct: 149 GSCWAFSAVAAVEGIVKIKNGNLISLSEQQLVDCASNEQNQGCGGGFMDNAFSYITEN-G 207
Query: 65 INTERDYPNVGVMDNCK 81
I +E DY G C+
Sbjct: 208 IASENDYQYRGGAGTCQ 224
>gi|71897043|ref|NP_001026516.1| cathepsin S precursor [Gallus gallus]
gi|53126701|emb|CAG30977.1| hypothetical protein RCJMB04_1f23 [Gallus gallus]
Length = 328
Score = 74.7 bits (182), Expect = 7e-12, Method: Composition-based stats.
Identities = 32/77 (41%), Positives = 46/77 (59%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G+CW F+ VGA+E K+ T LV +S Q LVDC ++ C GGF+ +QY+I N G
Sbjct: 135 GACWAFSAVGALEAQVKLKTGKLVSLSAQNLVDCSMMYGNKGCGGGFMTRAFQYIIDNNG 194
Query: 65 INTERDYPNVGVMDNCK 81
I++E YP + C+
Sbjct: 195 IDSEESYPYMAQNGTCQ 211
>gi|313118768|gb|ADR32296.1| C14 cysteine protease [Solanum demissum]
gi|313118770|gb|ADR32297.1| C14 cysteine protease [Solanum demissum]
Length = 217
Score = 74.7 bits (182), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 34/68 (50%), Positives = 48/68 (70%), Gaps = 1/68 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V A+E I+ IVT NL+ +S Q+LVDCD + + C GG ++ +++VI N G
Sbjct: 23 GSCWAFSAVAAMESINAIVTGNLISLSEQELVDCD-KSYNEGCDGGLMDYAFEFVINNGG 81
Query: 65 INTERDYP 72
I+TE DYP
Sbjct: 82 IDTEEDYP 89
>gi|225456820|ref|XP_002278323.1| PREDICTED: vignain [Vitis vinifera]
Length = 360
Score = 74.7 bits (182), Expect = 7e-12, Method: Composition-based stats.
Identities = 35/82 (42%), Positives = 50/82 (60%), Gaps = 1/82 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V A+EGI+ I TN LV +S Q+LVDCD E++ C GG + ++++ + G
Sbjct: 148 GSCWAFSTVVAVEGINHIKTNKLVSLSEQELVDCDTS-ENQGCNGGLMGYAFEFIKEKGG 206
Query: 65 INTERDYPNVGVMDNCKVFQFN 86
I TE+ YP C V + N
Sbjct: 207 ITTEQSYPYTAEDGTCDVSKVN 228
>gi|161172356|pdb|3BCN|A Chain A, Crystal Structure Of A Papain-Like Cysteine Protease
Ervatamin-A Complexed With Irreversible Inhibitor E-64
gi|161172357|pdb|3BCN|B Chain B, Crystal Structure Of A Papain-Like Cysteine Protease
Ervatamin-A Complexed With Irreversible Inhibitor E-64
Length = 209
Score = 74.7 bits (182), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 34/80 (42%), Positives = 49/80 (61%), Gaps = 2/80 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V +E I++I T NL+ +S QQLVDC + + C GG+ + YQY+I N G
Sbjct: 23 GSCWAFSTVTTVESINQIRTGNLISLSEQQLVDCSKK--NHGCKGGYFDRAYQYIIANGG 80
Query: 65 INTERDYPNVGVMDNCKVFQ 84
I+TE +YP C+ +
Sbjct: 81 IDTEANYPYKAFQGPCRAAK 100
>gi|395856027|ref|XP_003800444.1| PREDICTED: cathepsin K [Otolemur garnettii]
Length = 329
Score = 74.7 bits (182), Expect = 7e-12, Method: Composition-based stats.
Identities = 38/78 (48%), Positives = 51/78 (65%), Gaps = 6/78 (7%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDC--DNQGESRSCVGGFIETIYQYVIQN 62
GSCW F+ VGA+EG K T L+++S Q LVDC DN G C GG++ +QYV +N
Sbjct: 137 GSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCVSDNDG----CGGGYMTNAFQYVQKN 192
Query: 63 RGINTERDYPNVGVMDNC 80
RGI++E YP VG ++C
Sbjct: 193 RGIDSEDAYPYVGQDESC 210
>gi|431896622|gb|ELK06034.1| Cathepsin K [Pteropus alecto]
Length = 330
Score = 74.7 bits (182), Expect = 7e-12, Method: Composition-based stats.
Identities = 36/76 (47%), Positives = 50/76 (65%), Gaps = 2/76 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ VGA+EG K T L+++S Q LVDC E+ C GG++ +QYV +NRG
Sbjct: 138 GSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDC--VSENDGCGGGYMTNAFQYVQKNRG 195
Query: 65 INTERDYPNVGVMDNC 80
I++E YP VG ++C
Sbjct: 196 IDSEDAYPYVGQDESC 211
>gi|413953666|gb|AFW86315.1| hypothetical protein ZEAMMB73_539008 [Zea mays]
Length = 314
Score = 74.7 bits (182), Expect = 7e-12, Method: Composition-based stats.
Identities = 32/76 (42%), Positives = 45/76 (59%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G C F+ V A EGI KI T LV ++ Q+LVDCD GE + C GG ++ ++++I+N G
Sbjct: 120 GCCSAFSAVAATEGIVKISTGKLVSLADQELVDCDVHGEDQGCEGGLMDDAFKFIIKNGG 179
Query: 65 INTERDYPNVGVMDNC 80
+ TE YP C
Sbjct: 180 LTTESSYPYTAADGKC 195
>gi|60654335|gb|AAX29858.1| cathepsin K [synthetic construct]
gi|60654337|gb|AAX29859.1| cathepsin K [synthetic construct]
Length = 330
Score = 74.7 bits (182), Expect = 7e-12, Method: Composition-based stats.
Identities = 36/76 (47%), Positives = 50/76 (65%), Gaps = 2/76 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ VGA+EG K T L+++S Q LVDC E+ C GG++ +QYV +NRG
Sbjct: 137 GSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDC--VSENDGCGGGYMTNAFQYVQKNRG 194
Query: 65 INTERDYPNVGVMDNC 80
I++E YP VG ++C
Sbjct: 195 IDSEDAYPYVGQEESC 210
>gi|395729888|ref|XP_002810309.2| PREDICTED: cathepsin K [Pongo abelii]
Length = 343
Score = 74.7 bits (182), Expect = 7e-12, Method: Composition-based stats.
Identities = 36/76 (47%), Positives = 50/76 (65%), Gaps = 2/76 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ VGA+EG K T L+++S Q LVDC E+ C GG++ +QYV +NRG
Sbjct: 151 GSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDC--VSENDGCGGGYMTNAFQYVQKNRG 208
Query: 65 INTERDYPNVGVMDNC 80
I++E YP VG ++C
Sbjct: 209 IDSEDAYPYVGQEESC 224
>gi|74136185|ref|NP_001027984.1| cathepsin K precursor [Macaca mulatta]
gi|47117667|sp|P61276.1|CATK_MACFA RecName: Full=Cathepsin K; Flags: Precursor
gi|47117668|sp|P61277.1|CATK_MACMU RecName: Full=Cathepsin K; Flags: Precursor
gi|3236470|gb|AAC23694.1| cathepsin K [Macaca fascicularis]
gi|4927694|gb|AAD33249.1| cathepsin K [Macaca mulatta]
gi|355558400|gb|EHH15180.1| hypothetical protein EGK_01237 [Macaca mulatta]
gi|355763132|gb|EHH62118.1| hypothetical protein EGM_20317 [Macaca fascicularis]
gi|380809978|gb|AFE76864.1| cathepsin K preproprotein [Macaca mulatta]
gi|383416065|gb|AFH31246.1| cathepsin K preproprotein [Macaca mulatta]
gi|384945478|gb|AFI36344.1| cathepsin K preproprotein [Macaca mulatta]
Length = 329
Score = 74.3 bits (181), Expect = 7e-12, Method: Composition-based stats.
Identities = 36/76 (47%), Positives = 50/76 (65%), Gaps = 2/76 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ VGA+EG K T L+++S Q LVDC E+ C GG++ +QYV +NRG
Sbjct: 137 GSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDC--VSENDGCGGGYMTNAFQYVQKNRG 194
Query: 65 INTERDYPNVGVMDNC 80
I++E YP VG ++C
Sbjct: 195 IDSEDAYPYVGQEESC 210
>gi|49456399|emb|CAG46520.1| CTSK [Homo sapiens]
Length = 329
Score = 74.3 bits (181), Expect = 7e-12, Method: Composition-based stats.
Identities = 36/76 (47%), Positives = 50/76 (65%), Gaps = 2/76 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ VGA+EG K T L+++S Q LVDC E+ C GG++ +QYV +NRG
Sbjct: 137 GSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDC--VSENDGCGGGYMTNAFQYVQKNRG 194
Query: 65 INTERDYPNVGVMDNC 80
I++E YP VG ++C
Sbjct: 195 IDSEDAYPYVGQEESC 210
>gi|332220191|ref|XP_003259241.1| PREDICTED: cathepsin K [Nomascus leucogenys]
Length = 329
Score = 74.3 bits (181), Expect = 8e-12, Method: Composition-based stats.
Identities = 36/76 (47%), Positives = 50/76 (65%), Gaps = 2/76 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ VGA+EG K T L+++S Q LVDC E+ C GG++ +QYV +NRG
Sbjct: 137 GSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDC--VSENDGCGGGYMTNAFQYVQKNRG 194
Query: 65 INTERDYPNVGVMDNC 80
I++E YP VG ++C
Sbjct: 195 IDSEDAYPYVGQEESC 210
>gi|414870137|tpg|DAA48694.1| TPA: vignain [Zea mays]
Length = 484
Score = 74.3 bits (181), Expect = 8e-12, Method: Composition-based stats.
Identities = 32/77 (41%), Positives = 47/77 (61%), Gaps = 1/77 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ + A+EGI+ I T NL +S QQLVDCD + + C GG ++ +QY+ ++ G
Sbjct: 270 GSCWAFSTIAAVEGINAIKTKNLTSLSEQQLVDCDTKANA-GCNGGLMDYAFQYIAKHGG 328
Query: 65 INTERDYPNVGVMDNCK 81
+ E YP +CK
Sbjct: 329 VAAEDAYPYRARQASCK 345
>gi|226507950|ref|NP_001151278.1| LOC100284911 precursor [Zea mays]
gi|195645488|gb|ACG42212.1| vignain precursor [Zea mays]
Length = 376
Score = 74.3 bits (181), Expect = 8e-12, Method: Composition-based stats.
Identities = 32/77 (41%), Positives = 47/77 (61%), Gaps = 1/77 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ + A+EGI+ I T NL +S QQLVDCD + + C GG ++ +QY+ ++ G
Sbjct: 162 GSCWAFSTIAAVEGINAIKTKNLTSLSEQQLVDCDTKANA-GCNGGLMDYAFQYIAKHGG 220
Query: 65 INTERDYPNVGVMDNCK 81
+ E YP +CK
Sbjct: 221 VAAEDAYPYRARQASCK 237
>gi|195637152|gb|ACG38044.1| vignain precursor [Zea mays]
Length = 377
Score = 74.3 bits (181), Expect = 8e-12, Method: Composition-based stats.
Identities = 32/77 (41%), Positives = 47/77 (61%), Gaps = 1/77 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ + A+EGI+ I T NL +S QQLVDCD + + C GG ++ +QY+ ++ G
Sbjct: 163 GSCWAFSTIAAVEGINAIKTKNLTSLSEQQLVDCDTKANA-GCNGGLMDYAFQYIAKHGG 221
Query: 65 INTERDYPNVGVMDNCK 81
+ E YP +CK
Sbjct: 222 VAAEDAYPYRARQASCK 238
>gi|4503151|ref|NP_000387.1| cathepsin K preproprotein [Homo sapiens]
gi|1168793|sp|P43235.1|CATK_HUMAN RecName: Full=Cathepsin K; AltName: Full=Cathepsin O; AltName:
Full=Cathepsin O2; AltName: Full=Cathepsin X; Flags:
Precursor
gi|562757|emb|CAA57649.1| Cathepsin O [Homo sapiens]
gi|606923|gb|AAA65233.1| cathepsin O [Homo sapiens]
gi|1195556|gb|AAB35521.1| cathepsin O2 [Homo sapiens]
gi|16359188|gb|AAH16058.1| Cathepsin K [Homo sapiens]
gi|49456311|emb|CAG46476.1| CTSK [Homo sapiens]
gi|60823594|gb|AAX36649.1| cathepsin K [synthetic construct]
gi|119573901|gb|EAW53516.1| cathepsin K (pycnodysostosis), isoform CRA_b [Homo sapiens]
gi|307685681|dbj|BAJ20771.1| cathepsin K [synthetic construct]
gi|312150424|gb|ADQ31724.1| cathepsin K [synthetic construct]
Length = 329
Score = 74.3 bits (181), Expect = 8e-12, Method: Composition-based stats.
Identities = 36/76 (47%), Positives = 50/76 (65%), Gaps = 2/76 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ VGA+EG K T L+++S Q LVDC E+ C GG++ +QYV +NRG
Sbjct: 137 GSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDC--VSENDGCGGGYMTNAFQYVQKNRG 194
Query: 65 INTERDYPNVGVMDNC 80
I++E YP VG ++C
Sbjct: 195 IDSEDAYPYVGQEESC 210
>gi|426331364|ref|XP_004026652.1| PREDICTED: cathepsin K [Gorilla gorilla gorilla]
Length = 329
Score = 74.3 bits (181), Expect = 8e-12, Method: Composition-based stats.
Identities = 36/76 (47%), Positives = 50/76 (65%), Gaps = 2/76 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ VGA+EG K T L+++S Q LVDC E+ C GG++ +QYV +NRG
Sbjct: 137 GSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDC--VSENDGCGGGYMTNAFQYVQKNRG 194
Query: 65 INTERDYPNVGVMDNC 80
I++E YP VG ++C
Sbjct: 195 IDSEDAYPYVGQEESC 210
>gi|836934|gb|AAA95998.1| cathepsin X [Homo sapiens]
Length = 329
Score = 74.3 bits (181), Expect = 8e-12, Method: Composition-based stats.
Identities = 36/76 (47%), Positives = 50/76 (65%), Gaps = 2/76 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ VGA+EG K T L+++S Q LVDC E+ C GG++ +QYV +NRG
Sbjct: 137 GSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDC--VSENDGCGGGYMTNAFQYVQKNRG 194
Query: 65 INTERDYPNVGVMDNC 80
I++E YP VG ++C
Sbjct: 195 IDSEDAYPYVGQEESC 210
>gi|397492864|ref|XP_003817340.1| PREDICTED: cathepsin K [Pan paniscus]
Length = 343
Score = 74.3 bits (181), Expect = 8e-12, Method: Composition-based stats.
Identities = 36/76 (47%), Positives = 50/76 (65%), Gaps = 2/76 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ VGA+EG K T L+++S Q LVDC E+ C GG++ +QYV +NRG
Sbjct: 151 GSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDC--VSENDGCGGGYMTNAFQYVQKNRG 208
Query: 65 INTERDYPNVGVMDNC 80
I++E YP VG ++C
Sbjct: 209 IDSEDAYPYVGQEESC 224
>gi|297851334|ref|XP_002893548.1| peptidase C1A papain family protein [Arabidopsis lyrata subsp.
lyrata]
gi|297339390|gb|EFH69807.1| peptidase C1A papain family protein [Arabidopsis lyrata subsp.
lyrata]
Length = 346
Score = 74.3 bits (181), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 34/77 (44%), Positives = 49/77 (63%), Gaps = 1/77 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW F+ V A+EG++KIV NLV +S QQL+DCD + ++ C GG + + Y+I+NRG
Sbjct: 152 GCCWAFSSVAAVEGLTKIVGGNLVSLSEQQLLDCDRERDN-GCNGGIMSDAFSYIIKNRG 210
Query: 65 INTERDYPNVGVMDNCK 81
I +E YP C+
Sbjct: 211 IASEASYPYQETEGTCR 227
>gi|223946391|gb|ACN27279.1| unknown [Zea mays]
Length = 279
Score = 74.3 bits (181), Expect = 8e-12, Method: Composition-based stats.
Identities = 32/77 (41%), Positives = 47/77 (61%), Gaps = 1/77 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ + A+EGI+ I T NL +S QQLVDCD + + C GG ++ +QY+ ++ G
Sbjct: 65 GSCWAFSTIAAVEGINAIKTKNLTSLSEQQLVDCDTKANA-GCNGGLMDYAFQYIAKHGG 123
Query: 65 INTERDYPNVGVMDNCK 81
+ E YP +CK
Sbjct: 124 VAAEDAYPYRARQASCK 140
>gi|18401420|ref|NP_565649.1| cysteine proteinase-like protein [Arabidopsis thaliana]
gi|4314384|gb|AAD15594.1| cysteine proteinase [Arabidopsis thaliana]
gi|17381154|gb|AAL36389.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|20465849|gb|AAM20029.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|330252901|gb|AEC07995.1| cysteine proteinase-like protein [Arabidopsis thaliana]
Length = 348
Score = 74.3 bits (181), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 35/71 (49%), Positives = 47/71 (66%), Gaps = 7/71 (9%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCD---NQGESRSCVGGFIETIYQYVIQ 61
G CW F+ V A+EGI+KI LV +S QQL+DCD NQG C GG + ++Y+I+
Sbjct: 150 GGCWAFSAVAAVEGITKITKGELVSLSEQQLLDCDRDYNQG----CRGGIMSKAFEYIIK 205
Query: 62 NRGINTERDYP 72
N+GI TE +YP
Sbjct: 206 NQGITTEDNYP 216
>gi|42563538|gb|AAS20467.1| cysteine protease-like protein [Pelargonium x hortorum]
Length = 234
Score = 74.3 bits (181), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 36/85 (42%), Positives = 53/85 (62%), Gaps = 7/85 (8%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCD---NQGESRSCVGGFIETIYQYVIQ 61
G CW F+ + A+EGI+ IVT L+ +S Q+LVDCD NQG C GG ++ ++++I+
Sbjct: 2 GRCWAFSTIAAVEGINHIVTGELISLSEQELVDCDRSYNQG----CNGGLMDYAFEFIIK 57
Query: 62 NRGINTERDYPNVGVMDNCKVFQFN 86
N GI++E DYP V C + N
Sbjct: 58 NGGIDSEEDYPYKAVDGTCDPIRKN 82
>gi|380236892|emb|CBK52289.1| cathepsin S protein [Dicentrarchus labrax]
Length = 337
Score = 74.3 bits (181), Expect = 8e-12, Method: Composition-based stats.
Identities = 33/77 (42%), Positives = 45/77 (58%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ GA+EG T LVD+S Q LVDC + + C GG + +QYVI N+G
Sbjct: 144 GSCWAFSAAGALEGQLAKTTGKLVDLSPQNLVDCSTKYGNHGCNGGLMHHAFQYVIDNQG 203
Query: 65 INTERDYPNVGVMDNCK 81
I+++ YP G C+
Sbjct: 204 IDSDASYPYTGRNGECR 220
>gi|449532567|ref|XP_004173252.1| PREDICTED: oryzain alpha chain-like [Cucumis sativus]
Length = 321
Score = 74.3 bits (181), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 36/76 (47%), Positives = 51/76 (67%), Gaps = 1/76 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V A+EGI++IVT L+ +S Q+LVDCD + + C GG ++ +Q++I N G
Sbjct: 13 GSCWAFSSVAAVEGINQIVTGELIPLSEQELVDCD-KSFNMGCNGGLMDYAFQFIIGNGG 71
Query: 65 INTERDYPNVGVMDNC 80
I+TE DYP G C
Sbjct: 72 IDTEEDYPYKGRDAAC 87
>gi|54020908|ref|NP_001005695.1| cathepsin S precursor [Xenopus (Silurana) tropicalis]
gi|49522293|gb|AAH75261.1| cathepsin S [Xenopus (Silurana) tropicalis]
Length = 333
Score = 74.3 bits (181), Expect = 8e-12, Method: Composition-based stats.
Identities = 33/76 (43%), Positives = 49/76 (64%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ VGA+EG + T LV +S Q LVDC ++ ++ C GGF+ +QYVI N+G
Sbjct: 140 GSCWAFSAVGALEGQLMLKTGKLVSLSPQNLVDCSSKYGNKGCGGGFMTQAFQYVIDNKG 199
Query: 65 INTERDYPNVGVMDNC 80
I+++ YP + + C
Sbjct: 200 IDSDSYYPYHAMDEKC 215
>gi|312381834|gb|EFR27484.1| hypothetical protein AND_05795 [Anopheles darlingi]
Length = 508
Score = 74.3 bits (181), Expect = 8e-12, Method: Composition-based stats.
Identities = 33/68 (48%), Positives = 43/68 (63%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ GA+EG TN LV +S Q LVDC + ++ C GG I +QY+ QN G
Sbjct: 313 GSCWAFSSTGAVEGQHFRKTNKLVSLSEQNLVDCTSNYRNKGCKGGAIYRSFQYIEQNHG 372
Query: 65 INTERDYP 72
I+TE+ YP
Sbjct: 373 IDTEKSYP 380
>gi|261289779|ref|XP_002611751.1| hypothetical protein BRAFLDRAFT_284345 [Branchiostoma floridae]
gi|229297123|gb|EEN67761.1| hypothetical protein BRAFLDRAFT_284345 [Branchiostoma floridae]
Length = 330
Score = 74.3 bits (181), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 34/77 (44%), Positives = 47/77 (61%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ G++EG T LV +S Q LVDC Q ++ C GG ++ +QY+IQN+G
Sbjct: 136 GSCWAFSTTGSLEGQHAKATGTLVSLSEQNLVDCSRQEGNKGCEGGDMDQGFQYIIQNKG 195
Query: 65 INTERDYPNVGVMDNCK 81
I+TE+ YP CK
Sbjct: 196 IDTEQCYPYKAKNHRCK 212
>gi|359492179|ref|XP_002280808.2| PREDICTED: cysteine proteinase RD19a-like [Vitis vinifera]
gi|302142580|emb|CBI19783.3| unnamed protein product [Vitis vinifera]
Length = 365
Score = 74.3 bits (181), Expect = 8e-12, Method: Composition-based stats.
Identities = 34/84 (40%), Positives = 49/84 (58%), Gaps = 7/84 (8%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGE-------SRSCVGGFIETIYQ 57
GSCW F+ GA+EG + T NLV +S QQLVDCD++ + R C GG + T ++
Sbjct: 157 GSCWSFSTTGALEGAHFLATGNLVSLSEQQLVDCDHECDPEEYGACDRGCNGGLMNTAFE 216
Query: 58 YVIQNRGINTERDYPNVGVMDNCK 81
Y+++ G+ DYP G +CK
Sbjct: 217 YILKAGGVVRGEDYPYTGTDGHCK 240
>gi|111073715|dbj|BAF02546.1| triticain alpha [Triticum aestivum]
gi|388890585|gb|AFK80346.1| cysteine endopeptidase EP alpha [Secale cereale x Triticum durum]
Length = 461
Score = 74.3 bits (181), Expect = 8e-12, Method: Composition-based stats.
Identities = 30/68 (44%), Positives = 49/68 (72%), Gaps = 1/68 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ + A+EGI++IVT +++ +S Q+LVDCD + C GG ++ ++++I N G
Sbjct: 152 GSCWAFSAIAAVEGINQIVTGDMIPLSEQELVDCDTS-YNEGCNGGLMDYAFEFIINNGG 210
Query: 65 INTERDYP 72
I++E DYP
Sbjct: 211 IDSEEDYP 218
>gi|6435586|pdb|7PCK|A Chain A, Crystal Structure Of Wild Type Human Procathepsin K
gi|6435587|pdb|7PCK|B Chain B, Crystal Structure Of Wild Type Human Procathepsin K
gi|6435588|pdb|7PCK|C Chain C, Crystal Structure Of Wild Type Human Procathepsin K
gi|6435589|pdb|7PCK|D Chain D, Crystal Structure Of Wild Type Human Procathepsin K
gi|6435592|pdb|1BY8|A Chain A, The Crystal Structure Of Human Procathepsin K
Length = 314
Score = 74.3 bits (181), Expect = 8e-12, Method: Composition-based stats.
Identities = 36/76 (47%), Positives = 50/76 (65%), Gaps = 2/76 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ VGA+EG K T L+++S Q LVDC E+ C GG++ +QYV +NRG
Sbjct: 122 GSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDC--VSENDGCGGGYMTNAFQYVQKNRG 179
Query: 65 INTERDYPNVGVMDNC 80
I++E YP VG ++C
Sbjct: 180 IDSEDAYPYVGQEESC 195
>gi|214015390|gb|ACJ62311.1| cysteine protease [Zea mays subsp. parviglumis]
Length = 247
Score = 74.3 bits (181), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 36/82 (43%), Positives = 50/82 (60%), Gaps = 2/82 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G W F+ V AIEG++ I T NLV +S Q+++DCD Q C GG +E +++VI N G
Sbjct: 124 GRYWAFSAVAAIEGVNAIATGNLVSLSEQEIIDCDAQ--DSGCDGGQMEDAFRFVIGNGG 181
Query: 65 INTERDYPNVGVMDNCKVFQFN 86
I+TE DYP +G C + N
Sbjct: 182 IDTEADYPFIGTDGTCDANKEN 203
>gi|297745594|emb|CBI40759.3| unnamed protein product [Vitis vinifera]
Length = 300
Score = 74.3 bits (181), Expect = 9e-12, Method: Composition-based stats.
Identities = 33/76 (43%), Positives = 48/76 (63%), Gaps = 1/76 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V A+EGI++IVT NL +S Q+L+DCD + + C GG ++ + ++I N G
Sbjct: 104 GSCWAFSTVAAVEGINQIVTGNLTALSEQELIDCD-KPFNNGCNGGLMDYAFAFIISNGG 162
Query: 65 INTERDYPNVGVMDNC 80
+ E DYP V C
Sbjct: 163 LRKEEDYPYVMEEGTC 178
>gi|312381833|gb|EFR27483.1| hypothetical protein AND_05794 [Anopheles darlingi]
Length = 344
Score = 74.3 bits (181), Expect = 9e-12, Method: Composition-based stats.
Identities = 33/76 (43%), Positives = 45/76 (59%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ GA+EG T LV +S Q LVDC + + C GG ++ +QYV N+G
Sbjct: 149 GSCWSFSATGALEGQHFRKTGKLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFQYVKDNKG 208
Query: 65 INTERDYPNVGVMDNC 80
I+TE+ YP + D C
Sbjct: 209 IDTEKAYPYEAIDDEC 224
>gi|255568297|ref|XP_002525123.1| cysteine protease, putative [Ricinus communis]
gi|223535582|gb|EEF37250.1| cysteine protease, putative [Ricinus communis]
Length = 349
Score = 74.3 bits (181), Expect = 9e-12, Method: Composition-based stats.
Identities = 31/76 (40%), Positives = 48/76 (63%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V A EGI K+ T LV +S Q+LVDCD +G+ + C GG + ++++ ++ G
Sbjct: 146 GSCWAFSAVAATEGIHKLRTGKLVSLSEQELVDCDVKGQDKGCQGGLMVDAFKFIKRHGG 205
Query: 65 INTERDYPNVGVMDNC 80
+ +E +YP G C
Sbjct: 206 MTSEANYPYQGRDGKC 221
>gi|148362116|gb|ABQ59635.1| ervatamin-A [Tabernaemontana divaricata]
Length = 184
Score = 74.3 bits (181), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 34/80 (42%), Positives = 49/80 (61%), Gaps = 2/80 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V +E I++I T NL+ +S QQLVDC ++ C GG+ + YQY+I N G
Sbjct: 12 GSCWAFSTVTTVESINQIRTGNLISLSEQQLVDCSK--KNHGCKGGYFDRAYQYIIANGG 69
Query: 65 INTERDYPNVGVMDNCKVFQ 84
I+TE +YP C+ +
Sbjct: 70 IDTEANYPYKAFQGPCRAAK 89
>gi|26452046|dbj|BAC43113.1| putative cysteine proteinase RD21A precursor [Arabidopsis thaliana]
Length = 362
Score = 74.3 bits (181), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 31/68 (45%), Positives = 48/68 (70%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ VGA+EGI++I T L+ +S Q+LVDCD + C GG + ++++++N G
Sbjct: 152 GSCWAFSAVGAVEGINQITTGELISLSEQELVDCDRGFVNAGCDGGIMNYAFEFIMKNGG 211
Query: 65 INTERDYP 72
I T++DYP
Sbjct: 212 IETDQDYP 219
>gi|395856029|ref|XP_003800445.1| PREDICTED: cathepsin S [Otolemur garnettii]
Length = 331
Score = 74.3 bits (181), Expect = 9e-12, Method: Composition-based stats.
Identities = 33/78 (42%), Positives = 47/78 (60%), Gaps = 1/78 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQG-ESRSCVGGFIETIYQYVIQNR 63
GSCW F+ VGA+E K+ T LV +S Q LVDC + + C GGF+ +QY+I N
Sbjct: 137 GSCWAFSAVGALEAQLKLTTGKLVSLSAQNLVDCSTEKYRNEGCHGGFMTEAFQYIIDNN 196
Query: 64 GINTERDYPNVGVMDNCK 81
GI++E YP + + C+
Sbjct: 197 GIDSEASYPYKAMDEKCQ 214
>gi|66354492|gb|AAY44882.1| papain family cysteine protease [Vigna unguiculata]
Length = 178
Score = 74.3 bits (181), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 34/78 (43%), Positives = 50/78 (64%), Gaps = 1/78 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V IEG+ I LV +S Q+LVDC +G+S C GG++E ++++ + G
Sbjct: 8 GSCWAFSAVATIEGLHHIKKGELVSLSEQELVDC-VRGDSEGCNGGYVEDAFEFLAKKGG 66
Query: 65 INTERDYPNVGVMDNCKV 82
I +E +YP GV +CKV
Sbjct: 67 IASETNYPYKGVNKSCKV 84
>gi|30685308|ref|NP_566634.2| putative cysteine proteinase [Arabidopsis thaliana]
gi|30315949|sp|Q9LT77.1|CPR1_ARATH RecName: Full=Probable cysteine proteinase At3g19400; Flags:
Precursor
gi|11994462|dbj|BAB02464.1| cysteine proteinase [Arabidopsis thaliana]
gi|332642715|gb|AEE76236.1| putative cysteine proteinase [Arabidopsis thaliana]
Length = 362
Score = 74.3 bits (181), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 31/68 (45%), Positives = 48/68 (70%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ VGA+EGI++I T L+ +S Q+LVDCD + C GG + ++++++N G
Sbjct: 152 GSCWAFSAVGAVEGINQITTGELISLSEQELVDCDRGFVNAGCDGGIMNYAFEFIMKNGG 211
Query: 65 INTERDYP 72
I T++DYP
Sbjct: 212 IETDQDYP 219
>gi|334332716|ref|XP_001367365.2| PREDICTED: cathepsin L1-like [Monodelphis domestica]
Length = 335
Score = 74.3 bits (181), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 38/76 (50%), Positives = 46/76 (60%)
Query: 6 SCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRGI 65
SCW F+ VGAIEG T LV +S Q LVDC SC GGF++ +QYV N GI
Sbjct: 140 SCWAFSAVGAIEGQWFRKTGELVSLSIQNLVDCTTSDSISSCHGGFMDRAFQYVQDNGGI 199
Query: 66 NTERDYPNVGVMDNCK 81
+TE YP VG ++ CK
Sbjct: 200 DTEECYPYVGEVNECK 215
>gi|77404197|ref|NP_001029168.1| cathepsin K precursor [Canis lupus familiaris]
gi|122056102|sp|Q3ZKN1.1|CATK_CANFA RecName: Full=Cathepsin K; Flags: Precursor
gi|58047562|gb|AAW65150.1| cathepsin K [Canis lupus familiaris]
Length = 330
Score = 74.3 bits (181), Expect = 9e-12, Method: Composition-based stats.
Identities = 36/76 (47%), Positives = 50/76 (65%), Gaps = 2/76 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ VGA+EG K T L+++S Q LVDC E+ C GG++ +QYV +NRG
Sbjct: 138 GSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDC--VSENDGCGGGYMTNAFQYVQKNRG 195
Query: 65 INTERDYPNVGVMDNC 80
I++E YP VG ++C
Sbjct: 196 IDSEDAYPYVGQDESC 211
>gi|357129125|ref|XP_003566217.1| PREDICTED: thiol protease SEN102-like [Brachypodium distachyon]
Length = 380
Score = 74.3 bits (181), Expect = 9e-12, Method: Composition-based stats.
Identities = 30/68 (44%), Positives = 45/68 (66%), Gaps = 1/68 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ + A+EGI+ I TNNL +S QQLVDCD + + C GG ++ + Y+ ++ G
Sbjct: 166 GSCWAFSTIAAVEGINAIRTNNLTSLSEQQLVDCDTKTNA-GCDGGLMDDAFSYIAKHGG 224
Query: 65 INTERDYP 72
+ E+ YP
Sbjct: 225 VAAEKSYP 232
>gi|297826061|ref|XP_002880913.1| hypothetical protein ARALYDRAFT_481640 [Arabidopsis lyrata subsp.
lyrata]
gi|297326752|gb|EFH57172.1| hypothetical protein ARALYDRAFT_481640 [Arabidopsis lyrata subsp.
lyrata]
Length = 347
Score = 74.3 bits (181), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 35/71 (49%), Positives = 47/71 (66%), Gaps = 7/71 (9%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCD---NQGESRSCVGGFIETIYQYVIQ 61
G CW F+ V A+EGI+KI LV +S QQL+DCD NQG C GG + ++Y+I+
Sbjct: 149 GGCWAFSAVAAVEGITKITKGELVSLSEQQLLDCDTDYNQG----CHGGIMSKAFEYIIK 204
Query: 62 NRGINTERDYP 72
N+GI TE +YP
Sbjct: 205 NQGITTEDNYP 215
>gi|50355617|dbj|BAD29957.1| cysteine protease [Daucus carota]
Length = 437
Score = 74.3 bits (181), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 32/76 (42%), Positives = 51/76 (67%), Gaps = 1/76 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ +GA+EGI++I T L+ +S Q+LVDCD + + C GG ++ + ++I+N G
Sbjct: 160 GSCWAFSAIGAVEGINQITTGELITLSEQELVDCD-RSYNEGCEGGLMDYAFNFIIKNGG 218
Query: 65 INTERDYPNVGVMDNC 80
I+++ DYP G C
Sbjct: 219 IDSDLDYPYTGRDGTC 234
>gi|359359120|gb|AEV41026.1| putative cysteine protease [Oryza minuta]
Length = 464
Score = 74.3 bits (181), Expect = 1e-11, Method: Composition-based stats.
Identities = 32/78 (41%), Positives = 49/78 (62%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V A+EGI+KIVT LV +S Q+LV+C + C GG ++ + ++ +N G
Sbjct: 180 GSCWAFSAVAAVEGINKIVTGELVSLSEQELVECARNRGNSGCNGGIMDDAFAFITRNGG 239
Query: 65 INTERDYPNVGVMDNCKV 82
++TE DYP + C +
Sbjct: 240 LDTEEDYPYTAMDGKCDL 257
>gi|281204231|gb|EFA78427.1| cysteine proteinase 3 [Polysphondylium pallidum PN500]
Length = 329
Score = 74.3 bits (181), Expect = 1e-11, Method: Composition-based stats.
Identities = 30/68 (44%), Positives = 42/68 (61%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ GA+EG +I T N V +S QQL+DC + C GG +++ Y+++ G
Sbjct: 134 GSCWAFSTTGAVEGAHQIATGNFVSLSEQQLMDCSRSYGNHGCQGGLMDSAMSYIVKQGG 193
Query: 65 INTERDYP 72
INTE YP
Sbjct: 194 INTEESYP 201
>gi|356545112|ref|XP_003540989.1| PREDICTED: LOW QUALITY PROTEIN: KDEL-tailed cysteine endopeptidase
CEP1-like [Glycine max]
Length = 400
Score = 74.3 bits (181), Expect = 1e-11, Method: Composition-based stats.
Identities = 35/79 (44%), Positives = 54/79 (68%), Gaps = 2/79 (2%)
Query: 4 LGSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNR 63
+GSCW + V AIEGI +I T+ L+ +S Q+LVD +GES C+GG++E ++++++
Sbjct: 225 IGSCWALSAVAAIEGIHQITTSKLMFLSKQKLVD-SVKGESEGCIGGYVEDAFEFIVKKG 283
Query: 64 GINTERDYPNVGVMDNCKV 82
GI +E YP GV + CKV
Sbjct: 284 GILSETHYPYKGV-NXCKV 301
>gi|77554625|gb|ABA97421.1| Vignain precursor, putative [Oryza sativa Japonica Group]
gi|222630746|gb|EEE62878.1| hypothetical protein OsJ_17681 [Oryza sativa Japonica Group]
Length = 350
Score = 74.3 bits (181), Expect = 1e-11, Method: Composition-based stats.
Identities = 31/77 (40%), Positives = 45/77 (58%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW F+ V A+EG K+ T L+ +S Q+LVDCD G + C GG I+ +Q+++ N G
Sbjct: 156 GCCWAFSAVAAMEGFVKLSTGKLISLSEQELVDCDVDGNDQGCEGGEIDGAFQFILSNGG 215
Query: 65 INTERDYPNVGVMDNCK 81
+ E +YP CK
Sbjct: 216 LTAEANYPYTAEDGRCK 232
>gi|12621903|gb|AAB60643.2|AAB60643 cathepsin S [Homo sapiens]
Length = 267
Score = 73.9 bits (180), Expect = 1e-11, Method: Composition-based stats.
Identities = 32/78 (41%), Positives = 49/78 (62%), Gaps = 1/78 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGE-SRSCVGGFIETIYQYVIQNR 63
G+CW F+ VGA+E K+ T LV +S Q LVDC + ++ C GGF+ T +QY+I N+
Sbjct: 137 GACWAFSAVGALEAQLKLKTGKLVTLSAQNLVDCSTEKYGNKGCNGGFMTTAFQYIIDNK 196
Query: 64 GINTERDYPNVGVMDNCK 81
GI+++ YP + C+
Sbjct: 197 GIDSDASYPYKAMDQKCQ 214
>gi|281207557|gb|EFA81740.1| hypothetical protein PPL_05734 [Polysphondylium pallidum PN500]
Length = 387
Score = 73.9 bits (180), Expect = 1e-11, Method: Composition-based stats.
Identities = 32/68 (47%), Positives = 44/68 (64%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ G+IEG +I T NLV +S Q L+DC ++ C GG + ++YVI+N G
Sbjct: 143 GSCWSFSTTGSIEGAHEIATGNLVSLSEQNLIDCSTAEGNQGCNGGLMTNAFEYVIKNGG 202
Query: 65 INTERDYP 72
I+TE YP
Sbjct: 203 IDTEASYP 210
>gi|320169652|gb|EFW46551.1| cathepsin L2 [Capsaspora owczarzaki ATCC 30864]
Length = 325
Score = 73.9 bits (180), Expect = 1e-11, Method: Composition-based stats.
Identities = 32/77 (41%), Positives = 45/77 (58%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ G++EG T LV +S Q LVDC +Q + C GG ++ ++Y+I+N G
Sbjct: 130 GSCWSFSTTGSVEGQHARKTGTLVSLSEQNLVDCSSQEGNEGCNGGLMDDAFEYIIKNGG 189
Query: 65 INTERDYPNVGVMDNCK 81
I+TE YP CK
Sbjct: 190 IDTEASYPYTATTGTCK 206
>gi|356515046|ref|XP_003526212.1| PREDICTED: thiol protease SEN102-like [Glycine max]
Length = 342
Score = 73.9 bits (180), Expect = 1e-11, Method: Composition-based stats.
Identities = 34/76 (44%), Positives = 49/76 (64%), Gaps = 2/76 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G W F+ + A EGI +I T NLV +S Q+LVDCD+ + C GGF+E ++++I+N G
Sbjct: 149 GRFWAFSTIAATEGIHQISTGNLVSLSEQELVDCDSVDD--GCEGGFMEDGFEFIIKNGG 206
Query: 65 INTERDYPNVGVMDNC 80
I +E +YP GV C
Sbjct: 207 ITSETNYPYKGVDGTC 222
>gi|119573900|gb|EAW53515.1| cathepsin K (pycnodysostosis), isoform CRA_a [Homo sapiens]
Length = 288
Score = 73.9 bits (180), Expect = 1e-11, Method: Composition-based stats.
Identities = 36/76 (47%), Positives = 50/76 (65%), Gaps = 2/76 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ VGA+EG K T L+++S Q LVDC E+ C GG++ +QYV +NRG
Sbjct: 96 GSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDC--VSENDGCGGGYMTNAFQYVQKNRG 153
Query: 65 INTERDYPNVGVMDNC 80
I++E YP VG ++C
Sbjct: 154 IDSEDAYPYVGQEESC 169
>gi|440893559|gb|ELR46281.1| Cathepsin L1 [Bos grunniens mutus]
Length = 330
Score = 73.9 bits (180), Expect = 1e-11, Method: Composition-based stats.
Identities = 33/76 (43%), Positives = 45/76 (59%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ GA+EG T LV +S Q LVDC +R C GGFI+ +QYV+ G
Sbjct: 133 GSCWAFSATGALEGQMFQKTGKLVSLSEQNLVDCSQPEGNRGCHGGFIDNAFQYVLDVGG 192
Query: 65 INTERDYPNVGVMDNC 80
+++E YP G++ C
Sbjct: 193 LDSEESYPYTGLVGTC 208
>gi|159485468|ref|XP_001700766.1| cysteine endopeptidase [Chlamydomonas reinhardtii]
gi|158281265|gb|EDP07020.1| cysteine endopeptidase [Chlamydomonas reinhardtii]
Length = 498
Score = 73.9 bits (180), Expect = 1e-11, Method: Composition-based stats.
Identities = 33/67 (49%), Positives = 45/67 (67%), Gaps = 1/67 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ VG+IEG + + T LV +S QQLVDCD + C GG ++ ++YV+ N G
Sbjct: 155 GSCWAFSAVGSIEGANALATGQLVALSEQQLVDCDT-ASNMGCSGGLMDDAFKYVLDNGG 213
Query: 65 INTERDY 71
I+TE DY
Sbjct: 214 IDTEEDY 220
>gi|255540425|ref|XP_002511277.1| cysteine protease, putative [Ricinus communis]
gi|46395620|sp|O65039.1|CYSEP_RICCO RecName: Full=Vignain; AltName: Full=Cysteine endopeptidase; Flags:
Precursor
gi|2944446|gb|AAC62396.1| cysteine endopeptidase precursor [Ricinus communis]
gi|223550392|gb|EEF51879.1| cysteine protease, putative [Ricinus communis]
Length = 360
Score = 73.9 bits (180), Expect = 1e-11, Method: Composition-based stats.
Identities = 34/82 (41%), Positives = 52/82 (63%), Gaps = 1/82 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ + A+EGI++I TN LV +S Q+LVDCD +++ C GG ++ ++++ Q G
Sbjct: 148 GSCWAFSTIVAVEGINQIKTNKLVSLSEQELVDCDTD-QNQGCNGGLMDYAFEFIKQRGG 206
Query: 65 INTERDYPNVGVMDNCKVFQFN 86
I TE +YP C V + N
Sbjct: 207 ITTEANYPYEAYDGTCDVSKEN 228
>gi|358248896|ref|NP_001239703.1| uncharacterized protein LOC100799247 precursor [Glycine max]
gi|255636729|gb|ACU18700.1| unknown [Glycine max]
Length = 341
Score = 73.9 bits (180), Expect = 1e-11, Method: Composition-based stats.
Identities = 33/78 (42%), Positives = 48/78 (61%), Gaps = 1/78 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW FA V +E + +I T LV +S Q+LVDC +G+S C GG++E ++++ G
Sbjct: 145 GSCWAFATVATVESLHQITTGELVSLSEQELVDC-VRGDSEGCRGGYVENAFEFIANKGG 203
Query: 65 INTERDYPNVGVMDNCKV 82
I +E YP G +CKV
Sbjct: 204 ITSEAYYPYKGKDRSCKV 221
>gi|351629615|gb|AEQ54771.1| KDDL-tailed cysteine proteinase CP4 [Coffea canephora]
Length = 359
Score = 73.9 bits (180), Expect = 1e-11, Method: Composition-based stats.
Identities = 35/82 (42%), Positives = 49/82 (59%), Gaps = 2/82 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V +EGI+KI T LV +S Q+LVDC+ E C GG +E Y+++ ++ G
Sbjct: 148 GSCWAFSTVVGVEGINKIKTGQLVSLSEQELVDCETDNE--GCNGGLMENAYEFIKKSGG 205
Query: 65 INTERDYPNVGVMDNCKVFQFN 86
I TER YP +C + N
Sbjct: 206 ITTERLYPYKARDGSCDSSKMN 227
>gi|2224808|emb|CAB09697.1| cysteine endopeptidase EP-A [Hordeum vulgare subsp. vulgare]
gi|326502180|dbj|BAK06781.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 365
Score = 73.9 bits (180), Expect = 1e-11, Method: Composition-based stats.
Identities = 35/76 (46%), Positives = 50/76 (65%), Gaps = 2/76 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ + A+EGI+KI T LV +S Q+L+DCDN ++ C GG ++ +Q+ IQ G
Sbjct: 154 GSCWAFSTIVAVEGINKIRTGKLVSLSEQELMDCDNV-NNQGCEGGLMDYAFQF-IQKNG 211
Query: 65 INTERDYPNVGVMDNC 80
I TE +YP G +C
Sbjct: 212 ITTESNYPYQGEQGSC 227
>gi|2414683|emb|CAB16316.1| cysteine proteinase precursor [Vicia sativa]
Length = 379
Score = 73.9 bits (180), Expect = 1e-11, Method: Composition-based stats.
Identities = 34/83 (40%), Positives = 45/83 (54%), Gaps = 6/83 (7%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGE------SRSCVGGFIETIYQY 58
GSCW F G+IEG + + T LV +S QQLVDCDN+ + C GG + T Y Y
Sbjct: 162 GSCWAFTTTGSIEGANFLATGKLVSLSEQQLVDCDNKCDITKTSCDNGCNGGLMTTAYDY 221
Query: 59 VIQNRGINTERDYPNVGVMDNCK 81
+++ G+ E YP G CK
Sbjct: 222 LMEAGGLEEETSYPYTGAQGECK 244
>gi|410968296|ref|XP_003990643.1| PREDICTED: cathepsin K [Felis catus]
Length = 330
Score = 73.9 bits (180), Expect = 1e-11, Method: Composition-based stats.
Identities = 36/76 (47%), Positives = 50/76 (65%), Gaps = 2/76 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ VGA+EG K T L+++S Q LVDC E+ C GG++ +QYV +NRG
Sbjct: 138 GSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDC--VSENDGCGGGYMTNAFQYVQKNRG 195
Query: 65 INTERDYPNVGVMDNC 80
I++E YP VG ++C
Sbjct: 196 IDSEDAYPYVGQDESC 211
>gi|149751227|ref|XP_001490649.1| PREDICTED: cathepsin K-like [Equus caballus]
Length = 329
Score = 73.9 bits (180), Expect = 1e-11, Method: Composition-based stats.
Identities = 36/76 (47%), Positives = 50/76 (65%), Gaps = 2/76 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ VGA+EG K T L+++S Q LVDC E+ C GG++ +QYV +NRG
Sbjct: 137 GSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDC--VSENDGCGGGYMTNAFQYVQKNRG 194
Query: 65 INTERDYPNVGVMDNC 80
I++E YP VG ++C
Sbjct: 195 IDSEDAYPYVGQDESC 210
>gi|355681653|gb|AER96814.1| cathepsin K [Mustela putorius furo]
Length = 329
Score = 73.9 bits (180), Expect = 1e-11, Method: Composition-based stats.
Identities = 36/76 (47%), Positives = 50/76 (65%), Gaps = 2/76 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ VGA+EG K T L+++S Q LVDC E+ C GG++ +QYV +NRG
Sbjct: 138 GSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDC--VSENDGCGGGYMTNAFQYVQKNRG 195
Query: 65 INTERDYPNVGVMDNC 80
I++E YP VG ++C
Sbjct: 196 IDSEDAYPYVGQDESC 211
>gi|301767944|ref|XP_002919404.1| PREDICTED: cathepsin K-like [Ailuropoda melanoleuca]
gi|281352889|gb|EFB28473.1| hypothetical protein PANDA_008011 [Ailuropoda melanoleuca]
Length = 330
Score = 73.9 bits (180), Expect = 1e-11, Method: Composition-based stats.
Identities = 36/76 (47%), Positives = 50/76 (65%), Gaps = 2/76 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ VGA+EG K T L+++S Q LVDC E+ C GG++ +QYV +NRG
Sbjct: 138 GSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDC--VSENDGCGGGYMTNAFQYVQKNRG 195
Query: 65 INTERDYPNVGVMDNC 80
I++E YP VG ++C
Sbjct: 196 IDSEDAYPYVGQDESC 211
>gi|295321664|pdb|3H7D|A Chain A, The Crystal Structure Of The Cathepsin K Variant M5 In
Compl Chondroitin-4-Sulfate
gi|295321665|pdb|3H7D|E Chain E, The Crystal Structure Of The Cathepsin K Variant M5 In
Compl Chondroitin-4-Sulfate
Length = 215
Score = 73.9 bits (180), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 36/76 (47%), Positives = 51/76 (67%), Gaps = 2/76 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ VGA+EG K T L+++S Q LVDC + E+ C GG++ +QYV +NRG
Sbjct: 23 GSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCVS--ENDGCGGGYMTNAFQYVQKNRG 80
Query: 65 INTERDYPNVGVMDNC 80
I++E YP VG ++C
Sbjct: 81 IDSEDAYPYVGQEESC 96
>gi|139947602|ref|NP_001077155.1| cathepsin L1 precursor [Bos taurus]
gi|134025180|gb|AAI34742.1| CTSL1 protein [Bos taurus]
gi|296484500|tpg|DAA26615.1| TPA: cathepsin L1 [Bos taurus]
Length = 333
Score = 73.9 bits (180), Expect = 1e-11, Method: Composition-based stats.
Identities = 33/76 (43%), Positives = 45/76 (59%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ GA+EG T LV +S Q LVDC +R C GGFI+ +QYV+ G
Sbjct: 136 GSCWAFSATGALEGQMFQKTGKLVSLSEQNLVDCSQPEGNRGCHGGFIDNAFQYVLDVGG 195
Query: 65 INTERDYPNVGVMDNC 80
+++E YP G++ C
Sbjct: 196 LDSEESYPYTGLVGTC 211
>gi|4731374|gb|AAD28477.1|AF133839_1 papain-like cysteine protease [Sandersonia aurantiaca]
Length = 357
Score = 73.9 bits (180), Expect = 1e-11, Method: Composition-based stats.
Identities = 32/77 (41%), Positives = 49/77 (63%), Gaps = 1/77 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ + A+EGI++IVT LV +S Q+L+DCD +++ C GG ++ ++++ N G
Sbjct: 151 GSCWAFSAIAAVEGINQIVTKELVPLSEQELIDCDTD-QNQGCSGGLMDYAFEFIKNNGG 209
Query: 65 INTERDYPNVGVMDNCK 81
I TE YP CK
Sbjct: 210 ITTEDVYPYQAEDATCK 226
>gi|130502110|ref|NP_001076110.1| cathepsin K precursor [Oryctolagus cuniculus]
gi|1168794|sp|P43236.1|CATK_RABIT RecName: Full=Cathepsin K; AltName: Full=Protein OC-2; Flags:
Precursor
gi|454187|dbj|BAA03125.1| OC-2 protein [Oryctolagus cuniculus]
Length = 329
Score = 73.9 bits (180), Expect = 1e-11, Method: Composition-based stats.
Identities = 36/76 (47%), Positives = 50/76 (65%), Gaps = 2/76 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ VGA+EG K T L+++S Q LVDC E+ C GG++ +QYV +NRG
Sbjct: 137 GSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDC--VSENYGCGGGYMTNAFQYVQRNRG 194
Query: 65 INTERDYPNVGVMDNC 80
I++E YP VG ++C
Sbjct: 195 IDSEDAYPYVGQDESC 210
>gi|225428328|ref|XP_002279940.1| PREDICTED: cysteine proteinase-like [Vitis vinifera]
Length = 707
Score = 73.9 bits (180), Expect = 1e-11, Method: Composition-based stats.
Identities = 31/68 (45%), Positives = 45/68 (66%), Gaps = 1/68 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V A+EGI++IVT NL +S Q+L+DCD S C GG ++ + ++ N G
Sbjct: 511 GSCWAFSTVAAVEGINQIVTGNLTTLSEQELIDCDTTFNS-GCNGGLMDYAFAFIASNGG 569
Query: 65 INTERDYP 72
++ E DYP
Sbjct: 570 LHKEDDYP 577
>gi|157093728|gb|ABV22590.1| KDEL-tailed cysteine endopeptidase [Solanum lycopersicum]
Length = 360
Score = 73.9 bits (180), Expect = 1e-11, Method: Composition-based stats.
Identities = 34/82 (41%), Positives = 50/82 (60%), Gaps = 1/82 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V A+EGI++I T LV +S Q+LVDCD E++ C GG ++ + ++ + G
Sbjct: 148 GSCWAFSTVVAVEGINQIKTKKLVSLSEQELVDCDTT-ENQGCNGGLMDPAFDFIKKRGG 206
Query: 65 INTERDYPNVGVMDNCKVFQFN 86
I TE YP D C + + N
Sbjct: 207 ITTEERYPYKAEDDKCDIQKRN 228
>gi|28192373|gb|AAK07730.1| CPR1-like cysteine proteinase [Nicotiana tabacum]
Length = 374
Score = 73.9 bits (180), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 33/83 (39%), Positives = 55/83 (66%), Gaps = 1/83 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V A+ GI++IVT ++ +S Q+LVDCD + ++ C GG ++ ++++I N G
Sbjct: 162 GSCWAFSTVAAVGGINQIVTGEMITLSEQELVDCD-RVQNSGCNGGLMDYAFEFIISNGG 220
Query: 65 INTERDYPNVGVMDNCKVFQFNW 87
++TE+ YP GV C + N+
Sbjct: 221 MDTEKHYPYRGVEGRCDPVRKNY 243
>gi|324983200|gb|ADY68475.1| stem bromelain [Ananas comosus]
Length = 291
Score = 73.9 bits (180), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 33/81 (40%), Positives = 48/81 (59%), Gaps = 3/81 (3%)
Query: 2 HPLGSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQ 61
+P GSCW F+ + +EGI KIVT LV +S Q+++DC S C GGF++ Y ++I
Sbjct: 143 NPCGSCWAFSAIATVEGIYKIVTGYLVSLSEQEVLDC---AVSNGCDGGFVDNAYDFIIS 199
Query: 62 NRGINTERDYPNVGVMDNCKV 82
N G+ +E DYP +C
Sbjct: 200 NNGVASEADYPYQAYQGDCAA 220
>gi|168006315|ref|XP_001755855.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162693174|gb|EDQ79528.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 454
Score = 73.9 bits (180), Expect = 1e-11, Method: Composition-based stats.
Identities = 35/79 (44%), Positives = 51/79 (64%), Gaps = 7/79 (8%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCD---NQGESRSCVGGFIETIYQYVIQ 61
GSCW F+ +G++EGI+ I T V +S Q+LVDCD NQG C GG ++ + ++++
Sbjct: 152 GSCWAFSAIGSVEGINAIRTGEAVSLSEQELVDCDLEYNQG----CNGGLMDYAFDFILE 207
Query: 62 NRGINTERDYPNVGVMDNC 80
N GI+TE DYP G+ C
Sbjct: 208 NGGIDTENDYPYKGLDGRC 226
>gi|116794072|gb|ABK26996.1| unknown [Picea sitchensis]
Length = 367
Score = 73.9 bits (180), Expect = 1e-11, Method: Composition-based stats.
Identities = 36/76 (47%), Positives = 48/76 (63%), Gaps = 2/76 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ GAIEG++ I T LV +S Q+LV CD + C GG ++ + +VIQN G
Sbjct: 164 GSCWAFSTTGAIEGVNFISTGKLVSLSEQELVACD--ATNYGCEGGDMDYAFTWVIQNGG 221
Query: 65 INTERDYPNVGVMDNC 80
I+TE+DY GV C
Sbjct: 222 IDTEKDYSYTGVDSTC 237
>gi|46401612|dbj|BAD16614.1| cysteine proteinase [Dianthus caryophyllus]
Length = 459
Score = 73.9 bits (180), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 31/82 (37%), Positives = 54/82 (65%), Gaps = 1/82 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ + A+EG+++IVT +L+ +S Q+LV+CD + C GG ++ ++++I+N G
Sbjct: 154 GSCWAFSAIAAVEGVNQIVTGDLISLSEQELVECDTS-YNDGCDGGLMDYAFEFIIKNEG 212
Query: 65 INTERDYPNVGVMDNCKVFQFN 86
I+++ DYP G C + N
Sbjct: 213 IDSDEDYPYTGRDGRCDTNRKN 234
>gi|2914594|pdb|1MEM|A Chain A, Crystal Structure Of Cathepsin K Complexed With A Potent
Vinyl Sulfone Inhibitor
gi|28374044|pdb|1NL6|A Chain A, Crystal Structure Of The Cysteine Protease Human
Cathepsin K In Complex With A Covalent Azepanone
Inhibitor
gi|28374045|pdb|1NL6|B Chain B, Crystal Structure Of The Cysteine Protease Human
Cathepsin K In Complex With A Covalent Azepanone
Inhibitor
gi|28374047|pdb|1NLJ|A Chain A, Crystal Structure Of The Cysteine Protease Human
Cathepsin K In Complex With A Covalent Azepanone
Inhibitor
gi|28374048|pdb|1NLJ|B Chain B, Crystal Structure Of The Cysteine Protease Human
Cathepsin K In Complex With A Covalent Azepanone
Inhibitor
gi|47168617|pdb|1Q6K|A Chain A, Cathepsin K Complexed With T-butyl(1s)-1-cyclohexyl-2-
Oxoethylcarbamate
gi|55670045|pdb|1TU6|A Chain A, Cathepsin K Complexed With A Ketoamide Inhibitor
gi|55670046|pdb|1TU6|B Chain B, Cathepsin K Complexed With A Ketoamide Inhibitor
gi|62738654|pdb|1YK7|A Chain A, Cathepsin K Complexed With A Cyanopyrrolidine Inhibitor
gi|73535690|pdb|1YK8|A Chain A, Cathepsin K Complexed With A Cyanamide-Based Inhibitor
gi|73535721|pdb|1YT7|A Chain A, Cathepsin K Complexed With A Constrained Ketoamide
Inhibitor
gi|93278849|pdb|2BDL|A Chain A, Cathepsin K Complexed With A Pyrrolidine Ketoamide-Based
Inhibitor
gi|114793438|pdb|2ATO|A Chain A, Crystal Structure Of Human Cathepsin K In Complex With
Myocrisin
gi|114793448|pdb|2AUX|A Chain A, Cathepsin K Complexed With A Semicarbazone Inhibitor
gi|114793451|pdb|2AUZ|A Chain A, Cathepsin K Complexed With A Semicarbazone Inhibitor
gi|126030469|pdb|2FTD|A Chain A, Crystal Structure Of Cathepsin K Complexed With
7-Methyl- Substituted Azepan-3-One Compound
gi|126030470|pdb|2FTD|B Chain B, Crystal Structure Of Cathepsin K Complexed With
7-Methyl- Substituted Azepan-3-One Compound
gi|157830076|pdb|1ATK|A Chain A, Crystal Structure Of The Cysteine Protease Human
Cathepsin K In Complex With The Covalent Inhibitor E-64
gi|157830085|pdb|1AU0|A Chain A, Crystal Structure Of The Cysteine Protease Human
Cathepsin K In Complex With A Covalent Symmetric
Diacylaminomethyl Ketone Inhibitor
gi|157830086|pdb|1AU2|A Chain A, Crystal Structure Of The Cysteine Protease Human
Cathepsin K In Complex With A Covalent Propanone
Inhibitor
gi|157830087|pdb|1AU3|A Chain A, Crystal Structure Of The Cysteine Protease Human
Cathepsin K In Complex With A Covalent Pyrrolidinone
Inhibitor
gi|157830088|pdb|1AU4|A Chain A, Crystal Structure Of The Cysteine Protease Human
Cathepsin K In Complex With A Covalent Pyrrolidinone
Inhibitor
gi|157830146|pdb|1AYU|A Chain A, Crystal Structure Of Cysteine Protease Human Cathepsin K
In Complex With A Covalent Symmetric Biscarbohydrazide
Inhibitor
gi|157830147|pdb|1AYV|A Chain A, Crystal Structure Of Cysteine Protease Human Cathepsin K
In Complex With A Covalent Thiazolhydrazide Inhibitor
gi|157830148|pdb|1AYW|A Chain A, Crystal Structure Of Cysteine Protease Human Cathepsin K
In Complex With A Covalent
Benzyloxybenzoylcarbohydrazide Inhibitor
gi|157830300|pdb|1BGO|A Chain A, Crystal Structure Of Cysteine Protease Human Cathepsin K
In Complex With A Covalent Peptidomimetic Inhibitor
gi|197305045|pdb|3C9E|A Chain A, Crystal Structure Of The Cathepsin K : Chondroitin
Sulfate Complex.
gi|290560385|pdb|3KW9|A Chain A, X-Ray Structure Of Cathepsin K Covalently Bound To A
Triazine Ligand
gi|290560386|pdb|3KWZ|A Chain A, Cathepsin K In Complex With A Non-Selective 2-Cyano-
Pyrimidine Inhibitor
gi|290560387|pdb|3KX1|A Chain A, Cathepsin K In Complex With A Selective
2-Cyano-Pyrimidine Inhibitor
gi|293651910|pdb|3KWB|X Chain X, Structure Of Catk Covalently Bound To A Dioxo-Triazine
Inhibitor
gi|293651911|pdb|3KWB|Y Chain Y, Structure Of Catk Covalently Bound To A Dioxo-Triazine
Inhibitor
gi|308198615|pdb|3O1G|A Chain A, Cathepsin K Covalently Bound To A 2-Cyano Pyrimidine
Inhibitor With A Benzyl P3 Group.
gi|327200584|pdb|3O0U|A Chain A, Cathepsin K Covalently Bound To A Cyano-Pyrimidine
Inhibitor With Improved Selectivity Over Herg
gi|394986262|pdb|4DMX|A Chain A, Cathepsin K Inhibitor
gi|394986263|pdb|4DMY|A Chain A, Cathepsin K Inhibitor
gi|394986264|pdb|4DMY|B Chain B, Cathepsin K Inhibitor
Length = 215
Score = 73.9 bits (180), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 36/76 (47%), Positives = 51/76 (67%), Gaps = 2/76 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ VGA+EG K T L+++S Q LVDC + E+ C GG++ +QYV +NRG
Sbjct: 23 GSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCVS--ENDGCGGGYMTNAFQYVQKNRG 80
Query: 65 INTERDYPNVGVMDNC 80
I++E YP VG ++C
Sbjct: 81 IDSEDAYPYVGQEESC 96
>gi|146215980|gb|ABQ10192.1| actinidin Act2a [Actinidia deliciosa]
Length = 378
Score = 73.9 bits (180), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 32/67 (47%), Positives = 46/67 (68%)
Query: 6 SCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRGI 65
SCW F+ V A+EGI+KIVT NL+ +S Q+LVDC ++ C G + ++++I N GI
Sbjct: 149 SCWAFSAVAAVEGINKIVTGNLISLSEQELVDCGRTQITKGCNRGLMTDAFKFIINNGGI 208
Query: 66 NTERDYP 72
NTE +YP
Sbjct: 209 NTENNYP 215
>gi|281211531|gb|EFA85693.1| cysteine protease [Polysphondylium pallidum PN500]
Length = 366
Score = 73.6 bits (179), Expect = 1e-11, Method: Composition-based stats.
Identities = 30/68 (44%), Positives = 44/68 (64%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ G++EG +I T N+V++S Q LVDC + + C GG + + Y+I N G
Sbjct: 136 GSCWSFSTTGSVEGAHQIKTGNMVELSEQNLVDCSSAEGNMGCNGGLMNNAFDYIISNHG 195
Query: 65 INTERDYP 72
I+TE+ YP
Sbjct: 196 IDTEQSYP 203
>gi|85000505|ref|XP_954971.1| cysteine proteinase precursor, tacP [Theileria annulata strain
Ankara]
gi|65303117|emb|CAI75495.1| cysteine proteinase precursor, tacP, putative [Theileria annulata]
Length = 447
Score = 73.6 bits (179), Expect = 1e-11, Method: Composition-based stats.
Identities = 34/78 (43%), Positives = 51/78 (65%), Gaps = 3/78 (3%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW FA +G++E + KI + +D+S Q+LVDC+ + S+ C GGF +T QY IQN+G
Sbjct: 259 GSCWAFATIGSVESLYKIYRDVTLDLSEQELVDCETK--SKGCEGGFGDTALQY-IQNKG 315
Query: 65 INTERDYPNVGVMDNCKV 82
++ + D P V + C V
Sbjct: 316 VSNDNDIPYVAKKNTCVV 333
>gi|357452869|ref|XP_003596711.1| Cysteine proteinase [Medicago truncatula]
gi|355485759|gb|AES66962.1| Cysteine proteinase [Medicago truncatula]
Length = 344
Score = 73.6 bits (179), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 35/81 (43%), Positives = 47/81 (58%)
Query: 2 HPLGSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQ 61
H GSCW FA V AIEGI +I T LV +S Q+LVDC + C GG++E ++++
Sbjct: 145 HLCGSCWAFATVAAIEGIHQITTGRLVSLSEQELVDCVKTNTTDGCNGGYVEDACDFIVK 204
Query: 62 NRGINTERDYPNVGVMDNCKV 82
GI +E +YP V C V
Sbjct: 205 KGGITSETNYPYTRVDGKCNV 225
>gi|2342494|dbj|BAA21848.1| bromelain [Ananas comosus]
gi|2463582|dbj|BAA22543.1| FB31 precursor [Ananas comosus]
Length = 352
Score = 73.6 bits (179), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 33/81 (40%), Positives = 48/81 (59%), Gaps = 3/81 (3%)
Query: 2 HPLGSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQ 61
+P GSCW F+ + +EGI KIVT LV +S Q+++DC S C GGF++ Y ++I
Sbjct: 143 NPCGSCWAFSAIATVEGIYKIVTGYLVSLSEQEVLDC---AVSNGCDGGFVDNAYDFIIS 199
Query: 62 NRGINTERDYPNVGVMDNCKV 82
N G+ +E DYP +C
Sbjct: 200 NNGVASEADYPYQAYQGDCAA 220
>gi|431896621|gb|ELK06033.1| Cathepsin S [Pteropus alecto]
Length = 331
Score = 73.6 bits (179), Expect = 1e-11, Method: Composition-based stats.
Identities = 33/69 (47%), Positives = 45/69 (65%), Gaps = 1/69 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGES-RSCVGGFIETIYQYVIQNR 63
GSCW F+ VGA+E K+ T LV +S Q LVDC + S + C GGF+ + +QY+I N
Sbjct: 137 GSCWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTEKYSNKGCNGGFMTSAFQYIIDNN 196
Query: 64 GINTERDYP 72
GI++E YP
Sbjct: 197 GIDSEASYP 205
>gi|313118764|gb|ADR32294.1| C14 cysteine protease [Solanum stoloniferum]
Length = 217
Score = 73.6 bits (179), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 33/68 (48%), Positives = 49/68 (72%), Gaps = 1/68 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V A+E I+ IVT NL+ +S Q+LVDCD + ++ C GG ++ +++VI N G
Sbjct: 23 GSCWAFSAVAAMESINAIVTGNLISLSEQELVDCD-KSYNQGCDGGLMDYAFEFVINNGG 81
Query: 65 INTERDYP 72
I++E DYP
Sbjct: 82 IDSEEDYP 89
>gi|75765285|pdb|1U9V|A Chain A, Crystal Structure Of The Cysteine Protease Human
Cathepsin K In Complex With The Covalent Inhibitor
Nvp-Abe854
gi|75765286|pdb|1U9W|A Chain A, Crystal Structure Of The Cysteine Protease Human
Cathepsin K In Complex With The Covalent Inhibitor
Nvp-Abi491
gi|75765287|pdb|1U9X|A Chain A, Crystal Structure Of The Cysteine Protease Human
Cathepsin K In Complex With The Covalent Inhibitor
Nvp-Abj688
gi|160286063|pdb|2R6N|A Chain A, Crystal Structure Of A Pyrrolopyrimidine Inhibitor In
Complex With Human Cathepsin K
Length = 217
Score = 73.6 bits (179), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 36/76 (47%), Positives = 51/76 (67%), Gaps = 2/76 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ VGA+EG K T L+++S Q LVDC + E+ C GG++ +QYV +NRG
Sbjct: 25 GSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCVS--ENDGCGGGYMTNAFQYVQKNRG 82
Query: 65 INTERDYPNVGVMDNC 80
I++E YP VG ++C
Sbjct: 83 IDSEDAYPYVGQEESC 98
>gi|535473|emb|CAA53377.1| cysteine protease [Vicia sativa]
Length = 368
Score = 73.6 bits (179), Expect = 1e-11, Method: Composition-based stats.
Identities = 32/76 (42%), Positives = 49/76 (64%), Gaps = 1/76 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ + +E I+KIVT LV +S Q+LVDCD + + C GG ++ ++++I N G
Sbjct: 150 GSCWAFSTIATVEAINKIVTGKLVSLSEQELVDCD-RAFNEGCNGGLMDYAFEFIIGNGG 208
Query: 65 INTERDYPNVGVMDNC 80
I+T++ YP G C
Sbjct: 209 IDTDQHYPYKGFEGRC 224
>gi|315075311|ref|NP_001186668.1| cathepsin S isoform 2 preproprotein [Homo sapiens]
gi|194376464|dbj|BAG62991.1| unnamed protein product [Homo sapiens]
Length = 281
Score = 73.6 bits (179), Expect = 1e-11, Method: Composition-based stats.
Identities = 32/78 (41%), Positives = 49/78 (62%), Gaps = 1/78 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGE-SRSCVGGFIETIYQYVIQNR 63
G+CW F+ VGA+E K+ T LV +S Q LVDC + ++ C GGF+ T +QY+I N+
Sbjct: 87 GACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTEKYGNKGCNGGFMTTAFQYIIDNK 146
Query: 64 GINTERDYPNVGVMDNCK 81
GI+++ YP + C+
Sbjct: 147 GIDSDASYPYKAMDQKCQ 164
>gi|50513589|pdb|1SNK|A Chain A, Cathepsin K Complexed With Carbamate Derivatized
Norleucine Aldehyde
Length = 214
Score = 73.6 bits (179), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 36/76 (47%), Positives = 51/76 (67%), Gaps = 2/76 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ VGA+EG K T L+++S Q LVDC + E+ C GG++ +QYV +NRG
Sbjct: 22 GSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCVS--ENDGCGGGYMTNAFQYVQKNRG 79
Query: 65 INTERDYPNVGVMDNC 80
I++E YP VG ++C
Sbjct: 80 IDSEDAYPYVGQEESC 95
>gi|356564325|ref|XP_003550405.1| PREDICTED: cysteine proteinase 15A [Glycine max]
Length = 370
Score = 73.6 bits (179), Expect = 1e-11, Method: Composition-based stats.
Identities = 33/84 (39%), Positives = 47/84 (55%), Gaps = 7/84 (8%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRS-------CVGGFIETIYQ 57
GSCW F+ GA+EG + T LV +S QQLVDCD+ + C GG + ++
Sbjct: 161 GSCWSFSTTGALEGAHYLATGELVSLSEQQLVDCDHVCDPEEYGACDSGCNGGLMNNAFE 220
Query: 58 YVIQNRGINTERDYPNVGVMDNCK 81
Y++Q+ G+ E+DYP G CK
Sbjct: 221 YILQSGGVQKEKDYPYTGRDGTCK 244
>gi|354622947|ref|NP_001002938.2| cathepsin S precursor [Canis lupus familiaris]
Length = 339
Score = 73.6 bits (179), Expect = 1e-11, Method: Composition-based stats.
Identities = 33/78 (42%), Positives = 48/78 (61%), Gaps = 1/78 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGE-SRSCVGGFIETIYQYVIQNR 63
G+CW F+ VGA+E K+ T LV +S Q LVDC + ++ C GGF+ T +QY+I N
Sbjct: 145 GACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTEKYGNKGCNGGFMTTAFQYIIDNN 204
Query: 64 GINTERDYPNVGVMDNCK 81
GI++E YP + C+
Sbjct: 205 GIDSEASYPYKAMNGKCR 222
>gi|315364648|pdb|3OVZ|A Chain A, Cathepsin K In Complex With A Covalent Inhibitor With A
Ketoamide Warhead
Length = 213
Score = 73.6 bits (179), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 36/76 (47%), Positives = 51/76 (67%), Gaps = 2/76 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ VGA+EG K T L+++S Q LVDC + E+ C GG++ +QYV +NRG
Sbjct: 21 GSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCVS--ENDGCGGGYMTNAFQYVQKNRG 78
Query: 65 INTERDYPNVGVMDNC 80
I++E YP VG ++C
Sbjct: 79 IDSEDAYPYVGQEESC 94
>gi|344275468|ref|XP_003409534.1| PREDICTED: cathepsin K-like [Loxodonta africana]
Length = 329
Score = 73.6 bits (179), Expect = 1e-11, Method: Composition-based stats.
Identities = 36/76 (47%), Positives = 50/76 (65%), Gaps = 2/76 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ VGA+EG K T L+++S Q LVDC E+ C GG++ +QYV +NRG
Sbjct: 137 GSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDC--VSENDGCGGGYMTNAFQYVQKNRG 194
Query: 65 INTERDYPNVGVMDNC 80
I++E YP VG ++C
Sbjct: 195 IDSEDAYPYVGQDESC 210
>gi|71027309|ref|XP_763298.1| cysteine proteinase [Theileria parva strain Muguga]
gi|68350251|gb|EAN31015.1| cysteine proteinase, putative [Theileria parva]
Length = 460
Score = 73.6 bits (179), Expect = 1e-11, Method: Composition-based stats.
Identities = 35/78 (44%), Positives = 52/78 (66%), Gaps = 3/78 (3%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW FA V ++E + KI N +D+S Q+LVDC+ S+ C GGF +T +Y IQN+G
Sbjct: 272 GSCWAFASVSSVESLYKIYRNVTLDLSEQELVDCETS--SKGCEGGFGDTALKY-IQNKG 328
Query: 65 INTERDYPNVGVMDNCKV 82
++T+ + P +G +NC V
Sbjct: 329 VSTDSEIPYLGKKNNCLV 346
>gi|449522968|ref|XP_004168497.1| PREDICTED: xylem cysteine proteinase 1-like [Cucumis sativus]
Length = 348
Score = 73.6 bits (179), Expect = 1e-11, Method: Composition-based stats.
Identities = 31/76 (40%), Positives = 49/76 (64%), Gaps = 1/76 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V A+EGI+KIV NL +S Q+L+DCD + + C GG ++ + +++ + G
Sbjct: 152 GSCWAFSTVAAVEGINKIVGGNLTSLSEQELIDCD-RPYNNGCHGGLMDYAFSFIVSSGG 210
Query: 65 INTERDYPNVGVMDNC 80
++ E DYP + V C
Sbjct: 211 LHKEEDYPYLEVESTC 226
>gi|357143305|ref|XP_003572875.1| PREDICTED: xylem cysteine proteinase 1-like [Brachypodium
distachyon]
Length = 473
Score = 73.6 bits (179), Expect = 1e-11, Method: Composition-based stats.
Identities = 33/77 (42%), Positives = 49/77 (63%), Gaps = 1/77 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V A+EGI++I T L +S Q+L+DCD + C GGF++ + Y++ N G
Sbjct: 155 GSCWAFSTVAAVEGINQIATGKLESLSEQELMDCDTTFD-HGCGGGFMDFAFAYIMGNLG 213
Query: 65 INTERDYPNVGVMDNCK 81
I+T+ DYP + CK
Sbjct: 214 IHTDDDYPYLMEEGYCK 230
>gi|449449489|ref|XP_004142497.1| PREDICTED: cysteine proteinase 15A-like [Cucumis sativus]
Length = 406
Score = 73.6 bits (179), Expect = 1e-11, Method: Composition-based stats.
Identities = 34/83 (40%), Positives = 48/83 (57%), Gaps = 7/83 (8%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGE-------SRSCVGGFIETIYQ 57
GSCW F+ GA+EG + I T NL+++S QQLVDCD+ + + C GG + Y+
Sbjct: 200 GSCWAFSTCGAVEGANFIATGNLLNLSEQQLVDCDHTCDPTDKTACNNGCNGGLMTNAYK 259
Query: 58 YVIQNRGINTERDYPNVGVMDNC 80
Y+IQ+ G+ E YP G C
Sbjct: 260 YLIQSGGLEEESSYPYTGRSGQC 282
>gi|46948144|gb|AAT07054.1| cathepsin L-like cysteine proteinase [Brugia malayi]
Length = 368
Score = 73.6 bits (179), Expect = 1e-11, Method: Composition-based stats.
Identities = 32/78 (41%), Positives = 50/78 (64%), Gaps = 1/78 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDC-DNQGESRSCVGGFIETIYQYVIQNR 63
GSCW F+ VGA++G + T LV++S Q L+DC D+ + C GG + ++YV++N
Sbjct: 162 GSCWTFSAVGALKGQHFLQTGKLVELSMQNLLDCSDDTYGNYGCDGGLMMEAFEYVVKND 221
Query: 64 GINTERDYPNVGVMDNCK 81
GI+TE+ YP G + C+
Sbjct: 222 GIDTEKSYPYQGYQNTCR 239
>gi|356553413|ref|XP_003545051.1| PREDICTED: cysteine proteinase 15A-like [Glycine max]
Length = 367
Score = 73.6 bits (179), Expect = 1e-11, Method: Composition-based stats.
Identities = 33/84 (39%), Positives = 47/84 (55%), Gaps = 7/84 (8%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRS-------CVGGFIETIYQ 57
GSCW F+ GA+EG + T LV +S QQLVDCD+ + C GG + ++
Sbjct: 158 GSCWSFSTTGALEGAHYLATGELVSLSEQQLVDCDHVCDPEEYGACDSGCNGGLMNNAFE 217
Query: 58 YVIQNRGINTERDYPNVGVMDNCK 81
Y++Q+ G+ E+DYP G CK
Sbjct: 218 YILQSGGVQKEKDYPYTGRDGTCK 241
>gi|312985015|gb|ACX54787.2| cysteine protease [Arachis diogoi]
Length = 360
Score = 73.6 bits (179), Expect = 1e-11, Method: Composition-based stats.
Identities = 34/84 (40%), Positives = 47/84 (55%), Gaps = 7/84 (8%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRS-------CVGGFIETIYQ 57
GSCW F+ GA+EG + T LV +S QQLVDCD+ + C GG + +
Sbjct: 149 GSCWAFSTTGALEGAHYLSTGELVSLSEQQLVDCDHVCDPEEYGACDAGCNGGLMNNAFD 208
Query: 58 YVIQNRGINTERDYPNVGVMDNCK 81
Y++Q G+ TE+DYP G + CK
Sbjct: 209 YILQAGGVQTEKDYPYSGRDETCK 232
>gi|449455625|ref|XP_004145553.1| PREDICTED: xylem cysteine proteinase 1-like [Cucumis sativus]
Length = 351
Score = 73.6 bits (179), Expect = 2e-11, Method: Composition-based stats.
Identities = 31/76 (40%), Positives = 49/76 (64%), Gaps = 1/76 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V A+EGI+KIV NL +S Q+L+DCD + + C GG ++ + +++ + G
Sbjct: 155 GSCWAFSTVAAVEGINKIVGGNLTSLSEQELIDCD-RPYNNGCHGGLMDYAFSFIVSSGG 213
Query: 65 INTERDYPNVGVMDNC 80
++ E DYP + V C
Sbjct: 214 LHKEEDYPYLEVESTC 229
>gi|4100157|gb|AAD10337.1| cysteine proteinase precursor [Hordeum vulgare]
Length = 365
Score = 73.6 bits (179), Expect = 2e-11, Method: Composition-based stats.
Identities = 35/76 (46%), Positives = 50/76 (65%), Gaps = 2/76 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ + A+EGI+KI T LV +S Q+L+DCDN ++ C GG ++ +Q+ IQ G
Sbjct: 154 GSCWAFSTIVAVEGINKIRTGKLVSLSEQELMDCDNV-NNQGCDGGLMDYAFQF-IQKNG 211
Query: 65 INTERDYPNVGVMDNC 80
I TE +YP G +C
Sbjct: 212 ITTESNYPYQGEQGSC 227
>gi|2224812|emb|CAB09699.1| cysteine endopeptidase EP-A [Hordeum vulgare subsp. vulgare]
Length = 365
Score = 73.6 bits (179), Expect = 2e-11, Method: Composition-based stats.
Identities = 35/76 (46%), Positives = 50/76 (65%), Gaps = 2/76 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ + A+EGI+KI T LV +S Q+L+DCDN ++ C GG ++ +Q+ IQ G
Sbjct: 154 GSCWAFSTIVAVEGINKIRTGKLVSLSEQELMDCDNV-NNQGCDGGLMDYAFQF-IQKNG 211
Query: 65 INTERDYPNVGVMDNC 80
I TE +YP G +C
Sbjct: 212 ITTESNYPYQGEQGSC 227
>gi|23110962|ref|NP_004070.3| cathepsin S isoform 1 preproprotein [Homo sapiens]
gi|88984046|sp|P25774.3|CATS_HUMAN RecName: Full=Cathepsin S; Flags: Precursor
gi|60816153|gb|AAX36372.1| cathepsin S [synthetic construct]
gi|61358282|gb|AAX41541.1| cathepsin S [synthetic construct]
gi|119573903|gb|EAW53518.1| cathepsin S, isoform CRA_b [Homo sapiens]
gi|119573904|gb|EAW53519.1| cathepsin S, isoform CRA_b [Homo sapiens]
Length = 331
Score = 73.6 bits (179), Expect = 2e-11, Method: Composition-based stats.
Identities = 32/78 (41%), Positives = 49/78 (62%), Gaps = 1/78 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGE-SRSCVGGFIETIYQYVIQNR 63
G+CW F+ VGA+E K+ T LV +S Q LVDC + ++ C GGF+ T +QY+I N+
Sbjct: 137 GACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTEKYGNKGCNGGFMTTAFQYIIDNK 196
Query: 64 GINTERDYPNVGVMDNCK 81
GI+++ YP + C+
Sbjct: 197 GIDSDASYPYKAMDQKCQ 214
>gi|62510452|sp|Q8HY81.1|CATS_CANFA RecName: Full=Cathepsin S; Flags: Precursor
gi|27497538|gb|AAO13009.1| cathepsin S preproprotein [Canis lupus familiaris]
Length = 331
Score = 73.6 bits (179), Expect = 2e-11, Method: Composition-based stats.
Identities = 33/78 (42%), Positives = 48/78 (61%), Gaps = 1/78 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGE-SRSCVGGFIETIYQYVIQNR 63
G+CW F+ VGA+E K+ T LV +S Q LVDC + ++ C GGF+ T +QY+I N
Sbjct: 137 GACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTEKYGNKGCNGGFMTTAFQYIIDNN 196
Query: 64 GINTERDYPNVGVMDNCK 81
GI++E YP + C+
Sbjct: 197 GIDSEASYPYKAMNGKCR 214
>gi|449487301|ref|XP_004157559.1| PREDICTED: cysteine proteinase 15A-like [Cucumis sativus]
Length = 406
Score = 73.6 bits (179), Expect = 2e-11, Method: Composition-based stats.
Identities = 34/83 (40%), Positives = 48/83 (57%), Gaps = 7/83 (8%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGE-------SRSCVGGFIETIYQ 57
GSCW F+ GA+EG + I T NL+++S QQLVDCD+ + + C GG + Y+
Sbjct: 200 GSCWAFSTCGAVEGANFIATGNLLNLSEQQLVDCDHTCDPTDKTACNNGCNGGLMTNAYK 259
Query: 58 YVIQNRGINTERDYPNVGVMDNC 80
Y+IQ+ G+ E YP G C
Sbjct: 260 YLIQSGGLEEESSYPYTGRSGQC 282
>gi|61368403|gb|AAX43172.1| cathepsin S [synthetic construct]
Length = 332
Score = 73.6 bits (179), Expect = 2e-11, Method: Composition-based stats.
Identities = 32/78 (41%), Positives = 49/78 (62%), Gaps = 1/78 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGE-SRSCVGGFIETIYQYVIQNR 63
G+CW F+ VGA+E K+ T LV +S Q LVDC + ++ C GGF+ T +QY+I N+
Sbjct: 137 GACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTEKYGNKGCNGGFMTTAFQYIIDNK 196
Query: 64 GINTERDYPNVGVMDNCK 81
GI+++ YP + C+
Sbjct: 197 GIDSDASYPYKAMDQKCQ 214
>gi|354473025|ref|XP_003498737.1| PREDICTED: cathepsin S-like [Cricetulus griseus]
Length = 341
Score = 73.6 bits (179), Expect = 2e-11, Method: Composition-based stats.
Identities = 33/78 (42%), Positives = 48/78 (61%), Gaps = 2/78 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGE--SRSCVGGFIETIYQYVIQN 62
GSCW F+ VGA+E K+ T LV +S Q LVDC + + ++ C GGF+ +QY+I N
Sbjct: 146 GSCWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTEEKYGNKGCDGGFMTRAFQYIIDN 205
Query: 63 RGINTERDYPNVGVMDNC 80
GI+++ YP V + C
Sbjct: 206 GGIDSDASYPYKAVAEKC 223
>gi|229366026|gb|ACQ57993.1| Cathepsin H precursor [Anoplopoma fimbria]
Length = 247
Score = 73.6 bits (179), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 31/80 (38%), Positives = 45/80 (56%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ G +E ++ I T LV +S QQLVDC + C GG ++Y++ ++G
Sbjct: 130 GSCWTFSTTGCLESVTAISTGKLVPLSEQQLVDCAQDFNNHGCNGGLPSQAFEYIMYSKG 189
Query: 65 INTERDYPNVGVMDNCKVFQ 84
+ TE+DYP D C Q
Sbjct: 190 LMTEKDYPYTAFEDTCAYKQ 209
>gi|334324659|ref|XP_001371004.2| PREDICTED: cathepsin K-like [Monodelphis domestica]
Length = 332
Score = 73.6 bits (179), Expect = 2e-11, Method: Composition-based stats.
Identities = 35/76 (46%), Positives = 50/76 (65%), Gaps = 2/76 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ VGA+EG K T L+++S Q LVDC E+ C GG++ +QYV +NRG
Sbjct: 140 GSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDC--VSENDGCGGGYMTNAFQYVQKNRG 197
Query: 65 INTERDYPNVGVMDNC 80
I++E YP +G ++C
Sbjct: 198 IDSEDAYPYIGEDESC 213
>gi|2351107|dbj|BAA21929.1| bromelain [Ananas comosus]
Length = 312
Score = 73.6 bits (179), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 33/81 (40%), Positives = 48/81 (59%), Gaps = 3/81 (3%)
Query: 2 HPLGSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQ 61
+P GSCW F+ + +EGI KIVT LV +S Q+++DC S C GGF++ Y ++I
Sbjct: 103 NPCGSCWAFSAIATVEGIYKIVTGYLVSLSEQEVLDC---AVSNGCDGGFVDNAYDFIIS 159
Query: 62 NRGINTERDYPNVGVMDNCKV 82
N G+ +E DYP +C
Sbjct: 160 NNGVASEADYPYQAYQGDCAA 180
>gi|59798093|sp|P84346.1|MEX1_JACME RecName: Full=Mexicain
Length = 214
Score = 73.2 bits (178), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 35/81 (43%), Positives = 51/81 (62%), Gaps = 3/81 (3%)
Query: 2 HPLGSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQ 61
+P GSCW F+ V IEGI+KI+T L+ +S Q+L+DC+ + S C GG+ QYV+
Sbjct: 20 NPCGSCWAFSTVATIEGINKIITGQLISLSEQELLDCEYR--SHGCDGGYQTPSLQYVVD 77
Query: 62 NRGINTERDYPNVGVMDNCKV 82
N G++TER+YP C+
Sbjct: 78 N-GVHTEREYPYEKKQGRCRA 97
>gi|413933049|gb|AFW67600.1| cysteine protease 1 [Zea mays]
Length = 341
Score = 73.2 bits (178), Expect = 2e-11, Method: Composition-based stats.
Identities = 31/71 (43%), Positives = 45/71 (63%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW F+ V A+EG++KI T LV +S Q+LVDCD G + C GG ++ +Q+V + G
Sbjct: 147 GCCWAFSAVAAVEGLNKIRTGRLVSLSEQELVDCDVSGVDQGCDGGLMDNAFQFVARRGG 206
Query: 65 INTERDYPNVG 75
+ +E YP G
Sbjct: 207 LASESGYPYQG 217
>gi|389610697|dbj|BAM18960.1| cathepsin L [Papilio polytes]
Length = 341
Score = 73.2 bits (178), Expect = 2e-11, Method: Composition-based stats.
Identities = 32/77 (41%), Positives = 46/77 (59%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ GA+EG TN LV +S Q L+DC + C GG ++ ++Y+ N+G
Sbjct: 146 GSCWSFSATGALEGQHYRQTNILVSLSEQNLIDCSTAYGNNGCNGGLMDNAFKYIKDNKG 205
Query: 65 INTERDYPNVGVMDNCK 81
I+TE+ YP V D C+
Sbjct: 206 IDTEKSYPYEAVDDKCR 222
>gi|114559420|ref|XP_001171183.1| PREDICTED: cathepsin S isoform 1 [Pan troglodytes]
gi|397492868|ref|XP_003817342.1| PREDICTED: cathepsin S isoform 2 [Pan paniscus]
Length = 281
Score = 73.2 bits (178), Expect = 2e-11, Method: Composition-based stats.
Identities = 32/78 (41%), Positives = 48/78 (61%), Gaps = 1/78 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGE-SRSCVGGFIETIYQYVIQNR 63
G+CW F+ VGA+E K+ T LV +S Q LVDC + ++ C GGF+ T +QY+I N+
Sbjct: 87 GACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTEKYGNKGCNGGFMTTAFQYIIDNK 146
Query: 64 GINTERDYPNVGVMDNCK 81
GI+++ YP C+
Sbjct: 147 GIDSDASYPYKATDQKCQ 164
>gi|224106333|ref|XP_002333699.1| predicted protein [Populus trichocarpa]
gi|222837985|gb|EEE76350.1| predicted protein [Populus trichocarpa]
Length = 197
Score = 73.2 bits (178), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 35/77 (45%), Positives = 51/77 (66%), Gaps = 2/77 (2%)
Query: 4 LGSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNR 63
+G CW F+ V AIEGI K+ T NL+ +S QQLV+ D ++ C GG ++T +QY+I+N
Sbjct: 2 VGCCWAFSAVAAIEGIIKLKTGNLISLSKQQLVNRDVG--NKGCHGGLMDTAFQYIIRNE 59
Query: 64 GINTERDYPNVGVMDNC 80
G+ +E +YP GV C
Sbjct: 60 GLTSEDNYPYQGVDGTC 76
>gi|113120273|gb|ABI30276.1| VXH-C [Vasconcellea x heilbornii]
Length = 282
Score = 73.2 bits (178), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 36/80 (45%), Positives = 50/80 (62%), Gaps = 3/80 (3%)
Query: 3 PLGSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQN 62
P GSCW F+ V +EGI+KIVT L+ +S Q+L+DCD + S C GG+ T QYV+ N
Sbjct: 155 PCGSCWAFSTVATVEGINKIVTGKLISLSEQELLDCDRR--SHGCDGGYQRTSLQYVVDN 212
Query: 63 RGINTERDYPNVGVMDNCKV 82
G++TE +Y NC+
Sbjct: 213 -GVHTEYEYQYEKKQGNCRA 231
>gi|38344381|emb|CAD40319.2| OSJNBb0054B09.3 [Oryza sativa Japonica Group]
gi|116309071|emb|CAH66180.1| OSIGBa0130O15.4 [Oryza sativa Indica Group]
gi|116309098|emb|CAH66205.1| OSIGBa0148D14.11 [Oryza sativa Indica Group]
Length = 381
Score = 73.2 bits (178), Expect = 2e-11, Method: Composition-based stats.
Identities = 34/84 (40%), Positives = 52/84 (61%), Gaps = 7/84 (8%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQ---GESRS----CVGGFIETIYQ 57
GSCW F+ GA+EG + T L +S QQ+VDCD++ ESR+ C GG + T +
Sbjct: 167 GSCWSFSTSGALEGAHFLATGKLEVLSEQQMVDCDHECDASESRACDSGCNGGLMTTAFS 226
Query: 58 YVIQNRGINTERDYPNVGVMDNCK 81
Y++++ G+ +E+DYP G + CK
Sbjct: 227 YLMKSGGLQSEKDYPYAGRENTCK 250
>gi|12803615|gb|AAH02642.1| Cathepsin S [Homo sapiens]
gi|49456313|emb|CAG46477.1| CTSS [Homo sapiens]
gi|60821573|gb|AAX36579.1| cathepsin S [synthetic construct]
gi|189069420|dbj|BAG37086.1| unnamed protein product [Homo sapiens]
gi|261858586|dbj|BAI45815.1| cathepsin S [synthetic construct]
Length = 331
Score = 73.2 bits (178), Expect = 2e-11, Method: Composition-based stats.
Identities = 32/78 (41%), Positives = 49/78 (62%), Gaps = 1/78 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGE-SRSCVGGFIETIYQYVIQNR 63
G+CW F+ VGA+E K+ T LV +S Q LVDC + ++ C GGF+ T +QY+I N+
Sbjct: 137 GACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTEKYGNKGCNGGFMTTAFQYIIDNK 196
Query: 64 GINTERDYPNVGVMDNCK 81
GI+++ YP + C+
Sbjct: 197 GIDSDASYPYKAMDQKCQ 214
>gi|348513412|ref|XP_003444236.1| PREDICTED: cathepsin S-like [Oreochromis niloticus]
Length = 328
Score = 73.2 bits (178), Expect = 2e-11, Method: Composition-based stats.
Identities = 33/77 (42%), Positives = 46/77 (59%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G+CW F+ GA+EG K T L +STQ L+DC +R C GG I ++YV+ N+G
Sbjct: 135 GACWAFSAAGALEGQLKKSTGILRSLSTQNLIDCTTDYGNRGCNGGLIARAFKYVVDNQG 194
Query: 65 INTERDYPNVGVMDNCK 81
I +E YP +G + CK
Sbjct: 195 IASEDAYPYIGRHNQCK 211
>gi|242086591|ref|XP_002439128.1| hypothetical protein SORBIDRAFT_09g000960 [Sorghum bicolor]
gi|241944413|gb|EES17558.1| hypothetical protein SORBIDRAFT_09g000960 [Sorghum bicolor]
Length = 371
Score = 73.2 bits (178), Expect = 2e-11, Method: Composition-based stats.
Identities = 32/76 (42%), Positives = 47/76 (61%), Gaps = 1/76 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V A+EGI++IVT NL +S Q+LVDC G + C GG ++ + Y+ + G
Sbjct: 174 GSCWAFSTVAAVEGINQIVTGNLTSLSEQELVDCSTDGNN-GCNGGVMDNAFSYIASSGG 232
Query: 65 INTERDYPNVGVMDNC 80
+ TE YP + +C
Sbjct: 233 LRTEEAYPYLMEEGDC 248
>gi|310656789|gb|ADP02218.1| Peptidase_C1 domain-containing protein [Triticum aestivum]
Length = 341
Score = 73.2 bits (178), Expect = 2e-11, Method: Composition-based stats.
Identities = 32/77 (41%), Positives = 47/77 (61%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW F+ V A EGI K+ T L+ +S Q+LVDCD G + C GG ++ ++++I+N G
Sbjct: 145 GCCWAFSAVVATEGIVKLSTGKLISLSEQELVDCDVHGVDQGCEGGEMDDAFKFIIKNGG 204
Query: 65 INTERDYPNVGVMDNCK 81
+ TE +YP CK
Sbjct: 205 LTTEANYPYTAQDGQCK 221
>gi|357160300|ref|XP_003578721.1| PREDICTED: oryzain beta chain-like [Brachypodium distachyon]
Length = 349
Score = 73.2 bits (178), Expect = 2e-11, Method: Composition-based stats.
Identities = 31/77 (40%), Positives = 46/77 (59%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW F+ V A EGI ++ T LV +S Q+LVDCD G C GG ++ ++++I+N G
Sbjct: 153 GCCWAFSAVAATEGIVQLSTGKLVPLSEQELVDCDANGADHGCEGGEMDDAFEFIIKNGG 212
Query: 65 INTERDYPNVGVMDNCK 81
+ +E +YP CK
Sbjct: 213 LTSETNYPYTAQDGQCK 229
>gi|115477767|ref|NP_001062479.1| Os08g0556900 [Oryza sativa Japonica Group]
gi|42407937|dbj|BAD09076.1| putative cysteine proteinase [Oryza sativa Japonica Group]
gi|113624448|dbj|BAF24393.1| Os08g0556900 [Oryza sativa Japonica Group]
gi|125562525|gb|EAZ07973.1| hypothetical protein OsI_30231 [Oryza sativa Indica Group]
gi|215701458|dbj|BAG92882.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 385
Score = 73.2 bits (178), Expect = 2e-11, Method: Composition-based stats.
Identities = 29/68 (42%), Positives = 44/68 (64%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ + A+EGI+ I T+NL +S QQLVDCD + + C GG ++ +QY+ ++ G
Sbjct: 158 GSCWAFSTIAAVEGINAIRTSNLTALSEQQLVDCDTKTGNAGCDGGLMDNAFQYIAKHGG 217
Query: 65 INTERDYP 72
+ YP
Sbjct: 218 VAASSAYP 225
>gi|125604306|gb|EAZ43631.1| hypothetical protein OsJ_28254 [Oryza sativa Japonica Group]
Length = 369
Score = 73.2 bits (178), Expect = 2e-11, Method: Composition-based stats.
Identities = 29/68 (42%), Positives = 44/68 (64%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ + A+EGI+ I T+NL +S QQLVDCD + + C GG ++ +QY+ ++ G
Sbjct: 142 GSCWAFSTIAAVEGINAIRTSNLTALSEQQLVDCDTKTGNAGCDGGLMDNAFQYIAKHGG 201
Query: 65 INTERDYP 72
+ YP
Sbjct: 202 VAASSAYP 209
>gi|558563|emb|CAA57538.1| cysteine proteinase [Cicer arietinum]
Length = 325
Score = 73.2 bits (178), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 32/76 (42%), Positives = 50/76 (65%), Gaps = 1/76 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ + +E I+KIVT V +S Q+LVDCD + + C GG ++ ++++I+N G
Sbjct: 114 GSCWAFSTIATVEAINKIVTGKFVSLSEQELVDCD-RAFNEGCNGGLMDYAFEFIIRNGG 172
Query: 65 INTERDYPNVGVMDNC 80
I+T++DYP G C
Sbjct: 173 IDTDQDYPYNGFERKC 188
>gi|42567068|ref|NP_567686.2| putative cysteine proteinase [Arabidopsis thaliana]
gi|332659371|gb|AEE84771.1| putative cysteine proteinase [Arabidopsis thaliana]
Length = 356
Score = 73.2 bits (178), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 36/82 (43%), Positives = 54/82 (65%), Gaps = 7/82 (8%)
Query: 6 SCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCD--NQGESRSCVG-GFIETIYQYVIQN 62
SCW F+ V A+EG++KIVT L+ +S Q+LVDC+ N G C G G ++T +Q++I N
Sbjct: 156 SCWAFSTVAAVEGLNKIVTGELISLSEQELVDCNLVNNG----CYGSGLMDTAFQFLINN 211
Query: 63 RGINTERDYPNVGVMDNCKVFQ 84
G+++E+DYP G +C Q
Sbjct: 212 NGLDSEKDYPYQGTQGSCNRKQ 233
>gi|413938554|gb|AFW73105.1| hypothetical protein ZEAMMB73_931917 [Zea mays]
Length = 361
Score = 73.2 bits (178), Expect = 2e-11, Method: Composition-based stats.
Identities = 34/77 (44%), Positives = 50/77 (64%), Gaps = 1/77 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V A+EGI++IVT LV +S Q+LVDCD + C GG ++ + Y++ ++G
Sbjct: 157 GSCWAFSSVAAVEGINQIVTGKLVSLSEQELVDCDTTLD-HGCEGGTMDLAFAYMMGSQG 215
Query: 65 INTERDYPNVGVMDNCK 81
I+ E DYP + CK
Sbjct: 216 IHAEDDYPYLMEEGYCK 232
>gi|395535911|ref|XP_003769964.1| PREDICTED: cathepsin K [Sarcophilus harrisii]
Length = 332
Score = 73.2 bits (178), Expect = 2e-11, Method: Composition-based stats.
Identities = 34/76 (44%), Positives = 51/76 (67%), Gaps = 2/76 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ VGA+EG K T L+++S Q LVDC ++ + C GG++ +QYV +NRG
Sbjct: 140 GSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCVSKND--GCGGGYMTNAFQYVQENRG 197
Query: 65 INTERDYPNVGVMDNC 80
I++E YP +G ++C
Sbjct: 198 IDSEDAYPYIGQDESC 213
>gi|226504984|ref|NP_001151293.1| cysteine protease 1 precursor [Zea mays]
gi|195645596|gb|ACG42266.1| cysteine protease 1 precursor [Zea mays]
Length = 340
Score = 73.2 bits (178), Expect = 2e-11, Method: Composition-based stats.
Identities = 30/68 (44%), Positives = 44/68 (64%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW F+ V A+EG++KI T LV +S Q+LVDCD G + C GG ++ +Q+V + G
Sbjct: 147 GCCWAFSAVAAVEGLNKIRTGRLVSLSEQELVDCDVSGVDQGCDGGLMDNAFQFVARRGG 206
Query: 65 INTERDYP 72
+ +E YP
Sbjct: 207 LASESGYP 214
>gi|242066206|ref|XP_002454392.1| hypothetical protein SORBIDRAFT_04g029960 [Sorghum bicolor]
gi|241934223|gb|EES07368.1| hypothetical protein SORBIDRAFT_04g029960 [Sorghum bicolor]
Length = 356
Score = 73.2 bits (178), Expect = 2e-11, Method: Composition-based stats.
Identities = 33/77 (42%), Positives = 50/77 (64%), Gaps = 1/77 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V A+EGI++IVT LV +S Q+L+DCD + C GG ++ + Y++ ++G
Sbjct: 155 GSCWAFSSVAAVEGINQIVTGKLVSLSEQELMDCDTMLD-HGCEGGLMDFAFAYIMGSQG 213
Query: 65 INTERDYPNVGVMDNCK 81
I+ E DYP + CK
Sbjct: 214 IHAEDDYPYLMEEGYCK 230
>gi|19851|emb|CAA78365.1| tobacco pre-pro-cysteine proteinase [Nicotiana tabacum]
Length = 365
Score = 73.2 bits (178), Expect = 2e-11, Method: Composition-based stats.
Identities = 33/83 (39%), Positives = 47/83 (56%), Gaps = 7/83 (8%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRS-------CVGGFIETIYQ 57
GSCW F+ GA+EG + T LV +S QQLVDCD++ +S C GG + T ++
Sbjct: 154 GSCWSFSTTGAVEGAHFLATGELVSLSEQQLVDCDHECDSEQQDSCDAGCGGGLMTTAFE 213
Query: 58 YVIQNRGINTERDYPNVGVMDNC 80
Y ++ G+ E+DYP G C
Sbjct: 214 YTLKAGGLQLEKDYPYTGKDGKC 236
>gi|2463586|dbj|BAA22545.1| FB22 precursor [Ananas comosus]
Length = 340
Score = 73.2 bits (178), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 33/81 (40%), Positives = 48/81 (59%), Gaps = 3/81 (3%)
Query: 2 HPLGSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQ 61
+P GSCW F+ + +EGI KIVT LV +S Q+++DC S C GGF++ Y ++I
Sbjct: 142 NPCGSCWAFSAIATVEGIYKIVTGYLVSLSEQEVLDC---AVSNGCDGGFVDNAYDFIIS 198
Query: 62 NRGINTERDYPNVGVMDNCKV 82
N G+ +E DYP +C
Sbjct: 199 NNGVASEADYPYQAYEGDCTA 219
>gi|537437|gb|AAC35211.1| cysteine proteinase [Hemerocallis hybrid cultivar]
Length = 359
Score = 73.2 bits (178), Expect = 2e-11, Method: Composition-based stats.
Identities = 32/76 (42%), Positives = 49/76 (64%), Gaps = 2/76 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V A+EGI++I TN LV +S QQLVDCD + + C GG ++ + ++ N G
Sbjct: 151 GSCWAFSTVVAVEGINQIKTNELVSLSEQQLVDCDTK--NSGCNGGLMDYAFDFIKNNGG 208
Query: 65 INTERDYPNVGVMDNC 80
+++E YP + +C
Sbjct: 209 LSSEDSYPYLAEQKSC 224
>gi|374530932|gb|AEP83812.2| cysteine endopeptidase EP8 [Secale cereale x Triticum durum]
Length = 364
Score = 73.2 bits (178), Expect = 2e-11, Method: Composition-based stats.
Identities = 34/78 (43%), Positives = 52/78 (66%), Gaps = 2/78 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ + A+EGI+KI T LV +S Q+L+DCDN ++ C GG ++ +Q++ +N G
Sbjct: 153 GSCWAFSTIVAVEGINKIRTGKLVSLSEQELMDCDNV-NNQGCDGGLMDYAFQFIHKN-G 210
Query: 65 INTERDYPNVGVMDNCKV 82
I TE +YP G +C +
Sbjct: 211 ITTESNYPYQGEQGSCDL 228
>gi|150261413|pdb|2PNS|A Chain A, 1.9 Angstrom Resolution Crystal Structure Of A Plant
Cysteine Protease Ervatamin-C Refinement With Cdna
Derived Amino Acid Sequence
gi|150261414|pdb|2PNS|B Chain B, 1.9 Angstrom Resolution Crystal Structure Of A Plant
Cysteine Protease Ervatamin-C Refinement With Cdna
Derived Amino Acid Sequence
gi|166007115|pdb|2PRE|A Chain A, Crystal Structure Of Plant Cysteine Protease Ervatamin-C
Complexed With Irreversible Inhibitor E-64 At 2.7 A
Resolution
gi|166007116|pdb|2PRE|B Chain B, Crystal Structure Of Plant Cysteine Protease Ervatamin-C
Complexed With Irreversible Inhibitor E-64 At 2.7 A
Resolution
Length = 208
Score = 73.2 bits (178), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 35/80 (43%), Positives = 49/80 (61%), Gaps = 2/80 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V +E I++I T NL+ +S QQLVDC+ + + C GG YQY+I N G
Sbjct: 23 GSCWAFSTVSTVESINQIRTGNLISLSEQQLVDCNKK--NHGCKGGAFVYAYQYIIDNGG 80
Query: 65 INTERDYPNVGVMDNCKVFQ 84
I+TE +YP V C+ +
Sbjct: 81 IDTEANYPYKAVQGPCRAAK 100
>gi|158300877|ref|XP_001689282.1| AGAP011828-PA [Anopheles gambiae str. PEST]
gi|157013372|gb|EDO63348.1| AGAP011828-PA [Anopheles gambiae str. PEST]
Length = 344
Score = 73.2 bits (178), Expect = 2e-11, Method: Composition-based stats.
Identities = 32/76 (42%), Positives = 45/76 (59%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ GA+EG T LV +S Q LVDC + + C GG ++ +QY+ N+G
Sbjct: 149 GSCWSFSATGALEGQHFRKTGKLVSLSEQNLVDCSQKYGNNGCNGGMMDFAFQYIKDNKG 208
Query: 65 INTERDYPNVGVMDNC 80
I+TE+ YP + D C
Sbjct: 209 IDTEKSYPYEAIDDEC 224
>gi|149751225|ref|XP_001490531.1| PREDICTED: cathepsin S-like [Equus caballus]
Length = 332
Score = 73.2 bits (178), Expect = 2e-11, Method: Composition-based stats.
Identities = 33/78 (42%), Positives = 48/78 (61%), Gaps = 1/78 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGES-RSCVGGFIETIYQYVIQNR 63
G+CW F+ VGA+E K+ T NLV +S Q LVDC + S + C GGF+ +QY+I N
Sbjct: 138 GACWAFSAVGALEAQLKLKTGNLVSLSAQNLVDCSTEKYSNKGCNGGFMTAAFQYIIDNN 197
Query: 64 GINTERDYPNVGVMDNCK 81
GI+++ YP + C+
Sbjct: 198 GIDSDASYPYKAMDGKCR 215
>gi|417409876|gb|JAA51427.1| Putative cathepsin s, partial [Desmodus rotundus]
Length = 342
Score = 73.2 bits (178), Expect = 2e-11, Method: Composition-based stats.
Identities = 34/78 (43%), Positives = 46/78 (58%), Gaps = 1/78 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCD-NQGESRSCVGGFIETIYQYVIQNR 63
GSCW F+ VGA+E K+ T LV +S Q LVDC + +R C GGF+ +QY+I N
Sbjct: 148 GSCWAFSAVGALEAQVKLKTGKLVSLSAQNLVDCSVGKYSNRGCNGGFMTEAFQYIIDNN 207
Query: 64 GINTERDYPNVGVMDNCK 81
GI +E YP + C+
Sbjct: 208 GIESEASYPYKAMDGKCQ 225
>gi|161408097|dbj|BAF94152.1| cathepsin L-like cysteine protease 2 [Plautia stali]
Length = 334
Score = 73.2 bits (178), Expect = 2e-11, Method: Composition-based stats.
Identities = 32/77 (41%), Positives = 46/77 (59%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ GA+EG + T LV +S Q LVDC + C GG ++ +QY+ +N G
Sbjct: 140 GSCWAFSTTGALEGQNFRKTGKLVSLSEQNLVDCSGSYGNNGCEGGLMDNAFQYIKENHG 199
Query: 65 INTERDYPNVGVMDNCK 81
I+TE+ YP G + C+
Sbjct: 200 IDTEKSYPYEGEDETCR 216
>gi|332220183|ref|XP_003259237.1| PREDICTED: cathepsin S isoform 1 [Nomascus leucogenys]
Length = 331
Score = 73.2 bits (178), Expect = 2e-11, Method: Composition-based stats.
Identities = 32/78 (41%), Positives = 49/78 (62%), Gaps = 1/78 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGE-SRSCVGGFIETIYQYVIQNR 63
G+CW F+ VGA+E K+ T LV +S Q LVDC + ++ C GGF+ T +QY+I N+
Sbjct: 137 GACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTEKYGNKGCNGGFMTTAFQYIIDNK 196
Query: 64 GINTERDYPNVGVMDNCK 81
GI+++ YP + C+
Sbjct: 197 GIDSDASYPYKAMDQKCQ 214
>gi|114559412|ref|XP_001171151.1| PREDICTED: cathepsin K isoform 4 [Pan troglodytes]
gi|410221358|gb|JAA07898.1| cathepsin K [Pan troglodytes]
gi|410248298|gb|JAA12116.1| cathepsin K [Pan troglodytes]
gi|410301088|gb|JAA29144.1| cathepsin K [Pan troglodytes]
gi|410351445|gb|JAA42326.1| cathepsin K [Pan troglodytes]
Length = 329
Score = 73.2 bits (178), Expect = 2e-11, Method: Composition-based stats.
Identities = 35/76 (46%), Positives = 50/76 (65%), Gaps = 2/76 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ VGA+EG K T L+++S Q LVDC E+ C GG++ ++YV +NRG
Sbjct: 137 GSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDC--VSENDGCGGGYMTNAFEYVQKNRG 194
Query: 65 INTERDYPNVGVMDNC 80
I++E YP VG ++C
Sbjct: 195 IDSEDAYPYVGQEESC 210
>gi|118429523|gb|ABK91809.1| cathepsin L-like proteinase precursor [Clonorchis sinensis]
Length = 373
Score = 73.2 bits (178), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 33/70 (47%), Positives = 47/70 (67%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ GAIEG + + T NLV +S QQLVDC ++ + +C GG ++ ++YV + G
Sbjct: 174 GSCWAFSATGAIEGQNFLATGNLVSLSEQQLVDCSSEYGNNACNGGLMDNAFKYVKDSNG 233
Query: 65 INTERDYPNV 74
I+TE YP V
Sbjct: 234 IDTEASYPYV 243
>gi|179959|gb|AAA35655.1| cathepsin [Homo sapiens]
gi|248406|gb|AAB22005.1| cathepsin S [Homo sapiens]
Length = 331
Score = 73.2 bits (178), Expect = 2e-11, Method: Composition-based stats.
Identities = 32/78 (41%), Positives = 49/78 (62%), Gaps = 1/78 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGE-SRSCVGGFIETIYQYVIQNR 63
G+CW F+ VGA+E K+ T LV +S Q LVDC + ++ C GGF+ T +QY+I N+
Sbjct: 137 GACWAFSAVGALEAQLKLKTGKLVTLSAQNLVDCSTEKYGNKGCNGGFMTTAFQYIIDNK 196
Query: 64 GINTERDYPNVGVMDNCK 81
GI+++ YP + C+
Sbjct: 197 GIDSDASYPYKAMDQKCQ 214
>gi|57118009|gb|AAW34136.1| cysteine protease gp3a [Zingiber officinale]
Length = 475
Score = 73.2 bits (178), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 33/68 (48%), Positives = 47/68 (69%), Gaps = 2/68 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW FA + A+EGI++IVT +L+ +S QQLVDC + + C GG+ +QY+I N G
Sbjct: 165 GSCWAFAAIAAVEGINQIVTGDLISLSEQQLVDCSTR--NYGCEGGWPYRAFQYIINNGG 222
Query: 65 INTERDYP 72
+N+E YP
Sbjct: 223 VNSEEHYP 230
>gi|148224682|ref|NP_001086670.1| cathepsin S [Xenopus laevis]
gi|50418223|gb|AAH77285.1| Ctss-prov protein [Xenopus laevis]
Length = 320
Score = 73.2 bits (178), Expect = 2e-11, Method: Composition-based stats.
Identities = 32/76 (42%), Positives = 48/76 (63%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ VGA+EG + T +V +S Q LVDC ++ ++ C GGF+ +QYVI N G
Sbjct: 127 GSCWAFSAVGALEGQLMLKTGKIVSLSPQNLVDCSSKYGNKGCSGGFMTRAFQYVIDNNG 186
Query: 65 INTERDYPNVGVMDNC 80
I+++ YP + + C
Sbjct: 187 IDSDTYYPYHAMDEKC 202
>gi|225428879|ref|XP_002285299.1| PREDICTED: cysteine proteinase RD21a-like [Vitis vinifera]
Length = 469
Score = 73.2 bits (178), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 31/76 (40%), Positives = 51/76 (67%), Gaps = 1/76 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ + A+EGI++I T +L+ +S Q+LVDCD + ++ C GG ++ ++++I N G
Sbjct: 164 GSCWAFSTIAAVEGINQIATGDLISLSEQELVDCD-KSYNQGCNGGLMDYAFEFIINNGG 222
Query: 65 INTERDYPNVGVMDNC 80
I++E DYP C
Sbjct: 223 IDSEEDYPYRAADTTC 238
>gi|57118007|gb|AAW34135.1| cysteine protease gp2b [Zingiber officinale]
Length = 379
Score = 73.2 bits (178), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 33/71 (46%), Positives = 48/71 (67%), Gaps = 2/71 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V A+EGI++IVT +L+ +S QQLVDC + C GG++ +Q+++ N G
Sbjct: 164 GSCWAFSTVAAVEGINQIVTGDLISLSEQQLVDCTTA--NHGCRGGWMNPAFQFIVNNGG 221
Query: 65 INTERDYPNVG 75
IN+E YP G
Sbjct: 222 INSEETYPYRG 232
>gi|222628593|gb|EEE60725.1| hypothetical protein OsJ_14236 [Oryza sativa Japonica Group]
Length = 364
Score = 73.2 bits (178), Expect = 2e-11, Method: Composition-based stats.
Identities = 34/84 (40%), Positives = 52/84 (61%), Gaps = 7/84 (8%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQ---GESRS----CVGGFIETIYQ 57
GSCW F+ GA+EG + T L +S QQ+VDCD++ ESR+ C GG + T +
Sbjct: 150 GSCWSFSTSGALEGAHFLATGKLEVLSEQQMVDCDHECDASESRACDSGCNGGLMTTAFS 209
Query: 58 YVIQNRGINTERDYPNVGVMDNCK 81
Y++++ G+ +E+DYP G + CK
Sbjct: 210 YLMKSGGLQSEKDYPYAGRENTCK 233
>gi|358255476|dbj|GAA57175.1| cathepsin L [Clonorchis sinensis]
Length = 385
Score = 73.2 bits (178), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 33/70 (47%), Positives = 47/70 (67%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ GAIEG + + T NLV +S QQLVDC ++ + +C GG ++ ++YV + G
Sbjct: 186 GSCWAFSATGAIEGQNFLATGNLVSLSEQQLVDCSSEYGNNACNGGLMDNAFKYVKDSNG 245
Query: 65 INTERDYPNV 74
I+TE YP V
Sbjct: 246 IDTEASYPYV 255
>gi|349604730|gb|AEQ00199.1| Cathepsin K-like protein, partial [Equus caballus]
Length = 219
Score = 73.2 bits (178), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 36/76 (47%), Positives = 51/76 (67%), Gaps = 2/76 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ VGA+EG K T L+++S Q LVDC + E+ C GG++ +QYV +NRG
Sbjct: 27 GSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCVS--ENDGCGGGYMTNAFQYVQKNRG 84
Query: 65 INTERDYPNVGVMDNC 80
I++E YP VG ++C
Sbjct: 85 IDSEDAYPYVGQDESC 100
>gi|3451077|emb|CAA20473.1| cysteine proteinase-like protein [Arabidopsis thaliana]
gi|7269200|emb|CAB79307.1| cysteine proteinase-like protein [Arabidopsis thaliana]
Length = 355
Score = 73.2 bits (178), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 36/82 (43%), Positives = 54/82 (65%), Gaps = 7/82 (8%)
Query: 6 SCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCD--NQGESRSCVG-GFIETIYQYVIQN 62
SCW F+ V A+EG++KIVT L+ +S Q+LVDC+ N G C G G ++T +Q++I N
Sbjct: 156 SCWAFSTVAAVEGLNKIVTGELISLSEQELVDCNLVNNG----CYGSGLMDTAFQFLINN 211
Query: 63 RGINTERDYPNVGVMDNCKVFQ 84
G+++E+DYP G +C Q
Sbjct: 212 NGLDSEKDYPYQGTQGSCNRKQ 233
>gi|93279455|pdb|2F7D|A Chain A, A Mutant Rabbit Cathepsin K With A Nitrile Inhibitor
Length = 215
Score = 73.2 bits (178), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 36/76 (47%), Positives = 51/76 (67%), Gaps = 2/76 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ VGA+EG K T L+++S Q LVDC + E+ C GG++ +QYV +NRG
Sbjct: 23 GSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCVS--ENDGCGGGYMTNAFQYVQRNRG 80
Query: 65 INTERDYPNVGVMDNC 80
I++E YP VG ++C
Sbjct: 81 IDSEDAYPYVGQDESC 96
>gi|357166364|ref|XP_003580686.1| PREDICTED: oryzain alpha chain-like [Brachypodium distachyon]
Length = 360
Score = 73.2 bits (178), Expect = 2e-11, Method: Composition-based stats.
Identities = 31/82 (37%), Positives = 51/82 (62%), Gaps = 1/82 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GS W F+ + A+E I++IVT L+ +S Q+L+DCD + C GG ++ ++++I N G
Sbjct: 156 GSAWAFSAIAAVESINQIVTGELISLSEQELMDCDTSYNA-GCDGGLMDDAFEFIISNGG 214
Query: 65 INTERDYPNVGVMDNCKVFQFN 86
I+T+ DYP D+C + N
Sbjct: 215 IDTDEDYPYKARNDSCDANKRN 236
>gi|114559418|ref|XP_001171268.1| PREDICTED: cathepsin S isoform 3 [Pan troglodytes]
gi|397492866|ref|XP_003817341.1| PREDICTED: cathepsin S isoform 1 [Pan paniscus]
gi|410225070|gb|JAA09754.1| cathepsin S [Pan troglodytes]
gi|410251608|gb|JAA13771.1| cathepsin S [Pan troglodytes]
gi|410328325|gb|JAA33109.1| cathepsin S [Pan troglodytes]
gi|410328327|gb|JAA33110.1| cathepsin S [Pan troglodytes]
Length = 331
Score = 72.8 bits (177), Expect = 2e-11, Method: Composition-based stats.
Identities = 32/78 (41%), Positives = 48/78 (61%), Gaps = 1/78 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGE-SRSCVGGFIETIYQYVIQNR 63
G+CW F+ VGA+E K+ T LV +S Q LVDC + ++ C GGF+ T +QY+I N+
Sbjct: 137 GACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTEKYGNKGCNGGFMTTAFQYIIDNK 196
Query: 64 GINTERDYPNVGVMDNCK 81
GI+++ YP C+
Sbjct: 197 GIDSDASYPYKATDQKCQ 214
>gi|57118005|gb|AAW34134.1| cysteine protease gp2a [Zingiber officinale]
Length = 381
Score = 72.8 bits (177), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 33/71 (46%), Positives = 48/71 (67%), Gaps = 2/71 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V A+EGI++IVT +L+ +S QQLVDC + C GG++ +Q+++ N G
Sbjct: 166 GSCWAFSTVAAVEGINQIVTGDLISLSEQQLVDCTTA--NHGCRGGWMNPAFQFIVNNGG 223
Query: 65 INTERDYPNVG 75
IN+E YP G
Sbjct: 224 INSEETYPYRG 234
>gi|1174171|gb|AAB41816.1| NTH1 [Pisum sativum]
Length = 367
Score = 72.8 bits (177), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 32/76 (42%), Positives = 53/76 (69%), Gaps = 1/76 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G+CW F+ V A+E I+KIVT +LV +S Q+LVDCD + +++ C GG Y+++++N G
Sbjct: 143 GACWAFSAVAAVEAINKIVTGSLVSLSEQELVDCD-RTKNKGCNGGNQVNAYRFIVENGG 201
Query: 65 INTERDYPNVGVMDNC 80
++++ DYP +G C
Sbjct: 202 LDSQIDYPYLGRQSTC 217
>gi|297684916|ref|XP_002820055.1| PREDICTED: cathepsin L2 isoform 3 [Pongo abelii]
Length = 345
Score = 72.8 bits (177), Expect = 2e-11, Method: Composition-based stats.
Identities = 33/77 (42%), Positives = 48/77 (62%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ GA+EG T LV +S Q LVDC + ++ C GGF++ +QYV +N G
Sbjct: 147 GSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSHPQGNQGCNGGFMDKAFQYVKENGG 206
Query: 65 INTERDYPNVGVMDNCK 81
+++E YP V + + CK
Sbjct: 207 LDSEESYPYVAMDEICK 223
>gi|116787909|gb|ABK24688.1| unknown [Picea sitchensis]
gi|224284108|gb|ACN39791.1| unknown [Picea sitchensis]
gi|224285024|gb|ACN40241.1| unknown [Picea sitchensis]
Length = 366
Score = 72.8 bits (177), Expect = 2e-11, Method: Composition-based stats.
Identities = 35/83 (42%), Positives = 50/83 (60%), Gaps = 7/83 (8%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQ---GESRS----CVGGFIETIYQ 57
GSCW F+ GA+EG + + T LV +S QQLVDCD++ ++RS C GG + + YQ
Sbjct: 161 GSCWAFSTTGALEGANFLKTGELVSLSEQQLVDCDHECDPSDARSCDSGCNGGLMTSAYQ 220
Query: 58 YVIQNRGINTERDYPNVGVMDNC 80
Y +++ G+ E DYP G C
Sbjct: 221 YALKSGGLEKEEDYPYTGKDGTC 243
>gi|116779325|gb|ABK21238.1| unknown [Picea sitchensis]
gi|148905850|gb|ABR16087.1| unknown [Picea sitchensis]
gi|148908434|gb|ABR17330.1| unknown [Picea sitchensis]
gi|148908881|gb|ABR17545.1| unknown [Picea sitchensis]
gi|224286109|gb|ACN40765.1| unknown [Picea sitchensis]
Length = 366
Score = 72.8 bits (177), Expect = 2e-11, Method: Composition-based stats.
Identities = 35/83 (42%), Positives = 50/83 (60%), Gaps = 7/83 (8%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQ---GESRS----CVGGFIETIYQ 57
GSCW F+ GA+EG + + T LV +S QQLVDCD++ ++RS C GG + + YQ
Sbjct: 161 GSCWAFSTTGALEGANFLKTGELVSLSEQQLVDCDHECDPSDARSCDSGCNGGLMTSAYQ 220
Query: 58 YVIQNRGINTERDYPNVGVMDNC 80
Y +++ G+ E DYP G C
Sbjct: 221 YALKSGGLEKEEDYPYTGKDGTC 243
>gi|209731972|gb|ACI66855.1| Cathepsin H precursor [Salmo salar]
Length = 328
Score = 72.8 bits (177), Expect = 2e-11, Method: Composition-based stats.
Identities = 32/77 (41%), Positives = 43/77 (55%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ G +E ++ I T L+ +S QQLVDC + C GG ++Y+ N+G
Sbjct: 132 GSCWTFSTTGCLESVTAIATGKLLQLSEQQLVDCAQAFNNHGCNGGLPSQAFEYIKFNKG 191
Query: 65 INTERDYPNVGVMDNCK 81
I TE DYP D CK
Sbjct: 192 IMTEDDYPYTAHDDTCK 208
>gi|307110445|gb|EFN58681.1| hypothetical protein CHLNCDRAFT_56822 [Chlorella variabilis]
Length = 466
Score = 72.8 bits (177), Expect = 2e-11, Method: Composition-based stats.
Identities = 31/68 (45%), Positives = 46/68 (67%), Gaps = 1/68 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ GA+EG S I T L +S Q LVDCD + ++ C GG ++ ++++++N G
Sbjct: 147 GSCWAFSTTGAVEGASAIATGKLASLSEQMLVDCDRERDN-GCHGGLMDFAFEFIMKNGG 205
Query: 65 INTERDYP 72
I+TE DYP
Sbjct: 206 IDTEDDYP 213
>gi|410968392|ref|XP_003990691.1| PREDICTED: cathepsin S, partial [Felis catus]
Length = 310
Score = 72.8 bits (177), Expect = 2e-11, Method: Composition-based stats.
Identities = 33/78 (42%), Positives = 48/78 (61%), Gaps = 1/78 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGE-SRSCVGGFIETIYQYVIQNR 63
G+CW F+ VGA+E K+ T NLV +S Q LVDC + ++ C GGF+ +QY+I N
Sbjct: 149 GACWAFSAVGALEAQLKLKTGNLVSLSAQNLVDCSTEKYGNKGCNGGFMTEAFQYIIDNN 208
Query: 64 GINTERDYPNVGVMDNCK 81
GI++E YP + C+
Sbjct: 209 GIDSEASYPYKAMDGKCQ 226
>gi|296090463|emb|CBI40282.3| unnamed protein product [Vitis vinifera]
Length = 386
Score = 72.8 bits (177), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 31/76 (40%), Positives = 51/76 (67%), Gaps = 1/76 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ + A+EGI++I T +L+ +S Q+LVDCD + ++ C GG ++ ++++I N G
Sbjct: 81 GSCWAFSTIAAVEGINQIATGDLISLSEQELVDCD-KSYNQGCNGGLMDYAFEFIINNGG 139
Query: 65 INTERDYPNVGVMDNC 80
I++E DYP C
Sbjct: 140 IDSEEDYPYRAADTTC 155
>gi|224285931|gb|ACN40679.1| unknown [Picea sitchensis]
Length = 366
Score = 72.8 bits (177), Expect = 2e-11, Method: Composition-based stats.
Identities = 35/83 (42%), Positives = 50/83 (60%), Gaps = 7/83 (8%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQ---GESRS----CVGGFIETIYQ 57
GSCW F+ GA+EG + + T LV +S QQLVDCD++ ++RS C GG + + YQ
Sbjct: 161 GSCWAFSTTGALEGANFLKTGELVSLSEQQLVDCDHECDPSDARSCDSGCNGGLMTSAYQ 220
Query: 58 YVIQNRGINTERDYPNVGVMDNC 80
Y +++ G+ E DYP G C
Sbjct: 221 YALKSGGLEKEEDYPYTGKDGTC 243
>gi|146386366|gb|ABQ23971.1| cathepsin K [Oryctolagus cuniculus]
Length = 183
Score = 72.8 bits (177), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 36/76 (47%), Positives = 51/76 (67%), Gaps = 2/76 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ VGA+EG K T L+++S Q LVDC + E+ C GG++ +QYV +NRG
Sbjct: 2 GSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCVS--ENYGCGGGYMTNAFQYVQRNRG 59
Query: 65 INTERDYPNVGVMDNC 80
I++E YP VG ++C
Sbjct: 60 IDSEDAYPYVGQDESC 75
>gi|356543116|ref|XP_003540009.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Glycine
max]
Length = 337
Score = 72.8 bits (177), Expect = 2e-11, Method: Composition-based stats.
Identities = 33/76 (43%), Positives = 50/76 (65%), Gaps = 2/76 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V A EGI +I T+ L+ +S Q+LVDCD+ C GG++E ++++I+N G
Sbjct: 143 GSCWAFSTVAATEGIYQITTSMLMSLSEQELVDCDSV--DHGCDGGYMEGGFEFIIKNGG 200
Query: 65 INTERDYPNVGVMDNC 80
I++E +YP V C
Sbjct: 201 ISSEANYPYTAVDGTC 216
>gi|297684914|ref|XP_002820054.1| PREDICTED: cathepsin L2 isoform 2 [Pongo abelii]
Length = 334
Score = 72.8 bits (177), Expect = 2e-11, Method: Composition-based stats.
Identities = 33/77 (42%), Positives = 48/77 (62%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ GA+EG T LV +S Q LVDC + ++ C GGF++ +QYV +N G
Sbjct: 136 GSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSHPQGNQGCNGGFMDKAFQYVKENGG 195
Query: 65 INTERDYPNVGVMDNCK 81
+++E YP V + + CK
Sbjct: 196 LDSEESYPYVAMDEICK 212
>gi|139002720|dbj|BAF51966.1| cathepsin K [Carassius auratus]
gi|139002725|dbj|BAF51967.1| tartrate-resistant acid phosphatase [Carassius auratus]
Length = 332
Score = 72.8 bits (177), Expect = 2e-11, Method: Composition-based stats.
Identities = 37/78 (47%), Positives = 47/78 (60%), Gaps = 6/78 (7%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDC--DNQGESRSCVGGFIETIYQYVIQN 62
GSCW F+ VGA+EG K LVD+S Q LVDC DN G C GG++ ++YV N
Sbjct: 139 GSCWAFSSVGALEGQLKKTKGQLVDLSPQNLVDCVTDNDG----CGGGYMTNAFRYVKDN 194
Query: 63 RGINTERDYPNVGVMDNC 80
+GI++E YP VG C
Sbjct: 195 QGIDSEEGYPYVGTDQQC 212
>gi|171854651|dbj|BAG16515.1| putative cysteine proteinase [Capsicum chinense]
Length = 367
Score = 72.8 bits (177), Expect = 2e-11, Method: Composition-based stats.
Identities = 34/83 (40%), Positives = 48/83 (57%), Gaps = 7/83 (8%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQ--GESRS-----CVGGFIETIYQ 57
GSCW F+ GA+EG + T LV +S QQLVDCD++ E +S C GG + T ++
Sbjct: 155 GSCWSFSTTGAVEGAHFLATGELVSLSEQQLVDCDHECDAEQKSECDAGCGGGLMTTAFE 214
Query: 58 YVIQNRGINTERDYPNVGVMDNC 80
Y ++ G+ E+DYP G C
Sbjct: 215 YTLKAGGLQREKDYPYTGRNGQC 237
>gi|30749499|pdb|1MS6|A Chain A, Dipeptide Nitrile Inhibitor Bound To Cathepsin S.
gi|163310952|pdb|2R9M|A Chain A, Cathepsin S Complexed With Compound 15
gi|163310953|pdb|2R9M|B Chain B, Cathepsin S Complexed With Compound 15
gi|163310954|pdb|2R9N|A Chain A, Cathepsin S Complexed With Compound 26
gi|163310955|pdb|2R9N|B Chain B, Cathepsin S Complexed With Compound 26
gi|163310956|pdb|2R9O|A Chain A, Cathepsin S Complexed With Compound 8
gi|163310957|pdb|2R9O|B Chain B, Cathepsin S Complexed With Compound 8
Length = 222
Score = 72.8 bits (177), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 32/78 (41%), Positives = 49/78 (62%), Gaps = 1/78 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGE-SRSCVGGFIETIYQYVIQNR 63
G+CW F+ VGA+E K+ T LV +S Q LVDC + ++ C GGF+ T +QY+I N+
Sbjct: 23 GACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTEKYGNKGCNGGFMTTAFQYIIDNK 82
Query: 64 GINTERDYPNVGVMDNCK 81
GI+++ YP + C+
Sbjct: 83 GIDSDASYPYKAMDQKCQ 100
>gi|71482944|gb|AAZ32411.1| cysteine proteinase glycinain type [Nicotiana benthamiana]
Length = 355
Score = 72.8 bits (177), Expect = 3e-11, Method: Composition-based stats.
Identities = 32/83 (38%), Positives = 46/83 (55%), Gaps = 7/83 (8%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRS-------CVGGFIETIYQ 57
GSCW F+ GA+EG + T LV +S QQLVDCD++ + C GG + T ++
Sbjct: 154 GSCWSFSTTGAVEGAHFLATGELVSLSEQQLVDCDHECDPEQQDSCDAGCSGGLMTTAFE 213
Query: 58 YVIQNRGINTERDYPNVGVMDNC 80
Y ++ G+ E+DYP G C
Sbjct: 214 YTLKAGGLQREKDYPYTGKXGKC 236
>gi|356543124|ref|XP_003540013.1| PREDICTED: vignain-like [Glycine max]
gi|356543126|ref|XP_003540014.1| PREDICTED: vignain-like [Glycine max]
Length = 337
Score = 72.8 bits (177), Expect = 3e-11, Method: Composition-based stats.
Identities = 33/76 (43%), Positives = 50/76 (65%), Gaps = 2/76 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V A EGI +I T+ L+ +S Q+LVDCD+ C GG++E ++++I+N G
Sbjct: 143 GSCWAFSTVAATEGIYQITTSMLMSLSEQELVDCDSV--DHGCDGGYMEGGFEFIIKNGG 200
Query: 65 INTERDYPNVGVMDNC 80
I++E +YP V C
Sbjct: 201 ISSEANYPYTAVDGTC 216
>gi|46576373|sp|P83654.1|ERVC_TABDI RecName: Full=Ervatamin-C; Short=ERV-C
gi|46014979|pdb|1O0E|A Chain A, 1.9 Angstrom Crystal Structure Of A Plant Cysteine
Protease Ervatamin C
gi|46014980|pdb|1O0E|B Chain B, 1.9 Angstrom Crystal Structure Of A Plant Cysteine
Protease Ervatamin C
Length = 208
Score = 72.8 bits (177), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 34/78 (43%), Positives = 49/78 (62%), Gaps = 2/78 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V +E I++I T NL+ +S Q+LVDCD ++ C+GG YQY+I N G
Sbjct: 23 GSCWAFSTVSTVESINQIRTGNLISLSEQELVDCDK--KNHGCLGGAFVFAYQYIINNGG 80
Query: 65 INTERDYPNVGVMDNCKV 82
I+T+ +YP V C+
Sbjct: 81 IDTQANYPYKAVQGPCQA 98
>gi|125547724|gb|EAY93546.1| hypothetical protein OsI_15336 [Oryza sativa Indica Group]
Length = 348
Score = 72.8 bits (177), Expect = 3e-11, Method: Composition-based stats.
Identities = 34/84 (40%), Positives = 52/84 (61%), Gaps = 7/84 (8%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQ---GESRS----CVGGFIETIYQ 57
GSCW F+ GA+EG + T L +S QQ+VDCD++ ESR+ C GG + T +
Sbjct: 134 GSCWSFSTSGALEGAHFLATGKLEVLSEQQMVDCDHECDASESRACDSGCNGGLMTTAFS 193
Query: 58 YVIQNRGINTERDYPNVGVMDNCK 81
Y++++ G+ +E+DYP G + CK
Sbjct: 194 YLMKSGGLQSEKDYPYAGRENTCK 217
>gi|297733654|emb|CBI14901.3| unnamed protein product [Vitis vinifera]
Length = 273
Score = 72.8 bits (177), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 35/82 (42%), Positives = 50/82 (60%), Gaps = 1/82 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V A+EGI+ I TN LV +S Q+LVDCD E++ C GG + ++++ + G
Sbjct: 61 GSCWAFSTVVAVEGINHIKTNKLVSLSEQELVDCDTS-ENQGCNGGLMGYAFEFIKEKGG 119
Query: 65 INTERDYPNVGVMDNCKVFQFN 86
I TE+ YP C V + N
Sbjct: 120 ITTEQSYPYTAEDGTCDVSKVN 141
>gi|291398027|ref|XP_002715626.1| PREDICTED: cathepsin S [Oryctolagus cuniculus]
Length = 331
Score = 72.8 bits (177), Expect = 3e-11, Method: Composition-based stats.
Identities = 33/77 (42%), Positives = 47/77 (61%), Gaps = 1/77 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCD-NQGESRSCVGGFIETIYQYVIQNR 63
G+CW F+ VGA+E K+ T NLV +S Q LVDC + ++ C GGF+ +QY+I N
Sbjct: 137 GACWAFSAVGALEAQLKLKTGNLVSLSAQNLVDCSTTKYGNKGCNGGFMTEAFQYIIDNN 196
Query: 64 GINTERDYPNVGVMDNC 80
GI++E YP + C
Sbjct: 197 GIDSEASYPYKAMDQKC 213
>gi|1353726|gb|AAB01769.1| cysteine proteinase homolog, partial [Naegleria fowleri]
Length = 347
Score = 72.8 bits (177), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 38/90 (42%), Positives = 50/90 (55%), Gaps = 8/90 (8%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDN--------QGESRSCVGGFIETIY 56
GSCW F+ G +EG I LV +S QQLVDCD+ Q C GG + + +
Sbjct: 144 GSCWTFSTTGNVEGQWAIKKGKLVSLSEQQLVDCDHNCVTYQNQQACDSGCNGGLMWSAF 203
Query: 57 QYVIQNRGINTERDYPNVGVMDNCKVFQFN 86
QYVI+N G++TE YP GV D C+ + N
Sbjct: 204 QYVIKNGGLDTEDSYPYEGVDDTCRFNKSN 233
>gi|432114312|gb|ELK36240.1| Aryl hydrocarbon receptor nuclear translocator [Myotis davidii]
Length = 897
Score = 72.8 bits (177), Expect = 3e-11, Method: Composition-based stats.
Identities = 35/76 (46%), Positives = 49/76 (64%), Gaps = 2/76 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ VGA+EG T L+++S Q LVDC E+ C GG++ +QYV +NRG
Sbjct: 705 GSCWAFSSVGALEGQLMKKTGKLLNLSPQNLVDC--VSENDGCGGGYMTNAFQYVQRNRG 762
Query: 65 INTERDYPNVGVMDNC 80
I++E YP VG ++C
Sbjct: 763 IDSEDAYPYVGQDESC 778
>gi|5823018|gb|AAD53011.1|AF089848_1 senescence-specific cysteine protease [Brassica napus]
Length = 346
Score = 72.4 bits (176), Expect = 3e-11, Method: Composition-based stats.
Identities = 32/78 (41%), Positives = 47/78 (60%), Gaps = 2/78 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW F+ V AIEG ++I L+ +S QQLVDCD C GG ++T +++++ G
Sbjct: 152 GCCWAFSAVAAIEGATQIKKGKLISLSEQQLVDCDTN--DFGCSGGLMDTAFEHIMATGG 209
Query: 65 INTERDYPNVGVMDNCKV 82
+ TE +YP G NCK+
Sbjct: 210 LTTESNYPYKGEDANCKI 227
>gi|293334313|ref|NP_001170085.1| hypothetical protein [Zea mays]
gi|224033359|gb|ACN35755.1| unknown [Zea mays]
gi|414589091|tpg|DAA39662.1| TPA: hypothetical protein ZEAMMB73_231678 [Zea mays]
Length = 385
Score = 72.4 bits (176), Expect = 3e-11, Method: Composition-based stats.
Identities = 34/76 (44%), Positives = 43/76 (56%), Gaps = 2/76 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V +EGI +I T LV +S Q+LVDCD + C GG +++ N G
Sbjct: 173 GSCWAFSTVAVVEGIYQIRTGKLVSLSEQELVDCDTLDD--GCDGGISYRALRWIASNGG 230
Query: 65 INTERDYPNVGVMDNC 80
I TE DYP G D C
Sbjct: 231 ITTEADYPYTGTTDAC 246
>gi|256082975|ref|XP_002577726.1| subfamily C1A unassigned peptidase (C01 family) [Schistosoma
mansoni]
Length = 1471
Score = 72.4 bits (176), Expect = 3e-11, Method: Composition-based stats.
Identities = 33/70 (47%), Positives = 43/70 (61%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ GAIEG TN LV++S QQLVDC + C GG + + ++YV N G
Sbjct: 170 GSCWAFSTTGAIEGQHYRKTNRLVNLSEQQLVDCSKSYGNNGCSGGLMNSAFEYVRDNEG 229
Query: 65 INTERDYPNV 74
I++E YP V
Sbjct: 230 IDSEISYPYV 239
>gi|297744465|emb|CBI37727.3| unnamed protein product [Vitis vinifera]
Length = 331
Score = 72.4 bits (176), Expect = 3e-11, Method: Composition-based stats.
Identities = 31/68 (45%), Positives = 45/68 (66%), Gaps = 1/68 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V A+EGI++IVT NL +S Q+L+DCD S C GG ++ + ++ N G
Sbjct: 135 GSCWAFSTVAAVEGINQIVTGNLTTLSEQELIDCDTTFNS-GCNGGLMDYAFAFIASNGG 193
Query: 65 INTERDYP 72
++ E DYP
Sbjct: 194 LHKEDDYP 201
>gi|348565223|ref|XP_003468403.1| PREDICTED: cathepsin L1-like [Cavia porcellus]
Length = 333
Score = 72.4 bits (176), Expect = 3e-11, Method: Composition-based stats.
Identities = 33/77 (42%), Positives = 45/77 (58%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ G++EG T NLV +S Q LVDC ++ C GG ++ +QYV N+G
Sbjct: 136 GSCWAFSATGSLEGQMFHKTGNLVSLSEQNLVDCSRPQGNQGCNGGLMDFAFQYVKDNKG 195
Query: 65 INTERDYPNVGVMDNCK 81
+ E+ YP VG CK
Sbjct: 196 LEAEKSYPYVGKDGECK 212
>gi|297819568|ref|XP_002877667.1| hypothetical protein ARALYDRAFT_348033 [Arabidopsis lyrata subsp.
lyrata]
gi|297323505|gb|EFH53926.1| hypothetical protein ARALYDRAFT_348033 [Arabidopsis lyrata subsp.
lyrata]
Length = 341
Score = 72.4 bits (176), Expect = 3e-11, Method: Composition-based stats.
Identities = 31/77 (40%), Positives = 46/77 (59%), Gaps = 2/77 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW F+ V A+EG++KI LV +S QQL+DC E+ C GG + + Y+++N+G
Sbjct: 149 GCCWAFSAVAAVEGMTKIAKGELVSLSEQQLLDCST--ENDGCDGGIMWKAFDYIVENQG 206
Query: 65 INTERDYPNVGVMDNCK 81
I E +YP G C+
Sbjct: 207 ITAEDNYPYQGAQQTCE 223
>gi|357124027|ref|XP_003563708.1| PREDICTED: germination-specific cysteine protease 1-like
[Brachypodium distachyon]
Length = 334
Score = 72.4 bits (176), Expect = 3e-11, Method: Composition-based stats.
Identities = 32/82 (39%), Positives = 47/82 (57%), Gaps = 1/82 (1%)
Query: 2 HPLGSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQ 61
H CW F+ A+EGI +I T N V +S QQLVDC N + C G I+ Y+Y+ +
Sbjct: 133 HLCACCWAFSSAAAVEGIHQITTGNQVSLSVQQLVDCSNAANEK-CKAGEIDKAYEYIAR 191
Query: 62 NRGINTERDYPNVGVMDNCKVF 83
+ G+ ++DYP G C+V+
Sbjct: 192 SGGLVADQDYPYEGHSGTCRVY 213
>gi|315364646|pdb|3OVX|A Chain A, Cathepsin S In Complex With A Covalent Inhibitor With An
Aldehyde Warhead
gi|315364647|pdb|3OVX|B Chain B, Cathepsin S In Complex With A Covalent Inhibitor With An
Aldehyde Warhead
Length = 218
Score = 72.4 bits (176), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 32/78 (41%), Positives = 49/78 (62%), Gaps = 1/78 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGE-SRSCVGGFIETIYQYVIQNR 63
G+CW F+ VGA+E K+ T LV +S Q LVDC + ++ C GGF+ T +QY+I N+
Sbjct: 24 GACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTEKYGNKGCNGGFMTTAFQYIIDNK 83
Query: 64 GINTERDYPNVGVMDNCK 81
GI+++ YP + C+
Sbjct: 84 GIDSDASYPYKAMDQKCQ 101
>gi|300508731|pdb|3N3G|A Chain A, 4-(3-Trifluoromethylphenyl)-Pyrimidine-2-Carbonitrile As
Cathepsin S Inhibitors: N3, Not N1 Is Critically
Important
gi|300508732|pdb|3N3G|B Chain B, 4-(3-Trifluoromethylphenyl)-Pyrimidine-2-Carbonitrile As
Cathepsin S Inhibitors: N3, Not N1 Is Critically
Important
gi|327533626|pdb|3N4C|A Chain A, 6-Phenyl-1h-Imidazo[4,5-C]pyridine-4-Carbonitrile As
Cathepsin S Inhibitors
gi|327533627|pdb|3N4C|B Chain B, 6-Phenyl-1h-Imidazo[4,5-C]pyridine-4-Carbonitrile As
Cathepsin S Inhibitors
Length = 217
Score = 72.4 bits (176), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 32/78 (41%), Positives = 49/78 (62%), Gaps = 1/78 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGE-SRSCVGGFIETIYQYVIQNR 63
G+CW F+ VGA+E K+ T LV +S Q LVDC + ++ C GGF+ T +QY+I N+
Sbjct: 23 GACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTEKYGNKGCNGGFMTTAFQYIIDNK 82
Query: 64 GINTERDYPNVGVMDNCK 81
GI+++ YP + C+
Sbjct: 83 GIDSDASYPYKAMDQKCQ 100
>gi|195124431|ref|XP_002006696.1| GI21205 [Drosophila mojavensis]
gi|193911764|gb|EDW10631.1| GI21205 [Drosophila mojavensis]
Length = 339
Score = 72.4 bits (176), Expect = 3e-11, Method: Composition-based stats.
Identities = 32/76 (42%), Positives = 46/76 (60%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ GA+EG T LV +S Q LVDC + + C GG ++ ++Y+ N G
Sbjct: 144 GSCWAFSSTGALEGQHFRKTGTLVSLSEQNLVDCSAKYGNNGCNGGLMDNAFRYIKDNGG 203
Query: 65 INTERDYPNVGVMDNC 80
I+TE+ YP G+ D+C
Sbjct: 204 IDTEKSYPYEGIDDSC 219
>gi|93279887|pdb|2G6D|A Chain A, Human Cathepsin S Mutant With Vinyl Sulfone Inhibitor Cra-
14009
Length = 217
Score = 72.4 bits (176), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 32/78 (41%), Positives = 49/78 (62%), Gaps = 1/78 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGE-SRSCVGGFIETIYQYVIQNR 63
G+CW F+ VGA+E K+ T LV +S Q LVDC + ++ C GGF+ T +QY+I N+
Sbjct: 23 GACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTEKYGNKGCNGGFMTTAFQYIIDNK 82
Query: 64 GINTERDYPNVGVMDNCK 81
GI+++ YP + C+
Sbjct: 83 GIDSDASYPYKAMDQKCQ 100
>gi|114793879|pdb|2FYE|A Chain A, Mutant Human Cathepsin S With Irreversible Inhibitor Cra-
14013
Length = 217
Score = 72.4 bits (176), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 32/78 (41%), Positives = 49/78 (62%), Gaps = 1/78 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGE-SRSCVGGFIETIYQYVIQNR 63
G+CW F+ VGA+E K+ T LV +S Q LVDC + ++ C GGF+ T +QY+I N+
Sbjct: 23 GACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTKKYGNKGCNGGFMTTAFQYIIDNK 82
Query: 64 GINTERDYPNVGVMDNCK 81
GI+++ YP + C+
Sbjct: 83 GIDSDASYPYKAMDQKCQ 100
>gi|328909405|gb|AEB61370.1| cathepsin S-like protein, partial [Equus caballus]
Length = 281
Score = 72.4 bits (176), Expect = 3e-11, Method: Composition-based stats.
Identities = 33/78 (42%), Positives = 48/78 (61%), Gaps = 1/78 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGES-RSCVGGFIETIYQYVIQNR 63
G+CW F+ VGA+E K+ T NLV +S Q LVDC + S + C GGF+ +QY+I N
Sbjct: 87 GACWAFSAVGALEAQLKLKTGNLVSLSAQNLVDCSTEKYSNKGCNGGFMTAAFQYIIDNN 146
Query: 64 GINTERDYPNVGVMDNCK 81
GI+++ YP + C+
Sbjct: 147 GIDSDASYPYKAMDGKCR 164
>gi|145351119|ref|XP_001419933.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144580166|gb|ABO98226.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 272
Score = 72.4 bits (176), Expect = 3e-11, Method: Composition-based stats.
Identities = 38/85 (44%), Positives = 49/85 (57%), Gaps = 9/85 (10%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCD--------NQGESRSCVGGFIETIY 56
GSCW F+ GAIEG I T LV++S QQLVDCD N +S C GG
Sbjct: 66 GSCWTFSTTGAIEGAHFISTGKLVELSEQQLVDCDVGCDPDVPNACDS-GCNGGLPSNAM 124
Query: 57 QYVIQNRGINTERDYPNVGVMDNCK 81
+Y++++ GI+TE+ YP VG CK
Sbjct: 125 EYIVEHGGIDTEKSYPYVGEKGECK 149
>gi|93279396|pdb|2F1G|A Chain A, Cathepsin S In Complex With Non-Covalent
2-(Benzoxazol-2-Ylamino)- Acetamide
gi|93279397|pdb|2F1G|B Chain B, Cathepsin S In Complex With Non-Covalent
2-(Benzoxazol-2-Ylamino)- Acetamide
gi|114794366|pdb|2HH5|B Chain B, Crystal Structure Of Cathepsin S In Complex With A Zinc
Mediated Non-Covalent Arylaminoethyl Amide
gi|114794367|pdb|2HH5|A Chain A, Crystal Structure Of Cathepsin S In Complex With A Zinc
Mediated Non-Covalent Arylaminoethyl Amide
gi|118137884|pdb|2H7J|A Chain A, Crystal Structure Of Cathepsin S In Complex With A
Nonpeptidic Inhibitor.
gi|118137885|pdb|2H7J|B Chain B, Crystal Structure Of Cathepsin S In Complex With A
Nonpeptidic Inhibitor.
gi|118138002|pdb|2HXZ|A Chain A, Crystal Structure Of Cathepsin S In Complex With A
Nonpeptidic Inhibitor (hexagonal Spacegroup)
gi|118138003|pdb|2HXZ|B Chain B, Crystal Structure Of Cathepsin S In Complex With A
Nonpeptidic Inhibitor (hexagonal Spacegroup)
gi|118138004|pdb|2HXZ|C Chain C, Crystal Structure Of Cathepsin S In Complex With A
Nonpeptidic Inhibitor (hexagonal Spacegroup)
gi|149241966|pdb|2HHN|A Chain A, Cathepsin S In Complex With Non Covalent Arylaminoethyl
Amide.
gi|149241967|pdb|2HHN|B Chain B, Cathepsin S In Complex With Non Covalent Arylaminoethyl
Amide.
gi|149242657|pdb|2OP3|A Chain A, The Structure Of Cathepsin S With A Novel 2-
Arylphenoxyacetaldehyde Inhibitor Derived By The
Substrate Activity Screening (Sas) Method
gi|149242658|pdb|2OP3|B Chain B, The Structure Of Cathepsin S With A Novel 2-
Arylphenoxyacetaldehyde Inhibitor Derived By The
Substrate Activity Screening (Sas) Method
Length = 220
Score = 72.4 bits (176), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 32/78 (41%), Positives = 49/78 (62%), Gaps = 1/78 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGE-SRSCVGGFIETIYQYVIQNR 63
G+CW F+ VGA+E K+ T LV +S Q LVDC + ++ C GGF+ T +QY+I N+
Sbjct: 26 GACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTEKYGNKGCNGGFMTTAFQYIIDNK 85
Query: 64 GINTERDYPNVGVMDNCK 81
GI+++ YP + C+
Sbjct: 86 GIDSDASYPYKAMDQKCQ 103
>gi|18202415|sp|P82474.1|CPGP2_ZINOF RecName: Full=Zingipain-2; AltName: Full=Cysteine proteinase
GP-II
gi|6137410|pdb|1CQD|A Chain A, The 2.1 Angstrom Structure Of A Cysteine Protease With
Proline Specificity From Ginger Rhizome, Zingiber
Officinale
gi|6137411|pdb|1CQD|B Chain B, The 2.1 Angstrom Structure Of A Cysteine Protease With
Proline Specificity From Ginger Rhizome, Zingiber
Officinale
gi|6137412|pdb|1CQD|C Chain C, The 2.1 Angstrom Structure Of A Cysteine Protease With
Proline Specificity From Ginger Rhizome, Zingiber
Officinale
gi|6137413|pdb|1CQD|D Chain D, The 2.1 Angstrom Structure Of A Cysteine Protease With
Proline Specificity From Ginger Rhizome, Zingiber
Officinale
Length = 221
Score = 72.4 bits (176), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 33/71 (46%), Positives = 48/71 (67%), Gaps = 2/71 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V A+EGI++IVT +L+ +S QQLVDC + C GG++ +Q+++ N G
Sbjct: 25 GSCWAFSTVAAVEGINQIVTGDLISLSEQQLVDCTTA--NHGCRGGWMNPAFQFIVNNGG 82
Query: 65 INTERDYPNVG 75
IN+E YP G
Sbjct: 83 INSEETYPYRG 93
>gi|93279711|pdb|2FQ9|A Chain A, Cathepsin S With Nitrile Inhibitor
gi|93279712|pdb|2FQ9|B Chain B, Cathepsin S With Nitrile Inhibitor
gi|112490596|pdb|2FRA|A Chain A, Human Cathepsin S With Cra-27934, A Nitrile Inhibitor
gi|112490597|pdb|2FRA|B Chain B, Human Cathepsin S With Cra-27934, A Nitrile Inhibitor
gi|112490599|pdb|2FRQ|A Chain A, Human Cathepsin S With Inhibitor Cra-26871
gi|112490600|pdb|2FRQ|B Chain B, Human Cathepsin S With Inhibitor Cra-26871
gi|112490616|pdb|2FT2|A Chain A, Human Cathepsin S With Inhibitor Cra-29728
gi|112490617|pdb|2FT2|B Chain B, Human Cathepsin S With Inhibitor Cra-29728
gi|112490630|pdb|2FUD|A Chain A, Human Cathepsin S With Inhibitor Cra-27566
gi|112490631|pdb|2FUD|B Chain B, Human Cathepsin S With Inhibitor Cra-27566
gi|114793976|pdb|2G7Y|A Chain A, Human Cathepsin S With Inhibitor Cra-16981
gi|114793977|pdb|2G7Y|B Chain B, Human Cathepsin S With Inhibitor Cra-16981
Length = 225
Score = 72.4 bits (176), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 32/78 (41%), Positives = 49/78 (62%), Gaps = 1/78 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGE-SRSCVGGFIETIYQYVIQNR 63
G+CW F+ VGA+E K+ T LV +S Q LVDC + ++ C GGF+ T +QY+I N+
Sbjct: 24 GACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTEKYGNKGCNGGFMTTAFQYIIDNK 83
Query: 64 GINTERDYPNVGVMDNCK 81
GI+++ YP + C+
Sbjct: 84 GIDSDASYPYKAMDQKCQ 101
>gi|356984263|gb|AET43955.1| cathepsin L2, partial [Reishia clavigera]
Length = 278
Score = 72.4 bits (176), Expect = 3e-11, Method: Composition-based stats.
Identities = 30/77 (38%), Positives = 47/77 (61%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ G++EG T LV +S Q LVDC + + C GG ++ ++Y+ +N+G
Sbjct: 107 GSCWAFSATGSLEGQHFKKTGTLVSLSEQNLVDCSKKEGNEGCEGGLMDQAFEYIKRNKG 166
Query: 65 INTERDYPNVGVMDNCK 81
I+TE+ YP V + C+
Sbjct: 167 IDTEQSYPYRAVDEKCR 183
>gi|3087790|emb|CAA75029.1| cathepsin L2 [Homo sapiens]
Length = 334
Score = 72.4 bits (176), Expect = 3e-11, Method: Composition-based stats.
Identities = 34/77 (44%), Positives = 46/77 (59%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ GA+EG T LV +S Q LVDC ++ C GGF+ +QYV +N G
Sbjct: 136 GSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRPQGNQGCNGGFMARAFQYVKENGG 195
Query: 65 INTERDYPNVGVMDNCK 81
+++E YP V V + CK
Sbjct: 196 LDSEESYPYVAVDEICK 212
>gi|4469153|emb|CAB38314.1| chymopapain isoform II [Carica papaya]
Length = 352
Score = 72.4 bits (176), Expect = 3e-11, Method: Composition-based stats.
Identities = 35/77 (45%), Positives = 48/77 (62%), Gaps = 3/77 (3%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ + +EGI+KIVT NL+++S Q+LVDCD S C GG+ T QYV N G
Sbjct: 157 GSCWAFSTIATVEGINKIVTGNLLELSEQELVDCDKH--SYGCKGGYQTTSLQYV-ANNG 213
Query: 65 INTERDYPNVGVMDNCK 81
++T + YP C+
Sbjct: 214 VHTSKVYPYQAKQYKCR 230
>gi|179957|gb|AAC37592.1| cathepsin S [Homo sapiens]
Length = 331
Score = 72.4 bits (176), Expect = 3e-11, Method: Composition-based stats.
Identities = 31/69 (44%), Positives = 46/69 (66%), Gaps = 1/69 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGE-SRSCVGGFIETIYQYVIQNR 63
G+CW F+ VGA+E K+ T LV +S Q LVDC + ++ C GGF+ T +QY+I N+
Sbjct: 137 GACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTEKYGNKGCNGGFMTTAFQYIIDNK 196
Query: 64 GINTERDYP 72
GI+++ YP
Sbjct: 197 GIDSDASYP 205
>gi|146386731|pdb|1VSN|A Chain A, Crystal Structure Of A Potent Small Molecule Inhibitor
Bound To Cathepsin K
Length = 215
Score = 72.4 bits (176), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 35/76 (46%), Positives = 51/76 (67%), Gaps = 2/76 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ VGA+EG K T L++++ Q LVDC + E+ C GG++ +QYV +NRG
Sbjct: 23 GSCWAFSSVGALEGQLKKATGALLNLAPQNLVDCVS--ENDGCGGGYMTNAFQYVQRNRG 80
Query: 65 INTERDYPNVGVMDNC 80
I++E YP VG ++C
Sbjct: 81 IDSEDAYPYVGQDESC 96
>gi|34761156|gb|AAQ81938.1| cysteine proteinase precursor [Ipomoea batatas]
Length = 371
Score = 72.4 bits (176), Expect = 3e-11, Method: Composition-based stats.
Identities = 32/84 (38%), Positives = 49/84 (58%), Gaps = 7/84 (8%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRS-------CVGGFIETIYQ 57
GSCW F+ G +EG + + T L+ ++ Q+LVDCD+ + + C GG + T Y+
Sbjct: 161 GSCWSFSTTGTLEGTNFLATGELLSLNEQELVDCDHLCDPKKAGACDAGCNGGLMTTAYE 220
Query: 58 YVIQNRGINTERDYPNVGVMDNCK 81
YV+Q+ G+ E+DYP G CK
Sbjct: 221 YVLQSGGLEKEKDYPYTGRDGTCK 244
>gi|23110960|ref|NP_001324.2| cathepsin L2 preproprotein [Homo sapiens]
gi|320118898|ref|NP_001188504.1| cathepsin L2 preproprotein [Homo sapiens]
gi|12644075|sp|O60911.2|CATL2_HUMAN RecName: Full=Cathepsin L2; AltName: Full=Cathepsin U; AltName:
Full=Cathepsin V; Flags: Precursor
gi|3107915|dbj|BAA25909.1| cathepsin V [Homo sapiens]
gi|3228672|gb|AAC23598.1| cathepsin U [Homo sapiens]
gi|3869129|dbj|BAA34365.1| cathepsin L2 [Homo sapiens]
gi|23958123|gb|AAH23504.1| CTSL2 protein [Homo sapiens]
gi|37182404|gb|AAQ89004.1| cathepsin L2 [Homo sapiens]
gi|83405150|gb|AAI10513.1| Cathepsin L2 [Homo sapiens]
gi|119579235|gb|EAW58831.1| cathepsin L2, isoform CRA_a [Homo sapiens]
gi|119579236|gb|EAW58832.1| cathepsin L2, isoform CRA_a [Homo sapiens]
Length = 334
Score = 72.4 bits (176), Expect = 3e-11, Method: Composition-based stats.
Identities = 34/77 (44%), Positives = 46/77 (59%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ GA+EG T LV +S Q LVDC ++ C GGF+ +QYV +N G
Sbjct: 136 GSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRPQGNQGCNGGFMARAFQYVKENGG 195
Query: 65 INTERDYPNVGVMDNCK 81
+++E YP V V + CK
Sbjct: 196 LDSEESYPYVAVDEICK 212
>gi|255544115|ref|XP_002513120.1| cysteine protease, putative [Ricinus communis]
gi|223548131|gb|EEF49623.1| cysteine protease, putative [Ricinus communis]
Length = 362
Score = 72.4 bits (176), Expect = 3e-11, Method: Composition-based stats.
Identities = 31/77 (40%), Positives = 47/77 (61%), Gaps = 1/77 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V A+EGI++I T L+ +S Q+LVDCD E + C GG ++ +++ I+ G
Sbjct: 145 GSCWAFSAVAAVEGITEIKTGKLISLSEQELVDCDTNSEDQGCQGGLMDDAFKF-IEQHG 203
Query: 65 INTERDYPNVGVMDNCK 81
+ +E YP CK
Sbjct: 204 LASEATYPYDAADSTCK 220
>gi|2507252|sp|P14080.2|PAPA2_CARPA RecName: Full=Chymopapain; AltName: Full=Papaya proteinase II;
Short=PPII; Flags: Precursor
gi|1332461|emb|CAA66378.1| chymopapain [Carica papaya]
Length = 352
Score = 72.4 bits (176), Expect = 3e-11, Method: Composition-based stats.
Identities = 35/77 (45%), Positives = 48/77 (62%), Gaps = 3/77 (3%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ + +EGI+KIVT NL+++S Q+LVDCD S C GG+ T QYV N G
Sbjct: 157 GSCWAFSTIATVEGINKIVTGNLLELSEQELVDCDKH--SYGCKGGYQTTSLQYV-ANNG 213
Query: 65 INTERDYPNVGVMDNCK 81
++T + YP C+
Sbjct: 214 VHTSKVYPYQAKQYKCR 230
>gi|4757570|gb|AAD29084.1|AF082181_1 cysteine proteinase precursor [Solanum melongena]
Length = 363
Score = 72.4 bits (176), Expect = 4e-11, Method: Composition-based stats.
Identities = 34/83 (40%), Positives = 48/83 (57%), Gaps = 7/83 (8%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQ--GESRS-----CVGGFIETIYQ 57
GSCW F+ GA+EG + T LV +S QQLVDCD++ E +S C GG + T ++
Sbjct: 152 GSCWSFSTTGAVEGAHFLATGELVSLSEQQLVDCDHECDAEEKSECDAGCNGGLMTTAFE 211
Query: 58 YVIQNRGINTERDYPNVGVMDNC 80
Y ++ G+ E+DYP G C
Sbjct: 212 YTLKAGGLQREKDYPYTGRDGKC 234
>gi|37905511|gb|AAO64477.1| cathepsin S precursor [Fundulus heteroclitus]
Length = 337
Score = 72.4 bits (176), Expect = 4e-11, Method: Composition-based stats.
Identities = 33/77 (42%), Positives = 45/77 (58%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ GA+EG T L ++S Q LVDC + + C GGF+ +QYVI N+G
Sbjct: 144 GSCWAFSAAGALEGQLAKKTGKLQNLSPQNLVDCSTKYGNHGCNGGFMHKAFQYVIDNQG 203
Query: 65 INTERDYPNVGVMDNCK 81
I++E YP G C+
Sbjct: 204 IDSEDSYPYRGRDQQCQ 220
>gi|390457768|ref|XP_002742793.2| PREDICTED: cathepsin L2 [Callithrix jacchus]
Length = 588
Score = 72.4 bits (176), Expect = 4e-11, Method: Composition-based stats.
Identities = 33/77 (42%), Positives = 46/77 (59%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ GA+EG T LV +S Q LVDC + ++ C GGF+ +QYV +N G
Sbjct: 136 GSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSHPQGNQGCNGGFMNNAFQYVKENGG 195
Query: 65 INTERDYPNVGVMDNCK 81
+++E YP V +CK
Sbjct: 196 LDSEASYPYVAKDGSCK 212
>gi|15826035|pdb|1FH0|A Chain A, Crystal Structure Of Human Cathepsin V Complexed With An
Irreversible Vinyl Sulfone Inhibitor
gi|15826036|pdb|1FH0|B Chain B, Crystal Structure Of Human Cathepsin V Complexed With An
Irreversible Vinyl Sulfone Inhibitor
Length = 221
Score = 72.4 bits (176), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 34/77 (44%), Positives = 46/77 (59%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ GA+EG T LV +S Q LVDC ++ C GGF+ +QYV +N G
Sbjct: 23 GSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRPQGNQGCNGGFMARAFQYVKENGG 82
Query: 65 INTERDYPNVGVMDNCK 81
+++E YP V V + CK
Sbjct: 83 LDSEESYPYVAVDEICK 99
>gi|119573902|gb|EAW53517.1| cathepsin S, isoform CRA_a [Homo sapiens]
Length = 220
Score = 72.4 bits (176), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 31/69 (44%), Positives = 46/69 (66%), Gaps = 1/69 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGE-SRSCVGGFIETIYQYVIQNR 63
G+CW F+ VGA+E K+ T LV +S Q LVDC + ++ C GGF+ T +QY+I N+
Sbjct: 137 GACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTEKYGNKGCNGGFMTTAFQYIIDNK 196
Query: 64 GINTERDYP 72
GI+++ YP
Sbjct: 197 GIDSDASYP 205
>gi|115457680|ref|NP_001052440.1| Os04g0311400 [Oryza sativa Japonica Group]
gi|113564011|dbj|BAF14354.1| Os04g0311400, partial [Oryza sativa Japonica Group]
Length = 384
Score = 72.4 bits (176), Expect = 4e-11, Method: Composition-based stats.
Identities = 34/84 (40%), Positives = 52/84 (61%), Gaps = 7/84 (8%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQ---GESRS----CVGGFIETIYQ 57
GSCW F+ GA+EG + T L +S QQ+VDCD++ ESR+ C GG + T +
Sbjct: 170 GSCWSFSTSGALEGAHFLATGKLEVLSEQQMVDCDHECDASESRACDSGCNGGLMTTAFS 229
Query: 58 YVIQNRGINTERDYPNVGVMDNCK 81
Y++++ G+ +E+DYP G + CK
Sbjct: 230 YLMKSGGLQSEKDYPYAGRENTCK 253
>gi|21489677|gb|AAM55195.1|AF412313_1 cathepsin L cysteine protease [Haemonchus contortus]
gi|21483192|gb|AAL14224.1| cathepsin L [Haemonchus contortus]
Length = 354
Score = 72.0 bits (175), Expect = 4e-11, Method: Composition-based stats.
Identities = 31/76 (40%), Positives = 44/76 (57%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ GA+EG T LV +S Q LVDC + + C GG ++ ++Y+ +N G
Sbjct: 159 GSCWAFSSTGALEGQHARATGKLVSLSEQNLVDCSTKYGNHGCNGGLMDLAFEYIKENHG 218
Query: 65 INTERDYPNVGVMDNC 80
++TE YP VG C
Sbjct: 219 VDTEDSYPYVGRETKC 234
>gi|21483184|gb|AAF86584.1| cathepsin L cysteine protease [Haemonchus contortus]
Length = 355
Score = 72.0 bits (175), Expect = 4e-11, Method: Composition-based stats.
Identities = 31/76 (40%), Positives = 44/76 (57%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ GA+EG T LV +S Q LVDC + + C GG ++ ++Y+ +N G
Sbjct: 160 GSCWAFSSTGALEGQHARATGKLVSLSEQNLVDCSTKYGNHGCNGGLMDLAFEYIKENHG 219
Query: 65 INTERDYPNVGVMDNC 80
++TE YP VG C
Sbjct: 220 VDTEDSYPYVGRETKC 235
>gi|30749675|pdb|1NPZ|A Chain A, Crystal Structures Of Cathepsin S Inhibitor Complexes
gi|30749676|pdb|1NPZ|B Chain B, Crystal Structures Of Cathepsin S Inhibitor Complexes
gi|30749688|pdb|1NQC|A Chain A, Crystal Structures Of Cathepsin S Inhibitor Complexes
Length = 217
Score = 72.0 bits (175), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 32/78 (41%), Positives = 49/78 (62%), Gaps = 1/78 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGE-SRSCVGGFIETIYQYVIQNR 63
G+CW F+ VGA+E K+ T LV +S Q LVDC + ++ C GGF+ T +QY+I N+
Sbjct: 23 GACWAFSAVGALEAQLKLKTGKLVTLSAQNLVDCSTEKYGNKGCNGGFMTTAFQYIIDNK 82
Query: 64 GINTERDYPNVGVMDNCK 81
GI+++ YP + C+
Sbjct: 83 GIDSDASYPYKAMDQKCQ 100
>gi|302763831|ref|XP_002965337.1| hypothetical protein SELMODRAFT_230602 [Selaginella moellendorffii]
gi|300167570|gb|EFJ34175.1| hypothetical protein SELMODRAFT_230602 [Selaginella moellendorffii]
Length = 343
Score = 72.0 bits (175), Expect = 4e-11, Method: Composition-based stats.
Identities = 31/76 (40%), Positives = 46/76 (60%), Gaps = 2/76 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ + +IE + T LV +S QQL+DCD C GG +ET +++V++N G
Sbjct: 149 GSCWAFSAIASIESAHFLATKELVSLSEQQLMDCDTV--DAGCDGGLMETAFKFVVKNGG 206
Query: 65 INTERDYPNVGVMDNC 80
+ TE YP G + +C
Sbjct: 207 VTTEASYPYTGSVGSC 222
>gi|1173630|gb|AAB37233.1| cysteine proteinase [Phalaenopsis sp. SM9108]
Length = 359
Score = 72.0 bits (175), Expect = 4e-11, Method: Composition-based stats.
Identities = 29/68 (42%), Positives = 47/68 (69%), Gaps = 1/68 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V A+EGI++I T L+ +S Q+L+DCD E+ C GG ++ + ++ +N G
Sbjct: 153 GSCWAFSTVAAVEGINQIKTKKLLSLSEQELIDCDTD-ENNGCNGGLMDYAFDFIKKNGG 211
Query: 65 INTERDYP 72
I++E +YP
Sbjct: 212 ISSEAEYP 219
>gi|410904751|ref|XP_003965855.1| PREDICTED: cathepsin K-like [Takifugu rubripes]
Length = 331
Score = 72.0 bits (175), Expect = 4e-11, Method: Composition-based stats.
Identities = 34/77 (44%), Positives = 47/77 (61%), Gaps = 2/77 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ GA+EG+ T LVD+S Q LVDC E+ C GG++ ++YV NRG
Sbjct: 138 GSCWAFSSAGALEGMQAKKTGKLVDLSPQNLVDCVK--ENDGCGGGYMTNAFRYVATNRG 195
Query: 65 INTERDYPNVGVMDNCK 81
I++E YP V +C+
Sbjct: 196 IDSEASYPYVAQEQSCQ 212
>gi|57118011|gb|AAW34137.1| cysteine protease gp3b [Zingiber officinale]
Length = 466
Score = 72.0 bits (175), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 32/68 (47%), Positives = 46/68 (67%), Gaps = 2/68 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW FA + +EGI++IVT +L+ +S QQLVDC + + C GG+ +QY+I N G
Sbjct: 156 GSCWAFAAIATVEGINQIVTGDLISLSEQQLVDCSTR--NHGCEGGWPYRAFQYIINNGG 213
Query: 65 INTERDYP 72
+N+E YP
Sbjct: 214 VNSEEHYP 221
>gi|426331346|ref|XP_004026643.1| PREDICTED: cathepsin S [Gorilla gorilla gorilla]
Length = 220
Score = 72.0 bits (175), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 31/69 (44%), Positives = 46/69 (66%), Gaps = 1/69 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGE-SRSCVGGFIETIYQYVIQNR 63
G+CW F+ VGA+E K+ T LV +S Q LVDC + ++ C GGF+ T +QY+I N+
Sbjct: 137 GACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTEKYGNKGCNGGFMTTAFQYIIDNK 196
Query: 64 GINTERDYP 72
GI+++ YP
Sbjct: 197 GIDSDASYP 205
>gi|312451845|gb|ADQ85986.1| actinidin [Actinidia chinensis]
Length = 380
Score = 72.0 bits (175), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 32/78 (41%), Positives = 46/78 (58%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW F+ + +EGI+KIVT L+ +S Q+L+DC +R C G +I + ++I N G
Sbjct: 149 GGCWAFSAIATVEGINKIVTGVLISLSEQELIDCGRTQNTRGCNGSYITDGFPFIINNGG 208
Query: 65 INTERDYPNVGVMDNCKV 82
INTE +YP C V
Sbjct: 209 INTEENYPYTAQDGECNV 226
>gi|4469155|emb|CAB38315.1| chymopapain isoform III [Carica papaya]
Length = 361
Score = 72.0 bits (175), Expect = 4e-11, Method: Composition-based stats.
Identities = 35/77 (45%), Positives = 48/77 (62%), Gaps = 3/77 (3%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ + +EGI+KIVT NL+++S Q+LVDCD S C GG+ T QYV N G
Sbjct: 157 GSCWAFSTIATVEGINKIVTGNLLELSEQELVDCDKH--SYGCKGGYQTTSLQYV-ANNG 213
Query: 65 INTERDYPNVGVMDNCK 81
++T + YP C+
Sbjct: 214 VHTSKVYPCQAKQYKCR 230
>gi|297830594|ref|XP_002883179.1| hypothetical protein ARALYDRAFT_318695 [Arabidopsis lyrata subsp.
lyrata]
gi|297329019|gb|EFH59438.1| hypothetical protein ARALYDRAFT_318695 [Arabidopsis lyrata subsp.
lyrata]
Length = 308
Score = 72.0 bits (175), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 30/68 (44%), Positives = 47/68 (69%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ VGA+EGI++I T L+ +S Q+L+DCD + C GG + ++++I N G
Sbjct: 98 GSCWAFSAVGAVEGINQIKTGELISLSDQELIDCDRGFVNAGCEGGVMNYAFEFIINNGG 157
Query: 65 INTERDYP 72
I +++DYP
Sbjct: 158 IESDQDYP 165
>gi|307103885|gb|EFN52142.1| hypothetical protein CHLNCDRAFT_139276 [Chlorella variabilis]
Length = 388
Score = 72.0 bits (175), Expect = 4e-11, Method: Composition-based stats.
Identities = 32/67 (47%), Positives = 46/67 (68%), Gaps = 1/67 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ GA+EGI+ I T LV +S QQLVDCD++ + C GG ++ + Y+ +N G
Sbjct: 154 GSCWAFSATGAVEGINAIRTGKLVSLSEQQLVDCDSE-KDLGCGGGLMDFAFDYITKNGG 212
Query: 65 INTERDY 71
I++E DY
Sbjct: 213 IDSEDDY 219
>gi|226503129|ref|NP_001149806.1| LOC100283433 precursor [Zea mays]
gi|195634783|gb|ACG36860.1| xylem cysteine proteinase 2 precursor [Zea mays]
gi|219884977|gb|ACL52863.1| unknown [Zea mays]
Length = 377
Score = 72.0 bits (175), Expect = 4e-11, Method: Composition-based stats.
Identities = 31/76 (40%), Positives = 46/76 (60%), Gaps = 1/76 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V A+EGI++IVT NL +S QQLVDC G + C GG ++ + ++ G
Sbjct: 180 GSCWAFSTVAAVEGINQIVTGNLTSLSEQQLVDCSTDGNN-GCSGGVMDNAFSFIATGAG 238
Query: 65 INTERDYPNVGVMDNC 80
+ +E YP + +C
Sbjct: 239 LRSEEAYPYLMEEGDC 254
>gi|109119897|dbj|BAE96008.1| cysteine proteinase [Triticum aestivum]
Length = 377
Score = 72.0 bits (175), Expect = 4e-11, Method: Composition-based stats.
Identities = 31/77 (40%), Positives = 47/77 (61%), Gaps = 1/77 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V ++EGI+ I T LV +S Q+L+DCD ++ C GG ++ ++Y+ +N G
Sbjct: 160 GSCWAFSTVVSVEGINAIRTGKLVSLSEQELIDCDT-ADNDGCEGGLMDNAFEYIKKNGG 218
Query: 65 INTERDYPNVGVMDNCK 81
+ TE YP CK
Sbjct: 219 LTTEAAYPYRAANGTCK 235
>gi|195056367|ref|XP_001995082.1| GH22826 [Drosophila grimshawi]
gi|193899288|gb|EDV98154.1| GH22826 [Drosophila grimshawi]
Length = 340
Score = 72.0 bits (175), Expect = 4e-11, Method: Composition-based stats.
Identities = 31/76 (40%), Positives = 46/76 (60%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ GA+EG T L+ +S Q LVDC + + C GG ++ ++Y+ N G
Sbjct: 145 GSCWAFSSTGALEGQHFRKTGTLISLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDNGG 204
Query: 65 INTERDYPNVGVMDNC 80
I+TE+ YP G+ D+C
Sbjct: 205 IDTEKSYPYEGIDDSC 220
>gi|432114311|gb|ELK36239.1| Cathepsin S [Myotis davidii]
Length = 340
Score = 72.0 bits (175), Expect = 4e-11, Method: Composition-based stats.
Identities = 33/78 (42%), Positives = 47/78 (60%), Gaps = 1/78 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDN-QGESRSCVGGFIETIYQYVIQNR 63
GSCW F+ VGA+E K+ T LV +S Q LVDC + ++ C GGF+ +QY+I N
Sbjct: 146 GSCWAFSAVGALEAQLKLKTGKLVSLSVQNLVDCSTGKYSNKGCNGGFMTEAFQYIIDNN 205
Query: 64 GINTERDYPNVGVMDNCK 81
GI++E YP + C+
Sbjct: 206 GIDSEASYPYKAMDGKCQ 223
>gi|84660246|emb|CAI43320.1| cathepsin L [Lubomirskia baicalensis]
gi|85677150|emb|CAI46307.1| cathepsin L [Lubomirskia baicalensis]
Length = 327
Score = 72.0 bits (175), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 34/77 (44%), Positives = 44/77 (57%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V +EG T LV +S Q LVDC ++ C GG ++ +QYVI+N G
Sbjct: 130 GSCWAFSAVAGLEGQHFNATGTLVSLSEQNLVDCSTAEGNQGCNGGLMDNAFQYVIKNGG 189
Query: 65 INTERDYPNVGVMDNCK 81
I+TE YP V CK
Sbjct: 190 IDTEASYPYKAVDQKCK 206
>gi|356576257|ref|XP_003556249.1| PREDICTED: LOW QUALITY PROTEIN: cysteine proteinase 15A-like
[Glycine max]
Length = 374
Score = 72.0 bits (175), Expect = 4e-11, Method: Composition-based stats.
Identities = 33/84 (39%), Positives = 46/84 (54%), Gaps = 7/84 (8%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGE-------SRSCVGGFIETIYQ 57
GSCW F+ G+IEG + + T LV +S QQL+DCDN+ E C GG + Y
Sbjct: 157 GSCWAFSTTGSIEGANFLATGKLVSLSEQQLLDCDNKCEITEKTSCDNGCNGGLMTNAYN 216
Query: 58 YVIQNRGINTERDYPNVGVMDNCK 81
Y++++ G+ E YP G CK
Sbjct: 217 YLLESGGLEEESSYPYTGERGECK 240
>gi|3929823|emb|CAA77184.1| cathepsin S [Mus musculus]
Length = 163
Score = 72.0 bits (175), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 32/78 (41%), Positives = 47/78 (60%), Gaps = 2/78 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGE--SRSCVGGFIETIYQYVIQN 62
GSCW F+ VGA+EG K+ T L+ +S Q LVDC N+ + ++ C GG++ +QY+I N
Sbjct: 2 GSCWAFSAVGALEGQLKLKTGKLISLSAQNLVDCSNEEKYGNKGCGGGYMTEAFQYIIDN 61
Query: 63 RGINTERDYPNVGVMDNC 80
GI + YP + C
Sbjct: 62 GGIEADASYPYKATDEKC 79
>gi|297663703|ref|XP_002810310.1| PREDICTED: LOW QUALITY PROTEIN: cathepsin S [Pongo abelii]
Length = 330
Score = 72.0 bits (175), Expect = 4e-11, Method: Composition-based stats.
Identities = 31/69 (44%), Positives = 46/69 (66%), Gaps = 1/69 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGE-SRSCVGGFIETIYQYVIQNR 63
G+CW F+ VGA+E K+ T LV +S Q LVDC + ++ C GGF+ T +QY+I N+
Sbjct: 137 GACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTEKYGNKGCNGGFMTTAFQYIIDNK 196
Query: 64 GINTERDYP 72
GI+++ YP
Sbjct: 197 GIDSDASYP 205
>gi|56758090|gb|AAW27185.1| SJCHGC06231 protein [Schistosoma japonicum]
Length = 372
Score = 72.0 bits (175), Expect = 4e-11, Method: Composition-based stats.
Identities = 32/70 (45%), Positives = 44/70 (62%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ GAIEG TN LV++S QQL+DC + C GG ++ +QYV N+G
Sbjct: 172 GSCWAFSSTGAIEGQHYRKTNRLVNLSEQQLIDCSKSYGNNGCEGGLMDLAFQYVRDNKG 231
Query: 65 INTERDYPNV 74
I++E YP +
Sbjct: 232 IDSEISYPYI 241
>gi|359492709|ref|XP_002280798.2| PREDICTED: cysteine proteinase RD19a-like [Vitis vinifera]
gi|147841854|emb|CAN73591.1| hypothetical protein VITISV_022889 [Vitis vinifera]
gi|302142582|emb|CBI19785.3| unnamed protein product [Vitis vinifera]
Length = 371
Score = 72.0 bits (175), Expect = 4e-11, Method: Composition-based stats.
Identities = 31/78 (39%), Positives = 46/78 (58%), Gaps = 7/78 (8%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRS-------CVGGFIETIYQ 57
GSCW F+ +GA+EG + T NLV +STQQL+DCD + + C GG + ++
Sbjct: 161 GSCWSFSTIGALEGAHFLATGNLVSLSTQQLLDCDTECDPEEYDACDDGCNGGLMNNAFE 220
Query: 58 YVIQNRGINTERDYPNVG 75
Y+++ G+ E DYP G
Sbjct: 221 YILKAGGVAQEEDYPYTG 238
>gi|300175245|emb|CBK20556.2| unnamed protein product [Blastocystis hominis]
Length = 325
Score = 72.0 bits (175), Expect = 5e-11, Method: Composition-based stats.
Identities = 36/81 (44%), Positives = 46/81 (56%), Gaps = 3/81 (3%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G+CW FA V +IEG+ T ++D S QQLVDCD S C GG + Y+YV+ N G
Sbjct: 134 GACWAFAAVASIEGVYAQKTGKILDFSPQQLVDCDYS--SLGCSGGLMTYAYEYVMNN-G 190
Query: 65 INTERDYPNVGVMDNCKVFQF 85
I+ E DYP +CK F
Sbjct: 191 ISLESDYPYKASQGSCKKVDF 211
>gi|194352750|emb|CAQ00103.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
gi|326514262|dbj|BAJ92281.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326519402|dbj|BAJ96700.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326524351|dbj|BAK00559.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326531998|dbj|BAK01375.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 356
Score = 72.0 bits (175), Expect = 5e-11, Method: Composition-based stats.
Identities = 31/68 (45%), Positives = 44/68 (64%), Gaps = 1/68 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V A+EGI+ IVT NL +S Q+L+DC G S C GG ++ + Y+ + G
Sbjct: 157 GSCWAFSTVAAVEGINAIVTGNLTALSEQELIDCSVDGNS-GCNGGLMDYAFSYIASSGG 215
Query: 65 INTERDYP 72
++TE YP
Sbjct: 216 LHTEEAYP 223
>gi|242020003|ref|XP_002430447.1| Cathepsin L precursor, putative [Pediculus humanus corporis]
gi|212515585|gb|EEB17709.1| Cathepsin L precursor, putative [Pediculus humanus corporis]
Length = 345
Score = 72.0 bits (175), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 34/77 (44%), Positives = 46/77 (59%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ GA+EG+ T LV +S Q L+DC + + C GG ++ +QYV N G
Sbjct: 147 GSCWAFSATGALEGLHFRKTKVLVSLSEQNLIDCSTEEGNNGCNGGLMDQAFQYVRINGG 206
Query: 65 INTERDYPNVGVMDNCK 81
I+TER YP G D C+
Sbjct: 207 IDTERSYPYEGNNDVCR 223
>gi|328870624|gb|EGG18997.1| cysteine proteinase [Dictyostelium fasciculatum]
Length = 521
Score = 72.0 bits (175), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 32/76 (42%), Positives = 44/76 (57%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ G+ EG + T NLV +S Q LVDC + C GG ++ + Y+I+N+G
Sbjct: 141 GSCWSFSTTGSTEGAHFLSTGNLVSLSEQNLVDCSGPEGNDGCNGGLMDQAFTYIIKNKG 200
Query: 65 INTERDYPNVGVMDNC 80
I+TE YP V C
Sbjct: 201 IDTESSYPYKAVQGKC 216
>gi|357458909|ref|XP_003599735.1| Cysteine proteinase [Medicago truncatula]
gi|357474677|ref|XP_003607623.1| Cysteine proteinase [Medicago truncatula]
gi|355488783|gb|AES69986.1| Cysteine proteinase [Medicago truncatula]
gi|355508678|gb|AES89820.1| Cysteine proteinase [Medicago truncatula]
Length = 342
Score = 72.0 bits (175), Expect = 5e-11, Method: Composition-based stats.
Identities = 31/77 (40%), Positives = 49/77 (63%), Gaps = 1/77 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V +IEGI +I T LV +S Q+L+DC +G S C GG++E ++++ + G
Sbjct: 147 GSCWAFSTVASIEGIHQITTGELVSLSEQELIDC-VRGNSSGCSGGYLEDAFKFIAKKGG 205
Query: 65 INTERDYPNVGVMDNCK 81
+ +E +YP + CK
Sbjct: 206 MASETNYPYKETDEKCK 222
>gi|359811751|emb|CCE67159.1| cysteine peptidase, partial [Vasconcellea quercifolia]
Length = 211
Score = 72.0 bits (175), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 38/80 (47%), Positives = 50/80 (62%), Gaps = 3/80 (3%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V A+EGI+KIVT L+ +S Q+L+DC+ + S C GGF QYV QN G
Sbjct: 21 GSCWTFSSVAAVEGINKIVTGQLLWLSEQELLDCERR--SYGCRGGFPPYALQYVAQN-G 77
Query: 65 INTERDYPNVGVMDNCKVFQ 84
I+ + YP GV C+ Q
Sbjct: 78 IHLRQYYPYEGVQRQCRASQ 97
>gi|357507617|ref|XP_003624097.1| Cysteine protease [Medicago truncatula]
gi|355499112|gb|AES80315.1| Cysteine protease [Medicago truncatula]
Length = 340
Score = 72.0 bits (175), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 34/75 (45%), Positives = 48/75 (64%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ VGA+EGI +I + NLV +S Q+LVD + C GG++ +++V++N G
Sbjct: 143 GSCWAFSAVGALEGIQQITSGNLVSLSEQELVDRVRSNWTNGCNGGYLIDAFEFVLENGG 202
Query: 65 INTERDYPNVGVMDN 79
I TE YP GV N
Sbjct: 203 IATEASYPYRGVKGN 217
>gi|1619903|gb|AAB16996.1| thiol protease isoform B, partial [Glycine max]
Length = 319
Score = 72.0 bits (175), Expect = 5e-11, Method: Composition-based stats.
Identities = 33/84 (39%), Positives = 47/84 (55%), Gaps = 7/84 (8%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRS-------CVGGFIETIYQ 57
GSCW F+ GA+EG + T LV +S QQLVDCD+ + C GG + ++
Sbjct: 110 GSCWSFSTTGALEGAYYLATGELVSLSEQQLVDCDHVCDPEEYGACDSGCNGGLMNNAFE 169
Query: 58 YVIQNRGINTERDYPNVGVMDNCK 81
Y++Q+ G+ E+DYP G CK
Sbjct: 170 YILQSGGVQKEKDYPYTGRDGTCK 193
>gi|334324657|ref|XP_003340546.1| PREDICTED: cathepsin S-like isoform 2 [Monodelphis domestica]
Length = 281
Score = 71.6 bits (174), Expect = 5e-11, Method: Composition-based stats.
Identities = 33/78 (42%), Positives = 48/78 (61%), Gaps = 1/78 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCD-NQGESRSCVGGFIETIYQYVIQNR 63
GSCW F+ VGA+E K+ T LV +S Q LVDC ++ ++ C GGF+ + +QYVI N
Sbjct: 87 GSCWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTDKYDNHGCNGGFMTSAFQYVIDNN 146
Query: 64 GINTERDYPNVGVMDNCK 81
GI+++ YP C+
Sbjct: 147 GIDSDVSYPYKATDGKCQ 164
>gi|333069454|gb|AEF13978.1| chymopapain [Carica papaya]
Length = 352
Score = 71.6 bits (174), Expect = 5e-11, Method: Composition-based stats.
Identities = 34/77 (44%), Positives = 48/77 (62%), Gaps = 3/77 (3%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ + +EG++KIVT NL+++S Q+LVDCD S C GG+ T QYV N G
Sbjct: 157 GSCWAFSTIATVEGVNKIVTGNLLELSEQELVDCDKN--SHGCKGGYQTTSLQYVADN-G 213
Query: 65 INTERDYPNVGVMDNCK 81
++T + YP C+
Sbjct: 214 VHTSKVYPYQAKAMQCR 230
>gi|194701748|gb|ACF84958.1| unknown [Zea mays]
gi|414589103|tpg|DAA39674.1| TPA: thiol protease SEN102 [Zea mays]
Length = 374
Score = 71.6 bits (174), Expect = 5e-11, Method: Composition-based stats.
Identities = 34/76 (44%), Positives = 43/76 (56%), Gaps = 2/76 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V +EGI +I T LV +S Q+LVDCD + C GG +++ N G
Sbjct: 178 GSCWAFSTVAVVEGIYQIRTGKLVSLSEQELVDCDTLDD--GCDGGISYRALRWIASNGG 235
Query: 65 INTERDYPNVGVMDNC 80
I TE DYP G D C
Sbjct: 236 ITTETDYPYTGTTDAC 251
>gi|302790828|ref|XP_002977181.1| hypothetical protein SELMODRAFT_106402 [Selaginella moellendorffii]
gi|300155157|gb|EFJ21790.1| hypothetical protein SELMODRAFT_106402 [Selaginella moellendorffii]
Length = 337
Score = 71.6 bits (174), Expect = 5e-11, Method: Composition-based stats.
Identities = 31/76 (40%), Positives = 46/76 (60%), Gaps = 2/76 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ + +IE + T LV +S QQL+DCD C GG +ET +++V++N G
Sbjct: 145 GSCWAFSAIASIESAHFLATKELVSLSEQQLMDCDTV--DAGCDGGLMETAFKFVVKNGG 202
Query: 65 INTERDYPNVGVMDNC 80
+ TE YP G + +C
Sbjct: 203 VTTEAAYPYTGSVGSC 218
>gi|413942348|gb|AFW74997.1| Xylem cysteine proteinase 2 [Zea mays]
Length = 391
Score = 71.6 bits (174), Expect = 5e-11, Method: Composition-based stats.
Identities = 31/76 (40%), Positives = 46/76 (60%), Gaps = 1/76 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V A+EGI++IVT NL +S QQLVDC G + C GG ++ + ++ G
Sbjct: 194 GSCWAFSTVAAVEGINQIVTGNLTSLSEQQLVDCSTDGNN-GCSGGVMDNAFSFIATGAG 252
Query: 65 INTERDYPNVGVMDNC 80
+ +E YP + +C
Sbjct: 253 LRSEEAYPYLMEEGDC 268
>gi|351629613|gb|AEQ54770.1| cysteine proteinase CP1 [Coffea canephora]
Length = 397
Score = 71.6 bits (174), Expect = 5e-11, Method: Composition-based stats.
Identities = 35/84 (41%), Positives = 46/84 (54%), Gaps = 7/84 (8%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRS-------CVGGFIETIYQ 57
GSCW F+ GAIEG + I T L+ +S QQLVDCD+ + + C GG + T +
Sbjct: 190 GSCWAFSTTGAIEGANFIATGKLLSLSEQQLVDCDHMCDLKEKDDCDDGCSGGLMTTAFN 249
Query: 58 YVIQNRGINTERDYPNVGVMDNCK 81
Y+I+ GI E YP G CK
Sbjct: 250 YLIEAGGIEEEVTYPYTGKRGECK 273
>gi|320164780|gb|EFW41679.1| cathepsin L2 [Capsaspora owczarzaki ATCC 30864]
Length = 334
Score = 71.6 bits (174), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 32/77 (41%), Positives = 44/77 (57%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ G++EG T LV +S Q LVDC ++ C GG ++ +QY+I N+G
Sbjct: 140 GSCWSFSTTGSVEGQHARKTGQLVSLSEQNLVDCSKAQGNQGCNGGLMDDAFQYIITNKG 199
Query: 65 INTERDYPNVGVMDNCK 81
I+TE YP CK
Sbjct: 200 IDTEASYPYTAKDGTCK 216
>gi|33242870|gb|AAQ01139.1| cathepsin [Branchiostoma lanceolatum]
Length = 334
Score = 71.6 bits (174), Expect = 5e-11, Method: Composition-based stats.
Identities = 33/78 (42%), Positives = 45/78 (57%), Gaps = 1/78 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ G++EG T LVD+S QQLVDC ++ C GG ++ +QY+ N G
Sbjct: 136 GSCWAFSTTGSLEGQHSNKTGKLVDLSEQQLVDCSKDFGNQGCGGGLMDQAFQYITANGG 195
Query: 65 INTERDYPNVGVMDN-CK 81
++TE YP D CK
Sbjct: 196 LDTEESYPYTATDDEPCK 213
>gi|356515050|ref|XP_003526214.1| PREDICTED: vignain-like [Glycine max]
Length = 344
Score = 71.6 bits (174), Expect = 5e-11, Method: Composition-based stats.
Identities = 33/76 (43%), Positives = 48/76 (63%), Gaps = 2/76 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V A EGI +I T L+ +S Q+LVDCD+ C GG +E ++++I+N G
Sbjct: 149 GSCWAFSTVAATEGIYQISTGMLMSLSEQELVDCDSV--DHGCDGGLMEDGFEFIIKNGG 206
Query: 65 INTERDYPNVGVMDNC 80
I++E +YP V C
Sbjct: 207 ISSEANYPYTAVDGTC 222
>gi|351629617|gb|AEQ54772.1| KDEL-tailed cysteine proteinase CP4, partial [Coffea canephora]
Length = 215
Score = 71.6 bits (174), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 35/82 (42%), Positives = 49/82 (59%), Gaps = 2/82 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V +EGI+KI T LV +S Q+LVDC+ E C GG +E Y+++ ++ G
Sbjct: 4 GSCWAFSTVVGVEGINKIKTGQLVSLSEQELVDCETDNE--GCNGGLMENAYEFIKKSGG 61
Query: 65 INTERDYPNVGVMDNCKVFQFN 86
I TER YP +C + N
Sbjct: 62 ITTERLYPYKARDGSCDSSKMN 83
>gi|344275472|ref|XP_003409536.1| PREDICTED: cathepsin S-like isoform 2 [Loxodonta africana]
Length = 281
Score = 71.6 bits (174), Expect = 5e-11, Method: Composition-based stats.
Identities = 33/78 (42%), Positives = 46/78 (58%), Gaps = 1/78 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGES-RSCVGGFIETIYQYVIQNR 63
G+CW F+ VGA+E K+ T LV +S Q LVDC + S + C GGF+ +QY+I N
Sbjct: 87 GACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSGEKYSNKGCNGGFMTRAFQYIIDNN 146
Query: 64 GINTERDYPNVGVMDNCK 81
GI++E YP C+
Sbjct: 147 GIDSEASYPYKATDGKCQ 164
>gi|281208825|gb|EFA83000.1| cysteine proteinase [Polysphondylium pallidum PN500]
Length = 531
Score = 71.6 bits (174), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 31/68 (45%), Positives = 45/68 (66%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ G+IEG+ ++ T NLV +S Q L+DC ++ C GG + ++YVI+N G
Sbjct: 116 GSCWSFSTTGSIEGVHELQTGNLVALSEQNLIDCSVAEGNQGCNGGLMPNAFEYVIKNGG 175
Query: 65 INTERDYP 72
I+TE YP
Sbjct: 176 IDTEASYP 183
>gi|226499884|ref|NP_001148278.1| thiol protease SEN102 precursor [Zea mays]
gi|195617112|gb|ACG30386.1| thiol protease SEN102 precursor [Zea mays]
Length = 374
Score = 71.6 bits (174), Expect = 5e-11, Method: Composition-based stats.
Identities = 34/76 (44%), Positives = 43/76 (56%), Gaps = 2/76 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V +EGI +I T LV +S Q+LVDCD + C GG +++ N G
Sbjct: 178 GSCWAFSTVAVVEGIYQIRTGKLVSLSEQELVDCDTLDD--GCDGGISYRALRWIASNGG 235
Query: 65 INTERDYPNVGVMDNC 80
I TE DYP G D C
Sbjct: 236 ITTEADYPYTGTTDAC 251
>gi|75277440|sp|O23791.1|BROM1_ANACO RecName: Full=Fruit bromelain; AltName: Allergen=Ana c 2; Flags:
Precursor
gi|2342496|dbj|BAA21849.1| bromelain [Ananas comosus]
Length = 351
Score = 71.6 bits (174), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 33/84 (39%), Positives = 47/84 (55%), Gaps = 3/84 (3%)
Query: 2 HPLGSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQ 61
+P GSCW FA + +EGI KI T LV +S Q+++DC S C GG++ Y ++I
Sbjct: 142 NPCGSCWSFAAIATVEGIYKIKTGYLVSLSEQEVLDC---AVSYGCKGGWVNKAYDFIIS 198
Query: 62 NRGINTERDYPNVGVMDNCKVFQF 85
N G+ TE +YP + C F
Sbjct: 199 NNGVTTEENYPYLAYQGTCNANSF 222
>gi|47169030|pdb|1S4V|A Chain A, The 2.0 A Crystal Structure Of The Kdel-Tailed Cysteine
Endopeptidase Functioning In Programmed Cell Death Of
Ricinus Communis Endosperm
gi|47169031|pdb|1S4V|B Chain B, The 2.0 A Crystal Structure Of The Kdel-Tailed Cysteine
Endopeptidase Functioning In Programmed Cell Death Of
Ricinus Communis Endosperm
Length = 229
Score = 71.6 bits (174), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 34/82 (41%), Positives = 52/82 (63%), Gaps = 1/82 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ + A+EGI++I TN LV +S Q+LVDCD +++ C GG ++ ++++ Q G
Sbjct: 24 GSCWAFSTIVAVEGINQIKTNKLVSLSEQELVDCDTD-QNQGCNGGLMDYAFEFIKQRGG 82
Query: 65 INTERDYPNVGVMDNCKVFQFN 86
I TE +YP C V + N
Sbjct: 83 ITTEANYPYEAYDGTCDVSKEN 104
>gi|356542171|ref|XP_003539543.1| PREDICTED: LOW QUALITY PROTEIN: KDEL-tailed cysteine endopeptidase
CEP2-like [Glycine max]
Length = 342
Score = 71.6 bits (174), Expect = 6e-11, Method: Composition-based stats.
Identities = 31/71 (43%), Positives = 47/71 (66%), Gaps = 1/71 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V +E I+KI T LV +S QQL+DCDN+ + C GG +ET + ++ + G
Sbjct: 147 GSCWSFSAVATVEDINKIKTGKLVSLSEQQLIDCDNRNGNEGCNGGHMET-FTFITKRGG 205
Query: 65 INTERDYPNVG 75
+ T+++YP G
Sbjct: 206 LTTDKNYPYQG 216
>gi|67968401|dbj|BAE00562.1| unnamed protein product [Macaca fascicularis]
Length = 433
Score = 71.6 bits (174), Expect = 6e-11, Method: Composition-based stats.
Identities = 32/77 (41%), Positives = 47/77 (61%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ GA+EG T LV +S Q LVDC + ++ C GGF+ + ++YV +N G
Sbjct: 136 GSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSHPQGNQGCNGGFMNSAFRYVKENGG 195
Query: 65 INTERDYPNVGVMDNCK 81
+++E YP V + CK
Sbjct: 196 LDSEESYPYVAMDGICK 212
>gi|444515096|gb|ELV10758.1| Cathepsin S [Tupaia chinensis]
Length = 240
Score = 71.6 bits (174), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 33/78 (42%), Positives = 47/78 (60%), Gaps = 1/78 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDN-QGESRSCVGGFIETIYQYVIQNR 63
G+CW F+ VGA+E K+ T LV +S Q LVDC Q ++ C GGF+ +QY+I N
Sbjct: 46 GACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCATIQYGNKGCNGGFMTRAFQYIIDNN 105
Query: 64 GINTERDYPNVGVMDNCK 81
GI++E YP + C+
Sbjct: 106 GIDSEASYPYKATDEKCQ 123
>gi|31982433|ref|NP_031828.2| cathepsin K precursor [Mus musculus]
gi|12644320|sp|P55097.2|CATK_MOUSE RecName: Full=Cathepsin K; Flags: Precursor
gi|3550487|emb|CAA06825.1| cathepsin K [Mus musculus]
gi|12834090|dbj|BAB22783.1| unnamed protein product [Mus musculus]
gi|28277388|gb|AAH46320.1| Cathepsin K [Mus musculus]
gi|74209960|dbj|BAE21279.1| unnamed protein product [Mus musculus]
gi|148706870|gb|EDL38817.1| cathepsin K, isoform CRA_a [Mus musculus]
Length = 329
Score = 71.6 bits (174), Expect = 6e-11, Method: Composition-based stats.
Identities = 36/76 (47%), Positives = 48/76 (63%), Gaps = 2/76 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ GA+EG K T L+ +S Q LVDC E+ C GG++ T +QYV QN G
Sbjct: 137 GSCWAFSSAGALEGQLKKKTGKLLALSPQNLVDCVT--ENYGCGGGYMTTAFQYVQQNGG 194
Query: 65 INTERDYPNVGVMDNC 80
I++E YP VG ++C
Sbjct: 195 IDSEDAYPYVGQDESC 210
>gi|168018894|ref|XP_001761980.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162686697|gb|EDQ73084.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 369
Score = 71.6 bits (174), Expect = 6e-11, Method: Composition-based stats.
Identities = 36/84 (42%), Positives = 45/84 (53%), Gaps = 7/84 (8%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRS-------CVGGFIETIYQ 57
GSCW F+ GA+EG + + LV +S QQLVDCD+Q + C GGF+ YQ
Sbjct: 161 GSCWAFSTTGAVEGAHFLNSGKLVSLSEQQLVDCDHQCDREEADACDAGCNGGFMTNAYQ 220
Query: 58 YVIQNRGINTERDYPNVGVMDNCK 81
YV G+ E DYP G CK
Sbjct: 221 YVEAAGGLELESDYPYEGRDGKCK 244
>gi|356543122|ref|XP_003540012.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
Length = 342
Score = 71.6 bits (174), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 34/76 (44%), Positives = 48/76 (63%), Gaps = 2/76 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW F+ V A EGI +I T NLV +S Q+LVDCD+ C GG +E ++++I+N G
Sbjct: 148 GICWAFSAVAATEGIYQITTGNLVSLSEQELVDCDSV--DHGCDGGLMEHGFEFIIKNGG 205
Query: 65 INTERDYPNVGVMDNC 80
I++E +YP V C
Sbjct: 206 ISSEANYPYTAVNGTC 221
>gi|344275470|ref|XP_003409535.1| PREDICTED: cathepsin S-like isoform 1 [Loxodonta africana]
Length = 331
Score = 71.6 bits (174), Expect = 6e-11, Method: Composition-based stats.
Identities = 33/78 (42%), Positives = 46/78 (58%), Gaps = 1/78 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGES-RSCVGGFIETIYQYVIQNR 63
G+CW F+ VGA+E K+ T LV +S Q LVDC + S + C GGF+ +QY+I N
Sbjct: 137 GACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSGEKYSNKGCNGGFMTRAFQYIIDNN 196
Query: 64 GINTERDYPNVGVMDNCK 81
GI++E YP C+
Sbjct: 197 GIDSEASYPYKATDGKCQ 214
>gi|225706914|gb|ACO09303.1| Cathepsin H precursor [Osmerus mordax]
Length = 328
Score = 71.6 bits (174), Expect = 6e-11, Method: Composition-based stats.
Identities = 31/79 (39%), Positives = 43/79 (54%)
Query: 3 PLGSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQN 62
P GSCW F+ G +E ++ I T L+ +S QQLVDC + C GG ++Y+ N
Sbjct: 132 PCGSCWTFSTTGCLESVTAISTGKLLQLSEQQLVDCAQAFNNHGCNGGLPSQAFEYIKYN 191
Query: 63 RGINTERDYPNVGVMDNCK 81
+G+ TE DYP CK
Sbjct: 192 KGLMTEDDYPYTAQDGTCK 210
>gi|312281697|dbj|BAJ33714.1| unnamed protein product [Thellungiella halophila]
Length = 347
Score = 71.6 bits (174), Expect = 6e-11, Method: Composition-based stats.
Identities = 32/78 (41%), Positives = 46/78 (58%), Gaps = 2/78 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW F+ V AIEG +KI L+ +S QQLVDCD C GG ++T +++++ G
Sbjct: 153 GCCWAFSAVAAIEGATKIKKGKLISLSEQQLVDCDTN--DFGCSGGLMDTAFEHIMATGG 210
Query: 65 INTERDYPNVGVMDNCKV 82
+ TE +YP G CK+
Sbjct: 211 LTTESNYPYKGKDATCKI 228
>gi|157779038|gb|ABV71063.1| cathepsin L3 precursor [Schistosoma mansoni]
gi|360044915|emb|CCD82463.1| subfamily C1A unassigned peptidase (C01 family) [Schistosoma
mansoni]
Length = 370
Score = 71.6 bits (174), Expect = 6e-11, Method: Composition-based stats.
Identities = 33/70 (47%), Positives = 43/70 (61%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ GAIEG TN LV++S QQLVDC + C GG + + ++YV N G
Sbjct: 170 GSCWAFSTTGAIEGQHYRKTNRLVNLSEQQLVDCSKSYGNNGCSGGLMNSAFEYVRDNEG 229
Query: 65 INTERDYPNV 74
I++E YP V
Sbjct: 230 IDSEISYPYV 239
>gi|390994425|gb|AFM37362.1| cathepsin L2 [Dictyocaulus viviparus]
Length = 352
Score = 71.6 bits (174), Expect = 6e-11, Method: Composition-based stats.
Identities = 32/76 (42%), Positives = 43/76 (56%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ GA+EG T LV +S Q LVDC + + C GG ++ ++Y+ N G
Sbjct: 157 GSCWAFSATGALEGQHFRATGKLVSLSEQNLVDCSTKYGNHGCNGGLMDLAFEYIKDNHG 216
Query: 65 INTERDYPNVGVMDNC 80
I+TE YP VG C
Sbjct: 217 IDTEEGYPYVGKEMRC 232
>gi|21483188|gb|AAK77918.1| cathepsin L 1 [Dictyocaulus viviparus]
Length = 347
Score = 71.6 bits (174), Expect = 6e-11, Method: Composition-based stats.
Identities = 32/76 (42%), Positives = 43/76 (56%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ GA+EG T LV +S Q LVDC + + C GG ++ ++Y+ N G
Sbjct: 152 GSCWAFSATGALEGQHFRATGKLVSLSEQNLVDCSTKYGNHGCNGGLMDLAFEYIKDNHG 211
Query: 65 INTERDYPNVGVMDNC 80
I+TE YP VG C
Sbjct: 212 IDTEEGYPYVGKEMRC 227
>gi|21483190|gb|AAL14223.1| cathepsin L [Dictyocaulus viviparus]
Length = 347
Score = 71.6 bits (174), Expect = 6e-11, Method: Composition-based stats.
Identities = 32/76 (42%), Positives = 43/76 (56%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ GA+EG T LV +S Q LVDC + + C GG ++ ++Y+ N G
Sbjct: 152 GSCWAFSATGALEGQHFRATGKLVSLSEQNLVDCSTKYGNHGCNGGLMDLAFEYIKDNHG 211
Query: 65 INTERDYPNVGVMDNC 80
I+TE YP VG C
Sbjct: 212 IDTEEGYPYVGKEMRC 227
>gi|158268255|gb|ABW25047.1| cathepsin L-like protease [Strongylus vulgaris]
Length = 354
Score = 71.6 bits (174), Expect = 6e-11, Method: Composition-based stats.
Identities = 30/76 (39%), Positives = 43/76 (56%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ GA+EG + +V +S Q LVDC + + C GG ++ ++Y+ N G
Sbjct: 159 GSCWAFSATGALEGQHARASGKMVSLSEQNLVDCSTKYGNHGCNGGLMDLAFEYIKDNHG 218
Query: 65 INTERDYPNVGVMDNC 80
I+TE YP VG C
Sbjct: 219 IDTEESYPYVGRETKC 234
>gi|229596403|ref|XP_001009843.3| Papain family cysteine protease containing protein [Tetrahymena
thermophila]
gi|225565321|gb|EAR89598.3| Papain family cysteine protease containing protein [Tetrahymena
thermophila SB210]
Length = 324
Score = 71.6 bits (174), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 37/76 (48%), Positives = 44/76 (57%), Gaps = 3/76 (3%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ VG +EG I T NL S QQ+VDC + C GG + Y+YV+QN G
Sbjct: 137 GSCWAFSTVGGLEGAYAIATGNLTSFSEQQIVDCSK--ANAGCNGGDLPPAYKYVVQN-G 193
Query: 65 INTERDYPNVGVMDNC 80
I TE DYP GV C
Sbjct: 194 IETEADYPYKGVNQKC 209
>gi|13928758|ref|NP_113748.1| cathepsin K precursor [Rattus norvegicus]
gi|12585195|sp|O35186.1|CATK_RAT RecName: Full=Cathepsin K; Flags: Precursor
gi|2305208|gb|AAB65743.1| cathepsin K [Rattus norvegicus]
gi|50927597|gb|AAH78793.1| Cathepsin K [Rattus norvegicus]
gi|149030667|gb|EDL85704.1| cathepsin K, isoform CRA_a [Rattus norvegicus]
Length = 329
Score = 71.6 bits (174), Expect = 6e-11, Method: Composition-based stats.
Identities = 36/76 (47%), Positives = 48/76 (63%), Gaps = 2/76 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ GA+EG K T L+ +S Q LVDC E+ C GG++ T +QYV QN G
Sbjct: 137 GSCWAFSSAGALEGQLKKKTGKLLALSPQNLVDC--VSENYGCGGGYMTTAFQYVQQNGG 194
Query: 65 INTERDYPNVGVMDNC 80
I++E YP VG ++C
Sbjct: 195 IDSEDAYPYVGQDESC 210
>gi|33242872|gb|AAQ01140.1| cathepsin [Branchiostoma lanceolatum]
Length = 334
Score = 71.6 bits (174), Expect = 6e-11, Method: Composition-based stats.
Identities = 33/78 (42%), Positives = 45/78 (57%), Gaps = 1/78 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ G++EG T LVD+S QQLVDC ++ C GG ++ +QY+ N G
Sbjct: 136 GSCWAFSTTGSLEGQHSSKTGKLVDLSEQQLVDCSKDFGNQGCGGGLMDQAFQYIKANGG 195
Query: 65 INTERDYPNVGVMDN-CK 81
++TE YP D CK
Sbjct: 196 LDTEESYPYTATDDKPCK 213
>gi|334324655|ref|XP_001370975.2| PREDICTED: cathepsin S-like isoform 1 [Monodelphis domestica]
Length = 331
Score = 71.6 bits (174), Expect = 6e-11, Method: Composition-based stats.
Identities = 33/78 (42%), Positives = 48/78 (61%), Gaps = 1/78 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCD-NQGESRSCVGGFIETIYQYVIQNR 63
GSCW F+ VGA+E K+ T LV +S Q LVDC ++ ++ C GGF+ + +QYVI N
Sbjct: 137 GSCWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTDKYDNHGCNGGFMTSAFQYVIDNN 196
Query: 64 GINTERDYPNVGVMDNCK 81
GI+++ YP C+
Sbjct: 197 GIDSDVSYPYKATDGKCQ 214
>gi|414590229|tpg|DAA40800.1| TPA: putative cysteine protease family protein [Zea mays]
Length = 381
Score = 71.6 bits (174), Expect = 6e-11, Method: Composition-based stats.
Identities = 33/84 (39%), Positives = 48/84 (57%), Gaps = 7/84 (8%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDN------QGE-SRSCVGGFIETIYQ 57
GSCW F+ GA+EG + + T LVD+S QQLVDCD+ Q E + C GG + Y
Sbjct: 172 GSCWAFSTTGAVEGANFLATGELVDLSEQQLVDCDHTCSAVAQNECNNGCAGGLMTNAYS 231
Query: 58 YVIQNRGINTERDYPNVGVMDNCK 81
Y++++ G+ + YP G C+
Sbjct: 232 YLMESGGLMEQSAYPYTGAAGPCR 255
>gi|356543114|ref|XP_003540008.1| PREDICTED: LOW QUALITY PROTEIN: KDEL-tailed cysteine endopeptidase
CEP1-like [Glycine max]
Length = 343
Score = 71.6 bits (174), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 33/76 (43%), Positives = 49/76 (64%), Gaps = 2/76 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G+CW F+ V A EGI +I T NLV +S ++LVDCD+ C GG +E ++++I+N G
Sbjct: 148 GNCWAFSAVAATEGIYQITTGNLVSLSEKELVDCDSV--DHGCDGGLMEHGFEFIIKNGG 205
Query: 65 INTERDYPNVGVMDNC 80
I++E +YP V C
Sbjct: 206 ISSEANYPYTAVNGTC 221
>gi|319826926|gb|ADV74756.1| cysteine protease [Lactuca sativa]
Length = 363
Score = 71.2 bits (173), Expect = 6e-11, Method: Composition-based stats.
Identities = 30/71 (42%), Positives = 44/71 (61%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW F+ V A+EGI+K+ LV +S Q+LVDCD G + C GG +E +Q++ + +G
Sbjct: 164 GCCWAFSAVAAMEGINKLENGKLVSLSEQELVDCDIDGIDQGCEGGLMENAFQFIEKRKG 223
Query: 65 INTERDYPNVG 75
+ E YP G
Sbjct: 224 LAAESVYPYTG 234
>gi|113120271|gb|ABI30275.1| VS-A [Vasconcellea stipulata]
Length = 318
Score = 71.2 bits (173), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 38/80 (47%), Positives = 49/80 (61%), Gaps = 3/80 (3%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V A+EGI+KIVT LV +S Q+L+DC+ + S C GGF QYV N G
Sbjct: 157 GSCWTFSSVAAVEGINKIVTGQLVSLSEQELLDCERR--SYGCRGGFPPYALQYV-ANSG 213
Query: 65 INTERDYPNVGVMDNCKVFQ 84
I+ + YP GV C+ Q
Sbjct: 214 IHLRQYYPYEGVQRQCRAAQ 233
>gi|330842502|ref|XP_003293216.1| hypothetical protein DICPUDRAFT_95775 [Dictyostelium purpureum]
gi|325076482|gb|EGC30264.1| hypothetical protein DICPUDRAFT_95775 [Dictyostelium purpureum]
Length = 376
Score = 71.2 bits (173), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 31/68 (45%), Positives = 43/68 (63%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ G++EG +I T LV +S Q LVDC + C GG ++ + Y+IQN+G
Sbjct: 144 GSCWSFSTTGSVEGAHQIKTGKLVSLSEQNLVDCSGAEGNLGCDGGLMDNAFIYIIQNKG 203
Query: 65 INTERDYP 72
I+TE YP
Sbjct: 204 IDTESSYP 211
>gi|326514800|dbj|BAJ99761.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 291
Score = 71.2 bits (173), Expect = 7e-11, Method: Composition-based stats.
Identities = 30/68 (44%), Positives = 44/68 (64%), Gaps = 1/68 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ + A+EGI+ I T NL +S QQLVDCD + + C GG ++ +QY+ ++ G
Sbjct: 83 GSCWAFSTIAAVEGINAIRTKNLTSLSEQQLVDCDTKSNA-GCNGGLMDYAFQYIAKHGG 141
Query: 65 INTERDYP 72
+ E YP
Sbjct: 142 VAAEDAYP 149
>gi|359806140|ref|NP_001241450.1| uncharacterized protein LOC100778716 precursor [Glycine max]
gi|255639509|gb|ACU20049.1| unknown [Glycine max]
Length = 366
Score = 71.2 bits (173), Expect = 7e-11, Method: Composition-based stats.
Identities = 33/78 (42%), Positives = 46/78 (58%), Gaps = 7/78 (8%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRS-------CVGGFIETIYQ 57
GSCW F+ VGA+EG + T LV +S QQLVDCD++ + C GG + T ++
Sbjct: 156 GSCWSFSAVGALEGAHFLSTGELVSLSEQQLVDCDHECDPEERGACDSGCNGGLMTTAFE 215
Query: 58 YVIQNRGINTERDYPNVG 75
Y +Q G+ E+DYP G
Sbjct: 216 YTLQAGGLMREKDYPYTG 233
>gi|348513249|ref|XP_003444155.1| PREDICTED: cathepsin K-like [Oreochromis niloticus]
Length = 330
Score = 71.2 bits (173), Expect = 7e-11, Method: Composition-based stats.
Identities = 33/68 (48%), Positives = 43/68 (63%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ +GA+EG K T LV +S Q LVDC Q + C GG+I Y YVI+N G
Sbjct: 136 GSCWAFSSLGALEGQLKKRTGTLVSLSPQNLVDCSTQDGNLGCRGGYITKAYSYVIRNGG 195
Query: 65 INTERDYP 72
+++E YP
Sbjct: 196 VDSESFYP 203
>gi|158268253|gb|ABW25046.1| cathepsin L-like protease [Strongylus vulgaris]
Length = 354
Score = 71.2 bits (173), Expect = 7e-11, Method: Composition-based stats.
Identities = 30/76 (39%), Positives = 43/76 (56%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ GA+EG + +V +S Q LVDC + + C GG ++ ++Y+ N G
Sbjct: 159 GSCWAFSATGALEGQHARASGKMVSLSEQNLVDCSTKYGNHGCNGGLMDLAFEYIKDNHG 218
Query: 65 INTERDYPNVGVMDNC 80
I+TE YP VG C
Sbjct: 219 IDTEESYPYVGRETKC 234
>gi|164420679|ref|NP_001037464.2| fibroinase precursor [Bombyx mori]
gi|40556818|gb|AAR87763.1| fibroinase precursor [Bombyx mori]
Length = 341
Score = 71.2 bits (173), Expect = 7e-11, Method: Composition-based stats.
Identities = 32/77 (41%), Positives = 46/77 (59%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ GA+EG + LV +S Q L+DC Q + C GG ++ ++Y+ N G
Sbjct: 146 GSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIKDNGG 205
Query: 65 INTERDYPNVGVMDNCK 81
I+TE+ YP GV D C+
Sbjct: 206 IDTEQTYPYEGVDDKCR 222
>gi|27462834|gb|AAO15606.1| cathepsin L-like protease [Sarcoptes scabiei type hominis]
Length = 245
Score = 71.2 bits (173), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 32/82 (39%), Positives = 53/82 (64%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V ++E + + T LV++S Q+LVDC + C GG++++ +++VI+ G
Sbjct: 142 GSCWAFSAVASMESQNALKTGQLVELSEQELVDCSVGEGNEGCDGGWMDSAFEFVIKADG 201
Query: 65 INTERDYPNVGVMDNCKVFQFN 86
I+TE+ YP GV C+ +Q N
Sbjct: 202 IDTEKSYPYHGVNQVCRSYQKN 223
>gi|113120265|gb|ABI30272.1| VXH-A, partial [Vasconcellea x heilbornii]
Length = 318
Score = 71.2 bits (173), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 38/80 (47%), Positives = 49/80 (61%), Gaps = 3/80 (3%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V A+EGI+KIVT LV +S Q+L+DC+ + S C GGF QYV N G
Sbjct: 157 GSCWTFSSVAAVEGINKIVTGQLVSLSEQELLDCERR--SYGCRGGFPPYALQYV-ANSG 213
Query: 65 INTERDYPNVGVMDNCKVFQ 84
I+ + YP GV C+ Q
Sbjct: 214 IHLRQYYPYEGVQRQCRAAQ 233
>gi|308808478|ref|XP_003081549.1| Cysteine proteinase Cathepsin F (ISS) [Ostreococcus tauri]
gi|116060014|emb|CAL56073.1| Cysteine proteinase Cathepsin F (ISS), partial [Ostreococcus tauri]
Length = 293
Score = 71.2 bits (173), Expect = 7e-11, Method: Composition-based stats.
Identities = 37/85 (43%), Positives = 49/85 (57%), Gaps = 9/85 (10%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCD--------NQGESRSCVGGFIETIY 56
GSCW F+ GAIEG I T LV++S QQL+DCD N +S C GG
Sbjct: 87 GSCWTFSTTGAIEGAHFISTGKLVELSEQQLLDCDVGCDPDVPNACDS-GCNGGLPSNAM 145
Query: 57 QYVIQNRGINTERDYPNVGVMDNCK 81
+Y++++ GI+TE+ YP VG CK
Sbjct: 146 EYIVEHGGIDTEKSYPYVGEKGECK 170
>gi|4469157|emb|CAB38316.1| chymopapain isoform IV [Carica papaya]
Length = 226
Score = 71.2 bits (173), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 35/78 (44%), Positives = 48/78 (61%), Gaps = 3/78 (3%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ + +EGI+KIVT NL+++S Q+LVDCD S C GG+ T QYV N G
Sbjct: 22 GSCWAFSTIATVEGINKIVTGNLLELSEQELVDCDRH--SYGCKGGYQTTSLQYVANN-G 78
Query: 65 INTERDYPNVGVMDNCKV 82
++T + YP C+
Sbjct: 79 VHTSKVYPYQAKQYKCRA 96
>gi|194352746|emb|CAQ00101.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
Length = 381
Score = 71.2 bits (173), Expect = 7e-11, Method: Composition-based stats.
Identities = 35/84 (41%), Positives = 49/84 (58%), Gaps = 7/84 (8%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQ---GESRS----CVGGFIETIYQ 57
GSCW F+ GA+EG + + T L +S QQLVDCD++ E R+ C GG + T +
Sbjct: 169 GSCWSFSTSGALEGANYLATGKLEVLSEQQLVDCDHECDPSEPRACDAGCNGGLMTTAFS 228
Query: 58 YVIQNRGINTERDYPNVGVMDNCK 81
Y+ + G+ TE+DYP G CK
Sbjct: 229 YLAKAGGLETEKDYPYTGRNSACK 252
>gi|75812934|ref|NP_001028787.1| cathepsin S precursor [Bos taurus]
gi|115503669|sp|P25326.2|CATS_BOVIN RecName: Full=Cathepsin S; Flags: Precursor
gi|74353837|gb|AAI02246.1| Cathepsin S [Bos taurus]
gi|296489535|tpg|DAA31648.1| TPA: cathepsin S precursor [Bos taurus]
Length = 331
Score = 71.2 bits (173), Expect = 7e-11, Method: Composition-based stats.
Identities = 33/78 (42%), Positives = 47/78 (60%), Gaps = 1/78 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDN-QGESRSCVGGFIETIYQYVIQNR 63
GSCW F+ VGA+E K+ T LV +S Q LVDC + ++ C GGF+ +QY+I N
Sbjct: 137 GSCWAFSAVGALEAQVKLKTGKLVSLSAQNLVDCSTAKYGNKGCNGGFMTEAFQYIIDNN 196
Query: 64 GINTERDYPNVGVMDNCK 81
GI++E YP + C+
Sbjct: 197 GIDSEASYPYKAMDGKCQ 214
>gi|356517384|ref|XP_003527367.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
Length = 332
Score = 71.2 bits (173), Expect = 7e-11, Method: Composition-based stats.
Identities = 31/81 (38%), Positives = 49/81 (60%), Gaps = 1/81 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQ-LVDCDNQGESRSCVGGFIETIYQYVIQNR 63
G W + V A EGI + L+ +S++Q LVDCD +G + C GG ++ ++++IQN
Sbjct: 133 GCFWALSAVAATEGIHALXAGKLILLSSEQELVDCDTKGVDQDCQGGLMDDAFKFIIQNH 192
Query: 64 GINTERDYPNVGVMDNCKVFQ 84
G+NTE +YP GV C ++
Sbjct: 193 GLNTEANYPYKGVDGKCNAYE 213
>gi|194352752|emb|CAQ00104.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
Length = 351
Score = 71.2 bits (173), Expect = 7e-11, Method: Composition-based stats.
Identities = 31/68 (45%), Positives = 44/68 (64%), Gaps = 1/68 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V A+EGI+ IVT NL +S Q+L+DC G S C GG ++ + Y+ + G
Sbjct: 152 GSCWAFSTVAAVEGINAIVTGNLTALSEQELIDCSVDGNS-GCNGGMMDYAFSYIASSGG 210
Query: 65 INTERDYP 72
++TE YP
Sbjct: 211 LHTEEAYP 218
>gi|397499865|ref|XP_003820654.1| PREDICTED: cathepsin L2 isoform 1 [Pan paniscus]
gi|397499867|ref|XP_003820655.1| PREDICTED: cathepsin L2 isoform 2 [Pan paniscus]
Length = 334
Score = 71.2 bits (173), Expect = 7e-11, Method: Composition-based stats.
Identities = 33/77 (42%), Positives = 46/77 (59%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ GA+EG T LV +S Q LVDC ++ C GGF+ +QYV +N G
Sbjct: 136 GSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRPQGNQGCNGGFMARAFQYVKENGG 195
Query: 65 INTERDYPNVGVMDNCK 81
+++E YP V + + CK
Sbjct: 196 LDSEESYPYVAMDEICK 212
>gi|413917937|gb|AFW57869.1| hypothetical protein ZEAMMB73_830006 [Zea mays]
Length = 443
Score = 71.2 bits (173), Expect = 7e-11, Method: Composition-based stats.
Identities = 29/76 (38%), Positives = 44/76 (57%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW F+ V ++EG+ K+ T LV +S Q+LVDCD G + C GG ++ + +++ N G
Sbjct: 156 GCCWAFSAVASMEGVVKLSTGKLVSLSEQELVDCDVNGMDQGCEGGEMDDAFDFIVGNGG 215
Query: 65 INTERDYPNVGVMDNC 80
+ TE YP C
Sbjct: 216 LTTESRYPYTASDGTC 231
>gi|19195|emb|CAA78403.1| pre-pro-cysteine proteinase [Solanum lycopersicum]
Length = 361
Score = 71.2 bits (173), Expect = 7e-11, Method: Composition-based stats.
Identities = 32/83 (38%), Positives = 46/83 (55%), Gaps = 7/83 (8%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGE-------SRSCVGGFIETIYQ 57
GSCW F+ GA+EG + T LV +S QQLVDCD++ + C GG + T ++
Sbjct: 150 GSCWSFSTTGAVEGAHFLATGELVSLSEQQLVDCDHECDPVEKNDCDAGCNGGLMTTAFE 209
Query: 58 YVIQNRGINTERDYPNVGVMDNC 80
Y ++ G+ E+DYP G C
Sbjct: 210 YTLKAGGLQLEKDYPYTGRNGKC 232
>gi|440906716|gb|ELR56945.1| Cathepsin S, partial [Bos grunniens mutus]
Length = 342
Score = 71.2 bits (173), Expect = 7e-11, Method: Composition-based stats.
Identities = 33/78 (42%), Positives = 47/78 (60%), Gaps = 1/78 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDN-QGESRSCVGGFIETIYQYVIQNR 63
GSCW F+ VGA+E K+ T LV +S Q LVDC + ++ C GGF+ +QY+I N
Sbjct: 148 GSCWAFSAVGALEAQVKLKTGKLVSLSAQNLVDCSTAKYGNKGCNGGFMTEAFQYIIDNN 207
Query: 64 GINTERDYPNVGVMDNCK 81
GI++E YP + C+
Sbjct: 208 GIDSEASYPYKAMDGKCQ 225
>gi|28192375|gb|AAK07731.1| CPR2-like cysteine proteinase [Nicotiana tabacum]
Length = 363
Score = 71.2 bits (173), Expect = 7e-11, Method: Composition-based stats.
Identities = 32/83 (38%), Positives = 46/83 (55%), Gaps = 7/83 (8%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRS-------CVGGFIETIYQ 57
GSCW F+ GA+EG + T LV +S QQLVDCD++ + C GG + T ++
Sbjct: 152 GSCWSFSTTGAVEGAHFLATGELVSLSEQQLVDCDHECDPEQQDACDAGCGGGLMTTAFE 211
Query: 58 YVIQNRGINTERDYPNVGVMDNC 80
Y ++ G+ E+DYP G C
Sbjct: 212 YTLKAGGLQLEKDYPYTGKDGKC 234
>gi|222637029|gb|EEE67161.1| hypothetical protein OsJ_24244 [Oryza sativa Japonica Group]
Length = 309
Score = 71.2 bits (173), Expect = 7e-11, Method: Composition-based stats.
Identities = 31/84 (36%), Positives = 47/84 (55%), Gaps = 7/84 (8%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRS-------CVGGFIETIYQ 57
GSCW F+ GA+EG + + T NL+D+S QQLVDCD+ ++ C GG + Y
Sbjct: 160 GSCWAFSTTGAVEGANFLATGNLLDLSEQQLVDCDHTCDAEKKTECDSGCGGGLMTNAYA 219
Query: 58 YVIQNRGINTERDYPNVGVMDNCK 81
Y++ + G+ + YP G C+
Sbjct: 220 YLMSSGGLMEQSAYPYTGAQGTCR 243
>gi|5051468|emb|CAB44983.1| putative preprocysteine proteinase [Nicotiana tabacum]
Length = 363
Score = 71.2 bits (173), Expect = 7e-11, Method: Composition-based stats.
Identities = 32/83 (38%), Positives = 46/83 (55%), Gaps = 7/83 (8%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRS-------CVGGFIETIYQ 57
GSCW F+ GA+EG + T LV +S QQLVDCD++ + C GG + T ++
Sbjct: 152 GSCWSFSTTGAVEGAHFLATGELVSLSEQQLVDCDHECDPEQQDACDAGCGGGLMTTAFE 211
Query: 58 YVIQNRGINTERDYPNVGVMDNC 80
Y ++ G+ E+DYP G C
Sbjct: 212 YTLKAGGLQLEKDYPYTGKDGKC 234
>gi|115472081|ref|NP_001059639.1| Os07g0480900 [Oryza sativa Japonica Group]
gi|27261016|dbj|BAC45132.1| putative cysteine proteinase [Oryza sativa Japonica Group]
gi|113611175|dbj|BAF21553.1| Os07g0480900 [Oryza sativa Japonica Group]
gi|215693312|dbj|BAG88694.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 376
Score = 71.2 bits (173), Expect = 7e-11, Method: Composition-based stats.
Identities = 31/84 (36%), Positives = 47/84 (55%), Gaps = 7/84 (8%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRS-------CVGGFIETIYQ 57
GSCW F+ GA+EG + + T NL+D+S QQLVDCD+ ++ C GG + Y
Sbjct: 160 GSCWAFSTTGAVEGANFLATGNLLDLSEQQLVDCDHTCDAEKKTECDSGCGGGLMTNAYA 219
Query: 58 YVIQNRGINTERDYPNVGVMDNCK 81
Y++ + G+ + YP G C+
Sbjct: 220 YLMSSGGLMEQSAYPYTGAQGTCR 243
>gi|357446993|ref|XP_003593772.1| Cysteine proteinase [Medicago truncatula]
gi|355482820|gb|AES64023.1| Cysteine proteinase [Medicago truncatula]
Length = 339
Score = 71.2 bits (173), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 34/80 (42%), Positives = 49/80 (61%), Gaps = 2/80 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ VGAIEGI+ I T L+++S Q+L+DCD S C G++ + +VI+N+G
Sbjct: 128 GSCWAFSAVGAIEGINAITTGKLINLSEQELLDCD--PISGGCNSGWVNKAFDWVIRNKG 185
Query: 65 INTERDYPNVGVMDNCKVFQ 84
+ + DYP CK Q
Sbjct: 186 VALDNDYPYTAEKGVCKASQ 205
>gi|384247445|gb|EIE20932.1| hypothetical protein COCSUDRAFT_18161 [Coccomyxa subellipsoidea
C-169]
Length = 387
Score = 71.2 bits (173), Expect = 8e-11, Method: Composition-based stats.
Identities = 30/67 (44%), Positives = 47/67 (70%), Gaps = 1/67 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ G++EG + + T +LV +S QQLVDCD + + + C GG ++ + Y+I+N G
Sbjct: 138 GSCWAFSTTGSVEGANFLATGDLVSLSEQQLVDCDTK-KDQGCGGGLMDYAFDYIIKNGG 196
Query: 65 INTERDY 71
++TE DY
Sbjct: 197 LDTEEDY 203
>gi|52546912|gb|AAU81589.1| cysteine proteinase [Petunia x hybrida]
Length = 257
Score = 71.2 bits (173), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 32/83 (38%), Positives = 47/83 (56%), Gaps = 7/83 (8%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESR-------SCVGGFIETIYQ 57
GSCW F+ GA+EG + T LV +S QQLVDCD++ ++ C GG + T ++
Sbjct: 46 GSCWSFSTTGAVEGAHFLATGELVSLSEQQLVDCDHECDAEQQNECDAGCGGGLMTTAFE 105
Query: 58 YVIQNRGINTERDYPNVGVMDNC 80
Y ++ G+ E+DYP G C
Sbjct: 106 YTLKAGGLQREKDYPYTGRDGKC 128
>gi|52546920|gb|AAU81593.1| cysteine proteinase [Petunia x hybrida]
Length = 210
Score = 71.2 bits (173), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 31/68 (45%), Positives = 46/68 (67%), Gaps = 1/68 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V A+EGI++I T NL +S Q+L+DCD + C GG ++ +Q++I N G
Sbjct: 107 GSCWAFSTVAAVEGINQIKTGNLTSLSEQELIDCDTT-YNNGCNGGLMDYAFQFIISNGG 165
Query: 65 INTERDYP 72
++ E DYP
Sbjct: 166 LHKEDDYP 173
>gi|281206749|gb|EFA80934.1| counting factor associated protein [Polysphondylium pallidum PN500]
Length = 530
Score = 71.2 bits (173), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 34/77 (44%), Positives = 45/77 (58%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F G++EG+S + T LV +S QQLVDC G+S+ C GGF +QY++ G
Sbjct: 333 GSCWTFGSTGSLEGVSCLATGKLVSLSEQQLVDCAYLGQSQGCNGGFASDAFQYIMNFGG 392
Query: 65 INTERDYPNVGVMDNCK 81
I E YP + CK
Sbjct: 393 IAYESTYPYLMQNGYCK 409
>gi|33242876|gb|AAQ01142.1| cathepsin [Branchiostoma lanceolatum]
Length = 334
Score = 71.2 bits (173), Expect = 8e-11, Method: Composition-based stats.
Identities = 33/78 (42%), Positives = 45/78 (57%), Gaps = 1/78 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ G++EG T LVD+S QQLVDC ++ C GG ++ +QY+ N G
Sbjct: 136 GSCWAFSTTGSLEGQHSNKTGKLVDLSEQQLVDCSKDFGNQGCGGGLMDQAFQYIKANGG 195
Query: 65 INTERDYPNVGVMDN-CK 81
++TE YP D CK
Sbjct: 196 LDTEESYPYTATDDKPCK 213
>gi|47213724|emb|CAF95155.1| unnamed protein product [Tetraodon nigroviridis]
Length = 336
Score = 71.2 bits (173), Expect = 8e-11, Method: Composition-based stats.
Identities = 33/77 (42%), Positives = 47/77 (61%), Gaps = 2/77 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ GA+EG+ T LVD+S Q LVDC E+ C GG++ ++YV N+G
Sbjct: 143 GSCWAFSSAGALEGMLAKKTGKLVDLSPQNLVDCVK--ENSGCGGGYMTNAFKYVATNKG 200
Query: 65 INTERDYPNVGVMDNCK 81
+++E YP VG C+
Sbjct: 201 LDSEAAYPYVGQEQPCQ 217
>gi|114625736|ref|XP_001153919.1| PREDICTED: cathepsin L2 isoform 2 [Pan troglodytes]
gi|114625742|ref|XP_520130.2| PREDICTED: cathepsin L2 isoform 5 [Pan troglodytes]
Length = 334
Score = 71.2 bits (173), Expect = 8e-11, Method: Composition-based stats.
Identities = 33/77 (42%), Positives = 46/77 (59%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ GA+EG T LV +S Q LVDC ++ C GGF+ +QYV +N G
Sbjct: 136 GSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRPQGNQGCNGGFMARAFQYVKENGG 195
Query: 65 INTERDYPNVGVMDNCK 81
+++E YP V + + CK
Sbjct: 196 LDSEESYPYVAMDEICK 212
>gi|302763837|ref|XP_002965340.1| hypothetical protein SELMODRAFT_143126 [Selaginella moellendorffii]
gi|302790566|ref|XP_002977050.1| hypothetical protein SELMODRAFT_232903 [Selaginella moellendorffii]
gi|300155026|gb|EFJ21659.1| hypothetical protein SELMODRAFT_232903 [Selaginella moellendorffii]
gi|300167573|gb|EFJ34178.1| hypothetical protein SELMODRAFT_143126 [Selaginella moellendorffii]
Length = 300
Score = 71.2 bits (173), Expect = 8e-11, Method: Composition-based stats.
Identities = 31/76 (40%), Positives = 45/76 (59%), Gaps = 2/76 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ + +IE + T LV +S QQL+DCD + C GGF E +++V++N G
Sbjct: 110 GSCWAFSAIASIESAHFLATKELVSLSEQQLIDCDTV--DQGCQGGFPEDAFKFVVENGG 167
Query: 65 INTERDYPNVGVMDNC 80
+ TE YP G +C
Sbjct: 168 VTTEEAYPYTGFAGSC 183
>gi|33242878|gb|AAQ01143.1| cathepsin [Branchiostoma lanceolatum]
Length = 334
Score = 71.2 bits (173), Expect = 8e-11, Method: Composition-based stats.
Identities = 33/78 (42%), Positives = 45/78 (57%), Gaps = 1/78 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ G++EG T LVD+S QQLVDC ++ C GG ++ +QY+ N G
Sbjct: 136 GSCWAFSTTGSLEGQHSNKTGKLVDLSEQQLVDCSKDFGNQGCGGGLMDQAFQYIKANGG 195
Query: 65 INTERDYPNVGVMDN-CK 81
++TE YP D CK
Sbjct: 196 LDTEESYPYTATDDKPCK 213
>gi|348586359|ref|XP_003478936.1| PREDICTED: cathepsin S-like [Cavia porcellus]
Length = 344
Score = 71.2 bits (173), Expect = 8e-11, Method: Composition-based stats.
Identities = 31/69 (44%), Positives = 44/69 (63%), Gaps = 1/69 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGE-SRSCVGGFIETIYQYVIQNR 63
G+CW F+ VGA+E K+ T LV +S Q LVDC + ++ C GGF+ +QY+I N
Sbjct: 200 GACWAFSAVGALEAQVKLKTGKLVSLSAQNLVDCSTEKYGNKGCNGGFMTEAFQYIIDNN 259
Query: 64 GINTERDYP 72
GI++E YP
Sbjct: 260 GIDSETSYP 268
>gi|426362423|ref|XP_004048364.1| PREDICTED: cathepsin L2 isoform 1 [Gorilla gorilla gorilla]
gi|426362425|ref|XP_004048365.1| PREDICTED: cathepsin L2 isoform 2 [Gorilla gorilla gorilla]
Length = 334
Score = 70.9 bits (172), Expect = 8e-11, Method: Composition-based stats.
Identities = 33/77 (42%), Positives = 46/77 (59%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ GA+EG T LV +S Q LVDC ++ C GGF+ +QYV +N G
Sbjct: 136 GSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRPQGNQGCNGGFMARAFQYVKENGG 195
Query: 65 INTERDYPNVGVMDNCK 81
+++E YP V + + CK
Sbjct: 196 LDSEESYPYVAMDEICK 212
>gi|356557734|ref|XP_003547166.1| PREDICTED: P34 probable thiol protease-like [Glycine max]
Length = 369
Score = 70.9 bits (172), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 34/78 (43%), Positives = 45/78 (57%), Gaps = 3/78 (3%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ GAIEG S + T L+ +S Q+L+DC S C GG+I+ +VI NRG
Sbjct: 162 GSCWAFSATGAIEGASALATGKLISVSEQELLDC---AYSFGCGGGWIDKALDWVIGNRG 218
Query: 65 INTERDYPNVGVMDNCKV 82
I +E DYP C+
Sbjct: 219 IASEIDYPYTARKGTCRA 236
>gi|302790570|ref|XP_002977052.1| hypothetical protein SELMODRAFT_268054 [Selaginella moellendorffii]
gi|300155028|gb|EFJ21661.1| hypothetical protein SELMODRAFT_268054 [Selaginella moellendorffii]
Length = 300
Score = 70.9 bits (172), Expect = 8e-11, Method: Composition-based stats.
Identities = 31/76 (40%), Positives = 45/76 (59%), Gaps = 2/76 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ + +IE + T LV +S QQL+DCD + C GGF E +++V++N G
Sbjct: 110 GSCWAFSAIASIESAHFLATKELVSLSEQQLIDCDTV--DQGCQGGFPEDAFKFVVENGG 167
Query: 65 INTERDYPNVGVMDNC 80
+ TE YP G +C
Sbjct: 168 VTTEEAYPYTGFAGSC 183
>gi|66810271|ref|XP_638859.1| cysteine proteinase 3 [Dictyostelium discoideum AX4]
gi|166201983|sp|Q23894.2|CYSP3_DICDI RecName: Full=Cysteine proteinase 3; AltName: Full=Cysteine
proteinase II; Flags: Precursor
gi|60467526|gb|EAL65548.1| cysteine proteinase 3 [Dictyostelium discoideum AX4]
Length = 337
Score = 70.9 bits (172), Expect = 8e-11, Method: Composition-based stats.
Identities = 30/78 (38%), Positives = 49/78 (62%), Gaps = 1/78 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSC+ F+ G++EG++ I T LV +S Q ++DC + + C GG + ++Y+I+N G
Sbjct: 143 GSCYSFSTTGSVEGVTAIKTGKLVSLSEQNILDCSSSFGNEGCNGGLMTNAFEYIIKNNG 202
Query: 65 INTERDYP-NVGVMDNCK 81
+N+E YP + V D CK
Sbjct: 203 LNSEEQYPYEMKVNDECK 220
>gi|33242874|gb|AAQ01141.1| cathepsin [Branchiostoma lanceolatum]
Length = 334
Score = 70.9 bits (172), Expect = 8e-11, Method: Composition-based stats.
Identities = 33/78 (42%), Positives = 45/78 (57%), Gaps = 1/78 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ G++EG T LVD+S QQLVDC ++ C GG ++ +QY+ N G
Sbjct: 136 GSCWAFSTTGSLEGQHSNKTGKLVDLSEQQLVDCSKDFGNQGCGGGLMDQAFQYIKANGG 195
Query: 65 INTERDYPNVGVMDN-CK 81
++TE YP D CK
Sbjct: 196 LDTEESYPYTATDDKPCK 213
>gi|94421566|gb|ABF18890.1| cathepsin-L-like cysteine proteinase 2 [Lygus lineolaris]
Length = 216
Score = 70.9 bits (172), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 31/82 (37%), Positives = 46/82 (56%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ G++EG K+ L +S QQLVDC + C GG ++ ++Y+ +N G
Sbjct: 93 GSCWAFSTTGSLEGQHKLKQGKLYSLSEQQLVDCSAAEGNMGCEGGLMDDGFKYIKKNGG 152
Query: 65 INTERDYPNVGVMDNCKVFQFN 86
I+TE+ YP G C + N
Sbjct: 153 IDTEKSYPYTGEDGKCHATKKN 174
>gi|34559455|gb|AAQ75437.1| cathepsin L-like protease [Helicoverpa armigera]
gi|338855117|gb|AEJ31938.1| cathepsin L-like protease [Helicoverpa assulta]
Length = 341
Score = 70.9 bits (172), Expect = 9e-11, Method: Composition-based stats.
Identities = 32/77 (41%), Positives = 45/77 (58%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ GA+EG T LV +S Q L+DC + C GG ++ ++Y+ N G
Sbjct: 146 GSCWAFSTTGALEGQHFRKTGYLVSLSEQNLIDCSAAYGNNGCNGGLMDNAFKYIKDNGG 205
Query: 65 INTERDYPNVGVMDNCK 81
I+TE+ YP GV D C+
Sbjct: 206 IDTEKAYPYEGVDDKCR 222
>gi|957281|gb|AAB33990.1| cysteine proteinase [Bombyx mori]
Length = 344
Score = 70.9 bits (172), Expect = 9e-11, Method: Composition-based stats.
Identities = 32/77 (41%), Positives = 46/77 (59%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ GA+EG + LV +S Q L+DC Q + C GG ++ ++Y+ N G
Sbjct: 149 GSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIKDNGG 208
Query: 65 INTERDYPNVGVMDNCK 81
I+TE+ YP GV D C+
Sbjct: 209 IDTEQAYPYEGVDDKCR 225
>gi|356549192|ref|XP_003542981.1| PREDICTED: cysteine proteinase RD21a-like [Glycine max]
Length = 517
Score = 70.9 bits (172), Expect = 9e-11, Method: Composition-based stats.
Identities = 33/78 (42%), Positives = 49/78 (62%), Gaps = 2/78 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW F+ GAIEGI+ IV+ +L+ +S +LVDCD + C GG ++ +++V+ N G
Sbjct: 159 GCCWAFSSTGAIEGINAIVSGDLISLSEPELVDCDRTND--GCDGGHMDYAFEWVMHNGG 216
Query: 65 INTERDYPNVGVMDNCKV 82
I+TE +YP G C V
Sbjct: 217 IDTETNYPYSGADGTCNV 234
>gi|41055337|ref|NP_956720.1| cathepsin S, a [Danio rerio]
gi|32451845|gb|AAH54668.1| Cathepsin S, a [Danio rerio]
Length = 239
Score = 70.9 bits (172), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 33/70 (47%), Positives = 43/70 (61%)
Query: 3 PLGSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQN 62
P GSCW F+ VG++E K T LV +S Q L+DC +R C GGF+ + YVIQN
Sbjct: 44 PCGSCWAFSAVGSLEAQMKRRTAALVPLSAQNLLDCSVSLGNRGCKGGFLSRAFLYVIQN 103
Query: 63 RGINTERDYP 72
RGI++ YP
Sbjct: 104 RGIDSSTFYP 113
>gi|157132324|ref|XP_001655999.1| cathepsin l [Aedes aegypti]
gi|108881694|gb|EAT45919.1| AAEL002833-PA [Aedes aegypti]
Length = 339
Score = 70.9 bits (172), Expect = 9e-11, Method: Composition-based stats.
Identities = 32/76 (42%), Positives = 44/76 (57%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ GA+EG T LV +S Q LVDC + + C GG ++ +QY+ N G
Sbjct: 144 GSCWSFSATGALEGQHFRKTGKLVSLSEQNLVDCSGKYGNNGCNGGMMDYAFQYIKDNGG 203
Query: 65 INTERDYPNVGVMDNC 80
I+TE+ YP + D C
Sbjct: 204 IDTEKSYPYEAIDDTC 219
>gi|30141021|dbj|BAC75924.1| cysteine protease-2 [Helianthus annuus]
Length = 362
Score = 70.9 bits (172), Expect = 9e-11, Method: Composition-based stats.
Identities = 33/82 (40%), Positives = 48/82 (58%), Gaps = 1/82 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V A+EGI+KI TN LV +S Q+LVDCD E++ C GG ++ + ++ + G
Sbjct: 149 GSCWAFSTVAAVEGINKIKTNELVSLSEQELVDCDTL-ENQGCNGGLMDLAFDFIKKTGG 207
Query: 65 INTERDYPNVGVMDNCKVFQFN 86
+ E YP C + N
Sbjct: 208 LTREDAYPYAAEDGKCDSNKMN 229
>gi|224102377|ref|XP_002312656.1| predicted protein [Populus trichocarpa]
gi|222852476|gb|EEE90023.1| predicted protein [Populus trichocarpa]
Length = 358
Score = 70.9 bits (172), Expect = 9e-11, Method: Composition-based stats.
Identities = 32/82 (39%), Positives = 49/82 (59%), Gaps = 2/82 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V A+EGI+KI T L+ +S Q+LVDCD+ ++ C GG +E + ++ Q G
Sbjct: 149 GSCWAFSTVAAVEGINKIKTGELISLSEQELVDCDS--DNHGCNGGLMEDAFNFIKQIGG 206
Query: 65 INTERDYPNVGVMDNCKVFQFN 86
+ +E YP + C + N
Sbjct: 207 LTSENTYPYRAKEEPCDSNKMN 228
>gi|89266543|gb|ABD65563.1| cathepsin S [Ictalurus punctatus]
Length = 165
Score = 70.9 bits (172), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 32/80 (40%), Positives = 47/80 (58%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ GA+EG K T + +S Q LVDC ++ ++ C GGF+ +QYVI N G
Sbjct: 22 GSCWAFSAAGALEGQLKRTTGQVKSLSPQNLVDCSSKYGNKGCNGGFMTQAFQYVIDNGG 81
Query: 65 INTERDYPNVGVMDNCKVFQ 84
I+++ YP + C+ Q
Sbjct: 82 IDSDEAYPYTAMDGQCRYDQ 101
>gi|226477902|emb|CAX72658.1| Cathepsin L precursor [Schistosoma japonicum]
gi|226488903|emb|CAX74801.1| Cathepsin L precursor [Schistosoma japonicum]
Length = 372
Score = 70.9 bits (172), Expect = 9e-11, Method: Composition-based stats.
Identities = 32/70 (45%), Positives = 43/70 (61%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ GAIEG TN LV++S QQL+DC + C GG ++ +QYV N G
Sbjct: 172 GSCWAFSSTGAIEGQHYRKTNRLVNLSEQQLIDCSKSYGNNGCEGGLMDLAFQYVRDNEG 231
Query: 65 INTERDYPNV 74
I++E YP +
Sbjct: 232 IDSEISYPYI 241
>gi|226469954|emb|CAX70258.1| Cathepsin L precursor [Schistosoma japonicum]
Length = 372
Score = 70.9 bits (172), Expect = 9e-11, Method: Composition-based stats.
Identities = 32/70 (45%), Positives = 43/70 (61%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ GAIEG TN LV++S QQL+DC + C GG ++ +QYV N G
Sbjct: 172 GSCWAFSSTGAIEGQHYRKTNRLVNLSEQQLIDCSKSYGNNGCEGGLMDLAFQYVRDNEG 231
Query: 65 INTERDYPNV 74
I++E YP +
Sbjct: 232 IDSEISYPYI 241
>gi|440797325|gb|ELR18416.1| cathepsin Llike cysteine protease [Acanthamoeba castellanii str.
Neff]
Length = 345
Score = 70.9 bits (172), Expect = 9e-11, Method: Composition-based stats.
Identities = 32/68 (47%), Positives = 40/68 (58%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ GAIEG + T LVD+S + L+DC C GG +QYVI N+G
Sbjct: 135 GSCWAFSAAGAIEGQQALRTGRLVDLSEENLIDCSWAQGDMGCGGGLPSQAFQYVIDNKG 194
Query: 65 INTERDYP 72
I+TE YP
Sbjct: 195 IDTEARYP 202
>gi|297596716|ref|NP_001042970.2| Os01g0347600 [Oryza sativa Japonica Group]
gi|255673204|dbj|BAF04884.2| Os01g0347600 [Oryza sativa Japonica Group]
Length = 211
Score = 70.9 bits (172), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 35/78 (44%), Positives = 43/78 (55%), Gaps = 2/78 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW FA V AIEG++KI T L +S Q+LVDCD S C GG + ++ V G
Sbjct: 15 GSCWAFAAVAAIEGLTKIRTGQLTPLSEQELVDCDTN--SNGCGGGHTDRAFELVASKGG 72
Query: 65 INTERDYPNVGVMDNCKV 82
I E DY G C+V
Sbjct: 73 ITAESDYRYEGFQGKCRV 90
>gi|12805315|gb|AAH02125.1| Ctss protein [Mus musculus]
Length = 340
Score = 70.9 bits (172), Expect = 9e-11, Method: Composition-based stats.
Identities = 31/78 (39%), Positives = 48/78 (61%), Gaps = 2/78 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGE--SRSCVGGFIETIYQYVIQN 62
G+CW F+ VGA+EG K+ T L+ +S Q LVDC N+ + ++ C GG++ +QY+I N
Sbjct: 145 GACWAFSAVGALEGQLKLKTGKLISLSAQNLVDCSNEEKYGNKGCGGGYMTEAFQYIIDN 204
Query: 63 RGINTERDYPNVGVMDNC 80
GI + YP + + C
Sbjct: 205 GGIEADASYPYKAMDEKC 222
>gi|194352762|emb|CAQ00109.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
gi|326517250|dbj|BAJ99991.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 367
Score = 70.9 bits (172), Expect = 9e-11, Method: Composition-based stats.
Identities = 29/68 (42%), Positives = 44/68 (64%), Gaps = 1/68 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ + A+EGI+ I + NL +S QQLVDCD + + C GG ++ +QY+ ++ G
Sbjct: 159 GSCWAFSTIAAVEGINAIRSKNLTSLSEQQLVDCDTKSNA-GCNGGLMDYAFQYIAKHGG 217
Query: 65 INTERDYP 72
+ E YP
Sbjct: 218 VAAEDAYP 225
>gi|426216526|ref|XP_004002513.1| PREDICTED: cathepsin S isoform 2 [Ovis aries]
Length = 281
Score = 70.9 bits (172), Expect = 9e-11, Method: Composition-based stats.
Identities = 33/78 (42%), Positives = 47/78 (60%), Gaps = 1/78 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDN-QGESRSCVGGFIETIYQYVIQNR 63
GSCW F+ VGA+E K+ T LV +S Q LVDC + ++ C GGF+ +QY+I N
Sbjct: 87 GSCWAFSAVGALEAQVKLKTGKLVSLSAQNLVDCSTVKYGNKGCNGGFMTEAFQYIIDNN 146
Query: 64 GINTERDYPNVGVMDNCK 81
GI++E YP + C+
Sbjct: 147 GIDSEASYPYKAMDGRCQ 164
>gi|426216524|ref|XP_004002512.1| PREDICTED: cathepsin S isoform 1 [Ovis aries]
Length = 331
Score = 70.9 bits (172), Expect = 9e-11, Method: Composition-based stats.
Identities = 33/78 (42%), Positives = 47/78 (60%), Gaps = 1/78 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDN-QGESRSCVGGFIETIYQYVIQNR 63
GSCW F+ VGA+E K+ T LV +S Q LVDC + ++ C GGF+ +QY+I N
Sbjct: 137 GSCWAFSAVGALEAQVKLKTGKLVSLSAQNLVDCSTVKYGNKGCNGGFMTEAFQYIIDNN 196
Query: 64 GINTERDYPNVGVMDNCK 81
GI++E YP + C+
Sbjct: 197 GIDSEASYPYKAMDGRCQ 214
>gi|1085731|pir||S46476 cysteine proteinase (EC 3.4.22.-) III - mountain papaya
gi|926847|gb|AAB32657.1| cysteine proteinase CC-III [Carica candamarcensis=mountain
papaya, Hook, latex, Peptide, 214 aa]
Length = 214
Score = 70.9 bits (172), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 33/78 (42%), Positives = 49/78 (62%), Gaps = 3/78 (3%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ + +EGI+KIV NL +S Q+LVDCD + S C GG+ T +YV+ + G
Sbjct: 23 GSCWAFSTIATVEGINKIVHGNLTSLSEQELVDCDRR--SHGCKGGYQTTSLKYVV-DHG 79
Query: 65 INTERDYPNVGVMDNCKV 82
++TE++YP C+
Sbjct: 80 VHTEKEYPYEEKQYKCRA 97
>gi|301767946|ref|XP_002919405.1| PREDICTED: cathepsin S-like [Ailuropoda melanoleuca]
Length = 340
Score = 70.9 bits (172), Expect = 9e-11, Method: Composition-based stats.
Identities = 32/78 (41%), Positives = 46/78 (58%), Gaps = 1/78 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGE-SRSCVGGFIETIYQYVIQNR 63
G+CW F+ VGA+E K+ T LV +S Q LVDC + ++ C GGF+ +QY+I N
Sbjct: 146 GACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTEKYGNKGCNGGFMTEAFQYIIDNN 205
Query: 64 GINTERDYPNVGVMDNCK 81
GI++E YP C+
Sbjct: 206 GIDSEASYPYKATDGKCR 223
>gi|225446589|ref|XP_002280263.1| PREDICTED: vignain [Vitis vinifera]
Length = 339
Score = 70.9 bits (172), Expect = 9e-11, Method: Composition-based stats.
Identities = 31/76 (40%), Positives = 47/76 (61%), Gaps = 1/76 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW F+ V A EGI++I T L+ +S Q+LVDCD GE++ C GG ++ +++ I+ G
Sbjct: 144 GCCWAFSAVAATEGITQITTGKLISLSEQELVDCDTGGENQGCSGGLMDDAFRF-IKIHG 202
Query: 65 INTERDYPNVGVMDNC 80
+ +E YP G C
Sbjct: 203 LASEATYPYEGDDGTC 218
>gi|442539990|gb|AGC54590.1| bromelain, partial [Ananas comosus]
Length = 241
Score = 70.9 bits (172), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 33/84 (39%), Positives = 46/84 (54%), Gaps = 3/84 (3%)
Query: 2 HPLGSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQ 61
+P GSCW FA + +EGI KI T LV +S Q+++DC S C GG++ Y ++I
Sbjct: 32 NPCGSCWAFAAIATVEGIYKIKTGYLVSLSEQEVLDC---AVSYGCKGGWVNKAYDFIIS 88
Query: 62 NRGINTERDYPNVGVMDNCKVFQF 85
N G+ TE +YP C F
Sbjct: 89 NNGVTTEENYPYQAYQGTCNANSF 112
>gi|2746723|gb|AAB94925.1| cathepsin S precursor [Mus musculus]
Length = 340
Score = 70.9 bits (172), Expect = 9e-11, Method: Composition-based stats.
Identities = 31/78 (39%), Positives = 48/78 (61%), Gaps = 2/78 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGE--SRSCVGGFIETIYQYVIQN 62
G+CW F+ VGA+EG K+ T L+ +S Q LVDC N+ + ++ C GG++ +QY+I N
Sbjct: 145 GACWAFSAVGALEGQLKLKTGKLISLSAQNLVDCSNEEKYGNKGCGGGYMTEAFQYIIDN 204
Query: 63 RGINTERDYPNVGVMDNC 80
GI + YP + + C
Sbjct: 205 GGIEADASYPYKAMDEKC 222
>gi|151573014|gb|ABS17682.1| cathepsin L-1 [Artemia salina]
Length = 334
Score = 70.9 bits (172), Expect = 9e-11, Method: Composition-based stats.
Identities = 31/77 (40%), Positives = 45/77 (58%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ GA+EG + T LV +S Q L+DC + + C GG ++ +QY+ N+G
Sbjct: 140 GSCWAFSSTGALEGQTFRKTGKLVSLSEQNLIDCSGKYGNEGCNGGLMDQAFQYIKDNKG 199
Query: 65 INTERDYPNVGVMDNCK 81
I+TE YP D C+
Sbjct: 200 IDTENTYPYEAEDDVCR 216
>gi|294938848|ref|XP_002782226.1| cathepsin L, putative [Perkinsus marinus ATCC 50983]
gi|239893730|gb|EER14021.1| cathepsin L, putative [Perkinsus marinus ATCC 50983]
Length = 334
Score = 70.9 bits (172), Expect = 9e-11, Method: Composition-based stats.
Identities = 31/80 (38%), Positives = 46/80 (57%), Gaps = 1/80 (1%)
Query: 3 PLGSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQN 62
P GSCW F+ GA+E I T L+ +S QQL+DC + + C GG +E Y Y I++
Sbjct: 132 PCGSCWAFSATGALEAQYAIATGKLLSLSEQQLIDCSSSYGNEGCSGGLMENAYTY-IKS 190
Query: 63 RGINTERDYPNVGVMDNCKV 82
G++ E YP + + C+V
Sbjct: 191 AGLDQESTYPYIAKNNACQV 210
>gi|308810026|ref|XP_003082322.1| cysteine protease-1 (ISS) [Ostreococcus tauri]
gi|116060790|emb|CAL57268.1| cysteine protease-1 (ISS) [Ostreococcus tauri]
Length = 430
Score = 70.9 bits (172), Expect = 9e-11, Method: Composition-based stats.
Identities = 29/68 (42%), Positives = 46/68 (67%), Gaps = 2/68 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ GA+EGI+KI T LV +S Q++V C Q + C GG ++ ++++++N G
Sbjct: 223 GSCWAFSTTGAVEGITKIRTGRLVSLSEQEMVSCSKQ--NMGCNGGLMDYAFRWIVKNGG 280
Query: 65 INTERDYP 72
I++E YP
Sbjct: 281 IDSEFQYP 288
>gi|33242880|gb|AAQ01144.1| cathepsin [Branchiostoma lanceolatum]
Length = 334
Score = 70.9 bits (172), Expect = 9e-11, Method: Composition-based stats.
Identities = 33/78 (42%), Positives = 45/78 (57%), Gaps = 1/78 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ G++EG T LVD+S QQLVDC ++ C GG ++ +QY+ N G
Sbjct: 136 GSCWAFSTTGSLEGQHSNKTGKLVDLSEQQLVDCSKDFGNQGCGGGLMDQAFQYIKANGG 195
Query: 65 INTERDYPNVGVMDN-CK 81
++TE YP D CK
Sbjct: 196 LDTEESYPYTATDDKPCK 213
>gi|47086663|ref|NP_997853.1| cathepsin H precursor [Danio rerio]
gi|45709087|gb|AAH67615.1| Cathepsin H [Danio rerio]
Length = 330
Score = 70.9 bits (172), Expect = 1e-10, Method: Composition-based stats.
Identities = 28/79 (35%), Positives = 45/79 (56%)
Query: 3 PLGSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQN 62
P GSCW F+ G +E ++ I T L+ ++ QQL+DC ++ C GG ++Y++ N
Sbjct: 132 PCGSCWTFSTTGCLESVTAIATGKLLQLAEQQLIDCAGDFDNHGCNGGLPSHAFEYIMYN 191
Query: 63 RGINTERDYPNVGVMDNCK 81
+G+ TE DYP C+
Sbjct: 192 KGLMTEDDYPYQAKGGQCR 210
>gi|146386360|gb|ABQ23968.1| cathepsin S [Oryctolagus cuniculus]
Length = 162
Score = 70.9 bits (172), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 32/69 (46%), Positives = 45/69 (65%), Gaps = 1/69 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCD-NQGESRSCVGGFIETIYQYVIQNR 63
G+CW F+ VGA+E K+ T NLV +S Q LVDC + ++ C GGF+ +QY+I N
Sbjct: 90 GACWAFSAVGALEAQLKLKTGNLVSLSAQNLVDCSTTKYGNKGCNGGFMTEAFQYIIDNN 149
Query: 64 GINTERDYP 72
GI++E YP
Sbjct: 150 GIDSEASYP 158
>gi|91992508|gb|ABE72970.1| cathepsin L [Aedes aegypti]
Length = 339
Score = 70.9 bits (172), Expect = 1e-10, Method: Composition-based stats.
Identities = 32/76 (42%), Positives = 44/76 (57%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ GA+EG T LV +S Q LVDC + + C GG ++ +QY+ N G
Sbjct: 144 GSCWSFSATGALEGQHFRKTGKLVSLSEQNLVDCSGKYGNNGCNGGMMDYAFQYIKDNGG 203
Query: 65 INTERDYPNVGVMDNC 80
I+TE+ YP + D C
Sbjct: 204 IDTEKSYPYEAIDDTC 219
>gi|410907221|ref|XP_003967090.1| PREDICTED: pro-cathepsin H-like [Takifugu rubripes]
Length = 324
Score = 70.9 bits (172), Expect = 1e-10, Method: Composition-based stats.
Identities = 30/76 (39%), Positives = 42/76 (55%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ G +E ++ I + LV +S QQLVDC + C GG ++Y+ N+G
Sbjct: 130 GSCWTFSTTGCLESVTAINSGKLVPLSEQQLVDCAQDFNNHGCNGGLPSQAFEYIKYNKG 189
Query: 65 INTERDYPNVGVMDNC 80
+ TE DYP D C
Sbjct: 190 LMTESDYPYTAFEDKC 205
>gi|281352890|gb|EFB28474.1| hypothetical protein PANDA_008012 [Ailuropoda melanoleuca]
Length = 328
Score = 70.9 bits (172), Expect = 1e-10, Method: Composition-based stats.
Identities = 32/78 (41%), Positives = 46/78 (58%), Gaps = 1/78 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGE-SRSCVGGFIETIYQYVIQNR 63
G+CW F+ VGA+E K+ T LV +S Q LVDC + ++ C GGF+ +QY+I N
Sbjct: 134 GACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTEKYGNKGCNGGFMTEAFQYIIDNN 193
Query: 64 GINTERDYPNVGVMDNCK 81
GI++E YP C+
Sbjct: 194 GIDSEASYPYKATDGKCR 211
>gi|302790836|ref|XP_002977185.1| hypothetical protein SELMODRAFT_106228 [Selaginella moellendorffii]
gi|300155161|gb|EFJ21794.1| hypothetical protein SELMODRAFT_106228 [Selaginella moellendorffii]
Length = 299
Score = 70.9 bits (172), Expect = 1e-10, Method: Composition-based stats.
Identities = 31/76 (40%), Positives = 45/76 (59%), Gaps = 2/76 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ + +IE + T LV +S QQL+DCD + C GGF E +++V++N G
Sbjct: 110 GSCWAFSAIASIESAHFLATKELVSLSEQQLIDCDTV--DQGCQGGFPEDAFKFVVENGG 167
Query: 65 INTERDYPNVGVMDNC 80
+ TE YP G +C
Sbjct: 168 VTTEEAYPYTGFAGSC 183
>gi|4469159|emb|CAB38317.1| chymopapain isoform V [Carica papaya]
Length = 227
Score = 70.9 bits (172), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 35/78 (44%), Positives = 48/78 (61%), Gaps = 3/78 (3%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ + +EGI+KIVT NL+++S Q+LVDCD S C GG+ T QYV N G
Sbjct: 23 GSCWAFSTIATVEGINKIVTGNLLELSEQELVDCDKH--SYGCKGGYQTTSLQYVANN-G 79
Query: 65 INTERDYPNVGVMDNCKV 82
++T + YP C+
Sbjct: 80 VHTSKVYPCQAKQYKCRA 97
>gi|313507179|pdb|2ACT|A Chain A, Crystallographic Refinement Of The Structure Of Actinidin
At 1.7 Angstroms Resolution By Fast Fourier
Least-Squares Methods
Length = 220
Score = 70.9 bits (172), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 30/78 (38%), Positives = 48/78 (61%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G W F+ + +EGI+KI + +L+ +S Q+L+DC +R C GG+I +Q++I + G
Sbjct: 23 GGXWAFSAIATVEGINKITSGSLISLSEQELIDCGRTQNTRGCDGGYITDGFQFIINDGG 82
Query: 65 INTERDYPNVGVMDNCKV 82
INTE +YP +C V
Sbjct: 83 INTEENYPYTAQDGDCDV 100
>gi|242048430|ref|XP_002461961.1| hypothetical protein SORBIDRAFT_02g011230 [Sorghum bicolor]
gi|241925338|gb|EER98482.1| hypothetical protein SORBIDRAFT_02g011230 [Sorghum bicolor]
Length = 380
Score = 70.9 bits (172), Expect = 1e-10, Method: Composition-based stats.
Identities = 33/76 (43%), Positives = 42/76 (55%), Gaps = 2/76 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V +EGI +I T LV +S Q+LVDCD C GG +++ N G
Sbjct: 184 GSCWAFSTVAVVEGIYQIRTGKLVSLSEQELVDCDTL--DAGCDGGISYRALRWITSNGG 241
Query: 65 INTERDYPNVGVMDNC 80
+ TE DYP G D C
Sbjct: 242 LTTEEDYPYTGTTDAC 257
>gi|197258082|gb|ACH56225.1| cathepsin L-like cysteine proteinase [Bursaphelenchus xylophilus]
Length = 282
Score = 70.9 bits (172), Expect = 1e-10, Method: Composition-based stats.
Identities = 32/76 (42%), Positives = 42/76 (55%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ G++EG K T LV +S Q LVDC + C GG ++ ++YV QN G
Sbjct: 87 GSCWAFSATGSLEGQHKRATGKLVSLSEQNLVDCSADFGNNGCNGGLMDFAFEYVKQNHG 146
Query: 65 INTERDYPNVGVMDNC 80
I+TE YP C
Sbjct: 147 IDTEESYPYKAKQKKC 162
>gi|115461667|ref|NP_001054433.1| Os05g0108600 [Oryza sativa Japonica Group]
gi|14719319|gb|AAK73137.1|AC079022_10 putative cysteine proteinase [Oryza sativa]
gi|33151125|gb|AAP97431.1| cysteine protease CP1 [Oryza sativa]
gi|52353572|gb|AAU44138.1| cysteine proteinase CP1 [Oryza sativa Japonica Group]
gi|113577984|dbj|BAF16347.1| Os05g0108600 [Oryza sativa Japonica Group]
gi|125550541|gb|EAY96250.1| hypothetical protein OsI_18148 [Oryza sativa Indica Group]
Length = 358
Score = 70.9 bits (172), Expect = 1e-10, Method: Composition-based stats.
Identities = 30/68 (44%), Positives = 42/68 (61%), Gaps = 1/68 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V A+EGI+ IVT NL +S Q+L+DC G + C GG ++ + Y+ G
Sbjct: 163 GSCWAFSTVAAVEGINAIVTGNLTSLSEQELIDCSTDGNN-GCNGGLMDYAFSYIASTGG 221
Query: 65 INTERDYP 72
+ TE YP
Sbjct: 222 LRTEEAYP 229
>gi|121543825|gb|ABM55577.1| putative cathepsin L-like protease [Maconellicoccus hirsutus]
Length = 341
Score = 70.9 bits (172), Expect = 1e-10, Method: Composition-based stats.
Identities = 29/77 (37%), Positives = 45/77 (58%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ G++EG T L +S Q L+DC + + C GG ++ + Y+ N+G
Sbjct: 145 GSCWAFSTTGSLEGQHFRNTKQLTSLSEQNLIDCSGKYGNNGCSGGLMDNAFAYIKSNKG 204
Query: 65 INTERDYPNVGVMDNCK 81
I+TE+ YP G+ D C+
Sbjct: 205 IDTEQSYPYEGIDDKCR 221
>gi|390608645|ref|NP_001254624.1| cathepsin S isoform 1 preproprotein [Mus musculus]
gi|74214026|dbj|BAE29430.1| unnamed protein product [Mus musculus]
Length = 343
Score = 70.9 bits (172), Expect = 1e-10, Method: Composition-based stats.
Identities = 31/78 (39%), Positives = 47/78 (60%), Gaps = 2/78 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGE--SRSCVGGFIETIYQYVIQN 62
G+CW F+ VGA+EG K+ T L+ +S Q LVDC N+ + ++ C GG++ +QY+I N
Sbjct: 148 GACWAFSAVGALEGQLKLKTGKLISLSAQNLVDCSNEEKYGNKGCGGGYMTEAFQYIIDN 207
Query: 63 RGINTERDYPNVGVMDNC 80
GI + YP + C
Sbjct: 208 GGIEADASYPYKATDEKC 225
>gi|318054062|ref|NP_001187179.1| cathepsin S precursor [Ictalurus punctatus]
gi|190351079|gb|ACE75948.1| cathepsin S [Ictalurus punctatus]
Length = 329
Score = 70.9 bits (172), Expect = 1e-10, Method: Composition-based stats.
Identities = 32/80 (40%), Positives = 47/80 (58%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ GA+EG K T + +S Q LVDC ++ ++ C GGF+ +QYVI N G
Sbjct: 136 GSCWAFSAAGALEGQLKRTTGQVKSLSPQNLVDCSSKYGNKGCNGGFMTEAFQYVIDNGG 195
Query: 65 INTERDYPNVGVMDNCKVFQ 84
I+++ YP + C+ Q
Sbjct: 196 IDSDEAYPYTAMDGQCRYDQ 215
>gi|170041165|ref|XP_001848344.1| cathepsin l [Culex quinquefasciatus]
gi|167864709|gb|EDS28092.1| cathepsin l [Culex quinquefasciatus]
Length = 340
Score = 70.9 bits (172), Expect = 1e-10, Method: Composition-based stats.
Identities = 32/76 (42%), Positives = 44/76 (57%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ GA+EG T LV +S Q LVDC + + C GG ++ +QY+ N G
Sbjct: 145 GSCWSFSATGALEGQHFRKTGKLVSLSEQNLVDCSTKYGNNGCNGGMMDFAFQYIKDNGG 204
Query: 65 INTERDYPNVGVMDNC 80
I+TE+ YP + D C
Sbjct: 205 IDTEKAYPYEAIDDTC 220
>gi|42744610|gb|AAH66625.1| Ctssa protein [Danio rerio]
Length = 321
Score = 70.9 bits (172), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 33/70 (47%), Positives = 43/70 (61%)
Query: 3 PLGSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQN 62
P GSCW F+ VG++E K T LV +S Q L+DC +R C GGF+ + YVIQN
Sbjct: 126 PCGSCWAFSAVGSLEAQMKRRTAALVPLSAQNLLDCSVSLGNRGCKGGFLSRAFLYVIQN 185
Query: 63 RGINTERDYP 72
RGI++ YP
Sbjct: 186 RGIDSSTFYP 195
>gi|413933048|gb|AFW67599.1| hypothetical protein ZEAMMB73_513726 [Zea mays]
Length = 205
Score = 70.9 bits (172), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 32/76 (42%), Positives = 46/76 (60%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW F+ V A+EG++KI T LV +S Q+LVDCD G + C GG ++ +Q+V + G
Sbjct: 11 GCCWAFSAVAAVEGLNKIRTGRLVSLSEQELVDCDVSGVDQGCDGGLMDNAFQFVARRGG 70
Query: 65 INTERDYPNVGVMDNC 80
+ +E YP G C
Sbjct: 71 LASESGYPYQGRDGPC 86
>gi|334904467|gb|AEH26024.1| cysteine peptidase [Ananas comosus]
Length = 352
Score = 70.9 bits (172), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 33/84 (39%), Positives = 46/84 (54%), Gaps = 3/84 (3%)
Query: 2 HPLGSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQ 61
+P GSCW FA + +EGI KI T LV +S Q+++DC S C GG++ Y ++I
Sbjct: 143 NPCGSCWAFAAIATVEGIYKIKTGYLVSLSEQEVLDC---AVSYGCKGGWVNKAYDFIIS 199
Query: 62 NRGINTERDYPNVGVMDNCKVFQF 85
N G+ TE +YP C F
Sbjct: 200 NNGVTTEENYPYQAYQGTCNANSF 223
>gi|2463588|dbj|BAA22546.1| FB1035 precursor [Ananas comosus]
Length = 324
Score = 70.5 bits (171), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 33/84 (39%), Positives = 46/84 (54%), Gaps = 3/84 (3%)
Query: 2 HPLGSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQ 61
+P GSCW FA + +EGI KI T LV +S Q+++DC S C GG++ Y ++I
Sbjct: 115 NPCGSCWAFAAIATVEGIYKIKTGYLVSLSEQEVLDC---AVSYGCKGGWVNKAYDFIIS 171
Query: 62 NRGINTERDYPNVGVMDNCKVFQF 85
N G+ TE +YP C F
Sbjct: 172 NNGVTTEENYPYQAYQGTCNANSF 195
>gi|261289811|ref|XP_002611767.1| hypothetical protein BRAFLDRAFT_284308 [Branchiostoma floridae]
gi|229297139|gb|EEN67777.1| hypothetical protein BRAFLDRAFT_284308 [Branchiostoma floridae]
Length = 336
Score = 70.5 bits (171), Expect = 1e-10, Method: Composition-based stats.
Identities = 33/78 (42%), Positives = 45/78 (57%), Gaps = 1/78 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ G++EG T LVD+S QQLVDC ++ C GG ++ +QY+ N G
Sbjct: 138 GSCWAFSTTGSLEGQHANKTGKLVDLSEQQLVDCSKDFGNQGCGGGLMDQAFQYIKANGG 197
Query: 65 INTERDYPNVGVMDN-CK 81
++TE YP D CK
Sbjct: 198 LDTEESYPYTATDDKPCK 215
>gi|353441136|gb|AEQ94152.1| drought-inducible cysteine proteinase [Elaeis guineensis]
Length = 252
Score = 70.5 bits (171), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 31/79 (39%), Positives = 48/79 (60%), Gaps = 7/79 (8%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRS-------CVGGFIETIYQ 57
GSCW F+ GA+EG + + T L +S QQLVDCD++ +S C GG + T ++
Sbjct: 164 GSCWSFSASGALEGANFLATGQLESLSEQQLVDCDHECDSSEPDSCDSGCNGGLMTTAFE 223
Query: 58 YVIQNRGINTERDYPNVGV 76
Y++++ G+ E+DYP G
Sbjct: 224 YLLKSGGLELEKDYPYTGT 242
>gi|341940310|sp|O70370.2|CATS_MOUSE RecName: Full=Cathepsin S; Flags: Precursor
Length = 340
Score = 70.5 bits (171), Expect = 1e-10, Method: Composition-based stats.
Identities = 31/78 (39%), Positives = 47/78 (60%), Gaps = 2/78 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGE--SRSCVGGFIETIYQYVIQN 62
G+CW F+ VGA+EG K+ T L+ +S Q LVDC N+ + ++ C GG++ +QY+I N
Sbjct: 145 GACWAFSAVGALEGQLKLKTGKLISLSAQNLVDCSNEEKYGNKGCGGGYMTEAFQYIIDN 204
Query: 63 RGINTERDYPNVGVMDNC 80
GI + YP + C
Sbjct: 205 GGIEADASYPYKATDEKC 222
>gi|147772785|emb|CAN62838.1| hypothetical protein VITISV_003391 [Vitis vinifera]
Length = 298
Score = 70.5 bits (171), Expect = 1e-10, Method: Composition-based stats.
Identities = 32/76 (42%), Positives = 46/76 (60%), Gaps = 1/76 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V A EGI++I T L+ +S Q+LVDCD GE++ C GG + +++ I G
Sbjct: 103 GSCWAFSAVAATEGITQITTGKLISLSEQELVDCDTGGENQGCSGGLXDDAFRF-IXIHG 161
Query: 65 INTERDYPNVGVMDNC 80
+ +E YP G C
Sbjct: 162 LASEATYPYEGDDGTC 177
>gi|3850787|emb|CAA05360.1| cathepsin S [Mus musculus]
Length = 330
Score = 70.5 bits (171), Expect = 1e-10, Method: Composition-based stats.
Identities = 31/78 (39%), Positives = 48/78 (61%), Gaps = 2/78 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGE--SRSCVGGFIETIYQYVIQN 62
G+CW F+ VGA+EG K+ T L+ +S Q LVDC N+ + ++ C GG++ +QY+I N
Sbjct: 135 GACWAFSAVGALEGQLKLKTGKLISLSAQNLVDCSNEEKYGNKGCGGGYMTEAFQYIIDN 194
Query: 63 RGINTERDYPNVGVMDNC 80
GI + YP + + C
Sbjct: 195 GGIEADASYPYKAMDEKC 212
>gi|23344734|gb|AAN28680.1| cathepsin L [Theromyzon tessulatum]
Length = 351
Score = 70.5 bits (171), Expect = 1e-10, Method: Composition-based stats.
Identities = 31/77 (40%), Positives = 44/77 (57%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ G++EG T +VD+S Q LVDC + C GG + ++Y+ N+G
Sbjct: 151 GSCWAFSTTGSLEGQHMRKTGTMVDLSEQNLVDCSTSYGNDGCNGGLMTNAFKYIKDNKG 210
Query: 65 INTERDYPNVGVMDNCK 81
I+TE YP G +CK
Sbjct: 211 IDTEEAYPYAGRDGDCK 227
>gi|392306967|ref|NP_067256.3| cathepsin S isoform 2 preproprotein [Mus musculus]
gi|26390492|dbj|BAC25906.1| unnamed protein product [Mus musculus]
gi|148706872|gb|EDL38819.1| cathepsin S [Mus musculus]
Length = 342
Score = 70.5 bits (171), Expect = 1e-10, Method: Composition-based stats.
Identities = 31/78 (39%), Positives = 47/78 (60%), Gaps = 2/78 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGE--SRSCVGGFIETIYQYVIQN 62
G+CW F+ VGA+EG K+ T L+ +S Q LVDC N+ + ++ C GG++ +QY+I N
Sbjct: 147 GACWAFSAVGALEGQLKLKTGKLISLSAQNLVDCSNEEKYGNKGCGGGYMTEAFQYIIDN 206
Query: 63 RGINTERDYPNVGVMDNC 80
GI + YP + C
Sbjct: 207 GGIEADASYPYKATDEKC 224
>gi|255078398|ref|XP_002502779.1| cysteine endopeptidase [Micromonas sp. RCC299]
gi|226518045|gb|ACO64037.1| cysteine endopeptidase [Micromonas sp. RCC299]
Length = 414
Score = 70.5 bits (171), Expect = 1e-10, Method: Composition-based stats.
Identities = 28/80 (35%), Positives = 49/80 (61%), Gaps = 1/80 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ GA+EG++ I T L+ +S ++L+ C G + C GG ++ +++++ NRG
Sbjct: 179 GSCWAFSTTGAVEGVNAIKTGKLISLSEEELISCSTNG-NMGCNGGLMDNGFEWIVNNRG 237
Query: 65 INTERDYPNVGVMDNCKVFQ 84
I+TE + V + C F+
Sbjct: 238 IDTEDGWEYVAKEEKCGFFR 257
>gi|74152091|dbj|BAE32077.1| unnamed protein product [Mus musculus]
Length = 245
Score = 70.5 bits (171), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 31/78 (39%), Positives = 47/78 (60%), Gaps = 2/78 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGE--SRSCVGGFIETIYQYVIQN 62
G+CW F+ VGA+EG K+ T L+ +S Q LVDC N+ + ++ C GG++ +QY+I N
Sbjct: 50 GACWAFSAVGALEGQLKLKTGKLISLSAQNLVDCSNEEKYGNKGCGGGYMTEAFQYIIDN 109
Query: 63 RGINTERDYPNVGVMDNC 80
GI + YP + C
Sbjct: 110 GGIEADASYPYKATDEKC 127
>gi|281203744|gb|EFA77940.1| hypothetical protein PPL_08585 [Polysphondylium pallidum PN500]
Length = 505
Score = 70.5 bits (171), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 29/68 (42%), Positives = 44/68 (64%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ G++EG +I + N+V++S Q LVDC + C GG ++ ++Y+I N G
Sbjct: 288 GSCWSFSTTGSVEGAHQIKSGNMVELSEQNLVDCSTSEGNMGCNGGLMDYAFEYIITNNG 347
Query: 65 INTERDYP 72
I+TE YP
Sbjct: 348 IDTESSYP 355
>gi|118124|sp|P25250.1|CYSP2_HORVU RecName: Full=Cysteine proteinase EP-B 2; Flags: Precursor
gi|1146118|gb|AAA85036.1| cysteine proteinase EPB2 precursor [Hordeum vulgare subsp. vulgare]
Length = 373
Score = 70.5 bits (171), Expect = 1e-10, Method: Composition-based stats.
Identities = 31/78 (39%), Positives = 47/78 (60%), Gaps = 1/78 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V ++EGI+ I T +LV +S Q+L+DCD ++ C GG ++ ++Y+ N G
Sbjct: 156 GSCWAFSTVVSVEGINAIRTGSLVSLSEQELIDCDT-ADNDGCQGGLMDNAFEYIKNNGG 214
Query: 65 INTERDYPNVGVMDNCKV 82
+ TE YP C V
Sbjct: 215 LITEAAYPYRAARGTCNV 232
>gi|356515048|ref|XP_003526213.1| PREDICTED: vignain-like [Glycine max]
Length = 350
Score = 70.5 bits (171), Expect = 1e-10, Method: Composition-based stats.
Identities = 33/76 (43%), Positives = 48/76 (63%), Gaps = 2/76 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V A EGI +I T L+ +S Q+LVDCD+ C GG +E ++++I+N G
Sbjct: 143 GSCWAFSTVAATEGIYQISTGMLMSLSEQELVDCDSV--DHGCDGGLMEDGFEFIIKNGG 200
Query: 65 INTERDYPNVGVMDNC 80
I++E +YP V C
Sbjct: 201 ISSEANYPYTAVDGTC 216
>gi|441593109|ref|XP_003260582.2| PREDICTED: cathepsin L2 isoform 1 [Nomascus leucogenys]
Length = 334
Score = 70.5 bits (171), Expect = 1e-10, Method: Composition-based stats.
Identities = 33/77 (42%), Positives = 46/77 (59%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ GA+EG T LV +S Q LVDC ++ C GGF+ +QYV +N G
Sbjct: 136 GSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRPQGNQGCNGGFMGKAFQYVKENGG 195
Query: 65 INTERDYPNVGVMDNCK 81
+++E YP V + + CK
Sbjct: 196 LDSEESYPYVAMDEICK 212
>gi|254746340|emb|CAX16635.1| putative C1A cysteine protease precursor [Manduca sexta]
Length = 342
Score = 70.5 bits (171), Expect = 1e-10, Method: Composition-based stats.
Identities = 31/77 (40%), Positives = 46/77 (59%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ GA+EG + LV +S Q L+DC + + C GG ++ ++Y+ N G
Sbjct: 147 GSCWAFSTTGALEGQHFRKSGYLVSLSEQNLIDCSSTYGNNGCNGGLMDNAFKYIKDNGG 206
Query: 65 INTERDYPNVGVMDNCK 81
I+TE+ YP GV D C+
Sbjct: 207 IDTEKTYPYEGVDDKCR 223
>gi|67773374|gb|AAY81944.1| cysteine protease 6 [Paragonimus westermani]
Length = 325
Score = 70.5 bits (171), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 36/78 (46%), Positives = 46/78 (58%), Gaps = 2/78 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+V G +EG + T LV +S QQLVDCD Q C GG+ T Y +I+ G
Sbjct: 134 GSCWAFSVAGNVEGQWFLKTGQLVSLSKQQLVDCDVQ--DSGCDGGYPPTTYGEIIRMGG 191
Query: 65 INTERDYPNVGVMDNCKV 82
+ +RDYP VG CK+
Sbjct: 192 LEAQRDYPYVGREQPCKL 209
>gi|18408828|ref|NP_566920.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|12324451|gb|AAG52191.1|AC012329_18 putative cysteine proteinase; 15366-14136 [Arabidopsis thaliana]
gi|6723404|emb|CAB66413.1| cysteine protease-like protein [Arabidopsis thaliana]
gi|332645009|gb|AEE78530.1| putative cysteine proteinase [Arabidopsis thaliana]
Length = 341
Score = 70.5 bits (171), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 32/77 (41%), Positives = 46/77 (59%), Gaps = 2/77 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW F+ V A+EG++KI LV +S QQL+DC E+ C GG + + Y+ +N+G
Sbjct: 149 GCCWAFSAVAAVEGMTKIANGELVSLSEQQLLDCST--ENNGCGGGIMWKAFDYIKENQG 206
Query: 65 INTERDYPNVGVMDNCK 81
I TE +YP G C+
Sbjct: 207 ITTEDNYPYQGAQQTCE 223
>gi|66814630|ref|XP_641494.1| cysteine protease [Dictyostelium discoideum AX4]
gi|118121|sp|P04989.1|CYSP2_DICDI RecName: Full=Cysteine proteinase 2; AltName: Full=Prestalk
cathepsin; Flags: Precursor
gi|167860|gb|AAA33240.1| pst-cathepsin [Dictyostelium discoideum]
gi|1834417|emb|CAA27050.1| cysteine proteinase 2 [Dictyostelium discoideum]
gi|60469522|gb|EAL67513.1| cysteine protease [Dictyostelium discoideum AX4]
gi|225484|prf||1304284A cathepsin,prestalk
Length = 376
Score = 70.5 bits (171), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 30/68 (44%), Positives = 41/68 (60%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ G+ EG + T LV +S Q LVDC E+ C GG + + Y+I+N+G
Sbjct: 145 GSCWSFSTTGSTEGAHALKTKKLVSLSEQNLVDCSGPEENFGCDGGLMNNAFDYIIKNKG 204
Query: 65 INTERDYP 72
I+TE YP
Sbjct: 205 IDTESSYP 212
>gi|391343119|ref|XP_003745860.1| PREDICTED: cathepsin L-like [Metaseiulus occidentalis]
Length = 385
Score = 70.5 bits (171), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 33/76 (43%), Positives = 45/76 (59%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ VG++EG T LV +S Q LVDC + C GG+++ ++YV N G
Sbjct: 190 GSCWAFSAVGSLEGQHFKSTGKLVSLSEQNLVDCSTPEGNSGCNGGWMDQAFEYVKDNHG 249
Query: 65 INTERDYPNVGVMDNC 80
I+TE YP VG +C
Sbjct: 250 IDTEDSYPYVGTDGSC 265
>gi|341850671|gb|AEK97329.1| chromoplast senescence-associated protein 12 [Brassica rapa var.
parachinensis]
Length = 260
Score = 70.5 bits (171), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 32/78 (41%), Positives = 46/78 (58%), Gaps = 2/78 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW F+ V AIEG ++I L+ +S QQLVDCD C GG I+T +++++ G
Sbjct: 66 GCCWAFSAVAAIEGATQIKKGKLISLSEQQLVDCDTN--DFGCSGGLIDTAFEHIMATGG 123
Query: 65 INTERDYPNVGVMDNCKV 82
+ TE +YP G CK+
Sbjct: 124 LTTESNYPYKGEDATCKI 141
>gi|147809367|emb|CAN64491.1| hypothetical protein VITISV_015725 [Vitis vinifera]
Length = 321
Score = 70.5 bits (171), Expect = 1e-10, Method: Composition-based stats.
Identities = 34/84 (40%), Positives = 45/84 (53%), Gaps = 7/84 (8%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRS-------CVGGFIETIYQ 57
GSCW F+ GA+EG I T L+ +S QQLVDCD+ + R C GG + Y+
Sbjct: 114 GSCWAFSTTGAVEGAHFISTKKLLTLSEQQLVDCDHMCDIRDKXACDSGCEGGLMTNAYK 173
Query: 58 YVIQNRGINTERDYPNVGVMDNCK 81
Y+I+ G+ E YP G CK
Sbjct: 174 YLIEAGGLEEESSYPYTGKHGECK 197
>gi|2961621|gb|AAC05781.1| cathepsin S [Mus musculus]
Length = 340
Score = 70.5 bits (171), Expect = 1e-10, Method: Composition-based stats.
Identities = 31/78 (39%), Positives = 47/78 (60%), Gaps = 2/78 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGE--SRSCVGGFIETIYQYVIQN 62
G+CW F+ VGA+EG K+ T L+ +S Q LVDC N+ + ++ C GG++ +QY+I N
Sbjct: 145 GACWAFSAVGALEGQLKLKTGKLISLSAQNLVDCSNEEKYGNKGCGGGYMTEAFQYIIDN 204
Query: 63 RGINTERDYPNVGVMDNC 80
GI + YP + C
Sbjct: 205 GGIEADASYPYKATDEKC 222
>gi|356515038|ref|XP_003526208.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Glycine
max]
Length = 339
Score = 70.5 bits (171), Expect = 1e-10, Method: Composition-based stats.
Identities = 34/78 (43%), Positives = 48/78 (61%), Gaps = 1/78 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F++V AIEGI +I T LV +S Q+LVDC +G+S C G+ E +++V +N G
Sbjct: 146 GSCWAFSIVAAIEGIHQITTGKLVSLSEQELVDC-VKGKSEGCNFGYKEEAFEFVAKNGG 204
Query: 65 INTERDYPNVGVMDNCKV 82
+ +E YP C V
Sbjct: 205 LASEISYPYKANNKTCMV 222
>gi|17062058|gb|AAL34984.1|AF320565_1 cathepsine L-like cysteine protease [Rhodnius prolixus]
Length = 316
Score = 70.5 bits (171), Expect = 1e-10, Method: Composition-based stats.
Identities = 32/77 (41%), Positives = 44/77 (57%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ G++EG + T LV +S Q LVDC + C GG + +QYV N+G
Sbjct: 122 GSCWSFSATGSLEGQLFLKTGRLVSLSEQNLVDCSKTYGNSGCEGGLMNQAFQYVRDNKG 181
Query: 65 INTERDYPNVGVMDNCK 81
I+TE YP +NC+
Sbjct: 182 IDTEASYPYEARENNCR 198
>gi|37788267|gb|AAO64473.1| cathepsin H precursor [Fundulus heteroclitus]
Length = 345
Score = 70.5 bits (171), Expect = 1e-10, Method: Composition-based stats.
Identities = 29/68 (42%), Positives = 42/68 (61%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ G +E ++ I T LV +S QQLVDC + C GG ++Y++ N+G
Sbjct: 151 GSCWTFSTTGCLESVTAIATVKLVPLSEQQLVDCAQDFNNHGCNGGLPSQAFEYIMYNKG 210
Query: 65 INTERDYP 72
+ TE+DYP
Sbjct: 211 LMTEQDYP 218
>gi|358339356|dbj|GAA47436.1| cathepsin L [Clonorchis sinensis]
Length = 236
Score = 70.5 bits (171), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 34/78 (43%), Positives = 47/78 (60%), Gaps = 2/78 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW FAV G IEG T LV +S QQL+DCD + E +C GGF E Y+ +++ G
Sbjct: 43 GSCWAFAVTGNIEGQWYKKTKKLVSLSEQQLLDCDKKDE--ACNGGFPEWAYESIVKMGG 100
Query: 65 INTERDYPNVGVMDNCKV 82
+ +E+DYP + C +
Sbjct: 101 LMSEKDYPYEAHKETCNL 118
>gi|391333248|ref|XP_003741031.1| PREDICTED: uncharacterized protein LOC100898636 [Metaseiulus
occidentalis]
Length = 642
Score = 70.5 bits (171), Expect = 1e-10, Method: Composition-based stats.
Identities = 35/77 (45%), Positives = 43/77 (55%), Gaps = 2/77 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ GA+EG T L +S Q LVDC ES+ C GGF E +QY+ N G
Sbjct: 450 GSCWAFSATGAVEGQHFKATGRLESLSEQNLVDCVK--ESKGCDGGFFEQAFQYIKDNGG 507
Query: 65 INTERDYPNVGVMDNCK 81
INTE YP +C+
Sbjct: 508 INTEDSYPYEAFDGSCR 524
Score = 62.8 bits (151), Expect = 2e-08, Method: Composition-based stats.
Identities = 30/80 (37%), Positives = 39/80 (48%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G+CW FA GAIEG T NLV +S Q ++DC S C GG + Y+ + G
Sbjct: 128 GACWTFAATGAIEGQHFKATGNLVSLSEQNILDCVKTATSNGCSGGLFVEAFDYLKNSGG 187
Query: 65 INTERDYPNVGVMDNCKVFQ 84
I+ E YP C+ Q
Sbjct: 188 IDAEESYPYEASGGTCRFRQ 207
>gi|81542|pir||S02728 actinidain (EC 3.4.22.14) precursor (clone pAC.1) - kiwi fruit
(fragment)
gi|15957|emb|CAA31435.1| actinidin precursor [Actinidia chinensis]
gi|166319|gb|AAA32630.1| actinidin precursor [Actinidia deliciosa]
gi|226542|prf||1601514A actinidin
Length = 302
Score = 70.5 bits (171), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 31/78 (39%), Positives = 46/78 (58%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW F+ + +EGI+KIVT L+ +S Q+L+ C +R C GG+I +Q++I N G
Sbjct: 80 GGCWAFSAIATVEGINKIVTGVLISLSEQELIGCGGTQNTRGCNGGYITDGFQFIINNGG 139
Query: 65 INTERDYPNVGVMDNCKV 82
INT +YP C +
Sbjct: 140 INTGENYPYTAQDGECNL 157
>gi|118120|sp|P25249.1|CYSP1_HORVU RecName: Full=Cysteine proteinase EP-B 1; Flags: Precursor
gi|1146116|gb|AAA85035.1| cysteine proteinase EPB1 precursor [Hordeum vulgare subsp. vulgare]
Length = 371
Score = 70.5 bits (171), Expect = 1e-10, Method: Composition-based stats.
Identities = 31/78 (39%), Positives = 47/78 (60%), Gaps = 1/78 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V ++EGI+ I T +LV +S Q+L+DCD ++ C GG ++ ++Y+ N G
Sbjct: 156 GSCWAFSTVVSVEGINAIRTGSLVSLSEQELIDCDT-ADNDGCQGGLMDNAFEYIKNNGG 214
Query: 65 INTERDYPNVGVMDNCKV 82
+ TE YP C V
Sbjct: 215 LITEAAYPYRAARGTCNV 232
>gi|417409774|gb|JAA51378.1| Putative cathepsin k, partial [Desmodus rotundus]
Length = 331
Score = 70.5 bits (171), Expect = 1e-10, Method: Composition-based stats.
Identities = 34/76 (44%), Positives = 49/76 (64%), Gaps = 2/76 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ VGA+EG K T L+++S Q LVDC E+ C GG++ + YV +N+G
Sbjct: 139 GSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDC--VSENDGCGGGYMTNAFHYVQKNQG 196
Query: 65 INTERDYPNVGVMDNC 80
I++E YP VG ++C
Sbjct: 197 IDSEDAYPYVGQDESC 212
>gi|118136313|gb|ABK62794.1| cathepsin L-like cysteine protease [Neobenedenia melleni]
gi|118136315|gb|ABK62795.1| cathepsin L-like cysteine protease [Neobenedenia melleni]
Length = 335
Score = 70.5 bits (171), Expect = 1e-10, Method: Composition-based stats.
Identities = 30/77 (38%), Positives = 42/77 (54%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ G+IEG K T L+ S QQLVDC + C GG ++ + Y+I N+G
Sbjct: 140 GSCWAFSSTGSIEGAVKRATGKLISFSEQQLVDCSTAFGNHGCNGGIMDNSFNYLIHNKG 199
Query: 65 INTERDYPNVGVMDNCK 81
+ +E YP C+
Sbjct: 200 LESEASYPYEAQKKECR 216
>gi|162815|gb|AAA30435.1| cathepsin S, partial [Bos taurus]
gi|312895|emb|CAA43971.1| cathepsin S [Bos taurus]
Length = 196
Score = 70.5 bits (171), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 33/78 (42%), Positives = 47/78 (60%), Gaps = 1/78 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDN-QGESRSCVGGFIETIYQYVIQNR 63
GSCW F+ VGA+E K+ T LV +S Q LVDC + ++ C GGF+ +QY+I N
Sbjct: 2 GSCWAFSAVGALEAQVKLKTGKLVSLSAQNLVDCSTAKYGNKGCNGGFMTEAFQYIIDNN 61
Query: 64 GINTERDYPNVGVMDNCK 81
GI++E YP + C+
Sbjct: 62 GIDSEASYPYKAMDGKCQ 79
>gi|400180357|gb|AFP73317.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 70.5 bits (171), Expect = 1e-10, Method: Composition-based stats.
Identities = 31/77 (40%), Positives = 47/77 (61%), Gaps = 2/77 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW F+ VG++EG KI T NL++ S Q+L+DC + C GGF+ + ++I+N G
Sbjct: 152 GCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCTTN--NYGCNGGFMTNAFDFIIENGG 209
Query: 65 INTERDYPNVGVMDNCK 81
I+ E DY +G C+
Sbjct: 210 ISRESDYEYLGQQYTCR 226
>gi|74178074|dbj|BAE29827.1| unnamed protein product [Mus musculus]
gi|74178231|dbj|BAE29900.1| unnamed protein product [Mus musculus]
gi|74220784|dbj|BAE31361.1| unnamed protein product [Mus musculus]
Length = 326
Score = 70.5 bits (171), Expect = 1e-10, Method: Composition-based stats.
Identities = 31/78 (39%), Positives = 47/78 (60%), Gaps = 2/78 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGE--SRSCVGGFIETIYQYVIQN 62
G+CW F+ VGA+EG K+ T L+ +S Q LVDC N+ + ++ C GG++ +QY+I N
Sbjct: 131 GACWAFSAVGALEGQLKLKTGKLISLSAQNLVDCSNEEKYGNKGCGGGYMTEAFQYIIDN 190
Query: 63 RGINTERDYPNVGVMDNC 80
GI + YP + C
Sbjct: 191 GGIEADASYPYKATDEKC 208
>gi|255211|gb|AAB23202.1| cathepsin S [cattle, spleen, Peptide Partial, 217 aa]
gi|227966|prf||1714236A cathepsin S
Length = 217
Score = 70.1 bits (170), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 33/78 (42%), Positives = 47/78 (60%), Gaps = 1/78 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDN-QGESRSCVGGFIETIYQYVIQNR 63
GSCW F+ VGA+E K+ T LV +S Q LVDC + ++ C GGF+ +QY+I N
Sbjct: 23 GSCWAFSAVGALEAQVKLKTGKLVSLSAQNLVDCSTAKYGNKGCNGGFMTEAFQYIIDNN 82
Query: 64 GINTERDYPNVGVMDNCK 81
GI++E YP + C+
Sbjct: 83 GIDSEASYPYKAMDGKCQ 100
>gi|357126406|ref|XP_003564878.1| PREDICTED: cysteine proteinase EP-B 1-like [Brachypodium
distachyon]
Length = 377
Score = 70.1 bits (170), Expect = 1e-10, Method: Composition-based stats.
Identities = 28/69 (40%), Positives = 46/69 (66%), Gaps = 1/69 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNR- 63
GSCW F+ V ++EG++ I T +LV +S Q+L+DCD G+ C GG +E+ ++++ +
Sbjct: 158 GSCWAFSAVASVEGLNAIRTGSLVSLSEQELIDCDTGGDDNGCQGGLMESAFEFIAHSAG 217
Query: 64 GINTERDYP 72
G+ TE YP
Sbjct: 218 GLATEAAYP 226
>gi|55740402|gb|AAV63977.1| cathepsin L precursor [Artemia franciscana]
Length = 338
Score = 70.1 bits (170), Expect = 1e-10, Method: Composition-based stats.
Identities = 30/77 (38%), Positives = 45/77 (58%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ GA+EG + T L+ +S Q L+DC + + C GG ++ +QY+ N+G
Sbjct: 144 GSCWAFSSTGALEGQTFRKTGKLISLSEQNLIDCSGKYGNEGCNGGLMDQAFQYIKDNKG 203
Query: 65 INTERDYPNVGVMDNCK 81
I+TE YP D C+
Sbjct: 204 IDTENTYPYEAEDDVCR 220
>gi|115441717|ref|NP_001045138.1| Os01g0907600 [Oryza sativa Japonica Group]
gi|5761329|dbj|BAA83473.1| cysteine endopeptidase [Oryza sativa]
gi|20804884|dbj|BAB92565.1| cysteine endopeptidase [Oryza sativa Japonica Group]
gi|56785107|dbj|BAD82745.1| putative cysteine proteinase [Oryza sativa Japonica Group]
gi|113534669|dbj|BAF07052.1| Os01g0907600 [Oryza sativa Japonica Group]
gi|119395242|gb|ABL74582.1| cysteine endopeptidase [Oryza sativa Japonica Group]
gi|125528777|gb|EAY76891.1| hypothetical protein OsI_04850 [Oryza sativa Indica Group]
gi|125573036|gb|EAZ14551.1| hypothetical protein OsJ_04473 [Oryza sativa Japonica Group]
Length = 371
Score = 70.1 bits (170), Expect = 1e-10, Method: Composition-based stats.
Identities = 31/76 (40%), Positives = 45/76 (59%), Gaps = 1/76 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V ++EGI+ I T LV +S Q+L+DCD ++ C GG +E ++Y+ + G
Sbjct: 157 GSCWAFSTVVSVEGINAIRTGRLVSLSEQELIDCDT-ADNSGCQGGLMENAFEYIKHSGG 215
Query: 65 INTERDYPNVGVMDNC 80
I TE YP C
Sbjct: 216 ITTESAYPYRAANGTC 231
>gi|351726954|ref|NP_001236888.1| cysteine proteinase precursor [Glycine max]
gi|479060|emb|CAA83673.1| cysteine proteinase [Glycine max]
gi|300507422|gb|ADK24076.1| cysteine proteinase [Glycine max]
gi|300507425|gb|ADK24077.1| cysteine proteinase [Glycine max]
gi|1096153|prf||2111244A Cys protease
Length = 380
Score = 70.1 bits (170), Expect = 1e-10, Method: Composition-based stats.
Identities = 32/84 (38%), Positives = 46/84 (54%), Gaps = 7/84 (8%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGE-------SRSCVGGFIETIYQ 57
GSCW F+ G+IEG + + T LV +S QQL+DCDN+ + C GG + Y
Sbjct: 162 GSCWAFSTTGSIEGANFLATGKLVSLSEQQLLDCDNKCDITEKTSCDNGCNGGLMTNAYN 221
Query: 58 YVIQNRGINTERDYPNVGVMDNCK 81
Y++++ G+ E YP G CK
Sbjct: 222 YLLESGGLEEESSYPYTGERGECK 245
>gi|168059933|ref|XP_001781954.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162666600|gb|EDQ53250.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 369
Score = 70.1 bits (170), Expect = 1e-10, Method: Composition-based stats.
Identities = 36/89 (40%), Positives = 47/89 (52%), Gaps = 10/89 (11%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRS-------CVGGFIETIYQ 57
GSCW F+ GA+EG + T L+ +S QQLVDCD+Q + C GG + Y+
Sbjct: 161 GSCWAFSTTGAVEGAHFLATGKLLSLSEQQLVDCDHQCDPEEAQACDAGCGGGLMTNAYK 220
Query: 58 YVIQNRGINTERDYPNVGVMDNCKVFQFN 86
YV + G+ E DYP G C QFN
Sbjct: 221 YVEEAGGLELESDYPYKGRDGKC---QFN 246
>gi|225448924|ref|XP_002266821.1| PREDICTED: cysteine proteinase 15A-like [Vitis vinifera]
Length = 375
Score = 70.1 bits (170), Expect = 1e-10, Method: Composition-based stats.
Identities = 34/84 (40%), Positives = 45/84 (53%), Gaps = 7/84 (8%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRS-------CVGGFIETIYQ 57
GSCW F+ GA+EG I T L+ +S QQLVDCD+ + R C GG + Y+
Sbjct: 168 GSCWAFSTTGAVEGAHFISTKKLLTLSEQQLVDCDHMCDIRDKTACDSGCEGGLMTNAYK 227
Query: 58 YVIQNRGINTERDYPNVGVMDNCK 81
Y+I+ G+ E YP G CK
Sbjct: 228 YLIEAGGLEEESSYPYTGKHGECK 251
>gi|405977173|gb|EKC41636.1| Cathepsin K [Crassostrea gigas]
Length = 942
Score = 70.1 bits (170), Expect = 1e-10, Method: Composition-based stats.
Identities = 35/76 (46%), Positives = 43/76 (56%), Gaps = 2/76 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW FA G +EG T LV +S Q LVDC E+ C GG T Y+Y+ +N G
Sbjct: 750 GSCWAFATTGGLEGQHFRKTKKLVSLSEQNLVDCCK--ENLGCTGGLPVTAYKYIARNGG 807
Query: 65 INTERDYPNVGVMDNC 80
I+TE YP +G NC
Sbjct: 808 IDTEESYPYLGKNGNC 823
>gi|225431287|ref|XP_002275759.1| PREDICTED: cysteine proteinase RD19a isoform 1 [Vitis vinifera]
gi|297735094|emb|CBI17456.3| unnamed protein product [Vitis vinifera]
Length = 367
Score = 70.1 bits (170), Expect = 1e-10, Method: Composition-based stats.
Identities = 29/78 (37%), Positives = 48/78 (61%), Gaps = 7/78 (8%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGE-------SRSCVGGFIETIYQ 57
GSCW F+ +GA+EG + T NL+ +S QQLVDCD++ + + C GG + + ++
Sbjct: 157 GSCWSFSAIGALEGAHFLTTGNLISMSEQQLVDCDHECDPEEYGACDQGCNGGLMTSAFE 216
Query: 58 YVIQNRGINTERDYPNVG 75
Y+++ G+ E YP +G
Sbjct: 217 YILKAGGVEREETYPYIG 234
>gi|357127811|ref|XP_003565571.1| PREDICTED: fruit bromelain-like [Brachypodium distachyon]
Length = 364
Score = 70.1 bits (170), Expect = 1e-10, Method: Composition-based stats.
Identities = 37/77 (48%), Positives = 43/77 (55%), Gaps = 2/77 (2%)
Query: 6 SCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRGI 65
SCW FA V AIEG++KI T LV +S QQLVDCD S C GG +T V + GI
Sbjct: 168 SCWAFAAVAAIEGMNKIRTGTLVSLSEQQLVDCDKG--SSGCAGGRTDTALDLVAKRGGI 225
Query: 66 NTERDYPNVGVMDNCKV 82
+E YP G C V
Sbjct: 226 TSEEKYPYGGFNGKCNV 242
>gi|163658591|gb|ABY28387.1| cathepsin L [Gnathostoma spinigerum]
Length = 398
Score = 70.1 bits (170), Expect = 1e-10, Method: Composition-based stats.
Identities = 31/72 (43%), Positives = 43/72 (59%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ GA+EG T+ LV +S Q LVDC + + C GG ++ ++Y+ N G
Sbjct: 202 GSCWAFSATGALEGQHMRKTHQLVSLSEQNLVDCSRKYGNNGCNGGLMDNAFEYIKDNHG 261
Query: 65 INTERDYPNVGV 76
I+TE YP GV
Sbjct: 262 IDTEESYPYKGV 273
>gi|310656787|gb|ADP02216.1| Peptidase_C1 domain-containing protein [Triticum aestivum]
Length = 195
Score = 70.1 bits (170), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 32/78 (41%), Positives = 46/78 (58%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW F+ V A E I K+ T LV +S Q+LVDCD G + C GG ++ ++++I+N G
Sbjct: 74 GCCWAFSAVAATECIVKLSTGKLVSLSEQELVDCDIHGVDQGCEGGEMDDAFKFIIKNGG 133
Query: 65 INTERDYPNVGVMDNCKV 82
+ TE +YP CK
Sbjct: 134 LTTEANYPYTAQDGQCKT 151
>gi|294878199|ref|XP_002768307.1| cryptopain precursor, putative [Perkinsus marinus ATCC 50983]
gi|239870555|gb|EER01025.1| cryptopain precursor, putative [Perkinsus marinus ATCC 50983]
Length = 337
Score = 70.1 bits (170), Expect = 1e-10, Method: Composition-based stats.
Identities = 32/68 (47%), Positives = 44/68 (64%), Gaps = 1/68 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ VGA+EG+ K VT LVD+S QQL+DC + + C GG ++ Y+YV ++ G
Sbjct: 139 GSCWAFSTVGALEGLYKEVTGKLVDLSEQQLMDCSKEYGNEGCGGGNMDRAYEYV-EDHG 197
Query: 65 INTERDYP 72
I YP
Sbjct: 198 IKLNATYP 205
>gi|1709576|sp|P05994.3|PAPA4_CARPA RecName: Full=Papaya proteinase 4; AltName: Full=Glycyl
endopeptidase; AltName: Full=Papaya peptidase B;
AltName: Full=Papaya proteinase IV; Short=PPIV; Flags:
Precursor
gi|953176|emb|CAA54974.1| proteinase IV [Carica papaya]
Length = 348
Score = 70.1 bits (170), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 37/79 (46%), Positives = 47/79 (59%), Gaps = 3/79 (3%)
Query: 6 SCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRGI 65
SCW F+ V +EGI+KI T NLV++S Q+LVDCD Q S C G+ T QYV QN GI
Sbjct: 156 SCWAFSTVATVEGINKIKTGNLVELSEQELVDCDKQ--SYGCNRGYQSTSLQYVAQN-GI 212
Query: 66 NTERDYPNVGVMDNCKVFQ 84
+ YP + C+ Q
Sbjct: 213 HLRAKYPYIAKQQTCRANQ 231
>gi|395514298|ref|XP_003761356.1| PREDICTED: cathepsin L1-like [Sarcophilus harrisii]
Length = 365
Score = 70.1 bits (170), Expect = 2e-10, Method: Composition-based stats.
Identities = 31/77 (40%), Positives = 44/77 (57%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ G++EG T LV +S Q LVDC + C GG ++ ++YV +N G
Sbjct: 170 GSCWAFSATGSLEGQWFRKTGKLVSLSEQNLVDCSTAQGNSGCQGGLMDNAFEYVKENGG 229
Query: 65 INTERDYPNVGVMDNCK 81
I+TE YP + D C+
Sbjct: 230 IDTEESYPYIAADDTCQ 246
>gi|195381187|ref|XP_002049336.1| GJ20806 [Drosophila virilis]
gi|194144133|gb|EDW60529.1| GJ20806 [Drosophila virilis]
Length = 339
Score = 70.1 bits (170), Expect = 2e-10, Method: Composition-based stats.
Identities = 30/76 (39%), Positives = 45/76 (59%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ GA+EG L+ +S Q LVDC + + C GG ++ ++Y+ N G
Sbjct: 144 GSCWAFSSTGALEGQHFRKAGTLISLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDNGG 203
Query: 65 INTERDYPNVGVMDNC 80
I+TE+ YP G+ D+C
Sbjct: 204 IDTEKSYPYEGIDDSC 219
>gi|355681656|gb|AER96815.1| Cathepsin L precursor [Mustela putorius furo]
Length = 331
Score = 70.1 bits (170), Expect = 2e-10, Method: Composition-based stats.
Identities = 32/79 (40%), Positives = 44/79 (55%)
Query: 3 PLGSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQN 62
P GSCW F+ GA+EG T LV +S Q LVDC + C GG ++ +QYV N
Sbjct: 134 PCGSCWAFSATGALEGQMFRKTKRLVSLSEQNLVDCSQAEGNEGCSGGLMDYAFQYVKDN 193
Query: 63 RGINTERDYPNVGVMDNCK 81
G+++E YP ++CK
Sbjct: 194 GGLDSEESYPYRAQDESCK 212
>gi|355567966|gb|EHH24307.1| Cathepsin L2 [Macaca mulatta]
gi|355753494|gb|EHH57540.1| Cathepsin L2 [Macaca fascicularis]
gi|380790509|gb|AFE67130.1| cathepsin L2 preproprotein [Macaca mulatta]
Length = 334
Score = 70.1 bits (170), Expect = 2e-10, Method: Composition-based stats.
Identities = 32/77 (41%), Positives = 47/77 (61%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ GA+EG T LV +S Q LVDC + ++ C GGF+ + ++YV +N G
Sbjct: 136 GSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSHPQGNQGCNGGFMNSAFRYVKENGG 195
Query: 65 INTERDYPNVGVMDNCK 81
+++E YP V + CK
Sbjct: 196 LDSEESYPYVAMDGICK 212
>gi|224083868|ref|XP_002307151.1| predicted protein [Populus trichocarpa]
gi|222856600|gb|EEE94147.1| predicted protein [Populus trichocarpa]
Length = 298
Score = 70.1 bits (170), Expect = 2e-10, Method: Composition-based stats.
Identities = 27/68 (39%), Positives = 46/68 (67%)
Query: 13 VGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRGINTERDYP 72
V A+EGI+++ T L+ +S Q++VDCD +GE + C GG ++ ++++ QN+G+ TE +YP
Sbjct: 111 VAAMEGINQLTTGKLISLSEQEVVDCDTKGEDQGCNGGLMDDAFKFIEQNKGLTTEANYP 170
Query: 73 NVGVMDNC 80
G C
Sbjct: 171 YTGTDGTC 178
>gi|225707828|gb|ACO09760.1| Cathepsin S precursor [Osmerus mordax]
Length = 282
Score = 70.1 bits (170), Expect = 2e-10, Method: Composition-based stats.
Identities = 32/68 (47%), Positives = 43/68 (63%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ +GA+EG K +LV +S Q LVDC + + C GG++ Y YVI NRG
Sbjct: 139 GSCWAFSSIGALEGQMKRRNGSLVPLSPQNLVDCSTRFGNHGCKGGYLSKSYLYVISNRG 198
Query: 65 INTERDYP 72
I++E YP
Sbjct: 199 IDSESFYP 206
>gi|222629922|gb|EEE62054.1| hypothetical protein OsJ_16838 [Oryza sativa Japonica Group]
Length = 336
Score = 70.1 bits (170), Expect = 2e-10, Method: Composition-based stats.
Identities = 30/68 (44%), Positives = 42/68 (61%), Gaps = 1/68 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V A+EGI+ IVT NL +S Q+L+DC G + C GG ++ + Y+ G
Sbjct: 141 GSCWAFSTVAAVEGINAIVTGNLTSLSEQELIDCSTDGNN-GCNGGLMDYAFSYIASTGG 199
Query: 65 INTERDYP 72
+ TE YP
Sbjct: 200 LRTEEAYP 207
>gi|115468686|ref|NP_001057942.1| Os06g0582600 [Oryza sativa Japonica Group]
gi|55296512|dbj|BAD68726.1| putative cysteine proteinase [Oryza sativa Japonica Group]
gi|113595982|dbj|BAF19856.1| Os06g0582600 [Oryza sativa Japonica Group]
gi|215695236|dbj|BAG90427.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 357
Score = 70.1 bits (170), Expect = 2e-10, Method: Composition-based stats.
Identities = 30/68 (44%), Positives = 43/68 (63%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
SCW F+ V A+EGI +I ++NLV +STQQL+DC + C G ++ ++Y+ N G
Sbjct: 157 ASCWAFSAVAAVEGIHQIRSHNLVALSTQQLLDCSTGRNNHGCNRGDMDEAFRYITSNGG 216
Query: 65 INTERDYP 72
I E DYP
Sbjct: 217 IAAESDYP 224
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.323 0.140 0.458
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 1,409,333,940
Number of Sequences: 23463169
Number of extensions: 48738852
Number of successful extensions: 92530
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 3846
Number of HSP's successfully gapped in prelim test: 810
Number of HSP's that attempted gapping in prelim test: 86587
Number of HSP's gapped (non-prelim): 4694
length of query: 88
length of database: 8,064,228,071
effective HSP length: 58
effective length of query: 30
effective length of database: 6,703,364,269
effective search space: 201100928070
effective search space used: 201100928070
T: 11
A: 40
X1: 16 ( 7.5 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 69 (31.2 bits)